Skip to main content

Runtime validation matrix

ContractForge separates connector compatibility from runtime validation.

Compatibility means the contract shape and implementation path are supported. Validation means the path has been exercised against a real source, runtime or file layout. Production readiness still depends on the user's network, identity, cloud policy, provider limits and data volume.

Connector maturity levels

StatusMeaningProduction guidance
stableCore connector path with broad automated coverage and low platform-specific variability.Suitable for normal use after environment validation.
validatedTested against at least one real external service/runtime path.Review documented prerequisites and repeat validation in the target workspace.
experimentalLimited or adapter-specific behavior that should be validated in the target runtime.Use for pilots or controlled onboarding; add real-source validation before production.
plannedIntended connector or validation path not yet supported.Track as roadmap only.

validated is intentionally scoped. It does not certify every authentication mode, provider edition, network topology, workspace policy or data-volume profile for that connector. It means ContractForge has evidence for at least one representative real-runtime path.

Current maturity summary

Connector familyConnectorsMaturityEvidence summary
Spark catalog and filestable, sql, csv, json, parquet, orc, delta, textstableCovered by automated tests and real file-folder validation patterns.
Object/file formatssource.format=avro, xml, jsonl, ndjsonvalidatedUsed through object-storage style connectors when the runtime provides the required Spark readers.
HTTP fileshttp_file, http_csv, http_json, http_textvalidatedUsed for public HTTP file ingestion where direct Spark https:// reads are not reliable.
Object storages3, azure_blob, adls, object_storage, blobvalidatedValidated through governed external locations and selected direct credential paths where the runtime allows them.
Auto LoaderautoloadervalidatedValidated with available-now ingestion, checkpoints, child runs and stream metric aggregation.
JDBCjdbc, postgres, postgresql, mysql, sqlserver, oracle, redshiftvalidatedValidated with real PostgreSQL/Supabase and RDS/Aurora IAM scenarios; other JDBC engines share the same Spark JDBC contract and require driver/runtime validation.
Bounded RESTrest_apivalidatedValidated with nested public API payloads, raw response preservation and transform.shape.parse_json.
Externalized sourcesSaaS, ERP, CRM, drives, SFTP/FTP, Kafka/Event Hubs, Snowflake, BigQuery, MongoDB/searchexternalizedUse native platform/vendor connectors or custom resolver packages, then expose data to ContractForge through official primitives.

Runtime evidence

ScenarioRuntimeStatusWhat was validated
Public CSV over HTTPDatabricks serverlessvalidatedDriver-side HTTP download, explicit CSV schema/options, overwrite target and control-table metadata.
Nested REST JSONDatabricks serverless/classic patternvalidatedRaw API payload preservation and transform.shape.parse_json for nested structs/arrays.
Azure Blob via External LocationAzure Databricks serverlessvalidatedGoverned storage access without direct SAS/Hadoop credential injection in the contract.
Azure Blob via SAS/direct filesystem configAzure Databricks classic clustervalidatedDirect storage credential behavior where runtime filesystem configuration is allowed.
AWS S3 via External LocationAWS Databricks serverlessvalidatedUnity Catalog-governed S3 access and object storage ingestion patterns.
AWS S3 direct credentialsClassic clustervalidatedS3A credential setup with access key/session token from secrets.
Auto Loader available-nowDatabricks classic/serverless patternsvalidatedCheckpoints, child runs and aggregate stream metrics in ctrl_ingestion_streams.
JDBC PostgreSQL/SupabaseDatabricksvalidatedJDBC reads, pushdown, partitioning, watermark and SCD/hash-diff patterns.
RDS/Aurora IAM JDBCAWS Databricks serverless and classic diagnosticsvalidatedIAM token generation, SSL parameters, network diagnosis and Spark JDBC options.
Public USGS GeoJSON shapeAzure Databricks classic clustervalidatedREST/raw GeoJSON with schema_ref, array-of-struct explosion, coordinate extraction, SCD hash diff and gold marts. Latest evidence: execution run 516033488155314, validation run 1040635449894221; 1 bronze payload, 1,807 silver events, 31 daily gold rows and 4 magnitude-band rows.
Large known dataset TPCHAzure Databricks classic clustervalidatedsamples.tpch.lineitem source with 29,999,795 observed rows and a 250,000-row processed validation. Validated sql -> table -> sql, hash_diff_upsert, transform derivation/deduplication, gold aggregation and control-table evidence. Latest evidence: execution run 714754107506943, validation run 369877249106831.
Native connector handoffDatabricksvalidated patternExternal systems can be landed or federated by Databricks/native tooling, then ingested by ContractForge as table, sql, files or object storage.

Production adoption rule

Before marking a new connector path as production-ready in a project, record:

  1. Runtime type and Databricks workspace constraints.
  2. Connector version and external library version when applicable.
  3. Authentication mode and secret redaction behavior.
  4. Network path and provider-side allowlists/policies.
  5. Representative data volume, pagination and file-count limits.
  6. Control-table evidence: run status, row counts, errors, quality status and stream metrics when applicable.