Catch schema drift, quality violations, and SLA breaches before they reach production. The open-source data guardian for modern pipelines.
See how DCVPG solves real-world pipeline nightmares automatically with actionable visualizations.
Stop silent failures. When upstream sends nulls or schema changes, DCVPG isolates the bad batch in PostgreSQL and prevents downstream BI dashboards from displaying corrupted metrics.
When a CRITICAL violation occurs, our LLM agent analyzes the schema drift, proposes a fix to the YAML contract, and opens a GitHub PR automatically for your review.
Track row-count anomalies and freshness SLA breaches. If an overnight load produces 10x fewer rows than statistically normal, get alerted instantly via Slack or PagerDuty.
Integrates directly into Claude Desktop or Cursor. Ask questions like "What pipelines are failing?" or "Replay batch abc-123" and manage your platform using natural language.
DCVPG wraps around your existing pipelines. No rewrites needed. You simply add it as a quality gate.
# Add a single operator to your existing DAG
from dcvpg.orchestrators.airflow import DataContractValidatorOperator
validate = DataContractValidatorOperator(
task_id="validate_orders",
contract_name="orders_raw",
config_path="/opt/airflow/dcvpg.config.yaml",
)
# Gate your existing load task
extract >> transform >> validate >> load_to_prod
Explore the complete Data Contract Validator & Pipeline Guardian ecosystem.
Full YAML field reference and capabilities for defining nullability, types, and SLAs.
Write generic Python validation extensions to augment the built-in rule sets.
Configure internal connections for PostgreSQL, Snowflake, S3, GCS, and more.
Integrate directly with Claude Desktop or Cursor for natural language pipeline management.
Expose and consume backend REST services for orchestrators, scripts, and webhook alerts.
Fast track installation, setup, and deploying your very first Data Contract Validator.