Pipeline catalog
Register pipelines, datasets, and connections in a central metadata store. Query specs, owners, schedules, and environment tags from a versioned catalog API.
DataXPipe turns declarative pipeline specs into runnable artifacts and keeps a live catalog of pipelines, datasets, lineage edges, run history, and quality check results — all through one API.
No credit card required · Generate Airflow DAGs, SQL, and checks from YAML specs
GET /api/v1/pipelines/orders_sync
GET /api/v1/lineage/orders_raw
POST /api/v1/checks → { "status": "pass", "check_id": "row_count" }
Catalog: 12 pipelines · 34 datasets · 89 lineage edges
Last run: success · 3 quality checks passed1 API
Catalog, runs, checks & lineage
Spec-first
Validate before you generate
SaaS-ready
Multi-tenant orgs & billing
From spec validation to production runs, DataXPipe connects generation, cataloging, lineage, and quality in one workflow.
Register pipelines, datasets, and connections in a central metadata store. Query specs, owners, schedules, and environment tags from a versioned catalog API.
Capture source-to-target edges as pipelines are registered. Trace upstream and downstream dependencies for any dataset to understand blast radius before you change a transform.
Attach SQL and runnable checks to pipeline runs. Store pass/fail results with row counts and sample rows so stakeholders can verify data health after every execution.
Validate YAML or JSON specs against a JSON Schema, then generate Airflow DAGs, SQL transforms, test scripts, and metadata bundles — ready to deploy.
Every pipeline run records status, timing, row counts, and linked check results. Prometheus metrics and structured logging integrate with your existing monitoring stack.
Organizations get isolated API keys, plan-based limits, and role-aware permissions. Platform and admin roles control production deployments and sensitive operations.
A repeatable workflow from declarative specs to observable production pipelines.
Author a YAML or JSON spec with sources, targets, lineage edges, and quality checks. DataXPipe validates it against a JSON Schema before anything runs.
Get Airflow DAGs, SQL transforms, runnable check scripts, and metadata bundles. Generated DAGs notify the catalog on every run.
Register pipelines in the catalog API, query lineage for any dataset, and review check results tied to each run — all from one place.
Start with two pipelines on the Free plan. Upgrade when your team needs more connections, retention, and support.