Skip to main content

Lineage

The Lineage API lets you manage data lineage relationships between any two entities in Collate. Lineage captures how data flows from sources to destinations, including column-level mappings and pipeline references.

Available Endpoints

Supported Entity Types

Lineage can be established between any entity types in Collate. Common combinations include:
FromToExample
TableTableETL transformations between source and target tables
TableDashboardDashboards consuming data from tables
PipelineTablePipelines producing data into tables
TopicTableStreaming data from Kafka topics into tables
TableML ModelTables used as training data for ML models

Key Concepts

  • Edge: A directional lineage relationship from one entity to another
  • Column-level lineage: Mapping specific columns from a source entity to columns in a target entity
  • Pipeline reference: Associating a pipeline entity that powers the transformation
  • Upstream/Downstream depth: Controls how many hops to traverse when querying the lineage graph