Metadata Ingestion - Incremental Extraction
The default Metadata Ingestion roughly follows these steps:- Fetch all the information from the Source.
- Compare the information with the OpenMetadata information to update it properly.
- Compare the information with the OpenMetadata information to delete entities that were deleted.
- Fetch the last successful pipeline run.
- Add a small safety margin.
- Get all the structural changes since then.
- Flag deleted entities.
- Fetch/Compare only the entities with structural changes.
- Delete entities flagged for deletion.
External Ingestion
When using the Incremental Extraction feature with External Ingestions (ingesting using YAML files instead of setting it up from the UI), you must pass the ingestion pipeline fully qualified name to the configuration. This should be{service_name}{pipeline_name}
Example:
Feature available for
Databases

BigQuery
BETA | OpenMetadata

Redshift
BETA | OpenMetadata

Snowflake
BETA | OpenMetadata