IngestionPipeline
Ingestion Pipeline Config is used to set up a DAG and deploy. This entity is used to setup metadata/quality pipelines on Apache Airflow.
Properties
id: Unique identifier that identifies this pipeline. Refer to ../../../type/basic.json#/definitions/uuid.name: Name that identifies this pipeline instance uniquely. Refer to ../../../type/basic.json#/definitions/entityName.displayName(string): Display Name that identifies this Pipeline.description: Description of the Pipeline. Refer to ../../../type/basic.json#/definitions/markdown.pipelineType: Refer to #/definitions/pipelineType.owners: Owners of this Pipeline. Refer to ../../../type/entityReferenceList.json. Default:null.fullyQualifiedName: Name that uniquely identifies a Pipeline. Refer to ../../../type/basic.json#/definitions/fullyQualifiedEntityName.sourceConfig: Refer to ../../../metadataIngestion/workflow.json#/definitions/sourceConfig.openMetadataServerConnection: Refer to ../connections/metadata/openMetadataConnection.json.airflowConfig: Refer to #/definitions/airflowConfig.service: Link to the service (such as database, messaging, storage services, etc. for which this ingestion pipeline ingests the metadata from. Refer to ../../../type/entityReference.json.pipelineStatuses: Last of executions and status for the Pipeline. Refer to #/definitions/pipelineStatus.loggerLevel: Set the logging level for the workflow. Refer to ../../../metadataIngestion/workflow.json#/definitions/logLevels.deployed(boolean): Indicates if the workflow has been successfully deployed to Airflow. Default:false.enabled(boolean): True if the pipeline is ready to be run in the next schedule. False if it is paused. Default:true.href: Link to this ingestion pipeline resource. Refer to ../../../type/basic.json#/definitions/href.version: Metadata version of the entity. Refer to ../../../type/entityHistory.json#/definitions/entityVersion.updatedAt: Last update time corresponding to the new version of the entity in Unix epoch time milliseconds. Refer to ../../../type/basic.json#/definitions/timestamp.updatedBy(string): User who made the update.changeDescription: Change that led to this version of the entity. Refer to ../../../type/entityHistory.json#/definitions/changeDescription.deleted(boolean): Whentrueindicates the entity has been soft deleted. Default:false.provider: Refer to ../../../type/basic.json#/definitions/providerType.domain: Domain the asset belongs to. When not set, the asset inherits the domain from the parent it belongs to. Refer to ../../../type/entityReference.json.applicationType(string): Type of the application when pipelineType is 'application'.
Definitions
pipelineType(string): Type of Pipeline - metadata, usage. Must be one of:["metadata", "usage", "lineage", "profiler", "autoClassification", "TestSuite", "dataInsight", "elasticSearchReindex", "dbt", "application"].pipelineStatus(object): This defines runtime status of Pipeline. Cannot contain additional properties.runId(string): Pipeline unique run ID.pipelineState(string): Pipeline status denotes if its failed or succeeded. Must be one of:["queued", "success", "failed", "running", "partialSuccess"].startDate: startDate of the pipeline run for this particular execution. Refer to ../../../type/basic.json#/definitions/timestamp.timestamp: executionDate of the pipeline run for this particular execution. Refer to ../../../type/basic.json#/definitions/timestamp.endDate: endDate of the pipeline run for this particular execution. Refer to ../../../type/basic.json#/definitions/timestamp.status: Ingestion Pipeline summary status. Informed at the end of the execution. Refer to status.json#/definitions/ingestionStatus.
airflowConfig(object): Properties to configure the Airflow pipeline that will run the workflow. Cannot contain additional properties.pausePipeline(boolean): pause the pipeline from running once the deploy is finished successfully. Default:false.concurrency(integer): Concurrency of the Pipeline. Default:1.startDate: Start date of the pipeline. Refer to ../../../type/basic.json#/definitions/dateTime.endDate: End Date of the pipeline. Refer to ../../../type/basic.json#/definitions/dateTime.pipelineTimezone(string): Timezone in which pipeline going to be scheduled. Default:"UTC".retries(integer): Retry pipeline in case of failure. Default:0.retryDelay(integer): Delay between retries in seconds. Default:300.pipelineCatchup(boolean): Run past executions if the start date is in the past. Default:false.scheduleInterval(string): Scheduler Interval for the pipeline in cron format.maxActiveRuns(integer): Maximum Number of active runs. Default:1.workflowTimeout(integer): Timeout for the workflow in seconds. Default:null.workflowDefaultView(string): Default view in Airflow. Default:"tree".workflowDefaultViewOrientation(string): Default view Orientation in Airflow. Default:"LR".email: Email to notify workflow status. Refer to ../../../type/basic.json#/definitions/email.
Documentation file automatically generated at 2025-01-15 09:05:41.923720+00:00.