IngestionPipeline
Ingestion Pipeline Config is used to set up a DAG and deploy. This entity is used to setup metadata/quality pipelines on Apache Airflow.
Properties
- id: Unique identifier that identifies this pipeline. Refer to ../../../type/basic.json#/definitions/uuid.
- name: Name that identifies this pipeline instance uniquely. Refer to ../../../type/basic.json#/definitions/entityName.
- displayName(string): Display Name that identifies this Pipeline.
- description: Description of the Pipeline. Refer to ../../../type/basic.json#/definitions/markdown.
- pipelineType: Refer to #/definitions/pipelineType.
- owners: Owners of this Pipeline. Refer to ../../../type/entityReferenceList.json. Default:- None.
- fullyQualifiedName: Name that uniquely identifies a Pipeline. Refer to ../../../type/basic.json#/definitions/fullyQualifiedEntityName.
- sourceConfig: Refer to ../../../metadataIngestion/workflow.json#/definitions/sourceConfig.
- openMetadataServerConnection: Refer to ../connections/metadata/openMetadataConnection.json.
- airflowConfig: Refer to #/definitions/airflowConfig.
- service: Link to the service (such as database, messaging, storage services, etc. for which this ingestion pipeline ingests the metadata from. Refer to ../../../type/entityReference.json.
- pipelineStatuses: Last of executions and status for the Pipeline. Refer to #/definitions/pipelineStatus.
- loggerLevel: Set the logging level for the workflow. Refer to ../../../metadataIngestion/workflow.json#/definitions/logLevels.
- raiseOnError(boolean): Control if we want to flag the workflow as failed if we encounter any processing errors. Default:- True.
- deployed(boolean): Indicates if the workflow has been successfully deployed to Airflow. Default:- False.
- enabled(boolean): True if the pipeline is ready to be run in the next schedule. False if it is paused. Default:- True.
- href: Link to this ingestion pipeline resource. Refer to ../../../type/basic.json#/definitions/href.
- version: Metadata version of the entity. Refer to ../../../type/entityHistory.json#/definitions/entityVersion.
- updatedAt: Last update time corresponding to the new version of the entity in Unix epoch time milliseconds. Refer to ../../../type/basic.json#/definitions/timestamp.
- updatedBy(string): User who made the update.
- changeDescription: Change that led to this version of the entity. Refer to ../../../type/entityHistory.json#/definitions/changeDescription.
- incrementalChangeDescription: Change that lead to this version of the entity. Refer to ../../../type/entityHistory.json#/definitions/changeDescription.
- followers: Followers of this entity. Refer to ../../../type/entityReferenceList.json.
- deleted(boolean): When- trueindicates the entity has been soft deleted. Default:- False.
- provider: Refer to ../../../type/basic.json#/definitions/providerType.
- domains: Domains the asset belongs to. When not set, the asset inherits the domain from the parent it belongs to. Refer to ../../../type/entityReferenceList.json.
- applicationType(string): Type of the application when pipelineType is 'application'.
- ingestionRunner: The ingestion agent responsible for executing the ingestion pipeline. Refer to ../../../type/entityReference.json.
- processingEngine: The processing engine responsible for executing the ingestion pipeline logic. Refer to ../../../type/entityReference.json.
Definitions
- pipelineType(string): Type of Pipeline - metadata, usage. Must be one of:- ['metadata', 'usage', 'lineage', 'profiler', 'autoClassification', 'TestSuite', 'dataInsight', 'elasticSearchReindex', 'dbt', 'application'].
- pipelineStatus(object): This defines runtime status of Pipeline. Cannot contain additional properties.- runId(string): Pipeline unique run ID.
- pipelineState(string): Pipeline status denotes if its failed or succeeded. Must be one of:- ['queued', 'success', 'failed', 'running', 'partialSuccess'].
- startDate: startDate of the pipeline run for this particular execution. Refer to ../../../type/basic.json#/definitions/timestamp.
- timestamp: executionDate of the pipeline run for this particular execution. Refer to ../../../type/basic.json#/definitions/timestamp.
- endDate: endDate of the pipeline run for this particular execution. Refer to ../../../type/basic.json#/definitions/timestamp.
- status: Ingestion Pipeline summary status. Informed at the end of the execution. Refer to status.json#/definitions/ingestionStatus.
- config: Pipeline configuration for this particular execution. Refer to ../../../type/basic.json#/definitions/map.
- metadata: Metadata for the pipeline status. Refer to ../../../type/basic.json#/definitions/map.
 
- airflowConfig(object): Properties to configure the Airflow pipeline that will run the workflow. Cannot contain additional properties.- pausePipeline(boolean): pause the pipeline from running once the deploy is finished successfully. Default:- False.
- concurrency(integer): Concurrency of the Pipeline. Default:- 1.
- startDate: Start date of the pipeline. Refer to ../../../type/basic.json#/definitions/dateTime.
- endDate: End Date of the pipeline. Refer to ../../../type/basic.json#/definitions/dateTime.
- pipelineTimezone(string): Timezone in which pipeline going to be scheduled. Default:- UTC.
- retries(integer): Retry pipeline in case of failure. Default:- 0.
- retryDelay(integer): Delay between retries in seconds. Default:- 300.
- pipelineCatchup(boolean): Run past executions if the start date is in the past. Default:- False.
- scheduleInterval(string): Scheduler Interval for the pipeline in cron format.
- maxActiveRuns(integer): Maximum Number of active runs. Default:- 1.
- workflowTimeout(integer): Timeout for the workflow in seconds. Default:- None.
- workflowDefaultView(string): Default view in Airflow. Default:- tree.
- workflowDefaultViewOrientation(string): Default view Orientation in Airflow. Default:- LR.
- email: Email to notify workflow status. Refer to ../../../type/basic.json#/definitions/email.
 
Documentation file automatically generated at 2025-08-12 05:39:47.683420+00:00.