connectors

No menu items for this category
OpenMetadata Documentation
Iceberg

Iceberg

BETA
Available In
Feature List
Metadata
Owners
Query Usage
Data Profiler
Data Quality
Lineage
Column-level Lineage
dbt
Tags
Stored Procedures
Sample Data
Auto-Classification

In this section, we provide guides and references to use the Iceberg connector.

Configure and schedule Iceberg metadata workflows from the OpenMetadata UI:

To run the Ingestion via the UI you'll need to use the OpenMetadata Ingestion Container, which comes shipped with custom Airflow plugins to handle the workflow deployment. If you want to install it manually in an already existing Airflow host, you can follow this guide.

If you don't want to use the OpenMetadata Ingestion container to configure the workflows via the UI, then you can check the following docs to run the Ingestion Framework in any orchestrator externally.

The requirements actually depend on the Catalog and the FileSystem used. In a nutshell, the used credentials must have access to reading the Catalog and the Metadata File.

Must have glue:GetDatabases, and glue:GetTables permissions to be able to read the Catalog.

Must also have the s3:GetObject permission for the location of the Iceberg tables.

Must have dynamodb:DescribeTable and dynamodb:GetItem permissions on the Iceberg Catalog table.

Must also have the s3:GetObject permission for the location of the Iceberg tables.

It depends on where and how the Hive / Rest Catalog is setup and where the Iceberg files are stored.

Click Settings in the side navigation bar and then Services.

The first step is to ingest the metadata from your sources. To do that, you first need to create a Service connection first.

This Service will be the bridge between OpenMetadata and your source system.

Once a Service is created, it can be used to configure your ingestion workflows.

Visit Services Page

Select your Service Type and Add a New Service

Click on Add New Service to start the Service creation.

Create a new Service

Add a new Service from the Services page

Select Iceberg as the Service type and click Next.

Select Service

Select your Service from the list

Provide a name and description for your Service.

OpenMetadata uniquely identifies Services by their Service Name. Provide a name that distinguishes your deployment from other Services, including the other Iceberg Services that you might be ingesting metadata from.

Note that when the name is set, it cannot be changed.

Add New Service

Provide a Name and description for your Service

In this step, we will configure the connection settings required for Iceberg.

Please follow the instructions below to properly configure the Service to read from your sources. You will also find helper documentation on the right-hand side panel in the UI.

Configure Service connection

Configure the Service connection by filling the form