Skip to content

prefect-openmetadata

Welcome!

Using Prefect and OpenMetadata together will help you build and maintain a data platform you can trust.

Prefect allows you to coordinate your dataflow and provides visibility into the health of your workflow execution and workflow lineage. With OpenMetadata integration, you can enrich your orchestration system with metadata about data lineage, data catalog, data quality and governance, giving you more information about the health of your system.

Getting Started

Python setup

Requires an installation of Python 3.8+.

We recommend using a Python virtual environment manager such as pipenv, conda or virtualenv.

Installation

Install prefect-openmetadata with pip:

pip install prefect-openmetadata

Install OpenMetadata and Prefect

Head over to the install OpenMetadata page for detailed instructions on how to install and configure both platforms.

Write and run metadata ingestion flow

from prefect_openmetadata.flows import ingest_metadata

config = """See an example in the section: Run ingestion flow"""

if __name__ == "__main__":
    ingest_metadata(config)

For more details, check the run ingestion flow section.

Schedule a metadata ingestion flow

Simple example:

prefect deployment build -a -n dev myflow.py:ingest_metadata --interval 900

For more details, check the schedule ingestion flow section.

Resources

If you encounter any bugs while using prefect-openmetadata, feel free to open an issue in the prefect-openmetadata repository.

If you have any questions or issues while using prefect-openmetadata, you can find help in either the Prefect Discourse forum or the Prefect Slack community.

Development

If you'd like to install a version of prefect-openmetadata for development, clone the repository and perform an editable install with pip:

git clone https://github.com/PrefectHQ/prefect-openmetadata.git

cd prefect-openmetadata/

pip install -e ".[dev]"

# Install linting pre-commit hooks
pre-commit install