The Metadata Platform for your Data and AI Stack
-
Updated
Nov 23, 2024 - Java
The Metadata Platform for your Data and AI Stack
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Intake is a lightweight package for finding, investigating, loading and disseminating data.
📙 Awesome Data Catalogs and Observability Platforms.
🐳 The stupidly simple CLI workspace for your data warehouse.
Work with your web service, database, and streaming schemas in a single format.
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
Meteor is an easy-to-use, plugin-driven metadata collection framework to extract data from different sources and sink to any data catalog.
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.
Reference Architectures for Datalakes on AWS
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Sample code with integration between Data Catalog and RDBMS data sources.
End-to-end DataOps platform deployed by Terraform.
Data catalog for everything in your company
Tag Engine automates the process of creating, updating, deleting, and populating metadata in bulk with the Google Cloud services Data Catalog and Dataplex. Tag Engine is licensed under the Apache 2 license terms. Please make sure to read, understand and agree to the terms of the LICENSE and CONTRIBUTING files before proceeding.
The documentation repository is part of the Corporate Linked Data Catalog - short: COLID - application.
Add a description, image, and links to the data-catalog topic page so that developers can more easily learn about it.
To associate your repository with the data-catalog topic, visit your repo's landing page and select "manage topics."