Skip to content
Change the repository type filter

All

    Repositories list

    • List of entity resolution software and resources.
      Other
      12000Updated Feb 22, 2025Feb 22, 2025
    • zerox

      Public
      PDF to Markdown with vision models
      Python
      MIT License
      841000Updated Nov 17, 2024Nov 17, 2024
    • SDG is a specialized framework designed to generate high-quality structured tabular data.
      Python
      Apache License 2.0
      389000Updated Nov 2, 2024Nov 2, 2024
    • An open-source framework that simplifies implementation of data solutions.
      TypeScript
      Apache License 2.0
      28000Updated Oct 31, 2024Oct 31, 2024
    • In this repository you may find KQL (Kusto Query Language) queries and Watchlist schemes for data sources related to Microsoft Sentinel (a SIEM tool).
      MIT License
      25000Updated Oct 31, 2024Oct 31, 2024
    • A utility for Migrating Data between Oracle, Postgres, MySQL MariaDB, Snowflake. Stage Data from supported database to Amazon S3 and Azure Blob Storage in JSON …
      JavaScript
      MIT License
      10000Updated Oct 31, 2024Oct 31, 2024
    • Data Pipeline based on Medallion Architecture using Azure Data Factory, Databricks and DBT.
      Python
      4000Updated Oct 31, 2024Oct 31, 2024
    • AI-in-a-Box leverages the expertise of Microsoft across the globe to develop and provide AI and ML solutions to the technical community. Our intent is to prese…
      Jupyter Notebook
      MIT License
      197000Updated Oct 28, 2024Oct 28, 2024
    • One framework to develop, deploy and operate data workflows with Python and SQL.
      Python
      Apache License 2.0
      66000Updated Oct 28, 2024Oct 28, 2024
    • Fast data quality framework for modern data infrastructure
      Scala
      GNU Lesser General Public License v3.0
      6000Updated Oct 24, 2024Oct 24, 2024
    • In this project we are going to create an end-to-end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.
      Jupyter Notebook
      5000Updated Oct 19, 2024Oct 19, 2024
    • RLS (Row-level Security) Implementation on Unity Catalog initiated Databricks. Using ROW FILTER
      1000Updated Oct 18, 2024Oct 18, 2024
    • Deploy a multi-account cloud foundation to support highly-regulated workloads and complex compliance requirements.
      TypeScript
      Apache License 2.0
      644000Updated Oct 17, 2024Oct 17, 2024
    • Databricks Platform - Architecture, Security, Automation and much more!!
      Jupyter Notebook
      31000Updated Oct 16, 2024Oct 16, 2024
    • Bicep
      MIT License
      21000Updated Oct 16, 2024Oct 16, 2024
    • The Security Reference Architecture (SRA) implements typical security features as Terraform Templates that are deployed by most high-security organizations, and…
      HCL
      Other
      92000Updated Oct 15, 2024Oct 15, 2024
    • This Sample Datawarehouse Project is an integration from Informatica Cloud(IICS) to Snowflake and vice versa for ETL/ELT.
      1000Updated Oct 14, 2024Oct 14, 2024
    • Azure Cognitive Search + Azure OpenAI Accelerator
      Jupyter Notebook
      MIT License
      977000Updated Oct 7, 2024Oct 7, 2024
    • Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt.
      Python
      8000Updated Sep 30, 2024Sep 30, 2024
    • This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The ficticious organization is an …
      Python
      8100Updated Sep 30, 2024Sep 30, 2024
    • A Streamlit app for assessing data quality in Snowflake
      Python
      MIT License
      2000Updated Sep 29, 2024Sep 29, 2024
    • The ADF Universal Framework is an open-source project designed to provide a comprehensive and flexible solution for building scalable and efficient data integra…
      TSQL
      The Unlicense
      4000Updated Sep 6, 2024Sep 6, 2024
    • The ADF Universal Framework is an open-source project designed to provide a comprehensive and flexible solution for building scalable and efficient data integra…
      TSQL
      The Unlicense
      4000Updated Sep 6, 2024Sep 6, 2024
    • Building a Data Lakehouse using the Medallion architecture.
      Jupyter Notebook
      6000Updated Sep 1, 2024Sep 1, 2024
    • Python scripts for Azure Blob Storage data ingestion into Snowflake. Includes a manual version and an HTTP request version for Azure Functions.
      Python
      2000Updated Aug 24, 2024Aug 24, 2024
    • A collection of awesome resources regarding Record Linkage.
      MIT License
      1000Updated Aug 16, 2024Aug 16, 2024
    • Azure MLOps (v2) solution accelerators. Enterprise ready templates to deploy your machine learning models on the Azure Platform.
      Python
      MIT License
      801000Updated Aug 7, 2024Aug 7, 2024
    • Data Pipeline with Delta Lake using Medallion architecture
      Jupyter Notebook
      2000Updated Aug 6, 2024Aug 6, 2024
    • ML Ops Accelerator: Databricks & Azure Machine Learning Unification
      Python
      MIT License
      70000Updated Aug 5, 2024Aug 5, 2024
    • Azure Analytics End to End with Azure Synapse - Deployment Accelerator
      Bicep
      MIT License
      125000Updated Jul 16, 2024Jul 16, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.