You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
End-to-end Azure data engineering pipeline ingesting real-time earthquake data from the USGS API. Implements a Bronze–Silver–Gold lakehouse using Azure Data Factory, Databricks, ADLS Gen2, and Synapse Analytics, with both manual execution and fully automated daily-triggered workflows.
In this project we will orchestrate the movement of raw nyc taxi data from staging layer to presentation layer and to semantic model for creating visualizations.
Real-time streaming data pipeline using Apache Kafka, Spark Structured Streaming, and Delta Lake on Azure. Secure SSL Kafka integration, ADLS storage with OAuth2, and ML-driven anomaly detection with automated email alerts. Modular, scalable, and configurable for IoT and log analytics pipelines.
End-to-end data engineering pipeline implementing Medallion Architecture (Bronze-Silver-Gold) for trip transaction analytics. Automated ETL using Azure Data Factory, Databricks, and Delta Lake with real-time monitoring and email notifications via Logic Apps.
🚀📊 This project demonstrates a complete ETL pipeline for retail sales data using ☁️ Azure Databricks, 📦 ADLS Gen2, and PySpark, following the 🟫 Bronze → 🟦 Silver → 🟨 Gold architecture.
🚀 Build an automated data pipeline with Azure Data Factory and Databricks to efficiently process and analyze trip transaction data using Medallion Architecture.