Skip to content
View sreyas-lankala's full-sized avatar

Block or report sreyas-lankala

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sreyas-lankala/README.md

Hi, I'm Sreyas Lankala 👋

Data Quality & Governance Engineer | Building trusted data platforms at scale

LinkedIn Email GitHub


About Me

I'm a Data Quality & Governance Engineer with 5+ years of professional experience building the infrastructure that makes data teams trust their data.

  • 🏢 Previously: Amazon (Data Operations) · Hexaware Technologies (Data Engineer – DQ & Governance) · Mphasis (Data Specialist)
  • 🎓 MS Computer Science · Concordia University of Wisconsin · May 2026
  • 📍 Madison, WI · Open to relocation across the USA
  • 🛂 Authorized to work in the USA on F-1 OPT starting June 2026 (STEM OPT eligible – 3-year authorization)
  • 💡 Obsessed with one question: How do you make data trustworthy at scale?

🛠️ Tech Stack

Data Quality & Governance

Great Expectations dbt Collibra Apache Atlas Microsoft Purview

Data Engineering & Pipelines

Apache Airflow Apache Spark Snowflake

Languages & Tools

Python SQL PySpark Power BI

Cloud

AWS Azure GCP


🚀 Featured Projects

Airflow · Azure Data Lake · Microsoft Purview · dbt · Python · SQL

Enterprise-grade data governance platform processing 2M+ synthetic clinical records (Synthea). Automated validation frameworks, full metadata catalog, data lineage tracking, and observability dashboards built to meet real-world healthcare compliance standards.

Key highlights: Data quality rules engine · Metadata cataloging · Schema drift detection · Lineage mapping · SLA monitoring


Snowflake · dbt · Apache Airflow · GitHub Actions · SQL · Python

Full-stack enterprise data platform with medallion architecture (RAW → STAGING → MART), metadata-driven quality rule engine, and operational observability layer.

By the numbers: 40+ SQL scripts · 25+ dbt models · 65+ data quality rules · 6 governance schemas · CI/CD via GitHub Actions


PostgreSQL · SQL · Data Analysis

Advanced SQL analytics — customer segmentation, revenue analysis, cohort queries, and business KPI reporting.


📈 Professional Impact

Company Role Key Achievement
Hexaware Technologies Data Engineer – DQ & Governance 200+ quality rules · 99.9% cross-system accuracy · MTTR ↓25%
Amazon Data Operations Associate 99%+ accuracy · 500+ incidents resolved
Mphasis Data Specialist Dataset accuracy ↑15% · 100K+ records profiled

🎯 Currently Seeking

Full-time roles in the USA (OPT starting May 27, 2026) in:

  • Data Quality Engineer
  • Data Governance Analyst
  • Data Engineer
  • Analytics Engineer

📬 Let's connect: [email protected] · LinkedIn

Pinned Loading

  1. enterprise-data-platform enterprise-data-platform Public

    Enterprise data quality & governance platform: Snowflake · dbt · Airflow · medallion architecture · 65+ quality rules · metadata governance · observability

    Python 1