Skip to content
View JingYou-data's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report JingYou-data

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
JingYou-data/README.md

Jing You — Data Analytics & Data Engineering

Transforming data into automation, intelligence, and business value

Portfolio LinkedIn Tableau Email


👋 Hi there! I'm Jing You

I'm a Data Engineer & Analytics Engineer completing a Data Engineering Apprenticeship at Nashville Software School.
I build end-to-end data pipelines, design medallion architectures, and turn raw data into decisions that drive business outcomes.


🧠 About Me

  • 🎓 Dual-track training in Data Analytics and Data Engineering — Python, SQL, dbt, Snowflake, Airflow, AWS, Databricks
  • 🏗️ Passionate about pipeline architecture, data modeling, and BI storytelling
  • 🚀 Background in e-commerce operations & digital marketing — I understand the business side of data
  • 🌎 Based in Tennessee, actively targeting Seattle / Pacific Northwest opportunities

⚙️ Tech Stack

Languages: Python | SQL
Data Platforms: Snowflake | Databricks | AWS S3 | PostgreSQL | DuckDB
Orchestration & Pipelines: Apache Airflow | dbt (Core) | Docker | GitHub Actions
BI & Visualization: Power BI (Advanced DAX) | Tableau | Power Query
Libraries: Polars | Pandas | NumPy | Scikit-learn | Matplotlib | Seaborn | Folium
Other: MinIO | FastAPI | AWS CLI | Linux


📊 Featured Projects

Project Description Tools
🏥 NPPES Healthcare Provider Analytics Production-grade ELT pipeline processing 8.85M records (9.9 GB) through a medallion architecture. Analyzes geographic distribution, specialty mix, and provider density across US counties. Orchestrated with Airflow, modeled in dbt, served from Snowflake. Python, Polars, DuckDB, dbt, Snowflake, Airflow, AWS S3
🛒 Brazilian E-Commerce Analytics Multi-page Power BI dashboard analyzing 99K+ orders across 27 states. Features dynamic DAX narratives, min-max normalized radar scoring, and RANKX-based state performance rankings. Power BI, DAX, Power Query
📊 Global Online Retail Strategic Intelligence End-to-end BI solution transforming 541K+ raw records into executive insights. Automated Python ETL feeding a multi-page Power BI report with advanced DAX measures and $8.91M revenue analysis. Python, Pandas, Power BI, DAX
🔹 From Calls to Crimes: Nashville Public Safety Capstone analytics project joining 911 call data with crime records to surface spatial and temporal public safety trends across Nashville neighborhoods. Python, Pandas, Folium, Power BI
💬 Telco Customer Churn Prediction End-to-end ML pipeline predicting customer churn with classification models. Includes feature engineering, model evaluation, and business-framed findings. Python, Scikit-learn, Pandas

🌱 Currently Building

  • ⚙️ Medallion architecture pipelines on Snowflake + dbt (staging → intermediate → marts, incremental models, snapshots)
  • 🔥 Databricks / Apache Spark — distributed processing and large-scale transformation
  • 📡 Event-driven data systems — S3 listeners and trigger-based pipeline patterns
  • ☁️ Preparing for DP-700 Microsoft Fabric Data Engineer certification

📈 GitHub Stats

JingYou's GitHub stats Top Langs


"Turning Data into Decisions, and Decisions into Impact."
💬 Open to connecting with data professionals and hiring managers — let's talk pipelines, architecture, and business impact.

Pinned Loading

  1. brazilian-ecommerce-powerbi brazilian-ecommerce-powerbi Public

    Interactive Power BI dashboard analyzing Brazilian e-commerce performance across 27 states

    1

  2. NPPES NPPES Public

    Python 1

  3. Global_Retail_Analysis Global_Retail_Analysis Public

    Included Power BI Star Schema model, Python EDA scripts, and interactive dashboard documentation. Focused on analyzing 1.45M+ sales records.

    Jupyter Notebook 1

  4. Nashville-Public-Safety Nashville-Public-Safety Public

    Python 1