Skip to content
View BlaiseMarvin's full-sized avatar
🎯
Focusing
🎯
Focusing
  • United Kingdom

Block or report BlaiseMarvin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BlaiseMarvin/README.md

Blaise Rusoke — Data Scientist · ML Engineer · AI Engineer 👋

LinkedIn GitHub followers Email

Results-driven Data Scientist and Machine Learning Engineer with hands-on experience in designing, building, and deploying production-ready ML systems. I focus on turning data into reliable, scalable models and features that drive product value. My work spans research-grade experiments, production ML, and data engineering for analytics at scale.

Summary

  • Practical experience with end-to-end ML workflows: data ingestion, feature engineering, model training, hyperparameter tuning, evaluation, and deployment.
  • Strong Python-first stack (pandas, NumPy, scikit-learn) and deep learning using TensorFlow / PyTorch.
  • Production experience with model serving, containerization (Docker), MLOps tooling (MLflow, DVC), and cloud platforms (AWS / GCP).

Core Skills

  • Languages & Libraries: Python, SQL, pandas, NumPy, scikit-learn, TensorFlow, PyTorch
  • Data Engineering: ETL pipelines, data validation, BigQuery / PostgreSQL, Airflow
  • MLOps & Deployment: Docker, Kubernetes, MLflow, DVC, CI/CD, model monitoring
  • Cloud & Infra: AWS (S3, EC2, SageMaker), GCP (BigQuery, Cloud Run), REST APIs
  • Tools: Jupyter, Git, Docker, Bash, VS Code
  • Other: Experiment tracking, A/B testing, model explainability, hyperparameter optimization

Quick Links

Selected Projects (highlights)

Quick links to dive in — see my pinned repositories on my profile for full code and READMEs.

Curated highlights (click each title for the code and notebooks). The projects below are chosen to showcase my work across ML research, production ML, feature engineering at scale, and end-to-end data platforms.

  • Machine-Learning-and-Big-Data-Analytics — Primary AI & big-data repo. Contains Jupyter notebooks and pipeline examples for large-scale data processing and analytics using Apache Spark, end-to-end model training (feature engineering, model selection, hyperparameter tuning), and examples of scaling experiments. This repo is where most of my ML and big-data work lives: Spark-based ETL/feature pipelines, distributed experiments, and notebooks demonstrating model training and evaluation on large datasets.

  • FaceRecognitionPaymentSystem — Edge-AI face-recognition payment prototype. Deep-learning based face recognition (Siamese model on Inception-Resnet V1), deployed for real-time inference on Raspberry Pi with Intel NCS2 via OpenVINO. This project powered the paper "Edge AI Face Recognition for Public Transport Fare Payment" and won an award (see announcement: https://x.com/UCC_Official/status/1539631099923546113). The published e-print includes model metrics (validation accuracy ~93.8%) and deployment details: https://www.techrxiv.org/users/685096/articles/679153-edge-ai-face-recognition-for-public-transport-fare-payment

  • analytics_engineering — Analytics engineering and transformation pipelines (dbt + orchestration). Contains dbt models and Dagster pipelines used to build and maintain analytics-ready data marts and transformation workflows.

  • data_engineering_zoomcamp — My exercises and projects from the Data Engineering Zoomcamp. Includes hands-on work with ingestion, storage, Airflow orchestration, and analytics stacks (BigQuery/Postgres, Docker, and related tooling).

Other notable projects (selected):

  • Sasanya — A barter trading app built with Flutter (mobile prototyping).
  • socialMediaApp-fastapi — Full-stack prototype using FastAPI for backend APIs.

For more repositories and small utilities, see my GitHub profile: https://github.com/BlaiseMarvin

Example link format you can use for additional projects:

- [Project Title](https://github.com/BlaiseMarvin/REPO) — One-line summary + tech used + measurable impact

How to explore these projects

  • Open the repo link and start with the top-level README or the notebooks/ (or similarly named) folder to run example experiments.
  • Many repos include Jupyter notebooks — use a Python 3.8+ environment, install requirements from requirements.txt, and open the notebooks in Jupyter or VS Code.
  • For the FaceRecognitionPaymentSystem, see the deployment/ and README.md for edge deployment notes (OpenVINO, Raspberry Pi + NCS2).

Hire / Contact

If you'd like to collaborate or discuss a role, email: [email protected] (replace with your preferred address). I'm open to consulting, short-term contracts, and full-time opportunities.

How I work

  • Value-driven: I prioritize models and features that produce measurable product improvements.
  • Reproducible experiments: I track experiments, keep data lineage, and automate retraining where appropriate.
  • Production-first mindset: I design models with serving, latency, and monitoring in mind from day one.

Education & Certifications

  • B.Sc. / M.Sc. in [Your Field] — [University Name] (add years)
  • Certifications: (e.g., Coursera TensorFlow, AWS Certified ML Specialist) — add specifics here

Contact & Links

GitHub Stats

Blaise's GitHub stats Top Langs


Pinned Loading

  1. FaceRecognitionPaymentSystem FaceRecognitionPaymentSystem Public

    Design of a Face recognition payment system prototype

    Jupyter Notebook 2

  2. Sasanya Sasanya Public

    A barter trading app built with flutter

    Dart 2

  3. socialMediaApp-fastapi socialMediaApp-fastapi Public

    An app where you share your views, read your friends' views and vote on read views

    Python

  4. data_engineering_zoomcamp data_engineering_zoomcamp Public

    My code for the Datatalks club data engineering zoomcamp

    Jupyter Notebook

  5. analytics_engineering analytics_engineering Public

    Using dbt and Dagster to build and orchestrate data mart building pipelines

    Python

  6. Machine-Learning-and-Big-Data-Analytics Machine-Learning-and-Big-Data-Analytics Public

    Jupyter Notebook