Projects

Below are selected case studies (academic + applied) with a data-science focus.

Job Intelligence Engine

Published:

Flagship end-to-end data science system that converts messy job ads into structured skill signals, salary forecasts, and interpretable career positioning. Built as a reproducible pipeline with modular feature engineering, probabilistic skill models, an XGBoost salary model, and graph-based job–skill landscape outputs designed for decision-ready insights.

MLB Analytics with SQL

Published:

End-to-end SQL analytics project using the Lahman Baseball Database. Designed a complete relational workflow with schema creation, reusable views, advanced CTEs, window functions, and business-focused analyses on talent pipelines, salary dynamics, and player careers.

Machine Learning Projects in Python

Published:

Implemented core machine learning algorithms — from regression and classification to clustering and deep learning — through applied projects in Python. Focused on building intuition for model training, evaluation, and interpretability using Scikit-learn and Jupyter Notebooks.

Exploratory Data Analysis (EDA) Projects in Python

Published:

Developed an applied EDA framework combining real-world case studies — emergency call records and financial time series — to demonstrate data wrangling, feature extraction, and visualisation workflows using pandas, seaborn, and plotly.

Python Coding Challenges

Published:

Created a growing collection of short, narrative-style coding projects designed to strengthen Python reasoning, design thinking, and problem-solving fluency through realistic mini-scenarios.

SQL Coding Challenges

Published:

Ongoing SQL interview-practice log (DataLemur-style) to keep querying sharp alongside flagship projects, with emphasis on core analytical patterns (joins, CTEs, window functions, time series, cohorts).