Explore My Portfolio

A selection of data analytics projects using Python, SQL, Tableau, Power BI, EDA, and more.
(Click on a skill to filter.)

Filter by Skill

Projects

NYU Faculty Promotion Data Project

NYU Faculty Promotion: Data Analytics & Visualization ($15K Raise)

End-to-end data analytics for an NYU faculty member’s promotion, including advanced data cleaning, EDA, custom Excel dashboards, and stakeholder-driven reporting.
Delivered actionable insights from messy, incomplete datasets—directly contributing to a successful promotion and a $15,000 raise.

Excel EDA Python Data Cleaning Stakeholder Mgmt Visualization
US Logistics Data Optimization Project Thumbnail

US Logistics Data Transformation & Optimization

Led data transformation and delivery optimization for a 300-truck fleet in Tulsa, OK.
Rebuilt 5 years of fragmented Excel/CSV data (100K+ rows) with Python and SQL, engineered new features, and built Power BI dashboards. Uncovered actionable insights that enabled $50K/year in savings, improved data quality, and guided executive order policy decisions.

Python SQL PowerBI EDA Machine Learning
House Price Prediction Project Thumbnail

House Price Prediction: Ames Housing (XGBoost Regression)

Built a robust house price prediction model on the Ames Housing dataset using advanced EDA, feature engineering, and XGBoost.
Project highlights the real-world complexity of predictive modeling—balancing performance, data quality, and creative experimentation.

Python EDA Data Cleaning Machine Learning Visualization XGBoost Regression
Smart City Data Management Strategy Thumbnail

Smart City Data Strategy: Campo Belo, Brazil

Led the design of a comprehensive data management and analytics strategy for a Smart City initiative in Campo Belo, Minas Gerais. Delivered ELT pipelines, hybrid data modeling, and GDPR-compliant governance, laying the foundation for real-time analytics in urban mobility, public safety, and energy. Designed with phased implementation—starting in small neighborhoods and scaling to the entire city over the next decade.

Python EDA Visualization Data Engineering SQL Azure Governance AI/ML
Telco Churn XGBoost Project Thumbnail

Telco Customer Churn Prediction (XGBoost & EDA)

End-to-end analysis and predictive modeling for a California telecom using Python.
Combined advanced EDA, detailed feature engineering, and XGBoost classification (93% accuracy, ROC-AUC 0.98) to reveal why customers leave.
Interpreted model results with SHAP, delivering actionable business recommendations to reduce churn.

Python EDA Machine Learning Visualization SHAP
NBA Champions DNA Analysis

NBA Champions' DNA Analysis

(Ongoing...) Analysis of NBA championship teams' performance using advanced EDA, machine learning, SQL, and Tableau dashboards to reveal the hidden patterns behind winning rosters.

SQL Python EDA Machine Learning Visualization Tableau
Biochemical Process

Bayesian State Estimation in Biochemical Processes

Evaluating Bayesian estimators for biochemical process optimization using Python, Matlab, and statistical modeling, producing actionable insights for process control.

Python Excel Matlab Machine Learning Advanced Statistic Visualization
Tennis Odds Collection

Automating Tennis Odds Collection

Automated extraction of tennis betting odds using Python, image processing, OCR, and statistical models to fuel predictive analytics.

Python Machine Learning Advanced Statistic Visualization