DATA PORTFOLIO
Welcome to my data portfolio! Here you can find a selection of my projects and skills in data science and analytics.
Table of Contents
Achievements
- Recipient of Graduate Merit Scholarship Package at the American University, Washington D.C (2024-)
- BSc in Economics Graduate from the London School of Economics (LSE), United Kingdom (2021-2024)
I have a website on which you can see some of projects and blogs as a MS Data Science student.
Projects
Projects I have worked on:
Scrollytelling with Quarto: Close Read Prize Contest
This project analyzes the financial risk of semiconductor stocks using O-GARCH and Value-at-Risk (VaR) models in R to assess stock volatility and investment risks. The findings are presented on a personal website built with the R library ‘qmd-lab/closeread’, HTML and CSS. Examining stock volatility and investment risks highlights key factors influencing market fluctuations and helps investors make informed decisions.
Assessing Bias in Mortgage Lending Using Supervised Machine Learning Methods
I applied Python-supervised machine learning algorithms to assess bias in mortgage lending decisions. The project predicts loan approval outcomes based on applicant data, mitigating bias and improving decision-making processes in mortgage lending through data-driven insights.
SmartRetail: Customer Segmentation for Micro-Targeting
I implemented customer segmentation techniques to enhance marketing strategies using R. By analyzing consumer data, the project can identify distinct customer groups, enabling more targeted and effective marketing campaigns tailored to specific audience segments.
Airbnb Housing Factors Influencing Prices Project
The objective of this Python project is to analyze various factors affecting Airbnb pricing. By examining data on property features, locations, and host attributes, the study identifies key determinants that influence rental prices, providing insights for hosts to optimize their listings.
Micro Projects / Job Simulations
Micro project I have worked on:
-
Predicting Medical Insurance Premiums with Machine Learning
- This class group project- for my STAT-615 Regression class - is designed to predict medical insurance premium prices based on customer health profiles. By combining statistical modeling, tree-based machine learning, and unsupervised clustering, this project identifies key drivers of premium variation and proposes data-driven strategies for personalized pricing. As future work, I created an interactive R Shiny Dashboard to help predict medical insurance prices. The individual can enter his/her personal and medical information, including age, weight, height, transplant history, chronic conditions, cancer history in the family, and number of major surgeries.
-
Financial News Sentiment Analysis for Stock Insights
- This financial news analytics project - for my CS680 Data Mining class - is designed to predict the sentiment of stock headlines and assess how well different machine learning and deep learning models can classify market sentiment. By combining traditional methods like TF-IDF with SVMs and cutting-edge deep learning models like BiLSTM, BERT, and FinBERT, this project evaluates which approaches best uncover the emotional tone behind stock market news.
-
Food Consumer Price Index Inflation R Shiny Dashboard
- This class group project - for my DATA-615 Data Science class - aims to develop an interactive Shiny app that visualizes the Monthly Cost of Goods in the U.S. over the past 10 years, focusing on food Consumer Price Index (CPI) inflation using data from the Federal Reserve Economic Data (FRED). The app automates CPI data collection and applies a range of econometric and time series forecasting models—including ARIMA, SARIMA (Seasonal ARIMA), Holt-Winters, Prophet, and Simple Exponential Smoothing (SES)—to analyze trends and predict future inflation for categories like “Dairy,” “Meat, “Fruit,” “Baked Goods” and “Alcoholic Beverages.” It also incorporates volatility modeling techniques such as GARCH (Generalized Autoregressive Conditional Heteroskedasticity) and EWMA (Exponentially Weighted Moving Average) to assess fluctuations in food prices over time, offering users an interactive, data-driven exploration of food inflation dynamics in the U.S.
-
Lloyds Banking Data Science Forage Job Simulation: Customer Retention Enhancement through Predictive Analytics
- This project simulates a business-critical engagement with Lloyds Banking Group, aimed at reducing customer churn for its subsidiary, SmartBank. It leverages predictive analytics and supervised machine learning techniques to identify at-risk customers, enabling the delivery of strategic retention interventions. The case study demonstrates how data science can support financial institutions in protecting revenue, improving client satisfaction, and optimizing digital engagement channels.
-
Accenture Data Analytics and Visualization Forage Job Simulation: Social Buzz Content Popularity Data Analysis
- This project simulates a real-world data analytics engagement with Accenture, in partnership with the fast-growing social media platform Social Buzz. The objective was to analyze the platform’s massive volume of user-generated content and determine the top-performing content categories based on aggregate popularity scores.
-
API scraping, Data Cleaning and Visualisation
- Mortgage Rate Delinquency vs. Unemployment Analysis: In this file, I investigated the relationship between mortgage delinquency rates and unemployment trends using Python Plotly and FRED API. Utilizing economic data, the analysis explores correlations and potential causations, offering insights into how employment fluctuations impact mortgage repayments.
Certifications
- IBM Data Science Professional Certificate
- Microsoft Azure Fundamentals (AZ-900) Cert Prep by Microsoft Press -> Passed the AZ-900 exam at June 2025
Skills
- Programming Languages: Python, R, SQL, Tableau, PowerBI
- Libraries & Tools: Pandas, NumPy, Scikit-learn
- Data Visualization: Matplotlib, Seaborn, ggplot2
- Machine Learning: Regression, Classification, Clustering
- Other: Git
- Medium writing: Medium
Contact
You can reach out to me via email or connect with me on LinkedIn.