Portfolio
Dashboards
Power BI Dashboards and Data Visualizations
SQL Projects
Exploratory Data Analysis and Data Queries
- Auditing on Maji Ndogo water project
- Water Accessibility Insights with Complex Queries
- Water access project reporting using advanced SQL Queries
- Auditing the Maji Ndogo water project Data Analysis
This project was developed using SQL to clean and standardize inconsistent data, identify and correct inaccurate records, and prepare the dataset for reliable data modeling. The process involved data validation, outlier detection, handling missing values, and transforming raw data into a structured format suitable for in-depth analysis and decision-making.
This project involves advanced SQL queries to clustering data to unveil Maji Ndogo's water and This survey aimed to identify the water sources people use and determine both the total and average number of users for each source. Additionally, it examined the duration citizens typically spend in queues to access water crisis
This project utilizes advanced SQL techniques—including Common Table Expressions (CTEs), window functions, subqueries, and complex joins with aggregations—to analyze water access data. The goal is to uncover trends and patterns that help identify areas with clean water access, guide infrastructure installations, and support improvements to existing systems.
This project involves advanced SQL queries to analayze sales data, uncovering trends and patterns to optimize sales strategies, and improve forecassting accuracy
Python data analysis intership Projects
Exploratory Data Analysis
- International Restaurants Data Analysis
- Python programming language
- Scikit-learn
- Pandas and NumPy
- Matplotlib and Seaborn
- plotly
Overview:- This analysis explores international restaurants across various countries and cities, focusing on key metrics such as cuisine types, ratings, pricing, popularity, and geographic distribution. The goal is to identify trends, high-performing markets, and potential opportunities for investment or expansion.
🐍Programing languages and uselfull libraries i have used during EDA analysis
Scikit-learn is important for classic ML algorithms (regression, clustering, etc.)
Pandas are used for data manipulation and analysis using DataFrames and NumPy is used for numerical computing and array manipulation
Matplotlib is a basic plotting ,charting and seaborn is for statistical data visualization (built on matplotlib)
plotly is for interactive and web-based visualizations







