Scenario A data analyst at an online store.
“Revenue has been unstable. We want to understand what's happening and how to improve it.”
This project analyzes e-commerce data to identify key drivers of revenue, conversion performance, and business growth opportunities.
The dataset initially contained messy and inconsistent data, which was cleaned and transformed into an analysis-ready format.
- Understand revenue trends over time
- Identify key drivers of revenue changes
- Analyze conversion rate and user behavior
- Detect data quality issues
- Provide actionable business recommendations
The raw dataset contained several issues:
- Missing values (country, device, sessions, product)
- Inconsistent categories (e.g., "Germany" vs "germany")
- Invalid date formats
- Non-numeric revenue values
-
Standardized text fields (lowercase)
-
Replaced missing values with:
unknown(categorical fields)- median (sessions)
- 0 (revenue/conversions where necessary)
-
Removed invalid dates
-
Converted all numeric fields properly
- Conversion Rate = conversions / sessions
- Average Order Value (AOV) = revenue / conversions
- Revenue analyzed over time to detect patterns and anomalies
- Compared across devices and countries
- Identified underperforming segments
- Identified top products by revenue
- Analyzed AOV for pricing insights
- Measured impact of unknown categories
- Highlighted tracking/data issues
- Revenue fluctuations driven primarily by changes in conversion rate
- Mobile users show lower conversion compared to desktop
- Some products generate high revenue but low AOV
- A portion of revenue is linked to unknown categories, indicating tracking issues
- Improve mobile user experience to increase conversions
- Focus marketing on high-performing segments
- Promote high AOV products
- Fix tracking issues for missing product and country data
- Excel (data cleaning, pivot tables, charts)
- Python (data generation and preprocessing)
data/→ raw and cleaned datasetsimages/→ charts and visualizationsanalysis_summary.md→ detailed insights
This project demonstrates how messy data can be transformed into actionable insights that drive business decisions.
Reshma Chougule