This project analyzes sales data to uncover insights about revenue, performance across regions, and trends over time.
It was developed in Python (Jupyter Notebook) using Pandas and Matplotlib for data analysis and visualization.
The main goal is to make data speak clearly β #MAKE_DATA_TALK π
- File:
large_sales_data.csv - Columns include:
Dateβ Transaction dateRegionβ Geographic regionSalesPersonβ Sales representativeProductβ Product soldUnitsSoldβ Number of units soldUnitPriceβ Price per unit
- Checked data types for each column
- Converted
Datecolumn to datetime format - Verified no missing values (all columns clean β )
- Calculated a new column
TotalRevenue = UnitsSold * UnitPrice
Key insights and visuals created using Matplotlib:
- Average Units Sold per Region β using
groupby()and.mean() - Total Revenue per Region β using
.sum()and visualized as a Pie Chart - Daily Revenue Trend β plotted as a Line Chart to observe changes over time
- Pie Chart β Revenue distribution per region
- Line Chart β Total daily revenue trend
- The highest revenue was generated in top-performing regions (identified through the pie chart).
- Daily revenue trend shows fluctuations and peaks on specific dates β useful for forecasting and sales planning.
- Clean data ensured accurate aggregation and visualization.
- Python 3
- Pandas
- Matplotlib
- Jupyter Notebook
Sales Analysis Project/
β
βββ sales_analysis_project.ipynb # Main notebook (code & analysis)
βββ sales_analysis_project.html # Exported HTML version for viewing
βββ data/
β βββ large_sales_data.csv # Original dataset
βββ README.md # Project documentation
- Open the
.ipynbnotebook in Jupyter for interactive exploration. - Or view the HTML version directly in your browser.
- You can also preview it online using nbviewer:
π https://nbviewer.org
Mohamed Heta
Data Analyst | Power BI | Python | SQL
Hashtags: #MAKE_DATA_TALK #Ω
ΨΉΩΩΩ
Ψ©_ΩΩ_Ψ§ΩΨ³ΩΨ©