A data analysis project exploring consumer behavior and sales trends through EDA using Python. Includes visualizations and insights derived from retail shopping data.
-
Updated
Jul 17, 2025 - Jupyter Notebook
A data analysis project exploring consumer behavior and sales trends through EDA using Python. Includes visualizations and insights derived from retail shopping data.
This project demonstrates ETL pipeline built using PySpark for processing of data, ingestion of raw data, transformed into start-schema stored in PostgreSQL and visualized in Power BI
Exploratory analysis of the Online Retail dataset using Python, Pandas, and Matplotlib. Includes data cleaning, identification of missing and duplicate values, analysis of sales by country, customer and period, visualization of shopping patterns.
Add a description, image, and links to the retail-dataset topic page so that developers can more easily learn about it.
To associate your repository with the retail-dataset topic, visit your repo's landing page and select "manage topics."