Skip to content

Aboudlal/datafun-04-eda

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

DataFun-04-EDA πŸ“Œ Project Overview

This repository is for CC4.3: Create a Jupyter Notebook and Select a Kernel. The project shows how to create and organize a Jupyter Notebook, run Python and Markdown cells, and perform a simple Exploratory Data Analysis (EDA) using the Iris dataset.

πŸ“‚ Files in this Repo

cc4_3_notebook.ipynb β†’ The main Jupyter Notebook with code, Markdown, charts, and insights.

README.md β†’ Project description and workflow notes.

requirements.txt β†’ List of Python packages used.

.gitignore β†’ Files and folders excluded from Git.

πŸ› οΈ What I Did

Created a new notebook and selected the correct .venv kernel in VS Code.

Added Markdown cells for structure and explanations.

Imported libraries: pandas, seaborn, matplotlib.

Loaded the Iris dataset and explored rows, columns, and dtypes.

Generated descriptive statistics and distributions.

Created new features (Sepal Area, Petal Area).

Built visualizations (histograms, scatter plots, pairplots).

Added written observations after each chart.

πŸ“Š Key Insights

Setosa species is clearly distinct due to smaller petal size.

Versicolor and Virginica overlap but still show noticeable differences.

Feature engineering (Sepal Area & Petal Area) provides extra ways to analyze the flowers.

πŸš€ Tools Used

Python

Jupyter Notebook (VS Code)

Pandas

Seaborn

Matplotlib

πŸ”— Repository

➑️ GitHub Repo – datafun-04-eda

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors