📚 Books Scraping Project

📌 Overview

This project demonstrates web scraping using Python by extracting book-related data from the Books to Scrape website.

The scraper collects detailed information about books and stores it in structured CSV files for analysis and practice.

🚀 Project Objective

The goal of this project is to practice:

Web scraping fundamentals
HTML parsing
Data extraction and structuring
Working with real-world scraped datasets

All scraping logic is implemented in a Jupyter Notebook for easy understanding and modification.

🛠️ Technologies Used

Python 🐍
Requests – HTTP requests
BeautifulSoup – HTML parsing
Pandas – Data handling
Jupyter Notebook

📂 Files Included

📓 BooksScraping.ipynb – Main notebook with scraping logic
📊 Books.csv – Basic extracted dataset
📊 BookInfo.csv – Detailed book information
📊 BookDataSet(Scraped).csv – Combined and cleaned dataset

📊 Data Extracted

The scraper collects:

Book title
Price
Availability
Rating
Category
Additional book details (where available)

🔍 What This Project Does

Sends requests to web pages
Parses HTML content using BeautifulSoup
Extracts structured book data
Cleans and organizes data using Pandas
Stores results in CSV format

🎯 Purpose

This project is built to:

Learn and practice web scraping
Work with structured and unstructured web data
Improve Python and data handling skills
Build beginner-friendly data projects

⚠️ Disclaimer

This project is for educational purposes only.
The target website (Books to Scrape) is designed specifically for practicing web scraping.

🚀 Future Improvements

Scrape multiple categories automatically
Store data in a database (SQLite / MongoDB)
Add data visualization
Build a scraping pipeline

✨ Author

Anupam Singh
Aspiring Data Analyst & Developer

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.ipynb_checkpoints		.ipynb_checkpoints
BookDataSet(Scraped).csv		BookDataSet(Scraped).csv
BookInfo.csv		BookInfo.csv
Books.csv		Books.csv
BooksScraping.ipynb		BooksScraping.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 Books Scraping Project

📌 Overview

🚀 Project Objective

🛠️ Technologies Used

📂 Files Included

📊 Data Extracted

🔍 What This Project Does

🎯 Purpose

⚠️ Disclaimer

🚀 Future Improvements

✨ Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📚 Books Scraping Project

📌 Overview

🚀 Project Objective

🛠️ Technologies Used

📂 Files Included

📊 Data Extracted

🔍 What This Project Does

🎯 Purpose

⚠️ Disclaimer

🚀 Future Improvements

✨ Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages