Skip to content

NandiniMonpara/BookStore_web_scrapping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“˜ Books Web Scraper

A Python project to scrape book information from Books to Scrape, a website designed for practicing web scraping. This project extracts book titles, prices, availability, and star ratings using BeautifulSoup, and applies machine learning models for classification and regression tasks on the scraped data.


πŸ” Features

  • πŸ“š Scrapes book details from multiple pages
  • πŸ’° Extracts price, availability, and star rating
  • πŸ“Š Applies classification to predict star rating categories
  • πŸ“ˆ Uses regression to analyze or predict book prices
  • πŸ“¦ Saves data to CSV for further analysis
  • πŸ’‘ Easy-to-read and beginner-friendly code

πŸ› οΈ Tech Stack

  • Python
  • BeautifulSoup – HTML parsing
  • Requests – fetching web content
  • Pandas – data manipulation
  • Scikit-learn – classification and regression models
  • Jupyter Notebook

🧠 ML Tasks Performed

βœ… Classification

  • Goal: Predict book rating category (e.g., β˜…β˜…β˜…β˜†β˜†, β˜…β˜…β˜…β˜…β˜†) using features like price, availability, and title length.
  • Model Used: Random Forest / Logistic Regression (customizable)

βœ… Regression

  • Goal: Predict the price of a book based on its features (e.g., rating, title features, availability)
  • Model Used: Linear Regression / Decision Tree Regressor

πŸ“ˆ Example ML Insights

  • πŸ’¬ β€œBooks with higher star ratings tend to be priced slightly higher on average.”
  • πŸ“Š β€œOut-of-stock books have lower average ratings.”

About

A Python-based web scraping project that extracts book details like title, price, availability, and rating from the "Books to Scrape" website using BeautifulSoup.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors