This project explores the supplements and vitamins e-commerce market through web scraping and exploratory data analysis (EDA).
An online pharmacy website, one of the most visited in Greece, was scraped using BeautifulSoup to collect product-level data. The dataset was then analyzed to identify pricing patterns, discount strategies, popular product categories, and consumer preferences.
Key insights include:
- Top 10 most inexpensive products
- Top 10 products with the highest discounts
- Discount distribution across the pharmacy
- Top-rated and most reviewed products
- Most common ingredients and supplement categories
Pharmacy24webscrape_final.ipynb Web scraping pipeline for collecting product data from the online pharmacy.
EDA_SupVitDataset.ipynb Data cleaning, exploratory analysis, and visualization of the extracted dataset.
SupVit_Dataset.csv Final dataset containing detailed information on supplements and vitamins products.
product_name— Product name and short descriptioncategory— Supplement categorynormal_price— Original listed pricediscount_price— Price after discountreviews_count— Number of user reviewsrating— Average customer ratingdiscount%— Relative discount: (normal − discount) / normalweighted_average_metric— Popularity/quality score combining rating and reviews