Skip to content

IvayloP0709/Sentiment-Analysis-using-VADER-and-roBERTa

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

📊 Sentiment Analysis using VADER and roBERTa

A comparative NLP project that analyzes sentiment in textual reviews using both lexicon-based (VADER) and transformer-based (roBERTa) approaches.

Python Jupyter NLP

📋 Overview

This project compares two sentiment analysis techniques on a large corpus of text reviews:

Approach Model Type
VADER Valence Aware Dictionary and sEntiment Reasoner Lexicon-based, rule-based
roBERTa Robustly optimized BERT pretraining approach Transformer-based, deep learning

Both models produce polarity scores (positive, negative, neutral) for comparative analysis.

📊 Dataset

  • Size: ~500,000 textual reviews
  • Task: Sentiment classification and polarity scoring

⚠️ The dataset is too large to host on GitHub (even when compressed). Please obtain the review dataset from your course materials or the original source.

🏗️ Project Structure

├── Sentiment.ipynb    # Main notebook with implementation and analysis
└── README.md

📈 Contents

The notebook includes:

  • Data preprocessing and exploratory analysis
  • Data distribution visualizations
  • VADER sentiment scoring and polarity outputs
  • roBERTa sentiment scoring and polarity outputs
  • Model comparison and polarity score analysis
  • Limitations of each approach

🚀 Getting Started

Prerequisites

pip install pandas numpy transformers torch vaderSentiment

Usage

  1. Add your review dataset to the project directory
  2. Open Sentiment.ipynb in Jupyter
  3. Update the data path in the notebook
  4. Run all cells to preprocess, analyze, and compare sentiment scores

📝 Key Findings

  • Comparative performance of lexicon-based vs. transformer-based sentiment analysis
  • Polarity score outputs from both VADER and roBERTa
  • Discussion of strengths and limitations of each approach

👤 Author

Ivaylo Papazov

📄 License

This project is available for educational purposes.


⭐ If you find this project useful, please consider giving it a star!

About

Comparative study of sentiment analysis techniques using VADER (rule-based) and roBERTa (transformer-based) models. Demonstrates the performance differences between traditional and deep learning approaches for text sentiment classification.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors