Sentiment Analysis Pipeline

📌 Project Overview

This project is a Sentiment Analysis Pipeline that processes textual data to determine sentiment polarity (positive, negative, or neutral). It uses Natural Language Processing (NLP) techniques and machine learning models to analyze the sentiment of given text data.

📂 Project Structure

📦 Sentiment-Analysis-Pipeline
├── app.py               # Main application file
├── data_cleaning.py     # Data preprocessing script
├── data_ingestion.py    # Data loading and processing script
├── model.py             # Model training and evaluation script
├── sentiment_analysis.iml  # Project configuration file
├── requirements.txt     # Python dependencies
├── sentiment_analysis.postman_collection.json  # API collection for testing
├── vectorizer.pkl       # Vectorizer file for text transformation
└── README.md            # Project documentation

🚀 Features

✅ Preprocesses textual data (removes noise, tokenization, lemmatization) ✅ Supports multiple machine learning models (Logistic Regression, Naive Bayes, etc.) ✅ Uses TF-IDF vectorization for feature extraction ✅ Provides a REST API for sentiment analysis ✅ Outputs sentiment as Positive, Negative, or Neutral

🛠 Installation & Setup

Clone the Repository

git clone https://github.com/manu0312/Sentiment-Analysis-Pipeline.git
cd Sentiment-Analysis-Pipeline

Create a Virtual Environment (Optional but Recommended)

python -m venv venv
source venv/bin/activate  # On macOS/Linux
venv\Scripts\activate     # On Windows

Install Dependencies
```
pip install -r requirements.txt
```

🔍 Usage

1️⃣ Data Preprocessing

python data_cleaning.py

2️⃣ Train the Model

python model.py

3️⃣ Run the API

python app.py

The API will be available at: http://127.0.0.1:5000

🎯 Example API Usage

Send a POST request to the API:

curl -X POST http://127.0.0.1:5000/predict -H "Content-Type: application/json" -d '{"text": "I love this product!"}'

Response:

{
  "sentiment": "positive"
}

🛠 Technologies Used

Python 🐍
Flask (for API development)
Scikit-learn (for machine learning models)
NLTK (for text preprocessing)
Pandas & NumPy (for data manipulation)

🚀 Happy Coding! 😊

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.idea		.idea
IMDB Dataset.csv		IMDB Dataset.csv
IMDB_Cleaned_Dataset.csv		IMDB_Cleaned_Dataset.csv
README.md		README.md
app.py		app.py
data_cleaning.py		data_cleaning.py
data_ingestion.py		data_ingestion.py
imdb.sql		imdb.sql
model.pkl		model.pkl
requirements.txt		requirements.txt
sentimental analysis.postman_collection.json		sentimental analysis.postman_collection.json
train_model.py		train_model.py
vectorizer.pkl		vectorizer.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis Pipeline

📌 Project Overview

📂 Project Structure

🚀 Features

🛠 Installation & Setup

🔍 Usage

1️⃣ Data Preprocessing

2️⃣ Train the Model

3️⃣ Run the API

🎯 Example API Usage

🛠 Technologies Used

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis Pipeline

📌 Project Overview

📂 Project Structure

🚀 Features

🛠 Installation & Setup

🔍 Usage

1️⃣ Data Preprocessing

2️⃣ Train the Model

3️⃣ Run the API

🎯 Example API Usage

🛠 Technologies Used

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages