Sentiment Analysis with BiLSTM

📌 Project Overview

This project implements Sentiment Analysis using a Bidirectional LSTM (BiLSTM) neural network. The goal is to classify text data into positive or negative sentiment, leveraging the ability of BiLSTMs to capture contextual information from both past and future tokens in a sequence.

The pipeline includes:

Data preprocessing and train/validation/test splitting.
Model training with early stopping to prevent overfitting.
Performance visualization (accuracy and loss).
Evaluation on test data.

⚙️ Installation and Setup

1. Clone the repository

git clone https://github.com/anniemburu/Sentimental-Analysis-with-BiLSTM

2. Create and activate a virtual environment (Recommend Anaconda or miniconda)

conda create -n myenv python=3.9

conda activate myenv

3. Install dependencies

All dependencies are listed in requirements.txt. Install them with:

pip install -r requirements.txt

4. Data setup

The processed dataset is expected at: bash datasets/processed/sentiment_data.csv. The data used from this project was sourced from Movie Review, Polarity Dataset. You can modify data_split in src/data/preprocessing.py if you wish to use a different dataset. You can modify bash data_split in bash src/data/preprocessing.py if you wish to use a different dataset.

🚀 Training the Model

Run the training pipeline with:

python train.py

This will:

Train the BiLSTM model on the training data.
Validate it on the validation set.
Save the trained model to bash src/models/model_final.h5 .
Generate training performance plots at bash src/results/model_performance.png .

📊 Data Source

The dataset consists of labeled text samples with binary sentiment labels (0 = Negative, 1 = Positive).

Source: Movie Review, Polarity Dataset.

Preprocessing: The data has been tokenized, padded to fixed sequence length, and split into training, validation, and test sets.

📂 Project Structure

├── datasets/
│   └── processed/
│       └── sentiment_data.csv
├── src/
│   ├── data/
│   │   ├── preprocessing.py
│   │   └── data_loader.py
│   ├── models/
│   │   ├── base_model.py   # BiLSTM model architecture
│   │   └── model_final.h5   # Saved model
│   └── results/
│       └── model_performance.png
├── train.py                # Training pipeline
├── requirements.txt
└── README.md

🔎 Findings & Results

TBA

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis with BiLSTM

📌 Project Overview

⚙️ Installation and Setup

1. Clone the repository

2. Create and activate a virtual environment (Recommend Anaconda or miniconda)

3. Install dependencies

4. Data setup

🚀 Training the Model

📊 Data Source

🔎 Findings & Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis with BiLSTM

📌 Project Overview

⚙️ Installation and Setup

1. Clone the repository

2. Create and activate a virtual environment (Recommend Anaconda or miniconda)

3. Install dependencies

4. Data setup

🚀 Training the Model

📊 Data Source

🔎 Findings & Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages