Sentiment-Analysis

In the following code we built a SVM classifier for IMDB Movies reviews using Nltk Sklearn and Pandas libraries.

Data-Sets
We used a data set that contains 49950 IMDB reviews taged as positive and negative
The training set is consisted of 70% of the reviews, while the test set is consisted of 30% of them.

Analyzing & Pre-Processing The Data
We tried to find and grade the most significant set of tokens that will contribute more information for the reviews Sentiment's. Afterward we chose the TF-IDF for gradeing and normalize the tokens to vectors. On the TF-IDF we chose to use the Ngram model to get information from the previews world, set it up to (1,5), and using stopword to clean up the noisy world and custom made function to tokenized the reviews by using WordNetLemmatizer to gain one appearance for words that have same meaning in different declensions. The SVM classifier how's get the output of the TF-IDF using A linear kernel and the C chose to be 15.

Performance
We saw that when we try to "clean" the pos and neg train individually the acc wes bad 77%, then we use the TF-IDF to give a unuiq score to each token according to its appearance in a nagetive or a positive sentiment respectivly each document.

We could do so since TF-IDF is a numerical statistic that is intended to reflect how important a word is to a document in a our text, we than got a satisfying accuracy percentage wich stands on 88%.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.idea		.idea
README.md		README.md
SVM Sentiment classifier.ipynb		SVM Sentiment classifier.ipynb
SVM and LogisticRegression train .ipynb		SVM and LogisticRegression train .ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment-Analysis

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sentiment-Analysis

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages