GitHub - SuryaPradeepM/Comprehensive-Sentiment-Analysis-of-Movie-Reviews-IMDB-dataset: Comprehensive Sentiment Analysis of Movie Reviews using models from LogisticRegression to Bert

Automated Sentiment Analysis of Movie Reviews using various approaches including sklearn models, keras models & transfer learning

The goal for this analysis is to predict if a review rates the movie positively or negatively. Inside this dataset, there are 25,000 labelled movies reviews for training, 50,000 unlabeled reviews for training, and 25,000 reviews for testing.

IMDB movie reviews dataset
http://ai.stanford.edu/~amaas/data/sentiment
https://www.kaggle.com/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews
Contains 25000 positive and 25000 negative reviews
Contains at most reviews per movie
At least 7 stars out of 10 → positive (label = 1)
At most 4 stars out of 10 → negative (label = 0)

Notebooks

Dataset

The data used for this problem can be found at: https://www.kaggle.com/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews
The preprocessed data can be found at : https://drive.google.com/file/d/1-KrTwLg3b2NcHFafK_lrKeyTvUK3NhiL/view?usp=sharing

Accuracies Achieved:

Logistic Regression | 90.79 %
Support Vector Machine | 91.08 %
Multinomial Naive Bayes | 91.32 %
Simple Neural Net Keras | 92.83 %
RNN LSTM PyTorch | 86.04 %
BERT Fine Tuning | 91.68 %

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
data		data
html		html
images		images
.gitignore		.gitignore
Base_models_predictions.ipynb		Base_models_predictions.ipynb
Bert_model_predictions.ipynb		Bert_model_predictions.ipynb
Data_exploration_Preprocess.ipynb		Data_exploration_Preprocess.ipynb
Keras_Models_predictions.ipynb		Keras_Models_predictions.ipynb
LICENSE.md		LICENSE.md
PyTorch_RNN_predictions.ipynb		PyTorch_RNN_predictions.ipynb
README.md		README.md
requirements.txt		requirements.txt
word2vec_bag_of_centroids_predictions.ipynb		word2vec_bag_of_centroids_predictions.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automated Sentiment Analysis of Movie Reviews using various approaches including sklearn models, keras models & transfer learning

Notebooks

Dataset

Accuracies Achieved:

WordClouds

Positive Reviews WordCloud

Negative Reviews WordCloud

About

Releases

Packages

Contributors 2

Languages

License

SuryaPradeepM/Comprehensive-Sentiment-Analysis-of-Movie-Reviews-IMDB-dataset

Folders and files

Latest commit

History

Repository files navigation

Automated Sentiment Analysis of Movie Reviews using various approaches including sklearn models, keras models & transfer learning

Notebooks

Dataset

Accuracies Achieved:

WordClouds

Positive Reviews WordCloud

Negative Reviews WordCloud

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages