Skip to content

Comprehensive Sentiment Analysis of Movie Reviews using models from LogisticRegression to Bert

License

Notifications You must be signed in to change notification settings

SuryaPradeepM/Comprehensive-Sentiment-Analysis-of-Movie-Reviews-IMDB-dataset

Repository files navigation

Automated Sentiment Analysis of Movie Reviews using various approaches including sklearn models, keras models & transfer learning

The goal for this analysis is to predict if a review rates the movie positively or negatively. Inside this dataset, there are 25,000 labelled movies reviews for training, 50,000 unlabeled reviews for training, and 25,000 reviews for testing.

Notebooks

  1. Exploration and Preprocessing
  2. Base Models (Logistic Regression, Multinomial NB)
  3. Keras Models
  4. PyTorch RNN Model
  5. BERT Fine Tuned Model

Dataset

Accuracies Achieved:

  • Logistic Regression | 90.79 %
  • Support Vector Machine | 91.08 %
  • Multinomial Naive Bayes | 91.32 %
  • Simple Neural Net Keras | 92.83 %
  • RNN LSTM PyTorch | 86.04 %
  • BERT Fine Tuning | 91.68 %

WordClouds

Positive Reviews WordCloud

Positive WordCLoud

Negative Reviews WordCloud

Negative WordCLoud

About

Comprehensive Sentiment Analysis of Movie Reviews using models from LogisticRegression to Bert

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published