This study presents a comparative analysis of machine learning and deep learning algorithms for sentiment classification in movie reviews. Three benchmark datasets—IMDb (50K and 20K reviews) and Rotten Tomatoes—were used to evaluate six classifiers: Na¨ıve Bayes, Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, and BERT. Preprocessing included tokenization, stop-word removal, stemming, and feature extraction using Count Vectorizer and TF-IDF. Evaluation metrics such as accuracy, precision, recall, sensitivity, specificity, and F1-score were used to assess model performance. Logistic Regression achieved 88% accuracy on the IMDb dataset, while Random Forest exhibited the highest specificity. BERT outperformed traditional models in both accuracy and F1-score across all datasets, particularly in handling informal and context-heavy language. The results highlight the impact of dataset characteristics on classification performance and provide insights for deploying sentiment analysis in real-world applications like recommendation systems and audience profiling.
Building similarity graph...
Analyzing shared references across papers
Loading...
Mirza Jahanzaib
Md. Shafiur Raihan Shafi
Saeed Hossain Moheb
Building similarity graph...
Analyzing shared references across papers
Loading...
Jahanzaib et al. (Thu,) studied this question.
www.synapsesocial.com/papers/68af4eb9ad7bf08b1ead7a05 — DOI: https://doi.org/10.64494/jfst/v3i1/jm/2025/07/147-170