Key points are not available for this paper at this time.
Current lexica and machine learning based sentiment analysis approaches still suffer from a two-fold limitation. First, manual lexicon construction and machine training is time consuming and error-prone. Second, the prediction’s accuracy entails sentences and their corresponding training text should fall under the same domain. In this article, we experimentally evaluate four sentiment classifiers, namely Support Vector Machines, Naive Bayes, Logistic Regression and Random Forest. We quantify the quality of each of these models using three real-world datasets that comprise 50,000 movie reviews, 10,662 sentences, and 300 generic movie reviews. Specifically, we study the impact of a variety of natural language processing (NLP) pipelines on the quality of the predicted sentiment orientations. Additionally, we measure the impact of incorporating lexical semantic knowledge captured by WordNet on expanding original words in sentences. Findings demonstrate that the utilizing different NLP pipelines and semantic relationships impacts the quality of the sentiment analyzers. In particular, results indicate that coupling lemmatization and knowledge-based n-gram features proved to produce higher accuracy results. With this coupling, the accuracy of the support vector machine (SVM) classifier has improved to 90.43%, while it was 86.83%, 90.11%, 86.20%, respectively using the three other classifiers.
Building similarity graph...
Analyzing shared references across papers
Loading...
Mohammed Maree
Arab American University
Mujahed Eleyat
Arab American University
Shatha Rabayah
IAES International Journal of Artificial Intelligence
Université Claude Bernard Lyon 1
Université de Lyon
Arab American University
Building similarity graph...
Analyzing shared references across papers
Loading...
Maree et al. (Sun,) studied this question.
synapsesocial.com/papers/6a1bf37c00ee29383e9d331e — DOI: https://doi.org/10.11591/ijai.v12.i1.pp284-294