We consider the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative. Using movie reviews as data, we find that standard machine learning techniques definitively outperform human-produced baselines. However, the three machine learning methods we employed (Naive Bayes, maximum entropy classification, and support vector machines) do not perform as well on sentiment classification as on traditional topic-based categorization. We conclude by examining factors that make the sentiment classification problem more challenging.
Building similarity graph...
Analyzing shared references across papers
Loading...
Bo Pang
Tianjin University of Technology
Lillian Lee
Cornell University
Shivakumar Vaithyanathan
IBM (United States)
Cornell University
IBM Research - Almaden
Building similarity graph...
Analyzing shared references across papers
Loading...
Pang et al. (Tue,) studied this question.
synapsesocial.com/papers/69d8c02017a1cc0598d18216 — DOI: https://doi.org/10.3115/1118693.1118704