What does this research mean for the field?

A hybrid framework integrating TF-IDF, VADER lexicon features, and IGM weightings with advanced oversampling and ensemble models substantially improves the detection of minority negative sentiments in highly imbalanced online game reviews. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The aim is to develop a model for better sentiment analysis of imbalanced online game reviews.

April 15, 2026Open Access

View Full Paper

A Hybrid Feature-Weighting and Resampling Model for Imbalanced Sentiment Analysis in User Game Reviews

Key Points

The aim is to develop a model for better sentiment analysis of imbalanced online game reviews.
Integrated TF-IDF and VADER lexicon features with IGM weightings.
Utilized advanced oversampling methods like ADASYN and Borderline-SMOTE.
Employed ensemble models including XGBoost and LightGBM.
Analyzed a large dataset of Steam game reviews.
Significant improvement in detecting negative sentiments was achieved.
The framework effectively addressed the classification challenges associated with imbalanced feedback.

Abstract

Sentiment analysis of online game reviews has increasingly become important in understanding player experiences and supporting data-driven game development. However, research in this domain has continuously faced two unresolved challenges: (1) the extreme imbalance between positive and negative feedback, and (2) the inefficiency of existing feature-weighting schemes in capturing sentiment signals embedded in informal gaming discourses. Prior works demonstrated that negative feedback—though a few in number are highly influential—usually contain richer emotional content and longer textual structures; yet, prevailing classification models often perform poorly for these minorities (i.e., negative feedback). Numerous studies explored multimodal imbalance issues, class imbalance in cross-lingual ABSA (Aspect-Based Sentiment Analysis), reinforcement-learning-based architectures for imbalanced extraction tasks, and oversampling strategies like SMOTE (Synthetic Minority Over-sampling Technique) variants. Few investigations specifically addressed imbalanced sentiment classification in the contexts of online game reviews, where user-generated content exhibits unique lexical, structural, and emotional characteristics. To address these gaps, this study integrated TF-IDF (Term Frequency-Inverse Document Frequency), VADER (Valence Aware Dictionary and Sentiment Reasoner) lexicon features, and IGM (Inverse Gravity Moment) weightings with advanced oversampling methods such as ADASYN (Adaptive Synthetic Sampling Approach for Imbalanced Learning) and Borderline-SMOTE to improve the detection of minority sentiment classes. Ensemble models, including XGBoost (Extreme Gradient Boosting) and LightGBM (Light Gradient-Boosting Machine), were further employed to enhance the robustness of imbalance. Using a large-scale dataset of Steam game reviews, the proposed framework demonstrated substantial improvement in identifying negative sentiments, addressing a critical limitation in the existing computational game-analysis literature, and advancing the modeling for detecting the emotion-rich but imbalance-prone user feedback.

Ask AI

Helpful

Bookmark

View Full Paper

Ask AI

Helpful

Bookmark

View Full Paper

A Hybrid Feature-Weighting and Resampling Model for Imbalanced Sentiment Analysis in User Game Reviews

Key Points

Abstract

Cite This Study