What question did this study set out to answer?

To improve the reliability of sentiment labeling in drug safety monitoring using a GenAI-powered framework.

April 22, 2026Open Access

GenAI-Powered Framework for Reliable Sentiment Labeling in Drug Safety Monitoring

Key Points

To improve the reliability of sentiment labeling in drug safety monitoring using a GenAI-powered framework.
Processed 213,869 user-generated drug reviews using a hybrid labeling pipeline.
Employed a Random Forest classifier and 10-fold stratified cross-validation for evaluation.
Conducted cross-source validation on an independent dataset of 4091 reviews.
Achieved a classification accuracy of 96.45% across sentiment categories.
Demonstrated consistent performance in per-class analysis.
Showed robustness and generalizability through threshold sensitivity analysis.

Abstract

The analysis of medical data presents an opportunity for healthcare systems to support decision-making and improve patient outcomes. In this context, the automated analysis of user-generated drug reviews offers a promising approach for monitoring medication safety, understanding patient experiences, and detecting potential adverse effects in real time. This study advances sentiment analyses for pharmacovigilance by introducing a data-centric framework that incorporates a GenAI-powered labeling system for reliable and interpretable data annotation. A corpus of 213,869 user-generated drug reviews was processed through a hybrid labeling pipeline that reconciles user ratings, lexicon-based polarity, zero-shot transformer predictions, and GPT-5.2 as a fallback mechanism. This strategy enables the resolution of sentiment ambiguity, particularly the frequent misalignment between user-assigned ratings and underlying textual sentiment, by leveraging contextual understanding rather than relying solely on numerical scores. Drug review representations are enhanced using the Qwen3-Embedding-0.6B model, allowing improved capture of semantic nuances. Evaluated through 10-fold stratified cross-validation, the proposed labeling framework combined with a Random Forest classifier achieves a classification accuracy of 96.45%, with per-class analysis confirming consistent performance across all sentiment categories. Cross-source validation on an independent drug review dataset of 4091 reviews and a threshold sensitivity analysis further support the robustness and generalizability of the proposed approach.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Vouzis et al. (Sat,) studied this question.

synapsesocial.com/papers/69e865926e0dea528ddea0f2 https://doi.org/https://doi.org/10.3390/app16083942

Bookmark

View Full Paper