Key points are not available for this paper at this time.
ChatGPT has shown the potential of emerging general artificial intelligence capabilities, as it has demonstrated competent performance across many natural language processing tasks. In this work, we evaluate the capabilities of ChatGPT to perform text classification on three affective computing problems, namely, big-five personality prediction, sentiment analysis, and suicide tendency detection. We utilize three baselines, a robust language model (RoBERTa-base), a legacy word model with pretrained embeddings (Word2Vec), and a simple bag-of-words (BoW) baseline. Results show that the RoBERTa model trained for a specific downstream task generally has a superior performance. On the other hand, ChatGPT provides decent results and is relatively comparable to the Word2Vec and BoW baselines. ChatGPT further shows robustness against noisy data, where the Word2Vec model achieves worse results due to noise. Results indicate that ChatGPT is a good generalist model that is capable of achieving good results across various problems without any specialized training; however, it is not as good as a specialized model for a downstream task.
Building similarity graph...
Analyzing shared references across papers
Loading...
Mostafa M. Amin
University of Augsburg
Erik Cambria
University of Stirling
Björn W. Schuller
University of Notre Dame
IEEE Intelligent Systems
Imperial College London
Nanyang Technological University
University of Augsburg
Building similarity graph...
Analyzing shared references across papers
Loading...
Amin et al. (Wed,) studied this question.
synapsesocial.com/papers/69e0026cf3745b374825579d — DOI: https://doi.org/10.1109/mis.2023.3254179
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: