Key points are not available for this paper at this time.
This research explores a psychometric method for analyzing textual data using contextualembeddings generated by large language models. The proposed method leveragescontextual embeddings to create contextual scores, which are then used as input to createfactor analysis models. By treating documents as individuals and words as items, thisapproach provides a natural psychometric interpretation, unveiling a vast, unexploredlandscape for future research. It assumes that certain words, whose contextual meaningsvary significantly across documents, can be employed to differentiate documents within acorpus. The modeling process consists of two stages: obtaining contextual scores andperforming factor analysis. The first stage involves utilizing natural language processingtechniques and encoder-based transformer models to find common keywords and generatecontextual scores. The second stage involves employing different types of factor analysis toextract and define factors, obtain factor correlations, and identify top words of the factors.Tested on the Wiki STEM corpus, the experimental results demonstrate the new method’schances and challenges for analyzing textual data by combining both large language modelsand psychometric modeling.
Building similarity graph...
Analyzing shared references across papers
Loading...
Jinsong Chen (Fri,) studied this question.
www.synapsesocial.com/papers/68e572d6b6db64358751387e — DOI: https://doi.org/10.31219/osf.io/2nfts
Jinsong Chen
Building similarity graph...
Analyzing shared references across papers
Loading...