Here we compare the performance and cost of four language models (GPT 4, Llama 3, Gemma 2 and Mixtral 8x7b) in the lightweight task of population group curation. Our findings provide insight into potential sustainable curation practices in the presence of limited resources.
Landry et al. (Wed,) studied this question.