April 14, 2024Open Access

Mutation-based Consistency Testing for Evaluating the Code Understanding Capability of LLMs

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

Large Language Models (LLMs) have shown remarkable capabilities in processing both natural and programming languages, which have enabled various applications in software engineering, such as requirement engineering, code generation, and software testing. However, existing code generation benchmarks do not necessarily assess the code understanding performance of LLMs, especially for the subtle inconsistencies that may arise between code and its semantics described in natural language.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Li et al. (Sun,) studied this question.

synapsesocial.com/papers/68e6f3b2b6db64358766e6e1 https://doi.org/https://doi.org/10.1145/3644815.3644946

Me gusta

Guardar

Ver artículo completo