Key points are not available for this paper at this time.
Large language models have become increasingly utilized in programming contexts. However, due to the recent emergence of this trend, some aspects have been overlooked. We propose a research approach that investigates the inner mechanics of transformer networks, on a neuron, layer, and output representation level, to understand whether there is a theoretical limitation that prevents large language models from performing optimally in a multilingual setting. We propose to approach the investigation into the theoretical limitations, by addressing open problems in machine learning for the software engineering community. This will contribute to a greater understanding of large language models for programming-related tasks, making the findings more approachable to practitioners, and simply their implementation in future models.
Building similarity graph...
Analyzing shared references across papers
Loading...
Jonathan Katzy (Sun,) studied this question.
www.synapsesocial.com/papers/68e6f4b7b6db64358766f063 — DOI: https://doi.org/10.1145/3639478.3639787
Jonathan Katzy
Delft University of Technology
Building similarity graph...
Analyzing shared references across papers
Loading...