May 1, 2019

When Code Completion Fails: A Case Study on Real-World Completions

VHVincent J. HellendoornGoogle (United States)SPSebastian ProkschTokyo Institute of Technology HGHarald C. GallUniversity of Zurich

Key Points

Key points are not available for this paper at this time.

Abstract

Code completion is commonly used by software developers and is integrated into all major IDE's. Good completion tools can not only save time and effort but may also help avoid incorrect API usage. Many proposed completion tools have shown promising results on synthetic benchmarks, but these benchmarks make no claims about the realism of the completions they test. This lack of grounding in real-world data could hinder our scientific understanding of developer needs and of the efficacy of completion models. This paper presents a case study on 15,000 code completions that were applied by 66 real developers, which we study and contrast with artificial completions to inform future research and tools in this area. We find that synthetic benchmarks misrepresent many aspects of real-world completions; tested completion tools were far less accurate on real-world data. Worse, on the few completions that consumed most of the developers' time, prediction accuracy was less than 20% -- an effect that is invisible in synthetic benchmarks. Our findings have ramifications for future benchmarks, tool design and real-world efficacy: Benchmarks must account for completions that developers use most, such as intra-project APIs; models should be designed to be amenable to intra-project data; and real-world developer trials are essential to quantifying performance on the least predictable completions, which are both most time-consuming and far more typical than artificial data suggests. We publicly release our preprint https://doi.org/10.5281/zenodo.2565673 and replication data and materials https://doi.org/10.5281/zenodo.2562249.

اسأل الذكاء الاصطناعي

Bookmark

View Full Paper

Cite This Study

Hellendoorn et al. (Wed,) studied this question.

synapsesocial.com/papers/6a08cd525686deba6901f26c https://doi.org/https://doi.org/10.1109/icse.2019.00101

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

اسأل الذكاء الاصطناعي

Bookmark

View Full Paper