Key points are not available for this paper at this time.
Code completion is commonly used by software developers and is integrated into all major IDE's. Good completion tools can not only save time and effort but may also help avoid incorrect API usage. Many proposed completion tools have shown promising results on synthetic benchmarks, but these benchmarks make no claims about the realism of the completions they test. This lack of grounding in real-world data could hinder our scientific understanding of developer needs and of the efficacy of completion models. This paper presents a case study on 15,000 code completions that were applied by 66 real developers, which we study and contrast with artificial completions to inform future research and tools in this area. We find that synthetic benchmarks misrepresent many aspects of real-world completions; tested completion tools were far less accurate on real-world data. Worse, on the few completions that consumed most of the developers' time, prediction accuracy was less than 20% -- an effect that is invisible in synthetic benchmarks. Our findings have ramifications for future benchmarks, tool design and real-world efficacy: Benchmarks must account for completions that developers use most, such as intra-project APIs; models should be designed to be amenable to intra-project data; and real-world developer trials are essential to quantifying performance on the least predictable completions, which are both most time-consuming and far more typical than artificial data suggests. We publicly release our preprint https://doi.org/10.5281/zenodo.2565673 and replication data and materials https://doi.org/10.5281/zenodo.2562249.
Building similarity graph...
Analyzing shared references across papers
Loading...
Hellendoorn et al. (Wed,) studied this question.
synapsesocial.com/papers/6a08cd525686deba6901f26c — DOI: https://doi.org/10.1109/icse.2019.00101
Vincent J. Hellendoorn
Google (United States)
Sebastian Proksch
Tokyo Institute of Technology
Harald C. Gall
University of Zurich
University of California, Davis
University of Zurich
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: