Key points are not available for this paper at this time.
Large language models (LLMs) have shown remarkable abilities to generate code. However, their ability to develop software for physical computing and embedded systems, which requires cross-domain hardware and software knowledge, has not been thoroughly studied. We observe through our experiments and a 15-user pilot study that even when LLMs fail to produce working code, they can generate helpful reasoning about embedded design tasks, as well as specific debugging suggestions for both novice and expert developers. These results highlight the potential to develop AI assistants to dramatically lower the barrier to entry for working with hardware. To evaluate the capabilities and limitations of LLMs, we develop an automated testbench to quantify LLM performance on embedded programming tasks and perform 450 trials. We leverage these findings to analyze how programmers interact with these tools including their productivity and sense of fulfillment and outline a human-AI collaborative workflow for developing and debugging embedded systems.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zachary Englhardt
Richard Li
Dilini Nissanka
University of Washington
Building similarity graph...
Analyzing shared references across papers
Loading...
Englhardt et al. (Sat,) studied this question.
www.synapsesocial.com/papers/68e6a891b6db64358762b980 — DOI: https://doi.org/10.1145/3613905.3650764