What question did this study set out to answer?

The aim is to enhance on-device generative AI applications by managing thermal constraints effectively.

January 22, 2026Open Access

Sustainable Edge Intelligence: An Adaptive Thermal Scheduling Architecture for On-Device Generative AI and Neural Speech Synthesis

Key Points

The aim is to enhance on-device generative AI applications by managing thermal constraints effectively.
Introduced an Adaptive Thermal Scheduling Architecture.
Analyzed the Duty Cycle Manager for micro-pausing during token generation.
Detailed the Energy-Aware State Machine for downclocking background threads during peak inference.
Conducted long-duration stress tests to evaluate effectiveness.
Reduced peak device temperature by 12%.
Extended continuous inference time by 40%.
Improved user experience by preventing thermal throttling.

Abstract

Abstract: The deployment of Generative AI on mobile devices is severely constrained by the thermodynamic limits of passive cooling. Prolonged inference sessions, particularly for Large Language Models (LLMs) and Neural Text-to-Speech (TTS), frequently trigger thermal throttling, degrading user experience and shortening battery life. This technical report introduces a novel "Adaptive Thermal Scheduling Architecture" designed to decouple neural processing from thermal saturation. We analyze the implementation of a "Duty Cycle Manager" that introduces imperceptible micro-pauses between token generation bursts, allowing for rapid heat dissipation without breaking conversational flow. Furthermore, we detail an "Energy-Aware State Machine" that dynamically downclocks non-essential background threads during peak inference. Experimental data from long-duration stress tests demonstrates that this architecture reduces peak device temperature by 12% while extending continuous inference time by 40% compared to standard execution. These engineering optimizations provide a sustainable pathway for deploying always-on, empathetic AI companions on consumer hardware, aligning with green computing principles.

Sustainable Edge Intelligence: An Adaptive Thermal Scheduling Architecture for On-Device Generative AI and Neural Speech Synthesis

Key Points

Abstract

Cite This Study