In recent years, diffusion models have made remarkable success in generating realistic human motions. However, existing robot pose-learning approaches are largely focused on single-task and one-to-one scenarios, failing to account for multi-person social interactions. This limitation leads to rigid, context-insensitive behaviors that are ill-suited for real-world service scenarios. Consequently, current systems often produce robotic behaviors incapable of the fluidity and responsiveness expected in human-centered environments, a shortcoming underscored by affordance theory in robotics. To address this issue, we propose RoboActor, an innovative human-robot interaction behavior planner, draws inspiration from theatrical acting to orchestrate both deliberate and automatic actions. Our framework leverages Large Language Models (LLMs) to disentangle primary command-driven tasks from secondary, context-induced subtasks. By this means, RoboActor generates lifelike and socially appropriate behaviors in multi-person settings, significantly enhancing the naturalness, engagement, and realism of service robots in everyday social applications.
Chen et al. (Sun,) studied this question.