What question did this study set out to answer?

To develop an intuitive interface that allows optimal planning for autonomous systems using minimal user input in uncertain environments.

March 30, 2026Open Access

Data sparse inference of operator spatial reward models in uncertain environments

Key Points

To develop an intuitive interface that allows optimal planning for autonomous systems using minimal user input in uncertain environments.
Developed a multimodal algorithmic and visual interface called RINAO.
Integrated geographic database information with user-defined features for mission-specific planning.
Validated the approach with experts in public safety to assess usability and effectiveness.
The system showed effective alignment of geospatial interests.
User evaluations indicated above-average usability.
Our approach performed inference faster and with less input than the inverse reinforcement learning baseline.

Abstract

Human–machine teaming allows people to leverage the impressive capabilities of autonomous robotic teammates to safely accomplish challenging tasks. Although users may be experts in their fields, robotic interfaces need to be intuitive to the general population and able to quickly interpret minimal user input from multiple modalities in directing autonomous teammates toward key locations for information-based tasking. This work presents a flexible multimodal algorithmic and visual interface that enables dynamic reprogramming of autonomous planning algorithms, focusing on the use of uncrewed aerial systems engaged in outdoor search and rescue. The Responsive Interface for iNtuitive Aircraft Operation (RINAO) leverages known geographic database information, such as trail networks, in conjunction with a variable set of user-defined features, such as search areas and landmarks, to efficiently infer a mission-specific, uncertainty-aware geospatial interest distribution that informs optimal planning algorithms through reward shaping. The approach is validated using 10 experts in public safety with 13.5 years of median operational experience. Results of this user evaluation show that the system enables effective and efficient alignment of geospatial interest and above-average usability. Evaluating the system’s performance against an inverse reinforcement learning (IRL) baseline, we find that our approach meets or exceeds the baseline’s value alignment while performing inference in substantially less time and with less user input. These results demonstrate that multimodal preference inference can enable rapid and intuitive mission specification for human—robot teams operating in time-critical environments.

Bookmark

View Full Paper

Bookmark

View Full Paper

Data sparse inference of operator spatial reward models in uncertain environments

Key Points

Abstract

Cite This Study