What question did this study set out to answer?

This research aims to develop a reinforcement learning model capable of providing real-time navigation cues for individuals with visual impairments using synthetic navigation data.

May 8, 2026Open Access

From synthetic navigation data to real-world mobility cues: Reinforcement learning for sensory substitution in visual impairment

Key Points

This research aims to develop a reinforcement learning model capable of providing real-time navigation cues for individuals with visual impairments using synthetic navigation data.
Developed NavIndoor, a Unity-based environment for generating navigation data.
Trained a blind 'digital twin' using reinforcement learning with progressive difficulty and semantic mask augmentation.
Tested model performance against human players and evaluated its safety prediction accuracy using linear classifiers.
The best model achieved a mean reward of 74.3, which is 73% of the human score.
Attained an AUC of 0.92 on the Active Vision Dataset for safe forward pathway detection.
Successfully guided a blindfolded participant through a 38-meter corridor without collisions.

Abstract

Limited access to large-scale, task-specific navigation data remains a major obstacle to designing lightweight, real-time sensory-substitution devices (SSDs) for blind and low-vision travellers. Current datasets rarely contain collision examples or temporal context, forcing most AI pipelines to rely on generic scene-understanding tasks whose output is too high-dimensional for haptic or auditory feedback. We created NavIndoor , a Unity-based, procedurally-generated maze environment that delivers paired RGB and semantic-segmentation streams together with collision-aware rewards. A head-mounted blind “digital twin” was trained end-to-end with reinforcement learning to estimate a four-way action/value vector. Training used 250 episodes with a progressive increase in difficulty and semantic mask augmentation. Performance was (i) benchmarked against human players in-sim; (ii) probed on five unseen houses from the Active Vision Dataset (AVD) via linear classifiers that predict forward-path safety thresholds; (iii) timed on a Jetson Orin Nano powered with a portable power supply; (iv) evaluated on an early-stage haptic prototype during a proof-of-concept trial with a single blindfolded participant. In-simulation the best model attained a mean reward of 74.3 (73 % of human score). On the Active Vision Dataset (AVD) it achieved an AUC of 0.92 for detecting safe forward pathways and outperformed larger self-supervised or supervised backbones. The estimated value function V θ ( s ) correlated monotonically with the walkable distance, validating its interpretation as a one-dimensional safety signal. The full pipeline ran between 7 and 16 frames per second (FPS) on the Jetson with an end-to-end (image-to-motor) latency of 1.1 s. The model’s outputs were able to guide the blindfolded participant to navigate without collisions in a 38-meter curved corridor. Training on procedurally-generated semantic maps yields compact RL policies whose safety cues transfer directly to real indoor scenes without photorealistic rendering or domain adaptation. The approach enables portable, low-power SSD prototypes and lays the groundwork for forthcoming clinical validation of safe, real-time navigation aids for the visually impaired. • The limited availability of navigation data for visually impaired individuals restricts the development of AI-driven assistive technologies. • NavIndoor is a new software tool that generates procedurally created mazes, enabling the rapid extraction of synthetic, human-like navigation data. • Synthetic virtual-environment data, combined with reinforcement learning, enables real-time extraction of low-dimensional cues for safe pathway identification.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Sarbout et al. (Fri,) studied this question.

synapsesocial.com/papers/69fd7d94bfa21ec5bbf05e65 https://doi.org/https://doi.org/10.1016/j.array.2026.100861

Bookmark

View Full Paper