What question did this study set out to answer?

This work aims to develop an integrated AI assistive system to enhance the independence of visually impaired users through audio interactions.

March 2, 2026Open Access

AI-powered BlindSpot VisionGuide system on raspberry Pi for enhancing independence of visually impaired users

Key Points

This work aims to develop an integrated AI assistive system to enhance the independence of visually impaired users through audio interactions.
Integrated AI-based system developed for Raspberry Pi.
Included modules for face recognition, image captioning, and reading online newspapers.
Utilized deep facial embeddings for facial recognition and a transformer-based model for image captioning.
Implemented a text-to-speech engine for audio output and interaction.
Evaluated system performance regarding recognition accuracy, response time, and memory consumption.
The system exhibits reliable performance across all modules tested.
Demonstrated effective real-time audio feedback for face recognition.
Provided natural language descriptions for captured images using the BLIP model.
Successfully converted structured news content to speech through its integrated modules.

Abstract

This work describes BlindSpot-VisionGuide, an integrated, AI-based assistive system that aims to empower visually impaired people towards independence through real-time audio interaction. The system incorporates three fundamental capabilities-face recognition, image captioning, and reading online newspapers-into a voice-based platform deployable in Raspberry Pi hardware. The face recognition capability recognizes known people using deep facial embeddings and returns instant voice feedback. The image captioning module uses a transformer-based BLIP model to produce natural language descriptions of scenes captured. The online newspaper module fetches structured news content through APIs and converts it into speech through a text-to-speech engine. The voice interface is centralized for all the modules, enabling users to interact with their surroundings without their hands. The system has been tested for recognition accuracy, response time, and memory consumption on a Raspberry Pi 5. Experiments indicate that the platform operates reliably in all modules, striking a balance between computation and user-friendliness. Optimized for offline use and low-power devices, BlindSpot illustrates the practical applicability of embedded AI towards the creation of inclusive, scalable assistive technology. The authors conclude by noting potential extensions, such as object detection, multi-language support, and caregiver incorporation, making BlindSpot a fundamental model for vision-based accessibility systems of the next generation.

Bookmark

View Full Paper

Cite This Study

Sudha et al. (Fri,) studied this question.

synapsesocial.com/papers/69a528ecf1e85e5c73bf060b https://doi.org/https://doi.org/10.1038/s41598-026-39724-9

Bookmark

View Full Paper