What question did this study set out to answer?

The study aims to design a real-time Visual AI Agent optimized for offline performance on the NVIDIA Jetson Orin Nano.

June 3, 2026

Off-line visual AI agent on NVIDIA Jetson Orin Nano

Key Points

The study aims to design a real-time Visual AI Agent optimized for offline performance on the NVIDIA Jetson Orin Nano.
Developed a system utilizing YOLOv8 for object detection, BLIP for image captioning, and Places365 for scene recognition.
Integrated GPT-4V for scene understanding and optimized for offline, GPU-accelerated inference with ONNX models.
Conducted experiments to validate the system's efficiency in various applications including smart surveillance.
Achieved real-time object detection with high accuracy and rapid contextual captioning.
Validated effective scene labeling capabilities in various surveillance scenarios.
Demonstrated the Jetson Orin Nano as a suitable platform for privacy-preserving embedded vision systems.

Abstract

This paper presents the design and implementation of a real-time Visual AI Agent on the NVIDIA Jetson Orin Nano. The system integrates YOLOv8 for object detection, BLIP for image captioning, and Places365 for contextual scene recognition, forming a robust pipeline capable of not only detecting objects in video streams but also describing their context in natural language. Initially leveraging GPT-4V for rich scene understanding, we optimized our solution for a fully off-line, GPU-accelerated inference with ONNX models. Our experiments demonstrate real-time object detection, rapid contextual captioning, and accurate scene labelling, validating the Jetson Orin Nano as an effective edge AI platform for smart surveillance and assistive technologies. The proposed system demonstrates real-world applicability in smart surveillance environments, assistive navigation tools, and privacy-preserving embedded vision systems.

Bookmark

Cite This Study

Maestre et al. (Mon,) studied this question.

synapsesocial.com/papers/6a1fc696dee9eb8c0dce78dc https://doi.org/https://doi.org/10.1049/icp.2026.1959

Bookmark