The automotive industry is a representative of the manufacturing industry, as the demand has shifted to customization, and the production system faces severe challenges of multi-variety, small batch, and personalized customization. Under this background, the industrial robot, as the core execution unit in manufacturing, has undergone a profound transformation from a single repetitive task to multi-task collaboration, automation, and intelligence since it first moved from the laboratory to the production line in 1961, driven by rising labor costs and safety factors. Nowadays, modern automobile assembly lines usually deploy thousands of industrial robots, aiming at multi-dimensional optimization of scale, reliability, and cost.However, although robot technology has made great progress, the traditional robot system still faces many bottlenecks in the process of stepping into the era of embodied intelligence. For now, the automotive production systems always rely on static environment assumptions, exhibiting an ability of insufficient generalization, especially in autonomous decision making when handling unstructured dynamic disturbances (Xie et al., 2024), such as mixed-model production lines, frequent process changes, and material shortages (Gao et al., 2022). Traditional control strategies achieved millimeter level precision (Wang et al., 2023), and it is a struggle to keep the millisecond-and second-level of real time dynamic responses, required by the actual production system, and exhibits a poor robustness (Zhang et al., 2025). Furthermore, the traditional frame-driven vision systems often suffer from issues of latency, consuming nearly half the total energy of a robot under the dual carbon goals.Nevertheless, breakthroughs in neurorobotics offer new ways to overcome these challenges. Traditional industrial intelligence is limited by environmental models and relies on static, hard-coded logic, whereas embodied intelligence achieves smart perception, decision-making, and execution in robots through dynamic interaction with environment, combined with nonlinear modeling in neurorobotics and feature learning driven by data (Andrea and Alessandro, 2025;Hu et al., 2025). In recent years, the framework of deep learning has been widely applied to path planning (Sun et al., 2024), visual inspection, and production process optimization (Liu et al., 2025). In contrast, while traditional optimization techniques face limitations when processing massive heterogeneous data, neuromorphic sensing systems (Zhao et al., 2022) and the control strategies incorporating physical information embedding (Eisa et al., 2025) demonstrate great potential, which achieved reducing energy consumption and enhancing system robustness. Figure 1 shows the evolution of industrial robots in automotive manufacturing. This study conducted a systematic search of academic literature published between January 2021 and December 2025. We searched major academic databases such as IEEE Xplore, ScienceDirect, ACM Digital Library, SpringerLink, arXiv, and PubMed. The search topics encompassed neural model and automotive manufacturing. A total of 253 relevant records were obtained in the initial search. Through progressive screening of titles and abstracts, after removing duplicates and studies unrelated to automotive manufacturing scenarios, 29 articles were selected as the analysis objects of this mini review. Current literature lacks research on automotive manufacturing scenarios from the perspective of embodied intelligence, while focusing on how neuro-mechanistic approaches reconstruct the perception, decision, and action process of robots. The following chapters are organized as follows.We review neuromorphic sensing and event-driven environment modeling techniques, discuss how to achieve efficient spatial and semantic representation by addressing pixel redundancy in Section 2. Section 3 analyzes neural control and planning with high dynamic adaptability, covering biologically inspired reinforcement learning, neurodynamic models, and adaptive strategies for physical information embedding. Section 4 focuses on the human-machine integration under green low-carbon manufacturing and industry 5.0 and discusses the research trends of energy efficiency optimization algorithms, model lightweight technology, and mutual cognitive human-machine collaboration architecture. Section 5 summarizes the challenges faced by the current research and looks forward to the future evolution direction of embodied intelligence in the field of automobile manufacturing. Section 6 concludes this study.For automobile manufacturing of flexible flow line and frequent process changes, traditional environmental modeling based on geometric models, point cloud processing, or predefined states are inadequate. Leveraging knowledge graphs and graph neural networks (GNNs) to achieve semantic representation of the environment has become a research hotspot (Zhuang et al., 2023;Karthikeyan and Subashini, 2021). Systems infer latent causal relationships between equipment by modeling workshop states as graph structures, and then generating global decisions. Concurrently, digital twin technology integrating real-time data from physical entities into high-fidelity simulation environments, enables real-time monitoring (Song, 2023;Fatmah et al., 2025). There are some limitations in generalization of unstructured features, prompting exploration in few-shot learning, domain adaptation, and unsupervised frameworks. Nevertheless, nature of black-box possessed by neural networks conflicts with reliability demands of automotive manufacturing. Physics-informed neural networks (PINNs) embed physical laws as constraints within loss functions, exhibited a superior robustness in closed loop mechanical systems compared to traditional models (Eisa et al., 2025). Visual inspection always performed redundant computations on every pixel in each frame in traditional painting, in which over 95% of pixels remain static between frames. This rate-encoding typically resulted in high energy consumption and latency. Addressing this issue, researchers proposed a faster R-CNN-based visual system, which demonstrated giant efficiency advantages in automotive body painting defect detection (Rawat et al., 2021). However, there is some controversy about its scalability in complex automotive manufacturing scenarios. For instance, while the proposed model reduces pixel redundancy effectively, certain training methods face severe vanishing gradient issues. This leads to a decline in accuracy when handling long sequence tasks with high semantic complexity. Perception facilitates the evolution from image acquisition to interaction between the subject and environment, it will accelerate the transition from full computation to sparse perception.Automotive production environments possess strong dynamics like emergency order insertions, equipment failures, and process changes, while traditional physics-based strategies struggle to balance dynamic responsiveness and system robustness, when confronted with random disturbances (Julian et al., 2024). In the issue of scheduling problems, traditional heuristic algorithms easily get stuck in local optima caused by high-dimensional state space, while reinforcement learning algorithms based on Q-learning showed a superior performance (Waseem and Chang, 2023). Moreover, for task planning of collaborative robots, a deep reinforcement learning (DRL)-based framework was proposed, incorporating multi-attribute hierarchical task networks and a revival mechanism, achieving task completion time reduction in complex assembly scenarios. (Hou et al., 2024). However, the randomness of the framework in decision-making process still conflicts with the certainty requirement for actual automotive production. Industrial robots shall possess brain-like decision flexibility, rather than relying on pre-programmed rigid body dynamics. Considering issue of speed enhancement in data communication, Petrut et al., 2021 conducted a strategic development of the brainstem model, a model of the cerebellum was simulated on brain-inspired hardware, the verification was conducted through the SpiNNaker brain-like system and passed the benchmark verification of NEST software, results demonstrated that the model reduced the peak communication load by 41%. Furthermore, addressing nonlinear dynamic control challenges in assembly tasks, a fuzzy PID system integrated with spider monkey optimization (SMO) was proposed, achieving precise end effector control (Alka et al., 2021). There are some successful cases for achieving embodied intelligence of robots, such as acquired skills through observation or demonstration. (Park et al., 2023). Zhao et al., 2022 developed a brain-inspired hierarchical system for online gesture recognition and action learning, named spiking gating flow (SGF). By integrating feature extraction and event-driven training, it showed exceptionally high spatio-temporal feature utilization efficiency. However, the end-to-end models rely heavily on high-fidelity simulations, even slight modeling deviations caused by physical parameter influences often show a bottleneck when transferring from virtual environment to real physical space, due to a policy performance variation. Currently, unstructured disturbances in real scenarios are complex, the neural control strategies still require indepth exploration for multi-objective trade-offs and transfer implementation when addressing this problem.Manufacturing of automotive industry in industry 5.0Driven by industry 5.0 and carbon neutrality, the goals of synergy, sustainability, and human centricity are of equal importance. Energy reduction generally relies on hardware improvements, offline planning based on rigid body dynamics, which neglects the nonlinear interactive effects in dynamic environments. In contrast, the neural intelligence algorithm-based trajectory energy consumption models achieved over 30% energy consumption reduction according to Yao et al., (2024), it was integrated into industrial internet of things (IIoT) platforms and digital twin systems to accomplish full lifecycle energy consumption management (Chandrasekaran&Rajesh, 2024.). Wu and Corves, 2025 proposed an adaptive optimal trajectory tracking control framework, achieved a dynamic equilibrium of control performance and energy consumption while maintained operational precision. Addressing computational energy consumption when deploying deep learning models at the edge, Arijit et al., 2024 applied a structured large-scale lottery hypothesis (LTH) to remove sparse layers from neural networks, achieved overall system energy loss reduction.Industry 5.0 emphasizes deep integration between human and robots, nevertheless, due to the inherent uncertainty in human operations, establishing efficient mutual cognition mechanism remains a challenge. To solve unpredictable human interactions, such as turn-taking prediction in humanrobot collaborative assembly, Xu et al. (2023) proposed a spiking neural network (SNN) based on the Izhi neuron model, the proposed spatio-temporal prediction and planning framework (STAP-PPF) enhanced coordination through dynamically updated paths. Although this method showed certain advantages when handling standard collaborative actions, when confronted with worker fatigue, variations in worker skill levels, and other non-standard disruptive movements, the predictive accuracy will decline exponentially. For achieving deep reasoning for complex collaboration, a vision reasoning-based mutual cognition HRC approach was proposed (Jared and Gloria, 2023), which constructed domain-specific knowledge graphs and leveraged graph embedding techniques, results showed a good performance. The introduction of knowledge graph and semantic reasoning enhances the depth of understanding, but the resulting massive data significantly increases time delays, and adding production risk. There is still a certain gap between the simulation in the laboratory and the actual industrial deployment, embodied intelligence must be integrated into mature industrial internet architectures to meet the real-time and reliability demands of industry 5.0. For instance, in practical engineering applications within automotive manufacturing, He et al., 2024 proposed a long short-term memory neural network (LSTM) for the state recognition of the industrial robot, this model was deployed in a real-time production environment for continuous evaluation for state recognition of industrial robots. This application delivers data support for enhancing production process transparency. By identifying abnormal conditions during manufacturing, it strengthens resilience within systems, achieves full traceability across the entire process under industry 5.0. Robots are evolving from production tools to cognitive collaborators with deep semantic understanding capabilities. This transformation goes beyond improvements in technical metrics, but lies in how to achieve a deep integration of domain prior knowledge and data-driven models in the context of high-dimensional, heterogeneous, and massive data. A unified cross-modal cognitive framework that integrates multi-source heterogeneous information is not to break the perception barriers of machine collaboration, but the ability to realize the semantic understanding and flexible decision-making ability of embodied agents under complex production conditions. Table 1 shows a multi-dimensional summary of neurorobotic methodologies in automotive manufacturing (2021)(2022)(2023)(2024)(2025). Even if there are some potentials in reconfiguring perception, decision-making, and execution in automotive manufacturing, transferring neurorobotics from laboratory to industrial scenarios faces challenges.Although neurally grounded intelligent algorithms provide theoretical support for industrial robots in practical applications, they still face profound challenges in the nonlinear, strongly coupled, and highly dynamic industrial environment of automotive manufacturing.The automobile manufacturing requires traceability of the whole process, while the neural network relies on the black box feature in the production system, in key workstations, it is difficult to deconstruct the weighted decision-making schemes into explanations that align with actual production rules, therefore, this lack of transparency adding traceability analysis and troubleshooting of failure. At the same time, manufacturing systems have not established a comprehensive certification framework, thus, deploying end-to-end learning models in real-world production scenarios involving human-machine interaction still faces significant barriers.Robot often fails to comprehend underlying environment intentions, the systemic capabilities akin to human experts, such as cognitive coordination and decision still need further exploration (Lian et al., 2024). It is a challenge for achieving truly safe and efficient human-robot deep collaboration without bidirectional mutual cognition mechanism.Advanced architectures such as transformers or graph neural networks enhance the depth of semantic perception, however, massive parameter counts cause inference delays and the addition of response time. Besides, deploying such models on computationally constrained edge devices requires sacrificing inference accuracy or incurring costly investments. At the same time, research often overlooks the application costs of models, including data management, offline training, and energy consumption from hardware upgrades. For instance, cycle time savings by algorithm may be offset by maintenance costs when the consumption is not effectively optimized.Existing deep learning models heavily rely on training data with specific distributions. Confronted with unstructured perturbations, such as new vehicle models, workshop setting changes, or deformations in flexible materials, the generalization error of the model increases exponentially.Moreover, the robust strategies trained in simulated environments cannot fully achieve good performance in real physical scenarios.Industry 5.0 is not merely an iteration of technology but a reshaping of the workforce structure.Currently, there is a significant cognitive gap between engineers and AI experts. The self-learning capabilities of neural networks require skill in data processing and parameter tuning for the operator, which increases personnel training expenditures, moreover, integrating expert knowledge with blackbox strategies to achieve collaborative operations remains a significant technical challenge.Automotive manufacturing requires high cycle time, low latency requirements, future research shall focus on the computational efficiency of biological neural systems and advanced cognitive frameworks. Brain-inspired control systems and dynamic compensation integrated with neural networks enable microsecond-level monitoring of anomalies, significantly enhancing system robustness.To address the trust deficit in human-robot collaboration, it is necessary to enhance the cross-modal understanding and knowledge representation of robots. Developing large language models to extract deep semantic information, transforming non-standard gestures, voice commands, and ambiguous operational intentions of workers into structured task graphs, thus achieving the improvement of collaborative efficiency. Simultaneously, neural networks are employed to fuse the acquired highdimensional features with digitized expert knowledge, thereby enhancing the logical reasoning capabilities and improving the interpretability of decisions of the system.Mixed-line production of multiple vehicles increases downtime for training, the exploration of industrial foundation models (IFMs) shows advantages in dealing with this issue. Through continuous learning and experience accumulation across workstations, physical laws and expert knowledge are integrated into neural networks, the framework integrated structured knowledge shortens deployment cycles for new tasks with generalization capabilities.This mini review systematically argued the industrial robots in automotive manufacturing, different from the previous review that focused on the accuracy of algorithms, we outlined how robots in neurally grounded intelligent algorithms affecting the performance of the production system. Although the path to fully autonomous, self-evolving faces numerous challenges, such as computational bottlenecks and safety certification, with the integration of brain-inspired architectures and development of industrial large language models, future robots shall achieve a truly humancentered, green, and sustainable intelligent manufacturing system in the era of industry 5.0.
Zhang et al. (Thu,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: