The deployment of Motor Imagery Brain–Computer Interfaces (MI-BCI) is constrained by the inherent physiological variabilities of Electroencephalography (EEG) and parametric opacity. This paper presents a targeted technical audit of ten high-density MI-BCI computational pipelines, evaluating how existing literature addresses low Signal-to-Noise Ratio (SNR), intra-subject variability, and session-to-session instability. The investigation focuses on the contamination of data by ocular and muscular artifacts that overlap with the spectral components of Mu and Beta rhythms, often leading to algorithmic overfitting. Furthermore, the paper evaluates the impact of manifold drift where fluctuations in user state necessitate frequent recalibration as a primary hurdle for BCI portability. By applying a forensic evaluation framework to standardize the analysis across the ten selected studies, this paper identifies a high-performance landscape within standardized benchmarks, with classification accuracies reaching peak values of 95.42%. The audit specifically identifies a performance-reporting gap; while hybrid architectures demonstrate superior noise-rejection, they are frequently characterized by undocumented computational overhead. Additionally, while Neighborhood Component Analysis (NCA) emerges as a stable feature selection algorithm across the sampled literature, the systemic absence of reported execution times prevents a verified assessment of its low-latency viability. A critical technical finding is the widespread issue of Parametric Opacity, particularly regarding the omission of essential deterministic variables such as filter orders, windowing constants, and the final dimensionality of feature vectors. The audit reveals that the frequent failure to report the exact number of features utilized for classification masks potential overfitting and prevents an accurate assessment of the system’s generalization capabilities. Furthermore, only a specialized subset of the reviewed literature validates performance through formal statistical testing, such as Friedman ANOVA or Wilcoxon Signed-Rank tests, with most studies relying on peak accuracy metrics that may disguise filtered artifact residuals. This lack of granular documentation disguises the computational complexity of proposed methods and complicates their feasibility for hardware-in-the-loop validation. The findings establish that standardizing the reporting of preprocessing variables and feature-space dimensions is a prerequisite for overcoming current performance plateaus in universal BCI architectures.
Ramsoonder et al. (Thu,) studied this question.