This study tests whether modern machine-learning models can generate economically meaningful alpha when forecasting daily U.S. large-cap returns under realistic trading frictions. Using 11 years of high-resolution data (2013-2023) on 423 liquid S an equal-weight ensemble of TFT, LSTM and XGBoost delivers 1.42 Sharpe with a maximum draw-down of –9.6 %. Ablation studies show technical indicators dominate predictive power, sentiment adds 0.08 Sharpe, and macro variables contribute 0.05. The edge persists at transaction costs up to 10 bps and across the COVID-19 and GFC regimes. While GPU inference and nightly retraining introduce operational overhead, the incremental 0.4 Sharpe remains economically large. Our open-source pipeline and dataset enable full replication and extension.
Kaleem et al. (Thu,) studied this question.