What question did this study set out to answer?

This research investigates how the timing between narrated text and animations affects learning in a virtual reality context.

March 25, 2026

Continuous Temporal Contiguity Narration‐Animation for Learning in Virtual Reality

Key Points

This research investigates how the timing between narrated text and animations affects learning in a virtual reality context.
Conducted an experimental pre-post-test design with junior secondary school students.
Examined five temporal contiguity conditions with varying time lags between narration and animations.
Controlled for participants' prior knowledge, working memory capacity, and spatial abilities.
Used a multimedia lesson in VR to teach about organic matter decomposition in European forests.
Significant negative effects on recall and comprehension were observed with time lags of even −2 seconds.
Performance varied based on participants' working memory capacity.
Episodic memory tasks indicated better text-picture integration than previously reported.
Learners displayed awareness of the time lags during learning.

Abstract

ABSTRACT Background Temporal contiguity between visual and verbal (narrated) information could strongly influence learning from multimedia animations, especially in the case of a rich visual context such as Virtual Reality. Compared to spatial contiguity, less research has been carried out on the impact of the time lag, for example, the de‐synchronisation, between visual (pictures) and verbal (text) information during animation‐based learning and mainly on adults. Further, the results of this research were mixed, especially in the case of continuous temporal contiguity. The material conventionally used was composed of short, schematic animations and not of a rich, visual, learning context and no studies have been carried out on temporal contiguity in virtual reality. Objectives In order to help satisfy this need for research, we used a multimedia lesson, in VR (desktop) about organic matter decomposition in a forest environment to investigate the influence of the (de)‐synchronisation of, for example, time lag between, narrated text and dynamic pictures in rich PC‐VR video. The theoretical framework of the study involved a consistent combination of the Cognitive Theory of Multimedia Learning (CTML, Mayer 2014) and of the Time‐Based Resource‐Sharing model of working memory (TBRS, Barrouillet and Camos 2021). Method In an experimental pre‐post‐test learning paradigm, the effects of five new temporal contiguity conditions on various recall measures, comprehension, and also new episodic memory tasks were tested on 227 junior secondary school participants (second year: 12 years old). They were invited to study an on‐screen Virtual Reality lesson on the European forest soil decomposition mechanism. The delays between the narrated words and the corresponding animations were: −6, −2, 0, +2, +6 s, respectively. Prior knowledge, working memory capacity, and spatial abilities were controlled. Results and Conclusions Temporal gaps, even −2 s had a significant negative effect on recall and comprehension. Performances were influenced by working memory capacity. However, the episodic memory task results suggested that text‐picture integration with long text segments was greater than previous literature has shown. Learners were relatively aware of time lags. Implications In the future, precise time‐locked correspondence between spoken text and dynamic pictures might turn out to be of high importance for immersive VR, because of the high degree of interactivity, for example, user control of the environment dynamics. These results are consistent with the CTML and the TBRS models, and they suggest that it would be worthwhile to incorporate episodic memory measures into the CTML.

Bookmark

Cite This Study

Porte et al. (Mon,) studied this question.

synapsesocial.com/papers/69c37ba2b34aaaeb1a67e465 https://doi.org/https://doi.org/10.1002/jcal.70218

Bookmark