What does this research mean for the field?

Sequential annotations of embodied actions can identify typical patterns of movement and utterance combinations that facilitate understanding gestures in spoken conversations. Novelty: ClaimNovelty.SYNTHESIS. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This research aims to develop a framework for annotating gestures in spoken conversations to understand interaction dynamics.

March 3, 2026Open Access

Sequential Annotations of Embodied Actions in Movement Scenes from a Multimodal Corpus: A Case Study of the Miraikan SC Corpus

Key Points

This research aims to develop a framework for annotating gestures in spoken conversations to understand interaction dynamics.
Conducted sequential annotations of embodied actions within a multimodal corpus.
Focused on interactions between science communicators and visitors.
Annotated first actions by science communicators and subsequent actions by visitors.
Performed both quantitative and qualitative analyses of the annotation data.
Identified typical patterns of movement and utterance combinations used to prompt visitor actions.
Highlighted relevant cases with time lags between actions for deeper analysis.
Showed variations in causes of time lags in interactions.

Abstract

To construct a framework for annotating the process of understanding the meaning of gestures in spoken conversationsin a straightforward and versatile way, we conducted sequential annotations of embodied actions on movement scenes froma multimodal corpus of conversations between science communicators and visitors at the National Museum of EmergingScience and Innovation (Miraikan SC corpus). This paper introduces the purpose and outline of the sequential annotationsof embodied actions and presents the results of quantitative and qualitative analyses based on these annotations. For asequence in which a science communicator (SC) prompts a visitor to move using some movement or utterance and the visitorfollows and begins to move, the SC’s first action and the visitor’s second action were annotated. The SC’s first actions wereannotated as walking, pointing, changing body orientation, speech, and/or gestures. The quantitative analysis results suggestthat by reviewing the annotation data, it is possible to identify typical patterns of movement and utterance combinations thatconstitute the “gestures” that the SC used to prompt visitors to move. Meanwhile, the qualitative analysis results suggestthat a detailed analysis of cases with a significant time lag between the onset of the first and second actions would enable theidentification of the relevance and diversity of the causes of the time lag.

Bookmark

View Full Paper

Bookmark

View Full Paper

Sequential Annotations of Embodied Actions in Movement Scenes from a Multimodal Corpus: A Case Study of the Miraikan SC Corpus

Key Points

Abstract

Cite This Study