What question did this study set out to answer?

The main aim is to forecast future state of scene graphs based on past graph sequences from video input.

March 5, 2026

Scene Graph Forecasting Using Neural Network-Based Methods

Key Points

The main aim is to forecast future state of scene graphs based on past graph sequences from video input.
Developed GraphCast model architecture using object-centric encoding and transformer model.
Employed biaffine relation classification for interaction modeling.
Implemented temporal convolution module for feature extraction and noise robustness.
Conducted experiments on STAR and Action Genome datasets to validate performance.
GraphCast outperforms existing methods for scene graph forecasting.
Improvements in predicting object relations and presence were observed over baseline models.

Abstract

Forecasting the future state of a scene is a key computer vision task needed to build systems capable of proactive perception and decision-making in changing environments. This work addresses the problem of forecasting future scene graphs, where, given a video and a sequence of past graphs, one must predict objects and their relations in subsequent frames. Unlike existing approaches limited to static perception, the proposed method, GraphCast, takes into account semantic vision-language features of objects and their temporal dynamics. We introduce a model architecture based on object-centric encoding with a foundation transformer model, interaction modeling via a biaffine relation classification head, and a specialized object presence classifier. In addition, a temporal convolution module is used to extract features and improve robustness to noise. Experiments on the STAR and Action Genome datasets demonstrate that the proposed architecture outperforms existing baselines.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

A. M. Trunova

D. A. Yudin

Journals

Doklady Mathematics

Actions

Institutions

Moscow Institute of Physics and Technology

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Scene Graph Forecasting Using Neural Network-Based Methods

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study