Key points are not available for this paper at this time.
We propose a novel architecture design for video prediction in order to utilize procedural domain knowledge directly as part of the computational graph of data-driven models. On the basis of new challenging scenarios we show that state-of-the-art video predictors struggle in complex dynamical settings, and highlight that the introduction of prior process knowledge makes their learning problem feasible. Our approach results in the learning of a symbolically addressable interface between data-driven aspects in the model and our dedicated procedural knowledge module, which we utilize in downstream control tasks.
Building similarity graph...
Analyzing shared references across papers
Loading...
Takenaka et al. (Wed,) studied this question.
www.synapsesocial.com/papers/68e634d1b6db6435875c676f — DOI: https://doi.org/10.48550/arxiv.2407.09537
Patrick Takenaka
Johannes Maucher
Marco F. Huber
Building similarity graph...
Analyzing shared references across papers
Loading...