January 10, 2023Open Access

Mastering Diverse Domains through World Models

Key Points

Key points are not available for this paper at this time.

Abstract

Developing a general algorithm that learns to solve tasks across a wide range of applications has been a fundamental challenge in artificial intelligence. Although current reinforcement learning algorithms can be readily applied to tasks similar to what they have been developed for, configuring them for new application domains requires significant human expertise and experimentation. We present DreamerV3, a general algorithm that outperforms specialized methods across over 150 diverse tasks, with a single configuration. Dreamer learns a model of the environment and improves its behavior by imagining future scenarios. Robustness techniques based on normalization, balancing, and transformations enable stable learning across domains. Applied out of the box, Dreamer is the first algorithm to collect diamonds in Minecraft from scratch without human data or curricula. This achievement has been posed as a significant challenge in artificial intelligence that requires exploring farsighted strategies from pixels and sparse rewards in an open world. Our work allows solving challenging control problems without extensive experimentation, making reinforcement learning broadly applicable.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Danijar Hafner

Google (United States)

Jurgis Pašukonis

Google (United States)

Jimmy Ba

University of Toronto

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Mastering Diverse Domains through World Models

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study