Key points are not available for this paper at this time.
We introduce Argoverse 2 (AV2) - a collection of three datasets for and forecasting research in the self-driving domain. The annotated Dataset contains 1, 000 sequences of multimodal data, encompassing-resolution imagery from seven ring cameras, and two stereo cameras in to lidar point clouds, and 6-DOF map-aligned pose. Sequences contain3D cuboid annotations for 26 object categories, all of which are-sampled to support training and evaluation of 3D perception. The Lidar Dataset contains 20, 000 sequences of unlabeled lidar point and map-aligned pose. This dataset is the largest ever collection of sensor data and supports self-supervised learning and the emerging task point cloud forecasting. Finally, the Motion Forecasting Dataset contains250, 000 scenarios mined for interesting and challenging interactions between autonomous vehicle and other actors in each local scene. Models are tasked the prediction of future motion for "scored actors" in each scenario and provided with track histories that capture object location, heading, , and category. In all three datasets, each scenario contains its own Map with 3D lane and crosswalk geometry - sourced from data captured in six cities. We believe these datasets will support new and existing learning research problems in ways that existing datasets do not. All are released under the CC BY-NC-SA 4. 0 license.
Wilson et al. (Sun,) studied this question.