Santa Clara, California – ROVR, a leading decentralized physical infrastructure network (DePIN), has announced the launch of the ROVR Open Dataset, a high-resolution, multi-modal dataset designed to accelerate innovation in Spatial AI, autonomous driving, robotics, and digital twin applications. The announcement was made at the ADAS & Autonomous Vehicle Technology Summit North America. This release signifies a major step in democratizing access to high-quality, real-world data for the AI community.
The ROVR Open Dataset initially comprises 1,500 fully synchronized clips, totaling over 1TB of data. These clips feature diverse coverage across urban, suburban, and highway environments, including challenging scenarios like construction zones and heavy traffic. Each clip integrates raw LiDAR point clouds, high-resolution RGB video, high-frequency IMU data, and centimeter-level RTK GPS localization, providing a comprehensive view for advanced AI training.
Unlike traditional datasets focused solely on machine vision, the ROVR Open Dataset aims to capture the world as seen by human drivers, including their interactions with surroundings. "Spatial AI is the next frontier of artificial intelligence," stated Guang Ling, Founder of ROVR, emphasizing the dataset's role in empowering researchers to build safer, smarter, and more generalizable AI systems. All data is anonymized to ensure privacy and ethical AI development.
ROVR operates on a decentralized model, leveraging a global network of contributors equipped with custom-built mobile perception units. This community-driven approach has enabled the collection of over 20 million kilometers of road coverage and the deployment of more than 3,500 devices globally. This scalable infrastructure supports the continuous generation of diverse and high-fidelity data essential for robust AI models.
The dataset is released under a permissive license for non-commercial use, with future plans to offer extended versions, including detailed annotations and commercial licensing options. ROVR's initiative reflects its commitment to open infrastructure, fostering collaboration, reproducibility, and transparency across the global AI and robotics communities. The company aims to build the world’s largest open-access driving dataset, targeting 1 million 30-second clips.