Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

2026-05-16

[2025 - 2026]

Building an open synthetic benchmark and reference baselines for V2X cooperative 3D semantic occupancy prediction, so that connected vehicles and roadside infrastructure can jointly reconstruct dense, semantically rich representations of the driving scene.

A single autonomous vehicle is fundamentally limited in how completely it can understand a traffic scene. Onboard LiDAR and cameras are constrained by their physical viewpoint, so distant regions, blind corners, and occluded objects routinely fall outside any reliable sensing envelope. Vehicle-to-Everything (V2X) communication offers a way out of this single-agent bottleneck: when connected vehicles and roadside units share what they perceive, the resulting collaborative view can cover regions no individual sensor can reach. This project investigates how V2X cooperation can be applied to one of the most expressive forms of driving-scene understanding — 3D semantic occupancy prediction — and what is needed to make research in this direction reproducible and comparable.

Collaborative perception has so far been studied mainly for sparse outputs such as 3D bounding boxes and Bird’s-Eye-View segmentation, while 3D semantic occupancy — a dense voxel-level description of free space, occupied space, and per-voxel semantic class — has remained largely a single-vehicle problem. One important reason is the lack of a suitable benchmark: existing cooperative perception datasets are not designed around dense semantic voxel supervision, and existing occupancy datasets are not designed around multi-agent V2X scenarios. We address this gap by constructing a synthetic benchmark in which both sides are jointly considered from the start.

The benchmark is generated in the CARLA simulator with multiple time-synchronized agents — connected vehicles and roadside units — each equipped with realistic onboard sensors and a high-resolution semantic voxel sensor that produces dense ground truth around the agent. Scenes are designed to stress the cooperative setting: heavy occlusion in urban traffic, long-range perception requirements, and viewpoints that are clearly complementary across agents. Alongside the dataset, we define a unified evaluation protocol that allows ego-only and cooperative configurations to be compared on the same scenes under the same metrics, so that the contribution of V2X cooperation can be measured rather than assumed.

On top of this benchmark we provide reference baselines for collaborative 3D semantic occupancy prediction that perform inter-agent feature fusion before voxel-level decoding. These baselines demonstrate that cooperation yields clear, consistent gains over single-agent perception, especially in regions that are occluded or far from the ego vehicle, where single-agent occupancy prediction degrades the most. The dataset generation pipeline, ground-truth tools, and baseline implementations are released as open source so that future methods — including more efficient communication strategies, robustness studies, and learned cooperation policies — can be evaluated against a common, reproducible reference. The work is positioned as a foundation for the broader research agenda on cooperative perception in next-generation V2X-enabled autonomous driving.

Publication

Wu, Hanlin, Lin, Pengfei, Javanmardi, Ehsan, Bao, Naren, Qian, Bo, Si, Hao, Tsukada, Manabu, "A Synthetic Benchmark for Collaborative 3D Semantic Occupancy Prediction in V2X-Enabled Autonomous Driving ", In: IEEE International Conference on Robotics & Automation (ICRA 2026), Vienna, Austria, 2026.Proceedings Article | Abstract | Links | BibTeX

Category:

Project

Tags:

Autonomous Driving, Machine Learning

Related Projects:

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

autonomous driving machine learning

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

autonomous driving machine learning

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

autonomous driving machine learning

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

2026-05-16

Trajectory Planning for 6G Next-Generation UAVs/AMRs

machine learning uav

Trajectory Planning for 6G Next-Generation UAVs/AMRs

machine learning uav

Trajectory Planning for 6G Next-Generation UAVs/AMRs

machine learning uav

Trajectory Planning for 6G Next-Generation UAVs/AMRs

2026-05-16

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

2026-04-06

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

2026-02-27

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

2025-12-26

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins uav

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins uav

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins uav

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

2025-12-26

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

autonomous driving machine learning

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

2025-08-13

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

2025-08-13

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

[2025 - 2026]

Building an open synthetic benchmark and reference baselines for V2X cooperative 3D semantic occupancy prediction, so that connected vehicles and roadside infrastructure can jointly reconstruct dense, semantically rich representations of the driving scene.

Publication

Related Projects:

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

autonomous driving machine learning

autonomous driving machine learning

Trajectory Planning for 6G Next-Generation UAVs/AMRs

Trajectory Planning for 6G Next-Generation UAVs/AMRs

machine learning uav

machine learning uav

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

autonomous driving v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

v2x

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

digital twins extended reality

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins uav

digital twins uav

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

autonomous driving machine learning

autonomous driving machine learning

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

machine learning v2x

Tsukada Laboratory

Topics

Access & Contact

Language