Next Previous

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

2025-08-13

[2025]

A novel framework enabling collaborative perception among vehicles with different models by adapting on-the-fly during inference.

Collaborative perception allows autonomous vehicles to use Vehicle-to-Everything (V2X) communication to share sensor information, enabling them to overcome individual limitations by seeing further and through occlusions. This capability is expected to dramatically improve the accuracy and safety of environmental perception. However, a major real-world challenge is “heterogeneity,” where vehicles from different manufacturers use varied sensors and perception models. This results in a “domain gap”—differences in the features of shared data—making effective information fusion difficult. Existing solutions have been impractical for real-world applications, as they require vehicles to undergo joint training on a large dataset beforehand, which is not feasible in dynamic traffic environments with new, unknown collaborators.

To address this challenge, our research introduces Progressive Heterogeneous Collaborative Perception (PHCP), a novel framework designed to solve this problem during inference (i.e., during actual driving) without any need for pre-training. PHCP formulates the problem as “few-shot unsupervised domain adaptation,” an approach where an ego vehicle dynamically aligns features by self-training with a small amount of unlabeled data from its collaborator. This allows for flexible and on-the-fly adaptation to any new vehicle it encounters.

The PHCP process consists of two stages. In Stage I, which lasts for the first few frames after a collaborative relationship is established, the agent vehicle transmits both its intermediate features and its own detection results, which serve as “pseudo labels.” The ego vehicle uses these pseudo labels to self-train a lightweight “adapter” that learns to translate the agent’s data into a format it can understand. Once the adapter is fine-tuned, Stage II begins. In this stage, the agent only needs to send its feature data, and the ego vehicle uses the trained adapter to transform and fuse the information, achieving efficient and highly accurate collaborative perception.

We conducted extensive experiments on the OPV2V, an open benchmark dataset for autonomous driving, to validate our framework’s effectiveness. The results demonstrate that PHCP consistently outperforms the direct fusion baseline method by approximately 30% in perception accuracy. Furthermore, our approach achieves performance comparable to state-of-the-art (SOTA) methods that were trained on the entire dataset, despite using only a minimal amount of unlabeled data. This proves that PHCP is a highly practical and effective solution for enabling robust collaborative perception in the diverse and unpredictable traffic scenarios of the real world.

Publication

Si, Hao, Javanmardi, Ehsan, Tsukada, Manabu, "You Share Beliefs, I Adapt: Progressive Heterogeneous Collaborative Perception", In: International Conference on Computer Vision (ICCV2025), Honolulu, Hawai'i, 2025.Proceedings Article | Abstract | Links | BibTeX

Category:

Project

Tags:

Machine Learning, V2X

Related Projects:

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

autonomous driving machine learning

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

autonomous driving machine learning

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

autonomous driving machine learning

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

2026-05-16

Trajectory Planning for 6G Next-Generation UAVs/AMRs

machine learning uav

Trajectory Planning for 6G Next-Generation UAVs/AMRs

machine learning uav

Trajectory Planning for 6G Next-Generation UAVs/AMRs

machine learning uav

Trajectory Planning for 6G Next-Generation UAVs/AMRs

2026-05-16

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

2026-04-06

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

2026-02-27

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

2025-12-26

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins uav

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins uav

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins uav

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

2025-12-26

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

autonomous driving machine learning

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

2025-08-13

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

2025-08-13

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

[2025]

A novel framework enabling collaborative perception among vehicles with different models by adapting on-the-fly during inference.

Publication

Related Projects:

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

Collaborative 3D Semantic Occupancy Prediction for V2X-Enabled Autonomous Driving

autonomous driving machine learning

autonomous driving machine learning

Trajectory Planning for 6G Next-Generation UAVs/AMRs

Trajectory Planning for 6G Next-Generation UAVs/AMRs

machine learning uav

machine learning uav

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

autonomous driving v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

v2x

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

digital twins extended reality

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins uav

digital twins uav

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

autonomous driving machine learning

autonomous driving machine learning

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

machine learning v2x

Tsukada Laboratory

Topics

Access & Contact

Language