塚田研究室は2019年、東京大学大学院 情報理工学系研究科 創造情報学専攻に設立されました。コンピュータネットワークとサイバーフィジカルシステムを基盤に、協調型自動運転、混合現実、次世代通信、没入型メディアなど幅広い研究に取り組んでいます。
塚田研究室は2019年、東京大学大学院 情報理工学系研究科 創造情報学専攻に設立されました。コンピュータネットワークとサイバーフィジカルシステムを基盤に、協調型自動運転、混合現実、次世代通信、没入型メディアなど幅広い研究に取り組んでいます。
Highlights from CIV Summer School 2025 in Blonay, Switzerland 🇨🇭✨
A week full of inspiring lectures, poster sessions, hackathons, and cultural exchanges — combining academic excellence with great community spirit.
Looking forward to the next edition! 🚀
#CIVSummerSchool #ResearchLife #CooperativeIntelligence #AcademicEvents
9月 5
We present two works at IEEE MetaCom 2025 at Seoul, Republic of Korea.
- Shangkai Zhang, Alex Orsholits, Ehsan Javanmardi, Manabu Tsukada, “AWSIM-VR: A Tightly-Coupled Virtual Reality Extension for Human-in-the-Loop Pedestrian-Autonomous Vehicle Interaction”, In: 3rd Annual IEEE International Conference on Metaverse Computing, Networking, and Applications (IEEE MetaCom 2025), Seoul, Republic of Korea, 2025.
- Naren Bao, Alex Orsholits, Manabu Tsukada, “4D Path Planning via Spatiotemporal Voxels in Urban Airspaces”, In: 3rd Annual IEEE International Conference on Metaverse Computing, Networking, and Applications (IEEE MetaCom 2025), Seoul, Republic of Korea, 2025.
9月 2
Experiments in the Kashiwa campus
8月 5
We have two presentations and two workshop keynote talks in IEEE IV2025 @ Cluj - Napoca, Romania.
- Yun Li, Ehsan Javanmardi, Simon Thompson, Kai Katsumata, Alex Orsholits, Manabu Tsukada, “PrefDrive: Enhancing Autonomous Driving through Preference-Guided Large Language Models”, In: 36th IEEE Intelligent Vehicles Symposium (IV2025), Cluj-Napoca, Romania, 2025.
- Yuze Jiang, Ehsan Javanmardi, Manabu Tsukada, Hiroshi Esaki, “Towards Efficient Roadside LiDAR Deployment: A Fast Surrogate Metric Based on Entropy-Guided Visibility”, In: 36th IEEE Intelligent Vehicles Symposium (IV2025), Cluj-Napoca, Romania, 2025.
- Manabu Tsukada, “V2X Communication Technologies in the Era of End-to-End Autonomous Driving”, Keynote talk at The 2nd Workshop on Secure connected vehicles: Digital Twin, UAVs, and Smart Transportation, at IEEE IV 2025, Cluj-Napoca, Romania, 22.06.2025.
- Ehsan Javanmardi, “From Lab to Road: Advances and Challenges in V2X Cooperative Perception for AVs”, Keynote talk at 14th Workshop and Industry Panel on Cooperative Automated Driving and Future Mobility Systems at IEEE IV2025, Cluj - Napoca, Romania, 22.06.2025.
6月 25
Prof. Tsukada will present “Enabling Cooperative End-to-End Autonomous Driving: Feature Sharing, Adaptation, and Preference Learning” at Workshop on Foundation Models for V2X-Based Cooperative Autonomous Driving In conjunction with CVPR 2025 at Nashville, USA on 12.06.2025. https://drivex-workshop.github.io/
6月 10
Research camp for CREST Internet of Realities Project
6月 4
The visit of WIDE project members #utokyo #東京大学
5月 16
#UTokyo Kashiwa campus autonomous driving experiment 東京大学
4月 18
Farewell party #utokyo #東京大学
4月 2
情報処理学会第87回全国大会 情報通信科学のグランドチャレンジ #UTokyo #東京大学
3月 14
私たちの研究室では、自動運転、混合現実、次世代通信、デジタルツインなど、分野横断的なプロジェクトを推進しています。いずれのテーマも、理論研究から社会実装まで一貫して取り組み、実証実験や国際標準化活動を通じて社会に還元しています。現在は以下のようなプロジェクトに注力しています。
@article{Hu2025c,
title = {DoA Estimation and Kalman Filter based Multi-Antenna System for Vehicle Position in mmWave Network},
author = {Dou Hu and Jin Nakazato and Javanmardi Ehsan and Kazuki Maruta and Rui Dinis and Manabu Tsukada},
doi = {10.1109/OJVT.2025.3608747},
year = {2025},
date = {2025-09-14},
urldate = {2025-09-14},
journal = {IEEE Open Journal of Vehicular Technology},
abstract = {With the advancement of Beyond 5G/6G technologies, accurate positioning and velocity estimation in Internet of Vehicles (IoV) systems has become increasingly critical. Although GPS can provide real-time location information, its performance degrades significantly in environments with heavy obstructions, such as urban areas surrounded by skyscrapers. To address this limitation, this study proposes a positioning framework that relies on channel parameter estimation derived from multi-antenna signal processing. Specifically, we adopt an adaptive low-complexity 2D MUSIC (ALC2D-MUSIC) algorithm to estimate signal directions, and further apply an unscented Kalman filter (UKF) using extracted Direction of Arrival (DoA) and Time of Arrival (ToA) information to estimate vehicle positions and velocities. The proposed system is robust to variations in road geometry, making it suitable for deployment in diverse traffic environments. Simulation results demonstrate that our method achieves high estimation accuracy and outperforms a compressive sensing-based approach across different SNR levels, angular search resolutions, and antenna array sizes. Furthermore, the UKF-based tracking algorithm shows superior performance in curved road scenarios, validating its effectiveness under realistic mobility conditions.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}
@article{Zhou2025b,
title = {A Feature-Aware Elite-Imitation MARL for Multi-UAV Trajectory Optimization in Mountain Terrain Detection},
author = {Quanxi Zhou and Ye Tao and Qianxiao Su and Manabu Tsukada},
year = {2025},
date = {2025-09-13},
journal = {Drones},
keywords = {},
pubstate = {published},
tppubtype = {article}
}
@article{Chauhan2025,
title = {Towards the Future of Pedestrian-AV Interaction: Human Perception vs. LLM Insights on Smart Pole Interaction Unit in Shared Spaces},
author = {Vishal Chauhan and Anubhav Anubhav and Chia-Ming Chang and Xiang Su and Jin Nakazato and Ehsan Javanmardi and Alex Orsholits and Takeo Igarashi and Kantaro Fujiwara and Manabu Tsukada},
isbn = {1071-5819},
year = {2025},
date = {2025-09-13},
journal = {International Journal of Human–Computer Studies (IJHCS)},
keywords = {},
pubstate = {published},
tppubtype = {article}
}
@workshop{Hu2025,
title = {A Low PAPR Layered Multi-User OTFS Modulation},
author = {Dou Hu and Jin Nakazato and Kazuki Maruta and Omid Abbassi Aghda and Rui Dinis and Manabu Tsukada},
year = {2025},
date = {2025-06-17},
urldate = {2025-06-17},
booktitle = {AI-Driven Connectivity for Vehicular and Wireless Networks in VTC2025-Spring},
address = {Oslo, Norway},
abstract = {In modern communication systems, meeting the growing demand for high-capacity transmission requires developing efficient and robust modulation techniques. To address this, we propose a low-PAPR page-style Orthogonal Time Frequency Space (OTFS) modulation framework that enhances communication capacity while maintaining a low peak-to-average power ratio (PAPR). The proposed design introduces a novel pilot signal placement and analysis method, improving channel estimation accuracy and system performance in high-mobility multi-user scenarios. This paper provides an overview of recent advancements in OTFS-based multi-user communication systems, emphasizing their contributions to enhancing spectral efficiency, reliability, and robustness. Through extensive simulations, we demonstrate the effectiveness of the proposed framework in achieving superior BER performance, improved interference mitigation, and robust transmission capabilities compared to traditional methods, validating its suitability for next-generation communication networks.},
howpublished = {Workshop on AI-Driven Connectivity for Vehicular and Wireless Networks in VTC2025-Spring},
note = {IEEE VTS Tokyo/Japan Chapter 2025 Young Researcher's Encouragement Award},
keywords = {},
pubstate = {published},
tppubtype = {workshop}
}
@misc{Orsholits2025b,
title = {Edge Vision AI Co-Processing for Dynamic Context Awareness in Mixed Reality},
author = {Alex Orsholits and Manabu Tsukada},
url = {https://www.youtube.com/watch?v=xxahKZl4K9w
https://ieeevr.org/2025/awards/conference-awards/#poster-honorable},
doi = {10.1109/VRW66409.2025.00293},
year = {2025},
date = {2025-03-08},
urldate = {2025-03-08},
booktitle = {2025 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW)},
address = {Saint-Malo, France},
abstract = {Spatial computing is evolving towards leveraging data streaming for computationally demanding applications, facilitating a shift to lightweight, untethered, and standalone devices. These devices are ideal candidates for co-processing, where real-time scene context understanding and low-latency data streaming are fundamental for general-purpose Mixed Reality (MR) experiences. This poster demonstrates and evaluates a scalable approach to augmented contextual understanding in MR by implementing edge AI co-processing through a Hailo-8 AI accelerator, a low-power ARM-based single board computer (SBC), and the Magic Leap 2 AR headset. The resulting inferences are streamed back to the headset for spatial reprojection into the user’s vision.},
howpublished = {IEEE VR 2025, Poster},
note = {Honorable mention},
keywords = {},
pubstate = {published},
tppubtype = {misc}
}
@conference{山根那夢達2025,
title = {低遅延パケット冗長化手法の移動環境におけるフィールド実験評価},
author = {山根那夢達 and 中里仁 and 伊藤広記 and 三上学 and 土屋貴寛 and 塚田学 and 江崎浩},
year = {2025},
date = {2025-01-30},
urldate = {2025-01-30},
booktitle = {コミュニケーションクオリティ研究会},
address = {博多},
abstract = {本研究では, 筆者らが研究してきたIP層以上のパケットをGENEVEヘッダーでカプセル化し複数経路から送信する手法を使い, 単一の通信事業者上で異なる周波数帯域を持つ複数のモバイル経路を用いて実車両によるフィールド実験を行い, 同手法の評価を行った.},
note = {学生優秀発表賞},
keywords = {},
pubstate = {published},
tppubtype = {conference}
}
@inproceedings{Meng2025b,
title = {Patch Exploration-Based Route Planning for Autonomous Vehicles},
author = {Huan Meng and Jinhui Zhang and Xiaobing Huang and Ehsan Javanmardi and Manabu Tsukada},
year = {2025},
date = {2025-11-18},
urldate = {2025-11-18},
booktitle = {28th IEEE International Conference on Intelligent Transportation Systems (ITSC2025)},
address = {Gold Coast, Australia},
abstract = {Route planning is a crucial component of autonomous
driving, making it essential to develop efficient planning
methods tailored to different scenarios. To address the
limitations of the existing RRT* methods, we propose a
Patch Exploration RRT* (PE-RRT*). First, separating
hyperplanes and half-spaces are utilized to construct patch
for each node in the tree structure, allowing adaptation to
the environmental density. Additionally, during the
sampling process, the patches guide the expansion of the
random tree, enabling rapid searches and efficient use of
sampled points. Moreover, the patches are applied for route
pruning, simplifying the route representation. Simulation
results demonstrate that the proposed method achieves high
search efficiency across various scenarios. },
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
@inproceedings{Si2025,
title = {You Share Beliefs, I Adapt: Progressive Heterogeneous Collaborative Perception},
author = {Hao Si and Ehsan Javanmardi and Manabu Tsukada},
url = {https://sihaoo1.github.io/PHCP_Page/
https://arxiv.org/abs/2509.09310
https://github.com/sihaoo1/PHCP},
year = {2025},
date = {2025-10-19},
urldate = {2025-10-19},
booktitle = {International Conference on Computer Vision (ICCV2025)},
address = {Honolulu, Hawai'i},
abstract = {Collaborative perception enables vehicles to overcome individual perception limitations by sharing information, allowing them to see further and through occlusions. In real-world scenarios, models on different vehicles are often heterogeneous due to manufacturer variations. Existing methods for heterogeneous collaborative perception address this challenge by fine-tuning adapters or the entire network to bridge the domain gap. However, these methods are impractical in real-world applications, as each new collaborator must undergo joint training with the ego vehicle on a dataset before inference, or the ego vehicle stores models for all potential collaborators in advance. Therefore, we pose a new question: Can we tackle this challenge directly during inference, eliminating the need for joint training? To answer this, we introduce Progressive Heterogeneous Collaborative Perception (PHCP), a novel framework that formulates the problem as few-shot unsupervised domain adaptation. Unlike previous work, PHCP dynamically aligns features by self-training an adapter during inference, eliminating the need for labeled data and joint training. Extensive experiments on the OPV2V dataset demonstrate that PHCP achieves strong performance across diverse heterogeneous scenarios. Notably, PHCP achieves performance comparable to SOTA methods trained on the entire dataset while using only a small amount of unlabeled data.},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
@inproceedings{Li2025d,
title = {Multi-PrefDrive: Optimizing Large Language Models for Autonomous Driving Through Multi-Preference Tuning},
author = {Yun Li and Ehsan Javanmardi and Simon Thompson and Kai Katsumata and Alex Orsholits and Manabu Tsukada},
url = {https://liyun0607.github.io/},
year = {2025},
date = {2025-10-19},
booktitle = {2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
address = {Hangzhou, China},
abstract = {This paper introduces Multi-PrefDrive, a framework that significantly enhances LLM-based autonomous driving through multidimensional preference tuning. Aligning LLMs with human driving preferences is crucial yet challenging, as driving scenarios involve complex decisions where multiple incorrect actions can correspond to a single correct choice. Traditional binary preference tuning fails to capture this complexity. Our approach pairs each chosen action with multiple rejected alternatives, better reflecting real-world driving decisions. By implementing the Plackett-Luce preference model, we enable nuanced ranking of actions across the spectrum of possible errors. Experiments in the CARLA simulator demonstrate that our algorithm achieves an 11.0% improvement in overall score and an 83.6% reduction in
infrastructure collisions, while showing perfect compliance with traffic signals in certain environments. Comparative analysis against DPO and its variants reveals that Multi-PrefDrive’s superior discrimination between chosen and rejected actions, which achieving a margin value of 25, and such ability has been directly translates to enhanced driving performance. We implement memory-efficient techniques including LoRA and 4-bit quantization to enable deployment on consumer-grade hardware and will open-source our training code and multi-rejected dataset to advance research in LLM-based autonomous driving systems. Project Page (https://liyun0607.github.io/)},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
@inproceedings{Meng2025,
title = {Robust composite control strategy for constrained continuous-time nonlinear systems},
author = {Huan Meng and Jinhui Zhang and Manabu Tsukada},
year = {2025},
date = {2025-10-05},
booktitle = {IEEE International Conference on Systems, Man, and Cybernetics (SMC2025)},
address = {Vienna, Austria},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
@inproceedings{Li2025b,
title = {State-Guided Spatial Cross-Attention for Enhanced End-to-End Autonomous Driving},
author = {Dongyang Li and Ehsan Javanmardi and Manabu Tsukada},
year = {2025},
date = {2025-09-30},
urldate = {2025-09-30},
booktitle = {IEEE International Automated Vehicle Validation Conference (IAVVC 2025)},
address = {Baden-Baden, Germany},
abstract = {Handling near-accident scenarios is a significant challenge for end-to-end autonomous driving (E2E-AD), as these situations often involve sudden environmental changes, complex interactions with other road users, and high-risk decision-making under uncertainty. Unlike routine driving tasks, near-accident scenarios require rapid and precise responses based on external perception and internal vehicle dynamics. Successfully navigating such situations demands not only a comprehensive understanding of the surrounding environment but also an accurate assessment of the ego vehicle's state, including speed, acceleration, and steering angle, to ensure safe and reliable control. However, conventional E2E-AD models struggle to handle these safety-critical situations effectively. Standard approaches primarily rely on raw sensor inputs to learn driving policies, often overlooking the crucial role of vehicle state information in decision-making. Since many near-accident scenarios involve conditions where the same environmental observation could require vastly different responses depending on the ego vehicle's motion state-such as whether the vehicle is braking, accelerating, or experiencing traction loss-ignoring these internal dynamics can lead to unsafe or suboptimal actions. Furthermore, E2E-AD models typically learn a direct mapping from sensory inputs to control outputs, making it difficult to generalize to highly dynamic and unpredictable interactions, such as emergency evasive maneuvers or sudden braking events. To address these challenges, we propose a state-guided cross-attention mechanism that explicitly models the interaction between the ego vehicle's states and its perception of the environment. By incorporating vehicle state information into the decision-making process, our approach ensures that the model can dynamically adjust its attention to critical sensory inputs based on real-time driving conditions. This allows the autonomous system to make more context-aware decisions, improving its ability to respond effectively to complex and safety-critical scenarios.},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
@inproceedings{Bao2025,
title = {4D Path Planning via Spatiotemporal Voxels in Urban Airspaces},
author = {Naren Bao and Alex Orsholits and Manabu Tsukada},
year = {2025},
date = {2025-08-27},
booktitle = {3rd Annual IEEE International Conference on Metaverse Computing, Networking, and Applications (IEEE MetaCom 2025)},
address = {Seoul, Republic of Korea},
abstract = {This paper presents an approach to four-dimensional (4D) path planning for unmanned aerial vehicles (UAVs) in complex urban environments. We introduce a spatiotemporal voxel-based representation that effectively models both spatial and temporal dimensions of urban airspaces. By integrating the 4D spatio-temporal ID framework with reinforcement learning techniques, our system generates efficient and safe flight paths while considering dynamic obstacles and environmental constraints. The proposed method combines off-line pretraining and online fine-tuning of reinforcement learning models to achieve computational efficiency without compromising path quality. Experiments conducted using PLATEAU datasets in various urban scenarios demonstrate that our approach outperforms traditional path planning algorithms by 24% in safety metrics and 18% in efficiency metrics. Our framework advances the state-of-the-art in urban air mobility by providing a scalable solution for airspace management in increasingly congested urban environments.},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}
Highlights from CIV Summer School 2025 in Blonay, Switzerland 🇨🇭✨
A week full of inspiring lectures, poster sessions, hackathons, and cultural exchanges — combining academic excellence with great community spirit.
Looking forward to the next edition! 🚀
#CIVSummerSchool #ResearchLife #CooperativeIntelligence #AcademicEvents
9月 5
We present two works at IEEE MetaCom 2025 at Seoul, Republic of Korea.
- Shangkai Zhang, Alex Orsholits, Ehsan Javanmardi, Manabu Tsukada, “AWSIM-VR: A Tightly-Coupled Virtual Reality Extension for Human-in-the-Loop Pedestrian-Autonomous Vehicle Interaction”, In: 3rd Annual IEEE International Conference on Metaverse Computing, Networking, and Applications (IEEE MetaCom 2025), Seoul, Republic of Korea, 2025.
- Naren Bao, Alex Orsholits, Manabu Tsukada, “4D Path Planning via Spatiotemporal Voxels in Urban Airspaces”, In: 3rd Annual IEEE International Conference on Metaverse Computing, Networking, and Applications (IEEE MetaCom 2025), Seoul, Republic of Korea, 2025.
9月 2
Experiments in the Kashiwa campus
8月 5
We have two presentations and two workshop keynote talks in IEEE IV2025 @ Cluj - Napoca, Romania.
- Yun Li, Ehsan Javanmardi, Simon Thompson, Kai Katsumata, Alex Orsholits, Manabu Tsukada, “PrefDrive: Enhancing Autonomous Driving through Preference-Guided Large Language Models”, In: 36th IEEE Intelligent Vehicles Symposium (IV2025), Cluj-Napoca, Romania, 2025.
- Yuze Jiang, Ehsan Javanmardi, Manabu Tsukada, Hiroshi Esaki, “Towards Efficient Roadside LiDAR Deployment: A Fast Surrogate Metric Based on Entropy-Guided Visibility”, In: 36th IEEE Intelligent Vehicles Symposium (IV2025), Cluj-Napoca, Romania, 2025.
- Manabu Tsukada, “V2X Communication Technologies in the Era of End-to-End Autonomous Driving”, Keynote talk at The 2nd Workshop on Secure connected vehicles: Digital Twin, UAVs, and Smart Transportation, at IEEE IV 2025, Cluj-Napoca, Romania, 22.06.2025.
- Ehsan Javanmardi, “From Lab to Road: Advances and Challenges in V2X Cooperative Perception for AVs”, Keynote talk at 14th Workshop and Industry Panel on Cooperative Automated Driving and Future Mobility Systems at IEEE IV2025, Cluj - Napoca, Romania, 22.06.2025.
6月 25
Prof. Tsukada will present “Enabling Cooperative End-to-End Autonomous Driving: Feature Sharing, Adaptation, and Preference Learning” at Workshop on Foundation Models for V2X-Based Cooperative Autonomous Driving In conjunction with CVPR 2025 at Nashville, USA on 12.06.2025. https://drivex-workshop.github.io/
6月 10
Research camp for CREST Internet of Realities Project
6月 4
The visit of WIDE project members #utokyo #東京大学
5月 16
#UTokyo Kashiwa campus autonomous driving experiment 東京大学
4月 18
Farewell party #utokyo #東京大学
4月 2
情報処理学会第87回全国大会 情報通信科学のグランドチャレンジ #UTokyo #東京大学
3月 14