Next Previous

6G次世代UAV／AMRのための軌道計画

2026-05-16

[2024 - 2026]

無人航空機（UAV）および自律移動ロボット（AMR）は、空中監視、災害対応、倉庫物流、精密農業など多岐にわたる6G時代のアプリケーションを支える基盤プラットフォームである。しかしながら、干渉、部分観測性、エネルギー制約、マルチエージェント間の協調といった条件下での自律的な軌道計画は、依然として中心的なボトルネックとなっている。

UAV（ドローン）とAMR（地上ロボット）は、互いに補完的な自律プラットフォームである。UAVは到達困難な領域へのアクセス、人命リスクの低減、迅速な展開といった利点を備える一方、バッテリー、ペイロード、気象、規制といった制約に直面する。AMRは工場内において高い稼働持続性、ペイロード、柔軟性を提供するが、垂直方向への到達範囲、地形上の制約、システム統合の手間といった代償を伴う。両者はいずれも6Gネットワークにおける第一級のエンドポイントとなる。

本研究では、新規の強化学習（RL）アルゴリズムを核とするエンドツーエンドの計画フレームワークを提案するとともに、CARLA／AirSimによるシミュレーションパイプラインおよび小規模な物理テストベッドを構築する。本フレームワークは閉ループ評価を対象とし、計画器の出力がエージェントを駆動して逐次的な誤差が累積する状況を扱う。設計は次の三つの性質に基づいて導かれる。すなわち、サイトやタスクを跨いだ転移を可能とする汎化性、学習時および推論時の計算効率、ならびに認識・ダイナミクス・目的関数を差し替え可能とするカスタマイズ性である。

四つのアルゴリズムはマルチエージェントRLという共通の骨格を持ちつつ、それぞれ異なる軸において特化されている。UA-MARL（Uncertainty-Aware Multi-Agent RL）はサンプル効率の向上を目的とする。ITDQN（Imitation-based Triple Deep Q-Learning）は探索と活用のバランスを取ることを意図して設計されている。FM-EAC（Feature Model-based Enhanced Actor-Critic）は学習効率と汎化性の改善を目指す。そして、EIA-SEC（Elite Imitation Actor-Shared Ensemble Critic）は学習効率とカスタマイズ性の向上を目標とする。

アルゴリズム開発は二つのシミュレータによって支えられる。一つはCARLAであり、近代的なレンダリングパイプライン、事前構築された都市マップ、TCP経由で遠隔制御可能なカメラ／LiDARなどのシミュレートセンサを備えたオープンソースの自動運転シミュレータで、AMR側の実験対象として自然な選択肢となる。もう一つはAirSimであり、Unreal Engineを基盤とし、プラットフォーム非依存のAPIを提供することからUAV向けの深層学習およびRL研究で広く用いられている。物理テストベッドは、DJI Tello UAV 4機、Raspberry-Piコントローラ 4台、地上AMR 4台で構成され、今後さらにカメラ、IMU、LiDARの追加が計画されている。このテストベッドは、オペレータの介入が方策更新へとフィードバックされるヒューマン・イン・ザ・ループ実験を支援することを念頭に設計されている。

Publication

Zhou, Quanxi, Mao, Wencan, Coddou, Tomás Couso, Tsukada, Manabu, Yunling, Liu, Ji, Yusheng, "Trajectory Planning for UAV-Based Smart Farming Using Imitation-Based Triple Deep Q-Learning ", In: IEEE International Conference on Robotics & Automation (ICRA 2026), Vienna, Austria, 2026.Proceedings Article | Abstract | BibTeX

Zhou, Quanxi, Mao, Wencan, Xiao, Yu, Tsukada, Manabu, Ji, Yusheng, "Deep Reinforcement Learning for Automated Guided Vehicle Trajectory Planning in Industry 4.0", INFOCOM 2026 International Workshop on Fusion of Data, Operation, Information, and Communication Technology for Industry 4.0 and Society 5.0 (DOICT-IndSoc), 2026, (Best Paper Runner-up Award).Workshop | Abstract | BibTeX

Zhou, Quanxi, Tao, Ye, Su, Qianxiao, Tsukada, Manabu, "A Feature-Aware Elite-Imitation MARL for Multi-UAV Trajectory Optimization in Mountain Terrain Detection", In: Drones, 2025.Journal Article | Links | BibTeX

Zhou, Quanxi, Mao, Wencan, Nakazato, Jin, Ji, Yusheng, Tsukada, Manabu, "Uncertainty-Aware Multi-Agent Reinforcement Learning for Anti-Interference Trajectory Planning of Cellular-Connected UAVs", In: IEEE Transactions on Vehicular Technology, pp. 1 - 17, 2025, ISBN: 0018-9545.Journal Article | Abstract | Links | BibTeX

Liu, Yaxi, Zhou, Quanxi, Mao, Wencan, Li, Xulong, Huangfu, Wei, Tsukada, Manabu, Ji, Yusheng, Long, Keping, "Multi-Modal Trajectory Planning for Emergency-Oriented Air-Ground Collaborative Sensing and Communication", In: IEEE Transactions on Cognitive Communications and Networking, vol. 11, iss. 5, pp. 3094-3111, 2025, ISSN: 2332-7731.Journal Article | Abstract | Links | BibTeX

@article{Liu2025,

title = {Multi-Modal Trajectory Planning for Emergency-Oriented Air-Ground Collaborative Sensing and Communication},

author = {Yaxi Liu and Quanxi Zhou and Wencan Mao and Xulong Li and Wei Huangfu and Manabu Tsukada and Yusheng Ji and Keping Long},

doi = {10.1109/TCCN.2025.3585254},

issn = {2332-7731},

year  = {2025},

date = {2025-07-04},

urldate = {2025-07-04},

journal = {IEEE Transactions on Cognitive Communications and Networking},

volume = {11},

issue = {5},

pages = {3094-3111},

abstract = {To obtain real-time situational awareness of the world, air-ground collaborative sensing and communication provide a promising solution to form a pervasive cognitive communications and networking system. However, existing schemes struggle to cope with emergencies where ground base stations and Internet of Things devices are temporarily out-of-service. Motivated by this, we envision a novel emergency-oriented air-ground collaborative sensing and communication network where multi-modal cognitive entities (i.e., static/dynamic ground/aerial nodes) cooperatively collect data from IoT devices and simultaneously perform sensing functionality. In such a novel network, an optimization for joint trajectory planning and resource allocation is established to minimize both data transmission task delay and sensing task delay under the constraints of boundary, moving distance, accessible region, and energy consumption for network nodes. To tackle the problem, we propose a transfer learning-based deep reinforcement learning (DRL) framework where three advanced DRL algorithms are included. Such a framework can rapidly adapt to potentially updated environments by facilitating knowledge transfer across tasks for emergency rescue activities. The proposed framework outperforms three state-of-the-art baselines. Moreover, the newly introduced auxiliary cognitive entities facilitate the improvement of sensing and communication functionalities, and the proposed transfer learning-based scheme boosts convergence in fast-changing environments.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Zhou, Quanxi, Wang, Yongjing, Shen, Ruiyu, Nakazato, Jin, Tsukada, Manabu, Guan, Zhenyu, "Cellular Connected UAV Anti-Interference Path Planning Based on PDS-DDPG and TOPEM", In: IEEE Journal on Miniaturization for Air and Space Systems, 2024, ISSN: 2576-3164.Journal Article | Abstract | Links | BibTeX

@article{Zhou2024,

title = {Cellular Connected UAV Anti-Interference Path Planning Based on PDS-DDPG and TOPEM},

author = {Quanxi Zhou and Yongjing Wang and Ruiyu Shen and Jin Nakazato and Manabu Tsukada and Zhenyu Guan},

doi = {10.1109/JMASS.2024.3490762},

issn = {2576-3164},

year  = {2024},

date = {2024-11-04},

urldate = {2024-11-04},

journal = {IEEE Journal on Miniaturization for Air and Space Systems},

abstract = {Due to the randomness of channel fading, communication devices, and malicious interference sources, unmanned aerial vehicles (UAVs) face a complex and ever-changing task scenario, which poses significant communication security challenges, such as transmission outages. Fortunately, these communication security challenges can be transformed into path planning problems that minimize the weighted sum of UAV mission time and transmission outage time. In order to design the complex communication environment faced by UAVs in actual scenarios, we propose a system model, including building distribution, communication channel, and antenna design in this paper. Besides, we introduce other UAVs with fixed flight paths and ground interference resources with random locations to ensure mission UAVs have better anti-interference ability. However, it is challenging for classical search algorithms and heuristic algorithms to cope with the complex path problems mentioned above. In this paper, we propose an improved deep deterministic policy gradient (DDPG) algorithm with better performance compared with basic DDPG and DDQN algorithms. Specifically, a post-decision state (PDS) mechanism has been introduced to accelerate the convergence rate and enhance the stability of the training process. In addition, a transmission outage probability experience memory (TOPEM) has been designed to quickly generate wireless communication quality maps and provide temporary experience for the post-decision process, resulting in better training results. Simulation experiments have proven that, compared to basic DDPG, the improved algorithm increases training speed by at least 50%, significantly improves convergence rate, and reduces the episode required for convergence to 20%. It can also help UAVs choose better paths than basic DDPG and DDQN algorithms.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Category:

Project

Tags:

Machine Learning, UAV

admin

Related Projects:

V2X協調自動運転に向けた3Dセマンティック占有予測

autonomous driving machine learning

V2X協調自動運転に向けた3Dセマンティック占有予測

2026-05-16

6G次世代UAV／AMRのための軌道計画

machine learning uav

6G次世代UAV／AMRのための軌道計画

2026-05-16

Smart Pole Interaction Unit（SPIU）：共有空間における歩行者・自動運転車インタラクションを支えるインフラ側コミュニケーション

autonomous driving v2x

Smart Pole Interaction Unit（SPIU）：共有空間における歩行者・自動運転車インタラクションを支えるインフラ側コミュニケーション

2026-04-06

次世代V2Xシステムに向けたGPS非依存型の高信頼車両測位フレームワーク

v2x

次世代V2Xシステムに向けたGPS非依存型の高信頼車両測位フレームワーク

2026-02-27

都市デジタルツインと空間コンピューティングのためのスケーラブルな空間インデックスとしての空間ID

digital twins extended reality

都市デジタルツインと空間コンピューティングのためのスケーラブルな空間インデックスとしての空間ID

2025-12-26

自律無人航空機における4次元経路計画・世界モデル・強化学習およびVLM/VLA統合

digital twins uav

自律無人航空機における4次元経路計画・世界モデル・強化学習およびVLM/VLA統合

2025-12-26

Multi-PrefDrive：マルチ嗜好学習によるLLMベース自動運転の高度化

autonomous driving machine learning

Multi-PrefDrive：マルチ嗜好学習によるLLMベース自動運転の高度化

2025-08-13

適応的協調認識（PHCP）：初めて出会う自動運転車同士がその場で「つながる」技術

machine learning v2x

適応的協調認識（PHCP）：初めて出会う自動運転車同士がその場で「つながる」技術

2025-08-13

6G次世代UAV／AMRのための軌道計画

[2024 - 2026]

Publication

admin

Related Projects:

V2X協調自動運転に向けた3Dセマンティック占有予測

V2X協調自動運転に向けた3Dセマンティック占有予測

autonomous driving machine learning

autonomous driving machine learning

6G次世代UAV／AMRのための軌道計画

6G次世代UAV／AMRのための軌道計画

machine learning uav

machine learning uav

Smart Pole Interaction Unit（SPIU）：共有空間における歩行者・自動運転車インタラクションを支えるインフラ側コミュニケーション

Smart Pole Interaction Unit（SPIU）：共有空間における歩行者・自動運転車インタラクションを支えるインフラ側コミュニケーション

autonomous driving v2x

autonomous driving v2x

次世代V2Xシステムに向けたGPS非依存型の高信頼車両測位フレームワーク

次世代V2Xシステムに向けたGPS非依存型の高信頼車両測位フレームワーク

v2x

v2x

都市デジタルツインと空間コンピューティングのためのスケーラブルな空間インデックスとしての空間ID

都市デジタルツインと空間コンピューティングのためのスケーラブルな空間インデックスとしての空間ID

digital twins extended reality

digital twins extended reality

自律無人航空機における4次元経路計画・世界モデル・強化学習およびVLM/VLA統合

自律無人航空機における4次元経路計画・世界モデル・強化学習およびVLM/VLA統合

digital twins uav

digital twins uav

Multi-PrefDrive：マルチ嗜好学習によるLLMベース自動運転の高度化

Multi-PrefDrive：マルチ嗜好学習によるLLMベース自動運転の高度化

autonomous driving machine learning

autonomous driving machine learning

適応的協調認識（PHCP）：初めて出会う自動運転車同士がその場で「つながる」技術

適応的協調認識（PHCP）：初めて出会う自動運転車同士がその場で「つながる」技術

machine learning v2x

machine learning v2x

塚田研究室

トピック

住所 & 連絡先

言語