Trajectory Planning for 6G Next-Generation UAVs/AMRs

2026-05-16

[2024 - 2026]

Unmanned Aerial Vehicles (UAVs) and Autonomous Mobile Robots (AMRs) are foundational platforms for 6G-era applications spanning aerial surveillance, disaster response, warehouse logistics, and precision agriculture. Autonomous trajectory planning under interference, partial observability, energy constraints, and multi-agent coordination remains a central bottleneck.

UAVs (drones) and AMRs (ground robots) are complementary autonomy platforms. UAVs offer access to hard-to-reach areas, reduced risk to human life, and rapid deployment, but face battery, payload, weather, and regulatory limits. AMRs deliver high endurance, payload, and flexibility on the factory floor at the cost of vertical reach, terrain limits, and integration effort. Both will be first-class endpoints in 6G networks.

This research presents an end-to-end planning framework with novel reinforcement-learning (RL) algorithms, together with a CARLA/AirSim simulation pipeline and a small physical testbed. The framework targets closed-loop evaluation, where planner outputs drive the agent and successive errors compound, and is guided by three properties: generality (transfer across sites and tasks), efficiency (sample- and inference-time), and customizability (swappable perception, dynamics, and objectives).

Four algorithms share a multi-agent RL skeleton and specialize it along different axes. UA-MARL (Uncertainty-Aware Multi-Agent RL) aims to increase sample efficiency. ITDQN (Imitation-based Triple Deep Q-Learning) is designed for balancing exploration and exploitation. FM-EAC (Feature Model-based Enhanced Actor-Critic) targets improving training efficiency and generalizability. Finally, EIA-SEC (Elite Imitation Actor-Shared Ensemble Critic) has the goal of improving training efficiency and customizability.

Two simulators support algorithm development: CARLA, an open-source autonomous-driving simulator with a modern rendering pipeline, pre-made urban maps, and simulated camera/LiDAR sensors controlled remotely over TCP — the natural target for AMR-side experiments; and AirSim, an Unreal-Engine-based simulator with platform-independent APIs widely used for UAV deep-learning and RL research. The physical testbed comprises four DJI Tello UAVs, four Raspberry-Pi controllers, and four ground AMRs, with additional cameras, IMUs, and LiDAR planned. The setup is designed to support human-in-the-loop experimentation in which operator interventions feed back into policy updates.

Publication

Zhou, Quanxi, Mao, Wencan, Coddou, Tomás Couso, Tsukada, Manabu, Yunling, Liu, Ji, Yusheng, "Trajectory Planning for UAV-Based Smart Farming Using Imitation-Based Triple Deep Q-Learning ", In: IEEE International Conference on Robotics & Automation (ICRA 2026), Vienna, Austria, 2026.Proceedings Article | Abstract | BibTeX

Zhou, Quanxi, Mao, Wencan, Xiao, Yu, Tsukada, Manabu, Ji, Yusheng, "Deep Reinforcement Learning for Automated Guided Vehicle Trajectory Planning in Industry 4.0", INFOCOM 2026 International Workshop on Fusion of Data, Operation, Information, and Communication Technology for Industry 4.0 and Society 5.0 (DOICT-IndSoc), 2026.Workshop | Abstract | BibTeX

Zhou, Quanxi, Tao, Ye, Su, Qianxiao, Tsukada, Manabu, "A Feature-Aware Elite-Imitation MARL for Multi-UAV Trajectory Optimization in Mountain Terrain Detection", In: Drones, 2025.Journal Article | Links | BibTeX

Zhou, Quanxi, Mao, Wencan, Nakazato, Jin, Ji, Yusheng, Tsukada, Manabu, "Uncertainty-Aware Multi-Agent Reinforcement Learning for Anti-Interference Trajectory Planning of Cellular-Connected UAVs", In: IEEE Transactions on Vehicular Technology, pp. 1 - 17, 2025, ISBN: 0018-9545.Journal Article | Abstract | Links | BibTeX

Liu, Yaxi, Zhou, Quanxi, Mao, Wencan, Li, Xulong, Huangfu, Wei, Tsukada, Manabu, Ji, Yusheng, Long, Keping, "Multi-Modal Trajectory Planning for Emergency-Oriented Air-Ground Collaborative Sensing and Communication", In: IEEE Transactions on Cognitive Communications and Networking, vol. 11, iss. 5, pp. 3094-3111, 2025, ISSN: 2332-7731.Journal Article | Abstract | Links | BibTeX

@article{Liu2025,

title = {Multi-Modal Trajectory Planning for Emergency-Oriented Air-Ground Collaborative Sensing and Communication},

author = {Yaxi Liu and Quanxi Zhou and Wencan Mao and Xulong Li and Wei Huangfu and Manabu Tsukada and Yusheng Ji and Keping Long},

doi = {10.1109/TCCN.2025.3585254},

issn = {2332-7731},

year  = {2025},

date = {2025-07-04},

urldate = {2025-07-04},

journal = {IEEE Transactions on Cognitive Communications and Networking},

volume = {11},

issue = {5},

pages = {3094-3111},

abstract = {To obtain real-time situational awareness of the world, air-ground collaborative sensing and communication provide a promising solution to form a pervasive cognitive communications and networking system. However, existing schemes struggle to cope with emergencies where ground base stations and Internet of Things devices are temporarily out-of-service. Motivated by this, we envision a novel emergency-oriented air-ground collaborative sensing and communication network where multi-modal cognitive entities (i.e., static/dynamic ground/aerial nodes) cooperatively collect data from IoT devices and simultaneously perform sensing functionality. In such a novel network, an optimization for joint trajectory planning and resource allocation is established to minimize both data transmission task delay and sensing task delay under the constraints of boundary, moving distance, accessible region, and energy consumption for network nodes. To tackle the problem, we propose a transfer learning-based deep reinforcement learning (DRL) framework where three advanced DRL algorithms are included. Such a framework can rapidly adapt to potentially updated environments by facilitating knowledge transfer across tasks for emergency rescue activities. The proposed framework outperforms three state-of-the-art baselines. Moreover, the newly introduced auxiliary cognitive entities facilitate the improvement of sensing and communication functionalities, and the proposed transfer learning-based scheme boosts convergence in fast-changing environments.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Zhou, Quanxi, Wang, Yongjing, Shen, Ruiyu, Nakazato, Jin, Tsukada, Manabu, Guan, Zhenyu, "Cellular Connected UAV Anti-Interference Path Planning Based on PDS-DDPG and TOPEM", In: IEEE Journal on Miniaturization for Air and Space Systems, 2024, ISSN: 2576-3164.Journal Article | Abstract | Links | BibTeX

@article{Zhou2024,

title = {Cellular Connected UAV Anti-Interference Path Planning Based on PDS-DDPG and TOPEM},

author = {Quanxi Zhou and Yongjing Wang and Ruiyu Shen and Jin Nakazato and Manabu Tsukada and Zhenyu Guan},

doi = {10.1109/JMASS.2024.3490762},

issn = {2576-3164},

year  = {2024},

date = {2024-11-04},

urldate = {2024-11-04},

journal = {IEEE Journal on Miniaturization for Air and Space Systems},

abstract = {Due to the randomness of channel fading, communication devices, and malicious interference sources, unmanned aerial vehicles (UAVs) face a complex and ever-changing task scenario, which poses significant communication security challenges, such as transmission outages. Fortunately, these communication security challenges can be transformed into path planning problems that minimize the weighted sum of UAV mission time and transmission outage time. In order to design the complex communication environment faced by UAVs in actual scenarios, we propose a system model, including building distribution, communication channel, and antenna design in this paper. Besides, we introduce other UAVs with fixed flight paths and ground interference resources with random locations to ensure mission UAVs have better anti-interference ability. However, it is challenging for classical search algorithms and heuristic algorithms to cope with the complex path problems mentioned above. In this paper, we propose an improved deep deterministic policy gradient (DDPG) algorithm with better performance compared with basic DDPG and DDQN algorithms. Specifically, a post-decision state (PDS) mechanism has been introduced to accelerate the convergence rate and enhance the stability of the training process. In addition, a transmission outage probability experience memory (TOPEM) has been designed to quickly generate wireless communication quality maps and provide temporary experience for the post-decision process, resulting in better training results. Simulation experiments have proven that, compared to basic DDPG, the improved algorithm increases training speed by at least 50%, significantly improves convergence rate, and reduces the episode required for convergence to 20%. It can also help UAVs choose better paths than basic DDPG and DDQN algorithms.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Category:

Project

Tags:

UAV

admin

Related Projects:

Trajectory Planning for 6G Next-Generation UAVs/AMRs

uav

Trajectory Planning for 6G Next-Generation UAVs/AMRs

uav

Trajectory Planning for 6G Next-Generation UAVs/AMRs

uav

Trajectory Planning for 6G Next-Generation UAVs/AMRs

2026-05-16

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

2026-04-06

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

2026-02-27

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

2025-12-26

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

2025-12-26

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

autonomous driving machine learning

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

2025-08-13

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

2025-08-13

Extending Autoware for Cooperative Driving: Integrating Perception, Planning, and Coordination

autonomous driving v2x

Extending Autoware for Cooperative Driving: Integrating Perception, Planning, and Coordination

2025-08-09

Trajectory Planning for 6G Next-Generation UAVs/AMRs

[2024 - 2026]

Publication

admin

Related Projects:

Trajectory Planning for 6G Next-Generation UAVs/AMRs

Trajectory Planning for 6G Next-Generation UAVs/AMRs

uav

uav

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

Smart Pole Interaction Unit (SPIU): Infrastructure-Side Communication for Pedestrian-AV Interaction in Shared Spaces

autonomous driving v2x

autonomous driving v2x

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

Reliable GPS-Free Vehicle Localization Framework for Next-Generation V2X Systems

v2x

v2x

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

Spatial ID as a Scalable Spatial Index for Urban Digital Twins and Spatial Computing

digital twins extended reality

digital twins extended reality

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

4D Path Planning, World Models, Reinforcement Learning, and VLM/VLA Integration for Autonomous Drones

digital twins

digital twins

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

Multi-PrefDrive: Advancing LLM-Based Autonomous Driving via Multi-Preference Learning

autonomous driving machine learning

autonomous driving machine learning

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

Progressive Heterogeneous Collaborative Perception (PHCP): A Communication Technology for Autonomous Vehicles to Adapt and Connect on the Fly

machine learning v2x

machine learning v2x

Extending Autoware for Cooperative Driving: Integrating Perception, Planning, and Coordination

Extending Autoware for Cooperative Driving: Integrating Perception, Planning, and Coordination

autonomous driving v2x

autonomous driving v2x

Tsukada Laboratory

Topics

Access & Contact

Language