Cai Panpan's Home Page

August 2023 IEEE Transactions on Robotics

Closing the Planning-Learning Loop with Application to Autonomous Driving

Real-time planning under uncertainty is critical for robots operating in complex dynamic environments. Consider, for example, an autonomous robot vehicle driving in dense, unregulated urban traffic of cars, motorcycles, buses, etc.. The robot vehicle has to plan in both short and long terms, in order to interact with many traffic participants of uncertain intentions and drive effectively. Planning explicitly over a long time horizon, however, incurs prohibitive computational cost and is impractical under real-time constraints. To achieve real-time performance for large-scale planning, this work introduces a new algorithm Learning from Tree Search for Driving (LeTS-Drive), which integrates planning and learning in a closed loop, and applies it to autonomous driving in crowded urban traffic in simulation. Specifically, LeTS-Drive learns a policy and its value function from data provided by an online planner, which searches a sparsely-sampled belief tree; the online planner in turn uses the learned policy and value functions as heuristics to scale up its run-time performance for real-time robot control. These two steps are repeated to form a closed loop so that the planner and the learner inform each other and improve in synchrony. The algorithm learns on its own in a self-supervised manner, without human effort on explicit data labeling. Experimental results demonstrate that LeTS-Drive outperforms either planning or learning alone, as well as open-loop integration of planning and learning.

PDF

Mohamad H. Danesh*, Panpan Cai* (corresponding author), David Hsu

December 2022 Conference on Robot Learning (CORL) (Best paper finalist)

LEADER: Learning Attention over Driving Behaviors for Planning under Uncertainty

Uncertainty on human behaviors poses a significant challenge to autonomous driving in crowded urban environments. The partially observable Markov decision processes (POMDPs) offer a principled framework for planning under uncertainty, often leveraging Monte Carlo sampling to achieve online performance for complex tasks. However, sampling also raises safety concerns by potentially missing critical events. To address this, we propose a new algorithm, LEarning Attention over Driving bEhavioRs (LEADER), that learns to attend to critical human behaviors during planning. LEADER learns a neural network generator to provide attention over human behaviors in real-time situations. It integrates the attention into a belief-space planner, using importance sampling to bias reasoning towards critical events. To train the algorithm, we let the attention generator and the planner form a min-max game. By solving the min-max game, LEADER learns to perform risk-aware planning without human labeling.

PDF

Yuanfu Luo*, Panpan Cai* (co-first and corresponding author), Yiyuan Lee, David Hsu

February 2022 Robotics and Automation Letter

GAMMA: A General Agent Motion Model for Autonomous Driving

This paper presents GAMMA, a general motion prediction model that enables large-scale real-time simulation and planning for autonomous driving. GAMMA models heterogeneous, interactive traffic agents that operate under diverse road conditions, with various geometric and kinematic constraints. GAMMA treats the prediction task as constrained optimization in traffic agents’ velocity space. The objective is to optimize an agent’s driving performance, while obeying all the constraints resulting from the agent’s kinematics, collision avoidance with other agents, and the environmental context. Further, GAMMA explicitly conditions the prediction on human behavioral states as parameters of the optimization model, in order to account for versatile human behaviors. We evaluated GAMMA on a set of real-world benchmark datasets. The results show that GAMMA achieves high prediction accuracy on both homogeneous and heterogeneous traffic datasets, with sub-millisecond execution time. Further, the computational efficiency and the flexibility of GAMMA enable (i) simulation of mixed urban traffic at many locations worldwide and (ii) planning for autonomous driving in dense traffic with uncertain driver behaviors, both in real-time. The open-source code of GAMMA is available online.

PDF Code Dataset

Yiyuan Lee, Panpan Cai, David Hsu

July 2021 Robotics: Science & Systems

MAGIC: Learning Macro-Actions for Online POMDP Planning using Generator-Critic

When robots operate in the real world, they need to handle uncertainties in sensing, acting, and the environment dynamics. Many tasks also require reasoning about long-term consequences of robot decisions. The partially observable Markov decision process (POMDP) offers a principled approach for planning under uncertainty. However, its computational complexity grows exponentially with the planning horizon. We propose to use temporally-extended macro-actions to cut down the effective planning horizon and thus the exponential factor of the complexity. We propose Macro-Action Generator-Critic (MAGIC), an algorithm that learns a macro-action generator using feedback from a planner, and in turn uses the learned macro-actions to condition long-horizon planning. Importantly, the generator is learned to directly maximize the down-stream planning performance. We evaluate MAGIC on several long-term planning tasks, showing that it significantly outperforms planning using primitive actions and hand-crafted macro-actions in both simulation and on a real robot.

PDF Code

Panpan Cai*, Yiyuan Lee* (co-first author), Yuanfu Luo, David Hsu

September 2020 Presented at ICRA 2020

SUMMIT: A Simulator for Urban Driving in Massive Mixed Traffic

Autonomous driving in an unregulated urban crowd is an outstanding challenge, especially, in the presence of many aggressive, high-speed traffic participants. This paper presents SUMMIT, a high-fidelity simulator that facilitates the development and testing of crowd-driving algorithms. By leveraging the open-source OpenStreetMap map database and a heterogeneous multi-agent motion prediction model developed in our earlier work, SUMMIT simulates dense, unregulated urban traffic for heterogeneous agents at any worldwide loca- tions that OpenStreetMap supports. SUMMIT is built as an extension of CARLA and inherits from it the physical and visual realism for autonomous driving simulation. SUMMIT supports a wide range of applications, including perception, vehicle control and planning, end-to-end learning. We provide a context-aware planner together with benchmark scenarios and show that SUMMIT generates complex, realistic traffic behaviors in challenging crowd-driving settings.

PDF Code

Panpan Cai, Yuanfu Luo, Aseem Saxena, David Hsu, Wee Sun Lee

June 2019 Robotics: Science & Systems

LeTS-Drive: Driving in a Crowd by Learning from Tree Search

Autonomous driving in a crowded environment, e.g., a busy traffic intersection, is an unsolved challenge for robotics. The robot vehicle must contend with a dynamic and partially observable environment, noisy sensors, and many agents. A principled approach is to formalize it as a Partially Observable Markov Decision Process (POMDP) and solve it through online belief-tree search. To handle a large crowd and achieve realtime performance in this very challenging setting, we propose LeTS-Drive, which integrates online POMDP planning and deep learning. It consists of two phases. In the offline phase, we learn a policy and the corresponding value function by imitating the belief tree search. In the online phase, the learned policy and value function guide the belief tree search. LeTS-Drive leverages the robustness of planning and the runtime efficiency of learning to enhance the performance of both. Experimental results in simulation show that LeTS-Drive outperforms either planning or imitation learning alone and develops sophisticated driving skills.

PDF

Panpan Cai, Yuanfu Luo, David Hsu, Wee Sun Lee

June 2018 Robotics: Science & Systems

HyP-DESPOT: A hybrid parallel algorithm for online planning under uncertainty

Planning under uncertainty is critical for robust robot performance in uncertain, dynamic environments, but it incurs high computational cost. State-of-the-art online search algorithms, such as DESPOT, have vastly improved the computational efficiency of planning under uncertainty and made it a valuable tool for robotics in practice. This work takes one step further by leveraging both CPU and GPU parallelization in order to achieve near real-time online planning performance for complex tasks with large state, action, and observation spaces. Specifically, we propose Hybrid Parallel DESPOT (HyPDESPOT), a massively parallel online planning algorithm that integrates CPU and GPU parallelism in a multi-level scheme. It performs parallel DESPOT tree search by simultaneously traversing multiple independent paths using multi-core CPUs and performs parallel Monte-Carlo simulations at the leaf nodes of the search tree using GPUs. Experimental results show that HyPDESPOT speeds up online planning by up to several hundred times, compared with the original DESPOT algorithm, in several challenging robotic tasks in simulation.

PDF Code

Panpan Cai, Indhumathi Chandrasekaran, Jianmin Zheng, Yiyu Cai

January 2017 IEEE Transactions on Industrial Informatics

Automatic path planning for dual-crane lifting in complex environments using a prioritized multiobjective PGA

Cooperative dual-crane lifting is an challenging and critical task in industrial sites. In this paper, we aim to automatically generate optimized dual-crane lifting paths under highly complex constraints, i.e., collision avoidance, coordination between the two cranes, and balance of the lifting target. We propose a mathematical modeling of the cooperative lifting system. Based on the formulation, we devleop a massively parallel solver based on a multi-objective Genetic Algorithm to compute highly-optimized lifting trajectories that satisfy continous collision-avoidance, coordination, and load-balancing constraints in complex industrial envirnoments. Our results show that the planner generate lifting paths that are safe, efficient, and easy for conduction for any complex environments.

PDF

Recent service

New T-RO paper

Best paper finalist at CoRL 2022

Presented at ICRA 2022

WIPL 2021

Presented at RSS 2021

Past lecturing

Panpan Cai

Associate Professor

Shanghai Jiao Tong University (SJTU)

Biography

Interests

Education

Experience

Associate Professor

Shanghai Jiao Tong University

Senior Research fellow

National University of Singapore

Research fellow

National University of Singapore

PhD

Nanyang Technological University

Undergraduate student

Zhejiang University

Selected Publications

Talks

Publications

Contact