4.6 Article Proceedings Paper

Distributed Wildfire Surveillance with Autonomous Aircraft Using Deep Reinforcement Learning

Journal

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS
Volume 42, Issue 8, Pages 1768-1778

Publisher

AMER INST AERONAUTICS ASTRONAUTICS
DOI: 10.2514/1.G004106

Keywords

-

Funding

  1. National Science Foundation Graduate Research Fellowship [DGE1656518]

Ask authors/readers for more resources

Teams of autonomous unmanned aircraft can be used to monitor wildfires, enabling firefighters to make informed decisions. However, controlling multiple autonomous fixed-wing aircraft to maximize forest fire coverage is a complex problem. The state space is high dimensional, the fire propagates stochastically, the sensor information is imperfect, and the aircraft must coordinate with each other to accomplish their mission. This work presents two deep reinforcement learning approaches for training decentralized controllers that accommodate the high dimensionality and uncertainty inherent in the problem. The first approach controls the aircraft using immediate observations of the individual aircraft. The second approach allows aircraft to collaborate on a map of the wildfire's state and maintain a time history of locations visited, which are used as inputs to the controller. Simulation results show that both approaches allow the aircraft to accurately track wildfire expansions and outperform an online receding-horizon controller. Additional simulations demonstrate that the approach scales with different numbers of aircraft and generalizes to different wildfire shapes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Theory & Methods

Reluplex: a calculus for reasoning about deep neural networks

Guy Katz, Clark Barrett, David L. Dill, Kyle Julian, Mykel J. Kochenderfer

Summary: Deep neural networks are widely used for solving complex real-world problems, but providing formal guarantees for their behavior in safety-critical systems is challenging. Researchers have developed a novel technique based on the simplex method to verify properties of deep neural networks efficiently and at scale.

FORMAL METHODS IN SYSTEM DESIGN (2022)

Article Computer Science, Artificial Intelligence

Dynamic multi-robot task allocation under uncertainty and temporal constraints

Shushman Choudhury, Jayesh K. Gupta, Mykel J. Kochenderfer, Dorsa Sadigh, Jeannette Bohg

Summary: The study focuses on dynamically allocating tasks to multiple agents under time window constraints and task completion uncertainty, presenting a multi-agent allocation algorithm that excels in minimizing the number of unsuccessful tasks. The SCoBA algorithm effectively addresses key computational challenges through a hierarchical approach, demonstrating superior performance in practice and outperforming baseline methods in various metrics.

AUTONOMOUS ROBOTS (2022)

Article Computer Science, Artificial Intelligence

Generating probabilistic safety guarantees for neural network controllers

Sydney M. Katz, Kyle D. Julian, Christopher A. Strong, Mykel J. Kochenderfer

Summary: Neural networks are effective controllers in complex settings, but their difficult-to-verify outputs restrict their use in safety-critical applications. Recent research focuses on using formal methods to verify neural network outputs. This study proposes a method to provide probabilistic safety guarantees for neural network controllers using results from neural network verification tools.

MACHINE LEARNING (2023)

Article Computer Science, Artificial Intelligence

Global optimization of objective functions represented by ReLU networks

Christopher A. Strong, Haoze Wu, Aleksandar Zeljic, Kyle D. Julian, Guy Katz, Clark Barrett, Mykel J. Kochenderfer

Summary: Neural network failures in safety-critical contexts are difficult to guarantee, and verification algorithms can provide formal guarantees but cannot answer quantitative questions. This study proposes strategies to extend existing verifiers for optimization and finding extreme failures and minimum input perturbations. The proposed approach achieves better runtime performance than traditional methods and shows complementary performance with optimization-based verifiers.

MACHINE LEARNING (2023)

Review Ergonomics

A Review of Incident Prediction, Resource Allocation, and Dispatch Models for Emergency Management

Ayan Mukhopadhyay, Geoffrey Pettet, Sayyed Mohsen Vazirizade, Di Lu, Alejandro Jaimes, Said El Said, Hiba Baroud, Yevgeniy Vorobeychik, Mykel Kochenderfer, Abhishek Dubey

Summary: In the past fifty years, researchers have developed various statistical, data-driven, analytical, and algorithmic approaches for designing and improving emergency response management systems. This survey provides a detailed review of these approaches, focusing on the key challenges and issues in incident prediction, incident detection, resource allocation, and computer-aided dispatch. It highlights the strengths and weaknesses of prior work and explores the similarities and differences between different modeling paradigms. The survey concludes by discussing open challenges and opportunities for future research in this complex domain.

ACCIDENT ANALYSIS AND PREVENTION (2022)

Article Neurosciences

Towards assessing subcortical deep brain biomarkers of PTSD with functional near-infrared spectroscopy

Stephanie Balters, Marc R. Schlichting, Lara Foland-Ross, Sabrina Brigadoi, Jonas G. Miller, Mykel J. Kochenderfer, Amy S. Garrett, Allan L. Reiss

Summary: This study validates the feasibility of inferring activity in subcortical regions associated with posttraumatic stress disorder (PTSD) using cortical functional magnetic resonance imaging (fMRI) and simulated functional near-infrared spectroscopy (fNIRS) activity. Linear regression and neural network models provided the best prediction performance.

CEREBRAL CORTEX (2023)

Article Engineering, Multidisciplinary

Portfolio construction as linearly constrained separable optimization

Nicholas Moehle, Jack Gindi, Stephen Boyd, Mykel J. Kochenderfer

Summary: This paper presents a heuristic algorithm based on the ADMM method to solve separable nonconvex terms in mean-variance portfolio optimization problems and empirically demonstrates its effectiveness in tax-aware portfolio construction.

OPTIMIZATION AND ENGINEERING (2023)

Article Engineering, Civil

Modeling Human Driving Behavior Through Generative Adversarial Imitation Learning

Raunak Bhattacharyya, Blake Wulfe, Derek J. Phillips, Alex Kuefler, Jeremy Morton, Ransalu Senanayake, Mykel J. Kochenderfer

Summary: An open problem in autonomous vehicle safety validation is to build reliable models of human driving behavior in simulation. This work presents an approach to learn neural driving policies from real-world driving demonstration data. The approach uses imitation learning and Generative Adversarial Imitation Learning (GAIL) to effectively model human driving behavior.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2023)

Proceedings Paper Automation & Control Systems

How Do We Fail? Stress Testing Perception in Autonomous Vehicles

Harrison Delecki, Masha Itkina, Bernard Lange, Ransalu Senanayake, Mykel J. Kochenderfer

Summary: This paper presents a method for characterizing failures of LiDAR-based perception systems for autonomous vehicles (AVs) in adverse weather conditions. By using reinforcement learning to introduce disturbances and simulate LiDAR point clouds in adverse weather conditions, the likely failures in object tracking and trajectory prediction are identified. Experimental results show that the proposed approach can find high likelihood failures with smaller input disturbances compared to baselines, while remaining computationally tractable. The identified failures can inform the future development of robust perception systems for AVs.

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) (2022)

Proceedings Paper Automation & Control Systems

Dynamics-Aware Spatiotemporal Occupancy Prediction in Urban Environments

Maneekwan Toyungyernsub, Esen Yel, Jiachen Li, Mykel J. Kochenderfer

Summary: In this paper, a framework that integrates the detection and segmentation of moving obstacles with the prediction of future occupancy states of the local environment is proposed using deep neural network architectures. By utilizing occupancy-based environment representations, the problem of integrating static-dynamic object segmentation and environment prediction models directly is addressed. The method is validated on the real-world Waymo Open Dataset and exhibits higher prediction accuracy compared to baseline methods.

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) (2022)

Proceedings Paper Automation & Control Systems

Multi-Objective Policy Gradients with Topological Constraints

Kyle Hollins Wray, Stas Tiomkin, Mykel J. Kochenderfer, Pieter Abbeel

Summary: This paper introduces a multi-objective optimization model that encodes ordered sequential constraints to solve various challenging problems. By extending topological Markov decision processes (TMDPs) to continuous spaces and unknown transition dynamics, the policy gradient theorem for TMDPs is formulated, proven, and implemented, enabling the creation of TMDP learning algorithms that can generalize existing deep reinforcement learning (DRL) approaches.

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) (2022)

Proceedings Paper Automation & Control Systems

FIG-OP: Exploring Large-Scale Unknown Environments on a Fixed Time Budget

Oriana Peltzer, Amanda Bouman, Sung-Kyun Kim, Ransalu Senanayake, Joshua Ott, Harrison Delecki, Mamoru Sobue, Mykel J. Kochenderfer, Mac Schwager, Joel Burdick, Ali-akbar Agha-mohammadi

Summary: We present a method for autonomous exploration of large-scale unknown environments under time constraints. Our approach addresses model uncertainty by frontloading expected information gain, leading to improved coverage efficiency over traditional methods.

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) (2022)

Proceedings Paper Automation & Control Systems

Adaptive Coverage Path Planning for Efficient Exploration of Unknown Environments

Amanda Bouman, Joshua Ott, Sung-Kyun Kim, Kenny Chen, Mykel J. Kochenderfer, Brett Lopez, Ali-akbar Agha-Mohammadi, Joel Burdick

Summary: This method presents a solution to the coverage problem, aiming to autonomously explore an unknown environment under mission time constraints. It formulates the problem as a tree-based sequential decision making process to evaluate the effects of the robot's actions on future coverage states, considering traversability risk and dynamic constraints. An effective approximation to the coverage sensor model is proposed to quickly find near-optimal solutions.

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) (2022)

Article Engineering, Aerospace

Verification of Image-Based Neural Network Controllers Using Generative Models

Sydney M. Katz, Anthony L. Corso, Christopher A. Strong, Mykel J. Kochenderfer

Summary: This research proposes a method to enhance the safety of image-based neural network controllers by combining generative adversarial networks and control networks to transform the complex input space into a low-dimensional space, enabling the use of existing verification tools to provide formal guarantees on their performance.

JOURNAL OF AEROSPACE INFORMATION SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Scalable Online Planning for Multi-Agent MDPs

Shushman Choudhury, Jayesh K. Gupta, Peter Morales, Mykel J. Kochenderfer

Summary: We present a scalable tree search planning algorithm for large multi-agent sequential decision problems that require dynamic collaboration. Our algorithm allows for trading computation for approximation quality and dynamically coordinating actions.

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH (2022)

No Data Available