4.7 Article

Deep Drone Racing: From Simulation to Reality With Domain Randomization

期刊

IEEE TRANSACTIONS ON ROBOTICS
卷 36, 期 1, 页码 1-14

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TRO.2019.2942989

关键词

Drones; Navigation; Trajectory; State estimation; Training; Robot sensing systems; Drone racing; learning agile flight; learning for control

类别

资金

  1. Intel Network on Intelligent Systems
  2. Swiss National Center of Competence Research Robotics, through the Swiss National Science Foundation
  3. Swiss National Science Foundation European Research Council Starting Grant

向作者/读者索取更多资源

Dynamically changing environments, unreliable state estimation, and operation under severe resource constraints are fundamental challenges that limit the deployment of small autonomous drones. We address these challenges in the context of autonomous, vision-based drone racing in dynamic environments. A racing drone must traverse a track with possibly moving gates at high speed. We enable this functionality by combining the performance of a state-of-the-art planning and control system with the perceptual awareness of a convolutional neural network. The resulting modular system is both platform independent and domain independent: it is trained in simulation and deployed on a physical quadrotor without any fine-tuning. The abundance of simulated data, generated via domain randomization, makes our system robust to changes of illumination and gate appearance. To the best of our knowledge, our approach is the first to demonstrate zero-shot sim-to-real transfer on the task of agile drone flight. We extensively test the precision and robustness of our system, both in simulation and on a physical platform, and show significant improvements over the state of the art.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

AlphaPilot: autonomous drone racing

Philipp Foehn, Dario Brescianini, Elia Kaufmann, Titus Cieslewski, Mathias Gehrig, Manasi Muglikar, Davide Scaramuzza

Summary: This paper presents a novel system for autonomous, vision-based drone racing, achieving second place at the 2019 AlphaPilot Challenge by combining learned data abstraction, nonlinear filtering, and time-optimal trajectory planning.

AUTONOMOUS ROBOTS (2022)

Article Robotics

Bridging the Gap Between Events and Frames Through Unsupervised Domain Adaptation

Nico Messikommer, Daniel Gehrig, Mathias Gehrig, Davide Scaramuzza

Summary: Reliable perception during fast motion maneuvers or in high dynamic range environments is crucial for robotic systems. Event cameras, due to their robustness to challenging conditions, have great potential to increase the reliability of robot vision. However, the scarcity of labeled datasets has hindered the development of event-based vision. To address this issue, the proposed task transfer method trains models directly with labeled images and unlabeled event data.

IEEE ROBOTICS AND AUTOMATION LETTERS (2022)

Article Robotics

Performance, Precision, and Payloads: Adaptive Nonlinear MPC for Quadrotors

Drew Hanover, Philipp Foehn, Sihao Sun, Elia Kaufmann, Davide Scaramuzza

Summary: This paper proposes a novel hybrid adaptive nonlinear model predictive control (NMPC) method called L-1-NMPC, which learns and compensates for model uncertainties online to improve performance of quadrotor flight in challenging environments. The method demonstrates high flexibility and robustness, achieving significant tracking error reduction and improved adaptability in the presence of complex aerodynamic effects and unknown disturbances.

IEEE ROBOTICS AND AUTOMATION LETTERS (2022)

Article Robotics

Continuous-Time Vs. Discrete-Time Vision-Based SLAM: A Comparative Study

Giovanni Cioffi, Titus Cieslewski, Davide Scaramuzza

Summary: Robotic practitioners generally approach the vision-based SLAM problem through discrete-time formulations, but these formulations require tailored algorithms and simplifying assumptions in the presence of high-rate and/or asynchronous measurements. On the other hand, continuous-time SLAM allows the fusion of multiple sensor modalities in an intuitive fashion, but may result in worse trajectory estimates.

IEEE ROBOTICS AND AUTOMATION LETTERS (2022)

Article Robotics

Range, Endurance, and Optimal Speed Estimates for Multicopters

Leonard Bauersfeld, Davide Scaramuzza

Summary: This article introduces the importance of multicopters in various application areas, and proposes a method for estimating the range, endurance, and optimal speed of multicopters. The accuracy and feasibility of this method are validated through experiments and flights. The article also provides a pen-and-paper algorithm to assist future researchers in building drones with maximum range and endurance.

IEEE ROBOTICS AND AUTOMATION LETTERS (2022)

Article Robotics

Nonlinear MPC for Quadrotor Fault-Tolerant Control

Fang Nan, Sihao Sun, Philipp Foehn, Davide Scaramuzza

Summary: This study proposes a fault-tolerant controller using nonlinear model predictive control (NMPC) to stabilize and control a quadrotor in the event of complete failure of a single rotor. Unlike existing methods, this approach considers the full nonlinear dynamics of the damaged quadrotor and the thrust constraint of each rotor.

IEEE ROBOTICS AND AUTOMATION LETTERS (2022)

Article Robotics

Policy Search for Model Predictive Control With Application to Agile Drone Flight

Yunlong Song, Davide Scaramuzza

Summary: A novel framework of policy search for model predictive control is proposed in the study, utilizing policy search to automatically select high-level decision variables for MPC. The formulation of a parameterized controller allows optimizing policies in a self-supervised manner.

IEEE TRANSACTIONS ON ROBOTICS (2022)

Article Robotics

Hilti-Oxford Dataset: A Millimeter-Accurate Benchmark for Simultaneous Localization and Mapping

Lintong Zhang, Michael Helmberger, Lanke Frank Tarimo Fu, David Wisth, Marco Camurri, Davide Scaramuzza, Maurice Fallon

Summary: To drive the advancement of SLAM systems, we created the Hilti-Oxford Dataset, which includes various challenges to test the performance of SLAM algorithms in different scenarios. We implemented a novel ground truth collection method to accurately measure pose errors with millimeter accuracy. The dataset attracted a large number of researchers to participate in the Hilti SLAM challenge.

IEEE ROBOTICS AND AUTOMATION LETTERS (2023)

Article Robotics

Real-Time Neural MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms

Tim Salzmann, Elia Kaufmann, Jon Arrizabalaga, Marco Pavone, Davide Scaramuzza, Markus Ryll

Summary: Model Predictive Control (MPC) is a popular framework in embedded control for high-performance autonomous systems. However, the use of accurate dynamics models is crucial for achieving good control performance. This study presents Real-time Neural MPC, a framework that efficiently integrates large neural network architectures as dynamics models within a model-predictive control pipeline. The experiments conducted in simulation and on a quadrotor platform show the capabilities of the system to run learned models with large modeling capacity using gradient-based online optimization MPC, resulting in significant improvements compared to state-of-the-art MPC approaches.

IEEE ROBOTICS AND AUTOMATION LETTERS (2023)

Article Robotics

Learned Inertial Odometry for Autonomous Drone Racing

Giovanni Cioffi, Leonard Bauersfeld, Elia Kaufmann, Davide Scaramuzza

Summary: Inertial odometry is a promising solution for state estimation in agile quadrotor flight. However, relying solely on inertial measurements leads to drift in pose estimates. This study proposes a learning-based odometry algorithm using an IMU as the only sensor for autonomous drone racing. The algorithm combines a model-based filter driven by inertial measurements with a learning-based module that uses thrust measurements. The results show the superiority of this inertial odometry algorithm in pose estimation compared to other visual-inertial odometry methods, indicating its potential in agile quadrotor flight.

IEEE ROBOTICS AND AUTOMATION LETTERS (2023)

Proceedings Paper Automation & Control Systems

COVERED, CollabOratiVE Robot Environment Dataset for 3D Semantic segmentation

Charith Munasinghe, Fatemeh Mohammadi Amin, Davide Scaramuzza, Hans Wernher van de Venn

Summary: Safe human-robot collaboration is crucial in the emerging Industry 5.0 paradigm, where conventional robots are being replaced by more intelligent and flexible collaborative robots. However, the lack of research and dedicated datasets for 3D semantic segmentation of collaborative robot workspaces hinders safe and efficient collaboration. This work addresses this limitation by developing a new dataset named COVERED and benchmarking state-of-the-art algorithm performance for real-time semantic segmentation.

2022 IEEE 27TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA) (2022)

Proceedings Paper Automation & Control Systems

Improving safety in physical human-robot collaboration via deep metric learning

Maryam Rezayati, Grammatiki Zanni, Ying Zaoshi, Davide Scaramuzza, Hans Wernher van de Venn

Summary: Direct physical interaction with robots is important in flexible production scenarios, but poses risks to operators. Simple measures can prevent injuries, but hinder true human-robot cooperation. More sophisticated solutions are needed in human-robot collaboration scenarios.

2022 IEEE 27TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

ESS: Learning Event-Based Semantic Segmentation from Still Images

Zhaoning Sun, Nico Messikommer, Daniel Gehrig, Davide Scaramuzza

Summary: This research introduces ESS (Event-based Semantic Segmentation), a method that transfers the semantic segmentation task from labeled image datasets to unlabeled events through unsupervised domain adaptation (UDA). Compared to existing methods, our approach aligns event embeddings with image embeddings without the need for video data or per-pixel alignment between images and events, and without the need to infer motion from still images. We also introduce DSEC-Semantic, a large-scale event-based dataset with fine-grained labels. Experimental results show that ESS outperforms existing UDA approaches using image labels alone and even surpasses state-of-the-art supervised approaches when combined with event labels in both DDD17 and DSEC-Semantic datasets. ESS is a general-purpose method that opens up new and previously inaccessible research directions for event cameras.

COMPUTER VISION, ECCV 2022, PT XXXIV (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Time Lens plus plus : Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion

Stepan Tulyakov, Alfredo Bochicchio, Daniel Gehrig, Stamatios Georgoulis, Yuanyou Li, Davide Scaramuzza

Summary: This study addresses several issues in video frame interpolation, including unstable fusion process of complementary interpolation results, inefficient motion estimation process, and potential artifacts in low contrast regions. Additionally, a large-scale dataset with challenging scenes is constructed, and the reconstruction quality is improved through multi-scale feature-level fusion and one-shot non-linear inter-frame motion computation.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) (2022)

暂无数据