4.5 Article

Pitfalls and Best Practices in Algorithm Configuration

Journal

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH
Volume 64, Issue -, Pages 861-893

Publisher

AI ACCESS FOUNDATION
DOI: 10.1613/jair.1.11420

Keywords

-

Funding

  1. DFG (German Research Foundation) under Emmy Noether grant [HU 1900/2-1]
  2. State Graduate Funding Program of Baden-Wurttemberg

Ask authors/readers for more resources

Good parameter settings are crucial to achieve high performance in many areas of artificial intelligence (AI), such as propositional satisfiability solving, AI planning, scheduling, and machine learning (in particular deep learning). Automated algorithm con figuration methods have recently received much attention in the AI community since they replace tedious, irreproducible and error-prone manual parameter tuning and can lead to new state-of-the-art performance. However, practical applications of algorithm configuration are prone to several (often subtle) pitfalls in the experimental design that can render the procedure ineffective. We identify several common issues and propose best practices for avoiding them. As one possibility for automatically handling as many of these as possible, we also propose a tool called GenericWrapper4AC

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

Winning Solutions and Post-Challenge Analyses of the ChaLearn AutoDL Challenge 2019

Zhengying Liu, Adrien Pavao, Zhen Xu, Sergio Escalera, Fabio Ferreira, Isabelle Guyon, Sirui Hong, Frank Hutter, Rongrong Ji, Julio C. S. Jacques, Ge Li, Marius Lindauer, Zhipeng Luo, Meysam Madadi, Thomas Nierhoff, Kangning Niu, Chunguang Pan, Danny Stoll, Sebastien Treguer, Jin Wang, Peng Wang, Chenglin Wu, Youcheng Xiong, Arber Zela, Yang Zhang

Summary: This paper reports the results and post-challenge analyses of ChaLearn's AutoDL challenge series, where deep learning methods dominated in the setting that pushed for quick results through code submissions on hidden tasks. The study found that popular Neural Architecture Search (NAS) was impractical in this context.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2021)

Article Computer Science, Artificial Intelligence

Auto-Pytorch: Multi-Fidelity MetaLearning for Efficient and Robust AutoDL

Lucas Zimmer, Marius Lindauer, Frank Hutter

Summary: Auto-PyTorch combines neural architecture search and hyperparameter optimization to enable fully automated deep learning, achieving state-of-the-art performance on tabular benchmarks. Additionally, a new benchmark on DNN learning curves called LCBench is introduced, along with extensive ablation studies of Auto-PyTorch on typical AutoML benchmarks.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2021)

Article Computer Science, Artificial Intelligence

reinforcement learning, autonomous agents, neural networks

Jack Parker-Holder, Raghu Rajan, Xingyou Song, Andre Biedenkapp, Yingjie Miao, Theresa Eimer, Baohe Zhang, Vu Nguyen, Roberto Calandra, Aleksandra Faust, Frank Hutter, Marius Lindauer

Summary: The combination of Reinforcement Learning (RL) with deep learning has led to impressive achievements, but the success of RL agents is sensitive to design choices and manual tuning. AutoML has shown promise in automating design choices, and AutoRL is emerging as an important research area. This survey aims to unify the field of AutoRL, provide a common taxonomy, and discuss open problems for future researchers.

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH (2022)

Proceedings Paper Automation & Control Systems

T3VIP: Transformation-based 3D Video Prediction

Iman Nematollahi, Erick Rosete-Beas, Seyed Mahdi B. Azad, Raghu Rajan, Frank Hutter, Wolfram Burgard

Summary: To achieve autonomous skill acquisition, a transformation-based 3D video prediction (T3VIP) approach is proposed, which learns the physical rules governing the 3D world dynamics and is able to predict and reason about future outcomes. The model captures observational cues from image and point cloud domains, and incorporates automatic hyperparameter optimization to leverage the 2D and 3D observational signals. The model produces interpretable 3D models for predicting future depth videos and outperforms 2D baselines in RGB video prediction and visuomotor control.

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) (2022)

Article Computer Science, Artificial Intelligence

Automated Dynamic Algorithm Configuration

Steven Adriaensen, Andre Biedenkapp, Gresa Shala, Noor Awad, Theresa Eimer, Marius Lindauer, Frank Flutter

Summary: The performance of algorithms often relies on their parameter configuration. Automated algorithm configuration methods can alleviate the task of manually tuning parameters, but the learned configuration remains static. A promising approach is to automatically learn dynamic parameter adaptation policies from data. This article provides a comprehensive account of this new field of automated dynamic algorithm configuration, presenting recent advances and laying the foundation for future research.

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH (2022)

Proceedings Paper Computer Science, Cybernetics

Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration

Andre Biedenkapp, Nguyen Dang, Martin S. Krejca, Frank Hutter, Carola Doerr

Summary: The performance of evolutionary algorithms and other randomized search heuristics can be improved by choosing non-static parameters. However, we still lack understanding of the best approaches for dynamic parameter setting. This study extends a benchmark with known control policies and demonstrates its usefulness in analyzing the behavior of a reinforcement learning approach for dynamic algorithm configuration.

PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22) (2022)

Article Automation & Control Systems

SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization

Marius Lindauer, Katharina Eggensperger, Matthias Feurer, Andre Biedenkapp, Difan Deng, Carolin Benjamins, Tim Ruhkopf, Rene Sass, Frank Hutter

Summary: Algorithm parameters, especially hyperparameters, play a crucial role in the performance of machine learning algorithms. SMAC3 offers a versatile Bayesian Optimization framework to help users determine optimal hyperparameter configurations. It can significantly improve performance with just a few evaluations and is suitable for various use cases.

JOURNAL OF MACHINE LEARNING RESEARCH (2022)

Proceedings Paper Computer Science, Artificial Intelligence

TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation

Samuel G. Mueller, Frank Hutter

Summary: Automatic augmentation methods have become crucial for strong model performance in vision tasks. TrivialAugment, a simple baseline without parameters, outperforms previous methods almost for free. Through experiments and ablation studies, the key requirements for TrivialAugment's performance are revealed, providing a simple interface for widespread adoption and proposing best practices for future progress.

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Bayesian Optimization with a Prior for the Optimum

Artur Souza, Luigi Nardi, Leonardo B. Oliveira, Kunle Olukotun, Marius Lindauer, Frank Hutter

Summary: BOPrO is a Bayesian Optimization method that allows users to inject their knowledge about the input space to improve optimization performance. It outperforms state-of-the-art methods in terms of speed and achieves new performance levels in real-world applications.

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III (2021)

Proceedings Paper Computer Science, Artificial Intelligence

TempoRL: Learning When to Act

Andre Biedenkapp, Raghu Rajan, Frank Hutter, Marius Lindauer

Summary: Reinforcement learning is a powerful method of learning behavior through interaction with the environment, but traditional methods may struggle to make decisions proactively. The TempoRL approach introduces skip connections between states and skip-policy learning for repeated actions, showing significant acceleration in learning successful policies compared to vanilla Q-learning.

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning

Baohe Zhang, Raghu Rajan, Luis Pineda, Nathan Lambert, Andre Biedenkapp, Kurtland Chua, Frank Hutter, Roberto Calandra

Summary: Model-based Reinforcement Learning (MBRL) is a promising framework for learning control efficiently. Automatic hyperparameter optimization (HPO) can significantly improve performance, and dynamically tuning multiple MBRL hyperparameters during training further enhances performance. These insights contribute to understanding the effects of various hyperparameters on training stability and rewards.

24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Dynamic Algorithm Configuration: Foundation of a New Meta-Algorithmic Framework

Andre Biedenkapp, H. Furkan Bozkurt, Theresa Eimer, Frank Hutter, Marius Lindauer

ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (2020)

No Data Available