Robust and efficient automated machine learning: systems, infrastructure and advances in hyperparameter optimization

to related objects

Abstract: Automated Machine Learning (AutoML) is a new paradigm that democratizes machine learning and that will enable its widespread use. AutoML systems provide hands-free machine learning and automatically search for the best models, tune their hyperparameters, and ensemble them. Hyperparameter optimization (HPO) is a core part of AutoML research and also a research field on its own as models and datasets increase in size and complexity. However, current AutoML systems and HPO methods are resource-hungry and not yet robust enough for all the settings in which we would like to deploy them.

While machine learning automates programming by learning programs from data, AutoML goes one step further and allows us to leave such tasks entirely to the computer. It reduces the burden on the human expert and automates many steps required to build well-performing machine learning models. We are in dire need of such systems because of the increased usage of machine learning and the simultaneous shortage of machine learning practitioners and experts.

In this thesis, we make three contributions to increase the accessibility of machine learning by developing efficient and robust AutoML methods.

First, we address the problem of optimizing hyperparameters efficiently. Concretely, we survey the field of HPO, focusing on so-called multi-fidelity methods that efficiently optimize expensive machine learning algorithms by using cheaper approximations. Then, we develop a new method for transfer HPO, i.e., an HPO method that leverages knowledge gained on previous optimization tasks. It features strong empirical performance, worst-case performance guarantees, and is hyperparameter-free.

Second, we introduce our work on the efficient and robust AutoML system Auto-sklearn and its successor Auto-sklearn 2.0. We start by proposing Auto-sklearn, an extensible AutoML system that constructs linear machine learning pipelines from 15 classifiers, 16 preprocessing methods, and four data cleaning methods from scikit-learn. To improve over previous AutoML systems, we propose meta-learning and ensembling.

We extend on this with Auto-sklearn 2.0, in which we improve the meta-learning component and add new options to make Auto-sklearn suitable for a broader number of use cases. However, this opens up an HPO problem on the AutoML system level, and we introduce a meta-level meta-learning approach that adapts the AutoML system itself to the task at hand. Moreover, we describe how we used Auto-sklearn to win the 1st and 2nd ChaLearn AutoML competition.

Third, we enable AutoML research by contributing to the OpenML platform. Concretely, we create the OpenML-Python API, which gives us access to the OpenML platform with all its datasets. Then, to simplify access to datasets on OpenML and organize the thousands of datasets on OpenML, we develop OpenML benchmarking suites. These are curated datasets to standardize benchmarking practices in machine learning, and we have also used them in our works on HPO and AutoML.

Finally, we discuss how to improve the Auto-sklearn AutoML system further, pose several open questions to the field of AutoML to further increase the automation in machine learning practice and better understand the developed AutoML methods, and give suggestions on how to facilitate better benchmarking in AutoML using the OpenML platform.

In total, we demonstrate how machine learning can be made accessible by robust and efficient Automated Machine Learning and demonstrate substantial performance gains compared to previous AutoML systems

Location: Deutsche Nationalbibliothek Frankfurt am Main

Extent: Online-Ressource

Language: Englisch

Notes: Universität Freiburg, Dissertation, 2022

Keyword: Maschinelles Lernen
Künstliche Intelligenz
Maschinelles Lernen
Überwachtes Lernen
Benchmark
API
Python

Event: Veröffentlichung

(where): Freiburg

(who): Universität

(when): 2022

Creator: Feurer, Matthias

Contributor: Hutter, Frank
Vanschoren, Joaquin
Albert-Ludwigs-Universität Freiburg. Institut für Informatik
Maschinelles Lernen und Natürlichsprachliche Systeme, Professur Frank Hutter
Albert-Ludwigs-Universität Freiburg. Fakultät für Angewandte Wissenschaften

DOI: 10.6094/UNIFR/230516

URN: urn:nbn:de:bsz:25-freidok-2305168

Rights: Open Access; Der Zugriff auf das Objekt ist unbeschränkt möglich.

Last update: 15.08.2025, 7:34 AM CEST

Data provider

This object is provided by:
Deutsche Nationalbibliothek. If you have any questions about the object, please contact the data provider.

Show original at data provider

Associated

Feurer, Matthias
Hutter, Frank
Vanschoren, Joaquin
Albert-Ludwigs-Universität Freiburg. Institut für Informatik
Maschinelles Lernen und Natürlichsprachliche Systeme, Professur Frank Hutter
Albert-Ludwigs-Universität Freiburg. Fakultät für Angewandte Wissenschaften
Universität

Time of origin

2022

Other Objects (12)

Hyperparameter optimization

Streamlining hyperparameter optimization for radiation emulator training with automated Sherpa

ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets

Efficient bayesian hyperparameter optimization

Bayesian Hyperparameter Optimization - Relational and Scalable Surrogate Models for Hyperparameter Optimization Across Problem Instances

TPUXtract: An Exhaustive Hyperparameter Extraction Framework

A hyperparameter study for quantum kernel methods

Be aware of overfitting by hyperparameter optimization!

Fast Bayesian hyperparameter optimization on large datasets

Aufsatzsammlung

Advances in agent based complex automated negotiations

Infrastructure-based localisation of automated coal mining equipment

Using sequential statistical tests for efficient hyperparameter tuning

Hyperparameter optimization

Streamlining hyperparameter optimization for radiation emulator training with automated Sherpa

ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets

Efficient bayesian hyperparameter optimization

Bayesian Hyperparameter Optimization - Relational and Scalable Surrogate Models for Hyperparameter Optimization Across Problem Instances

TPUXtract: An Exhaustive Hyperparameter Extraction Framework

A hyperparameter study for quantum kernel methods

Be aware of overfitting by hyperparameter optimization!

Fast Bayesian hyperparameter optimization on large datasets

Aufsatzsammlung

Advances in agent based complex automated negotiations

Infrastructure-based localisation of automated coal mining equipment

Using sequential statistical tests for efficient hyperparameter tuning

Hyperparameter optimization

Streamlining hyperparameter optimization for radiation emulator training with automated Sherpa

ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets

Efficient bayesian hyperparameter optimization

Bayesian Hyperparameter Optimization - Relational and Scalable Surrogate Models for Hyperparameter Optimization Across Problem Instances

TPUXtract: An Exhaustive Hyperparameter Extraction Framework

A hyperparameter study for quantum kernel methods

Be aware of overfitting by hyperparameter optimization!

Fast Bayesian hyperparameter optimization on large datasets

Aufsatzsammlung

Advances in agent based complex automated negotiations

Infrastructure-based localisation of automated coal mining equipment

Using sequential statistical tests for efficient hyperparameter tuning

Cultural heritage institutions wishing to register will find more information here.

Fields marked * need to be filled in.

Username*

Please enter your username

Email*

Please enter your email address

Please do not fill this field

First name

Last name

Password*

Please enter your password

Confirm password*

Please enter the same password

I have read the terms of use and the privacy policy for the collection of personal data and accept them. *

This field is required.

I would like to subscribe to the newsletter of the Deutsche Digitale Bibliothek. See newsletter subscription info.

Account created

Your "My DDB" account has been successfully created. Before you can log in to your account, you must click the confirmation link in the message we just sent to the email address you provided.

Robust and efficient automated machine learning: systems, infrastructure and advances in hyperparameter optimization

Object Details

Classification and Topics

Contributors, Places and Time

Further information

Data provider

Associated

Time of origin

Other Objects (12)

Hyperparameter optimization

Streamlining hyperparameter optimization for radiation emulator training with automated Sherpa

ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets

Efficient bayesian hyperparameter optimization

Bayesian Hyperparameter Optimization - Relational and Scalable Surrogate Models for Hyperparameter Optimization Across Problem Instances

TPUXtract: An Exhaustive Hyperparameter Extraction Framework

A hyperparameter study for quantum kernel methods

Be aware of overfitting by hyperparameter optimization!

Fast Bayesian hyperparameter optimization on large datasets

Advances in agent based complex automated negotiations

Infrastructure-based localisation of automated coal mining equipment

Using sequential statistical tests for efficient hyperparameter tuning

Hyperparameter optimization

Streamlining hyperparameter optimization for radiation emulator training with automated Sherpa

ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets

Efficient bayesian hyperparameter optimization

Bayesian Hyperparameter Optimization - Relational and Scalable Surrogate Models for Hyperparameter Optimization Across Problem Instances

TPUXtract: An Exhaustive Hyperparameter Extraction Framework

A hyperparameter study for quantum kernel methods

Be aware of overfitting by hyperparameter optimization!

Fast Bayesian hyperparameter optimization on large datasets

Advances in agent based complex automated negotiations

Infrastructure-based localisation of automated coal mining equipment

Using sequential statistical tests for efficient hyperparameter tuning

Hyperparameter optimization

Streamlining hyperparameter optimization for radiation emulator training with automated Sherpa

ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets

Efficient bayesian hyperparameter optimization

Bayesian Hyperparameter Optimization - Relational and Scalable Surrogate Models for Hyperparameter Optimization Across Problem Instances

TPUXtract: An Exhaustive Hyperparameter Extraction Framework

A hyperparameter study for quantum kernel methods

Be aware of overfitting by hyperparameter optimization!

Fast Bayesian hyperparameter optimization on large datasets

Advances in agent based complex automated negotiations

Infrastructure-based localisation of automated coal mining equipment

Using sequential statistical tests for efficient hyperparameter tuning

Related objects

Reset password