AutoML | Literature Overview

41.

Sabbioni, Luca; Corda, Francesco; Restelli, Marcello

Stepsize Learning for Policy Gradient Methods in Contextual Markov Decision Processes Unpublished

2023.

Abstract | Links | BibTeX

40.

Chen, Deyao; Buzdalov, Maxim; Doerr, Carola; Dang, Nguyen

Using Automated Algorithm Configuration for Parameter Control Conference

Proceedings of the 17th ACM/SIGEVO Conference on Foundations of Genetic Algorithms., 2023.

Abstract | Links | BibTeX

39.

Adriaensen, Steven; Biedenkapp, André; Shala, Gresa; Awad, Noor; Eimer, Theresa; Lindauer, Marius; Hutter, Frank

Automated Dynamic Algorithm Configuration Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 75, pp. 1633-1699, 2022.

Abstract | Links | BibTeX

38.

Xue, Ke; Xu, Jiacheng; Yuan, Lei; Li, Miqing; Qian, Chao; Zhang, Zongzhang; Yu, Yang

Multi-agent Dynamic Algorithm Configuration Proceedings Article

In: Proceedings of the 36th International Conference on Advances in Neural Information Processing Systems (NeurIPS'22), 2022.

Abstract | Links | BibTeX

37.

Biedenkapp, André

Dynamic Algorithm Configuration by Reinforcement Learning PhD Thesis

2022.

Abstract | Links | BibTeX

@phdthesis{biedenkapp22,

title = {Dynamic Algorithm Configuration by Reinforcement Learning},

author = {André Biedenkapp},

url = {https://freidok.uni-freiburg.de/data/230869, UniFreiburg

https://ml.informatik.uni-freiburg.de/wp-content/uploads/2022/11/2022_Dissertation_Andre_Biedenkapp.pdf, pdf},

year = {2022},

date = {2022-10-14},

abstract = {The performance of algorithms, be it in the domain of machine learning, hard combinatorial problem solving or AI in general depends on their many parameters. Tuning an algorithm manually, however, is error-prone and very time-consuming. Many, if not most, algorithms are iterative in nature. Thus, they traverse a potentially diverse solution space, which might require different parameter settings at different stages to behave optimally. Further, algorithms are often used for solving a diverse set of problem instances, which by themselves might require different parameters. Taking all of this into account is infeasible for a human designer. Automated methods have therefore been proposed to mitigate human errors and minimize manual efforts. While such meta-algorithmic methods have shown large successes, there is still a lot of untapped potentials as prior approaches typically only consider configurations that do not change during an algorithm’s run or do not adapt to the problem instance.

In this dissertation, we present the first framework that is capable of dynamically configuring algorithms, in other words, capable of adapting configurations to the problem instance at hand during an algorithm’s solving process. To this end, we model the dynamic algorithm configuration (DAC) problem as a contextual Markov decision process. This enables us to learn dynamic configuration policies in a data-driven way by means of reinforcement learning.

We empirically demonstrate the effectiveness of our framework on a diverse set of problem settings consisting of artificial benchmarks, evolutionary algorithms, AI planning systems, as well as deep learning. We show that DAC outperforms previous meta-algorithmic approaches. Building on these successes, we formulate the first standardized interface for dynamic configuration and an extensive benchmark to facilitate reproducibility and lower the barrier of entry for new researchers into this novel research field. Lastly, our work on DAC feeds back into the reinforcement learning paradigm. Through the lens of DAC, we identify shortcomings in current state-of-the-art approaches and demonstrate how to solve these. In particular, intending to learn general policies for DAC, our work pushes the boundaries of generalization in reinforcement learning. We demonstrate how to efficiently incorporate domain knowledge when training general agents and propose to move from a reactive way of doing reinforcement learning to a proactive way by learning when to make new decisions.},

howpublished = {https://freidok.uni-freiburg.de/data/230869},

keywords = {},

pubstate = {published},

tppubtype = {phdthesis}

}

Close

The performance of algorithms, be it in the domain of machine learning, hard combinatorial problem solving or AI in general depends on their many parameters. Tuning an algorithm manually, however, is error-prone and very time-consuming. Many, if not most, algorithms are iterative in nature. Thus, they traverse a potentially diverse solution space, which might require different parameter settings at different stages to behave optimally. Further, algorithms are often used for solving a diverse set of problem instances, which by themselves might require different parameters. Taking all of this into account is infeasible for a human designer. Automated methods have therefore been proposed to mitigate human errors and minimize manual efforts. While such meta-algorithmic methods have shown large successes, there is still a lot of untapped potentials as prior approaches typically only consider configurations that do not change during an algorithm’s run or do not adapt to the problem instance.

In this dissertation, we present the first framework that is capable of dynamically configuring algorithms, in other words, capable of adapting configurations to the problem instance at hand during an algorithm’s solving process. To this end, we model the dynamic algorithm configuration (DAC) problem as a contextual Markov decision process. This enables us to learn dynamic configuration policies in a data-driven way by means of reinforcement learning.

We empirically demonstrate the effectiveness of our framework on a diverse set of problem settings consisting of artificial benchmarks, evolutionary algorithms, AI planning systems, as well as deep learning. We show that DAC outperforms previous meta-algorithmic approaches. Building on these successes, we formulate the first standardized interface for dynamic configuration and an extensive benchmark to facilitate reproducibility and lower the barrier of entry for new researchers into this novel research field. Lastly, our work on DAC feeds back into the reinforcement learning paradigm. Through the lens of DAC, we identify shortcomings in current state-of-the-art approaches and demonstrate how to solve these. In particular, intending to learn general policies for DAC, our work pushes the boundaries of generalization in reinforcement learning. We demonstrate how to efficiently incorporate domain knowledge when training general agents and propose to move from a reactive way of doing reinforcement learning to a proactive way by learning when to make new decisions.

Close

36.

Michele Tessari, Giovanni Iacca

Reinforcement learning based adaptive metaheuristics Workshop

Genetic and Evolutionary Computation Conference (GECCO) 2022, Companion Proceedings, 2022.

Abstract | Links | BibTeX

35.

Biedenkapp, André; Dang, Nguyen; Krejca, Martin S.; Hutter, Frank; Doerr, Carola

Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration Proceedings Article

In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO'22), 2022.

Abstract | Links | BibTeX

34.

Biedenkapp, André; Speck, David; Sievers, Silvan; Hutter, Frank; Lindauer, Marius; Seipp, Jendrik

Learning Domain-Independent Policies for Open List Selection Workshop

Workshop on Bridging the Gap Between AI Planning and Reinforcement Learning (PRL @ ICAPS'22), 2022.

Abstract | Links | BibTeX

33.

Bhatia, Abhinav; Svegliato, Justin; Nashed, Samer B.; Zilberstein, Shlomo

Tuning the Hyperparameters of Anytime Planning:A Metareasoning Approach with Deep Reinforcement Learning Proceedings Article

In: Proceedings of the 32nd International Conference on Automated Planning and Scheduling (ICAPS'22), 2022.

Abstract | Links | BibTeX

32.

Mandhane, Amol; Zhernov, Anton; Rauh, Maribeth; Gu, Chenjie; Wang, Miaosen; Xue, Flora; Shang, Wendy; Pang, Derek; Claus, Rene; Chiang, Ching-Han; others,

Muzero with self-competition for rate control in vp9 video compression Unpublished

2022.

Abstract | Links | BibTeX

31.

Getzelman, Grant; Balaprakash, Prasanna

Learning to Switch Optimizers for Quadratic Programming Proceedings Article

In: Balasubramanian, Vineeth N.; Tsang, Ivor (Ed.): Proceedings of The 13th Asian Conference on Machine Learning, pp. 1553–1568, PMLR, 2021.

Abstract | Links | BibTeX

30.

Olegovich Malashin, Roman

Sparsely Ensembled Convolutional Neural Network Classifiers via Reinforcement Learning Proceedings Article

In: 2021 6th International Conference on Machine Learning Technologies, pp. 102–110, 2021, ISBN: 9781450389402.

Abstract | Links | BibTeX

29.

Nguyen, Manh Hung; Grinsztajn, Nathan; Guyon, Isabelle; Sun-Hosoy, Lisheng

MetaREVEAL: RL-based Meta-learning from Learning Curves Proceedings Article

In: Workshop on Interactive Adaptive Learning co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2021), 2021.

Abstract | Links | BibTeX

28.

Ichnowski, Jeffrey; Jain, Paras; Stellato, Bartolomeo; Banjac, Goran; Luo, Michael; Borrelli, Francesco; Gonzalez, Joseph E.; Stoica, Ion; Goldberg, Ken

Accelerating Quadratic Optimization with Reinforcement Learning Unpublished

2021.

Abstract | Links | BibTeX

27.

Speck, D; Biedenkapp, A; Hutter, F; Mattmüller, R; Lindauer, M

Learning Heuristic Selection with Dynamic Algorithm Configuration Proceedings Article

In: Zhuo, H H; Yang, Q; Do, M; Goldman, R; Biundo, S; Katz, M (Ed.): Proceedings of the 31st International Conference on Automated Planning and Scheduling (ICAPS'21), pp. 597–605, AAAI, 2021.

Literature Overview

2023

2022

2021

2020

2019

2017

2016

2014

2012

2010

2008

2002

2001

2000