Safe and Adaptive Decision-Making for Optimization of Safety-Critical Systems:The ARTEO Algorithm
Published in arXiV.org. Under review for ICML 2023., 2022
We consider the problem of decision-making under uncertainty in an environment with safety constraints. We propose the ARTEO algorithm, where we cast multiarmed bandits as a mathematical programming problem subject to safety constraints and learn the unknown characteristics through exploration while optimizing the targets. We quantify the uncertainty in unknown characteristics by using Gaussian processes and incorporate it into the cost function as a contribution which drives exploration. We adaptively control the size of this contribution in accordance with the requirements of the environment. We guarantee the safety of our algorithm with a high probability through confidence bounds constructed under the regularity assumptions of Gaussian processes.
Recommended citation: B.S. Korkmaz, M. Zagorowska, M. Mercangoz, (2022). "Safe and Adaptive Decision-Making for Optimization of Safety-Critical Systems: The ARTEO Algorithm." https://arxiv.org/abs/2211.05495. https://arxiv.org/abs/2211.05495