Bayesian Optimization With Formal Safety Guarantees via Online Conformal Prediction

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)
19 Downloads (Pure)

Abstract

Black-box zero-th order optimization is a central primitive for applications in fields as diverse as finance, physics, and engineering. In a common formulation of this problem, a designer sequentially attempts candidate solutions, receiving noisy feedback on the value of each attempt from the system. In this paper, we study scenarios in which feedback is also provided on the <italic>safety</italic> of the attempted solution, and the optimizer is constrained to limit the number of unsafe solutions that are tried throughout the optimization process. Focusing on methods based on Bayesian optimization (BO), prior art has introduced an optimization scheme &#x2013; referred to as <sc>SafeOpt</sc> &#x2013; that is guaranteed not to select <italic>any</italic> unsafe solution with a controllable probability over feedback noise as long as strict assumptions on the safety constraint function are met. In this paper, a novel BO-based approach is introduced that satisfies safety requirements irrespective of properties of the constraint function. This strong theoretical guarantee is obtained at the cost of allowing for an arbitrary, controllable but non-zero, rate of violation of the safety constraint. The proposed method, referred to as <sc>Safe-Bocp</sc>, builds on online conformal prediction (CP) and is specialized to the cases in which feedback on the safety constraint is either noiseless or noisy. Experimental results on synthetic and real- world data validate the advantages and flexibility of the proposed <sc>Safe-Bocp</sc>.

Original languageEnglish
Pages (from-to)1-15
Number of pages15
JournalIeee Journal Of Selected Topics In Signal Processing
Early online date3 Jul 2024
DOIs
Publication statusE-pub ahead of print - 3 Jul 2024

Fingerprint

Dive into the research topics of 'Bayesian Optimization With Formal Safety Guarantees via Online Conformal Prediction'. Together they form a unique fingerprint.

Cite this