Bicocca Open Archive

Control of many real-life systems strongly relies on the knowledge of a domain expert, who usually adopts a safe control policy to deal with uncertainty. The term safe means that the policy is aimed at avoiding system’s disruptions or relevant deviations from the desired behaviour, usually at the cost of sub-optimal performances. This paper proposes a statistically-sound approach which exploits the collected experience to safe-explore new policies by assuming a reasonable risk in terms of safety while improving performances. Gaussian Process regression is the core of the approach, providing a probabilistic approximation of both system’s dynamics and performances, depending on historical data related to the application of the safe policy. Being a probabilistic model, Gaussian Process provides both an estimate of the level of safety and, more important, the associated predictive uncertainty, which is crucial for implementing the safe-exploration of new efficient policies. The approach allows to avoid the typically expensive implementation of a digital twin of the system, required in the case of simulation-optimization approaches, as well as the formulation as a stochastic programming problem. Results on two case studies, inspired by real-life systems, are presented, showing an improvement in terms of performances with respect the initial safe policy, with reasonable safety of the systems.

Candelieri, A., Ponti, A., Archetti, F. (2022). Safe-Exploration of Control Policies from Safe-Experience via Gaussian Processes. In Learning and Intelligent Optimization 16th International Conference, LION 16, Milos Island, Greece, June 5–10, 2022, Revised Selected Papers (pp.232-247). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-24866-5_18].

Safe-Exploration of Control Policies from Safe-Experience via Gaussian Processes

Candelieri, A;Ponti, A;Archetti, F

2022

Abstract

Control of many real-life systems strongly relies on the knowledge of a domain expert, who usually adopts a safe control policy to deal with uncertainty. The term safe means that the policy is aimed at avoiding system’s disruptions or relevant deviations from the desired behaviour, usually at the cost of sub-optimal performances. This paper proposes a statistically-sound approach which exploits the collected experience to safe-explore new policies by assuming a reasonable risk in terms of safety while improving performances. Gaussian Process regression is the core of the approach, providing a probabilistic approximation of both system’s dynamics and performances, depending on historical data related to the application of the safe policy. Being a probabilistic model, Gaussian Process provides both an estimate of the level of safety and, more important, the associated predictive uncertainty, which is crucial for implementing the safe-exploration of new efficient policies. The approach allows to avoid the typically expensive implementation of a digital twin of the system, required in the case of simulation-optimization approaches, as well as the formulation as a stochastic programming problem. Results on two case studies, inspired by real-life systems, are presented, showing an improvement in terms of performances with respect the initial safe policy, with reasonable safety of the systems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Gaussian processes; Optimal control; Safe-exploration; Test;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				16th International Conference on Learning and Intelligent Optimization, LION 16 2022 - 5 June 2022 through 10 June 2022
			
	Anno del convegno
	
				2022
			
	Curatori della monografia
	
				Simos, D; Rasskazova, V; Archetti, F; Kotsireas, I; Pardalos, P
			
	Titolo degli atti
	
				Learning and Intelligent Optimization
16th International Conference, LION 16, Milos Island, Greece, June 5–10, 2022, Revised Selected Papers
			
	ISBN del volume degli atti
	
				978-3-031-24865-8
			
	Collana o serie
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Data ahead of print o Data prima pubblicazione Online
	
				5-feb-2023
			
	Data di pubblicazione
	
				2022
			
	Numero del volume
	
				13621 LNCS
			
	Pagina iniziale
	
				232
			
	Pagina finale
	
				247
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1007/978-3-031-24866-5_18
			
	Fulltext
	
				none
			
	Citazione
	
				Candelieri, A., Ponti, A., Archetti, F. (2022). Safe-Exploration of Control Policies from Safe-Experience via Gaussian Processes. In Learning and Intelligent Optimization
16th International Conference, LION 16, Milos Island, Greece, June 5–10, 2022, Revised Selected Papers (pp.232-247). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-24866-5_18].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/408465

Citazioni

0

ND

Social impact