Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations

This paper addresses learning safe output feedback control laws from partial observations of expert demonstrations. We assume that a model of the system dynamics and a state estimator are available along with corresponding error bounds, e.g., estimated from data in practice. We first propose robust...

Full description

Saved in:
Bibliographic Details
Main Authors: Lars Lindemann, Alexander Robey, Lejun Jiang, Satyajeet Das, Stephen Tu, Nikolai Matni
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Open Journal of Control Systems
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10491341/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841554025786376192
author Lars Lindemann
Alexander Robey
Lejun Jiang
Satyajeet Das
Stephen Tu
Nikolai Matni
author_facet Lars Lindemann
Alexander Robey
Lejun Jiang
Satyajeet Das
Stephen Tu
Nikolai Matni
author_sort Lars Lindemann
collection DOAJ
description This paper addresses learning safe output feedback control laws from partial observations of expert demonstrations. We assume that a model of the system dynamics and a state estimator are available along with corresponding error bounds, e.g., estimated from data in practice. We first propose robust output control barrier functions (ROCBFs) as a means to guarantee safety, as defined through controlled forward invariance of a safe set. We then formulate an optimization problem to learn ROCBFs from expert demonstrations that exhibit safe system behavior, e.g., data collected from a human operator or an expert controller. When the parametrization of the ROCBF is linear, then we show that, under mild assumptions, the optimization problem is convex. Along with the optimization problem, we provide verifiable conditions in terms of the density of the data, smoothness of the system model and state estimator, and the size of the error bounds that guarantee validity of the obtained ROCBF. Towards obtaining a practical control algorithm, we propose an algorithmic implementation of our theoretical framework that accounts for assumptions made in our framework in practice. We validate our algorithm in the autonomous driving simulator CARLA and demonstrate how to learn safe control laws from simulated RGB camera images.
format Article
id doaj-art-3c2fe38e54d74bdf8cb9d1e737ce7f38
institution Kabale University
issn 2694-085X
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Open Journal of Control Systems
spelling doaj-art-3c2fe38e54d74bdf8cb9d1e737ce7f382025-01-09T00:03:00ZengIEEEIEEE Open Journal of Control Systems2694-085X2024-01-01315817210.1109/OJCSYS.2024.338534810491341Learning Robust Output Control Barrier Functions From Safe Expert DemonstrationsLars Lindemann0https://orcid.org/0000-0003-3430-6625Alexander Robey1https://orcid.org/0009-0003-5693-2819Lejun Jiang2https://orcid.org/0000-0001-9051-2218Satyajeet Das3https://orcid.org/0009-0001-6075-2664Stephen Tu4https://orcid.org/0000-0001-5178-8326Nikolai Matni5https://orcid.org/0000-0003-4936-3921Thomas Lord Department of Computer Science, University of Southern California, Los Angeles, CA, USADepartment of Electrical and Systems Engineering, University of Pennsylvania, Philadelphia, PA, USANuro, Inc., Mountain View, CA, USADepartment of Electrical and Systems Engineering, University of Pennsylvania, Philadelphia, PA, USAMing Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, USADepartment of Electrical and Systems Engineering, University of Pennsylvania, Philadelphia, PA, USAThis paper addresses learning safe output feedback control laws from partial observations of expert demonstrations. We assume that a model of the system dynamics and a state estimator are available along with corresponding error bounds, e.g., estimated from data in practice. We first propose robust output control barrier functions (ROCBFs) as a means to guarantee safety, as defined through controlled forward invariance of a safe set. We then formulate an optimization problem to learn ROCBFs from expert demonstrations that exhibit safe system behavior, e.g., data collected from a human operator or an expert controller. When the parametrization of the ROCBF is linear, then we show that, under mild assumptions, the optimization problem is convex. Along with the optimization problem, we provide verifiable conditions in terms of the density of the data, smoothness of the system model and state estimator, and the size of the error bounds that guarantee validity of the obtained ROCBF. Towards obtaining a practical control algorithm, we propose an algorithmic implementation of our theoretical framework that accounts for assumptions made in our framework in practice. We validate our algorithm in the autonomous driving simulator CARLA and demonstrate how to learn safe control laws from simulated RGB camera images.https://ieeexplore.ieee.org/document/10491341/Control barrier functionsdata-driven robust controloutput feedback control
spellingShingle Lars Lindemann
Alexander Robey
Lejun Jiang
Satyajeet Das
Stephen Tu
Nikolai Matni
Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations
IEEE Open Journal of Control Systems
Control barrier functions
data-driven robust control
output feedback control
title Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations
title_full Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations
title_fullStr Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations
title_full_unstemmed Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations
title_short Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations
title_sort learning robust output control barrier functions from safe expert demonstrations
topic Control barrier functions
data-driven robust control
output feedback control
url https://ieeexplore.ieee.org/document/10491341/
work_keys_str_mv AT larslindemann learningrobustoutputcontrolbarrierfunctionsfromsafeexpertdemonstrations
AT alexanderrobey learningrobustoutputcontrolbarrierfunctionsfromsafeexpertdemonstrations
AT lejunjiang learningrobustoutputcontrolbarrierfunctionsfromsafeexpertdemonstrations
AT satyajeetdas learningrobustoutputcontrolbarrierfunctionsfromsafeexpertdemonstrations
AT stephentu learningrobustoutputcontrolbarrierfunctionsfromsafeexpertdemonstrations
AT nikolaimatni learningrobustoutputcontrolbarrierfunctionsfromsafeexpertdemonstrations