Pure-Past Action Masking

Varricchione, Giovanni; Alechina, Natasha; Dastani, Mehdi; De Giacomo, Giuseppe; Logan, Brian; Perelli, Giuseppe

doi:https://doi.org/10.1609/aaai.v38i19.30163

Pure-Past Action Masking

Files

30163-Article_Text-34217-1-2-20240324.pdf (294.49 KB)

Publication date

2024-03-24

Authors

Varricchione, Giovanni

Alechina, Natasha

Dastani, Mehdi

De Giacomo, Giuseppe

Logan, Brian

Perelli, Giuseppe

DOI

https://doi.org/10.1609/aaai.v38i19.30163

Document Type

/dk/atira/pure/researchoutput/researchoutputtypes/contributiontojournal/conferencearticle

Metadata

Show full item record

Collections

Utrecht University Repository

License

taverne

Abstract

We present Pure-Past Action Masking (PPAM), a lightweight approach to action masking for safe reinforcement learning. In PPAM, actions are disallowed (“masked”) according to specifications expressed in Pure-Past Linear Temporal Logic (PPLTL). PPAM can enforce non-Markovian constraints, i.e., constraints based on the history of the system, rather than just the current state of the (possibly hidden) MDP. The features used in the safety constraint need not be the same as those used by the learning agent, allowing a clear separation of concerns between the safety constraints and reward specifications of the (learning) agent. We prove formally that an agent trained with PPAM can learn any optimal policy that satisfies the safety constraints, and that they are as expressive as shields, another approach to enforce non-Markovian constraints in RL. Finally, we provide empirical results showing how PPAM can guarantee constraint satisfaction in practice.

Keywords

Taverne, Artificial Intelligence

Citation

Varricchione, G, Alechina, N, Dastani, M, De Giacomo, G, Logan, B & Perelli, G 2024, 'Pure-Past Action Masking', Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 19, pp. 21646-21655. https://doi.org/10.1609/aaai.v38i19.30163

URI

https://dspace.library.uu.nl/handle/1874/452240

Pure-Past Action Masking

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI