Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas

Kuo, Yen-Ling; Katz, Boris; Barbu, Andrei

dc.contributor.author	Kuo, Yen-Ling
dc.contributor.author	Katz, Boris
dc.contributor.author	Barbu, Andrei
dc.date.accessioned	2022-03-24T16:59:34Z
dc.date.available	2022-03-24T16:59:34Z
dc.date.issued	2020-10-25
dc.identifier.uri	https://hdl.handle.net/1721.1/141355
dc.description.abstract	We demonstrate a reinforcement learning agent which uses a compositional recurrent neural network that takes as input an LTL formula and determines satisfying actions. The input LTL formulas have never been seen before, yet the network performs zero-shot generalization to satisfy them. This is a novel form of multi-task learning for RL agents where agents learn from one diverse set of tasks and generalize to a new set of diverse tasks. The formulation of the network enables this capacity to generalize. We demonstrate this ability in two domains. In a symbolic domain, the agent finds a sequence of letters that is accepted. In a Minecraft-like environment, the agent finds a sequence of actions that conform to the formula. While prior work could learn to execute one formula reliably given examples of that formula, we demonstrate how to encode all formulas reliably. This could form the basis of new multi- task agents that discover sub-tasks and execute them without any additional training, as well as the agents which follow more complex linguistic commands. The structures required for this generalization are specific to LTL formulas, which opens up an interesting theoretical question: what structures are required in neural networks for zero-shot generalization to different logics?	en_US
dc.description.sponsorship	This material is based upon work supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF-1231216.	en_US
dc.publisher	Center for Brains, Minds and Machines (CBMM), The Ninth International Conference on Learning Representations (ICLR)	en_US
dc.relation.ispartofseries	CBMM Memo;125
dc.title	Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas	en_US
dc.type	Article	en_US
dc.type	Technical Report	en_US
dc.type	Working Paper	en_US

Files in this item

Name:: CBMM-Memo-125.pdf
Size:: 2.124Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

CBMM Memo Series

Show simple item record