Safe Policy Optimization with Local Generalized Linear Function ApproximationsAkifumi WachiYunyue Weiet al.2021NeurIPS 2021
A Text-based Safety Benchmark for Reinforcement Learning ProblemsNgoc Lan HoangNicolas Galichetet al.2022NeurIPS 2022
Neuro-Symbolic Reinforcement Learning with First-Order LogicDaiki KimuraMasaki Onoet al.2021EMNLP 2021
Q-learning with Language Model for Edit-based Unsupervised SummarizationRyosuke KohitaAkifumi Wachiet al.2020EMNLP 2020
Reinforcement Learning with External Knowledge by using Logical Neural NetworksDaiki KimuraSUBHAJIT CHAUDHURYet al.2021IJCAI 2020