From ebacd01df64a4c8a7aa93a0f14af34704a649705 Mon Sep 17 00:00:00 2001 From: Gabriele Date: Tue, 3 Jun 2025 16:24:00 +0200 Subject: [PATCH 1/2] Add published papers --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index 89d65af..2fd1351 100644 --- a/README.md +++ b/README.md @@ -50,6 +50,8 @@ We thank [ngoodger](https://github.com/ngoodger) for implementing the [NLE Langu # Papers using MiniHack +- De Smet et al. [Relational Neurosymbolic Markov Models](https://ojs.aaai.org/index.php/AAAI/article/view/33777) (KU Leuven, AAAI 2025) +- Debot et al. [Neurosymbolic Reinforcement Learning: Playing MiniHack with Probabilistic Logic Shields] (KU Leuven, Demo Tack AAAI 2025) - Paglieri et al. [BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games](https://arxiv.org/abs/2411.13543) (UCL, Oxford, NYU, ICLR 2025) - Gumbsch et al. [Learning Hierarchical World Models with Adaptive Temporal Abstractions from Discrete Latent Dynamics](https://openreview.net/forum?id=TjCDNssXKU) (Uni Tübingen, MPI-IS, UCL, ICLR 2024) - Raparthy et al. [Generalization to New Sequential Decision Making Tasks with In-Context Learning](https://arxiv.org/abs/2312.03801) (Meta AI, UCL, ICML 2024) From baaed1f3943d73b5d3e508f17346f2ea80bf93c6 Mon Sep 17 00:00:00 2001 From: Gabriele Date: Tue, 3 Jun 2025 16:28:02 +0200 Subject: [PATCH 2/2] Add link to paper I forgot --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 2fd1351..a9ac730 100644 --- a/README.md +++ b/README.md @@ -51,7 +51,7 @@ We thank [ngoodger](https://github.com/ngoodger) for implementing the [NLE Langu # Papers using MiniHack - De Smet et al. [Relational Neurosymbolic Markov Models](https://ojs.aaai.org/index.php/AAAI/article/view/33777) (KU Leuven, AAAI 2025) -- Debot et al. [Neurosymbolic Reinforcement Learning: Playing MiniHack with Probabilistic Logic Shields] (KU Leuven, Demo Tack AAAI 2025) +- Debot et al. [Neurosymbolic Reinforcement Learning: Playing MiniHack with Probabilistic Logic Shields](https://ojs.aaai.org/index.php/AAAI/article/view/35349) (KU Leuven, Demo Tack AAAI 2025) - Paglieri et al. [BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games](https://arxiv.org/abs/2411.13543) (UCL, Oxford, NYU, ICLR 2025) - Gumbsch et al. [Learning Hierarchical World Models with Adaptive Temporal Abstractions from Discrete Latent Dynamics](https://openreview.net/forum?id=TjCDNssXKU) (Uni Tübingen, MPI-IS, UCL, ICLR 2024) - Raparthy et al. [Generalization to New Sequential Decision Making Tasks with In-Context Learning](https://arxiv.org/abs/2312.03801) (Meta AI, UCL, ICML 2024)