Saved in:
| Main Authors: | Volovikova, Zoya, Gorbov, Gregory, Kuderov, Petr, Panov, Aleksandr I., Skrynnik, Alexey |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.11962 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments
by: Volovikova, Zoya, et al.
Published: (2024)
by: Volovikova, Zoya, et al.
Published: (2024)
Self-Guided Plan Extraction for Instruction-Following Tasks with Goal-Conditional Reinforcement Learning
by: Volovikova, Zoya, et al.
Published: (2026)
by: Volovikova, Zoya, et al.
Published: (2026)
Safe Planning and Policy Optimization via World Model Learning
by: Latyshev, Artem, et al.
Published: (2025)
by: Latyshev, Artem, et al.
Published: (2025)
Learning Successor Features with Distributed Hebbian Temporal Memory
by: Dzhivelikian, Evgenii, et al.
Published: (2023)
by: Dzhivelikian, Evgenii, et al.
Published: (2023)
AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment
by: Ivanova, Anastasiia, et al.
Published: (2025)
by: Ivanova, Anastasiia, et al.
Published: (2025)
CAMAR: Continuous Actions Multi-Agent Routing
by: Pshenitsyn, Artem, et al.
Published: (2025)
by: Pshenitsyn, Artem, et al.
Published: (2025)
Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning
by: Andreychuk, Anton, et al.
Published: (2025)
by: Andreychuk, Anton, et al.
Published: (2025)
Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning
by: Ugadiarov, Leonid, et al.
Published: (2026)
by: Ugadiarov, Leonid, et al.
Published: (2026)
MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale
by: Andreychuk, Anton, et al.
Published: (2024)
by: Andreychuk, Anton, et al.
Published: (2024)
POGEMA: A Benchmark Platform for Cooperative Multi-Agent Pathfinding
by: Skrynnik, Alexey, et al.
Published: (2024)
by: Skrynnik, Alexey, et al.
Published: (2024)
LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation
by: Onishchenko, Anatoly O., et al.
Published: (2025)
by: Onishchenko, Anatoly O., et al.
Published: (2025)
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
by: Cherepanov, Egor, et al.
Published: (2025)
by: Cherepanov, Egor, et al.
Published: (2025)
MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning
by: Nesterova, Maria, et al.
Published: (2026)
by: Nesterova, Maria, et al.
Published: (2026)
Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding
by: Vyaltsev, Valeriy, et al.
Published: (2026)
by: Vyaltsev, Valeriy, et al.
Published: (2026)
VerifyLLM: LLM-Based Pre-Execution Task Plan Verification for Robots
by: Grigorev, Danil S., et al.
Published: (2025)
by: Grigorev, Danil S., et al.
Published: (2025)
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
by: Cherepanov, Egor, et al.
Published: (2024)
by: Cherepanov, Egor, et al.
Published: (2024)
Recurrent Action Transformer with Memory
by: Cherepanov, Egor, et al.
Published: (2023)
by: Cherepanov, Egor, et al.
Published: (2023)
ELMUR: External Layer Memory with Update/Rewrite for Long-Horizon RL Problems
by: Cherepanov, Egor, et al.
Published: (2025)
by: Cherepanov, Egor, et al.
Published: (2025)
Safe Policy Exploration Improvement via Subgoals
by: Angulo, Brian, et al.
Published: (2024)
by: Angulo, Brian, et al.
Published: (2024)
PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts
by: Li, Hengzhi, et al.
Published: (2025)
by: Li, Hengzhi, et al.
Published: (2025)
Symbolic Disentangled Representations for Images
by: Korchemnyi, Alexandr, et al.
Published: (2024)
by: Korchemnyi, Alexandr, et al.
Published: (2024)
Memory Retention Is Not Enough to Master Memory Tasks in Reinforcement Learning
by: Shchendrigin, Oleg, et al.
Published: (2026)
by: Shchendrigin, Oleg, et al.
Published: (2026)
Re:Frame -- Retrieving Experience From Associative Memory
by: Zelezetsky, Daniil, et al.
Published: (2025)
by: Zelezetsky, Daniil, et al.
Published: (2025)
Spatial Traces: Enhancing VLA Models with Spatial-Temporal Understanding
by: Patratskiy, Maxim A., et al.
Published: (2025)
by: Patratskiy, Maxim A., et al.
Published: (2025)
Object-Centric World Models Meet Monte Carlo Tree Search
by: Vakhitov, Rodion, et al.
Published: (2026)
by: Vakhitov, Rodion, et al.
Published: (2026)
Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models
by: Shi, Lucy Xiaoyang, et al.
Published: (2025)
by: Shi, Lucy Xiaoyang, et al.
Published: (2025)
KAGE-Bench: Fast Known-Axis Visual Generalization Evaluation for Reinforcement Learning
by: Cherepanov, Egor, et al.
Published: (2026)
by: Cherepanov, Egor, et al.
Published: (2026)
Object-Centric Learning with Slot Mixture Module
by: Kirilenko, Daniil, et al.
Published: (2023)
by: Kirilenko, Daniil, et al.
Published: (2023)
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
by: Xie, Tianbao, et al.
Published: (2024)
by: Xie, Tianbao, et al.
Published: (2024)
A New Perspective on Transformers in Online Reinforcement Learning for Continuous Control
by: Kachaev, Nikita, et al.
Published: (2025)
by: Kachaev, Nikita, et al.
Published: (2025)
Don't Blind Your VLA: Aligning Visual Representations for OOD Generalization
by: Kachaev, Nikita, et al.
Published: (2025)
by: Kachaev, Nikita, et al.
Published: (2025)
Relational Object-Centric Actor-Critic
by: Ugadiarov, Leonid, et al.
Published: (2023)
by: Ugadiarov, Leonid, et al.
Published: (2023)
MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended Text Evaluation
by: Li, Yu, et al.
Published: (2024)
by: Li, Yu, et al.
Published: (2024)
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
by: Wen, Bosi, et al.
Published: (2024)
by: Wen, Bosi, et al.
Published: (2024)
A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds
by: Cui, Christopher Z., et al.
Published: (2024)
by: Cui, Christopher Z., et al.
Published: (2024)
CoRL-MPPI: Enhancing MPPI With Learnable Behaviours For Efficient And Provably-Safe Multi-Robot Collision Avoidance
by: Dergachev, Stepan, et al.
Published: (2025)
by: Dergachev, Stepan, et al.
Published: (2025)
On Creativity and Open-Endedness
by: Soros, L. B., et al.
Published: (2024)
by: Soros, L. B., et al.
Published: (2024)
StarDojo: Benchmarking Open-Ended Behaviors of Agentic Multimodal LLMs in Production-Living Simulations with Stardew Valley
by: Tan, Weihao, et al.
Published: (2025)
by: Tan, Weihao, et al.
Published: (2025)
Dreaming in Code for Curriculum Learning in Open-Ended Worlds
by: Mitsides, Konstantinos, et al.
Published: (2026)
by: Mitsides, Konstantinos, et al.
Published: (2026)
HELP: Hierarchical Embodied Language Planner for Household Tasks
by: Korchemnyi, Alexandr V., et al.
Published: (2025)
by: Korchemnyi, Alexandr V., et al.
Published: (2025)
Similar Items
-
Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments
by: Volovikova, Zoya, et al.
Published: (2024) -
Self-Guided Plan Extraction for Instruction-Following Tasks with Goal-Conditional Reinforcement Learning
by: Volovikova, Zoya, et al.
Published: (2026) -
Safe Planning and Policy Optimization via World Model Learning
by: Latyshev, Artem, et al.
Published: (2025) -
Learning Successor Features with Distributed Hebbian Temporal Memory
by: Dzhivelikian, Evgenii, et al.
Published: (2023) -
AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment
by: Ivanova, Anastasiia, et al.
Published: (2025)