Saved in:
| Main Authors: | Ferraro, Stefano, Nakano, Akihiro, Suzuki, Masahiro, Matsuo, Yutaka |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.06136 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces
by: Oshima, Yuta, et al.
Published: (2024)
by: Oshima, Yuta, et al.
Published: (2024)
The Embodied World Model Based on LLM with Visual Information and Prediction-Oriented Prompts
by: Haijima, Wakana, et al.
Published: (2024)
by: Haijima, Wakana, et al.
Published: (2024)
Enhancing Unimodal Latent Representations in Multimodal VAEs through Iterative Amortized Inference
by: Oshima, Yuta, et al.
Published: (2024)
by: Oshima, Yuta, et al.
Published: (2024)
Does "Do Differentiable Simulators Give Better Policy Gradients?'' Give Better Policy Gradients?
by: Onoda, Ku, et al.
Published: (2026)
by: Onoda, Ku, et al.
Published: (2026)
Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
by: Nishimori, Soichiro, et al.
Published: (2026)
by: Nishimori, Soichiro, et al.
Published: (2026)
Object-Centric World Models Meet Monte Carlo Tree Search
by: Vakhitov, Rodion, et al.
Published: (2026)
by: Vakhitov, Rodion, et al.
Published: (2026)
Representing Positional Information in Generative World Models for Object Manipulation
by: Ferraro, Stefano, et al.
Published: (2024)
by: Ferraro, Stefano, et al.
Published: (2024)
From Pixels to Policies: Reinforcing Spatial Reasoning in Language Models for Content-Aware Layout Design
by: Li, Sha, et al.
Published: (2026)
by: Li, Sha, et al.
Published: (2026)
GenDOM: Generalizable One-shot Deformable Object Manipulation with Parameter-Aware Policy
by: Kuroki, So, et al.
Published: (2023)
by: Kuroki, So, et al.
Published: (2023)
Object-Centric Temporal Consistency via Conditional Autoregressive Inductive Biases
by: Meo, Cristian, et al.
Published: (2024)
by: Meo, Cristian, et al.
Published: (2024)
Out-of-Distribution Recovery with Object-Centric Keypoint Inverse Policy for Visuomotor Imitation Learning
by: Gao, George Jiayuan, et al.
Published: (2024)
by: Gao, George Jiayuan, et al.
Published: (2024)
Double Horizon Model-Based Policy Optimization
by: Kubo, Akihiro, et al.
Published: (2025)
by: Kubo, Akihiro, et al.
Published: (2025)
Object-Centric Representations Improve Policy Generalization in Robot Manipulation
by: Chapin, Alexandre, et al.
Published: (2025)
by: Chapin, Alexandre, et al.
Published: (2025)
Object-Centric World Models for Causality-Aware Reinforcement Learning
by: Nishimoto, Yosuke, et al.
Published: (2025)
by: Nishimoto, Yosuke, et al.
Published: (2025)
SOLD: Slot Object-Centric Latent Dynamics Models for Relational Manipulation Learning from Pixels
by: Mosbach, Malte, et al.
Published: (2024)
by: Mosbach, Malte, et al.
Published: (2024)
Enhancing Policy Learning with World-Action Model
by: Han, Yuci, et al.
Published: (2026)
by: Han, Yuci, et al.
Published: (2026)
When is Off-Policy Evaluation (Reward Modeling) Useful in Contextual Bandits? A Data-Centric Perspective
by: Sun, Hao, et al.
Published: (2023)
by: Sun, Hao, et al.
Published: (2023)
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
by: GX-Chen, Anthony, et al.
Published: (2024)
by: GX-Chen, Anthony, et al.
Published: (2024)
Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why
by: Armandpour, Mohammadreza, et al.
Published: (2026)
by: Armandpour, Mohammadreza, et al.
Published: (2026)
SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation
by: Kedia, Kushal, et al.
Published: (2026)
by: Kedia, Kushal, et al.
Published: (2026)
AdaWorldPolicy: World-Model-Driven Diffusion Policy with Online Adaptive Learning for Robotic Manipulation
by: Yuan, Ge, et al.
Published: (2026)
by: Yuan, Ge, et al.
Published: (2026)
Zipping the Thought: When and How Compressed Reasoning Data Works in LLM Post-Training
by: Matsutani, Kohsei, et al.
Published: (2026)
by: Matsutani, Kohsei, et al.
Published: (2026)
Composing Pre-Trained Object-Centric Representations for Robotics From "What" and "Where" Foundation Models
by: Shi, Junyao, et al.
Published: (2024)
by: Shi, Junyao, et al.
Published: (2024)
Bridging Reasoning Trajectories in On-Policy Distillation via Near-Future Guidance
by: Jiang, Yuxuan, et al.
Published: (2026)
by: Jiang, Yuxuan, et al.
Published: (2026)
Adaptive Experimental Design for Policy Learning
by: Kato, Masahiro, et al.
Published: (2024)
by: Kato, Masahiro, et al.
Published: (2024)
Safe Planning and Policy Optimization via World Model Learning
by: Latyshev, Artem, et al.
Published: (2025)
by: Latyshev, Artem, et al.
Published: (2025)
Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks
by: Gambardella, Andrew, et al.
Published: (2024)
by: Gambardella, Andrew, et al.
Published: (2024)
GenORM: Generalizable One-shot Rope Manipulation with Parameter-Aware Policy
by: Kuroki, So, et al.
Published: (2023)
by: Kuroki, So, et al.
Published: (2023)
WorldGym: World Model as An Environment for Policy Evaluation
by: Quevedo, Julian, et al.
Published: (2025)
by: Quevedo, Julian, et al.
Published: (2025)
Object-Centric World Model for Language-Guided Manipulation
by: Jeong, Youngjoon, et al.
Published: (2025)
by: Jeong, Youngjoon, et al.
Published: (2025)
PWM: Policy Learning with Multi-Task World Models
by: Georgiev, Ignat, et al.
Published: (2024)
by: Georgiev, Ignat, et al.
Published: (2024)
Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation
by: Nakanishi, Kosuke, et al.
Published: (2025)
by: Nakanishi, Kosuke, et al.
Published: (2025)
Data-Centric AI Governance: Addressing the Limitations of Model-Focused Policies
by: Gupta, Ritwik, et al.
Published: (2024)
by: Gupta, Ritwik, et al.
Published: (2024)
Act2Goal: From World Model To General Goal-conditioned Policy
by: Zhou, Pengfei, et al.
Published: (2025)
by: Zhou, Pengfei, et al.
Published: (2025)
Learning General Policies From Examples
by: Bonet, Blai, et al.
Published: (2025)
by: Bonet, Blai, et al.
Published: (2025)
Learning Efficiency Meets Symmetry Breaking
by: Bai, Yingbin, et al.
Published: (2025)
by: Bai, Yingbin, et al.
Published: (2025)
Social World Model-Augmented Mechanism Design Policy Learning
by: Zhang, Xiaoyuan, et al.
Published: (2025)
by: Zhang, Xiaoyuan, et al.
Published: (2025)
Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection
by: Yang, Bo, et al.
Published: (2025)
by: Yang, Bo, et al.
Published: (2025)
MeetBench-XL: Calibrated Multi-Dimensional Evaluation and Learned Dual-Policy Agents for Real-Time Meetings
by: Hu, Yuelin, et al.
Published: (2026)
by: Hu, Yuelin, et al.
Published: (2026)
Generative Emergent Communication: Large Language Model is a Collective World Model
by: Taniguchi, Tadahiro, et al.
Published: (2024)
by: Taniguchi, Tadahiro, et al.
Published: (2024)
Similar Items
-
SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces
by: Oshima, Yuta, et al.
Published: (2024) -
The Embodied World Model Based on LLM with Visual Information and Prediction-Oriented Prompts
by: Haijima, Wakana, et al.
Published: (2024) -
Enhancing Unimodal Latent Representations in Multimodal VAEs through Iterative Amortized Inference
by: Oshima, Yuta, et al.
Published: (2024) -
Does "Do Differentiable Simulators Give Better Policy Gradients?'' Give Better Policy Gradients?
by: Onoda, Ku, et al.
Published: (2026) -
Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
by: Nishimori, Soichiro, et al.
Published: (2026)