Saved in:
| Main Authors: | Park, Jaehyun, Kim, Yunho, Kim, Sejin, Lee, Byung-Jun, Kim, Sundong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.11338 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task
by: Kim, Yunho, et al.
Published: (2024)
by: Kim, Yunho, et al.
Published: (2024)
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement Learning
by: Lee, Hosung, et al.
Published: (2024)
by: Lee, Hosung, et al.
Published: (2024)
Learning-augmented robotic automation for real-world manufacturing
by: Kim, Yunho, et al.
Published: (2026)
by: Kim, Yunho, et al.
Published: (2026)
System 2 Reasoning for Human-AI Alignment: Generality and Adaptivity via ARC-AGI
by: Kim, Sejin, et al.
Published: (2024)
by: Kim, Sejin, et al.
Published: (2024)
TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design
by: Cho, Geonwoo, et al.
Published: (2025)
by: Cho, Geonwoo, et al.
Published: (2025)
Trust Region Q Adjoint Matching
by: Dong, Yonghoon, et al.
Published: (2026)
by: Dong, Yonghoon, et al.
Published: (2026)
Human Implicit Preference-Based Policy Fine-tuning for Multi-Agent Reinforcement Learning in USV Swarm
by: Kim, Hyeonjun, et al.
Published: (2025)
by: Kim, Hyeonjun, et al.
Published: (2025)
Not Only Rewards But Also Constraints: Applications on Legged Robot Locomotion
by: Kim, Yunho, et al.
Published: (2023)
by: Kim, Yunho, et al.
Published: (2023)
Addressing and Visualizing Misalignments in Human Task-Solving Trajectories
by: Kim, Sejin, et al.
Published: (2024)
by: Kim, Sejin, et al.
Published: (2024)
RoVerFly: Robust and Versatile Implicit Hybrid Control of Quadrotor-Payload Systems
by: Kim, Mintae, et al.
Published: (2025)
by: Kim, Mintae, et al.
Published: (2025)
Learning to Transfer Human Hand Skills for Robot Manipulations
by: Park, Sungjae, et al.
Published: (2025)
by: Park, Sungjae, et al.
Published: (2025)
DEAS: DEtached value learning with Action Sequence for Scalable Offline RL
by: Kim, Changyeon, et al.
Published: (2025)
by: Kim, Changyeon, et al.
Published: (2025)
RLDX-1 Technical Report
by: Kim, Dongyoung, et al.
Published: (2026)
by: Kim, Dongyoung, et al.
Published: (2026)
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
by: Cho, Geonwoo, et al.
Published: (2025)
by: Cho, Geonwoo, et al.
Published: (2025)
Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning
by: Canesse, Alexi, et al.
Published: (2024)
by: Canesse, Alexi, et al.
Published: (2024)
Causal-Paced Deep Reinforcement Learning
by: Cho, Geonwoo, et al.
Published: (2025)
by: Cho, Geonwoo, et al.
Published: (2025)
Attention-Based Neural-Augmented Kalman Filter for Legged Robot State Estimation
by: Lee, Seokju, et al.
Published: (2026)
by: Lee, Seokju, et al.
Published: (2026)
Robust Policy Learning via Offline Skill Diffusion
by: Kim, Woo Kyung, et al.
Published: (2024)
by: Kim, Woo Kyung, et al.
Published: (2024)
Enhancing Analogical Reasoning in the Abstraction and Reasoning Corpus via Model-Based RL
by: Lee, Jihwan, et al.
Published: (2024)
by: Lee, Jihwan, et al.
Published: (2024)
In-Context Policy Adaptation via Cross-Domain Skill Diffusion
by: Yoo, Minjong, et al.
Published: (2025)
by: Yoo, Minjong, et al.
Published: (2025)
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
by: Jain, Ayush, et al.
Published: (2024)
by: Jain, Ayush, et al.
Published: (2024)
Q-learning with Adjoint Matching
by: Li, Qiyang, et al.
Published: (2026)
by: Li, Qiyang, et al.
Published: (2026)
Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning
by: Baek, Seungho, et al.
Published: (2025)
by: Baek, Seungho, et al.
Published: (2025)
Verifier-free Test-Time Sampling for Vision Language Action Models
by: Jang, Suhyeok, et al.
Published: (2025)
by: Jang, Suhyeok, et al.
Published: (2025)
Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning
by: Song, Yeda, et al.
Published: (2024)
by: Song, Yeda, et al.
Published: (2024)
Decoupled Q-Chunking
by: Li, Qiyang, et al.
Published: (2025)
by: Li, Qiyang, et al.
Published: (2025)
Let Multimodal Embedders Learn When to Augment Query via Adaptive Query Augmentation
by: Kim, Wongyu, et al.
Published: (2025)
by: Kim, Wongyu, et al.
Published: (2025)
Multidimensional Adaptive Coefficient for Inference Trajectory Optimization in Flow and Diffusion
by: Lee, Dohoon, et al.
Published: (2024)
by: Lee, Dohoon, et al.
Published: (2024)
ARCTraj: A Dataset and Benchmark of Human Reasoning Trajectories for Abstract Problem Solving
by: Kim, Sejin, et al.
Published: (2025)
by: Kim, Sejin, et al.
Published: (2025)
Implicit Contact Diffuser: Sequential Contact Reasoning with Latent Point Cloud Diffusion
by: Huang, Zixuan, et al.
Published: (2024)
by: Huang, Zixuan, et al.
Published: (2024)
ImplicitRDP: An End-to-End Visual-Force Diffusion Policy with Structural Slow-Fast Learning
by: Chen, Wendi, et al.
Published: (2025)
by: Chen, Wendi, et al.
Published: (2025)
Belief Aided Navigation using Bayesian Reinforcement Learning for Avoiding Humans in Blind Spots
by: Kim, Jinyeob, et al.
Published: (2024)
by: Kim, Jinyeob, et al.
Published: (2024)
Causal Disentanglement Learning for Accurate Anomaly Detection in Multivariate Time Series
by: Kim, Wonah, et al.
Published: (2025)
by: Kim, Wonah, et al.
Published: (2025)
Meta-Controller: Few-Shot Imitation of Unseen Embodiments and Tasks in Continuous Control
by: Cho, Seongwoong, et al.
Published: (2024)
by: Cho, Seongwoong, et al.
Published: (2024)
EDMP: Ensemble-of-costs-guided Diffusion for Motion Planning
by: Saha, Kallol, et al.
Published: (2023)
by: Saha, Kallol, et al.
Published: (2023)
Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning
by: Kim, Bosung, et al.
Published: (2025)
by: Kim, Bosung, et al.
Published: (2025)
WOMBET: World Model-based Experience Transfer for Robust and Sample-efficient Reinforcement Learning
by: Kim, Mintae, et al.
Published: (2026)
by: Kim, Mintae, et al.
Published: (2026)
REFINE-DP: Diffusion Policy Fine-tuning for Humanoid Loco-manipulation via Reinforcement Learning
by: Gu, Zhaoyuan, et al.
Published: (2026)
by: Gu, Zhaoyuan, et al.
Published: (2026)
WarmPrior: Straightening Flow-Matching Policies with Temporal Priors
by: Kang, Sinjae, et al.
Published: (2026)
by: Kang, Sinjae, et al.
Published: (2026)
Denoising Heat-inspired Diffusion with Insulators for Collision Free Motion Planning
by: Chang, Junwoo, et al.
Published: (2023)
by: Chang, Junwoo, et al.
Published: (2023)
Similar Items
-
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task
by: Kim, Yunho, et al.
Published: (2024) -
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement Learning
by: Lee, Hosung, et al.
Published: (2024) -
Learning-augmented robotic automation for real-world manufacturing
by: Kim, Yunho, et al.
Published: (2026) -
System 2 Reasoning for Human-AI Alignment: Generality and Adaptivity via ARC-AGI
by: Kim, Sejin, et al.
Published: (2024) -
TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design
by: Cho, Geonwoo, et al.
Published: (2025)