Saved in:
| Main Authors: | Jin, Emily, Huang, Zhuoyi, Fränken, Jan-Philipp, Liu, Weiyu, Cha, Hannah, Brockbank, Erik, Wu, Sarah, Zhang, Ruohan, Wu, Jiajun, Gerstenberg, Tobias |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.01926 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
STaR-GATE: Teaching Language Models to Ask Clarifying Questions
by: Andukuri, Chinmaya, et al.
Published: (2024)
by: Andukuri, Chinmaya, et al.
Published: (2024)
Spot The Ball: A Benchmark for Visual Social Inference
by: Balamurugan, Neha, et al.
Published: (2025)
by: Balamurugan, Neha, et al.
Published: (2025)
Understanding Human Limits in Pattern Recognition: A Computational Model of Sequential Reasoning in Rock, Paper, Scissors
by: Cross, Logan, et al.
Published: (2025)
by: Cross, Logan, et al.
Published: (2025)
Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels
by: Fränken, Jan-Philipp, et al.
Published: (2024)
by: Fränken, Jan-Philipp, et al.
Published: (2024)
Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models
by: Fränken, Jan-Philipp, et al.
Published: (2024)
by: Fränken, Jan-Philipp, et al.
Published: (2024)
Why Someone Asked "Why": Foil Inference in Human and LLM Question Interpretation
by: Besch, Britt, et al.
Published: (2026)
by: Besch, Britt, et al.
Published: (2026)
Learning Compositional Behaviors from Demonstration and Language
by: Liu, Weiyu, et al.
Published: (2025)
by: Liu, Weiyu, et al.
Published: (2025)
Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners
by: Feng, Chun, et al.
Published: (2024)
by: Feng, Chun, et al.
Published: (2024)
Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel
by: Parés-Morlans, Carlota, et al.
Published: (2025)
by: Parés-Morlans, Carlota, et al.
Published: (2025)
CRAFT: Designing Creative and Functional 3D Objects
by: Guo, Michelle, et al.
Published: (2024)
by: Guo, Michelle, et al.
Published: (2024)
Predicate Hierarchies Improve Few-Shot State Classification
by: Jin, Emily, et al.
Published: (2025)
by: Jin, Emily, et al.
Published: (2025)
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
by: Li, Manling, et al.
Published: (2024)
by: Li, Manling, et al.
Published: (2024)
Post Reinforcement Learning Inference
by: Syrgkanis, Vasilis, et al.
Published: (2023)
by: Syrgkanis, Vasilis, et al.
Published: (2023)
Human-like Affective Cognition in Foundation Models
by: Gandhi, Kanishk, et al.
Published: (2024)
by: Gandhi, Kanishk, et al.
Published: (2024)
TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction
by: Jiang, Yunfan, et al.
Published: (2024)
by: Jiang, Yunfan, et al.
Published: (2024)
Composable Part-Based Manipulation
by: Liu, Weiyu, et al.
Published: (2024)
by: Liu, Weiyu, et al.
Published: (2024)
LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning
by: Motwani, Sumeet Ramesh, et al.
Published: (2026)
by: Motwani, Sumeet Ramesh, et al.
Published: (2026)
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL
by: Wu, Ian, et al.
Published: (2026)
by: Wu, Ian, et al.
Published: (2026)
Multi-Omics Analysis for Cancer Subtype Inference via Unrolling Graph Smoothness Priors
by: Lu, Jielong, et al.
Published: (2025)
by: Lu, Jielong, et al.
Published: (2025)
MoMaGen: Generating Demonstrations under Soft and Hard Constraints for Multi-Step Bimanual Mobile Manipulation
by: Li, Chengshu, et al.
Published: (2025)
by: Li, Chengshu, et al.
Published: (2025)
Hearing Anything Anywhere
by: Wang, Mason, et al.
Published: (2024)
by: Wang, Mason, et al.
Published: (2024)
Deep Active Inference Agents for Delayed and Long-Horizon Environments
by: Yeganeh, Yavar Taheri, et al.
Published: (2025)
by: Yeganeh, Yavar Taheri, et al.
Published: (2025)
Learning-Guided Rolling Horizon Optimization for Long-Horizon Flexible Job-Shop Scheduling
by: Li, Sirui, et al.
Published: (2025)
by: Li, Sirui, et al.
Published: (2025)
TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios
by: Shen, Yuanzhe, et al.
Published: (2026)
by: Shen, Yuanzhe, et al.
Published: (2026)
DiffSound: Differentiable Modal Sound Rendering and Inverse Rendering for Diverse Inference Tasks
by: Jin, Xutong, et al.
Published: (2024)
by: Jin, Xutong, et al.
Published: (2024)
Odysseys: Benchmarking Web Agents on Realistic Long Horizon Tasks
by: Jang, Lawrence Keunho, et al.
Published: (2026)
by: Jang, Lawrence Keunho, et al.
Published: (2026)
On Policy Evaluation Algorithms in Distributional Reinforcement Learning
by: Gerstenberg, Julian, et al.
Published: (2024)
by: Gerstenberg, Julian, et al.
Published: (2024)
HoTPP Benchmark: Are We Good at the Long Horizon Events Forecasting?
by: Karpukhin, Ivan, et al.
Published: (2024)
by: Karpukhin, Ivan, et al.
Published: (2024)
Effective Explanations Support Planning Under Uncertainty
by: Zhou, Hanqi, et al.
Published: (2026)
by: Zhou, Hanqi, et al.
Published: (2026)
Predicting Outcomes in Video Games with Long Short Term Memory Networks
by: Chulajata, Kittimate, et al.
Published: (2024)
by: Chulajata, Kittimate, et al.
Published: (2024)
A Communication-First Account of Explanation
by: Harding, Jacqueline, et al.
Published: (2025)
by: Harding, Jacqueline, et al.
Published: (2025)
Reinforcement Learning for Long-Horizon Interactive LLM Agents
by: Chen, Kevin, et al.
Published: (2025)
by: Chen, Kevin, et al.
Published: (2025)
Learning to Ball: Composing Policies for Long-Horizon Basketball Moves
by: Xu, Pei, et al.
Published: (2025)
by: Xu, Pei, et al.
Published: (2025)
Horizon causality from holographic scattering in asymptotically dS$_3$
by: Franken, Victor, et al.
Published: (2024)
by: Franken, Victor, et al.
Published: (2024)
Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe
by: Wu, Xixi, et al.
Published: (2026)
by: Wu, Xixi, et al.
Published: (2026)
λ: A Benchmark for Data-Efficiency in Long-Horizon Indoor Mobile Manipulation Robotics
by: Jaafar, Ahmed, et al.
Published: (2024)
by: Jaafar, Ahmed, et al.
Published: (2024)
$\boldsymbol{f}$-OPD: Stabilizing Long-Horizon On-Policy Distillation with Freshness-Aware Control
by: Chen, Xianwei, et al.
Published: (2026)
by: Chen, Xianwei, et al.
Published: (2026)
Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference
by: Wu, Zimeng, et al.
Published: (2026)
by: Wu, Zimeng, et al.
Published: (2026)
GR-RL: Going Dexterous and Precise for Long-Horizon Robotic Manipulation
by: Li, Yunfei, et al.
Published: (2025)
by: Li, Yunfei, et al.
Published: (2025)
The Best of Both Worlds: Hybridizing Neural Operators and Solvers for Stable Long-Horizon Inference
by: Roy, Rajyasri, et al.
Published: (2025)
by: Roy, Rajyasri, et al.
Published: (2025)
Similar Items
-
STaR-GATE: Teaching Language Models to Ask Clarifying Questions
by: Andukuri, Chinmaya, et al.
Published: (2024) -
Spot The Ball: A Benchmark for Visual Social Inference
by: Balamurugan, Neha, et al.
Published: (2025) -
Understanding Human Limits in Pattern Recognition: A Computational Model of Sequential Reasoning in Rock, Paper, Scissors
by: Cross, Logan, et al.
Published: (2025) -
Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels
by: Fränken, Jan-Philipp, et al.
Published: (2024) -
Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models
by: Fränken, Jan-Philipp, et al.
Published: (2024)