:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jin, Emily, Huang, Zhuoyi, Fränken, Jan-Philipp, Liu, Weiyu, Cha, Hannah, Brockbank, Erik, Wu, Sarah, Zhang, Ruohan, Wu, Jiajun, Gerstenberg, Tobias
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2410.01926
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

STaR-GATE: Teaching Language Models to Ask Clarifying Questions
by: Andukuri, Chinmaya, et al.
Published: (2024)

Spot The Ball: A Benchmark for Visual Social Inference
by: Balamurugan, Neha, et al.
Published: (2025)

Understanding Human Limits in Pattern Recognition: A Computational Model of Sequential Reasoning in Rock, Paper, Scissors
by: Cross, Logan, et al.
Published: (2025)

Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels
by: Fränken, Jan-Philipp, et al.
Published: (2024)

Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models
by: Fränken, Jan-Philipp, et al.
Published: (2024)

Why Someone Asked "Why": Foil Inference in Human and LLM Question Interpretation
by: Besch, Britt, et al.
Published: (2026)

Learning Compositional Behaviors from Demonstration and Language
by: Liu, Weiyu, et al.
Published: (2025)

Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners
by: Feng, Chun, et al.
Published: (2024)

Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel
by: Parés-Morlans, Carlota, et al.
Published: (2025)

CRAFT: Designing Creative and Functional 3D Objects
by: Guo, Michelle, et al.
Published: (2024)

Predicate Hierarchies Improve Few-Shot State Classification
by: Jin, Emily, et al.
Published: (2025)

Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
by: Li, Manling, et al.
Published: (2024)

Post Reinforcement Learning Inference
by: Syrgkanis, Vasilis, et al.
Published: (2023)

Human-like Affective Cognition in Foundation Models
by: Gandhi, Kanishk, et al.
Published: (2024)

TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction
by: Jiang, Yunfan, et al.
Published: (2024)

Composable Part-Based Manipulation
by: Liu, Weiyu, et al.
Published: (2024)

LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning
by: Motwani, Sumeet Ramesh, et al.
Published: (2026)

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL
by: Wu, Ian, et al.
Published: (2026)

Multi-Omics Analysis for Cancer Subtype Inference via Unrolling Graph Smoothness Priors
by: Lu, Jielong, et al.
Published: (2025)

MoMaGen: Generating Demonstrations under Soft and Hard Constraints for Multi-Step Bimanual Mobile Manipulation
by: Li, Chengshu, et al.
Published: (2025)

Hearing Anything Anywhere
by: Wang, Mason, et al.
Published: (2024)

Deep Active Inference Agents for Delayed and Long-Horizon Environments
by: Yeganeh, Yavar Taheri, et al.
Published: (2025)

Learning-Guided Rolling Horizon Optimization for Long-Horizon Flexible Job-Shop Scheduling
by: Li, Sirui, et al.
Published: (2025)

TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios
by: Shen, Yuanzhe, et al.
Published: (2026)

DiffSound: Differentiable Modal Sound Rendering and Inverse Rendering for Diverse Inference Tasks
by: Jin, Xutong, et al.
Published: (2024)

Odysseys: Benchmarking Web Agents on Realistic Long Horizon Tasks
by: Jang, Lawrence Keunho, et al.
Published: (2026)

On Policy Evaluation Algorithms in Distributional Reinforcement Learning
by: Gerstenberg, Julian, et al.
Published: (2024)

HoTPP Benchmark: Are We Good at the Long Horizon Events Forecasting?
by: Karpukhin, Ivan, et al.
Published: (2024)

Effective Explanations Support Planning Under Uncertainty
by: Zhou, Hanqi, et al.
Published: (2026)

Predicting Outcomes in Video Games with Long Short Term Memory Networks
by: Chulajata, Kittimate, et al.
Published: (2024)

A Communication-First Account of Explanation
by: Harding, Jacqueline, et al.
Published: (2025)

Reinforcement Learning for Long-Horizon Interactive LLM Agents
by: Chen, Kevin, et al.
Published: (2025)

Learning to Ball: Composing Policies for Long-Horizon Basketball Moves
by: Xu, Pei, et al.
Published: (2025)

Horizon causality from holographic scattering in asymptotically dS$_3$
by: Franken, Victor, et al.
Published: (2024)

Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe
by: Wu, Xixi, et al.
Published: (2026)

λ: A Benchmark for Data-Efficiency in Long-Horizon Indoor Mobile Manipulation Robotics
by: Jaafar, Ahmed, et al.
Published: (2024)

$\boldsymbol{f}$-OPD: Stabilizing Long-Horizon On-Policy Distillation with Freshness-Aware Control
by: Chen, Xianwei, et al.
Published: (2026)

Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference
by: Wu, Zimeng, et al.
Published: (2026)

GR-RL: Going Dexterous and Precise for Long-Horizon Robotic Manipulation
by: Li, Yunfei, et al.
Published: (2025)

The Best of Both Worlds: Hybridizing Neural Operators and Solvers for Stable Long-Horizon Inference
by: Roy, Rajyasri, et al.
Published: (2025)