:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Towers, Mark, Du, Yali, Freeman, Christopher, Norman, Timothy J.
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2408.08230
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Comparative User Evaluation of XRL Explanations using Goal Identification
by: Towers, Mark, et al.
Published: (2025)

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
by: Zeng, Zhiyuan, et al.
Published: (2025)

ConformaDecompose: Explaining Uncertainty via Calibration Localization
by: Yapicioglu, Fatima Rabia, et al.
Published: (2026)

Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
by: Clark, Tyler, et al.
Published: (2024)

FutureWorld: A Live Reinforcement Learning Environment for Predictive Agents with Real-World Outcome Rewards
by: Han, Zhixin, et al.
Published: (2026)

STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning
by: Chen, Sirui, et al.
Published: (2023)

Distributional Process Reward Models: Calibrated Prediction of Future Rewards via Conditional Optimal Transport
by: Ma, Rachel, et al.
Published: (2026)

Predicting the Future by Retrieving the Past
by: Du, Dazhao, et al.
Published: (2025)

Explaining Learned Reward Functions with Counterfactual Trajectories
by: Wehner, Jan, et al.
Published: (2024)

LLMs for XAI: Future Directions for Explaining Explanations
by: Zytek, Alexandra, et al.
Published: (2024)

Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents
by: Septon, Yael, et al.
Published: (2022)

Spatio-Temporal Trajectory Foundation Model - Recent Advances and Future Directions
by: Yang, Sean Bin, et al.
Published: (2025)

A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions
by: Du, Hung, et al.
Published: (2024)

It's About Time: Temporal References in Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2023)

RDAR: Reward-Driven Agent Relevance Estimation for Autonomous Driving
by: Bosio, Carlo, et al.
Published: (2025)

MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization
by: Wu, Boyuan
Published: (2025)

ABBEL: LLM Agents Acting through Belief Bottlenecks Expressed in Language
by: Lidayan, Aly, et al.
Published: (2025)

Complementary Recommendation in E-commerce: Definition, Approaches, and Future Directions
by: Li, Linyue, et al.
Published: (2024)

Belief or Circuitry? Causal Evidence for In-Context Graph Learning
by: Kowalyshyn, Katharine, et al.
Published: (2026)

Bench to the Future: A Pastcasting Benchmark for Forecasting Agents
by: FutureSearch, et al.
Published: (2025)

ACA-Net: Future Graph Learning for Logistical Demand-Supply Forecasting
by: Shi, Jiacheng, et al.
Published: (2025)

Belief States for Cooperative Multi-Agent Reinforcement Learning under Partial Observability
by: Pritz, Paul J., et al.
Published: (2025)

On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
by: Zhang, Yuheng, et al.
Published: (2024)

ATLaS: Agent Tuning via Learning Critical Steps
by: Chen, Zhixun, et al.
Published: (2025)

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
by: Lù, Xing Han, et al.
Published: (2025)

RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors
by: Bai, Fengshuo, et al.
Published: (2024)

Explaining Concept Drift through the Evolution of Group Counterfactuals
by: Stępka, Ignacy, et al.
Published: (2025)

FutureSim: Replaying World Events to Evaluate Adaptive Agents
by: Goel, Shashwat, et al.
Published: (2026)

VAM: Verbalized Action Masking for Controllable Exploration in RL Post-Training -- A Chess Case Study
by: Zhang, Zhicheng, et al.
Published: (2026)

Back To The Future: A Hybrid Transformer-XGBoost Model for Action-oriented Future-proofing Nowcasting
by: Sun, Ziheng
Published: (2024)

Explaining and Improving Information Complementarities in Multi-Agent Decision-making
by: Guo, Ziyang, et al.
Published: (2025)

GraphTool-Instruction: Revolutionizing Graph Reasoning in LLMs through Decomposed Subtask Instruction
by: Wang, Rongzheng, et al.
Published: (2024)

Support Sufficiency as Consequence-Sensitive Compression in Belief Arbitration
by: Walsh, Mark
Published: (2026)

All Language Models Large and Small
by: Chen, Zhixun, et al.
Published: (2024)

Explaining Robustness to Catastrophic Forgetting Through Incremental Concept Formation
by: Barari, Nicki, et al.
Published: (2025)

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
by: Liu, Zhihan, et al.
Published: (2023)

Latent State Estimation Helps UI Agents to Reason
by: Bishop, William E, et al.
Published: (2024)

ProgAgent:A Continual RL Agent with Progress-Aware Rewards
by: Tan, Jinzhou, et al.
Published: (2026)

Robot Policy Learning with Temporal Optimal Transport Reward
by: Fu, Yuwei, et al.
Published: (2024)

BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions
by: Liu, Xiao, et al.
Published: (2024)