Saved in:
| Main Authors: | Towers, Mark, Du, Yali, Freeman, Christopher, Norman, Timothy J. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.08230 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Comparative User Evaluation of XRL Explanations using Goal Identification
by: Towers, Mark, et al.
Published: (2025)
by: Towers, Mark, et al.
Published: (2025)
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
by: Zeng, Zhiyuan, et al.
Published: (2025)
by: Zeng, Zhiyuan, et al.
Published: (2025)
ConformaDecompose: Explaining Uncertainty via Calibration Localization
by: Yapicioglu, Fatima Rabia, et al.
Published: (2026)
by: Yapicioglu, Fatima Rabia, et al.
Published: (2026)
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
by: Clark, Tyler, et al.
Published: (2024)
by: Clark, Tyler, et al.
Published: (2024)
FutureWorld: A Live Reinforcement Learning Environment for Predictive Agents with Real-World Outcome Rewards
by: Han, Zhixin, et al.
Published: (2026)
by: Han, Zhixin, et al.
Published: (2026)
STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning
by: Chen, Sirui, et al.
Published: (2023)
by: Chen, Sirui, et al.
Published: (2023)
Distributional Process Reward Models: Calibrated Prediction of Future Rewards via Conditional Optimal Transport
by: Ma, Rachel, et al.
Published: (2026)
by: Ma, Rachel, et al.
Published: (2026)
Predicting the Future by Retrieving the Past
by: Du, Dazhao, et al.
Published: (2025)
by: Du, Dazhao, et al.
Published: (2025)
Explaining Learned Reward Functions with Counterfactual Trajectories
by: Wehner, Jan, et al.
Published: (2024)
by: Wehner, Jan, et al.
Published: (2024)
LLMs for XAI: Future Directions for Explaining Explanations
by: Zytek, Alexandra, et al.
Published: (2024)
by: Zytek, Alexandra, et al.
Published: (2024)
Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents
by: Septon, Yael, et al.
Published: (2022)
by: Septon, Yael, et al.
Published: (2022)
Spatio-Temporal Trajectory Foundation Model - Recent Advances and Future Directions
by: Yang, Sean Bin, et al.
Published: (2025)
by: Yang, Sean Bin, et al.
Published: (2025)
A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions
by: Du, Hung, et al.
Published: (2024)
by: Du, Hung, et al.
Published: (2024)
It's About Time: Temporal References in Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2023)
by: Lipinski, Olaf, et al.
Published: (2023)
RDAR: Reward-Driven Agent Relevance Estimation for Autonomous Driving
by: Bosio, Carlo, et al.
Published: (2025)
by: Bosio, Carlo, et al.
Published: (2025)
MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization
by: Wu, Boyuan
Published: (2025)
by: Wu, Boyuan
Published: (2025)
ABBEL: LLM Agents Acting through Belief Bottlenecks Expressed in Language
by: Lidayan, Aly, et al.
Published: (2025)
by: Lidayan, Aly, et al.
Published: (2025)
Complementary Recommendation in E-commerce: Definition, Approaches, and Future Directions
by: Li, Linyue, et al.
Published: (2024)
by: Li, Linyue, et al.
Published: (2024)
Belief or Circuitry? Causal Evidence for In-Context Graph Learning
by: Kowalyshyn, Katharine, et al.
Published: (2026)
by: Kowalyshyn, Katharine, et al.
Published: (2026)
Bench to the Future: A Pastcasting Benchmark for Forecasting Agents
by: FutureSearch, et al.
Published: (2025)
by: FutureSearch, et al.
Published: (2025)
ACA-Net: Future Graph Learning for Logistical Demand-Supply Forecasting
by: Shi, Jiacheng, et al.
Published: (2025)
by: Shi, Jiacheng, et al.
Published: (2025)
Belief States for Cooperative Multi-Agent Reinforcement Learning under Partial Observability
by: Pritz, Paul J., et al.
Published: (2025)
by: Pritz, Paul J., et al.
Published: (2025)
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
by: Zhang, Yuheng, et al.
Published: (2024)
by: Zhang, Yuheng, et al.
Published: (2024)
ATLaS: Agent Tuning via Learning Critical Steps
by: Chen, Zhixun, et al.
Published: (2025)
by: Chen, Zhixun, et al.
Published: (2025)
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
by: Lù, Xing Han, et al.
Published: (2025)
by: Lù, Xing Han, et al.
Published: (2025)
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors
by: Bai, Fengshuo, et al.
Published: (2024)
by: Bai, Fengshuo, et al.
Published: (2024)
Explaining Concept Drift through the Evolution of Group Counterfactuals
by: Stępka, Ignacy, et al.
Published: (2025)
by: Stępka, Ignacy, et al.
Published: (2025)
FutureSim: Replaying World Events to Evaluate Adaptive Agents
by: Goel, Shashwat, et al.
Published: (2026)
by: Goel, Shashwat, et al.
Published: (2026)
VAM: Verbalized Action Masking for Controllable Exploration in RL Post-Training -- A Chess Case Study
by: Zhang, Zhicheng, et al.
Published: (2026)
by: Zhang, Zhicheng, et al.
Published: (2026)
Back To The Future: A Hybrid Transformer-XGBoost Model for Action-oriented Future-proofing Nowcasting
by: Sun, Ziheng
Published: (2024)
by: Sun, Ziheng
Published: (2024)
Explaining and Improving Information Complementarities in Multi-Agent Decision-making
by: Guo, Ziyang, et al.
Published: (2025)
by: Guo, Ziyang, et al.
Published: (2025)
GraphTool-Instruction: Revolutionizing Graph Reasoning in LLMs through Decomposed Subtask Instruction
by: Wang, Rongzheng, et al.
Published: (2024)
by: Wang, Rongzheng, et al.
Published: (2024)
Support Sufficiency as Consequence-Sensitive Compression in Belief Arbitration
by: Walsh, Mark
Published: (2026)
by: Walsh, Mark
Published: (2026)
All Language Models Large and Small
by: Chen, Zhixun, et al.
Published: (2024)
by: Chen, Zhixun, et al.
Published: (2024)
Explaining Robustness to Catastrophic Forgetting Through Incremental Concept Formation
by: Barari, Nicki, et al.
Published: (2025)
by: Barari, Nicki, et al.
Published: (2025)
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
by: Liu, Zhihan, et al.
Published: (2023)
by: Liu, Zhihan, et al.
Published: (2023)
Latent State Estimation Helps UI Agents to Reason
by: Bishop, William E, et al.
Published: (2024)
by: Bishop, William E, et al.
Published: (2024)
ProgAgent:A Continual RL Agent with Progress-Aware Rewards
by: Tan, Jinzhou, et al.
Published: (2026)
by: Tan, Jinzhou, et al.
Published: (2026)
Robot Policy Learning with Temporal Optimal Transport Reward
by: Fu, Yuwei, et al.
Published: (2024)
by: Fu, Yuwei, et al.
Published: (2024)
BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions
by: Liu, Xiao, et al.
Published: (2024)
by: Liu, Xiao, et al.
Published: (2024)
Similar Items
-
A Comparative User Evaluation of XRL Explanations using Goal Identification
by: Towers, Mark, et al.
Published: (2025) -
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
by: Zeng, Zhiyuan, et al.
Published: (2025) -
ConformaDecompose: Explaining Uncertainty via Calibration Localization
by: Yapicioglu, Fatima Rabia, et al.
Published: (2026) -
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
by: Clark, Tyler, et al.
Published: (2024) -
FutureWorld: A Live Reinforcement Learning Environment for Predictive Agents with Real-World Outcome Rewards
by: Han, Zhixin, et al.
Published: (2026)