Saved in:
| Main Authors: | Zheng, Chongyi, Tuyls, Jens, Peng, Joanne, Eysenbach, Benjamin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.08021 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
UpSkill: Mutual Information Skill Learning for Structured Response Diversity in LLMs
by: Shah, Devan, et al.
Published: (2026)
by: Shah, Devan, et al.
Published: (2026)
Can We Really Learn One Representation to Optimize All Rewards?
by: Zheng, Chongyi, et al.
Published: (2026)
by: Zheng, Chongyi, et al.
Published: (2026)
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
by: Myers, Vivek, et al.
Published: (2024)
by: Myers, Vivek, et al.
Published: (2024)
Contrastive Difference Predictive Coding
by: Zheng, Chongyi, et al.
Published: (2023)
by: Zheng, Chongyi, et al.
Published: (2023)
Training LLM Agents to Empower Humans
by: Ellis, Evan, et al.
Published: (2025)
by: Ellis, Evan, et al.
Published: (2025)
Intention-Conditioned Flow Occupancy Models
by: Zheng, Chongyi, et al.
Published: (2025)
by: Zheng, Chongyi, et al.
Published: (2025)
Value Flows
by: Dong, Perry, et al.
Published: (2025)
by: Dong, Perry, et al.
Published: (2025)
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data
by: Zheng, Chongyi, et al.
Published: (2023)
by: Zheng, Chongyi, et al.
Published: (2023)
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
by: Liu, Grace, et al.
Published: (2024)
by: Liu, Grace, et al.
Published: (2024)
Unifying Goal-Conditioned RL and Unsupervised Skill Learning via Control-Maximization
by: Modirshanechi, Alireza, et al.
Published: (2026)
by: Modirshanechi, Alireza, et al.
Published: (2026)
Skill Learning via Policy Diversity Yields Identifiable Representations for Reinforcement Learning
by: Reizinger, Patrik, et al.
Published: (2025)
by: Reizinger, Patrik, et al.
Published: (2025)
Horizon Generalization in Reinforcement Learning
by: Myers, Vivek, et al.
Published: (2025)
by: Myers, Vivek, et al.
Published: (2025)
Representation-Based Exploration for Language Models: From Test-Time to Post-Training
by: Tuyls, Jens, et al.
Published: (2025)
by: Tuyls, Jens, et al.
Published: (2025)
Scaling Laws for Imitation Learning in Single-Agent Games
by: Tuyls, Jens, et al.
Published: (2023)
by: Tuyls, Jens, et al.
Published: (2023)
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
by: Wang, Kevin, et al.
Published: (2025)
by: Wang, Kevin, et al.
Published: (2025)
Consistent Zero-Shot Imitation with Contrastive Goal Inference
by: Wantlin, Kathryn, et al.
Published: (2025)
by: Wantlin, Kathryn, et al.
Published: (2025)
Is Temporal Difference Learning the Gold Standard for Stitching in RL?
by: Bortkiewicz, Michał, et al.
Published: (2025)
by: Bortkiewicz, Michał, et al.
Published: (2025)
Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards
by: Mohamed, Faisal, et al.
Published: (2026)
by: Mohamed, Faisal, et al.
Published: (2026)
Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning
by: Yu, Xuehui, et al.
Published: (2024)
by: Yu, Xuehui, et al.
Published: (2024)
Language-Guided World Models: A Model-Based Approach to AI Control
by: Zhang, Alex, et al.
Published: (2024)
by: Zhang, Alex, et al.
Published: (2024)
A Rate-Distortion View of Uncertainty Quantification
by: Apostolopoulou, Ifigeneia, et al.
Published: (2024)
by: Apostolopoulou, Ifigeneia, et al.
Published: (2024)
OGBench: Benchmarking Offline Goal-Conditioned RL
by: Park, Seohong, et al.
Published: (2024)
by: Park, Seohong, et al.
Published: (2024)
Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration
by: Nimonkar, Chirayu, et al.
Published: (2025)
by: Nimonkar, Chirayu, et al.
Published: (2025)
Behavior-Consistent Deep Reinforcement Learning
by: Hussing, Marcel, et al.
Published: (2026)
by: Hussing, Marcel, et al.
Published: (2026)
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
by: Park, Seohong, et al.
Published: (2023)
by: Park, Seohong, et al.
Published: (2023)
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
by: Liang, Yongyuan, et al.
Published: (2023)
by: Liang, Yongyuan, et al.
Published: (2023)
Contrastive Representations for Temporal Reasoning
by: Ziarko, Alicja, et al.
Published: (2025)
by: Ziarko, Alicja, et al.
Published: (2025)
Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
by: Li, Simin, et al.
Published: (2023)
by: Li, Simin, et al.
Published: (2023)
Simple Ingredients for Offline Reinforcement Learning
by: Cetin, Edoardo, et al.
Published: (2024)
by: Cetin, Edoardo, et al.
Published: (2024)
MemFly: On-the-Fly Memory Optimization via Information Bottleneck
by: Zhang, Zhenyuan, et al.
Published: (2026)
by: Zhang, Zhenyuan, et al.
Published: (2026)
Mutual Information Regularized Offline Reinforcement Learning
by: Ma, Xiao, et al.
Published: (2022)
by: Ma, Xiao, et al.
Published: (2022)
Horizon Reduction Makes RL Scalable
by: Park, Seohong, et al.
Published: (2025)
by: Park, Seohong, et al.
Published: (2025)
Learning to Assist Humans without Inferring Rewards
by: Myers, Vivek, et al.
Published: (2024)
by: Myers, Vivek, et al.
Published: (2024)
Accelerating Goal-Conditioned RL Algorithms and Research
by: Bortkiewicz, Michał, et al.
Published: (2024)
by: Bortkiewicz, Michał, et al.
Published: (2024)
The Scaling Law for LoRA Base on Mutual Information Upper Bound
by: Zhang, Jing, et al.
Published: (2025)
by: Zhang, Jing, et al.
Published: (2025)
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
by: Zhao, Yunfan, et al.
Published: (2023)
by: Zhao, Yunfan, et al.
Published: (2023)
Mutual Information Tracks Policy Coherence in Reinforcement Learning
by: Reid, Cameron, et al.
Published: (2025)
by: Reid, Cameron, et al.
Published: (2025)
Fairness in Survival Analysis: A Novel Conditional Mutual Information Augmentation Approach
by: Xie, Tianyang, et al.
Published: (2025)
by: Xie, Tianyang, et al.
Published: (2025)
MOLE: MOdular Learning FramEwork via Mutual Information Maximization
by: Li, Tianchao, et al.
Published: (2023)
by: Li, Tianchao, et al.
Published: (2023)
Bridging State and History Representations: Understanding Self-Predictive RL
by: Ni, Tianwei, et al.
Published: (2024)
by: Ni, Tianwei, et al.
Published: (2024)
Similar Items
-
UpSkill: Mutual Information Skill Learning for Structured Response Diversity in LLMs
by: Shah, Devan, et al.
Published: (2026) -
Can We Really Learn One Representation to Optimize All Rewards?
by: Zheng, Chongyi, et al.
Published: (2026) -
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
by: Myers, Vivek, et al.
Published: (2024) -
Contrastive Difference Predictive Coding
by: Zheng, Chongyi, et al.
Published: (2023) -
Training LLM Agents to Empower Humans
by: Ellis, Evan, et al.
Published: (2025)