:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zheng, Chongyi, Tuyls, Jens, Peng, Joanne, Eysenbach, Benjamin
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2412.08021
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

UpSkill: Mutual Information Skill Learning for Structured Response Diversity in LLMs
by: Shah, Devan, et al.
Published: (2026)

Can We Really Learn One Representation to Optimize All Rewards?
by: Zheng, Chongyi, et al.
Published: (2026)

Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
by: Myers, Vivek, et al.
Published: (2024)

Contrastive Difference Predictive Coding
by: Zheng, Chongyi, et al.
Published: (2023)

Training LLM Agents to Empower Humans
by: Ellis, Evan, et al.
Published: (2025)

Intention-Conditioned Flow Occupancy Models
by: Zheng, Chongyi, et al.
Published: (2025)

Value Flows
by: Dong, Perry, et al.
Published: (2025)

Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data
by: Zheng, Chongyi, et al.
Published: (2023)

A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
by: Liu, Grace, et al.
Published: (2024)

Unifying Goal-Conditioned RL and Unsupervised Skill Learning via Control-Maximization
by: Modirshanechi, Alireza, et al.
Published: (2026)

Skill Learning via Policy Diversity Yields Identifiable Representations for Reinforcement Learning
by: Reizinger, Patrik, et al.
Published: (2025)

Horizon Generalization in Reinforcement Learning
by: Myers, Vivek, et al.
Published: (2025)

Representation-Based Exploration for Language Models: From Test-Time to Post-Training
by: Tuyls, Jens, et al.
Published: (2025)

Scaling Laws for Imitation Learning in Single-Agent Games
by: Tuyls, Jens, et al.
Published: (2023)

1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
by: Wang, Kevin, et al.
Published: (2025)

Consistent Zero-Shot Imitation with Contrastive Goal Inference
by: Wantlin, Kathryn, et al.
Published: (2025)

Is Temporal Difference Learning the Gold Standard for Stitching in RL?
by: Bortkiewicz, Michał, et al.
Published: (2025)

Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards
by: Mohamed, Faisal, et al.
Published: (2026)

Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning
by: Yu, Xuehui, et al.
Published: (2024)

Language-Guided World Models: A Model-Based Approach to AI Control
by: Zhang, Alex, et al.
Published: (2024)

A Rate-Distortion View of Uncertainty Quantification
by: Apostolopoulou, Ifigeneia, et al.
Published: (2024)

OGBench: Benchmarking Offline Goal-Conditioned RL
by: Park, Seohong, et al.
Published: (2024)

Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration
by: Nimonkar, Chirayu, et al.
Published: (2025)

Behavior-Consistent Deep Reinforcement Learning
by: Hussing, Marcel, et al.
Published: (2026)

HIQL: Offline Goal-Conditioned RL with Latent States as Actions
by: Park, Seohong, et al.
Published: (2023)

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
by: Liang, Yongyuan, et al.
Published: (2023)

Contrastive Representations for Temporal Reasoning
by: Ziarko, Alicja, et al.
Published: (2025)

Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
by: Li, Simin, et al.
Published: (2023)

Simple Ingredients for Offline Reinforcement Learning
by: Cetin, Edoardo, et al.
Published: (2024)

MemFly: On-the-Fly Memory Optimization via Information Bottleneck
by: Zhang, Zhenyuan, et al.
Published: (2026)

Mutual Information Regularized Offline Reinforcement Learning
by: Ma, Xiao, et al.
Published: (2022)

Horizon Reduction Makes RL Scalable
by: Park, Seohong, et al.
Published: (2025)

Learning to Assist Humans without Inferring Rewards
by: Myers, Vivek, et al.
Published: (2024)

Accelerating Goal-Conditioned RL Algorithms and Research
by: Bortkiewicz, Michał, et al.
Published: (2024)

The Scaling Law for LoRA Base on Mutual Information Upper Bound
by: Zhang, Jing, et al.
Published: (2025)

Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
by: Zhao, Yunfan, et al.
Published: (2023)

Mutual Information Tracks Policy Coherence in Reinforcement Learning
by: Reid, Cameron, et al.
Published: (2025)

Fairness in Survival Analysis: A Novel Conditional Mutual Information Augmentation Approach
by: Xie, Tianyang, et al.
Published: (2025)

MOLE: MOdular Learning FramEwork via Mutual Information Maximization
by: Li, Tianchao, et al.
Published: (2023)

Bridging State and History Representations: Understanding Self-Predictive RL
by: Ni, Tianwei, et al.
Published: (2024)