:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Spigler, Giacomo
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2407.15134
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

TAVIS: A Benchmark for Egocentric Active Vision and Anticipatory Gaze in Imitation Learning
by: Spigler, Giacomo
Published: (2026)

Predicting Depression and Anxiety Risk in Dutch Neighborhoods from Street-View Images
by: Khodorivsko, Nin, et al.
Published: (2024)

Imitation of human motion achieves natural head movements for humanoid robots in an active-speaker detection task
by: Ding, Bosong, et al.
Published: (2024)

Reparameterization Proximal Policy Optimization
by: Zhong, Hai, et al.
Published: (2025)

On-Policy Optimization of ANFIS Policies Using Proximal Policy Optimization
by: Shankar, Kaaustaaub, et al.
Published: (2025)

Beyond the Boundaries of Proximal Policy Optimization
by: Tan, Charlie B., et al.
Published: (2024)

Proximal Policy Optimization with Adaptive Exploration
by: Lixandru, Andrei
Published: (2024)

Complexity-Regularized Proximal Policy Optimization
by: Serfilippi, Luca, et al.
Published: (2025)

KIPPO: Koopman-Inspired Proximal Policy Optimization
by: Cozma, Andrei, et al.
Published: (2025)

ESPO: Early-Stopping Proximal Policy Optimization
by: Li, Zihang, et al.
Published: (2026)

Learning Branching Policies for MILPs with Proximal Policy Optimization
by: Mhamed, Abdelouahed Ben, et al.
Published: (2025)

Extreme Region Policy Distillation
by: Chen, Changyu, et al.
Published: (2026)

Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning
by: Batra, Sumeet, et al.
Published: (2023)

PACED: Distillation and On-Policy Self-Distillation at the Frontier of Student Competence
by: Xu, Yuanda, et al.
Published: (2026)

Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning
by: Liu, Jiashun, et al.
Published: (2025)

CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric
by: Guo, Yunxiao, et al.
Published: (2021)

A dynamical clipping approach with task feedback for Proximal Policy Optimization
by: Zhang, Ziqi, et al.
Published: (2023)

PROMA: Projected Microbatch Accumulation for Reference-Free Proximal Policy Updates
by: Abrahamsen, Nilin
Published: (2026)

Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization
by: Küçükoğlu, Burcu, et al.
Published: (2022)

Online Policy Distillation with Decision-Attention
by: Yu, Xinqiang, et al.
Published: (2024)

HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation
by: Ding, Ken
Published: (2026)

TIP: Token Importance in On-Policy Distillation
by: Xu, Yuanda, et al.
Published: (2026)

ExO-PPO: an Extended Off-policy Proximal Policy Optimization Algorithm
by: Wang, Hanyong, et al.
Published: (2026)

OPD+: Rethinking the Advantage Design for On-Policy Distillation
by: Zhao, Hanyang, et al.
Published: (2026)

Trust-Region Behavior Blending for On-Policy Distillation
by: Plyusov, Daniil, et al.
Published: (2026)

The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation
by: Zhang, Jiaxin, et al.
Published: (2026)

Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs
by: Kohler, Hector, et al.
Published: (2025)

ADWIN: Adaptive Windows for Horizon-Aware On-Policy Distillation
by: Liang, Kun, et al.
Published: (2026)

Stable On-Policy Distillation through Adaptive Target Reformulation
by: Jang, Ijun, et al.
Published: (2026)

Interpretable Policy Distillation for Power Grid Topology Control
by: Dmitruka, Aleksandra, et al.
Published: (2026)

Explainable RL Policies by Distilling to Locally-Specialized Linear Policies with Voronoi State Partitioning
by: Deproost, Senne, et al.
Published: (2025)

Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards
by: Ahmad, Ahmad, et al.
Published: (2024)

Variational Distillation of Diffusion Policies into Mixture of Experts
by: Zhou, Hongyi, et al.
Published: (2024)

Asymmetric On-Policy Distillation: Bridging Exploitation and Imitation at the Token Level
by: Jia, Nan, et al.
Published: (2026)

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why
by: Armandpour, Mohammadreza, et al.
Published: (2026)

How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
by: Weltevrede, Max, et al.
Published: (2025)

A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization
by: Ali, Nawazish, et al.
Published: (2024)

Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation
by: Wang, Hao, et al.
Published: (2024)

From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation
by: Shen, Guobin, et al.
Published: (2026)

Prune-OPD: Efficient and Reliable On-Policy Distillation for Long-Horizon Reasoning
by: Yang, Zhicheng, et al.
Published: (2026)