:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xie, Zhengpeng, Cao, Jiahang, Wang, Changwei, Yang, Fan, Hutter, Marco, Zhang, Qiang, Zhang, Jianxiong, Xu, Renjing
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2501.02481
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Simple Policy Optimization
by: Xie, Zhengpeng, et al.
Published: (2024)

A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning
by: Xie, Zhengpeng, et al.
Published: (2025)

MoSA: Motion-constrained Stress Adaptation for Mitigating Real-to-Sim Gap in Continuum Dynamics via Learning Residual Anisotropy
by: Wang, Jiaxu, et al.
Published: (2026)

Reinforcement Learning with Generalizable Gaussian Splatting
by: Wang, Jiaxu, et al.
Published: (2024)

MoDE: A Mixture-of-Experts Model with Mutual Distillation among the Experts
by: Xie, Zhitian, et al.
Published: (2024)

Mutual Information Regularized Offline Reinforcement Learning
by: Ma, Xiao, et al.
Published: (2022)

Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
by: Li, Simin, et al.
Published: (2023)

Graph is a Natural Regularization: Revisiting Vector Quantization for Graph Representation Learning
by: Zhai, Zian, et al.
Published: (2025)

Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation
by: He, Qiang, et al.
Published: (2024)

Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
by: Duan, Jinhao, et al.
Published: (2023)

Fully Spiking Neural Network for Legged Robots
by: Jiang, Xiaoyang, et al.
Published: (2023)

Learning to Open and Traverse Doors with a Legged Manipulator
by: Zhang, Mike, et al.
Published: (2024)

Quantile Geometry Regularization for Distributional Reinforcement Learning
by: Zhang, Zhaofan, et al.
Published: (2026)

DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management
by: Xie, Yaqi, et al.
Published: (2026)

Preference-Based Self-Distillation: Beyond KL Matching via Reward Regularization
by: Yu, Xin, et al.
Published: (2026)

Enhancing Time Series Forecasting via Logic-Inspired Regularization
by: Zhang, Jianqi, et al.
Published: (2025)

From Generalist to Specialist Representation
by: Zheng, Yujia, et al.
Published: (2026)

Flora: Low-Rank Adapters Are Secretly Gradient Compressors
by: Hao, Yongchang, et al.
Published: (2024)

Representation Learning with Mutual Influence of Modalities for Node Classification in Multi-Modal Heterogeneous Networks
by: Li, Jiafan, et al.
Published: (2025)

The Scaling Law for LoRA Base on Mutual Information Upper Bound
by: Zhang, Jing, et al.
Published: (2025)

PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation
by: Fan, Tao, et al.
Published: (2025)

Using large language models for embodied planning introduces systematic safety risks
by: Zhang, Tao, et al.
Published: (2026)

Enhancing Modality Representation and Alignment for Multimodal Cold-start Active Learning
by: Shen, Meng, et al.
Published: (2024)

Distilled Protein Backbone Generation
by: Xie, Liyang, et al.
Published: (2025)

Convergent Linear Representations of Emergent Misalignment
by: Soligo, Anna, et al.
Published: (2025)

Convergent World Representations and Divergent Tasks
by: Park, Core Francisco
Published: (2026)

Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2025)

Rényi Divergence Deep Mutual Learning
by: Huang, Weipeng, et al.
Published: (2022)

Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics
by: Li, Chenhao, et al.
Published: (2025)

Uncertainty-Aware Robotic World Model Makes Offline Model-Based Reinforcement Learning Work on Real Robots
by: Li, Chenhao, et al.
Published: (2025)

Bridging the Gap: Enabling Soft Actor Critic for High Performance Legged Locomotion
by: Sabatini, Gianluca, et al.
Published: (2026)

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information
by: Shen, Guobin, et al.
Published: (2026)

DeepCell: Self-Supervised Multiview Fusion for Circuit Representation Learning
by: Shi, Zhengyuan, et al.
Published: (2025)

Adaptive Guidance for Local Training in Heterogeneous Federated Learning
by: Zhang, Jianqing, et al.
Published: (2024)

Fairness in Survival Analysis: A Novel Conditional Mutual Information Augmentation Approach
by: Xie, Tianyang, et al.
Published: (2025)

Linear $Q$-Learning Does Not Diverge in $L^2$: Convergence Rates to a Bounded Set
by: Liu, Xinyu, et al.
Published: (2025)

Multi-Head Spectral-Adaptive Graph Anomaly Detection
by: Cao, Qingyue, et al.
Published: (2025)

Large-Small Model Collaborative Framework for Federated Continual Learning
by: Yu, Hao, et al.
Published: (2025)

Context Distillation as Latent Memory Management
by: Zheng, Ziyang, et al.
Published: (2026)

Feature-Based vs. GAN-Based Learning from Demonstrations: When and Why
by: Li, Chenhao, et al.
Published: (2025)