Saved in:
| Main Authors: | Xie, Zhengpeng, Cao, Jiahang, Wang, Changwei, Yang, Fan, Hutter, Marco, Zhang, Qiang, Zhang, Jianxiong, Xu, Renjing |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.02481 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Simple Policy Optimization
by: Xie, Zhengpeng, et al.
Published: (2024)
by: Xie, Zhengpeng, et al.
Published: (2024)
A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning
by: Xie, Zhengpeng, et al.
Published: (2025)
by: Xie, Zhengpeng, et al.
Published: (2025)
MoSA: Motion-constrained Stress Adaptation for Mitigating Real-to-Sim Gap in Continuum Dynamics via Learning Residual Anisotropy
by: Wang, Jiaxu, et al.
Published: (2026)
by: Wang, Jiaxu, et al.
Published: (2026)
Reinforcement Learning with Generalizable Gaussian Splatting
by: Wang, Jiaxu, et al.
Published: (2024)
by: Wang, Jiaxu, et al.
Published: (2024)
MoDE: A Mixture-of-Experts Model with Mutual Distillation among the Experts
by: Xie, Zhitian, et al.
Published: (2024)
by: Xie, Zhitian, et al.
Published: (2024)
Mutual Information Regularized Offline Reinforcement Learning
by: Ma, Xiao, et al.
Published: (2022)
by: Ma, Xiao, et al.
Published: (2022)
Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
by: Li, Simin, et al.
Published: (2023)
by: Li, Simin, et al.
Published: (2023)
Graph is a Natural Regularization: Revisiting Vector Quantization for Graph Representation Learning
by: Zhai, Zian, et al.
Published: (2025)
by: Zhai, Zian, et al.
Published: (2025)
Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation
by: He, Qiang, et al.
Published: (2024)
by: He, Qiang, et al.
Published: (2024)
Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
by: Duan, Jinhao, et al.
Published: (2023)
by: Duan, Jinhao, et al.
Published: (2023)
Fully Spiking Neural Network for Legged Robots
by: Jiang, Xiaoyang, et al.
Published: (2023)
by: Jiang, Xiaoyang, et al.
Published: (2023)
Learning to Open and Traverse Doors with a Legged Manipulator
by: Zhang, Mike, et al.
Published: (2024)
by: Zhang, Mike, et al.
Published: (2024)
Quantile Geometry Regularization for Distributional Reinforcement Learning
by: Zhang, Zhaofan, et al.
Published: (2026)
by: Zhang, Zhaofan, et al.
Published: (2026)
DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management
by: Xie, Yaqi, et al.
Published: (2026)
by: Xie, Yaqi, et al.
Published: (2026)
Preference-Based Self-Distillation: Beyond KL Matching via Reward Regularization
by: Yu, Xin, et al.
Published: (2026)
by: Yu, Xin, et al.
Published: (2026)
Enhancing Time Series Forecasting via Logic-Inspired Regularization
by: Zhang, Jianqi, et al.
Published: (2025)
by: Zhang, Jianqi, et al.
Published: (2025)
From Generalist to Specialist Representation
by: Zheng, Yujia, et al.
Published: (2026)
by: Zheng, Yujia, et al.
Published: (2026)
Flora: Low-Rank Adapters Are Secretly Gradient Compressors
by: Hao, Yongchang, et al.
Published: (2024)
by: Hao, Yongchang, et al.
Published: (2024)
Representation Learning with Mutual Influence of Modalities for Node Classification in Multi-Modal Heterogeneous Networks
by: Li, Jiafan, et al.
Published: (2025)
by: Li, Jiafan, et al.
Published: (2025)
The Scaling Law for LoRA Base on Mutual Information Upper Bound
by: Zhang, Jing, et al.
Published: (2025)
by: Zhang, Jing, et al.
Published: (2025)
PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation
by: Fan, Tao, et al.
Published: (2025)
by: Fan, Tao, et al.
Published: (2025)
Using large language models for embodied planning introduces systematic safety risks
by: Zhang, Tao, et al.
Published: (2026)
by: Zhang, Tao, et al.
Published: (2026)
Enhancing Modality Representation and Alignment for Multimodal Cold-start Active Learning
by: Shen, Meng, et al.
Published: (2024)
by: Shen, Meng, et al.
Published: (2024)
Distilled Protein Backbone Generation
by: Xie, Liyang, et al.
Published: (2025)
by: Xie, Liyang, et al.
Published: (2025)
Convergent Linear Representations of Emergent Misalignment
by: Soligo, Anna, et al.
Published: (2025)
by: Soligo, Anna, et al.
Published: (2025)
Convergent World Representations and Divergent Tasks
by: Park, Core Francisco
Published: (2026)
by: Park, Core Francisco
Published: (2026)
Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2025)
by: Gao, Chen-Xiao, et al.
Published: (2025)
Rényi Divergence Deep Mutual Learning
by: Huang, Weipeng, et al.
Published: (2022)
by: Huang, Weipeng, et al.
Published: (2022)
Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics
by: Li, Chenhao, et al.
Published: (2025)
by: Li, Chenhao, et al.
Published: (2025)
Uncertainty-Aware Robotic World Model Makes Offline Model-Based Reinforcement Learning Work on Real Robots
by: Li, Chenhao, et al.
Published: (2025)
by: Li, Chenhao, et al.
Published: (2025)
Bridging the Gap: Enabling Soft Actor Critic for High Performance Legged Locomotion
by: Sabatini, Gianluca, et al.
Published: (2026)
by: Sabatini, Gianluca, et al.
Published: (2026)
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information
by: Shen, Guobin, et al.
Published: (2026)
by: Shen, Guobin, et al.
Published: (2026)
DeepCell: Self-Supervised Multiview Fusion for Circuit Representation Learning
by: Shi, Zhengyuan, et al.
Published: (2025)
by: Shi, Zhengyuan, et al.
Published: (2025)
Adaptive Guidance for Local Training in Heterogeneous Federated Learning
by: Zhang, Jianqing, et al.
Published: (2024)
by: Zhang, Jianqing, et al.
Published: (2024)
Fairness in Survival Analysis: A Novel Conditional Mutual Information Augmentation Approach
by: Xie, Tianyang, et al.
Published: (2025)
by: Xie, Tianyang, et al.
Published: (2025)
Linear $Q$-Learning Does Not Diverge in $L^2$: Convergence Rates to a Bounded Set
by: Liu, Xinyu, et al.
Published: (2025)
by: Liu, Xinyu, et al.
Published: (2025)
Multi-Head Spectral-Adaptive Graph Anomaly Detection
by: Cao, Qingyue, et al.
Published: (2025)
by: Cao, Qingyue, et al.
Published: (2025)
Large-Small Model Collaborative Framework for Federated Continual Learning
by: Yu, Hao, et al.
Published: (2025)
by: Yu, Hao, et al.
Published: (2025)
Context Distillation as Latent Memory Management
by: Zheng, Ziyang, et al.
Published: (2026)
by: Zheng, Ziyang, et al.
Published: (2026)
Feature-Based vs. GAN-Based Learning from Demonstrations: When and Why
by: Li, Chenhao, et al.
Published: (2025)
by: Li, Chenhao, et al.
Published: (2025)
Similar Items
-
Simple Policy Optimization
by: Xie, Zhengpeng, et al.
Published: (2024) -
A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning
by: Xie, Zhengpeng, et al.
Published: (2025) -
MoSA: Motion-constrained Stress Adaptation for Mitigating Real-to-Sim Gap in Continuum Dynamics via Learning Residual Anisotropy
by: Wang, Jiaxu, et al.
Published: (2026) -
Reinforcement Learning with Generalizable Gaussian Splatting
by: Wang, Jiaxu, et al.
Published: (2024) -
MoDE: A Mixture-of-Experts Model with Mutual Distillation among the Experts
by: Xie, Zhitian, et al.
Published: (2024)