Saved in:
| Main Authors: | Xie, Zhengpeng, Zhang, Qiang, Yang, Fan, Hutter, Marco, Xu, Renjing |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.16025 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Representation Convergence: Mutual Distillation is Secretly a Form of Regularization
by: Xie, Zhengpeng, et al.
Published: (2025)
by: Xie, Zhengpeng, et al.
Published: (2025)
Zeroth-Order Optimization is Secretly Single-Step Policy Optimization
by: Qiu, Junbin, et al.
Published: (2025)
by: Qiu, Junbin, et al.
Published: (2025)
A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning
by: Xie, Zhengpeng, et al.
Published: (2025)
by: Xie, Zhengpeng, et al.
Published: (2025)
Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics
by: Li, Chenhao, et al.
Published: (2025)
by: Li, Chenhao, et al.
Published: (2025)
CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think
by: Sun, Zening, et al.
Published: (2026)
by: Sun, Zening, et al.
Published: (2026)
Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning
by: Jing, Tan, et al.
Published: (2025)
by: Jing, Tan, et al.
Published: (2025)
A Progressive Image Restoration Network for High-order Degradation Imaging in Remote Sensing
by: Feng, Yujie, et al.
Published: (2024)
by: Feng, Yujie, et al.
Published: (2024)
3D-U-SAM Network For Few-shot Tooth Segmentation in CBCT Images
by: Zhang, Yifu, et al.
Published: (2023)
by: Zhang, Yifu, et al.
Published: (2023)
Applying Self-supervised Learning to Network Intrusion Detection for Network Flows with Graph Neural Network
by: Xu, Renjie, et al.
Published: (2024)
by: Xu, Renjie, et al.
Published: (2024)
Pretraining in Actor-Critic Reinforcement Learning for Robot Locomotion
by: Fan, Jiale, et al.
Published: (2025)
by: Fan, Jiale, et al.
Published: (2025)
Large Language Models Engineer Too Many Simple Features For Tabular Data
by: Küken, Jaris, et al.
Published: (2024)
by: Küken, Jaris, et al.
Published: (2024)
UCPO: Uncertainty-Aware Policy Optimization
by: Zeng, Xianzhou, et al.
Published: (2026)
by: Zeng, Xianzhou, et al.
Published: (2026)
Rethinking Robustness Assessment: Adversarial Attacks on Learning-based Quadrupedal Locomotion Controllers
by: Shi, Fan, et al.
Published: (2024)
by: Shi, Fan, et al.
Published: (2024)
DEL: Discrete Element Learner for Learning 3D Particle Dynamics with Neural Rendering
by: Wang, Jiaxu, et al.
Published: (2024)
by: Wang, Jiaxu, et al.
Published: (2024)
A General Framework for User-Guided Bayesian Optimization
by: Hvarfner, Carl, et al.
Published: (2023)
by: Hvarfner, Carl, et al.
Published: (2023)
Policy Optimization in RLHF: The Impact of Out-of-preference Data
by: Li, Ziniu, et al.
Published: (2023)
by: Li, Ziniu, et al.
Published: (2023)
c-TPE: Tree-structured Parzen Estimator with Inequality Constraints for Expensive Hyperparameter Optimization
by: Watanabe, Shuhei, et al.
Published: (2022)
by: Watanabe, Shuhei, et al.
Published: (2022)
Client Selection for Federated Policy Optimization with Environment Heterogeneity
by: Xie, Zhijie, et al.
Published: (2023)
by: Xie, Zhijie, et al.
Published: (2023)
DCPO: Dynamic Clipping Policy Optimization
by: Yang, Shihui, et al.
Published: (2025)
by: Yang, Shihui, et al.
Published: (2025)
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
by: Luo, Yudong, et al.
Published: (2024)
by: Luo, Yudong, et al.
Published: (2024)
3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
by: Ze, Yanjie, et al.
Published: (2024)
by: Ze, Yanjie, et al.
Published: (2024)
Relative Policy-Transition Optimization for Fast Policy Transfer
by: Xu, Jiawei, et al.
Published: (2022)
by: Xu, Jiawei, et al.
Published: (2022)
Actor-Critic Pretraining for Proximal Policy Optimization
by: Kernbach, Andreas, et al.
Published: (2026)
by: Kernbach, Andreas, et al.
Published: (2026)
MoSA: Motion-constrained Stress Adaptation for Mitigating Real-to-Sim Gap in Continuum Dynamics via Learning Residual Anisotropy
by: Wang, Jiaxu, et al.
Published: (2026)
by: Wang, Jiaxu, et al.
Published: (2026)
Learning to Open and Traverse Doors with a Legged Manipulator
by: Zhang, Mike, et al.
Published: (2024)
by: Zhang, Mike, et al.
Published: (2024)
Self-Correcting Bayesian Optimization through Bayesian Active Learning
by: Hvarfner, Carl, et al.
Published: (2023)
by: Hvarfner, Carl, et al.
Published: (2023)
Uncertainty-Aware Robotic World Model Makes Offline Model-Based Reinforcement Learning Work on Real Robots
by: Li, Chenhao, et al.
Published: (2025)
by: Li, Chenhao, et al.
Published: (2025)
Bridging the Gap: Enabling Soft Actor Critic for High Performance Legged Locomotion
by: Sabatini, Gianluca, et al.
Published: (2026)
by: Sabatini, Gianluca, et al.
Published: (2026)
Demystifying Group Relative Policy Optimization: Its Policy Gradient is a U-Statistic
by: Zhou, Hongyi, et al.
Published: (2026)
by: Zhou, Hongyi, et al.
Published: (2026)
In-Context Freeze-Thaw Bayesian Optimization for Hyperparameter Optimization
by: Rakotoarison, Herilalaina, et al.
Published: (2024)
by: Rakotoarison, Herilalaina, et al.
Published: (2024)
LEPO: Latent Reasoning Policy Optimization for Large Language Models
by: Zhou, Yuyan, et al.
Published: (2026)
by: Zhou, Yuyan, et al.
Published: (2026)
Variance Reduction Based Experience Replay for Policy Optimization
by: Zheng, Hua, et al.
Published: (2026)
by: Zheng, Hua, et al.
Published: (2026)
Hierarchical Multi-Label Contrastive Learning for Protein-Protein Interaction Prediction Across Organisms
by: Liu, Shiyi, et al.
Published: (2025)
by: Liu, Shiyi, et al.
Published: (2025)
BiLoRA: A Bi-level Optimization Framework for Overfitting-Resilient Low-Rank Adaptation of Large Pre-trained Models
by: Qiang, Rushi, et al.
Published: (2024)
by: Qiang, Rushi, et al.
Published: (2024)
Simple Optimizers for Convex Aligned Multi-Objective Optimization
by: Kretzu, Ben, et al.
Published: (2025)
by: Kretzu, Ben, et al.
Published: (2025)
Simple Denoising Diffusion Language Models
by: Zhu, Huaisheng, et al.
Published: (2025)
by: Zhu, Huaisheng, et al.
Published: (2025)
FedGRPO: Privately Optimizing Foundation Models with Group-Relative Rewards from Domain Client
by: Zhu, Gongxi, et al.
Published: (2026)
by: Zhu, Gongxi, et al.
Published: (2026)
dFlowGRPO: Rate-Aware Policy Optimization for Discrete Flow Models
by: Wan, Zhengyan, et al.
Published: (2026)
by: Wan, Zhengyan, et al.
Published: (2026)
ES-Parkour: Advanced Robot Parkour with Bio-inspired Event Camera and Spiking Neural Network
by: Zhang, Qiang, et al.
Published: (2025)
by: Zhang, Qiang, et al.
Published: (2025)
Proactive Constrained Policy Optimization with Preemptive Penalty
by: Yang, Ning, et al.
Published: (2025)
by: Yang, Ning, et al.
Published: (2025)
Similar Items
-
Representation Convergence: Mutual Distillation is Secretly a Form of Regularization
by: Xie, Zhengpeng, et al.
Published: (2025) -
Zeroth-Order Optimization is Secretly Single-Step Policy Optimization
by: Qiu, Junbin, et al.
Published: (2025) -
A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning
by: Xie, Zhengpeng, et al.
Published: (2025) -
Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics
by: Li, Chenhao, et al.
Published: (2025) -
CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think
by: Sun, Zening, et al.
Published: (2026)