Saved in:
| Main Authors: | Zhao, Weiye, Li, Feihan, Sun, Yifan, Wang, Yujie, Chen, Rui, Wei, Tianhao, Liu, Changliu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.01212 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
State-wise Constrained Policy Optimization
by: Zhao, Weiye, et al.
Published: (2023)
by: Zhao, Weiye, et al.
Published: (2023)
Absolute Policy Optimization
by: Zhao, Weiye, et al.
Published: (2023)
by: Zhao, Weiye, et al.
Published: (2023)
Learn With Imagination: Safe Set Guided State-wise Constrained Policy Optimization
by: Sun, Yifan, et al.
Published: (2023)
by: Sun, Yifan, et al.
Published: (2023)
GUARD: A Safe Reinforcement Learning Benchmark
by: Zhao, Weiye, et al.
Published: (2023)
by: Zhao, Weiye, et al.
Published: (2023)
Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning
by: Zhao, Weiye, et al.
Published: (2024)
by: Zhao, Weiye, et al.
Published: (2024)
Continual Learning and Lifting of Koopman Dynamics for Linear Control of Legged Robots
by: Li, Feihan, et al.
Published: (2024)
by: Li, Feihan, et al.
Published: (2024)
Physics-Aware Combinatorial Assembly Sequence Planning using Data-free Action Masking
by: Liu, Ruixuan, et al.
Published: (2024)
by: Liu, Ruixuan, et al.
Published: (2024)
Augmented Lagrangian Multiplier Network for State-wise Safety in Reinforcement Learning
by: Zhang, Jiaming, et al.
Published: (2026)
by: Zhang, Jiaming, et al.
Published: (2026)
Verification of Neural Control Barrier Functions with Symbolic Derivative Bounds Propagation
by: Hu, Hanjiang, et al.
Published: (2024)
by: Hu, Hanjiang, et al.
Published: (2024)
Safety Index Synthesis with State-dependent Control Space
by: Chen, Rui, et al.
Published: (2023)
by: Chen, Rui, et al.
Published: (2023)
AdaTrans: Feature-wise and Sample-wise Adaptive Transfer Learning for High-dimensional Regression
by: He, Zelin, et al.
Published: (2024)
by: He, Zelin, et al.
Published: (2024)
Contextual Bandits with Stage-wise Constraints
by: Pacchiano, Aldo, et al.
Published: (2024)
by: Pacchiano, Aldo, et al.
Published: (2024)
Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action
by: Chen, Xin, et al.
Published: (2024)
by: Chen, Xin, et al.
Published: (2024)
Meta-Control: Automatic Model-based Control Synthesis for Heterogeneous Robot Skills
by: Wei, Tianhao, et al.
Published: (2024)
by: Wei, Tianhao, et al.
Published: (2024)
SPARK: Safe Protective and Assistive Robot Kit
by: Sun, Yifan, et al.
Published: (2025)
by: Sun, Yifan, et al.
Published: (2025)
Conditional Policy Generator for Dynamic Constraint Satisfaction and Optimization
by: Lee, Wook, et al.
Published: (2025)
by: Lee, Wook, et al.
Published: (2025)
A Lightweight and Transferable Design for Robust LEGO Manipulation
by: Liu, Ruixuan, et al.
Published: (2023)
by: Liu, Ruixuan, et al.
Published: (2023)
Partition-wise Graph Filtering: A Unified Perspective Through the Lens of Graph Coarsening
by: Li, Guoming, et al.
Published: (2025)
by: Li, Guoming, et al.
Published: (2025)
A Layer-wise Analysis of Supervised Fine-Tuning
by: Zhao, Qinghua, et al.
Published: (2026)
by: Zhao, Qinghua, et al.
Published: (2026)
Verifiable Safety Q-Filters via Hamilton-Jacobi Reachability and Multiplicative Q-Networks
by: Li, Jiaxing, et al.
Published: (2025)
by: Li, Jiaxing, et al.
Published: (2025)
Mitigating the Safety Alignment Tax with Null-Space Constrained Policy Optimization
by: Niu, Yifan, et al.
Published: (2025)
by: Niu, Yifan, et al.
Published: (2025)
Step-wise Rubric Rewards for LLM Reasoning
by: Xie, Weichu, et al.
Published: (2026)
by: Xie, Weichu, et al.
Published: (2026)
Memory-adaptive Depth-wise Heterogeneous Federated Learning
by: Zhang, Kai, et al.
Published: (2023)
by: Zhang, Kai, et al.
Published: (2023)
Universally Empowering Zeroth-Order Optimization via Adaptive Layer-wise Sampling
by: Wang, Fei, et al.
Published: (2026)
by: Wang, Fei, et al.
Published: (2026)
SCFormer: Structured Channel-wise Transformer with Cumulative Historical State for Multivariate Time Series Forecasting
by: Guo, Shiwei, et al.
Published: (2025)
by: Guo, Shiwei, et al.
Published: (2025)
HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization
by: Zhao, Huaqin, et al.
Published: (2024)
by: Zhao, Huaqin, et al.
Published: (2024)
Improving Policy Optimization via $\varepsilon$-Retrain
by: Marzari, Luca, et al.
Published: (2024)
by: Marzari, Luca, et al.
Published: (2024)
Layer-wise dynamic rank for compressing large language models
by: Mi, Zhendong, et al.
Published: (2025)
by: Mi, Zhendong, et al.
Published: (2025)
ShuffleGate: Scalable Feature Optimization for Recommender Systems via Batch-wise Sensitivity Learning
by: Huang, Yihong, et al.
Published: (2025)
by: Huang, Yihong, et al.
Published: (2025)
Node-wise Filtering in Graph Neural Networks: A Mixture of Experts Approach
by: Han, Haoyu, et al.
Published: (2024)
by: Han, Haoyu, et al.
Published: (2024)
Adversarial Curriculum Graph Contrastive Learning with Pair-wise Augmentation
by: Zhao, Xinjian, et al.
Published: (2024)
by: Zhao, Xinjian, et al.
Published: (2024)
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
by: Zhu, Dawei, et al.
Published: (2023)
by: Zhu, Dawei, et al.
Published: (2023)
Patch-wise Structural Loss for Time Series Forecasting
by: Kudrat, Dilfira, et al.
Published: (2025)
by: Kudrat, Dilfira, et al.
Published: (2025)
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
by: Liu, Yuxi, et al.
Published: (2025)
by: Liu, Yuxi, et al.
Published: (2025)
Hierarchical Group-wise Ranking Framework for Recommendation Models
by: Yan, YaChen, et al.
Published: (2025)
by: Yan, YaChen, et al.
Published: (2025)
The Procrustean Bed of Time Series: The Optimization Bias in Point-wise Loss Functions
by: Cai, Rongyao, et al.
Published: (2025)
by: Cai, Rongyao, et al.
Published: (2025)
Training Long-Context LLMs Efficiently via Chunk-wise Optimization
by: Li, Wenhao, et al.
Published: (2025)
by: Li, Wenhao, et al.
Published: (2025)
Is Aggregation the Only Choice? Federated Learning via Layer-wise Model Recombination
by: Hu, Ming, et al.
Published: (2023)
by: Hu, Ming, et al.
Published: (2023)
Revisiting the Initial Steps in Adaptive Gradient Descent Optimization
by: Abuduweili, Abulikemu, et al.
Published: (2024)
by: Abuduweili, Abulikemu, et al.
Published: (2024)
Layer-wise Linear Mode Connectivity
by: Adilova, Linara, et al.
Published: (2023)
by: Adilova, Linara, et al.
Published: (2023)
Similar Items
-
State-wise Constrained Policy Optimization
by: Zhao, Weiye, et al.
Published: (2023) -
Absolute Policy Optimization
by: Zhao, Weiye, et al.
Published: (2023) -
Learn With Imagination: Safe Set Guided State-wise Constrained Policy Optimization
by: Sun, Yifan, et al.
Published: (2023) -
GUARD: A Safe Reinforcement Learning Benchmark
by: Zhao, Weiye, et al.
Published: (2023) -
Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning
by: Zhao, Weiye, et al.
Published: (2024)