:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhao, Weiye, Li, Feihan, Sun, Yifan, Wang, Yujie, Chen, Rui, Wei, Tianhao, Liu, Changliu
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2410.01212
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

State-wise Constrained Policy Optimization
by: Zhao, Weiye, et al.
Published: (2023)

Absolute Policy Optimization
by: Zhao, Weiye, et al.
Published: (2023)

Learn With Imagination: Safe Set Guided State-wise Constrained Policy Optimization
by: Sun, Yifan, et al.
Published: (2023)

GUARD: A Safe Reinforcement Learning Benchmark
by: Zhao, Weiye, et al.
Published: (2023)

Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning
by: Zhao, Weiye, et al.
Published: (2024)

Continual Learning and Lifting of Koopman Dynamics for Linear Control of Legged Robots
by: Li, Feihan, et al.
Published: (2024)

Physics-Aware Combinatorial Assembly Sequence Planning using Data-free Action Masking
by: Liu, Ruixuan, et al.
Published: (2024)

Augmented Lagrangian Multiplier Network for State-wise Safety in Reinforcement Learning
by: Zhang, Jiaming, et al.
Published: (2026)

Verification of Neural Control Barrier Functions with Symbolic Derivative Bounds Propagation
by: Hu, Hanjiang, et al.
Published: (2024)

Safety Index Synthesis with State-dependent Control Space
by: Chen, Rui, et al.
Published: (2023)

AdaTrans: Feature-wise and Sample-wise Adaptive Transfer Learning for High-dimensional Regression
by: He, Zelin, et al.
Published: (2024)

Contextual Bandits with Stage-wise Constraints
by: Pacchiano, Aldo, et al.
Published: (2024)

Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action
by: Chen, Xin, et al.
Published: (2024)

Meta-Control: Automatic Model-based Control Synthesis for Heterogeneous Robot Skills
by: Wei, Tianhao, et al.
Published: (2024)

SPARK: Safe Protective and Assistive Robot Kit
by: Sun, Yifan, et al.
Published: (2025)

Conditional Policy Generator for Dynamic Constraint Satisfaction and Optimization
by: Lee, Wook, et al.
Published: (2025)

A Lightweight and Transferable Design for Robust LEGO Manipulation
by: Liu, Ruixuan, et al.
Published: (2023)

Partition-wise Graph Filtering: A Unified Perspective Through the Lens of Graph Coarsening
by: Li, Guoming, et al.
Published: (2025)

A Layer-wise Analysis of Supervised Fine-Tuning
by: Zhao, Qinghua, et al.
Published: (2026)

Verifiable Safety Q-Filters via Hamilton-Jacobi Reachability and Multiplicative Q-Networks
by: Li, Jiaxing, et al.
Published: (2025)

Mitigating the Safety Alignment Tax with Null-Space Constrained Policy Optimization
by: Niu, Yifan, et al.
Published: (2025)

Step-wise Rubric Rewards for LLM Reasoning
by: Xie, Weichu, et al.
Published: (2026)

Memory-adaptive Depth-wise Heterogeneous Federated Learning
by: Zhang, Kai, et al.
Published: (2023)

Universally Empowering Zeroth-Order Optimization via Adaptive Layer-wise Sampling
by: Wang, Fei, et al.
Published: (2026)

SCFormer: Structured Channel-wise Transformer with Cumulative Historical State for Multivariate Time Series Forecasting
by: Guo, Shiwei, et al.
Published: (2025)

HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization
by: Zhao, Huaqin, et al.
Published: (2024)

Improving Policy Optimization via $\varepsilon$-Retrain
by: Marzari, Luca, et al.
Published: (2024)

Layer-wise dynamic rank for compressing large language models
by: Mi, Zhendong, et al.
Published: (2025)

ShuffleGate: Scalable Feature Optimization for Recommender Systems via Batch-wise Sensitivity Learning
by: Huang, Yihong, et al.
Published: (2025)

Node-wise Filtering in Graph Neural Networks: A Mixture of Experts Approach
by: Han, Haoyu, et al.
Published: (2024)

Adversarial Curriculum Graph Contrastive Learning with Pair-wise Augmentation
by: Zhao, Xinjian, et al.
Published: (2024)

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
by: Zhu, Dawei, et al.
Published: (2023)

Patch-wise Structural Loss for Time Series Forecasting
by: Kudrat, Dilfira, et al.
Published: (2025)

MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
by: Liu, Yuxi, et al.
Published: (2025)

Hierarchical Group-wise Ranking Framework for Recommendation Models
by: Yan, YaChen, et al.
Published: (2025)

The Procrustean Bed of Time Series: The Optimization Bias in Point-wise Loss Functions
by: Cai, Rongyao, et al.
Published: (2025)

Training Long-Context LLMs Efficiently via Chunk-wise Optimization
by: Li, Wenhao, et al.
Published: (2025)

Is Aggregation the Only Choice? Federated Learning via Layer-wise Model Recombination
by: Hu, Ming, et al.
Published: (2023)

Revisiting the Initial Steps in Adaptive Gradient Descent Optimization
by: Abuduweili, Abulikemu, et al.
Published: (2024)

Layer-wise Linear Mode Connectivity
by: Adilova, Linara, et al.
Published: (2023)