Saved in:
| Main Authors: | Wei, Xiyuan, Zhou, Linli, Wang, Bokun, Lin, Chih-Jen, Yang, Tianbao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02877 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
NeuCLIP: Efficient Large-Scale CLIP Training with Neural Normalizer Optimization
by: Wei, Xiyuan, et al.
Published: (2025)
by: Wei, Xiyuan, et al.
Published: (2025)
A Near-Optimal Single-Loop Stochastic Algorithm for Convex Finite-Sum Coupled Compositional Optimization
by: Wang, Bokun, et al.
Published: (2023)
by: Wang, Bokun, et al.
Published: (2023)
Stochastic Primal-Dual Double Block-Coordinate for Two-way Partial AUC Maximization
by: Zhou, Linli, et al.
Published: (2025)
by: Zhou, Linli, et al.
Published: (2025)
Stochastic Momentum Methods for Non-smooth Non-Convex Finite-Sum Coupled Compositional Optimization
by: Chen, Xingyu, et al.
Published: (2025)
by: Chen, Xingyu, et al.
Published: (2025)
On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning
by: Wang, Bokun, et al.
Published: (2024)
by: Wang, Bokun, et al.
Published: (2024)
Discovering Global False Negatives On the Fly for Self-supervised Contrastive Learning
by: Balmaseda, Vicente, et al.
Published: (2025)
by: Balmaseda, Vicente, et al.
Published: (2025)
Breaking the Limits of Open-Weight CLIP: An Optimization Framework for Self-supervised Fine-tuning of CLIP
by: Mehta, Anant, et al.
Published: (2026)
by: Mehta, Anant, et al.
Published: (2026)
Entropic Risk-Aware Monte Carlo Tree Search
by: Santos, Pedro P., et al.
Published: (2026)
by: Santos, Pedro P., et al.
Published: (2026)
DARE: Diffusion Language Model Activation Reuse for Efficient Inference
by: Frumkin, Natalia, et al.
Published: (2026)
by: Frumkin, Natalia, et al.
Published: (2026)
Single-loop Algorithms for Stochastic Non-convex Optimization with Weakly-Convex Constraints
by: Yang, Ming, et al.
Published: (2025)
by: Yang, Ming, et al.
Published: (2025)
Model Steering: Learning with a Reference Model Improves Generalization Bounds and Scaling Laws
by: Wei, Xiyuan, et al.
Published: (2025)
by: Wei, Xiyuan, et al.
Published: (2025)
FastCLIP: A Suite of Optimization Techniques to Accelerate CLIP Training with Limited Resources
by: Wei, Xiyuan, et al.
Published: (2024)
by: Wei, Xiyuan, et al.
Published: (2024)
Risk-Entropic Flow Matching
by: Ramezani, Vahid R., et al.
Published: (2025)
by: Ramezani, Vahid R., et al.
Published: (2025)
Compositional Risk Minimization
by: Mahajan, Divyat, et al.
Published: (2024)
by: Mahajan, Divyat, et al.
Published: (2024)
Communication-Efficient Federated Group Distributionally Robust Optimization
by: Guo, Zhishuai, et al.
Published: (2024)
by: Guo, Zhishuai, et al.
Published: (2024)
DRPO: Efficient Reasoning via Decoupled Reward Policy Optimization
by: Li, Gang, et al.
Published: (2025)
by: Li, Gang, et al.
Published: (2025)
Efficient Risk-sensitive Planning via Entropic Risk Measures
by: Marthe, Alexandre, et al.
Published: (2025)
by: Marthe, Alexandre, et al.
Published: (2025)
Efficient Sharpness-Aware Minimization for Molecular Graph Transformer Models
by: Wang, Yili, et al.
Published: (2024)
by: Wang, Yili, et al.
Published: (2024)
Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning
by: Luo, Zhi, et al.
Published: (2024)
by: Luo, Zhi, et al.
Published: (2024)
Federated Compositional Deep AUC Maximization
by: Zhang, Xinwen, et al.
Published: (2023)
by: Zhang, Xinwen, et al.
Published: (2023)
Non-Smooth Weakly-Convex Finite-sum Coupled Compositional Optimization
by: Hu, Quanqi, et al.
Published: (2023)
by: Hu, Quanqi, et al.
Published: (2023)
Entropic Auto-Encoding via Implicit Free-Energy Minimization
by: Aliahmadi, Hazhir, et al.
Published: (2026)
by: Aliahmadi, Hazhir, et al.
Published: (2026)
RASR: Risk-Averse Soft-Robust MDPs with EVaR and Entropic Risk
by: Hau, Jia Lin, et al.
Published: (2022)
by: Hau, Jia Lin, et al.
Published: (2022)
AdFair-CLIP: Adversarial Fair Contrastive Language-Image Pre-training for Chest X-rays
by: Yi, Chenlang, et al.
Published: (2025)
by: Yi, Chenlang, et al.
Published: (2025)
Memory-Efficient Continual Learning with CLIP Models
by: King, Ryan, et al.
Published: (2026)
by: King, Ryan, et al.
Published: (2026)
Navigating Potholes with Geometry-Aware Sharpness Minimization
by: Dufort-Labbé, Simon, et al.
Published: (2026)
by: Dufort-Labbé, Simon, et al.
Published: (2026)
Dual Adaptivity: Universal Algorithms for Minimizing the Adaptive Regret of Convex Functions
by: Zhang, Lijun, et al.
Published: (2025)
by: Zhang, Lijun, et al.
Published: (2025)
Efficient and Effective Implicit Dynamic Graph Neural Network
by: Zhong, Yongjian, et al.
Published: (2024)
by: Zhong, Yongjian, et al.
Published: (2024)
DRTriton: Large-Scale Synthetic Data Driven Reinforcement Learning for Triton Kernel Generation
by: Guo, Siqi, et al.
Published: (2026)
by: Guo, Siqi, et al.
Published: (2026)
A Universal Class of Sharpness-Aware Minimization Algorithms
by: Tahmasebi, Behrooz, et al.
Published: (2024)
by: Tahmasebi, Behrooz, et al.
Published: (2024)
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
by: Zhou, Zhanpeng, et al.
Published: (2024)
by: Zhou, Zhanpeng, et al.
Published: (2024)
Single-Loop Stochastic Algorithms for Difference of Max-Structured Weakly Convex Functions
by: Hu, Quanqi, et al.
Published: (2024)
by: Hu, Quanqi, et al.
Published: (2024)
Which LLMs are Difficult to Detect? A Detailed Analysis of Potential Factors Contributing to Difficulties in LLM Text Detection
by: Thorat, Shantanu, et al.
Published: (2024)
by: Thorat, Shantanu, et al.
Published: (2024)
Computing Pure-Strategy Nash Equilibria in a Two-Party Policy Competition: Existence and Algorithmic Approaches
by: Lin, Chuang-Chieh, et al.
Published: (2025)
by: Lin, Chuang-Chieh, et al.
Published: (2025)
Sparse Layer Sharpness-Aware Minimization for Efficient Fine-Tuning
by: Cheng, Yifei, et al.
Published: (2026)
by: Cheng, Yifei, et al.
Published: (2026)
An Information-Minimal Geometry for Qubit-Efficient Optimization
by: Ma, Gordon, et al.
Published: (2025)
by: Ma, Gordon, et al.
Published: (2025)
SOREL: A Stochastic Algorithm for Spectral Risks Minimization
by: Ge, Yuze, et al.
Published: (2024)
by: Ge, Yuze, et al.
Published: (2024)
Optimal Algorithms for Stochastic Complementary Composite Minimization
by: d'Aspremont, Alexandre, et al.
Published: (2022)
by: d'Aspremont, Alexandre, et al.
Published: (2022)
A Geometry-Aware Algorithm to Learn Hierarchical Embeddings in Hyperbolic Space
by: Wang, Zhangyu, et al.
Published: (2024)
by: Wang, Zhangyu, et al.
Published: (2024)
Exploring space efficiency in a tree-based linear model for extreme multi-label classification
by: Lin, He-Zhe, et al.
Published: (2024)
by: Lin, He-Zhe, et al.
Published: (2024)
Similar Items
-
NeuCLIP: Efficient Large-Scale CLIP Training with Neural Normalizer Optimization
by: Wei, Xiyuan, et al.
Published: (2025) -
A Near-Optimal Single-Loop Stochastic Algorithm for Convex Finite-Sum Coupled Compositional Optimization
by: Wang, Bokun, et al.
Published: (2023) -
Stochastic Primal-Dual Double Block-Coordinate for Two-way Partial AUC Maximization
by: Zhou, Linli, et al.
Published: (2025) -
Stochastic Momentum Methods for Non-smooth Non-Convex Finite-Sum Coupled Compositional Optimization
by: Chen, Xingyu, et al.
Published: (2025) -
On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning
by: Wang, Bokun, et al.
Published: (2024)