Saved in:
| Main Authors: | Chen, Xuxi, Wang, Zhendong, Sow, Daouda, Yang, Junjie, Chen, Tianlong, Liang, Yingbin, Zhou, Mingyuan, Wang, Zhangyang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.14270 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Rethinking PGD Attack: Is Sign Function Necessary?
by: Yang, Junjie, et al.
Published: (2023)
by: Yang, Junjie, et al.
Published: (2023)
Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining
by: Sow, Daouda, et al.
Published: (2025)
by: Sow, Daouda, et al.
Published: (2025)
Algorithm Design for Online Meta-Learning with Task Boundary Detection
by: Sow, Daouda, et al.
Published: (2023)
by: Sow, Daouda, et al.
Published: (2023)
Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis
by: Yang, Hongru, et al.
Published: (2024)
by: Yang, Hongru, et al.
Published: (2024)
Meta ControlNet: Enhancing Task Adaptation via Meta Learning
by: Yang, Junjie, et al.
Published: (2023)
by: Yang, Junjie, et al.
Published: (2023)
LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning
by: Perin, Gabriel J., et al.
Published: (2025)
by: Perin, Gabriel J., et al.
Published: (2025)
Neural Networks with Sparse Activation Induced by Large Bias: Tighter Analysis with Bias-Generalized NTK
by: Yang, Hongru, et al.
Published: (2023)
by: Yang, Hongru, et al.
Published: (2023)
Make Optimization Once and for All with Fine-grained Guidance
by: Shi, Mingjia, et al.
Published: (2025)
by: Shi, Mingjia, et al.
Published: (2025)
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
by: Chen, Tianyu, et al.
Published: (2024)
by: Chen, Tianyu, et al.
Published: (2024)
Enhancing and Accelerating Diffusion-Based Inverse Problem Solving through Measurements Optimization
by: Chen, Tianyu, et al.
Published: (2024)
by: Chen, Tianyu, et al.
Published: (2024)
Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay
by: Sun, Yifan, et al.
Published: (2025)
by: Sun, Yifan, et al.
Published: (2025)
Sparse Mixture-of-Experts for Compositional Generalization: Empirical Evidence and Theoretical Foundations of Optimal Sparsity
by: Zhao, Jinze, et al.
Published: (2024)
by: Zhao, Jinze, et al.
Published: (2024)
Enhancing Adversarial Training via Reweighting Optimization Trajectory
by: Huang, Tianjin, et al.
Published: (2023)
by: Huang, Tianjin, et al.
Published: (2023)
Few-Step Diffusion via Score identity Distillation
by: Zhou, Mingyuan, et al.
Published: (2025)
by: Zhou, Mingyuan, et al.
Published: (2025)
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games
by: Feng, Songtao, et al.
Published: (2023)
by: Feng, Songtao, et al.
Published: (2023)
Guided Score identity Distillation for Data-Free One-Step Text-to-Image Generation
by: Zhou, Mingyuan, et al.
Published: (2024)
by: Zhou, Mingyuan, et al.
Published: (2024)
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
by: Yin, Yueqin, et al.
Published: (2024)
by: Yin, Yueqin, et al.
Published: (2024)
DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference
by: Li, Chaojian, et al.
Published: (2021)
by: Li, Chaojian, et al.
Published: (2021)
Neon: Negative Extrapolation From Self-Training Improves Image Generation
by: Alemohammad, Sina, et al.
Published: (2025)
by: Alemohammad, Sina, et al.
Published: (2025)
In-Context Learning with Representations: Contextual Generalization of Trained Transformers
by: Yang, Tong, et al.
Published: (2024)
by: Yang, Tong, et al.
Published: (2024)
You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time
by: Han, Xiaotian, et al.
Published: (2025)
by: Han, Xiaotian, et al.
Published: (2025)
Distilled Protein Backbone Generation
by: Xie, Liyang, et al.
Published: (2025)
by: Xie, Liyang, et al.
Published: (2025)
Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
by: Yin, Yueqin, et al.
Published: (2024)
by: Yin, Yueqin, et al.
Published: (2024)
Non-asymptotic Convergence of Training Transformers for Next-token Prediction
by: Huang, Ruiquan, et al.
Published: (2024)
by: Huang, Ruiquan, et al.
Published: (2024)
Score Distillation Beyond Acceleration: Generative Modeling from Corrupted Data
by: Zhang, Yasi, et al.
Published: (2025)
by: Zhang, Yasi, et al.
Published: (2025)
On the Continuity of Schur-Horn Mapping
by: Chen, Hengzhun, et al.
Published: (2024)
by: Chen, Hengzhun, et al.
Published: (2024)
Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation
by: Chen, Tianyu, et al.
Published: (2025)
by: Chen, Tianyu, et al.
Published: (2025)
Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation
by: Zhou, Mingyuan, et al.
Published: (2024)
by: Zhou, Mingyuan, et al.
Published: (2024)
How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias
by: Huang, Ruiquan, et al.
Published: (2025)
by: Huang, Ruiquan, et al.
Published: (2025)
Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity
by: Liu, Shiwei, et al.
Published: (2021)
by: Liu, Shiwei, et al.
Published: (2021)
One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
by: Zhang, Mohan, et al.
Published: (2025)
by: Zhang, Mohan, et al.
Published: (2025)
LLM-AutoDiff: Auto-Differentiate Any LLM Workflow
by: Yin, Li, et al.
Published: (2025)
by: Yin, Li, et al.
Published: (2025)
Adaptive NNs asymptotic tracking control for high‐order nonlinear systems under prescribed performance and asymmetric output constraints
by: Kun Jiang, et al.
Published: (2024)
by: Kun Jiang, et al.
Published: (2024)
CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks
by: Wang, Tianlong, et al.
Published: (2024)
by: Wang, Tianlong, et al.
Published: (2024)
Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training
by: Zhang, Jiacheng, et al.
Published: (2024)
by: Zhang, Jiacheng, et al.
Published: (2024)
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
by: Zhao, Jiawei, et al.
Published: (2024)
by: Zhao, Jiawei, et al.
Published: (2024)
Rethinking Data Curation in LLM Training: Online Reweighting Offers Better Generalization than Offline Methods
by: Zhao, Wanru, et al.
Published: (2026)
by: Zhao, Wanru, et al.
Published: (2026)
IFFair: Influence Function-driven Sample Reweighting for Fair Classification
by: Yang, Jingran, et al.
Published: (2025)
by: Yang, Jingran, et al.
Published: (2025)
Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild
by: Zhao, Xinyu, et al.
Published: (2024)
by: Zhao, Xinyu, et al.
Published: (2024)
CSR and sustainable development: Multinationals are they socially responsible in Sub-Saharan Africa? The case of Areva in Niger
by: Youssoufou Hamadou Daouda
Published: (2014)
by: Youssoufou Hamadou Daouda
Published: (2014)
Similar Items
-
Rethinking PGD Attack: Is Sign Function Necessary?
by: Yang, Junjie, et al.
Published: (2023) -
Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining
by: Sow, Daouda, et al.
Published: (2025) -
Algorithm Design for Online Meta-Learning with Task Boundary Detection
by: Sow, Daouda, et al.
Published: (2023) -
Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis
by: Yang, Hongru, et al.
Published: (2024) -
Meta ControlNet: Enhancing Task Adaptation via Meta Learning
by: Yang, Junjie, et al.
Published: (2023)