:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Xuxi, Wang, Zhendong, Sow, Daouda, Yang, Junjie, Chen, Tianlong, Liang, Yingbin, Zhou, Mingyuan, Wang, Zhangyang
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2402.14270
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Rethinking PGD Attack: Is Sign Function Necessary?
by: Yang, Junjie, et al.
Published: (2023)

Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining
by: Sow, Daouda, et al.
Published: (2025)

Algorithm Design for Online Meta-Learning with Task Boundary Detection
by: Sow, Daouda, et al.
Published: (2023)

Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis
by: Yang, Hongru, et al.
Published: (2024)

Meta ControlNet: Enhancing Task Adaptation via Meta Learning
by: Yang, Junjie, et al.
Published: (2023)

LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning
by: Perin, Gabriel J., et al.
Published: (2025)

Neural Networks with Sparse Activation Induced by Large Bias: Tighter Analysis with Bias-Generalized NTK
by: Yang, Hongru, et al.
Published: (2023)

Make Optimization Once and for All with Fine-grained Guidance
by: Shi, Mingjia, et al.
Published: (2025)

Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
by: Chen, Tianyu, et al.
Published: (2024)

Enhancing and Accelerating Diffusion-Based Inverse Problem Solving through Measurements Optimization
by: Chen, Tianyu, et al.
Published: (2024)

Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay
by: Sun, Yifan, et al.
Published: (2025)

Sparse Mixture-of-Experts for Compositional Generalization: Empirical Evidence and Theoretical Foundations of Optimal Sparsity
by: Zhao, Jinze, et al.
Published: (2024)

Enhancing Adversarial Training via Reweighting Optimization Trajectory
by: Huang, Tianjin, et al.
Published: (2023)

Few-Step Diffusion via Score identity Distillation
by: Zhou, Mingyuan, et al.
Published: (2025)

Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games
by: Feng, Songtao, et al.
Published: (2023)

Guided Score identity Distillation for Data-Free One-Step Text-to-Image Generation
by: Zhou, Mingyuan, et al.
Published: (2024)

Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
by: Yin, Yueqin, et al.
Published: (2024)

DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference
by: Li, Chaojian, et al.
Published: (2021)

Neon: Negative Extrapolation From Self-Training Improves Image Generation
by: Alemohammad, Sina, et al.
Published: (2025)

In-Context Learning with Representations: Contextual Generalization of Trained Transformers
by: Yang, Tong, et al.
Published: (2024)

You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time
by: Han, Xiaotian, et al.
Published: (2025)

Distilled Protein Backbone Generation
by: Xie, Liyang, et al.
Published: (2025)

Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
by: Yin, Yueqin, et al.
Published: (2024)

Non-asymptotic Convergence of Training Transformers for Next-token Prediction
by: Huang, Ruiquan, et al.
Published: (2024)

Score Distillation Beyond Acceleration: Generative Modeling from Corrupted Data
by: Zhang, Yasi, et al.
Published: (2025)

On the Continuity of Schur-Horn Mapping
by: Chen, Hengzhun, et al.
Published: (2024)

Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation
by: Chen, Tianyu, et al.
Published: (2025)

Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation
by: Zhou, Mingyuan, et al.
Published: (2024)

How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias
by: Huang, Ruiquan, et al.
Published: (2025)

Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity
by: Liu, Shiwei, et al.
Published: (2021)

One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
by: Zhang, Mohan, et al.
Published: (2025)

LLM-AutoDiff: Auto-Differentiate Any LLM Workflow
by: Yin, Li, et al.
Published: (2025)

Adaptive NNs asymptotic tracking control for high‐order nonlinear systems under prescribed performance and asymmetric output constraints
by: Kun Jiang, et al.
Published: (2024)

CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks
by: Wang, Tianlong, et al.
Published: (2024)

Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training
by: Zhang, Jiacheng, et al.
Published: (2024)

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
by: Zhao, Jiawei, et al.
Published: (2024)

Rethinking Data Curation in LLM Training: Online Reweighting Offers Better Generalization than Offline Methods
by: Zhao, Wanru, et al.
Published: (2026)

IFFair: Influence Function-driven Sample Reweighting for Fair Classification
by: Yang, Jingran, et al.
Published: (2025)

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild
by: Zhao, Xinyu, et al.
Published: (2024)

CSR and sustainable development: Multinationals are they socially responsible in Sub-Saharan Africa? The case of Areva in Niger
by: Youssoufou Hamadou Daouda
Published: (2014)