Saved in:
| Main Authors: | Liu, Liming, Zhang, Zixuan, Du, Simon, Zhao, Tuo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.02809 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
BackPlay: Head-Only Look-Back Self-Correction for Diffusion Language Models
by: Liu, Liming, et al.
Published: (2026)
by: Liu, Liming, et al.
Published: (2026)
COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs
by: Liu, Liming, et al.
Published: (2025)
by: Liu, Liming, et al.
Published: (2025)
On the Interplay Between Stepsize Tuning and Progressive Sharpening
by: Roulet, Vincent, et al.
Published: (2023)
by: Roulet, Vincent, et al.
Published: (2023)
Diffusion Model for Manifold Data: Score Decomposition, Curvature, and Statistical Complexity
by: Zhang, Zixuan, et al.
Published: (2026)
by: Zhang, Zixuan, et al.
Published: (2026)
NorMuon: Making Muon more efficient and scalable
by: Li, Zichong, et al.
Published: (2025)
by: Li, Zichong, et al.
Published: (2025)
Understanding Sharpness Dynamics in NN Training with a Minimalist Example: The Effects of Dataset Difficulty, Depth, Stochasticity, and More
by: Yoo, Geonhui, et al.
Published: (2025)
by: Yoo, Geonhui, et al.
Published: (2025)
Minimalist Softmax Attention Provably Learns Constrained Boolean Functions
by: Hu, Jerry Yao-Chieh, et al.
Published: (2025)
by: Hu, Jerry Yao-Chieh, et al.
Published: (2025)
Differential Smoothing Mitigates Sharpening and Improves LLM Reasoning
by: Gai, Jingchu, et al.
Published: (2025)
by: Gai, Jingchu, et al.
Published: (2025)
Provably Unlearnable Data Examples
by: Wang, Derui, et al.
Published: (2024)
by: Wang, Derui, et al.
Published: (2024)
Robust Reinforcement Learning from Corrupted Human Feedback
by: Bukharin, Alexander, et al.
Published: (2024)
by: Bukharin, Alexander, et al.
Published: (2024)
Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks
by: Zhang, Zixuan, et al.
Published: (2023)
by: Zhang, Zixuan, et al.
Published: (2023)
Self-Consistency via Marginal Sharpening
by: Arzhantsev, Aleksei, et al.
Published: (2026)
by: Arzhantsev, Aleksei, et al.
Published: (2026)
Understanding Pan-Sharpening via Generalized Inverse
by: Liu, Shiqi, et al.
Published: (2023)
by: Liu, Shiqi, et al.
Published: (2023)
A Minimalist Prompt for Zero-Shot Policy Learning
by: Song, Meng, et al.
Published: (2024)
by: Song, Meng, et al.
Published: (2024)
Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening
by: He, Andre, et al.
Published: (2025)
by: He, Andre, et al.
Published: (2025)
Self-Improvement in Language Models: The Sharpening Mechanism
by: Huang, Audrey, et al.
Published: (2024)
by: Huang, Audrey, et al.
Published: (2024)
Minimalist Concept Erasure in Generative Models
by: Zhang, Yang, et al.
Published: (2025)
by: Zhang, Yang, et al.
Published: (2025)
Beyond Distribution Sharpening: The Importance of Task Rewards
by: Mittal, Sarthak, et al.
Published: (2026)
by: Mittal, Sarthak, et al.
Published: (2026)
Sharpened Lazy Incremental Quasi-Newton Method
by: Lahoti, Aakash, et al.
Published: (2023)
by: Lahoti, Aakash, et al.
Published: (2023)
MINTS: Minimalist Thompson Sampling
by: Wang, Kaizheng
Published: (2026)
by: Wang, Kaizheng
Published: (2026)
Soft-Label Caching and Sharpening for Communication-Efficient Federated Distillation
by: Azuma, Kitsuya, et al.
Published: (2025)
by: Azuma, Kitsuya, et al.
Published: (2025)
Graph Transductive Sharpening: Leveraging Unlabeled Predictions in Node Classification
by: Zaz, Brown, et al.
Published: (2026)
by: Zaz, Brown, et al.
Published: (2026)
Minimalist Visual Inertial Odometry
by: Pasti, Francesco, et al.
Published: (2026)
by: Pasti, Francesco, et al.
Published: (2026)
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation
by: Li, Zichong, et al.
Published: (2025)
by: Li, Zichong, et al.
Published: (2025)
A Minimalist Bayesian Framework for Stochastic Optimization
by: Wang, Kaizheng
Published: (2025)
by: Wang, Kaizheng
Published: (2025)
Laplacian Score Sharpening for Mitigating Hallucination in Diffusion Models
by: C, Barath Chandran., et al.
Published: (2025)
by: C, Barath Chandran., et al.
Published: (2025)
Edge of Stochastic Stability: Revisiting the Edge of Stability for SGD
by: Andreyev, Arseniy, et al.
Published: (2024)
by: Andreyev, Arseniy, et al.
Published: (2024)
The Origin of Edge of Stability
by: Litman, Elon
Published: (2026)
by: Litman, Elon
Published: (2026)
Persistent Classification: A New Approach to Stability of Data and Adversarial Examples
by: Bell, Brian, et al.
Published: (2024)
by: Bell, Brian, et al.
Published: (2024)
Laplacian Canonization: A Minimalist Approach to Sign and Basis Invariant Spectral Embedding
by: Ma, Jiangyan, et al.
Published: (2023)
by: Ma, Jiangyan, et al.
Published: (2023)
Sharpen Your Flow: Sharpness-Aware Sampling for Flow Matching
by: Gupta, Aditi, et al.
Published: (2026)
by: Gupta, Aditi, et al.
Published: (2026)
RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting
by: Yang, Shuo, et al.
Published: (2024)
by: Yang, Shuo, et al.
Published: (2024)
SEEN: Sharpening Explanations for Graph Neural Networks using Explanations from Neighborhoods
by: Cho, Hyeoncheol, et al.
Published: (2021)
by: Cho, Hyeoncheol, et al.
Published: (2021)
A Minimalist Method for Fine-tuning Text-to-Image Diffusion Models
by: Miao, Yanting, et al.
Published: (2025)
by: Miao, Yanting, et al.
Published: (2025)
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
by: Xiao, Teng, et al.
Published: (2025)
by: Xiao, Teng, et al.
Published: (2025)
Gaussian Match-and-Copy: A Minimalist Benchmark for Studying Transformer Induction
by: Gonon, Antoine, et al.
Published: (2026)
by: Gonon, Antoine, et al.
Published: (2026)
When Sharpening Becomes Collapse: Sampling Bias and Semantic Coupling in RL with Verifiable Rewards
by: Fan, Mingyuan, et al.
Published: (2026)
by: Fan, Mingyuan, et al.
Published: (2026)
B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning
by: Kim, Woojun, et al.
Published: (2025)
by: Kim, Woojun, et al.
Published: (2025)
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
by: Xiong, Wei, et al.
Published: (2025)
by: Xiong, Wei, et al.
Published: (2025)
Efficient Edge LLMs Deployment via HessianAware Quantization and CPU GPU Collaborative
by: Zhang, Tuo, et al.
Published: (2025)
by: Zhang, Tuo, et al.
Published: (2025)
Similar Items
-
BackPlay: Head-Only Look-Back Self-Correction for Diffusion Language Models
by: Liu, Liming, et al.
Published: (2026) -
COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs
by: Liu, Liming, et al.
Published: (2025) -
On the Interplay Between Stepsize Tuning and Progressive Sharpening
by: Roulet, Vincent, et al.
Published: (2023) -
Diffusion Model for Manifold Data: Score Decomposition, Curvature, and Statistical Complexity
by: Zhang, Zixuan, et al.
Published: (2026) -
NorMuon: Making Muon more efficient and scalable
by: Li, Zichong, et al.
Published: (2025)