Saved in:
| Main Authors: | Yao, Wei, Xu, Gengze, Tang, Huayi, Yang, Wenkai, Di, Donglin, Wang, Ziqiao, Liu, Yong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.03109 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Capabilities and Limitations of Weak-to-Strong Generalization: Generalization and Calibration
by: Yao, Wei, et al.
Published: (2025)
by: Yao, Wei, et al.
Published: (2025)
On the Emergence of Weak-to-Strong Generalization: A Bias-Variance Perspective
by: Xu, Gengze, et al.
Published: (2025)
by: Xu, Gengze, et al.
Published: (2025)
On the Blessing of Pre-training in Weak-to-Strong Generalization
by: Yao, Wei, et al.
Published: (2026)
by: Yao, Wei, et al.
Published: (2026)
Revisiting Weak-to-Strong Generalization in Theory and Practice: Reverse KL vs. Forward KL
by: Yao, Wei, et al.
Published: (2025)
by: Yao, Wei, et al.
Published: (2025)
On $f$-Divergence Principled Domain Adaptation: An Improved Framework
by: Wang, Ziqiao, et al.
Published: (2024)
by: Wang, Ziqiao, et al.
Published: (2024)
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications
by: Tang, Huayi, et al.
Published: (2023)
by: Tang, Huayi, et al.
Published: (2023)
PAC-Bayesian Generalization Bounds for Graph Convolutional Networks on Inductive Node Classification
by: Tang, Huayi, et al.
Published: (2025)
by: Tang, Huayi, et al.
Published: (2025)
Understanding Model Ensemble in Transferable Adversarial Attack
by: Yao, Wei, et al.
Published: (2024)
by: Yao, Wei, et al.
Published: (2024)
Theoretical Insights into Fine-Tuning Attention Mechanism: Generalization and Optimization
by: Yao, Xinhao, et al.
Published: (2024)
by: Yao, Xinhao, et al.
Published: (2024)
Generalization Bounds via Conditional $f$-Information
by: Wang, Ziqiao, et al.
Published: (2024)
by: Wang, Ziqiao, et al.
Published: (2024)
Perfect Alignment May be Poisonous to Graph Contrastive Learning
by: Liu, Jingyu, et al.
Published: (2023)
by: Liu, Jingyu, et al.
Published: (2023)
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization
by: Gong, Zixuan, et al.
Published: (2025)
by: Gong, Zixuan, et al.
Published: (2025)
Transformers as Intrinsic Optimizers: Forward Inference through the Energy Principle
by: Ren, Ruifeng, et al.
Published: (2025)
by: Ren, Ruifeng, et al.
Published: (2025)
Sparsity is Combinatorial Depth: Quantifying MoE Expressivity via Tropical Geometry
by: Su, Ye, et al.
Published: (2026)
by: Su, Ye, et al.
Published: (2026)
Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization
by: Tang, Cheng, et al.
Published: (2024)
by: Tang, Cheng, et al.
Published: (2024)
Loss Functions and Operators Generated by f-Divergences
by: Roulet, Vincent, et al.
Published: (2025)
by: Roulet, Vincent, et al.
Published: (2025)
CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels
by: Hu, Ruofan, et al.
Published: (2025)
by: Hu, Ruofan, et al.
Published: (2025)
$f$-Divergence Regularized RLHF: Two Tales of Sampling and Unified Analyses
by: Wu, Di, et al.
Published: (2026)
by: Wu, Di, et al.
Published: (2026)
Two Facets of SDE Under an Information-Theoretic Lens: Generalization of SGD via Training Trajectories and via Terminal States
by: Wang, Ziqiao, et al.
Published: (2022)
by: Wang, Ziqiao, et al.
Published: (2022)
Minimizing $f$-Divergences by Interpolating Velocity Fields
by: Liu, Song, et al.
Published: (2023)
by: Liu, Song, et al.
Published: (2023)
SteinGen: Generating Fidelitous and Diverse Graph Samples
by: Reinert, Gesine, et al.
Published: (2024)
by: Reinert, Gesine, et al.
Published: (2024)
Weak-to-Strong Generalization with Failure Trajectories: A Tree-based Approach to Elicit Optimal Policy in Strong Models
by: Ye, Ruimeng, et al.
Published: (2025)
by: Ye, Ruimeng, et al.
Published: (2025)
Efficient 4D fMRI ASD Classification using Spatial-Temporal-Omics-based Learning Framework
by: Weng, Ziqiao, et al.
Published: (2025)
by: Weng, Ziqiao, et al.
Published: (2025)
Generalization Error of $f$-Divergence Stabilized Algorithms via Duality
by: Daunas, Francisco, et al.
Published: (2025)
by: Daunas, Francisco, et al.
Published: (2025)
Effective Frontiers: A Unification of Neural Scaling Laws
by: Zou, Jiaxuan, et al.
Published: (2026)
by: Zou, Jiaxuan, et al.
Published: (2026)
Theoretical Analysis of Weak-to-Strong Generalization
by: Lang, Hunter, et al.
Published: (2024)
by: Lang, Hunter, et al.
Published: (2024)
Quantifying the Gain in Weak-to-Strong Generalization
by: Charikar, Moses, et al.
Published: (2024)
by: Charikar, Moses, et al.
Published: (2024)
Regularized $f$-Divergence Kernel Tests
by: Ribero, Mónica, et al.
Published: (2026)
by: Ribero, Mónica, et al.
Published: (2026)
On the Mechanisms of Weak-to-Strong Generalization: A Theoretical Perspective
by: Moniri, Behrad, et al.
Published: (2025)
by: Moniri, Behrad, et al.
Published: (2025)
Provable Weak-to-Strong Generalization via Benign Overfitting
by: Wu, David X., et al.
Published: (2024)
by: Wu, David X., et al.
Published: (2024)
Weak-to-Strong Generalization is Nearly Inevitable (in Linear Models)
by: Geng, Scott, et al.
Published: (2026)
by: Geng, Scott, et al.
Published: (2026)
Generalization in Federated Learning: A Conditional Mutual Information Framework
by: Wang, Ziqiao, et al.
Published: (2025)
by: Wang, Ziqiao, et al.
Published: (2025)
Weak-to-Strong Generalization under Distribution Shifts
by: Jeon, Myeongho, et al.
Published: (2025)
by: Jeon, Myeongho, et al.
Published: (2025)
Empirical Risk Minimization with $f$-Divergence Regularization
by: Daunas, Francisco, et al.
Published: (2026)
by: Daunas, Francisco, et al.
Published: (2026)
Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization
by: Yang, Wenkai, et al.
Published: (2024)
by: Yang, Wenkai, et al.
Published: (2024)
Rethinking Training Dynamics in Scale-wise Autoregressive Generation
by: Zhou, Gengze, et al.
Published: (2025)
by: Zhou, Gengze, et al.
Published: (2025)
Weak-to-Strong Generalization Even in Random Feature Networks, Provably
by: Medvedev, Marko, et al.
Published: (2025)
by: Medvedev, Marko, et al.
Published: (2025)
DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation
by: Liu, Jinxin, et al.
Published: (2024)
by: Liu, Jinxin, et al.
Published: (2024)
Learning Invariant Representations of Graph Neural Networks via Cluster Generalization
by: Xia, Donglin, et al.
Published: (2024)
by: Xia, Donglin, et al.
Published: (2024)
Generalizing Trust: Weak-to-Strong Trustworthiness in Language Models
by: Pawelczyk, Martin, et al.
Published: (2024)
by: Pawelczyk, Martin, et al.
Published: (2024)
Similar Items
-
The Capabilities and Limitations of Weak-to-Strong Generalization: Generalization and Calibration
by: Yao, Wei, et al.
Published: (2025) -
On the Emergence of Weak-to-Strong Generalization: A Bias-Variance Perspective
by: Xu, Gengze, et al.
Published: (2025) -
On the Blessing of Pre-training in Weak-to-Strong Generalization
by: Yao, Wei, et al.
Published: (2026) -
Revisiting Weak-to-Strong Generalization in Theory and Practice: Reverse KL vs. Forward KL
by: Yao, Wei, et al.
Published: (2025) -
On $f$-Divergence Principled Domain Adaptation: An Improved Framework
by: Wang, Ziqiao, et al.
Published: (2024)