Saved in:
| Main Authors: | Ji, Yuliang, Wu, Jian, Xi, Yuanzhe |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.00642 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
I can't see it but I can Fine-tune it: On Encrypted Fine-tuning of Transformers using Fully Homomorphic Encryption
by: Panzade, Prajwal, et al.
Published: (2024)
by: Panzade, Prajwal, et al.
Published: (2024)
You can't handle the (dirty) truth: Data-centric insights improve pseudo-labeling
by: Seedat, Nabeel, et al.
Published: (2024)
by: Seedat, Nabeel, et al.
Published: (2024)
Why can't Epidemiology be automated (yet)?
by: Bann, David, et al.
Published: (2025)
by: Bann, David, et al.
Published: (2025)
FlexAct: Why Learn when you can Pick?
by: Kumar, Ramnath, et al.
Published: (2026)
by: Kumar, Ramnath, et al.
Published: (2026)
Why social sciences are natural, and why they can't
by: JESÚS ZAMORA-BONILLA
Published: (2012)
by: JESÚS ZAMORA-BONILLA
Published: (2012)
Influence-based Attributions can be Manipulated
by: Yadav, Chhavi, et al.
Published: (2024)
by: Yadav, Chhavi, et al.
Published: (2024)
Why can't we be friends? Untangling conjoined polarization in America
by: Julie M. Norman, et al.
Published: (2025)
by: Julie M. Norman, et al.
Published: (2025)
Optimization for Neural Operators can Benefit from Width
by: Cisneros-Velarde, Pedro, et al.
Published: (2025)
by: Cisneros-Velarde, Pedro, et al.
Published: (2025)
Unraveling the Rainbow: can value-based methods schedule?
by: Corrêa, Arthur, et al.
Published: (2025)
by: Corrêa, Arthur, et al.
Published: (2025)
When, Where and Why to Average Weights?
by: Ajroldi, Niccolò, et al.
Published: (2025)
by: Ajroldi, Niccolò, et al.
Published: (2025)
Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why
by: Armandpour, Mohammadreza, et al.
Published: (2026)
by: Armandpour, Mohammadreza, et al.
Published: (2026)
Incorruptible Neural Networks: Training Models that can Generalize to Large Internal Perturbations
by: Jacobson, Philip, et al.
Published: (2026)
by: Jacobson, Philip, et al.
Published: (2026)
Embedding-based classifiers can detect prompt injection attacks
by: Ayub, Md. Ahsan, et al.
Published: (2024)
by: Ayub, Md. Ahsan, et al.
Published: (2024)
VAR: Visual Analysis for Rashomon Set of Machine Learning Models' Performance
by: Jin, Yuanzhe
Published: (2025)
by: Jin, Yuanzhe
Published: (2025)
HiGP: A high-performance Python package for Gaussian Process
by: Huang, Hua, et al.
Published: (2025)
by: Huang, Hua, et al.
Published: (2025)
If you can distinguish, you can express: Galois theory, Stone--Weierstrass, machine learning, and linguistics
by: Blum-Smith, Ben, et al.
Published: (2025)
by: Blum-Smith, Ben, et al.
Published: (2025)
Optimistic critics can empower small actors
by: Mastikhina, Olya, et al.
Published: (2025)
by: Mastikhina, Olya, et al.
Published: (2025)
Posterior Covariance Structures in Gaussian Processes
by: Cai, Difeng, et al.
Published: (2024)
by: Cai, Difeng, et al.
Published: (2024)
Convolutional Neural Networks can achieve binary bail judgement classification
by: Barman, Amit, et al.
Published: (2024)
by: Barman, Amit, et al.
Published: (2024)
Transformers can do Bayesian Clustering
by: Bhaskaran, Prajit, et al.
Published: (2025)
by: Bhaskaran, Prajit, et al.
Published: (2025)
Graph Diffusion that can Insert and Delete
by: Ninniri, Matteo, et al.
Published: (2025)
by: Ninniri, Matteo, et al.
Published: (2025)
Post-Norm can Resharpen Attention
by: Zsámboki, Pál, et al.
Published: (2025)
by: Zsámboki, Pál, et al.
Published: (2025)
Why Line Search when you can Plane Search? SO-Friendly Neural Networks allow Per-Iteration Optimization of Learning and Momentum Rates for Every Layer
by: Shea, Betty, et al.
Published: (2024)
by: Shea, Betty, et al.
Published: (2024)
You can't rely on fireman forensics
by: Mejía, Robin
Published: (2005)
by: Mejía, Robin
Published: (2005)
Rethinking Non-Negative Matrix Factorization with Implicit Neural Representations
by: Subramani, Krishna, et al.
Published: (2024)
by: Subramani, Krishna, et al.
Published: (2024)
When can we Approximate Wide Contrastive Models with Neural Tangent Kernels and Principal Component Analysis?
by: Anil, Gautham Govind, et al.
Published: (2024)
by: Anil, Gautham Govind, et al.
Published: (2024)
Text embedding models can be great data engineers
by: Kazemian, Iman, et al.
Published: (2025)
by: Kazemian, Iman, et al.
Published: (2025)
Clustering with minimum spanning trees: How good can it be?
by: Gagolewski, Marek, et al.
Published: (2023)
by: Gagolewski, Marek, et al.
Published: (2023)
Scaling can lead to compositional generalization
by: Redhardt, Florian, et al.
Published: (2025)
by: Redhardt, Florian, et al.
Published: (2025)
Neural networks can detect model-free static arbitrage strategies
by: Neufeld, Ariel, et al.
Published: (2023)
by: Neufeld, Ariel, et al.
Published: (2023)
Learning Regularizers: Learning Optimizers that can Regularize
by: Sahoo, Suraj Kumar, et al.
Published: (2025)
by: Sahoo, Suraj Kumar, et al.
Published: (2025)
Sublinear iterations can suffice even for DDPMs
by: Zhang, Matthew S., et al.
Published: (2025)
by: Zhang, Matthew S., et al.
Published: (2025)
Graph Neural Networks can Recover the Hidden Features Solely from the Graph Structure
by: Sato, Ryoma
Published: (2023)
by: Sato, Ryoma
Published: (2023)
Efficient Two-Stage Gaussian Process Regression Via Automatic Kernel Search and Subsampling
by: Zhao, Shifan, et al.
Published: (2024)
by: Zhao, Shifan, et al.
Published: (2024)
Differential Subgroup Discovery: Characterizing Where Two Populations Differ, and Why
by: Xu, Sascha, et al.
Published: (2026)
by: Xu, Sascha, et al.
Published: (2026)
How can representation dimension dominate structurally pruned LLMs?
by: Xu, Mingxue, et al.
Published: (2025)
by: Xu, Mingxue, et al.
Published: (2025)
Smoothed Online Classification can be Harder than Batch Classification
by: Raman, Vinod, et al.
Published: (2024)
by: Raman, Vinod, et al.
Published: (2024)
Sample what you cant compress
by: Birodkar, Vighnesh, et al.
Published: (2024)
by: Birodkar, Vighnesh, et al.
Published: (2024)
When can transformers compositionally generalize in-context?
by: Kobayashi, Seijin, et al.
Published: (2024)
by: Kobayashi, Seijin, et al.
Published: (2024)
Neural networks can be FLOP-efficient integrators of 1D oscillatory integrands
by: Sinha, Anshuman, et al.
Published: (2024)
by: Sinha, Anshuman, et al.
Published: (2024)
Similar Items
-
I can't see it but I can Fine-tune it: On Encrypted Fine-tuning of Transformers using Fully Homomorphic Encryption
by: Panzade, Prajwal, et al.
Published: (2024) -
You can't handle the (dirty) truth: Data-centric insights improve pseudo-labeling
by: Seedat, Nabeel, et al.
Published: (2024) -
Why can't Epidemiology be automated (yet)?
by: Bann, David, et al.
Published: (2025) -
FlexAct: Why Learn when you can Pick?
by: Kumar, Ramnath, et al.
Published: (2026) -
Why social sciences are natural, and why they can't
by: JESÚS ZAMORA-BONILLA
Published: (2012)