:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ji, Yuliang, Wu, Jian, Xi, Yuanzhe
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2506.00642
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

I can't see it but I can Fine-tune it: On Encrypted Fine-tuning of Transformers using Fully Homomorphic Encryption
by: Panzade, Prajwal, et al.
Published: (2024)

You can't handle the (dirty) truth: Data-centric insights improve pseudo-labeling
by: Seedat, Nabeel, et al.
Published: (2024)

Why can't Epidemiology be automated (yet)?
by: Bann, David, et al.
Published: (2025)

FlexAct: Why Learn when you can Pick?
by: Kumar, Ramnath, et al.
Published: (2026)

Why social sciences are natural, and why they can't
by: JESÚS ZAMORA-BONILLA
Published: (2012)

Influence-based Attributions can be Manipulated
by: Yadav, Chhavi, et al.
Published: (2024)

Why can't we be friends? Untangling conjoined polarization in America
by: Julie M. Norman, et al.
Published: (2025)

Optimization for Neural Operators can Benefit from Width
by: Cisneros-Velarde, Pedro, et al.
Published: (2025)

Unraveling the Rainbow: can value-based methods schedule?
by: Corrêa, Arthur, et al.
Published: (2025)

When, Where and Why to Average Weights?
by: Ajroldi, Niccolò, et al.
Published: (2025)

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why
by: Armandpour, Mohammadreza, et al.
Published: (2026)

Incorruptible Neural Networks: Training Models that can Generalize to Large Internal Perturbations
by: Jacobson, Philip, et al.
Published: (2026)

Embedding-based classifiers can detect prompt injection attacks
by: Ayub, Md. Ahsan, et al.
Published: (2024)

VAR: Visual Analysis for Rashomon Set of Machine Learning Models' Performance
by: Jin, Yuanzhe
Published: (2025)

HiGP: A high-performance Python package for Gaussian Process
by: Huang, Hua, et al.
Published: (2025)

If you can distinguish, you can express: Galois theory, Stone--Weierstrass, machine learning, and linguistics
by: Blum-Smith, Ben, et al.
Published: (2025)

Optimistic critics can empower small actors
by: Mastikhina, Olya, et al.
Published: (2025)

Posterior Covariance Structures in Gaussian Processes
by: Cai, Difeng, et al.
Published: (2024)

Convolutional Neural Networks can achieve binary bail judgement classification
by: Barman, Amit, et al.
Published: (2024)

Transformers can do Bayesian Clustering
by: Bhaskaran, Prajit, et al.
Published: (2025)

Graph Diffusion that can Insert and Delete
by: Ninniri, Matteo, et al.
Published: (2025)

Post-Norm can Resharpen Attention
by: Zsámboki, Pál, et al.
Published: (2025)

Why Line Search when you can Plane Search? SO-Friendly Neural Networks allow Per-Iteration Optimization of Learning and Momentum Rates for Every Layer
by: Shea, Betty, et al.
Published: (2024)

You can't rely on fireman forensics
by: Mejía, Robin
Published: (2005)

Rethinking Non-Negative Matrix Factorization with Implicit Neural Representations
by: Subramani, Krishna, et al.
Published: (2024)

When can we Approximate Wide Contrastive Models with Neural Tangent Kernels and Principal Component Analysis?
by: Anil, Gautham Govind, et al.
Published: (2024)

Text embedding models can be great data engineers
by: Kazemian, Iman, et al.
Published: (2025)

Clustering with minimum spanning trees: How good can it be?
by: Gagolewski, Marek, et al.
Published: (2023)

Scaling can lead to compositional generalization
by: Redhardt, Florian, et al.
Published: (2025)

Neural networks can detect model-free static arbitrage strategies
by: Neufeld, Ariel, et al.
Published: (2023)

Learning Regularizers: Learning Optimizers that can Regularize
by: Sahoo, Suraj Kumar, et al.
Published: (2025)

Sublinear iterations can suffice even for DDPMs
by: Zhang, Matthew S., et al.
Published: (2025)

Graph Neural Networks can Recover the Hidden Features Solely from the Graph Structure
by: Sato, Ryoma
Published: (2023)

Efficient Two-Stage Gaussian Process Regression Via Automatic Kernel Search and Subsampling
by: Zhao, Shifan, et al.
Published: (2024)

Differential Subgroup Discovery: Characterizing Where Two Populations Differ, and Why
by: Xu, Sascha, et al.
Published: (2026)

How can representation dimension dominate structurally pruned LLMs?
by: Xu, Mingxue, et al.
Published: (2025)

Smoothed Online Classification can be Harder than Batch Classification
by: Raman, Vinod, et al.
Published: (2024)

Sample what you cant compress
by: Birodkar, Vighnesh, et al.
Published: (2024)

When can transformers compositionally generalize in-context?
by: Kobayashi, Seijin, et al.
Published: (2024)

Neural networks can be FLOP-efficient integrators of 1D oscillatory integrands
by: Sinha, Anshuman, et al.
Published: (2024)