:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rabanser, Stephan, Papernot, Nicolas
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.20242
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Suitability Filter: A Statistical Framework for Classifier Evaluation in Real-World Deployment Settings
by: Pouget, Angéline, et al.
Published: (2025)

Uncertainty-Driven Reliability: Selective Prediction and Trustworthy Deployment in Modern Machine Learning
by: Rabanser, Stephan
Published: (2025)

Confidential Guardian: Cryptographically Prohibiting the Abuse of Model Abstention
by: Rabanser, Stephan, et al.
Published: (2025)

Towards a Science of AI Agent Reliability
by: Rabanser, Stephan, et al.
Published: (2026)

Selective Prediction via Training Dynamics
by: Rabanser, Stephan, et al.
Published: (2022)

Have it your way: Individualized Privacy Assignment for DP-SGD
by: Boenisch, Franziska, et al.
Published: (2023)

Fast Exact Unlearning for In-Context Learning Data for LLMs
by: Muresanu, Andrei I., et al.
Published: (2024)

Gradients Look Alike: Sensitivity is Often Overestimated in DP-SGD
by: Thudi, Anvith, et al.
Published: (2023)

What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark
by: Ibrahim, Adham, et al.
Published: (2024)

Backdoor Detection through Replicated Execution of Outsourced Training
by: Jia, Hengrui, et al.
Published: (2025)

Creative Problem Solving in Large Language and Vision Models -- What Would it Take?
by: Nair, Lakshmi, et al.
Published: (2024)

Architectural Neural Backdoors from First Principles
by: Langford, Harry, et al.
Published: (2024)

What Does Flow Matching Bring To TD Learning?
by: Agrawalla, Bhavya, et al.
Published: (2026)

Improving Continual Learning Performance and Efficiency with Auxiliary Classifiers
by: Szatkowski, Filip, et al.
Published: (2024)

What You See is What You Classify: Black Box Attributions
by: Stalder, Steven, et al.
Published: (2022)

Taking the GP Out of the Loop
by: Bafna, Mehul, et al.
Published: (2025)

Does the Model Say What the Data Says? A Simple Heuristic for Model Data Alignment
by: Salgado, Henry, et al.
Published: (2025)

Who Does What in Deep Learning? Multidimensional Game-Theoretic Attribution of Function of Neural Units
by: Dixit, Shrey, et al.
Published: (2025)

Does Refusal Training in LLMs Generalize to the Past Tense?
by: Andriushchenko, Maksym, et al.
Published: (2024)

How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not
by: Verdini, Francesco, et al.
Published: (2024)

Position: Model Collapse Does Not Mean What You Think
by: Schaeffer, Rylan, et al.
Published: (2025)

Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts
by: Zhang, Yifan, et al.
Published: (2024)

Language Models May Verbatim Complete Text They Were Not Explicitly Trained On
by: Liu, Ken Ziyu, et al.
Published: (2025)

SR-Reward: Taking The Path More Traveled
by: Azad, Seyed Mahdi B., et al.
Published: (2025)

The Curse of Recursion: Training on Generated Data Makes Models Forget
by: Shumailov, Ilia, et al.
Published: (2023)

Classifying Overlapping Gaussian Mixtures in High Dimensions: From Optimal Classifiers to Neural Nets
by: Cohen, Khen, et al.
Published: (2024)

Decomposing Elements of Problem Solving: What "Math" Does RL Teach?
by: Qin, Tian, et al.
Published: (2025)

On Continuity of Robust and Accurate Classifiers
by: Barati, Ramin, et al.
Published: (2023)

What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time
by: Yan, Dong, et al.
Published: (2026)

From Supervision to Exploration: What Does Protein Language Model Learn During Reinforcement Learning?
by: Cao, Hanqun, et al.
Published: (2025)

Active Timepoint Selection for Learning Measure-Valued Trajectories
by: Huynh, Nicolas, et al.
Published: (2026)

Know What You Don't Know: Selective Prediction for Early Exit DNNs
by: Bajpai, Divya Jyoti, et al.
Published: (2025)

Interpretable and Fair Mechanisms for Abstaining Classifiers
by: Lenders, Daphne, et al.
Published: (2025)

Confidence Calibration of Classifiers with Many Classes
by: LeCoz, Adrien, et al.
Published: (2024)

Classifier-free guidance in LLMs Safety
by: Smirnov, Roman
Published: (2024)

It Just Takes Two: Scaling Amortized Inference to Large Sets
by: Wehenkel, Antoine, et al.
Published: (2026)

Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs
by: Bikić, Antonio, et al.
Published: (2024)

Does Biomedical Training Lead to Better Medical Performance?
by: Dada, Amin, et al.
Published: (2024)

UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI
by: Shumailov, Ilia, et al.
Published: (2024)

Take a Step and Reconsider: Sequence Decoding for Self-Improved Neural Combinatorial Optimization
by: Pirnay, Jonathan, et al.
Published: (2024)