Saved in:
| Main Authors: | Rabanser, Stephan, Papernot, Nicolas |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.20242 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Suitability Filter: A Statistical Framework for Classifier Evaluation in Real-World Deployment Settings
by: Pouget, Angéline, et al.
Published: (2025)
by: Pouget, Angéline, et al.
Published: (2025)
Uncertainty-Driven Reliability: Selective Prediction and Trustworthy Deployment in Modern Machine Learning
by: Rabanser, Stephan
Published: (2025)
by: Rabanser, Stephan
Published: (2025)
Confidential Guardian: Cryptographically Prohibiting the Abuse of Model Abstention
by: Rabanser, Stephan, et al.
Published: (2025)
by: Rabanser, Stephan, et al.
Published: (2025)
Towards a Science of AI Agent Reliability
by: Rabanser, Stephan, et al.
Published: (2026)
by: Rabanser, Stephan, et al.
Published: (2026)
Selective Prediction via Training Dynamics
by: Rabanser, Stephan, et al.
Published: (2022)
by: Rabanser, Stephan, et al.
Published: (2022)
Have it your way: Individualized Privacy Assignment for DP-SGD
by: Boenisch, Franziska, et al.
Published: (2023)
by: Boenisch, Franziska, et al.
Published: (2023)
Fast Exact Unlearning for In-Context Learning Data for LLMs
by: Muresanu, Andrei I., et al.
Published: (2024)
by: Muresanu, Andrei I., et al.
Published: (2024)
Gradients Look Alike: Sensitivity is Often Overestimated in DP-SGD
by: Thudi, Anvith, et al.
Published: (2023)
by: Thudi, Anvith, et al.
Published: (2023)
What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark
by: Ibrahim, Adham, et al.
Published: (2024)
by: Ibrahim, Adham, et al.
Published: (2024)
Backdoor Detection through Replicated Execution of Outsourced Training
by: Jia, Hengrui, et al.
Published: (2025)
by: Jia, Hengrui, et al.
Published: (2025)
Creative Problem Solving in Large Language and Vision Models -- What Would it Take?
by: Nair, Lakshmi, et al.
Published: (2024)
by: Nair, Lakshmi, et al.
Published: (2024)
Architectural Neural Backdoors from First Principles
by: Langford, Harry, et al.
Published: (2024)
by: Langford, Harry, et al.
Published: (2024)
What Does Flow Matching Bring To TD Learning?
by: Agrawalla, Bhavya, et al.
Published: (2026)
by: Agrawalla, Bhavya, et al.
Published: (2026)
Improving Continual Learning Performance and Efficiency with Auxiliary Classifiers
by: Szatkowski, Filip, et al.
Published: (2024)
by: Szatkowski, Filip, et al.
Published: (2024)
What You See is What You Classify: Black Box Attributions
by: Stalder, Steven, et al.
Published: (2022)
by: Stalder, Steven, et al.
Published: (2022)
Taking the GP Out of the Loop
by: Bafna, Mehul, et al.
Published: (2025)
by: Bafna, Mehul, et al.
Published: (2025)
Does the Model Say What the Data Says? A Simple Heuristic for Model Data Alignment
by: Salgado, Henry, et al.
Published: (2025)
by: Salgado, Henry, et al.
Published: (2025)
Who Does What in Deep Learning? Multidimensional Game-Theoretic Attribution of Function of Neural Units
by: Dixit, Shrey, et al.
Published: (2025)
by: Dixit, Shrey, et al.
Published: (2025)
Does Refusal Training in LLMs Generalize to the Past Tense?
by: Andriushchenko, Maksym, et al.
Published: (2024)
by: Andriushchenko, Maksym, et al.
Published: (2024)
How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not
by: Verdini, Francesco, et al.
Published: (2024)
by: Verdini, Francesco, et al.
Published: (2024)
Position: Model Collapse Does Not Mean What You Think
by: Schaeffer, Rylan, et al.
Published: (2025)
by: Schaeffer, Rylan, et al.
Published: (2025)
Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts
by: Zhang, Yifan, et al.
Published: (2024)
by: Zhang, Yifan, et al.
Published: (2024)
Language Models May Verbatim Complete Text They Were Not Explicitly Trained On
by: Liu, Ken Ziyu, et al.
Published: (2025)
by: Liu, Ken Ziyu, et al.
Published: (2025)
SR-Reward: Taking The Path More Traveled
by: Azad, Seyed Mahdi B., et al.
Published: (2025)
by: Azad, Seyed Mahdi B., et al.
Published: (2025)
The Curse of Recursion: Training on Generated Data Makes Models Forget
by: Shumailov, Ilia, et al.
Published: (2023)
by: Shumailov, Ilia, et al.
Published: (2023)
Classifying Overlapping Gaussian Mixtures in High Dimensions: From Optimal Classifiers to Neural Nets
by: Cohen, Khen, et al.
Published: (2024)
by: Cohen, Khen, et al.
Published: (2024)
Decomposing Elements of Problem Solving: What "Math" Does RL Teach?
by: Qin, Tian, et al.
Published: (2025)
by: Qin, Tian, et al.
Published: (2025)
On Continuity of Robust and Accurate Classifiers
by: Barati, Ramin, et al.
Published: (2023)
by: Barati, Ramin, et al.
Published: (2023)
What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time
by: Yan, Dong, et al.
Published: (2026)
by: Yan, Dong, et al.
Published: (2026)
From Supervision to Exploration: What Does Protein Language Model Learn During Reinforcement Learning?
by: Cao, Hanqun, et al.
Published: (2025)
by: Cao, Hanqun, et al.
Published: (2025)
Active Timepoint Selection for Learning Measure-Valued Trajectories
by: Huynh, Nicolas, et al.
Published: (2026)
by: Huynh, Nicolas, et al.
Published: (2026)
Know What You Don't Know: Selective Prediction for Early Exit DNNs
by: Bajpai, Divya Jyoti, et al.
Published: (2025)
by: Bajpai, Divya Jyoti, et al.
Published: (2025)
Interpretable and Fair Mechanisms for Abstaining Classifiers
by: Lenders, Daphne, et al.
Published: (2025)
by: Lenders, Daphne, et al.
Published: (2025)
Confidence Calibration of Classifiers with Many Classes
by: LeCoz, Adrien, et al.
Published: (2024)
by: LeCoz, Adrien, et al.
Published: (2024)
Classifier-free guidance in LLMs Safety
by: Smirnov, Roman
Published: (2024)
by: Smirnov, Roman
Published: (2024)
It Just Takes Two: Scaling Amortized Inference to Large Sets
by: Wehenkel, Antoine, et al.
Published: (2026)
by: Wehenkel, Antoine, et al.
Published: (2026)
Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs
by: Bikić, Antonio, et al.
Published: (2024)
by: Bikić, Antonio, et al.
Published: (2024)
Does Biomedical Training Lead to Better Medical Performance?
by: Dada, Amin, et al.
Published: (2024)
by: Dada, Amin, et al.
Published: (2024)
UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI
by: Shumailov, Ilia, et al.
Published: (2024)
by: Shumailov, Ilia, et al.
Published: (2024)
Take a Step and Reconsider: Sequence Decoding for Self-Improved Neural Combinatorial Optimization
by: Pirnay, Jonathan, et al.
Published: (2024)
by: Pirnay, Jonathan, et al.
Published: (2024)
Similar Items
-
Suitability Filter: A Statistical Framework for Classifier Evaluation in Real-World Deployment Settings
by: Pouget, Angéline, et al.
Published: (2025) -
Uncertainty-Driven Reliability: Selective Prediction and Trustworthy Deployment in Modern Machine Learning
by: Rabanser, Stephan
Published: (2025) -
Confidential Guardian: Cryptographically Prohibiting the Abuse of Model Abstention
by: Rabanser, Stephan, et al.
Published: (2025) -
Towards a Science of AI Agent Reliability
by: Rabanser, Stephan, et al.
Published: (2026) -
Selective Prediction via Training Dynamics
by: Rabanser, Stephan, et al.
Published: (2022)