Saved in:
| Main Authors: | Wang, Jiachen T., Wu, Tong, Lyu, Kaifeng, Zou, James, Song, Dawn, Jia, Ruoxi, Mittal, Prateek |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.24503 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Data Shapley in One Training Run
by: Wang, Jiachen T., et al.
Published: (2024)
by: Wang, Jiachen T., et al.
Published: (2024)
Capturing the Temporal Dependence of Training Data Influence
by: Wang, Jiachen T., et al.
Published: (2024)
by: Wang, Jiachen T., et al.
Published: (2024)
Efficient Data Shapley for Weighted Nearest Neighbor Algorithms
by: Wang, Jiachen T., et al.
Published: (2024)
by: Wang, Jiachen T., et al.
Published: (2024)
Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits
by: Wang, Jiachen T., et al.
Published: (2024)
by: Wang, Jiachen T., et al.
Published: (2024)
Effectively Controlling Reasoning Models through Thinking Intervention
by: Wu, Tong, et al.
Published: (2025)
by: Wu, Tong, et al.
Published: (2025)
A Sustainable AI Economy Needs Data Deals That Work for Generators
by: Jia, Ruoxi, et al.
Published: (2026)
by: Jia, Ruoxi, et al.
Published: (2026)
ReliabilityRAG: Effective and Provably Robust Defense for RAG-based Web-Search
by: Shen, Zeyu, et al.
Published: (2025)
by: Shen, Zeyu, et al.
Published: (2025)
Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study
by: Wang, Chenguang, et al.
Published: (2024)
by: Wang, Chenguang, et al.
Published: (2024)
AutoScale: Scale-Aware Data Mixing for Pre-Training LLMs
by: Kang, Feiyang, et al.
Published: (2024)
by: Kang, Feiyang, et al.
Published: (2024)
Does More Inference-Time Compute Really Help Robustness?
by: Wu, Tong, et al.
Published: (2025)
by: Wu, Tong, et al.
Published: (2025)
Position: Towards Resilience Against Adversarial Examples
by: Dai, Sihui, et al.
Published: (2024)
by: Dai, Sihui, et al.
Published: (2024)
Safety Alignment Should Be Made More Than Just a Few Tokens Deep
by: Qi, Xiangyu, et al.
Published: (2024)
by: Qi, Xiangyu, et al.
Published: (2024)
Data Mixing Can Induce Phase Transitions in Knowledge Acquisition
by: Gu, Xinran, et al.
Published: (2025)
by: Gu, Xinran, et al.
Published: (2025)
Can Aha Moments Be Fake? Towards Quantifying Decorative and True Thinking in Chain-of-Thought
by: Zhao, Jiachen, et al.
Published: (2025)
by: Zhao, Jiachen, et al.
Published: (2025)
Data-Attributed Adaptive Control Barrier Functions: Safety-Certified Training Data Curation via Influence Analysis
by: Li, Jiachen, et al.
Published: (2026)
by: Li, Jiachen, et al.
Published: (2026)
Boosting Alignment for Post-Unlearning Text-to-Image Generative Models
by: Ko, Myeongseob, et al.
Published: (2024)
by: Ko, Myeongseob, et al.
Published: (2024)
Beyond Images: Adaptive Fusion of Visual and Textual Data for Food Classification
by: Mittal, Prateek, et al.
Published: (2023)
by: Mittal, Prateek, et al.
Published: (2023)
Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks
by: Dabas, Mahavir, et al.
Published: (2025)
by: Dabas, Mahavir, et al.
Published: (2025)
Adaptive and Stratified Subsampling for High-Dimensional Robust Estimation
by: Mittal, Prateek, et al.
Published: (2024)
by: Mittal, Prateek, et al.
Published: (2024)
PatchCURE: Improving Certifiable Robustness, Model Utility, and Computation Efficiency of Adversarial Patch Defenses
by: Xiang, Chong, et al.
Published: (2023)
by: Xiang, Chong, et al.
Published: (2023)
Rethinking Data Value: Asymmetric Data Shapley for Structure-Aware Valuation in Data Markets and Machine Learning Pipelines
by: Zheng, Xi, et al.
Published: (2025)
by: Zheng, Xi, et al.
Published: (2025)
BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models
by: Zeng, Yi, et al.
Published: (2024)
by: Zeng, Yi, et al.
Published: (2024)
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs
by: Zeng, Yi, et al.
Published: (2024)
by: Zeng, Yi, et al.
Published: (2024)
Adapting to Evolving Adversaries with Regularized Continual Robust Training
by: Dai, Sihui, et al.
Published: (2025)
by: Dai, Sihui, et al.
Published: (2025)
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
by: Yuan, Zhuowen, et al.
Published: (2024)
by: Yuan, Zhuowen, et al.
Published: (2024)
LLMs Can Plan Only If We Tell Them
by: Sel, Bilgehan, et al.
Published: (2025)
by: Sel, Bilgehan, et al.
Published: (2025)
Certifiably Robust RAG against Retrieval Corruption
by: Xiang, Chong, et al.
Published: (2024)
by: Xiang, Chong, et al.
Published: (2024)
ELFS: Label-Free Coreset Selection with Proxy Training Dynamics
by: Zheng, Haizhong, et al.
Published: (2024)
by: Zheng, Haizhong, et al.
Published: (2024)
The Evolution of LLM Adoption in Industry Data Curation Practices
by: Qian, Crystal, et al.
Published: (2024)
by: Qian, Crystal, et al.
Published: (2024)
PatchDEMUX: A Certifiably Robust Framework for Multi-label Classifiers Against Adversarial Patches
by: Jacob, Dennis, et al.
Published: (2025)
by: Jacob, Dennis, et al.
Published: (2025)
Rethinking Data Curation in LLM Training: Online Reweighting Offers Better Generalization than Offline Methods
by: Zhao, Wanru, et al.
Published: (2026)
by: Zhao, Wanru, et al.
Published: (2026)
Autoguided Online Data Curation for Diffusion Model Training
by: Pais, Valeria, et al.
Published: (2025)
by: Pais, Valeria, et al.
Published: (2025)
The Marginal Value of Momentum for Small Learning Rate SGD
by: Wang, Runzhe, et al.
Published: (2023)
by: Wang, Runzhe, et al.
Published: (2023)
Confidence-Aware Alignment Makes Reasoning LLMs More Reliable
by: Chen, Kejia, et al.
Published: (2026)
by: Chen, Kejia, et al.
Published: (2026)
On the Reliability of Code Comprehension Proxies
by: Arvan, Erfan, et al.
Published: (2026)
by: Arvan, Erfan, et al.
Published: (2026)
Displaced Fermionic Gaussian States and their Classical Simulation
by: Lyu, Xingjian, et al.
Published: (2024)
by: Lyu, Xingjian, et al.
Published: (2024)
Fermionic Gaussian Testing and Non-Gaussian Measures via Convolution
by: Lyu, Xingjian, et al.
Published: (2024)
by: Lyu, Xingjian, et al.
Published: (2024)
Rethinking the Role of Proxy Rewards in Language Model Alignment
by: Kim, Sungdong, et al.
Published: (2024)
by: Kim, Sungdong, et al.
Published: (2024)
Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds
by: Biswas, Prateek, et al.
Published: (2026)
by: Biswas, Prateek, et al.
Published: (2026)
MLAN: Language-Based Instruction Tuning Preserves and Transfers Knowledge in Multimodal Language Models
by: Tu, Jianhong, et al.
Published: (2024)
by: Tu, Jianhong, et al.
Published: (2024)
Similar Items
-
Data Shapley in One Training Run
by: Wang, Jiachen T., et al.
Published: (2024) -
Capturing the Temporal Dependence of Training Data Influence
by: Wang, Jiachen T., et al.
Published: (2024) -
Efficient Data Shapley for Weighted Nearest Neighbor Algorithms
by: Wang, Jiachen T., et al.
Published: (2024) -
Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits
by: Wang, Jiachen T., et al.
Published: (2024) -
Effectively Controlling Reasoning Models through Thinking Intervention
by: Wu, Tong, et al.
Published: (2025)