Saved in:
| Main Authors: | Hu, Kai, Leino, Klas, Wang, Zifan, Fredrikson, Matt |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.02513 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LipNeXt: Scaling up Lipschitz-based Certified Robustness to Billion-parameter Models
by: Hu, Kai, et al.
Published: (2026)
by: Hu, Kai, et al.
Published: (2026)
LLM Whisperer: An Inconspicuous Attack to Bias LLM Responses
by: Lin, Weiran, et al.
Published: (2024)
by: Lin, Weiran, et al.
Published: (2024)
Improving Alignment and Robustness with Circuit Breakers
by: Zou, Andy, et al.
Published: (2024)
by: Zou, Andy, et al.
Published: (2024)
Transferable Adversarial Attacks on Black-Box Vision-Language Models
by: Hu, Kai, et al.
Published: (2025)
by: Hu, Kai, et al.
Published: (2025)
When the Same Coefficients Reach Different Places: Asymmetric Realizability in Transplanting Tokenizers across Large Language Models
by: Liu, Xiaoze, et al.
Published: (2025)
by: Liu, Xiaoze, et al.
Published: (2025)
Evaluating Language Model Reasoning about Confidential Information
by: Sam, Dylan, et al.
Published: (2025)
by: Sam, Dylan, et al.
Published: (2025)
Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization
by: Hu, Kai, et al.
Published: (2024)
by: Hu, Kai, et al.
Published: (2024)
Jailbreak-Zero: A Path to Pareto Optimal Red Teaming for Large Language Models
by: Hu, Kai, et al.
Published: (2025)
by: Hu, Kai, et al.
Published: (2025)
Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents
by: Kumar, Priyanshu, et al.
Published: (2024)
by: Kumar, Priyanshu, et al.
Published: (2024)
Collective Certified Robustness against Graph Injection Attacks
by: Lai, Yuni, et al.
Published: (2024)
by: Lai, Yuni, et al.
Published: (2024)
Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers
by: Jin, Gaojie, et al.
Published: (2025)
by: Jin, Gaojie, et al.
Published: (2025)
Certified Robustness via Dynamic Margin Maximization and Improved Lipschitz Regularization
by: Fazlyab, Mahyar, et al.
Published: (2023)
by: Fazlyab, Mahyar, et al.
Published: (2023)
Certifiably Robust Image Watermark
by: Jiang, Zhengyuan, et al.
Published: (2024)
by: Jiang, Zhengyuan, et al.
Published: (2024)
CluCERT: Certifying LLM Robustness via Clustering-Guided Denoising Smoothing
by: Wang, Zixia, et al.
Published: (2025)
by: Wang, Zixia, et al.
Published: (2025)
Certifiably Robust Encoding Schemes
by: Saxena, Aman, et al.
Published: (2024)
by: Saxena, Aman, et al.
Published: (2024)
Enhancing Certifiable Semantic Robustness via Robust Pruning of Deep Neural Networks
by: Hu, Hanjiang, et al.
Published: (2025)
by: Hu, Hanjiang, et al.
Published: (2025)
Certifying Robustness via Topological Representations
by: Agerberg, Jens, et al.
Published: (2025)
by: Agerberg, Jens, et al.
Published: (2025)
Certified $\ell_2$ Attribution Robustness via Uniformly Smoothed Attributions
by: Wang, Fan, et al.
Published: (2024)
by: Wang, Fan, et al.
Published: (2024)
TimeRecipe: A Time-Series Forecasting Recipe via Benchmarking Module Level Effectiveness
by: Zhao, Zhiyuan, et al.
Published: (2025)
by: Zhao, Zhiyuan, et al.
Published: (2025)
Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging
by: Zimmer, Max, et al.
Published: (2023)
by: Zimmer, Max, et al.
Published: (2023)
Laplace-Bridged Randomized Smoothing for Fast Certified Robustness
by: Lin, Miao, et al.
Published: (2026)
by: Lin, Miao, et al.
Published: (2026)
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
by: Du, He, et al.
Published: (2026)
by: Du, He, et al.
Published: (2026)
Certified Causal Defense with Generalizable Robustness
by: Qiao, Yiran, et al.
Published: (2024)
by: Qiao, Yiran, et al.
Published: (2024)
A Recipe for Charge Density Prediction
by: Fu, Xiang, et al.
Published: (2024)
by: Fu, Xiang, et al.
Published: (2024)
On Using Certified Training towards Empirical Robustness
by: De Palma, Alessandro, et al.
Published: (2024)
by: De Palma, Alessandro, et al.
Published: (2024)
CEAR: Certified Ensemble Adversarial Robustness in DNNs
by: Sadig, Daniel, et al.
Published: (2026)
by: Sadig, Daniel, et al.
Published: (2026)
Certifying Global Robustness for Deep Neural Networks
by: Li, You, et al.
Published: (2024)
by: Li, You, et al.
Published: (2024)
Lipschitz-aware Linearity Grafting for Certified Robustness
by: Han, Yongjin, et al.
Published: (2025)
by: Han, Yongjin, et al.
Published: (2025)
Certifiably Byzantine-Robust Federated Conformal Prediction
by: Kang, Mintong, et al.
Published: (2024)
by: Kang, Mintong, et al.
Published: (2024)
On the Extreme Variance of Certified Local Robustness Across Model Seeds
by: Le, Minh, et al.
Published: (2026)
by: Le, Minh, et al.
Published: (2026)
Learning Better Certified Models from Empirically-Robust Teachers
by: De Palma, Alessandro
Published: (2026)
by: De Palma, Alessandro
Published: (2026)
Achieving Domain-Independent Certified Robustness via Knowledge Continuity
by: Sun, Alan, et al.
Published: (2024)
by: Sun, Alan, et al.
Published: (2024)
Fortifying Time Series: DTW-Certified Robust Anomaly Detection
by: Liu, Shijie, et al.
Published: (2026)
by: Liu, Shijie, et al.
Published: (2026)
Certifiably-Robust Federated Adversarial Learning via Randomized Smoothing
by: Chen, Cheng, et al.
Published: (2021)
by: Chen, Cheng, et al.
Published: (2021)
Safety Pretraining: Toward the Next Generation of Safe AI
by: Maini, Pratyush, et al.
Published: (2025)
by: Maini, Pratyush, et al.
Published: (2025)
VeriSplit: Secure and Practical Offloading of Machine Learning Inferences across IoT Devices
by: Zhang, Han, et al.
Published: (2024)
by: Zhang, Han, et al.
Published: (2024)
Pixel-wise Smoothing for Certified Robustness against Camera Motion Perturbations
by: Hu, Hanjiang, et al.
Published: (2023)
by: Hu, Hanjiang, et al.
Published: (2023)
Quamba: A Post-Training Quantization Recipe for Selective State Space Models
by: Chiang, Hung-Yueh, et al.
Published: (2024)
by: Chiang, Hung-Yueh, et al.
Published: (2024)
Augment then Smooth: Reconciling Differential Privacy with Certified Robustness
by: Wu, Jiapeng, et al.
Published: (2023)
by: Wu, Jiapeng, et al.
Published: (2023)
AuditVotes: A Framework Towards More Deployable Certified Robustness for Graph Neural Networks
by: Lai, Yuni, et al.
Published: (2025)
by: Lai, Yuni, et al.
Published: (2025)
Similar Items
-
LipNeXt: Scaling up Lipschitz-based Certified Robustness to Billion-parameter Models
by: Hu, Kai, et al.
Published: (2026) -
LLM Whisperer: An Inconspicuous Attack to Bias LLM Responses
by: Lin, Weiran, et al.
Published: (2024) -
Improving Alignment and Robustness with Circuit Breakers
by: Zou, Andy, et al.
Published: (2024) -
Transferable Adversarial Attacks on Black-Box Vision-Language Models
by: Hu, Kai, et al.
Published: (2025) -
When the Same Coefficients Reach Different Places: Asymmetric Realizability in Transplanting Tokenizers across Large Language Models
by: Liu, Xiaoze, et al.
Published: (2025)