Saved in:
| Main Authors: | Mao, Yuhao, Balauca, Stefan, Vechev, Martin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.04848 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Understanding Certified Training with Interval Bound Propagation
by: Mao, Yuhao, et al.
Published: (2023)
by: Mao, Yuhao, et al.
Published: (2023)
Gaussian Loss Smoothing Enables Certified Training with Tight Convex Relaxations
by: Balauca, Stefan, et al.
Published: (2024)
by: Balauca, Stefan, et al.
Published: (2024)
MixAT: Combining Continuous and Discrete Adversarial Training for LLMs
by: Dékány, Csaba, et al.
Published: (2025)
by: Dékány, Csaba, et al.
Published: (2025)
Expressiveness of Multi-Neuron Convex Relaxations in Neural Network Certification
by: Mao, Yuhao, et al.
Published: (2024)
by: Mao, Yuhao, et al.
Published: (2024)
Dual Randomized Smoothing: Beyond Global Noise Variance
by: Sun, Chenhao, et al.
Published: (2025)
by: Sun, Chenhao, et al.
Published: (2025)
Learning Compact Boolean Networks
by: Wang, Shengpu, et al.
Published: (2026)
by: Wang, Shengpu, et al.
Published: (2026)
BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
by: Petrov, Ivo, et al.
Published: (2025)
by: Petrov, Ivo, et al.
Published: (2025)
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets
by: Yukhymenko, Hanna, et al.
Published: (2026)
by: Yukhymenko, Hanna, et al.
Published: (2026)
Average Certified Radius is a Poor Metric for Randomized Smoothing
by: Sun, Chenhao, et al.
Published: (2024)
by: Sun, Chenhao, et al.
Published: (2024)
Watermark Stealing in Large Language Models
by: Jovanović, Nikola, et al.
Published: (2024)
by: Jovanović, Nikola, et al.
Published: (2024)
AlphaIntegrator: Transformer Action Search for Symbolic Integration Proofs
by: Ünsal, Mert, et al.
Published: (2024)
by: Ünsal, Mert, et al.
Published: (2024)
AutoBaxBuilder: Bootstrapping Code Security Benchmarking
by: von Arx, Tobias, et al.
Published: (2025)
by: von Arx, Tobias, et al.
Published: (2025)
A Unified Framework for LLM Watermarks
by: Gloaguen, Thibaud, et al.
Published: (2026)
by: Gloaguen, Thibaud, et al.
Published: (2026)
A Synthetic Dataset for Personal Attribute Inference
by: Yukhymenko, Hanna, et al.
Published: (2024)
by: Yukhymenko, Hanna, et al.
Published: (2024)
Back to the Drawing Board for Fair Representation Learning
by: Pouget, Angéline, et al.
Published: (2024)
by: Pouget, Angéline, et al.
Published: (2024)
Widening the Gap: Exploiting LLM Quantization via Outlier Injection
by: Zhan, Xiaohua, et al.
Published: (2026)
by: Zhan, Xiaohua, et al.
Published: (2026)
Practical Hybrid Quantum Language Models with Observable Readout on Real Hardware
by: Balauca, Stefan, et al.
Published: (2025)
by: Balauca, Stefan, et al.
Published: (2025)
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
by: Guldimann, Philipp, et al.
Published: (2024)
by: Guldimann, Philipp, et al.
Published: (2024)
Discovering Spoofing Attempts on Language Model Watermarks
by: Gloaguen, Thibaud, et al.
Published: (2024)
by: Gloaguen, Thibaud, et al.
Published: (2024)
Ward: Provable RAG Dataset Inference via LLM Watermarks
by: Jovanović, Nikola, et al.
Published: (2024)
by: Jovanović, Nikola, et al.
Published: (2024)
Watch your steps: Dormant Adversarial Behaviors that Activate upon LLM Finetuning
by: Gloaguen, Thibaud, et al.
Published: (2025)
by: Gloaguen, Thibaud, et al.
Published: (2025)
Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation
by: Mündler, Niels, et al.
Published: (2023)
by: Mündler, Niels, et al.
Published: (2023)
Watermarking Diffusion Language Models
by: Gloaguen, Thibaud, et al.
Published: (2025)
by: Gloaguen, Thibaud, et al.
Published: (2025)
CodeTaste: Can LLMs Generate Human-Level Code Refactorings?
by: Thillen, Alex, et al.
Published: (2026)
by: Thillen, Alex, et al.
Published: (2026)
Mind the Gap: A Practical Attack on GGUF Quantization
by: Egashira, Kazuki, et al.
Published: (2025)
by: Egashira, Kazuki, et al.
Published: (2025)
Fewer Weights, More Problems: A Practical Attack on LLM Pruning
by: Egashira, Kazuki, et al.
Published: (2025)
by: Egashira, Kazuki, et al.
Published: (2025)
SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents
by: Mündler, Niels, et al.
Published: (2024)
by: Mündler, Niels, et al.
Published: (2024)
Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation
by: De Muri, Giovanni, et al.
Published: (2025)
by: De Muri, Giovanni, et al.
Published: (2025)
Beyond Memorization: Violating Privacy Via Inference with Large Language Models
by: Staab, Robin, et al.
Published: (2023)
by: Staab, Robin, et al.
Published: (2023)
Delay, Plateau, or Collapse: Evaluating the Impact of Systematic Verification Error on RLVR
by: Egashira, Kazuki, et al.
Published: (2026)
by: Egashira, Kazuki, et al.
Published: (2026)
Exploiting LLM Quantization
by: Egashira, Kazuki, et al.
Published: (2024)
by: Egashira, Kazuki, et al.
Published: (2024)
MIBP-Cert: Certified Training against Data Perturbations with Mixed-Integer Bilinear Programs
by: Lorenz, Tobias, et al.
Published: (2024)
by: Lorenz, Tobias, et al.
Published: (2024)
Private Attribute Inference from Images with Vision-Language Models
by: Tömekçe, Batuhan, et al.
Published: (2024)
by: Tömekçe, Batuhan, et al.
Published: (2024)
Instruction Tuning for Secure Code Generation
by: He, Jingxuan, et al.
Published: (2024)
by: He, Jingxuan, et al.
Published: (2024)
On Distinguishing Capability Elicitation from Capability Creation in Post-Training: A Free-Energy Perspective
by: Li, Yuhao, et al.
Published: (2026)
by: Li, Yuhao, et al.
Published: (2026)
Automated Classification of Model Errors on ImageNet
by: Peychev, Momchil, et al.
Published: (2023)
by: Peychev, Momchil, et al.
Published: (2023)
Certifying Counterfactual Bias in LLMs
by: Chaudhary, Isha, et al.
Published: (2024)
by: Chaudhary, Isha, et al.
Published: (2024)
Certified Signed Graph Unlearning
by: Zhao, Junpeng, et al.
Published: (2025)
by: Zhao, Junpeng, et al.
Published: (2025)
Certified Training with Branch-and-Bound for Lyapunov-stable Neural Control
by: Shi, Zhouxing, et al.
Published: (2024)
by: Shi, Zhouxing, et al.
Published: (2024)
Rethinking Evaluation Paradigms in IBP-based Certified Training
by: Kaulen, Konstantin, et al.
Published: (2026)
by: Kaulen, Konstantin, et al.
Published: (2026)
Similar Items
-
Understanding Certified Training with Interval Bound Propagation
by: Mao, Yuhao, et al.
Published: (2023) -
Gaussian Loss Smoothing Enables Certified Training with Tight Convex Relaxations
by: Balauca, Stefan, et al.
Published: (2024) -
MixAT: Combining Continuous and Discrete Adversarial Training for LLMs
by: Dékány, Csaba, et al.
Published: (2025) -
Expressiveness of Multi-Neuron Convex Relaxations in Neural Network Certification
by: Mao, Yuhao, et al.
Published: (2024) -
Dual Randomized Smoothing: Beyond Global Noise Variance
by: Sun, Chenhao, et al.
Published: (2025)