:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mao, Yuhao, Balauca, Stefan, Vechev, Martin
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2406.04848
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Understanding Certified Training with Interval Bound Propagation
by: Mao, Yuhao, et al.
Published: (2023)

Gaussian Loss Smoothing Enables Certified Training with Tight Convex Relaxations
by: Balauca, Stefan, et al.
Published: (2024)

MixAT: Combining Continuous and Discrete Adversarial Training for LLMs
by: Dékány, Csaba, et al.
Published: (2025)

Expressiveness of Multi-Neuron Convex Relaxations in Neural Network Certification
by: Mao, Yuhao, et al.
Published: (2024)

Dual Randomized Smoothing: Beyond Global Noise Variance
by: Sun, Chenhao, et al.
Published: (2025)

Learning Compact Boolean Networks
by: Wang, Shengpu, et al.
Published: (2026)

BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
by: Petrov, Ivo, et al.
Published: (2025)

Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets
by: Yukhymenko, Hanna, et al.
Published: (2026)

Average Certified Radius is a Poor Metric for Randomized Smoothing
by: Sun, Chenhao, et al.
Published: (2024)

Watermark Stealing in Large Language Models
by: Jovanović, Nikola, et al.
Published: (2024)

AlphaIntegrator: Transformer Action Search for Symbolic Integration Proofs
by: Ünsal, Mert, et al.
Published: (2024)

AutoBaxBuilder: Bootstrapping Code Security Benchmarking
by: von Arx, Tobias, et al.
Published: (2025)

A Unified Framework for LLM Watermarks
by: Gloaguen, Thibaud, et al.
Published: (2026)

A Synthetic Dataset for Personal Attribute Inference
by: Yukhymenko, Hanna, et al.
Published: (2024)

Back to the Drawing Board for Fair Representation Learning
by: Pouget, Angéline, et al.
Published: (2024)

Widening the Gap: Exploiting LLM Quantization via Outlier Injection
by: Zhan, Xiaohua, et al.
Published: (2026)

Practical Hybrid Quantum Language Models with Observable Readout on Real Hardware
by: Balauca, Stefan, et al.
Published: (2025)

COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
by: Guldimann, Philipp, et al.
Published: (2024)

Discovering Spoofing Attempts on Language Model Watermarks
by: Gloaguen, Thibaud, et al.
Published: (2024)

Ward: Provable RAG Dataset Inference via LLM Watermarks
by: Jovanović, Nikola, et al.
Published: (2024)

Watch your steps: Dormant Adversarial Behaviors that Activate upon LLM Finetuning
by: Gloaguen, Thibaud, et al.
Published: (2025)

Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation
by: Mündler, Niels, et al.
Published: (2023)

Watermarking Diffusion Language Models
by: Gloaguen, Thibaud, et al.
Published: (2025)

CodeTaste: Can LLMs Generate Human-Level Code Refactorings?
by: Thillen, Alex, et al.
Published: (2026)

Mind the Gap: A Practical Attack on GGUF Quantization
by: Egashira, Kazuki, et al.
Published: (2025)

Fewer Weights, More Problems: A Practical Attack on LLM Pruning
by: Egashira, Kazuki, et al.
Published: (2025)

SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents
by: Mündler, Niels, et al.
Published: (2024)

Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation
by: De Muri, Giovanni, et al.
Published: (2025)

Beyond Memorization: Violating Privacy Via Inference with Large Language Models
by: Staab, Robin, et al.
Published: (2023)

Delay, Plateau, or Collapse: Evaluating the Impact of Systematic Verification Error on RLVR
by: Egashira, Kazuki, et al.
Published: (2026)

Exploiting LLM Quantization
by: Egashira, Kazuki, et al.
Published: (2024)

MIBP-Cert: Certified Training against Data Perturbations with Mixed-Integer Bilinear Programs
by: Lorenz, Tobias, et al.
Published: (2024)

Private Attribute Inference from Images with Vision-Language Models
by: Tömekçe, Batuhan, et al.
Published: (2024)

Instruction Tuning for Secure Code Generation
by: He, Jingxuan, et al.
Published: (2024)

On Distinguishing Capability Elicitation from Capability Creation in Post-Training: A Free-Energy Perspective
by: Li, Yuhao, et al.
Published: (2026)

Automated Classification of Model Errors on ImageNet
by: Peychev, Momchil, et al.
Published: (2023)

Certifying Counterfactual Bias in LLMs
by: Chaudhary, Isha, et al.
Published: (2024)

Certified Signed Graph Unlearning
by: Zhao, Junpeng, et al.
Published: (2025)

Certified Training with Branch-and-Bound for Lyapunov-stable Neural Control
by: Shi, Zhouxing, et al.
Published: (2024)

Rethinking Evaluation Paradigms in IBP-based Certified Training
by: Kaulen, Konstantin, et al.
Published: (2026)