:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Zamir, Or
Format:	Preprint
Published:	2024
Subjects:	Cryptography and Security Machine Learning
Online Access:	https://arxiv.org/abs/2401.10360
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Note on Non-Composability of Layerwise Approximate Verification for Neural Inference
by: Zamir, Or
Published: (2026)

Undetectable Conversations Between AI Agents via Pseudorandom Noise-Resilient Key Exchange
by: Vaikuntanathan, Vinod, et al.
Published: (2026)

Planting Undetectable Backdoors in Machine Learning Models
by: Goldwasser, Shafi, et al.
Published: (2022)

Harnessing large-language models to generate private synthetic text
by: Kurakin, Alexey, et al.
Published: (2023)

Can large language models be privacy preserving and fair medical coders?
by: Dadsetan, Ali, et al.
Published: (2024)

Seal Your Backdoor with Variational Defense
by: Sabolić, Ivan, et al.
Published: (2025)

Leveraging large language models for SQL behavior-based database intrusion detection
by: Shlezinger, Meital, et al.
Published: (2025)

Beyond Crash: Hijacking Your Autonomous Vehicle for Fun and Profit
by: Sun, Qi, et al.
Published: (2026)

Quantamination: Dynamic Quantization Leaks Your Data Across the Batch
by: Foerster, Hanna, et al.
Published: (2026)

Membership Inference Attacks Cannot Prove that a Model Was Trained On Your Data
by: Zhang, Jie, et al.
Published: (2024)

Towards the generation of hierarchical attack models from cybersecurity vulnerabilities using language models
by: Sowka, Kacper, et al.
Published: (2024)

Exploring the limits of strong membership inference attacks on large language models
by: Hayes, Jamie, et al.
Published: (2025)

Learning diverse attacks on large language models for robust red-teaming and safety tuning
by: Lee, Seanie, et al.
Published: (2024)

SINBAD: Saliency-informed detection of breakage caused by ad blocking
by: Chehade, Saiid El Hajj, et al.
Published: (2024)

Is merging worth it? Securely evaluating the information gain for causal dataset acquisition
by: Fawkes, Jake, et al.
Published: (2024)

Explaining the Model, Protecting Your Data: Revealing and Mitigating the Data Privacy Risks of Post-Hoc Model Explanations via Membership Inference
by: Huang, Catherine, et al.
Published: (2024)

Know Your Scientist: KYC as Biosecurity Infrastructure
by: Feldman, Jonathan, et al.
Published: (2026)

The Jailbreak Tax: How Useful are Your Jailbreak Outputs?
by: Nikolić, Kristina, et al.
Published: (2025)

How Catastrophic is Your LLM? Certifying Risk in Conversation
by: Wang, Chengxiao, et al.
Published: (2025)

Tell me about yourself: LLMs are aware of their learned behaviors
by: Betley, Jan, et al.
Published: (2025)

Your Agent Can Defend Itself against Backdoor Attacks
by: Changjiang, Li, et al.
Published: (2025)

Protect Your Score: Contact Tracing With Differential Privacy Guarantees
by: Romijnders, Rob, et al.
Published: (2023)

Robust image classification with multi-modal large language models
by: Villani, Francesco, et al.
Published: (2024)

One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
by: Zhang, Mohan, et al.
Published: (2025)

Your Privacy Depends on Others: Collusion Vulnerabilities in Individual Differential Privacy
by: Kaiser, Johannes, et al.
Published: (2026)

Sanitize Your Responses: Mitigating Privacy Leakage in Large Language Models
by: Fu, Wenjie, et al.
Published: (2025)

STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings
by: Rastogi, Saksham, et al.
Published: (2025)

BLIA: Detect model memorization in binary classification model through passive Label Inference attack
by: Khan, Mohammad Wahiduzzaman, et al.
Published: (2025)

Private-RAG: Answering Multiple Queries with LLMs while Keeping Your Data Private
by: Wu, Ruihan, et al.
Published: (2025)

Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable
by: Huang, Tiansheng, et al.
Published: (2025)

Attack and defense techniques in large language models: A survey and new perspectives
by: Liao, Zhiyu, et al.
Published: (2025)

SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator
by: Asl, Javad Rafiei, et al.
Published: (2024)

Multi-Layer Privacy-Preserving Record Linkage with Clerical Review based on gradual information disclosure
by: Rohde, Florens, et al.
Published: (2024)

Identity information based on human magnetocardiography signals
by: Zhang, Pengju, et al.
Published: (2024)

Correlation inference attacks against machine learning models
by: Creţu, Ana-Maria, et al.
Published: (2021)

AED: An black-box NLP classifier model attacker
by: Liu, Yueyang, et al.
Published: (2021)

Watch Out for Your Guidance on Generation! Exploring Conditional Backdoor Attacks against Large Language Models
by: He, Jiaming, et al.
Published: (2024)

How to DP-fy Your Data: A Practical Guide to Generating Synthetic Data With Differential Privacy
by: Ponomareva, Natalia, et al.
Published: (2025)

Beyond Labeling Oracles: What does it mean to steal ML models?
by: Shafran, Avital, et al.
Published: (2023)

Your Compiler is Backdooring Your Model: Understanding and Exploiting Compilation Inconsistency Vulnerabilities in Deep Learning Compilers
by: Chen, Simin, et al.
Published: (2025)