Saved in:
| Main Author: | Zamir, Or |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.10360 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Note on Non-Composability of Layerwise Approximate Verification for Neural Inference
by: Zamir, Or
Published: (2026)
by: Zamir, Or
Published: (2026)
Undetectable Conversations Between AI Agents via Pseudorandom Noise-Resilient Key Exchange
by: Vaikuntanathan, Vinod, et al.
Published: (2026)
by: Vaikuntanathan, Vinod, et al.
Published: (2026)
Planting Undetectable Backdoors in Machine Learning Models
by: Goldwasser, Shafi, et al.
Published: (2022)
by: Goldwasser, Shafi, et al.
Published: (2022)
Harnessing large-language models to generate private synthetic text
by: Kurakin, Alexey, et al.
Published: (2023)
by: Kurakin, Alexey, et al.
Published: (2023)
Can large language models be privacy preserving and fair medical coders?
by: Dadsetan, Ali, et al.
Published: (2024)
by: Dadsetan, Ali, et al.
Published: (2024)
Seal Your Backdoor with Variational Defense
by: Sabolić, Ivan, et al.
Published: (2025)
by: Sabolić, Ivan, et al.
Published: (2025)
Leveraging large language models for SQL behavior-based database intrusion detection
by: Shlezinger, Meital, et al.
Published: (2025)
by: Shlezinger, Meital, et al.
Published: (2025)
Beyond Crash: Hijacking Your Autonomous Vehicle for Fun and Profit
by: Sun, Qi, et al.
Published: (2026)
by: Sun, Qi, et al.
Published: (2026)
Quantamination: Dynamic Quantization Leaks Your Data Across the Batch
by: Foerster, Hanna, et al.
Published: (2026)
by: Foerster, Hanna, et al.
Published: (2026)
Membership Inference Attacks Cannot Prove that a Model Was Trained On Your Data
by: Zhang, Jie, et al.
Published: (2024)
by: Zhang, Jie, et al.
Published: (2024)
Towards the generation of hierarchical attack models from cybersecurity vulnerabilities using language models
by: Sowka, Kacper, et al.
Published: (2024)
by: Sowka, Kacper, et al.
Published: (2024)
Exploring the limits of strong membership inference attacks on large language models
by: Hayes, Jamie, et al.
Published: (2025)
by: Hayes, Jamie, et al.
Published: (2025)
Learning diverse attacks on large language models for robust red-teaming and safety tuning
by: Lee, Seanie, et al.
Published: (2024)
by: Lee, Seanie, et al.
Published: (2024)
SINBAD: Saliency-informed detection of breakage caused by ad blocking
by: Chehade, Saiid El Hajj, et al.
Published: (2024)
by: Chehade, Saiid El Hajj, et al.
Published: (2024)
Is merging worth it? Securely evaluating the information gain for causal dataset acquisition
by: Fawkes, Jake, et al.
Published: (2024)
by: Fawkes, Jake, et al.
Published: (2024)
Explaining the Model, Protecting Your Data: Revealing and Mitigating the Data Privacy Risks of Post-Hoc Model Explanations via Membership Inference
by: Huang, Catherine, et al.
Published: (2024)
by: Huang, Catherine, et al.
Published: (2024)
Know Your Scientist: KYC as Biosecurity Infrastructure
by: Feldman, Jonathan, et al.
Published: (2026)
by: Feldman, Jonathan, et al.
Published: (2026)
The Jailbreak Tax: How Useful are Your Jailbreak Outputs?
by: Nikolić, Kristina, et al.
Published: (2025)
by: Nikolić, Kristina, et al.
Published: (2025)
How Catastrophic is Your LLM? Certifying Risk in Conversation
by: Wang, Chengxiao, et al.
Published: (2025)
by: Wang, Chengxiao, et al.
Published: (2025)
Tell me about yourself: LLMs are aware of their learned behaviors
by: Betley, Jan, et al.
Published: (2025)
by: Betley, Jan, et al.
Published: (2025)
Your Agent Can Defend Itself against Backdoor Attacks
by: Changjiang, Li, et al.
Published: (2025)
by: Changjiang, Li, et al.
Published: (2025)
Protect Your Score: Contact Tracing With Differential Privacy Guarantees
by: Romijnders, Rob, et al.
Published: (2023)
by: Romijnders, Rob, et al.
Published: (2023)
Robust image classification with multi-modal large language models
by: Villani, Francesco, et al.
Published: (2024)
by: Villani, Francesco, et al.
Published: (2024)
One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
by: Zhang, Mohan, et al.
Published: (2025)
by: Zhang, Mohan, et al.
Published: (2025)
Your Privacy Depends on Others: Collusion Vulnerabilities in Individual Differential Privacy
by: Kaiser, Johannes, et al.
Published: (2026)
by: Kaiser, Johannes, et al.
Published: (2026)
Sanitize Your Responses: Mitigating Privacy Leakage in Large Language Models
by: Fu, Wenjie, et al.
Published: (2025)
by: Fu, Wenjie, et al.
Published: (2025)
STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings
by: Rastogi, Saksham, et al.
Published: (2025)
by: Rastogi, Saksham, et al.
Published: (2025)
BLIA: Detect model memorization in binary classification model through passive Label Inference attack
by: Khan, Mohammad Wahiduzzaman, et al.
Published: (2025)
by: Khan, Mohammad Wahiduzzaman, et al.
Published: (2025)
Private-RAG: Answering Multiple Queries with LLMs while Keeping Your Data Private
by: Wu, Ruihan, et al.
Published: (2025)
by: Wu, Ruihan, et al.
Published: (2025)
Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable
by: Huang, Tiansheng, et al.
Published: (2025)
by: Huang, Tiansheng, et al.
Published: (2025)
Attack and defense techniques in large language models: A survey and new perspectives
by: Liao, Zhiyu, et al.
Published: (2025)
by: Liao, Zhiyu, et al.
Published: (2025)
SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator
by: Asl, Javad Rafiei, et al.
Published: (2024)
by: Asl, Javad Rafiei, et al.
Published: (2024)
Multi-Layer Privacy-Preserving Record Linkage with Clerical Review based on gradual information disclosure
by: Rohde, Florens, et al.
Published: (2024)
by: Rohde, Florens, et al.
Published: (2024)
Identity information based on human magnetocardiography signals
by: Zhang, Pengju, et al.
Published: (2024)
by: Zhang, Pengju, et al.
Published: (2024)
Correlation inference attacks against machine learning models
by: Creţu, Ana-Maria, et al.
Published: (2021)
by: Creţu, Ana-Maria, et al.
Published: (2021)
AED: An black-box NLP classifier model attacker
by: Liu, Yueyang, et al.
Published: (2021)
by: Liu, Yueyang, et al.
Published: (2021)
Watch Out for Your Guidance on Generation! Exploring Conditional Backdoor Attacks against Large Language Models
by: He, Jiaming, et al.
Published: (2024)
by: He, Jiaming, et al.
Published: (2024)
How to DP-fy Your Data: A Practical Guide to Generating Synthetic Data With Differential Privacy
by: Ponomareva, Natalia, et al.
Published: (2025)
by: Ponomareva, Natalia, et al.
Published: (2025)
Beyond Labeling Oracles: What does it mean to steal ML models?
by: Shafran, Avital, et al.
Published: (2023)
by: Shafran, Avital, et al.
Published: (2023)
Your Compiler is Backdooring Your Model: Understanding and Exploiting Compilation Inconsistency Vulnerabilities in Deep Learning Compilers
by: Chen, Simin, et al.
Published: (2025)
by: Chen, Simin, et al.
Published: (2025)
Similar Items
-
A Note on Non-Composability of Layerwise Approximate Verification for Neural Inference
by: Zamir, Or
Published: (2026) -
Undetectable Conversations Between AI Agents via Pseudorandom Noise-Resilient Key Exchange
by: Vaikuntanathan, Vinod, et al.
Published: (2026) -
Planting Undetectable Backdoors in Machine Learning Models
by: Goldwasser, Shafi, et al.
Published: (2022) -
Harnessing large-language models to generate private synthetic text
by: Kurakin, Alexey, et al.
Published: (2023) -
Can large language models be privacy preserving and fair medical coders?
by: Dadsetan, Ali, et al.
Published: (2024)