Saved in:
| Main Authors: | Weiss, Christopher, Kreuter, Frauke, Habernal, Ivan |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2307.06708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
How reparametrization trick broke differentially-private text representation learning
by: Habernal, Ivan
Published: (2022)
by: Habernal, Ivan
Published: (2022)
DP-BART for Privatized Text Rewriting under Local Differential Privacy
by: Igamberdiev, Timour, et al.
Published: (2023)
by: Igamberdiev, Timour, et al.
Published: (2023)
Is Your Prompt Safe? Investigating Prompt Injection Attacks Against Open-Source LLMs
by: Wang, Jiawen, et al.
Published: (2025)
by: Wang, Jiawen, et al.
Published: (2025)
What Was Your Prompt? A Remote Keylogging Attack on AI Assistants
by: Weiss, Roy, et al.
Published: (2024)
by: Weiss, Roy, et al.
Published: (2024)
Universal share based quantum multi secret image sharing scheme
by: Rabari, Dipak K., et al.
Published: (2025)
by: Rabari, Dipak K., et al.
Published: (2025)
Fooling LLM graders into giving better grades through neural activity guided adversarial prompting
by: Yamamura, Atsushi, et al.
Published: (2024)
by: Yamamura, Atsushi, et al.
Published: (2024)
CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models
by: Zeng, Rui, et al.
Published: (2024)
by: Zeng, Rui, et al.
Published: (2024)
Data-centric NLP Backdoor Defense from the Lens of Memorization
by: Wang, Zhenting, et al.
Published: (2024)
by: Wang, Zhenting, et al.
Published: (2024)
Efficient derandomization of differentially private counting queries
by: Ghentiyala, Surendra
Published: (2025)
by: Ghentiyala, Surendra
Published: (2025)
Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods
by: Dey, Roopkatha, et al.
Published: (2024)
by: Dey, Roopkatha, et al.
Published: (2024)
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences
by: Zhang, Yangshijie
Published: (2025)
by: Zhang, Yangshijie
Published: (2025)
Demo: TOSense -- What Did You Just Agree to?
by: Chen, Xinzhang, et al.
Published: (2025)
by: Chen, Xinzhang, et al.
Published: (2025)
Do Reasoning LLMs Refuse What They Infer in Long Contexts?
by: Fu, Yu, et al.
Published: (2026)
by: Fu, Yu, et al.
Published: (2026)
NLP Privacy Risk Identification in Social Media (NLP-PRISM): A Survey
by: Goswami, Dhiman, et al.
Published: (2026)
by: Goswami, Dhiman, et al.
Published: (2026)
Efficient patient-centric EMR sharing block tree
by: Hu, Xiaohan, et al.
Published: (2025)
by: Hu, Xiaohan, et al.
Published: (2025)
What Really Matters in Many-Shot Attacks? An Empirical Study of Long-Context Vulnerabilities in LLMs
by: Kim, Sangyeop, et al.
Published: (2025)
by: Kim, Sangyeop, et al.
Published: (2025)
Differentially Private aggregate hints in mev-share
by: Passerat-Palmbach, Jonathan, et al.
Published: (2025)
by: Passerat-Palmbach, Jonathan, et al.
Published: (2025)
Advance sharing for stabilizer-based quantum secret sharing schemes
by: Shibata, Mamoru
Published: (2025)
by: Shibata, Mamoru
Published: (2025)
Quantum function secret sharing
by: Grilo, Alex B., et al.
Published: (2025)
by: Grilo, Alex B., et al.
Published: (2025)
dabih -- encrypted data storage and sharing platform
by: Huttner, Michael, et al.
Published: (2024)
by: Huttner, Michael, et al.
Published: (2024)
An indicator for effectiveness of text-to-image guardrails utilizing the Single-Turn Crescendo Attack (STCA)
by: Kwartler, Ted, et al.
Published: (2024)
by: Kwartler, Ted, et al.
Published: (2024)
On secret sharing from extended norm-trace curves
by: Geil, Olav
Published: (2026)
by: Geil, Olav
Published: (2026)
Training a General Purpose Automated Red Teaming Model
by: Padmakumar, Aishwarya, et al.
Published: (2026)
by: Padmakumar, Aishwarya, et al.
Published: (2026)
What Matters For Safety Alignment?
by: Li, Xing, et al.
Published: (2026)
by: Li, Xing, et al.
Published: (2026)
PBa-LLM: Privacy- and Bias-aware NLP using Named-Entity Recognition (NER)
by: Mancera, Gonzalo, et al.
Published: (2025)
by: Mancera, Gonzalo, et al.
Published: (2025)
CPE-Identifier: Automated CPE identification and CVE summaries annotation with Deep Learning and NLP
by: Hu, Wanyu, et al.
Published: (2024)
by: Hu, Wanyu, et al.
Published: (2024)
Practically adaptable CPABE based Health-Records sharing framework
by: Imam, Raza, et al.
Published: (2024)
by: Imam, Raza, et al.
Published: (2024)
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
by: Borkar, Jaydeep, et al.
Published: (2025)
by: Borkar, Jaydeep, et al.
Published: (2025)
Two-layer consensus based on master-slave consortium chain data sharing for Internet of Vehicles
by: Zhao, Feng, et al.
Published: (2024)
by: Zhao, Feng, et al.
Published: (2024)
Proof-of-Guardrail in AI Agents and What (Not) to Trust from It
by: Jin, Xisen, et al.
Published: (2026)
by: Jin, Xisen, et al.
Published: (2026)
Efficient Fuzzy Private Set Intersection from Secret-shared OPRF
by: Yang, Xinpeng, et al.
Published: (2026)
by: Yang, Xinpeng, et al.
Published: (2026)
What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks
by: Kirch, Nathalie, et al.
Published: (2024)
by: Kirch, Nathalie, et al.
Published: (2024)
On the Impossibility of Separating Intelligence from Judgment: The Computational Intractability of Filtering for AI Alignment
by: Ball, Sarah, et al.
Published: (2025)
by: Ball, Sarah, et al.
Published: (2025)
Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs
by: Cai, Will, et al.
Published: (2025)
by: Cai, Will, et al.
Published: (2025)
Emergent misalignment as prompt sensitivity: A research note
by: Wyse, Tim, et al.
Published: (2025)
by: Wyse, Tim, et al.
Published: (2025)
FLAME: Flexible LLM-Assisted Moderation Engine
by: Bakulin, Ivan, et al.
Published: (2025)
by: Bakulin, Ivan, et al.
Published: (2025)
Data sharing in the metaverse with key abuse resistance based on decentralized CP-ABE
by: Zhang, Liang, et al.
Published: (2024)
by: Zhang, Liang, et al.
Published: (2024)
Learning from Negative Examples: Why Warning-Framed Training Data Teaches What It Warns Against
by: Enkhbayar, Tsogt-Ochir
Published: (2025)
by: Enkhbayar, Tsogt-Ochir
Published: (2025)
What Does the Server See? Understanding Privacy Leakage from Large Language Models in Split Inference
by: Fan, Mingyuan, et al.
Published: (2026)
by: Fan, Mingyuan, et al.
Published: (2026)
A symmetric extensible protocol for quantum secret sharing
by: Ampatzis, Michael, et al.
Published: (2022)
by: Ampatzis, Michael, et al.
Published: (2022)
Similar Items
-
How reparametrization trick broke differentially-private text representation learning
by: Habernal, Ivan
Published: (2022) -
DP-BART for Privatized Text Rewriting under Local Differential Privacy
by: Igamberdiev, Timour, et al.
Published: (2023) -
Is Your Prompt Safe? Investigating Prompt Injection Attacks Against Open-Source LLMs
by: Wang, Jiawen, et al.
Published: (2025) -
What Was Your Prompt? A Remote Keylogging Attack on AI Assistants
by: Weiss, Roy, et al.
Published: (2024) -
Universal share based quantum multi secret image sharing scheme
by: Rabari, Dipak K., et al.
Published: (2025)