Saved in:
| Main Authors: | Chen, Yingquan, Li, Qianmu, Wu, Xiaocong, Li, Huifeng, Chang, Qing |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.00977 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs
by: Mathew, Yohan, et al.
Published: (2024)
by: Mathew, Yohan, et al.
Published: (2024)
Zero-shot Generative Linguistic Steganography
by: Lin, Ke, et al.
Published: (2024)
by: Lin, Ke, et al.
Published: (2024)
TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent
by: Meier, Dominik, et al.
Published: (2025)
by: Meier, Dominik, et al.
Published: (2025)
Fight Poison with Poison: Enhancing Robustness in Few-shot Machine-Generated Text Detection with Adversarial Training
by: Duan, Wenjing, et al.
Published: (2026)
by: Duan, Wenjing, et al.
Published: (2026)
Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
by: Qiu, Huming, et al.
Published: (2024)
by: Qiu, Huming, et al.
Published: (2024)
TSCheater: Generating High-Quality Tibetan Adversarial Texts via Visual Similarity
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
StegoStylo: Squelching Stylometric Scrutiny through Steganographic Stitching
by: Dilworth, Robert
Published: (2026)
by: Dilworth, Robert
Published: (2026)
GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors
by: Meng, Wenlong, et al.
Published: (2025)
by: Meng, Wenlong, et al.
Published: (2025)
SeqAR: Jailbreak LLMs with Sequential Auto-Generated Characters
by: Yang, Yan, et al.
Published: (2024)
by: Yang, Yan, et al.
Published: (2024)
Early Signs of Steganographic Capabilities in Frontier LLMs
by: Zolkowski, Artur, et al.
Published: (2025)
by: Zolkowski, Artur, et al.
Published: (2025)
MGTEVAL: An Interactive Platform for Systemtic Evaluation of Machine-Generated Text Detectors
by: Li, Yuanfan, et al.
Published: (2026)
by: Li, Yuanfan, et al.
Published: (2026)
A High-Capacity and Secure Disambiguation Algorithm for Neural Linguistic Steganography
by: Feng, Yapei, et al.
Published: (2025)
by: Feng, Yapei, et al.
Published: (2025)
A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text Generation
by: Hou, Shilong, et al.
Published: (2025)
by: Hou, Shilong, et al.
Published: (2025)
Iron Sharpens Iron: Defending Against Attacks in Machine-Generated Text Detection with Adversarial Training
by: Li, Yuanfan, et al.
Published: (2025)
by: Li, Yuanfan, et al.
Published: (2025)
Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality
by: Hoang, Duy C., et al.
Published: (2024)
by: Hoang, Duy C., et al.
Published: (2024)
Beyond Text: Unveiling Privacy Vulnerabilities in Multi-modal Retrieval-Augmented Generation
by: Zhang, Jiankun, et al.
Published: (2025)
by: Zhang, Jiankun, et al.
Published: (2025)
LLMs for Domain Generation Algorithm Detection
by: La O, Reynier Leyva, et al.
Published: (2024)
by: La O, Reynier Leyva, et al.
Published: (2024)
Adversarial Text Generation with Dynamic Contextual Perturbation
by: Waghela, Hetvi, et al.
Published: (2025)
by: Waghela, Hetvi, et al.
Published: (2025)
GIFDL: Generated Image Fluctuation Distortion Learning for Enhancing Steganographic Security
by: Wang, Xiangkun, et al.
Published: (2025)
by: Wang, Xiangkun, et al.
Published: (2025)
Provably Secure Disambiguating Neural Linguistic Steganography
by: Qi, Yuang, et al.
Published: (2024)
by: Qi, Yuang, et al.
Published: (2024)
From Thinking to Output: Chain-of-Thought and Text Generation Characteristics in Reasoning Language Models
by: Liu, Junhao, et al.
Published: (2025)
by: Liu, Junhao, et al.
Published: (2025)
LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops
by: Fu, Jiyuan, et al.
Published: (2025)
by: Fu, Jiyuan, et al.
Published: (2025)
Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment
by: Wang, Jiongxiao, et al.
Published: (2024)
by: Wang, Jiongxiao, et al.
Published: (2024)
A Content-Preserving Secure Linguistic Steganography
by: Xiang, Lingyun, et al.
Published: (2025)
by: Xiang, Lingyun, et al.
Published: (2025)
Block-wise Codeword Embedding for Reliable Multi-bit Text Watermarking
by: Kim, Joeun, et al.
Published: (2026)
by: Kim, Joeun, et al.
Published: (2026)
Purified and Unified Steganographic Network
by: Li, Guobiao, et al.
Published: (2024)
by: Li, Guobiao, et al.
Published: (2024)
Text Embedding Inversion Security for Multilingual Language Models
by: Chen, Yiyi, et al.
Published: (2024)
by: Chen, Yiyi, et al.
Published: (2024)
Human-in-the-Loop Generation of Adversarial Texts: A Case Study on Tibetan Script
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models
by: Wang, Zilong, et al.
Published: (2025)
by: Wang, Zilong, et al.
Published: (2025)
RegionMarker: A Region-Triggered Semantic Watermarking Framework for Embedding-as-a-Service Copyright Protection
by: Yang, Shufan, et al.
Published: (2025)
by: Yang, Shufan, et al.
Published: (2025)
Simulate and Eliminate: Revoke Backdoors for Generative Large Language Models
by: Li, Haoran, et al.
Published: (2024)
by: Li, Haoran, et al.
Published: (2024)
One Model Transfer to All: On Robust Jailbreak Prompts Generation against LLMs
by: Li, Linbao, et al.
Published: (2025)
by: Li, Linbao, et al.
Published: (2025)
TWGuard: A Case Study of LLM Safety Guardrails for Localized Linguistic Contexts
by: Chu, Hua-Rong, et al.
Published: (2026)
by: Chu, Hua-Rong, et al.
Published: (2026)
Differentially Private Synthetic Text Generation for Retrieval-Augmented Generation (RAG)
by: Mori, Junki, et al.
Published: (2025)
by: Mori, Junki, et al.
Published: (2025)
AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models
by: Wang, Yiming, et al.
Published: (2024)
by: Wang, Yiming, et al.
Published: (2024)
Efficient Provably Secure Linguistic Steganography via Range Coding
by: Yan, Ruiyi, et al.
Published: (2026)
by: Yan, Ruiyi, et al.
Published: (2026)
Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy
by: Fu, Yu, et al.
Published: (2023)
by: Fu, Yu, et al.
Published: (2023)
Secret-Protected Evolution for Differentially Private Synthetic Text Generation
by: Wang, Tianze, et al.
Published: (2025)
by: Wang, Tianze, et al.
Published: (2025)
Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning
by: Dai, Yanbo, et al.
Published: (2025)
by: Dai, Yanbo, et al.
Published: (2025)
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search
by: Ben-Tov, Matan, et al.
Published: (2024)
by: Ben-Tov, Matan, et al.
Published: (2024)
Similar Items
-
Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs
by: Mathew, Yohan, et al.
Published: (2024) -
Zero-shot Generative Linguistic Steganography
by: Lin, Ke, et al.
Published: (2024) -
TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent
by: Meier, Dominik, et al.
Published: (2025) -
Fight Poison with Poison: Enhancing Robustness in Few-shot Machine-Generated Text Detection with Adversarial Training
by: Duan, Wenjing, et al.
Published: (2026) -
Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
by: Qiu, Huming, et al.
Published: (2024)