Saved in:
| Main Authors: | Cao, Xi, Gesang, Quzong, Sun, Yuan, Qun, Nuo, Nyima, Tashi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.02371 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Human-in-the-Loop Generation of Adversarial Texts: A Case Study on Tibetan Script
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language Model
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
Pay Attention to the Robustness of Chinese Minority Language Models! Syllable-level Textual Adversarial Attack on Tibetan Script
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
Adversarial Text Generation with Dynamic Contextual Perturbation
by: Waghela, Hetvi, et al.
Published: (2025)
by: Waghela, Hetvi, et al.
Published: (2025)
Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack
by: Zhou, Ying, et al.
Published: (2024)
by: Zhou, Ying, et al.
Published: (2024)
Iron Sharpens Iron: Defending Against Attacks in Machine-Generated Text Detection with Adversarial Training
by: Li, Yuanfan, et al.
Published: (2025)
by: Li, Yuanfan, et al.
Published: (2025)
Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks
by: Zhang, Xinyu, et al.
Published: (2023)
by: Zhang, Xinyu, et al.
Published: (2023)
Fight Poison with Poison: Enhancing Robustness in Few-shot Machine-Generated Text Detection with Adversarial Training
by: Duan, Wenjing, et al.
Published: (2026)
by: Duan, Wenjing, et al.
Published: (2026)
A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts
by: Chen, Yingquan, et al.
Published: (2025)
by: Chen, Yingquan, et al.
Published: (2025)
Saliency Attention and Semantic Similarity-Driven Adversarial Perturbation
by: Waghela, Hetvi, et al.
Published: (2024)
by: Waghela, Hetvi, et al.
Published: (2024)
MaskSQL: Safeguarding Privacy for LLM-Based Text-to-SQL via Abstraction
by: Abedini, Sepideh, et al.
Published: (2025)
by: Abedini, Sepideh, et al.
Published: (2025)
Modeling the Attack: Detecting AI-Generated Text by Quantifying Adversarial Perturbations
by: Teja, Lekkala Sai, et al.
Published: (2025)
by: Teja, Lekkala Sai, et al.
Published: (2025)
Chain-of-Code Collapse: Reasoning Failures in LLMs via Adversarial Prompting in Code Generation
by: Roh, Jaechul, et al.
Published: (2025)
by: Roh, Jaechul, et al.
Published: (2025)
Graded Suspiciousness of Adversarial Texts to Human
by: Tonni, Shakila Mahjabin, et al.
Published: (2024)
by: Tonni, Shakila Mahjabin, et al.
Published: (2024)
Adversarial Decoding: Generating Readable Documents for Adversarial Objectives
by: Zhang, Collin, et al.
Published: (2024)
by: Zhang, Collin, et al.
Published: (2024)
Finding a Wolf in Sheep's Clothing: Combating Adversarial Text-To-Image Prompts with Text Summarization
by: Cooper, Portia, et al.
Published: (2024)
by: Cooper, Portia, et al.
Published: (2024)
Efficient and Stealthy Jailbreak Attacks via Adversarial Prompt Distillation from LLMs to SLMs
by: Li, Xiang, et al.
Published: (2025)
by: Li, Xiang, et al.
Published: (2025)
SafeReview: Defending LLM-based Review Systems Against Adversarial Hidden Prompts
by: Xin, Yuan, et al.
Published: (2026)
by: Xin, Yuan, et al.
Published: (2026)
Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods
by: Dey, Roopkatha, et al.
Published: (2024)
by: Dey, Roopkatha, et al.
Published: (2024)
Enhancing Adversarial Text Attacks on BERT Models with Projected Gradient Descent
by: Waghela, Hetvi, et al.
Published: (2024)
by: Waghela, Hetvi, et al.
Published: (2024)
Adversarial Attacks on Parts of Speech: An Empirical Study in Text-to-Image Generation
by: Shahariar, G M, et al.
Published: (2024)
by: Shahariar, G M, et al.
Published: (2024)
On Evaluating The Performance of Watermarked Machine-Generated Texts Under Adversarial Attacks
by: Liu, Zesen, et al.
Published: (2024)
by: Liu, Zesen, et al.
Published: (2024)
MGTEVAL: An Interactive Platform for Systemtic Evaluation of Machine-Generated Text Detectors
by: Li, Yuanfan, et al.
Published: (2026)
by: Li, Yuanfan, et al.
Published: (2026)
Learning-Based Automated Adversarial Red-Teaming for Robustness Evaluation of Large Language Models
by: Wei, Zhang, et al.
Published: (2025)
by: Wei, Zhang, et al.
Published: (2025)
A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text Generation
by: Hou, Shilong, et al.
Published: (2025)
by: Hou, Shilong, et al.
Published: (2025)
A Modified Word Saliency-Based Adversarial Attack on Text Classification Models
by: Waghela, Hetvi, et al.
Published: (2024)
by: Waghela, Hetvi, et al.
Published: (2024)
RTD-Guard: A Black-Box Textual Adversarial Detection Framework via Replacement Token Detection
by: Zhu, He, et al.
Published: (2026)
by: Zhu, He, et al.
Published: (2026)
Differentially Private Knowledge Distillation via Synthetic Text Generation
by: Flemings, James, et al.
Published: (2024)
by: Flemings, James, et al.
Published: (2024)
Beyond Text: Unveiling Privacy Vulnerabilities in Multi-modal Retrieval-Augmented Generation
by: Zhang, Jiankun, et al.
Published: (2025)
by: Zhang, Jiankun, et al.
Published: (2025)
GraphSteal: Structural Knowledge Stealing from Graph RAG via Traversal Reconstruction
by: Gu, Jinze, et al.
Published: (2026)
by: Gu, Jinze, et al.
Published: (2026)
Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality
by: Hoang, Duy C., et al.
Published: (2024)
by: Hoang, Duy C., et al.
Published: (2024)
from Benign import Toxic: Jailbreaking the Language Model via Adversarial Metaphors
by: Yan, Yu, et al.
Published: (2025)
by: Yan, Yu, et al.
Published: (2025)
InvisibleInk: High-Utility and Low-Cost Text Generation with Differential Privacy
by: Vinod, Vishnu, et al.
Published: (2025)
by: Vinod, Vishnu, et al.
Published: (2025)
GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models
by: Wang, Zilong, et al.
Published: (2025)
by: Wang, Zilong, et al.
Published: (2025)
GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors
by: Meng, Wenlong, et al.
Published: (2025)
by: Meng, Wenlong, et al.
Published: (2025)
Adversarial Robustness through Dynamic Ensemble Learning
by: Waghela, Hetvi, et al.
Published: (2024)
by: Waghela, Hetvi, et al.
Published: (2024)
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals
by: Zheng, Rui, et al.
Published: (2024)
by: Zheng, Rui, et al.
Published: (2024)
Groot: Adversarial Testing for Generative Text-to-Image Models with Tree-based Semantic Transformation
by: Liu, Yi, et al.
Published: (2024)
by: Liu, Yi, et al.
Published: (2024)
Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy
by: Fu, Yu, et al.
Published: (2023)
by: Fu, Yu, et al.
Published: (2023)
Text2VLM: Adapting Text-Only Datasets to Evaluate Alignment Training in Visual Language Models
by: Downer, Gabriel, et al.
Published: (2025)
by: Downer, Gabriel, et al.
Published: (2025)
Similar Items
-
Human-in-the-Loop Generation of Adversarial Texts: A Case Study on Tibetan Script
by: Cao, Xi, et al.
Published: (2024) -
Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language Model
by: Cao, Xi, et al.
Published: (2024) -
Pay Attention to the Robustness of Chinese Minority Language Models! Syllable-level Textual Adversarial Attack on Tibetan Script
by: Cao, Xi, et al.
Published: (2024) -
Adversarial Text Generation with Dynamic Contextual Perturbation
by: Waghela, Hetvi, et al.
Published: (2025) -
Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack
by: Zhou, Ying, et al.
Published: (2024)