:: Library Catalog

कवर छवि

में बचाया:

ग्रंथसूची विवरण
मुख्य लेखकों:	Zheng, Jingyi, Wang, Junfeng, Sun, Zhen, Dong, Wenhan, Liu, Yule, He, Xinlei
स्वरूप:	Preprint
प्रकाशित:	2025
विषय:	Cryptography and Security Artificial Intelligence
ऑनलाइन पहुंच:	https://arxiv.org/abs/2503.08708
टैग:	टैग जोड़ें कोई टैग नहीं, इस रिकॉर्ड को टैग करने वाले पहले व्यक्ति बनें!

समान संसाधन

Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack
द्वारा: Zhou, Ying, और अन्य
प्रकाशित: (2024)

MASH: Evading Black-Box AI-Generated Text Detectors via Style Humanization
द्वारा: Gu, Yongtong, और अन्य
प्रकाशित: (2026)

On Evaluating The Performance of Watermarked Machine-Generated Texts Under Adversarial Attacks
द्वारा: Liu, Zesen, और अन्य
प्रकाशित: (2024)

Unsafe LLM-Based Search: Quantitative Analysis and Mitigation of Safety Risks in AI Web Search
द्वारा: Luo, Zeren, और अन्य
प्रकाशित: (2025)

AuthorMist: Evading AI Text Detectors with Reinforcement Learning
द्वारा: David, Isaac, और अन्य
प्रकाशित: (2025)

Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media
द्वारा: Sun, Zhen, और अन्य
प्रकाशित: (2024)

CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers
द्वारा: Zheng, Jingyi, और अन्य
प्रकाशित: (2024)

PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning
द्वारा: Sun, Zhen, और अन्य
प्रकाशित: (2024)

Quantized Delta Weight Is Safety Keeper
द्वारा: Liu, Yule, और अन्य
प्रकाशित: (2024)

Stego Battlefield: Evaluating Image Steganography Attacks and Steganalysis Defenses
द्वारा: Sun, Zhen, और अन्य
प्रकाशित: (2026)

MGTBench: Benchmarking Machine-Generated Text Detection
द्वारा: He, Xinlei, और अन्य
प्रकाशित: (2023)

MGTEVAL: An Interactive Platform for Systemtic Evaluation of Machine-Generated Text Detectors
द्वारा: Li, Yuanfan, और अन्य
प्रकाशित: (2026)

JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models
द्वारा: Peng, Zifan, और अन्य
प्रकाशित: (2025)

Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models
द्वारा: Zhang, Zongmin, और अन्य
प्रकाशित: (2025)

Tarallo: Evading Behavioral Malware Detectors in the Problem Space
द्वारा: Digregorio, Gabriele, और अन्य
प्रकाशित: (2025)

SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning
द्वारा: Zhang, Heyi, और अन्य
प्रकाशित: (2025)

Jailbreak Attacks and Defenses Against Large Language Models: A Survey
द्वारा: Yi, Sibo, और अन्य
प्रकाशित: (2024)

Privacy-Preserving Federated Learning via Homomorphic Adversarial Networks
द्वारा: Dong, Wenhan, और अन्य
प्रकाशित: (2024)

StealthRL: Reinforcement Learning Paraphrase Attacks for Multi-Detector Evasion of AI-Text Detectors
द्वारा: Ranganath, Suraj, और अन्य
प्रकाशित: (2026)

"To Survive, I Must Defect": Jailbreaking LLMs via the Game-Theory Scenarios
द्वारा: Sun, Zhen, और अन्य
प्रकाशित: (2025)

Attacks on Approximate Caches in Text-to-Image Diffusion Models
द्वारा: Sun, Desen, और अन्य
प्रकाशित: (2025)

Backdoor Attack on Vision Language Models with Stealthy Semantic Manipulation
द्वारा: Zhong, Zhiyuan, और अन्य
प्रकाशित: (2025)

On the Generation and Mitigation of Harmful Geometry in Image-to-3D Models
द्वारा: Liu, Yule, और अन्य
प्रकाशित: (2026)

Auditing Data Membership in Reinforcement Learning With Verifiable Rewards
द्वारा: Liu, Yule, और अन्य
प्रकाशित: (2025)

FC-Attack: Jailbreaking Multimodal Large Language Models via Auto-Generated Flowcharts
द्वारा: Zhang, Ziyi, और अन्य
प्रकाशित: (2025)

Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors
द्वारा: Wang, Tianchun, और अन्य
प्रकाशित: (2024)

Global BGP Attacks that Evade Route Monitoring
द्वारा: Birge-Lee, Henry, और अन्य
प्रकाशित: (2024)

Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks
द्वारा: Shahgir, Haz Sameen, और अन्य
प्रकाशित: (2023)

GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors
द्वारा: Meng, Wenlong, और अन्य
प्रकाशित: (2025)

Explainable and Transferable Adversarial Attack for ML-Based Network Intrusion Detectors
द्वारा: Zhang, Hangsheng, और अन्य
प्रकाशित: (2024)

EvadeDroid: A Practical Evasion Attack on Machine Learning for Black-box Android Malware Detection
द्वारा: Bostani, Hamid, और अन्य
प्रकाशित: (2021)

Iron Sharpens Iron: Defending Against Attacks in Machine-Generated Text Detection with Adversarial Training
द्वारा: Li, Yuanfan, और अन्य
प्रकाशित: (2025)

JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models
द्वारा: Ran, Delong, और अन्य
प्रकाशित: (2024)

Provably Robust Multi-bit Watermarking for AI-generated Text
द्वारा: Qu, Wenjie, और अन्य
प्रकाशित: (2024)

BELT: Old-School Backdoor Attacks can Evade the State-of-the-Art Defense with Backdoor Exclusivity Lifting
द्वारा: Qiu, Huming, और अन्य
प्रकाशित: (2023)

Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
द्वारा: Shan, Shawn, और अन्य
प्रकाशित: (2023)

Combinational Backdoor Attack against Customized Text-to-Image Models
द्वारा: Jiang, Wenbo, और अन्य
प्रकाशित: (2024)

Membership Inference Attack Against Masked Image Modeling
द्वारा: Li, Zheng, और अन्य
प्रकाशित: (2024)

DMTG: A Human-Like Mouse Trajectory Generation Bot Based on Entropy-Controlled Diffusion Networks
द्वारा: Liu, Jiahua, और अन्य
प्रकाशित: (2024)

Evading Deep Learning-Based Malware Detectors via Obfuscation: A Deep Reinforcement Learning Approach
द्वारा: Etter, Brian, और अन्य
प्रकाशित: (2024)