Saved in:
| Main Authors: | Zhang, Liu, Yao, Yiran, Shi, Danping, Chai, Dongchen, Guo, Jian, Wang, Zilong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.10790 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Malla: Demystifying Real-world Large Language Model Integrated Malicious Services
by: Lin, Zilong, et al.
Published: (2024)
by: Lin, Zilong, et al.
Published: (2024)
Cryptanalysis and improvement of multimodal data encryption by machine-learning-based system
by: Tolba, Zakaria
Published: (2024)
by: Tolba, Zakaria
Published: (2024)
Watermarking Graph Neural Networks via Explanations for Ownership Protection
by: Downer, Jane, et al.
Published: (2025)
by: Downer, Jane, et al.
Published: (2025)
MEASER: Malware embedding attacks on open-source LLMs
by: Tan, Ming, et al.
Published: (2025)
by: Tan, Ming, et al.
Published: (2025)
NCCR: to Evaluate the Robustness of Neural Networks and Adversarial Examples
by: Pu, Shi, et al.
Published: (2025)
by: Pu, Shi, et al.
Published: (2025)
In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach
by: Gao, Yiran, et al.
Published: (2026)
by: Gao, Yiran, et al.
Published: (2026)
A Privacy-Preserving Federated Learning Method with Homomorphic Encryption in Omics Data
by: Negoya, Yusaku, et al.
Published: (2025)
by: Negoya, Yusaku, et al.
Published: (2025)
GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?
by: Chen, Chiyu, et al.
Published: (2025)
by: Chen, Chiyu, et al.
Published: (2025)
TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained Models
by: Li, Ding, et al.
Published: (2024)
by: Li, Ding, et al.
Published: (2024)
Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search
by: Huang, Xun, et al.
Published: (2026)
by: Huang, Xun, et al.
Published: (2026)
MAWSEO: Adversarial Wiki Search Poisoning for Illicit Online Promotion
by: Lin, Zilong, et al.
Published: (2023)
by: Lin, Zilong, et al.
Published: (2023)
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
by: Wang, Wenhao, et al.
Published: (2024)
by: Wang, Wenhao, et al.
Published: (2024)
The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey
by: Kim, Juhee, et al.
Published: (2026)
by: Kim, Juhee, et al.
Published: (2026)
BadEdit: Backdooring large language models by model editing
by: Li, Yanzhou, et al.
Published: (2024)
by: Li, Yanzhou, et al.
Published: (2024)
Human Society-Inspired Approaches to Agentic AI Security: The 4C Framework
by: Abuadbba, Alsharif, et al.
Published: (2026)
by: Abuadbba, Alsharif, et al.
Published: (2026)
AESP: A Human-Sovereign Economic Protocol for AI Agents with Privacy-Preserving Settlement
by: Wang, Jian Sheng
Published: (2026)
by: Wang, Jian Sheng
Published: (2026)
SecPE: Secure Prompt Ensembling for Private and Robust Large Language Models
by: Zhang, Jiawen, et al.
Published: (2025)
by: Zhang, Jiawen, et al.
Published: (2025)
Cryptanalysis of Pseudorandom Error-Correcting Codes
by: Wang, Tianrui, et al.
Published: (2025)
by: Wang, Tianrui, et al.
Published: (2025)
WGLE:Backdoor-free and Multi-bit Black-box Watermarking for Graph Neural Networks
by: Li, Tingzhi, et al.
Published: (2025)
by: Li, Tingzhi, et al.
Published: (2025)
CSC: Turning the Adversary's Poison against Itself
by: Shi, Yuchen, et al.
Published: (2026)
by: Shi, Yuchen, et al.
Published: (2026)
TBDetector:Transformer-Based Detector for Advanced Persistent Threats with Provenance Graph
by: Wang, Nan, et al.
Published: (2023)
by: Wang, Nan, et al.
Published: (2023)
OptiLeak: Efficient Prompt Reconstruction via Reinforcement Learning in Multi-tenant LLM Services
by: Wang, Longxiang, et al.
Published: (2026)
by: Wang, Longxiang, et al.
Published: (2026)
Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward
by: Guo, Weiyang, et al.
Published: (2026)
by: Guo, Weiyang, et al.
Published: (2026)
DNF: Dual-Layer Nested Fingerprinting for Large Language Model Intellectual Property Protection
by: Xu, Zhenhua, et al.
Published: (2026)
by: Xu, Zhenhua, et al.
Published: (2026)
False Claims against Model Ownership Resolution
by: Liu, Jian, et al.
Published: (2023)
by: Liu, Jian, et al.
Published: (2023)
Unlearning Inversion Attacks for Graph Neural Networks
by: Zhang, Jiahao, et al.
Published: (2025)
by: Zhang, Jiahao, et al.
Published: (2025)
Recent Advances in Attack and Defense Approaches of Large Language Models
by: Cui, Jing, et al.
Published: (2024)
by: Cui, Jing, et al.
Published: (2024)
Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization
by: Lin, Zhixin, et al.
Published: (2026)
by: Lin, Zhixin, et al.
Published: (2026)
Conflicts Make Large Reasoning Models Vulnerable to Attacks
by: Liu, Honghao, et al.
Published: (2026)
by: Liu, Honghao, et al.
Published: (2026)
CryptoX : Compositional Reasoning Evaluation of Large Language Models
by: Shi, Jiajun, et al.
Published: (2025)
by: Shi, Jiajun, et al.
Published: (2025)
Neural Honeytrace: Plug&Play Watermarking Framework against Model Extraction Attacks
by: Xu, Yixiao, et al.
Published: (2025)
by: Xu, Yixiao, et al.
Published: (2025)
Neural Stringology Based Cryptanalysis of EChaCha20
by: Kebande, Victor
Published: (2026)
by: Kebande, Victor
Published: (2026)
The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline
by: Wang, Haonan, et al.
Published: (2024)
by: Wang, Haonan, et al.
Published: (2024)
Progent: Securing AI Agents with Privilege Control
by: Shi, Tianneng, et al.
Published: (2025)
by: Shi, Tianneng, et al.
Published: (2025)
Safeguarding Multimodal Knowledge Copyright in the RAG-as-a-Service Environment
by: Chen, Tianyu, et al.
Published: (2025)
by: Chen, Tianyu, et al.
Published: (2025)
Invisible Textual Backdoor Attacks based on Dual-Trigger
by: Hou, Yang, et al.
Published: (2024)
by: Hou, Yang, et al.
Published: (2024)
Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models
by: Xiong, Junjie, et al.
Published: (2025)
by: Xiong, Junjie, et al.
Published: (2025)
Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense
by: Zhang, Jiawen, et al.
Published: (2025)
by: Zhang, Jiawen, et al.
Published: (2025)
CUBA: Controlled Untargeted Backdoor Attack against Deep Neural Networks
by: Wu, Yinghao, et al.
Published: (2025)
by: Wu, Yinghao, et al.
Published: (2025)
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
by: Zhang, Hanrong, et al.
Published: (2024)
by: Zhang, Hanrong, et al.
Published: (2024)
Similar Items
-
Malla: Demystifying Real-world Large Language Model Integrated Malicious Services
by: Lin, Zilong, et al.
Published: (2024) -
Cryptanalysis and improvement of multimodal data encryption by machine-learning-based system
by: Tolba, Zakaria
Published: (2024) -
Watermarking Graph Neural Networks via Explanations for Ownership Protection
by: Downer, Jane, et al.
Published: (2025) -
MEASER: Malware embedding attacks on open-source LLMs
by: Tan, Ming, et al.
Published: (2025) -
NCCR: to Evaluate the Robustness of Neural Networks and Adversarial Examples
by: Pu, Shi, et al.
Published: (2025)