Saved in:
| Main Authors: | Lin, Zilong, Cui, Jian, Liao, Xiaojing, Wang, XiaoFeng |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.03315 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MAWSEO: Adversarial Wiki Search Poisoning for Illicit Online Promotion
by: Lin, Zilong, et al.
Published: (2023)
by: Lin, Zilong, et al.
Published: (2023)
Consiglieres in the Shadow: Understanding the Use of Uncensored Large Language Models in Cybercrimes
by: Lin, Zilong, et al.
Published: (2025)
by: Lin, Zilong, et al.
Published: (2025)
Leveraging Large Language Models to Detect npm Malicious Packages
by: Zahan, Nusrat, et al.
Published: (2024)
by: Zahan, Nusrat, et al.
Published: (2024)
Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models
by: Xiong, Junjie, et al.
Published: (2025)
by: Xiong, Junjie, et al.
Published: (2025)
Hallucinating AI Hijacking Attack: Large Language Models and Malicious Code Recommenders
by: Noever, David, et al.
Published: (2024)
by: Noever, David, et al.
Published: (2024)
Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Carrier Articles
by: Wang, Zhilong, et al.
Published: (2024)
by: Wang, Zhilong, et al.
Published: (2024)
Hidden You Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Logic Chain Injection
by: Wang, Zhilong, et al.
Published: (2024)
by: Wang, Zhilong, et al.
Published: (2024)
A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory
by: Wei, Qianshan, et al.
Published: (2025)
by: Wei, Qianshan, et al.
Published: (2025)
The Janus Interface: How Fine-Tuning in Large Language Models Amplifies the Privacy Risks
by: Chen, Xiaoyi, et al.
Published: (2023)
by: Chen, Xiaoyi, et al.
Published: (2023)
SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models
by: Fang, Junfeng, et al.
Published: (2025)
by: Fang, Junfeng, et al.
Published: (2025)
Automatically Generating Rules of Malicious Software Packages via Large Language Model
by: Zhang, XiangRui, et al.
Published: (2025)
by: Zhang, XiangRui, et al.
Published: (2025)
Paladin: Defending LLM-enabled Phishing Emails with a New Trigger-Tag Paradigm
by: Pang, Yan, et al.
Published: (2025)
by: Pang, Yan, et al.
Published: (2025)
Stealthy Backdoor Attack to Real-world Models in Android Apps
by: Wei, Jiali, et al.
Published: (2025)
by: Wei, Jiali, et al.
Published: (2025)
Real-time ML-based Defense Against Malicious Payload in Reconfigurable Embedded Systems
by: Stahle-Smith, Rye, et al.
Published: (2025)
by: Stahle-Smith, Rye, et al.
Published: (2025)
SecPE: Secure Prompt Ensembling for Private and Robust Large Language Models
by: Zhang, Jiawen, et al.
Published: (2025)
by: Zhang, Jiawen, et al.
Published: (2025)
SDD: Self-Degraded Defense against Malicious Fine-tuning
by: Chen, Zixuan, et al.
Published: (2025)
by: Chen, Zixuan, et al.
Published: (2025)
Mobile GUI Agents under Real-world Threats: Are We There Yet?
by: Liu, Guohong, et al.
Published: (2025)
by: Liu, Guohong, et al.
Published: (2025)
A Graph-Attentive LSTM Model for Malicious URL Detection
by: Hossain, Md. Ifthekhar, et al.
Published: (2025)
by: Hossain, Md. Ifthekhar, et al.
Published: (2025)
Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models
by: Wang, Xunguang, et al.
Published: (2026)
by: Wang, Xunguang, et al.
Published: (2026)
Selective Amnesia: On Efficient, High-Fidelity and Blind Suppression of Backdoor Effects in Trojaned Machine Learning Models
by: Zhu, Rui, et al.
Published: (2022)
by: Zhu, Rui, et al.
Published: (2022)
Large Language Models powered Malicious Traffic Detection: Architecture, Opportunities and Case Study
by: Zhang, Xinggong, et al.
Published: (2025)
by: Zhang, Xinggong, et al.
Published: (2025)
Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models
by: Hong, Wenjing, et al.
Published: (2026)
by: Hong, Wenjing, et al.
Published: (2026)
Blockchain Data Analysis in the Era of Large-Language Models
by: Toyoda, Kentaroh, et al.
Published: (2024)
by: Toyoda, Kentaroh, et al.
Published: (2024)
Benchmarking Safety Risks of Knowledge-Intensive Reasoning under Malicious Knowledge Editing
by: Mao, Qinghua, et al.
Published: (2026)
by: Mao, Qinghua, et al.
Published: (2026)
A Vision for Access Control in LLM-based Agent Systems
by: Li, Xinfeng, et al.
Published: (2025)
by: Li, Xinfeng, et al.
Published: (2025)
Distilled Large Language Model in Confidential Computing Environment for System-on-Chip Design
by: Ben, Dong, et al.
Published: (2025)
by: Ben, Dong, et al.
Published: (2025)
EnJa: Ensemble Jailbreak on Large Language Models
by: Zhang, Jiahao, et al.
Published: (2024)
by: Zhang, Jiahao, et al.
Published: (2024)
SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models
by: Ohayon, Hillel, et al.
Published: (2026)
by: Ohayon, Hillel, et al.
Published: (2026)
Recent Advances in Attack and Defense Approaches of Large Language Models
by: Cui, Jing, et al.
Published: (2024)
by: Cui, Jing, et al.
Published: (2024)
Neural-Inspired Advances in Integral Cryptanalysis
by: Zhang, Liu, et al.
Published: (2025)
by: Zhang, Liu, et al.
Published: (2025)
Watermarking Techniques for Large Language Models: A Survey
by: Liang, Yuqing, et al.
Published: (2024)
by: Liang, Yuqing, et al.
Published: (2024)
Enhancing GraphQL Security by Detecting Malicious Queries Using Large Language Models, Sentence Transformers, and Convolutional Neural Networks
by: Perera, Irash, et al.
Published: (2025)
by: Perera, Irash, et al.
Published: (2025)
VeriLoRA: Fine-Tuning Large Language Models with Verifiable Security via Zero-Knowledge Proofs
by: Liao, Guofu, et al.
Published: (2025)
by: Liao, Guofu, et al.
Published: (2025)
AI Safety vs. AI Security: Demystifying the Distinction and Boundaries
by: Lin, Zhiqiang, et al.
Published: (2025)
by: Lin, Zhiqiang, et al.
Published: (2025)
A Survey on Data Security in Large Language Models
by: Chen, Kang, et al.
Published: (2025)
by: Chen, Kang, et al.
Published: (2025)
A Sentence Relation-Based Approach to Sanitizing Malicious Instructions
by: Datta, Soumil, et al.
Published: (2026)
by: Datta, Soumil, et al.
Published: (2026)
RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent
by: Xu, Huiyu, et al.
Published: (2024)
by: Xu, Huiyu, et al.
Published: (2024)
Model-based Large Language Model Customization as Service
by: Wu, Zhaomin, et al.
Published: (2024)
by: Wu, Zhaomin, et al.
Published: (2024)
DrLLM: Prompt-Enhanced Distributed Denial-of-Service Resistance Method with Large Language Models
by: Yin, Zhenyu, et al.
Published: (2024)
by: Yin, Zhenyu, et al.
Published: (2024)
CryptoX : Compositional Reasoning Evaluation of Large Language Models
by: Shi, Jiajun, et al.
Published: (2025)
by: Shi, Jiajun, et al.
Published: (2025)
Similar Items
-
MAWSEO: Adversarial Wiki Search Poisoning for Illicit Online Promotion
by: Lin, Zilong, et al.
Published: (2023) -
Consiglieres in the Shadow: Understanding the Use of Uncensored Large Language Models in Cybercrimes
by: Lin, Zilong, et al.
Published: (2025) -
Leveraging Large Language Models to Detect npm Malicious Packages
by: Zahan, Nusrat, et al.
Published: (2024) -
Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models
by: Xiong, Junjie, et al.
Published: (2025) -
Hallucinating AI Hijacking Attack: Large Language Models and Malicious Code Recommenders
by: Noever, David, et al.
Published: (2024)