Saved in:
| Main Authors: | Alexiou, Michail S., Mertoguno, J. Sukarno |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.09343 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Method for Fast Autonomy Transfer in Reinforcement Learning
by: Sahabandu, Dinuka, et al.
Published: (2024)
by: Sahabandu, Dinuka, et al.
Published: (2024)
Brain Tumor Classifiers Under Attack: Robustness of ResNet Variants Against Transferable FGSM and PGD Attacks
by: Deem, Ryan, et al.
Published: (2026)
by: Deem, Ryan, et al.
Published: (2026)
Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs
by: Liu, Fan, et al.
Published: (2024)
by: Liu, Fan, et al.
Published: (2024)
Adversarial Attacks Against Automated Fact-Checking: A Survey
by: Liu, Fanzhen, et al.
Published: (2025)
by: Liu, Fanzhen, et al.
Published: (2025)
HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text
by: Liu, Han, et al.
Published: (2024)
by: Liu, Han, et al.
Published: (2024)
Kyrtos: A methodology for automatic deep analysis of graphic charts with curves in technical documents
by: Alexiou, Michail S., et al.
Published: (2026)
by: Alexiou, Michail S., et al.
Published: (2026)
Chain Association-based Attacking and Shielding Natural Language Processing Systems
by: Huang, Jiacheng, et al.
Published: (2024)
by: Huang, Jiacheng, et al.
Published: (2024)
Fast Adversarial Training against Textual Adversarial Attacks
by: Yang, Yichen, et al.
Published: (2024)
by: Yang, Yichen, et al.
Published: (2024)
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs
by: Basu, Kinjal, et al.
Published: (2024)
by: Basu, Kinjal, et al.
Published: (2024)
A Statistical and Multi-Perspective Revisiting of the Membership Inference Attack in Large Language Models
by: Chen, Bowen, et al.
Published: (2024)
by: Chen, Bowen, et al.
Published: (2024)
Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective
by: Wen, Yuchen, et al.
Published: (2024)
by: Wen, Yuchen, et al.
Published: (2024)
STACK: Adversarial Attacks on LLM Safeguard Pipelines
by: McKenzie, Ian R., et al.
Published: (2025)
by: McKenzie, Ian R., et al.
Published: (2025)
Towards Analyzing and Understanding the Limitations of DPO: A Theoretical Perspective
by: Feng, Duanyu, et al.
Published: (2024)
by: Feng, Duanyu, et al.
Published: (2024)
Towards Trustworthy Knowledge Graph Reasoning: An Uncertainty Aware Perspective
by: Ni, Bo, et al.
Published: (2024)
by: Ni, Bo, et al.
Published: (2024)
An Adversarial Perspective on Machine Unlearning for AI Safety
by: Łucki, Jakub, et al.
Published: (2024)
by: Łucki, Jakub, et al.
Published: (2024)
Robust Vision-Language Models via Tensor Decomposition: A Defense Against Adversarial Attacks
by: Patel, Het, et al.
Published: (2025)
by: Patel, Het, et al.
Published: (2025)
Green Shielding: A User-Centric Approach Towards Trustworthy AI
by: Li, Aaron J., et al.
Published: (2026)
by: Li, Aaron J., et al.
Published: (2026)
X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
by: Huang, Hanxun, et al.
Published: (2025)
by: Huang, Hanxun, et al.
Published: (2025)
Towards Cross-lingual Values Judgment: A Consensus-Pluralism Perspective
by: Chen, Yukun, et al.
Published: (2026)
by: Chen, Yukun, et al.
Published: (2026)
Perspective Dial: Measuring Perspective of Text and Guiding LLM Outputs
by: Kim, Taejin, et al.
Published: (2025)
by: Kim, Taejin, et al.
Published: (2025)
Combating Adversarial Attacks with Multi-Agent Debate
by: Chern, Steffi, et al.
Published: (2024)
by: Chern, Steffi, et al.
Published: (2024)
Adversarial Attacks and Defense for Conversation Entailment Task
by: Yang, Zhenning, et al.
Published: (2024)
by: Yang, Zhenning, et al.
Published: (2024)
Merging Improves Self-Critique Against Jailbreak Attacks
by: Gallego, Victor
Published: (2024)
by: Gallego, Victor
Published: (2024)
GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications
by: Patil, Shishir G., et al.
Published: (2024)
by: Patil, Shishir G., et al.
Published: (2024)
SEP-Attack: A Simple and Effective Paradigm for Transfer-Based Textual Adversarial Attack
by: Liu, Han, et al.
Published: (2026)
by: Liu, Han, et al.
Published: (2026)
ShieldLearner: A New Paradigm for Jailbreak Attack Defense in LLMs
by: Ni, Ziyi, et al.
Published: (2025)
by: Ni, Ziyi, et al.
Published: (2025)
A Generative Adversarial Attack for Multilingual Text Classifiers
by: Roth, Tom, et al.
Published: (2024)
by: Roth, Tom, et al.
Published: (2024)
Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks
by: Liu, Haoyu, et al.
Published: (2026)
by: Liu, Haoyu, et al.
Published: (2026)
Temperature Matters: Enhancing Watermark Robustness Against Paraphrasing Attacks
by: Idrissi, Badr Youbi, et al.
Published: (2025)
by: Idrissi, Badr Youbi, et al.
Published: (2025)
Perspectives in Play: A Multi-Perspective Approach for More Inclusive NLP Systems
by: Muscato, Benedetta, et al.
Published: (2025)
by: Muscato, Benedetta, et al.
Published: (2025)
Robust Neural Information Retrieval: An Adversarial and Out-of-distribution Perspective
by: Liu, Yu-An, et al.
Published: (2024)
by: Liu, Yu-An, et al.
Published: (2024)
LLM-Generated Negative News Headlines Dataset: Creation and Benchmarking Against Real Journalism
by: Babalola, Olusola, et al.
Published: (2025)
by: Babalola, Olusola, et al.
Published: (2025)
FraudShield: Knowledge Graph Empowered Defense for LLMs against Fraud Attacks
by: Xu, Naen, et al.
Published: (2026)
by: Xu, Naen, et al.
Published: (2026)
API Pack: A Massive Multi-Programming Language Dataset for API Call Generation
by: Guo, Zhen, et al.
Published: (2024)
by: Guo, Zhen, et al.
Published: (2024)
Towards Adaptive, Scalable, and Robust Coordination of LLM Agents: A Dynamic Ad-Hoc Networking Perspective
by: Li, Rui, et al.
Published: (2026)
by: Li, Rui, et al.
Published: (2026)
Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language
by: Paul, Ronny, et al.
Published: (2024)
by: Paul, Ronny, et al.
Published: (2024)
Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting Attacks
by: Mu, Lin, et al.
Published: (2025)
by: Mu, Lin, et al.
Published: (2025)
Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models
by: Yi, Jingwei, et al.
Published: (2023)
by: Yi, Jingwei, et al.
Published: (2023)
An Audit on the Perspectives and Challenges of Hallucinations in NLP
by: Venkit, Pranav Narayanan, et al.
Published: (2024)
by: Venkit, Pranav Narayanan, et al.
Published: (2024)
Large Knowledge Model: Perspectives and Challenges
by: Chen, Huajun
Published: (2023)
by: Chen, Huajun
Published: (2023)
Similar Items
-
A Method for Fast Autonomy Transfer in Reinforcement Learning
by: Sahabandu, Dinuka, et al.
Published: (2024) -
Brain Tumor Classifiers Under Attack: Robustness of ResNet Variants Against Transferable FGSM and PGD Attacks
by: Deem, Ryan, et al.
Published: (2026) -
Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs
by: Liu, Fan, et al.
Published: (2024) -
Adversarial Attacks Against Automated Fact-Checking: A Survey
by: Liu, Fanzhen, et al.
Published: (2025) -
HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text
by: Liu, Han, et al.
Published: (2024)