:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Alexiou, Michail S., Mertoguno, J. Sukarno
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2602.09343
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Method for Fast Autonomy Transfer in Reinforcement Learning
by: Sahabandu, Dinuka, et al.
Published: (2024)

Brain Tumor Classifiers Under Attack: Robustness of ResNet Variants Against Transferable FGSM and PGD Attacks
by: Deem, Ryan, et al.
Published: (2026)

Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs
by: Liu, Fan, et al.
Published: (2024)

Adversarial Attacks Against Automated Fact-Checking: A Survey
by: Liu, Fanzhen, et al.
Published: (2025)

HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text
by: Liu, Han, et al.
Published: (2024)

Kyrtos: A methodology for automatic deep analysis of graphic charts with curves in technical documents
by: Alexiou, Michail S., et al.
Published: (2026)

Chain Association-based Attacking and Shielding Natural Language Processing Systems
by: Huang, Jiacheng, et al.
Published: (2024)

Fast Adversarial Training against Textual Adversarial Attacks
by: Yang, Yichen, et al.
Published: (2024)

API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs
by: Basu, Kinjal, et al.
Published: (2024)

A Statistical and Multi-Perspective Revisiting of the Membership Inference Attack in Large Language Models
by: Chen, Bowen, et al.
Published: (2024)

Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective
by: Wen, Yuchen, et al.
Published: (2024)

STACK: Adversarial Attacks on LLM Safeguard Pipelines
by: McKenzie, Ian R., et al.
Published: (2025)

Towards Analyzing and Understanding the Limitations of DPO: A Theoretical Perspective
by: Feng, Duanyu, et al.
Published: (2024)

Towards Trustworthy Knowledge Graph Reasoning: An Uncertainty Aware Perspective
by: Ni, Bo, et al.
Published: (2024)

An Adversarial Perspective on Machine Unlearning for AI Safety
by: Łucki, Jakub, et al.
Published: (2024)

Robust Vision-Language Models via Tensor Decomposition: A Defense Against Adversarial Attacks
by: Patel, Het, et al.
Published: (2025)

Green Shielding: A User-Centric Approach Towards Trustworthy AI
by: Li, Aaron J., et al.
Published: (2026)

X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
by: Huang, Hanxun, et al.
Published: (2025)

Towards Cross-lingual Values Judgment: A Consensus-Pluralism Perspective
by: Chen, Yukun, et al.
Published: (2026)

Perspective Dial: Measuring Perspective of Text and Guiding LLM Outputs
by: Kim, Taejin, et al.
Published: (2025)

Combating Adversarial Attacks with Multi-Agent Debate
by: Chern, Steffi, et al.
Published: (2024)

Adversarial Attacks and Defense for Conversation Entailment Task
by: Yang, Zhenning, et al.
Published: (2024)

Merging Improves Self-Critique Against Jailbreak Attacks
by: Gallego, Victor
Published: (2024)

GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications
by: Patil, Shishir G., et al.
Published: (2024)

SEP-Attack: A Simple and Effective Paradigm for Transfer-Based Textual Adversarial Attack
by: Liu, Han, et al.
Published: (2026)

ShieldLearner: A New Paradigm for Jailbreak Attack Defense in LLMs
by: Ni, Ziyi, et al.
Published: (2025)

A Generative Adversarial Attack for Multilingual Text Classifiers
by: Roth, Tom, et al.
Published: (2024)

Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks
by: Liu, Haoyu, et al.
Published: (2026)

Temperature Matters: Enhancing Watermark Robustness Against Paraphrasing Attacks
by: Idrissi, Badr Youbi, et al.
Published: (2025)

Perspectives in Play: A Multi-Perspective Approach for More Inclusive NLP Systems
by: Muscato, Benedetta, et al.
Published: (2025)

Robust Neural Information Retrieval: An Adversarial and Out-of-distribution Perspective
by: Liu, Yu-An, et al.
Published: (2024)

LLM-Generated Negative News Headlines Dataset: Creation and Benchmarking Against Real Journalism
by: Babalola, Olusola, et al.
Published: (2025)

FraudShield: Knowledge Graph Empowered Defense for LLMs against Fraud Attacks
by: Xu, Naen, et al.
Published: (2026)

API Pack: A Massive Multi-Programming Language Dataset for API Call Generation
by: Guo, Zhen, et al.
Published: (2024)

Towards Adaptive, Scalable, and Robust Coordination of LLM Agents: A Dynamic Ad-Hoc Networking Perspective
by: Li, Rui, et al.
Published: (2026)

Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language
by: Paul, Ronny, et al.
Published: (2024)

Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting Attacks
by: Mu, Lin, et al.
Published: (2025)

Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models
by: Yi, Jingwei, et al.
Published: (2023)

An Audit on the Perspectives and Challenges of Hallucinations in NLP
by: Venkit, Pranav Narayanan, et al.
Published: (2024)

Large Knowledge Model: Perspectives and Challenges
by: Chen, Huajun
Published: (2023)