:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kumar, Prasanna, Soni, Nishank, Munje, Gaurang
Format:	Preprint
Published:	2026
Subjects:	Cryptography and Security Artificial Intelligence Computational Engineering, Finance, and Science I.2.7
Online Access:	https://arxiv.org/abs/2602.22237
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AI Safeguards, Generative AI and the Pandora Box: AI Safety Measures to Protect Businesses and Personal Reputation
by: Kumar, Prasanna
Published: (2026)

VeriGuard: Enhancing LLM Agent Safety via Verified Code Generation
by: Miculicich, Lesly, et al.
Published: (2025)

A Validated Prompt Bank for Malicious Code Generation: Separating Executable Weapons from Security Knowledge in 1,554 Consensus-Labeled Prompts
by: Young, Richard J., et al.
Published: (2026)

FORGE: An LLM-driven Framework for Large-Scale Smart Contract Vulnerability Dataset Construction
by: Chen, Jiachi, et al.
Published: (2025)

LIPPEN: A Lightweight In-Place Pointer Encryption Architecture for Pointer Integrity
by: Iravani, Erfan, et al.
Published: (2026)

Evaluating the Robustness of Large Language Model Safety Guardrails Against Adversarial Attacks
by: Young, Richard J.
Published: (2025)

DWFS-Obfuscation: Dynamic Weighted Feature Selection for Robust Malware Familial Classification under Obfuscation
by: Wei, Xingyuan, et al.
Published: (2025)

POISONCRAFT: Practical Poisoning of Retrieval-Augmented Generation for Large Language Models
by: Shao, Yangguang, et al.
Published: (2025)

Show Me Your Code! Kill Code Poisoning: A Lightweight Method Based on Code Naturalness
by: Sun, Weisong, et al.
Published: (2025)

Prompt Fencing: A Cryptographic Approach to Establishing Security Boundaries in Large Language Model Prompts
by: Peh, Steven
Published: (2025)

Benchmarking Large Language Models for IoC Recovery under Adversarial Code Obfuscation and Encryption
by: Morales, Jaime, et al.
Published: (2026)

Reducing Information Overload: Because Even Security Experts Need to Blink
by: Kuehn, Philipp, et al.
Published: (2022)

JavelinGuard: Low-Cost Transformer Architectures for LLM Security
by: Datta, Yash, et al.
Published: (2025)

Temporal Attack Pattern Detection in Multi-Agent AI Workflows: An Open Framework for Training Trace-Based Security Models
by: Del Rosario, Ron F.
Published: (2025)

Real AI Agents with Fake Memories: Fatal Context Manipulation Attacks on Web3 Agents
by: Patlan, Atharv Singh, et al.
Published: (2025)

SecEmb: Sparsity-Aware Secure Federated Learning of On-Device Recommender System with Large Embedding
by: Mai, Peihua, et al.
Published: (2025)

ConfusionPrompt: Practical Private Inference for Online Large Language Models
by: Mai, Peihua, et al.
Published: (2023)

sudoLLM: On Multi-role Alignment of Language Models
by: Saha, Soumadeep, et al.
Published: (2025)

Split-and-Denoise: Protect large language model inference with local differential privacy
by: Mai, Peihua, et al.
Published: (2023)

Super Suffixes: Bypassing Text Generation Alignment and Guard Models Simultaneously
by: Adiletta, Andrew, et al.
Published: (2025)

UniC-RAG: Universal Knowledge Corruption Attacks to Retrieval-Augmented Generation
by: Geng, Runpeng, et al.
Published: (2025)

Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs)
by: Verma, Apurv, et al.
Published: (2024)

Efficient LLM Safety Evaluation through Multi-Agent Debate
by: Lin, Dachuan, et al.
Published: (2025)

PromptSAM+: Malware Detection based on Prompt Segment Anything Model
by: Wei, Xingyuan, et al.
Published: (2024)

Countermind: A Multi-Layered Security Architecture for Large Language Models
by: Schwarz, Dominik
Published: (2025)

Lightweight LLMs for Network Attack Detection in IoT Networks
by: Sudasinghe, Piyumi Bhagya, et al.
Published: (2026)

PoTS: Proof-of-Training-Steps for Backdoor Detection in Large Language Models
by: Seddik, Issam, et al.
Published: (2025)

Mitigating Trojanized Prompt Chains in Educational LLM Use Cases: Experimental Findings and Detection Tool Design
by: Charles, Richard M., et al.
Published: (2025)

Defending against Backdoor Attacks via Module Switching
by: Li, Weijun, et al.
Published: (2025)

CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities
by: Zhu, Yuxuan, et al.
Published: (2025)

MASH: Evading Black-Box AI-Generated Text Detectors via Style Humanization
by: Gu, Yongtong, et al.
Published: (2026)

Mitigating the Impact of Malware Evolution on API Sequence-based Windows Malware Detector
by: Wei, Xingyuan, et al.
Published: (2024)

Large Language Models for Combinatorial Optimization of Design Structure Matrix
by: Jiang, Shuo, et al.
Published: (2025)

Cyber Defense Benchmark: Agentic Threat Hunting Evaluation for LLMs in SecOps
by: Chona, Alankrit, et al.
Published: (2026)

Train to Defend: First Defense Against Cryptanalytic Neural Network Parameter Extraction Attacks
by: Kurian, Ashley, et al.
Published: (2025)

Terrarium: Revisiting the Blackboard for Multi-Agent Safety, Privacy, and Security Studies
by: Nakamura, Mason, et al.
Published: (2025)

VectorSmuggle: Steganographic Exfiltration in Embedding Stores and a Cryptographic Provenance Defense
by: Wanger, Jascha
Published: (2026)

Predicting Known Vulnerabilities from Attack Descriptions Using Sentence Transformers
by: Othman, Refat
Published: (2026)

Accelerating Suffix Jailbreak attacks with Prefix-Shared KV-cache
by: Wang, Xinhai, et al.
Published: (2026)

Ignore Me But Don't Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity Domain
by: Jang, Eugene, et al.
Published: (2024)