:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Han, Junxiao, Yu, Zheng, Bao, Lingfeng, Liu, Jiakun, Wan, Yao, Yin, Jianwei, Deng, Shuiguang, Han, Song
Format:	Preprint
Published:	2025
Subjects:	Cryptography and Security Software Engineering
Online Access:	https://arxiv.org/abs/2511.08060
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PATCHEVAL: A New Benchmark for Evaluating LLMs on Patching Real-World Vulnerabilities
by: Wei, Zichao, et al.
Published: (2025)

What Makes a Good LLM Agent for Real-world Penetration Testing?
by: Deng, Gelei, et al.
Published: (2026)

When "Correct" Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents?
by: Peng, Yibo, et al.
Published: (2025)

Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities
by: Garg, Aayush, et al.
Published: (2025)

From Reviewers' Lens: Understanding Bug Bounty Report Invalid Reasons with LLMs
by: Zheng, Jiangrui, et al.
Published: (2025)

ChainFuzzer: Greybox Fuzzing for Workflow-Level Multi-Tool Vulnerabilities in LLM Agents
by: Wu, Jiangrong, et al.
Published: (2026)

LLMs for Cyber Security: New Opportunities
by: Divakaran, Dinil Mon, et al.
Published: (2024)

LLMs as Firmware Experts: A Runtime-Grown Tree-of-Agents Framework
by: Zhang, Xiangrui, et al.
Published: (2025)

SIR-Bench: Evaluating Investigation Depth in Security Incident Response Agents
by: Begimher, Daniel, et al.
Published: (2026)

LLMs + Security = Trouble
by: Livshits, Benjamin
Published: (2026)

VulInstruct: Teaching LLMs Root-Cause Reasoning for Vulnerability Detection via Security Specifications
by: Zhu, Hao, et al.
Published: (2025)

LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs' Vulnerability Reasoning
by: Sun, Yuqiang, et al.
Published: (2024)

Automated Repair of TEE Partitioning Issues via DSL-Guided and LLM-Assisted Patching
by: Ma, Chengyan, et al.
Published: (2026)

Reentrancy Detection in the Age of LLMs
by: Ressi, Dalila, et al.
Published: (2026)

PatchSeeker: Mapping NVD Records to their Vulnerability-fixing Commits with LLM Generated Commits and Embeddings
by: Nguyen, Huu Hung, et al.
Published: (2025)

Towards Secure Logging: Characterizing and Benchmarking Logging Code Security Issues with LLMs
by: Yuan, He Yang, et al.
Published: (2026)

Evaluating and Improving the Robustness of Security Attack Detectors Generated by LLMs
by: Pasini, Samuele, et al.
Published: (2024)

StriderSPD: Structure-Guided Joint Representation Learning for Binary Security Patch Detection
by: Li, Qingyuan, et al.
Published: (2026)

Identifying Adversary Tactics and Techniques in Malware Binaries with an LLM Agent
by: Xuan, Zhou, et al.
Published: (2026)

Using LLMs for Security Advisory Investigations: How Far Are We?
by: Abdullah, Bayu Fedra, et al.
Published: (2025)

SecureFixAgent: A Hybrid LLM Agent for Automated Python Static Vulnerability Repair
by: Gajjar, Jugal, et al.
Published: (2025)

Security Is Relative: Training-Free Vulnerability Detection via Multi-Agent Behavioral Contract Synthesis
by: Wang, Yongchao, et al.
Published: (2026)

SkillProbe: Security Auditing for Emerging Agent Skill Marketplaces via Multi-Agent Collaboration
by: Guo, Zihan, et al.
Published: (2026)

FLAMES: Fine-tuning LLMs to Synthesize Invariants for Smart Contract Security
by: Eshghie, Mojtaba, et al.
Published: (2025)

Give LLMs a Security Course: Securing Retrieval-Augmented Code Generation via Knowledge Injection
by: Lin, Bo, et al.
Published: (2025)

Does Teaming-Up LLMs Improve Secure Code Generation? A Comprehensive Evaluation with Multi-LLMSecCodeEval
by: Sabir, Bushra, et al.
Published: (2026)

QASecClaw: A Multi-Agent LLM Approach for False Positive Reduction in Static Application Security Testing
by: Ameen, Mohd Ruhul, et al.
Published: (2026)

Revisiting Vulnerability Patch Localization: An Empirical Study and LLM-Based Solution
by: Xu, Haoran, et al.
Published: (2025)

Do Fine-Tuned LLMs Understand Vulnerabilities? An Investigation into the Semantic Trap
by: Huang, Feiyang, et al.
Published: (2026)

An Investigation of Patch Porting Practices of the Linux Kernel Ecosystem
by: Li, Xingyu, et al.
Published: (2024)

How to Compare the Security of Code Written by Humans to LLM-generated Code
by: Balebako, Rebecca, et al.
Published: (2026)

Chimera: Harnessing Multi-Agent LLMs for Automatic Insider Threat Simulation
by: Yu, Jiongchi, et al.
Published: (2025)

From Lab to Reality: A Practical Evaluation of Deep Learning Models and LLMs for Vulnerability Detection
by: Lu, Chaomeng, et al.
Published: (2025)

SABER: Benchmarking Operational Safety of LLM Coding Agents in Stateful Project Workspaces
by: Hu, Qi, et al.
Published: (2026)

Secure Coding with AI -- From Detection to Repair
by: Belozerov, Vladislav, et al.
Published: (2025)

Repository-Level Graph Representation Learning for Enhanced Security Patch Detection
by: Wen, Xin-Cheng, et al.
Published: (2024)

Out of Distribution, Out of Luck: How Well Can LLMs Trained on Vulnerability Datasets Detect Top 25 CWE Weaknesses?
by: Li, Yikun, et al.
Published: (2025)

An Empirical Security Evaluation of LLM-Generated Cryptographic Rust Code
by: Elsayed, Mohamed, et al.
Published: (2026)

Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale
by: Liu, Yi, et al.
Published: (2026)

Automated TEE Adaptation with LLMs: Identifying, Transforming, and Porting Sensitive Functions in Programs
by: Han, Ruidong, et al.
Published: (2025)