:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Okada, Hiroyuki, Oba, Tatsumi, Yanai, Naoto
Format:	Preprint
Published:	2026
Subjects:	Cryptography and Security
Online Access:	https://arxiv.org/abs/2601.03013
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Decoding BACnet Packets: A Large Language Model Approach for Packet Interpretation
by: Sharma, Rashi, et al.
Published: (2024)

Before You Hand Over the Wheel: Evaluating LLMs for Security Incident Analysis
by: Jajodia, Sourov, et al.
Published: (2026)

Using LLMs to Automate Threat Intelligence Analysis Workflows in Security Operation Centers
by: Tseng, PeiYu, et al.
Published: (2024)

Exploring Advanced Methodologies in Security Evaluation for LLMs
by: Huang, Jun, et al.
Published: (2024)

An Empirical Evaluation of LLMs for Solving Offensive Security Challenges
by: Shao, Minghao, et al.
Published: (2024)

Can Agents Secure Hardware? Evaluating Agentic LLM-Driven Obfuscation for IP Protection
by: Ghimire, Sujan, et al.
Published: (2026)

Can Developers rely on LLMs for Secure IaC Development?
by: Firouzi, Ehsan, et al.
Published: (2026)

Can LLMs Patch Security Issues?
by: Alrashedy, Kamel, et al.
Published: (2023)

Can We Trust Large Language Models Generated Code? A Framework for In-Context Learning, Security Patterns, and Code Evaluations Across Diverse LLMs
by: Mohsin, Ahmad, et al.
Published: (2024)

You Can't Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense
by: Mai, Wuyuao, et al.
Published: (2025)

LLMs in the SOC: An Empirical Study of Human-AI Collaboration in Security Operations Centres
by: Singh, Ronal, et al.
Published: (2025)

A Novel Cipher for Enhancing MAVLink Security: Design, Security Analysis, and Performance Evaluation Using a Drone Testbed
by: Dixit, Bhavya, et al.
Published: (2025)

Security Steerability is All You Need
by: Hazan, Itay, et al.
Published: (2025)

Model-Driven Security Analysis of Self-Sovereign Identity Systems
by: Ding, Yepeng, et al.
Published: (2024)

Evaluating the Influence of Multi-Factor Authentication and Recovery Settings on the Security and Accessibility of User Accounts
by: Büttner, Andre, et al.
Published: (2024)

LLMs Cannot Reliably Identify and Reason About Security Vulnerabilities (Yet?): A Comprehensive Evaluation, Framework, and Benchmarks
by: Ullah, Saad, et al.
Published: (2023)

LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
by: Li, Chloe, et al.
Published: (2025)

From LLMs to Agents: A Comparative Evaluation of LLMs and LLM-based Agents in Security Patch Detection
by: Han, Junxiao, et al.
Published: (2025)

Large Language Models for Security Operations Centers: A Comprehensive Survey
by: Habibzadeh, Ali, et al.
Published: (2025)

Does Teaming-Up LLMs Improve Secure Code Generation? A Comprehensive Evaluation with Multi-LLMSecCodeEval
by: Sabir, Bushra, et al.
Published: (2026)

Ruling the Unruly: Designing Effective, Low-Noise Network Intrusion Detection Rules for Security Operations Centers
by: Teuwen, Koen T. W., et al.
Published: (2025)

Can LLMs Hack Enterprise Networks? -- Replicated Computational Results (RCR) Report
by: Happe, Andreas, et al.
Published: (2026)

Considerations for Cloud Security Operations
by: Cusick, James
Published: (2016)

A Systematic Evaluation of Parameter-Efficient Fine-Tuning Methods for the Security of Code LLMs
by: Lee, Kiho, et al.
Published: (2025)

Cracking IoT Security: Can LLMs Outsmart Static Analysis Tools?
by: Quantrill, Jason, et al.
Published: (2026)

Evaluating and Improving the Robustness of Security Attack Detectors Generated by LLMs
by: Pasini, Samuele, et al.
Published: (2024)

AI-Driven Guided Response for Security Operation Centers with Microsoft Copilot for Security
by: Freitas, Scott, et al.
Published: (2024)

How Can We Effectively Use LLMs for Phishing Detection?: Evaluating the Effectiveness of Large Language Model-based Phishing Detection Models
by: Ji, Fujiao, et al.
Published: (2025)

Design Principles for the Construction of a Benchmark Evaluating Security Operation Capabilities of Multi-agent AI Systems
by: Cai, Yicheng, et al.
Published: (2026)

LITE-SOC: Lightweight Security Operations Center Simulator for Cybersecurity Education
by: Higgins, Martin, et al.
Published: (2026)

strideSEA: A STRIDE-centric Security Evaluation Approach
by: Jawad, Alvi, et al.
Published: (2025)

SecReEvalBench: A Multi-turned Security Resilience Evaluation Benchmark for Large Language Models
by: Cui, Huining, et al.
Published: (2025)

A Unified Framework for Human AI Collaboration in Security Operations Centers with Trusted Autonomy
by: Mohsin, Ahmad, et al.
Published: (2025)

Succinct Oblivious Tensor Evaluation and Applications: Adaptively-Secure Laconic Function Evaluation and Trapdoor Hashing for All Circuits
by: Abram, Damiano, et al.
Published: (2025)

Jailbreaking LLMs & VLMs: Mechanisms, Evaluation, and Unified Defense
by: Chen, Zejian, et al.
Published: (2026)

Evaluating Large Language Models for Security Bug Report Prediction
by: Soltaniani, Farnaz, et al.
Published: (2026)

Breaking Agent Backbones: Evaluating the Security of Backbone LLMs in AI Agents
by: Bazinska, Julia, et al.
Published: (2025)

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLMs
by: Panfilov, Alexander, et al.
Published: (2025)

Leveraging Trustworthy AI for Automotive Security in Multi-Domain Operations: Towards a Responsive Human-AI Multi-Domain Task Force for Cyber Social Security
by: Barletta, Vita Santa, et al.
Published: (2025)

Governing AI-Assisted Security Operations: A Design Science Framework for Operational Decision Support
by: De La Cruz, Elyson A., et al.
Published: (2026)