Saved in:
| Main Authors: | Okada, Hiroyuki, Oba, Tatsumi, Yanai, Naoto |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.03013 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Decoding BACnet Packets: A Large Language Model Approach for Packet Interpretation
by: Sharma, Rashi, et al.
Published: (2024)
by: Sharma, Rashi, et al.
Published: (2024)
Before You Hand Over the Wheel: Evaluating LLMs for Security Incident Analysis
by: Jajodia, Sourov, et al.
Published: (2026)
by: Jajodia, Sourov, et al.
Published: (2026)
Using LLMs to Automate Threat Intelligence Analysis Workflows in Security Operation Centers
by: Tseng, PeiYu, et al.
Published: (2024)
by: Tseng, PeiYu, et al.
Published: (2024)
Exploring Advanced Methodologies in Security Evaluation for LLMs
by: Huang, Jun, et al.
Published: (2024)
by: Huang, Jun, et al.
Published: (2024)
An Empirical Evaluation of LLMs for Solving Offensive Security Challenges
by: Shao, Minghao, et al.
Published: (2024)
by: Shao, Minghao, et al.
Published: (2024)
Can Agents Secure Hardware? Evaluating Agentic LLM-Driven Obfuscation for IP Protection
by: Ghimire, Sujan, et al.
Published: (2026)
by: Ghimire, Sujan, et al.
Published: (2026)
Can Developers rely on LLMs for Secure IaC Development?
by: Firouzi, Ehsan, et al.
Published: (2026)
by: Firouzi, Ehsan, et al.
Published: (2026)
Can LLMs Patch Security Issues?
by: Alrashedy, Kamel, et al.
Published: (2023)
by: Alrashedy, Kamel, et al.
Published: (2023)
Can We Trust Large Language Models Generated Code? A Framework for In-Context Learning, Security Patterns, and Code Evaluations Across Diverse LLMs
by: Mohsin, Ahmad, et al.
Published: (2024)
by: Mohsin, Ahmad, et al.
Published: (2024)
You Can't Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense
by: Mai, Wuyuao, et al.
Published: (2025)
by: Mai, Wuyuao, et al.
Published: (2025)
LLMs in the SOC: An Empirical Study of Human-AI Collaboration in Security Operations Centres
by: Singh, Ronal, et al.
Published: (2025)
by: Singh, Ronal, et al.
Published: (2025)
A Novel Cipher for Enhancing MAVLink Security: Design, Security Analysis, and Performance Evaluation Using a Drone Testbed
by: Dixit, Bhavya, et al.
Published: (2025)
by: Dixit, Bhavya, et al.
Published: (2025)
Security Steerability is All You Need
by: Hazan, Itay, et al.
Published: (2025)
by: Hazan, Itay, et al.
Published: (2025)
Model-Driven Security Analysis of Self-Sovereign Identity Systems
by: Ding, Yepeng, et al.
Published: (2024)
by: Ding, Yepeng, et al.
Published: (2024)
Evaluating the Influence of Multi-Factor Authentication and Recovery Settings on the Security and Accessibility of User Accounts
by: Büttner, Andre, et al.
Published: (2024)
by: Büttner, Andre, et al.
Published: (2024)
LLMs Cannot Reliably Identify and Reason About Security Vulnerabilities (Yet?): A Comprehensive Evaluation, Framework, and Benchmarks
by: Ullah, Saad, et al.
Published: (2023)
by: Ullah, Saad, et al.
Published: (2023)
LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
by: Li, Chloe, et al.
Published: (2025)
by: Li, Chloe, et al.
Published: (2025)
From LLMs to Agents: A Comparative Evaluation of LLMs and LLM-based Agents in Security Patch Detection
by: Han, Junxiao, et al.
Published: (2025)
by: Han, Junxiao, et al.
Published: (2025)
Large Language Models for Security Operations Centers: A Comprehensive Survey
by: Habibzadeh, Ali, et al.
Published: (2025)
by: Habibzadeh, Ali, et al.
Published: (2025)
Does Teaming-Up LLMs Improve Secure Code Generation? A Comprehensive Evaluation with Multi-LLMSecCodeEval
by: Sabir, Bushra, et al.
Published: (2026)
by: Sabir, Bushra, et al.
Published: (2026)
Ruling the Unruly: Designing Effective, Low-Noise Network Intrusion Detection Rules for Security Operations Centers
by: Teuwen, Koen T. W., et al.
Published: (2025)
by: Teuwen, Koen T. W., et al.
Published: (2025)
Can LLMs Hack Enterprise Networks? -- Replicated Computational Results (RCR) Report
by: Happe, Andreas, et al.
Published: (2026)
by: Happe, Andreas, et al.
Published: (2026)
Considerations for Cloud Security Operations
by: Cusick, James
Published: (2016)
by: Cusick, James
Published: (2016)
A Systematic Evaluation of Parameter-Efficient Fine-Tuning Methods for the Security of Code LLMs
by: Lee, Kiho, et al.
Published: (2025)
by: Lee, Kiho, et al.
Published: (2025)
Cracking IoT Security: Can LLMs Outsmart Static Analysis Tools?
by: Quantrill, Jason, et al.
Published: (2026)
by: Quantrill, Jason, et al.
Published: (2026)
Evaluating and Improving the Robustness of Security Attack Detectors Generated by LLMs
by: Pasini, Samuele, et al.
Published: (2024)
by: Pasini, Samuele, et al.
Published: (2024)
AI-Driven Guided Response for Security Operation Centers with Microsoft Copilot for Security
by: Freitas, Scott, et al.
Published: (2024)
by: Freitas, Scott, et al.
Published: (2024)
How Can We Effectively Use LLMs for Phishing Detection?: Evaluating the Effectiveness of Large Language Model-based Phishing Detection Models
by: Ji, Fujiao, et al.
Published: (2025)
by: Ji, Fujiao, et al.
Published: (2025)
Design Principles for the Construction of a Benchmark Evaluating Security Operation Capabilities of Multi-agent AI Systems
by: Cai, Yicheng, et al.
Published: (2026)
by: Cai, Yicheng, et al.
Published: (2026)
LITE-SOC: Lightweight Security Operations Center Simulator for Cybersecurity Education
by: Higgins, Martin, et al.
Published: (2026)
by: Higgins, Martin, et al.
Published: (2026)
strideSEA: A STRIDE-centric Security Evaluation Approach
by: Jawad, Alvi, et al.
Published: (2025)
by: Jawad, Alvi, et al.
Published: (2025)
SecReEvalBench: A Multi-turned Security Resilience Evaluation Benchmark for Large Language Models
by: Cui, Huining, et al.
Published: (2025)
by: Cui, Huining, et al.
Published: (2025)
A Unified Framework for Human AI Collaboration in Security Operations Centers with Trusted Autonomy
by: Mohsin, Ahmad, et al.
Published: (2025)
by: Mohsin, Ahmad, et al.
Published: (2025)
Succinct Oblivious Tensor Evaluation and Applications: Adaptively-Secure Laconic Function Evaluation and Trapdoor Hashing for All Circuits
by: Abram, Damiano, et al.
Published: (2025)
by: Abram, Damiano, et al.
Published: (2025)
Jailbreaking LLMs & VLMs: Mechanisms, Evaluation, and Unified Defense
by: Chen, Zejian, et al.
Published: (2026)
by: Chen, Zejian, et al.
Published: (2026)
Evaluating Large Language Models for Security Bug Report Prediction
by: Soltaniani, Farnaz, et al.
Published: (2026)
by: Soltaniani, Farnaz, et al.
Published: (2026)
Breaking Agent Backbones: Evaluating the Security of Backbone LLMs in AI Agents
by: Bazinska, Julia, et al.
Published: (2025)
by: Bazinska, Julia, et al.
Published: (2025)
Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLMs
by: Panfilov, Alexander, et al.
Published: (2025)
by: Panfilov, Alexander, et al.
Published: (2025)
Leveraging Trustworthy AI for Automotive Security in Multi-Domain Operations: Towards a Responsive Human-AI Multi-Domain Task Force for Cyber Social Security
by: Barletta, Vita Santa, et al.
Published: (2025)
by: Barletta, Vita Santa, et al.
Published: (2025)
Governing AI-Assisted Security Operations: A Design Science Framework for Operational Decision Support
by: De La Cruz, Elyson A., et al.
Published: (2026)
by: De La Cruz, Elyson A., et al.
Published: (2026)
Similar Items
-
Decoding BACnet Packets: A Large Language Model Approach for Packet Interpretation
by: Sharma, Rashi, et al.
Published: (2024) -
Before You Hand Over the Wheel: Evaluating LLMs for Security Incident Analysis
by: Jajodia, Sourov, et al.
Published: (2026) -
Using LLMs to Automate Threat Intelligence Analysis Workflows in Security Operation Centers
by: Tseng, PeiYu, et al.
Published: (2024) -
Exploring Advanced Methodologies in Security Evaluation for LLMs
by: Huang, Jun, et al.
Published: (2024) -
An Empirical Evaluation of LLMs for Solving Offensive Security Challenges
by: Shao, Minghao, et al.
Published: (2024)