Saved in:
| Main Authors: | Bertram, Johannes, Geiping, Jonas |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.16756 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
How to Secure Existing C and C++ Software without Memory Safety
by: Erlingsson, Úlfar
Published: (2025)
by: Erlingsson, Úlfar
Published: (2025)
SABER: Benchmarking Operational Safety of LLM Coding Agents in Stateful Project Workspaces
by: Hu, Qi, et al.
Published: (2026)
by: Hu, Qi, et al.
Published: (2026)
SafeToolBench: Pioneering a Prospective Benchmark to Evaluating Tool Utilization Safety in LLMs
by: Xia, Hongfei, et al.
Published: (2025)
by: Xia, Hongfei, et al.
Published: (2025)
Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks
by: Chu, Junjie, et al.
Published: (2026)
by: Chu, Junjie, et al.
Published: (2026)
Jailbreak Distillation: Renewable Safety Benchmarking
by: Zhang, Jingyu, et al.
Published: (2025)
by: Zhang, Jingyu, et al.
Published: (2025)
Identifying Adversary Tactics and Techniques in Malware Binaries with an LLM Agent
by: Xuan, Zhou, et al.
Published: (2026)
by: Xuan, Zhou, et al.
Published: (2026)
Managing Security Evidence in Safety-Critical Organizations
by: Mohamad, Mazen, et al.
Published: (2024)
by: Mohamad, Mazen, et al.
Published: (2024)
DITING: A Static Analyzer for Identifying Bad Partitioning Issues in TEE Applications
by: Ma, Chengyan, et al.
Published: (2025)
by: Ma, Chengyan, et al.
Published: (2025)
Automated TEE Adaptation with LLMs: Identifying, Transforming, and Porting Sensitive Functions in Programs
by: Han, Ruidong, et al.
Published: (2025)
by: Han, Ruidong, et al.
Published: (2025)
Uncover the Premeditated Attacks: Detecting Exploitable Reentrancy Vulnerabilities by Identifying Attacker Contracts
by: Yang, Shuo, et al.
Published: (2024)
by: Yang, Shuo, et al.
Published: (2024)
AutoFirm: Automatically Identifying Reused Libraries inside IoT Firmware at Large-Scale
by: Chen, YongLe, et al.
Published: (2024)
by: Chen, YongLe, et al.
Published: (2024)
Understanding, Implementing, and Supporting Security Assurance Cases in Safety-Critical Domains
by: Mohamad, Mazen
Published: (2025)
by: Mohamad, Mazen
Published: (2025)
Safety Interventions against Adversarial Patches in an Open-Source Driver Assistance System
by: Chen, Cheng, et al.
Published: (2025)
by: Chen, Cheng, et al.
Published: (2025)
Towards a Benchmark for Dependency Decision-Making
by: Singla, Tanmay, et al.
Published: (2026)
by: Singla, Tanmay, et al.
Published: (2026)
Match & Mend: Minimally Invasive Local Reassembly for Patching N-day Vulnerabilities in ARM Binaries
by: Jänich, Sebastian, et al.
Published: (2025)
by: Jänich, Sebastian, et al.
Published: (2025)
Evaluating Nova 2.0 Lite model under Amazon's Frontier Model Safety Framework
by: Krishna, Satyapriya, et al.
Published: (2026)
by: Krishna, Satyapriya, et al.
Published: (2026)
CNT: Safety-oriented Function Reuse across LLMs via Cross-Model Neuron Transfer
by: Zhao, Yue, et al.
Published: (2026)
by: Zhao, Yue, et al.
Published: (2026)
Bridging Safety and Security in Complex Systems: A Model-Based Approach with SAFT-GT Toolchain
by: Pekaric, Irdin, et al.
Published: (2026)
by: Pekaric, Irdin, et al.
Published: (2026)
Usability as a Weapon: Attacking the Safety of LLM-Based Code Generation via Usability Requirements
by: Li, Yue, et al.
Published: (2026)
by: Li, Yue, et al.
Published: (2026)
Who Tests the Testers? Systematic Enumeration and Coverage Audit of LLM Agent Tool Call Safety
by: Chen, Xuan, et al.
Published: (2026)
by: Chen, Xuan, et al.
Published: (2026)
Failure Analysis of Safety Controllers in Autonomous Vehicles Under Object-Based LiDAR Attacks
by: Ganiuly, Daniyal, et al.
Published: (2025)
by: Ganiuly, Daniyal, et al.
Published: (2025)
Uncovering Hidden Inclusions of Vulnerable Dependencies in Real-World Java Projects
by: Schott, Stefan, et al.
Published: (2026)
by: Schott, Stefan, et al.
Published: (2026)
Bytecode-centric Detection of Known-to-be-vulnerable Dependencies in Java Projects
by: Schott, Stefan, et al.
Published: (2025)
by: Schott, Stefan, et al.
Published: (2025)
FirmReBugger: A Benchmark Framework for Monolithic Firmware Fuzzers
by: Duong, Mathew, et al.
Published: (2026)
by: Duong, Mathew, et al.
Published: (2026)
CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift
by: Yu, Jiongchi, et al.
Published: (2025)
by: Yu, Jiongchi, et al.
Published: (2025)
PATCHEVAL: A New Benchmark for Evaluating LLMs on Patching Real-World Vulnerabilities
by: Wei, Zichao, et al.
Published: (2025)
by: Wei, Zichao, et al.
Published: (2025)
RealSec-bench: A Benchmark for Evaluating Secure Code Generation in Real-World Repositories
by: Wang, Yanlin, et al.
Published: (2026)
by: Wang, Yanlin, et al.
Published: (2026)
AutoDFBench 1.0: A Benchmarking Framework for Digital Forensic Tool Testing and Generated Code Evaluation
by: Wickramasekara, Akila, et al.
Published: (2025)
by: Wickramasekara, Akila, et al.
Published: (2025)
SoK: Where to Fuzz? Assessing Target Selection Methods in Directed Fuzzing
by: Weissberg, Felix, et al.
Published: (2025)
by: Weissberg, Felix, et al.
Published: (2025)
Challenges in the Safety-Security Co-Assurance of Collaborative Industrial Robots
by: Gleirscher, Mario, et al.
Published: (2020)
by: Gleirscher, Mario, et al.
Published: (2020)
A Holistic Approach to E-Commerce Innovation: Redefining Security and User Experience
by: Akash, Mohammad Olid Ali, et al.
Published: (2025)
by: Akash, Mohammad Olid Ali, et al.
Published: (2025)
Identifier-Free Code Embedding Models for Scalable Search
by: Wolos, Eric, et al.
Published: (2026)
by: Wolos, Eric, et al.
Published: (2026)
Identifying the Supply Chain of AI for Trustworthiness and Risk Management in Critical Applications
by: Sheh, Raymond K., et al.
Published: (2025)
by: Sheh, Raymond K., et al.
Published: (2025)
Containment Verification: AI Safety Guarantees Independent of Alignment
by: Moon, Royce, et al.
Published: (2026)
by: Moon, Royce, et al.
Published: (2026)
Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6
by: Lorenzo, Luis Guzmán
Published: (2026)
by: Lorenzo, Luis Guzmán
Published: (2026)
Inverting the Shield: Systematically Generating Safety Tests from Policy Specifications
by: Lu, Xiaoyue, et al.
Published: (2026)
by: Lu, Xiaoyue, et al.
Published: (2026)
AdaptiveGuard: Towards Adaptive Runtime Safety for LLM-Powered Software
by: Yang, Rui, et al.
Published: (2025)
by: Yang, Rui, et al.
Published: (2025)
Symbolic Guardrails for Domain-Specific Agents: Stronger Safety and Security Guarantees Without Sacrificing Utility
by: Hong, Yining, et al.
Published: (2026)
by: Hong, Yining, et al.
Published: (2026)
deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses
by: Androutsopoulos, Georgios, et al.
Published: (2025)
by: Androutsopoulos, Georgios, et al.
Published: (2025)
SCDBench: A Benchmark for LLM-Based Smart Contract Decompilers
by: Qin, Kaihua, et al.
Published: (2026)
by: Qin, Kaihua, et al.
Published: (2026)
Similar Items
-
How to Secure Existing C and C++ Software without Memory Safety
by: Erlingsson, Úlfar
Published: (2025) -
SABER: Benchmarking Operational Safety of LLM Coding Agents in Stateful Project Workspaces
by: Hu, Qi, et al.
Published: (2026) -
SafeToolBench: Pioneering a Prospective Benchmark to Evaluating Tool Utilization Safety in LLMs
by: Xia, Hongfei, et al.
Published: (2025) -
Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks
by: Chu, Junjie, et al.
Published: (2026) -
Jailbreak Distillation: Renewable Safety Benchmarking
by: Zhang, Jingyu, et al.
Published: (2025)