Saved in:
| Main Authors: | Li, Hongwei, Wang, Zhun, Dai, Qinrun, Nie, Yuzhou, Peng, Jinjun, Liu, Ruitong, Zhang, Jingyang, Zhu, Kaijie, He, Jingxuan, Wang, Lun, Ding, Yangruibo, Chen, Yueqi, Guo, Wenbo, Song, Dawn |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.16891 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection
by: Nie, Yuzhou, et al.
Published: (2025)
by: Nie, Yuzhou, et al.
Published: (2025)
LeakAgent: RL-based Red-teaming Agent for LLM Privacy Leakage
by: Nie, Yuzhou, et al.
Published: (2024)
by: Nie, Yuzhou, et al.
Published: (2024)
Co-PatcheR: Collaborative Software Patching with Component(s)-specific Small Reasoning Models
by: Tang, Yuheng, et al.
Published: (2025)
by: Tang, Yuheng, et al.
Published: (2025)
Progent: Securing AI Agents with Privilege Control
by: Shi, Tianneng, et al.
Published: (2025)
by: Shi, Tianneng, et al.
Published: (2025)
AgentVigil: Generic Black-Box Red-teaming for Indirect Prompt Injection against LLM Agents
by: Wang, Zhun, et al.
Published: (2025)
by: Wang, Zhun, et al.
Published: (2025)
When eBPF Meets Machine Learning: On-the-fly OS Kernel Compartmentalization
by: Wang, Zicheng, et al.
Published: (2024)
by: Wang, Zicheng, et al.
Published: (2024)
DevOps-Gym: Benchmarking AI Agents in Software DevOps Cycle
by: Tang, Yuheng, et al.
Published: (2026)
by: Tang, Yuheng, et al.
Published: (2026)
To Defend Against Cyber Attacks, We Must Teach AI Agents to Hack
by: Zhuo, Terry Yue, et al.
Published: (2026)
by: Zhuo, Terry Yue, et al.
Published: (2026)
CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale
by: Wang, Zhun, et al.
Published: (2025)
by: Wang, Zhun, et al.
Published: (2025)
A Framework for Formalizing LLM Agent Security
by: Siu, Vincent, et al.
Published: (2026)
by: Siu, Vincent, et al.
Published: (2026)
SeCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
by: Nie, Yuzhou, et al.
Published: (2024)
by: Nie, Yuzhou, et al.
Published: (2024)
TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents
by: Zhu, Kaijie, et al.
Published: (2026)
by: Zhu, Kaijie, et al.
Published: (2026)
SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning
by: Ding, Yangruibo, et al.
Published: (2024)
by: Ding, Yangruibo, et al.
Published: (2024)
Horizontal ground‐motion model for subduction slab earthquakes using offshore ground motions in the Japan Trench area
by: Jingyang Tan, et al.
Published: (2024)
by: Jingyang Tan, et al.
Published: (2024)
rePIRL: Learn PRM with Inverse RL for LLM Reasoning
by: Wu, Xian, et al.
Published: (2026)
by: Wu, Xian, et al.
Published: (2026)
CYCLE: Learning to Self-Refine the Code Generation
by: Ding, Yangruibo, et al.
Published: (2024)
by: Ding, Yangruibo, et al.
Published: (2024)
WebSentinel: Detecting and Localizing Prompt Injection Attacks for Web Agents
by: Wang, Xilong, et al.
Published: (2026)
by: Wang, Xilong, et al.
Published: (2026)
ExploitGym: Can AI Agents Turn Security Vulnerabilities into Real Attacks?
by: Wang, Zhun, et al.
Published: (2026)
by: Wang, Zhun, et al.
Published: (2026)
BlueCodeAgent: A Blue Teaming Agent Enabled by Automated Red Teaming for CodeGen AI
by: Guo, Chengquan, et al.
Published: (2025)
by: Guo, Chengquan, et al.
Published: (2025)
Effects of source, path, and site conditions on damping modification factor for the horizontal response spectrum using offshore ground motions in the Japan Trench area
by: Mingji Liu, et al.
Published: (2025)
by: Mingji Liu, et al.
Published: (2025)
TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation Models
by: Nie, Yuzhou., et al.
Published: (2024)
by: Nie, Yuzhou., et al.
Published: (2024)
Frontier AI's Impact on the Cybersecurity Landscape
by: Potter, Yujin, et al.
Published: (2025)
by: Potter, Yujin, et al.
Published: (2025)
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
by: Zhu, Kaijie, et al.
Published: (2025)
by: Zhu, Kaijie, et al.
Published: (2025)
When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search
by: Chen, Xuan, et al.
Published: (2024)
by: Chen, Xuan, et al.
Published: (2024)
SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning
by: Peng, Jinjun, et al.
Published: (2026)
by: Peng, Jinjun, et al.
Published: (2026)
The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey
by: Kim, Juhee, et al.
Published: (2026)
by: Kim, Juhee, et al.
Published: (2026)
InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing
by: Ding, Kun, et al.
Published: (2026)
by: Ding, Kun, et al.
Published: (2026)
PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification
by: Li, Hongwei, et al.
Published: (2025)
by: Li, Hongwei, et al.
Published: (2025)
PromptArmor: Simple yet Effective Prompt Injection Defenses
by: Shi, Tianneng, et al.
Published: (2025)
by: Shi, Tianneng, et al.
Published: (2025)
Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
by: Min, Marcus J., et al.
Published: (2023)
by: Min, Marcus J., et al.
Published: (2023)
Ambient-pressure superconductivity above 22 K in hole-doped YB2
by: Li, Xuejie, et al.
Published: (2025)
by: Li, Xuejie, et al.
Published: (2025)
The Essence of Balance for Self-Improving Agents in Vision-and-Language Navigation
by: Liu, Zhen, et al.
Published: (2026)
by: Liu, Zhen, et al.
Published: (2026)
RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack against LLMs
by: Chen, Xuan, et al.
Published: (2024)
by: Chen, Xuan, et al.
Published: (2024)
Scientific Computing with Open SageMath not only for Physics Education
by: Borovský, Dominik, et al.
Published: (2023)
by: Borovský, Dominik, et al.
Published: (2023)
Automated Code Editing with Search-Generate-Modify
by: Liu, Changshu, et al.
Published: (2023)
by: Liu, Changshu, et al.
Published: (2023)
DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents
by: Chen, Zhaorun, et al.
Published: (2026)
by: Chen, Zhaorun, et al.
Published: (2026)
Self-Sovereign Agent
by: Qu, Wenjie, et al.
Published: (2026)
by: Qu, Wenjie, et al.
Published: (2026)
Relating Events and Frames Based on Self-Supervised Learning and Uncorrelated Conditioning for Unsupervised Domain Adaptation
by: Rostami, Mohammad, et al.
Published: (2024)
by: Rostami, Mohammad, et al.
Published: (2024)
Does Personalized Nudging Wear Off? A Longitudinal Study of AI Self-Modeling for Behavioral Engagement
by: He, Qing, et al.
Published: (2026)
by: He, Qing, et al.
Published: (2026)
SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation
by: Shi, Xiaofeng, et al.
Published: (2025)
by: Shi, Xiaofeng, et al.
Published: (2025)
Similar Items
-
VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection
by: Nie, Yuzhou, et al.
Published: (2025) -
LeakAgent: RL-based Red-teaming Agent for LLM Privacy Leakage
by: Nie, Yuzhou, et al.
Published: (2024) -
Co-PatcheR: Collaborative Software Patching with Component(s)-specific Small Reasoning Models
by: Tang, Yuheng, et al.
Published: (2025) -
Progent: Securing AI Agents with Privilege Control
by: Shi, Tianneng, et al.
Published: (2025) -
AgentVigil: Generic Black-Box Red-teaming for Indirect Prompt Injection against LLM Agents
by: Wang, Zhun, et al.
Published: (2025)