Saved in:
| Main Authors: | Wang, Chengjia, Tang, Lanling, Yuan, Ming, Yu, Jiongchi, Xie, Xiaofei, Bu, Jiajun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.22170 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AutoEmpirical: LLM-Based Automated Research for Empirical Software Fault Analysis
by: Yu, Jiongchi, et al.
Published: (2025)
by: Yu, Jiongchi, et al.
Published: (2025)
ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection
by: Weng, Shihao, et al.
Published: (2026)
by: Weng, Shihao, et al.
Published: (2026)
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries
by: Jiang, Weipeng, et al.
Published: (2025)
by: Jiang, Weipeng, et al.
Published: (2025)
Human in the Loop for Fuzz Testing: Literature Review and the Road Ahead
by: Yu, Jiongchi, et al.
Published: (2026)
by: Yu, Jiongchi, et al.
Published: (2026)
Chimera: Harnessing Multi-Agent LLMs for Automatic Insider Threat Simulation
by: Yu, Jiongchi, et al.
Published: (2025)
by: Yu, Jiongchi, et al.
Published: (2025)
Defects4C: Benchmarking Large Language Model Repair Capability with C/C++ Bugs
by: Wang, Jian, et al.
Published: (2025)
by: Wang, Jian, et al.
Published: (2025)
Understanding the Supply Chain and Risks of Large Language Model Applications
by: Ma, Yujie, et al.
Published: (2025)
by: Ma, Yujie, et al.
Published: (2025)
Bias Testing and Mitigation in LLM-based Code Generation
by: Huang, Dong, et al.
Published: (2023)
by: Huang, Dong, et al.
Published: (2023)
SpecGen: Automated Generation of Formal Program Specifications via Large Language Models
by: Ma, Lezhi, et al.
Published: (2024)
by: Ma, Lezhi, et al.
Published: (2024)
CXXCrafter: An LLM-Based Agent for Automated C/C++ Open Source Software Building
by: Yu, Zhengmin, et al.
Published: (2025)
by: Yu, Zhengmin, et al.
Published: (2025)
MoDitector: Module-Directed Testing for Autonomous Driving Systems
by: Wang, Renzhi, et al.
Published: (2025)
by: Wang, Renzhi, et al.
Published: (2025)
CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift
by: Yu, Jiongchi, et al.
Published: (2025)
by: Yu, Jiongchi, et al.
Published: (2025)
Intention is All You Need: Refining Your Code from Your Intention
by: Guo, Qi, et al.
Published: (2025)
by: Guo, Qi, et al.
Published: (2025)
Beyond Accuracy: Policy Invariance as a Reliability Test for LLM Safety Judges
by: Weng, Shihao, et al.
Published: (2026)
by: Weng, Shihao, et al.
Published: (2026)
From Exploration to Specification: LLM-Based Property Generation for Mobile App Testing
by: Xiong, Yiheng, et al.
Published: (2026)
by: Xiong, Yiheng, et al.
Published: (2026)
LogiAgent: Automated Logical Testing for REST Systems with LLM-Based Multi-Agents
by: Zhang, Ke, et al.
Published: (2025)
by: Zhang, Ke, et al.
Published: (2025)
DriveTester: A Unified Platform for Simulation-Based Autonomous Driving Testing
by: Cheng, Mingfei, et al.
Published: (2024)
by: Cheng, Mingfei, et al.
Published: (2024)
What Makes a Good LLM Agent for Real-world Penetration Testing?
by: Deng, Gelei, et al.
Published: (2026)
by: Deng, Gelei, et al.
Published: (2026)
CA2: Code-Aware Agent for Automated Game Testing
by: Adaikkappan, Valliappan Chidambaram, et al.
Published: (2026)
by: Adaikkappan, Valliappan Chidambaram, et al.
Published: (2026)
ContrastRepair: Enhancing Conversation-Based Automated Program Repair via Contrastive Test Case Pairs
by: Kong, Jiaolong, et al.
Published: (2024)
by: Kong, Jiaolong, et al.
Published: (2024)
Temac: Multi-Agent Collaboration for Automated Web GUI Testing
by: Liu, Chenxu, et al.
Published: (2025)
by: Liu, Chenxu, et al.
Published: (2025)
Weaponizing the Commons: A Taxonomy and Detection Framework of Abuse on GitHub
by: Cheng, Yuli, et al.
Published: (2026)
by: Cheng, Yuli, et al.
Published: (2026)
Towards Automated Crowdsourced Testing via Personified-LLM
by: Yu, Shengcheng, et al.
Published: (2026)
by: Yu, Shengcheng, et al.
Published: (2026)
Enhancing Automated Program Repair via Faulty Token Localization and Quality-Aware Patch Refinement
by: Kong, Jiaolong, et al.
Published: (2025)
by: Kong, Jiaolong, et al.
Published: (2025)
LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems
by: Duvvuru, Venkata Sai Aswath, et al.
Published: (2025)
by: Duvvuru, Venkata Sai Aswath, et al.
Published: (2025)
TrajAudit: Automated Failure Diagnosis for Agentic Coding Systems
by: Wang, Minxing, et al.
Published: (2026)
by: Wang, Minxing, et al.
Published: (2026)
STCLocker: Deadlock Avoidance Testing for Autonomous Driving Systems
by: Cheng, Mingfei, et al.
Published: (2025)
by: Cheng, Mingfei, et al.
Published: (2025)
KTester: Leveraging Domain and Testing Knowledge for More Effective LLM-based Test Generation
by: Li, Anji, et al.
Published: (2025)
by: Li, Anji, et al.
Published: (2025)
Deploy-Master: Automating the Deployment of 50,000+ Agent-Ready Scientific Tools in One Day
by: Wang, Yi, et al.
Published: (2026)
by: Wang, Yi, et al.
Published: (2026)
FT2Ra: A Fine-Tuning-Inspired Approach to Retrieval-Augmented Code Completion
by: Guo, Qi, et al.
Published: (2024)
by: Guo, Qi, et al.
Published: (2024)
Themis: Automatic and Efficient Deep Learning System Testing with Strong Fault Detection Capability
by: Huang, Dong, et al.
Published: (2024)
by: Huang, Dong, et al.
Published: (2024)
Model-Enhanced LLM-Driven VUI Testing of VPA Apps
by: Li, Suwan, et al.
Published: (2024)
by: Li, Suwan, et al.
Published: (2024)
LLM Agents for Automated Dependency Upgrades
by: Tawosi, Vali, et al.
Published: (2025)
by: Tawosi, Vali, et al.
Published: (2025)
UTFix: Change Aware Unit Test Repairing using LLM
by: Rahman, Shanto, et al.
Published: (2025)
by: Rahman, Shanto, et al.
Published: (2025)
AgentRaft: Automated Detection of Data Over-Exposure in LLM Agents
by: Lin, Yixi, et al.
Published: (2026)
by: Lin, Yixi, et al.
Published: (2026)
AutoMT: A Multi-Agent LLM Framework for Automated Metamorphic Testing of Autonomous Driving Systems
by: Liang, Linfeng, et al.
Published: (2025)
by: Liang, Linfeng, et al.
Published: (2025)
A Comprehensive Study on Static Application Security Testing (SAST) Tools for Android
by: Zhu, Jingyun, et al.
Published: (2024)
by: Zhu, Jingyun, et al.
Published: (2024)
iPanda: An LLM-based Agent for Automated Conformance Testing of Communication Protocols
by: Sun, Xikai, et al.
Published: (2025)
by: Sun, Xikai, et al.
Published: (2025)
Hybrid Automated Program Repair by Combining Large Language Models and Program Analysis
by: Li, Fengjie, et al.
Published: (2024)
by: Li, Fengjie, et al.
Published: (2024)
MemoCoder: Automated Function Synthesis using LLM-Supported Agents
by: Jia, Yiping, et al.
Published: (2025)
by: Jia, Yiping, et al.
Published: (2025)
Similar Items
-
AutoEmpirical: LLM-Based Automated Research for Empirical Software Fault Analysis
by: Yu, Jiongchi, et al.
Published: (2025) -
ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection
by: Weng, Shihao, et al.
Published: (2026) -
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries
by: Jiang, Weipeng, et al.
Published: (2025) -
Human in the Loop for Fuzz Testing: Literature Review and the Road Ahead
by: Yu, Jiongchi, et al.
Published: (2026) -
Chimera: Harnessing Multi-Agent LLMs for Automatic Insider Threat Simulation
by: Yu, Jiongchi, et al.
Published: (2025)