:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Chengjia, Tang, Lanling, Yuan, Ming, Yu, Jiongchi, Xie, Xiaofei, Bu, Jiajun
Format:	Preprint
Published:	2025
Subjects:	Software Engineering
Online Access:	https://arxiv.org/abs/2509.22170
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AutoEmpirical: LLM-Based Automated Research for Empirical Software Fault Analysis
by: Yu, Jiongchi, et al.
Published: (2025)

ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection
by: Weng, Shihao, et al.
Published: (2026)

The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries
by: Jiang, Weipeng, et al.
Published: (2025)

Human in the Loop for Fuzz Testing: Literature Review and the Road Ahead
by: Yu, Jiongchi, et al.
Published: (2026)

Chimera: Harnessing Multi-Agent LLMs for Automatic Insider Threat Simulation
by: Yu, Jiongchi, et al.
Published: (2025)

Defects4C: Benchmarking Large Language Model Repair Capability with C/C++ Bugs
by: Wang, Jian, et al.
Published: (2025)

Understanding the Supply Chain and Risks of Large Language Model Applications
by: Ma, Yujie, et al.
Published: (2025)

Bias Testing and Mitigation in LLM-based Code Generation
by: Huang, Dong, et al.
Published: (2023)

SpecGen: Automated Generation of Formal Program Specifications via Large Language Models
by: Ma, Lezhi, et al.
Published: (2024)

CXXCrafter: An LLM-Based Agent for Automated C/C++ Open Source Software Building
by: Yu, Zhengmin, et al.
Published: (2025)

MoDitector: Module-Directed Testing for Autonomous Driving Systems
by: Wang, Renzhi, et al.
Published: (2025)

CAShift: Benchmarking Log-Based Cloud Attack Detection under Normality Shift
by: Yu, Jiongchi, et al.
Published: (2025)

Intention is All You Need: Refining Your Code from Your Intention
by: Guo, Qi, et al.
Published: (2025)

Beyond Accuracy: Policy Invariance as a Reliability Test for LLM Safety Judges
by: Weng, Shihao, et al.
Published: (2026)

From Exploration to Specification: LLM-Based Property Generation for Mobile App Testing
by: Xiong, Yiheng, et al.
Published: (2026)

LogiAgent: Automated Logical Testing for REST Systems with LLM-Based Multi-Agents
by: Zhang, Ke, et al.
Published: (2025)

DriveTester: A Unified Platform for Simulation-Based Autonomous Driving Testing
by: Cheng, Mingfei, et al.
Published: (2024)

What Makes a Good LLM Agent for Real-world Penetration Testing?
by: Deng, Gelei, et al.
Published: (2026)

CA2: Code-Aware Agent for Automated Game Testing
by: Adaikkappan, Valliappan Chidambaram, et al.
Published: (2026)

ContrastRepair: Enhancing Conversation-Based Automated Program Repair via Contrastive Test Case Pairs
by: Kong, Jiaolong, et al.
Published: (2024)

Temac: Multi-Agent Collaboration for Automated Web GUI Testing
by: Liu, Chenxu, et al.
Published: (2025)

Weaponizing the Commons: A Taxonomy and Detection Framework of Abuse on GitHub
by: Cheng, Yuli, et al.
Published: (2026)

Towards Automated Crowdsourced Testing via Personified-LLM
by: Yu, Shengcheng, et al.
Published: (2026)

Enhancing Automated Program Repair via Faulty Token Localization and Quality-Aware Patch Refinement
by: Kong, Jiaolong, et al.
Published: (2025)

LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems
by: Duvvuru, Venkata Sai Aswath, et al.
Published: (2025)

TrajAudit: Automated Failure Diagnosis for Agentic Coding Systems
by: Wang, Minxing, et al.
Published: (2026)

STCLocker: Deadlock Avoidance Testing for Autonomous Driving Systems
by: Cheng, Mingfei, et al.
Published: (2025)

KTester: Leveraging Domain and Testing Knowledge for More Effective LLM-based Test Generation
by: Li, Anji, et al.
Published: (2025)

Deploy-Master: Automating the Deployment of 50,000+ Agent-Ready Scientific Tools in One Day
by: Wang, Yi, et al.
Published: (2026)

FT2Ra: A Fine-Tuning-Inspired Approach to Retrieval-Augmented Code Completion
by: Guo, Qi, et al.
Published: (2024)

Themis: Automatic and Efficient Deep Learning System Testing with Strong Fault Detection Capability
by: Huang, Dong, et al.
Published: (2024)

Model-Enhanced LLM-Driven VUI Testing of VPA Apps
by: Li, Suwan, et al.
Published: (2024)

LLM Agents for Automated Dependency Upgrades
by: Tawosi, Vali, et al.
Published: (2025)

UTFix: Change Aware Unit Test Repairing using LLM
by: Rahman, Shanto, et al.
Published: (2025)

AgentRaft: Automated Detection of Data Over-Exposure in LLM Agents
by: Lin, Yixi, et al.
Published: (2026)

AutoMT: A Multi-Agent LLM Framework for Automated Metamorphic Testing of Autonomous Driving Systems
by: Liang, Linfeng, et al.
Published: (2025)

A Comprehensive Study on Static Application Security Testing (SAST) Tools for Android
by: Zhu, Jingyun, et al.
Published: (2024)

iPanda: An LLM-based Agent for Automated Conformance Testing of Communication Protocols
by: Sun, Xikai, et al.
Published: (2025)

Hybrid Automated Program Repair by Combining Large Language Models and Program Analysis
by: Li, Fengjie, et al.
Published: (2024)

MemoCoder: Automated Function Synthesis using LLM-Supported Agents
by: Jia, Yiping, et al.
Published: (2025)