Saved in:
| Main Author: | Masserini, Elena |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.03311 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Multi-Platform Mutation Testing of Task-based Chatbots
by: Clerissi, Diego, et al.
Published: (2025)
by: Clerissi, Diego, et al.
Published: (2025)
Automated Testing of Task-based Chatbots: How Far Are We?
by: Clerissi, Diego, et al.
Published: (2026)
by: Clerissi, Diego, et al.
Published: (2026)
Anonymizing Test Data in Android: Does It Hurt?
by: Masserini, Elena, et al.
Published: (2024)
by: Masserini, Elena, et al.
Published: (2024)
Bug Whispering: Towards Audio Bug Reporting
by: Masserini, Elena, et al.
Published: (2025)
by: Masserini, Elena, et al.
Published: (2025)
Assessing Task-based Chatbots: Snapshot and Curated Datasets for Dialogflow
by: Masserini, Elena, et al.
Published: (2026)
by: Masserini, Elena, et al.
Published: (2026)
Towards the Assessment of Task-based Chatbots: From the TOFU-R Snapshot to the BRASATO Curated Dataset
by: Masserini, Elena, et al.
Published: (2025)
by: Masserini, Elena, et al.
Published: (2025)
Metamorphic Testing of Image Captioning Systems via Image-Level Reduction
by: Xie, Xiaoyuan, et al.
Published: (2023)
by: Xie, Xiaoyuan, et al.
Published: (2023)
MultiTest: Physical-Aware Object Insertion for Testing Multi-sensor Fusion Perception Systems
by: Gao, Xinyu, et al.
Published: (2024)
by: Gao, Xinyu, et al.
Published: (2024)
Bita: A Conversational Assistant for Fairness Testing
by: Johnson, Keeryn, et al.
Published: (2025)
by: Johnson, Keeryn, et al.
Published: (2025)
Improving Deep Learning Framework Testing with Model-Level Metamorphic Testing
by: Mu, Yanzhou, et al.
Published: (2025)
by: Mu, Yanzhou, et al.
Published: (2025)
A Multi-Year Grey Literature Review on AI-assisted Test Automation
by: Ricca, Filippo, et al.
Published: (2024)
by: Ricca, Filippo, et al.
Published: (2024)
Efficient Black-Box Fault Localization for System-Level Test Code Using Large Language Models
by: Yaraghi, Ahmadreza Saboor, et al.
Published: (2025)
by: Yaraghi, Ahmadreza Saboor, et al.
Published: (2025)
Can AI Generate more Comprehensive Test Scenarios? Review on Automated Driving Systems Test Scenario Generation Methods
by: Zhou, Ji, et al.
Published: (2025)
by: Zhou, Ji, et al.
Published: (2025)
Reconsidering Conversational Norms in LLM Chatbots for Sustainable AI
by: Santos, Ronnie de Souza, et al.
Published: (2025)
by: Santos, Ronnie de Souza, et al.
Published: (2025)
MultiFileTest: A Multi-File-Level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
by: Wang, Yibo, et al.
Published: (2025)
by: Wang, Yibo, et al.
Published: (2025)
Test Design and Review Argumentation in AI-Assisted Test Generation
by: Enoiu, Eduard Paul, et al.
Published: (2026)
by: Enoiu, Eduard Paul, et al.
Published: (2026)
Scaling Mobile Chaos Testing with AI-Driven Test Execution
by: Marcano, Juan, et al.
Published: (2026)
by: Marcano, Juan, et al.
Published: (2026)
TestBench: Evaluating Class-Level Test Case Generation Capability of Large Language Models
by: Zhang, Quanjun, et al.
Published: (2024)
by: Zhang, Quanjun, et al.
Published: (2024)
Test Amplification for REST APIs via Single and Multi-Agent LLM Systems
by: Nooyens, Robbe, et al.
Published: (2025)
by: Nooyens, Robbe, et al.
Published: (2025)
AI Assurance: A Comprehensive Testing Strategy for Enterprise AI Systems
by: Badagi, Chitra, et al.
Published: (2026)
by: Badagi, Chitra, et al.
Published: (2026)
IDOL: Improved Different Optimization Levels Testing for Solidity Compilers
by: Li, Lantian, et al.
Published: (2025)
by: Li, Lantian, et al.
Published: (2025)
Bridging HCI and AI Research for the Evaluation of Conversational SE Assistants
by: Richards, Jonan, et al.
Published: (2025)
by: Richards, Jonan, et al.
Published: (2025)
The Role of AI in Modern Penetration Testing
by: Curtis, J. Alexander, et al.
Published: (2025)
by: Curtis, J. Alexander, et al.
Published: (2025)
Human-AI Collaboration for Scaling Agile Regression Testing: An Agentic-AI Teammate from Manual to Automated Testing
by: Outmani, Moustapha El, et al.
Published: (2026)
by: Outmani, Moustapha El, et al.
Published: (2026)
WebMAC: A Multi-Agent Collaborative Framework for Scenario Testing of Web Systems
by: Wan, Zhenyu, et al.
Published: (2026)
by: Wan, Zhenyu, et al.
Published: (2026)
LogiAgent: Automated Logical Testing for REST Systems with LLM-Based Multi-Agents
by: Zhang, Ke, et al.
Published: (2025)
by: Zhang, Ke, et al.
Published: (2025)
Targeted Testing of Compiler Optimizations via Grammar-Level Composition Styles
by: Zhou, Zitong, et al.
Published: (2025)
by: Zhou, Zitong, et al.
Published: (2025)
Testing with AI Agents: An Empirical Study of Test Generation Frequency, Quality, and Coverage
by: Yoshimoto, Suzuka, et al.
Published: (2026)
by: Yoshimoto, Suzuka, et al.
Published: (2026)
Generative AI in Simulation-Based Test Environments for Large-Scale Cyber-Physical Systems: An Industrial Study
by: Sadrnezhaad, Masoud, et al.
Published: (2025)
by: Sadrnezhaad, Masoud, et al.
Published: (2025)
Agentic AI in Industry: Adoption Level and Deployment Barriers
by: Apostolou, Spyridon Alvanakis, et al.
Published: (2026)
by: Apostolou, Spyridon Alvanakis, et al.
Published: (2026)
Penetration Testing and Legacy Systems
by: Smyth, Sandra
Published: (2023)
by: Smyth, Sandra
Published: (2023)
Ethics Testing: Proactive Identification of Generative AI System Harms
by: Tan, Shin Hwei, et al.
Published: (2026)
by: Tan, Shin Hwei, et al.
Published: (2026)
Generative AI for Testing of Autonomous Driving Systems: A Survey
by: Song, Qunying, et al.
Published: (2025)
by: Song, Qunying, et al.
Published: (2025)
Breaking, Stale, or Missing? Benchmarking Coding Agents on Project-Level Test Evolution
by: Shang, Ye, et al.
Published: (2026)
by: Shang, Ye, et al.
Published: (2026)
Automatic High-Level Test Case Generation using Large Language Models
by: Hasan, Navid Bin, et al.
Published: (2025)
by: Hasan, Navid Bin, et al.
Published: (2025)
Designing Secure AI-based Systems: a Multi-Vocal Literature Review
by: Schneider, Simon, et al.
Published: (2024)
by: Schneider, Simon, et al.
Published: (2024)
ContrastRepair: Enhancing Conversation-Based Automated Program Repair via Contrastive Test Case Pairs
by: Kong, Jiaolong, et al.
Published: (2024)
by: Kong, Jiaolong, et al.
Published: (2024)
Data-driven Test Generation for Fuzzing AI Compiler
by: Shen, Qingchao
Published: (2026)
by: Shen, Qingchao
Published: (2026)
ABTest: Behavior-Driven Testing for AI Coding Agents
by: Dai, Wuyang, et al.
Published: (2026)
by: Dai, Wuyang, et al.
Published: (2026)
Test Schedule Generation for Acceptance Testing of Mission-Critical Satellite Systems
by: Ollando, Raphaël, et al.
Published: (2025)
by: Ollando, Raphaël, et al.
Published: (2025)
Similar Items
-
Towards Multi-Platform Mutation Testing of Task-based Chatbots
by: Clerissi, Diego, et al.
Published: (2025) -
Automated Testing of Task-based Chatbots: How Far Are We?
by: Clerissi, Diego, et al.
Published: (2026) -
Anonymizing Test Data in Android: Does It Hurt?
by: Masserini, Elena, et al.
Published: (2024) -
Bug Whispering: Towards Audio Bug Reporting
by: Masserini, Elena, et al.
Published: (2025) -
Assessing Task-based Chatbots: Snapshot and Curated Datasets for Dialogflow
by: Masserini, Elena, et al.
Published: (2026)