Guardado en:
| Autores principales: | Shahbandeh, Mobina, Alian, Parsa, Nashid, Noor, Mesbah, Ali |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2409.10741 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Semantic Constraint Inference for Web Form Test Generation
por: Alian, Parsa, et al.
Publicado: (2024)
por: Alian, Parsa, et al.
Publicado: (2024)
Feature-Driven End-To-End Test Generation
por: Alian, Parsa, et al.
Publicado: (2024)
por: Alian, Parsa, et al.
Publicado: (2024)
Dockerfile Flakiness: Characterization and Repair
por: Shabani, Taha, et al.
Publicado: (2024)
por: Shabani, Taha, et al.
Publicado: (2024)
Contextual API Completion for Unseen Repositories Using LLMs
por: Nashid, Noor, et al.
Publicado: (2024)
por: Nashid, Noor, et al.
Publicado: (2024)
VISCA: Inferring Component Abstractions for Automated End-to-End Testing
por: Alian, Parsa, et al.
Publicado: (2025)
por: Alian, Parsa, et al.
Publicado: (2025)
LLM Test Generation via Iterative Hybrid Program Analysis
por: Gu, Sijia, et al.
Publicado: (2025)
por: Gu, Sijia, et al.
Publicado: (2025)
Issue2Test: Generating Reproducing Test Cases from Issue Reports
por: Nashid, Noor, et al.
Publicado: (2025)
por: Nashid, Noor, et al.
Publicado: (2025)
Beyond Accuracy: Behavioral Dynamics of Agentic Multi-Hunk Repair
por: Nashid, Noor, et al.
Publicado: (2025)
por: Nashid, Noor, et al.
Publicado: (2025)
Characterizing Multi-Hunk Patches: Divergence, Proximity, and LLM Repair Challenges
por: Nashid, Noor, et al.
Publicado: (2025)
por: Nashid, Noor, et al.
Publicado: (2025)
CallNavi, A Challenge and Empirical Study on LLM Function Calling and Routing
por: Song, Yewei, et al.
Publicado: (2025)
por: Song, Yewei, et al.
Publicado: (2025)
Self-reflection in Automated Qualitative Coding: Improving Text Annotation through Secondary LLM Critique
por: Dunivin, Zackary Okun, et al.
Publicado: (2026)
por: Dunivin, Zackary Okun, et al.
Publicado: (2026)
Effective Black Box Testing of Sentiment Analysis Classification Networks
por: Karbasizadeh, Parsa, et al.
Publicado: (2024)
por: Karbasizadeh, Parsa, et al.
Publicado: (2024)
Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents
por: Zainullina, Karina, et al.
Publicado: (2025)
por: Zainullina, Karina, et al.
Publicado: (2025)
From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence
por: Yang, Jian, et al.
Publicado: (2025)
por: Yang, Jian, et al.
Publicado: (2025)
Scalable Similarity-Aware Test Suite Minimization with Reinforcement Learning
por: Gu, Sijia, et al.
Publicado: (2024)
por: Gu, Sijia, et al.
Publicado: (2024)
Fine-Grained Assertion-Based Test Selection
por: Gu, Sijia, et al.
Publicado: (2024)
por: Gu, Sijia, et al.
Publicado: (2024)
WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning
por: Jiang, Juyong, et al.
Publicado: (2026)
por: Jiang, Juyong, et al.
Publicado: (2026)
Choosing the Right Communication Protocol for your Web Application
por: Hassan, Mohamed
Publicado: (2024)
por: Hassan, Mohamed
Publicado: (2024)
Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion
por: Cheng, Wei, et al.
Publicado: (2024)
por: Cheng, Wei, et al.
Publicado: (2024)
Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains
por: Zhang, Yu, et al.
Publicado: (2024)
por: Zhang, Yu, et al.
Publicado: (2024)
REPOT: Recoverable Program-of-Thought via Checkpoint Repair
por: Mazaheri, Parsa
Publicado: (2026)
por: Mazaheri, Parsa
Publicado: (2026)
Using Large Language Models for Student-Code Guided Test Case Generation in Computer Science Education
por: Kumar, Nischal Ashok, et al.
Publicado: (2024)
por: Kumar, Nischal Ashok, et al.
Publicado: (2024)
Tool-Aware Planning in Contact Center AI: Evaluating LLMs through Lineage-Guided Query Decomposition
por: Nathan, Varun, et al.
Publicado: (2026)
por: Nathan, Varun, et al.
Publicado: (2026)
Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
por: Chen, Jingchang, et al.
Publicado: (2024)
por: Chen, Jingchang, et al.
Publicado: (2024)
Benchmarking LLMs for Unit Test Generation from Real-World Functions
por: Huang, Dong, et al.
Publicado: (2025)
por: Huang, Dong, et al.
Publicado: (2025)
Remember Your Trace: Memory-Guided Long-Horizon Agentic Framework for Consistent and Hierarchical Repository-Level Code Documentation
por: Bae, Suyoung, et al.
Publicado: (2026)
por: Bae, Suyoung, et al.
Publicado: (2026)
FireBench: Evaluating Instruction Following in Enterprise and API-Driven LLM Applications
por: Zhang, Yunfan, et al.
Publicado: (2026)
por: Zhang, Yunfan, et al.
Publicado: (2026)
How Effectively Do LLMs Extract Feature-Sentiment Pairs from App Reviews?
por: Shah, Faiz Ali, et al.
Publicado: (2024)
por: Shah, Faiz Ali, et al.
Publicado: (2024)
A Case Study of Web App Coding with OpenAI Reasoning Models
por: Cui, Yi
Publicado: (2024)
por: Cui, Yi
Publicado: (2024)
Functional Consistency of LLM Code Embeddings: A Self-Evolving Data Synthesis Framework for Benchmarking
por: Li, Zhuohao, et al.
Publicado: (2025)
por: Li, Zhuohao, et al.
Publicado: (2025)
MacroBench: A Novel Testbed for Web Automation Scripts via Large Language Models
por: Kim, Hyunjun, et al.
Publicado: (2025)
por: Kim, Hyunjun, et al.
Publicado: (2025)
From Prediction to Application: Language Model-based Code Knowledge Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with Pedagogical Prompting for Comprehensive Programming Education
por: Lee, Unggi, et al.
Publicado: (2024)
por: Lee, Unggi, et al.
Publicado: (2024)
Vibe Code Bench: Evaluating AI Models on End-to-End Web Application Development
por: Tran, Hung, et al.
Publicado: (2026)
por: Tran, Hung, et al.
Publicado: (2026)
EHR-Based Mobile and Web Platform for Chronic Disease Risk Prediction Using Large Language Multimodal Models
por: Liao, Chun-Chieh, et al.
Publicado: (2024)
por: Liao, Chun-Chieh, et al.
Publicado: (2024)
WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing
por: Kong, Fanheng, et al.
Publicado: (2026)
por: Kong, Fanheng, et al.
Publicado: (2026)
Nissist: An Incident Mitigation Copilot based on Troubleshooting Guides
por: An, Kaikai, et al.
Publicado: (2024)
por: An, Kaikai, et al.
Publicado: (2024)
LocAgent: Graph-Guided LLM Agents for Code Localization
por: Chen, Zhaoling, et al.
Publicado: (2025)
por: Chen, Zhaoling, et al.
Publicado: (2025)
New Solutions on LLM Acceleration, Optimization, and Application
por: Huang, Yingbing, et al.
Publicado: (2024)
por: Huang, Yingbing, et al.
Publicado: (2024)
Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs
por: Le, Nguyen-Khang, et al.
Publicado: (2025)
por: Le, Nguyen-Khang, et al.
Publicado: (2025)
AURORA: Navigating UI Tarpits via Automated Neural Screen Understanding
por: Khan, Safwat Ali, et al.
Publicado: (2024)
por: Khan, Safwat Ali, et al.
Publicado: (2024)
Ejemplares similares
-
Semantic Constraint Inference for Web Form Test Generation
por: Alian, Parsa, et al.
Publicado: (2024) -
Feature-Driven End-To-End Test Generation
por: Alian, Parsa, et al.
Publicado: (2024) -
Dockerfile Flakiness: Characterization and Repair
por: Shabani, Taha, et al.
Publicado: (2024) -
Contextual API Completion for Unseen Repositories Using LLMs
por: Nashid, Noor, et al.
Publicado: (2024) -
VISCA: Inferring Component Abstractions for Automated End-to-End Testing
por: Alian, Parsa, et al.
Publicado: (2025)