Saved in:
| Main Authors: | Khamsepour, Parham, Cole, Mark, Ashraf, Ish, Puri, Sandeep, Sabetzadeh, Mehrdad, Nejati, Shiva |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.02345 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Impact of Critique on LLM-Based Model Generation from Natural Language: The Case of Activity Diagrams
by: Khamsepour, Parham, et al.
Published: (2025)
by: Khamsepour, Parham, et al.
Published: (2025)
Developing a Llama-Based Chatbot for CI/CD Question Answering: A Case Study at Ericsson
by: Chaudhary, Daksh, et al.
Published: (2024)
by: Chaudhary, Daksh, et al.
Published: (2024)
DSL or Code? Evaluating the Quality of LLM-Generated Algebraic Specifications: A Case Study in Optimization at Kinaxis
by: Ayoughi, Negin, et al.
Published: (2026)
by: Ayoughi, Negin, et al.
Published: (2026)
Genetic Programming for Self-Adaptive Auto-Scaling of Microservices
by: Li, Jia, et al.
Published: (2026)
by: Li, Jia, et al.
Published: (2026)
Requirements-driven Slicing of Simulink Models Using LLMs
by: Luitel, Dipeeka, et al.
Published: (2024)
by: Luitel, Dipeeka, et al.
Published: (2024)
Enhancing Automata Learning with Statistical Machine Learning: A Network Security Case Study
by: Ayoughi, Negin, et al.
Published: (2024)
by: Ayoughi, Negin, et al.
Published: (2024)
Effort-Optimized, Accuracy-Driven Labelling and Validation of Test Inputs for DL Systems: A Mixed-Integer Linear Programming Approach
by: Amini, Mohammad Hossein, et al.
Published: (2025)
by: Amini, Mohammad Hossein, et al.
Published: (2025)
Automated Test Validators for Flaky Cyber-Physical System Simulators: Approach and Evaluation
by: Jodat, Baharin A., et al.
Published: (2025)
by: Jodat, Baharin A., et al.
Published: (2025)
Simulink Mutation Testing using CodeBERT
by: Zhang, Jingfan, et al.
Published: (2025)
by: Zhang, Jingfan, et al.
Published: (2025)
A Lean Simulation Framework for Stress Testing IoT Cloud Systems
by: Li, Jia, et al.
Published: (2024)
by: Li, Jia, et al.
Published: (2024)
Test Input Validation for Vision-based DL Systems: An Active Learning Approach
by: Ghobari, Delaram, et al.
Published: (2025)
by: Ghobari, Delaram, et al.
Published: (2025)
Practical Guidelines for the Selection and Evaluation of Natural Language Processing Techniques in Requirements Engineering
by: Sabetzadeh, Mehrdad, et al.
Published: (2024)
by: Sabetzadeh, Mehrdad, et al.
Published: (2024)
An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations
by: Hassani, Shabnam, et al.
Published: (2025)
by: Hassani, Shabnam, et al.
Published: (2025)
From Law to Gherkin: A Human-Centred Quasi-Experiment on the Quality of LLM-Generated Behavioural Specifications from Food-Safety Regulations
by: Hassani, Shabnam, et al.
Published: (2025)
by: Hassani, Shabnam, et al.
Published: (2025)
Improving Requirements Completeness: Automated Assistance through Large Language Models
by: Luitel, Dipeeka, et al.
Published: (2023)
by: Luitel, Dipeeka, et al.
Published: (2023)
Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems
by: Amini, Mohammad Hossein, et al.
Published: (2024)
by: Amini, Mohammad Hossein, et al.
Published: (2024)
Rethinking Legal Compliance Automation: Opportunities with Large Language Models
by: Hassani, Shabnam, et al.
Published: (2024)
by: Hassani, Shabnam, et al.
Published: (2024)
Can Search-Based Testing with Pareto Optimization Effectively Cover Failure-Revealing Test Inputs?
by: Sorokin, Lev, et al.
Published: (2024)
by: Sorokin, Lev, et al.
Published: (2024)
Generating Minimalist Adversarial Perturbations to Test Object-Detection Models: An Adaptive Multi-Metric Evolutionary Search Approach
by: McIntyre-Garcia, Cristopher, et al.
Published: (2024)
by: McIntyre-Garcia, Cristopher, et al.
Published: (2024)
Testing Question Answering Software with Context-Driven Question Generation
by: Liu, Shuang, et al.
Published: (2025)
by: Liu, Shuang, et al.
Published: (2025)
A Comparison of Conversational Models and Humans in Answering Technical Questions: the Firefox Case
by: Correia, Joao, et al.
Published: (2025)
by: Correia, Joao, et al.
Published: (2025)
Exploring React Library Related Questions on Stack Overflow: Answered vs. Unanswered
by: Ardity, Vanesya Aura, et al.
Published: (2025)
by: Ardity, Vanesya Aura, et al.
Published: (2025)
Towards LLM-generated explanations for Component-based Knowledge Graph Question Answering Systems
by: Schiese, Dennis, et al.
Published: (2025)
by: Schiese, Dennis, et al.
Published: (2025)
Agentic DAG-Orchestrated Planner Framework for Multi-Modal, Multi-Hop Question Answering in Hybrid Data Lakes
by: B, Kirushikesh D, et al.
Published: (2026)
by: B, Kirushikesh D, et al.
Published: (2026)
CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering
by: Chen, Jialiang, et al.
Published: (2025)
by: Chen, Jialiang, et al.
Published: (2025)
Does Documentation Matter? An Empirical Study of Practitioners' Perspective on Open-Source Software Adoption
by: Imani, Aaron, et al.
Published: (2024)
by: Imani, Aaron, et al.
Published: (2024)
LLM Based Web Accessibility Repair: An Empirical Study of Detection, Remediation, and Cost
by: Oyelayo, Oluwatoyosi, et al.
Published: (2026)
by: Oyelayo, Oluwatoyosi, et al.
Published: (2026)
Beyond Code Snippets: Benchmarking LLMs on Repository-Level Question Answering
by: Alebachew, Yoseph Berhanu, et al.
Published: (2026)
by: Alebachew, Yoseph Berhanu, et al.
Published: (2026)
CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering
by: Hu, Ruida, et al.
Published: (2024)
by: Hu, Ruida, et al.
Published: (2024)
A Serverless Architecture for Real-Time Stock Analysis using Large Language Models: An Iterative Development and Debugging Case Study
by: Ashraf, Taniv
Published: (2025)
by: Ashraf, Taniv
Published: (2025)
InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models
by: Li, Linyi, et al.
Published: (2024)
by: Li, Linyi, et al.
Published: (2024)
DialogAgent: An Auto-engagement Agent for Code Question Answering Data Production
by: Liang, Xiaoyun, et al.
Published: (2024)
by: Liang, Xiaoyun, et al.
Published: (2024)
LogRouter: Adaptive Two-Level LLM Routing for Log Question Answering in Big Data Systems
by: Coskuner, Mert, et al.
Published: (2026)
by: Coskuner, Mert, et al.
Published: (2026)
The Cost of Downgrading Build Systems: A Case Study of Kubernetes
by: Ranjan, Gareema, et al.
Published: (2025)
by: Ranjan, Gareema, et al.
Published: (2025)
Supporting System Testing with a Multi-Agent LLM-based Framework for Knowledge Graph Extraction: A Case Study with Ethernet Switch Systems
by: Pan, Rongqi, et al.
Published: (2026)
by: Pan, Rongqi, et al.
Published: (2026)
Evaluating Repository-level Software Documentation via Question Answering and Feature-Driven Development
by: Wang, Xinchen, et al.
Published: (2026)
by: Wang, Xinchen, et al.
Published: (2026)
Robustness and Reasoning Fidelity of Large Language Models in Long-Context Code Question Answering
by: Maharaj, Kishan, et al.
Published: (2026)
by: Maharaj, Kishan, et al.
Published: (2026)
An exploratory analysis of Community-based Question-Answering Platforms and GPT-3-driven Generative AI: Is it the end of online community-based learning?
by: Hasan, Mohammed Mehedi, et al.
Published: (2024)
by: Hasan, Mohammed Mehedi, et al.
Published: (2024)
Faster Releases, Fewer Risks: A Study on Maven Artifact Vulnerabilities and Lifecycle Management
by: Shafin, Md Shafiullah, et al.
Published: (2025)
by: Shafin, Md Shafiullah, et al.
Published: (2025)
ReleaseEval: A Benchmark for Evaluating Language Models in Automated Release Note Generation
by: Meng, Qianru, et al.
Published: (2025)
by: Meng, Qianru, et al.
Published: (2025)
Similar Items
-
The Impact of Critique on LLM-Based Model Generation from Natural Language: The Case of Activity Diagrams
by: Khamsepour, Parham, et al.
Published: (2025) -
Developing a Llama-Based Chatbot for CI/CD Question Answering: A Case Study at Ericsson
by: Chaudhary, Daksh, et al.
Published: (2024) -
DSL or Code? Evaluating the Quality of LLM-Generated Algebraic Specifications: A Case Study in Optimization at Kinaxis
by: Ayoughi, Negin, et al.
Published: (2026) -
Genetic Programming for Self-Adaptive Auto-Scaling of Microservices
by: Li, Jia, et al.
Published: (2026) -
Requirements-driven Slicing of Simulink Models Using LLMs
by: Luitel, Dipeeka, et al.
Published: (2024)