:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Khamsepour, Parham, Cole, Mark, Ashraf, Ish, Puri, Sandeep, Sabetzadeh, Mehrdad, Nejati, Shiva
Format:	Preprint
Published:	2026
Subjects:	Software Engineering
Online Access:	https://arxiv.org/abs/2601.02345
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Impact of Critique on LLM-Based Model Generation from Natural Language: The Case of Activity Diagrams
by: Khamsepour, Parham, et al.
Published: (2025)

Developing a Llama-Based Chatbot for CI/CD Question Answering: A Case Study at Ericsson
by: Chaudhary, Daksh, et al.
Published: (2024)

DSL or Code? Evaluating the Quality of LLM-Generated Algebraic Specifications: A Case Study in Optimization at Kinaxis
by: Ayoughi, Negin, et al.
Published: (2026)

Genetic Programming for Self-Adaptive Auto-Scaling of Microservices
by: Li, Jia, et al.
Published: (2026)

Requirements-driven Slicing of Simulink Models Using LLMs
by: Luitel, Dipeeka, et al.
Published: (2024)

Enhancing Automata Learning with Statistical Machine Learning: A Network Security Case Study
by: Ayoughi, Negin, et al.
Published: (2024)

Effort-Optimized, Accuracy-Driven Labelling and Validation of Test Inputs for DL Systems: A Mixed-Integer Linear Programming Approach
by: Amini, Mohammad Hossein, et al.
Published: (2025)

Automated Test Validators for Flaky Cyber-Physical System Simulators: Approach and Evaluation
by: Jodat, Baharin A., et al.
Published: (2025)

Simulink Mutation Testing using CodeBERT
by: Zhang, Jingfan, et al.
Published: (2025)

A Lean Simulation Framework for Stress Testing IoT Cloud Systems
by: Li, Jia, et al.
Published: (2024)

Test Input Validation for Vision-based DL Systems: An Active Learning Approach
by: Ghobari, Delaram, et al.
Published: (2025)

Practical Guidelines for the Selection and Evaluation of Natural Language Processing Techniques in Requirements Engineering
by: Sabetzadeh, Mehrdad, et al.
Published: (2024)

An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations
by: Hassani, Shabnam, et al.
Published: (2025)

From Law to Gherkin: A Human-Centred Quasi-Experiment on the Quality of LLM-Generated Behavioural Specifications from Food-Safety Regulations
by: Hassani, Shabnam, et al.
Published: (2025)

Improving Requirements Completeness: Automated Assistance through Large Language Models
by: Luitel, Dipeeka, et al.
Published: (2023)

Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems
by: Amini, Mohammad Hossein, et al.
Published: (2024)

Rethinking Legal Compliance Automation: Opportunities with Large Language Models
by: Hassani, Shabnam, et al.
Published: (2024)

Can Search-Based Testing with Pareto Optimization Effectively Cover Failure-Revealing Test Inputs?
by: Sorokin, Lev, et al.
Published: (2024)

Generating Minimalist Adversarial Perturbations to Test Object-Detection Models: An Adaptive Multi-Metric Evolutionary Search Approach
by: McIntyre-Garcia, Cristopher, et al.
Published: (2024)

Testing Question Answering Software with Context-Driven Question Generation
by: Liu, Shuang, et al.
Published: (2025)

A Comparison of Conversational Models and Humans in Answering Technical Questions: the Firefox Case
by: Correia, Joao, et al.
Published: (2025)

Exploring React Library Related Questions on Stack Overflow: Answered vs. Unanswered
by: Ardity, Vanesya Aura, et al.
Published: (2025)

Towards LLM-generated explanations for Component-based Knowledge Graph Question Answering Systems
by: Schiese, Dennis, et al.
Published: (2025)

Agentic DAG-Orchestrated Planner Framework for Multi-Modal, Multi-Hop Question Answering in Hybrid Data Lakes
by: B, Kirushikesh D, et al.
Published: (2026)

CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering
by: Chen, Jialiang, et al.
Published: (2025)

Does Documentation Matter? An Empirical Study of Practitioners' Perspective on Open-Source Software Adoption
by: Imani, Aaron, et al.
Published: (2024)

LLM Based Web Accessibility Repair: An Empirical Study of Detection, Remediation, and Cost
by: Oyelayo, Oluwatoyosi, et al.
Published: (2026)

Beyond Code Snippets: Benchmarking LLMs on Repository-Level Question Answering
by: Alebachew, Yoseph Berhanu, et al.
Published: (2026)

CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering
by: Hu, Ruida, et al.
Published: (2024)

A Serverless Architecture for Real-Time Stock Analysis using Large Language Models: An Iterative Development and Debugging Case Study
by: Ashraf, Taniv
Published: (2025)

InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models
by: Li, Linyi, et al.
Published: (2024)

DialogAgent: An Auto-engagement Agent for Code Question Answering Data Production
by: Liang, Xiaoyun, et al.
Published: (2024)

LogRouter: Adaptive Two-Level LLM Routing for Log Question Answering in Big Data Systems
by: Coskuner, Mert, et al.
Published: (2026)

The Cost of Downgrading Build Systems: A Case Study of Kubernetes
by: Ranjan, Gareema, et al.
Published: (2025)

Supporting System Testing with a Multi-Agent LLM-based Framework for Knowledge Graph Extraction: A Case Study with Ethernet Switch Systems
by: Pan, Rongqi, et al.
Published: (2026)

Evaluating Repository-level Software Documentation via Question Answering and Feature-Driven Development
by: Wang, Xinchen, et al.
Published: (2026)

Robustness and Reasoning Fidelity of Large Language Models in Long-Context Code Question Answering
by: Maharaj, Kishan, et al.
Published: (2026)

An exploratory analysis of Community-based Question-Answering Platforms and GPT-3-driven Generative AI: Is it the end of online community-based learning?
by: Hasan, Mohammed Mehedi, et al.
Published: (2024)

Faster Releases, Fewer Risks: A Study on Maven Artifact Vulnerabilities and Lifecycle Management
by: Shafin, Md Shafiullah, et al.
Published: (2025)

ReleaseEval: A Benchmark for Evaluating Language Models in Automated Release Note Generation
by: Meng, Qianru, et al.
Published: (2025)