Saved in:
| Main Authors: | Li, Minxiao, Yan, Shuying, Zhang, Li, Liu, Yang, Liu, Fang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.06683 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CodeIF-Bench: Evaluating Instruction-Following Capabilities of Large Language Models in Interactive Code Generation
by: Wang, Peiding, et al.
Published: (2025)
by: Wang, Peiding, et al.
Published: (2025)
A System Model Generation Benchmark from Natural Language Requirements
by: Jin, Dongming, et al.
Published: (2025)
by: Jin, Dongming, et al.
Published: (2025)
Incorporating Verification Standards for Security Requirements Generation from Functional Specifications
by: Lian, Xiaoli, et al.
Published: (2025)
by: Lian, Xiaoli, et al.
Published: (2025)
A Scalable Benchmark for Repository-Oriented Long-Horizon Conversational Context Management
by: Liu, Yang, et al.
Published: (2026)
by: Liu, Yang, et al.
Published: (2026)
EfficientEdit: Accelerating Code Editing via Edit-Oriented Speculative Decoding
by: Wang, Peiding, et al.
Published: (2025)
by: Wang, Peiding, et al.
Published: (2025)
MRG-Bench: Evaluating and Exploring the Requirements of Context for Repository-Level Code Generation
by: Li, Haiyang
Published: (2025)
by: Li, Haiyang
Published: (2025)
ReqElicitGym: An Evaluation Environment for Interview Competence in Conversational Requirements Elicitation
by: Jin, Dongming, et al.
Published: (2026)
by: Jin, Dongming, et al.
Published: (2026)
Bridging Requirements and Architecture: Multi-Agent Orchestration with External Knowledge and Hierarchical Memory
by: Li, Ruiyin, et al.
Published: (2026)
by: Li, Ruiyin, et al.
Published: (2026)
Benchmarking and Evaluating VLMs for Software Architecture Diagram Understanding
by: Ouyang, Shuyin, et al.
Published: (2026)
by: Ouyang, Shuyin, et al.
Published: (2026)
Requirements Development and Formalization for Reliable Code Generation: A Multi-Agent Vision
by: Lu, Xu, et al.
Published: (2025)
by: Lu, Xu, et al.
Published: (2025)
UserTrace: User-Level Requirements Generation and Traceability Recovery from Software Project Repositories
by: Jin, Dongming, et al.
Published: (2025)
by: Jin, Dongming, et al.
Published: (2025)
Are They All Good? Evaluating the Quality of CoTs in LLM-based Code Generation
by: Zhang, Binquan, et al.
Published: (2025)
by: Zhang, Binquan, et al.
Published: (2025)
RealBench: A Repo-Level Code Generation Benchmark Aligned with Real-World Software Development Practices
by: Li, Jia, et al.
Published: (2026)
by: Li, Jia, et al.
Published: (2026)
An Evaluation of Requirements Modeling for Cyber-Physical Systems via LLMs
by: Jin, Dongming, et al.
Published: (2024)
by: Jin, Dongming, et al.
Published: (2024)
RepoScope: Leveraging Call Chain-Aware Multi-View Context for Repository-Level Code Generation
by: Liu, Yang, et al.
Published: (2025)
by: Liu, Yang, et al.
Published: (2025)
REprompt: Prompt Generation for Intelligent Software Development Guided by Requirements Engineering
by: Shi, Junjie, et al.
Published: (2026)
by: Shi, Junjie, et al.
Published: (2026)
Towards Requirements Engineering for GenAI-Enabled Software: Bridging Responsibility Gaps through Human Oversight Requirements
by: Mao, Zhenyu, et al.
Published: (2025)
by: Mao, Zhenyu, et al.
Published: (2025)
Requirements for Active Assistance of Natural Questions in Software Architecture
by: Lemos, Diogo, et al.
Published: (2025)
by: Lemos, Diogo, et al.
Published: (2025)
From Chat to Interview: Agentic Requirements Elicitation with an Experience Ontology
by: Jin, Dongming, et al.
Published: (2026)
by: Jin, Dongming, et al.
Published: (2026)
SolContractEval: A Benchmark for Evaluating Contract-Level Solidity Code Generation
by: Ye, Zhifan, et al.
Published: (2025)
by: Ye, Zhifan, et al.
Published: (2025)
Bridging the Gap between User Intent and LLM: A Requirement Alignment Approach for Code Generation
by: Li, Jia, et al.
Published: (2026)
by: Li, Jia, et al.
Published: (2026)
Towards Realistic Project-Level Code Generation via Multi-Agent Collaboration and Semantic Architecture Modeling
by: Zhao, Qianhui, et al.
Published: (2025)
by: Zhao, Qianhui, et al.
Published: (2025)
Usability as a Weapon: Attacking the Safety of LLM-Based Code Generation via Usability Requirements
by: Li, Yue, et al.
Published: (2026)
by: Li, Yue, et al.
Published: (2026)
Knowledge-Based Multi-Agent Framework for Automated Software Architecture Design
by: Zhang, Yiran, et al.
Published: (2025)
by: Zhang, Yiran, et al.
Published: (2025)
Requirements Volatility in Software Architecture Design: An Exploratory Case Study
by: Aaramaa, Sanja, et al.
Published: (2026)
by: Aaramaa, Sanja, et al.
Published: (2026)
WebCoderBench: Benchmarking Web Application Generation with Comprehensive and Interpretable Evaluation Metrics
by: Liu, Chenxu, et al.
Published: (2026)
by: Liu, Chenxu, et al.
Published: (2026)
Enhancing User-Feedback Driven Requirements Prioritization
by: Chattopadhyay, Aurek, et al.
Published: (2026)
by: Chattopadhyay, Aurek, et al.
Published: (2026)
RobuNFR: Evaluating the Robustness of Large Language Models on Non-Functional Requirements Aware Code Generation
by: Lin, Feng, et al.
Published: (2025)
by: Lin, Feng, et al.
Published: (2025)
AdaptiveLLM: A Framework for Selecting Optimal Cost-Efficient LLM for Code-Generation Based on CoT Length
by: Cheng, Junhang, et al.
Published: (2025)
by: Cheng, Junhang, et al.
Published: (2025)
ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
by: Liu, Kaiyuan, et al.
Published: (2025)
by: Liu, Kaiyuan, et al.
Published: (2025)
LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding
by: Li, Jia, et al.
Published: (2025)
by: Li, Jia, et al.
Published: (2025)
SR-Eval: Evaluating LLMs on Code Generation under Stepwise Requirement Refinement
by: Zhan, Zexun, et al.
Published: (2025)
by: Zhan, Zexun, et al.
Published: (2025)
A New Benchmark for the Appropriate Evaluation of RTL Code Optimization
by: Lu, Yao, et al.
Published: (2026)
by: Lu, Yao, et al.
Published: (2026)
ArchBench: Benchmarking Generative-AI for Software Architecture Tasks
by: Adnan, Bassam, et al.
Published: (2026)
by: Adnan, Bassam, et al.
Published: (2026)
A Study to Evaluate the Impact of LoRA Fine-tuning on the Performance of Non-functional Requirements Classification
by: Li, Xia, et al.
Published: (2025)
by: Li, Xia, et al.
Published: (2025)
Generating Project-Specific Test Cases with Requirement Validation Intention
by: Qi, Binhang, et al.
Published: (2025)
by: Qi, Binhang, et al.
Published: (2025)
Requirements-Based Test Generation: A Comprehensive Survey
by: Yang, Zhenzhen, et al.
Published: (2025)
by: Yang, Zhenzhen, et al.
Published: (2025)
QUPER-MAn: Benchmark-Guided Target Setting for Maintainability Requirements
by: Borg, Markus, et al.
Published: (2025)
by: Borg, Markus, et al.
Published: (2025)
FrontendBench: A Benchmark for Evaluating LLMs on Front-End Development via Automatic Evaluation
by: Zhu, Hongda, et al.
Published: (2025)
by: Zhu, Hongda, et al.
Published: (2025)
From Requirements to Architecture: Semi-Automatically Generating Software Architectures
by: Eisenreich, Tobias
Published: (2025)
by: Eisenreich, Tobias
Published: (2025)
Similar Items
-
CodeIF-Bench: Evaluating Instruction-Following Capabilities of Large Language Models in Interactive Code Generation
by: Wang, Peiding, et al.
Published: (2025) -
A System Model Generation Benchmark from Natural Language Requirements
by: Jin, Dongming, et al.
Published: (2025) -
Incorporating Verification Standards for Security Requirements Generation from Functional Specifications
by: Lian, Xiaoli, et al.
Published: (2025) -
A Scalable Benchmark for Repository-Oriented Long-Horizon Conversational Context Management
by: Liu, Yang, et al.
Published: (2026) -
EfficientEdit: Accelerating Code Editing via Edit-Oriented Speculative Decoding
by: Wang, Peiding, et al.
Published: (2025)