Guardado en:
| Autores principales: | Guo, Lianghong, Wang, Yanlin, Li, Caihua, Tao, Wei, Yang, Pengyu, Chen, Jiachi, Song, Haoyu, Tang, Duyu, Zheng, Zibin |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2506.10954 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey
por: Li, Caihua, et al.
Publicado: (2026)
por: Li, Caihua, et al.
Publicado: (2026)
OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution
por: Guo, Lianghong, et al.
Publicado: (2025)
por: Guo, Lianghong, et al.
Publicado: (2025)
Towards an Understanding of Large Language Models in Software Engineering Tasks
por: Zheng, Zibin, et al.
Publicado: (2023)
por: Zheng, Zibin, et al.
Publicado: (2023)
SimpleDevQA: Benchmarking Large Language Models on Development Knowledge QA
por: Zhang, Jing, et al.
Publicado: (2025)
por: Zhang, Jing, et al.
Publicado: (2025)
When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention
por: Guo, Lianghong, et al.
Publicado: (2024)
por: Guo, Lianghong, et al.
Publicado: (2024)
You Augment Me: Exploring ChatGPT-based Data Augmentation for Semantic Code Search
por: Wang, Yanlin, et al.
Publicado: (2024)
por: Wang, Yanlin, et al.
Publicado: (2024)
A Survey of Large Language Models for Code: Evolution, Benchmarking, and Future Trends
por: Zheng, Zibin, et al.
Publicado: (2023)
por: Zheng, Zibin, et al.
Publicado: (2023)
Identifying Smart Contract Security Issues in Code Snippets from Stack Overflow
por: Chen, Jiachi, et al.
Publicado: (2024)
por: Chen, Jiachi, et al.
Publicado: (2024)
RealSec-bench: A Benchmark for Evaluating Secure Code Generation in Real-World Repositories
por: Wang, Yanlin, et al.
Publicado: (2026)
por: Wang, Yanlin, et al.
Publicado: (2026)
Efficiently Detecting Reentrancy Vulnerabilities in Complex Smart Contracts
por: Wang, Zexu, et al.
Publicado: (2024)
por: Wang, Zexu, et al.
Publicado: (2024)
RLCoder: Reinforcement Learning for Repository-Level Code Completion
por: Wang, Yanlin, et al.
Publicado: (2024)
por: Wang, Yanlin, et al.
Publicado: (2024)
LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation
por: Zhang, Ziyao, et al.
Publicado: (2024)
por: Zhang, Ziyao, et al.
Publicado: (2024)
An Empirical Study of Agent Developer Practices in AI Agent Frameworks
por: Wang, Yanlin, et al.
Publicado: (2025)
por: Wang, Yanlin, et al.
Publicado: (2025)
ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations
por: Ni, Xinyi, et al.
Publicado: (2025)
por: Ni, Xinyi, et al.
Publicado: (2025)
An Empirical Study on Low-Code Programming using Traditional vs Large Language Model Support
por: Liu, Yongkun, et al.
Publicado: (2024)
por: Liu, Yongkun, et al.
Publicado: (2024)
NumScout: Unveiling Numerical Defects in Smart Contracts using LLM-Pruning Symbolic Execution
por: Chen, Jiachi, et al.
Publicado: (2025)
por: Chen, Jiachi, et al.
Publicado: (2025)
Hyperion: Unveiling DApp Inconsistencies using LLM and Dataflow-Guided Symbolic Execution
por: Yang, Shuo, et al.
Publicado: (2024)
por: Yang, Shuo, et al.
Publicado: (2024)
AlignCoder: Aligning Retrieval with Target Intent for Repository-Level Code Completion
por: Jiang, Tianyue, et al.
Publicado: (2026)
por: Jiang, Tianyue, et al.
Publicado: (2026)
HumanEvo: An Evolution-aware Benchmark for More Realistic Evaluation of Repository-level Code Generation
por: Zheng, Dewu, et al.
Publicado: (2024)
por: Zheng, Dewu, et al.
Publicado: (2024)
Safety Factories - a Manifesto
por: Cârlan, Carmen, et al.
Publicado: (2025)
por: Cârlan, Carmen, et al.
Publicado: (2025)
SWE-Cycle: Benchmarking Code Agents across the Complete Issue Resolution Cycle
por: Guan, Hao, et al.
Publicado: (2026)
por: Guan, Hao, et al.
Publicado: (2026)
SWE Atlas: Benchmarking Coding Agents Beyond Issue Resolution
por: Raghavendra, Mohit, et al.
Publicado: (2026)
por: Raghavendra, Mohit, et al.
Publicado: (2026)
EffiReasonTrans: RL-Optimized Reasoning for Code Translation
por: Wang, Yanlin, et al.
Publicado: (2025)
por: Wang, Yanlin, et al.
Publicado: (2025)
Definition and Detection of Centralization Defects in Smart Contracts
por: Lin, Zewei, et al.
Publicado: (2024)
por: Lin, Zewei, et al.
Publicado: (2024)
Exploration of Approaches for Robustness and Safety in a Low Code Open Environment for Factory Automation
por: A., Gustavo Quiros, et al.
Publicado: (2025)
por: A., Gustavo Quiros, et al.
Publicado: (2025)
Beyond Functional Correctness: Investigating Coding Style Inconsistencies in Large Language Models
por: Wang, Yanlin, et al.
Publicado: (2024)
por: Wang, Yanlin, et al.
Publicado: (2024)
SparseCoder: Identifier-Aware Sparse Transformer for File-Level Code Summarization
por: Wang, Yanlin, et al.
Publicado: (2024)
por: Wang, Yanlin, et al.
Publicado: (2024)
RustRepoTrans: Repository-level Code Translation Benchmark Targeting Rust
por: Ou, Guangsheng, et al.
Publicado: (2024)
por: Ou, Guangsheng, et al.
Publicado: (2024)
Copy-and-Paste? Identifying EVM-Inequivalent Code Smells in Multi-chain Reuse Contracts
por: Wang, Zexu, et al.
Publicado: (2025)
por: Wang, Zexu, et al.
Publicado: (2025)
A Hierarchical and Evolvable Benchmark for Fine-Grained Code Instruction Following with Multi-Turn Feedback
por: Duan, Guoliang, et al.
Publicado: (2025)
por: Duan, Guoliang, et al.
Publicado: (2025)
CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven Agents
por: Gong, Jing, et al.
Publicado: (2024)
por: Gong, Jing, et al.
Publicado: (2024)
RepoTransBench: A Real-World Multilingual Benchmark for Repository-Level Code Translation
por: Wang, Yanli, et al.
Publicado: (2024)
por: Wang, Yanli, et al.
Publicado: (2024)
SmartOracle: Generating Smart Contract Oracle via Fine-Grained Invariant Detection
por: Su, Jianzhong, et al.
Publicado: (2024)
por: Su, Jianzhong, et al.
Publicado: (2024)
DRAINCODE: Stealthy Energy Consumption Attacks on Retrieval-Augmented Code Generation via Context Poisoning
por: Wang, Yanlin, et al.
Publicado: (2026)
por: Wang, Yanlin, et al.
Publicado: (2026)
Trace: Securing Smart Contract Repository Against Access Control Vulnerability
por: Chen, Chong, et al.
Publicado: (2025)
por: Chen, Chong, et al.
Publicado: (2025)
When ChatGPT Meets Smart Contract Vulnerability Detection: How Far Are We?
por: Chen, Chong, et al.
Publicado: (2023)
por: Chen, Chong, et al.
Publicado: (2023)
FeedbackEval: A Benchmark for Evaluating Large Language Models in Feedback-Driven Code Repair Tasks
por: Dai, Dekun, et al.
Publicado: (2025)
por: Dai, Dekun, et al.
Publicado: (2025)
What's in a Benchmark? The Case of SWE-Bench in Automated Program Repair
por: Martinez, Matias, et al.
Publicado: (2026)
por: Martinez, Matias, et al.
Publicado: (2026)
DAppSCAN: Building Large-Scale Datasets for Smart Contract Weaknesses in DApp Projects
por: Zheng, Zibin, et al.
Publicado: (2023)
por: Zheng, Zibin, et al.
Publicado: (2023)
CRPWarner: Warning the Risk of Contract-related Rug Pull in DeFi Smart Contracts
por: Lin, Zewei, et al.
Publicado: (2024)
por: Lin, Zewei, et al.
Publicado: (2024)
Ejemplares similares
-
Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey
por: Li, Caihua, et al.
Publicado: (2026) -
OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution
por: Guo, Lianghong, et al.
Publicado: (2025) -
Towards an Understanding of Large Language Models in Software Engineering Tasks
por: Zheng, Zibin, et al.
Publicado: (2023) -
SimpleDevQA: Benchmarking Large Language Models on Development Knowledge QA
por: Zhang, Jing, et al.
Publicado: (2025) -
When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention
por: Guo, Lianghong, et al.
Publicado: (2024)