:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Gaurav, Nishant, Akarsh, Adit, Ravishankar, Tejas, Bajaj, Manoj
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Software Engineering Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2512.15813
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Dynamic ReAct: Scalable Tool Selection for Large-Scale MCP Environments
di: Gaurav, Nishant, et al.
Pubblicazione: (2025)

Synthesizing Procedural Memory: Challenges and Architectures in Automated Workflow Generation
di: Gaurav, Nishant, et al.
Pubblicazione: (2025)

Architecture Without Architects: How AI Coding Agents Shape Software Architecture
di: Konrad, Phongsakon Mark, et al.
Pubblicazione: (2026)

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences
di: Wang, Qihao, et al.
Pubblicazione: (2026)

MCP-Atlas: A Large-Scale Benchmark for Tool-Use Competency with Real MCP Servers
di: Bandi, Chaithanya, et al.
Pubblicazione: (2026)

DeltaMCP: Incremental Regeneration via Spec-Aware Transformation for MCP servers
di: Pujara, Aditya, et al.
Pubblicazione: (2026)

ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox
di: Li, Yuanyang, et al.
Pubblicazione: (2026)

MCP-Zero: Active Tool Discovery for Autonomous LLM Agents
di: Fei, Xiang, et al.
Pubblicazione: (2025)

Can Coding Agents Reproduce Findings in Computational Materials Science?
di: Huang, Ziyang, et al.
Pubblicazione: (2026)

Towards a Declarative Agentic Layer for Intelligent Agents in MCP-Based Server Ecosystems
di: Rodriguez-Sanchez, Maria Jesus, et al.
Pubblicazione: (2026)

AgentHub: A Registry for Discoverable, Verifiable, and Reproducible AI Agents
di: Pautsch, Erik, et al.
Pubblicazione: (2025)

AI-Generated Code Is Not Reproducible (Yet): An Empirical Study of Dependency Gaps in LLM-Based Coding Agents
di: Vangala, Bhanu Prakash, et al.
Pubblicazione: (2025)

MemRepair: Hierarchical Memory for Agentic Repository-Level Vulnerability Repair
di: Liu, Simiao, et al.
Pubblicazione: (2026)

Scaling Coding Agents via Atomic Skills
di: Ma, Yingwei, et al.
Pubblicazione: (2026)

RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation
di: Gan, Tiantian, et al.
Pubblicazione: (2025)

LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research
di: Yan, Shuo, et al.
Pubblicazione: (2025)

Learning Correct Behavior from Examples: Validating Sequential Execution in Autonomous Agents
di: Sharma, Reshabh K, et al.
Pubblicazione: (2026)

ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments
di: Han, Hojae, et al.
Pubblicazione: (2025)

Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI
di: Sapkota, Ranjan, et al.
Pubblicazione: (2025)

We Urgently Need Privilege Management in MCP: A Measurement of API Usage in MCP Ecosystems
di: Li, Zhihao, et al.
Pubblicazione: (2025)

AgentLens: Revealing The Lucky Pass Problem in SWE-Agent Evaluation
di: Sahoo, Priyam, et al.
Pubblicazione: (2026)

TDD Governance for Multi-Agent Code Generation via Prompt Engineering
di: Hasanli, Tarlan, et al.
Pubblicazione: (2026)

RedCode: Risky Code Execution and Generation Benchmark for Code Agents
di: Guo, Chengquan, et al.
Pubblicazione: (2024)

Your Code Agent Can Grow Alongside You with Structured Memory
di: Deng, Yi-Xuan, et al.
Pubblicazione: (2026)

FailureMem: A Failure-Aware Multimodal Framework for Autonomous Software Repair
di: Ma, Ruize, et al.
Pubblicazione: (2026)

Code Review Agent Benchmark
di: Zhang, Yuntong, et al.
Pubblicazione: (2026)

HarnessAPI: A Skill-First Framework for Unified Streaming APIs and MCP Tools
di: Jose, Edwin
Pubblicazione: (2026)

Workflows vs Agents for Code Translation
di: Gray, Henry, et al.
Pubblicazione: (2025)

Theory of Code Space: Do Code Agents Understand Software Architecture?
di: Sapunov, Grigory
Pubblicazione: (2026)

SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion
di: Ma, George, et al.
Pubblicazione: (2025)

Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection
di: Mudunuri, Sarat, et al.
Pubblicazione: (2026)

Impact and Implications of Generative AI for Enterprise Architects in Agile Environments: A Systematic Literature Review
di: Kooy, Stefan Julian, et al.
Pubblicazione: (2025)

Code Researcher: Deep Research Agent for Large Systems Code and Commit History
di: Singh, Ramneet, et al.
Pubblicazione: (2025)

Analyzing Message-Code Inconsistency in AI Coding Agent-Authored Pull Requests
di: Gong, Jingzhi, et al.
Pubblicazione: (2026)

Correctness isnt Efficiency: Runtime Memory Divergence in LLM-Generated Code
di: Rajput, Prateek, et al.
Pubblicazione: (2026)

GA4GC: Greener Agent for Greener Code via Multi-Objective Configuration Optimization
di: Gong, Jingzhi, et al.
Pubblicazione: (2025)

TransAgent: Enhancing LLM-Based Code Translation via Fine-Grained Execution Alignment
di: Yuan, Zhiqiang, et al.
Pubblicazione: (2024)

Imitation Game: Reproducing Deep Learning Bugs Leveraging an Intelligent Agent
di: Shah, Mehil B, et al.
Pubblicazione: (2025)

HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale
di: Phan, Huy Nhat, et al.
Pubblicazione: (2024)

DialogAgent: An Auto-engagement Agent for Code Question Answering Data Production
di: Liang, Xiaoyun, et al.
Pubblicazione: (2024)