:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huotala, Aleksi, Kuutila, Miikka, Turtio, Olli-Pekka, Sipilä, Simo, Mäntylä, Mika
Format:	Preprint
Published:	2025
Subjects:	Software Engineering Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.06708
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SESR-Eval: Dataset for Evaluating LLMs in the Title-Abstract Screening of Systematic Reviews
by: Huotala, Aleksi, et al.
Published: (2025)

Research Artifacts in Secondary Studies: A Systematic Mapping in Software Engineering
by: Huotala, Aleksi, et al.
Published: (2025)

The Promise and Challenges of Using LLMs to Accelerate the Screening Process of Systematic Reviews
by: Huotala, Aleksi, et al.
Published: (2024)

What Makes Programmers Laugh? Exploring the Submissions of the Subreddit r/ProgrammerHumor
by: Kuutila, Miikka, et al.
Published: (2024)

Individual Differences Limit Predicting Well-being and Productivity Using Software Repositories: A Longitudinal Industrial Study
by: Kuutila, Miikka, et al.
Published: (2021)

Teaching Software Metrology: The Science of Measurement for Software Engineering
by: Ralph, Paul, et al.
Published: (2024)

Detection, Classification and Prevalence of Self-Admitted Aging Debt
by: Sridharan, Murali, et al.
Published: (2025)

Cross-System Categorization of Abnormal Traces in Microservice-Based Systems via Meta-Learning
by: Wang, Yuqing, et al.
Published: (2024)

User Personas Improve Social Sustainability by Encouraging Software Developers to Deprioritize Antisocial Features
by: Ayoola, Bimpe, et al.
Published: (2024)

Token Interdependency Parsing (Tipping) -- Fast and Accurate Log Parsing
by: Hashemi, Shayan, et al.
Published: (2024)

Speed and Performance of Parserless and Unsupervised Anomaly Detection Methods on Software Logs
by: Nyyssölä, Jesse, et al.
Published: (2023)

Detecting Anomalies in Software Execution Logs with Siamese Network
by: Hashemi, Shayan, et al.
Published: (2021)

LLM-based agents for automating the enhancement of user story quality: An early report
by: Zhang, Zheying, et al.
Published: (2024)

Agentic Frameworks for Reasoning Tasks: An Empirical Study
by: Rasheed, Zeeshan, et al.
Published: (2026)

OneLog: Towards End-to-End Training in Software Log Anomaly Detection
by: Hashemi, Shayan, et al.
Published: (2021)

Assessing REST API Test Generation Strategies with Log Coverage
by: Reinikainen, Nana, et al.
Published: (2026)

Breaking the Illusion of Identity in LLM Tooling
by: Miller, Marek
Published: (2026)

EvolveTool-Bench: Evaluating the Quality of LLM-Generated Tool Libraries as Software Artifacts
by: Kaliyev, Alibek T., et al.
Published: (2026)

Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience Report
by: Khan, Ayman Asad, et al.
Published: (2024)

Multi-Agent Systems for Root Cause Analysis in Microservices
by: Naakka, Alexander, et al.
Published: (2026)

The Tool-Overuse Illusion: Why Does LLM Prefer External Tools over Internal Knowledge?
by: Zeng, Yirong, et al.
Published: (2026)

Beyond Accuracy: LLM Variability in Evidence Screening for Software Engineering SLRs
by: Hida, Gilberto Sussumu, et al.
Published: (2026)

Towards AI Evaluation in Domain-Specific RAG Systems: The AgriHubi Case Study
by: Hasan, Md. Toufique, et al.
Published: (2026)

MCP-Zero: Active Tool Discovery for Autonomous LLM Agents
by: Fei, Xiang, et al.
Published: (2025)

Schema First Tool APIs for LLM Agents: A Controlled Study of Tool Misuse, Recovery, and Budgeted Performance
by: Sigdel, Akshey, et al.
Published: (2026)

LogLead -- Fast and Integrated Log Loader, Enhancer, and Anomaly Detector
by: Mäntylä, Mika, et al.
Published: (2023)

Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents
by: Winston, Cailin, et al.
Published: (2026)

Graph-Based Self-Healing Tool Routing for Cost-Efficient LLM Agents
by: Bholani, Neeraj
Published: (2026)

DynamicsLLM: a Dynamic Analysis-based Tool for Generating Intelligent Execution Traces Using LLMs to Detect Android Behavioural Code Smells
by: Cherief, Houcine Abdelkader, et al.
Published: (2026)

ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox
by: Li, Yuanyang, et al.
Published: (2026)

LogDx-CI: Benchmarking Log Reduction Tools for LLM Root-Cause Diagnosis
by: Qin, Bowen
Published: (2026)

Automatic Red Teaming LLM-based Agents with Model Context Protocol Tools
by: He, Ping, et al.
Published: (2025)

Z-Space: A Multi-Agent Tool Orchestration Framework for Enterprise-Grade LLM Automation
by: He, Qingsong, et al.
Published: (2025)

RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation
by: Gan, Tiantian, et al.
Published: (2025)

Evaluating LLM-Based 0-to-1 Software Generation in End-to-End CLI Tool Scenarios
by: Hu, Ruida, et al.
Published: (2026)

Enhancing LLM-Based Coding Tools through Native Integration of IDE-Derived Static Context
by: Li, Yichen, et al.
Published: (2024)

Cross-System Software Log-based Anomaly Detection Using Meta-Learning
by: Wang, Yuqing, et al.
Published: (2024)

AI and Agile Software Development: From Frustration to Success -- XP2025 Workshop Summary
by: Herda, Tomas, et al.
Published: (2025)

A Comparative Study of Semantic Log Representations for Software Log-based Anomaly Detection
by: Wang, Yuqing, et al.
Published: (2026)

Automating Android Build Repair: Bridging the Reasoning-Execution Gap in LLM Agents with Domain-Specific Tools
by: Son, Ha Min, et al.
Published: (2025)