Saved in:
| Main Authors: | Huotala, Aleksi, Kuutila, Miikka, Turtio, Olli-Pekka, Sipilä, Simo, Mäntylä, Mika |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.06708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SESR-Eval: Dataset for Evaluating LLMs in the Title-Abstract Screening of Systematic Reviews
by: Huotala, Aleksi, et al.
Published: (2025)
by: Huotala, Aleksi, et al.
Published: (2025)
Research Artifacts in Secondary Studies: A Systematic Mapping in Software Engineering
by: Huotala, Aleksi, et al.
Published: (2025)
by: Huotala, Aleksi, et al.
Published: (2025)
The Promise and Challenges of Using LLMs to Accelerate the Screening Process of Systematic Reviews
by: Huotala, Aleksi, et al.
Published: (2024)
by: Huotala, Aleksi, et al.
Published: (2024)
What Makes Programmers Laugh? Exploring the Submissions of the Subreddit r/ProgrammerHumor
by: Kuutila, Miikka, et al.
Published: (2024)
by: Kuutila, Miikka, et al.
Published: (2024)
Individual Differences Limit Predicting Well-being and Productivity Using Software Repositories: A Longitudinal Industrial Study
by: Kuutila, Miikka, et al.
Published: (2021)
by: Kuutila, Miikka, et al.
Published: (2021)
Teaching Software Metrology: The Science of Measurement for Software Engineering
by: Ralph, Paul, et al.
Published: (2024)
by: Ralph, Paul, et al.
Published: (2024)
Detection, Classification and Prevalence of Self-Admitted Aging Debt
by: Sridharan, Murali, et al.
Published: (2025)
by: Sridharan, Murali, et al.
Published: (2025)
Cross-System Categorization of Abnormal Traces in Microservice-Based Systems via Meta-Learning
by: Wang, Yuqing, et al.
Published: (2024)
by: Wang, Yuqing, et al.
Published: (2024)
User Personas Improve Social Sustainability by Encouraging Software Developers to Deprioritize Antisocial Features
by: Ayoola, Bimpe, et al.
Published: (2024)
by: Ayoola, Bimpe, et al.
Published: (2024)
Token Interdependency Parsing (Tipping) -- Fast and Accurate Log Parsing
by: Hashemi, Shayan, et al.
Published: (2024)
by: Hashemi, Shayan, et al.
Published: (2024)
Speed and Performance of Parserless and Unsupervised Anomaly Detection Methods on Software Logs
by: Nyyssölä, Jesse, et al.
Published: (2023)
by: Nyyssölä, Jesse, et al.
Published: (2023)
Detecting Anomalies in Software Execution Logs with Siamese Network
by: Hashemi, Shayan, et al.
Published: (2021)
by: Hashemi, Shayan, et al.
Published: (2021)
LLM-based agents for automating the enhancement of user story quality: An early report
by: Zhang, Zheying, et al.
Published: (2024)
by: Zhang, Zheying, et al.
Published: (2024)
Agentic Frameworks for Reasoning Tasks: An Empirical Study
by: Rasheed, Zeeshan, et al.
Published: (2026)
by: Rasheed, Zeeshan, et al.
Published: (2026)
OneLog: Towards End-to-End Training in Software Log Anomaly Detection
by: Hashemi, Shayan, et al.
Published: (2021)
by: Hashemi, Shayan, et al.
Published: (2021)
Assessing REST API Test Generation Strategies with Log Coverage
by: Reinikainen, Nana, et al.
Published: (2026)
by: Reinikainen, Nana, et al.
Published: (2026)
Breaking the Illusion of Identity in LLM Tooling
by: Miller, Marek
Published: (2026)
by: Miller, Marek
Published: (2026)
EvolveTool-Bench: Evaluating the Quality of LLM-Generated Tool Libraries as Software Artifacts
by: Kaliyev, Alibek T., et al.
Published: (2026)
by: Kaliyev, Alibek T., et al.
Published: (2026)
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience Report
by: Khan, Ayman Asad, et al.
Published: (2024)
by: Khan, Ayman Asad, et al.
Published: (2024)
Multi-Agent Systems for Root Cause Analysis in Microservices
by: Naakka, Alexander, et al.
Published: (2026)
by: Naakka, Alexander, et al.
Published: (2026)
The Tool-Overuse Illusion: Why Does LLM Prefer External Tools over Internal Knowledge?
by: Zeng, Yirong, et al.
Published: (2026)
by: Zeng, Yirong, et al.
Published: (2026)
Beyond Accuracy: LLM Variability in Evidence Screening for Software Engineering SLRs
by: Hida, Gilberto Sussumu, et al.
Published: (2026)
by: Hida, Gilberto Sussumu, et al.
Published: (2026)
Towards AI Evaluation in Domain-Specific RAG Systems: The AgriHubi Case Study
by: Hasan, Md. Toufique, et al.
Published: (2026)
by: Hasan, Md. Toufique, et al.
Published: (2026)
MCP-Zero: Active Tool Discovery for Autonomous LLM Agents
by: Fei, Xiang, et al.
Published: (2025)
by: Fei, Xiang, et al.
Published: (2025)
Schema First Tool APIs for LLM Agents: A Controlled Study of Tool Misuse, Recovery, and Budgeted Performance
by: Sigdel, Akshey, et al.
Published: (2026)
by: Sigdel, Akshey, et al.
Published: (2026)
LogLead -- Fast and Integrated Log Loader, Enhancer, and Anomaly Detector
by: Mäntylä, Mika, et al.
Published: (2023)
by: Mäntylä, Mika, et al.
Published: (2023)
Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents
by: Winston, Cailin, et al.
Published: (2026)
by: Winston, Cailin, et al.
Published: (2026)
Graph-Based Self-Healing Tool Routing for Cost-Efficient LLM Agents
by: Bholani, Neeraj
Published: (2026)
by: Bholani, Neeraj
Published: (2026)
DynamicsLLM: a Dynamic Analysis-based Tool for Generating Intelligent Execution Traces Using LLMs to Detect Android Behavioural Code Smells
by: Cherief, Houcine Abdelkader, et al.
Published: (2026)
by: Cherief, Houcine Abdelkader, et al.
Published: (2026)
ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox
by: Li, Yuanyang, et al.
Published: (2026)
by: Li, Yuanyang, et al.
Published: (2026)
LogDx-CI: Benchmarking Log Reduction Tools for LLM Root-Cause Diagnosis
by: Qin, Bowen
Published: (2026)
by: Qin, Bowen
Published: (2026)
Automatic Red Teaming LLM-based Agents with Model Context Protocol Tools
by: He, Ping, et al.
Published: (2025)
by: He, Ping, et al.
Published: (2025)
Z-Space: A Multi-Agent Tool Orchestration Framework for Enterprise-Grade LLM Automation
by: He, Qingsong, et al.
Published: (2025)
by: He, Qingsong, et al.
Published: (2025)
RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation
by: Gan, Tiantian, et al.
Published: (2025)
by: Gan, Tiantian, et al.
Published: (2025)
Evaluating LLM-Based 0-to-1 Software Generation in End-to-End CLI Tool Scenarios
by: Hu, Ruida, et al.
Published: (2026)
by: Hu, Ruida, et al.
Published: (2026)
Enhancing LLM-Based Coding Tools through Native Integration of IDE-Derived Static Context
by: Li, Yichen, et al.
Published: (2024)
by: Li, Yichen, et al.
Published: (2024)
Cross-System Software Log-based Anomaly Detection Using Meta-Learning
by: Wang, Yuqing, et al.
Published: (2024)
by: Wang, Yuqing, et al.
Published: (2024)
AI and Agile Software Development: From Frustration to Success -- XP2025 Workshop Summary
by: Herda, Tomas, et al.
Published: (2025)
by: Herda, Tomas, et al.
Published: (2025)
A Comparative Study of Semantic Log Representations for Software Log-based Anomaly Detection
by: Wang, Yuqing, et al.
Published: (2026)
by: Wang, Yuqing, et al.
Published: (2026)
Automating Android Build Repair: Bridging the Reasoning-Execution Gap in LLM Agents with Domain-Specific Tools
by: Son, Ha Min, et al.
Published: (2025)
by: Son, Ha Min, et al.
Published: (2025)
Similar Items
-
SESR-Eval: Dataset for Evaluating LLMs in the Title-Abstract Screening of Systematic Reviews
by: Huotala, Aleksi, et al.
Published: (2025) -
Research Artifacts in Secondary Studies: A Systematic Mapping in Software Engineering
by: Huotala, Aleksi, et al.
Published: (2025) -
The Promise and Challenges of Using LLMs to Accelerate the Screening Process of Systematic Reviews
by: Huotala, Aleksi, et al.
Published: (2024) -
What Makes Programmers Laugh? Exploring the Submissions of the Subreddit r/ProgrammerHumor
by: Kuutila, Miikka, et al.
Published: (2024) -
Individual Differences Limit Predicting Well-being and Productivity Using Software Repositories: A Longitudinal Industrial Study
by: Kuutila, Miikka, et al.
Published: (2021)