:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Röder, Daniel, Juneja, Akhil, Roller, Roland, Schmeier, Sven
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.14382
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes
by: Frei, Johann, et al.
Published: (2025)

Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation
by: Logeswaran, Lajanugen, et al.
Published: (2026)

Promoting Sustainable Web Agents: Benchmarking and Estimating Energy Consumption through Empirical and Theoretical Analysis
by: Krupp, Lars, et al.
Published: (2025)

FoodCHA: Multi-Modal LLM Agent for Fine-Grained Food Analysis
by: Lee, Woojin, et al.
Published: (2026)

Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
by: Juneja, Gurusha, et al.
Published: (2023)

POIROT: Interrogating Agents for Failure Detection in Multi-Agent Systems
by: Varela, Iñaki Dellibarda, et al.
Published: (2026)

FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline
by: Seegmiller, Parker, et al.
Published: (2025)

In-depth Research Impact Summarization through Fine-Grained Temporal Citation Analysis
by: Arnaout, Hiba, et al.
Published: (2025)

Multi-class Classifier based Failure Prediction with Artificial and Anonymous Training for Data Privacy
by: Das, Dibakar, et al.
Published: (2022)

Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance
by: Ren, Yanwei, et al.
Published: (2026)

Advancing Healthcare Automation: Multi-Agent System for Medical Necessity Justification
by: Pandey, Himanshu, et al.
Published: (2024)

$\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning
by: Juneja, Gurusha, et al.
Published: (2024)

Enhancing Graph Attention Neural Network Performance for Marijuana Consumption Classification through Large-scale Augmented Granger Causality (lsAGC) Analysis of Functional MR Images
by: Vosoughi, Ali, et al.
Published: (2024)

InteractRank: Personalized Web-Scale Search Pre-Ranking with Cross Interaction Features
by: Khandagale, Sujay, et al.
Published: (2025)

Diagnosing Generalization Failures in Fine-Tuned LLMs: A Cross-Architectural Study on Phishing Detection
by: Bobe III, Frank, et al.
Published: (2026)

Fine-Grained Graph Generation through Latent Mixture Scheduling
by: Vakil, Nidhi, et al.
Published: (2026)

PRISM: Generation-Time Detection and Mitigation of Secret Leakage in Multi-Agent LLM Pipelines
by: Tapwal, Riya, et al.
Published: (2026)

Web Retrieval Agents for Evidence-Based Misinformation Detection
by: Tian, Jacob-Junqi, et al.
Published: (2024)

WebSentinel: Detecting and Localizing Prompt Injection Attacks for Web Agents
by: Wang, Xilong, et al.
Published: (2026)

AutoSurfer -- Teaching Web Agents through Comprehensive Surfing, Learning, and Modeling
by: Faisal, Fazle Elahi, et al.
Published: (2026)

A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding
by: Wang, Shuai, et al.
Published: (2026)

Fleet of Agents: Coordinated Problem Solving with Large Language Models
by: Klein, Lars, et al.
Published: (2024)

Agent-X: Full Pipeline Acceleration of On-device AI Agents
by: Chung, Jinha, et al.
Published: (2026)

Adversarial Training for Process Reward Models
by: Juneja, Gurusha, et al.
Published: (2025)

ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines
by: Jin, Tengjun, et al.
Published: (2025)

NeuroWeaver: An Autonomous Evolutionary Agent for Exploring the Programmatic Space of EEG Analysis Pipelines
by: Wang, Guoan, et al.
Published: (2026)

Cross-Layer Attention Probing for Fine-Grained Hallucination Detection
by: Suresh, Malavika, et al.
Published: (2025)

A Multi-Agent Rhizomatic Pipeline for Non-Linear Literature Analysis
by: Serrano, Julio C., et al.
Published: (2026)

Detecting Adversarial Fine-tuning with Auditing Agents
by: Egler, Sarah, et al.
Published: (2025)

Scilab-RL: A software framework for efficient reinforcement learning and cognitive modeling research
by: Dohmen, Jan, et al.
Published: (2024)

ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
by: Levy, Ido, et al.
Published: (2024)

Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation
by: Paiola, Pedro Henrique, et al.
Published: (2024)

AutoMLGen: Navigating Fine-Grained Optimization for Coding Agents
by: Du, Shangheng, et al.
Published: (2025)

OpAgent: Operator Agent for Web Navigation
by: Guo, Yuyu, et al.
Published: (2026)

Boosting Embodied AI Agents through Perception-Generation Disaggregation and Asynchronous Pipeline Execution
by: Zhang, Shulai, et al.
Published: (2025)

MagicGUI: A Foundational Mobile GUI Agent with Scalable Data Pipeline and Reinforcement Fine-tuning
by: Tang, Liujian, et al.
Published: (2025)

Robust and Fine-Grained Detection of AI Generated Texts
by: Kadiyala, Ram Mohan Rao, et al.
Published: (2025)

ChartCitor: Multi-Agent Framework for Fine-Grained Chart Visual Attribution
by: Goswami, Kanika, et al.
Published: (2025)

TAMO: Fine-Grained Root Cause Analysis via Tool-Assisted LLM Agent with Multi-Modality Observation Data in Cloud-Native Systems
by: Zhang, Xiao, et al.
Published: (2025)

EmbeWebAgent: Embedding Web Agents into Any Customized UI
by: Ma, Chenyang, et al.
Published: (2026)