Saved in:
| Main Authors: | Röder, Daniel, Juneja, Akhil, Roller, Roland, Schmeier, Sven |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.14382 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes
by: Frei, Johann, et al.
Published: (2025)
by: Frei, Johann, et al.
Published: (2025)
Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation
by: Logeswaran, Lajanugen, et al.
Published: (2026)
by: Logeswaran, Lajanugen, et al.
Published: (2026)
Promoting Sustainable Web Agents: Benchmarking and Estimating Energy Consumption through Empirical and Theoretical Analysis
by: Krupp, Lars, et al.
Published: (2025)
by: Krupp, Lars, et al.
Published: (2025)
FoodCHA: Multi-Modal LLM Agent for Fine-Grained Food Analysis
by: Lee, Woojin, et al.
Published: (2026)
by: Lee, Woojin, et al.
Published: (2026)
Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
by: Juneja, Gurusha, et al.
Published: (2023)
by: Juneja, Gurusha, et al.
Published: (2023)
POIROT: Interrogating Agents for Failure Detection in Multi-Agent Systems
by: Varela, Iñaki Dellibarda, et al.
Published: (2026)
by: Varela, Iñaki Dellibarda, et al.
Published: (2026)
FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline
by: Seegmiller, Parker, et al.
Published: (2025)
by: Seegmiller, Parker, et al.
Published: (2025)
In-depth Research Impact Summarization through Fine-Grained Temporal Citation Analysis
by: Arnaout, Hiba, et al.
Published: (2025)
by: Arnaout, Hiba, et al.
Published: (2025)
Multi-class Classifier based Failure Prediction with Artificial and Anonymous Training for Data Privacy
by: Das, Dibakar, et al.
Published: (2022)
by: Das, Dibakar, et al.
Published: (2022)
Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance
by: Ren, Yanwei, et al.
Published: (2026)
by: Ren, Yanwei, et al.
Published: (2026)
Advancing Healthcare Automation: Multi-Agent System for Medical Necessity Justification
by: Pandey, Himanshu, et al.
Published: (2024)
by: Pandey, Himanshu, et al.
Published: (2024)
$\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning
by: Juneja, Gurusha, et al.
Published: (2024)
by: Juneja, Gurusha, et al.
Published: (2024)
Enhancing Graph Attention Neural Network Performance for Marijuana Consumption Classification through Large-scale Augmented Granger Causality (lsAGC) Analysis of Functional MR Images
by: Vosoughi, Ali, et al.
Published: (2024)
by: Vosoughi, Ali, et al.
Published: (2024)
InteractRank: Personalized Web-Scale Search Pre-Ranking with Cross Interaction Features
by: Khandagale, Sujay, et al.
Published: (2025)
by: Khandagale, Sujay, et al.
Published: (2025)
Diagnosing Generalization Failures in Fine-Tuned LLMs: A Cross-Architectural Study on Phishing Detection
by: Bobe III, Frank, et al.
Published: (2026)
by: Bobe III, Frank, et al.
Published: (2026)
Fine-Grained Graph Generation through Latent Mixture Scheduling
by: Vakil, Nidhi, et al.
Published: (2026)
by: Vakil, Nidhi, et al.
Published: (2026)
PRISM: Generation-Time Detection and Mitigation of Secret Leakage in Multi-Agent LLM Pipelines
by: Tapwal, Riya, et al.
Published: (2026)
by: Tapwal, Riya, et al.
Published: (2026)
Web Retrieval Agents for Evidence-Based Misinformation Detection
by: Tian, Jacob-Junqi, et al.
Published: (2024)
by: Tian, Jacob-Junqi, et al.
Published: (2024)
WebSentinel: Detecting and Localizing Prompt Injection Attacks for Web Agents
by: Wang, Xilong, et al.
Published: (2026)
by: Wang, Xilong, et al.
Published: (2026)
AutoSurfer -- Teaching Web Agents through Comprehensive Surfing, Learning, and Modeling
by: Faisal, Fazle Elahi, et al.
Published: (2026)
by: Faisal, Fazle Elahi, et al.
Published: (2026)
A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding
by: Wang, Shuai, et al.
Published: (2026)
by: Wang, Shuai, et al.
Published: (2026)
Fleet of Agents: Coordinated Problem Solving with Large Language Models
by: Klein, Lars, et al.
Published: (2024)
by: Klein, Lars, et al.
Published: (2024)
Agent-X: Full Pipeline Acceleration of On-device AI Agents
by: Chung, Jinha, et al.
Published: (2026)
by: Chung, Jinha, et al.
Published: (2026)
Adversarial Training for Process Reward Models
by: Juneja, Gurusha, et al.
Published: (2025)
by: Juneja, Gurusha, et al.
Published: (2025)
ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines
by: Jin, Tengjun, et al.
Published: (2025)
by: Jin, Tengjun, et al.
Published: (2025)
NeuroWeaver: An Autonomous Evolutionary Agent for Exploring the Programmatic Space of EEG Analysis Pipelines
by: Wang, Guoan, et al.
Published: (2026)
by: Wang, Guoan, et al.
Published: (2026)
Cross-Layer Attention Probing for Fine-Grained Hallucination Detection
by: Suresh, Malavika, et al.
Published: (2025)
by: Suresh, Malavika, et al.
Published: (2025)
A Multi-Agent Rhizomatic Pipeline for Non-Linear Literature Analysis
by: Serrano, Julio C., et al.
Published: (2026)
by: Serrano, Julio C., et al.
Published: (2026)
Detecting Adversarial Fine-tuning with Auditing Agents
by: Egler, Sarah, et al.
Published: (2025)
by: Egler, Sarah, et al.
Published: (2025)
Scilab-RL: A software framework for efficient reinforcement learning and cognitive modeling research
by: Dohmen, Jan, et al.
Published: (2024)
by: Dohmen, Jan, et al.
Published: (2024)
ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
by: Levy, Ido, et al.
Published: (2024)
by: Levy, Ido, et al.
Published: (2024)
Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation
by: Paiola, Pedro Henrique, et al.
Published: (2024)
by: Paiola, Pedro Henrique, et al.
Published: (2024)
AutoMLGen: Navigating Fine-Grained Optimization for Coding Agents
by: Du, Shangheng, et al.
Published: (2025)
by: Du, Shangheng, et al.
Published: (2025)
OpAgent: Operator Agent for Web Navigation
by: Guo, Yuyu, et al.
Published: (2026)
by: Guo, Yuyu, et al.
Published: (2026)
Boosting Embodied AI Agents through Perception-Generation Disaggregation and Asynchronous Pipeline Execution
by: Zhang, Shulai, et al.
Published: (2025)
by: Zhang, Shulai, et al.
Published: (2025)
MagicGUI: A Foundational Mobile GUI Agent with Scalable Data Pipeline and Reinforcement Fine-tuning
by: Tang, Liujian, et al.
Published: (2025)
by: Tang, Liujian, et al.
Published: (2025)
Robust and Fine-Grained Detection of AI Generated Texts
by: Kadiyala, Ram Mohan Rao, et al.
Published: (2025)
by: Kadiyala, Ram Mohan Rao, et al.
Published: (2025)
ChartCitor: Multi-Agent Framework for Fine-Grained Chart Visual Attribution
by: Goswami, Kanika, et al.
Published: (2025)
by: Goswami, Kanika, et al.
Published: (2025)
TAMO: Fine-Grained Root Cause Analysis via Tool-Assisted LLM Agent with Multi-Modality Observation Data in Cloud-Native Systems
by: Zhang, Xiao, et al.
Published: (2025)
by: Zhang, Xiao, et al.
Published: (2025)
EmbeWebAgent: Embedding Web Agents into Any Customized UI
by: Ma, Chenyang, et al.
Published: (2026)
by: Ma, Chenyang, et al.
Published: (2026)
Similar Items
-
Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes
by: Frei, Johann, et al.
Published: (2025) -
Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation
by: Logeswaran, Lajanugen, et al.
Published: (2026) -
Promoting Sustainable Web Agents: Benchmarking and Estimating Energy Consumption through Empirical and Theoretical Analysis
by: Krupp, Lars, et al.
Published: (2025) -
FoodCHA: Multi-Modal LLM Agent for Fine-Grained Food Analysis
by: Lee, Woojin, et al.
Published: (2026) -
Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
by: Juneja, Gurusha, et al.
Published: (2023)