:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Jing-En, Fang, I-Sheng, Huang, Tzuhsuan, Liu, Yu-Lun, Wang, Chih-Yu, Chen, Jun-Cheng
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning Multiagent Systems
Online Access:	https://arxiv.org/abs/2506.04676
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models
by: Chung-En, et al.
Published: (2025)

ORCA: An Agentic Reasoning Framework for Hallucination and Adversarial Robustness in Vision-Language Models
by: Yu, Chung-En Johnny, et al.
Published: (2025)

SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing
by: Tu, Rong-Cheng, et al.
Published: (2024)

Agentic-J: An AI Agent for Biological Microscopy Image Analysis
by: Johanns, Lukas, et al.
Published: (2026)

MAG-3D: Multi-Agent Grounded Reasoning for 3D Understanding
by: Zheng, Henry, et al.
Published: (2026)

Visual Reasoning Agent: Robust Vision Systems in Remote Sensing via Inference-Time Scaling
by: Yu, Chung-En Johnny, et al.
Published: (2025)

Multi-Agent Amodal Completion: Direct Synthesis with Fine-Grained Semantic Guidance
by: Fan, Hongxing, et al.
Published: (2025)

FetalAgents: A Multi-Agent System for Fetal Ultrasound Image and Video Analysis
by: Hu, Xiaotian, et al.
Published: (2026)

AIDE: Agentically Improve Visual Language Model with Domain Experts
by: Chiu, Ming-Chang, et al.
Published: (2025)

AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction
by: Fatima, Syeda Kisaa, et al.
Published: (2025)

See it. Say it. Sorted: Agentic System for Compositional Diagram Generation
by: Zhang, Hantao, et al.
Published: (2025)

GenCellAgent: Generalizable, Training-Free Cellular Image Segmentation via Large Language Model Agents
by: Yu, Xi, et al.
Published: (2025)

Towards Reliable Fetal Ultrasound Interpretation with Multi-Agent Collaboration
by: Hu, Xiaotian, et al.
Published: (2026)

VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning
by: Chen, Boyu, et al.
Published: (2025)

MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling
by: Wang, Qian, et al.
Published: (2025)

Autonomous Computer Vision Development with Agentic AI
by: Kim, Jin, et al.
Published: (2025)

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning
by: Lu, Pan, et al.
Published: (2025)

Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
by: Yu, Xinlei, et al.
Published: (2025)

PhotoFlow: Agentic 3D Virtual Photography Missions
by: Guo, Jiarui, et al.
Published: (2026)

TraF-Align: Trajectory-aware Feature Alignment for Asynchronous Multi-agent Perception
by: Song, Zhiying, et al.
Published: (2025)

ARGOS: Who, Where, and When in Agentic Multi-Camera Person Search
by: Kim, Myungchul, et al.
Published: (2026)

LogiStory: A Logic-Aware Framework for Multi-Image Story Visualization
by: Meng, Chutian, et al.
Published: (2026)

AstroVLM: Expert Multi-agent Collaborative Reasoning for Astronomical Imaging Quality Diagnosis
by: Han, Yaohui, et al.
Published: (2026)

A Multi-Agent Perception-Action Alliance for Efficient Long Video Reasoning
by: Xu, Yichang, et al.
Published: (2026)

An Agentic System for Rare Disease Diagnosis with Traceable Reasoning
by: Zhao, Weike, et al.
Published: (2025)

A Multi-Agent System Enables Versatile Information Extraction from the Chemical Literature
by: Chen, Yufan, et al.
Published: (2025)

Agentic Design Review System
by: Nag, Sayan, et al.
Published: (2025)

RadAgents: Multimodal Agentic Reasoning for Chest X-ray Interpretation with Radiologist-like Workflows
by: Zhang, Kai, et al.
Published: (2025)

AniMaker: Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
by: Shi, Haoyuan, et al.
Published: (2025)

Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration
by: Wei, Zheng, et al.
Published: (2025)

SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement
by: Fang, Runnan, et al.
Published: (2025)

PreGSU-A Generalized Traffic Scene Understanding Model for Autonomous Driving based on Pre-trained Graph Attention Network
by: Wang, Yuning, et al.
Published: (2024)

StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments
by: Kulinski, Sean, et al.
Published: (2024)

Agentic Knowledgeable Self-awareness
by: Qiao, Shuofei, et al.
Published: (2025)

End-to-End Autonomous Driving through V2X Cooperation
by: Yu, Haibao, et al.
Published: (2024)

MARIC: Multi-Agent Reasoning for Image Classification
by: Seo, Wonduk, et al.
Published: (2025)

MaCTG: Multi-Agent Collaborative Thought Graph for Automatic Programming
by: Zhao, Zixiao, et al.
Published: (2024)

OralAgent: Integrating Reasoning, Tools, and Knowledge for Interactive Dental Image Analysis
by: Hao, Jing, et al.
Published: (2026)

VQQA: An Agentic Approach for Video Evaluation and Quality Improvement
by: Song, Yiwen, et al.
Published: (2026)

COMIC: Agentic Sketch Comedy Generation
by: Hong, Susung, et al.
Published: (2026)