Saved in:
| Main Authors: | Huang, Jing-En, Fang, I-Sheng, Huang, Tzuhsuan, Liu, Yu-Lun, Wang, Chih-Yu, Chen, Jun-Cheng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.04676 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models
by: Chung-En, et al.
Published: (2025)
by: Chung-En, et al.
Published: (2025)
ORCA: An Agentic Reasoning Framework for Hallucination and Adversarial Robustness in Vision-Language Models
by: Yu, Chung-En Johnny, et al.
Published: (2025)
by: Yu, Chung-En Johnny, et al.
Published: (2025)
SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing
by: Tu, Rong-Cheng, et al.
Published: (2024)
by: Tu, Rong-Cheng, et al.
Published: (2024)
Agentic-J: An AI Agent for Biological Microscopy Image Analysis
by: Johanns, Lukas, et al.
Published: (2026)
by: Johanns, Lukas, et al.
Published: (2026)
MAG-3D: Multi-Agent Grounded Reasoning for 3D Understanding
by: Zheng, Henry, et al.
Published: (2026)
by: Zheng, Henry, et al.
Published: (2026)
Visual Reasoning Agent: Robust Vision Systems in Remote Sensing via Inference-Time Scaling
by: Yu, Chung-En Johnny, et al.
Published: (2025)
by: Yu, Chung-En Johnny, et al.
Published: (2025)
Multi-Agent Amodal Completion: Direct Synthesis with Fine-Grained Semantic Guidance
by: Fan, Hongxing, et al.
Published: (2025)
by: Fan, Hongxing, et al.
Published: (2025)
FetalAgents: A Multi-Agent System for Fetal Ultrasound Image and Video Analysis
by: Hu, Xiaotian, et al.
Published: (2026)
by: Hu, Xiaotian, et al.
Published: (2026)
AIDE: Agentically Improve Visual Language Model with Domain Experts
by: Chiu, Ming-Chang, et al.
Published: (2025)
by: Chiu, Ming-Chang, et al.
Published: (2025)
AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction
by: Fatima, Syeda Kisaa, et al.
Published: (2025)
by: Fatima, Syeda Kisaa, et al.
Published: (2025)
See it. Say it. Sorted: Agentic System for Compositional Diagram Generation
by: Zhang, Hantao, et al.
Published: (2025)
by: Zhang, Hantao, et al.
Published: (2025)
GenCellAgent: Generalizable, Training-Free Cellular Image Segmentation via Large Language Model Agents
by: Yu, Xi, et al.
Published: (2025)
by: Yu, Xi, et al.
Published: (2025)
Towards Reliable Fetal Ultrasound Interpretation with Multi-Agent Collaboration
by: Hu, Xiaotian, et al.
Published: (2026)
by: Hu, Xiaotian, et al.
Published: (2026)
VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning
by: Chen, Boyu, et al.
Published: (2025)
by: Chen, Boyu, et al.
Published: (2025)
MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling
by: Wang, Qian, et al.
Published: (2025)
by: Wang, Qian, et al.
Published: (2025)
Autonomous Computer Vision Development with Agentic AI
by: Kim, Jin, et al.
Published: (2025)
by: Kim, Jin, et al.
Published: (2025)
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning
by: Lu, Pan, et al.
Published: (2025)
by: Lu, Pan, et al.
Published: (2025)
Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
by: Yu, Xinlei, et al.
Published: (2025)
by: Yu, Xinlei, et al.
Published: (2025)
PhotoFlow: Agentic 3D Virtual Photography Missions
by: Guo, Jiarui, et al.
Published: (2026)
by: Guo, Jiarui, et al.
Published: (2026)
TraF-Align: Trajectory-aware Feature Alignment for Asynchronous Multi-agent Perception
by: Song, Zhiying, et al.
Published: (2025)
by: Song, Zhiying, et al.
Published: (2025)
ARGOS: Who, Where, and When in Agentic Multi-Camera Person Search
by: Kim, Myungchul, et al.
Published: (2026)
by: Kim, Myungchul, et al.
Published: (2026)
LogiStory: A Logic-Aware Framework for Multi-Image Story Visualization
by: Meng, Chutian, et al.
Published: (2026)
by: Meng, Chutian, et al.
Published: (2026)
AstroVLM: Expert Multi-agent Collaborative Reasoning for Astronomical Imaging Quality Diagnosis
by: Han, Yaohui, et al.
Published: (2026)
by: Han, Yaohui, et al.
Published: (2026)
A Multi-Agent Perception-Action Alliance for Efficient Long Video Reasoning
by: Xu, Yichang, et al.
Published: (2026)
by: Xu, Yichang, et al.
Published: (2026)
An Agentic System for Rare Disease Diagnosis with Traceable Reasoning
by: Zhao, Weike, et al.
Published: (2025)
by: Zhao, Weike, et al.
Published: (2025)
A Multi-Agent System Enables Versatile Information Extraction from the Chemical Literature
by: Chen, Yufan, et al.
Published: (2025)
by: Chen, Yufan, et al.
Published: (2025)
Agentic Design Review System
by: Nag, Sayan, et al.
Published: (2025)
by: Nag, Sayan, et al.
Published: (2025)
RadAgents: Multimodal Agentic Reasoning for Chest X-ray Interpretation with Radiologist-like Workflows
by: Zhang, Kai, et al.
Published: (2025)
by: Zhang, Kai, et al.
Published: (2025)
AniMaker: Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
by: Shi, Haoyuan, et al.
Published: (2025)
by: Shi, Haoyuan, et al.
Published: (2025)
Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration
by: Wei, Zheng, et al.
Published: (2025)
by: Wei, Zheng, et al.
Published: (2025)
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement
by: Fang, Runnan, et al.
Published: (2025)
by: Fang, Runnan, et al.
Published: (2025)
PreGSU-A Generalized Traffic Scene Understanding Model for Autonomous Driving based on Pre-trained Graph Attention Network
by: Wang, Yuning, et al.
Published: (2024)
by: Wang, Yuning, et al.
Published: (2024)
StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments
by: Kulinski, Sean, et al.
Published: (2024)
by: Kulinski, Sean, et al.
Published: (2024)
Agentic Knowledgeable Self-awareness
by: Qiao, Shuofei, et al.
Published: (2025)
by: Qiao, Shuofei, et al.
Published: (2025)
End-to-End Autonomous Driving through V2X Cooperation
by: Yu, Haibao, et al.
Published: (2024)
by: Yu, Haibao, et al.
Published: (2024)
MARIC: Multi-Agent Reasoning for Image Classification
by: Seo, Wonduk, et al.
Published: (2025)
by: Seo, Wonduk, et al.
Published: (2025)
MaCTG: Multi-Agent Collaborative Thought Graph for Automatic Programming
by: Zhao, Zixiao, et al.
Published: (2024)
by: Zhao, Zixiao, et al.
Published: (2024)
OralAgent: Integrating Reasoning, Tools, and Knowledge for Interactive Dental Image Analysis
by: Hao, Jing, et al.
Published: (2026)
by: Hao, Jing, et al.
Published: (2026)
VQQA: An Agentic Approach for Video Evaluation and Quality Improvement
by: Song, Yiwen, et al.
Published: (2026)
by: Song, Yiwen, et al.
Published: (2026)
COMIC: Agentic Sketch Comedy Generation
by: Hong, Susung, et al.
Published: (2026)
by: Hong, Susung, et al.
Published: (2026)
Similar Items
-
Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models
by: Chung-En, et al.
Published: (2025) -
ORCA: An Agentic Reasoning Framework for Hallucination and Adversarial Robustness in Vision-Language Models
by: Yu, Chung-En Johnny, et al.
Published: (2025) -
SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing
by: Tu, Rong-Cheng, et al.
Published: (2024) -
Agentic-J: An AI Agent for Biological Microscopy Image Analysis
by: Johanns, Lukas, et al.
Published: (2026) -
MAG-3D: Multi-Agent Grounded Reasoning for 3D Understanding
by: Zheng, Henry, et al.
Published: (2026)