Saved in:
| Main Authors: | Liu, Zixian, Liu, Sihao, Zhao, Yuqi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.01366 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
by: Xu, Tianqi, et al.
Published: (2024)
by: Xu, Tianqi, et al.
Published: (2024)
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models
by: Hu, Sihao, et al.
Published: (2024)
by: Hu, Sihao, et al.
Published: (2024)
EgoBench: An Interactive Egocentric Multimodal Benchmark for Tool-Using Agents
by: Liu, Yunqi, et al.
Published: (2026)
by: Liu, Yunqi, et al.
Published: (2026)
SC-Arena: A Natural Language Benchmark for Single-Cell Reasoning with Knowledge-Augmented Evaluation
by: Zhao, Jiahao, et al.
Published: (2026)
by: Zhao, Jiahao, et al.
Published: (2026)
Graph-Augmented Large Language Model Agents: Current Progress and Future Prospects
by: Liu, Yixin, et al.
Published: (2025)
by: Liu, Yixin, et al.
Published: (2025)
MEDMKG: Benchmarking Medical Knowledge Exploitation with Multimodal Knowledge Graph
by: Wang, Xiaochen, et al.
Published: (2025)
by: Wang, Xiaochen, et al.
Published: (2025)
Annotation Guidelines-Based Knowledge Augmentation: Towards Enhancing Large Language Models for Educational Text Classification
by: Liu, Shiqi, et al.
Published: (2024)
by: Liu, Shiqi, et al.
Published: (2024)
A Survey on Large Language Model-Based Game Agents
by: Hu, Sihao, et al.
Published: (2024)
by: Hu, Sihao, et al.
Published: (2024)
Knowledge Graph Augmented Large Language Models for Disease Prediction
by: Wang, Ruiyu, et al.
Published: (2025)
by: Wang, Ruiyu, et al.
Published: (2025)
Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models
by: Jia, Yifan, et al.
Published: (2025)
by: Jia, Yifan, et al.
Published: (2025)
UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction
by: Ning, Yansong, et al.
Published: (2024)
by: Ning, Yansong, et al.
Published: (2024)
Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent
by: He, Linfeng, et al.
Published: (2024)
by: He, Linfeng, et al.
Published: (2024)
Cross-Modal Attention Network with Dual Graph Learning in Multimodal Recommendation
by: Dai, Ji, et al.
Published: (2026)
by: Dai, Ji, et al.
Published: (2026)
Assessing the Code Clone Detection Capability of Large Language Models
by: Zhang, Zixian, et al.
Published: (2024)
by: Zhang, Zixian, et al.
Published: (2024)
Automated Construction of Medical Indicator Knowledge Graphs Using Retrieval Augmented Large Language Models
by: Wang, Zhengda, et al.
Published: (2025)
by: Wang, Zhengda, et al.
Published: (2025)
KGPA: Robustness Evaluation for Large Language Models via Cross-Domain Knowledge Graphs
by: Pei, Aihua, et al.
Published: (2024)
by: Pei, Aihua, et al.
Published: (2024)
Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs
by: Wang, Junjie, et al.
Published: (2024)
by: Wang, Junjie, et al.
Published: (2024)
Evaluating Tool-Augmented Agents in Remote Sensing Platforms
by: Singh, Simranjit, et al.
Published: (2024)
by: Singh, Simranjit, et al.
Published: (2024)
CrossVid: A Comprehensive Benchmark for Evaluating Cross-Video Reasoning in Multimodal Large Language Models
by: Li, Jingyao, et al.
Published: (2025)
by: Li, Jingyao, et al.
Published: (2025)
DCGL: Dual-Channel Graph Learning with Large Language Models for Knowledge-Aware Recommendation
by: Zou, Xinchi, et al.
Published: (2026)
by: Zou, Xinchi, et al.
Published: (2026)
Enhancing Large Language Models (LLMs) for Telecom using Dynamic Knowledge Graphs and Explainable Retrieval-Augmented Generation
by: Yuan, Dun, et al.
Published: (2026)
by: Yuan, Dun, et al.
Published: (2026)
MindBridge: Scalable and Cross-Model Knowledge Editing via Memory-Augmented Modality
by: Li, Shuaike, et al.
Published: (2025)
by: Li, Shuaike, et al.
Published: (2025)
Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
by: Chen, Yurun, et al.
Published: (2025)
by: Chen, Yurun, et al.
Published: (2025)
ElecBench: a Power Dispatch Evaluation Benchmark for Large Language Models
by: Zhou, Xiyuan, et al.
Published: (2024)
by: Zhou, Xiyuan, et al.
Published: (2024)
MPCI-Bench: A Benchmark for Multimodal Pairwise Contextual Integrity Evaluation of Language Model Agents
by: Wang, Shouju, et al.
Published: (2026)
by: Wang, Shouju, et al.
Published: (2026)
Retrieval-Augmented Language Model for Extreme Multi-Label Knowledge Graph Link Prediction
by: Lin, Yu-Hsiang, et al.
Published: (2024)
by: Lin, Yu-Hsiang, et al.
Published: (2024)
Multimodal Cultural Heritage Knowledge Graph Extension with Language and Vision Models
by: Zhang, Yang, et al.
Published: (2026)
by: Zhang, Yang, et al.
Published: (2026)
TeachAnything: A Multimodal Crowdsourcing Platform for Training Embodied AI Agents in Symmetrical Reality
by: Liu, Zidong, et al.
Published: (2026)
by: Liu, Zidong, et al.
Published: (2026)
Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents
by: Luo, Yaxin, et al.
Published: (2025)
by: Luo, Yaxin, et al.
Published: (2025)
MAGNET: A Multi-Graph Attentional Network for Code Clone Detection
by: Zhang, Zixian, et al.
Published: (2025)
by: Zhang, Zixian, et al.
Published: (2025)
Knowledge Graph Enhanced Language Agents for Recommendation
by: Guo, Taicheng, et al.
Published: (2024)
by: Guo, Taicheng, et al.
Published: (2024)
ELMM: Efficient Lightweight Multimodal Large Language Models for Multimodal Knowledge Graph Completion
by: Huang, Wei, et al.
Published: (2025)
by: Huang, Wei, et al.
Published: (2025)
MediaClaw: Multimodal Intelligent-Agent Platform Technical Report
by: Zhao, Shaoan, et al.
Published: (2026)
by: Zhao, Shaoan, et al.
Published: (2026)
USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models
by: Zheng, Baolin, et al.
Published: (2025)
by: Zheng, Baolin, et al.
Published: (2025)
Rethinking and Benchmarking Large Language Models for Graph Reasoning
by: Hu, Yuwei, et al.
Published: (2025)
by: Hu, Yuwei, et al.
Published: (2025)
Knowledge Graph-Guided Retrieval Augmented Generation
by: Zhu, Xiangrong, et al.
Published: (2025)
by: Zhu, Xiangrong, et al.
Published: (2025)
Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness
by: Zhou, Dongzhuoran, et al.
Published: (2025)
by: Zhou, Dongzhuoran, et al.
Published: (2025)
Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering
by: Wang, Yuqi, et al.
Published: (2024)
by: Wang, Yuqi, et al.
Published: (2024)
AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field
by: Liang, Chen, et al.
Published: (2025)
by: Liang, Chen, et al.
Published: (2025)
AST-Enhanced or AST-Overloaded? The Surprising Impact of Hybrid Graph Representations on Code Clone Detection
by: Zhang, Zixian, et al.
Published: (2025)
by: Zhang, Zixian, et al.
Published: (2025)
Similar Items
-
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
by: Xu, Tianqi, et al.
Published: (2024) -
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models
by: Hu, Sihao, et al.
Published: (2024) -
EgoBench: An Interactive Egocentric Multimodal Benchmark for Tool-Using Agents
by: Liu, Yunqi, et al.
Published: (2026) -
SC-Arena: A Natural Language Benchmark for Single-Cell Reasoning with Knowledge-Augmented Evaluation
by: Zhao, Jiahao, et al.
Published: (2026) -
Graph-Augmented Large Language Model Agents: Current Progress and Future Prospects
by: Liu, Yixin, et al.
Published: (2025)