Saved in:
| Main Authors: | Yan, Hang, Che, Xinyu, Xu, Fangzhi, Sun, Qiushi, Ding, Zichen, Cheng, Kanzhi, Zhang, Jian, Qin, Tao, Liu, Jun, Lin, Qika |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02196 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis
by: Cheng, Kanzhi, et al.
Published: (2026)
by: Cheng, Kanzhi, et al.
Published: (2026)
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models
by: Xu, Fangzhi, et al.
Published: (2024)
by: Xu, Fangzhi, et al.
Published: (2024)
OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions
by: Xu, Fangzhi, et al.
Published: (2026)
by: Xu, Fangzhi, et al.
Published: (2026)
SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents
by: Cheng, Kanzhi, et al.
Published: (2024)
by: Cheng, Kanzhi, et al.
Published: (2024)
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
by: Xu, Fangzhi, et al.
Published: (2025)
by: Xu, Fangzhi, et al.
Published: (2025)
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
by: Wu, Zhiyong, et al.
Published: (2024)
by: Wu, Zhiyong, et al.
Published: (2024)
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
by: Sun, Qiushi, et al.
Published: (2024)
by: Sun, Qiushi, et al.
Published: (2024)
CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era
by: Cheng, Kanzhi, et al.
Published: (2025)
by: Cheng, Kanzhi, et al.
Published: (2025)
$ϕ$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation
by: Xu, Fangzhi, et al.
Published: (2025)
by: Xu, Fangzhi, et al.
Published: (2025)
Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models
by: Xu, Fangzhi, et al.
Published: (2023)
by: Xu, Fangzhi, et al.
Published: (2023)
OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows
by: Sun, Qiushi, et al.
Published: (2025)
by: Sun, Qiushi, et al.
Published: (2025)
OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent
by: Yang, Bowen, et al.
Published: (2026)
by: Yang, Bowen, et al.
Published: (2026)
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows
by: Sun, Qiushi, et al.
Published: (2025)
by: Sun, Qiushi, et al.
Published: (2025)
A Semantic Mention Graph Augmented Model for Document-Level Event Argument Extraction
by: Zhang, Jian, et al.
Published: (2024)
by: Zhang, Jian, et al.
Published: (2024)
Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond
by: Xu, Fangzhi, et al.
Published: (2023)
by: Xu, Fangzhi, et al.
Published: (2023)
TIDE-Bench: Task-Aware and Diagnostic Evaluation of Tool-Integrated Reasoning
by: Li, Yize, et al.
Published: (2026)
by: Li, Yize, et al.
Published: (2026)
PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering
by: Xu, Fangzhi, et al.
Published: (2024)
by: Xu, Fangzhi, et al.
Published: (2024)
Vision-Language Models Can Self-Improve Reasoning via Reflection
by: Cheng, Kanzhi, et al.
Published: (2024)
by: Cheng, Kanzhi, et al.
Published: (2024)
MAPS: Multi-Agent Personality Shaping for Collaborative Reasoning
by: Zhang, Jian, et al.
Published: (2025)
by: Zhang, Jian, et al.
Published: (2025)
$A^3$-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation
by: Zhang, Jian, et al.
Published: (2026)
by: Zhang, Jian, et al.
Published: (2026)
MUR: Momentum Uncertainty guided Reasoning for Large Language Models
by: Yan, Hang, et al.
Published: (2025)
by: Yan, Hang, et al.
Published: (2025)
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
by: Jia, Chengyou, et al.
Published: (2024)
by: Jia, Chengyou, et al.
Published: (2024)
Breaking the Data Barrier -- Building GUI Agents Through Task Generalization
by: Zhang, Junlei, et al.
Published: (2025)
by: Zhang, Junlei, et al.
Published: (2025)
Geo-TIDE
by: Eamer, Danika, et al.
Published: (2025)
by: Eamer, Danika, et al.
Published: (2025)
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
by: Wu, Zhiyong, et al.
Published: (2024)
by: Wu, Zhiyong, et al.
Published: (2024)
MFC-EQ: Mean-Field Control with Envelope Q-Learning for Moving Decentralized Agents in Formation
by: Lin, Qiushi, et al.
Published: (2024)
by: Lin, Qiushi, et al.
Published: (2024)
MAXS: Meta-Adaptive Exploration with LLM Agents
by: Zhang, Jian, et al.
Published: (2026)
by: Zhang, Jian, et al.
Published: (2026)
TIDE: Tuning-Integrated Dynamic Evolution for LLM-Based Automated Heuristic Design
by: Chen, Chentong, et al.
Published: (2026)
by: Chen, Chentong, et al.
Published: (2026)
Towards Unified Neurosymbolic Reasoning on Knowledge Graphs
by: Lin, Qika, et al.
Published: (2025)
by: Lin, Qika, et al.
Published: (2025)
GKG-LLM: A Unified Framework for Generalized Knowledge Graph Construction
by: Zhang, Jian, et al.
Published: (2025)
by: Zhang, Jian, et al.
Published: (2025)
TIDE: Temporal Incremental Draft Engine for Self-Improving LLM Inference
by: Park, Jiyoung, et al.
Published: (2026)
by: Park, Jiyoung, et al.
Published: (2026)
AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling
by: Ding, Liang
Published: (2026)
by: Ding, Liang
Published: (2026)
TIDE: Textual Identity Detection for Evaluating and Augmenting Classification and Language Models
by: Klu, Emmanuel, et al.
Published: (2023)
by: Klu, Emmanuel, et al.
Published: (2023)
Self-Improving LLM Agents at Test-Time
by: Acikgoz, Emre Can, et al.
Published: (2025)
by: Acikgoz, Emre Can, et al.
Published: (2025)
IMAGES OF DISASTER, TSUNAMI TIDE OF GRIEF
by: THOMAS, EVAN
Published: (2005)
by: THOMAS, EVAN
Published: (2005)
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models
by: Lin, Qika, et al.
Published: (2025)
by: Lin, Qika, et al.
Published: (2025)
MARS: Multi-Agent Adaptive Reasoning with Socratic Guidance for Automated Prompt Optimization
by: Zhang, Jian, et al.
Published: (2025)
by: Zhang, Jian, et al.
Published: (2025)
ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models
by: Yin, Zhangyue, et al.
Published: (2025)
by: Yin, Zhangyue, et al.
Published: (2025)
TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction
by: Agarwal, Aishwarya, et al.
Published: (2024)
by: Agarwal, Aishwarya, et al.
Published: (2024)
Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent
by: Zhan, Xiaoyu, et al.
Published: (2025)
by: Zhan, Xiaoyu, et al.
Published: (2025)
Similar Items
-
OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis
by: Cheng, Kanzhi, et al.
Published: (2026) -
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models
by: Xu, Fangzhi, et al.
Published: (2024) -
OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions
by: Xu, Fangzhi, et al.
Published: (2026) -
SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents
by: Cheng, Kanzhi, et al.
Published: (2024) -
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
by: Xu, Fangzhi, et al.
Published: (2025)