Saved in:
| Main Authors: | Xu, Xinbo, Yang, Ruihan, Shen, Haiyang, Xu, Wendong, Gao, Bofei, Wu, Ruoyu, Shi, Kean, Xie, Weichu, Chen, Xuanzhong, Wu, Ming, Zeng, Jason, Heinrich, Michael, Zhang, Elvis, Chen, Liang, Li, Kuan, Chang, Baobao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.15846 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Step-wise Rubric Rewards for LLM Reasoning
by: Xie, Weichu, et al.
Published: (2026)
by: Xie, Weichu, et al.
Published: (2026)
Improving MLLM Training Efficiency via Stage-Aware Sparsity
by: Shi, Kean, et al.
Published: (2025)
by: Shi, Kean, et al.
Published: (2025)
BabyVision: Visual Reasoning Beyond Language
by: Chen, Liang, et al.
Published: (2026)
by: Chen, Liang, et al.
Published: (2026)
Agentic Software Engineering: Foundational Pillars and a Research Roadmap
by: Hassan, Ahmed E., et al.
Published: (2025)
by: Hassan, Ahmed E., et al.
Published: (2025)
RareBench: Can LLMs Serve as Rare Diseases Specialists?
by: Chen, Xuanzhong, et al.
Published: (2024)
by: Chen, Xuanzhong, et al.
Published: (2024)
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
by: Chen, Liang, et al.
Published: (2025)
by: Chen, Liang, et al.
Published: (2025)
Design, Synthesis, and Performance of Novel Nano‐CoO/NiO‐loaded and Sulfonation‐Modified ZSM‐5 Composite Catalyst for In Situ Conversion of Oil Shale
by: Aibin Wu, et al.
Published: (2024)
by: Aibin Wu, et al.
Published: (2024)
X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System
by: Wang, Peng, et al.
Published: (2025)
by: Wang, Peng, et al.
Published: (2025)
AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications
by: Zhao, Yujie, et al.
Published: (2026)
by: Zhao, Yujie, et al.
Published: (2026)
Toward Agentic Software Project Management: A Vision and Roadmap
by: Assalaarachchi, Lakshana Iruni, et al.
Published: (2026)
by: Assalaarachchi, Lakshana Iruni, et al.
Published: (2026)
IterResearch: Rethinking Long-Horizon Agents with Interaction Scaling
by: Chen, Guoxin, et al.
Published: (2025)
by: Chen, Guoxin, et al.
Published: (2025)
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
by: Chen, Liang, et al.
Published: (2024)
by: Chen, Liang, et al.
Published: (2024)
The Breakthrough and Confrontation of Mainland Chinese Opera Films in Hong Kong under the Cold War Framework (1953–1957)
by: Du, Jiachen, et al.
Published: (2025)
by: Du, Jiachen, et al.
Published: (2025)
Simple Vertex Algebras Arising From Congruence Subgroups
by: Dai, Xuanzhong, et al.
Published: (2022)
by: Dai, Xuanzhong, et al.
Published: (2022)
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
by: Qiao, Zile, et al.
Published: (2025)
by: Qiao, Zile, et al.
Published: (2025)
FeatureBench: Benchmarking Agentic Coding for Complex Feature Development
by: Zhou, Qixing, et al.
Published: (2026)
by: Zhou, Qixing, et al.
Published: (2026)
Inverse-free quantum state estimation with Heisenberg scaling
by: Chen, Kean
Published: (2025)
by: Chen, Kean
Published: (2025)
Upgrading Systems, Software, and Microcomputers.
by: Berry, John
Published: (1989)
by: Berry, John
Published: (1989)
Towards Emergency Scenarios: An Integrated Decision-making Framework of Multi-lane Platoon Reorganization
by: Kong, Aijing, et al.
Published: (2025)
by: Kong, Aijing, et al.
Published: (2025)
The Role of Lactate in Drug Addiction
by: Ruiqi Chen, et al.
Published: (2025)
by: Ruiqi Chen, et al.
Published: (2025)
Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks
by: Lee, Yoonsang, et al.
Published: (2026)
by: Lee, Yoonsang, et al.
Published: (2026)
DecisionBench: A Benchmark for Emergent Delegation in Long-Horizon Agentic Workflows
by: Gao, Yuxuan, et al.
Published: (2026)
by: Gao, Yuxuan, et al.
Published: (2026)
NeuronMLP: Efficient LLM Inference via Singular Value Decomposition Compression and Tiling on AWS Trainium
by: Song, Dinghong, et al.
Published: (2025)
by: Song, Dinghong, et al.
Published: (2025)
Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap
by: Wu, Xingyu, et al.
Published: (2024)
by: Wu, Xingyu, et al.
Published: (2024)
Advancing Radar Hand Gesture Recognition: A Hybrid Spectrum Synthetic Framework Merging Simulation with Neural Networks
by: Tang, Jiaqi, et al.
Published: (2025)
by: Tang, Jiaqi, et al.
Published: (2025)
HorizonBench: Long-Horizon Personalization with Evolving Preferences
by: Li, Shuyue Stella, et al.
Published: (2026)
by: Li, Shuyue Stella, et al.
Published: (2026)
Upgrading Application Software: Problems and Perspectives.
by: Corbly, James E.
Published: (1997)
by: Corbly, James E.
Published: (1997)
Evolaris: A Roadmap to Self-Evolving Software Intelligence Management
by: Liu, Chengwei, et al.
Published: (2025)
by: Liu, Chengwei, et al.
Published: (2025)
Roadmap of Anaphylaxis Registries Across the World
by: Guillaume Pouessel, et al.
Published: (2025)
by: Guillaume Pouessel, et al.
Published: (2025)
VividMed: Vision Language Model with Versatile Visual Grounding for Medicine
by: Luo, Lingxiao, et al.
Published: (2024)
by: Luo, Lingxiao, et al.
Published: (2024)
The Dark Side of Upgrades: Uncovering Security Risks in Smart Contract Upgrades
by: Wang, Dingding, et al.
Published: (2025)
by: Wang, Dingding, et al.
Published: (2025)
SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?
by: Deng, Xiang, et al.
Published: (2025)
by: Deng, Xiang, et al.
Published: (2025)
HSCO-Bench: An Agent-Driven End-to-End Hardware-Software Co-design Benchmark for Systems-on-Chip
by: Tsai, Pei-Huan, et al.
Published: (2026)
by: Tsai, Pei-Huan, et al.
Published: (2026)
Limiting Spectral Distribution of High-dimensional Multivariate Kendall-$τ$
by: Wu, Ruoyu
Published: (2025)
by: Wu, Ruoyu
Published: (2025)
Roadmap on Advancements of the FHI-aims Software Package
by: Abbott, Joseph W., et al.
Published: (2025)
by: Abbott, Joseph W., et al.
Published: (2025)
Dual role of Glossy15 in regulating flowering by modulating gibberellins and floral organ gene expression in maize
by: Juan Yang, et al.
Published: (2025)
by: Juan Yang, et al.
Published: (2025)
LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades
by: Li, Yanan, et al.
Published: (2025)
by: Li, Yanan, et al.
Published: (2025)
Demystifying the Characteristics for Smart Contract Upgrades
by: Liu, Ye, et al.
Published: (2024)
by: Liu, Ye, et al.
Published: (2024)
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL
by: Gao, Jiaxuan, et al.
Published: (2025)
by: Gao, Jiaxuan, et al.
Published: (2025)
CookBench: A Long-Horizon Embodied Planning Benchmark for Complex Cooking Scenarios
by: Cai, Muzhen, et al.
Published: (2025)
by: Cai, Muzhen, et al.
Published: (2025)
Similar Items
-
Step-wise Rubric Rewards for LLM Reasoning
by: Xie, Weichu, et al.
Published: (2026) -
Improving MLLM Training Efficiency via Stage-Aware Sparsity
by: Shi, Kean, et al.
Published: (2025) -
BabyVision: Visual Reasoning Beyond Language
by: Chen, Liang, et al.
Published: (2026) -
Agentic Software Engineering: Foundational Pillars and a Research Roadmap
by: Hassan, Ahmed E., et al.
Published: (2025) -
RareBench: Can LLMs Serve as Rare Diseases Specialists?
by: Chen, Xuanzhong, et al.
Published: (2024)