Saved in:
| Main Authors: | Wei, Jingxuan, Li, Siyuan, Xu, Yuhang, Sun, Zheng, Jiang, Junjie, Jin, Hexuan, Jia, Caijun, He, Honghao, Xu, Xinglong, bai, Xi, Yu, Chang, Liu, Yumou, Zhu, Junnan, Zhou, Xuanhe, Chen, Jintao, Hu, Xiaobin, Pang, Shancheng, Yu, Bihui, He, Ran, Lei, Zhen, Li, Stan Z., He, Conghui, Yan, Shuicheng, Tan, Cheng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.23152 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents
by: Yu, Bihui, et al.
Published: (2026)
by: Yu, Bihui, et al.
Published: (2026)
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
by: Wei, Jingxuan, et al.
Published: (2025)
by: Wei, Jingxuan, et al.
Published: (2025)
Geoint-R1: Formalizing Multimodal Geometric Reasoning with Dynamic Auxiliary Constructions
by: Wei, Jingxuan, et al.
Published: (2025)
by: Wei, Jingxuan, et al.
Published: (2025)
PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control
by: Wei, Jingxuan, et al.
Published: (2026)
by: Wei, Jingxuan, et al.
Published: (2026)
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora
by: Pan, Chenkai, et al.
Published: (2026)
by: Pan, Chenkai, et al.
Published: (2026)
Thinking with Drafting: Optical Decompression via Logical Reconstruction
by: Wei, Jingxuan, et al.
Published: (2026)
by: Wei, Jingxuan, et al.
Published: (2026)
ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering
by: Jia, Caijun, et al.
Published: (2025)
by: Jia, Caijun, et al.
Published: (2025)
How RL Unlocks the Aha Moment in Geometric Interleaved Reasoning
by: Zhang, Xiangxiang, et al.
Published: (2026)
by: Zhang, Xiangxiang, et al.
Published: (2026)
TRACER: Verifiable Generative Provenance for Multimodal Tool-Using Agents
by: Yu, Bihui, et al.
Published: (2026)
by: Yu, Bihui, et al.
Published: (2026)
Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists
by: Wu, Yujun, et al.
Published: (2026)
by: Wu, Yujun, et al.
Published: (2026)
GenProve: Learning to Generate Text with Fine-Grained Provenance
by: Wei, Jingxuan, et al.
Published: (2026)
by: Wei, Jingxuan, et al.
Published: (2026)
Canvas-of-Thought: Grounding Reasoning via Mutable Structured States
by: Sun, Lingzhuang, et al.
Published: (2026)
by: Sun, Lingzhuang, et al.
Published: (2026)
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search
by: Sun, Linzhuang, et al.
Published: (2024)
by: Sun, Linzhuang, et al.
Published: (2024)
Synth-Empathy: Towards High-Quality Synthetic Empathy Data
by: Liang, Hao, et al.
Published: (2024)
by: Liang, Hao, et al.
Published: (2024)
ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering
by: Wei, Jingxuan, et al.
Published: (2025)
by: Wei, Jingxuan, et al.
Published: (2025)
Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training
by: Tan, Cheng, et al.
Published: (2023)
by: Tan, Cheng, et al.
Published: (2023)
Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning
by: Tan, Cheng, et al.
Published: (2024)
by: Tan, Cheng, et al.
Published: (2024)
SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches
by: Tan, Cheng, et al.
Published: (2025)
by: Tan, Cheng, et al.
Published: (2025)
Decouple to Generalize: Context-First Self-Evolving Learning for Data-Scarce Vision-Language Reasoning
by: Li, Tingyu, et al.
Published: (2025)
by: Li, Tingyu, et al.
Published: (2025)
LLaVA-NeuMT: Selective Layer-Neuron Modulation for Efficient Multilingual Multimodal Translation
by: Wei, Jingxuan, et al.
Published: (2025)
by: Wei, Jingxuan, et al.
Published: (2025)
MoDora: Tree-Based Semi-Structured Document Analysis System
by: Xu, Bangrui, et al.
Published: (2026)
by: Xu, Bangrui, et al.
Published: (2026)
A Triple-Bregman Balanced Primal-Dual Algorithm for Saddle Point Problems
by: Yu, Jintao, et al.
Published: (2025)
by: Yu, Jintao, et al.
Published: (2025)
Rational Sensibility: LLM Enhanced Empathetic Response Generation Guided by Self-presentation Theory
by: Sun, Linzhuang, et al.
Published: (2023)
by: Sun, Linzhuang, et al.
Published: (2023)
Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation
by: Wei, Jingxuan, et al.
Published: (2024)
by: Wei, Jingxuan, et al.
Published: (2024)
ResearchPulse: Building Method-Experiment Chains through Multi-Document Scientific Inference
by: Chen, Qi, et al.
Published: (2025)
by: Chen, Qi, et al.
Published: (2025)
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights
by: Tian, Juanxi, et al.
Published: (2025)
by: Tian, Juanxi, et al.
Published: (2025)
Automating Database-Native Function Code Synthesis with LLMs
by: Zhou, Wei, et al.
Published: (2026)
by: Zhou, Wei, et al.
Published: (2026)
Boosting Reasoning in Large Multimodal Models via Activation Replay
by: Xing, Yun, et al.
Published: (2025)
by: Xing, Yun, et al.
Published: (2025)
Ambient Moisture as Energy Source: MEG Technology toward Self‐Powered Wearable Sensors
by: Na Li, et al.
Published: (2025)
by: Na Li, et al.
Published: (2025)
TrinityDNA: A Bio-Inspired Foundational Model for Efficient Long-Sequence DNA Modeling
by: Yang, Qirong, et al.
Published: (2025)
by: Yang, Qirong, et al.
Published: (2025)
From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing
by: Wei, Jingxuan, et al.
Published: (2024)
by: Wei, Jingxuan, et al.
Published: (2024)
Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMs
by: Zhuang, Kai, et al.
Published: (2025)
by: Zhuang, Kai, et al.
Published: (2025)
A symmetric primal-dual algorithmic framework for saddle point problems
by: He, Hongjin, et al.
Published: (2022)
by: He, Hongjin, et al.
Published: (2022)
Ridge Estimation of High Dimensional Two-Way Fixed Effect Regression
by: He, Junnan, et al.
Published: (2026)
by: He, Junnan, et al.
Published: (2026)
Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
by: Yu, Xinlei, et al.
Published: (2025)
by: Yu, Xinlei, et al.
Published: (2025)
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
by: Yu, Xinlei, et al.
Published: (2025)
by: Yu, Xinlei, et al.
Published: (2025)
Trinity: A Modular Humanoid Robot AI System
by: Sun, Jingkai, et al.
Published: (2025)
by: Sun, Jingkai, et al.
Published: (2025)
Brain-inspired Computing Based on Deep Learning for Human-computer Interaction: A Review
by: Yu, Bihui, et al.
Published: (2023)
by: Yu, Bihui, et al.
Published: (2023)
Molecularly Engineered Quaternized κ ‐Carrageenan: a Multifunctional Platform for Atmospheric Water Harvesting, Moisture‐Electricity Generation, and Self‐powered Wearable Sensors
by: Na Li, et al.
Published: (2025)
by: Na Li, et al.
Published: (2025)
Guided Verifier: Collaborative Multimodal Reasoning via Dynamic Process Supervision
by: Sun, Lingzhuang, et al.
Published: (2026)
by: Sun, Lingzhuang, et al.
Published: (2026)
Similar Items
-
PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents
by: Yu, Bihui, et al.
Published: (2026) -
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
by: Wei, Jingxuan, et al.
Published: (2025) -
Geoint-R1: Formalizing Multimodal Geometric Reasoning with Dynamic Auxiliary Constructions
by: Wei, Jingxuan, et al.
Published: (2025) -
PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control
by: Wei, Jingxuan, et al.
Published: (2026) -
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora
by: Pan, Chenkai, et al.
Published: (2026)