:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wei, Jingxuan, Li, Siyuan, Xu, Yuhang, Sun, Zheng, Jiang, Junjie, Jin, Hexuan, Jia, Caijun, He, Honghao, Xu, Xinglong, bai, Xi, Yu, Chang, Liu, Yumou, Zhu, Junnan, Zhou, Xuanhe, Chen, Jintao, Hu, Xiaobin, Pang, Shancheng, Yu, Bihui, He, Ran, Lei, Zhen, Li, Stan Z., He, Conghui, Yan, Shuicheng, Tan, Cheng
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.23152
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents
by: Yu, Bihui, et al.
Published: (2026)

GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
by: Wei, Jingxuan, et al.
Published: (2025)

Geoint-R1: Formalizing Multimodal Geometric Reasoning with Dynamic Auxiliary Constructions
by: Wei, Jingxuan, et al.
Published: (2025)

PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control
by: Wei, Jingxuan, et al.
Published: (2026)

Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora
by: Pan, Chenkai, et al.
Published: (2026)

Thinking with Drafting: Optical Decompression via Logical Reconstruction
by: Wei, Jingxuan, et al.
Published: (2026)

ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering
by: Jia, Caijun, et al.
Published: (2025)

How RL Unlocks the Aha Moment in Geometric Interleaved Reasoning
by: Zhang, Xiangxiang, et al.
Published: (2026)

TRACER: Verifiable Generative Provenance for Multimodal Tool-Using Agents
by: Yu, Bihui, et al.
Published: (2026)

Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists
by: Wu, Yujun, et al.
Published: (2026)

GenProve: Learning to Generate Text with Fine-Grained Provenance
by: Wei, Jingxuan, et al.
Published: (2026)

Canvas-of-Thought: Grounding Reasoning via Mutable Structured States
by: Sun, Lingzhuang, et al.
Published: (2026)

BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search
by: Sun, Linzhuang, et al.
Published: (2024)

Synth-Empathy: Towards High-Quality Synthetic Empathy Data
by: Liang, Hao, et al.
Published: (2024)

ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering
by: Wei, Jingxuan, et al.
Published: (2025)

Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training
by: Tan, Cheng, et al.
Published: (2023)

Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning
by: Tan, Cheng, et al.
Published: (2024)

SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches
by: Tan, Cheng, et al.
Published: (2025)

Decouple to Generalize: Context-First Self-Evolving Learning for Data-Scarce Vision-Language Reasoning
by: Li, Tingyu, et al.
Published: (2025)

LLaVA-NeuMT: Selective Layer-Neuron Modulation for Efficient Multilingual Multimodal Translation
by: Wei, Jingxuan, et al.
Published: (2025)

MoDora: Tree-Based Semi-Structured Document Analysis System
by: Xu, Bangrui, et al.
Published: (2026)

A Triple-Bregman Balanced Primal-Dual Algorithm for Saddle Point Problems
by: Yu, Jintao, et al.
Published: (2025)

Rational Sensibility: LLM Enhanced Empathetic Response Generation Guided by Self-presentation Theory
by: Sun, Linzhuang, et al.
Published: (2023)

Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation
by: Wei, Jingxuan, et al.
Published: (2024)

ResearchPulse: Building Method-Experiment Chains through Multi-Document Scientific Inference
by: Chen, Qi, et al.
Published: (2025)

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights
by: Tian, Juanxi, et al.
Published: (2025)

Automating Database-Native Function Code Synthesis with LLMs
by: Zhou, Wei, et al.
Published: (2026)

Boosting Reasoning in Large Multimodal Models via Activation Replay
by: Xing, Yun, et al.
Published: (2025)

Ambient Moisture as Energy Source: MEG Technology toward Self‐Powered Wearable Sensors
by: Na Li, et al.
Published: (2025)

TrinityDNA: A Bio-Inspired Foundational Model for Efficient Long-Sequence DNA Modeling
by: Yang, Qirong, et al.
Published: (2025)

From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing
by: Wei, Jingxuan, et al.
Published: (2024)

Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMs
by: Zhuang, Kai, et al.
Published: (2025)

A symmetric primal-dual algorithmic framework for saddle point problems
by: He, Hongjin, et al.
Published: (2022)

Ridge Estimation of High Dimensional Two-Way Fixed Effect Regression
by: He, Junnan, et al.
Published: (2026)

Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
by: Yu, Xinlei, et al.
Published: (2025)

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
by: Yu, Xinlei, et al.
Published: (2025)

Trinity: A Modular Humanoid Robot AI System
by: Sun, Jingkai, et al.
Published: (2025)

Brain-inspired Computing Based on Deep Learning for Human-computer Interaction: A Review
by: Yu, Bihui, et al.
Published: (2023)

Molecularly Engineered Quaternized κ ‐Carrageenan: a Multifunctional Platform for Atmospheric Water Harvesting, Moisture‐Electricity Generation, and Self‐powered Wearable Sensors
by: Na Li, et al.
Published: (2025)

Guided Verifier: Collaborative Multimodal Reasoning via Dynamic Process Supervision
by: Sun, Lingzhuang, et al.
Published: (2026)