Saved in:
| Main Authors: | Zhong, Qimin, Liao, Hao, Wang, Siwei, Zhou, Mingyang, Wu, Xiaoqun, Mao, Rui, Chen, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.23186 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement
by: Zhong, Qimin, et al.
Published: (2026)
by: Zhong, Qimin, et al.
Published: (2026)
Exploring Loss Landscapes through the Lens of Spin Glass Theory
by: Liao, Hao, et al.
Published: (2024)
by: Liao, Hao, et al.
Published: (2024)
ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models
by: Wang, Siwei, et al.
Published: (2024)
by: Wang, Siwei, et al.
Published: (2024)
Fine-Tuning Medical Language Models for Enhanced Long-Contextual Understanding and Domain Expertise
by: Yang, Qimin, et al.
Published: (2024)
by: Yang, Qimin, et al.
Published: (2024)
How Transformers Learn to Plan via Multi-Token Prediction
by: Huang, Jianhao, et al.
Published: (2026)
by: Huang, Jianhao, et al.
Published: (2026)
Large Language Model's Multi-Capability Alignment in Biomedical Domain
by: Wu, Wentao, et al.
Published: (2025)
by: Wu, Wentao, et al.
Published: (2025)
PersonaArena: Dynamic Simulation for Evaluating and Enhancing Persona-Level Role-Playing in Large Language Models
by: Shi, Wenlong, et al.
Published: (2026)
by: Shi, Wenlong, et al.
Published: (2026)
CNSL-bench: Benchmarking the Sign Language Understanding Capabilities of MLLMs on Chinese National Sign Language
by: Zhao, Rui, et al.
Published: (2026)
by: Zhao, Rui, et al.
Published: (2026)
LLM-State: Open World State Representation for Long-horizon Task Planning with Large Language Model
by: Chen, Siwei, et al.
Published: (2023)
by: Chen, Siwei, et al.
Published: (2023)
Enhancing Emotional Generation Capability of Large Language Models via Emotional Chain-of-Thought
by: Li, Zaijing, et al.
Published: (2024)
by: Li, Zaijing, et al.
Published: (2024)
Revisiting the Travel Planning Capabilities of Large Language Models
by: Zhang, Bo-Wen, et al.
Published: (2026)
by: Zhang, Bo-Wen, et al.
Published: (2026)
Aligning Large Language Models for Controllable Recommendations
by: Lu, Wensheng, et al.
Published: (2024)
by: Lu, Wensheng, et al.
Published: (2024)
Scaling Capability in Token Space: An Analysis of Large Vision Language Model
by: Li, Tenghui, et al.
Published: (2024)
by: Li, Tenghui, et al.
Published: (2024)
Sheet as Token: A Graph-Enhanced Representation for Multi-Sheet Spreadsheet Understanding
by: Lei, Yiming, et al.
Published: (2026)
by: Lei, Yiming, et al.
Published: (2026)
Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors
by: Li, Guanghua, et al.
Published: (2024)
by: Li, Guanghua, et al.
Published: (2024)
GenSwarm: Scalable Multi-Robot Code-Policy Generation and Deployment via Language Models
by: Ji, Wenkang, et al.
Published: (2025)
by: Ji, Wenkang, et al.
Published: (2025)
Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
by: Wang, Danqing, et al.
Published: (2024)
by: Wang, Danqing, et al.
Published: (2024)
Token-Supervised Value Models for Enhancing Mathematical Problem-Solving Capabilities of Large Language Models
by: Lee, Jung Hyun, et al.
Published: (2024)
by: Lee, Jung Hyun, et al.
Published: (2024)
Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective
by: Wang, Siwei, et al.
Published: (2025)
by: Wang, Siwei, et al.
Published: (2025)
VocalNet-M2: Advancing Low-Latency Spoken Language Modeling via Integrated Multi-Codebook Tokenization and Multi-Token Prediction
by: Wang, Yuhao, et al.
Published: (2025)
by: Wang, Yuhao, et al.
Published: (2025)
DuoTok: Source-Aware Dual-Track Tokenization for Multi-Track Music Language Modeling
by: Lin, Rui, et al.
Published: (2025)
by: Lin, Rui, et al.
Published: (2025)
Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning
by: Li, Wenjun, et al.
Published: (2024)
by: Li, Wenjun, et al.
Published: (2024)
Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation
by: Li, Yuchen, et al.
Published: (2026)
by: Li, Yuchen, et al.
Published: (2026)
Beyond Hard Masks: Progressive Token Evolution for Diffusion Language Models
by: Zhong, Linhao, et al.
Published: (2026)
by: Zhong, Linhao, et al.
Published: (2026)
Understanding Textual Capability Degradation in Speech LLMs via Parameter Importance Analysis
by: Wang, Chao, et al.
Published: (2025)
by: Wang, Chao, et al.
Published: (2025)
Eliminating Out-of-Domain Recommendations in LLM-based Recommender Systems: A Unified View
by: Liao, Hao, et al.
Published: (2025)
by: Liao, Hao, et al.
Published: (2025)
Enhancing Conflict Resolution in Language Models via Abstract Argumentation
by: Li, Zhaoqun, et al.
Published: (2024)
by: Li, Zhaoqun, et al.
Published: (2024)
Pre-Training Curriculum for Multi-Token Prediction in Language Models
by: Aynetdinov, Ansar, et al.
Published: (2025)
by: Aynetdinov, Ansar, et al.
Published: (2025)
X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs
by: Ye, Rui, et al.
Published: (2025)
by: Ye, Rui, et al.
Published: (2025)
Biology-Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models
by: He, Haonan, et al.
Published: (2024)
by: He, Haonan, et al.
Published: (2024)
WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models
by: Fan, Shengda, et al.
Published: (2024)
by: Fan, Shengda, et al.
Published: (2024)
R-Capsule: Compressing High-Level Plans for Efficient Large Language Model Reasoning
by: Shan, Hongyu, et al.
Published: (2025)
by: Shan, Hongyu, et al.
Published: (2025)
Multi-RAG: A Multimodal Retrieval-Augmented Generation System for Adaptive Video Understanding
by: Mao, Mingyang, et al.
Published: (2025)
by: Mao, Mingyang, et al.
Published: (2025)
Learning Planning Abstractions from Language
by: Liu, Weiyu, et al.
Published: (2024)
by: Liu, Weiyu, et al.
Published: (2024)
How Confident Is the First Token? An Uncertainty-Calibrated Prompt Optimization Framework for Large Language Model Classification and Understanding
by: Chen, Wei, et al.
Published: (2026)
by: Chen, Wei, et al.
Published: (2026)
Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans
by: Qin, Sizhong, et al.
Published: (2026)
by: Qin, Sizhong, et al.
Published: (2026)
What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph
by: Jiang, Yutao, et al.
Published: (2025)
by: Jiang, Yutao, et al.
Published: (2025)
Enhancing Text-to-SQL Capabilities of Large Language Models via Domain Database Knowledge Injection
by: Ma, Xingyu, et al.
Published: (2024)
by: Ma, Xingyu, et al.
Published: (2024)
Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs
by: Wang, Junjie, et al.
Published: (2024)
by: Wang, Junjie, et al.
Published: (2024)
Preservation of Language Understanding Capabilities in Speech-aware Large Language Models
by: Kubis, Marek, et al.
Published: (2025)
by: Kubis, Marek, et al.
Published: (2025)
Similar Items
-
Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement
by: Zhong, Qimin, et al.
Published: (2026) -
Exploring Loss Landscapes through the Lens of Spin Glass Theory
by: Liao, Hao, et al.
Published: (2024) -
ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models
by: Wang, Siwei, et al.
Published: (2024) -
Fine-Tuning Medical Language Models for Enhanced Long-Contextual Understanding and Domain Expertise
by: Yang, Qimin, et al.
Published: (2024) -
How Transformers Learn to Plan via Multi-Token Prediction
by: Huang, Jianhao, et al.
Published: (2026)