Saved in:
| Main Authors: | Tong, Junlong, Wang, Zilong, Ren, YuJie, Yin, Peiran, Wu, Hao, Zhang, Wei, Shen, Xiaoyu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.04592 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
StreamingThinker: Large Language Models Can Think While Reading
by: Tong, Junlong, et al.
Published: (2025)
by: Tong, Junlong, et al.
Published: (2025)
Speak While Watching: Unleashing TRUE Real-Time Video Understanding Capability of Multimodal Large Language Models
by: Lin, Junyan, et al.
Published: (2026)
by: Lin, Junyan, et al.
Published: (2026)
LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding
by: Tong, Junlong, et al.
Published: (2025)
by: Tong, Junlong, et al.
Published: (2025)
Large Language Models and Causal Inference in Collaboration: A Survey
by: Liu, Xiaoyu, et al.
Published: (2024)
by: Liu, Xiaoyu, et al.
Published: (2024)
Think-as-You-See: Streaming Chain-of-Thought Reasoning for Large Vision-Language Models
by: Zhang, Jialiang, et al.
Published: (2026)
by: Zhang, Jialiang, et al.
Published: (2026)
HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit
by: Wu, Hao, et al.
Published: (2026)
by: Wu, Hao, et al.
Published: (2026)
The Few Govern the Many:Unveiling Few-Layer Dominance for Time Series Models
by: Qiu, Xin, et al.
Published: (2025)
by: Qiu, Xin, et al.
Published: (2025)
Rethinking the Role of LLMs in Time Series Forecasting
by: Qiu, Xin, et al.
Published: (2026)
by: Qiu, Xin, et al.
Published: (2026)
EFT-CoT: A Multi-Agent Chain-of-Thought Framework for Emotion-Focused Therapy
by: Du, Lanqing, et al.
Published: (2026)
by: Du, Lanqing, et al.
Published: (2026)
SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling
by: Zhao, Anhao, et al.
Published: (2025)
by: Zhao, Anhao, et al.
Published: (2025)
Hybrid OCR-LLM Framework for Enterprise-Scale Document Information Extraction Under Copy-heavy Task
by: Wang, Zilong, et al.
Published: (2025)
by: Wang, Zilong, et al.
Published: (2025)
LangSuitE: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments
by: Jia, Zixia, et al.
Published: (2024)
by: Jia, Zixia, et al.
Published: (2024)
Large Language Model Sourcing: A Survey
by: Pang, Liang, et al.
Published: (2025)
by: Pang, Liang, et al.
Published: (2025)
$\mathcal{V}isi\mathcal{P}runer$: Decoding Discontinuous Cross-Modal Dynamics for Efficient Multimodal LLMs
by: Fan, Yingqi, et al.
Published: (2025)
by: Fan, Yingqi, et al.
Published: (2025)
UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking
by: Wu, Hao, et al.
Published: (2026)
by: Wu, Hao, et al.
Published: (2026)
A Survey on Data Synthesis and Augmentation for Large Language Models
by: Wang, Ke, et al.
Published: (2024)
by: Wang, Ke, et al.
Published: (2024)
What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models
by: Fan, Yingqi, et al.
Published: (2026)
by: Fan, Yingqi, et al.
Published: (2026)
Dynamic Compressing Prompts for Efficient Inference of Large Language Models
by: Hu, Jinwu, et al.
Published: (2025)
by: Hu, Jinwu, et al.
Published: (2025)
Decoupling KL and Trajectories: A Unified Perspective for SFT, DAgger, Offline RL, and OPD in LLM Distillation
by: Zhao, Anhao, et al.
Published: (2026)
by: Zhao, Anhao, et al.
Published: (2026)
A Survey on Efficient Large Language Model Training: From Data-centric Perspectives
by: Luo, Junyu, et al.
Published: (2025)
by: Luo, Junyu, et al.
Published: (2025)
Assessing "Implicit" Retrieval Robustness of Large Language Models
by: Shen, Xiaoyu, et al.
Published: (2024)
by: Shen, Xiaoyu, et al.
Published: (2024)
A Survey on Efficient Inference for Large Language Models
by: Zhou, Zixuan, et al.
Published: (2024)
by: Zhou, Zixuan, et al.
Published: (2024)
A Survey of AIOps in the Era of Large Language Models
by: Zhang, Lingzhe, et al.
Published: (2025)
by: Zhang, Lingzhe, et al.
Published: (2025)
FineSteer: A Unified Framework for Fine-Grained Inference-Time Steering in Large Language Models
by: Weng, Zixuan, et al.
Published: (2026)
by: Weng, Zixuan, et al.
Published: (2026)
Mind the Gap! Static and Interactive Evaluations of Large Audio Models
by: Li, Minzhi, et al.
Published: (2025)
by: Li, Minzhi, et al.
Published: (2025)
SentGuard: Sentence-Level Streaming Guardrails for Large Language Models
by: Yu, Jiaqi, et al.
Published: (2026)
by: Yu, Jiaqi, et al.
Published: (2026)
What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis
by: Wang, Peiran, et al.
Published: (2025)
by: Wang, Peiran, et al.
Published: (2025)
Large Language Models for Generative Information Extraction: A Survey
by: Xu, Derong, et al.
Published: (2023)
by: Xu, Derong, et al.
Published: (2023)
Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey
by: Gan, Aoran, et al.
Published: (2025)
by: Gan, Aoran, et al.
Published: (2025)
A Survey on Multi-Turn Interaction Capabilities of Large Language Models
by: Zhang, Chen, et al.
Published: (2025)
by: Zhang, Chen, et al.
Published: (2025)
A Survey of Recent Backdoor Attacks and Defenses in Large Language Models
by: Zhao, Shuai, et al.
Published: (2024)
by: Zhao, Shuai, et al.
Published: (2024)
Knowledge Is Not Static: Order-Aware Hypergraph RAG for Language Models
by: Wu, Keshu, et al.
Published: (2026)
by: Wu, Keshu, et al.
Published: (2026)
Causal Inference with Large Language Model: A Survey
by: Ma, Jing
Published: (2024)
by: Ma, Jing
Published: (2024)
A Survey on Multimodal Large Language Models
by: Yin, Shukang, et al.
Published: (2023)
by: Yin, Shukang, et al.
Published: (2023)
Information Suppression in Large Language Models: Auditing, Quantifying, and Characterizing Censorship in DeepSeek
by: Qiu, Peiran, et al.
Published: (2025)
by: Qiu, Peiran, et al.
Published: (2025)
Beyond Static Personas: Situational Personality Steering for Large Language Models
by: Wei, Zesheng, et al.
Published: (2026)
by: Wei, Zesheng, et al.
Published: (2026)
Model Compression and Efficient Inference for Large Language Models: A Survey
by: Wang, Wenxiao, et al.
Published: (2024)
by: Wang, Wenxiao, et al.
Published: (2024)
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models
by: Qian, Chen, et al.
Published: (2024)
by: Qian, Chen, et al.
Published: (2024)
Personalization of Large Language Models: A Survey
by: Zhang, Zhehao, et al.
Published: (2024)
by: Zhang, Zhehao, et al.
Published: (2024)
Curved Inference: Concern-Sensitive Geometry in Large Language Model Residual Streams
by: Manson, Rob
Published: (2025)
by: Manson, Rob
Published: (2025)
Similar Items
-
StreamingThinker: Large Language Models Can Think While Reading
by: Tong, Junlong, et al.
Published: (2025) -
Speak While Watching: Unleashing TRUE Real-Time Video Understanding Capability of Multimodal Large Language Models
by: Lin, Junyan, et al.
Published: (2026) -
LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding
by: Tong, Junlong, et al.
Published: (2025) -
Large Language Models and Causal Inference in Collaboration: A Survey
by: Liu, Xiaoyu, et al.
Published: (2024) -
Think-as-You-See: Streaming Chain-of-Thought Reasoning for Large Vision-Language Models
by: Zhang, Jialiang, et al.
Published: (2026)