:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Li, Wu, Yongliang, Zhu, Jingze, Peng, Jiawei, Cai, Jianfei, Yang, Xu
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2507.08021
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture
by: Shi, Jingze, et al.
Published: (2024)

Demonstration Selection for In-Context Learning via Reinforcement Learning
by: Wang, Xubin, et al.
Published: (2024)

FlashBlock: Attention Caching for Efficient Long-Context Block Diffusion
by: Chen, Zhuokun, et al.
Published: (2026)

OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale
by: Shi, Jingze, et al.
Published: (2026)

Unveiling and Addressing Pseudo Forgetting in Large Language Models
by: Sun, Huashan, et al.
Published: (2024)

The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution
by: Qian, Chen, et al.
Published: (2026)

A Fusion Approach of Dependency Syntax and Sentiment Polarity for Feature Label Extraction in Commodity Reviews
by: Xu, Jianfei
Published: (2024)

To Trust or Not to Trust? Enhancing Large Language Models' Situated Faithfulness to External Contexts
by: Huang, Yukun, et al.
Published: (2024)

Unveiling the Invisible: Captioning Videos with Metaphors
by: Kalarani, Abisek Rajakumar, et al.
Published: (2024)

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
by: Wu, Xingyu, et al.
Published: (2025)

UniBias: Unveiling and Mitigating LLM Bias through Internal Attention and FFN Manipulation
by: Zhou, Hanzhang, et al.
Published: (2024)

Pre-training Limited Memory Language Models with Internal and External Knowledge
by: Zhao, Linxi, et al.
Published: (2025)

WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole-Slide Images
by: Chen, Pingyi, et al.
Published: (2023)

Internal Knowledge Without External Expression: Probing the Generalization Boundary of a Classical Chinese Language Model
by: Chen, Jiuting, et al.
Published: (2026)

The Role of Data Curation in Image Captioning
by: Li, Wenyan, et al.
Published: (2023)

Text-only Synthesis for Image Captioning
by: Zhou, Qing, et al.
Published: (2024)

Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
by: Huang, Ziyang, et al.
Published: (2025)

Exploring Diverse In-Context Configurations for Image Captioning
by: Yang, Xu, et al.
Published: (2023)

From Clicks to Preference: A Multi-stage Alignment Framework for Generative Query Suggestion in Conversational System
by: Yin, Junhao, et al.
Published: (2025)

TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding
by: Wu, Bingheng, et al.
Published: (2025)

TempPerturb-Eval: On the Joint Effects of Internal Temperature and External Perturbations in RAG Robustness
by: Zhou, Yongxin, et al.
Published: (2025)

What External Knowledge is Preferred by LLMs? Characterizing and Exploring Chain of Evidence in Imperfect Context for Multi-Hop QA
by: Chang, Zhiyuan, et al.
Published: (2024)

InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
by: Yan, Yuchen, et al.
Published: (2025)

A Survey on Transformer Context Extension: Approaches and Evaluation
by: Liu, Yijun, et al.
Published: (2025)

Mastering Board Games by External and Internal Planning with Language Models
by: Schultz, John, et al.
Published: (2024)

TInR: Exploring Tool-Internalized Reasoning in Large Language Models
by: Xu, Qiancheng, et al.
Published: (2026)

FACT: Examining the Effectiveness of Iterative Context Rewriting for Multi-fact Retrieval
by: Wang, Jinlin, et al.
Published: (2024)

Rule-driven News Captioning
by: Xu, Ning, et al.
Published: (2024)

CoT Vectors: Transferring and Probing the Reasoning Mechanisms of LLMs
by: Li, Li, et al.
Published: (2025)

END: Early Noise Dropping for Efficient and Effective Context Denoising
by: Jin, Hongye, et al.
Published: (2025)

OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser
by: Shi, Jingze, et al.
Published: (2024)

Doc-to-LoRA: Learning to Instantly Internalize Contexts
by: Charakorn, Rujikorn, et al.
Published: (2026)

Internal Reasoning vs. External Control: A Thermodynamic Analysis of Sycophancy in Large Language Models
by: Chang, Edward Y.
Published: (2025)

Bridging Internal Probability and Self-Consistency for Effective and Efficient LLM Reasoning
by: Zhou, Zhi, et al.
Published: (2025)

Context Engineering 2.0: The Context of Context Engineering
by: Hua, Qishuo, et al.
Published: (2025)

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning
by: Yan, Yuchen, et al.
Published: (2026)

Identifying and Mitigating Social Bias Knowledge in Language Models
by: Chen, Ruizhe, et al.
Published: (2024)

Captions Speak Louder than Images: Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
by: Ling, Xinyi, et al.
Published: (2024)

Enhancing Text Annotation through Rationale-Driven Collaborative Few-Shot Prompting
by: Wu, Jianfei, et al.
Published: (2024)

Trainable Dynamic Mask Sparse Attention
by: Shi, Jingze, et al.
Published: (2025)