Saved in:
| Main Authors: | Li, Li, Wu, Yongliang, Zhu, Jingze, Peng, Jiawei, Cai, Jianfei, Yang, Xu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.08021 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture
by: Shi, Jingze, et al.
Published: (2024)
by: Shi, Jingze, et al.
Published: (2024)
Demonstration Selection for In-Context Learning via Reinforcement Learning
by: Wang, Xubin, et al.
Published: (2024)
by: Wang, Xubin, et al.
Published: (2024)
FlashBlock: Attention Caching for Efficient Long-Context Block Diffusion
by: Chen, Zhuokun, et al.
Published: (2026)
by: Chen, Zhuokun, et al.
Published: (2026)
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale
by: Shi, Jingze, et al.
Published: (2026)
by: Shi, Jingze, et al.
Published: (2026)
Unveiling and Addressing Pseudo Forgetting in Large Language Models
by: Sun, Huashan, et al.
Published: (2024)
by: Sun, Huashan, et al.
Published: (2024)
The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution
by: Qian, Chen, et al.
Published: (2026)
by: Qian, Chen, et al.
Published: (2026)
A Fusion Approach of Dependency Syntax and Sentiment Polarity for Feature Label Extraction in Commodity Reviews
by: Xu, Jianfei
Published: (2024)
by: Xu, Jianfei
Published: (2024)
To Trust or Not to Trust? Enhancing Large Language Models' Situated Faithfulness to External Contexts
by: Huang, Yukun, et al.
Published: (2024)
by: Huang, Yukun, et al.
Published: (2024)
Unveiling the Invisible: Captioning Videos with Metaphors
by: Kalarani, Abisek Rajakumar, et al.
Published: (2024)
by: Kalarani, Abisek Rajakumar, et al.
Published: (2024)
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
by: Wu, Xingyu, et al.
Published: (2025)
by: Wu, Xingyu, et al.
Published: (2025)
UniBias: Unveiling and Mitigating LLM Bias through Internal Attention and FFN Manipulation
by: Zhou, Hanzhang, et al.
Published: (2024)
by: Zhou, Hanzhang, et al.
Published: (2024)
Pre-training Limited Memory Language Models with Internal and External Knowledge
by: Zhao, Linxi, et al.
Published: (2025)
by: Zhao, Linxi, et al.
Published: (2025)
WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole-Slide Images
by: Chen, Pingyi, et al.
Published: (2023)
by: Chen, Pingyi, et al.
Published: (2023)
Internal Knowledge Without External Expression: Probing the Generalization Boundary of a Classical Chinese Language Model
by: Chen, Jiuting, et al.
Published: (2026)
by: Chen, Jiuting, et al.
Published: (2026)
The Role of Data Curation in Image Captioning
by: Li, Wenyan, et al.
Published: (2023)
by: Li, Wenyan, et al.
Published: (2023)
Text-only Synthesis for Image Captioning
by: Zhou, Qing, et al.
Published: (2024)
by: Zhou, Qing, et al.
Published: (2024)
Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
by: Huang, Ziyang, et al.
Published: (2025)
by: Huang, Ziyang, et al.
Published: (2025)
Exploring Diverse In-Context Configurations for Image Captioning
by: Yang, Xu, et al.
Published: (2023)
by: Yang, Xu, et al.
Published: (2023)
From Clicks to Preference: A Multi-stage Alignment Framework for Generative Query Suggestion in Conversational System
by: Yin, Junhao, et al.
Published: (2025)
by: Yin, Junhao, et al.
Published: (2025)
TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding
by: Wu, Bingheng, et al.
Published: (2025)
by: Wu, Bingheng, et al.
Published: (2025)
TempPerturb-Eval: On the Joint Effects of Internal Temperature and External Perturbations in RAG Robustness
by: Zhou, Yongxin, et al.
Published: (2025)
by: Zhou, Yongxin, et al.
Published: (2025)
What External Knowledge is Preferred by LLMs? Characterizing and Exploring Chain of Evidence in Imperfect Context for Multi-Hop QA
by: Chang, Zhiyuan, et al.
Published: (2024)
by: Chang, Zhiyuan, et al.
Published: (2024)
InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
by: Yan, Yuchen, et al.
Published: (2025)
by: Yan, Yuchen, et al.
Published: (2025)
A Survey on Transformer Context Extension: Approaches and Evaluation
by: Liu, Yijun, et al.
Published: (2025)
by: Liu, Yijun, et al.
Published: (2025)
Mastering Board Games by External and Internal Planning with Language Models
by: Schultz, John, et al.
Published: (2024)
by: Schultz, John, et al.
Published: (2024)
TInR: Exploring Tool-Internalized Reasoning in Large Language Models
by: Xu, Qiancheng, et al.
Published: (2026)
by: Xu, Qiancheng, et al.
Published: (2026)
FACT: Examining the Effectiveness of Iterative Context Rewriting for Multi-fact Retrieval
by: Wang, Jinlin, et al.
Published: (2024)
by: Wang, Jinlin, et al.
Published: (2024)
Rule-driven News Captioning
by: Xu, Ning, et al.
Published: (2024)
by: Xu, Ning, et al.
Published: (2024)
CoT Vectors: Transferring and Probing the Reasoning Mechanisms of LLMs
by: Li, Li, et al.
Published: (2025)
by: Li, Li, et al.
Published: (2025)
END: Early Noise Dropping for Efficient and Effective Context Denoising
by: Jin, Hongye, et al.
Published: (2025)
by: Jin, Hongye, et al.
Published: (2025)
OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser
by: Shi, Jingze, et al.
Published: (2024)
by: Shi, Jingze, et al.
Published: (2024)
Doc-to-LoRA: Learning to Instantly Internalize Contexts
by: Charakorn, Rujikorn, et al.
Published: (2026)
by: Charakorn, Rujikorn, et al.
Published: (2026)
Internal Reasoning vs. External Control: A Thermodynamic Analysis of Sycophancy in Large Language Models
by: Chang, Edward Y.
Published: (2025)
by: Chang, Edward Y.
Published: (2025)
Bridging Internal Probability and Self-Consistency for Effective and Efficient LLM Reasoning
by: Zhou, Zhi, et al.
Published: (2025)
by: Zhou, Zhi, et al.
Published: (2025)
Context Engineering 2.0: The Context of Context Engineering
by: Hua, Qishuo, et al.
Published: (2025)
by: Hua, Qishuo, et al.
Published: (2025)
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning
by: Yan, Yuchen, et al.
Published: (2026)
by: Yan, Yuchen, et al.
Published: (2026)
Identifying and Mitigating Social Bias Knowledge in Language Models
by: Chen, Ruizhe, et al.
Published: (2024)
by: Chen, Ruizhe, et al.
Published: (2024)
Captions Speak Louder than Images: Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
by: Ling, Xinyi, et al.
Published: (2024)
by: Ling, Xinyi, et al.
Published: (2024)
Enhancing Text Annotation through Rationale-Driven Collaborative Few-Shot Prompting
by: Wu, Jianfei, et al.
Published: (2024)
by: Wu, Jianfei, et al.
Published: (2024)
Trainable Dynamic Mask Sparse Attention
by: Shi, Jingze, et al.
Published: (2025)
by: Shi, Jingze, et al.
Published: (2025)
Similar Items
-
Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture
by: Shi, Jingze, et al.
Published: (2024) -
Demonstration Selection for In-Context Learning via Reinforcement Learning
by: Wang, Xubin, et al.
Published: (2024) -
FlashBlock: Attention Caching for Efficient Long-Context Block Diffusion
by: Chen, Zhuokun, et al.
Published: (2026) -
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale
by: Shi, Jingze, et al.
Published: (2026) -
Unveiling and Addressing Pseudo Forgetting in Large Language Models
by: Sun, Huashan, et al.
Published: (2024)