Saved in:
| Main Authors: | Luo, Lizhuo, Shi, Zhuoran, Luo, Jiajun, Wang, Zhi, Ren, Shen, Wang, Wenya, Zhang, Tianwei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.06953 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs
by: Luo, Lizhuo, et al.
Published: (2026)
by: Luo, Lizhuo, et al.
Published: (2026)
Staleness-Centric Optimizations for Parallel Diffusion MoE Inference
by: Luo, Jiajun, et al.
Published: (2024)
by: Luo, Jiajun, et al.
Published: (2024)
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification
by: Sanyal, Soumya, et al.
Published: (2024)
by: Sanyal, Soumya, et al.
Published: (2024)
DAWN-ICL: Strategic Planning of Problem-solving Trajectories for Zero-Shot In-Context Learning
by: Tang, Xinyu, et al.
Published: (2024)
by: Tang, Xinyu, et al.
Published: (2024)
Adaptive Detoxification: Safeguarding General Capabilities of LLMs through Toxicity-Aware Knowledge Editing
by: Lu, Yifan, et al.
Published: (2025)
by: Lu, Yifan, et al.
Published: (2025)
TAD: Temporal-Aware Trajectory Self-Distillation for Fast and Accurate Diffusion LLM
by: Zhou, Haoyang, et al.
Published: (2026)
by: Zhou, Haoyang, et al.
Published: (2026)
KV-CoRE: Benchmarking Data-Dependent Low-Rank Compressibility of KV-Caches in LLMs
by: Chen, Jian, et al.
Published: (2026)
by: Chen, Jian, et al.
Published: (2026)
LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
by: Chen, Jianghao, et al.
Published: (2025)
by: Chen, Jianghao, et al.
Published: (2025)
DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference
by: Liu, Fuliang, et al.
Published: (2026)
by: Liu, Fuliang, et al.
Published: (2026)
LLMs Could Autonomously Learn Without External Supervision
by: Ji, Ke, et al.
Published: (2024)
by: Ji, Ke, et al.
Published: (2024)
Can Language Models Act as Knowledge Bases at Scale?
by: He, Qiyuan, et al.
Published: (2024)
by: He, Qiyuan, et al.
Published: (2024)
LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them
by: Xie, Wenya, et al.
Published: (2024)
by: Xie, Wenya, et al.
Published: (2024)
SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC
by: Luo, Jinglong, et al.
Published: (2024)
by: Luo, Jinglong, et al.
Published: (2024)
Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion
by: He, Qiyuan, et al.
Published: (2025)
by: He, Qiyuan, et al.
Published: (2025)
Weakly-supervised Domain Adaption for Aspect Extraction via Multi-level Interaction Transfer
by: Liang, Tao, et al.
Published: (2020)
by: Liang, Tao, et al.
Published: (2020)
Flux Attention: Context-Aware Hybrid Attention for Efficient LLMs Inference
by: Qiu, Quantong, et al.
Published: (2026)
by: Qiu, Quantong, et al.
Published: (2026)
FREE: Uncertainty-Aware Autoregression for Parallel Diffusion Transformers
by: Wen, Xinwan, et al.
Published: (2025)
by: Wen, Xinwan, et al.
Published: (2025)
ART: Attention Replacement Technique to Improve Factuality in LLMs
by: Luo, Ziqin, et al.
Published: (2026)
by: Luo, Ziqin, et al.
Published: (2026)
From Blind Guess to Informed Judgment: Teaching LLMs to Evaluate Materials by Building Knowledge-Augmented Preference Signals
by: Yu, Yeyong, et al.
Published: (2026)
by: Yu, Yeyong, et al.
Published: (2026)
Task-Aware LLM Routing with Multi-Level Task-Profile-Guided Data Synthesis for Cold-Start Scenarios
by: Liu, Hui, et al.
Published: (2026)
by: Liu, Hui, et al.
Published: (2026)
Personalizing LLMs with Binary Feedback: A Preference-Corrected Optimization Framework
by: Ma, Xilai, et al.
Published: (2026)
by: Ma, Xilai, et al.
Published: (2026)
JRE-L: Journalist, Reader, and Editor LLMs in the Loop for Science Journalism for the General Audience
by: Jiang, Gongyao, et al.
Published: (2025)
by: Jiang, Gongyao, et al.
Published: (2025)
d$^2$Cache: Accelerating Diffusion-Based LLMs via Dual Adaptive Caching
by: Jiang, Yuchu, et al.
Published: (2025)
by: Jiang, Yuchu, et al.
Published: (2025)
Fast-MIA: Efficient and Scalable Membership Inference for LLMs
by: Takahashi, Hiromu, et al.
Published: (2025)
by: Takahashi, Hiromu, et al.
Published: (2025)
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs
by: Cao, Shidong, et al.
Published: (2026)
by: Cao, Shidong, et al.
Published: (2026)
Continuously Steering LLMs Sensitivity to Contextual Knowledge with Proxy Models
by: Wang, Yilin, et al.
Published: (2025)
by: Wang, Yilin, et al.
Published: (2025)
Fast Thinking for Large Language Models
by: Zheng, Haoyu, et al.
Published: (2025)
by: Zheng, Haoyu, et al.
Published: (2025)
Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement
by: Ding, Peng, et al.
Published: (2025)
by: Ding, Peng, et al.
Published: (2025)
ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data
by: Shen, Yufan, et al.
Published: (2024)
by: Shen, Yufan, et al.
Published: (2024)
S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation
by: Han, Ligong, et al.
Published: (2026)
by: Han, Ligong, et al.
Published: (2026)
Mitigating Context-Memory Conflicts in LLMs through Dynamic Cognitive Reconciliation Decoding
by: Zhou, Yigeng, et al.
Published: (2026)
by: Zhou, Yigeng, et al.
Published: (2026)
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference
by: Song, Yuxuan, et al.
Published: (2025)
by: Song, Yuxuan, et al.
Published: (2025)
HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation
by: Zhang, Jiajun, et al.
Published: (2025)
by: Zhang, Jiajun, et al.
Published: (2025)
Understanding and Mitigating Numerical Sources of Nondeterminism in LLM Inference
by: Yuan, Jiayi, et al.
Published: (2025)
by: Yuan, Jiayi, et al.
Published: (2025)
Adaptive Layer-skipping in Pre-trained LLMs
by: Luo, Xuan, et al.
Published: (2025)
by: Luo, Xuan, et al.
Published: (2025)
Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules
by: Mohamed, Amr, et al.
Published: (2025)
by: Mohamed, Amr, et al.
Published: (2025)
A Pluggable Multi-Task Learning Framework for Sentiment-Aware Financial Relation Extraction
by: Luo, Jinming, et al.
Published: (2025)
by: Luo, Jinming, et al.
Published: (2025)
Fast-dLLM v2: Efficient Block-Diffusion LLM
by: Wu, Chengyue, et al.
Published: (2025)
by: Wu, Chengyue, et al.
Published: (2025)
Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing
by: Li, Zhuoran, et al.
Published: (2025)
by: Li, Zhuoran, et al.
Published: (2025)
Interpretable Multimodal Misinformation Detection with Logic Reasoning
by: Liu, Hui, et al.
Published: (2023)
by: Liu, Hui, et al.
Published: (2023)
Similar Items
-
DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs
by: Luo, Lizhuo, et al.
Published: (2026) -
Staleness-Centric Optimizations for Parallel Diffusion MoE Inference
by: Luo, Jiajun, et al.
Published: (2024) -
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification
by: Sanyal, Soumya, et al.
Published: (2024) -
DAWN-ICL: Strategic Planning of Problem-solving Trajectories for Zero-Shot In-Context Learning
by: Tang, Xinyu, et al.
Published: (2024) -
Adaptive Detoxification: Safeguarding General Capabilities of LLMs through Toxicity-Aware Knowledge Editing
by: Lu, Yifan, et al.
Published: (2025)