Saved in:
| Main Authors: | Sun, Si, Zhang, Hanqing, Liu, Zhiyuan, Bao, Jie, Song, Dawei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.01999 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Controllable Text Generation with Residual Memory Transformer
by: Zhang, Hanqing, et al.
Published: (2023)
by: Zhang, Hanqing, et al.
Published: (2023)
Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check
by: Wu, Haiming, et al.
Published: (2024)
by: Wu, Haiming, et al.
Published: (2024)
ZigzagAttention: Efficient Long-Context Inference with Exclusive Retrieval and Streaming Heads
by: Liu, Zhuorui, et al.
Published: (2025)
by: Liu, Zhuorui, et al.
Published: (2025)
LLM-Oriented Information Retrieval: A Denoising-First Perspective
by: Dai, Lu, et al.
Published: (2026)
by: Dai, Lu, et al.
Published: (2026)
A Two-Stage GPU Kernel Tuner Combining Semantic Refactoring and Search-Based Optimization
by: Qu, Qiuyi, et al.
Published: (2026)
by: Qu, Qiuyi, et al.
Published: (2026)
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
by: Cheng, Mingyue, et al.
Published: (2025)
by: Cheng, Mingyue, et al.
Published: (2025)
TCMIIES: A Browser-Based LLM-Powered Intelligent Information Extraction System for Academic Literature
by: Zhao, Hanqing
Published: (2026)
by: Zhao, Hanqing
Published: (2026)
Utility-Focused LLM Annotation for Retrieval and Retrieval-Augmented Generation
by: Zhang, Hengran, et al.
Published: (2025)
by: Zhang, Hengran, et al.
Published: (2025)
Towards Completeness-Oriented Tool Retrieval for Large Language Models
by: Qu, Changle, et al.
Published: (2024)
by: Qu, Changle, et al.
Published: (2024)
LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks
by: Wang, Hanqing, et al.
Published: (2024)
by: Wang, Hanqing, et al.
Published: (2024)
Utility-Oriented Visual Evidence Selection for Multimodal Retrieval-Augmented Generation
by: Luo, Weiqing, et al.
Published: (2026)
by: Luo, Weiqing, et al.
Published: (2026)
scAgent: Universal Single-Cell Annotation via a LLM Agent
by: Mao, Yuren, et al.
Published: (2025)
by: Mao, Yuren, et al.
Published: (2025)
Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method
by: Chen, Peter Baile, et al.
Published: (2025)
by: Chen, Peter Baile, et al.
Published: (2025)
WindowKV: Task-Adaptive Group-Wise KV Cache Window Selection for Efficient LLM Inference
by: Zuo, Youhui, et al.
Published: (2025)
by: Zuo, Youhui, et al.
Published: (2025)
ConfTuner: Training Large Language Models to Express Their Confidence Verbally
by: Li, Yibo, et al.
Published: (2025)
by: Li, Yibo, et al.
Published: (2025)
FlexLLM: Exploring LLM Customization for Moving Target Defense on Black-Box LLMs Against Jailbreak Attacks
by: Chen, Bocheng, et al.
Published: (2024)
by: Chen, Bocheng, et al.
Published: (2024)
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
by: Zhang, Xuanwang, et al.
Published: (2024)
by: Zhang, Xuanwang, et al.
Published: (2024)
LLM-Rec: Personalized Recommendation via Prompting Large Language Models
by: Lyu, Hanjia, et al.
Published: (2023)
by: Lyu, Hanjia, et al.
Published: (2023)
GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents
by: Diao, Lingxiao, et al.
Published: (2025)
by: Diao, Lingxiao, et al.
Published: (2025)
MIA-Tuner: Adapting Large Language Models as Pre-training Text Detector
by: Fu, Wenjie, et al.
Published: (2024)
by: Fu, Wenjie, et al.
Published: (2024)
LLM-Specific Utility: A New Perspective for Retrieval-Augmented Generation
by: Zhang, Hengran, et al.
Published: (2025)
by: Zhang, Hengran, et al.
Published: (2025)
Logic-Oriented Retriever Enhancement via Contrastive Learning
by: Zhang, Wenxuan, et al.
Published: (2026)
by: Zhang, Wenxuan, et al.
Published: (2026)
Craw4LLM: Efficient Web Crawling for LLM Pretraining
by: Yu, Shi, et al.
Published: (2025)
by: Yu, Shi, et al.
Published: (2025)
RB-SQL: A Retrieval-based LLM Framework for Text-to-SQL
by: Wu, Zhenhe, et al.
Published: (2024)
by: Wu, Zhenhe, et al.
Published: (2024)
Beyond the Speculative Game: A Survey of Speculative Execution in Large Language Models
by: Zhang, Chen, et al.
Published: (2024)
by: Zhang, Chen, et al.
Published: (2024)
MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning
by: Wang, Xujia, et al.
Published: (2024)
by: Wang, Xujia, et al.
Published: (2024)
LLM-Confidence Reranker: A Training-Free Approach for Enhancing Retrieval-Augmented Generation Systems
by: Song, Zhipeng, et al.
Published: (2026)
by: Song, Zhipeng, et al.
Published: (2026)
SR-LLM: Rethinking the Structured Representation in Large Language Model
by: Zhang, Jiahuan, et al.
Published: (2025)
by: Zhang, Jiahuan, et al.
Published: (2025)
Assessing "Implicit" Retrieval Robustness of Large Language Models
by: Shen, Xiaoyu, et al.
Published: (2024)
by: Shen, Xiaoyu, et al.
Published: (2024)
LLM-Oriented Token-Adaptive Knowledge Distillation
by: Xie, Xurong, et al.
Published: (2025)
by: Xie, Xurong, et al.
Published: (2025)
OrcaRouter: A Production-Oriented LLM Router with Hybrid Offline-Online Learning
by: Bao, Zhenghua, et al.
Published: (2026)
by: Bao, Zhenghua, et al.
Published: (2026)
SurveyBench: Can LLM(-Agents) Write Academic Surveys that Align with Reader Needs?
by: Sun, Zhaojun, et al.
Published: (2025)
by: Sun, Zhaojun, et al.
Published: (2025)
Beyond Static Alignment: Hierarchical Policy Control for LLM Safety via Risk-Aware Chain-of-Thought
by: Si, Jianfeng, et al.
Published: (2026)
by: Si, Jianfeng, et al.
Published: (2026)
Generative Multi-Modal Knowledge Retrieval with Large Language Models
by: Long, Xinwei, et al.
Published: (2024)
by: Long, Xinwei, et al.
Published: (2024)
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
by: Ping, Bowen, et al.
Published: (2024)
by: Ping, Bowen, et al.
Published: (2024)
Align Documents to Questions: Question-Oriented Document Rewriting for Retrieval-Augmented Generation
by: Li, Jiaang, et al.
Published: (2026)
by: Li, Jiaang, et al.
Published: (2026)
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
by: Ulmer, Dennis, et al.
Published: (2024)
by: Ulmer, Dennis, et al.
Published: (2024)
MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning
by: Wang, Hanqing, et al.
Published: (2024)
by: Wang, Hanqing, et al.
Published: (2024)
RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation
by: Jiang, Pengcheng, et al.
Published: (2025)
by: Jiang, Pengcheng, et al.
Published: (2025)
Zero-shot Graph Reasoning via Retrieval Augmented Framework with LLMs
by: Li, Hanqing, et al.
Published: (2025)
by: Li, Hanqing, et al.
Published: (2025)
Similar Items
-
Controllable Text Generation with Residual Memory Transformer
by: Zhang, Hanqing, et al.
Published: (2023) -
Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check
by: Wu, Haiming, et al.
Published: (2024) -
ZigzagAttention: Efficient Long-Context Inference with Exclusive Retrieval and Streaming Heads
by: Liu, Zhuorui, et al.
Published: (2025) -
LLM-Oriented Information Retrieval: A Denoising-First Perspective
by: Dai, Lu, et al.
Published: (2026) -
A Two-Stage GPU Kernel Tuner Combining Semantic Refactoring and Search-Based Optimization
by: Qu, Qiuyi, et al.
Published: (2026)