Saved in:
| Main Authors: | Chen, Zhuo, Jiang, Chengyue, Tu, Kewei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.02068 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts
by: Chen, Zhuo, et al.
Published: (2024)
by: Chen, Zhuo, et al.
Published: (2024)
RoT: Enhancing Large Language Models with Reflection on Search Trees
by: Hui, Wenyang, et al.
Published: (2024)
by: Hui, Wenyang, et al.
Published: (2024)
Layer-Condensed KV Cache for Efficient Inference of Large Language Models
by: Wu, Haoyi, et al.
Published: (2024)
by: Wu, Haoyi, et al.
Published: (2024)
Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference
by: Chen, Zhuo, et al.
Published: (2025)
by: Chen, Zhuo, et al.
Published: (2025)
Efficient Multimodal Planning Agent for Visual Question-Answering
by: Chen, Zhuo, et al.
Published: (2026)
by: Chen, Zhuo, et al.
Published: (2026)
Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models
by: Zhao, Yida, et al.
Published: (2024)
by: Zhao, Yida, et al.
Published: (2024)
GiLT: Augmenting Transformer Language Models with Dependency Graphs
by: Huang, Tianyu, et al.
Published: (2026)
by: Huang, Tianyu, et al.
Published: (2026)
A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference
by: Wu, You, et al.
Published: (2024)
by: Wu, You, et al.
Published: (2024)
Parallel Continuous Chain-of-Thought with Jacobi Iteration
by: Wu, Haoyi, et al.
Published: (2025)
by: Wu, Haoyi, et al.
Published: (2025)
Scaling Probabilistic Transformer via Efficient Cross-Scale Hyperparameter Transfer
by: Kuang, Penghao, et al.
Published: (2026)
by: Kuang, Penghao, et al.
Published: (2026)
A Systematic Study of Compositional Syntactic Transformer Language Models
by: Zhao, Yida, et al.
Published: (2025)
by: Zhao, Yida, et al.
Published: (2025)
Learning Robust Named Entity Recognizers From Noisy Data With Retrieval Augmentation
by: Ai, Chaoyi, et al.
Published: (2024)
by: Ai, Chaoyi, et al.
Published: (2024)
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale
by: Hu, Xiang, et al.
Published: (2024)
by: Hu, Xiang, et al.
Published: (2024)
Efficient Length-Generalizable Attention via Causal Retrieval for Long-Context Language Modeling
by: Hu, Xiang, et al.
Published: (2024)
by: Hu, Xiang, et al.
Published: (2024)
Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers
by: Lou, Chao, et al.
Published: (2024)
by: Lou, Chao, et al.
Published: (2024)
Augmenting Transformers with Recursively Composed Multi-grained Representations
by: Hu, Xiang, et al.
Published: (2023)
by: Hu, Xiang, et al.
Published: (2023)
Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
by: Tian, Junjiao, et al.
Published: (2024)
by: Tian, Junjiao, et al.
Published: (2024)
Unsupervised Morphological Tree Tokenizer
by: Zhu, Qingyang, et al.
Published: (2024)
by: Zhu, Qingyang, et al.
Published: (2024)
Hardware-aligned Hierarchical Sparse Attention for Efficient Long-term Memory Access
by: Hu, Xiang, et al.
Published: (2025)
by: Hu, Xiang, et al.
Published: (2025)
Comprehensive Evaluation of Multimodal AI Models in Medical Imaging Diagnosis: From Data Augmentation to Preference-Based Comparison
by: Ruan, Cailian, et al.
Published: (2024)
by: Ruan, Cailian, et al.
Published: (2024)
GRKV: Global Regression for Training-Free KV Cache Compression in Long-Context LLMs
by: Peng, Junjie, et al.
Published: (2026)
by: Peng, Junjie, et al.
Published: (2026)
Repurposing Synthetic Data for Fine-grained Search Agent Supervision
by: Zhao, Yida, et al.
Published: (2025)
by: Zhao, Yida, et al.
Published: (2025)
Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance
by: Srivastava, Saurabh, et al.
Published: (2023)
by: Srivastava, Saurabh, et al.
Published: (2023)
Interpretable Online Log Analysis Using Large Language Models with Prompt Strategies
by: Liu, Yilun, et al.
Published: (2023)
by: Liu, Yilun, et al.
Published: (2023)
YOCO++: Enhancing YOCO with KV Residual Connections for Efficient LLM Inference
by: Wu, You, et al.
Published: (2026)
by: Wu, You, et al.
Published: (2026)
A Large Language Model Based Method for Complex Logical Reasoning over Knowledge Graphs
by: Zhang, Ziyan, et al.
Published: (2025)
by: Zhang, Ziyan, et al.
Published: (2025)
Theory of Mind in Large Language Models: Assessment and Enhancement
by: Chen, Ruirui, et al.
Published: (2025)
by: Chen, Ruirui, et al.
Published: (2025)
Flash Multi-Head Feed-Forward Network
by: Zhang, Minshen, et al.
Published: (2025)
by: Zhang, Minshen, et al.
Published: (2025)
Structure Guided Prompt: Instructing Large Language Model in Multi-Step Reasoning by Exploring Graph Structure of the Text
by: Cheng, Kewei, et al.
Published: (2024)
by: Cheng, Kewei, et al.
Published: (2024)
EvolveSearch: An Iterative Self-Evolving Search Agent
by: Zhang, Dingchu, et al.
Published: (2025)
by: Zhang, Dingchu, et al.
Published: (2025)
Potential and Limitations of LLMs in Capturing Structured Semantics: A Case Study on SRL
by: Cheng, Ning, et al.
Published: (2024)
by: Cheng, Ning, et al.
Published: (2024)
Directional Gradient Projection for Robust Fine-Tuning of Foundation Models
by: Huang, Chengyue, et al.
Published: (2025)
by: Huang, Chengyue, et al.
Published: (2025)
Beyond Surface Reasoning: Unveiling the True Long Chain-of-Thought Capacity of Diffusion Large Language Models
by: Chen, Qiguang, et al.
Published: (2025)
by: Chen, Qiguang, et al.
Published: (2025)
RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations
by: Huang, Jing, et al.
Published: (2024)
by: Huang, Jing, et al.
Published: (2024)
MPCC: A Novel Benchmark for Multimodal Planning with Complex Constraints in Multimodal Large Language Models
by: Ji, Yiyan, et al.
Published: (2025)
by: Ji, Yiyan, et al.
Published: (2025)
Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows
by: Zhang, Shujian, et al.
Published: (2024)
by: Zhang, Shujian, et al.
Published: (2024)
Using Large Language Models for the Interpretation of Building Regulations
by: Fuchs, Stefan, et al.
Published: (2024)
by: Fuchs, Stefan, et al.
Published: (2024)
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration
by: Zhang, Hanzhi, et al.
Published: (2025)
by: Zhang, Hanzhi, et al.
Published: (2025)
FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question Answering
by: Huang, Chengyue, et al.
Published: (2025)
by: Huang, Chengyue, et al.
Published: (2025)
An Interpretable and Crosslingual Method for Evaluating Second-Language Dialogues
by: Gao, Rena, et al.
Published: (2024)
by: Gao, Rena, et al.
Published: (2024)
Similar Items
-
Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts
by: Chen, Zhuo, et al.
Published: (2024) -
RoT: Enhancing Large Language Models with Reflection on Search Trees
by: Hui, Wenyang, et al.
Published: (2024) -
Layer-Condensed KV Cache for Efficient Inference of Large Language Models
by: Wu, Haoyi, et al.
Published: (2024) -
Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference
by: Chen, Zhuo, et al.
Published: (2025) -
Efficient Multimodal Planning Agent for Visual Question-Answering
by: Chen, Zhuo, et al.
Published: (2026)