:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zou, Yuchen, Chen, Yineng, Li, Zuchao, Zhang, Lefei, Zhao, Hai
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Computation and Language
Online-Zugang:	https://arxiv.org/abs/2406.16722
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
von: Shi, Luohe, et al.
Veröffentlicht: (2024)

RACER: Retrieval-Augmented Contextual Rapid Speculative Decoding
von: Zhang, Zihong, et al.
Veröffentlicht: (2026)

Sparse is Enough in Fine-tuning Pre-trained Large Language Models
von: Song, Weixi, et al.
Veröffentlicht: (2023)

A Coin Has Two Sides: A Novel Detector-Corrector Framework for Chinese Spelling Correction
von: Zeng, Xiangke, et al.
Veröffentlicht: (2024)

KV-Latent: Dimensional-level KV Cache Reduction with Frequency-aware Rotary Positional Embedding
von: Shi, Luohe, et al.
Veröffentlicht: (2025)

Scaling LLM Speculative Decoding: Non-Autoregressive Forecasting in Large-Batch Scenarios
von: Shi, Luohe, et al.
Veröffentlicht: (2025)

Segment First or Comprehend First? Explore the Limit of Unsupervised Word Segmentation with Large Language Models
von: Zhang, Zihong, et al.
Veröffentlicht: (2025)

VHASR: A Multimodal Speech Recognition System With Vision Hotwords
von: Hu, Jiliang, et al.
Veröffentlicht: (2024)

From AR to Diffusion: Efficiently Adapting Large Language Models with Strictly Causal and Elastic Horizons
von: Ma, Xiangyu, et al.
Veröffentlicht: (2026)

ToM: Leveraging Tree-oriented MapReduce for Long-Context Reasoning in Large Language Models
von: Guo, Jiani, et al.
Veröffentlicht: (2025)

CoViPAL: Layer-wise Contextualized Visual Token Pruning for Large Vision-Language Models
von: Tang, Zicong, et al.
Veröffentlicht: (2025)

SirLLM: Streaming Infinite Retentive LLM
von: Yao, Yao, et al.
Veröffentlicht: (2024)

GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment
von: Yao, Yao, et al.
Veröffentlicht: (2024)

Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models
von: Yao, Yao, et al.
Veröffentlicht: (2023)

Centroid-centered Modeling for Efficient Vision Transformer Pre-training
von: Yan, Xin, et al.
Veröffentlicht: (2023)

IAM: Efficient Inference through Attention Mapping between Different-scale LLMs
von: Zhao, Yi, et al.
Veröffentlicht: (2025)

SongSong: A Time Phonograph for Chinese SongCi Music from Thousand of Years Away
von: Li, Jiajia, et al.
Veröffentlicht: (2026)

Label Drop for Multi-Aspect Relation Modeling in Universal Information Extraction
von: Yang, Lu, et al.
Veröffentlicht: (2025)

SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers
von: Tang, Zicong, et al.
Veröffentlicht: (2025)

Keep the Cost Down: A Review on Methods to Optimize LLM' s KV-Cache Consumption
von: Shi, Luohe, et al.
Veröffentlicht: (2024)

Model Hemorrhage and the Robustness Limits of Large Language Models
von: Ma, Ziyang, et al.
Veröffentlicht: (2025)

DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression
von: Zhao, Yi, et al.
Veröffentlicht: (2025)

MGIMM: Multi-Granularity Instruction Multimodal Model for Attribute-Guided Remote Sensing Image Detailed Description
von: Yang, Cong, et al.
Veröffentlicht: (2024)

Semantics-Preserved Distortion for Personal Privacy Protection in Information Management
von: Li, Jiajia, et al.
Veröffentlicht: (2022)

XQuant: Achieving Ultra-Low Bit KV Cache Quantization with Cross-Layer Compression
von: Yang, Haoqi, et al.
Veröffentlicht: (2025)

Multi-modal Auto-regressive Modeling via Visual Words
von: Peng, Tianshuo, et al.
Veröffentlicht: (2024)

How Deep is Love in LLMs' Hearts? Exploring Semantic Size in Human-like Cognition
von: Yao, Yao, et al.
Veröffentlicht: (2025)

AMIA: Automatic Masking and Joint Intention Analysis Makes LVLMs Robust Jailbreak Defenders
von: Zhang, Yuqi, et al.
Veröffentlicht: (2025)

Faster MoE LLM Inference for Extremely Large Models
von: Yang, Haoqi, et al.
Veröffentlicht: (2025)

DHI: Leveraging Diverse Hallucination Induction for Enhanced Contrastive Factuality Control in Large Language Models
von: Guo, Jiani, et al.
Veröffentlicht: (2026)

OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment
von: Xiao, Teng, et al.
Veröffentlicht: (2025)

Intention Analysis Makes LLMs A Good Jailbreak Defender
von: Zhang, Yuqi, et al.
Veröffentlicht: (2024)

Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning
von: Yang, Cong, et al.
Veröffentlicht: (2024)

Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
von: Chen, Jiaqi, et al.
Veröffentlicht: (2024)

ReMamba: Equip Mamba with Effective Long-Sequence Modeling
von: Yuan, Danlong, et al.
Veröffentlicht: (2024)

A Detailed Factor Analysis for the Political Compass Test: Navigating Ideologies of Large Language Models
von: Kamal, Sadia, et al.
Veröffentlicht: (2025)

Fixing the Broken Compass: Diagnosing and Improving Inference-Time Reward Modeling
von: Li, Jiachun, et al.
Veröffentlicht: (2025)

MLPs Compass: What is learned when MLPs are combined with PLMs?
von: Zhou, Li, et al.
Veröffentlicht: (2024)

RankMamba: Benchmarking Mamba's Document Ranking Performance in the Era of Transformers
von: Xu, Zhichao
Veröffentlicht: (2024)

End-to-end Contrastive Language-Speech Pretraining Model For Long-form Spoken Question Answering
von: Hu, Jiliang, et al.
Veröffentlicht: (2025)