:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Ben-Artzy, Amit, Schwartz, Roy
Formato:	Preprint
Publicado:	2025
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2507.16323
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Attend First, Consolidate Later: On the Importance of Attention in Different LLM Layers
por: Ben-Artzy, Amit, et al.
Publicado: (2024)

SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs
por: Jie, Shibo, et al.
Publicado: (2025)

ViSpeR: Multilingual Audio-Visual Speech Recognition
por: Narayan, Sanath, et al.
Publicado: (2024)

KOALA: Enhancing Speculative Decoding for LLM via Multi-Layer Draft Heads with Adversarial Learning
por: Zhang, Kaiqi, et al.
Publicado: (2024)

Enhancing LLM Character-Level Manipulation via Divide and Conquer
por: Xiong, Zhen, et al.
Publicado: (2025)

LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding
por: Lin, Gang, et al.
Publicado: (2026)

Acceleration Multiple Heads Decoding for LLM via Dynamic Tree Attention
por: Zhang, Zhendong
Publicado: (2025)

Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large Language Models
por: Mamou, Jonathan, et al.
Publicado: (2024)

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
por: Cai, Tianle, et al.
Publicado: (2024)

SpeCrawler: Generating OpenAPI Specifications from API Documentation Using Large Language Models
por: Lazar, Koren, et al.
Publicado: (2024)

Beyond Performance: Quantifying and Mitigating Label Bias in LLMs
por: Reif, Yuval, et al.
Publicado: (2024)

C-LLM: Learn to Check Chinese Spelling Errors Character by Character
por: Li, Kunting, et al.
Publicado: (2024)

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning
por: Hassid, Michael, et al.
Publicado: (2025)

CREFT: Sequential Multi-Agent LLM for Character Relation Extraction
por: Chun, Ye Eun, et al.
Publicado: (2025)

Inferring Functionality of Attention Heads from their Parameters
por: Elhelo, Amit, et al.
Publicado: (2024)

Transformers are Multi-State RNNs
por: Oren, Matanel, et al.
Publicado: (2024)

Understanding the Ability of LLMs to Handle Character-Level Perturbation
por: Zhuo, Anyuan, et al.
Publicado: (2025)

Exact Hard Monotonic Attention for Character-Level Transduction
por: Wu, Shijie, et al.
Publicado: (2019)

Cross-lingual, Character-Level Neural Morphological Tagging
por: Cotterell, Ryan, et al.
Publicado: (2017)

Hard Non-Monotonic Attention for Character-Level Transduction
por: Wu, Shijie, et al.
Publicado: (2018)

MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
por: Yue, Murong, et al.
Publicado: (2024)

Non-verbal information in spontaneous speech -- towards a new framework of analysis
por: Biron, Tirza, et al.
Publicado: (2024)

A Multi-Model Adaptation of Speculative Decoding for Classification
por: Roy, Somnath, et al.
Publicado: (2025)

Root Defence Strategies: Ensuring Safety of LLM at the Decoding Level
por: Zeng, Xinyi, et al.
Publicado: (2024)

Decoding Continuous Character-based Language from Non-invasive Brain Recordings
por: Zhang, Cenyuan, et al.
Publicado: (2024)

CharBench: Evaluating the Role of Tokenization in Character-Level Tasks
por: Uzan, Omri, et al.
Publicado: (2025)

How Do Language Models Acquire Character-Level Information?
por: Sato, Soma, et al.
Publicado: (2026)

Vocab Diet: Reshaping the Vocabulary of LLMs via Vector Arithmetic
por: Reif, Yuval, et al.
Publicado: (2025)

CharED: Character-wise Ensemble Decoding for Large Language Models
por: Gu, Kevin, et al.
Publicado: (2024)

Single Character Perturbations Break LLM Alignment
por: Lin, Leon, et al.
Publicado: (2024)

DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check
por: Qiao, Ziheng, et al.
Publicado: (2024)

Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE
por: Huang, Haiduo, et al.
Publicado: (2025)

Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning
por: Xu, Zhu, et al.
Publicado: (2024)

Word Recovery in Large Language Models Enables Character-Level Tokenization Robustness
por: Yang, Zhipeng, et al.
Publicado: (2026)

On Pruning State-Space LLMs
por: Ghattas, Tamer, et al.
Publicado: (2025)

Token-Level Marginalization for Multi-Label LLM Classifiers
por: Praharaj, Anjaneya, et al.
Publicado: (2025)

Effectively Compress KV Heads for LLM
por: Yu, Hao, et al.
Publicado: (2024)

The Larger the Better? Improved LLM Code-Generation via Budget Reallocation
por: Hassid, Michael, et al.
Publicado: (2024)

AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism
por: Wei, Zhepei, et al.
Publicado: (2025)

Character-Level Chinese Dependency Parsing via Modeling Latent Intra-Word Structure
por: Hou, Yang, et al.
Publicado: (2024)