Guardado en:
| Autores principales: | Ben-Artzy, Amit, Schwartz, Roy |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2507.16323 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Attend First, Consolidate Later: On the Importance of Attention in Different LLM Layers
por: Ben-Artzy, Amit, et al.
Publicado: (2024)
por: Ben-Artzy, Amit, et al.
Publicado: (2024)
SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs
por: Jie, Shibo, et al.
Publicado: (2025)
por: Jie, Shibo, et al.
Publicado: (2025)
ViSpeR: Multilingual Audio-Visual Speech Recognition
por: Narayan, Sanath, et al.
Publicado: (2024)
por: Narayan, Sanath, et al.
Publicado: (2024)
KOALA: Enhancing Speculative Decoding for LLM via Multi-Layer Draft Heads with Adversarial Learning
por: Zhang, Kaiqi, et al.
Publicado: (2024)
por: Zhang, Kaiqi, et al.
Publicado: (2024)
Enhancing LLM Character-Level Manipulation via Divide and Conquer
por: Xiong, Zhen, et al.
Publicado: (2025)
por: Xiong, Zhen, et al.
Publicado: (2025)
LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding
por: Lin, Gang, et al.
Publicado: (2026)
por: Lin, Gang, et al.
Publicado: (2026)
Acceleration Multiple Heads Decoding for LLM via Dynamic Tree Attention
por: Zhang, Zhendong
Publicado: (2025)
por: Zhang, Zhendong
Publicado: (2025)
Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large Language Models
por: Mamou, Jonathan, et al.
Publicado: (2024)
por: Mamou, Jonathan, et al.
Publicado: (2024)
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
por: Cai, Tianle, et al.
Publicado: (2024)
por: Cai, Tianle, et al.
Publicado: (2024)
SpeCrawler: Generating OpenAPI Specifications from API Documentation Using Large Language Models
por: Lazar, Koren, et al.
Publicado: (2024)
por: Lazar, Koren, et al.
Publicado: (2024)
Beyond Performance: Quantifying and Mitigating Label Bias in LLMs
por: Reif, Yuval, et al.
Publicado: (2024)
por: Reif, Yuval, et al.
Publicado: (2024)
C-LLM: Learn to Check Chinese Spelling Errors Character by Character
por: Li, Kunting, et al.
Publicado: (2024)
por: Li, Kunting, et al.
Publicado: (2024)
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning
por: Hassid, Michael, et al.
Publicado: (2025)
por: Hassid, Michael, et al.
Publicado: (2025)
CREFT: Sequential Multi-Agent LLM for Character Relation Extraction
por: Chun, Ye Eun, et al.
Publicado: (2025)
por: Chun, Ye Eun, et al.
Publicado: (2025)
Inferring Functionality of Attention Heads from their Parameters
por: Elhelo, Amit, et al.
Publicado: (2024)
por: Elhelo, Amit, et al.
Publicado: (2024)
Transformers are Multi-State RNNs
por: Oren, Matanel, et al.
Publicado: (2024)
por: Oren, Matanel, et al.
Publicado: (2024)
Understanding the Ability of LLMs to Handle Character-Level Perturbation
por: Zhuo, Anyuan, et al.
Publicado: (2025)
por: Zhuo, Anyuan, et al.
Publicado: (2025)
Exact Hard Monotonic Attention for Character-Level Transduction
por: Wu, Shijie, et al.
Publicado: (2019)
por: Wu, Shijie, et al.
Publicado: (2019)
Cross-lingual, Character-Level Neural Morphological Tagging
por: Cotterell, Ryan, et al.
Publicado: (2017)
por: Cotterell, Ryan, et al.
Publicado: (2017)
Hard Non-Monotonic Attention for Character-Level Transduction
por: Wu, Shijie, et al.
Publicado: (2018)
por: Wu, Shijie, et al.
Publicado: (2018)
MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
por: Yue, Murong, et al.
Publicado: (2024)
por: Yue, Murong, et al.
Publicado: (2024)
Non-verbal information in spontaneous speech -- towards a new framework of analysis
por: Biron, Tirza, et al.
Publicado: (2024)
por: Biron, Tirza, et al.
Publicado: (2024)
A Multi-Model Adaptation of Speculative Decoding for Classification
por: Roy, Somnath, et al.
Publicado: (2025)
por: Roy, Somnath, et al.
Publicado: (2025)
Root Defence Strategies: Ensuring Safety of LLM at the Decoding Level
por: Zeng, Xinyi, et al.
Publicado: (2024)
por: Zeng, Xinyi, et al.
Publicado: (2024)
Decoding Continuous Character-based Language from Non-invasive Brain Recordings
por: Zhang, Cenyuan, et al.
Publicado: (2024)
por: Zhang, Cenyuan, et al.
Publicado: (2024)
CharBench: Evaluating the Role of Tokenization in Character-Level Tasks
por: Uzan, Omri, et al.
Publicado: (2025)
por: Uzan, Omri, et al.
Publicado: (2025)
How Do Language Models Acquire Character-Level Information?
por: Sato, Soma, et al.
Publicado: (2026)
por: Sato, Soma, et al.
Publicado: (2026)
Vocab Diet: Reshaping the Vocabulary of LLMs via Vector Arithmetic
por: Reif, Yuval, et al.
Publicado: (2025)
por: Reif, Yuval, et al.
Publicado: (2025)
CharED: Character-wise Ensemble Decoding for Large Language Models
por: Gu, Kevin, et al.
Publicado: (2024)
por: Gu, Kevin, et al.
Publicado: (2024)
Single Character Perturbations Break LLM Alignment
por: Lin, Leon, et al.
Publicado: (2024)
por: Lin, Leon, et al.
Publicado: (2024)
DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check
por: Qiao, Ziheng, et al.
Publicado: (2024)
por: Qiao, Ziheng, et al.
Publicado: (2024)
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE
por: Huang, Haiduo, et al.
Publicado: (2025)
por: Huang, Haiduo, et al.
Publicado: (2025)
Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning
por: Xu, Zhu, et al.
Publicado: (2024)
por: Xu, Zhu, et al.
Publicado: (2024)
Word Recovery in Large Language Models Enables Character-Level Tokenization Robustness
por: Yang, Zhipeng, et al.
Publicado: (2026)
por: Yang, Zhipeng, et al.
Publicado: (2026)
On Pruning State-Space LLMs
por: Ghattas, Tamer, et al.
Publicado: (2025)
por: Ghattas, Tamer, et al.
Publicado: (2025)
Token-Level Marginalization for Multi-Label LLM Classifiers
por: Praharaj, Anjaneya, et al.
Publicado: (2025)
por: Praharaj, Anjaneya, et al.
Publicado: (2025)
Effectively Compress KV Heads for LLM
por: Yu, Hao, et al.
Publicado: (2024)
por: Yu, Hao, et al.
Publicado: (2024)
The Larger the Better? Improved LLM Code-Generation via Budget Reallocation
por: Hassid, Michael, et al.
Publicado: (2024)
por: Hassid, Michael, et al.
Publicado: (2024)
AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism
por: Wei, Zhepei, et al.
Publicado: (2025)
por: Wei, Zhepei, et al.
Publicado: (2025)
Character-Level Chinese Dependency Parsing via Modeling Latent Intra-Word Structure
por: Hou, Yang, et al.
Publicado: (2024)
por: Hou, Yang, et al.
Publicado: (2024)
Ejemplares similares
-
Attend First, Consolidate Later: On the Importance of Attention in Different LLM Layers
por: Ben-Artzy, Amit, et al.
Publicado: (2024) -
SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs
por: Jie, Shibo, et al.
Publicado: (2025) -
ViSpeR: Multilingual Audio-Visual Speech Recognition
por: Narayan, Sanath, et al.
Publicado: (2024) -
KOALA: Enhancing Speculative Decoding for LLM via Multi-Layer Draft Heads with Adversarial Learning
por: Zhang, Kaiqi, et al.
Publicado: (2024) -
Enhancing LLM Character-Level Manipulation via Divide and Conquer
por: Xiong, Zhen, et al.
Publicado: (2025)