Saved in:
| Main Author: | Song, Yubo |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.08570 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Layer-wise Alignment: Examining Safety Alignment Across Image Encoder Layers in Vision Language Models
by: Bachu, Saketh, et al.
Published: (2024)
by: Bachu, Saketh, et al.
Published: (2024)
Diagnostic-Driven Layer-Wise Compensation for Post-Training Quantization of Encoder-Decoder ASR Models
by: Wang, Xinyu, et al.
Published: (2026)
by: Wang, Xinyu, et al.
Published: (2026)
Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model
by: Zhang, Biao, et al.
Published: (2025)
by: Zhang, Biao, et al.
Published: (2025)
AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism
by: Wei, Zhepei, et al.
Published: (2025)
by: Wei, Zhepei, et al.
Published: (2025)
LEAD: Layer-wise Expert-aligned Decoding for Faithful Radiology Report Generation
by: Yang, Ruixiao, et al.
Published: (2026)
by: Yang, Ruixiao, et al.
Published: (2026)
ConfLayers: Adaptive Confidence-based Layer Skipping for Self-Speculative Decoding
by: Amer, Walaa, et al.
Published: (2026)
by: Amer, Walaa, et al.
Published: (2026)
A One-Layer Decoder-Only Transformer is a Two-Layer RNN: With an Application to Certified Robustness
by: Zhang, Yuhao, et al.
Published: (2024)
by: Zhang, Yuhao, et al.
Published: (2024)
Lower Layers Matter: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
by: Chen, Dingwei, et al.
Published: (2024)
by: Chen, Dingwei, et al.
Published: (2024)
Measuring the Redundancy of Decoder Layers in SpeechLLMs
by: Moumen, Adel, et al.
Published: (2026)
by: Moumen, Adel, et al.
Published: (2026)
Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks
by: Lu, Bo-Ru, et al.
Published: (2024)
by: Lu, Bo-Ru, et al.
Published: (2024)
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
by: Langedijk, Anna, et al.
Published: (2023)
by: Langedijk, Anna, et al.
Published: (2023)
Rethinking Entropy Allocation in LLM-based ASR: Understanding the Dynamics between Speech Encoders and LLMs
by: Xie, Yuan, et al.
Published: (2026)
by: Xie, Yuan, et al.
Published: (2026)
Multilingual Contrastive Decoding via Language-Agnostic Layers Skipping
by: Zhu, Wenhao, et al.
Published: (2024)
by: Zhu, Wenhao, et al.
Published: (2024)
Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers
by: He, Zicong, et al.
Published: (2025)
by: He, Zicong, et al.
Published: (2025)
CLaSp: In-Context Layer Skip for Self-Speculative Decoding
by: Chen, Longze, et al.
Published: (2025)
by: Chen, Longze, et al.
Published: (2025)
WilKE: Wise-Layer Knowledge Editor for Lifelong Knowledge Editing
by: Hu, Chenhui, et al.
Published: (2024)
by: Hu, Chenhui, et al.
Published: (2024)
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
by: Pang, Ziqi, et al.
Published: (2023)
by: Pang, Ziqi, et al.
Published: (2023)
LayerNorm Induces Recency Bias in Transformer Decoders
by: Kim, Junu, et al.
Published: (2025)
by: Kim, Junu, et al.
Published: (2025)
Decoders Laugh as Loud as Encoders
by: Borodach, Eli, et al.
Published: (2025)
by: Borodach, Eli, et al.
Published: (2025)
Improve Decoding Factuality by Token-wise Cross Layer Entropy of Large Language Models
by: Wu, Jialiang, et al.
Published: (2025)
by: Wu, Jialiang, et al.
Published: (2025)
KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization
by: Song, Mingbo, et al.
Published: (2025)
by: Song, Mingbo, et al.
Published: (2025)
Rethinking Layer Relevance in Large Language Models Beyond Cosine Similarity
by: Hinostroza, Cristian, et al.
Published: (2026)
by: Hinostroza, Cristian, et al.
Published: (2026)
From Layers to Submodules: Rethinking Granularity in Replacement-Based LLM Compression
by: Cunegatti, Elia, et al.
Published: (2026)
by: Cunegatti, Elia, et al.
Published: (2026)
Layer-Wise Evolution of Representations in Fine-Tuned Transformers: Insights from Sparse AutoEncoders
by: Nadipalli, Suneel
Published: (2025)
by: Nadipalli, Suneel
Published: (2025)
Emergence and Effectiveness of Task Vectors in In-Context Learning: An Encoder Decoder Perspective
by: Han, Seungwook, et al.
Published: (2024)
by: Han, Seungwook, et al.
Published: (2024)
When Less Is More? Diagnosing ASR Predictions in Sardinian via Layer-Wise Decoding
by: De Cristofaro, Domenico, et al.
Published: (2026)
by: De Cristofaro, Domenico, et al.
Published: (2026)
Rethinking the Multilingual Reasoning Gap with Layer Swap
by: Lasbordes, Maxence, et al.
Published: (2026)
by: Lasbordes, Maxence, et al.
Published: (2026)
Beyond Static Cropping: Layer-Adaptive Visual Localization and Decoding Enhancement
by: Zhu, Zipeng, et al.
Published: (2026)
by: Zhu, Zipeng, et al.
Published: (2026)
Layer-Order Inversion: Rethinking Latent Multi-Hop Reasoning in Large Language Models
by: Liu, Xukai, et al.
Published: (2026)
by: Liu, Xukai, et al.
Published: (2026)
DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer
by: Wei, Jinfeng, et al.
Published: (2024)
by: Wei, Jinfeng, et al.
Published: (2024)
DEL: Context-Aware Dynamic Exit Layer for Efficient Self-Speculative Decoding
by: Zarch, Hossein Entezari, et al.
Published: (2025)
by: Zarch, Hossein Entezari, et al.
Published: (2025)
LFD: Layer Fused Decoding to Exploit External Knowledge in Retrieval-Augmented Generation
by: Sun, Yang, et al.
Published: (2025)
by: Sun, Yang, et al.
Published: (2025)
Dynamic Encoder Size Based on Data-Driven Layer-wise Pruning for Speech Recognition
by: Xu, Jingjing, et al.
Published: (2024)
by: Xu, Jingjing, et al.
Published: (2024)
KOALA: Enhancing Speculative Decoding for LLM via Multi-Layer Draft Heads with Adversarial Learning
by: Zhang, Kaiqi, et al.
Published: (2024)
by: Zhang, Kaiqi, et al.
Published: (2024)
Spiralformer: Low Latency Encoder for Streaming Speech Recognition with Circular Layer Skipping and Early Exiting
by: Tsunoo, Emiru, et al.
Published: (2025)
by: Tsunoo, Emiru, et al.
Published: (2025)
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
by: Nielsen, Dan Saattrup, et al.
Published: (2024)
by: Nielsen, Dan Saattrup, et al.
Published: (2024)
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
by: Elhoushi, Mostafa, et al.
Published: (2024)
by: Elhoushi, Mostafa, et al.
Published: (2024)
Romanized to Native Malayalam Script Transliteration Using an Encoder-Decoder Framework
by: Baiju, Bajiyo, et al.
Published: (2024)
by: Baiju, Bajiyo, et al.
Published: (2024)
Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It)
by: Miaschi, Alessio, et al.
Published: (2024)
by: Miaschi, Alessio, et al.
Published: (2024)
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers
by: Bozic, Vukasin, et al.
Published: (2023)
by: Bozic, Vukasin, et al.
Published: (2023)
Similar Items
-
Layer-wise Alignment: Examining Safety Alignment Across Image Encoder Layers in Vision Language Models
by: Bachu, Saketh, et al.
Published: (2024) -
Diagnostic-Driven Layer-Wise Compensation for Post-Training Quantization of Encoder-Decoder ASR Models
by: Wang, Xinyu, et al.
Published: (2026) -
Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model
by: Zhang, Biao, et al.
Published: (2025) -
AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism
by: Wei, Zhepei, et al.
Published: (2025) -
LEAD: Layer-wise Expert-aligned Decoding for Faithful Radiology Report Generation
by: Yang, Ruixiao, et al.
Published: (2026)