Saved in:
| Main Authors: | Kachuee, Sajjad, Sharifkhani, Mohammad |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2201.03327 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Efficient Large Language Models with Zero-Shot Adjustable Acceleration
by: Kachuee, Sajjad, et al.
Published: (2025)
by: Kachuee, Sajjad, et al.
Published: (2025)
Geometry-Preserving Aggregation for Mixture-of-Experts Embedding Models
by: Kachuee, Sajjad, et al.
Published: (2026)
by: Kachuee, Sajjad, et al.
Published: (2026)
Improving Tool Retrieval by Leveraging Large Language Models for Query Generation
by: Kachuee, Mohammad, et al.
Published: (2024)
by: Kachuee, Mohammad, et al.
Published: (2024)
Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks
by: Yildirim, Savas
Published: (2024)
by: Yildirim, Savas
Published: (2024)
Shallow Cross-Encoders for Low-Latency Retrieval
by: Petrov, Aleksandr V., et al.
Published: (2024)
by: Petrov, Aleksandr V., et al.
Published: (2024)
Understanding Syntactic Generalization in Structure-inducing Language Models
by: Arps, David, et al.
Published: (2025)
by: Arps, David, et al.
Published: (2025)
LLM-based Frameworks for API Argument Filling in Task-Oriented Conversational Systems
by: Mok, Jisoo, et al.
Published: (2024)
by: Mok, Jisoo, et al.
Published: (2024)
Long-Context Encoder Models for Polish Language Understanding
by: Dadas, Sławomir, et al.
Published: (2026)
by: Dadas, Sławomir, et al.
Published: (2026)
Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
by: Sel, Bilgehan, et al.
Published: (2024)
by: Sel, Bilgehan, et al.
Published: (2024)
Multilingual Nonce Dependency Treebanks: Understanding how Language Models represent and process syntactic structure
by: Arps, David, et al.
Published: (2023)
by: Arps, David, et al.
Published: (2023)
Understanding AI Evaluation Patterns: How Different GPT Models Assess Vision-Language Descriptions
by: Abdoli, Sajjad, et al.
Published: (2025)
by: Abdoli, Sajjad, et al.
Published: (2025)
Moonshine v2: Ergodic Streaming Encoder ASR for Latency-Critical Speech Applications
by: Kudlur, Manjunath, et al.
Published: (2026)
by: Kudlur, Manjunath, et al.
Published: (2026)
E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning
by: Liao, Zihan, et al.
Published: (2024)
by: Liao, Zihan, et al.
Published: (2024)
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
by: Pang, Ziqi, et al.
Published: (2023)
by: Pang, Ziqi, et al.
Published: (2023)
Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model
by: Zhang, Biao, et al.
Published: (2025)
by: Zhang, Biao, et al.
Published: (2025)
Heterogeneous Encoders Scaling In The Transformer For Neural Machine Translation
by: Hu, Jia Cheng, et al.
Published: (2023)
by: Hu, Jia Cheng, et al.
Published: (2023)
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
by: Langedijk, Anna, et al.
Published: (2023)
by: Langedijk, Anna, et al.
Published: (2023)
Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks
by: Lu, Bo-Ru, et al.
Published: (2024)
by: Lu, Bo-Ru, et al.
Published: (2024)
Spiralformer: Low Latency Encoder for Streaming Speech Recognition with Circular Layer Skipping and Early Exiting
by: Tsunoo, Emiru, et al.
Published: (2025)
by: Tsunoo, Emiru, et al.
Published: (2025)
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation
by: Uludoğan, Gökçe, et al.
Published: (2024)
by: Uludoğan, Gökçe, et al.
Published: (2024)
Attention or Convolution: Transformer Encoders in Audio Language Models for Inference Efficiency
by: Jeon, Sungho, et al.
Published: (2023)
by: Jeon, Sungho, et al.
Published: (2023)
DisCoCLIP: A Distributional Compositional Tensor Network Encoder for Vision-Language Understanding
by: Lo, Kin Ian, et al.
Published: (2025)
by: Lo, Kin Ian, et al.
Published: (2025)
Interference Matrix: Quantifying Cross-Lingual Interference in Transformer Encoders
by: Alastruey, Belen, et al.
Published: (2025)
by: Alastruey, Belen, et al.
Published: (2025)
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models
by: Prabhavalkar, Rohit, et al.
Published: (2024)
by: Prabhavalkar, Rohit, et al.
Published: (2024)
Cross-Prompt Encoder for Low-Performing Languages
by: Mikaberidze, Beso, et al.
Published: (2025)
by: Mikaberidze, Beso, et al.
Published: (2025)
On Multilingual Encoder Language Model Compression for Low-Resource Languages
by: Gurgurov, Daniil, et al.
Published: (2025)
by: Gurgurov, Daniil, et al.
Published: (2025)
Understanding Public Perception of Crime in Bangladesh: A Transformer-Based Approach with Explainability
by: Hassan, Fatema Binte, et al.
Published: (2025)
by: Hassan, Fatema Binte, et al.
Published: (2025)
FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction
by: Jain, Akriti, et al.
Published: (2024)
by: Jain, Akriti, et al.
Published: (2024)
LabelFusion: Fusing Large Language Models with Transformer Encoders for Robust Financial News Classification
by: Schlee, Michael, et al.
Published: (2025)
by: Schlee, Michael, et al.
Published: (2025)
An Empirical Evaluation of Encoder Architectures for Fast Real-Time Long Conversational Understanding
by: Senthilnathan, Annamalai, et al.
Published: (2025)
by: Senthilnathan, Annamalai, et al.
Published: (2025)
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
by: Huang, Jie, et al.
Published: (2023)
by: Huang, Jie, et al.
Published: (2023)
On Affine Homotopy between Language Encoders
by: Chan, Robin SM, et al.
Published: (2024)
by: Chan, Robin SM, et al.
Published: (2024)
Language Models as Hierarchy Encoders
by: He, Yuan, et al.
Published: (2024)
by: He, Yuan, et al.
Published: (2024)
A Combined Encoder and Transformer Approach for Coherent and High-Quality Text Generation
by: Chen, Jiajing, et al.
Published: (2024)
by: Chen, Jiajing, et al.
Published: (2024)
MaBERT:A Padding Safe Interleaved Transformer Mamba Hybrid Encoder for Efficient Extended Context Masked Language Modeling
by: Kim, Jinwoong, et al.
Published: (2026)
by: Kim, Jinwoong, et al.
Published: (2026)
Transformers and Cortical Waves: Encoders for Pulling In Context Across Time
by: Muller, Lyle, et al.
Published: (2024)
by: Muller, Lyle, et al.
Published: (2024)
Assessing Logical Reasoning Capabilities of Encoder-Only Transformer Models
by: Pirozelli, Paulo, et al.
Published: (2023)
by: Pirozelli, Paulo, et al.
Published: (2023)
Rethinking the Mixture of Vision Encoders Paradigm for Enhanced Visual Understanding in Multimodal LLMs
by: Azadani, Mozhgan Nasr, et al.
Published: (2025)
by: Azadani, Mozhgan Nasr, et al.
Published: (2025)
Utilizing Multilingual Encoders to Improve Large Language Models for Low-Resource Languages
by: Puranegedara, Imalsha, et al.
Published: (2025)
by: Puranegedara, Imalsha, et al.
Published: (2025)
Understanding and Enhancing Mamba-Transformer Hybrids for Memory Recall and Language Modeling
by: Lee, Hyunji, et al.
Published: (2025)
by: Lee, Hyunji, et al.
Published: (2025)
Similar Items
-
Efficient Large Language Models with Zero-Shot Adjustable Acceleration
by: Kachuee, Sajjad, et al.
Published: (2025) -
Geometry-Preserving Aggregation for Mixture-of-Experts Embedding Models
by: Kachuee, Sajjad, et al.
Published: (2026) -
Improving Tool Retrieval by Leveraging Large Language Models for Query Generation
by: Kachuee, Mohammad, et al.
Published: (2024) -
Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks
by: Yildirim, Savas
Published: (2024) -
Shallow Cross-Encoders for Low-Latency Retrieval
by: Petrov, Aleksandr V., et al.
Published: (2024)