Guardado en:
| Autores principales: | Ding, Shaojin, Qiu, David, Rim, David, He, Yanzhang, Rybakov, Oleg, Li, Bo, Prabhavalkar, Rohit, Wang, Weiran, Sainath, Tara N., Han, Zhonglin, Li, Jian, Yazdanbakhsh, Amir, Agrawal, Shivani |
|---|---|
| Formato: | Preprint |
| Publicado: |
2023
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2312.08553 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
USM RNN-T model weights binarization
por: Rybakov, Oleg, et al.
Publicado: (2024)
por: Rybakov, Oleg, et al.
Publicado: (2024)
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models
por: Prabhavalkar, Rohit, et al.
Publicado: (2024)
por: Prabhavalkar, Rohit, et al.
Publicado: (2024)
Text Injection for Neural Contextual Biasing
por: Meng, Zhong, et al.
Publicado: (2024)
por: Meng, Zhong, et al.
Publicado: (2024)
How to Estimate Model Transferability of Pre-Trained Speech Models?
por: Chen, Zih-Ching, et al.
Publicado: (2023)
por: Chen, Zih-Ching, et al.
Publicado: (2023)
Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
por: Bambhaniya, Abhimanyu Rajeshkumar, et al.
Publicado: (2024)
por: Bambhaniya, Abhimanyu Rajeshkumar, et al.
Publicado: (2024)
SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compression
por: Mozaffari, Mohammad, et al.
Publicado: (2024)
por: Mozaffari, Mohammad, et al.
Publicado: (2024)
SimulTron: On-Device Simultaneous Speech to Speech Translation
por: Agranovich, Alex, et al.
Publicado: (2024)
por: Agranovich, Alex, et al.
Publicado: (2024)
Beyond Moore's Law: Harnessing the Redshift of Generative AI with Effective Hardware-Software Co-Design
por: Yazdanbakhsh, Amir
Publicado: (2025)
por: Yazdanbakhsh, Amir
Publicado: (2025)
Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models
por: Tao, Dehua, et al.
Publicado: (2026)
por: Tao, Dehua, et al.
Publicado: (2026)
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity
por: Khaki, Samir, et al.
Publicado: (2025)
por: Khaki, Samir, et al.
Publicado: (2025)
Effective Interplay between Sparsity and Quantization: From Theory to Practice
por: Harma, Simla Burcu, et al.
Publicado: (2024)
por: Harma, Simla Burcu, et al.
Publicado: (2024)
LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
por: Kamahori, Keisuke, et al.
Publicado: (2025)
por: Kamahori, Keisuke, et al.
Publicado: (2025)
Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages
por: Pillai, Leena G, et al.
Publicado: (2024)
por: Pillai, Leena G, et al.
Publicado: (2024)
Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
por: Munkhdalai, Tsendsuren, et al.
Publicado: (2024)
por: Munkhdalai, Tsendsuren, et al.
Publicado: (2024)
Improving the Inclusivity of Dutch Speech Recognition by Fine-tuning Whisper on the JASMIN-CGN Corpus
por: Shekoufandeh, Golshid, et al.
Publicado: (2025)
por: Shekoufandeh, Golshid, et al.
Publicado: (2025)
Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation
por: Wang, Huimeng, et al.
Publicado: (2024)
por: Wang, Huimeng, et al.
Publicado: (2024)
Not All NVFP4 QAT Recipes Are Equal: How Architecture and Scale Shape Model Quality for Anomaly Segmentation
por: Du, Zijian, et al.
Publicado: (2026)
por: Du, Zijian, et al.
Publicado: (2026)
Position-invariant Fine-tuning of Speech Enhancement Models with Self-supervised Speech Representations
por: Meghanani, Amit, et al.
Publicado: (2026)
por: Meghanani, Amit, et al.
Publicado: (2026)
Jurnal USM Law Review
Publicado: (2020)
Publicado: (2020)
Bangla Hate Speech Classification with Fine-tuned Transformer Models
por: Jafari, Yalda Keivan, et al.
Publicado: (2025)
por: Jafari, Yalda Keivan, et al.
Publicado: (2025)
Automatic Screening for Children with Speech Disorder using Automatic Speech Recognition: Opportunities and Challenges
por: Liu, Dancheng, et al.
Publicado: (2024)
por: Liu, Dancheng, et al.
Publicado: (2024)
On the Contribution of Lexical Features to Speech Emotion Recognition
por: Combei, David
Publicado: (2025)
por: Combei, David
Publicado: (2025)
Speech Robust Bench: A Robustness Benchmark For Speech Recognition
por: Shah, Muhammad A., et al.
Publicado: (2024)
por: Shah, Muhammad A., et al.
Publicado: (2024)
Speech Recognition Model Improves Text-to-Speech Synthesis using Fine-Grained Reward
por: Wang, Guansu, et al.
Publicado: (2025)
por: Wang, Guansu, et al.
Publicado: (2025)
Gradient Norm-based Fine-Tuning for Backdoor Defense in Automatic Speech Recognition
por: Zhou, Nanjun, et al.
Publicado: (2025)
por: Zhou, Nanjun, et al.
Publicado: (2025)
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
por: Stooke, Adam, et al.
Publicado: (2025)
por: Stooke, Adam, et al.
Publicado: (2025)
Persian Speech Emotion Recognition by Fine-Tuning Transformers
por: Shayaninasab, Minoo, et al.
Publicado: (2024)
por: Shayaninasab, Minoo, et al.
Publicado: (2024)
Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding
por: Hu, Jiliang, et al.
Publicado: (2025)
por: Hu, Jiliang, et al.
Publicado: (2025)
Speech Emotion Recognition with ASR Integration
por: Li, Yuanchao
Publicado: (2026)
por: Li, Yuanchao
Publicado: (2026)
Semantic Communications for Speech Recognition
por: Weng, Zhenzi, et al.
Publicado: (2021)
por: Weng, Zhenzi, et al.
Publicado: (2021)
Rubric-Guided Fine-tuning of SpeechLLMs for Multi-Aspect, Multi-Rater L2 Reading-Speech Assessment
por: Parikh, Aditya Kamlesh, et al.
Publicado: (2026)
por: Parikh, Aditya Kamlesh, et al.
Publicado: (2026)
Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval
por: Flemotomos, Nikolaos, et al.
Publicado: (2024)
por: Flemotomos, Nikolaos, et al.
Publicado: (2024)
DQ-Whisper: Joint Distillation and Quantization for Efficient Multilingual Speech Recognition
por: Shao, Hang, et al.
Publicado: (2023)
por: Shao, Hang, et al.
Publicado: (2023)
DQ-Data2vec: Decoupling Quantization for Multilingual Speech Recognition
por: Shao, Qijie, et al.
Publicado: (2025)
por: Shao, Qijie, et al.
Publicado: (2025)
Sparsity-Driven EEG Channel Selection for Brain-Assisted Speech Enhancement
por: Zhang, Jie, et al.
Publicado: (2023)
por: Zhang, Jie, et al.
Publicado: (2023)
Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition
por: Shin, Ui-Hyeop, et al.
Publicado: (2023)
por: Shin, Ui-Hyeop, et al.
Publicado: (2023)
BrainWavLM: Fine-tuning Speech Representations with Brain Responses to Language
por: Vattikonda, Nishitha, et al.
Publicado: (2025)
por: Vattikonda, Nishitha, et al.
Publicado: (2025)
Fine-tuning Quantized Neural Networks with Zeroth-order Optimization
por: Shang, Sifeng, et al.
Publicado: (2025)
por: Shang, Sifeng, et al.
Publicado: (2025)
Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR
por: Bai, Junwen, et al.
Publicado: (2024)
por: Bai, Junwen, et al.
Publicado: (2024)
Tao: Re-Thinking DL-based Microarchitecture Simulation
por: Pandey, Santosh, et al.
Publicado: (2024)
por: Pandey, Santosh, et al.
Publicado: (2024)
Ejemplares similares
-
USM RNN-T model weights binarization
por: Rybakov, Oleg, et al.
Publicado: (2024) -
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models
por: Prabhavalkar, Rohit, et al.
Publicado: (2024) -
Text Injection for Neural Contextual Biasing
por: Meng, Zhong, et al.
Publicado: (2024) -
How to Estimate Model Transferability of Pre-Trained Speech Models?
por: Chen, Zih-Ching, et al.
Publicado: (2023) -
Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
por: Bambhaniya, Abhimanyu Rajeshkumar, et al.
Publicado: (2024)