:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Ding, Shaojin, Qiu, David, Rim, David, He, Yanzhang, Rybakov, Oleg, Li, Bo, Prabhavalkar, Rohit, Wang, Weiran, Sainath, Tara N., Han, Zhonglin, Li, Jian, Yazdanbakhsh, Amir, Agrawal, Shivani
Formato:	Preprint
Publicado:	2023
Materias:	Audio and Speech Processing Sound
Acceso en línea:	https://arxiv.org/abs/2312.08553
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

USM RNN-T model weights binarization
por: Rybakov, Oleg, et al.
Publicado: (2024)

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models
por: Prabhavalkar, Rohit, et al.
Publicado: (2024)

Text Injection for Neural Contextual Biasing
por: Meng, Zhong, et al.
Publicado: (2024)

How to Estimate Model Transferability of Pre-Trained Speech Models?
por: Chen, Zih-Ching, et al.
Publicado: (2023)

Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
por: Bambhaniya, Abhimanyu Rajeshkumar, et al.
Publicado: (2024)

SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compression
por: Mozaffari, Mohammad, et al.
Publicado: (2024)

SimulTron: On-Device Simultaneous Speech to Speech Translation
por: Agranovich, Alex, et al.
Publicado: (2024)

Beyond Moore's Law: Harnessing the Redshift of Generative AI with Effective Hardware-Software Co-Design
por: Yazdanbakhsh, Amir
Publicado: (2025)

Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models
por: Tao, Dehua, et al.
Publicado: (2026)

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity
por: Khaki, Samir, et al.
Publicado: (2025)

Effective Interplay between Sparsity and Quantization: From Theory to Practice
por: Harma, Simla Burcu, et al.
Publicado: (2024)

LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
por: Kamahori, Keisuke, et al.
Publicado: (2025)

Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages
por: Pillai, Leena G, et al.
Publicado: (2024)

Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models
por: Munkhdalai, Tsendsuren, et al.
Publicado: (2024)

Improving the Inclusivity of Dutch Speech Recognition by Fine-tuning Whisper on the JASMIN-CGN Corpus
por: Shekoufandeh, Golshid, et al.
Publicado: (2025)

Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation
por: Wang, Huimeng, et al.
Publicado: (2024)

Not All NVFP4 QAT Recipes Are Equal: How Architecture and Scale Shape Model Quality for Anomaly Segmentation
por: Du, Zijian, et al.
Publicado: (2026)

Position-invariant Fine-tuning of Speech Enhancement Models with Self-supervised Speech Representations
por: Meghanani, Amit, et al.
Publicado: (2026)

Jurnal USM Law Review
Publicado: (2020)

Bangla Hate Speech Classification with Fine-tuned Transformer Models
por: Jafari, Yalda Keivan, et al.
Publicado: (2025)

Automatic Screening for Children with Speech Disorder using Automatic Speech Recognition: Opportunities and Challenges
por: Liu, Dancheng, et al.
Publicado: (2024)

On the Contribution of Lexical Features to Speech Emotion Recognition
por: Combei, David
Publicado: (2025)

Speech Robust Bench: A Robustness Benchmark For Speech Recognition
por: Shah, Muhammad A., et al.
Publicado: (2024)

Speech Recognition Model Improves Text-to-Speech Synthesis using Fine-Grained Reward
por: Wang, Guansu, et al.
Publicado: (2025)

Gradient Norm-based Fine-Tuning for Backdoor Defense in Automatic Speech Recognition
por: Zhou, Nanjun, et al.
Publicado: (2025)

Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
por: Stooke, Adam, et al.
Publicado: (2025)

Persian Speech Emotion Recognition by Fine-Tuning Transformers
por: Shayaninasab, Minoo, et al.
Publicado: (2024)

Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding
por: Hu, Jiliang, et al.
Publicado: (2025)

Speech Emotion Recognition with ASR Integration
por: Li, Yuanchao
Publicado: (2026)

Semantic Communications for Speech Recognition
por: Weng, Zhenzi, et al.
Publicado: (2021)

Rubric-Guided Fine-tuning of SpeechLLMs for Multi-Aspect, Multi-Rater L2 Reading-Speech Assessment
por: Parikh, Aditya Kamlesh, et al.
Publicado: (2026)

Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval
por: Flemotomos, Nikolaos, et al.
Publicado: (2024)

DQ-Whisper: Joint Distillation and Quantization for Efficient Multilingual Speech Recognition
por: Shao, Hang, et al.
Publicado: (2023)

DQ-Data2vec: Decoupling Quantization for Multilingual Speech Recognition
por: Shao, Qijie, et al.
Publicado: (2025)

Sparsity-Driven EEG Channel Selection for Brain-Assisted Speech Enhancement
por: Zhang, Jie, et al.
Publicado: (2023)

Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition
por: Shin, Ui-Hyeop, et al.
Publicado: (2023)

BrainWavLM: Fine-tuning Speech Representations with Brain Responses to Language
por: Vattikonda, Nishitha, et al.
Publicado: (2025)

Fine-tuning Quantized Neural Networks with Zeroth-order Optimization
por: Shang, Sifeng, et al.
Publicado: (2025)

Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR
por: Bai, Junwen, et al.
Publicado: (2024)

Tao: Re-Thinking DL-based Microarchitecture Simulation
por: Pandey, Santosh, et al.
Publicado: (2024)