Saved in:
| Main Authors: | Xie, Xurong, Liu, Xunying, Lee, Tan, Wang, Lan |
|---|---|
| Format: | Preprint |
| Published: |
2020
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2012.07460 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition
by: Xie, Xurong, et al.
Published: (2022)
by: Xie, Xurong, et al.
Published: (2022)
Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition
by: Xie, Xurong, et al.
Published: (2022)
by: Xie, Xurong, et al.
Published: (2022)
Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models
by: Li, Zhaoqing, et al.
Published: (2025)
by: Li, Zhaoqing, et al.
Published: (2025)
Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation
by: Geng, Mengzhe, et al.
Published: (2024)
by: Geng, Mengzhe, et al.
Published: (2024)
Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition
by: Jiang, Yicong, et al.
Published: (2024)
by: Jiang, Yicong, et al.
Published: (2024)
Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition
by: Hu, Shujie, et al.
Published: (2024)
by: Hu, Shujie, et al.
Published: (2024)
On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition
by: HU, Shujie, et al.
Published: (2025)
by: HU, Shujie, et al.
Published: (2025)
Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition
by: Wang, Huimeng, et al.
Published: (2025)
by: Wang, Huimeng, et al.
Published: (2025)
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
by: Li, Guinan, et al.
Published: (2024)
by: Li, Guinan, et al.
Published: (2024)
AbsoluteNet: A Deep Learning Neural Network to Classify Cerebral Hemodynamic Responses of Auditory Processing
by: Adeli, Behtom, et al.
Published: (2025)
by: Adeli, Behtom, et al.
Published: (2025)
Detection of Electric Motor Damage Through Analysis of Sound Signals Using Bayesian Neural Networks
by: Bauer, Waldemar, et al.
Published: (2024)
by: Bauer, Waldemar, et al.
Published: (2024)
Advancing Test-Time Adaptation in Wild Acoustic Test Settings
by: Liu, Hongfu, et al.
Published: (2023)
by: Liu, Hongfu, et al.
Published: (2023)
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era
by: Ren, Zhao, et al.
Published: (2023)
by: Ren, Zhao, et al.
Published: (2023)
Cosine Scoring with Uncertainty for Neural Speaker Embedding
by: Wang, Qiongqiong, et al.
Published: (2024)
by: Wang, Qiongqiong, et al.
Published: (2024)
Combolutional Neural Networks
by: Churchwell, Cameron, et al.
Published: (2025)
by: Churchwell, Cameron, et al.
Published: (2025)
Joint Source-Environment Adaptation for Deep Learning-Based Underwater Acoustic Source Ranging
by: Kari, Dariush, et al.
Published: (2025)
by: Kari, Dariush, et al.
Published: (2025)
Dynamic Gated Recurrent Neural Network for Compute-efficient Speech Enhancement
by: Cheng, Longbiao, et al.
Published: (2024)
by: Cheng, Longbiao, et al.
Published: (2024)
Autoregressive Guidance of Deep Spatially Selective Filters using Bayesian Tracking for Efficient Extraction of Moving Speakers
by: Kienegger, Jakob, et al.
Published: (2026)
by: Kienegger, Jakob, et al.
Published: (2026)
Deep Feature Learning for Medical Acoustics
by: Poirè, Alessandro Maria, et al.
Published: (2022)
by: Poirè, Alessandro Maria, et al.
Published: (2022)
Music Emotion Prediction Using Recurrent Neural Networks
by: Chang, Xinyu, et al.
Published: (2024)
by: Chang, Xinyu, et al.
Published: (2024)
Quantifying Quanvolutional Neural Networks Robustness for Speech in Healthcare Applications
by: Tran, Ha, et al.
Published: (2026)
by: Tran, Ha, et al.
Published: (2026)
Feature Aggregation in Joint Sound Classification and Localization Neural Networks
by: Healy, Brendan, et al.
Published: (2023)
by: Healy, Brendan, et al.
Published: (2023)
Bayesian Low-Rank Factorization for Robust Model Adaptation
by: Ugan, Enes Yavuz, et al.
Published: (2025)
by: Ugan, Enes Yavuz, et al.
Published: (2025)
Parametric Neural Amp Modeling with Active Learning
by: Grötschla, Florian, et al.
Published: (2025)
by: Grötschla, Florian, et al.
Published: (2025)
Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks
by: Elias, Noel
Published: (2024)
by: Elias, Noel
Published: (2024)
Automatic Equalization for Individual Instrument Tracks Using Convolutional Neural Networks
by: Mockenhaupt, Florian, et al.
Published: (2024)
by: Mockenhaupt, Florian, et al.
Published: (2024)
Test-Time Adaptation for Speech Emotion Recognition
by: Dong, Jiaheng, et al.
Published: (2026)
by: Dong, Jiaheng, et al.
Published: (2026)
Keyword-Guided Adaptation of Automatic Speech Recognition
by: Shamsian, Aviv, et al.
Published: (2024)
by: Shamsian, Aviv, et al.
Published: (2024)
Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification
by: Yao, Wei, et al.
Published: (2020)
by: Yao, Wei, et al.
Published: (2020)
Barwise Section Boundary Detection in Symbolic Music Using Convolutional Neural Networks
by: Eldeeb, Omar, et al.
Published: (2025)
by: Eldeeb, Omar, et al.
Published: (2025)
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
by: Shibuya, Takashi, et al.
Published: (2023)
by: Shibuya, Takashi, et al.
Published: (2023)
Vocal Melody Construction for Persian Lyrics Using LSTM Recurrent Neural Networks
by: Jafari, Farshad, et al.
Published: (2024)
by: Jafari, Farshad, et al.
Published: (2024)
Convolutional Neural Network Achieves Human-level Accuracy in Music Genre Classification
by: Dong, Mingwen
Published: (2018)
by: Dong, Mingwen
Published: (2018)
InterGridNet: An Electric Network Frequency Approach for Audio Source Location Classification Using Convolutional Neural Networks
by: Korgialas, Christos, et al.
Published: (2025)
by: Korgialas, Christos, et al.
Published: (2025)
CAK: Emergent Audio Effects from Minimal Deep Learning
by: Rockman, Austin
Published: (2025)
by: Rockman, Austin
Published: (2025)
Adaptive Control Attention Network for Underwater Acoustic Localization and Domain Adaptation
by: Vo, Quoc Thinh, et al.
Published: (2025)
by: Vo, Quoc Thinh, et al.
Published: (2025)
Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features
by: Zezario, Ryandhimas E., et al.
Published: (2021)
by: Zezario, Ryandhimas E., et al.
Published: (2021)
High Resolution Guitar Transcription via Domain Adaptation
by: Riley, Xavier, et al.
Published: (2024)
by: Riley, Xavier, et al.
Published: (2024)
Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks
by: Mohammadi, Amirmohammad, et al.
Published: (2024)
by: Mohammadi, Amirmohammad, et al.
Published: (2024)
Point Neuron Learning: A New Physics-Informed Neural Network Architecture
by: Bi, Hanwen, et al.
Published: (2024)
by: Bi, Hanwen, et al.
Published: (2024)
Similar Items
-
Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition
by: Xie, Xurong, et al.
Published: (2022) -
Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition
by: Xie, Xurong, et al.
Published: (2022) -
Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models
by: Li, Zhaoqing, et al.
Published: (2025) -
Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation
by: Geng, Mengzhe, et al.
Published: (2024) -
Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition
by: Jiang, Yicong, et al.
Published: (2024)