:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xie, Xurong, Liu, Xunying, Lee, Tan, Wang, Lan
Format:	Preprint
Published:	2020
Subjects:	Sound Audio and Speech Processing Machine Learning
Online Access:	https://arxiv.org/abs/2012.07460
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition
by: Xie, Xurong, et al.
Published: (2022)

Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition
by: Xie, Xurong, et al.
Published: (2022)

Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models
by: Li, Zhaoqing, et al.
Published: (2025)

Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation
by: Geng, Mengzhe, et al.
Published: (2024)

Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition
by: Jiang, Yicong, et al.
Published: (2024)

Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition
by: Hu, Shujie, et al.
Published: (2024)

On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition
by: HU, Shujie, et al.
Published: (2025)

Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition
by: Wang, Huimeng, et al.
Published: (2025)

Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
by: Li, Guinan, et al.
Published: (2024)

AbsoluteNet: A Deep Learning Neural Network to Classify Cerebral Hemodynamic Responses of Auditory Processing
by: Adeli, Behtom, et al.
Published: (2025)

Detection of Electric Motor Damage Through Analysis of Sound Signals Using Bayesian Neural Networks
by: Bauer, Waldemar, et al.
Published: (2024)

Advancing Test-Time Adaptation in Wild Acoustic Test Settings
by: Liu, Hongfu, et al.
Published: (2023)

A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era
by: Ren, Zhao, et al.
Published: (2023)

Cosine Scoring with Uncertainty for Neural Speaker Embedding
by: Wang, Qiongqiong, et al.
Published: (2024)

Combolutional Neural Networks
by: Churchwell, Cameron, et al.
Published: (2025)

Joint Source-Environment Adaptation for Deep Learning-Based Underwater Acoustic Source Ranging
by: Kari, Dariush, et al.
Published: (2025)

Dynamic Gated Recurrent Neural Network for Compute-efficient Speech Enhancement
by: Cheng, Longbiao, et al.
Published: (2024)

Autoregressive Guidance of Deep Spatially Selective Filters using Bayesian Tracking for Efficient Extraction of Moving Speakers
by: Kienegger, Jakob, et al.
Published: (2026)

Deep Feature Learning for Medical Acoustics
by: Poirè, Alessandro Maria, et al.
Published: (2022)

Music Emotion Prediction Using Recurrent Neural Networks
by: Chang, Xinyu, et al.
Published: (2024)

Quantifying Quanvolutional Neural Networks Robustness for Speech in Healthcare Applications
by: Tran, Ha, et al.
Published: (2026)

Feature Aggregation in Joint Sound Classification and Localization Neural Networks
by: Healy, Brendan, et al.
Published: (2023)

Bayesian Low-Rank Factorization for Robust Model Adaptation
by: Ugan, Enes Yavuz, et al.
Published: (2025)

Parametric Neural Amp Modeling with Active Learning
by: Grötschla, Florian, et al.
Published: (2025)

Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks
by: Elias, Noel
Published: (2024)

Automatic Equalization for Individual Instrument Tracks Using Convolutional Neural Networks
by: Mockenhaupt, Florian, et al.
Published: (2024)

Test-Time Adaptation for Speech Emotion Recognition
by: Dong, Jiaheng, et al.
Published: (2026)

Keyword-Guided Adaptation of Automatic Speech Recognition
by: Shamsian, Aviv, et al.
Published: (2024)

Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification
by: Yao, Wei, et al.
Published: (2020)

Barwise Section Boundary Detection in Symbolic Music Using Convolutional Neural Networks
by: Eldeeb, Omar, et al.
Published: (2025)

BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
by: Shibuya, Takashi, et al.
Published: (2023)

Vocal Melody Construction for Persian Lyrics Using LSTM Recurrent Neural Networks
by: Jafari, Farshad, et al.
Published: (2024)

Convolutional Neural Network Achieves Human-level Accuracy in Music Genre Classification
by: Dong, Mingwen
Published: (2018)

InterGridNet: An Electric Network Frequency Approach for Audio Source Location Classification Using Convolutional Neural Networks
by: Korgialas, Christos, et al.
Published: (2025)

CAK: Emergent Audio Effects from Minimal Deep Learning
by: Rockman, Austin
Published: (2025)

Adaptive Control Attention Network for Underwater Acoustic Localization and Domain Adaptation
by: Vo, Quoc Thinh, et al.
Published: (2025)

Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features
by: Zezario, Ryandhimas E., et al.
Published: (2021)

High Resolution Guitar Transcription via Domain Adaptation
by: Riley, Xavier, et al.
Published: (2024)

Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks
by: Mohammadi, Amirmohammad, et al.
Published: (2024)

Point Neuron Learning: A New Physics-Informed Neural Network Architecture
by: Bi, Hanwen, et al.
Published: (2024)