:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Shuai, Zhang, Dehao, Belatreche, Ammar, Xiao, Yichen, Qing, Hongyu, We, Wenjie, Zhang, Malu, Yang, Yang
Format:	Preprint
Published:	2024
Subjects:	Signal Processing Neural and Evolutionary Computing Sound Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2407.05310
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting
by: Wang, Shuai, et al.
Published: (2024)

Spiketrum: An FPGA-based Implementation of a Neuromorphic Cochlea
by: Alsakkal, MHD Anas, et al.
Published: (2024)

Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks
by: Song, Zeyang, et al.
Published: (2023)

sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
by: Yang, Qu, et al.
Published: (2024)

Automatic Voice Identification after Speech Resynthesis using PPG
by: Gaudier, Thibault, et al.
Published: (2024)

Delayed Memory Unit: Modelling Temporal Dependency Through Delay Gate
by: Sun, Pengfei, et al.
Published: (2023)

A Novel Transfer Learning Approach for Mental Stability Classification from Voice Signal
by: Islam, Rafiul, et al.
Published: (2026)

DeepSpeech models show Human-like Performance and Processing of Cochlear Implant Inputs
by: Steinhardt, Cynthia R., et al.
Published: (2024)

Spiking Music: Audio Compression with Event Based Auto-encoders
by: Lisboa, Martim, et al.
Published: (2024)

LACTOSE: Linear Array of Conditions, TOpologies with Separated Error-backpropagation -- The Differentiable "IF" Conditional for Differentiable Digital Signal Processing
by: Clarke, Christopher Johann
Published: (2025)

DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement
by: Sun, Tao, et al.
Published: (2024)

Grammatical Structure and Grammatical Variations in Non-Metric Iranian Classical Music
by: Kanani, Maziar, et al.
Published: (2025)

Generative Voice Bursts during Phone Call
by: Ranjan, Paritosh, et al.
Published: (2025)

Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2
by: Ke, Weijie, et al.
Published: (2024)

Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps
by: Nilsson, Mattias, et al.
Published: (2024)

Low-power SNN-based audio source localisation using a Hilbert Transform spike encoding scheme
by: Haghighatshoar, Saeid, et al.
Published: (2024)

Accurate Mapping of RNNs on Neuromorphic Hardware with Adaptive Spiking Neurons
by: Boeshertz, Gauthier, et al.
Published: (2024)

Biomimetic Frontend for Differentiable Audio Processing
by: Famularo, Ruolan Leslie, et al.
Published: (2024)

How to Estimate Model Transferability of Pre-Trained Speech Models?
by: Chen, Zih-Ching, et al.
Published: (2023)

Artificial Neural Networks Trained on Noisy Speech Exhibit the McGurk Effect
by: Grasse, Lukas, et al.
Published: (2024)

Deep Photonic Reservoir Computer for Speech Recognition
by: Picco, Enrico, et al.
Published: (2023)

LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units
by: Liu, Zeyu, et al.
Published: (2024)

Parsing Musical Structure to Enable Meaningful Variations
by: Kanani, Maziar, et al.
Published: (2025)

Parallel Stacked Aggregated Network for Voice Authentication in IoT-Enabled Smart Devices
by: Khan, Awais, et al.
Published: (2024)

Robust online reconstruction of continuous-time signals from a lean spike train ensemble code
by: Chattopadhyay, Anik, et al.
Published: (2024)

LVNS-RAVE: Diversified audio generation with RAVE and Latent Vector Novelty Search
by: Guo, Jinyue, et al.
Published: (2024)

HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks
by: Szatkowski, Filip, et al.
Published: (2022)

NEUROSEC: FPGA-Based Neuromorphic Audio Security
by: Isik, Murat, et al.
Published: (2024)

A Comparison of Temporal Encoders for Neuromorphic Keyword Spotting with Few Neurons
by: Nilsson, Mattias, et al.
Published: (2023)

Spoken Conversational Agents with Large Language Models
by: Yang, Chao-Han Huck, et al.
Published: (2025)

Deformable Audio Transformer for Audio Event Detection
by: Zhu, Wentao
Published: (2023)

Long-Form Text-to-Music Generation with Adaptive Prompts: A Case Study in Tabletop Role-Playing Games Soundtracks
by: Marra, Felipe, et al.
Published: (2024)

Adaptive Per-Channel Energy Normalization Front-end for Robust Audio Signal Processing
by: Meng, Hanyu, et al.
Published: (2025)

Acoustic neural networks: Identifying design principles and exploring physical feasibility
by: Kalthoff, Ivan, et al.
Published: (2025)

Frequency-Based Alignment of EEG and Audio Signals Using Contrastive Learning and SincNet for Auditory Attention Detection
by: Liao, Yuan, et al.
Published: (2025)

Microphone Array Signal Processing and Deep Learning for Speech Enhancement
by: Haeb-Umbach, Reinhold, et al.
Published: (2025)

AI-Driven Cardiorespiratory Signal Processing: Separation, Clustering, and Anomaly Detection
by: Torabi, Yasaman
Published: (2026)

Generative Deep Learning and Signal Processing for Data Augmentation of Cardiac Auscultation Signals: Improving Model Robustness Using Synthetic Audio
by: Abbott, Leigh, et al.
Published: (2024)

Interfacing PDM MEMS microphones with PFM spiking systems: Application for Neuromorphic Auditory Sensors
by: Jimenez-Fernandez, Angel, et al.
Published: (2019)

Emotion Detection Using Conditional Generative Adversarial Networks (cGAN): A Deep Learning Approach
by: Srivastava, Anushka
Published: (2025)