Saved in:
| Main Authors: | Wang, Shuai, Zhang, Dehao, Belatreche, Ammar, Xiao, Yichen, Qing, Hongyu, We, Wenjie, Zhang, Malu, Yang, Yang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.05310 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting
by: Wang, Shuai, et al.
Published: (2024)
by: Wang, Shuai, et al.
Published: (2024)
Spiketrum: An FPGA-based Implementation of a Neuromorphic Cochlea
by: Alsakkal, MHD Anas, et al.
Published: (2024)
by: Alsakkal, MHD Anas, et al.
Published: (2024)
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks
by: Song, Zeyang, et al.
Published: (2023)
by: Song, Zeyang, et al.
Published: (2023)
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
by: Yang, Qu, et al.
Published: (2024)
by: Yang, Qu, et al.
Published: (2024)
Automatic Voice Identification after Speech Resynthesis using PPG
by: Gaudier, Thibault, et al.
Published: (2024)
by: Gaudier, Thibault, et al.
Published: (2024)
Delayed Memory Unit: Modelling Temporal Dependency Through Delay Gate
by: Sun, Pengfei, et al.
Published: (2023)
by: Sun, Pengfei, et al.
Published: (2023)
A Novel Transfer Learning Approach for Mental Stability Classification from Voice Signal
by: Islam, Rafiul, et al.
Published: (2026)
by: Islam, Rafiul, et al.
Published: (2026)
DeepSpeech models show Human-like Performance and Processing of Cochlear Implant Inputs
by: Steinhardt, Cynthia R., et al.
Published: (2024)
by: Steinhardt, Cynthia R., et al.
Published: (2024)
Spiking Music: Audio Compression with Event Based Auto-encoders
by: Lisboa, Martim, et al.
Published: (2024)
by: Lisboa, Martim, et al.
Published: (2024)
LACTOSE: Linear Array of Conditions, TOpologies with Separated Error-backpropagation -- The Differentiable "IF" Conditional for Differentiable Digital Signal Processing
by: Clarke, Christopher Johann
Published: (2025)
by: Clarke, Christopher Johann
Published: (2025)
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement
by: Sun, Tao, et al.
Published: (2024)
by: Sun, Tao, et al.
Published: (2024)
Grammatical Structure and Grammatical Variations in Non-Metric Iranian Classical Music
by: Kanani, Maziar, et al.
Published: (2025)
by: Kanani, Maziar, et al.
Published: (2025)
Generative Voice Bursts during Phone Call
by: Ranjan, Paritosh, et al.
Published: (2025)
by: Ranjan, Paritosh, et al.
Published: (2025)
Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2
by: Ke, Weijie, et al.
Published: (2024)
by: Ke, Weijie, et al.
Published: (2024)
Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps
by: Nilsson, Mattias, et al.
Published: (2024)
by: Nilsson, Mattias, et al.
Published: (2024)
Low-power SNN-based audio source localisation using a Hilbert Transform spike encoding scheme
by: Haghighatshoar, Saeid, et al.
Published: (2024)
by: Haghighatshoar, Saeid, et al.
Published: (2024)
Accurate Mapping of RNNs on Neuromorphic Hardware with Adaptive Spiking Neurons
by: Boeshertz, Gauthier, et al.
Published: (2024)
by: Boeshertz, Gauthier, et al.
Published: (2024)
Biomimetic Frontend for Differentiable Audio Processing
by: Famularo, Ruolan Leslie, et al.
Published: (2024)
by: Famularo, Ruolan Leslie, et al.
Published: (2024)
How to Estimate Model Transferability of Pre-Trained Speech Models?
by: Chen, Zih-Ching, et al.
Published: (2023)
by: Chen, Zih-Ching, et al.
Published: (2023)
Artificial Neural Networks Trained on Noisy Speech Exhibit the McGurk Effect
by: Grasse, Lukas, et al.
Published: (2024)
by: Grasse, Lukas, et al.
Published: (2024)
Deep Photonic Reservoir Computer for Speech Recognition
by: Picco, Enrico, et al.
Published: (2023)
by: Picco, Enrico, et al.
Published: (2023)
LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units
by: Liu, Zeyu, et al.
Published: (2024)
by: Liu, Zeyu, et al.
Published: (2024)
Parsing Musical Structure to Enable Meaningful Variations
by: Kanani, Maziar, et al.
Published: (2025)
by: Kanani, Maziar, et al.
Published: (2025)
Parallel Stacked Aggregated Network for Voice Authentication in IoT-Enabled Smart Devices
by: Khan, Awais, et al.
Published: (2024)
by: Khan, Awais, et al.
Published: (2024)
Robust online reconstruction of continuous-time signals from a lean spike train ensemble code
by: Chattopadhyay, Anik, et al.
Published: (2024)
by: Chattopadhyay, Anik, et al.
Published: (2024)
LVNS-RAVE: Diversified audio generation with RAVE and Latent Vector Novelty Search
by: Guo, Jinyue, et al.
Published: (2024)
by: Guo, Jinyue, et al.
Published: (2024)
HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks
by: Szatkowski, Filip, et al.
Published: (2022)
by: Szatkowski, Filip, et al.
Published: (2022)
NEUROSEC: FPGA-Based Neuromorphic Audio Security
by: Isik, Murat, et al.
Published: (2024)
by: Isik, Murat, et al.
Published: (2024)
A Comparison of Temporal Encoders for Neuromorphic Keyword Spotting with Few Neurons
by: Nilsson, Mattias, et al.
Published: (2023)
by: Nilsson, Mattias, et al.
Published: (2023)
Spoken Conversational Agents with Large Language Models
by: Yang, Chao-Han Huck, et al.
Published: (2025)
by: Yang, Chao-Han Huck, et al.
Published: (2025)
Deformable Audio Transformer for Audio Event Detection
by: Zhu, Wentao
Published: (2023)
by: Zhu, Wentao
Published: (2023)
Long-Form Text-to-Music Generation with Adaptive Prompts: A Case Study in Tabletop Role-Playing Games Soundtracks
by: Marra, Felipe, et al.
Published: (2024)
by: Marra, Felipe, et al.
Published: (2024)
Adaptive Per-Channel Energy Normalization Front-end for Robust Audio Signal Processing
by: Meng, Hanyu, et al.
Published: (2025)
by: Meng, Hanyu, et al.
Published: (2025)
Acoustic neural networks: Identifying design principles and exploring physical feasibility
by: Kalthoff, Ivan, et al.
Published: (2025)
by: Kalthoff, Ivan, et al.
Published: (2025)
Frequency-Based Alignment of EEG and Audio Signals Using Contrastive Learning and SincNet for Auditory Attention Detection
by: Liao, Yuan, et al.
Published: (2025)
by: Liao, Yuan, et al.
Published: (2025)
Microphone Array Signal Processing and Deep Learning for Speech Enhancement
by: Haeb-Umbach, Reinhold, et al.
Published: (2025)
by: Haeb-Umbach, Reinhold, et al.
Published: (2025)
AI-Driven Cardiorespiratory Signal Processing: Separation, Clustering, and Anomaly Detection
by: Torabi, Yasaman
Published: (2026)
by: Torabi, Yasaman
Published: (2026)
Generative Deep Learning and Signal Processing for Data Augmentation of Cardiac Auscultation Signals: Improving Model Robustness Using Synthetic Audio
by: Abbott, Leigh, et al.
Published: (2024)
by: Abbott, Leigh, et al.
Published: (2024)
Interfacing PDM MEMS microphones with PFM spiking systems: Application for Neuromorphic Auditory Sensors
by: Jimenez-Fernandez, Angel, et al.
Published: (2019)
by: Jimenez-Fernandez, Angel, et al.
Published: (2019)
Emotion Detection Using Conditional Generative Adversarial Networks (cGAN): A Deep Learning Approach
by: Srivastava, Anushka
Published: (2025)
by: Srivastava, Anushka
Published: (2025)
Similar Items
-
Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting
by: Wang, Shuai, et al.
Published: (2024) -
Spiketrum: An FPGA-based Implementation of a Neuromorphic Cochlea
by: Alsakkal, MHD Anas, et al.
Published: (2024) -
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks
by: Song, Zeyang, et al.
Published: (2023) -
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
by: Yang, Qu, et al.
Published: (2024) -
Automatic Voice Identification after Speech Resynthesis using PPG
by: Gaudier, Thibault, et al.
Published: (2024)