:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kanani, Maziar, Leary, Sean O, McDermott, James
Format:	Preprint
Published:	2025
Subjects:	Neural and Evolutionary Computing Sound Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2507.10708
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Parsing Musical Structure to Enable Meaningful Variations
by: Kanani, Maziar, et al.
Published: (2025)

Radif Corpus: A Symbolic Dataset for Non-Metric Iranian Classical Music
by: Kanani, Maziar, et al.
Published: (2025)

Spiking Music: Audio Compression with Event Based Auto-encoders
by: Lisboa, Martim, et al.
Published: (2024)

Generative Voice Bursts during Phone Call
by: Ranjan, Paritosh, et al.
Published: (2025)

Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks
by: Song, Zeyang, et al.
Published: (2023)

Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2
by: Ke, Weijie, et al.
Published: (2024)

Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps
by: Nilsson, Mattias, et al.
Published: (2024)

Low-power SNN-based audio source localisation using a Hilbert Transform spike encoding scheme
by: Haghighatshoar, Saeid, et al.
Published: (2024)

DeepSpeech models show Human-like Performance and Processing of Cochlear Implant Inputs
by: Steinhardt, Cynthia R., et al.
Published: (2024)

A Novel Transfer Learning Approach for Mental Stability Classification from Voice Signal
by: Islam, Rafiul, et al.
Published: (2026)

sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
by: Yang, Qu, et al.
Published: (2024)

Artificial Neural Networks Trained on Noisy Speech Exhibit the McGurk Effect
by: Grasse, Lukas, et al.
Published: (2024)

Deep Photonic Reservoir Computer for Speech Recognition
by: Picco, Enrico, et al.
Published: (2023)

Ternary Spike-based Neuromorphic Signal Processing System
by: Wang, Shuai, et al.
Published: (2024)

Spiketrum: An FPGA-based Implementation of a Neuromorphic Cochlea
by: Alsakkal, MHD Anas, et al.
Published: (2024)

Long-Form Text-to-Music Generation with Adaptive Prompts: A Case Study in Tabletop Role-Playing Games Soundtracks
by: Marra, Felipe, et al.
Published: (2024)

How to Estimate Model Transferability of Pre-Trained Speech Models?
by: Chen, Zih-Ching, et al.
Published: (2023)

LACTOSE: Linear Array of Conditions, TOpologies with Separated Error-backpropagation -- The Differentiable "IF" Conditional for Differentiable Digital Signal Processing
by: Clarke, Christopher Johann
Published: (2025)

Parallel Stacked Aggregated Network for Voice Authentication in IoT-Enabled Smart Devices
by: Khan, Awais, et al.
Published: (2024)

DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement
by: Sun, Tao, et al.
Published: (2024)

Biomimetic Frontend for Differentiable Audio Processing
by: Famularo, Ruolan Leslie, et al.
Published: (2024)

Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting
by: Wang, Shuai, et al.
Published: (2024)

Robust online reconstruction of continuous-time signals from a lean spike train ensemble code
by: Chattopadhyay, Anik, et al.
Published: (2024)

LVNS-RAVE: Diversified audio generation with RAVE and Latent Vector Novelty Search
by: Guo, Jinyue, et al.
Published: (2024)

Deformable Audio Transformer for Audio Event Detection
by: Zhu, Wentao
Published: (2023)

Automatic Voice Identification after Speech Resynthesis using PPG
by: Gaudier, Thibault, et al.
Published: (2024)

Spoken Conversational Agents with Large Language Models
by: Yang, Chao-Han Huck, et al.
Published: (2025)

Acoustic neural networks: Identifying design principles and exploring physical feasibility
by: Kalthoff, Ivan, et al.
Published: (2025)

HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks
by: Szatkowski, Filip, et al.
Published: (2022)

LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units
by: Liu, Zeyu, et al.
Published: (2024)

Emotion Detection Using Conditional Generative Adversarial Networks (cGAN): A Deep Learning Approach
by: Srivastava, Anushka
Published: (2025)

Dilated Convolution with Learnable Spacings
by: Khalfaoui-Hassani, Ismail
Published: (2024)

Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition
by: Yu, Yu, et al.
Published: (2024)

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
by: Yu, Yu, et al.
Published: (2023)

NEUROSEC: FPGA-Based Neuromorphic Audio Security
by: Isik, Murat, et al.
Published: (2024)

Accurate Mapping of RNNs on Neuromorphic Hardware with Adaptive Spiking Neurons
by: Boeshertz, Gauthier, et al.
Published: (2024)

A Comparison of Temporal Encoders for Neuromorphic Keyword Spotting with Few Neurons
by: Nilsson, Mattias, et al.
Published: (2023)

Simulation of Neural Responses to Classical Music Using Organoid Intelligence Methods
by: Szelogowski, Daniel
Published: (2024)

Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction
by: Qian, Mengjie, et al.
Published: (2025)

Data Augmentation for Spoken Grammatical Error Correction
by: Karanasou, Penny, et al.
Published: (2025)