Saved in:
| Main Authors: | Kanani, Maziar, Leary, Sean O, McDermott, James |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.10708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Parsing Musical Structure to Enable Meaningful Variations
by: Kanani, Maziar, et al.
Published: (2025)
by: Kanani, Maziar, et al.
Published: (2025)
Radif Corpus: A Symbolic Dataset for Non-Metric Iranian Classical Music
by: Kanani, Maziar, et al.
Published: (2025)
by: Kanani, Maziar, et al.
Published: (2025)
Spiking Music: Audio Compression with Event Based Auto-encoders
by: Lisboa, Martim, et al.
Published: (2024)
by: Lisboa, Martim, et al.
Published: (2024)
Generative Voice Bursts during Phone Call
by: Ranjan, Paritosh, et al.
Published: (2025)
by: Ranjan, Paritosh, et al.
Published: (2025)
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks
by: Song, Zeyang, et al.
Published: (2023)
by: Song, Zeyang, et al.
Published: (2023)
Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2
by: Ke, Weijie, et al.
Published: (2024)
by: Ke, Weijie, et al.
Published: (2024)
Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps
by: Nilsson, Mattias, et al.
Published: (2024)
by: Nilsson, Mattias, et al.
Published: (2024)
Low-power SNN-based audio source localisation using a Hilbert Transform spike encoding scheme
by: Haghighatshoar, Saeid, et al.
Published: (2024)
by: Haghighatshoar, Saeid, et al.
Published: (2024)
DeepSpeech models show Human-like Performance and Processing of Cochlear Implant Inputs
by: Steinhardt, Cynthia R., et al.
Published: (2024)
by: Steinhardt, Cynthia R., et al.
Published: (2024)
A Novel Transfer Learning Approach for Mental Stability Classification from Voice Signal
by: Islam, Rafiul, et al.
Published: (2026)
by: Islam, Rafiul, et al.
Published: (2026)
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
by: Yang, Qu, et al.
Published: (2024)
by: Yang, Qu, et al.
Published: (2024)
Artificial Neural Networks Trained on Noisy Speech Exhibit the McGurk Effect
by: Grasse, Lukas, et al.
Published: (2024)
by: Grasse, Lukas, et al.
Published: (2024)
Deep Photonic Reservoir Computer for Speech Recognition
by: Picco, Enrico, et al.
Published: (2023)
by: Picco, Enrico, et al.
Published: (2023)
Ternary Spike-based Neuromorphic Signal Processing System
by: Wang, Shuai, et al.
Published: (2024)
by: Wang, Shuai, et al.
Published: (2024)
Spiketrum: An FPGA-based Implementation of a Neuromorphic Cochlea
by: Alsakkal, MHD Anas, et al.
Published: (2024)
by: Alsakkal, MHD Anas, et al.
Published: (2024)
Long-Form Text-to-Music Generation with Adaptive Prompts: A Case Study in Tabletop Role-Playing Games Soundtracks
by: Marra, Felipe, et al.
Published: (2024)
by: Marra, Felipe, et al.
Published: (2024)
How to Estimate Model Transferability of Pre-Trained Speech Models?
by: Chen, Zih-Ching, et al.
Published: (2023)
by: Chen, Zih-Ching, et al.
Published: (2023)
LACTOSE: Linear Array of Conditions, TOpologies with Separated Error-backpropagation -- The Differentiable "IF" Conditional for Differentiable Digital Signal Processing
by: Clarke, Christopher Johann
Published: (2025)
by: Clarke, Christopher Johann
Published: (2025)
Parallel Stacked Aggregated Network for Voice Authentication in IoT-Enabled Smart Devices
by: Khan, Awais, et al.
Published: (2024)
by: Khan, Awais, et al.
Published: (2024)
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement
by: Sun, Tao, et al.
Published: (2024)
by: Sun, Tao, et al.
Published: (2024)
Biomimetic Frontend for Differentiable Audio Processing
by: Famularo, Ruolan Leslie, et al.
Published: (2024)
by: Famularo, Ruolan Leslie, et al.
Published: (2024)
Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting
by: Wang, Shuai, et al.
Published: (2024)
by: Wang, Shuai, et al.
Published: (2024)
Robust online reconstruction of continuous-time signals from a lean spike train ensemble code
by: Chattopadhyay, Anik, et al.
Published: (2024)
by: Chattopadhyay, Anik, et al.
Published: (2024)
LVNS-RAVE: Diversified audio generation with RAVE and Latent Vector Novelty Search
by: Guo, Jinyue, et al.
Published: (2024)
by: Guo, Jinyue, et al.
Published: (2024)
Deformable Audio Transformer for Audio Event Detection
by: Zhu, Wentao
Published: (2023)
by: Zhu, Wentao
Published: (2023)
Automatic Voice Identification after Speech Resynthesis using PPG
by: Gaudier, Thibault, et al.
Published: (2024)
by: Gaudier, Thibault, et al.
Published: (2024)
Spoken Conversational Agents with Large Language Models
by: Yang, Chao-Han Huck, et al.
Published: (2025)
by: Yang, Chao-Han Huck, et al.
Published: (2025)
Acoustic neural networks: Identifying design principles and exploring physical feasibility
by: Kalthoff, Ivan, et al.
Published: (2025)
by: Kalthoff, Ivan, et al.
Published: (2025)
HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks
by: Szatkowski, Filip, et al.
Published: (2022)
by: Szatkowski, Filip, et al.
Published: (2022)
LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units
by: Liu, Zeyu, et al.
Published: (2024)
by: Liu, Zeyu, et al.
Published: (2024)
Emotion Detection Using Conditional Generative Adversarial Networks (cGAN): A Deep Learning Approach
by: Srivastava, Anushka
Published: (2025)
by: Srivastava, Anushka
Published: (2025)
Dilated Convolution with Learnable Spacings
by: Khalfaoui-Hassani, Ismail
Published: (2024)
by: Khalfaoui-Hassani, Ismail
Published: (2024)
Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition
by: Yu, Yu, et al.
Published: (2024)
by: Yu, Yu, et al.
Published: (2024)
Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
by: Yu, Yu, et al.
Published: (2023)
by: Yu, Yu, et al.
Published: (2023)
NEUROSEC: FPGA-Based Neuromorphic Audio Security
by: Isik, Murat, et al.
Published: (2024)
by: Isik, Murat, et al.
Published: (2024)
Accurate Mapping of RNNs on Neuromorphic Hardware with Adaptive Spiking Neurons
by: Boeshertz, Gauthier, et al.
Published: (2024)
by: Boeshertz, Gauthier, et al.
Published: (2024)
A Comparison of Temporal Encoders for Neuromorphic Keyword Spotting with Few Neurons
by: Nilsson, Mattias, et al.
Published: (2023)
by: Nilsson, Mattias, et al.
Published: (2023)
Simulation of Neural Responses to Classical Music Using Organoid Intelligence Methods
by: Szelogowski, Daniel
Published: (2024)
by: Szelogowski, Daniel
Published: (2024)
Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction
by: Qian, Mengjie, et al.
Published: (2025)
by: Qian, Mengjie, et al.
Published: (2025)
Data Augmentation for Spoken Grammatical Error Correction
by: Karanasou, Penny, et al.
Published: (2025)
by: Karanasou, Penny, et al.
Published: (2025)
Similar Items
-
Parsing Musical Structure to Enable Meaningful Variations
by: Kanani, Maziar, et al.
Published: (2025) -
Radif Corpus: A Symbolic Dataset for Non-Metric Iranian Classical Music
by: Kanani, Maziar, et al.
Published: (2025) -
Spiking Music: Audio Compression with Event Based Auto-encoders
by: Lisboa, Martim, et al.
Published: (2024) -
Generative Voice Bursts during Phone Call
by: Ranjan, Paritosh, et al.
Published: (2025) -
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks
by: Song, Zeyang, et al.
Published: (2023)