Saved in:
| Main Authors: | Chhaglani, Bhawana, Gummeson, Jeremy, Shenoy, Prashant |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.18002 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FeatureSense: Protecting Speaker Attributes in Always-On Audio Sensing System
by: Chhaglani, Bhawana, et al.
Published: (2025)
by: Chhaglani, Bhawana, et al.
Published: (2025)
NeckCare: Preventing Tech Neck using Hearable-based Multimodal Sensing
by: Chhaglani, Bhawana, et al.
Published: (2024)
by: Chhaglani, Bhawana, et al.
Published: (2024)
Quantum-Inspired Audio Unlearning: Towards Privacy-Preserving Voice Biometrics
by: Pathak, Shreyansh, et al.
Published: (2025)
by: Pathak, Shreyansh, et al.
Published: (2025)
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
by: Yang, Dongchao, et al.
Published: (2023)
by: Yang, Dongchao, et al.
Published: (2023)
Network Modulation Synthesis: New Algorithms for Generating Musical Audio Using Autoencoder Networks
by: Hyrkas, Jeremy
Published: (2021)
by: Hyrkas, Jeremy
Published: (2021)
Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs
by: Zhang, Wenyu, et al.
Published: (2025)
by: Zhang, Wenyu, et al.
Published: (2025)
Angular Distance Distribution Loss for Audio Classification
by: Almudévar, Antonio, et al.
Published: (2024)
by: Almudévar, Antonio, et al.
Published: (2024)
PAT: Parameter-Free Audio-Text Aligner to Boost Zero-Shot Audio Classification
by: Seth, Ashish, et al.
Published: (2024)
by: Seth, Ashish, et al.
Published: (2024)
MACE: Leveraging Audio for Evaluating Audio Captioning Systems
by: Dixit, Satvik, et al.
Published: (2024)
by: Dixit, Satvik, et al.
Published: (2024)
Class-Incremental Learning for Multi-Label Audio Classification
by: Mulimani, Manjunath, et al.
Published: (2024)
by: Mulimani, Manjunath, et al.
Published: (2024)
Advancing Continual Learning for Robust Deepfake Audio Classification
by: Dong, Feiyi, et al.
Published: (2024)
by: Dong, Feiyi, et al.
Published: (2024)
Benchmarking Time-localized Explanations for Audio Classification Models
by: Bolaños, Cecilia, et al.
Published: (2025)
by: Bolaños, Cecilia, et al.
Published: (2025)
AudioComposer: Towards Fine-grained Audio Generation with Natural Language Descriptions
by: Wang, Yuanyuan, et al.
Published: (2024)
by: Wang, Yuanyuan, et al.
Published: (2024)
Towards Weakly Supervised Text-to-Audio Grounding
by: Xu, Xuenan, et al.
Published: (2024)
by: Xu, Xuenan, et al.
Published: (2024)
Toward a Sparse and Interpretable Audio Codec
by: Vinyard, John
Published: (2025)
by: Vinyard, John
Published: (2025)
Towards Neural Audio Codec Source Parsing
by: Phukan, Orchid Chetia, et al.
Published: (2025)
by: Phukan, Orchid Chetia, et al.
Published: (2025)
Structural and Statistical Audio Texture Knowledge Distillation for Acoustic Classification
by: Ritu, Jarin, et al.
Published: (2025)
by: Ritu, Jarin, et al.
Published: (2025)
Hyperbolic Embeddings for Order-Aware Classification of Audio Effect Chains
by: Wada, Aogu, et al.
Published: (2025)
by: Wada, Aogu, et al.
Published: (2025)
Towards Fusion of Neural Audio Codec-based Representations with Spectral for Heart Murmur Classification via Bandit-based Cross-Attention Mechanism
by: Phukan, Orchid Chetia, et al.
Published: (2025)
by: Phukan, Orchid Chetia, et al.
Published: (2025)
Adversarial Representation Learning for Robust Privacy Preservation in Audio
by: Gharib, Shayan, et al.
Published: (2023)
by: Gharib, Shayan, et al.
Published: (2023)
Code Drift: Towards Idempotent Neural Audio Codecs
by: O'Reilly, Patrick, et al.
Published: (2024)
by: O'Reilly, Patrick, et al.
Published: (2024)
Towards Spatial Audio Understanding via Question Answering
by: Sudarsanam, Parthasaarathy, et al.
Published: (2025)
by: Sudarsanam, Parthasaarathy, et al.
Published: (2025)
ASGIR: Audio Spectrogram Transformer Guided Classification And Information Retrieval For Birds
by: Chaudhuri, Yashwardhan, et al.
Published: (2024)
by: Chaudhuri, Yashwardhan, et al.
Published: (2024)
Source Tracing of Audio Deepfake Systems
by: Klein, Nicholas, et al.
Published: (2024)
by: Klein, Nicholas, et al.
Published: (2024)
DeFT-Mamba: Universal Multichannel Sound Separation and Polyphonic Audio Classification
by: Lee, Dongheon, et al.
Published: (2024)
by: Lee, Dongheon, et al.
Published: (2024)
TAME: Temporal Audio-based Mamba for Enhanced Drone Trajectory Estimation and Classification
by: Xiao, Zhenyuan, et al.
Published: (2024)
by: Xiao, Zhenyuan, et al.
Published: (2024)
Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification
by: Cai, Yiqiang, et al.
Published: (2024)
by: Cai, Yiqiang, et al.
Published: (2024)
ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild
by: Yi, Jiangyan, et al.
Published: (2024)
by: Yi, Jiangyan, et al.
Published: (2024)
The T12 System for AudioMOS Challenge 2025: Audio Aesthetics Score Prediction System Using KAN- and VERSA-based Models
by: Yamamoto, Katsuhiko, et al.
Published: (2025)
by: Yamamoto, Katsuhiko, et al.
Published: (2025)
Evaluating CNN with Stacked Feature Representations and Audio Spectrogram Transformer Models for Sound Classification
by: Dehaghania, Parinaz Binandeh, et al.
Published: (2026)
by: Dehaghania, Parinaz Binandeh, et al.
Published: (2026)
BANC: Towards Efficient Binaural Audio Neural Codec for Overlapping Speech
by: Ratnarajah, Anton, et al.
Published: (2023)
by: Ratnarajah, Anton, et al.
Published: (2023)
Open-Set Source Tracing of Audio Deepfake Systems
by: Klein, Nicholas, et al.
Published: (2025)
by: Klein, Nicholas, et al.
Published: (2025)
Fully Few-shot Class-incremental Audio Classification Using Expandable Dual-embedding Extractor
by: Si, Yongjie, et al.
Published: (2024)
by: Si, Yongjie, et al.
Published: (2024)
Towards Robust Audio Deepfake Detection: A Evolving Benchmark for Continual Learning
by: Zhang, Xiaohui, et al.
Published: (2024)
by: Zhang, Xiaohui, et al.
Published: (2024)
ASTAR-NTU solution to AudioMOS Challenge 2025 Track1
by: Ritter-Gutierrez, Fabian, et al.
Published: (2025)
by: Ritter-Gutierrez, Fabian, et al.
Published: (2025)
Do Music Source Separation Models Preserve Spatial Information in Binaural Audio?
by: Namballa, Richa, et al.
Published: (2025)
by: Namballa, Richa, et al.
Published: (2025)
Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models
by: Xu, Xuenan, et al.
Published: (2024)
by: Xu, Xuenan, et al.
Published: (2024)
Audio-Based Classification of Insect Species Using Machine Learning Models: Cicada, Beetle, Termite, and Cricket
by: Shetty, Manas V, et al.
Published: (2025)
by: Shetty, Manas V, et al.
Published: (2025)
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models
by: Gong, Yitian, et al.
Published: (2026)
by: Gong, Yitian, et al.
Published: (2026)
Analysis of ABC Frontend Audio Systems for the NIST-SRE24
by: Barahona, Sara, et al.
Published: (2025)
by: Barahona, Sara, et al.
Published: (2025)
Similar Items
-
FeatureSense: Protecting Speaker Attributes in Always-On Audio Sensing System
by: Chhaglani, Bhawana, et al.
Published: (2025) -
NeckCare: Preventing Tech Neck using Hearable-based Multimodal Sensing
by: Chhaglani, Bhawana, et al.
Published: (2024) -
Quantum-Inspired Audio Unlearning: Towards Privacy-Preserving Voice Biometrics
by: Pathak, Shreyansh, et al.
Published: (2025) -
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
by: Yang, Dongchao, et al.
Published: (2023) -
Network Modulation Synthesis: New Algorithms for Generating Musical Audio Using Autoencoder Networks
by: Hyrkas, Jeremy
Published: (2021)