Saved in:
| Main Authors: | Stowell, Dan, Wood, Mike, Stylianou, Yannis, Glotin, Hervé |
|---|---|
| Format: | Preprint |
| Published: |
2016
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/1608.03417 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Automatic acoustic detection of birds through deep learning: the first Bird Audio Detection challenge
by: Stowell, Dan, et al.
Published: (2018)
by: Stowell, Dan, et al.
Published: (2018)
Computational bioacoustics with deep learning: a review and roadmap
by: Stowell, Dan
Published: (2021)
by: Stowell, Dan
Published: (2021)
Adaptive Representations of Sound for Automatic Insect Recognition
by: Faiß, Marius, et al.
Published: (2023)
by: Faiß, Marius, et al.
Published: (2023)
Rank-based loss for learning hierarchical representations
by: Nolasco, Ines, et al.
Published: (2021)
by: Nolasco, Ines, et al.
Published: (2021)
InsectSet459: an open dataset of insect sounds for bioacoustic machine learning
by: Faiß, Marius, et al.
Published: (2025)
by: Faiß, Marius, et al.
Published: (2025)
Investigating self-supervised representations for audio-visual deepfake detection
by: Boldisor, Dragos-Alexandru, et al.
Published: (2025)
by: Boldisor, Dragos-Alexandru, et al.
Published: (2025)
Audio-visual video-to-speech synthesis with synthesized input audio
by: Kefalas, Triantafyllos, et al.
Published: (2023)
by: Kefalas, Triantafyllos, et al.
Published: (2023)
Are audio DeepFake detection models polyglots?
by: Marek, Bartłomiej, et al.
Published: (2024)
by: Marek, Bartłomiej, et al.
Published: (2024)
Large-scale unsupervised audio pre-training for video-to-speech synthesis
by: Kefalas, Triantafyllos, et al.
Published: (2023)
by: Kefalas, Triantafyllos, et al.
Published: (2023)
Acoustic identification of individual animals with hierarchical contrastive learning
by: Nolasco, Ines, et al.
Published: (2024)
by: Nolasco, Ines, et al.
Published: (2024)
LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging
by: Singh, Shubhr, et al.
Published: (2025)
by: Singh, Shubhr, et al.
Published: (2025)
Towards generalizing deep-audio fake detection networks
by: Gasenzer, Konstantin, et al.
Published: (2023)
by: Gasenzer, Konstantin, et al.
Published: (2023)
Circumventing shortcuts in audio-visual deepfake detection datasets with unsupervised learning
by: Smeu, Stefan, et al.
Published: (2024)
by: Smeu, Stefan, et al.
Published: (2024)
An RFP dataset for Real, Fake, and Partially fake audio detection
by: AlAli, Abdulazeez, et al.
Published: (2024)
by: AlAli, Abdulazeez, et al.
Published: (2024)
Unsupervised outlier detection to improve bird audio dataset labels
by: Collins, Bruce
Published: (2025)
by: Collins, Bruce
Published: (2025)
A robust audio deepfake detection system via multi-view feature
by: Yang, Yujie, et al.
Published: (2024)
by: Yang, Yujie, et al.
Published: (2024)
Where are we in audio deepfake detection? A systematic analysis over generative and detection models
by: Li, Xiang, et al.
Published: (2024)
by: Li, Xiang, et al.
Published: (2024)
Scaling up masked audio encoder learning for general audio classification
by: Dinkel, Heinrich, et al.
Published: (2024)
by: Dinkel, Heinrich, et al.
Published: (2024)
animal2vec and MeerKAT: A self-supervised transformer for rare-event raw audio input and a large-scale reference dataset for bioacoustics
by: Schäfer-Zimmermann, Julian C., et al.
Published: (2024)
by: Schäfer-Zimmermann, Julian C., et al.
Published: (2024)
Forensic deepfake audio detection using segmental speech features
by: Yang, Tianle, et al.
Published: (2025)
by: Yang, Tianle, et al.
Published: (2025)
SMART: Tuning a symbolic music generation system with an audio domain aesthetic reward
by: Jonason, Nicolas, et al.
Published: (2025)
by: Jonason, Nicolas, et al.
Published: (2025)
Stage-adaptive audio diffusion modeling
by: Zhang, Xuanhao, et al.
Published: (2026)
by: Zhang, Xuanhao, et al.
Published: (2026)
Emoanti: audio anti-deepfake with refined emotion-guided representations
by: Li, Xiaokang, et al.
Published: (2025)
by: Li, Xiaokang, et al.
Published: (2025)
Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol
by: Apostolidis, Konstantinos, et al.
Published: (2024)
by: Apostolidis, Konstantinos, et al.
Published: (2024)
GRAM: Spatial general-purpose audio representations for real-world environments
by: Yuksel, Goksenin, et al.
Published: (2026)
by: Yuksel, Goksenin, et al.
Published: (2026)
TQCodec: Towards neural audio codec for high-fidelity music streaming
by: He, Lixing, et al.
Published: (2026)
by: He, Lixing, et al.
Published: (2026)
Towards audio language modeling -- an overview
by: Wu, Haibin, et al.
Published: (2024)
by: Wu, Haibin, et al.
Published: (2024)
Mellow: a small audio language model for reasoning
by: Deshmukh, Soham, et al.
Published: (2025)
by: Deshmukh, Soham, et al.
Published: (2025)
Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics
by: Ghani, Burooj, et al.
Published: (2024)
by: Ghani, Burooj, et al.
Published: (2024)
LibriVAD: A Scalable Open Dataset with Deep Learning Benchmarks for Voice Activity Detection
by: Stylianou, Ioannis, et al.
Published: (2025)
by: Stylianou, Ioannis, et al.
Published: (2025)
Training chord recognition models on artificially generated audio
by: Majchrzak, Martyna, et al.
Published: (2025)
by: Majchrzak, Martyna, et al.
Published: (2025)
Discriminant audio properties in deep learning based respiratory insufficiency detection in Brazilian Portuguese
by: Gauy, Marcelo Matheus, et al.
Published: (2024)
by: Gauy, Marcelo Matheus, et al.
Published: (2024)
Sustaining model performance for covid-19 detection from dynamic audio data: Development and evaluation of a comprehensive drift-adaptive framework
by: Ganitidis, Theofanis, et al.
Published: (2024)
by: Ganitidis, Theofanis, et al.
Published: (2024)
Exploring trends in audio mixes and masters: Insights from a dataset analysis
by: Mourgela, Angeliki, et al.
Published: (2024)
by: Mourgela, Angeliki, et al.
Published: (2024)
Modeling strategies for speech enhancement in the latent space of a neural audio codec
by: Kammoun, Sofiene, et al.
Published: (2025)
by: Kammoun, Sofiene, et al.
Published: (2025)
Spatial-CLAP: Learning Spatially-Aware audio--text Embeddings for Multi-Source Conditions
by: Seki, Kentaro, et al.
Published: (2025)
by: Seki, Kentaro, et al.
Published: (2025)
One Prompt, Many Sounds: Modeling Listener Variability in LLM-Based Equalization
by: Stylianou, Ioannis, et al.
Published: (2026)
by: Stylianou, Ioannis, et al.
Published: (2026)
ParaCLAP -- Towards a general language-audio model for computational paralinguistic tasks
by: Jing, Xin, et al.
Published: (2024)
by: Jing, Xin, et al.
Published: (2024)
Tweaking autoregressive methods for inpainting of gaps in audio signals
by: Mokrý, Ondřej, et al.
Published: (2024)
by: Mokrý, Ondřej, et al.
Published: (2024)
MBCodec:Thorough disentangle for high-fidelity audio compression
by: Zhang, Ruonan, et al.
Published: (2025)
by: Zhang, Ruonan, et al.
Published: (2025)
Similar Items
-
Automatic acoustic detection of birds through deep learning: the first Bird Audio Detection challenge
by: Stowell, Dan, et al.
Published: (2018) -
Computational bioacoustics with deep learning: a review and roadmap
by: Stowell, Dan
Published: (2021) -
Adaptive Representations of Sound for Automatic Insect Recognition
by: Faiß, Marius, et al.
Published: (2023) -
Rank-based loss for learning hierarchical representations
by: Nolasco, Ines, et al.
Published: (2021) -
InsectSet459: an open dataset of insect sounds for bioacoustic machine learning
by: Faiß, Marius, et al.
Published: (2025)