:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Stowell, Dan, Wood, Mike, Stylianou, Yannis, Glotin, Hervé
Format:	Preprint
Published:	2016
Subjects:	Sound
Online Access:	https://arxiv.org/abs/1608.03417
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Automatic acoustic detection of birds through deep learning: the first Bird Audio Detection challenge
by: Stowell, Dan, et al.
Published: (2018)

Computational bioacoustics with deep learning: a review and roadmap
by: Stowell, Dan
Published: (2021)

Adaptive Representations of Sound for Automatic Insect Recognition
by: Faiß, Marius, et al.
Published: (2023)

Rank-based loss for learning hierarchical representations
by: Nolasco, Ines, et al.
Published: (2021)

InsectSet459: an open dataset of insect sounds for bioacoustic machine learning
by: Faiß, Marius, et al.
Published: (2025)

Investigating self-supervised representations for audio-visual deepfake detection
by: Boldisor, Dragos-Alexandru, et al.
Published: (2025)

Audio-visual video-to-speech synthesis with synthesized input audio
by: Kefalas, Triantafyllos, et al.
Published: (2023)

Are audio DeepFake detection models polyglots?
by: Marek, Bartłomiej, et al.
Published: (2024)

Large-scale unsupervised audio pre-training for video-to-speech synthesis
by: Kefalas, Triantafyllos, et al.
Published: (2023)

Acoustic identification of individual animals with hierarchical contrastive learning
by: Nolasco, Ines, et al.
Published: (2024)

LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging
by: Singh, Shubhr, et al.
Published: (2025)

Towards generalizing deep-audio fake detection networks
by: Gasenzer, Konstantin, et al.
Published: (2023)

Circumventing shortcuts in audio-visual deepfake detection datasets with unsupervised learning
by: Smeu, Stefan, et al.
Published: (2024)

An RFP dataset for Real, Fake, and Partially fake audio detection
by: AlAli, Abdulazeez, et al.
Published: (2024)

Unsupervised outlier detection to improve bird audio dataset labels
by: Collins, Bruce
Published: (2025)

A robust audio deepfake detection system via multi-view feature
by: Yang, Yujie, et al.
Published: (2024)

Where are we in audio deepfake detection? A systematic analysis over generative and detection models
by: Li, Xiang, et al.
Published: (2024)

Scaling up masked audio encoder learning for general audio classification
by: Dinkel, Heinrich, et al.
Published: (2024)

animal2vec and MeerKAT: A self-supervised transformer for rare-event raw audio input and a large-scale reference dataset for bioacoustics
by: Schäfer-Zimmermann, Julian C., et al.
Published: (2024)

Forensic deepfake audio detection using segmental speech features
by: Yang, Tianle, et al.
Published: (2025)

SMART: Tuning a symbolic music generation system with an audio domain aesthetic reward
by: Jonason, Nicolas, et al.
Published: (2025)

Stage-adaptive audio diffusion modeling
by: Zhang, Xuanhao, et al.
Published: (2026)

Emoanti: audio anti-deepfake with refined emotion-guided representations
by: Li, Xiaokang, et al.
Published: (2025)

Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol
by: Apostolidis, Konstantinos, et al.
Published: (2024)

GRAM: Spatial general-purpose audio representations for real-world environments
by: Yuksel, Goksenin, et al.
Published: (2026)

TQCodec: Towards neural audio codec for high-fidelity music streaming
by: He, Lixing, et al.
Published: (2026)

Towards audio language modeling -- an overview
by: Wu, Haibin, et al.
Published: (2024)

Mellow: a small audio language model for reasoning
by: Deshmukh, Soham, et al.
Published: (2025)

Generalization in birdsong classification: impact of transfer learning methods and dataset characteristics
by: Ghani, Burooj, et al.
Published: (2024)

LibriVAD: A Scalable Open Dataset with Deep Learning Benchmarks for Voice Activity Detection
by: Stylianou, Ioannis, et al.
Published: (2025)

Training chord recognition models on artificially generated audio
by: Majchrzak, Martyna, et al.
Published: (2025)

Discriminant audio properties in deep learning based respiratory insufficiency detection in Brazilian Portuguese
by: Gauy, Marcelo Matheus, et al.
Published: (2024)

Sustaining model performance for covid-19 detection from dynamic audio data: Development and evaluation of a comprehensive drift-adaptive framework
by: Ganitidis, Theofanis, et al.
Published: (2024)

Exploring trends in audio mixes and masters: Insights from a dataset analysis
by: Mourgela, Angeliki, et al.
Published: (2024)

Modeling strategies for speech enhancement in the latent space of a neural audio codec
by: Kammoun, Sofiene, et al.
Published: (2025)

Spatial-CLAP: Learning Spatially-Aware audio--text Embeddings for Multi-Source Conditions
by: Seki, Kentaro, et al.
Published: (2025)

One Prompt, Many Sounds: Modeling Listener Variability in LLM-Based Equalization
by: Stylianou, Ioannis, et al.
Published: (2026)

ParaCLAP -- Towards a general language-audio model for computational paralinguistic tasks
by: Jing, Xin, et al.
Published: (2024)

Tweaking autoregressive methods for inpainting of gaps in audio signals
by: Mokrý, Ondřej, et al.
Published: (2024)

MBCodec:Thorough disentangle for high-fidelity audio compression
by: Zhang, Ruonan, et al.
Published: (2025)