:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Simionato, Riccardo, Fasciani, Stefano
Format:	Preprint
Published:	2024
Subjects:	Sound Artificial Intelligence Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2409.06513
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
by: Simionato, Riccardo, et al.
Published: (2024)

MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling
by: Tang, Jingjing, et al.
Published: (2025)

PianoBART: Symbolic Piano Music Generation and Understanding with Large-Scale Pre-Training
by: Liang, Xiao, et al.
Published: (2024)

Expressive MIDI-format Piano Performance Generation
by: Liu, Jingwei
Published: (2024)

A Holistic Evaluation of Piano Sound Quality
by: Zhou, Monan, et al.
Published: (2023)

Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional Representation
by: Huang, Jingyue, et al.
Published: (2024)

End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding
by: Zeng, Wei, et al.
Published: (2024)

Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System
by: Bang, Hayeon, et al.
Published: (2025)

PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text
by: Bang, Hayeon, et al.
Published: (2024)

FürElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance
by: Wang, Ruocheng, et al.
Published: (2024)

Disentangling Score Content and Performance Style for Joint Piano Rendering and Transcription
by: Zeng, Wei, et al.
Published: (2025)

Exploring Classical Piano Performance Generation with Expressive Music Variational AutoEncoder
by: Luo, Jing, et al.
Published: (2025)

Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
by: Li, Dichucheng, et al.
Published: (2025)

D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription
by: Kim, Hounsu, et al.
Published: (2025)

Scaling Self-Supervised Representation Learning for Symbolic Piano Performance
by: Bradshaw, Louis, et al.
Published: (2025)

Estimating Musical Surprisal from Audio in Autoregressive Diffusion Model Noise Spaces
by: Bjare, Mathias Rose, et al.
Published: (2025)

Comparative Study of State-based Neural Networks for Virtual Analog Audio Effects Modeling
by: Simionato, Riccardo, et al.
Published: (2024)

BNMusic: Blending Environmental Noises into Personalized Music
by: Zuo, Chi, et al.
Published: (2025)

PianoVAM: A Multimodal Piano Performance Dataset
by: Kim, Yonghyun, et al.
Published: (2025)

High-Resolution Sustain Pedal Depth Estimation from Piano Audio Across Room Acoustics
by: Fang, Kun, et al.
Published: (2025)

Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning
by: Colelough, Brandon, et al.
Published: (2024)

Segment-Factorized Full-Song Generation on Symbolic Piano Music
by: Chen, Ping-Yi, et al.
Published: (2025)

HuBERT-VIC: Improving Noise-Robust Automatic Speech Recognition of Speech Foundation Model via Variance-Invariance-Covariance Regularization
by: Ahn, Hyebin, et al.
Published: (2025)

Neuro-MSBG: An End-to-End Neural Model for Hearing Loss Simulation
by: Yuan, Hui-Guan, et al.
Published: (2025)

Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection
by: Ryu, Myeonghoon, et al.
Published: (2025)

Physics-Informed Neural Engine Sound Modeling with Differentiable Pulse-Train Synthesis
by: Doerfler, Robin, et al.
Published: (2026)

Improving Neural Diarization through Speaker Attribute Attractors and Local Dependency Modeling
by: Palzer, David, et al.
Published: (2025)

Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems
by: Raymondaud, Quentin, et al.
Published: (2024)

HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling
by: Xue, Rongkun, et al.
Published: (2025)

PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance
by: Gan, Qijun, et al.
Published: (2024)

One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model
by: Li, Zhaoqing, et al.
Published: (2024)

A Multi-task Learning Balanced Attention Convolutional Neural Network Model for Few-shot Underwater Acoustic Target Recognition
by: Huang, Wei, et al.
Published: (2025)

Leveraging Mixture of Experts for Improved Speech Deepfake Detection
by: Negroni, Viola, et al.
Published: (2024)

CONMOD: Controllable Neural Frame-based Modulation Effects
by: Lee, Gyubin, et al.
Published: (2024)

SPEAR: Receiver-to-Receiver Acoustic Neural Warping Field
by: He, Yuhang, et al.
Published: (2024)

Stage-Wise and Prior-Aware Neural Speech Phase Prediction
by: Liu, Fei, et al.
Published: (2024)

SpectroStream: A Versatile Neural Codec for General Audio
by: Li, Yunpeng, et al.
Published: (2025)

How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio?
by: Liu, Tianchi, et al.
Published: (2024)

A Real-Time Voice Activity Detection Based On Lightweight Neural
by: Jia, Jidong, et al.
Published: (2024)

Towards Leveraging Contrastively Pretrained Neural Audio Embeddings for Recommender Tasks
by: Grötschla, Florian, et al.
Published: (2024)