Saved in:
| Main Authors: | Simionato, Riccardo, Fasciani, Stefano |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.06513 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
by: Simionato, Riccardo, et al.
Published: (2024)
by: Simionato, Riccardo, et al.
Published: (2024)
MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling
by: Tang, Jingjing, et al.
Published: (2025)
by: Tang, Jingjing, et al.
Published: (2025)
PianoBART: Symbolic Piano Music Generation and Understanding with Large-Scale Pre-Training
by: Liang, Xiao, et al.
Published: (2024)
by: Liang, Xiao, et al.
Published: (2024)
Expressive MIDI-format Piano Performance Generation
by: Liu, Jingwei
Published: (2024)
by: Liu, Jingwei
Published: (2024)
A Holistic Evaluation of Piano Sound Quality
by: Zhou, Monan, et al.
Published: (2023)
by: Zhou, Monan, et al.
Published: (2023)
Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional Representation
by: Huang, Jingyue, et al.
Published: (2024)
by: Huang, Jingyue, et al.
Published: (2024)
End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding
by: Zeng, Wei, et al.
Published: (2024)
by: Zeng, Wei, et al.
Published: (2024)
Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System
by: Bang, Hayeon, et al.
Published: (2025)
by: Bang, Hayeon, et al.
Published: (2025)
PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text
by: Bang, Hayeon, et al.
Published: (2024)
by: Bang, Hayeon, et al.
Published: (2024)
FürElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance
by: Wang, Ruocheng, et al.
Published: (2024)
by: Wang, Ruocheng, et al.
Published: (2024)
Disentangling Score Content and Performance Style for Joint Piano Rendering and Transcription
by: Zeng, Wei, et al.
Published: (2025)
by: Zeng, Wei, et al.
Published: (2025)
Exploring Classical Piano Performance Generation with Expressive Music Variational AutoEncoder
by: Luo, Jing, et al.
Published: (2025)
by: Luo, Jing, et al.
Published: (2025)
Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders
by: Li, Dichucheng, et al.
Published: (2025)
by: Li, Dichucheng, et al.
Published: (2025)
D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription
by: Kim, Hounsu, et al.
Published: (2025)
by: Kim, Hounsu, et al.
Published: (2025)
Scaling Self-Supervised Representation Learning for Symbolic Piano Performance
by: Bradshaw, Louis, et al.
Published: (2025)
by: Bradshaw, Louis, et al.
Published: (2025)
Estimating Musical Surprisal from Audio in Autoregressive Diffusion Model Noise Spaces
by: Bjare, Mathias Rose, et al.
Published: (2025)
by: Bjare, Mathias Rose, et al.
Published: (2025)
Comparative Study of State-based Neural Networks for Virtual Analog Audio Effects Modeling
by: Simionato, Riccardo, et al.
Published: (2024)
by: Simionato, Riccardo, et al.
Published: (2024)
BNMusic: Blending Environmental Noises into Personalized Music
by: Zuo, Chi, et al.
Published: (2025)
by: Zuo, Chi, et al.
Published: (2025)
PianoVAM: A Multimodal Piano Performance Dataset
by: Kim, Yonghyun, et al.
Published: (2025)
by: Kim, Yonghyun, et al.
Published: (2025)
High-Resolution Sustain Pedal Depth Estimation from Piano Audio Across Room Acoustics
by: Fang, Kun, et al.
Published: (2025)
by: Fang, Kun, et al.
Published: (2025)
Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning
by: Colelough, Brandon, et al.
Published: (2024)
by: Colelough, Brandon, et al.
Published: (2024)
Segment-Factorized Full-Song Generation on Symbolic Piano Music
by: Chen, Ping-Yi, et al.
Published: (2025)
by: Chen, Ping-Yi, et al.
Published: (2025)
HuBERT-VIC: Improving Noise-Robust Automatic Speech Recognition of Speech Foundation Model via Variance-Invariance-Covariance Regularization
by: Ahn, Hyebin, et al.
Published: (2025)
by: Ahn, Hyebin, et al.
Published: (2025)
Neuro-MSBG: An End-to-End Neural Model for Hearing Loss Simulation
by: Yuan, Hui-Guan, et al.
Published: (2025)
by: Yuan, Hui-Guan, et al.
Published: (2025)
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection
by: Ryu, Myeonghoon, et al.
Published: (2025)
by: Ryu, Myeonghoon, et al.
Published: (2025)
Physics-Informed Neural Engine Sound Modeling with Differentiable Pulse-Train Synthesis
by: Doerfler, Robin, et al.
Published: (2026)
by: Doerfler, Robin, et al.
Published: (2026)
Improving Neural Diarization through Speaker Attribute Attractors and Local Dependency Modeling
by: Palzer, David, et al.
Published: (2025)
by: Palzer, David, et al.
Published: (2025)
Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems
by: Raymondaud, Quentin, et al.
Published: (2024)
by: Raymondaud, Quentin, et al.
Published: (2024)
HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling
by: Xue, Rongkun, et al.
Published: (2025)
by: Xue, Rongkun, et al.
Published: (2025)
PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance
by: Gan, Qijun, et al.
Published: (2024)
by: Gan, Qijun, et al.
Published: (2024)
One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model
by: Li, Zhaoqing, et al.
Published: (2024)
by: Li, Zhaoqing, et al.
Published: (2024)
A Multi-task Learning Balanced Attention Convolutional Neural Network Model for Few-shot Underwater Acoustic Target Recognition
by: Huang, Wei, et al.
Published: (2025)
by: Huang, Wei, et al.
Published: (2025)
Leveraging Mixture of Experts for Improved Speech Deepfake Detection
by: Negroni, Viola, et al.
Published: (2024)
by: Negroni, Viola, et al.
Published: (2024)
CONMOD: Controllable Neural Frame-based Modulation Effects
by: Lee, Gyubin, et al.
Published: (2024)
by: Lee, Gyubin, et al.
Published: (2024)
SPEAR: Receiver-to-Receiver Acoustic Neural Warping Field
by: He, Yuhang, et al.
Published: (2024)
by: He, Yuhang, et al.
Published: (2024)
Stage-Wise and Prior-Aware Neural Speech Phase Prediction
by: Liu, Fei, et al.
Published: (2024)
by: Liu, Fei, et al.
Published: (2024)
SpectroStream: A Versatile Neural Codec for General Audio
by: Li, Yunpeng, et al.
Published: (2025)
by: Li, Yunpeng, et al.
Published: (2025)
How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio?
by: Liu, Tianchi, et al.
Published: (2024)
by: Liu, Tianchi, et al.
Published: (2024)
A Real-Time Voice Activity Detection Based On Lightweight Neural
by: Jia, Jidong, et al.
Published: (2024)
by: Jia, Jidong, et al.
Published: (2024)
Towards Leveraging Contrastively Pretrained Neural Audio Embeddings for Recommender Tasks
by: Grötschla, Florian, et al.
Published: (2024)
by: Grötschla, Florian, et al.
Published: (2024)
Similar Items
-
Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
by: Simionato, Riccardo, et al.
Published: (2024) -
MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling
by: Tang, Jingjing, et al.
Published: (2025) -
PianoBART: Symbolic Piano Music Generation and Understanding with Large-Scale Pre-Training
by: Liang, Xiao, et al.
Published: (2024) -
Expressive MIDI-format Piano Performance Generation
by: Liu, Jingwei
Published: (2024) -
A Holistic Evaluation of Piano Sound Quality
by: Zhou, Monan, et al.
Published: (2023)