:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gupta, Akshaj, Guzman, Andrea, Badriprasad, Anagha, Park, Hwi Joo, Puranik, Upasana, Netzorg, Robin, Lian, Jiachen, Anumanchipalli, Gopala Krishna
Format:	Preprint
Published:	2025
Subjects:	Sound
Online Access:	https://arxiv.org/abs/2510.02597
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Speech After Gender: A Trans-Feminine Perspective on Next Steps for Speech Science and Technology
by: Netzorg, Robin, et al.
Published: (2024)

Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE
by: Lian, Jiachen, et al.
Published: (2022)

Audio Texture Manipulation by Exemplar-Based Analogy
by: Cheng, Kan Jen, et al.
Published: (2025)

AV-EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Omni-modal LLMS with Audio-visual Cues
by: Zhou, Dingkun, et al.
Published: (2025)

SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription
by: Zang, Yongyi, et al.
Published: (2023)

StyleStream: Real-Time Zero-Shot Voice Style Conversion
by: Liu, Yisi, et al.
Published: (2026)

TapToTab : Video-Based Guitar Tabs Generation using AI and Audio Analysis
by: Ghaleb, Ali, et al.
Published: (2024)

Towards Generalizability to Tone and Content Variations in the Transcription of Amplifier Rendered Electric Guitar Audio
by: Chen, Yu-Hua, et al.
Published: (2025)

audio2chart: End to End Audio Transcription into playable Guitar Hero charts
by: Tripodi, Riccardo
Published: (2025)

Towards Hierarchical Spoken Language Dysfluency Modeling
by: Lian, Jiachen, et al.
Published: (2024)

Sylber: Syllabic Embedding Representation of Speech from Raw Audio
by: Cho, Cheol Jun, et al.
Published: (2024)

Leveraging Real Electric Guitar Tones and Effects to Improve Robustness in Guitar Tablature Transcription Modeling
by: Pedroza, Hegel, et al.
Published: (2024)

Self-Supervised Audio-Visual Soundscape Stylization
by: Li, Tingle, et al.
Published: (2024)

SSDM: Scalable Speech Dysfluency Modeling
by: Lian, Jiachen, et al.
Published: (2024)

High Resolution Guitar Transcription via Domain Adaptation
by: Riley, Xavier, et al.
Published: (2024)

Joint Transcription of Acoustic Guitar Strumming Directions and Chords
by: Murgul, Sebastian, et al.
Published: (2025)

Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
by: Liu, Yisi, et al.
Published: (2024)

MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling
by: Edwards, Drew, et al.
Published: (2024)

GAPS: A Large and Diverse Classical Guitar Dataset and Benchmark Transcription Model
by: Riley, Xavier, et al.
Published: (2024)

Exploring Procedural Data Generation for Automatic Acoustic Guitar Fingerpicking Transcription
by: Murgul, Sebastian, et al.
Published: (2025)

DDSP Guitar Amp: Interpretable Guitar Amplifier Modeling
by: Yeh, Yen-Tung, et al.
Published: (2024)

Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware
by: Pedroza, Hegel, et al.
Published: (2025)

Sound Check: Auditing Audio Datasets
by: Agnew, William, et al.
Published: (2024)

Sounding that Object: Interactive Object-Aware Image to Audio Generation
by: Li, Tingle, et al.
Published: (2025)

GOAT: A Large Dataset of Paired Guitar Audio Recordings and Tablatures
by: Loth, Jackson, et al.
Published: (2025)

K-Function: Joint Pronunciation Transcription and Feedback for Evaluating Kids Language Function
by: Li, Shuhe, et al.
Published: (2025)

Multimodal Segmentation for Vocal Tract Modeling
by: Jain, Rishi, et al.
Published: (2024)

Coding Speech through Vocal Tract Kinematics
by: Cho, Cheol Jun, et al.
Published: (2024)

A Semantic Timbre Dataset for the Electric Guitar
by: Cameron, Joseph, et al.
Published: (2026)

Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency Detection
by: Zhang, Jinming, et al.
Published: (2025)

Stutter-Solver: End-to-end Multi-lingual Dysfluency Detection
by: Zhou, Xuanru, et al.
Published: (2024)

Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection
by: Zhou, Xuanru, et al.
Published: (2024)

Guitar Pickups I: Analysis of the Effect of Winding and Wire Gauge on Single Coil Electric Guitar Pickups
by: Batchelor, Charles, et al.
Published: (2024)

Towards Accurate Phonetic Error Detection Through Phoneme Similarity Modeling
by: Zhou, Xuanru, et al.
Published: (2025)

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal
by: Xu, Weihan, et al.
Published: (2025)

Beyond Transcription: Unified Audio Schema for Perception-Aware AudioLLMs
by: Zhang, Linhao, et al.
Published: (2026)

GuitarFlow: Realistic Electric Guitar Synthesis From Tablatures via Flow Matching and Style Transfer
by: Loth, Jackson, et al.
Published: (2025)

Production and Manufacturing of 3D Printed Acoustic Guitars
by: Tran, Timothy, et al.
Published: (2025)

A Machine Learning Approach for MIDI to Guitar Tablature Conversion
by: Kaliakatsos-Papakostas, Maximos, et al.
Published: (2025)

Enhancing Lie Detection Accuracy: A Comparative Study of Classic ML, CNN, and GCN Models using Audio-Visual Features
by: Abdelwahab, Abdelrahman, et al.
Published: (2024)