Saved in:
| Main Authors: | Gupta, Akshaj, Guzman, Andrea, Badriprasad, Anagha, Park, Hwi Joo, Puranik, Upasana, Netzorg, Robin, Lian, Jiachen, Anumanchipalli, Gopala Krishna |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.02597 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Speech After Gender: A Trans-Feminine Perspective on Next Steps for Speech Science and Technology
by: Netzorg, Robin, et al.
Published: (2024)
by: Netzorg, Robin, et al.
Published: (2024)
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE
by: Lian, Jiachen, et al.
Published: (2022)
by: Lian, Jiachen, et al.
Published: (2022)
Audio Texture Manipulation by Exemplar-Based Analogy
by: Cheng, Kan Jen, et al.
Published: (2025)
by: Cheng, Kan Jen, et al.
Published: (2025)
AV-EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Omni-modal LLMS with Audio-visual Cues
by: Zhou, Dingkun, et al.
Published: (2025)
by: Zhou, Dingkun, et al.
Published: (2025)
SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription
by: Zang, Yongyi, et al.
Published: (2023)
by: Zang, Yongyi, et al.
Published: (2023)
StyleStream: Real-Time Zero-Shot Voice Style Conversion
by: Liu, Yisi, et al.
Published: (2026)
by: Liu, Yisi, et al.
Published: (2026)
TapToTab : Video-Based Guitar Tabs Generation using AI and Audio Analysis
by: Ghaleb, Ali, et al.
Published: (2024)
by: Ghaleb, Ali, et al.
Published: (2024)
Towards Generalizability to Tone and Content Variations in the Transcription of Amplifier Rendered Electric Guitar Audio
by: Chen, Yu-Hua, et al.
Published: (2025)
by: Chen, Yu-Hua, et al.
Published: (2025)
audio2chart: End to End Audio Transcription into playable Guitar Hero charts
by: Tripodi, Riccardo
Published: (2025)
by: Tripodi, Riccardo
Published: (2025)
Towards Hierarchical Spoken Language Dysfluency Modeling
by: Lian, Jiachen, et al.
Published: (2024)
by: Lian, Jiachen, et al.
Published: (2024)
Sylber: Syllabic Embedding Representation of Speech from Raw Audio
by: Cho, Cheol Jun, et al.
Published: (2024)
by: Cho, Cheol Jun, et al.
Published: (2024)
Leveraging Real Electric Guitar Tones and Effects to Improve Robustness in Guitar Tablature Transcription Modeling
by: Pedroza, Hegel, et al.
Published: (2024)
by: Pedroza, Hegel, et al.
Published: (2024)
Self-Supervised Audio-Visual Soundscape Stylization
by: Li, Tingle, et al.
Published: (2024)
by: Li, Tingle, et al.
Published: (2024)
SSDM: Scalable Speech Dysfluency Modeling
by: Lian, Jiachen, et al.
Published: (2024)
by: Lian, Jiachen, et al.
Published: (2024)
High Resolution Guitar Transcription via Domain Adaptation
by: Riley, Xavier, et al.
Published: (2024)
by: Riley, Xavier, et al.
Published: (2024)
Joint Transcription of Acoustic Guitar Strumming Directions and Chords
by: Murgul, Sebastian, et al.
Published: (2025)
by: Murgul, Sebastian, et al.
Published: (2025)
Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
by: Liu, Yisi, et al.
Published: (2024)
by: Liu, Yisi, et al.
Published: (2024)
MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling
by: Edwards, Drew, et al.
Published: (2024)
by: Edwards, Drew, et al.
Published: (2024)
GAPS: A Large and Diverse Classical Guitar Dataset and Benchmark Transcription Model
by: Riley, Xavier, et al.
Published: (2024)
by: Riley, Xavier, et al.
Published: (2024)
Exploring Procedural Data Generation for Automatic Acoustic Guitar Fingerpicking Transcription
by: Murgul, Sebastian, et al.
Published: (2025)
by: Murgul, Sebastian, et al.
Published: (2025)
DDSP Guitar Amp: Interpretable Guitar Amplifier Modeling
by: Yeh, Yen-Tung, et al.
Published: (2024)
by: Yeh, Yen-Tung, et al.
Published: (2024)
Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware
by: Pedroza, Hegel, et al.
Published: (2025)
by: Pedroza, Hegel, et al.
Published: (2025)
Sound Check: Auditing Audio Datasets
by: Agnew, William, et al.
Published: (2024)
by: Agnew, William, et al.
Published: (2024)
Sounding that Object: Interactive Object-Aware Image to Audio Generation
by: Li, Tingle, et al.
Published: (2025)
by: Li, Tingle, et al.
Published: (2025)
GOAT: A Large Dataset of Paired Guitar Audio Recordings and Tablatures
by: Loth, Jackson, et al.
Published: (2025)
by: Loth, Jackson, et al.
Published: (2025)
K-Function: Joint Pronunciation Transcription and Feedback for Evaluating Kids Language Function
by: Li, Shuhe, et al.
Published: (2025)
by: Li, Shuhe, et al.
Published: (2025)
Multimodal Segmentation for Vocal Tract Modeling
by: Jain, Rishi, et al.
Published: (2024)
by: Jain, Rishi, et al.
Published: (2024)
Coding Speech through Vocal Tract Kinematics
by: Cho, Cheol Jun, et al.
Published: (2024)
by: Cho, Cheol Jun, et al.
Published: (2024)
A Semantic Timbre Dataset for the Electric Guitar
by: Cameron, Joseph, et al.
Published: (2026)
by: Cameron, Joseph, et al.
Published: (2026)
Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency Detection
by: Zhang, Jinming, et al.
Published: (2025)
by: Zhang, Jinming, et al.
Published: (2025)
Stutter-Solver: End-to-end Multi-lingual Dysfluency Detection
by: Zhou, Xuanru, et al.
Published: (2024)
by: Zhou, Xuanru, et al.
Published: (2024)
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection
by: Zhou, Xuanru, et al.
Published: (2024)
by: Zhou, Xuanru, et al.
Published: (2024)
Guitar Pickups I: Analysis of the Effect of Winding and Wire Gauge on Single Coil Electric Guitar Pickups
by: Batchelor, Charles, et al.
Published: (2024)
by: Batchelor, Charles, et al.
Published: (2024)
Towards Accurate Phonetic Error Detection Through Phoneme Similarity Modeling
by: Zhou, Xuanru, et al.
Published: (2025)
by: Zhou, Xuanru, et al.
Published: (2025)
Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal
by: Xu, Weihan, et al.
Published: (2025)
by: Xu, Weihan, et al.
Published: (2025)
Beyond Transcription: Unified Audio Schema for Perception-Aware AudioLLMs
by: Zhang, Linhao, et al.
Published: (2026)
by: Zhang, Linhao, et al.
Published: (2026)
GuitarFlow: Realistic Electric Guitar Synthesis From Tablatures via Flow Matching and Style Transfer
by: Loth, Jackson, et al.
Published: (2025)
by: Loth, Jackson, et al.
Published: (2025)
Production and Manufacturing of 3D Printed Acoustic Guitars
by: Tran, Timothy, et al.
Published: (2025)
by: Tran, Timothy, et al.
Published: (2025)
A Machine Learning Approach for MIDI to Guitar Tablature Conversion
by: Kaliakatsos-Papakostas, Maximos, et al.
Published: (2025)
by: Kaliakatsos-Papakostas, Maximos, et al.
Published: (2025)
Enhancing Lie Detection Accuracy: A Comparative Study of Classic ML, CNN, and GCN Models using Audio-Visual Features
by: Abdelwahab, Abdelrahman, et al.
Published: (2024)
by: Abdelwahab, Abdelrahman, et al.
Published: (2024)
Similar Items
-
Speech After Gender: A Trans-Feminine Perspective on Next Steps for Speech Science and Technology
by: Netzorg, Robin, et al.
Published: (2024) -
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE
by: Lian, Jiachen, et al.
Published: (2022) -
Audio Texture Manipulation by Exemplar-Based Analogy
by: Cheng, Kan Jen, et al.
Published: (2025) -
AV-EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Omni-modal LLMS with Audio-visual Cues
by: Zhou, Dingkun, et al.
Published: (2025) -
SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription
by: Zang, Yongyi, et al.
Published: (2023)