Saved in:
| Main Authors: | Wu, Xinyi, Wang, Haohong, Katsaggelos, Aggelos K. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.07076 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Automatic Camera Trajectory Control with Enhanced Immersion for Virtual Cinematography
by: Wu, Xinyi, et al.
Published: (2023)
by: Wu, Xinyi, et al.
Published: (2023)
DIVA-VQA: Detecting Inter-frame Variations in UGC Video Quality
by: Wang, Xinyi, et al.
Published: (2025)
by: Wang, Xinyi, et al.
Published: (2025)
Multimodal Dataset Normalization and Perceptual Validation for Music-Taste Correspondences
by: Spanio, Matteo, et al.
Published: (2026)
by: Spanio, Matteo, et al.
Published: (2026)
SFQA: A Comprehensive Perceptual Quality Assessment Dataset for Singing Face Generation
by: Gao, Zhilin, et al.
Published: (2026)
by: Gao, Zhilin, et al.
Published: (2026)
MotionBeat: Motion-Aligned Music Representation via Embodied Contrastive Learning and Bar-Equivariant Contact-Aware Encoding
by: Wang, Xuanchen, et al.
Published: (2025)
by: Wang, Xuanchen, et al.
Published: (2025)
PP-Motion: Physical-Perceptual Fidelity Evaluation for Human Motion Generation
by: Zhao, Sihan, et al.
Published: (2025)
by: Zhao, Sihan, et al.
Published: (2025)
Face Consistency Benchmark for GenAI Video
by: Podstawski, Michal, et al.
Published: (2025)
by: Podstawski, Michal, et al.
Published: (2025)
Avoiding Quality Saturation in UGC Compression Using Denoised References
by: Xiong, Xin, et al.
Published: (2025)
by: Xiong, Xin, et al.
Published: (2025)
A Tri-Dynamic Preprocessing Framework for UGC Video Compression
by: Zhao, Fei, et al.
Published: (2025)
by: Zhao, Fei, et al.
Published: (2025)
TPIFM: A Task-Aware Model for Evaluating Perceptual Interaction Fluency in Remote AR Collaboration
by: Song, Jiarun, et al.
Published: (2026)
by: Song, Jiarun, et al.
Published: (2026)
Video Echoed in Music: Semantic, Temporal, and Rhythmic Alignment for Video-to-Music Generation
by: Tong, Xinyi, et al.
Published: (2025)
by: Tong, Xinyi, et al.
Published: (2025)
Perceptual-oriented Learned Image Compression with Dynamic Kernel
by: Fu, Nianxiang, et al.
Published: (2024)
by: Fu, Nianxiang, et al.
Published: (2024)
Intelligent Text-Conditioned Music Generation
by: Xie, Zhouyao, et al.
Published: (2024)
by: Xie, Zhouyao, et al.
Published: (2024)
HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment
by: Xu, Zitong, et al.
Published: (2025)
by: Xu, Zitong, et al.
Published: (2025)
Harmony: A Unified Framework for Modality Incremental Learning
by: Song, Yaguang, et al.
Published: (2025)
by: Song, Yaguang, et al.
Published: (2025)
Perceptual Quality Assessment of Octree-RAHT Encoded 3D Point Clouds
by: Duan, Dongshuai, et al.
Published: (2024)
by: Duan, Dongshuai, et al.
Published: (2024)
MetaSR: Content-Adaptive Metadata Orchestration for Generative Super-Resolution
by: Guo, Jiaqi, et al.
Published: (2026)
by: Guo, Jiaqi, et al.
Published: (2026)
Music4All A+A: A Multimodal Dataset for Music Information Retrieval Tasks
by: Geiger, Jonas, et al.
Published: (2025)
by: Geiger, Jonas, et al.
Published: (2025)
MusicScore: A Dataset for Music Score Modeling and Generation
by: Lin, Yuheng, et al.
Published: (2024)
by: Lin, Yuheng, et al.
Published: (2024)
MusicAOG: an Energy-Based Model for Learning and Sampling a Hierarchical Representation of Symbolic Music
by: Qian, Yikai, et al.
Published: (2024)
by: Qian, Yikai, et al.
Published: (2024)
MusicWeaver: Composer-Style Structural Editing and Minute-Scale Coherent Music Generation
by: Wang, Xuanchen, et al.
Published: (2025)
by: Wang, Xuanchen, et al.
Published: (2025)
Music Grounding by Short Video
by: Xin, Zijie, et al.
Published: (2024)
by: Xin, Zijie, et al.
Published: (2024)
MusicSem: A Semantically Rich Language--Audio Dataset of Natural Music Descriptions
by: Salganik, Rebecca, et al.
Published: (2026)
by: Salganik, Rebecca, et al.
Published: (2026)
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence
by: You, Fuming, et al.
Published: (2024)
by: You, Fuming, et al.
Published: (2024)
Multi Agents Semantic Emotion Aligned Music to Image Generation with Music Derived Captions
by: Shi, Junchang, et al.
Published: (2025)
by: Shi, Junchang, et al.
Published: (2025)
Music-Aligned Holistic 3D Dance Generation via Hierarchical Motion Modeling
by: Li, Xiaojie, et al.
Published: (2025)
by: Li, Xiaojie, et al.
Published: (2025)
Towards Practical Real-Time Low-Latency Music Source Separation
by: Wu, Junyu, et al.
Published: (2025)
by: Wu, Junyu, et al.
Published: (2025)
DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
by: Wang, Zixuan, et al.
Published: (2024)
by: Wang, Zixuan, et al.
Published: (2024)
SIDQL: An Efficient Keyframe Extraction and Motion Reconstruction Framework in Motion Capture
by: Zhang, Xuling, et al.
Published: (2024)
by: Zhang, Xuling, et al.
Published: (2024)
Embedded Blockchains: A Synthesis of Blockchains, Spread Spectrum Watermarking, Perceptual Hashing & Digital Signatures
by: Blake, Sam
Published: (2020)
by: Blake, Sam
Published: (2020)
An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation
by: Wang, Yutong, et al.
Published: (2024)
by: Wang, Yutong, et al.
Published: (2024)
SonicGauss: Position-Aware Physical Sound Synthesis for 3D Gaussian Representations
by: Wang, Chunshi, et al.
Published: (2025)
by: Wang, Chunshi, et al.
Published: (2025)
Detecting Notational Errors in Digital Music Scores
by: Léo, Géré, et al.
Published: (2025)
by: Léo, Géré, et al.
Published: (2025)
Jamendo-QA: A Large-Scale Music Question Answering Dataset
by: Koh, Junyoung, et al.
Published: (2025)
by: Koh, Junyoung, et al.
Published: (2025)
REArtGS: Reconstructing and Generating Articulated Objects via 3D Gaussian Splatting with Geometric and Motion Constraints
by: Wu, Di, et al.
Published: (2025)
by: Wu, Di, et al.
Published: (2025)
Orthogonal Disentanglement with Projected Feature Alignment for Multimodal Emotion Recognition in Conversation
by: Che, Xinyi, et al.
Published: (2025)
by: Che, Xinyi, et al.
Published: (2025)
Memo2496: Expert-Annotated Dataset and Dual-View Adaptive Framework for Music Emotion Recognition
by: Li, Qilin, et al.
Published: (2025)
by: Li, Qilin, et al.
Published: (2025)
Generating Attribute-Aware Human Motions from Textual Prompt
by: Wang, Xinghan, et al.
Published: (2025)
by: Wang, Xinghan, et al.
Published: (2025)
Socially Aware Music Recommendation: A Multi-Modal Graph Neural Networks for Collaborative Music Consumption and Community-Based Engagement
by: Ziaoddini, Kajwan
Published: (2025)
by: Ziaoddini, Kajwan
Published: (2025)
Music Arena: Live Evaluation for Text-to-Music
by: Kim, Yonghyun, et al.
Published: (2025)
by: Kim, Yonghyun, et al.
Published: (2025)
Similar Items
-
Automatic Camera Trajectory Control with Enhanced Immersion for Virtual Cinematography
by: Wu, Xinyi, et al.
Published: (2023) -
DIVA-VQA: Detecting Inter-frame Variations in UGC Video Quality
by: Wang, Xinyi, et al.
Published: (2025) -
Multimodal Dataset Normalization and Perceptual Validation for Music-Taste Correspondences
by: Spanio, Matteo, et al.
Published: (2026) -
SFQA: A Comprehensive Perceptual Quality Assessment Dataset for Singing Face Generation
by: Gao, Zhilin, et al.
Published: (2026) -
MotionBeat: Motion-Aligned Music Representation via Embodied Contrastive Learning and Bar-Equivariant Contact-Aware Encoding
by: Wang, Xuanchen, et al.
Published: (2025)