Saved in:
| Main Authors: | Cui, Kai, Li, Jia, Liu, Yu, Zhang, Xuesong, Hu, Zhenzhen, Wang, Meng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.17163 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Cross-domain EEG-based Emotion Recognition with Contrastive Learning
by: Yan, Rui, et al.
Published: (2025)
by: Yan, Rui, et al.
Published: (2025)
InfoSyncNet: Information Synchronization Temporal Convolutional Network for Visual Speech Recognition
by: Xue, Junxiao, et al.
Published: (2025)
by: Xue, Junxiao, et al.
Published: (2025)
SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction
by: Pallotta, Enrico, et al.
Published: (2025)
by: Pallotta, Enrico, et al.
Published: (2025)
VisioPhysioENet: Visual Physiological Engagement Detection Network
by: Singh, Alakhsimar, et al.
Published: (2024)
by: Singh, Alakhsimar, et al.
Published: (2024)
RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems
by: Meyer, Jaro, et al.
Published: (2025)
by: Meyer, Jaro, et al.
Published: (2025)
SyncVIS: Synchronized Video Instance Segmentation
by: Zheng, Rongkun, et al.
Published: (2024)
by: Zheng, Rongkun, et al.
Published: (2024)
SyncDPO: Enhancing Temporal Synchronization in Video-Audio Joint Generation via Preference Learning
by: Cheng, Xin, et al.
Published: (2026)
by: Cheng, Xin, et al.
Published: (2026)
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
by: Peng, Ziqiao, et al.
Published: (2023)
by: Peng, Ziqiao, et al.
Published: (2023)
Adaptive Physical-Facial Representation Fusion via Subject-Invariant Cross-Modal Prompt Tuning for Video-Based Emotion Recognition
by: Luo, Xiwen, et al.
Published: (2026)
by: Luo, Xiwen, et al.
Published: (2026)
Partial Label Learning for Emotion Recognition from EEG
by: Zhang, Guangyi, et al.
Published: (2023)
by: Zhang, Guangyi, et al.
Published: (2023)
Seeing is Believing? Enhancing Vision-Language Navigation using Visual Perturbations
by: Zhang, Xuesong, et al.
Published: (2024)
by: Zhang, Xuesong, et al.
Published: (2024)
Disentangling Foreground and Background for vision-Language Navigation via Online Augmentation
by: Xu, Yunbo, et al.
Published: (2025)
by: Xu, Yunbo, et al.
Published: (2025)
SyncTweedies: A General Generative Framework Based on Synchronized Diffusions
by: Kim, Jaihoon, et al.
Published: (2024)
by: Kim, Jaihoon, et al.
Published: (2024)
VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection
by: Cheng, Hao, et al.
Published: (2025)
by: Cheng, Hao, et al.
Published: (2025)
Agent Journey Beyond RGB: Hierarchical Semantic-Spatial Representation Enrichment for Vision-and-Language Navigation
by: Zhang, Xuesong, et al.
Published: (2024)
by: Zhang, Xuesong, et al.
Published: (2024)
MGHFT: Multi-Granularity Hierarchical Fusion Transformer for Cross-Modal Sticker Emotion Recognition
by: Chen, Jian, et al.
Published: (2025)
by: Chen, Jian, et al.
Published: (2025)
Cross-Modal Consistency Learning for Sign Language Recognition
by: Wu, Kepeng, et al.
Published: (2025)
by: Wu, Kepeng, et al.
Published: (2025)
UniSync: Towards Generalizable and High-Fidelity Lip Synchronization for Challenging Scenarios
by: Fan, Ruidi, et al.
Published: (2026)
by: Fan, Ruidi, et al.
Published: (2026)
Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion
by: Liu, Shaowei, et al.
Published: (2025)
by: Liu, Shaowei, et al.
Published: (2025)
MCN-CL: Multimodal Cross-Attention Network and Contrastive Learning for Multimodal Emotion Recognition
by: Li, Feng, et al.
Published: (2025)
by: Li, Feng, et al.
Published: (2025)
Region-aware Spatiotemporal Modeling with Collaborative Domain Generalization for Cross-Subject EEG Emotion Recognition
by: Wu, Weiwei, et al.
Published: (2026)
by: Wu, Weiwei, et al.
Published: (2026)
UniSync: A Unified Framework for Audio-Visual Synchronization
by: Feng, Tao, et al.
Published: (2025)
by: Feng, Tao, et al.
Published: (2025)
Detecting Lip-Syncing Deepfakes: Vision Temporal Transformer for Analyzing Mouth Inconsistencies
by: Datta, Soumyya Kanti, et al.
Published: (2025)
by: Datta, Soumyya Kanti, et al.
Published: (2025)
SyncSDE: A Probabilistic Framework for Diffusion Synchronization
by: Lee, Hyunjun, et al.
Published: (2025)
by: Lee, Hyunjun, et al.
Published: (2025)
EEG-based Multimodal Representation Learning for Emotion Recognition
by: Yin, Kang, et al.
Published: (2024)
by: Yin, Kang, et al.
Published: (2024)
OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers
by: Peng, Ziqiao, et al.
Published: (2025)
by: Peng, Ziqiao, et al.
Published: (2025)
LEL: Lipschitz Continuity Constrained Ensemble Learning for Efficient EEG-Based Intra-subject Emotion Recognition
by: Gong, Shengyu, et al.
Published: (2025)
by: Gong, Shengyu, et al.
Published: (2025)
Cross-Modal Dual-Causal Learning for Long-Term Action Recognition
by: Shaowu, Xu, et al.
Published: (2025)
by: Shaowu, Xu, et al.
Published: (2025)
ECMF: Enhanced Cross-Modal Fusion for Multimodal Emotion Recognition in MER-SEMI Challenge
by: Hu, Juewen, et al.
Published: (2025)
by: Hu, Juewen, et al.
Published: (2025)
FreqDGT: Frequency-Adaptive Dynamic Graph Networks with Transformer for Cross-subject EEG Emotion Recognition
by: Li, Yueyang, et al.
Published: (2025)
by: Li, Yueyang, et al.
Published: (2025)
SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting
by: Peng, Ziqiao, et al.
Published: (2025)
by: Peng, Ziqiao, et al.
Published: (2025)
SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization
by: Ahn, Young Jin, et al.
Published: (2024)
by: Ahn, Young Jin, et al.
Published: (2024)
FACE: Few-shot Adapter with Cross-view Fusion for Cross-subject EEG Emotion Recognition
by: Liu, Haiqi, et al.
Published: (2025)
by: Liu, Haiqi, et al.
Published: (2025)
DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
by: Dong, Yue-Jiang, et al.
Published: (2025)
by: Dong, Yue-Jiang, et al.
Published: (2025)
HighSync: High-Quality Lip Synchronization via Latent Diffusion Models
by: Daghigh, Saeed Firouzi, et al.
Published: (2026)
by: Daghigh, Saeed Firouzi, et al.
Published: (2026)
SyncFix: Fixing 3D Reconstructions via Multi-View Synchronization
by: Li, Deming, et al.
Published: (2026)
by: Li, Deming, et al.
Published: (2026)
StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces
by: Yeo, Kyeongmin, et al.
Published: (2025)
by: Yeo, Kyeongmin, et al.
Published: (2025)
CoSyncDiT: Cognitive Synchronous Diffusion Transformer for Movie Dubbing
by: Cong, Gaoxiang, et al.
Published: (2026)
by: Cong, Gaoxiang, et al.
Published: (2026)
SyncTrack4D: Cross-Video Motion Alignment and Video Synchronization for Multi-Video 4D Gaussian Splatting
by: Lee, Yonghan, et al.
Published: (2025)
by: Lee, Yonghan, et al.
Published: (2025)
SyncAnyone: Implicit Disentanglement via Progressive Self-Correction for Lip-Syncing in the wild
by: Zhang, Xindi, et al.
Published: (2025)
by: Zhang, Xindi, et al.
Published: (2025)
Similar Items
-
Cross-domain EEG-based Emotion Recognition with Contrastive Learning
by: Yan, Rui, et al.
Published: (2025) -
InfoSyncNet: Information Synchronization Temporal Convolutional Network for Visual Speech Recognition
by: Xue, Junxiao, et al.
Published: (2025) -
SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction
by: Pallotta, Enrico, et al.
Published: (2025) -
VisioPhysioENet: Visual Physiological Engagement Detection Network
by: Singh, Alakhsimar, et al.
Published: (2024) -
RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems
by: Meyer, Jaro, et al.
Published: (2025)