Saved in:
| Main Authors: | Tang, Jiehui, Wang, Xiaofei, Xiao, Zhen, Liu, Jiayi, Liu, Xueliang, Hong, Richang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.19875 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Emotion Separation and Recognition from a Facial Expression by Generating the Poker Face with Vision Transformers
by: Li, Jia, et al.
Published: (2022)
by: Li, Jia, et al.
Published: (2022)
Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation
by: Tang, Shengeng, et al.
Published: (2024)
by: Tang, Shengeng, et al.
Published: (2024)
StgcDiff: Spatial-Temporal Graph Condition Diffusion for Sign Language Transition Generation
by: He, Jiashu, et al.
Published: (2025)
by: He, Jiashu, et al.
Published: (2025)
Domain Generalization for Face Anti-spoofing via Content-aware Composite Prompt Engineering
by: Guo, Jiabao, et al.
Published: (2025)
by: Guo, Jiabao, et al.
Published: (2025)
FaceMe: Robust Blind Face Restoration with Personal Identification
by: Liu, Siyu, et al.
Published: (2025)
by: Liu, Siyu, et al.
Published: (2025)
Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production
by: Tang, Shengeng, et al.
Published: (2024)
by: Tang, Shengeng, et al.
Published: (2024)
MGCA-Net: Multi-Grained Category-Aware Network for Open-Vocabulary Temporal Action Localization
by: Fang, Zhenying, et al.
Published: (2025)
by: Fang, Zhenying, et al.
Published: (2025)
Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering
by: Wang, Xu, et al.
Published: (2025)
by: Wang, Xu, et al.
Published: (2025)
Face-voice Association in Multilingual Environments (FAME) 2026 Challenge Evaluation Plan
by: Moscati, Marta, et al.
Published: (2025)
by: Moscati, Marta, et al.
Published: (2025)
Viewport Prediction for Volumetric Video Streaming by Exploring Video Saliency and Trajectory Information
by: Li, Jie, et al.
Published: (2023)
by: Li, Jie, et al.
Published: (2023)
Boundary Discretization and Reliable Classification Network for Temporal Action Detection
by: Fang, Zhenying, et al.
Published: (2023)
by: Fang, Zhenying, et al.
Published: (2023)
Grid Jigsaw Representation with CLIP: A New Perspective on Image Clustering
by: Song, Zijie, et al.
Published: (2023)
by: Song, Zijie, et al.
Published: (2023)
CanonSLR: Canonical-View Guided Multi-View Continuous Sign Language Recognition
by: Wang, Xu, et al.
Published: (2026)
by: Wang, Xu, et al.
Published: (2026)
SignAligner: Harmonizing Complementary Pose Modalities for Coherent Sign Language Generation
by: Wang, Xu, et al.
Published: (2025)
by: Wang, Xu, et al.
Published: (2025)
Generalizable Face Landmarking Guided by Conditional Face Warping
by: Liang, Jiayi, et al.
Published: (2024)
by: Liang, Jiayi, et al.
Published: (2024)
GT-Mean Loss: A Simple Yet Effective Solution for Brightness Mismatch in Low-Light Image Enhancement
by: Liao, Jingxi, et al.
Published: (2025)
by: Liao, Jingxi, et al.
Published: (2025)
ROVER: Robust Loop Closure Verification with Trajectory Prior in Repetitive Environments
by: Yu, Jingwen, et al.
Published: (2025)
by: Yu, Jingwen, et al.
Published: (2025)
Iterative Adversarial Attack on Image-guided Story Ending Generation
by: Wang, Youze, et al.
Published: (2023)
by: Wang, Youze, et al.
Published: (2023)
Linguistics-Vision Monotonic Consistent Network for Sign Language Production
by: Wang, Xu, et al.
Published: (2024)
by: Wang, Xu, et al.
Published: (2024)
Dual-View Alignment Learning with Hierarchical-Prompt for Class-Imbalance Multi-Label Classification
by: Huang, Sheng, et al.
Published: (2025)
by: Huang, Sheng, et al.
Published: (2025)
Find Matching Faces Based On Face Parameters
by: Bhatt, Setu A., et al.
Published: (2025)
by: Bhatt, Setu A., et al.
Published: (2025)
From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping
by: He, Xu, et al.
Published: (2025)
by: He, Xu, et al.
Published: (2025)
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval
by: Xiao, Jian, et al.
Published: (2024)
by: Xiao, Jian, et al.
Published: (2024)
Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model
by: Liu, Ting, et al.
Published: (2024)
by: Liu, Ting, et al.
Published: (2024)
Controllable Relation Disentanglement for Few-Shot Class-Incremental Learning
by: Zhou, Yuan, et al.
Published: (2024)
by: Zhou, Yuan, et al.
Published: (2024)
SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
by: Shi, Liangtao, et al.
Published: (2025)
by: Shi, Liangtao, et al.
Published: (2025)
SDTalk: Structured Facial Priors and Dual-Branch Motion Fields for Generalizable Gaussian Talking Head Synthesis
by: Jia, Peng, et al.
Published: (2026)
by: Jia, Peng, et al.
Published: (2026)
CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching
by: Li, Zizhuo, et al.
Published: (2025)
by: Li, Zizhuo, et al.
Published: (2025)
From Static to Dynamic: Adapting Landmark-Aware Image Models for Facial Expression Recognition in Videos
by: Chen, Yin, et al.
Published: (2023)
by: Chen, Yin, et al.
Published: (2023)
HonestFace: Towards Honest Face Restoration with One-Step Diffusion Model
by: Wang, Jingkai, et al.
Published: (2025)
by: Wang, Jingkai, et al.
Published: (2025)
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector
by: Guo, Xiao, et al.
Published: (2025)
by: Guo, Xiao, et al.
Published: (2025)
JoyType: A Robust Design for Multilingual Visual Text Creation
by: Li, Chao, et al.
Published: (2024)
by: Li, Chao, et al.
Published: (2024)
Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction
by: Guo, Xiao, et al.
Published: (2024)
by: Guo, Xiao, et al.
Published: (2024)
Image Captioning via Compact Bidirectional Architecture
by: Song, Zijie, et al.
Published: (2022)
by: Song, Zijie, et al.
Published: (2022)
RFOP: Rethinking Fusion and Orthogonal Projection for Face-Voice Association
by: Hannan, Abdul, et al.
Published: (2025)
by: Hannan, Abdul, et al.
Published: (2025)
Face-Voice Association with Inductive Bias for Maximum Class Separation
by: Moscati, Marta, et al.
Published: (2026)
by: Moscati, Marta, et al.
Published: (2026)
Deep Frequency-Aware Functional Maps for Robust Shape Matching
by: Luo, Feifan, et al.
Published: (2024)
by: Luo, Feifan, et al.
Published: (2024)
Motion Manipulation via Unsupervised Keypoint Positioning in Face Animation
by: Li, Hong, et al.
Published: (2026)
by: Li, Hong, et al.
Published: (2026)
Benchmarking Unified Face Attack Detection via Hierarchical Prompt Tuning
by: Liu, Ajian, et al.
Published: (2025)
by: Liu, Ajian, et al.
Published: (2025)
Image Deblurring by Exploring In-depth Properties of Transformer
by: Liang, Pengwei, et al.
Published: (2023)
by: Liang, Pengwei, et al.
Published: (2023)
Similar Items
-
Emotion Separation and Recognition from a Facial Expression by Generating the Poker Face with Vision Transformers
by: Li, Jia, et al.
Published: (2022) -
Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation
by: Tang, Shengeng, et al.
Published: (2024) -
StgcDiff: Spatial-Temporal Graph Condition Diffusion for Sign Language Transition Generation
by: He, Jiashu, et al.
Published: (2025) -
Domain Generalization for Face Anti-spoofing via Content-aware Composite Prompt Engineering
by: Guo, Jiabao, et al.
Published: (2025) -
FaceMe: Robust Blind Face Restoration with Personal Identification
by: Liu, Siyu, et al.
Published: (2025)