:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Perov, Ivan, Gao, Daiheng, Chervoniy, Nikolay, Liu, Kunlin, Marangonda, Sugasa, Umé, Chris, Dpfks, Facenheim, Carl Shift, RP, Luis, Jiang, Jian, Zhang, Sheng, Wu, Pingyu, Zhou, Bo, Zhang, Weiming
Format:	Preprint
Published:	2020
Subjects:	Computer Vision and Pattern Recognition Machine Learning Multimedia Image and Video Processing
Online Access:	https://arxiv.org/abs/2005.05535
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Identity-Aware Vision-Language Model for Explainable Face Forgery Detection
by: Xu, Junhao, et al.
Published: (2025)

Multi-Reference Generative Face Video Compression with Contrastive Learning
by: Konuko, Goluck, et al.
Published: (2024)

Audio-Visual Cross-Modal Compression for Generative Face Video Coding
by: Xu, Youmin, et al.
Published: (2025)

SFQA: A Comprehensive Perceptual Quality Assessment Dataset for Singing Face Generation
by: Gao, Zhilin, et al.
Published: (2026)

Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing
by: Li, Shuai, et al.
Published: (2025)

Voices, Faces, and Feelings: Multi-modal Emotion-Cognition Captioning for Mental Health Understanding
by: Zhou, Zhiyuan, et al.
Published: (2026)

CEM-Net: Cross-Emotion Memory Network for Emotional Talking Face Generation
by: Wu, Kangyi, et al.
Published: (2025)

OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance
by: Ge, Shuheng, et al.
Published: (2024)

Ensembling Synchronisation-based and Face-Voice Association Paradigms for Robust Active Speaker Detection in Egocentric Recordings
by: Clarke, Jason, et al.
Published: (2025)

Provably Secure Robust Image Steganography via Cross-Modal Error Correction
by: Qi, Yuang, et al.
Published: (2024)

SyncLipMAE: Contrastive Masked Pretraining for Audio-Visual Talking-Face Representation
by: Ling, Zeyu, et al.
Published: (2025)

Efficient Low-Resolution Face Recognition via Bridge Distillation
by: Ge, Shiming, et al.
Published: (2024)

ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
by: Zhang, Zhenxing, et al.
Published: (2024)

Band-Attention Modulated RetNet for Face Forgery Detection
by: Zhang, Zhida, et al.
Published: (2024)

Face Consistency Benchmark for GenAI Video
by: Podstawski, Michal, et al.
Published: (2025)

Reference-Guided Identity Preserving Face Restoration
by: Zhou, Mo, et al.
Published: (2025)

Dance-to-Music Generation with Encoder-based Textual Inversion
by: Li, Sifei, et al.
Published: (2024)

A Clustering-Based Method for Automatic Educational Video Recommendation Using Deep Face-Features of Lecturers
by: Mendes, Paulo R. C., et al.
Published: (2020)

G4G:A Generic Framework for High Fidelity Talking Face Generation with Fine-grained Intra-modal Alignment
by: Zhang, Juan, et al.
Published: (2024)

Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition
by: Zhang, Junzheng, et al.
Published: (2024)

Beyond Semantic Search: Towards Referential Anchoring in Composed Image Retrieval
by: Yang, Yuxin, et al.
Published: (2026)

Stickers on Facebook: Multifunctionality and face-enhancing politeness in everyday social interaction
by: Porrino-Moscoso, Laura M.
Published: (2026)

Inclusion 2024 Global Multimedia Deepfake Detection Challenge: Towards Multi-dimensional Face Forgery Detection
by: Zhang, Yi, et al.
Published: (2024)

Adaptive 3D Mesh Steganography Based on Feature-Preserving Distortion
by: Zhang, Yushu, et al.
Published: (2022)

AdaDPCC: Adaptive Rate Control and Rate-Distortion-Complexity Optimization for Dynamic Point Cloud Compression
by: Zhang, Chenhao, et al.
Published: (2025)

Clinical Multi-modal Fusion with Heterogeneous Graph and Disease Correlation Learning for Multi-Disease Prediction
by: Jiang, Yueheng, et al.
Published: (2025)

High-Fidelity 3D Gaussian Human Reconstruction via Region-Aware Initialization and Geometric Priors
by: Liu, Yang, et al.
Published: (2026)

Sec2Sec Co-attention for Video-Based Apparent Affective Prediction
by: Sun, Mingwei, et al.
Published: (2024)

SIDQL: An Efficient Keyframe Extraction and Motion Reconstruction Framework in Motion Capture
by: Zhang, Xuling, et al.
Published: (2024)

Uncertainty-Aware 3D Emotional Talking Face Synthesis with Emotion Prior Distillation
by: Shen, Nanhan, et al.
Published: (2026)

PixelatedScatter: Arbitrary-level Visual Abstraction for Large-scale Multiclass Scatterplots
by: Guo, Ziheng, et al.
Published: (2025)

Memories are One-to-Many Mapping Alleviators in Talking Face Generation
by: Tang, Anni, et al.
Published: (2022)

MultimodalHugs: Enabling Sign Language Processing in Hugging Face
by: Sant, Gerard, et al.
Published: (2025)

Trailer Reimagined: An Innovative, Llm-DRiven, Expressive Automated Movie Summary framework (TRAILDREAMS)
by: Balestri, Roberto, et al.
Published: (2026)

Beyond Sliders: Mastering the Art of Diffusion-based Image Manipulation
by: Tang, Yufei, et al.
Published: (2025)

ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Capability for Large Vision-Language Models
by: Liu, Shuo, et al.
Published: (2024)

Fostering Emotional Perspective-Taking: An Exploration of Affective Face-Tracking Interactions in the VR Narrative Rekindle
by: Fan, Hector, et al.
Published: (2026)

Recognizing Everything from All Modalities at Once: Grounded Multimodal Universal Information Extraction
by: Zhang, Meishan, et al.
Published: (2024)

Break-for-Make: Modular Low-Rank Adaptations for Composable Content-Style Customization
by: Xu, Yu, et al.
Published: (2024)

ISMAF: Intrinsic-Social Modality Alignment and Fusion for Multimodal Rumor Detection
by: Yu, Zihao, et al.
Published: (2025)