:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ren, Jingjing, Xu, Cheng, Chen, Haoyu, Qin, Xinran, Zhu, Lei
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2312.16274
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
by: Ren, Jingjing, et al.
Published: (2025)

UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
by: Ren, Jingjing, et al.
Published: (2024)

AdaptiveFusion: Adaptive Multi-Modal Multi-View Fusion for 3D Human Body Reconstruction
by: Chen, Anjun, et al.
Published: (2024)

Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing
by: Lin, Xun, et al.
Published: (2024)

Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis
by: Chefer, Hila, et al.
Published: (2026)

Robust Multi-Modal Face Anti-Spoofing with Domain Adaptation: Tackling Missing Modalities, Noisy Pseudo-Labels, and Model Degradation
by: Hsu, Ming-Tsung, et al.
Published: (2025)

Prompt-Free Conditional Diffusion for Multi-object Image Augmentation
by: Wang, Haoyu, et al.
Published: (2025)

Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization
by: Wu, Hongtao, et al.
Published: (2024)

EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
by: Gong, Chao, et al.
Published: (2025)

Towards Personalized Multi-Modal MRI Synthesis across Heterogeneous Datasets
by: Zhang, Yue, et al.
Published: (2026)

Domain-Adaptive Full-Face Gaze Estimation via Novel-View-Synthesis and Feature Disentanglement
by: Qin, Jiawei, et al.
Published: (2023)

Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
by: Zhao, Qingcheng, et al.
Published: (2024)

OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation
by: Yin, Bo-Wen, et al.
Published: (2025)

Towards Real Zero-Shot Camouflaged Object Segmentation without Camouflaged Annotations
by: Lei, Cheng, et al.
Published: (2024)

IdentiFace: Multi-Modal Iterative Diffusion Framework for Identifiable Suspect Face Generation in Crime Investigations
by: Liu, Weichen, et al.
Published: (2026)

FaceChain-FACT: Face Adapter with Decoupled Training for Identity-preserved Personalization
by: Yu, Cheng, et al.
Published: (2024)

MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing
by: Zheng, Junjie, et al.
Published: (2025)

SpatialImaginer: Towards Adaptive Visual Imagination for Spatial Reasoning
by: Li, Yian, et al.
Published: (2026)

UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions
by: Yao, Siyuan, et al.
Published: (2025)

ModalPrompt: Towards Efficient Multimodal Continual Instruction Tuning with Dual-Modality Guided Prompt
by: Zeng, Fanhu, et al.
Published: (2024)

Multi-Modal Face Anti-Spoofing via Cross-Modal Feature Transitions
by: Chong, Jun-Xiong, et al.
Published: (2025)

Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
by: Lei, Ting, et al.
Published: (2024)

Toward Scalable, Flexible Scene Flow for Point Clouds
by: Vedder, Kyle
Published: (2025)

MultiWorld: Scalable Multi-Agent Multi-View Video World Models
by: Wu, Haoyu, et al.
Published: (2026)

Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model
by: Tao, Keda, et al.
Published: (2024)

IDRetracor: Towards Visual Forensics Against Malicious Face Swapping
by: Cheng, Jikang, et al.
Published: (2024)

Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector
by: Guo, Xiao, et al.
Published: (2025)

Multi-Channel Cross Modal Detection of Synthetic Face Images
by: Ibsen, M., et al.
Published: (2023)

PixelWizard: Towards Efficient High-Fidelity Video Generation at Ultra-Large Spatial Resolution
by: Li, Wenxue, et al.
Published: (2026)

Towards Stable Self-Supervised Object Representations in Unconstrained Egocentric Video
by: Tan, Yuting, et al.
Published: (2026)

Adaptive Domain Shift in Diffusion Models for Cross-Modality Image Translation
by: Wang, Zihao, et al.
Published: (2026)

Adaptive Context Matters: Towards Provable Multi-Modality Guidance for Super-Resolution
by: Luo, Jinyi, et al.
Published: (2026)

Towards Consistent and Controllable Image Synthesis for Face Editing
by: Wei, Mengting, et al.
Published: (2025)

QuantFace: Efficient Quantization for Face Restoration
by: Li, Jiatong, et al.
Published: (2025)

Is Extending Modality The Right Path Towards Omni-Modality?
by: Zhu, Tinghui, et al.
Published: (2025)

Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model
by: Sun, Weilin, et al.
Published: (2025)

G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors
by: Yang, Haoxin, et al.
Published: (2024)

Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
by: Kim, Jihyun, et al.
Published: (2024)

Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality
by: Chen, Sishuo, et al.
Published: (2024)

Adaptive Multi-Modal Cross-Entropy Loss for Stereo Matching
by: Xu, Peng, et al.
Published: (2023)