Saved in:
| Main Authors: | Ren, Jingjing, Xu, Cheng, Chen, Haoyu, Qin, Xinran, Zhu, Lei |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2312.16274 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
by: Ren, Jingjing, et al.
Published: (2025)
by: Ren, Jingjing, et al.
Published: (2025)
UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
by: Ren, Jingjing, et al.
Published: (2024)
by: Ren, Jingjing, et al.
Published: (2024)
AdaptiveFusion: Adaptive Multi-Modal Multi-View Fusion for 3D Human Body Reconstruction
by: Chen, Anjun, et al.
Published: (2024)
by: Chen, Anjun, et al.
Published: (2024)
Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing
by: Lin, Xun, et al.
Published: (2024)
by: Lin, Xun, et al.
Published: (2024)
Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis
by: Chefer, Hila, et al.
Published: (2026)
by: Chefer, Hila, et al.
Published: (2026)
Robust Multi-Modal Face Anti-Spoofing with Domain Adaptation: Tackling Missing Modalities, Noisy Pseudo-Labels, and Model Degradation
by: Hsu, Ming-Tsung, et al.
Published: (2025)
by: Hsu, Ming-Tsung, et al.
Published: (2025)
Prompt-Free Conditional Diffusion for Multi-object Image Augmentation
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization
by: Wu, Hongtao, et al.
Published: (2024)
by: Wu, Hongtao, et al.
Published: (2024)
EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
by: Gong, Chao, et al.
Published: (2025)
by: Gong, Chao, et al.
Published: (2025)
Towards Personalized Multi-Modal MRI Synthesis across Heterogeneous Datasets
by: Zhang, Yue, et al.
Published: (2026)
by: Zhang, Yue, et al.
Published: (2026)
Domain-Adaptive Full-Face Gaze Estimation via Novel-View-Synthesis and Feature Disentanglement
by: Qin, Jiawei, et al.
Published: (2023)
by: Qin, Jiawei, et al.
Published: (2023)
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
by: Zhao, Qingcheng, et al.
Published: (2024)
by: Zhao, Qingcheng, et al.
Published: (2024)
OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation
by: Yin, Bo-Wen, et al.
Published: (2025)
by: Yin, Bo-Wen, et al.
Published: (2025)
Towards Real Zero-Shot Camouflaged Object Segmentation without Camouflaged Annotations
by: Lei, Cheng, et al.
Published: (2024)
by: Lei, Cheng, et al.
Published: (2024)
IdentiFace: Multi-Modal Iterative Diffusion Framework for Identifiable Suspect Face Generation in Crime Investigations
by: Liu, Weichen, et al.
Published: (2026)
by: Liu, Weichen, et al.
Published: (2026)
FaceChain-FACT: Face Adapter with Decoupled Training for Identity-preserved Personalization
by: Yu, Cheng, et al.
Published: (2024)
by: Yu, Cheng, et al.
Published: (2024)
MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing
by: Zheng, Junjie, et al.
Published: (2025)
by: Zheng, Junjie, et al.
Published: (2025)
SpatialImaginer: Towards Adaptive Visual Imagination for Spatial Reasoning
by: Li, Yian, et al.
Published: (2026)
by: Li, Yian, et al.
Published: (2026)
UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions
by: Yao, Siyuan, et al.
Published: (2025)
by: Yao, Siyuan, et al.
Published: (2025)
ModalPrompt: Towards Efficient Multimodal Continual Instruction Tuning with Dual-Modality Guided Prompt
by: Zeng, Fanhu, et al.
Published: (2024)
by: Zeng, Fanhu, et al.
Published: (2024)
Multi-Modal Face Anti-Spoofing via Cross-Modal Feature Transitions
by: Chong, Jun-Xiong, et al.
Published: (2025)
by: Chong, Jun-Xiong, et al.
Published: (2025)
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
by: Lei, Ting, et al.
Published: (2024)
by: Lei, Ting, et al.
Published: (2024)
Toward Scalable, Flexible Scene Flow for Point Clouds
by: Vedder, Kyle
Published: (2025)
by: Vedder, Kyle
Published: (2025)
MultiWorld: Scalable Multi-Agent Multi-View Video World Models
by: Wu, Haoyu, et al.
Published: (2026)
by: Wu, Haoyu, et al.
Published: (2026)
Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model
by: Tao, Keda, et al.
Published: (2024)
by: Tao, Keda, et al.
Published: (2024)
IDRetracor: Towards Visual Forensics Against Malicious Face Swapping
by: Cheng, Jikang, et al.
Published: (2024)
by: Cheng, Jikang, et al.
Published: (2024)
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector
by: Guo, Xiao, et al.
Published: (2025)
by: Guo, Xiao, et al.
Published: (2025)
Multi-Channel Cross Modal Detection of Synthetic Face Images
by: Ibsen, M., et al.
Published: (2023)
by: Ibsen, M., et al.
Published: (2023)
PixelWizard: Towards Efficient High-Fidelity Video Generation at Ultra-Large Spatial Resolution
by: Li, Wenxue, et al.
Published: (2026)
by: Li, Wenxue, et al.
Published: (2026)
Towards Stable Self-Supervised Object Representations in Unconstrained Egocentric Video
by: Tan, Yuting, et al.
Published: (2026)
by: Tan, Yuting, et al.
Published: (2026)
Adaptive Domain Shift in Diffusion Models for Cross-Modality Image Translation
by: Wang, Zihao, et al.
Published: (2026)
by: Wang, Zihao, et al.
Published: (2026)
Adaptive Context Matters: Towards Provable Multi-Modality Guidance for Super-Resolution
by: Luo, Jinyi, et al.
Published: (2026)
by: Luo, Jinyi, et al.
Published: (2026)
Towards Consistent and Controllable Image Synthesis for Face Editing
by: Wei, Mengting, et al.
Published: (2025)
by: Wei, Mengting, et al.
Published: (2025)
QuantFace: Efficient Quantization for Face Restoration
by: Li, Jiatong, et al.
Published: (2025)
by: Li, Jiatong, et al.
Published: (2025)
Is Extending Modality The Right Path Towards Omni-Modality?
by: Zhu, Tinghui, et al.
Published: (2025)
by: Zhu, Tinghui, et al.
Published: (2025)
Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model
by: Sun, Weilin, et al.
Published: (2025)
by: Sun, Weilin, et al.
Published: (2025)
G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors
by: Yang, Haoxin, et al.
Published: (2024)
by: Yang, Haoxin, et al.
Published: (2024)
Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
by: Kim, Jihyun, et al.
Published: (2024)
by: Kim, Jihyun, et al.
Published: (2024)
Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality
by: Chen, Sishuo, et al.
Published: (2024)
by: Chen, Sishuo, et al.
Published: (2024)
Adaptive Multi-Modal Cross-Entropy Loss for Stereo Matching
by: Xu, Peng, et al.
Published: (2023)
by: Xu, Peng, et al.
Published: (2023)
Similar Items
-
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis
by: Ren, Jingjing, et al.
Published: (2025) -
UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
by: Ren, Jingjing, et al.
Published: (2024) -
AdaptiveFusion: Adaptive Multi-Modal Multi-View Fusion for 3D Human Body Reconstruction
by: Chen, Anjun, et al.
Published: (2024) -
Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing
by: Lin, Xun, et al.
Published: (2024) -
Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis
by: Chefer, Hila, et al.
Published: (2026)