Saved in:
| Main Authors: | Ki, Taekyung, Min, Dongchan |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2305.00521 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Style-Preserving Lip Sync via Audio-Aware Style Reference
by: Zhong, Weizhi, et al.
Published: (2024)
by: Zhong, Weizhi, et al.
Published: (2024)
FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait
by: Ki, Taekyung, et al.
Published: (2024)
by: Ki, Taekyung, et al.
Published: (2024)
Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation
by: Ki, Taekyung, et al.
Published: (2024)
by: Ki, Taekyung, et al.
Published: (2024)
AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Deepfake Detection of Frontal Face Videos
by: Shahzad, Sahibzada Adil, et al.
Published: (2023)
by: Shahzad, Sahibzada Adil, et al.
Published: (2023)
Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering
by: Wang, Xu, et al.
Published: (2025)
by: Wang, Xu, et al.
Published: (2025)
High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
by: Zhong, Weizhi, et al.
Published: (2024)
by: Zhong, Weizhi, et al.
Published: (2024)
LipShiFT: A Certifiably Robust Shift-based Vision Transformer
by: Menon, Rohan, et al.
Published: (2025)
by: Menon, Rohan, et al.
Published: (2025)
NeuroLip: An Event-driven Spatiotemporal Learning Framework for Cross-Scene Lip-Motion-based Visual Speaker Recognition
by: Yao, Junguang, et al.
Published: (2026)
by: Yao, Junguang, et al.
Published: (2026)
StyleTalker: One-shot Style-based Audio-driven Talking Head Video Generation
by: Min, Dongchan, et al.
Published: (2022)
by: Min, Dongchan, et al.
Published: (2022)
FastCLIPstyler: Optimisation-free Text-based Image Style Transfer Using Style Representations
by: Suresh, Ananda Padhmanabhan, et al.
Published: (2022)
by: Suresh, Ananda Padhmanabhan, et al.
Published: (2022)
VQ-Style: Disentangling Style and Content in Motion with Residual Quantized Representations
by: Zargarbashi, Fatemeh, et al.
Published: (2026)
by: Zargarbashi, Fatemeh, et al.
Published: (2026)
Regional Style and Color Transfer
by: Ding, Zhicheng, et al.
Published: (2024)
by: Ding, Zhicheng, et al.
Published: (2024)
Exploring Phonetic Context-Aware Lip-Sync For Talking Face Generation
by: Park, Se Jin, et al.
Published: (2023)
by: Park, Se Jin, et al.
Published: (2023)
Exposing Lip-syncing Deepfakes from Mouth Inconsistencies
by: Datta, Soumyya Kanti, et al.
Published: (2024)
by: Datta, Soumyya Kanti, et al.
Published: (2024)
Evaluation of Randomization through Style Transfer for Enhanced Domain Generalization
by: Eisenhardt, Dustin, et al.
Published: (2026)
by: Eisenhardt, Dustin, et al.
Published: (2026)
DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection
by: Ko, Donggeun, et al.
Published: (2024)
by: Ko, Donggeun, et al.
Published: (2024)
FonTS: Text Rendering with Typography and Style Controls
by: Shi, Wenda, et al.
Published: (2024)
by: Shi, Wenda, et al.
Published: (2024)
Character-based Outfit Generation with Vision-augmented Style Extraction via LLMs
by: Forouzandehmehr, Najmeh, et al.
Published: (2024)
by: Forouzandehmehr, Najmeh, et al.
Published: (2024)
StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
by: Guo, Ziyu, et al.
Published: (2025)
by: Guo, Ziyu, et al.
Published: (2025)
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
by: Ki, Taekyung, et al.
Published: (2026)
by: Ki, Taekyung, et al.
Published: (2026)
Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation
by: Öttl, Mathias, et al.
Published: (2024)
by: Öttl, Mathias, et al.
Published: (2024)
SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models
by: Ma, Pingchuan, et al.
Published: (2025)
by: Ma, Pingchuan, et al.
Published: (2025)
A Billion-scale Foundation Model for Remote Sensing Images
by: Cha, Keumgang, et al.
Published: (2023)
by: Cha, Keumgang, et al.
Published: (2023)
LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation
by: Shenaj, Donald, et al.
Published: (2024)
by: Shenaj, Donald, et al.
Published: (2024)
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter
by: Liu, Gongye, et al.
Published: (2023)
by: Liu, Gongye, et al.
Published: (2023)
An Interpretable X-ray Style Transfer via Trainable Local Laplacian Filter
by: Eckert, Dominik, et al.
Published: (2024)
by: Eckert, Dominik, et al.
Published: (2024)
Style-Based Neural Architectures for Real-Time Weather Classification
by: Ouattara, Hamed, et al.
Published: (2026)
by: Ouattara, Hamed, et al.
Published: (2026)
AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
by: Wang, Fu-Yun, et al.
Published: (2024)
by: Wang, Fu-Yun, et al.
Published: (2024)
Enhancing Nighttime Vehicle Detection with Day-to-Night Style Transfer and Labeling-Free Augmentation
by: Yang, Yunxiang, et al.
Published: (2024)
by: Yang, Yunxiang, et al.
Published: (2024)
Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakes
by: Liu, Weifeng, et al.
Published: (2024)
by: Liu, Weifeng, et al.
Published: (2024)
OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters for Diffusion Models
by: Aliev, Ali, et al.
Published: (2026)
by: Aliev, Ali, et al.
Published: (2026)
SyncMapV2: Robust and Adaptive Unsupervised Segmentation
by: Zhang, Heng, et al.
Published: (2025)
by: Zhang, Heng, et al.
Published: (2025)
Removing Averaging: Personalized Lip-Sync Driven Characters Based on Identity Adapter
by: Zhu, Yanyu, et al.
Published: (2025)
by: Zhu, Yanyu, et al.
Published: (2025)
FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency
by: Liu, Shiyan, et al.
Published: (2025)
by: Liu, Shiyan, et al.
Published: (2025)
Personalized Image Generation from an Author Writing Style
by: Gandhi, Sagar, et al.
Published: (2025)
by: Gandhi, Sagar, et al.
Published: (2025)
Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application
by: Kim, Bumsoo, et al.
Published: (2024)
by: Kim, Bumsoo, et al.
Published: (2024)
BioLip: Language-Generalizable Lip-Sync Deepfake Detection via Biomechanical Constraint Violation Modeling
by: Chen, Hao, et al.
Published: (2026)
by: Chen, Hao, et al.
Published: (2026)
Magic Insert: Style-Aware Drag-and-Drop
by: Ruiz, Nataniel, et al.
Published: (2024)
by: Ruiz, Nataniel, et al.
Published: (2024)
LASER: Lip Landmark Assisted Speaker Detection for Robustness
by: Nguyen, Le Thien Phuc, et al.
Published: (2025)
by: Nguyen, Le Thien Phuc, et al.
Published: (2025)
Dynamic Neural Style Transfer for Artistic Image Generation using VGG19
by: Kashyap, Kapil, et al.
Published: (2025)
by: Kashyap, Kapil, et al.
Published: (2025)
Similar Items
-
Style-Preserving Lip Sync via Audio-Aware Style Reference
by: Zhong, Weizhi, et al.
Published: (2024) -
FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait
by: Ki, Taekyung, et al.
Published: (2024) -
Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation
by: Ki, Taekyung, et al.
Published: (2024) -
AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Deepfake Detection of Frontal Face Videos
by: Shahzad, Sahibzada Adil, et al.
Published: (2023) -
Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering
by: Wang, Xu, et al.
Published: (2025)