:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ki, Taekyung, Min, Dongchan
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2305.00521
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Style-Preserving Lip Sync via Audio-Aware Style Reference
by: Zhong, Weizhi, et al.
Published: (2024)

FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait
by: Ki, Taekyung, et al.
Published: (2024)

Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation
by: Ki, Taekyung, et al.
Published: (2024)

AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Deepfake Detection of Frontal Face Videos
by: Shahzad, Sahibzada Adil, et al.
Published: (2023)

Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering
by: Wang, Xu, et al.
Published: (2025)

High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
by: Zhong, Weizhi, et al.
Published: (2024)

LipShiFT: A Certifiably Robust Shift-based Vision Transformer
by: Menon, Rohan, et al.
Published: (2025)

NeuroLip: An Event-driven Spatiotemporal Learning Framework for Cross-Scene Lip-Motion-based Visual Speaker Recognition
by: Yao, Junguang, et al.
Published: (2026)

StyleTalker: One-shot Style-based Audio-driven Talking Head Video Generation
by: Min, Dongchan, et al.
Published: (2022)

FastCLIPstyler: Optimisation-free Text-based Image Style Transfer Using Style Representations
by: Suresh, Ananda Padhmanabhan, et al.
Published: (2022)

VQ-Style: Disentangling Style and Content in Motion with Residual Quantized Representations
by: Zargarbashi, Fatemeh, et al.
Published: (2026)

Regional Style and Color Transfer
by: Ding, Zhicheng, et al.
Published: (2024)

Exploring Phonetic Context-Aware Lip-Sync For Talking Face Generation
by: Park, Se Jin, et al.
Published: (2023)

Exposing Lip-syncing Deepfakes from Mouth Inconsistencies
by: Datta, Soumyya Kanti, et al.
Published: (2024)

Evaluation of Randomization through Style Transfer for Enhanced Domain Generalization
by: Eisenhardt, Dustin, et al.
Published: (2026)

DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection
by: Ko, Donggeun, et al.
Published: (2024)

FonTS: Text Rendering with Typography and Style Controls
by: Shi, Wenda, et al.
Published: (2024)

Character-based Outfit Generation with Vision-augmented Style Extraction via LLMs
by: Forouzandehmehr, Najmeh, et al.
Published: (2024)

StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
by: Guo, Ziyu, et al.
Published: (2025)

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
by: Ki, Taekyung, et al.
Published: (2026)

Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation
by: Öttl, Mathias, et al.
Published: (2024)

SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models
by: Ma, Pingchuan, et al.
Published: (2025)

A Billion-scale Foundation Model for Remote Sensing Images
by: Cha, Keumgang, et al.
Published: (2023)

LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation
by: Shenaj, Donald, et al.
Published: (2024)

StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter
by: Liu, Gongye, et al.
Published: (2023)

An Interpretable X-ray Style Transfer via Trainable Local Laplacian Filter
by: Eckert, Dominik, et al.
Published: (2024)

Style-Based Neural Architectures for Real-Time Weather Classification
by: Ouattara, Hamed, et al.
Published: (2026)

AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
by: Wang, Fu-Yun, et al.
Published: (2024)

Enhancing Nighttime Vehicle Detection with Day-to-Night Style Transfer and Labeling-Free Augmentation
by: Yang, Yunxiang, et al.
Published: (2024)

Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakes
by: Liu, Weifeng, et al.
Published: (2024)

OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters for Diffusion Models
by: Aliev, Ali, et al.
Published: (2026)

SyncMapV2: Robust and Adaptive Unsupervised Segmentation
by: Zhang, Heng, et al.
Published: (2025)

Removing Averaging: Personalized Lip-Sync Driven Characters Based on Identity Adapter
by: Zhu, Yanyu, et al.
Published: (2025)

FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency
by: Liu, Shiyan, et al.
Published: (2025)

Personalized Image Generation from an Author Writing Style
by: Gandhi, Sagar, et al.
Published: (2025)

Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application
by: Kim, Bumsoo, et al.
Published: (2024)

BioLip: Language-Generalizable Lip-Sync Deepfake Detection via Biomechanical Constraint Violation Modeling
by: Chen, Hao, et al.
Published: (2026)

Magic Insert: Style-Aware Drag-and-Drop
by: Ruiz, Nataniel, et al.
Published: (2024)

LASER: Lip Landmark Assisted Speaker Detection for Robustness
by: Nguyen, Le Thien Phuc, et al.
Published: (2025)

Dynamic Neural Style Transfer for Artistic Image Generation using VGG19
by: Kashyap, Kapil, et al.
Published: (2025)