:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Seo, Sunyong, Kim, Semin, Lee, Jongha
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.01290
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Data Augmentation For Small Object using Fast AutoAugment
by: Yoon, DaeEun, et al.
Published: (2025)

Color Universal Design Neural Network for the Color Vision Deficiencies
by: Seo, Sunyong, et al.
Published: (2025)

TabFlash: Efficient Table Understanding with Progressive Question Conditioning and Token Focusing
by: Kim, Jongha, et al.
Published: (2025)

Full-scale Representation Guided Network for Retinal Vessel Segmentation
by: Seo, Sunyong, et al.
Published: (2025)

VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
by: Lee, Ji Soo, et al.
Published: (2025)

Exploiting Diffusion Prior for Task-driven Image Restoration
by: Kim, Jaeha, et al.
Published: (2025)

DocPrune:Efficient Document Question Answering via Background, Question, and Comprehension-aware Token Pruning
by: Choi, Joonmyung, et al.
Published: (2026)

NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior
by: Kim, Gihoon, et al.
Published: (2024)

Implementation of a Skin Lesion Detection System for Managing Children with Atopic Dermatitis Based on Ensemble Learning
by: Jeon, Soobin, et al.
Published: (2025)

Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection
by: Kim, Jongha, et al.
Published: (2024)

Relevance-aware Multi-context Contrastive Decoding for Retrieval-augmented Visual Question Answering
by: Kim, Jongha, et al.
Published: (2026)

Bridging the gap to real-world language-grounded visual concept learning
by: Jung, Whie, et al.
Published: (2025)

Learning a Delighting Prior for Facial Appearance Capture in the Wild
by: Han, Yuxuan, et al.
Published: (2026)

Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model
by: Kim, Dongwon, et al.
Published: (2026)

Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild
by: Kim, Donggyun, et al.
Published: (2024)

Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery
by: Tran, Minh, et al.
Published: (2025)

FRIDAY: Mitigating Unintentional Facial Identity in Deepfake Detectors Guided by Facial Recognizers
by: Kim, Younhun, et al.
Published: (2024)

RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
by: Ji, Xiaozhong, et al.
Published: (2024)

4D Facial Expression Diffusion Model
by: Zou, Kaifeng, et al.
Published: (2023)

RA-Touch: Retrieval-Augmented Touch Understanding with Enriched Visual Data
by: Cho, Yoorhim, et al.
Published: (2025)

Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition
by: Halawa, Marah, et al.
Published: (2024)

Polyglot: Multilingual Style Preserving Speech-Driven Facial Animation
by: Nocentini, Federico, et al.
Published: (2026)

ERASE: Eliminating Redundant Visual Tokens via Adaptive Two-Stage Token Pruning
by: Lee, Yuna, et al.
Published: (2026)

Deep Learning Based Facial Retargeting Using Local Patches
by: Choi, Yeonsoo, et al.
Published: (2026)

Navigating Label Ambiguity for Facial Expression Recognition in the Wild
by: Lee, JunGyu, et al.
Published: (2025)

Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens
by: Lew, Jaihyun, et al.
Published: (2024)

Expressive Speech-driven Facial Animation with controllable emotions
by: Chen, Yutong, et al.
Published: (2023)

Masked Autoregressive Model for Weather Forecasting
by: Kim, Doyi, et al.
Published: (2024)

VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
by: Liu, Tao, et al.
Published: (2024)

PropFly: Learning to Propagate via On-the-Fly Supervision from Pre-trained Video Diffusion Models
by: Seo, Wonyong, et al.
Published: (2026)

Facial Appearance Capture at Home with Patch-Level Reflectance Prior
by: Han, Yuxuan, et al.
Published: (2025)

Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection
by: Seo, Ahyun, et al.
Published: (2025)

Semantic-Aware Reconstruction Error for Detecting AI-Generated Images
by: Kang, Ju Yeon, et al.
Published: (2025)

Analysis of Bias in Deep Learning Facial Beauty Regressors
by: Hamel, Chandon, et al.
Published: (2025)

TokTalk: Expressive Real-time Facial Animation from Audio-LLM Tokens
by: Zhao, Qingcheng, et al.
Published: (2026)

v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning
by: Chung, Jiwan, et al.
Published: (2025)

Prior-based Objective Inference Mining Potential Uncertainty for Facial Expression Recognition
by: Liu, Hanwei, et al.
Published: (2024)

LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
by: Wang, Hanyu, et al.
Published: (2024)

Neural Face Skinning for Mesh-agnostic Facial Expression Cloning
by: Cha, Sihun, et al.
Published: (2025)

Makeup Prior Models for 3D Facial Makeup Estimation and Applications
by: Yang, Xingchao, et al.
Published: (2024)