Saved in:
| Main Authors: | Seo, Sunyong, Kim, Semin, Lee, Jongha |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.01290 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Data Augmentation For Small Object using Fast AutoAugment
by: Yoon, DaeEun, et al.
Published: (2025)
by: Yoon, DaeEun, et al.
Published: (2025)
Color Universal Design Neural Network for the Color Vision Deficiencies
by: Seo, Sunyong, et al.
Published: (2025)
by: Seo, Sunyong, et al.
Published: (2025)
TabFlash: Efficient Table Understanding with Progressive Question Conditioning and Token Focusing
by: Kim, Jongha, et al.
Published: (2025)
by: Kim, Jongha, et al.
Published: (2025)
Full-scale Representation Guided Network for Retinal Vessel Segmentation
by: Seo, Sunyong, et al.
Published: (2025)
by: Seo, Sunyong, et al.
Published: (2025)
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
by: Lee, Ji Soo, et al.
Published: (2025)
by: Lee, Ji Soo, et al.
Published: (2025)
Exploiting Diffusion Prior for Task-driven Image Restoration
by: Kim, Jaeha, et al.
Published: (2025)
by: Kim, Jaeha, et al.
Published: (2025)
DocPrune:Efficient Document Question Answering via Background, Question, and Comprehension-aware Token Pruning
by: Choi, Joonmyung, et al.
Published: (2026)
by: Choi, Joonmyung, et al.
Published: (2026)
NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior
by: Kim, Gihoon, et al.
Published: (2024)
by: Kim, Gihoon, et al.
Published: (2024)
Implementation of a Skin Lesion Detection System for Managing Children with Atopic Dermatitis Based on Ensemble Learning
by: Jeon, Soobin, et al.
Published: (2025)
by: Jeon, Soobin, et al.
Published: (2025)
Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection
by: Kim, Jongha, et al.
Published: (2024)
by: Kim, Jongha, et al.
Published: (2024)
Relevance-aware Multi-context Contrastive Decoding for Retrieval-augmented Visual Question Answering
by: Kim, Jongha, et al.
Published: (2026)
by: Kim, Jongha, et al.
Published: (2026)
Bridging the gap to real-world language-grounded visual concept learning
by: Jung, Whie, et al.
Published: (2025)
by: Jung, Whie, et al.
Published: (2025)
Learning a Delighting Prior for Facial Appearance Capture in the Wild
by: Han, Yuxuan, et al.
Published: (2026)
by: Han, Yuxuan, et al.
Published: (2026)
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model
by: Kim, Dongwon, et al.
Published: (2026)
by: Kim, Dongwon, et al.
Published: (2026)
Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild
by: Kim, Donggyun, et al.
Published: (2024)
by: Kim, Donggyun, et al.
Published: (2024)
Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery
by: Tran, Minh, et al.
Published: (2025)
by: Tran, Minh, et al.
Published: (2025)
FRIDAY: Mitigating Unintentional Facial Identity in Deepfake Detectors Guided by Facial Recognizers
by: Kim, Younhun, et al.
Published: (2024)
by: Kim, Younhun, et al.
Published: (2024)
RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
by: Ji, Xiaozhong, et al.
Published: (2024)
by: Ji, Xiaozhong, et al.
Published: (2024)
4D Facial Expression Diffusion Model
by: Zou, Kaifeng, et al.
Published: (2023)
by: Zou, Kaifeng, et al.
Published: (2023)
RA-Touch: Retrieval-Augmented Touch Understanding with Enriched Visual Data
by: Cho, Yoorhim, et al.
Published: (2025)
by: Cho, Yoorhim, et al.
Published: (2025)
Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition
by: Halawa, Marah, et al.
Published: (2024)
by: Halawa, Marah, et al.
Published: (2024)
Polyglot: Multilingual Style Preserving Speech-Driven Facial Animation
by: Nocentini, Federico, et al.
Published: (2026)
by: Nocentini, Federico, et al.
Published: (2026)
ERASE: Eliminating Redundant Visual Tokens via Adaptive Two-Stage Token Pruning
by: Lee, Yuna, et al.
Published: (2026)
by: Lee, Yuna, et al.
Published: (2026)
Deep Learning Based Facial Retargeting Using Local Patches
by: Choi, Yeonsoo, et al.
Published: (2026)
by: Choi, Yeonsoo, et al.
Published: (2026)
Navigating Label Ambiguity for Facial Expression Recognition in the Wild
by: Lee, JunGyu, et al.
Published: (2025)
by: Lee, JunGyu, et al.
Published: (2025)
Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens
by: Lew, Jaihyun, et al.
Published: (2024)
by: Lew, Jaihyun, et al.
Published: (2024)
Expressive Speech-driven Facial Animation with controllable emotions
by: Chen, Yutong, et al.
Published: (2023)
by: Chen, Yutong, et al.
Published: (2023)
Masked Autoregressive Model for Weather Forecasting
by: Kim, Doyi, et al.
Published: (2024)
by: Kim, Doyi, et al.
Published: (2024)
VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
by: Liu, Tao, et al.
Published: (2024)
by: Liu, Tao, et al.
Published: (2024)
PropFly: Learning to Propagate via On-the-Fly Supervision from Pre-trained Video Diffusion Models
by: Seo, Wonyong, et al.
Published: (2026)
by: Seo, Wonyong, et al.
Published: (2026)
Facial Appearance Capture at Home with Patch-Level Reflectance Prior
by: Han, Yuxuan, et al.
Published: (2025)
by: Han, Yuxuan, et al.
Published: (2025)
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection
by: Seo, Ahyun, et al.
Published: (2025)
by: Seo, Ahyun, et al.
Published: (2025)
Semantic-Aware Reconstruction Error for Detecting AI-Generated Images
by: Kang, Ju Yeon, et al.
Published: (2025)
by: Kang, Ju Yeon, et al.
Published: (2025)
Analysis of Bias in Deep Learning Facial Beauty Regressors
by: Hamel, Chandon, et al.
Published: (2025)
by: Hamel, Chandon, et al.
Published: (2025)
TokTalk: Expressive Real-time Facial Animation from Audio-LLM Tokens
by: Zhao, Qingcheng, et al.
Published: (2026)
by: Zhao, Qingcheng, et al.
Published: (2026)
v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning
by: Chung, Jiwan, et al.
Published: (2025)
by: Chung, Jiwan, et al.
Published: (2025)
Prior-based Objective Inference Mining Potential Uncertainty for Facial Expression Recognition
by: Liu, Hanwei, et al.
Published: (2024)
by: Liu, Hanwei, et al.
Published: (2024)
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
by: Wang, Hanyu, et al.
Published: (2024)
by: Wang, Hanyu, et al.
Published: (2024)
Neural Face Skinning for Mesh-agnostic Facial Expression Cloning
by: Cha, Sihun, et al.
Published: (2025)
by: Cha, Sihun, et al.
Published: (2025)
Makeup Prior Models for 3D Facial Makeup Estimation and Applications
by: Yang, Xingchao, et al.
Published: (2024)
by: Yang, Xingchao, et al.
Published: (2024)
Similar Items
-
Data Augmentation For Small Object using Fast AutoAugment
by: Yoon, DaeEun, et al.
Published: (2025) -
Color Universal Design Neural Network for the Color Vision Deficiencies
by: Seo, Sunyong, et al.
Published: (2025) -
TabFlash: Efficient Table Understanding with Progressive Question Conditioning and Token Focusing
by: Kim, Jongha, et al.
Published: (2025) -
Full-scale Representation Guided Network for Retinal Vessel Segmentation
by: Seo, Sunyong, et al.
Published: (2025) -
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
by: Lee, Ji Soo, et al.
Published: (2025)