:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kim, Jihyeon, Kim, Sohee, Lee, Soosan, Jung, Souhwan, Rehg, James Matthew, Choi, Hyesong
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.27348
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation
by: Lai, Bolin, et al.
Published: (2022)

Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation
by: Li, Xiang, et al.
Published: (2025)

Supporting Mitosis Detection AI Training with Inter-Observer Eye-Gaze Consistencies
by: Gu, Hongyan, et al.
Published: (2024)

DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images
by: Kara, Ozgur, et al.
Published: (2025)

The Rescue Effect: Spatio-Semantic Early Exit Bypasses Quantization Collapse in CLIP
by: Nam, Kahyeon, et al.
Published: (2026)

MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling
by: Ahn, Jihye, et al.
Published: (2024)

EyeCue: Driver Cognitive Distraction Detection via Gaze-Empowered Egocentric Video Understanding
by: Zhang, Lang, et al.
Published: (2026)

Enhancing Alignment for Unified Multimodal Models via Semantically-Grounded Supervision
by: Kim, Jiyeong, et al.
Published: (2026)

Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Models
by: Choi, Hyesong, et al.
Published: (2024)

FUSE: Unifying Spectral and Semantic Cues for Robust AI-Generated Image Detection
by: Hossain, Md. Zahid, et al.
Published: (2025)

Eyes Tell the Truth: GazeVal Highlights Shortcomings of Generative AI in Medical Imaging
by: Wong, David, et al.
Published: (2025)

Semantic-Aware Reconstruction Error for Detecting AI-Generated Images
by: Kang, Ju Yeon, et al.
Published: (2025)

STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding
by: Kim, Junho, et al.
Published: (2026)

UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching
by: Kim, Soomin, et al.
Published: (2024)

Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders
by: Ryan, Fiona, et al.
Published: (2024)

Aggregating Diverse Cue Experts for AI-Generated Image Detection
by: Tan, Lei, et al.
Published: (2026)

Emerging Property of Masked Token for Effective Pre-training
by: Choi, Hyesong, et al.
Published: (2024)

CoT-Pose: Chain-of-Thought Reasoning for 3D Pose Generation from Abstract Prompts
by: Cha, Junuk, et al.
Published: (2025)

SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction
by: Son, Sumin, et al.
Published: (2024)

Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation
by: Choi, YoungChan, et al.
Published: (2025)

Decoupling Augmentation Bias in Prompt Learning for Vision-Language Models
by: Kim, Gahyeon, et al.
Published: (2025)

AAPL: Adding Attributes to Prompt Learning for Vision-Language Models
by: Kim, Gahyeon, et al.
Published: (2024)

Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation
by: Lai, Bolin, et al.
Published: (2023)

Gaze Prediction in Virtual Reality Without Eye Tracking Using Visual and Head Motion Cues
by: Petrou, Christos, et al.
Published: (2026)

When LoRA Betrays: Backdooring Text-to-Image Models by Masquerading as Benign Adapters
by: Lyu, Liangwei, et al.
Published: (2026)

Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation
by: Li, Xiang, et al.
Published: (2024)

SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection
by: Kim, Junsu, et al.
Published: (2024)

When Detectors Forget Forensics: Blocking Semantic Shortcuts for Generalizable AI-Generated Image Detection
by: Shuai, Chao, et al.
Published: (2026)

Seeing Eye to AI: Comparing Human Gaze and Model Attention in Video Memorability
by: Kumar, Prajneya, et al.
Published: (2023)

Reciprocal Attention Mixing Transformer for Lightweight Image Restoration
by: Choi, Haram, et al.
Published: (2023)

Eyes on VLM: Benchmarking Gaze Following and Social Gaze Prediction in Vision Language Models
by: Wang, Hengfei, et al.
Published: (2026)

TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning
by: Baek, Seungmin, et al.
Published: (2025)

TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
by: Kim, Jeongho, et al.
Published: (2024)

Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems
by: Wilson, Ethan, et al.
Published: (2025)

Three Forensic Cues for JPEG AI Images
by: Bergmann, Sandra, et al.
Published: (2025)

iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
by: Jo, Hayeon, et al.
Published: (2024)

RobIA: Robust Instance-aware Continual Test-time Adaptation for Deep Stereo
by: Ko, Jueun, et al.
Published: (2025)

Collaborative Learning for Enhanced Unsupervised Domain Adaptation
by: Cho, Minhee, et al.
Published: (2024)

OpenFS: Multi-Hand-Capable Fingerspelling Recognition with Implicit Signing-Hand Detection and Frame-Wise Letter-Conditioned Synthesis
by: Cha, Junuk, et al.
Published: (2026)

CLIP Can Understand Depth
by: Kim, Sohee, et al.
Published: (2024)