Saved in:
| Main Authors: | Kim, Jihyeon, Kim, Sohee, Lee, Soosan, Jung, Souhwan, Rehg, James Matthew, Choi, Hyesong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.27348 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation
by: Lai, Bolin, et al.
Published: (2022)
by: Lai, Bolin, et al.
Published: (2022)
Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation
by: Li, Xiang, et al.
Published: (2025)
by: Li, Xiang, et al.
Published: (2025)
Supporting Mitosis Detection AI Training with Inter-Observer Eye-Gaze Consistencies
by: Gu, Hongyan, et al.
Published: (2024)
by: Gu, Hongyan, et al.
Published: (2024)
DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images
by: Kara, Ozgur, et al.
Published: (2025)
by: Kara, Ozgur, et al.
Published: (2025)
The Rescue Effect: Spatio-Semantic Early Exit Bypasses Quantization Collapse in CLIP
by: Nam, Kahyeon, et al.
Published: (2026)
by: Nam, Kahyeon, et al.
Published: (2026)
MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling
by: Ahn, Jihye, et al.
Published: (2024)
by: Ahn, Jihye, et al.
Published: (2024)
EyeCue: Driver Cognitive Distraction Detection via Gaze-Empowered Egocentric Video Understanding
by: Zhang, Lang, et al.
Published: (2026)
by: Zhang, Lang, et al.
Published: (2026)
Enhancing Alignment for Unified Multimodal Models via Semantically-Grounded Supervision
by: Kim, Jiyeong, et al.
Published: (2026)
by: Kim, Jiyeong, et al.
Published: (2026)
Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Models
by: Choi, Hyesong, et al.
Published: (2024)
by: Choi, Hyesong, et al.
Published: (2024)
FUSE: Unifying Spectral and Semantic Cues for Robust AI-Generated Image Detection
by: Hossain, Md. Zahid, et al.
Published: (2025)
by: Hossain, Md. Zahid, et al.
Published: (2025)
Eyes Tell the Truth: GazeVal Highlights Shortcomings of Generative AI in Medical Imaging
by: Wong, David, et al.
Published: (2025)
by: Wong, David, et al.
Published: (2025)
Semantic-Aware Reconstruction Error for Detecting AI-Generated Images
by: Kang, Ju Yeon, et al.
Published: (2025)
by: Kang, Ju Yeon, et al.
Published: (2025)
STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding
by: Kim, Junho, et al.
Published: (2026)
by: Kim, Junho, et al.
Published: (2026)
UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching
by: Kim, Soomin, et al.
Published: (2024)
by: Kim, Soomin, et al.
Published: (2024)
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders
by: Ryan, Fiona, et al.
Published: (2024)
by: Ryan, Fiona, et al.
Published: (2024)
Aggregating Diverse Cue Experts for AI-Generated Image Detection
by: Tan, Lei, et al.
Published: (2026)
by: Tan, Lei, et al.
Published: (2026)
Emerging Property of Masked Token for Effective Pre-training
by: Choi, Hyesong, et al.
Published: (2024)
by: Choi, Hyesong, et al.
Published: (2024)
CoT-Pose: Chain-of-Thought Reasoning for 3D Pose Generation from Abstract Prompts
by: Cha, Junuk, et al.
Published: (2025)
by: Cha, Junuk, et al.
Published: (2025)
SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction
by: Son, Sumin, et al.
Published: (2024)
by: Son, Sumin, et al.
Published: (2024)
Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation
by: Choi, YoungChan, et al.
Published: (2025)
by: Choi, YoungChan, et al.
Published: (2025)
Decoupling Augmentation Bias in Prompt Learning for Vision-Language Models
by: Kim, Gahyeon, et al.
Published: (2025)
by: Kim, Gahyeon, et al.
Published: (2025)
AAPL: Adding Attributes to Prompt Learning for Vision-Language Models
by: Kim, Gahyeon, et al.
Published: (2024)
by: Kim, Gahyeon, et al.
Published: (2024)
Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation
by: Lai, Bolin, et al.
Published: (2023)
by: Lai, Bolin, et al.
Published: (2023)
Gaze Prediction in Virtual Reality Without Eye Tracking Using Visual and Head Motion Cues
by: Petrou, Christos, et al.
Published: (2026)
by: Petrou, Christos, et al.
Published: (2026)
When LoRA Betrays: Backdooring Text-to-Image Models by Masquerading as Benign Adapters
by: Lyu, Liangwei, et al.
Published: (2026)
by: Lyu, Liangwei, et al.
Published: (2026)
Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation
by: Li, Xiang, et al.
Published: (2024)
by: Li, Xiang, et al.
Published: (2024)
SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection
by: Kim, Junsu, et al.
Published: (2024)
by: Kim, Junsu, et al.
Published: (2024)
When Detectors Forget Forensics: Blocking Semantic Shortcuts for Generalizable AI-Generated Image Detection
by: Shuai, Chao, et al.
Published: (2026)
by: Shuai, Chao, et al.
Published: (2026)
Seeing Eye to AI: Comparing Human Gaze and Model Attention in Video Memorability
by: Kumar, Prajneya, et al.
Published: (2023)
by: Kumar, Prajneya, et al.
Published: (2023)
Reciprocal Attention Mixing Transformer for Lightweight Image Restoration
by: Choi, Haram, et al.
Published: (2023)
by: Choi, Haram, et al.
Published: (2023)
Eyes on VLM: Benchmarking Gaze Following and Social Gaze Prediction in Vision Language Models
by: Wang, Hengfei, et al.
Published: (2026)
by: Wang, Hengfei, et al.
Published: (2026)
TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning
by: Baek, Seungmin, et al.
Published: (2025)
by: Baek, Seungmin, et al.
Published: (2025)
TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
by: Kim, Jeongho, et al.
Published: (2024)
by: Kim, Jeongho, et al.
Published: (2024)
Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems
by: Wilson, Ethan, et al.
Published: (2025)
by: Wilson, Ethan, et al.
Published: (2025)
Three Forensic Cues for JPEG AI Images
by: Bergmann, Sandra, et al.
Published: (2025)
by: Bergmann, Sandra, et al.
Published: (2025)
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
by: Jo, Hayeon, et al.
Published: (2024)
by: Jo, Hayeon, et al.
Published: (2024)
RobIA: Robust Instance-aware Continual Test-time Adaptation for Deep Stereo
by: Ko, Jueun, et al.
Published: (2025)
by: Ko, Jueun, et al.
Published: (2025)
Collaborative Learning for Enhanced Unsupervised Domain Adaptation
by: Cho, Minhee, et al.
Published: (2024)
by: Cho, Minhee, et al.
Published: (2024)
OpenFS: Multi-Hand-Capable Fingerspelling Recognition with Implicit Signing-Hand Detection and Frame-Wise Letter-Conditioned Synthesis
by: Cha, Junuk, et al.
Published: (2026)
by: Cha, Junuk, et al.
Published: (2026)
CLIP Can Understand Depth
by: Kim, Sohee, et al.
Published: (2024)
by: Kim, Sohee, et al.
Published: (2024)
Similar Items
-
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation
by: Lai, Bolin, et al.
Published: (2022) -
Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation
by: Li, Xiang, et al.
Published: (2025) -
Supporting Mitosis Detection AI Training with Inter-Observer Eye-Gaze Consistencies
by: Gu, Hongyan, et al.
Published: (2024) -
DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images
by: Kara, Ozgur, et al.
Published: (2025) -
The Rescue Effect: Spatio-Semantic Early Exit Bypasses Quantization Collapse in CLIP
by: Nam, Kahyeon, et al.
Published: (2026)