:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Guan, Yiran, Chen, Zhuoguang, Zeng, Wenzheng, Cao, Zhiguo, Xiao, Yang
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2310.18131
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

GazeHTA: End-to-end Gaze Target Detection with Head-Target Association
by: Lin, Zhi-Yi, et al.
Published: (2024)

DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image
by: Wu, Qingxuan, et al.
Published: (2024)

StyGazeTalk: Learning Stylized Generation of Gaze and Head Dynamics
by: Shi, Chengwei, et al.
Published: (2025)

CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model
by: Yin, Pengwei, et al.
Published: (2024)

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner
by: Yan, Tingbing, et al.
Published: (2024)

Identity-Preserving Image-to-Video Generation via Reward-Guided Optimization
by: Shen, Liao, et al.
Published: (2025)

GazeDETR: Gaze Detection using Disentangled Head and Gaze Representations
by: de Belen, Ryan Anthony Jalova, et al.
Published: (2025)

DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning
by: Yuan, Tianyuan, et al.
Published: (2025)

DHECA-SuperGaze: Dual Head-Eye Cross-Attention and Super-Resolution for Unconstrained Gaze Estimation
by: Šikić, Franko, et al.
Published: (2025)

NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation
by: Wang, Yiran, et al.
Published: (2023)

UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty
by: Yang, Pengxuan, et al.
Published: (2025)

Gaze Label Alignment: Alleviating Domain Shift for Gaze Estimation
by: Zeng, Guanzhong, et al.
Published: (2024)

PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous Driving
by: Chen, Zhili, et al.
Published: (2023)

GazeFormer-MoE: Context-Aware Gaze Estimation via CLIP and MoE Transformer
by: Zhao, Xinyuan, et al.
Published: (2026)

HOIGaze: Gaze Estimation During Hand-Object Interactions in Extended Reality Exploiting Eye-Hand-Head Coordination
by: Hu, Zhiming, et al.
Published: (2025)

COMICS: End-to-end Bi-grained Contrastive Learning for Multi-face Forgery Detection
by: Zhang, Cong, et al.
Published: (2023)

Enhancing Space-time Video Super-resolution via Spatial-temporal Feature Interaction
by: Yue, Zijie, et al.
Published: (2022)

GaTector+: A Unified Head-free Framework for Gaze Object and Gaze Following Prediction
by: Jin, Yang, et al.
Published: (2025)

REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation
by: Wang, Haotian, et al.
Published: (2025)

Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
by: Zhang, Junrui, et al.
Published: (2024)

LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation
by: Yin, Pengwei, et al.
Published: (2024)

DFIMat: Decoupled Flexible Interactive Matting in Multi-Person Scenarios
by: Jiao, Siyi, et al.
Published: (2024)

TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion
by: Wang, Yiran, et al.
Published: (2025)

End-to-end Semantic-centric Video-based Multimodal Affective Computing
by: Lin, Ronghao, et al.
Published: (2024)

In-Context Matting
by: Guo, He, et al.
Published: (2024)

DREAM: Document Reconstruction via End-to-end Autoregressive Model
by: Li, Xin, et al.
Published: (2025)

GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding
by: Kawana, Yuki, et al.
Published: (2025)

Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders
by: Ryan, Fiona, et al.
Published: (2024)

GazeD: Context-Aware Diffusion for Accurate 3D Gaze Estimation
by: Catalini, Riccardo, et al.
Published: (2026)

HALO: Human-Aligned End-to-end Image Retargeting with Layered Transformations
by: Xu, Yiran, et al.
Published: (2025)

End-to-End PET Image Reconstruction via a Posterior-Mean Diffusion Model
by: Sun, Yiran, et al.
Published: (2025)

Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval
by: Yu, Jiwen, et al.
Published: (2025)

UniGaze: Towards Universal Gaze Estimation via Large-scale Pre-Training
by: Qin, Jiawei, et al.
Published: (2025)

Capturing Head Avatar with Hand Contacts from a Monocular Video
by: He, Haonan, et al.
Published: (2025)

TrackOcc: Camera-based 4D Panoptic Occupancy Tracking
by: Chen, Zhuoguang, et al.
Published: (2025)

Complet4R: Geometric Complete 4D Reconstruction
by: Wang, Weibang, et al.
Published: (2026)

LONG3R: Long Sequence Streaming 3D Reconstruction
by: Chen, Zhuoguang, et al.
Published: (2025)

Semi-Supervised Gaze Estimation via Disentangled Subspace Contrastive Learning
by: Tan, Qida, et al.
Published: (2026)

GazeShift: Unsupervised Gaze Estimation and Dataset for VR
by: Shapira, Gil, et al.
Published: (2026)

RoboTAG: End-to-end Robot Configuration Estimation via Topological Alignment Graph
by: Liu, Yifan, et al.
Published: (2025)