Saved in:
| Main Authors: | Guan, Yiran, Chen, Zhuoguang, Zeng, Wenzheng, Cao, Zhiguo, Xiao, Yang |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.18131 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GazeHTA: End-to-end Gaze Target Detection with Head-Target Association
by: Lin, Zhi-Yi, et al.
Published: (2024)
by: Lin, Zhi-Yi, et al.
Published: (2024)
DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image
by: Wu, Qingxuan, et al.
Published: (2024)
by: Wu, Qingxuan, et al.
Published: (2024)
StyGazeTalk: Learning Stylized Generation of Gaze and Head Dynamics
by: Shi, Chengwei, et al.
Published: (2025)
by: Shi, Chengwei, et al.
Published: (2025)
CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model
by: Yin, Pengwei, et al.
Published: (2024)
by: Yin, Pengwei, et al.
Published: (2024)
CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner
by: Yan, Tingbing, et al.
Published: (2024)
by: Yan, Tingbing, et al.
Published: (2024)
Identity-Preserving Image-to-Video Generation via Reward-Guided Optimization
by: Shen, Liao, et al.
Published: (2025)
by: Shen, Liao, et al.
Published: (2025)
GazeDETR: Gaze Detection using Disentangled Head and Gaze Representations
by: de Belen, Ryan Anthony Jalova, et al.
Published: (2025)
by: de Belen, Ryan Anthony Jalova, et al.
Published: (2025)
DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning
by: Yuan, Tianyuan, et al.
Published: (2025)
by: Yuan, Tianyuan, et al.
Published: (2025)
DHECA-SuperGaze: Dual Head-Eye Cross-Attention and Super-Resolution for Unconstrained Gaze Estimation
by: Šikić, Franko, et al.
Published: (2025)
by: Šikić, Franko, et al.
Published: (2025)
NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation
by: Wang, Yiran, et al.
Published: (2023)
by: Wang, Yiran, et al.
Published: (2023)
UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty
by: Yang, Pengxuan, et al.
Published: (2025)
by: Yang, Pengxuan, et al.
Published: (2025)
Gaze Label Alignment: Alleviating Domain Shift for Gaze Estimation
by: Zeng, Guanzhong, et al.
Published: (2024)
by: Zeng, Guanzhong, et al.
Published: (2024)
PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous Driving
by: Chen, Zhili, et al.
Published: (2023)
by: Chen, Zhili, et al.
Published: (2023)
GazeFormer-MoE: Context-Aware Gaze Estimation via CLIP and MoE Transformer
by: Zhao, Xinyuan, et al.
Published: (2026)
by: Zhao, Xinyuan, et al.
Published: (2026)
HOIGaze: Gaze Estimation During Hand-Object Interactions in Extended Reality Exploiting Eye-Hand-Head Coordination
by: Hu, Zhiming, et al.
Published: (2025)
by: Hu, Zhiming, et al.
Published: (2025)
COMICS: End-to-end Bi-grained Contrastive Learning for Multi-face Forgery Detection
by: Zhang, Cong, et al.
Published: (2023)
by: Zhang, Cong, et al.
Published: (2023)
Enhancing Space-time Video Super-resolution via Spatial-temporal Feature Interaction
by: Yue, Zijie, et al.
Published: (2022)
by: Yue, Zijie, et al.
Published: (2022)
GaTector+: A Unified Head-free Framework for Gaze Object and Gaze Following Prediction
by: Jin, Yang, et al.
Published: (2025)
by: Jin, Yang, et al.
Published: (2025)
REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation
by: Wang, Haotian, et al.
Published: (2025)
by: Wang, Haotian, et al.
Published: (2025)
Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
by: Zhang, Junrui, et al.
Published: (2024)
by: Zhang, Junrui, et al.
Published: (2024)
LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation
by: Yin, Pengwei, et al.
Published: (2024)
by: Yin, Pengwei, et al.
Published: (2024)
DFIMat: Decoupled Flexible Interactive Matting in Multi-Person Scenarios
by: Jiao, Siyi, et al.
Published: (2024)
by: Jiao, Siyi, et al.
Published: (2024)
TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion
by: Wang, Yiran, et al.
Published: (2025)
by: Wang, Yiran, et al.
Published: (2025)
End-to-end Semantic-centric Video-based Multimodal Affective Computing
by: Lin, Ronghao, et al.
Published: (2024)
by: Lin, Ronghao, et al.
Published: (2024)
In-Context Matting
by: Guo, He, et al.
Published: (2024)
by: Guo, He, et al.
Published: (2024)
DREAM: Document Reconstruction via End-to-end Autoregressive Model
by: Li, Xin, et al.
Published: (2025)
by: Li, Xin, et al.
Published: (2025)
GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding
by: Kawana, Yuki, et al.
Published: (2025)
by: Kawana, Yuki, et al.
Published: (2025)
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders
by: Ryan, Fiona, et al.
Published: (2024)
by: Ryan, Fiona, et al.
Published: (2024)
GazeD: Context-Aware Diffusion for Accurate 3D Gaze Estimation
by: Catalini, Riccardo, et al.
Published: (2026)
by: Catalini, Riccardo, et al.
Published: (2026)
HALO: Human-Aligned End-to-end Image Retargeting with Layered Transformations
by: Xu, Yiran, et al.
Published: (2025)
by: Xu, Yiran, et al.
Published: (2025)
End-to-End PET Image Reconstruction via a Posterior-Mean Diffusion Model
by: Sun, Yiran, et al.
Published: (2025)
by: Sun, Yiran, et al.
Published: (2025)
Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval
by: Yu, Jiwen, et al.
Published: (2025)
by: Yu, Jiwen, et al.
Published: (2025)
UniGaze: Towards Universal Gaze Estimation via Large-scale Pre-Training
by: Qin, Jiawei, et al.
Published: (2025)
by: Qin, Jiawei, et al.
Published: (2025)
Capturing Head Avatar with Hand Contacts from a Monocular Video
by: He, Haonan, et al.
Published: (2025)
by: He, Haonan, et al.
Published: (2025)
TrackOcc: Camera-based 4D Panoptic Occupancy Tracking
by: Chen, Zhuoguang, et al.
Published: (2025)
by: Chen, Zhuoguang, et al.
Published: (2025)
Complet4R: Geometric Complete 4D Reconstruction
by: Wang, Weibang, et al.
Published: (2026)
by: Wang, Weibang, et al.
Published: (2026)
LONG3R: Long Sequence Streaming 3D Reconstruction
by: Chen, Zhuoguang, et al.
Published: (2025)
by: Chen, Zhuoguang, et al.
Published: (2025)
Semi-Supervised Gaze Estimation via Disentangled Subspace Contrastive Learning
by: Tan, Qida, et al.
Published: (2026)
by: Tan, Qida, et al.
Published: (2026)
GazeShift: Unsupervised Gaze Estimation and Dataset for VR
by: Shapira, Gil, et al.
Published: (2026)
by: Shapira, Gil, et al.
Published: (2026)
RoboTAG: End-to-end Robot Configuration Estimation via Topological Alignment Graph
by: Liu, Yifan, et al.
Published: (2025)
by: Liu, Yifan, et al.
Published: (2025)
Similar Items
-
GazeHTA: End-to-end Gaze Target Detection with Head-Target Association
by: Lin, Zhi-Yi, et al.
Published: (2024) -
DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image
by: Wu, Qingxuan, et al.
Published: (2024) -
StyGazeTalk: Learning Stylized Generation of Gaze and Head Dynamics
by: Shi, Chengwei, et al.
Published: (2025) -
CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model
by: Yin, Pengwei, et al.
Published: (2024) -
CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner
by: Yan, Tingbing, et al.
Published: (2024)