Saved in:
Bibliographic Details
Main Authors: Shi, Chengwei, Cao, Chong
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.17168
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Gaze and head movements play a central role in expressive 3D media, human-agent interaction, and immersive communication. Existing works often model facial components in isolation and lack mechanisms for generating personalized, style-aware gaze behaviors. We propose StyGazeTalk, a multimodal framework that synthesizes synchronized gaze-head dynamics with controllable styles. To support high-fidelity training, we construct HAGE, a high-precision multimodal dataset containing eye-tracking data, audio, head pose, and 3D facial parameters. Experiments show that our method produces temporally coherent, style-consistent gaze-head motions, enhancing realism in 3D face generation.