Saved in:
| Main Authors: | Zhen, Dingcheng, Zheng, Xu, Zhang, Ruixin, Jiang, Zhiqi, Yan, Yichao, Tao, Ming, Yin, Shunshun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.11746 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SoulX-Podcast: Towards Realistic Long-form Podcasts with Dialectal and Paralinguistic Diversity
by: Xie, Hanke, et al.
Published: (2025)
by: Xie, Hanke, et al.
Published: (2025)
SoulX-FlashTalk: Real-Time Infinite Streaming of Audio-Driven Avatars via Self-Correcting Bidirectional Distillation
by: Shen, Le, et al.
Published: (2025)
by: Shen, Le, et al.
Published: (2025)
SoulX-FlashHead: Oracle-guided Generation of Infinite Real-time Streaming Talking Heads
by: Yu, Tan, et al.
Published: (2026)
by: Yu, Tan, et al.
Published: (2026)
SoulX-Transcriber: A Robust End-to-End Framework for Multi-Speaker Speech Transcription
by: Dai, Yuhang, et al.
Published: (2026)
by: Dai, Yuhang, et al.
Published: (2026)
SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis
by: Qian, Jiale, et al.
Published: (2026)
by: Qian, Jiale, et al.
Published: (2026)
Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation
by: Zhen, Dingcheng, et al.
Published: (2025)
by: Zhen, Dingcheng, et al.
Published: (2025)
SoulX-Duplug: Plug-and-Play Streaming State Prediction Module for Realtime Full-Duplex Speech Conversation
by: Yan, Ruiqi, et al.
Published: (2026)
by: Yan, Ruiqi, et al.
Published: (2026)
RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer
by: Du, Fangyu, et al.
Published: (2025)
by: Du, Fangyu, et al.
Published: (2025)
Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression
by: Zhen, Dingcheng, et al.
Published: (2025)
by: Zhen, Dingcheng, et al.
Published: (2025)
Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
by: Zhang, Jiangning, et al.
Published: (2025)
by: Zhang, Jiangning, et al.
Published: (2025)
Of Humans, Pigs, and Souls
by: Mimica, Jadran
Published: (2023)
by: Mimica, Jadran
Published: (2023)
Relax Forcing: Relaxed KV-Memory for Consistent Long Video Generation
by: Zhao, Zengqun, et al.
Published: (2026)
by: Zhao, Zengqun, et al.
Published: (2026)
InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions
by: Wang, Zhenzhi, et al.
Published: (2025)
by: Wang, Zhenzhi, et al.
Published: (2025)
The Soul in the System (Consciousness, Memory, and the Golden Margin)
by: Flank, George
Published: (2025)
by: Flank, George
Published: (2025)
EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation
by: Qu, Qiang, et al.
Published: (2025)
by: Qu, Qiang, et al.
Published: (2025)
AnimateAnywhere: Rouse the Background in Human Image Animation
by: Liu, Xiaoyu, et al.
Published: (2025)
by: Liu, Xiaoyu, et al.
Published: (2025)
NameBERT: Scaling Name-Based Nationality Classification with LLM-Augmented Open Academic Data
by: Ming, Cong, et al.
Published: (2026)
by: Ming, Cong, et al.
Published: (2026)
EverAnimate: Minute-Scale Human Animation via Latent Flow Restoration
by: Li, Wuyang, et al.
Published: (2026)
by: Li, Wuyang, et al.
Published: (2026)
StableAnimator: High-Quality Identity-Preserving Human Image Animation
by: Tu, Shuyuan, et al.
Published: (2024)
by: Tu, Shuyuan, et al.
Published: (2024)
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
by: Wang, Xiang, et al.
Published: (2025)
by: Wang, Xiang, et al.
Published: (2025)
Predicting Human Chess Moves: An AI Assisted Analysis of Chess Games Using Skill-group Specific n-gram Language Models
by: Zhong, Daren, et al.
Published: (2025)
by: Zhong, Daren, et al.
Published: (2025)
Relevance-driven Decision Making for Safer and More Efficient Human Robot Collaboration
by: Zhang, Xiaotong, et al.
Published: (2024)
by: Zhang, Xiaotong, et al.
Published: (2024)
StableAnimator++: Overcoming Pose Misalignment and Face Distortion for Human Image Animation
by: Tu, Shuyuan, et al.
Published: (2025)
by: Tu, Shuyuan, et al.
Published: (2025)
Animate-X: Universal Character Image Animation with Enhanced Motion Representation
by: Tan, Shuai, et al.
Published: (2024)
by: Tan, Shuai, et al.
Published: (2024)
Selfhood and the Soul
Published: (2021)
Published: (2021)
Snapshots of the Soul
by: Blasing, Molly Thomasy
Published: (2024)
by: Blasing, Molly Thomasy
Published: (2024)
Time and Soul
by: Zachhuber, Johannes
Published: (2022)
by: Zachhuber, Johannes
Published: (2022)
Soul Liberty
by: Turner, Nicole Myers
Published: (2023)
by: Turner, Nicole Myers
Published: (2023)
The Politics of the Soul
by: John Milbank
Published: (2015)
by: John Milbank
Published: (2015)
Medicine for the Soul.
by: Tietjen, Mildred C.
Published: (1980)
by: Tietjen, Mildred C.
Published: (1980)
X-Dyna: Expressive Dynamic Human Image Animation
by: Chang, Di, et al.
Published: (2025)
by: Chang, Di, et al.
Published: (2025)
ConvScale: Conversational Interviews for Scale-Aligned Measurement
by: Qin, Peinuan, et al.
Published: (2026)
by: Qin, Peinuan, et al.
Published: (2026)
Human Gaussian Splatting: Real-time Rendering of Animatable Avatars
by: Moreau, Arthur, et al.
Published: (2023)
by: Moreau, Arthur, et al.
Published: (2023)
PersonaLive! Expressive Portrait Image Animation for Live Streaming
by: Li, Zhiyuan, et al.
Published: (2025)
by: Li, Zhiyuan, et al.
Published: (2025)
Implicit Preference Alignment for Human Image Animation
by: Wang, Yuanzhi, et al.
Published: (2026)
by: Wang, Yuanzhi, et al.
Published: (2026)
The Cost of Living in a Conflict Zone: A Study of Shahnaz Bashir’s Scattered Souls
by: Ashaq Hussain Parray
Published: (2017)
by: Ashaq Hussain Parray
Published: (2017)
Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models
by: Ji, Yicheng, et al.
Published: (2026)
by: Ji, Yicheng, et al.
Published: (2026)
FlexKV: Flexible Index Offloading for Memory-Disaggregated Key-Value Store
by: Hu, Zhisheng, et al.
Published: (2025)
by: Hu, Zhisheng, et al.
Published: (2025)
Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression
by: Li, Kunjun, et al.
Published: (2025)
by: Li, Kunjun, et al.
Published: (2025)
Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory
by: Agarwal, Vatsal, et al.
Published: (2026)
by: Agarwal, Vatsal, et al.
Published: (2026)
Similar Items
-
SoulX-Podcast: Towards Realistic Long-form Podcasts with Dialectal and Paralinguistic Diversity
by: Xie, Hanke, et al.
Published: (2025) -
SoulX-FlashTalk: Real-Time Infinite Streaming of Audio-Driven Avatars via Self-Correcting Bidirectional Distillation
by: Shen, Le, et al.
Published: (2025) -
SoulX-FlashHead: Oracle-guided Generation of Infinite Real-time Streaming Talking Heads
by: Yu, Tan, et al.
Published: (2026) -
SoulX-Transcriber: A Robust End-to-End Framework for Multi-Speaker Speech Transcription
by: Dai, Yuhang, et al.
Published: (2026) -
SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis
by: Qian, Jiale, et al.
Published: (2026)