Saved in:
| Main Authors: | Lin, Jing, Feng, Yao, Liu, Weiyang, Black, Michael J. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.04533 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ChatPose: Chatting about 3D Human Pose
by: Feng, Yao, et al.
Published: (2023)
by: Feng, Yao, et al.
Published: (2023)
Ghost on the Shell: An Expressive Representation of General 3D Shapes
by: Liu, Zhen, et al.
Published: (2023)
by: Liu, Zhen, et al.
Published: (2023)
Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation
by: Petrovich, Mathis, et al.
Published: (2024)
by: Petrovich, Mathis, et al.
Published: (2024)
Online Video Understanding: OVBench and VideoChat-Online
by: Huang, Zhenpeng, et al.
Published: (2024)
by: Huang, Zhenpeng, et al.
Published: (2024)
FaceGPT: Self-supervised Learning to Chat about 3D Human Faces
by: Wang, Haoran, et al.
Published: (2024)
by: Wang, Haoran, et al.
Published: (2024)
How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception
by: Shahzad, Sahibzada Adil, et al.
Published: (2024)
by: Shahzad, Sahibzada Adil, et al.
Published: (2024)
Multimodal Neurodegenerative Disease Subtyping Explained by ChatGPT
by: Reyes, Diego Machado, et al.
Published: (2024)
by: Reyes, Diego Machado, et al.
Published: (2024)
Chatting about Upper-Body Expressive Human Pose and Shape Estimation
by: Zhao, Yuxiang, et al.
Published: (2026)
by: Zhao, Yuxiang, et al.
Published: (2026)
ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model
by: Zhou, Zhongyi, et al.
Published: (2025)
by: Zhou, Zhongyi, et al.
Published: (2025)
Ani3DHuman: Photorealistic 3D Human Animation with Self-guided Stochastic Sampling
by: Sun, Qi, et al.
Published: (2026)
by: Sun, Qi, et al.
Published: (2026)
GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs
by: Gao, Gege, et al.
Published: (2023)
by: Gao, Gege, et al.
Published: (2023)
VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling
by: Li, Xinhao, et al.
Published: (2024)
by: Li, Xinhao, et al.
Published: (2024)
Contrastive Learning for Multimodal Human Activity Recognition with Limited Labeled Data
by: Jing, Long, et al.
Published: (2026)
by: Jing, Long, et al.
Published: (2026)
MuseChat: A Conversational Music Recommendation System for Videos
by: Dong, Zhikang, et al.
Published: (2023)
by: Dong, Zhikang, et al.
Published: (2023)
ChatGarment: Garment Estimation, Generation and Editing via Large Language Models
by: Bian, Siyuan, et al.
Published: (2024)
by: Bian, Siyuan, et al.
Published: (2024)
Occlusion Resilient 3D Human Pose Estimation
by: Roy, Soumava Kumar, et al.
Published: (2024)
by: Roy, Soumava Kumar, et al.
Published: (2024)
Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions
by: Siyao, Li, et al.
Published: (2025)
by: Siyao, Li, et al.
Published: (2025)
Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
by: Liu, Weiyang, et al.
Published: (2023)
by: Liu, Weiyang, et al.
Published: (2023)
SCULPT: Shape-Conditioned Unpaired Learning of Pose-dependent Clothed and Textured Human Meshes
by: Sanyal, Soubhik, et al.
Published: (2023)
by: Sanyal, Soubhik, et al.
Published: (2023)
DreamReward: Text-to-3D Generation with Human Preference
by: Ye, Junliang, et al.
Published: (2024)
by: Ye, Junliang, et al.
Published: (2024)
Learning Mutual Excitation for Hand-to-Hand and Human-to-Human Interaction Recognition
by: Liu, Mengyuan, et al.
Published: (2024)
by: Liu, Mengyuan, et al.
Published: (2024)
Hierarchical Abstraction Enables Human-Like 3D Object Recognition in Deep Learning Models
by: Fu, Shuhao, et al.
Published: (2025)
by: Fu, Shuhao, et al.
Published: (2025)
Can Large Language Models Understand Symbolic Graphics Programs?
by: Qiu, Zeju, et al.
Published: (2024)
by: Qiu, Zeju, et al.
Published: (2024)
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
by: Wang, Zhenzhi, et al.
Published: (2024)
by: Wang, Zhenzhi, et al.
Published: (2024)
Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D
by: Sharma, Agniv, et al.
Published: (2026)
by: Sharma, Agniv, et al.
Published: (2026)
Spatio-Temporal Multi-Subgraph GCN for 3D Human Motion Prediction
by: Wang, Jiexin, et al.
Published: (2024)
by: Wang, Jiexin, et al.
Published: (2024)
Human Fall Detection using Transfer Learning-based 3D CNN
by: Alam, Ekram, et al.
Published: (2025)
by: Alam, Ekram, et al.
Published: (2025)
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
by: Liu, Zhen, et al.
Published: (2024)
by: Liu, Zhen, et al.
Published: (2024)
VERA: Explainable Video Anomaly Detection via Verbalized Learning of Vision-Language Models
by: Ye, Muchao, et al.
Published: (2024)
by: Ye, Muchao, et al.
Published: (2024)
EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans
by: Garau, Nicola, et al.
Published: (2024)
by: Garau, Nicola, et al.
Published: (2024)
Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
by: Toubal, Imad Eddine, et al.
Published: (2024)
by: Toubal, Imad Eddine, et al.
Published: (2024)
Open-Vocabulary Functional 3D Human-Scene Interaction Generation
by: Liu, Jie, et al.
Published: (2026)
by: Liu, Jie, et al.
Published: (2026)
Value Gradient Guidance for Flow Matching Alignment
by: Liu, Zhen, et al.
Published: (2025)
by: Liu, Zhen, et al.
Published: (2025)
CLImage: Human-Annotated Datasets for Complementary-Label Learning
by: Wang, Hsiu-Hsuan, et al.
Published: (2023)
by: Wang, Hsiu-Hsuan, et al.
Published: (2023)
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
by: Li, Wenhao, et al.
Published: (2023)
by: Li, Wenhao, et al.
Published: (2023)
Verbalized Machine Learning: Revisiting Machine Learning with Language Models
by: Xiao, Tim Z., et al.
Published: (2024)
by: Xiao, Tim Z., et al.
Published: (2024)
ErgoChat: a Visual Query System for the Ergonomic Risk Assessment of Construction Workers
by: Fan, Chao, et al.
Published: (2024)
by: Fan, Chao, et al.
Published: (2024)
Unsupervised Machine Learning for Detecting and Locating Human-Made Objects in 3D Point Cloud
by: Zhao, Hong, et al.
Published: (2024)
by: Zhao, Hong, et al.
Published: (2024)
Enhancing 3D Human Pose Estimation Amidst Severe Occlusion with Dual Transformer Fusion
by: Ghafoor, Mehwish, et al.
Published: (2024)
by: Ghafoor, Mehwish, et al.
Published: (2024)
Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation
by: Wang, Ti, et al.
Published: (2024)
by: Wang, Ti, et al.
Published: (2024)
Similar Items
-
ChatPose: Chatting about 3D Human Pose
by: Feng, Yao, et al.
Published: (2023) -
Ghost on the Shell: An Expressive Representation of General 3D Shapes
by: Liu, Zhen, et al.
Published: (2023) -
Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation
by: Petrovich, Mathis, et al.
Published: (2024) -
Online Video Understanding: OVBench and VideoChat-Online
by: Huang, Zhenpeng, et al.
Published: (2024) -
FaceGPT: Self-supervised Learning to Chat about 3D Human Faces
by: Wang, Haoran, et al.
Published: (2024)