:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lin, Jing, Feng, Yao, Liu, Weiyang, Black, Michael J.
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2405.04533
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ChatPose: Chatting about 3D Human Pose
by: Feng, Yao, et al.
Published: (2023)

Ghost on the Shell: An Expressive Representation of General 3D Shapes
by: Liu, Zhen, et al.
Published: (2023)

Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation
by: Petrovich, Mathis, et al.
Published: (2024)

Online Video Understanding: OVBench and VideoChat-Online
by: Huang, Zhenpeng, et al.
Published: (2024)

FaceGPT: Self-supervised Learning to Chat about 3D Human Faces
by: Wang, Haoran, et al.
Published: (2024)

How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception
by: Shahzad, Sahibzada Adil, et al.
Published: (2024)

Multimodal Neurodegenerative Disease Subtyping Explained by ChatGPT
by: Reyes, Diego Machado, et al.
Published: (2024)

Chatting about Upper-Body Expressive Human Pose and Shape Estimation
by: Zhao, Yuxiang, et al.
Published: (2026)

ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model
by: Zhou, Zhongyi, et al.
Published: (2025)

Ani3DHuman: Photorealistic 3D Human Animation with Self-guided Stochastic Sampling
by: Sun, Qi, et al.
Published: (2026)

GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs
by: Gao, Gege, et al.
Published: (2023)

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling
by: Li, Xinhao, et al.
Published: (2024)

Contrastive Learning for Multimodal Human Activity Recognition with Limited Labeled Data
by: Jing, Long, et al.
Published: (2026)

MuseChat: A Conversational Music Recommendation System for Videos
by: Dong, Zhikang, et al.
Published: (2023)

ChatGarment: Garment Estimation, Generation and Editing via Large Language Models
by: Bian, Siyuan, et al.
Published: (2024)

Occlusion Resilient 3D Human Pose Estimation
by: Roy, Soumava Kumar, et al.
Published: (2024)

Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions
by: Siyao, Li, et al.
Published: (2025)

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
by: Liu, Weiyang, et al.
Published: (2023)

SCULPT: Shape-Conditioned Unpaired Learning of Pose-dependent Clothed and Textured Human Meshes
by: Sanyal, Soubhik, et al.
Published: (2023)

DreamReward: Text-to-3D Generation with Human Preference
by: Ye, Junliang, et al.
Published: (2024)

Learning Mutual Excitation for Hand-to-Hand and Human-to-Human Interaction Recognition
by: Liu, Mengyuan, et al.
Published: (2024)

Hierarchical Abstraction Enables Human-Like 3D Object Recognition in Deep Learning Models
by: Fu, Shuhao, et al.
Published: (2025)

Can Large Language Models Understand Symbolic Graphics Programs?
by: Qiu, Zeju, et al.
Published: (2024)

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
by: Wang, Zhenzhi, et al.
Published: (2024)

Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D
by: Sharma, Agniv, et al.
Published: (2026)

Spatio-Temporal Multi-Subgraph GCN for 3D Human Motion Prediction
by: Wang, Jiexin, et al.
Published: (2024)

Human Fall Detection using Transfer Learning-based 3D CNN
by: Alam, Ekram, et al.
Published: (2025)

Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
by: Liu, Zhen, et al.
Published: (2024)

VERA: Explainable Video Anomaly Detection via Verbalized Learning of Vision-Language Models
by: Ye, Muchao, et al.
Published: (2024)

EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans
by: Garau, Nicola, et al.
Published: (2024)

Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
by: Toubal, Imad Eddine, et al.
Published: (2024)

Open-Vocabulary Functional 3D Human-Scene Interaction Generation
by: Liu, Jie, et al.
Published: (2026)

Value Gradient Guidance for Flow Matching Alignment
by: Liu, Zhen, et al.
Published: (2025)

CLImage: Human-Annotated Datasets for Complementary-Label Learning
by: Wang, Hsiu-Hsuan, et al.
Published: (2023)

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
by: Li, Wenhao, et al.
Published: (2023)

Verbalized Machine Learning: Revisiting Machine Learning with Language Models
by: Xiao, Tim Z., et al.
Published: (2024)

ErgoChat: a Visual Query System for the Ergonomic Risk Assessment of Construction Workers
by: Fan, Chao, et al.
Published: (2024)

Unsupervised Machine Learning for Detecting and Locating Human-Made Objects in 3D Point Cloud
by: Zhao, Hong, et al.
Published: (2024)

Enhancing 3D Human Pose Estimation Amidst Severe Occlusion with Dual Transformer Fusion
by: Ghafoor, Mehwish, et al.
Published: (2024)

Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation
by: Wang, Ti, et al.
Published: (2024)