:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Qu, Kehua, Ding, Rui, Tang, Jin
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2411.03729
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

UnityGraph: Unified Learning of Spatio-temporal features for Multi-person Motion Prediction
by: Qu, Kehua, et al.
Published: (2024)

ChronoForge-RL: Chronological Forging through Reinforcement Learning for Enhanced Video Understanding
by: Chen, Kehua
Published: (2025)

Semore: VLM-guided Enhanced Semantic Motion Representations for Visual Reinforcement Learning
by: Wang, Wentao, et al.
Published: (2025)

SemiHMER: Semi-supervised Handwritten Mathematical Expression Recognition using pseudo-labels
by: Chen, Kehua, et al.
Published: (2025)

VERHallu: Evaluating and Mitigating Event Relation Hallucination in Video Large Language Models
by: Zhang, Zefan, et al.
Published: (2026)

MARS: Paying more attention to visual attributes for text-based person search
by: Ergasti, Alex, et al.
Published: (2024)

Temporal Continual Learning with Prior Compensation for Human Motion Prediction
by: Tang, Jianwei, et al.
Published: (2025)

E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving
by: Tang, Yihong, et al.
Published: (2025)

MS-Net: A Multi-Path Sparse Model for Motion Prediction in Multi-Scenes
by: Tang, Xiaqiang, et al.
Published: (2024)

Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer
by: Raab, Sigal, et al.
Published: (2024)

GCA-ResUNet:Image segmentation in medical images using grouped coordinate attention
by: Ding, Jun, et al.
Published: (2025)

Capturing More: Learning Multi-Domain Representations for Robust Online Handwriting Verification
by: Zhang, Peirong, et al.
Published: (2025)

Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic
by: Tang, Jianwei, et al.
Published: (2025)

ChemVA: Advancing Large Language Models on Chemical Reaction Diagrams Understanding
by: Rao, Mingyang, et al.
Published: (2026)

TrajFlow: Multi-modal Motion Prediction via Flow Matching
by: Yan, Qi, et al.
Published: (2025)

CNN-based Multi-In-Multi-Out Model for Efficient Spatiotemporal Prediction
by: Jin, Hyeonseok
Published: (2026)

EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models
by: Fang, Yiyang, et al.
Published: (2026)

Multi-modal user interface control detection using cross-attention
by: Moradi, Milad, et al.
Published: (2026)

MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction
by: Feng, Yan, et al.
Published: (2024)

AdaptGCD: Multi-Expert Adapter Tuning for Generalized Category Discovery
by: Qu, Yuxun, et al.
Published: (2024)

EvRainDrop: HyperGraph-guided Completion for Effective Frame and Event Stream Aggregation
by: Wang, Futian, et al.
Published: (2025)

SAM Guided Semantic and Motion Changed Region Mining for Remote Sensing Change Captioning
by: Wang, Futian, et al.
Published: (2025)

RPT-SR: Regional Prior attention Transformer for infrared image Super-Resolution
by: Jin, Youngwan, et al.
Published: (2026)

UniPINN: A Unified PINN Framework for Multi-task Learning of Diverse Navier-Stokes Equations
by: Sun, Dengdi, et al.
Published: (2026)

KDMOS:Knowledge Distillation for Motion Segmentation
by: Cao, Chunyu, et al.
Published: (2025)

FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency
by: Liu, Shiyan, et al.
Published: (2025)

MogaNet: Multi-order Gated Aggregation Network
by: Li, Siyuan, et al.
Published: (2022)

Vision-Core Guided Contrastive Learning for Balanced Multi-modal Prognosis Prediction of Stroke
by: Chen, Liren, et al.
Published: (2026)

FlowCoMotion: Text-to-Motion Generation via Token-Latent Flow Modeling
by: Guan, Dawei, et al.
Published: (2026)

An Efficient and Multi-private Key Secure Aggregation for Federated Learning
by: Yang, Xue, et al.
Published: (2023)

Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery
by: Zhang, Xiang, et al.
Published: (2025)

Coordinating Multiple Conditions for Trajectory-Controlled Human Motion Generation
by: Cai, Deli, et al.
Published: (2026)

LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction
by: Yan, Yixin, et al.
Published: (2025)

Predictive Reasoning with Augmented Anomaly Contrastive Learning for Compositional Visual Relations
by: Li, Chengtai, et al.
Published: (2026)

Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles
by: Liao, Haicheng, et al.
Published: (2025)

ART: Adaptive Relation Tuning for Generalized Relation Prediction
by: Sudhakaran, Gopika, et al.
Published: (2025)

Quantifying Uncertainty in Motion Prediction with Variational Bayesian Mixture
by: Lu, Juanwu, et al.
Published: (2024)

HumanCM: One Step Human Motion Prediction
by: Haojie, Liu, et al.
Published: (2025)

Translution: Unifying Self-attention and Convolution for Adaptive and Relative Modeling
by: Fan, Hehe, et al.
Published: (2025)

MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators
by: Zhang, Yaqi, et al.
Published: (2023)