Saved in:
| Main Authors: | Qu, Kehua, Ding, Rui, Tang, Jin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.03729 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
UnityGraph: Unified Learning of Spatio-temporal features for Multi-person Motion Prediction
by: Qu, Kehua, et al.
Published: (2024)
by: Qu, Kehua, et al.
Published: (2024)
ChronoForge-RL: Chronological Forging through Reinforcement Learning for Enhanced Video Understanding
by: Chen, Kehua
Published: (2025)
by: Chen, Kehua
Published: (2025)
Semore: VLM-guided Enhanced Semantic Motion Representations for Visual Reinforcement Learning
by: Wang, Wentao, et al.
Published: (2025)
by: Wang, Wentao, et al.
Published: (2025)
SemiHMER: Semi-supervised Handwritten Mathematical Expression Recognition using pseudo-labels
by: Chen, Kehua, et al.
Published: (2025)
by: Chen, Kehua, et al.
Published: (2025)
VERHallu: Evaluating and Mitigating Event Relation Hallucination in Video Large Language Models
by: Zhang, Zefan, et al.
Published: (2026)
by: Zhang, Zefan, et al.
Published: (2026)
MARS: Paying more attention to visual attributes for text-based person search
by: Ergasti, Alex, et al.
Published: (2024)
by: Ergasti, Alex, et al.
Published: (2024)
Temporal Continual Learning with Prior Compensation for Human Motion Prediction
by: Tang, Jianwei, et al.
Published: (2025)
by: Tang, Jianwei, et al.
Published: (2025)
E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving
by: Tang, Yihong, et al.
Published: (2025)
by: Tang, Yihong, et al.
Published: (2025)
MS-Net: A Multi-Path Sparse Model for Motion Prediction in Multi-Scenes
by: Tang, Xiaqiang, et al.
Published: (2024)
by: Tang, Xiaqiang, et al.
Published: (2024)
Monkey See, Monkey Do: Harnessing Self-attention in Motion Diffusion for Zero-shot Motion Transfer
by: Raab, Sigal, et al.
Published: (2024)
by: Raab, Sigal, et al.
Published: (2024)
GCA-ResUNet:Image segmentation in medical images using grouped coordinate attention
by: Ding, Jun, et al.
Published: (2025)
by: Ding, Jun, et al.
Published: (2025)
Capturing More: Learning Multi-Domain Representations for Robust Online Handwriting Verification
by: Zhang, Peirong, et al.
Published: (2025)
by: Zhang, Peirong, et al.
Published: (2025)
Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic
by: Tang, Jianwei, et al.
Published: (2025)
by: Tang, Jianwei, et al.
Published: (2025)
ChemVA: Advancing Large Language Models on Chemical Reaction Diagrams Understanding
by: Rao, Mingyang, et al.
Published: (2026)
by: Rao, Mingyang, et al.
Published: (2026)
TrajFlow: Multi-modal Motion Prediction via Flow Matching
by: Yan, Qi, et al.
Published: (2025)
by: Yan, Qi, et al.
Published: (2025)
CNN-based Multi-In-Multi-Out Model for Efficient Spatiotemporal Prediction
by: Jin, Hyeonseok
Published: (2026)
by: Jin, Hyeonseok
Published: (2026)
EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models
by: Fang, Yiyang, et al.
Published: (2026)
by: Fang, Yiyang, et al.
Published: (2026)
Multi-modal user interface control detection using cross-attention
by: Moradi, Milad, et al.
Published: (2026)
by: Moradi, Milad, et al.
Published: (2026)
MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction
by: Feng, Yan, et al.
Published: (2024)
by: Feng, Yan, et al.
Published: (2024)
AdaptGCD: Multi-Expert Adapter Tuning for Generalized Category Discovery
by: Qu, Yuxun, et al.
Published: (2024)
by: Qu, Yuxun, et al.
Published: (2024)
EvRainDrop: HyperGraph-guided Completion for Effective Frame and Event Stream Aggregation
by: Wang, Futian, et al.
Published: (2025)
by: Wang, Futian, et al.
Published: (2025)
SAM Guided Semantic and Motion Changed Region Mining for Remote Sensing Change Captioning
by: Wang, Futian, et al.
Published: (2025)
by: Wang, Futian, et al.
Published: (2025)
RPT-SR: Regional Prior attention Transformer for infrared image Super-Resolution
by: Jin, Youngwan, et al.
Published: (2026)
by: Jin, Youngwan, et al.
Published: (2026)
UniPINN: A Unified PINN Framework for Multi-task Learning of Diverse Navier-Stokes Equations
by: Sun, Dengdi, et al.
Published: (2026)
by: Sun, Dengdi, et al.
Published: (2026)
KDMOS:Knowledge Distillation for Motion Segmentation
by: Cao, Chunyu, et al.
Published: (2025)
by: Cao, Chunyu, et al.
Published: (2025)
FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency
by: Liu, Shiyan, et al.
Published: (2025)
by: Liu, Shiyan, et al.
Published: (2025)
MogaNet: Multi-order Gated Aggregation Network
by: Li, Siyuan, et al.
Published: (2022)
by: Li, Siyuan, et al.
Published: (2022)
Vision-Core Guided Contrastive Learning for Balanced Multi-modal Prognosis Prediction of Stroke
by: Chen, Liren, et al.
Published: (2026)
by: Chen, Liren, et al.
Published: (2026)
FlowCoMotion: Text-to-Motion Generation via Token-Latent Flow Modeling
by: Guan, Dawei, et al.
Published: (2026)
by: Guan, Dawei, et al.
Published: (2026)
An Efficient and Multi-private Key Secure Aggregation for Federated Learning
by: Yang, Xue, et al.
Published: (2023)
by: Yang, Xue, et al.
Published: (2023)
Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery
by: Zhang, Xiang, et al.
Published: (2025)
by: Zhang, Xiang, et al.
Published: (2025)
Coordinating Multiple Conditions for Trajectory-Controlled Human Motion Generation
by: Cai, Deli, et al.
Published: (2026)
by: Cai, Deli, et al.
Published: (2026)
LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction
by: Yan, Yixin, et al.
Published: (2025)
by: Yan, Yixin, et al.
Published: (2025)
Predictive Reasoning with Augmented Anomaly Contrastive Learning for Compositional Visual Relations
by: Li, Chengtai, et al.
Published: (2026)
by: Li, Chengtai, et al.
Published: (2026)
Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles
by: Liao, Haicheng, et al.
Published: (2025)
by: Liao, Haicheng, et al.
Published: (2025)
ART: Adaptive Relation Tuning for Generalized Relation Prediction
by: Sudhakaran, Gopika, et al.
Published: (2025)
by: Sudhakaran, Gopika, et al.
Published: (2025)
Quantifying Uncertainty in Motion Prediction with Variational Bayesian Mixture
by: Lu, Juanwu, et al.
Published: (2024)
by: Lu, Juanwu, et al.
Published: (2024)
HumanCM: One Step Human Motion Prediction
by: Haojie, Liu, et al.
Published: (2025)
by: Haojie, Liu, et al.
Published: (2025)
Translution: Unifying Self-attention and Convolution for Adaptive and Relative Modeling
by: Fan, Hehe, et al.
Published: (2025)
by: Fan, Hehe, et al.
Published: (2025)
MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators
by: Zhang, Yaqi, et al.
Published: (2023)
by: Zhang, Yaqi, et al.
Published: (2023)
Similar Items
-
UnityGraph: Unified Learning of Spatio-temporal features for Multi-person Motion Prediction
by: Qu, Kehua, et al.
Published: (2024) -
ChronoForge-RL: Chronological Forging through Reinforcement Learning for Enhanced Video Understanding
by: Chen, Kehua
Published: (2025) -
Semore: VLM-guided Enhanced Semantic Motion Representations for Visual Reinforcement Learning
by: Wang, Wentao, et al.
Published: (2025) -
SemiHMER: Semi-supervised Handwritten Mathematical Expression Recognition using pseudo-labels
by: Chen, Kehua, et al.
Published: (2025) -
VERHallu: Evaluating and Mitigating Event Relation Hallucination in Video Large Language Models
by: Zhang, Zefan, et al.
Published: (2026)