Saved in:
| Main Authors: | Kim, Minchul, Ye, Dingqiang, Su, Yiyang, Liu, Feng, Liu, Xiaoming |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.04708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
KeyPoint Relative Position Encoding for Face Recognition
by: Kim, Minchul, et al.
Published: (2024)
by: Kim, Minchul, et al.
Published: (2024)
A Quality-Guided Mixture of Score-Fusion Experts Framework for Human Recognition
by: Zhu, Jie, et al.
Published: (2025)
by: Zhu, Jie, et al.
Published: (2025)
Open-Set Biometrics: Beyond Good Closed-Set Models
by: Su, Yiyang, et al.
Published: (2024)
by: Su, Yiyang, et al.
Published: (2024)
50 Years of Automated Face Recognition
by: Kim, Minchul, et al.
Published: (2025)
by: Kim, Minchul, et al.
Published: (2025)
LocalScore: Local Density-Aware Similarity Scoring for Biometrics
by: Su, Yiyang, et al.
Published: (2026)
by: Su, Yiyang, et al.
Published: (2026)
HAMoBE: Hierarchical and Adaptive Mixture of Biometric Experts for Video-based Person ReID
by: Su, Yiyang, et al.
Published: (2025)
by: Su, Yiyang, et al.
Published: (2025)
Sapiens: Foundation for Human Vision Models
by: Khirodkar, Rawal, et al.
Published: (2024)
by: Khirodkar, Rawal, et al.
Published: (2024)
Person Recognition at Altitude and Range: Fusion of Face, Body Shape and Gait
by: Liu, Feng, et al.
Published: (2025)
by: Liu, Feng, et al.
Published: (2025)
FusionAgent: A Multimodal Agent with Dynamic Model Selection for Human Recognition
by: Zhu, Jie, et al.
Published: (2026)
by: Zhu, Jie, et al.
Published: (2026)
BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models
by: Ye, Dingqiang, et al.
Published: (2025)
by: Ye, Dingqiang, et al.
Published: (2025)
BigGait: Learning Gait Representation You Want by Large Vision Models
by: Ye, Dingqiang, et al.
Published: (2024)
by: Ye, Dingqiang, et al.
Published: (2024)
Interpretable Perception and Reasoning for Audiovisual Geolocation
by: Su, Yiyang, et al.
Published: (2026)
by: Su, Yiyang, et al.
Published: (2026)
Pedestrian Attribute Editing for Gait Recognition and Anonymization
by: Ma, Jingzhe, et al.
Published: (2023)
by: Ma, Jingzhe, et al.
Published: (2023)
Can Textual Reasoning Improve the Performance of MLLMs on Fine-grained Visual Classification?
by: Zhu, Jie, et al.
Published: (2026)
by: Zhu, Jie, et al.
Published: (2026)
Sapiens2
by: Khirodkar, Rawal, et al.
Published: (2026)
by: Khirodkar, Rawal, et al.
Published: (2026)
Silhouette-based Gait Foundation Model
by: Ye, Dingqiang, et al.
Published: (2025)
by: Ye, Dingqiang, et al.
Published: (2025)
Person Recognition in Aerial Surveillance: A Decade Survey
by: Nguyen, Kien, et al.
Published: (2025)
by: Nguyen, Kien, et al.
Published: (2025)
Visual Persona: Foundation Model for Full-Body Human Customization
by: Nam, Jisu, et al.
Published: (2025)
by: Nam, Jisu, et al.
Published: (2025)
MixCut:A Data Augmentation Method for Facial Expression Recognition
by: Yu, Jiaxiang, et al.
Published: (2024)
by: Yu, Jiaxiang, et al.
Published: (2024)
InstantHDR: Single-forward Gaussian Splatting for High Dynamic Range 3D Reconstruction
by: Ye, Dingqiang, et al.
Published: (2026)
by: Ye, Dingqiang, et al.
Published: (2026)
MotivNet: Evolving Meta-Sapiens into an Emotionally Intelligent Foundation Model
by: Medicharla, Rahul, et al.
Published: (2025)
by: Medicharla, Rahul, et al.
Published: (2025)
Foundation Cures Personalization: Improving Personalized Models' Prompt Consistency via Hidden Foundation Knowledge
by: Cai, Yiyang, et al.
Published: (2024)
by: Cai, Yiyang, et al.
Published: (2024)
Arc2Face: A Foundation Model for ID-Consistent Human Faces
by: Papantoniou, Foivos Paraperas, et al.
Published: (2024)
by: Papantoniou, Foivos Paraperas, et al.
Published: (2024)
UniMo: Unifying 2D Video and 3D Human Motion with an Autoregressive Framework
by: Pang, Youxin, et al.
Published: (2025)
by: Pang, Youxin, et al.
Published: (2025)
SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition
by: Lu, Feng, et al.
Published: (2025)
by: Lu, Feng, et al.
Published: (2025)
H-MoRe: Learning Human-centric Motion Representation for Action Analysis
by: Huang, Zhanbo, et al.
Published: (2025)
by: Huang, Zhanbo, et al.
Published: (2025)
On the Holistic Approach for Detecting Human Image Forgery
by: Guo, Xiao, et al.
Published: (2026)
by: Guo, Xiao, et al.
Published: (2026)
H-Flow: Self-supervised Human Scene Flow via Physics-inspired Joint Multi-modal Learning
by: Huang, Zhanbo, et al.
Published: (2026)
by: Huang, Zhanbo, et al.
Published: (2026)
Read Pointer Meters in complex environments based on a Human-like Alignment and Recognition Algorithm
by: Shu, Yan, et al.
Published: (2023)
by: Shu, Yan, et al.
Published: (2023)
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
by: Guo, Xu, et al.
Published: (2026)
by: Guo, Xu, et al.
Published: (2026)
Explore Human Parsing Modality for Action Recognition
by: Liu, Jinfu, et al.
Published: (2024)
by: Liu, Jinfu, et al.
Published: (2024)
SARATR-X: Toward Building A Foundation Model for SAR Target Recognition
by: Li, Weijie, et al.
Published: (2024)
by: Li, Weijie, et al.
Published: (2024)
Advancing Human Action Recognition with Foundation Models trained on Unlabeled Public Videos
by: Qian, Yang, et al.
Published: (2024)
by: Qian, Yang, et al.
Published: (2024)
Cross-Model Cross-Stream Learning for Self-Supervised Human Action Recognition
by: Liu, Mengyuan, et al.
Published: (2023)
by: Liu, Mengyuan, et al.
Published: (2023)
Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era
by: Lu, Feng, et al.
Published: (2025)
by: Lu, Feng, et al.
Published: (2025)
SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition
by: Do, Jeonghyeok, et al.
Published: (2024)
by: Do, Jeonghyeok, et al.
Published: (2024)
MedSapiens: Taking a Pose to Rethink Medical Imaging Landmark Detection
by: Elbatel, Marawan, et al.
Published: (2025)
by: Elbatel, Marawan, et al.
Published: (2025)
Source-Free Domain Adaptation with Frozen Multimodal Foundation Model
by: Tang, Song, et al.
Published: (2023)
by: Tang, Song, et al.
Published: (2023)
Active Generation Network of Human Skeleton for Action Recognition
by: Liu, Long, et al.
Published: (2024)
by: Liu, Long, et al.
Published: (2024)
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
by: He, Xuanhua, et al.
Published: (2024)
by: He, Xuanhua, et al.
Published: (2024)
Similar Items
-
KeyPoint Relative Position Encoding for Face Recognition
by: Kim, Minchul, et al.
Published: (2024) -
A Quality-Guided Mixture of Score-Fusion Experts Framework for Human Recognition
by: Zhu, Jie, et al.
Published: (2025) -
Open-Set Biometrics: Beyond Good Closed-Set Models
by: Su, Yiyang, et al.
Published: (2024) -
50 Years of Automated Face Recognition
by: Kim, Minchul, et al.
Published: (2025) -
LocalScore: Local Density-Aware Similarity Scoring for Biometrics
by: Su, Yiyang, et al.
Published: (2026)