:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kim, Minchul, Ye, Dingqiang, Su, Yiyang, Liu, Feng, Liu, Xiaoming
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2504.04708
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

KeyPoint Relative Position Encoding for Face Recognition
by: Kim, Minchul, et al.
Published: (2024)

A Quality-Guided Mixture of Score-Fusion Experts Framework for Human Recognition
by: Zhu, Jie, et al.
Published: (2025)

Open-Set Biometrics: Beyond Good Closed-Set Models
by: Su, Yiyang, et al.
Published: (2024)

50 Years of Automated Face Recognition
by: Kim, Minchul, et al.
Published: (2025)

LocalScore: Local Density-Aware Similarity Scoring for Biometrics
by: Su, Yiyang, et al.
Published: (2026)

HAMoBE: Hierarchical and Adaptive Mixture of Biometric Experts for Video-based Person ReID
by: Su, Yiyang, et al.
Published: (2025)

Sapiens: Foundation for Human Vision Models
by: Khirodkar, Rawal, et al.
Published: (2024)

Person Recognition at Altitude and Range: Fusion of Face, Body Shape and Gait
by: Liu, Feng, et al.
Published: (2025)

FusionAgent: A Multimodal Agent with Dynamic Model Selection for Human Recognition
by: Zhu, Jie, et al.
Published: (2026)

BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models
by: Ye, Dingqiang, et al.
Published: (2025)

BigGait: Learning Gait Representation You Want by Large Vision Models
by: Ye, Dingqiang, et al.
Published: (2024)

Interpretable Perception and Reasoning for Audiovisual Geolocation
by: Su, Yiyang, et al.
Published: (2026)

Pedestrian Attribute Editing for Gait Recognition and Anonymization
by: Ma, Jingzhe, et al.
Published: (2023)

Can Textual Reasoning Improve the Performance of MLLMs on Fine-grained Visual Classification?
by: Zhu, Jie, et al.
Published: (2026)

Sapiens2
by: Khirodkar, Rawal, et al.
Published: (2026)

Silhouette-based Gait Foundation Model
by: Ye, Dingqiang, et al.
Published: (2025)

Person Recognition in Aerial Surveillance: A Decade Survey
by: Nguyen, Kien, et al.
Published: (2025)

Visual Persona: Foundation Model for Full-Body Human Customization
by: Nam, Jisu, et al.
Published: (2025)

MixCut:A Data Augmentation Method for Facial Expression Recognition
by: Yu, Jiaxiang, et al.
Published: (2024)

InstantHDR: Single-forward Gaussian Splatting for High Dynamic Range 3D Reconstruction
by: Ye, Dingqiang, et al.
Published: (2026)

MotivNet: Evolving Meta-Sapiens into an Emotionally Intelligent Foundation Model
by: Medicharla, Rahul, et al.
Published: (2025)

Foundation Cures Personalization: Improving Personalized Models' Prompt Consistency via Hidden Foundation Knowledge
by: Cai, Yiyang, et al.
Published: (2024)

Arc2Face: A Foundation Model for ID-Consistent Human Faces
by: Papantoniou, Foivos Paraperas, et al.
Published: (2024)

UniMo: Unifying 2D Video and 3D Human Motion with an Autoregressive Framework
by: Pang, Youxin, et al.
Published: (2025)

SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition
by: Lu, Feng, et al.
Published: (2025)

H-MoRe: Learning Human-centric Motion Representation for Action Analysis
by: Huang, Zhanbo, et al.
Published: (2025)

On the Holistic Approach for Detecting Human Image Forgery
by: Guo, Xiao, et al.
Published: (2026)

H-Flow: Self-supervised Human Scene Flow via Physics-inspired Joint Multi-modal Learning
by: Huang, Zhanbo, et al.
Published: (2026)

Read Pointer Meters in complex environments based on a Human-like Alignment and Recognition Algorithm
by: Shu, Yan, et al.
Published: (2023)

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
by: Guo, Xu, et al.
Published: (2026)

Explore Human Parsing Modality for Action Recognition
by: Liu, Jinfu, et al.
Published: (2024)

SARATR-X: Toward Building A Foundation Model for SAR Target Recognition
by: Li, Weijie, et al.
Published: (2024)

Advancing Human Action Recognition with Foundation Models trained on Unlabeled Public Videos
by: Qian, Yang, et al.
Published: (2024)

Cross-Model Cross-Stream Learning for Self-Supervised Human Action Recognition
by: Liu, Mengyuan, et al.
Published: (2023)

Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era
by: Lu, Feng, et al.
Published: (2025)

SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition
by: Do, Jeonghyeok, et al.
Published: (2024)

MedSapiens: Taking a Pose to Rethink Medical Imaging Landmark Detection
by: Elbatel, Marawan, et al.
Published: (2025)

Source-Free Domain Adaptation with Frozen Multimodal Foundation Model
by: Tang, Song, et al.
Published: (2023)

Active Generation Network of Human Skeleton for Action Recognition
by: Liu, Long, et al.
Published: (2024)

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
by: He, Xuanhua, et al.
Published: (2024)