:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Zhicheng, Liang, Wensheng, Zhuang, Ruiyan, Li, Shuai, Tan, Jianwei, Ma, Xiaoguang
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.08420
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Leveraging Foundation Model Automatic Data Augmentation Strategies and Skeletal Points for Hands Action Recognition in Industrial Assembly Lines
by: Wu, Liang, et al.
Published: (2024)

Are Foundation Models Ready for Industrial Defect Recognition? A Reality Check on Real-World Data
by: Baeuerle, Simon, et al.
Published: (2025)

LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition
by: Oraki, Soroush, et al.
Published: (2024)

A Real-Time Human Action Recognition Model for Assisted Living
by: Wang, Yixuan, et al.
Published: (2025)

Action Recognition based Industrial Safety Violation Detection
by: Reddy, Surya N, et al.
Published: (2024)

Leveraging Foundation Models for Multimodal Graph-Based Action Recognition
by: Ziaeetabar, Fatemeh, et al.
Published: (2025)

EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition
by: Liu, Jingyu, et al.
Published: (2024)

ZSG-IAD: A Multimodal Framework for Zero-Shot Grounded Industrial Anomaly Detection
by: Chen, Qiuhui, et al.
Published: (2026)

Transcending Adversarial Perturbations: Manifold-Aided Adversarial Examples with Legitimate Semantics
by: Li, Shuai, et al.
Published: (2024)

Real-Time Human Action Recognition on Embedded Platforms
by: Wang, Ruiqi, et al.
Published: (2024)

Efficient Spatial-Temporal Modeling for Real-Time Video Analysis: A Unified Framework for Action Recognition and Object Tracking
by: John, Shahla
Published: (2025)

Towards an Effective Action-Region Tracking Framework for Fine-grained Video Action Recognition
by: Sun, Baoli, et al.
Published: (2025)

Advancing Human Action Recognition with Foundation Models trained on Unlabeled Public Videos
by: Qian, Yang, et al.
Published: (2024)

A Renaissance of Explicit Motion Information Mining from Transformers for Action Recognition
by: Zhuang, Peiqin, et al.
Published: (2025)

OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box Prompts
by: Zhou, Zhen, et al.
Published: (2024)

OpenMarcie: Dataset for Multimodal Action Recognition in Industrial Environments
by: Bello, Hymalai, et al.
Published: (2026)

ActionSink: Toward Precise Robot Manipulation with Dynamic Integration of Action Flow
by: Guo, Shanshan, et al.
Published: (2025)

Foundation Model for Skeleton-Based Human Action Understanding
by: Wang, Hongsong, et al.
Published: (2025)

Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
by: Erdogan, Enes, et al.
Published: (2025)

Unsupervised Spatial-Temporal Feature Enrichment and Fidelity Preservation Network for Skeleton based Action Recognition
by: Li, Chuankun, et al.
Published: (2024)

DialBench: Towards Accurate Reading Recognition of Pointer Meter using Large Foundation Models
by: Wang, Futian, et al.
Published: (2025)

Selective Volume Mixup for Video Action Recognition
by: Tan, Yi, et al.
Published: (2023)

A Breast Vision Pathology Foundation Model for Real-world Clinical Utility
by: Xu, Yingxue, et al.
Published: (2026)

PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition
by: He, Shenglin, et al.
Published: (2024)

Spatial-Temporal Perception with Causal Inference for Naturalistic Driving Action Recognition
by: Chang, Qing, et al.
Published: (2025)

EPRecon: An Efficient Framework for Real-Time Panoptic 3D Reconstruction from Monocular Video
by: Zhou, Zhen, et al.
Published: (2024)

A Survey on Foundation-Model-Based Industrial Defect Detection
by: Yang, Tianle, et al.
Published: (2025)

An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition
by: Xu, Haojun, et al.
Published: (2024)

DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation
by: Zhao, Ziyu, et al.
Published: (2025)

Towards Unified Facial Action Unit Recognition Framework by Large Language Models
by: Hu, Guohong, et al.
Published: (2024)

AURA: A Hybrid Spatiotemporal-Chromatic Framework for Robust, Real-Time Detection of Industrial Smoke Emissions
by: Bychkov, Mikhail, et al.
Published: (2025)

EdgeOAR: Real-time Online Action Recognition On Edge Devices
by: Luo, Wei, et al.
Published: (2024)

Lane Change Classification and Prediction with Action Recognition Networks
by: Liang, Kai, et al.
Published: (2022)

KAConvNet: Kolmogorov-Arnold Convolutional Networks for Vision Recognition
by: Liu, Zhaoxiang, et al.
Published: (2026)

Industrial Language-Image Dataset (ILID): Adapting Vision Foundation Models for Industrial Settings
by: Moenck, Keno, et al.
Published: (2024)

Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification
by: Li, Shuai, et al.
Published: (2024)

Temporal Action Detection Model Compression by Progressive Block Drop
by: Chen, Xiaoyong, et al.
Published: (2025)

Taylor Videos for Action Recognition
by: Wang, Lei, et al.
Published: (2024)

ActionHub: A Large-scale Action Video Description Dataset for Zero-shot Action Recognition
by: Zhou, Jiaming, et al.
Published: (2024)

Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search
by: Chen, Lei, et al.
Published: (2026)