Saved in:
| Main Authors: | Wang, Zhicheng, Liang, Wensheng, Zhuang, Ruiyan, Li, Shuai, Tan, Jianwei, Ma, Xiaoguang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.08420 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Leveraging Foundation Model Automatic Data Augmentation Strategies and Skeletal Points for Hands Action Recognition in Industrial Assembly Lines
by: Wu, Liang, et al.
Published: (2024)
by: Wu, Liang, et al.
Published: (2024)
Are Foundation Models Ready for Industrial Defect Recognition? A Reality Check on Real-World Data
by: Baeuerle, Simon, et al.
Published: (2025)
by: Baeuerle, Simon, et al.
Published: (2025)
LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition
by: Oraki, Soroush, et al.
Published: (2024)
by: Oraki, Soroush, et al.
Published: (2024)
A Real-Time Human Action Recognition Model for Assisted Living
by: Wang, Yixuan, et al.
Published: (2025)
by: Wang, Yixuan, et al.
Published: (2025)
Action Recognition based Industrial Safety Violation Detection
by: Reddy, Surya N, et al.
Published: (2024)
by: Reddy, Surya N, et al.
Published: (2024)
Leveraging Foundation Models for Multimodal Graph-Based Action Recognition
by: Ziaeetabar, Fatemeh, et al.
Published: (2025)
by: Ziaeetabar, Fatemeh, et al.
Published: (2025)
EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition
by: Liu, Jingyu, et al.
Published: (2024)
by: Liu, Jingyu, et al.
Published: (2024)
ZSG-IAD: A Multimodal Framework for Zero-Shot Grounded Industrial Anomaly Detection
by: Chen, Qiuhui, et al.
Published: (2026)
by: Chen, Qiuhui, et al.
Published: (2026)
Transcending Adversarial Perturbations: Manifold-Aided Adversarial Examples with Legitimate Semantics
by: Li, Shuai, et al.
Published: (2024)
by: Li, Shuai, et al.
Published: (2024)
Real-Time Human Action Recognition on Embedded Platforms
by: Wang, Ruiqi, et al.
Published: (2024)
by: Wang, Ruiqi, et al.
Published: (2024)
Efficient Spatial-Temporal Modeling for Real-Time Video Analysis: A Unified Framework for Action Recognition and Object Tracking
by: John, Shahla
Published: (2025)
by: John, Shahla
Published: (2025)
Towards an Effective Action-Region Tracking Framework for Fine-grained Video Action Recognition
by: Sun, Baoli, et al.
Published: (2025)
by: Sun, Baoli, et al.
Published: (2025)
Advancing Human Action Recognition with Foundation Models trained on Unlabeled Public Videos
by: Qian, Yang, et al.
Published: (2024)
by: Qian, Yang, et al.
Published: (2024)
A Renaissance of Explicit Motion Information Mining from Transformers for Action Recognition
by: Zhuang, Peiqin, et al.
Published: (2025)
by: Zhuang, Peiqin, et al.
Published: (2025)
OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box Prompts
by: Zhou, Zhen, et al.
Published: (2024)
by: Zhou, Zhen, et al.
Published: (2024)
OpenMarcie: Dataset for Multimodal Action Recognition in Industrial Environments
by: Bello, Hymalai, et al.
Published: (2026)
by: Bello, Hymalai, et al.
Published: (2026)
ActionSink: Toward Precise Robot Manipulation with Dynamic Integration of Action Flow
by: Guo, Shanshan, et al.
Published: (2025)
by: Guo, Shanshan, et al.
Published: (2025)
Foundation Model for Skeleton-Based Human Action Understanding
by: Wang, Hongsong, et al.
Published: (2025)
by: Wang, Hongsong, et al.
Published: (2025)
Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
by: Erdogan, Enes, et al.
Published: (2025)
by: Erdogan, Enes, et al.
Published: (2025)
Unsupervised Spatial-Temporal Feature Enrichment and Fidelity Preservation Network for Skeleton based Action Recognition
by: Li, Chuankun, et al.
Published: (2024)
by: Li, Chuankun, et al.
Published: (2024)
DialBench: Towards Accurate Reading Recognition of Pointer Meter using Large Foundation Models
by: Wang, Futian, et al.
Published: (2025)
by: Wang, Futian, et al.
Published: (2025)
Selective Volume Mixup for Video Action Recognition
by: Tan, Yi, et al.
Published: (2023)
by: Tan, Yi, et al.
Published: (2023)
A Breast Vision Pathology Foundation Model for Real-world Clinical Utility
by: Xu, Yingxue, et al.
Published: (2026)
by: Xu, Yingxue, et al.
Published: (2026)
PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition
by: He, Shenglin, et al.
Published: (2024)
by: He, Shenglin, et al.
Published: (2024)
Spatial-Temporal Perception with Causal Inference for Naturalistic Driving Action Recognition
by: Chang, Qing, et al.
Published: (2025)
by: Chang, Qing, et al.
Published: (2025)
EPRecon: An Efficient Framework for Real-Time Panoptic 3D Reconstruction from Monocular Video
by: Zhou, Zhen, et al.
Published: (2024)
by: Zhou, Zhen, et al.
Published: (2024)
A Survey on Foundation-Model-Based Industrial Defect Detection
by: Yang, Tianle, et al.
Published: (2025)
by: Yang, Tianle, et al.
Published: (2025)
An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition
by: Xu, Haojun, et al.
Published: (2024)
by: Xu, Haojun, et al.
Published: (2024)
DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation
by: Zhao, Ziyu, et al.
Published: (2025)
by: Zhao, Ziyu, et al.
Published: (2025)
Towards Unified Facial Action Unit Recognition Framework by Large Language Models
by: Hu, Guohong, et al.
Published: (2024)
by: Hu, Guohong, et al.
Published: (2024)
AURA: A Hybrid Spatiotemporal-Chromatic Framework for Robust, Real-Time Detection of Industrial Smoke Emissions
by: Bychkov, Mikhail, et al.
Published: (2025)
by: Bychkov, Mikhail, et al.
Published: (2025)
EdgeOAR: Real-time Online Action Recognition On Edge Devices
by: Luo, Wei, et al.
Published: (2024)
by: Luo, Wei, et al.
Published: (2024)
Lane Change Classification and Prediction with Action Recognition Networks
by: Liang, Kai, et al.
Published: (2022)
by: Liang, Kai, et al.
Published: (2022)
KAConvNet: Kolmogorov-Arnold Convolutional Networks for Vision Recognition
by: Liu, Zhaoxiang, et al.
Published: (2026)
by: Liu, Zhaoxiang, et al.
Published: (2026)
Industrial Language-Image Dataset (ILID): Adapting Vision Foundation Models for Industrial Settings
by: Moenck, Keno, et al.
Published: (2024)
by: Moenck, Keno, et al.
Published: (2024)
Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification
by: Li, Shuai, et al.
Published: (2024)
by: Li, Shuai, et al.
Published: (2024)
Temporal Action Detection Model Compression by Progressive Block Drop
by: Chen, Xiaoyong, et al.
Published: (2025)
by: Chen, Xiaoyong, et al.
Published: (2025)
Taylor Videos for Action Recognition
by: Wang, Lei, et al.
Published: (2024)
by: Wang, Lei, et al.
Published: (2024)
ActionHub: A Large-scale Action Video Description Dataset for Zero-shot Action Recognition
by: Zhou, Jiaming, et al.
Published: (2024)
by: Zhou, Jiaming, et al.
Published: (2024)
Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search
by: Chen, Lei, et al.
Published: (2026)
by: Chen, Lei, et al.
Published: (2026)
Similar Items
-
Leveraging Foundation Model Automatic Data Augmentation Strategies and Skeletal Points for Hands Action Recognition in Industrial Assembly Lines
by: Wu, Liang, et al.
Published: (2024) -
Are Foundation Models Ready for Industrial Defect Recognition? A Reality Check on Real-World Data
by: Baeuerle, Simon, et al.
Published: (2025) -
LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition
by: Oraki, Soroush, et al.
Published: (2024) -
A Real-Time Human Action Recognition Model for Assisted Living
by: Wang, Yixuan, et al.
Published: (2025) -
Action Recognition based Industrial Safety Violation Detection
by: Reddy, Surya N, et al.
Published: (2024)