Saved in:
| Main Authors: | Reddy, Surya N, Kurrey, Vaibhav, Nagar, Mayank, Gupta, Gagan Raj |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.05531 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Process Integrated Computer Vision for Real-Time Failure Prediction in Steel Rolling Mill
by: Kurrey, Vaibhav, et al.
Published: (2025)
by: Kurrey, Vaibhav, et al.
Published: (2025)
PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels
by: Aayushman, et al.
Published: (2024)
by: Aayushman, et al.
Published: (2024)
Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
by: Mahanta, Cristina, et al.
Published: (2025)
by: Mahanta, Cristina, et al.
Published: (2025)
FOCUS: Bridging Fine-Grained Recognition and Open-World Discovery across Domains
by: Rathore, Vaibhav, et al.
Published: (2026)
by: Rathore, Vaibhav, et al.
Published: (2026)
TechING: Towards Real World Technical Image Understanding via VLMs
by: Nadeem, Tafazzul, et al.
Published: (2026)
by: Nadeem, Tafazzul, et al.
Published: (2026)
Gaslight, Gatekeep, V1-V3: Early Visual Cortex Alignment Shields Vision-Language Models from Sycophantic Manipulation
by: Shah, Arya, et al.
Published: (2026)
by: Shah, Arya, et al.
Published: (2026)
OpenMarcie: Dataset for Multimodal Action Recognition in Industrial Environments
by: Bello, Hymalai, et al.
Published: (2026)
by: Bello, Hymalai, et al.
Published: (2026)
ALow-Cost Real-Time Framework for Industrial Action Recognition Using Foundation Models
by: Wang, Zhicheng, et al.
Published: (2024)
by: Wang, Zhicheng, et al.
Published: (2024)
Occlusion Aware Student Emotion Recognition based on Facial Action Unit Detection
by: Wally, Shrouk, et al.
Published: (2023)
by: Wally, Shrouk, et al.
Published: (2023)
Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task
by: Chharia, Aviral, et al.
Published: (2025)
by: Chharia, Aviral, et al.
Published: (2025)
HIDISC: A Hyperbolic Framework for Domain Generalization with Generalized Category Discovery
by: Rathore, Vaibhav, et al.
Published: (2025)
by: Rathore, Vaibhav, et al.
Published: (2025)
Active Learning for GCN-based Action Recognition
by: Sahbi, Hichem
Published: (2025)
by: Sahbi, Hichem
Published: (2025)
TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content
by: Anand, Avinash, et al.
Published: (2024)
by: Anand, Avinash, et al.
Published: (2024)
Exploring Explainability in Video Action Recognition
by: Saha, Avinab, et al.
Published: (2024)
by: Saha, Avinab, et al.
Published: (2024)
Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity Recognition
by: Ray, Abhisek, et al.
Published: (2024)
by: Ray, Abhisek, et al.
Published: (2024)
Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition
by: Li, Xunsong, et al.
Published: (2024)
by: Li, Xunsong, et al.
Published: (2024)
Analysis and Evaluation of Kinect-based Action Recognition Algorithms
by: Wang, Lei
Published: (2021)
by: Wang, Lei
Published: (2021)
Do You See What I Say? Generalizable Deepfake Detection based on Visual Speech Recognition
by: Bora, Maheswar, et al.
Published: (2025)
by: Bora, Maheswar, et al.
Published: (2025)
SV3.3B: A Sports Video Understanding Model for Action Recognition
by: Kodathala, Sai Varun, et al.
Published: (2025)
by: Kodathala, Sai Varun, et al.
Published: (2025)
StretchySnake: Flexible SSM Training Unlocks Action Recognition Across Spatio-Temporal Scales
by: Siddiqui, Nyle, et al.
Published: (2025)
by: Siddiqui, Nyle, et al.
Published: (2025)
MonitorVLM:A Vision Language Framework for Safety Violation Detection in Mining Operations
by: Wu, Jiang, et al.
Published: (2025)
by: Wu, Jiang, et al.
Published: (2025)
Understanding and Mitigating Toxicity in Image-Text Pretraining Datasets: A Case Study on LLaVA
by: Kanjula, Karthik Reddy, et al.
Published: (2025)
by: Kanjula, Karthik Reddy, et al.
Published: (2025)
GRITv2: Efficient and Light-weight Social Relation Recognition
by: Reddy, N K Sagar, et al.
Published: (2024)
by: Reddy, N K Sagar, et al.
Published: (2024)
Synthetic-to-Real Domain Adaptation for Action Recognition: A Dataset and Baseline Performances
by: Reddy, Arun V., et al.
Published: (2023)
by: Reddy, Arun V., et al.
Published: (2023)
Leveraging Foundation Model Automatic Data Augmentation Strategies and Skeletal Points for Hands Action Recognition in Industrial Assembly Lines
by: Wu, Liang, et al.
Published: (2024)
by: Wu, Liang, et al.
Published: (2024)
Continual Learning Improves Zero-Shot Action Recognition
by: Gowda, Shreyank N, et al.
Published: (2024)
by: Gowda, Shreyank N, et al.
Published: (2024)
YOWOv3: An Efficient and Generalized Framework for Human Action Detection and Recognition
by: Dang, Duc Manh Nguyen, et al.
Published: (2024)
by: Dang, Duc Manh Nguyen, et al.
Published: (2024)
LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition
by: Oraki, Soroush, et al.
Published: (2024)
by: Oraki, Soroush, et al.
Published: (2024)
Joint Temporal Pooling for Improving Skeleton-based Action Recognition
by: Gunasekara, Shanaka Ramesh, et al.
Published: (2024)
by: Gunasekara, Shanaka Ramesh, et al.
Published: (2024)
Differentiable Frequency-based Disentanglement for Aerial Video Action Recognition
by: Kothandaraman, Divya, et al.
Published: (2022)
by: Kothandaraman, Divya, et al.
Published: (2022)
Benchmarking Recurrent Event-Based Object Detection for Industrial Multi-Class Recognition on MTevent
by: Manohar, Lokeshwaran, et al.
Published: (2026)
by: Manohar, Lokeshwaran, et al.
Published: (2026)
Telling Stories for Common Sense Zero-Shot Action Recognition
by: Gowda, Shreyank N, et al.
Published: (2023)
by: Gowda, Shreyank N, et al.
Published: (2023)
Discerning the Chaos: Detecting Adversarial Perturbations while Disentangling Intentional from Unintentional Noises
by: Jain, Anubhooti, et al.
Published: (2024)
by: Jain, Anubhooti, et al.
Published: (2024)
Exploring Vision-Language Models for Open-Vocabulary Zero-Shot Action Segmentation
by: Unmesh, Asim, et al.
Published: (2026)
by: Unmesh, Asim, et al.
Published: (2026)
UltrasODM: A Dual Stream Optical Flow Mamba Network for 3D Freehand Ultrasound Reconstruction
by: Anand, Mayank, et al.
Published: (2025)
by: Anand, Mayank, et al.
Published: (2025)
Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition
by: Akdag, Erkut, et al.
Published: (2024)
by: Akdag, Erkut, et al.
Published: (2024)
Sign Language Recognition based on YOLOv5 Algorithm for the Telugu Sign Language
by: P, Vipul Reddy., et al.
Published: (2024)
by: P, Vipul Reddy., et al.
Published: (2024)
Low-Resolution Action Recognition for Tiny Actions Challenge
by: Chen, Boyu, et al.
Published: (2022)
by: Chen, Boyu, et al.
Published: (2022)
Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition
by: Liu, Jinfu, et al.
Published: (2024)
by: Liu, Jinfu, et al.
Published: (2024)
An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition
by: Xu, Haojun, et al.
Published: (2024)
by: Xu, Haojun, et al.
Published: (2024)
Similar Items
-
Process Integrated Computer Vision for Real-Time Failure Prediction in Steel Rolling Mill
by: Kurrey, Vaibhav, et al.
Published: (2025) -
PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels
by: Aayushman, et al.
Published: (2024) -
Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
by: Mahanta, Cristina, et al.
Published: (2025) -
FOCUS: Bridging Fine-Grained Recognition and Open-World Discovery across Domains
by: Rathore, Vaibhav, et al.
Published: (2026) -
TechING: Towards Real World Technical Image Understanding via VLMs
by: Nadeem, Tafazzul, et al.
Published: (2026)