:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	VS, Balaji, AR, Mahi, PS, Anirudh Ganapathy, M, Manju
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2407.01435
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Interpreting Hand gestures using Object Detection and Digits Classification
by: K, Sangeetha, et al.
Published: (2024)

PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation
by: Dwivedi, Vikas, et al.
Published: (2023)

SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition
by: Chappa, Naga VS Raviteja, et al.
Published: (2023)

SegFace: Face Segmentation of Long-Tail Classes
by: Narayan, Kartik, et al.
Published: (2024)

FaceXBench: Evaluating Multimodal LLMs on Face Understanding
by: Narayan, Kartik, et al.
Published: (2025)

Certainty and Uncertainty Guided Active Domain Adaptation
by: Safaei, Bardia, et al.
Published: (2025)

Animalbooth: multimodal feature enhancement for animal subject personalization
by: Liu, Chen, et al.
Published: (2025)

SegFormer Fine-Tuning with Dropout: Advancing Hair Artifact Removal in Skin Lesion Analysis
by: Saad, Asif Mohammed, et al.
Published: (2025)

FaceXFormer: A Unified Transformer for Facial Analysis
by: Narayan, Kartik, et al.
Published: (2024)

A study of animal action segmentation algorithms across supervised, unsupervised, and semi-supervised learning paradigms
by: Blau, Ari, et al.
Published: (2024)

Diversity-enhanced Collaborative Mamba for Semi-supervised Medical Image Segmentation
by: Li, Shumeng, et al.
Published: (2025)

Text2Place: Affordance-aware Text Guided Human Placement
by: Parihar, Rishubh, et al.
Published: (2024)

Compass Control: Multi Object Orientation Control for Text-to-Image Generation
by: Parihar, Rishubh, et al.
Published: (2025)

Hybrid-supervised Hypergraph-enhanced Transformer for Micro-gesture Based Emotion Recognition
by: Xia, Zhaoqiang, et al.
Published: (2025)

Zero-Shot Scene Understanding for Automatic Target Recognition Using Large Vision-Language Models
by: Ranasinghe, Yasiru, et al.
Published: (2025)

VisionTrap: Unanswerable Questions On Visual Data
by: Saadat, Asir, et al.
Published: (2025)

Template-based Multi-Domain Face Recognition
by: Nanduri, Anirudh, et al.
Published: (2024)

FLAASH: Flow-Attention Adaptive Semantic Hierarchical Fusion for Multi-Modal Tobacco Content Analysis
by: Chappa, Naga VS Raviteja, et al.
Published: (2024)

PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
by: Parihar, Rishubh, et al.
Published: (2024)

REACT: Recognize Every Action Everywhere All At Once
by: Chappa, Naga VS Raviteja, et al.
Published: (2023)

An Evidential-enhanced Tri-Branch Consistency Learning Method for Semi-supervised Medical Image Segmentation
by: Zhang, Zhenxi, et al.
Published: (2024)

Measuring Train Driver Performance as Key to Approval of Driverless Trains
by: Tagiew, Rustam, et al.
Published: (2025)

UCATSC: Uncertainty-Aware Constrained Traffic Signal Control Under Vision-Based Partial Observability
by: Bodagala, Jayawant, et al.
Published: (2026)

HAtt-Flow: Hierarchical Attention-Flow Mechanism for Group Activity Scene Graph Generation in Videos
by: Chappa, Naga VS Raviteja, et al.
Published: (2023)

CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory
by: Shafiullah, Nur Muhammad Mahi, et al.
Published: (2022)

Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach
by: PS, Kailas, et al.
Published: (2024)

Cross-Spectral Body Recognition with Side Information Embedding: Benchmarks on LLCM and Analyzing Range-Induced Occlusions on IJB-MDF
by: Nanduri, Anirudh, et al.
Published: (2025)

Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection
by: Rajan, Anirudh Sundara, et al.
Published: (2025)

Multi-Domain Biometric Recognition using Body Embeddings
by: Nanduri, Anirudh, et al.
Published: (2025)

PosSAM: Panoptic Open-vocabulary Segment Anything
by: VS, Vibashan, et al.
Published: (2024)

One Identity, Many Roles: Multimodal Entity Coreference for Enhanced Video Situation Recognition
by: Darur, Balaji, et al.
Published: (2026)

Evaluating the Impact of Adversarial Attacks on Traffic Sign Classification using the LISA Dataset
by: Tadessa, Nabeyou, et al.
Published: (2025)

A Computer Vision Hybrid Approach: CNN and Transformer Models for Accurate Alzheimer's Detection from Brain MRI Scans
by: Hoque, Md Mahmudul, et al.
Published: (2026)

Public Health Advocacy Dataset: A Dataset of Tobacco Usage Videos from Social Media
by: Chappa, Naga VS Raviteja, et al.
Published: (2024)

Robustness Analysis on Foundational Segmentation Models
by: Schiappa, Madeline Chantry, et al.
Published: (2023)

Color-$S^{4}L$: Self-supervised Semi-supervised Learning with Image Colorization
by: Chen, Hanxiao
Published: (2024)

Dual-supervised Asymmetric Co-training for Semi-supervised Medical Domain Generalization
by: Song, Jincai, et al.
Published: (2025)

Test-time Conditional Text-to-Image Synthesis Using Diffusion Models
by: Shukla, Tripti, et al.
Published: (2024)

Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis
by: Agarwal, Aishwarya, et al.
Published: (2024)

AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
by: Agarwal, Aishwarya, et al.
Published: (2024)