Saved in:
| Main Authors: | VS, Balaji, AR, Mahi, PS, Anirudh Ganapathy, M, Manju |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.01435 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Interpreting Hand gestures using Object Detection and Digits Classification
by: K, Sangeetha, et al.
Published: (2024)
by: K, Sangeetha, et al.
Published: (2024)
PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation
by: Dwivedi, Vikas, et al.
Published: (2023)
by: Dwivedi, Vikas, et al.
Published: (2023)
SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition
by: Chappa, Naga VS Raviteja, et al.
Published: (2023)
by: Chappa, Naga VS Raviteja, et al.
Published: (2023)
SegFace: Face Segmentation of Long-Tail Classes
by: Narayan, Kartik, et al.
Published: (2024)
by: Narayan, Kartik, et al.
Published: (2024)
FaceXBench: Evaluating Multimodal LLMs on Face Understanding
by: Narayan, Kartik, et al.
Published: (2025)
by: Narayan, Kartik, et al.
Published: (2025)
Certainty and Uncertainty Guided Active Domain Adaptation
by: Safaei, Bardia, et al.
Published: (2025)
by: Safaei, Bardia, et al.
Published: (2025)
Animalbooth: multimodal feature enhancement for animal subject personalization
by: Liu, Chen, et al.
Published: (2025)
by: Liu, Chen, et al.
Published: (2025)
SegFormer Fine-Tuning with Dropout: Advancing Hair Artifact Removal in Skin Lesion Analysis
by: Saad, Asif Mohammed, et al.
Published: (2025)
by: Saad, Asif Mohammed, et al.
Published: (2025)
FaceXFormer: A Unified Transformer for Facial Analysis
by: Narayan, Kartik, et al.
Published: (2024)
by: Narayan, Kartik, et al.
Published: (2024)
A study of animal action segmentation algorithms across supervised, unsupervised, and semi-supervised learning paradigms
by: Blau, Ari, et al.
Published: (2024)
by: Blau, Ari, et al.
Published: (2024)
Diversity-enhanced Collaborative Mamba for Semi-supervised Medical Image Segmentation
by: Li, Shumeng, et al.
Published: (2025)
by: Li, Shumeng, et al.
Published: (2025)
Text2Place: Affordance-aware Text Guided Human Placement
by: Parihar, Rishubh, et al.
Published: (2024)
by: Parihar, Rishubh, et al.
Published: (2024)
Compass Control: Multi Object Orientation Control for Text-to-Image Generation
by: Parihar, Rishubh, et al.
Published: (2025)
by: Parihar, Rishubh, et al.
Published: (2025)
Hybrid-supervised Hypergraph-enhanced Transformer for Micro-gesture Based Emotion Recognition
by: Xia, Zhaoqiang, et al.
Published: (2025)
by: Xia, Zhaoqiang, et al.
Published: (2025)
Zero-Shot Scene Understanding for Automatic Target Recognition Using Large Vision-Language Models
by: Ranasinghe, Yasiru, et al.
Published: (2025)
by: Ranasinghe, Yasiru, et al.
Published: (2025)
VisionTrap: Unanswerable Questions On Visual Data
by: Saadat, Asir, et al.
Published: (2025)
by: Saadat, Asir, et al.
Published: (2025)
Template-based Multi-Domain Face Recognition
by: Nanduri, Anirudh, et al.
Published: (2024)
by: Nanduri, Anirudh, et al.
Published: (2024)
FLAASH: Flow-Attention Adaptive Semantic Hierarchical Fusion for Multi-Modal Tobacco Content Analysis
by: Chappa, Naga VS Raviteja, et al.
Published: (2024)
by: Chappa, Naga VS Raviteja, et al.
Published: (2024)
PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
by: Parihar, Rishubh, et al.
Published: (2024)
by: Parihar, Rishubh, et al.
Published: (2024)
REACT: Recognize Every Action Everywhere All At Once
by: Chappa, Naga VS Raviteja, et al.
Published: (2023)
by: Chappa, Naga VS Raviteja, et al.
Published: (2023)
An Evidential-enhanced Tri-Branch Consistency Learning Method for Semi-supervised Medical Image Segmentation
by: Zhang, Zhenxi, et al.
Published: (2024)
by: Zhang, Zhenxi, et al.
Published: (2024)
Measuring Train Driver Performance as Key to Approval of Driverless Trains
by: Tagiew, Rustam, et al.
Published: (2025)
by: Tagiew, Rustam, et al.
Published: (2025)
UCATSC: Uncertainty-Aware Constrained Traffic Signal Control Under Vision-Based Partial Observability
by: Bodagala, Jayawant, et al.
Published: (2026)
by: Bodagala, Jayawant, et al.
Published: (2026)
HAtt-Flow: Hierarchical Attention-Flow Mechanism for Group Activity Scene Graph Generation in Videos
by: Chappa, Naga VS Raviteja, et al.
Published: (2023)
by: Chappa, Naga VS Raviteja, et al.
Published: (2023)
CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory
by: Shafiullah, Nur Muhammad Mahi, et al.
Published: (2022)
by: Shafiullah, Nur Muhammad Mahi, et al.
Published: (2022)
Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach
by: PS, Kailas, et al.
Published: (2024)
by: PS, Kailas, et al.
Published: (2024)
Cross-Spectral Body Recognition with Side Information Embedding: Benchmarks on LLCM and Analyzing Range-Induced Occlusions on IJB-MDF
by: Nanduri, Anirudh, et al.
Published: (2025)
by: Nanduri, Anirudh, et al.
Published: (2025)
Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection
by: Rajan, Anirudh Sundara, et al.
Published: (2025)
by: Rajan, Anirudh Sundara, et al.
Published: (2025)
Multi-Domain Biometric Recognition using Body Embeddings
by: Nanduri, Anirudh, et al.
Published: (2025)
by: Nanduri, Anirudh, et al.
Published: (2025)
PosSAM: Panoptic Open-vocabulary Segment Anything
by: VS, Vibashan, et al.
Published: (2024)
by: VS, Vibashan, et al.
Published: (2024)
One Identity, Many Roles: Multimodal Entity Coreference for Enhanced Video Situation Recognition
by: Darur, Balaji, et al.
Published: (2026)
by: Darur, Balaji, et al.
Published: (2026)
Evaluating the Impact of Adversarial Attacks on Traffic Sign Classification using the LISA Dataset
by: Tadessa, Nabeyou, et al.
Published: (2025)
by: Tadessa, Nabeyou, et al.
Published: (2025)
A Computer Vision Hybrid Approach: CNN and Transformer Models for Accurate Alzheimer's Detection from Brain MRI Scans
by: Hoque, Md Mahmudul, et al.
Published: (2026)
by: Hoque, Md Mahmudul, et al.
Published: (2026)
Public Health Advocacy Dataset: A Dataset of Tobacco Usage Videos from Social Media
by: Chappa, Naga VS Raviteja, et al.
Published: (2024)
by: Chappa, Naga VS Raviteja, et al.
Published: (2024)
Robustness Analysis on Foundational Segmentation Models
by: Schiappa, Madeline Chantry, et al.
Published: (2023)
by: Schiappa, Madeline Chantry, et al.
Published: (2023)
Color-$S^{4}L$: Self-supervised Semi-supervised Learning with Image Colorization
by: Chen, Hanxiao
Published: (2024)
by: Chen, Hanxiao
Published: (2024)
Dual-supervised Asymmetric Co-training for Semi-supervised Medical Domain Generalization
by: Song, Jincai, et al.
Published: (2025)
by: Song, Jincai, et al.
Published: (2025)
Test-time Conditional Text-to-Image Synthesis Using Diffusion Models
by: Shukla, Tripti, et al.
Published: (2024)
by: Shukla, Tripti, et al.
Published: (2024)
Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis
by: Agarwal, Aishwarya, et al.
Published: (2024)
by: Agarwal, Aishwarya, et al.
Published: (2024)
AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
by: Agarwal, Aishwarya, et al.
Published: (2024)
by: Agarwal, Aishwarya, et al.
Published: (2024)
Similar Items
-
Interpreting Hand gestures using Object Detection and Digits Classification
by: K, Sangeetha, et al.
Published: (2024) -
PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation
by: Dwivedi, Vikas, et al.
Published: (2023) -
SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition
by: Chappa, Naga VS Raviteja, et al.
Published: (2023) -
SegFace: Face Segmentation of Long-Tail Classes
by: Narayan, Kartik, et al.
Published: (2024) -
FaceXBench: Evaluating Multimodal LLMs on Face Understanding
by: Narayan, Kartik, et al.
Published: (2025)