:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pokala, Praveen Kumar, Patibandla, Jaya Sai Kiran, Pandey, Naveen Kumar, Pailla, Balakrishna Reddy
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2402.00918
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AQUALLM: Audio Question Answering Data Generation Using Large Language Models
by: Behera, Swarup Ranjan, et al.
Published: (2023)

SNIFR : Boosting Fine-Grained Child Harmful Content Detection Through Audio-Visual Alignment with Cascaded Cross-Transformer
by: Phukan, Orchid Chetia, et al.
Published: (2025)

Capsule Endoscopy Multi-classification via Gated Attention and Wavelet Transformations
by: Panchananam, Lakshmi Srinivas, et al.
Published: (2024)

Exploring deep learning for Event-Based Saliency Prediction with a Transformer-based model
by: Mazna, Romaric, et al.
Published: (2026)

LTCA: Long-range Temporal Context Attention for Referring Video Object Segmentation
by: Yan, Cilin, et al.
Published: (2025)

Diabetic Retinopathy Lesion Segmentation through Attention Mechanisms
by: Jithesh, Aruna, et al.
Published: (2026)

Multi-Context Temporal Consistent Modeling for Referring Video Object Segmentation
by: Choi, Sun-Hyuk, et al.
Published: (2025)

Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation
by: Marschall, Samuel, et al.
Published: (2024)

WB LUTs: Contrastive Learning for White Balancing Lookup Tables
by: Manne, Sai Kumar Reddy, et al.
Published: (2024)

Curvature Informed Furthest Point Sampling
by: Bhardwaj, Shubham, et al.
Published: (2024)

Enhancing Pneumonia Diagnosis and Severity Assessment through Deep Learning: A Comprehensive Approach Integrating CNN Classification and Infection Segmentation
by: Mallidi, S Kumar Reddy
Published: (2025)

AADNet: Attention aware Demoiréing Network
by: Reddy, M Rakesh, et al.
Published: (2024)

Turn-by-Turn Indoor Navigation for the Visually Impaired
by: Srinivasaiah, Santosh, et al.
Published: (2024)

Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
by: Truong, Quang-Trung, et al.
Published: (2024)

Spatio-Temporal Attention for Consistent Video Semantic Segmentation in Automated Driving
by: Varghese, Serin, et al.
Published: (2026)

Biomechanical-phase based Temporal Segmentation in Sports Videos: a Demonstration on Javelin-Throw
by: Badatya, Bikash Kumar, et al.
Published: (2025)

Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation
by: Song, Liyang, et al.
Published: (2025)

Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
by: Khan, Shaharukh, et al.
Published: (2025)

Learning to Refocus with Video Diffusion Models
by: Tedla, SaiKiran, et al.
Published: (2025)

Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach
by: Mareddy, Sai Krishna Reddy, et al.
Published: (2025)

FOCUS: Towards Universal Foreground Segmentation
by: You, Zuyao, et al.
Published: (2025)

Language-Guided Temporal Token Pruning for Efficient VideoLLM Processing
by: Kumar, Yogesh
Published: (2025)

Learning Local and Global Temporal Contexts for Video Semantic Segmentation
by: Sun, Guolei, et al.
Published: (2022)

Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration
by: Chandaliya, Praveen Kumar, et al.
Published: (2024)

TexTAR : Textual Attribute Recognition in Multi-domain and Multi-lingual Document Images
by: Kumar, Rohan, et al.
Published: (2025)

UnSegGNet: Unsupervised Image Segmentation using Graph Neural Networks
by: Reddy, Kovvuri Sai Gopal, et al.
Published: (2024)

UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA Filters
by: Reddy, Kovvuri Sai Gopal, et al.
Published: (2024)

Embodiment: Self-Supervised Depth Estimation Based on Camera Models
by: Zhang, Jinchang, et al.
Published: (2024)

Multi-Stain Multi-Level Convolutional Network for Multi-Tissue Breast Cancer Image Segmentation
by: Modi, Akash, et al.
Published: (2024)

Joint Flow And Feature Refinement Using Attention For Video Restoration
by: Merugu, Ranjith, et al.
Published: (2025)

Point Tracking as a Temporal Cue for Robust Myocardial Segmentation in Echocardiography Videos
by: Khodabakhshian, Bahar, et al.
Published: (2026)

Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
by: Park, Suho, et al.
Published: (2025)

Chitrarth: Bridging Vision and Language for a Billion People
by: Khan, Shaharukh, et al.
Published: (2025)

Multi-dimension Transformer with Attention-based Filtering for Medical Image Segmentation
by: Wang, Wentao, et al.
Published: (2024)

SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
by: Fu, Yunxiang, et al.
Published: (2024)

Sparsity-Aware Voxel Attention and Foreground Modulation for 3D Semantic Scene Completion
by: Xue, Yu, et al.
Published: (2026)

CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects
by: Liu, Tao, et al.
Published: (2025)

Certified vs. Empirical Adversarial Robust-ness via Hybrid Convolutions with Attention Stochasticity
by: Dhar, Joy, et al.
Published: (2026)

Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection
by: Aggarwal, Sajal, et al.
Published: (2024)

Robust Foreground-Background Separation for Severely-Degraded Videos Using Convolutional Sparse Representation Modeling
by: Naganuma, Kazuki, et al.
Published: (2025)