Saved in:
| Main Authors: | Pokala, Praveen Kumar, Patibandla, Jaya Sai Kiran, Pandey, Naveen Kumar, Pailla, Balakrishna Reddy |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.00918 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AQUALLM: Audio Question Answering Data Generation Using Large Language Models
by: Behera, Swarup Ranjan, et al.
Published: (2023)
by: Behera, Swarup Ranjan, et al.
Published: (2023)
SNIFR : Boosting Fine-Grained Child Harmful Content Detection Through Audio-Visual Alignment with Cascaded Cross-Transformer
by: Phukan, Orchid Chetia, et al.
Published: (2025)
by: Phukan, Orchid Chetia, et al.
Published: (2025)
Capsule Endoscopy Multi-classification via Gated Attention and Wavelet Transformations
by: Panchananam, Lakshmi Srinivas, et al.
Published: (2024)
by: Panchananam, Lakshmi Srinivas, et al.
Published: (2024)
Exploring deep learning for Event-Based Saliency Prediction with a Transformer-based model
by: Mazna, Romaric, et al.
Published: (2026)
by: Mazna, Romaric, et al.
Published: (2026)
LTCA: Long-range Temporal Context Attention for Referring Video Object Segmentation
by: Yan, Cilin, et al.
Published: (2025)
by: Yan, Cilin, et al.
Published: (2025)
Diabetic Retinopathy Lesion Segmentation through Attention Mechanisms
by: Jithesh, Aruna, et al.
Published: (2026)
by: Jithesh, Aruna, et al.
Published: (2026)
Multi-Context Temporal Consistent Modeling for Referring Video Object Segmentation
by: Choi, Sun-Hyuk, et al.
Published: (2025)
by: Choi, Sun-Hyuk, et al.
Published: (2025)
Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation
by: Marschall, Samuel, et al.
Published: (2024)
by: Marschall, Samuel, et al.
Published: (2024)
WB LUTs: Contrastive Learning for White Balancing Lookup Tables
by: Manne, Sai Kumar Reddy, et al.
Published: (2024)
by: Manne, Sai Kumar Reddy, et al.
Published: (2024)
Curvature Informed Furthest Point Sampling
by: Bhardwaj, Shubham, et al.
Published: (2024)
by: Bhardwaj, Shubham, et al.
Published: (2024)
Enhancing Pneumonia Diagnosis and Severity Assessment through Deep Learning: A Comprehensive Approach Integrating CNN Classification and Infection Segmentation
by: Mallidi, S Kumar Reddy
Published: (2025)
by: Mallidi, S Kumar Reddy
Published: (2025)
AADNet: Attention aware Demoiréing Network
by: Reddy, M Rakesh, et al.
Published: (2024)
by: Reddy, M Rakesh, et al.
Published: (2024)
Turn-by-Turn Indoor Navigation for the Visually Impaired
by: Srinivasaiah, Santosh, et al.
Published: (2024)
by: Srinivasaiah, Santosh, et al.
Published: (2024)
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
by: Truong, Quang-Trung, et al.
Published: (2024)
by: Truong, Quang-Trung, et al.
Published: (2024)
Spatio-Temporal Attention for Consistent Video Semantic Segmentation in Automated Driving
by: Varghese, Serin, et al.
Published: (2026)
by: Varghese, Serin, et al.
Published: (2026)
Biomechanical-phase based Temporal Segmentation in Sports Videos: a Demonstration on Javelin-Throw
by: Badatya, Bikash Kumar, et al.
Published: (2025)
by: Badatya, Bikash Kumar, et al.
Published: (2025)
Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation
by: Song, Liyang, et al.
Published: (2025)
by: Song, Liyang, et al.
Published: (2025)
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
by: Khan, Shaharukh, et al.
Published: (2025)
by: Khan, Shaharukh, et al.
Published: (2025)
Learning to Refocus with Video Diffusion Models
by: Tedla, SaiKiran, et al.
Published: (2025)
by: Tedla, SaiKiran, et al.
Published: (2025)
Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach
by: Mareddy, Sai Krishna Reddy, et al.
Published: (2025)
by: Mareddy, Sai Krishna Reddy, et al.
Published: (2025)
FOCUS: Towards Universal Foreground Segmentation
by: You, Zuyao, et al.
Published: (2025)
by: You, Zuyao, et al.
Published: (2025)
Language-Guided Temporal Token Pruning for Efficient VideoLLM Processing
by: Kumar, Yogesh
Published: (2025)
by: Kumar, Yogesh
Published: (2025)
Learning Local and Global Temporal Contexts for Video Semantic Segmentation
by: Sun, Guolei, et al.
Published: (2022)
by: Sun, Guolei, et al.
Published: (2022)
Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration
by: Chandaliya, Praveen Kumar, et al.
Published: (2024)
by: Chandaliya, Praveen Kumar, et al.
Published: (2024)
TexTAR : Textual Attribute Recognition in Multi-domain and Multi-lingual Document Images
by: Kumar, Rohan, et al.
Published: (2025)
by: Kumar, Rohan, et al.
Published: (2025)
UnSegGNet: Unsupervised Image Segmentation using Graph Neural Networks
by: Reddy, Kovvuri Sai Gopal, et al.
Published: (2024)
by: Reddy, Kovvuri Sai Gopal, et al.
Published: (2024)
UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA Filters
by: Reddy, Kovvuri Sai Gopal, et al.
Published: (2024)
by: Reddy, Kovvuri Sai Gopal, et al.
Published: (2024)
Embodiment: Self-Supervised Depth Estimation Based on Camera Models
by: Zhang, Jinchang, et al.
Published: (2024)
by: Zhang, Jinchang, et al.
Published: (2024)
Multi-Stain Multi-Level Convolutional Network for Multi-Tissue Breast Cancer Image Segmentation
by: Modi, Akash, et al.
Published: (2024)
by: Modi, Akash, et al.
Published: (2024)
Joint Flow And Feature Refinement Using Attention For Video Restoration
by: Merugu, Ranjith, et al.
Published: (2025)
by: Merugu, Ranjith, et al.
Published: (2025)
Point Tracking as a Temporal Cue for Robust Myocardial Segmentation in Echocardiography Videos
by: Khodabakhshian, Bahar, et al.
Published: (2026)
by: Khodabakhshian, Bahar, et al.
Published: (2026)
Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
by: Park, Suho, et al.
Published: (2025)
by: Park, Suho, et al.
Published: (2025)
Chitrarth: Bridging Vision and Language for a Billion People
by: Khan, Shaharukh, et al.
Published: (2025)
by: Khan, Shaharukh, et al.
Published: (2025)
Multi-dimension Transformer with Attention-based Filtering for Medical Image Segmentation
by: Wang, Wentao, et al.
Published: (2024)
by: Wang, Wentao, et al.
Published: (2024)
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
by: Fu, Yunxiang, et al.
Published: (2024)
by: Fu, Yunxiang, et al.
Published: (2024)
Sparsity-Aware Voxel Attention and Foreground Modulation for 3D Semantic Scene Completion
by: Xue, Yu, et al.
Published: (2026)
by: Xue, Yu, et al.
Published: (2026)
CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects
by: Liu, Tao, et al.
Published: (2025)
by: Liu, Tao, et al.
Published: (2025)
Certified vs. Empirical Adversarial Robust-ness via Hybrid Convolutions with Attention Stochasticity
by: Dhar, Joy, et al.
Published: (2026)
by: Dhar, Joy, et al.
Published: (2026)
Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection
by: Aggarwal, Sajal, et al.
Published: (2024)
by: Aggarwal, Sajal, et al.
Published: (2024)
Robust Foreground-Background Separation for Severely-Degraded Videos Using Convolutional Sparse Representation Modeling
by: Naganuma, Kazuki, et al.
Published: (2025)
by: Naganuma, Kazuki, et al.
Published: (2025)
Similar Items
-
AQUALLM: Audio Question Answering Data Generation Using Large Language Models
by: Behera, Swarup Ranjan, et al.
Published: (2023) -
SNIFR : Boosting Fine-Grained Child Harmful Content Detection Through Audio-Visual Alignment with Cascaded Cross-Transformer
by: Phukan, Orchid Chetia, et al.
Published: (2025) -
Capsule Endoscopy Multi-classification via Gated Attention and Wavelet Transformations
by: Panchananam, Lakshmi Srinivas, et al.
Published: (2024) -
Exploring deep learning for Event-Based Saliency Prediction with a Transformer-based model
by: Mazna, Romaric, et al.
Published: (2026) -
LTCA: Long-range Temporal Context Attention for Referring Video Object Segmentation
by: Yan, Cilin, et al.
Published: (2025)