:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Das, Aryan, Biswas, Koushik, Roy, Swalpa Kumar, Patro, Badri Narayana, Verma, Vinay Kumar
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.14514
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Uncertainty-Aware Vision-Language Segmentation for Medical Imaging
by: Das, Aryan, et al.
Published: (2026)

HyperCap: Hyperspectral Land Cover Captioning Dataset for Vision Language Models
by: Das, Aryan, et al.
Published: (2025)

Counting Without Numbers and Finding Without Words
by: Patro, Badri Narayana
Published: (2026)

Reliable or Deceptive? Investigating Gated Features for Smooth Visual Explanations in CNNs
by: Mitra, Soham, et al.
Published: (2024)

Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
by: Patro, Badri Narayana, et al.
Published: (2024)

SceneMixer: Exploring Convolutional Mixing Networks for Remote Sensing Scene Classification
by: Alkhatib, Mohammed Q., et al.
Published: (2025)

Leveraging Task-Specific Knowledge from LLM for Semi-Supervised 3D Medical Image Segmentation
by: Kumari, Suruchi, et al.
Published: (2024)

MixerSENet: A Lightweight Framework for Efficient Hyperspectral Image Classification
by: Alkhatib, Mohammed Q., et al.
Published: (2026)

Convolutional Prompting meets Language Models for Continual Learning
by: Roy, Anurag, et al.
Published: (2024)

Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis
by: Patro, Badri N., et al.
Published: (2024)

SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models
by: Roy, Arani, et al.
Published: (2025)

HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet
by: Patro, Badri N., et al.
Published: (2026)

NAKUL-Med: Spectral-Graph State Space Models with Dynamics Kernels for Medical Signals
by: Patro, Badri N., et al.
Published: (2026)

ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models
by: Kara, Ozgur, et al.
Published: (2025)

How to Learn More? Exploring Kolmogorov-Arnold Networks for Hyperspectral Image Classification
by: Jamali, Ali, et al.
Published: (2024)

CURE: Concept Unlearning via Orthogonal Representation Editing in Diffusion Models
by: Biswas, Shristi Das, et al.
Published: (2025)

HEART: Hyperspherical Embedding Alignment via Kent-Representation Traversal in Diffusion Models
by: Roy, Arani, et al.
Published: (2026)

Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation?
by: Dutta, Pallabi, et al.
Published: (2024)

ADROIT: A Self-Supervised Framework for Learning Robust Representations for Active Learning
by: Banerjee, Soumya, et al.
Published: (2025)

LLMOrbit: A Circular Taxonomy of Large Language Models -From Scaling Walls to Agentic AI Systems
by: Patro, Badri N., et al.
Published: (2026)

SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series
by: Patro, Badri N., et al.
Published: (2024)

Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generation
by: Biswas, Shristi Das, et al.
Published: (2025)

Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
by: Biswas, Shristi Das, et al.
Published: (2025)

Spatial Gated Multi-Layer Perceptron for Land Use and Land Cover Mapping
by: Jamali, Ali, et al.
Published: (2023)

Motion-Adapter: A Diffusion Model Adapter for Text-to-Motion Generation of Compound Actions
by: Jiang, Yue, et al.
Published: (2026)

Scaling Concept With Text-Guided Diffusion Models
by: Huang, Chao, et al.
Published: (2024)

CAD: Memory Efficient Convolutional Adapter for Segment Anything
by: Kim, Joohyeok, et al.
Published: (2024)

DiffSTR: Controlled Diffusion Models for Scene Text Removal
by: Pathak, Sanhita, et al.
Published: (2024)

FOCUS: Forcing In-Context Object Localization through Visual Support Constraints and Policy Optimization
by: Karim, Mohammed Asad, et al.
Published: (2026)

Forecasting formation of a Tropical Cyclone Using Reanalysis Data
by: Kumar, Sandeep, et al.
Published: (2022)

AttriStory: Fine-grained Attribute Realization for Visual Storytelling with Diffusion Models
by: Sreenivas, Manogna, et al.
Published: (2026)

MAGIC: Multimodal Alignment & Grounding-aware Instruction Coreset for Vision-Language Models
by: Biswas, Shristi Das, et al.
Published: (2026)

Sharing the Learned Knowledge-base to Estimate Convolutional Filter Parameters for Continual Image Restoration
by: Kar, Aupendu, et al.
Published: (2025)

Multi-dimension Transformer with Attention-based Filtering for Medical Image Segmentation
by: Wang, Wentao, et al.
Published: (2024)

TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection
by: Xiao, Xi, et al.
Published: (2025)

Resource Efficient Perception for Vision Systems
by: Subramanyam, A V, et al.
Published: (2024)

SAM-PTx: Text-Guided Fine-Tuning of SAM with Parameter-Efficient, Parallel-Text Adapters
by: Jalilian, Shayan, et al.
Published: (2025)

ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
by: Cheng, Jiaxiang, et al.
Published: (2024)

Rethinking Test Time Scaling for Flow-Matching Generative Models
by: Yu, Qingtao, et al.
Published: (2025)

PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation
by: Ma, Jian, et al.
Published: (2023)