Saved in:
| Main Authors: | Das, Aryan, Biswas, Koushik, Roy, Swalpa Kumar, Patro, Badri Narayana, Verma, Vinay Kumar |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.14514 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Uncertainty-Aware Vision-Language Segmentation for Medical Imaging
by: Das, Aryan, et al.
Published: (2026)
by: Das, Aryan, et al.
Published: (2026)
HyperCap: Hyperspectral Land Cover Captioning Dataset for Vision Language Models
by: Das, Aryan, et al.
Published: (2025)
by: Das, Aryan, et al.
Published: (2025)
Counting Without Numbers and Finding Without Words
by: Patro, Badri Narayana
Published: (2026)
by: Patro, Badri Narayana
Published: (2026)
Reliable or Deceptive? Investigating Gated Features for Smooth Visual Explanations in CNNs
by: Mitra, Soham, et al.
Published: (2024)
by: Mitra, Soham, et al.
Published: (2024)
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
by: Patro, Badri Narayana, et al.
Published: (2024)
by: Patro, Badri Narayana, et al.
Published: (2024)
SceneMixer: Exploring Convolutional Mixing Networks for Remote Sensing Scene Classification
by: Alkhatib, Mohammed Q., et al.
Published: (2025)
by: Alkhatib, Mohammed Q., et al.
Published: (2025)
Leveraging Task-Specific Knowledge from LLM for Semi-Supervised 3D Medical Image Segmentation
by: Kumari, Suruchi, et al.
Published: (2024)
by: Kumari, Suruchi, et al.
Published: (2024)
MixerSENet: A Lightweight Framework for Efficient Hyperspectral Image Classification
by: Alkhatib, Mohammed Q., et al.
Published: (2026)
by: Alkhatib, Mohammed Q., et al.
Published: (2026)
Convolutional Prompting meets Language Models for Continual Learning
by: Roy, Anurag, et al.
Published: (2024)
by: Roy, Anurag, et al.
Published: (2024)
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis
by: Patro, Badri N., et al.
Published: (2024)
by: Patro, Badri N., et al.
Published: (2024)
SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models
by: Roy, Arani, et al.
Published: (2025)
by: Roy, Arani, et al.
Published: (2025)
HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet
by: Patro, Badri N., et al.
Published: (2026)
by: Patro, Badri N., et al.
Published: (2026)
NAKUL-Med: Spectral-Graph State Space Models with Dynamics Kernels for Medical Signals
by: Patro, Badri N., et al.
Published: (2026)
by: Patro, Badri N., et al.
Published: (2026)
ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models
by: Kara, Ozgur, et al.
Published: (2025)
by: Kara, Ozgur, et al.
Published: (2025)
How to Learn More? Exploring Kolmogorov-Arnold Networks for Hyperspectral Image Classification
by: Jamali, Ali, et al.
Published: (2024)
by: Jamali, Ali, et al.
Published: (2024)
CURE: Concept Unlearning via Orthogonal Representation Editing in Diffusion Models
by: Biswas, Shristi Das, et al.
Published: (2025)
by: Biswas, Shristi Das, et al.
Published: (2025)
HEART: Hyperspherical Embedding Alignment via Kent-Representation Traversal in Diffusion Models
by: Roy, Arani, et al.
Published: (2026)
by: Roy, Arani, et al.
Published: (2026)
Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation?
by: Dutta, Pallabi, et al.
Published: (2024)
by: Dutta, Pallabi, et al.
Published: (2024)
ADROIT: A Self-Supervised Framework for Learning Robust Representations for Active Learning
by: Banerjee, Soumya, et al.
Published: (2025)
by: Banerjee, Soumya, et al.
Published: (2025)
LLMOrbit: A Circular Taxonomy of Large Language Models -From Scaling Walls to Agentic AI Systems
by: Patro, Badri N., et al.
Published: (2026)
by: Patro, Badri N., et al.
Published: (2026)
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series
by: Patro, Badri N., et al.
Published: (2024)
by: Patro, Badri N., et al.
Published: (2024)
Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generation
by: Biswas, Shristi Das, et al.
Published: (2025)
by: Biswas, Shristi Das, et al.
Published: (2025)
Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
by: Biswas, Shristi Das, et al.
Published: (2025)
by: Biswas, Shristi Das, et al.
Published: (2025)
Spatial Gated Multi-Layer Perceptron for Land Use and Land Cover Mapping
by: Jamali, Ali, et al.
Published: (2023)
by: Jamali, Ali, et al.
Published: (2023)
Motion-Adapter: A Diffusion Model Adapter for Text-to-Motion Generation of Compound Actions
by: Jiang, Yue, et al.
Published: (2026)
by: Jiang, Yue, et al.
Published: (2026)
Scaling Concept With Text-Guided Diffusion Models
by: Huang, Chao, et al.
Published: (2024)
by: Huang, Chao, et al.
Published: (2024)
CAD: Memory Efficient Convolutional Adapter for Segment Anything
by: Kim, Joohyeok, et al.
Published: (2024)
by: Kim, Joohyeok, et al.
Published: (2024)
DiffSTR: Controlled Diffusion Models for Scene Text Removal
by: Pathak, Sanhita, et al.
Published: (2024)
by: Pathak, Sanhita, et al.
Published: (2024)
FOCUS: Forcing In-Context Object Localization through Visual Support Constraints and Policy Optimization
by: Karim, Mohammed Asad, et al.
Published: (2026)
by: Karim, Mohammed Asad, et al.
Published: (2026)
Forecasting formation of a Tropical Cyclone Using Reanalysis Data
by: Kumar, Sandeep, et al.
Published: (2022)
by: Kumar, Sandeep, et al.
Published: (2022)
AttriStory: Fine-grained Attribute Realization for Visual Storytelling with Diffusion Models
by: Sreenivas, Manogna, et al.
Published: (2026)
by: Sreenivas, Manogna, et al.
Published: (2026)
MAGIC: Multimodal Alignment & Grounding-aware Instruction Coreset for Vision-Language Models
by: Biswas, Shristi Das, et al.
Published: (2026)
by: Biswas, Shristi Das, et al.
Published: (2026)
Sharing the Learned Knowledge-base to Estimate Convolutional Filter Parameters for Continual Image Restoration
by: Kar, Aupendu, et al.
Published: (2025)
by: Kar, Aupendu, et al.
Published: (2025)
Multi-dimension Transformer with Attention-based Filtering for Medical Image Segmentation
by: Wang, Wentao, et al.
Published: (2024)
by: Wang, Wentao, et al.
Published: (2024)
TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection
by: Xiao, Xi, et al.
Published: (2025)
by: Xiao, Xi, et al.
Published: (2025)
Resource Efficient Perception for Vision Systems
by: Subramanyam, A V, et al.
Published: (2024)
by: Subramanyam, A V, et al.
Published: (2024)
SAM-PTx: Text-Guided Fine-Tuning of SAM with Parameter-Efficient, Parallel-Text Adapters
by: Jalilian, Shayan, et al.
Published: (2025)
by: Jalilian, Shayan, et al.
Published: (2025)
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
by: Cheng, Jiaxiang, et al.
Published: (2024)
by: Cheng, Jiaxiang, et al.
Published: (2024)
Rethinking Test Time Scaling for Flow-Matching Generative Models
by: Yu, Qingtao, et al.
Published: (2025)
by: Yu, Qingtao, et al.
Published: (2025)
PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation
by: Ma, Jian, et al.
Published: (2023)
by: Ma, Jian, et al.
Published: (2023)
Similar Items
-
Uncertainty-Aware Vision-Language Segmentation for Medical Imaging
by: Das, Aryan, et al.
Published: (2026) -
HyperCap: Hyperspectral Land Cover Captioning Dataset for Vision Language Models
by: Das, Aryan, et al.
Published: (2025) -
Counting Without Numbers and Finding Without Words
by: Patro, Badri Narayana
Published: (2026) -
Reliable or Deceptive? Investigating Gated Features for Smooth Visual Explanations in CNNs
by: Mitra, Soham, et al.
Published: (2024) -
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
by: Patro, Badri Narayana, et al.
Published: (2024)