Saved in:
| Main Authors: | Nasiri-Sarvi, Ali, Nguyen, Anh Tien, Rivaz, Hassan, Samaras, Dimitris, Hosseini, Mahdi S. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.12403 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SPARC: Concept-Aligned Sparse Autoencoders for Cross-Model and Cross-Modal Interpretability
by: Nasiri-Sarvi, Ali, et al.
Published: (2025)
by: Nasiri-Sarvi, Ali, et al.
Published: (2025)
Vision Mamba for Classification of Breast Ultrasound Images
by: Nasiri-Sarvi, Ali, et al.
Published: (2024)
by: Nasiri-Sarvi, Ali, et al.
Published: (2024)
Vim4Path: Self-Supervised Vision Mamba for Histopathology Images
by: Nasiri-Sarvi, Ali, et al.
Published: (2024)
by: Nasiri-Sarvi, Ali, et al.
Published: (2024)
Ultrasound Image Generation using Latent Diffusion Models
by: Freiche, Benoit, et al.
Published: (2025)
by: Freiche, Benoit, et al.
Published: (2025)
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification
by: Zhang, Jingwei, et al.
Published: (2024)
by: Zhang, Jingwei, et al.
Published: (2024)
LBMamba: Locally Bi-directional Mamba
by: Zhang, Jingwei, et al.
Published: (2025)
by: Zhang, Jingwei, et al.
Published: (2025)
CLIP-SVD: Efficient and Interpretable Vision-Language Adaptation via Singular Values
by: Koleilat, Taha, et al.
Published: (2025)
by: Koleilat, Taha, et al.
Published: (2025)
Self-supervised co-salient object detection via feature correspondence at multiple scales
by: Chakraborty, Souradeep, et al.
Published: (2024)
by: Chakraborty, Souradeep, et al.
Published: (2024)
Vision Transformer for Classification of Breast Ultrasound Images
by: Gheflati, Behnaz, et al.
Published: (2021)
by: Gheflati, Behnaz, et al.
Published: (2021)
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards
by: Le, Minh-Quan, et al.
Published: (2025)
by: Le, Minh-Quan, et al.
Published: (2025)
Comparative Analysis of Diffusion Generative Models in Computational Pathology
by: Thakkar, Denisha, et al.
Published: (2024)
by: Thakkar, Denisha, et al.
Published: (2024)
TopoDiffusionNet: A Topology-aware Diffusion Model
by: Gupta, Saumya, et al.
Published: (2024)
by: Gupta, Saumya, et al.
Published: (2024)
Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation
by: Howlader, Prantik, et al.
Published: (2024)
by: Howlader, Prantik, et al.
Published: (2024)
Assessing Sample Quality via the Latent Space of Generative Models
by: Xu, Jingyi, et al.
Published: (2024)
by: Xu, Jingyi, et al.
Published: (2024)
Evi-Steer: Learning to Steer Biomedical Vision-Language Models through Efficient and Generalizable Evidential Tuning
by: Koleilat, Taha, et al.
Published: (2026)
by: Koleilat, Taha, et al.
Published: (2026)
MI-NeRF: Learning a Single Face NeRF from Multiple Identities
by: Chatziagapi, Aggelina, et al.
Published: (2024)
by: Chatziagapi, Aggelina, et al.
Published: (2024)
MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition
by: Chatziagapi, Aggelina, et al.
Published: (2024)
by: Chatziagapi, Aggelina, et al.
Published: (2024)
VLEER: Vision and Language Embeddings for Explainable Whole Slide Image Representation
by: Nguyen, Anh Tien, et al.
Published: (2025)
by: Nguyen, Anh Tien, et al.
Published: (2025)
Phrase-Instance Alignment for Generalized Referring Segmentation
by: Nguyen, E-Ro, et al.
Published: (2024)
by: Nguyen, E-Ro, et al.
Published: (2024)
Fast constrained sampling in pre-trained diffusion models
by: Graikos, Alexandros, et al.
Published: (2024)
by: Graikos, Alexandros, et al.
Published: (2024)
Grounding DINO-US-SAM: Text-Prompted Multi-Organ Segmentation in Ultrasound with LoRA-Tuned Vision-Language Models
by: Rasaee, Hamza, et al.
Published: (2025)
by: Rasaee, Hamza, et al.
Published: (2025)
Learning 3D Reconstruction with Priors in Test Time
by: Zhou, Lei, et al.
Published: (2026)
by: Zhou, Lei, et al.
Published: (2026)
Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier
by: Howlader, Prantik, et al.
Published: (2024)
by: Howlader, Prantik, et al.
Published: (2024)
Importance-Based Token Merging for Efficient Image and Video Generation
by: Wu, Haoyu, et al.
Published: (2024)
by: Wu, Haoyu, et al.
Published: (2024)
JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation
by: Chakkera, Sai Tanmay Reddy, et al.
Published: (2024)
by: Chakkera, Sai Tanmay Reddy, et al.
Published: (2024)
PISCES: Annotation-free Text-to-Video Post-Training via Optimal Transport-Aligned Rewards
by: Le, Minh-Quan, et al.
Published: (2026)
by: Le, Minh-Quan, et al.
Published: (2026)
Improving Contrastive Learning for Referring Expression Counting
by: Triaridis, Kostas, et al.
Published: (2025)
by: Triaridis, Kostas, et al.
Published: (2025)
CORA: Consistency-Guided Semi-Supervised Framework for Reasoning Segmentation
by: Howlader, Prantik, et al.
Published: (2025)
by: Howlader, Prantik, et al.
Published: (2025)
Reliability of deep learning models for anatomical landmark detection: The role of inter-rater variability
by: Salari, Soorena, et al.
Published: (2024)
by: Salari, Soorena, et al.
Published: (2024)
Rig3DGS: Creating Controllable Portraits from Casual Monocular Videos
by: Rivero, Alfredo, et al.
Published: (2024)
by: Rivero, Alfredo, et al.
Published: (2024)
Talking Head Generation via AU-Guided Landmark Prediction
by: Chang, Shao-Yu, et al.
Published: (2025)
by: Chang, Shao-Yu, et al.
Published: (2025)
PathSegDiff: Pathology Segmentation using Diffusion model representations
by: Danisetty, Sachin Kumar, et al.
Published: (2025)
by: Danisetty, Sachin Kumar, et al.
Published: (2025)
MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image Segmentation
by: Koleilat, Taha, et al.
Published: (2024)
by: Koleilat, Taha, et al.
Published: (2024)
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models
by: Koleilat, Taha, et al.
Published: (2024)
by: Koleilat, Taha, et al.
Published: (2024)
Training Deep Visual Networks Beyond Loss and Accuracy Through a Dynamical Systems Approach
by: La Quang, Hai, et al.
Published: (2026)
by: La Quang, Hai, et al.
Published: (2026)
Explainable AI and susceptibility to adversarial attacks: a case study in classification of breast ultrasound images
by: Rasaee, Hamza, et al.
Published: (2021)
by: Rasaee, Hamza, et al.
Published: (2021)
One Attention, One Scale: Phase-Aligned Rotary Positional Embeddings for Mixed-Resolution Diffusion Transformer
by: Wu, Haoyu, et al.
Published: (2025)
by: Wu, Haoyu, et al.
Published: (2025)
Embedding Physical Reasoning into Diffusion-Based Shadow Generation
by: Hu, Shilin, et al.
Published: (2025)
by: Hu, Shilin, et al.
Published: (2025)
Cast and Attached Shadow Detection via Iterative Light and Geometry Reasoning
by: Hu, Shilin, et al.
Published: (2025)
by: Hu, Shilin, et al.
Published: (2025)
Efficient INT8 Single-Image Super-Resolution via Deployment-Aware Quantization and Teacher-Guided Training
by: Nguyen, Pham Phuong Nam, et al.
Published: (2026)
by: Nguyen, Pham Phuong Nam, et al.
Published: (2026)
Similar Items
-
SPARC: Concept-Aligned Sparse Autoencoders for Cross-Model and Cross-Modal Interpretability
by: Nasiri-Sarvi, Ali, et al.
Published: (2025) -
Vision Mamba for Classification of Breast Ultrasound Images
by: Nasiri-Sarvi, Ali, et al.
Published: (2024) -
Vim4Path: Self-Supervised Vision Mamba for Histopathology Images
by: Nasiri-Sarvi, Ali, et al.
Published: (2024) -
Ultrasound Image Generation using Latent Diffusion Models
by: Freiche, Benoit, et al.
Published: (2025) -
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification
by: Zhang, Jingwei, et al.
Published: (2024)