Saved in:
| Main Authors: | Apurba, Md Shifatul Ahsan, Selim, Md, Chen, Jin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.00901 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Overcoming the Curvature Bottleneck in MeanFlow
by: Zhang, Xinxi, et al.
Published: (2025)
by: Zhang, Xinxi, et al.
Published: (2025)
Understanding, Accelerating, and Improving MeanFlow Training
by: Kim, Jin-Young, et al.
Published: (2025)
by: Kim, Jin-Young, et al.
Published: (2025)
MeanFlow Transformers with Representation Autoencoders
by: Hu, Zheyuan, et al.
Published: (2025)
by: Hu, Zheyuan, et al.
Published: (2025)
VLM6D: VLM based 6Dof Pose Estimation based on RGB-D Images
by: Sarowar, Md Selim, et al.
Published: (2025)
by: Sarowar, Md Selim, et al.
Published: (2025)
MFI-ResNet: Efficient ResNet Architecture Optimization via MeanFlow Compression and Selective Incubation
by: Sun, Nuolin, et al.
Published: (2025)
by: Sun, Nuolin, et al.
Published: (2025)
VFM-VLM: Vision Foundation Model and Vision Language Model based Visual Comparison for 3D Pose Estimation
by: Sarowar, Md Selim, et al.
Published: (2025)
by: Sarowar, Md Selim, et al.
Published: (2025)
Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language Models
by: Alnaasan, Manar, et al.
Published: (2025)
by: Alnaasan, Manar, et al.
Published: (2025)
GST-VLA: Structured Gaussian Spatial Tokens for 3D Depth-Aware Vision-Language-Action Models
by: Sarowar, Md Selim, et al.
Published: (2026)
by: Sarowar, Md Selim, et al.
Published: (2026)
An Explainable Vision-Language Model Framework with Adaptive PID-Tversky Loss for Lumbar Spinal Stenosis Diagnosis
by: Sk., Md. Sajeebul Islam, et al.
Published: (2026)
by: Sk., Md. Sajeebul Islam, et al.
Published: (2026)
HANS-Net: Hyperbolic Convolution and Adaptive Temporal Attention for Accurate and Generalizable Liver and Tumor Segmentation in CT Imaging
by: Abian, Arefin Ittesafun, et al.
Published: (2025)
by: Abian, Arefin Ittesafun, et al.
Published: (2025)
ReHARK: Refined Hybrid Adaptive RBF Kernels for Robust One-Shot Vision-Language Adaptation
by: Islam, Md Jahidul
Published: (2026)
by: Islam, Md Jahidul
Published: (2026)
Large Language Model with Region-guided Referring and Grounding for CT Report Generation
by: Chen, Zhixuan, et al.
Published: (2024)
by: Chen, Zhixuan, et al.
Published: (2024)
CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models
by: Alam, Hasan Md Tusfiqur, et al.
Published: (2025)
by: Alam, Hasan Md Tusfiqur, et al.
Published: (2025)
RegionE: Adaptive Region-Aware Generation for Efficient Image Editing
by: Chen, Pengtao, et al.
Published: (2025)
by: Chen, Pengtao, et al.
Published: (2025)
ChatAlchemy: AI-enabled Chat Assistant For PharmAlchemy
by: Md Shifatul Ahsan, Apurba
Published: (2025)
by: Md Shifatul Ahsan, Apurba
Published: (2025)
Edge-Enhanced Vision Transformer Framework for Accurate AI-Generated Image Detection
by: Das, Dabbrata, et al.
Published: (2025)
by: Das, Dabbrata, et al.
Published: (2025)
C^2DA: Contrastive and Context-aware Domain Adaptive Semantic Segmentation
by: Khan, Md. Al-Masrur, et al.
Published: (2024)
by: Khan, Md. Al-Masrur, et al.
Published: (2024)
Virtual-Eyes: Quantitative Validation of a Lung CT Quality-Control Pipeline for Foundation-Model Cancer Risk Prediction
by: Hoq, Md. Enamul, et al.
Published: (2025)
by: Hoq, Md. Enamul, et al.
Published: (2025)
SRLoRA: Subspace Recomposition in Low-Rank Adaptation via Importance-Based Fusion and Reinitialization
by: Yang, Haodong, et al.
Published: (2025)
by: Yang, Haodong, et al.
Published: (2025)
Region-to-Region: Enhancing Generative Image Harmonization with Adaptive Regional Injection
by: Zhang, Zhiqiu, et al.
Published: (2025)
by: Zhang, Zhiqiu, et al.
Published: (2025)
MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Autonomous Driving
by: Wang, Junli, et al.
Published: (2026)
by: Wang, Junli, et al.
Published: (2026)
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities
by: Ishmam, Md Farhan, et al.
Published: (2023)
by: Ishmam, Md Farhan, et al.
Published: (2023)
AlphaFlow: Understanding and Improving MeanFlow Models
by: Zhang, Huijie, et al.
Published: (2025)
by: Zhang, Huijie, et al.
Published: (2025)
Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA
by: Peter, Ojonugwa Oluwafemi Ejiga, et al.
Published: (2025)
by: Peter, Ojonugwa Oluwafemi Ejiga, et al.
Published: (2025)
PLOT-CT: Pre-log Voronoi Decomposition Assisted Generation for Low-dose CT Reconstruction
by: Huang, Bin, et al.
Published: (2026)
by: Huang, Bin, et al.
Published: (2026)
Redefining the Down-Sampling Scheme of U-Net for Precision Biomedical Image Segmentation
by: Li, Mingjie, et al.
Published: (2026)
by: Li, Mingjie, et al.
Published: (2026)
Deep Learning-Based Segmentation of Peritoneal Cancer Index Regions from CT Imaging
by: Gort, Pieter C., et al.
Published: (2026)
by: Gort, Pieter C., et al.
Published: (2026)
Zero-shot System for Automatic Body Region Detection for Volumetric CT and MR Images
by: Jush, Farnaz Khun, et al.
Published: (2026)
by: Jush, Farnaz Khun, et al.
Published: (2026)
Efficient Flow Matching for Sparse-View CT Reconstruction
by: Shi, Jiayang, et al.
Published: (2026)
by: Shi, Jiayang, et al.
Published: (2026)
Decoupled MeanFlow: Turning Flow Models into Flow Maps for Accelerated Sampling
by: Lee, Kyungmin, et al.
Published: (2025)
by: Lee, Kyungmin, et al.
Published: (2025)
MambaLiteUNet: Cross-Gated Adaptive Feature Fusion for Robust Skin Lesion Segmentation
by: Rahman, Md Maklachur, et al.
Published: (2026)
by: Rahman, Md Maklachur, et al.
Published: (2026)
FUSED-Net: Detecting Traffic Signs with Limited Data
by: Rahman, Md. Atiqur, et al.
Published: (2024)
by: Rahman, Md. Atiqur, et al.
Published: (2024)
Unsupervised Domain Adaptation for Action Recognition via Self-Ensembling and Conditional Embedding Alignment
by: Ghosh, Indrajeet, et al.
Published: (2024)
by: Ghosh, Indrajeet, et al.
Published: (2024)
MFSR: MeanFlow Distillation for One Step Real-World Image Super Resolution
by: Wang, Ruiqing, et al.
Published: (2026)
by: Wang, Ruiqing, et al.
Published: (2026)
Modular Deep Active Learning Framework for Image Annotation: A Technical Report for the Ophthalmo-AI Project
by: Kadir, Md Abdul, et al.
Published: (2024)
by: Kadir, Md Abdul, et al.
Published: (2024)
CT-Flow: Orchestrating CT Interpretation Workflow with Model Context Protocol Servers
by: Gu, Yannian, et al.
Published: (2026)
by: Gu, Yannian, et al.
Published: (2026)
MedGemma vs GPT-4: Open-Source and Proprietary Zero-shot Medical Disease Classification from Images
by: Prottasha, Md. Sazzadul Islam, et al.
Published: (2025)
by: Prottasha, Md. Sazzadul Islam, et al.
Published: (2025)
FreqSelect: Frequency-Aware fMRI-to-Image Reconstruction
by: Ye, Junliang, et al.
Published: (2025)
by: Ye, Junliang, et al.
Published: (2025)
A Tiered GAN Approach for Monet-Style Image Generation
by: Neha, FNU, et al.
Published: (2024)
by: Neha, FNU, et al.
Published: (2024)
MPFlow: Multi-modal Posterior-Guided Flow Matching for Zero-Shot MRI Reconstruction
by: Kim, Seunghoi, et al.
Published: (2026)
by: Kim, Seunghoi, et al.
Published: (2026)
Similar Items
-
Overcoming the Curvature Bottleneck in MeanFlow
by: Zhang, Xinxi, et al.
Published: (2025) -
Understanding, Accelerating, and Improving MeanFlow Training
by: Kim, Jin-Young, et al.
Published: (2025) -
MeanFlow Transformers with Representation Autoencoders
by: Hu, Zheyuan, et al.
Published: (2025) -
VLM6D: VLM based 6Dof Pose Estimation based on RGB-D Images
by: Sarowar, Md Selim, et al.
Published: (2025) -
MFI-ResNet: Efficient ResNet Architecture Optimization via MeanFlow Compression and Selective Incubation
by: Sun, Nuolin, et al.
Published: (2025)