Saved in:
| Main Authors: | Chen, Xiwen, Zhu, Wenhui, Qiu, Peijie, Razi, Abolfazl |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.02944 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification
by: Zhu, Wenhui, et al.
Published: (2024)
by: Zhu, Wenhui, et al.
Published: (2024)
SelfReg-UNet: Self-Regularized UNet for Medical Image Segmentation
by: Zhu, Wenhui, et al.
Published: (2024)
by: Zhu, Wenhui, et al.
Published: (2024)
Fast 2DGS: Efficient Image Representation with Deep Gaussian Prior
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
How Effective Can Dropout Be in Multiple Instance Learning ?
by: Zhu, Wenhui, et al.
Published: (2025)
by: Zhu, Wenhui, et al.
Published: (2025)
RBAD: A Dataset and Benchmark for Retinal Vessels Branching Angle Detection
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models
by: Zhu, Wenhui, et al.
Published: (2025)
by: Zhu, Wenhui, et al.
Published: (2025)
Prompt-OT: An Optimal Transport Regularization Paradigm for Knowledge Preservation in Vision-Language Model Adaptation
by: Chen, Xiwen, et al.
Published: (2025)
by: Chen, Xiwen, et al.
Published: (2025)
Enhancing Digital Hologram Reconstruction Using Reverse-Attention Loss for Untrained Physics-Driven Deep Learning Models with Uncertain Distance
by: Chen, Xiwen, et al.
Published: (2024)
by: Chen, Xiwen, et al.
Published: (2024)
Multimodal Variational Autoencoder: a Barycentric View
by: Qiu, Peijie, et al.
Published: (2024)
by: Qiu, Peijie, et al.
Published: (2024)
Cracking Instance Jigsaw Puzzles: An Alternative to Multiple Instance Learning for Whole Slide Image Analysis
by: Chen, Xiwen, et al.
Published: (2025)
by: Chen, Xiwen, et al.
Published: (2025)
FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
Many-MobileNet: Multi-Model Augmentation for Robust Retinal Disease Classification
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
OTPrune: Distribution-Aligned Visual Token Pruning via Optimal Transport
by: Chen, Xiwen, et al.
Published: (2026)
by: Chen, Xiwen, et al.
Published: (2026)
PDL: Regularizing Multiple Instance Learning with Progressive Dropout Layers
by: Zhu, Wenhui, et al.
Published: (2023)
by: Zhu, Wenhui, et al.
Published: (2023)
LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding
by: Dong, Xuanzhao, et al.
Published: (2025)
by: Dong, Xuanzhao, et al.
Published: (2025)
Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
STA-Unet: Rethink the semantic redundant for Medical Imaging Segmentation
by: Vasa, Vamsi Krishna, et al.
Published: (2024)
by: Vasa, Vamsi Krishna, et al.
Published: (2024)
CUNSB-RFIE: Context-aware Unpaired Neural Schrödinger Bridge in Retinal Fundus Image Enhancement
by: Dong, Xuanzhao, et al.
Published: (2024)
by: Dong, Xuanzhao, et al.
Published: (2024)
AtomDiffuser: Time-Aware Degradation Modeling for Drift and Beam Damage in STEM Imaging
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
Motor Focus: Fast Ego-Motion Prediction for Assistive Visual Navigation
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
nnMobileNet: Rethinking CNN for Retinopathy Research
by: Zhu, Wenhui, et al.
Published: (2023)
by: Zhu, Wenhui, et al.
Published: (2023)
VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
SC-MIL: Sparsely Coded Multiple Instance Learning for Whole Slide Image Classification
by: Qiu, Peijie, et al.
Published: (2023)
by: Qiu, Peijie, et al.
Published: (2023)
A BERT-Style Self-Supervised Learning CNN for Disease Identification from Retinal Images
by: Li, Xin, et al.
Published: (2025)
by: Li, Xin, et al.
Published: (2025)
Context-Aware Optimal Transport Learning for Retinal Fundus Image Enhancement
by: Vasa, Vamsi Krishna, et al.
Published: (2024)
by: Vasa, Vamsi Krishna, et al.
Published: (2024)
Schrödinger Diffusion Driven Signal Recovery in 3T BOLD fMRI Using Unmatched 7T Observations
by: Xiong, Yujian, et al.
Published: (2025)
by: Xiong, Yujian, et al.
Published: (2025)
EyeBench: A Call for More Rigorous Evaluation of Retinal Image Enhancement
by: Zhu, Wenhui, et al.
Published: (2025)
by: Zhu, Wenhui, et al.
Published: (2025)
RetinalGPT: A Retinal Clinical Preference Conversational Assistant Powered by Large Vision-Language Models
by: Zhu, Wenhui, et al.
Published: (2025)
by: Zhu, Wenhui, et al.
Published: (2025)
Mags-RL: Wearing Multimodal LLMs a Magnifying Glass via Agentic Reinforcement Learning For Complex Scene Reasoning
by: Dong, Xuanzhao, et al.
Published: (2026)
by: Dong, Xuanzhao, et al.
Published: (2026)
Real-World Scene Recovery for Scattering-Degraded Images Using Spatial and Frequency Priors
by: Liu, Yun, et al.
Published: (2025)
by: Liu, Yun, et al.
Published: (2025)
Don't Waste Bits! Adaptive KV-Cache Quantization for Lightweight On-Device LLMs
by: Boroujeni, Sayed Pedram Haeri, et al.
Published: (2026)
by: Boroujeni, Sayed Pedram Haeri, et al.
Published: (2026)
All You Need for Object Detection: From Pixels, Points, and Prompts to Next-Gen Fusion and Multimodal LLMs/VLMs in Autonomous Vehicles
by: Boroujeni, Sayed Pedram Haeri, et al.
Published: (2025)
by: Boroujeni, Sayed Pedram Haeri, et al.
Published: (2025)
Enhanced Cooperative Perception for Autonomous Vehicles Using Imperfect Communication
by: Sarlak, Ahmad, et al.
Published: (2024)
by: Sarlak, Ahmad, et al.
Published: (2024)
RobustFormer: Noise-Robust Pre-training for images and videos
by: Bastola, Ashish, et al.
Published: (2024)
by: Bastola, Ashish, et al.
Published: (2024)
D2-MLP: Dynamic Decomposed MLP Mixer for Medical Image Segmentation
by: Yang, Jin, et al.
Published: (2024)
by: Yang, Jin, et al.
Published: (2024)
Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing
by: Liao, Chen, et al.
Published: (2025)
by: Liao, Chen, et al.
Published: (2025)
Comparative Analysis of Patch Attack on VLM-Based Autonomous Driving Architectures
by: Fernandez, David, et al.
Published: (2026)
by: Fernandez, David, et al.
Published: (2026)
Encoding Semantic Priors into the Weights of Implicit Neural Representation
by: Cai, Zhicheng, et al.
Published: (2024)
by: Cai, Zhicheng, et al.
Published: (2024)
ImageRAGTurbo: Towards One-step Text-to-Image Generation with Retrieval-Augmented Diffusion Models
by: Qiu, Peijie, et al.
Published: (2026)
by: Qiu, Peijie, et al.
Published: (2026)
Training Convolutional Neural Networks with the Forward-Forward algorithm
by: Scodellaro, Riccardo, et al.
Published: (2023)
by: Scodellaro, Riccardo, et al.
Published: (2023)
Similar Items
-
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification
by: Zhu, Wenhui, et al.
Published: (2024) -
SelfReg-UNet: Self-Regularized UNet for Medical Image Segmentation
by: Zhu, Wenhui, et al.
Published: (2024) -
Fast 2DGS: Efficient Image Representation with Deep Gaussian Prior
by: Wang, Hao, et al.
Published: (2025) -
How Effective Can Dropout Be in Multiple Instance Learning ?
by: Zhu, Wenhui, et al.
Published: (2025) -
RBAD: A Dataset and Benchmark for Retinal Vessels Branching Angle Detection
by: Wang, Hao, et al.
Published: (2024)