:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yun, Sukwon, Peng, Jie, Trevino, Alexandro E., Park, Chanyoung, Chen, Tianlong
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2407.17857
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts
by: Xin, Jiayi, et al.
Published: (2025)

SPATIA: Multimodal Generation and Prediction of Spatial Cell Phenotypes
by: Kong, Zhenglun, et al.
Published: (2025)

Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required?
by: Yun, Sukwon, et al.
Published: (2025)

Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation
by: Jeon, Jaehyeong, et al.
Published: (2024)

CompoDistill: Attention Distillation for Compositional Reasoning in Multimodal LLMs
by: Kim, Jiwan, et al.
Published: (2025)

SIMPLOT: Enhancing Chart Question Answering by Distilling Essentials
by: Kim, Wonjoong, et al.
Published: (2024)

Adaptive Self-training Framework for Fine-grained Scene Graph Generation
by: Kim, Kibum, et al.
Published: (2024)

(PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork
by: Huang, Tianjin, et al.
Published: (2024)

Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation
by: Patil, Vaidehi, et al.
Published: (2025)

LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition
by: Hu, Youbing, et al.
Published: (2024)

Efficient AI-Driven Multi-Section Whole Slide Image Analysis for Biochemical Recurrence Prediction in Prostate Cancer
by: Cho, Yesung, et al.
Published: (2026)

Thickness-aware E(3)-Equivariant 3D Mesh Neural Networks
by: Kim, Sungwon, et al.
Published: (2025)

HYDRA: Hybrid Data Multiplexing and Run-time Layer Configurable DNN Accelerator
by: Kumar, Sonu, et al.
Published: (2024)

Enhancing Multimodal In-Context Learning for Image Classification through Coreset Optimization
by: Chen, Huiyi, et al.
Published: (2025)

FEAST: Fully Connected Expressive Attention for Spatial Transcriptomics
by: Jeong, Taejin, et al.
Published: (2026)

Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis
by: Jiang, Yankai, et al.
Published: (2025)

LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation
by: Kim, Kibum, et al.
Published: (2023)

You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement
by: Yan, Qingsen, et al.
Published: (2024)

Empirical Analysis of Anomaly Detection on Hyperspectral Imaging Using Dimension Reduction Methods
by: Kim, Dongeon, et al.
Published: (2024)

Label-Efficient Deep Learning in Medical Image Analysis: Challenges and Future Directions
by: Jin, Cheng, et al.
Published: (2023)

Sampling-Aware 3D Spatial Analysis in Multiplexed Imaging
by: Harlev, Ido, et al.
Published: (2026)

MERIT: Multi-domain Efficient RAW Image Translation
by: Huang, Wenjun, et al.
Published: (2026)

RegionE: Adaptive Region-Aware Generation for Efficient Image Editing
by: Chen, Pengtao, et al.
Published: (2025)

Sim2Real within 5 Minutes: Efficient Domain Transfer with Stylized Gaussian Splatting for Endoscopic Images
by: Wu, Junyang, et al.
Published: (2024)

Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping
by: Park, Sunghyun, et al.
Published: (2026)

PriorNet: A Novel Lightweight Network with Multidimensional Interactive Attention for Efficient Image Dehazing
by: Chen, Yutong, et al.
Published: (2024)

Data Upcycling Knowledge Distillation for Image Super-Resolution
by: Zhang, Yun, et al.
Published: (2023)

DiffusionAgent: Navigating Expert Models for Agentic Image Generation
by: Qin, Jie, et al.
Published: (2024)

QuRe: Query-Relevant Retrieval through Hard Negative Sampling in Composed Image Retrieval
by: Kwak, Jaehyun, et al.
Published: (2025)

AI-Driven Virtual Teacher for Enhanced Educational Efficiency: Leveraging Large Pretrain Models for Autonomous Error Analysis and Correction
by: Xu, Tianlong, et al.
Published: (2024)

RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training
by: Lu, Zhixiu, et al.
Published: (2024)

RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models
by: Park, Seulki, et al.
Published: (2023)

DANet: Enhancing Small Object Detection through an Efficient Deformable Attention Network
by: Mia, Md Sohag, et al.
Published: (2023)

Efficiently Training A Flat Neural Network Before It has been Quantizated
by: Xia, Peng, et al.
Published: (2025)

Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning
by: Chen, Guanjie, et al.
Published: (2025)

EdgeSync: Accelerating Edge-Model Updates for Data Drift through Adaptive Continuous Learning
by: Donga, Runchu, et al.
Published: (2025)

Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
by: Dutt, Raman, et al.
Published: (2023)

PATHS: A Hierarchical Transformer for Efficient Whole Slide Image Analysis
by: Buzzard, Zak, et al.
Published: (2024)

Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models
by: Zhang, Gengwei, et al.
Published: (2026)

Development of Image Collection Method Using YOLO and Siamese Network
by: Shin, Chan Young, et al.
Published: (2024)