Saved in:
| Main Authors: | Yun, Sukwon, Peng, Jie, Trevino, Alexandro E., Park, Chanyoung, Chen, Tianlong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.17857 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts
by: Xin, Jiayi, et al.
Published: (2025)
by: Xin, Jiayi, et al.
Published: (2025)
SPATIA: Multimodal Generation and Prediction of Spatial Cell Phenotypes
by: Kong, Zhenglun, et al.
Published: (2025)
by: Kong, Zhenglun, et al.
Published: (2025)
Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required?
by: Yun, Sukwon, et al.
Published: (2025)
by: Yun, Sukwon, et al.
Published: (2025)
Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation
by: Jeon, Jaehyeong, et al.
Published: (2024)
by: Jeon, Jaehyeong, et al.
Published: (2024)
CompoDistill: Attention Distillation for Compositional Reasoning in Multimodal LLMs
by: Kim, Jiwan, et al.
Published: (2025)
by: Kim, Jiwan, et al.
Published: (2025)
SIMPLOT: Enhancing Chart Question Answering by Distilling Essentials
by: Kim, Wonjoong, et al.
Published: (2024)
by: Kim, Wonjoong, et al.
Published: (2024)
Adaptive Self-training Framework for Fine-grained Scene Graph Generation
by: Kim, Kibum, et al.
Published: (2024)
by: Kim, Kibum, et al.
Published: (2024)
(PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork
by: Huang, Tianjin, et al.
Published: (2024)
by: Huang, Tianjin, et al.
Published: (2024)
Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation
by: Patil, Vaidehi, et al.
Published: (2025)
by: Patil, Vaidehi, et al.
Published: (2025)
LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition
by: Hu, Youbing, et al.
Published: (2024)
by: Hu, Youbing, et al.
Published: (2024)
Efficient AI-Driven Multi-Section Whole Slide Image Analysis for Biochemical Recurrence Prediction in Prostate Cancer
by: Cho, Yesung, et al.
Published: (2026)
by: Cho, Yesung, et al.
Published: (2026)
Thickness-aware E(3)-Equivariant 3D Mesh Neural Networks
by: Kim, Sungwon, et al.
Published: (2025)
by: Kim, Sungwon, et al.
Published: (2025)
HYDRA: Hybrid Data Multiplexing and Run-time Layer Configurable DNN Accelerator
by: Kumar, Sonu, et al.
Published: (2024)
by: Kumar, Sonu, et al.
Published: (2024)
Enhancing Multimodal In-Context Learning for Image Classification through Coreset Optimization
by: Chen, Huiyi, et al.
Published: (2025)
by: Chen, Huiyi, et al.
Published: (2025)
FEAST: Fully Connected Expressive Attention for Spatial Transcriptomics
by: Jeong, Taejin, et al.
Published: (2026)
by: Jeong, Taejin, et al.
Published: (2026)
Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis
by: Jiang, Yankai, et al.
Published: (2025)
by: Jiang, Yankai, et al.
Published: (2025)
LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation
by: Kim, Kibum, et al.
Published: (2023)
by: Kim, Kibum, et al.
Published: (2023)
You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement
by: Yan, Qingsen, et al.
Published: (2024)
by: Yan, Qingsen, et al.
Published: (2024)
Empirical Analysis of Anomaly Detection on Hyperspectral Imaging Using Dimension Reduction Methods
by: Kim, Dongeon, et al.
Published: (2024)
by: Kim, Dongeon, et al.
Published: (2024)
Label-Efficient Deep Learning in Medical Image Analysis: Challenges and Future Directions
by: Jin, Cheng, et al.
Published: (2023)
by: Jin, Cheng, et al.
Published: (2023)
Sampling-Aware 3D Spatial Analysis in Multiplexed Imaging
by: Harlev, Ido, et al.
Published: (2026)
by: Harlev, Ido, et al.
Published: (2026)
MERIT: Multi-domain Efficient RAW Image Translation
by: Huang, Wenjun, et al.
Published: (2026)
by: Huang, Wenjun, et al.
Published: (2026)
RegionE: Adaptive Region-Aware Generation for Efficient Image Editing
by: Chen, Pengtao, et al.
Published: (2025)
by: Chen, Pengtao, et al.
Published: (2025)
Sim2Real within 5 Minutes: Efficient Domain Transfer with Stylized Gaussian Splatting for Endoscopic Images
by: Wu, Junyang, et al.
Published: (2024)
by: Wu, Junyang, et al.
Published: (2024)
Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping
by: Park, Sunghyun, et al.
Published: (2026)
by: Park, Sunghyun, et al.
Published: (2026)
PriorNet: A Novel Lightweight Network with Multidimensional Interactive Attention for Efficient Image Dehazing
by: Chen, Yutong, et al.
Published: (2024)
by: Chen, Yutong, et al.
Published: (2024)
Data Upcycling Knowledge Distillation for Image Super-Resolution
by: Zhang, Yun, et al.
Published: (2023)
by: Zhang, Yun, et al.
Published: (2023)
DiffusionAgent: Navigating Expert Models for Agentic Image Generation
by: Qin, Jie, et al.
Published: (2024)
by: Qin, Jie, et al.
Published: (2024)
QuRe: Query-Relevant Retrieval through Hard Negative Sampling in Composed Image Retrieval
by: Kwak, Jaehyun, et al.
Published: (2025)
by: Kwak, Jaehyun, et al.
Published: (2025)
AI-Driven Virtual Teacher for Enhanced Educational Efficiency: Leveraging Large Pretrain Models for Autonomous Error Analysis and Correction
by: Xu, Tianlong, et al.
Published: (2024)
by: Xu, Tianlong, et al.
Published: (2024)
RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training
by: Lu, Zhixiu, et al.
Published: (2024)
by: Lu, Zhixiu, et al.
Published: (2024)
RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models
by: Park, Seulki, et al.
Published: (2023)
by: Park, Seulki, et al.
Published: (2023)
DANet: Enhancing Small Object Detection through an Efficient Deformable Attention Network
by: Mia, Md Sohag, et al.
Published: (2023)
by: Mia, Md Sohag, et al.
Published: (2023)
Efficiently Training A Flat Neural Network Before It has been Quantizated
by: Xia, Peng, et al.
Published: (2025)
by: Xia, Peng, et al.
Published: (2025)
Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning
by: Chen, Guanjie, et al.
Published: (2025)
by: Chen, Guanjie, et al.
Published: (2025)
EdgeSync: Accelerating Edge-Model Updates for Data Drift through Adaptive Continuous Learning
by: Donga, Runchu, et al.
Published: (2025)
by: Donga, Runchu, et al.
Published: (2025)
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
by: Dutt, Raman, et al.
Published: (2023)
by: Dutt, Raman, et al.
Published: (2023)
PATHS: A Hierarchical Transformer for Efficient Whole Slide Image Analysis
by: Buzzard, Zak, et al.
Published: (2024)
by: Buzzard, Zak, et al.
Published: (2024)
Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models
by: Zhang, Gengwei, et al.
Published: (2026)
by: Zhang, Gengwei, et al.
Published: (2026)
Development of Image Collection Method Using YOLO and Siamese Network
by: Shin, Chan Young, et al.
Published: (2024)
by: Shin, Chan Young, et al.
Published: (2024)
Similar Items
-
I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts
by: Xin, Jiayi, et al.
Published: (2025) -
SPATIA: Multimodal Generation and Prediction of Spatial Cell Phenotypes
by: Kong, Zhenglun, et al.
Published: (2025) -
Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required?
by: Yun, Sukwon, et al.
Published: (2025) -
Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation
by: Jeon, Jaehyeong, et al.
Published: (2024) -
CompoDistill: Attention Distillation for Compositional Reasoning in Multimodal LLMs
by: Kim, Jiwan, et al.
Published: (2025)