Saved in:
| Main Authors: | She, Chengying, Chen, Chengwei, Zhang, Xinran, Wang, Ben, Liu, Lizhuang, Shao, Chengwei, Bian, Yun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.20347 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EfficientMIL: Efficient Linear-Complexity MIL Method for WSI Classification
by: She, Chengying, et al.
Published: (2025)
by: She, Chengying, et al.
Published: (2025)
InfoMatch: Entropy Neural Estimation for Semi-Supervised Image Classification
by: Han, Qi, et al.
Published: (2024)
by: Han, Qi, et al.
Published: (2024)
Enhancing WSI-Based Survival Analysis with Report-Auxiliary Self-Distillation
by: Wang, Zheng, et al.
Published: (2025)
by: Wang, Zheng, et al.
Published: (2025)
Graph-Driven Multimodal Feature Learning Framework for Apparent Personality Assessment
by: Wang, Kangsheng, et al.
Published: (2025)
by: Wang, Kangsheng, et al.
Published: (2025)
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection
by: Li, Xiaofan, et al.
Published: (2024)
by: Li, Xiaofan, et al.
Published: (2024)
MergeUp-augmented Semi-Weakly Supervised Learning for WSI Classification
by: Ouyang, Mingxi, et al.
Published: (2024)
by: Ouyang, Mingxi, et al.
Published: (2024)
StyGazeTalk: Learning Stylized Generation of Gaze and Head Dynamics
by: Shi, Chengwei, et al.
Published: (2025)
by: Shi, Chengwei, et al.
Published: (2025)
Adversarial-Guided Diffusion for Multimodal LLM Attacks
by: Xia, Chengwei, et al.
Published: (2025)
by: Xia, Chengwei, et al.
Published: (2025)
Histomorphology-Guided Prototypical Multi-Instance Learning for Breast Cancer WSI Classification
by: Wang, Baizhi, et al.
Published: (2025)
by: Wang, Baizhi, et al.
Published: (2025)
Rethinking Vision Transformer Depth via Structural Reparameterization
by: Zhou, Chengwei, et al.
Published: (2025)
by: Zhou, Chengwei, et al.
Published: (2025)
Adaptive Intra-Class Variation Contrastive Learning for Unsupervised Person Re-Identification
by: Liu, Lingzhi, et al.
Published: (2024)
by: Liu, Lingzhi, et al.
Published: (2024)
SpecGaussian with Latent Features: A High-quality Modeling of the View-dependent Appearance for 3D Gaussian Splatting
by: Wang, Zhiru, et al.
Published: (2024)
by: Wang, Zhiru, et al.
Published: (2024)
Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning
by: Tan, Jieyi, et al.
Published: (2025)
by: Tan, Jieyi, et al.
Published: (2025)
DualSplat: Robust 3D Gaussian Splatting via Pseudo-Mask Bootstrapping from Reconstruction Failures
by: Wang, Xu, et al.
Published: (2026)
by: Wang, Xu, et al.
Published: (2026)
VAT: Vision Action Transformer by Unlocking Full Representation of ViT
by: Li, Wenhao, et al.
Published: (2025)
by: Li, Wenhao, et al.
Published: (2025)
Weakly Supervised Multimodal Temporal Forgery Localization via Multitask Learning
by: Xu, Wenbo, et al.
Published: (2025)
by: Xu, Wenbo, et al.
Published: (2025)
Hierarchical Consensus Network for Multiview Feature Learning
by: Xia, Chengwei, et al.
Published: (2025)
by: Xia, Chengwei, et al.
Published: (2025)
M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation
by: Luo, Mingshuang, et al.
Published: (2024)
by: Luo, Mingshuang, et al.
Published: (2024)
GAME: Learning Multimodal Interactions via Graph Structures for Personality Trait Estimation
by: Wang, Kangsheng, et al.
Published: (2025)
by: Wang, Kangsheng, et al.
Published: (2025)
WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image
by: Liang, Yuci, et al.
Published: (2024)
by: Liang, Yuci, et al.
Published: (2024)
GauSTAR: Gaussian Surface Tracking and Reconstruction
by: Zheng, Chengwei, et al.
Published: (2025)
by: Zheng, Chengwei, et al.
Published: (2025)
Efficient Human-Object-Interaction (EHOI) Detection via Interaction Label Coding and Conditional Decision
by: Yang, Tsung-Shan, et al.
Published: (2024)
by: Yang, Tsung-Shan, et al.
Published: (2024)
Multitask Multimodal Self-Supervised Learning for Medical Images
by: Simionescu, Cristian
Published: (2025)
by: Simionescu, Cristian
Published: (2025)
MVInverse: Feed-forward Multi-view Inverse Rendering in Seconds
by: Wu, Xiangzuo, et al.
Published: (2025)
by: Wu, Xiangzuo, et al.
Published: (2025)
Enhancing Implicit Neural Representations via Symmetric Power Transformation
by: Zhang, Weixiang, et al.
Published: (2024)
by: Zhang, Weixiang, et al.
Published: (2024)
Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI Analysis
by: Tang, Kunming, et al.
Published: (2024)
by: Tang, Kunming, et al.
Published: (2024)
Transformer-based Multimodal Change Detection with Multitask Consistency Constraints
by: Liu, Biyuan, et al.
Published: (2023)
by: Liu, Biyuan, et al.
Published: (2023)
GaMNet: A Hybrid Network with Gabor Fusion and NMamba for Efficient 3D Glioma Segmentation
by: Ye, Chengwei, et al.
Published: (2025)
by: Ye, Chengwei, et al.
Published: (2025)
SU-SAM: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed Scenes
by: Song, Yiran, et al.
Published: (2024)
by: Song, Yiran, et al.
Published: (2024)
SuperGS: Consistent and Detailed 3D Super-Resolution Scene Reconstruction via Gaussian Splatting
by: Xie, Shiyun, et al.
Published: (2025)
by: Xie, Shiyun, et al.
Published: (2025)
SuperGS: Super-Resolution 3D Gaussian Splatting Enhanced by Variational Residual Features and Uncertainty-Augmented Learning
by: Xie, Shiyun, et al.
Published: (2024)
by: Xie, Shiyun, et al.
Published: (2024)
HyperPath: Knowledge-Guided Hyperbolic Semantic Hierarchy Modeling for WSI Analysis
by: Huang, Peixiang, et al.
Published: (2025)
by: Huang, Peixiang, et al.
Published: (2025)
Removing Motion Artifact in MRI by Using a Perceptual Loss Driven Deep Learning Framework
by: Guo, Ziheng, et al.
Published: (2026)
by: Guo, Ziheng, et al.
Published: (2026)
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
by: Ying, Kaining, et al.
Published: (2024)
by: Ying, Kaining, et al.
Published: (2024)
Decouple, Reorganize, and Fuse: A Multimodal Framework for Cancer Survival Prediction
by: Wang, Huayi, et al.
Published: (2025)
by: Wang, Huayi, et al.
Published: (2025)
VideoAfford: Grounding 3D Affordance from Human-Object-Interaction Videos via Multimodal Large Language Model
by: Wang, Hanqing, et al.
Published: (2026)
by: Wang, Hanqing, et al.
Published: (2026)
MMNavAgent: Multi-Magnification WSI Navigation Agent for Clinically Consistent Whole-Slide Analysis
by: Xu, Zhengyang, et al.
Published: (2026)
by: Xu, Zhengyang, et al.
Published: (2026)
Diff$^2$I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior
by: Mu, Juncheng, et al.
Published: (2025)
by: Mu, Juncheng, et al.
Published: (2025)
EgoM2P: Egocentric Multimodal Multitask Pretraining
by: Li, Gen, et al.
Published: (2025)
by: Li, Gen, et al.
Published: (2025)
Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud Semantic Segmentation via Decoupling Optimization
by: Li, Mengtian, et al.
Published: (2024)
by: Li, Mengtian, et al.
Published: (2024)
Similar Items
-
EfficientMIL: Efficient Linear-Complexity MIL Method for WSI Classification
by: She, Chengying, et al.
Published: (2025) -
InfoMatch: Entropy Neural Estimation for Semi-Supervised Image Classification
by: Han, Qi, et al.
Published: (2024) -
Enhancing WSI-Based Survival Analysis with Report-Auxiliary Self-Distillation
by: Wang, Zheng, et al.
Published: (2025) -
Graph-Driven Multimodal Feature Learning Framework for Apparent Personality Assessment
by: Wang, Kangsheng, et al.
Published: (2025) -
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection
by: Li, Xiaofan, et al.
Published: (2024)