Saved in:
| Main Authors: | Wang, Zijia, Yang, Wenbin, Liu, Zhisong, Jia, Zhen |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.01175 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
StyleMamba : State Space Model for Efficient Text-driven Image Style Transfer
by: Wang, Zijia, et al.
Published: (2024)
by: Wang, Zijia, et al.
Published: (2024)
UltraSeP: Sequence-aware Pre-training for Echocardiography Probe Movement Guidance
by: Jiang, Haojun, et al.
Published: (2024)
by: Jiang, Haojun, et al.
Published: (2024)
An Efficient Aerial Image Detection with Variable Receptive Fields
by: Wenbin, Liu
Published: (2025)
by: Wenbin, Liu
Published: (2025)
MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings
by: Chen, Haonan, et al.
Published: (2025)
by: Chen, Haonan, et al.
Published: (2025)
DiTraj: training-free trajectory control for video diffusion transformer
by: Lei, Cheng, et al.
Published: (2025)
by: Lei, Cheng, et al.
Published: (2025)
Structure-aware World Model for Probe Guidance via Large-scale Self-supervised Pre-train
by: Jiang, Haojun, et al.
Published: (2024)
by: Jiang, Haojun, et al.
Published: (2024)
Multi-view Phase-aware Pedestrian-Vehicle Incident Reasoning Framework with Vision-Language Models
by: Zhen, Hao, et al.
Published: (2025)
by: Zhen, Hao, et al.
Published: (2025)
CBPNet: A Continual Backpropagation Prompt Network for Alleviating Plasticity Loss on Edge Devices
by: Shao, Runjie, et al.
Published: (2025)
by: Shao, Runjie, et al.
Published: (2025)
Semantic Discrepancy-aware Detector for Image Forgery Identification
by: Wang, Ziye, et al.
Published: (2025)
by: Wang, Ziye, et al.
Published: (2025)
UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views
by: Guo, Jiaxin, et al.
Published: (2024)
by: Guo, Jiaxin, et al.
Published: (2024)
Potential Energy based Mixture Model for Noisy Label Learning
by: Wang, Zijia, et al.
Published: (2024)
by: Wang, Zijia, et al.
Published: (2024)
Self-training Room Layout Estimation via Geometry-aware Ray-casting
by: Solarte, Bolivar, et al.
Published: (2024)
by: Solarte, Bolivar, et al.
Published: (2024)
Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation
by: Gu, Pengfei, et al.
Published: (2024)
by: Gu, Pengfei, et al.
Published: (2024)
URMF: Uncertainty-aware Robust Multimodal Fusion for Multimodal Sarcasm Detection
by: Wang, Zhenyu, et al.
Published: (2026)
by: Wang, Zhenyu, et al.
Published: (2026)
RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation
by: You, Siyuan, et al.
Published: (2025)
by: You, Siyuan, et al.
Published: (2025)
Uncertainty-aware Evidential Fusion-based Learning for Semi-supervised Medical Image Segmentation
by: He, Yuanpeng, et al.
Published: (2024)
by: He, Yuanpeng, et al.
Published: (2024)
Uncertainty-aware Prototype Learning with Variational Inference for Few-shot Point Cloud Segmentation
by: Zhao, Yifei, et al.
Published: (2026)
by: Zhao, Yifei, et al.
Published: (2026)
M$^3$-VQA: A Benchmark for Multimodal, Multi-Entity, Multi-Hop Visual Question Answering
by: Ma, Jiatong, et al.
Published: (2026)
by: Ma, Jiatong, et al.
Published: (2026)
Agentic Knowledgeable Self-awareness
by: Qiao, Shuofei, et al.
Published: (2025)
by: Qiao, Shuofei, et al.
Published: (2025)
Self-Supervised Multi-Object Tracking with Path Consistency
by: Lu, Zijia, et al.
Published: (2024)
by: Lu, Zijia, et al.
Published: (2024)
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training
by: Bawazir, Ameera, et al.
Published: (2024)
by: Bawazir, Ameera, et al.
Published: (2024)
Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment
by: Cheng, Zhixin, et al.
Published: (2025)
by: Cheng, Zhixin, et al.
Published: (2025)
Uncertainty-aware Long-tailed Weights Model the Utility of Pseudo-labels for Semi-supervised Learning
by: Wu, Jiaqi, et al.
Published: (2025)
by: Wu, Jiaqi, et al.
Published: (2025)
The Finer the Better: Towards Granular-aware Open-set Domain Generalization
by: Wang, Yunyun, et al.
Published: (2025)
by: Wang, Yunyun, et al.
Published: (2025)
SCMIL: Sparse Context-aware Multiple Instance Learning for Predicting Cancer Survival Probability Distribution in Whole Slide Images
by: Yang, Zekang, et al.
Published: (2024)
by: Yang, Zekang, et al.
Published: (2024)
One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training
by: Yu, Jia, et al.
Published: (2025)
by: Yu, Jia, et al.
Published: (2025)
STORM: End-to-End Referring Multi-Object Tracking in Videos
by: Lu, Zijia, et al.
Published: (2026)
by: Lu, Zijia, et al.
Published: (2026)
MUSE: Model-based Uncertainty-aware Similarity Estimation for zero-shot 2D Object Detection and Segmentation
by: Cho, Sungmin, et al.
Published: (2025)
by: Cho, Sungmin, et al.
Published: (2025)
Uncertainty-aware Efficient Subgraph Isomorphism using Graph Topology
by: Kusari, Arpan, et al.
Published: (2022)
by: Kusari, Arpan, et al.
Published: (2022)
Order-aware Interactive Segmentation
by: Wang, Bin, et al.
Published: (2024)
by: Wang, Bin, et al.
Published: (2024)
Attention Mechanism based Cognition-level Scene Understanding
by: Tang, Xuejiao, et al.
Published: (2022)
by: Tang, Xuejiao, et al.
Published: (2022)
PersGuard: Preventing Malicious Personalization via Backdoor Attacks on Pre-trained Text-to-Image Diffusion Models
by: Liu, Xinwei, et al.
Published: (2025)
by: Liu, Xinwei, et al.
Published: (2025)
TextEditBench: Evaluating Reasoning-aware Text Editing Beyond Rendering
by: Gui, Rui, et al.
Published: (2025)
by: Gui, Rui, et al.
Published: (2025)
ALA: Naturalness-aware Adversarial Lightness Attack
by: Huang, Yihao, et al.
Published: (2022)
by: Huang, Yihao, et al.
Published: (2022)
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
by: Gu, Zekai, et al.
Published: (2025)
by: Gu, Zekai, et al.
Published: (2025)
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
by: Pan, Chenbin, et al.
Published: (2025)
by: Pan, Chenbin, et al.
Published: (2025)
SDTalk: Structured Facial Priors and Dual-Branch Motion Fields for Generalizable Gaussian Talking Head Synthesis
by: Jia, Peng, et al.
Published: (2026)
by: Jia, Peng, et al.
Published: (2026)
QVD: Post-training Quantization for Video Diffusion Models
by: Tian, Shilong, et al.
Published: (2024)
by: Tian, Shilong, et al.
Published: (2024)
Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
by: Liu, Tao, et al.
Published: (2024)
by: Liu, Tao, et al.
Published: (2024)
Challenge Summary U-MedSAM: Uncertainty-aware MedSAM for Medical Image Segmentation
by: Wang, Xin, et al.
Published: (2024)
by: Wang, Xin, et al.
Published: (2024)
Similar Items
-
StyleMamba : State Space Model for Efficient Text-driven Image Style Transfer
by: Wang, Zijia, et al.
Published: (2024) -
UltraSeP: Sequence-aware Pre-training for Echocardiography Probe Movement Guidance
by: Jiang, Haojun, et al.
Published: (2024) -
An Efficient Aerial Image Detection with Variable Receptive Fields
by: Wenbin, Liu
Published: (2025) -
MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings
by: Chen, Haonan, et al.
Published: (2025) -
DiTraj: training-free trajectory control for video diffusion transformer
by: Lei, Cheng, et al.
Published: (2025)