Saved in:
| Main Authors: | Zhang, Wentian, Liu, Haozhe, Li, Bing, Xie, Jinheng, Huang, Yawen, Li, Yuexiang, Zheng, Yefeng, Ghanem, Bernard |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2306.07716 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable
by: Liu, Haozhe, et al.
Published: (2024)
by: Liu, Haozhe, et al.
Published: (2024)
X-ray Insights Unleashed: Pioneering the Enhancement of Multi-Label Long-Tail Data
by: Yang, Xinquan, et al.
Published: (2025)
by: Yang, Xinquan, et al.
Published: (2025)
Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation
by: Yi, Jingjun, et al.
Published: (2024)
by: Yi, Jingjun, et al.
Published: (2024)
Adaptive Convolutional Dictionary Network for CT Metal Artifact Reduction
by: Wang, Hong, et al.
Published: (2022)
by: Wang, Hong, et al.
Published: (2022)
Robust Source-Free Domain Adaptation for Medical Image Segmentation based on Curriculum Learning
by: Zhang, Ziqi, et al.
Published: (2025)
by: Zhang, Ziqi, et al.
Published: (2025)
Can Video Diffusion Model Reconstruct 4D Geometry?
by: Mai, Jinjie, et al.
Published: (2025)
by: Mai, Jinjie, et al.
Published: (2025)
Learning Long-form Video Prior via Generative Pre-Training
by: Xie, Jinheng, et al.
Published: (2024)
by: Xie, Jinheng, et al.
Published: (2024)
Faster Diffusion via Temporal Attention Decomposition
by: Liu, Haozhe, et al.
Published: (2024)
by: Liu, Haozhe, et al.
Published: (2024)
A Hybrid Framework Bridging CNN and ViT based on Theory of Evidence for Diabetic Retinopathy Grading
by: Qiu, Junlai, et al.
Published: (2025)
by: Qiu, Junlai, et al.
Published: (2025)
Prototype Correlation Matching and Class-Relation Reasoning for Few-Shot Medical Image Segmentation
by: Zhang, Yumin, et al.
Published: (2024)
by: Zhang, Yumin, et al.
Published: (2024)
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
by: Li, Bing, et al.
Published: (2024)
by: Li, Bing, et al.
Published: (2024)
DGFamba: Learning Flow Factorized State Space for Visual Domain Generalization
by: Bi, Qi, et al.
Published: (2025)
by: Bi, Qi, et al.
Published: (2025)
URoadNet: Dual Sparse Attentive U-Net for Multiscale Road Network Extraction
by: Song, Jie, et al.
Published: (2024)
by: Song, Jie, et al.
Published: (2024)
Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing
by: Kong, Zhe, et al.
Published: (2024)
by: Kong, Zhe, et al.
Published: (2024)
Structure Observation Driven Image-Text Contrastive Learning for Computed Tomography Report Generation
by: Liu, Hong, et al.
Published: (2026)
by: Liu, Hong, et al.
Published: (2026)
Fingerprint Presentation Attack Detector Using Global-Local Model
by: Liu, Haozhe, et al.
Published: (2024)
by: Liu, Haozhe, et al.
Published: (2024)
CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video
by: Miao, Xingyu, et al.
Published: (2024)
by: Miao, Xingyu, et al.
Published: (2024)
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
by: Thoker, Fida Mohammad, et al.
Published: (2025)
by: Thoker, Fida Mohammad, et al.
Published: (2025)
Show-o2: Improved Native Unified Multimodal Models
by: Xie, Jinheng, et al.
Published: (2025)
by: Xie, Jinheng, et al.
Published: (2025)
AnomalyXFusion: Multi-modal Anomaly Synthesis with Diffusion
by: Hu, Jie, et al.
Published: (2024)
by: Hu, Jie, et al.
Published: (2024)
TrackMAE: Video Representation Learning via Track Mask and Predict
by: Vandeghen, Renaud, et al.
Published: (2026)
by: Vandeghen, Renaud, et al.
Published: (2026)
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders
by: Hinojosa, Carlos, et al.
Published: (2024)
by: Hinojosa, Carlos, et al.
Published: (2024)
K-Space-Aware Cross-Modality Score for Synthesized Neuroimage Quality Assessment
by: Xie, Guoyang, et al.
Published: (2023)
by: Xie, Guoyang, et al.
Published: (2023)
Masked Face Recognition with Generative-to-Discriminative Representations
by: Ge, Shiming, et al.
Published: (2024)
by: Ge, Shiming, et al.
Published: (2024)
Video Self-Stitching Graph Network for Temporal Action Localization
by: Zhao, Chen, et al.
Published: (2020)
by: Zhao, Chen, et al.
Published: (2020)
Radiology Report Generation for Low-Quality X-Ray Images
by: Zhu, Hongze, et al.
Published: (2026)
by: Zhu, Hongze, et al.
Published: (2026)
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions
by: Luo, Cheng, et al.
Published: (2025)
by: Luo, Cheng, et al.
Published: (2025)
ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields
by: Miao, Xingyu, et al.
Published: (2024)
by: Miao, Xingyu, et al.
Published: (2024)
Rethinking Brain Tumor Segmentation from the Frequency Domain Perspective
by: Shao, Minye, et al.
Published: (2025)
by: Shao, Minye, et al.
Published: (2025)
UniHead: Unifying Multi-Perception for Detection Heads
by: Zhou, Hantao, et al.
Published: (2023)
by: Zhou, Hantao, et al.
Published: (2023)
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder
by: Wang, Jingchao, et al.
Published: (2025)
by: Wang, Jingchao, et al.
Published: (2025)
Dynamic Analysis and Adaptive Discriminator for Fake News Detection
by: Su, Xinqi, et al.
Published: (2024)
by: Su, Xinqi, et al.
Published: (2024)
ADVMEM: Adversarial Memory Initialization for Realistic Test-Time Adaptation via Tracklet-Based Benchmarking
by: Alhuwaider, Shyma, et al.
Published: (2025)
by: Alhuwaider, Shyma, et al.
Published: (2025)
SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors
by: Hong, Zhen, et al.
Published: (2025)
by: Hong, Zhen, et al.
Published: (2025)
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
by: Li, Xiaojie, et al.
Published: (2024)
by: Li, Xiaojie, et al.
Published: (2024)
Few-shot Image Generation via Masked Discrimination
by: Zhu, Jingyuan, et al.
Published: (2022)
by: Zhu, Jingyuan, et al.
Published: (2022)
Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation
by: Eldesokey, Abdelrahman, et al.
Published: (2025)
by: Eldesokey, Abdelrahman, et al.
Published: (2025)
Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation
by: Deng, Songhe, et al.
Published: (2024)
by: Deng, Songhe, et al.
Published: (2024)
TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced Efficiency
by: Shao, Minye, et al.
Published: (2025)
by: Shao, Minye, et al.
Published: (2025)
Wearable-based behaviour interpolation for semi-supervised human activity recognition
by: Duan, Haoran, et al.
Published: (2024)
by: Duan, Haoran, et al.
Published: (2024)
Similar Items
-
Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable
by: Liu, Haozhe, et al.
Published: (2024) -
X-ray Insights Unleashed: Pioneering the Enhancement of Multi-Label Long-Tail Data
by: Yang, Xinquan, et al.
Published: (2025) -
Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation
by: Yi, Jingjun, et al.
Published: (2024) -
Adaptive Convolutional Dictionary Network for CT Metal Artifact Reduction
by: Wang, Hong, et al.
Published: (2022) -
Robust Source-Free Domain Adaptation for Medical Image Segmentation based on Curriculum Learning
by: Zhang, Ziqi, et al.
Published: (2025)