:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Wentian, Liu, Haozhe, Li, Bing, Xie, Jinheng, Huang, Yawen, Li, Yuexiang, Zheng, Yefeng, Ghanem, Bernard
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2306.07716
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable
by: Liu, Haozhe, et al.
Published: (2024)

X-ray Insights Unleashed: Pioneering the Enhancement of Multi-Label Long-Tail Data
by: Yang, Xinquan, et al.
Published: (2025)

Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation
by: Yi, Jingjun, et al.
Published: (2024)

Adaptive Convolutional Dictionary Network for CT Metal Artifact Reduction
by: Wang, Hong, et al.
Published: (2022)

Robust Source-Free Domain Adaptation for Medical Image Segmentation based on Curriculum Learning
by: Zhang, Ziqi, et al.
Published: (2025)

Can Video Diffusion Model Reconstruct 4D Geometry?
by: Mai, Jinjie, et al.
Published: (2025)

Learning Long-form Video Prior via Generative Pre-Training
by: Xie, Jinheng, et al.
Published: (2024)

Faster Diffusion via Temporal Attention Decomposition
by: Liu, Haozhe, et al.
Published: (2024)

A Hybrid Framework Bridging CNN and ViT based on Theory of Evidence for Diabetic Retinopathy Grading
by: Qiu, Junlai, et al.
Published: (2025)

Prototype Correlation Matching and Class-Relation Reasoning for Few-Shot Medical Image Segmentation
by: Zhang, Yumin, et al.
Published: (2024)

Vivid-ZOO: Multi-View Video Generation with Diffusion Model
by: Li, Bing, et al.
Published: (2024)

DGFamba: Learning Flow Factorized State Space for Visual Domain Generalization
by: Bi, Qi, et al.
Published: (2025)

URoadNet: Dual Sparse Attentive U-Net for Multiscale Road Network Extraction
by: Song, Jie, et al.
Published: (2024)

Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing
by: Kong, Zhe, et al.
Published: (2024)

Structure Observation Driven Image-Text Contrastive Learning for Computed Tomography Report Generation
by: Liu, Hong, et al.
Published: (2026)

Fingerprint Presentation Attack Detector Using Global-Local Model
by: Liu, Haozhe, et al.
Published: (2024)

CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video
by: Miao, Xingyu, et al.
Published: (2024)

SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
by: Thoker, Fida Mohammad, et al.
Published: (2025)

Show-o2: Improved Native Unified Multimodal Models
by: Xie, Jinheng, et al.
Published: (2025)

AnomalyXFusion: Multi-modal Anomaly Synthesis with Diffusion
by: Hu, Jie, et al.
Published: (2024)

TrackMAE: Video Representation Learning via Track Mask and Predict
by: Vandeghen, Renaud, et al.
Published: (2026)

ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders
by: Hinojosa, Carlos, et al.
Published: (2024)

K-Space-Aware Cross-Modality Score for Synthesized Neuroimage Quality Assessment
by: Xie, Guoyang, et al.
Published: (2023)

Masked Face Recognition with Generative-to-Discriminative Representations
by: Ge, Shiming, et al.
Published: (2024)

Video Self-Stitching Graph Network for Temporal Action Localization
by: Zhao, Chen, et al.
Published: (2020)

Radiology Report Generation for Low-Quality X-Ray Images
by: Zhu, Hongze, et al.
Published: (2026)

OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions
by: Luo, Cheng, et al.
Published: (2025)

ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields
by: Miao, Xingyu, et al.
Published: (2024)

Rethinking Brain Tumor Segmentation from the Frequency Domain Perspective
by: Shao, Minye, et al.
Published: (2025)

UniHead: Unifying Multi-Perception for Detection Heads
by: Zhou, Hantao, et al.
Published: (2023)

Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder
by: Wang, Jingchao, et al.
Published: (2025)

Dynamic Analysis and Adaptive Discriminator for Fake News Detection
by: Su, Xinqi, et al.
Published: (2024)

ADVMEM: Adversarial Memory Initialization for Realistic Test-Time Adaptation via Tracklet-Based Benchmarking
by: Alhuwaider, Shyma, et al.
Published: (2025)

SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors
by: Hong, Zhen, et al.
Published: (2025)

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
by: Li, Xiaojie, et al.
Published: (2024)

Few-shot Image Generation via Masked Discrimination
by: Zhu, Jingyuan, et al.
Published: (2022)

Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation
by: Eldesokey, Abdelrahman, et al.
Published: (2025)

Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation
by: Deng, Songhe, et al.
Published: (2024)

TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced Efficiency
by: Shao, Minye, et al.
Published: (2025)

Wearable-based behaviour interpolation for semi-supervised human activity recognition
by: Duan, Haoran, et al.
Published: (2024)