Saved in:
| Main Authors: | He, Zongyao, Jin, Zhi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.16451 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Uncertainties of Latent Representations in Computer Vision
by: Kirchhof, Michael
Published: (2024)
by: Kirchhof, Michael
Published: (2024)
OT-ALD: Aligning Latent Distributions with Optimal Transport for Accelerated Image-to-Image Translation
by: Wang, Zhanpeng, et al.
Published: (2025)
by: Wang, Zhanpeng, et al.
Published: (2025)
Robust Latent Representation Tuning for Image-text Classification
by: Sun, Hao, et al.
Published: (2024)
by: Sun, Hao, et al.
Published: (2024)
Accelerating Learned Video Compression via Low-Resolution Representation Learning
by: Qiu, Zidian, et al.
Published: (2024)
by: Qiu, Zidian, et al.
Published: (2024)
Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning
by: Wei, Yibing, et al.
Published: (2024)
by: Wei, Yibing, et al.
Published: (2024)
Feature Map Convergence Evaluation for Functional Module
by: Zhang, Ludan, et al.
Published: (2024)
by: Zhang, Ludan, et al.
Published: (2024)
Taxonomy-Aware Representation Alignment for Hierarchical Visual Recognition with Large Multimodal Models
by: He, Hulingxiao, et al.
Published: (2026)
by: He, Hulingxiao, et al.
Published: (2026)
Self-Corrected Image Generation with Explainable Latent Rewards
by: Luo, Yinyi, et al.
Published: (2026)
by: Luo, Yinyi, et al.
Published: (2026)
KFS-Bench: Comprehensive Evaluation of Key Frame Sampling in Long Video Understanding
by: Li, Zongyao, et al.
Published: (2025)
by: Li, Zongyao, et al.
Published: (2025)
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
by: Hao, Shaozhe, et al.
Published: (2024)
by: Hao, Shaozhe, et al.
Published: (2024)
Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single-Image Denoising
by: Li, Huaqiu, et al.
Published: (2025)
by: Li, Huaqiu, et al.
Published: (2025)
Domain-Specific Latent Representations Improve the Fidelity of Diffusion-Based Medical Image Super-Resolution
by: Cajas, Sebastian, et al.
Published: (2026)
by: Cajas, Sebastian, et al.
Published: (2026)
Conditional Latent Coding with Learnable Synthesized Reference for Deep Image Compression
by: Wu, Siqi, et al.
Published: (2025)
by: Wu, Siqi, et al.
Published: (2025)
SocialCVAE: Predicting Pedestrian Trajectory via Interaction Conditioned Latents
by: Xiang, Wei, et al.
Published: (2024)
by: Xiang, Wei, et al.
Published: (2024)
Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN
by: Nigam, Shivangi, et al.
Published: (2025)
by: Nigam, Shivangi, et al.
Published: (2025)
Latent Distillation for Continual Object Detection at the Edge
by: Pasti, Francesco, et al.
Published: (2024)
by: Pasti, Francesco, et al.
Published: (2024)
GOAL: Geometrically Optimal Alignment for Continual Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2026)
by: Han, Jizhou, et al.
Published: (2026)
DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning
by: Liu, Dongxu, et al.
Published: (2025)
by: Liu, Dongxu, et al.
Published: (2025)
MT-Mark: Rethinking Image Watermarking via Mutual-Teacher Collaboration with Adaptive Feature Modulation
by: Ge, Fei, et al.
Published: (2025)
by: Ge, Fei, et al.
Published: (2025)
MARR: Module-Adaptive Residual Reconstruction for Low-Bit Post-Training Quantization
by: Su, Le, et al.
Published: (2026)
by: Su, Le, et al.
Published: (2026)
Efficient Neural Video Representation with Temporally Coherent Modulation
by: Shin, Seungjun, et al.
Published: (2025)
by: Shin, Seungjun, et al.
Published: (2025)
PROSPECT: Unified Streaming Vision-Language Navigation via Semantic--Spatial Fusion and Latent Predictive Representation
by: Fan, Zehua, et al.
Published: (2026)
by: Fan, Zehua, et al.
Published: (2026)
Latent Bias Alignment for High-Fidelity Diffusion Inversion in Real-World Image Reconstruction and Manipulation
by: Chen, Weiming, et al.
Published: (2026)
by: Chen, Weiming, et al.
Published: (2026)
Joint Imaging-ROI Representation Learning via Cross-View Contrastive Alignment for Brain Disorder Classification
by: Liang, Wei, et al.
Published: (2026)
by: Liang, Wei, et al.
Published: (2026)
Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation
by: Si, Qi, et al.
Published: (2025)
by: Si, Qi, et al.
Published: (2025)
Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting
by: Zhu, Lingting, et al.
Published: (2025)
by: Zhu, Lingting, et al.
Published: (2025)
Narrowing Information Bottleneck Theory for Multimodal Image-Text Representations Interpretability
by: Zhu, Zhiyu, et al.
Published: (2025)
by: Zhu, Zhiyu, et al.
Published: (2025)
Adaptive Clinical-Aware Latent Diffusion for Multimodal Brain Image Generation and Missing Modality Imputation
by: Zhou, Rong, et al.
Published: (2026)
by: Zhou, Rong, et al.
Published: (2026)
UnfoldLDM: Degradation-Aware Unfolding with Iterative Latent Diffusion Priors for Blind Image Restoration
by: He, Chunming, et al.
Published: (2025)
by: He, Chunming, et al.
Published: (2025)
Latent Diffusion Models for Attribute-Preserving Image Anonymization
by: Piano, Luca, et al.
Published: (2024)
by: Piano, Luca, et al.
Published: (2024)
Spatial-Aware Latent Initialization for Controllable Image Generation
by: Sun, Wenqiang, et al.
Published: (2024)
by: Sun, Wenqiang, et al.
Published: (2024)
Latent Expression Generation for Referring Image Segmentation and Grounding
by: Yu, Seonghoon, et al.
Published: (2025)
by: Yu, Seonghoon, et al.
Published: (2025)
Diffuse and Disperse: Image Generation with Representation Regularization
by: Wang, Runqian, et al.
Published: (2025)
by: Wang, Runqian, et al.
Published: (2025)
LVDrive: Latent Visual Representation Enhanced Vision-Language-Action Autonomous Driving Model
by: Mei, Xiaodong, et al.
Published: (2026)
by: Mei, Xiaodong, et al.
Published: (2026)
Introducing 3D Representation for Medical Image Volume-to-Volume Translation via Score Fusion
by: Zhu, Xiyue, et al.
Published: (2025)
by: Zhu, Xiyue, et al.
Published: (2025)
LatentEdit: Adaptive Latent Control for Consistent Semantic Editing
by: Liu, Siyi, et al.
Published: (2025)
by: Liu, Siyi, et al.
Published: (2025)
Probing the Latent World: Emergent Discrete Symbols and Physical Structure in Latent Representations
by: ming, Liu hung
Published: (2026)
by: ming, Liu hung
Published: (2026)
Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis
by: Yeganeh, Yousef, et al.
Published: (2024)
by: Yeganeh, Yousef, et al.
Published: (2024)
Diffusion Model in Latent Space for Medical Image Segmentation Task
by: Ngoc, Huynh Trinh, et al.
Published: (2025)
by: Ngoc, Huynh Trinh, et al.
Published: (2025)
Cluster and Predict Latent Patches for Improved Masked Image Modeling
by: Darcet, Timothée, et al.
Published: (2025)
by: Darcet, Timothée, et al.
Published: (2025)
Similar Items
-
Uncertainties of Latent Representations in Computer Vision
by: Kirchhof, Michael
Published: (2024) -
OT-ALD: Aligning Latent Distributions with Optimal Transport for Accelerated Image-to-Image Translation
by: Wang, Zhanpeng, et al.
Published: (2025) -
Robust Latent Representation Tuning for Image-text Classification
by: Sun, Hao, et al.
Published: (2024) -
Accelerating Learned Video Compression via Low-Resolution Representation Learning
by: Qiu, Zidian, et al.
Published: (2024) -
Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning
by: Wei, Yibing, et al.
Published: (2024)