:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Zare, Mohammad
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2512.00088
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Symbolic Disentangled Representations for Images
by: Korchemnyi, Alexandr, et al.
Published: (2024)

Hyperbolic Image-Text Representations
by: Desai, Karan, et al.
Published: (2023)

ARETE: Attention-based Rasterized Encoding for Topology Estimation using HSV-transformed Crowdsourced Vehicle Fleet Data
by: Fritz, Daniel, et al.
Published: (2026)

Improved Probabilistic Image-Text Representations
by: Chun, Sanghyuk
Published: (2023)

Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding
by: Xia, Jingming, et al.
Published: (2025)

Text-Conditional JEPA for Learning Semantically Rich Visual Representations
by: Huang, Chen, et al.
Published: (2026)

ARGENT: Adaptive Hierarchical Image-Text Representations
by: Huynh, Chuong, et al.
Published: (2026)

Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity
by: Lee, Hagyeong, et al.
Published: (2024)

Disentangled Representation Learning with Transmitted Information Bottleneck
by: Dang, Zhuohang, et al.
Published: (2023)

Disentangled Representation Learning with the Gromov-Monge Gap
by: Uscidda, Théo, et al.
Published: (2024)

Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination Methods
by: Alkhalefi, Mohammad, et al.
Published: (2023)

StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images
by: Zawar, Rushikesh, et al.
Published: (2024)

Disentangled Representation Learning via Modular Compositional Bias
by: Jung, Whie, et al.
Published: (2025)

Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps
by: Kim, Jeeyung, et al.
Published: (2024)

Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning
by: Hsu, Kyle, et al.
Published: (2024)

Isometric Representation Learning for Disentangled Latent Space of Diffusion Models
by: Hahm, Jaehoon, et al.
Published: (2024)

CLIPTime: Time-Aware Multimodal Representation Learning from Images and Text
by: Rani, Anju, et al.
Published: (2025)

LASERS: LAtent Space Encoding for Representations with Sparsity for Generative Modeling
by: Li, Xin, et al.
Published: (2024)

Text-to-Image GAN with Pretrained Representations
by: You, Xiaozhou, et al.
Published: (2024)

Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization
by: Cheng, De, et al.
Published: (2025)

Domain Generalization in-the-Wild: Disentangling Classification from Domain-Aware Representations
by: Son, Ha Min, et al.
Published: (2025)

L-VAE: Variational Auto-Encoder with Learnable Beta for Disentangled Representation
by: Ozcan, Hazal Mogultay, et al.
Published: (2025)

Improving the Reconstruction of Disentangled Representation Learners via Multi-Stage Modeling
by: Srivastava, Akash, et al.
Published: (2020)

Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation
by: Izadi, Amir Mohammad, et al.
Published: (2025)

Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models
by: Motamed, Saman, et al.
Published: (2023)

Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning
by: Role, François, et al.
Published: (2025)

Disentangle and Regularize: Sign Language Production with Articulator-Based Disentanglement and Channel-Aware Regularization
by: Tasyurek, Sumeyye Meryem, et al.
Published: (2025)

Neighbour-level Message Interaction Encoding for Improved Representation Learning on Graphs
by: Zhang, Haimin, et al.
Published: (2024)

Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models
by: Jun, Youngjun, et al.
Published: (2024)

Disentanglement and Assessment of Shortcuts in Ophthalmological Retinal Imaging Exams
by: Fernandes, Leonor, et al.
Published: (2025)

DRESS: Disentangled Representation-based Self-Supervised Meta-Learning for Diverse Tasks
by: Cui, Wei, et al.
Published: (2025)

Disentangling Mean Embeddings for Better Diagnostics of Image Generators
by: Gruber, Sebastian G., et al.
Published: (2024)

Disentanglement-Based Equivariant Learning for Compositional VQA
by: Du, Zhou, et al.
Published: (2026)

Anisotropic Fourier Features for Positional Encoding in Medical Imaging
by: Jabareen, Nabil, et al.
Published: (2025)

Learning Visual-Semantic Subspace Representations
by: Moreira, Gabriel, et al.
Published: (2024)

Closed-Loop Unsupervised Representation Disentanglement with $β$-VAE Distillation and Diffusion Probabilistic Feedback
by: Jin, Xin, et al.
Published: (2024)

Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models
by: Xie, Baao, et al.
Published: (2024)

Unsupervised Learning of Disentangled Representations from Video
by: Denton, Remi, et al.
Published: (2017)

Explicitly Disentangled Representations in Object-Centric Learning
by: Majellaro, Riccardo, et al.
Published: (2024)

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning
by: Wang, Qi, et al.
Published: (2025)