Saved in:
| Main Author: | Zare, Mohammad |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.00088 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Symbolic Disentangled Representations for Images
by: Korchemnyi, Alexandr, et al.
Published: (2024)
by: Korchemnyi, Alexandr, et al.
Published: (2024)
Hyperbolic Image-Text Representations
by: Desai, Karan, et al.
Published: (2023)
by: Desai, Karan, et al.
Published: (2023)
ARETE: Attention-based Rasterized Encoding for Topology Estimation using HSV-transformed Crowdsourced Vehicle Fleet Data
by: Fritz, Daniel, et al.
Published: (2026)
by: Fritz, Daniel, et al.
Published: (2026)
Improved Probabilistic Image-Text Representations
by: Chun, Sanghyuk
Published: (2023)
by: Chun, Sanghyuk
Published: (2023)
Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding
by: Xia, Jingming, et al.
Published: (2025)
by: Xia, Jingming, et al.
Published: (2025)
Text-Conditional JEPA for Learning Semantically Rich Visual Representations
by: Huang, Chen, et al.
Published: (2026)
by: Huang, Chen, et al.
Published: (2026)
ARGENT: Adaptive Hierarchical Image-Text Representations
by: Huynh, Chuong, et al.
Published: (2026)
by: Huynh, Chuong, et al.
Published: (2026)
Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity
by: Lee, Hagyeong, et al.
Published: (2024)
by: Lee, Hagyeong, et al.
Published: (2024)
Disentangled Representation Learning with Transmitted Information Bottleneck
by: Dang, Zhuohang, et al.
Published: (2023)
by: Dang, Zhuohang, et al.
Published: (2023)
Disentangled Representation Learning with the Gromov-Monge Gap
by: Uscidda, Théo, et al.
Published: (2024)
by: Uscidda, Théo, et al.
Published: (2024)
Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination Methods
by: Alkhalefi, Mohammad, et al.
Published: (2023)
by: Alkhalefi, Mohammad, et al.
Published: (2023)
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images
by: Zawar, Rushikesh, et al.
Published: (2024)
by: Zawar, Rushikesh, et al.
Published: (2024)
Disentangled Representation Learning via Modular Compositional Bias
by: Jung, Whie, et al.
Published: (2025)
by: Jung, Whie, et al.
Published: (2025)
Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps
by: Kim, Jeeyung, et al.
Published: (2024)
by: Kim, Jeeyung, et al.
Published: (2024)
Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning
by: Hsu, Kyle, et al.
Published: (2024)
by: Hsu, Kyle, et al.
Published: (2024)
Isometric Representation Learning for Disentangled Latent Space of Diffusion Models
by: Hahm, Jaehoon, et al.
Published: (2024)
by: Hahm, Jaehoon, et al.
Published: (2024)
CLIPTime: Time-Aware Multimodal Representation Learning from Images and Text
by: Rani, Anju, et al.
Published: (2025)
by: Rani, Anju, et al.
Published: (2025)
LASERS: LAtent Space Encoding for Representations with Sparsity for Generative Modeling
by: Li, Xin, et al.
Published: (2024)
by: Li, Xin, et al.
Published: (2024)
Text-to-Image GAN with Pretrained Representations
by: You, Xiaozhou, et al.
Published: (2024)
by: You, Xiaozhou, et al.
Published: (2024)
Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization
by: Cheng, De, et al.
Published: (2025)
by: Cheng, De, et al.
Published: (2025)
Domain Generalization in-the-Wild: Disentangling Classification from Domain-Aware Representations
by: Son, Ha Min, et al.
Published: (2025)
by: Son, Ha Min, et al.
Published: (2025)
L-VAE: Variational Auto-Encoder with Learnable Beta for Disentangled Representation
by: Ozcan, Hazal Mogultay, et al.
Published: (2025)
by: Ozcan, Hazal Mogultay, et al.
Published: (2025)
Improving the Reconstruction of Disentangled Representation Learners via Multi-Stage Modeling
by: Srivastava, Akash, et al.
Published: (2020)
by: Srivastava, Akash, et al.
Published: (2020)
Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation
by: Izadi, Amir Mohammad, et al.
Published: (2025)
by: Izadi, Amir Mohammad, et al.
Published: (2025)
Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models
by: Motamed, Saman, et al.
Published: (2023)
by: Motamed, Saman, et al.
Published: (2023)
Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning
by: Role, François, et al.
Published: (2025)
by: Role, François, et al.
Published: (2025)
Disentangle and Regularize: Sign Language Production with Articulator-Based Disentanglement and Channel-Aware Regularization
by: Tasyurek, Sumeyye Meryem, et al.
Published: (2025)
by: Tasyurek, Sumeyye Meryem, et al.
Published: (2025)
Neighbour-level Message Interaction Encoding for Improved Representation Learning on Graphs
by: Zhang, Haimin, et al.
Published: (2024)
by: Zhang, Haimin, et al.
Published: (2024)
Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models
by: Jun, Youngjun, et al.
Published: (2024)
by: Jun, Youngjun, et al.
Published: (2024)
Disentanglement and Assessment of Shortcuts in Ophthalmological Retinal Imaging Exams
by: Fernandes, Leonor, et al.
Published: (2025)
by: Fernandes, Leonor, et al.
Published: (2025)
DRESS: Disentangled Representation-based Self-Supervised Meta-Learning for Diverse Tasks
by: Cui, Wei, et al.
Published: (2025)
by: Cui, Wei, et al.
Published: (2025)
Disentangling Mean Embeddings for Better Diagnostics of Image Generators
by: Gruber, Sebastian G., et al.
Published: (2024)
by: Gruber, Sebastian G., et al.
Published: (2024)
Disentanglement-Based Equivariant Learning for Compositional VQA
by: Du, Zhou, et al.
Published: (2026)
by: Du, Zhou, et al.
Published: (2026)
Anisotropic Fourier Features for Positional Encoding in Medical Imaging
by: Jabareen, Nabil, et al.
Published: (2025)
by: Jabareen, Nabil, et al.
Published: (2025)
Learning Visual-Semantic Subspace Representations
by: Moreira, Gabriel, et al.
Published: (2024)
by: Moreira, Gabriel, et al.
Published: (2024)
Closed-Loop Unsupervised Representation Disentanglement with $β$-VAE Distillation and Diffusion Probabilistic Feedback
by: Jin, Xin, et al.
Published: (2024)
by: Jin, Xin, et al.
Published: (2024)
Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models
by: Xie, Baao, et al.
Published: (2024)
by: Xie, Baao, et al.
Published: (2024)
Unsupervised Learning of Disentangled Representations from Video
by: Denton, Remi, et al.
Published: (2017)
by: Denton, Remi, et al.
Published: (2017)
Explicitly Disentangled Representations in Object-Centric Learning
by: Majellaro, Riccardo, et al.
Published: (2024)
by: Majellaro, Riccardo, et al.
Published: (2024)
Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning
by: Wang, Qi, et al.
Published: (2025)
by: Wang, Qi, et al.
Published: (2025)
Similar Items
-
Symbolic Disentangled Representations for Images
by: Korchemnyi, Alexandr, et al.
Published: (2024) -
Hyperbolic Image-Text Representations
by: Desai, Karan, et al.
Published: (2023) -
ARETE: Attention-based Rasterized Encoding for Topology Estimation using HSV-transformed Crowdsourced Vehicle Fleet Data
by: Fritz, Daniel, et al.
Published: (2026) -
Improved Probabilistic Image-Text Representations
by: Chun, Sanghyuk
Published: (2023) -
Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding
by: Xia, Jingming, et al.
Published: (2025)