Saved in:
| Main Authors: | Sap, Duygu, Lotz, Martin, Mattinson, Connor |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.16514 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Performance-Efficiency Trade-off for Fashion Image Retrieval
by: Hurtado, Julio, et al.
Published: (2025)
by: Hurtado, Julio, et al.
Published: (2025)
Enhancing Leaf Disease Classification Using GAT-GCN Hybrid Model
by: Sundhar, Shyam, et al.
Published: (2025)
by: Sundhar, Shyam, et al.
Published: (2025)
The Role of Data Curation in Image Captioning
by: Li, Wenyan, et al.
Published: (2023)
by: Li, Wenyan, et al.
Published: (2023)
Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization
by: Dunlop, Connor, et al.
Published: (2025)
by: Dunlop, Connor, et al.
Published: (2025)
Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network
by: Wang, Zhaoyang, et al.
Published: (2024)
by: Wang, Zhaoyang, et al.
Published: (2024)
Detection, Retrieval, and Explanation Unified: A Violence Detection System Based on Knowledge Graphs and GAT
by: Jiang, Wen-Dong, et al.
Published: (2025)
by: Jiang, Wen-Dong, et al.
Published: (2025)
SuperiorGAT: Graph Attention Networks for Sparse LiDAR Point Cloud Reconstruction in Autonomous Systems
by: Awedat, Khalfalla, et al.
Published: (2025)
by: Awedat, Khalfalla, et al.
Published: (2025)
Local Representative Token Guided Merging for Text-to-Image Generation
by: Lee, Min-Jeong, et al.
Published: (2025)
by: Lee, Min-Jeong, et al.
Published: (2025)
The Algorithmic Gaze of Image Quality Assessment: An Audit and Trace Ethnography of the LAION-Aesthetics Predictor
by: Taylor, Jordan, et al.
Published: (2026)
by: Taylor, Jordan, et al.
Published: (2026)
REPrune: Channel Pruning via Kernel Representative Selection
by: Park, Mincheol, et al.
Published: (2024)
by: Park, Mincheol, et al.
Published: (2024)
Revisiting Diffusion Autoencoder Training for Image Reconstruction Quality
by: Khungurn, Pramook, et al.
Published: (2025)
by: Khungurn, Pramook, et al.
Published: (2025)
GAT-NeRF: Geometry-Aware-Transformer Enhanced Neural Radiance Fields for High-Fidelity 4D Facial Avatars
by: Chang, Zhe, et al.
Published: (2026)
by: Chang, Zhe, et al.
Published: (2026)
Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
by: Fu, Bin, et al.
Published: (2024)
by: Fu, Bin, et al.
Published: (2024)
Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders
by: Bohacek, Matyas, et al.
Published: (2025)
by: Bohacek, Matyas, et al.
Published: (2025)
Generalizable Sensor-Based Activity Recognition via Categorical Concept Invariant Learning
by: Xiong, Di, et al.
Published: (2024)
by: Xiong, Di, et al.
Published: (2024)
Diffusion Autoencoder for Unsupervised Artifact Restoration in Handheld Fundus Images
by: Palani, Mathumetha, et al.
Published: (2026)
by: Palani, Mathumetha, et al.
Published: (2026)
DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion
by: Guo, Yuchen, et al.
Published: (2024)
by: Guo, Yuchen, et al.
Published: (2024)
MAESIL: Masked Autoencoder for Enhanced Self-supervised Medical Image Learning
by: Kim, Kyeonghun, et al.
Published: (2026)
by: Kim, Kyeonghun, et al.
Published: (2026)
Latent Diffusion Model without Variational Autoencoder
by: Shi, Minglei, et al.
Published: (2025)
by: Shi, Minglei, et al.
Published: (2025)
MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training
by: Li, Jiayang, et al.
Published: (2024)
by: Li, Jiayang, et al.
Published: (2024)
Exploring Social Media Image Categorization Using Large Models with Different Adaptation Methods: A Case Study on Cultural Nature's Contributions to People
by: Khaldi, Rohaifa, et al.
Published: (2024)
by: Khaldi, Rohaifa, et al.
Published: (2024)
Representing Animatable Avatar via Factorized Neural Fields
by: Song, Chunjin, et al.
Published: (2024)
by: Song, Chunjin, et al.
Published: (2024)
Unsupervised Tomato Split Anomaly Detection using Hyperspectral Imaging and Variational Autoencoders
by: Abdulsalam, Mahmoud, et al.
Published: (2025)
by: Abdulsalam, Mahmoud, et al.
Published: (2025)
Latent-Compressed Variational Autoencoder for Video Diffusion Models
by: Guan, Jiarui, et al.
Published: (2026)
by: Guan, Jiarui, et al.
Published: (2026)
Diffusion Autoencoders are Scalable Image Tokenizers
by: Chen, Yinbo, et al.
Published: (2025)
by: Chen, Yinbo, et al.
Published: (2025)
CLIMB: Controllable Longitudinal Brain Image Generation using Mamba-based Latent Diffusion Model and Gaussian-aligned Autoencoder
by: Dao, Duy-Phuong, et al.
Published: (2026)
by: Dao, Duy-Phuong, et al.
Published: (2026)
Open Ad-hoc Categorization with Contextualized Feature Learning
by: Wang, Zilin, et al.
Published: (2025)
by: Wang, Zilin, et al.
Published: (2025)
Gaussian Masked Autoencoders
by: Rajasegaran, Jathushan, et al.
Published: (2025)
by: Rajasegaran, Jathushan, et al.
Published: (2025)
One-Step is Enough: Sparse Autoencoders for Text-to-Image Diffusion Models
by: Surkov, Viacheslav, et al.
Published: (2024)
by: Surkov, Viacheslav, et al.
Published: (2024)
Backdoor Defense in Diffusion Models via Spatial Attention Unlearning
by: Jha, Abha, et al.
Published: (2025)
by: Jha, Abha, et al.
Published: (2025)
SAEmnesia: Erasing Concepts in Diffusion Models with Supervised Sparse Autoencoders
by: Cassano, Enrico, et al.
Published: (2025)
by: Cassano, Enrico, et al.
Published: (2025)
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
by: Chen, Junyu, et al.
Published: (2024)
by: Chen, Junyu, et al.
Published: (2024)
Do Vision and Language Encoders Represent the World Similarly?
by: Maniparambil, Mayug, et al.
Published: (2024)
by: Maniparambil, Mayug, et al.
Published: (2024)
Text-based Person Search in Full Images via Semantic-Driven Proposal Generation
by: Zhang, Shizhou, et al.
Published: (2021)
by: Zhang, Shizhou, et al.
Published: (2021)
Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models
by: Tian, Zhihua, et al.
Published: (2025)
by: Tian, Zhihua, et al.
Published: (2025)
Representing 3D Shapes With 64 Latent Vectors for 3D Diffusion Models
by: Cho, In, et al.
Published: (2025)
by: Cho, In, et al.
Published: (2025)
Spotlighter: Revisiting Prompt Tuning from a Representative Mining View
by: Gao, Yutong, et al.
Published: (2025)
by: Gao, Yutong, et al.
Published: (2025)
SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders
by: Li, Qing, et al.
Published: (2025)
by: Li, Qing, et al.
Published: (2025)
Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures
by: Yerukola, Akhila, et al.
Published: (2025)
by: Yerukola, Akhila, et al.
Published: (2025)
Ladder Bottom-up Convolutional Bidirectional Variational Autoencoder for Image Translation of Dotted Arabic Expiration Dates
by: Zidane, Ahmed, et al.
Published: (2023)
by: Zidane, Ahmed, et al.
Published: (2023)
Similar Items
-
Performance-Efficiency Trade-off for Fashion Image Retrieval
by: Hurtado, Julio, et al.
Published: (2025) -
Enhancing Leaf Disease Classification Using GAT-GCN Hybrid Model
by: Sundhar, Shyam, et al.
Published: (2025) -
The Role of Data Curation in Image Captioning
by: Li, Wenyan, et al.
Published: (2023) -
Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization
by: Dunlop, Connor, et al.
Published: (2025) -
Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network
by: Wang, Zhaoyang, et al.
Published: (2024)