Saved in:
| Main Authors: | Singh, Silky, Jandial, Surgan, Shahid, Simra, Java, Abhinav |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.16330 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
by: Srivastava, Ashutosh, et al.
Published: (2024)
by: Srivastava, Ashutosh, et al.
Published: (2024)
Towards Efficient Exemplar Based Image Editing with Multimodal VLMs
by: Jadhav, Avadhoot, et al.
Published: (2025)
by: Jadhav, Avadhoot, et al.
Published: (2025)
S2H-DPO: Hardness-Aware Preference Optimization for Vision-Language Models
by: Shukla, Nitish, et al.
Published: (2026)
by: Shukla, Nitish, et al.
Published: (2026)
Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models
by: Furniturewala, Shaz, et al.
Published: (2024)
by: Furniturewala, Shaz, et al.
Published: (2024)
Towards Operationalizing Right to Data Protection
by: Java, Abhinav, et al.
Published: (2024)
by: Java, Abhinav, et al.
Published: (2024)
Understanding Task Transfer in Vision-Language Models
by: Sachdeva, Bhuvan, et al.
Published: (2025)
by: Sachdeva, Bhuvan, et al.
Published: (2025)
Artistic-style text detector and a new Movie-Poster dataset
by: Ning, Aoxiang, et al.
Published: (2024)
by: Ning, Aoxiang, et al.
Published: (2024)
Procedural terrain generation with style transfer
by: Merizzi, Fabio
Published: (2024)
by: Merizzi, Fabio
Published: (2024)
Multiscale style transfer based on a Laplacian pyramid for traditional Chinese painting
by: Liu, Kunxiao, et al.
Published: (2025)
by: Liu, Kunxiao, et al.
Published: (2025)
On dataset transferability in medical image classification
by: Juodelyte, Dovile, et al.
Published: (2024)
by: Juodelyte, Dovile, et al.
Published: (2024)
Stylized Meta-Album: Group-bias injection with style transfer to study robustness against distribution shifts
by: Mussard, Romain, et al.
Published: (2025)
by: Mussard, Romain, et al.
Published: (2025)
Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configuration
by: Wen, Ziqi, et al.
Published: (2023)
by: Wen, Ziqi, et al.
Published: (2023)
Exploring text-to-image generation for historical document image retrieval
by: Cote, Melissa, et al.
Published: (2025)
by: Cote, Melissa, et al.
Published: (2025)
The persistence of painting styles
by: Munnangi, Reetikaa Reddy, et al.
Published: (2025)
by: Munnangi, Reetikaa Reddy, et al.
Published: (2025)
Do text-free diffusion models learn discriminative visual representations?
by: Mukhopadhyay, Soumik, et al.
Published: (2023)
by: Mukhopadhyay, Soumik, et al.
Published: (2023)
Learning text-to-video retrieval from image captioning
by: Ventura, Lucas, et al.
Published: (2024)
by: Ventura, Lucas, et al.
Published: (2024)
Hyper-parameter tuning for text guided image editing
by: Zhang, Shiwen
Published: (2024)
by: Zhang, Shiwen
Published: (2024)
CMRNext: Camera to LiDAR Matching in the Wild for Localization and Extrinsic Calibration
by: Cattaneo, Daniele, et al.
Published: (2024)
by: Cattaneo, Daniele, et al.
Published: (2024)
Exploring scalable medical image encoders beyond text supervision
by: Pérez-García, Fernando, et al.
Published: (2024)
by: Pérez-García, Fernando, et al.
Published: (2024)
Efficient scene text image super-resolution with semantic guidance
by: TomyEnrique, LeoWu, et al.
Published: (2024)
by: TomyEnrique, LeoWu, et al.
Published: (2024)
Ranking-aware adapter for text-driven image ordering with CLIP
by: Yu, Wei-Hsiang, et al.
Published: (2024)
by: Yu, Wei-Hsiang, et al.
Published: (2024)
Consistent text-to-image generation via scene de-contextualization
by: Tang, Song, et al.
Published: (2025)
by: Tang, Song, et al.
Published: (2025)
IMMA: Immunizing text-to-image Models against Malicious Adaptation
by: Zheng, Amber Yijia, et al.
Published: (2023)
by: Zheng, Amber Yijia, et al.
Published: (2023)
Architecture inside the mirage: evaluating generative image models on architectural style, elements, and typologies
by: Magrill, Jamie, et al.
Published: (2026)
by: Magrill, Jamie, et al.
Published: (2026)
VideoWeave: A Data-Centric Approach for Efficient Video Understanding
by: Durante, Zane, et al.
Published: (2026)
by: Durante, Zane, et al.
Published: (2026)
Visual question answering based evaluation metrics for text-to-image generation
by: Miyamoto, Mizuki, et al.
Published: (2024)
by: Miyamoto, Mizuki, et al.
Published: (2024)
Concept Corrector: Erase concepts on the fly for text-to-image diffusion models
by: Meng, Zheling, et al.
Published: (2025)
by: Meng, Zheling, et al.
Published: (2025)
IMAGE-ALCHEMY: Advancing subject fidelity in personalised text-to-image generation
by: Tiwari, Amritanshu, et al.
Published: (2025)
by: Tiwari, Amritanshu, et al.
Published: (2025)
UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders
by: Walmer, Matthew, et al.
Published: (2026)
by: Walmer, Matthew, et al.
Published: (2026)
Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks
by: Playout, Clément, et al.
Published: (2024)
by: Playout, Clément, et al.
Published: (2024)
Can language-guided unsupervised adaptation improve medical image classification using unpaired images and texts?
by: Rahman, Umaima, et al.
Published: (2024)
by: Rahman, Umaima, et al.
Published: (2024)
A spatiotemporal style transfer algorithm for dynamic visual stimulus generation
by: Greco, Antonino, et al.
Published: (2024)
by: Greco, Antonino, et al.
Published: (2024)
Mitigating analytical variability in fMRI results with style transfer
by: Germani, Elodie, et al.
Published: (2024)
by: Germani, Elodie, et al.
Published: (2024)
Dark Miner: Defend against undesirable generation for text-to-image diffusion models
by: Meng, Zheling, et al.
Published: (2024)
by: Meng, Zheling, et al.
Published: (2024)
PointT2I: LLM-based text-to-image generation via keypoints
by: Lee, Taekyung, et al.
Published: (2025)
by: Lee, Taekyung, et al.
Published: (2025)
RaLF: Flow-based Global and Metric Radar Localization in LiDAR Maps
by: Nayak, Abhijeet, et al.
Published: (2023)
by: Nayak, Abhijeet, et al.
Published: (2023)
A large-scale image-text dataset benchmark for farmland segmentation
by: Tao, Chao, et al.
Published: (2025)
by: Tao, Chao, et al.
Published: (2025)
TurboEdit: Instant text-based image editing
by: Wu, Zongze, et al.
Published: (2024)
by: Wu, Zongze, et al.
Published: (2024)
Generalizing Monocular 3D Object Detection
by: Kumar, Abhinav
Published: (2025)
by: Kumar, Abhinav
Published: (2025)
Exploring connections of spectral analysis and transfer learning in medical imaging
by: Lu, Yucheng, et al.
Published: (2024)
by: Lu, Yucheng, et al.
Published: (2024)
Similar Items
-
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
by: Srivastava, Ashutosh, et al.
Published: (2024) -
Towards Efficient Exemplar Based Image Editing with Multimodal VLMs
by: Jadhav, Avadhoot, et al.
Published: (2025) -
S2H-DPO: Hardness-Aware Preference Optimization for Vision-Language Models
by: Shukla, Nitish, et al.
Published: (2026) -
Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models
by: Furniturewala, Shaz, et al.
Published: (2024) -
Towards Operationalizing Right to Data Protection
by: Java, Abhinav, et al.
Published: (2024)