:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Singh, Silky, Jandial, Surgan, Shahid, Simra, Java, Abhinav
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2405.16330
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
by: Srivastava, Ashutosh, et al.
Published: (2024)

Towards Efficient Exemplar Based Image Editing with Multimodal VLMs
by: Jadhav, Avadhoot, et al.
Published: (2025)

S2H-DPO: Hardness-Aware Preference Optimization for Vision-Language Models
by: Shukla, Nitish, et al.
Published: (2026)

Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models
by: Furniturewala, Shaz, et al.
Published: (2024)

Towards Operationalizing Right to Data Protection
by: Java, Abhinav, et al.
Published: (2024)

Understanding Task Transfer in Vision-Language Models
by: Sachdeva, Bhuvan, et al.
Published: (2025)

Artistic-style text detector and a new Movie-Poster dataset
by: Ning, Aoxiang, et al.
Published: (2024)

Procedural terrain generation with style transfer
by: Merizzi, Fabio
Published: (2024)

Multiscale style transfer based on a Laplacian pyramid for traditional Chinese painting
by: Liu, Kunxiao, et al.
Published: (2025)

On dataset transferability in medical image classification
by: Juodelyte, Dovile, et al.
Published: (2024)

Stylized Meta-Album: Group-bias injection with style transfer to study robustness against distribution shifts
by: Mussard, Romain, et al.
Published: (2025)

Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configuration
by: Wen, Ziqi, et al.
Published: (2023)

Exploring text-to-image generation for historical document image retrieval
by: Cote, Melissa, et al.
Published: (2025)

The persistence of painting styles
by: Munnangi, Reetikaa Reddy, et al.
Published: (2025)

Do text-free diffusion models learn discriminative visual representations?
by: Mukhopadhyay, Soumik, et al.
Published: (2023)

Learning text-to-video retrieval from image captioning
by: Ventura, Lucas, et al.
Published: (2024)

Hyper-parameter tuning for text guided image editing
by: Zhang, Shiwen
Published: (2024)

CMRNext: Camera to LiDAR Matching in the Wild for Localization and Extrinsic Calibration
by: Cattaneo, Daniele, et al.
Published: (2024)

Exploring scalable medical image encoders beyond text supervision
by: Pérez-García, Fernando, et al.
Published: (2024)

Efficient scene text image super-resolution with semantic guidance
by: TomyEnrique, LeoWu, et al.
Published: (2024)

Ranking-aware adapter for text-driven image ordering with CLIP
by: Yu, Wei-Hsiang, et al.
Published: (2024)

Consistent text-to-image generation via scene de-contextualization
by: Tang, Song, et al.
Published: (2025)

IMMA: Immunizing text-to-image Models against Malicious Adaptation
by: Zheng, Amber Yijia, et al.
Published: (2023)

Architecture inside the mirage: evaluating generative image models on architectural style, elements, and typologies
by: Magrill, Jamie, et al.
Published: (2026)

VideoWeave: A Data-Centric Approach for Efficient Video Understanding
by: Durante, Zane, et al.
Published: (2026)

Visual question answering based evaluation metrics for text-to-image generation
by: Miyamoto, Mizuki, et al.
Published: (2024)

Concept Corrector: Erase concepts on the fly for text-to-image diffusion models
by: Meng, Zheling, et al.
Published: (2025)

IMAGE-ALCHEMY: Advancing subject fidelity in personalised text-to-image generation
by: Tiwari, Amritanshu, et al.
Published: (2025)

UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders
by: Walmer, Matthew, et al.
Published: (2026)

Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks
by: Playout, Clément, et al.
Published: (2024)

Can language-guided unsupervised adaptation improve medical image classification using unpaired images and texts?
by: Rahman, Umaima, et al.
Published: (2024)

A spatiotemporal style transfer algorithm for dynamic visual stimulus generation
by: Greco, Antonino, et al.
Published: (2024)

Mitigating analytical variability in fMRI results with style transfer
by: Germani, Elodie, et al.
Published: (2024)

Dark Miner: Defend against undesirable generation for text-to-image diffusion models
by: Meng, Zheling, et al.
Published: (2024)

PointT2I: LLM-based text-to-image generation via keypoints
by: Lee, Taekyung, et al.
Published: (2025)

RaLF: Flow-based Global and Metric Radar Localization in LiDAR Maps
by: Nayak, Abhijeet, et al.
Published: (2023)

A large-scale image-text dataset benchmark for farmland segmentation
by: Tao, Chao, et al.
Published: (2025)

TurboEdit: Instant text-based image editing
by: Wu, Zongze, et al.
Published: (2024)

Generalizing Monocular 3D Object Detection
by: Kumar, Abhinav
Published: (2025)

Exploring connections of spectral analysis and transfer learning in medical imaging
by: Lu, Yucheng, et al.
Published: (2024)