:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Khan, Faizan Farooq, Joseph, K J, Goswami, Koustava, Elhoseiny, Mohamed, Srinivasan, Balaji Vasan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2512.03335
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Agentic Design Review System
by: Nag, Sayan, et al.
Published: (2025)

Neural Catalog: Scaling Species Recognition with Catalog of Life-Augmented Generation
by: Khan, Faizan Farooq, et al.
Published: (2025)

Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders
by: Khan, Faizan Farooq, et al.
Published: (2025)

SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
by: Nag, Sayan, et al.
Published: (2024)

AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art
by: Khan, Faizan Farooq, et al.
Published: (2024)

PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation
by: Dwivedi, Vikas, et al.
Published: (2023)

Domain-Aware Continual Zero-Shot Learning
by: Yi, Kai, et al.
Published: (2021)

Towards Design Compositing
by: Mahajan, Abhinav, et al.
Published: (2026)

How Well Can Vision Language Models See Image Details?
by: Gou, Chenhui, et al.
Published: (2024)

Test-time Conditional Text-to-Image Synthesis Using Diffusion Models
by: Shukla, Tripti, et al.
Published: (2024)

Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis
by: Agarwal, Aishwarya, et al.
Published: (2024)

AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
by: Agarwal, Aishwarya, et al.
Published: (2024)

Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation
by: Zhang, Wenxuan, et al.
Published: (2024)

FloAt: Flow Warping of Self-Attention for Clothing Animation Generation
by: Mishra, Swasti Shreya, et al.
Published: (2024)

Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation
by: Jamil, Sofia, et al.
Published: (2025)

Design-o-meter: Towards Evaluating and Refining Graphic Designs
by: Goyal, Sahil, et al.
Published: (2024)

Post-Hoc Answer Attribution for Grounded and Trustworthy Long Document Comprehension: Task, Insights, and Challenges
by: Sancheti, Abhilasha, et al.
Published: (2024)

Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models
by: Jamil, Sofia, et al.
Published: (2025)

FishNet++: Analyzing the capabilities of Multimodal Large Language Models in marine biology
by: Khan, Faizan Farooq, et al.
Published: (2025)

PoemTale Diffusion: Minimising Information Loss in Poem to Image Generation with Multi-Stage Prompt Refinement
by: Jamil, Sofia, et al.
Published: (2025)

Do It Yourself (DIY): Modifying Images for Poems in a Zero-Shot Setting Using Weighted Prompt Manipulation
by: Jamil, Sofia, et al.
Published: (2025)

Mean Flows for One-step Generative Modeling
by: Geng, Zhengyang, et al.
Published: (2025)

Analytic Convolutional Layer: A Step to Analytic Neural Network
by: Cui, Jingmao, et al.
Published: (2024)

ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models
by: Kothandaraman, Divya, et al.
Published: (2024)

MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models
by: Chowdhury, Sanjoy, et al.
Published: (2024)

On the Design of One-step Diffusion via Shortcutting Flow Paths
by: Lin, Haitao, et al.
Published: (2025)

General and Efficient Visual Goal-Conditioned Reinforcement Learning using Object-Agnostic Masks
by: Shahriar, Fahim, et al.
Published: (2025)

Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness
by: Fort, Stanislav, et al.
Published: (2024)

Social Media Ready Caption Generation for Brands
by: Maheshwari, Himanshu, et al.
Published: (2024)

Aurelia: Test-time Reasoning Distillation in Audio-Visual LLMs
by: Chowdhury, Sanjoy, et al.
Published: (2025)

Flows and Diffusions on the Neural Manifold
by: Saragih, Daniel, et al.
Published: (2025)

Optimizing Few-Step Generation with Adaptive Matching Distillation
by: Bai, Lichen, et al.
Published: (2026)

Drifting Preference Optimization for One-Step Generative Models
by: Jiang, Zhou, et al.
Published: (2026)

Design and Behavior of Sparse Mixture-of-Experts Layers in CNN-based Semantic Segmentation
by: Pavlitska, Svetlana, et al.
Published: (2026)

Extracting Usable Predictions from Quantized Networks through Uncertainty Quantification for OOD Detection
by: Singhal, Rishi, et al.
Published: (2024)

One Step Learning, One Step Review
by: Huang, Xiaolong, et al.
Published: (2024)

StoryGPT-V: Large Language Models as Consistent Story Visualizers
by: Shen, Xiaoqian, et al.
Published: (2023)

Generate in Reconstruction Space, Match in Semantic Space: Transport Geometry for One-Step Generation
by: Van Assel, Hugues, et al.
Published: (2026)

Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning
by: Safaei, Bardia, et al.
Published: (2025)

SOLAR: Switchable Output Layer for Accuracy and Robustness in Once-for-All Training
by: Tareen, Shaharyar Ahmed Khan, et al.
Published: (2025)