Saved in:
| Main Authors: | Khan, Faizan Farooq, Joseph, K J, Goswami, Koustava, Elhoseiny, Mohamed, Srinivasan, Balaji Vasan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.03335 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Agentic Design Review System
by: Nag, Sayan, et al.
Published: (2025)
by: Nag, Sayan, et al.
Published: (2025)
Neural Catalog: Scaling Species Recognition with Catalog of Life-Augmented Generation
by: Khan, Faizan Farooq, et al.
Published: (2025)
by: Khan, Faizan Farooq, et al.
Published: (2025)
Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders
by: Khan, Faizan Farooq, et al.
Published: (2025)
by: Khan, Faizan Farooq, et al.
Published: (2025)
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
by: Nag, Sayan, et al.
Published: (2024)
by: Nag, Sayan, et al.
Published: (2024)
AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art
by: Khan, Faizan Farooq, et al.
Published: (2024)
by: Khan, Faizan Farooq, et al.
Published: (2024)
PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation
by: Dwivedi, Vikas, et al.
Published: (2023)
by: Dwivedi, Vikas, et al.
Published: (2023)
Domain-Aware Continual Zero-Shot Learning
by: Yi, Kai, et al.
Published: (2021)
by: Yi, Kai, et al.
Published: (2021)
Towards Design Compositing
by: Mahajan, Abhinav, et al.
Published: (2026)
by: Mahajan, Abhinav, et al.
Published: (2026)
How Well Can Vision Language Models See Image Details?
by: Gou, Chenhui, et al.
Published: (2024)
by: Gou, Chenhui, et al.
Published: (2024)
Test-time Conditional Text-to-Image Synthesis Using Diffusion Models
by: Shukla, Tripti, et al.
Published: (2024)
by: Shukla, Tripti, et al.
Published: (2024)
Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis
by: Agarwal, Aishwarya, et al.
Published: (2024)
by: Agarwal, Aishwarya, et al.
Published: (2024)
AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
by: Agarwal, Aishwarya, et al.
Published: (2024)
by: Agarwal, Aishwarya, et al.
Published: (2024)
Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation
by: Zhang, Wenxuan, et al.
Published: (2024)
by: Zhang, Wenxuan, et al.
Published: (2024)
FloAt: Flow Warping of Self-Attention for Clothing Animation Generation
by: Mishra, Swasti Shreya, et al.
Published: (2024)
by: Mishra, Swasti Shreya, et al.
Published: (2024)
Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation
by: Jamil, Sofia, et al.
Published: (2025)
by: Jamil, Sofia, et al.
Published: (2025)
Design-o-meter: Towards Evaluating and Refining Graphic Designs
by: Goyal, Sahil, et al.
Published: (2024)
by: Goyal, Sahil, et al.
Published: (2024)
Post-Hoc Answer Attribution for Grounded and Trustworthy Long Document Comprehension: Task, Insights, and Challenges
by: Sancheti, Abhilasha, et al.
Published: (2024)
by: Sancheti, Abhilasha, et al.
Published: (2024)
Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models
by: Jamil, Sofia, et al.
Published: (2025)
by: Jamil, Sofia, et al.
Published: (2025)
FishNet++: Analyzing the capabilities of Multimodal Large Language Models in marine biology
by: Khan, Faizan Farooq, et al.
Published: (2025)
by: Khan, Faizan Farooq, et al.
Published: (2025)
PoemTale Diffusion: Minimising Information Loss in Poem to Image Generation with Multi-Stage Prompt Refinement
by: Jamil, Sofia, et al.
Published: (2025)
by: Jamil, Sofia, et al.
Published: (2025)
Do It Yourself (DIY): Modifying Images for Poems in a Zero-Shot Setting Using Weighted Prompt Manipulation
by: Jamil, Sofia, et al.
Published: (2025)
by: Jamil, Sofia, et al.
Published: (2025)
Mean Flows for One-step Generative Modeling
by: Geng, Zhengyang, et al.
Published: (2025)
by: Geng, Zhengyang, et al.
Published: (2025)
Analytic Convolutional Layer: A Step to Analytic Neural Network
by: Cui, Jingmao, et al.
Published: (2024)
by: Cui, Jingmao, et al.
Published: (2024)
ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models
by: Kothandaraman, Divya, et al.
Published: (2024)
by: Kothandaraman, Divya, et al.
Published: (2024)
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models
by: Chowdhury, Sanjoy, et al.
Published: (2024)
by: Chowdhury, Sanjoy, et al.
Published: (2024)
On the Design of One-step Diffusion via Shortcutting Flow Paths
by: Lin, Haitao, et al.
Published: (2025)
by: Lin, Haitao, et al.
Published: (2025)
General and Efficient Visual Goal-Conditioned Reinforcement Learning using Object-Agnostic Masks
by: Shahriar, Fahim, et al.
Published: (2025)
by: Shahriar, Fahim, et al.
Published: (2025)
Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness
by: Fort, Stanislav, et al.
Published: (2024)
by: Fort, Stanislav, et al.
Published: (2024)
Social Media Ready Caption Generation for Brands
by: Maheshwari, Himanshu, et al.
Published: (2024)
by: Maheshwari, Himanshu, et al.
Published: (2024)
Aurelia: Test-time Reasoning Distillation in Audio-Visual LLMs
by: Chowdhury, Sanjoy, et al.
Published: (2025)
by: Chowdhury, Sanjoy, et al.
Published: (2025)
Flows and Diffusions on the Neural Manifold
by: Saragih, Daniel, et al.
Published: (2025)
by: Saragih, Daniel, et al.
Published: (2025)
Optimizing Few-Step Generation with Adaptive Matching Distillation
by: Bai, Lichen, et al.
Published: (2026)
by: Bai, Lichen, et al.
Published: (2026)
Drifting Preference Optimization for One-Step Generative Models
by: Jiang, Zhou, et al.
Published: (2026)
by: Jiang, Zhou, et al.
Published: (2026)
Design and Behavior of Sparse Mixture-of-Experts Layers in CNN-based Semantic Segmentation
by: Pavlitska, Svetlana, et al.
Published: (2026)
by: Pavlitska, Svetlana, et al.
Published: (2026)
Extracting Usable Predictions from Quantized Networks through Uncertainty Quantification for OOD Detection
by: Singhal, Rishi, et al.
Published: (2024)
by: Singhal, Rishi, et al.
Published: (2024)
One Step Learning, One Step Review
by: Huang, Xiaolong, et al.
Published: (2024)
by: Huang, Xiaolong, et al.
Published: (2024)
StoryGPT-V: Large Language Models as Consistent Story Visualizers
by: Shen, Xiaoqian, et al.
Published: (2023)
by: Shen, Xiaoqian, et al.
Published: (2023)
Generate in Reconstruction Space, Match in Semantic Space: Transport Geometry for One-Step Generation
by: Van Assel, Hugues, et al.
Published: (2026)
by: Van Assel, Hugues, et al.
Published: (2026)
Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning
by: Safaei, Bardia, et al.
Published: (2025)
by: Safaei, Bardia, et al.
Published: (2025)
SOLAR: Switchable Output Layer for Accuracy and Robustness in Once-for-All Training
by: Tareen, Shaharyar Ahmed Khan, et al.
Published: (2025)
by: Tareen, Shaharyar Ahmed Khan, et al.
Published: (2025)
Similar Items
-
Agentic Design Review System
by: Nag, Sayan, et al.
Published: (2025) -
Neural Catalog: Scaling Species Recognition with Catalog of Life-Augmented Generation
by: Khan, Faizan Farooq, et al.
Published: (2025) -
Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders
by: Khan, Faizan Farooq, et al.
Published: (2025) -
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
by: Nag, Sayan, et al.
Published: (2024) -
AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art
by: Khan, Faizan Farooq, et al.
Published: (2024)