Saved in:
| Main Author: | Asokan, Raghul |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.17037 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FineLIP: Extending CLIP's Reach via Fine-Grained Alignment with Longer Text Inputs
by: Asokan, Mothilal, et al.
Published: (2025)
by: Asokan, Mothilal, et al.
Published: (2025)
Fine-grained Text to Image Synthesis
by: Ouyang, Xu, et al.
Published: (2024)
by: Ouyang, Xu, et al.
Published: (2024)
Robust Fusion Controller: Degradation-aware Image Fusion with Fine-grained Language Instructions
by: Zhang, Hao, et al.
Published: (2025)
by: Zhang, Hao, et al.
Published: (2025)
Espresso: Robust Concept Filtering in Text-to-Image Models
by: Das, Anudeep, et al.
Published: (2024)
by: Das, Anudeep, et al.
Published: (2024)
BFA: Best-Feature-Aware Fusion for Multi-View Fine-grained Manipulation
by: Lan, Zihan, et al.
Published: (2025)
by: Lan, Zihan, et al.
Published: (2025)
Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval
by: Ma, Zehong, et al.
Published: (2025)
by: Ma, Zehong, et al.
Published: (2025)
FITA: Fine-grained Image-Text Aligner for Radiology Report Generation
by: Yang, Honglong, et al.
Published: (2024)
by: Yang, Honglong, et al.
Published: (2024)
Cross-modal Full-mode Fine-grained Alignment for Text-to-Image Person Retrieval
by: Yin, Hao, et al.
Published: (2025)
by: Yin, Hao, et al.
Published: (2025)
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
by: Wu, Tong, et al.
Published: (2024)
by: Wu, Tong, et al.
Published: (2024)
FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction
by: Hua, Hang, et al.
Published: (2024)
by: Hua, Hang, et al.
Published: (2024)
MVAM: Multi-View Attention Method for Fine-grained Image-Text Matching
by: Cui, Wanqing, et al.
Published: (2024)
by: Cui, Wanqing, et al.
Published: (2024)
Feature-Enhanced TResNet for Fine-Grained Food Image Classification
by: Liu, Lulu, et al.
Published: (2025)
by: Liu, Lulu, et al.
Published: (2025)
FineFACE: Fair Facial Attribute Classification Leveraging Fine-grained Features
by: Manzoor, Ayesha, et al.
Published: (2024)
by: Manzoor, Ayesha, et al.
Published: (2024)
3rd Place Solution to Large-scale Fine-grained Food Recognition
by: Zhong, Yang, et al.
Published: (2025)
by: Zhong, Yang, et al.
Published: (2025)
Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation
by: MaungMaung, AprilPyone, et al.
Published: (2024)
by: MaungMaung, AprilPyone, et al.
Published: (2024)
Identity-Preserving Text-to-Image Generation via Dual-Level Feature Decoupling and Expert-Guided Fusion
by: Chen, Kewen, et al.
Published: (2025)
by: Chen, Kewen, et al.
Published: (2025)
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
by: He, Wanggui, et al.
Published: (2024)
by: He, Wanggui, et al.
Published: (2024)
AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation
by: Li, Zhiwen, et al.
Published: (2025)
by: Li, Zhiwen, et al.
Published: (2025)
Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification
by: Addepalli, Sravanti, et al.
Published: (2023)
by: Addepalli, Sravanti, et al.
Published: (2023)
Fine-grained Defocus Blur Control for Generative Image Models
by: Shrivastava, Ayush, et al.
Published: (2025)
by: Shrivastava, Ayush, et al.
Published: (2025)
FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba
by: Xie, Xinyu, et al.
Published: (2024)
by: Xie, Xinyu, et al.
Published: (2024)
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering
by: Guan, Kaisi, et al.
Published: (2025)
by: Guan, Kaisi, et al.
Published: (2025)
KETA: Kinematic-Phrases-Enhanced Text-to-Motion Generation via Fine-grained Alignment
by: Jiang, Yu, et al.
Published: (2025)
by: Jiang, Yu, et al.
Published: (2025)
Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval
by: Lan, Bangxiang, et al.
Published: (2025)
by: Lan, Bangxiang, et al.
Published: (2025)
T2I-VeRW: Part-level Fine-grained Perception for Text-to-Image Vehicle Retrieval
by: Wang, Xiao, et al.
Published: (2026)
by: Wang, Xiao, et al.
Published: (2026)
Vision Mamba Distillation for Low-resolution Fine-grained Image Classification
by: Chen, Yao, et al.
Published: (2024)
by: Chen, Yao, et al.
Published: (2024)
Enhancing Fine-grained Image Classification through Attentive Batch Training
by: Le, Duy M., et al.
Published: (2024)
by: Le, Duy M., et al.
Published: (2024)
Learning to Align Generative Appearance Priors for Fine-grained Image Retrieval
by: Wang, Shijie, et al.
Published: (2026)
by: Wang, Shijie, et al.
Published: (2026)
Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization
by: Guo, Xiao, et al.
Published: (2024)
by: Guo, Xiao, et al.
Published: (2024)
F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions
by: Yang, Jie, et al.
Published: (2024)
by: Yang, Jie, et al.
Published: (2024)
Advancing Food Nutrition Estimation via Visual-Ingredient Feature Fusion
by: Qi, Huiyan, et al.
Published: (2025)
by: Qi, Huiyan, et al.
Published: (2025)
FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes
by: Pan, Ziying, et al.
Published: (2024)
by: Pan, Ziying, et al.
Published: (2024)
UB-FineNet: Urban Building Fine-grained Classification Network for Open-access Satellite Images
by: He, Zhiyi, et al.
Published: (2024)
by: He, Zhiyi, et al.
Published: (2024)
Task Adaptive Feature Distribution Based Network for Few-shot Fine-grained Target Classification
by: Li, Ping, et al.
Published: (2024)
by: Li, Ping, et al.
Published: (2024)
Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image Fusion
by: Yi, Xunpeng, et al.
Published: (2024)
by: Yi, Xunpeng, et al.
Published: (2024)
ChefFusion: Multimodal Foundation Model Integrating Recipe and Food Image Generation
by: Li, Peiyu, et al.
Published: (2024)
by: Li, Peiyu, et al.
Published: (2024)
Fine-grained Image Retrieval via Dual-Vision Adaptation
by: Jiang, Xin, et al.
Published: (2025)
by: Jiang, Xin, et al.
Published: (2025)
CTRL-F: Pairing Convolution with Transformer for Image Classification via Multi-Level Feature Cross-Attention and Representation Learning Fusion
by: EL-Assiouti, Hosam S., et al.
Published: (2024)
by: EL-Assiouti, Hosam S., et al.
Published: (2024)
SRFNet: Monocular Depth Estimation with Fine-grained Structure via Spatial Reliability-oriented Fusion of Frames and Events
by: Pan, Tianbo, et al.
Published: (2023)
by: Pan, Tianbo, et al.
Published: (2023)
Fine Tuning Text-to-Image Diffusion Models for Correcting Anomalous Images
by: Yoo, Hyunwoo
Published: (2024)
by: Yoo, Hyunwoo
Published: (2024)
Similar Items
-
FineLIP: Extending CLIP's Reach via Fine-Grained Alignment with Longer Text Inputs
by: Asokan, Mothilal, et al.
Published: (2025) -
Fine-grained Text to Image Synthesis
by: Ouyang, Xu, et al.
Published: (2024) -
Robust Fusion Controller: Degradation-aware Image Fusion with Fine-grained Language Instructions
by: Zhang, Hao, et al.
Published: (2025) -
Espresso: Robust Concept Filtering in Text-to-Image Models
by: Das, Anudeep, et al.
Published: (2024) -
BFA: Best-Feature-Aware Fusion for Multi-View Fine-grained Manipulation
by: Lan, Zihan, et al.
Published: (2025)