:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xiong, Zhinan, Yuan, Shunqi
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.10785
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Geometry-Aware Image Flow Matching
by: Lee, Junho, et al.
Published: (2026)

CritiFusion: Semantic Critique and Spectral Alignment for Faithful Text-to-Image Generation
by: Chen, ZhenQi, et al.
Published: (2025)

Semantic Granularity Navigation in Image Editing
by: Lu, Liangsi, et al.
Published: (2026)

Aligning Latent Geometry for Spherical Flow Matching in Image Generation
by: Meral, Tuna Han Salih, et al.
Published: (2026)

Instruction-augmented Multimodal Alignment for Image-Text and Element Matching
by: Yue, Xinli, et al.
Published: (2025)

SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation
by: Ge, Xingtong, et al.
Published: (2025)

Modeling Multi-Granularity Context Information Flow for Pavement Crack Detection
by: Pang, Junbiao, et al.
Published: (2024)

Visual Semantic Description Generation with MLLMs for Image-Text Matching
by: Chen, Junyu, et al.
Published: (2025)

Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild
by: Jin, Siyoon, et al.
Published: (2024)

$β$-CLIP: Text-Conditioned Contrastive Learning for Multi-Granular Vision-Language Alignment
by: Zohra, Fatimah, et al.
Published: (2025)

MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment
by: Gao, Zhiting, et al.
Published: (2025)

Flow Matching for Conditional MRI-CT and CBCT-CT Image Synthesis
by: Hadzic, Arnela, et al.
Published: (2025)

Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis
by: Miao, Boming, et al.
Published: (2024)

SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
by: Wang, Chaoyang, et al.
Published: (2024)

Alignment-Guided Score Matching for Text-to-Image Alignment in Diffusion Models
by: Lee, Jaa-Yeon, et al.
Published: (2026)

Dual-Granularity Cross-Modal Identity Association for Weakly-Supervised Text-to-Person Image Matching
by: Zhang, Yafei, et al.
Published: (2025)

DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
by: Nam, Jisu, et al.
Published: (2024)

Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance
by: Arrabi, Ahmad, et al.
Published: (2024)

Subject-Aware Multi-Granularity Alignment for Zero-Shot EEG-to-Image Retrieval
by: Jiang, Lin, et al.
Published: (2026)

Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting
by: Chen, Wenting, et al.
Published: (2024)

FusionFM: All-in-One Multi-Modal Image Fusion with Flow Matching
by: Zhu, Huayi, et al.
Published: (2025)

Semantic Alignment of Unimodal Medical Text and Vision Representations
by: Di Folco, Maxime, et al.
Published: (2025)

InstructEngine: Instruction-driven Text-to-Image Alignment
by: Lu, Xingyu, et al.
Published: (2025)

Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models
by: Zhang, Yang, et al.
Published: (2024)

Semantic Image Synthesis via Diffusion Models
by: Zhou, Wengang, et al.
Published: (2022)

Value Gradient Guidance for Flow Matching Alignment
by: Liu, Zhen, et al.
Published: (2025)

Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
by: Wang, Yuan, et al.
Published: (2024)

RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment
by: Wang, Chao, et al.
Published: (2025)

Multi-Head Attention Driven Dynamic Visual-Semantic Embedding for Enhanced Image-Text Matching
by: Chen, Wenjing
Published: (2024)

FullFlow: Upgrading Text-to-Image Flow Matching Models for Bidirectional Vision--Language Generation
by: Bill, Eric Tillmann, et al.
Published: (2026)

OS-HGAdapter: Open Semantic Hypergraph Adapter for Large Language Models Assisted Entropy-Enhanced Image-Text Alignment
by: Chen, Rongjun, et al.
Published: (2025)

TCSA-UDA: Text-Driven Cross-Semantic Alignment for Unsupervised Domain Adaptation in Medical Image Segmentation
by: Maurya, Lalit, et al.
Published: (2025)

Reconciling Semantic Controllability and Diversity for Remote Sensing Image Synthesis with Hybrid Semantic Embedding
by: Liu, Junde, et al.
Published: (2024)

Novel Object Synthesis via Adaptive Text-Image Harmony
by: Xiong, Zeren, et al.
Published: (2024)

Ensemble Quadratic Assignment Network for Graph Matching
by: Tan, Haoru, et al.
Published: (2024)

FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching
by: Yi, Junchao, et al.
Published: (2026)

Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding
by: Mao, Shunqi, et al.
Published: (2025)

Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval
by: Liu, Delong, et al.
Published: (2023)

NIFTY: a Non-Local Image Flow Matching for Texture Synthesis
by: Chatillon, Pierrick, et al.
Published: (2025)

VisualPrompter: Semantic-Aware Prompt Optimization with Visual Feedback for Text-to-Image Synthesis
by: Wu, Shiyu, et al.
Published: (2025)