Saved in:
| Main Authors: | Xiong, Zhinan, Yuan, Shunqi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.10785 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Geometry-Aware Image Flow Matching
by: Lee, Junho, et al.
Published: (2026)
by: Lee, Junho, et al.
Published: (2026)
CritiFusion: Semantic Critique and Spectral Alignment for Faithful Text-to-Image Generation
by: Chen, ZhenQi, et al.
Published: (2025)
by: Chen, ZhenQi, et al.
Published: (2025)
Semantic Granularity Navigation in Image Editing
by: Lu, Liangsi, et al.
Published: (2026)
by: Lu, Liangsi, et al.
Published: (2026)
Aligning Latent Geometry for Spherical Flow Matching in Image Generation
by: Meral, Tuna Han Salih, et al.
Published: (2026)
by: Meral, Tuna Han Salih, et al.
Published: (2026)
Instruction-augmented Multimodal Alignment for Image-Text and Element Matching
by: Yue, Xinli, et al.
Published: (2025)
by: Yue, Xinli, et al.
Published: (2025)
SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation
by: Ge, Xingtong, et al.
Published: (2025)
by: Ge, Xingtong, et al.
Published: (2025)
Modeling Multi-Granularity Context Information Flow for Pavement Crack Detection
by: Pang, Junbiao, et al.
Published: (2024)
by: Pang, Junbiao, et al.
Published: (2024)
Visual Semantic Description Generation with MLLMs for Image-Text Matching
by: Chen, Junyu, et al.
Published: (2025)
by: Chen, Junyu, et al.
Published: (2025)
Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild
by: Jin, Siyoon, et al.
Published: (2024)
by: Jin, Siyoon, et al.
Published: (2024)
$β$-CLIP: Text-Conditioned Contrastive Learning for Multi-Granular Vision-Language Alignment
by: Zohra, Fatimah, et al.
Published: (2025)
by: Zohra, Fatimah, et al.
Published: (2025)
MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment
by: Gao, Zhiting, et al.
Published: (2025)
by: Gao, Zhiting, et al.
Published: (2025)
Flow Matching for Conditional MRI-CT and CBCT-CT Image Synthesis
by: Hadzic, Arnela, et al.
Published: (2025)
by: Hadzic, Arnela, et al.
Published: (2025)
Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis
by: Miao, Boming, et al.
Published: (2024)
by: Miao, Boming, et al.
Published: (2024)
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
by: Wang, Chaoyang, et al.
Published: (2024)
by: Wang, Chaoyang, et al.
Published: (2024)
Alignment-Guided Score Matching for Text-to-Image Alignment in Diffusion Models
by: Lee, Jaa-Yeon, et al.
Published: (2026)
by: Lee, Jaa-Yeon, et al.
Published: (2026)
Dual-Granularity Cross-Modal Identity Association for Weakly-Supervised Text-to-Person Image Matching
by: Zhang, Yafei, et al.
Published: (2025)
by: Zhang, Yafei, et al.
Published: (2025)
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
by: Nam, Jisu, et al.
Published: (2024)
by: Nam, Jisu, et al.
Published: (2024)
Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance
by: Arrabi, Ahmad, et al.
Published: (2024)
by: Arrabi, Ahmad, et al.
Published: (2024)
Subject-Aware Multi-Granularity Alignment for Zero-Shot EEG-to-Image Retrieval
by: Jiang, Lin, et al.
Published: (2026)
by: Jiang, Lin, et al.
Published: (2026)
Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting
by: Chen, Wenting, et al.
Published: (2024)
by: Chen, Wenting, et al.
Published: (2024)
FusionFM: All-in-One Multi-Modal Image Fusion with Flow Matching
by: Zhu, Huayi, et al.
Published: (2025)
by: Zhu, Huayi, et al.
Published: (2025)
Semantic Alignment of Unimodal Medical Text and Vision Representations
by: Di Folco, Maxime, et al.
Published: (2025)
by: Di Folco, Maxime, et al.
Published: (2025)
InstructEngine: Instruction-driven Text-to-Image Alignment
by: Lu, Xingyu, et al.
Published: (2025)
by: Lu, Xingyu, et al.
Published: (2025)
Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models
by: Zhang, Yang, et al.
Published: (2024)
by: Zhang, Yang, et al.
Published: (2024)
Semantic Image Synthesis via Diffusion Models
by: Zhou, Wengang, et al.
Published: (2022)
by: Zhou, Wengang, et al.
Published: (2022)
Value Gradient Guidance for Flow Matching Alignment
by: Liu, Zhen, et al.
Published: (2025)
by: Liu, Zhen, et al.
Published: (2025)
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
by: Wang, Yuan, et al.
Published: (2024)
by: Wang, Yuan, et al.
Published: (2024)
RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment
by: Wang, Chao, et al.
Published: (2025)
by: Wang, Chao, et al.
Published: (2025)
Multi-Head Attention Driven Dynamic Visual-Semantic Embedding for Enhanced Image-Text Matching
by: Chen, Wenjing
Published: (2024)
by: Chen, Wenjing
Published: (2024)
FullFlow: Upgrading Text-to-Image Flow Matching Models for Bidirectional Vision--Language Generation
by: Bill, Eric Tillmann, et al.
Published: (2026)
by: Bill, Eric Tillmann, et al.
Published: (2026)
OS-HGAdapter: Open Semantic Hypergraph Adapter for Large Language Models Assisted Entropy-Enhanced Image-Text Alignment
by: Chen, Rongjun, et al.
Published: (2025)
by: Chen, Rongjun, et al.
Published: (2025)
TCSA-UDA: Text-Driven Cross-Semantic Alignment for Unsupervised Domain Adaptation in Medical Image Segmentation
by: Maurya, Lalit, et al.
Published: (2025)
by: Maurya, Lalit, et al.
Published: (2025)
Reconciling Semantic Controllability and Diversity for Remote Sensing Image Synthesis with Hybrid Semantic Embedding
by: Liu, Junde, et al.
Published: (2024)
by: Liu, Junde, et al.
Published: (2024)
Novel Object Synthesis via Adaptive Text-Image Harmony
by: Xiong, Zeren, et al.
Published: (2024)
by: Xiong, Zeren, et al.
Published: (2024)
Ensemble Quadratic Assignment Network for Graph Matching
by: Tan, Haoru, et al.
Published: (2024)
by: Tan, Haoru, et al.
Published: (2024)
FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching
by: Yi, Junchao, et al.
Published: (2026)
by: Yi, Junchao, et al.
Published: (2026)
Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding
by: Mao, Shunqi, et al.
Published: (2025)
by: Mao, Shunqi, et al.
Published: (2025)
Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval
by: Liu, Delong, et al.
Published: (2023)
by: Liu, Delong, et al.
Published: (2023)
NIFTY: a Non-Local Image Flow Matching for Texture Synthesis
by: Chatillon, Pierrick, et al.
Published: (2025)
by: Chatillon, Pierrick, et al.
Published: (2025)
VisualPrompter: Semantic-Aware Prompt Optimization with Visual Feedback for Text-to-Image Synthesis
by: Wu, Shiyu, et al.
Published: (2025)
by: Wu, Shiyu, et al.
Published: (2025)
Similar Items
-
Geometry-Aware Image Flow Matching
by: Lee, Junho, et al.
Published: (2026) -
CritiFusion: Semantic Critique and Spectral Alignment for Faithful Text-to-Image Generation
by: Chen, ZhenQi, et al.
Published: (2025) -
Semantic Granularity Navigation in Image Editing
by: Lu, Liangsi, et al.
Published: (2026) -
Aligning Latent Geometry for Spherical Flow Matching in Image Generation
by: Meral, Tuna Han Salih, et al.
Published: (2026) -
Instruction-augmented Multimodal Alignment for Image-Text and Element Matching
by: Yue, Xinli, et al.
Published: (2025)