Saved in:
| Main Authors: | Wang, Shuting, Tang, Haihong, Dou, Zhicheng, Xiong, Chenyan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.06812 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
by: Liu, Qingming, et al.
Published: (2025)
by: Liu, Qingming, et al.
Published: (2025)
GraphShaper: Geometry-aware Alignment for Improving Transfer Learning in Text-Attributed Graphs
by: Zhang, Heng, et al.
Published: (2025)
by: Zhang, Heng, et al.
Published: (2025)
CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward
by: Guan, Yandong, et al.
Published: (2025)
by: Guan, Yandong, et al.
Published: (2025)
LumiGen: An LVLM-Enhanced Iterative Framework for Fine-Grained Text-to-Image Generation
by: Dong, Xiaoqi, et al.
Published: (2025)
by: Dong, Xiaoqi, et al.
Published: (2025)
FlexControl: Computation-Aware ControlNet with Differentiable Router for Text-to-Image Generation
by: Fang, Zheng, et al.
Published: (2025)
by: Fang, Zheng, et al.
Published: (2025)
FlowMotion: Target-Predictive Conditional Flow Matching for Jitter-Reduced Text-Driven Human Motion Generation
by: Cuba, Manolo Canales, et al.
Published: (2025)
by: Cuba, Manolo Canales, et al.
Published: (2025)
LumiX: Structured and Coherent Text-to-Intrinsic Generation
by: Han, Xu, et al.
Published: (2025)
by: Han, Xu, et al.
Published: (2025)
MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data
by: Borne--Pons, Paul, et al.
Published: (2025)
by: Borne--Pons, Paul, et al.
Published: (2025)
Adaptive Hybrid Caching for Efficient Text-to-Video Diffusion Model Acceleration
by: Wei, Yuanxin, et al.
Published: (2025)
by: Wei, Yuanxin, et al.
Published: (2025)
GCTAM: Global and Contextual Truncated Affinity Combined Maximization Model For Unsupervised Graph Anomaly Detection
by: Zhang, Xiong, et al.
Published: (2026)
by: Zhang, Xiong, et al.
Published: (2026)
Efficient Diffusion Models: A Survey
by: Shen, Hui, et al.
Published: (2025)
by: Shen, Hui, et al.
Published: (2025)
MidSurfNet: Learnable Face Pairing and Interference Implicit Fields for Generalized Mid-surface Abstraction
by: Ye, Li, et al.
Published: (2026)
by: Ye, Li, et al.
Published: (2026)
Multimodal Benchmarking and Recommendation of Text-to-Image Generation Models
by: Wanaskar, Kapil, et al.
Published: (2025)
by: Wanaskar, Kapil, et al.
Published: (2025)
LGCC: Enhancing Flow Matching Based Text-Guided Image Editing with Local Gaussian Coupling and Context Consistency
by: Liu, Fangbing, et al.
Published: (2025)
by: Liu, Fangbing, et al.
Published: (2025)
PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation
by: Hsu, HsiaoYuan, et al.
Published: (2025)
by: Hsu, HsiaoYuan, et al.
Published: (2025)
Expressive Text-to-Image Generation with Rich Text
by: Ge, Songwei, et al.
Published: (2023)
by: Ge, Songwei, et al.
Published: (2023)
A Few-Step Generative Model on Cumulative Flow Maps
by: Li, Zhiqi, et al.
Published: (2026)
by: Li, Zhiqi, et al.
Published: (2026)
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
by: Girdhar, Rohit, et al.
Published: (2023)
by: Girdhar, Rohit, et al.
Published: (2023)
Learning to Synthesize Compatible Fashion Items Using Semantic Alignment and Collocation Classification: An Outfit Generation Framework
by: Zhou, Dongliang, et al.
Published: (2025)
by: Zhou, Dongliang, et al.
Published: (2025)
Emotion Knowledge Enhancement for Vision Large Language Models: A Self-Verification Approach for High-Quality Emotion Instruction Data Generation
by: Wang, Feifan, et al.
Published: (2025)
by: Wang, Feifan, et al.
Published: (2025)
Alice v1: Distillation-Enhanced Video Generation Surpassing Closed-Source Models
by: Xiaoyu, Wang, et al.
Published: (2026)
by: Xiaoyu, Wang, et al.
Published: (2026)
LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration
by: Zhang, Yuyao, et al.
Published: (2025)
by: Zhang, Yuyao, et al.
Published: (2025)
PoissonNet: A Local-Global Approach for Learning on Surfaces
by: Maesumi, Arman, et al.
Published: (2025)
by: Maesumi, Arman, et al.
Published: (2025)
LUSD: Localized Update Score Distillation for Text-Guided Image Editing
by: Chinchuthakun, Worameth, et al.
Published: (2025)
by: Chinchuthakun, Worameth, et al.
Published: (2025)
MicroRicci: A Greedy and Local Ricci Flow Solver for Self-Tuning Mesh Smoothing
by: Anh, Le Vu, et al.
Published: (2025)
by: Anh, Le Vu, et al.
Published: (2025)
DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
by: Zhou, Zhenglin, et al.
Published: (2025)
by: Zhou, Zhenglin, et al.
Published: (2025)
MeshFIM: Local Low-Poly Mesh Editing via Fill-in-the-Middle Autoregressive Generation
by: Yang, Dingdong, et al.
Published: (2026)
by: Yang, Dingdong, et al.
Published: (2026)
HACD: Harnessing Attribute Semantics and Mesoscopic Structure for Community Detection
by: Zhang, Anran, et al.
Published: (2024)
by: Zhang, Anran, et al.
Published: (2024)
What matters for Representation Alignment: Global Information or Spatial Structure?
by: Singh, Jaskirat, et al.
Published: (2025)
by: Singh, Jaskirat, et al.
Published: (2025)
Don't Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation LLM
by: Mews, Maximilian, et al.
Published: (2024)
by: Mews, Maximilian, et al.
Published: (2024)
MCMC: Bridging Rendering, Optimization and Generative AI
by: Singh, Gurprit, et al.
Published: (2025)
by: Singh, Gurprit, et al.
Published: (2025)
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
by: Han, Junlin, et al.
Published: (2024)
by: Han, Junlin, et al.
Published: (2024)
Gen-C: Populating Virtual Worlds with Generative Crowds
by: Panayiotou, Andreas, et al.
Published: (2025)
by: Panayiotou, Andreas, et al.
Published: (2025)
Evaluating Machine Learning Approaches for ASCII Art Generation
by: Coumar, Sai, et al.
Published: (2025)
by: Coumar, Sai, et al.
Published: (2025)
Customizing Text-to-Image Models with a Single Image Pair
by: Jones, Maxwell, et al.
Published: (2024)
by: Jones, Maxwell, et al.
Published: (2024)
Harnessing Adaptive Topology Representations for Zero-Shot Graph Question Answering
by: Wei, Yanbin, et al.
Published: (2025)
by: Wei, Yanbin, et al.
Published: (2025)
Explorable INR: An Implicit Neural Representation for Ensemble Simulation Enabling Efficient Spatial and Parameter Exploration
by: Chen, Yi-Tang, et al.
Published: (2025)
by: Chen, Yi-Tang, et al.
Published: (2025)
Mitigating Semantic Collapse in Generative Personalization with Test-Time Embedding Adjustment
by: Bui, Anh, et al.
Published: (2025)
by: Bui, Anh, et al.
Published: (2025)
Generating Multi-Image Synthetic Data for Text-to-Image Customization
by: Kumari, Nupur, et al.
Published: (2025)
by: Kumari, Nupur, et al.
Published: (2025)
CAD-Coder:Text-Guided CAD Files Code Generation
by: He, Changqi, et al.
Published: (2025)
by: He, Changqi, et al.
Published: (2025)
Similar Items
-
Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
by: Liu, Qingming, et al.
Published: (2025) -
GraphShaper: Geometry-aware Alignment for Improving Transfer Learning in Text-Attributed Graphs
by: Zhang, Heng, et al.
Published: (2025) -
CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward
by: Guan, Yandong, et al.
Published: (2025) -
LumiGen: An LVLM-Enhanced Iterative Framework for Fine-Grained Text-to-Image Generation
by: Dong, Xiaoqi, et al.
Published: (2025) -
FlexControl: Computation-Aware ControlNet with Differentiable Router for Text-to-Image Generation
by: Fang, Zheng, et al.
Published: (2025)