Saved in:
| Main Authors: | Jia, Zexi, Luo, Pengcheng, Zhong, Yijia, Zhang, Jinchao, Zhou, Jie |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.08064 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Manifold-Optimal Guidance: A Unified Riemannian Control View of Diffusion Guidance
by: Jia, Zexi, et al.
Published: (2026)
by: Jia, Zexi, et al.
Published: (2026)
Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity
by: Fang, Zhengyao, et al.
Published: (2026)
by: Fang, Zhengyao, et al.
Published: (2026)
StyleDecoupler: Generalizable Artistic Style Disentanglement
by: Jia, Zexi, et al.
Published: (2026)
by: Jia, Zexi, et al.
Published: (2026)
CoDA: Color Distribution Probing for Efficient and Generalizable AI-Generated Image Detection
by: Jia, Zexi, et al.
Published: (2026)
by: Jia, Zexi, et al.
Published: (2026)
Exploring Specular Reflection Inconsistency for Generalizable Face Forgery Detection
by: Fei, Hongyan, et al.
Published: (2026)
by: Fei, Hongyan, et al.
Published: (2026)
RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation
by: Yuan, Zhiqiang, et al.
Published: (2025)
by: Yuan, Zhiqiang, et al.
Published: (2025)
Control-CLIP: Decoupling Category and Style Guidance in CLIP for Specific-Domain Generation
by: Jia, Zexi, et al.
Published: (2025)
by: Jia, Zexi, et al.
Published: (2025)
Semantic to Structure: Learning Structural Representations for Infringement Detection
by: Huang, Chuanwei, et al.
Published: (2025)
by: Huang, Chuanwei, et al.
Published: (2025)
A Visual Leap in CLIP Compositionality Reasoning through Generation of Counterfactual Sets
by: Jia, Zexi, et al.
Published: (2025)
by: Jia, Zexi, et al.
Published: (2025)
WalkVLM:Aid Visually Impaired People Walking by Vision Language Model
by: Yuan, Zhiqiang, et al.
Published: (2024)
by: Yuan, Zhiqiang, et al.
Published: (2024)
F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language Model
by: Bi, Hanbo, et al.
Published: (2025)
by: Bi, Hanbo, et al.
Published: (2025)
From Imitation to Innovation: The Emergence of AI Unique Artistic Styles and the Challenge of Copyright Protection
by: Jia, Zexi, et al.
Published: (2025)
by: Jia, Zexi, et al.
Published: (2025)
Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task
by: Yang, Yiran, et al.
Published: (2024)
by: Yang, Yiran, et al.
Published: (2024)
A Synthetic-to-Real Dehazing Method based on Domain Unification
by: Yuan, Zhiqiang, et al.
Published: (2025)
by: Yuan, Zhiqiang, et al.
Published: (2025)
Generative Video Compression with One-Dimensional Latent Representation
by: Zheng, Zihan, et al.
Published: (2026)
by: Zheng, Zihan, et al.
Published: (2026)
Real2Code: Reconstruct Articulated Objects via Code Generation
by: Mandi, Zhao, et al.
Published: (2024)
by: Mandi, Zhao, et al.
Published: (2024)
Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers
by: Zhang, Ruiyuan, et al.
Published: (2023)
by: Zhang, Ruiyuan, et al.
Published: (2023)
Video2LoRA: Unified Semantic-Controlled Video Generation via Per-Reference-Video LoRA
by: Wu, Zexi, et al.
Published: (2026)
by: Wu, Zexi, et al.
Published: (2026)
Flying Bird Object Detection Algorithm in Surveillance Video Based on Motion Information
by: Sun, Ziwei, et al.
Published: (2023)
by: Sun, Ziwei, et al.
Published: (2023)
Semantic One-Dimensional Tokenizer for Image Reconstruction and Generation
by: Qu, Yunpeng, et al.
Published: (2026)
by: Qu, Yunpeng, et al.
Published: (2026)
Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis
by: Chen, Weiming, et al.
Published: (2025)
by: Chen, Weiming, et al.
Published: (2025)
Enhancing Visual Reliance in Text Generation: A Bayesian Perspective on Mitigating Hallucination in Large Vision-Language Models
by: Hu, Nanxing, et al.
Published: (2025)
by: Hu, Nanxing, et al.
Published: (2025)
HIR-ALIGN: Enhancing Hyperspectral Image Restoration via Diffusion-Based Data Generation
by: Pang, Li, et al.
Published: (2026)
by: Pang, Li, et al.
Published: (2026)
Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation
by: Zhang, Haojie, et al.
Published: (2023)
by: Zhang, Haojie, et al.
Published: (2023)
Why Settle for One? Text-to-ImageSet Generation and Evaluation
by: Jia, Chengyou, et al.
Published: (2025)
by: Jia, Chengyou, et al.
Published: (2025)
Unleashing Video Language Models for Fine-grained HRCT Report Generation
by: Fang, Yingying, et al.
Published: (2026)
by: Fang, Yingying, et al.
Published: (2026)
ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation
by: Zhang, Ting, et al.
Published: (2024)
by: Zhang, Ting, et al.
Published: (2024)
Understanding the Implicit User Intention via Reasoning with Large Language Model for Image Editing
by: Wang, Yijia, et al.
Published: (2025)
by: Wang, Yijia, et al.
Published: (2025)
MemGS: Memory-Efficient Gaussian Splatting for Real-Time SLAM
by: Bai, Yinlong, et al.
Published: (2025)
by: Bai, Yinlong, et al.
Published: (2025)
Low-Dimensional Gradient Helps Out-of-Distribution Detection
by: Wu, Yingwen, et al.
Published: (2023)
by: Wu, Yingwen, et al.
Published: (2023)
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
by: Kim, Dongwon, et al.
Published: (2025)
by: Kim, Dongwon, et al.
Published: (2025)
Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering
by: Wang, Peng, et al.
Published: (2024)
by: Wang, Peng, et al.
Published: (2024)
Perceptual Video Coding for Machines via Satisfied Machine Ratio Modeling
by: Zhang, Qi, et al.
Published: (2022)
by: Zhang, Qi, et al.
Published: (2022)
Towards One-step Causal Video Generation via Adversarial Self-Distillation
by: Yang, Yongqi, et al.
Published: (2025)
by: Yang, Yongqi, et al.
Published: (2025)
Multi-concept Model Immunization through Differentiable Model Merging
by: Zheng, Amber Yijia, et al.
Published: (2024)
by: Zheng, Amber Yijia, et al.
Published: (2024)
OneViewAll: Semantic Prior Guided One-View 6D Pose Estimation for Novel Objects
by: Luo, Yang, et al.
Published: (2026)
by: Luo, Yang, et al.
Published: (2026)
Boosting Semi-Supervised Medical Image Segmentation via Masked Image Consistency and Discrepancy Learning
by: Zhou, Pengcheng, et al.
Published: (2025)
by: Zhou, Pengcheng, et al.
Published: (2025)
CoDoL: Conditional Domain Prompt Learning for Out-of-Distribution Generalization
by: Zhang, Min, et al.
Published: (2025)
by: Zhang, Min, et al.
Published: (2025)
HGP-Mamba: Integrating Histology and Generated Protein Features for Mamba-based Multimodal Survival Risk Prediction
by: Dai, Jing, et al.
Published: (2026)
by: Dai, Jing, et al.
Published: (2026)
Collaborative Low-Rank Adaptation for Pre-Trained Vision Transformers
by: Liu, Zheng, et al.
Published: (2025)
by: Liu, Zheng, et al.
Published: (2025)
Similar Items
-
Manifold-Optimal Guidance: A Unified Riemannian Control View of Diffusion Guidance
by: Jia, Zexi, et al.
Published: (2026) -
Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity
by: Fang, Zhengyao, et al.
Published: (2026) -
StyleDecoupler: Generalizable Artistic Style Disentanglement
by: Jia, Zexi, et al.
Published: (2026) -
CoDA: Color Distribution Probing for Efficient and Generalizable AI-Generated Image Detection
by: Jia, Zexi, et al.
Published: (2026) -
Exploring Specular Reflection Inconsistency for Generalizable Face Forgery Detection
by: Fei, Hongyan, et al.
Published: (2026)