Saved in:
| Main Authors: | Yang, Yang, Meng, Feifan, Fang, Han, Zhang, Weiming |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.26348 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Latent Guidance in Diffusion Models for Perceptual Evaluations
by: Saini, Shreshth, et al.
Published: (2025)
by: Saini, Shreshth, et al.
Published: (2025)
T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models
by: Yang, Jindong, et al.
Published: (2025)
by: Yang, Jindong, et al.
Published: (2025)
SKeDA: A Generative Watermarking Framework for Text-to-video Diffusion Models
by: Yang, Yang, et al.
Published: (2026)
by: Yang, Yang, et al.
Published: (2026)
Diffusion Model with Perceptual Loss
by: Lin, Shanchuan, et al.
Published: (2023)
by: Lin, Shanchuan, et al.
Published: (2023)
Scaling-up Perceptual Video Quality Assessment
by: Jia, Ziheng, et al.
Published: (2025)
by: Jia, Ziheng, et al.
Published: (2025)
EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
by: Wang, Zun, et al.
Published: (2025)
by: Wang, Zun, et al.
Published: (2025)
RefBench-PRO: Perceptual and Reasoning Oriented Benchmark for Referring Expression Comprehension
by: Gao, Tianyi, et al.
Published: (2025)
by: Gao, Tianyi, et al.
Published: (2025)
Scaling Properties of Diffusion Models for Perceptual Tasks
by: Ravishankar, Rahul, et al.
Published: (2024)
by: Ravishankar, Rahul, et al.
Published: (2024)
MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement
by: Deng, Yufan, et al.
Published: (2025)
by: Deng, Yufan, et al.
Published: (2025)
Gradient-Free Classifier Guidance for Diffusion Model Sampling
by: Shenoy, Rahul, et al.
Published: (2024)
by: Shenoy, Rahul, et al.
Published: (2024)
BudgetFusion: Perceptually-Guided Adaptive Diffusion Models
by: Li, Qinchan, et al.
Published: (2024)
by: Li, Qinchan, et al.
Published: (2024)
PixelGen: Improving Pixel Diffusion with Perceptual Supervision
by: Ma, Zehong, et al.
Published: (2026)
by: Ma, Zehong, et al.
Published: (2026)
Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays
by: Schuit, Gregory, et al.
Published: (2025)
by: Schuit, Gregory, et al.
Published: (2025)
Predicting and Enhancing the Fairness of DNNs with the Curvature of Perceptual Manifolds
by: Ma, Yanbiao, et al.
Published: (2023)
by: Ma, Yanbiao, et al.
Published: (2023)
Refine-IQA: Multi-Stage Reinforcement Finetuning for Perceptual Image Quality Assessment
by: Jia, Ziheng, et al.
Published: (2025)
by: Jia, Ziheng, et al.
Published: (2025)
MAP-Diff: Multi-Anchor Guided Diffusion for Progressive 3D Whole-Body Low-Dose PET Denoising
by: Jing, Peiyuan, et al.
Published: (2026)
by: Jing, Peiyuan, et al.
Published: (2026)
Entropy Rectifying Guidance for Diffusion and Flow Models
by: Ifriqi, Tariq Berrada, et al.
Published: (2025)
by: Ifriqi, Tariq Berrada, et al.
Published: (2025)
Beyond Fixed Anchors: Precisely Erasing Concepts with Sibling Exclusive Counterparts
by: Zhang, Tong, et al.
Published: (2025)
by: Zhang, Tong, et al.
Published: (2025)
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
by: Jang, Sangwon, et al.
Published: (2025)
by: Jang, Sangwon, et al.
Published: (2025)
Perceptual Group Tokenizer: Building Perception with Iterative Grouping
by: Deng, Zhiwei, et al.
Published: (2023)
by: Deng, Zhiwei, et al.
Published: (2023)
Unveiling and Mitigating Generalized Biases of DNNs through the Intrinsic Dimensions of Perceptual Manifolds
by: Ma, Yanbiao, et al.
Published: (2024)
by: Ma, Yanbiao, et al.
Published: (2024)
PMG: Progressive Motion Generation via Sparse Anchor Postures Curriculum Learning
by: Xi, Yingjie, et al.
Published: (2025)
by: Xi, Yingjie, et al.
Published: (2025)
How Much To Guide: Revisiting Adaptive Guidance in Classifier-Free Guidance Text-to-Vision Diffusion Models
by: Zhang, Huixuan, et al.
Published: (2025)
by: Zhang, Huixuan, et al.
Published: (2025)
TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding
by: Guo, Chaohong, et al.
Published: (2025)
by: Guo, Chaohong, et al.
Published: (2025)
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance
by: Yesiltepe, Hidir, et al.
Published: (2024)
by: Yesiltepe, Hidir, et al.
Published: (2024)
Semantic Guidance Tuning for Text-To-Image Diffusion Models
by: Kang, Hyun, et al.
Published: (2023)
by: Kang, Hyun, et al.
Published: (2023)
Tell Me What to Track: Infusing Robust Language Guidance for Enhanced Referring Multi-Object Tracking
by: Huang, Wenjun, et al.
Published: (2024)
by: Huang, Wenjun, et al.
Published: (2024)
Inversion-DPO: Precise and Efficient Post-Training for Diffusion Models
by: Li, Zejian, et al.
Published: (2025)
by: Li, Zejian, et al.
Published: (2025)
Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis
by: Wang, Zixuan, et al.
Published: (2025)
by: Wang, Zixuan, et al.
Published: (2025)
AttAnchor: Guiding Cross-Modal Token Alignment in VLMs with Attention Anchors
by: Zhang, Junyang, et al.
Published: (2025)
by: Zhang, Junyang, et al.
Published: (2025)
Causal Disentanglement-Inspired Degradation Representation Learning for Full-Reference Image Quality Assessment
by: Zhang, Zhen, et al.
Published: (2026)
by: Zhang, Zhen, et al.
Published: (2026)
Leveraging Geometric Visual Illusions as Perceptual Inductive Biases for Vision Models
by: Yang, Haobo, et al.
Published: (2025)
by: Yang, Haobo, et al.
Published: (2025)
Perceptual Quality-based Model Training under Annotator Label Uncertainty
by: Zhou, Chen, et al.
Published: (2024)
by: Zhou, Chen, et al.
Published: (2024)
Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model
by: Zhang, Zhenxing, et al.
Published: (2025)
by: Zhang, Zhenxing, et al.
Published: (2025)
Parallel Rescaling: Rebalancing Consistency Guidance for Personalized Diffusion Models
by: Chae, JungWoo, et al.
Published: (2025)
by: Chae, JungWoo, et al.
Published: (2025)
Upsample Guidance: Scale Up Diffusion Models without Training
by: Hwang, Juno, et al.
Published: (2024)
by: Hwang, Juno, et al.
Published: (2024)
AnchorDiff: Training-Free Concept Grounding for MM-DiTs via Anchor-Based Graph Propagation
by: Zhang, Jian, et al.
Published: (2026)
by: Zhang, Jian, et al.
Published: (2026)
ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model
by: Wang, Yufei, et al.
Published: (2024)
by: Wang, Yufei, et al.
Published: (2024)
A Simple Background Augmentation Method for Object Detection with Diffusion Model
by: Li, Yuhang, et al.
Published: (2024)
by: Li, Yuhang, et al.
Published: (2024)
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
by: Zeng, Weili, et al.
Published: (2024)
by: Zeng, Weili, et al.
Published: (2024)
Similar Items
-
Latent Guidance in Diffusion Models for Perceptual Evaluations
by: Saini, Shreshth, et al.
Published: (2025) -
T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models
by: Yang, Jindong, et al.
Published: (2025) -
SKeDA: A Generative Watermarking Framework for Text-to-video Diffusion Models
by: Yang, Yang, et al.
Published: (2026) -
Diffusion Model with Perceptual Loss
by: Lin, Shanchuan, et al.
Published: (2023) -
Scaling-up Perceptual Video Quality Assessment
by: Jia, Ziheng, et al.
Published: (2025)