:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Yang, Meng, Feifan, Fang, Han, Zhang, Weiming
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.26348
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Latent Guidance in Diffusion Models for Perceptual Evaluations
by: Saini, Shreshth, et al.
Published: (2025)

T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models
by: Yang, Jindong, et al.
Published: (2025)

SKeDA: A Generative Watermarking Framework for Text-to-video Diffusion Models
by: Yang, Yang, et al.
Published: (2026)

Diffusion Model with Perceptual Loss
by: Lin, Shanchuan, et al.
Published: (2023)

Scaling-up Perceptual Video Quality Assessment
by: Jia, Ziheng, et al.
Published: (2025)

EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
by: Wang, Zun, et al.
Published: (2025)

RefBench-PRO: Perceptual and Reasoning Oriented Benchmark for Referring Expression Comprehension
by: Gao, Tianyi, et al.
Published: (2025)

Scaling Properties of Diffusion Models for Perceptual Tasks
by: Ravishankar, Rahul, et al.
Published: (2024)

MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement
by: Deng, Yufan, et al.
Published: (2025)

Gradient-Free Classifier Guidance for Diffusion Model Sampling
by: Shenoy, Rahul, et al.
Published: (2024)

BudgetFusion: Perceptually-Guided Adaptive Diffusion Models
by: Li, Qinchan, et al.
Published: (2024)

PixelGen: Improving Pixel Diffusion with Perceptual Supervision
by: Ma, Zehong, et al.
Published: (2026)

Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays
by: Schuit, Gregory, et al.
Published: (2025)

Predicting and Enhancing the Fairness of DNNs with the Curvature of Perceptual Manifolds
by: Ma, Yanbiao, et al.
Published: (2023)

Refine-IQA: Multi-Stage Reinforcement Finetuning for Perceptual Image Quality Assessment
by: Jia, Ziheng, et al.
Published: (2025)

MAP-Diff: Multi-Anchor Guided Diffusion for Progressive 3D Whole-Body Low-Dose PET Denoising
by: Jing, Peiyuan, et al.
Published: (2026)

Entropy Rectifying Guidance for Diffusion and Flow Models
by: Ifriqi, Tariq Berrada, et al.
Published: (2025)

Beyond Fixed Anchors: Precisely Erasing Concepts with Sibling Exclusive Counterparts
by: Zhang, Tong, et al.
Published: (2025)

Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
by: Jang, Sangwon, et al.
Published: (2025)

Perceptual Group Tokenizer: Building Perception with Iterative Grouping
by: Deng, Zhiwei, et al.
Published: (2023)

Unveiling and Mitigating Generalized Biases of DNNs through the Intrinsic Dimensions of Perceptual Manifolds
by: Ma, Yanbiao, et al.
Published: (2024)

PMG: Progressive Motion Generation via Sparse Anchor Postures Curriculum Learning
by: Xi, Yingjie, et al.
Published: (2025)

How Much To Guide: Revisiting Adaptive Guidance in Classifier-Free Guidance Text-to-Vision Diffusion Models
by: Zhang, Huixuan, et al.
Published: (2025)

TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding
by: Guo, Chaohong, et al.
Published: (2025)

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance
by: Yesiltepe, Hidir, et al.
Published: (2024)

Semantic Guidance Tuning for Text-To-Image Diffusion Models
by: Kang, Hyun, et al.
Published: (2023)

Tell Me What to Track: Infusing Robust Language Guidance for Enhanced Referring Multi-Object Tracking
by: Huang, Wenjun, et al.
Published: (2024)

Inversion-DPO: Precise and Efficient Post-Training for Diffusion Models
by: Li, Zejian, et al.
Published: (2025)

Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis
by: Wang, Zixuan, et al.
Published: (2025)

AttAnchor: Guiding Cross-Modal Token Alignment in VLMs with Attention Anchors
by: Zhang, Junyang, et al.
Published: (2025)

Causal Disentanglement-Inspired Degradation Representation Learning for Full-Reference Image Quality Assessment
by: Zhang, Zhen, et al.
Published: (2026)

Leveraging Geometric Visual Illusions as Perceptual Inductive Biases for Vision Models
by: Yang, Haobo, et al.
Published: (2025)

Perceptual Quality-based Model Training under Annotator Label Uncertainty
by: Zhou, Chen, et al.
Published: (2024)

Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model
by: Zhang, Zhenxing, et al.
Published: (2025)

Parallel Rescaling: Rebalancing Consistency Guidance for Personalized Diffusion Models
by: Chae, JungWoo, et al.
Published: (2025)

Upsample Guidance: Scale Up Diffusion Models without Training
by: Hwang, Juno, et al.
Published: (2024)

AnchorDiff: Training-Free Concept Grounding for MM-DiTs via Anchor-Based Graph Propagation
by: Zhang, Jian, et al.
Published: (2026)

ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model
by: Wang, Yufei, et al.
Published: (2024)

A Simple Background Augmentation Method for Object Detection with Diffusion Model
by: Li, Yuhang, et al.
Published: (2024)

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
by: Zeng, Weili, et al.
Published: (2024)