:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Zhang, Lingyun, Xie, Yu, Fang, Zhongli, Liu, Yu, Chen, Ping
Formato:	Preprint
Publicado:	2026
Materias:	Computer Vision and Pattern Recognition
Acceso en línea:	https://arxiv.org/abs/2604.03941
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

SafeCtrl: Region-Based Safety Control for Text-to-Image Diffusion via Detect-Then-Suppress
por: Zhang, Lingyun, et al.
Publicado: (2025)

NSFW-Classifier Guided Prompt Sanitization for Safe Text-to-Image Generation
por: Xie, Yu, et al.
Publicado: (2025)

TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
por: Zeng, Weichao, et al.
Publicado: (2024)

Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization
por: Zhang, Lingyun, et al.
Publicado: (2024)

IdentityGuard: Context-Aware Restriction and Provenance for Personalized Synthesis
por: Zhang, Lingyun, et al.
Publicado: (2026)

Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
por: Fang, Chuan, et al.
Publicado: (2023)

ConfCtrl: Enabling Precise Camera Control in Video Diffusion via Confidence-Aware Interpolation
por: Yang, Liudi, et al.
Publicado: (2026)

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
por: Zhang, Hongfei, et al.
Publicado: (2025)

CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images
por: Liu, Jian, et al.
Publicado: (2024)

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
por: Xia, Tian, et al.
Publicado: (2024)

Ctrl-Z Sampling: Scaling Diffusion Sampling with Controlled Random Zigzag Explorations
por: Mao, Shunqi, et al.
Publicado: (2025)

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers
por: Cao, Ke, et al.
Publicado: (2025)

CtrlVDiff: Controllable Video Generation via Unified Multimodal Video Diffusion
por: Xi, Dianbing, et al.
Publicado: (2025)

Mitigating Memorization in Text-to-Image Diffusion via Region-Aware Prompt Augmentation and Multimodal Copy Detection
por: Chen, Yunzhuo, et al.
Publicado: (2026)

LumiCtrl : Learning Illuminant Prompts for Lighting Control in Personalized Text-to-Image Models
por: Butt, Muhammad Atif, et al.
Publicado: (2025)

CameraCtrl: Enabling Camera Control for Text-to-Video Generation
por: He, Hao, et al.
Publicado: (2024)

AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models
por: Chen, Die, et al.
Publicado: (2025)

Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance
por: Lin, Kuan Heng, et al.
Publicado: (2024)

CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning
por: Cao, Qingqing, et al.
Publicado: (2024)

CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance
por: Wang, Hanyang, et al.
Publicado: (2026)

EmoCtrl: Controllable Emotional Image Content Generation
por: Yang, Jingyuan, et al.
Publicado: (2025)

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
por: Xue, Zeyue, et al.
Publicado: (2023)

SafeGuider: Robust and Practical Content Safety Control for Text-to-Image Models
por: Qi, Peigui, et al.
Publicado: (2025)

LightCtrl: Training-free Controllable Video Relighting
por: Peng, Yizuo, et al.
Publicado: (2026)

BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection
por: Ma, Jitao, et al.
Publicado: (2023)

PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
por: Wang, Chen, et al.
Publicado: (2025)

Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization
por: Li, Feifei, et al.
Publicado: (2025)

Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes
por: Gosselin, Anthony, et al.
Publicado: (2025)

Layer-wise Instance Binding for Regional and Occlusion Control in Text-to-Image Diffusion Transformers
por: Chen, Ruidong, et al.
Publicado: (2026)

Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing
por: Liao, Chen, et al.
Publicado: (2025)

DynamiCtrl: Rethinking the Basic Structure and the Role of Text for High-quality Human Image Animation
por: Zhao, Haoyu, et al.
Publicado: (2025)

Personalized Safety Alignment for Text-to-Image Diffusion Models
por: Lei, Yu, et al.
Publicado: (2025)

Ctrl-VI: Controllable Video Synthesis via Variational Inference
por: Duan, Haoyi, et al.
Publicado: (2025)

TIDE: Text-Informed Dynamic Extrapolation with Step-Aware Temperature Control for Diffusion Transformers
por: Liu, Yihua, et al.
Publicado: (2026)

SafeGen-Bench: Benchmarking Safety in Image-Conditioned Text-to-Video Generation
por: Ma, Yingzi, et al.
Publicado: (2026)

Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
por: Gou, Yunhao, et al.
Publicado: (2024)

Beyond Detection: A Structure-Aware Framework for Scene Text Tracking
por: Yu, Chenmin, et al.
Publicado: (2026)

PROBE: Diagnosing Residual Concept Capacity in Erased Text-to-Video Diffusion Models
por: Xie, Yiwei, et al.
Publicado: (2026)

CtrlFuse: Mask-Prompt Guided Controllable Infrared and Visible Image Fusion
por: Sun, Yiming, et al.
Publicado: (2026)

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
por: Chen, Zhennan, et al.
Publicado: (2024)