Guardado en:
| Autores principales: | Zhang, Lingyun, Xie, Yu, Fang, Zhongli, Liu, Yu, Chen, Ping |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2604.03941 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
SafeCtrl: Region-Based Safety Control for Text-to-Image Diffusion via Detect-Then-Suppress
por: Zhang, Lingyun, et al.
Publicado: (2025)
por: Zhang, Lingyun, et al.
Publicado: (2025)
NSFW-Classifier Guided Prompt Sanitization for Safe Text-to-Image Generation
por: Xie, Yu, et al.
Publicado: (2025)
por: Xie, Yu, et al.
Publicado: (2025)
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
por: Zeng, Weichao, et al.
Publicado: (2024)
por: Zeng, Weichao, et al.
Publicado: (2024)
Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization
por: Zhang, Lingyun, et al.
Publicado: (2024)
por: Zhang, Lingyun, et al.
Publicado: (2024)
IdentityGuard: Context-Aware Restriction and Provenance for Personalized Synthesis
por: Zhang, Lingyun, et al.
Publicado: (2026)
por: Zhang, Lingyun, et al.
Publicado: (2026)
Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
por: Fang, Chuan, et al.
Publicado: (2023)
por: Fang, Chuan, et al.
Publicado: (2023)
ConfCtrl: Enabling Precise Camera Control in Video Diffusion via Confidence-Aware Interpolation
por: Yang, Liudi, et al.
Publicado: (2026)
por: Yang, Liudi, et al.
Publicado: (2026)
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
por: Zhang, Hongfei, et al.
Publicado: (2025)
por: Zhang, Hongfei, et al.
Publicado: (2025)
CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images
por: Liu, Jian, et al.
Publicado: (2024)
por: Liu, Jian, et al.
Publicado: (2024)
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
por: Xia, Tian, et al.
Publicado: (2024)
por: Xia, Tian, et al.
Publicado: (2024)
Ctrl-Z Sampling: Scaling Diffusion Sampling with Controlled Random Zigzag Explorations
por: Mao, Shunqi, et al.
Publicado: (2025)
por: Mao, Shunqi, et al.
Publicado: (2025)
RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers
por: Cao, Ke, et al.
Publicado: (2025)
por: Cao, Ke, et al.
Publicado: (2025)
CtrlVDiff: Controllable Video Generation via Unified Multimodal Video Diffusion
por: Xi, Dianbing, et al.
Publicado: (2025)
por: Xi, Dianbing, et al.
Publicado: (2025)
Mitigating Memorization in Text-to-Image Diffusion via Region-Aware Prompt Augmentation and Multimodal Copy Detection
por: Chen, Yunzhuo, et al.
Publicado: (2026)
por: Chen, Yunzhuo, et al.
Publicado: (2026)
LumiCtrl : Learning Illuminant Prompts for Lighting Control in Personalized Text-to-Image Models
por: Butt, Muhammad Atif, et al.
Publicado: (2025)
por: Butt, Muhammad Atif, et al.
Publicado: (2025)
CameraCtrl: Enabling Camera Control for Text-to-Video Generation
por: He, Hao, et al.
Publicado: (2024)
por: He, Hao, et al.
Publicado: (2024)
AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models
por: Chen, Die, et al.
Publicado: (2025)
por: Chen, Die, et al.
Publicado: (2025)
Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance
por: Lin, Kuan Heng, et al.
Publicado: (2024)
por: Lin, Kuan Heng, et al.
Publicado: (2024)
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning
por: Cao, Qingqing, et al.
Publicado: (2024)
por: Cao, Qingqing, et al.
Publicado: (2024)
CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance
por: Wang, Hanyang, et al.
Publicado: (2026)
por: Wang, Hanyang, et al.
Publicado: (2026)
EmoCtrl: Controllable Emotional Image Content Generation
por: Yang, Jingyuan, et al.
Publicado: (2025)
por: Yang, Jingyuan, et al.
Publicado: (2025)
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
por: Xue, Zeyue, et al.
Publicado: (2023)
por: Xue, Zeyue, et al.
Publicado: (2023)
SafeGuider: Robust and Practical Content Safety Control for Text-to-Image Models
por: Qi, Peigui, et al.
Publicado: (2025)
por: Qi, Peigui, et al.
Publicado: (2025)
LightCtrl: Training-free Controllable Video Relighting
por: Peng, Yizuo, et al.
Publicado: (2026)
por: Peng, Yizuo, et al.
Publicado: (2026)
BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection
por: Ma, Jitao, et al.
Publicado: (2023)
por: Ma, Jitao, et al.
Publicado: (2023)
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
por: Wang, Chen, et al.
Publicado: (2025)
por: Wang, Chen, et al.
Publicado: (2025)
Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization
por: Li, Feifei, et al.
Publicado: (2025)
por: Li, Feifei, et al.
Publicado: (2025)
Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes
por: Gosselin, Anthony, et al.
Publicado: (2025)
por: Gosselin, Anthony, et al.
Publicado: (2025)
Layer-wise Instance Binding for Regional and Occlusion Control in Text-to-Image Diffusion Transformers
por: Chen, Ruidong, et al.
Publicado: (2026)
por: Chen, Ruidong, et al.
Publicado: (2026)
Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing
por: Liao, Chen, et al.
Publicado: (2025)
por: Liao, Chen, et al.
Publicado: (2025)
DynamiCtrl: Rethinking the Basic Structure and the Role of Text for High-quality Human Image Animation
por: Zhao, Haoyu, et al.
Publicado: (2025)
por: Zhao, Haoyu, et al.
Publicado: (2025)
Personalized Safety Alignment for Text-to-Image Diffusion Models
por: Lei, Yu, et al.
Publicado: (2025)
por: Lei, Yu, et al.
Publicado: (2025)
Ctrl-VI: Controllable Video Synthesis via Variational Inference
por: Duan, Haoyi, et al.
Publicado: (2025)
por: Duan, Haoyi, et al.
Publicado: (2025)
TIDE: Text-Informed Dynamic Extrapolation with Step-Aware Temperature Control for Diffusion Transformers
por: Liu, Yihua, et al.
Publicado: (2026)
por: Liu, Yihua, et al.
Publicado: (2026)
SafeGen-Bench: Benchmarking Safety in Image-Conditioned Text-to-Video Generation
por: Ma, Yingzi, et al.
Publicado: (2026)
por: Ma, Yingzi, et al.
Publicado: (2026)
Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
por: Gou, Yunhao, et al.
Publicado: (2024)
por: Gou, Yunhao, et al.
Publicado: (2024)
Beyond Detection: A Structure-Aware Framework for Scene Text Tracking
por: Yu, Chenmin, et al.
Publicado: (2026)
por: Yu, Chenmin, et al.
Publicado: (2026)
PROBE: Diagnosing Residual Concept Capacity in Erased Text-to-Video Diffusion Models
por: Xie, Yiwei, et al.
Publicado: (2026)
por: Xie, Yiwei, et al.
Publicado: (2026)
CtrlFuse: Mask-Prompt Guided Controllable Infrared and Visible Image Fusion
por: Sun, Yiming, et al.
Publicado: (2026)
por: Sun, Yiming, et al.
Publicado: (2026)
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
por: Chen, Zhennan, et al.
Publicado: (2024)
por: Chen, Zhennan, et al.
Publicado: (2024)
Ejemplares similares
-
SafeCtrl: Region-Based Safety Control for Text-to-Image Diffusion via Detect-Then-Suppress
por: Zhang, Lingyun, et al.
Publicado: (2025) -
NSFW-Classifier Guided Prompt Sanitization for Safe Text-to-Image Generation
por: Xie, Yu, et al.
Publicado: (2025) -
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
por: Zeng, Weichao, et al.
Publicado: (2024) -
Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization
por: Zhang, Lingyun, et al.
Publicado: (2024) -
IdentityGuard: Context-Aware Restriction and Provenance for Personalized Synthesis
por: Zhang, Lingyun, et al.
Publicado: (2026)