:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kim, Mingyu, Kim, Young-Heon, Park, Mijung
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.13300
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Bayesian Principles Improve Prompt Learning In Vision-Language Models
by: Kim, Mingyu, et al.
Published: (2025)

Guiding What Not to Generate: Automated Negative Prompting for Text-Image Alignment
by: Park, Sangha, et al.
Published: (2025)

LaMoGen: Laban Movement-Guided Diffusion for Text-to-Motion Generation
by: Kim, Heechang, et al.
Published: (2025)

Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs
by: Kim, Mingyu, et al.
Published: (2024)

GuidNoise: Single-Pair Guided Diffusion for Generalized Noise Synthesis
by: Kim, Changjin, et al.
Published: (2025)

Dynamic VLM-Guided Negative Prompting for Diffusion Models
by: Chang, Hoyeon, et al.
Published: (2025)

HoliSafe: Holistic Safety Benchmarking and Modeling for Vision-Language Model
by: Lee, Youngwan, et al.
Published: (2025)

Thinking Diffusion: Penalize and Guide Visual-Grounded Reasoning in Diffusion Multimodal Language Models
by: Kim, Keuntae, et al.
Published: (2026)

UniSAFE: A Comprehensive Benchmark for Safety Evaluation of Unified Multimodal Models
by: Lee, Segyu, et al.
Published: (2026)

Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation
by: Kim, Jihyo, et al.
Published: (2024)

CharDiff-LP: A Diffusion Model with Character-Level Guidance for License Plate Image Restoration
by: Na, Kihyun, et al.
Published: (2025)

Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation
by: Park, SoYoung, et al.
Published: (2025)

MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance
by: Kim, Chaewon, et al.
Published: (2025)

3DPhysVideo: Consistency-Guided Flow SDE for Video Generation via 3D Scene Reconstruction and Physical Simulation
by: Kim, Hwidong, et al.
Published: (2026)

FPANet: Frequency-based Video Demoireing using Frame-level Post Alignment
by: Oh, Gyeongrok, et al.
Published: (2023)

Adaptive Self-training Framework for Fine-grained Scene Graph Generation
by: Kim, Kibum, et al.
Published: (2024)

Text-Guided Variational Image Generation for Industrial Anomaly Detection and Segmentation
by: Lee, Mingyu, et al.
Published: (2024)

SafeEditor: Unified MLLM for Efficient Post-hoc T2I Safety Editing
by: Zhang, Ruiyang, et al.
Published: (2025)

Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
by: Park, Dongmin, et al.
Published: (2024)

MAGIC: Few-Shot Mask-Guided Anomaly Inpainting with Prompt Perturbation, Spatially Adaptive Guidance, and Context Awareness
by: Choi, JaeHyuck, et al.
Published: (2025)

Toward the Frontiers of Reliable Diffusion Sampling via Adversarial Sinkhorn Attention Guidance
by: Kim, Kwanyoung
Published: (2025)

Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration
by: Park, Donwon, et al.
Published: (2024)

FALCON: False-Negative Aware Learning of Contrastive Negatives in Vision-Language Alignment
by: Kim, Myunsoo, et al.
Published: (2025)

H2-Cache: A Novel Hierarchical Dual-Stage Cache for High-Performance Acceleration of Generative Diffusion Models
by: Sung, Mingyu, et al.
Published: (2025)

VideoMaMa: Mask-Guided Video Matting via Generative Prior
by: Lim, Sangbeom, et al.
Published: (2026)

CellCLIP -- Learning Perturbation Effects in Cell Painting via Text-Guided Contrastive Learning
by: Lu, Mingyu, et al.
Published: (2025)

Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models
by: Ahn, Donghoon, et al.
Published: (2025)

Denoising Task Routing for Diffusion Models
by: Park, Byeongjun, et al.
Published: (2023)

VisAgent: Narrative-Preserving Story Visualization Framework
by: Kim, Seungkwon, et al.
Published: (2025)

Explainable Adversarial-Robust Vision-Language-Action Model for Robotic Manipulation
by: Kim, Ju-Young, et al.
Published: (2025)

Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance
by: Ahn, Donghoon, et al.
Published: (2024)

TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
by: Kim, Jeongho, et al.
Published: (2024)

SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video Captioning
by: Kim, Ye-Chan, et al.
Published: (2026)

DriveSafe: A Framework for Risk Detection and Safety Suggestions in Driving Scenarios
by: Artham, Sainithin, et al.
Published: (2026)

Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority Generation
by: Um, Soobin, et al.
Published: (2025)

LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection
by: Park, Chanyeong, et al.
Published: (2024)

EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting
by: Lee, Dong In, et al.
Published: (2024)

Clustering-based Image-Text Graph Matching for Domain Generalization
by: Park, Nokyung, et al.
Published: (2023)

Empowering Bridge Digital Twins by Bridging the Data Gap with a Unified Synthesis Framework
by: Wang, Wang, et al.
Published: (2025)

Parallel Rescaling: Rebalancing Consistency Guidance for Personalized Diffusion Models
by: Chae, JungWoo, et al.
Published: (2025)