:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kim, Hyunseung, Choi, Chiho, Malla, Srikanth, Padmanabhan, Sai Prahladh, Bagchi, Saurabh, Choi, Joon Hee
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2509.19731
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

COPAL: Continual Pruning in Large Language Generative Models
by: Malla, Srikanth, et al.
Published: (2024)

Accelerating Conditional Prompt Learning via Masked Image Modeling for Vision-Language Models
by: Bui, Phuoc-Nguyen, et al.
Published: (2025)

Deep Understanding of Sign Language for Sign to Subtitle Alignment
by: Jang, Youngjoon, et al.
Published: (2025)

ADEPT: Adaptive Dynamic Early-Exit Process for Transformers
by: Yoo, Sangmin, et al.
Published: (2026)

MATRIX: Mask Track Alignment for Interaction-aware Video Generation
by: Jin, Siyoon, et al.
Published: (2025)

Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks
by: Choi, Sehwan, et al.
Published: (2024)

MAGIC: Few-Shot Mask-Guided Anomaly Inpainting with Prompt Perturbation, Spatially Adaptive Guidance, and Context Awareness
by: Choi, JaeHyuck, et al.
Published: (2025)

Evaluating Demographic Misrepresentation in Image-to-Image Portrait Editing
by: Seo, Huichan, et al.
Published: (2026)

SceneNAT: Masked Generative Modeling for Language-Guided Indoor Scene Synthesis
by: Choi, Jeongjun, et al.
Published: (2026)

OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation
by: Lee, Sanghyeon, et al.
Published: (2026)

MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling
by: Ahn, Jihye, et al.
Published: (2024)

OphEdit: Training-Free Text-Guided Editing of Ophthalmic Surgical Videos
by: Jangir, Ritul, et al.
Published: (2026)

GEM: Gaussian Evolution Model for Occupancy Forecasting and Motion Planning
by: Chen, Cheng, et al.
Published: (2026)

Training-Free Image Editing with Visual Context Integration and Concept Alignment
by: Song, Rui, et al.
Published: (2026)

InstantDrag: Improving Interactivity in Drag-based Image Editing
by: Shin, Joonghyuk, et al.
Published: (2024)

Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing
by: He, Runze, et al.
Published: (2026)

Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy Prediction
by: Chen, Cheng, et al.
Published: (2025)

Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance
by: Hur, Jiwan, et al.
Published: (2024)

Audio-Guided Visual Editing with Complex Multi-Modal Prompts
by: Kim, Hyeonyu, et al.
Published: (2025)

Good Noise Makes Good Edits: A Training-Free Diffusion-Based Video Editing with Image and Text Prompts
by: Choi, Saemee, et al.
Published: (2025)

CAVIS: Context-Aware Video Instance Segmentation
by: Lee, Seunghun, et al.
Published: (2024)

Fork-Merge Decoding: Enhancing Multimodal Understanding in Audio-Visual Large Language Models
by: Jung, Chaeyoung, et al.
Published: (2025)

Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning
by: Herath, Kavindu, et al.
Published: (2026)

Geometry-Aware Scene Configurations for Novel View Synthesis
by: Kim, Minkwan, et al.
Published: (2025)

Preserve or Modify? Context-Aware Evaluation for Balancing Preservation and Modification in Text-Guided Image Editing
by: Kim, Yoonjeon, et al.
Published: (2024)

Edit Where You Mean: Region-Aware Adapter Injection for Mask-Free Local Image Editing
by: Cai, Honghao, et al.
Published: (2026)

Universal Image Immunization against Diffusion-based Image Editing via Semantic Injection
by: Lee, Chanhui, et al.
Published: (2026)

SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding
by: Sun, Qianqian, et al.
Published: (2025)

Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Models
by: Choi, Hyesong, et al.
Published: (2024)

Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models
by: Choi, Lucas, et al.
Published: (2025)

ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
by: Mao, Chaojie, et al.
Published: (2025)

Lesion-Aware Post-Training of Latent Diffusion Models for Synthesizing Diffusion MRI from CT Perfusion
by: Lee, Junhyeok, et al.
Published: (2025)

Guiding What Not to Generate: Automated Negative Prompting for Text-Image Alignment
by: Park, Sangha, et al.
Published: (2025)

Visual Representation Alignment for Multimodal Large Language Models
by: Yoon, Heeji, et al.
Published: (2025)

Periodic-MAE: Periodic Video Masked Autoencoder for rPPG Estimation
by: Choi, Jiho, et al.
Published: (2025)

Emerging Property of Masked Token for Effective Pre-training
by: Choi, Hyesong, et al.
Published: (2024)

Hands-off Image Editing: Language-guided Editing without any Task-specific Labeling, Masking or even Training
by: Santos, Rodrigo, et al.
Published: (2025)

Document Haystack: A Long Context Multimodal Image/Document Understanding Vision LLM Benchmark
by: Huybrechts, Goeric, et al.
Published: (2025)

MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models
by: Zhu, Hongyang, et al.
Published: (2025)

Noise Map Guidance: Inversion with Spatial Context for Real Image Editing
by: Cho, Hansam, et al.
Published: (2024)