Saved in:
| Main Authors: | Kim, Hyunseung, Choi, Chiho, Malla, Srikanth, Padmanabhan, Sai Prahladh, Bagchi, Saurabh, Choi, Joon Hee |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.19731 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
COPAL: Continual Pruning in Large Language Generative Models
by: Malla, Srikanth, et al.
Published: (2024)
by: Malla, Srikanth, et al.
Published: (2024)
Accelerating Conditional Prompt Learning via Masked Image Modeling for Vision-Language Models
by: Bui, Phuoc-Nguyen, et al.
Published: (2025)
by: Bui, Phuoc-Nguyen, et al.
Published: (2025)
Deep Understanding of Sign Language for Sign to Subtitle Alignment
by: Jang, Youngjoon, et al.
Published: (2025)
by: Jang, Youngjoon, et al.
Published: (2025)
ADEPT: Adaptive Dynamic Early-Exit Process for Transformers
by: Yoo, Sangmin, et al.
Published: (2026)
by: Yoo, Sangmin, et al.
Published: (2026)
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
by: Jin, Siyoon, et al.
Published: (2025)
by: Jin, Siyoon, et al.
Published: (2025)
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks
by: Choi, Sehwan, et al.
Published: (2024)
by: Choi, Sehwan, et al.
Published: (2024)
MAGIC: Few-Shot Mask-Guided Anomaly Inpainting with Prompt Perturbation, Spatially Adaptive Guidance, and Context Awareness
by: Choi, JaeHyuck, et al.
Published: (2025)
by: Choi, JaeHyuck, et al.
Published: (2025)
Evaluating Demographic Misrepresentation in Image-to-Image Portrait Editing
by: Seo, Huichan, et al.
Published: (2026)
by: Seo, Huichan, et al.
Published: (2026)
SceneNAT: Masked Generative Modeling for Language-Guided Indoor Scene Synthesis
by: Choi, Jeongjun, et al.
Published: (2026)
by: Choi, Jeongjun, et al.
Published: (2026)
OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation
by: Lee, Sanghyeon, et al.
Published: (2026)
by: Lee, Sanghyeon, et al.
Published: (2026)
MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling
by: Ahn, Jihye, et al.
Published: (2024)
by: Ahn, Jihye, et al.
Published: (2024)
OphEdit: Training-Free Text-Guided Editing of Ophthalmic Surgical Videos
by: Jangir, Ritul, et al.
Published: (2026)
by: Jangir, Ritul, et al.
Published: (2026)
GEM: Gaussian Evolution Model for Occupancy Forecasting and Motion Planning
by: Chen, Cheng, et al.
Published: (2026)
by: Chen, Cheng, et al.
Published: (2026)
Training-Free Image Editing with Visual Context Integration and Concept Alignment
by: Song, Rui, et al.
Published: (2026)
by: Song, Rui, et al.
Published: (2026)
InstantDrag: Improving Interactivity in Drag-based Image Editing
by: Shin, Joonghyuk, et al.
Published: (2024)
by: Shin, Joonghyuk, et al.
Published: (2024)
Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing
by: He, Runze, et al.
Published: (2026)
by: He, Runze, et al.
Published: (2026)
Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy Prediction
by: Chen, Cheng, et al.
Published: (2025)
by: Chen, Cheng, et al.
Published: (2025)
Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance
by: Hur, Jiwan, et al.
Published: (2024)
by: Hur, Jiwan, et al.
Published: (2024)
Audio-Guided Visual Editing with Complex Multi-Modal Prompts
by: Kim, Hyeonyu, et al.
Published: (2025)
by: Kim, Hyeonyu, et al.
Published: (2025)
Good Noise Makes Good Edits: A Training-Free Diffusion-Based Video Editing with Image and Text Prompts
by: Choi, Saemee, et al.
Published: (2025)
by: Choi, Saemee, et al.
Published: (2025)
CAVIS: Context-Aware Video Instance Segmentation
by: Lee, Seunghun, et al.
Published: (2024)
by: Lee, Seunghun, et al.
Published: (2024)
Fork-Merge Decoding: Enhancing Multimodal Understanding in Audio-Visual Large Language Models
by: Jung, Chaeyoung, et al.
Published: (2025)
by: Jung, Chaeyoung, et al.
Published: (2025)
Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning
by: Herath, Kavindu, et al.
Published: (2026)
by: Herath, Kavindu, et al.
Published: (2026)
Geometry-Aware Scene Configurations for Novel View Synthesis
by: Kim, Minkwan, et al.
Published: (2025)
by: Kim, Minkwan, et al.
Published: (2025)
Preserve or Modify? Context-Aware Evaluation for Balancing Preservation and Modification in Text-Guided Image Editing
by: Kim, Yoonjeon, et al.
Published: (2024)
by: Kim, Yoonjeon, et al.
Published: (2024)
Edit Where You Mean: Region-Aware Adapter Injection for Mask-Free Local Image Editing
by: Cai, Honghao, et al.
Published: (2026)
by: Cai, Honghao, et al.
Published: (2026)
Universal Image Immunization against Diffusion-based Image Editing via Semantic Injection
by: Lee, Chanhui, et al.
Published: (2026)
by: Lee, Chanhui, et al.
Published: (2026)
SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding
by: Sun, Qianqian, et al.
Published: (2025)
by: Sun, Qianqian, et al.
Published: (2025)
Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Models
by: Choi, Hyesong, et al.
Published: (2024)
by: Choi, Hyesong, et al.
Published: (2024)
Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models
by: Choi, Lucas, et al.
Published: (2025)
by: Choi, Lucas, et al.
Published: (2025)
ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
by: Mao, Chaojie, et al.
Published: (2025)
by: Mao, Chaojie, et al.
Published: (2025)
Lesion-Aware Post-Training of Latent Diffusion Models for Synthesizing Diffusion MRI from CT Perfusion
by: Lee, Junhyeok, et al.
Published: (2025)
by: Lee, Junhyeok, et al.
Published: (2025)
Guiding What Not to Generate: Automated Negative Prompting for Text-Image Alignment
by: Park, Sangha, et al.
Published: (2025)
by: Park, Sangha, et al.
Published: (2025)
Visual Representation Alignment for Multimodal Large Language Models
by: Yoon, Heeji, et al.
Published: (2025)
by: Yoon, Heeji, et al.
Published: (2025)
Periodic-MAE: Periodic Video Masked Autoencoder for rPPG Estimation
by: Choi, Jiho, et al.
Published: (2025)
by: Choi, Jiho, et al.
Published: (2025)
Emerging Property of Masked Token for Effective Pre-training
by: Choi, Hyesong, et al.
Published: (2024)
by: Choi, Hyesong, et al.
Published: (2024)
Hands-off Image Editing: Language-guided Editing without any Task-specific Labeling, Masking or even Training
by: Santos, Rodrigo, et al.
Published: (2025)
by: Santos, Rodrigo, et al.
Published: (2025)
Document Haystack: A Long Context Multimodal Image/Document Understanding Vision LLM Benchmark
by: Huybrechts, Goeric, et al.
Published: (2025)
by: Huybrechts, Goeric, et al.
Published: (2025)
MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models
by: Zhu, Hongyang, et al.
Published: (2025)
by: Zhu, Hongyang, et al.
Published: (2025)
Noise Map Guidance: Inversion with Spatial Context for Real Image Editing
by: Cho, Hansam, et al.
Published: (2024)
by: Cho, Hansam, et al.
Published: (2024)
Similar Items
-
COPAL: Continual Pruning in Large Language Generative Models
by: Malla, Srikanth, et al.
Published: (2024) -
Accelerating Conditional Prompt Learning via Masked Image Modeling for Vision-Language Models
by: Bui, Phuoc-Nguyen, et al.
Published: (2025) -
Deep Understanding of Sign Language for Sign to Subtitle Alignment
by: Jang, Youngjoon, et al.
Published: (2025) -
ADEPT: Adaptive Dynamic Early-Exit Process for Transformers
by: Yoo, Sangmin, et al.
Published: (2026) -
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
by: Jin, Siyoon, et al.
Published: (2025)