:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jo, Kyungmin, Yun, Jooyeol, Choo, Jaegul
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2508.02004
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure
by: Yun, Jooyeol, et al.
Published: (2025)

Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization
by: Yun, Jooyeol, et al.
Published: (2024)

Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects
by: Jo, Kyungmin, et al.
Published: (2024)

Enabling Region-Specific Control via Lassos in Point-Based Colorization
by: Lee, Sanghyeon, et al.
Published: (2024)

SphereDiff: Tuning-free 360° Static and Dynamic Panorama Generation via Spherical Latent Representation
by: Park, Minho, et al.
Published: (2025)

Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models
by: Park, Minho, et al.
Published: (2024)

DetailCLIP: Injecting Image Details into CLIP's Feature Space
by: Zhang, Zilun, et al.
Published: (2022)

The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning
by: Jo, Wonjun, et al.
Published: (2025)

Imagining the Unseen: Generative Location Modeling for Object Placement
by: Yun, Jooyeol, et al.
Published: (2024)

The Devil is in the EOS: Sequence Training for Detailed Image Captioning
by: Mohamed, Abdelrahman, et al.
Published: (2025)

DesignLab: Designing Slides Through Iterative Detection and Correction
by: Yun, Jooyeol, et al.
Published: (2025)

Differentiable JPEG: The Devil is in the Details
by: Reich, Christoph, et al.
Published: (2023)

The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing
by: Bobkov, Denis, et al.
Published: (2024)

DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction
by: Liu, Yiheng, et al.
Published: (2025)

ReflectCAP: Detailed Image Captioning with Reflective Memory
by: Min, Kyungmin, et al.
Published: (2026)

PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask
by: Kim, Jeongho, et al.
Published: (2024)

TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
by: Kim, Jeongho, et al.
Published: (2024)

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts
by: Gani, Hanan, et al.
Published: (2023)

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
by: Cao, Chenjie, et al.
Published: (2024)

GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text
by: Shim, Gyumin, et al.
Published: (2025)

Attention to Detail: Global-Local Attention for High-Resolution AI-Generated Image Detection
by: Han, Lawrence
Published: (2026)

Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning
by: Kim, Donghu, et al.
Published: (2024)

Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention
by: Park, Jeonghoon, et al.
Published: (2025)

From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation
by: Kim, Jeongho, et al.
Published: (2025)

Good Noise Makes Good Edits: A Training-Free Diffusion-Based Video Editing with Image and Text Prompts
by: Choi, Saemee, et al.
Published: (2025)

Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models
by: Saichandran, Ketan Suhaas, et al.
Published: (2025)

Temporal In-Context Fine-Tuning with Temporal Reasoning for Versatile Control of Video Diffusion Models
by: Kim, Kinam, et al.
Published: (2025)

DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks
by: Monsefi, Amin Karimi, et al.
Published: (2024)

First Place Solution to the MLCAS 2025 GWFSS Challenge: The Devil is in the Detail and Minority
by: Cao, Songliang, et al.
Published: (2025)

Finding Needles in Images: Can Multimodal LLMs Locate Fine Details?
by: Thakkar, Parth, et al.
Published: (2025)

Block and Detail: Scaffolding Sketch-to-Image Generation
by: Sarukkai, Vishnu, et al.
Published: (2024)

Benchmarking and Improving Detail Image Caption
by: Dong, Hongyuan, et al.
Published: (2024)

The Devil is in the Details: Boosting Guided Depth Super-Resolution via Rethinking Cross-Modal Alignment and Aggregation
by: Jiang, Xinni, et al.
Published: (2024)

Missing Fine Details in Images: Last Seen in High Frequencies
by: Medi, Tejaswini, et al.
Published: (2025)

The Devil is in the Details -- From OCR for Old Church Slavonic to Purely Visual Stemma Reconstruction
by: Hoenen, Armin
Published: (2026)

Beyond Illumination: Fine-Grained Detail Preservation in Extreme Dark Image Restoration
by: Zhang, Tongshun, et al.
Published: (2025)

No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
by: Gaur, Manu, et al.
Published: (2024)

Detail++: Training-Free Detail Enhancer for Text-to-Image Diffusion Models
by: Chen, Lifeng, et al.
Published: (2025)

MM-SeR: Multimodal Self-Refinement for Lightweight Image Captioning
by: Song, Junha, et al.
Published: (2025)

Generating Fine Details of Entity Interactions
by: Gu, Xinyi, et al.
Published: (2025)