Saved in:
| Main Authors: | Jo, Kyungmin, Yun, Jooyeol, Choo, Jaegul |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.02004 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure
by: Yun, Jooyeol, et al.
Published: (2025)
by: Yun, Jooyeol, et al.
Published: (2025)
Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization
by: Yun, Jooyeol, et al.
Published: (2024)
by: Yun, Jooyeol, et al.
Published: (2024)
Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects
by: Jo, Kyungmin, et al.
Published: (2024)
by: Jo, Kyungmin, et al.
Published: (2024)
Enabling Region-Specific Control via Lassos in Point-Based Colorization
by: Lee, Sanghyeon, et al.
Published: (2024)
by: Lee, Sanghyeon, et al.
Published: (2024)
SphereDiff: Tuning-free 360° Static and Dynamic Panorama Generation via Spherical Latent Representation
by: Park, Minho, et al.
Published: (2025)
by: Park, Minho, et al.
Published: (2025)
Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models
by: Park, Minho, et al.
Published: (2024)
by: Park, Minho, et al.
Published: (2024)
DetailCLIP: Injecting Image Details into CLIP's Feature Space
by: Zhang, Zilun, et al.
Published: (2022)
by: Zhang, Zilun, et al.
Published: (2022)
The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning
by: Jo, Wonjun, et al.
Published: (2025)
by: Jo, Wonjun, et al.
Published: (2025)
Imagining the Unseen: Generative Location Modeling for Object Placement
by: Yun, Jooyeol, et al.
Published: (2024)
by: Yun, Jooyeol, et al.
Published: (2024)
The Devil is in the EOS: Sequence Training for Detailed Image Captioning
by: Mohamed, Abdelrahman, et al.
Published: (2025)
by: Mohamed, Abdelrahman, et al.
Published: (2025)
DesignLab: Designing Slides Through Iterative Detection and Correction
by: Yun, Jooyeol, et al.
Published: (2025)
by: Yun, Jooyeol, et al.
Published: (2025)
Differentiable JPEG: The Devil is in the Details
by: Reich, Christoph, et al.
Published: (2023)
by: Reich, Christoph, et al.
Published: (2023)
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing
by: Bobkov, Denis, et al.
Published: (2024)
by: Bobkov, Denis, et al.
Published: (2024)
DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction
by: Liu, Yiheng, et al.
Published: (2025)
by: Liu, Yiheng, et al.
Published: (2025)
ReflectCAP: Detailed Image Captioning with Reflective Memory
by: Min, Kyungmin, et al.
Published: (2026)
by: Min, Kyungmin, et al.
Published: (2026)
PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask
by: Kim, Jeongho, et al.
Published: (2024)
by: Kim, Jeongho, et al.
Published: (2024)
TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
by: Kim, Jeongho, et al.
Published: (2024)
by: Kim, Jeongho, et al.
Published: (2024)
LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts
by: Gani, Hanan, et al.
Published: (2023)
by: Gani, Hanan, et al.
Published: (2023)
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
by: Cao, Chenjie, et al.
Published: (2024)
by: Cao, Chenjie, et al.
Published: (2024)
GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text
by: Shim, Gyumin, et al.
Published: (2025)
by: Shim, Gyumin, et al.
Published: (2025)
Attention to Detail: Global-Local Attention for High-Resolution AI-Generated Image Detection
by: Han, Lawrence
Published: (2026)
by: Han, Lawrence
Published: (2026)
Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning
by: Kim, Donghu, et al.
Published: (2024)
by: Kim, Donghu, et al.
Published: (2024)
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention
by: Park, Jeonghoon, et al.
Published: (2025)
by: Park, Jeonghoon, et al.
Published: (2025)
From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation
by: Kim, Jeongho, et al.
Published: (2025)
by: Kim, Jeongho, et al.
Published: (2025)
Good Noise Makes Good Edits: A Training-Free Diffusion-Based Video Editing with Image and Text Prompts
by: Choi, Saemee, et al.
Published: (2025)
by: Choi, Saemee, et al.
Published: (2025)
Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models
by: Saichandran, Ketan Suhaas, et al.
Published: (2025)
by: Saichandran, Ketan Suhaas, et al.
Published: (2025)
Temporal In-Context Fine-Tuning with Temporal Reasoning for Versatile Control of Video Diffusion Models
by: Kim, Kinam, et al.
Published: (2025)
by: Kim, Kinam, et al.
Published: (2025)
DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks
by: Monsefi, Amin Karimi, et al.
Published: (2024)
by: Monsefi, Amin Karimi, et al.
Published: (2024)
First Place Solution to the MLCAS 2025 GWFSS Challenge: The Devil is in the Detail and Minority
by: Cao, Songliang, et al.
Published: (2025)
by: Cao, Songliang, et al.
Published: (2025)
Finding Needles in Images: Can Multimodal LLMs Locate Fine Details?
by: Thakkar, Parth, et al.
Published: (2025)
by: Thakkar, Parth, et al.
Published: (2025)
Block and Detail: Scaffolding Sketch-to-Image Generation
by: Sarukkai, Vishnu, et al.
Published: (2024)
by: Sarukkai, Vishnu, et al.
Published: (2024)
Benchmarking and Improving Detail Image Caption
by: Dong, Hongyuan, et al.
Published: (2024)
by: Dong, Hongyuan, et al.
Published: (2024)
The Devil is in the Details: Boosting Guided Depth Super-Resolution via Rethinking Cross-Modal Alignment and Aggregation
by: Jiang, Xinni, et al.
Published: (2024)
by: Jiang, Xinni, et al.
Published: (2024)
Missing Fine Details in Images: Last Seen in High Frequencies
by: Medi, Tejaswini, et al.
Published: (2025)
by: Medi, Tejaswini, et al.
Published: (2025)
The Devil is in the Details -- From OCR for Old Church Slavonic to Purely Visual Stemma Reconstruction
by: Hoenen, Armin
Published: (2026)
by: Hoenen, Armin
Published: (2026)
Beyond Illumination: Fine-Grained Detail Preservation in Extreme Dark Image Restoration
by: Zhang, Tongshun, et al.
Published: (2025)
by: Zhang, Tongshun, et al.
Published: (2025)
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
by: Gaur, Manu, et al.
Published: (2024)
by: Gaur, Manu, et al.
Published: (2024)
Detail++: Training-Free Detail Enhancer for Text-to-Image Diffusion Models
by: Chen, Lifeng, et al.
Published: (2025)
by: Chen, Lifeng, et al.
Published: (2025)
MM-SeR: Multimodal Self-Refinement for Lightweight Image Captioning
by: Song, Junha, et al.
Published: (2025)
by: Song, Junha, et al.
Published: (2025)
Generating Fine Details of Entity Interactions
by: Gu, Xinyi, et al.
Published: (2025)
by: Gu, Xinyi, et al.
Published: (2025)
Similar Items
-
Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure
by: Yun, Jooyeol, et al.
Published: (2025) -
Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization
by: Yun, Jooyeol, et al.
Published: (2024) -
Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects
by: Jo, Kyungmin, et al.
Published: (2024) -
Enabling Region-Specific Control via Lassos in Point-Based Colorization
by: Lee, Sanghyeon, et al.
Published: (2024) -
SphereDiff: Tuning-free 360° Static and Dynamic Panorama Generation via Spherical Latent Representation
by: Park, Minho, et al.
Published: (2025)