Saved in:
| Main Author: | Hoenen, Armin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.11724 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Differentiable JPEG: The Devil is in the Details
by: Reich, Christoph, et al.
Published: (2023)
by: Reich, Christoph, et al.
Published: (2023)
The Devil is in the EOS: Sequence Training for Detailed Image Captioning
by: Mohamed, Abdelrahman, et al.
Published: (2025)
by: Mohamed, Abdelrahman, et al.
Published: (2025)
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
by: Cao, Chenjie, et al.
Published: (2024)
by: Cao, Chenjie, et al.
Published: (2024)
The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning
by: Jo, Wonjun, et al.
Published: (2025)
by: Jo, Wonjun, et al.
Published: (2025)
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing
by: Bobkov, Denis, et al.
Published: (2024)
by: Bobkov, Denis, et al.
Published: (2024)
First Place Solution to the MLCAS 2025 GWFSS Challenge: The Devil is in the Detail and Minority
by: Cao, Songliang, et al.
Published: (2025)
by: Cao, Songliang, et al.
Published: (2025)
Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention
by: Jo, Kyungmin, et al.
Published: (2025)
by: Jo, Kyungmin, et al.
Published: (2025)
The Devil is in the Details: Boosting Guided Depth Super-Resolution via Rethinking Cross-Modal Alignment and Aggregation
by: Jiang, Xinni, et al.
Published: (2024)
by: Jiang, Xinni, et al.
Published: (2024)
DevilSight: Augmenting Monocular Human Avatar Reconstruction through a Virtual Perspective
by: Chen, Yushuo, et al.
Published: (2025)
by: Chen, Yushuo, et al.
Published: (2025)
The Devil is in the Few Shots: Iterative Visual Knowledge Completion for Few-shot Learning
by: Li, Yaohui, et al.
Published: (2024)
by: Li, Yaohui, et al.
Published: (2024)
DeepSeek-OCR 2: Visual Causal Flow
by: Wei, Haoran, et al.
Published: (2026)
by: Wei, Haoran, et al.
Published: (2026)
Cell Instance Segmentation: The Devil Is in the Boundaries
by: Liang, Peixian, et al.
Published: (2025)
by: Liang, Peixian, et al.
Published: (2025)
Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation
by: Wang, Yuran, et al.
Published: (2024)
by: Wang, Yuran, et al.
Published: (2024)
Hierarchical Visual Feature Aggregation for OCR-Free Document Understanding
by: Park, Jaeyoo, et al.
Published: (2024)
by: Park, Jaeyoo, et al.
Published: (2024)
OCR-Agent: Agentic OCR with Capability and Memory Reflection
by: Wen, Shimin, et al.
Published: (2026)
by: Wen, Shimin, et al.
Published: (2026)
OmniOCR: Generalist OCR for Ethnic Minority Languages
by: Liu, Bonan, et al.
Published: (2026)
by: Liu, Bonan, et al.
Published: (2026)
Monocular Online Reconstruction with Enhanced Detail Preservation
by: Wu, Songyin, et al.
Published: (2025)
by: Wu, Songyin, et al.
Published: (2025)
Preserving Old Memories in Vivid Detail: Human-Interactive Photo Restoration Framework
by: Back, Seung-Yeon, et al.
Published: (2024)
by: Back, Seung-Yeon, et al.
Published: (2024)
3D Surface Reconstruction with Enhanced High-Frequency Details
by: Zhang, Shikun, et al.
Published: (2025)
by: Zhang, Shikun, et al.
Published: (2025)
Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation
by: Li, Yongkang, et al.
Published: (2024)
by: Li, Yongkang, et al.
Published: (2024)
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
by: Peng, Ziqiao, et al.
Published: (2023)
by: Peng, Ziqiao, et al.
Published: (2023)
3D Face Reconstruction With Geometry Details From a Single Color Image Under Occluded Scenes
by: Zhao, Dapeng, et al.
Published: (2024)
by: Zhao, Dapeng, et al.
Published: (2024)
Context-Independent OCR with Multimodal LLMs: Effects of Image Resolution and Visual Complexity
by: Inoue, Kotaro
Published: (2025)
by: Inoue, Kotaro
Published: (2025)
FinCriticalED: A Visual Benchmark for Financial Fact-Level OCR
by: He, Yueru, et al.
Published: (2025)
by: He, Yueru, et al.
Published: (2025)
DEVICE: Depth and Visual Concepts Aware Transformer for OCR-based Image Captioning
by: Xu, Dongsheng, et al.
Published: (2023)
by: Xu, Dongsheng, et al.
Published: (2023)
Agentar-Fin-OCR
by: Qian, Siyi, et al.
Published: (2026)
by: Qian, Siyi, et al.
Published: (2026)
MILo: Mesh-In-the-Loop Gaussian Splatting for Detailed and Efficient Surface Reconstruction
by: Guédon, Antoine, et al.
Published: (2025)
by: Guédon, Antoine, et al.
Published: (2025)
DebSDF: Delving into the Details and Bias of Neural Indoor Scene Reconstruction
by: Xiao, Yuting, et al.
Published: (2023)
by: Xiao, Yuting, et al.
Published: (2023)
ROSA: Reconstructing Object Shape and Appearance Textures by Adaptive Detail Transfer
by: Kaltheuner, Julian, et al.
Published: (2025)
by: Kaltheuner, Julian, et al.
Published: (2025)
When Good OCR Is Not Enough: Benchmarking OCR Robustness for Retrieval-Augmented Generation
by: Sun, Lin, et al.
Published: (2026)
by: Sun, Lin, et al.
Published: (2026)
olmOCR 2: Unit Test Rewards for Document OCR
by: Poznanski, Jake, et al.
Published: (2025)
by: Poznanski, Jake, et al.
Published: (2025)
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
by: Zhang, Junyuan, et al.
Published: (2024)
by: Zhang, Junyuan, et al.
Published: (2024)
The Devil is in the Spurious Correlations: Boosting Moment Retrieval with Dynamic Learning
by: Zhou, Xinyang, et al.
Published: (2025)
by: Zhou, Xinyang, et al.
Published: (2025)
Counterfeit Answers: Adversarial Forgery against OCR-Free Document Visual Question Answering
by: Pintore, Marco, et al.
Published: (2025)
by: Pintore, Marco, et al.
Published: (2025)
CorrDetail: Visual Detail Enhanced Self-Correction for Face Forgery Detection
by: Zhou, Binjia, et al.
Published: (2025)
by: Zhou, Binjia, et al.
Published: (2025)
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios
by: Shi, Yang, et al.
Published: (2025)
by: Shi, Yang, et al.
Published: (2025)
Ocean-OCR: Towards General OCR Application via a Vision-Language Model
by: Chen, Song, et al.
Published: (2025)
by: Chen, Song, et al.
Published: (2025)
TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content
by: Anand, Avinash, et al.
Published: (2024)
by: Anand, Avinash, et al.
Published: (2024)
Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR
by: Liang, Yunhao, et al.
Published: (2026)
by: Liang, Yunhao, et al.
Published: (2026)
SimpleOCR: Rendering Visualized Questions to Teach MLLMs to Read
by: Peng, Yibo, et al.
Published: (2026)
by: Peng, Yibo, et al.
Published: (2026)
Similar Items
-
Differentiable JPEG: The Devil is in the Details
by: Reich, Christoph, et al.
Published: (2023) -
The Devil is in the EOS: Sequence Training for Detailed Image Captioning
by: Mohamed, Abdelrahman, et al.
Published: (2025) -
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
by: Cao, Chenjie, et al.
Published: (2024) -
The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning
by: Jo, Wonjun, et al.
Published: (2025) -
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing
by: Bobkov, Denis, et al.
Published: (2024)