Saved in:
| Main Authors: | Cao, Songliang, Hu, Tianqi, Lu, Hao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.17305 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Plant Taxonomy Meets Plant Counting: A Fine-Grained, Taxonomic Dataset for Counting Hundreds of Plant Species
by: Xu, Jinyu, et al.
Published: (2026)
by: Xu, Jinyu, et al.
Published: (2026)
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
by: Cao, Chenjie, et al.
Published: (2024)
by: Cao, Chenjie, et al.
Published: (2024)
Differentiable JPEG: The Devil is in the Details
by: Reich, Christoph, et al.
Published: (2023)
by: Reich, Christoph, et al.
Published: (2023)
The Devil is in the EOS: Sequence Training for Detailed Image Captioning
by: Mohamed, Abdelrahman, et al.
Published: (2025)
by: Mohamed, Abdelrahman, et al.
Published: (2025)
SAMSON: 3rd Place Solution of LSVOS 2025 VOS Challenge
by: Xie, Yujie, et al.
Published: (2025)
by: Xie, Yujie, et al.
Published: (2025)
First-Place Solution to NeurIPS 2024 Invisible Watermark Removal Challenge
by: Shamshad, Fahad, et al.
Published: (2025)
by: Shamshad, Fahad, et al.
Published: (2025)
The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning
by: Jo, Wonjun, et al.
Published: (2025)
by: Jo, Wonjun, et al.
Published: (2025)
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing
by: Bobkov, Denis, et al.
Published: (2024)
by: Bobkov, Denis, et al.
Published: (2024)
Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge
by: Liang, Hao, et al.
Published: (2025)
by: Liang, Hao, et al.
Published: (2025)
First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024
by: Zhang, Tengfei, et al.
Published: (2024)
by: Zhang, Tengfei, et al.
Published: (2024)
First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Atomic Activity Recognition 2024
by: Li, Ruyang, et al.
Published: (2024)
by: Li, Ruyang, et al.
Published: (2024)
Second Place Solution of WSDM2023 Toloka Visual Question Answering Challenge
by: Wu, Xiangyu, et al.
Published: (2024)
by: Wu, Xiangyu, et al.
Published: (2024)
The Devil is in the Details -- From OCR for Old Church Slavonic to Purely Visual Stemma Reconstruction
by: Hoenen, Armin
Published: (2026)
by: Hoenen, Armin
Published: (2026)
The Instance-centric Transformer for the RVOS Track of LSVOS Challenge: 3rd Place Solution
by: Cao, Bin, et al.
Published: (2024)
by: Cao, Bin, et al.
Published: (2024)
Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention
by: Jo, Kyungmin, et al.
Published: (2025)
by: Jo, Kyungmin, et al.
Published: (2025)
DepthCropSeg++: Scaling a Crop Segmentation Foundation Model With Depth-Labeled Data
by: Zhang, Jiafei, et al.
Published: (2026)
by: Zhang, Jiafei, et al.
Published: (2026)
The Devil is in the Details: Boosting Guided Depth Super-Resolution via Rethinking Cross-Modal Alignment and Aggregation
by: Jiang, Xinni, et al.
Published: (2024)
by: Jiang, Xinni, et al.
Published: (2024)
Cell Instance Segmentation: The Devil Is in the Boundaries
by: Liang, Peixian, et al.
Published: (2025)
by: Liang, Peixian, et al.
Published: (2025)
1st Place Solution to the 1st SkatingVerse Challenge
by: Sun, Tao, et al.
Published: (2024)
by: Sun, Tao, et al.
Published: (2024)
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
by: Peng, Ziqiao, et al.
Published: (2023)
by: Peng, Ziqiao, et al.
Published: (2023)
SVC 2025: the First Multimodal Deception Detection Challenge
by: Lin, Xun, et al.
Published: (2025)
by: Lin, Xun, et al.
Published: (2025)
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
by: Wu, Ruipu, et al.
Published: (2024)
by: Wu, Ruipu, et al.
Published: (2024)
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
by: Wu, Biao, et al.
Published: (2024)
by: Wu, Biao, et al.
Published: (2024)
3rd Place Solution for MeViS Track in CVPR 2024 PVUW workshop: Motion Expression guided Video Segmentation
by: Pan, Feiyu, et al.
Published: (2024)
by: Pan, Feiyu, et al.
Published: (2024)
First Place Solution to the Multiple-choice Video QA Track of The Second Perception Test Challenge
by: Peng, Yingzhe, et al.
Published: (2024)
by: Peng, Yingzhe, et al.
Published: (2024)
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution
by: Wang, Mengjiao, et al.
Published: (2025)
by: Wang, Mengjiao, et al.
Published: (2025)
Evaluation of Winning Solutions of 2025 Low Power Computer Vision Challenge
by: Ye, Zihao, et al.
Published: (2026)
by: Ye, Zihao, et al.
Published: (2026)
X-Restormer++: 1st Place Solution for the UG2+ CVPR 2026 All-Weather Restoration Challenge
by: Pan, Youwei, et al.
Published: (2026)
by: Pan, Youwei, et al.
Published: (2026)
2nd Place Report of MOSEv2 Challenge 2025: Concept Guided Video Object Segmentation via SeC
by: Zhang, Zhixiong, et al.
Published: (2025)
by: Zhang, Zhixiong, et al.
Published: (2025)
3rd Place Solution for VisDA 2021 Challenge -- Universally Domain Adaptive Image Recognition
by: Liao, Haojin, et al.
Published: (2021)
by: Liao, Haojin, et al.
Published: (2021)
Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation
by: Wang, Yuran, et al.
Published: (2024)
by: Wang, Yuran, et al.
Published: (2024)
UNINEXT-Cutie: The 1st Solution for LSVOS Challenge RVOS Track
by: Fang, Hao, et al.
Published: (2024)
by: Fang, Hao, et al.
Published: (2024)
1st Place Solution for ICCV 2023 OmniObject3D Challenge: Sparse-View Reconstruction
by: Du, Hang, et al.
Published: (2024)
by: Du, Hang, et al.
Published: (2024)
Task adaptation of Vision-Language-Action model: 1st Place Solution for the 2025 BEHAVIOR Challenge
by: Larchenko, Ilia, et al.
Published: (2025)
by: Larchenko, Ilia, et al.
Published: (2025)
The Devil is in the Few Shots: Iterative Visual Knowledge Completion for Few-shot Learning
by: Li, Yaohui, et al.
Published: (2024)
by: Li, Yaohui, et al.
Published: (2024)
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
by: Gong, Sitong, et al.
Published: (2025)
by: Gong, Sitong, et al.
Published: (2025)
1st Place Solution of Multiview Egocentric Hand Tracking Challenge ECCV2024
by: Zou, Minqiang, et al.
Published: (2024)
by: Zou, Minqiang, et al.
Published: (2024)
The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection
by: Cao, Tianjiao, et al.
Published: (2025)
by: Cao, Tianjiao, et al.
Published: (2025)
First Place Solution to the ECCV 2024 BRAVO Challenge: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation
by: Kerssies, Tommie, et al.
Published: (2024)
by: Kerssies, Tommie, et al.
Published: (2024)
Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking
by: Yang, Cheng-Yen, et al.
Published: (2025)
by: Yang, Cheng-Yen, et al.
Published: (2025)
Similar Items
-
Plant Taxonomy Meets Plant Counting: A Fine-Grained, Taxonomic Dataset for Counting Hundreds of Plant Species
by: Xu, Jinyu, et al.
Published: (2026) -
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
by: Cao, Chenjie, et al.
Published: (2024) -
Differentiable JPEG: The Devil is in the Details
by: Reich, Christoph, et al.
Published: (2023) -
The Devil is in the EOS: Sequence Training for Detailed Image Captioning
by: Mohamed, Abdelrahman, et al.
Published: (2025) -
SAMSON: 3rd Place Solution of LSVOS 2025 VOS Challenge
by: Xie, Yujie, et al.
Published: (2025)