Saved in:
| Main Authors: | Ju, Rui-Yang, Yamashita, Kohei, Kameko, Hirotaka, Mori, Shinsuke |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.19086 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DKDS: A Benchmark Dataset of Degraded Kuzushiji Documents with Seals for Detection and Binarization
by: Ju, Rui-Yang, et al.
Published: (2025)
by: Ju, Rui-Yang, et al.
Published: (2025)
Recipe Generation from Unsegmented Cooking Videos
by: Nishimura, Taichi, et al.
Published: (2022)
by: Nishimura, Taichi, et al.
Published: (2022)
Character Recognition in Byzantine Seals with Deep Neural Networks
by: Rageau, Théophile, et al.
Published: (2024)
by: Rageau, Théophile, et al.
Published: (2024)
EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos referring to Procedural Texts
by: Haneji, Yuto, et al.
Published: (2024)
by: Haneji, Yuto, et al.
Published: (2024)
BioVL-QR: Egocentric Biochemical Vision-and-Language Dataset Using Micro QR Codes
by: Nishimoto, Tomohiro, et al.
Published: (2024)
by: Nishimoto, Tomohiro, et al.
Published: (2024)
PhyG-MoE: A Physics-Guided Mixture-of-Experts Framework for Energy-Efficient GNSS Interference Recognition
by: Zeng, Zhihan, et al.
Published: (2026)
by: Zeng, Zhihan, et al.
Published: (2026)
Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection
by: Yamashita, Kohei, et al.
Published: (2023)
by: Yamashita, Kohei, et al.
Published: (2023)
LOCR: Location-Guided Transformer for Optical Character Recognition
by: Sun, Yu, et al.
Published: (2024)
by: Sun, Yu, et al.
Published: (2024)
MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis
by: Qiu, Di, et al.
Published: (2024)
by: Qiu, Di, et al.
Published: (2024)
Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
by: Yoshida, Tomoya, et al.
Published: (2025)
by: Yoshida, Tomoya, et al.
Published: (2025)
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
by: Zhou, Bangbang, et al.
Published: (2024)
by: Zhou, Bangbang, et al.
Published: (2024)
Confidence-Guided Diffusion Augmentation for Enhanced Bangla Compound Character Recognition
by: Rayhan, Md. Sultan Al
Published: (2026)
by: Rayhan, Md. Sultan Al
Published: (2026)
M-PhyGs: Multi-Material Object Dynamics from Video
by: Wada, Norika, et al.
Published: (2025)
by: Wada, Norika, et al.
Published: (2025)
Hierarchical Object Detection and Recognition Framework for Practical Plant Disease Diagnosis
by: Iwano, Kohei, et al.
Published: (2024)
by: Iwano, Kohei, et al.
Published: (2024)
Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching
by: Wang, Bin, et al.
Published: (2024)
by: Wang, Bin, et al.
Published: (2024)
Text-driven Affordance Learning from Egocentric Vision
by: Yoshida, Tomoya, et al.
Published: (2024)
by: Yoshida, Tomoya, et al.
Published: (2024)
MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views
by: Guédon, Antoine, et al.
Published: (2024)
by: Guédon, Antoine, et al.
Published: (2024)
Vision-Language Model Guided Image Restoration
by: Yang, Cuixin, et al.
Published: (2025)
by: Yang, Cuixin, et al.
Published: (2025)
AllRestorer: All-in-One Transformer for Image Restoration under Composite Degradations
by: Mao, Jiawei, et al.
Published: (2024)
by: Mao, Jiawei, et al.
Published: (2024)
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework
by: Tao, Jiale, et al.
Published: (2025)
by: Tao, Jiale, et al.
Published: (2025)
ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation
by: Jin, Yanlin, et al.
Published: (2024)
by: Jin, Yanlin, et al.
Published: (2024)
Through the Lens of Character: Resolving Modality-Role Interference in Multimodal Role-Playing Agent
by: Tang, Yihong, et al.
Published: (2026)
by: Tang, Yihong, et al.
Published: (2026)
Invizo: Arabic Handwritten Document Optical Character Recognition Solution
by: Waly, Alhossien, et al.
Published: (2025)
by: Waly, Alhossien, et al.
Published: (2025)
AMR-CCR: Anchored Modular Retrieval for Continual Chinese Character Recognition
by: Wu, Yuchuan, et al.
Published: (2026)
by: Wu, Yuchuan, et al.
Published: (2026)
Unsupervised Attention Regularization Based Domain Adaptation for Oracle Character Recognition
by: Wang, Mei, et al.
Published: (2024)
by: Wang, Mei, et al.
Published: (2024)
High Cursive Complex Character Recognition using GAN External Classifier
by: Rafiuddin, S M
Published: (2025)
by: Rafiuddin, S M
Published: (2025)
Dual Associated Encoder for Face Restoration
by: Tsai, Yu-Ju, et al.
Published: (2023)
by: Tsai, Yu-Ju, et al.
Published: (2023)
Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
by: Ma, Yuhang, et al.
Published: (2024)
by: Ma, Yuhang, et al.
Published: (2024)
Rotation-free Online Handwritten Character Recognition Using Linear Recurrent Units
by: Ling, Zhe, et al.
Published: (2026)
by: Ling, Zhe, et al.
Published: (2026)
W-Net: One-Shot Arbitrary-Style Chinese Character Generation with Deep Neural Networks
by: Jiang, Haochuan, et al.
Published: (2024)
by: Jiang, Haochuan, et al.
Published: (2024)
Detailed Geometry and Appearance from Opportunistic Motion
by: Hirai, Ryosuke, et al.
Published: (2026)
by: Hirai, Ryosuke, et al.
Published: (2026)
Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments
by: Nagaonkar, Sankalp, et al.
Published: (2025)
by: Nagaonkar, Sankalp, et al.
Published: (2025)
ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expression Recognition
by: Zhu, Jianhua, et al.
Published: (2024)
by: Zhu, Jianhua, et al.
Published: (2024)
Seal2Real: Prompt Prior Learning on Diffusion Model for Unsupervised Document Seal Data Generation and Realisation
by: Yan, Mingfu, et al.
Published: (2023)
by: Yan, Mingfu, et al.
Published: (2023)
Multi-task Image Restoration Guided By Robust DINO Features
by: Lin, Xin, et al.
Published: (2023)
by: Lin, Xin, et al.
Published: (2023)
Logios : An open source Greek Polytonic Optical Character Recognition system
by: Konstantinos, Perifanos, et al.
Published: (2025)
by: Konstantinos, Perifanos, et al.
Published: (2025)
OneRestore: A Universal Restoration Framework for Composite Degradation
by: Guo, Yu, et al.
Published: (2024)
by: Guo, Yu, et al.
Published: (2024)
Improving Chinese Character Representation with Formation Tree
by: Hong, Yang, et al.
Published: (2024)
by: Hong, Yang, et al.
Published: (2024)
Fracture Detection in Pediatric Wrist Trauma X-ray Images Using YOLOv8 Algorithm
by: Ju, Rui-Yang, et al.
Published: (2023)
by: Ju, Rui-Yang, et al.
Published: (2023)
Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration
by: Zhou, Shihao, et al.
Published: (2024)
by: Zhou, Shihao, et al.
Published: (2024)
Similar Items
-
DKDS: A Benchmark Dataset of Degraded Kuzushiji Documents with Seals for Detection and Binarization
by: Ju, Rui-Yang, et al.
Published: (2025) -
Recipe Generation from Unsegmented Cooking Videos
by: Nishimura, Taichi, et al.
Published: (2022) -
Character Recognition in Byzantine Seals with Deep Neural Networks
by: Rageau, Théophile, et al.
Published: (2024) -
EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos referring to Procedural Texts
by: Haneji, Yuto, et al.
Published: (2024) -
BioVL-QR: Egocentric Biochemical Vision-and-Language Dataset Using Micro QR Codes
by: Nishimoto, Tomohiro, et al.
Published: (2024)