Saved in:
| Main Authors: | Chan, Adrian, Mijar, Anupam, Saeed, Mehreen, Wong, Chau-Wai, Khater, Akram |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.02179 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition
by: Saeed, Mehreen, et al.
Published: (2024)
by: Saeed, Mehreen, et al.
Published: (2024)
Handwritten Text Recognition of Historical Manuscripts Using Transformer-Based Models
by: Meoded, Erez
Published: (2025)
by: Meoded, Erez
Published: (2025)
Unlocking the Archives: Using Large Language Models to Transcribe Handwritten Historical Documents
by: Humphries, Mark, et al.
Published: (2024)
by: Humphries, Mark, et al.
Published: (2024)
HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition
by: Truc, Pham Thach Thanh, et al.
Published: (2025)
by: Truc, Pham Thach Thanh, et al.
Published: (2025)
WriteViT: Handwritten Text Generation with Vision Transformer
by: Nam, Dang Hoai, et al.
Published: (2025)
by: Nam, Dang Hoai, et al.
Published: (2025)
Optimal Transport for Handwritten Text Recognition in a Low-Resource Regime
by: Wraight, Petros Georgoulas, et al.
Published: (2025)
by: Wraight, Petros Georgoulas, et al.
Published: (2025)
Lumos : Empowering Multimodal LLMs with Scene Text Recognition
by: Shenoy, Ashish, et al.
Published: (2024)
by: Shenoy, Ashish, et al.
Published: (2024)
Arabic Handwritten Text for Person Biometric Identification: A Deep Learning Approach
by: Balat, Mazen, et al.
Published: (2024)
by: Balat, Mazen, et al.
Published: (2024)
Multimodal Arabic Captioning with Interpretable Visual Concept Integration
by: Elchafei, Passant, et al.
Published: (2025)
by: Elchafei, Passant, et al.
Published: (2025)
GatedLexiconNet: A Comprehensive End-to-End Handwritten Paragraph Text Recognition System
by: Kumari, Lalita, et al.
Published: (2024)
by: Kumari, Lalita, et al.
Published: (2024)
A Framework For Refining Text Classification and Object Recognition from Academic Articles
by: Li, Jinghong, et al.
Published: (2023)
by: Li, Jinghong, et al.
Published: (2023)
Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
Graph Neural Network based Handwritten Trajectories Recognition
by: Sharma, Anuj, et al.
Published: (2024)
by: Sharma, Anuj, et al.
Published: (2024)
Text Role Classification in Scientific Charts Using Multimodal Transformers
by: Kim, Hye Jin, et al.
Published: (2024)
by: Kim, Hye Jin, et al.
Published: (2024)
Seeing Justice Clearly: Handwritten Legal Document Translation with OCR and Vision-Language Models
by: Nigam, Shubham Kumar, et al.
Published: (2025)
by: Nigam, Shubham Kumar, et al.
Published: (2025)
NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition
by: Liu, Chenyu, et al.
Published: (2024)
by: Liu, Chenyu, et al.
Published: (2024)
A Sobel-Gradient MLP Baseline for Handwritten Character Recognition
by: Nouri, Azam
Published: (2025)
by: Nouri, Azam
Published: (2025)
JSTR: Judgment Improves Scene Text Recognition
by: Fujitake, Masato
Published: (2024)
by: Fujitake, Masato
Published: (2024)
Revisiting N-Gram Models: Their Impact in Modern Neural Networks for Handwritten Text Recognition
by: Tarride, Solène, et al.
Published: (2024)
by: Tarride, Solène, et al.
Published: (2024)
Quilt-1M: One Million Image-Text Pairs for Histopathology
by: Ikezogwo, Wisdom Oluchi, et al.
Published: (2023)
by: Ikezogwo, Wisdom Oluchi, et al.
Published: (2023)
Improving MLLM Historical Record Extraction with Test-Time Image
by: Archibald, Taylor, et al.
Published: (2025)
by: Archibald, Taylor, et al.
Published: (2025)
Advancing Offline Handwritten Text Recognition: A Systematic Review of Data Augmentation and Generation Techniques
by: Rassul, Yassin Hussein, et al.
Published: (2025)
by: Rassul, Yassin Hussein, et al.
Published: (2025)
Different Strokes for Different Folks: Writer Identification for Historical Arabic Manuscripts
by: Abushahla, Hamza A., et al.
Published: (2026)
by: Abushahla, Hamza A., et al.
Published: (2026)
KNN and ANN-based Recognition of Handwritten Pashto Letters using Zoning Features
by: Khan, Sulaiman, et al.
Published: (2019)
by: Khan, Sulaiman, et al.
Published: (2019)
Rotation-free Online Handwritten Character Recognition Using Linear Recurrent Units
by: Ling, Zhe, et al.
Published: (2026)
by: Ling, Zhe, et al.
Published: (2026)
Symbol-Aware Reasoning with Masked Discrete Diffusion for Handwritten Mathematical Expression Recognition
by: Kawakatsu, Takaya, et al.
Published: (2026)
by: Kawakatsu, Takaya, et al.
Published: (2026)
MathWriting: A Dataset For Handwritten Mathematical Expression Recognition
by: Gervais, Philippe, et al.
Published: (2024)
by: Gervais, Philippe, et al.
Published: (2024)
MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learning
by: Saeed, Nadia
Published: (2024)
by: Saeed, Nadia
Published: (2024)
BanglaNet: Bangla Handwritten Character Recognition using Ensembling of Convolutional Neural Network
by: Saha, Chandrika, et al.
Published: (2024)
by: Saha, Chandrika, et al.
Published: (2024)
HAND: Hierarchical Attention Network for Multi-Scale Handwritten Document Recognition and Layout Analysis
by: Hamdan, Mohammed, et al.
Published: (2024)
by: Hamdan, Mohammed, et al.
Published: (2024)
Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation
by: Collins, Katherine M., et al.
Published: (2024)
by: Collins, Katherine M., et al.
Published: (2024)
Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation
by: Chang, Wei-Chia, et al.
Published: (2025)
by: Chang, Wei-Chia, et al.
Published: (2025)
CAMEL-Bench: A Comprehensive Arabic LMM Benchmark
by: Ghaboura, Sara, et al.
Published: (2024)
by: Ghaboura, Sara, et al.
Published: (2024)
Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models
by: Li, Sijie, et al.
Published: (2026)
by: Li, Sijie, et al.
Published: (2026)
Indian Sign Language Recognition Using Mediapipe Holistic
by: G, Velmathi, et al.
Published: (2023)
by: G, Velmathi, et al.
Published: (2023)
Transformer-VQ: Linear-Time Transformers via Vector Quantization
by: Lingle, Lucas D.
Published: (2023)
by: Lingle, Lucas D.
Published: (2023)
PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction
by: Poesina, Eduard, et al.
Published: (2024)
by: Poesina, Eduard, et al.
Published: (2024)
Remote Blood Oxygen Estimation From Videos Using Neural Networks
by: Mathew, Joshua, et al.
Published: (2021)
by: Mathew, Joshua, et al.
Published: (2021)
Generative Technology for Human Emotion Recognition: A Scope Review
by: Ma, Fei, et al.
Published: (2024)
by: Ma, Fei, et al.
Published: (2024)
Low-Resource Heuristics for Bahnaric Optical Character Recognition Improvement
by: Tran, Phat, et al.
Published: (2026)
by: Tran, Phat, et al.
Published: (2026)
Similar Items
-
Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition
by: Saeed, Mehreen, et al.
Published: (2024) -
Handwritten Text Recognition of Historical Manuscripts Using Transformer-Based Models
by: Meoded, Erez
Published: (2025) -
Unlocking the Archives: Using Large Language Models to Transcribe Handwritten Historical Documents
by: Humphries, Mark, et al.
Published: (2024) -
HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition
by: Truc, Pham Thach Thanh, et al.
Published: (2025) -
WriteViT: Handwritten Text Generation with Vision Transformer
by: Nam, Dang Hoai, et al.
Published: (2025)