:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Yuyi, Zhang, Peirong, Yang, Zhenhua, Yan, Pengyu, Shi, Yongxin, Liu, Pengwei, Guo, Fengjun, Jin, Lianwen
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2507.05108
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Predicting the Original Appearance of Damaged Historical Documents
by: Yang, Zhenhua, et al.
Published: (2024)

MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories
by: Zhang, Yuyi, et al.
Published: (2025)

PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography
by: Liu, Junle, et al.
Published: (2026)

TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models
by: Cao, Jiahuan, et al.
Published: (2024)

C$^{3}$Bench: A Comprehensive Classical Chinese Understanding Benchmark for Large Language Models
by: Cao, Jiahuan, et al.
Published: (2024)

Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach
by: Zhang, Peirong, et al.
Published: (2024)

UPOCR: Towards Unified Pixel-Level OCR Interface
by: Peng, Dezhi, et al.
Published: (2023)

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
by: Zhang, Jiaxin, et al.
Published: (2024)

OCRGenBench: A Comprehensive Benchmark for Evaluating OCR Generative Capabilities
by: Zhang, Peirong, et al.
Published: (2025)

URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
by: Shi, Yongxin, et al.
Published: (2025)

HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
by: Zhang, Yuyi, et al.
Published: (2024)

Capturing More: Learning Multi-Domain Representations for Robust Online Handwriting Verification
by: Zhang, Peirong, et al.
Published: (2025)

MCCD: A Multi-Attribute Chinese Calligraphy Character Dataset Annotated with Script Styles, Dynasties, and Calligraphers
by: Zhao, Yixin, et al.
Published: (2025)

Shared Heritage, Distinct Writing: Rethinking Resource Selection for East Asian Historical Documents
by: Song, Seyoung, et al.
Published: (2024)

Datasets for Large Language Models: A Comprehensive Survey
by: Liu, Yang, et al.
Published: (2024)

Revisiting Tampered Scene Text Detection in the Era of Generative AI
by: Qu, Chenfan, et al.
Published: (2024)

Omni-IML: Towards Unified Image Manipulation Localization
by: Qu, Chenfan, et al.
Published: (2024)

Musical Heritage Historical Entity Linking
by: Graciotti, Arianna, et al.
Published: (2025)

Privacy-Preserving Biometric Verification with Handwritten Random Digit String
by: Zhang, Peirong, et al.
Published: (2025)

Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
by: Luo, Chuwei, et al.
Published: (2022)

PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
by: Lin, Zening, et al.
Published: (2024)

DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation
by: Wang, Jiapeng, et al.
Published: (2024)

VideoVista-CulturalLingo: 360$^\circ$ Horizons-Bridging Cultures, Languages, and Domains in Video Comprehension
by: Chen, Xinyu, et al.
Published: (2025)

Learning to Erase Private Knowledge from Multi-Documents for Retrieval-Augmented Large Language Models
by: Wang, Yujing, et al.
Published: (2025)

VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models
by: Wang, Jiapeng, et al.
Published: (2024)

SEAL: Can Saturated Benchmarks Be Revived by LLM-as-a-Meta-Judge?
by: Chen, Jiamin, et al.
Published: (2026)

Multi-Physics: A Comprehensive Benchmark for Multimodal LLMs Reasoning on Chinese Multi-Subject Physics Problems
by: Luo, Zhongze, et al.
Published: (2025)

Innovating China's Intangible Cultural Heritage with DeepSeek + MidJourney: The Case of Yangliuqing theme Woodblock Prints
by: Yang, RuiKun, et al.
Published: (2025)

Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models
by: Miao, Ziqi, et al.
Published: (2025)

Semisupervised Neural Proto-Language Reconstruction
by: Lu, Liang, et al.
Published: (2024)

Don't Erase, Inform! Detecting and Contextualizing Harmful Language in Cultural Heritage Collections
by: Mastromichalakis, Orfeas Menis, et al.
Published: (2025)

HERITAGE: An End-to-End Web Platform for Processing Korean Historical Documents in Hanja
by: Song, Seyoung, et al.
Published: (2025)

An Investigation into Value Misalignment in LLM-Generated Texts for Cultural Heritage
by: Bu, Fan, et al.
Published: (2025)

Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding
by: Guo, Gabe, et al.
Published: (2025)

ICH-Qwen: A Large Language Model Towards Chinese Intangible Cultural Heritage
by: Ye, Wenhao, et al.
Published: (2025)

Deciphering Oracle Bone Language with Diffusion Models
by: Guan, Haisu, et al.
Published: (2024)

Translating Hanja Historical Documents to Contemporary Korean and English
by: Son, Juhee, et al.
Published: (2022)

Cultural Biases of Large Language Models and Humans in Historical Interpretation
by: Celli, Fabio, et al.
Published: (2025)

OCRBench: On the Hidden Mystery of OCR in Large Multimodal Models
by: Liu, Yuliang, et al.
Published: (2023)

Read as You See: Guiding Unimodal LLMs for Low-Resource Explainable Harmful Meme Detection
by: Pan, Fengjun, et al.
Published: (2025)