Saved in:
Bibliographic Details
Main Authors: Chen, Hong Cai, Wu, Longchang, Zhang, Yang
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.16541
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917934399488000
author Chen, Hong Cai
Wu, Longchang
Zhang, Yang
author_facet Chen, Hong Cai
Wu, Longchang
Zhang, Yang
contents When designing circuits, engineers obtain the information of electronic devices by browsing a large number of documents, which is low efficiency and heavy workload. The use of artificial intelligence technology to automatically parse documents can greatly improve the efficiency of engineers. However, the current document layout analysis model is aimed at various types of documents and is not suitable for electronic device documents. This paper proposes to use EDocNet to realize the document layout analysis function for document analysis, and use the electronic device document data set created by myself for training. The training method adopts the focus and global knowledge distillation method, and a model suitable for electronic device documents is obtained, which can divide the contents of electronic device documents into 21 categories. It has better average accuracy and average recall rate. It also greatly improves the speed of model checking.
format Preprint
id arxiv_https___arxiv_org_abs_2502_16541
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation
Chen, Hong Cai
Wu, Longchang
Zhang, Yang
Computer Vision and Pattern Recognition
Machine Learning
When designing circuits, engineers obtain the information of electronic devices by browsing a large number of documents, which is low efficiency and heavy workload. The use of artificial intelligence technology to automatically parse documents can greatly improve the efficiency of engineers. However, the current document layout analysis model is aimed at various types of documents and is not suitable for electronic device documents. This paper proposes to use EDocNet to realize the document layout analysis function for document analysis, and use the electronic device document data set created by myself for training. The training method adopts the focus and global knowledge distillation method, and a model suitable for electronic device documents is obtained, which can divide the contents of electronic device documents into 21 categories. It has better average accuracy and average recall rate. It also greatly improves the speed of model checking.
title EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2502.16541