Saved in:
| Main Authors: | Hu, Kai, Wang, Jiawei, Lin, Weihong, Zhong, Zhuoyao, Sun, Lei, Huo, Qiang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.09220 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis
by: Wang, Jiawei, et al.
Published: (2024)
by: Wang, Jiawei, et al.
Published: (2024)
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
by: Wang, Jiawei, et al.
Published: (2025)
by: Wang, Jiawei, et al.
Published: (2025)
Dynamic Relation Transformer for Contextual Text Block Detection
by: Wang, Jiawei, et al.
Published: (2024)
by: Wang, Jiawei, et al.
Published: (2024)
Efficient Medical VIE via Reinforcement Learning
by: Liu, Lijun, et al.
Published: (2025)
by: Liu, Lijun, et al.
Published: (2025)
PAGED: A Benchmark for Procedural Graphs Extraction from Documents
by: Du, Weihong, et al.
Published: (2024)
by: Du, Weihong, et al.
Published: (2024)
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
by: Wang, Zhecan, et al.
Published: (2023)
by: Wang, Zhecan, et al.
Published: (2023)
DLAFormer: An End-to-End Transformer For Document Layout Analysis
by: Wang, Jiawei, et al.
Published: (2024)
by: Wang, Jiawei, et al.
Published: (2024)
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
by: Li, Yunxin, et al.
Published: (2024)
by: Li, Yunxin, et al.
Published: (2024)
READoc: A Unified Benchmark for Realistic Document Structured Extraction
by: Li, Zichao, et al.
Published: (2024)
by: Li, Zichao, et al.
Published: (2024)
UNIKIE-BENCH: Benchmarking Large Multimodal Models for Key Information Extraction in Visual Documents
by: Ji, Yifan, et al.
Published: (2026)
by: Ji, Yifan, et al.
Published: (2026)
UniMEEC: Towards Unified Multimodal Emotion Recognition and Emotion Cause
by: Hu, Guimin, et al.
Published: (2024)
by: Hu, Guimin, et al.
Published: (2024)
Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation
by: Bhattacharyya, Aniket, et al.
Published: (2024)
by: Bhattacharyya, Aniket, et al.
Published: (2024)
RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulation
by: Liu, Fanfan, et al.
Published: (2024)
by: Liu, Fanfan, et al.
Published: (2024)
PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
by: Lin, Zening, et al.
Published: (2024)
by: Lin, Zening, et al.
Published: (2024)
CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding
by: Huo, Jiahao, et al.
Published: (2026)
by: Huo, Jiahao, et al.
Published: (2026)
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
by: Li, Minghan, et al.
Published: (2024)
by: Li, Minghan, et al.
Published: (2024)
Lightweight Spatial Modeling for Combinatorial Information Extraction From Documents
by: Dong, Yanfei, et al.
Published: (2024)
by: Dong, Yanfei, et al.
Published: (2024)
A Positive-Unlabeled Metric Learning Framework for Document-Level Relation Extraction with Incomplete Labeling
by: Wang, Ye, et al.
Published: (2023)
by: Wang, Ye, et al.
Published: (2023)
UniCL: A Universal Contrastive Learning Framework for Large Time Series Models
by: Li, Jiawei, et al.
Published: (2024)
by: Li, Jiawei, et al.
Published: (2024)
UniVLR: Unifying Text and Vision in Visual Latent Reasoning for Multimodal LLMs
by: Jiang, Houcheng, et al.
Published: (2026)
by: Jiang, Houcheng, et al.
Published: (2026)
Span-Oriented Information Extraction -- A Unifying Perspective on Information Extraction
by: Ding, Yifan, et al.
Published: (2024)
by: Ding, Yifan, et al.
Published: (2024)
UniICL: An Efficient Unified Framework Unifying Compression, Selection, and Generation
by: Gao, Jun, et al.
Published: (2024)
by: Gao, Jun, et al.
Published: (2024)
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
by: Lin, Bin, et al.
Published: (2025)
by: Lin, Bin, et al.
Published: (2025)
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages
by: Pham, Trinh, et al.
Published: (2024)
by: Pham, Trinh, et al.
Published: (2024)
Beyond Static Alignment: Hierarchical Policy Control for LLM Safety via Risk-Aware Chain-of-Thought
by: Si, Jianfeng, et al.
Published: (2026)
by: Si, Jianfeng, et al.
Published: (2026)
LMDX: Language Model-based Document Information Extraction and Localization
by: Perot, Vincent, et al.
Published: (2023)
by: Perot, Vincent, et al.
Published: (2023)
UniMem: Towards a Unified View of Long-Context Large Language Models
by: Fang, Junjie, et al.
Published: (2024)
by: Fang, Junjie, et al.
Published: (2024)
Joint Extraction Matters: Prompt-Based Visual Question Answering for Multi-Field Document Information Extraction
by: Loem, Mengsay, et al.
Published: (2025)
by: Loem, Mengsay, et al.
Published: (2025)
Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework
by: Yan, Yibo, et al.
Published: (2026)
by: Yan, Yibo, et al.
Published: (2026)
UniVer: A Unified Perspective for Multi-step and Multi-draft Speculative Decoding
by: Weng, Yepeng, et al.
Published: (2026)
by: Weng, Yepeng, et al.
Published: (2026)
UniHR: Hierarchical Representation Learning for Unified Knowledge Graph Link Prediction
by: Liu, Zhiqiang, et al.
Published: (2024)
by: Liu, Zhiqiang, et al.
Published: (2024)
Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs
by: Colakoglu, Gaye, et al.
Published: (2025)
by: Colakoglu, Gaye, et al.
Published: (2025)
LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining
by: Shen, Huawen, et al.
Published: (2024)
by: Shen, Huawen, et al.
Published: (2024)
GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models
by: Jiang, Pengcheng, et al.
Published: (2024)
by: Jiang, Pengcheng, et al.
Published: (2024)
Neurosymbolic Information Extraction from Transactional Documents
by: Hemmer, Arthur, et al.
Published: (2025)
by: Hemmer, Arthur, et al.
Published: (2025)
SLIDE: Sliding Localized Information for Document Extraction
by: Singh, Divyansh, et al.
Published: (2025)
by: Singh, Divyansh, et al.
Published: (2025)
UniECG: Understanding and Generating ECG in One Unified Model
by: Jin, Jiarui, et al.
Published: (2025)
by: Jin, Jiarui, et al.
Published: (2025)
PairUni: Pairwise Training for Unified Multimodal Language Models
by: Zheng, Jiani, et al.
Published: (2025)
by: Zheng, Jiani, et al.
Published: (2025)
Guideline Learning for In-context Information Extraction
by: Pang, Chaoxu, et al.
Published: (2023)
by: Pang, Chaoxu, et al.
Published: (2023)
ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models
by: Xu, Jun, et al.
Published: (2024)
by: Xu, Jun, et al.
Published: (2024)
Similar Items
-
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis
by: Wang, Jiawei, et al.
Published: (2024) -
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
by: Wang, Jiawei, et al.
Published: (2025) -
Dynamic Relation Transformer for Contextual Text Block Detection
by: Wang, Jiawei, et al.
Published: (2024) -
Efficient Medical VIE via Reinforcement Learning
by: Liu, Lijun, et al.
Published: (2025) -
PAGED: A Benchmark for Procedural Graphs Extraction from Documents
by: Du, Weihong, et al.
Published: (2024)