:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hu, Kai, Wang, Jiawei, Lin, Weihong, Zhong, Zhuoyao, Sun, Lei, Huo, Qiang
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2401.09220
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis
by: Wang, Jiawei, et al.
Published: (2024)

UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
by: Wang, Jiawei, et al.
Published: (2025)

Dynamic Relation Transformer for Contextual Text Block Detection
by: Wang, Jiawei, et al.
Published: (2024)

Efficient Medical VIE via Reinforcement Learning
by: Liu, Lijun, et al.
Published: (2025)

PAGED: A Benchmark for Procedural Graphs Extraction from Documents
by: Du, Weihong, et al.
Published: (2024)

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
by: Wang, Zhecan, et al.
Published: (2023)

DLAFormer: An End-to-End Transformer For Document Layout Analysis
by: Wang, Jiawei, et al.
Published: (2024)

Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
by: Li, Yunxin, et al.
Published: (2024)

READoc: A Unified Benchmark for Realistic Document Structured Extraction
by: Li, Zichao, et al.
Published: (2024)

UNIKIE-BENCH: Benchmarking Large Multimodal Models for Key Information Extraction in Visual Documents
by: Ji, Yifan, et al.
Published: (2026)

UniMEEC: Towards Unified Multimodal Emotion Recognition and Emotion Cause
by: Hu, Guimin, et al.
Published: (2024)

Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation
by: Bhattacharyya, Aniket, et al.
Published: (2024)

RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulation
by: Liu, Fanfan, et al.
Published: (2024)

PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
by: Lin, Zening, et al.
Published: (2024)

CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding
by: Huo, Jiahao, et al.
Published: (2026)

UniVS: Unified and Universal Video Segmentation with Prompts as Queries
by: Li, Minghan, et al.
Published: (2024)

Lightweight Spatial Modeling for Combinatorial Information Extraction From Documents
by: Dong, Yanfei, et al.
Published: (2024)

A Positive-Unlabeled Metric Learning Framework for Document-Level Relation Extraction with Incomplete Labeling
by: Wang, Ye, et al.
Published: (2023)

UniCL: A Universal Contrastive Learning Framework for Large Time Series Models
by: Li, Jiawei, et al.
Published: (2024)

UniVLR: Unifying Text and Vision in Visual Latent Reasoning for Multimodal LLMs
by: Jiang, Houcheng, et al.
Published: (2026)

Span-Oriented Information Extraction -- A Unifying Perspective on Information Extraction
by: Ding, Yifan, et al.
Published: (2024)

UniICL: An Efficient Unified Framework Unifying Compression, Selection, and Generation
by: Gao, Jun, et al.
Published: (2024)

UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
by: Lin, Bin, et al.
Published: (2025)

UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages
by: Pham, Trinh, et al.
Published: (2024)

Beyond Static Alignment: Hierarchical Policy Control for LLM Safety via Risk-Aware Chain-of-Thought
by: Si, Jianfeng, et al.
Published: (2026)

LMDX: Language Model-based Document Information Extraction and Localization
by: Perot, Vincent, et al.
Published: (2023)

UniMem: Towards a Unified View of Long-Context Large Language Models
by: Fang, Junjie, et al.
Published: (2024)

Joint Extraction Matters: Prompt-Based Visual Question Answering for Multi-Field Document Information Extraction
by: Loem, Mengsay, et al.
Published: (2025)

Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework
by: Yan, Yibo, et al.
Published: (2026)

UniVer: A Unified Perspective for Multi-step and Multi-draft Speculative Decoding
by: Weng, Yepeng, et al.
Published: (2026)

UniHR: Hierarchical Representation Learning for Unified Knowledge Graph Link Prediction
by: Liu, Zhiqiang, et al.
Published: (2024)

Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs
by: Colakoglu, Gaye, et al.
Published: (2025)

LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining
by: Shen, Huawen, et al.
Published: (2024)

GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models
by: Jiang, Pengcheng, et al.
Published: (2024)

Neurosymbolic Information Extraction from Transactional Documents
by: Hemmer, Arthur, et al.
Published: (2025)

SLIDE: Sliding Localized Information for Document Extraction
by: Singh, Divyansh, et al.
Published: (2025)

UniECG: Understanding and Generating ECG in One Unified Model
by: Jin, Jiarui, et al.
Published: (2025)

PairUni: Pairwise Training for Unified Multimodal Language Models
by: Zheng, Jiani, et al.
Published: (2025)

Guideline Learning for In-context Information Extraction
by: Pang, Chaoxu, et al.
Published: (2023)

ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models
by: Xu, Jun, et al.
Published: (2024)