Saved in:
| Main Authors: | Zhao, Zhiyuan, Kang, Hengrui, Wang, Bin, He, Conghui |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.12628 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OmniDocLayout: Towards Diverse Document Layout Generation via Coarse-to-Fine LLM Learning
by: Kang, Hengrui, et al.
Published: (2025)
by: Kang, Hengrui, et al.
Published: (2025)
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction
by: Sun, Ting, et al.
Published: (2025)
by: Sun, Ting, et al.
Published: (2025)
DocCogito: Aligning Layout Cognition and Step-Level Grounded Reasoning for Document Understanding
by: Wu, Yuchuan, et al.
Published: (2026)
by: Wu, Yuchuan, et al.
Published: (2026)
LED Benchmark: Diagnosing Structural Layout Errors for Document Layout Analysis
by: Heo, Inbum, et al.
Published: (2025)
by: Heo, Inbum, et al.
Published: (2025)
SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific Posters
by: Tanaka, Shohei, et al.
Published: (2024)
by: Tanaka, Shohei, et al.
Published: (2024)
STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation
by: Wang, Ruyu, et al.
Published: (2025)
by: Wang, Ruyu, et al.
Published: (2025)
PARL: Position-Aware Relation Learning Network for Document Layout Analysis
by: Liu, Fuyuan, et al.
Published: (2026)
by: Liu, Fuyuan, et al.
Published: (2026)
SFDLA: Source-Free Document Layout Analysis
by: Tewes, Sebastian, et al.
Published: (2025)
by: Tewes, Sebastian, et al.
Published: (2025)
Diachronic Document Dataset for Semantic Layout Analysis
by: Clérice, Thibault, et al.
Published: (2024)
by: Clérice, Thibault, et al.
Published: (2024)
Cross-Domain Document Layout Analysis Using Document Style Guide
by: Wu, Xingjiao, et al.
Published: (2022)
by: Wu, Xingjiao, et al.
Published: (2022)
A Hybrid Approach for Document Layout Analysis in Document images
by: Shehzadi, Tahira, et al.
Published: (2024)
by: Shehzadi, Tahira, et al.
Published: (2024)
DLAFormer: An End-to-End Transformer For Document Layout Analysis
by: Wang, Jiawei, et al.
Published: (2024)
by: Wang, Jiawei, et al.
Published: (2024)
HybriDLA: Hybrid Generation for Document Layout Analysis
by: Chen, Yufan, et al.
Published: (2025)
by: Chen, Yufan, et al.
Published: (2025)
From Codicology to Code: A Comparative Study of Transformer and YOLO-based Detectors for Layout Analysis in Historical Documents
by: Aguilar, Sergio Torres
Published: (2025)
by: Aguilar, Sergio Torres
Published: (2025)
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
by: Ouyang, Linke, et al.
Published: (2024)
by: Ouyang, Linke, et al.
Published: (2024)
LayoutFlow: Flow Matching for Layout Generation
by: Guerreiro, Julian Jorge Andrade, et al.
Published: (2024)
by: Guerreiro, Julian Jorge Andrade, et al.
Published: (2024)
ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training
by: Jiang, Zhouqiang, et al.
Published: (2024)
by: Jiang, Zhouqiang, et al.
Published: (2024)
360 Layout Estimation via Orthogonal Planes Disentanglement and Multi-view Geometric Consistency Perception
by: Shen, Zhijie, et al.
Published: (2023)
by: Shen, Zhijie, et al.
Published: (2023)
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
by: Luo, Chuwei, et al.
Published: (2024)
by: Luo, Chuwei, et al.
Published: (2024)
UnSupDLA: Towards Unsupervised Document Layout Analysis
by: Sheikh, Talha Uddin, et al.
Published: (2024)
by: Sheikh, Talha Uddin, et al.
Published: (2024)
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
by: Chen, Yufan, et al.
Published: (2024)
by: Chen, Yufan, et al.
Published: (2024)
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
by: Zhao, Zhiyuan, et al.
Published: (2023)
by: Zhao, Zhiyuan, et al.
Published: (2023)
Chat2Layout: Interactive 3D Furniture Layout with a Multimodal LLM
by: Wang, Can, et al.
Published: (2024)
by: Wang, Can, et al.
Published: (2024)
Video2Layout: Recall and Reconstruct Metric-Grounded Cognitive Map for Spatial Reasoning
by: Huang, Yibin, et al.
Published: (2025)
by: Huang, Yibin, et al.
Published: (2025)
Towards Khmer Scene Document Layout Detection
by: Kong, Marry, et al.
Published: (2026)
by: Kong, Marry, et al.
Published: (2026)
Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation
by: Horita, Daichi, et al.
Published: (2023)
by: Horita, Daichi, et al.
Published: (2023)
Layout Anything: One Transformer for Universal Room Layout Estimation
by: Mia, Md Sohag, et al.
Published: (2025)
by: Mia, Md Sohag, et al.
Published: (2025)
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
by: Zheng, Guangcong, et al.
Published: (2023)
by: Zheng, Guangcong, et al.
Published: (2023)
Accurate Fine-grained Layout Analysis for the Historical Tibetan Document Based on the Instance Segmentation
by: Zhao, Penghai, et al.
Published: (2021)
by: Zhao, Penghai, et al.
Published: (2021)
RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge Guidance
by: Hou, Xianbao, et al.
Published: (2026)
by: Hou, Xianbao, et al.
Published: (2026)
Generating Synthetic Invoices via Layout-Preserving Content Replacement
by: V, Bevin, et al.
Published: (2025)
by: V, Bevin, et al.
Published: (2025)
LEGION: Learning to Ground and Explain for Synthetic Image Detection
by: Kang, Hengrui, et al.
Published: (2025)
by: Kang, Hengrui, et al.
Published: (2025)
Advanced Layout Analysis Models for Docling
by: Livathinos, Nikolaos, et al.
Published: (2025)
by: Livathinos, Nikolaos, et al.
Published: (2025)
LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis
by: Shihab, Ibne Farabi, et al.
Published: (2025)
by: Shihab, Ibne Farabi, et al.
Published: (2025)
Parser-Oriented Structural Refinement for a Stable Layout Interface in Document Parsing
by: Liu, Fuyuan, et al.
Published: (2026)
by: Liu, Fuyuan, et al.
Published: (2026)
GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts
by: He, Junwen, et al.
Published: (2024)
by: He, Junwen, et al.
Published: (2024)
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
by: Zhao, Peiang, et al.
Published: (2023)
by: Zhao, Peiang, et al.
Published: (2023)
uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images
by: Lee, Jonathan, et al.
Published: (2025)
by: Lee, Jonathan, et al.
Published: (2025)
DogLayout: Denoising Diffusion GAN for Discrete and Continuous Layout Generation
by: Gan, Zhaoxing, et al.
Published: (2024)
by: Gan, Zhaoxing, et al.
Published: (2024)
LTSim: Layout Transportation-based Similarity Measure for Evaluating Layout Generation
by: Otani, Mayu, et al.
Published: (2024)
by: Otani, Mayu, et al.
Published: (2024)
Similar Items
-
OmniDocLayout: Towards Diverse Document Layout Generation via Coarse-to-Fine LLM Learning
by: Kang, Hengrui, et al.
Published: (2025) -
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction
by: Sun, Ting, et al.
Published: (2025) -
DocCogito: Aligning Layout Cognition and Step-Level Grounded Reasoning for Document Understanding
by: Wu, Yuchuan, et al.
Published: (2026) -
LED Benchmark: Diagnosing Structural Layout Errors for Document Layout Analysis
by: Heo, Inbum, et al.
Published: (2025) -
SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific Posters
by: Tanaka, Shohei, et al.
Published: (2024)