:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Fu, ShunLiang, Zhang, Yanxin, Xiang, Yixin, Du, Xiaoyu, Tang, Jinhui
Format:	Preprint
Published:	2026
Subjects:	Information Retrieval
Online Access:	https://arxiv.org/abs/2601.18203
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MAB-DQA: Addressing Query Aspect Importance in Document Question Answering with Multi-Armed Bandits
by: Xiang, Yixin, et al.
Published: (2026)

Precision Profile Pollution Attack on Sequential Recommenders via Influence Function
by: Du, Xiaoyu, et al.
Published: (2024)

FACE: A General Framework for Mapping Collaborative Filtering Embeddings into LLM Tokens
by: Wang, Chao, et al.
Published: (2025)

Structured Attention Matters to Multimodal LLMs in Document Understanding
by: Liu, Chang, et al.
Published: (2025)

Frozen LVLMs for Micro-Video Recommendation: A Systematic Study of Feature Extraction and Fusion
by: Sun, Huatuan, et al.
Published: (2025)

Rank4Gen: RAG-Preference-Aligned Document Set Selection and Ranking
by: Fan, Yongqi, et al.
Published: (2026)

AnnoRetrieve: Efficient Structured Retrieval for Unstructured Document Analysis
by: Lin, Teng, et al.
Published: (2026)

Multimodal Search in Chemical Documents and Reactions
by: Shah, Ayush Kumar, et al.
Published: (2025)

eSapiens: A Real-World NLP Framework for Multimodal Document Understanding and Enterprise Knowledge Processing
by: Shi, Isaac, et al.
Published: (2025)

SumRank: Aligning Summarization Models for Long-Document Listwise Reranking
by: Feng, Jincheng, et al.
Published: (2026)

Markup Language Modeling for Web Document Understanding
by: Liu, Su, et al.
Published: (2025)

AlignRec: Aligning and Training in Multimodal Recommendations
by: Liu, Yifan, et al.
Published: (2024)

LFRAG: Layout-oriented Fine-grained Retrieval-Augmented Generation on Multimodal Document Understanding
by: Zhu, Yifan, et al.
Published: (2026)

RosePO: Aligning LLM-based Recommenders with Human Values
by: Liao, Jiayi, et al.
Published: (2024)

ReAlign: Optimizing the Visual Document Retriever with Reasoning-Guided Fine-Grained Alignment
by: Yang, Hao, et al.
Published: (2026)

Unifying Multimodal Retrieval via Document Screenshot Embedding
by: Ma, Xueguang, et al.
Published: (2024)

Rethinking Reasoning in Document Ranking: Why Chain-of-Thought Falls Short
by: Lu, Xuan, et al.
Published: (2025)

Tools are under-documented: Simple Document Expansion Boosts Tool Retrieval
by: Lu, Xuan, et al.
Published: (2025)

MultiCBR: Multi-view Contrastive Learning for Bundle Recommendation
by: Ma, Yunshan, et al.
Published: (2023)

Beyond Text: Aligning Vision and Language for Multimodal E-Commerce Retrieval
by: Zhang, Qujiaheng, et al.
Published: (2026)

CAT-ID$^2$: Category-Tree Integrated Document Identifier Learning for Generative Retrieval In E-commerce
by: Liu, Xiaoyu, et al.
Published: (2025)

Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning
by: Zhang, Jinxu
Published: (2024)

Understanding Internal Representations of Recommendation Models with Sparse Autoencoders
by: Wang, Jiayin, et al.
Published: (2024)

Unlocking Multimodal Document Intelligence: From Current Triumphs to Future Frontiers of Visual Document Retrieval
by: Yan, Yibo, et al.
Published: (2026)

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding
by: Tripathi, Vishesh, et al.
Published: (2025)

Document Similarity Enhanced IPS Estimation for Unbiased Learning to Rank
by: Liang, Zeyan, et al.
Published: (2025)

Cross-Document Topic-Aligned Chunking for Retrieval-Augmented Generation
by: Stankovic, Mile
Published: (2025)

ELCoRec: Enhance Language Understanding with Co-Propagation of Numerical and Categorical Features for Recommendation
by: Chen, Jizheng, et al.
Published: (2024)

From Documents to Dialogue: Building KG-RAG Enhanced AI Assistants
by: Mukherjee, Manisha, et al.
Published: (2025)

Doc-Researcher: A Unified System for Multimodal Document Parsing and Deep Research
by: Dong, Kuicai, et al.
Published: (2025)

Information-Theoretic Generative Clustering of Documents
by: Du, Xin, et al.
Published: (2024)

Financial Sentiment Analysis on News and Reports Using Large Language Models and FinBERT
by: Shen, Yanxin, et al.
Published: (2024)

MARS: Modality-Aligned Retrieval for Sequence Augmented CTR Prediction
by: Xiao, Yutian, et al.
Published: (2025)

HKRAG: Holistic Knowledge Retrieval-Augmented Generation Over Visually-Rich Documents
by: Tong, Anyang, et al.
Published: (2025)

CSQL: Mapping Documents into Causal Databases
by: Mahadevan, Sridhar
Published: (2026)

DREQ: Document Re-Ranking Using Entity-based Query Understanding
by: Chatterjee, Shubham, et al.
Published: (2024)

SSEmb: A Joint Structural and Semantic Embedding Framework for Mathematical Formula Retrieval
by: Li, Ruyin, et al.
Published: (2025)

Structural and Disentangled Adaptation of Large Vision Language Models for Multimodal Recommendation
by: Rao, Zhongtao, et al.
Published: (2025)

Leveraging LLMs to Evaluate Usefulness of Document
by: Wang, Xingzhu, et al.
Published: (2025)

MoLoRAG: Bootstrapping Document Understanding via Multi-modal Logic-aware Retrieval
by: Wu, Xixi, et al.
Published: (2025)