Saved in:
| Main Authors: | Fu, ShunLiang, Zhang, Yanxin, Xiang, Yixin, Du, Xiaoyu, Tang, Jinhui |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.18203 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MAB-DQA: Addressing Query Aspect Importance in Document Question Answering with Multi-Armed Bandits
by: Xiang, Yixin, et al.
Published: (2026)
by: Xiang, Yixin, et al.
Published: (2026)
Precision Profile Pollution Attack on Sequential Recommenders via Influence Function
by: Du, Xiaoyu, et al.
Published: (2024)
by: Du, Xiaoyu, et al.
Published: (2024)
FACE: A General Framework for Mapping Collaborative Filtering Embeddings into LLM Tokens
by: Wang, Chao, et al.
Published: (2025)
by: Wang, Chao, et al.
Published: (2025)
Structured Attention Matters to Multimodal LLMs in Document Understanding
by: Liu, Chang, et al.
Published: (2025)
by: Liu, Chang, et al.
Published: (2025)
Frozen LVLMs for Micro-Video Recommendation: A Systematic Study of Feature Extraction and Fusion
by: Sun, Huatuan, et al.
Published: (2025)
by: Sun, Huatuan, et al.
Published: (2025)
Rank4Gen: RAG-Preference-Aligned Document Set Selection and Ranking
by: Fan, Yongqi, et al.
Published: (2026)
by: Fan, Yongqi, et al.
Published: (2026)
AnnoRetrieve: Efficient Structured Retrieval for Unstructured Document Analysis
by: Lin, Teng, et al.
Published: (2026)
by: Lin, Teng, et al.
Published: (2026)
Multimodal Search in Chemical Documents and Reactions
by: Shah, Ayush Kumar, et al.
Published: (2025)
by: Shah, Ayush Kumar, et al.
Published: (2025)
eSapiens: A Real-World NLP Framework for Multimodal Document Understanding and Enterprise Knowledge Processing
by: Shi, Isaac, et al.
Published: (2025)
by: Shi, Isaac, et al.
Published: (2025)
SumRank: Aligning Summarization Models for Long-Document Listwise Reranking
by: Feng, Jincheng, et al.
Published: (2026)
by: Feng, Jincheng, et al.
Published: (2026)
Markup Language Modeling for Web Document Understanding
by: Liu, Su, et al.
Published: (2025)
by: Liu, Su, et al.
Published: (2025)
AlignRec: Aligning and Training in Multimodal Recommendations
by: Liu, Yifan, et al.
Published: (2024)
by: Liu, Yifan, et al.
Published: (2024)
LFRAG: Layout-oriented Fine-grained Retrieval-Augmented Generation on Multimodal Document Understanding
by: Zhu, Yifan, et al.
Published: (2026)
by: Zhu, Yifan, et al.
Published: (2026)
RosePO: Aligning LLM-based Recommenders with Human Values
by: Liao, Jiayi, et al.
Published: (2024)
by: Liao, Jiayi, et al.
Published: (2024)
ReAlign: Optimizing the Visual Document Retriever with Reasoning-Guided Fine-Grained Alignment
by: Yang, Hao, et al.
Published: (2026)
by: Yang, Hao, et al.
Published: (2026)
Unifying Multimodal Retrieval via Document Screenshot Embedding
by: Ma, Xueguang, et al.
Published: (2024)
by: Ma, Xueguang, et al.
Published: (2024)
Rethinking Reasoning in Document Ranking: Why Chain-of-Thought Falls Short
by: Lu, Xuan, et al.
Published: (2025)
by: Lu, Xuan, et al.
Published: (2025)
Tools are under-documented: Simple Document Expansion Boosts Tool Retrieval
by: Lu, Xuan, et al.
Published: (2025)
by: Lu, Xuan, et al.
Published: (2025)
MultiCBR: Multi-view Contrastive Learning for Bundle Recommendation
by: Ma, Yunshan, et al.
Published: (2023)
by: Ma, Yunshan, et al.
Published: (2023)
Beyond Text: Aligning Vision and Language for Multimodal E-Commerce Retrieval
by: Zhang, Qujiaheng, et al.
Published: (2026)
by: Zhang, Qujiaheng, et al.
Published: (2026)
CAT-ID$^2$: Category-Tree Integrated Document Identifier Learning for Generative Retrieval In E-commerce
by: Liu, Xiaoyu, et al.
Published: (2025)
by: Liu, Xiaoyu, et al.
Published: (2025)
Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning
by: Zhang, Jinxu
Published: (2024)
by: Zhang, Jinxu
Published: (2024)
Understanding Internal Representations of Recommendation Models with Sparse Autoencoders
by: Wang, Jiayin, et al.
Published: (2024)
by: Wang, Jiayin, et al.
Published: (2024)
Unlocking Multimodal Document Intelligence: From Current Triumphs to Future Frontiers of Visual Document Retrieval
by: Yan, Yibo, et al.
Published: (2026)
by: Yan, Yibo, et al.
Published: (2026)
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding
by: Tripathi, Vishesh, et al.
Published: (2025)
by: Tripathi, Vishesh, et al.
Published: (2025)
Document Similarity Enhanced IPS Estimation for Unbiased Learning to Rank
by: Liang, Zeyan, et al.
Published: (2025)
by: Liang, Zeyan, et al.
Published: (2025)
Cross-Document Topic-Aligned Chunking for Retrieval-Augmented Generation
by: Stankovic, Mile
Published: (2025)
by: Stankovic, Mile
Published: (2025)
ELCoRec: Enhance Language Understanding with Co-Propagation of Numerical and Categorical Features for Recommendation
by: Chen, Jizheng, et al.
Published: (2024)
by: Chen, Jizheng, et al.
Published: (2024)
From Documents to Dialogue: Building KG-RAG Enhanced AI Assistants
by: Mukherjee, Manisha, et al.
Published: (2025)
by: Mukherjee, Manisha, et al.
Published: (2025)
Doc-Researcher: A Unified System for Multimodal Document Parsing and Deep Research
by: Dong, Kuicai, et al.
Published: (2025)
by: Dong, Kuicai, et al.
Published: (2025)
Information-Theoretic Generative Clustering of Documents
by: Du, Xin, et al.
Published: (2024)
by: Du, Xin, et al.
Published: (2024)
Financial Sentiment Analysis on News and Reports Using Large Language Models and FinBERT
by: Shen, Yanxin, et al.
Published: (2024)
by: Shen, Yanxin, et al.
Published: (2024)
MARS: Modality-Aligned Retrieval for Sequence Augmented CTR Prediction
by: Xiao, Yutian, et al.
Published: (2025)
by: Xiao, Yutian, et al.
Published: (2025)
HKRAG: Holistic Knowledge Retrieval-Augmented Generation Over Visually-Rich Documents
by: Tong, Anyang, et al.
Published: (2025)
by: Tong, Anyang, et al.
Published: (2025)
CSQL: Mapping Documents into Causal Databases
by: Mahadevan, Sridhar
Published: (2026)
by: Mahadevan, Sridhar
Published: (2026)
DREQ: Document Re-Ranking Using Entity-based Query Understanding
by: Chatterjee, Shubham, et al.
Published: (2024)
by: Chatterjee, Shubham, et al.
Published: (2024)
SSEmb: A Joint Structural and Semantic Embedding Framework for Mathematical Formula Retrieval
by: Li, Ruyin, et al.
Published: (2025)
by: Li, Ruyin, et al.
Published: (2025)
Structural and Disentangled Adaptation of Large Vision Language Models for Multimodal Recommendation
by: Rao, Zhongtao, et al.
Published: (2025)
by: Rao, Zhongtao, et al.
Published: (2025)
Leveraging LLMs to Evaluate Usefulness of Document
by: Wang, Xingzhu, et al.
Published: (2025)
by: Wang, Xingzhu, et al.
Published: (2025)
MoLoRAG: Bootstrapping Document Understanding via Multi-modal Logic-aware Retrieval
by: Wu, Xixi, et al.
Published: (2025)
by: Wu, Xixi, et al.
Published: (2025)
Similar Items
-
MAB-DQA: Addressing Query Aspect Importance in Document Question Answering with Multi-Armed Bandits
by: Xiang, Yixin, et al.
Published: (2026) -
Precision Profile Pollution Attack on Sequential Recommenders via Influence Function
by: Du, Xiaoyu, et al.
Published: (2024) -
FACE: A General Framework for Mapping Collaborative Filtering Embeddings into LLM Tokens
by: Wang, Chao, et al.
Published: (2025) -
Structured Attention Matters to Multimodal LLMs in Document Understanding
by: Liu, Chang, et al.
Published: (2025) -
Frozen LVLMs for Micro-Video Recommendation: A Systematic Study of Feature Extraction and Fusion
by: Sun, Huatuan, et al.
Published: (2025)