Saved in:
| Main Authors: | Wang, Yu, Tan, Tianhao, Wang, Yifei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.09553 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue Corpus
by: Shi, Xiaoming, et al.
Published: (2025)
by: Shi, Xiaoming, et al.
Published: (2025)
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization
by: Hosking, Tom, et al.
Published: (2024)
by: Hosking, Tom, et al.
Published: (2024)
ALOHA: Empowering Multilingual Agent for University Orientation with Hierarchical Retrieval
by: Tao, Mingxu, et al.
Published: (2025)
by: Tao, Mingxu, et al.
Published: (2025)
LEMUR: A Corpus for Robust Fine-Tuning of Multilingual Law Embedding Models for Retrieval
by: Ahmadi, Narges Baba, et al.
Published: (2026)
by: Ahmadi, Narges Baba, et al.
Published: (2026)
PETra: A Multilingual Corpus of Pragmatic Explicitation in Translation
by: Osmelak, Doreen, et al.
Published: (2025)
by: Osmelak, Doreen, et al.
Published: (2025)
Targum -- A Multilingual New Testament Translation Corpus
by: Rapacz, Maciej, et al.
Published: (2026)
by: Rapacz, Maciej, et al.
Published: (2026)
VideoRAG: Retrieval-Augmented Generation over Video Corpus
by: Jeong, Soyeong, et al.
Published: (2025)
by: Jeong, Soyeong, et al.
Published: (2025)
Cross-Lingual Consensus: Aligning Multilingual Cultural Knowledge via Multilingual Self-Consistency
by: Soegeng, Andrew Ivan, et al.
Published: (2026)
by: Soegeng, Andrew Ivan, et al.
Published: (2026)
Multilingual Knowledge Graph Completion via Efficient Multilingual Knowledge Sharing
by: Mao, Cunli, et al.
Published: (2025)
by: Mao, Cunli, et al.
Published: (2025)
Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task
by: Ranaldi, Leonardo, et al.
Published: (2025)
by: Ranaldi, Leonardo, et al.
Published: (2025)
EuroSpeech: A Multilingual Speech Corpus
by: Pfisterer, Samuel, et al.
Published: (2025)
by: Pfisterer, Samuel, et al.
Published: (2025)
Retrieval-Augmented Generation with Hierarchical Knowledge
by: Huang, Haoyu, et al.
Published: (2025)
by: Huang, Haoyu, et al.
Published: (2025)
Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering
by: Zhang, Xiaoming, et al.
Published: (2024)
by: Zhang, Xiaoming, et al.
Published: (2024)
Towards Corpus-Grounded Agentic LLMs for Multilingual Grammatical Analysis
by: Klemen, Matej, et al.
Published: (2025)
by: Klemen, Matej, et al.
Published: (2025)
Multilingual Information Retrieval with a Monolingual Knowledge Base
by: Zhuang, Yingying, et al.
Published: (2025)
by: Zhuang, Yingying, et al.
Published: (2025)
Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation
by: Wu, Suhang, et al.
Published: (2024)
by: Wu, Suhang, et al.
Published: (2024)
A Longitudinal, Multinational, and Multilingual Corpus of News Coverage of the Russo-Ukrainian War
by: Mohanty, Dikshya, et al.
Published: (2026)
by: Mohanty, Dikshya, et al.
Published: (2026)
ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus
by: Hamed, Injy, et al.
Published: (2024)
by: Hamed, Injy, et al.
Published: (2024)
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
by: Wang, Jiayi, et al.
Published: (2024)
by: Wang, Jiayi, et al.
Published: (2024)
mRAKL: Multilingual Retrieval-Augmented Knowledge Graph Construction for Low-Resourced Languages
by: Nigatu, Hellina Hailu, et al.
Published: (2025)
by: Nigatu, Hellina Hailu, et al.
Published: (2025)
Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models
by: Semnani, Sina J., et al.
Published: (2025)
by: Semnani, Sina J., et al.
Published: (2025)
COSMMIC: Comment-Sensitive Multimodal Multilingual Indian Corpus for Summarization and Headline Generation
by: Kumar, Raghvendra, et al.
Published: (2025)
by: Kumar, Raghvendra, et al.
Published: (2025)
Hierarchical Semantic Retrieval with Cobweb
by: Gupta, Anant, et al.
Published: (2025)
by: Gupta, Anant, et al.
Published: (2025)
A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces
by: Du, Mingxuan, et al.
Published: (2026)
by: Du, Mingxuan, et al.
Published: (2026)
Tracing Multilingual Factual Knowledge Acquisition in Pretraining
by: Liu, Yihong, et al.
Published: (2025)
by: Liu, Yihong, et al.
Published: (2025)
MITRA: A Large-Scale Parallel Corpus and Multilingual Pretrained Language Model for Machine Translation and Semantic Retrieval for Pāli, Sanskrit, Buddhist Chinese, and Tibetan
by: Nehrdich, Sebastian, et al.
Published: (2026)
by: Nehrdich, Sebastian, et al.
Published: (2026)
Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment
by: Lu, Yuxing, et al.
Published: (2026)
by: Lu, Yuxing, et al.
Published: (2026)
Explanation based In-Context Demonstrations Retrieval for Multilingual Grammatical Error Correction
by: Li, Wei, et al.
Published: (2025)
by: Li, Wei, et al.
Published: (2025)
GraphER: An Efficient Graph-Based Enrichment and Reranking Method for Retrieval-Augmented Generation
by: Miao, Ruizhong, et al.
Published: (2026)
by: Miao, Ruizhong, et al.
Published: (2026)
BOUTEF: A Multilingual Corpus for FakeNews in North Africa -- Language as a Weapon
by: Smaili, Kamel, et al.
Published: (2026)
by: Smaili, Kamel, et al.
Published: (2026)
GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text
by: Ginn, Michael, et al.
Published: (2024)
by: Ginn, Michael, et al.
Published: (2024)
mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus
by: Futeral, Matthieu, et al.
Published: (2024)
by: Futeral, Matthieu, et al.
Published: (2024)
KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment
by: Lu, Yuxing, et al.
Published: (2025)
by: Lu, Yuxing, et al.
Published: (2025)
PRoH: Dynamic Planning and Reasoning over Knowledge Hypergraphs for Retrieval-Augmented Generation
by: Zai, Xiangjun, et al.
Published: (2025)
by: Zai, Xiangjun, et al.
Published: (2025)
Multilingual Generative Retrieval via Cross-lingual Semantic Compression
by: Huang, Yuxin, et al.
Published: (2025)
by: Huang, Yuxin, et al.
Published: (2025)
Language-Coupled Reinforcement Learning for Multilingual Retrieval-Augmented Generation
by: Qi, Rui, et al.
Published: (2026)
by: Qi, Rui, et al.
Published: (2026)
Multilingual Knowledge Graph Completion from Pretrained Language Models with Knowledge Constraints
by: Song, Ran, et al.
Published: (2024)
by: Song, Ran, et al.
Published: (2024)
HIRAG: Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation
by: Jiao, YiHan, et al.
Published: (2025)
by: Jiao, YiHan, et al.
Published: (2025)
Distillation for Multilingual Information Retrieval
by: Yang, Eugene, et al.
Published: (2024)
by: Yang, Eugene, et al.
Published: (2024)
Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification
by: Chen, Huiyao, et al.
Published: (2024)
by: Chen, Huiyao, et al.
Published: (2024)
Similar Items
-
KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue Corpus
by: Shi, Xiaoming, et al.
Published: (2025) -
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization
by: Hosking, Tom, et al.
Published: (2024) -
ALOHA: Empowering Multilingual Agent for University Orientation with Hierarchical Retrieval
by: Tao, Mingxu, et al.
Published: (2025) -
LEMUR: A Corpus for Robust Fine-Tuning of Multilingual Law Embedding Models for Retrieval
by: Ahmadi, Narges Baba, et al.
Published: (2026) -
PETra: A Multilingual Corpus of Pragmatic Explicitation in Translation
by: Osmelak, Doreen, et al.
Published: (2025)