Saved in:
| Main Authors: | Shaukat, Muhammad Arslan, Adnan, Muntasir, Kuhn, Carlos C. N. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.06976 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs
by: Adnan, Muntasir, et al.
Published: (2025)
by: Adnan, Muntasir, et al.
Published: (2025)
Adaptive Hierarchical Evaluation of LLMs and SAST tools for CWE Prediction in Python
by: Adnan, Muntasir, et al.
Published: (2026)
by: Adnan, Muntasir, et al.
Published: (2026)
Enhancing Depressive Post Detection in Bangla: A Comparative Study of TF-IDF, BERT and FastText Embeddings
by: Sazan, Saad Ahmed, et al.
Published: (2024)
by: Sazan, Saad Ahmed, et al.
Published: (2024)
Large Language Model Guided Self-Debugging Code Generation
by: Adnan, Muntasir, et al.
Published: (2025)
by: Adnan, Muntasir, et al.
Published: (2025)
HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking
by: Lu, Wensheng, et al.
Published: (2025)
by: Lu, Wensheng, et al.
Published: (2025)
SmartChunk Retrieval: Query-Aware Chunk Compression with Planning for Efficient Document RAG
by: Zhang, Xuechen, et al.
Published: (2025)
by: Zhang, Xuechen, et al.
Published: (2025)
Investigating OCR-Sensitive Neurons to Improve Entity Recognition in Historical Documents
by: Boros, Emanuela, et al.
Published: (2024)
by: Boros, Emanuela, et al.
Published: (2024)
Adaptive Chunking: Optimizing Chunking-Method Selection for RAG
by: Júnior, Paulo Roberto de Moura, et al.
Published: (2026)
by: Júnior, Paulo Roberto de Moura, et al.
Published: (2026)
Chunking German Legal Code
by: Prior, Max, et al.
Published: (2026)
by: Prior, Max, et al.
Published: (2026)
Chunk-Distilled Language Modeling
by: Li, Yanhong, et al.
Published: (2024)
by: Li, Yanhong, et al.
Published: (2024)
Cross-Document Topic-Aligned Chunking for Retrieval-Augmented Generation
by: Stankovic, Mile
Published: (2025)
by: Stankovic, Mile
Published: (2025)
Reconstructing Context: Evaluating Advanced Chunking Strategies for Retrieval-Augmented Generation
by: Merola, Carlo, et al.
Published: (2025)
by: Merola, Carlo, et al.
Published: (2025)
Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents
by: Wu, Kaifeng, et al.
Published: (2025)
by: Wu, Kaifeng, et al.
Published: (2025)
MultiDocFusion: Hierarchical and Multimodal Chunking Pipeline for Enhanced RAG on Long Industrial Documents
by: Shin, Joongmin, et al.
Published: (2026)
by: Shin, Joongmin, et al.
Published: (2026)
ChunkNorris: A High-Performance and Low-Energy Approach to PDF Parsing and Chunking
by: Ciancone, Mathieu, et al.
Published: (2025)
by: Ciancone, Mathieu, et al.
Published: (2025)
Evaluating Financial Sentiment Analysis with Annotators Instruction Assisted Prompting: Enhancing Contextual Interpretation and Stock Prediction Accuracy
by: Rahman, A M Muntasir, et al.
Published: (2025)
by: Rahman, A M Muntasir, et al.
Published: (2025)
Beyond Chunking: Discourse-Aware Hierarchical Retrieval for Long Document Question Answering
by: Chen, Huiyao, et al.
Published: (2025)
by: Chen, Huiyao, et al.
Published: (2025)
Political Events using RAG with LLMs
by: Arslan, Muhammad, et al.
Published: (2025)
by: Arslan, Muhammad, et al.
Published: (2025)
Sustainable Digitalization of Business with Multi-Agent RAG and LLM
by: Arslan, Muhammad, et al.
Published: (2025)
by: Arslan, Muhammad, et al.
Published: (2025)
Contextual Document Embeddings
by: Morris, John X., et al.
Published: (2024)
by: Morris, John X., et al.
Published: (2024)
A New HOPE: Domain-agnostic Automatic Evaluation of Text Chunking
by: Brådland, Henrik, et al.
Published: (2025)
by: Brådland, Henrik, et al.
Published: (2025)
ChunkLLM: A Lightweight Pluggable Framework for Accelerating LLMs Inference
by: Ouyang, Haojie, et al.
Published: (2025)
by: Ouyang, Haojie, et al.
Published: (2025)
Unleashing Artificial Cognition: Integrating Multiple AI Systems
by: Adnan, Muntasir, et al.
Published: (2024)
by: Adnan, Muntasir, et al.
Published: (2024)
Chunking, Retrieval, and Re-ranking: An Empirical Evaluation of RAG Architectures for Policy Document Question Answering
by: Maharjan, Anuj, et al.
Published: (2026)
by: Maharjan, Anuj, et al.
Published: (2026)
Enhancing Retrieval Augmented Generation with Hierarchical Text Segmentation Chunking
by: Nguyen, Hai Toan, et al.
Published: (2025)
by: Nguyen, Hai Toan, et al.
Published: (2025)
Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers
by: Xie, Jiawen, et al.
Published: (2023)
by: Xie, Jiawen, et al.
Published: (2023)
VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical Alterations
by: Dumpala, Sri Harsha, et al.
Published: (2024)
by: Dumpala, Sri Harsha, et al.
Published: (2024)
Adaptive Token Boundaries: Integrating Human Chunking Mechanisms into Multimodal LLMs
by: Yu, Dongxing
Published: (2025)
by: Yu, Dongxing
Published: (2025)
Grounding Language Model with Chunking-Free In-Context Retrieval
by: Qian, Hongjin, et al.
Published: (2024)
by: Qian, Hongjin, et al.
Published: (2024)
Revealing the Numeracy Gap: An Empirical Investigation of Text Embedding Models
by: Deng, Ningyuan, et al.
Published: (2025)
by: Deng, Ningyuan, et al.
Published: (2025)
Same Patient, Different Words, Different Diagnosis? Evaluating Semantic Stability in Clinical LLMs
by: Alkaeed, Mahdi, et al.
Published: (2026)
by: Alkaeed, Mahdi, et al.
Published: (2026)
Trustworthy AI for Medicine: Continuous Hallucination Detection and Elimination with CHECK
by: Garcia-Fernandez, Carlos, et al.
Published: (2025)
by: Garcia-Fernandez, Carlos, et al.
Published: (2025)
Factuality of Large Language Models: A Survey
by: Wang, Yuxia, et al.
Published: (2024)
by: Wang, Yuxia, et al.
Published: (2024)
H-Net++: Hierarchical Dynamic Chunking for Tokenizer-Free Language Modelling in Morphologically-Rich Languages
by: Zakershahrak, Mehrdad, et al.
Published: (2025)
by: Zakershahrak, Mehrdad, et al.
Published: (2025)
Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization
by: Duan, Shaohua, et al.
Published: (2025)
by: Duan, Shaohua, et al.
Published: (2025)
QCG-Rerank: Chunks Graph Rerank with Query Expansion in Retrieval-Augmented LLMs for Tourism Domain
by: Wei, Qikai, et al.
Published: (2024)
by: Wei, Qikai, et al.
Published: (2024)
The Chronicles of RAG: The Retriever, the Chunk and the Generator
by: Finardi, Paulo, et al.
Published: (2024)
by: Finardi, Paulo, et al.
Published: (2024)
A Multi-Strategy Approach for AI-Generated Text Detection
by: Zain, Ali, et al.
Published: (2025)
by: Zain, Ali, et al.
Published: (2025)
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented Generation
by: Zhong, Zijie, et al.
Published: (2024)
by: Zhong, Zijie, et al.
Published: (2024)
Measuring Embedding Sensitivity to Authorial Style in French: Comparing Literary Texts with Language Model Rewritings
by: Icard, Benjamin, et al.
Published: (2026)
by: Icard, Benjamin, et al.
Published: (2026)
Similar Items
-
The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs
by: Adnan, Muntasir, et al.
Published: (2025) -
Adaptive Hierarchical Evaluation of LLMs and SAST tools for CWE Prediction in Python
by: Adnan, Muntasir, et al.
Published: (2026) -
Enhancing Depressive Post Detection in Bangla: A Comparative Study of TF-IDF, BERT and FastText Embeddings
by: Sazan, Saad Ahmed, et al.
Published: (2024) -
Large Language Model Guided Self-Debugging Code Generation
by: Adnan, Muntasir, et al.
Published: (2025) -
HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking
by: Lu, Wensheng, et al.
Published: (2025)