Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hevia, Juan Segundo, Arredondo, Facundo, Kumar, Vishesh
Format:	Preprint
Published:	2025
Subjects:	Computers and Society
Online Access:	https://arxiv.org/abs/2510.06255
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916995865247744
author	Hevia, Juan Segundo Arredondo, Facundo Kumar, Vishesh
author_facet	Hevia, Juan Segundo Arredondo, Facundo Kumar, Vishesh
contents	The integration of large language models (LLMs) into education offers significant potential to enhance accessibility and engagement, yet their high computational demands limit usability in low-resource settings, exacerbating educational inequities. To address this, we propose an offline Retrieval-Augmented Generation (RAG) pipeline that pairs a small language model (SLM) with a robust retrieval mechanism, enabling factual, contextually relevant responses without internet connectivity. We evaluate the efficacy of this pipeline using domain-specific educational content, focusing on biology coursework. Our analysis highlights key challenges: smaller models, such as SmolLM, struggle to effectively leverage extended contexts provided by the RAG pipeline, particularly when noisy or irrelevant chunks are included. To improve performance, we propose exploring advanced chunking techniques, alternative small or quantized versions of larger models, and moving beyond traditional metrics like MMLU to a holistic evaluation framework assessing free-form response. This work demonstrates the feasibility of deploying AI tutors in constrained environments, laying the groundwork for equitable, offline, and device-based educational tools.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_06255
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Towards an Efficient, Customizable, and Accessible AI Tutor Hevia, Juan Segundo Arredondo, Facundo Kumar, Vishesh Computers and Society The integration of large language models (LLMs) into education offers significant potential to enhance accessibility and engagement, yet their high computational demands limit usability in low-resource settings, exacerbating educational inequities. To address this, we propose an offline Retrieval-Augmented Generation (RAG) pipeline that pairs a small language model (SLM) with a robust retrieval mechanism, enabling factual, contextually relevant responses without internet connectivity. We evaluate the efficacy of this pipeline using domain-specific educational content, focusing on biology coursework. Our analysis highlights key challenges: smaller models, such as SmolLM, struggle to effectively leverage extended contexts provided by the RAG pipeline, particularly when noisy or irrelevant chunks are included. To improve performance, we propose exploring advanced chunking techniques, alternative small or quantized versions of larger models, and moving beyond traditional metrics like MMLU to a holistic evaluation framework assessing free-form response. This work demonstrates the feasibility of deploying AI tutors in constrained environments, laying the groundwork for equitable, offline, and device-based educational tools.
title	Towards an Efficient, Customizable, and Accessible AI Tutor
topic	Computers and Society
url	https://arxiv.org/abs/2510.06255

Similar Items