Saved in:
Bibliographic Details
Main Authors: Raorane, Tanisha, Kole, Prasenjit
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.06607
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918281404743680
author Raorane, Tanisha
Kole, Prasenjit
author_facet Raorane, Tanisha
Kole, Prasenjit
contents Sanskrit Subhasitas encapsulate centuries of cultural and philosophical wisdom, yet remain underutilized in the digital age due to linguistic and contextual barriers. In this work, we present Pragya, a retrieval-augmented generation (RAG) framework for semantic recommendation of Subhasitas. We curate a dataset of 200 verses annotated with thematic tags such as motivation, friendship, and compassion. Using sentence embeddings (IndicBERT), the system retrieves top-k verses relevant to user queries. The retrieved results are then passed to a generative model (Mistral LLM) to produce transliterations, translations, and contextual explanations. Experimental evaluation demonstrates that semantic retrieval significantly outperforms keyword matching in precision and relevance, while user studies highlight improved accessibility through generated summaries. To our knowledge, this is the first attempt at integrating retrieval and generation for Sanskrit Subhasitas, bridging cultural heritage with modern applied AI.
format Preprint
id arxiv_https___arxiv_org_abs_2601_06607
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Pragya: An AI-Based Semantic Recommendation System for Sanskrit Subhasitas
Raorane, Tanisha
Kole, Prasenjit
Computation and Language
Artificial Intelligence
Machine Learning
Sanskrit Subhasitas encapsulate centuries of cultural and philosophical wisdom, yet remain underutilized in the digital age due to linguistic and contextual barriers. In this work, we present Pragya, a retrieval-augmented generation (RAG) framework for semantic recommendation of Subhasitas. We curate a dataset of 200 verses annotated with thematic tags such as motivation, friendship, and compassion. Using sentence embeddings (IndicBERT), the system retrieves top-k verses relevant to user queries. The retrieved results are then passed to a generative model (Mistral LLM) to produce transliterations, translations, and contextual explanations. Experimental evaluation demonstrates that semantic retrieval significantly outperforms keyword matching in precision and relevance, while user studies highlight improved accessibility through generated summaries. To our knowledge, this is the first attempt at integrating retrieval and generation for Sanskrit Subhasitas, bridging cultural heritage with modern applied AI.
title Pragya: An AI-Based Semantic Recommendation System for Sanskrit Subhasitas
topic Computation and Language
Artificial Intelligence
Machine Learning
url https://arxiv.org/abs/2601.06607