Saved in:
Bibliographic Details
Main Authors: Revankar, Kush, Deshpande, Shreyas, Sayeed, Araham, Tandale, Ansh, Bobde, Sarika
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.06485
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909947423358976
author Revankar, Kush
Deshpande, Shreyas
Sayeed, Araham
Tandale, Ansh
Bobde, Sarika
author_facet Revankar, Kush
Deshpande, Shreyas
Sayeed, Araham
Tandale, Ansh
Bobde, Sarika
contents Communication between deaf users, visually im paired users, and the general hearing population often relies on tools that support only one direction of interaction. To address this limitation, this work presents Sanvaad, a lightweight multimodal accessibility framework designed to support real time, two-way communication. For deaf users, Sanvaad includes an ISL recognition module built on MediaPipe landmarks. MediaPipe is chosen primarily for its efficiency and low computational load, enabling the system to run smoothly on edge devices without requiring dedicated hardware. Spoken input from a phone can also be translated into sign representations through a voice-to-sign component that maps detected speech to predefined phrases and produces corresponding GIFs or alphabet-based visualizations. For visually impaired users, the framework provides a screen free voice interface that integrates multilingual speech recognition, text summarization, and text-to-speech generation. These components work together through a Streamlit-based interface, making the system usable on both desktop and mobile environments. Overall, Sanvaad aims to offer a practical and accessible pathway for inclusive communication by combining lightweight computer vision and speech processing tools within a unified framework.
format Preprint
id arxiv_https___arxiv_org_abs_2512_06485
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Sanvaad: A Multimodal Accessibility Framework for ISL Recognition and Voice-Based Interaction
Revankar, Kush
Deshpande, Shreyas
Sayeed, Araham
Tandale, Ansh
Bobde, Sarika
Computer Vision and Pattern Recognition
Communication between deaf users, visually im paired users, and the general hearing population often relies on tools that support only one direction of interaction. To address this limitation, this work presents Sanvaad, a lightweight multimodal accessibility framework designed to support real time, two-way communication. For deaf users, Sanvaad includes an ISL recognition module built on MediaPipe landmarks. MediaPipe is chosen primarily for its efficiency and low computational load, enabling the system to run smoothly on edge devices without requiring dedicated hardware. Spoken input from a phone can also be translated into sign representations through a voice-to-sign component that maps detected speech to predefined phrases and produces corresponding GIFs or alphabet-based visualizations. For visually impaired users, the framework provides a screen free voice interface that integrates multilingual speech recognition, text summarization, and text-to-speech generation. These components work together through a Streamlit-based interface, making the system usable on both desktop and mobile environments. Overall, Sanvaad aims to offer a practical and accessible pathway for inclusive communication by combining lightweight computer vision and speech processing tools within a unified framework.
title Sanvaad: A Multimodal Accessibility Framework for ISL Recognition and Voice-Based Interaction
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2512.06485