Salvato in:
| Autori principali: | , , |
|---|---|
| Natura: | Recurso digital |
| Lingua: | |
| Pubblicazione: |
Zenodo
2026
|
| Soggetti: | |
| Accesso online: | https://doi.org/10.5281/zenodo.20124605 |
| Tags: |
Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
|
Sommario:
- Podcasts have emerged as a widely adopted medium for information dissemination across domains such as technology, education, health, and entertainment. However, their extended duration and unstructured conversational format make efficient content discovery challenging. This paper presents an automated Podcast Summarization and Topic Classification system that integrates Automatic Speech Recognition (ASR) and transformer-based Natural Language Processing (NLP) models to convert raw audio into structured textual insights. The proposed framework employs Whisper ASR for accurate speech-to-text transcription, followed by preprocessing techniques including tokenization, stop-word removal, and normalization. Abstractive summarization is performed using the BART transformer model to generate concise and semantically coherent summaries. Topic classification is achieved using supervised machine learning algorithms, including Logistic Regression, Decision Tree, Random Forest, and contextual modeling BERT. Experimental evaluation demonstrates reliable transcription accuracy and improved classification performance across multiple podcast genres. The system significantly reduces listening time, enhances accessibility, and improves content discoverability. The modular architecture ensures scalability and provides a practical solution for intelligent audio content analysis in academic and real-world applications.