Saved in:
Bibliographic Details
Main Authors: Rahmun, Mahieyin, Khan, Rafat Hasan, Aurpa, Tanjim Taharat, Khan, Sadia, Nahiyan, Zulker Nayeen, Almas, Mir Sayad Bin, Rajib, Rakibul Hasan, Hassan, Syeda Sakira
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2412.13279
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • The aim of this project is to implement and design arobust synthetic speech classifier for the IEEE Signal ProcessingCup 2022 challenge. Here, we learn a synthetic speech attributionmodel using the speech generated from various text-to-speech(TTS) algorithms as well as unknown TTS algorithms. Weexperiment with both the classical machine learning methodssuch as support vector machine, Gaussian mixture model, anddeep learning based methods such as ResNet, VGG16, and twoshallow end-to-end networks. We observe that deep learningbased methods with raw data demonstrate the best performance.