Saved in:
Bibliographic Details
Main Authors: Kumar, Mayank, Rathore, Ravindranath Singh
Format: Recurso digital
Language:English
Published: Zenodo 2026
Online Access:https://doi.org/10.5281/zenodo.19178462
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866901635682271232
author Kumar, Mayank
Rathore, Ravindranath Singh
author_facet Kumar, Mayank
Rathore, Ravindranath Singh
contents <p>Accurate prediction of protein secondary structure is essential for reliable tertiary structure prediction and peptide design. To address this, multiple algorithms have been proposed. The conformation of a residue within a polypeptide chain is strongly influenced by its immediate neighbors. The conformational tendencies of 20 residues in the presence of their first neighbors, constituting a total of 8000 tripeptides, were calculated. Using the propensity values of 8000 tripeptide variants, we propose an accurate method for secondary structure prediction. Various machine learning (ML) models were built using propensities, position-specific scoring matrices (PSSM) and amino acid binary features to predict three-state (Q3) and eight-state (Q8) secondary structures. The results suggest that the XGBoost ML model produced highly accurate predictions, achieving accuracies of 93% and 88% for Q3 and Q8 state, respectively for validation dataset CB513. A program for prediction called S8kPred is developed. This utility along with consensus secondary structure prediction is available online at <a href="https://www.s8kpred.in">https://www.s8kpred.in</a>.</p>
format Recurso digital
id zenodo_https___doi_org_10_5281_zenodo_19178462
institution Zenodo
language eng
publishDate 2026
publisher Zenodo
record_format zenodo
spellingShingle S8kPred: A Novel Approach for Protein Secondary Structure Prediction Using 8000 Tripeptide Propensities
Kumar, Mayank
Rathore, Ravindranath Singh
<p>Accurate prediction of protein secondary structure is essential for reliable tertiary structure prediction and peptide design. To address this, multiple algorithms have been proposed. The conformation of a residue within a polypeptide chain is strongly influenced by its immediate neighbors. The conformational tendencies of 20 residues in the presence of their first neighbors, constituting a total of 8000 tripeptides, were calculated. Using the propensity values of 8000 tripeptide variants, we propose an accurate method for secondary structure prediction. Various machine learning (ML) models were built using propensities, position-specific scoring matrices (PSSM) and amino acid binary features to predict three-state (Q3) and eight-state (Q8) secondary structures. The results suggest that the XGBoost ML model produced highly accurate predictions, achieving accuracies of 93% and 88% for Q3 and Q8 state, respectively for validation dataset CB513. A program for prediction called S8kPred is developed. This utility along with consensus secondary structure prediction is available online at <a href="https://www.s8kpred.in">https://www.s8kpred.in</a>.</p>
title S8kPred: A Novel Approach for Protein Secondary Structure Prediction Using 8000 Tripeptide Propensities
url https://doi.org/10.5281/zenodo.19178462