Kaydedildi:
Detaylı Bibliyografya
Yazar: Shokhrukh Sariyev
Materyal Türü: Recurso digital
Dil:İngilizce
Baskı/Yayın Bilgisi: Zenodo 2025
Konular:
Online Erişim:https://doi.org/10.5281/zenodo.17994364
Etiketler: Etiketle
Etiket eklenmemiş, İlk siz ekleyin!
İçindekiler:
  • This article proposes a genetic algorithm-based approach to optimize the filling of missing NaN values in a dataset. The focus is on selecting NaN values in the dataset directly corresponding to the results of the classification task. In the proposed method, each individual is represented as a chromosome in the form of a vector of all missing values. The search space is bounded by the given intervals for numerical attributes, and by the set of appropriate categories for categorical attributes. The accuracy indicator of the Random Forest ensemble model was used as the fitness function in the genetic algorithm.