Saved in:
| Hovedforfatter: | |
|---|---|
| Format: | Recurso digital |
| Sprog: | engelsk |
| Udgivet: |
Zenodo
2026
|
| Fag: | |
| Online adgang: | https://doi.org/10.5281/zenodo.20177527 |
| Tags: |
Tilføj Tag
Ingen Tags, Vær først til at tagge denne postø!
|
| _version_ | 1866901852418736128 |
|---|---|
| author | D'Ambrosio, Alessia |
| author_facet | D'Ambrosio, Alessia |
| contents | <p>This dataset contains 180 simulated binary classification datasets designed to study the interaction between class imbalance and local neighborhood geometry. The simulation design combines three imbalance levels, defined as the proportion of the positive class (pi = 0.005, 0.05, 0.5), three local geometric configurations (clean_local, fragmented_minor, majority_intrusion), and 20 replications for each regime-by-imbalance combination.</p> <p>Each dataset contains 2500 observations, 7 numerical predictors, and a binary class label with levels "pos" and "neg". The positive class is treated as the minority class when pi < 0.5. The simulated configurations are intended to represent different local difficulty mechanisms: well-separated local structure, fragmented minority structure, and majority intrusion into the minority region.</p> |
| format | Recurso digital |
| id | zenodo_https___doi_org_10_5281_zenodo_20177527 |
| institution | Zenodo |
| language | eng |
| publishDate | 2026 |
| publisher | Zenodo |
| record_format | zenodo |
| spellingShingle | Local Geometry in Imbalanced Classification: Simulated Datasets D'Ambrosio, Alessia binary classification local geometry class imbalance <p>This dataset contains 180 simulated binary classification datasets designed to study the interaction between class imbalance and local neighborhood geometry. The simulation design combines three imbalance levels, defined as the proportion of the positive class (pi = 0.005, 0.05, 0.5), three local geometric configurations (clean_local, fragmented_minor, majority_intrusion), and 20 replications for each regime-by-imbalance combination.</p> <p>Each dataset contains 2500 observations, 7 numerical predictors, and a binary class label with levels "pos" and "neg". The positive class is treated as the minority class when pi < 0.5. The simulated configurations are intended to represent different local difficulty mechanisms: well-separated local structure, fragmented minority structure, and majority intrusion into the minority region.</p> |
| title | Local Geometry in Imbalanced Classification: Simulated Datasets |
| topic | binary classification local geometry class imbalance |
| url | https://doi.org/10.5281/zenodo.20177527 |