Gespeichert in:
| Hauptverfasser: | , , , |
|---|---|
| Format: | Recurso digital |
| Sprache: | Englisch |
| Veröffentlicht: |
Zenodo
2022
|
| Schlagworte: | |
| Online-Zugang: | https://doi.org/10.5281/zenodo.6580171 |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Inhaltsangabe:
- <p>This dataset contains the posts of Open Source Stack Exchange site, collected at the end of 2020, along with the categorization of the posts. For each post a category, and potentially a second one is indicated, along with the cluster (generic group) each category belongs to. The coding task of assigning each question to a category was performed by two independent coders for each question (the categorization of each coder is also provided in the dataset). The dataset contains also (in a separate file) a dictionary of the most correlated unigrams and bigrams per category.</p>