Saved in:
| Main Author: | |
|---|---|
| Format: | Recurso digital |
| Language: | Urdu |
| Published: |
Zenodo
2025
|
| Online Access: | https://doi.org/10.5281/zenodo.16148140 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Table of Contents:
- <p><span lang="en-PK">The Mushtaq Urdu Named Entities Corpus (<strong>MUNEC</strong>) is a dataset containing 119,276 labelled tokens, comprising of <strong>8,759 Named Entities</strong> and 110,517 non-Named Entity tokens from seven different domains of Urdu news text including </span>Entertainment, Finance, General, Health, Politics, Science and Sports<span lang="en-PK">. The Named Entities have been tagged into 5 types or classes, including <strong>Person, Location, Organization, Date, and Number</strong>.</span></p> <p><span lang="en-PK">T</span><span lang="en-PK">he dataset is freely available for research and academic purposes, provided that the authors' work is duly referenced. </span></p> <p><span lang="en-PK"> </span></p>