Saved in:
| Main Author: | |
|---|---|
| Format: | Recurso digital |
| Language: | English |
| Published: |
Zenodo
2025
|
| Subjects: | |
| Online Access: | https://doi.org/10.5281/zenodo.14814699 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Table of Contents:
- <p>The dataset extracted by the script contains news article metadata from the <em>TV BRICS</em> website, spanning multiple languages. The dataset is structured as a CSV file.<br><br></p> <p><strong>Coliumns</strong></p> <div> <div> <div> <div> <ul> <li><strong>Language</strong>: The language in which the news article is published. The dataset includes news articles in Russian, English, Portuguese, Chinese, Spanish, and Arabic.</li> <li><strong>Title</strong>: The headline or title of the news article.</li> <li><strong>Path</strong>: The relative URL path of the article on <em>TV BRICS</em> (e.g., <code>/news/iran-zapustil-pervuyu-v-islamskom-mire-platformu-dlya-nauchnogo-obmena/</code>).</li> <li><strong>Link</strong>: The full URL to the news article, constructed using the website domain (e.g., <code>https://tvbrics.com/news/iran-zapustil-pervuyu-v-islamskom-mire-platformu-dlya-nauchnogo-obmena/</code>).</li> </ul> <h3><strong>Characteristics of the Dataset</strong></h3> <ul> <li><strong>Multilingual Scope</strong>: The dataset includes articles from different linguistic sections of the website, making it suitable for comparative media analysis across languages.</li> <li><strong>Structured and Uniform Format</strong>: Each entry contains a standardized format with a title, relative path, and absolute URL.</li> <li><strong>Pagination-Based Extraction</strong>: Articles are fetched from multiple pages per language, ensuring a broad coverage of news over time.</li> <li><strong>Chronologically Ordered</strong>: The scraping script sorts the results by publication date in descending order, capturing the most recent articles first.</li> <li><strong>Deduplication Considerations</strong>: The script prevents redundant entries by checking if the first scraped article on a page already exists in the dataset.</li> </ul> <h3><strong>Potential Uses</strong></h3> <ul> <li><strong>Comparative News Analysis</strong>: Investigating how different linguistic versions of <em>TV BRICS</em> report on the same events.</li> <li><strong>Disinformation and Influence Studies</strong>: Analyzing narratives and framing across language-specific editions.</li> <li><strong>International Media Monitoring</strong>: Tracking coverage trends across geopolitical regions.</li> <li><strong>Linguistic and Sentiment Analysis</strong>: Evaluating tone, sentiment, and framing variations by language.</li> </ul> <p>This dataset serves as a structured repository of <em>TV BRICS</em> articles, facilitating further research in media studies, information operations, and digital propaganda analysis.</p> </div> </div> </div> </div> <div> <div> <div> </div> </div> </div> <p> </p>