Saved in:
| Main Author: | |
|---|---|
| Format: | Recurso digital |
| Language: | English |
| Published: |
Zenodo
2025
|
| Subjects: | |
| Online Access: | https://doi.org/10.5281/zenodo.14814699 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866902245197479936 |
|---|---|
| author | Benzoni, Peter |
| author_facet | Benzoni, Peter |
| contents | <p>The dataset extracted by the script contains news article metadata from the <em>TV BRICS</em> website, spanning multiple languages. The dataset is structured as a CSV file.<br><br></p> <p><strong>Coliumns</strong></p> <div> <div> <div> <div> <ul> <li><strong>Language</strong>: The language in which the news article is published. The dataset includes news articles in Russian, English, Portuguese, Chinese, Spanish, and Arabic.</li> <li><strong>Title</strong>: The headline or title of the news article.</li> <li><strong>Path</strong>: The relative URL path of the article on <em>TV BRICS</em> (e.g., <code>/news/iran-zapustil-pervuyu-v-islamskom-mire-platformu-dlya-nauchnogo-obmena/</code>).</li> <li><strong>Link</strong>: The full URL to the news article, constructed using the website domain (e.g., <code>https://tvbrics.com/news/iran-zapustil-pervuyu-v-islamskom-mire-platformu-dlya-nauchnogo-obmena/</code>).</li> </ul> <h3><strong>Characteristics of the Dataset</strong></h3> <ul> <li><strong>Multilingual Scope</strong>: The dataset includes articles from different linguistic sections of the website, making it suitable for comparative media analysis across languages.</li> <li><strong>Structured and Uniform Format</strong>: Each entry contains a standardized format with a title, relative path, and absolute URL.</li> <li><strong>Pagination-Based Extraction</strong>: Articles are fetched from multiple pages per language, ensuring a broad coverage of news over time.</li> <li><strong>Chronologically Ordered</strong>: The scraping script sorts the results by publication date in descending order, capturing the most recent articles first.</li> <li><strong>Deduplication Considerations</strong>: The script prevents redundant entries by checking if the first scraped article on a page already exists in the dataset.</li> </ul> <h3><strong>Potential Uses</strong></h3> <ul> <li><strong>Comparative News Analysis</strong>: Investigating how different linguistic versions of <em>TV BRICS</em> report on the same events.</li> <li><strong>Disinformation and Influence Studies</strong>: Analyzing narratives and framing across language-specific editions.</li> <li><strong>International Media Monitoring</strong>: Tracking coverage trends across geopolitical regions.</li> <li><strong>Linguistic and Sentiment Analysis</strong>: Evaluating tone, sentiment, and framing variations by language.</li> </ul> <p>This dataset serves as a structured repository of <em>TV BRICS</em> articles, facilitating further research in media studies, information operations, and digital propaganda analysis.</p> </div> </div> </div> </div> <div> <div> <div> </div> </div> </div> <p> </p> |
| format | Recurso digital |
| id | zenodo_https___doi_org_10_5281_zenodo_14814699 |
| institution | Zenodo |
| language | eng |
| publishDate | 2025 |
| publisher | Zenodo |
| record_format | zenodo |
| spellingShingle | TV BRICS Archive of Titles and Links in English, Russian, Portuguese, Chinese, Spanish and Arabic Benzoni, Peter TV BRICS World wide web Web Archives as Topic <p>The dataset extracted by the script contains news article metadata from the <em>TV BRICS</em> website, spanning multiple languages. The dataset is structured as a CSV file.<br><br></p> <p><strong>Coliumns</strong></p> <div> <div> <div> <div> <ul> <li><strong>Language</strong>: The language in which the news article is published. The dataset includes news articles in Russian, English, Portuguese, Chinese, Spanish, and Arabic.</li> <li><strong>Title</strong>: The headline or title of the news article.</li> <li><strong>Path</strong>: The relative URL path of the article on <em>TV BRICS</em> (e.g., <code>/news/iran-zapustil-pervuyu-v-islamskom-mire-platformu-dlya-nauchnogo-obmena/</code>).</li> <li><strong>Link</strong>: The full URL to the news article, constructed using the website domain (e.g., <code>https://tvbrics.com/news/iran-zapustil-pervuyu-v-islamskom-mire-platformu-dlya-nauchnogo-obmena/</code>).</li> </ul> <h3><strong>Characteristics of the Dataset</strong></h3> <ul> <li><strong>Multilingual Scope</strong>: The dataset includes articles from different linguistic sections of the website, making it suitable for comparative media analysis across languages.</li> <li><strong>Structured and Uniform Format</strong>: Each entry contains a standardized format with a title, relative path, and absolute URL.</li> <li><strong>Pagination-Based Extraction</strong>: Articles are fetched from multiple pages per language, ensuring a broad coverage of news over time.</li> <li><strong>Chronologically Ordered</strong>: The scraping script sorts the results by publication date in descending order, capturing the most recent articles first.</li> <li><strong>Deduplication Considerations</strong>: The script prevents redundant entries by checking if the first scraped article on a page already exists in the dataset.</li> </ul> <h3><strong>Potential Uses</strong></h3> <ul> <li><strong>Comparative News Analysis</strong>: Investigating how different linguistic versions of <em>TV BRICS</em> report on the same events.</li> <li><strong>Disinformation and Influence Studies</strong>: Analyzing narratives and framing across language-specific editions.</li> <li><strong>International Media Monitoring</strong>: Tracking coverage trends across geopolitical regions.</li> <li><strong>Linguistic and Sentiment Analysis</strong>: Evaluating tone, sentiment, and framing variations by language.</li> </ul> <p>This dataset serves as a structured repository of <em>TV BRICS</em> articles, facilitating further research in media studies, information operations, and digital propaganda analysis.</p> </div> </div> </div> </div> <div> <div> <div> </div> </div> </div> <p> </p> |
| title | TV BRICS Archive of Titles and Links in English, Russian, Portuguese, Chinese, Spanish and Arabic |
| topic | TV BRICS World wide web Web Archives as Topic |
| url | https://doi.org/10.5281/zenodo.14814699 |