Збережено в:
Бібліографічні деталі
Автори: Criado, Alvaro, Carnerero, Cristina, Soret Miravet, Albert, Guevara, Marc, Jorba, Oriol, Mateu Armengol, Jan
Формат: Recurso digital
Мова:Англійська
Опубліковано: Zenodo 2025
Предмети:
Онлайн доступ:https://doi.org/10.5281/zenodo.16737066
Теги: Додати тег
Немає тегів, Будьте першим, хто поставить тег для цього запису!
Зміст:
  • <h1><strong>Summary</strong></h1> <p>Air pollution is the leading environmental threat to global public health [1]. As a result, accurately characterizing air quality levels is a priority. The city of Barcelona (Spain) faces a persistent NO₂ pollution problem [2,3], mainly driven by high vehicle density [4], a compact urban morphology, and its coastal location. </p> <p>We present a database consisting of daily and annual NO₂ surface concentrations with associated uncertainty estimates from 2019 to 2024 in Barcelona city, as summarized in <em>Table 1</em>. The database is obtained by combining high-resolution outputs from the CALIOPE-Urban air quality model [5] with multiple observational datasets using a data-fusion method based on Universal Kriging (UK) [6].  Exceedance probability maps are also provided by combining concentration levels with their associated uncertainty. These maps are based on daily and annual NO₂ thresholds from European Air Quality Directives 2008/50/EC [7] and 2024/2881 [8], as well as the 2021 WHO guidelines [1], and serve as a valuable tool for policymakers and regulatory assessments by offering information directly linked to specific NO₂ limits.</p> <p>The first dataset corresponds to annual averages of the variables in high-spatial resolution maps, at a grid of 25m x 25m. The NO₂ annual thresholds for computing the probability maps are (i) 10 µg/m³, the WHO recommended limit [1], (ii) 20  µg/m³, the European annual limit value set by the current air quality Directive 2024/2881 [8], and (iii) 40  µg/m³, the European annual limit value set by the air quality Directive 2008/50/c [7] (the one in force at the period of the database). </p> <p>The second dataset provides daily and annual averages of the variables aggregated at the census tract level. The exceedance probability maps are calculated using the NO₂ levels and uncertainty already expressed in the census tracts. The annual limits are the same as above, while the NO₂ daily thresholds for computing the probability maps are (i) 25 µg/m³, the WHO-recommended limit [1], and (ii) 50  µg/m³, the European daily limit value set by the air quality Directive 2024/2881 [7]. </p> <p>For more information on the methodology, see reference [6]. For the code repository, refer to [12]. For more details about this database, refer to [13].</p> <table style="border-collapse: collapse; width: 98.7805%; height: 464.688px;"><colgroup><col style="width: 12.1432%;"><col style="width: 38.2174%;"><col style="width: 11.1222%;"><col style="width: 16.4891%;"><col style="width: 10.4815%;"><col style="width: 11.504%;"></colgroup> <tbody> <tr style="height: 47.5938px;"> <td style="height: 47.5938px;"><strong>Dataset</strong></td> <td style="height: 47.5938px;"><strong>Variable</strong></td> <td style="height: 47.5938px;"><strong>Temporal resolution</strong></td> <td style="height: 47.5938px;"> <p><strong>Spatial resolution</strong></p> </td> <td style="height: 47.5938px;"><strong>Coverage </strong></td> <td style="height: 47.5938px;"><strong>Formats</strong></td> </tr> <tr style="height: 184.75px;"> <td style="height: 184.75px;">Dataset 1</td> <td style="height: 184.75px;"> <ul> <li>NO₂</li> <li>Uncertainty</li> <li>Probability of exceeding 10 µg/m³</li> <li>Probability of exceeding 20 µg/m³</li> <li>Probability of exceeding 40 µg/m³</li> </ul> </td> <td style="height: 184.75px;">Annual</td> <td style="height: 184.75px;"> <p>High</p> <p>(25m x 25m)</p> </td> <td style="height: 184.75px;"> <p>2019-2024</p> </td> <td style="height: 184.75px;"> <p>SHP,</p> <p>raster</p> </td> </tr> <tr style="height: 125.969px;"> <td style="height: 125.969px;">Dataset 2 (A)</td> <td style="height: 125.969px;"> <ul> <li>NO₂</li> <li>Uncertainty</li> <li>Probability of exceeding 10 µg/m³</li> <li>Probability of exceeding 20 µg/m³</li> <li>Probability of exceeding 40 µg/m³</li> </ul> </td> <td style="height: 125.969px;"> <p>Annual</p> </td> <td style="height: 125.969px;">Census tract</td> <td style="height: 125.969px;"> <p>2019-2024</p> </td> <td style="height: 125.969px;"> <p>SHP,</p> <p>CSV</p> </td> </tr> <tr style="height: 106.375px;"> <td style="height: 106.375px;"> <p>Dataset 2 (B)</p> </td> <td style="height: 106.375px;"> <ul> <li>NO₂</li> <li>Uncertainty</li> <li>Probability of exceeding 25 µg/m³</li> <li>Probability of exceeding 50 µg/m³</li> </ul> </td> <td style="height: 106.375px;">Daily</td> <td style="height: 106.375px;">Census tract</td> <td style="height: 106.375px;"> <p>2019-2024</p> </td> <td style="height: 106.375px;"> <p>SHP, </p> <p>CSV</p> </td> </tr> </tbody> </table> <p><em>Table 1: Scheme of the two datasets provided, showing air quality variables, temporal and spatial resolutions, formats, and</em><br><em>temporal coverage.</em></p> <h1><strong>Structure</strong></h1> <p>The Zenodo repository is organized based on the structure presented in <em>Table 1</em>. Each dataset is a <em>.zip</em> file, composed of folders based on the different formats.</p> <h2><strong>Dataset 1</strong></h2> <p>This dataset is composed of <strong>annual</strong> values in <strong>high-spatial resolution (25m x 25m)</strong>. The zip file is <em>Dataset1.zip</em>.</p> <h3><strong>1) SHP format</strong></h3> <p>The folder here is named <em>Dataset1_SHP</em>. The files are named as <strong><em>shp_reso_$variable$_annualmean_$year$.shp</em></strong>, where <strong><em>$variable$</em></strong> can be <em>NO2, sdrel </em>or<em> exc_together</em>; and <strong><em>$year$</em></strong> is a number from 2019 to 2024. In addition to the <em>.shp </em>extension<em>, </em>the corresponding <em>.shx, .prj, </em>and <em>.dbf </em>files<em> </em>are also present.</p> <p>The files are processed using the R package <em>sf</em> [9], and each contains approximately 250,000 features as geometry type <em>MULTIPOLYGON</em>. The projection is EPSG:4326, using the geodetic coordinate reference system WGS 84. In addition to the geometry, the attribute fields vary depending on the variable:</p> <ul> <li> <p>For the <em>NO2</em> variable, the field <em>no2</em> represents the NO₂ annual mean concentration, with units of µg/m³.</p> </li> <li> <p>For <em>sdrel</em>, the field <em>sd_rel</em> indicates the relative uncertainty of the NO₂ annual mean, expressed as a %.</p> </li> <li> <p>For <em>exc_together</em>, the fields <em>exc_40, exc_20,</em> and <em>exc_10</em> correspond to the probability of exceeding (expressed as % probability):</p> <ul> <li> <p>40 µg/m³ (the annual limit value set by the European Air Quality Directive 2008/50/EC [7]),</p> </li> <li> <p>20 µg/m³ (the annual limit value set by the European Air Quality Directive 2024/2881 [8]), and</p> </li> <li> <p>10 µg/m³ (the annual limit value recommended by the WHO 2021 guidelines [1]).</p> </li> </ul> </li> </ul> <p>These exceedance probabilities are calculated by combining the annual mean NO₂ values with their associated uncertainties and selecting a NO₂ threshold.</p> <h3><strong>2) Raster format</strong></h3> <p>The folder here is named <em>Dataset1_raster</em>. The files are named as <strong><em>raster_together_annualmean_$year$.tif</em></strong>, where <strong><em>$year$</em></strong> is a number from 2019 to 2024.</p> <p>The files are processed using the R packages <em>raster</em> [10] and <em>terra</em> [11]. Each file contains approximately 600,000 cells, as a mask is applied to the city of Barcelona. Of these, approximately 250,000 cells have valid (non-NA) values. The projection is EPSG:4326, using the geodetic coordinate reference system WGS 84. The rasters contain five bands, with the following names:</p> <ul> <li> <p><em>no2</em> represents the NO₂ annual mean concentration, with units of µg/m³.</p> </li> <li> <p><em>sd rel </em>indicates the relative uncertainty of the NO₂ annual mean, expressed as a %.</p> </li> <li> <p>The bands <em>exc40, exc20,</em> and <em>exc10</em> correspond to the probability of exceeding (expressed as a % probability):</p> <ul> <li> <p>40 µg/m³ (the annual limit value set by the European Air Quality Directive 2008/50/EC [7]),</p> </li> <li> <p>20 µg/m³ (the annual limit value set by the European Air Quality Directive 2024/2881 [8]), and</p> </li> <li> <p>10 µg/m³ (the annual limit value recommended by the WHO 2021 guidelines [1]).</p> </li> </ul> </li> </ul> <p>These exceedance probabilities are calculated by combining the annual mean NO₂ values with their associated uncertainties and selecting a NO₂ threshold.</p> <h2><strong>Dataset 2 (A)</strong></h2> <p>This dataset is composed of <strong>annual</strong> values in the <strong>census track</strong> aggregation. The zip file is <em>Dataset2A.zip</em>.</p> <h3><strong>1) SHP format</strong></h3> <p>The folder here is named <em>Dataset2A_SHP</em>. The files are named as <strong><em>shp_census_annualmean_$year$.shp</em></strong>, where <strong><em>$year$</em></strong> is a number from 2019 to 2024. In addition to the <em>.shp </em>extension<em>, </em>the corresponding <em>.shx, .prj, </em>and <em>.dbf </em>files<em> </em>are also present.</p> <p>The files are processed using the R package <em>sf</em> [9], and each contains 1,068 features as geometry type <em>MULTIPOLYGON</em>. The projection is EPSG:4326, using the geodetic coordinate reference system WGS 84. In addition to the geometry, the files contain a total of 7 fields:</p> <ul> <li><em>OBJECTID</em>, a number identification from 1 to 1,068.</li> <li><em>ID_census</em>, the official census tract identifier in Spain, based on the classification by the <em>Instituto Nacional de Estadística</em> (INE).</li> <li> <p><em>no2</em> represents the NO₂ annual mean concentration, with units of µg/m³.</p> </li> <li> <p><em>sd_rel</em> indicates the relative uncertainty of the NO₂ annual mean, expressed as a %.</p> </li> <li> <p>Fields <em>exc_40, exc_20,</em> and <em>exc_10</em> correspond to the probability of exceeding (expressed as a % probability):</p> <ul> <li> <p>40 µg/m³ (the annual limit value set by the European Air Quality Directive 2008/50/EC [7]),</p> </li> <li> <p>20 µg/m³ (the annual limit value set by the European Air Quality Directive 2024/2881 [8]), and</p> </li> <li> <p>10 µg/m³ (the annual limit value recommended by the WHO 2021 guidelines [1]).</p> </li> </ul> </li> </ul> <p>These exceedance probabilities are calculated by combining the annual mean NO₂ values with their associated uncertainties and selecting a NO₂ threshold.</p> <h3><strong>2) CSV format</strong></h3> <p>The folder here is named <em>Dataset2A_CSV</em>. The files are named as <strong><em>csv</em><em>_census_annualmean_$year$.csv</em></strong>, where <strong><em>$year$</em></strong> is a number from 2019 to 2024. The files contain 1,068 rows and, in addition to a column called <em>X</em> (which represents the row numbers), they include the same information as the SHP files mentioned above, but without the geometry column. To associate each row with its corresponding spatial area, please refer to the geometry in the SHP files from the same dataset.</p> <h2><strong>Dataset 2 (B)</strong></h2> <p>This dataset is composed of <strong>daily</strong> values in the <strong>census track</strong> aggregation. The zip file is <em>Dataset2B.zip</em>.</p> <h3><strong>1) SHP format</strong></h3> <p>The folder here is named <em>Dataset2B_SHP</em>. The files are named as <strong><em>census_daily_$year$_$variable$.shp</em></strong>, where <strong><em>$variable$</em></strong> can be <em>no2, sd_rel, </em><em>exc_25, </em>or<em> exc_50</em>; and <strong><em>$year$</em></strong> is a number from 2019 to 2024. In addition to the <em>.shp </em>extension<em>, </em>the corresponding <em>.shx, .prj, </em>and <em>.dbf </em>files<em> </em>are also present.</p> <p>The files are processed using the R package <em>sf</em> [9], and each contains 1,068 features as geometry type <em>MULTIPOLYGON</em>. The projection is EPSG:4326, using the geodetic coordinate reference system WGS 84. In addition to the geometry, the files contain a total of 367 fields (368 for 2020):</p> <ul> <li><em>OBJECTID</em>, a number identification from 1 to 1,068.</li> <li><em>ID_census</em>, the official census tract identifier in Spain, based on the classification by the <em>Instituto Nacional de Estadística</em> (INE).</li> <li> <p>From<em> <strong>$year$</strong>0101 </em>to<em> <strong>$year$</strong>1231 </em>(i.e.,<em> 20190101, 20190102, ... 20191231</em>). The meaning of the dates columns varies depending on the <strong><em>$variable$</em></strong> specified in the file:</p> <ul> <li>If <strong><em>$variable$</em></strong> is <em>no2</em>, they represent the daily mean NO₂ value for that day, with units of µg/m³.</li> <li>If <strong><em>$variable$</em></strong> is <em>sd_rel</em>, they represent the relative uncertainty of the daily mean NO₂ value for that day, expressed as a %.</li> <li>If <strong><em>$variable$</em></strong><em>e</em> is <em>exc_25</em>, they correspond to the probability of exceeding 25 µg/m³, the daily limit value recommended by the WHO 2021 guidelines [1], expressed as a % probability.</li> <li>If <strong><em>$variable$</em></strong> is <em>exc_50</em>, they correspond to the probability of exceeding 50 µg/m³, the daily limit value set by the European Air Quality Directive 2024/2881 [8], expressed as a % probability.</li> </ul> </li> </ul> <p>These exceedance probabilities are calculated by combining the annual mean NO₂ values with their associated uncertainties and selecting a NO₂ threshold.</p> <h3><strong>2) CSV format</strong></h3> <p>The folder here is named <em>Dataset2B_CSV</em>. The files are named as <strong><em>csv</em><em>_daily_$year$_$variable$.csv</em></strong>, where <strong><em>$variable$</em></strong> can be <em>no2, sd_rel, </em><em>exc_25, </em>or<em> exc_50</em>; and <strong><em>$year$</em></strong> is a number from 2019 to 2024. The files contain 1,068 rows, and they include the same information as the SHP files mentioned above, depending on the <strong><em>$variable$</em></strong>, but without the geometry column. To associate each row with its corresponding spatial area, please refer to the geometry in the SHP files from the same dataset. Note that the date columns start with <em>X</em>, from<em> X<strong>$year$</strong>0101 </em>to<em> X<strong>$year$</strong>1231.</em></p> <h1><strong><em>uncertAIR</em> platform</strong></h1> <p>In addition, the datasets are accessible through the<strong> <em>uncertAIR</em> platform (https://earth.bsc.es/shiny/uncertAIR/).</strong>  <em>uncertAIR</em> is a research project funded by the Barcelona City Council. The project emphasized the importance of addressing uncertainty in air quality simulations and aimed to create an accessible tool for data dissemination and visualization.</p> <h1><strong>Legal Notice</strong></h1> <p>Note: for more information on the methodology, see reference [6]. For the code repository, refer to [12]. For more details about this database, refer to [13].</p> <h2>How should I use the information and products available in this repository or the <em>uncertAIR</em> platform?</h2> <ol> <li>Data, images, and other products from this repository or the <em>uncertAIR</em> platform available on this server may be used solely for research and education purposes. Products may not be used for commercial purposes.</li> <li>In case of establishing links to the contents of this website, kindly inform us via email <strong>alvaro.criado@bsc.es</strong></li> <li>The authors cannot guarantee that the data are correct in all circumstances. Neither do accepts any liability whatsoever for any error or omission in the data, or any loss or damage arising from its use.</li> <li>Numerical data must not be supplied as a whole or in part to any third party without prior authorization.</li> <li>Articles, papers, or written scientific works of any form, that are based in whole or in part on data, images or other products supplied by this database should contain an acknowledgement, giving credit to the Center every time data/images/products are used. <strong>Please use the reference [12] as well as the citation to this repository</strong>.</li> </ol>