variables: 935647
Data license: CC-BY
This data as json
id | name | unit | description | createdAt | updatedAt | code | coverage | timespan | datasetId | sourceId | shortUnit | display | columnOrder | originalMetadata | grapherConfigAdmin | shortName | catalogPath | dimensions | schemaVersion | processingLevel | processingLog | titlePublic | titleVariant | attributionShort | attribution | descriptionShort | descriptionFromProducer | descriptionKey | descriptionProcessing | licenses | license | grapherConfigETL | type | sort | dataChecksum | metadataChecksum |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
935647 | Share of pages in the Guardian with a country tag (excludes UK, US, Australia) (10-year average) | pages per 100,000 pages | 2024-06-20 15:38:03 | 2024-07-25 23:08:47 | 2023-2023 | 6533 | { "unit": "pages per 100,000 pages" } |
0 | relative_pages_tags_excluded_10y_avg | grapher/news/2024-05-08/guardian_mentions/avg_10y#relative_pages_tags_excluded_10y_avg | 2 | Share of pages in The Guardian that are tagged with a country-related label. Excludes US, UK and Australia.. It reflects a 10-year average (2014 - 2023). | [] |
Getting the number of articles/entries talking about a certain country has no straightforward answer, since there can be different strategies. The strategy for this indicator is based on first defining a set of country name variations for each country, and then look for content on The Guardian with an explicit mention to these names. 1. Get all country name variations: - Obtain all the country name variations using our standard name list. - Our list may not cover all cases, and may contain some names that are not valid on The Guardian API (e.g. names with symbols like ';' are not supported). Therefore, we clean this list. 2. For each country, obtain the number of pages using each set of name variations. Steps: - For each country and year we get all content metadata: a query like "https://content.guardianapis.com/search?q=...&from-date=2020-01-01&to-date=2020-12-31" for year 2020. The count of pages is in the property `response.total`. For mor details, please refer to the snapshot script. This estimates exclude the UK, US, and Australia from the total number of pages. The reason for this is because the Guardian is a UK-based newspaper, and it is expected to have a higher number of articles about the UK, US, and Australia. | float | [] |
f4865861d6793f175a39e1945cdaa2c6 | ab302c23557b5e18f58b58cbb59bbb71 |