variables: 902138
Data license: CC-BY
This data as json
id | name | unit | description | createdAt | updatedAt | code | coverage | timespan | datasetId | sourceId | shortUnit | display | columnOrder | originalMetadata | grapherConfigAdmin | shortName | catalogPath | dimensions | schemaVersion | processingLevel | processingLog | titlePublic | titleVariant | attributionShort | attribution | descriptionShort | descriptionFromProducer | descriptionKey | descriptionProcessing | licenses | license | grapherConfigETL | type | sort | dataChecksum | metadataChecksum |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
902138 | Share of pages in the Guardian with a country tag | pages per 100,000 pages | 2024-05-23 12:19:23 | 2024-07-25 23:08:48 | 2013-2023 | 6533 | { "unit": "pages per 100,000 pages" } |
0 | relative_pages_tags | grapher/news/2024-05-08/guardian_mentions/guardian_mentions#relative_pages_tags | 2 | Share of pages in The Guardian that are tagged with a country-related label. | [] |
Getting the number of articles/entries talking about a certain country has no straightforward answer, since there can be different strategies. The strategy for this indicator is based on first getting all the tags for a country, and then getting the number of articles that have those tags. 1. Obtain all tags that concern a country: - Obtain all the tag pages that have a title starting with a country name: a query like "https://content.guardianapis.com/tags?web-title=spain", for Spain. As a result we obtain a mapping that tells us for each country the list of tags (e.g. "Spain: [world/spain, travel/spain, etc.]") in use. - We work with a list of ~240 countries. - Getting the right country names has been an iterative process, trying to align our standard country names with the Guardian's. 2. For each country, obtain the number of pages using each set of tags. Steps: - For each country and year we get all content metadata: a query like "https://content.guardianapis.com/search?tags=...&from-date=2020-01-01&to-date=2020-12-31" for year 2020. The count of pages is in the property `response.total`. For mor details, please refer to the snapshot script. | float | [] |
82ef4f90325641e0e6c50576506b75ec | 3dd4775634566260d70d42e094b0c559 |