bigquery
The entire quarter-billion-record GDELT Event Database is now available as a public dataset in Google BigQuery.
完整的25亿记录的GDELT事件数据库现在可以作为公共数据集在Google BigQuery中使用。
This is the sentence at the top of the release post, and it’s a really big deal.
这是发布文章顶部的句子,这确实很重要。
加特 (GDELT)
The Global Database of Events, Language and Tone is one of the largest datasets on the planet. It is the quantitative database of human society, relying on thousands of news sources from every corner of the globe dating back to 1979.
全球事件,语言和语气数据库是地球上最大的数据集之一。 它是人类社会的定量数据库,它依赖于追溯到1979年的全球各个角落的数千个新闻来源。
It was thought up by Kalev Leetaru, who is also the author of the Google release post referenced above. The GDELT covers all countries globally spanning a third of a century, and consists of daily updates during that time period. Hundreds of millions of records, each with 59 fields narrating into detail the actors and events having taken place. Every record is georeferenced, so you can globally place it, and all actors are tagged with appropriate ethnic and religious affiliation. All this – free and available for your perusal, and you don’t even have to have the computing power to handle it.
Kalev Leetaru曾想过&