源数据和数据源
by Hiren Patel
希伦·帕特尔(Hiren Patel)
什么是开放数据? (What is Open Data?)
In simple terms, Open Data means the kind of data which is open for anyone and everyone for access, modification, reuse, and sharing.
简而言之,“ 开放数据”是指对任何人和所有人开放以供访问,修改,重用和共享的数据类型。
Open Data derives its base from various “open movements” such as open source, open hardware, open government, open science etc.
开放数据源于各种“开放运动”,例如开放源代码,开放硬件,开放政府,开放科学等。
Governments, independent organizations, and agencies have come forward to open the floodgates of data to create more and more open data for free and easy access.
各国政府,独立组织和机构已经挺身而出,打开数据的闸门,以创建越来越多的开放数据,以供免费和轻松访问。
为什么开放数据很重要? (Why Is Open Data Important?)
Open data is important because the world has grown increasingly data-driven. But if there are restrictions on the access and use of data, the idea of data-driven business and governance will not be materialized.
开放数据非常重要,因为世界越来越以数据为驱动力。 但是,如果对数据的访问和使用有限制,那么数据驱动型业务和治理的想法将无法实现。
Therefore, open data has its own unique place. It can allow a fuller understanding of the global problems and universal issues. It can give a big boost to businesses. It can be a great impetus for machine learning. It can help fight global problems such as disease or crime or famine. Open data can empower citizens and hence can strengthen democracy. It can streamline the processes and systems that the society and governments have built. It can help transform the way we understand and engage with the world.
因此,开放数据有其独特的位置。 它可以使人们对全球问题和普遍问题有更全面的了解。 它可以极大地促进企业发展。 这可能是机器学习的强大动力。 它可以帮助解决疾病,犯罪或饥荒等全球性问题。 开放数据可以增强公民权能,因此可以加强民主。 它可以简化社会和政府建立的流程和系统。 它可以帮助改变我们理解和与世界互动的方式。
So here’s my list of 15 awesome Open Data sources:
因此,这是我列出的15个很棒的开放数据源的清单:
1. 世界银行公开数据 (1. World Bank Open Data)
As a repository of the world’s most comprehensive data regarding what’s happening in different countries across the world, World Bank Open Data is a vital source of Open Data. It also provides access to other datasets as well which are mentioned in the data catalog.
作为有关世界不同国家正在发生的事情的全球最全面数据的存储库,世界银行开放数据是开放数据的重要来源。 它还提供对数据目录中提到的其他数据集的访问。
World Bank Open Data is massive because it has got 3000 datasets and 14000 indicators encompassing microdata, time series statistics, and geospatial data.
世界银行开放数据之所以庞大,是因为它拥有3000个数据集和14000个指标,其中包括微数据,时间序列统计信息和地理空间数据。
Accessing and discovering the data you want is also quite easy. All you need to do is to specify the indicator names, countries or topics and it will open up the treasure-house of Open Data for you. It also allows you to download data in different formats such as CSV, Excel, and XML.
访问和发现所需的数据也非常容易。 您所需要做的就是指定指标名称,国家或主题,这将为您打开开放数据的宝库。 它还允许您下载不同格式的数据,例如CSV,Excel和XML。
If you are a journalist or academic, you will be enthralled by the array of tools available to you. You can get access to analysis and visualization tools that can bolster your research. It can felicitate a deeper and better understanding of global problems.
如果您是新闻工作者或学术界人士,那么您将被一系列可用的工具所吸引。 您可以访问可以增强您的研究的分析和可视化工具。 它可以促进对全球问题的更深入和更好的理解。
You can get access to the API which can help you create the data visualizations you need, live combinations with other data sources and many more such features.
您可以访问API,该API可以帮助您创建所需的数据可视化,与其他数据源的实时组合以及更多此类功能。
Therefore, it’s no surprise that World Bank Open Data tops any list of Open Data sources!
因此,世界银行开放数据在开放数据源的任何列表中居于首位也就不足为奇了!
2. 世卫组织(世界卫生组织)—开放数据仓库 (2. WHO (World Health Organization) — Open data repository)
WHO’s Open Data repository is how WHO keeps track of health-specific statistics of its 194 Member States.
世卫组织的开放数据存储库是世卫组织跟踪其194个会员国特定于健康的统计数据的方式。
The repository keeps the data systematically organized. It can be accessed as per different needs. For instance, whether it is mortality or burden of diseases, one can access data classified under 100 or more categories such as the Millennium Development Goals (child nutrition, child health, maternal and reproductive health, immunization, HIV/AIDS, tuberculosis, malaria, neglected diseases, water and sanitation), non communicable diseases and risk factors, epidemic-prone diseases, health systems, environmental health, violence and injuries, equity etc.
该存储库可以系统地组织数据。 可以根据不同需求进行访问。 例如,无论是死亡还是疾病负担,人们都可以访问100类或更多类别的数据,例如千年发展目标(儿童营养,儿童健康,孕产妇和生殖健康,免疫,艾滋病毒/艾滋病,结核病,疟疾,被忽视的疾病,水和卫生设施),非传染性疾病和危险因素,易流行的疾病,卫生系统,环境健康,暴力和伤害,公平等。
For your specific needs, you can go through the datasets according to themes, category, indicator, and country.
根据您的特定需求,您可以根据主题,类别,指标和国家/地区浏览数据集。
The good thing is that it is possible to download whatever data you need in Excel Format. You can also monitor and analyze data by making use of its data portal.
好处是可以以Excel格式下载所需的任何数据。 您还可以通过其数据门户监视和分析数据。
The API to the World Health Organization’s data and statistics content is also available.
也可以使用世界卫生组织的数据和统计内容的API。
3. Google Public Data Explorer (3. Google Public Data Explorer)
Launched in 2010, Google Public Data Explorer can help you explore vast amounts of public-interest datasets.