Data repositories
· AWS (Amazon Web Services) Public Data Sets, provides a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications.
· BigML big list of public data sources.
· Bioassay data, described in Virtual screening of bioassay data, by Amanda Schierz, J. of Cheminformatics, with 21 Bioassay datasets (Active / Inactive compounds) available for download.
· Bitly 1.usa.gov data, anonymized clicks on gov links.
· Canada Open Data, pilot project with many government and geospatial datasets.
· Causality Workbench data repository.
· Corral Big Data repository at Texas Advanced Computing Center, supporting data-centric science.
· Data Source Handbook, A Guide to Public Data, by Pete Warden, O'Reilly (Jan 2011).
· Datacatalogs.org, open government data from US, EU, Canada, CKAN, and more.
· Data.gov.uk, publicly available data from UK (also London datastore.)
· Data.gov/Education, central guide for education data resources including high-value data sets, data visualization tools, resources for the classroom, applications created from open data and more.
· DataMarket, visua