Several government, collaborative, and research efforts are continuously going on to develop and maintain standard datasets for different subjects and domains inside subjects. These datasets are available for the public to download or to work offline, or they can also have the facility of online computations over these datasets. One such notable effort is named Open Science Data Cloud (OSDC), which has several datasets on each subject. This list, compiled from various open data sources, is available. They also host data on their web portal (https://www.opensciencedatacloud.org/publicdata/). A subject-wise list of selected datasets from OSDC is as follows:
Agriculture:
The U.S. Department of Agriculture's plants database
Biology:
1,000 genomes
Gene Expression Omnibus (GEO)
MIT cancer genomics data
Protein data bank
Climate/weather:
Australian weather
Canadian Meteorological Centre
Climate data from UEA (updated monthly)
Global climate data Since 1929
Complex networks:
CrossRef...