📜 ⬆️ ⬇️

Toolbox for Researchers - Second Edition: a selection of 15 thematic data banks

Data banks help to share the results of experiments and measurements, play an important role in shaping the academic environment and in the development process of specialists.

We will tell you about datasets obtained with the help of expensive equipment (sources of these data are often large international organizations and scientific programs, most often associated with the natural sciences), and about state data banks.



Photo by Jan Antonin Kolar - Unsplash
')
Data.gov.ru is a well-known public project in the field of open data. Its Moscow counterpart is Data.mos.ru. Of the foreign options worth noting Data.gov - a platform with open data from the US government (a single catalog with filters).

The University Information System is a MSU project that combines databases with statistical information about the social and economic situation in the country, as well as publications from government and scientific sources. The data are taken both from Rosstat and from studies conducted at the Moscow State University. The resource can be used without prior registration, but for full access you will need to submit an application.

Cartographic database of the All-Russian Geological Institute. Karpinsky. Information on the country's natural resources, collected during the existence of the institution, was marked on digital maps. The interface of the site allows you to compare OpenStreetMap or Ya.Map with a number of additional. layers with information about the magnetic field, minerals, etc.

GEOSS is a portal for searching Earth observation data from satellites and drones of various types. The resource archive is collected by 90 organizations around the world. To find information of interest, just select the desired area on the map or type keywords into the search.

MAST is an archive sponsored by NASA. The presented data is collected by orbital telescopes - you can study and download studies using a filter search .


Photo Max Bender - Unsplash

OpenEI is a platform for searching open data on energy use, in particular, on renewable energy resources and new technologies in the industry. The site is organized on the principle of wiki - the accuracy of the data is verified by the community .

Experimental Nuclear Reaction Data (EXFOR) is a library containing data of 22615 experiments with elementary particles. Complete with the CINDA (Computer Index of Nuclear Reaction Data) and IBANDL (Ion Beam Analysis Nuclear Data Library) databases is one of the largest data banks on nuclear physics. Supervised by the Brookhaven National Laboratory in the United States, but contains experiments from around the world - including Russia and China .

National Centers for Environmental Information - archive of environmental data. Here you get access to twenty petabytes of oceanic and geophysical data, as well as information about the atmosphere and coastal zones. In particular, there is information about the depth of the ocean, the surface of the sun, records of sedimentary rocks and satellite images. To find the desired dataset, you can use the catalog .

ADS is a repository for searching archaeological data managed by the University of York. There are old and new scientific publications, information about the excavations and artifacts. Three categories are offered for search: ArchSearch, Archives and Library. The first is stored data about the excavations and artifacts. In the second - an archive of all downloaded materials. In the third - publications from magazines, books and studies. There are search options by country, epoch, and object type.

DRYAD - this service helps to search for information for research on the data bank of 80 thousand files. Studies and articles from the bank can be used under license CC0 . Subject materials include different areas of knowledge, but most of the research is related to medicine and computer science. According to internal statistics , in 2018, users of the site were most interested in whale songs, temperature tolerance of marine life, and neural activity in the temporal lobe of the human brain.


In the laboratory " Advanced nanomaterials and optoelectronic devices " ITMO University

GenBank is a DNA library provided by the National Center for Biotechnology Information USA (NCBI), as well as data banks in Europe and Japan. Search by identifier is available in a special search engine, using the BLAST tool or programmatically .

PubChem is a database of compounds and bioassays that the US National Center for Biotechnology Information contains. There is a web interface with advanced search (an example about the side effects of water ). Data apply to public domain rights.

Protein Data Bank (RCSB PDB) is a bank of images of proteins and nucleic acids, the history of which dates back to 1971. Originally developed as an internal project of the Brookhaven National Laboratory, but later turned into the largest international database of its type. Most of the academic journals related to biochemistry, require authors to post on the site obtained in the course of research protein models.

InterPro - a database that combines many dataset various scientific projects. Includes SMART - a program for analyzing domains in protein sequences, based on machine learning technologies and a dataset of 1200 models. Supported by the European Bioinformatics Institute.



Photo excursions in the laboratories of the ITMO University:

Source: https://habr.com/ru/post/453408/


All Articles