American multinational technology company Google has announced the introduction of a new search engine Dataset Search to enable scientists and journalists to find the datasets they require.
The launch of Dataset Search comes nearly 14 years after the launch of Google Scholar, the Mountain View-headquartered tech behemoth’s freely accessible web search engine for academic studies and reports.
Conventionally, datasets are spread everywhere ranging from research papers of individuals, private research organization websites, government organizations, data-powered publications, and many other primary and secondary sources.
With Dataset Search, the technology company is intending to adapt the Google Scholar to assist scientists, researchers, journalists, and analysts sift through this data in a single search bar.
Organizations that publish their data online, such as governments and universities, will require metadata tags in their web pages that provide more information on their data, including who created it, how it was collected, the time it was published, and so on.
The data will be then be indexed by the company’s search engine and joined with information from the Knowledge Graph.
The tech giant has summarized guidelines for dataset providers that will help the company as well as others to smoothly identify its content and easily redirect the data to scientists, researchers, and journalists who need it.
The approach of Google on the new search engine is based on an open-source standard outlined by the collaborative data community Schema and ask data providers to provide information on the creator of the dataset, the source of its collection, the date and time of its publish, and the terms and conditions under which the data can be used.
Google intends to combine the tens of thousands of diverse repositories for datasets online as well as make the information discoverable.
Presently, the dataset publication is tremendously fragmented and various scientific organizations, government organizations, and local bodies have their own choicest repositories. The Dataset Search will provide the scientists, researchers, and journalists with all their required data and information.
The initial launch of Dataset Search will cover the social sciences, environmental, governmental, and datasets from news agencies and organizations such as ProPublica. After the service gains popularity, the magnitude of data the Dataset Search indexes will rapidly increase as organizations, researchers, scientists, and journalists join the race to make their data accessible.