D-Lib MagazineJanuary/February 2014 Opening and Linking Agricultural Research Data
Esther Dzalé Yeumo Kaboré AbstractA Research Data Alliance (RDA) Interest Group has formed around a community of scientists and researchers who wish to make agricultural data, information, and knowledge more accessible. The objective of the Agricultural Data Interest Group is to get together active representatives of the major international institutions that work on agricultural research and innovation worldwide, in order to address issues related to data that are important to the development of global agriculture. The Interest Group is moving toward this goal by advancing the formation of a Wheat Data Interoperability Working Group that will address diverse data problems of 'wheat data' and, within a time period of 18 months, provide a framework for wheat data interoperability. The framework will foster the adoption of common standards and vocabularies for wheat data management, and facilitate access, discovery, reuse, and integration of that data. IntroductionAgricultural data tends to be complex owing to its origin from different purposes, task oriented systems, organizations and projects. One of the main topics for discussion is the infrastructural requirement for scientific agricultural data and its management in standard forms and methods. There are several worldwide data initiatives in the agricultural domain that need to be interoperable. For example, agricultural datasets may be information resources or numerical data regarding yield, market prices or other associated data such as soil analysis data. The challenge is being able to provide intuitive services to end users that span continents with widely varying requirements in terms of data-based services. There are various stakeholders, from data producers to consumers, and their needs have to be satisfied in terms of services and researched products. In recent years, efforts to make agricultural data, information and knowledge more accessible have increased. For more than 5 years now, Global Forum on Agricultural Research (GFAR), the Cooperative Group on International Agricultural Research (CGIAR) and numerous other partners have been promoting the Coherence in Information for Agricultural Research for Development (CIARD) movement to open up access to agricultural knowledge worldwide. Within the CIARD movement the Routemap to Information Nodes and Gateways (RING) was established, which already contains 986 data sources. In April 2013 the G8 group organized an event on open data in agriculture, with participation far broader than the G8 organizers. At that conference, Secretary Vilsack (Agriculture, US Government) said that "open data" means "available without restrictions" and "machine readable". Since then, responding to an initiative of the UK and US governments, another international alliance was formed under the title of "Global Open Data for Agriculture and Nutrition" (GODAN). RDA Interest Group on Agriculture DataAn Interest Group (IG) on agriculture data has been initiated under the Research Data Alliance (RDA). The Agricultural Data Interest Group will link up with the international undertakings described above and will try to integrate them into the interdisciplinary framework of RDA. The objective is to get an active representation of the major international institutions that work on agricultural research and innovation worldwide and address issues related to data that are important to the development of global agriculture. The group intends to bring together leaders and stakeholders in agricultural data initiatives to work towards feasible services in a technology-independent and standards-compliant platform for worldwide use by communities in agricultural sectors, and to represent all stakeholders producing, managing, aggregating, sharing and consuming data for agricultural research and innovation. The IG has already begun to gather key information needed to support the development of infrastructure within a proposed Wheat Data Interoperability Working Group that will address diverse data problems of 'wheat data' and, within a time period of 18 months, provide a framework for wheat data interoperability. As a first practical step, the IG is preparing a Case Statement for the Wheat Data Interoperability Working Group. Meetings to define the group have already been held. Wheat Data InteroperabilityInteroperability is a wide concept that encompasses the ability of organisations to work together towards mutually beneficial and commonly agreed goals. The Agricultural Data Interest Group is using the following definition from the European Interoperability Framework (EIF): 'An interoperability framework is an agreed approach to interoperability for organisations that wish to work together towards the joint delivery of public services. Within its scope of applicability, it specifies a set of common elements such as vocabulary, concepts, principles, policies, guidelines, recommendations, standards, specifications and practices.' The proposed Wheat Data Interoperability Working Group will aim to provide a common framework for describing, representing linking and publishing wheat data with respect to open standards. Such a framework will promote and sustain wheat data sharing, reusability and operability. Specifying the wheat linked data framework will require answering many questions, including which (minimal) metadata to describe which type of data; which vocabularies/ontologies/formats; and which good practices. Mainly based on the needs of the Wheat Initiative Information System (WheatIS) in terms of functionalities and data types, the Agricultural Research Interest Group and the proposed Working Group will identify relevant use cases in order to produce a "cookbook" on how to produce "wheat data" that are easily shareable, reusable and interoperable. To do so, the Working Group will:
Based on a survey report performed in June 2012, the Working Group will focus on the following data types, by order of priority: SNP, Genomic annotations, Phenotypes, Genetic Maps, Physical Maps, Germplasm. Implementing the framework will help cultivate a wheat ecosystem with people familiar with interoperability, organisations ready to collaborate, and common tools and services. The WheatIS (Wheat Information System of the Global Wheat Initiative) will be provided with a linked data framework based on community-accepted standards (an RDA Wheat Interoperability proposed WG deliverable), which will ensure data analysis and data integration facilities. Such a framework is a great asset for the WheatIS to provide the analysis functions and other services expected by the researchers. The cookbook will ensure that:
The cookbook might be adapted for other crops such as RICE, MAIZE which are also very important for food security. There is a variety of wheat data sources with various types of data, formats and structures. Integrating these data is an important part of current research, specifically when answering questions such as "What genes and traits are relevant for understanding the impact of climate change on wheat plant productivity?". The wheat data interoperability framework should have an impact and be helpful by a) fostering the adoption of common standards and vocabularies for wheat data management, and b) facilitating access, discovery, reuse and integration of wheat data. Additional Discussion Areas Within the Agricultural Data Interest GroupData Policies Proposal for a Germplasm Working Group Support Sustainability Data sources As a next important step, stakeholder involvement in the Agricultural Data IG should be enlarged. The goal is not to secure a comprehensive coverage of all institutes and organizations, but rather representative coverage of practitioners who will able to work with, and for, the community. About the Authors
|
||||||||||||
|