TY - JOUR
T1 - A data management workflow of biodiversity data from the field to data users
AU - Hackett, Rachel A.
AU - Belitz, Michael W.
AU - Gilbert, Edward E.
AU - Monfils, Anna K.
N1 - Funding Information:
The authors thank G. Nelson and M. Monfils for providing their expertise at multiple steps throughout the project, and Blake Cahill for reviewing the initial stages of this work. Support for these projects was provided by the Botanical Society of America, Central Michigan University, Hanes Trust Foundation, Michigan Garden Club Inc., NatureServe, and The Nature Conservancy. Additional funding was provided by the U.S. National Science Foundation (award DBI-1730526) to develop educational materials for biodiversity data literacy (https://www.biodiversityliteracy.com/poweshiek-skipperling). This is contribution 133 of the Central Michigan University Institute for Great Lakes Research.
Publisher Copyright:
© 2019 Hackett et al. Applications in Plant Sciences is published by Wiley Periodicals, Inc. on behalf of the Botanical Society of America
PY - 2019/12/1
Y1 - 2019/12/1
N2 - Premise: Heterogeneity of biodiversity data from the collections, research, and management communities presents challenges for data findability, accessibility, interoperability, and reusability. Workflows designed with data collection, standards, dissemination, and reuse in mind will generate better information across geopolitical, administrative, and institutional boundaries. Here, we present our data workflow as a case study of how we collected, shared, and used data from multiple sources. Methods: In 2012, we initiated the collection of biodiversity data relating to Michigan prairie fens, including data on plant communities and the federally endangered Poweshiek skipperling (Oarisma poweshiek). Results: Over 23,000 occurrence records were compiled in a database following Darwin Core standards. The records were linked with media and biological, chemical, and geometric measurements. We published the data as Global Biodiversity Information Facility data sets and in Symbiota SEINet portals. Discussion: We highlight data collection techniques that optimized transcription time, including the use of predetermined and controlled vocabulary, Darwin Core terms, and data dictionaries. The validity and longevity of our data were supported by voucher specimens, metadata with measurement records, and published manuscripts detailing methods and data sets. Key to our data dissemination was cooperation among partners and the utilization of dynamic tools. To increase data interoperability, we need flexible and customizable data collection templates, coding, and enhanced communication among communities using biodiversity data.
AB - Premise: Heterogeneity of biodiversity data from the collections, research, and management communities presents challenges for data findability, accessibility, interoperability, and reusability. Workflows designed with data collection, standards, dissemination, and reuse in mind will generate better information across geopolitical, administrative, and institutional boundaries. Here, we present our data workflow as a case study of how we collected, shared, and used data from multiple sources. Methods: In 2012, we initiated the collection of biodiversity data relating to Michigan prairie fens, including data on plant communities and the federally endangered Poweshiek skipperling (Oarisma poweshiek). Results: Over 23,000 occurrence records were compiled in a database following Darwin Core standards. The records were linked with media and biological, chemical, and geometric measurements. We published the data as Global Biodiversity Information Facility data sets and in Symbiota SEINet portals. Discussion: We highlight data collection techniques that optimized transcription time, including the use of predetermined and controlled vocabulary, Darwin Core terms, and data dictionaries. The validity and longevity of our data were supported by voucher specimens, metadata with measurement records, and published manuscripts detailing methods and data sets. Key to our data dissemination was cooperation among partners and the utilization of dynamic tools. To increase data interoperability, we need flexible and customizable data collection templates, coding, and enhanced communication among communities using biodiversity data.
M3 - Article
AN - SCOPUS:85077073474
VL - 7
JO - Applications in Plant Sciences
JF - Applications in Plant Sciences
SN - 2168-0450
IS - 12
ER -