|Title||Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Data Services |
|Publication Type||Conference Paper |
|Year of Publication||2016 |
|Authors||Collins, M |
|Editor||Paul, D |
|Conference Name||Island Biology 2016, II International Conference on Island Evolution, Ecology, and Conservation |
|Date Published||07/2016 |
|Publisher||Island Biology 2016, II International Conference on Island Evolution, Ecology, and Conservation |
|Conference Location||University of the Azores at Angra do Heroísmo, Terceira Island, Azores, Portugal |
|Keywords||biodiversity, GUODA, iDigBio, Spark |
|Abstract||Digitized collection objects are an important resource for research analyses such as ecological niche modeling, morphological studies, and data gaps. National and international initiatives have resulted in an enormous number of data records, hundreds of millions, becoming aggregated and made available for study in the last few years. Aggregators like the Global Biodiversity Information Facility (GBIF) internationally, and the Integrated Digitized Biocollections (iDigBio) in the United States, provide several interfaces to their data sets: web based portals, application programming interfaces (APIs), and data processing services. This talk will provide an overview of what kinds of use cases these interfaces can serve, the trade-offs they make, and how to get started using them.
As a concrete example, the iDigBio portal’s features will be demonstrated. The programming API will be described and examples of its use from the R programming language will be shown. iDigBio’s data processing service, Global Unified Open Data Access (GUODA), which provides a high-performance method for analyzing entire biodiversity data sets, will be described as well.