Approaches to estimating the universe of natural history collections data
DOI:
https://doi.org/10.17161/bi.v7i2.3991Palabras clave:
Natural history collections, size, estimates, primary biodiversity dataResumen
This contribution explores the problem of recognizing and measuring the universe of specimen-level data existing in Natural History Collections around the world, in absence of a complete, world-wide census or register. Estimates of size seem necessary to plan for resource allocation for digitization or data capture, and may help represent how many vouchered primary biodiversity data (in terms of collections, specimens or curatorial units) might remain to be mobilized. Three general approaches are proposed for further development, and initial estimates are given. Probabilistic models involve crossing data from a set of biodiversity datasets, finding commonalities and estimating the likelihood of totally obscure data from the fraction of known data missing from specific datasets in the set. Distribution models aim to find the underlying distribution of collections’ compositions, figuring out the occult sector of the distributions. Finally, case studies seek to compare digitized data from collections known to the world to the amount of data known to exist in the collection but not generally available or not digitized. Preliminary estimates range from 1.2 to 2.1 gigaunits, of which a mere 3% at most is currently web-accessible through GBIF’s mobilization efforts. However, further data and analyses, along with other approaches relying more heavily on surveys, might change the picture and possibly help narrow the estimate. In particular, unknown collections not having emerged through literature are the major source of uncertainty.Métricas
File downloads
3,074
Crossref
3
DataCite
1
Descargas
Descargas
Archivos adicionales
Publicado
2010-10-09
Número
Sección
Articles (peer-reviewed)
Licencia
Copyright for articles published in this journal is retained by the authors, with first publication rights granted to the journal. All articles are licensed under a Creative Commons Attribution Non-Commercial license.
Competing Interests: The authors have declared that no competing interests exist.
Cómo citar
Ariño, Arturo H. 2010. “Approaches to Estimating the Universe of Natural History Collections Data”. Biodiversity Informatics 7 (2). https://doi.org/10.17161/bi.v7i2.3991.