A Quantitative Comparison of XML Schemas for Taxonomic Publications

Authors

  • Guido Sautter
  • Klemens Böhm
  • Donat Agosti

DOI:

https://doi.org/10.17161/bi.v4i0.36

Keywords:

sysetmatics, taxonomy, xml schema, taxonx, quantiative analysis, heritage literature

Abstract

Large numbers of legacy taxonomic publications are currently being digitized to make them online available and ready for full text search. The documents are being marked up with XML for two purposes: To preserve the document structure, and to facilitate access via standard query languages like XQuery. With regard to the second aspect, the choice of an appropriate XML schema is crucial. It affects both query performance and the correctness of query results. Over the last few years, several different XML schemas have been proposed as markup standards for taxonomic publications. In this paper, we report on a thorough evaluation and com¬parison of these schemas. We have examined if they facilitate formulation and correct processing of queries that are common when it comes to taxonomic literature. We also compare the performance of these queries on documents that are marked up with the different schemas. Finally, we propose extensions to the schemas that enhance correctness of query results.

Metrics

File downloads
1,185
Jan 2008Jul 2008Jan 2009Jul 2009Jan 2010Jul 2010Jan 2011Jul 2011Jan 2012Jul 2012Jan 2013Jul 2013Jan 2014Jul 2014Jan 2015Jul 2015Jan 2016Jul 2016Jan 2017Jul 2017Jan 2018Jul 2018Jan 2019Jul 2019Jan 2020Jul 2020Jan 2021Jul 2021Jan 2022Jul 2022Jan 2023Jul 2023Jan 2024Jul 2024Jan 2025Jul 2025Jan 20269
|

Downloads

Author Biography

  • Donat Agosti
    http://antbase.org/agosticv_2003.html

Downloads

Published

2007-08-21

Issue

Section

Articles (peer-reviewed)

How to Cite

Sautter, Guido, Klemens Böhm, and Donat Agosti. 2007. “A Quantitative Comparison of XML Schemas for Taxonomic Publications”. Biodiversity Informatics 4 (August). https://doi.org/10.17161/bi.v4i0.36.