TaxonGrab: Extracting Taxonomic Names From Text

Drew Koning; Indra Neil Sarkar; Thomas Moritz

doi:10.17161/bi.v2i0.17

TaxonGrab: Extracting Taxonomic Names From Text

Autores/as

Drew Koning American Museum of Natural History
Indra Neil Sarkar American Museum of Natural History
Thomas Moritz American Museum of Natural History

DOI:

https://doi.org/10.17161/bi.v2i0.17

Palabras clave:

Named Entity Recognition, Taxonomic Name Extraction

Resumen

Identification of organism names in biological texts is essential for the management of archival resources to facilitate comparative biological investigation. Because organism nomenclature conforms closely to prescribed rules, automated techniques may be useful for identifying organism names from existing documents, and may also support the completion of comprehensive indices of taxonomic names; such comprehensive lists are not yet available. Using a combination of contextual rules and a language lexicon, we have developed a set of simple computational techniques for extracting taxonomic names from biological text. Our proposed method consistently performs at greater than 96% Precision and 94% Recall, and at a much higher speed than manual extraction techniques. An implementation of the described method is available as a Web based tool written in PHP. Additionally, the PHP source code is available from SourceForge: http://sourceforge.net/projects/taxongrab, and the project website is http://research.amnh.org/informatics/taxlit/apps/.

Descargas

Los datos de descarga aún no están disponibles.

Descargas

PDF (Inglés)

Publicado

2005-11-16

Número

Vol. 2 (2005)

Sección

Articles (peer-reviewed)

Licencia

Copyright for articles published in this journal is retained by the authors, with first publication rights granted to the journal. All articles are licensed under a Creative Commons Attribution Non-Commercial license.

Competing Interests: The authors have declared that no competing interests exist.

Cómo citar

Koning, Drew, Indra Neil Sarkar, and Thomas Moritz. 2005. “TaxonGrab: Extracting Taxonomic Names From Text”. Biodiversity Informatics 2 (November). https://doi.org/10.17161/bi.v2i0.17.

Descargar cita

TaxonGrab: Extracting Taxonomic Names From Text

Autores/as

DOI:

Palabras clave:

Resumen

Descargas

Descargas

Publicado

Número

Sección

Licencia

Cómo citar

Enviar un artículo

Información

Desarrollado por

Idioma