Training Workshop on Biodiversity Data Quality, Cleaning, Georeferencing and Delivery 7 - 11, July, 2008

Organiza

Instituto Nacional de Biodiversidad.

 

The objective of this workshop is to contribute to regional capacity building in the use of best practices and information technologies for the administration and dissemination of biodiversity information.

The workshop comprises the following four modules:

  • Data Quality
  • Data Cleaning
  • Georeferencing
  • Data Integration and Delivery

1. Data Quality  (4 hours)

This module is based on the guidelines presented by Dr. Arthur Chapman in the document published by GBIF “GBIF Training Manual 1: Digitisation of Natural History Collection” and INBio’s experience in the execution of the National Biodiversity Inventory and others projects that have had a strong component of data management. It covers the following main topics:

  • Principles of data quality
  • Quality of different types of biodiversity data (taxonomic and nomenclatural, spatial, specimen and observation, species, among others).
  • Protocols for managing biological collections, Information systems and data quality.
  • Documenting data.

2. Data cleaning (1.5 day)

This module is based on the guidelines presented by Dr. Arthur Chapman, in the document published by GBIF “GBIF Training Manual 1: Digitisation of Natural History Collection”.  INBio has applied those and other principles in the cleaning of its own data and has also assisted and advised other institutions in the cleaning of theirs. It covers the following main topics:

  • Principles of data cleaning
  • Methods of data cleaning
  • Taxonomic and nomenclatural data
    • Scientific names
    • Common names
  • Time data
  • Descriptive data
  • Person names
    • Collectors
    • Determinators
    • Authors
  • Types of errors
  • Software resources
  • On-line resources

 

3. Georeferencing  (2 days)

This module is based on the methodology developed by University of Berkeley and CONABIO (Comisión Nacional de Biodiversidad de México).  INBio has used this methodology in the past to georeference large sets of data with missing, incorrect or inaccurate coordinates. It covers the following main topics:

  • Point-radius method
  • Cartographic variables for the assessment of uncertainty
  • Uncertainty assessment
  • Locality description
  • Information normalization
  • Cartography and Geographic Information Systems
  • Coordinates assignation
  • Geographic information validation

 

4. Data Integration and Sharing  (1 day)

This module is based on the How to become a GBIF data provider? workshop that GBIF has helped to design and implement worldwide.  It has been taught several times by INBio’s staff.  It covers the following main topics:

  • Main initiatives developing standards and tools (TDWG, GBIF, IABIN, among others)
  • GBIF network architecture
  • DiGIR and TAPIR protocols overview
  • How to set a data provider for GBIF and other networks?
    • Data provider packages download
    • DiGIR Installation
    • TapirLink installation
    • Registration of the new providers

For more information contact to:

Manuel Vargas
Informatics Development, INBio
Tel. (506) 2507-8277  Fax. (506) 2507-8274
Correo electrónico: mvargas@inbio.ac.cr
Página web: http://www.inbio.ac.cr


 
Home | INBio | Biodiversity | Services | INBioparque | INBio Editorial | Site map | Web Survey

Copyright © 2008 National Biodiversity Institute.