The Species 2000 & ITIS Catalogue of Life

Frank Bisby "The Catalogue of Life is the most comprehensive and authoritative global index of species currently available. It consists of a single integrated species checklist and taxonomic hierarchy."
signature
Frank Bisby (1945—2011) was leader of the Catalogue of Life project from its very beginning. His inspirational vision and spirit remains with all of us.

The Species 2000 & ITIS Catalogue of Life is planned to become a comprehensive catalogue of all known species of organisms on Earth. Rapid progress has been made recently and this Catalogue of Life edition contains 1,352,112 species. Please note that this is probably just slightly over 2/3 of the world's known species. This means that for many groups it continues to be deficient, and users will notice that many species are still missing from the Catalogue.

The present Catalogue is compiled with sectors provided by 132 taxonomic databases from around the world. Many of these contain taxonomic data and opinions from extensive networks of specialists, so that the complete work contains contributions from more than 3,000 specialists from throughout the taxonomic profession. Species 2000 and ITIS teams peer review databases, select appropriate sectors and integrate the sectors into a single coherent catalogue with a single hierarchical classification. It is planned to introduce alternative taxonomic treatments and alternative classifications, but an important feature is that for those users who wish to use it, a single preferred catalogue, based on peer reviews, will continue to be provided.

As a new development started in 2010, there has been a change in data assembly procedure and in published editions of the Catalogue. This is part of a process in the EC 4D4Life and i4Life projects to bring the Annual Checklist and Dynamic Checklist into unified workflow – The Catalogue of Life. The CoL Workbench tool has been used for data harvesting and final processing of the Catalogue of Life editions since September 2011.

• The Catalogue of Life (monthly updated editions around a year)
These are progressively enhanced editions in what becomes a dynamically developing system, made available online with additional web-services. We switched to monthly editions in February 2012. At all times the latest release is available for online access at www.catalogueoflife.org/col.

• Annual Checklist of the Catalogue of Life
Published each April as fixed online edition with web-services, and on DVD. At all times the latest Annual Checklist is online at www.catalogueoflife.org/annual-checklist. A fixed edition that is published, archived and deposited in public libraries, that can be cited, and that can be used as a common catalogue for comparative purposes by many organisations. A copy is on this DVD, which is distributed free of charge, and identical copies are viewable and downloadable on the website. The archive edition for 2013 is at www.catalogueoflife.org/annual-checklist/2013. Archived editions can be found by changing the year in the URL (e.g. /2011, /2010 etc.), and downloadable versions and web-services can be found at www.catalogueoflife.org/services.

The Catalogue of Life partnership

In June 2001 the Species 2000 and ITIS organisations, that had previously worked separately, decided to work together to create the Catalogue of Life, now estimated at 1.9 million species (Chapman, 2009). The two organisations remain separate and different in structure. However, by working together in creating a common product, the partnership has enabled them to reduce duplication of effort, make better use of resources, and to accelerate production. The combined Annual Checklist has become well established as a cited reference used for data compilation and comparison. For instance, it is used as the principal taxonomic index in the GBIF and EoL data portals and recognised by the CBD.

The scientific policy for the Catalogue of Life programme is developed by the CoL Global Team: David Eades (Convenor, USA), Tom Orrell (USA), Richard Pyle (USA), Mike Ruggiero (USA), Richard White (UK), Jeya Kathirithamby (UK), Alastair Culham (UK), Jerry Cooper (New Zealand), Mark Costello (New Zealand), Nicolas Bailly (Philippines), Thierry Bourgoin (France), Christina Flann (The Netherlands), Li-Qiang Ji (China), Patricia Mergen (Belgium), Heimo Rainer (Austria), Tony Rees (Australia), with further assistance from the Regional Hubs.

About ITIS

The Integrated Taxonomic Information System (ITIS) is a partnership of federal agencies and other organisations from the United States, Canada, and Mexico, with data stewards and experts from around the world (see www.itis.gov). The ITIS database is an automated reference of scientific and common names of biota of interest to North America. ITIS contains more than 727,000 scientific and common names in all kingdoms, is accessible via the World Wide Web in English, French, Spanish, and Portuguese (itis.gbif.net) and is an associate member of GBIF.

ITIS is managed by Gerald Guala (Director), Thomas Orrell (Deputy Director), Michael Ruggiero (Senior Scientific Advisor), David Nicolson (Data Development Leader), Mike Frame (Information Technology Leader), Roy McDiarmid (Taxonomy Leader), James Macklin (ITIS-Canada Director), and Patricia Koleff (SIIT-Mexico Director). They are advised and supported by the ITIS Steering Committee and ITIS Data Stewards.

About Species 2000

Species 2000 (www.sp2000.org) is an autonomous federation of taxonomic database custodians, involving taxonomists throughout the world. The goal is to collate a uniform and validated index to the world's known species. Species 2000 is registered as a not-for-profit company limited by guarantee (registered in England No. 3479405) with six directors, and taxonomic database and relevant software organisations from around the world as members. It started from a TDWG Task Group; sponsored by CODATA, IUBS and IUMS; is an associate participant in GBIF, a data provider to EC LifeWatch; and is recognised by UNEP and the CBD. Its Phase II Programme includes EC 4D4Life and i4Life e-infrastructure projects. 4D4Life includes the establishment of a Global Multi-Hub Network with Regional Centres in China, New Zealand, Australia, Brazil and N. America. i4Life includes pipelines for data exchange and checklist harmonisation with GBIF, EBI/NSDC, Barcode of Life, IUCN Red List and EoL.

Species 2000 is governed by an elected Board of Directors that deals with legal and financial matters, and that is advised by the CoL Global Team on scientific policy. The Directors are presently: Peter Schalk (Chair, The Netherlands), Alex Gray (UK), David Eades (Convenor of the CoL Global Team, USA), Guy Baillargeon (Canada), Vanderlei Canhos (Brazil) and Keping Ma (China).

Structure of the Catalogue of Life

The goal is to list every distinct species in each group of organisms. At present, some groups are globally complete, some are represented by global sectors that are nearing completion, and others are represented by partial sectors. The global sectors, whether complete or not, are provided by selected, peer reviewed global species databases (GSDs - see definition below) in the Species 2000 federation or by equivalent global sectors of ITIS.

The partial sectors in the Catalogue of Life are supplied by regional species databases as ITIS Regional (North America), NZIB (New Zealand), Catalogue of Life China, Australian Faunal Directory, and by thematic databases as WoRMS (marine species) and FADA (freshwater species). Sometime, regional and thematic checklists are the only available resources for these incomplete groups at the time. Regional and thematic checklists have been assembled in CoL proto-GSDs, a new development of this year cycle. Proto-GSD is a taxonomic database contains a draft species checklist, which is built from regional, thematic and nomenclatural databases, using semi-automatic procedures for merging species lists under single management classification, without detailed scrutiny by taxon specialists. 2013 Annual Checklist contains experimental proto-GSDs taken in the Catalogue of Life to fill major gaps in the phyla Mollusca, Platyhelminthes, Myxosoa, in the class Clitellata (Annelida) and in some flowering plant families. Proto-GSD is temporarily solution to fill gap areas.

Each species in the Catalogue of Life is listed with an accepted scientific name, a cited reference and its position in the hierarchical classification. Additional common names and synonyms may be provided, but these data are not complete, and for some species none may exist. The complete list of fields (known as the "Catalogue of Life Standard Dataset") is given below:

More detailed information about the Standard Dataset is available on the Catalogue of Life website (www.catalogueoflife.org)

Each species is linked via the genus and family to the taxonomic classification. Above the node of attachment of each data sector this classification has been agreed by Species 2000 and ITIS as a practical management tool to provide access to the Catalogue of Life. The top levels of this management classification were set in 2005, as the CoL Taxonomic Classification Edition 1. A second edition has now been prepared by the CoL Hierarchy Panel, and implemented as Part A for the Animalia and Fungi, Part B for Plantae and Part C still to follow.

The CoL Hierarchy Panel is composed of experts who make recommendations on the higher level classification, led by Michael Ruggiero, Dennis Gordon, Nicolas Bailly and David Nicolson. Please read the draft discussion document "Towards a management hierarchy (classification) for the Catalogue of Life" by Dennis Gordon, appended to this edition of the Annual Checklist (www.catalogueoflife.org/annual-checklist/2013/info/hierarchy).

The CoL Taxonomic Classification Edition 2 Part A re-arranged the Fungi following the 10th edition of the Dictionary of the Fungi (Kirk, Cannon, Minter & Stalper, 2008 - CABI). We are grateful to Paul Kirk for his work with the CoL Hierarchy Panel.

The CoL Taxonomic Classification Edition 2 Part A re-arranged the phyla of Animalia in a new system, following the recommendations of the CoL Hierarchy Panel, with a detailed arrangement carried out by Michael Ruggiero, Dennis Gordon, Nicolas Bailly and with technical assistance from David Nicolson.

The CoL Taxonomic Classification Edition 2 Part B re-arranged the phyla, classes and orders of Plantae in a new system, following the implementations in ITIS (with one exception for traditional monocot class Liliopsida, which is kept in the CoL for management convenience).

Where available from the suppliers, infraspecific taxa such as subspecies and varieties have also been included but this coverage is variable between taxonomic sectors.

Where possible, a web link back to the supplier's own database is provided at the bottom of each species detail page.

CoL Taxon Identifiers (Taxon LSIDs)

The Catalogue of Life programme started to issue permanent CoL Taxon Identifiers in the 2008 Annual Checklist. Every taxon recognised in the Catalogue, including species and higher taxa, is given a globally unique identifier (GUID) using the Life Science Identifier (LSID) system (http://sourceforge.net/projects/lsids).

Where a taxon record remains unchanged in this 2013 Annual Checklist edition, it carries the same LSID as in 2008, except that the revision part changes from AC2009 to AC2012. New or changed taxa in the 2013 Annual Checklist edition receive completely new LSIDs. This will enable the introduction in the next few years of change tracking between editions, so that relationships to taxa in previous editions are recorded, and web-services to alert users to changes in name, classification, circumscription or data. Technical users are already able to inspect changes in the metadata provided by the CoL LSID Resolution Service, and to build their own services using them. For more information, see http://www.catalogueoflife.org/lsid. LSIDs in the 2013 Annual Checklist Annual Checklist have been generated by Richard White, Cardiff University, UK.

Our LSIDs are long strings of symbols intended only for computer usage. However, the user interface does show LSID buttons (click, and the LSID is displayed in full), and the LSID is shown at the base of each Species Details screen. This is done so that a user can copy and paste the LSIDs into other software or services: it is NOT intended for them to be read or typed.

Functionality of the Catalogue of Life

• Species (and infraspecific taxa) can be located either by searching by name or by tracking down through the hierarchical classification.
• Searching by name can be done using accepted scientific name, synonym or common name. Automatic synonymic and common name indexing takes the user directly to the species under its accepted name. The search can use part names, or be restricted to complete words.
• Tracking down the tree or classification uses accepted names for taxa.
• On each species details page the relevant higher taxa are listed, and provide a link to the relevant node of the hierarchical classification.
• The species details pages link to the source database, usually showing further information.
• A full species list for each higher taxon in the classification can be accessed via option Browse Taxonomic Classification

During the 4D4Life project (2009-2012), functionality of the Catalogue of Life interface has been enhanced (interface version 1.8):

• Number of species for each taxon in the Catalogue can be found in the Taxonomic Tree. We are planning to add species estimation figures and display completeness of the checklists by taxa in next editions of the Catalogue.
• Names of supplier databases are visible with each taxon in the Taxonomic Tree.
• Thumbnail images are shown with kingdoms and phyla in the Tree.
• The operational menu of the Catalogue now can be shown in different languages. The list of available languages will be expanded in next editions.
• A standard bibliographic citation for each contributing database can be now copied and passed from How To Cite page

The DVD contains the Annual Checklist dataset and the software identical to that used on the Web. The structure of the Annual Checklist database has been optimised for performance with the user interface but is not ideal for importing to other systems. The content may be copied subject to the copyright conditions given on the inside cover of this booklet.