CETAF Stable Identifier Guide: Difference between revisions
| bwf>Anton Güntsch | bwf>Fabian Reimeier   (Restructured page and added more documentation) | ||
| Line 1: | Line 1: | ||
| = CETAF ISTC Stable Identifier Initiative = | = CETAF ISTC Stable Identifier Initiative = | ||
| CETAF ISTC Stable Identifiers are globally unique, consistent and reliable identifiers for specimens in natural and botanical collections. These identifiers are used in the world wide web to redirect users and systems to images, websites and metadata of the physical objects and to integrate them with the semantic web. | |||
| == How do CETAF ISTC Stable Identifier look like? == | |||
| [[File:Stable identifier example.png|thumb|Example for the syntax of a Stable Identifier.]] | |||
| The identifier system is based on HTTP-URIs and Linked Data principles. It is simple and future-proof.   | |||
| Each collection object as well as its associated information resources (e.g. multimedia, RDF, webpages) are identified by stable HTTP-URIs that will never change. The URI Syntax for the objects is chosen and maintained by the institution owning them. This flexibility is one of the main advantages of the CETAF Stable Identifier system as it allows e.g. to include branding and local scope identifiers into the Stable Identifier URI. There are however some [https://wiki.pro-ibiosphere.eu/wiki/Best_practices_for_stable_URIs best practices for stable URIs]. Examples are: | |||
| [http://herbarium.bgbm.org/object/B100277113 http://herbarium.bgbm.org/object/B100277113] | |||
| [http://www.botanicalcollections.be/specimen/BR000000551633 http://www.botanicalcollections.be/specimen/BR000000551633] | |||
| == How are CETAF ISTC Stable Identifiers resolved? == | |||
| A CETAF ISTC Stable Identifier allows the access of information about the corresponding collection object in various ways. | |||
| If a human user tries to access a collection object by typing it’s CETAF ISTC Stable Identifier into a web-browser, he will be redirected to a human-readable representation (e.g. html web-page) of it. | |||
| If a software-system tries to access the collection object via the same identifier, it will be redirected to a machine-processable RDF-encoded metadata record. The identifier is therefore integrated with the semantic web and can also be used in other RDF representations to link to the belonging collection object. | |||
| == What can CETAF ISTC Stable Identifiers be used for? == | |||
| As described above, CETAF ISTC Identifiers can first of all be used to redirect users and systems to images, websites and metadata of the physical objects they belong to. | |||
| They can also be used to precisely reference specimens needed in scientific studies and serve as basis for data retrieval, integration and reproducibility of data experiments. | |||
| Additionally, the stable identifiers enable new applications in the semantic web domain. An example for this is the Biology Pilot. The [https://www.bgbm.org/ Botanic Garden and Botanical Museum Berlin], [https://www.plantentuinmeise.be/en/home/ Meise Botanic Garden] and the [https://www.kew.org/ Royal Botanic Gardens Kew] annotated thousands of specimens with the [https://kiki.huh.harvard.edu/databases/botanist_index.html HUH] and [https://www.wikidata.org/wiki/Wikidata:Main_Page WikiData] IDs of their collectors. The stable identifiers of the annotated specimens are available on [https://www.gbif.org/ GBIF] and a server is crawling the identifiers to organize the RDF information in a Blaze Graph. This graph enables us to search for specimen by their collector HUH or WikiData ID, which is invariant to the different abbreviations the institutions may be using. The query will return all relevant specimens available in the joined set of specimens regardless of their origin institution. If the number of institutions using stable identifiers grows and the amount of machine readable annotations increases, this technology could be used to basically create a “google for specimen”. | |||
| == How can I implement CETAF ISTC Stable Identifiers for my collection? == | |||
| The CETAF ISTC Stable Identifiers can be implemented in three levels. They are described in detail [http://herbal.rbge.info/md.php?q=documentation here]. Following conditions have to be met to reach the corresponding implementation levels: | |||
| '''Level 1''' | |||
| * you assigned a stable URI to each object of your collection, which will be never changed and preferably follows the [https://wiki.pro-ibiosphere.eu/wiki/Best_practices_for_stable_URIs best practices for stable URIs] | |||
| * there exists a human-readable representation (web-page) for each of your collection objects | |||
| * a user trying to access a collection object by typing the stable URI of it into a web-browser will be redirected to the human-readable representation (web-page) of the object (you can test this by using the [http://herbal.rbge.info/search.php CETAF URI Tester]) | |||
| '''Level 2''' | |||
| * you reached level 1 | |||
| * there exists a machine-readable RDF metadata record for each of your collection objects | |||
| * a machine trying to access a collection object via its identifier with  'application/rdf+xml' header will be redirected to the objects machine-readable RDF metadata record (you can test this by using the [http://herbal.rbge.info/search.php CETAF URI Tester]) | |||
| '''Level 3''' | |||
| * you reached level 2 | |||
| * the machine-readable RDF metadata record of each of your collection objects encodes application specific data (e.g. is compliant to the [[CSPP|CETAF Specimen Preview Profile (CSPP)]]) | |||
| == Useful Links == | |||
| * [http://wiki.pro-ibiosphere.eu/wiki/Best_practices_for_stable_URIs Best practices for stable URIs] | |||
| * [[CSPP|CETAF Specimen Preview Profile (CSPP)]] | |||
| * [https://sourceforge.net/projects/stablecollectionidentifiers/ Source code and example documents] | |||
| * [http://herbal.rbge.info/ CETAF URI Tester] | |||
| == Further reading == | |||
| * [http://www.pro-ibiosphere.eu/news/4296_stable_identifiers_for_specimens_-_a_cetaf_istc_initiative_supported_by_pro-ibiosphere/ Stable identifiers for specimens – A CETAF ISTC initiative supported by pro-iBiosphere] | |||
| * [https://academic.oup.com/database/article/3053443/Actionable, Paper in Database (Oxford) 2017 (1): Actionable, long-term stable and semantic web compatible identifiers for access to biological collection objects] | |||
| * [http://www.nature.com/nature/journal/v546/n7656/full/546033d.html?foxtrotcallback=true Paper in Nature 546, 33 (01 June 2017): Data management: Stable identifiers for collection specimens] | |||
| * [http://www.cetaf.org/sites/default/files/cetaf-istc_stable_identifiers_poster50x70.pdf Poster: CETAF stable identifiers for specimens] | |||
| == Meetings == | == Meetings == | ||
| Line 9: | Line 56: | ||
| * [http://cetafidentifiers.biowikifarm.net/wiki/IDs_and_LOD_Discussion (Virtual) LOD Hackathon (October 2017)] | * [http://cetafidentifiers.biowikifarm.net/wiki/IDs_and_LOD_Discussion (Virtual) LOD Hackathon (October 2017)] | ||
| * [http://cetafdigitization.biowikifarm.net/cdig/ISTC_DWG_Meeting_Spring_2018_Copenhagen Joint CETAF-ISTC / CETAF-DWG meeting (February 2018) ] | * [http://cetafdigitization.biowikifarm.net/cdig/ISTC_DWG_Meeting_Spring_2018_Copenhagen Joint CETAF-ISTC / CETAF-DWG meeting (February 2018) ] | ||
| * [https://itsworldcongress.com/programme/topics/ Quality of Life 25th ITS World Congress (September 2018)]   | |||
| * [https://cetafdigitization.biowikifarm.net/cdig/ISTC_DWG_Meeting_Spring_2019_Vienna Joint CETAF-ISTC / CETAF-DWG meeting (February 2019)] | |||
| * [https:// | |||
| * [ | |||
Revision as of 15:26, 10 April 2019
CETAF ISTC Stable Identifier Initiative
CETAF ISTC Stable Identifiers are globally unique, consistent and reliable identifiers for specimens in natural and botanical collections. These identifiers are used in the world wide web to redirect users and systems to images, websites and metadata of the physical objects and to integrate them with the semantic web.
How do CETAF ISTC Stable Identifier look like?
The identifier system is based on HTTP-URIs and Linked Data principles. It is simple and future-proof. Each collection object as well as its associated information resources (e.g. multimedia, RDF, webpages) are identified by stable HTTP-URIs that will never change. The URI Syntax for the objects is chosen and maintained by the institution owning them. This flexibility is one of the main advantages of the CETAF Stable Identifier system as it allows e.g. to include branding and local scope identifiers into the Stable Identifier URI. There are however some best practices for stable URIs. Examples are:
http://herbarium.bgbm.org/object/B100277113
http://www.botanicalcollections.be/specimen/BR000000551633
How are CETAF ISTC Stable Identifiers resolved?
A CETAF ISTC Stable Identifier allows the access of information about the corresponding collection object in various ways. If a human user tries to access a collection object by typing it’s CETAF ISTC Stable Identifier into a web-browser, he will be redirected to a human-readable representation (e.g. html web-page) of it. If a software-system tries to access the collection object via the same identifier, it will be redirected to a machine-processable RDF-encoded metadata record. The identifier is therefore integrated with the semantic web and can also be used in other RDF representations to link to the belonging collection object.
What can CETAF ISTC Stable Identifiers be used for?
As described above, CETAF ISTC Identifiers can first of all be used to redirect users and systems to images, websites and metadata of the physical objects they belong to. They can also be used to precisely reference specimens needed in scientific studies and serve as basis for data retrieval, integration and reproducibility of data experiments. Additionally, the stable identifiers enable new applications in the semantic web domain. An example for this is the Biology Pilot. The Botanic Garden and Botanical Museum Berlin, Meise Botanic Garden and the Royal Botanic Gardens Kew annotated thousands of specimens with the HUH and WikiData IDs of their collectors. The stable identifiers of the annotated specimens are available on GBIF and a server is crawling the identifiers to organize the RDF information in a Blaze Graph. This graph enables us to search for specimen by their collector HUH or WikiData ID, which is invariant to the different abbreviations the institutions may be using. The query will return all relevant specimens available in the joined set of specimens regardless of their origin institution. If the number of institutions using stable identifiers grows and the amount of machine readable annotations increases, this technology could be used to basically create a “google for specimen”.
How can I implement CETAF ISTC Stable Identifiers for my collection?
The CETAF ISTC Stable Identifiers can be implemented in three levels. They are described in detail here. Following conditions have to be met to reach the corresponding implementation levels:
Level 1
- you assigned a stable URI to each object of your collection, which will be never changed and preferably follows the best practices for stable URIs
- there exists a human-readable representation (web-page) for each of your collection objects
- a user trying to access a collection object by typing the stable URI of it into a web-browser will be redirected to the human-readable representation (web-page) of the object (you can test this by using the CETAF URI Tester)
Level 2
- you reached level 1
- there exists a machine-readable RDF metadata record for each of your collection objects
- a machine trying to access a collection object via its identifier with 'application/rdf+xml' header will be redirected to the objects machine-readable RDF metadata record (you can test this by using the CETAF URI Tester)
Level 3
- you reached level 2
- the machine-readable RDF metadata record of each of your collection objects encodes application specific data (e.g. is compliant to the CETAF Specimen Preview Profile (CSPP))
Useful Links
- Best practices for stable URIs
- CETAF Specimen Preview Profile (CSPP)
- Source code and example documents
- CETAF URI Tester
Further reading
- Stable identifiers for specimens – A CETAF ISTC initiative supported by pro-iBiosphere
- Paper in Database (Oxford) 2017 (1): Actionable, long-term stable and semantic web compatible identifiers for access to biological collection objects
- Paper in Nature 546, 33 (01 June 2017): Data management: Stable identifiers for collection specimens
- Poster: CETAF stable identifiers for specimens
Meetings
- Edinburgh Meeting (June 2013)
- Joint ISTC/pro-iBiosphere workshop Berlin (October 2013)
- Geneva Meeting (October 2015)
- Joint CETAF-ISTC / CETAF-DWG meeting (May 2016)
- Joint CETAF-ISTC / CETAF-DWG meeting (March 2017)
- (Virtual) LOD Hackathon (October 2017)
- Joint CETAF-ISTC / CETAF-DWG meeting (February 2018)
- Quality of Life 25th ITS World Congress (September 2018)
- Joint CETAF-ISTC / CETAF-DWG meeting (February 2019)



