Submission Instructions
Submitting to the Phenotype Library
Submission materials
The following materials are required to submit phenotype definitions to the library:
- Code list file
- A .csv or .xlsx file containing the clinical terms for each phenotype definition. A separate file is needed for each coding terminology (e.g. SNOMED, ICD-10).
- Files must be formatted with two columns: a ‘Code’ column containing the codes and a ‘Description’ column containing the clinical terms.
- Metadata: a description of the phenotyping definition, including:
- authors
- data sources used in its development
- associated publications
- any restrictions (eg. date or sex) on its use
An example of a submitted phenotype definition with metadata can be found here.
Submission methods
Phenotype definitions can be submitted using the Phenotype Library interface, or via the API.
Submission via the Phenotype Library interface
- Request a user account by filling in the form here.
- Login and access the creation interface here.
- Click Create in the BHF Data Science Centre Phenotype section
- Complete the submission form:
- Some fields are optional, but providing as much information/metadata as possible is useful
- Ensure the Collections field is populated with BHF Data Science Centre and Phenotype Library
- Code lists are uploaded in the Clinical Code List section by creating a new concept.
- Once you have finalised the form click Create Phenotype
- You will be able to view the phenotype definitions you have created in the My collection area of your account
- To publish the phenotype definition click into it from the My collection area and click Request publication in the top right hand corner. This allows users without a Phenotype Library account to view, interact with, or download (but not edit) the phenotype definition.
- The phenotype definition can be edited after it is published, but you will need to re-request its publication in order for the new version to be visible. Editing after publication will generate a new version ID.
More detailed documentation on using the Phenotype Library interface for can be found here.
Submission via the Phenotype Library API
The HDR UK Phenotype Library API documentation page can be found here, and its reference data can be found here. The Client packages are available in both Python and R.
To use the API, the metadata for each phenotype definition needs to be compiled into individual YAML files. Instructions for using the API and example files can be found here. Please include the BHF Data Science Centre and Phenotype Library collection IDs (20 and 18, respectively) in each YAML file.
We have additionally created scripts for formatting codelist files, automating the creation of YAML files, and batch uploading phenotypes. These may be particularly useful for projects with a large number of phenotypes.
Additional support
Additional documentation on the Phenotype Library can be found here. Any questions or problems that occur during the submission process should be directed to the Phenotype Library, via their contact page.