diff options
| author | Terry Truong <terry06890@gmail.com> | 2022-08-30 12:27:42 +1000 |
|---|---|---|
| committer | Terry Truong <terry06890@gmail.com> | 2022-08-30 12:27:42 +1000 |
| commit | e8e58a3bb9dc233dacf573973457c5b48d369503 (patch) | |
| tree | 242500ca304c5afbb7e6506e61da4c4dfff0b175 /backend/tolData/otol | |
| parent | 930c12d33e1093f874a4beb4d6376621e464e8c0 (diff) | |
Add scripts for generating eol/enwiki mappings
- New data sources: OTOL taxonomy, EOL provider-ids, Wikidata dump
- Add 'node_iucn' table
- Remove 'redirected' field from 'wiki_ids' table
- Make 'eol_ids' table have 'name' as the primary key
- Combine name-generation scripts into genNameData.py
- Combine description-generation scripts into genDescData.py
Diffstat (limited to 'backend/tolData/otol')
| -rw-r--r-- | backend/tolData/otol/README.md | 19 |
1 files changed, 14 insertions, 5 deletions
diff --git a/backend/tolData/otol/README.md b/backend/tolData/otol/README.md index 4be2fd2..e018369 100644 --- a/backend/tolData/otol/README.md +++ b/backend/tolData/otol/README.md @@ -1,10 +1,19 @@ -Files -===== -- opentree13.4tree.tgz <br> +This directory holds files obtained via the +[Open Tree of Life](https://tree.opentreeoflife.org/about/open-tree-of-life). + +# Tree Data Files +- `opentree13.4tree.tgz` <br> Obtained from <https://tree.opentreeoflife.org/about/synthesis-release/v13.4>. Contains tree data from the [Open Tree of Life](https://tree.opentreeoflife.org/about/open-tree-of-life). -- labelled\_supertree\_ottnames.tre <br> +- `labelled_supertree_ottnames.tre` <br> Extracted from the .tgz file. Describes the structure of the tree. -- annotations.json +- `annotations.json` <br> Extracted from the .tgz file. Contains additional attributes of tree nodes. Used for finding out which nodes have 'phylogenetic support'. + +# Taxonomy Data Files +- `ott3.3.tgz` <br> + Obtained from <https://tree.opentreeoflife.org/about/taxonomy-version/ott3.3>. + Contains taxonomy data from the Open Tree of Life. +- `otol/taxonomy.tsv` <br> + Extracted from the .tgz file. Holds taxon IDs from sources like NCBI, used to map between datasets. |
