aboutsummaryrefslogtreecommitdiff
path: root/backend/tolData/otol/README.md
diff options
context:
space:
mode:
authorTerry Truong <terry06890@gmail.com>2022-08-30 12:27:42 +1000
committerTerry Truong <terry06890@gmail.com>2022-08-30 12:27:42 +1000
commite8e58a3bb9dc233dacf573973457c5b48d369503 (patch)
tree242500ca304c5afbb7e6506e61da4c4dfff0b175 /backend/tolData/otol/README.md
parent930c12d33e1093f874a4beb4d6376621e464e8c0 (diff)
Add scripts for generating eol/enwiki mappings
- New data sources: OTOL taxonomy, EOL provider-ids, Wikidata dump - Add 'node_iucn' table - Remove 'redirected' field from 'wiki_ids' table - Make 'eol_ids' table have 'name' as the primary key - Combine name-generation scripts into genNameData.py - Combine description-generation scripts into genDescData.py
Diffstat (limited to 'backend/tolData/otol/README.md')
-rw-r--r--backend/tolData/otol/README.md19
1 files changed, 14 insertions, 5 deletions
diff --git a/backend/tolData/otol/README.md b/backend/tolData/otol/README.md
index 4be2fd2..e018369 100644
--- a/backend/tolData/otol/README.md
+++ b/backend/tolData/otol/README.md
@@ -1,10 +1,19 @@
-Files
-=====
-- opentree13.4tree.tgz <br>
+This directory holds files obtained via the
+[Open Tree of Life](https://tree.opentreeoflife.org/about/open-tree-of-life).
+
+# Tree Data Files
+- `opentree13.4tree.tgz` <br>
Obtained from <https://tree.opentreeoflife.org/about/synthesis-release/v13.4>.
Contains tree data from the [Open Tree of Life](https://tree.opentreeoflife.org/about/open-tree-of-life).
-- labelled\_supertree\_ottnames.tre <br>
+- `labelled_supertree_ottnames.tre` <br>
Extracted from the .tgz file. Describes the structure of the tree.
-- annotations.json
+- `annotations.json` <br>
Extracted from the .tgz file. Contains additional attributes of tree
nodes. Used for finding out which nodes have 'phylogenetic support'.
+
+# Taxonomy Data Files
+- `ott3.3.tgz` <br>
+ Obtained from <https://tree.opentreeoflife.org/about/taxonomy-version/ott3.3>.
+ Contains taxonomy data from the Open Tree of Life.
+- `otol/taxonomy.tsv` <br>
+ Extracted from the .tgz file. Holds taxon IDs from sources like NCBI, used to map between datasets.