From 94a8ad9b067e5a2c442ce47ce72d1a53eb444160 Mon Sep 17 00:00:00 2001 From: Terry Truong Date: Mon, 23 Jan 2023 18:00:43 +1100 Subject: Clean up some docs and naming inconsistencies --- backend/tol_data/README.md | 6 ++---- 1 file changed, 2 insertions(+), 4 deletions(-) (limited to 'backend/tol_data/README.md') diff --git a/backend/tol_data/README.md b/backend/tol_data/README.md index a21418b..f1bbf6b 100644 --- a/backend/tol_data/README.md +++ b/backend/tol_data/README.md @@ -3,7 +3,7 @@ This directory holds files used to generate the tree-of-life database data.db. # Database Tables ## Tree Structure - `nodes`
- Format : `name TEXT PRIMARY KEY, id TEXT UNIQUE, tips INT`
+ Format: `name TEXT PRIMARY KEY, id TEXT UNIQUE, tips INT`
Represents a tree-of-life node. `tips` holds the number of no-child descendants - `edges`
Format: `parent TEXT, child TEXT, p_support INT, PRIMARY KEY (parent, child)`
@@ -59,10 +59,8 @@ takes several days, and occupies over 200 GB. ## Environment Some of the scripts use third-party packages: - `indexed_bzip2`: For parallelised bzip2 processing. -- `jsonpickle`: For encoding class objects as JSON. - `requests`: For downloading data. -- `PIL`: For image processing. -- `tkinter`: For providing a basic GUI to review images. +- `Pillow`: For image processing. - `mwxml`, `mwparserfromhell`: For parsing Wikipedia dumps. ## Generate Tree Structure Data -- cgit v1.2.3