aboutsummaryrefslogtreecommitdiff
path: root/backend/tolData/eol/README.md
blob: 1a9dbdf3f7a570742939220c42ce9aafd31d9f8d (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
This directory holds files obtained from/using the [Encyclopedia of Life](https://eol.org/).

# Name Data Files
-   vernacularNames.csv <br>
    Obtained from <https://opendata.eol.org/dataset/vernacular-names> on 24/04/2022 (last updated on 27/10/2020).
    Contains alternative-node-names data from EOL.

# Image Metadata Files
-   imagesList.tgz <br>
    Obtained from <https://opendata.eol.org/dataset/images-list> on 24/04/2022 (last updated on 05/02/2020).
    Contains metadata for images from EOL.
-   imagesList/ <br>
    Extracted from imagesList.tgz.
-   genImagesListDb.py <br>
    Creates a database, and imports imagesList/*.csv files into it.
-   imagesList.db <br>
    Created by running genImagesListDb.py <br>
    Tables: <br>
    -   `images`:
        `content_id INT PRIMARY KEY, page_id INT, source_url TEXT, copy_url TEXT, license TEXT, copyright_owner TEXT`

# Image Generation Files
-   downloadImgs.py <br>
    Used to download image files into imgsForReview/.
-   reviewImgs.py <br>
    Used to review images in imgsForReview/, moving acceptable ones into imgs/.