From 8caab74cbdeabbf4ebc223562825b8c999cf3e19 Mon Sep 17 00:00:00 2001 From: Robert Bossy <Robert.Bossy@inra.fr> Date: Wed, 31 Mar 2021 11:43:35 +0200 Subject: [PATCH] updated README.md --- README.md | 30 ++++++++++++------------------ 1 file changed, 12 insertions(+), 18 deletions(-) diff --git a/README.md b/README.md index 809ebdc..26623a4 100644 --- a/README.md +++ b/README.md @@ -63,25 +63,19 @@ This step uses the following files: `dsmz-match.plan`, `bacdive2alvisnlp.xslt` a The DSMZ match results are written in the `dsmz-match` directory in the output directory. -#### `report.txt` +#### `dispatch-report.txt` -A tabular file where each line represents a match. +A tabular file where each line represents a DSMZ catalog entry. -* `BACDIVE ID`: BacDive entry identifier, **this is different from the DSM catalogue number**. -* `FIELD`: strain form, either `catalog-number`, `species`, or `species-and-number`. -* `NAME`: strain form that matches. -* `NCBI TAXID`: matched taxon identifier in the NCBI taxonomy. -* ̀ǸCBI CANONICAL`: canonical name of the matched taxon in the NCBI taxonomy. -* `NCBI RANK`: taxonomic rank of the matched taxon in the NCBI taxonomy. +* `ENTRY`: name of the catalog entry file. +* `DISPATCH`: dipatch decision + * `append`: new strain appended to NCBI species or NCBI subspecies + * `append-species`: new strain and lineage appended to NCBI superspecific taxon + * `equivalent`: equivalence with NCBI strain + * `type material`: equivelence with NCBI species or subspecies given as type strain + * `no-number`: strain has no designation or number + * `fail`: no match at any taxonomic level -The three last columns are empty if there is no match. +#### `dsmz-nodes.txt` -#### The decision tree - -1. Try to match - - - -#### `equivalent-strains.txt` - -This file shows equivalence between catalog entries and the NCBI \ No newline at end of file +Files in the format of NCBI Taxonomy `nodes.dmp` and `names.dmp` that contains all additions. -- GitLab