Skip to content
Snippets Groups Projects
Commit 1e8e026e authored by Robert Bossy's avatar Robert Bossy
Browse files

NCBI Taxonomy download snakefile

parent 81a9aa77
No related branches found
No related tags found
No related merge requests found
......@@ -8,12 +8,15 @@ ALVISNLP: '~/code/alvisnlp/.test/alvisnlp/bin/alvisnlp'
REWRITE_TAXONOMY: '~/code/bibliome-java-utils/test/install/bin/rewrite-taxonomy'
# NCBI Taxonomy files
NCBI_DIR: 'ncbi-taxonomy_2021-03-26'
# Output and working directories
OUTDIR: 'test'
DSMZ_STRAINS_DIR: 'dsmz-strains'
DSMZ_MATCH_DIR: 'dsmz-match'
NCBI_DIR: 'ncbi-taxonomy'
NCBI_ZIP_URL: 'https://ftp.ncbi.nlm.nih.gov/pub/taxonomy/taxdmp.zip'
configfile: 'config.yaml'
import glob
rule match:
'''
......
configfile: 'config.yaml'
rule unzip:
output:
config['OUTDIR'] + '/' + config['NCBI_DIR'] + '/nodes.dmp'
input:
config['OUTDIR'] + '/' + config['NCBI_DIR'] + '/taxdmp.zip'
shell:
'''unzip -d {config[OUTDIR]}/{config[NCBI_DIR]} {input}'''
rule download:
output:
config['OUTDIR'] + '/' + config['NCBI_DIR'] + '/taxdmp.zip'
shell:
'''curl -o {output} '{config[NCBI_ZIP_URL]}' '''
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment