README.md 1.49 KB
Newer Older
Mouhamadou Ba's avatar
Mouhamadou Ba committed
1
2
# About

Mouhamadou Ba's avatar
Mouhamadou Ba committed
3
This project is designed to extract entities (i.e., `taxa`, `phenotypes`, `habitats`, `disease names`, `hosts`, `pathogen`, `vector`, `dates` and `geographic names`) from textual data for the purpose of scientific watch.
Mouhamadou Ba's avatar
Mouhamadou Ba committed
4

Mouhamadou Ba's avatar
Mouhamadou Ba committed
5
It contains a workflow based on Framework [AlvisNLP](https://github.com/Bibliome/alvisnlp) and uses the Ontobiotope Ontology and NCBI taxonomy.
Mouhamadou Ba's avatar
Mouhamadou Ba committed
6
7


Mouhamadou Ba's avatar
Mouhamadou Ba committed
8
## Usage
Mouhamadou Ba's avatar
Mouhamadou Ba committed
9
> To run the workflow you need `GNU bash, version 4.4.x` and `singularity version 3.4.x` must be installed in yout machine. The workflow is compatible with `AlvisNLP version 0.7.x` 
Mouhamadou Ba's avatar
Mouhamadou Ba committed
10

Mouhamadou Ba's avatar
Mouhamadou Ba committed
11
12
13
14
15

1. clone the project

```
git clone https://forgemia.inra.fr/mandiayba/pesv-tm.git
Mouhamadou Ba's avatar
Mouhamadou Ba committed
16
cd pesv-tm
Mouhamadou Ba's avatar
Mouhamadou Ba committed
17
18
```

Mouhamadou Ba's avatar
Mouhamadou Ba committed
19
2. pull the singularity image of AlvisNLP. 
Mouhamadou Ba's avatar
Mouhamadou Ba committed
20
> `login` and `password` are required to pull the AlvisNLP singularity image from forgemia, contact maintainer if you don't have grant.
Mouhamadou Ba's avatar
Mouhamadou Ba committed
21
22
23

```
cd pesv-tm/softwares
Mouhamadou Ba's avatar
Mouhamadou Ba committed
24
singularity pull --docker-login alvisnlp.sif oras:registry.forgemia.inra.fr/migale/tm-tools-packages/sif/alvisnlp:v0.0.4
Mouhamadou Ba's avatar
Mouhamadou Ba committed
25
26
```

Mouhamadou Ba's avatar
Mouhamadou Ba committed
27
28
3. run the workflow. 
> You may run the workflow with the test corpus `corpus/pesv/Xylella-test/txt/`, results are stored into `corpus/pesv/Xylella-test/`
Mouhamadou Ba's avatar
Mouhamadou Ba committed
29
30
31
32
33
34
35
36

```
softwares/alvisnlp.sif -J-Xmx32G -verbose -cleanTmp \
-alias input corpus/pesv/Xylella-test/txt/ \
-outputDir corpus/pesv/Xylella-test/ \
-entity ontobiotope resources/BioNLP-OST+EnovFood \
-feat inhibit-syntax inhibit-syntax \
plans/PESV_workflow.plan
Mouhamadou Ba's avatar
Mouhamadou Ba committed
37
38
```

Mouhamadou Ba's avatar
Mouhamadou Ba committed
39
40
4. See results from `corpus/Xylella/visualisation_html`

Mouhamadou Ba's avatar
Mouhamadou Ba committed
41
## Maintainer
Mouhamadou Ba's avatar
Mouhamadou Ba committed
42
Mouhamadou Ba : mouhamadou.ba@inrae.fr