Skip to content
Snippets Groups Projects
Commit 1eb811b2 authored by Robert Bossy's avatar Robert Bossy
Browse files

refactorred reject file

parent cebf0ba2
No related branches found
No related tags found
No related merge requests found
[0-9/\.\-]+ ### only symbols
[ -@]+
### countries
Argentina
Bulgaria
China
Japan
Mali
Namibia
niger
Panama
Tanzania
Togo
Tonga
turkey
### Common English words
A
AD
AH
Album
all
ALL
AM
AND
AT
BAG
bear
Camera
Cancer
Car
cat
collection
Data
fish
flag
Gas
Hero
Hi
HI
I
Idea
IE
Lap
Major
man
ME
MR
MS
name
none
OR
other
permit
PM
red
root
Sea
Ship
spot
TeSt
This
unknown
white
### Excluded nodes
# root
ncbi:1 ncbi:1
Be ncbi:1587
Bd ncbi:1613 # transposons
unclassified bacterium ncbi:2338
unidentified bacteria ncbi:2338
unidentified bacterium ncbi:2338
unknown bacteria ncbi:2338
ncbi:2387 ncbi:2387
# insertion sequences
ncbi:2673 ncbi:2673
unidentified proteobacterium ncbi:2722
unknown proteobacterium ncbi:2722 # other sequences
rape ncbi:3708 ncbi:28384
Glycine ncbi:3846
rays ncbi:7858 # expression vectors
A hybrid ncbi:8307 ncbi:29278
monitors ncbi:8555
Ara ncbi:9225 # synthetic construct
euro ncbi:9319
man ncbi:9606
bear ncbi:9632
bears ncbi:9632
cat ncbi:9685
pig ncbi:9823
Axis ncbi:9855
Vira ncbi:10239
unidentified poxvirus ncbi:10283
unidentified entomopoxvirus ncbi:10291
ASFV ncbi:10497
degu ncbi:10160
LGT ncbi:11085
LI ncbi:11086
PVA ncbi:12215
GA-1 ncbi:12345
other sequences ncbi:28384
29278
Spea ncbi:30316
A glycine ncbi:307491
ncbi:32630 ncbi:32630
# unidentified
ncbi:32644 ncbi:32644
flag ncbi:34205
plasmids ncbi:36549 # plasmids
hybrid ncbi:37965 ncbi:36549
bacteriophage ncbi:38018
bacteriophages ncbi:38018 # hybrid
unidentified bacteriophage ncbi:38018 ncbi:37965
unidentified phage ncbi:38018
mum ncbi:41568 # cloning vector
Arca ncbi:44596
ncbi:45196 ncbi:45196
ncbi:45197
4ncbi:5328 # shuttle vector
Thymus ncbi:49990 shuttle vector
ncbi:52958
Bacillus ncbi:55087 # expression vector
ncbi:187 ncbi:55511 ncbi:81076
name ncbi:55581
spot ncbi:59837 # human gut metagenome
Laser ncbi:62990 ncbi:408170
Idea ncbi:76236
Codon ncbi:79338
expression vector ncbi:81076 ### Excluded synonyms
unidentified expression vector ncbi:81076 unidentified proteobacterium
Dina ncbi:83994 unknown proteobacterium
gag ncbi:103820 rape
Later ncbi:123504 Glycine
Ada ncbi:125078 ray
Side ncbi:145724 rays
Aa ncbi:152839 BD
tipa ncbi:162890 monitors
This ncbi:169495 Ara
aka ncbi:172644 euro
permit ncbi:173331 bears
Car ncbi:201850 pig
Mene ncbi:206144 Axis
Pero ncbi:214303 Vira
3A ncbi:215167 unidentified poxvirus
Luria ncbi:218032 unidentified entomopoxvirus
Iso ncbi:238707 ASFV
Cis ncbi:245896 degu
ray ncbi:255564 LGT
Pera ncbi:256812 LI
Mops ncbi:258862 PVA
Bias ncbi:272805 GA-1
Sige ncbi:328602 GA 1
Span ncbi:333408 Spea
California ncbi:337343 bacteriophage
teta ncbi:338092 bacteriophages
Circe ncbi:345438 unidentified bacteriophage
Tasa ncbi:381831 unidentified phage
Nusa ncbi:468772 mum
A bacterium ncbi:494443 Arca
--> ncbi:545367 Thymus
phototrophic bacterium
ncbi:55087 Bacillus
Laser
Codon
Dina
gag
Ada
Aa
tipa
aka
Mene
Pero
3A
3a
Luria
Iso
Cis
Pera
Mops
Bias
Sige
California
teta
Circe
Nusa
[A-Z]\. alpha [A-Z]\. alpha
[A-Z]\. beta [A-Z]\. beta
[A-Z]\. gamma [A-Z]\. gamma
[A-Z]\. delta [A-Z]\. delta
[A-Z]\. epsilon [A-Z]\. epsilon
[A-Z]\. group [A-Z]\. group
[A-Z] complex
A hybrid
A glycine
A group A group
A major A major
A minor A minor
...@@ -109,30 +187,54 @@ A minimum ...@@ -109,30 +187,54 @@ A minimum
S medium S medium
A mouse A mouse
A flagellum A flagellum
S complex
Asp
Beta Beta
Helix Helix
rat rat
Tor Tor
Bio ncbi:463801 Chen
Chen ncbi:8842 Dialysis
Color ncbi:8869 Ideas
Dialysis ncbi:124307 Indicator
Ideas ncbi:76236 Phyla
Indicator ncbi:189528
Phyla ncbi:86858
163164
374463
tetra tetra
408170 Tetra
Delta ncbi:998453 Delta
is ncbi:159382 Electron
Are ncbi:695398
Electron ncbi:1118549
environmental samples environmental samples
E ncbi:178505 clinical samples
AND ncbi:1481724
clinical samples ncbi:88229
clinical samples ncbi:191496 ### Obsolete
clinical samples ncbi:226901 # Be ncbi:1587
# Bd ncbi:1613
# unclassified bacterium ncbi:2338
# unidentified bacteria ncbi:2338
# unidentified bacterium ncbi:2338
# unknown bacteria ncbi:2338
# flag ncbi:34205
# ncbi:5328
# 187 ncbi:55511
# name ncbi:55581
# spot ncbi:59837
# Laser ncbi:62990
# Idea ncbi:76236
# Later ncbi:123504
# Side ncbi:145724
# This ncbi:169495
# permit ncbi:173331
# Car ncbi:201850
# Span ncbi:333408
# Tasa ncbi:381831
# Nusa ncbi:468772
# A bacterium ncbi:494443
# --> ncbi:545367
# A group
# Asp
# Bio ncbi:463801
# Color ncbi:8869
# 163164
# 374463
# is ncbi:159382
# Are ncbi:695398
# E ncbi:178505
# AND ncbi:1481724
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment