PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome1937.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_008819 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1NATL1_00001NATL1_00121Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_00001-317-4.030802DNA polymerase III subunit beta
NATL1_00011-316-3.811649hypothetical protein
NATL1_00021-315-3.412112phosphoribosylformylglycinamidine synthase II
NATL1_00031-315-4.079769amidophosphoribosyltransferase
NATL1_00041-115-3.712204DNA gyrase/topoisomerase IV, subunit A
NATL1_00051014-1.952539TPR-repeat pilus assembly protein TadD
NATL1_00061-114-0.725844hypothetical protein
NATL1_000710202.169767hypothetical protein
NATL1_000810181.641962transcription antitermination protein NusB
NATL1_000910172.378436signal recognition particle docking protein
NATL1_001010193.032369protein phosphatase 2C
NATL1_00111-1193.767031argininosuccinate lyase
NATL1_001210204.671953RNA recognition motif-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00051SYCDCHAPRONE300.009 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.5 bits (66), Expect = 0.009
Identities = 18/114 (15%), Positives = 43/114 (37%)

Query: 101 LSLDKVLVINPKNASIYFAKGSIYMNLKNLENAILMLNQGLLLDNKNESGYFQLGNAYIM 160
++ + I+ ++ E+A + +LD+ + + LG
Sbjct: 23 GTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQA 82

Query: 161 LKEYKKALHTYNKVTKLNPNFWQVINNQGLILYEINKKDEALSKFKLAAKLSNN 214
+ +Y A+H+Y+ ++ + + L + + EA S LA +L +
Sbjct: 83 MGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIAD 136



Score = 29.5 bits (66), Expect = 0.009
Identities = 15/86 (17%), Positives = 31/86 (36%)

Query: 48 QIGKTAKQLIQFGEYKEAIKILKLALKLNPTEETLWTTLADAQFKSKDSNNALLSLDKVL 107
Q+ A Q G+Y++A K+ + L+ + + L + + A+ S
Sbjct: 38 QLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGA 97

Query: 108 VINPKNASIYFAKGSIYMNLKNLENA 133
+++ K F + L A
Sbjct: 98 IMDIKEPRFPFHAAECLLQKGELAEA 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00121cloacin451e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.1 bits (106), Expect = 1e-07
Identities = 28/79 (35%), Positives = 32/79 (40%)

Query: 91 GGGGGGYGGGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGY 150
GG G G+ G + G G G G GG G +GGG G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 151 GGGGQGGYGGGGQGGYGGG 169
G GG G GGG G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 42.8 bits (100), Expect = 6e-07
Identities = 32/83 (38%), Positives = 34/83 (40%), Gaps = 1/83 (1%)

Query: 116 GGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQG-GYGGGGQGGY 174
GG G G G GG G GGG G G + GGG G G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 175 GGGGQGGYGGGGYGGGGQADQSA 197
G GG G GGG G GG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 42.0 bits (98), Expect = 1e-06
Identities = 30/79 (37%), Positives = 32/79 (40%), Gaps = 1/79 (1%)

Query: 100 GGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQG-GYGGGGQGGY 158
GG G G G GG G GGG G G + GGG G G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 159 GGGGQGGYGGGGQGGYGGG 177
G GG G GGG G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 42.0 bits (98), Expect = 1e-06
Identities = 30/79 (37%), Positives = 32/79 (40%), Gaps = 1/79 (1%)

Query: 108 GGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQG-GYGGGGQGGY 166
GG G G G GG G GGG G G + GGG G G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 167 GGGGQGGYGGGGQGGYGGG 185
G GG G GGG G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 40.5 bits (94), Expect = 4e-06
Identities = 27/65 (41%), Positives = 29/65 (44%), Gaps = 1/65 (1%)

Query: 90 GGGGGGGYGGGGYGGGGQGGYGGGGQGGYGGGGQG-GYGGGGQGGYGGGGQGGYGGGGQG 148
G GG G G GGG G G + GGG G G GG G+G GG G GGG G
Sbjct: 17 SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSG 76

Query: 149 GYGGG 153
G
Sbjct: 77 TGGNL 81



Score = 35.1 bits (80), Expect = 3e-04
Identities = 24/60 (40%), Positives = 25/60 (41%), Gaps = 1/60 (1%)

Query: 87 GGYGGGGGGGYGGGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGG 146
GG G G GG G G + GGG G G GG G G GG G G G GG
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSG-SGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 34.7 bits (79), Expect = 3e-04
Identities = 23/58 (39%), Positives = 25/58 (43%), Gaps = 4/58 (6%)

Query: 84 PRRGGYGGGGGGGYGGGG----YGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGG 137
P G GGG G G +GGG G GG G+G GG G GGG G G
Sbjct: 24 PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 3e-04
Identities = 27/84 (32%), Positives = 30/84 (35%), Gaps = 3/84 (3%)

Query: 124 GGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQG---GYGGGGQG 180
GG G G G GG G GGG G G + GGG G +GGG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 181 GYGGGGYGGGGQADQSAQDRPSGA 204
G GGG GG + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 29.7 bits (66), Expect = 0.011
Identities = 21/58 (36%), Positives = 22/58 (37%)

Query: 79 PRGSAPRRGGYGGGGGGGYGGGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGG 136
P G G G G GG G G + GGG G GGG G GGG G
Sbjct: 24 PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 29.3 bits (65), Expect = 0.017
Identities = 15/36 (41%), Positives = 16/36 (44%)

Query: 88 GYGGGGGGGYGGGGYGGGGQGGYGGGGQGGYGGGGQ 123
G G G G +GGG G G G GG G GG
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82



Score = 28.5 bits (63), Expect = 0.032
Identities = 21/60 (35%), Positives = 23/60 (38%), Gaps = 1/60 (1%)

Query: 89 YGGGGGGGYGGGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQGGYGGGGQG 148
+GGG G G GG G G GG G G GG G G +G G GG
Sbjct: 46 WGGGSGSGIHWGGGSGHGNGG-GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


2NATL1_00271NATL1_00451Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_00271315-1.3308864-hydroxythreonine-4-phosphate dehydrogenase
NATL1_00281217-0.865458nucleoside-diphosphate-sugar epimerases
NATL1_00291114-0.140001hypothetical protein
NATL1_003010130.531015HNH endonuclease:HNH nuclease
NATL1_00311014-1.378373type II secretion system protein-like protein
NATL1_00321013-1.909196hypothetical protein
NATL1_00331112-2.391410soluble hydrogenase small subunit
NATL1_00341012-2.658078cobalt-precorrin-6A synthase
NATL1_00351112-3.500613GMP synthase
NATL1_00361113-5.138808hypothetical protein
NATL1_00371014-5.174057site-specific DNA methylase
NATL1_00381-111-4.939728hypothetical protein
NATL1_00391011-5.885401glutathione S-transferase
NATL1_00401-110-6.292972hypothetical protein
NATL1_00411-19-5.480240hypothetical protein
NATL1_00421-110-5.324898hypothetical protein
NATL1_00431011-4.490242hypothetical protein
NATL1_00441012-4.708162hypothetical protein
NATL1_00451220-1.169283hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00421SYCDCHAPRONE451e-07 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 44.9 bits (106), Expect = 1e-07
Identities = 21/97 (21%), Positives = 35/97 (36%)

Query: 48 EQIINQALKFHSKGNISEATKYYQYFINQGFKDHRVFSNYGAILRKQGKVKESGFFMRKA 107
EQ+ + A + G +A K +Q D R F GA + G+ +
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 108 IEIQPNIPIINFNMGNILKDLGKLKEAEIFIRKSIRM 144
+ P F+ L G+L EAE + + +
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00431SYCDCHAPRONE465e-08 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 46.1 bits (109), Expect = 5e-08
Identities = 30/131 (22%), Positives = 45/131 (34%), Gaps = 3/131 (2%)

Query: 46 EQIINQAIQFHLKGNIPKATKYYQQLINQECNDYRVFSNYGAILQGLGKSKEAEASLRKA 105
EQ+ + A + G A K +Q L + D R F GA Q +G+ A S
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 106 VELNPDLAESHSYLGNLLNDLGKFKEAEASLRKAVEL---NPNLALAHAYLGILLNDLGQ 162
++ + L G+ EAE+ L A EL + +L +
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKL 156

Query: 163 LKEAEASLKKA 173
KE E
Sbjct: 157 KKEMEHECVDN 167



Score = 38.4 bits (89), Expect = 2e-05
Identities = 25/118 (21%), Positives = 43/118 (36%), Gaps = 5/118 (4%)

Query: 93 GKSKEAEASLRKAVELNPDLAESHSYLGNLLNDLGKFKEAEASLRKAVELNPNLALAHAY 152
GK ++A + L+ + LG +G++ A S ++ +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 153 LGILLNDLGQLKEAEASLKKAIKLKFGSVKAYDAL----SNVLNKLGRKKEAEESSKK 206
L G+L EAE+ L A +L + L S++L + KKE E
Sbjct: 110 AAECLLQKGELAEAESGLFLAQEL-IADKTEFKELSTRVSSMLEAIKLKKEMEHECVD 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00441SYCDCHAPRONE431e-06 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 42.6 bits (100), Expect = 1e-06
Identities = 21/91 (23%), Positives = 32/91 (35%)

Query: 46 EQIISQAIRFHEQGKIIEATKYYQYCLNHDFNDPIVFLNYGTILRSIGKLKKAEIFIRKA 105
EQ+ S A ++ GK +A K +Q D D FL G +++G+ A
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 106 INIDPNLQDVHFKLGVVLNELNRPKEAIKYF 136
+D F L + EA
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGL 127


3NATL1_00651NATL1_01111Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_00651317-2.073717hypothetical protein
NATL1_00661320-2.046681hypothetical protein
NATL1_00671523-0.713883hypothetical protein
NATL1_00681016-3.186629hypothetical protein
NATL1_00691015-2.963280hypothetical protein
NATL1_00701115-2.577789cyanate hydratase
NATL1_00711313-2.074261hypothetical protein
NATL1_00721214-1.823352hypothetical protein
NATL1_00731115-1.630439hypothetical protein
NATL1_007414242.450626hypothetical protein
NATL1_007514252.071278hypothetical protein
NATL1_007614302.730413hypothetical protein
NATL1_007714312.522210hypothetical protein
NATL1_007814161.993110hypothetical protein
NATL1_007914208.143434hypothetical protein
NATL1_008014197.576214hypothetical protein
NATL1_008114197.479795hypothetical protein
NATL1_008214187.280529hypothetical protein
NATL1_008314187.045419hypothetical protein
NATL1_008415218.142725hypothetical protein
NATL1_00851724-2.187308hypothetical protein
NATL1_00861625-2.266149hypothetical protein
NATL1_00871420-2.692881hypothetical protein
NATL1_00881520-2.579049hypothetical protein
NATL1_00891418-2.800375hypothetical protein
NATL1_00901318-4.046776hypothetical protein
NATL1_00911118-3.178404hypothetical protein
NATL1_00921016-4.120456hypothetical protein
NATL1_00931015-5.575225hypothetical protein
NATL1_00941-117-5.968333hypothetical protein
NATL1_00951-117-5.360140hypothetical protein
NATL1_00961-117-3.787638hypothetical protein
NATL1_00971013-3.442134hypothetical protein
NATL1_00981014-2.805675hypothetical protein
NATL1_00991113-0.688194hypothetical protein
NATL1_01001212-0.755845hypothetical protein
NATL1_01011214-2.168849hypothetical protein
NATL1_01021215-2.164720hypothetical protein
NATL1_01031017-3.327247hypothetical protein
NATL1_01041-216-1.878597hypothetical protein
NATL1_01051012-2.101195hypothetical protein
NATL1_01061-112-3.062955hypothetical protein
NATL1_01071-113-3.039224Short-chain dehydrogenases of various substrate
NATL1_01081-114-3.601508hypothetical protein
NATL1_01091-113-2.823163hypothetical protein
NATL1_01101012-3.881315hypothetical protein
NATL1_01111113-3.826522hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00731SYCDCHAPRONE482e-08 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 48.0 bits (114), Expect = 2e-08
Identities = 22/138 (15%), Positives = 51/138 (36%), Gaps = 3/138 (2%)

Query: 106 ELNPNFADAHYNLGNTLRDLGKLKEAELSYRKAIEISPNYANTLYNLGTILSDLGKLQDA 165
E++ + + Y+L GK ++A ++ + + LG +G+ A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 166 EFSYRQAIIINPNYTEAHYNLGNTLRDLGKLKDAE---LSYRKAIKISPNYAKVHCNLGT 222
SY I++ ++ L G+L +AE ++ I + ++ + +
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSS 149

Query: 223 ILRDLGKLKDAELYTRKA 240
+L + K+ E
Sbjct: 150 MLEAIKLKKEMEHECVDN 167



Score = 44.1 bits (104), Expect = 4e-07
Identities = 21/111 (18%), Positives = 37/111 (33%), Gaps = 3/111 (2%)

Query: 38 KLSELSKGQIINQAIQFHAQGNIQKAAKYYQYFIDQGFKDPRVLANYGVILKGFGNSQEA 97
++S + Q+ + A + G + A K +Q D R G + G A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 98 ELLYRKAIELNPNFADAHYNLGNTLRDLGKLKEAELSYRKAIEI---SPNY 145
Y ++ ++ L G+L EAE A E+ +
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEF 140



Score = 40.3 bits (94), Expect = 9e-06
Identities = 29/137 (21%), Positives = 53/137 (38%), Gaps = 3/137 (2%)

Query: 140 EISPNYANTLYNLGTILSDLGKLQDAEFSYRQAIIINPNYTEAHYNLGNTLRDLGKLKDA 199
EIS + LY+L GK +DA ++ +++ + LG + +G+ A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 200 ELSYRKAIKISPNYAKVHCNLGTILRDLGKLKDAELYTRKAIQL---NPDFAEAYSNLGN 256
SY + + + L G+L +AE A +L +F E + + +
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSS 149

Query: 257 ILSDLGNLKEAEISQKK 273
+L + KE E
Sbjct: 150 MLEAIKLKKEMEHECVD 166



Score = 39.5 bits (92), Expect = 1e-05
Identities = 21/133 (15%), Positives = 44/133 (33%), Gaps = 3/133 (2%)

Query: 77 DPRVLANYGVILKGFGNSQEAELLYRKAIELNPNFADAHYNLGNTLRDLGKLKEAELSYR 136
L + G ++A +++ L+ + LG + +G+ A SY
Sbjct: 35 TLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYS 94

Query: 137 KAIEISPNYANTLYNLGTILSDLGKLQDAE---FSYRQAIIINPNYTEAHYNLGNTLRDL 193
+ ++ L G+L +AE F ++ I + E + + L +
Sbjct: 95 YGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAI 154

Query: 194 GKLKDAELSYRKA 206
K+ E
Sbjct: 155 KLKKEMEHECVDN 167



Score = 36.4 bits (84), Expect = 2e-04
Identities = 20/103 (19%), Positives = 35/103 (33%)

Query: 175 INPNYTEAHYNLGNTLRDLGKLKDAELSYRKAIKISPNYAKVHCNLGTILRDLGKLKDAE 234
I+ + E Y+L GK +DA ++ + ++ LG + +G+ A
Sbjct: 31 ISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAI 90

Query: 235 LYTRKAIQLNPDFAEAYSNLGNILSDLGNLKEAEISQKKTIEL 277
++ + L G L EAE EL
Sbjct: 91 HSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00751SYCDCHAPRONE371e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 37.2 bits (86), Expect = 1e-05
Identities = 24/121 (19%), Positives = 42/121 (34%), Gaps = 2/121 (1%)

Query: 86 LELSPTEICLVYSMRGNAKRNSGDFDGAISDQNKALDFDPLYADGYFNRGIAKFKKGDFD 145
E+S + +YS + SG ++ A D + + G + G +D
Sbjct: 29 NEISSDTLEQLYS-LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYD 87

Query: 146 GAIQDYSQVLKINPKDSDAFFNRANVKKEIDDMKGACEDWRTAADL-GDDDAKKFLRENC 204
AI YS ++ K+ F+ A + ++ A A +L D K L
Sbjct: 88 LAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRV 147

Query: 205 E 205

Sbjct: 148 S 148



Score = 28.4 bits (63), Expect = 0.012
Identities = 23/131 (17%), Positives = 46/131 (35%), Gaps = 7/131 (5%)

Query: 60 EYFFNRAQDKFELADYEEAILDYNKALELSPTEICLVYSMRGNAKRNSGDFDGAISDQNK 119
E ++ A ++++ YE+A + L + + G ++ G +D AI +
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGL-GACRQAMGQYDLAIHSYSY 95

Query: 120 ALDFDPLYADGYFNRGIAKFKKGDFDGAIQ--DYSQVLKINPKDSDAFFNRANVKKE--- 174
D F+ +KG+ A +Q L + + R + E
Sbjct: 96 GAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIK 155

Query: 175 -IDDMKGACED 184
+M+ C D
Sbjct: 156 LKKEMEHECVD 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00841ICENUCLEATIN2363e-65 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 236 bits (604), Expect = 3e-65
Identities = 154/715 (21%), Positives = 283/715 (39%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A DS +I Q A ++T G+ Q A + G+ A D++ + G+
Sbjct: 298 AGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYG 357

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
Q A D++ G+ Q A + G+ A D++ + G+ Q A ++T
Sbjct: 358 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 417

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A + G+ A D++ + G+ Q A D++ G+ Q A
Sbjct: 418 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 477

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
+ G+ A +++ + G+ Q A +T G+ Q A ++ + G+
Sbjct: 478 SDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTST 537

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A +++ + G+ Q A+ ++ G+ Q A + AG+ T A D +AG+
Sbjct: 538 AGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYG 597

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A++ + AG+ T A + G+ T A D +AG+ A ++
Sbjct: 598 STQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 657

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A + G + + +AG+ + A + G G A
Sbjct: 658 AGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEG 717

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
+G+ A D +AG+ T A++ AG+ T A + G+ T
Sbjct: 718 SDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTST 777

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A D + +AG+ T A + + AG+ A + TG+ + A D+ + G
Sbjct: 778 AGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYG 837

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
Q + AG+ + A + TG G AG+D + +AG+ A ++ +
Sbjct: 838 STQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILT 897

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
AG+ T A + G+ T A ++ + +AG+ T A+F +AG+ + A +
Sbjct: 898 AGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQ 957

Query: 952 ISAMEGFNPTHVASFNPEAMAGFKGTQLKELDPESFAALTTEQAAEMTPDAAAAF 1006
S G+ T +A ++ +AG+ TQ A + Q AE + A +
Sbjct: 958 SSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGY 1012



Score = 229 bits (585), Expect = 5e-63
Identities = 154/687 (22%), Positives = 271/687 (39%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A DS + Q A + G+ A D++ + G+ Q A ++T G+
Sbjct: 266 AGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYG 325

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
Q A + G+ A D++ + G+ Q A D++ G+ Q A +
Sbjct: 326 STQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLT 385

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ A D++ + G+ Q A ++T G+ Q A + G+ A D
Sbjct: 386 AGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDD 445

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
++ + G+ Q A D++ G+ Q A + G+ A +++ + G+ Q
Sbjct: 446 SSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQT 505

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A +T G+ Q A ++ + G+ A +++ +AG+ T A+++ + AG+
Sbjct: 506 AGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYG 565

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A AG+ T A D +AG+ T A++ AG+ A +
Sbjct: 566 STQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLT 625

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
TG+ + A D+ + G Q + AG+ + A G G AG D
Sbjct: 626 TGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGAD 685

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
+ +AG+ A ++ + AG+ T A +G+ T A D +AG+ T
Sbjct: 686 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQT 745

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A++ + AG+ T A + G+ A D+ I G+ + A + G
Sbjct: 746 ASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYG 805

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
Q G+ + A D S + G G AG++ AG+ A +
Sbjct: 806 STQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLT 865

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
G+ T A +D + +AG+ T A ++ AG+ T A + G+ T A +E
Sbjct: 866 TGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYE 925

Query: 952 ISAMEGFNPTHVASFNPEAMAGFKGTQ 978
S + G+ T ASF MAG+ +Q
Sbjct: 926 SSLIAGYGSTQTASFKSTLMAGYGSSQ 952



Score = 228 bits (582), Expect = 1e-62
Identities = 161/754 (21%), Positives = 281/754 (37%), Gaps = 8/754 (1%)

Query: 231 VAPDQSGDPAPTGDAATAPVSDAVAEAAAADPDAAAAAAAPVPPPAEIGGVAPSELAKDD 290
+ ++ A T A D +E + E G P++ +
Sbjct: 105 ILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDID--ATIESGSTQPTQTIEIA 162

Query: 291 I------ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDA 344
S +I + A +T + G+ A D+T V G+ Q A ++
Sbjct: 163 TYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEES 222

Query: 345 TAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIA 404
+ + G+ Q + G+ A D++ + G+ Q A D++ G+ Q A
Sbjct: 223 SQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTA 282

Query: 405 ALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQE 464
+ G+ A D++ + G+ Q A ++T G+ Q A + G+
Sbjct: 283 QKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGS 342

Query: 465 DQIAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVG 524
A D++ + G+ Q A D++ G+ Q A + G+ A D++ +
Sbjct: 343 TGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIA 402

Query: 525 GFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDP 584
G+ Q A ++T G+ Q A + G+ A D++ +AG+ T A D
Sbjct: 403 GYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDS 462

Query: 585 LAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIA 644
AG+ T A AG+ T A ++ +AG+ T A + AG+ A
Sbjct: 463 SLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTA 522

Query: 645 AIDTQAITGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGK 704
++ ITG+ + A ++ + G Q ++ AG+ + A G G
Sbjct: 523 QNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGS 582

Query: 705 DHVAGFDPTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVA 764
AG D + +AG+ A++ AG+ T A + G+ T A D +A
Sbjct: 583 TGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIA 642

Query: 765 GFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDA 824
G+ T A ++ AG+ T A AG+ A D+ I G+ + A ++
Sbjct: 643 GYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNS 702

Query: 825 QAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVA 884
G Q +G+ + A D S + G G A + + AG+ A
Sbjct: 703 ILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTA 762

Query: 885 AFDPLAVAGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDE 944
+ G+ T A D + +AG+ T A + AG+ T A G+
Sbjct: 763 REQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGS 822

Query: 945 THVAAFEISAMEGFNPTHVASFNPEAMAGFKGTQ 978
T A + S + G+ T A +N AG+ TQ
Sbjct: 823 TSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQ 856



Score = 226 bits (577), Expect = 5e-62
Identities = 148/715 (20%), Positives = 277/715 (38%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A SD+ A D++ + G+ Q A D++ G+ Q A + G+
Sbjct: 330 AQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYG 389

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
A D++ + G+ Q A ++T G+ Q A + G+ A D++ +
Sbjct: 390 STGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLI 449

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A D++ G+ Q A + G+ A +++ + G+ Q A
Sbjct: 450 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYG 509

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
+T G+ Q A ++ + G+ A +++ + G+ Q A+ ++ G+ Q
Sbjct: 510 STLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQT 569

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A + G+ A D++ + G+ Q A+ ++ AG+ T A + G+
Sbjct: 570 AREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYG 629

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A D + +AG+ T A ++ + AG+ T A AG+ A D+ I
Sbjct: 630 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLI 689

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A ++ G Q +G+ + A D S + G G A +
Sbjct: 690 AGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYH 749

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
+ AG+ A + G+ T A D +AG+ T A + + AG+ T
Sbjct: 750 SSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQT 809

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A G+ T A D +AG+ A ++ G+ + A ++ TG
Sbjct: 810 AQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYG 869

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
++ + +AG+ + A + G G A + G+ A ++ +
Sbjct: 870 STSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLI 929

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
AG+ T A+F T +AG+ + A + AG+ T +A +D +AG+ T A ++
Sbjct: 930 AGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQ 989

Query: 952 ISAMEGFNPTHVASFNPEAMAGFKGTQLKELDPESFAALTTEQAAEMTPDAAAAF 1006
+ G+ T A + AG+ T D A + + + A +
Sbjct: 990 STLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGY 1044



Score = 226 bits (576), Expect = 7e-62
Identities = 146/688 (21%), Positives = 271/688 (39%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A DS +I Q A ++T G+ Q A + G+ A D++ + G+
Sbjct: 394 AGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYG 453

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
Q A D++ G+ Q A + G+ A +++ + G+ Q A +T
Sbjct: 454 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLT 513

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A ++ + G+ A +++ + G+ Q A+ ++ G+ Q A
Sbjct: 514 AGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREG 573

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
+ G+ A D++ + G+ Q A+ ++ G+ Q A + G+
Sbjct: 574 SDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTST 633

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A D++ + G+ Q A ++ G+ Q A + AG+ T A D +AG+
Sbjct: 634 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYG 693

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A ++ AG+ T A +G+ T A D +AG+ A+ +
Sbjct: 694 STQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLT 753

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A + TG + + +AG+ + A G G A
Sbjct: 754 AGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQER 813

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
G+ A D +AG+ T A ++ + AG+ T A + G+ T
Sbjct: 814 SDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTST 873

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A +D + +AG+ T A ++ + AG+ A ++ TG+ + A ++ + G
Sbjct: 874 AGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYG 933

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
Q +F+ T MAG+ + A S G G +AG+D + +AG+ A +
Sbjct: 934 STQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLT 993

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
AG+ T A T AG+ T A D + +AG+ + + AG+ T ++
Sbjct: 994 AGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLR 1053

Query: 952 ISAMEGFNPTHVASFNPEAMAGFKGTQL 979
G+ + ++ AG+ Q+
Sbjct: 1054 SVLTAGYGSSLISGRRSSLTAGYGSNQI 1081



Score = 225 bits (575), Expect = 9e-62
Identities = 155/705 (21%), Positives = 274/705 (38%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A DS +I Q A D++ G+ Q A + G+ A D++ + G+
Sbjct: 250 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 309

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
Q A ++T G+ Q A + G+ A D++ + G+ Q A D++
Sbjct: 310 STQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 369

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A + G+ A D++ + G+ Q A ++T G+ Q A
Sbjct: 370 AGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKG 429

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
+ G+ A D++ + G+ Q A D++ G+ Q A + G+
Sbjct: 430 SDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTST 489

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A +++ + G+ Q A +T G+ Q A ++ + G+ T A + +AG+
Sbjct: 490 AGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYG 549

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A+++ AG+ T A AG+ T A D +AG+ A+ +
Sbjct: 550 STQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLT 609

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A + TG + + +AG+ + A + G G A
Sbjct: 610 AGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEG 669

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
AG+ A D +AG+ T A ++ + AG+ T A +G+ T
Sbjct: 670 SDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTST 729

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A D + +AG+ T A++ AG+ A + TG+ + A D+ + G
Sbjct: 730 AGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 789

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
Q + AG+ + A TG G AG D + +AG+ A ++ +
Sbjct: 790 STQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 849

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
AG+ T A + G+ T A +D + +AG+ T A ++ + AG+ T A
Sbjct: 850 AGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEN 909

Query: 952 ISAMEGFNPTHVASFNPEAMAGFKGTQLKELDPESFAALTTEQAA 996
G+ T A + +AG+ TQ A + Q A
Sbjct: 910 SDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTA 954



Score = 225 bits (575), Expect = 9e-62
Identities = 151/705 (21%), Positives = 270/705 (38%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A SD+ A D++ + G+ Q A ++T G+ Q A + G+
Sbjct: 282 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 341

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
A D++ + G+ Q A D++ G+ Q A + G+ A D++ +
Sbjct: 342 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLI 401

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A ++T G+ Q A + G+ A D++ + G+ Q A D
Sbjct: 402 AGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGED 461

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
++ G+ Q A + G+ A +++ + G+ Q A +T G+ Q
Sbjct: 462 SSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQT 521

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A ++ + G+ A +++ + G+ Q A+ ++ AG+ T A AG+
Sbjct: 522 AQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYG 581

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A D + +AG+ T A++ AG+ T A + G+ A D+ I
Sbjct: 582 STGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLI 641

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A ++ G Q AG+ + A D S + G G AG++
Sbjct: 642 AGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYN 701

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
AG+ A +G+ T A D +AG+ T A++ AG+ T
Sbjct: 702 SILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQT 761

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A G+ T A D +AG+ A + G+ + A + TG
Sbjct: 762 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYG 821

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
+ + +AG+ + A + G G A + G+ A +D +
Sbjct: 822 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLI 881

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
AG+ T A ++ AG+ T A + G+ T A ++ +AG+ T A+F+
Sbjct: 882 AGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFK 941

Query: 952 ISAMEGFNPTHVASFNPEAMAGFKGTQLKELDPESFAALTTEQAA 996
+ M G+ + A AG+ T + D A + Q A
Sbjct: 942 STLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTA 986



Score = 225 bits (574), Expect = 1e-61
Identities = 149/689 (21%), Positives = 269/689 (39%)

Query: 295 DSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQEDQ 354
DS + Q A + G+ A D++ + G+ Q A ++T G+ Q
Sbjct: 365 DSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQ 424

Query: 355 IAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGF 414
A + G+ A D++ + G+ Q A D++ G+ Q A + G+
Sbjct: 425 TAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGY 484

Query: 415 QEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATA 474
A +++ + G+ Q A +T G+ Q A ++ + G+ A +++
Sbjct: 485 GSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSL 544

Query: 475 VGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAAL 534
+ G+ Q A+ ++ G+ Q A + G+ A D++ + G+ Q A+
Sbjct: 545 IAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASY 604

Query: 535 DATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFDETH 594
++ G+ Q A + G+ A D++ +AG+ T A ++ + AG+ T
Sbjct: 605 HSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQ 664

Query: 595 VAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGF 654
A AG+ T A D +AG+ T A ++ + AG+ A + +G+
Sbjct: 665 TAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGY 724

Query: 655 NADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTA 714
+ A D+ + G Q ++ + AG+ + A TG G AG D +
Sbjct: 725 GSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSL 784

Query: 715 VAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHVAAF 774
+AG+ A + + AG+ T A G+ T A D +AG+ T A +
Sbjct: 785 IAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGY 844

Query: 775 DPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLAKDQ 834
+ AG+ T A + G+ A D+ I G+ + A ++ G Q
Sbjct: 845 NSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQ 904

Query: 835 FVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAVAGF 894
G+ + A + S + G G A F T +AG+ + A AG+
Sbjct: 905 TAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGY 964

Query: 895 DETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFEISA 954
T A +D + +AG+ T A + T AG+ T A AG+ T A + S
Sbjct: 965 GSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSL 1024

Query: 955 MEGFNPTHVASFNPEAMAGFKGTQLKELD 983
+ G+ + + AG+ T + L
Sbjct: 1025 IAGYGSSLTSGIRSFLTAGYGSTLISGLR 1053



Score = 224 bits (572), Expect = 2e-61
Identities = 153/715 (21%), Positives = 280/715 (39%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A DS ++ Q A +++ + G+ Q + G+ A D++ + G+
Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
Q A D++ G+ Q A + G+ A D++ + G+ Q A ++T
Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A + G+ A D++ + G+ Q A D++ G+ Q A
Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
+ G+ A D++ + G+ Q A ++T G+ Q A + G+
Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A D++ + G+ Q A D++ G+ Q A + AG+ T A ++ +AG+
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A + T AG+ T A + + G+ T A + +AG+ A+ ++
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A + G + + +AG+ + A+ S G G A
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
G+ A D +AG+ T A ++ + AG+ T A AG+ T
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A D + +AG+ T A ++ + AG+ A + +G+ + A D+ + G
Sbjct: 682 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYG 741

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
Q ++ + AG+ + A TG G AG D + +AG+ A + +
Sbjct: 742 STQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILT 801

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
AG+ T A G+ T A D + +AG+ T A ++ + AG+ T A
Sbjct: 802 AGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEN 861

Query: 952 ISAMEGFNPTHVASFNPEAMAGFKGTQLKELDPESFAALTTEQAAEMTPDAAAAF 1006
G+ T A ++ +AG+ TQ + A + Q A+ D +
Sbjct: 862 SDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGY 916



Score = 222 bits (566), Expect = 1e-60
Identities = 142/687 (20%), Positives = 270/687 (39%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A DS + Q A + G+ A +++ + G+ Q A +T G+
Sbjct: 458 AGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYG 517

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
Q A ++ + G+ A +++ + G+ Q A+ ++ G+ Q A +
Sbjct: 518 STQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLT 577

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ A D++ + G+ Q A+ ++ G+ Q A + G+ A D
Sbjct: 578 AGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGAD 637

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
++ + G+ Q A ++ G+ Q A + G+ A D++ + G+ Q
Sbjct: 638 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQT 697

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A ++ G+ Q A + G+ A D++ +AG+ T A++ AG+
Sbjct: 698 AGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYG 757

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A G+ T A D +AG+ T A + + AG+ A +
Sbjct: 758 STQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLT 817

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
TG+ + A D+ + G Q + AG+ + A + TG G AG+D
Sbjct: 818 TGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYD 877

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
+ +AG+ A ++ + AG+ T A + G+ T A ++ +AG+ T
Sbjct: 878 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQT 937

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A+F T +AG+ + A AG+ +A D+ I G+ + A + G
Sbjct: 938 ASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYG 997

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
Q T AG+ + A D S + G G +G AG+ ++ +
Sbjct: 998 STQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLT 1057

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
AG+ + + + AG+ +A+ + +AG + T + + +AG + A +
Sbjct: 1058 AGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYR 1117

Query: 952 ISAMEGFNPTHVASFNPEAMAGFKGTQ 978
+ + G + +A + +AG TQ
Sbjct: 1118 STLISGADSVQMAGERGKLIAGADSTQ 1144



Score = 219 bits (558), Expect = 8e-60
Identities = 136/688 (19%), Positives = 267/688 (38%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A DS +I Q A D++ G+ Q A + G+ A +++ + G+
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
Q A +T G+ Q A ++ + G+ A +++ + G+ Q A+ ++
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A + G+ A D++ + G+ Q A+ ++ G+ Q A
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
+ G+ A D++ + G+ Q A ++ G+ Q A + G+
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A D++ + G+ Q A ++ G+ Q A + +G+ T A D +AG+
Sbjct: 682 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYG 741

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A++ + AG+ T A + G+ T A D +AG+ A +
Sbjct: 742 STQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILT 801

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A + TG + + +AG+ + A + G G A +
Sbjct: 802 AGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEN 861

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
G+ A +D +AG+ T A ++ + AG+ T A + G+ T
Sbjct: 862 SDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTST 921

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A ++ + +AG+ T A+F +AG+ A + G+ + +A D+ + G
Sbjct: 922 AGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYG 981

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
Q ++ T AG+ + A + G G AG D + +AG+ + +
Sbjct: 982 STQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLT 1041

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
AG+ T + AG+ + ++ + AG+ +A+ +AG + T +
Sbjct: 1042 AGYGSTLISGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNR 1101

Query: 952 ISAMEGFNPTHVASFNPEAMAGFKGTQL 979
+ G + A + ++G Q+
Sbjct: 1102 SMLIAGKGSSQTAGYRSTLISGADSVQM 1129



Score = 217 bits (554), Expect = 3e-59
Identities = 150/705 (21%), Positives = 266/705 (37%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
SD+ A D++ + G+ Q A D++ G+ Q A + G+
Sbjct: 234 GMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYG 293

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
A D++ + G+ Q A ++T G+ Q A + G+ A D++ +
Sbjct: 294 STGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLI 353

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A D++ G+ Q A + G+ A D++ + G+ Q A +
Sbjct: 354 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 413

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
+T G+ Q A + G+ A D++ + G+ Q A D++ G+ Q
Sbjct: 414 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 473

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A + G+ A +++ + G+ Q A +T AG+ T A + + G+
Sbjct: 474 AQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYG 533

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A + + +AG+ T A+++ + AG+ T A AG+ A D+ I
Sbjct: 534 STSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSII 593

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A+ + G Q + G+ + A D S + G G AG++
Sbjct: 594 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYN 653

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
AG+ A AG+ T A D +AG+ T A ++ + AG+ T
Sbjct: 654 SILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQT 713

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A +G+ T A D +AG+ A+ + G+ + A + TG
Sbjct: 714 AQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYG 773

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
+ + +AG+ + A G G A G+ A D +
Sbjct: 774 STSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLI 833

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
AG+ T A ++ AG+ T A + G+ T A +D +AG+ T A +
Sbjct: 834 AGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYN 893

Query: 952 ISAMEGFNPTHVASFNPEAMAGFKGTQLKELDPESFAALTTEQAA 996
G+ T A N + G+ T + A + Q A
Sbjct: 894 SILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTA 938



Score = 208 bits (530), Expect = 2e-56
Identities = 134/682 (19%), Positives = 261/682 (38%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A SD+ A +++ + G+ Q A +T G+ Q A ++ + G+
Sbjct: 474 AQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYG 533

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
A +++ + G+ Q A+ ++ G+ Q A + G+ A D++ +
Sbjct: 534 STSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSII 593

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A+ ++ G+ Q A + G+ A D++ + G+ Q A +
Sbjct: 594 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYN 653

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
+ G+ Q A + G+ A D++ + G+ Q A ++ G+ Q
Sbjct: 654 SILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQT 713

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A + G+ A D++ + G+ Q A+ ++ AG+ T A + G+
Sbjct: 714 AQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYG 773

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A D + +AG+ T A + + AG+ T A G+ A D+ I
Sbjct: 774 STSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLI 833

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A ++ G Q G+ + A D S + G G AG++
Sbjct: 834 AGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYN 893

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
AG+ A + G+ T A ++ +AG+ T A+F +AG+ +
Sbjct: 894 SILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQT 953

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A + AG+ T +A +D +AG+ A + G+ + A + G
Sbjct: 954 AREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYG 1013

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
+ + +AG+ + + I G G ++G AG+ + ++
Sbjct: 1014 STATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLISGRRSSLT 1073

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFE 951
AG+ A+ + +AG + T + +AG + A + ++G D +A
Sbjct: 1074 AGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGER 1133

Query: 952 ISAMEGFNPTHVASFNPEAMAG 973
+ G + T A + +AG
Sbjct: 1134 GKLIAGADSTQTAGDRSKLLAG 1155



Score = 206 bits (525), Expect = 8e-56
Identities = 139/700 (19%), Positives = 263/700 (37%)

Query: 260 ADPDAAAAAAAPVPPPAEIGGVAPSELAKDDIANLDSDVIEDLKEDQVAALDATAVGGFK 319
A D++ A A+ G + A +S +I Q A +T G+
Sbjct: 458 AGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYG 517

Query: 320 EDQVAALDATAVGGFKEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAV 379
Q A ++ + G+ A +++ + G+ Q A+ ++ G+ Q A +
Sbjct: 518 STQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLT 577

Query: 380 GGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALD 439
G+ A D++ + G+ Q A+ ++ G+ Q A + G+ A D
Sbjct: 578 AGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGAD 637

Query: 440 ATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQI 499
++ + G+ Q A ++ G+ Q A + G+ A D++ + G+ Q
Sbjct: 638 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQT 697

Query: 500 AALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQ 559
A ++ G+ Q A + G+ A D++ + G+ Q A+ ++ G+
Sbjct: 698 AGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYG 757

Query: 560 EDQIAALDATAVAGFDETHVAAFDPLAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAV 619
Q A + G+ T A D +AG+ T A + AG+ T A
Sbjct: 758 STQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLT 817

Query: 620 AGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLAKDQFVAFE 679
G+ T A D +AG+ A ++ G+ + A ++ TG ++
Sbjct: 818 TGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYD 877

Query: 680 PTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAVAGFDETHF 739
+ +AG+ + A + G G A + G+ A ++ +AG+ T
Sbjct: 878 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQT 937

Query: 740 AAFDPLAVAGFDETHVAAFDPLAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFD 799
A+F +AG+ + A AG+ T +A +D + +AG+ T A + AG+
Sbjct: 938 ASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYG 997

Query: 800 EKHIAAIDTQAITGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYM 859
A + G+ + A D+ + G AG+ + I+ +
Sbjct: 998 STQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLT 1057

Query: 860 TGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPTAVAGFDETHVAAFD 919
G G ++G + AG+ +A+ +AG + T +AG + A +
Sbjct: 1058 AGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYR 1117

Query: 920 PTAVAGFDETHVAAFDPLAVAGFDETHVAAFEISAMEGFN 959
T ++G D +A +AG D T A + G N
Sbjct: 1118 STLISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNN 1157



Score = 197 bits (501), Expect = 6e-53
Identities = 131/644 (20%), Positives = 252/644 (39%), Gaps = 1/644 (0%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A +S +I Q A+ ++ G+ Q A + G+ A D++ + G+
Sbjct: 538 AGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYG 597

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
Q A+ ++ G+ Q A + G+ A D++ + G+ Q A ++
Sbjct: 598 STQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 657

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A + G+ A D++ + G+ Q A ++ G+ Q A
Sbjct: 658 AGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEG 717

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
+ G+ A D++ + G+ Q A+ ++ G+ Q A + G+
Sbjct: 718 SDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTST 777

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A D++ + G+ Q A + G+ Q A + G+ T A D +AG+
Sbjct: 778 AGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYG 837

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A ++ AG+ T A + G+ T A +D +AG+ A ++
Sbjct: 838 STQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILT 897

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A ++ TG +E + +AG+ + A+ + M G G A
Sbjct: 898 AGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQ 957

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
+ AG+ +A +D +AG+ T A + AG+ T A AG+ T
Sbjct: 958 SSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTAT 1017

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A D + +AG+ + + AG+ I+ + + G+ + ++ + G
Sbjct: 1018 AGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLISGRRSSLTAGYG 1077

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
+Q + + +AG + I + G G AG+ T ++G D +A +
Sbjct: 1078 SNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLI 1137

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFD 935
AG D T A +AG + +++ A D + + ++ + A D
Sbjct: 1138 AGADSTQTAGDRSKLLAG-NNSYLTAGDRSKLTAGNDCILMAGD 1180



Score = 196 bits (499), Expect = 1e-52
Identities = 138/657 (21%), Positives = 249/657 (37%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A +SD+I A +++ + G+ Q A+ ++ G+ Q A + G+
Sbjct: 522 AQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYG 581

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
A D++ + G+ Q A+ ++ G+ Q A + G+ A D++ +
Sbjct: 582 STGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLI 641

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A ++ G+ Q A + G+ A D++ + G+ Q A +
Sbjct: 642 AGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYN 701

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
+ G+ Q A + G+ A D++ + G+ Q A+ ++ G+ Q
Sbjct: 702 SILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQT 761

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A + G+ A D++ + G+ Q A + AG+ T A G+
Sbjct: 762 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYG 821

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A D + +AG+ T A ++ + AG+ T A + G+ A D+ I
Sbjct: 822 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLI 881

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A ++ G Q G+ + A + S + G G A F
Sbjct: 882 AGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFK 941

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
T +AG+ + A AG+ T A +D +AG+ T A + AG+ T
Sbjct: 942 STLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQT 1001

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
A T AG+ T A D +AG+ + I + G+ + ++ L + G
Sbjct: 1002 AEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYG 1061

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
+ AG+ ++ IA+ S + G + G +AG + A + +
Sbjct: 1062 SSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLI 1121

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVA 948
+G D A +AG D T A +AG + A AG D +A
Sbjct: 1122 SGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCILMA 1178



Score = 194 bits (494), Expect = 5e-52
Identities = 158/727 (21%), Positives = 271/727 (37%), Gaps = 24/727 (3%)

Query: 296 SDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQEDQI 355
I + D VA + A G + +D A +++ + Q +I
Sbjct: 102 MQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEI 161

Query: 356 AALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQ 415
A +T G Q +A +T G IA +T G D+T V G+
Sbjct: 162 ATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAG--------ADSTLVAGYG 213

Query: 416 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAV 475
Q A +++ + G+ Q + G+ A D++ + G+ Q A D++
Sbjct: 214 STQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 273

Query: 476 GGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 535
G+ Q A + G+ A D++ + G+ Q A ++T G+ Q A
Sbjct: 274 AGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKG 333

Query: 536 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFDETHV 595
+ G+ A D++ + G+ Q A D++ AG+ T A AG+ T
Sbjct: 334 SDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGT 393

Query: 596 AAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFN 655
A D + +AG+ T A + AG+ T A AG+ A D+ I G+
Sbjct: 394 AGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYG 453

Query: 656 ADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAV 715
+ A D+ G Q AG+ + A + S + G G AG+ T
Sbjct: 454 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLT 513

Query: 716 AGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHVAAFD 775
AG+ A + + G+ T A + +AG+ T A+++ + AG+ T A
Sbjct: 514 AGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREG 573

Query: 776 PTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLAKDQF 835
AG+ T A D +AG+ A+ + G+ + A + TG
Sbjct: 574 SDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTST 633

Query: 836 VAFEPTAMAGFNADHIAAIDHSYMTGLGK--------DHVAGFDPTAVAGFDEAHVAAFD 887
+ + +AG+ + A + G G D AG+ T+ AG D + +A +
Sbjct: 634 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYG 693

Query: 888 PLAVAGFDETHFAAFDPTA--------VAGFDETHVAAFDPTAVAGFDETHVAAFDPLAV 939
AG++ A + T +G+ T A D + +AG+ T A++
Sbjct: 694 STQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLT 753

Query: 940 AGFDETHVAAFEISAMEGFNPTHVASFNPEAMAGFKGTQLKELDPESFAALTTEQAAEMT 999
AG+ T A + G+ T A + +AG+ TQ A + Q A+
Sbjct: 754 AGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQER 813

Query: 1000 PDAAAAF 1006
D +
Sbjct: 814 SDLTTGY 820



Score = 183 bits (464), Expect = 2e-48
Identities = 134/654 (20%), Positives = 244/654 (37%), Gaps = 8/654 (1%)

Query: 292 ANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFKEDQVAALDATAVGGFQ 351
A SD+ A D++ + G+ Q A+ ++ G+ Q A + G+
Sbjct: 570 AREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYG 629

Query: 352 EDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAV 411
A D++ + G+ Q A ++ G+ Q A + G+ A D++ +
Sbjct: 630 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLI 689

Query: 412 GGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALD 471
G+ Q A ++ G+ Q A + G+ A D++ + G+ Q A+
Sbjct: 690 AGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYH 749

Query: 472 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQI 531
++ G+ Q A + G+ A D++ + G+ Q A + G+ Q
Sbjct: 750 SSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQT 809

Query: 532 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFDETHVAAFDPLAVAGFD 591
A + G+ A D++ + G+ Q A ++ AG+ T A + G+
Sbjct: 810 AQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYG 869

Query: 592 ETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 651
T A +D + +AG+ T A ++ + AG+ T A + G+ A ++ I
Sbjct: 870 STSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLI 929

Query: 652 TGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFD 711
G+ + A+ + M G Q + + AG+ + +A D S + G G AG+
Sbjct: 930 AGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQ 989

Query: 712 PTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHVAAFDPLAVAGFDETHV 771
T AG+ A AG+ T A D +AG+ + + AG+ T +
Sbjct: 990 STLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLI 1049

Query: 772 AAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLA 831
+ AG+ + ++ AG+ IA+ + I G + + + + G
Sbjct: 1050 SGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKG 1109

Query: 832 KDQFVAFEPTAMAGFNADHIAAIDHSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAV 891
Q AG+ + I+ D M G +AG D T AG +A +
Sbjct: 1110 SSQ--------TAGYRSTLISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLT 1161

Query: 892 AGFDETHFAAFDPTAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDET 945
AG A D +AG A + AG + + AG +
Sbjct: 1162 AGDRSKLTAGNDCILMAGDRSKLTAGINSILTAGCRSKLIGSNGSTLTAGENSV 1215



Score = 149 bits (376), Expect = 4e-38
Identities = 110/536 (20%), Positives = 203/536 (37%)

Query: 276 AEIGGVAPSELAKDDIANLDSDVIEDLKEDQVAALDATAVGGFKEDQVAALDATAVGGFK 335
A+ G + A DS +I Q A ++ G+ Q A + G+
Sbjct: 666 AQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYG 725

Query: 336 EDQVAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQIAALDATAV 395
A D++ + G+ Q A+ ++ G+ Q A + G+ A D++ +
Sbjct: 726 STSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLI 785

Query: 396 GGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQVAALDATAVGGFQEDQVAALD 455
G+ Q A + G+ Q A + G+ A D++ + G+ Q A +
Sbjct: 786 AGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYN 845

Query: 456 ATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQV 515
+ G+ Q A ++ G+ A D++ + G+ Q A ++ G+ Q
Sbjct: 846 SILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQT 905

Query: 516 AALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVGGFQEDQIAALDATAVAGFD 575
A ++ G+ A +++ + G+ Q A+ +T + G+ Q A ++ AG+
Sbjct: 906 AQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYG 965

Query: 576 ETHVAAFDPLAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDETHVAAFDPLAV 635
T +A +D +AG+ T A + T AG+ T A AG+ T A D +
Sbjct: 966 STSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLI 1025

Query: 636 AGFDEKHIAAIDTQAITGFNADHVAALDAQAMTGLAKDQFVAFEPTAMAGFNADHIAAID 695
AG+ + I + G+ + ++ L + G + AG+ ++ IA+
Sbjct: 1026 AGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHR 1085

Query: 696 HSYMTGLGKDHVAGFDPTAVAGFDEAHVAAFDPLAVAGFDETHFAAFDPLAVAGFDETHV 755
S + G + G +AG + A + ++G D A +AG D T
Sbjct: 1086 SSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAGADSTQT 1145

Query: 756 AAFDPLAVAGFDETHVAAFDPTAVAGFDETHVAAFDPLAVAGFDEKHIAAIDTQAI 811
A +AG + A AG D +A AG + A ++ I
Sbjct: 1146 AGDRSKLLAGNNSYLTAGDRSKLTAGNDCILMAGDRSKLTAGINSILTAGCRSKLI 1201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_01021CABNDNGRPT412e-05 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 40.7 bits (95), Expect = 2e-05
Identities = 21/74 (28%), Positives = 30/74 (40%), Gaps = 5/74 (6%)

Query: 312 EVLDGIDGPDVLKGGLGNDTFKGNGGNDTIDGGSDFDIATYSGNFSDYTFTIANKVVTIS 371
++L G ++L+GG GND G G DT+ GG+ D Y I+
Sbjct: 350 DILVGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTVAAY----DWIA 405

Query: 372 DNRLSENDGIDTLS 385
D D ID +
Sbjct: 406 DF-QKGIDKIDLSA 418



Score = 31.5 bits (71), Expect = 0.011
Identities = 9/32 (28%), Positives = 15/32 (46%)

Query: 314 LDGIDGPDVLKGGLGNDTFKGNGGNDTIDGGS 345
+ G+ G + G+ + G GND + G S
Sbjct: 325 VGGLKGNVSIAHGVTIENAIGGSGNDILVGNS 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_01071DHBDHDRGNASE502e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 49.7 bits (118), Expect = 2e-09
Identities = 40/186 (21%), Positives = 72/186 (38%), Gaps = 14/186 (7%)

Query: 5 LITGSNRGIGLELVRQLKDRGDDVIAT-CRSASPELNSLSVRVETN------IDITSGDS 57
ITG+ +GIG + R L +G + A E S++ E D+ +
Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAA 71

Query: 58 VVKLRDNLKDNS--VDVLIQNAGIAEFNSLSNLDPQSIVHQFEVNALSPLCCVHTLLSKL 115
+ ++ ++ +D+L+ AG+ + +L + F VN+ ++ +
Sbjct: 72 IDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYM 131

Query: 116 --SKSAKIALISSRMGSIEDNNSGGSYGYRMSKVALCMAGKSLSVDLMPRGISVGILHPG 173
+S I + S + S +Y SK A M K L ++L I I+ PG
Sbjct: 132 MDRRSGSIVTVGSNPAGVP-RTSMAAYA--SSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 174 LVSTRM 179
T M
Sbjct: 189 STETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_01091SECA320.002 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 32.2 bits (73), Expect = 0.002
Identities = 10/38 (26%), Positives = 18/38 (47%)

Query: 104 RSVELTVELASLLKQLNIPHAVLLVKVDFRQRRIANEA 141
S+E + +++ L + I H VL K + I +A
Sbjct: 457 ISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQA 494


4NATL1_01921NATL1_01971Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_01921315-0.521860cell division inhibitor
NATL1_01931415-1.045085hypothetical protein
NATL1_01941516-0.527807heat shock protein DnaJ
NATL1_019514140.997724O-acetylserine (thiol)-lyase A
NATL1_01961413-1.769206hypothetical protein
NATL1_01971211-1.555934hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_01921NUCEPIMERASE444e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 44.0 bits (104), Expect = 4e-07
Identities = 31/154 (20%), Positives = 63/154 (40%), Gaps = 20/154 (12%)

Query: 1 MKLLLTGCTGFIGRELIPLLINEGHSLTVIS--------RQSQGKVQTIANDQNINFIQM 52
MK L+TG GFIG + L+ GH + I Q +++ +A F ++
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP-GFQFHKI 59

Query: 53 NPAESSSWDKEEIQNSLKSCE--GVINLAGEPIAEKRWTTDHCKEITNSRIETTKNLIKS 110
+ A D+E + + S V R++ ++ +S + N+++
Sbjct: 60 DLA-----DREGMTDLFASGHFERVFISPHR--LAVRYSLENPHAYADSNLTGFLNILEG 112

Query: 111 LRNLRKSPKVLINASAIGFYGSHPQTEFTEENIP 144
R+ + L+ AS+ YG + + F+ ++
Sbjct: 113 CRHN--KIQHLLYASSSSVYGLNRKMPFSTDDSV 144


5NATL1_03661NATL1_04061Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_03661-2103.143734NUDIX hydrolase
NATL1_03671-2102.8921772-amino-4-hydroxy-6-
NATL1_03681-292.985468protoporphyrin IX magnesium chelatase subunit
NATL1_03691-2100.999898ABC transporter
NATL1_03701-1132.024519ABC transporter ATP-binding protein
NATL1_037110132.396744hypothetical protein
NATL1_03721-2152.568202NADH dehydrogenase subunit J
NATL1_03731-1163.052939NADH dehydrogenase subunit B
NATL1_037410183.659074NADH dehydrogenase subunit A
NATL1_03751-2142.645505rubredoxin
NATL1_03761-114-0.471769hypothetical protein
NATL1_03771-118-2.896859cytochrome b559 subunit alpha
NATL1_03781-215-2.169922photosystem II reaction center L
NATL1_03791-215-2.423661photosystem II reaction center protein J
NATL1_03801-115-2.0950405'-methylthioadenosine phosphorylase
NATL1_03811-115-2.156101selenide,water dikinase
NATL1_03821214-1.926828tRNA nucleotidyltransferase/poly(A) polymerase
NATL1_03831111-0.893702UvrD/REP helicase
NATL1_03841412-4.191330hypothetical protein
NATL1_03851311-4.322199phycobilisome protein
NATL1_03861313-5.761520phycobilisome protein (phycoerythrin,
NATL1_03871212-6.499569bilin biosynthesis protein cpeZ
NATL1_03881113-5.064464HEAT repeat-containing protein
NATL1_03891013-5.305501bilin biosynthesis protein CpeY
NATL1_03901013-3.640238CpeT
NATL1_03911012-3.385914phycoerythrin linker protein CpeS
NATL1_03921111-2.996082hypothetical protein
NATL1_03931113-2.331323hypothetical protein
NATL1_03941313-2.681573nucleoside-diphosphate-sugar epimerases
NATL1_03951-1140.155074nucleotide sugar epimerase
NATL1_03961-115-0.641956hypothetical protein
NATL1_03971-3150.338588hypothetical protein
NATL1_03981-2140.031559*hypothetical protein
NATL1_03991-3150.341622carbohydrate kinase
NATL1_04001-2133.496265S-adenosylmethionine synthetase
NATL1_04011-1132.327899HAD family hydrolase
NATL1_040210174.16378630S ribosomal protein S1
NATL1_04031-3183.712321transcriptional regulator NrdR
NATL1_04041-2143.741398photosystem II reaction center protein T
NATL1_040510133.544850photosystem II PsbB protein (CP47)
NATL1_04061517-1.409668hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_03681TONBPROTEIN320.005 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.3 bits (73), Expect = 0.005
Identities = 15/70 (21%), Positives = 24/70 (34%), Gaps = 6/70 (8%)

Query: 355 LKAAVRLVIAPRAMQMPSEEEMEPPAPEDQQPPPPPPEDSDDNNDQEEDQEEDQEEEQDE 414
L +V VI A P M PA + PP + E + E + +
Sbjct: 28 LYTSVHQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEP-----VVEPEPEPEPIPEPP 82

Query: 415 ESSP-PIPEE 423
+ +P I +
Sbjct: 83 KEAPVVIEKP 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_03701PF05272348e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.5 bits (76), Expect = 8e-04
Identities = 12/44 (27%), Positives = 20/44 (45%)

Query: 36 LAIVGPSGSGKSTILRLLAGLLLPSEGSLKISGIDQNYLRLDQN 79
+ + G G GKST++ L GL S+ I +Y ++
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGI 642


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_03941NUCEPIMERASE1367e-40 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 136 bits (345), Expect = 7e-40
Identities = 70/337 (20%), Positives = 131/337 (38%), Gaps = 45/337 (13%)

Query: 7 KNLVTGGAGFVGSHLIDRLMKSGEKVICLDNFFTGSKENIEH----WIGHPSFELI---- 58
K LVTG AGF+G H+ RL+++G +V+ +DN +++ + P F+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 59 -DHDVIEPIKLD--VDRIWHLACPASPIHYQF-NPIKTAKTSFLGTYNMLGLARKVG-AR 113
D + + + +R++ + + Y NP A ++ G N+L R
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 114 ILLASTSEVYGNPEIHPQPEKYNGNVNPIGIRSCYDEGKRVAESLCYDYMRMHGLEIRIA 173
+L AS+S VYG P + + +P+ S Y K+ E + + Y ++GL
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVD-HPV---SLYAATKKANELMAHTYSHLYGLPATGL 176

Query: 174 RIFNTYGPRMLLNDGR---LISNLLVQSIHGNDLTIYGNGKQTRSFCFVDDLIDGLTLFM 230
R F YGP GR + + G + +Y GK R F ++DD+ + +
Sbjct: 177 RFFTVYGPW-----GRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231

Query: 231 NSL----------NVGP---------MNLGNPEELSILQITNFIRNISIEKVNLKFLKAL 271
+ + P N+GN + ++ + + + L
Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291

Query: 272 DDDPLRRKPDIYLAKKELNWEPKIMFKEGLAITRKYF 308
D L D + + + P+ K+G+ ++
Sbjct: 292 PGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_03951NUCEPIMERASE475e-172 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 475 bits (1224), Expect = e-172
Identities = 172/337 (51%), Positives = 227/337 (67%), Gaps = 5/337 (1%)

Query: 9 TILVTGAAGFIGAALVKALLNLDFKVIGIDNLNDYYSTSLKRSRLTEIEKVSTVNGEWFF 68
LVTGAAGFIG + K LL +V+GIDNLNDYY SLK++RL + + + F
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQ-----PGFQF 56

Query: 69 YEIPIEDNKVLQDIINRYNPQVFVHLAAQAGVRYSITNPAAYIQSNLVGFANVLEGCRQN 128
++I + D + + D+ + + + VRYS+ NP AY SNL GF N+LEGCR N
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 129 QIPHLIYASSSSVYGGNKNLPFYEEQAVNHPVSLYAATKKSNELMAHTYSHLYDLPTTGL 188
+I HL+YASSSSVYG N+ +PF + +V+HPVSLYAATKK+NELMAHTYSHLY LP TGL
Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGL 176

Query: 189 RFFTVYGPWGRPDMAPMIFARSILNNEPIQVFNYGKMQRDFTYIDDVVEGIIRCCFKKAS 248
RFFTVYGPWGRPDMA F +++L + I V+NYGKM+RDFTYIDD+ E IIR
Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 249 IDDEFNPLVPNPSTSSAPYRIFNIGNSRPTQLTYFIELLEKNLGKKAIKNFQPMQPGDVV 308
D ++ P+ S APYR++NIGNS P +L +I+ LE LG +A KN P+QPGDV+
Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVL 296

Query: 309 STAARMDLLNSWVDYKPITSIENGIKLFSEWYLDYFK 345
T+A L + + P T++++G+K F WY D++K
Sbjct: 297 ETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


6NATL1_04211NATL1_04341Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_04211-114-3.258490photosystem I reaction center subunit IV
NATL1_04221-114-3.566820hypothetical protein
NATL1_04231-214-4.618927LysM domain-containing protein
NATL1_04241-211-2.056678aldehyde dehydrogenase
NATL1_04251015-2.496686hypothetical protein
NATL1_04261017-2.312638hypothetical protein
NATL1_042713252.301752hypothetical protein
NATL1_042813273.419507hypothetical protein
NATL1_042914293.750216hypothetical protein
NATL1_043015333.563703hypothetical protein
NATL1_043115394.040913hypothetical protein
NATL1_043218445.997739hypothetical protein
NATL1_043314312.936793hypothetical protein
NATL1_043412222.831684hypothetical protein
7NATL1_04521NATL1_04571Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_04521-1143.179514light repressed protein A-like protein
NATL1_04531-2143.700180lipoate-protein ligase B
NATL1_04541-3113.072397long-chain-fatty-acid--CoA ligase
NATL1_04551-2123.400381hypothetical protein
NATL1_04561-3133.815451branched-chain alpha-keto acid dehydrogenase
NATL1_04571-3133.113196S-adenosylmethionine:tRNA
8NATL1_04961NATL1_05181Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_049611133.050866riboflavin synthase subunit alpha
NATL1_049710133.896980hypothetical protein
NATL1_04981-1123.218167cytochrome c oxidase, subunit III
NATL1_04991-1123.040180cytochrome c oxidase, subunit I
NATL1_05001-2152.116494cytochrome c oxidase, subunit 2
NATL1_05011-2122.718185hypothetical protein
NATL1_05021-1122.233765protoheme IX farnesyltransferase
NATL1_05031-1111.838696multidrug efflux ABC transporter
NATL1_050410112.502159multidrug efflux ABC transporter
NATL1_05051-1123.298019hypothetical protein
NATL1_05061-2123.369086molecular chaperone GroEL
NATL1_05071-2121.406411hypothetical protein
NATL1_05081-2120.9650493-oxoacyl-ACP reductase
NATL1_05091-3100.4144272-C-methyl-D-erythritol 4-phosphate
NATL1_05101013-0.214682carboxypeptidase
NATL1_05111113-1.191129prenyltransferase
NATL1_05121-112-0.416287exopolyphosphatase
NATL1_051312120.200462hypothetical protein
NATL1_051412120.790385precorrin-4 C11-methyltransferase
NATL1_051515100.881999prolipoprotein diacylglyceryl transferase
NATL1_05161112-0.491845apocytochrome f
NATL1_05171111-0.638427cytochrome b6-f complex iron-sulfur subunit
NATL1_05181213-1.675519hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_04991PF05272310.016 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.016
Identities = 17/91 (18%), Positives = 29/91 (31%), Gaps = 12/91 (13%)

Query: 458 YDPQFQLINQISSVGALLMALSTLPFLWNILQSILFG---EEAGDNPWNALTPEWLTSSP 514
++ L AL+ AL + P L + PW +
Sbjct: 442 LRGRWLLKP---RRAALIEALRSAPALAGCVAFDELREQPVAVRAFPWRKAPGPLEDADV 498

Query: 515 PPVENWDGEAPLVLEPYGYGEKDSNETQEAI 545
+ ++ V YG GE + T++AI
Sbjct: 499 LRLADY------VETTYGTGEASAQTTEQAI 523


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_05041ABC2TRNSPORT571e-11 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 56.9 bits (137), Expect = 1e-11
Identities = 62/247 (25%), Positives = 97/247 (39%), Gaps = 21/247 (8%)

Query: 40 PSTLFAGILQPLIWLLLFAALFSKAPIDFLPGSSSYGEFLGAGLIVFTAFSGALNAGLPL 99
++L + +PLI+L F + G SY FL AG++ +A + A +
Sbjct: 32 LASLLGHLAEPLIYL--FGLGAGLGVMVGRVGGVSYTAFLAAGMVATSAMTAATFETIYA 89

Query: 100 MFDR--EFGFLNRLLVAPLSSRSSIVISSVIYITSISLLQSFAIMATSALLGYG-WPSLG 156
F R +L L IV+ + + + + L I +A LGY W SL
Sbjct: 90 AFGRMEGQRTWEAMLYTQLRL-GDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWLSL- 147

Query: 157 GFLLIAITLLLLVFAITGLSLGLAFVLPGHIELIAIIFIANLPVLFASTALAPISFMPEW 216
L + L A L + + + P + I + P+LF S A+ P+ +P
Sbjct: 148 --LYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIV 205

Query: 217 LGWIASINPLTFAIEPIRMAYHSSVDLRAILIDAPYGEVNGYSCLAILLILTVGLFFLIR 276
A PL HS +R I++ P +V + + L I V FFL
Sbjct: 206 FQTAARFLPL----------SHSIDLIRPIMLGHPVVDVCQH--VGALCIYIVIPFFLST 253

Query: 277 PLLNRKL 283
LL R+L
Sbjct: 254 ALLRRRL 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_05081DHBDHDRGNASE1391e-42 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 139 bits (352), Expect = 1e-42
Identities = 80/256 (31%), Positives = 133/256 (51%), Gaps = 15/256 (5%)

Query: 4 SKLLEGQTAIVTGASRGIGKAIAIFLAKEGAEVIINYSSSLENANKVVSEINSFGGKAYP 63
+K +EG+ A +TGA++GIG+A+A LA +GA + + E KVVS + + A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 64 LQADISNENSVNDLIKTVLEKNNKIDVLVNNAGITKDGLLMRMKTDDWQKVLDLNLSGVF 123
AD+ + +++++ + + ID+LVN AG+ + GL+ + ++W+ +N +GVF
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 124 YCTRAVSRQMLKQKKGRIINITSVVGLMGNPGQANYSAAKAGVVGLTQSAAKEFASRGIT 183
+R+VS+ M+ ++ G I+ + S + A Y+++KA V T+ E A I
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 184 VNAVAPGFISTDMTKDL-------------DSESILSAIPLGRFGNPEDVAGAVRFLAAD 230
N V+PG TDM L E+ + IPL + P D+A AV FL +
Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 231 PSASYITGQVIQVDGG 246
A +IT + VDGG
Sbjct: 242 -QAGHITMHNLCVDGG 256


9NATL1_05951NATL1_06081Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_059510113.275856hypothetical protein
NATL1_05961-1114.072620protochlorophyllide oxidoreductase
NATL1_05971-1113.845572protochlorophyllide reductase iron-sulfur
NATL1_059810124.592072light-independent protochlorophyllide reductase
NATL1_059912175.688431light-independent protochlorophyllide reductase
NATL1_060014256.835695hypothetical protein
NATL1_060114196.731638hypothetical protein
NATL1_060213185.174630HAM1 family protein
NATL1_060314195.492739carboxysome shell protein CsoS1
NATL1_060413175.023802ribulose bisophosphate carboxylase
NATL1_060512164.382930ribulose bisphosphate carboxylase, small chain
NATL1_060612163.633321carboxysome shell protein CsoS2
NATL1_06071-2161.922867carboxysome shell protein CsoS3
NATL1_060810173.407251carboxysome peptide A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_05961DHBDHDRGNASE468e-08 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 45.8 bits (108), Expect = 8e-08
Identities = 25/120 (20%), Positives = 46/120 (38%), Gaps = 5/120 (4%)

Query: 11 VLITGTTSGVGLYATKSLVERGWRVITANRCSVRAEASASAVGLPTNSPRQLKHIQIDLG 70
ITG G+G ++L +G + + + E S++ D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---ADVR 67

Query: 71 DLDSVRNGAKSLLEDLEKPLDALVCNAAVYLPRLKKPLRSPQGYEISMATNHFGHFLLIQ 130
D ++ + ++ P+D LV A V P L L S + +E + + N G F +
Sbjct: 68 DSAAIDEITARIEREMG-PIDILVNVAGVLRPGLIHSL-SDEEWEATFSVNSTGVFNASR 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_06001PF05272260.043 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 26.2 bits (57), Expect = 0.043
Identities = 9/47 (19%), Positives = 19/47 (40%)

Query: 73 PLSTTEVSKLLGVRPGSSKVERGGLIAKKLSRNVWRLIKSSQESSHW 119
++ ++ + LG PG S G + L+ N W ++ +
Sbjct: 797 FVTIADLVQALGADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRRR 843


10NATL1_07441NATL1_07661Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_07441220-1.269900hypothetical protein
NATL1_07451319-3.071490hypothetical protein
NATL1_07461417-2.857839hypothetical protein
NATL1_07471418-3.168882hypothetical protein
NATL1_07481317-3.114385hypothetical protein
NATL1_07491215-4.154912hypothetical protein
NATL1_07501113-3.278585hypothetical protein
NATL1_07511016-5.191621hypothetical protein
NATL1_07521019-4.595093hypothetical protein
NATL1_07531019-4.162826hypothetical protein
NATL1_07541118-4.973647hypothetical protein
NATL1_07551118-4.537175hypothetical protein
NATL1_07561218-4.899043hypothetical protein
NATL1_07571017-2.527525hypothetical protein
NATL1_07581-117-2.837706hypothetical protein
NATL1_07591-115-3.383977hypothetical protein
NATL1_07601016-2.303351hypothetical protein
NATL1_07611115-2.784390hypothetical protein
NATL1_07621113-3.257786hypothetical protein
NATL1_07631019-3.719297hypothetical protein
NATL1_07641-118-2.721022hypothetical protein
NATL1_07651-116-2.583400hypothetical protein
NATL1_07661-116-3.102376permease
11NATL1_08031NATL1_08121Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_08031-2134.114546heme transporter
NATL1_08041-1134.086419ribulose-phosphate 3-epimerase
NATL1_08051-2133.574852fructose 1,6-bisphosphatase II
NATL1_08061-2132.819171glutamyl-tRNA reductase
NATL1_08071-2113.491052glucose-1-phosphate adenylyltransferase
NATL1_08081-2124.0066296-phosphogluconate dehydrogenase
NATL1_08091-3142.9951366-phosphogluconolactonase (DevB, Pgl)
NATL1_08101-3143.120585hypothetical protein
NATL1_08111-2142.818758coat protein
NATL1_08121-2143.096547dihydroxy-acid dehydratase
12NATL1_08481NATL1_08821Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_08481214-4.739141hypothetical protein
NATL1_08491315-4.084534ABC transporter ATPase
NATL1_08501112-2.394257Rieske iron-sulfur protein 2Fe-2S subunit
NATL1_08511-1120.486961hypothetical protein
NATL1_08521-1100.089316hypothetical protein
NATL1_08531-310-0.872819UDP-glucose 4-epimerase
NATL1_08541-213-2.276265hypothetical protein
NATL1_08551-213-2.851643dTDP-4-dehydrorhamnose 3,5-epimerase
NATL1_08561015-3.713627dTDP-4-dehydrorhamnose reductase
NATL1_08571217-5.200044dTDP-D-glucose 4,6-dehydratase
NATL1_08581319-6.264840UDP-N-acetylglucosamine 2-epimerase
NATL1_08591319-6.567951nucleotide-diphosphate-sugar epimerase, membrane
NATL1_08601319-7.083385UDP-N-acetylmuramyl pentapeptide
NATL1_08611420-7.487306hypothetical protein
NATL1_08621318-7.608992glycosyltransferase-like protein
NATL1_08631318-8.298131hypothetical protein
NATL1_08641318-8.870924UDP-glucose 6-dehydrogenase
NATL1_08651220-9.180006asparagine synthase
NATL1_08661319-8.456328hypothetical protein
NATL1_08671219-7.797004hypothetical protein
NATL1_08681119-7.060124hypothetical protein
NATL1_08691116-6.086882hypothetical protein
NATL1_08701017-4.553437hypothetical protein
NATL1_08711-116-3.989796hypothetical protein
NATL1_08721013-4.614313ialic acid synthase
NATL1_08731-111-4.505354UDP-N-acetylglucosamine 2-epimerase
NATL1_08741014-5.855961hypothetical protein
NATL1_08751116-6.197989imidazole glycerol-phosphate synthase
NATL1_08761214-6.404483glutamine amidotransferase
NATL1_08771314-6.392941hypothetical protein
NATL1_08781619-7.072644hypothetical protein
NATL1_08791521-7.693901hypothetical protein
NATL1_08801420-5.185998hypothetical protein
NATL1_08811320-3.551445hypothetical protein
NATL1_08821218-3.662233hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_08531NUCEPIMERASE1782e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 178 bits (452), Expect = 2e-55
Identities = 80/346 (23%), Positives = 151/346 (43%), Gaps = 44/346 (12%)

Query: 1 MRVLLTGGAGFIGSHIALLLLERGYDVLILDSFANSSSNVIERIENFLDNKALKYK-LRV 59
M+ L+TG AGFIG H++ LLE G+ V+ +D+ + +++ + L +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQ----ARLELLAQPGFQF 56

Query: 60 INGDIRDKQILESIFSKCVKENKPIEVVIHLAGVKSVCESLTNPLYYWDVNVSGTLNLLL 119
D+ D++ + +F + E V +V SL NP Y D N++G LN+L
Sbjct: 57 HKIDLADREGMTDLF-----ASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILE 111

Query: 120 TMKDYQCYSLVFSSSATIYGLSDYVPILEEQKIS-PITPYGQTKVAVENLFYDLYKSNVN 178
+ + L+++SS+++YGL+ +P + + P++ Y TK A E + + Y
Sbjct: 112 GCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAH-TYSHLYG 170

Query: 179 LWKICSLRYFNPVGAHPSGLIGEDPRGIPNNLFPFITQVAIGRQKILNIYGDDWETKDGS 238
L LR+F G P G P+ T+ A+ K +++Y G
Sbjct: 171 L-PATGLRFFTVYG----------PWGRPDMALFKFTK-AMLEGKSIDVYN------YGK 212

Query: 239 GIRDYVHIIDLAEGHLASIDYLNTSESC--------------LEFINLGSGKGYSVFQII 284
RD+ +I D+AE + D + +++ N+G+ + I
Sbjct: 213 MKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYI 272

Query: 285 RQFELSTGCSIPFSIESRRDGDVAVSYADISKAKRLLSWTPKRSLE 330
+ E + G ++ + GDV + AD ++ +TP+ +++
Sbjct: 273 QALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVK 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_08561NUCEPIMERASE443e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 44.4 bits (105), Expect = 3e-07
Identities = 37/159 (23%), Positives = 55/159 (34%), Gaps = 27/159 (16%)

Query: 1 MKVLLTGASGQLGQAIIKS----------------------KPSFVELIAT---TRRELD 35
MK L+TGA+G +G + K K + +EL+A ++D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 36 LADDEACRRAVRQHQPDWVINSGAYTAVDKAEDEKELAMSINTIAPKMFAEELSQTG-GK 94
LAD E + V S AV + + N E
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 95 LLQLSTDFVFDGEQNFPYKTGQK-KKPLGVYGATKAAGE 132
LL S+ V+ + P+ T P+ +Y ATK A E
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_08571NUCEPIMERASE1643e-50 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 164 bits (418), Expect = 3e-50
Identities = 82/353 (23%), Positives = 147/353 (41%), Gaps = 51/353 (14%)

Query: 16 RILVTGGAGFIGGAVIRKLLKESTSKIFNIDKIGYASDLT---AIDEILRTKDYSDRYDF 72
+ LVTG AGFIG V ++LL+ ++ ID + D++ A E+L + F
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEA-GHQVVGIDNLNDYYDVSLKQARLELLAQPGFQ----F 56

Query: 73 AKIDLSIPDETAKAISDSDPDLIMHLAAESHVDRSIQGPEAFINSNIFGTFNLLEATRKH 132
KIDL+ + + + + V S++ P A+ +SN+ G N+LE R +
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 133 YENLSNKRKNDFRFLHISTDEVFGSLGLNGK--FTESTSYD-PRSPYSASKASSDHLVRS 189
L+ S+ V+ GLN K F+ S D P S Y+A+K +++ + +
Sbjct: 117 ---------KIQHLLYASSSSVY---GLNRKMPFSTDDSVDHPVSLYAATKKANELMAHT 164

Query: 190 WHHTFQLPIVITNCSNNFGPWQFPEKLIPVAINKALALKSIPLYGDGENIRDWLYVDDHV 249
+ H + LP +GPW P+ + L KSI +Y G+ RD+ Y+DD
Sbjct: 165 YSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 250 DALFLAANKGKIGDS------------------YCVGGYGERKNIEILKIICKILD-EIY 290
+A+ + D+ Y +G + ++ ++ + L E
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 291 PKHSPFERLITKVQDRKGHDRRYAIDPSKIRNELGWEPKYSLEDRLETTVQWY 343
P + G + D + +G+ P+ +++D ++ V WY
Sbjct: 285 KNMLPL---------QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_08591NUCEPIMERASE584e-11 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 57.9 bits (140), Expect = 4e-11
Identities = 50/321 (15%), Positives = 102/321 (31%), Gaps = 75/321 (23%)

Query: 285 TVCITGAGGSIGSELSKQ----------IYNLNPYKMILIDHSESHLYNINKQITSYPDN 334
+TGA G IG +SK+ I NLN Y + + + ++ + P
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQA-------RLELLAQPG- 53

Query: 335 GIEVKAILGSTTDLPFINKVFTDNNVDIIFHAAAYKHVPLVESNPLKGLFNNVFSTEIVC 394
+ D + +F + + +F + V NP +N+ +
Sbjct: 54 ---FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNIL 110

Query: 395 KAALEAGANNLVLIST---------------DKAVRPTNVMGASKRLSELVVQAIAEKSK 439
+ +L+ S+ D P ++ A+K+ +EL+ +
Sbjct: 111 EGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 440 ENSIAKKTCFSMVRFGNVLGSSGS---VLPLFQEQIDNGGPITL-THPRIIRYFMTISEA 495
+ +RF V G G L F + + G I + + ++ R F I +
Sbjct: 171 LPATG-------LRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDI 223

Query: 496 SQLVIQ------------------SKVLAEGGDVFHLDMGKPVSIKSLAEQLILLNGLSI 537
++ +I+ V+++ PV + + L
Sbjct: 224 AEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL-------- 275

Query: 538 KDNKNLEGDIEIKFTGLRPGE 558
L + + L+PG+
Sbjct: 276 --EDALGIEAKKNMLPLQPGD 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_08611NUCEPIMERASE504e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 49.8 bits (119), Expect = 4e-09
Identities = 58/352 (16%), Positives = 114/352 (32%), Gaps = 94/352 (26%)

Query: 10 VIVSGANGFTGKFVCKELIKNKINFIAL----------LRPGSI-----PDW-FNKNKIE 53
+V+GA GF G V K L++ + + L+ + P + F+K +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 54 FR------FADLNSYDELHS--------QLNGCRALINVASIGFGSAKNIIKSCYKSNIE 99
R FA + S L A + GF NI++ C + I+
Sbjct: 63 DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGF---LNILEGCRHNKIQ 119

Query: 100 RVIFISTTAI--------FTRLNASSKTIRLEAENDIINSK----------LKWTIIRPT 141
+++ S++++ F+ ++ + L A N L T +R
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 142 MIYGSPKDRNM--IKLIKWIDNMPIIPIFGNGKSLQQPVNVKDVAWSLVKIIDKKSTY-- 197
+YG +M K K + I ++ GK + + D+A +++++ D
Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239

Query: 198 ---------------YRSFNISGKEPLTFTQIVDIIEKMLNKSIIKIYLSKNITLLFIGL 242
YR +NI P+ + +E L K L
Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNML----------- 288

Query: 243 LERLRIKFPIKSEQVHRLNENKDFIHEKAKRAFNYNP-LSFEEGINIEIESY 293
P++ V + + + P + ++G+ + Y
Sbjct: 289 --------PLQPGDVLETSADTK----ALYEVIGFTPETTVKDGVKNFVNWY 328


13NATL1_09021NATL1_09191Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_09021313-2.628102multidrug efflux ABC transporter
NATL1_09031413-2.857132membrane protein, multidrug efflux associated
NATL1_09041414-3.268328hypothetical protein
NATL1_09051418-5.509914josephin
NATL1_09061318-4.167021purine phosphoribosyltransferase-related
NATL1_09071416-3.737190hypothetical protein
NATL1_09081113-1.728126hypothetical protein
NATL1_09091-112-1.940904hypothetical protein
NATL1_09101-113-1.819713hypothetical protein
NATL1_09111113-2.224696hypothetical protein
NATL1_09121113-2.332771hypothetical protein
NATL1_09131113-2.166434hypothetical protein
NATL1_09141315-2.933097glycoside hydrolase family protein
NATL1_09151416-4.104947mannosyl-3-phosphoglycerate phosphatase
NATL1_09161415-4.551335kinase
NATL1_09171017-4.958298hypothetical protein
NATL1_09181216-3.838572hypothetical protein
NATL1_09191415-4.007903GRAM domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_09021ABC2TRNSPORT290.016 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 29.1 bits (65), Expect = 0.016
Identities = 36/142 (25%), Positives = 53/142 (37%), Gaps = 7/142 (4%)

Query: 107 YVASHLAEQATRLPFA-IAIAITFFILNPSSFWLPSLPQLFLAYLSTHFAFTIAFLLQSL 165
V +A AT+ A I + L + + SL T AF L +
Sbjct: 113 IVLGEMAWAATKAALAGAGIGVVAAALGYTQWL--SLLYALPVIALTGLAFAS---LGMV 167

Query: 166 VAALCFWTEKSSALERLIFVPYLFLSGLLVPLSAFPSNVLKVAMLTPFPYLINFPAKILS 225
V AL + + L+ P LFLSG + P+ P A P + I+ I+
Sbjct: 168 VTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIML 227

Query: 226 GMPV-DIFNGFLAQILWISILV 246
G PV D+ A ++I I
Sbjct: 228 GHPVVDVCQHVGALCIYIVIPF 249


14NATL1_09511NATL1_09581Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_09511112-4.118001pyruvate dehydrogenase E1 beta subunit
NATL1_09521314-5.211283preprotein translocase subunit SecD
NATL1_09531219-6.816414preprotein translocase subunit SecF
NATL1_09541118-7.449575hypothetical protein
NATL1_09551218-6.805524hypothetical protein
NATL1_09561216-6.397776hypothetical protein
NATL1_09571020-2.683893hypothetical protein
NATL1_09581-123-3.596810hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_09521SECFTRNLCASE727e-16 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 71.8 bits (176), Expect = 7e-16
Identities = 39/243 (16%), Positives = 98/243 (40%), Gaps = 5/243 (2%)

Query: 233 DLTKSIAGTSRLLGIVIDGISYSEASVGKQFETAGITGGAATISGNFTADDARNLEVQLR 292
+ ++ L ++I + + I + L ++
Sbjct: 66 GVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVE 125

Query: 293 GGALPL--PIEIIQIRTIGPSLGTQNIRLSLFAALTGLFFVGIFMIFIYKLA-GFVAVLA 349
+ ++I ++GP + + + ++++ L + ++ ++ AV+A
Sbjct: 126 TALTAVDPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVA 185

Query: 350 LSCYSLFTLSIYALLPVTLTLPGIAGFILSIGMAVDANVLIFERLKEEL--LNGNTLIRS 407
L L T+ ++A+L + L +A + G +++ V++F+RL+E L L
Sbjct: 186 LVHDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDV 245

Query: 408 IDASFKNAFSSIIDGHLTTLISCITLFYLGTGFVKGFAATLGIGVMLSLFTALTCTKAIL 467
++ S S + +TTL++ + + G ++GF + GV ++++ K I+
Sbjct: 246 MNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV 305

Query: 468 KFL 470
F+
Sbjct: 306 LFI 308



Score = 33.3 bits (76), Expect = 0.002
Identities = 22/136 (16%), Positives = 41/136 (30%), Gaps = 24/136 (17%)

Query: 7 WFLFVILFAIFSIFICTNIPFQLGLDLRGGSQLTLEVQPTKEITKITPNEIEGVKAVLDK 66
F I+ I S+ + I G+D +GG+ + E ++
Sbjct: 23 TFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYR------------A 70

Query: 67 RVNGLGVSDSQLQTVGTNQLLLELPGEQDPSGAAKVLGETALLEFRIQKNGTAAQYRDLQ 126
+ L + D + V DPS ++ + G Q Q
Sbjct: 71 ALEPLELGDVIISEVR------------DPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQ 118

Query: 127 TQRNSVESIINLLEEN 142
N VE+ + ++
Sbjct: 119 ELVNKVETALTAVDPA 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_09531SECFTRNLCASE2069e-67 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 206 bits (526), Expect = 9e-67
Identities = 90/312 (28%), Positives = 160/312 (51%), Gaps = 20/312 (6%)

Query: 4 KFNVQIYKNRKYVWLVSFSLCLISIIGMLICLKSSSIKAPLNLGLDYTGGTQITLERSCT 63
K N ++ + + + + + S+I L+ LN G+D+ GGT I E +
Sbjct: 11 KTNFDFFRWQWATFGAAIVMMIASVILPLV--------IGLNFGIDFKGGTTIRTESTTA 62

Query: 64 -DECISLNTSEISN--NIIALKKQDKSFSSNTSPNLSRSQIQLLDNSQLISIRLPFLSAD 120
D + E ++I + +D SF + + R Q+Q + +
Sbjct: 63 IDVGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQG---AEGQGAQGQE 119

Query: 121 QSDSVVAEVNKSFGPFNNENTSVEIIGPSLGRQLLKSSLISLFFAFLGIALYINFRYDRR 180
+ V + + TS E +GP + +L+ +++ SL A + I YI R++ +
Sbjct: 120 LVNKVETALTAVDPAL--KITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQ 177

Query: 181 YSFLALFALLHDILIVCGVFAWLGYFFNVEVDSLFAVSLLTIAGYSVNNTVVVFDRIREK 240
++ A+ AL+HD+L+ G+FA L ++ D +LLTI GYS+N+TVVVFDR+RE
Sbjct: 178 FALGAVVALVHDVLLTVGLFAVLQ----LKFDLTTVAALLTITGYSINDTVVVFDRLREN 233

Query: 241 SLLENQLSYKYQIDKAVGATLTRTIYTSLTTLLPLICILIWGGSTLYWFAFALLIGVIIG 300
+ + + ++ +V TL+RT+ T +TTLL L+ +LIWGG + F FA++ GV G
Sbjct: 234 LIKYKTMPLRDVMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTG 293

Query: 301 SWSSIALAPSLL 312
++SS+ +A +++
Sbjct: 294 TYSSVYVAKNIV 305


15NATL1_09701NATL1_09761Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_097012132.110069transcription regulator
NATL1_097113130.839080threonine dehydratase
NATL1_09721411-0.0234571-deoxy-D-xylulose-5-phosphate synthase
NATL1_09731621-3.207683photosystem I PsaK protein (subunit X)
NATL1_09741-117-3.910562hypothetical protein
NATL1_09751-117-3.630253hypothetical protein
NATL1_09761-216-3.106535alkyl hydroperoxide reductase
16NATL1_10651NATL1_11001Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_10651223-4.806407S4 domain-containing protein
NATL1_10661322-4.517323ABC transporter
NATL1_10671423-4.646598hypothetical protein
NATL1_10681423-4.288605hypothetical protein
NATL1_10691521-4.412119hypothetical protein
NATL1_10701118-3.131607uridine kinase
NATL1_10711219-0.413291hypothetical protein
NATL1_10721120-1.700783hypothetical protein
NATL1_10731019-1.953146hypothetical protein
NATL1_10741019-2.502225hypothetical protein
NATL1_10751022-2.102033hypothetical protein
NATL1_10761329-3.582753hypothetical protein
NATL1_10771316-0.857528hypothetical protein
NATL1_10781318-0.265542hypothetical protein
NATL1_10791415-0.944536hypothetical protein
NATL1_10801313-0.927873hypothetical protein
NATL1_10811313-1.707050hypothetical protein
NATL1_10821313-1.547560Fatty acid desaturase
NATL1_10831117-3.633121hypothetical protein
NATL1_10841016-3.248285hypothetical protein
NATL1_10851-117-3.438989hypothetical protein
NATL1_10861118-3.656785hypothetical protein
NATL1_10871524-3.109229hypothetical protein
NATL1_10881624-2.948847hypothetical protein
NATL1_10891019-0.938332hypothetical protein
NATL1_10901118-1.335575hypothetical protein
NATL1_10911-216-0.689220hypothetical protein
NATL1_10921-119-2.022150hypothetical protein
NATL1_10931-220-2.402387hypothetical protein
NATL1_10941-120-2.405358hypothetical protein
NATL1_10951015-6.088782hypothetical protein
NATL1_10961114-5.836176hypothetical protein
NATL1_10971213-6.064512hypothetical protein
NATL1_10981210-5.041311hypothetical protein
NATL1_10991210-5.180779hypothetical protein
NATL1_11001112-5.208227hypothetical protein
17NATL1_11121NATL1_11251Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_111213383.154271hypothetical protein
NATL1_111317484.708290hypothetical protein
NATL1_111417425.016198hypothetical protein
NATL1_111512262.440349hypothetical protein
NATL1_111610221.081100hypothetical protein
NATL1_11171114-1.926776hypothetical protein
NATL1_11181214-2.358095high light inducible protein
NATL1_11191415-3.299820hypothetical protein
NATL1_11201416-3.561023hypothetical protein
NATL1_11211218-3.599039hypothetical protein
NATL1_11221222-3.683918NAD-dependent DNA ligase N-terminus
NATL1_11231125-1.296934hypothetical protein
NATL1_11241022-2.013698hypothetical protein
NATL1_11251422-1.373168hypothetical protein
18NATL1_11591NATL1_12051Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_11591117-3.007663arsenite efflux pump ACR3 and related permeases
NATL1_11601219-1.828282hypothetical protein
NATL1_11611220-3.754017hypothetical protein
NATL1_11621019-4.464217hypothetical protein
NATL1_11631021-5.199562hypothetical protein
NATL1_11641021-5.751665hypothetical protein
NATL1_11651-123-4.209744hypothetical protein
NATL1_11661-321-2.822530hypothetical protein
NATL1_11671-120-2.271640hypothetical protein
NATL1_11681-220-1.240906hypothetical protein
NATL1_11691-2210.425146hypothetical protein
NATL1_117012231.061210hypothetical protein
NATL1_117112261.067922hypothetical protein
NATL1_117213271.159776hypothetical protein
NATL1_117313251.080674hypothetical protein
NATL1_117412222.097685hypothetical protein
NATL1_117513232.246924hypothetical protein
NATL1_117612212.714228hypothetical protein
NATL1_117713192.035845Rossmann fold nucleotide-binding protein
NATL1_117811182.692286hypothetical protein
NATL1_117912202.881999hypothetical protein
NATL1_118014210.832738hypothetical protein
NATL1_11811218-0.368001hypothetical protein
NATL1_11821-118-0.298019hypothetical protein
NATL1_118310230.054931hypothetical protein
NATL1_118416241.924470hypothetical protein
NATL1_118515283.353263hypothetical protein
NATL1_118611232.520563hypothetical protein
NATL1_118710202.560933hypothetical protein
NATL1_118812193.098292hypothetical protein
NATL1_118911193.082131hypothetical protein
NATL1_119012182.109288hypothetical protein
NATL1_119110201.826939hypothetical protein
NATL1_119211231.892248hypothetical protein
NATL1_119310211.801243hypothetical protein
NATL1_119410221.327859hypothetical protein
NATL1_119512190.828156hypothetical protein
NATL1_119611190.703882hypothetical protein
NATL1_11971-120-0.560465hypothetical protein
NATL1_11981-217-0.484461hypothetical protein
NATL1_11991222-2.451335hypothetical protein
NATL1_12001224-2.451335hypothetical protein
NATL1_12011023-2.066423hypothetical protein
NATL1_12021-221-2.111776hypothetical protein
NATL1_12031-220-1.027284hypothetical protein
NATL1_120414210.162145hypothetical protein
NATL1_120512191.065149hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_11701ANTHRAXTOXNA270.004 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 27.4 bits (60), Expect = 0.004
Identities = 18/49 (36%), Positives = 24/49 (48%), Gaps = 3/49 (6%)

Query: 12 AVKREGYRHFEVKSYGGKKDERWVELFPVNNNEILIRVPWSELKTYSKW 60
AVK GY +V ++G ++D E FP +NEI I P E W
Sbjct: 563 AVKYTGYTGGDVVNHGTEQDN---EEFPEKDNEIFIINPEGEFILTKNW 608


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_11961PERTACTIN322e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.4 bits (73), Expect = 2e-04
Identities = 12/42 (28%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 59 HPEGNIEKFVEANGDWMASHR-VDLSTMEKSAWTWTDNSNVQ 99
G ++ + + W + R VD +++ + W TDNSNV
Sbjct: 404 ASSGPLDVALASQARWTGATRAVDSLSIDNATWVMTDNSNVG 445


19NATL1_12331NATL1_13641Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_123311323.485900hypothetical protein
NATL1_123412332.411887hypothetical protein
NATL1_12351-1192.347628hypothetical protein
NATL1_123610171.111467hypothetical protein
NATL1_12371-1160.121975hypothetical protein
NATL1_12381016-0.726237hypothetical protein
NATL1_12391016-2.570170hypothetical protein
NATL1_12401-217-1.158928hypothetical protein
NATL1_12411220-3.160555hypothetical protein
NATL1_12421121-4.282836hypothetical protein
NATL1_12431122-3.061835hypothetical protein
NATL1_12441-121-2.701491hypothetical protein
NATL1_12451-220-1.569500hypothetical protein
NATL1_12461-121-2.451335hypothetical protein
NATL1_12471222-1.959330hypothetical protein
NATL1_124814221.317346hypothetical protein
NATL1_124915240.829275hypothetical protein
NATL1_125016241.225136hypothetical protein
NATL1_125115261.309022hypothetical protein
NATL1_125215250.999186hypothetical protein
NATL1_125314220.391594hypothetical protein
NATL1_125411270.536087hypothetical protein
NATL1_125512230.141258hypothetical protein
NATL1_12561523-1.081472hypothetical protein
NATL1_12571323-0.329319hypothetical protein
NATL1_12581218-0.702363hypothetical protein
NATL1_125911160.153663hypothetical protein
NATL1_12601-115-1.635840hypothetical protein
NATL1_12611-113-1.586195hypothetical protein
NATL1_12621015-0.386013hypothetical protein
NATL1_12631015-0.848317hydrolases or acyltransferases
NATL1_12641219-0.193998hypothetical protein
NATL1_126513190.578968hypothetical protein
NATL1_126614231.308064hypothetical protein
NATL1_126715240.945190hypothetical protein
NATL1_12681929-0.801170hypothetical protein
NATL1_12691421-0.447996hypothetical protein
NATL1_12701420-0.683264hypothetical protein
NATL1_12711321-1.183506hypothetical protein
NATL1_12721221-1.343740hypothetical protein
NATL1_12731222-1.048961hypothetical protein
NATL1_12741015-1.514253imidazoleglycerol-phosphate dehydratase
NATL1_12751120-2.389029hypothetical protein
NATL1_12761020-1.310194hypothetical protein
NATL1_12771-121-1.974292hypothetical protein
NATL1_12781-222-1.870615hypothetical protein
NATL1_127910232.596372hypothetical protein
NATL1_128011243.445825hypothetical protein
NATL1_128112253.473783*hypothetical protein
NATL1_128213262.948821hypothetical protein
NATL1_128313263.377237hypothetical protein
NATL1_128413253.519405hypothetical protein
NATL1_12851120-2.260313hypothetical protein
NATL1_12861123-1.270590hypothetical protein
NATL1_12871523-1.157575hypothetical protein
NATL1_12881424-0.641380hypothetical protein
NATL1_12891418-1.541653hypothetical protein
NATL1_12901422-0.987595hypothetical protein
NATL1_12911622-1.422528hypothetical protein
NATL1_12921621-1.958030hypothetical protein
NATL1_12931418-1.883614hypothetical protein
NATL1_12941322-0.503283hypothetical protein
NATL1_129512250.269754hypothetical protein
NATL1_129613211.518889hypothetical protein
NATL1_129713220.848995hypothetical protein
NATL1_129810172.416195hypothetical protein
NATL1_129910182.373227hypothetical protein
NATL1_130011221.191277hypothetical protein
NATL1_130113210.284034hypothetical protein
NATL1_13021120-0.583519hypothetical protein
NATL1_13031216-0.566994hypothetical protein
NATL1_13041720-3.625680hypothetical protein
NATL1_13051721-3.476976hypothetical protein
NATL1_13061723-2.718001hypothetical protein
NATL1_13071825-2.821705hypothetical protein
NATL1_13081622-3.650375hypothetical protein
NATL1_13091321-2.966799hypothetical protein
NATL1_13101223-2.980435hypothetical protein
NATL1_13111125-2.451335hypothetical protein
NATL1_13121025-1.971718hypothetical protein
NATL1_13131-2300.054931hypothetical protein
NATL1_131410372.985385hypothetical protein
NATL1_131512403.536689hypothetical protein
NATL1_131618385.162407hypothetical protein
NATL1_131715313.809688hypothetical protein
NATL1_131814281.662141hypothetical protein
NATL1_131912200.021965hypothetical protein
NATL1_132013383.708907hypothetical protein
NATL1_132111312.229247hypothetical protein
NATL1_132211251.328452hypothetical protein
NATL1_132310230.941879peptidyl-tRNA hydrolase domain protein
NATL1_132411240.843976hypothetical protein
NATL1_132512271.211976hypothetical protein
NATL1_13261217-2.632889hypothetical protein
NATL1_13271217-2.003087endonuclease VIII
NATL1_13281224-1.663046hypothetical protein
NATL1_13291329-1.059393hypothetical protein
NATL1_133013290.455642hypothetical protein
NATL1_133112260.155432hypothetical protein
NATL1_13321222-0.217658hypothetical protein
NATL1_133315190.022624hypothetical protein
NATL1_13341321-0.781328hypothetical protein
NATL1_13351017-1.006890hypothetical protein
NATL1_13361118-1.475045hypothetical protein
NATL1_13371024-1.079260hypothetical protein
NATL1_133811240.454234hypothetical protein
NATL1_13391325-0.144787hypothetical protein
NATL1_134012200.489842hypothetical protein
NATL1_134113201.166236hypothetical protein
NATL1_13421220-0.064293hypothetical protein
NATL1_13431120-0.788824hypothetical protein
NATL1_13441021-2.603890hypothetical protein
NATL1_13451022-1.498954hypothetical protein
NATL1_13461-122-1.707589hypothetical protein
NATL1_13471-122-2.026706hypothetical protein
NATL1_13481-121-1.347582hypothetical protein
NATL1_134911210.173964hypothetical protein
NATL1_135013222.030458hypothetical protein
NATL1_135113252.219578hypothetical protein
NATL1_135217424.487441hypothetical protein
NATL1_135316476.468853hypothetical protein
NATL1_135419486.848182hypothetical protein
NATL1_135516403.996753hypothetical protein
NATL1_135614321.999423hypothetical protein
NATL1_13571330-0.325731hypothetical protein
NATL1_13581321-1.227229hypothetical protein
NATL1_13591519-3.315888hypothetical protein
NATL1_13601618-2.523329hypothetical protein
NATL1_13611422-3.371363hypothetical protein
NATL1_13621520-2.803695hypothetical protein
NATL1_13631321-1.901506hypothetical protein
NATL1_13641318-1.097369hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_12941MALTOSEBP270.045 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 26.6 bits (58), Expect = 0.045
Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 86 ELLIDQEDNSLLDEPSILEDNGDNPLGIVIEPSLPEVIEENLPEISEAPLDNDQ 139
EL + +N LL + + N D PLG V S E + ++ P I+ A ++N Q
Sbjct: 300 ELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKD-PRIA-ATMENAQ 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_13411SHIGARICIN240.032 Ribosome inactivating protein family signature.
		>SHIGARICIN#Ribosome inactivating protein family signature.

Length = 289

Score = 24.0 bits (52), Expect = 0.032
Identities = 11/57 (19%), Positives = 21/57 (36%), Gaps = 7/57 (12%)

Query: 2 SEPKFLKVKL----GDTVLVGEDEIAKVLSFVVGARDPAASKLFKSQTLIQAKSNLF 54
++ + L +T+ V I +V+G R S F + +A +F
Sbjct: 66 GSQRYALIHLTNYADETISVA---IDVTNVYVMGYRAGDTSYFFNEASATEAAKYVF 119


20NATL1_13741NATL1_14181Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_137413280.495258hypothetical protein
NATL1_137513291.154903hypothetical protein
NATL1_137618311.417713hypothetical protein
NATL1_137719301.865934hypothetical protein
NATL1_137818351.424634hypothetical protein
NATL1_137917311.287982hypothetical protein
NATL1_138015301.527160hypothetical protein
NATL1_138115331.512080hypothetical protein
NATL1_138214331.435935hypothetical protein
NATL1_138313311.085359hypothetical protein
NATL1_138413291.912042hypothetical protein
NATL1_138512271.631813hypothetical protein
NATL1_138612231.781470hypothetical protein
NATL1_138712211.329530hypothetical protein
NATL1_138812180.390447hypothetical protein
NATL1_138914180.788222hypothetical protein
NATL1_13901626-0.909909hypothetical protein
NATL1_139114280.112768hypothetical protein
NATL1_13921025-0.502017hypothetical protein
NATL1_13931025-0.222922hypothetical protein
NATL1_139410202.446326hypothetical protein
NATL1_139511212.518844ATP-dependent Clp protease adaptor
NATL1_139611192.787696hypothetical protein
NATL1_139712142.075414hypothetical protein
NATL1_139813142.028952hypothetical protein
NATL1_139913162.705898hypothetical protein
NATL1_140013181.591381hypothetical protein
NATL1_140112181.966336hypothetical protein
NATL1_140212231.949414hypothetical protein
NATL1_14031224-0.108631hypothetical protein
NATL1_140413240.453230hypothetical protein
NATL1_140511250.585103hypothetical protein
NATL1_140612270.951443hypothetical protein
NATL1_140713300.698848hypothetical protein
NATL1_14081227-0.404551pseudouridylate synthase specific to ribosomal
NATL1_140915331.006879hypothetical protein
NATL1_141016330.148054hypothetical protein
NATL1_14111628-0.704610hypothetical protein
NATL1_14121426-0.319552hypothetical protein
NATL1_14131424-0.725144hypothetical protein
NATL1_14141526-0.083393hypothetical protein
NATL1_141516280.989526hypothetical protein
NATL1_141614292.457612hypothetical protein
NATL1_141710302.664177hypothetical protein
NATL1_141812271.510018hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_14181BONTOXILYSIN260.026 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 25.6 bits (56), Expect = 0.026
Identities = 12/52 (23%), Positives = 20/52 (38%), Gaps = 1/52 (1%)

Query: 26 EMMPQEEIVEAPALEPITEKQLNDLFGEEMYLGTLDL-RGNKVDTKRDYSKK 76
M Q + + ++ I + + DL + TL L R T D S +
Sbjct: 685 CMAKQSILAQESLVKQIVQNKFTDLSKASIPPDTLKLIRETTEKTFIDLSNE 736


21NATL1_14281NATL1_14761Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_14281221-1.319473hypothetical protein
NATL1_14291224-1.403892hypothetical protein
NATL1_14301324-2.122604hypothetical protein
NATL1_14311422-1.883959hypothetical protein
NATL1_14321425-1.542879hypothetical protein
NATL1_14331423-2.667785hypothetical protein
NATL1_14341421-3.853204hypothetical protein
NATL1_14351221-3.797336hypothetical protein
NATL1_14361218-1.998846hypothetical protein
NATL1_14371221-1.481909hypothetical protein
NATL1_14381216-0.449795hypothetical protein
NATL1_143912182.044000hypothetical protein
NATL1_144012222.846139hypothetical protein
NATL1_144113212.847811hypothetical protein
NATL1_144214291.929007hypothetical protein
NATL1_144312263.032071hypothetical protein
NATL1_144410232.660256hypothetical protein
NATL1_14451-1181.936234hypothetical protein
NATL1_14461-1202.301395hypothetical protein
NATL1_14471-1211.269839hypothetical protein
NATL1_144810220.811718hypothetical protein
NATL1_14491423-1.192342*hypothetical protein
NATL1_14501224-3.413417hypothetical protein
NATL1_14511423-5.187653hypothetical protein
NATL1_14521520-6.187989hypothetical protein
NATL1_14531618-7.458193hypothetical protein
NATL1_14541517-6.813314hypothetical protein
NATL1_14551619-5.907058hypothetical protein
NATL1_14561317-0.110829hypothetical protein
NATL1_145712170.704939hypothetical protein
NATL1_145811161.109605hypothetical protein
NATL1_145911202.305422hypothetical protein
NATL1_146010202.058045hypothetical protein
NATL1_14611-1142.484181sulfate transporter
NATL1_14621-2151.844962hypothetical protein
NATL1_14631-2163.536560hypothetical protein
NATL1_14641-2153.876128hypothetical protein
NATL1_14651-2153.332531hypothetical protein
NATL1_14671-2143.429727*tRNA (uracil-5-)-methyltransferase Gid
NATL1_146810152.999891carotenoid isomerase
NATL1_146910173.530719two-component response regulator
NATL1_14701-210-1.313740hypothetical protein
NATL1_14711-110-2.262526glutaredoxin-like protein
NATL1_14721011-3.700258BolA-like protein
NATL1_14731-111-3.416174phospholipid/glycerol acyltransferase
NATL1_14741-111-3.074860pyridoxine 5'-phosphate synthase
NATL1_14751-112-3.428878exodeoxyribonuclease V subunit C 125 kD
NATL1_14761-113-3.598310S-isoprenylcysteine methyltransferase-like
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_14561ACRIFLAVINRP250.028 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 25.2 bits (55), Expect = 0.028
Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 2/29 (6%)

Query: 3 IFLLLSLTFLTIFFVFKKVFYSAKRNLFK 31
+ ++S T L IFFV VF+ R FK
Sbjct: 1007 MGGMVSATLLAIFFV--PVFFVVIRRCFK 1033


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_14691HTHFIS601e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.2 bits (146), Expect = 1e-12
Identities = 29/130 (22%), Positives = 51/130 (39%), Gaps = 5/130 (3%)

Query: 30 SPSKVLVVEPHPTLRTVLVQRLRQDGQLAAAVGSAEEAVDLCRDQSPDLLVSAEILEKSS 89
+ + +LV + +RTVL Q L + G +A DL+V+ ++ +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 90 ALRLAQQL-----GCSVIVLTARTGVEPVVGLLDDGADDVLRKPFGLEELAARCRTLLKR 144
A L ++ V+V++A+ + + GA D L KPF L EL L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 145 GRIGLQERVA 154
+ +
Sbjct: 122 PKRRPSKLED 131


22NATL1_15361NATL1_15781Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_15361214-2.479667homoserine dehydrogenase
NATL1_15371215-3.782039hypothetical protein
NATL1_15381214-2.779155ABC transporter substrate-binding protein
NATL1_15391113-2.713507ABC transporter, oligopeptides
NATL1_15401013-3.168882dienelactone hydrolase
NATL1_15411220-3.780869major facilitator superfamily multidrug-efflux
NATL1_15421018-2.682175hypothetical protein
NATL1_15431-320-1.227706pfkB family carbohydrate kinase
NATL1_15441016-4.269516hypothetical protein
NATL1_15451-116-3.002805lipoprotein
NATL1_15461219-1.458163hypothetical protein
NATL1_15471216-1.541757hypothetical protein
NATL1_154813220.315635hypothetical protein
NATL1_154914242.665936hypothetical protein
NATL1_155013224.620169hypothetical protein
NATL1_155112245.737304hypothetical protein
NATL1_155211235.893066hypothetical protein
NATL1_155311246.721242hypothetical protein
NATL1_155412256.942847hypothetical protein
NATL1_155512296.543374hypothetical protein
NATL1_155612286.096361hypothetical protein
NATL1_155711264.963631hypothetical protein
NATL1_155811255.018124hypothetical protein
NATL1_155910161.000202hypothetical protein
NATL1_156010140.410279hypothetical protein
NATL1_15611113-2.629112hypothetical protein
NATL1_15621014-4.687388hypothetical protein
NATL1_15631-112-5.189888hypothetical protein
NATL1_15641-112-4.395779hypothetical protein
NATL1_15651-211-3.229385peroxiredoxin
NATL1_15661-111-2.212753DNA photolyase-like protein
NATL1_15671-3150.976257short-chain dehydrogenase/reductase
NATL1_15681-2161.336544hypothetical protein
NATL1_15691-213-0.357416transglutaminase-like superfamily protein
NATL1_15701-213-0.751522hypothetical protein
NATL1_15711-114-0.256176hypothetical protein
NATL1_15721-120-0.378506hypothetical protein
NATL1_15731-310-3.204995hypothetical protein
NATL1_15741-29-4.231486hypothetical protein
NATL1_15751-110-1.466368hypothetical protein
NATL1_15761-110-1.811256hypothetical protein
NATL1_15771-111-4.680634hypothetical protein
NATL1_15781-110-3.740469hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_15531IGASERPTASE472e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 47.4 bits (112), Expect = 2e-08
Identities = 32/164 (19%), Positives = 49/164 (29%), Gaps = 9/164 (5%)

Query: 68 TQLKDESTESQPEATPEPEATPEPEATPEPEATPEPEATPEPEVIPEPEVIPEPEVIPEP 127
Q E+ E+Q T E + E + +V P+ E + EP
Sbjct: 1086 AQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP 1145

Query: 128 EVIPEPEVIPEPEVIPEPEVITEPQATSEPEATSEPEVMPEPEVMPEPEVIPEPEVITEP 187
+P V + Q E + E V V V+ PE T
Sbjct: 1146 ARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205

Query: 188 EVITEPEVITEPEVMPEPE-------VITEPEANSENSIDDENI 224
T+P V +E P+ V E + +S D +
Sbjct: 1206 T--TQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247



Score = 39.3 bits (91), Expect = 9e-06
Identities = 25/165 (15%), Positives = 46/165 (27%), Gaps = 22/165 (13%)

Query: 67 PTQLKDESTESQPEATPEPEAT---PEPEATPEPEATPEPEATPEPEVIPEPEVIPEP-E 122
P Q + E+ + Q E E + T EP++ A E A + +P
Sbjct: 1131 PKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN 1190

Query: 123 VIPEPEVIPEPEVIPEPEVIPEPEVITEPQATSE----PEATSEPEVMPEP-EVMPEPEV 177
PE T+P SE P+ V P V P
Sbjct: 1191 TGNSVVENPENTT----------PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTS 1240

Query: 178 IPEPEVITEPEVITEPEVITEPEVMPEPEVITEPEANSENSIDDE 222
+ + ++ + + + + + N ++
Sbjct: 1241 SNDRSTVALCDLTSTNTNAVLSDARAKAQFVA---LNVGKAVSQH 1282



Score = 38.5 bits (89), Expect = 2e-05
Identities = 22/127 (17%), Positives = 35/127 (27%), Gaps = 1/127 (0%)

Query: 62 KLDQGPTQLKDESTESQPEATPEPEATPEPEATPEPEATPEPEATPEPEVIPEPEVIPEP 121
K + P S + + T +P+A P E P T +P
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSS 1177

Query: 122 EVIPEPEVIPEPEVIPEPEVIPEPEVITEPQATSEPEATSEPEVMPEPEVMPEPEVIPEP 181
V PE Q T E++++P+ V P EP
Sbjct: 1178 NVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHN-VEP 1236

Query: 182 EVITEPE 188
+ +
Sbjct: 1237 ATTSSND 1243



Score = 30.8 bits (69), Expect = 0.005
Identities = 11/100 (11%), Positives = 23/100 (23%), Gaps = 1/100 (1%)

Query: 63 LDQGPTQLKDESTESQPEATPEPEATPEPEATPEPEATPEPEATPEPEVIPEPEVIPEPE 122
+ + +Q + QP + PE + E
Sbjct: 1155 IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSE 1214

Query: 123 VIPEPEVIPEPEVIPEPEVIPEPEVITE-PQATSEPEATS 161
+P+ V P + + + + TS
Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_15661LPSBIOSNTHSS300.010 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 29.8 bits (67), Expect = 0.010
Identities = 18/77 (23%), Positives = 29/77 (37%), Gaps = 5/77 (6%)

Query: 245 MDIMSRSTLFNSYSKVYILRNIAINNEFDLSENVSQFKRGLIDKVNKLIPNSEVLKSTDL 304
+DI+ R V +LRN F + E + Q K +PN++V L
Sbjct: 17 LDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIA-----KAIAHLPNAQVDSFEGL 71

Query: 305 GINLSGHNFIDVIYPGV 321
+N + I G+
Sbjct: 72 TVNYARQRQAGAILRGL 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_15671DHBDHDRGNASE652e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 65.1 bits (158), Expect = 2e-14
Identities = 50/205 (24%), Positives = 86/205 (41%), Gaps = 18/205 (8%)

Query: 11 DGKVFLITGANSGLGYETSKFLLERGATVIMSCRDLIKGEKAKQELLKFNFSGKIELVEL 70
+GK+ ITGA G+G ++ L +GA + + K EK L E
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA--EAFPA 64

Query: 71 DLSDLINVKKFAESIKNKFDYLDVLINNAGI--MAPPKTFSKQGFEIQFAVNHLAHMFLT 128
D+ D + + I+ + +D+L+N AG+ + S + +E F+VN +
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 129 LELLPMLEEKNNSRVVTVTSGVQYFGKIQWADLQGNLKYDRWASYAQSKLANVMFGLELD 188
+ + ++ + +VTV S + A+YA SK A VMF L
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTS------------MAAYASSKAAAVMFTKCLG 172

Query: 189 SKLKESNSKTSSLLAHPGFARTNLQ 213
+L E N + + + PG T++Q
Sbjct: 173 LELAEYNIRCN--IVSPGSTETDMQ 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_15741SYCDCHAPRONE438e-07 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 42.6 bits (100), Expect = 8e-07
Identities = 29/147 (19%), Positives = 52/147 (35%), Gaps = 17/147 (11%)

Query: 38 NYLSETPEEEIINQAIKFHLQGKISEAIKYYKHCLIKGFHDEKVFCNYGIILKNLGKTKE 97
N +S E++ + A + GK +A K ++ + +D + F G + +G+
Sbjct: 29 NEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDL 88

Query: 98 AELLQLKAIEIKPNYAEAYSNLGVIYKDKGKLKEAELSLKKAIEI---RPNFANAHNNLG 154
A + + KG+L EAE L A E+ + F
Sbjct: 89 AIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFK------- 141

Query: 155 IIFNDLGKFKEAELSYLKAIELKPDFA 181
+ S L+AI+LK +
Sbjct: 142 -------ELSTRVSSMLEAIKLKKEME 161



Score = 38.4 bits (89), Expect = 2e-05
Identities = 13/79 (16%), Positives = 25/79 (31%)

Query: 107 EIKPNYAEAYSNLGVIYKDKGKLKEAELSLKKAIEIRPNFANAHNNLGIIFNDLGKFKEA 166
EI + E +L GK ++A + + + LG +G++ A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 167 ELSYLKAIELKPDFAEPYY 185
SY + +
Sbjct: 90 IHSYSYGAIMDIKEPRFPF 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_15761CABNDNGRPT403e-05 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 39.6 bits (92), Expect = 3e-05
Identities = 19/93 (20%), Positives = 33/93 (35%), Gaps = 6/93 (6%)

Query: 294 GLSGYIANTGSSSNDELTGSSSNETFFASEGSDIINGKGGNDTSIYSGKFSDYSFTREDN 353
GL G ++ + + G S N+ + +I+ G GND +Y G +D +
Sbjct: 327 GLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQGGAGNDV-LYGGAGADTLYGGAGR 385

Query: 354 SL----AIADQRTGKNN-GTDTLSNIEYIQFSD 381
+ D + D I+ I S
Sbjct: 386 DTFVYGSGQDSTVAAYDWIADFQKGIDKIDLSA 418



Score = 29.9 bits (67), Expect = 0.029
Identities = 27/149 (18%), Positives = 34/149 (22%), Gaps = 34/149 (22%)

Query: 301 NTGSSSNDELTGSSSNETFFASEGSDIINGKGGNDTSIYSGKFSDYSFTREDNSLAIADQ 360
N S ++ G N + + G GND + + N +
Sbjct: 316 NLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDI-LVGNSADNILQGGAGNDVLYGG- 373

Query: 361 RTGKNNGTDTLSNIEYIQFSDQKVEESKVDVVKTYSGKFSDYKFYNKGNGVYQIKTDSGY 420
G DTL Y G D Y G D
Sbjct: 374 -----AGADTL-----------------------YGGAGRDTFVYGSGQDSTVAAYDWIA 405

Query: 421 DDITGFP---LLTFTGEGTTSSFKDISAI 446
D G L F EG SF
Sbjct: 406 DFQKGIDKIDLSAFRNEG-QLSFVQDQFT 433



Score = 29.9 bits (67), Expect = 0.033
Identities = 20/130 (15%), Positives = 40/130 (30%), Gaps = 18/130 (13%)

Query: 293 QGLSGYIANTGSSSNDELTGSSSNETFFASEGSDIINGKGGNDTSIYSGKFSDYSFTRED 352
G S G + ND L G + +T + G D G D+++ + +
Sbjct: 353 VGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTVAAYDW--------- 403

Query: 353 NSLAIADQRTGKN-NGTDTLSNIEYIQFSDQKVEESKVDVVKTYSGKFSD----YKFYNK 407
IAD + G + N + F + +V+ + S
Sbjct: 404 ----IADFQKGIDKIDLSAFRNEGQLSFVQDQFTGKGQEVMLQWDAANSITNLWLHEAGH 459

Query: 408 GNGVYQIKTD 417
+ + ++
Sbjct: 460 SSVDFLVRIV 469


23NATL1_16061NATL1_16541Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_160610244.234298photosystem I assembly protein Ycf4
NATL1_160711192.892921photosystem II PsbD protein (D2)
NATL1_160812121.516919photosystem II PsbC protein (CP43)
NATL1_16091315-3.467423Maf-like protein
NATL1_16101415-3.346189hypothetical protein
NATL1_16111314-3.030748hypothetical protein
NATL1_16121316-4.381304cobyric acid synthase
NATL1_16131116-3.556133hypothetical protein
NATL1_16141115-3.756890hypothetical protein
NATL1_16151014-1.702271hypothetical protein
NATL1_161611130.834717hypothetical protein
NATL1_16171-1141.529978hypothetical protein
NATL1_16181-2151.949176*iron ABC transporter, substrate binding protein
NATL1_16191-3160.667168hydroxylase
NATL1_16201-113-0.617674M20/M25/M40 family peptidase-like protein
NATL1_16211-214-0.045412porin
NATL1_16221212-2.292982glycyl-tRNA synthetase subunit alpha
NATL1_16231317-4.274010hypothetical protein
NATL1_16241421-4.973489hypothetical protein
NATL1_16251526-4.428736hypothetical protein
NATL1_16261629-1.824554hypothetical protein
NATL1_16271524-6.470739hypothetical protein
NATL1_16281630-3.917133hypothetical protein
NATL1_16291627-3.889445hypothetical protein
NATL1_16301427-0.820346hypothetical protein
NATL1_16311524-1.447319hypothetical protein
NATL1_16321623-2.451335hypothetical protein
NATL1_16331421-0.813570hypothetical protein
NATL1_16341-117-1.787731hypothetical protein
NATL1_16351116-6.195202hypothetical protein
NATL1_16361115-6.685862hypothetical protein
NATL1_16371118-6.718645hypothetical protein
NATL1_16381118-6.915149hypothetical protein
NATL1_16391317-7.353295peroxiredoxin
NATL1_16401416-7.609085hypothetical protein
NATL1_16411718-2.513874hypothetical protein
NATL1_16421316-1.226628hypothetical protein
NATL1_16431118-0.518118hypothetical protein
NATL1_16441-1211.941052hypothetical protein
NATL1_16451-3181.519118hypothetical protein
NATL1_16461016-0.298091cytochrome b559 subunit beta
NATL1_16471013-1.176289hypothetical protein
NATL1_16481113-0.653949hypothetical protein
NATL1_16491112-0.134044flavodoxin FldA
NATL1_16501213-0.513567CopG family protein
NATL1_16511212-0.738764glycosyltransferase
NATL1_165211130.248640glycosyl transferases group 1
NATL1_165312151.221505SMR family transporter
NATL1_165412170.622739signal peptide peptidase SppA (protease IV)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_16321PF04647250.030 Accessory gene regulator B
		>PF04647#Accessory gene regulator B

Length = 212

Score = 24.7 bits (54), Expect = 0.030
Identities = 10/51 (19%), Positives = 18/51 (35%), Gaps = 2/51 (3%)

Query: 12 SKPYWCQPWSIISFGVLVLIFSFKLLNNIIITSILGFFILVWWILFLIIAP 62
K Y C S++ F VL I + ++ + + L + P
Sbjct: 75 EKYYRCTLTSLLVFNVLAYIAHLIDPAYFQL--LILIAFITSLLALLFLVP 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_16331TONBPROTEIN372e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 37.3 bits (86), Expect = 2e-05
Identities = 20/103 (19%), Positives = 33/103 (32%), Gaps = 2/103 (1%)

Query: 76 PIEEDKPVEEDKPVEEDKPVEEDKPVEEDKPVEEDKPVEEDKPVEEDKPVEEDKPVEEDK 135
PI D + + VE + E ++ PV +KP + KP +
Sbjct: 44 PISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPV 103

Query: 136 PVEEDKPVEEDKPVEE--DKPVEEDKPVEEDKPVEEDKPVEED 176
+++P + KPVE P E P +
Sbjct: 104 KKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPV 146



Score = 33.8 bits (77), Expect = 2e-04
Identities = 19/89 (21%), Positives = 33/89 (37%), Gaps = 2/89 (2%)

Query: 104 DKPVEEDKPVEEDKPVEEDKPVEEDKPVEEDKPVEEDKPVEEDKPVEEDKPVEEDKPVEE 163
+P+ D + + VE + E ++ PV +KP + KP +
Sbjct: 42 AQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPK 101

Query: 164 DKPVEEDKPVEEDKPVEE--DKPVEEDKP 190
+++P + KPVE P E P
Sbjct: 102 PVKKVQEQPKRDVKPVESRPASPFENTAP 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_16401SYCDCHAPRONE422e-06 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 41.8 bits (98), Expect = 2e-06
Identities = 15/73 (20%), Positives = 30/73 (41%)

Query: 140 KYDESRLSFEKAIELDKNYFDAYINLGLLNKDSNKYNEAEECYLKALEINNKSAIAHLNL 199
KY+++ F+ LD ++ LG + +Y+ A Y ++ K +
Sbjct: 51 KYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHA 110

Query: 200 GACYKEKQDLDKA 212
C +K +L +A
Sbjct: 111 AECLLQKGELAEA 123



Score = 34.1 bits (78), Expect = 8e-04
Identities = 26/130 (20%), Positives = 44/130 (33%), Gaps = 5/130 (3%)

Query: 88 PNHIYSKLNLSFLYYKLNQLEIAEKIIEEAIQLKPSMPNGHCIRGLILKGLDKYDESRLS 147
+YS L+F Y+ + E A K+ + L G + + +YD + S
Sbjct: 36 LEQLYS---LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHS 92

Query: 148 FEKAIELDKNYFDAYINLGLLNKDSNKYNEAEECYLKALE-INNKSAIAHLNLGA-CYKE 205
+ +D + + EAE A E I +K+ L+ E
Sbjct: 93 YSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLE 152

Query: 206 KQDLDKAILH 215
L K + H
Sbjct: 153 AIKLKKEMEH 162



Score = 32.6 bits (74), Expect = 0.003
Identities = 20/119 (16%), Positives = 41/119 (34%), Gaps = 3/119 (2%)

Query: 11 NKGSSKKLQKLSEKDLK---AKSINNHIKGNLDEAEKGYIAFLRNGYSDADIISNYALIC 67
G+ L ++S L+ + + N + G ++A K + A + D+
Sbjct: 21 GGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACR 80

Query: 68 EGKGENEKAIRLYKKCAKSFPNHIYSKLNLSFLYYKLNQLEIAEKIIEEAIQLKPSMPN 126
+ G+ + AI Y A + + + +L AE + A +L
Sbjct: 81 QAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTE 139



Score = 31.8 bits (72), Expect = 0.005
Identities = 19/94 (20%), Positives = 36/94 (38%)

Query: 153 ELDKNYFDAYINLGLLNKDSNKYNEAEECYLKALEINNKSAIAHLNLGACYKEKQDLDKA 212
E+ + + +L S KY +A + + +++ + L LGAC + D A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 213 ILHTKMAIEIDNKLENCYLNLATIYNQIGDYKKS 246
I +D K + A Q G+ ++
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEA 123


24NATL1_16671NATL1_16741Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_16671-215-3.310298tetrapyrrole methylase family protein
NATL1_16681314-5.423896hypothetical protein
NATL1_16691114-4.573012hypothetical protein
NATL1_16701016-3.753660*hypothetical protein
NATL1_16711017-4.061979hypothetical protein
NATL1_16721118-3.416958hypothetical protein
NATL1_16741318-3.208910*hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_16741SYCDCHAPRONE411e-06 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 41.5 bits (97), Expect = 1e-06
Identities = 24/125 (19%), Positives = 44/125 (35%), Gaps = 3/125 (2%)

Query: 50 EQIINQAFKFHSQGNISKATKYYQICIKQGFNNPQVFSNFGILLKEIDQLKEAEKMIKQA 109
EQ+ + AF + G A K +Q + + F G + + Q A
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 110 IKLKPDYAIAYNNLGNILIDLGRLKEAEIYTKKAIDL---KPDYANAYNTLGNILKELDN 166
+ + L+ G L EAE A +L K ++ + ++L+ +
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKL 156

Query: 167 LKDAE 171
K+ E
Sbjct: 157 KKEME 161



Score = 34.9 bits (80), Expect = 3e-04
Identities = 15/105 (14%), Positives = 32/105 (30%)

Query: 106 IKQAIKLKPDYAIAYNNLGNILIDLGRLKEAEIYTKKAIDLKPDYANAYNTLGNILKELD 165
I ++ D +L G+ ++A + L + + LG + +
Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMG 84

Query: 166 NLKDAEICFSKAISLEPDHESAIINRGQLYFDKGEFKKALKDSDL 210
A +S ++ + + KGE +A L
Sbjct: 85 QYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFL 129


25NATL1_17771NATL1_18211Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_177711183.264396pentapeptide repeat-containing protein
NATL1_177810142.659441hypothetical protein
NATL1_17791-2111.838444hypothetical protein
NATL1_17801-2111.242109ferredoxin
NATL1_17811-38-0.411643ribosomal protein L11 methyltransferase
NATL1_17821-29-0.668582D-3-phosphoglycerate dehydrogenase
NATL1_17831113-3.216362hypothetical protein
NATL1_17841215-3.559451S4-like domain-containing protein
NATL1_17851114-5.055959*hypothetical protein
NATL1_17861-112-3.420038UDP-N-acetylmuramoyl-L-alanyl-D-glutamate
NATL1_17871-114-2.207007MATH domain-containing protein
NATL1_17881-113-2.722613hypothetical protein
NATL1_17891-414-2.520419hypothetical protein
NATL1_17901-315-3.404724hypothetical protein
NATL1_17911-115-2.579704GAF domain-containing protein
NATL1_17921-118-5.389188hypothetical protein
NATL1_17931-219-5.029272hypothetical protein
NATL1_17941-122-3.389286hypothetical protein
NATL1_17951-128-3.073557hypothetical protein
NATL1_17961130-1.504365hypothetical protein
NATL1_179716351.702352hypothetical protein
NATL1_179813271.610290hypothetical protein
NATL1_179911231.386385hypothetical protein
NATL1_180011211.561506hypothetical protein
NATL1_180112181.371295hypothetical protein
NATL1_180211191.317633hypothetical protein
NATL1_18031-1170.491415hypothetical protein
NATL1_180411180.092697hypothetical protein
NATL1_18051-118-0.001641hypothetical protein
NATL1_180611210.249087hypothetical protein
NATL1_18071423-1.767574hypothetical protein
NATL1_18081423-1.488955hypothetical protein
NATL1_18091-117-0.325829hypothetical protein
NATL1_18101-2170.261014hypothetical protein
NATL1_18111-216-0.724907hypothetical protein
NATL1_18121-116-1.256244hypothetical protein
NATL1_18131016-0.949539hypothetical protein
NATL1_18141011-2.858715hypothetical protein
NATL1_18151114-6.151900hypothetical protein
NATL1_18161014-5.360776hypothetical protein
NATL1_18171015-5.664186hypothetical protein
NATL1_18181015-5.284573hypothetical protein
NATL1_18191015-5.676029hypothetical protein
NATL1_18201015-4.854877hypothetical protein
NATL1_18211114-3.405989hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_17821NUCEPIMERASE300.023 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.023
Identities = 10/24 (41%), Positives = 13/24 (54%)

Query: 148 GKIGSHVAKVANAMGMEVIGFDPF 171
G IG HV+K G +V+G D
Sbjct: 10 GFIGFHVSKRLLEAGHQVVGIDNL 33


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_17921PF06917270.007 Periplasmic pectate lyase
		>PF06917#Periplasmic pectate lyase

Length = 555

Score = 27.2 bits (60), Expect = 0.007
Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 4/44 (9%)

Query: 13 VASKAGKLFEKMTPEMIDQKLVESQVIQQMID----QLQLEGLK 52
+A +A LF M P +ID L +++Q D Q ++GLK
Sbjct: 302 IAREANVLFRDMRPLLIDNPLAMLDILRQQPDAEVLQWVIDGLK 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_18171TYPE3IMSPROT290.002 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.3 bits (66), Expect = 0.002
Identities = 11/70 (15%), Positives = 28/70 (40%), Gaps = 5/70 (7%)

Query: 14 LVIACLTVSLFI----GFLISINLIVDPIIYWLASTEGARIIISCILSWFGVSTLFDFLY 69
+V+ + + + I L+ + + R ++ F V ++ D+ +
Sbjct: 147 VVLLSILIWIIIKGNLVTLLQLP-TCGIECITPLLGQILRQLMVICTVGFVVISIADYAF 205

Query: 70 KRYRRRKKLK 79
+ Y+ K+LK
Sbjct: 206 EYYQYIKELK 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_18191SYCDCHAPRONE361e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 35.7 bits (82), Expect = 1e-04
Identities = 19/149 (12%), Positives = 41/149 (27%), Gaps = 12/149 (8%)

Query: 34 INTNTPSKSSKEKIINQALDSHSEGNIQEAKKLYQYLINQGFNDHRVFSNYGVILQNLGK 93
+ T + + L G I ++ + Q + GK
Sbjct: 1 MQQETTDTQEYQLAMESFLKGG--GTIAMLNEISSDTLEQ-------LYSLAFNQYQSGK 51

Query: 94 LKEAKISFRKAIELNPNYHEAHANLGNILRDLGKLEEAEVSTLKAIELNPNFASAHCNLG 153
++A F+ L+ LG + +G+ + A S ++ +
Sbjct: 52 YEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAA 111

Query: 154 ---LILEGLDKIEQSVFSFKRALETNPND 179
L L + E +F + +
Sbjct: 112 ECLLQKGELAEAESGLFLAQELIADKTEF 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_18201SYCDCHAPRONE443e-07 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 43.8 bits (103), Expect = 3e-07
Identities = 28/131 (21%), Positives = 48/131 (36%), Gaps = 3/131 (2%)

Query: 45 EQIINQAFKFHSQGNISEAAKYYQYCINQAFKDYRVFTNYGVILKKFGKLQEAEKFQREA 104
EQ+ + AF + G +A K +Q D R F G + G+ A
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 105 IQINPNFAEAYSNLGNILRDLGQLKEAELSFRKAIEI---KSDYAEAYSNLGNILRDLGQ 161
++ + L G+L EAE A E+ K+++ E + + ++L +
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKL 156

Query: 162 LKEAELSFRKA 172
KE E
Sbjct: 157 KKEMEHECVDN 167



Score = 43.0 bits (101), Expect = 5e-07
Identities = 25/118 (21%), Positives = 49/118 (41%), Gaps = 3/118 (2%)

Query: 92 GKLQEAEKFQREAIQINPNFAEAYSNLGNILRDLGQLKEAELSFRKAIEIKSDYAEAYSN 151
GK ++A K + ++ + + LG + +GQ A S+ + +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 152 LGNILRDLGQLKEAELSFRKAIEI---KPDYAEAHSNLGNILSDLGIKKEAKLEKQKS 206
L G+L EAE A E+ K ++ E + + ++L + +KKE + E +
Sbjct: 110 AAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_18211SYCDCHAPRONE473e-08 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 46.8 bits (111), Expect = 3e-08
Identities = 30/136 (22%), Positives = 48/136 (35%), Gaps = 11/136 (8%)

Query: 46 EQIINQAFKFHSQGNISEAAKYYQYCINQAFKDYRVFTNYGVILKKFGKLKEAEKCQREA 105
EQ+ + AF + G +A K +Q D R F G + G+ A
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 106 IQINPNFAEAYSNLGNILSDLGQLKEAELSFRKAIEIKSDYAEAHSNLGNILRDFGQLKE 165
++ + L G+L EAE A E+ +D E F +L
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTE-----------FKELST 145

Query: 166 AELSFRKAIEIKSDYA 181
S +AI++K +
Sbjct: 146 RVSSMLEAIKLKKEME 161



Score = 38.8 bits (90), Expect = 2e-05
Identities = 23/118 (19%), Positives = 39/118 (33%), Gaps = 3/118 (2%)

Query: 127 GQLKEAELSFRKAIEIKSDYAEAHSNLGNILRDFGQLKEAELSFRKAIEIKSDYAEAHSN 186
G+ ++A F+ + + LG + GQ A S+ + +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 187 LGNILNDLGQLKEAELSFRKAIEI---KPDFANTHNNLGIILSDLDQLKEAELSFRKA 241
L G+L EAE A E+ K +F + +L + KE E
Sbjct: 110 AAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167



Score = 38.4 bits (89), Expect = 3e-05
Identities = 23/113 (20%), Positives = 40/113 (35%), Gaps = 3/113 (2%)

Query: 175 EIKSDYAEAHSNLGNILNDLGQLKEAELSFRKAIEIKPDFANTHNNLGIILSDLDQLKEA 234
EI SD E +L G+ ++A F+ + + LG + Q A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 235 ELSFRKAIEIKPDFIKAYSNLGNILRDLGQLKEAELSFRKAIKI---KPDYAE 284
S+ + + + L G+L EAE A ++ K ++ E
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKE 142



Score = 36.4 bits (84), Expect = 1e-04
Identities = 22/118 (18%), Positives = 41/118 (34%), Gaps = 3/118 (2%)

Query: 161 GQLKEAELSFRKAIEIKSDYAEAHSNLGNILNDLGQLKEAELSFRKAIEIKPDFANTHNN 220
G+ ++A F+ + + LG +GQ A S+ + +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 221 LGIILSDLDQLKEAELSFRKAIEI---KPDFIKAYSNLGNILRDLGQLKEAELSFRKA 275
L +L EAE A E+ K +F + + + ++L + KE E
Sbjct: 110 AAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167



Score = 35.7 bits (82), Expect = 2e-04
Identities = 24/118 (20%), Positives = 45/118 (38%), Gaps = 3/118 (2%)

Query: 93 GKLKEAEKCQREAIQINPNFAEAYSNLGNILSDLGQLKEAELSFRKAIEIKSDYAEAHSN 152
GK ++A K + ++ + + LG +GQ A S+ + +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 153 LGNILRDFGQLKEAELSFRKAIEI---KSDYAEAHSNLGNILNDLGQLKEAELSFRKA 207
L G+L EAE A E+ K+++ E + + ++L + KE E
Sbjct: 110 AAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167



Score = 32.6 bits (74), Expect = 0.002
Identities = 17/91 (18%), Positives = 29/91 (31%), Gaps = 13/91 (14%)

Query: 221 LGIILSDLDQL-------------KEAELSFRKAIEIKPDFIKAYSNLGNILRDLGQLKE 267
I L+QL ++A F+ + + + LG + +GQ
Sbjct: 29 NEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDL 88

Query: 268 AELSFRKAIKIKPDYAEAYFNLAYLELLKGN 298
A S+ + F+ A L KG
Sbjct: 89 AIHSYSYGAIMDIKEPRFPFHAAECLLQKGE 119


26NATL1_18461NATL1_18571Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_184612142.375118hypothetical protein
NATL1_184714172.693284hypothetical protein
NATL1_184814183.823730F0F1 ATP synthase subunit gamma
NATL1_184914214.399687F0F1 ATP synthase subunit alpha
NATL1_185015244.263294F0F1 ATP synthase subunit delta
NATL1_185116274.207452F0F1 ATP synthase subunit B
NATL1_185210212.582428F0F1 ATP synthase subunit B'
NATL1_185312182.044000F0F1 ATP synthase subunit C
NATL1_18541113-0.459652F0F1 ATP synthase subunit A
NATL1_18551212-1.718474H+-transporting ATP synthase
NATL1_18561214-2.302746hypothetical protein
NATL1_18571213-1.881128cell division protein FtsW
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_18471PF03944250.050 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 24.6 bits (53), Expect = 0.050
Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 1/20 (5%)

Query: 11 DWPFELSLFDLK-NYILSKL 29
DWPF SLF + NY+L+
Sbjct: 292 DWPFLYSLFQVNSNYVLNGF 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_18501INTIMIN280.023 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 28.1 bits (62), Expect = 0.023
Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 16/105 (15%)

Query: 63 LEKLFSSQVTPSFLNLLKLLADRQRIGLL----NSVLERLLEIYREQRNIALATITSASA 118
+K +S Q+ P ++N L+ L+ R L+ N +LE Y++Q ++L I
Sbjct: 411 FDKPWSQQIEPQYVNELRTLSG-SRYDLVQRNNNIILE-----YKKQDILSL-NIPHDIN 463

Query: 119 LNEDQQSELLKKVQSIAGTDNLEIDLKVDSELL--GGFVVNVGSK 161
E ++ V+S G D + D DS L GG + + GS+
Sbjct: 464 GTERSTQKIQLIVKSKYGLDRIVWD---DSALRSQGGQIQHSGSQ 505


27NATL1_19241NATL1_19301Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_19241-1163.455838urease subunit gamma
NATL1_192512185.037404urease subunit beta
NATL1_192612174.708499urease subunit alpha
NATL1_192715232.481168hypothetical protein
NATL1_192814261.658254hypothetical protein
NATL1_192913272.669576hypothetical protein
NATL1_193014273.506112hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_19261UREASE11040.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1104 bits (2856), Expect = 0.0
Identities = 404/569 (71%), Positives = 471/569 (82%), Gaps = 1/569 (0%)

Query: 1 MPFKISRQAYAETYGPTKGDRIRLADTDLILEVEQDHTHYGDEVKFGGGKVIRDGMGQSQ 60
M +++SR AYA +GPT GD++RLADT+L +EVE+D T +G+EVKFGGGKVIRDGMGQSQ
Sbjct: 1 MSYRMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQ 60

Query: 61 QSRDNGVVDTVITNALILDWWGIVKADIGIKDGKISGIGKAGNPDTQEGVNIIVGASTEA 120
+R+ G VDTVITNALILD WGIVKADIG+KDG+I+ IGKAGNPD Q GV IIVG TE
Sbjct: 61 VTREGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEV 120

Query: 121 IAGEGSIITAGAIDSHIHFICPQQIETALASGVTTMLGGGTGPATGTNATTCTPGAFHIS 180
IAGEG I+TAG +DSHIHFICPQQIE AL SG+T MLGGGTGPA GT ATTCTPG +HI+
Sbjct: 121 IAGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIA 180

Query: 181 RMLQSAEGFPVNLGFFGKGNATNKAALEEQVRAGACGLKLHEDWGTTPACIDSCLSVADQ 240
RM+++A+ FP+NL F GKGNA+ AL E V GA LKLHEDWGTTPA ID CLSVAD+
Sbjct: 181 RMIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADE 240

Query: 241 LDVQVCIHTDTLNEAGFVEDTIKAIKGRTIHTFHTEGAGGGHAPDIIKICGESNVIPSST 300
DVQV IHTDTLNE+GFVEDTI AIKGRTIH +HTEGAGGGHAPDII+ICG+ NVIPSST
Sbjct: 241 YDVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSST 300

Query: 301 NPTRPFTLNTLEEHLDMLMVCHHLDPKIPEDVAFAESRIRRETIAAEDILHDLGAFSIIA 360
NPTRP+T+NTL EHLDMLMVCHHL P IPED+AFAESRIR+ETIAAEDILHD+GAFSII+
Sbjct: 301 NPTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIIS 360

Query: 361 SDSQAMGRVGEVISRTFQTAHKMKVQRGALPEDNQRNDNHRLKRYISKVTINPAIAHGIS 420
SDSQAMGRVGEV RT+QTA KMK QRG L E+ NDN R+KRYI+K TINPAIAHG+S
Sbjct: 361 SDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLS 420

Query: 421 AHVGSVEVGKLADLVLWKPGFFGIKPDLVVKGGCIAWAQMGDANASIPTPQPVHGRPMFS 480
+GS+EVGK ADLVLW P FFG+KPD+V+ GG IA A MGD NASIPTPQPVH RPMF
Sbjct: 421 HEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFG 480

Query: 481 SFGKAISPTCLTFLSENAIDAGVPERLKLERTCAPVKDTR-KISKQSMKLNDARPKIEVD 539
++G++ + + +TF+S+ ++DAG+ RL + + V++TR I K SM N P IEVD
Sbjct: 481 AYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVD 540

Query: 540 PQTYEVFANGELLTCEPAESLPLAQRYLL 568
P+TYEV A+GELLTCEPA LP+AQRY L
Sbjct: 541 PETYEVRADGELLTCEPATVLPMAQRYFL 569


28NATL1_20571NATL1_20691Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_20571-1113.045643mannose-1-phosphate guanylyltransferase
NATL1_205811114.422248glucosamine--fructose-6-phosphate
NATL1_205911134.956073photosystem I subunit VII
NATL1_206010104.567043acyl carrier protein
NATL1_20611-1114.1686713-oxoacyl-ACP synthase
NATL1_20621-2123.371147transketolase
NATL1_20631-1142.078282thiamine biosynthesis protein ThiC
NATL1_206410142.154092hypothetical protein
NATL1_206511122.345368zinc metallopeptidase
NATL1_206612122.862379hypothetical protein
NATL1_206710121.975749Holliday junction DNA helicase RuvB
NATL1_20681-1111.531894SsrA-binding protein
NATL1_20691-3143.282989hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_20641TYPE3OMGPROT260.009 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 26.4 bits (58), Expect = 0.009
Identities = 12/48 (25%), Positives = 18/48 (37%), Gaps = 6/48 (12%)

Query: 19 LAITGFLHREGKDKIQAIPAL----VVGSGLVFTGAIRRFRRRRMLFL 62
L I G E + +P L +G+ +F RR LF+
Sbjct: 458 LIIGGIYRDELSVALSKVPLLGDIPYIGA--LFRRKSELTRRTVRLFI 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_20661SYCDCHAPRONE419e-07 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 41.1 bits (96), Expect = 9e-07
Identities = 20/96 (20%), Positives = 36/96 (37%)

Query: 92 EISPEDLDPYLNRGIAEEALQRWEDASKDYNYVLNNNPKDVSALYNLGNVMGSMDNWIEA 151
EIS + L+ + + ++EDA K + + + D LG +M + A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 152 KKLFAQAASSNNAIAMASSSEALAIYQLGDLELAEK 187
++ A + A + Q G+L AE
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAES 125



Score = 28.4 bits (63), Expect = 0.021
Identities = 16/111 (14%), Positives = 36/111 (32%)

Query: 45 DFIRAEKDWSSYLNDYPDDAAALSNRGNIRLALGDPKGAIKDQTKSIEISPEDLDPYLNR 104
F++ + D L + + G + A K + D +L
Sbjct: 17 SFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGL 76

Query: 105 GIAEEALQRWEDASKDYNYVLNNNPKDVSALYNLGNVMGSMDNWIEAKKLF 155
G +A+ +++ A Y+Y + K+ ++ + EA+
Sbjct: 77 GACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGL 127


29NATL1_20961NATL1_21081Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_20961-212-3.654000GDP-D-mannose dehydratase
NATL1_20971-112-4.232750hypothetical protein
NATL1_20981012-3.917938hypothetical protein
NATL1_20991-211-3.433247hypothetical protein
NATL1_210017141.887172hypothetical protein
NATL1_210116131.513917ATPase
NATL1_210215140.486742leukotoxin secretion protein-like protein
NATL1_21031515-0.012471glycosyltransferase
NATL1_210415160.338799hypothetical protein
NATL1_210515150.411247hypothetical protein
NATL1_21061115-5.320133GDP-D-mannose dehydratase
NATL1_21071015-5.077597hypothetical protein
NATL1_21081-29-3.033504hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_20961NUCEPIMERASE961e-24 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 96.0 bits (239), Expect = 1e-24
Identities = 64/327 (19%), Positives = 122/327 (37%), Gaps = 15/327 (4%)

Query: 8 LITGITGQDGSYLAELLLDKGYKVHGLVRRSSQKNTNLLDNILNSQHNKNLELHYGDLTQ 67
L+TG G G ++++ LL+ G++V G+ + + +L L + H DL
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 68 STNILRIIENIQPDEIYNLGAQSHVQVSFETPEYTAQTDALGPLRILEAIRILQLTKKTK 127
+ + + + ++ + V+ S E P A ++ G L ILE R ++
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI---QH 120

Query: 128 IYQASTSELYGLVQETPQNERTPF-YPRSPYGVAKLYAYWITINYRESYGIFACNGILFN 186
+ AS+S +YGL ++ P + +P S Y K + Y YG+ A F
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 187 HESPRRGENFVTRKITKGLCEINRGSTDCLYLGNIDSLRDWGHAKDYVEMQWMMLQQEKP 246
P + K TK + E G + +Y RD+ + D E +
Sbjct: 181 VYGPWGRPDMALFKFTKAMLE---GKSIDVY-NYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 247 EDYVISTGKQTSVRRFVELCAEHLNWGGIIWEGKGIDEIGKRKDNKQVIIRIDPNL--FR 304
D + T ++ + I + + I N+ +
Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL-----EDALGIEAKKNMLPLQ 291

Query: 305 PAEVNSLLGDSEKAYKKLGWKPKYNIE 331
P +V D++ Y+ +G+ P+ ++
Sbjct: 292 PGDVLETSADTKALYEVIGFTPETTVK 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_20971CABNDNGRPT290.028 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 29.2 bits (65), Expect = 0.028
Identities = 20/46 (43%), Positives = 24/46 (52%), Gaps = 11/46 (23%)

Query: 147 KNTIIHEIGHALGLAHP-----------FNDPFNKNYTTQDTIMSY 181
+ T HEIGHALGLAHP +ND + Q +IMSY
Sbjct: 183 RQTFTHEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSY 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_21021RTXTOXIND1075e-28 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 107 bits (270), Expect = 5e-28
Identities = 59/263 (22%), Positives = 106/263 (40%), Gaps = 55/263 (20%)

Query: 161 DTEITEANQISLIETLKINQEILDNLKYLSEEGAASRIQYLQQSNKV------------- 207
+ A ++ + LD+ L + A ++ L+Q NK
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 208 -----------------------QEIKTKLKE--------------NEVQMKYQKIISPV 230
EI KL++ NE + + I +PV
Sbjct: 275 LEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPV 334

Query: 231 NGIVFDMQPKGPGYVARTSEPILKVVPLD-KLQAEIEIDSSDIGFISLGKDTDISIDSFP 289
+ V ++ G V T+E ++ +VP D L+ + + DIGFI++G++ I +++FP
Sbjct: 335 SVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFP 394

Query: 290 SSDFGVVEGKIIRIGSDALPPDPRINKGYRFPAIIKLNQQNLILKSGKTLPLQAGMSITA 349
+ +G + GK+ I DA+ D R+ + II + + L K +PL +GM++TA
Sbjct: 395 YTRYGYLVGKVKNINLDAI-EDQRLGLVFN--VIISIEENCLSTG-NKNIPLSSGMAVTA 450

Query: 350 NIKLRKVSYLQLLLNNFSDKADS 372
IK S + LL+ +
Sbjct: 451 EIKTGMRSVISYLLSPLEESVTE 473



Score = 86.0 bits (213), Expect = 1e-20
Identities = 29/133 (21%), Positives = 52/133 (39%), Gaps = 2/133 (1%)

Query: 89 WAEAITWTLIGGTSFGIAWLALAKTEEIVIAQGKLEPKTGVIEVQMPLEGITKEILVKEG 148
+ + ++G L + E + A GKL E++ I KEI+VKEG
Sbjct: 56 RPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115

Query: 149 DRVEKGQILIHLDTEITEANQISLIETLKINQEILDNLKYLSEEGAASRIQYLQQSNKVQ 208
+ V KG +L+ L EA+ + +L + L+ +Y + + + +
Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQAR--LEQTRYQILSRSIELNKLPELKLPDE 173

Query: 209 EIKTKLKENEVQM 221
+ E EV
Sbjct: 174 PYFQNVSEEEVLR 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_21051RTXTOXINA783e-16 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 78.5 bits (193), Expect = 3e-16
Identities = 37/92 (40%), Positives = 50/92 (54%)

Query: 655 GGTKADTFTGGAGDDSIDGDAGDDSLVGAAGADDLDGGAGNDVISGGAGNDTITAGSGND 714
G D F G GDD I+G+ G+D L G G D L GG G+D + GG GND + +GN+
Sbjct: 733 GSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNN 792

Query: 715 NLDGGADADTFTLSTNYTTDDTIVGGAGEDSV 746
L+GG D F + N + + GG G D +
Sbjct: 793 YLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKL 824



Score = 76.5 bits (188), Expect = 1e-15
Identities = 39/104 (37%), Positives = 53/104 (50%)

Query: 1544 LTGGSGADTLTGGSVADTITGGAGGDTITTNAGDDTVAGGGGNDTITGGAGDNVITGGAG 1603
L G + AD G D G G D I N G+D + G GNDT++GG GD+ + GG G
Sbjct: 722 LIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDG 781

Query: 1604 NDSITAGAGFDNIDSGAGDDTIVFAANMSLSDTVAGGDGNDTIT 1647
ND + AG + ++ G GDD N + + GG GND +
Sbjct: 782 NDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLY 825



Score = 66.5 bits (162), Expect = 1e-12
Identities = 30/90 (33%), Positives = 43/90 (47%)

Query: 655 GGTKADTFTGGAGDDSIDGDAGDDSLVGAAGADDLDGGAGNDVISGGAGNDTITAGSGND 714
G D G G+D + GD G+D+L G G D L GG GND + G AGN+ + G G+D
Sbjct: 742 GADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDD 801

Query: 715 NLDGGADADTFTLSTNYTTDDTIVGGAGED 744
++ + +D + G G D
Sbjct: 802 EFQVQGNSLAKNVLFGGKGNDKLYGSEGAD 831



Score = 64.6 bits (157), Expect = 4e-12
Identities = 39/115 (33%), Positives = 51/115 (44%), Gaps = 24/115 (20%)

Query: 1543 TLTGGSGADTLTGGSVADTITGGAGGDTITTNAGDDTVAGGGGNDTITGGAGD------- 1595
+ G G D L G DT++GG G D + G+D + G GN+ + GG GD
Sbjct: 748 LIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQG 807

Query: 1596 -----NVITGGAGNDSITAGAGFDNIDSGAGDDTIVFAANMSLSDTVAGGDGNDT 1645
NV+ GG GND + G D +D G GDD + GG GND
Sbjct: 808 NSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLL------------KGGYGNDI 850



Score = 60.8 bits (147), Expect = 6e-11
Identities = 36/115 (31%), Positives = 47/115 (40%), Gaps = 6/115 (5%)

Query: 633 DTANGSVVPMTITAGSGGYTGSGGTKADTFTGGAGDDSIDGDAGDDSLVGAAGADDLDGG 692
D GS G G D G G+D++ G GDD L G G D L G
Sbjct: 729 DKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGV 788

Query: 693 AGNDVISGGAGNDTITAGSGN---DNLDGGADADTFTLSTNYTTDDTIVGGAGED 744
AGN+ ++GG G+D + + L GG D S D + GG G+D
Sbjct: 789 AGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEG---ADLLDGGEGDD 840



Score = 60.0 bits (145), Expect = 1e-10
Identities = 33/115 (28%), Positives = 51/115 (44%), Gaps = 12/115 (10%)

Query: 1544 LTGGSGADTLTGGSVADTITGGAGGDTITTNAGDDTVAGGGGND------------TITG 1591
L G G DTL+GG+ D + GG G D + AG++ + GG G+D + G
Sbjct: 758 LYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFG 817

Query: 1592 GAGDNVITGGAGNDSITAGAGFDNIDSGAGDDTIVFAANMSLSDTVAGGDGNDTI 1646
G G++ + G G D + G G D + G G+D + + G D +
Sbjct: 818 GKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKL 872



Score = 57.7 bits (139), Expect = 6e-10
Identities = 40/129 (31%), Positives = 54/129 (41%), Gaps = 12/129 (9%)

Query: 633 DTANGSVVPMTITAGSGGYTGSGGTKADTFTGGAGDDSIDGDAGDDSLVGAAGADD---- 688
D G+ + G T SGG D GG G+D + G AG++ L G G D+
Sbjct: 747 DLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQ 806

Query: 689 --------LDGGAGNDVISGGAGNDTITAGSGNDNLDGGADADTFTLSTNYTTDDTIVGG 740
L GG GND + G G D + G G+D L GG D + + Y G
Sbjct: 807 GNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDG 866

Query: 741 AGEDSVSMT 749
ED +S+
Sbjct: 867 GKEDKLSLA 875



Score = 52.7 bits (126), Expect = 2e-08
Identities = 37/127 (29%), Positives = 52/127 (40%), Gaps = 16/127 (12%)

Query: 647 GSGGYTGSGGTKADTFTGGAGDDS------------IDGDAGDDSLVGAAGADDLDGGAG 694
G G G + GG GDD + G G+D L G+ GAD LDGG G
Sbjct: 779 GDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEG 838

Query: 695 NDVISGGAGNDTIT--AGSGNDNL-DGGADADTFTLSTNYTTDDTIVGGAGEDSVSMTIA 751
+D++ GG GND +G G+ + D G D +L+ + D G D +
Sbjct: 839 DDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLA-DIDFRDVAFKREGNDLIMYKGE 897

Query: 752 GGTTTYT 758
G +
Sbjct: 898 GNVLSIG 904



Score = 52.3 bits (125), Expect = 2e-08
Identities = 39/128 (30%), Positives = 56/128 (43%), Gaps = 25/128 (19%)

Query: 1543 TLTGGSGADTLTGGSVADTITGGAGGDTITTNAGDD------------TVAGGGGNDTIT 1590
TL+GG+G D L GG D + G AG + + GDD + GG GND +
Sbjct: 766 TLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLY 825

Query: 1591 GGAGDNVITGGAGNDSITAGAGFDN------------IDSGAGDDTIVFAANMSLSDTVA 1638
G G +++ GG G+D + G G D D G +D + A++ D
Sbjct: 826 GSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSL-ADIDFRDVAF 884

Query: 1639 GGDGNDTI 1646
+GND I
Sbjct: 885 KREGNDLI 892



Score = 51.9 bits (124), Expect = 3e-08
Identities = 29/78 (37%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 1116 SISGGGGADTILGSAGAETIIGGAGNDSLDGAAGNDSISGGAGNDTFSFAATGQLNNGDT 1175
+ G G DT+ G G + + GG GND L G AGN+ ++GG G+D F N
Sbjct: 757 RLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKN--V 814

Query: 1176 IDGGDGTDNLDVTTAVDL 1193
+ GG G D L + DL
Sbjct: 815 LFGGKGNDKLYGSEGADL 832



Score = 41.9 bits (98), Expect = 4e-05
Identities = 25/97 (25%), Positives = 40/97 (41%), Gaps = 4/97 (4%)

Query: 1089 DGNITVLGGSAGDDITLATAIATDKVHSISGGGGADTILGSAGAETIIGGAGNDSLDGAA 1148
DGN ++G + + L D+ + + G G + + G G D LDG
Sbjct: 780 DGNDKLIGVAGNN--YLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGE 837

Query: 1149 GNDSISGGAGNDTFSFAATGQLNNGDTIDGGDGTDNL 1185
G+D + GG GND + + + + D G D L
Sbjct: 838 GDDLLKGGYGNDIYRYLSG--YGHHIIDDDGGKEDKL 872



Score = 35.7 bits (82), Expect = 0.003
Identities = 38/178 (21%), Positives = 63/178 (35%), Gaps = 15/178 (8%)

Query: 1479 VSGGGDTDFQDLIASGTVAVTASGSGNLTIDDAN-------IDGTSASLSVDASA---MS 1528
S GD D + +++G+ + A G G+ + IDGT A+ + + + +
Sbjct: 613 ESHLGDGDDKVFLSAGSANIYA-GKGHDVVYYDKTDTGYLTIDGTKATEAGNYTVTRVLG 671

Query: 1529 GTVSAVATNTSVATTLTGGSGADTLTGGSVADTITGGAGGDTITTNAGDDTVAGGGGNDT 1588
G V + G + S T G + + G D
Sbjct: 672 GDVKVLQEVVKEQEVSVG-KRTEKTQYRSYEFTHINGKNLTETDNLYSVEELIGTTRADK 730

Query: 1589 ITGGAGDNVITGGAGNDSITAGAGFDNIDSGAGDDTIVFAANMSLSDTVAGGDGNDTI 1646
G ++ G G+D I G D + G+DT+ D + GGDGND +
Sbjct: 731 FFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNG---DDQLYGGDGNDKL 785


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_21061NUCEPIMERASE671e-14 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 66.7 bits (163), Expect = 1e-14
Identities = 68/333 (20%), Positives = 127/333 (38%), Gaps = 49/333 (14%)

Query: 1 MVFGCTGQDGSLISNSLLRKGYEVLGI---TRSSKPNYQY--LQQLGINQSFEIKKIGVE 55
+V G G G +S LL G++V+GI + + L+ L F+ KI +
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA-QPGFQFHKIDLA 62

Query: 56 D-DLLSFQKLIEFYDPIEIYNLSAQSSVGLSFKEPYNSIKSIYHQTLILLEATRTIEFNG 114
D + ++ L ++ + +V S + P+ S L +LE R +
Sbjct: 63 DREGMT--DLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ- 119

Query: 115 TIFLAGSSEMFG-ETKVAADINHP-RKPLSPYAHAKEASFSLVKQYRRIYNINCVTGVLF 172
+ A SS ++G K+ + P+S YA K+A+ + Y +Y + TG+ F
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLP-ATGLRF 178

Query: 173 NHESTL-----RTDRFVIPKIIKGAIQCKKNKTYKLSLGNIDIYRDWGWAEDYVEAMQRI 227
T+ R D + K K ++ K Y + RD+ + +D EA+ R+
Sbjct: 179 ---FTVYGPWGRPD-MALFKFTKAMLEGKSIDVY----NYGKMKRDFTYIDDIAEAIIRL 230

Query: 228 TRADKKQDHVICTGNLTS-------------------LKDFIKKVFDKMDLNWQDHVTIN 268
D T L D+I+ + D + + N
Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE----AKKN 286

Query: 269 KGNKRPHDILKSFGSPQALKLELDWENKKSIED 301
+P D+L++ +AL + + + +++D
Sbjct: 287 MLPLQPGDVLETSADTKALYEVIGFTPETTVKD 319


30NATL1_00421NATL1_00471N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_00421-110-5.324898hypothetical protein
NATL1_00431011-4.490242hypothetical protein
NATL1_00441012-4.708162hypothetical protein
NATL1_00451220-1.169283hypothetical protein
NATL1_00461118-0.570457hypothetical protein
NATL1_00471018-0.520551hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00421SYCDCHAPRONE451e-07 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 44.9 bits (106), Expect = 1e-07
Identities = 21/97 (21%), Positives = 35/97 (36%)

Query: 48 EQIINQALKFHSKGNISEATKYYQYFINQGFKDHRVFSNYGAILRKQGKVKESGFFMRKA 107
EQ+ + A + G +A K +Q D R F GA + G+ +
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 108 IEIQPNIPIINFNMGNILKDLGKLKEAEIFIRKSIRM 144
+ P F+ L G+L EAE + + +
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00431SYCDCHAPRONE465e-08 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 46.1 bits (109), Expect = 5e-08
Identities = 30/131 (22%), Positives = 45/131 (34%), Gaps = 3/131 (2%)

Query: 46 EQIINQAIQFHLKGNIPKATKYYQQLINQECNDYRVFSNYGAILQGLGKSKEAEASLRKA 105
EQ+ + A + G A K +Q L + D R F GA Q +G+ A S
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 106 VELNPDLAESHSYLGNLLNDLGKFKEAEASLRKAVEL---NPNLALAHAYLGILLNDLGQ 162
++ + L G+ EAE+ L A EL + +L +
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKL 156

Query: 163 LKEAEASLKKA 173
KE E
Sbjct: 157 KKEMEHECVDN 167



Score = 38.4 bits (89), Expect = 2e-05
Identities = 25/118 (21%), Positives = 43/118 (36%), Gaps = 5/118 (4%)

Query: 93 GKSKEAEASLRKAVELNPDLAESHSYLGNLLNDLGKFKEAEASLRKAVELNPNLALAHAY 152
GK ++A + L+ + LG +G++ A S ++ +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 153 LGILLNDLGQLKEAEASLKKAIKLKFGSVKAYDAL----SNVLNKLGRKKEAEESSKK 206
L G+L EAE+ L A +L + L S++L + KKE E
Sbjct: 110 AAECLLQKGELAEAESGLFLAQEL-IADKTEFKELSTRVSSMLEAIKLKKEMEHECVD 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00441SYCDCHAPRONE431e-06 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 42.6 bits (100), Expect = 1e-06
Identities = 21/91 (23%), Positives = 32/91 (35%)

Query: 46 EQIISQAIRFHEQGKIIEATKYYQYCLNHDFNDPIVFLNYGTILRSIGKLKKAEIFIRKA 105
EQ+ S A ++ GK +A K +Q D D FL G +++G+ A
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 106 INIDPNLQDVHFKLGVVLNELNRPKEAIKYF 136
+D F L + EA
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_00471PF07472290.014 Fucose-binding lectin II
		>PF07472#Fucose-binding lectin II

Length = 245

Score = 29.2 bits (65), Expect = 0.014
Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 30 VTIFISIFFAVIVTENSIDYQFGDQIDLLDWIIG 63
V IF +F ++ +E+ D + D I +L+W +G
Sbjct: 212 VDIFKKTYFGLVGSEDGTDGDYNDGIAILNWPLG 245


31NATL1_05761NATL1_05811N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_05761-2120.445559transaldolase B
NATL1_05771-2120.664447NAD binding site
NATL1_05781-3111.659611ribosome recycling factor
NATL1_05791-3111.305496uridylate kinase
NATL1_05801-3110.500572cob(I)alamin adenosyltransferase
NATL1_05811-290.859727phage integrase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_05761TYPE3OMGPROT290.025 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 29.1 bits (65), Expect = 0.025
Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 2/53 (3%)

Query: 31 DATTNPSLIL-AAAQMPAYQSLIDQALTTSREMLGTSAAKVDVVKEALDELCV 82
D + N ++ + +MP YQ LI L + + + VD+ + L EL V
Sbjct: 250 DPSLNAIIVRDSPERMPMYQRLIHA-LDKPSARIEVALSIVDINADQLTELGV 301


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_05781MYCMG045270.039 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 27.4 bits (60), Expect = 0.039
Identities = 10/56 (17%), Positives = 26/56 (46%)

Query: 125 LRNIRREAIDRVKKSEKDGDLSEDQSRDEQEKIQKETDNFIKDIEKKLSEKEAEIL 180
L+ I + V + + ++ Q +Q +KE D + + ++ L ++++ L
Sbjct: 375 LKVISDPSTGIVSSKKNNAEMKSKQMSTDQMTSEKEFDYYTETLKALLEKEDSAEL 430


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_05791CARBMTKINASE290.018 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 28.6 bits (64), Expect = 0.018
Identities = 17/65 (26%), Positives = 25/65 (38%), Gaps = 14/65 (21%)

Query: 115 RAIRHL-EKGRVVVFGGGCGNPFFTT-------------DTTAALRAAEINAEVVFKATK 160
I+ L E+G +V+ GG G P D A E+NA++ T
Sbjct: 177 ETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTD 236

Query: 161 VDGVY 165
V+G
Sbjct: 237 VNGAA 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_05811INTIMIN290.048 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 28.9 bits (64), Expect = 0.048
Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 46 KVQRISLKLPHDINGLEEARKAIELI 71
K +SL +PHDING E + + I+LI
Sbjct: 450 KQDILSLNIPHDINGTERSTQKIQLI 475


32NATL1_06931NATL1_06991N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_06931013-2.496594hypothetical protein
NATL1_06941013-3.205104short-chain dehydrogenase/reductase
NATL1_06951010-2.607098lycopene epsilon cyclase
NATL1_06961111-4.354032hypothetical protein
NATL1_06971-111-4.210753light-dependent protochlorophyllide
NATL1_0698109-4.420308small mechanosensitive ion channel
NATL1_06991010-4.299646isochorismatase hydrolase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_06931TYPE3IMRPROT250.027 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 25.5 bits (56), Expect = 0.027
Identities = 8/41 (19%), Positives = 17/41 (41%), Gaps = 1/41 (2%)

Query: 8 QFLEPIQEQLNKVYSIA-SLALTVLVCLWLFNFIVGLIQRT 47
+ + + ++ LAL ++ L N +GL+ R
Sbjct: 167 NAFLALTKAGSLIFLNGLMLALPLITLLLTLNLALGLLNRM 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_06941DHBDHDRGNASE634e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 63.1 bits (153), Expect = 4e-14
Identities = 40/195 (20%), Positives = 86/195 (44%), Gaps = 7/195 (3%)

Query: 2 RKILISGASRGIGKAIAIKLLKEGHSLSL--GVRERDDLLNTPLDPKINNSDSFLVHTYD 59
+ I+GA++GIG+A+A L +G ++ E+ + + + L + ++++F D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF---PAD 65

Query: 60 ATDQNSSKKWVDKTFETFKSVDTIIHCAGIFKTTKLLFNDNEMKDIEDLWKVNVMGPWIL 119
D + + + +D +++ AG+ + L + ++ E + VN G +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPG--LIHSLSDEEWEATFSVNSTGVFNA 123

Query: 120 TKHAWKYLSLSNSARIIVLVSMSGKRSKGNLCGYSMSKFALMSLCQTMRNEGWGNGIRVT 179
++ KY+ S I+ + S + ++ Y+ SK A + + + E IR
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 180 AICPGWVNTDMAKEI 194
+ PG TDM +
Sbjct: 184 IVSPGSTETDMQWSL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_06971DHBDHDRGNASE526e-10 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 52.4 bits (125), Expect = 6e-10
Identities = 43/175 (24%), Positives = 67/175 (38%), Gaps = 19/175 (10%)

Query: 16 VMITGGSSGIGFQAVLKLISLGHNIILPCKNISRANEVLTNLFNHSLDESSNKGEIYTP- 74
ITG + GIG L S G +I V N SS K E
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIA----------AVDYNPEKLEKVVSSLKAEARHAE 60

Query: 75 --IMDLSDLSSIDSLCSEVKNRRWTIDVLILNAGLQYTGSKTPRRSTQGIELTFAVNHLS 132
D+ D ++ID + + ++ ID+L+ AG+ G S + E TF+VN
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGL-IHSLSDEEWEATFSVNSTG 119

Query: 133 HFYLTQKILP-FIDIRNDPKIIITSSEVHNPNSGGGKVGA-KA---SLGKLKGLE 182
F ++ + +D R+ + + S+ P + + KA K GLE
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLE 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_06991ISCHRISMTASE402e-06 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 40.4 bits (94), Expect = 2e-06
Identities = 38/203 (18%), Positives = 69/203 (33%), Gaps = 31/203 (15%)

Query: 5 IKTNPNPINTNNNNNKI---IVEDETLLLIVDVQQKLIKNIKDN----QQLLFNIKKLTD 57
I+ P ++ NK+ + +LLI D+Q + +L NI+KL +
Sbjct: 6 IQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKN 65

Query: 58 TCNLLNVRVAIT----EQNPLKLG-----------------KTLESITDNNEYPKFEKME 96
C L + V T QNP K + + ++ K
Sbjct: 66 QCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWR 125

Query: 97 FSCIYNKNFIKYINDYNFKNIIVSGIESHICVLQTSMDLLQKGLNILIPRDAIGSRNEMD 156
+S N ++ + +I++GI +HI L T+ + + + DA+ +
Sbjct: 126 YSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEK 185

Query: 157 NDTAFLRLILS--GAVASTTESL 177
+ A L T L
Sbjct: 186 HQMA-LEYAAGRCAFTVMTDSLL 207


33NATL1_08531NATL1_08611N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_08531-310-0.872819UDP-glucose 4-epimerase
NATL1_08541-213-2.276265hypothetical protein
NATL1_08551-213-2.851643dTDP-4-dehydrorhamnose 3,5-epimerase
NATL1_08561015-3.713627dTDP-4-dehydrorhamnose reductase
NATL1_08571217-5.200044dTDP-D-glucose 4,6-dehydratase
NATL1_08581319-6.264840UDP-N-acetylglucosamine 2-epimerase
NATL1_08591319-6.567951nucleotide-diphosphate-sugar epimerase, membrane
NATL1_08601319-7.083385UDP-N-acetylmuramyl pentapeptide
NATL1_08611420-7.487306hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_08531NUCEPIMERASE1782e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 178 bits (452), Expect = 2e-55
Identities = 80/346 (23%), Positives = 151/346 (43%), Gaps = 44/346 (12%)

Query: 1 MRVLLTGGAGFIGSHIALLLLERGYDVLILDSFANSSSNVIERIENFLDNKALKYK-LRV 59
M+ L+TG AGFIG H++ LLE G+ V+ +D+ + +++ + L +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQ----ARLELLAQPGFQF 56

Query: 60 INGDIRDKQILESIFSKCVKENKPIEVVIHLAGVKSVCESLTNPLYYWDVNVSGTLNLLL 119
D+ D++ + +F + E V +V SL NP Y D N++G LN+L
Sbjct: 57 HKIDLADREGMTDLF-----ASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILE 111

Query: 120 TMKDYQCYSLVFSSSATIYGLSDYVPILEEQKIS-PITPYGQTKVAVENLFYDLYKSNVN 178
+ + L+++SS+++YGL+ +P + + P++ Y TK A E + + Y
Sbjct: 112 GCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAH-TYSHLYG 170

Query: 179 LWKICSLRYFNPVGAHPSGLIGEDPRGIPNNLFPFITQVAIGRQKILNIYGDDWETKDGS 238
L LR+F G P G P+ T+ A+ K +++Y G
Sbjct: 171 L-PATGLRFFTVYG----------PWGRPDMALFKFTK-AMLEGKSIDVYN------YGK 212

Query: 239 GIRDYVHIIDLAEGHLASIDYLNTSESC--------------LEFINLGSGKGYSVFQII 284
RD+ +I D+AE + D + +++ N+G+ + I
Sbjct: 213 MKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYI 272

Query: 285 RQFELSTGCSIPFSIESRRDGDVAVSYADISKAKRLLSWTPKRSLE 330
+ E + G ++ + GDV + AD ++ +TP+ +++
Sbjct: 273 QALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVK 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_08561NUCEPIMERASE443e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 44.4 bits (105), Expect = 3e-07
Identities = 37/159 (23%), Positives = 55/159 (34%), Gaps = 27/159 (16%)

Query: 1 MKVLLTGASGQLGQAIIKS----------------------KPSFVELIAT---TRRELD 35
MK L+TGA+G +G + K K + +EL+A ++D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 36 LADDEACRRAVRQHQPDWVINSGAYTAVDKAEDEKELAMSINTIAPKMFAEELSQTG-GK 94
LAD E + V S AV + + N E
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 95 LLQLSTDFVFDGEQNFPYKTGQK-KKPLGVYGATKAAGE 132
LL S+ V+ + P+ T P+ +Y ATK A E
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_08571NUCEPIMERASE1643e-50 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 164 bits (418), Expect = 3e-50
Identities = 82/353 (23%), Positives = 147/353 (41%), Gaps = 51/353 (14%)

Query: 16 RILVTGGAGFIGGAVIRKLLKESTSKIFNIDKIGYASDLT---AIDEILRTKDYSDRYDF 72
+ LVTG AGFIG V ++LL+ ++ ID + D++ A E+L + F
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEA-GHQVVGIDNLNDYYDVSLKQARLELLAQPGFQ----F 56

Query: 73 AKIDLSIPDETAKAISDSDPDLIMHLAAESHVDRSIQGPEAFINSNIFGTFNLLEATRKH 132
KIDL+ + + + + V S++ P A+ +SN+ G N+LE R +
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 133 YENLSNKRKNDFRFLHISTDEVFGSLGLNGK--FTESTSYD-PRSPYSASKASSDHLVRS 189
L+ S+ V+ GLN K F+ S D P S Y+A+K +++ + +
Sbjct: 117 ---------KIQHLLYASSSSVY---GLNRKMPFSTDDSVDHPVSLYAATKKANELMAHT 164

Query: 190 WHHTFQLPIVITNCSNNFGPWQFPEKLIPVAINKALALKSIPLYGDGENIRDWLYVDDHV 249
+ H + LP +GPW P+ + L KSI +Y G+ RD+ Y+DD
Sbjct: 165 YSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 250 DALFLAANKGKIGDS------------------YCVGGYGERKNIEILKIICKILD-EIY 290
+A+ + D+ Y +G + ++ ++ + L E
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 291 PKHSPFERLITKVQDRKGHDRRYAIDPSKIRNELGWEPKYSLEDRLETTVQWY 343
P + G + D + +G+ P+ +++D ++ V WY
Sbjct: 285 KNMLPL---------QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_08591NUCEPIMERASE584e-11 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 57.9 bits (140), Expect = 4e-11
Identities = 50/321 (15%), Positives = 102/321 (31%), Gaps = 75/321 (23%)

Query: 285 TVCITGAGGSIGSELSKQ----------IYNLNPYKMILIDHSESHLYNINKQITSYPDN 334
+TGA G IG +SK+ I NLN Y + + + ++ + P
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQA-------RLELLAQPG- 53

Query: 335 GIEVKAILGSTTDLPFINKVFTDNNVDIIFHAAAYKHVPLVESNPLKGLFNNVFSTEIVC 394
+ D + +F + + +F + V NP +N+ +
Sbjct: 54 ---FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNIL 110

Query: 395 KAALEAGANNLVLIST---------------DKAVRPTNVMGASKRLSELVVQAIAEKSK 439
+ +L+ S+ D P ++ A+K+ +EL+ +
Sbjct: 111 EGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 440 ENSIAKKTCFSMVRFGNVLGSSGS---VLPLFQEQIDNGGPITL-THPRIIRYFMTISEA 495
+ +RF V G G L F + + G I + + ++ R F I +
Sbjct: 171 LPATG-------LRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDI 223

Query: 496 SQLVIQ------------------SKVLAEGGDVFHLDMGKPVSIKSLAEQLILLNGLSI 537
++ +I+ V+++ PV + + L
Sbjct: 224 AEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL-------- 275

Query: 538 KDNKNLEGDIEIKFTGLRPGE 558
L + + L+PG+
Sbjct: 276 --EDALGIEAKKNMLPLQPGD 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_08611NUCEPIMERASE504e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 49.8 bits (119), Expect = 4e-09
Identities = 58/352 (16%), Positives = 114/352 (32%), Gaps = 94/352 (26%)

Query: 10 VIVSGANGFTGKFVCKELIKNKINFIAL----------LRPGSI-----PDW-FNKNKIE 53
+V+GA GF G V K L++ + + L+ + P + F+K +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 54 FR------FADLNSYDELHS--------QLNGCRALINVASIGFGSAKNIIKSCYKSNIE 99
R FA + S L A + GF NI++ C + I+
Sbjct: 63 DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGF---LNILEGCRHNKIQ 119

Query: 100 RVIFISTTAI--------FTRLNASSKTIRLEAENDIINSK----------LKWTIIRPT 141
+++ S++++ F+ ++ + L A N L T +R
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 142 MIYGSPKDRNM--IKLIKWIDNMPIIPIFGNGKSLQQPVNVKDVAWSLVKIIDKKSTY-- 197
+YG +M K K + I ++ GK + + D+A +++++ D
Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239

Query: 198 ---------------YRSFNISGKEPLTFTQIVDIIEKMLNKSIIKIYLSKNITLLFIGL 242
YR +NI P+ + +E L K L
Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNML----------- 288

Query: 243 LERLRIKFPIKSEQVHRLNENKDFIHEKAKRAFNYNP-LSFEEGINIEIESY 293
P++ V + + + P + ++G+ + Y
Sbjct: 289 --------PLQPGDVLETSADTK----ALYEVIGFTPETTVKDGVKNFVNWY 328


34NATL1_18171NATL1_18211N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_18171015-5.664186hypothetical protein
NATL1_18181015-5.284573hypothetical protein
NATL1_18191015-5.676029hypothetical protein
NATL1_18201015-4.854877hypothetical protein
NATL1_18211114-3.405989hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_18171TYPE3IMSPROT290.002 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.3 bits (66), Expect = 0.002
Identities = 11/70 (15%), Positives = 28/70 (40%), Gaps = 5/70 (7%)

Query: 14 LVIACLTVSLFI----GFLISINLIVDPIIYWLASTEGARIIISCILSWFGVSTLFDFLY 69
+V+ + + + I L+ + + R ++ F V ++ D+ +
Sbjct: 147 VVLLSILIWIIIKGNLVTLLQLP-TCGIECITPLLGQILRQLMVICTVGFVVISIADYAF 205

Query: 70 KRYRRRKKLK 79
+ Y+ K+LK
Sbjct: 206 EYYQYIKELK 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_18191SYCDCHAPRONE361e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 35.7 bits (82), Expect = 1e-04
Identities = 19/149 (12%), Positives = 41/149 (27%), Gaps = 12/149 (8%)

Query: 34 INTNTPSKSSKEKIINQALDSHSEGNIQEAKKLYQYLINQGFNDHRVFSNYGVILQNLGK 93
+ T + + L G I ++ + Q + GK
Sbjct: 1 MQQETTDTQEYQLAMESFLKGG--GTIAMLNEISSDTLEQ-------LYSLAFNQYQSGK 51

Query: 94 LKEAKISFRKAIELNPNYHEAHANLGNILRDLGKLEEAEVSTLKAIELNPNFASAHCNLG 153
++A F+ L+ LG + +G+ + A S ++ +
Sbjct: 52 YEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAA 111

Query: 154 ---LILEGLDKIEQSVFSFKRALETNPND 179
L L + E +F + +
Sbjct: 112 ECLLQKGELAEAESGLFLAQELIADKTEF 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_18201SYCDCHAPRONE443e-07 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 43.8 bits (103), Expect = 3e-07
Identities = 28/131 (21%), Positives = 48/131 (36%), Gaps = 3/131 (2%)

Query: 45 EQIINQAFKFHSQGNISEAAKYYQYCINQAFKDYRVFTNYGVILKKFGKLQEAEKFQREA 104
EQ+ + AF + G +A K +Q D R F G + G+ A
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 105 IQINPNFAEAYSNLGNILRDLGQLKEAELSFRKAIEI---KSDYAEAYSNLGNILRDLGQ 161
++ + L G+L EAE A E+ K+++ E + + ++L +
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKL 156

Query: 162 LKEAELSFRKA 172
KE E
Sbjct: 157 KKEMEHECVDN 167



Score = 43.0 bits (101), Expect = 5e-07
Identities = 25/118 (21%), Positives = 49/118 (41%), Gaps = 3/118 (2%)

Query: 92 GKLQEAEKFQREAIQINPNFAEAYSNLGNILRDLGQLKEAELSFRKAIEIKSDYAEAYSN 151
GK ++A K + ++ + + LG + +GQ A S+ + +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 152 LGNILRDLGQLKEAELSFRKAIEI---KPDYAEAHSNLGNILSDLGIKKEAKLEKQKS 206
L G+L EAE A E+ K ++ E + + ++L + +KKE + E +
Sbjct: 110 AAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_18211SYCDCHAPRONE473e-08 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 46.8 bits (111), Expect = 3e-08
Identities = 30/136 (22%), Positives = 48/136 (35%), Gaps = 11/136 (8%)

Query: 46 EQIINQAFKFHSQGNISEAAKYYQYCINQAFKDYRVFTNYGVILKKFGKLKEAEKCQREA 105
EQ+ + AF + G +A K +Q D R F G + G+ A
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96

Query: 106 IQINPNFAEAYSNLGNILSDLGQLKEAELSFRKAIEIKSDYAEAHSNLGNILRDFGQLKE 165
++ + L G+L EAE A E+ +D E F +L
Sbjct: 97 AIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTE-----------FKELST 145

Query: 166 AELSFRKAIEIKSDYA 181
S +AI++K +
Sbjct: 146 RVSSMLEAIKLKKEME 161



Score = 38.8 bits (90), Expect = 2e-05
Identities = 23/118 (19%), Positives = 39/118 (33%), Gaps = 3/118 (2%)

Query: 127 GQLKEAELSFRKAIEIKSDYAEAHSNLGNILRDFGQLKEAELSFRKAIEIKSDYAEAHSN 186
G+ ++A F+ + + LG + GQ A S+ + +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 187 LGNILNDLGQLKEAELSFRKAIEI---KPDFANTHNNLGIILSDLDQLKEAELSFRKA 241
L G+L EAE A E+ K +F + +L + KE E
Sbjct: 110 AAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167



Score = 38.4 bits (89), Expect = 3e-05
Identities = 23/113 (20%), Positives = 40/113 (35%), Gaps = 3/113 (2%)

Query: 175 EIKSDYAEAHSNLGNILNDLGQLKEAELSFRKAIEIKPDFANTHNNLGIILSDLDQLKEA 234
EI SD E +L G+ ++A F+ + + LG + Q A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 235 ELSFRKAIEIKPDFIKAYSNLGNILRDLGQLKEAELSFRKAIKI---KPDYAE 284
S+ + + + L G+L EAE A ++ K ++ E
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKE 142



Score = 36.4 bits (84), Expect = 1e-04
Identities = 22/118 (18%), Positives = 41/118 (34%), Gaps = 3/118 (2%)

Query: 161 GQLKEAELSFRKAIEIKSDYAEAHSNLGNILNDLGQLKEAELSFRKAIEIKPDFANTHNN 220
G+ ++A F+ + + LG +GQ A S+ + +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 221 LGIILSDLDQLKEAELSFRKAIEI---KPDFIKAYSNLGNILRDLGQLKEAELSFRKA 275
L +L EAE A E+ K +F + + + ++L + KE E
Sbjct: 110 AAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167



Score = 35.7 bits (82), Expect = 2e-04
Identities = 24/118 (20%), Positives = 45/118 (38%), Gaps = 3/118 (2%)

Query: 93 GKLKEAEKCQREAIQINPNFAEAYSNLGNILSDLGQLKEAELSFRKAIEIKSDYAEAHSN 152
GK ++A K + ++ + + LG +GQ A S+ + +
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 153 LGNILRDFGQLKEAELSFRKAIEI---KSDYAEAHSNLGNILNDLGQLKEAELSFRKA 207
L G+L EAE A E+ K+++ E + + ++L + KE E
Sbjct: 110 AAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167



Score = 32.6 bits (74), Expect = 0.002
Identities = 17/91 (18%), Positives = 29/91 (31%), Gaps = 13/91 (14%)

Query: 221 LGIILSDLDQL-------------KEAELSFRKAIEIKPDFIKAYSNLGNILRDLGQLKE 267
I L+QL ++A F+ + + + LG + +GQ
Sbjct: 29 NEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDL 88

Query: 268 AELSFRKAIKIKPDYAEAYFNLAYLELLKGN 298
A S+ + F+ A L KG
Sbjct: 89 AIHSYSYGAIMDIKEPRFPFHAAECLLQKGE 119


35NATL1_20951NATL1_21021N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_2095106-1.724942preprotein translocase subunit SecA
NATL1_20961-212-3.654000GDP-D-mannose dehydratase
NATL1_20971-112-4.232750hypothetical protein
NATL1_20981012-3.917938hypothetical protein
NATL1_20991-211-3.433247hypothetical protein
NATL1_210017141.887172hypothetical protein
NATL1_210116131.513917ATPase
NATL1_210215140.486742leukotoxin secretion protein-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_20951SECA7880.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 788 bits (2036), Expect = 0.0
Identities = 288/529 (54%), Positives = 362/529 (68%), Gaps = 17/529 (3%)

Query: 1 MFKKLLGDPNTRKLKRYFPLVSDVNIFEEDLLSLSDDDLRTRTSEFRSKLEKVSSPNEEL 60
+ K+ G N R L+R +V+ +N E ++ LSD++L+ +T+EFR++LEK
Sbjct: 5 LLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEV----- 59

Query: 61 SLLDELLPEAFAVVREASKRVLGMRHFDVQLIGGMVLHEGQIAEMKTGEGKTLVATLPSY 120
L+ L+PEAFAVVREASKRV GMRHFDVQL+GGMVL+E IAEM+TGEGKTL ATLP+Y
Sbjct: 60 --LENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAY 117

Query: 121 LNALTGRGVHVVTVNDYLARRDAEWMGQIHRFLGLSVGLVQQSMAPLERKKNYECDITYA 180
LNALTG+GVHVVTVNDYLA+RDAE + FLGL+VG+ M +++ Y DITY
Sbjct: 118 LNALTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYG 177

Query: 181 TNSELGFDYLRDNMAADKSEIVQRDFQFCVIDEVDSILIDEARTPLIISGQVERSQEKYK 240
TN+E GFDYLRDNMA E VQR + ++DEVDSILIDEARTPLIISG E S E YK
Sbjct: 178 TNNEYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYK 237

Query: 241 QAAQVVENLKRAIDTSKDGIDPEGDYEVDEKQRSCILTDEGFANTEKLLNVQDLFDPKE- 299
+ +++ +L R + EG + VDEK R LT+ G E+LL + + D E
Sbjct: 238 RVNKIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGES 297

Query: 300 -------PWAHYVTNALKAKELFIKDVNYIVRNDEAVIVDEFTGRVMPGRRWSDGQHQAI 352
H+VT AL+A LF +DV+YIV++ E +IVDE TGR M GRRWSDG HQA+
Sbjct: 298 LYSPANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAV 357

Query: 353 EAKENLSIQPETQTLASITYQNFFLLYPRLSGMTGTAKTEEVEFEKTYKLQTTVVPTNRK 412
EAKE + IQ E QTLASIT+QN+F LY +L+GMTGTA TE EF YKL T VVPTNR
Sbjct: 358 EAKEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRP 417

Query: 413 ISRQDWVDQVFKTEAAKWRAVAKETADIHQKGRPVLVGTTSVEKSELLSTLLSEQQVPHN 472
+ R+D D V+ TEA K +A+ ++ + KG+PVLVGT S+EKSEL+S L++ + HN
Sbjct: 418 MIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHN 477

Query: 473 LLNAKPENVEREAEIVAQAGRAGAVTIATNMAGRGTDIILGGNSDYMAR 521
+LNAK EA IVAQAG AVTIATNMAGRGTDI+LGG+
Sbjct: 478 VLNAKFH--ANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVA 524



Score = 392 bits (1008), Expect = e-124
Identities = 118/314 (37%), Positives = 184/314 (58%), Gaps = 3/314 (0%)

Query: 628 IKELRIAIQLIKNEYEEVLSQEETNVRRVGGLHVIGTERHESRRVDNQLRGRAGRQGDLG 687
+ L + + V GGLH+IGTERHESRR+DNQLRGR+GRQGD G
Sbjct: 523 VAALENPTAEQIEKIKADWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAG 582

Query: 688 STRFFLSLEDNLLRIFGGDRVAGLMNAFRVEEDMPIESGMLTRSLEGAQKKVETYYYDIR 747
S+RF+LS+ED L+RIF DRV+G+M ++ IE +T+++ AQ+KVE+ +DIR
Sbjct: 583 SSRFYLSMEDALMRIFASDRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIR 642

Query: 748 KQIFEYDEVMNNQRKAVYSERRRVLDGRELKLQVIGYGQRTMEEIVEAYVNEDLPPEEWN 807
KQ+ EYD+V N+QR+A+YS+R +LD ++ + + + ++AY+ E W+
Sbjct: 643 KQLLEYDDVANDQRRAIYSQRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWD 702

Query: 808 LTNLVSKVKEFIYLLEDLKPEDLLGLNKNELKDFLKEQLRN-AYDMKEAKVEQSHPGIMR 866
+ L ++K L DL + L ++ L+E++ + ++ + K E +MR
Sbjct: 703 IPGLQERLKNDFDL--DLPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMR 760

Query: 867 QAERFFILQQLDTLWREHLQSMDSLKESVGLRGYGQKDPLIEYKNEGYDMFLEMMVNMRR 926
E+ +LQ LD+LW+EHL +MD L++ + LRGY QKDP EYK E + MF M+ +++
Sbjct: 761 HFEKGVMLQTLDSLWKEHLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKY 820

Query: 927 NVIYSMFMFQPAQK 940
VI ++ Q
Sbjct: 821 EVISTLSKVQVRMP 834


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_20961NUCEPIMERASE961e-24 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 96.0 bits (239), Expect = 1e-24
Identities = 64/327 (19%), Positives = 122/327 (37%), Gaps = 15/327 (4%)

Query: 8 LITGITGQDGSYLAELLLDKGYKVHGLVRRSSQKNTNLLDNILNSQHNKNLELHYGDLTQ 67
L+TG G G ++++ LL+ G++V G+ + + +L L + H DL
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 68 STNILRIIENIQPDEIYNLGAQSHVQVSFETPEYTAQTDALGPLRILEAIRILQLTKKTK 127
+ + + + ++ + V+ S E P A ++ G L ILE R ++
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI---QH 120

Query: 128 IYQASTSELYGLVQETPQNERTPF-YPRSPYGVAKLYAYWITINYRESYGIFACNGILFN 186
+ AS+S +YGL ++ P + +P S Y K + Y YG+ A F
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 187 HESPRRGENFVTRKITKGLCEINRGSTDCLYLGNIDSLRDWGHAKDYVEMQWMMLQQEKP 246
P + K TK + E G + +Y RD+ + D E +
Sbjct: 181 VYGPWGRPDMALFKFTKAMLE---GKSIDVY-NYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 247 EDYVISTGKQTSVRRFVELCAEHLNWGGIIWEGKGIDEIGKRKDNKQVIIRIDPNL--FR 304
D + T ++ + I + + I N+ +
Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL-----EDALGIEAKKNMLPLQ 291

Query: 305 PAEVNSLLGDSEKAYKKLGWKPKYNIE 331
P +V D++ Y+ +G+ P+ ++
Sbjct: 292 PGDVLETSADTKALYEVIGFTPETTVK 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_20971CABNDNGRPT290.028 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 29.2 bits (65), Expect = 0.028
Identities = 20/46 (43%), Positives = 24/46 (52%), Gaps = 11/46 (23%)

Query: 147 KNTIIHEIGHALGLAHP-----------FNDPFNKNYTTQDTIMSY 181
+ T HEIGHALGLAHP +ND + Q +IMSY
Sbjct: 183 RQTFTHEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSY 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_21021RTXTOXIND1075e-28 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 107 bits (270), Expect = 5e-28
Identities = 59/263 (22%), Positives = 106/263 (40%), Gaps = 55/263 (20%)

Query: 161 DTEITEANQISLIETLKINQEILDNLKYLSEEGAASRIQYLQQSNKV------------- 207
+ A ++ + LD+ L + A ++ L+Q NK
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 208 -----------------------QEIKTKLKE--------------NEVQMKYQKIISPV 230
EI KL++ NE + + I +PV
Sbjct: 275 LEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPV 334

Query: 231 NGIVFDMQPKGPGYVARTSEPILKVVPLD-KLQAEIEIDSSDIGFISLGKDTDISIDSFP 289
+ V ++ G V T+E ++ +VP D L+ + + DIGFI++G++ I +++FP
Sbjct: 335 SVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFP 394

Query: 290 SSDFGVVEGKIIRIGSDALPPDPRINKGYRFPAIIKLNQQNLILKSGKTLPLQAGMSITA 349
+ +G + GK+ I DA+ D R+ + II + + L K +PL +GM++TA
Sbjct: 395 YTRYGYLVGKVKNINLDAI-EDQRLGLVFN--VIISIEENCLSTG-NKNIPLSSGMAVTA 450

Query: 350 NIKLRKVSYLQLLLNNFSDKADS 372
IK S + LL+ +
Sbjct: 451 EIKTGMRSVISYLLSPLEESVTE 473



Score = 86.0 bits (213), Expect = 1e-20
Identities = 29/133 (21%), Positives = 52/133 (39%), Gaps = 2/133 (1%)

Query: 89 WAEAITWTLIGGTSFGIAWLALAKTEEIVIAQGKLEPKTGVIEVQMPLEGITKEILVKEG 148
+ + ++G L + E + A GKL E++ I KEI+VKEG
Sbjct: 56 RPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115

Query: 149 DRVEKGQILIHLDTEITEANQISLIETLKINQEILDNLKYLSEEGAASRIQYLQQSNKVQ 208
+ V KG +L+ L EA+ + +L + L+ +Y + + + +
Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQAR--LEQTRYQILSRSIELNKLPELKLPDE 173

Query: 209 EIKTKLKENEVQM 221
+ E EV
Sbjct: 174 PYFQNVSEEEVLR 186


36NATL1_21051NATL1_21111N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
NATL1_210515150.411247hypothetical protein
NATL1_21061115-5.320133GDP-D-mannose dehydratase
NATL1_21071015-5.077597hypothetical protein
NATL1_21081-29-3.033504hypothetical protein
NATL1_21091-310-1.042121dTDP-glucose pyrophosphorylase
NATL1_21101-212-1.605221GNAT family acetyltransferase
NATL1_21111-211-0.597886*6,7-dimethyl-8-ribityllumazine synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_21051RTXTOXINA783e-16 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 78.5 bits (193), Expect = 3e-16
Identities = 37/92 (40%), Positives = 50/92 (54%)

Query: 655 GGTKADTFTGGAGDDSIDGDAGDDSLVGAAGADDLDGGAGNDVISGGAGNDTITAGSGND 714
G D F G GDD I+G+ G+D L G G D L GG G+D + GG GND + +GN+
Sbjct: 733 GSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNN 792

Query: 715 NLDGGADADTFTLSTNYTTDDTIVGGAGEDSV 746
L+GG D F + N + + GG G D +
Sbjct: 793 YLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKL 824



Score = 76.5 bits (188), Expect = 1e-15
Identities = 39/104 (37%), Positives = 53/104 (50%)

Query: 1544 LTGGSGADTLTGGSVADTITGGAGGDTITTNAGDDTVAGGGGNDTITGGAGDNVITGGAG 1603
L G + AD G D G G D I N G+D + G GNDT++GG GD+ + GG G
Sbjct: 722 LIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDG 781

Query: 1604 NDSITAGAGFDNIDSGAGDDTIVFAANMSLSDTVAGGDGNDTIT 1647
ND + AG + ++ G GDD N + + GG GND +
Sbjct: 782 NDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLY 825



Score = 66.5 bits (162), Expect = 1e-12
Identities = 30/90 (33%), Positives = 43/90 (47%)

Query: 655 GGTKADTFTGGAGDDSIDGDAGDDSLVGAAGADDLDGGAGNDVISGGAGNDTITAGSGND 714
G D G G+D + GD G+D+L G G D L GG GND + G AGN+ + G G+D
Sbjct: 742 GADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDD 801

Query: 715 NLDGGADADTFTLSTNYTTDDTIVGGAGED 744
++ + +D + G G D
Sbjct: 802 EFQVQGNSLAKNVLFGGKGNDKLYGSEGAD 831



Score = 64.6 bits (157), Expect = 4e-12
Identities = 39/115 (33%), Positives = 51/115 (44%), Gaps = 24/115 (20%)

Query: 1543 TLTGGSGADTLTGGSVADTITGGAGGDTITTNAGDDTVAGGGGNDTITGGAGD------- 1595
+ G G D L G DT++GG G D + G+D + G GN+ + GG GD
Sbjct: 748 LIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQG 807

Query: 1596 -----NVITGGAGNDSITAGAGFDNIDSGAGDDTIVFAANMSLSDTVAGGDGNDT 1645
NV+ GG GND + G D +D G GDD + GG GND
Sbjct: 808 NSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLL------------KGGYGNDI 850



Score = 60.8 bits (147), Expect = 6e-11
Identities = 36/115 (31%), Positives = 47/115 (40%), Gaps = 6/115 (5%)

Query: 633 DTANGSVVPMTITAGSGGYTGSGGTKADTFTGGAGDDSIDGDAGDDSLVGAAGADDLDGG 692
D GS G G D G G+D++ G GDD L G G D L G
Sbjct: 729 DKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGV 788

Query: 693 AGNDVISGGAGNDTITAGSGN---DNLDGGADADTFTLSTNYTTDDTIVGGAGED 744
AGN+ ++GG G+D + + L GG D S D + GG G+D
Sbjct: 789 AGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEG---ADLLDGGEGDD 840



Score = 60.0 bits (145), Expect = 1e-10
Identities = 33/115 (28%), Positives = 51/115 (44%), Gaps = 12/115 (10%)

Query: 1544 LTGGSGADTLTGGSVADTITGGAGGDTITTNAGDDTVAGGGGND------------TITG 1591
L G G DTL+GG+ D + GG G D + AG++ + GG G+D + G
Sbjct: 758 LYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFG 817

Query: 1592 GAGDNVITGGAGNDSITAGAGFDNIDSGAGDDTIVFAANMSLSDTVAGGDGNDTI 1646
G G++ + G G D + G G D + G G+D + + G D +
Sbjct: 818 GKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKL 872



Score = 57.7 bits (139), Expect = 6e-10
Identities = 40/129 (31%), Positives = 54/129 (41%), Gaps = 12/129 (9%)

Query: 633 DTANGSVVPMTITAGSGGYTGSGGTKADTFTGGAGDDSIDGDAGDDSLVGAAGADD---- 688
D G+ + G T SGG D GG G+D + G AG++ L G G D+
Sbjct: 747 DLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQ 806

Query: 689 --------LDGGAGNDVISGGAGNDTITAGSGNDNLDGGADADTFTLSTNYTTDDTIVGG 740
L GG GND + G G D + G G+D L GG D + + Y G
Sbjct: 807 GNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDG 866

Query: 741 AGEDSVSMT 749
ED +S+
Sbjct: 867 GKEDKLSLA 875



Score = 52.7 bits (126), Expect = 2e-08
Identities = 37/127 (29%), Positives = 52/127 (40%), Gaps = 16/127 (12%)

Query: 647 GSGGYTGSGGTKADTFTGGAGDDS------------IDGDAGDDSLVGAAGADDLDGGAG 694
G G G + GG GDD + G G+D L G+ GAD LDGG G
Sbjct: 779 GDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEG 838

Query: 695 NDVISGGAGNDTIT--AGSGNDNL-DGGADADTFTLSTNYTTDDTIVGGAGEDSVSMTIA 751
+D++ GG GND +G G+ + D G D +L+ + D G D +
Sbjct: 839 DDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLA-DIDFRDVAFKREGNDLIMYKGE 897

Query: 752 GGTTTYT 758
G +
Sbjct: 898 GNVLSIG 904



Score = 52.3 bits (125), Expect = 2e-08
Identities = 39/128 (30%), Positives = 56/128 (43%), Gaps = 25/128 (19%)

Query: 1543 TLTGGSGADTLTGGSVADTITGGAGGDTITTNAGDD------------TVAGGGGNDTIT 1590
TL+GG+G D L GG D + G AG + + GDD + GG GND +
Sbjct: 766 TLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLY 825

Query: 1591 GGAGDNVITGGAGNDSITAGAGFDN------------IDSGAGDDTIVFAANMSLSDTVA 1638
G G +++ GG G+D + G G D D G +D + A++ D
Sbjct: 826 GSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSL-ADIDFRDVAF 884

Query: 1639 GGDGNDTI 1646
+GND I
Sbjct: 885 KREGNDLI 892



Score = 51.9 bits (124), Expect = 3e-08
Identities = 29/78 (37%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 1116 SISGGGGADTILGSAGAETIIGGAGNDSLDGAAGNDSISGGAGNDTFSFAATGQLNNGDT 1175
+ G G DT+ G G + + GG GND L G AGN+ ++GG G+D F N
Sbjct: 757 RLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKN--V 814

Query: 1176 IDGGDGTDNLDVTTAVDL 1193
+ GG G D L + DL
Sbjct: 815 LFGGKGNDKLYGSEGADL 832



Score = 41.9 bits (98), Expect = 4e-05
Identities = 25/97 (25%), Positives = 40/97 (41%), Gaps = 4/97 (4%)

Query: 1089 DGNITVLGGSAGDDITLATAIATDKVHSISGGGGADTILGSAGAETIIGGAGNDSLDGAA 1148
DGN ++G + + L D+ + + G G + + G G D LDG
Sbjct: 780 DGNDKLIGVAGNN--YLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGE 837

Query: 1149 GNDSISGGAGNDTFSFAATGQLNNGDTIDGGDGTDNL 1185
G+D + GG GND + + + + D G D L
Sbjct: 838 GDDLLKGGYGNDIYRYLSG--YGHHIIDDDGGKEDKL 872



Score = 35.7 bits (82), Expect = 0.003
Identities = 38/178 (21%), Positives = 63/178 (35%), Gaps = 15/178 (8%)

Query: 1479 VSGGGDTDFQDLIASGTVAVTASGSGNLTIDDAN-------IDGTSASLSVDASA---MS 1528
S GD D + +++G+ + A G G+ + IDGT A+ + + + +
Sbjct: 613 ESHLGDGDDKVFLSAGSANIYA-GKGHDVVYYDKTDTGYLTIDGTKATEAGNYTVTRVLG 671

Query: 1529 GTVSAVATNTSVATTLTGGSGADTLTGGSVADTITGGAGGDTITTNAGDDTVAGGGGNDT 1588
G V + G + S T G + + G D
Sbjct: 672 GDVKVLQEVVKEQEVSVG-KRTEKTQYRSYEFTHINGKNLTETDNLYSVEELIGTTRADK 730

Query: 1589 ITGGAGDNVITGGAGNDSITAGAGFDNIDSGAGDDTIVFAANMSLSDTVAGGDGNDTI 1646
G ++ G G+D I G D + G+DT+ D + GGDGND +
Sbjct: 731 FFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNG---DDQLYGGDGNDKL 785


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_21061NUCEPIMERASE671e-14 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 66.7 bits (163), Expect = 1e-14
Identities = 68/333 (20%), Positives = 127/333 (38%), Gaps = 49/333 (14%)

Query: 1 MVFGCTGQDGSLISNSLLRKGYEVLGI---TRSSKPNYQY--LQQLGINQSFEIKKIGVE 55
+V G G G +S LL G++V+GI + + L+ L F+ KI +
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA-QPGFQFHKIDLA 62

Query: 56 D-DLLSFQKLIEFYDPIEIYNLSAQSSVGLSFKEPYNSIKSIYHQTLILLEATRTIEFNG 114
D + ++ L ++ + +V S + P+ S L +LE R +
Sbjct: 63 DREGMT--DLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ- 119

Query: 115 TIFLAGSSEMFG-ETKVAADINHP-RKPLSPYAHAKEASFSLVKQYRRIYNINCVTGVLF 172
+ A SS ++G K+ + P+S YA K+A+ + Y +Y + TG+ F
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLP-ATGLRF 178

Query: 173 NHESTL-----RTDRFVIPKIIKGAIQCKKNKTYKLSLGNIDIYRDWGWAEDYVEAMQRI 227
T+ R D + K K ++ K Y + RD+ + +D EA+ R+
Sbjct: 179 ---FTVYGPWGRPD-MALFKFTKAMLEGKSIDVY----NYGKMKRDFTYIDDIAEAIIRL 230

Query: 228 TRADKKQDHVICTGNLTS-------------------LKDFIKKVFDKMDLNWQDHVTIN 268
D T L D+I+ + D + + N
Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE----AKKN 286

Query: 269 KGNKRPHDILKSFGSPQALKLELDWENKKSIED 301
+P D+L++ +AL + + + +++D
Sbjct: 287 MLPLQPGDVLETSADTKALYEVIGFTPETTVKD 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_21101SACTRNSFRASE341e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.1 bits (78), Expect = 1e-04
Identities = 16/74 (21%), Positives = 30/74 (40%), Gaps = 4/74 (5%)

Query: 70 DCLIGFGRATSDRIFRAVLWDIVVKSEFKGVGIGKLIVENLINKKSIKNVEKIYLMTTTK 129
+ IG + S+ A++ DI V +++ G+G ++ I + + L T
Sbjct: 74 NNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDI 133

Query: 130 S----SFYTKFGFK 139
+ FY K F
Sbjct: 134 NISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
NATL1_21111ADHESNFAMILY260.048 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 26.4 bits (58), Expect = 0.048
Identities = 10/34 (29%), Positives = 21/34 (61%), Gaps = 4/34 (11%)

Query: 96 VVVSEAS---KGIATVSRETGVPIIFGVLTTDTM 126
+ E+S + + TVS++T +P I+ + TD++
Sbjct: 250 SLFVESSVDDRPMKTVSQDTNIP-IYAQIFTDSI 282



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.