PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeNC_002678.gbThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_002678 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1mll0428mll0489Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll0428327-4.576654hypothetical protein
mll0429227-4.627238hypothetical protein
msl0430329-6.484282hypothetical protein
msl0431229-5.773141hypothetical protein
mlr0432427-4.628707hypothetical protein
mll0433325-4.347999hypothetical protein
msl0434327-4.240913hypothetical protein
msl0435222-1.701691hypothetical protein
mll0437321-1.394085hypothetical protein
msl0438619-0.995860hypothetical protein
msl0439619-2.082540hypothetical protein
msl0440522-0.721976hypothetical protein
mll04412190.246681hypothetical protein
mll0442220-0.016025hypothetical protein
mll0443220-0.084159hypothetical protein
mlr0445219-0.309024hypothetical protein
mll0446117-0.279053hypothetical protein
mll0448013-0.714389hypothetical protein
mll0449212-1.479684hypothetical protein
mll0450312-1.969612hypothetical protein
mll0452211-1.321034hypothetical protein
mll0453114-1.601022hypothetical protein
mll0454217-0.408619hypothetical protein
mll0455318-0.491949hypothetical protein
mll0457013-1.835880hypothetical protein
mll0458015-1.524909hypothetical protein
mll0459016-2.253575hypothetical protein
mll0460017-2.102811hypothetical protein
mll0461-115-3.118147hypothetical protein
mll0462-215-3.322401hypothetical protein
mll0463223-4.127887bacteriophage terminase large subunit-like
mll0464126-5.443513bacteriophage packaging protein gp3-like
mll0465229-5.384195hypothetical protein
msr0466430-5.758629hypothetical protein
mll0467431-4.685372hypothetical protein
mll0468432-5.082205hypothetical protein
mll0469435-5.361563hypothetical protein
msl0470430-3.825515hypothetical protein
msr0471532-5.233315hypothetical protein
mll0472433-5.214601hypothetical protein
mll0473433-6.153386hypothetical protein
mlr0474533-5.813411hypothetical protein
mlr0475432-5.822611bacteriophage integrase
msl8587539-7.815846*hypothetical protein
mll0476338-7.284698hypothetical protein
mlr0478439-8.254031hypothetical protein
mlr0479340-8.266280succinoglycan biosynthesis protein exoI
mlr0480341-8.950849hypothetical protein
mll0481542-9.041974hypothetical protein
mll0482441-8.769946hypothetical protein
mll0483441-8.488517hypothetical protein
mll0485441-8.096209hypothetical protein
mll0486335-7.338926hypothetical protein
mll0487332-6.374090prophage integrase
mlr0488024-3.859940hypothetical protein
mll0489021-3.491494transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msl0435ACRIFLAVINRP240.045 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 24.4 bits (53), Expect = 0.045
Identities = 9/53 (16%), Positives = 20/53 (37%), Gaps = 3/53 (5%)

Query: 17 VPKTAGADIISVADLTEKFFADAKA---KGMSSTEIEEDTGSVYEAILDAIVH 66
+ GA+ + A + A+ + +GM + T V +I + +
Sbjct: 291 IKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKT 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0446BCTERIALGSPC300.045 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 29.5 bits (66), Expect = 0.045
Identities = 19/61 (31%), Positives = 27/61 (44%), Gaps = 4/61 (6%)

Query: 141 SGNIVGAVAGTLPAVVAAPELFG----AGEAGLGMRSVVSALTGGTINAADSGVRSGGDP 196
S I A A P + LFG +AG S +S L T+N + +GV +G D
Sbjct: 47 SVQITPAQARQQPVTLNDFTLFGVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDD 106

Query: 197 A 197
+
Sbjct: 107 S 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0448PF03544330.003 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.4 bits (76), Expect = 0.003
Identities = 23/129 (17%), Positives = 35/129 (27%), Gaps = 4/129 (3%)

Query: 269 AQTPDQVPLPPAMPGQTQASAAPPVQTASLDPSIGIAPATKPAPDPGILAAAANAAPTDP 328
AQ + PA QA PP +P P I P
Sbjct: 47 AQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPK 106

Query: 329 GILASPSLPASAPIPTPPPNAGYVPGAPGATSPAGQRVLSTMMQEDPLSGPGGVVQALSA 388
+ P + + AP + + ++ P++ +ALS
Sbjct: 107 PVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATS----KPVTSVASGPRALSR 162

Query: 389 ANPPAPAGA 397
P PA A
Sbjct: 163 NQPQYPARA 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0458IGASERPTASE426e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.6 bits (97), Expect = 6e-06
Identities = 28/189 (14%), Positives = 54/189 (28%), Gaps = 7/189 (3%)

Query: 27 AETNPVAAEPVSTPNPISTDPKTVEAKPEPKADKAPTTREALKAAAAKVAEKAKADEGDE 86
E +E T + + TVE + + K + T + K ++ +
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS----PKQEQSET 1138

Query: 87 GKKPAPVQSQPKTGEKPADKAALPDPKPTKGA---ETTTTAKPADTTMRAEPKATSHHEA 143
+ A + + + + ET++ + T S E
Sbjct: 1139 VQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVEN 1198

Query: 144 PARFKSDAAAMAEWEKAPEPVKAAVHRSIRELEAGIEKHRVSAEAFEQVKDFDDLAKRNN 203
P ++ K RS+R + +E S+ V D + N
Sbjct: 1199 PENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTN 1258

Query: 204 TSLRDAMTR 212
L DA +
Sbjct: 1259 AVLSDARAK 1267



Score = 33.1 bits (75), Expect = 0.002
Identities = 27/165 (16%), Positives = 42/165 (25%), Gaps = 28/165 (16%)

Query: 31 PVAAEPVSTPNPISTDPKTVEAKPE-----------PKADKAPTTREALKAAAAKVAEKA 79
V ++TPN I D +V + E P A P+ A +K K
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 80 KADEGDEGKKPAPVQSQPKTGEKPADKAALPDPKPTKGAETTTTAKPADTTMRA------ 133
+ + + K KA + + T + +T A
Sbjct: 1051 VEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110

Query: 134 -----------EPKATSHHEAPARFKSDAAAMAEWEKAPEPVKAA 167
PK TS AE + +P
Sbjct: 1111 KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155



Score = 32.7 bits (74), Expect = 0.003
Identities = 23/163 (14%), Positives = 43/163 (26%), Gaps = 6/163 (3%)

Query: 5 NNQQSLRRSNPMDDMNGGASAPAETNPVAAEPVSTPNPISTDPKTVEAKPEPKADKAPTT 64
Q + + + A E + S +P +TV+ + EP + PT
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT- 1152

Query: 65 REALKAAAAKVAEKAKADEGDEGKKPAPVQSQPKTGEKPADKAALPDPKPTKGAETTTTA 124
+K ++ AD K+ + QP T + T T
Sbjct: 1153 -VNIKEPQSQ--TNTTADTEQPAKETSSNVEQPVTESTTVN--TGNSVVENPENTTPATT 1207

Query: 125 KPADTTMRAEPKATSHHEAPARFKSDAAAMAEWEKAPEPVKAA 167
+P + + H + + V
Sbjct: 1208 QPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0460cloacin394e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 4e-05
Identities = 25/81 (30%), Positives = 32/81 (39%), Gaps = 1/81 (1%)

Query: 111 GLANGGGGGAAGPHGNGAAGGSGGGSGGGGNGGGSTGAASSFPSGGAGGNNFGGTGGGAG 170
G G GA GN GG G GGG GS ++ + P GG G+ GG
Sbjct: 4 GDGRGHNTGAHSTSGN-INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 171 GAATALPGAAGTNGGGGGGGS 191
G + G +G GG +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 37.4 bits (86), Expect = 1e-04
Identities = 32/90 (35%), Positives = 39/90 (43%), Gaps = 5/90 (5%)

Query: 97 ASGIGDVKYSGGARGLANGGGGGAAGPHGNGAAGGSGGGS-----GGGGNGGGSTGAASS 151
+ G G +G N GG G GA+ GSG S GGG G G S
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 152 FPSGGAGGNNFGGTGGGAGGAATALPGAAG 181
+GG GN+ GG+G G +A A P A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 36.2 bits (83), Expect = 2e-04
Identities = 33/95 (34%), Positives = 39/95 (41%), Gaps = 16/95 (16%)

Query: 139 GGNGGGSTGAASSFPSGGAGGNNFGGTGGGAGGAATALPGAAGTNGGGGGGGSPDDSDTP 198
GG+G G A S +G N G TG G GG A+ G + N GGG
Sbjct: 3 GGDGRGHNTGAHS----TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG----I 54

Query: 199 GVGGNGGNGTEWDATHGSGGGAGAGGDASTAGGSG 233
GG G HG+GGG G G S GG+
Sbjct: 55 HWGGGSG--------HGNGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 6e-04
Identities = 34/100 (34%), Positives = 44/100 (44%), Gaps = 4/100 (4%)

Query: 84 AAKAGGASLGGDAASGIGDVKYSGGARGLANG--GGGGAAGPH-GNGAAGGSGGGSGGGG 140
A G GG G+G G N GGG +G H G G+ G+GGG+G G
Sbjct: 13 AHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSG 72

Query: 141 NGGGSTGAASSFPSGGAGGNNFGGTGGGAGGAATALPGAA 180
G G+ G S+ + A G T GAGG A ++ A
Sbjct: 73 GGSGTGGNLSAVAAPVAFGFPALST-PGAGGLAVSISAGA 111



Score = 34.7 bits (79), Expect = 6e-04
Identities = 31/111 (27%), Positives = 45/111 (40%), Gaps = 10/111 (9%)

Query: 165 TGGGAGGAATALPGAAGTNGGGGGGGSPDDSDTPGVGGNGGNGTEWDATHGS-GGGAGAG 223
+GG G T +G GG G GVGG +G+ W + + GGG+G+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGL--------GVGGGASDGSGWSSENNPWGGGSGSG 53

Query: 224 -GDASTAGGSGGLYGGGAGRSNLSGGQGIIVITYTPGGAPDITGTLAATLA 273
+G G G +G + +GG V G P ++ A LA
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0462IGASERPTASE340.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 0.003
Identities = 32/176 (18%), Positives = 64/176 (36%), Gaps = 8/176 (4%)

Query: 619 KAAAAQAKPPPPDPAMLKAQAEIQALQAKSEAEAQSHAQEHAYKMQTLAAERDVKVQEAQ 678
A +A PPP PA E A +K E++ ++ A +T A R+V +EA+
Sbjct: 1017 IARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDA--TETTAQNREV-AKEAK 1073

Query: 679 IRVGNENARHARDMQVKDADMRSKLAEAGYPPDFSIDGANQMNQAQFQ---QIMAELTAT 735
V N + Q ++ E + ++ + Q ++ ++++
Sbjct: 1074 SNV-KANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPK 1132

Query: 736 REQSVQQGQQLGQTLVQALQLVVQSNQDTAKAMIAAATAPKRIVRDHAGRPIGAET 791
+EQS + Q + + V + A P + + +P+ T
Sbjct: 1133 QEQS-ETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187


2mll0558mlr0587Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll05583121.399614hypothetical protein
mll05592131.605638glutathione S-transferase
mll05602141.618138succinoglycan biotynthesis protein ExoI
mlr05623141.382687histidine kinase sensor protein
mll05633171.544246nicotinate-nucleotide--dimethylbenzimidazole
mlr0564217-0.563757hypothetical protein
mlr0565221-1.631321hypothetical protein
mll0566124-3.041876tRNA-dihydrouridine synthase A
msl0567130-4.339945hypothetical protein
mlr0568130-4.254815hypothetical protein
mlr0569129-3.246301hypothetical protein
mlr0571030-3.426117hypothetical protein
mll0572127-2.554421hypothetical protein
mll0573228-2.630815hypothetical protein
mll0574127-2.384012hypothetical protein
mll0576231-4.583318hypothetical protein
msl0577131-6.756512hypothetical protein
mll0578030-6.562297hypothetical protein
mll0579133-8.444016hypothetical protein
msl8590745-0.754762hypothetical protein
mlr058111451.553384hypothetical protein
mll058210402.362820hypothetical protein
mll058311403.404974hypothetical protein
msr058411393.522976hypothetical protein
mlr058510372.854673hypothetical protein
mlr05878311.884512hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0562PF06580330.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.5 bits (74), Expect = 0.003
Identities = 23/104 (22%), Positives = 41/104 (39%), Gaps = 26/104 (25%)

Query: 387 LLSNAIKF----TATGGEIRVRVGWTAGGGQYISVKDNGPGIPEDEIPVVLSAFGQGSIA 442
L+ N IK GG+I ++ G G + V++ G A
Sbjct: 263 LVENGIKHGIAQLPQGGKILLK-GTKDNGTVTLEVENTGSL------------------A 303

Query: 443 IKSAEQGTGLGLPIV-QGLLAMHGGEFELH--SKLREGTEAIAI 483
+K+ ++ TG GL V + L ++G E ++ K + + I
Sbjct: 304 LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0587ICENUCLEATIN350.006 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 35.1 bits (80), Expect = 0.006
Identities = 170/901 (18%), Positives = 272/901 (30%)

Query: 808 ATGSTGATGSTGATGSTGATGSTGATGSTGATGATGATGSTGATGATGSTGATGSTGATG 867
A + S G+ + + + AT + +G S G
Sbjct: 120 AGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAG 179

Query: 868 STGATGATGATGSTGATGATGSTGATGATGSTGATGSTGVTGATGSTGATGATGATGSTG 927
A ++ G+TG+ GA + + T ++ G +
Sbjct: 180 YGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSD 239

Query: 928 ATGSTGNTGATGSTGATGATGATGDTGATGSTGATGSTGATGATGDTGATGSTGATGSTG 987
T G+TG G + A + T S+ G A + T G+TG+ G
Sbjct: 240 LTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAG 299

Query: 988 ATGATGDTGATGSTGATGATGATGDTGATGSTGATGSTGATGDTGATGSTGATGATGATG 1047
A + + T +T G + + T G TG G + A +
Sbjct: 300 ADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGST 359

Query: 1048 DTGATGSTGATGATGATGATGATGDTGATGSTGATGATGDTGATGSTGATGSTGATGDTG 1107
T S+ G A + T GSTG GA A + T +T G
Sbjct: 360 QTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAG 419

Query: 1108 TTGSTGATGATGATGDTGATGATGATGSTGATGATGATGDTGATGSTGATGATGATGDTG 1167
+ A + T G+TG G S A + T ++ + G A +
Sbjct: 420 YGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSD 479

Query: 1168 ATGDTGVTGSTGATGATGATGATGDTGATGSTGATGATGSTGTTGATGATGDTGATGDTG 1227
T G T + G + A + T GST G + + G+T G
Sbjct: 480 LTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAG 539

Query: 1228 ATGSTGATGATGATGDTGATGSTGATGSTGATGATGDTGATGSTGDTGATGATGATGATG 1287
A S A + T + + G + A + T GSTG G+ + A +
Sbjct: 540 ANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGST 599

Query: 1288 ATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGATGDTGATGATGATGATGATG 1347
T ++ + G A + T G+T GA + + T + G
Sbjct: 600 QTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAG 659

Query: 1348 DTGATGSTGATGSTGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATG 1407
+ + T G T G+ + A + T S G +
Sbjct: 660 YGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSD 719

Query: 1408 STGATGATGATGDTGATGSTGATGSTGATGDTGATGSTGATGATGDTGATGSTGATGATG 1467
T G+T G + + + T + + G A + T G+T G
Sbjct: 720 LTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAG 779

Query: 1468 ATGSTGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGSTGATGATGATG 1527
A S A + T + G A + T G+T G+ + A +
Sbjct: 780 ADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGST 839

Query: 1528 DTGATGSTGATGATGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGDTG 1587
T S G A + T GST G + + T + G
Sbjct: 840 QTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAG 899

Query: 1588 ATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGATGATGDTG 1647
+ A + T G+T + G + A + T S +T G + A +
Sbjct: 900 YGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSS 959

Query: 1648 ATGSTGATGSTGATGDTGATGATGATGDTGATGATGATGATGATGDTGATGATGSTGATG 1707
T G+T G A + T +T G A + T GST G
Sbjct: 960 LTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAG 1019

Query: 1708 A 1708
A
Sbjct: 1020 A 1020



Score = 35.1 bits (80), Expect = 0.006
Identities = 168/877 (19%), Positives = 270/877 (30%)

Query: 1450 ATGDTGATGSTGATGATGATGSTGATGDTGATGSTGATGATGATGDTGATGSTGATGATG 1509
+ + AT + +G + G A + G+TG G
Sbjct: 144 IDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAG 203

Query: 1510 ATGDTGSTGATGATGATGDTGATGSTGATGATGATGATGATGDTGATGSTGATGATGATG 1569
A + + T + G + T G TG G + A +
Sbjct: 204 ADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGST 263

Query: 1570 DTGATGSTGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTG 1629
T S+ G A + T G+TG GA S A + T +T + G
Sbjct: 264 QTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAG 323

Query: 1630 ATGATGATGATGATGDTGATGSTGATGSTGATGDTGATGATGATGDTGATGATGATGATG 1689
A + T G+TG+ G S A + T ++ G A +
Sbjct: 324 YGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSD 383

Query: 1690 ATGDTGATGATGSTGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGATG 1749
T G+TG G+ + A + T +T G A + T G+TG G
Sbjct: 384 LTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAG 443

Query: 1750 DTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGDTGATGSTG 1809
D + + + T + G A + T GST G A +
Sbjct: 444 DDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGST 503

Query: 1810 ATGATGATGATGATGDTGATGSTGATGSTGATGDTGATGSTGATGATGATGDTGATGSTG 1869
T G+T G A + G+T GA S A + T + + G
Sbjct: 504 QTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAG 563

Query: 1870 ATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATG 1929
A + T G+TG G+ A + T + ++ G + A +
Sbjct: 564 YGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSV 623

Query: 1930 ATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTG 1989
T G+T + GA + A + T + G A + T G+T G
Sbjct: 624 LTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAG 683

Query: 1990 ATGSTGATGATGSTGDTGATGSTGATGVTGATGDTGATGATGATGATGATGSTGATGATG 2049
A S A + T + + G A + T G+T GA S A +
Sbjct: 684 ADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGST 743

Query: 2050 STGDTGATGATGDTGATGSTGATGSTGATGATGATGATGATGDTGATGSTGDTGATGSTG 2109
T ++ G + + T G+T GA + + T + + G
Sbjct: 744 QTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAG 803

Query: 2110 ATGATGATGATGATGATGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATG 2169
A + T G+T GA A + T + G + A +
Sbjct: 804 YGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSD 863

Query: 2170 ATGDTGATGSTGATGSTGATGDTGATGSTGATGATGDTGATGSTGATGATGSTGATGATG 2229
T G+T + G S A + T + G + + T G+T G
Sbjct: 864 LTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAG 923

Query: 2230 DTGATGSTGATGDTGATGSTGATGATGDTGATGSTGATGATGATGATGATGDTGATGSTG 2289
+ + + T + ST G A + T G+T G A +
Sbjct: 924 YESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGST 983

Query: 2290 ATGSTGATGDTGATGSTGATGATGATGDTGATGSTGA 2326
T +T G + A ++ T G+T + GA
Sbjct: 984 QTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGA 1020



Score = 34.7 bits (79), Expect = 0.008
Identities = 167/910 (18%), Positives = 275/910 (30%)

Query: 1105 DTGTTGSTGATGATGATGDTGATGATGATGSTGATGATGATGDTGATGSTGATGATGATG 1164
D A + G + + AT + +G
Sbjct: 111 DYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYGSTLSG 170

Query: 1165 DTGATGDTGVTGSTGATGATGATGATGDTGATGSTGATGATGSTGTTGATGATGDTGATG 1224
+ G + A ++ G TG G+ A + T ++ G
Sbjct: 171 THQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGS 230

Query: 1225 DTGATGSTGATGATGATGDTGATGSTGATGSTGATGATGDTGATGSTGDTGATGATGATG 1284
+ T G+TG G S A + T + G A + T
Sbjct: 231 TQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 290

Query: 1285 ATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGATGDTGATGATGATGATG 1344
G+TG GA S A + T +T + G A + T G+TG G
Sbjct: 291 GYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDS 350

Query: 1345 ATGDTGATGSTGATGSTGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTG 1404
+ + T S+ G + + T G TG G+ + A + T
Sbjct: 351 SLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTA 410

Query: 1405 ATGSTGATGATGATGDTGATGSTGATGSTGATGDTGATGSTGATGATGDTGATGSTGATG 1464
ST G + T GSTG GD + + + T ++ + G
Sbjct: 411 GEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 470

Query: 1465 ATGATGSTGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGSTGATGATG 1524
A + T G+T + G + A + T G+T G + +
Sbjct: 471 TQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLIT 530

Query: 1525 ATGDTGATGSTGATGATGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATG 1584
G T G+ + A + T + G A + T GSTG G+
Sbjct: 531 GYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDS 590

Query: 1585 DTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGATGATG 1644
A + T + ++ G + A + T G+T + GA + A + T
Sbjct: 591 SIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTA 650

Query: 1645 DTGATGSTGATGSTGATGDTGATGATGATGDTGATGATGATGATGATGDTGATGATGSTG 1704
+ + G + A + T G+T GA + A + T + G
Sbjct: 651 GYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGS 710

Query: 1705 ATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGATGDTGATGSTGATGATG 1764
A + T G+T GA A + T + ++ G + + T
Sbjct: 711 TQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTT 770

Query: 1765 ATGDTGATGSTGATGATGATGDTGATGSTGATGATGDTGATGSTGATGATGATGATGATG 1824
G T G+ + A + T S G A + T G+T GA
Sbjct: 771 GYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADS 830

Query: 1825 DTGATGSTGATGSTGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATG 1884
A + T + G + A + T G+T + G + A + T
Sbjct: 831 SLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTA 890

Query: 1885 STGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATG 1944
+ G A ++ T G+T G S A + T +T G
Sbjct: 891 GYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGS 950

Query: 1945 ATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGSTG 2004
+ A + T G+T G A + T +T G + A ++ T
Sbjct: 951 SQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTA 1010

Query: 2005 DTGATGSTGA 2014
G+T + GA
Sbjct: 1011 GYGSTATAGA 1020



Score = 33.6 bits (76), Expect = 0.018
Identities = 170/879 (19%), Positives = 275/879 (31%)

Query: 1427 TGATGSTGATGDTGATGSTGATGATGDTGATGSTGATGATGATGSTGATGDTGATGSTGA 1486
S +T + +G + G A S+ G+TG+ GA
Sbjct: 145 DATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGA 204

Query: 1487 TGATGATGDTGATGSTGATGATGATGDTGSTGATGATGATGDTGATGSTGATGATGATGA 1546
A + T ++ G + T G TG G + A +
Sbjct: 205 DSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 264

Query: 1547 TGATGDTGATGSTGATGATGATGDTGATGSTGATGATGDTGATGSTGATGATGATGDTGA 1606
T + G A + T GSTG GA A + T +T G
Sbjct: 265 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGY 324

Query: 1607 TGSTGATGATGATGDTGATGSTGATGATGATGATGATGDTGATGSTGATGSTGATGDTGA 1666
+ A + T G+TG+ G + A + T ++ + G + A +
Sbjct: 325 GSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 384

Query: 1667 TGATGATGDTGATGATGATGATGATGDTGATGATGSTGATGATGDTGATGSTGATGATGA 1726
T G+TG GA + A + T +T G A + T G+TG G
Sbjct: 385 TAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGD 444

Query: 1727 TGDTGATGSTGATGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGD 1786
A + T ++ G + + T G T G + A +
Sbjct: 445 DSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQ 504

Query: 1787 TGATGSTGATGATGDTGATGSTGATGATGATGATGATGDTGATGSTGATGSTGATGDTGA 1846
T GST G A + G+T GA A + T S + G
Sbjct: 505 TAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGY 564

Query: 1847 TGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGA 1906
+ A + T G+TG+ G+ + A + T S ++ G A +
Sbjct: 565 GSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVL 624

Query: 1907 TGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGA 1966
T G+T GA S A + T + + G A + T G+T GA
Sbjct: 625 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGA 684

Query: 1967 TGDTGATGSTGATGATGATGDTGATGSTGATGATGSTGDTGATGSTGATGVTGATGDTGA 2026
A + T + G + A + T G+T + GA A +
Sbjct: 685 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQ 744

Query: 2027 TGATGATGATGATGSTGATGATGSTGDTGATGATGDTGATGSTGATGSTGATGATGATGA 2086
T + ++ G + A + T G+T G + + + T + G
Sbjct: 745 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGY 804

Query: 2087 TGATGDTGATGSTGDTGATGSTGATGATGATGATGATGATGATGATGATGDTGATGSTGA 2146
+ T G+T + GA + A + T + G A ++
Sbjct: 805 GSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDL 864

Query: 2147 TGATGATGDTGATGSTGATGATGATGDTGATGSTGATGSTGATGDTGATGSTGATGATGD 2206
T G+T G S A + T + + G + A ++ T G+T G
Sbjct: 865 TTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 924

Query: 2207 TGATGSTGATGATGSTGATGATGDTGATGSTGATGDTGATGSTGATGATGDTGATGSTGA 2266
+ + + T S +T G + + + T GST G A +
Sbjct: 925 ESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQ 984

Query: 2267 TGATGATGATGATGDTGATGSTGATGSTGATGDTGATGS 2305
T +T G A S+ T G+T GA S
Sbjct: 985 TAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSS 1023



Score = 33.2 bits (75), Expect = 0.020
Identities = 167/909 (18%), Positives = 280/909 (30%)

Query: 1801 DTGATGSTGATGATGATGATGATGDTGATGSTGATGSTGATGDTGATGSTGATGATGATG 1860
D A A + + G+ + + + AT + +G
Sbjct: 111 DYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYGSTLSG 170

Query: 1861 DTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATG 1920
+ G A + G+TG GA A + T ++ G
Sbjct: 171 THQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGS 230

Query: 1921 STGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATG 1980
+ + T G+TG+ G + A + T ++ G A + T
Sbjct: 231 TQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 290

Query: 1981 ATGATGDTGATGSTGATGATGSTGDTGATGSTGATGVTGATGDTGATGATGATGATGATG 2040
G+TG GA S A + T +T + G A + T G+TG G
Sbjct: 291 GYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDS 350

Query: 2041 STGATGATGSTGDTGATGATGDTGATGSTGATGSTGATGATGATGATGATGDTGATGSTG 2100
S A + T ++ G + + T G+TG GA + + T
Sbjct: 351 SLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTA 410

Query: 2101 DTGATGSTGATGATGATGATGATGATGATGATGATGDTGATGSTGATGATGATGDTGATG 2160
+T + G A + T G+TG G A + T ++ G
Sbjct: 411 GEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 470

Query: 2161 STGATGATGATGDTGATGSTGATGSTGATGDTGATGSTGATGATGDTGATGSTGATGATG 2220
+ A + T G+T + G S A + T G+T G + +
Sbjct: 471 TQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLIT 530

Query: 2221 STGATGATGDTGATGSTGATGDTGATGSTGATGATGDTGATGSTGATGATGATGATGATG 2280
G+T G + + + T + S G A + T G+TG G+
Sbjct: 531 GYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDS 590

Query: 2281 DTGATGSTGATGSTGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATG 2340
A + T S ++ G + A + T G+T + GA + A + T
Sbjct: 591 SIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTA 650

Query: 2341 STGATGSTGATGDTGATGSTGATGSTGDTGATGSTGATGATGATGVTGDRGATGSTGATG 2400
+ + G A + T G T G+ + A + T + + G
Sbjct: 651 GYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGS 710

Query: 2401 STGATGATGATGASGATGDTGATGSTGATGATGDTGATGSTGATGDTGATGSTGATGDTG 2460
+ A + T G+T GA S A + T + S+ G + + T
Sbjct: 711 TQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTT 770

Query: 2461 ATGSTGATGATGSAGATGATGATGDTGATGSTGATGATGATGDTGATGATGDTGATGSTG 2520
GST GA S A + T + + G A + T G T G+
Sbjct: 771 GYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADS 830

Query: 2521 ATGSTGATGDTGATGSTGATGSTGATGATGATGASGPTGDTGATGSTGATGATGDTGSTG 2580
+ + + T S G A + + G T G + A + T
Sbjct: 831 SLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTA 890

Query: 2581 ATGSTGATGDTGATGSTGATGATGAPGATGATGDTGATGSTGDTGATGSTGATGATGDTG 2640
S G + + T G+T G + + + T S +T G
Sbjct: 891 GYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGS 950

Query: 2641 ATGSTGATGATGSAGATGSTGATGATGATGNTGATGSTGATGSTGATGATGSTGPKGDTG 2700
+ + + T G+T G + A + T +T + G + T
Sbjct: 951 SQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTA 1010

Query: 2701 STGSTGPKG 2709
GST G
Sbjct: 1011 GYGSTATAG 1019



Score = 33.2 bits (75), Expect = 0.025
Identities = 166/909 (18%), Positives = 278/909 (30%)

Query: 1603 DTGATGSTGATGATGATGDTGATGSTGATGATGATGATGATGDTGATGSTGATGSTGATG 1662
D A A + G+ + AT + +G
Sbjct: 111 DYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYGSTLSG 170

Query: 1663 DTGATGATGATGDTGATGATGATGATGATGDTGATGATGSTGATGATGDTGATGSTGATG 1722
+ G A ++ G+TG GA + + T ++ G
Sbjct: 171 THQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGS 230

Query: 1723 ATGATGDTGATGSTGATGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATG 1782
+ T G+TG G + + T ++ G + + T
Sbjct: 231 TQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 290

Query: 1783 ATGDTGATGSTGATGATGDTGATGSTGATGATGATGATGATGDTGATGSTGATGSTGATG 1842
G TG G+ + A + T +T G A + T G+TG+ G
Sbjct: 291 GYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDS 350

Query: 1843 DTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATG 1902
A + T ++ G + A + T G+TG+ GA + A + T
Sbjct: 351 SLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTA 410

Query: 1903 STGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATG 1962
+T G A + T G+TG G S A + T ++ + G
Sbjct: 411 GEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 470

Query: 1963 ATGATGDTGATGSTGATGATGATGDTGATGSTGATGATGSTGDTGATGSTGATGVTGATG 2022
A + T G+T G A + T GST G + A +
Sbjct: 471 TQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLIT 530

Query: 2023 DTGATGATGATGATGATGSTGATGATGSTGDTGATGATGDTGATGSTGATGSTGATGATG 2082
G+T GA + A + T + S G + T GSTG G+
Sbjct: 531 GYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDS 590

Query: 2083 ATGATGATGDTGATGSTGDTGATGSTGATGATGATGATGATGATGATGATGATGDTGATG 2142
+ A + T + S+ G + A + T G+T GA + A + T
Sbjct: 591 SIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTA 650

Query: 2143 STGATGATGATGDTGATGSTGATGATGATGDTGATGSTGATGSTGATGDTGATGSTGATG 2202
+ G A + T G+T GA S A + T + + G
Sbjct: 651 GYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGS 710

Query: 2203 ATGDTGATGSTGATGATGSTGATGATGDTGATGSTGATGDTGATGSTGATGATGDTGATG 2262
+ T G+T + GA + + T + + G A + T
Sbjct: 711 TQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTT 770

Query: 2263 STGATGATGATGATGATGDTGATGSTGATGSTGATGDTGATGSTGATGATGATGDTGATG 2322
G+T GA + A + T + + G A + T G+T GA
Sbjct: 771 GYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADS 830

Query: 2323 STGATGATGATGDTGATGSTGATGSTGATGDTGATGSTGATGSTGDTGATGSTGATGATG 2382
S A + T + + G + A ++ T G+T + G + + + T
Sbjct: 831 SLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTA 890

Query: 2383 ATGVTGDRGATGSTGATGSTGATGATGATGASGATGDTGATGSTGATGATGDTGATGSTG 2442
G + A ++ T G+T +G A + T + T G
Sbjct: 891 GYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGS 950

Query: 2443 ATGDTGATGSTGATGDTGATGSTGATGATGSAGATGATGATGDTGATGSTGATGATGATG 2502
+ + T G T G + A + T +T G + A ++ T
Sbjct: 951 SQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTA 1010

Query: 2503 DTGATGATG 2511
G+T G
Sbjct: 1011 GYGSTATAG 1019


3mll0691mll0702Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
mll0691-116-3.338026trehalose-6-phosphate phosphatase
mll0692021-3.728480hypothetical protein
mll0693125-5.197979sugar transferase
mlr0694222-4.805576hypothetical protein
mlr0695217-4.713179O-antigen acetylase
mll0696012-3.152747hypothetical protein
mlr0698-110-2.685408hypothetical protein
mlr0699113-2.448842*hypothetical protein
mll0700114-1.755510glycerol kinase
mll0702217-1.660526hypothetical protein
4mlr0789mlr0796Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr07892151.354434hypothetical protein
mlr07912141.378216membrane transporter
mll07923141.032397thioredoxin reductase
msl07934141.034398ferredoxin
mlr07944130.884021hypothetical protein
mlr07963141.164080kinesin-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0789PF04183280.032 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 27.9 bits (62), Expect = 0.032
Identities = 12/54 (22%), Positives = 27/54 (50%), Gaps = 10/54 (18%)

Query: 109 KLFWDTLAADIKRHEERHVEIAKNYG----------RELENALKATYPRTDCGT 152
+ F+ LAA + + ++H ++++ + R + N +K T+P D G+
Sbjct: 504 RRFYQLLAAVLSDYMKKHPQMSERFALFSLFRPQIIRVVLNPVKLTWPDLDGGS 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0791TCRTETB1209e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 120 bits (303), Expect = 9e-32
Identities = 89/401 (22%), Positives = 166/401 (41%), Gaps = 26/401 (6%)

Query: 38 LLAALDQTIIAPAMPTIARALGHAE-YLPWMVTGYLLTATAVAPLYGKISDVYGRRPTIY 96
+ L++ ++ ++P IA W+ T ++LT + +YGK+SD G + +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 97 AAILIFLVGSLVSAMAPNMF-VLVIGRAIQGAGGGGLFALAQTVIGDLVPPRERARYAAW 155
I+I GS++ + + F +L++ R IQGAG AL V+ +P R +
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 156 VSGTWAVASIAGPLLGGTFAEHLHWSLIFWINIPLGLLAMAIINKPLKKLPIAAKHHR-- 213
+ A+ GP +GG A ++HWS + + IP+ + L KL +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPF---LMKLLKKEVRIKGH 198

Query: 214 IDGLGALLLVIATALLLLALNWGGSTYAWFSREIVGLVAGSAVFWALFAIRLMGATEPLI 273
D G +L+ + +L +Y+ + S + + +F + T+P +
Sbjct: 199 FDIKGIILMSVGIVFFMLFTT----SYSI------SFLIVSVLSFLIFVKHIRKVTDPFV 248

Query: 274 SLEVLGNPIVLAGALSMFLLQAANIGASVYLPVYLQTVIGLSVSESGMAMLGLLLGTVAG 333
+ N + G L ++ G +P ++ V LS +E G + + GT++
Sbjct: 249 DPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI--IFPGTMSV 306

Query: 334 AATS---GRLIPRFVHYKRIAMIGITLAIVSIGLLSAIAGHASLLEVEILTTLIGLGSGT 390
G L+ R + IG+T VS S + S I+ ++G G
Sbjct: 307 IIFGYIGGILVDRRGPLY-VLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSF 364

Query: 391 TFPVATVSVQNAVDRTHLGVATGVLTFLRTLGGALGVALLG 431
T V + V +++ + G +L F L G+A++G
Sbjct: 365 TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0796PF03544383e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 37.6 bits (87), Expect = 3e-04
Identities = 21/118 (17%), Positives = 32/118 (27%), Gaps = 17/118 (14%)

Query: 1902 DTSEPRNLRPAPQAQAARPAEPPRRPPAPQPELRTPQPPLGGTALRGTLDLERPAEPRQR 1961
D P+ ++P P+ P P P P E +P +
Sbjct: 59 DLEPPQAVQPPPEPVV-EPEPEPEPIPEPPKEAPVVIEK------------PKPKPKPKP 105

Query: 1962 PEAGART-PQGGWVRDLLTAASNDADLRPATPPSPPAEAPRAAPA---QRSPLHVVES 2015
P+ AS + PA P S A A + P P + +
Sbjct: 106 KPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRN 163


5msl0939mlr0980Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
msl0939-228-4.756498*hypothetical protein
mll0941-118-3.594570hypothetical protein
msr09444152.941967hypothetical protein
mlr09454153.051677heat shock protein
mll09464153.443741hypothetical protein
mll09474163.565989hypothetical protein
mll09484163.563231hypothetical protein
mll09504153.117235hypothetical protein
mll0951-320-2.560411hypothetical protein
msl0952-120-3.246826hypothetical protein
mlr0954029-5.894128methionine aminopeptidase
mlr0956441-8.788412DNA repair protein RadC
mlr0958033-5.799922integrase
msr0959133-5.501907hypothetical protein
msr0960132-5.002747hypothetical protein
mlr0961132-4.902382glycosyltransferase
mll0962026-3.055964acetyltransferase
mll0964-120-0.915782conjugal transfer relaxase TraA
msr0965124-4.662688conjugal transfer protein traD
mlr0967228-4.886422allantoate amidohydrolase
msl0968231-5.551309hypothetical protein
mll0969231-5.692561transposase
mlr0970331-5.815550ubiquinol-cytochrome C reductase iron-sulfur
mlr0971329-5.336761ubiquinol-cytochrome C reductase iron-sulfur
mlr0973328-3.825664hypothetical protein
mlr0974326-3.199956hypothetical protein
mll0975222-2.572211calpastatin
mll0976221-2.150452sensor/response regulator hybrid
mlr0977319-1.937376hypothetical protein
mlr0978121-2.993940non-heme chloroperoxidase
mlr0979127-3.170518hypothetical protein
mlr0980127-3.714887hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0950cloacin350.006 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.1 bits (80), Expect = 0.006
Identities = 30/108 (27%), Positives = 36/108 (33%), Gaps = 3/108 (2%)

Query: 1008 GGGGGNGGFSVAGTFTTGALGASVAVGGSGGSGQSAGEVTVTSAGNIQTHGDQSIGIMAQ 1067
G G G S +G G G V G S GSG S+ G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSS---ENNPWGGGSGSGIHWGGGSGH 62

Query: 1068 SVGGGGGNGGFAGAGAITLQGVSAAVGLGGSGAGGGSAKTVRVTSTGD 1115
GGG GN G L V+A V G A + V+ +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.5 bits (76), Expect = 0.019
Identities = 35/118 (29%), Positives = 41/118 (34%), Gaps = 8/118 (6%)

Query: 227 GGDG-GDGGDAKGISGD--AGDGGLGGSGGNATVNFNSGSVETQGNYSAGIAAISQGGHG 283
GGDG G A SG+ G GLG GG + GS + N G + S G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGA-----SDGSGWSSENNPWGGGSGSGIHWG 57

Query: 284 GNGGGGGGLVFNPGGGSPAGAGGNANVFTGVGTTITTYGIYSHGIAAQSIGGGGGGSA 341
G G G G GG G + V V G A SI G +A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.1 bits (75), Expect = 0.024
Identities = 31/98 (31%), Positives = 41/98 (41%), Gaps = 8/98 (8%)

Query: 1130 SVGGGGGNGGSTVSLALAKDAGIGVALGGKGGAAGNGLDVTVISTGNISTGAGFISGGVR 1189
S G G G+ S + + G G G + G+G G S G+G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGS-GSGIHWGGGS 60

Query: 1190 GTGAAGILAQSVGGGGGNGGFAGTLGGGKSIAVGVAFG 1227
G G G G G +GG +GT G ++A VAFG
Sbjct: 61 GHGNGG-------GNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.1 bits (75), Expect = 0.026
Identities = 26/89 (29%), Positives = 37/89 (41%)

Query: 443 IGGGGGNGGNSGGLVSLGGDGASTTNGGIVEVTNTSAGSISTDGKQSAGIFAQSVGGGGG 502
+ GG G G N+G + G T G+ + +G S + G + GGG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 503 NGGTSGGLFSAGGKGGAGGNGARVTVTNA 531
G GG ++GG G GGN + V A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 32.8 bits (74), Expect = 0.032
Identities = 30/113 (26%), Positives = 41/113 (36%), Gaps = 3/113 (2%)

Query: 941 IGGGGGNGGFAVTLSVSGSYEGAGGAAAVSVGGSGATGGVGKDVFVTSYGNIATYGKQSD 1000
+ GG G G S SG+ G V G S +G ++ +G + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN---NPWGGGSGSGIHWG 57

Query: 1001 GILAQSIGGGGGNGGFSVAGTFTTGALGASVAVGGSGGSGQSAGEVTVTSAGN 1053
G GGG GN G A+ A VA G S AG + V+ +
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0976PF06580468e-08 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 46.0 bits (109), Expect = 8e-08
Identities = 29/178 (16%), Positives = 62/178 (34%), Gaps = 25/178 (14%)

Query: 119 ARTALQRADTLVRDTIIRARKS----SRETEHADINACLEVIEKLTGWAGDANIRLEVAS 174
AR L L+R ++ + + E +++ L+ + + D ++ E
Sbjct: 193 AREMLTSLSELMRYSLRYSNARQVSLADELTV--VDSYLQ-LASIQ--FEDR-LQFENQI 246

Query: 175 ATDLPMVRCDPLGLQNAVLNLVFNARDAMPNGGVISISVAEVVVGPAHQIEARVKDNGVG 234
+ V+ P+ +Q V N + + +P GG I + + + V++ G
Sbjct: 247 NPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKD----NGTVTLEVENTGSL 302

Query: 235 MSPETVVRAFEPFFTTKGNGLGGVGLPMVKRFVEEHGGTIEVESSFGSGTTVILRLPA 292
T + G GL V + + E I++ G ++ +P
Sbjct: 303 ALKNTK--------ESTGTGLQNVRERLQMLYGTEAQ--IKLSEKQG-KVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0977NUCEPIMERASE412e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 40.9 bits (96), Expect = 2e-06
Identities = 24/125 (19%), Positives = 37/125 (29%), Gaps = 40/125 (32%)

Query: 1 MKIVVIGGTGLIGSKTVERLRKKGHDV------------------LAASPNGGVNTITG- 41
MK +V G G IG +RL + GH V L G
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 42 ----EGLAEALA--GAQVVIDLA----------NSPSFEDKAVLEFFETSGRNLLAAEKR 85
EG+ + A + V N ++ D + F N+L +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFL-----NILEGCRH 115

Query: 86 AGVKH 90
++H
Sbjct: 116 NKIQH 120


6mll1017msr1055Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll1017-125-3.3810673-ketoacyl-ACP reductase
mll1018024-3.899610racemase
mlr1020126-4.412208glucose-resistance amylase regulator
mll1021021-3.500968phosphoglycerate dehydrogenase
mll1022017-2.380817adenylate cyclase
mlr1023115-2.321597adenylate cyclase
mll1024213-1.892132polyamine transport protein
mlr1025213-0.834174transcriptional regulatory protein, nodulation
mll1026212-0.455549rhizobiocin secretion protein rspE
mll1027211-0.706335rhizobiocin secretion protein rspD
mll1028312-1.534526rhizobiocin rzcA
msr1029-1111.614964DNA repair protein
mlr1030-191.474237hypothetical protein
msl10310121.179364hypothetical protein
mll10330110.494617hypothetical protein
mll103429-0.4657953-hydroxyacyl-CoA dehydrogenase
mll1036210-0.9066913-ketoacyl-ACP reductase
mll1037210-2.073327hypothetical protein
mlr1038311-2.206159transcriptional regulator
mlr1039411-2.600330cytochrome C oxidase subunit II
mlr1041312-2.937681cytochrome C oxidase subunit I
mlr1042-114-2.815116cytochrome C oxidase subunit III
mlr1043-215-3.772228cytochrome C oxidase subunit III
mlr1044021-3.233171cytochrome C oxidase subunit IV
mlr1045014-2.255613hypothetical protein
mll1046115-2.297440cytochrome C oxidase subunit I
mll1047114-2.567233protoheme IX farnesyltransferase
mll1048115-3.510313cytochrome C
mlr1050017-3.959228hypothetical protein
mll1054-111-2.857348trigger factor
msr1055020-4.002712*hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1017DHBDHDRGNASE837e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 82.8 bits (204), Expect = 7e-21
Identities = 72/247 (29%), Positives = 120/247 (48%), Gaps = 17/247 (6%)

Query: 18 AASINRGLAIVTGGRRGIGRAICCELAQAGFDIAVIDIVDDENASETVNLVRRHGRNAAF 77
A I +A +TG +GIG A+ LA G IA +D + E + V+ ++ R+A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD-YNPEKLEKVVSSLKAEARHAEA 61

Query: 78 YRKDISETSDNTALIERIEADLGGATCLVNNAGVQVSVRGDLLD-VSEESFDRLVGINLR 136
+ D+ +++ + RIE ++G LVN AGV +R L+ +S+E ++ +N
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGV---LRPGLIHSLSDEEWEATFSVNST 118

Query: 137 GTFFFTQAVARAMISRPHVRLERSVVTITSANAGLVSPEKGPYCISKAGLSMASQQFAIR 196
G F +++V++ M+ R S+VT+ S AG+ Y SKA M ++ +
Sbjct: 119 GVFNASRSVSKYMMDRR----SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLE 174

Query: 197 LADTGIRVHEIRPGLIQTDMTADVY--EKYSAQVEAGQLSAIR------RWGQPEDIARG 248
LA+ IR + + PG +TDM ++ E + QV G L + + +P DIA
Sbjct: 175 LAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADA 234

Query: 249 VATLAVG 255
V L G
Sbjct: 235 VLFLVSG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1023SYCDCHAPRONE384e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 38.0 bits (88), Expect = 4e-05
Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 6/76 (7%)

Query: 473 GERLQRLNPLEDRDIHEL--LAFTHYLLGDYEASLRSFRR---WDNNNYDRGFANLAACL 527
G + LN + + +L LAF Y G YE + + F+ D+ + F L AC
Sbjct: 22 GGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRF-FLGLGACR 80

Query: 528 GQLGRAEEARSAWGRC 543
+G+ + A ++
Sbjct: 81 QAMGQYDLAIHSYSYG 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1026RTXTOXIND338e-115 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 338 bits (869), Expect = e-115
Identities = 100/427 (23%), Positives = 172/427 (40%), Gaps = 3/427 (0%)

Query: 9 RTIRRYLLGGVAACIFLVGGAGSLAAVTELSGAVIAPGKLVVDSSVKKVQHPTGGVVGDI 68
RR L FLV A L+ + ++ A GKL K+++ +V +I
Sbjct: 52 PVSRRPRLVAYFIMGFLVI-AFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 69 LAREGDAVKSGQVLIRLDETVTRANLAIVTKGLDEFEARLARLEAERDDRAGIAFPASLT 128
+ +EG++V+ G VL++L A+ L + R + P
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKL 170

Query: 129 SRRDDPAVARA--MAGEQSLFEFRRQARAGQKAQLEERIAQLAEEASGLTEQRTAKSREI 186
+ SL + + QK Q E + + E + +
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 187 ELIGTELESIRTLWLKKLVSIDRMTALERDAVRLDGEHGQLTASIAQSKGRIAETRLQII 246
+ + L+ +L K+ ++ + E V E + + Q + I + +
Sbjct: 231 RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290

Query: 247 QVDQDLRSEVATELRDVQGKISEFVERKVSAEDQLKRIDIRSPQDGVVHQLAVHTIGGVI 306
V Q ++E+ +LR I E++ + IR+P V QL VHT GGV+
Sbjct: 291 LVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVV 350

Query: 307 SPGEVIMLVVPVADDLTVEARIAPQDIDQLSLGQDVALKLSAFNQRVTPELSGVVSEISA 366
+ E +M++VP D L V A + +DI +++GQ+ +K+ AF L G V I+
Sbjct: 351 TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINL 410

Query: 367 DLSVDERSGASFYTVRVSLPRTELKKLKGLTLAPGMPVEAFFATGSRTMLSYLVKPLADQ 426
D D+R G F + K + L+ GM V A TG R+++SYL+ PL +
Sbjct: 411 DAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEES 470

Query: 427 IARAFRE 433
+ + RE
Sbjct: 471 VTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1028RTXTOXINA781e-16 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 77.7 bits (191), Expect = 1e-16
Identities = 64/280 (22%), Positives = 103/280 (36%), Gaps = 63/280 (22%)

Query: 384 GDDTITGSNSPDTITGGRGNDTLNGVGGNDTYIYARGDGNDTVTDGSGNGINDRLVFTDI 443
GDD I G++ D + G +GNDTL+G G+D +Y GDGND + +GN
Sbjct: 745 GDDLIEGNDGNDRLYGDKGNDTLSGGNGDD-QLYG-GDGNDKLIGVAGNNY--------- 793

Query: 444 DPSMVSLVRIGSDVKVVIAESSPGAGDAGSIVLKDILGDFISQGVDKILFADGTVWTRPT 503
+ G GD D+ ++
Sbjct: 794 --------------------LNGGDGD------------------DEFQVQGNSLAKNV- 814

Query: 504 IVGKLVDLLGTTGNDSINGTSATDIIRGAAGNDTLNGASGDDTYLYARGDGNDTVNEGFW 563
L G GND + G+ D++ G G+D L G G+D Y Y G G+ +++
Sbjct: 815 -------LFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGG 867

Query: 564 DVNDRLVFTNINRSGVSLVRNGNDLTVVIAES-APGAGDGGSVLIKNTLDDNNSWG---- 618
D+L +I+ V+ R GNDL + E G + +N + +
Sbjct: 868 K-EDKLSLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISNHE 926

Query: 619 VEKVVFADGTAWTRADIRVALLDQAGTTGNDTIIGFNVAD 658
+E++ G T ++ AL Q + G +
Sbjct: 927 IEQIFDKSGRIITPDSLKKALEYQQRNNKASYVYGNDALA 966



Score = 75.0 bits (184), Expect = 1e-15
Identities = 74/285 (25%), Positives = 111/285 (38%), Gaps = 44/285 (15%)

Query: 510 DLLGTTGNDSINGTSATDIIRGAAGNDTLNGASGDDTYLYARGDGNDTVNEGFWDVNDRL 569
+L+GTT D G+ TDI GA G+D + G G+D LY GNDT++ G + +D+L
Sbjct: 721 ELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDR-LYG-DKGNDTLSGG--NGDDQL 776

Query: 570 VFTNINRSGVSLVRNGNDLTVVIAESAPGAGDGGSVLIKNTLDDNNSWGVEKVVFADGTA 629
+ GND + G + L DD E V + A
Sbjct: 777 YGGD-----------GNDKLI--------GVAGNNYLNGGDGDD------EFQVQGNSLA 811

Query: 630 WTRADIRVALLDQAGTTGNDTIIGFNVADTLHGRAGNDTLNGAGGDDTYLYARGDGNDTV 689
G GND + G AD L G G+D L G G+D Y Y G G+ +
Sbjct: 812 KNVLF---------GGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHII 862

Query: 690 NEGFWDVNDRLVFTDINPSGVSLVRNGNDLTVVIAES-APGAGDGGSVLIKNTLDDNNSW 748
++ D+L DI+ V+ R GNDL + E G + +N + +
Sbjct: 863 DDDGGK-EDKLSLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGD 921

Query: 749 G----VEKVVFADGTTWTRADIRVALLNQADTAGNDTITGFNVAD 789
+E++ G T ++ AL Q + G +
Sbjct: 922 ISNHEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVYGNDALA 966



Score = 71.2 bits (174), Expect = 2e-14
Identities = 73/287 (25%), Positives = 108/287 (37%), Gaps = 44/287 (15%)

Query: 644 GTTGNDTIIGFNVADTLHGRAGNDTLNGAGGDDTYLYARGDGNDTVNEGFWDVNDRLVFT 703
GTT D G D HG G+D + G G+D LY GNDT++ G + +D+L
Sbjct: 724 GTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDR-LYG-DKGNDTLSGG--NGDDQLYGG 779

Query: 704 DINPSGVSLVRNGNDLTVVIAESAPGAGDGGSVLIKNTLDDNNSWGVEKVVFADGTTWTR 763
D N + GN+ G GD + N+L N +G
Sbjct: 780 DGNDKLIG--VAGNNYLN------GGDGDDEFQVQGNSLAKNVLFG-------------- 817

Query: 764 ADIRVALLNQADTAGNDTITGFNVADTLHGRAGNDTLNGAGGDDTYLYARGDGNDTVNEG 823
GND + G AD L G G+D L G G+D Y Y G G+ +++
Sbjct: 818 ------------GKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDD 865

Query: 824 FWDVNDRLVFTDINPSGVSLVRNGNDLTVVIAES-APGAGDGGSVLIKNTLDDNNSWG-- 880
D+L DI+ V+ R GNDL + E G + +N + +
Sbjct: 866 GGK-EDKLSLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISN 924

Query: 881 --VEKVVFADGTTWTRADIRVALLDQAGTTGNDTITGFNVADRISGG 925
+E++ G T ++ AL Q + G + S G
Sbjct: 925 HEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVYGNDALAYGSQG 971



Score = 68.8 bits (168), Expect = 8e-14
Identities = 68/258 (26%), Positives = 99/258 (38%), Gaps = 44/258 (17%)

Query: 8 TDGDDVLVGSAASGIMHGGKGNDTLDGASGNDNYVYARGDGNDLITDGYNDVGDRLTFTD 67
T D GS + I HG G+D ++G GND +Y GND ++ G D D+L D
Sbjct: 725 TTRADKFFGSKFTDIFHGADGDDLIEGNDGNDR-LYG-DKGNDTLSGGNGD--DQLYGGD 780

Query: 68 INSSTVSLVRSGNDVTIVIAESAPGAGDGGSVRLKDALDDDHNRGVDQVVFADGTIWTRA 127
GND I G G+ L DD + + +
Sbjct: 781 -----------GNDKLI---------GVAGNNYLNGGDGDDEFQVQGNSLAKN------- 813

Query: 128 GIRVMLLDQTATVGNDTITGFNVADTISGKAGNDTIDGAGGNDNYVYARGDGNDTLTEGY 187
+L GND + G AD + G G+D + G GND Y Y G G+ + +
Sbjct: 814 -----VLFGGK--GNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDG 866

Query: 188 NDYGDRLTFTDIDSSAVSIVRNGNDVTVVIAES-APGAGDGGSVVLKDTLEDNAGRG--- 243
D+L+ DID V+ R GND+ + E G + ++ E +G
Sbjct: 867 GK-EDKLSLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISNH 925

Query: 244 -IDQIVFADGTNWSRAQL 260
I+QI G + L
Sbjct: 926 EIEQIFDKSGRIITPDSL 943



Score = 65.8 bits (160), Expect = 8e-13
Identities = 62/203 (30%), Positives = 84/203 (41%), Gaps = 27/203 (13%)

Query: 265 LTGTTADETLVGFSRDDTFHYARGGGDDTIIDGVNNGYNDQLVFSDINPDDVTLVGIGND 324
L GTT + G D FH G D +I+G N ND+L + D +D G G+D
Sbjct: 722 LIGTTRADKFFGSKFTDIFH---GADGDDLIEG--NDGNDRL-YGD-KGNDTLSGGNGDD 774

Query: 325 VKVVVAESTTGAGDGGSILLKDALANYY--GQGIDKIVFADGTAWTRDDFRAAILGLGAT 382
GDG L+ A NY G G D+ + L
Sbjct: 775 QLY--------GGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNV--------LFGG 818

Query: 383 AGDDTITGSNSPDTITGGRGNDTLNGVGGNDTYIYARGDGNDTVTDGSGNGINDRLVFTD 442
G+D + GS D + GG G+D L G GND Y Y G G+ + D G D+L D
Sbjct: 819 KGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGK--EDKLSLAD 876

Query: 443 IDPSMVSLVRIGSDVKVVIAESS 465
ID V+ R G+D+ + E +
Sbjct: 877 IDFRDVAFKREGNDLIMYKGEGN 899



Score = 60.4 bits (146), Expect = 3e-11
Identities = 62/244 (25%), Positives = 94/244 (38%), Gaps = 49/244 (20%)

Query: 141 GNDTITGFNVADTISGKAGNDTIDGAGGNDNYVYARGDGNDTLTEGYNDYGDRLTFTDID 200
G+D I G + D + G GNDT+ G G+D GDGND L
Sbjct: 745 GDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLY--GGDGNDKLIGV-------------- 788

Query: 201 SSAVSIVRNGNDVTVVIAESAPGAGDGGSVVLKDTLEDN---AGRGIDQIVFADGTNWSR 257
GN+ G GD V ++L N G+G D++ ++G +
Sbjct: 789 --------AGNNYLN------GGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGAD--- 831

Query: 258 AQLRDILLTGTTADETLVGFSRDDTFHYARGGGDDTIIDGVNNGYNDQLVFSDINPDDVT 317
LL G D+ L G +D + Y G G I D + G D+L +DI+ DV
Sbjct: 832 ------LLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDD--DGGKEDKLSLADIDFRDVA 883

Query: 318 LVGIGNDVKVVVAES---TTGAGDGGSI--LLKDALANYYGQGIDKIVFADGTAWTRDDF 372
GND+ + E + G +G + + + I++I G T D
Sbjct: 884 FKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISNHEIEQIFDKSGRIITPDSL 943

Query: 373 RAAI 376
+ A+
Sbjct: 944 KKAL 947



Score = 53.8 bits (129), Expect = 3e-09
Identities = 57/227 (25%), Positives = 86/227 (37%), Gaps = 46/227 (20%)

Query: 776 TAGNDTITGFNVADTLHGRAGNDTLNGAGGDDTYLYARGDGNDTVNEGFWDVNDRLVFTD 835
T D G D HG G+D + G G+D LY GNDT++ G + +D+L D
Sbjct: 725 TTRADKFFGSKFTDIFHGADGDDLIEGNDGNDR-LYG-DKGNDTLSGG--NGDDQLYGGD 780

Query: 836 INPSGVSLVRNGNDLTVVIAESAPGAGDGGSVLIKNTLDDNNSWGVEKVVFADGTTWTRA 895
GND + G + L DD + +
Sbjct: 781 -----------GNDKLI--------GVAGNNYLNGGDGDD-------EFQVQGNSLAKNV 814

Query: 896 DIRVALLDQAGTTGNDTITGFNVADRISGGGGNDTLTGGAGSDTFIFHTNFGSDKITDFV 955
G GND + G AD + GG G+D L GG G+D + + + +G I D
Sbjct: 815 --------LFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDD-- 864

Query: 956 VGAGSQDVIQFGNDVFADFASVLAAATQVGADTVITHDAGNTLTLKN 1002
G +D + + F D A + G D ++ GN L++ +
Sbjct: 865 -DGGKEDKLSLADIDFRDV-----AFKREGNDLIMYKGEGNVLSIGH 905


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1030cloacin378e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 8e-05
Identities = 19/42 (45%), Positives = 21/42 (50%)

Query: 25 NGNGGGNGGGHGNGGGHGNGGGNAGSNGKGNSGASHGKSASA 66
N GGG+G G GGG G+G G N G SG SA A
Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 36.2 bits (83), Expect = 1e-04
Identities = 18/37 (48%), Positives = 26/37 (70%)

Query: 23 AGNGNGGGNGGGHGNGGGHGNGGGNAGSNGKGNSGAS 59
+G+G G G GHGNGGG+GN GG +G+ G ++ A+
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 35.1 bits (80), Expect = 3e-04
Identities = 20/47 (42%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 25 NGNGGGNGGGHGNGGGHGNGGGNAGSNGKGNSGASHGKSASAPGQVG 71
G G G+G G G GHGNGGGN S G G+ + + +AP G
Sbjct: 46 WGGGSGSGIHWGGGSGHGNGGGNGNS-GGGSGTGGNLSAVAAPVAFG 91



Score = 32.0 bits (72), Expect = 0.002
Identities = 18/41 (43%), Positives = 22/41 (53%)

Query: 19 SPALAGNGNGGGNGGGHGNGGGHGNGGGNAGSNGKGNSGAS 59
+P G+G+G GGG G+G G GNG GS GN A
Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 28.1 bits (62), Expect = 0.036
Identities = 22/66 (33%), Positives = 27/66 (40%), Gaps = 6/66 (9%)

Query: 19 SPALAGNGNGGGNGGGHG-----NGGGHGNGGGNAGSNGKGNSGASHGKSASAPGQVGKV 73
P G G G +G G GGG G+G G +G GN G + G S G G +
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN-GNSGGGSGTGGNL 81

Query: 74 DADATA 79
A A
Sbjct: 82 SAVAAP 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1033VACJLIPOPROT280.047 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 27.9 bits (62), Expect = 0.047
Identities = 12/45 (26%), Positives = 19/45 (42%), Gaps = 2/45 (4%)

Query: 204 FFVQSVFGLLGGIGTHPEDVAHMKRTADRLFGDQF-RWSVLGAGA 247
FF+ ++ G+ G I ++RT FG + V G G
Sbjct: 103 FFLNTILGMGGFIDVAGMANPKLQRTEPHRFGSTLGHYGV-GYGP 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1036DHBDHDRGNASE1053e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 105 bits (262), Expect = 3e-29
Identities = 75/257 (29%), Positives = 117/257 (45%), Gaps = 14/257 (5%)

Query: 15 GLKGQRVLVTAGAGGIGFAIADTLSRLGARIIVCDISDEALAAAPGKIDLVA----AVKA 70
G++G+ +T A GIG A+A TL+ GA I D + E L + A A A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 71 DVSRDEDVDRLFETVKEKLGGLDALVNNAGIAGPTGGVDEIEPDDWRRCIDICLTGQFLC 130
DV +D + ++ ++G +D LVN AG+ P G + + ++W + TG F
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRP-GLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 131 ARRAVPLIKAAGGGSIVSMSSAAGRHGYAFRTPYSAAKFGVIGFAQSLAKELGPHGIRVN 190
+R + GSIV++ S Y+++K + F + L EL + IR N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 191 AILPGIIEGPRIEGVIAAR--AKQV--GISHEEMTGRYLQNISLRRMTSPYDVASMVAFL 246
+ PG E + A A+QV G TG I L+++ P D+A V FL
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTG-----IPLKKLAKPSDIADAVLFL 238

Query: 247 LSDAGINISGQSLGVDG 263
+S +I+ +L VDG
Sbjct: 239 VSGQAGHITMHNLCVDG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1041ACRIFLAVINRP300.040 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.8 bits (67), Expect = 0.040
Identities = 25/108 (23%), Positives = 44/108 (40%), Gaps = 9/108 (8%)

Query: 14 EVAEVELYHPHSWWTKYVFSQDAKVIAVQYSATATAIGLVALVLSWLMRLQLGFPGTFDF 73
+VA VEL + + + A + ++ + A A+ + + L LQ FP
Sbjct: 264 DVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKV 323

Query: 74 ITPEAYYQFI---------TMHGMIMVIYLLTALFLGGFGNYLIPLMV 112
+ P F+ T+ IM+++L+ LFL LIP +
Sbjct: 324 LYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIA 371


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1050PF05616290.013 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 28.6 bits (63), Expect = 0.013
Identities = 19/75 (25%), Positives = 30/75 (40%), Gaps = 8/75 (10%)

Query: 47 LEPGHAQGGSVRQLPAICST--------ASSSPASSKSAWPRPDNKPNRQESDSRLAGTR 98
L PG A+ + + LP + + +P + + P PD P+ GTR
Sbjct: 315 LTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTR 374

Query: 99 RLMPSAPSRTSIRPR 113
P+ P R + R R
Sbjct: 375 PDSPAVPDRPNGRHR 389


7mll1107mll1116Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
mll1107218-1.386374outer membrane protein, nodT
mll1108229-3.715584protein-L-isoaspartate O-methyltransferase
msl1109438-7.712351*hypothetical protein
msl1110539-8.097925hypothetical protein
msr1111540-8.653200hypothetical protein
mlr1112634-7.651071*hypothetical protein
msr1113529-6.550006hypothetical protein
mll1116323-4.252649hypothetical protein
8mll1371mlr1397Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll1371-1173.048914NADH dehydrogenase subunit B
mll1372-1133.414623NADH dehydrogenase subunit A
mlr1373-1143.894373hypothetical protein
mlr1374-1143.736025hypothetical protein
mlr1375-1143.862888cobalamin synthesis protein cobW
mlr13770124.193832cobaltochelatase subunit CobN
mlr13781103.142019cobalamin synthesis protein cobG
mlr13791132.912315precorrin-8X methylmutase
mlr13801142.751104precorrin-2 C20-methyltransferase
mlr13813133.759956precorrin-3B C17-methyltransferase
mll13823123.738667cobalt-precorrin-6x reductase
mlr13834123.755523precorrin 6y methylase
mlr13843154.150709cobalamin biosynthesis protein G cbiG
mlr13851132.990257precorrin 3 methylase
mlr13862142.823927uroporphyrin-III C-methyltransferase
mlr13872141.818745cobyrinic acid a,c-diamide synthase
mlr1388-1131.424533cobalamin (5'-phosphate) synthase
mlr1389-2121.480817nicotinate-nucleotide-dimethylbenzimidazole
mll1390-1121.372842hypothetical protein
msl13911162.328058hypothetical protein
mll13932172.330205*hypothetical protein
mlr13943152.351103hypothetical protein
mlr13964131.427810ABC transporter ATP-binding protein
mlr13972120.569956ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1390OMPADOMAIN463e-08 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 46.5 bits (110), Expect = 3e-08
Identities = 53/207 (25%), Positives = 80/207 (38%), Gaps = 27/207 (13%)

Query: 20 ALAADATVELPVASSYNWTGGYIGAQIGGGWSKVDQPWGFTEASPNIFDQDNADGSGVVG 79
ALA ATV W Y GA GWS+ GF + +N G+G G
Sbjct: 11 ALAGFATVAQAAPKDNTW---YTGA--KLGWSQYHDT-GFINNNGPT--HENQLGAGAFG 62

Query: 80 GLHAGYNWQSGSFVFGGEADINATGIDGDDGGSGGDINGFKARWVASVRARAGFAF-DRV 138
G Y G E + G G + +KA+ V + A+ G+ D +
Sbjct: 63 G----YQVNPY---VGFEMGYDWLGRMPYKGSV--ENGAYKAQGV-QLTAKLGYPITDDL 112

Query: 139 LIYGTGGYAYLNGKADTRDVGRQESHSASFNGWTIGAGAEYALTDNITVRGEYRYADFGS 198
IY G +ADT+ ++H + G EYA+T I R EY++
Sbjct: 113 DIYTRLGGMVW--RADTKSNVYGKNHDTGVSP-VFAGGVEYAITPEIATRLEYQWT---- 165

Query: 199 KTVVFNNYVEDISPALHTVTIGVSYKF 225
+ + + P +++GVSY+F
Sbjct: 166 -NNIGDAHTIGTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1394RTXTOXIND682e-14 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 68.3 bits (167), Expect = 2e-14
Identities = 45/258 (17%), Positives = 80/258 (31%), Gaps = 51/258 (19%)

Query: 121 DTAKLQVQIERAEASAKGAAANVEDATVTLAENESALVRAAALTKRGMATDQSLEAATAT 180
+ ++ +++ A A + +S L ++L + ++
Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260

Query: 181 RDRSKAALDSAKANLA-----IANADLKSQQT---------------------------- 207
+ L K+ L I +A + Q
Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320

Query: 208 ---DLANSTIYAPIDGIVLTRSV-DPGQTVASSLQAPVLFIIAADLRNMELVAAVDEADI 263
S I AP+ V V G V A L +I + +E+ A V DI
Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVT---TAETLMVIVPEDDTLEVTALVQNKDI 377

Query: 264 GAVKTGQHARFTVDAFPDR---PFDAEIRDISYASVT--TDGVVTYNAR------LEVDN 312
G + GQ+A V+AFP ++++I+ ++ G+V L N
Sbjct: 378 GFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGN 437

Query: 313 NELLLRPGMTATVSVVTR 330
+ L GM T + T
Sbjct: 438 KNIPLSSGMAVTAEIKTG 455



Score = 59.1 bits (143), Expect = 1e-11
Identities = 23/143 (16%), Positives = 48/143 (33%), Gaps = 1/143 (0%)

Query: 69 AAKADLTVKVSATGTLQPLTQV-DISSQLSGIIRSVSVKENQQVKKGDVLAALDTAKLQV 127
+ + + +A G L + +I + I++ + VKE + V+KGDVL L +
Sbjct: 75 SVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134

Query: 128 QIERAEASAKGAAANVEDATVTLAENESALVRAAALTKRGMATDQSLEAATATRDRSKAA 187
+ ++S A + E + L + S E K
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 188 LDSAKANLAIANADLKSQQTDLA 210
+ + +L ++ +
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERL 217


9mll1771mlr1797Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
mll17712181.791476hypothetical protein
mll17732181.919939hypothetical protein
mll17751201.581268hypothetical protein
msr1776-2201.818387hypothetical protein
mlr17770161.487887hypothetical protein
mll17793142.247759hypothetical protein
mlr17803143.120746hypothetical protein
msr17823171.875621hypothetical protein
mlr17835190.944633hypothetical protein
mll1786620-0.145856hypothetical protein
msl1788720-0.540648hypothetical protein
mlr1789115-0.842660epoxide hydrolase
mll1791114-2.544278*hypothetical protein
mll1793116-2.485978hypothetical protein
msr1795115-0.474170hypothetical protein
mlr1797214-0.477063hypothetical protein
10mlr1836mlr1843Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr18361164.468590hypothetical protein
mlr18372154.425086hypothetical protein
msr18380144.416175hypothetical protein
mll18390144.037821hypothetical protein
mlr1841-1133.553729p-hydroxycinnamoyl CoA hydratase/lyase
mlr1843-1123.678529acyl-CoA synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1837PilS_PF08805260.047 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 26.0 bits (57), Expect = 0.047
Identities = 7/49 (14%), Positives = 20/49 (40%), Gaps = 6/49 (12%)

Query: 77 RGHQALGVLLSIKLALLVIGAALAIRFGPFASGDS------AAAIVTGM 119
+G + VLL + + +++ +A + ++ S ++ M
Sbjct: 26 KGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANM 74


11mlr1935mlr1976Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr19352112.071231serine proteinase
mll19362141.733379hypothetical protein
mll19370131.221583hypothetical protein
mlr19391151.593436DNA-binding response regulator
mll19402161.707861hypothetical protein
mlr19412162.248770hypothetical protein
mlr19443182.909474glycosyl transferase
mlr19461141.661796glycosyl transferase
mlr19480140.653101ABC transporter ATP-binding protein
msl1949-115-0.616792cell division topological specificity factor
mll1950-118-0.221242cell division inhibitor MinD
mll1951022-1.044272septum formation inhibitor
mll1952227-3.232834norsolorinic acid reductase
mlr1953-125-1.936842transcriptional regulator
mll19544240.857444hypothetical protein
mlr19555232.042616hypothetical protein
msr19566231.085280hypothetical protein
msl19573180.924501hypothetical protein
mll19584191.652072hypothetical protein
mll19595181.317514hypothetical protein
mll19600140.491437hypothetical protein
mll19620130.450260hypothetical protein
mll19630121.275183hypothetical protein
msl19651131.970495hypothetical protein
mll19660131.938551hypothetical protein
mlr19680122.190720peptide synthetase
mlr19691123.997921spermidine acetyltransferase
mlr19700144.053880hypothetical protein
mll19711143.306917L-lysine 2,3-aminomutase
msl19720143.429892hypothetical protein
mlr19730143.630579glycine cleavage system transcription activator
mll19750144.069438cinnamoyl-CoA reductase
mlr19761163.366254transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1935FLAGELLIN360.001 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 35.8 bits (82), Expect = 0.001
Identities = 36/269 (13%), Positives = 66/269 (24%), Gaps = 5/269 (1%)

Query: 352 TGIDAHNFGTGATSVTANGTVTGSFAEGIKVVGNAAVTVSVADTVTGATRGLSLVGGTGG 411
ID + G +V T + T +V V
Sbjct: 160 QKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTT 219

Query: 412 SGDISVTGTGGFAGGSGDAANILNNGSGTVTIDISGASSSTGGEGIVVRDVTTSTGISVT 471
+ + A G + NN + + + + + I G +
Sbjct: 220 APTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFD 279

Query: 472 TGAVTALTAGKDAIDVQSQSLTGNITEVANGNLQAGNAGMVAAILNAAGIGNIDVTANGS 531
VT K D ++ I A A + A + +V
Sbjct: 280 YKGVTFTIDTKTGNDGN-GKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYT-SV 337

Query: 532 LDARFGIDAENFGSGSTKVTTVGPVTVTTGNGIFALSTGGDVTVNAGDVTSTGNTAIIAR 591
++ +F D + + V + I VT G T I +
Sbjct: 338 VNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDK 397

Query: 592 QTGAGAGA---IAITTSGTVEGAIAAIDA 617
+ A + +A+ID+
Sbjct: 398 TASGVSTLINEDAAAAKKSTANPLASIDS 426


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1936TYPE3IMSPROT290.019 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.3 bits (66), Expect = 0.019
Identities = 5/41 (12%), Positives = 11/41 (26%)

Query: 62 QMRPSRATLPWLVAYGVTLCALNLLFYAALARIPLGVAVAL 102
+ LP + + A +P+ + L
Sbjct: 272 LYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPL 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1939HTHFIS588e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.9 bits (140), Expect = 8e-13
Identities = 24/116 (20%), Positives = 46/116 (39%), Gaps = 2/116 (1%)

Query: 3 TTVFIVDDHPLLLRGLADLIARDSGYRVIGTALDGRSALTMIRQDLPDVAVIDLNMPGFS 62
T+ + DD + L ++R +GY V + + I D+ V D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSR-AGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLDLAFELGKETPTTRCVMLTAGASQSQLYEVIKAGVAGIVLKEAAIGTLLRCIHR 118
DL + K P ++++A + + + G + K + L+ I R
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1953HTHTETR632e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.7 bits (152), Expect = 2e-14
Identities = 36/210 (17%), Positives = 71/210 (33%), Gaps = 25/210 (11%)

Query: 1 MRVSKEKASANRDALLKAASRLFRQRGIEGVGVAEIAKEAGLTHGALYAHFSSKDELAAA 60
R +K++A R +L A RLF Q+G+ + EIAK AG+T GA+Y HF K +L +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 AFSYGFARNMADTRAWAGDRNPSFQDYMGGLL------------SPFMRDKLETGCPMAA 108
+ + + + +L + + + C
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 109 SASEIGRQDCGVSASFTDAFQEMAAMLEGSIETIIPAAEKR-----KLAIAAVAAEIGAM 163
+ + + + E +E +++ I A + A + I +
Sbjct: 122 EMAVVQQAQ-------RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174

Query: 164 AVSRAIAKTDVALADEVLQAVLETVAAAYR 193
+ A L + + + + Y
Sbjct: 175 MENWLFAPQSFDLK-KEARDYVAILLEMYL 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1970ENTSNTHTASED671e-15 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 66.6 bits (162), Expect = 1e-15
Identities = 37/152 (24%), Positives = 58/152 (38%), Gaps = 13/152 (8%)

Query: 38 LPEEARSIPARQPAMRRASGAARWVAHRLLADTGISDLAIPRAPSGAPLWPNGIVGSLAH 97
LP R + + + A R A L + G+ + PLWP+G+ GS++H
Sbjct: 33 LPHHDR-LRSAGRKRKAEHLAGRIAAVHALREVGVRTVPGM-GDKRQPLWPDGLFGSISH 90

Query: 98 DDDMAVAAVAPVGGIVSLGIDVEP------AEPLPDDIFAIVATGADRTGAADPRLAGRI 151
A+A + +GID+E A L I + LA +
Sbjct: 91 CATTALAVI----SRQRIGIDIEKIMSQHTATELAPSIIDSDERQILQASLLPFPLALTL 146

Query: 152 LFAAKEAVYKAAYPLDREVLGYEDIAVDLDAG 183
F+AKE+VYK A+ + G+ V
Sbjct: 147 AFSAKESVYK-AFSDRVTLPGFNSAKVTSLTA 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1971PF07520300.024 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 29.6 bits (66), Expect = 0.024
Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 7/84 (8%)

Query: 20 VRQGVRHVRDLDRLPLSPVERAAAQAAAAHHKVRAPKAYLDLIDWNDPADP------IRA 73
V G R + L+R +P+ R + K++ P + + +D + +RA
Sbjct: 936 VYIGARQL-PLERWTTTPLYRLDFANDSIAGKIKLPVKVELVREDDDFDEAETSLEKLRA 994

Query: 74 QVIPSPDELEEAEGELGDPIADHD 97
+ + ++ AE G I + D
Sbjct: 995 ERVREVFRVDAAEDAEGTMIKNDD 1018


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1975NUCEPIMERASE551e-10 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 55.2 bits (133), Expect = 1e-10
Identities = 37/175 (21%), Positives = 65/175 (37%), Gaps = 27/175 (15%)

Query: 15 VLVTGGSGFIASHCMLKLLDAGYRLRTTVRSLEREAEVRAMLREGGAE--PGDRLSFVAA 72
LVTG +GFI H +LL+AG+++ + +L +V L++ E F
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVV-GIDNLNDYYDVS--LKQARLELLAQPGFQFHKI 59

Query: 73 DLTADAGWAEAV---AGCAYVMH-----GASPTPSGSQTREEDWVRPAVDGVLRVLKAAR 124
DL AD + V + + + G L +L+ R
Sbjct: 60 DL-ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHA----YADSNLTGFLNILEGCR 114

Query: 125 DAGIKRVVL--TSAIGAVAMGHAPQTRPFNETDWSDLSGAVAPYQRSKTLSERAA 177
I+ ++ +S++ + PF+ D D V+ Y +K +E A
Sbjct: 115 HNKIQHLLYASSSSVYG-----LNRKMPFSTDDSVD--HPVSLYAATKKANELMA 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1976HTHTETR280.030 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.030
Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 8/96 (8%)

Query: 218 LEELARAAAMSRTSFAFHFRQTAGVAPLTY---LTQWRMHLAERALREEDTPVAVLARSL 274
L E+A+AA ++R + +HF+ + + + + E + P++VL L
Sbjct: 34 LGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLREIL 93

Query: 275 GYTSESAFSNAFKRATGTAPKRYRTAGKAERSGDAE 310
+ ES + +R K E G+
Sbjct: 94 IHVLESTVTEERRRLLMEIIFH-----KCEFVGEMA 124


12mlr2037mll2061Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr20373152.985905hypothetical protein
mlr20381132.291992hypothetical protein
mll20390111.910997hypothetical protein
mll20410131.167602short chain oxidoreductase
mlr20420140.974504glutathione S-transferase
mll20431142.263753hypothetical protein
mlr2044-1113.535352hypothetical protein
mll2045-1113.665099hypothetical protein
mlr2046-2143.521097hypothetical protein
mlr2047-1143.194985short chain dehydrogenase
mll2048-1152.022823transcriptional regulator
mll2051-1160.594512serine protease
mlr2052116-0.901850glutathione S-transferase
msl2054218-2.428989hypothetical protein
msl86120120.121699hypothetical protein
mlr20560120.853387hypothetical protein
msr20570150.703576hypothetical protein
msr20592160.335508hypothetical protein
mll20602121.989474hypothetical protein
mll20611143.176237cytosine deaminase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2041DHBDHDRGNASE776e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 76.6 bits (188), Expect = 6e-19
Identities = 52/191 (27%), Positives = 76/191 (39%), Gaps = 9/191 (4%)

Query: 7 KVALITGANRGIGLETGRQLAKLGFTVL---LGVRDLAKGEAAAKGLEGHVEAIALDVAA 63
K+A ITGA +GIG R LA G + L K ++ K H EA DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 64 PDAATTAADEVQRRFGRLDVLINNAAIHYDTGSRAL-RPDWTVIREAFETNVFGAWRVAA 122
A ++R G +D+L+N A + +L +W F N G + +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT---FSVNSTGVFNASR 125

Query: 123 AFAPLLKAGGHGRLVNVSSEGGSLASMGAGAPAYSTSKATLNALTCVLAAELRGSGVLVN 182
+ + + G +V V S + A Y++SKA T L EL + N
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAA--YASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 183 AICPGWVATDM 193
+ PG TDM
Sbjct: 184 IVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2047DHBDHDRGNASE821e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.6 bits (201), Expect = 1e-20
Identities = 61/256 (23%), Positives = 109/256 (42%), Gaps = 19/256 (7%)

Query: 3 LSNQKILIVGGGSGMGLALARRCVEAGATVIIAGRSDDRLRQARETLG----NPAGLEVA 58
+ + I G G+G A+AR GA + + ++L + +L +
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 59 VVDIAREDQVAA-LFAEVGGLDHIVSTAADIEGAYRLLPELDLKAAQRMVNSKLFGPLLL 117
V D A D++ A + E+G +D +V+ A + L+ L + + + G
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPG--LIHSLSDEEWEATFSVNSTGVFNA 123

Query: 118 AKHGAPRLAA--SGSMTLISGIAAYRPAARGSVVAAVNAALEGLVRALAVELAP--LRVN 173
++ + + SGS+ + A P + A+ AA + L +ELA +R N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 174 AVSPGWVDTEIWAQVAGDRKAE--MLAAMAER----LPVGRVGQPEDIADAIFFLIGN-- 225
VSPG +T++ + D ++ E +P+ ++ +P DIADA+ FL+
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 226 GFTTGTTLHVEGGHRL 241
G T L V+GG L
Sbjct: 244 GHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr205656KDTSANTIGN310.006 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 30.7 bits (69), Expect = 0.006
Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 170 KAAVPPHKLLVYKVTEGWA---PLCDFLGVALPNEPFPNLNDRETIKKIIRDI 219
K + P K+L K+ + ++ P D G+ +P+ PN E I+ I+++
Sbjct: 255 KPSASPVKVLSDKIIQIYSDIKPFADIAGINVPDTGLPNSASIEQIQSKIQEL 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2060OMPADOMAIN411e-06 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 41.5 bits (97), Expect = 1e-06
Identities = 44/192 (22%), Positives = 67/192 (34%), Gaps = 38/192 (19%)

Query: 2 KLALLCSVAFLAAASPVLAADIVEPVAAMPFSWTGFYGGVQAGGGW-NDSRWSGATFDSF 60
K A+ +VA A+ AA P +Y G + G +D+ + +
Sbjct: 3 KTAIAIAVALAGFATVAQAA----PKD------NTWYTGAKLGWSQYHDTGFINNNGPTH 52

Query: 61 NTNGSGGIFGGQIGYNYQINQFVIGIE---GDLAGSTVKG--DGQCSTALGTTCETKQDY 115
G FGG YQ+N +V G E L KG + A G
Sbjct: 53 ENQLGAGAFGG-----YQVNPYV-GFEMGYDWLGRMPYKGSVENGAYKAQGVQ------- 99

Query: 116 LGSVRGRLGYAI-DRILIYGDAGVAFTKY-KIAEVDGFHQSFGGGSRVGWTAGLGAEYAL 173
+ +LGY I D + IY G + + V G + + V G EYA+
Sbjct: 100 ---LTAKLGYPITDDLDIYTRLGGMVWRADTKSNVYGKNHD----TGVSPVFAGGVEYAI 152

Query: 174 TDHWTAGVEWNY 185
T +E+ +
Sbjct: 153 TPEIATRLEYQW 164


13mll2083mlr2105Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll20832153.961675hypothetical protein
mll20840134.186099hypothetical protein
mlr20850123.595901hypothetical protein
mlr2086-1123.613833cellulase
mll2088-1123.434448hypothetical protein
mll2090-1103.291364hypothetical protein
mlr2091-192.405808hypothetical protein
mlr20920111.716111hypothetical protein
mll20941101.427308transcriptional regulator
mlr20952120.562809hypothetical protein
msr2097217-0.566771hypothetical protein
msl8613217-0.140915hypothetical protein
mlr2098316-0.467510hypothetical protein
mlr2101116-0.716326catalase
msl2102323-2.146896hypothetical protein
mll2104219-1.808615hypothetical protein
mlr2105216-1.106091hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2090GPOSANCHOR320.013 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 31.6 bits (71), Expect = 0.013
Identities = 32/224 (14%), Positives = 74/224 (33%), Gaps = 10/224 (4%)

Query: 171 TAKSADKAAKLANAIAQAYLADQASARAKMATDASDSITARLEEQRKRV----QQAENAV 226
+AK A+ A A+ ++A A + A + LE ++ + + E A+
Sbjct: 140 SAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKAL 199

Query: 227 EAYKSAHNMVMAAGNLVSDQELTEINTQLSAAQSRTAALKAQVDQLRRSGGAPDATSEAM 286
E + A + E + + + + +A A+
Sbjct: 200 EGAMNFSTADSAKIKTLEA-EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 258

Query: 287 RSSVISSLRAQEATLVDQVSQLGTELGPRHPSMIAAQQQLRDTRALIARELGRIGAAAET 346
+ + L ++ + ++ A + + D +
Sbjct: 259 EAR-QAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANR-QSLRR 316

Query: 347 DYERALANQQALEAKVAGMKSKSLDTDQASVRLRELQRDLEAVR 390
D + + ++ LEA+ ++ + +AS + L+RDL+A R
Sbjct: 317 DLDASREAKKQLEAEHQKLEEQ-NKISEAS--RQSLRRDLDASR 357


14msr2119mlr2141Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
msr21193142.181734hypothetical protein
mlr21204143.077614transcriptional regulator
msl86144143.349187hypothetical protein
mlr21233133.031929hypothetical protein
mlr21244142.928926lipopolysaccharide modification acyltransferase
mlr21254252.301402hypothetical protein
mll21264202.706584hypothetical protein
msl21283222.668807hypothetical protein
mll21293212.320422hypothetical protein
mll21342212.839192hypothetical protein
mll21351142.528318hypothetical protein
mlr21361151.884369phage repressor protein C
mll21373152.144756hypothetical protein
mlr21382142.233069hypothetical protein
mlr21412142.542949hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2120HTHTETR402e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 40.4 bits (94), Expect = 2e-06
Identities = 23/85 (27%), Positives = 37/85 (43%)

Query: 1 MRYDKGRKDASRSRIMEVASHRFRGDGIAASGLASIMSDAGMTNGAFYPHFQSKADLVRE 60
R K +R I++VA F G++++ L I AG+T GA Y HF+ K+DL E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 SMASALETQSQQLQQALASGGLELA 85
+ + + A +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPL 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2129SECA290.013 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.5 bits (66), Expect = 0.013
Identities = 13/55 (23%), Positives = 20/55 (36%)

Query: 130 DSRNQSEKASAFSPENAQHFSAREEKGMGVSSPENAQHLPPAGRNTPTPGFSGKK 184
+ Q + A Q S +++ ++ GRN P P SGKK
Sbjct: 838 EELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKK 892


15mlr2155mll2171Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
mlr21552210.435051DNA-binding protein
mll2156224-0.294787hypothetical protein
mlr2158324-0.726584hypothetical protein
mlr21592221.163329transcriptional regulator
msr86151152.930134hypothetical protein
mlr21600133.194985hypothetical protein
mlr21610123.405025hypothetical protein
mll21620133.988125hypothetical protein
mll21632144.581033hypothetical protein
mll21641165.164458hypothetical protein
mll21662154.011426hypothetical protein
mll21672154.232271arsenate reductase
mll21682154.653834hypothetical protein
mll21702185.069148arsenate reductase
mll21712195.409184transcriptional regulator
16mll2207msl2214Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll2207212-0.950952hypothetical protein
mll2208212-0.441379hypothetical protein
mll22092120.363421hypothetical protein
mll22112121.504844morphinone reductase
msl22122110.937011hypothetical protein
msl22142100.906347hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2208SYCDCHAPRONE482e-08 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 47.6 bits (113), Expect = 2e-08
Identities = 17/94 (18%), Positives = 31/94 (32%)

Query: 89 GINPELSAAYYNNGIILVLKGDYDRAITYLDQAIFLDPDNAEFYYNRGVAWSYKGNDERA 148
I+ + Y+ G Y+ A LD ++ F+ G G + A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 149 IADYDAAIKLNPGDARAYHNRGLNWARKGDKERA 182
I Y ++ + R + +KG+ A
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEA 123



Score = 41.5 bits (97), Expect = 2e-06
Identities = 21/93 (22%), Positives = 32/93 (34%)

Query: 124 LDPDNAEFYYNRGVAWSYKGNDERAIADYDAAIKLNPGDARAYHNRGLNWARKGDKERAI 183
+ D E Y+ G E A + A L+ D+R + G G + AI
Sbjct: 31 ISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAI 90

Query: 184 ADYSQAISLDPKNASSYNNRGDAWDSKGDDDRA 216
YS +D K + + KG+ A
Sbjct: 91 HSYSYGAIMDIKEPRFPFHAAECLLQKGELAEA 123



Score = 38.8 bits (90), Expect = 2e-05
Identities = 17/94 (18%), Positives = 35/94 (37%)

Query: 157 KLNPGDARAYHNRGLNWARKGDKERAIADYSQAISLDPKNASSYNNRGDAWDSKGDDDRA 216
+++ ++ N + G E A + LD ++ + G + G D A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 217 MADYNQVIILDTKNAHAYYRRGLIWSRKGDDSRA 250
+ Y+ I+D K + +KG+ + A
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEA 123



Score = 38.0 bits (88), Expect = 3e-05
Identities = 23/116 (19%), Positives = 35/116 (30%), Gaps = 7/116 (6%)

Query: 43 IAACTAIIEDKAEASDNRAAAYFNRAGALIRRGDNDDAFADYDKAIGINPELSAAYYNNG 102
IA I D E + A + G +DA + ++ S + G
Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQS-------GKYEDAHKVFQALCVLDHYDSRFFLGLG 77

Query: 103 IILVLKGDYDRAITYLDQAIFLDPDNAEFYYNRGVAWSYKGNDERAIADYDAAIKL 158
G YD AI +D F ++ KG A + A +L
Sbjct: 78 ACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 38.0 bits (88), Expect = 3e-05
Identities = 18/95 (18%), Positives = 33/95 (34%)

Query: 200 YNNRGDAWDSKGDDDRAMADYNQVIILDTKNAHAYYRRGLIWSRKGDDSRAIADYSQVIS 259
+ G + A + + +LD ++ + G G AI YS
Sbjct: 39 LYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAI 98

Query: 260 LDPTDPSIRYNKGLAWLRKGDGDRAIADFDEAIRL 294
+D +P ++ L+KG+ A + A L
Sbjct: 99 MDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 36.1 bits (83), Expect = 1e-04
Identities = 17/109 (15%), Positives = 37/109 (33%)

Query: 294 LDPKMAAAYYDRGTEWLRKGDRDRAITDYSEVITLEPTNAMALNDRGFVLNELGEYERAL 353
+ Y + G + A + + L+ ++ G +G+Y+ A+
Sbjct: 31 ISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAI 90

Query: 354 ADLNRAIGLDPKQAKIYSNRAIARAAKGDFAPALADYNQAIALDPNFPN 402
+ +D K+ + + A KG+ A A + A L +
Sbjct: 91 HSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTE 139



Score = 35.7 bits (82), Expect = 2e-04
Identities = 17/85 (20%), Positives = 25/85 (29%)

Query: 234 YYRRGLIWSRKGDDSRAIADYSQVISLDPTDPSIRYNKGLAWLRKGDGDRAIADFDEAIR 293
Y + G A + + LD D G G D AI +
Sbjct: 39 LYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAI 98

Query: 294 LDPKMAAAYYDRGTEWLRKGDRDRA 318
+D K + L+KG+ A
Sbjct: 99 MDIKEPRFPFHAAECLLQKGELAEA 123



Score = 31.8 bits (72), Expect = 0.003
Identities = 22/96 (22%), Positives = 32/96 (33%)

Query: 339 RGFVLNELGEYERALADLNRAIGLDPKQAKIYSNRAIARAAKGDFAPALADYNQAIALDP 398
F + G+YE A LD ++ + R A G + A+ Y+ +D
Sbjct: 42 LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101

Query: 399 NFPNAYAGRGFVNFYSGMLAKAEPDFAKAAALAPDN 434
P G LA+AE A L D
Sbjct: 102 KEPRFPFHAAECLLQKGELAEAESGLFLAQELIADK 137



Score = 30.3 bits (68), Expect = 0.012
Identities = 20/111 (18%), Positives = 32/111 (28%)

Query: 252 ADYSQVISLDPTDPSIRYNKGLAWLRKGDGDRAIADFDEAIRLDPKMAAAYYDRGTEWLR 311
+ + + Y+ + G + A F LD + + G
Sbjct: 23 GTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQA 82

Query: 312 KGDRDRAITDYSEVITLEPTNAMALNDRGFVLNELGEYERALADLNRAIGL 362
G D AI YS ++ L + GE A + L A L
Sbjct: 83 MGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2209SYCDCHAPRONE541e-10 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 53.8 bits (129), Expect = 1e-10
Identities = 25/89 (28%), Positives = 34/89 (38%)

Query: 151 DYVKAIADFDKAIRLDPENNGLYNLRGNAYLRKGDYDQAITSYSQAIFLDSQDPNQYFNL 210
Y A F LD ++ + G G YD AI SYS +D ++P F+
Sbjct: 51 KYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHA 110

Query: 211 GLAWTTKGNLERAIADYSQAISLDANHAE 239
KG L A + A L A+ E
Sbjct: 111 AECLLQKGELAEAESGLFLAQELIADKTE 139



Score = 48.4 bits (115), Expect = 1e-08
Identities = 28/112 (25%), Positives = 42/112 (37%), Gaps = 3/112 (2%)

Query: 156 IADFDKAIRLDPENNGLYNLRGNAYLRKGDYDQAITSYSQAIFLDSQDPNQYFNLGLAWT 215
IA ++ E LY+L N Y G Y+ A + LD D + LG
Sbjct: 25 IAMLNEISSDTLEQ--LYSLAFNQYQ-SGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQ 81

Query: 216 TKGNLERAIADYSQAISLDANHAEAYRWRADAWVKRGDTDQALSDYTEAIRL 267
G + AI YS +D A+ +++G+ +A S A L
Sbjct: 82 AMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 39.5 bits (92), Expect = 1e-05
Identities = 19/92 (20%), Positives = 34/92 (36%)

Query: 345 NLGLAWWDKGDLDRAISAFDQAVIVDPKYAPAYNDRGLARMDKNQYDLAIADYNMAILID 404
+L + G + A F ++D + + G R QYDLAI Y+ ++D
Sbjct: 41 SLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMD 100

Query: 405 AGFVSAYRNRGNAWNRKGQFDYAIADFDQAID 436
+ +KG+ A + A +
Sbjct: 101 IKEPRFPFHAAECLLQKGELAEAESGLFLAQE 132



Score = 35.3 bits (81), Expect = 3e-04
Identities = 18/100 (18%), Positives = 28/100 (28%)

Query: 267 LDPGDAETFRNRARIWERKRDYDRAIADYDQAIAFAPNDAVAYNGRGWMWSLKHETDRAI 326
+ E + A + Y+ A + D+ + G G + D AI
Sbjct: 31 ISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAI 90

Query: 327 VDYVKATAFDPNYVLAYDNLGLAWWDKGDLDRAISAFDQA 366
Y D + KG+L A S A
Sbjct: 91 HSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLA 130



Score = 32.6 bits (74), Expect = 0.003
Identities = 14/108 (12%), Positives = 29/108 (26%), Gaps = 1/108 (0%)

Query: 98 PKDAEAFNNRGLIWGHKKDFDRALADYDKAIELNPQIAIAYANRGLIWNDIKHDYVKAIA 157
E + ++ A + L+ + + G Y AI
Sbjct: 33 SDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQA-MGQYDLAIH 91

Query: 158 DFDKAIRLDPENNGLYNLRGNAYLRKGDYDQAITSYSQAIFLDSQDPN 205
+ +D + L+KG+ +A + A L +
Sbjct: 92 SYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTE 139



Score = 28.7 bits (64), Expect = 0.045
Identities = 15/96 (15%), Positives = 31/96 (32%), Gaps = 2/96 (2%)

Query: 377 YNDRGLARMDKNQYDLAIADYNMAILIDAGFVSAYRNR-GNAWNRKGQFDYAIADFDQAI 435
+ +Y+ A + ++D + S + G GQ+D AI +
Sbjct: 39 LYSLAFNQYQSGKYEDAHKVFQALCVLDH-YDSRFFLGLGACRQAMGQYDLAIHSYSYGA 97

Query: 436 DHDPDDADAYVGRGRSRIYKADYTKAIADLDQAIRI 471
D + + K + +A + L A +
Sbjct: 98 IMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


17mll2226mll2256Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
mll2226219-0.207662hypothetical protein
mll2227121-0.134951hypothetical protein
msl2230223-0.310098hypothetical protein
mll2232122-0.270019molecular chaperone GroEL
mll2233220-2.112459co-chaperonin GroES
mlr22340121.155054hypothetical protein
msr2235-1122.899418hypothetical protein
msl2237-3132.201443hypothetical protein
msl2238-2152.222674hypothetical protein
msl2239-2162.120314hypothetical protein
mll2240-1172.377508hypothetical protein
mlr22421171.743282transcriptional regulator
mlr22442171.152613sugar binding protein of sugar ABC transporter
mlr22453141.198675sugar ABC transporter permease
mlr22463141.277581sugar ABC transporter permease
mlr22472141.174176hypothetical protein
mll22482150.434709hypothetical protein
mll22490151.518051hypothetical protein
mll2250-2130.900655hypothetical protein
mll22520140.452164lactoylglutathione lyase
mll22530160.482457hypothetical protein
mll22551141.667515transcriptional regulator
mll22561103.033662cation transport protein
18mlr2365msr2423Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr2365290.330181hypothetical protein
mlr2366112-0.405960hypothetical protein
mll2368114-1.258643hypothetical protein
mlr2370117-2.367413hypothetical protein
mll2372118-3.073930hypothetical protein
mll2374019-3.183616hypothetical protein
mlr2375126-4.678319hypothetical protein
mlr2376028-4.779980hypothetical protein
mll2378221-4.539043hypothetical protein
msr2379218-4.188182hypothetical protein
mlr2380217-4.138574transporter
mlr2382218-4.190726hypothetical protein
mll2384218-4.253675sensor/response regulator hybrid protein
mll2385316-4.228724sensory histidine kinase
mll2386224-4.255996small heat shock protein
mll2387322-3.757396small heat shock protein
msr2388315-1.435497hypothetical protein
mlr2389213-1.693168hypothetical protein
msl2390311-1.445493hypothetical protein
mll2392311-1.405549hypothetical protein
mlr2393310-1.661402co-chaperonin GroES
mlr2394312-1.260461molecular chaperone GroEL
mll2397217-1.714782transcriptional regulator
mlr2398318-1.805015hypothetical protein
mlr2399316-1.706067acetoacetate decarboxylase
mlr2400118-2.1790483-hydroxybutyrate dehydrogenase
mlr2403118-1.676365RND efflux membrane fusion protein
mlr2404017-1.867505RND efflux transporter
msr2405019-1.882820hypothetical protein
mlr24062130.361120hypothetical protein
mlr24071120.625815hypothetical protein
mlr24081142.677616*hypothetical protein
mll24100153.335188hypothetical protein
mll24110133.658392hypothetical protein
mlr2412-1133.547718hypothetical protein
mll24132153.890788short chain dehydrogenase
mll24142143.728147response regulatory protein
mll24161113.152304serine protease
mlr24170122.049072serine protease
mll24182130.477593hypothetical protein
mlr24192150.903710transcriptional regulator
mll24202130.523461transcriptional regulator
mlr24211110.219427ACP phosphodiesterase
msr24234110.925607hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2365PF03544310.012 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 31.1 bits (70), Expect = 0.012
Identities = 17/101 (16%), Positives = 26/101 (25%), Gaps = 1/101 (0%)

Query: 149 KAQRQTAAKPSTAAPDDQQPTSPAPAPAAPNDQPGDLANDLQAPPDAGEIPAGS-APKAD 207
A P + P P P P + P + P + PK D
Sbjct: 58 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRD 117

Query: 208 ADDPEAGWGETIDQGEAALPAFKKTAIENNTTVATVTSEYQ 248
E+ + A P + V +V S +
Sbjct: 118 VKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPR 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2368OMPADOMAIN991e-27 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 99.2 bits (247), Expect = 1e-27
Identities = 40/113 (35%), Positives = 59/113 (52%), Gaps = 12/113 (10%)

Query: 65 FALDSAQLDQTARAELDEFAKALKDNRLSTFSFVVEGHTDATGPDRYNQDLSQRRAQSVA 124
F + A L +A LD+ L + S VV G+TD G D YNQ LS+RRAQSV
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVV 282

Query: 125 AFLEANGVESVRLEAIGLGKSHPRVANPYDPV------------NRRVEMRIR 165
+L + G+ + ++ A G+G+S+P N D V +RRVE+ ++
Sbjct: 283 DYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2374OMPADOMAIN874e-21 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 86.5 bits (214), Expect = 4e-21
Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 11/113 (9%)

Query: 259 IYFRPASARLDAKSRPLLTEVEGVVGKC--PTLKVEVSGYTDSDGSPEANKALSERRAQA 316
+ F A L + + L ++ + V V GYTD GS N+ LSERRAQ+
Sbjct: 221 VLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQS 280

Query: 317 VAEALVAGGVPRQQISAAGHGEENPVAANDTPKNK---------ALNRRIEFS 360
V + L++ G+P +ISA G GE NPV N K A +RR+E
Sbjct: 281 VVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2382PF06580280.017 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.3 bits (63), Expect = 0.017
Identities = 25/139 (17%), Positives = 48/139 (34%), Gaps = 15/139 (10%)

Query: 40 LLGWVLIVSGGLQGISLIGAGHVPHFWLQLISVILALLIGLLF---LRHPGNGLLTITLL 96
+GW + G SL G +I I L+GL+ R + L
Sbjct: 17 GIGWGVYTLTGFGFASLYG----SPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLN 72

Query: 97 LIVFFMIEGIAKVVFALTIRPFPNWGWVLGSGLIGILLSVTLWVRIPVTAVCLIGLLLGI 156
+ + A VV + W + + I LL+ + T + ++ +
Sbjct: 73 MGQIILRVLPACVVIGMV--------WFVANTSIWRLLAFINTKPVAFTLPLALSIIFNV 124

Query: 157 ELISVGAAIAYLAWHVRKS 175
+++ ++ Y WH K+
Sbjct: 125 VVVTFMWSLLYFGWHFFKN 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2384HTHFIS708e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.2 bits (172), Expect = 8e-15
Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 4/124 (3%)

Query: 564 PSILLIDDSSVLRQLTAQSLQQRGFVVTCAAGSAEALAIIERAPHEFDVIVTDFAMPLVS 623
+IL+ DD + +R + Q+L + G+ V + +A I D++VTD MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD--GDLVVTDVVMPDEN 61

Query: 624 GVEVIRFARNLRSDWPAIIITGYADADSI--ADRPSDVPLLNKPFREKDLIESIFHVIAH 681
+++ + R D P ++++ + A L KPF +LI I +A
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 682 ASAK 685
+
Sbjct: 122 PKRR 125



Score = 67.2 bits (164), Expect = 1e-13
Identities = 32/154 (20%), Positives = 62/154 (40%), Gaps = 26/154 (16%)

Query: 23 KARVLAVDDDERNLLAIQEVLAPI-----AEIVAVRSGEEALRCLLKQDFAVILLDVLMP 77
A +L DDD AI+ VL ++ + R + D +++ DV+MP
Sbjct: 3 GATILVADDDA----AIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 78 GLDGYETAGLIRQREQSKRTPLIFLTAINKEDAHMLRGYDAGAVDYVFKPFDPVMLRSKV 137
+ ++ I++ P++ ++A N ++ + GA DY+ KPFD
Sbjct: 59 DENAFDLLPRIKKAR--PDLPVLVMSAQNTFMT-AIKASEKGAYDYLPKPFDL------- 108

Query: 138 AVFVELHEKTLEIQRKAIAEQALLAKALQAEKEK 171
+ + I +A+AE L+ + +
Sbjct: 109 -------TELIGIIGRALAEPKRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2385HTHFIS743e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 3e-15
Identities = 29/131 (22%), Positives = 56/131 (42%), Gaps = 3/131 (2%)

Query: 1485 GAKVLIVDDDIRNIYSLTSVLETYDIEVMHAERGREGIALLEQVPDVDAALIDIMMPEMD 1544
GA +L+ DDD L L +V + D + D++MP+ +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDEN 61

Query: 1545 GYETMRQIRNTPAIAHIPLISVTAKAMKGDRQKCLDAGASDYIAKPVDLDLLLALLRVWI 1604
++ + +I+ A +P++ ++A+ K + GA DY+ KP DL L+ ++ +
Sbjct: 62 AFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 1605 GRSRARVETSE 1615
+ R E
Sbjct: 120 AEPKRRPSKLE 130



Score = 44.8 bits (106), Expect = 2e-06
Identities = 33/185 (17%), Positives = 57/185 (30%), Gaps = 31/185 (16%)

Query: 1227 VLIVEDDPTFGGLLLGLARSAGLKGVLSTAGSGTLALA--RKLVPDAITLDLGLSDIDGW 1284
+L+ +DD +L AG + + D + D+ + D + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 1285 VLFDLLR-HDPKTSGIPVHVISGAEDIDDLAS---KGASSISTKPVSSDELMNVFQDIHS 1340
L ++ P +PV V+S KGA KP E
Sbjct: 64 DLLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE---------- 110

Query: 1341 RKLRIQRRVLVADPDPERRLSLVEAIRDGVTSVTAIGRVAANADDIEL----ASYDAVVL 1396
+ I R L E + + D + +GR AA + + D ++
Sbjct: 111 -LIGIIGRAL-----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164

Query: 1397 GFGRS 1401
G S
Sbjct: 165 ITGES 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2397HTHTETR677e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.0 bits (163), Expect = 7e-16
Identities = 35/214 (16%), Positives = 72/214 (33%), Gaps = 18/214 (8%)

Query: 10 AEIGREKRERTRTLIVEAGAMLLAERPREGLTVDAVVEAAGVAKGTFYYHFQSIDELASA 69
A +++ + TR I++ L +++ ++ + +AAGV +G Y+HF+ +L S
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 70 VGEKLGESF-DAVLTPARLELQDPVERLTFAFTRFLEKAISDSNWARLVVQSSHSP---- 124
+ E + + L DP+ L LE +++ L+ H
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 125 ---------TEFARGIRNNLKADIAEAIVQGRL-SLRDAELAVDIVIGIWLQVTRGILER 174
+ ++ + I L + A I+ G + L
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 175 GARPELT---GQAVEAVLRALGSSQSEQRKATKK 205
+L V +L + + AT +
Sbjct: 182 PQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2400DHBDHDRGNASE1072e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 107 bits (269), Expect = 2e-30
Identities = 62/253 (24%), Positives = 112/253 (44%), Gaps = 8/253 (3%)

Query: 3 KVVVVTGAASGIGKEIALTFARKGAKVVIADLDLDAAEETAREIDPAALRALGVGMDVSN 62
K+ +TGAA GIG+ +A T A +GA + D + + E+ + A A DV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 EDQVESGISRVVETFGRIDVLVSNAGVQTVAPLVEFDFDKWRKLLSIHLDGAFLTTRAAL 122
++ +R+ G ID+LV+ AGV + ++W S++ G F +R+
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 RQMYRQNSGSIIYMGSVHSKEASPFKAPYVTAKHGLIGLAKVVAKEGAAHGVRANVICPG 182
+ M + SGSI+ +GS + A Y ++K + K + E A + +R N++ PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 183 FVRTPLVEKQIPEQARELGISPEDVVKTMMLRETVD---GEFTTVQDVAETALFLAAFPS 239
T + ++ E V+K + + D+A+ LFL + +
Sbjct: 189 STETDMQWSLWADENGA-----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 240 NALTGQSIVVSHG 252
+T ++ V G
Sbjct: 244 GHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2403RTXTOXIND543e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.4 bits (131), Expect = 3e-10
Identities = 32/209 (15%), Positives = 71/209 (33%), Gaps = 15/209 (7%)

Query: 77 DMGAIVKKGQKLAELSAVDYQNKVTAAEADVDAAKAALAQA--SAQEERFRILLGKGFAT 134
D +++ K +A+ + ++ +NK A ++ K+ L Q + L T
Sbjct: 239 DFSSLLHKQA-IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL----VT 293

Query: 135 HSQYDEALKSLQSARAQVQATEANLRIARNQLSYTQLTATDDGVVTATGA-DPGQVVAAG 193
+E L L+ + L + + + A V G VV
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 194 QMVVEVSGNDAREAVFA-VATSDVTRAKLGMAVNVSLQ---GRLDIAVTGTIREISPEA- 248
+ ++ + D V A V D+ +G + ++ + G ++ I+ +A
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413

Query: 249 -DSATGT-YQVKVALASPPSEMRLGAVVI 275
D G + V +++ + +
Sbjct: 414 EDQRLGLVFNVIISIEENCLSTGNKNIPL 442



Score = 38.3 bits (89), Expect = 4e-05
Identities = 21/109 (19%), Positives = 37/109 (33%), Gaps = 9/109 (8%)

Query: 67 VGGRMLSRQVDMGAIVKKGQKLAELSAVDYQNKVTAAEADVDAAKAALAQASAQEERFRI 126
+ V G V+KG L +L+A AEAD +++L QA ++ R++I
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 127 LLG--KGFATHSQYDEALKSLQSARAQVQATEANLRIARNQLSYTQLTA 173
L + Q+ + +L + Q
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2404ACRIFLAVINRP462e-148 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 462 bits (1189), Expect = e-148
Identities = 223/1047 (21%), Positives = 420/1047 (40%), Gaps = 64/1047 (6%)

Query: 7 LSEWAVHNRALVVFLMLICVIGGVSAYERLGRQEDPDFTVQTMVVQANWPGATTADTLKQ 66
++ + + L +I ++ G A +L + P + V AN+PGA
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 67 VTDRIEKKLEETPNLDYIKSYT-KPGQATIFVYLKESTPKRDLSDIWYQVRKKVSDIGPT 125
VT IE+ + NL Y+ S + G TI + + T D QV+ K+ P
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT---DPDIAQVQVQNKLQLATPL 117

Query: 126 LPQGVVGP-FFNDEFGDVFGTVYGITYDG--FSAREARDFAE-TARGEFLRAPDVGKVDI 181
LPQ V ++ + V G D + + D+ + R VG V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 182 YGDQDEKVYLNFSPQKLANLKLNLDDVLAAIARQNAVAPSGIINTPQE------NMLVDV 235
+G Q + + L KL DV+ + QN +G + N +
Sbjct: 178 FGAQYA-MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 236 TGSLLSSDGIANLNLWI--DGRFYKLTDIAQVQRGYSDPPSKMFRINGKPAIGIGVNMRE 293
+ + + L + DG +L D+A+V+ G + + RINGKPA G+G+ +
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNV-IARINGKPAAGLGIKLAT 295

Query: 294 GGNNLDFGKGLHEAAERLKQRFPVGIELNLVSDQPEVVHEAIGGFTEALVEAIVIVLVVS 353
G N LD K + L+ FP G+++ D V +I + L EAI++V +V
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 354 FLSLG-FRAGLVVALSIPLVLAIVFVAMDALGISLQRISLGALIIALGLLVDDAMITIEM 412
+L L RA L+ +++P+VL F + A G S+ +++ +++A+GLLVDDA++ +E
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 413 MISKI--EEGMEKIKAATFAYTSTAFPMLTGTLITILGFLPIGFANSNTGQYCFSLFVVI 470
+ ++ E+ + +A + + ++ ++ F+P+ F +TG + I
Sbjct: 416 -VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 471 AVALVASWFVAVVFAPVIGLSILPSHTKKAQANAEPGRFMRAFERLLGFAM--------- 521
A+ S VA++ P + ++L + + N G F F ++
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHEN--KGGFFGWFNTTFDHSVNHYTNSVGK 532

Query: 522 --RHRWPTIAAALILFSASLYGMGFVQQQFFPTSNRPELLVTMTLPKNASIAATQAQTER 579
+ ++ + + + F P ++ L + LP A+ TQ ++
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 580 LEKALAGDPDIARFSSYVGGGAIRFYLPLDVQLDNDFMAETVVVTKDLKARDRVQARLET 639
+ + S + G Q N MA V K + R+ + E
Sbjct: 593 VTDYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMA--FVSLKPWEERNGDENSAEA 645

Query: 640 LFA------GSFPD---VAVRISR-LELGPPVGWPVQ-YRVSAPTTEEARQYAEQVAQTL 688
+ G D + + +ELG G+ + + + Q Q+
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMA 705

Query: 689 R-ASGLVRNVNYDWAEKNKALRIVVDQDRVRQAGLSSEELAQALNRVISGSTVTQIRDSI 747
+ +V + E ++ VDQ++ + G+S ++ Q ++ + G+ V D
Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765

Query: 748 YLVDVVARAESDERSSVEALRNLQITTPTGASVPLRELAQFQYDLDDGYVWRRGRLPTIT 807
+ + +A++ R E + L + + G VP + + R LP++
Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSME 825

Query: 808 VQAEPLPGLQPASVHGRIAGAIEGLRKSMPAGTLLETGGTVEKSAQSNAALLAQFPLMIT 867
+Q E PG + G +E L +PAG + G + S A +
Sbjct: 826 IQGEAAPG----TSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFV 881

Query: 868 LMLTVLMVQLGSFRQLAMVISVAPLGLIGVAAALLTTNTPMGFIATLGIIALAGMIIRNS 927
++ L S+ V+ V PLG++GV A N +G++ G+ +N+
Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941

Query: 928 VILVHQIEH-ERAQGIEPWKAVIDATTHRFRPIMLTAAAAILGMIPIMHDVFWG-----P 981
+++V + +G +A + A R RPI++T+ A ILG++P+ G
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 982 MAFAIVGGLAVATVLTLVFLPALYVAV 1008
+ ++GG+ AT+L + F+P +V +
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2410VACCYTOTOXIN270.037 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 27.3 bits (60), Expect = 0.037
Identities = 19/69 (27%), Positives = 31/69 (44%), Gaps = 8/69 (11%)

Query: 6 VNSSGRLAHSPIIAFKRMGA----SPQQGEKTMFNTVKTAALSALIGLGALTAVPAHADS 61
+ + R + P+++ +GA +PQQ F TV + A++G G T S
Sbjct: 3 IQQTHRKINRPLVSLALVGALVSITPQQSHAAFFTTV---IIPAIVG-GIATGAAVGTVS 58

Query: 62 LYLGFGNNQ 70
LG+G Q
Sbjct: 59 GLLGWGLKQ 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2412OMPADOMAIN714e-15 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 70.7 bits (173), Expect = 4e-15
Identities = 28/114 (24%), Positives = 51/114 (44%), Gaps = 14/114 (12%)

Query: 639 FEFGSSSISDTEVQKLEGVASAMEKLLKKNPAETFLIEGHTDAVGTPEANLALSDRRAEA 698
F F +++ L+ + S + L K+ + + G+TD +G+ N LS+RRA++
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVV--VLGYTDRIGSDAYNQGLSERRAQS 280

Query: 699 VAEALTNAFGIPPENLTTQGYGEQY-----------LKVNTQAPNRENRRVAIR 741
V + L + GIP + ++ +G GE + +RRV I
Sbjct: 281 VVDYLI-SKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2413DHBDHDRGNASE652e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 65.1 bits (158), Expect = 2e-14
Identities = 51/209 (24%), Positives = 88/209 (42%), Gaps = 22/209 (10%)

Query: 3 LKGKTLFISGGSRGIGLAIALRAARDGANVTIAAKTAEPHPKLPGTIYSAAQEIEQAGGK 62
++GK FI+G ++GIG A+A A GA++ E K+ ++ + A+ E
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---- 61

Query: 63 ALPVLCDIREEAQVAEAVAKTVEKFGGIDICVNNASAIQLTGTLQTDMKRYDLMHQINTR 122
P D+R+ A + E A+ + G IDI VN A ++ + ++ +N+
Sbjct: 62 -FPA--DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118

Query: 123 GTFLVSKMCIPHLKLADNPHILNLA------PPLDMKAKWFKNHVAYTMAKFGMSMCTLG 176
G F S+ ++ + I+ + P M AY +K M T
Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMA--------AYASSKAAAVMFTKC 170

Query: 177 MSAEFAKDGIAVNSLWPISTIDTAAVRNL 205
+ E A+ I N + P ST +T +L
Sbjct: 171 LGLELAEYNIRCNIVSPGST-ETDMQWSL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2416DNABINDINGHU270.042 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 27.0 bits (60), Expect = 0.042
Identities = 14/55 (25%), Positives = 20/55 (36%), Gaps = 13/55 (23%)

Query: 89 VTAEEAVEA-GEEIELTLASGVTV------------KAELVGRDPSTGVALLKPA 130
+ AV+A + LA G V +A GR+P TG + A
Sbjct: 20 KDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGRNPQTGEEIKIKA 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2417V8PROTEASE812e-19 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 81.2 bits (200), Expect = 2e-19
Identities = 44/211 (20%), Positives = 77/211 (36%), Gaps = 30/211 (14%)

Query: 49 TTVADAVDRIGPAVCRIERIGGQGGH-GSGFVIAPDGLVVTNFHVV----GDARTVRV-- 101
+ D + V I+ G SG V+ D ++TN HVV GD ++
Sbjct: 77 HQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKD-TLLTNKHVVDATHGDPHALKAFP 135

Query: 102 ------SMPDGASSEGRVLGRDPDTDIALV--------RADGSFTDVAPLGDSKRLRRGQ 147
+ P+G + ++ + D+A+V + G A + ++ + Q
Sbjct: 136 SAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQ 195

Query: 148 IAIAIGNPLGFEWTVTSGVVSALGRSMRASTGRLIDDVIQTDAALNPGNSGGPLVSSAGE 207
G P + + S T L + +Q D + GNSG P+ + E
Sbjct: 196 NITVTGYPGDKPV-------ATMWESKGKIT-YLKGEAMQYDLSTTGGNSGSPVFNEKNE 247

Query: 208 VIGVNTAMIHGAQGIAFAVASNTANFVISEI 238
VIG++ + A + N NF+ I
Sbjct: 248 VIGIHWGGVPNEFNGAVFINENVRNFLKQNI 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2419HTHTETR653e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.0 bits (158), Expect = 3e-15
Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 1/108 (0%)

Query: 8 RSNRDRTEATRADLIAAARKLFTEKSYAETGTPEIVTAAGVTRGALYHHFADKQALFAAV 67
R + + TR ++ A +LF+++ + T EI AAGVTRGA+Y HF DK LF+ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 68 VEQEAQAVAQ-EIERASPSSLEARDALIAGSDAYLDAMRAPGRTRLLL 114
E + + E+E + + L L++ R RLL+
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLM 110


19mll2454mll2472Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll24542152.607243transcriptional regulator
mlr24563152.040178hypothetical protein
msr24573151.337504hypothetical protein
mll24590140.888106hypothetical protein
mll24610120.784954hypothetical protein
mll24620120.677411hypothetical protein
msl24631130.621729hypothetical protein
mll24650151.322767hypothetical protein
mll24660131.380170RNA polymerase sigma factor RpoD
mll2467-181.804889DNA primase
mll24692113.181090hypothetical protein
mlr24702123.222578hypothetical protein
mlr24711103.285688transporter
mll24720113.011284transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2454HTHTETR552e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.6 bits (131), Expect = 2e-11
Identities = 30/210 (14%), Positives = 70/210 (33%), Gaps = 19/210 (9%)

Query: 1 MALDKEETGERVLAIAEVLLNEGGMDNLKARTIAEQAGISVGSVYNLFSDLDGVHRAVNM 60
+ +ET + +L +A L ++ G+ + IA+ AG++ G++Y F D + +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 RLLDRLGAAGAAAMADLSQRGITDVRQRLLALAGAYVNFVEGHPGSWPALLAFNRRRPTL 120
+G A ++ +R+ L+ + + V L+ +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTV-----TEERRRLLMEIIFHKCEF 119

Query: 121 AEPDAYEARLDQLFE---------IIAGVLAGGDFDLDDDTRRIAARTLWSSVHGIVTSG 171
A + + + + D TRR A + G ++
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAA-----IIMRGYISGL 174

Query: 172 YAGRSVRRQAGEIDQQIELLVAVFIRGLER 201
Q+ ++ ++ VA+ +
Sbjct: 175 MENWLFAPQSFDLKKEARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2469PF06776280.011 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 27.6 bits (61), Expect = 0.011
Identities = 11/38 (28%), Positives = 17/38 (44%), Gaps = 3/38 (7%)

Query: 6 LSVLAVSMMAATSLAGQAAPTNAPVAPQSNYTKVDWQK 43
+LA +M A S + +A A +S + DWQ
Sbjct: 51 RLMLAGAMAIALSFG-WSDRADAQGAVRSVHG--DWQI 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2471TCRTETB462e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.0 bits (109), Expect = 2e-07
Identities = 38/165 (23%), Positives = 65/165 (39%), Gaps = 1/165 (0%)

Query: 5 LIALFIAAFAFGTTEFVIAGVLPQVAEGLGVSVPSAGYLVSGYACGIAIGGPLLALVTKS 64
LI L I +F E V+ LP +A S ++ + + +IG + ++
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 65 LPRKTLLLGLAIAFTIGQAACALAPDFTSMLLL-RIAVAVAHGAYFGVAMVVAVGLVPED 123
L K LLL I G + F S+L++ R A+ + MVV +P++
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 124 KRGMAVAVILSGLTVSNVIGVPAGTAIGNIWGWRATFWVMCALGV 168
RG A +I S + + +G G I + W + +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITII 180


20mlr2503msr2529Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr25031133.433855hypothetical protein
mll25053154.337975transcriptional regulator
mlr25062123.920749hypothetical protein
mll25072144.066282short-chain dehydrogenase
mlr25090134.783415transcriptional regulator
mll25100183.621629transporter
mlr2511-111-0.401753regulatory protein
mll2512-19-0.570817hypothetical protein
mlr2514-19-0.785115hypothetical protein
mlr2516-110-1.086485hypothetical protein
mlr2517-110-1.728943carbamoyl phosphate synthase large subunit
mll2518023-4.491848hypothetical protein
mll25192131.556054translation initiation inhibitor
mll25201122.036404translation initiation inhibitor
mll25211112.203928hypothetical protein
mll25221121.953263hypothetical protein
mll25243130.511240acetyltransferase
mll2525416-0.257742hypothetical protein
mlr2526218-1.626818methylated-DNA-protein-cystein
mll2527224-2.721489hypothetical protein
msr2529221-1.523891cold shock protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2507DHBDHDRGNASE651e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 65.5 bits (159), Expect = 1e-14
Identities = 55/193 (28%), Positives = 90/193 (46%), Gaps = 4/193 (2%)

Query: 10 AVVTGASSGIGAIYADRLAGQGYDLVLVARRADRLEELAEKLRYAYDRKVSVISADLSDD 69
A +TGA+ GIG A LA QG + V ++LE++ L+ A R AD+ D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLK-AEARHAEAFPADVRDS 69

Query: 70 DDVRRVEQAISAD-DSVTLLVNNAGLGGQQVVATADADAAERMIKVNVIALTRLTRAVLP 128
+ + I + + +LVN AG+ ++ + + E VN + +R+V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 129 GLLARNRGAIVNIASVLAYETSFG-GIYSGTKAYVVNFTEALHREVAGTGVKVQVVLPGA 187
++ R G+IV + S A Y+ +KA V FT+ L E+A ++ +V PG+
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 188 TRTDF-WELAGSD 199
T TD W L +
Sbjct: 190 TETDMQWSLWADE 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2510TCRTETB667e-14 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 66.1 bits (161), Expect = 7e-14
Identities = 38/153 (24%), Positives = 69/153 (45%), Gaps = 1/153 (0%)

Query: 5 LFWLALGSFTISTEGFVISSLLPDIARDAGISIPLAGTLITAFALAYAIGTPILATLTGE 64
L WL + SF V++ LPDIA D + TAF L ++IGT + L+ +
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 65 WDRRRVILWTLVFFVIGNIAAALS-SSFEVLLIARVIMALSSGLFAATAQGTAVALVDDH 123
+R++L+ ++ G++ + S F +L++AR I + F A +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 124 HRARAIAVVVGGTTVAVAVGAPLGALVATIAGW 156
+R +A ++ + VG +G ++A W
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2524SACTRNSFRASE502e-10 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 49.9 bits (119), Expect = 2e-10
Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 1/57 (1%)

Query: 98 AFVKDLAVHPEARGKGIGEALMWQAFATFRDRGAVHVDLKTNTVENAAAIRLYERLG 154
A ++D+AV + R KG+G AL+ +A ++ + L+T + N +A Y +
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDI-NISACHFYAKHH 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2525PF06776290.031 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 28.7 bits (64), Expect = 0.031
Identities = 22/84 (26%), Positives = 27/84 (32%), Gaps = 14/84 (16%)

Query: 120 PETVSVLPQSSGIAPIQTAELTPATA---------PALDAAAPPAAATAAFAGSTPAAPA 170
P T +P I + AEL+P A A A A +F S A
Sbjct: 15 PVTNHAVPALKAI-QMGPAELSPMLASCRRLARRNGARLMLAGAMAIALSFGWSDRADAQ 73

Query: 171 VPAVMTANAWQ----APPPARAGQ 190
WQ PP A+A Q
Sbjct: 74 GAVRSVHGDWQIRCDTPPGAKAEQ 97


21mll2726mll2746Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll27260163.841033poly(A) polymerase
mll2727-1184.010706hypothetical protein
mll2728-2184.055946hypothetical protein
mlr2730-2173.922498regulatory protein
mlr2732-2173.854325hypothetical protein
mlr2734-2183.467712hypothetical protein
mlr27351182.462384hypothetical protein
mll27362181.539973ATP-dependent Clp protease adaptor
mll27372171.513789hypothetical protein
msl27383221.340010hypothetical protein
mlr27402211.293241two-component response regulator
mlr27412201.437005two-component sensor histidine kinase
mll27422171.048092ABC transporter ATP-binding protein
mll27441140.561076ABC transporter permease
mll2746216-0.492424ABC transporter binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2730HTHFIS340.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 0.001
Identities = 15/52 (28%), Positives = 24/52 (46%), Gaps = 1/52 (1%)

Query: 124 LMADEINRASPRTQSALLQAMQEYHVTIAGARYDLPAPFHVLATQN-PLEQE 174
L DEI Q+ LL+ +Q+ T G R + + ++A N L+Q
Sbjct: 235 LFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr273456KDTSANTIGN310.029 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 30.7 bits (69), Expect = 0.029
Identities = 12/37 (32%), Positives = 20/37 (54%)

Query: 761 PAAPMPSQAAIARIDAYMQQGGTVLFDTRDQFANGIG 797
P +P+ A+I +I + +Q+ G L + RD F I
Sbjct: 287 PDTGLPNSASIEQIQSKIQELGDTLEELRDSFDGYIN 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2740HTHFIS1003e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99.5 bits (248), Expect = 3e-26
Identities = 35/133 (26%), Positives = 61/133 (45%), Gaps = 1/133 (0%)

Query: 3 SDAHILIVDDDKGIRDLLQEFFQKRGLHTSVAADGTEMEAVLRRAQVDLIVLDVMLPGKS 62
+ A IL+ DDD IR +L + + G + ++ + + DL+V DV++P ++
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLELCRDLRAQYS-TPIIMLTAVTETTDRVVGLEMGADDYVPKPFDPRELLARIRAVLRR 121
+L ++ P+++++A + E GA DY+PKPFD EL+ I L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 NGAAEPKRATAKQ 134
K Q
Sbjct: 122 PKRRPSKLEDDSQ 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2741BCTERIALGSPG280.032 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 28.3 bits (63), Expect = 0.032
Identities = 21/94 (22%), Positives = 37/94 (39%), Gaps = 20/94 (21%)

Query: 1 MKRLLPQ---TLPAWVLLIVIAGLLISQVATLYIVSRDRAVANDVV----------DLYR 47
M+ Q TL +++IVI G+L S V + ++++A V D+Y+
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 48 LNDRAF-----SLVQLMSGAT--PEERKATAAGL 74
L++ + L L+ T P G
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGY 94


22mll3238mll3246Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll3238221-3.186931penicillin-binding protein
msl3240332-5.982181hypothetical protein
mll3241333-5.893109hypothetical protein
mll3242232-6.568866cyclase
mll3243228-5.105060hypothetical protein
mll3244126-4.149875hypothetical protein
mll3246-123-3.566649hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3244OMPADOMAIN541e-09 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 54.2 bits (130), Expect = 1e-09
Identities = 30/129 (23%), Positives = 49/129 (37%), Gaps = 27/129 (20%)

Query: 852 TDIFFYSGKVVPGPEQTPVLDRLADQIKEFAENARKSGVTARFMLTGHSDATGRETANAS 911
+D+ F K PE LD+L Q+ ++ G++D G + N
Sbjct: 219 SDVLFNFNKATLKPEGQAALDQLYSQLSNLDPK------DGSVVVLGYTDRIGSDAYNQG 272

Query: 912 ISAARAETVRALLNKRGVAPELLLVRGAGTFEPLVPENSQTGSST--------------- 956
+S RA++V L +G+ + + RG G N TG++
Sbjct: 273 LSERRAQSVVDYLISKGIPADKISARGMG------ESNPVTGNTCDNVKQRAALIDCLAP 326

Query: 957 NRRVSITVN 965
+RRV I V
Sbjct: 327 DRRVEIEVK 335


23msr3514mll3521Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
msr35143120.584922hypothetical protein
mll35153131.529729transcriptional regulator
mlr35162131.000816hypothetical protein
msl35174141.712185response regulator
msr35182161.396810hypothetical protein
mll35202161.314032hypoxanthine-guanine phosphoribosyltransferase
mll35213151.717271hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msl3517HTHFIS851e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.9 bits (210), Expect = 1e-22
Identities = 34/118 (28%), Positives = 60/118 (50%), Gaps = 2/118 (1%)

Query: 2 AKLLIVEDDESVRTLAARALERAGHAIDIAADGAQGLALIRAAHGGYDLVVSDIRMPEMD 61
A +L+ +DD ++RT+ +AL RAG+ + I ++ A I A G DLVV+D+ MP+ +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG--DLVVTDVVMPDEN 61

Query: 62 GIEMAIAAAALFPAMKIMLMTGYADQRERAEELNGIILDVVQKPFTLAEIRSRVERAL 119
++ P + +++M+ + D + KPF L E+ + RAL
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


24msr3579mll3600Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
msr3579290.113181hypothetical protein
mll3580290.178492hypothetical protein
mll3581270.468987penicillin-binding protein
msr35821100.422304hypothetical protein
mlr35830100.130469hippurate hydrolase
mlr3584014-0.245512*cytochrome B561
mll3585113-0.567654hypothetical protein
mll3586114-0.661997oxidoreductase
mlr3587215-1.864813alpha-glucoside ABC transporter ATP-binding
mll3588214-2.314514sugar transporter permease
mll3589112-1.787039sugar transporter permease
mll3590112-1.421961sugar transporter sugar binding protein
mll3591013-0.256954alpha-L-arabinofuranosidase
mll35922140.751243hypothetical protein
mll35933142.380785hypothetical protein
mll35954142.603269D-tagatose 3-epimerase-related protein
mll35963143.358165ribose ABC transporter substrate-binding
mll35973134.047136ribose ABC transporter permease
mll35982123.919622ribose ABC transporter ATP-binding protein
mll35991123.881776hypothetical protein
mll36000123.564964transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msr3579PF01206854e-26 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 85.2 bits (211), Expect = 4e-26
Identities = 31/68 (45%), Positives = 41/68 (60%)

Query: 9 DLKGLNCPLPVLKAKKRLAGMQPGSRLWLETTDPLAVIDIPAFCSDAGHQLVETAAVSGG 68
D GLNCPLP+LKAKK LA M G L++ TDP +V D +F GH+L+E G
Sbjct: 9 DATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEEDGT 68

Query: 69 HRFLVERG 76
+ F ++R
Sbjct: 69 YHFRLKRA 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3590MALTOSEBP422e-06 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 42.4 bits (99), Expect = 2e-06
Identities = 60/232 (25%), Positives = 80/232 (34%), Gaps = 58/232 (25%)

Query: 57 TLEWGTPFYAKVQTSAAVGEAPDVMTYHASRIPLAVSQDLLEEITADDMSKMGLSASDFA 116
T+E K AA G+ PD++ + R LL EIT D + L
Sbjct: 62 TVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYP---- 117

Query: 117 QTTMGAVTVDGKQYAVPLDTHPIVLYYNRVLLKKAGVLGDDGRPVGMKNKEEFTATLQKL 176
T AV +GK A P+ + L YN+ LL P K EE A ++L
Sbjct: 118 -FTWDAVRYNGKLIAYPIAVEALSLIYNKDLL-----------PNPPKTWEEIPALDKEL 165

Query: 177 KDAGVE----------FPLGSVTADGNFMYRTIYSLVCQQGGELLTGNEFLAGDNGKKLA 226
K G F + ADG + ++ +NGK
Sbjct: 166 KAKGKSALMFNLQEPYFTWPLIAADGGYAFKY---------------------ENGKYDI 204

Query: 227 NALAVLQGWTKAGL-----------QSTYTDYPATVALFTSGKAAMMINGVW 267
+ V KAGL + TDY A F G+ AM ING W
Sbjct: 205 KDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPW 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3593MICOLLPTASE300.003 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 29.7 bits (66), Expect = 0.003
Identities = 9/36 (25%), Positives = 17/36 (47%), Gaps = 1/36 (2%)

Query: 23 WIVVEGHPTMQTAVQHTTEDGKVMSGTWRASPGTYH 58
W++ + V + DG +S T + +PG Y+
Sbjct: 1049 WLLYSAD-DLSNYVDYANADGNKLSNTCKLNPGKYY 1083


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3597RTXTOXINA340.001 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 33.8 bits (77), Expect = 0.001
Identities = 18/50 (36%), Positives = 28/50 (56%)

Query: 75 TGGIDLSVGSILAASAMVAVLVSLVPDWGLLGVPAAILVGLGFGLVNGLL 124
TG ID S+ +I A V+ +S L+G P + LVG G+++G+L
Sbjct: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGIL 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3600ARGREPRESSOR336e-04 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 32.5 bits (74), Expect = 6e-04
Identities = 13/44 (29%), Positives = 20/44 (45%), Gaps = 5/44 (11%)

Query: 12 GRQRQIVELLRDRPFASVRELQERL-----GVSAATVRRDIDRI 50
R +I E++ + EL + L V+ ATV RDI +
Sbjct: 5 QRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL 48


25mlr3617mll3631Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr3617422-1.246241hypothetical protein
mlr3618319-0.339691hydrolase
mll36193160.194439hypothetical protein
msr36202120.243500hypothetical protein
msr36210110.341724pseudo hydrolase
mll3623-1100.333580hypothetical protein
mll3624-1130.883651ABC-transporter ATP-binding protein system
mll36250150.753328ABC transporter permease
mll36261120.282656ABC transporter substrate-binding protein
mll36273110.612935dihydrolipoamide acetyltransferase homoserine
mll36283130.401497TPP-dependent acetoin dehydrogenase subunit
mll36291140.020381TPP-dependent acetoin dehydrogenase subunit
mll3630280.438874hypothetical protein
mll3631280.0715343',5'-cyclic-nucleotide phosphodiesterase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3619PF03544413e-06 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 41.1 bits (96), Expect = 3e-06
Identities = 35/145 (24%), Positives = 50/145 (34%), Gaps = 14/145 (9%)

Query: 11 RNLMWGIPASLILH-VLVATLLVYGLPVAPQQPREEQPVNVAIVPPPDQPKP-------- 61
R W S+ +H +VA LL + + P QP++V +V P D P
Sbjct: 12 RRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPE 71

Query: 62 ---KPVPPAPKPPEPKVEKPPEQKVEKQPPSEKQPKAPPVEVLKPVFQYGDKDTGPRKSL 118
+P P PEP E P + K P K VE K ++ P
Sbjct: 72 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR--DVKPVESRPASPF 129

Query: 119 DGASAQDSSPSPAKDDDSKPPAVPK 143
+ + + S A SKP
Sbjct: 130 ENTAPARPTSSTATAATSKPVTSVA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3623IGASERPTASE548e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 53.9 bits (129), Expect = 8e-10
Identities = 51/269 (18%), Positives = 84/269 (31%), Gaps = 50/269 (18%)

Query: 35 PRSEQQPDQQEQPVNVAIVPPPEKPKPKPAPKPPEPTPEKKAEKPPEQKPPPEPPKPPDD 94
P + Q N I E P P PAP P T E AE ++
Sbjct: 1000 PNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQE------------ 1047

Query: 95 HVLKRVFQYGQKDTGPEKSLDGNSAKPNTPSPAKDEAVKPPITPTPVPTRPAPVATPQQK 154
K+++ N + E K + T+ VA +
Sbjct: 1048 ----------------SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 155 AEPSKPDEK--PATIAPDGKP-AQGEEKQEAA--APDAKPAQNE------------EKQA 197
+ ++ E AT+ + K + E+ QE P Q + E
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 198 TEDADKQQADKRELAP--QPAEKQAVVTPKPLAAETGDK--PAPPPSAEKAKPKPAK-TM 252
T + + Q+ A QPA++ + +P+ T + + E P + T+
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211

Query: 253 NFKSAKAFKAPSGNARRPSPTNDAAAAGS 281
N +S+ K + R P N A S
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPHNVEPATTS 1240



Score = 43.5 bits (102), Expect = 2e-06
Identities = 42/261 (16%), Positives = 82/261 (31%), Gaps = 55/261 (21%)

Query: 42 DQQEQPVNVAIVPPPEKPKPKPAPKPPEPTPEKKAEKPPEQKPPPEPPKPPDDHVLKRVF 101
+++ Q V+ + P A P P+ ++ + E PP P P +
Sbjct: 986 EKRNQTVDTTNITTPNN---IQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 102 QYGQKDTGPEKSLDGNSAKPNTPSPAKDEAVKPPITPTPVPTRPAPVATPQQKAEPSKPD 161
Q+ K+++ N + ++ A+ +K +
Sbjct: 1043 NSKQE----SKTVEKNEQDATETTAQN-----------------------REVAKEAKSN 1075

Query: 162 EKPATIAPDGKPAQGEEKQEAAAPDAKPAQNEEKQATEDADKQQADKRELAPQPAEKQAV 221
K T + + E K+ + + +E E +K + + + P + +
Sbjct: 1076 VKANTQTNEVAQSGSETKE------TQTTETKETATVEKEEKAKVETEKTQEVP-KVTSQ 1128

Query: 222 VTPKPLAAETGDKPAPPPSAEKAKPKPAKTMNFKSAKAFKAPSGNARRPSPTNDAAAAGS 281
V+PK +ET A P + T+N K + S TN A
Sbjct: 1129 VSPKQEQSETVQPQAEP------ARENDPTVNIKEPQ------------SQTNTTADTEQ 1170

Query: 282 PIYSGLPGVRKLYSQGATGNA 302
P V + ++ T N
Sbjct: 1171 PAKETSSNVEQPVTESTTVNT 1191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3624PERTACTIN290.030 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 29.3 bits (65), Expect = 0.030
Identities = 45/168 (26%), Positives = 60/168 (35%), Gaps = 18/168 (10%)

Query: 208 IAVMDHGRLAQLATPRELYHEPANEMVASFISQGILLPADVLTGEDGGHCKVRVLGTELV 267
+A MD + PA V G +P DG + V V + +
Sbjct: 243 VAAMDGAIVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPLLDGWY-GVDVSDSTVD 301

Query: 268 VRCRAGEPPRAGAKICC-RSADLDVSTDGPGFDGLVKRVIYQGGGARIEFAPAAGPDLTL 326
+ E P+ GA I R A + VS G VI GGGAR PA+ +TL
Sbjct: 302 LAQSIVEAPQLGAAIRAGRGARVTVS--GGSLSAPHGNVIETGGGARRFPPPASPLSITL 359

Query: 327 --------------HFEQPDPLTLESGAQARLRIKSGWLIPAAVAVAG 360
+P LTL GAQ + I + L P A +G
Sbjct: 360 QAGARAQGRALLYRVLPEPVKLTLAGGAQGQGDIVATELPPIPGASSG 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3627RTXTOXIND310.011 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.0 bits (70), Expect = 0.011
Identities = 11/43 (25%), Positives = 19/43 (44%)

Query: 14 MATGQISRWFAEEGARVKKGDVLFEIETDKAAMEIDAPASGVL 56
+ + +EG V+KGDVL ++ A + S +L
Sbjct: 102 IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLL 144


26mll3710mll3731Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll37103141.433398ABC-transport system ATP binding protein
mll37112161.272568xylose transport permease xylH
mll37132181.062549xylose binding protein transport system, xylF
mlr37143160.945032xyl repressor
mll3716219-0.201168phosphate regulatory protein, PhoB
mll37182150.071442phosphate uptake regulatory protein PhoU
mll37190140.086302phosphate ABC transporter ATP-binding protein
mll37200120.117259phosphate ABC transporter permease
mll3722-1100.023031phosphate ABC transporter permease
mll3723-390.133615phosphate-binding protein
mll3725-390.186892hybrid sensory histidine kinase
mlr3726317-0.396621hypothetical protein
msl3727015-1.106091hypothetical protein
mll3729116-1.299397hypothetical protein
mll3731215-0.943966glutathione S-transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3716HTHFIS763e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.0 bits (187), Expect = 3e-18
Identities = 32/142 (22%), Positives = 60/142 (42%), Gaps = 5/142 (3%)

Query: 1 MIAPRIMVVEDEEPLGVLLRYNLESEGYQVEVVTRGDEAEIRLQENVPDLLVLDWMVPAV 60
M I+V +D+ + +L L GY V + + + DL+V D ++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGIELCRRLRMRPETERLPIIMLTARGEESDRVRGLSTGADDYLVKPFSTPEFMA---RV 117
+ +L R++ LP+++++A+ ++ GA DYL KPF E + R
Sbjct: 61 NAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 118 KALLRRAKPEVLSSVLKVGDIV 139
A +R ++ +V
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3725HTHFIS667e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.6 bits (160), Expect = 7e-13
Identities = 30/127 (23%), Positives = 52/127 (40%), Gaps = 4/127 (3%)

Query: 1263 ILVAEDNEVNQMVFTQILGETGYGFEIVGNGRKALDAFGKLNPCMILMDVSMPEMSGLEA 1322
ILVA+D+ + V Q L GY I N + +++ DV MP+ + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 1323 TAAIRQLEQQTGTHVPIVGVTAHALKGDRERCLEAGMDDYLPKPISPRALLEKVERWLGA 1382
I++ +P++ ++A + E G DYLPKP L+ + R L
Sbjct: 66 LPRIKKA----RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 1383 GRQVQRN 1389
++
Sbjct: 122 PKRRPSK 128



Score = 58.3 bits (141), Expect = 1e-10
Identities = 51/257 (19%), Positives = 91/257 (35%), Gaps = 56/257 (21%)

Query: 1081 TGARVLIVDDNAVNRAILTEQMTSWTFDSCAAESGAEGLKVLIAAAAYGVPVDCVVLDYQ 1140
TGA +L+ DD+A R +L + ++ +D + A + + A D VV D
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-----DGDLVVTDVV 56

Query: 1141 MPEMSGAEMARIVRNTGGLADTPIIMLTSVDQSLANTSYRDLGIDAQLIKPARSSVLLET 1200
MP+ + ++ ++ D P++++++ + + + G L KP L
Sbjct: 57 MPDENAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD----LTE 110

Query: 1201 LVATIQRHRHATDSHAVQPLAAERPDVPQPPPLALSEQRAQLQPPPVRPRL--------P 1252
L+ I R AL+E + + +
Sbjct: 111 LIGIIGR--------------------------ALAEPKRRPSKLEDDSQDGMPLVGRSA 144

Query: 1253 ATGGDGHRLDILVAEDNEVNQMVFTQILGETGYGFEIVG-----NGRKALDAFGKLNPCM 1307
A L L+ D + I GE+G G E+V G++ F +N
Sbjct: 145 AMQEIYRVLARLMQTDLTL------MITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198

Query: 1308 ILMDVSMPEMSGLEATA 1324
I D+ E+ G E A
Sbjct: 199 IPRDLIESELFGHEKGA 215


27mlr3797mlr3808Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr37972150.974372glutamine ABC transporter integral membrane
mlr37992140.963909glutamine ABC transporter integral membrane
mlr38012141.361378glutamine ABC transporter ATP-binding protein
mlr38022121.866963transcriptional regulator
mlr38042112.432028selenocysteine synthase
mll38052122.094609hypothetical protein
msr38062122.954407hypothetical protein
mlr38072133.139103RNA polymerase sigma factor
mlr38082132.235555hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3801PF05272290.022 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.022
Identities = 13/54 (24%), Positives = 20/54 (37%)

Query: 2 SDATNASPALLDVRDVSKAFGTVEVLRSVSLQVKRGEVVTVIGPSGSGKTTLLR 55
+ L ++ V K V R + K V + G G GK+TL+
Sbjct: 561 TPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLIN 614


28mll3890mll3904Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll3890212-0.033362enterochelin uptake protein TolR
mll38912140.928952TolQ protein, inner membrane protein, tolerance
msr86452151.159361hypothetical protein
mll38923130.913943hypothetical protein
mlr38933120.585898glycosyl hydrolase, cellulase, family
mll38943140.983294hypothetical protein
mll38952130.719299Holliday junction DNA helicase RuvB
mll3898112-0.642735Holliday junction DNA helicase RuvA
mll3899211-1.229462hypothetical protein
msl39003110.577357hypothetical protein
mll39011121.440599Holliday junction resolvase
mll39030151.326917hypothetical protein
mll39042161.009624hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3895PHPHTRNFRASE363e-04 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 35.5 bits (82), Expect = 3e-04
Identities = 21/131 (16%), Positives = 52/131 (39%), Gaps = 26/131 (19%)

Query: 65 GKTTLAQIMARELGVNFRSTSGPVIAKAGDLAALLTNLEEGDVLFIDEIHRLNPAVEEIL 124
G+T+ + IM+R L + P + + + ++ GD++ +D +E I+
Sbjct: 186 GRTSHSAIMSRSLEI-------PAVVGTKE---VTEKIQHGDMVIVD-------GIEGIV 228

Query: 125 YPAMEDFQLDLIIGEGPAARSVKIDLARFTLVAATTRLGLLTNPLRDRFGIPVRLNFYTV 184
+ ++ + A K + A+ +TT+ G + + N T
Sbjct: 229 IVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKDGAH---------VELAANIGTP 279

Query: 185 EELEQIVRRGA 195
++++ ++ G
Sbjct: 280 KDVDGVLANGG 290


29mll4004msl4042Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll40043152.519724rRNA large subunit methyltransferase
mll40053142.815992hypothetical protein
mlr40063152.936712hypothetical protein
mll40072162.985746hypothetical protein
mll40082162.740604nicotinic acid mononucleotide
mll40091152.423380gamma-glutamyl phosphate reductase
mll4011-1151.706694gamma-glutamyl kinase
mll4013-1160.385996GTPase ObgE
mlr4014-213-0.960054hypothetical protein
mll4016013-0.879905acetyltransferase
msl4017117-2.28975850S ribosomal protein L27
mll4019325-4.61282350S ribosomal protein L21
mlr4020123-5.104971*hypothetical protein
mll4023227-5.719744transposase
mlr4024326-5.662220IS3 family transposase orfB
mlr4025327-6.073723hypothetical protein
mlr4028326-5.831490hypothetical protein
mll4029428-5.808705porin
mll4030539-6.959921hypothetical protein
msl4031436-6.382384two-component response regulator
mll4032437-6.293811hypothetical protein
mlr4033639-6.390601DNA invertase RlgA
msl8646736-3.331699hypothetical protein
mll4037736-2.253347hypothetical protein
msl4038736-1.410750hypothetical protein
msl4039736-1.520712hypothetical protein
mll4040635-1.674493hypothetical protein
mll4041633-2.165547hypothetical protein
msl4042332-4.756669excisionase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4008LPSBIOSNTHSS290.006 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 29.4 bits (66), Expect = 0.006
Identities = 24/73 (32%), Positives = 37/73 (50%), Gaps = 8/73 (10%)

Query: 11 GLFGGSFNPPHAGHALVAEIALRRLAL-DQLWWMVTPGNPLKSTRELAPLAERLQLSEQ- 68
++ GSF+P GH +I R L DQ++ V NP K + + + ERL+ +
Sbjct: 3 AIYPGSFDPITFGH---LDIIERGCRLFDQVYVAVL-RNPNK--QPMFSVQERLEQIAKA 56

Query: 69 IARNPKIKVTAFE 81
IA P +V +FE
Sbjct: 57 IAHLPNAQVDSFE 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4011CARBMTKINASE453e-07 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 44.8 bits (106), Expect = 3e-07
Identities = 26/128 (20%), Positives = 45/128 (35%), Gaps = 11/128 (8%)

Query: 133 VPVINENDTVATSEIRYGDNDRLAARVATMMGADLLVLLSDIDGLYTAPPARDPKAKFIP 192
VPVI E+ + E D D ++A + AD+ ++L+D++G +
Sbjct: 197 VPVILEDGEIKGVEAVI-DKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQW---- 251

Query: 193 VVDRITPDIEAMAGAAASELSRGGMRTKLDAG-KIATAAGTAMIITSGTRLSPLMAIERG 251
+ + + E G M K+ A + G II L + G
Sbjct: 252 -LREVKVE-ELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAH---LEKAVEALEG 306

Query: 252 ERATFFRP 259
+ T P
Sbjct: 307 KTGTQVLP 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4019FLGHOOKFLIK310.005 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 30.6 bits (68), Expect = 0.005
Identities = 18/53 (33%), Positives = 25/53 (47%)

Query: 106 AKPAKKAAVKAEAKAEVAAEAAPKEAKAKKEAAPKADVTAETAAAPLFKAPKG 158
A+P +A++KAEV + +P A A P T AAP+ AP G
Sbjct: 184 AQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLG 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4028SYCDCHAPRONE320.003 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 32.2 bits (73), Expect = 0.003
Identities = 14/74 (18%), Positives = 28/74 (37%), Gaps = 9/74 (12%)

Query: 75 NHPLAYYILGTLGIGYDN----EKALRYFARAVAEEPQNPYYHLSLGETYLKVSEFTPAI 130
+H + + LG LG + A+ ++ + + P + E L+ E A
Sbjct: 66 DHYDSRFFLG-LGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEA- 123

Query: 131 RHIQQALDLKPDLV 144
+ L L +L+
Sbjct: 124 ---ESGLFLAQELI 134



Score = 29.9 bits (67), Expect = 0.015
Identities = 19/83 (22%), Positives = 30/83 (36%)

Query: 133 IQQALDLKPDLVEALCALGDAYNEFDKGELALPLFEKALKIDRYHPLARLGLPYALASLG 192
I ++ D +E L +L + K E A +F+ +D Y LGL ++G
Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMG 84

Query: 193 RMDEAAVLLKEAIDRRIALPTAY 215
+ D A I P
Sbjct: 85 QYDLAIHSYSYGAIMDIKEPRFP 107



Score = 28.7 bits (64), Expect = 0.037
Identities = 12/77 (15%), Positives = 22/77 (28%), Gaps = 3/77 (3%)

Query: 93 EKALRYFARAVAEEPQNPYYHLSLGETYLKVSEFTPAIRHIQQALDLKPDLVEALCALGD 152
E A + F + + + L LG + ++ AI + +
Sbjct: 53 EDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAE 112

Query: 153 AY---NEFDKGELALPL 166
E + E L L
Sbjct: 113 CLLQKGELAEAESGLFL 129


30mlr4215mlr4224Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
mlr42152123.346672hypothetical protein
mll42162114.128454glutamine amidotransferase
mll42173114.384894hypothetical protein
mll42183104.794163hypothetical protein
mll42194114.282532phage tail protein
mll42214133.818871uroporphyrinogen-III synthase
mll42232133.646656porphobilinogen deaminase
mlr42242122.916151DNA-binding/iron metalloprotein/AP endonuclease
31mll4356mlr4379Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll43562161.937525chloride channel protein
mll43581130.643322hypothetical protein
mlr43591151.067986hypothetical protein
mlr43610120.715353hypothetical protein
mlr4362-2110.116660hypothetical protein
mll4363-2100.059850hypothetical protein
mll4364-29-0.048165hypothetical protein
mlr4366312-0.508719argininosuccinate synthase
mlr4368715-0.100437hypothetical protein
mlr43694140.667605hypothetical protein
mlr43700131.693261hypothetical protein
mll43721142.198502transglycosylase
msr43730172.669706hypothetical protein
mlr4376-1202.819515hypothetical protein
mlr4377-1163.194985hypothetical protein
mlr43781172.898500hypothetical protein
mlr43791183.004780signal recognition particle protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4379PF07132300.032 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 29.7 bits (66), Expect = 0.032
Identities = 33/132 (25%), Positives = 43/132 (32%), Gaps = 24/132 (18%)

Query: 422 GMADMMKAMGGKGKGGGLMRGMMGGLASKMG------LGGMMPGGMGGMGGMPDLSKMDP 475
G+ + +GG GGGL G+ L S +G LGG + GM M + +
Sbjct: 75 GLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGGLGGALGAGMNAMNPSAMMGSLLF 134

Query: 476 KQLEAL----QKQAQAAGLGGMKGLPG--------------GLPGGGLPGLPGGMKLPGL 517
LE L Q Q G + + G GL G L
Sbjct: 135 SALEDLLGGGMSQQQGGLFGNKQPSSPEISAYTQGVNDALSAILGNGLSQTKGQTSPLQL 194

Query: 518 PGLGGGGLPGLG 529
G GL G G
Sbjct: 195 GNNGLQGLSGAG 206


32mlr4438mll4451Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr4438222-3.606362hypothetical protein
mlr44396120.136243hypothetical protein
mll44406120.167838hypothetical protein
mll44416110.549588microcystin dependent protein MdpB
mll44424130.814927microcystin dependent protein MdpB
mll44434131.361771microcystin dependent protein MdpB
mll44444151.630408hypothetical protein
mlr4445-2142.794756hypothetical protein
mlr44461153.807730proline dipeptidase
mlr44480173.309075hypothetical protein
mll4451-1143.051307dihydroorotase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4438PF03544270.016 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 27.2 bits (60), Expect = 0.016
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 1/65 (1%)

Query: 48 VFTVAIGGR-AVSLELTAVHQIASSPRPGGGFTLLFKGPRDISLPQAIYHLAGDAITDDI 106
+ +V I G L T+VHQ+ P P ++ P D+ PQA+ + +
Sbjct: 19 LLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEP 78

Query: 107 FIVPV 111
P+
Sbjct: 79 EPEPI 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4440SACTRNSFRASE421e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 42.2 bits (99), Expect = 1e-07
Identities = 32/122 (26%), Positives = 45/122 (36%), Gaps = 31/122 (25%)

Query: 44 AHYIKHYPNADWLVIMRDGDD------------IGRLYIER-WPTQHRIIDIALLPTYRG 90
Y K Y + D V + + IGR+ I W I DIA+ YR
Sbjct: 44 KPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRK 103

Query: 91 RGLGAALLGDLIDEA--------WLAGKSVSIHVEKNNPARQLYARLGFAVAEDKGVYDL 142
+G+G ALL I+ A L + + N A YA+ F + G D
Sbjct: 104 KGVGTALLHKAIEWAKENHFCGLMLETQDI------NISACHFYAKHHFII----GAVDT 153

Query: 143 MV 144
M+
Sbjct: 154 ML 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4444cloacin380.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 0.001
Identities = 31/106 (29%), Positives = 45/106 (42%), Gaps = 1/106 (0%)

Query: 1254 NNTGTIGIGGTITNSGTGNGVVVSGGSAAITVSADISSSATAPGTAVKVDGITGGSVTFS 1313
+NTG G I N G V G S S++ + G+ + G +G
Sbjct: 9 HNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 1314 GLITSTGTGTGVSVSNTAAGSGVGFGAVTVSGAAGNGIGISGNAGS 1359
+ G+GTG ++S AA GF A++ GA G + IS A S
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4445PF067761261e-38 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 126 bits (317), Expect = 1e-38
Identities = 31/132 (23%), Positives = 53/132 (40%)

Query: 63 PWAVNCSSGSTANELQCQVSQNLTEAKTGQRVLTVTVRRDNANGSFAMLLALPHGLFLPS 122
W + C + A QC + Q++ LTV + + S M + P G+ LPS
Sbjct: 82 DWQIRCDTPPGAKAEQCALIQSVVAEDRSNAGLTVIILKTADQKSKLMRVVAPLGVLLPS 141

Query: 123 GVSYQIDSGKKVTVAIQTSDQNGAYAAVPVAPELAKAMKSGTTLNIGMESVTRKPVTIPV 182
G+ ++D+ NG A V + +L +++ T + + + P+
Sbjct: 142 GLGLKLDNVDVGRAGFVRCLPNGCVAEVVMDDKLLGQLRTAKTATFIIFETPEEGIGFPL 201

Query: 183 SLKGFGAAVAKL 194
SL G G KL
Sbjct: 202 SLNGIGEGYDKL 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4451UREASE320.004 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 32.0 bits (73), Expect = 0.004
Identities = 23/104 (22%), Positives = 39/104 (37%), Gaps = 18/104 (17%)

Query: 4 DLLLKGGRLIDPASGIDAPRDVAIANGRVAAI----------DADIPADRAEQIVDATGC 53
D ++ ++D + A D+ + +GR+AAI I +++ G
Sbjct: 69 DTVITNALILDHWGIVKA--DIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGK 126

Query: 54 IVAPGLVDLHSHVYWGGTSLGVDADRLAAKSGTTTFIDAGSAGA 97
IV G +D H H + A SG T + G+ A
Sbjct: 127 IVTAGGMDSHIHF------ICPQQIEEALMSGLTCMLGGGTGPA 164


33mlr4491mlr4529Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr4491-1123.306170Maf-like protein
mlr4492-2142.803519shikimate 5-dehydrogenase
mlr44930162.774736dephospho-CoA kinase
mlr44950162.491243DNA polymerase III subunit epsilon
mll4497-1162.155706lipase
mll44980161.941223hypothetical protein
mll45001181.515678proline dipeptidase
mll45011152.169653transcriptional regulator
mll45032161.910286cystathionine gamma-synthase
mll45051162.206392cystathionine beta-synthase
mll45063183.194985hypothetical protein
mlr45081172.635188hypothetical protein
mlr45091162.717865hypothetical protein
mll45111163.089433cinnamoyl ester hydrolase
mlr45120153.347540hypothetical protein
mll45140163.452219hypothetical protein
mll45161162.409516hypothetical protein
mlr45172192.223682taurine ABC transporter substrate-binding
mlr45182202.005973taurine ABC transporter ATP-binding protein
mlr45192220.944055taurine transport system permease
msr45202141.512960hypothetical protein
mll45211131.115237permease, ABC-2-type transport system
mll45231121.258732ABC transporter ATP-binding protein
mlr45240121.140083quinol oxidase subunit I
mlr45250111.327571quinol oxidase subunit II
mll45260102.212859transcriptional regulatory
mlr45271150.720612hypothetical protein
mlr45294140.104477hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4498ISCHRISMTASE554e-11 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 55.4 bits (133), Expect = 4e-11
Identities = 34/154 (22%), Positives = 60/154 (38%), Gaps = 5/154 (3%)

Query: 69 ILANVQRLQEAARANHVPLQHWAYIVDLDKQDRPFHPLGADGKSAFSDKSDPLTEICHEV 128
+ AN+++L+ +P+ + A + DR L D + +I E+
Sbjct: 56 LSANIRKLKNQCVQLGIPVVYTAQPGSQNPDDRA---LLTDFWGPGLNSGPYEEKIITEL 112

Query: 129 APARDEALLVKAEASAFRSGPAADQLKAAGIEWLVVAGVWTEACIDATVKDAVARGFRVL 188
AP D+ +L K SAF+ + ++ G + L++ G++ T +A +
Sbjct: 113 APEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAF 172

Query: 189 LVKDACGSGSAAMHQTGILNLANRLYGGAVTDTD 222
V DA S HQ + A R TD
Sbjct: 173 FVGDAVADFSLEKHQMALEYAAGR--CAFTVMTD 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4509ISCHRISMTASE502e-09 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 49.6 bits (118), Expect = 2e-09
Identities = 43/187 (22%), Positives = 67/187 (35%), Gaps = 17/187 (9%)

Query: 9 EIPTSLVEWC-DPHRMALVVYDMQIGICRQVAGAA----DIVERTGIVLEAARSAGMRLA 63
++P + V W DP+R L+++DMQ A ++ + G+ +
Sbjct: 16 DMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVV 75

Query: 64 FTRHLSLPRKWMGATQLRTAMAWQRRDSPDAVEPWFLRDADATRIIPELAPRADEAVFDK 123
+T Q + R D P +II ELAP D+ V K
Sbjct: 76 YT------------AQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTK 123

Query: 124 LTMSAFDSTALSFALRDCGVRAIALAGIAMEIGIEPTVRQATDNGFTAVVIEDACGFGNR 183
SAF T L +R G + + GI IG T +A A + DA +
Sbjct: 124 WRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSL 183

Query: 184 EARDRSM 190
E ++
Sbjct: 184 EKHQMAL 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4521ABC2TRNSPORT702e-16 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 70.3 bits (172), Expect = 2e-16
Identities = 51/240 (21%), Positives = 107/240 (44%), Gaps = 1/240 (0%)

Query: 8 AIYRVEMARAFRTVLQSIISPVISTSLYFVVFGSAIGSRITEIDGISYGAFIVPGLIMLS 67
A++R + L S++ + +Y G+ +G + + G+SY AF+ G++ S
Sbjct: 18 AVWRRNYIAWKKAALASLLGHLAEPLIYLFGLGAGLGVMVGRVGGVSYTAFLAAGMVATS 77

Query: 68 LLTQSISNASFAIYFPKFVGSIYE-LLSAPVSYLEIVIAYVGGAATKSIILGLIILATAS 126
+T + +A + +E +L + +IV+ + AATK+ + G I A+
Sbjct: 78 AMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAA 137

Query: 127 LFVPLRIEHPFWMIAFLVLTAVTFSLFGFIIGIWARSFEQLQLVPLLIVTPLTFLGGSFY 186
+ + + + LT + F+ G ++ A S++ L++TP+ FL G+ +
Sbjct: 138 ALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVF 197

Query: 187 SIHMLPGIWKTITLFNPVVYLISGFRWSFYGKADVSVGISLGMTLVFLAVCIAIVAWIFR 246
+ LP +++T F P+ + I R G V V +G +++ + + + R
Sbjct: 198 PVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLR 257


34mll4541msr8655Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll45413203.277425preprotein translocase subunit SecB
mll45431193.933101FxsA protein
mlr4544-1173.737222hypothetical protein
mlr45460174.254237outer membrane lipoprotein GNA33, membrane-bound
mlr45471171.944203hypothetical protein
mlr45481161.251638transcriptional regulator
msl45493160.979795hypothetical protein
mll45500152.015044hypothetical protein
msl45510152.519309hypothetical protein
mll4552-1182.821932intracellular PHB depolymerase
mll45530193.738335rare lipoprotein A
msr45541192.684989hypothetical protein
mll45552192.561815aliphatic sulfonate transport ATP-binding
mll45571172.416701aliphatic sulfonate transport membrane
mll45582172.177173sulfonate monooxygenase
mll45592161.369714aliphatic sulfonate binding protein
mll45603141.405647hypothetical protein
msr86553131.629388hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4541SECBCHAPRONE1447e-47 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 144 bits (365), Expect = 7e-47
Identities = 48/162 (29%), Positives = 86/162 (53%), Gaps = 5/162 (3%)

Query: 4 NDDAPVGAANGNGNTGAQPSLNVLAQYVKDLSFESPGAPNSLRGRDKAPGIAINVNVNAN 63
+++ V AA+ QP L + YVKD+SFE+P P+ + +D P ++ +++ A
Sbjct: 2 SEENQVNAADTQATQ--QPVLQIQRIYVKDVSFEAPNLPHIFQ-QDWEPKLSFDLSTEAK 58

Query: 64 PLSDKQFDVNLTLNAKASFDQE--VLFNVELVYGGVFAISGFPQEHMLPILFIECPRLLF 121
+ D ++V L ++ + + + V F E+ GVF ISG + M L +CP +LF
Sbjct: 59 QVGDDLYEVCLNISVETTMESSGDVAFICEVKQAGVFTISGLEEMQMAHCLTSQCPNMLF 118

Query: 122 PFARQIIAEATRNGGFPPLMLDPIDFAQMFQQKIAEDQAASK 163
P+AR++++ G FP L L P++F +F + + A +
Sbjct: 119 PYARELVSSLVNRGTFPALNLSPVNFDALFMDYLQRQEQAEQ 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4546ENTEROTOXINA290.030 Heat-labile enterotoxin A chain signature.
		>ENTEROTOXINA#Heat-labile enterotoxin A chain signature.

Length = 258

Score = 28.8 bits (64), Expect = 0.030
Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 7/54 (12%)

Query: 217 LSELGEIPLAQVTMQSIRAWFKAHPQRIDEILWQNRSYI--FFREADVEDAALG 268
+S LG IP +Q I W++ + IDE L +NR Y ++R ++ A G
Sbjct: 131 VSALGGIPYSQ-----IYGWYRVNFGVIDERLHRNREYRDRYYRNLNIAPAEDG 179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4555PF05272290.025 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.025
Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 63 GKSGCGKSTLLRLLAGLDRPTSGSLTLGAEEE 94
G G GKSTL+ L GLD + +G ++
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634


35mll4604mll4634Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll4604211-0.367070hypothetical protein
mlr4605111-0.287324transcriptional regulator
mll46062120.854917ATP-dependent DNA ligase
mll4607-1130.987479hypothetical protein
mll4608-1142.097395hypothetical protein
mll4611-2153.334455hypothetical protein
mlr4612-2153.424870hypothetical protein
mlr4613-2153.412850ECF family RNA polymerase sigma factor
mll4614-2172.008162glutathione S-transferase
mll4616-2192.029624cysteine synthase
mll4618-2171.810823hypothetical protein
mll4619-1161.375578L-sorbosone dehydrogenase
msl46200151.971743hypothetical protein
mll46210151.689309heat-inducible transcription repressor
mlr46221151.296066ribonuclease PH
mlr46242140.912827lactoylglutathione lyase
mlr46261132.333428deoxyribonucleotide triphosphate
mlr46270162.935329coproporphyrinogen III oxidase
mlr46290161.607683hypothetical protein
msl46300172.589503hypothetical protein
msl46310173.219351hypothetical protein
mlr46321173.049696hypothetical protein
mlr46331162.706799hypothetical protein
mll46342142.640296UDP pyrophosphate phosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4616VACCYTOTOXIN290.031 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 28.8 bits (64), Expect = 0.031
Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 2/74 (2%)

Query: 241 AGFAPKILDTTIYDEIIKVSNEDSVANARLVARLEGVPVGISSGAALQAAIVVGSRPENK 300
AG+A ++D T +EI K N + +A LE G+ + +L A+++ SR N
Sbjct: 922 AGYARTMIDATSANEITKQLN-TATTTLNNIASLEHKTSGLQT-LSLSNAMILNSRLVNL 979

Query: 301 GKTLVVVIPDFAER 314
+ I FA+R
Sbjct: 980 SRRHTNHIDSFAKR 993


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4624ECOLIPORIN280.014 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 28.0 bits (62), Expect = 0.014
Identities = 17/38 (44%), Positives = 20/38 (52%), Gaps = 1/38 (2%)

Query: 5 AILESALYVTDLA-AAEQFYVGVLGLDLLGKVDGRHLF 41
A++ AL A AAE + LDL GKVDG H F
Sbjct: 7 ALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYF 44


36mlr4645msr4728Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr4645-1143.103598*hypothetical protein
mlr46460143.194985hypothetical protein
mlr4647-1113.338046cytochrome b561
msl4648-2123.430877hypothetical protein
msl4649-2113.238825hypothetical protein
mll4650-2103.020769adenylate cyclase
mlr4651-2101.814762cysteine synthase
mlr4653-2101.392623aromatic-L-amino-acid decarboxylase
mlr4654-2130.007065hypothetical protein
mll4655-1112.613589adenylate cyclase
msr46561142.593300hypothetical protein
mll46591132.592477hypothetical protein
mlr46601142.688014hypothetical protein
mlr46611143.164571hypothetical protein
mlr4662-2100.760528hydantoinase
mlr4664-112-1.273845malate synthase G
mlr4665122-3.703044hypothetical protein
mlr4666124-4.205273TetR family transcriptional regulator
mll4667327-5.079399hypothetical protein
mlr4668436-7.618916site-specific recombinase
mlr4669436-7.357365hypothetical protein
mlr4670436-7.897358hypothetical protein
mlr4671643-8.436850hypothetical protein
mlr4672646-9.387547hypothetical protein
mll4674544-10.066377hypothetical protein
mll4676543-9.904672hypothetical protein
msl4677542-9.820167hypothetical protein
mll4680642-9.700178glycosyltransferase
mlr4682539-9.375637hypothetical protein
mlr4683534-8.701433DNA-directed DNA polymerase
mlr4684733-6.771040hypothetical protein
mlr4685636-7.068331hypothetical protein
mlr4686637-6.946185hypothetical protein
mlr4687540-6.320921hypothetical protein
mlr4688440-6.654922hypothetical protein
mll4689239-6.894771transcriptional regulator
mlr4691241-6.908699hypothetical protein
mlr4692239-7.207270transcriptional regulator
mll4695239-7.213234hypothetical protein
msl4696336-7.681639hypothetical protein
mll4697335-7.178349two-component system response regulator
mll4698230-6.465223two-component system histidine protein kinase
mll4699229-6.693205large-conductance mechanosensitive channel
msl4700331-6.216410hypothetical protein
mll4701233-6.712656chloramphenicol acetyltransferase
msl4702127-5.643792hypothetical protein
mll4705130-6.383286DNA ligase
mlr4706233-6.861673hypothetical protein
mll4707221-3.840380hypothetical protein
mll4708220-4.181243DNA invertase
mll4709217-3.763050hypothetical protein
mll4710322-4.288490NADPH:quinone oxidoreductase
mlr4711221-3.941099hypothetical protein
mll4712222-3.751435integral membrane transport protein
mlr4713334-6.661051large-conductance mechanosensitive channel
msl4714336-6.151142transglycosylase
mlr4715434-5.370944hypothetical protein
mlr4717432-5.902096outer membrane lipoprotein
mlr4720334-5.346210small heat-shock protein
mlr4721433-4.911958small heat shock protein (class I)
mlr4722231-5.288049protoporphyrinogen oxidase
mlr4723230-4.434885hypothetical protein
mll4724021-2.935876hypothetical protein
msr4725121-1.631053hypothetical protein
msl4726218-0.077393hypothetical protein
msr4728218-0.686294hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4645AEROLYSIN310.009 Aerolysin signature.
		>AEROLYSIN#Aerolysin signature.

Length = 493

Score = 30.8 bits (69), Expect = 0.009
Identities = 13/41 (31%), Positives = 20/41 (48%)

Query: 77 ITGMALMARAYNQAQKFGVEMVIPDEAKLLSAAADNTGARY 117
+TG++L+ AQ E V PD+ +L S G +Y
Sbjct: 6 LTGLSLIISGLLMAQAQAAEPVYPDQLRLFSLGQGVCGDKY 46


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4666HTHTETR702e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.4 bits (172), Expect = 2e-17
Identities = 38/210 (18%), Positives = 67/210 (31%), Gaps = 20/210 (9%)

Query: 3 KPSNTADEILAAARTFIVAGGYNGFSYADIAEVVGIRKASIHHHFPSKVDLVQTLLKRYL 62
+ T IL A G + S +IA+ G+ + +I+ HF K DL + +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 63 EDAVIGLAGLERDVPG-PPELLRTYA-GIWARCIEDASMPFCVCALLASELPA--LPPQL 118
+ + PG P +LR + + + + + +
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 119 AVEVRAYFQFLSGWLTGVIERGAEKGTLVISASPRIEAEAFMATVHGAMLS--------- 169
+ R + ++ E L R A + G M +
Sbjct: 128 QAQ-RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFD 186

Query: 170 ----ARAYGDVLTFGMILTPTLQKLIPATD 195
AR Y +L +L PTL+ PAT+
Sbjct: 187 LKKEARDYVAILLEMYLLCPTLRN--PATN 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4667HTHFIS330.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.002
Identities = 17/65 (26%), Positives = 25/65 (38%), Gaps = 2/65 (3%)

Query: 67 GGLRARPGEVSLAHHGVLFLDEFPEFTPQTLDALRQPLETGDCMIARANHRVTYPARIQL 126
G G A G LFLDE + L + L+ G+ R + +++
Sbjct: 218 GAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGE--YTTVGGRTPIRSDVRI 275

Query: 127 VAAMN 131
VAA N
Sbjct: 276 VAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4688TONBPROTEIN270.037 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 27.3 bits (60), Expect = 0.037
Identities = 16/69 (23%), Positives = 23/69 (33%), Gaps = 9/69 (13%)

Query: 10 SPSADQPDVASQVPPNTQPPPSAPPVLASQPPTTGPSLFIPDARSGRIGHPRGAAPLKAL 69
+P+ +P A Q PP P P +PP P + I P+ K
Sbjct: 51 TPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV---------IEKPKPKPKPKPK 101

Query: 70 PEAKSENVA 78
P K +
Sbjct: 102 PVKKVQEQP 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4697HTHFIS554e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.8 bits (132), Expect = 4e-11
Identities = 25/117 (21%), Positives = 44/117 (37%), Gaps = 2/117 (1%)

Query: 4 PRIKIVIADDHPIVRSGIRAVLEQRAEWWVCGEADNGETAVRLAREHAAKIVILDYSLPV 63
I++ADD +R+ + L + N T R +V+ D +P
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 64 LNGLEATRIIRRTMPETEVLIYTMHEDESLIRETLRAGARGYLLKIEDDSELVAAVA 120
N + I++ P+ VL+ + + GA YL K D +EL+ +
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4699MECHCHANNEL1291e-41 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 129 bits (325), Expect = 1e-41
Identities = 68/139 (48%), Positives = 91/139 (65%), Gaps = 9/139 (6%)

Query: 6 MLKEFQEFISKGNVMDLAVGVIIGAAFGKIVDSLVNDIIMPIIGAIFGGLDFNNYFVGLS 65
++KEF+EF +GNV+DLAVGVIIGAAFGKIV SLV DIIMP +G + GG+DF + V L
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTL- 61

Query: 66 SAVNATSLADARKQGAVLAYGSFITVALNFVILAFIIFLMVKAVNNLRKRLEREKPAAAA 125
A V+ YG FI +F+I+AF IF+ +K +N L + ++E+PAAA
Sbjct: 62 ------RDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNR--KKEEPAAAP 113

Query: 126 PPPADIALLTQIRDLLARK 144
P + LLT+IRDLL +
Sbjct: 114 APTKEEVLLTEIRDLLKEQ 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4713MECHCHANNEL1291e-41 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 129 bits (325), Expect = 1e-41
Identities = 69/138 (50%), Positives = 88/138 (63%), Gaps = 10/138 (7%)

Query: 1 MLKEFQEFISKGNVMDLAVGVIIGAAFGKIVDSLVNDIIMPVIGAIFGGLDFNNYFVGLS 60
++KEF+EF +GNV+DLAVGVIIGAAFGKIV SLV DIIMP +G + GG+DF + V L
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTL- 61

Query: 61 SAVNATSLADAKKQGAVFAYGSFITVALNFVILAFIIFLMVKAVNNLRRRLEREKPATPA 120
A V YG FI +F+I+AF IF+ +K +N L R ++E+PA
Sbjct: 62 ------RDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNR--KKEEPAAAP 113

Query: 121 APPPADVALLTEIRDLLA 138
A P + LLTEIRDLL
Sbjct: 114 A-PTKEEVLLTEIRDLLK 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4717BCTLIPOCALIN933e-26 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 92.8 bits (230), Expect = 3e-26
Identities = 51/153 (33%), Positives = 82/153 (53%), Gaps = 11/153 (7%)

Query: 18 AVTAITSLNLSRYLGKWYELCRLPIKYEDETATDITANYSLNDSVTVRVDNRCFD-KHGK 76
+V ++ L+ YLGKWYE+ RL +E + +TA Y + + + V NR + + G+
Sbjct: 22 SVKPVSDFELNNYLGKWYEVARLDHSFE-RGLSQVTAEYRVRNDGGISVLNRGYSEEKGE 80

Query: 77 PFRAIGEATPVDD-ARSRLKVTFLPKYIRWIPFTSGDYWVLKLDPE-YKVSLVGSPDRQY 134
A G+A V+ LKV+F PF G Y V +LD E Y + V P+ +Y
Sbjct: 81 WKEAEGKAYFVNGSTDGYLKVSFFG------PFY-GSYVVFELDRENYSYAFVSGPNTEY 133

Query: 135 LWLLARSPDLAQDTRERYLAEAKRQGFDLTNLI 167
LWLL+R+P + + ++++ +K +GFD LI
Sbjct: 134 LWLLSRTPTVERGILDKFIEMSKERGFDTNRLI 166


37mll4938mlr4951Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll49382181.165149hypothetical protein
mll49402181.861651urease subunit alpha
mlr49412182.083874hypothetical protein
mlr49421142.908246hypothetical protein
msl49442152.489102urease subunit beta
mll49452142.615778urease accessory protein UreJ
msl49470131.402268hypothetical protein
mll49481121.648451urease subunit gamma
mll49491121.829279urease accessory protein D
mlr49512132.011410NodF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4940UREASE11000.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1100 bits (2848), Expect = 0.0
Identities = 498/568 (87%), Positives = 536/568 (94%), Gaps = 1/568 (0%)

Query: 3 RISRAAYAQMYGPTVGDKVRLADTELFIEVEKDLTIHGEEVKFGGGKVIRDGMGQSQVSR 62
R+SRAAYA M+GPTVGDKVRLADTELFIEVEKD T HGEEVKFGGGKVIRDGMGQSQV+R
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 63 AQGAVDTVITNALVVDAGAGIFKADIGLKDGRIAAIGKAGNPDTQDGVTIIIGPGTEIIA 122
GAVDTVITNAL++D GI KADIGLKDGRIAAIGKAGNPD Q GVTII+GPGTE+IA
Sbjct: 64 EGGAVDTVITNALILDH-WGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIA 122

Query: 123 GEGKILTAGGFDAHIHFICPQQIEEALMSGITTMLGGGTGPAHGTLATTCTPGPWHMARM 182
GEGKI+TAGG D+HIHFICPQQIEEALMSG+T MLGGGTGPAHGTLATTCTPGPWH+ARM
Sbjct: 123 GEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARM 182

Query: 183 IQSFDAFPMNIGLSGKGNASLPAALEEMVLGGACSLKLHEDWGTTPAAIDCCLSVADDYD 242
I++ DAFPMN+ +GKGNASLP AL EMVLGGA SLKLHEDWGTTPAAIDCCLSVAD+YD
Sbjct: 183 IEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYD 242

Query: 243 VQVMIHTDTLNESGFVENTVAAIKGRTIHAFHTEGAGGGHAPDIIKVCGLPNVIPSSTNP 302
VQVMIHTDTLNESGFVE+T+AAIKGRTIHA+HTEGAGGGHAPDII++CG PNVIPSSTNP
Sbjct: 243 VQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNP 302

Query: 303 TRPYTVNTLAEHLDMLMVCHHLSPSIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSD 362
TRPYTVNTLAEHLDMLMVCHHLSP+IPEDIAFAESRIRKETIAAEDILHDIGAFSIISSD
Sbjct: 303 TRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSD 362

Query: 363 SQAMGRVGEVAIRTWQTADKMKRQRGALPQETGDNDNFRVRRYIAKYTINPAIAHGLSKD 422
SQAMGRVGEVAIRTWQTADKMKRQRG L +ETGDNDNFRV+RYIAKYTINPAIAHGLS +
Sbjct: 363 SQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHE 422

Query: 423 IGSIAVGKRADLVLWNPAFFGVKPDMVLVGGMIAAAPMGDPNASIPTPQPMHYRPMFGAY 482
IGS+ VGKRADLVLWNPAFFGVKPDMVL+GG IAAAPMGDPNASIPTPQP+HYRPMFGAY
Sbjct: 423 IGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAY 482

Query: 483 GKARTNSSVTFVSKAALESGLHGRLGVEKQFVAVENTRGGIGKHSMVLNDATPHVEVDPE 542
G++RTNSSVTFVS+A+L++GL GRLGV K+ VAV+NTRGGIGK SM+ N TPH+EVDPE
Sbjct: 483 GRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPE 542

Query: 543 TYEVRADGELLTCEPATVLPMAQRYFLF 570
TYEVRADGELLTCEPATVLPMAQRYFLF
Sbjct: 543 TYEVRADGELLTCEPATVLPMAQRYFLF 570


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4941IGASERPTASE300.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.002
Identities = 19/115 (16%), Positives = 29/115 (25%), Gaps = 5/115 (4%)

Query: 20 LAGAASAQQQPAPATPAPKATTPAAG-----GQQAAPAIQSVNIVDITELPKDTQTQVNQ 74
+Q T P T + SV P TQ VN
Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS 1213

Query: 75 VIAQRGDAGLQKLRSSIDATPKVKSALQAKGMTSAQVVAASMEPNGALTLITKKA 129
+ + ++ S+ + + T A S N L+ KA
Sbjct: 1214 ESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKA 1268


38mlr4987mlr5016Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr49872160.510421hypothetical protein
mll49881170.055655phospho-2-dehydro-3-deoxyheptonate aldolase
mll4989016-0.334427short chain dehydrogenase
mll4991117-0.123113myo-inositol dehydrogenase (idhA)
mll4992116-0.440683sugar ABC transporter ATP-binding protein
mll49933130.474254ribose ABC transporter permease
mll49960120.120811sugar ABC transporter substrate-binding protein
mll49971110.087215hypothetical protein
mlr4998211-0.743433hydrocarbon oxygenase MocD
mlr4999210-1.390342Rieske-like ferredoxin MocE
mlr5000111-0.988396ferredoxin reductase
mll5001013-1.961673hypothetical protein
mll5004115-0.977783ATP-dependent protease ATP-binding protein HslU
mll50053150.005052hypothetical protein
mll50061160.638350aminoglycoside 6'-N-acetyltransferase
mll50071141.836289ATP-dependent protease peptidase subunit
mlr50082111.867184imidazoleglycerol-phosphate dehydratase
mlr50102132.301893hypothetical protein
mlr50113141.853988imidazole glycerol phosphate synthase subunit
mlr50131140.772577hypothetical protein
mlr50142151.5408201-(5-phosphoribosyl)-5-[(5-
mlr50152140.990808arginase
mlr50162161.286588imidazole glycerol phosphate synthase subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4989DHBDHDRGNASE1046e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (260), Expect = 6e-29
Identities = 73/242 (30%), Positives = 114/242 (47%), Gaps = 11/242 (4%)

Query: 13 AIVTGGAQGIGFAVAEALADEGCRALALIGRSQEKGDKAVAHFKKAGVDAIFISADVSKV 72
A +TG AQGIG AVA LA +G A + + EK +K V+ K A ADV
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAA-VDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 73 ADCKRAVATALAHFGTLNALVNAAATSARGSLVETSEELFDQIFATNVRGPFFLMQGLVA 132
A A G ++ LVN A G + S+E ++ F+ N G F + +
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 133 HLLERKAPGSIVNVLSMSAHCGQSFLTPYSTSKGALMTLTKNVANAYRFDRIRCNAVLPG 192
++++R++ GSIV V S A ++ + Y++SK A + TK + IRCN V PG
Sbjct: 130 YMMDRRS-GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 193 WMDTEGEDIVQKKW-HDAPDDWLAKAEAAQ-----PMGQLVKPDQLARLISYMVSPQSGV 246
+T D+ W + + + K P+ +L KP +A + ++VS Q+G
Sbjct: 189 STET---DMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 247 MT 248
+T
Sbjct: 246 IT 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4996HTHFIS310.007 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.007
Identities = 11/42 (26%), Positives = 20/42 (47%)

Query: 222 DESAIGAIQAMKAANIDMKSVVVGGVDATQDALAAMQAGDLD 263
DE+A + +K A D+ +V+ + A+ A + G D
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYD 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5004PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 11/115 (9%), Positives = 38/115 (33%), Gaps = 21/115 (18%)

Query: 107 IRDLVEIAIGLVREKMREDVKARAHINAEERVLEALVGK-TASP----ATRDSFRKKLRD 161
+ +++ ++ + + + +++ V + + +
Sbjct: 225 VDSYLQL------ASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQG 278

Query: 162 GEL------DDKEIEIEVADTGNGGMPGFEIPGMPGANIGVLNINDMLSKAMGGK 210
G++ D+ + +EV +TG+ + G+ N+ + L G +
Sbjct: 279 GKILLKGTKDNGTVTLEVENTGSLALKN----TKESTGTGLQNVRERLQMLYGTE 329


39mll5078mll5088Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
mll50782103.094545plasmid stabilization protein
msl50792103.400299hypothetical protein
mll50802113.447749hypothetical protein
mll50821113.449892thioredoxin
mll50830123.156062ATP-dependent nuclease subunit A
mll50840122.893842hypothetical protein
mll5085-2161.411601hypothetical protein
mll5086-2151.596191hypothetical protein
mll50870171.471097two-component sensor histidine kinase
mll50882170.640023S-adenosyl-L-homocysteine hydrolase
40mll5277mll5293Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll5277317-0.596251transcriptional regulator
mlr5278215-0.495659glucose dehydrogenase
mlr5280112-0.402666glucose dehydrogenase
mlr52810120.179303hypothetical protein
mlr52822120.992670hypothetical protein
mlr52832120.965268hypothetical protein
mll52850121.731880hypothetical protein
mll52861131.851484acetyl-coa synthetase
msr52871132.036003hypothetical protein
mll52891142.752404dihydroxyacetone kinase subunit DhaK
mll52902133.154581hypothetical protein
mll52911132.888015phosphoenolpyruvate-protein phosphotransferase
msl52922132.168736phosphocarrier protein HP
mll52932131.867284PTS system mannnose-specific transporter subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5278DHBDHDRGNASE1086e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 108 bits (272), Expect = 6e-31
Identities = 74/251 (29%), Positives = 111/251 (44%), Gaps = 14/251 (5%)

Query: 4 LNGKTAVITGGATGIGRAAATRFIEEGAFVFIYGRRQEALDAAVADLGANAR---AVKGS 60
+ GK A ITG A GIG A A +GA + E L+ V+ L A AR A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 VSDLADLDRLYAAVKAERGTLDIVFANAGAGGPLPLGQITAEHIDETFDTNVKGTIFTVQ 120
V D A +D + A ++ E G +DI+ AG P + ++ E + TF N G +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 121 QALPLMGP--GGSIILTGSSAGTTGAPGFTAYSASKAAVRNLARTWAEDLKGTGIRVNVL 178
M GSI+ GS+ AY++SKAA + +L IR N++
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 179 SPGATATELAKEALGEEGQKA---------YGAMTPLQRMADPAEIGAVAAFLASSDSSF 229
SPG+T T++ +E + PL+++A P++I FL S +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 230 MTASEVAVDGG 240
+T + VDGG
Sbjct: 246 ITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5280DHBDHDRGNASE1248e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 124 bits (313), Expect = 8e-37
Identities = 80/256 (31%), Positives = 125/256 (48%), Gaps = 14/256 (5%)

Query: 2 KKLEGKIAVITGGSSGIGLATAKRFVEEGAHVV---ITGRREKELKEAAAFIMRNVTTVV 58
K +EGKIA ITG + GIG A A+ +GAH+ + +++ + R+
Sbjct: 4 KGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 59 GDVSLLEDLDRLYAVVKEKHGHIDVLFANAGAGTIAPLAAATEAHFDQTFDVNVKGLFFT 118
DV +D + A ++ + G ID+L AG + + ++ ++ TF VN G+F
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 119 VQKALPLFKD--GGSIILNSSVSNVLGLP--GFSTYAASKAAVRNFSRAWTLELKDRKIR 174
+ D GSI+ S N G+P + YA+SKAA F++ LEL + IR
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGS--NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 175 VNTMSPGAIETPALETTTGLTPEQAEQAVAQFASQ----IPMGRRGKPEEIAAAVTFLAS 230
N +SPG+ ET ++ + AEQ + IP+ + KP +IA AV FL S
Sbjct: 182 CNIVSPGSTET-DMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 231 DDSSYVTGVDLAVDGG 246
+ ++T +L VDGG
Sbjct: 241 GQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5291PHPHTRNFRASE428e-147 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 428 bits (1102), Expect = e-147
Identities = 192/540 (35%), Positives = 290/540 (53%), Gaps = 17/540 (3%)

Query: 5 RIEGVPASAGYAEGPLFDLDRPPAVYAGKAT--LAEEKTSLQAAIGTAVSRLSAMAEAAD 62
+I G+ AS+G A F P + ++ E L AA+ + L A+ + +
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTE 63

Query: 63 S----DAAGILEFHIAMLEDDALRGPAFASI-TSGQTADVAWREALDAEIAGYEASDQDY 117
+ D A I H+ +L+D L I A+ A +E D ++ +E+ D +Y
Sbjct: 64 ASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEY 123

Query: 118 FRARATDLRDIRDQVLRALTEDVDSAAPT---GAILCGEDIAPTRFLETDWSHGGGIALK 174
+ RA D+RD+ +VL L + T ++ ED+ P+ + + G A
Sbjct: 124 MKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATD 183

Query: 175 AGSTASHVAMLARSRGVPMVVGLREMSASPAG--MALLDAEHGSIVLAPSPAEIGAFRQS 232
G SH A+++RS +P VVG +E++ M ++D G +++ P+ E+ A+ +
Sbjct: 184 IGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEK 243

Query: 233 SASFAARQGKAQRFLARPAVTKAGTAVRVQVNIADPSDVDGIDVATCDGVGLMRTEFLF- 291
A+F ++ + + + P+ TK G V + NI P DVDG+ +G+GL RTEFL+
Sbjct: 244 RAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLYM 303

Query: 292 -GKNLPDEETQYRAYRKVLEWAGEKPVTIRTVDAGGDKPVPGFTV-EEGNPFLGLRGIRL 349
LP EE Q+ AY++V++ KPV IRT+D GGDK + + +E NPFLG R IRL
Sbjct: 304 DRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAIRL 363

Query: 350 SLARLDIFRIQIRALLRAAPHGNLRVMFPMIAMADEYERAAALFAEEQAALAAGGVAQKM 409
L + DIFR Q+RALLRA+ +GNL+VMFPMIA +E +A A+ EE+ L + GV
Sbjct: 364 CLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSD 423

Query: 410 P-PLGIMVEVPSVAIAPEAFAG-VAFFSIGSNDLTQYVMAAARDNAAVAHFNSVRHPAVL 467
+GIMVE+PS A+A FA V FFSIG+NDL QY MAA R N V++ HPA+L
Sbjct: 424 SIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPAIL 483

Query: 468 RLIGSVAAFGRENGIPVSLCGDAGGDPAAIPSLLEAGLRDLSVAPAQLAMAKAAIADVSV 527
RL+ V G V +CG+ GD AIP LL GL + S++ + A++ + +S
Sbjct: 484 RLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLSK 543


41mlr5450mlr5478Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr5450-1113.491687indolepyruvate oxidoreductase subunit B
mlr5451-1113.633411acylglycerophosphoethanolamine acyltransferase
mll54522105.083735hypothetical protein
mll54532124.254416hypothetical protein
mll54540161.890020hypothetical protein
mll54550201.054312hypothetical protein
mll5456021-0.200077RNA polymerase sigma factor
mll5457025-1.704872acid shock protein
mll5459-133-3.016370hypothetical protein
mlr5460031-3.265986CP4-like integrase
msr5461129-3.348325hypothetical protein
msr5462227-3.378099hypothetical protein
mlr5463225-3.075849hypothetical protein
mlr5464222-3.184503hypothetical protein
mll5465518-2.464656hypothetical protein
mll5466616-2.275265hypothetical protein
mll5467516-2.357825hypothetical protein
mll5468616-1.916249hypothetical protein
mll5469417-2.336547hypothetical protein
mll5470418-2.331051hypothetical protein
mll5471320-3.684622arylsulfatase
mlr5472328-5.241323hypothetical protein
mlr5473224-4.173912hypothetical protein
mll5475319-3.203599hypothetical protein
mlr5477216-2.264162hypothetical protein
mlr5478215-1.820799hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5451TCRTETB376e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.8 bits (85), Expect = 6e-04
Identities = 41/185 (22%), Positives = 72/185 (38%), Gaps = 9/185 (4%)

Query: 18 LFWTQFLSAFNDNFLKNTLVFLILFTLAADQAAS--LVTLAGAVFMAPFLLLSALGGEIA 75
L W LS F+ L ++ + L +A D FM F + +A+ G+++
Sbjct: 16 LIWLCILSFFS--VLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 76 DRFDKALIARRLKFAEIAAAAVSVAG-IALSSIPVLMTALLMFGIISALFGPIKYGILPD 134
D+ I R L F I SV G + S +L+ A + G +A F + ++
Sbjct: 74 DQLG---IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130

Query: 135 HLERKELPKANAWIESATFAAILGGTIAGGVVSADGIGVTVFGPIMMALAVGCWFVSRYI 194
++ ++ KA I S G GG++ A I + I M + F+ + +
Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMI-AHYIHWSYLLLIPMITIITVPFLMKLL 189

Query: 195 PPTGS 199

Sbjct: 190 KKEVR 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5453SECA300.030 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.030
Identities = 9/14 (64%), Positives = 10/14 (71%)

Query: 280 GREGRLGEPGSSVF 293
GR GR G+ GSS F
Sbjct: 573 GRSGRQGDAGSSRF 586


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5455PF04335327e-04 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 32.1 bits (73), Expect = 7e-04
Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 3/54 (5%)

Query: 115 ADRYVVRRRRQRFWWLGLGLAGIGLAGAVAGLALVTVVTPDVQPDHYVLDANAT 168
D+ R ++ W+ +AG+ A A AG+ V +TP + YV+ +
Sbjct: 22 RDKLAAAERSKKLAWV---VAGVAGALATAGVVAVAALTPLKTVEPYVITVDRN 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msr5461HTHFIS240.033 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 24.4 bits (53), Expect = 0.033
Identities = 6/18 (33%), Positives = 14/18 (77%)

Query: 14 EAAAILGMSKPTFWRRVR 31
+AA +LG+++ T +++R
Sbjct: 454 KAADLLGLNRNTLRKKIR 471


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5468BCTERIALGSPC270.030 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 27.2 bits (60), Expect = 0.030
Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 10/60 (16%)

Query: 52 NPERDALYLNVTPSKNDGKTV-YRLAVKDVPVDAFWSISLYNAEGHFQKNDLNAYSLNSI 110
+ Y++ +P ND K YRL D+F+ + L Q ND+ A +LN +
Sbjct: 180 ASTTMSDYVSFSPIMNDNKLQGYRLNPGPKS-DSFYRVGL-------QDNDM-AVALNGL 230


42mlr5721mll5816Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr5721-119-5.090179transcriptional regulator
msr5722017-1.576612hypothetical protein
mlr5723019-1.638786hypothetical protein
msl5725024-1.907511hypothetical protein
msl5727-123-2.140112hypothetical protein
msr5728-124-2.008953hypothetical protein
mll5729023-1.503274conjugal transfer relaxase TraA
mll5731230-4.018379hypothetical protein
mlr5732121-3.305818hypothetical protein
msr5733018-2.954675hypothetical protein
msr8675016-1.878447hypothetical protein
msr5734-114-1.418556hypothetical protein
mll5735-115-1.774924methyltransferase
mlr5737-114-2.032487two-component sensor protein
mlr5738220-2.626326two-component response regulator
mlr5739121-2.746946two-component sensor protein
mlr5740225-4.066262ribose ABC transporter
mll5741431-5.344110hypothetical protein
msr5742434-5.138349hypothetical protein
mll5743434-5.421203hypothetical protein
mlr5745533-5.867417hypothetical protein
mlr5746532-6.272934hypothetical protein
msl5749531-5.986913transglycosylase
mlr5750430-5.976091hypothetical protein
mlr5751427-6.488714outer membrane lipoprotein
mlr5752325-4.299569hypothetical protein
mll5753026-4.254872hypothetical protein
msr5754228-5.013632hypothetical protein
msr5755229-5.236132hypothetical protein
msl5756325-4.655368hypothetical protein
mlr5757121-3.402495transposase
mll5758120-3.203062hypothetical protein
mll5759118-2.424256magnesium/cobalt transport protein
msl5762216-2.123732transcriptional regulatory protein
mll5763114-1.892456symbiosis island integrase
mll5764011-0.985373transketolase
mll5765-113-1.063532transketolase
mll5766-113-1.720380short-chain type dehydrogenase/reductase
mll5767-114-1.979250hypothetical protein
mll5768014-1.945294ABC transporter substrate-binding protein
mll5769-118-2.371794ABC transporter permease
mll5770023-3.044140ABC transporter ATP-binding protein
mlr5771-126-4.034553transcriptional regulatory protein
msr5773030-3.809235hypothetical protein
mll5774-124-3.324370transposase
mlr5775024-3.152272thioredoxin
mlr5776024-3.027422hypothetical protein
mll5777024-3.029915two-component sensor
mll5778024-3.430593two-component response regulator
mll5779024-3.227521RND efflux transporter
mll5780133-6.242601RND efflux membrane fusion protein
msr5782143-7.994236hypothetical protein
msl5783142-6.723838hypothetical protein
msl5784039-6.214566hypothetical protein
mlr5785034-4.556953Fis family transcriptional regulator
mlr5786-133-4.881767aminotransferase
mlr5787-226-1.514663hypothetical protein
mll5788-1130.124557phosphomethylpyrimidine kinase
mll5789-113-0.223324thiamine-phosphate pyrophosphorylase
mll5790-113-0.434375thiazole synthase
msl5792-114-1.128672sulfur carrier protein ThiS
mll5793016-1.541440thiamine biosynthesis oxidoreductase THIO
mll5795-115-2.733959thiamine biosynthesis protein ThiC
mll5796029-5.044434hypothetical protein
mll5797130-5.3684926-pyruvoyl tetrahydrobiopterin synthase
mll5798031-5.409263succinoglycan biosynthesis regulator (exsB)
mll5799133-5.887412hypothetical protein
mll5800232-5.958103hypothetical protein
mlr5801232-6.025312NoeK, phosphomannomutase (PMM)
mlr5802230-5.853739NoeJ; phosphomannose isomerase/GDP-mannose
mll5803232-6.484065hypothetical protein
mll5804121-4.484293hypothetical protein
mlr5805120-4.210308cytochrome C peroxidase
msr5806119-3.459088hypothetical protein
mlr5807121-4.110180transcriptional regulator
mll5809018-3.795307succinate-semialdehyde dehydrogenase
mll5810016-2.968464molecular chaperone GroEL
msl5812026-4.464048co-chaperonin GroES
mll5813129-4.761334transposase
mll5814230-4.818441N-hydroxyarylamine O-acetyltransferase
mll5815328-3.882801coproporphyrinogen III oxidase
mll5816225-2.872162hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5738HTHFIS691e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.7 bits (168), Expect = 1e-16
Identities = 31/126 (24%), Positives = 58/126 (46%), Gaps = 13/126 (10%)

Query: 9 TIVMIEDDEGHARLIEKNIRRAGVNNDVVAFTNGSSALAYLLGPDGSGDASVGRHLLVLL 68
TI++ +DD ++ + + RAG DV +N ++ ++ G GD LV+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIA--AGDGD-------LVVT 53

Query: 69 DLNLPDMTGLDILQQIKANQHLKRIPVVVLTTTDDSREIQRCYDLGANVYITKPVNYEGF 128
D+ +PD D+L +IK +PV+V++ + + + GA Y+ KP +
Sbjct: 54 DVVMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111

Query: 129 ANAIRQ 134
I +
Sbjct: 112 IGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5739HTHFIS882e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 2e-21
Identities = 32/164 (19%), Positives = 65/164 (39%), Gaps = 2/164 (1%)

Query: 1 MPETRVLYIDDDDALARLVQKKLGRLGFVVEHASSPEQALTRLEEGGFDVLALDHYLGAG 60
M +L DDD A+ ++ + L R G+ V S+ + G D++ D +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 TGLEFLARLATRGAAPPAVYVTGSSEMSVAVAALKAGASDFVPKTIG-DDFIALLASALD 119
+ L R+ P + ++ + A+ A + GA D++PK + I ++ AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 QAVAKARLVAEKEAAEAQVRAARDRAELLLAEVNHRVANSLAMV 163
+ + E ++ + R A + V R+ + +
Sbjct: 121 EPKRRPS-KLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5745PHPHTRNFRASE300.012 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 30.1 bits (68), Expect = 0.012
Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 15/64 (23%)

Query: 250 SRPFEMNPFLRFIGQRLAQLKPHL---QRRVEMRVALHKHPGVAGIVEAATESLKMIPGI 306
P E+NPFL F RL K + Q R +R + + + V M P I
Sbjct: 347 QLPKELNPFLGFRAIRLCLEKQDIFRTQLRALLRASTYGNLKV------------MFPMI 394

Query: 307 SLVD 310
+ ++
Sbjct: 395 ATLE 398


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5751BCTLIPOCALIN925e-26 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 92.0 bits (228), Expect = 5e-26
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 11/153 (7%)

Query: 18 AVTAITSLNLSQYLGKWYELCRLPIKYEDETATDITANYSLNDSVTVRVDNRCFD-KHGK 76
+V ++ L+ YLGKWYE+ RL +E + +TA Y + + + V NR + + G+
Sbjct: 22 SVKPVSDFELNNYLGKWYEVARLDHSFE-RGLSQVTAEYRVRNDGGISVLNRGYSEEKGE 80

Query: 77 PSRAIGEAT-PADDARSRLKVTFLPKYIRWIPFTSGDYWVLKLDPE-YKVSLVGSPDRQY 134
A G+A LKV+F PF G Y V +LD E Y + V P+ +Y
Sbjct: 81 WKEAEGKAYFVNGSTDGYLKVSFFG------PFY-GSYVVFELDRENYSYAFVSGPNTEY 133

Query: 135 LWLLARSPDLAQDTRERYLAEAKRQGFDLTNLI 167
LWLL+R+P + + ++++ +K +GFD LI
Sbjct: 134 LWLLSRTPTVERGILDKFIEMSKERGFDTNRLI 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5766DHBDHDRGNASE1132e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (284), Expect = 2e-32
Identities = 77/255 (30%), Positives = 119/255 (46%), Gaps = 16/255 (6%)

Query: 3 LKGKAVVISGAASPRGIGRSTATLMAEQGARIAILDLDEEQARDAAASLGPEHI---GLA 59
++GK I+GAA +GIG + A +A QGA IA +D + E+ +SL E
Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 60 CNVADLDSCKRAAAEVTEAFGRVEVLCNIAGITQPVKTLDIGPADWDRILDVNLRGVLYL 119
+V D + A + G +++L N+AG+ +P + +W+ VN GV
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 120 SQAFIPHMRENGGGSIICMSSVSAQRGGGIFGGPHYSAAKAGVLGLAKAMAREFGPDGIR 179
S++ +M + GSI+ + S A G Y+++KA + K + E IR
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 180 INCVTPGLIQTDITGGKLTDAM-RADIIK--------GIPLSRLGDARDVAGAYLFLASD 230
N V+PG +TD+ D +IK GIPL +L D+A A LFL S
Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 231 LASYITGAVIDVNGG 245
A +IT + V+GG
Sbjct: 242 QAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5770PREPILNPTASE310.010 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.9 bits (70), Expect = 0.010
Identities = 23/81 (28%), Positives = 34/81 (41%), Gaps = 4/81 (4%)

Query: 282 ELLGPGDVRIAISSGTILGLAGAPAGPTGLIAPLVGAGLGAGWRLSGEGFADRFS--SPA 339
E +G GD ++ + G LG P L++ LVGA +G G L + P
Sbjct: 209 EGMGYGDFKLLAALGAWLGWQALPI--VLLLSSLVGAFMGIGLILLRNHHQSKPIPFGPY 266

Query: 340 AAARAGIGFVSGDRATKGILS 360
A I + GD T+ L+
Sbjct: 267 LAIAGWIALLWGDSITRWYLT 287


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5778HTHFIS763e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.0 bits (187), Expect = 3e-18
Identities = 33/140 (23%), Positives = 58/140 (41%)

Query: 2 IKVLVVEDDADTADEIVDEFCAAGFEVERAATGPDGLTKAKTQAFDVITLDRLLPGLDGL 61
+LV +DDA + AG++V + D++ D ++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 SLLENLRGGGVDTPVMVLSALSSVDERIRGLRAGGDDYLVKPFSLAELRTRIEVLARRTP 121
LL ++ D PV+V+SA ++ I+ G DYL KPF L EL I
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 DQRATVLRLADLELDLLTRT 141
+ + + + + L+ R+
Sbjct: 124 RRPSKLEDDSQDGMPLVGRS 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5779ACRIFLAVINRP7450.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 745 bits (1924), Expect = 0.0
Identities = 295/1031 (28%), Positives = 498/1031 (48%), Gaps = 29/1031 (2%)

Query: 3 VSNIFVQRPIATGLLTLGIVLVGMVAYMLLPISSLPQVDFPTIAVESTLPGAKAETMAST 62
++N F++RPI +L + +++ G +A + LP++ P + P ++V + PGA A+T+ T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VASPLEQQFSTIPGLAQMTSTST-LGKTDMTLQFDLSRNIDSAAQDVQAAINAASGSLPK 121
V +EQ + I L M+STS G +TL F + D A VQ + A+ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 AMQSPP-TYHKVNPALFTVISIVLQSDTIPLPLVMDAASNMVAQTLSQIPGAGFVDMPGS 180
+Q + K + + V V + + D ++ V TLS++ G G V + G+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 SKSAIRIQLDPMKLASLGMSLEDVRSALVVGTTNGPKGTL------EGAQRSVTLDANNQ 234
A+RI LD L ++ DV + L V G L G Q + ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 235 LLTSADASAAII-AYRNGSPIRISDIGRAIDSVENTTLGAWYNNRKAVLIDVHLQSGANA 293
+ + +GS +R+ D+ R EN + A N + A + + L +GANA
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 VDVVNGVKAALPGLRLRLPPSVQIMTAGDSTVAVRAAVADVQFTLMITIGLVVLTIFLFL 353
+D +KA L L+ P ++++ D+T V+ ++ +V TL I LV L ++LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 KSMSATIIPAVTIPVSLIGTFGVMYLLGYSLDNISLMGLTIAVGFVVDDAIVVIENIVRH 413
++M AT+IP + +PV L+GTF ++ GYS++ +++ G+ +A+G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 I-EGGESPLEATLKGASEIGFTVVSMTASLIAVFIPLLLMSSMVGRLFREFSVTIAVALI 472
+ E P EAT K S+I +V + L AVFIP+ G ++R+FS+TI A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 ISALVSLTLTPMMCALMIKGH--GHEAKPNRLSSLLERGFDFIQRGYSRSLRVVVGHPRL 530
+S LV+L LTP +CA ++K H FD Y+ S+ ++G
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 531 VLLCFFATLAVTAELFLAIPKGFFPQQDLGLISGSTQAAQDISFQAMAAKQQAVVDLILK 590
LL + +A LFL +P F P++D G+ Q + + V D LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 591 DPAVD----TVRSTLGGSGRM-NSGNLQIVLKPLSERSV---SADQVIARLRKETAGVSG 642
+ + + SG+ N+G + LKP ER+ SA+ VI R + E +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 643 VSVGMQAVQGINVGARGSQTQFQYTLQD---PNLPELYRWADTMTAELRKLP-QVRDVAN 698
V + G+ T F + L D L + + + + P + V
Sbjct: 660 GFV--IPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 699 DLQAVAPHASIAVDRDTASRLGITPQAIDDTLYDAFGQRQVTTIFTQADQHKIVMEIDPK 758
+ + VD++ A LG++ I+ T+ A G V + K+ ++ D K
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 759 YRVDAGALDAIYVGSNSGKQVPLSAFAKVSSSVAPLTINHQGLFPSVTLSFNLAPNVALG 818
+R+ +D +YV S +G+ VP SAF + PS+ + AP + G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 819 DAVTAVEAMQARIRMSATVQTGFQGTAQAFQASLSTQPMLIVAALIAVYIVLGMLYESVI 878
DA+ +E + ++ + A + + G + + S + P L+ + + V++ L LYES
Sbjct: 838 DAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 879 HPITILSTLPSAGVGALATLMLVGGQLDVMGLVGIILLIGIVKKNAIMMIDFALSAERDR 938
P++++ +P VG L L + DV +VG++ IG+ KNAI++++FA
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 939 GLPPKEAIIQACILRFRPITMTTLCALLGALPLALGTGVGSELRRPLGIAIVGGLCVSQL 998
G EA + A +R RPI MT+L +LG LPLA+ G GS + +GI ++GG+ + L
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 999 LTLYTTPVIYL 1009
L ++ PV ++
Sbjct: 1016 LAIFFVPVFFV 1026



Score = 86.0 bits (213), Expect = 4e-19
Identities = 78/503 (15%), Positives = 167/503 (33%), Gaps = 25/503 (4%)

Query: 5 NIFVQRPIATGLLTLGIVLVGMVAYMLLPISSLPQVDFPTIAVESTLP-GAKAETMASTV 63
+ L+ IV +V ++ LP S LP+ D LP GA E +
Sbjct: 531 GKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVL 590

Query: 64 A----------SPLEQQFSTIPGLAQMTSTSTLGKTDMTLQ-FDLSRNIDSAAQDVQAAI 112
+ T+ G + G ++L+ ++ +++A+ V
Sbjct: 591 DQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRA 650

Query: 113 NAASGSLPKAMQSPPTYHKVN-PALFTVISIVLQSDTIPLPLVMDAASNMVAQTLSQIPG 171
G + P + T L + A N + +Q P
Sbjct: 651 KMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPA 710

Query: 172 AGFVDMPGSSKSA--IRIQLDPMKLASLGMSLEDVRSALVVGTTNGPKGTL--EGAQRSV 227
+ P + ++++D K +LG+SL D+ + G + +
Sbjct: 711 SLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKL 770

Query: 228 TLDANNQLLTSADASAAI-IAYRNGSPIRISDIGRAIDSVENTTLGAWYNNRKAVLIDVH 286
+ A+ + + + + NG + S + + L R L +
Sbjct: 771 YVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRL-----ERYNGLPSME 825

Query: 287 LQSGANAVDVVNGVKAALPGLRLRLPPSVQIMTAGDSTVAVRAAVADVQFTLMITIGLVV 346
+Q A A + L +LP + G S + + I+ +V
Sbjct: 826 IQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSG-NQAPALVAISFVVVF 884

Query: 347 LTIFLFLKSMSATIIPAVTIPVSLIGTFGVMYLLGYSLDNISLMGLTIAVGFVVDDAIVV 406
L + +S S + + +P+ ++G L D ++GL +G +AI++
Sbjct: 885 LCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILI 944

Query: 407 IENIV-RHIEGGESPLEATLKGASEIGFTVVSMTASLIAVFIPLLLMSSMVGRLFREFSV 465
+E + G+ +EATL ++ + + I +PL + + +
Sbjct: 945 VEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGI 1004

Query: 466 TIAVALIISALVSLTLTPMMCAL 488
+ ++ + L+++ P+ +
Sbjct: 1005 GVMGGMVSATLLAIFFVPVFFVV 1027


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5780RTXTOXIND487e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 47.5 bits (113), Expect = 7e-08
Identities = 27/89 (30%), Positives = 38/89 (42%), Gaps = 8/89 (8%)

Query: 75 AQLGNVPIWVTGLGTVQPF-NSVTVKPRANGQINDIVFTEGQMVHAHDVLARLDPKPFLG 133
+ LG V I T G + S +KP N + +I+ EG+ V DVL +L
Sbjct: 75 SVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTAL---- 130

Query: 134 PLHQAEATLAKDQAQLTNAQADLQRVSSL 162
AEA K Q+ L A+ + R L
Sbjct: 131 ---GAEADTLKTQSSLLQARLEQTRYQIL 156



Score = 32.9 bits (75), Expect = 0.002
Identities = 36/195 (18%), Positives = 79/195 (40%), Gaps = 24/195 (12%)

Query: 135 LHQAEATLAKDQAQLTNAQADLQRVSSLAQKGFATGQVLDTQKAQIAQTEATILADKAAI 194
L ++ L + ++++ +A+ + Q V+ L + ++LD ++ QT I +
Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKN-----EILD----KLRQTTDNIGLLTLEL 318

Query: 195 ENAQTQIDYTTITSPIDG-VAGIRMIDAGNMITSTDLGIVTINQLEPISVVFTVPADAVA 253
+ + + I +P+ V +++ G ++T+ + +V + + + + V V +
Sbjct: 319 AKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIG 378

Query: 254 DWPVGTPASAISVTALAANDSRMLD-TGTLSLV--DNHIDPAT-----ATIKLK---ATF 302
VG A I V A +R G + + D D I ++ +
Sbjct: 379 FINVGQNAI-IKVEAF--PYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLST 435

Query: 303 PNKDHQLKPGQFVSA 317
NK+ L G V+A
Sbjct: 436 GNKNIPLSSGMAVTA 450


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5801DHBDHDRGNASE300.025 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 29.6 bits (66), Expect = 0.025
Identities = 31/115 (26%), Positives = 49/115 (42%), Gaps = 21/115 (18%)

Query: 190 QVLRSLGASVVAVGKTATFVPVDTEAVDAATIAKLKKWVREFGLDA--------IVS--- 238
++ S+V VG VP + A A++ A + + GL+ IVS
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 239 TDADADRPLIADENGD--LLRGDLVGLATALFLK--------ADTIVTPVTSNSG 283
T+ D L ADENG +++G L T + LK AD ++ V+ +G
Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244


43mll5835mlr5888Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll5835-120-3.646154quinolinate synthetase
mll5836124-3.493213hypothetical protein
mll5837123-3.770189Fis family transcriptional regulator
mll5838225-3.514772hypothetical protein
mll5839227-3.896485hypothetical protein
mll5840221-2.757396C4-dicarboxylate transporter DctA
mlr5841124-3.220110dicarboxylate sensor protein
mlr5842130-5.339637C4-dicarboxylate transport system regulatory
mlr5843234-6.573801hypothetical protein
mll5844334-6.412858hypothetical protein
mlr5845433-6.344549hypothetical protein
mlr5848441-8.130798nodulation protein NodZ
mlr5849438-7.781283GDP-mannose 4,6-dehydratase
mlr8749124-4.777625GDP-L-fucose synthetase(nodulation protein
mll5853-119-4.050500hypothetical protein
msl5852-117-4.160368hypothetical protein
mll5854-114-3.796067hypothetical protein
msl8750015-3.559573(4Fe-4S) ferredoxin
mll5855-113-3.068800nitrogen fixation protein nifB
mll5857014-3.292829nif-specific regulatory protein,nifA
msl5859014-2.603786ferredoxin like protein, fixX
mll5860014-2.713199nitrogen fixation protein,fixC
mll5861-115-2.164072nitrogen fixation protein,fixB
mll5862016-2.154292nitrogen fixation protein,fixA
mll5864-116-2.025899nitrogenase stabilizing/protective protein
mll5865015-2.164144nitrogenase cofactor synthesis protein nifS
msr8678018-2.981799hypothetical protein
mlr5867015-3.047722ABC transporter ATP-binding protein exsA
msr5868220-3.955641hypothetical protein
mlr5869120-3.861292(4Fe-4S) ferredoxin
mlr5871222-4.388908nitrogen fixation protein nifQ
mll5872123-4.370352RNA polymerase factor sigma-54
mll5873-221-2.453500peroxiredoxin 2 family protein
mlr5875-124-2.172043hypothetical protein
mlr5876-220-2.741663cytochrome P450
msr5877021-2.345616hypothetical protein
mlr5878-119-2.241979gamma-BHC dehydrochlorinase
mlr5879020-2.256404short chain dehydrogenase
mlr5880022-3.475572hypothetical protein
mlr5881021-4.654595hypothetical protein
mlr5882129-5.687582phosphoenolpyruvate mutase
mlr5883132-6.022975aspartate transaminase
mlr5884236-6.826802hemolysin erythrocyte lysis protein 2
mlr5886137-7.157888hypothetical protein
mlr5887131-6.369010hypothetical protein
mlr5888-130-4.754121asparagine synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5837HTHFIS444e-153 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 444 bits (1143), Expect = e-153
Identities = 133/384 (34%), Positives = 201/384 (52%), Gaps = 16/384 (4%)

Query: 222 MAEKIRMQKQFGELKQPGHEGKRAHVKGIIGDSPALRGLLEKVALVARSNSTVLLRGESG 281
+ I + + E ++G S A++ + +A + +++ T+++ GESG
Sbjct: 111 LIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESG 170

Query: 282 TGKELVAKAVHEMSGRAKRPFIKLNCAALPETVLESELFGHEKGAFTSAFNSRKGRFELA 341
TGKELVA+A+H+ R PF+ +N AA+P ++ESELFGHEKGAFT A GRFE A
Sbjct: 171 TGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQA 230

Query: 342 DKGTLFLDEIGEISASFQAKLLRVLQEQEFERVGSNQTIKVDVRVIAATNRNLEDAVARN 401
+ GTLFLDEIG++ Q +LLRVLQ+ E+ VG I+ DVR++AATN++L+ ++ +
Sbjct: 231 EGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQG 290

Query: 402 EFRADLYYRISVVPLLVPPLRERRGDIPLLAAEFLKNFNSENGRTMVFDASATEVLMNCA 461
FR DLYYR++VVPL +PPLR+R DIP L F++ E FD A E++
Sbjct: 291 LFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHP 350

Query: 462 FPGNVRELENCVQRTATLAAGASIGRNDFDCCHGRCLSAMLWKHASKETAPKSEPIAPSP 521
+PGNVRELEN V+R L I R + + +
Sbjct: 351 WPGNVRELENLVRRLTALYPQDVITREIIENELRSEI-------PDSPIEKAAARSGSLS 403

Query: 522 LNPAMQSSGAFASAAIGVPHDDSEQALPAPVRLGLVSDAKMTDRERLIAAMERSGWVQAK 581
++ A++ + A+ G ALP V + ++AA+ + Q K
Sbjct: 404 ISQAVEENMRQYFASFG-------DALPPSGLYDRVLAE--MEYPLILAALTATRGNQIK 454

Query: 582 AARLLGLTPRQIGYALRKYGIEIK 605
AA LLGL + +R+ G+ +
Sbjct: 455 AADLLGLNRNTLRKKIRELGVSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5842HTHFIS426e-149 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 426 bits (1098), Expect = e-149
Identities = 160/492 (32%), Positives = 241/492 (48%), Gaps = 47/492 (9%)

Query: 1 MSAESGPVIFIDDDEDVLRAATQMLKLASFSPSVFGSAEAALARIDGNFDGPVVSDIRMP 60
M+ ++ DDD + Q L A + + +A I VV+D+ MP
Sbjct: 1 MTG--ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 GLNGLQLFERVKAIDPEIPVVLITGHADVELAVAAIQDGAYDFISKPYANDRLLVTLHRA 120
N L R+K P++PV++++ A+ A + GAYD++ KP+ L+ + RA
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 121 SEKRRLVLENRRLRDAVVRSAGDIPLIGEAPTMIRLRETLRQIADTDVDVLVEGETGTGK 180
+ + RR S +PL+G + M + L ++ TD+ +++ GE+GTGK
Sbjct: 119 LAEPK-----RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGK 173

Query: 181 EVVADLLHRWSRRRTKPFVALNCGALPESVIDSELFGHEAGAFTGAQRRRTGRIEHSNGG 240
E+VA LH + +RR PFVA+N A+P +I+SELFGHE GAFTGAQ R TGR E + GG
Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGG 233

Query: 241 TLFLDEIESMPPALQVKLLRVLETRQITPLGTNETRSIDLRVVSATKADLGDPAARGDFR 300
TLFLDEI MP Q +LLRVL+ + T +G D+R+V+AT DL +G FR
Sbjct: 234 TLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFR 293

Query: 301 EDLYFRLNVVTLRIPPLRERREDIPMLFGHFLERASKRFSRPVPDISAGVRDRLVSHNWP 360
EDLY+RLNVV LR+PPLR+R EDIP L HF+++A + V + + +H WP
Sbjct: 294 EDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWP 352

Query: 361 GNVRELVHFADRVAL-------GLERLGTPIASGRKEQAVSSLH---------------- 397
GNVREL + R+ E + + S + +
Sbjct: 353 GNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENM 412

Query: 398 ----------------LSEKVSLYEATIIRDALQECGGDVRRTIEVLGVPRKTFYDKLKR 441
++ E +I AL G+ + ++LG+ R T K++
Sbjct: 413 RQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472

Query: 442 HGIDTADYRKTA 453
G+ ++A
Sbjct: 473 LGVSVYRSSRSA 484


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5849NUCEPIMERASE1021e-26 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 102 bits (255), Expect = 1e-26
Identities = 77/339 (22%), Positives = 127/339 (37%), Gaps = 22/339 (6%)

Query: 8 LITGVTGQDGSYLAELLLEKGYSVHGIKRRSSLFNTGRIDHLYHDPHESGVDLTLHHGDL 67
L+TG G G ++++ LLE G+ V GI + ++ + G H DL
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGF--QFHKIDL 61

Query: 68 TDSSSLTRVIQLVQPDEIYNLAAQSHVAVSFEEPEYTANSDALGALRILEAIRILGLVKH 127
D +T + + ++ + V S E P A+S+ G L ILE R ++H
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK-IQH 120

Query: 128 TRYYQASTSELYGLVRETPQTETTPF-YPRSPYAVAKLYAHWITVNYRESYNLYACNGIL 186
Y AS+S +YGL R+ P + +P S YA K + Y Y L A
Sbjct: 121 LLY--ASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRF 178

Query: 187 FNHESPARGETFVTRKITRALTRIKLGMQRTLFLGNLNARRDWGHARDYVQMQWLMLQQQ 246
F P K T+A + G ++ +RD+ + D + +
Sbjct: 179 FTVYGPWGRPDMALFKFTKA---MLEGKSIDVY-NYGKMKRDFTYIDDIAEAIIRLQD-- 232

Query: 247 QPEDFVIASGEQHSVREFVTVAAAELGIDVRWVGEGVDE---EGYDAKTGALIVKVDPRY 303
VI + E T AA+ V +G + A AL ++
Sbjct: 233 -----VIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287

Query: 304 --FRPAEVETLLGDARKAKEKLGWEPKISFIELVREMVR 340
+P +V D + E +G+ P+ + + V+ V
Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVN 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8749NUCEPIMERASE913e-23 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 91.4 bits (227), Expect = 3e-23
Identities = 64/348 (18%), Positives = 128/348 (36%), Gaps = 51/348 (14%)

Query: 8 KVYVAGHRGLVGSATMRALEALGSYEI----ITRTHD-------------------ELDL 44
K V G G +G + L G + + +D ++DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 45 FDRSETRRFFMSQRPDYVVMCAAKVGGILANASSPVDFLHNNLAIQVSVFDAAYASGVER 104
DR F S + V + + + + +P + +NL +++ + + ++
Sbjct: 62 ADREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 105 MIFLGSSCIYPRDCPQPIREEYLLTGPLEATNRPYALAKIAGVESCWSFNRQYKARYLAL 164
+++ SS +Y + P + + P+ YA K A +++ Y L
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVS----LYAATKKANELMAHTYSHLYGLPATGL 176

Query: 165 MPTNLYGP-GDNYHPENCHVLPALIRRFHQAKMNGDSSVGVWGSGNPRREFMYSSDVGDA 223
+YGP G P+ + +F +A + G S+ V+ G +R+F Y D+ +A
Sbjct: 177 RFFTVYGPWGR---PD------MALFKFTKAMLEGK-SIDVYNYGKMKRDFTYIDDIAEA 226

Query: 224 IAFLLGLPDSDFDALTAPDTAP--------LINVGVGEDVTIREVAELVKAAVGWEGNLV 275
I L + T P + N+G V + + + ++ A+G E
Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKN 286

Query: 276 FDTTKPDGTPRKLLDVTRLRN-LGWKAKTSLGAGLQAT---YEDFLRL 319
+P D L +G+ +T++ G++ Y DF ++
Sbjct: 287 MLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYKV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5857HTHFIS440e-152 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 440 bits (1133), Expect = e-152
Identities = 141/389 (36%), Positives = 201/389 (51%), Gaps = 18/389 (4%)

Query: 197 LIEKQPRPEQSLDEDRTHPARHPH--VKIDGIIGESPALKQVLEIVSVVARTNSTVLLRG 254
L E ++L E + P++ ++G S A++++ +++ + +T+ T+++ G
Sbjct: 108 LTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167

Query: 255 ESGTGKEFFAQAIHKLSHRREKSFVKLNCAALPESVLESELFGHEKGAFTGAILQRAGRF 314
ESGTGKE A+A+H RR FV +N AA+P ++ESELFGHEKGAFTGA + GRF
Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRF 227

Query: 315 ELANGGTLLLDEIGEISPAFQAKLLRVLQEGELERVGGTRTLKVDVRLICATNKDLEMAV 374
E A GGTL LDEIG++ Q +LLRVLQ+GE VGG ++ DVR++ ATNKDL+ ++
Sbjct: 228 EQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSI 287

Query: 375 RNGEFRADLYYRINVVPIVLPPLRERPGDIPRLAKALLDRFNKENHRDLAFTPSALDLIS 434
G FR DLYYR+NVVP+ LPPLR+R DIP L + + + KE F AL+L+
Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMK 347

Query: 435 QCYFPGNVRELENCVRRTATLARSMTITPSDFACQNSQCLSSLLWKGVGRSHGAYAVDEF 494
+PGNVRELEN VRR L IT + + + G+
Sbjct: 348 AHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGS------ 401

Query: 495 ARGNMMPVGSPLPAMRVGPSQNEASPGETCDPNHPACPAINQRLTERDRLIDAMEKAGWV 554
S A+ Q AS G+ P E ++ A+
Sbjct: 402 --------LSISQAVEENMRQYFASFGDALPP--SGLYDRVLAEMEYPLILAALTATRGN 451

Query: 555 QAKAARFLGLTPRQVGYALRRHHIEVKKF 583
Q KAA LGL + +R + V +
Sbjct: 452 QIKAADLLGLNRNTLRKKIRELGVSVYRS 480


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5879DHBDHDRGNASE1127e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 112 bits (282), Expect = 7e-31
Identities = 78/250 (31%), Positives = 111/250 (44%), Gaps = 13/250 (5%)

Query: 5 GRVVIITGAAGGIGRALVEIVAADGDIVVAVDLPGSGVLELAGGL---GHPHLGLECDVS 61
G++ ITGAA GIG A+ +A+ G + AVD + ++ L DV
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 62 REEDIVALYGRIEAQFAKIDVLVNNAAIGPAMAATIDTGFEAFRRVLATNLIGPFIMAGE 121
I + RIE + ID+LVN A + E + + N G F +
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 122 AARRM--QPGAAIVNVASLAGVLGNPKRNAYASSKAGLIALTRSLACEWASRGIRVTAVA 179
++ M + +IV V S + AYASSKA + T+ L E A IR V+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 180 PGYVRTPMVAEL-------ERAGKMDLAAVRRRVPMGRMARPDEIARAVRFLASAQTGYI 232
PG T M L E+ K L + +P+ ++A+P +IA AV FL S Q G+I
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 233 TGSVLTVDGG 242
T L VDGG
Sbjct: 247 TMHNLCVDGG 256



Score = 43.5 bits (102), Expect = 6e-07
Identities = 32/127 (25%), Positives = 50/127 (39%), Gaps = 3/127 (2%)

Query: 274 RTVVVTGGANGIGAAVVRRFAANSDTVVIADKDGAGAAELADLLG--GRHV-AKSVDLAV 330
+ +TG A GIG AV R A+ + D + ++ L RH A D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 331 ESDVVALFEEIRGRFGRIEVLVNCAAIADTFVRGIEIPQQIERVLDVNLTGTFTCARGDQ 390
+ + + I G I++LVN A + + ++ E VN TG F +R
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 391 VDGRRRR 397
RR
Sbjct: 129 KYMMDRR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5880DHBDHDRGNASE684e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 67.8 bits (165), Expect = 4e-16
Identities = 41/134 (30%), Positives = 58/134 (43%), Gaps = 9/134 (6%)

Query: 33 REAIKSMDA--GGVILNLGSINSFLPFVPRHAYGASTAGMNILTRCMAAELGSVGIRTAT 90
R K M G I+ +GS + +P AY +S A + T+C+ EL IR
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 91 VALGYIRTPDIAQL-----VESGCI--DSVAIKRRIPMGRMGEPEDVAEAVFFLASPDAS 143
V+ G T L I K IP+ ++ +P D+A+AV FL S A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 144 YVNGSTLYVDGGLT 157
++ L VDGG T
Sbjct: 245 HITMHNLCVDGGAT 258


44mll5898mlr6035Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll58982280.259791hypothetical protein
mll5900121-0.491290hypothetical protein
mll5901117-1.806147hypothetical protein
msr5902215-3.003912hypothetical protein
mlr5903113-2.727496plasmid stabilization protein
mll8741114-2.730046hypothetical protein
mlr5905012-3.025961nitrogenase reductase
mlr5906-113-2.920957nitrogenase molybdenum-iron protein alpha chain,
mlr5907-113-2.428888nitrogenase molybdenum-iron protein beta chain,
mlr5908-119-2.167733nitrogenase molybdenum-cofactor biosynthesis
mlr5909020-2.190936nitrogenase molybdenum-cofactor biosynthesis
mlr5911022-2.911886nitrogenase molybdenum-iron protein nifX
mlr5912221-2.600034hypothetical protein
mlr5913222-2.990316phosphate regulatory protein
mlr5914126-3.996898hypothetical protein
mlr5915123-4.122907hypothetical protein
mll5916226-4.814724orotate phosphoribosyltransferase
mll5917124-5.388927acetoacetate decarboxylase
msl5919126-4.796023hypothetical protein
msl5920127-4.940540hypothetical protein
mll5922025-4.559714GDP-D-mannose dehydratase(nodulation protein
mlr5923024-4.314470lysine:N6-hydroxylase
mlr5924027-4.548890hypothetical protein
mlr5926028-3.200907tRNA synthetase
msr5928-131-4.654255hypothetical protein
mlr5930-131-4.416327(4Fe-4S) ferredoxin
mlr5932031-4.7634931-aminocyclopropane-1-carboxylate deaminase
mlr5935030-4.913123hypothetical protein
mll5936231-6.537182hypothetical protein
mll8751334-7.382183hypothetical protein
msl8680333-6.484782hypothetical protein
mll5941130-6.583859nitrogen fixation protein nifU
mll5942231-6.464423hypothetical protein
mlr5943332-7.050570diaminobutyrate--2-oxoglutarate
mlr5944228-4.703730hypothetical protein
mlr5945225-4.287030hypothetical protein
mlr5946323-4.356723transposase
msr5947332-7.316934transposase
mll5948236-7.474991transposase
mll5949129-4.843293transposase
mll5951225-3.838160hypothetical protein
mlr5952226-2.723205transposase
mll8752228-2.955809hypothetical protein
mlr5955021-0.072989hypothetical protein
mll5956-119-0.161072integrase/recombinase
mll5957-125-1.763832integrase/recombinase
mll5958129-3.021625integrase/recombinase
mll5960027-2.916126transposase
mll5961-127-2.651755transposase
mlr5962128-3.072480hypothetical protein
mll5963129-2.720896hypothetical protein
mll5964026-1.121400transposase
mlr59651250.747954transposase
msl5966132-2.171742hypothetical protein
mll5967133-2.716634transposase
mlr5968242-6.508036transposase
msr8681141-7.361956hypothetical protein
msr5969136-6.116860hypothetical protein
mll5970134-5.808705peptidase
msl8683129-4.084709hypothetical protein
mlr5971127-3.010144L-lysine 2,3-aminomutase
msr59721210.958563transposase
mlr59732211.484650integrase/recombinase
msr59741190.628414integrase/recombinase
msr5976-123-0.923716transposase
mlr5977-124-1.393102transposase
msr5978027-1.049709transposase
msr5979127-1.912174transposase
mlr5980128-2.061080transposase
mll5981131-2.814983transposase
msl5982325-1.235226hypothetical protein
mlr5983224-0.095614transposase
mll5985325-0.180264hypothetical protein
msr86842230.957594hypothetical protein
msr59862231.058233hypothetical protein
mlr59873231.105661aminotransferase
mlr59883231.819323hypothetical protein
mlr59892221.500953hypothetical protein
mlr59902221.593960hypothetical protein
msr59913251.663591hypothetical protein
msr59924221.073773transposase
mlr5993123-2.175173transposase
mlr5994229-3.934645transposase
msr5995130-4.905163transposase
msl5996021-3.634662transposase
msr5997-120-3.386548transposase
mll5998-119-0.962565hypothetical protein
mll6001-1150.925292transposase
msl6002-2140.779232hypothetical protein
mll60030130.535490adenosylmethionine-8-amino-7-oxononanoate
mll6005-1140.658322dithiobiotin synthetase
mll60060160.3663878-amino-7-oxononanoate synthase
mll6007217-1.186459biotin synthase
mll6008323-2.620363citrate lyase beta-subunit
mll6011325-2.9111993-methylcrotonyl-CoA carboxylase
mll6012330-3.630728acetyl/propionyl CoA carboxylase subunit beta
mll6015539-5.455081carnitine racemase
mll6017338-5.499584acetyl-CoA synthetase
mll6018244-6.376246hypothetical protein
mll6019240-5.754787aminotransferase
msl6020336-5.333280hypothetical protein
msl6021337-5.635508hypothetical protein
msr6022235-5.143953hypothetical protein
mlr6024334-5.439686aminotransferase
mlr6025227-4.443418hypothetical protein
mlr6027026-3.723687hypothetical protein
mlr6028024-3.207711hypothetical protein
mlr6030020-2.460419transposase
mlr6031020-2.379610transposase
mlr6032121-1.881268transposase
mll6033021-2.438162transposase
mlr6034120-2.706272transposase
mlr6035021-3.369303transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5913HTHFIS421e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 41.7 bits (98), Expect = 1e-06
Identities = 23/123 (18%), Positives = 40/123 (32%), Gaps = 2/123 (1%)

Query: 2 KPRVLICSQDAEFYLFLSHILEVDGFVSEPAGGAKEALAMADDRDFQAVVLDCGSTSLTG 61
+L+ DA L+ L G+ A D VV D
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 SAICARLKREPRTGGLPIIALIAPGAENQHLDLLKAGIDESFVRPVAPAKLLDCLRTRLA 121
+ R+K+ LP++ + A + + G + +P +L+ + LA
Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 122 LPK 124
PK
Sbjct: 121 EPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5932PYOCINKILLER290.024 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 29.4 bits (65), Expect = 0.024
Identities = 33/152 (21%), Positives = 55/152 (36%), Gaps = 20/152 (13%)

Query: 137 RRSWEKALYEVKARGGRPYAIPA-GASVHPNGGLGYVGFAEEVRAQEEQLGFAFDYMVVC 195
R++ E+A + R YA+PA G+ V G G + A+ + + + A V+
Sbjct: 232 RKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIA--VLG 289

Query: 196 TVTGSTHAGMLVGFAK---------------DGRQRNVIGIDAS--ATPAKTKAQVLSIA 238
V S + M VGFA R +G+DA+ P ++ A
Sbjct: 290 RVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSVNLNAVAKA 349

Query: 239 RHTATLVELGSELAEDDVVLLEDYAHPRYGIP 270
T L + A + L + +P
Sbjct: 350 SGTVDLPMRLTNEARGNTTTLSVVSTDGVSVP 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5946HTHTETR313e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 31.1 bits (70), Expect = 3e-04
Identities = 13/60 (21%), Positives = 26/60 (43%), Gaps = 4/60 (6%)

Query: 8 DAQKAFILKQGADGIPVAEICRRAGISQAT-YFNWKKK---YDGLLPTEMKRLKQLEDEN 63
D +QG + EI + AG+++ Y+++K K + + + +LE E
Sbjct: 18 DVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEY 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5965TYPE3IMSPROT290.033 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.0 bits (65), Expect = 0.033
Identities = 14/69 (20%), Positives = 25/69 (36%), Gaps = 18/69 (26%)

Query: 177 QLANAIRGYAAEYGLIAARGMCKIEPLLERIAADKMLPDLARELFALHAKEYAQLQTQLK 236
+R A E G+ P+L+R LAR L+ A + +
Sbjct: 290 AQVQTVRKIAEEEGV----------PILQR-------IPLARALYW-DALVDHYIPAEQI 331

Query: 237 DVDAKLMAW 245
+ A+++ W
Sbjct: 332 EATAEVLRW 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5985SACTRNSFRASE452e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 45.3 bits (107), Expect = 2e-08
Identities = 20/72 (27%), Positives = 33/72 (45%), Gaps = 5/72 (6%)

Query: 94 CLLGNAEDR--WG--TLVDNLHVLPTAKGRGVGRHLIRVAAGWSAENYPGVGLHLWVYEV 149
+G + R W L++++ V + +GVG L+ A W+ EN GL L ++
Sbjct: 75 NCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDI 133

Query: 150 NAPARAFYERMG 161
N A FY +
Sbjct: 134 NISACHFYAKHH 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5989FLGMOTORFLIG280.043 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 28.2 bits (63), Expect = 0.043
Identities = 13/55 (23%), Positives = 23/55 (41%), Gaps = 5/55 (9%)

Query: 8 ALILEMLPDMPRQLALAARHIADHPDQVLVHSMRDLASRASVSPATLLRLTQTLK 62
ALIL L A+ ++ P +V + R +A SP + + + L+
Sbjct: 141 ALILSYLDP-----QKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6035PF05043290.029 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 28.8 bits (64), Expect = 0.029
Identities = 9/40 (22%), Positives = 16/40 (40%)

Query: 4 RHVTDHQMRLFMKLRQEHTTEVAAAKASISRATAYRIKKN 43
+H T + F+ + E + IS ++ YRI
Sbjct: 84 KHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQ 123


45msl6054mlr6115Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
msl6054137-4.809589hypothetical protein
mll6055137-4.543111hypothetical protein
mlr6056140-6.414028hypothetical protein
msl6057038-5.989107hypothetical protein
mll6059130-4.179383hypothetical protein
msl6060030-3.822559hypothetical protein
msr6061-129-3.061671hypothetical protein
msr8685026-2.742834hypothetical protein
mll6062025-2.646492hypothetical protein
mlr6063-125-2.126573transposase
mll6064026-2.351463transposase
mll6066031-3.874198transposase
mll6067033-3.499693transposase
mll6068134-3.857457transposase
mlr6069232-2.955809transposase
mlr6070335-3.532053beta-ketoacyl-ACP synthase
mll8753637-6.547602hypothetical protein
msr6073229-4.904985hypothetical protein
msr6074226-5.418332hypothetical protein
mll6075229-5.350383transposase
mlr6076229-5.878790hypothetical protein
mlr6078330-5.750821ABC transporter binding protein component
mlr6079231-4.847504ABC transporter ATP-binding protein
mlr6080331-4.750604ABC transporter permease
mlr6081230-4.405128amidase
mlr6082332-4.369662ABC transporter binding protein component
mlr6083332-3.985172ABC transporter permease
mlr6084328-3.497610ABC transporter permease
mlr6085427-3.492635ABC transporter ATP-binding protein
mlr6086527-3.356408ABC transporter ATP-binding protein
mlr6087531-4.637964dipeptidyl aminopeptidase
mll6088426-4.791754transposase
mll6089134-6.619979transposase
mll6090235-6.186771transposase
mll6092236-6.180015transposase
mlr6093234-5.931999hypothetical protein
msr6094232-5.096663hypothetical protein
mlr6095132-5.2659933-methylaspartate ammonia-lyase
mlr6096229-4.692621hypothetical protein
mlr6097028-4.898732nitrogen assimilation control protein
mlr6098027-4.611529aldehyde dehydrogenase
mlr6099-129-5.084395fumarate hydratase, class I
mlr6100-131-4.742719transcriptional regulator
mlr6101-129-4.270841hypothetical protein
mlr6102-133-3.901295transcriptional regulator
mlr6103-134-3.385845conjugation factor synthetase, traI
msl8686131-4.324656hypothetical protein
mlr6104032-3.566769hypothetical protein
mll6105030-4.340901hypothetical protein
msr6106028-4.317155transcriptional regulator
msr6108-129-4.170137transcriptional regulator
mll6109-126-2.793716hypothetical protein
mll6110025-2.267925transposase
mll6112-124-2.506247transposase
mlr6114223-2.506474serine hydroxymethyltransferase
mlr6115224-1.988215S-adenosylmethionine synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6063TYPE3IMSPROT290.034 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.0 bits (65), Expect = 0.034
Identities = 14/69 (20%), Positives = 25/69 (36%), Gaps = 18/69 (26%)

Query: 159 QLANAIRGYAAEYGLIAARGMCKIEPLLERIAADKMLPDLARELFALHAKEYAQLQTQLK 218
+R A E G+ P+L+R LAR L+ A + +
Sbjct: 290 AQVQTVRKIAEEEGV----------PILQR-------IPLARALYW-DALVDHYIPAEQI 331

Query: 219 DVDAKLMAW 227
+ A+++ W
Sbjct: 332 EATAEVLRW 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll8753PYOCINKILLER300.014 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 29.8 bits (66), Expect = 0.014
Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 11/87 (12%)

Query: 136 GLPEAKRGLVAGAGGALRLGEMLPPVLANEILLTGLLFEAPRAYQLGLVNRLVPEHFLLE 195
A RGL+ A GA L + + +A +L +L AP +G + L
Sbjct: 259 VATAAGRGLIQVAQGAASLAQAISDAIA---VLGRVLASAPSVMAVGFAS--------LT 307

Query: 196 AAMSLADSIAQNAPLSVRASLALVKAQ 222
+ A+ P SVR +L + A+
Sbjct: 308 YSSRTAEQWQDQTPDSVRYALGMDAAK 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6079PF05272280.046 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.046
Identities = 7/20 (35%), Positives = 11/20 (55%)

Query: 38 LTILGPSGSGKTTALMLLAG 57
+ + G G GK+T + L G
Sbjct: 599 VVLEGTGGIGKSTLINTLVG 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6089HTHTETR313e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 31.1 bits (70), Expect = 3e-04
Identities = 13/60 (21%), Positives = 26/60 (43%), Gaps = 4/60 (6%)

Query: 8 DAQKAFILKQGADGIPVAEICRRAGISQAT-YFNWKKK---YDGLLPTEMKRLKQLEDEN 63
D +QG + EI + AG+++ Y+++K K + + + +LE E
Sbjct: 18 DVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEY 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6103AUTOINDCRSYN1588e-51 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 158 bits (400), Expect = 8e-51
Identities = 44/164 (26%), Positives = 69/164 (42%), Gaps = 3/164 (1%)

Query: 1 MIELIAPGWYGAFADELHEMHRLRYRVFKERLDWNVRTTGGFEIDSFDSLKPHYLVLRDS 60
M+E+ + E+ LR FK+RL+W V+ T G E D +D+ YL
Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKD 60

Query: 61 AGRVRGGVRLLPSTGPTMLRDVFSRLLEGRAAPEEPSVWESSRFALDLPPSAPKDSGSIA 120
V +R + + P M+ F + P E + ESSRF +D A G+
Sbjct: 61 -NTVICSLRFIETKYPNMITGTFFPYFKEINIP-EGNYLESSRFFVD-KSRAKDILGNEY 117

Query: 121 VATYELLAGMIEFGLSRLLTHIVTVTDLRMERILRRAGWPLDRI 164
+ L MI + + I T+ M IL+R+GW + +
Sbjct: 118 PISSMLFLSMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRVV 161


46mlr6129mlr8757Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr6129-222-3.529083transcriptional regulator
mlr6130023-4.546951L-threonine aldolase
mlr6131230-6.712054hypothetical protein
mlr6132126-6.561725outer membrane protease
mll6134027-6.427253hypothetical protein
mll6136126-5.610047hypothetical protein
mll6137023-3.988228hypothetical protein
msl6138024-3.602869transcriptional regulator
mlr6139129-4.874674hypothetical protein
mlr6140-129-4.590903hypothetical protein
mlr6141234-4.898732hypothetical protein
mll6142235-3.588527succinate-semialdehyde dehydrogenase
msr6143337-4.212423transposase
mlr6144435-3.963367hypothetical protein
mlr6145326-0.549917hypothetical protein
mll6146326-0.386576transposase
mll8754224-0.238016hypothetical protein
msl6147322-0.751162transposase
msr6149321-0.886648transposase
mlr61502190.338837transposase
mll61515211.263166transposase
msl86884171.469553hypothetical protein
msl61522161.310563hypothetical protein
mll61531150.936258succinate-semialdehyde dehydrogenase
mlr61541140.457937antirestriction protein
mlr6156014-0.518138hypothetical protein
mlr6157016-1.863857partitioning protein
mlr6158-122-2.838902hypothetical protein
mlr6159127-3.557427hypothetical protein
mlr6161130-4.819927methyltransferase, nodulation protein, nodS
mlr8755130-4.898571acyltransferase
mlr6163228-4.542456N-acetylglucosaminyltransferase
mlr6164-124-2.841482nodulation factor exporter subunit NodI
mlr6166-126-3.175751nodulation protein nodJ
mlr6171-227-3.243690nodulation protein NOLO
msr8743-125-1.390553hypothetical protein
msl6173-121-0.419113hypothetical protein
mlr6174021-0.125068transcriptional regulator
msr8756-124-0.526091hypothetical protein
mlr6175-125-1.621823chitooligosaccharide deacetylase, nodulation
msr8689231-5.029529hypothetical protein
mlr6176232-5.269391transcriptional regulator
mlr6177132-5.100565hypothetical protein
mlr6178033-5.925361hypothetical protein
mll6179133-6.564051transcriptional regulator, nudulation protein
mlr8757130-6.142994nodulation protein acetyl transferase NolL
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6132OMPTIN1052e-28 Omptin serine protease signature.
		>OMPTIN#Omptin serine protease signature.

Length = 317

Score = 105 bits (263), Expect = 2e-28
Identities = 76/323 (23%), Positives = 135/323 (41%), Gaps = 25/323 (7%)

Query: 13 ALLAAWPAMASDAVASEPAPYNP-NFSFVGGVGVIDIQANELVYQAPGSGSKLSHLIWE- 70
++ P S ++E + P N + +G + + E VY A G K+S L W+
Sbjct: 7 GIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVSQLDWKF 66

Query: 71 STSAVISAEMT-------ARSEAGWTAKLSGQIAFSGNSYMEDYDWNPSFSTNDGWNDWS 123
+ +A+I + + AGWT S GN M D DW S + W+
Sbjct: 67 NNAAIIKGAINWDLMPQISIGAAGWTTLGSR----GGN--MVDQDWMDSSNPGT----WT 116

Query: 124 NRSRHCDTDLDYFYSATVALGHDFQFSDQLLINVNGGFKYTSVKWTAYGGSYVYSV-AGF 182
+ SRH DT L+Y + + + + G++ + +TA GGSY+YS GF
Sbjct: 117 DESRHPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGF 176

Query: 183 RDSVGDFPGDAKAISYQQKLPVAFAGIDATYVQDRWNFGFALKGGTTFSAGATDHHW--M 240
RD +G FP +AI Y+Q+ + + G+ +Y + + G K + D H+
Sbjct: 177 RDDIGSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPG 236

Query: 241 RDLRFEDDFESAPVLMVGGSVGYQYNERTSFFVSGSYVKVFTARGDYNTYDIATG-AETG 299
+ + + + V + GY +V G++ +V +G+ + YD ++
Sbjct: 237 KRITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNNNTSDYS 296

Query: 300 GEDDWIGGDFAAATVMVGIKTSF 322
+ G + G+K +F
Sbjct: 297 K--NGAGIENYNFITTAGLKYTF 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6139CHLAMIDIAOMP280.033 Chlamydia major outer membrane protein signature.
		>CHLAMIDIAOMP#Chlamydia major outer membrane protein signature.

Length = 393

Score = 28.0 bits (62), Expect = 0.033
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 4/40 (10%)

Query: 82 DRAFSFTVNERG--WEWGSATISAADIYRYASIDEDLELI 119
D AFS++V R WE G AT+ A+ ++YA +E +
Sbjct: 188 DTAFSWSVGARAALWECGCATLGAS--FQYAQSKPKVEEL 225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6150SALSPVBPROT300.012 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 30.1 bits (67), Expect = 0.012
Identities = 24/94 (25%), Positives = 47/94 (50%), Gaps = 5/94 (5%)

Query: 41 SQHDEILVSR--IDRSFKSSDRTYGARRVWHDVLAEGLSCGLHRIERLMRENGLRARPRR 98
+Q E L+SR + + S + A V+ + +AEGLS L + + GL+ +
Sbjct: 421 TQAKETLLSRDYLSTNEPSDEEFKNAMSVYINDIAEGLS-SLPETDHRVVYRGLKLD--K 477

Query: 99 RGLPKDTGERAAVSDNLLDRAFEASAPNQKWVAD 132
L E + + ++D+AF +++P++ W+ D
Sbjct: 478 PALSDVLKEYTTIGNIIIDKAFMSTSPDKAWIND 511


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6166ABC2TRNSPORT350e-125 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 350 bits (900), Expect = e-125
Identities = 181/255 (70%), Positives = 212/255 (83%)

Query: 8 ALPANAWNWVAVWRRNYLAWKKVALVSLLGNLADPMIYLFGLGTGLGIMVGRVDGASYIA 67
ALP + NW+AVWRRNY+AWKK AL SLLG+LA+P+IYLFGLG GLG+MVGRV G SY A
Sbjct: 8 ALPGGSLNWIAVWRRNYIAWKKAALASLLGHLAEPLIYLFGLGAGLGVMVGRVGGVSYTA 67

Query: 68 FLAAGMVAVSAMTASTLETLYAAFARMHSQRTWEAMLYTHVTLGDIVLGELAWAATKAFM 127
FLAAGMVA SAMTA+T ET+YAAF RM QRTWEAMLYT + LGDIVLGE+AWAATKA +
Sbjct: 68 FLAAGMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAAL 127

Query: 128 AGTAITIVTATLGYAAWPSVLYALPIIALTGCVFASLAMIVTALSPSYDYFVFYQTLVLT 187
AG I +V A LGY W S+LYALP+IALTG FASL M+VTAL+PSYDYF+FYQTLV+T
Sbjct: 128 AGAGIGVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVIT 187

Query: 188 PMLFLSGAVFPLNQLPGAFQQIARCMPLSHAIDLIRPVMLDRPISGIALHVGVLGLYALL 247
P+LFLSGAVFP++QLP FQ AR +PLSH+IDLIRP+ML P+ + HVG L +Y ++
Sbjct: 188 PILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVI 247

Query: 248 PFFLSMALLRRRLMR 262
PFFLS ALLRRRL+R
Sbjct: 248 PFFLSTALLRRRLLR 262


47mlr6191mll6391Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr6191221-1.013528hypothetical protein
mll6192424-1.382130hypothetical protein
mll6193323-1.372917hypothetical protein
mll8758421-0.789303hypothetical protein
mll6195320-1.540774hypothetical protein
mlr6196321-2.677499hypothetical protein
mlr6199322-4.613798hypothetical protein
mlr6200222-4.712981hypothetical protein
msl6202125-5.020265hypothetical protein
mlr6203129-6.173207hypothetical protein
mlr6204131-6.268416hypothetical protein
msl6205234-6.6305435-methyltetrahydrofolate--homocysteine
mll6206235-6.7193915-methyltetrahydrofolate--homocysteine
mll6207441-7.814846DNA-damage-inducible protein
mlr6209444-8.284182histidine decarboxylase
mlr6210344-6.723521glutamine synthetase
mll6211342-6.308031alanine racemase
mll6212340-4.952574hypothetical protein
msl6213333-3.917986hypothetical protein
mlr6215232-3.799167hypothetical protein
mlr6217228-3.420268hypothetical protein
mlr6218120-2.460294dipeptidase
msr6219018-1.007914transposase
mll6220017-0.811101transposase
msl6221-122-2.821863transposase
mll6222022-2.968037transposase
mll6223019-2.511811transposase
mll6224022-2.671682transposase
mll6225128-4.021975transposase
mlr6226329-4.514613methyltransferase
mll6227324-2.200951transposase
mll6228224-1.614672integrase/recombinase
mll6229327-2.683033integrase/recombinase
mll6230226-2.916797integrase/recombinase
mll6231121-1.845060integrase/recombinase
mll6232021-1.708668transposase
mll6233-118-1.570681transposase
mll6236-118-1.918010transcriptional regulator
mll6237-119-2.049573hypothetical protein
mll6238-121-1.903559sarcosine oxidase subunit alpha
msl6239127-3.047895sarcosine oxidase subunit delta
mll6240027-3.108131sarcosine oxidase subunit beta
mlr6241-130-3.098387transcriptional regulator
mll6243035-3.195720hypothetical protein
mlr8759-128-3.801636hypothetical protein
mlr6244130-4.976939hypothetical protein
msl6246033-5.414699hypothetical protein
mlr6247034-5.762368hypothetical protein
mll6249142-7.393420hypothetical protein
mll6250344-8.876188tyramine oxidase
mll8746450-9.428036hypothetical protein
mll6251446-9.064763hypothetical protein
mll8760547-9.642653hypothetical protein
mll6252547-9.804701hypothetical protein
mll6253546-9.615617ABC transporter permease
mll6255440-8.005996ABC transporter ATP-binding protein
mll6256336-7.636014ABC transporter binding protein component
mlr8761238-7.351890hypothetical protein
mlr6258030-5.817147hypothetical protein
msr6259-129-4.707751cold shock protein
mll6260032-4.784992transcriptional regulator
mll6261-134-5.253556hypothetical protein
msl6262131-4.283276hypothetical protein
msr6263131-3.995122hypothetical protein
mll6264-129-3.796759hypothetical protein
mlr6265-225-2.496072hypothetical protein
mlr6266-222-1.304655hypothetical protein
mll6267-124-1.931334hypothetical protein
msl6268-225-2.562184citrate synthase
mll6270-229-3.182167dithiobiotin synthetase
mll6272-232-3.560911nicotinate-nucleotide pyrophosphorylase
msl6271-137-4.220746hypothetical protein
mlr6273-135-4.212423transposase
mlr6274031-3.614031NTP-binding protein
mlr6275030-3.080735hypothetical protein
msr6276126-2.557652hypothetical protein
msr6277332-5.319797transposase
mlr6278233-6.417050transposase
mll6279233-5.687614hypothetical protein
mll6280234-6.117096hypothetical protein
msr6281335-6.388841hypothetical protein
mlr6282331-5.750976phosphinothricin tripeptide synthetase B
mlr6283227-5.217714L-proline 3-hydroxylase
mll6284228-5.005691transporter
mll6285231-5.872857D-amino acid dehydrogenase small subunit
mlr6286231-6.645326dipeptidase
mlr6287128-6.464521ABC transporter ATP-binding protein
mlr6288338-7.626933ABC transporter substrate-binding protein
mlr6289241-7.292269ABC transporter permease
mlr6290140-6.791918ABC transporter permease
mll6291142-7.916126transcriptional regulator
mlr6292039-7.383556dihydrodipicolinate synthetase
mlr6294044-7.916126hypothetical protein
mlr6296033-7.001874carboxylase
mlr6298126-5.940098gamma-glutamyl kinase
mlr6299220-5.138349transcriptional regulator
mlr6301018-3.439056transposase
msl6302219-4.126788hypothetical protein
mll6303219-4.059506hypothetical protein
mll6304121-5.147894hypothetical protein
mll6306224-6.054910hypothetical protein
mlr6307327-6.593198phage-related replication protein
mlr6308431-4.027055hypothetical protein
mlr6309433-3.630522endonuclease
mlr6311333-3.435450nuclease
mlr6313336-2.547440hypothetical protein
msl6314335-2.332577transposase
mlr6316128-2.044936hypothetical protein
msr6318-124-2.194398hypothetical protein
mll6319022-2.014929transposase
msr6320019-1.809946hypothetical protein
msr6321119-1.770875transposase
mlr6323018-1.738756resolvase
mlr6324228-3.199113hypothetical protein
mlr6325328-3.158202hypothetical protein
mlr6326230-3.791289DNA invertase
mlr6327231-4.155443hypothetical protein
mlr6328332-4.518514hypothetical protein
mlr6331332-5.029570hypothetical protein
msl6332336-7.029265hypothetical protein
mlr6334337-7.047699two-component response regulator
mlr6335435-6.177850type II secretion system protein
mlr8762437-6.331619hypothetical protein
mll6337338-5.829170nodulation protein NOLX
mll6338140-4.646882nodulation protein NolW
mlr8763140-4.338349hypothetical protein
mlr6339139-4.352931nodulation protein NolT
mlr8764042-5.501557nodulation protein NolU
mlr6341041-5.722071nodulation protein NolV
mlr6342142-6.843379ATP synthase in type III secretion system, hrcN
mlr6343243-7.717850hypothetical protein
mlr6344243-7.930227translocation protein in type III secretion
mlr8766240-8.155515type III secretion system protein
msr8694238-7.398771hypothetical protein
mlr6345136-6.714445translocation protein in type III secretion
mlr6346134-6.161740translocation protein in type III secretion
mlr6347131-5.809958hypothetical protein
mlr6348030-5.426094type III secretion inner membrane protein, HrcV
mlr8765328-3.258916hypothetical protein
mll6350222-1.088575hypothetical protein
msr6351223-1.124264hypothetical protein
mll6352327-1.962661transposase
mll6353327-1.980886transposase
mll6354427-2.023401hypothetical protein
mll6355427-2.229907transposase
msl6356427-2.520757hypothetical protein
mlr6358325-2.097854hypothetical protein
mll6359322-1.418472transposase
mlr6361221-0.565222hypothetical protein
mll63623172.966786transposase
mll63634214.058543resolvase-like
mlr63645224.598916cytochrome P-450
mlr63656234.866527cytochrome P-450
mlr63667235.498800short-chain type dehydrogenase/reductase
mlr63672193.314602cytochrome P450
mlr63682192.569540geranyltranstransferase
mlr63691191.729467hypothetical protein
mlr63701210.284932hypothetical protein
mlr6371024-1.038092isopentenyl pyrophosphate isomerase
mlr6372131-3.018720transporter
mll6374130-2.684059integrase/recombinase
mll6376130-2.821746dihydrodipicolinate synthetase
mll6377128-3.303332threonine efflux protein
mll6378128-3.146317hypothetical protein
mll6379-122-2.955522transposase
msl6380-122-2.659874hypothetical protein
mll6383-122-3.954774transposase
mll6384-122-3.665347a-type carbonic anhydrase
mlr6385-123-3.749460conjugation factor synthase traI
mlr6386-122-3.872153glucosamine--fructose-6-phosphate
mlr6387126-4.719656hypothetical protein
mll6389022-3.978463porin
mlr6390-223-3.084283coproporphyrinogen III oxidase
mll6391-125-3.679314hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6211ALARACEMASE1911e-59 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 191 bits (487), Expect = 1e-59
Identities = 94/364 (25%), Positives = 150/364 (41%), Gaps = 22/364 (6%)

Query: 10 DVVLEIDLAAIRANFQKISTLVGDKVKVAAVVKSDAYGLGLVDIARTLIDAGCDLLFVAN 69
+ +DL A++ N + +V +VVK++AYG G+ I + + N
Sbjct: 4 PIQASLDLQALKQNLSIVRQA-ATHARVWSVVKANAYGHGIERIWSAIGATDG--FALLN 60

Query: 70 LDEALLLRSSFSRVAI----AVFRDEFDRFGTWYRSHGLIPVVNNCKELHAVGTA--GEP 123
L+EA+ LR + I F + +R L V++ +L A+ A P
Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHR---LTTCVHSNWQLKALQNARLKAP 117

Query: 124 QSYFLNVETGFSRFGLSVGDIQREY-LLRTFERYRPSIVLSHLACGECISDPMNQLQRDR 182
+L V +G +R G + + LR ++SH A E + R
Sbjct: 118 LDIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMAR-- 175

Query: 183 FRTVYDLLKPTRGSLSASAGVWLGKSYHFDMVRVGSALYGI----HNAGVQTNPLKPVVK 238
+ L R SLS SA HFD VR G LYG + L+PV+
Sbjct: 176 IEQAAEGL-ECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMT 234

Query: 239 LRARILDVRSVPAGEAVGYGATFRTDRASRVAIVGIGYKHGLPWSCANKIFVRLAEYSAP 298
L + I+ V+++ AGE VGYG + R+ IV GY G P V +
Sbjct: 235 LSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTM 294

Query: 299 SIGRISMEYMIIDITDVPARRCSPGTFAELLSEDFTVNDLGAAAGVSPQEALTRLGAGCT 358
++G +SM+ + +D+T P + GT EL ++ ++D+ AAAG E + L
Sbjct: 295 TVGTVSMDMLAVDLTPCP--QAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVP 352

Query: 359 RKYL 362
+
Sbjct: 353 VVTV 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6232IGASERPTASE300.007 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.007
Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 3/59 (5%)

Query: 38 DGW-PAANLLACLLEIEMAERASRRIQRHREQSGLPAGKTFATFDFDF-PS-GIRKPHL 93
+ W +L + ++ + + RH Q GL AGK F +F P G+R +L
Sbjct: 1371 NHWYLGIDLGYGKFQSKLQTNHNAKFARHTAQFGLTAGKAFNLGNFGITPIVGVRYSYL 1429


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6233SECA310.015 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.0 bits (70), Expect = 0.015
Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 11/57 (19%)

Query: 255 HLKRRLDQALRRRS--------SRDFPSIED--YRRFVDQEVAKQNRRRARLVDDER 301
H RR+D LR RS SR + S+ED R F V+ R+ + E
Sbjct: 562 HESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSG-MMRKLGMKPGEA 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6252PF07675290.033 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 29.3 bits (65), Expect = 0.033
Identities = 12/48 (25%), Positives = 21/48 (43%)

Query: 320 AGEFERELVPGAEWTFEAGMVFHVLMMAQGIGFSETVLITDNGPERLT 367
AG+ + ++ FEAG + M G+G + + D+ P T
Sbjct: 488 AGDGGNQPARYDDFAFEAGKKYTFTMRRAGMGDGTDMEVEDDSPASYT 535


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6256SECA300.019 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.019
Identities = 11/52 (21%), Positives = 28/52 (53%), Gaps = 5/52 (9%)

Query: 82 KLKAQVD-SNNVQWDVMELSNPAFRPDANKYLEEIDYAAFDKETLDNLVPEA 132
+++ V+ N ++ ++ +LS+ + ++ ++ E L+NL+PEA
Sbjct: 20 RMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLE----KGEVLENLIPEA 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6258PHAGEIV270.021 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 26.8 bits (59), Expect = 0.021
Identities = 11/40 (27%), Positives = 16/40 (40%)

Query: 47 PDVMGCVLSPSVQLAASAYPSVSLPVLRSFAFAALCYEPA 86
PDV G V S + + VLR+ F + P+
Sbjct: 50 PDVKGTVTVYSSDVKPENLRDFFISVLRANNFDMVGSIPS 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msr6263LUXSPROTEIN280.003 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein

LuxS signature.
Length = 171

Score = 28.0 bits (62), Expect = 0.003
Identities = 13/49 (26%), Positives = 22/49 (44%), Gaps = 4/49 (8%)

Query: 18 APYTYVTTKHFLLEFGLETLRDL----PDFEALEDAGLLSKEKLLAGYI 62
AP V + T+ DL P+ + L + G+ + E L AG++
Sbjct: 15 APAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYAGFM 63


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6284TCRTETA462e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 46.4 bits (110), Expect = 2e-07
Identities = 56/266 (21%), Positives = 96/266 (36%), Gaps = 13/266 (4%)

Query: 69 FILSVFVGAIADNFSRRRVMFAGWCLMAIASAMLATSVALGFVAPWMILGFSFLIACGSA 128
F + +GA++D F RR V+ A+ A++AT+ L W+ L ++A +
Sbjct: 57 FACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL-----WV-LYIGRIVAGITG 110

Query: 129 VNDPAWQASVGDIVDRRDVPAAVTLLTVGFNTVRTVGPALGGL---VVASFGLVTAFAVT 185
A + DI D + ++ F GP LGGL A A+
Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170

Query: 186 ALSYLVPVCTIWRSKWKVRSSPLPREPLRTALHDGLRFTAMSSEIKASIARGTLFGLASI 245
L++L C + K PL RE L R+ + + A +A + L
Sbjct: 171 GLNFLT-GCFLLPESHKGERRPLRREALNPL--ASFRWARGMTVVAALMAVFFIMQLVGQ 227

Query: 246 AILALLPLVVRDQLGGGPIAYGTLMGGFG-TGAVLAGVSNSMLRRSLQLERLMRLACVAC 304
AL + D+ G + FG ++ + + L R + L +A
Sbjct: 228 VPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIAD 287

Query: 305 AACCLSLALTSSIAVAAIALAFGGAG 330
+ LA + +A + +G
Sbjct: 288 GTGYILLAFATRGWMAFPIMVLLASG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6287TYPE4SSCAGA310.016 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 31.2 bits (70), Expect = 0.016
Identities = 31/108 (28%), Positives = 41/108 (37%), Gaps = 27/108 (25%)

Query: 365 LSGSILLDGVDFLNLSQHELRKQRSRMQMIFQDPFASLNPRMNVGT-------------- 410
L GS+ DGV F++ S F+ AS NP VG
Sbjct: 489 LQGSLKHDGVMFVDYSN-------------FKYTNASKNPNKGVGVTNGVSHLEVGFNKV 535

Query: 411 AIVEPLLINNLASRSEARDKVADLLQRVGLSPDMVNRFPHEFSGGQRQ 458
AI +NNLA S R + D L GLSP N+ +F ++
Sbjct: 536 AIFNLPDLNNLAITSFVRRNLEDKLTTKGLSPQEANKLIKDFLSSNKE 583


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6298CARBMTKINASE435e-07 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 43.3 bits (102), Expect = 5e-07
Identities = 34/134 (25%), Positives = 52/134 (38%), Gaps = 19/134 (14%)

Query: 145 GAVPVINENDATATPEVCLGDNDRLAARVAQIAKADLLILLSDVDGLF----TEDPHDNP 200
G VPVI E+ E + D D ++A+ AD+ ++L+DV+G TE
Sbjct: 195 GGVPVILEDGEIKGVEAVI-DKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQ--- 250

Query: 201 LARMIPEVRRITPEIEIMASLSPARHGSGGMVTKLMAA-RIAMEAGCNVVIAKGSKSYPL 259
+R + E E+ +G M K++AA R G +IA K
Sbjct: 251 ------WLREVKVE-ELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEK---A 300

Query: 260 AAIENGAPSTWFIP 273
G T +P
Sbjct: 301 VEALEGKTGTQVLP 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msl6302PF07132260.006 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 26.2 bits (57), Expect = 0.006
Identities = 12/34 (35%), Positives = 18/34 (52%), Gaps = 1/34 (2%)

Query: 2 QFTTGKPEM-AQPETAKADVDKLRTNEKKWTKAL 34
QF PE+ +PE K + + ++K W KAL
Sbjct: 256 QFMDQYPEVFGKPEYQKDNWQTAKQDDKSWAKAL 289


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6306RTXTOXIND300.005 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.005
Identities = 13/110 (11%), Positives = 31/110 (28%), Gaps = 2/110 (1%)

Query: 35 ATREEYERRLAQSNAAMTAREEELQRKQAAIDAAREDIDRQVAEKLKLERAGIAVEEARK 94
A E + QS+ E+ R Q + + ++ + ++ EE +
Sbjct: 129 ALGAEADTLKTQSSLLQARLEQT--RYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 95 AKILVSTDLEDKDRKLGELEATLRARDEKLAAAQLQQAEFMKQQRALDDE 144
L+ + + E L + + + + R
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR 236



Score = 28.6 bits (64), Expect = 0.009
Identities = 14/127 (11%), Positives = 35/127 (27%), Gaps = 7/127 (5%)

Query: 25 NESLAAPLIAATREEYERRLAQSNAAMTAREEELQRKQAAIDAAREDIDRQVAEKLKLER 84
LI ++ + Q + + E A I+ R +L
Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS-RVEKSRLD--- 238

Query: 85 AGIAVEEARKAKILVSTDLEDKDRKLGELEATLRARDEKLAAAQLQQAEFMKQQRALDDE 144
+ + + +++ K E LR +L + + ++ + +
Sbjct: 239 ---DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 145 KREMALT 151
+ L
Sbjct: 296 FKNEILD 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6311PF07132300.019 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 30.4 bits (68), Expect = 0.019
Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 565 WLKKYPDTFGKPLSEKDDWQE 585
++ +YP+ FGKP +KD+WQ
Sbjct: 257 FMDQYPEVFGKPEYQKDNWQT 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6325SYCDCHAPRONE290.007 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.007
Identities = 12/66 (18%), Positives = 24/66 (36%), Gaps = 6/66 (9%)

Query: 6 GLAAVLWRLGQHQEAIDHYQSMLKLNPND-----NQGIRYVLAGHLL-ARDDIKALRKLL 59
GL A +GQ+ AI Y ++ + + + G L A + ++L+
Sbjct: 75 GLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELI 134

Query: 60 KQHEDD 65
+
Sbjct: 135 ADKTEF 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6334HTHFIS742e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 2e-17
Identities = 28/138 (20%), Positives = 55/138 (39%)

Query: 2 RTLFVDHHADLTRAVGVALGDSGFAVDVVPTLEQASSAFSCASYEILLLELVLPDGDGLD 61
L D A + + AL +G+ V + + ++++ ++V+PD + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 WLKQLRGEGHSVPALILSDVDDLEKRIAIFNGGADDFLLKPVYTNELIARMRAVLRRSTQ 121
L +++ +P L++S + I GA D+L KP ELI + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 MTAPIIVFGNLHFDPIGR 139
+ + +GR
Sbjct: 125 RPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6335BCTERIALGSPD1309e-35 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 130 bits (327), Expect = 9e-35
Identities = 58/264 (21%), Positives = 115/264 (43%), Gaps = 27/264 (10%)

Query: 153 QVNLSVRVAEVSRSAMKALGVNLS-AFGQIDNFRVGLLSGGGTGSGAAQGGGTAGIGFNN 211
QV + +AEV + LG+ + + F SG + A G +
Sbjct: 346 QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTN---SGLPISTAIAGANQYNKDGTVS 402

Query: 212 GAV-----------------NIGAVLDALAKEHIASVLAEPNLTAMSGETASFLAGGEFP 254
++ N +L AL+ +LA P++ + A+F G E P
Sbjct: 403 SSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVP 462

Query: 255 IPVLQ-----ENKQVSVEFRHFGVSLEFVPTVLNNNRINIHVKPEVSELSSQGAVQINGI 309
+ +N +VE + G+ L+ P + + + + ++ EVS ++ A +
Sbjct: 463 VLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVA-DAASSTSSD 521

Query: 310 SVPAVSTRRADTVVELASGQSFAIGGLIRRNVNNNVSAFPWLGEMPILGALFRSSSFQKE 369
+TR + V + SG++ +GGL+ ++V++ P LG++P++GALFRS+S +
Sbjct: 522 LGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVS 581

Query: 370 ESELIILVTPYIVKPGSSPNQMSA 393
+ L++ + P +++ Q S+
Sbjct: 582 KRNLMLFIRPTVIRDRDEYRQASS 605


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6338TYPE3OMGPROT1244e-35 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 124 bits (312), Expect = 4e-35
Identities = 51/155 (32%), Positives = 76/155 (49%), Gaps = 3/155 (1%)

Query: 3 LLCAGFFLSAGTNGTLGVPLSLSKTPYRYTVLDQDISEALQQFGNNLNIRVNISAEVKGR 62
L LS + + L PY Y + + + L FG N + V +S ++ +
Sbjct: 13 LTGTLLLLS---SYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDK 69

Query: 63 IRGSMPDLPPREFLDRLANMYGLQWYYDGLVLYVSAAKESQTRMLVLTSIRFDTFKGALD 122
+ G P++FL +A++Y L WYYDG VLY+ E +R++ L K AL
Sbjct: 70 VSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQ 129

Query: 123 KLEISDDRYVVRPAPGDGLVLVSGPPRFTALVEQT 157
+ I + R+ RP + LV VSGPPR+ LVEQT
Sbjct: 130 RSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQT 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6339FLGMRINGFLIF829e-20 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 81.9 bits (202), Expect = 9e-20
Identities = 40/165 (24%), Positives = 70/165 (42%), Gaps = 7/165 (4%)

Query: 28 LYTQLQEREANEMLALLMDNGVHAVRVAAKDGTSTVQVDEKLLAYSIDLLNGKGLPRQSF 87
L++ L +++ ++A L + +G+ ++V + L +GLP+
Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPYR---FANGSGAIEVPADKVHELRLRLAQQGLPKGG- 108

Query: 88 KNLG-EIFQGSGLIASPTEERARYVYALSEELSRTISDIDGVFSVRVHVVLPHNDLLRAG 146
+G E+ S E+ Y AL EL+RTI + V S RVH+ +P L
Sbjct: 109 -AVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVRE 167

Query: 147 ATPSSASVFIRHDAKADLS-VLLPKIKMLVADSIEGLSYDKVEVV 190
SASV + + L + + LV+ ++ GL V +V
Sbjct: 168 QKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLV 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6341FLGFLIH270.045 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 27.1 bits (59), Expect = 0.045
Identities = 38/179 (21%), Positives = 73/179 (40%), Gaps = 33/179 (18%)

Query: 40 ERHQQHVRSWARAAYQRELARGHTEGLNAGAEE---------------MAALISQAVAEV 84
+ H+Q ++ Q+ +G+ EGL G E+ M L+S+ +
Sbjct: 50 QAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTL 109

Query: 85 ARRKAVLEQQLPQLVLEILSELLG---AFDPGELLVMAVRHAIERQYSGAEVCLHVYPTQ 141
+V+ +L Q+ LE +++G D L+ + + + L V+P
Sbjct: 110 DALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDD 169

Query: 142 V----DMLAREFA--GWDGQDGRPRVRIKPDPTLSPRRCVLWSEYGNVDLGLDAQMRAL 194
+ DML + GW R++ DPTL P C + ++ G++D + + + L
Sbjct: 170 LQRVDDMLGATLSLHGW---------RLRGDPTLHPGGCKVSADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6344TYPE3OMOPROT1317e-38 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 131 bits (331), Expect = 7e-38
Identities = 46/182 (25%), Positives = 79/182 (43%), Gaps = 19/182 (10%)

Query: 172 FRALGELFGQLPRQPRGLLSDLPIVVAGEIGTLHVPAAILRKACAGDALLPDLAPFGRGE 231
F L EL +P+ L L V IG+ ++L + GD LL + R E
Sbjct: 131 FEHLPELPAVGGGRPKMLRWPLRFV----IGSSDTQRSLLGRIGIGDVLLIRTS---RAE 183

Query: 232 IALSLGQLWASADLEGDQLVLHGPFRPRSYSLENAHMTQLGSQLGPTE---DLDDVEIML 288
+ +L +EG +V +L+ H+ + + E L+ + + L
Sbjct: 184 VYCYAKKLGHFNRVEGGIIV---------ETLDIQHIEEENNTTETAETLPGLNQLPVKL 234

Query: 289 VFECGRWPIPLGELRSAGEGHIFELGRPIQDPVDILANGQCIGRGDIVRIGDTLGIRLRG 348
F R + L EL + G+ + L + V+I+ANG +G G++V++ DTLG+ +
Sbjct: 235 EFVLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHE 294

Query: 349 RL 350
L
Sbjct: 295 WL 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8766TYPE3IMPPROT2333e-80 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 233 bits (596), Expect = 3e-80
Identities = 84/215 (39%), Positives = 130/215 (60%), Gaps = 7/215 (3%)

Query: 9 LALLAVTAGLGLLVLVVVTTTAFVKVSVVLFLVRNALGTQTIPPNIALYAVALILTMFLS 68
++L+A+ A LL ++ + T FVK S+V +VRNALG Q IP N+ L VAL+L+MF+
Sbjct: 5 ISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVM 64

Query: 69 APVVEQTYDRMTDPKLHYQTFDDWVSAAKSGSEPLRDHLKKFTNEEQRQFFLSSTEKVWP 128
P++ Y D + + G + RD+L K+++ E QFF ++ K
Sbjct: 65 WPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQY 124

Query: 129 AEM-------RAKATVDDLSILVPSFLISELKRAFEIGFLLYLPFIVIDLIVTTILMAMG 181
E + + + L+P++ +SE+K AF+IGF LYLPF+V+DL+V+++L+A+G
Sbjct: 125 GEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALG 184

Query: 182 MSMVSPTLISVPFKLFVFVAIDGWSKLMHGLVLSY 216
M M+SP IS P KL +FVA+DGW+ L GL+L Y
Sbjct: 185 MMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msr8694TYPE3IMQPROT563e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 55.9 bits (135), Expect = 3e-14
Identities = 20/72 (27%), Positives = 40/72 (55%)

Query: 1 MSQSLVVFMIWILPPLIASVVVGLVIGIIQAATQIQDESLPLTVKLLVVVAVIGLFAPVL 60
+++L + +I P I + ++GL++G+ Q TQ+Q+++LP +KLL V + L +
Sbjct: 8 GNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSGWY 67

Query: 61 SAPLIELTDQIF 72
L+ Q+
Sbjct: 68 GEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6345TYPE3IMRPROT1622e-51 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 162 bits (413), Expect = 2e-51
Identities = 39/226 (17%), Positives = 89/226 (39%), Gaps = 7/226 (3%)

Query: 24 LGAARAIGIMMILPVFTRSQTDGLIRGCLAVGFGLPCLAHVSDALQALDPETRLIEVALL 83
R + ++ P+ + ++ LA+ + + L L
Sbjct: 18 WPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFAL----WL 73

Query: 84 GLKEVLVGALLGTFLGIPLWGLQAAGEFIDNQRGVTNPSAPTDPATNSQASAMGVFLGIT 143
++++L+G LG + ++ AGE I Q G++ + DPA++ + + +
Sbjct: 74 AVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATF-VDPASHLNMPVLARIMDML 132

Query: 144 AIAIFVASGGLETLIGALYGSYLIWPVYKFYPTLSTQGAMEVLGLLDQIMRTALLVSGPV 203
A+ +F+ G LI L ++ P+ L++ + + I L+++ P+
Sbjct: 133 ALLLFLTFNGHLWLISLLVDTFHTLPI--GGEPLNSNAFLALTKAGSLIFLNGLMLALPL 190

Query: 204 VFFMTLIDVSFMLLRRFAPQFKLTQLSPAIKNLVFPILMVTYAGYL 249
+ + ++++ LL R APQ + + + V LM +
Sbjct: 191 ITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLI 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6346TYPE3IMSPROT310e-106 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 310 bits (795), Expect = e-106
Identities = 96/338 (28%), Positives = 160/338 (47%), Gaps = 4/338 (1%)

Query: 5 SEEKTHAATPKKLNDARKKGQLPHSSDFVRAVGTCAGLGYLWLRGSAIEDKCREALLFVD 64
S EKT TPKK+ DARKKGQ+ S + V A L + + +L
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPA 61

Query: 65 KLQNLPFDFAVRQALVVLAELTLATVGPLLGTLVAAVLLASILANGGFVFSLEPMTPNFD 124
+ LPF A+ + + PLL + A + +AS + GF+ S E + P+
Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLT-VAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 125 KINPFQGLKRLASARSMVELGKTLFKVFVLGATFSFCLLGMWKTMVYLPFCGMGCLGLVV 184
KINP +G KR+ S +S+VE K++ KV +L + G T++ LP CG+ C+ ++
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 185 TGA-KLLIGIGAGALLAAGLIDLLVQRALFLREMRMTKTEVTRELKDQQGAPELKSERRR 243
+ L+ I + + D + +++E++M+K E+ RE K+ +G+PE+KS+RR+
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 244 IRDESADEPPL-GVHHATLIFKG-TAILIGLRYVRGETGVPVLVCRADGERASHLLSEAR 301
E V ++++ T I IG+ Y RGET +P++ + + + A
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 302 ALRLEIVDNDVLAHQLIGKTQLGRPIPMQYFEPVARAL 339
+ I+ LA L + IP + E A L
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVL 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6366DHBDHDRGNASE1301e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 130 bits (327), Expect = 1e-37
Identities = 81/254 (31%), Positives = 117/254 (46%), Gaps = 13/254 (5%)

Query: 98 EGKVAVVTGAGAGIGKACALAIAREGGRVVVADIDGSAAVACTAQIAAEAGHALALAMDI 157
EGK+A +TGA GIG+A A +A +G + D + + + AEA HA A D+
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 158 ADAQAVAALFETAERHFGGVDLLVNNASAMHLTPRDRTILDLDLAVWDQTMATNLRGTLL 217
D+ A+ + ER G +D+LVN A + L W+ T + N G
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIH----SLSDEEWEATFSVNSTGVFN 122

Query: 218 CCRQAIPRMIARGGGAIVNMSSCQGLSGDTAQTSYAASKAAMNMLSASLATQYGHAQIRC 277
R M+ R G+IV + S T+ +YA+SKAA M + L + IRC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 278 NAVAPGLI---MTERLLAKLDECMQR---HLSRHQL---LQRVGRPEDVAALVAFLLSDD 328
N V+PG M L A + Q L + L+++ +P D+A V FL+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 329 AAFITGQVLCIDGG 342
A IT LC+DGG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6372TCRTETA519e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 50.6 bits (121), Expect = 9e-09
Identities = 59/267 (22%), Positives = 96/267 (35%), Gaps = 15/267 (5%)

Query: 89 FLLSIIAGALADNYSRRNLMFAGWCVIASSSTMLTVLAGLGIFNPWMVLAFSCLAGVGAA 148
F + + GAL+D + RR ++ A ++ L W++ +AG+ A
Sbjct: 57 FACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL-----WVLYIGRIVAGITGA 111

Query: 149 FTDPAWHASVGDILRKRDVPAAVTLISVGYNAVRSIGPALGGVVVASFGPLTAFAVAT-- 206
T A + DI + +S + GP LGG++ F P F A
Sbjct: 112 -TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFFAAAAL 169

Query: 207 --LTYLMLLWTIGRCKWQVRPSPLPSEPLTTAIHDGARFTALSSEIKAAIARGALFGLTS 264
L +L + + R PL E L R+ + + A +A + L
Sbjct: 170 NGLNFLTGCFLLPESHKGER-RPLRREALNPLA--SFRWARGMTVVAALMAVFFIMQLVG 226

Query: 265 ISILALLPLVARDQLGGGPVVYGILMAGFG-TGALFAGICNNILRRRLSQERLTTLSCIA 323
AL + D+ GI +A FG +L + + RL + R L IA
Sbjct: 227 QVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIA 286

Query: 324 CAACCLSLAFTPSVAVAAIALALGGAG 350
+ LAF +A + L +G
Sbjct: 287 DGTGYILLAFATRGWMAFPIMVLLASG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6385AUTOINDCRSYN1522e-48 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 152 bits (385), Expect = 2e-48
Identities = 41/164 (25%), Positives = 65/164 (39%), Gaps = 3/164 (1%)

Query: 12 MIQLITPGLYSEFAGELKEMHGLRYRVFKERLDWEVQTGGEMETDTFDDLKPVYLLLKGS 71
M+++ + E+ LR FK+RL+W VQ ME D +D+ YL
Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKD 60

Query: 72 DWRIRGCVRLLPTTGPTMLRDTFPALLGEAVAPASPDIWESSRFALDLPPSTPKAAGGLA 131
+ + +R + T P M+ TF E + + ESSRF +D G
Sbjct: 61 N-TVICSLRFIETKYPNMITGTFFPYFKE-INIPEGNYLESSRFFVD-KSRAKDILGNEY 117

Query: 132 QATYELFAGMIEFGLANNLTRIVTVTDTRMERILRLATWPLSRI 175
+ LF MI + I T+ M IL+ + W + +
Sbjct: 118 PISSMLFLSMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRVV 161


48mlr6403mlr6427Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr6403020-3.107773conjugal transfer protein TrbF
mlr6404023-3.607736conjugal transfer protein trbG
mlr6405-119-3.822559conjugal transfer protein trbI
msr6406-115-4.011709hypothetical protein
mll6408-215-3.790810coproporphyrinogen III oxidase
mlr6409-214-3.617824transcriptional regulator
mll6410-112-3.184865hypothetical protein
mlr6411011-3.012167cbb3-type cytochrome C oxidase subunit I
mlr6412-110-1.101312cbb3-type cytochrome C oxidase subunit II
msr6413-212-0.882933cytochrome-c oxidase subunit FixQ
mlr6414-212-0.989565cytochrome-c oxidase subunit FixP
mlr6415-213-0.912520nitrogen fixation protein fixG
mlr6416-2150.016550nitrogen fixation protein fixH
mlr6417-1140.904639nitrogen fixation protein fixI
msr64180180.188172nitrogen fixation protein fixS
msl64192180.891757symbiosis island integrase
mll64213200.133455hypothetical protein
mll64234211.359712hypothetical protein
mll64244201.270263hypothetical protein
mll64263210.070423hypothetical protein
mlr6427218-0.012298hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6403PF04335603e-13 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 60.2 bits (146), Expect = 3e-13
Identities = 36/209 (17%), Positives = 70/209 (33%), Gaps = 12/209 (5%)

Query: 15 EPVTPYQKAAQLWD-ERIGSSRVQARNWRFMALGCLTLAAGLSGGLVWQSMQSRVVPYVV 73
+ + Y + A W+ +++ ++ + +A LA + + V PYV+
Sbjct: 8 DELKAYFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVI 67

Query: 74 EVDGF-GEARAVAP--AIRDYEPSDAQIAWHLGRFIQNVRSVSTDPVLVRQNWLSAYDFA 130
VD GEA A +A + L +++ + + + +
Sbjct: 68 TVDRNTGEASIAAKLHGDATITYDEAVRKYFLATYVRYREGW--IAAAREEYFDAVMVMS 125

Query: 131 TDRAA-LFLNEYAKAN--DPFGQIGT-RSVSVQVTSVVRASDSSFQVKWAEQVFERGSLA 186
+ Y N P + V V++ V + QV + + GS +
Sbjct: 126 ARPEQDRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGNVAQVYF-TKESVTGSNS 184

Query: 187 STTRWTAILTIVIRP-PSNTDQLRNNPLG 214
+ T A + + PS NPLG
Sbjct: 185 TKTDAVATIKYKVDGTPSKEVDRFKNPLG 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6405PRTACTNFAMLY300.026 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 29.6 bits (66), Expect = 0.026
Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 13/93 (13%)

Query: 94 GIMPKAPVLGPPLPGDLG----RPILERQRQLGIAPGQDISAEEQRLAQQAIEARESQVL 149
G +P V G +PG G P+L+ G D+S LAQ +EA E
Sbjct: 267 GAVPGGAVPGGAVPGGFGPGGFGPVLDG------WYGVDVSGSSVELAQSIVEAPELGAA 320

Query: 150 FRIDNRPRQTDVAGGGPSAQQPSEALPQSGATR 182
R+ R T V+GG S P + ++G R
Sbjct: 321 IRVGRGARVT-VSGG--SLSAPHGNVIETGGAR 350


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6427INTIMIN300.016 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.4 bits (68), Expect = 0.016
Identities = 23/102 (22%), Positives = 36/102 (35%), Gaps = 1/102 (0%)

Query: 3 HSGNLSHLAPPDFRKDATLDRLAERFNVAQFVSFSPSPSGPRQEYCRLAGLPANHQFATA 62
H+ L+ ++P D K D A + Q S A
Sbjct: 141 HTNKLTKMSP-DVTKSNMTDDKALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQA 199

Query: 63 SEAVQALFERSGEGTVNIRTFSEASSQSREFLYALQDSEAVL 104
S +QA + G VN+++ + S +FL DSE +L
Sbjct: 200 SSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKML 241


49mll6476mlr6488Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll6476211-2.037361hypothetical protein
mll6477210-2.323033hypothetical protein
mll6478012-2.968367hypothetical protein
mll6479012-2.870129pilus assembly protein
mll6480-115-2.751171pilus assembly protein
msl6482-219-4.193220pilin subunit
mlr6483-318-4.532812secretory protein kinase
mlr6484124-5.293628hypothetical protein
mlr6486122-4.989334hypothetical protein
mlr6487218-5.067496hypothetical protein
mlr6488015-3.000338type IV prepilin peptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6479BCTERIALGSPD1562e-43 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 156 bits (395), Expect = 2e-43
Identities = 67/262 (25%), Positives = 115/262 (43%), Gaps = 14/262 (5%)

Query: 175 QQVNLEVRILEAKRNAGRDLGVSIRSNNSRSTTIVGTGIA---AVDKDNVVLGTGGFLSD 231
QV +E I E + G +LG+ + N+ T +G+ A+ N G S
Sbjct: 345 PQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSS 404

Query: 232 LLSTSTPFGALLTRVIDNNIKVDLYIEALEAKGAVRTLANPNLTTLSGEQASFNAGGEVP 291
L S + F + N + + AL + LA P++ TL +A+FN G EVP
Sbjct: 405 LASALSSFNGIAAGFYQGNW--AMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVP 462

Query: 292 -----IRTLDKNGEISIVYKQFGVNLLFTPVVLDDGKIHMNLAPEVSDLN----GFTTAG 342
T N ++ K G+ L P + + + + + EVS + ++
Sbjct: 463 VLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDL 522

Query: 343 DPIFTNRKLSTVVELRDGQSFAVGGLLSSRTTKLQNQVPWLGQVPVIGTLFRNSSNQKEE 402
F R ++ V + G++ VGGLL + ++VP LG +PVIG LFR++S + +
Sbjct: 523 GATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSK 582

Query: 403 TELVVIVTPHIVRPVKPGEQLA 424
L++ + P ++R Q +
Sbjct: 583 RNLMLFIRPTVIRDRDEYRQAS 604


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6484BCTERIALGSPF300.016 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.8 bits (67), Expect = 0.016
Identities = 23/95 (24%), Positives = 45/95 (47%), Gaps = 6/95 (6%)

Query: 163 LRAGHPTTVAIALVAREMPDP-LGTEFGIVSDEISFGLSLEQAVRKLSERVGFEGLHLLS 221
+ A P A+ VA++ P L V ++ G SL A++ FE L+
Sbjct: 81 VAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGS--FERLY--- 135

Query: 222 VSLSIQAKTGGNLTEILSNLSSVLRERRKLRMKIR 256
++ +T G+L +L+ L+ +R+++R +I+
Sbjct: 136 CAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQ 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6486BCTERIALGSPF320.002 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 32.5 bits (74), Expect = 0.002
Identities = 27/129 (20%), Positives = 48/129 (37%), Gaps = 26/129 (20%)

Query: 193 HKWLGIQLSIVTLQL----RAGKPLREALRELADRIGLDEARALAVLFRQSEELGTSLTD 248
+ L+++T QL A PL EAL +A + L R G SL D
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 249 ALR--------IYSDEMRT-----------QRIMNAEERANALPVKMMIPLGLCIFPVVM 289
A++ +Y + R+ + E+ + ++ + I+P V+
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAM---IYPCVL 179

Query: 290 MVVMLPVLI 298
VV + V+
Sbjct: 180 TVVAIAVVS 188


50mlr6541mlr6568Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr6541216-1.995443hypothetical protein
mlr6543216-1.775279*two-component regulator
mlr6544215-1.848875ATPase AAA
mlr6545213-2.165459hypothetical protein
mlr6546114-2.498806hypothetical protein
mlr6547112-2.687368hypothetical protein
mlr6548019-1.278320hypothetical protein
mlr6550027-1.789725hypothetical protein
mlr6551224-1.991896hypothetical protein
msr6553222-1.880069hypothetical protein
mlr6554121-1.070198hypothetical protein
mlr6556020-0.968774hypothetical protein
mlr6557217-0.218593hypothetical protein
mlr65581160.716465hypothetical protein
mlr65590171.158831hypothetical protein
msr65600180.847692hypothetical protein
mlr65610180.647160hypothetical protein
mlr6562018-0.385718hypothetical protein
mlr6564-119-1.298758hypothetical protein
mlr6565-121-2.362335hypothetical protein
mlr6566123-3.308770hypothetical protein
mlr6567121-3.267600hypothetical protein
mlr6568020-3.367380hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6547RTXTOXIND310.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.3 bits (71), Expect = 0.007
Identities = 6/49 (12%), Positives = 19/49 (38%), Gaps = 3/49 (6%)

Query: 306 RYLPAKNQLAAATSDLAATEAAI---KRATTDLEQKKSSVEADYKANVS 351
L +N+ A ++L ++ + + +++ V +K +
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL 301


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6558NAFLGMOTY290.031 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 28.9 bits (64), Expect = 0.031
Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 4/78 (5%)

Query: 250 IVGVARAGEESGKDSKGRSAGERLKAFIKDPDKQPVLRLRQPVFTQAEAEKRASAALNER 309
I+ R G++ K SK R A ++ +++ Q + + +T + K S +L+ER
Sbjct: 184 ILHYERQGDQLTKASKKRLA--QIADYVRH--NQDIDLVLVATYTDSTDGKSESQSLSER 239

Query: 310 AKEFLKGEAEAIGLPDIR 327
E L+ E++GLP+ R
Sbjct: 240 RAESLRTYFESLGLPEDR 257


51mlr6601mll6610Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr6601-120-3.434246hypothetical protein
msr6604-121-3.405519hypothetical protein
mll6606-121-3.253296response regulator FixJ
mll6607-123-3.704136two-component, nitrogen fixation sensor protein
mlr6608-123-3.408083hypothetical protein
mlr6609-123-3.364752Mg2+ transport ATPase
mll6610-131-3.676051hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6606HTHFIS1182e-33 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 118 bits (297), Expect = 2e-33
Identities = 39/144 (27%), Positives = 66/144 (45%)

Query: 9 VVDDDVDVRKSLGFLLATADFAVRLYESATAFLSTATGKLEGCIVTDVRMPGIDGIEFLR 68
V DDD +R L L+ A + VR+ +A +VTDV MP + + L
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLP 67

Query: 69 QLRASGHTIPVIVMTGHADVALAVQAMKEGAADFIEKPFDDEMLIEAIRSALANRNQAHA 128
+++ + +PV+VM+ A++A ++GA D++ KPFD LI I ALA + +
Sbjct: 68 RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPS 127

Query: 129 AHPQSADIRDRLSTLSERERQVLD 152
+ L S +++
Sbjct: 128 KLEDDSQDGMPLVGRSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6607PF06580320.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.007
Identities = 30/158 (18%), Positives = 60/158 (37%), Gaps = 28/158 (17%)

Query: 341 MLRDAVERAAEQALRAGDVIRHLRDFVARGESERQVERLPVLIEE----AASLALVGARE 396
++ + +A E +++R+ R + RQV L +E + L L +
Sbjct: 185 LILEDPTKAREMLTSLSELMRY----SLRYSNARQV----SLADELTVVDSYLQLASIQF 236

Query: 397 INLL-VSYKLDPAAELVLTDRIQIQQVLLNLMRNAVEAMQGSPRRELKVTTVARDDGMAE 455
+ L +++PA V + +Q ++ N +++ + + + LK T +D+G
Sbjct: 237 EDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGT---KDNGTVT 293

Query: 456 VSVIDTGPGLAPEVSAQLFQPFVTTKKHGMGVGLSICR 493
+ V +TG K G GL R
Sbjct: 294 LEVENTGSLALKNT------------KESTGTGLQNVR 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6608SECA290.023 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.1 bits (65), Expect = 0.023
Identities = 28/111 (25%), Positives = 38/111 (34%), Gaps = 17/111 (15%)

Query: 93 VCSPTRFLSVSARAADLIVTGQAGDNVFRAVDVGSLTLGAGRPVLVAATNVE-------- 144
V PT + DL+ +A D+ T G+PVLV ++E
Sbjct: 410 VVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIKERTA-KGQPVLVGTISIEKSELVSNE 468

Query: 145 ---HVLAKTVLVAWKDTREARRAMADALPFLAKASEVVIATIDTERGESIR 192
+ VL A EA +A A + V IAT RG I
Sbjct: 469 LTKAGIKHNVLNAKFHANEA-AIVAQA----GYPAAVTIATNMAGRGTDIV 514


52mll6640mlr6656Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll6640-126-3.829364transcriptional regulator
mlr6641025-5.032156transcriptional regulator
mll6642017-3.263011N-amidino-scyllo-inosamine-4-phosphate
mll6643116-2.496072hypothetical protein
mll6645-114-3.299846oxidoreductase
mlr6646-112-3.734525hypothetical protein
mll6647012-3.486081hypothetical protein
mlr6648013-3.438583acetolactate synthase, large subunit
mlr6649015-4.058901histidinol dehydrogenase
mlr6651021-4.918689ABC transporter substrate-binding protein
mlr6652023-4.603048ABC transporter permease
mlr6653022-4.303806ABC transporter permease
mlr6654023-3.965910ABC transporter ATP-binding protein
mlr6655024-3.920400dehydrogenase subunit
mlr6656124-3.667760dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6640HTHFIS544e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 53.7 bits (129), Expect = 4e-10
Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 4/57 (7%)

Query: 259 PMPAADLLGWTGPEILAEAERGVLQRALARADGNVSAAAQALGISRATLHRKLNRLD 315
+P + L +LAE E ++ AL GN AA LG++R TL +K+ L
Sbjct: 422 ALPPSGLYD----RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6654PF05272340.001 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.9 bits (77), Expect = 0.001
Identities = 15/56 (26%), Positives = 21/56 (37%), Gaps = 9/56 (16%)

Query: 32 LVLLGPSGCGKSTLLNMIAGLESITSGEIRIAHEVVNDLPPKDRDIAMVFQSYALY 87
+VL G G GKSTL+N + GL+ + I +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVAY 645


53mll6677mll6708Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll6677027-3.153533transcriptional activator
mll6679025-2.975694stress-induced protein
mlr6681024-2.612888transcriptional regulator
mlr6682-120-1.094231epoxide hydrolase
mlr6683-121-1.246133epoxide hydrolase
mlr6684114-1.284755hypothetical protein
mll6685214-1.068324hypothetical protein
mll6686313-1.052724short-chain dehydrogenase
mll66871130.558030hypothetical protein
mll66880121.560337hypothetical protein
mlr66891131.546395two-component sensor
mlr66901152.375618two-component response regulator
mlr66911141.950026two-component response regulator
mlr66922141.430383O-linked GlcNAc transferase
mlr6693013-0.608752hypothetical protein
mlr6694014-3.440497hypothetical protein
mll6695119-4.319239hypothetical protein
mll6696220-4.997424DNA ligase
mll6698322-5.766897hypothetical protein
mll6699321-5.047219hypothetical protein
mll6700321-4.867130hypothetical protein
mll6702226-4.695018menaquinone biosynthesis methyltransferase
mll6703129-5.074111hypothetical protein
mlr6704128-4.643676hypothetical protein
mlr6706026-4.290670transposase
mlr6707125-3.986683transposase
mll6708021-3.204251hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6686DHBDHDRGNASE793e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 78.6 bits (193), Expect = 3e-19
Identities = 54/184 (29%), Positives = 93/184 (50%), Gaps = 2/184 (1%)

Query: 6 AVITGASGGIGAVYVDRLAERGYDLVLVARNGDKLTQVANRVRAKTGRKIDTLSADLANA 65
A ITGA+ GIG LA +G + V N +KL +V + ++A+ R + AD+ ++
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADVRDS 69

Query: 66 SDLARVEA-FLRETPDVTLLVNNAGLGGALKLLDSDVDQMTSLISLNVTALTRLTYAIVP 124
+ + + A RE + +LVN AG+ + ++ + S+N T + + ++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 125 GFVARAAGTIINIASIVAINPESLNGVYGGSKAFVVAFSQNLRHELAGTGVRVQVVLPGA 184
+ R +G+I+ + S A P + Y SKA V F++ L ELA +R +V PG+
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 185 TATD 188
T TD
Sbjct: 190 TETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6687SECA270.034 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 27.1 bits (60), Expect = 0.034
Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 4/65 (6%)

Query: 89 RDVPLPAWSEGLRQA-KYPEHV-VRHLS-AMAELTKQGRYDRMTDTLRKLTGEAPTNMRD 145
R + WS+GL QA + E V +++ + +A +T Q Y R+ + L +TG A T +
Sbjct: 342 RTMQGRRWSDGLHQAVEAKEGVQIQNENQTLASITFQN-YFRLYEKLAGMTGTADTEAFE 400

Query: 146 FVKLH 150
F ++
Sbjct: 401 FSSIY 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6689HTHFIS601e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.2 bits (146), Expect = 1e-11
Identities = 29/136 (21%), Positives = 56/136 (41%), Gaps = 13/136 (9%)

Query: 388 RLRVLICEDETDVATVIAALLDSEGFSSDVAPDIATAKALLQSRDYAALTLDIKLAEESG 447
+L+ +D+ + TV+ L G+ + + AT + + D + D+ + +E+
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 448 IKLFHDIRASPVNSDIAVIVISAVADEARRSLNGTAV-----GIVDWLEKPVDSGRLHAA 502
L I+ D+ V+V+SA TA+ G D+L KP D L
Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFM------TAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 503 LAKIVASRNEQRPKIL 518
+ + +A + K+
Sbjct: 115 IGRALAEPKRRPSKLE 130



Score = 55.6 bits (134), Expect = 3e-10
Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 1/68 (1%)

Query: 513 QRPKILHVEDDEGVLAVMSAGLGS-DVSIISAKTLQEARRAVAKRHFDLVILDIALPDGS 571
IL +DD + V++ L + R +A DLV+ D+ +PD +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 572 GLDLLADL 579
DLL +
Sbjct: 62 AFDLLPRI 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6690HTHFIS784e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.3 bits (193), Expect = 4e-20
Identities = 37/125 (29%), Positives = 58/125 (46%), Gaps = 3/125 (2%)

Query: 6 ARILYVDDEDDIREIAQMSLELDPEFEVRSCSSGAAALTDAAAWHPDLILLDVMMPDMDG 65
A IL DD+ IR + +L ++VR S+ A AA DL++ DV+MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALS-RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 66 PETLKRLAASPLTASIPVAFITARTQTHQVERYLAMGAVGVIAKPFDPLALAGEVRKLLS 125
+ L R+ +PV ++A+ + GA + KPFD L G + + L+
Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 126 EHPGR 130
E R
Sbjct: 121 EPKRR 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6691HTHFIS902e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.3 bits (224), Expect = 2e-24
Identities = 37/126 (29%), Positives = 61/126 (48%), Gaps = 2/126 (1%)

Query: 5 KARVLICDDDPLLLELMEFRLRAKGYEVITAVDGAEALAKAEQHGPDIIVLDAMMPKADG 64
A +L+ DDD + ++ L GY+V + A D++V D +MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 65 LEVLARLKGDPVLSDTPVVMLTARKAERDIVSALEKGADDYLVKPFIPEELLARLARLIA 124
++L R+K D PV++++A+ + A EKGA DYL KPF EL+ + R +A
Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 125 RKNGKR 130
+
Sbjct: 121 EPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6692SYCDCHAPRONE383e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 38.0 bits (88), Expect = 3e-05
Identities = 25/125 (20%), Positives = 54/125 (43%), Gaps = 7/125 (5%)

Query: 27 QTVDELYASAVKARQARHFDEAVDLLRRALALKPDNADALVQLG--FAELGRNDLAAARD 84
T+++LY+ A Q+ +++A + + L ++ + LG +G+ DLA
Sbjct: 34 DTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAI--H 91

Query: 85 AFSKALSLAPTYQDASFGLAEIEFRSGNLDAA---LPLAESVARAQPGNTDAAALVENIR 141
++S + F AE + G L A L LA+ + + + + V ++
Sbjct: 92 SYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSML 151

Query: 142 KAMQA 146
+A++
Sbjct: 152 EAIKL 156



Score = 31.8 bits (72), Expect = 0.004
Identities = 18/75 (24%), Positives = 31/75 (41%)

Query: 261 DVLASSPDNVEALDLDAKVALLEADYTRAGQSFQRVLAIDPRNAEALVGIGDVRRAQGDD 320
+ S D +E L A Y A + FQ + +D ++ +G+G R+A G
Sbjct: 27 MLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQY 86

Query: 321 DAARQAYREALAIEP 335
D A +Y ++
Sbjct: 87 DLAIHSYSYGAIMDI 101



Score = 31.4 bits (71), Expect = 0.005
Identities = 11/53 (20%), Positives = 23/53 (43%)

Query: 181 AGKLPEAEKVYRRALGLAPKNTDILVALGLIVGSSQRFDEAGHFFDRALAIKP 233
+GK +A KV++ L ++ + LG + ++D A H + +
Sbjct: 49 SGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6702TCRTETOQM280.041 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 27.9 bits (62), Expect = 0.041
Identities = 19/116 (16%), Positives = 37/116 (31%), Gaps = 26/116 (22%)

Query: 144 VAKLFGHFEMDVELGPKSRRYLVEAGFEDIRV----------ESFMVTNLDGDPQDFADV 193
+ G +M+V +Y VE ++ V E + + +P
Sbjct: 387 ILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVPPNPF----- 441

Query: 194 IVAWADVYAGEMATRRGDGPEFIARFRQG-----FQDHIFAALH---PKGYAGWPI 241
WA + G G ++ + G FQ+ + + +G GW +
Sbjct: 442 ---WASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEGIRYGCEQGLYGWNV 494


54mll6735msl6763Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll6735017-3.493187arginine/ornithine antiporter
msr6734-223-3.865645hypothetical protein
mlr6736-224-4.568602arginine/ornithine antiporter
msl6737-124-3.832834hypothetical protein
mll6738-118-2.332313hypothetical protein
mlr6739-117-1.848467oxidoreductase
mlr6740-118-1.332813methyl transferase
msl6741-119-1.822546acyl carrier protein
mll6742-121-1.838932long chain acyl-CoA synthetase
mll6743020-1.963603asparagine synthetase
mll6744222-2.486958NAD synthetase
mll6746224-2.425972pristinamycin I synthase 3
mlr6747223-2.866767sulfate adenylyltransferase
mlr6748120-2.051024hypothetical protein
mll6750018-1.454066HlyA protein
mll6752022-1.560523CDA peptide synthetase III
mlr6753021-1.438703sugar transferase
mlr6754019-1.380598hypothetical protein
mlr6755-120-1.228180asparagine synthase
mlr6756-123-2.346873exopolysaccharide biosynthesis protein
mlr6757-127-3.400151exopolysaccharide biosynthesis protein
mlr6758025-3.406394exopolysaccharide biosynthesis protein
mll6759125-4.093587nucleotide sugar epimerase
mll6760024-3.908840two component system response regulator
mlr6761024-3.894273two component system sensor histidine kinase
mll6762020-3.092579two component system response regulator
msl6763218-2.867465hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6747PF05272280.040 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.040
Identities = 14/65 (21%), Positives = 20/65 (30%), Gaps = 7/65 (10%)

Query: 149 AFLPRPDLLIQLDAPQETLR-RRLMDRHAQQPLLEVLLFELGVDRGLRQADISRDIGKCL 207
A L R A Q+ A L+ LG D G + + L
Sbjct: 773 ALLTREGAPAAEGAAQKGYSVNTTFVTIAD------LVQALGADPGKSSPMLEGQVRDWL 826

Query: 208 RQDGW 212
++GW
Sbjct: 827 NENGW 831


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6750CABNDNGRPT639e-13 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 63.1 bits (153), Expect = 9e-13
Identities = 42/178 (23%), Positives = 65/178 (36%), Gaps = 14/178 (7%)

Query: 272 PTSTVPSSSDHTFYGTGGADALHGTTGADTMVG----GGYNDTYYVNNVGDKVVELAGGG 327
+T S + F D T + ++ G DT+ + + G
Sbjct: 261 NMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINLNEG 320

Query: 328 NDTVLASVSY--ALSAGSEIEHLAIASKSDTTTMNLKGNEFSQTIDGNAGNNVINGGGGK 385
+ + + + +++ G IE+ S +D L GN + G AGN+V+ GG G
Sbjct: 321 SFSDVGGLKGNVSIAHGVTIENAIGGSGNDI----LVGNSADNILQGGAGNDVLYGGAGA 376

Query: 386 DVLTGNGGHDYFFFNAA--LKTGNVDKITDFNVAQDKIVLDHSVFTGLQTGALPTSAF 441
D L G G D F + + D I DF DKI D S F + F
Sbjct: 377 DTLYGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDKI--DLSAFRNEGQLSFVQDQF 432


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6759NUCEPIMERASE5250.0 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 525 bits (1354), Expect = 0.0
Identities = 181/331 (54%), Positives = 224/331 (67%), Gaps = 1/331 (0%)

Query: 10 IVVTGTAGFIGFHVASRLLRRGLAVIGVDNFTPYYDVGLKEARFAQLCAEPGFTPMQMDL 69
+VTG AGFIGFHV+ RLL G V+G+DN YYDV LK+AR L A+PGF ++DL
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELL-AQPGFQFHKIDL 61

Query: 70 ADQALVKALFSDFQPSHFVHLAAQAGVRYSLADPHAYVQSNIVAFLNVLEGCRHAGVSHL 129
AD+ + LF+ + VRYSL +PHAY SN+ FLN+LEGCRH + HL
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 130 VYASSSSVYGANRSIPFSEHHGASHPVSFYAATKSANECMAHSYSHLFGLPVTGLRFFTV 189
+YASSSSVYG NR +PFS HPVS YAATK ANE MAH+YSHL+GLP TGLRFFTV
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 190 YGPWGRPDMAVYTFTHAIAEGRTIEIANAGRVWRDFTYIDDIVEGVVRVLAAPPRPDPDW 249
YGPWGRPDMA++ FT A+ EG++I++ N G++ RDFTYIDDI E ++R+ P D W
Sbjct: 182 YGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQW 241

Query: 250 DSRAAAPATSSAPYRIYNIGNDRPEEINRLIAIIETALGRRAVRVNVPLPPGDVLKTRAD 309
PA S APYR+YNIGN P E+ I +E ALG A + +PL PGDVL+T AD
Sbjct: 242 TVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSAD 301

Query: 310 VSDLRGAVGFAPATALEDGVQRFVEWYRDFH 340
L +GF P T ++DGV+ FV WYRDF+
Sbjct: 302 TKALYEVIGFTPETTVKDGVKNFVNWYRDFY 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6761NAFLGMOTY300.026 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 29.7 bits (66), Expect = 0.026
Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 8/63 (12%)

Query: 303 PLEELSAVVHP-ADWPKLAASAKPSSAVNYDVEIRVRRTDGHIRWVALRG-----RQEEH 356
PLE +VHP + S++ S +N D E+++RR G R V+L R EH
Sbjct: 43 PLE--CQLVHPIPSFGDAVFSSRASKKINLDFELKMRRPMGETRNVSLISMPPPWRPGEH 100

Query: 357 GDR 359
DR
Sbjct: 101 ADR 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6762HTHFIS481e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.9 bits (114), Expect = 1e-08
Identities = 32/155 (20%), Positives = 54/155 (34%), Gaps = 16/155 (10%)

Query: 3 RIVIADDHGLYRRGLRLALMAGIPSVEIFEAACFDAVVNLLEEQASIDLAILDLNMPGLF 62
I++ADD R L AL V I A + + DL + D+ MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNA--ATLWRWIAAGDG-DLVVTDVVMPDEN 61

Query: 63 NQEVLSDVLAAYPDTRFAIVSGDDSRSEILTALSIGLHGYIVKSQKDEEVVLAVNEILAG 122
++L + A PD ++S ++ + A G + Y+ K E++ + L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL-- 119

Query: 123 RIYVPALLSRTSADQRSYAALPSARNPIRQRVGSS 157
+ +R + L VG S
Sbjct: 120 -----------AEPKRRPSKLEDDSQDGMPLVGRS 143


55msl6830mll6838Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
msl6830618-2.345362hypothetical protein
mll6832518-2.101420hypothetical protein
msl6833517-2.007257hypothetical protein
mll6834513-1.583279hypothetical protein
mll6836412-1.393963hypothetical protein
mll6838310-1.209281hypothetical protein
56mlr6950msl8708Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr6950115-3.197709GMP synthase
mlr6952222-3.846315CP4-like integrase
mll6953423-3.845245hypothetical protein
msl6954524-3.808145hypothetical protein
mlr6955523-2.796673conjugal transfer protein TRBI
msr6956323-3.421744hypothetical protein
mll6957023-4.495218hypothetical protein
mlr6958028-7.122476transcriptional regulator
mlr6959125-6.735480hypothetical protein
mlr6961124-6.861645transposase
mlr6962126-7.164650transposase
mlr6963127-6.918437hypothetical protein
mlr6964228-6.652794ABC transporter binding protein
mlr6965130-6.173521ABC transporter ATP-binding protein
mlr6966130-5.938117ABC transporter permease
mlr6967129-5.425705ABC transporter permease
mlr6968129-5.115467acetylpolyamine aminohydrolase
mlr6969126-4.014010aldehyde dehydrogenase
mll6970225-3.367070transcriptional regulator
mll6971224-2.725221transcriptional regulator
mlr6972224-2.671092glycine cleavage system protein T
mlr6973121-2.543923glycine cleavage system protein T
mlr6974020-2.679402acyl-CoA synthetase
mlr6975-118-3.561018acyl-CoA dehydrogenase
mlr6976019-3.798022enoyl-CoA hydratase
msl8708021-4.099280hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6957NUCEPIMERASE623e-13 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 62.5 bits (152), Expect = 3e-13
Identities = 63/354 (17%), Positives = 123/354 (34%), Gaps = 75/354 (21%)

Query: 3 RVVIIGGSGHVGTYLVPRLVEAGYEVVNVS----------RGQRAAYTLNAAWKSVEPVV 52
+ ++ G +G +G ++ RL+EAG++VV + + R ++ + +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 53 IDRDTEEKAGTFGEKVRALKADIVVDM-----ISFTLDSTK-----QIVGAL-------R 95
DR+ + + V + ++L++ + G L
Sbjct: 62 ADREGMTDL------FASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRH 115

Query: 96 GEVQHFLHCGTIWVYGHNTAIP-ATEDQPKNPFGSYGTQKAEIESWLLNEARRNGFPATV 154
++QH L+ + VYG N +P +T+D +P Y K E + G PAT
Sbjct: 116 NKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATG 175

Query: 155 FRPGHIVGPGWEPLNPAGHFDVGVF---SQIARGEPLVLPNLGNETVHHVHADDVAQMVM 211
R + GP P D+ +F + G+ + + N G + DD+A+ ++
Sbjct: 176 LRFFTVYGPWGRP-------DMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228

Query: 212 RAI----------------VSWSNAVGEAFNTVSPQAINLRGYAEALYNWFGHAPRLSYE 255
R + S A +N + + L Y +AL + G + +
Sbjct: 229 RLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNML 288

Query: 256 PFDTWKGKQTEENWRATWEHIARSPSHSIAKARNLLGYDPRYSSLQAVYESVEW 309
P +T + ++G+ P + V V W
Sbjct: 289 PLQPGDVLETSAD---------------TKALYEVIGFTPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6964MALTOSEBP461e-07 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 45.9 bits (108), Expect = 1e-07
Identities = 85/364 (23%), Positives = 141/364 (38%), Gaps = 43/364 (11%)

Query: 28 LILGLSTTVALAEGNLNIY---NWGEYTSPELIDKFSKTYNIHVTQTDFDSNDTALAKVR 84
++ S + EG L I+ + G E+ KF K I VT D + +V
Sbjct: 18 MMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVA 77

Query: 85 QGSSGFDVVVPSQSFIPTYIQEGLLAETNPGQMENAKNLEERWRNPAFDPGRKYSVPWLW 144
G D++ + Y Q GLLAE P + K W ++ G+ + P
Sbjct: 78 ATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYN-GKLIAYPIAV 136

Query: 145 YTSGVSVNTSVFKGDINTWKVI--LDPPAELKGKINIVPEMNDIMFA----------AIK 192
+ N + TW+ I LD + KGK ++ + + F A K
Sbjct: 137 EALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFK 196

Query: 193 FEGGTWCTSD--------KALLTKVRDRLLEAKKSWLSIDYS-GTLKMASGDVSASLD-- 241
+E G + D KA LT + D L++ K DYS G+ + +++
Sbjct: 197 YENGKYDIKDVGVDNAGAKAGLTFLVD-LIKNKHMNADTDYSIAEAAFNKGETAMTINGP 255

Query: 242 WSGSALKRRTQNHSIAY-----GLPKEGFTYGSDNVVVLKDAPNLENAKLF-QNFIMAPE 295
W+ S + N+ + G P + F G + + +PN E AK F +N+++ E
Sbjct: 256 WAWSNIDTSKVNYGVTVLPTFKGQPSKPFV-GVLSAGINAASPNKELAKEFLENYLLTDE 314

Query: 296 NAALNSTFAKYGAAIIGAEKYYSDDMKGAPELTIPDDMKSKGELLTLCDPKITQLYSRIW 355
+ GA A K Y +++ P + + KGE++ P I Q+ S W
Sbjct: 315 GLEAVNKDKPLGAV---ALKSYEEELAKDPRIAATMENAQKGEIM----PNIPQM-SAFW 366

Query: 356 QDVQ 359
V+
Sbjct: 367 YAVR 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6965PF05272310.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.011
Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 51 TLLGPSGCGKTTLLRLIGGFEYPTAGTILLG 81
L G G GK+TL+ + G ++ + +G
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6967PF06580310.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.007
Identities = 16/98 (16%), Positives = 37/98 (37%), Gaps = 12/98 (12%)

Query: 53 WGGFSLRWFQ--SAANNQQVISASILSLKLAAISATLSTALATLAALAMSRTPRFRGWTL 110
WG ++L F S + ++ S I ++ ++ + L+ A + + +GW L
Sbjct: 20 WGVYTLTGFGFASLYGSPKLHSM-IFNIAISLMGLVLTHAYRSF--------IKRQGW-L 69

Query: 111 AYSAISVPLMVPEIVTAVALLIVTATIRGWTGYSGLGY 148
+ + L V + ++ A W + +
Sbjct: 70 KLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINT 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6969RTXTOXINA300.041 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.5 bits (66), Expect = 0.041
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 306 LLIPADRHAEALDIARRVAAATRVGDPSSEDTDMGPVISQQQFDKIQRMIGLGIEEGATL 365
LLIP D + + V A +G D G I++Q F +++IGL E G T+
Sbjct: 51 LLIPKDYKGQGSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGL-TERGVTI 109

Query: 366 VA 367
A
Sbjct: 110 FA 111


57msr8709mlr6993Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
msr8709-121-3.346838hypothetical protein
mll6985-118-3.241406ABC transporter permease
mll6986015-3.889006ABC transporter permease
mll6987014-4.104530ABC transporter substrate-binding protein
mll6988017-3.889736transcriptional regulator
mlr6990119-4.588735transcriptional regulator
mlr6991118-4.313696aminotransferase
mlr6992217-4.491469ABC transporter substrate-binding protein
mlr6993220-3.181235acetyl xylan esterase
58mll7332mlr7341Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll7332223-5.392273ABC transporter binding protein
msl7333334-7.675105hypothetical protein
mll7334237-8.885277short chain dehydrogenase
msl7335349-12.220035hypothetical protein
mll7336344-11.372474acetyltransferase
mll7337134-9.310424hypothetical protein
mll7338032-8.855790hypothetical protein
mlr7339023-6.696548glycosyl transferase
mlr7341-111-3.013229O-antigen methyl transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll7334DHBDHDRGNASE1371e-41 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 137 bits (347), Expect = 1e-41
Identities = 82/259 (31%), Positives = 129/259 (49%), Gaps = 12/259 (4%)

Query: 15 MDLFKLDGDVALVTGAGSGIGQAIAIGLAEAGADVACFGHASKGGLEETAQQVTTLGRKA 74
M+ ++G +A +TGA GIG+A+A LA GA +A + + LE+ + R A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEK-LEKVVSSLKAEARHA 59

Query: 75 LVLTGTVTSQSDLAAAIDRVEAELGALTVAVNNAGIAGSEPAETLSLEKWQKVHEVNVAG 134
V + + R+E E+G + + VN AG+ +LS E+W+ VN G
Sbjct: 60 EAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTG 119

Query: 135 VFLSCQAEARKMLARRKGSIINIASMSGTIVNRGLTQAHYNSSKAAVIHMSKSLAMEWAD 194
VF + ++ ++ M+ RR GSI+ + S + + A Y SSKAA + +K L +E A+
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSM--AAYASSKAAAVMFTKCLGLELAE 177

Query: 195 RGLRVNVVSPGYTLTPMNKR---PEVAEEIKI------FKRDTPMGRMAAPEEMVGPTVF 245
+R N+VSPG T T M E E I FK P+ ++A P ++ +F
Sbjct: 178 YNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLF 237

Query: 246 LASRASSFVTGLDLIVDGG 264
L S + +T +L VDGG
Sbjct: 238 LVSGQAGHITMHNLCVDGG 256


59mll7422mlr7432Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll74222112.2924214-diphosphocytidyl-2-C-methyl-D-erythritol
msl74232111.469395hypothetical protein
mll74242111.544618proteinase
mll74271111.089544hypothetical protein
mlr74261120.409257octaprenyl-diphosphate synthase
mlr74280120.254377transcriptional regulator
mlr7429210-0.049720hypothetical protein
mlr7430310-0.101005hypothetical protein
mlr7432210-0.310402hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7429ACRIFLAVINRP320.003 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.7 bits (72), Expect = 0.003
Identities = 24/152 (15%), Positives = 45/152 (29%), Gaps = 33/152 (21%)

Query: 83 QVGAVVLGTLFLALSSYIEVPMVPVPV--TMQTFAVTLIGALYGWRLGAVTIAAWLVEGA 140
Q+ ++G + + +I + + F++T++ A+ L A+ + L
Sbjct: 437 QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALC--- 493

Query: 141 AGFPVLAGGAAGVAHFVGPTGGYLFAFPITGALVGW-------LAERGWNGNRVVLAFAA 193
A + P G GW N +L
Sbjct: 494 -------------ATLLKPVSA--EHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 194 MLLGNLACLVLGTAWLAVMIGTEKAITFGFLP 225
L A +V G L + + + FLP
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSS------FLP 564


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7432SYCDCHAPRONE438e-07 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 43.0 bits (101), Expect = 8e-07
Identities = 22/127 (17%), Positives = 46/127 (36%), Gaps = 4/127 (3%)

Query: 482 LELQPDQPQVLNYLGYSWVDMNTNLKEGLAMIQKAVDLRPSDGYIVDSLGWAYFRLGRFD 541
E+ D + L L ++ ++ + Q L D LG +G++D
Sbjct: 29 NEISSDTLEQLYSLAFNQYQSG-KYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYD 87

Query: 542 DAVREMERAVSLKPEDPVLNDHLGDAYWRVGRKLEATFQWNQARDL---KPDPDVLATLQ 598
A+ + ++P H + + G EA A++L K + L+T
Sbjct: 88 LAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRV 147

Query: 599 QKLMKGL 605
+++ +
Sbjct: 148 SSMLEAI 154



Score = 37.6 bits (87), Expect = 4e-05
Identities = 30/139 (21%), Positives = 46/139 (33%), Gaps = 12/139 (8%)

Query: 354 QLAAVAEQLKDGEGAIALYRRIPDSSPLKELSDL-QLGLNLADLDRHDEAITHLKAFVDA 412
A+ LK G G IA+ I L L L N ++++A +A
Sbjct: 11 YQLAMESFLKGG-GTIAMLNEISS----DTLEQLYSLAFNQYQSGKYEDAHKVFQALCVL 65

Query: 413 HPNDMRAYLALGGVYSSKEDFRSAASLYDKAVEA-LKTPTAANWNIFYQRGIAYERLKEW 471
D R +L LG + + A Y +K P + + E
Sbjct: 66 DHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRF-----PFHAAECLLQKGEL 120

Query: 472 PKAEPNFRKALELQPDQPQ 490
+AE A EL D+ +
Sbjct: 121 AEAESGLFLAQELIADKTE 139



Score = 30.7 bits (69), Expect = 0.011
Identities = 25/137 (18%), Positives = 40/137 (29%), Gaps = 7/137 (5%)

Query: 316 EILLDLATALNRGGGEPFVRLYLQYALALRPDSDAALVQLAAVAEQLKDGEGAIALYRRI 375
E L + + L GG + + D+ L LA Q E A +++ +
Sbjct: 10 EYQLAMESFLKGGGT-------IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQAL 62

Query: 376 PDSSPLKELSDLQLGLNLADLDRHDEAITHLKAFVDAHPNDMRAYLALGGVYSSKEDFRS 435
L LG + ++D AI + R K +
Sbjct: 63 CVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAE 122

Query: 436 AASLYDKAVEALKTPTA 452
A S A E + T
Sbjct: 123 AESGLFLAQELIADKTE 139


60mll7476mlr7495Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll7476291.510452hypothetical protein
mlr7477391.612651hemolysin-like protein
mlr7480291.507221RNA methyltransferase
mlr74823111.244445hypothetical protein
mlr7483211-0.005747hypothetical protein
mll7484115-0.338771hypothetical protein
msl74850180.231265hypothetical protein
mll74871180.000540TldD protein, suppresses inhibitory activity of
mll7488118-1.011940hypothetical protein
mlr7490318-1.724294cytochrome C oxidase subunit II
mlr7491518-1.684090cytochrome C oxidase subunit I
mlr7493414-1.827846protoheme IX farnesyltransferase
msr7494212-2.265045hypothetical protein
mlr7495211-1.178964cytochrome C oxidase assembly protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7482PERTACTIN340.002 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 34.3 bits (78), Expect = 0.002
Identities = 33/114 (28%), Positives = 44/114 (38%), Gaps = 9/114 (7%)

Query: 671 QNSGQNPPNGKFKKL-HNQGGGQANAQANGRGNEAQGNGNGPKFHRLPASGNG-----GG 724
+NSG P +G L G A + + G +RL A+GNG G
Sbjct: 509 RNSGSEPASGNTMLLVQTPRGSAATFTLANKDGKVDI---GTYRYRLAANGNGQWSLVGA 565

Query: 725 NVQQKFKVNNGNPQLRVQGQPKPRRPEFHAQPNQPVQRQAQPRPPQPPAVKKPS 778
K P+P +P QP QP QRQ + PQPPA ++ S
Sbjct: 566 KAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELS 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7483RTXTOXINA290.004 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 28.8 bits (64), Expect = 0.004
Identities = 21/102 (20%), Positives = 32/102 (31%), Gaps = 4/102 (3%)

Query: 14 ADT-QSAPGGFNGVSTIASLLTSIALAGFIATGVAAATTTTAPAA---ATTAAPAKKPAA 69
ADT A G + + + IA A +T+A AA A+ A P +
Sbjct: 263 ADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISPLS 322

Query: 70 SAMTPQKTAISKQCSALADAKKLHGKAREKFRADCKKNGGKA 111
K + + + K G + A K G
Sbjct: 323 FLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKETGAI 364


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll7488PF067761832e-61 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 183 bits (467), Expect = 2e-61
Identities = 87/165 (52%), Positives = 118/165 (71%)

Query: 29 AMFLAALSAAGLFVAGQANAAQPSGTVRSTHGAWSIICDTPAGATSEQCVMMQNVVAEDR 88
A + A + A G ++ A G VRS HG W I CDTP GA +EQC ++Q+VVAEDR
Sbjct: 50 ARLMLAGAMAIALSFGWSDRADAQGAVRSVHGDWQIRCDTPPGAKAEQCALIQSVVAEDR 109

Query: 89 PEMGLSVVVLRTADNKAEILRVLAPLGVLLPNGLGLNVDGKDIGRAYFVRCFQDGCYAEV 148
GL+V++L+TAD K++++RV+APLGVLLP+GLGL +D D+GRA FVRC +GC AEV
Sbjct: 110 SNAGLTVIILKTADQKSKLMRVVAPLGVLLPSGLGLKLDNVDVGRAGFVRCLPNGCVAEV 169

Query: 149 ILEKPLLDTLKTGTSATFIVFQTPEEGIGIPVDLKGFADGFAALP 193
+++ LL L+T +ATFI+F+TPEEGIG P+ L G +G+ LP
Sbjct: 170 VMDDKLLGQLRTAKTATFIIFETPEEGIGFPLSLNGIGEGYDKLP 214


61mlr7549mll7564Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr7549118-3.220909nucleotide sugar epimerase
mlr7550224-4.154643glucose-1-phosphate thymidylyltransferase
mlr7551225-4.252186dTDP-6-deoxy-D-glucose-3,5-epimerase
mlr7552228-4.797967dTDP-D-glucose-4,6-dehydratase
mlr7553335-5.872581dTDP-6-deoxy-L-mannose-dehydrogenase
mlr7554238-8.497070hypothetical protein
mlr7555134-9.276580hypothetical protein
mlr7556135-9.839945sugar transferase
mlr7557037-9.782973hypothetical protein
mlr7558134-9.554525sugar nucleotide epimerase/dehydratase
mlr7559133-9.900253hypothetical protein
mlr7560131-8.132401hypothetical protein
mlr7561131-6.003401hypothetical protein
mll7563-123-3.739476hypothetical protein
mll7564021-3.126049ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7549NUCEPIMERASE5460.0 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 546 bits (1408), Expect = 0.0
Identities = 191/339 (56%), Positives = 240/339 (70%), Gaps = 6/339 (1%)

Query: 1 MKVLVTGAAGFIGYHVARRLLERGDEVVGIDSVNDYYDPRIKQARLRLLAEASRGSNAGY 60
MK LVTGAAGFIG+HV++RLLE G +VVGID++NDYYD +KQARL LLA+ G+
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP------GF 54

Query: 61 HFIHGNLAEREIVDGCFADHDFDRVIHLAAQAGVRYSLENPRAYVESNIVAFTNMLEACR 120
F +LA+RE + FA F+RV + VRYSLENP AY +SN+ F N+LE CR
Sbjct: 55 QFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCR 114

Query: 121 NAGMAHLTYASTSSVYGANTDMPFSEHRPADHPLQFYAATKRANELMAHSYSHLFGLPTT 180
+ + HL YAS+SSVYG N MPFS DHP+ YAATK+ANELMAH+YSHL+GLP T
Sbjct: 115 HNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPAT 174

Query: 181 GLRFFTVYGPWGRPDMALFLFTRSILAGEPIKLFNNGNHTRDFTYIDDIAEGVIRASDSP 240
GLRFFTVYGPWGRPDMALF FT+++L G+ I ++N G RDFTYIDDIAE +IR D
Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234

Query: 241 AAGNPAWDSGHPDPATSSAPWRIFNIGNNNPVKLTAYVEALESALGRKAVIELLPLQAGD 300
+ W PA S AP+R++NIGN++PV+L Y++ALE ALG +A +LPLQ GD
Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGD 294

Query: 301 VPDTFADTTALQEAVGYRPGTSVSDGVGRFVEWYKAYFG 339
V +T ADT AL E +G+ P T+V DGV FV WY+ ++
Sbjct: 295 VLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7552NUCEPIMERASE1596e-48 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 159 bits (403), Expect = 6e-48
Identities = 81/357 (22%), Positives = 144/357 (40%), Gaps = 52/357 (14%)

Query: 1 MNFLVTGGAGFIGSAVCRHLCANPAYRVTNLDKLT--YAGNLASLRQIENAH-NYRFAHA 57
M +LVTG AGFIG V + L ++V +D L Y +L R A ++F
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DICDERAVLDIMRRDDIDIVMNLAAESHVDRSIDGPGAFIETNIVGTYRILNAALEYWRG 117
D+ D + D+ + V V S++ P A+ ++N+ G IL
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC------ 113

Query: 118 LPDDRKSRFRFHHV--STDEVFGD---LPFDGGMFVEETPYAPSSPYSASKAASDHLVRA 172
R ++ + H + S+ V+G +PF +++ P S Y+A+K A++ +
Sbjct: 114 ----RHNKIQ-HLLYASSSSVYGLNRKMPFS----TDDSVDHPVSLYAATKKANELMAHT 164

Query: 173 WHETYGLPVVLSNCSNNYGPYHFPEKLIPLVILNALDEKPLPVYGAGANVRDWLFVEDHA 232
+ YGLP YGP+ P+ + L+ K + VY G RD+ +++D A
Sbjct: 165 YSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 233 ----RALELV--------------ATKGTPGESYNVGGNSERTNLAVVETICDLLDIRRP 274
R +++ A P YN+G +S + ++ + D L I
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 275 RAGGRCYRDLISFVTDRPGHDRRYAIDASKIGRELGWAPSENFDSGLARTVDWFLDN 331
+ + + +PG + D + +G+ P G+ V+W+ D
Sbjct: 285 K----------NMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7553NUCEPIMERASE558e-11 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 55.2 bits (133), Expect = 8e-11
Identities = 53/240 (22%), Positives = 90/240 (37%), Gaps = 46/240 (19%)

Query: 1 MRLAVTGRDG----QVVSSLLEAGQFA-GVDVIA--------------IGRP-----QLD 36
M+ VTG G V LLEAG G+D + + +P ++D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 37 LANPDTVIEAIAAARPDIVVSAAAYTAVDQAEDEPDLAFRVNAVGAGKVAQAAARLGVP- 95
LA+ + + + A+ + V + AV + + P N G + + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 96 VIHLSTDYVFDGSASGAYVETDATA-PASVYGASKLAGEQTVAAAGPRHL------ILRT 148
+++ S+ V+ + + D+ P S+Y A+K A E HL LR
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYS--HLYGLPATGLRF 178

Query: 149 AWVYSPFGK------NFVKTMLRLAADRDEISVVAD--QWGNPSSALDIADAILHAAATL 200
VY P+G+ F K ML + I V + + DIA+AI+ +
Sbjct: 179 FTVYGPWGRPDMALFKFTKAML----EGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7558NUCEPIMERASE1592e-48 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 159 bits (405), Expect = 2e-48
Identities = 78/364 (21%), Positives = 132/364 (36%), Gaps = 57/364 (15%)

Query: 6 VVTGGAGFIGCALSTKLANRFDRVVVIDSLHP--QIHAERKRPADL-DPRVELVVADVTE 62
+VTG AGFIG +S +L +VV ID+L+ + ++ R L P + D+ +
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 63 ASTWVPALETIRPEVVVHLAAETGTGQSLTEASRHAIANVLGTTRMLDAFATASHVPERF 122
+ E V SL +A +N+ G +L+ +
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--IQHL 121

Query: 123 VLASSRAVYGEGAWRSRKAGEITYPGQRSKQQLERAEWDFPGLEPLPMDAEKTIPNPTSI 182
+ ASS +VYG +P + ++ +P S+
Sbjct: 122 LYASSSSVYGLN-------------------------------RKMPFSTDDSVDHPVSL 150

Query: 183 YGATKLTQEQILRAWALSFGTAVNVLRLQNVYGPGQSLTNSYTGIVSLFIRMAKEGKSIP 242
Y ATK E + ++ +G LR VYGP + F + EGKSI
Sbjct: 151 YAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMAL----FKFTKAMLEGKSID 206

Query: 243 IYEDGDIGRDFVFIDDVASALDKATAT-------------DLSQSLA----YDVGTGSKT 285
+Y G + RDF +IDD+A A+ + + S+A Y++G S
Sbjct: 207 VYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPV 266

Query: 286 TVLDLARIVAKRYEAPEPHVNGMFRNGDVRCAVANIDRTVRELEWRPLKSVDEGIGQLSE 345
++D + + + GDV A+ + + P +V +G+
Sbjct: 267 ELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVN 326

Query: 346 WVDS 349
W
Sbjct: 327 WYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll7563GPOSANCHOR300.015 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.015
Identities = 22/112 (19%), Positives = 37/112 (33%), Gaps = 4/112 (3%)

Query: 265 DEDNGLLDQLDIMRTRFDDACETFSAAMATEELAVVQLRGELSTRSTETERARQENSKLA 324
+ L + + R D + AM ++ T E ++L
Sbjct: 211 AKIKTLEAEKAALAARKADLEKALEGAMNFST----ADSAKIKTLEAEKAALEARQAELE 266

Query: 325 SFLEQQSLEVRGLAAERDALVGTRGTLIAERDALVNAQASLIAERDALLRDM 376
LE +A+ L + L AE+ L + L A R +L RD+
Sbjct: 267 KALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll7564PF05272320.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.002
Identities = 12/37 (32%), Positives = 17/37 (45%)

Query: 60 LGLVGPNGAGKTTLLKVLYGIYQPSGGTISITGKVDA 96
+ L G G GK+TL+ L G+ S I D+
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDS 635


62mlr7894mll7904Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr78942181.610241hypothetical protein
mlr78953171.361275transcriptional regulator
mlr78964171.559964hypothetical protein
mlr79005170.869403hypothetical protein
mll79025180.483661hypothetical protein
mll79044120.621122hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7895HTHTETR625e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.6 bits (149), Expect = 5e-14
Identities = 33/191 (17%), Positives = 61/191 (31%), Gaps = 13/191 (6%)

Query: 7 SDKRQHVVETAYALFKRAGFHATGVDRIIAEADVAKMTMYRHFPSKDELIVAVLDYRARR 66
+ RQH+++ A LF + G +T + I A V + +Y HF K +L + +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 67 FDDQLDKLAQKSI-TPEQKIAEIFDWHGRW---------FRSPDFHGCLFAHALAEFGDP 116
+ + K P + EI FH C F +A
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 117 GHPVFQAVARQKNGLRQRMRS---ILSEVMPRGRAENVAATLLMLIEGATLMAQMGQADT 173
+ + + + +++M R A + + L+E Q
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 174 ALREARKTALD 184
R+ L+
Sbjct: 190 EARDYVAILLE 200


63mll7977mlr8083Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll7977020-3.581421hypothetical protein
mll7978226-4.212423hypothetical protein
msl7979424-3.827401hypothetical protein
msl7980427-4.088966hypothetical protein
mlr7981328-3.859077ABC transporter ATP-binding protein
mll7982428-3.273126hypothetical protein
mll7983327-1.885425hypothetical protein
mlr7984330-0.313787hypothetical protein
msr7985233-2.120186hypothetical protein
mlr7987232-1.981486hypothetical protein
mlr7988332-2.640285hypothetical protein
msr7989232-2.541204hypothetical protein
mlr7992130-2.444564DNA modification methylase
mll7993131-5.288376hypothetical protein
msl8725125-4.180494hypothetical protein
mlr7995223-3.551607hypothetical protein
mlr7996120-3.362711hypothetical protein
mlr7998221-3.985474hypothetical protein
mll8000119-3.324290hypothetical protein
mlr8001116-1.826763hypothetical protein
mlr8003116-1.650909hypothetical protein
mlr8004317-1.805015hypothetical protein
msl8005316-1.131546hypothetical protein
mlr8006316-0.703498hypothetical protein
mlr8007217-1.436141hypothetical protein
mlr8009217-1.812964hypothetical protein
mlr8010119-1.981783hypothetical protein
mlr8011317-1.542890hypothetical protein
mlr8012316-1.606239hypothetical protein
mlr8013219-2.014318hypothetical protein
mlr8014220-2.324913hypothetical protein
mlr8015-118-1.867936hypothetical protein
mlr8016-115-1.163642hypothetical protein
msr8017-116-1.884809hypothetical protein
mlr8018-215-1.689544hypothetical protein
msr8019-214-1.047440hypothetical protein
mlr8020-113-0.695079hypothetical protein
mlr8022215-1.802967hypothetical protein
mlr8023315-2.583010hypothetical protein
mlr8025317-3.332667hypothetical protein
mlr8026419-4.102427hypothetical protein
mlr8028522-5.262928hypothetical protein
mlr8029629-7.170086hypothetical protein
msl8030528-6.322471hypothetical protein
mlr8031427-5.759911hypothetical protein
mlr8032325-4.687716exopolysaccharide production protein EXOZ
mll8033330-6.012176hypothetical protein
mll8034327-4.749572hypothetical protein
mlr8035427-5.097217hypothetical protein
msr8036429-6.515019hypothetical protein
mlr8037330-7.089772hypothetical protein
mlr8038333-7.857647hypothetical protein
msl8039022-5.002479hypothetical protein
mll8040126-4.602995hypothetical protein
mlr8042128-5.053315ATP-dependent DNA ligase
msr8043038-5.152077hypothetical protein
msr8044-126-2.625121hypothetical protein
msr8045026-3.183394hypothetical protein
mll8047-122-3.246096hypothetical protein
msr8048022-3.281381hypothetical protein
mll8049121-2.666866hypothetical protein
mll8050222-3.736902ortho-methyltransferase
mlr8051627-5.722213hypothetical protein
mlr8052524-4.264986hypothetical protein
mlr8053328-5.238312hypothetical protein
msr8054326-5.393251hypothetical protein
mll8055125-5.181797hypothetical protein
mlr8056223-3.252812hypothetical protein
mlr8058223-4.756614hypothetical protein
mll8059226-5.465289hypothetical protein
msl8061224-4.690797hypothetical protein
mll8062224-4.629741hypothetical protein
mll8063124-4.661769ATP-dependent DNA ligase
mll8064126-5.286278outer membrane lipoprotein carrier protein
msr8065126-4.667423hypothetical protein
mll8067125-4.684695two-component sensor histidine kinase
mll8068130-6.043641hypothetical protein
msl8069131-7.243429hypothetical protein
msr8070128-6.748518hypothetical protein
msl8073026-6.208327hypothetical protein
msl8074024-5.470660hypothetical protein
msl8075026-5.190613hypothetical protein
mlr8076-126-5.312611hypothetical protein
mlr8077-126-5.245645cardiolipin synthetase
mll8078026-5.1676773',5'-cyclic-nucleotide phosphodiesterase
mll8080026-5.201962hypothetical protein
mll8081-125-5.243602hypothetical protein
mlr8082-222-4.455825circadian clock gene kaiC
mlr8083-121-3.486825two-component, sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8012TETREPRESSOR270.019 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 27.2 bits (60), Expect = 0.019
Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 1/45 (2%)

Query: 9 CDVGFYDEMAGVATDLLTEFNQGVVKLKRETPGVVDPEQPWMPVE 53
+ GF A ++ F G V ++E + ++P P E
Sbjct: 121 TENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAAL-TDRPAAPDE 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8026PF04183280.020 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 27.5 bits (61), Expect = 0.020
Identities = 10/25 (40%), Positives = 13/25 (52%), Gaps = 1/25 (4%)

Query: 4 EEYIRLP-HRWQWGYTDCTLFAADW 27
++ LP H WQW T F AD+
Sbjct: 213 HNWLPLPVHPWQWQQKIATDFIADF 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8037SECA280.014 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 27.9 bits (62), Expect = 0.014
Identities = 12/84 (14%), Positives = 27/84 (32%), Gaps = 5/84 (5%)

Query: 32 RDIKESDARAALSYEQSEQRAAASRAKMYQKTDELVERVTATESAVSKLNADMTSVKEVT 91
R ++ + + S ++ KT E R+ E + + V+E +
Sbjct: 16 RTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLENLIPEAFAVVREAS 75

Query: 92 AEVTRWKLMGLGALGVTGMAAAAL 115
++ G+ V + L
Sbjct: 76 -----KRVFGMRHFDVQLLGGMVL 94


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll8067HTHFIS741e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.5 bits (183), Expect = 1e-15
Identities = 34/119 (28%), Positives = 52/119 (43%), Gaps = 1/119 (0%)

Query: 1268 VMVVEDEDRVRAVSAEALRELGYSVVEASGPNEAIKMIEAGQQLSLLFTDVVMPEMSGRQ 1327
++V +D+ +R V +AL GY V S + I AG L+ TDVVMP+ +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDENAFD 64

Query: 1328 LVDILRRGNPKLKVLYTTGYTRNAIVHNGILDPVTQLLPKPFSLEDLAEKVRTILDDPS 1386
L+ +++ P L VL + LPKPF L +L + L +P
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


64mlr8092mll8116Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr8092128-3.283326hypothetical protein
mll8093027-3.683754hypothetical protein
mll8094028-3.645406blue light photoreceptor cryptochrome
mll8095234-4.565369hypothetical protein
mll8096233-4.032906hypothetical protein
mlr8097233-4.491099hypothetical protein
mll8100229-3.890969hypothetical protein
mlr8101127-3.907517hypothetical protein
msr8102126-4.464408hypothetical protein
msr8103124-4.100613hypothetical protein
mlr8105125-4.497323hypothetical protein
msl8106-125-3.489507hypothetical protein
mlr8107025-4.518802hypothetical protein
msr8108-134-6.533078hypothetical protein
msl8112032-5.382648hypothetical protein
msr8113228-4.573692hypothetical protein
mlr8114027-4.627671hypothetical protein
mll8115028-4.545702hypothetical protein
mll8116128-4.031906site-specific recombinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msl8112PF07675250.044 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 25.4 bits (55), Expect = 0.044
Identities = 12/41 (29%), Positives = 17/41 (41%), Gaps = 5/41 (12%)

Query: 26 MAEDMKDGRN----LMRVDADGNILWKALPPATQDCFTGMN 62
++E ++G + D DGN W PP F G N
Sbjct: 633 LSESFENGIPASWKTIDADGDGNN-WTTTPPPGGSSFAGHN 672


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll8115VACCYTOTOXIN290.024 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 29.2 bits (65), Expect = 0.024
Identities = 27/138 (19%), Positives = 52/138 (37%), Gaps = 20/138 (14%)

Query: 162 MLAMNRVEFRVARLLIELTPRSQLTNPLARRKRYEGISPAQMAAMEADIAEVSHDYLSAA 221
++A+N+ +F + EL RS + L +G Q +++ A + + A
Sbjct: 873 LVAINQHDFGTIESVFELANRSNDIDTLYANSGAQGRDLLQTLLIDSHDAGYARTMIDAT 932

Query: 222 STH--------GSEMLNLIAATSYFDRLLN----------NPKLVRYLARNFARQLEVF- 262
S + + LN IA+ + L N +LV L+R ++ F
Sbjct: 933 SANEITKQLNTATTTLNNIASLEHKTSGLQTLSLSNAMILNSRLVN-LSRRHTNHIDSFA 991

Query: 263 QNLLDFREARYKEHPPQA 280
+ L ++ R+ A
Sbjct: 992 KRLQALKDQRFASLESAA 1009


65mlr8156mll8176Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr8156-116-3.053676hypothetical protein
mll8158-117-3.430698salicylate hydroxylase
mlr8161017-4.227773polyphosphate kinase
mlr8162127-5.142594exopolyphosphatase
mlr8165235-6.792453overcoming lysogenization defect protein
mlr8166020-3.706327ATP-dependent DNA helicase
msr8167-2141.135898hypothetical protein
mll8168-1131.716490hypothetical protein
msr81690111.467491hypothetical protein
mll8170091.891204hypothetical protein
mll81720102.117919fumarate hydratase
mll81741122.524487dehydrogenase
mlr81752142.141837hypothetical protein
mll81762131.853395hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll8174DHBDHDRGNASE1285e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 128 bits (322), Expect = 5e-38
Identities = 68/256 (26%), Positives = 118/256 (46%), Gaps = 8/256 (3%)

Query: 6 KRTLVTGGSDGIGLAIAEAFLSEGADVLIVGRDAAKLEAARQKLAALGQAGAVETSSADL 65
K +TG + GIG A+A S+GA + V + KLE L A + E AD+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA--EAFPADV 66

Query: 66 ATSLGVATVVEQVKETGRPLDIPINNAGVADLVPFESVSEAQFQHSFALNVAAAFFLTQG 125
S + + +++ P+DI +N AGV S+S+ +++ +F++N F ++
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 126 LLPHFGA--GASIINISSYFARKMIPKRPSSVYSLSKGALNSLTRSLAFELGPRGIRVNA 183
+ + SI+ + S A +P+ + Y+ SK A T+ L EL IR N
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAG--VPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 184 IAPGTVDTAMRRK--TVDNLPAEAKAELKAYVERSYPLGRIGRPDDLAGMAVYLASDEAA 241
++PG+ +T M+ +N + + PL ++ +P D+A ++L S +A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 242 WTSGGIFAVDGGYTAG 257
+ VDGG T G
Sbjct: 245 HITMHNLCVDGGATLG 260


66mlr8491msr8510Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
mlr84913111.703304hypothetical protein
mlr84922122.340284ribokinase
mll84933151.380926hypothetical protein
msr8494316-0.218460hypothetical protein
mll84950160.694249**integrase
msl84960220.093417hypothetical protein
msl84971190.008143hypothetical protein
msl8499022-1.386047hypothetical protein
mll8500123-1.223243repressor protein C
mll8503221-0.543092hypothetical protein
mll8506221-1.031206hypothetical protein
mlr8507421-1.435557hypothetical protein
mll8511420-1.260461hypothetical protein
msr85095181.908811hypothetical protein
msr85102172.004508hypothetical protein
67mlr0397mlr0400N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr0397313-0.136977nitrogen reguration protein nirB protein
mlr03982130.025498nitrogen assimilation regulatory protein ntrC
mlr0399111-0.241139nitrogen regulation protein ntrY
mlr0400-1120.477781nitrogen assimilation regulatory protein ntrX
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0397PF06580385e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.9 bits (88), Expect = 5e-05
Identities = 26/163 (15%), Positives = 51/163 (31%), Gaps = 48/163 (29%)

Query: 214 LDHVKA---IAKNGFAKKIRILEDYDPSL-----PPVFANRDQLIQVFLNLVKNAAEAIG 265
L V + +A F +++ +P++ PP+ L+Q LV+N +
Sbjct: 222 LTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM------LVQT---LVENGIK--- 269

Query: 266 TDPHGEIVLSTAFRPG-IRVSVPGTQDRVSLPLEFCVRDNGPGVSEDILPILFDPFITTK 324
HG ++ + G I + + V + G ++
Sbjct: 270 ---HG---IAQLPQGGKILLKGTKDNGT----VTLEVENTGSLALKN------------T 307

Query: 325 PNGSGLGLALV----AKIVGEHGGIIECESTPRGTTFRILMPA 363
+G GL V + G I + +L+P
Sbjct: 308 KESTGTGLQNVRERLQMLYGTEAQI-KLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0398HTHFIS5880.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 588 bits (1517), Expect = 0.0
Identities = 367/482 (76%), Positives = 420/482 (87%), Gaps = 1/482 (0%)

Query: 4 RGNILVADDDAAIRTVLNQALSRVGHEVRVTSNASTLWRWVAAGEGDLVITDVVMPDENA 63
ILVADDDAAIRTVLNQALSR G++VR+TSNA+TLWRW+AAG+GDLV+TDVVMPDENA
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 FDMLPRIKKARPELPVIVMSAQNTFMTAIRASETGAYEYLPKPFDLTELLNIVNRALSEP 123
FD+LPRIKKARP+LPV+VMSAQNTFMTAI+ASE GAY+YLPKPFDLTEL+ I+ RAL+EP
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 124 RRPKIDARPDEQPETMPLVGRSAAMQDIYRMLARMMQTDLTVMISGESGTGKELVARALH 183
+R + D+ + MPLVGRSAAMQ+IYR+LAR+MQTDLT+MI+GESGTGKELVARALH
Sbjct: 123 KR-RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALH 181

Query: 184 EYGRRRGGPFVAINMAAIPRDLIESELFGHEKGAFTGAQNRSTGRFEQAEGGTLFLDEIG 243
+YG+RR GPFVAINMAAIPRDLIESELFGHEKGAFTGAQ RSTGRFEQAEGGTLFLDEIG
Sbjct: 182 DYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIG 241

Query: 244 DMPMEAQTRLLRVLQQGEYTTVGGRTPIKTDVRIVAATNKDLRTLINQGLFREDLFYRLN 303
DMPM+AQTRLLRVLQQGEYTTVGGRTPI++DVRIVAATNKDL+ INQGLFREDL+YRLN
Sbjct: 242 DMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLN 301

Query: 304 VVPLRLPALRERSEDVPDLVRHFFKLGEMEGLQTKRISSGGIELMKRYPWPGNVRELENL 363
VVPLRLP LR+R+ED+PDLVRHF + E EGL KR +ELMK +PWPGNVRELENL
Sbjct: 302 VVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENL 361

Query: 364 VRRLAALYSQDEISAEIIEAELKTGERPVVPGGGNLIPDDLSIGQAVEHFLQRYFASFAG 423
VRRL ALY QD I+ EIIE EL++ LSI QAVE +++YFASF
Sbjct: 362 VRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGD 421

Query: 424 ELPPAGLYQRILSEVEYPLVLASMTATRGNQIKAAELLGLNRNTLRKKIRELGVNVYKSS 483
LPP+GLY R+L+E+EYPL+LA++TATRGNQIKAA+LLGLNRNTLRKKIRELGV+VY+SS
Sbjct: 422 ALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSS 481

Query: 484 RP 485
R
Sbjct: 482 RS 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0399PF06580397e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.7 bits (90), Expect = 7e-05
Identities = 25/154 (16%), Positives = 52/154 (33%), Gaps = 25/154 (16%)

Query: 534 IIRQVEDIGRMVDEFSAFARM--PKPEMKAIDLRESLREASFLVEVSRA----DITFERI 587
I+ M+ S R + + L + L ++++ + FE
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFE-N 244

Query: 588 FGNEPLKGTFDSRLLAQAFGNVIKNAAEAIDGLEQKDGSHGIIRIQAGRQNGAIRIDVID 647
N + +L Q ++N G+ Q G I ++ + NG + ++V +
Sbjct: 245 QINPAIMDVQVPPMLVQTL---VENGI--KHGIAQLP-QGGKILLKGTKDNGTVTLEVEN 298

Query: 648 NGKGLPRENRQRLLEPYMTTREKGTGLGLAIVKK 681
G + ++ TG GL V++
Sbjct: 299 TGSLALKNTKE------------STGTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0400HTHFIS418e-145 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 418 bits (1077), Expect = e-145
Identities = 164/478 (34%), Positives = 256/478 (53%), Gaps = 31/478 (6%)

Query: 2 ASDILIVDDEEDIRELVAGILSDEGHETRTAFDADSALAAIADRAPRLIFLDIWLQGSRL 61
+ IL+ DD+ IR ++ LS G++ R +A + IA L+ D+ +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD--E 60

Query: 62 DGLALLDEIKTMHPTLPVVMISGHGNIETAVSAIRRGAYDFIEKPFKADRLILIAERALE 121
+ LL IK P LPV+++S TA+ A +GAYD++ KPF LI I RAL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 122 TSKLRREVSDLKQRSGETFDLIGMSSAMSQLRQTIERVAPTNSRVMIIGPSGSGKELAAR 181
+R S L+ S + L+G S+AM ++ + + R+ T+ +MI G SG+GKEL AR
Sbjct: 121 E--PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 182 AIHTLSARKGAPFVTLSAANITPERMEIELFGTE----SNGVERKVGALEEAHRGILYID 237
A+H R+ PFV ++ A I + +E ELFG E + R G E+A G L++D
Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 238 EVADMPRETQNKILRVLVEQQFERVGGTKRVKVDVRIISSTSQNLEAMIADGRFREDLYH 297
E+ DMP + Q ++LRVL + ++ VGG ++ DVRI+++T+++L+ I G FREDLY+
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 298 RLAVVPVMVPGLAERREDIPYLVDNFMKQIARQAGIKPRRIGDDALAVLQAHNWPGNVRQ 357
RL VVP+ +P L +R EDIP LV +F++Q ++ G+ +R +AL +++AH WPGNVR+
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 358 LRNNVERLMILARGDDVDAPITADLLPSEI---------GDVMPRTPNQSDQHIMALP-- 406
L N V RL L D + I + L SEI + +Q+ + M
Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 407 -----------LREAREQFEKDYLIAQINRFGGNISKTAEFIGMERSALHRKLKSLGV 453
+ E ++A + GN K A+ +G+ R+ L +K++ LGV
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


68mll0804mll0817N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll0804-3100.495354transporter of 3-phenylpropionic acid
mll0806-290.536155hypothetical protein
mlr0808-280.388238short-chain dehydrogenase/reductase
mlr0809-370.385932malic enzyme
mlr0810-290.040456hypothetical protein
mlr0812-280.198159hypothetical protein
mll0813-18-0.174889glutamyl-tRNA synthetase
mll0814010-0.465991hypothetical protein
mll0815112-0.059193ABC transporter ATP-binding protein
mll0817113-0.721033hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0804TCRTETA310.012 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.6 bits (69), Expect = 0.012
Identities = 61/341 (17%), Positives = 118/341 (34%), Gaps = 42/341 (12%)

Query: 29 FISLGTHLPYFPLWLQ---AKGFHAEQIAVILAAPMFLRVVTTPLLTTLADRARDRANVY 85
+ +G +P P L+ ++LA ++ P+L L+DR R +
Sbjct: 18 AVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL 77

Query: 86 VALVAASLLLSAGYFLTPTYAMVLAVSLGLTIVWTPHSPIADSLALSGVRRFGSNYASMR 145
V+L A++ Y + T + + +G + + A + A G A
Sbjct: 78 VSLAGAAV----DYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHF 133

Query: 146 KWGSICYLCANVAG---GFILAATGPQAVPVIIVLALAA-ALVAGLLAPRMGRPRKASPL 201
+ S C+ VAG G ++ P A P AL + G + PL
Sbjct: 134 GFMSACFGFGMVAGPVLGGLMGGFSPHA-PFFAAAALNGLNFLTGCFLLPESHKGERRPL 192

Query: 202 SAAEIQHAAPSLFN------AYFLYFTFGVGIITASHAFLYGFVSIY---WKSIGISDSV 252
+ A + A + F + ++ A L+ W + I S+
Sbjct: 193 RREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISL 252

Query: 253 VGLLWAWGVVSEVCMFLLFNRIFASVSVVKVMVIAGIGSIVRWILF------PLVWPLGL 306
G++ + ++ + A + + +++ I +IL + +P+ +
Sbjct: 253 AAF----GILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMV 308

Query: 307 GIAGFFAVQSLHSVSVAMVLIGLQKMIAETVSEERTGAAQG 347
+A + + LQ M++ V EER G QG
Sbjct: 309 LLASG-----------GIGMPALQAMLSRQVDEERQGQLQG 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0808DHBDHDRGNASE724e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 72.0 bits (176), Expect = 4e-17
Identities = 61/253 (24%), Positives = 95/253 (37%), Gaps = 4/253 (1%)

Query: 4 GIRGKKAIVCASSKGLGKGCAMALAEAGCDIVVNGRNAE--LVAKTAAELRERFDVTVTE 61
GI GK A + +++G+G+ A LA G I N E ++ + R
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 62 VVGDVSKPDVQKALLAACPEP-DILVNNNGGPPLRDFRELDRAKILEGVTQNMVTPIELV 120
V D + D A + P DILVN G L + + N
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 121 QAVLDGMAKRGFGRIVNITSLSVYVPIPGLDLSSGARAGLTSFLAGVARTVIDRNVTINS 180
++V M R G IV + S VP + + ++A F + + + N+ N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 181 LLPGKLDTDRLRGPHEAGTSEAPAAAAARKTRISADVPAKRLGTPEEFGQICAFLCSVHA 240
+ PG +TD + +T +P K+L P + FL S A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLET-FKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 241 GYLTGQNIPVDGG 253
G++T N+ VDGG
Sbjct: 244 GHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0810OMPADOMAIN434e-07 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 43.0 bits (101), Expect = 4e-07
Identities = 52/213 (24%), Positives = 77/213 (36%), Gaps = 36/213 (16%)

Query: 18 ALAADAISAEPA-TAHDWSGVYVGGQIGYGFGKTDATYNLPNTPTIRGSQDYDTDGFLGG 76
ALA A A+ A + W Y G ++G+ D + N PT G GG
Sbjct: 11 ALAGFATVAQAAPKDNTW---YTGAKLGWSQYH-DTGFINNNGPTHENQLGA---GAFGG 63

Query: 77 VQIGYNYQINSAVLGVEADVSGADIKGHSDEITSGLGDRYDTKVDWFGTLRARAGYAFDR 136
YQ+N +G E G D G S Y + L A+ GY
Sbjct: 64 ------YQVN-PYVGFEM---GYDWLGRMPYKGSVENGAYKAQG---VQLTAKLGYPITD 110

Query: 137 TL-IYGTGGLAFGSVENRYVDGPFDTFSEKNTKVGWTIGAGLEQAITDHWSAKFEYQYV- 194
L IY G + +T V G+E AIT + + EYQ+
Sbjct: 111 DLDIYTRLGGMV--WRADTKSNVYG--KNHDTGVSPVFAGGVEYAITPEIATRLEYQWTN 166

Query: 195 DLRD-QTIDYAPNSNTTFDNTFNAVKIGMNYKF 226
++ D TI P++ + +G++Y+F
Sbjct: 167 NIGDAHTIGTRPDNGM--------LSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr0812PF03544300.013 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.9 bits (67), Expect = 0.013
Identities = 12/102 (11%), Positives = 21/102 (20%), Gaps = 9/102 (8%)

Query: 308 PPPPQQSLPETEAAPLVPVPASKPDPGADPETLANREG---------GLDREAVKRLSAK 358
PP P P P P+P + + + E KR
Sbjct: 61 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKP 120

Query: 359 PVTSPVSALPPEQRKVRVVGPTFLPDPSAAINLQAPAPKAVQ 400
+ P S ++ + +
Sbjct: 121 VESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSR 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0817SACTRNSFRASE362e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.1 bits (83), Expect = 2e-05
Identities = 18/93 (19%), Positives = 33/93 (35%), Gaps = 4/93 (4%)

Query: 33 DEFHHRAFANSLCAAAYIDGKQVGFGRAITDRTVFAYLADIIVWPQNRGQGIGQRLVQAL 92
+ + Y++ +G + ++ +A + DI V R +G+G L+
Sbjct: 55 MDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKA 114

Query: 93 IDHPELGSVSHWSLSTGD----AHGVYEKLGFK 121
I+ + L T D A Y K F
Sbjct: 115 IEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


69mll0989mll1002N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll0989-117-2.455024two component sensor-kinase
mll0990117-1.987651two component responce regulator
mll0991118-1.795540hypothetical protein
mlr0992018-2.303297hypothetical protein
msr0993020-2.514787hypothetical protein
mll0994120-1.820447transporter
mll0995124-1.366894secretion protein
msl0996124-2.104031hypothetical protein
mll0997020-1.920728sensor/response regulator hybrid
msl0999018-1.359510hypothetical protein
mll1000017-1.776235dehydrogenase
mll1001-117-2.280314hypothetical protein
mll1002-117-2.903545sugar ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0989PF06580340.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.002
Identities = 25/180 (13%), Positives = 66/180 (36%), Gaps = 24/180 (13%)

Query: 423 VAQKIIRNADAAAQVIGRIRSLFSKTEGEPQPLDLN-----ALIREVCELMGDRLASSRV 477
+ I+ + A +++ + L + ++ ++ +L + R+
Sbjct: 182 IRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQF-EDRL 240

Query: 478 MLELDLDPELPATAADHVQMEQVVLNLVRNGIEAMQDVSTTARSLRIVSRGQDDGTVEVE 537
E ++P + + ++ +V N +++GI + + + +D+GTV +E
Sbjct: 241 QFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGK----ILLKGT-KDNGTVTLE 295

Query: 538 VRDRGRGLSDPERIFEAFYTTKPDGMGMGLPICRSIVEAHYG---RVWAKNVEGGGASII 594
V + G + G GL R ++ YG ++ +G +++
Sbjct: 296 VENTGSLALK----------NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0990HTHFIS984e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 98.4 bits (245), Expect = 4e-26
Identities = 31/145 (21%), Positives = 58/145 (40%)

Query: 15 ILVDDDAEVRDALKELLNSVGIESISFSSTQEVLDAELPDRPACFVLDVRMPGQSGLDLQ 74
++ DDDA +R L + L+ G + S+ + V DV MP ++ DL
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 75 HLLLTKGIQTPVVFLTGHGDIAMSVQAMKTGAVDFLTKPVRDQTFLDAVSVAVATDKARR 134
+ PV+ ++ +++A + GA D+L KP + + A+A K R
Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRP 126

Query: 135 AASTAARKTMALYETLTPREREVLR 159
+ + + +E+ R
Sbjct: 127 SKLEDDSQDGMPLVGRSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0994TCRTETB1061e-26 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 106 bits (265), Expect = 1e-26
Identities = 82/402 (20%), Positives = 160/402 (39%), Gaps = 17/402 (4%)

Query: 41 AFMAVLNIQIVNASLADIQGAIGAGIDDGGWISTAYLIAEIVVIPLTGWLSQVFSLRNYL 100
+F +VLN ++N SL DI W++TA+++ + + G LS ++ L
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 101 IANAVLFLAFSVACAFAANLGQMIVL-RAIQGFTGGVLIPMAFTIVITLLPKARQPIGLA 159
+ ++ SV + ++++ R IQG + +V +PK +
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 160 LFAMSATFAPAIGPTIGGYLTENWGWQYIFYVNIVPGALMVGMLWFSLDRQPMRLALLGE 219
L +GP IGG + W Y+ ++P ++ + F + + + G
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLL---LIPMITIITVP-FLMKLLKKEVRIKGH 198

Query: 220 GDWPGIATMAIGLAALQTVLEEGNKNDWFGSLFVLRLAIVAAVSLSLFVWIELTSSHPLL 279
D GI M++G+ +L + + F + VL S +FV + P +
Sbjct: 199 FDIKGIILMSVGIVFF--MLFTTSYSISFLIVSVL--------SFLIFVKHIRKVTDPFV 248

Query: 280 NLRLLLRRNFGFGILANFMLGTALYGSVFILPVYLARIQGYNAEQIGLVLAWTG-LPQLL 338
+ L F G+L ++ + G V ++P + + + +IG V+ + G + ++
Sbjct: 249 DPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308

Query: 339 LIPLVPYLMQRFDVRLVIVTGFALFAASNFMNVHMTAGYASDQLFWPNIVRAIGQALVFA 398
+ L+ R V+ G + S F+ S + + G +
Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKT 367

Query: 399 PLSAIAVSGIEQENAGSASSLFNMIRNLGGAVGIALLQTFLT 440
+S I S ++Q+ AG+ SL N L GIA++ L+
Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0995RTXTOXIND1292e-35 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 129 bits (327), Expect = 2e-35
Identities = 55/372 (14%), Positives = 102/372 (27%), Gaps = 82/372 (22%)

Query: 88 VKADSTTIAPKVSGYIAEVLVRDNQKVTVGQVLARIDDRDFRAALDQAQADMRAAEATVR 147
S I P + + E++V++ + V G VL ++ A + Q+ + A
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQT 151

Query: 148 NLDAQIVLQRALIEQARATVAATQASLRFAAVDADRYATLAKSGTGTTQKAEA-SRAGAD 206
L + + + + R +L K T Q + D
Sbjct: 152 RYQILSR-SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLD 210

Query: 207 QLAAGLARDQAAVVAAEV----------------------RIDVLATERDKALAQVDRAQ 244
+ A A + E + VL E A +
Sbjct: 211 KKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRV 270

Query: 245 AAGEQARL---------------------------------------------NLSYATI 259
+ ++ + I
Sbjct: 271 YKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVI 330

Query: 260 TAPVDGTV-GARTLRIGQYVGAGTQLMAVVPQNAVYVV-ANFKETQLTYVRGGQPVRVAI 317
APV V + G V LM +VP++ V A + + ++ GQ + +
Sbjct: 331 RAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKV 390

Query: 318 DGFPGVE---LEGHVDSLSPASGLEFALLPPDNATGNFTKIVQRIPVKIMIEDQELGGLL 374
+ FP L G V +++ + D G ++ I + + L
Sbjct: 391 EAFPYTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGNK-NIPL 442

Query: 375 RAGMSVEPTIDT 386
+GM+V I T
Sbjct: 443 SSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll0997PF06580376e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 6e-05
Identities = 25/158 (15%), Positives = 50/158 (31%), Gaps = 30/158 (18%)

Query: 121 RQELTDICQLLEQFQVVLANVLSPSIRLTLHVAGSLPLAFIDRELVERALLNLVLNARDA 180
ELT + L+ + + L ++ + + + LVE N + +
Sbjct: 219 ADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVE----NGIKHGIAQ 274

Query: 181 MPAGGDISIAAALEFPPSSKTGPPKQMIRLSISDNGVGMDDGTLKMAGRKNFSTKANGSG 240
+P GG I + + + L + + G T +G
Sbjct: 275 LPQGGKILLKGTKD----------NGTVTLEVENTGSLALKNT------------KESTG 312

Query: 241 LGLAVVRRIVESLSG---RFSIISTLGHGTTIDLWLPA 275
GL VR ++ L G + + G + + +P
Sbjct: 313 TGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1002PF05272300.016 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.016
Identities = 13/21 (61%), Positives = 14/21 (66%)

Query: 36 VLCLLGDNGAGKSTLINTLAG 56
+ L G G GKSTLINTL G
Sbjct: 598 SVVLEGTGGIGKSTLINTLVG 618


70mlr1023mll1036N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr1023115-2.321597adenylate cyclase
mll1024213-1.892132polyamine transport protein
mlr1025213-0.834174transcriptional regulatory protein, nodulation
mll1026212-0.455549rhizobiocin secretion protein rspE
mll1027211-0.706335rhizobiocin secretion protein rspD
mll1028312-1.534526rhizobiocin rzcA
msr1029-1111.614964DNA repair protein
mlr1030-191.474237hypothetical protein
msl10310121.179364hypothetical protein
mll10330110.494617hypothetical protein
mll103429-0.4657953-hydroxyacyl-CoA dehydrogenase
mll1036210-0.9066913-ketoacyl-ACP reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1023SYCDCHAPRONE384e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 38.0 bits (88), Expect = 4e-05
Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 6/76 (7%)

Query: 473 GERLQRLNPLEDRDIHEL--LAFTHYLLGDYEASLRSFRR---WDNNNYDRGFANLAACL 527
G + LN + + +L LAF Y G YE + + F+ D+ + F L AC
Sbjct: 22 GGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRF-FLGLGACR 80

Query: 528 GQLGRAEEARSAWGRC 543
+G+ + A ++
Sbjct: 81 QAMGQYDLAIHSYSYG 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1026RTXTOXIND338e-115 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 338 bits (869), Expect = e-115
Identities = 100/427 (23%), Positives = 172/427 (40%), Gaps = 3/427 (0%)

Query: 9 RTIRRYLLGGVAACIFLVGGAGSLAAVTELSGAVIAPGKLVVDSSVKKVQHPTGGVVGDI 68
RR L FLV A L+ + ++ A GKL K+++ +V +I
Sbjct: 52 PVSRRPRLVAYFIMGFLVI-AFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 69 LAREGDAVKSGQVLIRLDETVTRANLAIVTKGLDEFEARLARLEAERDDRAGIAFPASLT 128
+ +EG++V+ G VL++L A+ L + R + P
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKL 170

Query: 129 SRRDDPAVARA--MAGEQSLFEFRRQARAGQKAQLEERIAQLAEEASGLTEQRTAKSREI 186
+ SL + + QK Q E + + E + +
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 187 ELIGTELESIRTLWLKKLVSIDRMTALERDAVRLDGEHGQLTASIAQSKGRIAETRLQII 246
+ + L+ +L K+ ++ + E V E + + Q + I + +
Sbjct: 231 RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290

Query: 247 QVDQDLRSEVATELRDVQGKISEFVERKVSAEDQLKRIDIRSPQDGVVHQLAVHTIGGVI 306
V Q ++E+ +LR I E++ + IR+P V QL VHT GGV+
Sbjct: 291 LVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVV 350

Query: 307 SPGEVIMLVVPVADDLTVEARIAPQDIDQLSLGQDVALKLSAFNQRVTPELSGVVSEISA 366
+ E +M++VP D L V A + +DI +++GQ+ +K+ AF L G V I+
Sbjct: 351 TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINL 410

Query: 367 DLSVDERSGASFYTVRVSLPRTELKKLKGLTLAPGMPVEAFFATGSRTMLSYLVKPLADQ 426
D D+R G F + K + L+ GM V A TG R+++SYL+ PL +
Sbjct: 411 DAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEES 470

Query: 427 IARAFRE 433
+ + RE
Sbjct: 471 VTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1028RTXTOXINA781e-16 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 77.7 bits (191), Expect = 1e-16
Identities = 64/280 (22%), Positives = 103/280 (36%), Gaps = 63/280 (22%)

Query: 384 GDDTITGSNSPDTITGGRGNDTLNGVGGNDTYIYARGDGNDTVTDGSGNGINDRLVFTDI 443
GDD I G++ D + G +GNDTL+G G+D +Y GDGND + +GN
Sbjct: 745 GDDLIEGNDGNDRLYGDKGNDTLSGGNGDD-QLYG-GDGNDKLIGVAGNNY--------- 793

Query: 444 DPSMVSLVRIGSDVKVVIAESSPGAGDAGSIVLKDILGDFISQGVDKILFADGTVWTRPT 503
+ G GD D+ ++
Sbjct: 794 --------------------LNGGDGD------------------DEFQVQGNSLAKNV- 814

Query: 504 IVGKLVDLLGTTGNDSINGTSATDIIRGAAGNDTLNGASGDDTYLYARGDGNDTVNEGFW 563
L G GND + G+ D++ G G+D L G G+D Y Y G G+ +++
Sbjct: 815 -------LFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGG 867

Query: 564 DVNDRLVFTNINRSGVSLVRNGNDLTVVIAES-APGAGDGGSVLIKNTLDDNNSWG---- 618
D+L +I+ V+ R GNDL + E G + +N + +
Sbjct: 868 K-EDKLSLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISNHE 926

Query: 619 VEKVVFADGTAWTRADIRVALLDQAGTTGNDTIIGFNVAD 658
+E++ G T ++ AL Q + G +
Sbjct: 927 IEQIFDKSGRIITPDSLKKALEYQQRNNKASYVYGNDALA 966



Score = 75.0 bits (184), Expect = 1e-15
Identities = 74/285 (25%), Positives = 111/285 (38%), Gaps = 44/285 (15%)

Query: 510 DLLGTTGNDSINGTSATDIIRGAAGNDTLNGASGDDTYLYARGDGNDTVNEGFWDVNDRL 569
+L+GTT D G+ TDI GA G+D + G G+D LY GNDT++ G + +D+L
Sbjct: 721 ELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDR-LYG-DKGNDTLSGG--NGDDQL 776

Query: 570 VFTNINRSGVSLVRNGNDLTVVIAESAPGAGDGGSVLIKNTLDDNNSWGVEKVVFADGTA 629
+ GND + G + L DD E V + A
Sbjct: 777 YGGD-----------GNDKLI--------GVAGNNYLNGGDGDD------EFQVQGNSLA 811

Query: 630 WTRADIRVALLDQAGTTGNDTIIGFNVADTLHGRAGNDTLNGAGGDDTYLYARGDGNDTV 689
G GND + G AD L G G+D L G G+D Y Y G G+ +
Sbjct: 812 KNVLF---------GGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHII 862

Query: 690 NEGFWDVNDRLVFTDINPSGVSLVRNGNDLTVVIAES-APGAGDGGSVLIKNTLDDNNSW 748
++ D+L DI+ V+ R GNDL + E G + +N + +
Sbjct: 863 DDDGGK-EDKLSLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGD 921

Query: 749 G----VEKVVFADGTTWTRADIRVALLNQADTAGNDTITGFNVAD 789
+E++ G T ++ AL Q + G +
Sbjct: 922 ISNHEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVYGNDALA 966



Score = 71.2 bits (174), Expect = 2e-14
Identities = 73/287 (25%), Positives = 108/287 (37%), Gaps = 44/287 (15%)

Query: 644 GTTGNDTIIGFNVADTLHGRAGNDTLNGAGGDDTYLYARGDGNDTVNEGFWDVNDRLVFT 703
GTT D G D HG G+D + G G+D LY GNDT++ G + +D+L
Sbjct: 724 GTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDR-LYG-DKGNDTLSGG--NGDDQLYGG 779

Query: 704 DINPSGVSLVRNGNDLTVVIAESAPGAGDGGSVLIKNTLDDNNSWGVEKVVFADGTTWTR 763
D N + GN+ G GD + N+L N +G
Sbjct: 780 DGNDKLIG--VAGNNYLN------GGDGDDEFQVQGNSLAKNVLFG-------------- 817

Query: 764 ADIRVALLNQADTAGNDTITGFNVADTLHGRAGNDTLNGAGGDDTYLYARGDGNDTVNEG 823
GND + G AD L G G+D L G G+D Y Y G G+ +++
Sbjct: 818 ------------GKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDD 865

Query: 824 FWDVNDRLVFTDINPSGVSLVRNGNDLTVVIAES-APGAGDGGSVLIKNTLDDNNSWG-- 880
D+L DI+ V+ R GNDL + E G + +N + +
Sbjct: 866 GGK-EDKLSLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISN 924

Query: 881 --VEKVVFADGTTWTRADIRVALLDQAGTTGNDTITGFNVADRISGG 925
+E++ G T ++ AL Q + G + S G
Sbjct: 925 HEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVYGNDALAYGSQG 971



Score = 68.8 bits (168), Expect = 8e-14
Identities = 68/258 (26%), Positives = 99/258 (38%), Gaps = 44/258 (17%)

Query: 8 TDGDDVLVGSAASGIMHGGKGNDTLDGASGNDNYVYARGDGNDLITDGYNDVGDRLTFTD 67
T D GS + I HG G+D ++G GND +Y GND ++ G D D+L D
Sbjct: 725 TTRADKFFGSKFTDIFHGADGDDLIEGNDGNDR-LYG-DKGNDTLSGGNGD--DQLYGGD 780

Query: 68 INSSTVSLVRSGNDVTIVIAESAPGAGDGGSVRLKDALDDDHNRGVDQVVFADGTIWTRA 127
GND I G G+ L DD + + +
Sbjct: 781 -----------GNDKLI---------GVAGNNYLNGGDGDDEFQVQGNSLAKN------- 813

Query: 128 GIRVMLLDQTATVGNDTITGFNVADTISGKAGNDTIDGAGGNDNYVYARGDGNDTLTEGY 187
+L GND + G AD + G G+D + G GND Y Y G G+ + +
Sbjct: 814 -----VLFGGK--GNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDG 866

Query: 188 NDYGDRLTFTDIDSSAVSIVRNGNDVTVVIAES-APGAGDGGSVVLKDTLEDNAGRG--- 243
D+L+ DID V+ R GND+ + E G + ++ E +G
Sbjct: 867 GK-EDKLSLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISNH 925

Query: 244 -IDQIVFADGTNWSRAQL 260
I+QI G + L
Sbjct: 926 EIEQIFDKSGRIITPDSL 943



Score = 65.8 bits (160), Expect = 8e-13
Identities = 62/203 (30%), Positives = 84/203 (41%), Gaps = 27/203 (13%)

Query: 265 LTGTTADETLVGFSRDDTFHYARGGGDDTIIDGVNNGYNDQLVFSDINPDDVTLVGIGND 324
L GTT + G D FH G D +I+G N ND+L + D +D G G+D
Sbjct: 722 LIGTTRADKFFGSKFTDIFH---GADGDDLIEG--NDGNDRL-YGD-KGNDTLSGGNGDD 774

Query: 325 VKVVVAESTTGAGDGGSILLKDALANYY--GQGIDKIVFADGTAWTRDDFRAAILGLGAT 382
GDG L+ A NY G G D+ + L
Sbjct: 775 QLY--------GGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNV--------LFGG 818

Query: 383 AGDDTITGSNSPDTITGGRGNDTLNGVGGNDTYIYARGDGNDTVTDGSGNGINDRLVFTD 442
G+D + GS D + GG G+D L G GND Y Y G G+ + D G D+L D
Sbjct: 819 KGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGK--EDKLSLAD 876

Query: 443 IDPSMVSLVRIGSDVKVVIAESS 465
ID V+ R G+D+ + E +
Sbjct: 877 IDFRDVAFKREGNDLIMYKGEGN 899



Score = 60.4 bits (146), Expect = 3e-11
Identities = 62/244 (25%), Positives = 94/244 (38%), Gaps = 49/244 (20%)

Query: 141 GNDTITGFNVADTISGKAGNDTIDGAGGNDNYVYARGDGNDTLTEGYNDYGDRLTFTDID 200
G+D I G + D + G GNDT+ G G+D GDGND L
Sbjct: 745 GDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLY--GGDGNDKLIGV-------------- 788

Query: 201 SSAVSIVRNGNDVTVVIAESAPGAGDGGSVVLKDTLEDN---AGRGIDQIVFADGTNWSR 257
GN+ G GD V ++L N G+G D++ ++G +
Sbjct: 789 --------AGNNYLN------GGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGAD--- 831

Query: 258 AQLRDILLTGTTADETLVGFSRDDTFHYARGGGDDTIIDGVNNGYNDQLVFSDINPDDVT 317
LL G D+ L G +D + Y G G I D + G D+L +DI+ DV
Sbjct: 832 ------LLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDD--DGGKEDKLSLADIDFRDVA 883

Query: 318 LVGIGNDVKVVVAES---TTGAGDGGSI--LLKDALANYYGQGIDKIVFADGTAWTRDDF 372
GND+ + E + G +G + + + I++I G T D
Sbjct: 884 FKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISNHEIEQIFDKSGRIITPDSL 943

Query: 373 RAAI 376
+ A+
Sbjct: 944 KKAL 947



Score = 53.8 bits (129), Expect = 3e-09
Identities = 57/227 (25%), Positives = 86/227 (37%), Gaps = 46/227 (20%)

Query: 776 TAGNDTITGFNVADTLHGRAGNDTLNGAGGDDTYLYARGDGNDTVNEGFWDVNDRLVFTD 835
T D G D HG G+D + G G+D LY GNDT++ G + +D+L D
Sbjct: 725 TTRADKFFGSKFTDIFHGADGDDLIEGNDGNDR-LYG-DKGNDTLSGG--NGDDQLYGGD 780

Query: 836 INPSGVSLVRNGNDLTVVIAESAPGAGDGGSVLIKNTLDDNNSWGVEKVVFADGTTWTRA 895
GND + G + L DD + +
Sbjct: 781 -----------GNDKLI--------GVAGNNYLNGGDGDD-------EFQVQGNSLAKNV 814

Query: 896 DIRVALLDQAGTTGNDTITGFNVADRISGGGGNDTLTGGAGSDTFIFHTNFGSDKITDFV 955
G GND + G AD + GG G+D L GG G+D + + + +G I D
Sbjct: 815 --------LFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDD-- 864

Query: 956 VGAGSQDVIQFGNDVFADFASVLAAATQVGADTVITHDAGNTLTLKN 1002
G +D + + F D A + G D ++ GN L++ +
Sbjct: 865 -DGGKEDKLSLADIDFRDV-----AFKREGNDLIMYKGEGNVLSIGH 905


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1030cloacin378e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 8e-05
Identities = 19/42 (45%), Positives = 21/42 (50%)

Query: 25 NGNGGGNGGGHGNGGGHGNGGGNAGSNGKGNSGASHGKSASA 66
N GGG+G G GGG G+G G N G SG SA A
Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 36.2 bits (83), Expect = 1e-04
Identities = 18/37 (48%), Positives = 26/37 (70%)

Query: 23 AGNGNGGGNGGGHGNGGGHGNGGGNAGSNGKGNSGAS 59
+G+G G G GHGNGGG+GN GG +G+ G ++ A+
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 35.1 bits (80), Expect = 3e-04
Identities = 20/47 (42%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 25 NGNGGGNGGGHGNGGGHGNGGGNAGSNGKGNSGASHGKSASAPGQVG 71
G G G+G G G GHGNGGGN S G G+ + + +AP G
Sbjct: 46 WGGGSGSGIHWGGGSGHGNGGGNGNS-GGGSGTGGNLSAVAAPVAFG 91



Score = 32.0 bits (72), Expect = 0.002
Identities = 18/41 (43%), Positives = 22/41 (53%)

Query: 19 SPALAGNGNGGGNGGGHGNGGGHGNGGGNAGSNGKGNSGAS 59
+P G+G+G GGG G+G G GNG GS GN A
Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 28.1 bits (62), Expect = 0.036
Identities = 22/66 (33%), Positives = 27/66 (40%), Gaps = 6/66 (9%)

Query: 19 SPALAGNGNGGGNGGGHG-----NGGGHGNGGGNAGSNGKGNSGASHGKSASAPGQVGKV 73
P G G G +G G GGG G+G G +G GN G + G S G G +
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN-GNSGGGSGTGGNL 81

Query: 74 DADATA 79
A A
Sbjct: 82 SAVAAP 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1033VACJLIPOPROT280.047 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 27.9 bits (62), Expect = 0.047
Identities = 12/45 (26%), Positives = 19/45 (42%), Gaps = 2/45 (4%)

Query: 204 FFVQSVFGLLGGIGTHPEDVAHMKRTADRLFGDQF-RWSVLGAGA 247
FF+ ++ G+ G I ++RT FG + V G G
Sbjct: 103 FFLNTILGMGGFIDVAGMANPKLQRTEPHRFGSTLGHYGV-GYGP 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1036DHBDHDRGNASE1053e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 105 bits (262), Expect = 3e-29
Identities = 75/257 (29%), Positives = 117/257 (45%), Gaps = 14/257 (5%)

Query: 15 GLKGQRVLVTAGAGGIGFAIADTLSRLGARIIVCDISDEALAAAPGKIDLVA----AVKA 70
G++G+ +T A GIG A+A TL+ GA I D + E L + A A A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 71 DVSRDEDVDRLFETVKEKLGGLDALVNNAGIAGPTGGVDEIEPDDWRRCIDICLTGQFLC 130
DV +D + ++ ++G +D LVN AG+ P G + + ++W + TG F
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRP-GLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 131 ARRAVPLIKAAGGGSIVSMSSAAGRHGYAFRTPYSAAKFGVIGFAQSLAKELGPHGIRVN 190
+R + GSIV++ S Y+++K + F + L EL + IR N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 191 AILPGIIEGPRIEGVIAAR--AKQV--GISHEEMTGRYLQNISLRRMTSPYDVASMVAFL 246
+ PG E + A A+QV G TG I L+++ P D+A V FL
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTG-----IPLKKLAKPSDIADAVLFL 238

Query: 247 LSDAGINISGQSLGVDG 263
+S +I+ +L VDG
Sbjct: 239 VSGQAGHITMHNLCVDG 255


71mlr1449mll1455N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr14491132.001845phosphoserine phosphatase
mlr14500121.860830acetyltransferase
mll14510110.900026serine protease
msl1453-2110.355317hypothetical protein
mll1454-2110.637386ftsH protease activity modulator hflC
mll1455-3101.352233protease subunit hflK
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1449LIPOLPP20290.024 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 28.6 bits (63), Expect = 0.024
Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 18/113 (15%)

Query: 80 GIACDLVLPQEAD--TANTTAALRAALAAEPVDVIVQQAQTRRKKILIADMDSTM--IDQ 135
G A DL+ + D T TA RA LAA + + + + + + A ++ D
Sbjct: 67 GRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRSISGTDT 126

Query: 136 ECIDELADEIGVKEHVAA-ITARSMNGEIAFEPALRERVALLKGLDAAVVDRI 187
E I +L D KE +A+ + AR + ++RV +L GLD +VD++
Sbjct: 127 EKISQLVD----KELIASKMLARYVG---------KDRVFVLVGLDKQIVDKV 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1451V8PROTEASE728e-16 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 72.0 bits (176), Expect = 8e-16
Identities = 33/169 (19%), Positives = 59/169 (34%), Gaps = 25/169 (14%)

Query: 107 QSLGSGFVIDAEQGIVVTNNHVIADADDIEV------------NFSDGVTLKATLVGTDT 154
+ SG V+ + ++TN HV+ N+ +G +
Sbjct: 101 TFIASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSG 158

Query: 155 KTDVAVLKVDPK------GHKLTAVKFGDSTKMRVGDWVMAVGNPFGLGGTVTVGIVSAR 208
+ D+A++K P G + ++ + +V + G P V +
Sbjct: 159 EGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDK----PVATMWES 214

Query: 209 NRDINSGPYDDFIQTDAAINRGNSGGPLFNSAGEVIGINTAIISPSGGS 257
I + +Q D + GNSG P+FN EVIGI+ +
Sbjct: 215 KGKITYLK-GEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNEFNG 262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1454RTXTOXIND330.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.3 bits (76), Expect = 0.001
Identities = 14/125 (11%), Positives = 35/125 (28%), Gaps = 11/125 (8%)

Query: 111 QIELAEARLRTRLDAALRRVYGLRDFEAALSEQRAVMMREVRDQLRPDATSLGLQIEDVR 170
Q EL + R L R+ + + + Q + +
Sbjct: 204 QKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAK-----HAVLEQE 258

Query: 171 IRRTDLTAEVSQQTYDRMKAERLAEAARLRARGNEAAQRITARADREVVEIVAEAQKESE 230
+ + E+ + E +A+ + +T E+++ + +
Sbjct: 259 NKYVEAVNELRVYKSQLEQIESEILSAKEEYQ------LVTQLFKNEILDKLRQTTDNIG 312

Query: 231 ILRGE 235
+L E
Sbjct: 313 LLTLE 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1455cloacin300.017 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.1 bits (67), Expect = 0.017
Identities = 29/104 (27%), Positives = 42/104 (40%), Gaps = 16/104 (15%)

Query: 2 PWNDKSGGGGGPWGGGGNNQGPWGQGPKGPSGPQGSPPDLEDIIRRGQDRLRRALPGGGG 61
PW SG G GG G+ G G SG G+ + + G L + PG GG
Sbjct: 45 PWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPAL--STPGAGG 102

Query: 62 ASPAVFG-----LIAAVLVAL--------WAFQAVYTVQPDEVA 92
+ ++ IA ++ AL W A+Y V P ++A
Sbjct: 103 LAVSISAGALSAAIADIMAALKGPFKFGLWGV-ALYGVLPSQIA 145


72mll1539mll1549N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll15390122.066741DNA ligase
mll1541-1132.573516DNA repair protein RecN
mlr15400142.013510hypothetical protein
mll1543-1130.888905hypothetical protein
mll1545-1131.334107UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine
mll1546-1151.533165cell division protein FtsZ
mll15490131.305646cell division protein FtsA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1539ECOLIPORIN330.004 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 33.0 bits (75), Expect = 0.004
Identities = 38/169 (22%), Positives = 59/169 (34%), Gaps = 24/169 (14%)

Query: 128 IDGLSASLRYENGVFVQGATRGDGAVGEDITANLRTIADIPAKLKGSGWPDVIEIRGEVY 187
+DGL+ +L+Y QG A +I N R D G G+ +
Sbjct: 163 VDGLNFALQY------QGKNESQSADDVNIGTNNRNNGDDIRYDNGDGF--------GIS 208

Query: 188 MTYAEFEALKARSAAAGGQDYVNPRNTAAGSLRQKDASVTASRNLKFFA---YAWGFTTA 244
TY A AA D N + A G++ D + + LK+ A Y +
Sbjct: 209 TTYDIGMGFSA-GAAYTTSDRTNEQVNAGGTIAGGDKADAWTAGLKYDANNIYLATMYSE 267

Query: 245 DPAPTQYESVQKFADWGFKISPLMVRAKSVEELVAQYHLIEEQRSSLGY 293
T Y K D G + ++ E + AQY R ++ +
Sbjct: 268 TRNMTPYGKTDKGYDGGVA-----NKTQNFE-VTAQYQFDFGLRPAVSF 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1541GPOSANCHOR330.003 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 33.1 bits (75), Expect = 0.003
Identities = 41/221 (18%), Positives = 67/221 (30%), Gaps = 10/221 (4%)

Query: 170 EQELTRHRAKVAAAAREADYLRAAVTELTKLDPQPGEETELAELRAHMMRAEKIASEIHD 229
E+ L A + + L A L + E L +
Sbjct: 161 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAEL--EKALEGAMNFSTADSAKIKTLEA 218

Query: 230 AQDVLSGPSSPLPQLASLLRRLQRKATEAPGLLEDVVKSLDEAMLSLDAAQSGVEAALRA 289
+ L+ + L + + LE +L+ L+ A G A
Sbjct: 219 EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTA 278

Query: 290 TEYDPQRLEKAEERLFSLRAASRKHSVAVDDLAQLRDTMVADLADLDAGEERLHGLEKQA 349
+ LE + L + +A S ++ A + DLDA E LE +
Sbjct: 279 DSAKIKTLEAEKAALEAEKADLEHQSQVLN--ANRQSL----RRDLDASREAKKQLEAEH 332

Query: 350 AAAREAYDIAAAQLSSLR--HAAAVGLTKAVMAELPALKLE 388
E I+ A SLR A+ K + AE L+ +
Sbjct: 333 QKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQ 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1543SYCDCHAPRONE290.019 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.7 bits (64), Expect = 0.019
Identities = 23/116 (19%), Positives = 41/116 (35%), Gaps = 11/116 (9%)

Query: 52 DVLYNQGLANLNAGRLDEASKKFDAVDRQHPYSEWARKSMVMGAFADYRKGSYDEAISSA 111
+ LY+ +G+ ++A K F A+ Y +R + +GA G YD AI S
Sbjct: 37 EQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYD--SRFFLGLGA-CRQAMGQYDLAIHSY 93

Query: 112 KRYLALYPSTDDAPYAQYIIGLSYYRQIKDVTQDQKEARQTLQTMQDLVTRWPTSE 167
+ P + ++ + EA L Q+L+ +
Sbjct: 94 SYGAIMDI---KEPRFPFHAAECLLQK-----GELAEAESGLFLAQELIADKTEFK 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1546PF03544330.002 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.0 bits (75), Expect = 0.002
Identities = 28/149 (18%), Positives = 44/149 (29%), Gaps = 7/149 (4%)

Query: 306 LEGVIRVSVVATGIDKSAAEIAAAPISIRTAPPKPAVRPAVAAVESRPAPVQQPVYEP-R 364
+ G + ++ T E+ A I PA AV+ P PV +P EP
Sbjct: 24 IHGAVVAGLLYT-SVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEP 82

Query: 365 AADPVAEAIQLAEANAAAMAQARPAPVAHADDFRPQSKIFQAPPQQPMPQPVVQQMQPAP 424
+P EA + E +P P +P+ + P + P P
Sbjct: 83 IPEPPKEAPVVIEKPKPK---PKPKPKPVKKVEQPKRDV--KPVESRPASPFENTAPARP 137

Query: 425 QPREMLREVQQPVAMAPQRMPRVEDFPPV 453
+PV + P
Sbjct: 138 TSSTATAATSKPVTSVASGPRALSRNQPQ 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1549SHAPEPROTEIN491e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 49.4 bits (118), Expect = 1e-08
Identities = 50/191 (26%), Positives = 77/191 (40%), Gaps = 24/191 (12%)

Query: 202 RMVATPYASGLAALVDDELELGAACIDMGGGTTTISVFSEGKFVHGDAIAIGGNHVTLDM 261
++ P A+ + A + G+ +D+GGGTT ++V S V+ ++ IGG+ D
Sbjct: 139 FLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGD--RFDE 196

Query: 262 A--------KGLSTSLDAAERLKVMHGSALPGSADD------RDLVSIQP--IGDDGDVP 305
A G AER+K GSA PG R+L P + +
Sbjct: 197 AIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEI 256

Query: 306 LQIPRSVMTRIVRARIDETLELLRDRLNKSGYGNAVGKRVVLTGGASQLAGLPEAARRIL 365
L+ + +T IV A + LE L + + +VLTGG + L L
Sbjct: 257 LEALQEPLTGIVSA-VMVALEQCPPELA----SDISERGMVLTGGGALLRNLDRLLMEET 311

Query: 366 GRNVRIG-RPL 375
G V + PL
Sbjct: 312 GIPVVVAEDPL 322


73mll1629mlr1638N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll16290150.844256phenylhydantoinase
mll1631-1141.266464allantoate amidohydrolase
mll1632-1141.374169beta alanine--pyruvate transaminase
mlr16341131.453879transcriptional regulator
mll16361141.585431hypothetical protein
mlr16370131.098549hypothetical protein
mlr16381100.960348oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1629UREASE402e-05 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 39.7 bits (93), Expect = 2e-05
Identities = 29/94 (30%), Positives = 37/94 (39%), Gaps = 16/94 (17%)

Query: 4 VIKNGTIVTADRTWKADVLVKHGKIVAIGSDLHGDHEFDAT-------------GCYVMP 50
VI N I+ KAD+ +K G+I AIG + D + T G V
Sbjct: 71 VITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTA 130

Query: 51 GGIDPHTHLEMPFMGTYSADDFESGTRAALAGGT 84
GG+D H H P + SG L GGT
Sbjct: 131 GGMDSHIHFICP---QQIEEALMSGLTCMLGGGT 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1634HTHTETR712e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 71.2 bits (174), Expect = 2e-17
Identities = 34/147 (23%), Positives = 64/147 (43%), Gaps = 4/147 (2%)

Query: 8 PRRTRIQ-QEKRELILEAALEVFSTHGFRGSTIDQIAEAAGMSKPNLLYYFRRKEDIHET 66
R+T+ + QE R+ IL+ AL +FS G +++ +IA+AAG+++ + ++F+ K D+
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 67 LMQRLLDTWLAPLREL--DDIGDPMTELRSYIRRKLEMARDFPRESRLFA-NEILQGAPR 123
+ + E GDP++ LR + LE R L
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 124 IMPLLAGELKTLVDEKAAVIKGWMRAG 150
M ++ + L E I+ ++
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHC 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1637DHBDHDRGNASE682e-15 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 67.8 bits (165), Expect = 2e-15
Identities = 50/203 (24%), Positives = 83/203 (40%), Gaps = 11/203 (5%)

Query: 3 LKDKTILITGSTDGVGRVVAQRLGADGARVLVHGRDAARGKAAVAEIEAAGGRAEFFAAD 62
++ K ITG+ G+G VA+ L + GA + + + + V+ ++A AE F AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 63 LASLAEIRHLAEAVRARTNRLDILINNAGIGTAAAKRQVSADGYELRFAVNYLAGFLLTS 122
+ A I + + +DIL+N AG+ +S + +E F+VN F +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 123 ELLPLLKASAPARIVNVASAGQQAIDFGDVMLTHGYSGVRAYCQSKLAQILFTVDLAEQL 182
+ + IV V S + + AY SK A ++FT L +L
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGV----------PRTSMAAYASSKAAAVMFTKCLGLEL 175

Query: 183 KGTGVTVNALHPASYMNTTMVRQ 205
+ N + P S T M
Sbjct: 176 AEYNIRCNIVSPGS-TETDMQWS 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1638DHBDHDRGNASE841e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 84.3 bits (208), Expect = 1e-21
Identities = 59/245 (24%), Positives = 97/245 (39%), Gaps = 20/245 (8%)

Query: 10 IIGGSSGIGLATARKLLGPGMKVTITGRN---QDKLISAWKSLGGAADKAAFDASKPDEV 66
I G + GIG A AR L G + N +K++S+ K+ A+ D +
Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72

Query: 67 RQFFERL----GPFDHLVLAASGGKGLGPFETLDLADIGSGVEEKVRPQLSCLQAALPTL 122
+ R+ GP D LV A G G +L + + + ++ +
Sbjct: 73 DEITARIEREMGPIDILVNVA-GVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYM 131

Query: 123 --NKSGSVTFISAVSAQATMPGIAGIGAINGMLLTVAPILAVELKP--LRVNVVAPGVID 178
+SGS+ + + A +A + + L +EL +R N+V+PG +
Sbjct: 132 MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTE 191

Query: 179 TP-----WWDFLPDEQRQAVFAE-YAGKTPVGRIGRAEDVASAIAFLVSN--GFMTGQVL 230
T W D EQ E + P+ ++ + D+A A+ FLVS G +T L
Sbjct: 192 TDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNL 251

Query: 231 TCDGG 235
DGG
Sbjct: 252 CVDGG 256


74mlr1970mll1981N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr19700144.053880hypothetical protein
mll19711143.306917L-lysine 2,3-aminomutase
msl19720143.429892hypothetical protein
mlr19730143.630579glycine cleavage system transcription activator
mll19750144.069438cinnamoyl-CoA reductase
mlr19761163.366254transcriptional regulator
msl19781162.787860tautomerase
mll1982-1152.037003short chain oxidoreductase
mll1981-1161.273276short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1970ENTSNTHTASED671e-15 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 66.6 bits (162), Expect = 1e-15
Identities = 37/152 (24%), Positives = 58/152 (38%), Gaps = 13/152 (8%)

Query: 38 LPEEARSIPARQPAMRRASGAARWVAHRLLADTGISDLAIPRAPSGAPLWPNGIVGSLAH 97
LP R + + + A R A L + G+ + PLWP+G+ GS++H
Sbjct: 33 LPHHDR-LRSAGRKRKAEHLAGRIAAVHALREVGVRTVPGM-GDKRQPLWPDGLFGSISH 90

Query: 98 DDDMAVAAVAPVGGIVSLGIDVEP------AEPLPDDIFAIVATGADRTGAADPRLAGRI 151
A+A + +GID+E A L I + LA +
Sbjct: 91 CATTALAVI----SRQRIGIDIEKIMSQHTATELAPSIIDSDERQILQASLLPFPLALTL 146

Query: 152 LFAAKEAVYKAAYPLDREVLGYEDIAVDLDAG 183
F+AKE+VYK A+ + G+ V
Sbjct: 147 AFSAKESVYK-AFSDRVTLPGFNSAKVTSLTA 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1971PF07520300.024 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 29.6 bits (66), Expect = 0.024
Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 7/84 (8%)

Query: 20 VRQGVRHVRDLDRLPLSPVERAAAQAAAAHHKVRAPKAYLDLIDWNDPADP------IRA 73
V G R + L+R +P+ R + K++ P + + +D + +RA
Sbjct: 936 VYIGARQL-PLERWTTTPLYRLDFANDSIAGKIKLPVKVELVREDDDFDEAETSLEKLRA 994

Query: 74 QVIPSPDELEEAEGELGDPIADHD 97
+ + ++ AE G I + D
Sbjct: 995 ERVREVFRVDAAEDAEGTMIKNDD 1018


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1975NUCEPIMERASE551e-10 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 55.2 bits (133), Expect = 1e-10
Identities = 37/175 (21%), Positives = 65/175 (37%), Gaps = 27/175 (15%)

Query: 15 VLVTGGSGFIASHCMLKLLDAGYRLRTTVRSLEREAEVRAMLREGGAE--PGDRLSFVAA 72
LVTG +GFI H +LL+AG+++ + +L +V L++ E F
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVV-GIDNLNDYYDVS--LKQARLELLAQPGFQFHKI 59

Query: 73 DLTADAGWAEAV---AGCAYVMH-----GASPTPSGSQTREEDWVRPAVDGVLRVLKAAR 124
DL AD + V + + + G L +L+ R
Sbjct: 60 DL-ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHA----YADSNLTGFLNILEGCR 114

Query: 125 DAGIKRVVL--TSAIGAVAMGHAPQTRPFNETDWSDLSGAVAPYQRSKTLSERAA 177
I+ ++ +S++ + PF+ D D V+ Y +K +E A
Sbjct: 115 HNKIQHLLYASSSSVYG-----LNRKMPFSTDDSVD--HPVSLYAATKKANELMA 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr1976HTHTETR280.030 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.030
Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 8/96 (8%)

Query: 218 LEELARAAAMSRTSFAFHFRQTAGVAPLTY---LTQWRMHLAERALREEDTPVAVLARSL 274
L E+A+AA ++R + +HF+ + + + + E + P++VL L
Sbjct: 34 LGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLREIL 93

Query: 275 GYTSESAFSNAFKRATGTAPKRYRTAGKAERSGDAE 310
+ ES + +R K E G+
Sbjct: 94 IHVLESTVTEERRRLLMEIIFH-----KCEFVGEMA 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1982DHBDHDRGNASE757e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 75.1 bits (184), Expect = 7e-18
Identities = 50/188 (26%), Positives = 81/188 (43%), Gaps = 8/188 (4%)

Query: 3 RTILITGASSGFGAMTARALARAGHTVFASMRDPSARGGAAAAEMEALARDEGVVLKPIA 62
+ ITGA+ G G AR LA G + A +P ++ + + E +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL-----EKVVSSLKAEARHAEAFP 63

Query: 63 LDVTSDGSAEAAIRRILGEAGRLDVLIHNAGHMGFGPAEAFSPEQLTQLYDVNVVGTQRV 122
DV + + RI E G +D+L++ AG + G + S E+ + VN G
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 123 NRAALPHMRSLGRAQMIWVGSSSTRGGTPPF-LAPYFAAKAGMDALAQSYALELARFGIE 181
+R+ +M ++ VGS+ G P +A Y ++KA + LELA + I
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNP--AGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 182 TTIVVPGA 189
IV PG+
Sbjct: 182 CNIVSPGS 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll1981DHBDHDRGNASE1044e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 4e-29
Identities = 80/249 (32%), Positives = 120/249 (48%), Gaps = 12/249 (4%)

Query: 19 RTAIVTGASKGIGAAIAQRLARDGLAVVVNYARGRAEADAVRGAIEAGGGKAIAVQADIA 78
+ A +TGA++GIG A+A+ LA G A + + + V +++A A A AD+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQG-AHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 79 DPTGIATLFDAGEKAFGGVDILVNNAGIMKLSPIAGTDDASFDAQIAVNLGGVFRGTREG 138
D I + E+ G +DILVN AG+++ I D ++A +VN GVF +R
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 139 AKRLRD--GGRIVNFSSSVVGLYQPGYGVYAATKAAVEAMTHILAKELGARRVTVNAVAP 196
+K + D G IV S+ G+ + YA++KAA T L EL + N V+P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 197 GPVETA----LFMDGKSATQI-----EAIGKMIPLGRLGQPDDIAGVVSFLAGPDSGWVN 247
G ET L+ D A Q+ E IPL +L +P DIA V FL +G +
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 248 GQIIRANGG 256
+ +GG
Sbjct: 248 MHNLCVDGG 256


75mlr2294mll2301N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr2294-114-0.413802hypothetical protein
mlr2295-112-0.351115transporter
mlr2296-112-1.182782hydrolase
mll2297013-2.080046hypothetical protein
mlr2298016-1.653500hypothetical protein
mll2299-117-0.405217two-component response regulator
mll2300-216-0.853183response regulator
mll2301-215-0.432948sensory transduction regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2294ECOLIPORIN270.035 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 27.2 bits (60), Expect = 0.035
Identities = 17/34 (50%), Positives = 21/34 (61%), Gaps = 2/34 (5%)

Query: 15 LLLVLPALLLATSAEARTIYYGNKVGMELTIVKK 48
L LV+PALL A +A A IY NK G +L + K
Sbjct: 6 LALVIPALLAAGAAHAAEIY--NKDGNKLDLYGK 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2299HTHFIS784e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.9 bits (192), Expect = 4e-20
Identities = 31/125 (24%), Positives = 58/125 (46%), Gaps = 5/125 (4%)

Query: 1 MSNGAVLVVEDEQLILLDVESALEEAGFEVVAAHNAAKALAAFDAEPGKFKGLVTDIRLG 60
M+ +LV +D+ I + AL AG++V NAA A G +VTD+ +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDL--VVTDVVMP 58

Query: 61 AGKSGWDVARHLRQANPTIPVIYMSGDSAIHWGAEGVPESVM--ITKPFFLPQIITALST 118
++ +D+ +++A P +PV+ MS + + + + KPF L ++I +
Sbjct: 59 -DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 119 LLNQQ 123
L +
Sbjct: 118 ALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2300HTHFIS592e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.1 bits (143), Expect = 2e-13
Identities = 34/127 (26%), Positives = 55/127 (43%), Gaps = 5/127 (3%)

Query: 6 KAVILIVEDSAIIRMGAVDLVVHAGYEALEASNADEAIRLLEARTDIVLVFTDVGMPGTM 65
A IL+ +D A IR + AGY+ SNA R + A LV TDV MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPD-E 60

Query: 66 DGVKLAHHIRNRWPPVKLIVASGRSIIEQ--SRLPEGS-QFFPKPYSDVTIVEQMRRMLS 122
+ L I+ P + ++V S ++ +G+ + PKP+ ++ + R L+
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 123 PIETRRK 129
+ R
Sbjct: 121 EPKRRPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2301PF06580469e-08 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 46.4 bits (110), Expect = 9e-08
Identities = 38/185 (20%), Positives = 75/185 (40%), Gaps = 21/185 (11%)

Query: 128 ARADARQKDDLIRDKAILMQEVQ---HRVANSLQIIASVLMQSARRVQSEETRGHLHDAH 184
A D + + ++ ++ + Q H + N+L I +++++ + A
Sbjct: 147 AEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTK------------AR 194

Query: 185 NRVMSIAAVQRH-LAQSGVENVSLRTYFTQLSESLGASMISDKDRLSIVVSVDDSVVKSD 243
+ S++ + R+ L S VSL T + L + I +DRL ++ +++
Sbjct: 195 EMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQ 254

Query: 244 ISISLGLIVTELVINALKHAF-PQERSGKISVDYRSHGQDWTLSVRDDGIGMGDSAAKAG 302
+ ++V LV N +KH + GKI + TL V + G + K
Sbjct: 255 V---PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN-TKES 310

Query: 303 LGTGI 307
GTG+
Sbjct: 311 TGTGL 315


76mll2357mll2374N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll2357-110-0.250386hypothetical protein
mlr2358-1110.526725hypothetical protein
mlr23590100.765799hypothetical protein
mlr23600100.859244hypothetical protein
mlr23610111.070443phosphoprotein phosphatase
mlr23631100.977830serine/threonine kinase
mlr2364190.142256hypothetical protein
mlr2365290.330181hypothetical protein
mlr2366112-0.405960hypothetical protein
mll2368114-1.258643hypothetical protein
mlr2370117-2.367413hypothetical protein
mll2372118-3.073930hypothetical protein
mll2374019-3.183616hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2357ICENUCLEATIN442e-06 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 44.0 bits (103), Expect = 2e-06
Identities = 50/248 (20%), Positives = 84/248 (33%)

Query: 503 LLIKHDRNKTVQHDQSDRIDHDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSNDTETVG 562
L + +T Q + + + + G+ G T T G ++T
Sbjct: 896 LTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAR 955

Query: 563 ANRSLTVMANETIHVVANSTENIDASHSQTVGLVQTVTVGAARVDTVGATETRSVGAAQM 622
SLT T +S+ +QT G T+T G T + T + G
Sbjct: 956 EQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGST 1015

Query: 623 NTIGASRSVTVGAGQSHDIGADDGWNVEASQSIEIGADQGLKIGGAQNTEIGKTWTVKVG 682
T GA S+ G G S G ++ G L G + G+ ++ G
Sbjct: 1016 ATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLISGRRSSLTAG 1075

Query: 683 QDASTQIDGAHELKIGKKSLTQVGEDAGMVVGKNLTIEAKDSITIKTGSAEILMKKDGTI 742
++ L G +S G + ++ GK + A T+ +G+ + M +
Sbjct: 1076 YGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGK 1135

Query: 743 TIKGKDIT 750
I G D T
Sbjct: 1136 LIAGADST 1143



Score = 42.8 bits (100), Expect = 5e-06
Identities = 48/217 (22%), Positives = 71/217 (32%), Gaps = 2/217 (0%)

Query: 530 GHNLDEDVGNNKTVKIGVDQTTNIGSNDTETVGANRSLTVMANETIHVVANSTENIDASH 589
G L G+ +T + D T GS T T GAN SL T NS
Sbjct: 509 GSTLTAGYGSTQTAQNESDLITGYGS--TSTAGANSSLIAGYGSTQTASYNSVLTAGYGS 566

Query: 590 SQTVGLVQTVTVGAARVDTVGATETRSVGAAQMNTIGASRSVTVGAGQSHDIGADDGWNV 649
+QT +T G T G+ + G T S+T G G +
Sbjct: 567 TQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTT 626

Query: 650 EASQSIEIGADQGLKIGGAQNTEIGKTWTVKVGQDASTQIDGAHELKIGKKSLTQVGEDA 709
+ GAD L G G + G ++ +L G S + G D+
Sbjct: 627 GYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADS 686

Query: 710 GMVVGKNLTIEAKDSITIKTGSAEILMKKDGTITIKG 746
++ G T A + + G ++G+ G
Sbjct: 687 SLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSG 723



Score = 39.7 bits (92), Expect = 5e-05
Identities = 62/328 (18%), Positives = 108/328 (32%), Gaps = 25/328 (7%)

Query: 461 GNATQSGWKSNSSKGGG-----GYNELMFEDKAGSELVNFQAQKDHHLLIKHDRNKTVQH 515
G+ + +G+ S+ G G GYN ++ ++ AQ++ L + T +
Sbjct: 869 GSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQ----TAQENSDLTTGYGSTSTAGY 924

Query: 516 DQSDRIDHDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSNDTETVGANRSLTVMANETI 575
+ S + + + G + + G T G + SL T
Sbjct: 925 ESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQ 984

Query: 576 HVVANSTENIDASHSQTVGLVQTVTVGAARVDTVGATETRSVGAAQMNTIGASRSVTVGA 635
ST +QT T+T G T GA + G T G +T G
Sbjct: 985 TAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGY 1044

Query: 636 GQSHDIGADDGWNVEASQSIEIGADQGLK---------------IGGAQNTEIGKTWTVK 680
G + G S+ G L I G ++T+I ++
Sbjct: 1045 GSTLISGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSML 1104

Query: 681 VGQDASTQIDGAHELKI-GKKSLTQVGEDAGMVVGKNLTIEAKDSITIKTGSAEILMKKD 739
+ S+Q G I G S+ GE ++ G + T A D + G+ L D
Sbjct: 1105 IAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGD 1164

Query: 740 GTITIKGKDITVKGSGKINVDASSDIVM 767
+ G D + + + A + ++
Sbjct: 1165 RSKLTAGNDCILMAGDRSKLTAGINSIL 1192



Score = 36.7 bits (84), Expect = 4e-04
Identities = 49/256 (19%), Positives = 80/256 (31%), Gaps = 12/256 (4%)

Query: 497 AQKDHHLLIKHDRNKTVQHDQSDRIDHDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSN 556
A + L+ + +T + + + + + + D G T G + + G
Sbjct: 490 AGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYG 549

Query: 557 DTETVGANRSLTVMANETIHVVANSTENIDASHSQTVGLVQTVTVGAARVDTVGATETRS 616
T+T N LT T S + T G ++ G T + +
Sbjct: 550 STQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLT 609

Query: 617 VGAAQMNTIGASRSVTVGAGQSHDIGADDGWNVEASQSIEIGADQGLKIGGAQNTEIGKT 676
G T +T G G + GAD + G + L G G T
Sbjct: 610 AGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAG------YGST 663

Query: 677 WTVKVGQD-----ASTQIDGAHELKIGKKSLTQV-GEDAGMVVGKNLTIEAKDSITIKTG 730
T + G D ST GA I TQ G ++ + G T A++ + +G
Sbjct: 664 QTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSG 723

Query: 731 SAEILMKKDGTITIKG 746
+ I G
Sbjct: 724 YGSTSTAGADSSLIAG 739



Score = 36.3 bits (83), Expect = 5e-04
Identities = 47/235 (20%), Positives = 75/235 (31%), Gaps = 2/235 (0%)

Query: 497 AQKDHHLLIKHDRNKTVQHDQSDRIDHDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSN 556
A D ++ + +T + S + + + G T G D + G
Sbjct: 586 AGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 645

Query: 557 DTETVGANRSLTVMANETIHVVANSTENIDASHSQTVGLVQTVTVGAARVDTVGATETRS 616
T+T G N LT T S + T G ++ G T G +
Sbjct: 646 STQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 705

Query: 617 VGAAQMNTIGASRSVTVGAGQSHDIGADDGWNVEASQSIEIGADQGLKIGGAQNTEIGKT 676
G T +T G G + GAD + S + + G +T+ +
Sbjct: 706 AGYGSTQTAQEGSDLTSGYGSTSTAGADSSL-IAGYGSTQTASYHSSLTAGYGSTQTARE 764

Query: 677 WTVKVGQDASTQIDGAHELKIGKKSLTQV-GEDAGMVVGKNLTIEAKDSITIKTG 730
+V ST GA I TQ G + + G T A++ + TG
Sbjct: 765 QSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTG 819



Score = 35.9 bits (82), Expect = 7e-04
Identities = 41/210 (19%), Positives = 65/210 (30%), Gaps = 2/210 (0%)

Query: 538 GNNKTVKIGVDQTTNIGSNDTETVGANRSLTVMANETIHVVANSTENIDASHSQTVGLVQ 597
G T + G T+T LT T A+S+ +QT G
Sbjct: 739 GYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHS 798

Query: 598 TVTVGAARVDTVGATETRSVGAAQMNTIGASRSVTVGAGQSHDIGADDGWNVEASQSIEI 657
+T G T + G +T GA S+ G G + G + +
Sbjct: 799 ILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTA 858

Query: 658 GADQGLKIGGAQNTEIGKTWTVKVGQDASTQIDGAHE-LKIGKKSLTQVGEDAGMVVGKN 716
+ L G + G ++ G STQ G + L G S E++ + G
Sbjct: 859 QENSDLTTGYGSTSTAGYDSSLIAGYG-STQTAGYNSILTAGYGSTQTAQENSDLTTGYG 917

Query: 717 LTIEAKDSITIKTGSAEILMKKDGTITIKG 746
T A ++ G + + G
Sbjct: 918 STSTAGYESSLIAGYGSTQTASFKSTLMAG 947



Score = 35.9 bits (82), Expect = 8e-04
Identities = 58/279 (20%), Positives = 90/279 (32%), Gaps = 19/279 (6%)

Query: 497 AQKDHHLLIKHDRNKTVQHDQSDRIDHDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSN 556
A ++ + + +T + + + G + G T G D + G
Sbjct: 218 AGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYG 277

Query: 557 DTETVGANRSLTVMANETIHVVANSTENIDASHSQTVGLVQTVTVGAARVDTVGATETRS 616
T+T LT T A+S+ +QT G T T G T +
Sbjct: 278 STQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLT 337

Query: 617 VGAAQMNTIGASRSVTVGAGQSHDIGADDGWNVEASQSIEIGADQGLKIGGAQNTEIGKT 676
G T G S+ G G + G D +S + G+ Q + G G T
Sbjct: 338 AGYGSTGTAGDDSSLIAGYGSTQTAGED------SSLTAGYGSTQTAQKGSDLTAGYGST 391

Query: 677 WTVKVGQDASTQIDGAHELKIGKKSLTQVGEDAGMVVGKNLTIEAKDSITIKTGSAEILM 736
T G D+S L G S GE++ G T A+ + G
Sbjct: 392 GT--AGADSS--------LIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441

Query: 737 KKDGTITIKGKDITVKGSGKINVDA---SSDIVMKGSNI 772
D + I G T ++ A S+ KGS++
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 480



Score = 34.7 bits (79), Expect = 0.002
Identities = 43/236 (18%), Positives = 70/236 (29%), Gaps = 4/236 (1%)

Query: 497 AQKDHHLLIKHDRNKTVQHDQSDRIDHDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSN 556
A D L+ + +T D S + + + D G T G + + G
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 557 DTETVGANRSLTVMANETIHVVANSTENIDASHSQTVGLVQTVTVGAARVDTVGATETRS 616
T+T G +LT T S + T G ++ G T +
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 617 VGAAQMNTIGASRSVTVGAGQSHDIGADDGWNVEASQSIEIGADQGLKI--GGAQNTEIG 674
G T +T G G + G+D + L G Q
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 675 KTWTVKVGQDASTQIDGAHELKIGKKSLTQVGEDAGMVVGKNLTIEAKDSITIKTG 730
T G ++ D + L G S G ++ + G T A++ + G
Sbjct: 622 SVLTTGYGSTSTAGADSS--LIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAG 675



Score = 34.0 bits (77), Expect = 0.003
Identities = 47/239 (19%), Positives = 75/239 (31%), Gaps = 18/239 (7%)

Query: 497 AQKDHHLLIKHDRNKTVQHDQSDRIDHDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSN 556
A + L+ + +T ++ + + + D G T G D + G
Sbjct: 538 AGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYG 597

Query: 557 DTETVGANRSLTVMANETIHVVANSTENIDASHSQTVGLVQTVTVGAARVDTVGATETRS 616
T+T + SLT T QT +T G T GA +
Sbjct: 598 STQTASYHSSLTAGYGST----------------QTAREQSVLTTGYGSTSTAGADSSLI 641

Query: 617 VGAAQMNTIGASRSVTVGAGQSHDIGADDGWNVEASQSIEIGADQGLKIGGAQNTEIGKT 676
G T G + +T G G + + GAD L G G
Sbjct: 642 AGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYN 701

Query: 677 WTVKVGQDASTQIDGAHELKIGKKSLTQVGEDAGMVVGKNLTIEA--KDSITIKTGSAE 733
+ G ++ +L G S + G D+ ++ G T A S+T GS +
Sbjct: 702 SILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQ 760



Score = 34.0 bits (77), Expect = 0.003
Identities = 51/281 (18%), Positives = 93/281 (33%), Gaps = 25/281 (8%)

Query: 511 KTVQHDQSDRIDHDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSNDTETVGANRSLTVM 570
+T Q + + + + G++ G T G + G T+T N LT
Sbjct: 856 QTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTG 915

Query: 571 ANETIHVVANSTENIDASHSQTVGLVQTVTVGAARVDTVGATETRSVGAAQMNTIGASRS 630
T S+ +QT T+ G T + + G + G S
Sbjct: 916 YGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSS 975

Query: 631 VTVGAGQSHDIGADDGWNVEASQSIEIGADQGLKIGGAQNTEIGKTWTVKVGQDASTQID 690
+ G G + G +++ + G+ Q + G T T G D+S
Sbjct: 976 LIAGYGSTQTAG------YQSTLTAGYGSTQTAEHSSTLTAGYGSTAT--AGADSSLIAG 1027

Query: 691 GAHELKIGKKSLTQVGEDAGMVV----------GKNLTIEAKDSITIKTGSAEI------ 734
L G +S G + ++ G +L + S+T GS +I
Sbjct: 1028 YGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHRSS 1087

Query: 735 LMKKDGTITIKG-KDITVKGSGKINVDASSDIVMKGSNILQ 774
L+ + I G + + + G G ++ G++ +Q
Sbjct: 1088 LIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQ 1128



Score = 33.6 bits (76), Expect = 0.004
Identities = 38/233 (16%), Positives = 72/233 (30%), Gaps = 10/233 (4%)

Query: 523 HDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSN--------DTETVGANRSLTVMANET 574
+ + + G+ + + G T+ G++ T+T G + LT T
Sbjct: 748 YHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGST 807

Query: 575 IHVVANSTENIDASHSQTVGLVQTVTVGAARVDTVGATETRSVGAAQMNTIGASRSVTVG 634
S + T G ++ G T G + G T + +T G
Sbjct: 808 QTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTG 867

Query: 635 AGQSHDIGADDGWNVEASQSIEIGADQGLKIGGAQNTEIGKTWTVKVGQDASTQIDG-AH 693
G + G D + S + + G +T+ + + ST G
Sbjct: 868 YGSTSTAGYDSSL-IAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYES 926

Query: 694 ELKIGKKSLTQVGEDAGMVVGKNLTIEAKDSITIKTGSAEILMKKDGTITIKG 746
L G S + ++ G + A++ ++ G M + I G
Sbjct: 927 SLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAG 979



Score = 33.2 bits (75), Expect = 0.005
Identities = 43/212 (20%), Positives = 68/212 (32%), Gaps = 8/212 (3%)

Query: 538 GNNKTVKIGVDQTTNIGSNDTETVGANRSLTVMANETIHVVANSTENIDASHSQTVGLVQ 597
+ + G T G + T G + T A+ T+ ST+ SQ G
Sbjct: 171 THQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGS 230

Query: 598 T--------VTVGAARVDTVGATETRSVGAAQMNTIGASRSVTVGAGQSHDIGADDGWNV 649
T +T G T G + G T G S+T G G +
Sbjct: 231 TQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 290

Query: 650 EASQSIEIGADQGLKIGGAQNTEIGKTWTVKVGQDASTQIDGAHELKIGKKSLTQVGEDA 709
+ GAD L G G+ T G ++ +L G S G+D+
Sbjct: 291 GYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDS 350

Query: 710 GMVVGKNLTIEAKDSITIKTGSAEILMKKDGT 741
++ G T A + ++ G + G+
Sbjct: 351 SLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGS 382



Score = 33.2 bits (75), Expect = 0.005
Identities = 58/278 (20%), Positives = 93/278 (33%), Gaps = 43/278 (15%)

Query: 522 DHDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSND--------TETVGANRSLT----- 568
H ++ G+ E G++ T+ G T G++ T+T G S
Sbjct: 171 THQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGS 230

Query: 569 -----VMANETIHVVANSTENIDAS------HSQTVGLVQTVTVGAARVDTVGATETRSV 617
++ T + T D+S +QT G ++T G T +
Sbjct: 231 TQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 290

Query: 618 GAAQMNTIGASRSVTVGAGQSHDIGADDGWNVEASQSIEIGADQGLKIGGAQNTEIGKTW 677
G T GA S+ G G + G + ++Q+ G+ Q + G G T
Sbjct: 291 GYGSTGTAGADSSLIAGYGSTQTAGEE------STQTAGYGSTQTAQKGSDLTAGYGSTG 344

Query: 678 TVKVGQDASTQIDGAHELKIGKKSLTQVGEDAGMVVGKNLTIEAKDSITIKTGSAEILMK 737
T G D+S L G S GED+ + G T A+ + G
Sbjct: 345 T--AGDDSS--------LIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTA 394

Query: 738 KDGTITIKGKDITVKGSGKINVDA---SSDIVMKGSNI 772
+ I G T + A S+ KGS++
Sbjct: 395 GADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDL 432



Score = 30.5 bits (68), Expect = 0.036
Identities = 50/250 (20%), Positives = 84/250 (33%), Gaps = 32/250 (12%)

Query: 497 AQKDHHLLIKHDRNKTVQHDQSDRIDHDAKHSVGHNLDEDVGNNKTVKIGVDQTTNIGSN 556
A D L+ + +T + + + + + D G T G D + G
Sbjct: 394 AGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYG 453

Query: 557 DTETVGANRSLTVM----------ANETIHVVANSTENIDAS------HSQTVGLVQTVT 600
T+T G + SLT ++ T + ST ++S +QT G T+T
Sbjct: 454 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLT 513

Query: 601 VGAARVDTVGATETRSVGAAQMNTIGASRSVTVGAGQSHDIGADDGWNVEASQSIEIGAD 660
G T G +T GA+ S+ G G + + + + G+
Sbjct: 514 AGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN------SVLTAGYGST 567

Query: 661 QGLKIGGAQNTEIGKTWTVKVGQDASTQIDGAHELKIGKKSLTQVGEDAGMVVGKNLTIE 720
Q + G G T T G D+S + G S + + G T
Sbjct: 568 QTAREGSDLTAGYGSTGT--AGSDSS--------IIAGYGSTQTASYHSSLTAGYGSTQT 617

Query: 721 AKDSITIKTG 730
A++ + TG
Sbjct: 618 AREQSVLTTG 627


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2359OMPADOMAIN651e-13 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 65.3 bits (159), Expect = 1e-13
Identities = 30/123 (24%), Positives = 55/123 (44%), Gaps = 17/123 (13%)

Query: 379 ISNQLLFASGQAELKAQFQPI---AADIAKALDAEAGPIKIVGHTDNVKPKKSSAFKSNF 435
+ + +LF +A LK + Q LD + G + ++G+TD + S A+ N
Sbjct: 217 LKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIG---SDAY--NQ 271

Query: 436 DLSVARAKAVQAMIARQLKDPSRLSVDGKGEDEPIADNATADGR---------AKNRRVD 486
LS RA++V + + ++S G GE P+ N + + A +RRV+
Sbjct: 272 GLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVE 331

Query: 487 VMI 489
+ +
Sbjct: 332 IEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2363IGASERPTASE642e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 63.9 bits (155), Expect = 2e-12
Identities = 55/290 (18%), Positives = 81/290 (27%), Gaps = 15/290 (5%)

Query: 407 PVVEAPKPVAETPQPDESNPPPPVQPQPEPSKPDNAATI----ETVNPPALPDAQVKPPV 462
P VE +T P PS P N I E PP P +
Sbjct: 983 PEVEKRNQTVDTTNI----TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038

Query: 463 PAENQAPGVPPATEAR--QPAEQAPVAPPVGKPASPTVQPETTANSQAQAPPVEIKPAQN 520
+ E E V K A V+ T N AQ+ +
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 521 PTANPPKTSAPSQAGAESKPAQQANPPATEPSA-TELVEILSKLARPEASPPPASESRLP 579
T +A E++ Q+ ++ S E E + A P P + P
Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158

Query: 580 AQPTSPPVKPAAPPP-TASQPTPPTSPSEPTQTA-QPSQQPQA--PVTTRPENTEVAINV 635
T+ P T+S P + S T + P+ P TT+P + N
Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK 1218

Query: 636 PKPAVPPAKPVDEAAQRASWVRDFSGGDCFYASLTSQTASSAAIEGLATA 685
PK + + LTS ++ + A A
Sbjct: 1219 PKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKA 1268



Score = 55.5 bits (133), Expect = 8e-10
Identities = 52/306 (16%), Positives = 84/306 (27%), Gaps = 50/306 (16%)

Query: 337 LSQPRPAPAPAPASTPMPTAPRKSSRMPAVAGGLVTLAVLLGGGLYFSGILAPPPAEEKL 396
+Q + P S +A PPPA
Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPSNNEEIAR--------------VDEAPVPPPA---- 1029

Query: 397 KPLTPKPVPKPVVEAPKPVAETPQPDESNPPPPVQPQPEPSKPDNAATIETVNPPALPDA 456
P TP + V E K ++T + +E + E +K + N A
Sbjct: 1030 -PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKA--NTQTNEVA 1086

Query: 457 QVKPPVPAENQAPGVPPATEARQPAEQAPVAPPVGKPASPTVQPETTANSQAQAPPVEIK 516
Q E Q TE ++ A K T + + +Q P + +
Sbjct: 1087 QSGSET-KETQT------TETKETATVEKE----EKAKVETEKTQEVPKVTSQVSPKQEQ 1135

Query: 517 P-AQNPTANPPKTSAPSQAGAESKPAQQANPPATEPSATELVEILSKLARPEASPPPASE 575
P A P + + P+ + +Q TE A E P +E
Sbjct: 1136 SETVQPQAEPARENDPTVN-IKEPQSQTNTTADTEQPAKET---------SSNVEQPVTE 1185

Query: 576 SRLPAQPTSPPVKPAAPPPTASQPT-------PPTSPSEPTQTAQPSQQPQAPVTTRPEN 628
S S P P +QPT P + + + P A ++ +
Sbjct: 1186 STTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245

Query: 629 TEVAIN 634
T +
Sbjct: 1246 TVALCD 1251



Score = 43.9 bits (103), Expect = 3e-06
Identities = 37/190 (19%), Positives = 54/190 (28%), Gaps = 19/190 (10%)

Query: 392 AEEKLKPLTPKPVPKPVVEAPKPVAETPQPDESNPPPPVQPQPEPSKPDNAATIETVNPP 451
+E K T V + K ET + E P V Q P + ETV P
Sbjct: 1090 SETKETQTTETKETATVEKEEKAKVETEKTQEV---PKVTSQVSPKQ----EQSETVQPQ 1142

Query: 452 ALPDAQVKPPVPAENQAPGVPPATEARQPAEQAP--VAPPVGKPASPTVQPETTANSQAQ 509
A P + P V + P + + P + T NS +
Sbjct: 1143 AEPARENDPTVNIKE-----PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE 1197

Query: 510 APPVEIKPAQNPT-----ANPPKTSAPSQAGAESKPAQQANPPATEPSATELVEILSKLA 564
P PT +N PK + + A + + S L ++ S
Sbjct: 1198 NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNT 1257

Query: 565 RPEASPPPAS 574
S A
Sbjct: 1258 NAVLSDARAK 1267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2365PF03544310.012 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 31.1 bits (70), Expect = 0.012
Identities = 17/101 (16%), Positives = 26/101 (25%), Gaps = 1/101 (0%)

Query: 149 KAQRQTAAKPSTAAPDDQQPTSPAPAPAAPNDQPGDLANDLQAPPDAGEIPAGS-APKAD 207
A P + P P P P + P + P + PK D
Sbjct: 58 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRD 117

Query: 208 ADDPEAGWGETIDQGEAALPAFKKTAIENNTTVATVTSEYQ 248
E+ + A P + V +V S +
Sbjct: 118 VKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPR 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2368OMPADOMAIN991e-27 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 99.2 bits (247), Expect = 1e-27
Identities = 40/113 (35%), Positives = 59/113 (52%), Gaps = 12/113 (10%)

Query: 65 FALDSAQLDQTARAELDEFAKALKDNRLSTFSFVVEGHTDATGPDRYNQDLSQRRAQSVA 124
F + A L +A LD+ L + S VV G+TD G D YNQ LS+RRAQSV
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVV 282

Query: 125 AFLEANGVESVRLEAIGLGKSHPRVANPYDPV------------NRRVEMRIR 165
+L + G+ + ++ A G+G+S+P N D V +RRVE+ ++
Sbjct: 283 DYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2374OMPADOMAIN874e-21 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 86.5 bits (214), Expect = 4e-21
Identities = 39/113 (34%), Positives = 54/113 (47%), Gaps = 11/113 (9%)

Query: 259 IYFRPASARLDAKSRPLLTEVEGVVGKC--PTLKVEVSGYTDSDGSPEANKALSERRAQA 316
+ F A L + + L ++ + V V GYTD GS N+ LSERRAQ+
Sbjct: 221 VLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQS 280

Query: 317 VAEALVAGGVPRQQISAAGHGEENPVAANDTPKNK---------ALNRRIEFS 360
V + L++ G+P +ISA G GE NPV N K A +RR+E
Sbjct: 281 VVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333


77mll2397mlr2419N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll2397217-1.714782transcriptional regulator
mlr2398318-1.805015hypothetical protein
mlr2399316-1.706067acetoacetate decarboxylase
mlr2400118-2.1790483-hydroxybutyrate dehydrogenase
mlr2403118-1.676365RND efflux membrane fusion protein
mlr2404017-1.867505RND efflux transporter
msr2405019-1.882820hypothetical protein
mlr24062130.361120hypothetical protein
mlr24071120.625815hypothetical protein
mlr24081142.677616*hypothetical protein
mll24100153.335188hypothetical protein
mll24110133.658392hypothetical protein
mlr2412-1133.547718hypothetical protein
mll24132153.890788short chain dehydrogenase
mll24142143.728147response regulatory protein
mll24161113.152304serine protease
mlr24170122.049072serine protease
mll24182130.477593hypothetical protein
mlr24192150.903710transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2397HTHTETR677e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.0 bits (163), Expect = 7e-16
Identities = 35/214 (16%), Positives = 72/214 (33%), Gaps = 18/214 (8%)

Query: 10 AEIGREKRERTRTLIVEAGAMLLAERPREGLTVDAVVEAAGVAKGTFYYHFQSIDELASA 69
A +++ + TR I++ L +++ ++ + +AAGV +G Y+HF+ +L S
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 70 VGEKLGESF-DAVLTPARLELQDPVERLTFAFTRFLEKAISDSNWARLVVQSSHSP---- 124
+ E + + L DP+ L LE +++ L+ H
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 125 ---------TEFARGIRNNLKADIAEAIVQGRL-SLRDAELAVDIVIGIWLQVTRGILER 174
+ ++ + I L + A I+ G + L
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 175 GARPELT---GQAVEAVLRALGSSQSEQRKATKK 205
+L V +L + + AT +
Sbjct: 182 PQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2400DHBDHDRGNASE1072e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 107 bits (269), Expect = 2e-30
Identities = 62/253 (24%), Positives = 112/253 (44%), Gaps = 8/253 (3%)

Query: 3 KVVVVTGAASGIGKEIALTFARKGAKVVIADLDLDAAEETAREIDPAALRALGVGMDVSN 62
K+ +TGAA GIG+ +A T A +GA + D + + E+ + A A DV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 EDQVESGISRVVETFGRIDVLVSNAGVQTVAPLVEFDFDKWRKLLSIHLDGAFLTTRAAL 122
++ +R+ G ID+LV+ AGV + ++W S++ G F +R+
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 RQMYRQNSGSIIYMGSVHSKEASPFKAPYVTAKHGLIGLAKVVAKEGAAHGVRANVICPG 182
+ M + SGSI+ +GS + A Y ++K + K + E A + +R N++ PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 183 FVRTPLVEKQIPEQARELGISPEDVVKTMMLRETVD---GEFTTVQDVAETALFLAAFPS 239
T + ++ E V+K + + D+A+ LFL + +
Sbjct: 189 STETDMQWSLWADENGA-----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 240 NALTGQSIVVSHG 252
+T ++ V G
Sbjct: 244 GHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2403RTXTOXIND543e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.4 bits (131), Expect = 3e-10
Identities = 32/209 (15%), Positives = 71/209 (33%), Gaps = 15/209 (7%)

Query: 77 DMGAIVKKGQKLAELSAVDYQNKVTAAEADVDAAKAALAQA--SAQEERFRILLGKGFAT 134
D +++ K +A+ + ++ +NK A ++ K+ L Q + L T
Sbjct: 239 DFSSLLHKQA-IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL----VT 293

Query: 135 HSQYDEALKSLQSARAQVQATEANLRIARNQLSYTQLTATDDGVVTATGA-DPGQVVAAG 193
+E L L+ + L + + + A V G VV
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 194 QMVVEVSGNDAREAVFA-VATSDVTRAKLGMAVNVSLQ---GRLDIAVTGTIREISPEA- 248
+ ++ + D V A V D+ +G + ++ + G ++ I+ +A
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413

Query: 249 -DSATGT-YQVKVALASPPSEMRLGAVVI 275
D G + V +++ + +
Sbjct: 414 EDQRLGLVFNVIISIEENCLSTGNKNIPL 442



Score = 38.3 bits (89), Expect = 4e-05
Identities = 21/109 (19%), Positives = 37/109 (33%), Gaps = 9/109 (8%)

Query: 67 VGGRMLSRQVDMGAIVKKGQKLAELSAVDYQNKVTAAEADVDAAKAALAQASAQEERFRI 126
+ V G V+KG L +L+A AEAD +++L QA ++ R++I
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 127 LLG--KGFATHSQYDEALKSLQSARAQVQATEANLRIARNQLSYTQLTA 173
L + Q+ + +L + Q
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2404ACRIFLAVINRP462e-148 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 462 bits (1189), Expect = e-148
Identities = 223/1047 (21%), Positives = 420/1047 (40%), Gaps = 64/1047 (6%)

Query: 7 LSEWAVHNRALVVFLMLICVIGGVSAYERLGRQEDPDFTVQTMVVQANWPGATTADTLKQ 66
++ + + L +I ++ G A +L + P + V AN+PGA
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 67 VTDRIEKKLEETPNLDYIKSYT-KPGQATIFVYLKESTPKRDLSDIWYQVRKKVSDIGPT 125
VT IE+ + NL Y+ S + G TI + + T D QV+ K+ P
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT---DPDIAQVQVQNKLQLATPL 117

Query: 126 LPQGVVGP-FFNDEFGDVFGTVYGITYDG--FSAREARDFAE-TARGEFLRAPDVGKVDI 181
LPQ V ++ + V G D + + D+ + R VG V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 182 YGDQDEKVYLNFSPQKLANLKLNLDDVLAAIARQNAVAPSGIINTPQE------NMLVDV 235
+G Q + + L KL DV+ + QN +G + N +
Sbjct: 178 FGAQYA-MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 236 TGSLLSSDGIANLNLWI--DGRFYKLTDIAQVQRGYSDPPSKMFRINGKPAIGIGVNMRE 293
+ + + L + DG +L D+A+V+ G + + RINGKPA G+G+ +
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNV-IARINGKPAAGLGIKLAT 295

Query: 294 GGNNLDFGKGLHEAAERLKQRFPVGIELNLVSDQPEVVHEAIGGFTEALVEAIVIVLVVS 353
G N LD K + L+ FP G+++ D V +I + L EAI++V +V
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 354 FLSLG-FRAGLVVALSIPLVLAIVFVAMDALGISLQRISLGALIIALGLLVDDAMITIEM 412
+L L RA L+ +++P+VL F + A G S+ +++ +++A+GLLVDDA++ +E
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 413 MISKI--EEGMEKIKAATFAYTSTAFPMLTGTLITILGFLPIGFANSNTGQYCFSLFVVI 470
+ ++ E+ + +A + + ++ ++ F+P+ F +TG + I
Sbjct: 416 -VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 471 AVALVASWFVAVVFAPVIGLSILPSHTKKAQANAEPGRFMRAFERLLGFAM--------- 521
A+ S VA++ P + ++L + + N G F F ++
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHEN--KGGFFGWFNTTFDHSVNHYTNSVGK 532

Query: 522 --RHRWPTIAAALILFSASLYGMGFVQQQFFPTSNRPELLVTMTLPKNASIAATQAQTER 579
+ ++ + + + F P ++ L + LP A+ TQ ++
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 580 LEKALAGDPDIARFSSYVGGGAIRFYLPLDVQLDNDFMAETVVVTKDLKARDRVQARLET 639
+ + S + G Q N MA V K + R+ + E
Sbjct: 593 VTDYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMA--FVSLKPWEERNGDENSAEA 645

Query: 640 LFA------GSFPD---VAVRISR-LELGPPVGWPVQ-YRVSAPTTEEARQYAEQVAQTL 688
+ G D + + +ELG G+ + + + Q Q+
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMA 705

Query: 689 R-ASGLVRNVNYDWAEKNKALRIVVDQDRVRQAGLSSEELAQALNRVISGSTVTQIRDSI 747
+ +V + E ++ VDQ++ + G+S ++ Q ++ + G+ V D
Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765

Query: 748 YLVDVVARAESDERSSVEALRNLQITTPTGASVPLRELAQFQYDLDDGYVWRRGRLPTIT 807
+ + +A++ R E + L + + G VP + + R LP++
Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSME 825

Query: 808 VQAEPLPGLQPASVHGRIAGAIEGLRKSMPAGTLLETGGTVEKSAQSNAALLAQFPLMIT 867
+Q E PG + G +E L +PAG + G + S A +
Sbjct: 826 IQGEAAPG----TSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFV 881

Query: 868 LMLTVLMVQLGSFRQLAMVISVAPLGLIGVAAALLTTNTPMGFIATLGIIALAGMIIRNS 927
++ L S+ V+ V PLG++GV A N +G++ G+ +N+
Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941

Query: 928 VILVHQIEH-ERAQGIEPWKAVIDATTHRFRPIMLTAAAAILGMIPIMHDVFWG-----P 981
+++V + +G +A + A R RPI++T+ A ILG++P+ G
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 982 MAFAIVGGLAVATVLTLVFLPALYVAV 1008
+ ++GG+ AT+L + F+P +V +
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2410VACCYTOTOXIN270.037 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 27.3 bits (60), Expect = 0.037
Identities = 19/69 (27%), Positives = 31/69 (44%), Gaps = 8/69 (11%)

Query: 6 VNSSGRLAHSPIIAFKRMGA----SPQQGEKTMFNTVKTAALSALIGLGALTAVPAHADS 61
+ + R + P+++ +GA +PQQ F TV + A++G G T S
Sbjct: 3 IQQTHRKINRPLVSLALVGALVSITPQQSHAAFFTTV---IIPAIVG-GIATGAAVGTVS 58

Query: 62 LYLGFGNNQ 70
LG+G Q
Sbjct: 59 GLLGWGLKQ 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2412OMPADOMAIN714e-15 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 70.7 bits (173), Expect = 4e-15
Identities = 28/114 (24%), Positives = 51/114 (44%), Gaps = 14/114 (12%)

Query: 639 FEFGSSSISDTEVQKLEGVASAMEKLLKKNPAETFLIEGHTDAVGTPEANLALSDRRAEA 698
F F +++ L+ + S + L K+ + + G+TD +G+ N LS+RRA++
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVV--VLGYTDRIGSDAYNQGLSERRAQS 280

Query: 699 VAEALTNAFGIPPENLTTQGYGEQY-----------LKVNTQAPNRENRRVAIR 741
V + L + GIP + ++ +G GE + +RRV I
Sbjct: 281 VVDYLI-SKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2413DHBDHDRGNASE652e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 65.1 bits (158), Expect = 2e-14
Identities = 51/209 (24%), Positives = 88/209 (42%), Gaps = 22/209 (10%)

Query: 3 LKGKTLFISGGSRGIGLAIALRAARDGANVTIAAKTAEPHPKLPGTIYSAAQEIEQAGGK 62
++GK FI+G ++GIG A+A A GA++ E K+ ++ + A+ E
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---- 61

Query: 63 ALPVLCDIREEAQVAEAVAKTVEKFGGIDICVNNASAIQLTGTLQTDMKRYDLMHQINTR 122
P D+R+ A + E A+ + G IDI VN A ++ + ++ +N+
Sbjct: 62 -FPA--DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118

Query: 123 GTFLVSKMCIPHLKLADNPHILNLA------PPLDMKAKWFKNHVAYTMAKFGMSMCTLG 176
G F S+ ++ + I+ + P M AY +K M T
Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMA--------AYASSKAAAVMFTKC 170

Query: 177 MSAEFAKDGIAVNSLWPISTIDTAAVRNL 205
+ E A+ I N + P ST +T +L
Sbjct: 171 LGLELAEYNIRCNIVSPGST-ETDMQWSL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2416DNABINDINGHU270.042 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 27.0 bits (60), Expect = 0.042
Identities = 14/55 (25%), Positives = 20/55 (36%), Gaps = 13/55 (23%)

Query: 89 VTAEEAVEA-GEEIELTLASGVTV------------KAELVGRDPSTGVALLKPA 130
+ AV+A + LA G V +A GR+P TG + A
Sbjct: 20 KDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGRNPQTGEEIKIKA 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2417V8PROTEASE812e-19 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 81.2 bits (200), Expect = 2e-19
Identities = 44/211 (20%), Positives = 77/211 (36%), Gaps = 30/211 (14%)

Query: 49 TTVADAVDRIGPAVCRIERIGGQGGH-GSGFVIAPDGLVVTNFHVV----GDARTVRV-- 101
+ D + V I+ G SG V+ D ++TN HVV GD ++
Sbjct: 77 HQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKD-TLLTNKHVVDATHGDPHALKAFP 135

Query: 102 ------SMPDGASSEGRVLGRDPDTDIALV--------RADGSFTDVAPLGDSKRLRRGQ 147
+ P+G + ++ + D+A+V + G A + ++ + Q
Sbjct: 136 SAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQ 195

Query: 148 IAIAIGNPLGFEWTVTSGVVSALGRSMRASTGRLIDDVIQTDAALNPGNSGGPLVSSAGE 207
G P + + S T L + +Q D + GNSG P+ + E
Sbjct: 196 NITVTGYPGDKPV-------ATMWESKGKIT-YLKGEAMQYDLSTTGGNSGSPVFNEKNE 247

Query: 208 VIGVNTAMIHGAQGIAFAVASNTANFVISEI 238
VIG++ + A + N NF+ I
Sbjct: 248 VIGIHWGGVPNEFNGAVFINENVRNFLKQNI 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2419HTHTETR653e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.0 bits (158), Expect = 3e-15
Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 1/108 (0%)

Query: 8 RSNRDRTEATRADLIAAARKLFTEKSYAETGTPEIVTAAGVTRGALYHHFADKQALFAAV 67
R + + TR ++ A +LF+++ + T EI AAGVTRGA+Y HF DK LF+ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 68 VEQEAQAVAQ-EIERASPSSLEARDALIAGSDAYLDAMRAPGRTRLLL 114
E + + E+E + + L L++ R RLL+
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLM 110


78mll2582mll2590N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll2582-2121.244107metalloprotease transporter
mll2583-1121.292552metalloprotease transporter
mll25850131.259597endo-1,3-1,4-beta-glycanase
mll25862142.294084hypothetical protein
mlr25871142.972306hypothetical protein
mlr25882143.108763hypothetical protein
mll25890112.151571transporter
mll2590-1111.905962secretion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2582RTXTOXIND2812e-92 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 281 bits (721), Expect = 2e-92
Identities = 85/426 (19%), Positives = 171/426 (40%), Gaps = 3/426 (0%)

Query: 17 IRRVAFAGYAATALLVGCFGYWAVSAPLSGAVITQGTISATGGNILIQHPEGGIIEALLV 76
RR Y LV F +V + G ++ +G + I+ E I++ ++V
Sbjct: 54 SRRPRLVAYFIMGFLVIAFI-LSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIV 112

Query: 77 HDGDRVQQAQDLIVLDPTAAQAELNRLTRQSIALRAAAARLEAERDGLDR--LAPITKPA 134
+G+ V++ L+ L A+A+ + + R R + ++ L + P
Sbjct: 113 KEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPD 172

Query: 135 PASLQADFEALIREQQKEFDARLARFRSEQSILAQRVAMHRQSVVGLQSQKEAVQQQAEI 194
Q E + + + +++++ + R + + ++ + + +
Sbjct: 173 EPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRV 232

Query: 195 VKKELGIKTGLLDKGLTNRTEYSQLLRSEADLVGQAGALEANLAAANTQIVEAQEQIERL 254
K L + LL K + + + V + ++ L ++I+ A+E+ + +
Sbjct: 233 EKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292

Query: 255 TTQRVEEALTKLDEVRSNLADVEEQIRAAEAVLRRTTIKAPAAGIVVSSTYNTKGSVIAR 314
T E L KL + N+ + ++ E + + I+AP + V +T+G V+
Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT 352

Query: 315 GEKIMEILPTAPGLIVDARLRPKDVDQVRVGQPAKLRLSALNMRLTPEVSATVAQISADR 374
E +M I+P L V A ++ KD+ + VGQ A +++ A + V I+ D
Sbjct: 353 AETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA 412

Query: 375 LIDEATHEPYFRVKLRIAADLPPGVKPGQLYPGTPVEAFINTGDRTFFEYLVRPMLDSFA 434
+ D+ + + L G K L G V A I TG R+ YL+ P+ +S
Sbjct: 413 IEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVT 472

Query: 435 RAFTER 440
+ ER
Sbjct: 473 ESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2585CABNDNGRPT562e-10 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 55.7 bits (134), Expect = 2e-10
Identities = 41/208 (19%), Positives = 61/208 (29%), Gaps = 19/208 (9%)

Query: 31 LYGSSGNDSFYGASG----VNVTMHGGTGDDIYYLYAAGNKVAEAAGAGIDTISTW---M 83
++ D + + G D + + N+ + +
Sbjct: 273 FNSNTDRDFYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNV 332

Query: 84 SYKLPDNVENLVVTNANNYAFGNGLDNIITAKAGHQTLDGGAGNDVLIDGGGGYDTFIVS 143
S +EN + + N+ GN DNI+ AG+ L GGAG D L GG G DTF+
Sbjct: 333 SIAHGVTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTLY-GGAGRDTFVYG 391

Query: 144 KGNGS---------DLIANFAATDTVRLNGYGFTSFDAIHSSMIQAGSNVLLNLGSGEIL 194
G S D D G SF G V+L + +
Sbjct: 392 SGQDSTVAAYDWIADFQKGIDKIDLSAFRNEGQLSFVQDQ--FTGKGQEVMLQWDAANSI 449

Query: 195 EFKDTTIDKLQPSQFELPIDKSGMQLSF 222
F + I Q
Sbjct: 450 TNLWLHEAGHSSVDFLVRIVGQAAQSDI 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2589TCRTETB1098e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (273), Expect = 8e-28
Identities = 86/411 (20%), Positives = 160/411 (38%), Gaps = 27/411 (6%)

Query: 18 MCVGMFIALLDIQIVASSLQDIGGGLSAAQDQIGWVQTSYLVAEIIVIPLSGWLTRVFST 77
+C+ F ++L+ ++ SL DI + WV T++++ I + G L+
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 78 RWLFTISAAGFTLASMLCGFAWSIESMIVF-RALQGLLGASMIPTVFTSSFHYFQGPRRV 136
+ L S++ S S+++ R +QG A+ V Y R
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 137 YAAAVVGSIASIAPTLGPVIGGWITDTLNWHWLFYVNVIPGTIITILVAVLVKIDKADPS 196
A ++GSI ++ +GP IGG I ++W +L + +I TIIT+ L+K+ K +
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI--TIITVP--FLMKLLKKEVR 194

Query: 197 LLKGADYIGIVLMAVSLGTAEYVLEEGSRWNWFDDATIRNGAIVAGITMVLFVIRSLTYS 256
+ D GI+LM+V + F + + IV+ ++ ++FV +
Sbjct: 195 IKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVT 244

Query: 257 QPVVDFRAFGNRNFAIGCFLSFVTGIGIFATIYLTPLFLGYVRGYDALQTGLAV-FSTGV 315
P VD N F IG + + + + P + V + G + F +
Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTM 304

Query: 316 ASMVGVPVYIFLAKRFDTRWLMMFGLASFGASMWSFSAI---TSQWGAAELLIPQLFRGF 372
+ ++ + L R +++ G+ S + S + TS + ++ F
Sbjct: 305 SVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSF 364

Query: 373 PQVFA--VAPSVNLGLGSLPPERLKYASGLFNTMRNLGGAVGIAICGAILN 421
+ + S SL + L N L GIAI G +L+
Sbjct: 365 TKTVISTIVSS------SLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2590RTXTOXIND1023e-26 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 102 bits (256), Expect = 3e-26
Identities = 57/432 (13%), Positives = 129/432 (29%), Gaps = 87/432 (20%)

Query: 9 ELQPAQPSAVQIPAPAKSRRKP-IVMLGAAIAIGVAGWYGFQWWQAGRFVMSTDDAYVGG 67
E PA ++ P + R +M IA ++ Q + G
Sbjct: 40 EFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLG-----QVEIVATANGKLTHSG 94

Query: 68 NVTPLAPRVAGHIDQILVEDNQHVDAGQLIIRIDDRPFKAA------------------- 108
+ P + +I+V++ + V G +++++ +A
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 109 -----------------------------VERARASVQQQQSALDNLRAQVSL----QNS 135
V R + +++Q S N + Q L + +
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214

Query: 136 LIEQAEADLEAKSAAATFTTQDAKRYEVLASTRAGSQQ---DAQRSLAAD----GQAKAS 188
A + + + L +A ++ + + K+
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 189 VSASRAGLAAARQQLDVL--------NTQISEATAAVLAAKADLDTAELDLGFTQIRAPI 240
+ + + +A+++ ++ ++ + T + +L E + IRAP+
Sbjct: 275 LEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPV 334

Query: 241 DGLVGN-RLAQVGTYVSPGSYLLTIVPQS-GLWVDANFKEDQLRRMADGQAATVYTDIAP 298
V ++ G V+ L+ IVP+ L V A + + + GQ A + + P
Sbjct: 335 SVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFP 394

Query: 299 DQ---PLKGHVSSLGPATGAIFSVIPAQNATGNFTKIVQRVPVRITIDPDQARKVALRPG 355
L G V ++ G ++ + ++ L G
Sbjct: 395 YTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGNKNIP--LSSG 445

Query: 356 LSTVVTVDTGAH 367
++ + TG
Sbjct: 446 MAVTAEIKTGMR 457


79mlr2803mll2816N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr28031120.9309183-hydroxyacyl-CoA dehydrogenase
mlr28052100.492282hypothetical protein
mlr280629-0.050765transcriptional regulator, nolR protein
mlr2808111-0.226256hypothetical protein
mll2809212-1.085171hypothetical protein
mll2811013-2.460624hypothetical protein
mlr2813012-0.960859hypothetical protein
mlr2815-114-1.401608two-component response regulator
mll2816-114-1.892030two-component transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2803DHBDHDRGNASE821e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.6 bits (201), Expect = 1e-20
Identities = 59/197 (29%), Positives = 89/197 (45%), Gaps = 15/197 (7%)

Query: 5 GQIAIVTGGGSGLGEATARALAAKGARVAIFDVGIERAAKVAADIGGISVQCDVSSAD-S 63
G+IA +TG G+GEA AR LA++GA +A D E+ KV + + + + AD
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 64 GTAALAETASKLGEPR----ILVNCAGIAIGVKTIGKDGPHPLDQYRKVIEVNLIGTFNM 119
+AA+ E +++ ILVN AG+ G +++ VN G FN
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVL----RPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 120 IRLVADRAASLEPLQGGERGVIVNTASVAAYDGQIGQAAYSASKGGVVGMTLPVARDLAR 179
R V+ + G IV S A + AAY++SK V T + +LA
Sbjct: 124 SRSVSKY------MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 180 SGIRVCTIAPGIFKTPM 196
IR ++PG +T M
Sbjct: 178 YNIRCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2806PF03309290.003 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 29.3 bits (66), Expect = 0.003
Identities = 14/37 (37%), Positives = 18/37 (48%)

Query: 42 DGEMSVGAIADKVMLSQSALSQHLAKLRALDLVETRR 78
GE GAIA V +S A + A LR ++L R
Sbjct: 144 KGEFLGGAIAPGVQVSSDAAAARSAALRRVELTRPRS 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2809RTXTOXIND320.004 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.004
Identities = 19/139 (13%), Positives = 41/139 (29%), Gaps = 12/139 (8%)

Query: 74 LAIERTDKNHALAELSAKNEALRQREEELHRLSERLKDTERKLEKRALELEKLEQMYDDA 133
+ + + AE + + E RL D L K+A+ + +
Sbjct: 202 KYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQ---- 257

Query: 134 SFSSSSRQIELVARESELQKLASDIALLRGQRKEADRRQQEIAAESKA-ARDALKAEKKR 192
+ + V +EL+ S + + + A Q + K D L+
Sbjct: 258 -------ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 193 VAELDKKVERLLATLADRE 211
+ L ++ +
Sbjct: 311 IGLLTLELAKNEERQQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2815HTHFIS664e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.4 bits (162), Expect = 4e-16
Identities = 30/111 (27%), Positives = 51/111 (45%), Gaps = 2/111 (1%)

Query: 7 VDDSSVIRKVAKRILGGSDMVVIEAASGLDALEMCAADMPDIIVVDGALPDVQAVDLIRR 66
DD + IR V + L + V ++ AA D++V D +PD A DL+ R
Sbjct: 9 ADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPR 68

Query: 67 VRAMESPTRPQILISLVELDIASIMRAKRAGAQGYLLKPFNRPQLLERFRN 117
++ P P +++S + + ++A GA YL KPF+ +L+
Sbjct: 69 IKKAR-PDLPVLVMS-AQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2816HTHFIS792e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-19
Identities = 27/124 (21%), Positives = 58/124 (46%), Gaps = 1/124 (0%)

Query: 2 RVLLIEDDSATAQSIELMLKSESFNVYTTDLGEEGVDLGKLYDYDIILLDLNLPDMSGYE 61
+L+ +DD+A + L ++V T D D+++ D+ +PD + ++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRTLRLSKVKTPILILSGMAGIEDKVRGLGFGADDYMTKPFHKDELVARI-HAIVRRSK 120
+L ++ ++ P+L++S ++ GA DY+ KPF EL+ I A+ +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GHAQ 124
++
Sbjct: 125 RPSK 128


80mll2885mlr2892N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll2885-111-1.266703ABC transporter substrate-binding protein
mll2887013-0.363159ABC transporter permease
mll2888-1110.116907ABC transporter permease
mll2890-1110.640296ABC transporter ATP-binding protein
mll2891-1110.748501hypothetical protein
mlr2892-2120.135403transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2885MALTOSEBP407e-06 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 40.5 bits (94), Expect = 7e-06
Identities = 42/157 (26%), Positives = 67/157 (42%), Gaps = 5/157 (3%)

Query: 8 TAMGLALLASTGLAR-AEGVLNIY---NWGNYTSPDVIKKFEDKYNIKVTITDYDSNDTA 63
+A+ + +++ LA+ EG L I+ + G +V KKFE IKVT+ D +
Sbjct: 13 SALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEK 72

Query: 64 LAKIRQGGTGFDIAVPSQTYVPIWIKEGLLLETDPGKMENFKNVAPEWANPEFDPGRKYS 123
++ G G DI + + + GLL E P K K W ++ G+ +
Sbjct: 73 FPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYN-GKLIA 131

Query: 124 VPWAWGTIGVVVNTDAYKGPADTWGIVFNTPDELKGK 160
P A + ++ N D P TW + ELK K
Sbjct: 132 YPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAK 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2890PF05272300.024 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.024
Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 43 TLLGPSGCGKTTLLRLIAGFDFPTAGEILL 72
L G G GK+TL+ + G DF + +
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2891UREASE320.006 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 32.0 bits (73), Expect = 0.006
Identities = 22/84 (26%), Positives = 36/84 (42%), Gaps = 8/84 (9%)

Query: 2 SATGTGHNADLIVINGRVLTMDGGNPAAEAVAVKDGAIIAVGSRA------AIEELEGTA 55
T G D ++ N +L G A + +KDG I A+G + + G
Sbjct: 60 QVTREGGAVDTVITNALILDHWGIVKAD--IGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117

Query: 56 TQVIDAKGGSVLPGFIEAHMHLFS 79
T+VI +G V G +++H+H
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIHFIC 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2892HTHTETR641e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.9 bits (155), Expect = 1e-14
Identities = 26/146 (17%), Positives = 55/146 (37%), Gaps = 3/146 (2%)

Query: 18 QPRQQRSSKVVDRILDAALILTREQGTRTPTTLAIAQRAGLSVGSVYQYFPNKEAILLDL 77
+ +Q + + ILD AL L +QG + + IA+ AG++ G++Y +F +K + ++
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 78 ARRWLSSFPEVIERRIKVPRPTNREEFRREVRKLFIDTSSIYLENATLMPVI---EAISG 134
S+ E+ R + + T + + + E +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 135 NADLRPIQDEYDEQIIALYAAWLRHV 160
A ++ Q + L+H
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHC 148


81mll2899mlr2945N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll28991110.157318flagellar biosynthesis protein FlhB
mll29012121.143031flagellar motor switch protein G
mll29023151.847167flagellar motor switch protein FliN
mll29041121.773796flagellar motor switch protein FliM
mll29050141.154168flagellar motor protein MotA
mlr29061161.555640hypothetical protein
mlr29072152.117346flagellar basal body rod protein FlgF
mlr29090171.940764flagellum-specific ATP synthase
mlr29101200.656758hypothetical protein
mlr2911-1211.209169flagellar basal-body rod protein FlgB
mlr2912-1211.853063flagellar basal body rod protein FlgC
mlr2913-1171.800909flagellar hook-basal body protein FliE
mlr2915-1161.523542flagellar basal body rod protein FlgG
mlr29170151.209993flagellar basal body P-ring biosynthesis protein
mlr29180171.061303flagellar basal body P-ring biosynthesis protein
mlr2920318-0.018130hypothetical protein
mlr29214180.288729flagellar basal body L-ring protein
mlr29234190.448773hypothetical protein
mlr29243201.211434flagellar biosynthesis protein FliP
mlr29252191.989380flagellin FlaA
mlr29270132.412656flagellin FlaA
mlr2928-1122.736736flagellar MS-ring protein
mlr2930-1142.523024hypothetical protein
mlr2931-1141.975682flagellar motor protein MotB
mlr2932-2151.557318chemotaxis MOTC protein
mlr2933-1140.934282chemotaxis protein MotD
mlr2934-1180.032247hypothetical protein
mlr29350180.013079two-component response regulator
mlr2937116-0.174668flagellar hook protein FlgE
mlr2938316-0.183734flagellar hook-associated protein FlgK
mlr29393160.812313flagellar hook-associated protein FlgL
mlr29402150.924870flagellar biosynthesis regulatory protein FlaF
mlr29411142.066167flagellar biosynthesis repressor FlbT
mlr29421142.636218flagellar basal body rod modification protein
msr29431152.729035flagellar biosynthesis protein FliQ
mlr29451152.888434integral membrane efflux protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2899TYPE3IMSPROT2872e-97 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 287 bits (737), Expect = 2e-97
Identities = 96/352 (27%), Positives = 175/352 (49%), Gaps = 9/352 (2%)

Query: 8 DSKTEEATEKKVRDTIEQGKLPHSRETAILASFVAILVFTVFYAKDAVINLGMFLSMFLE 67
KTE+ T KK+RD ++G++ S+E A VA+ + + + + + E
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 68 K---PEAWPMNTETDVIDLYKIVMLEVGRAVVSLLVLLTVAGIGASVLQNMPQFVGERIR 124
+ P + ++ D V+LE LL + + I + V+Q GE I+
Sbjct: 63 QSYLPFSQALSYVVDN------VLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIK 116

Query: 125 PQLSRISITKGWNRMFGAQGWVEFLKSLAKVGFAIAVLTFTLSEDHSKLLAGMITNPVAF 184
P + +I+ +G R+F + VEFLKS+ KV ++ + + LL
Sbjct: 117 PDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECI 176

Query: 185 GLVIRGIAVDILVAIVFVMGLIAAIDIVWSRFHWKQDLRMTKQEVKDEFKQSEGDPIVKS 244
++ I ++V +I+ D + + + ++L+M+K E+K E+K+ EG P +KS
Sbjct: 177 TPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKS 236

Query: 245 RLRSLARDRARKRMMTAVPRATLIIANPTHFSIALKYVRDEDSAPLVVAKGQDLVALKIR 304
+ R ++ + M V R+++++ANPTH +I + Y R E PLV K D +R
Sbjct: 237 KRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVR 296

Query: 305 EIAKEHNIPIFEDVALARSMYKQVSVDNVIPSQFYQAVAELVRIVYSKKAER 356
+IA+E +PI + + LAR++Y VD+ IP++ +A AE++R + + E+
Sbjct: 297 KIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEK 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2901FLGMOTORFLIG1711e-52 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 171 bits (434), Expect = 1e-52
Identities = 75/331 (22%), Positives = 155/331 (46%), Gaps = 13/331 (3%)

Query: 4 LTTLTRAQKAAAILVAMGKPSASRLLKFFKQEELKALIEGARLLRTIPQSDLERIVAEFE 63
++ LT QKAA +LV++G +S++ K+ QEE+++L L TI + ++ EF+
Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71

Query: 64 AEFTEGAGLLD-SADRMDTILNESLSPEEMSAIMGNKKPEAAPEGPPPIWPDLEKLEPSR 122
+ D +L +SL ++ I+ N + + + +P+
Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSR----PFEFVRRADPAN 127

Query: 123 LGTFLAGEHPQTAAMVLSKLAPQAAASVLLTLTKPMRGEIIKRMVTMANVPDAAARIVEN 182
+ F+ EHPQT A++LS L PQ A+ +L +L ++ + +R+ M R VE
Sbjct: 128 ILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVER 187

Query: 183 RLR---ASVLAETSTKDTSAGQARVASVLNELDKPLLEEVMQDLEAAGTPDLDG-VRARL 238
L AS+ +E T + G V ++N D+ + +++ LE P+L ++ ++
Sbjct: 188 VLEKKLASLSSEDYTS--AGGVDNVVEIINMADRKTEKFIIESLEEE-DPELAEEIKKKM 244

Query: 239 FAFDDLPLLTQKARVLLFDGLSTELVTLALRGTSAALAEAVLSAIGARSRRMIEAELGQG 298
F F+D+ LL ++ + + + + AL+ + E + + R+ M++ ++ +
Sbjct: 245 FVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDM-EF 303

Query: 299 SEGIPLADIMTARKTIVTTTIRLSREGAFEL 329
D+ +++ IV+ +L +G +
Sbjct: 304 LGPTRRKDVEESQQKIVSLIRKLEEQGEIVI 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2902FLGMOTORFLIN852e-24 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 85.0 bits (210), Expect = 2e-24
Identities = 36/94 (38%), Positives = 60/94 (63%), Gaps = 10/94 (10%)

Query: 36 DAAFRAAAG-------ANSNVIMNIPVDVQIILGSTEMAVADLMALQKGSTVALNRRIGE 88
DA F+ G + ++IM+IPV + + LG T M + +L+ L +GS VAL+ GE
Sbjct: 36 DAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGE 95

Query: 89 PVDVVVNGRKIARGEITVLESDPSRFGIRLTEII 122
P+D+++NG IA+GE+ V+ ++G+R+T+II
Sbjct: 96 PLDILINGYLIAQGEVVVVA---DKYGVRITDII 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2904FLGMOTORFLIM482e-08 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 48.0 bits (114), Expect = 2e-08
Identities = 54/267 (20%), Positives = 107/267 (40%), Gaps = 12/267 (4%)

Query: 35 MAERAAPLLQKSLTSELGVPVTVDLRAVEVSRVAE-ARSRAGDTFAMTIVASSTSSDAMT 93
M E A L SL+++L V V + +V+ E RS + I +A+
Sbjct: 58 MHETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAV- 116

Query: 94 LVIDAPAIAIMVCTLFGGDPETPASPIERDLSQIEVDVSTMLFQQVAQALNGSGRRSLDL 153
L +D ++ LFGG + ++RDL+ IE V + ++ + S + +DL
Sbjct: 117 LEVDPSITFSIIDRLFGGTGQAAK--VQRDLTDIENSVMEGVIVRILANVRESWTQVIDL 174

Query: 154 RLPVPRAMSGTEAKRHVLRDGAALRIVLGISTPADSGTVTVTMPQ------RIVLASRDS 207
R + + + + + V + + L + G + +P L+S+
Sbjct: 175 RPRLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFW 234

Query: 208 AASTAGDDHGPSWRERFSEEVMRSTVALEATMPLARLTLGDLASLEVGQVIDFDET-AQS 266
+S + +++ + + A + RL++ D+ L VG +I +T
Sbjct: 235 FSSV-RRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGD 293

Query: 267 RARLSARGQTLFVCEFGKLGQNYTVRI 293
LS + F+C+ G +G+ +I
Sbjct: 294 PFVLSIGNRKKFLCQPGVVGKKIAAQI 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2907FLGHOOKAP1310.004 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 31.1 bits (70), Expect = 0.004
Identities = 8/32 (25%), Positives = 18/32 (56%)

Query: 3 DSLYVALSAQMALERRLDTIADNVANANTVGF 34
+ A+S A + L+T ++N+++ N G+
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGY 33


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2909PF05272300.016 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.016
Identities = 21/55 (38%), Positives = 25/55 (45%), Gaps = 3/55 (5%)

Query: 158 QRVGTAFMTGVKVIDIFTPLCFGQRMGVFAG-SGVGKSTLLAMLAGADAF-DTVV 210
Q VG + G V + P C V G G+GKSTL+ L G D F DT
Sbjct: 574 QLVGKYILMGH-VARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHF 627


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2912FLGHOOKAP1300.003 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.9 bits (67), Expect = 0.003
Identities = 10/38 (26%), Positives = 19/38 (50%)

Query: 97 NVNVLIEMADMTEANRSYEANLQVVKQARDLISMTIDL 134
VN+ E ++ + Y AN QV++ A + I++
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 28.4 bits (63), Expect = 0.009
Identities = 15/71 (21%), Positives = 24/71 (33%), Gaps = 11/71 (15%)

Query: 5 TAALKVAASGLGAQSERLRVVSENLANAQSTGTTPGADPYRRKTISFVSELDRASGAST- 63
++ + A SGL A L S N+++ G Y R+T
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAG-------YTRQTTIMAQANSTLGAGGWV 53

Query: 64 ---VEVNSIDR 71
V V+ + R
Sbjct: 54 GNGVYVSGVQR 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2913FLGHOOKFLIE322e-04 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 31.6 bits (71), Expect = 2e-04
Identities = 18/73 (24%), Positives = 33/73 (45%), Gaps = 2/73 (2%)

Query: 37 TSFAEAVSQAASKTVNTLQNAEQVSLQALKG--DADTRQVVDAVMSAQQALQTAVAIRDK 94
SFA + A + +T A + + G V+ + A ++Q + +R+K
Sbjct: 31 ISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNK 90

Query: 95 VVSAYLEVSRMGI 107
+V+AY EV M +
Sbjct: 91 LVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2915FLGHOOKAP1422e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.9 bits (98), Expect = 2e-06
Identities = 19/80 (23%), Positives = 30/80 (37%), Gaps = 15/80 (18%)

Query: 4 LAIAATGMNAQQTNLEVIANNIANINTTGYKRARAEFSDLLYQVDRTQGVPNRSNASLVP 63
+ A +G+NA Q L +NNI++ N GY R S +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQT---------------TIMAQANSTLG 48

Query: 64 EGVSIGLGVKTTAVRNVHTQ 83
G +G GV + V+ +
Sbjct: 49 AGGWVGNGVYVSGVQREYDA 68



Score = 39.9 bits (93), Expect = 7e-06
Identities = 10/41 (24%), Positives = 19/41 (46%)

Query: 213 TIQQGYLEASNVDPVKEITELISAQRAYEMNSKVIQAADDM 253
+ S V+ +E L Q+ Y N++V+Q A+ +
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAI 538


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2918FLGPRINGFLGI422e-150 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 422 bits (1087), Expect = e-150
Identities = 226/354 (63%), Positives = 291/354 (82%)

Query: 9 SSLPPGQVASRIKDIAQLQSSRDNQLVGYGLVIGLAGSGDSLRNSPFTEQSIRAMLENLG 68
S+ P SRIKDIA LQ+ RDNQL+GYGLV+GL G+GDSLR+SPFTEQS+RAML+NLG
Sbjct: 20 STPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNLG 79

Query: 69 IATEGGSARAKNVAAVIVTANMPPYVQSGARIDIDVSSMGDATSLAGGTLIMTPLKAADG 128
I T+GG + AKN+AAV+VTAN+PP+ G+R+D+ VSS+GDATSL GG LIMT L ADG
Sbjct: 80 ITTQGGQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADG 139

Query: 129 EIYAVGQGAVIVSGFTAKGQAEQLTQGVPTAGRVPNGAIVERSVKAEFDDQSTLTLQLRN 188
+IYAV QGA+IV+GF+A+G A LTQGV T+ RVPNGAI+ER + ++F D L LQLRN
Sbjct: 140 QIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQLRN 199

Query: 189 PDFSTAIRIADAINDYTSQRFGMRVAGERDSRTVQIRRPKGVSAARFYAEIENLVVESDT 248
PDFSTA+R+AD +N + R+G +A RDS+ + +++P+ R AEIENL VE+DT
Sbjct: 200 PDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDT 259

Query: 249 PARVVIDERTGTIVIGNDVKISRVAISHGTLTVRITEAPRVVQPEPFSKGETAVEPFTAI 308
PA+VVI+ERTGTIVIG DV+ISRVA+S+GTLTV++TE+P+V+QP PFS+G+TAV+P T I
Sbjct: 260 PAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDI 319

Query: 309 DATRPDARVAVLDGPDLQTLVSGLNRLGVKPDGIIAILQGIKSAGALQADLVLQ 362
A + ++VA+++GPDL+TLV+GLN +G+K DGIIAILQGIKSAGALQA+LVLQ
Sbjct: 320 MAMQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQAELVLQ 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2921FLGLRINGFLGH1812e-59 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 181 bits (461), Expect = 2e-59
Identities = 59/239 (24%), Positives = 97/239 (40%), Gaps = 38/239 (15%)

Query: 11 VAALSGCG-----------TNLREVGKEPSLSPVGSGIDGGNTSALYKYPEPPRAPVKKF 59
V +L+GC T+ + V P +PV +G ++++ +P +
Sbjct: 16 VLSLTGCAWIPSTPLVQGATSAQPV---PGPTPVANG-------SIFQSAQPINYGYQP- 64

Query: 60 SLWDDRQSRLFTDPRALSQGDILTVRIKINDRANFKNQNDRSRTANRKLGFDLSAQ---- 115
LF D R + GD LT+ ++ N A+ + + SR GFD +
Sbjct: 65 ---------LFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQG 115

Query: 116 WDKWSTAGKGAGALNSATDTTADGEIKRSETLELNVAAIVTDVLPNGNLMITGSQEVRVN 175
+ A A G S T + V VL NGNL + G +++ +N
Sbjct: 116 LFGNARADVEAS---GGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAIN 172

Query: 176 AELRVLTIAGIVRPADIGAENTIPYERIAEARISYGGRGRISEIQQPAYGQQVLDQVLP 234
+ +G+V P I NT+P ++A+ARI Y G G I+E Q + Q+ + P
Sbjct: 173 QGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSP 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2924FLGBIOSNFLIP2647e-92 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 264 bits (677), Expect = 7e-92
Identities = 102/244 (41%), Positives = 153/244 (62%), Gaps = 3/244 (1%)

Query: 1 MKKFLLAAALIGAATSVAAAQQLD--LGGIGKADGATVGYIIQMFGLLTVLSVAPGLLIM 58
M++ L A ++ + A QL G + +Q +T L+ P +L+M
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60

Query: 59 VTSFTRFVIAFSILRAGIGLQSTPANLILISLSLFMTFYVMAPTFDQAWNTGVKPLMDNQ 118
+TSFTR +I F +LR +G S P N +L+ L+LF+TF++M+P D+ + +P + +
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120

Query: 119 ISQTEAFEKISDPFRTFMLHNVRDKDFDLFADLARERGQVVAKETVDLRILVPAFMISEI 178
IS EA EK + P R FML R+ D LFA LA G + E V +RIL+PA++ SE+
Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANT-GPLQGPEAVPMRILLPAYVTSEL 179

Query: 179 RRGFEIGFLIVLPFLVIDLIVATITMAMGMMMLPPTVVSLPFKILFFVLIDGWNLLVGSL 238
+ F+IGF I +PFL+IDL++A++ MA+GMMM+PP ++LPFK++ FVL+DGW LLVGSL
Sbjct: 180 KTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSL 239

Query: 239 VRSF 242
+SF
Sbjct: 240 AQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2925FLAGELLIN725e-16 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 71.6 bits (175), Expect = 5e-16
Identities = 48/319 (15%), Positives = 105/319 (32%), Gaps = 3/319 (0%)

Query: 4 IMTNSAALTALQSLNATQNNLSTTQARISTGYRVSQASDNAAYWSIATTMRSDNQAMSTV 63
I TNS +L +LN +Q++LS+ R+S+G R++ A D+AA +IA S+ + ++
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 SDSLGLGASKVDTAYTGMNSAITTINAIQQKLTGSY--GQTDAAKEKTQVEIAALQQQLK 121
S + G S T +N + +++ + +D+ + Q EI +++
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 122 GYADSATFSGTNMLSVSTASGTAADVKIVSAFNRSSTGSVSLSTIDVNVESIKLYDSGAA 181
++ F+G +LS + + ++ ++ ++
Sbjct: 124 RVSNQTQFNGVKVLSQDNQMKIQVGANDGETI-TIDLQKIDVKSLGLDGFNVNGPKEATV 182

Query: 182 PTAKGLLDTARLGTTGAATTTAQAPTLGAAPAAGDTYSVASLAIFSGTTAASDAQISQMM 241
K T A + + DT + A
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 242 TVVDAALKDMTTAATKLGAAKSSIDLQKTFTQSLMDSIDRGVGQLVDADMNKESTRLQAL 301
L T + AK+ K + + N + ++
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTT 302

Query: 302 QVQQQLGVQALSIANGSSQ 320
+++ + I G++
Sbjct: 303 INGEKVTLTVADITAGAAN 321



Score = 67.0 bits (163), Expect = 2e-14
Identities = 52/328 (15%), Positives = 103/328 (31%), Gaps = 2/328 (0%)

Query: 2 ASIMTNSAALTALQSLNATQNNLSTTQARISTGYRVSQASDNAAYWSIATTMRSDNQAMS 61
A++ ++ + + + + +++G V+ + + +
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 62 TVSDSLGLGASKVDTAYTGMNSAITTINAIQQKLTGSYGQTDAAKEKTQVEIAALQQQLK 121
++ + K + G A AI+ G +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 122 --GYADSATFSGTNMLSVSTASGTAADVKIVSAFNRSSTGSVSLSTIDVNVESIKLYDSG 179
++ A+ AA ++ S ES KL D
Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLE 359

Query: 180 AAPTAKGLLDTARLGTTGAATTTAQAPTLGAAPAAGDTYSVASLAIFSGTTAASDAQISQ 239
A KG G A TL D + + + AA+ +
Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTAN 419

Query: 240 MMTVVDAALKDMTTAATKLGAAKSSIDLQKTFTQSLMDSIDRGVGQLVDADMNKESTRLQ 299
+ +D+AL + + LGA ++ D T + + +++ ++ DAD E + +
Sbjct: 420 PLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMS 479

Query: 300 ALQVQQQLGVQALSIANGSSQSILSLFR 327
Q+ QQ G L+ AN Q++LSL R
Sbjct: 480 KAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2927FLAGELLIN742e-16 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 73.5 bits (180), Expect = 2e-16
Identities = 54/323 (16%), Positives = 105/323 (32%), Gaps = 19/323 (5%)

Query: 4 IMTNAAALTALQSLNATNKSLEHTQSRISTGYRVSEASDNAAYWSIATTMRSDNQALSTV 63
I TN+ +L +LN + SL R+S+G R++ A D+AA +IA S+ + L+
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 QDALGLGASKVDTAYTGMNNVLTSIGQLKTKLL--STIGQTAAAKAKTQTEITTLQAQMK 121
G S T +N + ++ +++ + + + + Q EI ++
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 122 SFADAATFSGSNYLSVTSTQ-----------VAAPNDGVQANAKIVASFNRSSSGAISLG 170
++ F+G LS + + + + + FN + ++G
Sbjct: 124 RVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVG 183

Query: 171 TIDINVESTKLFDNGLSTAVKNQGTLDRKTSVYATAAAQNLYDTAYAAAIAGGGTDIAAN 230
+ + ++ +D A K + ++ V T A AA TD A N
Sbjct: 184 DLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAEN 243

Query: 231 ------TAGQTAAGAVVPRIDNVSAFNLDITAAGVTDDIITQMIGKIDKVMSQLTDQATI 284
+ A +T I TI
Sbjct: 244 NTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTI 303

Query: 285 LGAAKSSIDLQKTFTQSLMDSID 307
G + T + +D+
Sbjct: 304 NGEKVTLTVADITAGAANVDAAT 326



Score = 71.2 bits (174), Expect = 1e-15
Identities = 48/299 (16%), Positives = 93/299 (31%), Gaps = 8/299 (2%)

Query: 61 STVQDALGLGASKVDTAYTGMNNVLTSIGQLKTKLLSTIGQTAAAKAKTQTEITTLQAQM 120
V N LT+ + T + + + +
Sbjct: 212 GAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKG 271

Query: 121 KSFADAATFSGSNYLSVTSTQVAAPNDGVQANAKIVASFNRSSSGAISLGTIDINVESTK 180
D + G + T + + + I + I+ G +++ + +
Sbjct: 272 GKEGDTFDYKGV---TFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQ 328

Query: 181 LFDNGLSTAVKNQGTLDRKT-----SVYATAAAQNLYDTAYAAAIAGGGTDIAANTAGQT 235
N ++ V Q T D KT + A + + T AA
Sbjct: 329 SSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTL 388

Query: 236 AAGAVVPRIDNVSAFNLDITAAGVTDDIITQMIGKIDKVMSQLTDQATILGAAKSSIDLQ 295
A + L A + ID +S++ + LGA ++ D
Sbjct: 389 AGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSA 448

Query: 296 KTFTQSLMDSIDRGVGQLVDADMNKESTRLQALQVQQQLGIQALSIANSSSQSILSLFK 354
T + + +++ ++ DAD E + + Q+ QQ G L+ AN Q++LSL +
Sbjct: 449 ITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2928FLGMRINGFLIF376e-126 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 376 bits (966), Expect = e-126
Identities = 141/568 (24%), Positives = 231/568 (40%), Gaps = 61/568 (10%)

Query: 9 ISNLRGFGVKRLAMLAGIAVLVMGVIGIASVYLNRPAYDTLYVGLDRADVNQIGLVLGEA 68
+ L L + ++ ++ P Y TL+ L D I L +
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQM 72

Query: 69 GIGFDVGSDGTSVLVPAGTTAQARMLLAEKGLPTSANAGYELFDNVGAMGLTSFMQQITR 128
I + + ++ VPA + R+ LA++GLP G+EL D G++ F +Q+
Sbjct: 73 NIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQ-EKFGISQFSEQVNY 131

Query: 129 VRALEGEIARTIQSISGIKAARVHIVMSERANFRRDEQQPSASVVIR-YAGIDAEKS-AQ 186
RALEGE+ARTI+++ +K+ARVH+ M + + F R+++ PSASV + G ++
Sbjct: 132 QRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQIS 191

Query: 187 SIRHLVAAAVPGLSADKVTVLDSNGNLLAAGDDPSNTSAARTLGVEQTVEAQIGDNIRRA 246
++ HLV++AV GL VT++D +G+LL + L VE++I I
Sbjct: 192 AVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAI 251

Query: 247 LTPYLGPDNFRASVKAEVNTDTRQTEETIFDPNSRVERSVQSVRANENSNQKQASTPASV 306
L+P +G N A V A+++ ++ E + PN ++ R S Q A P V
Sbjct: 252 LSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGV 311

Query: 307 -------------------------EQNLPETQATSTEGPQTTSANDRKEEITNYEINSK 341
QN P+T TST + ++ E +NYE++
Sbjct: 312 PGALSNQPAPPNEAPIATPPTNQQNAQNTPQTS-TSTNSNSAGPRSTQRNETSNYEVDRT 370

Query: 342 KIATVSNGYTVTKMSIAVVVNQDRLKTILGKDATPEQIAKRVAEIQKMVTSATGLDDKRG 401
T N + ++S+AVVVN L T +Q+ + I+ + A G DKRG
Sbjct: 371 IRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQ----IEDLTREAMGFSDKRG 426

Query: 402 DVIDVSAVEF--IDGLDGEAI--PQAGMLDSIGQHAGTLINAGAFIVVVFLVAFFGLRPM 457
D ++V F +D GE Q +D + L +VV +++ +RP
Sbjct: 427 DTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWL----LVLVVAWILWRKAVRPQ 482

Query: 458 AAALTARATPALSGPNFDEVQRSLPTPEAAASADAGAAIGALPGSRPGTNPLDDLRQKIR 517
A A E A A+ L R R
Sbjct: 483 LTRRVEEAKAA---------------QEQAQVRQETEE--AVEVRLSKDEQLQQRRANQR 525

Query: 518 ---PAPQERLARMVDLNEERTAQILRKW 542
+R+ M D + A ++R+W
Sbjct: 526 LGAEVMSQRIREMSDNDPRVVALVIRQW 553


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2931FLGHOOKFLIK340.001 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 34.4 bits (78), Expect = 0.001
Identities = 29/128 (22%), Positives = 42/128 (32%), Gaps = 6/128 (4%)

Query: 198 SQQVAAPAAEASAQRPKIEGDPLKPGDKAAESQV-AKVKAVPPAPPVKDAPLEPLAES-- 254
+Q +A A + D L A+ S + A + P V DAP L
Sbjct: 101 AQTMALAAVADKNTTKDEKADDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKP 160

Query: 255 ---GKDAAATMAKAGETKASGAKASDAKTGDAKTEDSKAVAAAKADTAAAAQETAAPEAG 311
K + + A A G A A+ + V + + AAA P
Sbjct: 161 TLFTKLTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQT 220

Query: 312 EKPPTAAA 319
+ PT AA
Sbjct: 221 QPLPTVAA 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2935HTHFIS338e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 8e-04
Identities = 22/116 (18%), Positives = 43/116 (37%), Gaps = 11/116 (9%)

Query: 2 IVIVDERELVTEGYNSLFDREGVACAGFAPGEFGEWVNSAADTDLRSVRAFLIGDCR--- 58
I++ D+ + N R G + +A D DL ++ D
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDL------VVTDVVMPD 59

Query: 59 -DGAISPRQIRDR-TGAPVIALSEQHSLENTLRLFESGVDDVIRKPVHIREILARI 112
+ +I+ PV+ +S Q++ ++ E G D + KP + E++ I
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2937FLGHOOKAP1456e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.6 bits (105), Expect = 6e-07
Identities = 14/43 (32%), Positives = 26/43 (60%)

Query: 375 LENSNVDIAEELTDMIAAQRSYTANSKVFQTGSDLMDVLVNLK 417
S V++ EE ++ Q+ Y AN++V QT + + D L+N++
Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 35.3 bits (81), Expect = 4e-04
Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 9 TGVSGMNAQANRLSTTADNIANSDTTGYKRSS 40
+SG+NA L+T ++NI++ + GY R +
Sbjct: 6 NAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2938FLGHOOKAP1661e-13 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 65.7 bits (160), Expect = 1e-13
Identities = 63/319 (19%), Positives = 122/319 (38%), Gaps = 27/319 (8%)

Query: 4 SSALSIAQSALMSTARQTSVVTRNVSDASNPDYARRIAVVTSTAP----------GARSV 53
SS ++ A S L + + + N+S + Y R+ ++ G
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 54 DIQRVANDLLFRQNLGALSAYSGQNALYSGMDQLDVSVNGVDNASSPSTAIANLQQALQL 113
+QR + + Q A + SG A Y M ++D ++ + SS +T + + +LQ
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLS--TSTSSLATQMQDFFTSLQT 118

Query: 114 YATSPSNQNLGSSVIDAAKQVVRSLNEGSKAIQDFRTQTDGQIDTAVKDLNSLLGQFQDA 173
++ + ++I ++ +V + ++D Q + I +V +N+ Q
Sbjct: 119 LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASL 178

Query: 174 NQAV--ISGTRSGTDVSDALDQRDALLKKIADYIPISTFTRGDNDMVITTGDGTTLFETI 231
N + ++G +G ++ LDQRD L+ ++ + + + IT +G +L
Sbjct: 179 NDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSL---- 234

Query: 232 PRTVSFTPAAGYAAGAPGNTIYIDNVPVSAGSGGNTTA------SGKLAGLLQLRDGVAS 285
V + A AA V G+ GN +G L G+L R
Sbjct: 235 ---VQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLD 291

Query: 286 TMQSQLDETARGLITAFAE 304
++ L + A AF
Sbjct: 292 QTRNTLGQLALAFAEAFNT 310



Score = 39.2 bits (91), Expect = 3e-05
Identities = 14/67 (20%), Positives = 35/67 (52%), Gaps = 4/67 (5%)

Query: 415 LQSMRQQASTAADAKEALAQRSSEALSNATGVNVDQEMSLMLDLEHTYQASARMMKTVDD 474
+++ ++T + L+ + +GVN+D+E + + Y A+A++++T +
Sbjct: 482 TATLKTSSATQGNVVTQLSNQQQSI----SGVNLDEEYGNLQRFQQYYLANAQVLQTANA 537

Query: 475 MLTALLN 481
+ AL+N
Sbjct: 538 IFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msr2943TYPE3IMQPROT562e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 56.3 bits (136), Expect = 2e-14
Identities = 21/77 (27%), Positives = 39/77 (50%)

Query: 5 DALDIVQYAVWTVLTASAPVVLVAMAVGIGIALIQALTQIQEITLTFVPKIVAIMLVVAL 64
D + A++ VL S +VA +G+ + L Q +TQ+QE TL F K++ + L + L
Sbjct: 3 DLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFL 62

Query: 65 TGPFIGGQISAFTNVIF 81
+ G + ++ +
Sbjct: 63 LSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr2945TCRTETA433e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.9 bits (101), Expect = 3e-06
Identities = 49/261 (18%), Positives = 85/261 (32%), Gaps = 11/261 (4%)

Query: 69 MLFALVAGAIADSFDRRKVMLVAQTFMLVVSVLLTVFTYYNLLTPWTL--LAFTFLIDSG 126
A V GA++D F RR +L+VS+ Y + T L L ++
Sbjct: 57 FACAPVLGALSDRFGRR--------PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI 108

Query: 127 TALNSPSWQASVGDMVPRNKVPAAVALNSMGFNLTRSVGPAIGGIIVAAAGAAAAFAANA 186
T A + D+ ++ S F GP +GG++ + A FAA A
Sbjct: 109 TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAA 168

Query: 187 VSYIGLIIVMARWKPDVPVSTLPRETLGAAMGAGLRYVAMSPNIGKVLVRGAAFGFSAGA 246
++ + + P A R+ + ++
Sbjct: 169 LNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQV 228

Query: 247 VLALLPLVARDVVKGDALTYGIMLGSFGI-GAVGGALISVRLRQLLSSETMVRCAFAGFA 305
AL + D DA T GI L +FGI ++ A+I+ + L +
Sbjct: 229 PAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADG 288

Query: 306 VCAFNAALSHHAWQTSLGLLV 326
A + W +++
Sbjct: 289 TGYILLAFATRGWMAFPIMVL 309



Score = 37.9 bits (88), Expect = 1e-04
Identities = 35/158 (22%), Positives = 63/158 (39%), Gaps = 13/158 (8%)

Query: 245 GAVLALLPLVARDVVKGD--ALTYGIMLGSFGIGAVGGALISVRLRQLLSSETMVRCAFA 302
G ++ +LP + RD+V + YGI+L + + A + L ++ + A
Sbjct: 22 GLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLA 81

Query: 303 GFAVCAFNAALSHHAWQTSLGLLVGG---ACWVIALSHF-NVTVQMATPRWVVGRVLSVY 358
G AV A + W +G +V G A +A ++ ++T R +
Sbjct: 82 GAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHF------GF 135

Query: 359 QTATFG-GIALGSWIWGVVADAHGAETALIAAAIAMLA 395
+A FG G+ G + G++ AAA+ L
Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLN 173



Score = 30.6 bits (69), Expect = 0.017
Identities = 36/173 (20%), Positives = 61/173 (35%), Gaps = 12/173 (6%)

Query: 9 EGVSALAPFRHGIFRAVWSASLVSNFG-GLIQGVGAA-WMMTTIATSPYQVALVQASTT- 65
E ++ LA FR V +A + F L+ V AA W++ + + S
Sbjct: 195 EALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAA 254

Query: 66 ---LPIMLFALVAGAIADSFDRRKVMLVAQTFMLVVSVLLTVFTYYNLLTPWTLLAFTFL 122
L + A++ G +A R+ +++ +LL T L
Sbjct: 255 FGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG----WMAFPIMVLL 310

Query: 123 IDSGTALNSPSWQASVGDMVPRNKVPAAVALNSMGFNLTRSVGPAIGGIIVAA 175
G + P+ QA + V + + +LT VGP + I AA
Sbjct: 311 ASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


82mll2997mlr3004N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll2997-1102.660295short chain dehydrogenase
mlr2998-1102.437973transcriptional regulator
mll29990102.441184hypothetical protein
mll30011101.949774response regulatory protein
mlr30021112.004508oligoendopeptidase F
mlr3004372.040093NADH dehydogenase-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2997DHBDHDRGNASE806e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 79.7 bits (196), Expect = 6e-20
Identities = 57/234 (24%), Positives = 89/234 (38%), Gaps = 21/234 (8%)

Query: 8 NVLITGANKGIGLETARRLAAMGFNVWLGARDAERGEAAAKALRNEGLDVEWLALDVASD 67
ITGA +GIG AR LA+ G ++ + E+ E +L+ E E DV
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 68 DSVAAAARTLTARVSSLDVLVNNAGIA-PGYVDALGPDGRYERAPSRENVADMKATFDVN 126
++ + + +D+LVN AG+ PG + +L + + +ATF VN
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDE-------------EWEATFSVN 116

Query: 127 VFGPVRVTQAFLPLLLAAPAARIVMVSSYLGSIARAAGNSQSPNVMGYGSSKTALNAITV 186
G +++ ++ + IV V S ++ Y SSK A T
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGS-------NPAGVPRTSMAAYASSKAAAVMFTK 169

Query: 187 AFARELSPRGMMVNAAAPGYTATDLNAHRGTRTVQQAAEIIVQLAALKAGGPTG 240
EL+ + N +PG T TD+ I L K G P
Sbjct: 170 CLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLK 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll2999TONBPROTEIN330.003 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.7 bits (74), Expect = 0.003
Identities = 28/128 (21%), Positives = 49/128 (38%), Gaps = 11/128 (8%)

Query: 276 IAVVAPENAQRANVPQIADEQAPEPEKDTPETIIAALPAKEIPLPDFAPRPKADVGAQPE 335
+ +V P + + Q E EPE + P KE P+ P+PK P+
Sbjct: 47 VTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPE---PPKEAPVVIEKPKPK------PK 97

Query: 336 NVPFAMADATATTEQAVATAQAPANMPFGKADPAALAAAAAAADPAQ--VAINNIPVPTW 393
P + ++ V ++ PF PA L ++ A A ++ ++ + P
Sbjct: 98 PKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALS 157

Query: 394 RPERTLPA 401
R + PA
Sbjct: 158 RNQPQYPA 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3001HTHFIS488e-172 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 488 bits (1259), Expect = e-172
Identities = 169/503 (33%), Positives = 262/503 (52%), Gaps = 25/503 (4%)

Query: 3 GSILIVDDDPVQRRLLEAAVTRFGHTAIVVDGGEAGLDALDGPGARDICVVILDLVMPGL 62
+IL+ DDD R +L A++R G+ + + A D +V+ D+VMP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---AGDGDLVVTDVVMPDE 60

Query: 63 DGIGVLKAMRERDITVPVIVQTAQGGIETVVLAMRHGAFDFVVKPASPDRLQASIANALK 122
+ +L +++ +PV+V +AQ T + A GA+D++ KP L I AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL- 119

Query: 123 VEAVEGEVKRTSRKRGGLLTFRDMITHSPAMDRVIRLGQKAAGSSIPILIEGESGVGKEL 182
+ + ++ S AM + R+ + + + ++I GESG GKEL
Sbjct: 120 -AEPKRRPSKLEDDS---QDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKEL 175

Query: 183 VARAIQGSGDRRSKPFVTVNCGAIPDNLVESILFGHEKGSFTGATDKHTGKFVEAHSGTL 242
VARA+ G RR+ PFV +N AIP +L+ES LFGHEKG+FTGA + TG+F +A GTL
Sbjct: 176 VARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTL 235

Query: 243 FLDEIGDLPLDVQVKLLRAVQEGEVDPVGGRSTVKVDIRLISATHRNLLQQVKDGKFRED 302
FLDEIGD+P+D Q +LLR +Q+GE VGGR+ ++ D+R+++AT+++L Q + G FRED
Sbjct: 236 FLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFRED 295

Query: 303 LFYRLNVYPIFVPPLRDRRDDIPHLVTHFMEKVAPADPRRRLQGISPAALAVLEAYDWPG 362
L+YRLNV P+ +PPLRDR +DIP LV HF+++ ++ AL +++A+ WPG
Sbjct: 296 LYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQA--EKEGLDVKRFDQEALELMKAHPWPG 353

Query: 363 NIRQLENAVFRASVLCEGDVLDVDDFPQIRAQVEGTVNLETDDAAPRLSSPPELRDEGGP 422
N+R+LEN V R + L DV+ + +E A S + +E
Sbjct: 354 NVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMR 413

Query: 423 AGDDAPAAEPDRPAVLQPRFGTLRALDERGNVRALADVELEMIKLAIDHYNGQMSEVARR 482
+ + R LA++E +I A+ G + A
Sbjct: 414 QYFASFGDALPPSGLYD---------------RVLAEMEYPLILAALTATRGNQIKAADL 458

Query: 483 LGIGRSTLYRKLKEYGIDPETGR 505
LG+ R+TL +K++E G+
Sbjct: 459 LGLNRNTLRKKIRELGVSVYRSS 481


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3004PF03544339e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 32.6 bits (74), Expect = 9e-04
Identities = 22/125 (17%), Positives = 32/125 (25%), Gaps = 3/125 (2%)

Query: 74 AQASTGGAKAASQPPAALMSTPAAAKSARAAAKAAPAKPAAKAAPAKAATAKSAAPKPAA 133
A A +A PP + P P K K
Sbjct: 56 APADLEPPQAVQPPPEP---VVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVE 112

Query: 134 AKPATSKPAATAASSAKSSAAPKPAAAKKAAPAAAKPAAGKSDNLRRLIGIGPVNEKLLK 193
KP + +S + AP + A A +KP + R L P +
Sbjct: 113 QPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQ 172

Query: 194 AQGVT 198
A +
Sbjct: 173 ALRIE 177


83mlr3096mlr3103N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr30961140.500374short chain dehydrogenase/reductase
mlr30970130.295409nodulation protein NodN
mlr30990110.332809hypothetical protein
mlr31000120.647214oxidoreductase, short chain
mlr31020120.594314transcriptional regulator
mlr3103-3141.743508hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3096DHBDHDRGNASE1075e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 107 bits (268), Expect = 5e-30
Identities = 75/261 (28%), Positives = 114/261 (43%), Gaps = 15/261 (5%)

Query: 10 VAGKTALVTGAATGIGRMAAAALVKAGASVMIASRKGEDCIKVANAFNGLGAPGRAEGFA 69
+ GK A +TGAA GIG A L GA + E KV ++ AE F
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEAR--HAEAFP 63

Query: 70 GDVSSEAGIAALVAEVKARTGKLGILINNAGVSWGAPLESFPYSAWAKVLGVNVTAVFHL 129
DV A I + A ++ G + IL+N AGV + S W VN T VF+
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 130 TRELLPLLEAAASDADPARVINLGSVMGTQPLADDAYSYTASKAAVHHLTRTLALEFAAR 189
+R + + S + ++ +GS P A +Y +SKAA T+ L LE A
Sbjct: 124 SRSVSKYMMDRRSGS----IVTVGSNPAGVPRTSMA-AYASSKAAAVMFTKCLGLELAEY 178

Query: 190 RITVNAFAPGPFQSRMT-AFATGTDEQAKHVGGH-------VPIGRIGAPDDIAGATLYL 241
I N +PG ++ M + + + + G +P+ ++ P DIA A L+L
Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 242 CSRAGSYVTGAILPIDGGQSV 262
S ++T L +DGG ++
Sbjct: 239 VSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3100DHBDHDRGNASE1037e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 103 bits (259), Expect = 7e-29
Identities = 69/256 (26%), Positives = 115/256 (44%), Gaps = 12/256 (4%)

Query: 5 DGATVLITGAAGGLGRGAAKGFASEGARLVLSDIDEKALADLAATLPAETAI---LAGNV 61
+G ITGAA G+G A+ AS+GA + D + + L + ++L AE +V
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 62 ADEKLSEDLVRLAVEKFGRLDVTVNNAGIVQSFVRLPQVPSDEARRVLEIDLLGVFYAMK 121
D +++ + G +D+ VN AG+++ + + + +E ++ GVF A +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGL-IHSLSDEEWEATFSVNSTGVFNASR 125

Query: 122 HQIPQMERQFRATAKGGAIVNIASVAGLVGAPKLSVYAAAKHGVVGLTKSAAAEYATKGV 181
M + + G+IV + S V ++ YA++K V TK E A +
Sbjct: 126 SVSKYMMDR-----RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 182 RINAICPAHTRTAMVDSFVRASGAPE---AEALAELTRGVPMKRVAEVDEITTAILFAAD 238
R N + P T T M S E +L G+P+K++A+ +I A+LF
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 239 PANSFMTGHALAVDGG 254
+T H L VDGG
Sbjct: 241 GQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3102HTHTETR728e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.4 bits (177), Expect = 8e-18
Identities = 27/173 (15%), Positives = 63/173 (36%), Gaps = 9/173 (5%)

Query: 7 KRREKQKAELRSELVAAAHKLVQEEGYEGLTIRKLAKRVGYAPMSVYSYFADKQDILFAL 66
++ +++ E R ++ A +L ++G ++ ++AK G ++Y +F DK D+ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 67 AEDAFETLARRIEE---HPSDDPVEALQAVMTEYAAFGLGNPNEYRTVFMTEKTRPPEGQ 123
E + + E DP+ L+ ++ + + + G+
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 124 SF---QEMHEANPAMKALITR-VEACVAAGKLHG--DPRAIATMLWAVGHGTI 170
Q I + ++ C+ A L R A ++ G +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLM 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3103CHANNELTSX321e-04 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 32.3 bits (73), Expect = 1e-04
Identities = 17/30 (56%), Positives = 20/30 (66%), Gaps = 1/30 (3%)

Query: 1 MKKTLLTLAAVLALSGSAFAASATQPVKHQ 30
MKKTLL AV+ALS + FAA A + K Q
Sbjct: 1 MKKTLLAAGAVVALS-TTFAAGAAENDKPQ 29


84mlr3115mll3127N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr31150161.119283short chain dehydrogenase
mlr31161151.771135hypothetical protein
msr31171161.33001830S ribosomal protein S21
mlr31200131.521595O-linked GlcNAc transferase
mlr31210131.729441hypothetical protein
mlr31220141.937358hypothetical protein
mlr31250141.718810hypothetical protein
mll31260151.160840two-component response regulator
mll31270141.323932two-component sensor KdpD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3115DHBDHDRGNASE1097e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 109 bits (273), Expect = 7e-31
Identities = 74/257 (28%), Positives = 116/257 (45%), Gaps = 8/257 (3%)

Query: 6 ALANKIAIVTGASSGIGRATAKLFAEEGARVVVAARRQAELDTLVAEISDAEGTAVALAG 65
+ KIA +TGA+ GIG A A+ A +GA + +L+ +V+ + A A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 66 DVRDEAYAKALVDLAVESFGGLDVAFNNAGAVGQMGPISGLSLEGWRETLDTNLTSAFLG 125
DVRD A + G +D+ N AG V + G I LS E W T N T F
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAG-VLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 126 AKYQVPAMIERGGGSLIFTSTFVGHTIGMPGMTSYAASKAGLIGLTQVLAAEHGPQGVRV 185
++ M++R GS++ + M +YA+SKA + T+ L E +R
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRT-SMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 186 NALLPGGTDTPA--SITNAPDAGPEVLAFVQALH----ALKRMAQPEEIARSALYLASDA 239
N + PG T+T S+ + +V+ LK++A+P +IA + L+L S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 240 SSFTTGTALFADGGVSI 256
+ T L DGG ++
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3120SYCDCHAPRONE486e-09 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 47.6 bits (113), Expect = 6e-09
Identities = 23/94 (24%), Positives = 36/94 (38%)

Query: 144 YRKAGRTQDAFNDFQKAIQLDTTDARAYHNRGLIYQSQGQHKFAIEDFSTAISLAPDAAE 203
++G+ +DA FQ LD D+R + G Q+ GQ+ AI +S +
Sbjct: 46 QYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPR 105

Query: 204 PYNGRGLSYLATGDEDNAFSDFNMAIKLDGQNAE 237
L G+ A S +A +L E
Sbjct: 106 FPFHAAECLLQKGELAEAESGLFLAQELIADKTE 139



Score = 36.8 bits (85), Expect = 3e-05
Identities = 17/107 (15%), Positives = 36/107 (33%), Gaps = 3/107 (2%)

Query: 171 YHNRGLIYQSQGQHKFAIEDFSTAISLAPDAAEPYNGRGLSYLATGDEDNAFSDFNMAIK 230
++ G+++ A + F L + + G G A G D A ++
Sbjct: 39 LYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAI 98

Query: 231 LDGQNAEAWANQALIYERRGDKAKAAKSYKEAVRL---NPNYQPAKD 274
+D + + A ++G+ A+A A L ++
Sbjct: 99 MDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELST 145



Score = 30.3 bits (68), Expect = 0.005
Identities = 11/70 (15%), Positives = 23/70 (32%)

Query: 50 ENISSLSAVIQRNPQDPEGYNVRGSAYGRGGQYQAALKDFNQAIQLNPNFYQAYSNRALI 109
+ A+ + D + G+ GQY A+ ++ ++ + + A
Sbjct: 54 DAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAEC 113

Query: 110 QRFLGNQAAA 119
G A A
Sbjct: 114 LLQKGELAEA 123



Score = 28.7 bits (64), Expect = 0.019
Identities = 14/101 (13%), Positives = 31/101 (30%)

Query: 63 PQDPEGYNVRGSAYGRGGQYQAALKDFNQAIQLNPNFYQAYSNRALIQRFLGNQAAALGD 122
E + G+Y+ A K F L+ + + ++ +G A+
Sbjct: 33 SDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHS 92

Query: 123 YNKSIQINGNYDAAYIGRGNLYRKAGRTQDAFNDFQKAIQL 163
Y+ ++ + G +A + A +L
Sbjct: 93 YSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3125TCRTETB721e-15 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 71.8 bits (176), Expect = 1e-15
Identities = 68/450 (15%), Positives = 155/450 (34%), Gaps = 28/450 (6%)

Query: 23 FCFISLGILLHATNETMVATVMPAMVGELAGVQ-LVGWSLAIYELGAIVAGAAAGRLVSY 81
C +S NE ++ +P + + W + L + A G+L
Sbjct: 19 LCILSF---FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 82 VALRTNMVVAALLYAAGALICATSPSM-QLFLAGRLIEGLGGGALVSLAFVSVERLFSRA 140
+ ++ ++ ++ G++I S L + R I+G G A +L V V R +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 141 IWPQLFGIMSAIWGVAAFSGPLLGALMTEFLSWRWAFGVFTLGGATMALASFLVLNTPEA 200
+ FG++ +I + GP +G ++ ++ W + + + T+ FL+ +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI---TIITVPFLMKLLKKE 192

Query: 201 KKPSTSAGRTPPFPFAALACLAVAVVLIASAGVDIALLRSSLLIVLGLAGLALFFYIDAL 260
+ F + ++V +V + S + L ++ L+ ++ +
Sbjct: 193 VR------IKGHFDIKGIILMSVGIVFF------MLFTTSYSISFLIVSVLSFLIFVKHI 240

Query: 261 RPRSRLF-PARLFSWRTPVGAGMTMVAAFSVATCSFGVYGPLLLTSLHDIPLLTTGYIIA 319
R + F L P G+ F P ++ +H + G +I
Sbjct: 241 RKVTDPFVDPGLGKNI-PFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII 299

Query: 320 AESIAWSILSILVAN--APPQRERLIIVTGALMIAAGIAGFAYTIPLGSIPLILICALLQ 377
I+ + + ++ G ++ ++ + + + I +
Sbjct: 300 FPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASF-LLETTSWFMTIIIVFV 358

Query: 378 GGGFGIAWPFLTRVIVASAPDDEQTIASAAVPTMQRIGYAVGAALAGIVANASGFSQGL- 436
GG ++ ++ +S E + + + G A+ G + + Q L
Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLL 418

Query: 437 --NHDAAANVASWLFLAFVPLGILGCLAAL 464
D + + S L L F + ++ L L
Sbjct: 419 PMEVDQSTYLYSNLLLLFSGIIVISWLVTL 448


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3126HTHFIS1052e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 105 bits (263), Expect = 2e-28
Identities = 38/127 (29%), Positives = 61/127 (48%)

Query: 7 RILVVDDEPPIRKLLRVGLGSQGYAISEAPNAKVAIELMEQEKPDLVLLDLGLPGMGGHE 66
ILV DD+ IR +L L GY + NA + DLV+ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 LLRKWRDEGLDIPVVILSSRTDEAGIVNALELGADDYVTKPFGMNELVARIRVALRHKFQ 126
LL + + D+PV+++S++ + A E GA DY+ KPF + EL+ I AL +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 127 QQGEKPV 133
+ +
Sbjct: 125 RPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3127PF06580367e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 7e-04
Identities = 28/129 (21%), Positives = 53/129 (41%), Gaps = 26/129 (20%)

Query: 766 KIEVDIPPDLPMLKLDPVLFEQVLFNLLDNASKY----SPPGSTIRLQGWADNGSVIVQI 821
+ E I P + +++ P + Q L ++N K+ P G I L+G DNG+V +++
Sbjct: 241 QFENQINPAIMDVQV-PPMLVQTL---VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296

Query: 822 MDEGPGIPSHDLERVFDTFYRVRKGDQVRAGTGLGLSICRGFVESMGGTISAANRTDRPG 881
+ G + E TG GL R ++ + GT + +++ G
Sbjct: 297 ENTGSLALKNTKE-----------------STGTGLQNVRERLQMLYGTEAQIKLSEKQG 339

Query: 882 AA-FTIRMP 889
+ +P
Sbjct: 340 KVNAMVLIP 348


85mll3372mll3380N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll33722111.340964short chain dehydrogenase
mlr3373181.486602transcriptional regulator, repressor protein
mll3374191.694779hypothetical protein
mlr3375-1101.190765transcriptional regulator
mlr3376-1100.189870transcriptional regulator
mlr3377010-1.342866hypothetical protein
mll3378114-1.888003sugar ABC transporter ATP-binding protein
mll3379114-1.559869hypothetical protein
mll3380113-1.092771sugar ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3372DHBDHDRGNASE375e-05 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 37.3 bits (86), Expect = 5e-05
Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 10/96 (10%)

Query: 3 LQGKVALVAGGTRGAGRGIAVELGAAGATVYVTGRSTRAQQSEYARPETIEETAELVTAN 62
++GK+A + G +G G +A L + GA + PE +E+ + A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD----------YNPEKLEKVVSSLKAE 55

Query: 63 GGSGIAVQADHLVADDVRGLIERIRKEQGRLDILVN 98
A AD + + + RI +E G +DILVN
Sbjct: 56 ARHAEAFPADVRDSAAIDEITARIEREMGPIDILVN 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3373HTHTETR513e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.8 bits (121), Expect = 3e-10
Identities = 17/87 (19%), Positives = 35/87 (40%)

Query: 10 KMTRPKTQPDEQVLEAALRLIHEHGPEALTFERLAKACGLSGATLVQRFGNKARLKQRTL 69
K + + + +L+ ALRL + G + + +AKA G++ + F +K+ L
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 70 LHAWDGLDDKTRTLAAAVPKTPAGAIE 96
+ + + A P P +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLR 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3374TCRTETB371e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 37.2 bits (86), Expect = 1e-04
Identities = 29/166 (17%), Positives = 63/166 (37%), Gaps = 4/166 (2%)

Query: 35 LGVFGLVTAEFLPASLLTPMARDLGVTEGVAGQAVTATAIVGAIAAPTMAIITRRMD-RR 93
L F ++ L SL +A D TA + +I ++ ++ +R
Sbjct: 22 LSFFSVLNEMVLNVSLPD-IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80

Query: 94 LVMWMLTSFLILSNLLATFAASLPMLLLARVVLGVALGGFWAMSAAMALRLVPMRLMPRA 153
L+++ + S + + +L++AR + G F A+ + R +P +A
Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140

Query: 154 MSIILTGVSLATVTAAPLGAYVGDI--WGWRTAFMIATIVGALALL 197
+I + V++ +G + W + + TI+ L+
Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3378PF06580320.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.002
Identities = 10/38 (26%), Positives = 19/38 (50%), Gaps = 1/38 (2%)

Query: 79 AKLREALR-AELKNLQMQLGATFLFVTHDQIEAMSMGD 115
K+ + A+L L+ Q+ F+F + I A+ + D
Sbjct: 152 WKMASMAQEAQLMALKAQINPHFMFNALNNIRALILED 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3380PF05272320.004 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.004
Identities = 17/82 (20%), Positives = 25/82 (30%), Gaps = 19/82 (23%)

Query: 31 VVCLLGPSGCGKTTTLRVIAGLESVTDGEVVIAGKVMNNLPPEKRDIAMVFQFYALYPSA 90
V L G G GK+T + + GL+ +D I +D Y
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVAY--- 645

Query: 91 SVGENIAFPLYHDGISRAERTA 112
+ E RA+ A
Sbjct: 646 ELSE-------MTAFRRADAEA 660


86mll3619mll3627N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll36193160.194439hypothetical protein
msr36202120.243500hypothetical protein
msr36210110.341724pseudo hydrolase
mll3623-1100.333580hypothetical protein
mll3624-1130.883651ABC-transporter ATP-binding protein system
mll36250150.753328ABC transporter permease
mll36261120.282656ABC transporter substrate-binding protein
mll36273110.612935dihydrolipoamide acetyltransferase homoserine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3619PF03544413e-06 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 41.1 bits (96), Expect = 3e-06
Identities = 35/145 (24%), Positives = 50/145 (34%), Gaps = 14/145 (9%)

Query: 11 RNLMWGIPASLILH-VLVATLLVYGLPVAPQQPREEQPVNVAIVPPPDQPKP-------- 61
R W S+ +H +VA LL + + P QP++V +V P D P
Sbjct: 12 RRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPE 71

Query: 62 ---KPVPPAPKPPEPKVEKPPEQKVEKQPPSEKQPKAPPVEVLKPVFQYGDKDTGPRKSL 118
+P P PEP E P + K P K VE K ++ P
Sbjct: 72 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR--DVKPVESRPASPF 129

Query: 119 DGASAQDSSPSPAKDDDSKPPAVPK 143
+ + + S A SKP
Sbjct: 130 ENTAPARPTSSTATAATSKPVTSVA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3623IGASERPTASE548e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 53.9 bits (129), Expect = 8e-10
Identities = 51/269 (18%), Positives = 84/269 (31%), Gaps = 50/269 (18%)

Query: 35 PRSEQQPDQQEQPVNVAIVPPPEKPKPKPAPKPPEPTPEKKAEKPPEQKPPPEPPKPPDD 94
P + Q N I E P P PAP P T E AE ++
Sbjct: 1000 PNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQE------------ 1047

Query: 95 HVLKRVFQYGQKDTGPEKSLDGNSAKPNTPSPAKDEAVKPPITPTPVPTRPAPVATPQQK 154
K+++ N + E K + T+ VA +
Sbjct: 1048 ----------------SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 155 AEPSKPDEK--PATIAPDGKP-AQGEEKQEAA--APDAKPAQNE------------EKQA 197
+ ++ E AT+ + K + E+ QE P Q + E
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 198 TEDADKQQADKRELAP--QPAEKQAVVTPKPLAAETGDK--PAPPPSAEKAKPKPAK-TM 252
T + + Q+ A QPA++ + +P+ T + + E P + T+
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211

Query: 253 NFKSAKAFKAPSGNARRPSPTNDAAAAGS 281
N +S+ K + R P N A S
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPHNVEPATTS 1240



Score = 43.5 bits (102), Expect = 2e-06
Identities = 42/261 (16%), Positives = 82/261 (31%), Gaps = 55/261 (21%)

Query: 42 DQQEQPVNVAIVPPPEKPKPKPAPKPPEPTPEKKAEKPPEQKPPPEPPKPPDDHVLKRVF 101
+++ Q V+ + P A P P+ ++ + E PP P P +
Sbjct: 986 EKRNQTVDTTNITTPNN---IQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 102 QYGQKDTGPEKSLDGNSAKPNTPSPAKDEAVKPPITPTPVPTRPAPVATPQQKAEPSKPD 161
Q+ K+++ N + ++ A+ +K +
Sbjct: 1043 NSKQE----SKTVEKNEQDATETTAQN-----------------------REVAKEAKSN 1075

Query: 162 EKPATIAPDGKPAQGEEKQEAAAPDAKPAQNEEKQATEDADKQQADKRELAPQPAEKQAV 221
K T + + E K+ + + +E E +K + + + P + +
Sbjct: 1076 VKANTQTNEVAQSGSETKE------TQTTETKETATVEKEEKAKVETEKTQEVP-KVTSQ 1128

Query: 222 VTPKPLAAETGDKPAPPPSAEKAKPKPAKTMNFKSAKAFKAPSGNARRPSPTNDAAAAGS 281
V+PK +ET A P + T+N K + S TN A
Sbjct: 1129 VSPKQEQSETVQPQAEP------ARENDPTVNIKEPQ------------SQTNTTADTEQ 1170

Query: 282 PIYSGLPGVRKLYSQGATGNA 302
P V + ++ T N
Sbjct: 1171 PAKETSSNVEQPVTESTTVNT 1191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3624PERTACTIN290.030 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 29.3 bits (65), Expect = 0.030
Identities = 45/168 (26%), Positives = 60/168 (35%), Gaps = 18/168 (10%)

Query: 208 IAVMDHGRLAQLATPRELYHEPANEMVASFISQGILLPADVLTGEDGGHCKVRVLGTELV 267
+A MD + PA V G +P DG + V V + +
Sbjct: 243 VAAMDGAIVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPLLDGWY-GVDVSDSTVD 301

Query: 268 VRCRAGEPPRAGAKICC-RSADLDVSTDGPGFDGLVKRVIYQGGGARIEFAPAAGPDLTL 326
+ E P+ GA I R A + VS G VI GGGAR PA+ +TL
Sbjct: 302 LAQSIVEAPQLGAAIRAGRGARVTVS--GGSLSAPHGNVIETGGGARRFPPPASPLSITL 359

Query: 327 --------------HFEQPDPLTLESGAQARLRIKSGWLIPAAVAVAG 360
+P LTL GAQ + I + L P A +G
Sbjct: 360 QAGARAQGRALLYRVLPEPVKLTLAGGAQGQGDIVATELPPIPGASSG 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3627RTXTOXIND310.011 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.0 bits (70), Expect = 0.011
Identities = 11/43 (25%), Positives = 19/43 (44%)

Query: 14 MATGQISRWFAEEGARVKKGDVLFEIETDKAAMEIDAPASGVL 56
+ + +EG V+KGDVL ++ A + S +L
Sbjct: 102 IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLL 144


87mlr3639mlr3659N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr3639-1131.629883trehalose/maltose binding protein
mlr3640-1112.052020ABC transporter permease
mlr3641-2102.083874ABC transporter permease
mlr3643-1131.229109hypothetical protein
mlr36440140.642133carbohydrate kinase
mlr3645-1140.166025sugar ABC transporter ATP-binding protein
mll3646-1140.111062hypothetical protein
mlr36470150.429258RND efflux membrane fusion protein
mlr36491160.424100RND efflux transporter
mll3651-1161.742200hypothetical protein
msl3653-1141.749969hypothetical protein
msr36520122.127204hypothetical protein
msr36540131.316783hypothetical protein
mll36560131.125067regulatory protein, VirG protein
mlr36570150.628587response regulator
mlr36590150.282363histidine protein kinase, FixL
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3639MALTOSEBP606e-12 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 59.7 bits (144), Expect = 6e-12
Identities = 92/369 (24%), Positives = 159/369 (43%), Gaps = 57/369 (15%)

Query: 58 VKVNLEFVPYEGLHDKTVLAQGSGGGYDVVLFDVIWPAEYATNKVLVDVSSKITDDMKKG 117
+KV +E + L +K +G G D++ + YA + +L +++ +
Sbjct: 59 IKVTVEHP--DKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPD--KAFQDK 114

Query: 118 VLPGAWTTVQYDGKYYGMPWILDTKYLFYNKDILEKAGIKTPPKTWEELGAQAKIIQDKG 177
+ P W V+Y+GK P ++ L YNKD+L PPKTWEE+ A K ++ KG
Sbjct: 115 LYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLL-----PNPPKTWEEIPALDKELKAKG 169

Query: 178 LLKTPIAWSWSQAEAAICDYTT--LVSAYGGDFLK--DGKPDFQ-----NGGGLSALKYM 228
K+ + ++ + Y T L++A GG K +GK D + N G + L ++
Sbjct: 170 --KSALMFNLQEP------YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFL 221

Query: 229 VDSYKSGLTNPNSKEFLEEDVRKVFENGDAAFALNWTYMYNMANDPKDSKVAGKVGVVPA 288
VD K+ N ++ + E F G+ A +N + + ++ SKV GV
Sbjct: 222 VDLIKNKHMNADTDYSIAE---AAFNKGETAMTINGPWAW---SNIDTSKV--NYGVTVL 273

Query: 289 PGVTGISEVSAVNGSMGLGVTAVSKHPDEAWKYIE--YMTSQATQNQYAKLSLPIWASSY 346
P G V G + G+ A S + + A +++E +T + + A +
Sbjct: 274 PTFKGQPSKPFV-GVLSAGINAASPNKELAKEFLENYLLTDEGLE-----------AVNK 321

Query: 347 DDP----AVTKGQEEL-----IAAAKLGLAAMYPRPTTPKYQELSTALQQAIQESLLGQS 397
D P A+ +EEL IAA P P+ A++ A+ + G+
Sbjct: 322 DKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQ 381

Query: 398 SPEDALKTA 406
+ ++ALK A
Sbjct: 382 TVDEALKDA 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3645PF05272310.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.009
Identities = 16/56 (28%), Positives = 22/56 (39%), Gaps = 9/56 (16%)

Query: 32 LVLLGSSGCGKSTLLNIIAGLAEATSGDVLIGGRSILGVHPKNRDIAMVFQSYALY 87
+VL G+ G GKSTL+N + GL + IG +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG---------TGKDSYEQIAGIVAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3646HTHTETR572e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 2e-12
Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 1 MRAQRPNSREKILAAAADVARESGPGSLSLDAVASRAGVSKGGLLYNFPTKAKLMQGLVE 60
+ + +R+ IL A + + G S SL +A AGV++G + ++F K+ L + E
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 GYLRDFEQALETAGSNDDGSNPLAVY 86
+ + +PL+V
Sbjct: 65 LSESNIGELEL-EYQAKFPGDPLSVL 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3647RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.9 bits (101), Expect = 2e-06
Identities = 24/125 (19%), Positives = 41/125 (32%), Gaps = 9/125 (7%)

Query: 75 SSWTP---GVEAIGTVRAVRGVDLTVE--TAGIVKEIPFHANQKVAANAVLLQLDDAVER 129
S A G + G ++ IVKEI + V VLL+L
Sbjct: 75 SVLGQVEIVATANGKL-THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE 133

Query: 130 ADLDAQKAQ---AALDQVSLTRAIELTRRGVGSDSTLDTARAAASASASQVTKLQAVLDQ 186
AD ++ A L+Q + L + S +V +L +++ +
Sbjct: 134 ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE 193

Query: 187 KQLTA 191
+ T
Sbjct: 194 QFSTW 198



Score = 34.4 bits (79), Expect = 8e-04
Identities = 27/160 (16%), Positives = 56/160 (35%), Gaps = 10/160 (6%)

Query: 110 ANQKVAANAVLLQLDDAVE-RADLDAQKAQAALDQVSLTRAIELTRRGVGSD------ST 162
Q +A +AVL Q + VE +L K+Q + + A + + V
Sbjct: 245 HKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA-KEEYQLVTQLFKNEILDK 303

Query: 163 LDTARAAASASASQVTKLQAVLDQKQLTAPFAGTVGIPKI-DIGQYMAPGTAVVTL-QDL 220
L ++ K + + AP + V K+ G + ++ + +
Sbjct: 304 LRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED 363

Query: 221 DTMRVDFSIPEQQLPLLKIGQTVRLGLSGADMPFAGEIRG 260
DT+ V + + + + +GQ + + G + G
Sbjct: 364 DTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVG 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3649ACRIFLAVINRP8310.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 831 bits (2147), Expect = 0.0
Identities = 339/1032 (32%), Positives = 551/1032 (53%), Gaps = 29/1032 (2%)

Query: 4 SDLFIRRPVLSTVLGCLILLLGFQGIFNLSIRQYPKVDETAITITTAYPGASADLIQGFI 63
++ FIRRP+ + VL ++++ G I L + QYP + A++++ YPGA A +Q +
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 SAPIARAVASTENIDYVTSSS-RPSSSTVTVQMKLGSNPDVALTEVLSKVQGVRGTLPDA 122
+ I + + +N+ Y++S+S S T+T+ + G++PD+A +V +K+Q LP
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 SKDPVIVKGTGQQFAMMYISM--QNPNMTKEQLTEYIERVIRPRMSTVEGVADVQIFGAQ 180
+ I +M NP T++ +++Y+ ++ +S + GV DVQ+FGAQ
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181

Query: 181 EYSMRVWIDPIRLAARGVTAAEVLTAINNSNFLSAPGNTQNEYVVS------SISVRSTL 234
Y+MR+W+D L +T +V+ + N A G + SI ++
Sbjct: 182 -YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 235 QTPEAFAELPLR-STDGNVVRLRDVARVELGAANTDTRVSFNGKPGTFLAIFPTPAANPL 293
+ PE F ++ LR ++DG+VVRL+DVARVELG N + NGKP L I AN L
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 294 TTAAALTKLVPQIQETLPKGMTIEVVYDATGQISASIEEVFKTIGEAVAIVVVVILLFLG 353
TA A+ + ++Q P+GM + YD T + SI EV KT+ EA+ +V +V+ LFL
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 354 SFRSVMMPIITIPLSLIGVCFLLFAVGYSINLLSLLAMVLAIGLVVDDAIVVVENIHRHM 413
+ R+ ++P I +P+ L+G +L A GYSIN L++ MVLAIGL+VDDAIVVVEN+ R M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 414 EEDHMTPMQAAFSGMREIASAIVAMTMTLAAVFAPLAFTGGLTGALFREFAVTLAGSVVL 473
ED + P +A M +I A+V + M L+AVF P+AF GG TGA++R+F++T+ ++ L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 474 SGVIAVTITPMMSARLLKAGT------PGRFQRIVDGIFARVEHVYERAVTGSLNYRPLT 527
S ++A+ +TP + A LLK + G F + F + Y +V L
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 528 LIIVLALVGVTGFMFTKTSSELAPEEDQGFLLSLVTAPTYATSDYTETYVNQMLGLV--- 584
L+I +V +F + S PEEDQG L+++ P AT + T+ ++Q+
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 585 -RDIPETRAQFSAVAFGGTTNSAFVGF-AFKDWAERKRNSKELQADI---TARLAKVAGV 639
+ E+ + +F G +A + F + K W ER + +A I L K+
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 640 QAFVFAPPT--LPGSGGGLPIALVVRSTGDSAEVYKAAEQIKNKA-QASGRFIVVQNSMS 696
F P G+ G L+ ++ + +A Q+ A Q + V+ +
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 697 YDSPQVTVTIDRDRAAALNLPIADIGRTLTLLVGGAEVAQFDRDSNSYDIIPQVPQQFRD 756
D+ Q + +D+++A AL + ++DI +T++ +GG V F + Q +FR
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 757 NPERLGEYFVRSVTGEMVPLSAVVNISNNASPAAIEQFNQLNSSTISALPLPGVTTGDGL 816
PE + + +VRS GEMVP SA +E++N L S I PG ++GD +
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 817 KVLEDIARESLPDTFFIDYSGQSRQEKEQGNTILIAFAAAVIVIYLVLAAQFESFRDPLI 876
++E++A + LP D++G S QE+ GN A + +V++L LAA +ES+ P+
Sbjct: 841 ALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 877 IMMAVPLSIFGAIVPLNLGLGTLNIYTQVGLITLIGLITKHGILLVEFANQQREAHGMRR 936
+M+ VPL I G ++ L ++Y VGL+T IGL K+ IL+VEFA E G
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 937 RDAIIASAKVRLRPILMTTAAMALGVVPLITSSGAGAAARYSMGLVIFTGILVGTMFTLF 996
+A + + ++RLRPILMT+ A LGV+PL S+GAG+ A+ ++G+ + G++ T+ +F
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 997 VVPMFYTFIASK 1008
VP+F+ I
Sbjct: 1020 FVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3656HTHFIS481e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.9 bits (114), Expect = 1e-08
Identities = 32/131 (24%), Positives = 51/131 (38%), Gaps = 4/131 (3%)

Query: 4 RAVIALVSVADVVATELADHLERRGHDVRQARQPWEAESLLAGGGIDVVVVGDSLSQAEG 63
A I + + T L L R G+DVR +A G D+VV +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 RDLLRRYGGQGEGGRDGPDFILICRPSDLVDKVLALELGAADVVESPLNVRELAARVGGL 123
DLL R + R +++ + + + A E GA D + P ++ EL +G
Sbjct: 63 FDLLPRI----KKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 124 LSRRGRGTQEL 134
L+ R +L
Sbjct: 119 LAEPKRRPSKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3657HTHFIS892e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.7 bits (220), Expect = 2e-22
Identities = 36/133 (27%), Positives = 62/133 (46%), Gaps = 2/133 (1%)

Query: 2 RARIVIVEDEPDLRDAVAEYLGAAGYDVATAETAAAARSLIETQAFHLAILDIAMPGEDG 61
A I++ +D+ +R + + L AGYDV AA I L + D+ MP E+
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 LSLGRWLRSRMP-IGIIYATAAGTALDRIVGLELGADDYIVKPYELREVLARVRSVL-RR 119
L ++ P + ++ +A T + I E GA DY+ KP++L E++ + L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 120 VPQPTELLDRKTR 132
+P++L D
Sbjct: 123 KRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3659HTHFIS549e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.5 bits (131), Expect = 9e-10
Identities = 25/113 (22%), Positives = 42/113 (37%), Gaps = 5/113 (4%)

Query: 560 HALIIDDEPDVAGSLSDILELMGIKSRIAPVWESGAATLSGHIPPDIVFSDLRMPGTSGM 619
L+ DD+ + L+ L G RI + ++ D+V +D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDENAF 63

Query: 620 AIYRELLAERPELARRFVLVTGDLIGAKAEIEALPAQQRPQILEKPFSTLDVR 672
+ + RP+L VLV I+A L KPF ++
Sbjct: 64 DLLPRIKKARPDLP---VLVMSAQNTFMTAIKASEKGAY-DYLPKPFDLTELI 112


88mlr3825mlr3838N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr38250151.966844transmembrane transport protein
mll38270141.620068transcriptional regulator
mlr3829-1131.288154hypothetical protein
msl38310131.321028hypothetical protein
mll38320131.214907multidrug resistance protein
mll38330131.400610hypothetical protein
mll38351111.555906hypothetical protein
mll38361102.627482aldehyde dehydrogenase
mll38373113.490133aldehyde dehydrogenase
mlr38383123.342071transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3825TCRTETA300.013 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.013
Identities = 64/337 (18%), Positives = 114/337 (33%), Gaps = 30/337 (8%)

Query: 33 PQIPVFLTRLDIS---KFTLGLLILLFGAGAVTAMTWCGHLISKHGSRTVLRWFGLCGSL 89
P +P L L S G+L+ L+ G L + G R VL ++
Sbjct: 26 PVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAV 85

Query: 90 GLLVVALAPNLPLAAIAMFIFGGSIGGMDVAMNANAVVV---ERKMSRAIMSSSHGFWSL 146
++A AP L + I + G + VA A + ER MS+ GF
Sbjct: 86 DYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGF--- 142

Query: 147 GGFAGGGLGGFAIQHYGHLAHAAVVTVLAFAAIAMAVGYLV------AEDKP------QA 194
G AG LGG HA A + G + E +P
Sbjct: 143 GMVAGPVLGGLMGGFS---PHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNP 199

Query: 195 AEHHKFALPANPLVYLIGLMALLTMVSEGAVLDWAALYLRQELGADLAIAGLAYAAFSGV 254
++A + L+ + ++ +V + W ++ D G++ AAF +
Sbjct: 200 LASFRWARGMTVVAALMAVFFIMQLVGQVPAALWV-IFGEDRFHWDATTIGISLAAFGIL 258

Query: 255 MAIMR-FFGDGVRNRFGAVTTLRGSAVVAATG---MLVAGLSPSPWFAIAAFATCGFGIA 310
++ + V R G L + TG + A + + A+ G G+
Sbjct: 259 HSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMP 318

Query: 311 NMVPII-FSAGGNQEGMSSGTGMSVVTTMGYSGILVA 346
+ ++ ++G G+ ++ + G L+
Sbjct: 319 ALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLF 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msl3831ENTSNTHTASED230.042 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 23.5 bits (50), Expect = 0.042
Identities = 7/17 (41%), Positives = 10/17 (58%)

Query: 11 LILVLVGAVPAWPHSRS 27
++ LV A+ PH RS
Sbjct: 207 SVITLVSAITRVPHDRS 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3832ACRIFLAVINRP507e-165 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 507 bits (1306), Expect = e-165
Identities = 230/1056 (21%), Positives = 441/1056 (41%), Gaps = 71/1056 (6%)

Query: 3 IVRLAINNARLTISVLVFLLIAGWVAYQSTPKEAEPDVPIPMMYVSLIYQGISPEDSERL 62
+ I + + L++AG +A P P + P + VS Y G + +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 LLRPMESKLKSLKGLKEMRSAAFQGG-GYVLVEFQPQTNLATALQDTRSKVQDGKADLPQ 121
+ + +E + + L M S + G + + FQ T+ A ++K+Q LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 AAEEPVVTEVNISEFPVLVVTL---SGELPERVLAA-AARELRDRIEEVPGVLEGSLQGS 177
++ ++ S ++V + + ++ A ++D + + GV + L G+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 178 RDDLVEVVIDPMKLSSYGLQLDQLIGAVGASNSLVAAGNIEGSQGK------YAVKVPSL 231
+ + + +D L+ Y L +I + N +AAG + G+ ++ +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 232 IETPEDVAALPVVAGPN-AVVQAKDIATIRSTFADATTITRLNGKPAIAIEVKKRIGANL 290
+ PE+ + + + +VV+ KD+A + + I R+NGKPA + +K GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 291 IDTLTKVRAVSDAFVKTMPEGMHVTYTQDKSVFVNQLLGDLQNHVMIAVILVFIVILYAL 350
+DT ++A P+GM V Y D + FV + ++ + A++LVF+V+ L
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 351 -SGRASLLIGLAIPSSFLIGILLLAMMGYTINMIVLFSLILAVGMLVDDAIIVTEFAERR 409
+ RA+L+ +A+P L +LA GY+IN + +F ++LA+G+LVDDAI+V E ER
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 410 M-SEGMPKQEAFALAAKRMAGPVIAATMTRIAAFSPLLFWPGIIGDFMKYMPITLIVTLS 468
M + +P +EA + ++ G ++ M A F P+ F+ G G + IT++ ++
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 469 ASMLYALVFAPTLGAIFAK--APQHHEDGNR-DGW-----------YMAVVKQAVRFPIT 514
S+L AL+ P L A K + +HHE+ GW Y V + +
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 515 VMVLTVMLLVGIFVGYSKYGAGVEFFPSVEPDYGLLYVHARGNLSLAEMDTATKIAENRL 574
+++ +++ G+ V + + F P + L + + +
Sbjct: 540 YLLIYALIVAGMVVLFLR--LPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYY 597

Query: 575 LGW--PGLKSVYTRVGKSQGGGQDVPEDVVGVIQYEFIDWRERKSANQ----ILNDLRGV 628
L ++SV+T G S G G+ W ER +++ +
Sbjct: 598 LKNEKANVESVFTVNGFSFSG----QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 629 MAGIPGVDVEV----RVPEAGPPTGKPIQ-IRLSAIDPKGLDEKARAVAARIAKVPG-VI 682
+ I V + E G TG + I + + L + + A+ P ++
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 683 DISDGLPPPGVDWALEVDRAKAAQYGISPTSVGTVVQLVTNGLKLSEYRPAGADKAVDIR 742
+ + LEVD+ KA G+S + + + G ++++ G + +
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG--RVKKLY 771

Query: 743 LRLPEDRR-TLSTLDELRVQTSQG-SVPISNFVVRKAKPSVGILNRIDGARTVVVQANVT 800
++ R +D+L V+++ G VP S F L R +G ++ +Q
Sbjct: 772 VQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAA 831

Query: 801 AGTQVATVQQEVTQAVADMNLGSGIRWKLAGSNEDSAEASAFLSKAFGAAIFLIFLVLLA 860
GT + + + G G W E + A + ++FL L A
Sbjct: 832 PGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPA--LVAISFVVVFLCLAA 889

Query: 861 QFNKFTSVWLVLSCVVMATIGVFLGLLITGETFGIVMSGIGVIALAGVVVNNNIVLID-T 919
+ ++ V+ V + +GV L + + V +G++ G+ N I++++
Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKND-VYFMVGLLTTIGLSAKNAILIVEFA 948

Query: 920 YDRLREEGWDKMDAVLQTCRERARPVVLTAVSAILGVLPIAF--GLGLEIFHHETTINAP 977
D + +EG ++A L R R RP+++T+++ ILGVLP+A G G +
Sbjct: 949 KDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN-------- 1000

Query: 978 STQWWISLSSAIVFGLSFATVLTLVVTPSMLMVFTR 1013
++ ++ G+ AT+L + P +V R
Sbjct: 1001 ------AVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3833RTXTOXIND522e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.8 bits (124), Expect = 2e-09
Identities = 28/151 (18%), Positives = 60/151 (39%), Gaps = 21/151 (13%)

Query: 142 QLDTARSNLTLAQSQLDTAQAELDRNEVKAPFDGVIDRVPVELGSSVMQGGEVATILKL- 200
+L N+ L +L + + ++AP + ++ V V+ E T++ +
Sbjct: 303 KLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE--TLMVIV 360

Query: 201 ---DPVIARGEISERDLGYLKIGDKAGVRLVS-GQT----VEGTVRYISRDASSATRT-- 250
D + + +D+G++ +G A +++ + T + G V+ I+ DA R
Sbjct: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGL 420

Query: 251 -FRVEVAIPNADGSVP-------AGMTAEIQ 273
F V ++I S +GM +
Sbjct: 421 VFNVIISIEENCLSTGNKNIPLSSGMAVTAE 451



Score = 34.4 bits (79), Expect = 6e-04
Identities = 23/119 (19%), Positives = 47/119 (39%), Gaps = 14/119 (11%)

Query: 66 LTEADKRAVLATRVAGVIDKLPVKQGDHVKTGDLVLML-----AAEEKISMVD--NAKQL 118
LT + + + ++ ++ VK+G+ V+ GD++L L A+ + A+
Sbjct: 90 LTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE 149

Query: 119 VAQRQAELDA----SLRLMKTGNLPKLQLDTARSNLT---LAQSQLDTAQAELDRNEVK 170
+ Q + L +K + P Q + L L + Q T Q + + E+
Sbjct: 150 QTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr3838HTHTETR626e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.6 bits (149), Expect = 6e-14
Identities = 34/194 (17%), Positives = 65/194 (33%), Gaps = 10/194 (5%)

Query: 10 ADPKRVRILDGAMKVFLAYGFSRTTMDDIARAADMSRPALYLQFKNKTDIFRAIAMMVLS 69
A R ILD A+++F G S T++ +IA+AA ++R A+Y FK+K+D+F I + S
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 70 RSVAAAKMALAGDGAFTERMMRAIDEAFI-------SMMSAVVASPHGAELLDMKSSLGD 122
A ++R I + + H E + + +
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 123 LVGCWRGALVEHIAAAIHGEAARNGVDLAARGLSAQLLADMLLDGLEGMKARISDPEGQR 182
+ I + + L + A ++ + G+
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPA---DLMTRRAAIIMRGYISGLMENWLFAPQSF 185

Query: 183 QAAGALVKVIDLAL 196
+ + L
Sbjct: 186 DLKKEARDYVAILL 199


89mll3877mll3889N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll3877-112-0.424706hypothetical protein
mll3878-1101.108897hypothetical protein
mll38790101.487392phosphoglucosamine mutase
mlr38801111.768703hypothetical protein
mlr3881-1131.444536hypothetical protein
mll38820131.962423metalloprotease (cell division protein) FtsH
mll38840132.006328hypothetical protein
mll38860161.131947hypothetical protein
mll38870150.681945peptidoglycan-associated lipoprotein
mll38880140.469016translocation protein TolB
mll38890120.241398hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3877OMPADOMAIN290.023 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 28.7 bits (64), Expect = 0.023
Identities = 33/215 (15%), Positives = 62/215 (28%), Gaps = 53/215 (24%)

Query: 6 RIVLALAGLGLLPLTPAVAADYDPPIYADQAPDYVPVEVGSGWYLRGDVSYLAQKSFKND 65
+ +A+A T A AA D Y GW D ++ ++
Sbjct: 3 KTAIAIAVALAGFATVAQAAPKDNTWYTG---------AKLGWSQYHDTGFINNNGPTHE 53

Query: 66 DFAFTPASFDEKEDPIFASIGFGYHFNDYLRADLNLGYLPGNKIGIGYDDSLSVVPPATS 125
+ +F GY N Y+ ++ GYD + P
Sbjct: 54 N-QLGAGAF------------GGYQVNPYVGFEM------------GYDWLGRM--PYKG 86

Query: 126 TVASADLKNYAYSLMLNAYVDLGTYVGITPYLGGGVGIVQSTRRLSANYFTNNGDPTDDF 185
+V + K L + + Y G + ++ D +
Sbjct: 87 SVENGAYKAQGVQLTAKLGYPITD--DLDIYTRLGGMVWRA-------------DTKSNV 131

Query: 186 VQTDDKTKYSLAYTLNAGLAYQVSKNVSVDLGYQY 220
+ T S + G+ Y ++ ++ L YQ+
Sbjct: 132 YGKNHDTGVSPVFA--GGVEYAITPEIATRLEYQW 164



Score = 28.0 bits (62), Expect = 0.035
Identities = 19/102 (18%), Positives = 32/102 (31%), Gaps = 26/102 (25%)

Query: 154 TPYLGGGVGIVQSTRRLSANYFTNNGDPTDDFVQTDDKTKYSLAYTLNAGLAYQVSKNVS 213
T Y G +G Q + NNG ++ A YQV+ V
Sbjct: 27 TWYTGAKLGWSQYH---DTGFINNNGPTHEN------------QLGAGAFGGYQVNPYVG 71

Query: 214 VDLGYQYF-----------SAPDAEYVTAASLTSFPVHKGIS 244
++GY + A A+ V + +P+ +
Sbjct: 72 FEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLD 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3878OMPADOMAIN382e-05 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 38.4 bits (89), Expect = 2e-05
Identities = 45/201 (22%), Positives = 72/201 (35%), Gaps = 35/201 (17%)

Query: 81 GGKSFDYGKLKGGF------SLGGGV--GYKINDRLRADVTADYWFKSNFNGGTSDLVST 132
G + LG G GY++N + ++ D+ + + G +
Sbjct: 35 GWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYK 94

Query: 133 STEVSKMSALLLLANAYVDIGTWHGITPYVGAGIGGARVKWDTVYDPNTAETNPGASNWR 192
+ V L A I + Y +GG + DT + + G S
Sbjct: 95 AQGVQ------LTAKLGYPITD--DLDIY--TRLGGMVWRADTKSNVYGKNHDTGVS--- 141

Query: 193 FAYALMAGASYCLTDKIILDAGYRFSHIQGGRMFEWDASSAGPGFDRGINTHEVRGGLRY 252
G Y +T +I Y++++ G DA + G D G+ + G+ Y
Sbjct: 142 --PVFAGGVEYAITPEIATRLEYQWTNNIG------DAHTIGTRPDNGM----LSLGVSY 189

Query: 253 QFGGNNGCAAPVVAYQPEPEP 273
+FG G AAPVVA P P P
Sbjct: 190 RFGQ--GEAAPVVAPAPAPAP 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3882HTHFIS310.015 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.015
Identities = 22/82 (26%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 193 VLLVGPPGTGKTLLARSV---AGEANVPFFT-----ISGSDFVEMFVGV------GASRV 238
+++ G GTGK L+AR++ N PF I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 239 RD-MFDQAKKNAPCIIFIDEID 259
F+QA+ +F+DEI
Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3886SYCDCHAPRONE382e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 37.6 bits (87), Expect = 2e-05
Identities = 28/136 (20%), Positives = 52/136 (38%), Gaps = 15/136 (11%)

Query: 242 STNDPEELYRNSYQFILSGDYGTAEQGFRDHISRFPRDAKAADAHYWLGESLLGQQ--KY 299
S++ E+LY ++ SG Y A + F+ D++ ++LG Q +Y
Sbjct: 32 SSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSR-----FFLGLGACRQAMGQY 86

Query: 300 RDAAEVFLAASKDYPKAKKAP----DMLLKLGVSLVGLKQHDVA---CATFSEVGKRYPD 352
A + + K + P + LL+ G +A A +E K
Sbjct: 87 DLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEF-KELST 145

Query: 353 ISSALKERVKQEKALA 368
S++ E +K +K +
Sbjct: 146 RVSSMLEAIKLKKEME 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3887OMPADOMAIN1201e-35 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 120 bits (302), Expect = 1e-35
Identities = 33/137 (24%), Positives = 60/137 (43%), Gaps = 11/137 (8%)

Query: 40 NGAGAATPGSAQDFTVNIGDRIFFDTDSSSIRADAQTTLARQAQWLNQY--KQYAIVVEG 97
A Q + + F+ + ++++ + Q L + L+ K ++VV G
Sbjct: 200 VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLG 259

Query: 98 HADERGTREYNLALGARRAAAARDFLVSKGVASSRLKTISYGKERPVA--VCDD------ 149
+ D G+ YN L RRA + D+L+SKG+ + ++ G+ PV CD+
Sbjct: 260 YTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAA 319

Query: 150 -ISCWSQNRRAVTTLSG 165
I C + +RR + G
Sbjct: 320 LIDCLAPDRRVEIEVKG 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll3889IGASERPTASE514e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 51.2 bits (122), Expect = 4e-09
Identities = 46/228 (20%), Positives = 77/228 (33%), Gaps = 19/228 (8%)

Query: 94 EAKPKPVDMTSAPPPAPT----PKETPKTEDVPKPQEKP--KPIPATEVAPAPTPKEEVK 147
E + + VD T+ P P E++ + E P P PAT T E K
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045

Query: 148 PEPVKQTEPKPTPAKPAPTPPPQDKTAAIDPTPEVKPDAVAEAIAK-DPPAEETQLPSSA 206
++++ + A Q++ A + VK + +A+ +ETQ
Sbjct: 1046 ----QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ----- 1096

Query: 207 PAPEARPKPQPAQAESAKAPERKDAEKPVKEASSKPKSDDKQFNANEISALLDKQKPSGG 266
E + + E AK K E P + PK +Q + A ++
Sbjct: 1097 -TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQ--EQSETVQPQAEPARENDPTV 1153

Query: 267 GAKRSTQQASLGGDKDQGQKLSKSEQGALESQLGGCWTLPVGLEGSEN 314
K Q + D +Q K + S ++ T +E EN
Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPEN 1201



Score = 46.6 bits (110), Expect = 1e-07
Identities = 41/240 (17%), Positives = 83/240 (34%), Gaps = 18/240 (7%)

Query: 63 PAP-LPTQRPDIVPASQKVGENSVDTDKPITPEAKPKPVDMTSAPPPAPTPKETPKTEDV 121
PAP P++ + V + K +V+ ++ E + A K +T +V
Sbjct: 1028 PAPATPSETTETVAENSKQESKTVEKNEQDATE--TTAQNREVAKEAKSNVKANTQTNEV 1085

Query: 122 PKPQEKPKPIPATEVAPAPTPKEEVKPEPVKQTEPKPTPAKPAPTPPPQDKTAAIDPTPE 181
+ + K TE T ++E K + V+ + + P + P Q+++ + P E
Sbjct: 1086 AQSGSETKETQTTETKETATVEKEEKAK-VETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144

Query: 182 VKPDAVAEAIAKDPPAEE-TQLPSSAPAPEARPKPQPAQAESA---------KAPERKDA 231
+ K+P ++ T + PA E + ES + PE
Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTP 1204

Query: 232 EKPVKEASSKPKSDDKQFNANEISALLDKQKPSGGGAKRSTQQASLGGDKDQGQKLSKSE 291
+S+ + K + + ++ +P A S+ S D + +
Sbjct: 1205 ATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP----ATTSSNDRSTVALCDLTSTNTNAV 1260



Score = 36.2 bits (83), Expect = 3e-04
Identities = 26/191 (13%), Positives = 54/191 (28%), Gaps = 22/191 (11%)

Query: 44 MEAIAQTLQGDKKAVMHEKPAPLPTQRPDIVPASQKVGENSVDTDK--PITPEAKPKPVD 101
++A QT + + ++ T+ V +K + T + +T + PK
Sbjct: 1076 VKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPK--- 1132

Query: 102 MTSAPPPAPTPKETPKTEDVPKPQEKPKPIPATEVAPAPTPKEEVKPEPVKQTEPKPTPA 161
P+ P E+ P T P + + +P +
Sbjct: 1133 --QEQSETVQPQAEPARENDP-----------TVNIKEPQSQTNTTAD---TEQPAKETS 1176

Query: 162 KPAPTPPPQDKTAAIDPTPEVKPDAVAEAIAKDPPAEETQ-LPSSAPAPEARPKPQPAQA 220
P + T + P+ A + E+ P + R P +
Sbjct: 1177 SNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236

Query: 221 ESAKAPERKDA 231
+ + +R
Sbjct: 1237 ATTSSNDRSTV 1247


90mlr4110mll4128N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr4110010-2.026563acid phosphatase
mlr4111-1180.263278hypothetical protein
mll4112-217-0.012755GTP-binding protein TypA
msr4113-114-0.305874hypothetical protein
mlr4114-114-0.184339hypothetical protein
mll4115-1170.215685secreted alkaline phosphatase
mll41181160.905800transmembrane transport protein
mlr4117-1140.554462transcriptional regulatory protein
mlr4119-2151.086850hypothetical protein
mll4120-2161.306341inorganic pyrophosphatase
mll4122-2152.167317hypothetical protein
mll4125-1142.955349hypothetical protein
mll4127-2143.171609hypothetical protein
mll41280132.947337hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4110TYPE3OMGPROT280.049 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 27.9 bits (62), Expect = 0.049
Identities = 16/82 (19%), Positives = 26/82 (31%), Gaps = 9/82 (10%)

Query: 124 MHSFAAPNLAARLRAHGKTFAGYVEARS-PRKHNPWESFADAKGFEKPLAQFPRDYAKLP 182
+HSF L L + Y A+ P+ A + L D+
Sbjct: 5 LHSFFKRVLTGTLLL----LSSYSWAQELDWLPIPYVYVAK----GESLRDLLTDFGANY 56

Query: 183 SVSFVIPNLENDMHDGTIEAAD 204
+ V+ + ND G E +
Sbjct: 57 DATVVVSDKINDKVSGQFEHDN 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4112TCRTETOQM1732e-48 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 173 bits (439), Expect = 2e-48
Identities = 107/449 (23%), Positives = 183/449 (40%), Gaps = 92/449 (20%)

Query: 1 MKLRNIAIIAHVDHGKTTLVDQLLKQSGSFRDNQRVAE--RAMDSNDLEKERGITILAKA 58
MK+ NI ++AHVD GKTTL + LL SG+ + V + D+ LE++RGITI
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 59 TSVDWKDTRINIVDTPGHADFGGEVERILSMVDSAIVLVDAAEGPMPQTKFVVGKALKVG 118
TS W++T++NI+DTPGH DF EV R LS++D AI+L+ A +G QT+ + K+G
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 120

Query: 119 LKPIVVINKIDR-------------------------------------PDARHVEVVNE 141
+ I INKID+ ++ + V E
Sbjct: 121 IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 180

Query: 142 VFDLFAALDATDEQLD-----------------FPILYGSGRDGWVSENPEGPKDQQLAP 184
D + + L+ FP+ +GS ++ +
Sbjct: 181 GNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNN-----------IGIDN 229

Query: 185 LFDLIIKHVPAPTVHPGPFRMIGTI--LEANPFLGRIITGRIESGTLKANQAVKVLHHDG 242
L ++I + T H G + G + +E + R+ R+ SG L +V++
Sbjct: 230 LIEVITNKFYSST-HRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-- 286

Query: 243 TQIETGRISKILAFRGLERQPIEEAQAGDIVAIAGLS---KGTVADTFCDLAVTEALHAQ 299
E +I+++ E I++A +G+IV + + DT +
Sbjct: 287 ---EKIKITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENPL 343

Query: 300 PIDPPTVTMSFLVNDSPLAGTEGDKVTSRVIRDRLLREAEGNVALKIEESPDKDSFFVSG 359
P+ TV P + + + D LL ++ + L+ +S
Sbjct: 344 PLLQTTV--------EPSKPQQREML-----LDALLEISDSDPLLRYYVDSATHEIILSF 390

Query: 360 RGELQLAVLIETMRRE-GFEIAVSRPRVV 387
G++Q+ V ++ + EI + P V+
Sbjct: 391 LGKVQMEVTCALLQEKYHVEIEIKEPTVI 419



Score = 37.9 bits (88), Expect = 1e-04
Identities = 25/101 (24%), Positives = 38/101 (37%), Gaps = 3/101 (2%)

Query: 384 PRVVMQ--KGENGELLEPVEEVVIDVDEEHAGVVVQKMSERKAEMVELRPSGGNRQRIVF 441
P V+ Q K ELLEP I +E+ + A +V+ + N +
Sbjct: 521 PIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSG 579

Query: 442 HAPTRGLIGYQSELLTDTRGTAVMNRLFHAYEPYKGELPGR 482
P R + Y+S+L T G +V Y GE +
Sbjct: 580 EIPARCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQ 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4118TCRTETB330.002 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 32.9 bits (75), Expect = 0.002
Identities = 30/151 (19%), Positives = 59/151 (39%), Gaps = 7/151 (4%)

Query: 62 NSFGWRRDVISLAAGVGILLYGLTGPFAAALMERIGLRRTLIASLLVMSGSTALSLLMTK 121
S W L +G +YG L +++G++R L+ +++ + + +
Sbjct: 49 ASTNWVNTAFMLTFSIGTAVYG-------KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS 101

Query: 122 PWHLFITWGVFSGIGSGAVASVLGATIVNRWFKTNRGLVMGLMSASSATGLLVFLPLLAS 181
+ L I G G+ A +++ + K NRG GL+ + A G V +
Sbjct: 102 FFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGM 161

Query: 182 LAQSGGWKPVAVAVAVATACLLPLVWLLVPE 212
+A W + + + + L+ LL E
Sbjct: 162 IAHYIHWSYLLLIPMITIITVPFLMKLLKKE 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4117HTHTETR674e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.3 bits (164), Expect = 4e-16
Identities = 28/93 (30%), Positives = 46/93 (49%)

Query: 24 KKILDVAYDLFYRRGIRAIGVDEIVKRAGVTKPSLYRSFPSKDELAASYLRQYDLEYWER 83
+ ILDVA LF ++G+ + + EI K AGVT+ ++Y F K +L + + E
Sbjct: 14 QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGEL 73

Query: 84 FDEAVKAHPGDPRAQIKAFLTRIGKRTQVADYR 116
E PGDP + ++ L + + T + R
Sbjct: 74 ELEYQAKFPGDPLSVLREILIHVLESTVTEERR 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4127cloacin438e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 43.2 bits (101), Expect = 8e-07
Identities = 24/55 (43%), Positives = 27/55 (49%), Gaps = 3/55 (5%)

Query: 239 GPSGSSGGSGWTTGSSGGGWSSGSSSSGWSSGSSSSGGGFSGGGGSSGGGGSSGS 293
GP+G G G S G GWSS ++ G SGS GG SG G G G S G
Sbjct: 23 GPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGG 74



Score = 42.8 bits (100), Expect = 1e-06
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 237 GKGPSGSSGGSGWTTGSS--GGGWSSGSSSSGWSSGSSSSGGGFSGGGGSSGGGGSS 291
G+S GSGW++ ++ GGG SG G S + G G SGGG +GG S+
Sbjct: 27 LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 37.8 bits (87), Expect = 4e-05
Identities = 23/61 (37%), Positives = 27/61 (44%), Gaps = 8/61 (13%)

Query: 241 SGSSGGSGWTTGSSGGGWSSG--SSSSGWSS------GSSSSGGGFSGGGGSSGGGGSSG 292
+G+ SG G G G S SGWSS G S SG + GG G GGG+
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 293 S 293
S
Sbjct: 71 S 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4128cloacin345e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 5e-04
Identities = 16/47 (34%), Positives = 19/47 (40%), Gaps = 6/47 (12%)

Query: 233 PGRRSSSGGWSSGSS------GGGWSSGGGGFSGGGGSSGGGGSSGS 273
G S + W GS GG GGG GG SG GG+ +
Sbjct: 37 SGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 32.0 bits (72), Expect = 0.002
Identities = 18/56 (32%), Positives = 21/56 (37%), Gaps = 16/56 (28%)

Query: 234 GRRSSSGGWSSGSSGGGWSS----------------GGGGFSGGGGSSGGGGSSGS 273
G + G S G GWSS GG G GGG+ GG SG+
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77



Score = 30.1 bits (67), Expect = 0.010
Identities = 14/40 (35%), Positives = 17/40 (42%)

Query: 234 GRRSSSGGWSSGSSGGGWSSGGGGFSGGGGSSGGGGSSGS 273
G S SG G SG G G G GG G+ G + +
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 28.9 bits (64), Expect = 0.022
Identities = 13/38 (34%), Positives = 15/38 (39%), Gaps = 2/38 (5%)

Query: 239 SGGWSSGSSGGGWSSGGG--GFSGGGGSSGGGGSSGSW 274
SGG G + G S+ G G G G GG W
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGW 39


91mlr4438mlr4445N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr4438222-3.606362hypothetical protein
mlr44396120.136243hypothetical protein
mll44406120.167838hypothetical protein
mll44416110.549588microcystin dependent protein MdpB
mll44424130.814927microcystin dependent protein MdpB
mll44434131.361771microcystin dependent protein MdpB
mll44444151.630408hypothetical protein
mlr4445-2142.794756hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4438PF03544270.016 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 27.2 bits (60), Expect = 0.016
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 1/65 (1%)

Query: 48 VFTVAIGGR-AVSLELTAVHQIASSPRPGGGFTLLFKGPRDISLPQAIYHLAGDAITDDI 106
+ +V I G L T+VHQ+ P P ++ P D+ PQA+ + +
Sbjct: 19 LLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEP 78

Query: 107 FIVPV 111
P+
Sbjct: 79 EPEPI 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4440SACTRNSFRASE421e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 42.2 bits (99), Expect = 1e-07
Identities = 32/122 (26%), Positives = 45/122 (36%), Gaps = 31/122 (25%)

Query: 44 AHYIKHYPNADWLVIMRDGDD------------IGRLYIER-WPTQHRIIDIALLPTYRG 90
Y K Y + D V + + IGR+ I W I DIA+ YR
Sbjct: 44 KPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRK 103

Query: 91 RGLGAALLGDLIDEA--------WLAGKSVSIHVEKNNPARQLYARLGFAVAEDKGVYDL 142
+G+G ALL I+ A L + + N A YA+ F + G D
Sbjct: 104 KGVGTALLHKAIEWAKENHFCGLMLETQDI------NISACHFYAKHHFII----GAVDT 153

Query: 143 MV 144
M+
Sbjct: 154 ML 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4444cloacin380.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 0.001
Identities = 31/106 (29%), Positives = 45/106 (42%), Gaps = 1/106 (0%)

Query: 1254 NNTGTIGIGGTITNSGTGNGVVVSGGSAAITVSADISSSATAPGTAVKVDGITGGSVTFS 1313
+NTG G I N G V G S S++ + G+ + G +G
Sbjct: 9 HNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 1314 GLITSTGTGTGVSVSNTAAGSGVGFGAVTVSGAAGNGIGISGNAGS 1359
+ G+GTG ++S AA GF A++ GA G + IS A S
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4445PF067761261e-38 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 126 bits (317), Expect = 1e-38
Identities = 31/132 (23%), Positives = 53/132 (40%)

Query: 63 PWAVNCSSGSTANELQCQVSQNLTEAKTGQRVLTVTVRRDNANGSFAMLLALPHGLFLPS 122
W + C + A QC + Q++ LTV + + S M + P G+ LPS
Sbjct: 82 DWQIRCDTPPGAKAEQCALIQSVVAEDRSNAGLTVIILKTADQKSKLMRVVAPLGVLLPS 141

Query: 123 GVSYQIDSGKKVTVAIQTSDQNGAYAAVPVAPELAKAMKSGTTLNIGMESVTRKPVTIPV 182
G+ ++D+ NG A V + +L +++ T + + + P+
Sbjct: 142 GLGLKLDNVDVGRAGFVRCLPNGCVAEVVMDDKLLGQLRTAKTATFIIFETPEEGIGFPL 201

Query: 183 SLKGFGAAVAKL 194
SL G G KL
Sbjct: 202 SLNGIGEGYDKL 213


92mlr4457mll4471N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr4457-2102.596541two-component response regulator
mlr4459-3102.242604sensory histidine protein kinase
mlr4460-3102.626907lipoprotein
mlr4461-292.959888ABC transporter ATP-binding protein
mlr4463-282.925806ABC transporter ATP-binding protein
mlr4464-392.975957ABC transporter
mlr4467-1142.294561hypothetical protein
mll44680162.563438transcriptional regulator
mlr44690162.983925hypothetical protein
mll44700162.637168dihydrolipoamide dehydrogenase
mll44710172.770564branched-chain alpha-keto acid dehydrogenase E2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4457HTHFIS771e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.2 bits (190), Expect = 1e-18
Identities = 31/150 (20%), Positives = 61/150 (40%), Gaps = 8/150 (5%)

Query: 2 RLLLVEDDQKTADYIVRGLTEAGHVCDLLRNGHDALFAATSGSYDVIVADRMIPGLDGLS 61
+L+ +DD + + L+ AG+ + N +G D++V D ++P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 MVKAARAAGVRTPAIFLTSIGGIDDRVEGLEAGGDDYLVKPFAFSELLARI-NALGRRPA 120
++ + A P + +++ ++ E G DYL KPF +EL+ I AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 AQEQKTVLRVADLEMDLI-----MRRVTRQ 145
+ + M L+ M+ + R
Sbjct: 125 --RPSKLEDDSQDGMPLVGRSAAMQEIYRV 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4459PF06580453e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.2 bits (107), Expect = 3e-07
Identities = 27/102 (26%), Positives = 38/102 (37%), Gaps = 23/102 (22%)

Query: 358 LVENALRH----CPPGTTIKLSVTRQAERVVASVADNGPGIPPDEREQVFQRLYRLDHSR 413
LVEN ++H P G I L T+ V V + G + +E
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------- 309

Query: 414 STPGNGLGLSLVRA-IADLHG--ASIALDDCQPGLAVVVSFP 452
G GL VR + L+G A I L + Q + +V P
Sbjct: 310 ---STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4463PF05272290.020 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.020
Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 46 LVGPSGSGKTTLLHILGGLD 65
L G G GK+TL++ L GLD
Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4464LCRVANTIGEN320.007 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 32.3 bits (73), Expect = 0.007
Identities = 26/110 (23%), Positives = 40/110 (36%), Gaps = 13/110 (11%)

Query: 543 RIDAELTNGSDVTVFGTTDRP---AGAHLAALASLPEAATAEPMQHRFAYVGADLQDLYG 599
I + N D ++G TD A A L +P+ + ++D G
Sbjct: 185 NIHDKSINLMDKNLYGYTDEEIFKASAEYKILEKMPQTTIQVDGSEKKI---VSIKDFLG 241

Query: 600 IDPARIGRATGLSDAYFSGASAAGTLALLAAT------PDGVLVSEETVQ 643
+ R G L ++Y S L+ A T P LVS++T Q
Sbjct: 242 SENKRTGALGNLKNSY-SYNKDNNELSHFATTCSDKSRPLNDLVSQKTTQ 290


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4468HTHFIS270.036 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 26.7 bits (59), Expect = 0.036
Identities = 12/41 (29%), Positives = 22/41 (53%)

Query: 2 LSEAEQALLSLLRSNARASTAELARRLGVSRTTVQSRIERL 42
L+E E L+ + R + + A LG++R T++ +I L
Sbjct: 433 LAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4471PF03544300.018 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.9 bits (67), Expect = 0.018
Identities = 19/72 (26%), Positives = 26/72 (36%)

Query: 92 KAEAVAAEPPAKLPTPKPETAGPVAKASPKAGAPEAKPAPAVAKSTGQRSISGAPRPEGE 151
K V E P P PKP+ V + E++PA + R S
Sbjct: 88 KEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATS 147

Query: 152 RPLASPAVRLRA 163
+P+ S A RA
Sbjct: 148 KPVTSVASGPRA 159


93mlr4593mlr4601N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr4593-1162.384388transcriptional regulator
mlr45940162.100851hypothetical protein
mlr4595-1171.468642ABC-transporter ATP-binding protein system
mlr45960191.367499ABC transporter ATP-binding protein
mll4597-1172.041772threonine dehydratase
mll45990161.097083hypothetical protein
mlr46010150.718694hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4593HTHTETR612e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.8 bits (147), Expect = 2e-13
Identities = 29/189 (15%), Positives = 67/189 (35%), Gaps = 22/189 (11%)

Query: 7 AKSSRRETSPELTRAVLVQAALKLFGRQGFDGTSTREIAAEAQANIGSIAYHFGGKEGLR 66
A+ +++E TR ++ AL+LF +QG TS EIA A G+I +HF K L
Sbjct: 2 ARKTKQEAQE--TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 67 AAVADYIVETVQAVAGRALAAGQAPGSGQAAPAGDPEAARAQLFAAIEGIVGFVVAQPHA 126
+ + + + + A P + L + ++ V +
Sbjct: 60 SEIWELSESNIGELE-------------LEYQAKFPGDPLSVLREILIHVLESTVTEERR 106

Query: 127 GEIVQFLLRELQHPTAA--LDRIYDGVFEPTHRRLCHLWE--QATGE---PAESERTRLT 179
+++ + + + + + + ++ R+ + + R +
Sbjct: 107 RLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAII 166

Query: 180 VFTLIGQVV 188
+ I ++
Sbjct: 167 MRGYISGLM 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4594RTXTOXIND506e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.8 bits (119), Expect = 6e-09
Identities = 48/368 (13%), Positives = 105/368 (28%), Gaps = 92/368 (25%)

Query: 28 VEGDYVLLAPIEVAQVETVAVKRGDRVMPGTTVVTLESADAKIAVAQAEASLAQAQAQLA 87
G + PIE + V+ + VK G+ V G ++ L + A+ + ++SL QA+ +
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQT 151

Query: 88 DLQVGKRPEE-----------------------------------------------IAA 100
Q+ R E +
Sbjct: 152 RYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211

Query: 101 LKAQVDMAKAQADDAKR-------KYDRAADLFKRGTGTQADYDTASATLETANAQVGQA 153
+A+ A+ + + + D + L + + A ++
Sbjct: 212 KRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVY 271

Query: 154 QANLAV---GGLPARPETI--------------KAADNQVKQAQAALQQAQWRLSKRTLA 196
++ L L A+ E + + + L + + R +
Sbjct: 272 KSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIR 331

Query: 197 APSPGRVNDV-IRNPGDTAGPTAPVISMLPDGAVKL-SVYIPEAAFSSVRIGSLLSVHCD 254
AP +V + + G ++ ++P+ + + + +G + +
Sbjct: 332 APVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE 391

Query: 255 GCGE----GVKARVSYVSPDPEFTPPVIYSLENRQKLVYLVEARPEGDAG-------PLQ 303
+ +V ++ D + R LV+ V E + PL
Sbjct: 392 AFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENCLSTGNKNIPLS 443

Query: 304 PGQIVDVE 311
G V E
Sbjct: 444 SGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4596ABC2TRNSPORT443e-07 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 44.2 bits (104), Expect = 3e-07
Identities = 46/172 (26%), Positives = 73/172 (42%), Gaps = 4/172 (2%)

Query: 205 ALTRETERGTMENLLAMPSSPAEIMLGKVLPFLVVGAVQVAVVLVAAKLLFDIPFVGSLT 264
A R + T E +L +I+LG++ A+ A + V A L ++ SL
Sbjct: 90 AFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWL-SLL 148

Query: 265 LLLSSVLVFVLSLVLLGYTISTMARSQMQAMQLTFFFFLPSLLLSGFMFPYRGMPGWAQA 324
L + + L+ LG ++ +A S + P L LSG +FP +P Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 325 LGEIFPLTHFLRITRSVMLKGADFHAIATEVGWLAVF--VLVFAGTALLRFR 374
PL+H + + R +ML + VG L ++ + F TALLR R
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVD-VCQHVGALCIYIVIPFFLSTALLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4601PF02370338e-04 M protein repeat
		>PF02370#M protein repeat

Length = 168

Score = 32.8 bits (74), Expect = 8e-04
Identities = 30/130 (23%), Positives = 53/130 (40%), Gaps = 10/130 (7%)

Query: 36 YRAYADGLAESFPQSAKVFEEMAE--EEDGHRDSLIELFRKRFGERIPLIRREHVRGYYE 93
Y +D E+ PQ + E + + +G IE K E R R +E
Sbjct: 36 YLDSSDSKRENDPQYRALMGENQDLRKREGQYQDKIEELEKERKE---KQERPERREKFE 92

Query: 94 RKPDWLVRPLGIEHVRRQAEAMEQQAYRFYVEAAKRTSDASTRKLLDDLA---AAEQGHE 150
R+ + +++ + +E + + + K+ SDAS + L DL AA++ E
Sbjct: 93 RQHQDKHYQEQQKKHQQEQQQLEAEKQK--LAKEKQISDASRQGLNRDLEASRAAKKELE 150

Query: 151 NSAQRLEQKH 160
Q+L +H
Sbjct: 151 PKHQKLGTEH 160


94mlr4812mlr4819N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr4812-121-0.428984inner membrane protein translocase component
mlr4813-1180.096814secreted protein MPB70 (and transforming growth
mlr4814-1180.516245streptogramin A acetyl transferase
mlr48150190.734528ribosome biogenesis GTP-binding protein YsxC
mll48160171.067325TetR family transcriptional regulator
mlr4818-1161.263483multidrug resistance efflux pump
mlr4819-1131.938185multidrug resistance protein B (drug efflux
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr481260KDINNERMP320e-104 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 320 bits (821), Expect = e-104
Identities = 167/597 (27%), Positives = 277/597 (46%), Gaps = 80/597 (13%)

Query: 1 MENNRNFFITIALSVLILAVWQYFYVLPRSEAQREAARVEQQRVEEQKKAAEAANPGAGT 60
M++ RN + IAL + +WQ ++ + T
Sbjct: 1 MDSQRNLLV-IALLFVSFMIWQ------------------AWEQDKNPQPQAQQTTQTTT 41

Query: 61 PAPAPGTIPNAPGGDTVTVAGRDQALAASKRVKIDTPSLEGSINLTGARLDDLKLKHYTE 120
A P K + + T L+ +IN G ++ L Y +
Sbjct: 42 TAAGSAADQGVPASGQ------------GKLISVKTDVLDLTINTRGGDVEQALLPAYPK 89

Query: 121 TVDKNSPEIELLN--PQALPTGYFAEIGFVGNDKTGA-VPGAETVWNVDGNPTL----SP 173
++ P +LL PQ + Y A+ G G D G ++NV+ + +
Sbjct: 90 ELNSTQP-FQLLETSPQFI---YQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQN 145

Query: 174 STPVTLTYTNDKGLTFKRTFSVD-ANYMFTVSDTVQNSGSSAVSLFNYGRVTRYDK-PAV 231
V +TYT+ G TF +TF + +Y V+ VQN+G + + ++G++ + P
Sbjct: 146 ELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPH 205

Query: 232 ASTYVLHEGLIGFTGTE-GLQEHKYASIE----KDKQYQPGKATDGWLGITDKYWAVTLV 286
T + L F G + KY + D + + GW+ + +Y+A +
Sbjct: 206 LDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAMLQQYFATAWI 265

Query: 287 PTEKQPFQPRYAFF--EDGRHRYQSDFLTDAINVEAGQSATVETEVFAGAKEVAKINAYA 344
P F+ G + + + V+ GQ+ + + ++ G + K+ A A
Sbjct: 266 PHN----DGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAVA 321

Query: 345 EDRHIKRFDLLIDWGWFHFITKPMFWLIDTLYKFLGNFGLAILATTVIVKALFFPLANKS 404
DL +D+GW FI++P+F L+ ++ F+GN+G +I+ T IV+ + +PL
Sbjct: 322 PH-----LDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQ 376

Query: 405 YASMANMKKVQPKMLEIREKYADDKMKQQQAMMELYKTEKINPLAGCWPVALQIPVFFSL 464
Y SMA M+ +QPK+ +RE+ DDK + Q MM LYK EK+NPL GC+P+ +Q+P+F +L
Sbjct: 377 YTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLAL 436

Query: 465 YKVLYITIEMRHAPFFGWIQDLAAPDPTSLFNLFGLIPVTLPHMLMIGVWPLIMGVTMFL 524
Y +L ++E+R APF WI DL+A DP + P++MGVTMF
Sbjct: 437 YYMLMGSVELRQAPFALWIHDLSAQDPYYIL-------------------PILMGVTMFF 477

Query: 525 QMRMNPTP-PDPTQAAIFTWMPVIFTFMMAGFPAGLVIYWAWNNMLSILQQGVIMKR 580
+M+PT DP Q I T+MPVIFT FP+GLV+Y+ +N+++I+QQ +I +
Sbjct: 478 IQKMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRG 534


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll4816HTHTETR772e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 77.4 bits (190), Expect = 2e-19
Identities = 36/165 (21%), Positives = 69/165 (41%), Gaps = 9/165 (5%)

Query: 24 RPAAGQDPVKRSQIIDGARRVFIEKGFEAASMNDITREAGVSKGTIYVYFANKEELFEAL 83
R + R I+D A R+F ++G + S+ +I + AGV++G IY +F +K +LF +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 84 IEEERGTIFKNMYDMLDRADDLRQTLVKFGKVLSMKITSARVIQAQRTVIGASDRIPDM- 142
E I + + + L ++L + S + +R ++ +
Sbjct: 63 WELSESNIGELELEYQAK--FPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 143 GARFYERGPKRG-----HDKVVKFLNAAIERGLLKID-DVDLAAY 181
G + +R +D++ + L IE +L D AA
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAI 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4818RTXTOXIND982e-24 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 98.0 bits (244), Expect = 2e-24
Identities = 44/289 (15%), Positives = 99/289 (34%), Gaps = 21/289 (7%)

Query: 109 ENQQVKAGDPLLTIDDGDYKIAVAQAEAQIATLSKTLDRIDAQTKAAQASLQQAKAQKVA 168
++V L+ ++ Q E + + A+ + + K++
Sbjct: 180 SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239

Query: 169 DQAAADNAARAQQRAAQLVKTHVGTQAQLDDAQTALDQANAALVGADAQIAAAQANI--G 226
+ A A+ + +V +L ++ L+Q + ++ A +
Sbjct: 240 FSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNE 299

Query: 227 VLEAQRAESASTLASLQLGRDKAARDLSFTVLKAPYDGIVGNRSV-EQGDLVSPGQKLAV 285
+L+ + ++ + L L K +V++AP V V +G +V+ + L V
Sbjct: 300 ILD-KLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358

Query: 286 VVPMDKLYVV-ANFKETQLARLVPGEKVNISVDAIDG---HPIQGTVSSLAPASGAVFSL 341
+VP D V A + + + G+ I V+A + G V ++ +
Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA------ 412

Query: 342 LPPENATGNFTKVVQRVPVRIDVPA--DALKTGKLRAGLSVVVNVDSRT 388
+V V + I+ K L +G++V + +
Sbjct: 413 -----IEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456



Score = 69.5 bits (170), Expect = 4e-15
Identities = 28/151 (18%), Positives = 51/151 (33%), Gaps = 9/151 (5%)

Query: 94 VSPKISGYVDQVKVSENQQVKAGDPLLTIDDGDYKIAVAQAEAQIATLSKTLDRIDAQTK 153
+ P + V ++ V E + V+ GD LL + + + ++ + L L++ Q
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSL--LQARLEQTRYQIL 156

Query: 154 AAQASLQQAKAQKVADQAAADNAARAQ-QRAAQLVKTHVGTQAQLDDAQTALDQANAALV 212
+ L + K+ D+ N + + R L+ + Q Q Q L
Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLI------KEQFSTWQNQKYQKELNLD 210

Query: 213 GADAQIAAAQANIGVLEAQRAESASTLASLQ 243
A+ A I E S L
Sbjct: 211 KKRAERLTVLARINRYENLSRVEKSRLDDFS 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr4819TCRTETB1081e-27 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 108 bits (272), Expect = 1e-27
Identities = 75/401 (18%), Positives = 147/401 (36%), Gaps = 19/401 (4%)

Query: 37 FMAILDIQIVSASLAEIQAGLSASSDEIPWVQTAYLIAEVVMIPLSGFLSRMLSTRVLFT 96
F ++L+ +++ SL +I + WV TA+++ + + G LS L + L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 97 IAAAGFTAASALAATA-TNIDQMIVYRAVQGFIGGGMIPSVFAAAFTIF-PPSKRAVVSP 154
S + + +I+ R +QG G P++ + P R
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQG-AGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 155 MIGLVATLAPTIGPTVGGYISHAFSWHWLFLVNVVPGILVATAAWSLIDFDKPNLKLFNK 214
+IG + + +GP +GG I+H W +L L+ ++ I V L+ K +++
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPF----LMKLLKKEVRIKGH 198

Query: 215 FDWWGLAGMAAFLGCMEYVLEEGPNNDWLQDQGVFICAIVMTIGAVIFFWRVFTAEEPIV 274
FD G+ M+ + + IV + +IF + +P V
Sbjct: 199 FDIKGIILMSVGIVFFMLFTT---SYSISF-------LIVSVLSFLIFVKHIRKVTDPFV 248

Query: 275 DLRAFSNVNFAFGSLFSFVIGIGLYGLTYLYPVFLGRIRGYDSMMIGEA-LFVSGLAMFV 333
D N+ F G L +I + G + P + + + IG +F +++ +
Sbjct: 249 DPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308

Query: 334 TAPISGFLSSKIDLRLMMMIGFFGFATGTWWMTHLTADWDFYELLIPQILRGCSMMLCMV 393
I G L + ++ IG + + + + + I + +
Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKT 367

Query: 394 PINNIALGTLPPDRLKNASGLFNLTRNLGGAVGLALINTVL 434
I+ I +L L N T L G+A++ +L
Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


95mlr5177mll5193N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr5177-114-0.554283hypothetical protein
mll5179-2130.841150hypothetical protein
mll5182-1140.524927hypothetical protein
mlr5181-2130.833012nicotinamide nucleotide transhydrogenase,
mlr5183-290.341741nicotinamide nucleotide transhydrogenase subunit
mlr5184-280.283777nicotinamide nucleotide transhydrogenase subunit
mll5186-290.321841hypothetical protein
msl5187-2100.214932hypothetical protein
mll5188-2110.484068DNA gyrase subunit B
mll5190-1100.714295hypothetical protein
mll5191-191.623200hypothetical protein
mlr5192092.267573cytochrome P450
mll5193092.444782NADH dehydrogenase (ubiquinone) 1 alpha
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5177AUTOINDCRSYN343e-04 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 34.1 bits (78), Expect = 3e-04
Identities = 25/119 (21%), Positives = 55/119 (46%), Gaps = 7/119 (5%)

Query: 27 MMQLLEHVDYRLITGGEDLEAIYRLRYKSYL-RSGMCGPIAAGMFEDRWDNLPNSYRFGV 85
M+++ + V++ L++ + E ++ LR +++ R GM D++DN +Y FG+
Sbjct: 1 MLEIFD-VNHTLLSETKSGE-LFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGI 58

Query: 86 YCYGELVSTIRFHYISREHPNSPSVDAYPEILIERLARGETFIDGTRFATDPDAAPAPG 144
++ ++RF I ++PN + + E +++ +RF D A
Sbjct: 59 K-DNTVICSLRF--IETKYPNMIT-GTFFPYFKEINIPEGNYLESSRFFVDKSRAKDIL 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5183PYOCINKILLER270.046 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 26.7 bits (58), Expect = 0.046
Identities = 13/39 (33%), Positives = 17/39 (43%)

Query: 5 SLQKALDQLDQASAAVRLAVQNLANAPGGAEAAGDAAHA 43
SLQ ++ L A A++ A N A AEA A
Sbjct: 199 SLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQ 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5184ACRIFLAVINRP300.022 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.2 bits (68), Expect = 0.022
Identities = 39/171 (22%), Positives = 65/171 (38%), Gaps = 20/171 (11%)

Query: 55 SAGRFGLIVLGLAIGGGVGAVT------ARRIAMTSMPQLVAAFHSLVGLAAVMVAAAAI 108
S + + LAIG V R + +P A S+ + +V A +
Sbjct: 389 SINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMV 448

Query: 109 ----YAPESFGIGTAGDIHAQALIEMSLGVAIG---AITFTGSVIAFLKLDGRMSGKPIM 161
+ P +F G+ G I+ Q I + +A+ A+ T ++ A L L +
Sbjct: 449 LSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATL-LKPVSAEHHEN 507

Query: 162 LGG------RHFINAALGVALIVLIVLLVTTESKLVFWLIVAASLVLGVLL 206
GG F ++ V +L T L++ LIVA +VL + L
Sbjct: 508 KGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRL 558


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5186DNABINDINGHU325e-04 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 32.4 bits (74), Expect = 5e-04
Identities = 12/33 (36%), Positives = 17/33 (51%)

Query: 218 KTYLAQLVAARTGLSEADAKARVDAMVAKVEDA 250
K L VA T L++ D+ A VDA+ + V
Sbjct: 4 KQDLIAKVAEATELTKKDSAAAVDAVFSAVSSY 36


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5190TYPE4SSCAGA340.003 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 33.9 bits (77), Expect = 0.003
Identities = 30/98 (30%), Positives = 45/98 (45%), Gaps = 4/98 (4%)

Query: 476 IKMVQGDLPTSLEHYEAARDMLQDLTASVPDEKSWLGDLAMAN---DKIGNVLATQGDVG 532
+K Q DL SL E ++ S K+ + A AN D+I ++ + +
Sbjct: 607 VKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKNKMEAKAQANSQKDEIFALINKEANRD 666

Query: 533 AAAKAYQQSLS-IKRKLVDAQPNSASLLRDLTITYDEI 569
A A AY Q+L IKR+L D N L+D ++DE
Sbjct: 667 ARAIAYAQNLKGIKRELSDKLENVNKNLKDFDKSFDEF 704


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5193NUCEPIMERASE361e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 36.3 bits (84), Expect = 1e-04
Identities = 55/302 (18%), Positives = 103/302 (34%), Gaps = 78/302 (25%)

Query: 27 VVVFGGSGFVGRHVVRALAKRGYRIR----------VACRRPDLAGHLQPLGNVGQIQPV 76
+V G +GF+G HV + L + G+++ V+ ++ L+ L G Q
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQ----ARLELLAQPG-FQFH 57

Query: 77 QANVRVRWSVDR--AVQGADHVVNLVAILHETG-------RQKFSAVHEFGSRAVAEAAR 127
+ ++ R + A + V ++ + G + E R
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPH---RLAVRYSLENPHAYADSNLTGFLNILEGCR 114

Query: 128 SVGAGLTHI------SALGADLD---SESD--------YARTKALGEKAV-----LETIP 165
+ H+ S G + S D YA TK E L +P
Sbjct: 115 H--NKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLP 172

Query: 166 DAVIFRPSINFG----PEDSFFNRFASMARYSPVLPLIGGGQTKFQPVYVGDVAEAVARS 221
A R +G P+ + F +M + + G+ K Y+ D+AEA+ R
Sbjct: 173 -ATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSI-DVYNYGKMKRDFTYIDDIAEAIIRL 230

Query: 222 VDGKIDR-----------------GQIYELGGP---NVLTFKECMEELLTVIERKRLLVP 261
D ++Y +G ++ + + +E+ L IE K+ ++P
Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG-IEAKKNMLP 289

Query: 262 VP 263
+
Sbjct: 290 LQ 291


96mll5518mlr5526N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll55180141.769728RND efflux transporter
mll5519-1131.276177RND efflux membrane fusion protein
mll5520-2120.398271RND efflux membrane fusion protein
mlr5521-3110.2894934-aminobutyrate aminotransferase
mll5522-390.040046rubredoxin reductase
mlr5523-29-0.107861hypothetical protein
msr5525-190.724441hypothetical protein
mlr5526-181.505361bacterioferritin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5518ACRIFLAVINRP459e-147 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 459 bits (1183), Expect = e-147
Identities = 227/1045 (21%), Positives = 436/1045 (41%), Gaps = 64/1045 (6%)

Query: 6 LSDWALSHRSMVWYFMLVFVVAGIFSYLNLGREEDPNFTIKTMIIQANWPGASVKETLQQ 65
++++ + W ++ ++AG + L L + P + + AN+PGA +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 66 VTDRIEKKLEELDSLDFTRSVTT-AGQTVIFVNLKDTTRARDVVPNWLQVRNMVNDIKAQ 124
VT IE+ + +D+L + S + AG I + + T D +QV+N +
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT---DPDIAQVQVQNKLQLATPL 117

Query: 125 FPQGVQGP-FFNDRFGDVYGNIYAFTSDG--LTPRQLRDYVED-ARTKILTVPNAGKVDL 180
PQ VQ ++ Y + F SD T + DYV + + + G V L
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 181 VGAQDEAIYLEFSTRQIAALGLNQQAIVASLQAQNAITPSGVIQSGPE------RISVRV 234
GAQ A+ + + L ++ L+ QN +G + P S+
Sbjct: 178 FGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 235 GGQFTSEDSLRAINLRVND--RFFRLSDVATITRGYADPPTALFRFNGQDAIALAIGMKP 292
+F + + + LRVN RL DVA + G + + R NG+ A L I +
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELG-GENYNVIARINGKPAAGLGIKLAT 295

Query: 293 NANLLQFGEALHKEMNKVLADLPVGVGVHLVADQPVIVEEAVSGFTRALFEAVAIVLAVS 352
AN L +A+ ++ ++ P G+ V D V+ ++ + LFEA+ +V V
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 353 FISLG-MRAGFVVALSIPLVLAITFTVMAYLGISLQRISLGALIIALGLLVDDAMIAVEM 411
++ L MRA + +++P+VL TF ++A G S+ +++ +++A+GLLVDDA++ VE
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 412 MVARLEVGDNLRKAATYVYTS-TAFPMLTGTLVTVAGFIPIGLNSSAAGEYTFTLFVVIA 470
+ + K AT S ++ +V A FIP+ + G + I
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 471 VSLLVSWIVAVLFAPLLGVTILPATMKTKHHDQPG-----------RLTSLFRRVLVGSV 519
++ +S +VA++ P L T+L + +HH+ G + + + +
Sbjct: 476 SAMALSVLVALILTPALCATLL-KPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 520 RHHWLTIIATVLLFAASIAGFGLVQQQFFPPSDRPELIVDWNLPQNSSIAETRDQMERFE 579
++ L+ A + F + F P D+ + LP ++ T+ +++
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 580 QRALVGNPDIDHFSSYIGQGAVRFVLAYDVQPANPYFGQTVIVTKSLEARNRVKPALEKL 639
L +V V + G + K E RN + + E +
Sbjct: 595 DYYLKNEKANV--------ESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAV 646

Query: 640 LREEFVG----TDAFV------KLLELGPPVGRPVQ-YRVGGPDIQTVRELAQQFAGVIS 688
+ + D FV ++ELG G + G + + Q G+ +
Sbjct: 647 IHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAA 706

Query: 689 AN-ARLAAPTFDWNEPQRVLRVDVLQDKARQLGITSSDIASALNSTVGGATITQVRDATY 747
+ A L + + E +++V Q+KA+ LG++ SDI +++ +GG + D
Sbjct: 707 QHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGR 766

Query: 748 LINVVARSRDAERGSIGTLQNLQLPTSTGEAIPLAAVANFRYELEQPTVWRRDRIPTITV 807
+ + ++ R + L + ++ GE +P +A + P + R + +P++ +
Sbjct: 767 VKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEI 826

Query: 808 RAGLVGDTLPATAVNELKPSVDAFIAKLPPRYSVETAGSVEESAKSQGPIAAVVPLMLFV 867
+ T A+ ++ +KLP + G + S A+V + V
Sbjct: 827 QGEAAPGTSSGDAMAL----MENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVV 882

Query: 868 MATILMIQLQSFQRLFLVVAVAPLGLIGVVAALVPSGAPLGFVAILGVLALIGILIRNSV 927
+ L +S+ V+ V PLG++GV+ A ++G+L IG+ +N++
Sbjct: 883 VFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAI 942

Query: 928 ILIVQIEDL-VREGKDRWAAVIEATEHRMRPIALTAAAASLALIPIA------REVFWGP 980
+++ +DL +EGK A + A R+RPI +T+ A L ++P+A
Sbjct: 943 LIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN-A 1001

Query: 981 MAYAMMGGIIAGTAITLLFLPALYV 1005
+ +MGG+++ T + + F+P +V
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVFFV 1026



Score = 83.7 bits (207), Expect = 2e-18
Identities = 80/526 (15%), Positives = 175/526 (33%), Gaps = 50/526 (9%)

Query: 518 SVRHHWLTIIATVLLFAASIAGFGLVQQQFFPPSDRPELIVDWNLPQNSSIAETRDQMER 577
+R + ++L A + +P P + V N P + +D + +
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYP-GADAQTVQDTVTQ 63

Query: 578 FEQRALVGNPDIDHFSSY-IGQGAVRFVLAYDVQPANPYFGQTVIVTKSLEARNRVKPAL 636
++ + G ++ + SS G+V L + +P Q + K A + +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSG-TDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 637 EKLLREEFVGTDAFVKLLELGPPVGRPVQYRVGGPDIQTVRELAQQFAGVISANARLAAP 696
++ + +++ + Q + V++ + GV
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----- 177

Query: 697 TFDWNEPQRVLRVDVLQDKARQLGITSSDIASALNS--------TVGGATITQVRDATYL 748
Q +R+ + D + +T D+ + L +GG +
Sbjct: 178 ----FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 749 INVVARSRDAERGSIGTLQNLQLPTST-GEAIPLAAVANFRYELE-QPTVWRRDRIPTIT 806
I R ++ E + L ++ G + L VA E + R + P
Sbjct: 234 IIAQTRFKNPEE-----FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAG 288

Query: 807 VRAGLVGDTLPATAVNELKPSVDAFIAKLPPRYSVETA--------GSVEESAKSQGPIA 858
+ L +K + P V S+ E K+
Sbjct: 289 LGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI 348

Query: 859 AVVPLMLFVMATILMIQLQSFQRLFLVVAVA-PLGLIGVVAALVPSGAPLGFVAILGVLA 917
+V L++++ LQ+ R L+ +A P+ L+G A L G + + + G++
Sbjct: 349 MLVFLVMYLF-------LQNM-RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVL 400

Query: 918 LIGILIRNSVILIVQIEDLVREGK-DRWAAVIEATEHRMRPIALTAAAASLALIPIA--- 973
IG+L+ ++++++ +E ++ E K A ++ + A S IP+A
Sbjct: 401 AIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFG 460

Query: 974 --REVFWGPMAYAMMGGIIAGTAITLLFLPALYVTWFRIKEPKQGR 1017
+ + ++ + + L+ PAL T + +
Sbjct: 461 GSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHE 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5519RTXTOXIND384e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 4e-05
Identities = 20/140 (14%), Positives = 47/140 (33%), Gaps = 4/140 (2%)

Query: 94 LAVQAARAELSNAEAQFANAAASEERQRQL--LASANATQAVFDAAQQARKAAEANVERA 151
L + E N + + E + TQ + + N+
Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 152 KASLAKSQEQLGYARLFSDFDGVVTAVGA-EVGQTVSAGQTVVTVARSDLR-EAVVDIPD 209
LAK++E+ + + + V + G V+ +T++ + D E + +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 210 QLTGDLTTGTPFQVILQSLP 229
+ G + G + +++ P
Sbjct: 375 KDIGFINVGQNAIIKVEAFP 394



Score = 36.0 bits (83), Expect = 2e-04
Identities = 20/125 (16%), Positives = 39/125 (31%), Gaps = 19/125 (15%)

Query: 72 VKVGDIVSKGTTIAALDPTALELAVQAARAELSNAEAQFANAAASEERQRQLLASA---N 128
VK G+ V KG + L A A+ ++ A + R + L S
Sbjct: 112 VKEGESVRKGDVLLKLTALG-------AEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164

Query: 129 ATQAVFDAAQQARKAAEANVERAKASLA---------KSQEQLGYARLFSDFDGVVTAVG 179
+ + +E V R + + K Q++L + ++ V+ +
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224

Query: 180 AEVGQ 184

Sbjct: 225 RYENL 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5520RTXTOXIND561e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.6 bits (134), Expect = 1e-10
Identities = 38/195 (19%), Positives = 72/195 (36%), Gaps = 10/195 (5%)

Query: 58 RVGGRIAERLVDVGQHVDQGAVLARIDPQEQQSDLRSAQADLDAARAQLTQSAAAFERQK 117
+ RL D + + A+ A+ EQ++ A +L ++QL Q + K
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQAI-AKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAK 286

Query: 118 TLLAQGFTTRRDYDAADQALKVAQGSVDAAQSAFANAQQNLSFTELKAGAPGVITARQVE 177
T + D+ L+ ++ A ++ + ++A + +V
Sbjct: 287 EEYQL-VTQLFKNEILDK-LRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVH 344

Query: 178 T-GQVVQAAQTVFTVAEDGDR-DAVFNVQETLVARTPASPAVTITLLSDPQVRA---TGK 232
T G VV A+T+ + + D + VQ + I + + P R GK
Sbjct: 345 TEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGK 404

Query: 233 VREISP--VVDQASG 245
V+ I+ + DQ G
Sbjct: 405 VKNINLDAIEDQRLG 419



Score = 43.7 bits (103), Expect = 7e-07
Identities = 17/106 (16%), Positives = 35/106 (33%)

Query: 59 VGGRIAERLVDVGQHVDQGAVLARIDPQEQQSDLRSAQADLDAARAQLTQSAAAFERQKT 118
+ E +V G+ V +G VL ++ ++D Q+ L AR + T+ +
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 119 LLAQGFTTRRDYDAADQALKVAQGSVDAAQSAFANAQQNLSFTELK 164
+ + + + + F+ Q EL
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5526HELNAPAPROT332e-04 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 33.3 bits (76), Expect = 2e-04
Identities = 16/93 (17%), Positives = 33/93 (35%), Gaps = 8/93 (8%)

Query: 54 HADRLIARIIFLEGHPN--------LQSVAPLRIGQNVKEVLESDLAGEYDARTAYKRSR 105
D + R++ + G P S+ + E++++ + + K
Sbjct: 60 TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVI 119

Query: 106 EICHEVGDYVTMKLFEDLLADEEGHIDFLETQL 138
+ E D T LF L+ + E + L + L
Sbjct: 120 GLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152


97mlr5594mll5605N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr5594013-1.307224type IV prepilin peptidase, cpaA
mlr5595012-1.418951pilus assembly protein cpaB
mlr5597-112-1.427886exporter protein, cpaC
mlr5598-211-0.779631pilus assembly protein cpaD
mlr5600-111-0.084681pilus assembly protein cpaE
mlr5602-1130.226285secretory protein kinase, cpaF
mlr5603-2110.524635hypothetical protein
mlr5604-180.475951hypothetical protein
mll5605-1100.773204hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5594PREPILNPTASE417e-07 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 41.3 bits (97), Expect = 7e-07
Identities = 41/167 (24%), Positives = 71/167 (42%), Gaps = 26/167 (15%)

Query: 1 MAPMLEALIFVVFPFCMLFAAISDMLSMTIANRVSV---LLVVVFALVAPLTGMEWAAYG 57
+AP L ++ + ++ D+ M + +++++ ++F L+ + A G
Sbjct: 128 LAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIG 187

Query: 58 WHFAAGALVLAVTFGLFAM----GGMGGGDAKLLAATAVWMGLNIHLVEYLVVSTLIGGL 113
AG LVL + F + GMG GD KLLAA W+G L L++S+L+G
Sbjct: 188 --AMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQA-LPIVLLLSSLVGAF 244

Query: 114 LTIAILLYRKSPLAVITGRNPFLRHFAEESVGIPYGIALGLGGLLTY 160
+ I ++L R S IP+G L + G +
Sbjct: 245 MGIGLILLRNHHQ----------------SKPIPFGPYLAIAGWIAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5597BCTERIALGSPD1166e-30 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 116 bits (292), Expect = 6e-30
Identities = 65/283 (22%), Positives = 114/283 (40%), Gaps = 32/283 (11%)

Query: 155 NPDQERRVSKIVNLLQIIGDDQVTLKVTVAEVSRSVMKQLGVNM------VGNGGSNGIS 208
PD + +++ L I QV ++ +AEV + LG+ + ++G+
Sbjct: 326 APDVMNDLERVIAQLDI-RRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLP 384

Query: 209 YG---ALSDNFTGLGKQLSHSG----------FNIGNSALMAYINAMEQSGVMKTLAEPT 255
A ++ + G S + A+ S LA P+
Sbjct: 385 ISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPS 444

Query: 256 LTAVSGEKATFKVGGEYNLLTGVSQNVSSDNQTGLTTYTINKIEYGIGLEFQPVVLSPGR 315
+ + +ATF VG E +LTG SQ S DN T+ + GI L+ +P +
Sbjct: 445 IVTLDNMEATFNVGQEVPVLTG-SQTTSGDN----IFNTVERKTVGIKLKVKPQINEGDS 499

Query: 316 ISLKVRTSVSEPTTEGSVALSNGVTSPGANMLSLRKRLADTTVELPSGGSMMIAGLVRDD 375
+ L++ VS + TS + R + V + SG ++++ GL+
Sbjct: 500 VLLEIEQEVSSVAD------AASSTSSDLG-ATFNTRTVNNAVLVGSGETVVVGGLLDKS 552

Query: 376 VRQAVNGLPGLTKIPVLGALFRSRDFVRNESELVIIITPYLAK 418
V + +P L IPV+GALFRS ++ L++ I P + +
Sbjct: 553 VSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIR 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5602PF03544320.005 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 31.9 bits (72), Expect = 0.005
Identities = 14/59 (23%), Positives = 20/59 (33%)

Query: 20 PAPAPAVPPPAGSTDTAVLARPAAPMQPSAAVAPPARRAVEAPPIAPEARRPQREKSET 78
APA PP A + P +P A +E P P+ + +K E
Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQ 113



Score = 29.9 bits (67), Expect = 0.020
Identities = 16/61 (26%), Positives = 20/61 (32%), Gaps = 7/61 (11%)

Query: 22 PAPAVPPPAGSTDTAVLARPAAPMQPSAAVAPPARRAV-------EAPPIAPEARRPQRE 74
PAPA P A L P A P V P EAP + + + +
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 103

Query: 75 K 75
K
Sbjct: 104 K 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5603BCTERIALGSPF280.049 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 28.3 bits (63), Expect = 0.049
Identities = 31/150 (20%), Positives = 61/150 (40%), Gaps = 15/150 (10%)

Query: 150 FVSFRRARRVKAF-LNEFPNALDIIVRAVKSGLPLNDAVRLIANESPEP-VKAEFRRIVD 207
+S RR R+ L L +V A +PL +A+ +A +S +P + +
Sbjct: 56 GLSLRRKIRLSTSDLALLTRQLATLVAA---SMPLEEALDAVAKQSEKPHLSQLMAAVRS 112

Query: 208 SQQMGISIPDATLRMPETMPCTEASFFGIVIQI--QSQAGGNLSEALGNLSRVLRDRKKM 265
G S+ DA P SF + + + G+L L L+ R++M
Sbjct: 113 KVMEGHSLADAMKCFP-------GSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQM 165

Query: 266 KAKV-QALSMEAKASAAIIGALPFIVAFLV 294
++++ QA+ + I + +++ +V
Sbjct: 166 RSRIQQAMIYPCVLTVVAIAVVSILLSVVV 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5605SYCDCHAPRONE320.001 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 32.2 bits (73), Expect = 0.001
Identities = 20/134 (14%), Positives = 48/134 (35%), Gaps = 7/134 (5%)

Query: 73 PNDKRIATNFAAALQMDGDADQSLAVMRKLAIAYPKDRDVLAAYGKALAANSQFEAALDA 132
+ + + L+ G +A++ +++ + L + + ++E A
Sbjct: 6 TDTQEYQLAMESFLKGGGT----IAMLNEIS---SDTLEQLYSLAFNQYQSGKYEDAHKV 58

Query: 133 VRRAQTPEYPDWKLVSAEAAILDQLGQKDDARQLYRKALELKPNEPSVLSNLGMSYVLEG 192
+ ++ D + A +GQ D A Y + EP + + +G
Sbjct: 59 FQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKG 118

Query: 193 DLRTAETYMRSAAQ 206
+L AE+ + A +
Sbjct: 119 ELAEAESGLFLAQE 132


98mlr5684mll5701N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr56840102.207459two-component system response regulator
mll5686192.081554transmembrane efflux protein
mlr5687091.811038transcriptional regulator
mll5688191.563372hypothetical protein
mll56891111.213633hypothetical protein
mlr56900111.248784transcriptional regulator
mll56910141.444236two-component sensor histidine kinase
mlr56920111.366152large-conductance mechanosensitive channel
mlr5693-1111.929482aspartate aminotransferase
mll56950111.6353655-aminolevulinate synthase
mlr56970121.171650UDP-galactose 4-epimerase
mlr56981130.838382D-amino acid dehydrogenase small subunit
mlr57003130.694274hypothetical protein
mll57011160.101703L-fuculokinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5684HTHFIS711e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.4 bits (175), Expect = 1e-16
Identities = 33/160 (20%), Positives = 64/160 (40%), Gaps = 10/160 (6%)

Query: 9 IADDHPLFRGALREALAGIGNVAAIHEAGDFESAKALVVANEDVDLVLLDLSMPGASGLS 68
+ADD R L +AL+ G + + + + A D DLV+ D+ MP +
Sbjct: 8 VADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWI-AAGDGDLVVTDVVMPDENAFD 64

Query: 69 GLISLRGIHPAVPLIVVSAHDDPATIRRALDLGASGFISKSASMEEIRNAVQLVL----- 123
L ++ P +P++V+SA + T +A + GA ++ K + E+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 124 --AGDIAAPVGVDLGVERDPEISDLIKRLQALTPQQTRVL 161
+ V R + ++ + L L ++
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5686TCRTETB1073e-27 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 107 bits (269), Expect = 3e-27
Identities = 86/394 (21%), Positives = 157/394 (39%), Gaps = 17/394 (4%)

Query: 34 LPTLEQELHADFRQLQWVMNAYTIAVTTVLMAVGTIADRYGRKRVFLISIAAFGLTSLIC 93
LP + + + WV A+ + + G ++D+ G KR+ L I S+I
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 94 GLA-DDVSTLIVARFLQGLSGGAVLICQLAVLSHEFREGRERAVAWGWWGVIFGVGLGFG 152
+ S LI+ARF+QG +G A + V+ + R A+G G I +G G G
Sbjct: 97 FVGHSFFSLLIMARFIQG-AGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155

Query: 153 PIIGGGIVAASSWEWVFLIHGPAAAVAFVLAWTGVHESKDPEAGKLDLAGIVTLSPSVFC 212
P IGG I W ++ LI P + V + + + G D+ GI+ +S +
Sbjct: 156 PAIGGMIAHYIHWSYLLLI--PMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVF 213

Query: 213 LVFYITQGPDLGFASPVALTILGVSIASFIAFLIAERISKRPMFDFSVFRIRPFSGAIVG 272
+ F + +++ L VS+ SF+ F+ R P D + + PF ++
Sbjct: 214 FML---------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLC 264

Query: 273 SAAMNLSYWPFMIYLPIWFHAGLGYDSISTG-LALLAYTLPTLVMPPLAERLSLRYQPGI 331
+ + F+ +P + G + + T+ ++ + L R P
Sbjct: 265 GGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLY 324

Query: 332 IIPAGLAVIGVGFILMKFGSAAARPDWLTMLPGCLIAGAGLGITNTPVTNTTTGSVSSDR 391
++ G+ + V F+ F W + + G GL T T ++ + S+
Sbjct: 325 VLNIGVTFLSVSFLTASFLLETTS--WFMTIIIVFVLG-GLSFTKTVISTIVSSSLKQQE 381

Query: 392 AGMASGIDMSARMVSLAVNIAVMGFILAGGVLAH 425
AG + +S IA++G +L+ +L
Sbjct: 382 AGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQ 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5688TCRTETA803e-18 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 79.9 bits (197), Expect = 3e-18
Identities = 85/387 (21%), Positives = 142/387 (36%), Gaps = 19/387 (4%)

Query: 55 RPALLGLLLFFCAGFADGALMPFFP--LWASSEAGIPVGAIGLLFGCYAGGELLAAPLIG 112
RP ++ L G +MP P L + G+L YA + AP++G
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 113 GIADRIGRRPVLIVSSIGVGAGFLGLFFVHGVAITAIVLLATGMCESVLHPTILTAIADV 172
++DR GRRPVL+VS G + + + + I + G+ + IAD+
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYIADI 123

Query: 173 TPPSAHPRWFSLARVSSSAGQILGPACGALLALVSLRSAFLAGGTMLVLGGIVMLFALNE 232
T R F G + GP G L+ S + F A + L + F L E
Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 233 TIGIGRAAGDDLPDGEDDEEEGLSALLPAFRDGRLAKLLLWV-VLFEVAGNWIEAVIPLY 291
+ R E A R + L+ V + ++ G A+ ++
Sbjct: 184 SHKGER-------RPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236

Query: 292 AQDAGTLTPSGVGALFAYAAALTVGLQMLVSRMVESRSALWLTVGAGL-ATIFAFALLA- 349
+D + +G A L Q +++ V +R + G+ A + LLA
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296

Query: 350 ASPAMVALIGAVSLCSIAQMLVGPLVPTAVNALAPPARRASYMAASSVAVDLKDSLGPSI 409
A+ +A V L S + P + ++ R+ + + L +GP +
Sbjct: 297 ATRGWMAFPIMVLLASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLL 354

Query: 410 GTALYALAPR----LPWIAGIPLVAIA 432
TA+YA + WIAG L +
Sbjct: 355 FTAIYAASITTWNGWAWIAGAALYLLC 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5690HTHTETR661e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 1e-15
Identities = 25/142 (17%), Positives = 55/142 (38%), Gaps = 5/142 (3%)

Query: 14 QPKQARATDLVAAILQAAVQVLETEGAQRFTTTRVAEKAGVSVGSLYQYFPNKAALLFRL 73
+ + A + IL A+++ +G + +A+ AGV+ G++Y +F +K+ L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 74 QSDEWQQTTDLLCRILQEVEKPPLERLRILVHAFVRSECEEAAVRGALDDAAPLYRDAPE 133
+L + PL LR ++ + S E R + ++
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLM---EIIFHKCEF 119

Query: 134 AQEA--RASAERTVEIFMRETL 153
E A+R + + + +
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRI 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5691HTHFIS551e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.2 bits (133), Expect = 1e-09
Identities = 25/115 (21%), Positives = 45/115 (39%), Gaps = 4/115 (3%)

Query: 1049 HVLCIDNDARILEGMRLLLEGWGCKVDTVSGSRDLENA-ALHRPDIVLADYHLDGETGLD 1107
+L D+DA I + L G V S + L A D+V+ D + E D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1108 IIARLRATHGDDLAAVLVTADRSNEVRAAAAGLDIAVINKPLKPAVLRSMMARVR 1162
++ R++ DL ++++A N A + + KP L ++ +
Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQ--NTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5692MECHCHANNEL1401e-45 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 140 bits (354), Expect = 1e-45
Identities = 70/140 (50%), Positives = 93/140 (66%), Gaps = 10/140 (7%)

Query: 18 MLKEFQEFISKGNVMDLAVGVIIGAAFGKIVTSLVDDVIMPIFGAIFGGLDFNNYYIGLS 77
++KEF+EF +GNV+DLAVGVIIGAAFGKIV+SLV D+IMP G + GG+DF + + L
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTL- 61

Query: 78 SAVNATSLAEAKKQGAVFAYGSFITAVLNFLILAFIIFLMVKAVNNLRRRLEREKPAAPA 137
A+ V YG FI V +FLI+AF IF+ +K +N L R ++E+PAA
Sbjct: 62 ------RDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNR--KKEEPAAAP 113

Query: 138 APPPADVALLTEIRDLLAKR 157
A P + LLTEIRDLL ++
Sbjct: 114 A-PTKEEVLLTEIRDLLKEQ 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5697NUCEPIMERASE1651e-50 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 165 bits (420), Expect = 1e-50
Identities = 80/332 (24%), Positives = 137/332 (41%), Gaps = 41/332 (12%)

Query: 3 MTVLVTGGAGYIGSHMVWELLDAGERVVVLDRLSTGF---------EWAVAPEAKLVVGD 53
M LVTG AG+IG H+ LL+AG +VV +D L+ + E P + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 54 VADKELVGSIIRDNHVDAIIHFAGSIVVPESVADPLAYYENNTSKTRTLIETAVREGVPH 113
+AD+E + + H + + + V S+ +P AY ++N + ++E + H
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 114 FIFSSTAAVYGGAGLEPVREDARLA-PESPYGLSKLMSEWMLRDAGLAHDIRYTALRYFN 172
+++S+++VYG P D + P S Y +K +E M + + T LR+F
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 173 VAGADPKGRTGQSTPGATHLIKVACETALGKRPFMQVFGTDYPTPDGTCMRDYIHVSDLA 232
V G P GR + T + GK V+ G RD+ ++ D+A
Sbjct: 181 VYG--PWGRPDMALFKFTKAML------EGKSI--DVYN------YGKMKRDFTYIDDIA 224

Query: 233 AAHRLALQRL---------RAGGTSL------VANCGYSHGYSVLEVIDSVRRAFGRDFE 277
A + G + V N G S +++ I ++ A G + +
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 278 VKMGDRRPGDAAAVVANSDLARAELGWTPQRD 309
M +PGD A++ +G+TP+
Sbjct: 285 KNMLPLQPGDVLETSADTKALYEVIGFTPETT 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr5700ALARACEMASE471e-07 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 46.7 bits (111), Expect = 1e-07
Identities = 69/345 (20%), Positives = 129/345 (37%), Gaps = 46/345 (13%)

Query: 5 QVAIDLGRIERNARTIVDRCALSGIKVFGVTKGMC---GMPQVARAMLRGGVAGIAESRF 61
Q ++DL +++N + R A + +V+ V K G+ ++ A+ +
Sbjct: 6 QASLDLQALKQNLSIV--RQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALL--NL 61

Query: 62 ENIRRLRDSGINAPIMLLRSPPMARIEEVVRTVDISLQSELATIREISRIAE-RMGRVHD 120
E LR+ G PI++L +++ L + + + ++ + R+ D
Sbjct: 62 EEAITLRERGWKGPILMLEG--FFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLD 119

Query: 121 IMLMIDLGDLREGIWPNDLIPTVEQILQFKGVRIAGIGTNLGCFGAIMPTPENLGQLVAH 180
I L ++ G R G P+ ++ +Q+ V + ++ E+
Sbjct: 120 IYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHF-------AEAEHPD----G 168

Query: 181 AYKTERLSGKSLDWISGG---ASSSLTLLLEGRLPAGINNLRVGEAILQG--GVETFRDV 235
++ + + ++S+ TL A + +R G IL G +RD+
Sbjct: 169 ISGAMARIEQAAEGLECRRSLSNSAATLWHPE---AHFDWVRPG-IILYGASPSGQWRDI 224

Query: 236 PWAELEPDACRLTSDIIEVK-LKPSRPIGESG-YDAFGNQP--VFP-DEGDRL-RAIANI 289
L P L+S+II V+ LK +G G Y A Q + D R
Sbjct: 225 ANTGLRP-VMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPT- 282

Query: 290 GREDVLVEG-LTPIAKGIRVLGASSDHLLLDVQDADPRPAVGDRV 333
VLV+G T + S D L +D+ P+ +G V
Sbjct: 283 -GTPVLVDGVRTMTVGTV-----SMDMLAVDL-TPCPQAGIGTPV 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll5701PF03309320.004 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 32.0 bits (73), Expect = 0.004
Identities = 24/127 (18%), Positives = 41/127 (32%), Gaps = 30/127 (23%)

Query: 7 IAVIDIGKTNAKVALVDLATSSE--VALRRMANAPVRQAPYPHHDVESLWTFILDSLAGL 64
+ ID+ T+ V L+ + V R+ P A D +L +D L G
Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTA-----DELALT---IDGLIGD 53

Query: 65 NCEQRIDAISITTHGATGALVDASGELVLPVLDYEFDGPNGLAADYDAIRPPFAETGTPR 124
+ E R+ S + +P + +E + Y P P
Sbjct: 54 DAE-RLTGASGLS--------------TVPSVLHEVR---VMLEQYWPNVPHVL--IEPG 93

Query: 125 LPLGLNL 131
+ G+ L
Sbjct: 94 VRTGIPL 100


99mlr6334mlr6346N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr6334337-7.047699two-component response regulator
mlr6335435-6.177850type II secretion system protein
mlr8762437-6.331619hypothetical protein
mll6337338-5.829170nodulation protein NOLX
mll6338140-4.646882nodulation protein NolW
mlr8763140-4.338349hypothetical protein
mlr6339139-4.352931nodulation protein NolT
mlr8764042-5.501557nodulation protein NolU
mlr6341041-5.722071nodulation protein NolV
mlr6342142-6.843379ATP synthase in type III secretion system, hrcN
mlr6343243-7.717850hypothetical protein
mlr6344243-7.930227translocation protein in type III secretion
mlr8766240-8.155515type III secretion system protein
msr8694238-7.398771hypothetical protein
mlr6345136-6.714445translocation protein in type III secretion
mlr6346134-6.161740translocation protein in type III secretion
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6334HTHFIS742e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 2e-17
Identities = 28/138 (20%), Positives = 55/138 (39%)

Query: 2 RTLFVDHHADLTRAVGVALGDSGFAVDVVPTLEQASSAFSCASYEILLLELVLPDGDGLD 61
L D A + + AL +G+ V + + ++++ ++V+PD + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 WLKQLRGEGHSVPALILSDVDDLEKRIAIFNGGADDFLLKPVYTNELIARMRAVLRRSTQ 121
L +++ +P L++S + I GA D+L KP ELI + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 MTAPIIVFGNLHFDPIGR 139
+ + +GR
Sbjct: 125 RPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6335BCTERIALGSPD1309e-35 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 130 bits (327), Expect = 9e-35
Identities = 58/264 (21%), Positives = 115/264 (43%), Gaps = 27/264 (10%)

Query: 153 QVNLSVRVAEVSRSAMKALGVNLS-AFGQIDNFRVGLLSGGGTGSGAAQGGGTAGIGFNN 211
QV + +AEV + LG+ + + F SG + A G +
Sbjct: 346 QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTN---SGLPISTAIAGANQYNKDGTVS 402

Query: 212 GAV-----------------NIGAVLDALAKEHIASVLAEPNLTAMSGETASFLAGGEFP 254
++ N +L AL+ +LA P++ + A+F G E P
Sbjct: 403 SSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVP 462

Query: 255 IPVLQ-----ENKQVSVEFRHFGVSLEFVPTVLNNNRINIHVKPEVSELSSQGAVQINGI 309
+ +N +VE + G+ L+ P + + + + ++ EVS ++ A +
Sbjct: 463 VLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVA-DAASSTSSD 521

Query: 310 SVPAVSTRRADTVVELASGQSFAIGGLIRRNVNNNVSAFPWLGEMPILGALFRSSSFQKE 369
+TR + V + SG++ +GGL+ ++V++ P LG++P++GALFRS+S +
Sbjct: 522 LGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVS 581

Query: 370 ESELIILVTPYIVKPGSSPNQMSA 393
+ L++ + P +++ Q S+
Sbjct: 582 KRNLMLFIRPTVIRDRDEYRQASS 605


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6338TYPE3OMGPROT1244e-35 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 124 bits (312), Expect = 4e-35
Identities = 51/155 (32%), Positives = 76/155 (49%), Gaps = 3/155 (1%)

Query: 3 LLCAGFFLSAGTNGTLGVPLSLSKTPYRYTVLDQDISEALQQFGNNLNIRVNISAEVKGR 62
L LS + + L PY Y + + + L FG N + V +S ++ +
Sbjct: 13 LTGTLLLLS---SYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDK 69

Query: 63 IRGSMPDLPPREFLDRLANMYGLQWYYDGLVLYVSAAKESQTRMLVLTSIRFDTFKGALD 122
+ G P++FL +A++Y L WYYDG VLY+ E +R++ L K AL
Sbjct: 70 VSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQ 129

Query: 123 KLEISDDRYVVRPAPGDGLVLVSGPPRFTALVEQT 157
+ I + R+ RP + LV VSGPPR+ LVEQT
Sbjct: 130 RSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQT 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6339FLGMRINGFLIF829e-20 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 81.9 bits (202), Expect = 9e-20
Identities = 40/165 (24%), Positives = 70/165 (42%), Gaps = 7/165 (4%)

Query: 28 LYTQLQEREANEMLALLMDNGVHAVRVAAKDGTSTVQVDEKLLAYSIDLLNGKGLPRQSF 87
L++ L +++ ++A L + +G+ ++V + L +GLP+
Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPYR---FANGSGAIEVPADKVHELRLRLAQQGLPKGG- 108

Query: 88 KNLG-EIFQGSGLIASPTEERARYVYALSEELSRTISDIDGVFSVRVHVVLPHNDLLRAG 146
+G E+ S E+ Y AL EL+RTI + V S RVH+ +P L
Sbjct: 109 -AVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVRE 167

Query: 147 ATPSSASVFIRHDAKADLS-VLLPKIKMLVADSIEGLSYDKVEVV 190
SASV + + L + + LV+ ++ GL V +V
Sbjct: 168 QKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLV 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6341FLGFLIH270.045 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 27.1 bits (59), Expect = 0.045
Identities = 38/179 (21%), Positives = 73/179 (40%), Gaps = 33/179 (18%)

Query: 40 ERHQQHVRSWARAAYQRELARGHTEGLNAGAEE---------------MAALISQAVAEV 84
+ H+Q ++ Q+ +G+ EGL G E+ M L+S+ +
Sbjct: 50 QAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTL 109

Query: 85 ARRKAVLEQQLPQLVLEILSELLG---AFDPGELLVMAVRHAIERQYSGAEVCLHVYPTQ 141
+V+ +L Q+ LE +++G D L+ + + + L V+P
Sbjct: 110 DALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDD 169

Query: 142 V----DMLAREFA--GWDGQDGRPRVRIKPDPTLSPRRCVLWSEYGNVDLGLDAQMRAL 194
+ DML + GW R++ DPTL P C + ++ G++D + + + L
Sbjct: 170 LQRVDDMLGATLSLHGW---------RLRGDPTLHPGGCKVSADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6344TYPE3OMOPROT1317e-38 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 131 bits (331), Expect = 7e-38
Identities = 46/182 (25%), Positives = 79/182 (43%), Gaps = 19/182 (10%)

Query: 172 FRALGELFGQLPRQPRGLLSDLPIVVAGEIGTLHVPAAILRKACAGDALLPDLAPFGRGE 231
F L EL +P+ L L V IG+ ++L + GD LL + R E
Sbjct: 131 FEHLPELPAVGGGRPKMLRWPLRFV----IGSSDTQRSLLGRIGIGDVLLIRTS---RAE 183

Query: 232 IALSLGQLWASADLEGDQLVLHGPFRPRSYSLENAHMTQLGSQLGPTE---DLDDVEIML 288
+ +L +EG +V +L+ H+ + + E L+ + + L
Sbjct: 184 VYCYAKKLGHFNRVEGGIIV---------ETLDIQHIEEENNTTETAETLPGLNQLPVKL 234

Query: 289 VFECGRWPIPLGELRSAGEGHIFELGRPIQDPVDILANGQCIGRGDIVRIGDTLGIRLRG 348
F R + L EL + G+ + L + V+I+ANG +G G++V++ DTLG+ +
Sbjct: 235 EFVLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHE 294

Query: 349 RL 350
L
Sbjct: 295 WL 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8766TYPE3IMPPROT2333e-80 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 233 bits (596), Expect = 3e-80
Identities = 84/215 (39%), Positives = 130/215 (60%), Gaps = 7/215 (3%)

Query: 9 LALLAVTAGLGLLVLVVVTTTAFVKVSVVLFLVRNALGTQTIPPNIALYAVALILTMFLS 68
++L+A+ A LL ++ + T FVK S+V +VRNALG Q IP N+ L VAL+L+MF+
Sbjct: 5 ISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVM 64

Query: 69 APVVEQTYDRMTDPKLHYQTFDDWVSAAKSGSEPLRDHLKKFTNEEQRQFFLSSTEKVWP 128
P++ Y D + + G + RD+L K+++ E QFF ++ K
Sbjct: 65 WPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQY 124

Query: 129 AEM-------RAKATVDDLSILVPSFLISELKRAFEIGFLLYLPFIVIDLIVTTILMAMG 181
E + + + L+P++ +SE+K AF+IGF LYLPF+V+DL+V+++L+A+G
Sbjct: 125 GEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALG 184

Query: 182 MSMVSPTLISVPFKLFVFVAIDGWSKLMHGLVLSY 216
M M+SP IS P KL +FVA+DGW+ L GL+L Y
Sbjct: 185 MMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msr8694TYPE3IMQPROT563e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 55.9 bits (135), Expect = 3e-14
Identities = 20/72 (27%), Positives = 40/72 (55%)

Query: 1 MSQSLVVFMIWILPPLIASVVVGLVIGIIQAATQIQDESLPLTVKLLVVVAVIGLFAPVL 60
+++L + +I P I + ++GL++G+ Q TQ+Q+++LP +KLL V + L +
Sbjct: 8 GNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSGWY 67

Query: 61 SAPLIELTDQIF 72
L+ Q+
Sbjct: 68 GEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6345TYPE3IMRPROT1622e-51 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 162 bits (413), Expect = 2e-51
Identities = 39/226 (17%), Positives = 89/226 (39%), Gaps = 7/226 (3%)

Query: 24 LGAARAIGIMMILPVFTRSQTDGLIRGCLAVGFGLPCLAHVSDALQALDPETRLIEVALL 83
R + ++ P+ + ++ LA+ + + L L
Sbjct: 18 WPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFAL----WL 73

Query: 84 GLKEVLVGALLGTFLGIPLWGLQAAGEFIDNQRGVTNPSAPTDPATNSQASAMGVFLGIT 143
++++L+G LG + ++ AGE I Q G++ + DPA++ + + +
Sbjct: 74 AVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATF-VDPASHLNMPVLARIMDML 132

Query: 144 AIAIFVASGGLETLIGALYGSYLIWPVYKFYPTLSTQGAMEVLGLLDQIMRTALLVSGPV 203
A+ +F+ G LI L ++ P+ L++ + + I L+++ P+
Sbjct: 133 ALLLFLTFNGHLWLISLLVDTFHTLPI--GGEPLNSNAFLALTKAGSLIFLNGLMLALPL 190

Query: 204 VFFMTLIDVSFMLLRRFAPQFKLTQLSPAIKNLVFPILMVTYAGYL 249
+ + ++++ LL R APQ + + + V LM +
Sbjct: 191 ITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLI 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6346TYPE3IMSPROT310e-106 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 310 bits (795), Expect = e-106
Identities = 96/338 (28%), Positives = 160/338 (47%), Gaps = 4/338 (1%)

Query: 5 SEEKTHAATPKKLNDARKKGQLPHSSDFVRAVGTCAGLGYLWLRGSAIEDKCREALLFVD 64
S EKT TPKK+ DARKKGQ+ S + V A L + + +L
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPA 61

Query: 65 KLQNLPFDFAVRQALVVLAELTLATVGPLLGTLVAAVLLASILANGGFVFSLEPMTPNFD 124
+ LPF A+ + + PLL + A + +AS + GF+ S E + P+
Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLT-VAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 125 KINPFQGLKRLASARSMVELGKTLFKVFVLGATFSFCLLGMWKTMVYLPFCGMGCLGLVV 184
KINP +G KR+ S +S+VE K++ KV +L + G T++ LP CG+ C+ ++
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 185 TGA-KLLIGIGAGALLAAGLIDLLVQRALFLREMRMTKTEVTRELKDQQGAPELKSERRR 243
+ L+ I + + D + +++E++M+K E+ RE K+ +G+PE+KS+RR+
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 244 IRDESADEPPL-GVHHATLIFKG-TAILIGLRYVRGETGVPVLVCRADGERASHLLSEAR 301
E V ++++ T I IG+ Y RGET +P++ + + + A
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 302 ALRLEIVDNDVLAHQLIGKTQLGRPIPMQYFEPVARAL 339
+ I+ LA L + IP + E A L
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVL 338


100mll6596mll6621N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll6596-114-1.172647hypothetical protein
mll6598-215-2.558855hypothetical protein
mll6600-214-2.2872085-aminolevulinate synthase
mlr6601-120-3.434246hypothetical protein
msr6604-121-3.405519hypothetical protein
mll6606-121-3.253296response regulator FixJ
mll6607-123-3.704136two-component, nitrogen fixation sensor protein
mlr6608-123-3.408083hypothetical protein
mlr6609-123-3.364752Mg2+ transport ATPase
mll6610-131-3.676051hypothetical protein
mll6611-128-2.801349hypothetical protein
mll6613-224-2.979454hypothetical protein
msl6615-120-2.377835hypothetical protein
mlr6616-121-1.992653hypothetical protein
mlr6617-219-1.714576two-component system, regulatory protein
mlr6618-219-1.828623two-component sensor protein
mll6619-118-1.580656hypothetical protein
mll6620018-0.995601thiamine biosynthesis lipoprotein
mll6621013-0.446194hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6596PYOCINKILLER280.042 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.8 bits (61), Expect = 0.042
Identities = 38/165 (23%), Positives = 60/165 (36%), Gaps = 27/165 (16%)

Query: 25 VVLALPRGGIPV---AAEVAKALGAPLDVLVVRKVGAPGNAELAVAAVVDGNP-----PD 76
V A RG I V AA +A+A+ + VL AP + A++ + D
Sbjct: 259 VATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGFASLTYSSRTAEQWQD 318

Query: 77 VVLNREIVEAYALDDAELA-------SLVALERPELE---RRRLAYKGSRRSLSVAGKTV 126
+ A +D A+L + VA ++ R +G+ +LSV
Sbjct: 319 QT-PDSVRYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMRLTNEARGNTTTLSVVSTDG 377

Query: 127 ILVDDGA--------ATGTTMKVAIRALRHRAPREIIVAVPVSPP 163
+ V AT +V + + AP I+ P SPP
Sbjct: 378 VSVPKAVPVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTPASPP 422


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6606HTHFIS1182e-33 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 118 bits (297), Expect = 2e-33
Identities = 39/144 (27%), Positives = 66/144 (45%)

Query: 9 VVDDDVDVRKSLGFLLATADFAVRLYESATAFLSTATGKLEGCIVTDVRMPGIDGIEFLR 68
V DDD +R L L+ A + VR+ +A +VTDV MP + + L
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLP 67

Query: 69 QLRASGHTIPVIVMTGHADVALAVQAMKEGAADFIEKPFDDEMLIEAIRSALANRNQAHA 128
+++ + +PV+VM+ A++A ++GA D++ KPFD LI I ALA + +
Sbjct: 68 RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPS 127

Query: 129 AHPQSADIRDRLSTLSERERQVLD 152
+ L S +++
Sbjct: 128 KLEDDSQDGMPLVGRSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6607PF06580320.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.007
Identities = 30/158 (18%), Positives = 60/158 (37%), Gaps = 28/158 (17%)

Query: 341 MLRDAVERAAEQALRAGDVIRHLRDFVARGESERQVERLPVLIEE----AASLALVGARE 396
++ + +A E +++R+ R + RQV L +E + L L +
Sbjct: 185 LILEDPTKAREMLTSLSELMRY----SLRYSNARQV----SLADELTVVDSYLQLASIQF 236

Query: 397 INLL-VSYKLDPAAELVLTDRIQIQQVLLNLMRNAVEAMQGSPRRELKVTTVARDDGMAE 455
+ L +++PA V + +Q ++ N +++ + + + LK T +D+G
Sbjct: 237 EDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGT---KDNGTVT 293

Query: 456 VSVIDTGPGLAPEVSAQLFQPFVTTKKHGMGVGLSICR 493
+ V +TG K G GL R
Sbjct: 294 LEVENTGSLALKNT------------KESTGTGLQNVR 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6608SECA290.023 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.1 bits (65), Expect = 0.023
Identities = 28/111 (25%), Positives = 38/111 (34%), Gaps = 17/111 (15%)

Query: 93 VCSPTRFLSVSARAADLIVTGQAGDNVFRAVDVGSLTLGAGRPVLVAATNVE-------- 144
V PT + DL+ +A D+ T G+PVLV ++E
Sbjct: 410 VVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIKERTA-KGQPVLVGTISIEKSELVSNE 468

Query: 145 ---HVLAKTVLVAWKDTREARRAMADALPFLAKASEVVIATIDTERGESIR 192
+ VL A EA +A A + V IAT RG I
Sbjct: 469 LTKAGIKHNVLNAKFHANEA-AIVAQA----GYPAAVTIATNMAGRGTDIV 514


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6613cloacin320.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 31.6 bits (71), Expect = 0.001
Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 3/70 (4%)

Query: 59 GGSAVTRRSGAGTISTFVALAAGMFAMGTWFFGGLALDHAEAAGFSSDVAEIHESLGGIT 118
G S +G + +A G A+ T GGLA+ + A S+ +A+I +L G
Sbjct: 69 GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSIS-AGALSAAIADIMAALKG-- 125

Query: 119 AFAFLIWGLV 128
F F +WG+
Sbjct: 126 PFKFGLWGVA 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6617HTHFIS801e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.3 bits (198), Expect = 1e-19
Identities = 30/129 (23%), Positives = 56/129 (43%)

Query: 2 KILLAEDEPRIAADVATVLKASGMAVDTVRDGEAAWFAGDVENYDAAILDLGLPKLDGLT 61
IL+A+D+ I + L +G V + W + D + D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILKRWRANARRFPVLILTARGMWTERVDGINAGADDYLPKPFEMEELLARLRAILRRSTG 121
+L R + PVL+++A+ + + GA DYLPKPF++ EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 QAAPVLKSG 130
+ + +
Sbjct: 125 RPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6618PF06580415e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.4 bits (97), Expect = 5e-06
Identities = 19/130 (14%), Positives = 42/130 (32%), Gaps = 28/130 (21%)

Query: 325 GKQLSFNMDVAEDA-TAPIDEGDLSEILGNLVENAARYA------KSSVRVGASAAAGEV 377
+L F + + ++ LVEN ++ + + + G V
Sbjct: 237 EDRLQFENQINPAIMDVQV----PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTV 292

Query: 378 VITVTDDGPGIPDADREFALSRGVQLDSKGGSSGLGLAIVSD-IVEAYGG--RLAMANAD 434
+ V + G L + S+G GL V + + YG ++ ++
Sbjct: 293 TLEVENTGSL--------------ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQ 338

Query: 435 PGLVVTISLP 444
+ + +P
Sbjct: 339 GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6621IGASERPTASE310.005 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.005
Identities = 30/162 (18%), Positives = 46/162 (28%), Gaps = 26/162 (16%)

Query: 56 EALPADAAEAPTVQPKEPASAAA-----LPVLRTSPAPLDPRFRTRLPIARETTAGIAVP 110
E + E TV+ +E A +P + + +P + T P A
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE--------- 1144

Query: 111 AERPNDPVKQTVVPVAPQPSPAPPAQP-------VTPAVTD--PASSTPSFAVTPAIYIP 161
R NDP P + + A QP V VT+ ++ S P P
Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTP 1204

Query: 162 V---PQPRPAYPNAPVRVVRVGMKPAAHGYADGVYTGPTADA 200
P N P R ++ H +
Sbjct: 1205 ATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRST 1246


101mll6686mlr6692N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll6686313-1.052724short-chain dehydrogenase
mll66871130.558030hypothetical protein
mll66880121.560337hypothetical protein
mlr66891131.546395two-component sensor
mlr66901152.375618two-component response regulator
mlr66911141.950026two-component response regulator
mlr66922141.430383O-linked GlcNAc transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6686DHBDHDRGNASE793e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 78.6 bits (193), Expect = 3e-19
Identities = 54/184 (29%), Positives = 93/184 (50%), Gaps = 2/184 (1%)

Query: 6 AVITGASGGIGAVYVDRLAERGYDLVLVARNGDKLTQVANRVRAKTGRKIDTLSADLANA 65
A ITGA+ GIG LA +G + V N +KL +V + ++A+ R + AD+ ++
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADVRDS 69

Query: 66 SDLARVEA-FLRETPDVTLLVNNAGLGGALKLLDSDVDQMTSLISLNVTALTRLTYAIVP 124
+ + + A RE + +LVN AG+ + ++ + S+N T + + ++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 125 GFVARAAGTIINIASIVAINPESLNGVYGGSKAFVVAFSQNLRHELAGTGVRVQVVLPGA 184
+ R +G+I+ + S A P + Y SKA V F++ L ELA +R +V PG+
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 185 TATD 188
T TD
Sbjct: 190 TETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6687SECA270.034 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 27.1 bits (60), Expect = 0.034
Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 4/65 (6%)

Query: 89 RDVPLPAWSEGLRQA-KYPEHV-VRHLS-AMAELTKQGRYDRMTDTLRKLTGEAPTNMRD 145
R + WS+GL QA + E V +++ + +A +T Q Y R+ + L +TG A T +
Sbjct: 342 RTMQGRRWSDGLHQAVEAKEGVQIQNENQTLASITFQN-YFRLYEKLAGMTGTADTEAFE 400

Query: 146 FVKLH 150
F ++
Sbjct: 401 FSSIY 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6689HTHFIS601e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.2 bits (146), Expect = 1e-11
Identities = 29/136 (21%), Positives = 56/136 (41%), Gaps = 13/136 (9%)

Query: 388 RLRVLICEDETDVATVIAALLDSEGFSSDVAPDIATAKALLQSRDYAALTLDIKLAEESG 447
+L+ +D+ + TV+ L G+ + + AT + + D + D+ + +E+
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 448 IKLFHDIRASPVNSDIAVIVISAVADEARRSLNGTAV-----GIVDWLEKPVDSGRLHAA 502
L I+ D+ V+V+SA TA+ G D+L KP D L
Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFM------TAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 503 LAKIVASRNEQRPKIL 518
+ + +A + K+
Sbjct: 115 IGRALAEPKRRPSKLE 130



Score = 55.6 bits (134), Expect = 3e-10
Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 1/68 (1%)

Query: 513 QRPKILHVEDDEGVLAVMSAGLGS-DVSIISAKTLQEARRAVAKRHFDLVILDIALPDGS 571
IL +DD + V++ L + R +A DLV+ D+ +PD +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 572 GLDLLADL 579
DLL +
Sbjct: 62 AFDLLPRI 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6690HTHFIS784e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.3 bits (193), Expect = 4e-20
Identities = 37/125 (29%), Positives = 58/125 (46%), Gaps = 3/125 (2%)

Query: 6 ARILYVDDEDDIREIAQMSLELDPEFEVRSCSSGAAALTDAAAWHPDLILLDVMMPDMDG 65
A IL DD+ IR + +L ++VR S+ A AA DL++ DV+MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALS-RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 66 PETLKRLAASPLTASIPVAFITARTQTHQVERYLAMGAVGVIAKPFDPLALAGEVRKLLS 125
+ L R+ +PV ++A+ + GA + KPFD L G + + L+
Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 126 EHPGR 130
E R
Sbjct: 121 EPKRR 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6691HTHFIS902e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.3 bits (224), Expect = 2e-24
Identities = 37/126 (29%), Positives = 61/126 (48%), Gaps = 2/126 (1%)

Query: 5 KARVLICDDDPLLLELMEFRLRAKGYEVITAVDGAEALAKAEQHGPDIIVLDAMMPKADG 64
A +L+ DDD + ++ L GY+V + A D++V D +MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 65 LEVLARLKGDPVLSDTPVVMLTARKAERDIVSALEKGADDYLVKPFIPEELLARLARLIA 124
++L R+K D PV++++A+ + A EKGA DYL KPF EL+ + R +A
Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 125 RKNGKR 130
+
Sbjct: 121 EPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6692SYCDCHAPRONE383e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 38.0 bits (88), Expect = 3e-05
Identities = 25/125 (20%), Positives = 54/125 (43%), Gaps = 7/125 (5%)

Query: 27 QTVDELYASAVKARQARHFDEAVDLLRRALALKPDNADALVQLG--FAELGRNDLAAARD 84
T+++LY+ A Q+ +++A + + L ++ + LG +G+ DLA
Sbjct: 34 DTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAI--H 91

Query: 85 AFSKALSLAPTYQDASFGLAEIEFRSGNLDAA---LPLAESVARAQPGNTDAAALVENIR 141
++S + F AE + G L A L LA+ + + + + V ++
Sbjct: 92 SYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSML 151

Query: 142 KAMQA 146
+A++
Sbjct: 152 EAIKL 156



Score = 31.8 bits (72), Expect = 0.004
Identities = 18/75 (24%), Positives = 31/75 (41%)

Query: 261 DVLASSPDNVEALDLDAKVALLEADYTRAGQSFQRVLAIDPRNAEALVGIGDVRRAQGDD 320
+ S D +E L A Y A + FQ + +D ++ +G+G R+A G
Sbjct: 27 MLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQY 86

Query: 321 DAARQAYREALAIEP 335
D A +Y ++
Sbjct: 87 DLAIHSYSYGAIMDI 101



Score = 31.4 bits (71), Expect = 0.005
Identities = 11/53 (20%), Positives = 23/53 (43%)

Query: 181 AGKLPEAEKVYRRALGLAPKNTDILVALGLIVGSSQRFDEAGHFFDRALAIKP 233
+GK +A KV++ L ++ + LG + ++D A H + +
Sbjct: 49 SGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101


102mlr6727mll6733N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr6727-114-0.959017transcriptional regulator
mll6729011-0.682118hypothetical protein
mll6730011-0.352744multidrug-efflux transport protein
mll6731-112-1.576289multidrug efflux membrane fusion protein
mll6732-113-2.197747hypothetical protein
mll6733014-2.566833arginine deiminase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6727PF05043320.004 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 32.2 bits (73), Expect = 0.004
Identities = 10/32 (31%), Positives = 14/32 (43%), Gaps = 3/32 (9%)

Query: 260 SIEQMTRELGISRSRLYRLFEASGGIVHYIQH 291
E + +E IS S LYR+ I I+
Sbjct: 102 QAESICKEFYISSSSLYRIISQ---INKVIKR 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6730ACRIFLAVINRP9090.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 909 bits (2350), Expect = 0.0
Identities = 419/1033 (40%), Positives = 624/1033 (60%), Gaps = 12/1033 (1%)

Query: 2 ISDLFITRTRLAIVLSIVISIAGAIAIFSLPVQQYPEITPPTVSVTAFYPGASAEVIADV 61
+++ FI R A VL+I++ +AGA+AI LPV QYP I PP VSV+A YPGA A+ + D
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 62 VGGPLETAINGVPNMMYMSSTSSNAGQYSLSVTFEVGTNPDIAQVNVQNRAQLAISQLPA 121
V +E +NG+ N+MYMSSTS +AG ++++TF+ GT+PDIAQV VQN+ QLA LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 AVTQQGVSVRSRSPDFVLGIAFYSPDSKLDVLQITNFTSTTIADALSRVSGVGEASVVGA 181
V QQG+SV S +++ F S + I+++ ++ + D LSR++GVG+ + G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 182 SEYSMRIWLNPARMDALGITADDVAAAIQRQNIQASLGQAGAPPAREGTELQYTLVARGR 241
++Y+MRIWL+ ++ +T DV ++ QN Q + GQ G PA G +L +++A+ R
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 242 LLSTDEFGDIVIRTGEAGAIVRLHDIARIELGARSYSSSASFAGHDTAMLQINQAPGANA 301
+ +EFG + +R G++VRL D+AR+ELG +Y+ A G A L I A GANA
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 302 IQTANAVRAELQRLSPTFPPGLQYQVVYDATRFVRTSLSLIVRILGEAFLIVLVVTYLFL 361
+ TA A++A+L L P FP G++ YD T FV+ S+ +V+ L EA ++V +V YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 362 QDWRATLVSALAMPVAMLGAIAVLLAFGYSINSISLLALVLAIGLVADDAILVVENVKHV 421
Q+ RATL+ +A+PV +LG A+L AFGYSIN++++ +VLAIGL+ DDAI+VVENV+ V
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 422 MEDDPTVETVAAARKAMGQITGPIISTTLVLLAVVIPTAFLSGISGQLYRQFAVTLSAAL 481
M +D A K+M QI G ++ +VL AV IP AF G +G +YRQF++T+ +A+
Sbjct: 420 MMEDKL-PPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 482 VISSVVGLTLSPALAAILLRRGQRGY---RRGPLGWFARFMNATRVGYGRLVGFLVRLWI 538
+S +V L L+PAL A LL+ + + G GWF + + Y VG ++
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 539 IPLAALAACFLGAYLLFSGLPSTFLPDEDQGALFVDIQLPNAASLDRTRAIVGEVQKTL- 597
L A G +LF LPS+FLP+EDQG IQLP A+ +RT+ ++ +V
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 598 -SATKGVENVISVAGFSIVQGTQSPNGAMVVAALDPWDQRNTPDLRLDAILARLRAQFST 656
+ VE+V +V GFS Q+ N M +L PW++RN + +A++ R + +
Sbjct: 599 KNEKANVESVFTVNGFSF--SGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGK 656

Query: 657 IPGANVAVFSPPAISGIGAVGGLDLRLQALQGQPPEEIAAVVRAFVTAINQAP-EIGGVA 715
I V F+ PAI +G G D L G + + + Q P + V
Sbjct: 657 IRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 716 TTFSADVPQIYVDVDRTRAEALGVSVSDIYSTIGASFGSRYVNDFTLHGRVFQVNLQADA 775
D Q ++VD+ +A+ALGVS+SDI TI + G YVNDF GRV ++ +QADA
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 776 EHRGDAEDILNLHVRSRTGAMVPLRAVVSTSTVLAPFVISRYNLSVAAQINGQVAPGGSS 835
+ R ED+ L+VRS G MVP A ++ V + RYN + +I G+ APG SS
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836

Query: 836 GAAMDAIERVAAEALPAGYGYEWSGLSFQERRSAGQESVIFGLAFLFAYLFLVAQYESWM 895
G AM +E +A++ LPAG GY+W+G+S+QER S Q + ++F+ +L L A YESW
Sbjct: 837 GDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 896 LPIAVILSLGAALFGATGALVLFGLQNSLYVQIAIVLLIGLASKNAILIVEFAKERRE-E 954
+P++V+L + + G A LF +N +Y + ++ IGL++KNAILIVEFAK+ E E
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 955 GQSIAEAARLGAEQRFRAVMMTALSFILGIIPLAVSTGAGAGARQAVGVTIFGGMLAATT 1014
G+ + EA + R R ++MT+L+FILG++PLA+S GAG+GA+ AVG+ + GGM++AT
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1015 IGLFIIPALFAAI 1027
+ +F +P F I
Sbjct: 1016 LAIFFVPVFFVVI 1028



Score = 97.2 bits (242), Expect = 2e-22
Identities = 78/518 (15%), Positives = 178/518 (34%), Gaps = 40/518 (7%)

Query: 6 FITRTRLAIVLSIVISIAGAIAIFSLPVQQYPEITPPTVSVTAFYP-GASAEVIADVVGG 64
+ T +++ +I + LP PE P GA+ E V+
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 65 PLETAIN----------GVPNMMYMSSTSSNAGQYSLSVTFEVGTNPDIAQVNVQNRAQL 114
+ + V + + + +E + + V +RA++
Sbjct: 593 VTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKM 652

Query: 115 AISQLPAAVTQ-------QGVSVRSRSPDFVLGIAFYSPDSKLDVLQITNFTSTTIADAL 167
+ ++ + + ++ A D+ + Q N A
Sbjct: 653 ELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDA---LTQARNQLLGMAAQHP 709

Query: 168 SRVSGVGEASVVGASEYSMRIWLNPARMDALGITADDVAAAIQRQNIQASLGQAGAPPAR 227
+ + V + +++ + + + + ALG++ D+ Q I +LG
Sbjct: 710 ASLVSVRPNGLEDTAQFKLEV--DQEKAQALGVSLSDIN-----QTISTALGGTYVNDFI 762

Query: 228 EGTELQYTLVARG---RLLSTDEFGDIVIRTGEAGAIVRLHDIARIELGARSYSSSASFA 284
+ L + + ++ + +R+ G +V +
Sbjct: 763 DRGR-VKKLYVQADAKFRMLPEDVDKLYVRSAN-GEMVPFSAFTTSHWV-YGSPRLERYN 819

Query: 285 GHDTAMLQINQAPGANAIQTANAVRAELQRLSPTFPPGLQYQVVYDATRFVRTSLSLIVR 344
G + +Q APG ++ A ++ L+ P G+ Y + + +
Sbjct: 820 GLPSMEIQGEAAPGTSSGD----AMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPA- 874

Query: 345 ILGEAFLIVLVVTYLFLQDWRATLVSALAMPVAMLGAIAVLLAFGYSINSISLLALVLAI 404
++ +F++V + + W + L +P+ ++G + F + ++ L+ I
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934

Query: 405 GLVADDAILVVENVKHVMEDDPTVETVAAARKAMGQITGPIISTTLVLLAVVIPTAFLSG 464
GL A +AIL+VE K +ME + V A A+ PI+ T+L + V+P A +G
Sbjct: 935 GLSAKNAILIVEFAKDLMEKE-GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG 993

Query: 465 ISGQLYRQFAVTLSAALVISSVVGLTLSPALAAILLRR 502
+ + +V ++++ + P ++ R
Sbjct: 994 AGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6731RTXTOXIND449e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.7 bits (103), Expect = 9e-07
Identities = 19/111 (17%), Positives = 35/111 (31%), Gaps = 7/111 (6%)

Query: 88 IRARVTGFLHSVDFKDGQAVKAGDTLFEIEPDQLNALVASARAQV-------ARADATRI 140
I+ + + K+G++V+ GD L ++ A ++ + R
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158

Query: 141 SAERSLARNRDLLARRTVSQATVDEVQAAFDVASADVQVAQAALDTAELNL 191
S E + L + +EV + Q ELNL
Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNL 209



Score = 34.4 bits (79), Expect = 8e-04
Identities = 20/137 (14%), Positives = 40/137 (29%), Gaps = 20/137 (14%)

Query: 120 QLNALVASARAQVARADATRISAERSLARNRDLLARRTVSQATVDEVQAAFDVASADVQV 179
+ + ++Q+ + ++ +SA+ L + + +
Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG---------L 313

Query: 180 AQAALDTAELNLSYAHITAPISGSI-GRATFTTGNLVGPDSGSLARIVSLDTVRVAFA-- 236
L E + I AP+S + T G +V +L IV D A
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372

Query: 237 -------VTEGLLVTIR 246
+ G I+
Sbjct: 373 QNKDIGFINVGQNAIIK 389


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6733ARGDEIMINASE459e-163 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 459 bits (1183), Expect = e-163
Identities = 123/412 (29%), Positives = 213/412 (51%), Gaps = 12/412 (2%)

Query: 3 TKFGVHSEVGQLRKVMVCAPGRAHQRLTPSNCDALLFDDVLWVDNAKRDHFDFMTKMRDR 62
+ SE+G+L+KV++ PG + LTP LFDD+ +++ A+++H F + +++
Sbjct: 6 NPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNN 65

Query: 63 GVEVVEMHNLLAETVAVP-EGKKWILDNQVVPNQIGL-GLVDEVRSYLEGLSNRDLAETL 120
VE+ + +L++E + + + ++ +I ++ ++ Y L+ ++ +
Sbjct: 66 LVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSLTIDNMISKM 125

Query: 121 IGGLSTHEFPESIGGEQLELIRDAAGVNEYLLPPLPNTLYTRDTTCWIYGGVTQNPLYWP 180
I G+ T E L G N +++ P+PN L+TRD I GVT N ++
Sbjct: 126 ISGVVTEELKN----YTSSLDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTK 181

Query: 181 ARHEETILTTSIYKFHPDFAGKVNVWWGDPTKDWGLATLEGGDVMPIGKGNVLIGMSERT 240
R ETI I+K+HP + V +W + A+LEGGD + + KG ++IG+SERT
Sbjct: 182 VRQRETIFAEYIFKYHPVYKENVPIWLNRWEE----ASLEGGDELVLNKGLLVIGISERT 237

Query: 241 SRQAISQLAATLFE-KGAAERVIVAAMPKLRAAMHLDTVFTFADRDCVLLYPDIVNGIEA 299
+++ +LA +LF+ K + + ++ +PK R+ MHLDTVFT D +
Sbjct: 238 EAKSVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDMYFSI 297

Query: 300 FSYRPDG-KGGVELHKDKGTFVETVRDALGLKKMRVVETGGNAYMRERTQWDSGANLVCA 358
+ + + + K+K + + LG K + GG+ R QW+ GAN++
Sbjct: 298 YVLTYNPSSSKIHIKKEKARIKDVLSFYLGRKIDIIKCAGGDLIHGAREQWNDGANVLAI 357

Query: 359 SPGVVYAYDRNTYTNTLLRKEGIEVITIIGAELGRGRGGGHCMTCPIIRDAV 410
+PG + AY RN TN L + GI+V I +EL RGRGG CM+ P+IR+ +
Sbjct: 358 APGEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


103mlr6817mll6829N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr6817-1132.658188hypothetical protein
mlr6818-1112.074212hypothetical protein
mlr6819-1111.799835general secretion protein E
mlr68210142.125026general secretion protein F
mlr68220131.328494general secretion protein H
mlr68230141.120296general secretion protein I
mlr6825-1121.119758general secretion protein J
mll68270131.102735general secretion protein G
mll68280141.446988type IV prepilin peptidase
mll68290130.948054general secretion protein D
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6817TONBPROTEIN300.015 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.0 bits (67), Expect = 0.015
Identities = 9/33 (27%), Positives = 9/33 (27%)

Query: 289 PPPVAPPPVEVAAAPVEQAPAPPPSEAPPAVAP 321
PP A P E P P P A
Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6821BCTERIALGSPF1887e-58 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 188 bits (479), Expect = 7e-58
Identities = 105/410 (25%), Positives = 183/410 (44%), Gaps = 13/410 (3%)

Query: 1 MPAFAYRAYLADGSTEAGVLDASTKQDAARKLAQQGRRSYHLAPVNSERPQLR-LPGRSA 59
M + Y+A A G G +A + + A + L ++G L P++ + + S
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERG-----LVPLSVDENRGDQQKSGST 55

Query: 60 LTLTRR------VDLSRLFSELSVLLNAGFTVDRALGAVISGEANRQRRQQLQSVLDLTT 113
RR DL+ L +L+ L+ A ++ AL AV Q + +V
Sbjct: 56 GLSLRRKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVM 115

Query: 114 GGRPIAEAFAALPGITPDV-AALLASGERSGKMAFICQRLADTFEATAKRRAAIIEALAY 172
G +A+A PG + A++A+GE SG + + RLAD E + R+ I +A+ Y
Sbjct: 116 EGHSLADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIY 175

Query: 173 PAFLLLVMSGALVILATVLVPALEPIFEGSSAPKPFTMTMLSAFGTVFRDYPFVFPLAAV 232
P L +V + IL +V+VP + F P + +L R + LA +
Sbjct: 176 PCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALL 235

Query: 233 LGLLCYLLLSRSAGARRRLSHWLLRIPLIGALVRDAVIARYLETLALLLGNGVAMTEALG 292
G + + ++ R R LL +PLIG + R ARY TL++L + V + +A+
Sbjct: 236 AGFMAFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMR 295

Query: 293 LAANVSRQSSLATSFASIEDNVANGARLHGAIVKAGIFDHATMSLVSLGEEANALPVVLD 352
++ +V + D V G LH A+ + +F +++ GE + L +L+
Sbjct: 296 ISGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLE 355

Query: 353 RAAKMLQLTLTRRIDTVLKLLTPALTISLGFLVGSLVISVMTTILSINDL 402
RAA + ++ L L P L +S+ +V +V++++ IL +N L
Sbjct: 356 RAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTL 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6822BCTERIALGSPG355e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.9 bits (80), Expect = 5e-05
Identities = 11/27 (40%), Positives = 21/27 (77%)

Query: 16 GFTLVEMLVVLAIMALVAAIAAPGLVS 42
GFTL+E++VV+ I+ ++A++ P L+
Sbjct: 9 GFTLLEIMVVIVIIGVLASLVVPNLMG 35


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6827BCTERIALGSPG1171e-36 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 117 bits (294), Expect = 1e-36
Identities = 40/133 (30%), Positives = 64/133 (48%), Gaps = 4/133 (3%)

Query: 21 RDDREGGFTLVELLVVLAIIALIATLAAPQVLRYLGAARTNAAKAQIRNIESALELYYVD 80
D++ GFTL+E++VV+ II ++A+L P ++ A A + I +E+AL++Y +D
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 81 NAKYPTNEEGLNALVAQPA---GETRWNGPYLKGSTGLKDPWGRPYSYEVKADASGVVIR 137
N YPT +GL +LV P +N DPWG Y + +
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLP-ADPWGNDYVLVNPGEHGAYDLL 121

Query: 138 SLGKDGKPDGTGE 150
S G DG+ +
Sbjct: 122 SAGPDGEMGTEDD 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6828PREPILNPTASE691e-16 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 68.7 bits (168), Expect = 1e-16
Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 1/143 (0%)

Query: 4 HPSLVLAATIALAVVLGAISIADFRRQIIPDGLNLALAGIGLSYQLAADADAMPQRLLFA 63
P A + L VL A++ D + ++PD L L L GL + L ++ ++ A
Sbjct: 129 APGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGA 188

Query: 64 AATFAAAWLLRRGHFLMTGRIGLGLGDVKMLAAASCWISPLLLPVLLFIASASALLFVGG 123
A + W L L+TG+ G+G GD K+LAA W+ LP++L ++S G
Sbjct: 189 MAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIG 248

Query: 124 QVVATGPAAARARVAFGPFIAIG 146
++ + FGP++AI
Sbjct: 249 LILLRN-HHQSKPIPFGPYLAIA 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll6829BCTERIALGSPD2993e-94 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 299 bits (768), Expect = 3e-94
Identities = 157/656 (23%), Positives = 284/656 (43%), Gaps = 75/656 (11%)

Query: 85 TSDGSGKFELNLVNAPIADAAKAVLGDALHLNYIVDPRVQGTVTLQTSQPVSQDALVDIL 144
+ +F + I + V + L+ I+DP V+GT+T+++ ++++
Sbjct: 23 RPAAAEEFSASFKGTDIQEFINTVSKN-LNKTVIIDPSVRGTITVRSYDMLNEEQYYQFF 81

Query: 145 QSALAVNA-AGITSRAGTYQIVPLSEIMASTPPVSVPSTSPSGPGVKVQVLQLQFIAADE 203
S L V A I G ++V + + PV+ + G V +V+ L +AA +
Sbjct: 82 LSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARD 141

Query: 204 MKTILEPITRQ---GSVLRVDSTRNIITVAGSDSDLNAIREAVSVFDVDWMRGMSVALHP 260
+ +L + GSV+ + + N++ + G + + + V VD SV P
Sbjct: 142 LAPLLRQLNDNAGVGSVVHYEPS-NVLLMTGRAAVIKRLLTIV--ERVDNAGDRSVVTVP 198

Query: 261 LKTSKPEAVAAELDSIFGTKE---GPGAKLIQFIPNDRLNSVLVITSRPAYLARAATWIN 317
L + V + + PG+ + + ++R N+VLV P R I
Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGE-PNSRQRIIAMIK 257

Query: 318 KLDRLAETNESQLFVYQIQNRPAKELASVLSSVLGTTVKTEGQSGGSNVAPDQTPIAMQS 377
+LDR T + V ++ A +L VL+ + T + + A+
Sbjct: 258 QLDRQQAT-QGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPV--------AALDK 308

Query: 378 DGVTPAPLTGPSPSLPQQDNQAPAHATVVADVENNALLIQTTARDYQRIEQILSKVDVLP 437
+ + A + NAL++ +E++++++D+
Sbjct: 309 NI------------------------IIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRR 344

Query: 438 TQVMLEAVIAEVTLNDDLKYGLRWFFENGGTK------VSVTDVAKAA------------ 479
QV++EA+IAEV D L G++W +N G + ++ A
Sbjct: 345 PQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSS 404

Query: 480 ---AAATLPGFNWSYATDNIQVTLNALSKITDVNVISAPTIMALNNQKAILQVGDQVPIL 536
A ++ G + N + L ALS T ++++ P+I+ L+N +A VG +VP+L
Sbjct: 405 LASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVL 464

Query: 537 TQQSQDTGNGSAPIINSVQMKDTGVILTVTPRINNAGRVMLDIQQEVSNVTKTDSSDIDS 596
T +G+ N+V+ K G+ L V P+IN V+L+I+QEVS+V +S S
Sbjct: 465 TGSQTTSGDNIF---NTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADA-ASSTSS 520

Query: 597 ---PTIQQRKVQTRVLVNDGESLALGGLIQQNNSVDRSQVPILGDIPILGNAFKQKDDTI 653
T R V VLV GE++ +GGL+ ++ S +VP+LGDIP++G F+ +
Sbjct: 521 DLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKV 580

Query: 654 RRTELIIFIRPHVVRDINEAREVTDEFRGKISLQTPIQKRRGG--TKLQQDLKRLA 707
+ L++FIRP V+RD +E R+ + + Q+ + L QDL +
Sbjct: 581 SKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIY 636


104mlr6964mlr6969N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr6964228-6.652794ABC transporter binding protein
mlr6965130-6.173521ABC transporter ATP-binding protein
mlr6966130-5.938117ABC transporter permease
mlr6967129-5.425705ABC transporter permease
mlr6968129-5.115467acetylpolyamine aminohydrolase
mlr6969126-4.014010aldehyde dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6964MALTOSEBP461e-07 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 45.9 bits (108), Expect = 1e-07
Identities = 85/364 (23%), Positives = 141/364 (38%), Gaps = 43/364 (11%)

Query: 28 LILGLSTTVALAEGNLNIY---NWGEYTSPELIDKFSKTYNIHVTQTDFDSNDTALAKVR 84
++ S + EG L I+ + G E+ KF K I VT D + +V
Sbjct: 18 MMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVA 77

Query: 85 QGSSGFDVVVPSQSFIPTYIQEGLLAETNPGQMENAKNLEERWRNPAFDPGRKYSVPWLW 144
G D++ + Y Q GLLAE P + K W ++ G+ + P
Sbjct: 78 ATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYN-GKLIAYPIAV 136

Query: 145 YTSGVSVNTSVFKGDINTWKVI--LDPPAELKGKINIVPEMNDIMFA----------AIK 192
+ N + TW+ I LD + KGK ++ + + F A K
Sbjct: 137 EALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFK 196

Query: 193 FEGGTWCTSD--------KALLTKVRDRLLEAKKSWLSIDYS-GTLKMASGDVSASLD-- 241
+E G + D KA LT + D L++ K DYS G+ + +++
Sbjct: 197 YENGKYDIKDVGVDNAGAKAGLTFLVD-LIKNKHMNADTDYSIAEAAFNKGETAMTINGP 255

Query: 242 WSGSALKRRTQNHSIAY-----GLPKEGFTYGSDNVVVLKDAPNLENAKLF-QNFIMAPE 295
W+ S + N+ + G P + F G + + +PN E AK F +N+++ E
Sbjct: 256 WAWSNIDTSKVNYGVTVLPTFKGQPSKPFV-GVLSAGINAASPNKELAKEFLENYLLTDE 314

Query: 296 NAALNSTFAKYGAAIIGAEKYYSDDMKGAPELTIPDDMKSKGELLTLCDPKITQLYSRIW 355
+ GA A K Y +++ P + + KGE++ P I Q+ S W
Sbjct: 315 GLEAVNKDKPLGAV---ALKSYEEELAKDPRIAATMENAQKGEIM----PNIPQM-SAFW 366

Query: 356 QDVQ 359
V+
Sbjct: 367 YAVR 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6965PF05272310.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.011
Identities = 9/31 (29%), Positives = 16/31 (51%)

Query: 51 TLLGPSGCGKTTLLRLIGGFEYPTAGTILLG 81
L G G GK+TL+ + G ++ + +G
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6967PF06580310.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.007
Identities = 16/98 (16%), Positives = 37/98 (37%), Gaps = 12/98 (12%)

Query: 53 WGGFSLRWFQ--SAANNQQVISASILSLKLAAISATLSTALATLAALAMSRTPRFRGWTL 110
WG ++L F S + ++ S I ++ ++ + L+ A + + +GW L
Sbjct: 20 WGVYTLTGFGFASLYGSPKLHSM-IFNIAISLMGLVLTHAYRSF--------IKRQGW-L 69

Query: 111 AYSAISVPLMVPEIVTAVALLIVTATIRGWTGYSGLGY 148
+ + L V + ++ A W + +
Sbjct: 70 KLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINT 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr6969RTXTOXINA300.041 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.5 bits (66), Expect = 0.041
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 306 LLIPADRHAEALDIARRVAAATRVGDPSSEDTDMGPVISQQQFDKIQRMIGLGIEEGATL 365
LLIP D + + V A +G D G I++Q F +++IGL E G T+
Sbjct: 51 LLIPKDYKGQGSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGL-TERGVTI 109

Query: 366 VA 367
A
Sbjct: 110 FA 111


105mlr7286mlr7298N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr72860150.888137ABC transporter protein, ATP binding component
mlr72870150.419508ABC transporter permease
mlr72880150.637128ABC transporter binding protein
mll72890160.691566short chain dehydrogenase
mll72900130.492896transcriptional regulator
mlr72930150.379985acyl-CoA synthetase
mlr7295-1140.358804transcriptional regulator
mlr7297-1110.448742component of multidrug efflux system
mlr7298-2121.111651component of multidrug efflux system
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7286PHPHTRNFRASE290.041 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.4 bits (66), Expect = 0.041
Identities = 14/46 (30%), Positives = 27/46 (58%)

Query: 155 VAIARALAHQPKVLILDEPTSSLSSAEADRLFALVERLREQGVAIL 200
VAIA+A H + +++ + + S E ++L A +E+ +E+ AI
Sbjct: 14 VAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIK 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll7289DHBDHDRGNASE844e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 83.9 bits (207), Expect = 4e-20
Identities = 68/255 (26%), Positives = 111/255 (43%), Gaps = 10/255 (3%)

Query: 431 KPLTGQVVLITGGAGAIGAATAKLFADNGAHAVVVDLDGD---KAADTAKKAGNNSIGVA 487
K + G++ ITG A IG A A+ A GAH VD + + K + K ++
Sbjct: 4 KGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 488 ADITDPAQVRAAFDKAVAVFGGVDILVSNAGAAWEGRIGELDDALLRKSFELNFFAHQSV 547
AD+ D A + + G +DILV+ AG G I L D +F +N +
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 548 AQNAVRIMLEQGTGGVLLFNTSKQAVNPGPKFGAYGVPKAATLFLSRQYALDYGAHGIRS 607
+++ + M+++ +G ++ S A P AY KAA + ++ L+ + IR
Sbjct: 124 SRSVSKYMMDRRSGSIVTVG-SNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 608 NAVNADRIRSGLLTDAMIASRSGARGV---SEKEYMSGNLLGQEVTAQDVAQA--FLHHA 662
N V+ + + ++ A +GA V S + + +G L + D+A A FL
Sbjct: 183 NIVSPGSTETD-MQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 663 LAERTTADVTTVDGG 677
A T VDGG
Sbjct: 242 QAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7295HTHTETR682e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.7 bits (165), Expect = 2e-16
Identities = 28/146 (19%), Positives = 56/146 (38%), Gaps = 14/146 (9%)

Query: 3 RILDCAERLFRHYGYGKTNVADIARELGMSPANIYRFFASKVEIHQAVCGRMLGASYKMA 62
ILD A RLF G T++ +IA+ G++ IY F K ++ + ++
Sbjct: 15 HILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELE 74

Query: 63 YEIM-HLPIGAEERLRRYIHAQYKMTLETMLDEQKVHEMVIVAL-----ERDWGVIDKHV 116
E P LR + LE+ + E++ ++ + + V+ +
Sbjct: 75 LEYQAKFPGDPLSVLREILIH----VLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 117 DSI----HDLFAEVIRDGIEAGEFAE 138
++ +D + ++ IEA
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPA 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7297RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.3 bits (102), Expect = 1e-06
Identities = 28/171 (16%), Positives = 59/171 (34%), Gaps = 17/171 (9%)

Query: 107 SAQAALDAAERQVETTELTRKRAEQLFTRNFAPKSQLEQATLAHDQAVATRDSARSSLDQ 166
++ L+ E ++ + + + QLF L++ D L +
Sbjct: 270 VYKSQLEQIESEILSAKEEYQLVTQLFKN-----EILDKLRQTTDNI----GLLTLELAK 320

Query: 167 AKNQVGYTDLKADRDGIVTAVNA-DVGQVVGSGTPVVSVAVDGEK-EVLIAVPEMEIAEF 224
+ + + ++A V + G VV + ++ + + + EV V +I
Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380

Query: 225 KPGKSVK---AGFWSDSTLALDGKVREVAGSADPQSRT---FAVRVSLPND 269
G++ F L GKV+ + A R F V +S+ +
Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431



Score = 42.1 bits (99), Expect = 2e-06
Identities = 18/127 (14%), Positives = 44/127 (34%), Gaps = 5/127 (3%)

Query: 72 VNGKITERLVDIGQHVVPGDVLARIDPTDYDLSVKSAQAALDAAERQVETTELTRKRAEQ 131
N + E +V G+ V GDVL ++ + Q++L A + ++ + E
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 132 LFTRNFAPKSQLEQATLAHDQAVATRDSARSSLDQAKNQ-----VGYTDLKADRDGIVTA 186
+ ++ ++ + + +NQ + +A+R ++
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLAR 222

Query: 187 VNADVGQ 193
+N
Sbjct: 223 INRYENL 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7298ACRIFLAVINRP497e-161 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 497 bits (1280), Expect = e-161
Identities = 236/1044 (22%), Positives = 433/1044 (41%), Gaps = 57/1044 (5%)

Query: 13 LSRWAIGHPSIARFLFGLIIIAGALGLMRMGQKEDPDFTFRVMVVQAIWPGSSIQEMEDQ 72
++ + I P A L ++++AGAL ++++ + P + V A +PG+ Q ++D
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 73 VVNKIERKLQETPHLDFVRSFT-RAGSAIITVQIKGDTNAAEVADAFYQVRKKVGDISGD 131
V IE+ + +L ++ S + AGS IT+ + T+ A QV+ K+ +
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTD---PDIAQVQVQNKLQLATPL 117

Query: 132 LPQGLLGPY-FNDEFGDTFITLHSISGDGFSY--PELKKFAI-QARDMLLTTPGVEKAVI 187
LPQ + ++ +++ + D ++ + +D L GV +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 188 IGDQPEKLYIDVSSKALAERGLTLPDLQNAIKGQNNVDPAGAVDTGVN------SVRISV 241
G Q + I + + L + LT D+ N +K QN+ AG + + I
Sbjct: 178 FGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 242 EGDVTKAADIRELRLRAG--GQVTRLGDIATVTSGLEDPFQRKYRFNGHDSVQLGVVMAK 299
+ + ++ LR G V RL D+A V G E+ + R NG + LG+ +A
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGEN-YNVIARINGKPAAGLGIKLAT 295

Query: 300 GFKVTDVGKDVEATYKRFEEALPYGVSVDQISDQPGVVTDAVAEFMHALGEALLIVLVVS 359
G D K ++A + P G+ V D V ++ E + L EA+++V +V
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 360 FLSIG-WRSGLVIAIAIPLVLAATFAIMYELGIDLQRISLGALIIALGLLVDDAMIVVEM 418
+L + R+ L+ IA+P+VL TFAI+ G + +++ +++A+GLLVDDA++VVE
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 419 MER-KLEEGLVKIEAASFAYTSTAFPMLSGTLITTAGFIPVGFAASTAGEYVRTLFYVVG 477
+ER +E+ L EA + + ++ ++ +A FIP+ F + G R +
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 478 IALIVSWFVAVYFTPWLGNMILKQRK-HAGTHHDVFDTRFYRRLRAT-------VGWAVR 529
A+ +S VA+ TP L +LK + F F + VG +
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILG 535

Query: 530 HRIIVLVMTLVTFGTSLWAFQFIPQNFFPQSSRPEILVDLWLPEGTSIKEVETQAKALEA 589
L++ + + F +P +F P+ + L + LP G + + + +
Sbjct: 536 STGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTD 595

Query: 590 KMMDDKDKRFIATYIGEGAPRFFLPLDQQLRNPNFAQMLVM---ANDEPARERLIVKLRT 646
+ K A F Q N V + E +
Sbjct: 596 YYL----KNEKANVESVFTVNGFSFSGQ---AQNAGMAFVSLKPWEERNGDENSAEAVIH 648

Query: 647 VLAEDFPSIRAKVDRLFLGPP-------TGWPVQMRVM-GPDRQEVRRIADQVKAKFRED 698
+ IR F P TG+ ++ G + + +Q+ +
Sbjct: 649 RAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQH 708

Query: 699 PL-LGAVHDDWLEPVPAMKLVIDQDRARALGVTSQRIRQMLQATMAGAPLDDFRDGEETV 757
P L +V + LE KL +DQ++A+ALGV+ I Q + + G ++DF D
Sbjct: 709 PASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVK 768

Query: 758 SIVAREPDASRSLLSSVDSVYVPTDFGGFVPLSQVAKVVPVLEQGIEWRRDRLPTISVRA 817
+ + R L VD +YV + G VP S V R + LP++ ++
Sbjct: 769 KLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQG 828

Query: 818 TLPDGVQSNDVVTKMYKDMQGLRDGLAPGYKIEIQGGAEDSAESQASIAAKAPIMLAIIV 877
G S D + M+ L L G + G + S A I ++
Sbjct: 829 EAAPGTSSGDAMAL----MENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVF 884

Query: 878 VLLMIQLQNFGKAMLVLATGPLGIIGAAAALLISGAPFGFVAILGVIALLGIIMRNSIIL 937
+ L +++ + V+ PLGI+G A + ++G++ +G+ +N+I++
Sbjct: 885 LCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILI 944

Query: 938 VDQIDQ-DIARGMERSEAIIGSAVRRFRPIVLTAMTAVLALIPISRAVFWG-----PLAY 991
V+ G EA + + R RPI++T++ +L ++P++ + G +
Sbjct: 945 VEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGI 1004

Query: 992 AMMGGIVVATVLTILVLPAGYALF 1015
+MGG+V AT+L I +P + +
Sbjct: 1005 GVMGGMVSATLLAIFFVPVFFVVI 1028


106mlr7458mlr7466N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr7458215-0.440968enoyl-ACP reductase
mlr74590110.712055hypothetical protein
msl74601120.722797hypothetical protein
mlr74612130.612951chorismate synthase
mlr74630110.574734GTP cyclohydrolase II
mlr7464-1101.279721acetyltransferase
mll74650101.310125ABC transporter permease
mlr74660100.391723transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7458DHBDHDRGNASE549e-11 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 54.3 bits (130), Expect = 9e-11
Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 17/257 (6%)

Query: 8 MAGKRGLILGIANNRSIAYGIAKACVDHGAEI-ALTYQGEAFKKRVEPLAAELGAFVAGH 66
+ GK I G A + I +A+ GA I A+ Y E +K V L AE A
Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 67 CDVTDSASLDEVFANVAKHWGKLDFLVHAIAFSDKDELTGRYVETTRDNFLRTMDISVFS 126
DV DSA++DE+ A + + G +D LV+ G + + + T ++
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLR----PGLIHSLSDEEWEATFSVNSTG 119

Query: 127 FTTIAKRAEALMT--EGGSLLTLTYYGAEKVMPHYNVMGVAKAALEASVRYLAVDLGGKK 184
++ M GS++T+ A +KAA + L ++L
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 185 IRVNAISAGPIKT-----LAASGIGDFRYILKWNE---YNSPLKQTVTQEEVGDSGVYFL 236
IR N +S G +T L A G + I E PLK+ ++ D+ ++ +
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 237 SDLSRGVTGEVHHVDSG 253
S + +T VD G
Sbjct: 240 SGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7461PF05272280.047 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.047
Identities = 10/49 (20%), Positives = 19/49 (38%)

Query: 125 DYRGGGRSSARETAARVAAGALARKVVPGMVVRGALVSMGEKSIDRANW 173
DY+ + + G +AR + PG ++V G I ++
Sbjct: 564 DYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTL 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll7465TCRTETA340.001 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.0 bits (78), Expect = 0.001
Identities = 52/279 (18%), Positives = 95/279 (34%), Gaps = 19/279 (6%)

Query: 50 GAVSATFALANAFLAPQISRLVDRLGQTRIVVPTTIISVLAFITLVSAANQDWPVWTLFV 109
G + A +AL AP + L DR G+ +++ + + + + + +A +W L++
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF----LWVLYI 101

Query: 110 SALLAAAMPSMPAMVRARWTELFRGQPEMNTAFAFESAADELVYIAGASLSVGLSAALFP 169
++ A + V + E F F SA +AG L GL P
Sbjct: 102 GRIV-AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP 159

Query: 170 EAGMLASTLF--LALGSTAFILQRSTEPQVRPVDHGSSGSAIRLRPVQIIT--------- 218
A A+ L + F+L S + + RP+ + R + +T
Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVF 219

Query: 219 FALIFIGATFATTEVSTVAITKELGQPGAASLVIGVYALGSFVLGIIVGALNLKAPLQRQ 278
F + +G A V + L S +I G + + +R
Sbjct: 220 FIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRA 279

Query: 279 LAIAVALVTLGTLLPLTASSVP--LLALTVFISGVAISP 315
L + + G +L A+ + + SG P
Sbjct: 280 LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMP 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7466HTHTETR967e-27 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 95.8 bits (238), Expect = 7e-27
Identities = 42/207 (20%), Positives = 69/207 (33%), Gaps = 11/207 (5%)

Query: 1 MHRPRKEMIAETRAKLIAAARQAFGTIGYAEASMDDFTASAGLTRGALYHHFGDKKGLLE 60
M R K+ ETR ++ A + F G + S+ + +AG+TRGA+Y HF DK L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 AVIAEIDGEMAARVNEVASRAP-TRWQHFVDECTTYIEMALEPEIQR----IIFRDGPAV 115
+ + + E ++ P + +E + E +R IIF V
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 116 LGDPAQWSNANACVGSMTDHLTAL----QREGVVVPGVDPETAARLINGA-SSQAAQRIA 170
D + ++ + AA ++ G S +
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 171 NSNDPEATSKIVVAAFKQLLEGLLRKP 197
+ K LLE L P
Sbjct: 181 APQSFDLK-KEARDYVAILLEMYLLCP 206


107mll7563mll7572N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mll7563-123-3.739476hypothetical protein
mll7564021-3.126049ABC transporter ATP-binding protein
mll7565-122-2.258199ABC transporter permease
mlr7566-223-1.933220glycosyltransferase
mll7567-120-2.374636nodulation protein noeK, phosphomannomutase
mlr7568-212-2.202593lipopolysaccharide biosynthesis protein
mlr7570013-1.620002glycosyltransferase
mll7572-19-1.331764epimerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll7563GPOSANCHOR300.015 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.015
Identities = 22/112 (19%), Positives = 37/112 (33%), Gaps = 4/112 (3%)

Query: 265 DEDNGLLDQLDIMRTRFDDACETFSAAMATEELAVVQLRGELSTRSTETERARQENSKLA 324
+ L + + R D + AM ++ T E ++L
Sbjct: 211 AKIKTLEAEKAALAARKADLEKALEGAMNFST----ADSAKIKTLEAEKAALEARQAELE 266

Query: 325 SFLEQQSLEVRGLAAERDALVGTRGTLIAERDALVNAQASLIAERDALLRDM 376
LE +A+ L + L AE+ L + L A R +L RD+
Sbjct: 267 KALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll7564PF05272320.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.002
Identities = 12/37 (32%), Positives = 17/37 (45%)

Query: 60 LGLVGPNGAGKTTLLKVLYGIYQPSGGTISITGKVDA 96
+ L G G GK+TL+ L G+ S I D+
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDS 635


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7568NUCEPIMERASE892e-21 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 89.1 bits (221), Expect = 2e-21
Identities = 65/317 (20%), Positives = 107/317 (33%), Gaps = 61/317 (19%)

Query: 306 RVLVTGGAGSIGRTLVKRSLELGAGAVLVADNSEFGIFQLSQYIDEK-DHDRL------- 357
+ LVTG AG IG + KR LE G V+ DN L+ Y D RL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQ-VVGIDN-------LNDYYDVSLKQARLELLAQPG 53

Query: 358 -KVRIVDVADRRQMTRVVTEFKPDIIFHAAALKHVPLLEENWESAIQTNVFGTLVCAEVA 416
+ +D+ADR MT + + +F + V EN + +N+ G L E
Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113

Query: 417 AKCGVPQFLLISS---------------DKAVDPTSVLGITKRAAEQLVSALHESHAIAP 461
+ L SS D P S+ TK+A E + +
Sbjct: 114 RHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY---- 169

Query: 462 DGRRSGTKFIAVRFGNVFGSNGS---VATIFQAQIEAGGPVTI-TDRRMTRYFMTVAEAV 517
G +RF V+G G F + G + + +M R F + +
Sbjct: 170 -----GLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 518 DLVIMAAADAQSRNGKDD------------YAIYMLDMGKPVPILEVAETMIRMAGKTPY 565
+ +I + + Y +Y + PV +++ + + G
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI--- 281

Query: 566 ADIPIRFTGIRPGEKLH 582
+ ++PG+ L
Sbjct: 282 -EAKKNMLPLQPGDVLE 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll7572NUCEPIMERASE461e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 45.9 bits (109), Expect = 1e-07
Identities = 46/202 (22%), Positives = 63/202 (31%), Gaps = 35/202 (17%)

Query: 1 MKVLVTGATGFIGRQVVHRLREAGAE------------LRLASRHPERLG-PGQDAMRMP 47
MK LVTGA GFIG V RL EAG + + L E L PG ++
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 48 DVDAPTAAFLALARGVTDVVHCAGLNNDEGNATEADFRAA----------NAELSARLAQ 97
D L + V R + N + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPH---------RLAVRYSLENPHAYADSNLTGFLNILE 111

Query: 98 AAAEQASGRFIQLSSIRAVIGARVSATIDEDTIPD-PQCAYGRSKREAEIRVLDAYASHG 156
+ SS +V G D D P Y +K+ E + Y+
Sbjct: 112 GCRHNKIQHLLYASS-SSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE-LMAHTYSHLY 169

Query: 157 RSDATVLRLPPVYGTGMQGNLA 178
AT LR VYG + ++A
Sbjct: 170 GLPATGLRFFTVYGPWGRPDMA 191


108mlr7674mlr7686N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr7674-212-0.852495ABC transporter ATP-binding protein
mlr7676-113-1.086783ABC transporter binding protein
mlr7677115-0.885077ABC transporter permease
mlr7678014-0.773800ABC transporter permease
mlr7680015-0.9658383-oxoacyl-ACP reductase
mll76820120.981277hypothetical protein
msr76830131.510820hypothetical protein
mlr76840101.582596two-component system, regulatory protein
mlr7686191.777246two-component system sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7674PF05272330.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.1 bits (75), Expect = 0.002
Identities = 13/56 (23%), Positives = 19/56 (33%), Gaps = 9/56 (16%)

Query: 38 LTLLGPSGSGKTTFLMILAGFVQPTEGKLFSDGTDITDRPAEQRAAGMVFQGYALF 93
+ L G G GK+T + L G FSD T + + G +
Sbjct: 599 VVLEGTGGIGKSTLINTLVG------LDFFSD-THFDI--GTGKDSYEQIAGIVAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7680DHBDHDRGNASE951e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 95.1 bits (236), Expect = 1e-25
Identities = 73/235 (31%), Positives = 107/235 (45%), Gaps = 24/235 (10%)

Query: 9 GRTAIVTGGARGIGRAIAHKLSLSGADVWIWDIEPVELEGTRS-----------LSVDVT 57
G+ A +TG A+GIG A+A L+ GA + D P +LE S DV
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 58 KRDNVMQALT-IMGEVG-VDILVNNAGWLGGYKPFEEFEPAEWQRILQVNLLGTFEVTHR 115
+ + I E+G +DILVN AG L EW+ VN G F +
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLR-PGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 116 VLPLMRRAGKGRIVNMGSLAGKEGLPSLAAYSAASAGVIAFTKALSREVSDTDIRVNCIA 175
V M G IV +GS S+AAY+++ A + FTK L E+++ +IR N ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 176 PGPIDTRLIRDL-----GNETVDAMISAS-----PLKRLGDPNEVAALVVWLCSD 220
PG +T + L G E V + PLK+L P+++A V++L S
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msr7683AEROLYSIN260.027 Aerolysin signature.
		>AEROLYSIN#Aerolysin signature.

Length = 493

Score = 26.2 bits (57), Expect = 0.027
Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 9 RTATLALCAAGLFASQATAMSVIAPDQ 35
+ L+L +GL +QA A + PDQ
Sbjct: 5 KLTGLSLIISGLLMAQAQAAEPVYPDQ 31


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7684HTHFIS801e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.3 bits (198), Expect = 1e-19
Identities = 31/123 (25%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 2 RVLVVEDDKDLNRQIADALVDAGYVVDRAFDGEEGHFLGDTEPYDAVVLDIGLPQIDGIS 61
+LV +DD + + AL AGY V + D VV D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VVERWRRGGRKMPVLILTARDRWSDKVSGIDAGADDYVTKPFHIEEVLARL-RALIRRAA 120
++ R ++ +PVL+++A++ + + + GA DY+ KPF + E++ + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GHA 123
+
Sbjct: 125 RPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7686PF06580310.013 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.013
Identities = 21/99 (21%), Positives = 36/99 (36%), Gaps = 20/99 (20%)

Query: 377 LLENAMKWA----KSAVSVTVAPGKDDNLFEISIDDDGPGIPEDKARDALKRGRRLDETK 432
L+EN +K + + KD+ + +++ G L TK
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA--------------LKNTK 308

Query: 433 PGTGLGLAIVAD-LVNEYGGILALE-RSGLGGLKAVVRL 469
TG GL V + L YG ++ G + A+V +
Sbjct: 309 ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


109mlr7692mll7700N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr7692-290.917671serine protease
mlr7695-290.750302two-component response regulator
mlr7697-2101.096476two-component, sensor histidine kinase
mlr7698-2111.256893bifunctional glutamine-synthetase
mll7700-2121.206800two-component, sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7692V8PROTEASE651e-13 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 65.0 bits (158), Expect = 1e-13
Identities = 35/163 (21%), Positives = 58/163 (35%), Gaps = 25/163 (15%)

Query: 132 PRPVAQGSGFFISEDGYLVTNNHVVEEG-------TAFTVV-----TNDGKELDAKLVGT 179
P SG + L+TN HVV+ AF +G ++
Sbjct: 98 PTGTFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKY 156

Query: 180 DPRTDLAVLKVEG-------GGKFTYVDFADDSKVRVGDWVVAVGNPFGLGGTVTAGIVS 232
DLA++K G +++++ +V + G P A +
Sbjct: 157 SGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP---VATMWE 213

Query: 233 ARGRDIGAGPYDDFLQIDASVNRGNSGGPTFNLNGQVVGINTA 275
++G+ +Q D S GNSG P FN +V+GI+
Sbjct: 214 SKGKITYLKGE--AMQYDLSTTGGNSGSPVFNEKNEVIGIHWG 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7695HTHFIS913e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.0 bits (226), Expect = 3e-23
Identities = 31/125 (24%), Positives = 56/125 (44%)

Query: 17 KILVIEDDREAADYLQKAFTEAGHTAHVAGDGETGFALADAGDYDVMVIDRMMPRRDGLS 76
ILV +DD L +A + AG+ + + T + AGD D++V D +MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 77 VIAGLRSRGNTTPVLILSALGEVDDRVTGLRAGGDDYLTKPYAFSELLARVEVLNRRASA 136
++ ++ PVL++SA + G DYL KP+ +EL+ +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 137 KEAET 141
+ ++
Sbjct: 125 RPSKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7697PF06580330.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.5 bits (74), Expect = 0.003
Identities = 14/82 (17%), Positives = 29/82 (35%), Gaps = 18/82 (21%)

Query: 382 IVDNAIKYSTDSTSKPA-VRVTLERTHGEIWLCVADNGQGIPDDADRARATERFVRLEKS 440
+V+N IK+ + + + + +G + L V + G L
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS-----------------LALK 305

Query: 441 RSQPGSGLGLSLAKAVMTFHYG 462
++ +G GL + + YG
Sbjct: 306 NTKESTGTGLQNVRERLQMLYG 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mll7700PF06580407e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.2 bits (94), Expect = 7e-06
Identities = 34/165 (20%), Positives = 53/165 (32%), Gaps = 34/165 (20%)

Query: 192 FSLDQEQIDLGPLISETVRVVSLQAAQKA-----ITVETRIADALSLFADRRAIKQIVIN 246
+SL L E V S + E +I A+ D + +V
Sbjct: 206 YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAI---MDVQVPPMLVQT 262

Query: 247 LLSNAVKF----TGQGGHISVRARNTSGALVLTIEDNGCGIPKEALGKLGRPFEQVQNQF 302
L+ N +K QGG I ++ +G + L +E+ G K
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 303 SKNHAGSGLGLA-ISRSLAELQGG--ALKIRSTEGVGTIVSVRIP 344
+G GL + L L G +K+ +G V IP
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


110mlr7771mlr7776N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
mlr7771-281.004791hypothetical protein
mlr7772-110-0.634380hypothetical protein
mlr7773-1100.294817extracytoplasmic sigma factor EcfR
mlr7774-110-0.290507hypothetical protein
mlr7776-210-1.179890hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7771TATBPROTEIN330.003 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 32.7 bits (74), Expect = 0.003
Identities = 17/117 (14%), Positives = 40/117 (34%), Gaps = 6/117 (5%)

Query: 20 RQQSRIGLIERELGALRSLVLSGAVPPVAKPAEQMAADGKAEAAPVPAAAADIASPA--- 76
Q+ ++ + L + L+ P + +++ ++ A + AS
Sbjct: 51 TQELKLQEFQDSLKKVEKASLTNLTPELKASMDELRQAAESMKRSYVANDPEKASDEAHT 110

Query: 77 VSEPVVQASATETEAPAGEAVSGPWSTSEAPKAAEPAEPVAGANPPGKPDIETALGT 133
+ PVV+ + E A + + +P+ P P + +TA +
Sbjct: 111 IHNPVVKDNEAAHEGVTPAAAQ---TQASSPEQKPETTPEPVVKPAADAEPKTAAPS 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7773OUTRMMBRANEA280.041 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 27.6 bits (61), Expect = 0.041
Identities = 12/28 (42%), Positives = 15/28 (53%), Gaps = 3/28 (10%)

Query: 7 RFALAACAAPGRRVE---SGSTRAISEP 31
R AL C AP RRVE G +++P
Sbjct: 317 RAALIDCLAPDRRVEIEVKGIKDVVTQP 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7774INTIMIN300.010 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.0 bits (67), Expect = 0.010
Identities = 14/35 (40%), Positives = 20/35 (57%)

Query: 169 ANGFQLVGGRLLPAGEAKAAMLLYEDDKGERISLF 203
ANGF + LP+ A A L+YE G+ ++LF
Sbjct: 327 ANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALF 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr7776SACTRNSFRASE412e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.1 bits (96), Expect = 2e-06
Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 1/90 (1%)

Query: 65 FIERETAETLVAEVDGRVAGYAIVLFRKGSGVARLYSIAVGPFFGALGIGRQLLTAAEEA 124
++E E + ++ G I + +G A + IAV + G+G LL A E
Sbjct: 59 YVEEEGKAAFLYYLENNCIGR-IKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEW 117

Query: 125 AFEHDRMMLRLEVREDNSRAIRIYEQAGYR 154
A E+ L LE ++ N A Y + +
Sbjct: 118 AKENHFCGLMLETQDINISACHFYAKHHFI 147


111msr8220mlr8233N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
msr8220234-5.674144hypothetical protein
mlr8221028-4.028583hypothetical protein
mlr8222018-2.485240hypothetical protein
mll8223-1120.840044hypothetical protein
mlr8224-1111.892133hypothetical protein
mlr8225-1122.428463hypothetical protein
mlr82270111.137162iron ABC transporter ATP-binding protein
mlr8228-1120.733775iron (III) ABC transporter substrate-binding
mlr8229-190.280689ABC transporter permease
mlr8230-18-0.410785transcriptional regulator
mlr823118-0.469252efflux pump protein FarA
mlr8233310-1.091604efflux pump protein FarB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
msr8220SALSPVBPROT270.007 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein

signature.
Length = 591

Score = 26.6 bits (58), Expect = 0.007
Identities = 9/25 (36%), Positives = 14/25 (56%)

Query: 46 AQGPGRGLVRFGWACRRMAVSRRIS 70
+ G G G GW+C M+++R S
Sbjct: 59 SSGGGNGPFGVGWSCATMSIARSTS 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8221IGASERPTASE373e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.6 bits (84), Expect = 3e-04
Identities = 40/223 (17%), Positives = 72/223 (32%), Gaps = 35/223 (15%)

Query: 205 VRDQIEEKNTVRSSLIEQRDAEIKDAVDQFERQRDSFVQRIKMARDSGDSDSAR---KLE 261
V D+ E N +L + A QRD + + ++ D + + +
Sbjct: 928 VADKTGEPNHNELTLFDASKA-----------QRDHL--NVSLVGNTVDLGAWKYKLRNV 974

Query: 262 DEVAKLANPR-SKIGAKFDA-QIDPLD---QEIASLRSD------FDRLRASSP-PMTAD 309
+ L NP K D I + ++ S+ S+ D P P T
Sbjct: 975 NGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPS 1034

Query: 310 QRQKLTG----RRSNLEQTRDADAAS---WQRRLDEAGKRLADAQGAEANKATVAAQNQV 362
+ + + S + + DA R + + K A A ++ +
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094

Query: 363 RQDQIAKELAALEKERIPMARTDQVRRIAARWYGAKPEQVTPE 405
Q KE A +EKE T++ + + P+Q E
Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE 1137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8227PF05272280.050 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 27.7 bits (61), Expect = 0.050
Identities = 10/32 (31%), Positives = 16/32 (50%)

Query: 34 LAIIGPNGAGKTTLLRMLSGMLRPSAGEVKLG 65
+ + G G GK+TL+ L G+ S +G
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8228FERRIBNDNGPP362e-04 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 35.7 bits (82), Expect = 2e-04
Identities = 31/185 (16%), Positives = 61/185 (32%), Gaps = 22/185 (11%)

Query: 16 TAFAFPVTVDSCGKPLTFDAPPKRAVIHDLNMAEMAFALKLQPSIVGLTGITGW--YKVG 73
TA A + P R V + E+ AL + P G+ + +
Sbjct: 14 TAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVP--YGVADTINYRLWVSE 71

Query: 74 PEFKAEQGSIPELAPKY-PTLENLVAVEPDFFFAGWYYGMKPGGDVTPDTLAPHGIKTLV 132
P S+ ++ + P LE L ++P F YG P+ LA +
Sbjct: 72 PPLPD---SVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS------PEMLAR-----IA 117

Query: 133 LTESCVHLDKNRPAASMDLLYGDVEKLGKIFGKEADAEKLVSGWKAQLADITTKVGDRKG 192
D +P A + ++ + ++ AE ++ ++ + + + R
Sbjct: 118 PGRGFNFSDGKQPLAMAR---KSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGA 174

Query: 193 TRVFL 197
+ L
Sbjct: 175 RPLLL 179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8231RTXTOXIND1074e-28 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 107 bits (270), Expect = 4e-28
Identities = 56/330 (16%), Positives = 104/330 (31%), Gaps = 29/330 (8%)

Query: 32 YIWVTGGRYQETENANLQQAKVSIASDTAGRIVQVAITDHQMVKQGDLLFTIDPEPYRIA 91
+ E Q SI + + Q V + ++L +
Sbjct: 137 LKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT--SLIKEQ 194

Query: 92 LAQADAAVAAARLNVEQLRAAYSQSMAQEKSASSEVDYAQSQYDRAADLAQKG------- 144
+ LN+++ RA +A+ + +S+ D + L K
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAV 254

Query: 145 INAKSSLDQARNDLDKAKQQVAVAQQGIISAKAALGGNP-DIETD---KHPTVMAALAA- 199
+ ++ +A N+L K Q+ + I+SAK + + K +
Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 200 ---RDKAAYDLARTTVKAPADGVISQASSFKVGQFVGSGTPLFSLVESDDT-WIDANFKE 255
K + ++AP + Q G V + L +V DDT + A +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 256 TQLTNMKPGQKAEIVVDTYPGKTF---EATVKAIGAGTGAEFSLLPAQNATGNWVKVTQR 312
+ + GQ A I V+ +P + VK I G V
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIIS 427

Query: 313 IPVRLELTDPDAKMALRTGMSASVTVDTGT 342
I L+ + + L +GM+ + + TG
Sbjct: 428 IEEN-CLSTGNKNIPLSSGMAVTAEIKTGM 456



Score = 51.4 bits (123), Expect = 3e-09
Identities = 19/174 (10%), Positives = 55/174 (31%), Gaps = 4/174 (2%)

Query: 10 KRRTGRFFLMLALPAALVIGGGYIWVTGGRYQETENANLQQA----KVSIASDTAGRIVQ 65
+ R ++A + +I G+ + AN + I + +
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109

Query: 66 VAITDHQMVKQGDLLFTIDPEPYRIALAQADAAVAAARLNVEQLRAAYSQSMAQEKSASS 125
+ + + + V++GD+L + + +++ ARL + + +
Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169

Query: 126 EVDYAQSQYDRAADLAQKGINAKSSLDQARNDLDKAKQQVAVAQQGIISAKAAL 179
D Q ++ + K +N + + + + ++ A +
Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
mlr8233TCRTETB1453e-40 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 145 bits (366), Expect = 3e-40
Identities = 91/406 (22%), Positives = 182/406 (44%), Gaps = 14/406 (3%)

Query: 16 ITVALMLATVMQALDTTIANVALPTMTGDLGASPDNINWVLTSYIVAAAIMTPVTGWLAD 75
I + L + + L+ + NV+LP + D P + NWV T++++ +I T V G L+D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 76 RFGRKELFLTAVVGFTIFSMLCGLAWSLETIVLF-RLMQGVFGAAIVPLSQTFLLDINPK 134
+ G K L L ++ S++ + S ++++ R +QG AA L + PK
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 135 ERHGQAMAIWGAGIMLGPILGPTLGGWLTENFNWRWVFFINLPVGIVAFLGMAAYLPAVA 194
E G+A + G+ + +G +GP +GG + +W ++ I + + I+ + L
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEV 193

Query: 195 RRVRSFDFFGFAMISLGVGALQLMLDRGGEVDWFSSVEIWIELGLAITGFWVFIIHTMTA 254
R FD G ++S+G+ L F++ L +++ F +F+ H
Sbjct: 194 RIKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKV 243

Query: 255 DHPFIDPKIFLDQNFVTGLAFIFVMGVLILASMSLLPPMLSTIFGYPTITIG-MVIGPRG 313
PF+DP + + F+ G+ ++ + +S++P M+ + T IG ++I P
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 314 IGTMISMLVVGRIMGKIDARILVVIGFLLTAHSLYTMASFTPQMDNWLIISSGVIQGLGM 373
+ +I + G ++ + ++ IG + S T + F + +W + V G+
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS-FLLETTSWFMTIIIVFVLGGL 362

Query: 374 GMVFVPLSTVAFATLDARYRTDATALFSLVRNLGSSIGVSVVTVLL 419
+ST+ ++L + +L + L G+++V LL
Sbjct: 363 SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.