PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome1644.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_002944 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1MAP0018cMAP0045Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0018c310-0.587704hypothetical protein
MAP0019c310-0.667297PbpA
MAP0020c313-1.219596RodA
MAP0021c214-0.112817Ppp
MAP0022c120-1.722703hypothetical protein
MAP0023c122-2.269831hypothetical protein
MAP0024c024-2.549508*hypothetical protein
MAP0025124-2.925134acyl carrier protein
MAP0026122-2.490513long-chain-fatty-acid--ACP ligase
MAP0027221-2.915819hypothetical protein
MAP0028c218-2.094160hypothetical protein
MAP0029c217-1.376074hypothetical protein
MAP0030c216-1.934364hypothetical protein
MAP0031c216-0.889721hypothetical protein
MAP0032c214-0.865532FadE25_1
MAP0033c117-0.006313hypothetical protein
MAP0034019-0.570921hypothetical protein
MAP0035122-0.276353hypothetical protein
MAP00361200.968767hypothetical protein
MAP00373200.419614hypothetical protein
MAP00383171.354222hypothetical protein
MAP00392141.529749hypothetical protein
MAP00401131.732129hypothetical protein
MAP00410122.408145hypothetical protein
MAP00420132.280242hypothetical protein
MAP0043c-1101.185536hypothetical protein
MAP0044c190.550049hypothetical protein
MAP00452100.266039hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0018cYERSSTKINASE346e-04 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 33.9 bits (77), Expect = 6e-04
Identities = 37/144 (25%), Positives = 61/144 (42%), Gaps = 19/144 (13%)

Query: 70 HPGIAAVHDYGESQLDGEGRTAYLVMELVNGEPLNSVLKRTGRLSLRHALDMLEQTGRAL 129
HP +A VH G + L+M+ V+G + L+ + ++ G
Sbjct: 190 HPNLANVHGMAVVPY-GNRKEEALLMDEVDGWRCSDTLRTLADSWKQGKINSEAYWGTIK 248

Query: 130 QVAH----------AAGLVHRDVKPGNILIT-PTGQVKITDFGIAKAVDAAPVTQTGMVM 178
+AH AG+VH D+KPGN++ +G+ + D G+ P G
Sbjct: 249 FIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQP---KGF-- 303

Query: 179 GTAQYIAPEQALGH-DATPASDVY 201
T + APE +G+ A+ SDV+
Sbjct: 304 -TESFKAPELGVGNLGASEKSDVF 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0021cPF05616310.014 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 30.9 bits (69), Expect = 0.014
Identities = 22/71 (30%), Positives = 28/71 (39%), Gaps = 6/71 (8%)

Query: 423 PRATSPPGAQATRSPVPETGGPASPAP-PTTSASPTPSTNATPGPASSSPAGPTTTSQTL 481
P + P AQ P+PE +PA P + +P N P P + A P T Q
Sbjct: 317 PGSAEAPNAQ----PLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPG 372

Query: 482 TALPGPPLQPG 492
T P P P
Sbjct: 373 TR-PDSPAVPD 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0022cIGASERPTASE280.013 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.5 bits (63), Expect = 0.013
Identities = 14/55 (25%), Positives = 21/55 (38%)

Query: 88 ADDSTLVLTDDYASARHARLTQRGSEWYVEDLGSTNGTYLDRAKVTTAVRVPIGT 142
+D T +DY R + + S GTY D+ K VR+ G+
Sbjct: 154 TEDQTQKRREDYYMPRLDKFVTEVAPIEASTASSDAGTYNDQNKYPAFVRLGSGS 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0027CHANLCOLICIN358e-04 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 35.1 bits (80), Expect = 8e-04
Identities = 48/170 (28%), Positives = 72/170 (42%), Gaps = 33/170 (19%)

Query: 94 QLPKIGADLEGIAAALAEAQ-KAGAARIATLDAQLKNISDLVVEAVALLNDKSLSPADRD 152
QL K A+ A A AEAQ KA A R DA + + D+V E AL ++ S +P+ +
Sbjct: 61 QLKKTQAEQAARAKAAAEAQAKAKANR----DALTQRLKDIVNE--ALRHNASRTPSATE 114

Query: 153 ALHALINTC--ENDALRHTKA---ALADLQAIRNGY--ADGLQKSLNSLHAEGYDGAPLH 205
HA E++ LR KA A + +A + A+ +K + AE
Sbjct: 115 LAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAE-------- 166

Query: 206 GVDADGVAPASPAQQAALDDIRRATNQAVLDQMAKVRAAQRALDKAMADV 255
+A A + AAL + +A V AQ+ L A ++V
Sbjct: 167 TERQLKLAEAEEKRLAALSEEAKA-----------VEIAQKKLSAAQSEV 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0041IGASERPTASE455e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.4 bits (107), Expect = 5e-07
Identities = 49/291 (16%), Positives = 73/291 (25%), Gaps = 34/291 (11%)

Query: 71 NDELRKLKRAGMTDTARQAAEARVIEQARTRAVRRVESTIRYLHAVQSGARPPAPPRPTL 130
N E+ K + T I+ E R A PPAP P+
Sbjct: 982 NPEVEKRNQ---TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVP---PPAPATPSE 1035

Query: 131 DAPGADLERTQVLPAV----GDAEHTAGPPAQQTPEARPRPTRRRAAEDEPAQLEETRVL 186
Q V DA T + EA+ + E +
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSN------VKANTQTNEVAQSG 1089

Query: 187 PAIRDDAAAADASVPETPASEAPVPEAPETPAPEAPAAEAPVPEPLGEPTTRGRHHAPVP 246
++ E E +T E P + V + T P
Sbjct: 1090 SETKETQTTETKETATVEKEEKAKVETEKT--QEVPKVTSQVSPKQEQSETVQPQAEPAR 1147

Query: 247 EPVP----------EHVEPDIAAPAEAADLEQTQVIP---VVRRDEPVPEPPVVAPPEPA 293
E P + D PA+ Q + V V E P P
Sbjct: 1148 ENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207

Query: 294 EEPADSGRHHAPADRDEPATESRPA---VADTETAQPRVEKPAPTTAFDNN 341
+ +S + P +R + S P A T + T+ + N
Sbjct: 1208 QPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTN 1258


2MAP0067MAP0072cY        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0067119-4.37858430S ribosomal protein S6
MAP0068019-3.995935single-stranded DNA-binding protein
MAP0069019-4.10937530S ribosomal protein S18
MAP0070-120-3.93826750S ribosomal protein L9
MAP0071-115-4.309035hypothetical protein
MAP0072c-114-3.908999hypothetical protein
3MAP0086MAP0109Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0086122-4.013949hypothetical protein
MAP0087020-4.896235hypothetical protein
MAP0088120-4.939296hypothetical protein
MAP0089020-4.297398hypothetical protein
MAP0090c217-4.316318hypothetical protein
MAP0091c119-5.061969hypothetical protein
MAP0092024-4.870161hypothetical protein
MAP0093-130-4.797975hypothetical protein
MAP0094-133-5.111660hypothetical protein
MAP0095c-135-5.887935NADH dehydrogenase subunit I
MAP0096c-139-5.455516hypothetical protein
MAP0097c-138-4.754057hypothetical protein
MAP0098c041-5.061732hypothetical protein
MAP0099039-5.713147hypothetical protein
MAP0100-135-4.201487hypothetical protein
MAP0101-231-4.360173hypothetical protein
MAP0102-229-4.670359hypothetical protein
MAP0103c-223-3.691561hypothetical protein
MAP0104-119-3.619132hypothetical protein
MAP0105c015-3.401264hypothetical protein
MAP0106c19-3.798200hypothetical protein
MAP0107111-2.294556hypothetical protein
MAP0108212-1.827919hypothetical protein
MAP0109213-2.543623Mce1B
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0098cTETREPRESSOR825e-21 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 81.9 bits (202), Expect = 5e-21
Identities = 61/221 (27%), Positives = 89/221 (40%), Gaps = 32/221 (14%)

Query: 27 ITRAAVLASALEIIDRDGVDGLSMRRLGEAVGRDPMALYRHVPNKAAVLDGVVEMVFER- 85
+ R +V+ +ALE+++ G+DGL+ R+L + +G + LY HV NK A+LD + + R
Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILARH 63

Query: 86 --LSLDTTTPDWAAALRKLGHEF-------RDLARAHPNVVPLLVTRPLATPLGMRPPGI 136
SL W + LR F RD A+ H P
Sbjct: 64 HDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLGTRP--------DEKQY----- 110

Query: 137 LRHLEEVLTLLIGAGFTGEDALHVYRALFGFLYGHVLTELQEIVERPEETDHVLRLGLHR 196
+E L + GF+ D L+ A+ F G VL E QE + L
Sbjct: 111 -DTVETQLRFMTENGFSLRDGLYAISAVSHFTLGAVL-EQQEHTAALTDRPAAPDENLPP 168

Query: 197 LPIDQFGHLRELAPVWASYDPLAELDRGLDILLSGLAVRLT 237
L LRE + S D GL+ L+ G V+LT
Sbjct: 169 L-------LREALQIMDSDDGEQAFLHGLESLIRGFEVQLT 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0099PF07132240.041 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 23.9 bits (51), Expect = 0.041
Identities = 16/45 (35%), Positives = 21/45 (46%), Gaps = 1/45 (2%)

Query: 10 KFEAAKGSAKKVFGRATGNTGMRAEGRAGQVKG-NAKQAGDKLND 53
KF A G K TGNT + A G G G +A GD++ +
Sbjct: 304 KFMKAVGMIKSAVAGDTGNTNLHARGNGGASLGIDAAMIGDRIVN 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0108PF03544348e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 34.2 bits (78), Expect = 8e-04
Identities = 20/101 (19%), Positives = 30/101 (29%)

Query: 358 PQNYHPPTDLAPPPGTQIGPDGNLVATGPPLYNPNPSLADPNPPLPWWPWQIGPAPRVPG 417
P + PP + PPP + P+ P + P P P + +
Sbjct: 57 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR 116

Query: 418 TADPDDAPPPPPPSPAPPGPPPSPAPPGAVAPAAYGGNVGP 458
P ++ P P P P S A + GP
Sbjct: 117 DVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGP 157



Score = 33.0 bits (75), Expect = 0.002
Identities = 21/119 (17%), Positives = 24/119 (20%), Gaps = 10/119 (8%)

Query: 364 PTDLAPPPGTQIGPDGNLVATGPPLYNPNPSLADPNPPLPWWPWQIGPAPRVPGTADPDD 423
P P T + P PP P P P P P A
Sbjct: 44 PAPAQPISVTMVAPA----DLEPPQAVQPPPEPVVEPEPE-----PEPIPEPPKEAPVVI 94

Query: 424 APPPPPPSPAPPGPPPSPAPPGAVAPAAYGGNVGPVGSQRERDQLGLITGQGRPASVAT 482
P P P P P P V P + V +
Sbjct: 95 EKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAP-ARPTSSTATAATSKPVTS 152


4MAP0153MAP0160Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP01532100.758449hypothetical protein
MAP01542130.090543hypothetical protein
MAP0155314-0.414595hypothetical protein
MAP0156314-1.155470hypothetical protein
MAP01573110.438608hypothetical protein
MAP0158011-0.990496hypothetical protein
MAP0159c111-1.543795hypothetical protein
MAP01602140.205109hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0153PF06580379e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 9e-05
Identities = 24/136 (17%), Positives = 54/136 (39%), Gaps = 31/136 (22%)

Query: 308 DALRVYPDVEVSLVPSPTVLMIGLPTGLRLVIDNAIANAVKHG-----NAGKIQLTVSSS 362
D L+ + P ++ + +P +++ + N +KHG GKI L +
Sbjct: 238 DRLQFENQIN------PAIMDVQVP---PMLVQTLVENGIKHGIAQLPQGGKILLKGTKD 288

Query: 363 GEGVEIAIDDDGSGIPESERATVFERFARGSTAARSGSGLGLALVAQQ-AELHGGTAELQ 421
V + +++ GS ++ + +G GL V ++ L+G A+++
Sbjct: 289 NGTVTLEVENTGSLALKNT---------------KESTGTGLQNVRERLQMLYGTEAQIK 333

Query: 422 -NSPLGGTRLLLRLAG 436
+ G ++ + G
Sbjct: 334 LSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0155HTHTETR453e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.0 bits (106), Expect = 3e-08
Identities = 22/73 (30%), Positives = 31/73 (42%), Gaps = 1/73 (1%)

Query: 2 ATATRERFLTAATGLFRRQGYSGTGLKQIVAESRAPLGSLYHFFPGGKQDLAVQAIAHTA 61
A TR+ L A LF +QG S T L +I + G++Y F K DL + +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHF-KDKSDLFSEIWELSE 67

Query: 62 ERYRELLDRVFAR 74
EL A+
Sbjct: 68 SNIGELELEYQAK 80


5MAP0174MAP0196cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP01742120.973554hypothetical protein
MAP0175c1140.113566hypothetical protein
MAP01760130.274489hypothetical protein
MAP0177-1120.734371hypothetical protein
MAP0178c-2141.784376DNA polymerase IV
MAP0179c1150.986824hypothetical protein
MAP01801140.537849hypothetical protein
MAP0181c3152.029493ribonuclease activity regulator protein RraA
MAP0182c5140.690314hypothetical protein
MAP0183c2130.052835hypothetical protein
MAP0184c110-0.130902hypothetical protein
MAP0185c1100.130722hypothetical protein
MAP0186c2102.210506hypothetical protein
MAP0187c3122.997633hypothetical protein
MAP0188c3124.685738hypothetical protein
MAP01893135.004112hypothetical protein
MAP01902144.677171GlpQ1
MAP0191c2134.921640hypothetical protein
MAP0192c0144.023396hypothetical protein
MAP01931163.877459prephenate dehydratase
MAP01941153.152392hypothetical protein
MAP0195c1152.231737hypothetical protein
MAP0196c2142.428071hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0178cBACINVASINB290.047 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 28.6 bits (63), Expect = 0.047
Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 3/77 (3%)

Query: 184 VGAVTAEKLRAHGIATVADVAELSESTLGSMVGAAMGRQLYALSRNIDRRRVSTGVRRRS 243
VGA+ A I VA V + + + LG+ + MG + L N+ ++ G + +
Sbjct: 410 VGAIVAAIAMVAVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFT 469

Query: 244 VGAQR---ALGRAGNTM 257
G QR LG G+ M
Sbjct: 470 QGMQRITSGLGNVGSKM 486


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0179cHTHTETR707e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.7 bits (170), Expect = 7e-17
Identities = 28/123 (22%), Positives = 45/123 (36%), Gaps = 2/123 (1%)

Query: 14 RAIRPTGDEREQAILATAERLLETRPFAGISVDDLAKGAGLSRPTFYFYFKSKEAVVLSL 73
R + E Q IL A RL + + S+ ++AK AG++R Y++FK K + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 74 LEPVIARADAEFDGAVQRLPTDPRRVWRNGIKAFFTAFSS--HRALARAATEALATSSEL 131
E + + P DP V R + + + R L
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 132 RAV 134
AV
Sbjct: 123 MAV 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0182cIGASERPTASE270.029 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.0 bits (59), Expect = 0.029
Identities = 17/103 (16%), Positives = 34/103 (33%), Gaps = 9/103 (8%)

Query: 4 PQDPTNSAADGAGD-----PPEKKPPAKAAKKTAKAPAKKAPAKKAPAKKAPAKKAPAK- 57
P P+N+ D PP P++ + A+ +++ + + A A +
Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067

Query: 58 --KAPAKNTPTGGGQ-RADTNGDLTAAAKDAAAQAKSTVEAAD 97
K N +G T + + +TVE +
Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0187cCHANLCOLICIN270.046 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 27.3 bits (60), Expect = 0.046
Identities = 21/107 (19%), Positives = 48/107 (44%), Gaps = 5/107 (4%)

Query: 39 GVNDALAKLEEARANEDHAAIF-LNEKNLAFHLGGHVNHSIWWKNLSPDGGDKPTGELAA 97
V++A L++A+ N ++ I + ++F+ + + ++ + DK G+
Sbjct: 325 RVHEAEENLKKAQNNLLNSQIKDAVDATVSFYQTLTEKYGEKYSKMAQELADKSKGKKIG 384

Query: 98 AIDDAFGSFDKFRA----QFSAAANGLQGSGWAVLGYDTVGSRLLTF 140
+++A +F+K++ +FS A + A + YD L F
Sbjct: 385 NVNEALAAFEKYKDVLNKKFSKADRDAIFNALASVKYDDWAKHLDQF 431


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0191cPF05616387e-05 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 37.8 bits (87), Expect = 7e-05
Identities = 29/76 (38%), Positives = 33/76 (43%), Gaps = 7/76 (9%)

Query: 12 PGR-QGPPRQPGPEPSPVIPRPGGPAPSPHAPTQPLHRPPP--APPARP----APPARPA 64
PG + P QP PE SP PAP+ + T+P P P P A P P RP
Sbjct: 317 PGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPD 376

Query: 65 PPASAIRPARPRRKRR 80
PA RP RK R
Sbjct: 377 SPAVPDRPNGRHRKER 392


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0196cPERTACTIN387e-05 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 38.2 bits (88), Expect = 7e-05
Identities = 19/48 (39%), Positives = 20/48 (41%)

Query: 400 QAPPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPPPPPAPEAPPGGPP 447
+APP P P P P P P PP PP PEAP PP
Sbjct: 566 KAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613



Score = 35.5 bits (81), Expect = 5e-04
Identities = 21/51 (41%), Positives = 22/51 (43%), Gaps = 1/51 (1%)

Query: 401 APPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPPPPPAPEA-PPGGPPPAG 450
A PP AP P PP P PP P PP PP + P PPAG
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615



Score = 33.5 bits (76), Expect = 0.002
Identities = 18/40 (45%), Positives = 18/40 (45%)

Query: 409 GAPPPANPAPEAPPAPVPPPAAPPPPPPPAPEAPPGGPPP 448
GA P P P P P P P P PP PP P PP P
Sbjct: 564 GAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQR 603



Score = 33.2 bits (75), Expect = 0.003
Identities = 18/48 (37%), Positives = 18/48 (37%)

Query: 398 PPQAPPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPPPPPAPEAPPGG 445
PP P P P P P P PP P PP P P PP G
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615



Score = 31.6 bits (71), Expect = 0.007
Identities = 16/46 (34%), Positives = 16/46 (34%)

Query: 389 PGQQPVVTQPPQAPPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPP 434
P QP P P PP P P PP P AP PP
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613



Score = 31.2 bits (70), Expect = 0.010
Identities = 17/49 (34%), Positives = 17/49 (34%)

Query: 393 PVVTQPPQAPPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPPPPPAPEA 441
P PQ P P PP P P PP P P P PPA
Sbjct: 569 PAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRE 617



Score = 28.9 bits (64), Expect = 0.047
Identities = 16/52 (30%), Positives = 18/52 (34%)

Query: 388 LPGQQPVVTQPPQAPPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPPPPPAP 439
L G + P P P PP P+ P P PP P P P P
Sbjct: 562 LVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613



Score = 28.9 bits (64), Expect = 0.047
Identities = 21/62 (33%), Positives = 23/62 (37%), Gaps = 1/62 (1%)

Query: 366 AQSPSTQSGSAAAPEMPRNNQHLPGQQPVVTQPPQAPPPPVDNGAPPPANPAPEAPPAPV 425
A + + Q A P QPPQ P PP P P PEA PAP
Sbjct: 553 AANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEA-PAPQ 611

Query: 426 PP 427
PP
Sbjct: 612 PP 613


6MAP0205MAP0214Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MAP02052132.355278hypothetical protein
MAP02061121.405512hypothetical protein
MAP02070110.318276hypothetical protein
MAP0208090.707851hypothetical protein
MAP0209c190.320331Csp
MAP0210c3110.572577PirG
MAP0211312-0.298855Glf
MAP02124130.735912hypothetical protein
MAP02136151.793674hypothetical protein
MAP02144160.782301phosphoribose diphosphate:decaprenyl-phosphate
7MAP0272MAP0284cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0272290.638405hypothetical protein
MAP0273170.719881hypothetical protein
MAP0274271.347205ProW
MAP0275081.039837ProZ
MAP02761110.165767hypothetical protein
MAP0277c017-1.185859prephenate dehydrogenase
MAP0278226-3.199714hypothetical protein
MAP0279431-5.171497hypothetical protein
MAP0280328-5.284607*hypothetical protein
MAP0281329-6.113335hypothetical protein
MAP0282c232-5.643804hypothetical protein
MAP0283c126-4.898426hypothetical protein
MAP0284c120-3.895303hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0284cCHANLCOLICIN300.023 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.0 bits (67), Expect = 0.023
Identities = 39/169 (23%), Positives = 66/169 (39%), Gaps = 42/169 (24%)

Query: 21 AINQSLQAGSPFQIERLAEAFHNAGRCTAEADHAFQDARKRFDAAWNHQNGDHPINDSDE 80
A N ++QA + RLA+A A + A+ AFQ+A +R E
Sbjct: 118 ANNAAMQA--EDERLRLAKAEEKARKEAEAAEKAFQEAEQR----------------RKE 159

Query: 81 VQRVTKSLGAQSLQLPKIAADLEGIAASLAEAQKAGAQEIATLERELMFLDRLIGAAKED 140
++R A++ + K+A E A+L+E KA EIA
Sbjct: 160 IEREK----AETERQLKLAEAEEKRLAALSEEAKA--VEIA------------------Q 195

Query: 141 LKLNLPAAERAKLERLIKDAHADAVDDVRDAVKEMNSIRNAYSETLRKS 189
KL+ +E K++ IK ++ + EM ++ +E + S
Sbjct: 196 KKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQAS 244


8MAP0359cMAP0364cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0359c-1113.503153MoxR2
MAP0360c-1104.364697hypothetical protein
MAP0361c-1114.543147hypothetical protein
MAP0362c-1124.168334hypothetical protein
MAP0363-1113.171422hypothetical protein
MAP0364c0103.242256hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0359cHTHFIS354e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.8 bits (80), Expect = 4e-04
Identities = 25/89 (28%), Positives = 36/89 (40%), Gaps = 4/89 (4%)

Query: 116 LLADEINRTPPKTQAALLEAMEERQVSVEGEAQPLPEPF-IVTATQNPVEYEGTYQLPEA 174
L DEI P Q LL +++ + + G P+ IV AT ++ L
Sbjct: 235 LFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFRE 294

Query: 175 QLDRFLLKLNVTLPP---REAEIAILHRH 200
L L + + LPP R +I L RH
Sbjct: 295 DLYYRLNVVPLRLPPLRDRAEDIPDLVRH 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0362cPERTACTIN300.027 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 29.7 bits (66), Expect = 0.027
Identities = 20/48 (41%), Positives = 20/48 (41%)

Query: 49 PPAYGPQSQYGAQYGAHFPGRYGAHYGPPPTYPPPGYPPAPAFGPPPG 96
PPA P Q G Q G P P P PP P APA PP G
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0364cTCRTETOQM481e-07 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 47.5 bits (113), Expect = 1e-07
Identities = 26/116 (22%), Positives = 47/116 (40%), Gaps = 15/116 (12%)

Query: 4 LATAGHVDHGKSTLLHRLT---------------GMWPDRLAEEQRRGLTIDLGFVWTEL 48
+ HVD GK+TL L D E++RG+TI G +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 49 AGRRLAFVDVPGHERFVANMLAGVGPVPAVVFVVAATEGWMPQSEEHLAALDALRV 104
++ +D PGH F+A + + + + +++A +G Q+ AL + +
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121


9MAP0373MAP0381Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MAP03732120.219514hypothetical protein
MAP0374c311-0.312073hypothetical protein
MAP0375c110-0.847016hypothetical protein
MAP0376c011-1.588431hypothetical protein
MAP0377c212-0.766214hypothetical protein
MAP0378c212-0.531234hypothetical protein
MAP0379c312-0.267934hypothetical protein
MAP03803131.364573hypothetical protein
MAP03813140.946566hypothetical protein
10MAP0406MAP0417Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0406290.168813hypothetical protein
MAP0407c180.420910acetyl-CoA synthetase
MAP04081101.293958hypothetical protein
MAP04090100.733924hypothetical protein
MAP04102132.517458hypothetical protein
MAP04112123.520414hypothetical protein
MAP04122124.582761DppD_1
MAP04132126.392283hypothetical protein
MAP0414c2115.977021hypothetical protein
MAP04153137.428714hypothetical protein
MAP04163134.670936hypothetical protein
MAP04171144.051294hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0406V8PROTEASE300.007 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 30.0 bits (67), Expect = 0.007
Identities = 35/201 (17%), Positives = 60/201 (29%), Gaps = 43/201 (21%)

Query: 17 VAVTGSHPRAAAADVRIPLGGGAGIVVNGDTMCTLTTIGGDAAGDLIGFTSAHC-----G 71
+ T + A +++ G + + +G D T+ H G
Sbjct: 79 ITDTTNGHYAPVTYIQVEAPTG-------TFIASGVVVGKDTL-----LTNKHVVDATHG 126

Query: 72 GPGAQVAAEGAENA------GILGTMVAGND-NLDYAVIKFDPAKVTPVANFNGFLISGI 124
P A A A N G + D A++KF P + +
Sbjct: 127 DPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNKHIG------EVV 180

Query: 125 GPDPAFGEIACKQGRT---TGNSCGVTWG-MGQTPGTI------VMQVCG--QPGDSGAP 172
P + + TG M ++ G I MQ G+SG+P
Sbjct: 181 KPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSP 240

Query: 173 V-TVNNQLVGMIHGAFSDNLP 192
V N+++G+ G +
Sbjct: 241 VFNEKNEVIGIHWGGVPNEFN 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0415SECA290.035 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.1 bits (65), Expect = 0.035
Identities = 27/98 (27%), Positives = 35/98 (35%), Gaps = 25/98 (25%)

Query: 163 NTPGLR--WPDLALQGGRLNWAAV-RDALPRH-RGVSVLSGT--------------RRG- 203
N P +R PDL A+ D R +G VL GT + G
Sbjct: 415 NRPMIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGI 474

Query: 204 -HEVDAGPVHA-----VVDAGRRGAVSVICDLPRRFTD 235
H V HA V AG AV++ ++ R TD
Sbjct: 475 KHNVLNAKFHANEAAIVAQAGYPAAVTIATNMAGRGTD 512


11MAP0449MAP0463Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0449-2113.279815GTP cyclohydrolase I
MAP0450-1104.323175hypothetical protein
MAP0451-1104.101249FolX
MAP04520103.693090hypothetical protein
MAP0453-192.989796hypothetical protein
MAP0454-271.615360hypothetical protein
MAP0455-17-0.205219hypothetical protein
MAP045619-1.626771pantoate--beta-alanine ligase
MAP0457211-2.095206aspartate alpha-decarboxylase
MAP045829-1.279896pantothenate kinase
MAP0459210-1.015602lysyl-tRNA synthetase
MAP0460011-1.802034Lsr2
MAP0461-210-1.230441ClpC
MAP0462115-0.971488hypothetical protein
MAP0463213-0.529425hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0454TONBPROTEIN363e-04 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 35.7 bits (82), Expect = 3e-04
Identities = 30/97 (30%), Positives = 44/97 (45%)

Query: 214 PRPQQPAPLRRERPRYQEPPGPTEPRFEPRYQPPPAAAATPPPPETPPEPPREPQPEPQP 273
P P QP + P EPP +P EP +P P P PP+ P +P+P+P+P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 98

Query: 274 EPQPQPQGRDWQPVGAEGQWLPPGSPGSNWAGADAES 310
+P+P + ++ + P SP N A A S
Sbjct: 99 KPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTS 135



Score = 30.0 bits (67), Expect = 0.015
Identities = 18/101 (17%), Positives = 32/101 (31%), Gaps = 1/101 (0%)

Query: 241 EPRYQPPPAAAATPPPPETPPEPPREPQ-PEPQPEPQPQPQGRDWQPVGAEGQWLPPGSP 299
P+ PP P PE P P + P +P+P+P+ + + Q P
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKP 116

Query: 300 GSNWAGADAESTGGRRRARHSGADEAWDGMPVEPVPPPPYA 340
+ + E+T R + + P +
Sbjct: 117 VESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALS 157



Score = 29.6 bits (66), Expect = 0.024
Identities = 27/126 (21%), Positives = 42/126 (33%), Gaps = 6/126 (4%)

Query: 195 PGRHDETAIIDVPEEPLMPPRPQQPAPLRRERPRYQEPPGPTEPRFEPRYQPPPAAAATP 254
P ++ V L PP+ QP P P + P P P+ P P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKP-----K 93

Query: 255 PPPETPPEPPREPQPEPQPEPQPQPQGRDWQPVGAEGQWLPPGSPGSNWAGADAESTGGR 314
P P+ P+P ++ Q +P+ + +P + R P S + S
Sbjct: 94 PKPKPKPKPVKKVQEQPKRDVKPV-ESRPASPFENTAPARLTSSTATAATSKPVTSVASG 152

Query: 315 RRARHS 320
RA
Sbjct: 153 PRALSR 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0458PF03309369e-132 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 369 bits (950), Expect = e-132
Identities = 236/271 (87%), Positives = 255/271 (94%)

Query: 1 MLLAIDVRNTHTVVGLISGSKEHAKVVQQWRIRTESEITADELALTIDGLIGDDSERLTG 60
MLLAIDVRNTHTVVGLISGS +HAKVVQQWRIRTE E+TADELALTIDGLIGDD+ERLTG
Sbjct: 1 MLLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGDDAERLTG 60

Query: 61 AAALSTVPSVLHEVRLMLDQYWPSVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAAF 120
A+ LSTVPSVLHEVR+ML+QYWP+VPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAA+
Sbjct: 61 ASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAAY 120

Query: 121 HRFQSPAIVIDFGSSICVDVVSAKGEFLGGAIAPGLQVSSDAAAARSAALRRVELARPRS 180
H++ + AIV+DFGSSICVDVVSAKGEFLGGAIAPG+QVSSDAAAARSAALRRVEL RPRS
Sbjct: 121 HKYGTAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRVELTRPRS 180

Query: 181 VIGKNTVECMQAGAVFGFAGLVDGLVGRIREDVPGFGGDDVAIVATGHTAPLLLPELDTV 240
VIGKNTVECMQAGAVFGFAGLVDGLV RIR+DV GF G DVA+VATGHTAPL+LP+L TV
Sbjct: 181 VIGKNTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVVATGHTAPLVLPDLRTV 240

Query: 241 SHYDQHLTLHGLRLVFERNRDAQRGRLKTAR 271
HYD+HLTL GLRLVFERNR QRG+LK AR
Sbjct: 241 EHYDRHLTLDGLRLVFERNRANQRGKLKPAR 271


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0461HTHFIS320.014 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.014
Identities = 30/138 (21%), Positives = 53/138 (38%), Gaps = 22/138 (15%)

Query: 518 IIGQEDAVKAVSKAIRRTRAGLKDPKRPSGSFIFAGPSGVGKTELSKALANFLFGDDDAL 577
++G+ A++ + + + R + + + G SG GK +++AL ++ +
Sbjct: 139 LVGRSAAMQEIYRVLARL-------MQTDLTLMITGESGTGKELVARALHDYGKRRNGPF 191

Query: 578 IQIDMGEFHDRFTASRLFGAPPGYVGYEEGGQLTEKVRRKP--FS-----VVLFDEIEKA 630
+ I+M S LFG E G T R F + DEI
Sbjct: 192 VAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 631 HQEIYNSLLQVLEDGRLT 648
+ LL+VL+ G T
Sbjct: 244 PMDAQTRLLRVLQQGEYT 261


12MAP0472cMAP0485cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0472c2111.841787DNA integrity scanning protein DisA
MAP0473c-1111.958458DNA repair protein RadA
MAP0474c0122.208992hypothetical protein
MAP0475-2112.086969hypothetical protein
MAP0476-1102.8909482-C-methyl-D-erythritol 4-phosphate
MAP0477092.9278412-C-methyl-D-erythritol 2,4-cyclodiphosphate
MAP0478083.039366cysteinyl-tRNA synthetase
MAP04791113.613031hypothetical protein
MAP04801123.497830hypothetical protein
MAP0481c1123.909185hypothetical protein
MAP04822163.854823hypothetical protein
MAP04832183.844078AbsR2
MAP0484c4173.821131ArsB2
MAP0485c2133.330751hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0484cRTXTOXINA300.017 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.3 bits (68), Expect = 0.017
Identities = 19/86 (22%), Positives = 35/86 (40%), Gaps = 6/86 (6%)

Query: 55 LSGVVAFLGAVLVLAKLCDDEGLFEAAGAAIARGRVGSAGMLRRVFVIASAITAVLSLDA 114
+SG+++ + A +L+ D AAG + +G+ G ++IA LS A
Sbjct: 245 VSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSA 304

Query: 115 AVVLLTPVVLAAVRRQRTAVRPYAYA 140
A L + A+ P ++
Sbjct: 305 AAAGLIASAVTL------AISPLSFL 324


13MAP0580cMAP0596cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0580c3111.745077hypothetical protein
MAP0581c3112.355857hypothetical protein
MAP05824112.073143BpoA
MAP05835110.297692hypothetical protein
MAP0584613-0.553047hypothetical protein
MAP05855140.145611hypothetical protein
MAP0586c213-1.205466hypothetical protein
MAP0587-111-1.218116hypothetical protein
MAP0588-211-1.246023hypothetical protein
MAP0589c09-0.511943hypothetical protein
MAP0590211-0.212053*hypothetical protein
MAP0591214-1.825090PhoP
MAP0592215-1.923578PhoR
MAP0593c216-4.292121hypothetical protein
MAP0594c215-3.655724hypothetical protein
MAP0595c212-3.873508AdhB
MAP0596c311-3.546779hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0583PF05272280.026 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.026
Identities = 14/40 (35%), Positives = 16/40 (40%), Gaps = 7/40 (17%)

Query: 18 AATAVAKTAGAAPPPALPAP-------PPSPGIGVDHENG 50
+ TA A AG PP P P PG G D E+
Sbjct: 389 SPTAAAGGAGGGEPPKKRDPSAGAGTDPGGPGGGDDGEDP 428


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0585VACCYTOTOXIN330.002 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 32.7 bits (74), Expect = 0.002
Identities = 16/55 (29%), Positives = 19/55 (34%), Gaps = 2/55 (3%)

Query: 95 QNGPNNA--PQGGQNNPPQGGPNNAPQGGQNNPPQGGPNNAPQNGPNNPPQGGQN 147
G N P+GG + P P+N Q N Q N NPP Q
Sbjct: 318 SAGLNIIAPPEGGYKDKPNDKPSNTTQNNAKNDKQESSQNNSNTQVINPPNSAQK 372



Score = 32.3 bits (73), Expect = 0.003
Identities = 20/71 (28%), Positives = 26/71 (36%), Gaps = 10/71 (14%)

Query: 118 PQGGQNNPPQGGPNNAPQNGPNNPPQGGQNEPGGQYPPNQHGQNPPQNGQN---QPPYDH 174
P+GG + P P+N QN N + Q N NPP + Q QP
Sbjct: 327 PEGGYKDKPNDKPSNTTQNNAKN-----DKQESSQNNSNTQVINPPNSAQKTEIQPT--Q 379

Query: 175 DQNPPTPGGPT 185
+ P GG
Sbjct: 380 VIDGPFAGGKN 390



Score = 31.9 bits (72), Expect = 0.004
Identities = 23/91 (25%), Positives = 31/91 (34%), Gaps = 6/91 (6%)

Query: 76 TPPQGGQNNPPPNGPNNPPQNGPNNAPQGGQNNPPQGGPNNAPQGGQNNPPQGGPNNAPQ 135
PP+GG + P + P+N QN N Q N N P Q Q P
Sbjct: 325 APPEGGYKDKPNDKPSNTTQNNAKNDKQESSQNNSNTQVINPPNSAQKTEIQ------PT 378

Query: 136 NGPNNPPQGGQNEPGGQYPPNQHGQNPPQNG 166
+ P GG+N N + + G
Sbjct: 379 QVIDGPFAGGKNTVVNINRINTNADGTIRVG 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0591HTHFIS1103e-30 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 110 bits (277), Expect = 3e-30
Identities = 36/136 (26%), Positives = 63/136 (46%)

Query: 13 ARVLVVDDEANIVELLSVSLKFQGFEVHTATNGAQALDRAREARPDAVILDVMMPGMDGF 72
A +LV DD+A I +L+ +L G++V +N A D V+ DV+MP + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 73 GVLRRLRADGIDAPALFLTARDSLQDKIAGLTLGGDDYVTKPFSLEEVVARLRVILRRAG 132
+L R++ D P L ++A+++ I G DY+ KPF L E++ + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 133 KGGAEPRSARLTFADI 148
+ ++ +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0592PF06580357e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 7e-04
Identities = 21/104 (20%), Positives = 34/104 (32%), Gaps = 24/104 (23%)

Query: 370 LIANALQH----TPESADVTVRVGTDGDDAVLEVADRGPGMNEQDASRVFERFYRTDSSR 425
L+ N ++H P+ + ++ D LEV + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------- 309

Query: 426 ARASGGTGLGLSIV-ESLVRAHGGTVGVTTAPGQGC-CFRVTLP 467
TG GL V E L +G + + QG V +P
Sbjct: 310 -----STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


14MAP0616cMAP0629cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0616c48-0.137895hypothetical protein
MAP061739-0.403467hypothetical protein
MAP0618c39-0.955433hypothetical protein
MAP0619c310-0.601404hypothetical protein
MAP0620011-0.200159hypothetical protein
MAP0621-111-0.280918FAD-binding dehydrogenase
MAP0622c015-0.615691hypothetical protein
MAP0623-214-0.125684hypothetical protein
MAP0624-111-0.384472hypothetical protein
MAP0625311-1.339589phosphoribosylformylglycinamidine synthase
MAP0626512-1.514101phosphoribosylformylglycinamidine synthase I
MAP0627c310-0.773546hypothetical protein
MAP0628c210-0.528047hypothetical protein
MAP0629c412-1.521698hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0617HTHTETR507e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 49.6 bits (118), Expect = 7e-10
Identities = 27/127 (21%), Positives = 46/127 (36%), Gaps = 3/127 (2%)

Query: 15 ILAEAARLVAERGAERVSLRELARCAGVSHAAPAHHFTDRRGLFTALATQGFELLTQALA 74
IL A RL +++G SL E+A+ AGV+ A HF D+ LF+ + + +
Sbjct: 16 ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELEL 75

Query: 75 DARGDFADAALAYVQFALDHPGHY-QVMFNRSLLDASDGGLAAAEAAAGAELSRGVATLR 133
+ + F L+ ++ L H R LL E + +
Sbjct: 76 EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEII--FHKCEFVGEMAVVQQAQRNL 133

Query: 134 DPHARAD 140
+
Sbjct: 134 CLESYDR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0618cTCRTETB1342e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 134 bits (339), Expect = 2e-36
Identities = 85/426 (19%), Positives = 173/426 (40%), Gaps = 27/426 (6%)

Query: 51 IMAVLDSTVVAVAQRTFIAQFGVNQAIVSWTIAGYMLAFATVIPITGWAADRFGTKRLFM 110
+VL+ V+ V+ F A +W +ML F+ + G +D+ G KRL +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 111 GSVLIFTLGSLLCAVAPNIL-LLILFRVVQGVGGGMLLPLSFVILTREAGPKRVGRLMAV 169
++I GS++ V + LLI+ R +QG G L V++ R + G+ +
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 170 GGIPILLGPIGGPILGGWLIGAYGWKWIFLINLPIGLTAFALAALLFPKDRSAPSEALDI 229
G + +G GP +GG + W +L+ +P+ + K DI
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 230 TGALLLSPGVAIFLCGVCSIPGRHTVADRYVLVPALVGLVLIAAFILHAWYRTEHPLIDL 289
G +L+S G+ F+ T L+ +++ ++ I + P +D
Sbjct: 202 KGIILMSVGIVFFMLFT-------TSYSISFLIVSVLSFLIFVKHIR----KVTDPFVDP 250

Query: 290 RLFRN-PVVTQVNVTLLVFAAASVGVGLLVPSYFQIVGHETPMQSG-LHMLPIGVGAVLT 347
L +N P + V ++F G +VP + V + + G + + P + ++
Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTV-AGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 348 MPLGGAVMDKHGPGKIVLTGLPLMAVG---LAVFTYGVARQAAYSPVLVCGLAIMGLGIG 404
+GG ++D+ GP ++ G+ ++V + + V V G G+
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG------GLS 363

Query: 405 LTTTPLSAALMQALAPHQVARGTTLISVNQQVGGSIGAALMAVILTNQF-NRNPALMAAN 463
T T +S + +L + G +L++ + G A++ +L+ ++ M +
Sbjct: 364 FTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVD 423

Query: 464 EAAGMH 469
++ ++
Sbjct: 424 QSTYLY 429


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0619cTCRTETB1249e-33 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 124 bits (312), Expect = 9e-33
Identities = 80/412 (19%), Positives = 170/412 (41%), Gaps = 21/412 (5%)

Query: 46 VCILATVMAILDVTVVSVAQRTFIDQFSSSQAVVAWTMTGYTLALATVIPITGWAADRFG 105
+CIL + ++L+ V++V+ + F+ A W T + L + + G +D+ G
Sbjct: 19 LCIL-SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77

Query: 106 TKRLFIGSVLAFMLGSLLCALAANVLQLIVF-RVVQGIGGGMLLPLGFMILTREAGPRRL 164
KRL + ++ GS++ + + L++ R +QG G L +++ R
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 165 GRLMSILSIPMLLAPIGGPILGGWLIDTSSWRWIFLINVPIGLLTVALAAVVFPRDHPAR 224
G+ ++ + + GP +GG + W +L+ +P+ + + +
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRI 195

Query: 225 SETFDAVGVLLLSPGLATFLFAVSSIPGRGTVADRHVLIPAAMGLTLIAGFVGHAWHRAD 284
FD G++L+S G+ F+ T LI + + + FV H +
Sbjct: 196 KGHFDIKGIILMSVGIVFFMLFT-------TSYSISFLIVSVLSFLI---FVKHI-RKVT 244

Query: 285 HPLIDLRLFRN-PVLTHANVTMLVFATAFFGAGLLLPSYFQQVLHQTPMQAG-VHMIPQG 342
P +D L +N P + ++F T G ++P + V + + G V + P
Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGT-VAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 343 LGAMLTVRLTGPLVDRQGPGKVVLVGIALITAGLGAFAFGVARQAPYLPTLLAGLAITGL 402
+ ++ + G LVDR+GP V+ +G+ ++ +F + + T+ +
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASF-LLETTSWFMTI--IIVFVLG 360

Query: 403 GMGCTMMPLSVASVQALAPHQIARGTTLMSVSHQVGGSMGTALMSMILTNQF 454
G+ T +S +L + G +L++ + + G A++ +L+
Sbjct: 361 GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412


15MAP0750cMAP0777cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0750c29-2.135646hypothetical protein
MAP0751c211-1.610391hypothetical protein
MAP0752c111-2.110570hypothetical protein
MAP0753c011-2.477906hypothetical protein
MAP0754c011-2.594039hypothetical protein
MAP0755012-3.016262AldA_2
MAP0756016-4.236380hypothetical protein
MAP0757117-4.952282hypothetical protein
MAP0758016-4.515269hypothetical protein
MAP0759116-3.629945hypothetical protein
MAP0760018-3.522532hypothetical protein
MAP0761018-3.417772hypothetical protein
MAP0762020-3.272965hypothetical protein
MAP0763021-2.604833hypothetical protein
MAP0764-122-2.805418hypothetical protein
MAP0765022-3.426079hypothetical protein
MAP0766c122-3.818372hypothetical protein
MAP0767c020-3.967532hypothetical protein
MAP0768c-115-4.151326hypothetical protein
MAP0769-213-4.579259hypothetical protein
MAP0770c013-4.512432hypothetical protein
MAP0771-112-4.608603hypothetical protein
MAP0772-111-3.576544hypothetical protein
MAP0773012-3.392609hypothetical protein
MAP0774c211-3.224334hypothetical protein
MAP0775212-3.379230hypothetical protein
MAP0776c212-3.167103hypothetical protein
MAP0777c211-2.922848hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0751cIGASERPTASE349e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 9e-04
Identities = 23/124 (18%), Positives = 43/124 (34%), Gaps = 5/124 (4%)

Query: 6 KPPHDSATSDEDVKALLEEAEAEAAEAEALAAAARARARAARLRREAQAQAAKAAAETSE 65
DV ++ E A EA + E A+ +K ++T E
Sbjct: 995 TNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSET--TETVAENSKQESKTVE 1052

Query: 66 ADEHAADDEAAETDDEVSDSVEATDADDTETTQAETKEAAKETEPKEEAAEETAEDAESE 125
+E A + A+ + ++ A+ T E ++ ET+ + + E E
Sbjct: 1053 KNEQDATETTAQNREVAKEAKSNVKAN---TQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 126 SKAR 129
KA+
Sbjct: 1110 EKAK 1113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0754cDHBDHDRGNASE1328e-40 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 132 bits (332), Expect = 8e-40
Identities = 79/248 (31%), Positives = 122/248 (49%), Gaps = 19/248 (7%)

Query: 2 KTAVVTGGGSGIGLAVVERLRADGLNVASIDLRPSDAEL-------------AFTADVTD 48
K A +TG GIG AV L + G ++A++D P E AF ADV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 49 RSQVDAALSAIRAQLGPVTVLVNAAGLDGFKKFNNITFEDWQRVIDVNLNGVFHTTQAVL 108
+ +D + I ++GP+ +LVN AG+ ++++ E+W+ VN GVF+ +++V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 109 PDMIEAGWGRIVNISSSSTHSGAPYMSHYVAAKSAVNGLTKSLALEYGPKGITVNAVPPG 168
M++ G IV + S+ M+ Y ++K+A TK L LE I N V PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 169 FIDTPMLRA------AEKNGFLGDIEETIARTPVRRMGTPQDIAAACAFLVSEEAGYITG 222
+T M + + G +E P++++ P DIA A FLVS +AG+IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 223 QILGVNGG 230
L V+GG
Sbjct: 249 HNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0763PF03544290.010 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.010
Identities = 17/84 (20%), Positives = 24/84 (28%), Gaps = 6/84 (7%)

Query: 126 LQKSVDPSKILYSEPRLAPGGEGPKPGPPEIPPAVSAYTGLPGDPVGPPGAEPPARIPGA 185
P ++ EP P E PK P I P P + +
Sbjct: 64 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK------PKPKPKPKPVKKVEQPKRD 117

Query: 186 AMPLPPPPSTPMPPPPPPEPGVSG 209
P+ P++P P P S
Sbjct: 118 VKPVESRPASPFENTAPARPTSST 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0764PRTACTNFAMLY330.003 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 32.7 bits (74), Expect = 0.003
Identities = 25/80 (31%), Positives = 37/80 (46%)

Query: 95 GQTSLLGSMHVELNTPLGQQGSGRLQPGATIPLSRSSAYPSTEQTLSSLGAVVNGGGLGQ 154
GQ SL+G+ P Q G QP P + + P+ + ++ A VN GG+G
Sbjct: 562 GQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGL 621

Query: 155 IGEIIHNFSAALSGREGAVR 174
+ + S ALS R G +R
Sbjct: 622 ASTLWYAESNALSKRLGELR 641



Score = 28.9 bits (64), Expect = 0.048
Identities = 23/62 (37%), Positives = 25/62 (40%), Gaps = 3/62 (4%)

Query: 365 GPGPRQIVGDPLPGPPPGAAPLPGPPPGAAPLPGPPPGAASLPDAGLGQTPPAATAPTEG 424
G G +VG P P P AP PGP P P P P A P + AA A
Sbjct: 560 GNGQWSLVGAKAP-PAPKPAPQPGPQPPQPPQPQPEAPAPQPPAG--RELSAAANAAVNT 616

Query: 425 GG 426
GG
Sbjct: 617 GG 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0765PF03544358e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 34.6 bits (79), Expect = 8e-04
Identities = 12/89 (13%), Positives = 18/89 (20%), Gaps = 1/89 (1%)

Query: 460 AAVPPVPS-SGPPALAPMSRMSADLPPIAPLDVPTPTELPPPPPPPPAPAAPDQVDGAAP 518
A + P + PP + P P + P E P P P P
Sbjct: 58 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRD 117

Query: 519 QAAPSAFAGKASKPAPSVVVAKYDPRTGR 547
+ +
Sbjct: 118 VKPVESRPASPFENTAPARPTSSTATAAT 146



Score = 33.4 bits (76), Expect = 0.002
Identities = 24/119 (20%), Positives = 34/119 (28%), Gaps = 6/119 (5%)

Query: 435 LPPGAVPRGAPAGPRGENPPPGSVGAAVPPVPSSGPPALAPMSRMSADLPPIAPLDVPTP 494
LP A P + PP +V PP P P P + P AP+ + P
Sbjct: 43 LPAPAQPISVTMVAPADLEPPQAVQP--PPEPVVEPE---PEPEPIPEPPKEAPVVIEKP 97

Query: 495 TELPPPPPPPPAPAAPDQVDGAAPQAAPSAFAGKASKPAPSVVVAKYDPRTGRYVGPDG 553
+ P P P P P + A + + PA +
Sbjct: 98 -KPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0770cHTHTETR592e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.3 bits (143), Expect = 2e-13
Identities = 29/90 (32%), Positives = 42/90 (46%), Gaps = 3/90 (3%)

Query: 2 LDAALDLFAANGVSGTSLQMIADAVGITKAAVYHQFRTKEQIVIAVTERELGRLVPALEE 61
LD AL LF+ GVS TSL IA A G+T+ A+Y F+ K + + E + E
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 62 AEAHDDG---PQARDALLVRVIEMAVRDRR 88
+A G R+ L+ + +RR
Sbjct: 77 YQAKFPGDPLSVLREILIHVLESTVTEERR 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0775HTHTETR561e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.8 bits (134), Expect = 1e-11
Identities = 28/156 (17%), Positives = 60/156 (38%), Gaps = 10/156 (6%)

Query: 16 ARRIGAPDAKNRGLLLDAAERLMLEEGYAAVTSRRLASRAGLKPQLVHYYFRTMEELFLE 75
AR+ + R +LD A RL ++G ++ + +A AG+ ++++F+ +LF E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 76 VFRRRAEEGLAVQAQALQSDQP--LWAVWRFGTDPAFTQISME-----FMALANHRKDMR 128
++ + Q+ P +V R E M + H+ +
Sbjct: 62 IWELSESN-IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 129 AEIAYYA--ERFRDEQQRAVAAALERYGVQNKDVPP 162
E+A +R + ++ ++ K +P
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPA 156


16MAP0792cMAP0806cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0792c212-0.306778hypothetical protein
MAP0793c113-0.602332hypothetical protein
MAP0794111-1.085014hypothetical protein
MAP07951100.712392hypothetical protein
MAP0796c0100.772274hypothetical protein
MAP0797-1100.979985hypothetical protein
MAP0798-181.024357hypothetical protein
MAP0799c191.592276hypothetical protein
MAP0800c093.290125hypothetical protein
MAP08013112.380542hypothetical protein
MAP08022112.810721molybdenum cofactor biosynthesis protein MoaC
MAP08031143.142758Mog
MAP08041142.826792MoaE2
MAP0805c-1142.439976hypothetical protein
MAP0806c212-0.041654MoaD2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0794HTHTETR631e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.1 bits (153), Expect = 1e-14
Identities = 32/109 (29%), Positives = 52/109 (47%), Gaps = 6/109 (5%)

Query: 1 MGRPRSDTRERIQQVARELFSQRGVQRTSLQDIADRLGITKPALYYHFPSREDLVRSILV 60
+ +TR+ I VA LFSQ+GV TSL +IA G+T+ A+Y+HF + DL I
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 PLIEEGERFVAEHESREHTE----ARELLEGYFD--FHYRHRRDLVLLL 103
E++++ + RE+L + RR L+ ++
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEII 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0803ARGDEIMINASE326e-04 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 32.5 bits (74), Expect = 6e-04
Identities = 8/45 (17%), Positives = 21/45 (46%)

Query: 31 WLAQQGFSSAQPEVVADGSPVGEALRKAIDDDVDVILTSGGTGIA 75
++ SS++ + + + + + L + +D+I +GG I
Sbjct: 298 YVLTYNPSSSKIHIKKEKARIKDVLSFYLGRKIDIIKCAGGDLIH 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0805cPF03544374e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 36.9 bits (85), Expect = 4e-05
Identities = 17/65 (26%), Positives = 19/65 (29%), Gaps = 1/65 (1%)

Query: 140 VPQAPPADAPPPAPVQLASFD-RPAPPDAPAPDAPPPAPADLPPAPPADATPPAPPADAP 198
AP PP A P P P P+ P AP + P P P
Sbjct: 53 TMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVE 112

Query: 199 APDGD 203
P D
Sbjct: 113 QPKRD 117



Score = 29.2 bits (65), Expect = 0.012
Identities = 20/100 (20%), Positives = 26/100 (26%), Gaps = 1/100 (1%)

Query: 126 VPAPAGLDAPGVNGVPQAPPADAPPPAPVQLASFDRPAPPDAPAPDAPPPAP-ADLPPAP 184
A P V P+ P PP + +P P P P P D+ P
Sbjct: 63 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVE 122

Query: 185 PADATPPAPPADAPAPDGDFHGFVPAGMPDHVSEAGYTQR 224
A+P A A + S R
Sbjct: 123 SRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSR 162


17MAP0844MAP0871cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0844015-3.217381hypothetical protein
MAP0845016-3.286681hypothetical protein
MAP0846023-3.581618hypothetical protein
MAP0847028-4.531975hypothetical protein
MAP0848137-6.091460hypothetical protein
MAP0849c145-7.446242hypothetical protein
MAP0850c051-8.261411hypothetical protein
MAP0851159-9.998914hypothetical protein
MAP0852159-10.311056hypothetical protein
MAP0853158-10.151489hypothetical protein
MAP0854157-9.970840hypothetical protein
MAP0855256-9.990349hypothetical protein
MAP0856c256-10.063234hypothetical protein
MAP0857c457-9.684693hypothetical protein
MAP0858358-10.337549hypothetical protein
MAP0859c459-10.288914hypothetical protein
MAP0860c460-10.450658hypothetical protein
MAP0861459-10.178905hypothetical protein
MAP0862450-9.168700hypothetical protein
MAP0863039-6.473378hypothetical protein
MAP0864-228-4.729645hypothetical protein
MAP0865-221-3.965843hypothetical protein
MAP0866-211-1.977833hypothetical protein
MAP0867c-210-1.605614*hypothetical protein
MAP0868c-18-1.197840hypothetical protein
MAP0869c-27-1.798809manganese transport protein MntH
MAP0870c09-2.796003hypothetical protein
MAP0871c28-2.219892short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0847PF03544371e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 36.9 bits (85), Expect = 1e-04
Identities = 25/96 (26%), Positives = 31/96 (32%), Gaps = 2/96 (2%)

Query: 49 VVGPAPTANATPCGAPEANVD-PPAAPPAMPAPQPVVQPPTGRRPTHTNDQAPLPKLGPL 107
VV P P P EA V P P P+PV + +R + P
Sbjct: 73 VVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENT 132

Query: 108 ISSLLKPSAPGRQYSAPVHPRAEGPGP-DPSAPMYP 142
+ S S PV A GP + P YP
Sbjct: 133 APARPTSSTATAATSKPVTSVASGPRALSRNQPQYP 168



Score = 29.9 bits (67), Expect = 0.022
Identities = 21/93 (22%), Positives = 24/93 (25%), Gaps = 2/93 (2%)

Query: 122 SAPVHPRAEGPGPD--PSAPMYPQAEVLPPGPNPPTPGGAVPPNANQLRPNAAPMPQPQP 179
VH E P P S M A++ PP P P V P P P
Sbjct: 34 YTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV 93

Query: 180 QPQPAPGPPPPSIAGAPTSLVDWVTGPNGPNKT 212
+P P P P P
Sbjct: 94 IEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPA 126



Score = 29.6 bits (66), Expect = 0.025
Identities = 19/99 (19%), Positives = 29/99 (29%), Gaps = 7/99 (7%)

Query: 123 APVHPRAEGPGPDPSAPMYPQA------EVLPPGPNPPTPGGAVPPNANQLRPNAAPMPQ 176
A P P+P P+ + P P P V Q + + P+
Sbjct: 65 AVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVE-QPKRDVKPVES 123

Query: 177 PQPQPQPAPGPPPPSIAGAPTSLVDWVTGPNGPNKTLQR 215
P P P+ + A + VT + L R
Sbjct: 124 RPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSR 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0866cloacin300.007 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.5 bits (68), Expect = 0.007
Identities = 21/67 (31%), Positives = 29/67 (43%), Gaps = 12/67 (17%)

Query: 152 FPTDLDDVASRLNAAARLTIAVQRARQDREARIEGELGQQPGYIEQLEYDAAHPENAVEV 211
FP D A ++ + L+ + RQD E R Q E+DA HP A E
Sbjct: 274 FPKDSGHNAVYVSVSDVLSPDQVKQRQDEENR------------RQQEWDATHPVEAAER 321

Query: 212 DGEQASA 218
+ E+A A
Sbjct: 322 NYERARA 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0871cDHBDHDRGNASE1082e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 108 bits (271), Expect = 2e-30
Identities = 73/254 (28%), Positives = 118/254 (46%), Gaps = 10/254 (3%)

Query: 8 LDDKVAVITGAGRGLGAAIAVAFAEAGADVLIASRTESQLEAVAEQVRAAGRRAHVVAAD 67
++ K+A ITGA +G+G A+A A GA + +LE V ++A R A AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 68 LAHPESTAELAARAVEAFGKLDIVVNNVGGTMPNTLLTTSTKDLKDAFTFNVATAHALTV 127
+ + E+ AR G +DI+VN G P + + S ++ + F+ N +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 128 AAVPLMLEHSGGGNIIKITSTMGRLAGRGFAAYGTAKAALSHYTRLTALDLCPR-IRVNA 186
+ M++ G+I+ + S + AAY ++KAA +T+ L+L IR N
Sbjct: 126 SVSKYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 187 IAPGSILTSA-----LDVVASNDELRAPMEK---ATPLRRLGDPVDIAAAAVYLASPAGE 238
++PGS T D + ++ +E PL++L P DIA A ++L S
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 239 FLTGKTLEVDGGLT 252
+T L VDGG T
Sbjct: 245 HITMHNLCVDGGAT 258


18MAP1065MAP1112cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP10652132.195046hypothetical protein
MAP10662132.645019hypothetical protein
MAP1067c3132.296175hypothetical protein
MAP10683132.294148hypothetical protein
MAP10694142.506471hypothetical protein
MAP1070c4143.330482hypothetical protein
MAP1071c4153.024097hypothetical protein
MAP10721151.706357recombination factor protein RarA
MAP10731151.769205hypothetical protein
MAP1074c1161.688753hypothetical protein
MAP1075c0162.219329hypothetical protein
MAP10760152.719031hypothetical protein
MAP10770152.762774alanyl-tRNA synthetase
MAP10781133.624107Holliday junction resolvase-like protein
MAP10790113.738917hypothetical protein
MAP1080194.047577shikimate 5-dehydrogenase
MAP1081182.278389hypothetical protein
MAP1082c2101.703507hypothetical protein
MAP1083c2111.397823hypothetical protein
MAP1084c1101.308873hypothetical protein
MAP10850101.119666hypothetical protein
MAP1086-1111.706176hypothetical protein
MAP10871123.324679hypothetical protein
MAP10881103.333966hypothetical protein
MAP10891103.374540hypothetical protein
MAP10900113.421107DppD_2
MAP1091-1143.419314chorismate synthase
MAP1092-1141.960295shikimate kinase
MAP10930141.8215333-dehydroquinate synthase
MAP10943182.1249143-dehydroquinate dehydratase
MAP1095c2150.478966hypothetical protein
MAP10962140.871099PepQ
MAP10972140.021020elongation factor P
MAP10981130.317512transcription antitermination protein NusB
MAP10990130.098891hypothetical protein
MAP1100-1110.125682Adi
MAP11010111.188842hypothetical protein
MAP1102c0140.127016TcrA
MAP1103c1130.972956hypothetical protein
MAP1104c3121.320736hypothetical protein
MAP11053131.962466hypothetical protein
MAP11062142.399136hypothetical protein
MAP11073142.468454hypothetical protein
MAP11082132.361446hypothetical protein
MAP11091142.358526hypothetical protein
MAP11101152.281517hypothetical protein
MAP11111142.153548oxidoreductase/HEAT repeat-containing protein
MAP1112c2141.604438hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1068SECYTRNLCASE300.008 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 30.5 bits (69), Expect = 0.008
Identities = 12/40 (30%), Positives = 19/40 (47%), Gaps = 2/40 (5%)

Query: 134 TLDLDPPEITDEVRDYAA--PSFAPGRPLIEVLRDLASRI 171
+ +P E+ D ++ Y P GRP E L + +RI
Sbjct: 335 AISFNPEEVADNMKKYGGFIPGIRAGRPTAEYLSYVLNRI 374


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1071cPERTACTIN320.004 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.0 bits (72), Expect = 0.004
Identities = 18/48 (37%), Positives = 18/48 (37%)

Query: 11 PPEPPPGYGPPPGYGTPPGYGTPPPPPPGYGPPPGYGAPPPGYGPPPG 58
PP P P P P G P PP PP PP P PP G
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615



Score = 30.1 bits (67), Expect = 0.013
Identities = 18/47 (38%), Positives = 18/47 (38%)

Query: 8 PGTPPEPPPGYGPPPGYGTPPGYGTPPPPPPGYGPPPGYGAPPPGYG 54
P P P PG P P PP PP PP P AP P G
Sbjct: 569 PAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615



Score = 29.7 bits (66), Expect = 0.018
Identities = 17/51 (33%), Positives = 20/51 (39%)

Query: 2 SQPPEYPGTPPEPPPGYGPPPGYGTPPGYGTPPPPPPGYGPPPGYGAPPPG 52
++ P P P+P P GP P P PP PP P PP G
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615



Score = 28.5 bits (63), Expect = 0.037
Identities = 21/69 (30%), Positives = 22/69 (31%), Gaps = 4/69 (5%)

Query: 21 PPGYGTPPGYGTPPPPPPGYGPPPGYGAPPPGYGPPPGYGPPPGFGGP----PKPAFSVG 76
G + G PP P P P P G PP PP PP P P G
Sbjct: 556 GNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615

Query: 77 EAFGWAWNA 85
A NA
Sbjct: 616 RELSAAANA 624


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1077FLGFLGJ300.038 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 30.1 bits (67), Expect = 0.038
Identities = 31/131 (23%), Positives = 52/131 (39%), Gaps = 14/131 (10%)

Query: 526 TGAGESARAAVTDVQKIAKTLWVHRVNVESGEFVEGDTVIAAVDPQWRRGATQGHSGTHM 585
T +A VQK + + +S F+ A + + + Q H+
Sbjct: 120 TVVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFL------AQLSLPAQLASQQSGVPHHL 173

Query: 586 V--HAALRQVLGPNAVQAGSLNRPGYLRF----DFNWQGPLTEEQRTQIEEVTNQAVQAD 639
+ AAL G ++ + P Y F NW+GP+TE T+ E + V+A
Sbjct: 174 ILAQAALESGWGQRQIRRENGE-PSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAK 232

Query: 640 FEVH-TFTEQL 649
F V+ ++ E L
Sbjct: 233 FRVYSSYLEAL 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1081PREPILNPTASE442e-08 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 44.4 bits (105), Expect = 2e-08
Identities = 37/130 (28%), Positives = 57/130 (43%), Gaps = 9/130 (6%)

Query: 12 WLAVLSCYDIRQHRLPNALTLTGAAAILAAAGLAG--RGPSALAGAAA----LAAIYLLV 65
L L+ D+ + LP+ LTL L L G A+ GA A L ++Y
Sbjct: 143 VLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLWSLYWAF 202

Query: 66 HAV-APGGMGAGDVKLALGVGALTGCGGV-GVWFLAALAAPLLTVLVGVLARVRRAGPTV 123
+ GMG GD KL +GA G + V L++L + + + +L ++ P +
Sbjct: 203 KLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNHHQSKP-I 261

Query: 124 PHGPSLCLAA 133
P GP L +A
Sbjct: 262 PFGPYLAIAG 271


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1101PF06580392e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.5 bits (92), Expect = 2e-05
Identities = 18/105 (17%), Positives = 36/105 (34%), Gaps = 28/105 (26%)

Query: 363 LVDNAVRHAVS------CVAIEVGSRDGTAVLTVGDDGPGIPPAQRSRVFERFVRLDTDR 416
LV+N ++H ++ + ++ +GT L V + G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK----------------- 305

Query: 417 ARSGGGAGLGLAIVAE-VVAAHGG--TVTIGDRPGGGTTLTVALP 458
+ G GL V E + +G + + ++ G V +P
Sbjct: 306 -NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1102cHTHFIS883e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 87.6 bits (217), Expect = 3e-22
Identities = 29/129 (22%), Positives = 58/129 (44%), Gaps = 1/129 (0%)

Query: 2 KVLLVEDEPRLAATVARGLKAEGFVVVTVGNGVDGLAEATENPFDIVILDIMLPGRSGYE 61
+L+ +D+ + + + L G+ V N D+V+ D+++P + ++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRRMRSNNVWTPVLMLTAKDGEYDETDAFDLGADDYLTKPFSFRVLVARL-RALVRRGA 120
+L R++ PVL+++A++ A + GA DYL KPF L+ + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 PERPVVLTA 129
+ +
Sbjct: 125 RPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1110PF05272344e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.3 bits (78), Expect = 4e-04
Identities = 18/66 (27%), Positives = 23/66 (34%), Gaps = 10/66 (15%)

Query: 27 VVRPG----EILVLTGPSGCGKSTVLRALAGLLTPDGGRVLADGVPVTGTSGDRAMVFQD 82
V+ PG +VL G G GKST++ L GL +D GT D
Sbjct: 588 VMEPGCKFDYSVVLEGTGGIGKSTLINTLVGL------DFFSDTHFDIGTGKDSYEQIAG 641

Query: 83 NALLPW 88

Sbjct: 642 IVAYEL 647


19MAP1121cMAP1138cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1121c2150.634790hypothetical protein
MAP1122013-0.401962MIHF
MAP11230130.539261guanylate kinase
MAP1124-1120.585048DNA-directed RNA polymerase subunit omega
MAP1125-1122.710069bifunctional phosphopantothenoylcysteine
MAP1126-1132.612578S-adenosylmethionine synthetase
MAP1127c0113.189865hypothetical protein
MAP1128c0124.937391hypothetical protein
MAP11291135.448294hypothetical protein
MAP11300125.445830primosome assembly protein PriA
MAP1131-194.022357hypothetical protein
MAP1132c0103.051821hypothetical protein
MAP11330112.355098methionyl-tRNA formyltransferase
MAP11341121.934097Fmu
MAP1135113-0.097379ribulose-phosphate 3-epimerase
MAP1136010-0.058097RibG
MAP1137c111-0.944373hypothetical protein
MAP1138c212-1.491150hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1137cTCRTETB773e-17 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 77.2 bits (190), Expect = 3e-17
Identities = 71/342 (20%), Positives = 131/342 (38%), Gaps = 29/342 (8%)

Query: 32 VLLGALDAYVVVTIMRDIMTDVHIPINQLQRITWIITMYLLGYIAAMPLLGRASDRFGRK 91
L+ V+ + DI D + P W+ T ++L + + G+ SD+ G K
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPAST---NWVNTAFMLTFSIGTAVYGKLSDQLGIK 79

Query: 92 LVLQVSLALFMVGSVVTALAGHWGDFHLLIGGRTIQGVASGALLPVTLALGADLWAQRNR 151
+L + + GSV+ GH F LLI R IQG + A + + + A + NR
Sbjct: 80 RLLLFGIIINCFGSVI-GFVGH-SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 152 AGVLGGIGAAQELGSVLGPLYGIFIVFLFHDWRYVFWINVPLTLIAMVMIQFSLPSHEKV 211
G IG+ +G +GP G I H W Y+ +P+ I V L E
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIH-WSYLL--LIPMITIITVPFLMKLLKKEV- 193

Query: 212 EQPEKVDLVGGVLLAVALGLAVIGLYNPEPDGKQILPSYGLPLVLGAVVVGILFLLWERF 271
D+ G +L++V + ++ SY + ++ +V+ ++F+ R
Sbjct: 194 RIKGHFDIKGIILMSVGIVFFMLF-----------TTSYSISFLIVSVLSFLIFVKHIRK 242

Query: 272 ARTRLIEPAGVHFRPFLAALGASLFAGAALMVTLVDVELFGQGVLG---QDQTQAAGLLL 328
++P PF+ + G + T+ ++ Q T G ++
Sbjct: 243 VTDPFVDPGLGKNIPFMIG----VLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI 298

Query: 329 WFLIALPIG--AVLGGWIATRVGDRAMTFVGLLIAAYGYWLI 368
F + + +GG + R G + +G+ + +
Sbjct: 299 IFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTA 340


20MAP1179MAP1189Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MAP11793121.846475protoheme IX farnesyltransferase
MAP1180c6132.585809Qor
MAP11818122.574862hypothetical protein
MAP1182c6121.058124hypothetical protein
MAP1183c3100.532299hypothetical protein
MAP1184c2110.168924hypothetical protein
MAP1185c112-0.036650hypothetical protein
MAP1186112-1.102105hypothetical protein
MAP1187312-1.458830hypothetical protein
MAP1188210-0.144165hypothetical protein
MAP1189212-0.035111hypothetical protein
21MAP1213MAP1236cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1213222-4.134764hypothetical protein
MAP1214220-4.125469hypothetical protein
MAP1215531-4.326018hypothetical protein
MAP1216c124-3.303170hypothetical protein
MAP1217c1160.678278hypothetical protein
MAP1218c2142.913545hypothetical protein
MAP1219c3133.680286hypothetical protein
MAP1220c1103.412807hypothetical protein
MAP1221-291.793783hypothetical protein
MAP1222-191.679059hypothetical protein
MAP1223c-190.994093hypothetical protein
MAP1224c-280.429361hypothetical protein
MAP1225-28-0.230341MutA
MAP1226-29-2.007833methylmalonyl-CoA mutase
MAP1227-121-3.510254arginine/ornithine transport system ATPase
MAP1228229-5.960069hypothetical protein
MAP1229338-8.494213hypothetical protein
MAP1230449-10.772389hypothetical protein
MAP1231454-12.267287GmdA
MAP1232553-11.875269EpiA
MAP1233544-10.158665hypothetical protein
MAP1234127-7.949225hypothetical protein
MAP1235119-6.892062hypothetical protein
MAP1236c114-5.970770DrrC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1217cPRPHPHLPASEC320.003 Prokaryotic zinc-dependent phospholipase C signature.
		>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature.

Length = 398

Score = 31.5 bits (71), Expect = 0.003
Identities = 15/69 (21%), Positives = 24/69 (34%), Gaps = 3/69 (4%)

Query: 35 YEAFTETWKDSLSIGWQGNGAEALRSRTYADKVKVSDMVDQLHEAAKVARAGATDLSAAR 94
+E F E K+ I G YAD +K D E A+ + +
Sbjct: 179 FETFAEERKEQYKINTAGCKTN---EDFYADILKNKDFNAWSKEYARGFAKTGKSIYYSH 235

Query: 95 SRMRNAVVD 103
+ M ++ D
Sbjct: 236 ASMSHSWDD 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1220cPERTACTIN331e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 33.2 bits (75), Expect = 1e-04
Identities = 18/45 (40%), Positives = 19/45 (42%)

Query: 65 PPGPPPMWAAPPMPPPPLHGPPGPPHGFPPPPPGEAFPTLPPPSP 109
PP P P P P P PP PP PP P + P P P P
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQP 612



Score = 30.1 bits (67), Expect = 0.001
Identities = 20/56 (35%), Positives = 20/56 (35%)

Query: 42 GAVGTVAAIAFAWIISVGPPGPPPPGPPPMWAAPPMPPPPLHGPPGPPHGFPPPPP 97
G V A A PGP P PP PP PP P P P P PP
Sbjct: 558 GQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613



Score = 26.6 bits (58), Expect = 0.023
Identities = 18/51 (35%), Positives = 19/51 (37%), Gaps = 5/51 (9%)

Query: 60 PPGPPPPGPPPMWAAPPMPPPPLHGPPGPPHGFPPPPPGEAFPTLPPPSPR 110
P P PGP P PP PP PP PP P PP+ R
Sbjct: 571 PKPAPQPGPQ-----PGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGR 616



Score = 26.2 bits (57), Expect = 0.036
Identities = 14/37 (37%), Positives = 14/37 (37%)

Query: 74 APPMPPPPLHGPPGPPHGFPPPPPGEAFPTLPPPSPR 110
APP P P P P P PP P P P R
Sbjct: 567 APPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQR 603


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1221HTHFIS816e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 6e-20
Identities = 36/125 (28%), Positives = 66/125 (52%), Gaps = 1/125 (0%)

Query: 2 LVVEDSETIREMVSEALTEVGYHTEARRDGERLEELLDGIRPDLVVLDVMLPGRDGFALI 61
LV +D IR ++++AL+ GY + L + DLVV DV++P + F L+
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 62 DVIRDWG-DIGIVLITARDGLPDRLRGLDGGADDYVIKPFELAELVSRVGAVLRRRGRLP 120
I+ D+ +++++A++ ++ + GA DY+ KPF+L EL+ +G L R P
Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRP 126

Query: 121 QVIQV 125
++
Sbjct: 127 SKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1222TONBPROTEIN392e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 38.8 bits (90), Expect = 2e-05
Identities = 14/36 (38%), Positives = 15/36 (41%)

Query: 97 AEGPVEPPPPYPPPLPPPAYPPPYPPPYPPPPGPPP 132
+EPP PP P P P P P P PP P
Sbjct: 51 TPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAP 86



Score = 37.3 bits (86), Expect = 8e-05
Identities = 10/39 (25%), Positives = 11/39 (28%)

Query: 94 PDLAEGPVEPPPPYPPPLPPPAYPPPYPPPYPPPPGPPP 132
+ P P P P P P P P P P
Sbjct: 59 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPK 97



Score = 36.1 bits (83), Expect = 2e-04
Identities = 15/51 (29%), Positives = 15/51 (29%), Gaps = 1/51 (1%)

Query: 94 PDLAEGPVEPPPPYPPPLPPPAYPPPYPPPYP-PPPGPPPDTTATAVVHPL 143
P A P P P P P P PP P P P P V
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQ 107



Score = 36.1 bits (83), Expect = 2e-04
Identities = 16/49 (32%), Positives = 17/49 (34%), Gaps = 1/49 (2%)

Query: 98 EGPVEPPPPYPPPLPPPAYPPPYPPPYP-PPPGPPPDTTATAVVHPLPD 145
PV P P P P+P P P P P P P P P D
Sbjct: 65 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRD 113



Score = 36.1 bits (83), Expect = 2e-04
Identities = 12/41 (29%), Positives = 15/41 (36%)

Query: 92 ISPDLAEGPVEPPPPYPPPLPPPAYPPPYPPPYPPPPGPPP 132
++P E P PP P + P P P P P P
Sbjct: 50 VTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIE 90



Score = 32.3 bits (73), Expect = 0.004
Identities = 14/54 (25%), Positives = 17/54 (31%)

Query: 78 LVVTADGQAYGDRAISPDLAEGPVEPPPPYPPPLPPPAYPPPYPPPYPPPPGPP 131
+V AD + P+ P P P P P P P P P P
Sbjct: 49 MVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKP 102



Score = 31.9 bits (72), Expect = 0.004
Identities = 12/46 (26%), Positives = 13/46 (28%)

Query: 98 EGPVEPPPPYPPPLPPPAYPPPYPPPYPPPPGPPPDTTATAVVHPL 143
E EP P P P P P P P V P+
Sbjct: 72 EPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPV 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1223cV8PROTEASE413e-06 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 40.8 bits (95), Expect = 3e-06
Identities = 31/194 (15%), Positives = 62/194 (31%), Gaps = 17/194 (8%)

Query: 36 TPAQQPTPPPVLAPIDLPAA---SAIGPGAGI-YVDYTDGSGGMGCTAGFLVHTSSGQAG 91
P +Q V+ P + + G A + Y+ +G + G +V G+
Sbjct: 59 KPLEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIAS-GVVV----GKDT 113

Query: 92 ILTAGHCNRP--GEPSKVTMNLGGVLPYATLGTFSQTISEGVHDEQHDIGLIILDGDNVP 149
+LT H G+P + + + + D+ ++ +
Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173

Query: 150 QSPAIAASVPVSGVAANLQVGQQLCKFGMGSGADAC------GQIVEITGSKVKFLAGGQ 203
+ A QV Q + G G+I + G +++
Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQYDLSTT 233

Query: 204 CGDSGGPVYRYEND 217
G+SG PV+ +N+
Sbjct: 234 GGNSGSPVFNEKNE 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1231NUCEPIMERASE977e-25 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 96.8 bits (241), Expect = 7e-25
Identities = 73/333 (21%), Positives = 122/333 (36%), Gaps = 48/333 (14%)

Query: 3 RALITGITGQDGSYLAELLLSKGYEVHGLVRRASTFNTSRIDHLYVDPHQPGARL----- 57
+ L+TG G G ++++ LL G++V G+ N Y D ARL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGI----DNLND------YYDVSLKQARLELLAQ 51

Query: 58 ---FLHYADLTDGTRLVTLLSSIDPDEVYNLAAQSHVRVSFDEPVHTGDTTGMGSIRLLE 114
H DL D + L +S + V+ + VR S + P D+ G + +LE
Sbjct: 52 PGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILE 111

Query: 115 AVRLSRVDCRFYQASSSEMFGASP--PPQNESTPFYPRSPYGAAKVFSYWTTRNYREAYG 172
R +++ ASSS ++G + P + + +P S Y A K + Y YG
Sbjct: 112 GCRHNKIQ-HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 173 LFAVNGILFNHESPRRGETFVTRKITRAVARIRAGVQSEVYMGNLDAIRDWGYAPEYVEG 232
L A F P K T+A + G +VY RD+ Y + E
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKA---MLEGKSIDVY-NYGKMKRDFTYIDDIAEA 226

Query: 233 MWRMLQAPEPDD-------------------YVLATGRGYTVREFAQAAFDHVGLDWQKH 273
+ R+ D Y + + ++ QA D +G++ +K+
Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKN 286

Query: 274 VKFDDRYLRPTEVDSLVGDADRAAQSLGWKASV 306
L+P +V D + +G+
Sbjct: 287 --MLP--LQPGDVLETSADTKALYEVIGFTPET 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1232NUCEPIMERASE731e-16 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 72.9 bits (179), Expect = 1e-16
Identities = 60/287 (20%), Positives = 103/287 (35%), Gaps = 35/287 (12%)

Query: 46 EIDLTDRAATFDFVSETRPQVIIDAAARVGGIMANNTYPADFLSENLRIQTNLLDAAVAV 105
+IDL DR D + + + + R + + P + NL N+L+
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 106 RVPRLLFLGSSCIYPKYAPQPIHESALLTGPLEPTNDAYAIAKIAGILQVQAVRRQYGLA 165
++ LL+ SS +Y P + P+ YA K A L YGL
Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSL----YAATKKANELMAHTYSHLYGLP 172

Query: 166 WISAMPTNLYGP-GDNFSPSGSHLLPALIRRYEEAKAGGAEEVTNWGTGTPRRELLHVDD 224
+YGP G P + L + E K+ + + G +R+ ++DD
Sbjct: 173 ATGLRFFTVYGPWGR---PDMA--LFKFTKAMLEGKS-----IDVYNYGKMKRDFTYIDD 222

Query: 225 LASACLFLLEHFDGPNH------------------VNVGTGVDHSISEIADMVATAVGYI 266
+A A + L + + N+G + + + A+G
Sbjct: 223 IAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE 282

Query: 267 GETRWDPTKPDGTPRKLLDVSALRE-LGWRPRIALKEGIDATVSWYR 312
+ P +P D AL E +G+ P +K+G+ V+WYR
Sbjct: 283 AKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYR 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1236cABC2TRNSPORT403e-06 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 40.3 bits (94), Expect = 3e-06
Identities = 46/214 (21%), Positives = 81/214 (37%), Gaps = 11/214 (5%)

Query: 21 RLLLRWRRDQ-AVLMGSLLLPIFLLFAYQIVLGERVHQVTGIGSVYGIAPMCAVISALFG 79
R + W++ A L+G L P+ LF LG V +V G+ +A SA+
Sbjct: 22 RNYIAWKKAALASLLGHLAEPLIYLFGLGAGLGVMVGRVGGVSYTAFLAAGMVATSAMTA 81

Query: 80 SLGNSVGITIDR--ESRVLSRMWVLPIHRASAVTGWVIAEVVRALIGTILITAIALAMGL 137
+ ++ R R M + V G + +A + I +A A+G
Sbjct: 82 ATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALG- 140

Query: 138 RFTNGWAAALLFLLIPSITVTGFTALVMAMAVRKKGRTAMTWLLSATFALA---FVNPGA 194
+T + L +P I +TG + M V + ++ T + F++
Sbjct: 141 -YTQWLS---LLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAV 196

Query: 195 TPIKLFPDWARPLIRMQPISPPIEAMRSLAHGGP 228
P+ P + R P+S I+ +R + G P
Sbjct: 197 FPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHP 230


22MAP1268cMAP1280cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1268c37-0.692518GlgZ
MAP1269c39-0.047755GlgY
MAP1270c390.434966GlgX_1
MAP1271c4112.716174hypothetical protein
MAP1272c4124.025815hypothetical protein
MAP1273c3113.546481hypothetical protein
MAP1274192.409355adenosylmethionine--8-amino-7-oxononanoate
MAP12751112.4009698-amino-7-oxononanoate synthase
MAP1276091.380222dithiobiotin synthetase
MAP1277090.314485hypothetical protein
MAP1278c090.108226hypothetical protein
MAP1279c090.194986hypothetical protein
MAP1280c2121.297495hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1272cIGASERPTASE290.031 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.9 bits (64), Expect = 0.031
Identities = 20/127 (15%), Positives = 42/127 (33%), Gaps = 2/127 (1%)

Query: 186 QVSPVRTAGMAPYMVRVLGTTAPT--QQVPQQAPLQQTPAQQAPLQQTPGQQAPLQQTPG 243
+++ V A + P T T + Q++ + Q A ++ +
Sbjct: 1016 EIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSN 1075

Query: 244 QQLPTQQAPLQQVPGQQVPGQQLPTQQAPQQAPLQLAPTQQAPLQQLPTQQSPLQQLPVQ 303
+ TQ + Q + Q T++ + A + Q++P S + Q
Sbjct: 1076 VKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQ 1135

Query: 304 QSPLQPA 310
+QP
Sbjct: 1136 SETVQPQ 1142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1275PF00577310.007 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 31.4 bits (71), Expect = 0.007
Identities = 14/72 (19%), Positives = 26/72 (36%), Gaps = 1/72 (1%)

Query: 90 LADYVGAASGLLFSSGY-AANLGAVVGLSGRGALVVSDAYSHASLVDACRLSRARVVVTP 148
L G + Y A N G + GAL V ++++L D + V
Sbjct: 404 LPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLY 463

Query: 149 HRDVDAVRAALQ 160
++ ++ +Q
Sbjct: 464 NKSLNESGTNIQ 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1280cSALSPVBPROT300.028 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 29.7 bits (66), Expect = 0.028
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 11/80 (13%)

Query: 144 PERLARPRLSYPKRWGYRPAALGLFAFVWMELASPNPAAPSWVTGWLLIY----TVLMAA 199
+R A LS + PAA ++W + A W+ + Y
Sbjct: 225 RDRSAMRYLSKVQYGNATPAA---DLYLW----TSATPAVQWLFTLVFDYGERGVDPQVP 277

Query: 200 GAWLCGQRWLARADPFGVYS 219
A+ WLAR DPF +Y+
Sbjct: 278 PAFTAQNSWLARQDPFSLYN 297


23MAP1387cMAP1398Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1387c2120.863233hypothetical protein
MAP13883140.576097hypothetical protein
MAP13893130.430441acyl-CoA synthetase
MAP13903130.339774hypothetical protein
MAP1391c1120.704574hypothetical protein
MAP1392c1130.672647hypothetical protein
MAP1393c-1122.202299hypothetical protein
MAP1394c-1133.511747Amt_1
MAP13950144.3582523-methyladenine DNA glycosylase
MAP13960134.175782tyrosyl-tRNA synthetase
MAP1397-1114.158219hypothetical protein
MAP13981124.338229hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1391cHTHTETR631e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.5 bits (154), Expect = 1e-14
Identities = 22/106 (20%), Positives = 44/106 (41%), Gaps = 10/106 (9%)

Query: 7 PRRRRGRPAGSSGSRERILASARELFARNGIRNTSIRAVAAAAGVDSALVHHYFGTKEKL 66
R+ + + +R+ IL A LF++ G+ +TS+ +A AAGV ++ +F K L
Sbjct: 2 ARKTKQE---AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58

Query: 67 FAAAVQIPIDPMQVIGPLREVPVDDLGYALPSMLLPLWDSEVGAAF 112
F+ + + E+ ++ P L + +
Sbjct: 59 FSEIWE------LSESNIGELELEYQA-KFPGDPLSVLREILIHVL 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1392cABC2TRNSPORT462e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 46.5 bits (110), Expect = 2e-08
Identities = 37/174 (21%), Positives = 73/174 (41%), Gaps = 6/174 (3%)

Query: 53 TMQRERASGTLERILTTPLRRLDLLIA--YGTAFSIAAAAQAILACTVSFWLLGFDTAGS 110
R T E +L T LR D+++ A A A I + LG+ S
Sbjct: 90 AFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAA---LGYTQWLS 146

Query: 111 PVWVFVIAIVNAVLGVGLGLLCSAFARTEFQAVQFIPLVMVPQLLLAGIIVPRALMPHWL 170
++ + + + LG++ +A A + + + LV+ P L L+G + P +P
Sbjct: 147 LLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVF 206

Query: 171 QWISNVLPASYALEALQQVGTHPELTFIAVRDIVVVLGFALAALCLAAATLRRR 224
Q + LP S++++ ++ + + + + + + + L+ A LRRR
Sbjct: 207 QTAARFLPLSHSIDLIRPIMLGHPVVDVCQH-VGALCIYIVIPFFLSTALLRRR 259


24MAP1434MAP1441Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1434013-3.135690hypothetical protein
MAP1435015-3.049472hypothetical protein
MAP1436c115-3.103398hypothetical protein
MAP1437c115-3.183522hypothetical protein
MAP1438c314-2.123096hypothetical protein
MAP1439c315-2.632815hypothetical protein
MAP1440c416-2.185465hypothetical protein
MAP1441316-1.207405hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1435DHBDHDRGNASE338e-04 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 33.1 bits (75), Expect = 8e-04
Identities = 28/122 (22%), Positives = 48/122 (39%), Gaps = 7/122 (5%)

Query: 20 VDHRDDAAVSDLFDRVRRESGRLDLLVNNAATISDNLVSSKPF--WEKPLDLADVLDVGL 77
D RD AA+ ++ R+ RE G +D+LVN A + L+ S WE + + V
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSV-NSTGVFN 122

Query: 78 RSSYVASWYAAPLLVAGGRGLIAFTSSPGSVCYMHGPAYGAQKAGVDKMAADMAVDFRGT 137
S V+ + ++ S+P V AY + KA + ++
Sbjct: 123 ASRSVSKYMMD----RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY 178

Query: 138 GV 139
+
Sbjct: 179 NI 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1436cDHBDHDRGNASE1037e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 103 bits (259), Expect = 7e-29
Identities = 73/268 (27%), Positives = 110/268 (41%), Gaps = 33/268 (12%)

Query: 6 GRRAIVTGAGSGIGAATAARLLDEGATVVAYDISAEGLARTRAAADDAGTGKRLTTAVLD 65
G+ A +TGA GIG A A L +GA + A D + E L + ++ + D
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSS--LKAEARHAEAFPAD 65

Query: 66 ISVEGDVIAAVDGAVADLGGLEVLVNVAAIQTCSHTHQTTLADWNRTLAVNLTGTFLMTR 125
+ + ++G +++LVNVA + H + +W T +VN TG F +R
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 126 QALPALLDSGRGVVVNFTSTAASFAHPYMAAYAASKGGILSFTHSLALEYAKQGLRAVNI 185
++D G +V S A MAAYA+SK + FT L LE A+ +R +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 186 QPGGVSTALANSTLDKMPDGYDVGLWAKQTPLLHGKDSEILGD---------------PS 230
PG T + S LWA + +G + I G PS
Sbjct: 186 SPGSTETDMQWS------------LWADE----NGAEQVIKGSLETFKTGIPLKKLAKPS 229

Query: 231 AVASVIAMVASDDGAFITGTEIRVDGGA 258
+A + + S IT + VDGGA
Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1439cDHBDHDRGNASE585e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 57.8 bits (139), Expect = 5e-12
Identities = 45/165 (27%), Positives = 72/165 (43%), Gaps = 6/165 (3%)

Query: 3 VVLADIDGDAVAALRDELAAGGGAAHDAACDVRDPAAVQDLADR-AYDIGPVRLLVNNAG 61
+ D + + + + L A A DVRD AA+ ++ R ++GP+ +LVN AG
Sbjct: 35 IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAG 94

Query: 62 IEQFGYLWDTPVVNWQHVMDVNVSGVFYGVRAFLPKMMAAGQQAWVWNIASVGAVVAMPL 121
+ + G + W+ VN +GVF R+ MM + V + S A V
Sbjct: 95 VLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMMDRRSGSIV-TVGSNPAGVPRTS 153

Query: 122 QAPYIVSKHAVLALTECLHLEVQATGHDDHVHVQAVLPGPVRSNI 166
A Y SK A + T+CL LE+ + V PG +++
Sbjct: 154 MAAYASSKAAAVMFTKCLGLELAEYN----IRCNIVSPGSTETDM 194


25MAP1485cMAP1491Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1485c210-0.028706hypothetical protein
MAP1486c28-0.093145hypothetical protein
MAP1487c390.575961hypothetical protein
MAP1488c3112.185032hypothetical protein
MAP1489c3131.748348hypothetical protein
MAP14903141.603328hypothetical protein
MAP14914161.794710hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1489cDHBDHDRGNASE1025e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 102 bits (256), Expect = 5e-28
Identities = 90/285 (31%), Positives = 129/285 (45%), Gaps = 33/285 (11%)

Query: 4 VTGKVAVISGAARGQGRSHARMLAAEGADIIAVDLCADIETNEYPLARPEDLDETARLVE 63
+ GK+A I+GAA+G G + AR LA++GA I AVD PE L++ ++
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD------------YNPEKLEKVVSSLK 53

Query: 64 KEGQRAITAVADVRDRVALSAAIDAGVAEFGHLDIVVANAGIC--PLTAGLPPQAFADAV 121
E + A ADVRD A+ E G +DI+V AG+ L L + +
Sbjct: 54 AEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATF 113

Query: 122 DVDLGGVLNLVHASLKHLRA--GASIIVIGSNAAFMSSLNTSGAGSGIGGPGGAGYAFAK 179
V+ GV N + K++ SI+ +GSN A G+ A YA +K
Sbjct: 114 SVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPA------------GVPRTSMAAYASSK 161

Query: 180 LAAAHYVNDFALALAPFSIRMNAVHPTNVDTDMLHSPPMYRAFRPDLPAPTREDAEPVFP 239
AA + L LA ++IR N V P + +TDM S + + E + P
Sbjct: 162 AAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIP 221

Query: 240 LVQAMPVPYVEPEDISEAVLFLASDAARYITGQQLRVDAGGFLKV 284
L + +P DI++AVLFL S A +IT L VD G L V
Sbjct: 222 LKK-----LAKPSDIADAVLFLVSGQAGHITMHNLCVDGGATLGV 261


26MAP1503cMAP1519Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1503c433-2.399296hypothetical protein
MAP1504431-1.218161hypothetical protein
MAP1505424-0.278274hypothetical protein
MAP15062150.823615hypothetical protein
MAP15071111.329096hypothetical protein
MAP15081100.693873hypothetical protein
MAP15091101.022058hypothetical protein
MAP1510-191.212288hypothetical protein
MAP1511-191.258463hypothetical protein
MAP1512090.586821hypothetical protein
MAP15131111.114670hypothetical protein
MAP15143121.848629hypothetical protein
MAP15153111.605278hypothetical protein
MAP15163112.305716hypothetical protein
MAP15173102.722183hypothetical protein
MAP1518392.773485hypothetical protein
MAP1519292.295245hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1511SUBTILISIN1092e-28 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 109 bits (275), Expect = 2e-28
Identities = 48/272 (17%), Positives = 91/272 (33%), Gaps = 73/272 (26%)

Query: 285 FSGVAPDVDIIAIRQSSQAFGLKDAYTGDEDPQTQAKIDNVQTMARAIVHAANMGASVIN 344
GVAP+ D++ I+ ++ + + I +A +I+
Sbjct: 103 VVGVAPEADLLIIKVLNKQGS-----------------GQYDWIIQGIYYAIEQKVDIIS 145

Query: 345 ISDVTCMSARNIIDQRALGAAVRYAAVDKNAVIVAAAGDTSKKDCKQNPPHDPLQPNDPR 404
MS D L AV+ A V +++ AAG N+
Sbjct: 146 ------MSLGGPEDVPELHEAVKKA-VASQILVMCAAG------------------NEGD 180

Query: 405 NWNSVTTVVTPSWFSDYVLTVGAVDTEGHPLSQGNQGQASTSVAGPWVGIAAPGTDVVGL 464
+ + P + + V++VGA++ + H + S + V + APG D++
Sbjct: 181 GDDRTDELGYPGCY-NEVISVGAINFDRHA--------SEFSNSNNEVDLVAPGEDILS- 230

Query: 465 SPRDDGLINAIDGPDNTLLVPSGTSFSAAIVSGVAALVRAKYPQ-----LSAYQVINRLT 519
P SGTS + V+G AL++ L+ ++ +L
Sbjct: 231 -----------TVPGGKYATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLI 279

Query: 520 RTARAPARGVDNQVGHGIVDPVAA----LTWD 547
+ G+G++ A +D
Sbjct: 280 KRTIPLG-NSPKMEGNGLLYLTAVEELSRIFD 310



Score = 69.9 bits (171), Expect = 5e-15
Identities = 31/94 (32%), Positives = 43/94 (45%), Gaps = 7/94 (7%)

Query: 77 TDFRLQPKYMDMLNLQEAWQFGRGAGQKVAVIDTGVTP-HPRFP-HLIPGGDYIMSGDG- 133
P+ ++M+ W RG G KVAV+DTG HP +I G ++ +G
Sbjct: 17 QQVNEIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGD 76

Query: 134 ---LSDCDAHGTLVASMIGAASADGAGVPPAAPR 164
D + HGT VA I AA+ + GV AP
Sbjct: 77 PEIFKDYNGHGTHVAGTI-AATENENGVVGVAPE 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1513HTHFIS290.041 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.4 bits (66), Expect = 0.041
Identities = 28/143 (19%), Positives = 45/143 (31%), Gaps = 37/143 (25%)

Query: 239 AVELLRRVYSRDPKFTPAREALDNPTYRLVLT---DPET-IEAR--------TDPWDPDS 286
A +LL R+ P LV++ T I+A P+D
Sbjct: 62 AFDLLPRIKKARPD-----------LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110

Query: 287 APTRAEAEAARHAEEAAKYLAEGDAELNAMLGMERAKREIKLIKSTTKVNLARAKMGLPV 346
A + L + + ++G A +EI + M +
Sbjct: 111 L-IGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEI--------YRVLARLMQTDL 161

Query: 347 PVTSRHTLLLGPPGTGKTSVARA 369
+ ++ G GTGK VARA
Sbjct: 162 TL-----MITGESGTGKELVARA 179


27MAP1588cMAP1606cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1588c29-0.760824AhpD
MAP1589c210-0.162915hypothetical protein
MAP159019-0.359262OxyR
MAP1591-29-1.345620hypothetical protein
MAP1592-29-1.381861hypothetical protein
MAP1593011-0.930037hypothetical protein
MAP1594c110-0.479977hypothetical protein
MAP1595110-0.439107BfrA
MAP1596211-0.805825hypothetical protein
MAP1597213-1.250828hypothetical protein
MAP1598211-0.504606hypothetical protein
MAP159919-1.182773GlnA3
MAP160029-1.907573hypothetical protein
MAP160119-2.530894hypothetical protein
MAP1602c111-2.124736hypothetical protein
MAP1603c210-0.966916hypothetical protein
MAP1604c210-2.479714hypothetical protein
MAP1605c214-2.884131short chain dehydrogenase
MAP1606c216-1.875269hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1596TCRTETB1466e-40 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 146 bits (369), Expect = 6e-40
Identities = 94/426 (22%), Positives = 188/426 (44%), Gaps = 18/426 (4%)

Query: 24 SPQRRNLIFVAIVLGMLLAALDQTIVATALPTIVANLGDAGHQ-SWVVTSYLLASTIVTA 82
S R N I + + + + L++ ++ +LP I + +WV T+++L +I TA
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTA 67

Query: 83 LVGKLGDLYGRKRVFQAAVLFFVAGSVLCGLAQSMA-MLVGARALQGIGGGGITVTASAL 141
+ GKL D G KR+ ++ GSV+ + S +L+ AR +QG G +
Sbjct: 68 VYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 142 IGEVVPLRERGRYQGILGAVFGVTTVIGPLLGGYFTDYLSWRWAFWVNVPVSVIVIFVAA 201
+ +P RG+ G++G++ + +GP +GG Y+ W++ + +P+ I+
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH--WSYLLLIPMITIITVPFL 185

Query: 202 AAIPALAASAKPVIDYAGIVFVGLGAAGLTLATSWGGSRYPWGSPTITGLFAAAAVALGV 261
+ K D GI+ + +G L T + Y ++ L +
Sbjct: 186 MKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFT----TSYSISFLIVSVLS------FLI 235

Query: 262 FVVVERRAAEPILPVRLFASPVFTVCCVLSFVVGFAMLGAMTFLPTYMQYVDGVSATTSG 321
FV R+ +P + L + F + + ++ + G ++ +P M+ V +S G
Sbjct: 236 FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG 295

Query: 322 LRTL-PMVVGMLFTSTGSGTIVGRTGRYKIFPVAGTALMALAFLLMSRMQPSTPAVIQSL 380
+ P + ++ G +V R G + + G ++++FL S + +T + +
Sbjct: 296 SVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNI-GVTFLSVSFLTASFLLETTSWFMTII 354

Query: 381 YLFILGAGIGLSMQVLILIVQNTSDFEDLGVATSGVRFFRTIGSSFGAAIFGSLF-VNFL 439
+F+LG G+ + V+ IV ++ ++ G S + F + G AI G L + L
Sbjct: 355 IVFVLG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413

Query: 440 NRRIGP 445
++R+ P
Sbjct: 414 DQRLLP 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1605cDHBDHDRGNASE675e-15 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 66.6 bits (162), Expect = 5e-15
Identities = 52/190 (27%), Positives = 82/190 (43%), Gaps = 8/190 (4%)

Query: 2 KSIFITGAGSGMGREGAKLFHAKGWRVGAVDRNDDGLATLQQELGDDRLWTRA--VDVTD 59
K FITGA G+G A+ ++G + AVD N + L + L + A DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 60 KAALDGALADFCAGNTGGGLDMMWNNAGIGESGWFEDVPYDAAMRVVDVNYKAVLTGAYG 119
AA+D A G +D++ N AG+ G + + VN V +
Sbjct: 69 SAAIDEITARIER--EMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 120 ALPYLKKSAGSLMFSTSSSSATYGMPR--LAVYSSTKHAVKGLTEALSVEWQRHGVRVAD 177
Y+ + + S+ A G+PR +A Y+S+K A T+ L +E + +R
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 178 VLPGLIDTAI 187
V PG +T +
Sbjct: 185 VSPGSTETDM 194


28MAP1627MAP1644Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1627018-3.104097hypothetical protein
MAP1628025-3.830551hypothetical protein
MAP1629c132-4.024865Aao
MAP1630c239-6.072395hypothetical protein
MAP1631c443-7.447431hypothetical protein
MAP1632c240-7.447562hypothetical protein
MAP1633c338-6.055665hypothetical protein
MAP1634134-5.858039hypothetical protein
MAP1635c129-5.251753hypothetical protein
MAP1636c025-3.847290hypothetical protein
MAP1637c020-2.582034hypothetical protein
MAP1638c112-0.551923hypothetical protein
MAP1639c115-0.167357hypothetical protein
MAP1640c-114-0.151088hypothetical protein
MAP1641c-1140.278700hypothetical protein
MAP1642-1100.444992hypothetical protein
MAP1643-1100.511029isocitrate lyase
MAP16442141.230931hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1630cPF05616336e-04 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 33.2 bits (75), Expect = 6e-04
Identities = 16/39 (41%), Positives = 22/39 (56%)

Query: 31 PDETHGPAPGPAATPSPAPSTSPSPAASPSPSASPAPAP 69
PD T G A P A P P S + +PA +P+P+ +P P
Sbjct: 313 PDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRP 351



Score = 28.6 bits (63), Expect = 0.019
Identities = 13/48 (27%), Positives = 18/48 (37%)

Query: 25 VVAGADPDETHGPAPGPAATPSPAPSTSPSPAASPSPSASPAPAPAAP 72
V +P P P P+P P +P A+P P P +P
Sbjct: 331 VSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSP 378


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1631cHTHTETR624e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.3 bits (151), Expect = 4e-14
Identities = 28/169 (16%), Positives = 63/169 (37%), Gaps = 7/169 (4%)

Query: 1 MADEQADSRERLISGTRELLWDRGYVGTSPTAILQQSGVGQGSLYHHFRGKHDLVLAAEQ 60
E ++R+ ++ L +G TS I + +GV +G++Y HF+ K DL +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 QAAADMQRSIKEAFAG-NRSAHDKIADYLTRQREVL-----RGCSVGRLTADPVIVGDDQ 114
+ +++ E A + + L E R + + VG+
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 115 LRAPVAQTFEV-LHTCLTRTIREGQRSGEISVELEPHKVAAAISATIQG 162
+ + + + + +T++ + + +L + A + I G
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1632cTCRTETB1514e-43 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 151 bits (384), Expect = 4e-43
Identities = 100/414 (24%), Positives = 170/414 (41%), Gaps = 24/414 (5%)

Query: 10 LCLGTALIIMEANVLNVAIPSIRQALHASPAQSLWIIDAYTLVLAALLLSAGRLGDRIGA 69
LC+ + ++ VLNV++P I + PA + W+ A+ L + G+L D++G
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 70 RRCYLLGLAVFSIASVLCALAASSAE-LIAARTIQGVGAAVLIPAPLGLISAMFSDLTAR 128
+R L G+ + SV+ + S LI AR IQG GAA PA + ++ A + R
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAA-FPALVMVVVARYIPKENR 137

Query: 129 AKAVAVWVTIGGVGFAAGPLIGGLLVSTFGWRSIFLINIPAAAIIAV-MVRLTVAEASRS 187
KA + +I +G GP IGG++ W +L+ IP II V + + + R
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRI 195

Query: 188 PLPFDYVGQALAIVGLSAVVFACVESSALAWMSPFVLLPAVAAALILGLFVIDQRHRGRA 247
FD G L VG+ + S F+++ ++ +FV R
Sbjct: 196 KGHFDIKGIILMSVGIVFFMLFTTSYSIS-----FLIVSVLS----FLIFVKHIRKVTDP 246

Query: 248 GAWVLLPVELLNNRPVNAGLMSGFVYNFTLYGLVLVYSYVFQSARGYSPVQTGLAFA-PL 306
+ L N P G++ G + T+ G V + Y+ + S + G P
Sbjct: 247 ----FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 307 TVAALVTSLPAGRFVAAHGARRGIMIGMALSAIGLCALAFDAQRMPFVVLSIAFGIFAT- 365
T++ ++ G V G + IG+ ++ +F + + + I
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF---MTIIIVFVL 359

Query: 366 -GLSLSATGQTMAVMANASDQYKNTASSMLNTARQTGGVIGVAALGAITSRDLL 418
GLS + T + V ++ Q S+LN G+A +G + S LL
Sbjct: 360 GGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1644cloacin402e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.7 bits (92), Expect = 2e-06
Identities = 30/76 (39%), Positives = 32/76 (42%), Gaps = 6/76 (7%)

Query: 68 GGPGGGGGSIPGGPTGGGGPGGGGGSIPGGPTGGGGPGGGGGSIPGGPTGGGGPGGGGGS 127
G G+I GGPTG G GGG G + GGG GS G G GGG
Sbjct: 11 TGAHSTSGNINGGPTGL-GVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN- 68

Query: 128 IPGVGGGGGGPGGGGG 143
G GGG G GG
Sbjct: 69 ----GNSGGGSGTGGN 80



Score = 33.9 bits (77), Expect = 2e-04
Identities = 27/70 (38%), Positives = 30/70 (42%), Gaps = 4/70 (5%)

Query: 85 GGPGGGGGSIPGGPTGGGGPGGGGGSIPGGPTGGGGPGGGGGSIP---GVGGGGGGPGGG 141
G G+I GGPTG G GGG G + GGG GS G G G G G G
Sbjct: 11 TGAHSTSGNINGGPTGLG-VGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 142 GGCVGNVCGG 151
G+ GG
Sbjct: 70 NSGGGSGTGG 79



Score = 28.9 bits (64), Expect = 0.009
Identities = 30/79 (37%), Positives = 31/79 (39%), Gaps = 13/79 (16%)

Query: 82 TGGGGPG-GGGGSIPGGPTGGGGPGGGGGS------------IPGGPTGGGGPGGGGGSI 128
+GG G G G G GG G G G P G G G GGGS
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 129 PGVGGGGGGPGGGGGCVGN 147
G GGG G GGG G GN
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80


29MAP1702cMAP1738Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1702c221-1.750723hypothetical protein
MAP1703c122-2.805385hypothetical protein
MAP1704c217-3.111851hypothetical protein
MAP1705c117-2.834434hypothetical protein
MAP1706217-3.437626hypothetical protein
MAP1707316-2.711806short chain dehydrogenase
MAP1708214-3.493480hypothetical protein
MAP1709c113-3.063612FadD11_2
MAP1710110-3.133743hypothetical protein
MAP1711c210-2.986380hypothetical protein
MAP1712210-2.883409hypothetical protein
MAP1713110-3.342969FadE20_1
MAP1714113-2.749132acetyl-CoA acetyltransferase
MAP1715217-2.723637FadB_2
MAP1716025-2.703894short chain dehydrogenase
MAP1717025-2.943174hypothetical protein
MAP1718c-128-3.308961hypothetical protein
MAP1719c-126-3.133687hypothetical protein
MAP1720023-2.521804hypothetical protein
MAP1721c025-2.965243hypothetical protein
MAP1722021-2.523138hypothetical protein
MAP1723-221-3.523654hypothetical protein
MAP1724c-121-3.418439hypothetical protein
MAP1725c-123-3.320106hypothetical protein
MAP1726c-226-3.978307hypothetical protein
MAP1727-123-3.941360hypothetical protein
MAP1728c-125-3.957490YfnB
MAP1729c-123-2.623132hypothetical protein
MAP1730c020-1.940136hypothetical protein
MAP1731c217-2.024181hypothetical protein
MAP1732c216-2.498792hypothetical protein
MAP1733111-4.523705hypothetical protein
MAP173429-4.604644hypothetical protein
MAP1735110-4.170884hypothetical protein
MAP1736110-4.191875hypothetical protein
MAP1737111-3.661417hypothetical protein
MAP1738111-3.261656hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1707DHBDHDRGNASE1131e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (283), Expect = 1e-32
Identities = 78/254 (30%), Positives = 112/254 (44%), Gaps = 26/254 (10%)

Query: 4 RRVLITGASRGIGRAVADRLAEGGHEPIGLARSAPKD-----------FPGEFYEVDLAD 52
+ ITGA++GIG AVA LA G + + K E + D+ D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 53 PYATAATLDKIVGHAA-VHAVVNNVGFARFGRLGSIELDHLFDTYNLNVRAAVQVVQAAL 111
A +I + +VN G R G + S+ + T+++N ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 112 PGMLDAEWGRIVNVTSLTTLGTPERT--PYAAAKAALEACTRIWAGELASAGITVNAVAP 169
M+D G IV V S G P + YA++KAA T+ ELA I N V+P
Sbjct: 129 KYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 170 GPTETDMYR---------ERSPVGSAREARFLQSIPLHRVARPREIAHAICFLLDEDAGY 220
G TETDM E+ GS F IPL ++A+P +IA A+ FL+ AG+
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSL--ETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 221 ITGQILRVDGGGSI 234
IT L VDGG ++
Sbjct: 246 ITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1711cHTHTETR507e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.0 bits (119), Expect = 7e-10
Identities = 25/98 (25%), Positives = 42/98 (42%), Gaps = 1/98 (1%)

Query: 1 MTNPRATNEDLTAKARIRNAALDLYAQYGEDRISLRDIASAAGVTLGLVQHHFKTKAGVR 60
M T + I + AL L++Q G SL +IA AAGVT G + HFK K+ +
Sbjct: 1 MARKTKQEAQET-RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 61 DAVDQLVVDYFAHALAQVPAEGSARHVAAARDEAVARM 98
+ +L + A+ ++ R+ + +
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVL 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1716DHBDHDRGNASE952e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 95.1 bits (236), Expect = 2e-25
Identities = 56/186 (30%), Positives = 83/186 (44%), Gaps = 1/186 (0%)

Query: 6 KVVAITGGARGIGLATAKAFLAAGAKVALGDLDTELAEKQAVELGGDPAVV-GLSLDVSD 64
K+ ITG A+GIG A A+ + GA +A D + E EK L + DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 PASFVAFLDDVEARLGRLDVLVSNAGIMPTGPFVDEPPTMSRRMIDVNVYGVLNGSRLAA 124
A+ +E +G +D+LV+ AG++ G VN GV N SR +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 ARFVPRGAGHIVNIASLAGVTGEPGMATYCGTKHFVVGFTESLHRELRPHRVGVSLVLPG 184
+ R +G IV + S MA Y +K V FT+ L EL + + ++V PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 185 IINTEL 190
T++
Sbjct: 189 STETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1718cPF07675290.007 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 29.3 bits (65), Expect = 0.007
Identities = 33/126 (26%), Positives = 50/126 (39%), Gaps = 13/126 (10%)

Query: 10 TTAATAALAAAGLLAAAPAFADP----QVLQFGQMAEISSNGGTIDYTVSNLQPSGHNDG 65
T A++ L +A PA ADP Q + E+ GG DY ++N +P+ +
Sbjct: 427 TGTASSNLYSANFEYLTPANADPVVTTQNIIVTGQGEVVIPGGVYDYCITNPEPA--SGK 484

Query: 66 VWYSDVTAKGVSGNAVPNIADFNARAVNSSTFAVMKGNQTDGLPEGPLPLGTPVTGRLYF 125
+W G GN DF A TF + + DG + + +P Y
Sbjct: 485 MW-----IAGDGGNQPARYDDFAFEAGKKYTFTMRRAGMGDGT-DMEVEDDSP-ASYTYT 537

Query: 126 DVRNGT 131
R+GT
Sbjct: 538 VYRDGT 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1719cHTHTETR647e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.9 bits (155), Expect = 7e-15
Identities = 31/165 (18%), Positives = 63/165 (38%), Gaps = 12/165 (7%)

Query: 4 VAQPVRSDAARNREALIEVATRLFAAAAGGDEPSLRLIAREAGVGVGTLFRHFPTREALV 63
+A+ + +A R+ +++VA RLF+ G SL IA+ AGV G ++ HF + L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQ-GVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 64 EAVYQDQVRRLTEGADQLLANHP--PAQAMRRWMDLFTDWLATKHGMLGTLRAMINNEQL 121
+++ + E + A P P +R + + T+ + + + +
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 122 GSGHTRI---------ELLAAIDKILAAGRAAGDIGDHISSEDVA 157
+ E I++ L A + + + A
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1720NUCEPIMERASE391e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 39.4 bits (92), Expect = 1e-05
Identities = 22/81 (27%), Positives = 30/81 (37%), Gaps = 11/81 (13%)

Query: 35 VFVTGGSGLTGPAVVSELLSAGHRVTGLARSAASAD------RLARLGAEPFT---GSLD 85
VTG +G G V LL AGH+V G+ D RL L F L
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 86 DLDRLREGAAAA--DGVIHMA 104
D + + + A+ + V
Sbjct: 63 DREGMTDLFASGHFERVFISP 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1721cHTHTETR705e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.4 bits (172), Expect = 5e-17
Identities = 37/183 (20%), Positives = 63/183 (34%), Gaps = 7/183 (3%)

Query: 19 LPRISREQKERNRGRILAAAGEGFKARGIDGVGIDELMKAAGMSHGGFYNHFPSKEDLAL 78
+ R ++++ + R IL A F +G+ + E+ KAAG++ G Y HF K DL
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 79 EVLHQGFTDSLDTVAAVIDTHAHSGRAALHAIIDTYLSTEHRDHPEHGCASAALAADAGR 138
E+ ++ + + L I+ L + + +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRL-LMEIIFHKCEF 119

Query: 139 HGVKA--QEAYRRGLQGYIGAFADLLRVSARQRG---TKLDARRAREQAIGLFSQMVGAQ 193
G A Q+A R L+ + L RRA G S ++
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLK-HCIEAKMLPADLMTRRAAIIMRGYISGLMENW 178

Query: 194 LIA 196
L A
Sbjct: 179 LFA 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1726cHTHTETR741e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.9 bits (181), Expect = 1e-18
Identities = 33/175 (18%), Positives = 66/175 (37%), Gaps = 8/175 (4%)

Query: 1 MTRTQQRAAENRRTVIDAAREIIATQGVEALTLEAVAEKADVVVQTIYNRVGGRSALLTA 60
+T+Q A E R+ ++D A + + QGV + +L +A+ A V IY +S L +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 VAEQALEQSRVYM-DPAYEADGTVEERMMLAANAYARFARERPHEFRILVEPPNEPEAVA 119
+ E + + + G + ++ ++ E V
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 120 RIAELTRAQN-------ARLTAVLREGMAAGLIRADLDPDDVTTALWATFNGLLA 167
+A + +AQ R+ L+ + A ++ ADL + +GL+
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1732cHTHTETR756e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 75.4 bits (185), Expect = 6e-19
Identities = 36/143 (25%), Positives = 70/143 (48%), Gaps = 3/143 (2%)

Query: 14 RRKRADGEMSRERILDAATEIAAERGYEATSIGLVSAKCGLPASSIYWHFKNKDDLIAAV 73
R+ + + + +R+ ILD A + +++G +TS+G ++ G+ +IYWHFK+K DL + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 74 IERSFADWRKAWQVPDEGAPRDRLAGLAMQIAKVLMDSP--DFIRLGLMLALERRPVEPR 131
E S ++ + P D L+ L +I +++S + R LM + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLR-EILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 132 ARAMFIQARAQAYDELADIVREL 154
A+ QA+ E D + +
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQT 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1735ALARACEMASE290.029 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 28.6 bits (64), Expect = 0.029
Identities = 17/59 (28%), Positives = 22/59 (37%), Gaps = 5/59 (8%)

Query: 235 GLAPAWIGVGTLDLFYPECLEYARRLREAGVPAQEEIVPGAFHAFDQIVDKAPISAKFF 293
G+ W +G D F LE A LRE G ++ G FHA I +
Sbjct: 42 GIERIWSAIGATDGFALLNLEEAITLRERGWKGPILMLEGFFHA-----QDLEIYDQHR 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1736HTHTETR543e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.9 bits (129), Expect = 3e-11
Identities = 34/175 (19%), Positives = 61/175 (34%), Gaps = 9/175 (5%)

Query: 1 MANPVGLRERRRRQTSADIRDAAVRLTLERGFDKVTVDEICAEAGISTRTFFNYFPNKES 60
MA + RQ I D A+RL ++G ++ EI AG++ + +F +K
Sbjct: 1 MARKTKQEAQETRQH---ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSD 57

Query: 61 ---AIAYGPSDIPPELVADFVAAGPAPYSVVLAELITLAAHHLRDVPPRREHAANMLELA 117
I EL ++ A P VL E++ + RR ++
Sbjct: 58 LFSEIWELSESNIGELELEYQAKFPGDPLSVLREIL-IHVLESTVTEERRRLLMEIIFHK 116

Query: 118 KTSPAVLAAFLADLERFQNQLTDIIVR--RQGMQPDDEMAPLISALALTAVRSGI 170
+A + D I + + ++ A L++ A +R I
Sbjct: 117 CEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1738ACRIFLAVINRP451e-06 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 45.2 bits (107), Expect = 1e-06
Identities = 46/253 (18%), Positives = 94/253 (37%), Gaps = 32/253 (12%)

Query: 139 AQSNDGKASYVQVYLAGNQGEALANESVESVQNIVKSVQA--PNGVK---AYVTGP---A 190
A+ N A+ + + L A A ++ ++++ + +Q P G+K Y T P
Sbjct: 279 ARINGKPAAGLGIKL---ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQL 335

Query: 191 ALSADQHTAGDRSLQLITAATFTVIIGMLLLVYRSVITVLLTLVMVVLELSAARGMVAFL 250
++ T L A ++ + L +++ L+ + V + L ++A
Sbjct: 336 SIHEVVKT-------LFEAIMLVFLV--MYLFLQNMRATLIPTIAVPVVLLGTFAILAAF 386

Query: 251 GYYKIIGLSTFATNLLVTLAIAAATDYAIFLIGRYQEARAVGES--REDAYYTMYKGTAH 308
GY I L+ F + LAI D AI ++ + + +E +M +
Sbjct: 387 GY-SINTLTMFG----MVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGA 441

Query: 309 VVAGSGMTIAGATFCLHFTNL--PYFQTLGIPLAIGMVVVVAAALTLGPAVISVASRFRQ 366
+V + + A F ++ I + M + V AL L PA + + +
Sbjct: 442 LVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPA---LCATLLK 498

Query: 367 TLEPRRTQRIRGW 379
+ + G+
Sbjct: 499 PVSAEHHENKGGF 511



Score = 39.8 bits (93), Expect = 6e-05
Identities = 38/203 (18%), Positives = 75/203 (36%), Gaps = 18/203 (8%)

Query: 740 KAAYEALKGTPLEGSKIYLAGTASIYKDLSDGNNYDLLIAGISSLCLIFIIMLIITRGVV 799
KA L+ +G K+ + + LS ++ ++ L+F++M + + +
Sbjct: 307 KAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHE---VVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 800 ASAVIVGTVLLSLGASFGLSVLIWQHLIGIELHWMVL-AMSVIILLAVGADYNLLLV--- 855
A+ + V + L +F + G ++ + + M + I L V D +++V
Sbjct: 364 ATLIPTIAVPVVLLGTFAI-----LAAFGYSINTLTMFGMVLAIGLLV--DDAIVVVENV 416

Query: 856 ARFKEEIHAGLNTGIIRSMGGTGSVVTSAGLVFAFT---MMTMAVSELTVIGQVGTTIGL 912
R E +SM + +V + M S + Q TI
Sbjct: 417 ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVS 476

Query: 913 GLLFDTLIVRSLMTPSIAALLGK 935
+ L+ L TP++ A L K
Sbjct: 477 AMALSVLVALIL-TPALCATLLK 498


30MAP1768cMAP1773cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1768c29-1.110938hypothetical protein
MAP1769c310-0.121649hypothetical protein
MAP1770c4120.646354hypothetical protein
MAP1771c4130.495244hypothetical protein
MAP17723101.801689hypothetical protein
MAP1773c3112.270030hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1769cMALTOSEBP643e-13 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 64.4 bits (156), Expect = 3e-13
Identities = 69/261 (26%), Positives = 109/261 (41%), Gaps = 36/261 (13%)

Query: 88 GRCPDVLMAWELSYAELADRGVLLDLGPLLARDKAFAQQLQADSIPALYETFTFNGKQYA 147
G PD++ + A G+L ++ P DKAF +L P ++ +NGK A
Sbjct: 80 GDGPDIIFWAHDRFGGYAQSGLLAEITP----DKAFQDKL----YPFTWDAVRYNGKLIA 131

Query: 148 LPEQWSGNYLFYNKRLFDEAGVPSPPAAWEHPWGFSEFLNAARALTKRDASGRAAQYGFV 207
P L YNK L +P+PP WE + L A G++A
Sbjct: 132 YPIAVEALSLIYNKDL-----LPNPPKTWEEIPALDKELKA---------KGKSAL--MF 175

Query: 208 NTWGSYYSAGLFAMNNG--VPWSDPRLNPTHFNFDNAAFQEAVQFYADL-ANKYRVAPNG 264
N Y++ L A + G + + + + DNA + + F DL NK+ A
Sbjct: 176 NLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNA--- 232

Query: 265 SETQSMSTPNLFAVGRAAMALGGHWRYQTYLRAEGLDFDVAPLPVGPAVGKGQPACSDIG 324
+T F G AM + G W + ++ +++ V LP KGQP+ +G
Sbjct: 233 -DTDYSIAEAAFNKGETAMTINGPWAWSNIDTSK-VNYGVTVLPTF----KGQPSKPFVG 286

Query: 325 ATGLAISSSSPRKEQAWEFVK 345
I+++SP KE A EF++
Sbjct: 287 VLSAGINAASPNKELAKEFLE 307


31MAP1808cMAP1820Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1808c083.209407hypothetical protein
MAP1809c193.072942hypothetical protein
MAP1810193.582963CobG
MAP1811293.081069precorrin-8X methylmutase
MAP1812092.775624CobI
MAP1813c1102.642646hypothetical protein
MAP18142112.087440hypothetical protein
MAP1815c1131.409792cobalt-precorrin-6x reductase
MAP1816c117-1.351057hypothetical protein
MAP1817c226-1.614225CobL
MAP1818c327-1.840442hypothetical protein
MAP1819c229-2.586666hypothetical protein
MAP1820227-2.405142hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1810cloacin348e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 8e-04
Identities = 22/72 (30%), Positives = 34/72 (47%), Gaps = 11/72 (15%)

Query: 212 AECGPIAY--PAVTKPPVGWITQDDGRVTLGAAVPLGLLSARVAEYLAALQAPLVITPWR 269
A P+A+ PA++ P G L ++ G LSA +A+ +AAL+ P W
Sbjct: 83 AVAAPVAFGFPALSTPGAG---------GLAVSISAGALSAAIADIMAALKGPFKFGLWG 133

Query: 270 SVLVGDLREEVA 281
L G L ++A
Sbjct: 134 VALYGVLPSQIA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1819cDHBDHDRGNASE372e-05 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 37.0 bits (85), Expect = 2e-05
Identities = 37/171 (21%), Positives = 55/171 (32%)

Query: 9 VLILGGRSEIGVELARRLAPGTTVVLAARNADRVNDQVDALKAAGASAVHTREFDADDLA 68
I G IG +AR LA + A ++V + A A D D A
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 69 SHGPLVASVVADHGPIGTAVLAFGILGDQARAETDAEHAVAIVHTDYVAQVSMLTHLAIA 128
+ + A + + GPI V G+L E A + + ++
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 129 MRAAGRGQLVVFSSIAGARVRRANYVYGSAKAGLDGFASGLADALHGTGVR 179
M G +V S R + Y S+KA F L L +R
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181


32MAP1969cMAP1983cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1969c290.317997hypothetical protein
MAP19701110.5464543-methyl-2-oxobutanoate
MAP19712130.293362hypothetical protein
MAP1972c3150.660963hypothetical protein
MAP19731131.024685hypothetical protein
MAP1974-180.495321hypothetical protein
MAP19750100.274727hypothetical protein
MAP19760100.491706hypothetical protein
MAP1977c0111.701409hypothetical protein
MAP19780112.686647hypothetical protein
MAP19790112.642423hypothetical protein
MAP1980c-1133.447373bifunctional RNase H/acid phosphatase
MAP1981c1133.441032hypothetical protein
MAP1982c1144.704430hypothetical protein
MAP1983c1133.738402hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1981cGPOSANCHOR320.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.0 bits (72), Expect = 0.002
Identities = 29/171 (16%), Positives = 52/171 (30%)

Query: 2 KADVAQQRSLLELANVDAELSRLAHRAEHLPEQQACERMQQEYDAAGDRLGAVRIALEDI 61
+A A+ LE A + + + R A + I
Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 248

Query: 62 DAHVLRLEAEVDAVRQREDRDRSLLQSGAIDAKQLADLQHELETLQRRQTSLEDSLLEVM 121
A + E + D+ ++ L+ E L+ + LE +
Sbjct: 249 KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLN 308

Query: 122 ERREELQAQLDGEQQALKELEAEMATARRDLDAARGEISESRALHSSRRDA 172
R+ L+ LD ++A K+LEAE + R + R+A
Sbjct: 309 ANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREA 359


33MAP2021cMAP2026Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2021c-119-3.006340hypothetical protein
MAP2022-123-4.078925hypothetical protein
MAP2023c031-5.917369hypothetical protein
MAP2024c-132-6.187263hypothetical protein
MAP2025c028-6.326530hypothetical protein
MAP2026122-4.265579hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2023cHTHTETR432e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 42.7 bits (100), Expect = 2e-07
Identities = 31/178 (17%), Positives = 63/178 (35%), Gaps = 13/178 (7%)

Query: 16 LEVGYTVLAEEGVRALKVERLCQQAGVTRGSFYWHFEDIDN-YRAALVESWNKFLERDRQ 74
L+V + +++GV + + + + AGVTRG+ YWHF+D + + S + E + +
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 75 ALSELDSLPPRQRLSAMMGTLVSPQYWMLERAMR-----------EWARLDPVAAENIRA 123
++ P ++ L S R + E A +
Sbjct: 77 YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLE 136

Query: 124 ADRHLLRTVTKAYRDYGF-SPEDAKLRAELTFAAGIGLLHLTGSAEQAQSLAQRERFL 180
+ + +T+ + + A + GL+ A Q+ L + R
Sbjct: 137 SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDY 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2026YERSSTKINASE300.036 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 29.7 bits (66), Expect = 0.036
Identities = 31/138 (22%), Positives = 54/138 (39%), Gaps = 12/138 (8%)

Query: 95 QTLHEMLKTGSLEPRRATDIIRQVASAL----DAAHAAGLIHRDVKPQNIIVT-PDDFAY 149
+TL + K G + I+ +A L + AG++H D+KP N++
Sbjct: 227 RTLADSWKQGKINSEAYWGTIKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPV 286

Query: 150 LVDFGIAEARGDTHLTMAGHTVGTFDYMAPER-FGDEETTSAVDVYALACVLYEALTGAK 208
++D G+ G+ T + APE G+ + DV+ + L + G +
Sbjct: 287 VIDLGLHSRSGEQPKGF------TESFKAPELGVGNLGASEKSDVFLVVSTLLHCIEGFE 340

Query: 209 PFPVHSAEQAIRAHLSSP 226
P Q +R S P
Sbjct: 341 KNPEIKPNQGLRFITSEP 358


34MAP2068cMAP2079Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2068c213-1.557080hypothetical protein
MAP2069c216-0.892056heat shock protein 90
MAP20700180.562722hypothetical protein
MAP2071c017-0.292394hypothetical protein
MAP2072c-116-0.479006hypothetical protein
MAP2073c-114-0.251076hypothetical protein
MAP2074c222-1.709161hypothetical protein
MAP2075c222-2.036969hypothetical protein
MAP2076c221-1.945522hypothetical protein
MAP2077c221-1.678086hypothetical protein
MAP2078c220-1.657878hypothetical protein
MAP2079320-2.034558hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2073cRTXTOXINA280.044 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 28.0 bits (62), Expect = 0.044
Identities = 24/106 (22%), Positives = 46/106 (43%), Gaps = 2/106 (1%)

Query: 24 AGVLGANDGIVSTAGIVVGVAAATALRAPILTAG-SAGLVAGAVSMALGEYVSVSTQRDT 82
AGV + + + A + T+ +AGL+A AV++A+ +S
Sbjct: 271 AGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISPLSFLSIADKF 330

Query: 83 EKALLIQEHQELRDDPAAELDE-LAALYEAKGLTAATARTVAEELT 127
++A I+E+ + + D LAA ++ G A+ T++ L
Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLA 376


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2078cPF06580403e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.2 bits (94), Expect = 3e-05
Identities = 19/86 (22%), Positives = 29/86 (33%), Gaps = 14/86 (16%)

Query: 811 VANAIEHGHRHSPQ-GTIRLGATALGDQVRLTITDTGTWKVPQATSYPHRGRGIPL---- 865
V N I+HG PQ G I L T V L + +TG+ + G L
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT----KESTGTGLQNVR 319

Query: 866 --MKSLMND---VDIRSDTGGTTVQL 886
++ L + + G +
Sbjct: 320 ERLQMLYGTEAQIKLSEKQGKVNAMV 345


35MAP2147cMAP2155Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2147c323-3.695143hypothetical protein
MAP2148337-7.658012hypothetical protein
MAP2149c342-8.468684hypothetical protein
MAP2150341-9.602922hypothetical protein
MAP2151240-9.550814hypothetical protein
MAP2152c336-7.630825hypothetical protein
MAP2153334-7.490311hypothetical protein
MAP2154c126-5.421910hypothetical protein
MAP2155-117-4.163858hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2147cTONBPROTEIN362e-04 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 36.1 bits (83), Expect = 2e-04
Identities = 26/98 (26%), Positives = 33/98 (33%), Gaps = 6/98 (6%)

Query: 71 PPSNAPTRPPEVPVPAYEPPPIP-VPVQSPLVLPPPNAGQGAVPVLEINPPGPDAPRDAP 129
PP P V P EP PIP P ++P+V+ P P P
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKP 116

Query: 130 VVAPPRGLDAPSLAIAAAKAAPVTRVDPANPPKPPTQV 167
V + P +P A A+ T KP T V
Sbjct: 117 VESRP---ASPFENTAPARLTSSTATAA--TSKPVTSV 149



Score = 31.9 bits (72), Expect = 0.004
Identities = 21/93 (22%), Positives = 28/93 (30%), Gaps = 5/93 (5%)

Query: 75 APTRPPEVPVPAYEPPPIPVPVQSPLVLPPPNAGQGAVPVLEINPPGPDAPRDAPVVAPP 134
A PP+ P EP P P P+ PP PV+ P P+ PV
Sbjct: 53 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKE-----APVVIEKPKPKPKPKPKPVKKVQ 107

Query: 135 RGLDAPSLAIAAAKAAPVTRVDPANPPKPPTQV 167
+ + A+P PA
Sbjct: 108 EQPKRDVKPVESRPASPFENTAPARLTSSTATA 140



Score = 30.3 bits (68), Expect = 0.012
Identities = 17/100 (17%), Positives = 28/100 (28%), Gaps = 5/100 (5%)

Query: 68 VQGPPSNAPTRPPEVPVPAYEPPPIPVPVQSPLVLPPPNAGQGAVPVLEINPPGPDAPRD 127
P P PE P A P P P P + ++ P +P +
Sbjct: 67 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFE 126

Query: 128 ----APVVAPPRGLDAPSLAIAAAKA-APVTRVDPANPPK 162
A + + + A ++R P P +
Sbjct: 127 NTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPAR 166


36MAP2180cMAP2201Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2180c111-3.027917hypothetical protein
MAP2181c112-2.861847hypothetical protein
MAP2182c112-2.813768hypothetical protein
MAP2183c012-2.464559hypothetical protein
MAP2184c015-2.447468hypothetical protein
MAP2185c115-3.108536hypothetical protein
MAP2186c116-2.986380hypothetical protein
MAP2187c014-3.068516hypothetical protein
MAP2188c016-3.271745hypothetical protein
MAP2189-118-3.621945hypothetical protein
MAP2190-121-3.745773hypothetical protein
MAP2191-125-4.054581hypothetical protein
MAP2192-225-3.526023hypothetical protein
MAP2193-226-3.223734hypothetical protein
MAP2194-222-3.522779hypothetical protein
MAP2195-121-3.430621hypothetical protein
MAP2196-117-3.330192hypothetical protein
MAP2197-114-3.586209hypothetical protein
MAP2198016-3.746585hypothetical protein
MAP2199-116-4.532422hypothetical protein
MAP2200-113-3.334592hypothetical protein
MAP2201111-3.471655hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2181cHTHTETR794e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.9 bits (194), Expect = 4e-20
Identities = 39/203 (19%), Positives = 81/203 (39%), Gaps = 5/203 (2%)

Query: 24 QRSRERIANQVRLMLDAALRLIREKG-DSFTTQELVKEAGVALQTFYRYFATKDELLLAV 82
+++++ + +LD ALRL ++G S + E+ K AGV Y +F K +L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 83 IADAMTDACARWRDAARDLP-DPVARLRFYVTAVIEVLDNEQGDGGTAKFVVSTHWRLHR 141
+ ++ + P DP++ LR + V+E E+ + + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 142 VFPDELAEAEKPF--VDLLLGEINAGIEAG-LLAPADPKWAAWFIAELVRSVYHYYAYAP 198
+ + A+ D + + IEA L A + AA + + + + +AP
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 199 REVDVKEQLWQFCLTALGGTARK 221
+ D+K++ + L
Sbjct: 183 QSFDLKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2184cDHBDHDRGNASE1044e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 4e-29
Identities = 67/255 (26%), Positives = 118/255 (46%), Gaps = 6/255 (2%)

Query: 5 LAGKSAIVTGAGSGVGRVSALRFAEEGARVVAADIDLDHAKETVCQIESAGGTAIAIGTD 64
+ GK A +TGA G+G A A +GA + A D + + ++ V +++ A A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 65 VSDEQQVQAMIAAAVDQYGRLDILFNNVGIPTPRLGMIFEDHTLEDFNRLVAVNLGGVFL 124
V D + + A + G +DIL N G+ R G+I + E++ +VN GVF
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGV--LRPGLIHS-LSDEEWEATFSVNSTGVFN 122

Query: 125 GCKYAVLRFKEQGAGGVILNTASVAGLVGWGGSVYGATKGGVIQLTRAVAIEAAPFGIRV 184
+ ++ +G ++ ++ AG+ + Y ++K + T+ + +E A + IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 185 NAICPAAMPLTGFMAAGGLEVDAEQQ--AAIAESVGGQHPLGRAITAEDCAEAALYLVSD 242
N + P + T + + + +Q E+ PL + D A+A L+LVS
Sbjct: 183 NIVSPGSTE-TDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 243 AARNVTGVALPVDGG 257
A ++T L VDGG
Sbjct: 242 QAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2193PYOCINKILLER310.011 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.5 bits (68), Expect = 0.011
Identities = 17/73 (23%), Positives = 23/73 (31%), Gaps = 5/73 (6%)

Query: 56 QRRQDAGQARSRRR-----RGLGETRCGGARKRGGRRRTDQPAGFDARGAQSPAGPTAAG 110
Q R + A E A+++ + Q A A PA +
Sbjct: 201 QIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVA 260

Query: 111 TAAARGHHPAGQG 123
TAA RG QG
Sbjct: 261 TAAGRGLIQVAQG 273


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2198DHBDHDRGNASE622e-13 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 62.0 bits (150), Expect = 2e-13
Identities = 53/212 (25%), Positives = 87/212 (41%), Gaps = 10/212 (4%)

Query: 9 SKTSRGALAVVTGAGSGIGAAFALELGKRGGTVVCSDIDQAAAQRTADAITQHGAKALAT 68
+K G +A +TGA GIG A A L +G + D + ++ ++ A A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 69 RCDVSQFGDVQALAEQSQSCFG--APPTLVINNAGVGAGGAAIGDAPLDDWQWTLGINLW 126
DV D A+ E + P +++N AGV G I ++W+ T +N
Sbjct: 63 PADVR---DSAAIDEITARIEREMGPIDILVNVAGVLRPGL-IHSLSDEEWEATFSVNST 118

Query: 127 GPIHGCHVFTPILRDAAPSAAPRGIINVASAAAFGAAPGMAAYNVSKAGVLSLSETLAAE 186
G + + + D + I+ V S A MAAY SKA + ++ L E
Sbjct: 119 GVFNASRSVSKYMMDRRSGS----IVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLE 174

Query: 187 LSGTPVRVTVLCPTFVKTNILESGRISEESGE 218
L+ +R ++ P +T++ S E E
Sbjct: 175 LAEYNIRCNIVSPGSTETDMQWSLWADENGAE 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2199FbpA_PF05833290.023 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 29.1 bits (65), Expect = 0.023
Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 10/67 (14%)

Query: 161 FEKINNDESRHLVVDFEVLDMIGHAKVRRLLIDFVGHHA-------TPGLIIGAITG-AP 212
+IN D R +V+DFE D +G + L+I+ +G H+ +I+ +I P
Sbjct: 91 IHQINQD--RIVVIDFESTDELGFNSIYSLIIEIMGRHSNMTLIRKRDNIIMDSIKHITP 148

Query: 213 LINRIRN 219
IN R+
Sbjct: 149 DINTYRS 155


37MAP2341cMAP2346cY        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2341c29-0.968317hypothetical protein
MAP2342c29-0.700895hypothetical protein
MAP2343c39-0.468666hypothetical protein
MAP2344610-0.190172hypothetical protein
MAP2345c4110.742543hypothetical protein
MAP2346c3100.729836hypothetical protein
38MAP2441cMAP2453cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2441c314-0.099642hypothetical protein
MAP2442211-0.435360hypothetical protein
MAP2443111-0.466273hypothetical protein
MAP2444c013-1.303102hypothetical protein
MAP2445412-1.237746Ogt
MAP2446c214-1.728087hypothetical protein
MAP2447c113-1.962989UDP-N-acetylglucosamine
MAP2448514-2.947911hypothetical protein
MAP2449c512-2.632845hypothetical protein
MAP2450c512-2.837127F0F1 ATP synthase subunit epsilon
MAP2451c312-2.800538F0F1 ATP synthase subunit beta
MAP2452c310-3.242369F0F1 ATP synthase subunit gamma
MAP2453c310-3.125352F0F1 ATP synthase subunit alpha
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2441cTCRTETB424e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 41.8 bits (98), Expect = 4e-06
Identities = 67/421 (15%), Positives = 134/421 (31%), Gaps = 58/421 (13%)

Query: 48 LPALSAHYRIGPATAALTVSLTTGALALSIIPASVLSERYGRIRVMLISGVASSVIGLLL 107
LP ++ + PA+ + ++ LS++ G R++L + + ++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 108 PFSPS-LGVLLFGRAAQGVALAGIPAVAMALLAEEVDASSLGSAMGRYIAGTTIGGLAGR 166
S +L+ R QG A PA+ M ++A + + G A G + +G G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 167 IVPSVVVQVGTWRVALLACSLITLAGTAVFAVLVPRSR------------FFTPKPASVR 214
+ ++ W LL + + + +L R +
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFML 216

Query: 215 AALRNLAGHL---------------------------RNPVLAKLFAVGFVLMGGFVTVY 247
L +N G ++ G
Sbjct: 217 FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFV 276

Query: 248 NYLGYRLAARPFGLAPSVVGLLFLL--YLVGTGTSVVAGRLADRRGRPLVLGAALPIAVA 305
+ + Y + L+ + +G + + + + G L DRRG VL +
Sbjct: 277 SMVPYMM-KDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSV 335

Query: 306 GLL---LTVPATLAAIVAGVGVFTGGFFAAHTVASGWV-GAVAQRDRAEASALYLFSYYL 361
L + T + + GG TV S V ++ Q++ +L F+ +L
Sbjct: 336 SFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFL 395

Query: 362 GGSVAGAFGGVLYGVG-----------GWSATVCFVVVLLMAGAALVALLVRDNGFRIGR 410
A G L + S + ++LL +G +++ LV N ++ +
Sbjct: 396 SEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSNLLLLFSGIIVISWLVTLNVYKHSQ 455

Query: 411 R 411
R
Sbjct: 456 R 456


39MAP2520cMAP2527cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2520c-112-3.177867hypothetical protein
MAP2521c-115-3.127159DeaD
MAP2522-121-4.073744hypothetical protein
MAP2523c023-4.744543hypothetical protein
MAP2524c024-4.483852hypothetical protein
MAP2525c-123-4.832658hypothetical protein
MAP2526c023-4.639867hypothetical protein
MAP2527c123-5.473579hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2526cDHBDHDRGNASE1274e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 127 bits (321), Expect = 4e-38
Identities = 79/251 (31%), Positives = 123/251 (49%), Gaps = 6/251 (2%)

Query: 3 RVAVVTGGGSGIGRAIVERLAHDRHRVAVLDVNEEAAEKVAARVAADGAHAIAVPTDVAE 62
++A +TG GIG A+ LA +A +D N E EKV + + A+ HA A P DV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 SASVAAAFESVRRALGPVQVLVTSAAITGFKPFGEITIEDWNRHLAVNLTGTFLCLQAAL 122
SA++ + R +GP+ +LV A + ++ E+W +VN TG F ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 PDMVEAGWGRVVTISSTAAQTGSPRQGHYSASKGGVIALTRTIALEYAVHGITANTVPPF 182
M++ G +VT+ S A Y++SK + T+ + LE A + I N V P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 183 SVDTPMLRA--AQEAGNLPPVKYLAK----ASPVGRLGTGEDIAAACAFLCSDEAGYITG 236
S +T M + A E G +K + P+ +L DIA A FL S +AG+IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 237 QIIGVNGGAVI 247
+ V+GGA +
Sbjct: 249 HNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2527cHTHTETR624e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.3 bits (151), Expect = 4e-14
Identities = 20/88 (22%), Positives = 37/88 (42%)

Query: 7 PRRVGAETSQTRDALLEAVAQMMLEEGYASVTYRALAAKAGVTPSLVQYYFPSLDDIFVA 66
R+ E +TR +L+ ++ ++G +S + +A AGVT + ++F D+F
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 67 AIRRYSERNLQWLTEELQRRADDPLHAL 94
+ E + DPL L
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVL 89


40MAP2568cMAP2575cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2568c1123.393950hypothetical protein
MAP2569c0123.414180glucosyl-3-phosphoglycerate synthase
MAP2570c0113.491297FolP2
MAP2571c1113.971347long-chain-acyl-CoA synthetase
MAP2572c2133.653247hypothetical protein
MAP25731123.512839hypothetical protein
MAP2574c2102.117215succinyl-diaminopimelate desuccinylase
MAP2575c2102.403014hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2570cPF07472280.027 Fucose-binding lectin II
		>PF07472#Fucose-binding lectin II

Length = 245

Score = 28.5 bits (63), Expect = 0.027
Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 2/45 (4%)

Query: 133 TWGGVDPALPEVAAEFGAGLVCSHTGNALPRTRPFRVSYGTSTRG 177
TW G LP AA+FG G V ++ A P+ P GT+T G
Sbjct: 87 TWAGAPGVLPGAAAKFGVGAVVNYFSKATPQPEP--TQPGTTTGG 129


41MAP2671cMAP2679cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2671c211-1.334291glucose-6-phosphate 1-dehydrogenase
MAP2672316-1.182838hypothetical protein
MAP2673219-0.993025hypothetical protein
MAP2674c418-1.049443hypothetical protein
MAP2675c524-2.560123hypothetical protein
MAP2676c527-6.789753hypothetical protein
MAP2677c320-6.351080hypothetical protein
MAP2678c316-3.611815hypothetical protein
MAP2679c212-3.145110hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2675cTONBPROTEIN372e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 36.5 bits (84), Expect = 2e-05
Identities = 14/46 (30%), Positives = 16/46 (34%)

Query: 122 APPPPDAPPPPPDAPAPPPDAPPAPPEAPAVLIDAPAPVPPPPPGP 167
PP PPP P P+ P P + P P P P P
Sbjct: 55 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 100



Score = 31.9 bits (72), Expect = 8e-04
Identities = 18/61 (29%), Positives = 22/61 (36%), Gaps = 5/61 (8%)

Query: 107 FTHHMFLPMPPEDGGAPPPPDAPPPPPDAPAPPPDAPPAPPEAPAVLIDAPAPVPPPPPG 166
T + P PPP P P+ P AP V+I+ P P P P P
Sbjct: 47 VTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAP-----VVIEKPKPKPKPKPK 101

Query: 167 P 167
P
Sbjct: 102 P 102



Score = 26.9 bits (59), Expect = 0.038
Identities = 9/39 (23%), Positives = 9/39 (23%)

Query: 114 PMPPEDGGAPPPPDAPPPPPDAPAPPPDAPPAPPEAPAV 152
E P P PP P P P P
Sbjct: 66 EPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 104


42MAP2753MAP2770Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2753219-1.966425hypothetical protein
MAP2754218-2.281399hypothetical protein
MAP2755219-2.269213hypothetical protein
MAP2756c220-2.057718hypothetical protein
MAP2757235-5.185849hypothetical protein
MAP2758235-4.187582hypothetical protein
MAP2759136-3.021321hypothetical protein
MAP2760c135-3.467422hypothetical protein
MAP2761c237-3.878988hypothetical protein
MAP2762c237-4.799246hypothetical protein
MAP2763c238-4.778763hypothetical protein
MAP2764c238-4.878340hypothetical protein
MAP2765c233-4.707592hypothetical protein
MAP2766c122-3.271281hypothetical protein
MAP2767c016-2.036885hypothetical protein
MAP2768c012-1.140936hypothetical protein
MAP2769c1100.203040hypothetical protein
MAP27702121.243649hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2753PF03544310.005 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 30.7 bits (69), Expect = 0.005
Identities = 18/75 (24%), Positives = 24/75 (32%), Gaps = 8/75 (10%)

Query: 100 PTAARPVTIEPA--AALPTVTAVQAPAAA------PPATVTVTPPPAPPATVTVQAAPAP 151
P A+P+++ A L AVQ P P + P AP + P P
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 103

Query: 152 APTVVKPAPALPPPG 166
P VK
Sbjct: 104 KPKPVKKVEQPKRDV 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2756cCHANLCOLICIN384e-04 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 37.7 bits (87), Expect = 4e-04
Identities = 42/185 (22%), Positives = 66/185 (35%), Gaps = 6/185 (3%)

Query: 40 GLGGSLSKAFAAVDGTA----ARRDLLALQQEWRRAADVEADAAARMIRDQRRLAEATVK 95
G GGS S++ AA+ TA A+ +Q R A EA A A+ RD L +
Sbjct: 39 GKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDA--LTQRLKD 96

Query: 96 YGDDSSRTAAAQAMLARSQRDHIDAMIAAEAAHGRLAKAGNETADSVSRMQKLGANPIFN 155
+++ R A++ A +A + AE RLAKA + +K
Sbjct: 97 IVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQR 156

Query: 156 AAGIGSVAAMGIGLVSATDAAGNFQQSLQRLHTVAEESPANLKAISDGVLKLSSVVGYAP 215
I A + +A +L E + L A V+K+ +
Sbjct: 157 RKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLN 216

Query: 216 QKLMD 220
+L
Sbjct: 217 SRLSS 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2764cPERTACTIN270.030 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 27.4 bits (60), Expect = 0.030
Identities = 17/59 (28%), Positives = 22/59 (37%)

Query: 17 PTRQTPAARRRSGPAPHPTARPTQKATPHPPQPERHTMTTDNPTPSDDQALAALYATAL 75
P PA + P P P P P PPQP + P P + L+A A+
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAANAAV 626


43MAP2839cMAP2853cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2839c1164.781981hypothetical protein
MAP2840c1145.346953diaminopimelate epimerase
MAP2841c1134.560789tRNA delta(2)-isopentenylpyrophosphate
MAP2842c2154.334492hypothetical protein
MAP2843c0122.180841hypothetical protein
MAP28441132.683464hypothetical protein
MAP2845c0121.491544hypothetical protein
MAP2846c-1132.753631(dimethylallyl)adenosine tRNA
MAP2847c0112.335549recombination regulator RecX
MAP2848c-171.681745recombinase A
MAP28490114.243514hypothetical protein
MAP2850c1112.836601hypothetical protein
MAP2851c1123.087917hypothetical protein
MAP28523130.548531hypothetical protein
MAP2853c3120.113733hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2844PERTACTIN290.035 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 29.3 bits (65), Expect = 0.035
Identities = 16/37 (43%), Positives = 19/37 (51%)

Query: 35 ARPAPRPGPRPGPRPVSAGRPAAHPVVVPPPPSDPHR 71
A PAP+P P+PGP+P P P PP P R
Sbjct: 567 APPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQR 603


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2846cACRIFLAVINRP300.035 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.8 bits (67), Expect = 0.035
Identities = 27/106 (25%), Positives = 39/106 (36%), Gaps = 23/106 (21%)

Query: 150 AQVEIAEALQQFPSSLPSARESAYAAWVSISVGCNNSCTFCIVPSLRGKEVDRSPDDILA 209
AQV++ LQ LP + ISV +S ++ +V + DDI
Sbjct: 103 AQVQVQNKLQLATPLLPQEVQQ-----QGISVE-KSSSSYLMVAGFVSDNPGTTQDDISD 156

Query: 210 EVRSLVAD------GVLEVTLLG-----------QNVNAYGVSFAD 238
V S V D GV +V L G +N Y ++ D
Sbjct: 157 YVASNVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKYKLTPVD 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP28492FE2SRDCTASE334e-04 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 33.5 bits (76), Expect = 4e-04
Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 196 RASCCLFYRLPGGSVCGDCVLD 217
R +CC YRLP CGDC L
Sbjct: 241 RRTCCQRYRLPDVQQCGDCTLK 262


44MAP2944cMAP2957Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2944c29-0.358348hypothetical protein
MAP2945c111-0.418515ribosome recycling factor
MAP2946c010-0.263891uridylate kinase
MAP2947111-0.076925hypothetical protein
MAP29481100.126756hypothetical protein
MAP2949c0110.944130hypothetical protein
MAP2950c2120.840945hypothetical protein
MAP29514110.170185hypothetical protein
MAP2952c4131.542605hypothetical protein
MAP29532132.433343hypothetical protein
MAP2954c0152.486521amidase
MAP2955c1162.596242elongation factor Ts
MAP2956c0144.34505430S ribosomal protein S2
MAP29572155.473088hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2947IGASERPTASE290.006 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.3 bits (65), Expect = 0.006
Identities = 16/84 (19%), Positives = 26/84 (30%), Gaps = 4/84 (4%)

Query: 69 RRNGSMAEENKSGPAEAVKGVVEDVKGKAKEAVGAVAGRDDLTREGQAQQDKAEAQRDAA 128
++ E+N+ E + AKEA V Q+ + E Q
Sbjct: 1045 KQESKTVEKNEQDATETTA----QNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100

Query: 129 KKEAEAEAARGGAEAAEERQRANQ 152
K+ A E E+ Q +
Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPK 1124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2951DHBDHDRGNASE1209e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 120 bits (301), Expect = 9e-35
Identities = 76/251 (30%), Positives = 118/251 (47%), Gaps = 13/251 (5%)

Query: 36 GKKAIITGGDSGIGRAVAIAYAREGADVLIAYLNEDDDARDVARHVTDAGRKCVLVPGDL 95
GK A ITG GIG AVA A +GA + N + + V + R P D+
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEK-VVSSLKAEARHAEAFPADV 66

Query: 96 SDPAHCRAVVDRAVRELGGVDILVNNAAYQMMHKNLDEISDEEWDYTFRLNVGAYFYLTK 155
D A + R RE+G +DILVN A + +SDEEW+ TF +N F ++
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGL-IHSLSDEEWEATFSVNSTGVFNASR 125

Query: 156 AALPHLRA--GSSIIGSSSVNSDTPNPTLAPYAATKAAIANFSASLAQLLGDKGIRVNSV 213
+ ++ SI+ S + P ++A YA++KAA F+ L L + IR N V
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 214 APGPIWTPLIPSTMPPD---------SVESFGDNVPLGRAGQPAELAPIYVLLASDEASY 264
+PG T + S + S+E+F +PL + +P+++A + L S +A +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 265 ISGARVAVTGG 275
I+ + V GG
Sbjct: 246 ITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2957OMADHESIN290.009 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 29.1 bits (64), Expect = 0.009
Identities = 19/55 (34%), Positives = 24/55 (43%), Gaps = 1/55 (1%)

Query: 17 PAAAGDLHLEWPLRPPPVVVRAFEAPAQDWHPGHRGVDLAGTAGQPVYAAGAGTV 71
P A L LE+P+RPP A A+ H G G V A GAG++
Sbjct: 41 PNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAV-AVGAGSI 94


45MAP2985MAP2990cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2985292.455981hypothetical protein
MAP2986c292.325205PII uridylyl-transferase
MAP2987c3102.129591GlnB
MAP2988c2112.400066Amt_2
MAP2989c3123.166426hypothetical protein
MAP2990c3112.859501hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2988cRTXTOXINA320.006 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 32.2 bits (73), Expect = 0.006
Identities = 39/167 (23%), Positives = 67/167 (40%), Gaps = 24/167 (14%)

Query: 248 NAGSATSSNGAAGSTFMTTTIATAT-AMLAWMLTERIRDGKATTLGAASGIVAGLVAITP 306
NA + T + AAG T + + +++ +R G +T+ AA+G++A V +
Sbjct: 260 NADADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTS-AAAAGLIASAVTL-- 316

Query: 307 SCSSVNVLGALVVGLVAGVVCALAVGLKFKLGFD-DSL-------------DVVGVHLV- 351
+ S ++ L A + + K KLG+D DSL + + V
Sbjct: 317 AISPLSFLSIADKFKRANKIEEYSQRFK-KLGYDGDSLLAAFHKETGAIDASLTTISTVL 375

Query: 352 ----GGLAGTLLVGLLAAPESPAISGVTGVSKGLFYGGGWAQLERQA 394
G++ L+ AP S + VTG+ G+ A E A
Sbjct: 376 ASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEHVA 422


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2989cIGASERPTASE372e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.6 bits (84), Expect = 2e-04
Identities = 32/194 (16%), Positives = 71/194 (36%), Gaps = 20/194 (10%)

Query: 95 TISEVELPEPETPAAPAAEAPAPEAPESPEAPAPEAPAPE-IEEIAPTEGRLERLRGRLA 153
T + ++ P P+ A EAP P APA + E + E + E + +
Sbjct: 999 TPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ-- 1056

Query: 154 RSQNALGRSLLGLIGGGDLDEDAWQDVEDTLLVADLGPVVTESVIAQLRGRLASSDVRTE 213
+ ++ ++ ++A +V+ ++ +E+ Q ++ V E
Sbjct: 1057 DATETTAQN-------REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 214 ADAKAVLRDV-----LINELRPDLDRSIRALPHAD-----HPSVLLIVGVNGTGKTTTVG 263
AK + +++ P ++S P A+ P+V + + T T
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTE 1169

Query: 264 KLARVLVADGRRVV 277
+ A+ ++ + V
Sbjct: 1170 QPAKETSSNVEQPV 1183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2990cGPOSANCHOR436e-06 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 43.1 bits (101), Expect = 6e-06
Identities = 37/231 (16%), Positives = 81/231 (35%), Gaps = 1/231 (0%)

Query: 306 SDELAAHEKALGELSGRAESVQQTWFALSALAERVAATVRIASERAQHLDLEPVTTGDTD 365
D + L + ++ ++ + + A + E+A + T
Sbjct: 84 KDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEAR-KADLEKALEGAMNFSTADSAK 142

Query: 366 PDALEAEAERVAAAEQQLLAELATARSRLETARAELAEREREAAEADRAHMAAVRAEADR 425
LEAE +AA + L L A + A++ E E A + +A
Sbjct: 143 IKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGA 202

Query: 426 REGLARLAGQVETMRARVESIDDSVARLSERIEAAAARAQQAKAEFETVQGRVGELDQGE 485
+ +++T+ A ++ A L + +E A + A+ +T++ L+ +
Sbjct: 203 MNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQ 262

Query: 486 VGLDEHHERTVAALRLADERVAELQAAERDAERKVASLRARIDALAVGLER 536
L++ E + ++ L+A + E + A L + L +
Sbjct: 263 AELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQS 313



Score = 36.2 bits (83), Expect = 8e-04
Identities = 60/320 (18%), Positives = 112/320 (35%), Gaps = 19/320 (5%)

Query: 210 HRKRKEKALRKLDAMSANLARLTDLTTELRRQLKPLGRQAEVARRAQTIQADLRDARLRL 269
K+ + A A L +L + L+ A A+ + A L
Sbjct: 132 AMNFSTADSAKIKTLEAEKAALAARKADLEKALEGA-MNFSTADSAKIKTLEAEKAALEA 190

Query: 270 AADDLVNRRGEREAIFEAEAAMRREHDEASARLAVASDELAAHEKALGELSGRAESVQQT 329
+L A++A + + A LA +L + S + +T
Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250

Query: 330 WFALSALAERVAATVRIASERAQHLDLEPVTTGDT---DPDALEAEAERVAAAEQQLLAE 386
A A E A + A E A + T + ALEAE + Q L A
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310

Query: 387 LATARSRLETARAELAEREREAAEADRAHMAAVRAEADRREGL-------ARLAGQVETM 439
+ R L+ +R + E E + + + + + R L +L + + +
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKL 370

Query: 440 RARVESIDDSVARLSERIEAAAARAQQAKAEFETVQGRVGELDQGEVGLDEHHERTVAAL 499
+ + + S L ++A+ +Q + E ++ L++ L+E +
Sbjct: 371 EEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEE-------SK 423

Query: 500 RLADERVAELQAAERDAERK 519
+L ++ AELQ A+ +AE K
Sbjct: 424 KLTEKEKAELQ-AKLEAEAK 442



Score = 35.8 bits (82), Expect = 0.001
Identities = 50/381 (13%), Positives = 102/381 (26%), Gaps = 54/381 (14%)

Query: 702 KAGAELAAAEAQVAQLSAALSGALAEQAARQDSAEQALAALNESDSAISGMYEQLGRLGQ 761
+ + + ++ E + + E+L +
Sbjct: 40 VSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKE 99

Query: 762 EARTSEDEWSRLLRQREELEAGRTQTVAEVTELENRLRNAQETPQEPAAEPVNRQQIAAA 821
+ R ++ S + +ELEA + + N + AE A
Sbjct: 100 KLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKAD 159

Query: 822 TDAARSAEVEARLAVRTAEERANAVRGRADSLRRAAAAEREARVRAQQAREARLRAAAVA 881
+ A + A + A + ++ + E + A A+++
Sbjct: 160 LEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEA- 218

Query: 882 AAVADSGRLLATRLNAVVAAASRIRDALAAERQQRATAMAAVRDEVNALSARVAALTDSL 941
L A A + + + + + E AL AR A L +L
Sbjct: 219 ---------EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKAL 269

Query: 942 HSDEVANAQAALRIEQLEQMVLEQFGMAPADLIAEYGPHIALPPSELEMAEYEQAKERGE 1001
+ + +I+ LE AE +
Sbjct: 270 EGAMNFSTADSAKIKTLE-------------------------------AEKAALEAEKA 298

Query: 1002 QVFAPAPIPFDRPTQERRAKRAERELAELGRVNPLALEEFAALEERYNFLSTQLEDVKAA 1061
+ + + A R+ L R + E LE + L Q + +A+
Sbjct: 299 DL-----------EHQSQVLNANRQ--SLRRDLDASREAKKQLEAEHQKLEEQNKISEAS 345

Query: 1062 RKDLLGVVDEVDARILQVFSE 1082
R+ L +D Q+ +E
Sbjct: 346 RQSLRRDLDASREAKKQLEAE 366



Score = 32.3 bits (73), Expect = 0.014
Identities = 32/170 (18%), Positives = 63/170 (37%), Gaps = 10/170 (5%)

Query: 695 EVTSEIDKAGAELAAAEAQVAQLSAALSGALAEQAARQDSAEQALAALNESDSAISGMYE 754
+ A++ EA+ A L+A + + + A + ++ + +
Sbjct: 201 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 260

Query: 755 QLGRLGQEARTSEDEWSRLLRQREELEAGRTQTVAEVTELENRLRNAQETPQEPAAEPVN 814
+ L + + + + + + LEA + AE +LE+ + N
Sbjct: 261 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEH----------QSQVLNAN 310

Query: 815 RQQIAAATDAARSAEVEARLAVRTAEERANAVRGRADSLRRAAAAEREAR 864
RQ + DA+R A+ + + EE+ SLRR A REA+
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAK 360


46MAP3085cMAP3101cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3085c2152.800898hypothetical protein
MAP3086c0130.559387hypothetical protein
MAP3087c0121.350784enoyl-CoA hydratase
MAP3088c1130.399957hypothetical protein
MAP3089c113-0.037361hypothetical protein
MAP3090c113-1.388102SerB2
MAP3091c012-1.897169CtaD
MAP3092214-0.237689hypothetical protein
MAP3093213-2.270390AdhC
MAP3094c115-2.203808hypothetical protein
MAP3095c314-4.233493ribonucleotide-diphosphate reductase subunit
MAP3096213-4.041127hypothetical protein
MAP3097213-4.530240hypothetical protein
MAP3098c213-5.384462hypothetical protein
MAP3099c315-5.553918hypothetical protein
MAP3100c312-4.870139ribonucleotide-diphosphate reductase subunit
MAP3101c213-1.307276ribonucleotide reductase stimulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3092FERRIBNDNGPP542e-10 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 54.2 bits (130), Expect = 2e-10
Identities = 61/274 (22%), Positives = 93/274 (33%), Gaps = 36/274 (13%)

Query: 91 SADPQRIVVLAGDQLDALCALGLQSRVVGAALPDGASGQPAYLGGAV-----RGVPGVGS 145
+ DP RIV L ++ L ALG+ P G + Y V VG
Sbjct: 32 AIDPNRIVALEWLPVELLLALGIV--------PYGVADTINYRLWVSEPPLPDSVIDVGL 83

Query: 146 RSHPDVKAIAAAHPDLILGSQGLTPALYPQLAAIAPT-VFTAA----PGAAWRDNLRAVG 200
R+ P+++ + P ++ S G P+ LA IAP F + P A R +L +
Sbjct: 84 RTEPNLELLTEMKPSFMVWSAGYGPS-PEMLARIAPGRGFNFSDGKQPLAMARKSLTEMA 142

Query: 201 AATARAGAVDGLLS---GFSQRAGDVGARHDASHFQASIVQLTTG-SIRVFGANNFPASV 256
A + L+ F + + A + L + VFG N+ +
Sbjct: 143 DLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPL--LLTTLIDPRHMLVFGPNSLFQEI 200

Query: 257 LGAVGVDRPAAQRFTDKPYLEIGATDADLAKNPDLSVADADVVYLSCATPAAADRAATVL 316
L G+ A Q T+ + D LA D DV+ D
Sbjct: 201 LDEYGI-PNAWQGETNFWGSTAVSID-RLAA-----YKDVDVLCFDHDNSKDMDALMA-- 251

Query: 317 DSGPWRKLSANRDNRVYVVNDEIWQTGQGLIAAR 350
+ W+ + R R V +W G L A
Sbjct: 252 -TPLWQAMPFVRAGRFQRVPA-VWFYGATLSAMH 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3094cPF06057280.014 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 27.9 bits (62), Expect = 0.014
Identities = 11/41 (26%), Positives = 17/41 (41%), Gaps = 5/41 (12%)

Query: 71 NRDEIAEFISGMTHYDAGPENIIR-VAARLAAAGWPLAGID 110
+ + F+SG D G + + V L GWP+ G
Sbjct: 49 TKPPLVIFLSG----DGGWATLDKAVGGILQQQGWPVVGWS 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3097HTHTETR482e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.1 bits (114), Expect = 2e-09
Identities = 31/192 (16%), Positives = 57/192 (29%), Gaps = 18/192 (9%)

Query: 7 RQRRRELLDALIAEFAAGGIGDRSLRRVAEAVGTSHRMLLHHFGSREGLLLAIVEEVERR 66
++ R+ +LD + F+ G+ SL +A+A G + + HF + L I E E
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 67 QMRVLTELPRAPAEGFAAMWAD-----LRRPELREFERLFFECYSR---AAQGEKPFARM 118
+ E ++ + L E RL E +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 119 LPDAVDDWLR---------QAETHSGAPFDPAMAR-LGLAVIRGLLLDLVATGDEAGVDA 168
+ + A A + I GL+ + + +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 169 AARAFVNLLNAG 180
AR +V +L
Sbjct: 190 EARDYVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3099cHTHTETR575e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 5e-12
Identities = 41/195 (21%), Positives = 62/195 (31%), Gaps = 7/195 (3%)

Query: 23 RWREHRKKVRNEIVDAAFRAIDRLGPE-LSVREIAEEAGTAKPKIYRHFHDKSDLFQAIG 81
+ ++ ++ R I+D A R + G S+ EIA+ AG + IY HF DKSDLF I
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 82 ERLRDMLWTAIFPSIDLKTDSAREVIRRSVEEYVTLVDKHPNVLRVF-IQGRSTGTPQST 140
E + V+R + + + I
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 141 VTILNEGREITLAMADLFDNELREM----ELD-HAAVELAAHAAFGSAASATEWWLGPEP 195
+ R + L D + L+ L AA G + E WL
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 196 DSPRLMSRAQFVAHL 210
+VA L
Sbjct: 184 SFDLKKEARDYVAIL 198


47MAP3162cMAP3170cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3162c023-3.468773hypothetical protein
MAP3163119-2.737793hypothetical protein
MAP3164017-2.942054hypothetical protein
MAP3165-119-3.871728hypothetical protein
MAP3166c-215-4.658936hypothetical protein
MAP3167c-113-4.673242hypothetical protein
MAP3168c-111-2.915509hypothetical protein
MAP3169c012-3.260541hypothetical protein
MAP3170c-19-3.941186SsrA-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3166cPF03895300.002 Serum resistance protein DsrA.
		>PF03895#Serum resistance protein DsrA.

Length = 79

Score = 29.8 bits (67), Expect = 0.002
Identities = 9/35 (25%), Positives = 17/35 (48%)

Query: 32 LARQTAEGISGDMLEIGTYQGKSAILMGYGLRDDE 66
L + G + +G Y+ K+A+ +G G R +
Sbjct: 18 LVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITD 52


48MAP3245MAP3250Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3245093.230551hypothetical protein
MAP32460113.758534hypothetical protein
MAP3247c194.441558hypothetical protein
MAP3248094.204150hypothetical protein
MAP32491114.582507hypothetical protein
MAP3250093.502082hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3248NUCEPIMERASE1623e-49 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 162 bits (411), Expect = 3e-49
Identities = 74/335 (22%), Positives = 123/335 (36%), Gaps = 38/335 (11%)

Query: 31 RVLITGGAGFLGAHLCARLLDDGVEVVSVDDLSTS-GPAVRFG-----DRPGYRFVQRDV 84
+ L+TG AGF+G H+ RLL+ G +VV +D+L+ +++ +PG++F + D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 85 CEPGLIDEV--GSGFDAVFHLASAASPVDYQ-RRPIQTLCTGSAGTATALEIAERAG-AR 140
+ + ++ F+ VF + V Y P + G LE
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 141 FVLASTSEVYGDPESHPQRESYWGNVNPAGPRSVYDEAKRFAEALTFAYHRLGRADVGAA 200
+ AS+S VYG P + P S+Y K+ E + Y L
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDD----SVDHPVSLYAATKKANELMAHTYSHLYGLPATGL 176

Query: 201 RIFNTYGPGMRADDGRMVP-TFCLQALRGDPLTVSGTGLQTRSLCYVDDTITGLIALAHS 259
R F YGP R D M F L G + V G R Y+DD +I L
Sbjct: 177 RFFTVYGPWGRPD---MALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233

Query: 260 DFAGP-------------------VNIGNPTELTVLSAAELIRELAGSTSTIQFTPPAAD 300
NIGN + + ++ + + + G + P
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPG 293

Query: 301 DPQRRCPDIRLARKRLGWRPRVDYRTGLSTTLAWF 335
D D + + +G+ P + G+ + W+
Sbjct: 294 DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


49MAP3261cMAP3273cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3261c210-0.320303hypothetical protein
MAP3262c311-0.555825GlgX_2
MAP3263c090.275663hypothetical protein
MAP3264c010-0.328334hypothetical protein
MAP32650100.566820hypothetical protein
MAP3266c2110.423220hypothetical protein
MAP3267c390.052423hypothetical protein
MAP3268380.510552Hsp18_3
MAP3269270.634094hypothetical protein
MAP3270c191.306549hypothetical protein
MAP3271c1100.919421hypothetical protein
MAP32721100.833706hypothetical protein
MAP3273c2121.633564hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3261cDHBDHDRGNASE511e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 50.8 bits (121), Expect = 1e-09
Identities = 41/178 (23%), Positives = 69/178 (38%), Gaps = 12/178 (6%)

Query: 18 VVGASSGLGRCIGVGLAQRGDRVA---LLARRRQRIEAAAKDAGPGAVAIECDVTDQASC 74
+ GA+ G+G + LA +G +A + +++ ++ K A A DV D A+
Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72

Query: 75 ASAIGEAADALGGIDNIVYTPAV---GPLVRMVDTDADTWRRIFDTNVIGA-SLVTAAAV 130
+G ID +V V G + + D + W F N G + + +
Sbjct: 73 DEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEE---WEATFSVNSTGVFNASRSVSK 129

Query: 131 PHLSASAGKAVYLSSDAGAFGPPWPGLGAYGVSKAALERLVEAWRAEHPDIGFTCLIV 188
+ +G V + S+ G P + AY SKAA + E + C IV
Sbjct: 130 YMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3270cPF06580340.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.002
Identities = 15/79 (18%), Positives = 28/79 (35%), Gaps = 11/79 (13%)

Query: 473 VSNAVRH-----SGASRLTVQI-GVADMFTLDVIDNGRGIASGNTRRS--GLANMTRRAE 524
V N ++H ++ ++ TL+V + G + GL N+ R +
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQ 323

Query: 525 QLGG---SCEISSPPGGGT 540
L G ++S G
Sbjct: 324 MLYGTEAQIKLSEKQGKVN 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3271cHTHFIS712e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.6 bits (173), Expect = 2e-16
Identities = 25/102 (24%), Positives = 42/102 (41%), Gaps = 2/102 (1%)

Query: 3 KVFLVDDHEVVRRGLCDLLSSDPDLQIVGEAGTVAEAKARIPAARPDVAVLDVRLPDGNG 62
+ + DD +R L LS + A I A D+ V DV +PD N
Sbjct: 5 TILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 63 IELCRDLLSEHPDLRCLMLTSFTSDEAMLEAILAGASGFVIK 104
+L + PDL L++++ + ++A GA ++ K
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


50MAP3296cMAP3311cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3296c084.160995hypothetical protein
MAP3297c083.921425hypothetical protein
MAP3298084.030621hypothetical protein
MAP3299c084.262671hypothetical protein
MAP3300c184.751507hypothetical protein
MAP3301c0104.091771hypothetical protein
MAP3302-2111.411923hypothetical protein
MAP3303-1111.294407hypothetical protein
MAP33040110.916501hypothetical protein
MAP3305c0100.902509hypothetical protein
MAP3306c2110.599978molybdopterin biosynthesis-like protein MoeZ
MAP3307c0120.938815hypothetical protein
MAP33082122.001600hypothetical protein
MAP3309c4122.379855hypothetical protein
MAP33104112.326575hypothetical protein
MAP3311c2102.450029hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3308HTHTETR787e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.1 bits (192), Expect = 7e-20
Identities = 33/171 (19%), Positives = 61/171 (35%), Gaps = 15/171 (8%)

Query: 31 RGNRLPRDERRGQLLVVASDVFVDRGYHAAGMDEIADRAGVSKPVLYQHFTSKLELYLAV 90
R + E R +L VA +F +G + + EIA AGV++ +Y HF K +L+ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 91 LARHVENLVSGVQQALS-TTKDNRRRLHAAVQAFFDFIEHD--SQGYRLIFENDYVTEPE 147
N+ + + D L + + + + I + E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 148 VAAQLRVATESCIDAVYALISE-----------DSGLDPHRARMIAVGLVG 187
+A + C++ Y I + + L RA +I G +
Sbjct: 123 MAVVQQAQRNLCLE-SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3309cACRIFLAVINRP270.009 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.1 bits (60), Expect = 0.009
Identities = 16/67 (23%), Positives = 27/67 (40%), Gaps = 13/67 (19%)

Query: 14 LVLTSTQTPDEVEELVSAALSDGSDLLRLSDERGRRFLVHTSKIAYVEIGIADVRRVGFG 73
+ T + P+E ++ SDGS ++RL D +A VE+G + +
Sbjct: 235 IAQTRFKNPEEFGKVTLRVNSDGS-VVRLKD------------VARVELGGENYNVIARI 281

Query: 74 VGAEVVG 80
G G
Sbjct: 282 NGKPAAG 288


51MAP3374MAP3379cY        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MAP33741143.503295LPPG:FO 2-phospho-L-lactate transferase
MAP33751164.435063F420-0--gamma-glutamyl ligase
MAP33761153.333322hypothetical protein
MAP33772143.197142hypothetical protein
MAP3378c1143.094828RmlA2
MAP3379c2152.717636WbbL
52MAP3410cMAP3426cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3410c-2123.265516L-lysine aminotransferase
MAP3411c-2123.327484hypothetical protein
MAP3412-2123.497947hypothetical protein
MAP3413-1113.202181AldB
MAP3414-1113.100779hypothetical protein
MAP3415-1113.142616hypothetical protein
MAP3416-1111.403965hypothetical protein
MAP3417c0101.439117hypothetical protein
MAP34181110.952540hypothetical protein
MAP3419c1101.445304hypothetical protein
MAP3420c1121.472218hypothetical protein
MAP3421c2131.898397hypothetical protein
MAP3422c2152.015776hypothetical protein
MAP3423c2142.063679GlpD2
MAP3424c2153.113056flavoprotein disulfide reductase
MAP34251133.562738hypothetical protein
MAP3426c0123.625982AmiA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3414HTHTETR589e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.5 bits (141), Expect = 9e-13
Identities = 18/103 (17%), Positives = 40/103 (38%), Gaps = 3/103 (2%)

Query: 5 RRRLSPEDRRAELLALGAEVFGKRPYDEVRIDEIAERAGVSRALMYHYFPDKRAFFAAVV 64
+ + ++ R +L + +F ++ + EIA+ AGV+R +Y +F DK F+ +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 65 KDEADRL--YENTNMDDVTGLTMYEEIRVGVLAYMAYHEQNPE 105
+ + E G +R ++ +
Sbjct: 64 ELSESNIGELELEYQAKFPG-DPLSVLREILIHVLESTVTEER 105


53MAP3486MAP3502Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP34862112.597576hypothetical protein
MAP3487c182.429437hypothetical protein
MAP3488c071.502877hypothetical protein
MAP3489c-181.937219GMP synthase
MAP3490091.860972hypothetical protein
MAP34911111.546736hypothetical protein
MAP34920101.936885hypothetical protein
MAP3493094.019749hypothetical protein
MAP3494094.116770hypothetical protein
MAP3495c093.673456hypothetical protein
MAP3496-1103.969324hypothetical protein
MAP3497-193.460431acyl-CoA synthetase
MAP3498c093.139406CtpI
MAP3499c010-0.479954hypothetical protein
MAP3500c090.136225hypothetical protein
MAP3501170.764009hypothetical protein
MAP3502260.759932AdhD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3490PF03544362e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 36.5 bits (84), Expect = 2e-04
Identities = 19/130 (14%), Positives = 29/130 (22%)

Query: 328 QPPLPAPVAEPATPSAAAPTGLPATAGATPIAASAAASGPAPAPTPAPTAATVSSPAPPA 387
QP VA A P P AP P P
Sbjct: 48 QPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKP 107

Query: 388 PPAPGAAPFAPPYAVPPPGAGFGSKARASVDTRAKSKSPQPDSNAVGAGAAVREAAHARR 447
P + F + A A + + + +V +G +
Sbjct: 108 VKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQY 167

Query: 448 RQRSRRRGDE 457
R++ E
Sbjct: 168 PARAQALRIE 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3496DHBDHDRGNASE903e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 90.1 bits (223), Expect = 3e-23
Identities = 60/193 (31%), Positives = 94/193 (48%), Gaps = 3/193 (1%)

Query: 46 AVGGRTVLVTGASYGIGEATARRLAAAGATVLVVARSEERLGELTAAINAGGGRAVAYPT 105
+ G+ +TGA+ GIGEA AR LA+ GA + V + E+L ++ +++ A A A+P
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 106 DLTDESAVSALTKQITEEHGPLDVVVSNAGKSLRRSLHHQYDRPHDFQRTIDVNYLGPVR 165
D+ D +A+ +T +I E GP+D++V+ AG LR L H +++ T VN G
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAG-VLRPGLIHSLSD-EEWEATFSVNSTGVFN 122

Query: 166 LLLGLLPAMRDNGRGHVVNVSSVGVRVVPGPQWGAYQASKGAFDRWLRSVAPELHADGVH 225
+ M D G +V V S VP AY +SK A + + + EL +
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAG-VPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 226 VSTVYFALVRTRM 238
+ V T M
Sbjct: 182 CNIVSPGSTETDM 194


54MAP3810cMAP3822Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3810c025-3.503699hypothetical protein
MAP3811-125-3.973686hypothetical protein
MAP3812c029-5.852142hypothetical protein
MAP3813236-7.396997hypothetical protein
MAP3814c233-6.543403hypothetical protein
MAP3815133-7.518834hypothetical protein
MAP3816125-2.413762hypothetical protein
MAP3817c124-0.946431hypothetical protein
MAP38181200.427192hypothetical protein
MAP38190133.107391*hypothetical protein
MAP38200113.032653deoxycytidine triphosphate deaminase
MAP38210113.653917hypothetical protein
MAP38220113.085210hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3810cSECBCHAPRONE280.011 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 27.9 bits (62), Expect = 0.011
Identities = 7/34 (20%), Positives = 17/34 (50%)

Query: 52 LTATVTAGPARADQAAFLNDLHNAGIHAVNGGDD 85
L +V + AF+ ++ AG+ ++G ++
Sbjct: 69 LNISVETTMESSGDVAFICEVKQAGVFTISGLEE 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3821TONBPROTEIN455e-07 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 44.6 bits (105), Expect = 5e-07
Identities = 29/121 (23%), Positives = 42/121 (34%), Gaps = 3/121 (2%)

Query: 371 ISVPEPVAAPKPLSLPVAAPLPAAPPPAAPPLPEAPPIPAAPPVVPVPVVVPPVPVPVPV 430
V E A +P+S+ + P PP A P PE P P P+ PP PV +
Sbjct: 33 HQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPE---PIPEPPKEAPVVI 89

Query: 431 RIPVPDPVSPPQLLAPPRLSVPQPVQPPVRVPQPPSPPQVGGTVPQSPPQHQTPPGEGTP 490
P P P P+ + + + V+P P P + S T +
Sbjct: 90 EKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSV 149

Query: 491 P 491

Sbjct: 150 A 150



Score = 41.1 bits (96), Expect = 6e-06
Identities = 30/124 (24%), Positives = 39/124 (31%), Gaps = 7/124 (5%)

Query: 368 QPKISVPEPVAAPKPLSLPVAAPLPAAPPPAAPPLPEAPPIPAAPPVVPVPVVVPPVPVP 427
P + P V P L + P P P PE PIP P PV + P P P
Sbjct: 38 LPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV-IEKPKPKP 96

Query: 428 VPVRIPVPDPVSPPQLLAPP---RLSVPQPVQPPVRVPQPP---SPPQVGGTVPQSPPQH 481
P PV P+ P R + P P R+ + + +V P
Sbjct: 97 KPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRAL 156

Query: 482 QTPP 485

Sbjct: 157 SRNQ 160



Score = 36.9 bits (85), Expect = 2e-04
Identities = 29/129 (22%), Positives = 35/129 (27%), Gaps = 8/129 (6%)

Query: 339 VALQPTPNQHLIVPTEQAPAPPPVQASAPQPKISVPEPVAAPKPLSLPVAAPLPAAPPPA 398
V P P Q + V P QA P P+ V P P P
Sbjct: 35 VIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPI-------PEPPKEAPV 87

Query: 399 APPLPEAPPIPAAPPVVPVPVVVPPVPVPVPVRIPVPDPVSPPQLLAPPRLSVPQPVQPP 458
P+ P P PV V PV R P + P L + +P
Sbjct: 88 VIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAA-TSKPV 146

Query: 459 VRVPQPPSP 467
V P
Sbjct: 147 TSVASGPRA 155



Score = 32.3 bits (73), Expect = 0.005
Identities = 20/103 (19%), Positives = 29/103 (28%), Gaps = 8/103 (7%)

Query: 351 VPTEQAPAPPPVQASAPQPKISVPEPVAAPKPLSLPVAAPL--------PAAPPPAAPPL 402
E P P P+ + + + +P PKP PV P PA+P
Sbjct: 67 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFE 126

Query: 403 PEAPPIPAAPPVVPVPVVVPPVPVPVPVRIPVPDPVSPPQLLA 445
AP + P + P P + A
Sbjct: 127 NTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQA 169



Score = 30.3 bits (68), Expect = 0.019
Identities = 21/80 (26%), Positives = 33/80 (41%), Gaps = 2/80 (2%)

Query: 419 VVVPPVPVPVPVRIPVPDPVSPPQLLAPPRLSVPQPVQPPVRVPQPPSPPQV--GGTVPQ 476
+ +P P+ V + P + PPQ + PP V +P P +P+PP V P+
Sbjct: 36 IELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPK 95

Query: 477 SPPQHQTPPGEGTPPKPDSP 496
P+ + PK D
Sbjct: 96 PKPKPKPVKKVQEQPKRDVK 115


55MAP3890MAP3905Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP389029-1.110333hypothetical protein
MAP38912101.278741hypothetical protein
MAP38923122.081587hypothetical protein
MAP3893c3132.376941hypothetical protein
MAP3894c3103.126796GlnH
MAP3895c393.436227hypothetical protein
MAP38961104.521568hypothetical protein
MAP3897c093.463607thiamine-phosphate pyrophosphorylase
MAP38980101.842646hypothetical protein
MAP38991120.126460sulfur carrier protein ThiS
MAP39002110.244296thiazole synthase
MAP3901312-0.171566hypothetical protein
MAP3902c611-0.626498hypothetical protein
MAP3903c412-0.940525hypothetical protein
MAP3904210-1.140073hypothetical protein
MAP3905210-0.791867hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3890ACRIFLAVINRP512e-08 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 51.0 bits (122), Expect = 2e-08
Identities = 49/288 (17%), Positives = 105/288 (36%), Gaps = 46/288 (15%)

Query: 151 SYESVAAVRKIVDSTPA--PPGVKAYVAGNTVLNADTSIVGHKSMATMALVSIVVIFVML 208
+ ++ A++ + P G+K +T SI + +I+++F+++
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSI---HEVVKTLFEAIMLVFLVM 355

Query: 209 LVVYRSIVTTVLSLVIIGIELFAAQGITATAG-NLNIIGLTPYAVSMITMLSIAAGTDYV 267
+ +++ T++ + + + L I A G ++N + + L+I D
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMV------LAIGLLVDDA 409

Query: 268 IFLLGRYHEARSFGQDREEAFYTAYHGVSHVILGSGLTIAGACLCLTAARLP-------- 319
I ++ R +D+ +S + + G + L+A +P
Sbjct: 410 IVVVENVE--RVMMEDKLPPKEATEKSMSQI----QGALVGIAMVLSAVFIPMAFFGGST 463

Query: 320 --YFQTMGLPCAIAMVVIVLAALTLAPAILAV-------------GSRFGLFDPKRAIDV 364
++ + AM + VL AL L PA+ A G FG F+ V
Sbjct: 464 GAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSV 523

Query: 365 RGWRKVGTAVVRWPKPIIVVTAAIAVIGFISLLTYVPNYDDQKFTPKD 412
+ ++ +++ A I V G + L +P F P++
Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALI-VAGMVVLFLRLP----SSFLPEE 566



Score = 38.3 bits (89), Expect = 2e-04
Identities = 51/261 (19%), Positives = 93/261 (35%), Gaps = 27/261 (10%)

Query: 684 FKNPDFKRGLKMFVSPDGTAVRF---------------IITHQGDPASVEGIKHVAGVKD 728
FKNP+ + + V+ DG+ VR I G PA+ GIK G
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 729 -AVADAIKGTPLESSKVYLAG----TASMYSDMQEGVIIDLLVAGISCLILIFTIMLIIT 783
A AIK E + G + + I +++ ++L+F +M +
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 784 RSVVAALVIVGTVAASLGTACGLSVLMWQDLIGLGVQWIVLPLSIVILLAVGSDYNLLLV 843
+++ A L+ V L + + L + +VL + +++ A+ N V
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN---V 416

Query: 844 SRLKEEIPAGLNTGIIRGMGASGRVVTAAGLVFAFT---MASMIVSQLRVIGELGTTIAL 900
R+ E + M + +V + MA S + + TI
Sbjct: 417 ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVS 476

Query: 901 GLLVDTLIVRSFMTPSIAAAL 921
+ + L+ TP++ A L
Sbjct: 477 AMALSVLVALIL-TPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3891HTHTETR691e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.3 bits (169), Expect = 1e-16
Identities = 28/168 (16%), Positives = 58/168 (34%), Gaps = 8/168 (4%)

Query: 16 QRTEGRLDRSRDPAILDAALAALAEHGYDATNMNDIAARAGVGKAAIYRRWSSKAALMTD 75
++T+ +R ILD AL ++ G +T++ +IA AGV + AIY + K+ L ++
Sbjct: 3 RKTKQEAQETRQ-HILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 76 AL----IYWRPELLNDDAPDTGSLAGDLDAIVKRAKRNDNALISNDLVLRV---ALEAAH 128
L A G L I+ + L++ + E
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 129 DPELATALNDLILFKGRRVLSAVLAQAADRGEIDPNRDWSLVADVLTA 176
+ + + + + L + + + A ++
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRG 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3893cYERSSTKINASE320.010 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.0 bits (72), Expect = 0.010
Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 5/80 (6%)

Query: 273 ILPALGYLHSIGLVYNDLKPENIMLTEEQLK--LIDLGAVSRINSFGYLYGTPGFQAPE- 329
+L +L G+V+ND+KP N++ + +IDLG SR + T F+APE
Sbjct: 254 LLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGF-TESFKAPEL 312

Query: 330 -IVRTGPTVATDIYTVGRTL 348
+ G + +D++ V TL
Sbjct: 313 GVGNLGASEKSDVFLVVSTL 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3903cPF07824280.013 Type III secretion chaperone
		>PF07824#Type III secretion chaperone

Length = 120

Score = 28.0 bits (62), Expect = 0.013
Identities = 13/36 (36%), Positives = 18/36 (50%)

Query: 37 PNPSDINTLAASLSKGYGLNICTAGNIDRGRQLAVL 72
P +IN L +LS Y IC A + + G +A L
Sbjct: 49 ALPENINDLIYALSLNYSEKICLATDDEGGSLIARL 84


56MAP3917cMAP3938cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3917c2121.266324hypothetical protein
MAP3918c1120.370191peptide deformylase
MAP3919-2120.500370hypothetical protein
MAP3920-2110.982863hypothetical protein
MAP3921-291.878801SodC
MAP3922-1102.360469carboxylate-amine ligase
MAP3923-193.743273hypothetical protein
MAP3924-1113.552389hypothetical protein
MAP39250124.284820EchA8_2
MAP3926c1144.762639hypothetical protein
MAP39271144.795320hypothetical protein
MAP3928c2134.324544hypothetical protein
MAP3929c1143.698501hypothetical protein
MAP3930c-1113.590600phosphatidylserine decarboxylase
MAP39311141.985642hypothetical protein
MAP3932c2131.122267MoaA3
MAP3933c3160.231046short chain dehydrogenase
MAP3934c3150.007250hypothetical protein
MAP39352130.424629hypothetical protein
MAP39361140.285824molecular chaperone GroEL
MAP39371120.787205hypothetical protein
MAP3938c2121.714474hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3927HTHTETR522e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.6 bits (123), Expect = 2e-10
Identities = 21/97 (21%), Positives = 35/97 (36%)

Query: 12 ILDAARALVLDGGPRAASVAAIAKASGAPAGTLYHRFGNRDGILTAAWLRALERFQARAL 71
ILD A L G + S+ IAKA+G G +Y F ++ + + W + L
Sbjct: 16 ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELEL 75

Query: 72 AADGDTPEDTAVAMAVAAVGFARALPDDARLLLTIRP 108
P D + + + + R L +
Sbjct: 76 EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEI 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3933cDHBDHDRGNASE587e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 57.8 bits (139), Expect = 7e-12
Identities = 55/209 (26%), Positives = 89/209 (42%), Gaps = 20/209 (9%)

Query: 21 GRVVVITGANTGIGYETAAVLAHRGAHVVLAVRDLEKGNAALSRIVAASPNADVTLQQLD 80
G++ ITGA GIG A LA +GAH+ + EK +S + A + +A+ D
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF--PAD 65

Query: 81 LASLASVRSAAEALRAAYPRIDLLINNAGV--MWTPKQVTEDGFELQFGTNHLGHFALTG 138
+ A++ + ID+L+N AGV ++++ +E F N G F +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 139 LLLDHLLGVRDSRVVTVSSLGHRLRAAIHFDDLHWERRYDRVAAYGQSKLANLLFTYELQ 198
+ +++ R +VTV S + R +AAY SK A ++FT L
Sbjct: 126 SVSKYMMDRRSGSIVTVGS-----------NPAGVPRT--SMAAYASSKAAAVMFTKCLG 172

Query: 199 RRLAAAPDAKTIAVAAHPGGSNTELARHL 227
LA I PG + T++ L
Sbjct: 173 LELAEYNIRCNI---VSPGSTETDMQWSL 198


57MAP3958cMAP3976Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3958c011-3.257935hypothetical protein
MAP3959c013-3.970093hypothetical protein
MAP3960113-4.378363hypothetical protein
MAP3961112-3.439097isocitrate lyase
MAP3962110-4.0280473-hydroxybutyryl-CoA dehydrogenase
MAP3963310-3.651384UmaA1
MAP3964c212-2.322086UmaA2
MAP3965c212-0.601643hypothetical protein
MAP39662151.133470hypothetical protein
MAP39672140.341115hypothetical protein
MAP3968-1131.551961hypothetical protein
MAP39690111.256526hypothetical protein
MAP3970081.877433hypothetical protein
MAP39711101.218637deoxyribose-phosphate aldolase
MAP3972c180.519582hypothetical protein
MAP3973c1112.486512hypothetical protein
MAP3974c2122.700450hypothetical protein
MAP39752122.350743UDP-N-acetylenolpyruvoylglucosamine reductase
MAP39763122.261775hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3965cHTHTETR472e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 46.5 bits (110), Expect = 2e-08
Identities = 28/176 (15%), Positives = 63/176 (35%), Gaps = 10/176 (5%)

Query: 17 RRWHQHKVDRRNELVDGTIDAIRRLG-GALSMDEIAAEIGVSKTVLYRYFVDKNDLTTAV 75
R+ Q + R ++D + + G + S+ EIA GV++ +Y +F DK+DL + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 76 MMRFTQTTLIPNMAAALTSGLDGFDLTREVIRVYVETVANEPEPYRFVMSNSSASKS--- 132
+ D + RE++ +E+ E + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 133 -KVIADSERIIARMLAVLMRRRMQHAGMDTGGVEPW-----AYLIVGGVQLATHSW 182
V+ ++R + + + ++H A ++ G + +W
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENW 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3972cPERTACTIN320.003 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.4 bits (73), Expect = 0.003
Identities = 21/57 (36%), Positives = 24/57 (42%), Gaps = 2/57 (3%)

Query: 41 PGQAPGQVPPPPPAPAPPPPGQTERFGAPRPQMQQPGYPPPPTPPAGPTERLATAPN 97
P P P P P P PP P Q + P Q+ P P PPAG L+ A N
Sbjct: 569 PAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG--RELSAAAN 623



Score = 30.8 bits (69), Expect = 0.008
Identities = 24/64 (37%), Positives = 27/64 (42%), Gaps = 4/64 (6%)

Query: 35 RINTGGPGQAPGQVPPPPPAPAPPPPGQTERFGAPRPQMQQPGYPPPPTPPAGPTERLAT 94
R+ G GQ PPAP P P + P PQ QP PP P P P +R
Sbjct: 551 RLAANGNGQWSLVGAKAPPAPKPAPQPGPQ----PGPQPPQPPQPPQPPQPPQPPQRQPE 606

Query: 95 APNP 98
AP P
Sbjct: 607 APAP 610


58MAP4015MAP4056cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP40150123.098276hypothetical protein
MAP4016c0133.684430hypothetical protein
MAP4017c-1133.453771hypothetical protein
MAP4018c1183.602199hypothetical protein
MAP40192193.360136hypothetical protein
MAP40203192.788030glutamate-1-semialdehyde aminotransferase
MAP40215182.667928hypothetical protein
MAP40224172.643170hypothetical protein
MAP40233172.571932CcsA
MAP40243152.732095hypothetical protein
MAP40252152.506044CcsB
MAP40262163.403731hypothetical protein
MAP40273163.362826hypothetical protein
MAP4028c2153.8802493-oxoacyl-ACP synthase III
MAP4029c3134.4470701,4-dihydroxy-2-naphthoate
MAP40302114.4576595'-methylthioadenosine phosphorylase
MAP40313135.457828hypothetical protein
MAP40322145.113879hypothetical protein
MAP4033c1145.447689hypothetical protein
MAP40342125.139053hypothetical protein
MAP40351145.057246hypothetical protein
MAP40361123.524504hypothetical protein
MAP4037c2102.646238hypothetical protein
MAP4038c1101.969206O-succinylbenzoic acid--CoA ligase
MAP4039c1150.266249hypothetical protein
MAP4040c1150.501110hypothetical protein
MAP4041c1130.821893hypothetical protein
MAP4042c2141.161202hypothetical protein
MAP4043c1120.506984short chain dehydrogenase
MAP4044c1120.289829naphthoate synthase
MAP4045c2111.391729hypothetical protein
MAP4046c1121.106923hypothetical protein
MAP4047-1111.957928hypothetical protein
MAP4048c-1111.836549acyl-CoA synthetase
MAP4049-1102.652386hypothetical protein
MAP40500103.278763O-succinylbenzoate synthase
MAP40510102.562753BpoC_2
MAP40521113.4463682-succinyl-5-enolpyruvyl-6-hydroxy-3-
MAP40530132.419564hypothetical protein
MAP40541142.338732hypothetical protein
MAP40552152.019725ubiquinone/menaquinone biosynthesis
MAP4056c3141.501118hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4020FLGMRINGFLIF300.016 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 30.3 bits (68), Expect = 0.016
Identities = 17/98 (17%), Positives = 32/98 (32%), Gaps = 6/98 (6%)

Query: 287 AAFGGRAEVMERLAPLGPVYQAGTLSGNPVAMAAGLATLRHADAAAYAALDANADRL-AR 345
A+ + + +E L L + + A+A +A + A Y L +N
Sbjct: 4 ASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGG 63

Query: 346 LLSDALTNAGVPHQIPRAGNMLSVFFTDTPVTDFASAR 383
+ LT +P++ + V P R
Sbjct: 64 AIVAQLTQMNIPYRFANGSGAIEV-----PADKVHELR 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4026IGASERPTASE300.032 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.032
Identities = 18/74 (24%), Positives = 27/74 (36%), Gaps = 4/74 (5%)

Query: 16 TTQIPPAAAAGDEKKDAAEPQPEPGGDAPTKAFAGFRTERRVPAPEREPAPPTAPRPGGM 75
T+Q+ P + + AEP E PT +++ A +PA T
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPAREN---DPTVNIKEPQSQTNTTADTEQPAKET-SSNVEQ 1181

Query: 76 PPWDSTPVTGIPRV 89
P +ST V V
Sbjct: 1182 PVTESTTVNTGNSV 1195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4031NUCEPIMERASE916e-25 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 91.4 bits (227), Expect = 6e-25
Identities = 38/132 (28%), Positives = 55/132 (41%), Gaps = 12/132 (9%)

Query: 1 MRVLVTGAAGFIGSRVAAALRAAGHDVVAVDALLAAAHGPN---------PLPPNGCHRV 51
M+ LVTGAAGFIG V+ L AGH VV +D L + + P H++
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDN-LNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 52 DVRDADALAPLLA--GVDVVCHQAAMVGAGVDAADAPAYGGHNDLATTVLLAQMFAAGVR 109
D+ D + + L A + V + + AY N +L ++
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 110 RLVLASSMVVYG 121
L+ ASS VYG
Sbjct: 120 HLLYASSSSVYG 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4032NUCEPIMERASE814e-21 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 81.0 bits (200), Expect = 4e-21
Identities = 38/157 (24%), Positives = 58/157 (36%), Gaps = 23/157 (14%)

Query: 1 MVALRYHNVYGPGMPRDTPYSGVAAIFRSALEKGEPPRVFEDGGQMRDFVHVDDVAAANL 60
LR+ VYGP D F A+ +G+ V+ G RDF ++DD+A A +
Sbjct: 173 ATGLRFFTVYGPWGRPDMALF----KFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228

Query: 61 AAL------------------ACRDGFTAVNVCSGQPISILQVATALCDARGGAVAPVVT 102
A + N+ + P+ ++ AL DA G A
Sbjct: 229 RLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE-AKKNM 287

Query: 103 GQYRSGDVRHIVADPSRAARLLGFRAAVQPGDGLREF 139
+ GDV AD ++GF DG++ F
Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4034PF05616350.001 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 34.7 bits (79), Expect = 0.001
Identities = 34/122 (27%), Positives = 41/122 (33%), Gaps = 28/122 (22%)

Query: 358 QNGTPGKFMPILPSQQQAPLPPPPADAPNAGFQGGVVQAPSNAPAPQ--PAVPAPANPAP 415
+NG P + + Q P G +AP+ P P+ PA NPAP
Sbjct: 284 RNGNPVQVVATFGRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAP 343

Query: 416 EAPSNPGVVPNPNPGWAPNPNPLVPVPIPIPVPIPGWNPPYNP-----PYTPPYNPTTPY 470
NPG PNP P P NP NP P T P +P P
Sbjct: 344 N--ENPGTRPNPEPD-------------------PDLNPDANPDTDGQPGTRPDSPAVPD 382

Query: 471 TP 472
P
Sbjct: 383 RP 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4043cDHBDHDRGNASE758e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 74.7 bits (183), Expect = 8e-18
Identities = 49/178 (27%), Positives = 83/178 (46%), Gaps = 3/178 (1%)

Query: 38 LAGKRVLLTGASSGIGEAAAEQFAREGARVVVVARRKDLLDALAERITRAGGEAIAMPCD 97
+ GK +TGA+ GIGEA A A +GA + V + L+ + + A A P D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 98 ISDLDAADALVADVQQRLGGVDILINNAGRSIRRPLAESLERWHDVERTMVLNYYAPLRL 157
+ D A D + A +++ +G +DIL+N AG + RP + E T +N
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAG--VLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 158 IRGIAPGMIERGDGHIINVSTWGVLSEASPLFAVYNASKAALSTVSRVVETEWGDKGV 215
R ++ M++R G I+ V + + + A Y +SKAA ++ + E + +
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSM-AAYASSKAAAVMFTKCLGLELAEYNI 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4047HTHTETR611e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.8 bits (147), Expect = 1e-13
Identities = 32/169 (18%), Positives = 64/169 (37%), Gaps = 16/169 (9%)

Query: 33 GRSDARRNRQRLLEAATAAFTAHG-ASVSLESIARDAGVGIGTLYRHFPNREALVEAVYR 91
+ +A+ RQ +L+ A F+ G +S SL IA+ AGV G +Y HF ++ L ++
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 92 AELAEVAAAAAQLLQRHP--PKTALRRWMDRYANFVATKRG-------------MAESLQ 136
+ + + + P P + LR + T+ +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 137 AIFESGALVPSQTRDSIVGAVETLLRAGADDASLRADVQADDVVSSLIG 185
+ ++ + ++ D I ++ + A A L A + + G
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4049UREASE340.002 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 33.9 bits (78), Expect = 0.002
Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 527 DDVIGSLEVGKYADLVVLSADPRAVPPERI 556
IGSLEVGK ADLV+ + V P+ +
Sbjct: 420 SHEIGSLEVGKRADLVLWNPAFFGVKPDMV 449


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4055BCTERIALGSPD280.025 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 28.3 bits (63), Expect = 0.025
Identities = 26/110 (23%), Positives = 39/110 (35%), Gaps = 10/110 (9%)

Query: 80 DFSVGMLAAGAARRVPKVAGDATRLPFADDVFDAVTISFGLRNVVDTQAALREMARVTRP 139
F+V + G + V +P A D + R V T A R++A + R
Sbjct: 89 GFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQ 148

Query: 140 ------GGRLVVCEFSTPSNALFATVYKEYLMRALPRVARAVSSNPDAYV 183
G +V E PSN L T ++ L + V + D V
Sbjct: 149 LNDNAGVGSVVHYE---PSNVLLMTGRAAV-IKRLLTIVERVDNAGDRSV 194


59MAP4256MAP4269cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP42562142.470847hypothetical protein
MAP42573131.265221GadB
MAP42583132.844341alanine racemase
MAP42591122.330593hypothetical protein
MAP42600142.208138hypothetical protein
MAP4261-110-0.401566hypothetical protein
MAP4262115-1.818112hypothetical protein
MAP4263217-2.267236DNA-binding/iron metalloprotein/AP endonuclease
MAP4264223-4.821040molecular chaperone GroES
MAP4265325-5.245233molecular chaperone GroEL
MAP4266131-6.699967hypothetical protein
MAP4267123-5.021186hypothetical protein
MAP4268c118-4.317540hypothetical protein
MAP4269c314-1.939509hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4258ALARACEMASE388e-136 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 388 bits (998), Expect = e-136
Identities = 110/373 (29%), Positives = 173/373 (46%), Gaps = 28/373 (7%)

Query: 17 AEALVDLGAIEHNVRLLCEQARGAQVMAVVKADGYGHGAVQTARAALAAGAAELGVATVD 76
+A +DL A++ N+ ++ + A A+V +VVKA+ YGHG + A A + ++
Sbjct: 5 IQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDG--FALLNLE 62

Query: 77 EALALRAAGISAPVL---AWLHPPGIDFRPALLAGVQIGLSSQRQLDELLTAVRDTGRTA 133
EA+ LR G P+L + H ++ + + S QL L A
Sbjct: 63 EAITLRERGWKGPILMLEGFFHAQDLEIYDQ--HRLTTCVHSNWQLKALQNA--RLKAPL 118

Query: 134 TVTVKVDTRLNRNGVPPAQYPSMLTALRRAVAEEAIVPRGLMSHMVYADQPANPVNDVQA 193
+ +KV++ +NR G P + +LT ++ A + LMSH A+ P
Sbjct: 119 DIYLKVNSGMNRLGFQPDR---VLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISG--AM 173

Query: 194 QRFTDMLAQAREQGVRFEVAHLSNSSATMSRPDLAFDMVRPGIAVYGLSPVPELGDM--- 250
R + +G+ LSNS+AT+ P+ FD VRPGI +YG SP + D+
Sbjct: 174 ARI-----EQAAEGLECRR-SLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANT 227

Query: 251 GLVPAMTVKCTVALVKSIRAGESVSYGHTWTAQRDTNLALLPVGYADGIFRSLGGRLQVS 310
GL P MT+ + V++++AGE V YG +TA+ + + ++ GYADG R V
Sbjct: 228 GLRPVMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVL 287

Query: 311 INGRRRPGVGRICMDQFVVDLGPGRPDVAEGDEAILFGPGSNGEPTAQDWADLLGTIHYE 370
++G R VG + MD VDL P P G L+G E D A GT+ YE
Sbjct: 288 VDGVRTMTVGTVSMDMLAVDLTPC-PQAGIGTPVELWGK----EIKIDDVAAAAGTVGYE 342

Query: 371 VVTSPRGRITRTY 383
++ + R+
Sbjct: 343 LMCALALRVPVVT 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4262SACTRNSFRASE482e-09 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 47.6 bits (113), Expect = 2e-09
Identities = 18/92 (19%), Positives = 31/92 (33%), Gaps = 6/92 (6%)

Query: 53 GARVGDTLVGYAGISRLGRVPPYEYEIHTIGVDPAYQGRGIGRLMLDRLLEFA---DGGV 109
+ + +G I I I V Y+ +G+G +L + +E+A
Sbjct: 69 LYYLENNCIGRIKIRSNWNGYAL---IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCG 125

Query: 110 VYLEVRTDNEPAIGLYRSVGFEQIGLRRRYYR 141
+ LE + N A Y F + Y
Sbjct: 126 LMLETQDINISACHFYAKHHFIIGAVDTMLYS 157


60MAP4322cMAP4329cY        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MAP4322c6220.164282hypothetical protein
MAP4323c626-0.525639hypothetical protein
MAP4324c831-0.741899hypothetical protein
MAP4325c529-1.846516hypothetical protein
MAP4326c231-2.558819hypothetical protein
MAP4327c022-2.893166hypothetical protein
MAP4328c-123-4.025623hypothetical protein
MAP4329c017-3.046712hypothetical protein
61MAP0017cMAP0022cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0017c18-0.294261hypothetical protein
MAP0018c310-0.587704hypothetical protein
MAP0019c310-0.667297PbpA
MAP0020c313-1.219596RodA
MAP0021c214-0.112817Ppp
MAP0022c120-1.722703hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0017cTETREPRESSOR270.026 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 27.2 bits (60), Expect = 0.026
Identities = 12/67 (17%), Positives = 21/67 (31%), Gaps = 3/67 (4%)

Query: 67 GHRPPPARRTFSSGQRALLWAAGVLGALAIIIAVLI---VINSRADQQQQPPTVTDTGTP 123
G RP + Q + G + + + + +QQ+ +TD
Sbjct: 102 GTRPDEKQYDTVETQLRFMTENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALTDRPAA 161

Query: 124 PASAPPP 130
P PP
Sbjct: 162 PDENLPP 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0018cYERSSTKINASE346e-04 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 33.9 bits (77), Expect = 6e-04
Identities = 37/144 (25%), Positives = 61/144 (42%), Gaps = 19/144 (13%)

Query: 70 HPGIAAVHDYGESQLDGEGRTAYLVMELVNGEPLNSVLKRTGRLSLRHALDMLEQTGRAL 129
HP +A VH G + L+M+ V+G + L+ + ++ G
Sbjct: 190 HPNLANVHGMAVVPY-GNRKEEALLMDEVDGWRCSDTLRTLADSWKQGKINSEAYWGTIK 248

Query: 130 QVAH----------AAGLVHRDVKPGNILIT-PTGQVKITDFGIAKAVDAAPVTQTGMVM 178
+AH AG+VH D+KPGN++ +G+ + D G+ P G
Sbjct: 249 FIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQP---KGF-- 303

Query: 179 GTAQYIAPEQALGH-DATPASDVY 201
T + APE +G+ A+ SDV+
Sbjct: 304 -TESFKAPELGVGNLGASEKSDVF 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0021cPF05616310.014 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 30.9 bits (69), Expect = 0.014
Identities = 22/71 (30%), Positives = 28/71 (39%), Gaps = 6/71 (8%)

Query: 423 PRATSPPGAQATRSPVPETGGPASPAP-PTTSASPTPSTNATPGPASSSPAGPTTTSQTL 481
P + P AQ P+PE +PA P + +P N P P + A P T Q
Sbjct: 317 PGSAEAPNAQ----PLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPG 372

Query: 482 TALPGPPLQPG 492
T P P P
Sbjct: 373 TR-PDSPAVPD 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0022cIGASERPTASE280.013 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.5 bits (63), Expect = 0.013
Identities = 14/55 (25%), Positives = 21/55 (38%)

Query: 88 ADDSTLVLTDDYASARHARLTQRGSEWYVEDLGSTNGTYLDRAKVTTAVRVPIGT 142
+D T +DY R + + S GTY D+ K VR+ G+
Sbjct: 154 TEDQTQKRREDYYMPRLDKFVTEVAPIEASTASSDAGTYNDQNKYPAFVRLGSGS 208


62MAP0151cMAP0155N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0151c-211-0.861546hypothetical protein
MAP0152c-111-0.144217hypothetical protein
MAP01532100.758449hypothetical protein
MAP01542130.090543hypothetical protein
MAP0155314-0.414595hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0151cCHANNELTSX280.019 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 27.7 bits (61), Expect = 0.019
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 6/56 (10%)

Query: 88 AAVAIGSLSYAATLQALTGRLPGNTDEDRYFEAWVNQTVSVLAQYTNRRKPLNDND 143
AA A+ +LS A D+ +Y W +Q+V+V+ Y R P ND
Sbjct: 7 AAGAVVALSTTFAAGA------AENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRND 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0152cHTHTETR532e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.1 bits (127), Expect = 2e-12
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 14 RDRLLSAALRLFAAKGYAATSVADIQRESGLAPGSGALYKHFGSKRELLEA 64
R +L ALRLF+ +G ++TS+ +I + +G+ GA+Y HF K +L
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTR--GAIYWHFKDKSDLFSE 61


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0153PF06580379e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 9e-05
Identities = 24/136 (17%), Positives = 54/136 (39%), Gaps = 31/136 (22%)

Query: 308 DALRVYPDVEVSLVPSPTVLMIGLPTGLRLVIDNAIANAVKHG-----NAGKIQLTVSSS 362
D L+ + P ++ + +P +++ + N +KHG GKI L +
Sbjct: 238 DRLQFENQIN------PAIMDVQVP---PMLVQTLVENGIKHGIAQLPQGGKILLKGTKD 288

Query: 363 GEGVEIAIDDDGSGIPESERATVFERFARGSTAARSGSGLGLALVAQQ-AELHGGTAELQ 421
V + +++ GS ++ + +G GL V ++ L+G A+++
Sbjct: 289 NGTVTLEVENTGSLALKNT---------------KESTGTGLQNVRERLQMLYGTEAQIK 333

Query: 422 -NSPLGGTRLLLRLAG 436
+ G ++ + G
Sbjct: 334 LSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0155HTHTETR453e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.0 bits (106), Expect = 3e-08
Identities = 22/73 (30%), Positives = 31/73 (42%), Gaps = 1/73 (1%)

Query: 2 ATATRERFLTAATGLFRRQGYSGTGLKQIVAESRAPLGSLYHFFPGGKQDLAVQAIAHTA 61
A TR+ L A LF +QG S T L +I + G++Y F K DL + +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHF-KDKSDLFSEIWELSE 67

Query: 62 ERYRELLDRVFAR 74
EL A+
Sbjct: 68 SNIGELELEYQAK 80


63MAP0165MAP0172N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0165-1131.979915hypothetical protein
MAP0166-2122.219180hypothetical protein
MAP0167-2100.524553hypothetical protein
MAP0168c-1100.608286hypothetical protein
MAP0169c-1100.305121hypothetical protein
MAP0170-2110.121058RNA polymerase sigma factor SigI
MAP0171c-211-0.341428hypothetical protein
MAP0172-111-1.105797hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0165SUBTILISIN1233e-33 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 123 bits (309), Expect = 3e-33
Identities = 56/259 (21%), Positives = 99/259 (38%), Gaps = 63/259 (24%)

Query: 239 GVVGVAPHATIISIRQSSRAFEPVNPPPGDPNSDEKVKAGTLNSVARAVVHAANMGAKVI 298
GVVGVAP A ++ I+ + K +G + + + + +A +I
Sbjct: 102 GVVGVAPEADLLIIKVLN-----------------KQGSGQYDWIIQGIYYAIEQKVDII 144

Query: 299 NISVTACLPAAAPADQRALGAALWYAATVKDAVVVAAAGNDGEAGCNNNPMFDLLDPSDP 358
++S+ P D L A+ A +V+ AAGN+G+ +
Sbjct: 145 SMSL------GGPEDVPELHEAVKKA-VASQILVMCAAGNEGDGDDRTDE---------- 187

Query: 359 RDWHQVKVVSAPSWFSDYVLSVGAVDASGAALDKSMSGPWVGVAAPGTHIMGLSPQGGGP 418
+ P + + V+SVGA++ A + S S V + APG I+ P G
Sbjct: 188 --------LGYPGCY-NEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGG--- 235

Query: 419 VNAYPPSRPGEKNMPFWGTSFSAAYVSGVAALVRAKFP-----ELSAHQVINRIVQSAHN 473
K F GTS + +V+G AL++ +L+ ++ ++++
Sbjct: 236 -----------KYATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRT-I 283

Query: 474 PPAGVDNKVGYGLVDPVAA 492
P G GL+ A
Sbjct: 284 PLGNSPKMEGNGLLYLTAV 302



Score = 64.1 bits (156), Expect = 3e-13
Identities = 25/101 (24%), Positives = 42/101 (41%), Gaps = 8/101 (7%)

Query: 52 MRRANSCSAPITVRN-PDVAQLAPGFNLLNIAKAWQYSTGNGVPVAVIDTGVSPN-PRLP 109
M R ++ V ++ G ++ W + G GV VAV+DTG + P L
Sbjct: 1 MERKVHIIPYQVIKQEQQVNEIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLK 60

Query: 110 --VVPGGDYIMGEDG----LSDCDAHGTIVSSIIGAAPQGI 144
++ G ++ ++G D + HGT V+ I A
Sbjct: 61 ARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENEN 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0169cNUCEPIMERASE300.008 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 30.1 bits (68), Expect = 0.008
Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 1 MTIVVTGATGNVGRPLVTALLAAGAPVRAV 30
M +VTGA G +G + LL AG V +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGI 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0171cPF03544443e-07 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 44.2 bits (104), Expect = 3e-07
Identities = 28/139 (20%), Positives = 39/139 (28%), Gaps = 6/139 (4%)

Query: 13 QPERPSARPAPEAPPPPVAPRPAPPPRPPTPAPTPPTRPPAPPTFRPHPPAAAPAPAAPP 72
P P P PE P P P+ AP P P +P
Sbjct: 68 PPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPAS 127

Query: 73 TPRPQPSATPQLPDRGWRRALRLATFGLIDLGPSPAQRREAQF-EQAIQARLYGNYKVG- 130
A P A + GP R + Q+ +A R+ G KV
Sbjct: 128 PFENTAPARPTSSTAT---AATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 184

Query: 131 -VLGKGGVGKTSVAASVGS 148
V G V + ++ +
Sbjct: 185 DVTPDGRVDNVQILSAKPA 203



Score = 36.1 bits (83), Expect = 2e-04
Identities = 22/72 (30%), Positives = 27/72 (37%), Gaps = 4/72 (5%)

Query: 18 SARPAPEAPPPPVAPRPA--PPPRPPTPAPTPPTRPPAPPTFRPHPPAAAP--APAAPPT 73
PAP P PA PP+ P P P P P P PP AP P
Sbjct: 41 IELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPK 100

Query: 74 PRPQPSATPQLP 85
P+P+P ++
Sbjct: 101 PKPKPKPVKKVE 112



Score = 35.3 bits (81), Expect = 2e-04
Identities = 15/86 (17%), Positives = 21/86 (24%)

Query: 6 EFLRDRLQPERPSARPAPEAPPPPVAPRPAPPPRPPTPAPTPPTRPPAPPTFRPHPPAAA 65
E +P+ R P +P T P P + +
Sbjct: 89 EAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSK 148

Query: 66 PAPAAPPTPRPQPSATPQLPDRGWRR 91
P + PR PQ P R
Sbjct: 149 PVTSVASGPRALSRNQPQYPARAQAL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0172ARGREPRESSOR300.048 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 29.8 bits (67), Expect = 0.048
Identities = 17/81 (20%), Positives = 30/81 (37%), Gaps = 4/81 (4%)

Query: 228 RFSTNTFPSWPLAHPFRRIAHNGE---INTVTGNENWM-RAREALIKTDVFGTEADVEKL 283
RF+ + L F +I + T+ GN + + L ++ GT + +
Sbjct: 69 RFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIMGTICGDDTI 128

Query: 284 FPICTPGASDTARFDEVLELL 304
IC ++LELL
Sbjct: 129 LIICRTHDDTKVVQKKILELL 149


64MAP0191cMAP0202N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0191c2134.921640hypothetical protein
MAP0192c0144.023396hypothetical protein
MAP01931163.877459prephenate dehydratase
MAP01941153.152392hypothetical protein
MAP0195c1152.231737hypothetical protein
MAP0196c2142.428071hypothetical protein
MAP0197-1121.314695seryl-tRNA synthetase
MAP0198c-1100.775950hypothetical protein
MAP0199-110-0.927328hypothetical protein
MAP0200c-110-1.017876hypothetical protein
MAP0201-111-0.303367hypothetical protein
MAP0202011-0.510190hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0191cPF05616387e-05 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 37.8 bits (87), Expect = 7e-05
Identities = 29/76 (38%), Positives = 33/76 (43%), Gaps = 7/76 (9%)

Query: 12 PGR-QGPPRQPGPEPSPVIPRPGGPAPSPHAPTQPLHRPPP--APPARP----APPARPA 64
PG + P QP PE SP PAP+ + T+P P P P A P P RP
Sbjct: 317 PGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPD 376

Query: 65 PPASAIRPARPRRKRR 80
PA RP RK R
Sbjct: 377 SPAVPDRPNGRHRKER 392


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0196cPERTACTIN387e-05 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 38.2 bits (88), Expect = 7e-05
Identities = 19/48 (39%), Positives = 20/48 (41%)

Query: 400 QAPPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPPPPPAPEAPPGGPP 447
+APP P P P P P P PP PP PEAP PP
Sbjct: 566 KAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613



Score = 35.5 bits (81), Expect = 5e-04
Identities = 21/51 (41%), Positives = 22/51 (43%), Gaps = 1/51 (1%)

Query: 401 APPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPPPPPAPEA-PPGGPPPAG 450
A PP AP P PP P PP P PP PP + P PPAG
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615



Score = 33.5 bits (76), Expect = 0.002
Identities = 18/40 (45%), Positives = 18/40 (45%)

Query: 409 GAPPPANPAPEAPPAPVPPPAAPPPPPPPAPEAPPGGPPP 448
GA P P P P P P P P PP PP P PP P
Sbjct: 564 GAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQR 603



Score = 33.2 bits (75), Expect = 0.003
Identities = 18/48 (37%), Positives = 18/48 (37%)

Query: 398 PPQAPPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPPPPPAPEAPPGG 445
PP P P P P P P PP P PP P P PP G
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615



Score = 31.6 bits (71), Expect = 0.007
Identities = 16/46 (34%), Positives = 16/46 (34%)

Query: 389 PGQQPVVTQPPQAPPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPP 434
P QP P P PP P P PP P AP PP
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613



Score = 31.2 bits (70), Expect = 0.010
Identities = 17/49 (34%), Positives = 17/49 (34%)

Query: 393 PVVTQPPQAPPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPPPPPAPEA 441
P PQ P P PP P P PP P P P PPA
Sbjct: 569 PAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRE 617



Score = 28.9 bits (64), Expect = 0.047
Identities = 16/52 (30%), Positives = 18/52 (34%)

Query: 388 LPGQQPVVTQPPQAPPPPVDNGAPPPANPAPEAPPAPVPPPAAPPPPPPPAP 439
L G + P P P PP P+ P P PP P P P P
Sbjct: 562 LVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613



Score = 28.9 bits (64), Expect = 0.047
Identities = 21/62 (33%), Positives = 23/62 (37%), Gaps = 1/62 (1%)

Query: 366 AQSPSTQSGSAAAPEMPRNNQHLPGQQPVVTQPPQAPPPPVDNGAPPPANPAPEAPPAPV 425
A + + Q A P QPPQ P PP P P PEA PAP
Sbjct: 553 AANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEA-PAPQ 611

Query: 426 PP 427
PP
Sbjct: 612 PP 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0197CHANLCOLICIN300.026 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 29.7 bits (66), Expect = 0.026
Identities = 29/103 (28%), Positives = 40/103 (38%), Gaps = 13/103 (12%)

Query: 25 PSLVDALLAADAARRAAISTADTLRAEQKSASKSVGAAS-----PEQRPALLARAK---- 75
PS + A +AA +A +AE+K A K AA EQR + R K
Sbjct: 110 PSATELAHANNAAMQAEDERLRLAKAEEK-ARKEAEAAEKAFQEAEQRRKEIEREKAETE 168

Query: 76 ---ELAEQVKAAETAQAEAEAAFTAAHLAISNVVIDGVPAGGE 115
+LAE + A +E A A +S + V GE
Sbjct: 169 RQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGE 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0198cHTHTETR300.005 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 30.4 bits (68), Expect = 0.005
Identities = 11/67 (16%), Positives = 22/67 (32%), Gaps = 2/67 (2%)

Query: 149 LRAACALLADNLRKPLTLRQIGERIGVGQRTLSRLF--RDELAMTFPQWRTQVRLQHALV 206
L A L + +L +I + GV + + F + +L + + L
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 207 LLAERRD 213
A+
Sbjct: 77 YQAKFPG 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0201HTHTETR581e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.7 bits (139), Expect = 1e-12
Identities = 40/188 (21%), Positives = 68/188 (36%), Gaps = 14/188 (7%)

Query: 1 MVRPAQTARSERTREALRQAALVRFLAQGVEDTSAEQIAADAGVSLRTFYRHFRSKHDLL 60
M R + ++ TR+ + AL F QGV TS +IA AGV+ Y HF+ K DL
Sbjct: 1 MARKTK-QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 61 FADYTGLH-----WFRAALDARPAD-EPIIDSVQAAIFSFPYDVDAVTKIAA-LRHEELD 113
+ P D ++ + + + + + H+
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 114 PG---RIVRHIRDVQADFAEAIAAQLQRRGRGAGAPTADQRVRAAVTARCIAAAVFGAME 170
G + + R++ + + I L + A AD R A A + + G ME
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTL-KHCIEAKMLPADLMTRRA--AIIMRGYISGLME 176

Query: 171 VWMVGDER 178
W+ +
Sbjct: 177 NWLFAPQS 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0202PF07212300.019 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 30.4 bits (68), Expect = 0.019
Identities = 15/42 (35%), Positives = 22/42 (52%)

Query: 196 AVNTLYRGPATPGSAAALAFGLGVPDGDTMQMKKLRGGIGVL 237
AVN R P TP ++AL G +G MQ++ + +G L
Sbjct: 180 AVNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTL 221


65MAP0259MAP0266cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0259-190.085071hypothetical protein
MAP0260-280.000691hypothetical protein
MAP0261c-29-0.837455hypothetical protein
MAP0262-29-0.431636hypothetical protein
MAP0263c0100.263388hypothetical protein
MAP0264c011-0.080908hypothetical protein
MAP0265c09-0.236105FadE1_1
MAP0266c010-0.364441hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0259HTHFIS1022e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 102 bits (257), Expect = 2e-27
Identities = 34/117 (29%), Positives = 64/117 (54%)

Query: 23 VLVVDDEAVLAEMVSMALRYEGWNIATASDGASAIAAARNQRPDVVVLDVMLPDMSGLDV 82
+LV DD+A + +++ AL G+++ S+ A+ D+VV DV++PD + D+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 83 LHKLREENPQLPVLLLTAKDAVEDRIAGLTAGGDDYVTKPFSIEEVVLRLRALLRRT 139
L ++++ P LPVL+++A++ I G DY+ KPF + E++ + L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0261cFIMBRILLIN270.026 Porphyromonas gingivalis: fimbrillin protein signature.
		>FIMBRILLIN#Porphyromonas gingivalis: fimbrillin protein signature.

Length = 348

Score = 27.3 bits (60), Expect = 0.026
Identities = 24/78 (30%), Positives = 34/78 (43%), Gaps = 3/78 (3%)

Query: 82 AATGIAAVLSDGNPPQVKSVGLGNVNGVTLGYTSGTGQGNASAEKNGNSYKITGTATGVD 141
AA +++D NP V L N N Y +G N E+N + Y I T TG
Sbjct: 260 AAFNAGWIVADNNPTTYYPV-LVNFNSNNYTYDNGYTPKN-KIERN-HKYDIKLTITGPG 316

Query: 142 MANPLQPVNKPFEIDVTC 159
NP P+ + ++V C
Sbjct: 317 TNNPENPITESAHLNVQC 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0264cHTHTETR521e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.3 bits (125), Expect = 1e-10
Identities = 25/161 (15%), Positives = 48/161 (29%), Gaps = 12/161 (7%)

Query: 20 RQREATEEVERILAAAVRVMERVAPEPPRVSDIVAEAGSSNKAFYRYFAGKDELILAVME 79
++EA E + IL A+R+ + + +I AG + A Y +F K +L + E
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 80 RGVAIVVSYLEHQMAKEATPRGKIARWIEGTLAQVADPHLISMTRA----------AAGQ 129
+ + AK P ++ E + + R G+
Sbjct: 65 LSESNIGELELEYQAK--FPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 130 MSAGTSWRAADQEMMRPLRELLVEPVAALGSSDVDRDVEAV 170
M+ + E ++ D
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRA 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0266cDHBDHDRGNASE754e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 75.1 bits (184), Expect = 4e-18
Identities = 65/251 (25%), Positives = 104/251 (41%), Gaps = 14/251 (5%)

Query: 13 RVVLITGGNRGLGREMAFGAARCGADVVIASRNLDNCVATAQQVEHETGRRAMAYQVHVG 72
++ ITG +G+G +A A GA + N + ++ E R A A+ V
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADVR 67

Query: 73 RWDQLDGLVEASYDRFGKIDTLINNAGMSPLYDKLTD-VTEKLFDAVVNLNLKGPFRLSA 131
+D + G ID L+N AG+ L L ++++ ++A ++N G F S
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGV--LRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 132 LVGERMVAAGRGSIINVSTAGSLRPTPDIVPYAASKAGLNAMTEALAKAFGP-AVRVNTL 190
V + M+ GSI+ V + + P + YA+SKA T+ L +R N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 191 MAGPFLTDVSRA-WNLEAVQENPFRHLA--------LQRAGDPREIVGAALFLASDASSF 241
G TD+ + W E E + L++ P +I A LFL S +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 242 TTGSILRADGG 252
T L DGG
Sbjct: 246 ITMHNLCVDGG 256


66MAP0348cMAP0356cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0348c07-0.818359hypothetical protein
MAP034907-0.332286hypothetical protein
MAP0350080.265166hypothetical protein
MAP0351080.615107hypothetical protein
MAP0352c-190.555854hypothetical protein
MAP03530101.858680glycerol kinase
MAP0354c-1102.793419hypothetical protein
MAP0355-1112.984817hypothetical protein
MAP0356c-1112.971995hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0348cDHBDHDRGNASE867e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 86.3 bits (213), Expect = 7e-22
Identities = 67/280 (23%), Positives = 111/280 (39%), Gaps = 35/280 (12%)

Query: 16 LEGKVAIVTGTSRGVGVGIAHELLRAGATVVG--CARSPLDTIPGIEPDWTERAFQRVCD 73
+EGK+A +TG ++G+G +A L GA + L+ + A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 74 QGDYRGIDAFVTDVVATHGRLDILVNNAGGTVPAPHAESIPELVQRIQGSPAADDDYART 133
D ID + G +DILVN AG P E +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT------------- 112

Query: 134 ALFHAFAVQMNLIGPLWFAIRTYRQMQTQDRMGCIINISSGAGHPAGSP--TLVSYGAAK 191
F+V N G + + M + R G I+ + S +PAG P ++ +Y ++K
Sbjct: 113 -----FSV--NSTGVFNASRSVSKYMMDR-RSGSIVTVGS---NPAGVPRTSMAAYASSK 161

Query: 192 SGLNHLTRSLAQEWGP-KVRVNCVALGPTMTENFRSFVLPKDDP------TGEKYFAAVP 244
+ T+ L E +R N V+ G T T+ S ++ + E + +P
Sbjct: 162 AAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIP 221

Query: 245 LRRAGEPAEVGRICVFLAGGQADFVNGTTIECDGGMLPGV 284
L++ +P+++ +FL GQA + + DGG GV
Sbjct: 222 LKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0350DHBDHDRGNASE923e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 91.7 bits (227), Expect = 3e-24
Identities = 71/254 (27%), Positives = 114/254 (44%), Gaps = 13/254 (5%)

Query: 9 MRGQVAIVTGAAQGVGKGIAAALLERGAAVLLVDIQQETLEATATELRALGRV-ERLVTD 67
+ G++A +TGAAQG+G+ +A L +GA + VD E LE + L+A R E D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 68 LRDPDSAPRIAAAAVDAFGSVHGLVNNAIATNEPKAFVDITTDDLALGYEVGPRATFLLM 127
+RD + I A G + LVN A P ++ ++ + V F
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVA-GVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 128 QAVHPLLVKEGGGAIVNLGSGTGTGGEPRWGGYAAAKEGIRGLSKVAALEWGRDNIRVNV 187
++V ++ G+IV +GS YA++K +K LE NIR N+
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 188 VCPFAESDGVKLWKQFAPNDYAKAVGR---------VPMKRIGDVRTDVGALVAFLLSTD 238
V P ++ W +A + A+ V + +P+K++ +D+ V FL+S
Sbjct: 185 VSP-GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAK-PSDIADAVLFLVSGQ 242

Query: 239 ATFITGQTIHVDGG 252
A IT + VDGG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0351HTHTETR581e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.7 bits (139), Expect = 1e-12
Identities = 21/80 (26%), Positives = 32/80 (40%)

Query: 9 RQRAQIRADIRRAAFRLFVERGYDAVTTEEIATAAGVSPRTFFRHVPAKEELLLAPVRYG 68
++ + R I A RLF ++G + + EIA AAGV+ + H K +L
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 69 GAAIVHLLEGRPAGESPDVA 88
+ I L A D
Sbjct: 67 ESNIGELELEYQAKFPGDPL 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0354ccloacin280.030 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 28.1 bits (62), Expect = 0.030
Identities = 17/51 (33%), Positives = 19/51 (37%), Gaps = 7/51 (13%)

Query: 56 GPLGFGPGFGPGFGPGF-------GPGFGFGPGGPRGAWRRGGPGRGKRGD 99
GP G G G G G G+ G G G G G+ G G G G
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGG 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0356cPERTACTIN310.006 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.8 bits (69), Expect = 0.006
Identities = 26/80 (32%), Positives = 31/80 (38%), Gaps = 10/80 (12%)

Query: 144 GTVVVNERGPRLGPPPAMPPSLAWWASSLQLSGLSSGQAEVARQFLSRAAQLDPGLRLQM 203
G V+ G R PPPA P S+ L +G R L R L L
Sbjct: 336 GNVIETGGGARRFPPPASPLSIT----------LQAGARAQGRALLYRVLPEPVKLTLAG 385

Query: 204 AYRIAGDVVARIAPPPPGAP 223
+ GD+VA PP PGA
Sbjct: 386 GAQGQGDIVATELPPIPGAS 405



Score = 28.5 bits (63), Expect = 0.039
Identities = 16/43 (37%), Positives = 17/43 (39%)

Query: 245 PPAPWPAPGYPPAWPGSGPAPQWPAPGPANPGPPEGFSAGFTP 287
PPAP PAP P P P P P P PP+ P
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAP 610


67MAP0489cMAP0494N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0489c072.893271hypothetical protein
MAP0490092.016869hypothetical protein
MAP0491c0101.489999hypothetical protein
MAP04920101.511463FadE34
MAP0493c-180.175524hypothetical protein
MAP0494-110-0.266752hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0489cADHESNFAMILY1069e-29 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 106 bits (265), Expect = 9e-29
Identities = 54/267 (20%), Positives = 102/267 (38%), Gaps = 24/267 (8%)

Query: 21 AVLAVTGWAALTGCTGPAHPHATA---VVASTDVWGSVARAVAGGHVAVASILSGSDQDP 77
VL ++ + +G + VVA+ + + + +AG + + SI+ QDP
Sbjct: 8 LVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV-PIGQDP 66

Query: 78 HSYEASPSDAAAIADAGLVVFNG----GGYDGWVDDVLAHHPGVARVDAYALL----PDD 129
H YE P D ++A L+ +NG G + W ++ + D +A+
Sbjct: 67 HEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIY 126

Query: 130 GRPRNE------HVFYHLGVAKAVAAAVADRLAAIDPGNAADYRRNAAAFGRDADAI-AG 182
+NE H + +L A +A +L+A DP N Y +N + D +
Sbjct: 127 LEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKE 186

Query: 183 IEHTIAAAHPGGSVVATEPVAF-YLLEASGLVNRTPPALEAAVENETDPAPADLARALDL 241
+ ++ T AF Y +A G+ P A + E + P + ++
Sbjct: 187 SKDKFNKIPAEKKLIVTSEGAFKYFSKAYGV----PSAYIWEINTEEEGTPEQIKTLVEK 242

Query: 242 LDRHQVSALVVNPQTSASAVNGLREAA 268
L + +V +L V + + +
Sbjct: 243 LRQTKVPSLFVESSVDDRPMKTVSQDT 269


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0491cHTHTETR672e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.6 bits (162), Expect = 2e-15
Identities = 32/198 (16%), Positives = 63/198 (31%), Gaps = 17/198 (8%)

Query: 61 GSEAQRERRKRILDATMAIASKGGYEAVQMRAVADRADVAVGTLYRYFPSKVHLLVSALG 120
+ +E R+ ILD + + S+ G + + +A A V G +Y +F K L
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 121 REFSRI-DAKTDRAAMTGGTPFQRLNFMVGKLNRAMQRNPLLTEAMTRAYVFADASAASE 179
S I + + + A G P L ++ + + + +F E
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEER--RRLLMEIIFHKCEFVGE 122

Query: 180 VDQVEKLIDS-----------MFARAMADGE--PTEDQYHIARVISDVWLSNLLAWLTRR 226
+ V++ + + A ++ + WL
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL-FA 181

Query: 227 ASATDVSKRLDLAVRLLL 244
+ D+ K V +LL
Sbjct: 182 PQSFDLKKEARDYVAILL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0492PF04183310.026 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 30.6 bits (69), Expect = 0.026
Identities = 21/79 (26%), Positives = 32/79 (40%), Gaps = 7/79 (8%)

Query: 158 WLVVDTASDGVHIEPLAATDFSRPLARVVLDSAPATVLAQTPERVEELAATVLAAEAAGI 217
WL +D + EP+ A L + VL + ATV E +++L AT+L
Sbjct: 57 WLWIDAQTLRCADEPVLAQTLLMQLKQ-VLSMSDATV----AEHMQDLYATLLGDLQLLK 111

Query: 218 TR--WSLQTAVDYAKVREQ 234
R S ++ R Q
Sbjct: 112 ARRGLSASDLINLNADRLQ 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0493cHTHTETR573e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.6 bits (136), Expect = 3e-12
Identities = 36/188 (19%), Positives = 68/188 (36%), Gaps = 12/188 (6%)

Query: 7 DARARLVAAALDLFNERGYDQTTVAEIAERAGLTKSTFFRHFPDKRDVLAA----GQDAI 62
+ R ++ AL LF+++G T++ EIA+ AG+T+ + HF DK D+ + + I
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 63 AALLREGIAAAPADATPLALVCSGLKSAAAAFTPFNKELAPRLRAAIAASAELQERNALK 122
L E A P D + + + L + E+ +
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 123 QIG-------LALAVSEALQARGVPEPA-AALAAELGALAVKTAYARWAEPDESGDLGDM 174
+ + + ++A+ +P AA + + W +S DL
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKE 190

Query: 175 ACQALREL 182
A + L
Sbjct: 191 ARDYVAIL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0494NUCEPIMERASE521e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 51.7 bits (124), Expect = 1e-09
Identities = 24/82 (29%), Positives = 39/82 (47%), Gaps = 11/82 (13%)

Query: 26 RVLVTGASGGIGSAVVKELLAAGHHVIGL---------ARSEASAATVSGLGAEPLRGDI 76
+ LVTGA+G IG V K LL AGH V+G+ + +A ++ G + + D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 77 ADLDVLQK--AAVDTDGVAYLA 96
AD + + A+ + V
Sbjct: 62 ADREGMTDLFASGHFERVFISP 83


68MAP0598cMAP0603N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0598c19-2.397115hypothetical protein
MAP0599c18-1.624781short chain dehydrogenase
MAP0600c19-1.952216hypothetical protein
MAP0601c06-0.292593hypothetical protein
MAP060215-0.252612AldA_1
MAP0603170.252486short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0598cTCRTETOQM300.022 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 30.2 bits (68), Expect = 0.022
Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 2/40 (5%)

Query: 218 PKDKADRDML-DVLVSIKDEDGKPRFSADEITG-MFISLM 255
P R+ML D L+ I D D R+ D T + +S +
Sbjct: 352 PSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFL 391


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0599cDHBDHDRGNASE1162e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 116 bits (291), Expect = 2e-33
Identities = 66/200 (33%), Positives = 105/200 (52%)

Query: 9 ERRPAIVAGASSGIGEATAIELAAHGFPVALGARRVEKLNDIVGKINADGGEAVGFHLDV 68
E + A + GA+ GIGEA A LA+ G +A EKL +V + A+ A F DV
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 69 TDPNSVKSFVAQAVDALGDIEVLVAGAGDTYFGKLAEIAGDEFESQLQIHLVGAFRLASA 128
D ++ A+ +G I++LV AG G + ++ +E+E+ ++ G F + +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 129 VLPGMLERQRGDLIFVGSDVALRQRPHMGAYGAAKAALVAMVNNFQMELEGTGVRASVVH 188
V M++R+ G ++ VGS+ A R M AY ++KAA V +EL +R ++V
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 189 PGPTKTAMGWSLPAEKIGPA 208
PG T+T M WSL A++ G
Sbjct: 187 PGSTETDMQWSLWADENGAE 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0601cHTHTETR552e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.6 bits (131), Expect = 2e-11
Identities = 26/198 (13%), Positives = 60/198 (30%), Gaps = 17/198 (8%)

Query: 21 PRNRRQEETFRKVLAAGIETLREKSYSDLTVRAVAARAKVAPATAYTYFSSKNHLIAEVY 80
+ +ET + +L + ++ S ++ +A A V Y +F K+ L +E++
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 81 LDLVRQV-PYFTDVNDPMPTRVEQVLR-------HLALVVADEPEVSAACTTALLSGGAD 132
+ + P VLR + + G
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 133 PAVRAARDRIGVEIHRRITSAMG---------PDADPTTVSALEMSFFGALVQAGSGEFS 183
V+ A+ + +E + RI + D + + + L++
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 184 YREIADRLAYVVRLILTG 201
++ V ++L
Sbjct: 184 SFDLKKEARDYVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0603DHBDHDRGNASE1262e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 126 bits (317), Expect = 2e-37
Identities = 78/253 (30%), Positives = 114/253 (45%), Gaps = 15/253 (5%)

Query: 4 ENKVGIVTGSGGGIGQAYAEALAREGAAVVVADINAEAAEAVAKQIVADGGTAISVAVDV 63
E K+ +TG+ GIG+A A LA +GA + D N E E V + A+ A + DV
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 64 SDPESAKAMADRTLAEFGGIDYLVNNAAIFGGMKLDFLLTIDPEYYKKFMSVNLDGALWC 123
D + + R E G ID LVN A + ++ + ++ E ++ SVN G
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGV---LRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 124 TRAVYKKMTKRGGGAIVNQSSTAA---WLYSNYYGLAKVGINGLTQQLSRELGGRNIRIN 180
+R+V K M R G+IV S A Y +K T+ L EL NIR N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 181 AIAPGPIDTEANRTTTPKEMVDDIV---------KGLPLSRMGTPDDLVGMCLFLLSDEA 231
++PG +T+ + E + V G+PL ++ P D+ LFL+S +A
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 232 SWITGQIFNVDGG 244
IT VDGG
Sbjct: 244 GHITMHNLCVDGG 256


69MAP0702MAP0712cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0702-110-1.400604hypothetical protein
MAP0703-19-0.906218hypothetical protein
MAP0704-19-0.765821hypothetical protein
MAP0705-110-0.269061hypothetical protein
MAP0706-1100.210533hypothetical protein
MAP0707-2100.060135hypothetical protein
MAP0708-29-0.696640hypothetical protein
MAP0709010-1.281957hypothetical protein
MAP0710c010-0.969536short chain dehydrogenase
MAP0711c010-1.007016hypothetical protein
MAP0712c011-0.524913hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0702DHBDHDRGNASE1132e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (284), Expect = 2e-32
Identities = 77/263 (29%), Positives = 127/263 (48%), Gaps = 16/263 (6%)

Query: 5 LAGKVAIVTGGASGIGRGIVERFVAEGARVVIADIETERGERLAAELGGEAVFR---RTD 61
+ GK+A +TG A GIG + ++GA + D E+ E++ + L EA D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 62 VSDIEQVGALVAAAVEKFGGLHVMVNNAGISSPLRRLLDDDLA--DFHRVMGVNVLGVMA 119
V D + + A + G + ++VN AG+ LR L L+ ++ VN GV
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGV---LRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 120 GTRDAARHMADNGGGTIINLTSIGGIQAGGGVMTYRASKAAVIQFTKAAAIELARYDIRV 179
+R +++M D G+I+ + S + Y +SKAA + FTK +ELA Y+IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 180 NAIAPGNIPTPILGKSAGDMDPEQRERFEARIR---EGMREDRPLKREGTPDDVAEAALY 236
N ++PG+ T + D + E I+ E + PLK+ P D+A+A L+
Sbjct: 183 NIVSPGSTETDMQWSLWADENGA-----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLF 237

Query: 237 FATDRSRYVTGTVLPVDGGTSAG 259
+ ++ ++T L VDGG + G
Sbjct: 238 LVSGQAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0703HTHTETR539e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.1 bits (127), Expect = 9e-11
Identities = 22/99 (22%), Positives = 44/99 (44%), Gaps = 1/99 (1%)

Query: 11 ERASSTQEAILVAAERLYAEHGMFAVSNRQVSEAAGQGNNAAVGYHFGTKADLVRAIEHK 70
+ A T++ IL A RL+++ G+ + S ++++AAG A+ +HF K+DL I
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGV-TRGAIYWHFKDKSDLFSEIWEL 65

Query: 71 HRGPVEQLREQMVAELLESGAGGGRDAELRSWVACSVRP 109
+ +L + A+ R+ + +
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEE 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0706CHANLCOLICIN290.035 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.9 bits (64), Expect = 0.035
Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 3/84 (3%)

Query: 86 VYLEFDEGVAAFARTVADLIPANAVIAVDECTGAMSRAAATLFPRGAPVDAAAIVGAAKA 145
+ ++ E + A+ +AD + V+E A + L + + D AI A +
Sbjct: 359 LTEKYGEKYSKMAQELADKSKGKKIGNVNEALAAFEKYKDVLNKKFSKADRDAIFNALAS 418

Query: 146 VKTPD---ELSCIRTAIRITDEAM 166
VK D L ++IT
Sbjct: 419 VKYDDWAKHLDQFAKYLKITGHVS 442


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0709DHBDHDRGNASE1098e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 109 bits (273), Expect = 8e-31
Identities = 77/252 (30%), Positives = 121/252 (48%), Gaps = 19/252 (7%)

Query: 13 RVAVVTGAGAGIGRGIAAGLAAFGARVAIWERDAQTCTRAAESIGGLG-----IVTDVRD 67
++A +TGA GIG +A LA+ GA +A + + + + S+ DVRD
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 68 SGQVDAALQRTITELGTPAILVNNAGGVFSSPLLETSENGWDALYRANLRHVLLCTQRIA 127
S +D R E+G ILVN AG + + S+ W+A + N V ++ ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 128 RQLVSVGAGGSIISLTSIEGVRAAPGYAAYAAAKAGVINYTKTAALELAPHGIRVNAIAP 187
+ ++ GSI+++ S AAYA++KA + +TK LELA + IR N ++P
Sbjct: 129 KYMMD-RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 188 DITLTE----------GLEQ-LGGEAATTAMGNIVPLGRPGHVDEIASAAVFLASDMSGY 236
T T+ G EQ + G T G +PL + +IA A +FL S +G+
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTG--IPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 237 LTGQTLHVDGGT 248
+T L VDGG
Sbjct: 246 ITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0710cDHBDHDRGNASE843e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 83.9 bits (207), Expect = 3e-21
Identities = 46/191 (24%), Positives = 86/191 (45%), Gaps = 1/191 (0%)

Query: 7 GFDGRAAVVTGGASGIGLATATEFARRGARLVLSDVDQPALEQAVNGLRGQGFDAHGVVC 66
G +G+ A +TG A GIG A A A +GA + D + LE+ V+ L+ + A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 67 DVRHLDEMVRLADEAFRLLGGVDVVFSNAGIVVAGPLAQMNHDDWRWVIDIDLWGSIHAV 126
DVR + + R +G +D++ + AG++ G + ++ ++W ++ G +A
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 127 EAFLPRLLEQGTGGHIAFTASFAGLVPNAGLGTYGVAKYGVVGLAETLAREVKPNGIGVS 186
+ ++++ G I S VP + Y +K V + L E+ I +
Sbjct: 125 RSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 187 VLCPMVVETKL 197
++ P ET +
Sbjct: 184 IVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0711cDHBDHDRGNASE1032e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 103 bits (258), Expect = 2e-28
Identities = 81/285 (28%), Positives = 126/285 (44%), Gaps = 39/285 (13%)

Query: 5 VEGKVAFVTGAARGQGRSHAVRLAQEGADIIAVDICKPIRAGVVDTAIPASTPEDLAETA 64
+EGK+AF+TGAA+G G + A LA +GA I AVD P + V +++ A A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD-YNPEKLEKVVSSLKA-----EARHA 59

Query: 65 DLVKGHNRRIVTAEVDVRDYDALKAAVDSGVEQLGRLDIIVANAGIGNGGDTLDKTSEED 124
+ DVRD A+ ++G +DI+V AG+ G + S+E+
Sbjct: 60 EAFP----------ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPG-LIHSLSDEE 108

Query: 125 WTEMIDINLAGVWKTVKAGVPHMIAGGRGGSIILTSSVGGLKAYPHTGHYVAAKHGVVGL 184
W +N GV+ ++ +M+ R GSI+ S Y ++K V
Sbjct: 109 WEATFSVNSTGVFNASRSVSKYMMD-RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMF 167

Query: 185 MRAFGVDLGQHMIRVNSVHPTHVKTPM-----LHNEGTFKMFRPDLENPGPDDMAPICQM 239
+ G++L ++ IR N V P +T M G ++ + LE
Sbjct: 168 TKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLET------------ 215

Query: 240 FHTLPIPW---VEPIDISNAVLFFASDEARYITGVTLPIDAGSCL 281
IP +P DI++AVLF S +A +IT L +D G+ L
Sbjct: 216 -FKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0712cHTHTETR552e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.6 bits (131), Expect = 2e-11
Identities = 28/164 (17%), Positives = 61/164 (37%), Gaps = 5/164 (3%)

Query: 20 KTAKLRAAQRVQRFLDAAQAIIIEKGSTDFTVQEVVDRSRQSLRSFYLQFDGKHELLLAL 79
+ K A + Q LD A + ++G + ++ E+ + + + Y F K +L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 80 FEDALSRSADQIRAATES-HTDPLERLQVAVQLLYEASRPDPTAKRPLFTDFAPRLLVTH 138
+E + S + DPL L+ + + E++ + + + F V
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 139 PAEV----KVAHAPLLALLTELMEAAGEAGKLRTTINPKRVAAM 178
A V + + + ++ EA L + +R A +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAII 166


70MAP0763MAP0770cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP0763021-2.604833hypothetical protein
MAP0764-122-2.805418hypothetical protein
MAP0765022-3.426079hypothetical protein
MAP0766c122-3.818372hypothetical protein
MAP0767c020-3.967532hypothetical protein
MAP0768c-115-4.151326hypothetical protein
MAP0769-213-4.579259hypothetical protein
MAP0770c013-4.512432hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0763PF03544290.010 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.010
Identities = 17/84 (20%), Positives = 24/84 (28%), Gaps = 6/84 (7%)

Query: 126 LQKSVDPSKILYSEPRLAPGGEGPKPGPPEIPPAVSAYTGLPGDPVGPPGAEPPARIPGA 185
P ++ EP P E PK P I P P + +
Sbjct: 64 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK------PKPKPKPKPVKKVEQPKRD 117

Query: 186 AMPLPPPPSTPMPPPPPPEPGVSG 209
P+ P++P P P S
Sbjct: 118 VKPVESRPASPFENTAPARPTSST 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0764PRTACTNFAMLY330.003 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 32.7 bits (74), Expect = 0.003
Identities = 25/80 (31%), Positives = 37/80 (46%)

Query: 95 GQTSLLGSMHVELNTPLGQQGSGRLQPGATIPLSRSSAYPSTEQTLSSLGAVVNGGGLGQ 154
GQ SL+G+ P Q G QP P + + P+ + ++ A VN GG+G
Sbjct: 562 GQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGL 621

Query: 155 IGEIIHNFSAALSGREGAVR 174
+ + S ALS R G +R
Sbjct: 622 ASTLWYAESNALSKRLGELR 641



Score = 28.9 bits (64), Expect = 0.048
Identities = 23/62 (37%), Positives = 25/62 (40%), Gaps = 3/62 (4%)

Query: 365 GPGPRQIVGDPLPGPPPGAAPLPGPPPGAAPLPGPPPGAASLPDAGLGQTPPAATAPTEG 424
G G +VG P P P AP PGP P P P P A P + AA A
Sbjct: 560 GNGQWSLVGAKAP-PAPKPAPQPGPQPPQPPQPQPEAPAPQPPAG--RELSAAANAAVNT 616

Query: 425 GG 426
GG
Sbjct: 617 GG 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0765PF03544358e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 34.6 bits (79), Expect = 8e-04
Identities = 12/89 (13%), Positives = 18/89 (20%), Gaps = 1/89 (1%)

Query: 460 AAVPPVPS-SGPPALAPMSRMSADLPPIAPLDVPTPTELPPPPPPPPAPAAPDQVDGAAP 518
A + P + PP + P P + P E P P P P
Sbjct: 58 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRD 117

Query: 519 QAAPSAFAGKASKPAPSVVVAKYDPRTGR 547
+ +
Sbjct: 118 VKPVESRPASPFENTAPARPTSSTATAAT 146



Score = 33.4 bits (76), Expect = 0.002
Identities = 24/119 (20%), Positives = 34/119 (28%), Gaps = 6/119 (5%)

Query: 435 LPPGAVPRGAPAGPRGENPPPGSVGAAVPPVPSSGPPALAPMSRMSADLPPIAPLDVPTP 494
LP A P + PP +V PP P P P + P AP+ + P
Sbjct: 43 LPAPAQPISVTMVAPADLEPPQAVQP--PPEPVVEPE---PEPEPIPEPPKEAPVVIEKP 97

Query: 495 TELPPPPPPPPAPAAPDQVDGAAPQAAPSAFAGKASKPAPSVVVAKYDPRTGRYVGPDG 553
+ P P P P P + A + + PA +
Sbjct: 98 -KPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP0770cHTHTETR592e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.3 bits (143), Expect = 2e-13
Identities = 29/90 (32%), Positives = 42/90 (46%), Gaps = 3/90 (3%)

Query: 2 LDAALDLFAANGVSGTSLQMIADAVGITKAAVYHQFRTKEQIVIAVTERELGRLVPALEE 61
LD AL LF+ GVS TSL IA A G+T+ A+Y F+ K + + E + E
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 62 AEAHDDG---PQARDALLVRVIEMAVRDRR 88
+A G R+ L+ + +RR
Sbjct: 77 YQAKFPGDPLSVLREILIHVLESTVTEERR 106


71MAP1039MAP1050cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP10390140.754456hypothetical protein
MAP1040c0170.841706FadD9
MAP1041c-1161.8527884-aminobutyrate aminotransferase
MAP10421151.216032preprotein translocase subunit YajC
MAP10431120.505464preprotein translocase subunit SecD
MAP1044-1100.446543preprotein translocase subunit SecF
MAP1045090.086211hypothetical protein
MAP104608-0.693315adenine phosphoribosyltransferase
MAP1047-17-0.801421RelA
MAP1048c110-0.000423hypothetical protein
MAP1049c091.172938hypothetical protein
MAP1050c290.695962PpiB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1039PF05616290.022 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 29.3 bits (65), Expect = 0.022
Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 8/47 (17%)

Query: 248 DVQPLASDAPAA---PAVEPYRDPAYDPADVSSGALTKPRTSADPEP 291
D+ P +++AP A P V P +PA +PA + P T +PEP
Sbjct: 314 DLTPGSAEAPNAQPLPEVSPAENPANNPAPNEN-----PGTRPNPEP 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1040cNUCEPIMERASE340.003 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 34.0 bits (78), Expect = 0.003
Identities = 47/243 (19%), Positives = 77/243 (31%), Gaps = 59/243 (24%)

Query: 776 TVLLTGATGFLGRYLALEWLERMDMV-------DGKVIALVRARSDEEARARLDKTFDSG 828
L+TGA GF+G +++ LE V D ++L +AR E A+ F
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARL--ELLAQPGFQFHKI 59

Query: 829 D----PKLLAHYQQLAADHLEVIAGDKGEANLGLGQDVWQRLADTVDVIVDPAALVNHVL 884
D + + + + + R + + +P A +
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLA-----------VRYS-----LENPHAYAD--- 100

Query: 885 PYSELFGPNALGTAELIRLALTSKQKPYTYVSTIGVGDQIEPGKFVENADIRQMSATRAI 944
N G ++ +K + Y S+ V F + +
Sbjct: 101 -------SNLTGFLNILEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHP------ 147

Query: 945 NDSYANGYGNSKWAGEVLLREAHDLCGLPVAVFRCDMILADTTYAGQLNLPDM----FTR 1000
+ Y +K A E++ L GLP R T Y G PDM FT+
Sbjct: 148 ----VSLYAATKKANELMAHTYSHLYGLPATGLR-----FFTVY-GPWGRPDMALFKFTK 197

Query: 1001 LML 1003
ML
Sbjct: 198 AML 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1043SECFTRNLCASE637e-13 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 62.9 bits (153), Expect = 7e-13
Identities = 40/215 (18%), Positives = 78/215 (36%), Gaps = 22/215 (10%)

Query: 367 RTQISGGDPPFTAATAKQLAN----VLKYGSLPLSFQSSEAQTVSATLGLTSLRAGLIAG 422
+ Q G A ++L N L L S E +V + + + +
Sbjct: 102 QMQEDGQGAEGQGAQGQELVNKVETALTAVDPALKITSFE--SVGPKVSGELVWTAVWSL 159

Query: 423 AIGLLLVLLYSLLYYRVLGLLTALSLAASGAMVFAILILLGRYI--NYTLDLAGIAGLII 480
++++ Y + + +L A A+V +L+ +G + DL +A L+
Sbjct: 160 LAATVVIMFYIWVRFEWQ-----FALGAVVALVHDVLLTVGLFAVLQLKFDLTTVAALLT 214

Query: 481 GIGTTADSFVVFFERIKDEIR--EGRTFRSAVPRGWTRARKTIVSGNAVTFLAAAVLYFL 538
G + + VV F+R+++ + + R + V T LA +
Sbjct: 215 ITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTGMTTLLALVPMLIW 274

Query: 539 AIGQVKGFAFTLGLTTILDLVVVFLVTWPLVYLAS 573
++GF F + + VF T+ VY+A
Sbjct: 275 GGDVIRGFVFAM-------VWGVFTGTYSSVYVAK 302



Score = 36.0 bits (83), Expect = 4e-04
Identities = 18/99 (18%), Positives = 35/99 (35%), Gaps = 7/99 (7%)

Query: 14 LSVFLLLLVGVYLLVFLTGDKRAAPKLGIDLQGGTRVTLTARTPDGSAPSREALAQAQQI 73
+++++ +L + G GID +GGT + + T R AL + ++
Sbjct: 24 FGAAIVMMIASVILPLVIG-----LNFGIDFKGGTTIRTESTTAIDVGVYRAAL-EPLEL 77

Query: 74 ISARVNGLGVSGSEVVVDGDNLIITVPGNDGNEARNLGQ 112
++ + S +I DG A G
Sbjct: 78 GDVIISEVR-DPSFREDQHVAMIRIQMQEDGQGAEGQGA 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1044SECFTRNLCASE2481e-81 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 248 bits (636), Expect = 1e-81
Identities = 66/310 (21%), Positives = 139/310 (44%), Gaps = 16/310 (5%)

Query: 52 FEVVGRRRLWFGISGAIVAVAILSIALRGFTFGIDFNGGTTVSMPAAGT------HGTVH 105
F+ + FG + ++ +++ + G FGIDF GGTT+ + +
Sbjct: 14 FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALE 73

Query: 106 TAQVQEVFRKTLGTDPESVVVVGNGAAATVQIRSETLTNDQTTKLRNALFDAFGPKGADG 165
++ +V + S + A +Q++ + + L +
Sbjct: 74 PLELGDVIISEVRD--PSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAV 131

Query: 166 KPSKQAISDSQVSETWGDQITDKALIALVVFLALVGLYITVRYERYMTLAAIAAMFFDLT 225
P+ + S V ++ A+ +L+ ++ YI VR+E L A+ A+ D+
Sbjct: 132 DPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVL 191

Query: 226 VTAGVYSLVGFEVTPATVIGLLTILGFSLYDTVIVFDKVEENTRDFQHTNRRTFAEQANL 285
+T G+++++ + TV LLTI G+S+ DTV+VFD++ EN ++ + NL
Sbjct: 192 LTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTM---PLRDVMNL 248

Query: 286 AINQTFMRSINTSLIGVLPVLALMVVAVWLLGVGTLKDLALVQLVGILVGTYSSIFVATP 345
++N+T R++ T + +L ++ +++ G ++ + G+ GTYSS++VA
Sbjct: 249 SVNETLSRTVMTGMTTLLALVPMLI-----WGGDVIRGFVFAMVWGVFTGTYSSVYVAKN 303

Query: 346 LLVTLRERTE 355
+++ +
Sbjct: 304 IVLFIGLDRN 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1049cYERSSTKINASE340.001 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 34.3 bits (78), Expect = 0.001
Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 8/82 (9%)

Query: 128 RAGLIHRDVKPMNVLVTTARDFVYLIDFGLARAQADTALTQTGATMGTVAYMAPERFTGT 187
+AG++H D+KP NV+ A +ID GL + T ++ APE G
Sbjct: 263 KAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQ------PKGFTESFKAPELGVGN 316

Query: 188 --TDHRADVYSLACVLHECLTG 207
++DV+ + L C+ G
Sbjct: 317 LGASEKSDVFLVVSTLLHCIEG 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1050cSURFACELAYER280.046 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 28.1 bits (62), Expect = 0.046
Identities = 20/80 (25%), Positives = 32/80 (40%), Gaps = 10/80 (12%)

Query: 45 RTRRILLIAAGSIVAVAVIAAVVVTVLNNRNEHKASTAAPSTSTSAPETTTPGQPSQVPP 104
+ RI+ AA +++AVA IAA + V + A+T ++ +A V
Sbjct: 3 KNLRIVSAAAAALLAVAPIAATAMPV------NAATTINADSAINANTNA----KYDVDV 52

Query: 105 LPPFKPSADVGANCQYPPSP 124
P A V + P P
Sbjct: 53 TPSISAIAAVAKSDTMPAIP 72


72MAP1198MAP1212cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1198-113-0.496354hypothetical protein
MAP1199-113-0.912865hypothetical protein
MAP1200c-113-0.907321hypothetical protein
MAP1201c-212-0.565451aconitate hydratase
MAP12020100.467763hypothetical protein
MAP1203090.567420hypothetical protein
MAP1204090.326706hypothetical protein
MAP120519-0.099670MoxR
MAP1206080.324632hypothetical protein
MAP120709-0.001582hypothetical protein
MAP1208-1110.388361hypothetical protein
MAP1209-280.304118FabG1
MAP1210-2100.378699enoyl-ACP reductase
MAP1211012-0.115680ferrochelatase
MAP1212c018-2.584774hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1198BCTERIALGSPC300.027 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 29.5 bits (66), Expect = 0.027
Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 2/53 (3%)

Query: 82 ARDRVLSARGLDVLLTDLEKQQALMAEVADDDARDRAIRRYGQLEERFVALGG 134
D ++ GLD L D E+ + M +AD + R GQ ++ ++ GG
Sbjct: 220 DNDMAVALNGLD--LRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEFGG 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1200cHTHTETR674e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.0 bits (163), Expect = 4e-16
Identities = 33/177 (18%), Positives = 62/177 (35%), Gaps = 14/177 (7%)

Query: 1 MPKVSEDHLAARRRQILDGARRCFAEFGYDKATVRRLEQAIGMSRGAIFHHFRDKDALFF 60
M + ++ R+ ILD A R F++ G ++ + +A G++RGAI+ HF+DK LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 ALAHEDAERMAEVAS--------------RAGLIQVMRDMLAAPDQFDWLATRLEIARKL 106
+ + E+ R LI V+ + + + +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 107 RNDPAFSRGWAERSAELAAATTDRLRRQTEAHRVRDDVPGDVLQCYLELVLDGLVAR 163
+ E L+ EA + D+ + + GL+
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1203GPOSANCHOR393e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 39.3 bits (91), Expect = 3e-05
Identities = 36/214 (16%), Positives = 66/214 (30%), Gaps = 24/214 (11%)

Query: 45 NLAALIANVAKANQRLQNLSAEIQTEQESVNKALVDVE----TARDNAAAAQHDLEASQQ 100
+ + L+ A ++ Q + KAL + + A
Sbjct: 166 GAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAA 225

Query: 101 AVKDANAAIAGAQRRFDTFAAAAYMNGPSNSYLTARNPD----------DIIATEAAAKT 150
D A+ GA +A + L AR + A A KT
Sbjct: 226 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 285

Query: 151 LTASSQTVMAKLQQARTE----QVNRESAAR-LAKQNADKAAADAKASQ-----DAAVAA 200
L A + A+ + NR+S R L K +A+ + + A+
Sbjct: 286 LEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEAS 345

Query: 201 LTNSRQKFDEQREQMIRLATERDQAQAKLEAAKR 234
+ R+ D RE +L E + + + + ++
Sbjct: 346 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 379



Score = 38.9 bits (90), Expect = 3e-05
Identities = 34/205 (16%), Positives = 65/205 (31%), Gaps = 37/205 (18%)

Query: 37 AQADPVADNLAALIANVAKANQRLQNLSAEIQTEQESVNKALVDVETARDNAAAAQHDLE 96
+ + + L A A R L ++ ++T AA + +
Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKA 298

Query: 97 ASQQAVKDANAAIAGAQRRFDTFAAAAYMNGPSNSYLTARNPDDIIATEAAAKTLTASSQ 156
+ + NA +R D A+ A K L A Q
Sbjct: 299 DLEHQSQVLNANRQSLRRDLD-------------------------ASREAKKQLEAEHQ 333

Query: 157 TVMAKLQQARTEQVNRESAARLAKQNADKAAADAKASQDAAVAALTNSRQKFDEQREQMI 216
+ + + + A+R + + A+ +AK +A L + + R+ +
Sbjct: 334 KLEEQNKISE--------ASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLR 385

Query: 217 R-LATERD---QAQAKLEAAKRQLS 237
R L R+ Q + LE A +L+
Sbjct: 386 RDLDASREAKKQVEKALEEANSKLA 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1205HTHFIS346e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 6e-04
Identities = 32/162 (19%), Positives = 60/162 (37%), Gaps = 21/162 (12%)

Query: 41 AEVQTLERAIFEVKRIIVGQD----QLVERMLVGLLSKGHVLLEGVPGVAKTL---AVET 93
LE + ++ G+ ++ + + + +++ G G K L A+
Sbjct: 124 RRPSKLEDDSQDGMPLV-GRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD 182

Query: 94 FAKVVGGTFARIQ---FTPDLVPTDIIG------TRIYRQGKEEFDTELGPVVVNFLLAD 144
+ K G F I DL+ +++ G T + F+ G L D
Sbjct: 183 YGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT----LFLD 238

Query: 145 EINRAPAKVQSALLEVMQERHVSIGGKTFPLPNPFLVMATQN 186
EI P Q+ LL V+Q+ + G P+ + ++A N
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1209DHBDHDRGNASE1126e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 112 bits (280), Expect = 6e-32
Identities = 65/252 (25%), Positives = 122/252 (48%), Gaps = 21/252 (8%)

Query: 24 RSVLVTGGNRGIGLAIAQRLAADGHKVAVTHRGSGAPDGLFGVE-----------CDVTD 72
+ +TG +GIG A+A+ LA+ G +A + + DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 73 NDAVDRAFTEVEEHQGPVEVLVSNAGISKDAFLIRMTEERFTEVINANLTGAFRVTQRAA 132
+ A+D +E GP+++LV+ AG+ + + +++E + + N TG F ++ +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 133 RSMQKKRFGRIIYIGSVSGMWGIGNQSNYAAAKAGLIGMARSISRELSKAGVTANVVAPG 192
+ M +R G I+ +GS + + YA++KA + + + EL++ + N+V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 193 YIDTEMTRAL-------DERIQAGALEF---IPAKRVGTAAEVAGAVSFLASEDASYIAG 242
+T+M +L ++ I+ F IP K++ +++A AV FL S A +I
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 243 AVIPVDGGMGMG 254
+ VDGG +G
Sbjct: 249 HNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1210DHBDHDRGNASE579e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 57.0 bits (137), Expect = 9e-12
Identities = 53/274 (19%), Positives = 102/274 (37%), Gaps = 34/274 (12%)

Query: 5 LDGKRILVTGIITDSSIAFHIAKVAQEAGAQLVLTGFDRLRLIQRIVD-------RLPEK 57
++GK +TG I +A+ GA + D V R E
Sbjct: 6 IEGKIAFITG--AAQGIGEAVARTLASQGAHI--AAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 58 APLIELDVQNEEHLNTLAQRVTAEIGEGNRLDGVVHSIGFMPQTGMGINPFFDAPYEDVS 117
P DV++ ++ + R+ E+G +D +V+ G + E+
Sbjct: 62 FPA---DVRDSAAIDEITARIEREMG---PIDILVNVAGVLR-----PGLIHSLSDEEWE 110

Query: 118 KGIHISAYSYASLAKALLPIM--NPGGSIVGMDFD----PSRAMPAYNWMTVAKSALESV 171
+++ + ++++ M GSIV + + P +M AY +K+A
Sbjct: 111 ATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYA---SSKAAAVMF 167

Query: 172 NRFVAREAGPHGVRSNLVAAGPIRTLAMAGIVGGVLGDQAAEQIRLLEEGWDQRAPIGWN 231
+ + E + +R N+V+ G T + + A + I+ E + P+
Sbjct: 168 TKCLGLELAEYNIRCNIVSPGSTETDMQWSL--WADENGAEQVIKGSLETFKTGIPLK-K 224

Query: 232 MKDPTPVAKTVCALLSDWLPATTGTIIYADGGAS 265
+ P+ +A V L+S T + DGGA+
Sbjct: 225 LAKPSDIADAVLFLVSGQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1212cHTHFIS280.044 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.3 bits (63), Expect = 0.044
Identities = 33/142 (23%), Positives = 50/142 (35%), Gaps = 15/142 (10%)

Query: 185 LGDAEYTLRSAVRSAAETLGAIGLGS---AASDVDDPRGLVEQLLESARQHRIPDHAPSR 241
L A Y +R +AA I G +DV P LL ++ R PD
Sbjct: 23 LSRAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKAR-PD----- 75

Query: 242 ALRVLENAAHVDAIIAVSAGLSRSDS--PDRFAAPIAAGLE--PLGTQSSSEARIAGDAL 297
L VL +A + A+ A + P F G+ L +++ D+
Sbjct: 76 -LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQ 134

Query: 298 RPLTAVVRSARMAAVTAILHSA 319
+ V RSA M + +L
Sbjct: 135 DGMPLVGRSAAMQEIYRVLARL 156


73MAP1217cMAP1223cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1217c1160.678278hypothetical protein
MAP1218c2142.913545hypothetical protein
MAP1219c3133.680286hypothetical protein
MAP1220c1103.412807hypothetical protein
MAP1221-291.793783hypothetical protein
MAP1222-191.679059hypothetical protein
MAP1223c-190.994093hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1217cPRPHPHLPASEC320.003 Prokaryotic zinc-dependent phospholipase C signature.
		>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature.

Length = 398

Score = 31.5 bits (71), Expect = 0.003
Identities = 15/69 (21%), Positives = 24/69 (34%), Gaps = 3/69 (4%)

Query: 35 YEAFTETWKDSLSIGWQGNGAEALRSRTYADKVKVSDMVDQLHEAAKVARAGATDLSAAR 94
+E F E K+ I G YAD +K D E A+ + +
Sbjct: 179 FETFAEERKEQYKINTAGCKTN---EDFYADILKNKDFNAWSKEYARGFAKTGKSIYYSH 235

Query: 95 SRMRNAVVD 103
+ M ++ D
Sbjct: 236 ASMSHSWDD 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1220cPERTACTIN331e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 33.2 bits (75), Expect = 1e-04
Identities = 18/45 (40%), Positives = 19/45 (42%)

Query: 65 PPGPPPMWAAPPMPPPPLHGPPGPPHGFPPPPPGEAFPTLPPPSP 109
PP P P P P P PP PP PP P + P P P P
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQP 612



Score = 30.1 bits (67), Expect = 0.001
Identities = 20/56 (35%), Positives = 20/56 (35%)

Query: 42 GAVGTVAAIAFAWIISVGPPGPPPPGPPPMWAAPPMPPPPLHGPPGPPHGFPPPPP 97
G V A A PGP P PP PP PP P P P P PP
Sbjct: 558 GQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613



Score = 26.6 bits (58), Expect = 0.023
Identities = 18/51 (35%), Positives = 19/51 (37%), Gaps = 5/51 (9%)

Query: 60 PPGPPPPGPPPMWAAPPMPPPPLHGPPGPPHGFPPPPPGEAFPTLPPPSPR 110
P P PGP P PP PP PP PP P PP+ R
Sbjct: 571 PKPAPQPGPQ-----PGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGR 616



Score = 26.2 bits (57), Expect = 0.036
Identities = 14/37 (37%), Positives = 14/37 (37%)

Query: 74 APPMPPPPLHGPPGPPHGFPPPPPGEAFPTLPPPSPR 110
APP P P P P P PP P P P R
Sbjct: 567 APPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQR 603


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1221HTHFIS816e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 6e-20
Identities = 36/125 (28%), Positives = 66/125 (52%), Gaps = 1/125 (0%)

Query: 2 LVVEDSETIREMVSEALTEVGYHTEARRDGERLEELLDGIRPDLVVLDVMLPGRDGFALI 61
LV +D IR ++++AL+ GY + L + DLVV DV++P + F L+
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 62 DVIRDWG-DIGIVLITARDGLPDRLRGLDGGADDYVIKPFELAELVSRVGAVLRRRGRLP 120
I+ D+ +++++A++ ++ + GA DY+ KPF+L EL+ +G L R P
Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRP 126

Query: 121 QVIQV 125
++
Sbjct: 127 SKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1222TONBPROTEIN392e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 38.8 bits (90), Expect = 2e-05
Identities = 14/36 (38%), Positives = 15/36 (41%)

Query: 97 AEGPVEPPPPYPPPLPPPAYPPPYPPPYPPPPGPPP 132
+EPP PP P P P P P P PP P
Sbjct: 51 TPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAP 86



Score = 37.3 bits (86), Expect = 8e-05
Identities = 10/39 (25%), Positives = 11/39 (28%)

Query: 94 PDLAEGPVEPPPPYPPPLPPPAYPPPYPPPYPPPPGPPP 132
+ P P P P P P P P P P
Sbjct: 59 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPK 97



Score = 36.1 bits (83), Expect = 2e-04
Identities = 15/51 (29%), Positives = 15/51 (29%), Gaps = 1/51 (1%)

Query: 94 PDLAEGPVEPPPPYPPPLPPPAYPPPYPPPYP-PPPGPPPDTTATAVVHPL 143
P A P P P P P P PP P P P P V
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQ 107



Score = 36.1 bits (83), Expect = 2e-04
Identities = 16/49 (32%), Positives = 17/49 (34%), Gaps = 1/49 (2%)

Query: 98 EGPVEPPPPYPPPLPPPAYPPPYPPPYP-PPPGPPPDTTATAVVHPLPD 145
PV P P P P+P P P P P P P P P D
Sbjct: 65 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRD 113



Score = 36.1 bits (83), Expect = 2e-04
Identities = 12/41 (29%), Positives = 15/41 (36%)

Query: 92 ISPDLAEGPVEPPPPYPPPLPPPAYPPPYPPPYPPPPGPPP 132
++P E P PP P + P P P P P P
Sbjct: 50 VTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIE 90



Score = 32.3 bits (73), Expect = 0.004
Identities = 14/54 (25%), Positives = 17/54 (31%)

Query: 78 LVVTADGQAYGDRAISPDLAEGPVEPPPPYPPPLPPPAYPPPYPPPYPPPPGPP 131
+V AD + P+ P P P P P P P P P P
Sbjct: 49 MVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKP 102



Score = 31.9 bits (72), Expect = 0.004
Identities = 12/46 (26%), Positives = 13/46 (28%)

Query: 98 EGPVEPPPPYPPPLPPPAYPPPYPPPYPPPPGPPPDTTATAVVHPL 143
E EP P P P P P P P V P+
Sbjct: 72 EPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPV 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1223cV8PROTEASE413e-06 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 40.8 bits (95), Expect = 3e-06
Identities = 31/194 (15%), Positives = 62/194 (31%), Gaps = 17/194 (8%)

Query: 36 TPAQQPTPPPVLAPIDLPAA---SAIGPGAGI-YVDYTDGSGGMGCTAGFLVHTSSGQAG 91
P +Q V+ P + + G A + Y+ +G + G +V G+
Sbjct: 59 KPLEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIAS-GVVV----GKDT 113

Query: 92 ILTAGHCNRP--GEPSKVTMNLGGVLPYATLGTFSQTISEGVHDEQHDIGLIILDGDNVP 149
+LT H G+P + + + + D+ ++ +
Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173

Query: 150 QSPAIAASVPVSGVAANLQVGQQLCKFGMGSGADAC------GQIVEITGSKVKFLAGGQ 203
+ A QV Q + G G+I + G +++
Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQYDLSTT 233

Query: 204 CGDSGGPVYRYEND 217
G+SG PV+ +N+
Sbjct: 234 GGNSGSPVFNEKNE 247


74MAP1331cMAP1336N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1331c-290.147813hypothetical protein
MAP1332-18-0.256495hypothetical protein
MAP1333-19-0.985805hypothetical protein
MAP133418-0.700580hypothetical protein
MAP133508-0.471461excinuclease ABC subunit B
MAP1336-17-0.468718hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1331cPF03544330.005 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 32.6 bits (74), Expect = 0.005
Identities = 19/121 (15%), Positives = 33/121 (27%), Gaps = 10/121 (8%)

Query: 110 HPGRPRPSHDSGPPTLTWTAITEQTNPTRPTRRPVPPRRPARPP---STPPEERTVIVPG 166
P P+ P ++T A + P P P P P PP+E V++
Sbjct: 40 VIELPAPAQ---PISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEK 96

Query: 167 SPARSRVAPASAPAGPPRTSPTPATEFTVIDAKTADVSNLATRFVKLLSPRASSAAGTAG 226
+ + P E + N A + A+++
Sbjct: 97 PKPKPKPKPKPVKKVEQPKRDVKPVE----SRPASPFENTAPARPTSSTATAATSKPVTS 152

Query: 227 A 227

Sbjct: 153 V 153



Score = 30.3 bits (68), Expect = 0.026
Identities = 25/140 (17%), Positives = 40/140 (28%), Gaps = 5/140 (3%)

Query: 104 QQGHAGHPGRPRPSHDSGPPTLTWTAITEQTNPTRPTRRPVPPRRPARPPSTPPEERTVI 163
+ A P P P + P ++ +P P +P +R
Sbjct: 61 EPPQAVQP-PPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR--- 116

Query: 164 VPGSPARSRVAPASAPAGPPRTSPTPATEFTVIDAKTADVSNLATRFVKLLSPRASSAAG 223
P SR A P R + + AT T + A + P + A
Sbjct: 117 -DVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALR 175

Query: 224 TAGAMTIGRAADNDIVVADV 243
G + + D V +V
Sbjct: 176 IEGQVKVKFDVTPDGRVDNV 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1332YERSSTKINASE320.010 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 31.6 bits (71), Expect = 0.010
Identities = 48/213 (22%), Positives = 87/213 (40%), Gaps = 46/213 (21%)

Query: 69 HPHIVGVH--------DRGEFNGHLWISMDYVEG---TDASRLVKESYPDGMPLDE---- 113
HP++ VH +R E + MD V+G +D R + +S+ G E
Sbjct: 190 HPNLANVHGMAVVPYGNRKEEA----LLMDEVDGWRCSDTLRTLADSWKQGKINSEAYWG 245

Query: 114 -VSAIVQAVAGALDYAHARGLLHRDVKPANILLTHPEAGERRILLADFGVARHLGD-ISG 171
+ I + ++ G++H D+KP N++ +GE ++ D G+ G+ G
Sbjct: 246 TIKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRA-SGEPVVI--DLGLHSRSGEQPKG 302

Query: 172 ITETNVAVGTVAYAAPEQLTGS-PIDGRADQYALAATAFHLLTGAPPFQHSNPIAVIGQH 230
TE+ + APE G+ ++D + + +T H + G NP Q
Sbjct: 303 FTES--------FKAPELGVGNLGASEKSDVFLVVSTLLHCIEGF----EKNPEIKPNQG 350

Query: 231 LH---EDPPRLSD------FRPELAGLDEVFCQ 254
L +P + D RP +AG++ + +
Sbjct: 351 LRFITSEPAHVMDENGYPIHRPGIAGVETAYTR 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1335FLGMOTORFLIM330.003 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 33.3 bits (76), Expect = 0.003
Identities = 22/100 (22%), Positives = 36/100 (36%), Gaps = 8/100 (8%)

Query: 206 VQYTRNDLS-FTRGSFRVRGDTVEIIPSYEELAVRIEFFGDEIEALYYLHPLTGDVIRQV 264
+ T LS R V +V+ + YEE I A+ + PL G+ + +V
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLT-YEEFIRSIPTPS--TLAVITMDPLKGNAVLEV 119

Query: 265 DSLRIFPATHYVAGPERMAHAI----STIEQELAERLAEL 300
D F + G A + + IE + E +
Sbjct: 120 DPSITFSIIDRLFGGTGQAAKVQRDLTDIENSVMEGVIVR 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1336TCRTETB362e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.0 bits (83), Expect = 2e-04
Identities = 48/311 (15%), Positives = 108/311 (34%), Gaps = 21/311 (6%)

Query: 1 MQGVAGGLLAGLGYAVINSALPRWLWTRGSALVSAMWGVATVVGPATGGLFAQLGIWRWA 60
+QG L V+ +P+ + L+ ++ + VGPA GG+ A W +
Sbjct: 112 IQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYL 171

Query: 61 FVVM--AVLTALMALLVPVALARVDPAPAIPRMKVPVWSLLIIGVAALAVSVAQIPHNTA 118
++ ++T + + R+ I + L+ +G+ + T+
Sbjct: 172 LLIPMITIITVPFLMKLLKKEVRIKGHFDIKGI-----ILMSVGIVFFMLFT------TS 220

Query: 119 ATFGLLAAGIMLVGLFVIVDWRMHAAILPPSVFSPGP-LKWIYLTMGVLMAAAMVNTYVP 177
+ L ++ +FV ++ + P + P + + + A + VP
Sbjct: 221 YSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVP 280

Query: 178 LFGQRLAHLTPIAAGFLGAALALGWTVSEIVSASLENPRTVGRVVMVAPLVAASGLALGA 237
+ + L+ G+ + T+S I+ + V R + V G+ +
Sbjct: 281 YMMKDVHQLSTAEI---GSVIIFPGTMSVIIFGYIGG-ILVDRRGPL--YVLNIGVTFLS 334

Query: 238 VARHGDGSAWTAALWAVALLVAGTGIGMAWPHLSARAMASVN-DPAEGGAASAAINTVQL 296
V+ W + +++ G+++ + S + E GA + +N
Sbjct: 335 VSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSF 394

Query: 297 TSAAIGAGLAG 307
S G + G
Sbjct: 395 LSEGTGIAIVG 405


75MAP1377MAP1383cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1377-117-3.405428hypothetical protein
MAP1378-114-1.910523hypothetical protein
MAP1379-1130.024569hypothetical protein
MAP1380-1120.688051hypothetical protein
MAP1381-2121.379699hypothetical protein
MAP1382c-1122.034717hypothetical protein
MAP1383c-1122.030028short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1377DHBDHDRGNASE702e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 70.5 bits (172), Expect = 2e-16
Identities = 63/263 (23%), Positives = 114/263 (43%), Gaps = 12/263 (4%)

Query: 4 LSGKKAVVTGAGGDGLGQAIANRLGGLGADIALIGRTLEKVQRRGREVEERWGVKTVAIS 63
+ GK A +TGA G+G+A+A L GA IA + EK + + A
Sbjct: 6 IEGKIAFITGAA-QGIGEAVARTLASQGAHIAAVDYNPEK-LEKVVSSLKAEARHAEAFP 63

Query: 64 ADMSDWDQVHNAVREAHWQLGGLDIMVNNPVMVAGGLFETQTKEQIDFTVLGSLSMMMYG 123
AD+ D + ++G +DI+VN ++ GL + + E+ + T S+ G
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT----FSVNSTG 119

Query: 124 AHAALQ----FLLPQGSGKIINIGSVGGRIQQRGLVVYNACKAGVIGFTRNLAHEVALRG 179
A + +++ + SG I+ +GS + + + Y + KA + FT+ L E+A
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 180 VNVLGVAPGIMLNPQLKQYVLDPQDDQERAGRAAIIEAITQQVQLGRASLPEEAANMVAF 239
+ V+PG Q+ L ++ +E + L + + P + A+ V F
Sbjct: 180 IRCNIVSPGSTETDM--QWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLF 237

Query: 240 LATEAADYLCGQTIDVAGGQWMG 262
L + A ++ + V GG +G
Sbjct: 238 LVSGQAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1380DHBDHDRGNASE1103e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (277), Expect = 3e-31
Identities = 80/264 (30%), Positives = 115/264 (43%), Gaps = 14/264 (5%)

Query: 3 DNKVALITGAARGQGRAHAVRLSAGGADIIAVDIAGRLPESVPYESPTRDDLAETARLVE 62
+ K+A ITGAA+G G A A L++ GA I AVD E V ++
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKV-------------VSSLK 53

Query: 63 ANGRRAITAAIDVRDAEKLSAAVGHAVAELGRLDIIVANAGICCPAPWDQITGQAFRDTI 122
A R A DVRD+ + E+G +DI+V AG+ P ++ + + T
Sbjct: 54 AEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATF 113

Query: 123 DTNVIGTWNTVMAGAHHIIAGRRGGSIILIGSAAGVKMQAFMVHYTASKHAITGMARAFA 182
N G +N + RR GSI+ +GS + M Y +SK A +
Sbjct: 114 SVNSTGVFN-ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLG 172

Query: 183 AELGRYNIRVNSLHPGAVDTPMGTGRMRDALESAAATYPHLEGLHKPLLPDGIAQPEDIA 242
EL YNIR N + PG+ +T M D + LE + +A+P DIA
Sbjct: 173 LELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIA 232

Query: 243 DAVAWLASDQSRFVTASGISVDLG 266
DAV +L S Q+ +T + VD G
Sbjct: 233 DAVLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1381DHBDHDRGNASE922e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 92.4 bits (229), Expect = 2e-24
Identities = 74/266 (27%), Positives = 114/266 (42%), Gaps = 27/266 (10%)

Query: 2 RGLQGKTFIVAGGSTGIGAATAERLASEGAAVTVGDINIEGANATVGRITQSGGRAIAVE 61
+G++GK + G + GIG A A LAS+GA + D N E V + A A
Sbjct: 4 KGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 62 FDLADEASVRNLVQKTIGEFGALHGLHNVGSDLSAENLGRDTTLLDTDFDVWRRTLDVNL 121
D+ D A++ + + E G + L NV L G +L D + W T VN
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVL---RPGLIHSLSDEE---WEATFSVNS 117

Query: 122 LGYVRTSRAVLAHLLQQGSGSIVNTSSGGSLGTDPMHV------AYNAAKAAVNQLTRHI 175
G SR+V +++ + SGSIV ++G++P V AY ++KAA T+ +
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIV------TVGSNPAGVPRTSMAAYASSKAAAVMFTKCL 171

Query: 176 ANNWGAQGVRCNGVMPGLVMGETQKQ---QNDIQLQ------QMFLQAAKVTRLGEPRDI 226
+RCN V PG + Q + Q + F + +L +P DI
Sbjct: 172 GLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDI 231

Query: 227 AAITAFLLSDEAEWINGQVWYIGGAS 252
A FL+S +A I + G +
Sbjct: 232 ADAVLFLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1383cDHBDHDRGNASE734e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 73.2 bits (179), Expect = 4e-17
Identities = 54/176 (30%), Positives = 78/176 (44%)

Query: 6 QTVVITGASAGIGRATAKEFGRRGANVALLARGAAGLQGAARDVEAGGGKALALPTDVAD 65
+ ITGA+ GIG A A+ +GA++A + L+ ++A A A P DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 66 HAAVVAAADETEAAFGPIDVWVNVAFTSVFAPFSEISAEEFKRVTEVTYLGYVHGTMAAL 125
AA+ E GPID+ VNVA +S EE++ V G + + +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 126 AKMRPRDRGTIVQVGSALSQRSIPLQSAYCGAKHAVNGFTESVRCELLHEGSRVRI 181
M R G+IV VGS + +AY +K A FT+ + EL R I
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184


76MAP1435MAP1442N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1435015-3.049472hypothetical protein
MAP1436c115-3.103398hypothetical protein
MAP1437c115-3.183522hypothetical protein
MAP1438c314-2.123096hypothetical protein
MAP1439c315-2.632815hypothetical protein
MAP1440c416-2.185465hypothetical protein
MAP1441316-1.207405hypothetical protein
MAP14421100.247596hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1435DHBDHDRGNASE338e-04 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 33.1 bits (75), Expect = 8e-04
Identities = 28/122 (22%), Positives = 48/122 (39%), Gaps = 7/122 (5%)

Query: 20 VDHRDDAAVSDLFDRVRRESGRLDLLVNNAATISDNLVSSKPF--WEKPLDLADVLDVGL 77
D RD AA+ ++ R+ RE G +D+LVN A + L+ S WE + + V
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSV-NSTGVFN 122

Query: 78 RSSYVASWYAAPLLVAGGRGLIAFTSSPGSVCYMHGPAYGAQKAGVDKMAADMAVDFRGT 137
S V+ + ++ S+P V AY + KA + ++
Sbjct: 123 ASRSVSKYMMD----RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY 178

Query: 138 GV 139
+
Sbjct: 179 NI 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1436cDHBDHDRGNASE1037e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 103 bits (259), Expect = 7e-29
Identities = 73/268 (27%), Positives = 110/268 (41%), Gaps = 33/268 (12%)

Query: 6 GRRAIVTGAGSGIGAATAARLLDEGATVVAYDISAEGLARTRAAADDAGTGKRLTTAVLD 65
G+ A +TGA GIG A A L +GA + A D + E L + ++ + D
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSS--LKAEARHAEAFPAD 65

Query: 66 ISVEGDVIAAVDGAVADLGGLEVLVNVAAIQTCSHTHQTTLADWNRTLAVNLTGTFLMTR 125
+ + ++G +++LVNVA + H + +W T +VN TG F +R
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 126 QALPALLDSGRGVVVNFTSTAASFAHPYMAAYAASKGGILSFTHSLALEYAKQGLRAVNI 185
++D G +V S A MAAYA+SK + FT L LE A+ +R +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 186 QPGGVSTALANSTLDKMPDGYDVGLWAKQTPLLHGKDSEILGD---------------PS 230
PG T + S LWA + +G + I G PS
Sbjct: 186 SPGSTETDMQWS------------LWADE----NGAEQVIKGSLETFKTGIPLKKLAKPS 229

Query: 231 AVASVIAMVASDDGAFITGTEIRVDGGA 258
+A + + S IT + VDGGA
Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1439cDHBDHDRGNASE585e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 57.8 bits (139), Expect = 5e-12
Identities = 45/165 (27%), Positives = 72/165 (43%), Gaps = 6/165 (3%)

Query: 3 VVLADIDGDAVAALRDELAAGGGAAHDAACDVRDPAAVQDLADR-AYDIGPVRLLVNNAG 61
+ D + + + + L A A DVRD AA+ ++ R ++GP+ +LVN AG
Sbjct: 35 IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAG 94

Query: 62 IEQFGYLWDTPVVNWQHVMDVNVSGVFYGVRAFLPKMMAAGQQAWVWNIASVGAVVAMPL 121
+ + G + W+ VN +GVF R+ MM + V + S A V
Sbjct: 95 VLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMMDRRSGSIV-TVGSNPAGVPRTS 153

Query: 122 QAPYIVSKHAVLALTECLHLEVQATGHDDHVHVQAVLPGPVRSNI 166
A Y SK A + T+CL LE+ + V PG +++
Sbjct: 154 MAAYASSKAAAVMFTKCLGLELAEYN----IRCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1442DHBDHDRGNASE762e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 76.2 bits (187), Expect = 2e-18
Identities = 61/214 (28%), Positives = 91/214 (42%), Gaps = 36/214 (16%)

Query: 5 LSGKTALVTGSSRGIGRAVAQRLAAAGATVAVTARSHS------SSLSTRAGTATALPGT 58
+ GK A +TG+++GIG AVA+ LA+ GA +A + SSL A A A P
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP-- 63

Query: 59 IGETIELIEAAGGSAFGIAADLEDADQRDGLVDAVLDRTGRIDILVNNAGFADYSLVEDM 118
AD+ D+ D + + G IDILVN AG L+ +
Sbjct: 64 -------------------ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSL 104

Query: 119 SLETFDRTVEHYLRVPFVLTKCAVPHMRKQGAGWIVNIGSVTGVAPVRPYREYNKASGDV 178
S E ++ T F ++ +M + +G IV +GS P +
Sbjct: 105 SDEEWEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVP---------RTSMA 155

Query: 179 VYAAMKAALHRFTQGVAAELLDASIAVNCVGPST 212
YA+ KAA FT+ + EL + +I N V P +
Sbjct: 156 AYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189


77MAP1449cMAP1455cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1449c010-0.812467hypothetical protein
MAP1450c010-1.224609hypothetical protein
MAP1451-112-0.861614hypothetical protein
MAP1452c-17-0.091263hypothetical protein
MAP1453c-180.010445hypothetical protein
MAP1454080.440661hypothetical protein
MAP1455c080.456034hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1449cDHBDHDRGNASE1234e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 123 bits (309), Expect = 4e-36
Identities = 78/263 (29%), Positives = 121/263 (46%), Gaps = 13/263 (4%)

Query: 3 GLSGRVVLVTGAGRGIGRSHCQRFAEEGADVIAVDVPAAAPDLAQTAAAVQQRGVRAATA 62
G+ G++ +TGA +GIG + + A +GA + AVD + ++ + R A A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 63 LADVSDFAALAAAVDEAVGRLGRLDVLVGNAGI-HPAAAPAWEITPQNWQQTLDVNLTGV 121
DV D AA+ +G +D+LV AG+ P ++ + W+ T VN TGV
Sbjct: 65 --DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPG--LIHSLSDEEWEATFSVNSTGV 120

Query: 122 WHTVKAGVPHMSRTARGGSIVIISSTSGIRGTPGAAPYSASKHAVVGLARTLANELGPQG 181
++ + V R GSIV + S A Y++SK A V + L EL
Sbjct: 121 FNASR-SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 182 IRVNTVHPGAVATAMVLNEATFRRLRPDLDNATADDAAEALSARHLLPVPWV-EPVDVSN 240
IR N V PG+ T M L D + A + + +P+ + +P D+++
Sbjct: 180 IRCNIVSPGSTETDMQ------WSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIAD 233

Query: 241 AVVFLASDQARYITGTQIVVDAG 263
AV+FL S QA +IT + VD G
Sbjct: 234 AVLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1451DHBDHDRGNASE1133e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (284), Expect = 3e-32
Identities = 80/277 (28%), Positives = 121/277 (43%), Gaps = 24/277 (8%)

Query: 7 AGRLAGKVALITGAARGIGRAQAVRFAQEGADIVALDLCGPVDTVMVPPSTPDDLDHTAS 66
A + GK+A ITGAA+GIG A A A +GA I A+D P V S + H +
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD-YNPEKLEKVVSSLKAEARHAEA 61

Query: 67 LVGEVGGRMHAELVDVRDLDGVQAATERGARRFGGLDVVCATAGITSRAMTVEMDESVWR 126
DVRD + T R R G +D++ AG+ + + + W
Sbjct: 62 FP-----------ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWE 110

Query: 127 TMLDVNLTGVWHTCRAAAPHLIARGAGSMILTNSIAGLRGLVGVAHYTAAKHGVVGLMQS 186
VN TGV++ R+ + +++ R +GS++ S +A Y ++K V +
Sbjct: 111 ATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKC 170

Query: 187 LAHELAPHRVRVNCVHPTNVDTPLIQNDTVRSAFRPDLDRPPTRAEFAEAARAMNLLQVP 246
L ELA + +R N V P + +T D S + + E + +P
Sbjct: 171 LGLELAEYNIRCNIVSPGSTET-----DMQWSLWADENGAEQVIKGSLETFK----TGIP 221

Query: 247 W---VDPVDVANAALFLASDEARYITAVTLPVDAGAT 280
P D+A+A LFL S +A +IT L VD GAT
Sbjct: 222 LKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1454IGASERPTASE290.022 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.3 bits (65), Expect = 0.022
Identities = 9/38 (23%), Positives = 15/38 (39%)

Query: 261 QAQQDKAEAQREAAQKEAEAESARKAAEINEARQKAEQ 298
Q+ + E Q ++ A E KA E Q+ +
Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1455cDHBDHDRGNASE882e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 87.8 bits (217), Expect = 2e-23
Identities = 54/183 (29%), Positives = 86/183 (46%), Gaps = 11/183 (6%)

Query: 1 MLDRYGRIDVLVNNVGHWLRHPGGFADTDPQLWDELYRINLHHVFLVTHAFLPTMIDRGA 60
+ G ID+LVN G + PG + W+ + +N VF + + M+DR +
Sbjct: 79 IEREMGPIDILVNVAG--VLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMMDRRS 136

Query: 61 GAIVNVSSVEGLRGYPEDPVYAAFKAAVIGFTRSLAVQVGNHGVRVNAVAPDVTESLQVP 120
G+IV V S YA+ KAA + FT+ L +++ + +R N V+P TE+ +
Sbjct: 137 GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTET-DMQ 195

Query: 121 YSQWLS--AEEQ------SQWPRWVPVGRMGLPEDQARVILFLASDCSSFITGHTIPTDG 172
+S W EQ + +P+ ++ P D A +LFL S + IT H + DG
Sbjct: 196 WSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDG 255

Query: 173 GTT 175
G T
Sbjct: 256 GAT 258


78MAP1468cMAP1481cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1468c-18-0.521432short chain dehydrogenase
MAP1469c08-0.862197hypothetical protein
MAP1470c-180.352011hypothetical protein
MAP1471-210-0.066858hypothetical protein
MAP1472c-19-0.001662hypothetical protein
MAP1473c08-0.017346hypothetical protein
MAP1474c190.151733RNA polymerase sigma factor SigF
MAP14750101.201703hypothetical protein
MAP1476c0101.084083hypothetical protein
MAP1477c0100.254360hypothetical protein
MAP1478-180.356786hypothetical protein
MAP1479c-170.142811hypothetical protein
MAP1480-28-0.135057hypothetical protein
MAP1481c-27-1.066318hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1468cDHBDHDRGNASE698e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 69.3 bits (169), Expect = 8e-16
Identities = 53/205 (25%), Positives = 92/205 (44%), Gaps = 17/205 (8%)

Query: 13 LRGKFAVVTGANSGLGFGLAKRLAAAGAEVVLAVRDPAKGDQAVAAIRREVPQAKLTIRQ 72
+ GK A +TGA G+G +A+ LA+ GA + +P K ++ V++++ E A+
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF--P 63

Query: 73 LDLSSLRSVAALGEQLTAEGRPIDILINNAGVMAP-PRRQQTSDGFELQFGTNHLGHFAL 131
D+ ++ + ++ E PIDIL+N AGV+ P + + +E F N G F
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 132 TGRLLALLRAADSARVVTVSSIAATQRKLDFADVNAEHGYQPMYSYGVAKLAQLMFAVEL 191
+ + + S +VTV S A + M +Y +K A +MF L
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTS------------MAAYASSKAAAVMFTKCL 171

Query: 192 DRRSRLGGWGLMSNAAHPGLAKTNL 216
L + + N PG +T++
Sbjct: 172 GL--ELAEYNIRCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1470cHTHTETR471e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 47.3 bits (112), Expect = 1e-08
Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 1/83 (1%)

Query: 8 RSDRSTGTREAILSAAEVLFAERGMYAVSNRQISEAAGQGNNAAACYHFGTRVDLLRAIE 67
+ TR+ IL A LF+++G+ + S +I++AAG A +HF + DL I
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGV-TRGAIYWHFKDKSDLFSEIW 63

Query: 68 GKHREPIEKLRAQMLAAVGDSTE 90
I +L + A
Sbjct: 64 ELSESNIGELELEYQAKFPGDPL 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1471BINARYTOXINA327e-04 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 31.9 bits (72), Expect = 7e-04
Identities = 14/50 (28%), Positives = 25/50 (50%), Gaps = 1/50 (2%)

Query: 34 VRITVSGRKTGLARSATVQYVPFRDGLLLVGSNWGRRRHPSWSANLKAAQ 83
VRI + G++ A ++ V + F+D + G WG+ + WS L +
Sbjct: 232 VRIVIEGKQYIKAEASIVNSLDFKDD-VSKGDLWGKENYSDWSNKLTPNE 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1475DHBDHDRGNASE845e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 83.9 bits (207), Expect = 5e-21
Identities = 65/253 (25%), Positives = 113/253 (44%), Gaps = 5/253 (1%)

Query: 11 LSGRGAVVSGGSRGIGRAVAELLAGLGAGVVVNGRDPQAVQETVAAITAAGGRATAVVGA 70
+ G+ A ++G ++GIG AVA LA GA + +P+ +++ V+++ A A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 71 ADDERIARSLVDECIGAFGRLDALINCAGIAEPAGSSILNITADEFDHLIGAHLGTAFHT 130
D + G +D L+N AG+ P I +++ +E++ + F+
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPG--LIHSLSDEEWEATFSVNSTGVFNA 123

Query: 131 CRAAAPVMVEQRHGSIVNT-SSVAFLGDYGGTGYPAGKGAVNALTMAIAAELKAYGVRAN 189
R+ + M+++R GSIV S+ A + Y + K A T + EL Y +R N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 190 VVCPGA-RTRLSTGADYERHIEDLHRRGLLDDMTRQASLDS-APPVFVAPVYGYLVSDLA 247
+V PG+ T + + + + +G L+ L A P +A +LVS A
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 248 RDVTGQILVAAGG 260
+T L GG
Sbjct: 244 GHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1477cHTHTETR507e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.4 bits (120), Expect = 7e-10
Identities = 26/135 (19%), Positives = 51/135 (37%), Gaps = 9/135 (6%)

Query: 7 AGNTPASDAEAIERILDAADRIIAERG-SALRIADVARALGVTRQTVYRYFPGTQALLVA 65
A T E + ILD A R+ +++G S+ + ++A+A GVTR +Y +F L
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 66 SAMRSADGFLDRMAAHLDGVTDPVVAITEGMAFAVEELACDHQVEFVLNQRHRGGQKVSI 125
+ +++ + A G +V H +E + + R I
Sbjct: 62 --------IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEII 113

Query: 126 ISDTALAFGRSMLHR 140
+++ +
Sbjct: 114 FHKCEFVGEMAVVQQ 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1478UREASE557e-10 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 54.7 bits (132), Expect = 7e-10
Identities = 35/97 (36%), Positives = 47/97 (48%), Gaps = 16/97 (16%)

Query: 13 DTVITNGRWFDGTGGPSAMRDIGVRDGRVVTIA-AGPLDT---------AGATVIDASGQ 62
DTVITN D G A DIG++DGR+ I AG D G VI G+
Sbjct: 69 DTVITNALILDHWGIVKA--DIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGK 126

Query: 63 WVIPGIIDIHTHYDVEILCEPELSESLRHGVTTVLLG 99
V G +D H H+ +C ++ E+L G+T +L G
Sbjct: 127 IVTAGGMDSHIHF----ICPQQIEEALMSGLTCMLGG 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1479cHTHTETR521e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.9 bits (124), Expect = 1e-10
Identities = 15/69 (21%), Positives = 35/69 (50%)

Query: 16 ARRSDRRPGQTIRKVLDAGLQELRESSYAGLTMRAVATRAGVSPASAYTYFPSKSALVAA 75
AR++ + +T + +LD L+ + + ++ +A AGV+ + Y +F KS L +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 76 VYLRFLRDL 84
++ ++
Sbjct: 62 IWELSESNI 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1481cPRTACTNFAMLY393e-05 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 39.3 bits (91), Expect = 3e-05
Identities = 43/150 (28%), Positives = 60/150 (40%), Gaps = 14/150 (9%)

Query: 336 AGSARVLVATDIAARGVHVDEV--------ELVVHIDPPSEHKSYLHRSGRTARAGSAGD 387
AG +VL +A G+ V +LVV D +H+ ++ SG + SA
Sbjct: 467 AGRFKVLTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRNSG--SEPASANT 524

Query: 388 VVTVVLPEQREHTRALMRKAG-IDVAPQR--VTAGSQAVHALVGPIAPPKP-PAAAGVPS 443
++ V P T L K G +D+ R + A +LVG APP P PA P
Sbjct: 525 LLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQ 584

Query: 444 HPAGPHRPAAAGQRRRRSGRSARTTAAHAV 473
P P A + +GR A AV
Sbjct: 585 PPQPPQPQPEAPAPQPPAGRELSAAANAAV 614


79MAP1527cMAP1534N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1527c09-0.936186hypothetical protein
MAP1528-111-0.546166hypothetical protein
MAP1529-29-0.312515hypothetical protein
MAP1530-110-0.087289hypothetical protein
MAP1531c-1100.454048hypothetical protein
MAP15320111.450132hypothetical protein
MAP1533-1100.859774hypothetical protein
MAP1534090.173437preprotein translocase subunit SecA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1527cTCRTETOQM280.033 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 27.5 bits (61), Expect = 0.033
Identities = 13/41 (31%), Positives = 15/41 (36%), Gaps = 6/41 (14%)

Query: 138 WQVKDYVAGLT------PDSSDEQFRAAAPTIPVFALTKAG 172
W V D P S+ FR AP + L KAG
Sbjct: 492 WNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAG 532


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1528HTHTETR552e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.0 bits (132), Expect = 2e-11
Identities = 18/159 (11%), Positives = 51/159 (32%), Gaps = 5/159 (3%)

Query: 8 GKRQQSREQIEARIIELGRRQLVDRGAAGLSVRAIARDLGMVSSAVYRYVSSRDELLTLL 67
K +Q ++ I+++ R +G + S+ IA+ G+ A+Y + + +L + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 68 LVDAYSDL----ADTVDRARETAGEQWSDDVIAIARATRRWAVEHPACWALLYGSPVPGY 123
+ S++ + + + +I + +T + + G
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 124 HAPPERTVAAG-TRVVAALFDAVAAGITTGDIRLTNDPA 161
A ++ + + I +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTR 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1531cPF05272310.015 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.015
Identities = 11/52 (21%), Positives = 24/52 (46%)

Query: 456 LVITGRSGAGKTTLLRSLAELWPYASGTLCRPDGDNATMFLSQLPYVPLGTL 507
+V+ G G GK+TL+ +L L ++ G ++ ++ + L +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEM 650


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1534SECA8560.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 856 bits (2212), Expect = 0.0
Identities = 281/816 (34%), Positives = 395/816 (48%), Gaps = 119/816 (14%)

Query: 18 RLLGASTEKNRSRSLTLVTDSSEYDDEAAGLTDEQLR-KAAGLLNLEDLAESED--IPQF 74
++ G+ ++ R +V + + E L+DE+L+ K A + E + IP+
Sbjct: 8 KVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLENLIPEA 67

Query: 75 LAIAREAAERATGLRPFDVQLLGALRMLAGDVIEMATGEGKTLAGAIAAAGYVLAGRHVH 134
A+ REA++R G+R FDVQLLG + + + EM TGEGKTL + A L G+ VH
Sbjct: 68 FAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVH 127

Query: 135 VVTINDYLARRDAEWMGPLIEAMGLTVGWITAESSSEERRAAYGCDVTYASVNEIGFDVL 194
VVT+NDYLA+RDAE PL E +GLTVG + +R AY D+TY + NE GFD L
Sbjct: 128 VVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNEYGFDYL 187

Query: 195 RDQLVTDVADLVSPNPDVALIDEADSVLVDEALVPLVLAGTTHRETPRLEII-KLVGEL- 252
RD + + V AL+DE DS+L+DEA PL+++G + + + K++ L
Sbjct: 188 RDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNKIIPHLI 247

Query: 253 ----------EAGTDYDTDADSRNVHLTDVGARKVEKAL-------GGIDLYSEEHVGTT 295
+ + D SR V+LT+ G +E+ L G LYS ++
Sbjct: 248 RQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSPANI-ML 306

Query: 296 LTEVNVALHAHVLLQRDVHYIVRDDAVHLINASRGRIAQLQRWPDGLQAAVEAKEGIETT 355
+ V AL AH L RDV YIV+D V +++ GR Q +RW DGL AVEAKEG++
Sbjct: 307 MHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKEGVQIQ 366

Query: 356 ETGEVLDTITVQALINRYATVCGMTGTALAAGEQLRQFYKLGVSPIPPNKPNIREDEADR 415
+ L +IT Q Y + GMTGTA + YKL +P N+P IR+D D
Sbjct: 367 NENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRKDLPDL 426

Query: 416 VYITAAAKNDAIVEHIIEVHETGQPVLVGTRDVAESEELHERLLRRGVPAVVLNAKNDAE 475
VY+T A K AI+E I E GQPVLVGT + +SE + L + G+ VLNAK A
Sbjct: 427 VYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHAN 486

Query: 476 EAQVIAEAGKFGVVTVSTQMAGRGTDIRLGGSDEA----------------------DHD 513
EA ++A+AG VT++T MAGRGTDI LGGS +A HD
Sbjct: 487 EAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHD 546

Query: 514 RVAELGGLHVVGTGRHHTERLDNQLRGRAGRQGDPGSSVFFSSWEDDVVAA-NLDR---- 568
V E GGLH++GT RH + R+DNQLRGR+GRQGD GSS F+ S ED ++ DR
Sbjct: 547 AVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSGM 606

Query: 569 -NKLPMETDPETGDGRIVSPKAAGLLDHAQRVAEGRMLDVHANTWRYNQLIAQQRAIIVD 627
KL M+ I P + +AQR E R D+ Y+ + QR I
Sbjct: 607 MRKLGMKPGE-----AIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIYS 661

Query: 628 RRNTLLRTATAREEL-------------AELAPKRYRELADEIP---------------- 658
+RN LL + E + A + P+ E+ D
Sbjct: 662 QRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIA 721

Query: 659 -------------------------EERLETIC---------RHIMLYHLDRGWADHLAY 684
+R E + + +ML LD W +HLA
Sbjct: 722 EWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAA 781

Query: 685 LADIRESIHLRALGRQNPLDEFHRLAVDAFASLAAD 720
+ +R+ IHLR +++P E+ R + FA++
Sbjct: 782 MDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLES 817


80MAP1569MAP1574cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1569-191.008046ModD
MAP15701110.602136hypothetical protein
MAP15710111.422413AdhA_1
MAP1572c-1101.309966hypothetical protein
MAP1573c-191.917879phosphoketolase
MAP1574c-172.226785short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1569PF05616393e-05 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 39.0 bits (90), Expect = 3e-05
Identities = 29/88 (32%), Positives = 33/88 (37%), Gaps = 4/88 (4%)

Query: 37 SHADPEVPTPVPPSTATAPPAAPAPNGQPAPNAQPAPGAPAPNGQPAPAAPAPNDPNAAP 96
S + V V P P +A APN QP P PA P PAP PN P
Sbjct: 299 SQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAEN---PANNPAPNENPGTRPNPEP 355

Query: 97 PPVGAPPNGAPPPPVDPNAPPPPPADPN 124
P P+ P P P PA P+
Sbjct: 356 DP-DLNPDANPDTDGQPGTRPDSPAVPD 382



Score = 34.3 bits (78), Expect = 7e-04
Identities = 28/78 (35%), Positives = 33/78 (42%), Gaps = 7/78 (8%)

Query: 43 VPTP-VPPSTATAPPAAPAPNGQPAPNAQPAPGAPAPNGQPAPAAPAPNDPNAAP---PP 98
+P P + P +A AP A P P PA N PA PAPN P DP+ P P
Sbjct: 310 IPRPDLTPGSAEAPNAQPLPEVSPAEN--PA-NNPAPNENPGTRPNPEPDPDLNPDANPD 366

Query: 99 VGAPPNGAPPPPVDPNAP 116
P P P P+ P
Sbjct: 367 TDGQPGTRPDSPAVPDRP 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1570SECYTRNLCASE260.034 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 25.9 bits (57), Expect = 0.034
Identities = 12/54 (22%), Positives = 20/54 (37%)

Query: 40 RGSGAGILMDIVIGIVGALIGGFILSFFVNTAGGGLIFTFFTALLGSVILLWIV 93
RG G G+ + + I I T GG I +G +++ +V
Sbjct: 184 RGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLIMVALVV 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1572cACRIFLAVINRP290.030 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.7 bits (64), Expect = 0.030
Identities = 12/43 (27%), Positives = 20/43 (46%)

Query: 27 IAVVVVVLVLTNLVAHFTTPWASIGTVPAAAVGLVILMRYRGL 69
I+ VVV L L L ++ P + + VP VG+++
Sbjct: 878 ISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQ 920



Score = 27.9 bits (62), Expect = 0.047
Identities = 38/157 (24%), Positives = 61/157 (38%), Gaps = 23/157 (14%)

Query: 24 HLDIAVVVVVLVLTNLVAHFTTPWASIGTVPAAAVGLVILMRYRGLGWADLGLGRDHWKS 83
L A+++V LV+ + + VP +G ++ G L +
Sbjct: 343 TLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMF------ 396

Query: 84 GVGYALAAVAVVAAVIAIGVLLPATRPMFMNN---RYAT------ISGAMIASMVVIPVQ 134
G LA +V AI V+ R M + + AT I GA++ +V+
Sbjct: 397 --GMVLAIGLLVDD--AIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSA- 451

Query: 135 TVIPEELAFRGVLHGALNRAWGFR-GVALAGSLLFGL 170
IP +AF G GA+ R + A+A S+L L
Sbjct: 452 VFIP--MAFFGGSTGAIYRQFSITIVSAMALSVLVAL 486


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1574cDHBDHDRGNASE923e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 92.0 bits (228), Expect = 3e-24
Identities = 68/236 (28%), Positives = 104/236 (44%), Gaps = 25/236 (10%)

Query: 11 VRDKVVVITGGARGIGLATATALHKLGAKVAIGD-----IDEVRVKESGAALDLDVYGKL 65
+ K+ ITG A+GIG A A L GA +A D +++V A + +
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF-PA 64

Query: 66 DVTDPHSFSDFLDEVERQLGPIDVLVNNAGIMPLGRVVDESDAVTRRILDINVYGVILGS 125
DV D + + +ER++GPID+LVN AG++ G + SD +N GV S
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 126 KLALARMIPRGRGHVINVASLAGETYLAGAATYCASKHAVVGFTDAARIEYRRSGVTFSV 185
+ M+ R G ++ V S A Y +SK A V FT +E + ++
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 186 VKPTFVNTEL--------------IAGTS-----GAKGVRNAEPSDIADAIVKLVA 222
V P T++ I G+ G + A+PSDIADA++ LV+
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240


81MAP1626cMAP1632cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1626c-211-2.646652NanT
MAP1627018-3.104097hypothetical protein
MAP1628025-3.830551hypothetical protein
MAP1629c132-4.024865Aao
MAP1630c239-6.072395hypothetical protein
MAP1631c443-7.447431hypothetical protein
MAP1632c240-7.447562hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1626cTCRTETA449e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.7 bits (103), Expect = 9e-07
Identities = 34/143 (23%), Positives = 57/143 (39%), Gaps = 4/143 (2%)

Query: 16 NSFIAALLGWTMDAFDYFIVVLVYADIAKTFHHSKAEVA---FVTTATLIMRPVGALLFG 72
I L +DA +++ V + + HS A + +M+ A + G
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 73 LWADRVGRRLPLMVDVMFYSVVGFLCAFAPNFTVLVILRLLYGIGMGGEWGLGAALAMEK 132
+DR GRR L+V + +V + A AP VL I R++ GI G + A +
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADI 123

Query: 133 VPVERRGFFSGLLQEGYAFGYLL 155
+ R G + + FG +
Sbjct: 124 TDGDERARHFGFMSACFGFGMVA 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1630cPF05616336e-04 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 33.2 bits (75), Expect = 6e-04
Identities = 16/39 (41%), Positives = 22/39 (56%)

Query: 31 PDETHGPAPGPAATPSPAPSTSPSPAASPSPSASPAPAP 69
PD T G A P A P P S + +PA +P+P+ +P P
Sbjct: 313 PDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRP 351



Score = 28.6 bits (63), Expect = 0.019
Identities = 13/48 (27%), Positives = 18/48 (37%)

Query: 25 VVAGADPDETHGPAPGPAATPSPAPSTSPSPAASPSPSASPAPAPAAP 72
V +P P P P+P P +P A+P P P +P
Sbjct: 331 VSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSP 378


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1631cHTHTETR624e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.3 bits (151), Expect = 4e-14
Identities = 28/169 (16%), Positives = 63/169 (37%), Gaps = 7/169 (4%)

Query: 1 MADEQADSRERLISGTRELLWDRGYVGTSPTAILQQSGVGQGSLYHHFRGKHDLVLAAEQ 60
E ++R+ ++ L +G TS I + +GV +G++Y HF+ K DL +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 QAAADMQRSIKEAFAG-NRSAHDKIADYLTRQREVL-----RGCSVGRLTADPVIVGDDQ 114
+ +++ E A + + L E R + + VG+
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 115 LRAPVAQTFEV-LHTCLTRTIREGQRSGEISVELEPHKVAAAISATIQG 162
+ + + + + +T++ + + +L + A + I G
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1632cTCRTETB1514e-43 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 151 bits (384), Expect = 4e-43
Identities = 100/414 (24%), Positives = 170/414 (41%), Gaps = 24/414 (5%)

Query: 10 LCLGTALIIMEANVLNVAIPSIRQALHASPAQSLWIIDAYTLVLAALLLSAGRLGDRIGA 69
LC+ + ++ VLNV++P I + PA + W+ A+ L + G+L D++G
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 70 RRCYLLGLAVFSIASVLCALAASSAE-LIAARTIQGVGAAVLIPAPLGLISAMFSDLTAR 128
+R L G+ + SV+ + S LI AR IQG GAA PA + ++ A + R
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAA-FPALVMVVVARYIPKENR 137

Query: 129 AKAVAVWVTIGGVGFAAGPLIGGLLVSTFGWRSIFLINIPAAAIIAV-MVRLTVAEASRS 187
KA + +I +G GP IGG++ W +L+ IP II V + + + R
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRI 195

Query: 188 PLPFDYVGQALAIVGLSAVVFACVESSALAWMSPFVLLPAVAAALILGLFVIDQRHRGRA 247
FD G L VG+ + S F+++ ++ +FV R
Sbjct: 196 KGHFDIKGIILMSVGIVFFMLFTTSYSIS-----FLIVSVLS----FLIFVKHIRKVTDP 246

Query: 248 GAWVLLPVELLNNRPVNAGLMSGFVYNFTLYGLVLVYSYVFQSARGYSPVQTGLAFA-PL 306
+ L N P G++ G + T+ G V + Y+ + S + G P
Sbjct: 247 ----FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 307 TVAALVTSLPAGRFVAAHGARRGIMIGMALSAIGLCALAFDAQRMPFVVLSIAFGIFAT- 365
T++ ++ G V G + IG+ ++ +F + + + I
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF---MTIIIVFVL 359

Query: 366 -GLSLSATGQTMAVMANASDQYKNTASSMLNTARQTGGVIGVAALGAITSRDLL 418
GLS + T + V ++ Q S+LN G+A +G + S LL
Sbjct: 360 GGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413


82MAP1716MAP1726cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1716025-2.703894short chain dehydrogenase
MAP1717025-2.943174hypothetical protein
MAP1718c-128-3.308961hypothetical protein
MAP1719c-126-3.133687hypothetical protein
MAP1720023-2.521804hypothetical protein
MAP1721c025-2.965243hypothetical protein
MAP1722021-2.523138hypothetical protein
MAP1723-221-3.523654hypothetical protein
MAP1724c-121-3.418439hypothetical protein
MAP1725c-123-3.320106hypothetical protein
MAP1726c-226-3.978307hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1716DHBDHDRGNASE952e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 95.1 bits (236), Expect = 2e-25
Identities = 56/186 (30%), Positives = 83/186 (44%), Gaps = 1/186 (0%)

Query: 6 KVVAITGGARGIGLATAKAFLAAGAKVALGDLDTELAEKQAVELGGDPAVV-GLSLDVSD 64
K+ ITG A+GIG A A+ + GA +A D + E EK L + DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 PASFVAFLDDVEARLGRLDVLVSNAGIMPTGPFVDEPPTMSRRMIDVNVYGVLNGSRLAA 124
A+ +E +G +D+LV+ AG++ G VN GV N SR +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 ARFVPRGAGHIVNIASLAGVTGEPGMATYCGTKHFVVGFTESLHRELRPHRVGVSLVLPG 184
+ R +G IV + S MA Y +K V FT+ L EL + + ++V PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 185 IINTEL 190
T++
Sbjct: 189 STETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1718cPF07675290.007 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 29.3 bits (65), Expect = 0.007
Identities = 33/126 (26%), Positives = 50/126 (39%), Gaps = 13/126 (10%)

Query: 10 TTAATAALAAAGLLAAAPAFADP----QVLQFGQMAEISSNGGTIDYTVSNLQPSGHNDG 65
T A++ L +A PA ADP Q + E+ GG DY ++N +P+ +
Sbjct: 427 TGTASSNLYSANFEYLTPANADPVVTTQNIIVTGQGEVVIPGGVYDYCITNPEPA--SGK 484

Query: 66 VWYSDVTAKGVSGNAVPNIADFNARAVNSSTFAVMKGNQTDGLPEGPLPLGTPVTGRLYF 125
+W G GN DF A TF + + DG + + +P Y
Sbjct: 485 MW-----IAGDGGNQPARYDDFAFEAGKKYTFTMRRAGMGDGT-DMEVEDDSP-ASYTYT 537

Query: 126 DVRNGT 131
R+GT
Sbjct: 538 VYRDGT 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1719cHTHTETR647e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.9 bits (155), Expect = 7e-15
Identities = 31/165 (18%), Positives = 63/165 (38%), Gaps = 12/165 (7%)

Query: 4 VAQPVRSDAARNREALIEVATRLFAAAAGGDEPSLRLIAREAGVGVGTLFRHFPTREALV 63
+A+ + +A R+ +++VA RLF+ G SL IA+ AGV G ++ HF + L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQ-GVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 64 EAVYQDQVRRLTEGADQLLANHP--PAQAMRRWMDLFTDWLATKHGMLGTLRAMINNEQL 121
+++ + E + A P P +R + + T+ + + + +
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 122 GSGHTRI---------ELLAAIDKILAAGRAAGDIGDHISSEDVA 157
+ E I++ L A + + + A
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1720NUCEPIMERASE391e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 39.4 bits (92), Expect = 1e-05
Identities = 22/81 (27%), Positives = 30/81 (37%), Gaps = 11/81 (13%)

Query: 35 VFVTGGSGLTGPAVVSELLSAGHRVTGLARSAASAD------RLARLGAEPFT---GSLD 85
VTG +G G V LL AGH+V G+ D RL L F L
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 86 DLDRLREGAAAA--DGVIHMA 104
D + + + A+ + V
Sbjct: 63 DREGMTDLFASGHFERVFISP 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1721cHTHTETR705e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.4 bits (172), Expect = 5e-17
Identities = 37/183 (20%), Positives = 63/183 (34%), Gaps = 7/183 (3%)

Query: 19 LPRISREQKERNRGRILAAAGEGFKARGIDGVGIDELMKAAGMSHGGFYNHFPSKEDLAL 78
+ R ++++ + R IL A F +G+ + E+ KAAG++ G Y HF K DL
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 79 EVLHQGFTDSLDTVAAVIDTHAHSGRAALHAIIDTYLSTEHRDHPEHGCASAALAADAGR 138
E+ ++ + + L I+ L + + +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRL-LMEIIFHKCEF 119

Query: 139 HGVKA--QEAYRRGLQGYIGAFADLLRVSARQRG---TKLDARRAREQAIGLFSQMVGAQ 193
G A Q+A R L+ + L RRA G S ++
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLK-HCIEAKMLPADLMTRRAAIIMRGYISGLMENW 178

Query: 194 LIA 196
L A
Sbjct: 179 LFA 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1726cHTHTETR741e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.9 bits (181), Expect = 1e-18
Identities = 33/175 (18%), Positives = 66/175 (37%), Gaps = 8/175 (4%)

Query: 1 MTRTQQRAAENRRTVIDAAREIIATQGVEALTLEAVAEKADVVVQTIYNRVGGRSALLTA 60
+T+Q A E R+ ++D A + + QGV + +L +A+ A V IY +S L +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 VAEQALEQSRVYM-DPAYEADGTVEERMMLAANAYARFARERPHEFRILVEPPNEPEAVA 119
+ E + + + G + ++ ++ E V
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 120 RIAELTRAQN-------ARLTAVLREGMAAGLIRADLDPDDVTTALWATFNGLLA 167
+A + +AQ R+ L+ + A ++ ADL + +GL+
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176


83MAP1732cMAP1740cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP1732c216-2.498792hypothetical protein
MAP1733111-4.523705hypothetical protein
MAP173429-4.604644hypothetical protein
MAP1735110-4.170884hypothetical protein
MAP1736110-4.191875hypothetical protein
MAP1737111-3.661417hypothetical protein
MAP1738111-3.261656hypothetical protein
MAP1739c115-1.006539FabG3_1
MAP1740c117-0.656736hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1732cHTHTETR756e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 75.4 bits (185), Expect = 6e-19
Identities = 36/143 (25%), Positives = 70/143 (48%), Gaps = 3/143 (2%)

Query: 14 RRKRADGEMSRERILDAATEIAAERGYEATSIGLVSAKCGLPASSIYWHFKNKDDLIAAV 73
R+ + + + +R+ ILD A + +++G +TS+G ++ G+ +IYWHFK+K DL + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 74 IERSFADWRKAWQVPDEGAPRDRLAGLAMQIAKVLMDSP--DFIRLGLMLALERRPVEPR 131
E S ++ + P D L+ L +I +++S + R LM + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLR-EILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 132 ARAMFIQARAQAYDELADIVREL 154
A+ QA+ E D + +
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQT 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1735ALARACEMASE290.029 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 28.6 bits (64), Expect = 0.029
Identities = 17/59 (28%), Positives = 22/59 (37%), Gaps = 5/59 (8%)

Query: 235 GLAPAWIGVGTLDLFYPECLEYARRLREAGVPAQEEIVPGAFHAFDQIVDKAPISAKFF 293
G+ W +G D F LE A LRE G ++ G FHA I +
Sbjct: 42 GIERIWSAIGATDGFALLNLEEAITLRERGWKGPILMLEGFFHA-----QDLEIYDQHR 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1736HTHTETR543e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.9 bits (129), Expect = 3e-11
Identities = 34/175 (19%), Positives = 61/175 (34%), Gaps = 9/175 (5%)

Query: 1 MANPVGLRERRRRQTSADIRDAAVRLTLERGFDKVTVDEICAEAGISTRTFFNYFPNKES 60
MA + RQ I D A+RL ++G ++ EI AG++ + +F +K
Sbjct: 1 MARKTKQEAQETRQH---ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSD 57

Query: 61 ---AIAYGPSDIPPELVADFVAAGPAPYSVVLAELITLAAHHLRDVPPRREHAANMLELA 117
I EL ++ A P VL E++ + RR ++
Sbjct: 58 LFSEIWELSESNIGELELEYQAKFPGDPLSVLREIL-IHVLESTVTEERRRLLMEIIFHK 116

Query: 118 KTSPAVLAAFLADLERFQNQLTDIIVR--RQGMQPDDEMAPLISALALTAVRSGI 170
+A + D I + + ++ A L++ A +R I
Sbjct: 117 CEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1738ACRIFLAVINRP451e-06 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 45.2 bits (107), Expect = 1e-06
Identities = 46/253 (18%), Positives = 94/253 (37%), Gaps = 32/253 (12%)

Query: 139 AQSNDGKASYVQVYLAGNQGEALANESVESVQNIVKSVQA--PNGVK---AYVTGP---A 190
A+ N A+ + + L A A ++ ++++ + +Q P G+K Y T P
Sbjct: 279 ARINGKPAAGLGIKL---ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQL 335

Query: 191 ALSADQHTAGDRSLQLITAATFTVIIGMLLLVYRSVITVLLTLVMVVLELSAARGMVAFL 250
++ T L A ++ + L +++ L+ + V + L ++A
Sbjct: 336 SIHEVVKT-------LFEAIMLVFLV--MYLFLQNMRATLIPTIAVPVVLLGTFAILAAF 386

Query: 251 GYYKIIGLSTFATNLLVTLAIAAATDYAIFLIGRYQEARAVGES--REDAYYTMYKGTAH 308
GY I L+ F + LAI D AI ++ + + +E +M +
Sbjct: 387 GY-SINTLTMFG----MVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGA 441

Query: 309 VVAGSGMTIAGATFCLHFTNL--PYFQTLGIPLAIGMVVVVAAALTLGPAVISVASRFRQ 366
+V + + A F ++ I + M + V AL L PA + + +
Sbjct: 442 LVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPA---LCATLLK 498

Query: 367 TLEPRRTQRIRGW 379
+ + G+
Sbjct: 499 PVSAEHHENKGGF 511



Score = 39.8 bits (93), Expect = 6e-05
Identities = 38/203 (18%), Positives = 75/203 (36%), Gaps = 18/203 (8%)

Query: 740 KAAYEALKGTPLEGSKIYLAGTASIYKDLSDGNNYDLLIAGISSLCLIFIIMLIITRGVV 799
KA L+ +G K+ + + LS ++ ++ L+F++M + + +
Sbjct: 307 KAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHE---VVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 800 ASAVIVGTVLLSLGASFGLSVLIWQHLIGIELHWMVL-AMSVIILLAVGADYNLLLV--- 855
A+ + V + L +F + G ++ + + M + I L V D +++V
Sbjct: 364 ATLIPTIAVPVVLLGTFAI-----LAAFGYSINTLTMFGMVLAIGLLV--DDAIVVVENV 416

Query: 856 ARFKEEIHAGLNTGIIRSMGGTGSVVTSAGLVFAFT---MMTMAVSELTVIGQVGTTIGL 912
R E +SM + +V + M S + Q TI
Sbjct: 417 ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVS 476

Query: 913 GLLFDTLIVRSLMTPSIAALLGK 935
+ L+ L TP++ A L K
Sbjct: 477 AMALSVLVALIL-TPALCATLLK 498


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1739cDHBDHDRGNASE1378e-42 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 137 bits (347), Expect = 8e-42
Identities = 74/259 (28%), Positives = 123/259 (47%), Gaps = 18/259 (6%)

Query: 2 AERLAGKVALVSGGARGMGASHVRSLVAEGAKVVFGDILDDEGKAVAAEVGEATRY---L 58
A+ + GK+A ++G A+G+G + R+L ++GA + D ++ + V + + R+
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 59 HLDVTKPEDWDAAVATALAEFGRIDVLVNNAGIINIGTLEDYALSEWQRILDINLTGVFL 118
DV D A E G ID+LVN AG++ G + + EW+ +N TGVF
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 119 GIRAVVKPMKEAGRGSIINISSIEGMAGTIACHGYTATKFAVRGLTKSAALELGPSGIRV 178
R+V K M + GSI+ + S + Y ++K A TK LEL IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 179 NSIHPGLIKTPM--TEWVPEDIFQSA-------------LGRAAEPKEVSNLVVYLASDE 223
N + PG +T M + W E+ + L + A+P ++++ V++L S +
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 224 SSYSTGSEFVVDGGTTAGL 242
+ + T VDGG T G+
Sbjct: 243 AGHITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP1740cPF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.001
Identities = 13/77 (16%), Positives = 28/77 (36%), Gaps = 12/77 (15%)

Query: 439 NAVRHAAATKLT------IAVEVADEVSIKVIDNGKGLPDDVSEA---GLKTLRRRAERV 489
N ++H A + V+++V + G + E+ GL+ +R R + +
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQML 325

Query: 490 GG---TLTVGAAAGGGT 503
G + + G
Sbjct: 326 YGTEAQIKLSEKQGKVN 342


84MAP2167cMAP2177cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2167c0111.316695hypothetical protein
MAP2168c-280.693932hypothetical protein
MAP2169c-171.123209hypothetical protein
MAP2170c-281.143036hypothetical protein
MAP2171c-171.328675hypothetical protein
MAP2172c-171.163817hypothetical protein
MAP2173c-271.101732hypothetical protein
MAP2174c-181.586879hypothetical protein
MAP2175c0100.866281hypothetical protein
MAP2176c0100.464834hypothetical protein
MAP2177c0100.202414hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2167cTYPE3IMSPROT280.025 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 27.8 bits (62), Expect = 0.025
Identities = 8/31 (25%), Positives = 15/31 (48%)

Query: 100 DKKLQKAAKNGDLPLSFDVTNIQPTASGAAT 130
KK++ A K G + S +V + + +A
Sbjct: 11 PKKIRDARKKGQVAKSKEVVSTALIVALSAM 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2168cSALSPVBPROT290.017 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 28.6 bits (63), Expect = 0.017
Identities = 9/14 (64%), Positives = 10/14 (71%)

Query: 43 APLPQDPPPPPPPP 56
AP+ PPPPPPP
Sbjct: 360 APVNNMMPPPPPPP 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2171cISCHRISMTASE320.018 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 31.9 bits (72), Expect = 0.018
Identities = 24/111 (21%), Positives = 43/111 (38%), Gaps = 9/111 (8%)

Query: 933 RIIMVDEIPLTPNGKLDETALAAVDNAEAVDGAAPPQTGTESALAELIAELLGQP--RVD 990
+M D + LD+ A D + T + + IAELL + +
Sbjct: 199 FTVMTDSL-------LDQLQNAPADVQKTSANTGKKNVFTCENIRKQIAELLQETPEDIT 251

Query: 991 VTADFLALGLDSIMALSVVQAARARGIALRARLVLDCTSIRELAEAIDAES 1041
D L GLDS+ +++V+ R G + + + +I E + + S
Sbjct: 252 DQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLTTRS 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2172cISCHRISMTASE378e-05 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 37.3 bits (86), Expect = 8e-05
Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 2/80 (2%)

Query: 290 PRPVVSTGGRTEPTRTDTERALANVFAELL--STPEVGRFDDFFALGGDSILSVQLASRA 347
P V T T T + AELL + ++ +D G DS+ + L +
Sbjct: 214 PADVQKTSANTGKKNVFTCENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQW 273

Query: 348 RAAGLPVSPRMIFENPTVQQ 367
R G V+ + E PT+++
Sbjct: 274 RREGAEVTFVELAERPTIEE 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2174cNUCEPIMERASE300.045 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 30.1 bits (68), Expect = 0.045
Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 5/90 (5%)

Query: 657 VVITGGSGAIGLHYARYCLQRGTRNLTLLSRNGIEAAVLRELTGSHDAR--VSAPRCDIT 714
++TG +G IG H ++ L+ G + + + + N L++ A+ + D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 715 DRAAVTQAAARYAGSGATLLIHTAGIAQAR 744
DR +T +A + + R
Sbjct: 63 DREGMTDL---FASGHFERVFISPHRLAVR 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2177cISCHRISMTASE752e-16 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 75.1 bits (184), Expect = 2e-16
Identities = 34/75 (45%), Positives = 47/75 (62%)

Query: 5 PARSEDIREEVAELLGVDVDAVQPGSNLIGQGLDSIRIMTLAGRWRRRGIAVDFATLAET 64
E+IR+++AELL + + +L+ +GLDS+RIMTL +WRR G V F LAE
Sbjct: 229 VFTCENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAER 288

Query: 65 PTIEAWAQLVTAGRQ 79
PTIE W +L+T Q
Sbjct: 289 PTIEEWQKLLTTRSQ 303


85MAP2225cMAP2232N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2225c011-0.025558hypothetical protein
MAP2226c012-0.613457hypothetical protein
MAP2227012-1.210191FadE17
MAP2228013-1.177412hypothetical protein
MAP2229114-1.338837hypothetical protein
MAP2230c114-1.620932hypothetical protein
MAP2231112-3.251532PapA3_1
MAP2232010-2.657130hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2225cPF06057310.006 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 30.6 bits (69), Expect = 0.006
Identities = 23/114 (20%), Positives = 37/114 (32%), Gaps = 11/114 (9%)

Query: 22 VAEVLAEARKRAGA--------SGDADV---PINRMRAGDVSTYELAELLSPSLFADERI 70
++ + + G S A+V +N M A A LLSPS +D I
Sbjct: 103 TLAIIDKYQAEFGTQKVILIGYSFGAEVIPFVLNEMPARYRKNVLGAVLLSPSQSSDFEI 162

Query: 71 VVLEAAGEAGKDAAAVILSAAAEMPAGVVLVVVHSGGGRAKALATELQSLGAVV 124
V E + A + L + +L + L E++ V
Sbjct: 163 HVSEMVTSDNQSARYLTLPEVNKQTTVPMLCLYGKEDDAPLHLCPEVKQPNVTV 216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2229LIPOLPP20280.010 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 28.2 bits (62), Expect = 0.010
Identities = 22/80 (27%), Positives = 44/80 (55%), Gaps = 8/80 (10%)

Query: 38 VVDDGELAFNTGRGTAKARA-IARDSRVVICVDDPHPPYSFVQVQGVAAVS----EDPAE 92
++ + ++ ++T + TAKARA +A + + + D + V G ++S E ++
Sbjct: 72 LITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRSISGTDTEKISQ 131

Query: 93 VLD---IATRTGARYMGADR 109
++D IA++ ARY+G DR
Sbjct: 132 LVDKELIASKMLARYVGKDR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2230cDHBDHDRGNASE475e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 47.4 bits (112), Expect = 5e-07
Identities = 34/161 (21%), Positives = 63/161 (39%), Gaps = 8/161 (4%)

Query: 1185 LVTGGLGSIGLEIAGHLAAQGARHLVLTGRRAPGEAAQRRIDALSQQHGCEVRVIAADVA 1244
+TG IG +A LA+QGA + E ++ + +L + ADV
Sbjct: 12 FITGAAQGIGEAVARTLASQGAH---IAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADVR 67

Query: 1245 DAHHVARLLGAVRAELPPVAGIVHAAGEIGTTPLSDLEDAEIDRVFAGKVWGAWHLSEAA 1304
D+ + + + E+ P+ +V+ AG + + L D E + F+ G ++ S +
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 1305 A----DLRLDFFVSTSSIASVWGGLGQTAYGAANAFLDGLA 1341
+ D R V+ S + AY ++ A
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFT 168



Score = 41.2 bits (96), Expect = 4e-05
Identities = 31/113 (27%), Positives = 48/113 (42%), Gaps = 4/113 (3%)

Query: 3297 LITGGLGAIGLHLAAYLAQLGAGEIVLTSRRAPDAAARRAIDDITERYRCRVHTFAADVG 3356
ITG IG +A LA GA I + R+ F ADV
Sbjct: 12 FITGAAQGIGEAVARTLASQGA-HIAAVDYNPEKLEKVVSSLKAEARHA---EAFPADVR 67

Query: 3357 DAAQVEQLLARIRAELPPLAGVAHLAGVLDDALLSQQSPERFRVALAPKAFGA 3409
D+A ++++ ARI E+ P+ + ++AGVL L+ S E + + + G
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2232ACRIFLAVINRP612e-11 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 61.0 bits (148), Expect = 2e-11
Identities = 36/215 (16%), Positives = 80/215 (37%), Gaps = 30/215 (13%)

Query: 185 AVLVLVVLLVVYRSAVTMLLPLVTILLSLVIAQAAVA--GYSQLTGSGVSNQSIVFLSAI 242
+LV +V+ + ++ L+P + + + L+ A +A GYS N +F +
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSI-------NTLTMFGMVL 400

Query: 243 MAGAGTDYAVFLISRYHDYLR-RGDDFDQAVRKALISIGKVITASASTVGITFLLIGFAR 301
G D A+ ++ + +A K++ I + A + F+ + F
Sbjct: 401 AIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAF-- 458

Query: 302 MG-----VFKTVGISSAIGIGVAFLAAVTLLPAIMVL-------------AGPRGWIRPR 343
G +++ I+ + ++ L A+ L PA+ G GW
Sbjct: 459 FGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTT 518

Query: 344 RELTTGLWRRSGIRIVRRPRTHLVASVLVLIILAS 378
+ + + S +I+ +L+ L++ +
Sbjct: 519 FDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVV 553


86MAP2312cMAP2324cN        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2312c-160.027247FadE19
MAP2313c-170.643983AccA1
MAP2314c-17-0.728978AccD1
MAP2315-28-0.654774hypothetical protein
MAP2316-29-1.012696hypothetical protein
MAP2317c-18-1.722477hypothetical protein
MAP231818-1.635522hypothetical protein
MAP2319c06-1.378068hypothetical protein
MAP232028-0.717385oligoribonuclease
MAP232129-0.453933*hypothetical protein
MAP2322c19-0.408326hypothetical protein
MAP2323c1100.136769hypothetical protein
MAP2324c-19-0.642507hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2312cCLENTEROTOXN330.002 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 32.7 bits (74), Expect = 0.002
Identities = 25/100 (25%), Positives = 43/100 (43%), Gaps = 11/100 (11%)

Query: 142 TRTTARLEDGEWVINGSKQFITNSGTDITSLVTVTAVTGTVG---ADKKEISTIIVPSGT 198
T + + L DG +VI+ +I + ++S + TGT KE+S I + +
Sbjct: 31 TNSNSNLSDGLYVIDKGDGWILGEPSVVSSQILNPNETGTFSQSLTKSKEVS--INVNFS 88

Query: 199 PGFTVEPVYSKV----GWNASDTHPL--SFSDARVPEENL 232
GFT E + + V G + + + S S P E +
Sbjct: 89 VGFTSEFIQASVEYGFGITIGEQNTIERSVSTTAGPNEYV 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2315HTHTETR772e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 76.6 bits (188), Expect = 2e-19
Identities = 34/176 (19%), Positives = 64/176 (36%), Gaps = 9/176 (5%)

Query: 16 PNRRSQLKSDRRLQLLSAAERLFAERGFLAVRLEDIGASAGVSGPAIYRHFPNKESLLVE 75
+ Q + R +L A RLF+++G + L +I +AGV+ AIY HF +K L E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 76 LLVGISTRLLAGARQV-RARSADAAAALDGLIDFHLDFALNEPDLIRIQDRDLAYLPKPA 134
+ + + + D + L ++ L+ + E + +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 135 ERQ-VRKAQRQYVEVWVGVLREL------NPELAEA-DARLTAHAVFGLLNSTPHS 182
E V++AQR + + L R A + G ++ +
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2317cTCRTETA310.009 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.3 bits (71), Expect = 0.009
Identities = 69/364 (18%), Positives = 119/364 (32%), Gaps = 42/364 (11%)

Query: 53 IGGSTTPASWLGRAAAIAGLTVALLAPAVGVWVESPHRRRVALSVLTALAVTLTGSMFFI 112
+ S + G A+ L AP +G + RR V L L + ++
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSL--AGAAVDYAIMAT 92

Query: 113 RDRPGYLWAGLVLLGATAACGDLASVPYNAMLRQLSTPRTAGRISGFGWAAGYVGSVLLL 172
L+ G ++ G T A G +A A + ++ R FG+ + G
Sbjct: 93 APFLWVLYIGRIVAGITGATGAVAG----AYIADITDGDE--RARHFGFMSACFG----- 141

Query: 173 LVIYTGFIAGSGSGPDATRGLLRVPLRDGLYVREAMLVAAAWLALFALPLLFVAHRLTES 232
G +AG G GL+ G + A AAA L +
Sbjct: 142 ----FGMVAGPVLG-----GLM------GGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186

Query: 233 AEGYRPTSMLGGYRKLWTEVREEWRRDRNLVYFLFASALFRDGLAAIFAFGAVLGVNVYG 292
E RP L+ F L AA++ + G + +
Sbjct: 187 GER-RPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALW---VIFGEDRFH 242

Query: 293 ISQADVLVFGVAASVVAAVGA----VLGGFVDHRVGSKPVIVASLLA-IVVLGLTLMALS 347
+ G++ + + + ++ G V R+G + ++ ++A L A
Sbjct: 243 WDATTI---GISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATR 299

Query: 348 GPVAFWACGLLLCMFIGPSQSSSRALLLQMAKHGREGVAFGLYTMTGRAVSFVAPWLFSV 407
G +AF ++L G + +A+L + R+G G S V P LF+
Sbjct: 300 GWMAF--PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 408 FVDA 411
A
Sbjct: 358 IYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2318DHBDHDRGNASE837e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 82.8 bits (204), Expect = 7e-21
Identities = 54/200 (27%), Positives = 94/200 (47%), Gaps = 9/200 (4%)

Query: 12 AVVTGASQNIGEALATELAARGHHLIVTARREDLLKDLAARLTEKYRVTVEVRPADLADA 71
A +TGA+Q IGEA+A LA++G H+ + L+ + + L + R E PAD+ D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEAR-HAEAFPADVRDS 69

Query: 72 GERATLCDELAART--ISVLCANAGTATFGPVATLDPAGEKAQVQLNVLGVHDLTLAVLP 129
+ + I +L AG G + +L +A +N GV + + +V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 130 GMVERRAGGILISGSAAGNSPIPYNATYAATKAFVNTFSESLRGELRGSGVHVTLLAPGP 189
M++RR+G I+ GS P A YA++KA F++ L EL + +++PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 190 VRTDL------PDDAEASIV 203
TD+ ++ ++
Sbjct: 190 TETDMQWSLWADENGAEQVI 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2319cPF03544290.035 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.035
Identities = 11/53 (20%), Positives = 15/53 (28%), Gaps = 3/53 (5%)

Query: 437 QTVQQPAQPPPGPQ---APTGSRSPQAQSDYPPVPPNGPVPPMPEPAEPKGPG 486
Q VQ P +P P+ P +A P P P +
Sbjct: 64 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2324cACRIFLAVINRP535e-09 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 52.9 bits (127), Expect = 5e-09
Identities = 42/227 (18%), Positives = 87/227 (38%), Gaps = 15/227 (6%)

Query: 187 IAIPLSFAVLVWVLGGVVAATLPVVLGALAIVGTMSVLRLISFATDVSTYALDLSIAMGL 246
AI L F V+ L + A +P + + ++GT ++L ++ + T + +A+GL
Sbjct: 346 EAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFG-MVLAIGL 404

Query: 247 ALAIDYNLLIITRYREELARTEDRDR-ALYRTMATAGRTVLFSATT---VGLSMAVMAVF 302
+D ++++ + + + A ++M+ ++ A V + MA
Sbjct: 405 L--VDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGS 462

Query: 303 PMYFLKSSAYTIVATAIIVAIAAVVITPA-AIVLLGPRLDTMDARRVMHRILHGQRSFRD 361
+ + TIV+ + + A+++TPA LL P H G + +
Sbjct: 463 TGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPV------SAEHHENKGGFFGWFN 516

Query: 362 PAHKPLVEQFWYRSTKYVLRRALPVGLSVVALLLLLGVPFLGVKWGF 408
V + S +L L ++ + V FL + F
Sbjct: 517 TTFDHSVN-HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSF 562


87MAP2526cMAP2541cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2526c023-4.639867hypothetical protein
MAP2527c123-5.473579hypothetical protein
MAP2528-211-1.373477hypothetical protein
MAP2529-29-0.215850hypothetical protein
MAP2530-280.435858hypothetical protein
MAP2531-29-0.178798hypothetical protein
MAP2532-2100.147896hypothetical protein
MAP2533-1110.580750hypothetical protein
MAP2534c-2130.229125hypothetical protein
MAP2535-114-0.135106hypothetical protein
MAP2536-114-0.466658alpha-ketoglutarate decarboxylase
MAP2537-1120.090543hypothetical protein
MAP2538-18-0.022643hypothetical protein
MAP2539c170.273465hypothetical protein
MAP2540c18-1.490654hypothetical protein
MAP2541c19-2.525816malate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2526cDHBDHDRGNASE1274e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 127 bits (321), Expect = 4e-38
Identities = 79/251 (31%), Positives = 123/251 (49%), Gaps = 6/251 (2%)

Query: 3 RVAVVTGGGSGIGRAIVERLAHDRHRVAVLDVNEEAAEKVAARVAADGAHAIAVPTDVAE 62
++A +TG GIG A+ LA +A +D N E EKV + + A+ HA A P DV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 SASVAAAFESVRRALGPVQVLVTSAAITGFKPFGEITIEDWNRHLAVNLTGTFLCLQAAL 122
SA++ + R +GP+ +LV A + ++ E+W +VN TG F ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 PDMVEAGWGRVVTISSTAAQTGSPRQGHYSASKGGVIALTRTIALEYAVHGITANTVPPF 182
M++ G +VT+ S A Y++SK + T+ + LE A + I N V P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 183 SVDTPMLRA--AQEAGNLPPVKYLAK----ASPVGRLGTGEDIAAACAFLCSDEAGYITG 236
S +T M + A E G +K + P+ +L DIA A FL S +AG+IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 237 QIIGVNGGAVI 247
+ V+GGA +
Sbjct: 249 HNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2527cHTHTETR624e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.3 bits (151), Expect = 4e-14
Identities = 20/88 (22%), Positives = 37/88 (42%)

Query: 7 PRRVGAETSQTRDALLEAVAQMMLEEGYASVTYRALAAKAGVTPSLVQYYFPSLDDIFVA 66
R+ E +TR +L+ ++ ++G +S + +A AGVT + ++F D+F
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 67 AIRRYSERNLQWLTEELQRRADDPLHAL 94
+ E + DPL L
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVL 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2528DHBDHDRGNASE1162e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 116 bits (292), Expect = 2e-33
Identities = 90/279 (32%), Positives = 131/279 (46%), Gaps = 25/279 (8%)

Query: 6 AGKVEGKVAFITGAARGQGRSHAITLAREGADIIAIDVCKQLDGVKLPMSTPDDLAETVR 65
A +EGK+AFITGAA+G G + A TLA +GA I A+D P+ L + V
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD------------YNPEKLEKVVS 50

Query: 66 QVEALGRRIIASQVDVRDFDAMQAAVDDGVTQLGRLDIVLANAALASEGTRLNRMGPKTW 125
++A R A DVRD A+ ++G +DI++ A + G ++ + + W
Sbjct: 51 SLKAEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGL-IHSLSDEEW 109

Query: 126 RDMIDVNLNGAWITARVAIPHIMAGKRGGSIVFTSSIGGLRGAENIGNYIASKHGLHGLM 185
VN G + +R ++M +R GSIV S ++ Y +SK
Sbjct: 110 EATFSVNSTGVFNASRSVSKYMMD-RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFT 168

Query: 186 RTMALELGPRNIRVNIVCPSSVATPMLLNEPTYRMFRPDLENPTVEDFKVASRQMHVLPI 245
+ + LEL NIR NIV P S T M + + ++E FK I
Sbjct: 169 KCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFK--------TGI 220

Query: 246 PY---VEPADISNAILFLVSDDARYITGVALPVDGGALL 281
P +P+DI++A+LFLVS A +IT L VDGGA L
Sbjct: 221 PLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2530DHBDHDRGNASE1096e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 109 bits (273), Expect = 6e-31
Identities = 75/250 (30%), Positives = 113/250 (45%), Gaps = 11/250 (4%)

Query: 10 RVAVITGAGTGIGRASALVLAEHGADIVLAGRRPDPLQATAKEVEALGRRALIVPTDVTE 69
++A ITGA GIG A A LA GA I P+ L+ ++A R A P DV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 70 THQCQDLVDATLADFGRLDILVNNAGGGETKGITRWTEEEWHDVVDLNLGSVWFLSRCAV 129
+ ++ + G +DILVN AG I ++EEW +N V+ SR
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 130 KPMMTQGSGAIVNISSGASLIAMPQAAIYAAAKAGVNNLTGSMAAAWGRKGIRVNCIACG 189
K MM + SG+IV + S + + A YA++KA T + IR N ++ G
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 190 AIRTD---GLLAD------AAKGGFDVDQLGAMNAMGRIAEPDEIGYGVLFFASDASSYC 240
+ TD L AD KG + + G + ++A+P +I VLF S + +
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGI--PLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 241 SGQTLYIHGG 250
+ L + GG
Sbjct: 247 TMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2534cTCRTETB1561e-43 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 156 bits (397), Expect = 1e-43
Identities = 92/401 (22%), Positives = 175/401 (43%), Gaps = 19/401 (4%)

Query: 43 LWAMMIGFFMIMVDSTIVAIANPTIMADLHIGYDTVVWVTSAYLLGYAVVLLVAGRLGDR 102
+W ++ FF ++ + ++ ++ P I D + + WV +A++L +++ V G+L D+
Sbjct: 17 IWLCILSFFSVL-NEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 103 FGTKNLYLIGLAVFTVASVWCGLAGS-AAMLIAARVVQGVGAGVLTPQTLSTITRIFPPE 161
G K L L G+ + SV + S ++LI AR +QG GA + + R P E
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 162 RRGVAVSVWSATAGAASLVGPLAGGVLVDGLGWQWIFFVNVPIGVLGLALAYWLVPVLPT 221
RG A + + VGP GG++ + W ++ + + + L L +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRI 195

Query: 222 QSHRFDLVGVGLSGVGMFLIVFGLQQGQAAHWQPWIWALIVAGVGFVTVFVFWQSVNVRE 281
+ H FD+ G+ L VG+ + + + ++ V +FV V +
Sbjct: 196 KGH-FDIKGIILMSVGIVFFMLFTTS--------YSISFLIVSVLSFLIFVKHIR-KVTD 245

Query: 282 PLIPLVIFADRDFSLCNI--GVAIISFAATAMMLPLTFYAQAVCGLSP-TRSALLIAPMA 338
P + + + F + + G+ + A M+P + V LS +++I P
Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMM--KDVHQLSTAEIGSVIIFPGT 303

Query: 339 IANGVFAPFVGKIVDRYHPRPVLGFGFSLLAIALTWLTFEMSPATPIWRLVLPFFAMGVG 398
++ +F G +VDR P VL G + L+++ +LT T W + + + G
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVS--FLTASFLLETTSWFMTIIIVFVLGG 361

Query: 399 MAFVWSPLTATATRNLSAQLAGAGSAVYNSVRQLGAVLGSA 439
++F + ++ + +L Q AGAG ++ N L G A
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIA 402


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2536PF03544405e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 39.6 bits (92), Expect = 5e-05
Identities = 19/105 (18%), Positives = 25/105 (23%), Gaps = 8/105 (7%)

Query: 55 STDGPSAPAPAAPQTAQPALPAQTAPPAQTAQPARPAPQP-------AAAPGNGASTRPA 107
PAPA P + PA PP P P +P P
Sbjct: 37 VHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEK 96

Query: 108 KPATPPPAEGDELQTLRGAAAAVVK-NMSASLEVPTATSVRAIPA 151
P P + + + AS TA +
Sbjct: 97 PKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSST 141



Score = 38.4 bits (89), Expect = 9e-05
Identities = 22/129 (17%), Positives = 35/129 (27%), Gaps = 15/129 (11%)

Query: 42 PEPTGDSVLAAPAST-DGPSAPAPAAPQTAQPALPAQTAPPAQTAQPARPAPQPAAAPGN 100
P P + P P P+ A + P +P + QP
Sbjct: 61 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPK----- 115

Query: 101 GASTRPAKPATPPPAEGDELQTLRGAAAAVVKNMSASLEVPTATSVRAIPAKLLIDNRIV 160
R KP PA A A + +A+ + A + L N+
Sbjct: 116 ----RDVKPVESRPA-----SPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQ 166

Query: 161 INNQLKRTR 169
+ + R
Sbjct: 167 YPARAQALR 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2537DHBDHDRGNASE771e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 76.6 bits (188), Expect = 1e-18
Identities = 58/201 (28%), Positives = 91/201 (45%), Gaps = 2/201 (0%)

Query: 2 QGFAGKVAVVTGAGSGIGQALAVELARSGAKVAISDVDLEGLAHTEEQLKAIGAQYKADR 61
+G GK+A +TGA GIG+A+A LA GA +A D + E L LKA +A
Sbjct: 4 KGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 62 LDVTEREAFLAYADAVKEHFGKVNQIYNNAGIAFTGDVEVSQFKDIERVMDVDFWGVVNG 121
DV + A ++ G ++ + N AG+ G + ++ E V+ GV N
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 122 TKAFLPHLIASGDGHVVNVSSVFGLFSVPGQAAYNSAKFAVRGFTEALRQEMAAAGHPVA 181
+++ +++ G +V V S AAY S+K A FT+ L E+A +
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN--IR 181

Query: 182 VTTVHPGGIKTAIARNATAAE 202
V PG +T + + A E
Sbjct: 182 CNIVSPGSTETDMQWSLWADE 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2538DHBDHDRGNASE763e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 75.9 bits (186), Expect = 3e-18
Identities = 53/195 (27%), Positives = 86/195 (44%), Gaps = 2/195 (1%)

Query: 2 QGFAGKVAVVTGAGSGIGQALAVELGRAGAKLAISDVDTAGLAQTAEQLAAIGAPVKADR 61
+G GK+A +TGA GIG+A+A L GA +A D + L + L A +A
Sbjct: 4 KGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 62 LDVTEREAFLAYADAVNEHYGRVNQIYNNAGITFIGSIEDSRFKDIERVVDVDFWGVVNG 121
DV + A + G ++ + N AG+ G I ++ E V+ GV N
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 122 TKAFLPHLIASGDGHVINISSALGLFSAPGQAAYVSAKFAVRGFTEALHQEMLRAGHPVR 181
+++ +++ G ++ + S AAY S+K A FT+ L E+ A + +R
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL--AEYNIR 181

Query: 182 VTTVHPGGIKTAFAR 196
V PG +T
Sbjct: 182 CNIVSPGSTETDMQW 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2541cOMPTIN290.026 Omptin serine protease signature.
		>OMPTIN#Omptin serine protease signature.

Length = 317

Score = 28.8 bits (64), Expect = 0.026
Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 11/62 (17%)

Query: 202 KGKNAAEVVGDQNWIENDFIPTVAKRGAAIIDARGASSAASAASATTDAARDWLLGTPAG 261
K NAA + G NW D +P ++ I A G ++ S D +DW+ + G
Sbjct: 65 KFNNAAIIKGAINW---DLMPQIS------IGAAGWTTLGSRGGNMVD--QDWMDSSNPG 113

Query: 262 DW 263
W
Sbjct: 114 TW 115


88MAP2545cMAP2559N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2545c-29-1.040855SugC
MAP2546c-19-0.220654SugB
MAP2547c081.874047SugA
MAP2548c082.682662hypothetical protein
MAP2549c192.808259hypothetical protein
MAP25500112.707004hypothetical protein
MAP2551-1122.903623hypothetical protein
MAP2552-1122.750885hypothetical protein
MAP25530122.008584hypothetical protein
MAP2554c0110.833064sec-independent translocase
MAP2555c1110.001987hypothetical protein
MAP2556c080.200325hypothetical protein
MAP2557c-19-0.519306RNA polymerase sigma factor SigE
MAP2558-29-0.158576hypothetical protein
MAP2559-39-0.959949hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2545cPF05272320.004 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.4 bits (73), Expect = 0.004
Identities = 14/56 (25%), Positives = 21/56 (37%), Gaps = 9/56 (16%)

Query: 33 LILVGPSGCGKTTTLNMIAGLEDISSGELRIGGERVNEKAPKDRDIAMVFQSYALY 88
++L G G GK+T +N + GL+ S IG +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG---------TGKDSYEQIAGIVAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2548cMALTOSEBP545e-10 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 54.0 bits (129), Expect = 5e-10
Identities = 42/166 (25%), Positives = 67/166 (40%), Gaps = 8/166 (4%)

Query: 133 PGPLSTARWHDRLFAAPVTTNTQLLWYRPDLVLQPPRTWDAVVTEAARLHAAGRPSWIAV 192
P R++ +L A P+ L Y DL+ PP+TW+ + L A G+ A+
Sbjct: 117 PFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKS---AL 173

Query: 193 QANEGEGLVVWFNTLLASGGGRVLSEDGRRVTLTDTPAHRAATVNALRILKSVATAPGAD 252
N E W L+A+ GG + + + D A L L + +
Sbjct: 174 MFNLQEPYFTW--PLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMN 231

Query: 253 PSITRTDEGTARLAVEQGRAALAVNWPYALASMLDNAVKGGVPFLP 298
TD A A +G A+ +N P+A +++ + V GV LP
Sbjct: 232 AD---TDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLP 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2550FLGMOTORFLIG310.010 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 30.9 bits (70), Expect = 0.010
Identities = 30/171 (17%), Positives = 59/171 (34%), Gaps = 27/171 (15%)

Query: 185 KPVDVADAIRGLPPKRRYEVLKALNDDRLADILQELPELDQAEVLSQLGTERSADVLEEM 244
P ++ + I+ P+ +L L+ + + IL LP Q V ++ ++
Sbjct: 124 DPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIAL------MDRT 177

Query: 245 DPDDAADLLGVLNPTDAEM-LLKRMDPGDSASVRRLLTHSPDTAGGLMTSNPVVLTPDTA 303
P+ ++ VL A + G +V ++ N +
Sbjct: 178 SPEVVREVERVLEKKLASLSSEDYTSAGGVDNVVEII-------------NMADRKTEKF 224

Query: 304 VAEALARARDPDLTAALSSMVFVVRPPTATPTGRYLGCVPLQRLLREAPAE 354
+ E+L DP+L + +FV +QR+LRE +
Sbjct: 225 IIESL-EEEDPELAEEIKKKMFVFEDIVLLDDR------SIQRVLREIDGQ 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2552TONBPROTEIN386e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 37.7 bits (87), Expect = 6e-05
Identities = 21/87 (24%), Positives = 24/87 (27%), Gaps = 3/87 (3%)

Query: 369 PPAGPPPFGAPPPFAPPPSGPPPFAPPPSGPPPFGPPPPDAGPVPAPPAGVAPQAAPSPP 428
PP P P P PP P P P P PV P
Sbjct: 60 AVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVES 119

Query: 429 AGPAP---QAPSPAAAGPAPNAPKKPG 452
+P AP+ + A A KP
Sbjct: 120 RPASPFENTAPARLTSSTATAATSKPV 146



Score = 30.3 bits (68), Expect = 0.013
Identities = 19/85 (22%), Positives = 22/85 (25%)

Query: 370 PAGPPPFGAPPPFAPPPSGPPPFAPPPSGPPPFGPPPPDAGPVPAPPAGVAPQAAPSPPA 429
PA P P P P P P P P P AP P+ P P
Sbjct: 41 PAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 100

Query: 430 GPAPQAPSPAAAGPAPNAPKKPGPV 454
P + P + P
Sbjct: 101 KPVKKVQEQPKRDVKPVESRPASPF 125



Score = 29.2 bits (65), Expect = 0.027
Identities = 18/109 (16%), Positives = 24/109 (22%)

Query: 346 PTPQQAPTQVPGCTVICISSQGSPPAGPPPFGAPPPFAPPPSGPPPFAPPPSGPPPFGPP 405
Q P V+ + P PP P P P P
Sbjct: 54 DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRD 113

Query: 406 PPDAGPVPAPPAGVAPQAAPSPPAGPAPQAPSPAAAGPAPNAPKKPGPV 454
PA P A + A + + P A + P
Sbjct: 114 VKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQ 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2554cTATBPROTEIN782e-21 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 78.1 bits (192), Expect = 2e-21
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 2/75 (2%)

Query: 5 LSWEHMLVLVVVGLVVLGPERLPGAIRWTSNALRQARDYLSGVTTQLREDLG-PEFDDLR 63
+ + +L++ ++GLVVLGP+RLP A++ + +R R + V +L ++L EF D
Sbjct: 4 IGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQDSL 63

Query: 64 VPLSELQKLRGMTPR 78
+ E L +TP
Sbjct: 64 KKV-EKASLTNLTPE 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2555cV8PROTEASE751e-16 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 74.7 bits (183), Expect = 1e-16
Identities = 42/201 (20%), Positives = 73/201 (36%), Gaps = 29/201 (14%)

Query: 197 NGEAPAGRFAKVAAATAG---AVVTIESKSDQEGMQGSGVVIDGRGYIVTNNHVISEAAN 253
N P ++ T G V I+ ++ SGVV+ G+ ++TN HV+
Sbjct: 68 NVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVV-GKDTLLTNKHVVDATHG 126

Query: 254 NPSQFKTTVVFNDGKEVP------ANLVGRDPKTDLAVLKVDNV-------DNLSVARLG 300
+P K + P + + DLA++K + + A +
Sbjct: 127 DPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMS 186

Query: 301 DSDKVRVGDEVLAAGAPLGLRSTVTHGIVSALHRPVPLSGEGSDTDTVIDAVQTDASINH 360
++ + +V + G P +G T +A+Q D S
Sbjct: 187 NNAETQVNQNITVTGYPGDKPVATMW------------ESKGKITYLKGEAMQYDLSTTG 234

Query: 361 GNSGGPLIDMNSQVIGIDTAG 381
GNSG P+ + ++VIGI G
Sbjct: 235 GNSGSPVFNEKNEVIGIHWGG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2559HTHTETR665e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 5e-15
Identities = 37/193 (19%), Positives = 59/193 (30%), Gaps = 18/193 (9%)

Query: 1 MRSADLTAAARIRDAAIEQFGEHGF-GVGLRRIAEAAGVSAALVIHHFGSKEGLRKACDD 59
+ I D A+ F + G L IA+AAGV+ + HF K L +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 60 HIAEQIRESKTEALQTNDPAVW----------FGQLAEIEEFAPLIAYVLRSMQTGGDLA 109
I E + E E L+ + + G++A
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 110 KM------LWRQMIENAEGYLEEGVRAGTIKPSRDPRARAKYLGITGGGGLLLYLQMHDN 163
+ L + + E L+ + A + R RA + GL+
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTR-RAAIIMRGYISGLMENWLFAPQ 183

Query: 164 PTDLRAVLRDYSR 176
DL+ RDY
Sbjct: 184 SFDLKKEARDYVA 196


89MAP2610cMAP2616cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2610c-2101.759982hypothetical protein
MAP2611c-181.408083FO synthase
MAP2612c091.206029hypothetical protein
MAP2613c0111.374587hypothetical protein
MAP2614010-0.128119hypothetical protein
MAP2615c19-0.247682hypothetical protein
MAP2616c111-0.374153hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2610cPF05844280.043 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 27.7 bits (61), Expect = 0.043
Identities = 14/32 (43%), Positives = 15/32 (46%), Gaps = 8/32 (25%)

Query: 115 PGGAGRAVDDHAQRARGRPAAAGEGGQATATR 146
PG AGR+V G P AA E Q A R
Sbjct: 22 PGAAGRSV--------GTPQAAAELPQVPAAR 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2612cRTXTOXINA320.001 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.9 bits (72), Expect = 0.001
Identities = 15/54 (27%), Positives = 30/54 (55%)

Query: 37 DSGATDPAIRFVVLALLAVDGVLSALAGALLLPLYIGTVPFPISGLISGLLNAA 90
++GA D ++ + L +V +SA A L+ + + ++G+ISG+L A+
Sbjct: 360 ETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEAS 413


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2614HTHTETR493e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.9 bits (116), Expect = 3e-09
Identities = 26/193 (13%), Positives = 60/193 (31%), Gaps = 10/193 (5%)

Query: 38 SRSRRRGEVLERALYEATLAELTEVGYGGLTMEGIAARAHTGKAALYRRWDTKCELVHAA 97
++++ + + + + L ++ G ++ IA A + A+Y + K +L
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 98 LVFALPPVPELRSGRSA----------RENLLAMFTAQRDLLAGKTAFPGIEVIQQLLHE 147
+ + EL A RE L+ + + + I + + E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 148 PEMRAIFADAVVRPRLKIVESILQSAVQDGDLDPKSITPLTARIGSALINQHFLLNGSPP 207
+ + +E L+ ++ L +T A I I+ P
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 208 NRRELALIVDTVI 220
+L +
Sbjct: 183 QSFDLKKEARDYV 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2616cTCRTETOQM1726e-48 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 172 bits (437), Expect = 6e-48
Identities = 102/442 (23%), Positives = 171/442 (38%), Gaps = 61/442 (13%)

Query: 53 RNVAIVAHVDHGKTTLVDAMLRQSGALHHRGD-DTQERILDSGDLEKEKGITILAKNTAV 111
N+ ++AHVD GKTTL +++L SGA+ G D D+ LE+++GITI T+
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSF 63

Query: 112 HRHHPDGSVTVINVIDTPGHADFGGEVERGLSMVDGVLLLVDASEGPLPQTRFVLRKALA 171
+ T +N+IDTPGH DF EV R LS++DG +LL+ A +G QTR +
Sbjct: 64 QWEN-----TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRK 118

Query: 172 AHLPVILVVNKTDRPDARIAEVVEASHDLLLDVA----------------SDLDDEAAKA 215
+P I +NK D+ ++ V + + L ++
Sbjct: 119 MGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTV 178

Query: 216 AEHALGLPTLYASG------------RAGVAS-TEQPA-DGEVPAGENLDPLFDVLLEHI 261
E L Y SG + + P G +D L +V+
Sbjct: 179 IEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKF 238

Query: 262 PAPSGDPEAPLQALVTNLDASTFLGRLALIRIYNGKLRKGQQVAWMREVDGLPVITSAKI 321
+ + ++ L V ++ S RLA IR+Y+G L V + KI
Sbjct: 239 YSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVR-------ISEKEKIKI 291

Query: 322 TELLATEGVERSPTDEAIAGDIVAVAGLP---EIMIGDTLADPDHAHALPRITVDEPAIS 378
TE+ + E D+A +G+IV + ++GDT P I P +
Sbjct: 292 TEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRER----IENPLPLLQ 347

Query: 379 VTIGTNTSPLAGKVPGHKLTARLVRNRLDQELVGNVSIRVVDIGRPDAWEVQGRGELALA 438
T+ + L L + + +R + G++ +
Sbjct: 348 TTVEPSKPQQRE-----MLLDAL--LEISDS---DPLLRYYVDSATHEIILSFLGKVQME 397

Query: 439 VLVEQMRRE-GFELTVGKPQVV 459
V ++ + E+ + +P V+
Sbjct: 398 VTCALLQEKYHVEIEIKEPTVI 419



Score = 42.9 bits (101), Expect = 4e-06
Identities = 18/84 (21%), Positives = 33/84 (39%), Gaps = 1/84 (1%)

Query: 466 QLHEPFEAMTIDCPEEFVGAITQLMAARKGRMEEMTNHAAGWVRMDFIVPSRGLIGFRTD 525
+L EP+ + I P+E++ + + T V + +P+R + +R+D
Sbjct: 534 ELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVD-TQLKNNEVILSGEIPARCIQEYRSD 592

Query: 526 FLTITRGTGIANAVFDGYRPWAGE 549
T G + GY GE
Sbjct: 593 LTFFTNGRSVCLTELKGYHVTTGE 616


90MAP2656MAP2664N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP26561100.388214hypothetical protein
MAP2657010-0.753397hypothetical protein
MAP2658-110-0.861055hypothetical protein
MAP2659-2100.136180hypothetical protein
MAP2660-280.544295hypothetical protein
MAP2661-291.2066485-methyltetrahydropteroyltriglutamate--
MAP2662c-1101.453048hypothetical protein
MAP2663c0102.600212hypothetical protein
MAP2664-2102.454034pyruvate phosphate dikinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2656SACTRNSFRASE300.008 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.3 bits (68), Expect = 0.008
Identities = 11/43 (25%), Positives = 22/43 (51%)

Query: 208 RLATASADDVAAVVDDVLIAACADALGAARAVMDLAVEYAKTR 250
R+ S + A+++D+ +A G A++ A+E+AK
Sbjct: 79 RIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2658HTHTETR661e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 1e-14
Identities = 41/203 (20%), Positives = 70/203 (34%), Gaps = 20/203 (9%)

Query: 18 RRAEILATAASLIASSGLR-TSLQEIADAAGILPGSLYHHFESKEAILVELTRRYQEDLE 76
R IL A L + G+ TSL EIA AAG+ G++Y HF+ K + E+ ++
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI---WELSES 68

Query: 77 RIGRAAQARLDEPDSRPVPEQIIELGSAIANCAVE-HRAALQMSFY---EGPGTDPELTK 132
IG + P+ L + + E R L + E G + +
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 133 LTRQRPVAIQEAMLQTLRAGRWSGYIKPDIDLPTLADRICQTMLQVGLDVMRHTASADP- 191
R + + + QTL+ + + D+ A + + + P
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG-----LMENWLFAPQ 183

Query: 192 ------VAELLCRIILQGLASRP 208
A I+L+ P
Sbjct: 184 SFDLKKEARDYVAILLEMYLLCP 206



Score = 46.2 bits (109), Expect = 6e-08
Identities = 19/145 (13%), Positives = 40/145 (27%), Gaps = 9/145 (6%)

Query: 240 DKAALVRAVARAEFGRRGYEVTTIRDIAAAAGLGTGTVYRVIGSKDELLDSIM-RSFGKK 298
+ ++ A F ++G T++ +IA AAG+ G +Y K +L I S
Sbjct: 12 TRQHILDV-ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 299 VEAGWVSVLRSDATPTEKLDALSWVNVNALDQFSDEFRIQLAWMRLS------PPTANPG 352
E + P L + + + +
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 353 WSYATRLRQ-MKSLLSEGLRTGEIA 376
+ ++ L + +
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLP 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2659DHBDHDRGNASE592e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 59.3 bits (143), Expect = 2e-12
Identities = 62/286 (21%), Positives = 102/286 (35%), Gaps = 71/286 (24%)

Query: 10 DGKRALVVGGATGMGAAAAKSAAELGAEVIVLDYAPVTYDV-----------AKSIQVDL 58
+GK A + G A G+G A A++ A GA + +DY P + A++ D+
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 59 RDPASIDAALEQLD---GPVHAVFSAAGIAE-GTTDLM---------AINFLGHRYLIER 105
RD A+ID +++ GP+ + + AG+ G + ++N G
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 106 LLERNQLPSGSAICFISSVAGMGWENDLDLLNEFLATPDFATAQEWVKAHEPE-GIIHYG 164
+ + +I + S A P + Y
Sbjct: 127 VSKYMMDRRSGSIVTVGSNP----------------------------AGVPRTSMAAYA 158

Query: 165 FSKKVVNAYVATQGYPLLKKGIRINAICPGPTDTPLAQANADLWLT----------FAQD 214
SK + G L + IR N + PG T+T + + LW +
Sbjct: 159 SSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWS---LWADENGAEQVIKGSLET 215

Query: 215 YRDETG---SKVHTPEQMGDVMAFLNSAAAFGINGITLLVDYGHTM 257
++ TG K+ P + D + FL S A I L VD G T+
Sbjct: 216 FK--TGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2660NUCEPIMERASE329e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 32.4 bits (74), Expect = 9e-04
Identities = 24/164 (14%), Positives = 50/164 (30%), Gaps = 31/164 (18%)

Query: 3 RVVIVGGHGKVALQLSAILTQRGDAVTSL---------FRNPDHADDVAATGAKPVVADI 53
+ ++ G G + +S L + G V + + +A G + D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 54 ERLDTDALA------GHDAVVFSAGAGG-----GNPARTYAVDRDAAIRVVDAAARSGVK 102
D + + + V S NP + + +++ + ++
Sbjct: 62 A--DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 103 RFVMVS---YFGAGP----DHGVPQDDPFFPYAESKAAAD--AH 137
+ S +G D P YA +K A + AH
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAH 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2662cTCRTETB300.036 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.8 bits (67), Expect = 0.036
Identities = 38/234 (16%), Positives = 81/234 (34%), Gaps = 32/234 (13%)

Query: 159 LAALFADGTGPVPELGTTIGLLPAWKIVLILLVLAVLG--LRDKVIFLAAR--------G 208
+ ++ A G G P +G I W +L++ ++ ++ K++ R G
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKG 203

Query: 209 EVYASMAVTFLFGAPDMIVGSKVVFLVIWMGAATSKLNKHFPFVISTMMSNNPLVRPRWL 268
+ S+ + F + FL++ + + FV +P V P
Sbjct: 204 IILMSVGIVFFMLFTTS---YSISFLIVSVLS-------FLIFVKHIRKVTDPFVDPGLG 253

Query: 269 KRRFFEKFPDDLRPGRLSRWIAHLSTA-IEMLVPLPLFFCHGGWPVTVAAVVMVCFHLGI 327
K F G L I + A +VP + H + +V++ + +
Sbjct: 254 KNIPFMI-------GVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSV 306

Query: 328 LVSIPMG---VPLEWNVFMIF-GVLSLFVAHTGIGLGDLRHPVVVAVLFVVIAG 377
++ +G V ++++ GV L V+ + ++ V + G
Sbjct: 307 IIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2664PHPHTRNFRASE723e-15 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 71.7 bits (176), Expect = 3e-15
Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 2/83 (2%)

Query: 401 EALHAAERGEPVILVRDHTRPEDVSGMLA--ARGIVTEVGGAASHAAVVSRELGRVAVVG 458
E A E +++ + P D + + +G T++GG SH+A++SR L AVVG
Sbjct: 146 ETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVG 205

Query: 459 CGAGVAAMLAGRRITVDGAEGEV 481
+ G + VDG EG V
Sbjct: 206 TKEVTEKIQHGDMVIVDGIEGIV 228


91MAP2774cMAP2780N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP2774c-210-0.023831hypothetical protein
MAP2775-19-0.280566hypothetical protein
MAP2776c-110-1.221674hypothetical protein
MAP2777c-112-0.444007hypothetical protein
MAP2778c-211-0.699607hypothetical protein
MAP2779-3140.348796short chain dehydrogenase
MAP2780-2160.339082hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2774cDHBDHDRGNASE661e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 66.2 bits (161), Expect = 1e-14
Identities = 53/206 (25%), Positives = 90/206 (43%), Gaps = 12/206 (5%)

Query: 17 LQGKVAVITGGAGGIGRALGRRLGHEGMKVVLADVLADPLQEATRALADEGIEAAGVVTD 76
++GK+A ITG A GIG A+ R L +G + D + L++ +L E A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 77 VTDYSSVEALAKEALRRFGAVDVVCNNAGTGAVSEGYLWEHDLADWRWGIDVNVLGVIHG 136
V D ++++ + R G +D++ N AG + G + +W VN GV +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGV--LRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 137 LKAFVPILLERGEGHVVNTCSGNGGFAPIARGAMGGPATAVYPMTKAAVLCLTESLYTHL 196
++ +++R G +V S G + A Y +KAA + T+ L L
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAA--------YASSKAAAVMFTKCL--GL 173

Query: 197 EMTGTRVRAHVLFPGGFLNTGIWESW 222
E+ +R +++ PG W W
Sbjct: 174 ELAEYNIRCNIVSPGSTETDMQWSLW 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2778cHTHTETR484e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.1 bits (114), Expect = 4e-09
Identities = 23/119 (19%), Positives = 45/119 (37%), Gaps = 13/119 (10%)

Query: 19 ILGIVVDMLDTGGYEAVQLREVARRARVSMATIYKRYRTRDELIVAALEGWMDANRYARL 78
IL + + + G + L E+A+ A V+ IY ++ + +L E + +
Sbjct: 16 ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL-----SESNI 70

Query: 79 PSLIDELPGESMYSDLMHVMRTIFE-------PWERHPLMLRSYFQARSGPGGKRLIRR 130
L E + D + V+R I ER L++ F G ++++
Sbjct: 71 GELELEYQAK-FPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2779DHBDHDRGNASE881e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 87.8 bits (217), Expect = 1e-22
Identities = 75/275 (27%), Positives = 123/275 (44%), Gaps = 31/275 (11%)

Query: 4 LDGKVAFITGVARGQGRSHAVRLAREGANIIGIDICADIAANGYPMACRAELDQTVALVE 63
++GK+AFITG A+G G + A LA +GA+I +D E + V
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD-------------YNPEKLEKVVSSL 52

Query: 64 EAGGKMLGTV-ADVRDFGQVKAALDAGVEQFGRLDIVLANAGIA-PLAFRQLSIEEELAQ 121
+A + ADVRD + + G +DI++ AG+ P LS E +
Sbjct: 53 KAEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDE----E 108

Query: 122 WRAVTGVNLDGAYHTAWAAIPHLLAGNRGGVIIFTSSTAGIKGFGGLQGGGLGYAASKHG 181
W A VN G ++ + + +++ G ++ S+ AG+ + YA+SK
Sbjct: 109 WEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVP-----RTSMAAYASSKAA 163

Query: 182 IVGLMRTLADALAPLNIRVNTVHPTAVNTMMA----TNDDMIEFLQKNPGAGPHLQNPMP 237
V + L LA NIR N V P + T M +++ E Q G+ + +P
Sbjct: 164 AVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAE--QVIKGSLETFKTGIP 221

Query: 238 VGML-EPEDVSAAIAYLVSDEARYVTGVTFPVDAG 271
+ L +P D++ A+ +LVS +A ++T VD G
Sbjct: 222 LKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2780DHBDHDRGNASE1031e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 103 bits (259), Expect = 1e-28
Identities = 82/272 (30%), Positives = 123/272 (45%), Gaps = 27/272 (9%)

Query: 10 VAGKRVLITGAARGMGRSHAVRLAEQGADCILVDICCTPTGLDYPLATEEDLNETVRLVE 69
+ GK ITGAA+G+G + A LA QGA +DY E + +++
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHI---------AAVDYNPEKLEKVVSSLKAEA 56

Query: 70 KHGSRAVPKIVDVRDEAAMKAAVDAAVDELGGLDGAVANAGVLTVGTWDTTTAEQWRLVL 129
+H + A P DVRD AA+ E+G +D V AGVL G + + E+W
Sbjct: 57 RH-AEAFP--ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATF 113

Query: 130 DVNLIGAWNTCAAALPHLIGGGGGSLVNISSSAGIKGTPL--HLPYTASKHGIVGMTLAL 187
VN G +N + +++ GS+V + S+ G P Y +SK V T L
Sbjct: 114 SVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCL 171

Query: 188 ANELAAQNIRVNTVHPTGVATGMAPPGMHALIAEQ-------RPDLVPIFLNALPAPLIE 240
ELA NIR N V P T M +L A++ + L L +
Sbjct: 172 GLELAEYNIRCNIVSPGSTETDM----QWSLWADENGAEQVIKGSLETFKTGIPLKKLAK 227

Query: 241 ASDVSNAVLYLISDESRYVTGLELKVDAGVTI 272
SD+++AVL+L+S ++ ++T L VD G T+
Sbjct: 228 PSDIADAVLFLVSGQAGHITMHNLCVDGGATL 259


92MAP2973MAP2981cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP29730120.841653hypothetical protein
MAP2974c0110.622961tRNA (guanine-N(1)-)-methyltransferase
MAP2975c0120.16719916S rRNA-processing protein RimM
MAP2976c-111-0.389842hypothetical protein
MAP2977c090.16954930S ribosomal protein S16
MAP2978c0100.361031hypothetical protein
MAP29790110.960405DacB
MAP2980c1101.382425hypothetical protein
MAP2981c0101.919597hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2973PERTACTIN330.002 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 33.2 bits (75), Expect = 0.002
Identities = 27/77 (35%), Positives = 32/77 (41%), Gaps = 1/77 (1%)

Query: 22 GSEALVQAKVYHLPVAAPVRVLTQPPPPQSAALMLEAATPPQPPQALPNAGFAQLPA-RI 80
G +LV AK P AP P P + PPQPPQ P A Q PA R
Sbjct: 558 GQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRE 617

Query: 81 QQAADQAAASGAALSVA 97
AA AA + + +A
Sbjct: 618 LSAAANAAVNTGGVGLA 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2977cIGASERPTASE270.041 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.3 bits (60), Expect = 0.041
Identities = 12/79 (15%), Positives = 23/79 (29%), Gaps = 3/79 (3%)

Query: 94 KVAPPKPSKLELFNAALAEAEGGPTT---EAAKPKKKAATSGAKKAAKAAEPEAAASEAA 150
+V+P + + A E PT E A + ++ E +E+
Sbjct: 1128 QVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187

Query: 151 EPEAAAAPAEGGEQAESSA 169
+ E E +
Sbjct: 1188 TVNTGNSVVENPENTTPAT 1206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2979BLACTAMASEA428e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 42.5 bits (100), Expect = 8e-07
Identities = 30/99 (30%), Positives = 41/99 (41%), Gaps = 7/99 (7%)

Query: 47 DLDSGQVLAGRDPNVAHPPASTIKTLLAQVVLDEV-----SLDATVVADAADTQVECNCV 101
DL SG+ L + P ST K +L VL V L+ + D V+ + V
Sbjct: 46 DLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDL-VDYSPV 104

Query: 102 GIK-PGRSYTARQLLDALLLVSGNDAANTLAHMLGGPEA 139
K T +L A + +S N AAN L +GGP
Sbjct: 105 SEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAG 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2980cHTHTETR537e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.7 bits (126), Expect = 7e-11
Identities = 26/104 (25%), Positives = 44/104 (42%), Gaps = 2/104 (1%)

Query: 1 MARTQQQRREETVARLLDASIATIIEVGYARASAAVITKRAGVSVGALFRHFDTMGDFMA 60
MAR +Q +ET +LD ++ + G + S I K AGV+ GA++ HF D +
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 ATASEVLRRQLESFTKRVADIPAD--QPVLEVVLGILRDLTSGP 102
E + A P D + E+++ +L +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEE 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP2981cUREASE350.001 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 34.7 bits (80), Expect = 0.001
Identities = 26/106 (24%), Positives = 45/106 (42%), Gaps = 24/106 (22%)

Query: 18 DAMPYDVIIRDGLWFDGTGGAALTRTLGIRDGVLVDVAESLDEAGCP------------- 64
+ D +I + L D G + +G++DG + ++ +AG P
Sbjct: 64 EGGAVDTVITNALILDHWG--IVKADIGLKDGRIA----AIGKAGNPDMQPGVTIIVGPG 117

Query: 65 -EVIDAAGKWVLPGFIDVHTHYDAEVLLDPGLRESVRHGVTTVLLG 109
EVI GK V G +D H H+ ++ E++ G+T +L G
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIHFICPQQIE----EALMSGLTCMLGG 159


93MAP3049cMAP3053cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3049c-227-0.958547hypothetical protein
MAP3050c0312.477811hypothetical protein
MAP30510272.776388hypothetical protein
MAP3052c1252.651100hypothetical protein
MAP3053c1242.715003cystathionine gamma-lyase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3049cACRIFLAVINRP565e-10 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 56.4 bits (136), Expect = 5e-10
Identities = 53/276 (19%), Positives = 101/276 (36%), Gaps = 39/276 (14%)

Query: 144 QGDALANESVDAIRNIVEHTPP--PPGVKAYVT-GAAPLVTDQFEVGSKGIFKVTVITVL 200
A A ++ AI+ + P P G+K P V K +F+ ++ L
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 201 VILAMLLWVYRSVTAVFVLVTVIVEMAAARGIVAFLGSVGLIGLSTYATNLLTL--LVIA 258
V+ L +++ A + + V LG+ ++ Y+ N LT+ +V+A
Sbjct: 354 VMYLFL----QNMRATLIPTIAVP--------VVLLGTFAILAAFGYSINTLTMFGMVLA 401

Query: 259 AGT--DYAIFFVGRYHEARHEGQD--RETAYYTMYRGTTHVVLGSGLTVAGAVLCLRF-- 312
G D AI V E + +E +M ++G + ++ + + F
Sbjct: 402 IGLLVDDAIVVVENVERVMMEDKLPPKEATEKSM-SQIQGALVGIAMVLSAVFIPMAFFG 460

Query: 313 -TRLNYFQSLGIPAAIGIGVALAAALSLTPAVITV-------------GSLFGLFDPKRQ 358
+ ++ I + +++ AL LTPA+ G FG F+
Sbjct: 461 GSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFD 520

Query: 359 MATRGWRRIGTAIVRWPGPILVVATG-VALVGLLAL 393
+ + I+ G L++ VA + +L L
Sbjct: 521 HSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFL 556



Score = 41.7 bits (98), Expect = 2e-05
Identities = 27/167 (16%), Positives = 62/167 (37%), Gaps = 17/167 (10%)

Query: 760 DLLIAGIAALSLILLIMVLITRSLVAAIVIVGTVALSLGASFGLSVLVWQDILGIKLYWI 819
+++ A+ L+ L+M L +++ A ++ V + L +F + G + +
Sbjct: 339 EVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAA-----FGYSINTL 393

Query: 820 CLALSVIILLAVGSDYNLLL--ISRFREEIHAGLNTGIIRSMAGSGAVVTSAGLVFAFTM 877
+ V+ + + D +++ + R E +SM+ + +V
Sbjct: 394 TMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMV---LS 450

Query: 878 ASFVSASLL------VLGQIGTTIALGLLFDTLIVRSFMTPSVAALL 918
A F+ + + Q TI + L+ TP++ A L
Sbjct: 451 AVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALIL-TPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3050cTYPE3IMQPROT270.009 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 27.0 bits (60), Expect = 0.009
Identities = 10/59 (16%), Positives = 22/59 (37%), Gaps = 7/59 (11%)

Query: 5 SIASLVKRG-WMVLVVVVVVGVAGFCVYRLHGIFGSHNNTSAAGGISDQTEPFSPKHIT 62
+ + ++VL++ + + L G+F + +QT PF K +
Sbjct: 3 DLVFAGNKALYLVLILSGWPTIVATIIGLLVGLF------QTVTQLQEQTLPFGIKLLG 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3052cHTHTETR683e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.7 bits (165), Expect = 3e-16
Identities = 29/163 (17%), Positives = 56/163 (34%), Gaps = 11/163 (6%)

Query: 18 WSPREAEILAVTLRLLQEHGYDQLTVDAVAGAAHASKATVYRRWPSKAELVLAAFIEGVR 77
IL V LRL + G ++ +A AA ++ +Y + K++L +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 78 QV--AVPPNTGTLRGDLLALGETVCEQVGHHASTI--RAVMFEVSRHPAL-------NDA 126
+ GD L++ + V T R ++ E+ H
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 127 LQHQFLDQRRALIEHVLHQAVDRGEISADAISDELWDLLPGYL 169
Q + IE L ++ + AD ++ ++ GY+
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3053cCHLAMIDIAOM6320.005 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 31.6 bits (71), Expect = 0.005
Identities = 15/39 (38%), Positives = 19/39 (48%)

Query: 91 PPGSTVVVPADGYYQVRRHAAEDLAPAGVTVIEASSAQI 129
P + V G +R ED GVTV+EA+ AQI
Sbjct: 332 PVEYVISVSNPGDLVLRDVVVEDTLSPGVTVLEAAGAQI 370


94MAP3092MAP3099cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3092214-0.237689hypothetical protein
MAP3093213-2.270390AdhC
MAP3094c115-2.203808hypothetical protein
MAP3095c314-4.233493ribonucleotide-diphosphate reductase subunit
MAP3096213-4.041127hypothetical protein
MAP3097213-4.530240hypothetical protein
MAP3098c213-5.384462hypothetical protein
MAP3099c315-5.553918hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3092FERRIBNDNGPP542e-10 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 54.2 bits (130), Expect = 2e-10
Identities = 61/274 (22%), Positives = 93/274 (33%), Gaps = 36/274 (13%)

Query: 91 SADPQRIVVLAGDQLDALCALGLQSRVVGAALPDGASGQPAYLGGAV-----RGVPGVGS 145
+ DP RIV L ++ L ALG+ P G + Y V VG
Sbjct: 32 AIDPNRIVALEWLPVELLLALGIV--------PYGVADTINYRLWVSEPPLPDSVIDVGL 83

Query: 146 RSHPDVKAIAAAHPDLILGSQGLTPALYPQLAAIAPT-VFTAA----PGAAWRDNLRAVG 200
R+ P+++ + P ++ S G P+ LA IAP F + P A R +L +
Sbjct: 84 RTEPNLELLTEMKPSFMVWSAGYGPS-PEMLARIAPGRGFNFSDGKQPLAMARKSLTEMA 142

Query: 201 AATARAGAVDGLLS---GFSQRAGDVGARHDASHFQASIVQLTTG-SIRVFGANNFPASV 256
A + L+ F + + A + L + VFG N+ +
Sbjct: 143 DLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPL--LLTTLIDPRHMLVFGPNSLFQEI 200

Query: 257 LGAVGVDRPAAQRFTDKPYLEIGATDADLAKNPDLSVADADVVYLSCATPAAADRAATVL 316
L G+ A Q T+ + D LA D DV+ D
Sbjct: 201 LDEYGI-PNAWQGETNFWGSTAVSID-RLAA-----YKDVDVLCFDHDNSKDMDALMA-- 251

Query: 317 DSGPWRKLSANRDNRVYVVNDEIWQTGQGLIAAR 350
+ W+ + R R V +W G L A
Sbjct: 252 -TPLWQAMPFVRAGRFQRVPA-VWFYGATLSAMH 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3094cPF06057280.014 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 27.9 bits (62), Expect = 0.014
Identities = 11/41 (26%), Positives = 17/41 (41%), Gaps = 5/41 (12%)

Query: 71 NRDEIAEFISGMTHYDAGPENIIR-VAARLAAAGWPLAGID 110
+ + F+SG D G + + V L GWP+ G
Sbjct: 49 TKPPLVIFLSG----DGGWATLDKAVGGILQQQGWPVVGWS 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3097HTHTETR482e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.1 bits (114), Expect = 2e-09
Identities = 31/192 (16%), Positives = 57/192 (29%), Gaps = 18/192 (9%)

Query: 7 RQRRRELLDALIAEFAAGGIGDRSLRRVAEAVGTSHRMLLHHFGSREGLLLAIVEEVERR 66
++ R+ +LD + F+ G+ SL +A+A G + + HF + L I E E
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 67 QMRVLTELPRAPAEGFAAMWAD-----LRRPELREFERLFFECYSR---AAQGEKPFARM 118
+ E ++ + L E RL E +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 119 LPDAVDDWLR---------QAETHSGAPFDPAMAR-LGLAVIRGLLLDLVATGDEAGVDA 168
+ + A A + I GL+ + + +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 169 AARAFVNLLNAG 180
AR +V +L
Sbjct: 190 EARDYVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3099cHTHTETR575e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 5e-12
Identities = 41/195 (21%), Positives = 62/195 (31%), Gaps = 7/195 (3%)

Query: 23 RWREHRKKVRNEIVDAAFRAIDRLGPE-LSVREIAEEAGTAKPKIYRHFHDKSDLFQAIG 81
+ ++ ++ R I+D A R + G S+ EIA+ AG + IY HF DKSDLF I
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 82 ERLRDMLWTAIFPSIDLKTDSAREVIRRSVEEYVTLVDKHPNVLRVF-IQGRSTGTPQST 140
E + V+R + + + I
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 141 VTILNEGREITLAMADLFDNELREM----ELD-HAAVELAAHAAFGSAASATEWWLGPEP 195
+ R + L D + L+ L AA G + E WL
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 196 DSPRLMSRAQFVAHL 210
+VA L
Sbjct: 184 SFDLKKEARDYVAIL 198


95MAP3191MAP3197N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3191-211-0.594112hypothetical protein
MAP3192-2110.031728hypothetical protein
MAP3193-1120.627146hypothetical protein
MAP3194012-0.295698hypothetical protein
MAP3195012-1.013846hypothetical protein
MAP3196012-1.307723FadE12_3
MAP3197212-1.422353hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3191PF06872310.008 EspG protein
		>PF06872#EspG protein

Length = 398

Score = 30.8 bits (69), Expect = 0.008
Identities = 14/50 (28%), Positives = 26/50 (52%)

Query: 163 PDAETTLLMIEATDEFRVPPPGPLGRHFPFDPSQATIPEPQALDDDPAHD 212
P+ E L+++EA ++ R+ P+ RH P+ T+P+ + D H
Sbjct: 122 PNNEKFLVLLEANEQNRLLQSLPINRHMPYIQVHHTLPQEELTDLLSMHK 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3192PF06057330.002 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 32.5 bits (74), Expect = 0.002
Identities = 19/84 (22%), Positives = 34/84 (40%), Gaps = 3/84 (3%)

Query: 40 PKNKPSDTVLVFMHPIGGGAYLP--MINALARAGHHVIYCNSRFRGTDSALLMEKVVEDL 97
+ +++F+ GG A L + L + G V+ +S + + V +D
Sbjct: 45 ASSHTKPPLVIFLSGDGGWATLDKAVGGILQQQGWPVVGWSS-LKYYWKQKDPKDVTQDT 103

Query: 98 GECIKDAKKRLGYTKVVLAGWSGG 121
I + G KV+L G+S G
Sbjct: 104 LAIIDKYQAEFGTQKVILIGYSFG 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3195HTHTETR755e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 75.4 bits (185), Expect = 5e-19
Identities = 33/164 (20%), Positives = 56/164 (34%), Gaps = 9/164 (5%)

Query: 14 AKGRQTRQAIEQAARKLFAERGFHGTTLADITSAAGKSPAVFYRYFADKEDLLAALAE-S 72
+ ++TRQ I A +LF+++G T+L +I AAG + Y +F DK DL + + E S
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 73 FLHEVVTPSGLSVHLPDSPDD------DAFFTAVVTGYWNMFKQNIGIMIAVAQLAATQQ 126
+ P P + VT + + I+ +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVT--EERRRLLMEIIFHKCEFVGEMA 124

Query: 127 RFAAVQNEFRRFGIDLVAASVRRAQEQGYGAELHPQHTAAAIAL 170
Q D + +++ E AA I
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMR 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3197RTXTOXIND290.028 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 28.6 bits (64), Expect = 0.028
Identities = 15/111 (13%), Positives = 36/111 (32%), Gaps = 16/111 (14%)

Query: 6 QAQAAAAAAGAPVPHILVADNSPAALGNPFLICDEIKGETIARRIQRRLDAADGHAARAH 65
+++ + V I+V + G+ L + E + Q L A R
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 66 LLGQCAHAVAAIHRAEIDRIQDPGLRR-------------QDQLTEWRQRL 103
+L + ++ E+ +P + ++Q + W+ +
Sbjct: 155 IL---SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202


96MAP3215cMAP3223cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3215c-1130.431416hypothetical protein
MAP32161120.844843Omt
MAP3217c1111.741632hypothetical protein
MAP3218c1121.221999MoxR3
MAP3219c1120.807219hypothetical protein
MAP3220c0100.726291hypothetical protein
MAP3221180.324277hypothetical protein
MAP3222c0100.443861hypothetical protein
MAP3223c091.132353hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3215cHTHTETR573e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.3 bits (138), Expect = 3e-12
Identities = 26/140 (18%), Positives = 47/140 (33%), Gaps = 2/140 (1%)

Query: 12 RARIRHAALREFGEKGYEGATIRSIAAAAGVSSGLLRHHFGSKQELRQACDDYLVKTMRD 71
R I ALR F ++G ++ IA AAGV+ G + HF K +L + + +
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 72 LNAQVQDNAKRGDVHYVSARIPLGQHQDYITRALVEGGAGELFDALVSMTEEWLASADEH 131
L + Q + + + + + +T +F E
Sbjct: 73 LELEYQAKFPGDPLSVLREIL-IHVLESTVTEERRRLLMEIIF-HKCEFVGEMAVVQQAQ 130

Query: 132 RAEPADVDAKSRATLITAMA 151
R + + TL +
Sbjct: 131 RNLCLESYDRIEQTLKHCIE 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3218cHTHFIS362e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.6 bits (82), Expect = 2e-04
Identities = 29/138 (21%), Positives = 51/138 (36%), Gaps = 16/138 (11%)

Query: 27 RNALRLILTAVLARGHILIEDLPGLGKTLIARS---FAAALGLQFTRVQ---FTPDLLPA 80
+ R++ + ++I G GK L+AR+ + F + DL+ +
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIES 206

Query: 81 DLLG------STIYDMQSGRFAFRAGPIFTNLLLADEINRTPPKTQAALLEAMAEGQVSI 134
+L G + +GRF G L DEI P Q LL + +G+ +
Sbjct: 207 ELFGHEKGAFTGAQTRSTGRFEQAEG----GTLFLDEIGDMPMDAQTRLLRVLQQGEYTT 262

Query: 135 DGETHRLPKPFIVLATDN 152
G + ++A N
Sbjct: 263 VGGRTPIRSDVRIVAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3220cPERTACTIN290.030 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 28.9 bits (64), Expect = 0.030
Identities = 18/54 (33%), Positives = 20/54 (37%)

Query: 113 LARLFASHGLSPAPPPGTGASPPPHPAAPPAPPPQHPQRPRDGSQDTLGILLAG 166
L A PAP PG P P P PPQ PQ P+ + AG
Sbjct: 562 LVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3222cDHBDHDRGNASE465e-08 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 45.8 bits (108), Expect = 5e-08
Identities = 21/94 (22%), Positives = 38/94 (40%), Gaps = 9/94 (9%)

Query: 114 STPLHAYRRHFDIAVFAAYELMQRVCPDMIGAGGGAIINITSVASRLPGDGPYADRSGGV 173
S + F + + + V M+ G+I+ + S + +P
Sbjct: 103 SLSDEEWEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTS--------- 153

Query: 174 LPGYGGSKAALEHLTQCVAYDLADHRIAVNALSP 207
+ Y SKAA T+C+ +LA++ I N +SP
Sbjct: 154 MAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3223cHTHTETR743e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.5 bits (180), Expect = 3e-18
Identities = 30/183 (16%), Positives = 59/183 (32%), Gaps = 14/183 (7%)

Query: 26 PGAGRPRDPRIDSAILSATAELLVQTGYSNISLAAVAERAGTTKSALYRRWSSKAELVHE 85
+ IL L Q G S+ SL +A+ AG T+ A+Y + K++L E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 86 AAFPTAPTALEAPAGDIAADMRMMIEATRDV----FTTPVVRAALPGLV------ADMTA 135
+ E A + R++ + V L+ +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 136 DPALNARVMSRFA-DLFTAVRVRLREAVDRGEAHRDVDPDRLIELIGGA---TMLRMLLR 191
+ A+ + + + + L+ ++ D+ R ++ G M L
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 192 PEE 194
P+
Sbjct: 182 PQS 184


97MAP3284cMAP3291cN        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3284c-27-0.869060FadD29
MAP3285c-111-0.881832hypothetical protein
MAP3286110-2.474151IS1547_2 transposase
MAP3287010-2.002943hypothetical protein
MAP3288-110-1.722096hypothetical protein
MAP3289c-29-1.286698Mce1_1
MAP3290c-29-0.948801Mpt64
MAP3291c-110-0.852234*hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3284cNUCEPIMERASE421e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 41.7 bits (98), Expect = 1e-05
Identities = 43/189 (22%), Positives = 69/189 (36%), Gaps = 38/189 (20%)

Query: 744 ILLTGATGFLGPFLLSSLLARTPYTVHALVRATDPGHGLDRIVA----SLRKAQLWTPAL 799
L+TGA GF+G + LL V G+D + SL++A+ L
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVV-----------GIDNLNDYYDVSLKQAR-----L 46

Query: 800 EAEVRARVRVICGDLAEPALGIGEPAFARLARD--VDAVVHNGAL--VNY-VRTYDALRP 854
E + + DLA+ L + V + V Y + A
Sbjct: 47 ELLAQPGFQFHKIDLAD------REGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD 100

Query: 855 TNVEGTRELLRLAMTDHAKTFHLV--SSTFIYGWSTQPVVGEWDANEKMAGLDFGYSQTK 912
+N+ G +L H K HL+ SS+ +YG + + D+ + L Y+ TK
Sbjct: 101 SNLTGFLNILEGC--RHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSL---YAATK 155

Query: 913 WVAEQLALA 921
E +A
Sbjct: 156 KANELMAHT 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3287DHBDHDRGNASE554e-11 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 55.4 bits (133), Expect = 4e-11
Identities = 54/204 (26%), Positives = 88/204 (43%), Gaps = 26/204 (12%)

Query: 14 GRTVIVTGANAGLGEVTARELARVGGHVILAVRNTDKGRAAADRMAGVATGRVEVRELDL 73
G+ +TGA G+GE AR LA G H+ N +K + A E D+
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEAR-HAEAFPADV 66

Query: 74 QDLASVR----RFADGIDTVDVLVNNAGI--MATKHAVTVDGFEGQIGTNHLGHFALTNL 127
+D A++ R + +D+LVN AG+ H+++ + +E N G F +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 128 LLPKLTDR----VVTVSSLMHHFGYISLKDLNFRSRPYSAWLAYSQSKLANLLFTSELQR 183
+ + DR +VTV S N P ++ AY+ SK A ++FT L
Sbjct: 127 VSKYMMDRRSGSIVTVGS-------------NPAGVPRTSMAAYASSKAAAVMFTKCLG- 172

Query: 184 RLDAVPSSLRALAAHPGWSHTNLQ 207
L+ ++R PG + T++Q
Sbjct: 173 -LELAEYNIRCNIVSPGSTETDMQ 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3289cFLGMOTORFLIG310.009 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 30.9 bits (70), Expect = 0.009
Identities = 24/112 (21%), Positives = 45/112 (40%), Gaps = 21/112 (18%)

Query: 161 QTITSIAEKVDPVKLNLTLSAAAQSLSGLGEKFGQSVVNANALLDDVNPRMPQA-----R 215
QTI I +DP K A+ LS L + +V AL+D +P + +
Sbjct: 138 QTIALILSYLDPQK-------ASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 216 KDIQGLAALGDTYADASPDLFDFLNN-------AVITSRTINAQQKDLDQAL 260
K + L++ T A ++ + +N +I S + + +L + +
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIES--LEEEDPELAEEI 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3291cPF05616340.004 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 33.6 bits (76), Expect = 0.004
Identities = 26/66 (39%), Positives = 30/66 (45%), Gaps = 10/66 (15%)

Query: 895 PGAGAAATNIQPTEGGAPAASPPANAPAPAVTPGSAP-PVAAP-------PVPDGSVTLS 946
PG+ A A N QP +PA +P AN PAP PG+ P P P P DG
Sbjct: 317 PGS-AEAPNAQPLPEVSPAENP-ANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTR 374

Query: 947 PAKAAV 952
P AV
Sbjct: 375 PDSPAV 380


98MAP3325MAP3331N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3325112-1.623697short chain dehydrogenase
MAP3326c211-1.167868hypothetical protein
MAP3327c-112-1.033518hypothetical protein
MAP3328c014-0.595076hypothetical protein
MAP3329c-1100.567420hypothetical protein
MAP3330-191.702712hypothetical protein
MAP3331-281.605732hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3325DHBDHDRGNASE724e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 72.4 bits (177), Expect = 4e-17
Identities = 50/190 (26%), Positives = 86/190 (45%), Gaps = 8/190 (4%)

Query: 3 LNGKTMFISGASRGIGLAIAKRAAQDGANIALIAKTAEPHPKLPGTVYTAAKELEEAGGQ 62
+ GK FI+GA++GIG A+A+ A GA+IA + E K+ ++ A+ E
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE----- 60

Query: 63 ALPIVGDVRDPESVEAAVAKTIEQFGGIDICVNNASAINLGSITEVPMKRFDLMNGIQVR 122
A P DVRD +++ A+ + G IDI VN A + G I + + ++ +
Sbjct: 61 AFPA--DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118

Query: 123 GTYAVSQACIPHLKGRENPHILTLSPPVLLGKEWLEPTAYMMAKFGMTLCALGIAEEMRE 182
G + S++ ++ R + I+T+ G AY +K + + E+ E
Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGSNP-AGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 183 AGIASNTLWP 192
I N + P
Sbjct: 178 YNIRCNIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3328cDHBDHDRGNASE842e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 83.9 bits (207), Expect = 2e-21
Identities = 66/253 (26%), Positives = 93/253 (36%), Gaps = 27/253 (10%)

Query: 13 VLVTGGTSGIGNAIATAFADSGAAVTVTGTRAAATGYPDIDLGAFSYRQ----CHIQDPE 68
+TG GIG A+A A GA + L A + ++D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 69 SVDALANSLSD----LDILVNNAGGPYPAG-DEYDPDGYVASVTQNMFGPMRLTMRCHDL 123
++D + + +DILVN AG P + + A+ + N G + +
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS---RSV 127

Query: 124 LKSSRAAGGASVVNVVSMSAFRSAVFVPGYASSKMGLVALTMNLSRRWAGDGIRVNAIAP 183
K S+V V S A + YASSK V T L A IR N ++P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 184 GLIDTRMTHP-----------AMGIPEVMDVEIGFHTPLGRPGTPADCAGAALFLCTEAA 232
G +T M G E I PL + P+D A A LFL + A
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGI----PLKKLAKPSDIADAVLFLVSGQA 243

Query: 233 SYITGSTIAVDGG 245
+IT + VDGG
Sbjct: 244 GHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3330DHBDHDRGNASE1024e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 102 bits (254), Expect = 4e-28
Identities = 71/266 (26%), Positives = 116/266 (43%), Gaps = 18/266 (6%)

Query: 12 MGLRGLADKVAVVVGGATGIGAATAARLAGEGCRVVIGDVAVDAARQTADRIAAAGGTAT 71
M +G+ K+A + G A GIG A A LA +G + D + + + A A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 72 QVAFDLADPASVATLIDCAATTYGGVDLLFNVGADMSTIRADTDVVDIDFDVWDRVMTVS 131
D+ D A++ + G +D+L NV + + + W+ +V+
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIH----SLSDEEWEATFSVN 116

Query: 132 LRGYVAAMKYAIPRMLDRGGGAIVNMSSAAAFQGEPARPAYATAKAGIGALTRHVASRWG 191
G A + M+DR G+IV + S A + AYA++KA T+ +
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 192 KDNIRCNAVAPGFTATETIRSVPQWPELEAAALKRIRG-----------PRVGDPADVAT 240
+ NIRCN V+PG T T+ S+ W + E A + I+G ++ P+D+A
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSL--WAD-ENGAEQVIKGSLETFKTGIPLKKLAKPSDIAD 233

Query: 241 LVAFLLSAEGDWINGQVINIDGGTVL 266
V FL+S + I + +DGG L
Sbjct: 234 AVLFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3331HTHTETR453e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.4 bits (107), Expect = 3e-08
Identities = 18/143 (12%), Positives = 52/143 (36%), Gaps = 4/143 (2%)

Query: 6 RSRRRDGDERRRQLCDAAIRVLAEHGSRGLTHGQVDRYAGVPEGTTSYYYRTRAALLQGV 65
R +++ E R+ + D A+R+ ++ G + G++ + AGV G ++++ ++ L
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS-- 60

Query: 66 GKRVAEIDVANLQSVIDEPLDPLSPFAHLARLTMMQASGPGLMLNRARHELLLGAARDPG 125
+ E+ +N+ + E ++ + R L+
Sbjct: 61 --EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118

Query: 126 LAETSQIFAGRINSMARDAIAHL 148
+ ++ ++ +
Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRI 141


99MAP3503cMAP3510N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3503c07-0.038922hypothetical protein
MAP3504c07-0.723769hypothetical protein
MAP3505c18-0.635011hypothetical protein
MAP3506c08-0.721991hypothetical protein
MAP350709-1.220710hypothetical protein
MAP3508010-1.272162hypothetical protein
MAP3509-110-0.013496hypothetical protein
MAP3510-2100.562129hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3503cDHBDHDRGNASE968e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 95.9 bits (238), Expect = 8e-26
Identities = 67/248 (27%), Positives = 106/248 (42%), Gaps = 11/248 (4%)

Query: 5 QVAIVTGASSGIGLGCATRLAGTGMAVLGTGRDPKRLAELETAIGDPDRVA-TVAVDLTD 63
++A +TGA+ GIG A LA G + +P++L ++ +++ R A D+ D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 64 DDAPRRIVDRALQRWGHIDFLINNAGVGSPKPLHETDDDTLDYFLNLMLRAPFRLAREVL 123
A I R + G ID L+N AGV P +H D+ + ++ F +R V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 124 PHLPP--GSAIINVTSTFAVVGGLRGGAYSAAKGGLTALTTHIACQYGASGIRCNAVAPG 181
++ +I+ V S A V AY+++K T + + IRCN V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 182 VTVTPM-----VEKRLQDPRFRKINTEM---TPHQRLGSVDDIAATVAFLCSPGGSFING 233
T T M ++ + + P ++L DIA V FL S I
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 234 QTIVVDGG 241
+ VDGG
Sbjct: 249 HNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3506cENTEROTOXINA290.039 Heat-labile enterotoxin A chain signature.
		>ENTEROTOXINA#Heat-labile enterotoxin A chain signature.

Length = 258

Score = 29.2 bits (65), Expect = 0.039
Identities = 19/60 (31%), Positives = 25/60 (41%), Gaps = 6/60 (10%)

Query: 334 HTVGMLFSPGYPGFLRGVAASGPDEFVVTTSGGQISRYRPEASESEVLADG---FDQLYG 390
H G GY + V A+ P+ F V + Y P E EV A G + Q+YG
Sbjct: 88 HLAGQSILSGYSTYYIYVIATAPNMFNVN---DVLGVYSPHPYEQEVSALGGIPYSQIYG 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3507DHBDHDRGNASE1184e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 118 bits (297), Expect = 4e-34
Identities = 78/259 (30%), Positives = 122/259 (47%), Gaps = 15/259 (5%)

Query: 8 LEGRVVVVSGAGGGGIGTTVTAMAARAGATVIAVSRSKENLDEHIAPLAARGLAVLPVAA 67
+EG++ ++GA G IG V A GA + AV + E L++ ++ L A A
Sbjct: 6 IEGKIAFITGAAQG-IGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 68 DASTDEGIAAVIDQARRADGRLYGLVNVAGGAEPSTWMPSTRVSRTDWRKIFADNLETAF 127
D I + + R G + LVNVAG P +S +W F+ N F
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPG---LIHSLSDEEWEATFSVNSTGVF 121

Query: 128 FMSQAVAAELLARRLPGSIVSISSISGMNTAPFHIAYGTAKSAIAAMTRTMALELAQSAI 187
S++V+ ++ RR GSIV++ S AY ++K+A T+ + LELA+ I
Sbjct: 122 NASRSVSKYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 188 RVNAVAPGVTETAASRTYVADDPDRDRR----------AIAMGRRGRPEEQAGAILFLLS 237
R N V+PG TET + AD+ ++ I + + +P + A A+LFL+S
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 238 ELSSYVTGQTLLVDGGLDL 256
+ ++T L VDGG L
Sbjct: 241 GQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3510DHBDHDRGNASE818e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 80.9 bits (199), Expect = 8e-20
Identities = 54/192 (28%), Positives = 83/192 (43%), Gaps = 13/192 (6%)

Query: 14 RVAVVTGAGRGLGRAYAHLLAARGAKVVVNDVGGALDGAGVDTGPAAQ--VVDEITAAGG 71
++A +TGA +G+G A A LA++GA + A VD P VV + A
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHI-----------AAVDYNPEKLEKVVSSLKAEAR 57

Query: 72 DAVACTESVATPEGGRAIIETALARYGRLDVLVHNAGNVRRASLKQMSYEDFDAVLDVHL 131
A A V I G +D+LV+ AG +R + +S E+++A V+
Sbjct: 58 HAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNS 117

Query: 132 RGAFHVLRPAFPVMCRAGYGRIVLTSSIGGLYGNQGVANYAAAKAGVIGLSNVAALEGAA 191
G F+ R M G IV S +A YA++KA + + LE A
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 192 EGVRCNVIVPAA 203
+RCN++ P +
Sbjct: 178 YNIRCNIVSPGS 189


100MAP3520cMAP3527N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3520c071.294506hypothetical protein
MAP3521-171.682373hypothetical protein
MAP35220102.112982OxyS_1
MAP3523c091.235465oxalyl-CoA decarboxylase
MAP35240101.128259acyl-CoA synthetase
MAP3525c012-0.055392elongation factor G
MAP3526c113-0.768323hypothetical protein
MAP3527011-1.337075PepA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3520cUREASE290.021 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 28.9 bits (65), Expect = 0.021
Identities = 19/60 (31%), Positives = 25/60 (41%), Gaps = 4/60 (6%)

Query: 17 MALVAPVGRGGASVPLPQPVPGIASILPANGAVVGVAHPIVVTFTAPAADRAAVERSIHV 76
AP+G AS+P PQPV P GA VTF + A+ A + + V
Sbjct: 454 TIAAAPMGDPNASIPTPQPV----HYRPMFGAYGRSRTNSSVTFVSQASLDAGLAGRLGV 509


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3521ISCHRISMTASE434e-07 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 43.5 bits (102), Expect = 4e-07
Identities = 39/182 (21%), Positives = 67/182 (36%), Gaps = 31/182 (17%)

Query: 44 RPCADSAALVLIDVQR---DFYADDAPMRVEGTSAALGAMAELARPFRRRELPIVHVVRL 100
P + A L++ D+Q D + A E + +L + +P+V
Sbjct: 25 VPDPNRAVLLIHDMQNYFVDAFTAGASPVTE----LSANIRKLKNQCVQLGIPVV----- 75

Query: 101 YRADGSNADPVRRRFIEDGARVAVPGSPGSQ-IAPELLPKAVELDHQLLLSGGFQQIGPA 159
Y A + +P R + D + P + I EL P+
Sbjct: 76 YTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPE------------------DD 117

Query: 160 EHVMYKPRWGAFYGTKLVQHLRESGTDTLVFAGCNFPNCPRTSIYEASERDFRIVLVADA 219
+ V+ K R+ AF T L++ +R+ G D L+ G + EA D + V DA
Sbjct: 118 DLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDA 177

Query: 220 IS 221
++
Sbjct: 178 VA 179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3525cTCRTETOQM352e-114 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 352 bits (904), Expect = e-114
Identities = 146/684 (21%), Positives = 268/684 (39%), Gaps = 74/684 (10%)

Query: 23 IRNVVLVGPSGGGKTTLVEALLVAAGVLNRPGSVADGSTVCDYDEAEIRQQRSVGVAVAS 82
I N+ ++ GKTTL E+LL +G + GSV G+T D E ++ ++ + S
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 83 LSHDGVKVNLVDTPGYADFVGELRAGLRAADCALFVIAANEGVDEPTKLLWQECNQVGMP 142
+ KVN++DTPG+ DF+ E+ L D A+ +I+A +GV T++L+ ++G+P
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 143 RAVVITKLDHARANYAEALAAAQNAFGDKVLPLYLPTGLPACTGLIGLLSQQRYAYADGK 202
I K+D + + + +++ Q+ Y +
Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIK-----------------QKVELYPN-- 163

Query: 203 RAVRPPDPADAAQIEAARGTLIEGIIEESEDESLMERYLGGESIDETVLIADLERAVARG 262
+ + ++ Q + T+IEG ++ L+E+Y+ G+S++ L +
Sbjct: 164 --MCVTNFTESEQWD----TVIEG------NDDLLEKYMSGKSLEALELEQEESIRFHNC 211

Query: 263 SFFPVIPVCSSTGVGTLELLEVATRGFPSPMEHPLPEVFTQVGAPRAGLACDPDGPLLAE 322
S FPV + +G L+EV T F S L +
Sbjct: 212 SLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH-------------------RGQSELCGK 252

Query: 323 VVKTTSDPYVGRVSLVRVFSGTIRPDATVHVSGHFASFFGTGNGNGHAHPDHDEDERIGV 382
V K R++ +R++SG + +V +S E +I
Sbjct: 253 VFKIEYSEKRQRLAYIRLYSGVLHLRDSVRIS-------------------EKEKIKITE 293

Query: 383 LSFPLGKQQRPASAVVAGDICAIGKLSRAETGDTLSDKSEPLVLKPWTMPEPLLPVAIAA 442
+ + + +G+I I + + L D + P PLL +
Sbjct: 294 MYTSINGELCKIDKAYSGEI-VILQNEFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEP 352

Query: 443 HAKTDEDKLSVGLGRLAAEDPTLRIEQNQETHQIVLWCMGESHAGVVLDALANRYGVTVD 502
+ L L ++ DP LR + TH+I+L +G+ V L +Y V ++
Sbjct: 353 SKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIE 412

Query: 503 TVELRLPLRETFAGKAKGHGRHVKQSGGHGQYAVCDIEVEPLPEGSGFEFVDKVVGGAVP 562
E + E KA+ + + +A + V PLP GSG ++ V G +
Sbjct: 413 IKEPTVIYMERPLKKAE--YTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLN 470

Query: 563 RQFIPSVEKGVRAQMEKGVHAGYPVVDIRVTLLDGKAHSVDSSDFAFQMAGALALREAAA 622
+ F +V +G+R E+G++ G+ V D ++ G +S S+ F+M + L +
Sbjct: 471 QSFQNAVMEGIRYGCEQGLY-GWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLK 529

Query: 623 ATKVVLLEPIDEISVLVPDDFVGAVMGDLSGRRGRVLGTDTAGHERTVVKAEVPQVELTR 682
LLEP + P +++ D ++ T +E ++ E+P +
Sbjct: 530 KAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNE-VILSGEIPARCIQE 588

Query: 683 YAIDLRSLAHGAASFTRSFARYEP 706
Y DL +G + Y
Sbjct: 589 YRSDLTFFTNGRSVCLTELKGYHV 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3527V8PROTEASE475e-08 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 46.9 bits (111), Expect = 5e-08
Identities = 30/167 (17%), Positives = 56/167 (33%), Gaps = 28/167 (16%)

Query: 86 GTGIVIDPNGVVLTNNHVISGATE----ISAF------DVGNGQTYAV-DVVGYDRTQDI 134
+G+V+ + +LTN HV+ + AF D + + Y D+
Sbjct: 104 ASGVVVGKD-TLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDL 162

Query: 135 AVLQLRGAAGLPTATIGGEATVGEPIVALG---NVGGQGGTPNAVAGKVVALNQSV--SA 189
A+++ + +GE + N Q V G + +
Sbjct: 163 AIVKF--------SPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWES 214

Query: 190 TDTLTGAQENLGGLIQADAPIKPGDSGGPMVNSAGQVIGVDTAATDS 236
+T + +Q D G+SG P+ N +VIG+ +
Sbjct: 215 KGKITYLKGEA---MQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPN 258


101MAP3560MAP3567N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3560-2130.202645hypothetical protein
MAP3561-2130.244365hypothetical protein
MAP3562-115-0.036942hypothetical protein
MAP3563-1170.661964hypothetical protein
MAP3564-1150.422511hypothetical protein
MAP3565-2120.106231hypothetical protein
MAP3566-212-0.715998hypothetical protein
MAP35670110.498874hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3560HELNAPAPROT1232e-38 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 123 bits (309), Expect = 2e-38
Identities = 37/145 (25%), Positives = 61/145 (42%), Gaps = 2/145 (1%)

Query: 13 QAARLTELLQKQLSTYNDLHLTLKHIHWNVVGPNFIGVHEMIDPQVEAVRGFADDVAERI 72
+ L QLS + L+ L HW V GP+F +HE + + D +AER+
Sbjct: 9 NQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERL 68

Query: 73 AALGASPQGTPGAIIKDRSWDDYSVGRDTVQAHLAALDLVYNGVIEDIRQYIDETDE-LD 131
A+G P T + S D + A ++ Y + + + I +E D
Sbjct: 69 LAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVND-YKQISSESKFVIGLAEENQD 127

Query: 132 QVTQDLLIGQAAQLEKFQWFVRAHL 156
T DL +G ++EK W + ++L
Sbjct: 128 NATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3562HTHTETR901e-24 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 90.5 bits (224), Expect = 1e-24
Identities = 31/172 (18%), Positives = 65/172 (37%), Gaps = 4/172 (2%)

Query: 13 RPPAAKADETRQRIIQAARLVFSERGYDGATFQAIAARADLTRPAINHYFASKRALYQEV 72
R +A ETRQ I+ A +FS++G + IA A +TR AI +F K L+ E+
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 73 MDETNEFVIG-VGIKEADRETTLVGRLTAFISAAVKANAENPAGSAFIFGGVLESQRHPE 131
+ + + +A + L + +++ + + + E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 132 W---NTAENDSVRIAREFLIRVVNDAIEHGEVAADIDASALVETLLVVMCGV 180
A+ + + + + + + IE + AD+ + + G+
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3565PF03544290.034 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.034
Identities = 19/102 (18%), Positives = 31/102 (30%), Gaps = 2/102 (1%)

Query: 364 DAPAALDAAVPGSQPPPAPAEPAVLDVVNATTHDGLAAALEEALAGRGFTRGSATTAPTQ 423
+ P + +P P P V V + + T + T+ T
Sbjct: 85 EPPKEAPVVIEKPKPKPKPKPKPVKKVEQP--KRDVKPVESRPASPFENTAPARPTSSTA 142

Query: 424 AEDSSIEYGPGAEAAARLLADQLHLPATAQSAVAPGTVRLTV 465
+S A L +Q PA AQ+ G V++
Sbjct: 143 TAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3567DHBDHDRGNASE838e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 83.2 bits (205), Expect = 8e-21
Identities = 52/192 (27%), Positives = 84/192 (43%), Gaps = 9/192 (4%)

Query: 3 GVQDRVIVVTGAGGGLGREYALTLAREGASVVVNDLGGARDGTGAGHNMADQVVKEIKDA 62
G++ ++ +TGA G+G A TLA +GA + D + ++VV +K
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEK---------LEKVVSSLKAE 55

Query: 63 GGRAVANYDSVAEPAGAENIIKTALDEFGAVHGVVSNAGILRDGTFHKMLFENWDAVLKV 122
A A V + A + I E G + +V+ AG+LR G H + E W+A V
Sbjct: 56 ARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSV 115

Query: 123 HLYGGYNVIRAAWPHFREQSYGRVVVATSTSGLFGNFGQTNYGAAKLGLVGLINSLALEG 182
+ G +N R+ + ++ G +V S Y ++K V L LE
Sbjct: 116 NSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175

Query: 183 AKYNIHANAIAP 194
A+YNI N ++P
Sbjct: 176 AEYNIRCNIVSP 187


102MAP3604MAP3608N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3604113-1.734709Mce1_2
MAP3605213-1.347469hypothetical protein
MAP3606212-0.718807hypothetical protein
MAP3607111-1.338959hypothetical protein
MAP360809-1.301983hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3604FLGMOTORFLIG320.005 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 31.7 bits (72), Expect = 0.005
Identities = 24/112 (21%), Positives = 45/112 (40%), Gaps = 21/112 (18%)

Query: 154 QTITSIAEKVDPVKLNLTLSAAAQSLSGLGEKFGQSVVNANALLDDVNPRMPQA-----R 208
QTI I +DP K A+ LS L + +V AL+D +P + +
Sbjct: 138 QTIALILSYLDPQK-------ASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 209 KDIQGLAALGDTYADASPDLFDFLNN-------AVITSRTINAQQKDLDQAL 253
K + L++ T A ++ + +N +I S + + +L + +
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIES--LEEEDPELAEEI 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3605FLGMOTORFLIN290.018 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 28.7 bits (64), Expect = 0.018
Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 8/59 (13%)

Query: 152 ALDPQKVNTIATALVTVFQGQGG--------TINDILDQTAQLTSQLGERDQAIGEVIK 202
AL+ QK T +A VFQ GG I+ I+D +LT +LG I E+++
Sbjct: 22 ALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLR 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3606PF03544310.006 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 31.5 bits (71), Expect = 0.006
Identities = 25/127 (19%), Positives = 34/127 (26%), Gaps = 9/127 (7%)

Query: 392 LPRPDNPLPCAGAVTGPFGGPGFPAPVDVMTSPPNPAGLPPTPGIPIAGRPGDAPPDVPG 451
LP P P ++ P P + PP P P PI P +AP +
Sbjct: 43 LPAPAQP------ISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEK 96

Query: 452 ---TPVPLPTQAPPGARTENLAPAGPVPPPSTFAPGAAAGSAGTPRAGQPVAGAVHQPRR 508
P P P + + P S F A A +
Sbjct: 97 PKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASG 156

Query: 509 DRRQRRS 515
R R+
Sbjct: 157 PRALSRN 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3607PF03544310.010 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 31.1 bits (70), Expect = 0.010
Identities = 15/41 (36%), Positives = 17/41 (41%)

Query: 492 APVPIPPPPPGPGVAPGPVAPTPAPVSAPAPNAGGPAAPAD 532
APV I P P P P PV P P PA+P +
Sbjct: 90 APVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFE 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3608PF07328290.015 T-DNA border endonuclease VirD1
		>PF07328#T-DNA border endonuclease VirD1

Length = 144

Score = 29.3 bits (65), Expect = 0.015
Identities = 11/62 (17%), Positives = 21/62 (33%)

Query: 241 ADTKQLLINAVDSVGRLSQAADQYLSEARGPLHTDLQALQCPLKELGKASPYLIGALKLI 300
A T +LL + ++ ++ +Q A + K LG L L +
Sbjct: 65 AKTVELLRDMSRAIAGVATNINQIAKAANRTHDPAYHSFMAERKVLGLELSKLSAVLAPL 124

Query: 301 LT 302
+
Sbjct: 125 ME 126


103MAP3633MAP3644N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP36330101.450706hypothetical protein
MAP3634-291.281709hypothetical protein
MAP36350100.933251hypothetical protein
MAP36360110.828669hypothetical protein
MAP3637c1120.944487hypothetical protein
MAP36381110.905959hypothetical protein
MAP3639c0101.504253hypothetical protein
MAP3640-1112.131440hypothetical protein
MAP3641c-1121.555470hypothetical protein
MAP3642c0111.830486hypothetical protein
MAP3643c0111.485551tRNA (guanine-N(7)-)-methyltransferase
MAP36441100.562739hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3633TCRTETB392e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.1 bits (91), Expect = 2e-05
Identities = 25/163 (15%), Positives = 62/163 (38%), Gaps = 1/163 (0%)

Query: 44 ALPAIARNLQVSLVLVGTLLSWYALVAALTTIPLVRWTAHLPRRRVLVASLTCLTASQLI 103
+LP IA + + + + L ++ T + + L +R+L+ + +I
Sbjct: 36 SLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI 95

Query: 104 SALAPNFA-VLAAGRVLCAITHGLLWSVIAPIATRLVPPSHAGRATMSIYVGTSLALVVG 162
+ +F +L R + +++ + R +P + G+A I ++ VG
Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155

Query: 163 SPLTAALSLMWGWRLAVVCVTVAAAVVTVAARLMLPEMVLTEH 205
+ ++ W ++ + V +L+ E+ + H
Sbjct: 156 PAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGH 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3634TONBPROTEIN401e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 39.6 bits (92), Expect = 1e-05
Identities = 20/101 (19%), Positives = 29/101 (28%), Gaps = 12/101 (11%)

Query: 37 PALADPDPAPADPGAVAAPPGPPAPPDPLAPPPPPDPLAPPPPAAPPAPW---------- 86
PA +P A P P P P P P P + P P P P
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111

Query: 87 --LPPAAQPAAAPAAGQDPTPFTGTPPFGPPTFVPKTGSTV 125
+ P A+P P T + + + ++
Sbjct: 112 RDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASG 152



Score = 31.9 bits (72), Expect = 0.003
Identities = 22/92 (23%), Positives = 24/92 (26%), Gaps = 5/92 (5%)

Query: 44 PAPADPGAVA--APPGPPAPPDPLAPPPPPDPLAPPPPAAPPAPWLPPAAQPAAAPAAGQ 101
PAPA P +V P P PP P P P P P P P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 98

Query: 102 DPTPFTG---TPPFGPPTFVPKTGSTVGVAQP 130
P P P + S P
Sbjct: 99 KPKPVKKVQEQPKRDVKPVESRPASPFENTAP 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3635HTHTETR381e-05 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 37.7 bits (87), Expect = 1e-05
Identities = 20/143 (13%), Positives = 55/143 (38%), Gaps = 3/143 (2%)

Query: 2 RSPREKMVVSAALLIRERGAHATAISDVLEHSGAPRGSAYHYFPGGRTQLLCEAVDFAGE 61
+ R+ ++ A L ++G +T++ ++ + +G RG+ Y +F ++ L E + +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHF-KDKSDLFSEIWELSES 68

Query: 62 YVAAVIAG--AESGSRLLDTLIDTYREQLRDSDFRAGCPVVAVAVEAGEQSDAERPVIER 119
+ + A+ L L + L + ++ + + E V+++
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 120 AAAAFDRWTDLIAQRFVADGIRR 142
A + ++ + I
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEA 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3637cACRIFLAVINRP681e-13 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 68.3 bits (167), Expect = 1e-13
Identities = 42/227 (18%), Positives = 88/227 (38%), Gaps = 22/227 (9%)

Query: 186 VILIVLLAVFGSLAAAAIPLALGICTVVVTMGLVYLLSAYTTMSVFVTSTVSMFGIALA- 244
++ +V+ ++ A IP V V + + + A S+ +T++MFG+ LA
Sbjct: 350 LVFLVMYLFLQNMRATLIPTI----AVPVVLLGTFAILAAFGYSI---NTLTMFGMVLAI 402

Query: 245 ---VDYSLFILMRFREELRSGR-QPREAVDAAMATSGLAVVLSGMTVIASLTGIYVINTP 300
VD ++ ++ + + P+EA + +M+ A+V M + A +
Sbjct: 403 GLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGS 462

Query: 301 A---LKSMATGAILAVAVAMLTSTTLTPAALATFGRAAAK-----RSALLHWSRRPESTQ 352
+ + + A+A+++L + LTPA AT + + + W
Sbjct: 463 TGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHS 522

Query: 353 SKFWNRWIGWVMRRPWMSALAASLVLLVMAAPAASMVLGNSLLRQFD 399
+ +G ++ L L+V + L +S L + D
Sbjct: 523 VNHYTNSVGKILGSTGRYLLIY--ALIVAGMVVLFLRLPSSFLPEED 567


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3638OMADHESIN290.010 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 29.1 bits (64), Expect = 0.010
Identities = 25/58 (43%), Positives = 30/58 (51%), Gaps = 8/58 (13%)

Query: 2 TSASAGRRRRIFAGLIAAALPGAAVAVLAGPPATGANDPCAASEVARTIGSVSKSMGD 59
+ASA I G A A GAAVAV AG ATG N + IG +SK++GD
Sbjct: 63 LNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVN--------SVAIGPLSKALGD 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3641cACRIFLAVINRP582e-10 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 57.9 bits (140), Expect = 2e-10
Identities = 35/233 (15%), Positives = 83/233 (35%), Gaps = 34/233 (14%)

Query: 193 AVPMVAVVLFLVFGGAVAAGLPAIVGGLSIAGSLGILRLVAVFGPVHYFAQPVVSLIGL- 251
A+ +V +V++L A +P I + + G+ IL FG ++ +++ G+
Sbjct: 347 AIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAA---FG----YSINTLTMFGMV 399

Query: 252 ---GIAIDYGLFVVSRF-REEIAEGYDTEAAVRRTVMTAGRTVTFSAVLIIASSASLLVL 307
G+ +D + VV R + + + A +++ + A+++ A +
Sbjct: 400 LAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFF 459

Query: 308 P--QG-FVHSLTYAIFAAVGLAALLSITFLPACLGILGRHVDALGVRTAFRVPFLRNWKY 364
G + I +A+ L+ L+++ PA L L +A +
Sbjct: 460 GGSTGAIYRQFSITIVSAMALSVLVALILTPALCATL------LKPVSAEHHENKGGF-- 511

Query: 365 SRAYLNWLADRLQKTKTREEVEAGFWGKLVNWVMRKPLAFAIPIAVGMILLVI 417
W + + V ++ + + A+ + +V+
Sbjct: 512 ----FGWFNTTFDHSVNH-------YTNSVGKILGSTGRYLLIYALIVAGMVV 553



Score = 50.6 bits (121), Expect = 3e-08
Identities = 39/209 (18%), Positives = 78/209 (37%), Gaps = 23/209 (11%)

Query: 534 AKKIAELRAVNPPKGLNLYVGGTPALEQDSIHSLFDKAPLMLVLLLGATLLLMFLAFGSV 593
A + E A P G+ G E+ S ++AP ++ + L + + S
Sbjct: 839 AMALMENLASKLPAGIGYDWTGMSYQERLS----GNQAPALVAISFVVVFLCLAALYESW 894

Query: 594 VLPIKAVLMSALTLGSTMGILTWIFVDGHLSGVLNFTPTPLMVVVIALVVAVGFGLATDY 653
+P+ +M + LG +L + MV ++ + GL+
Sbjct: 895 SIPV--SVMLVVPLGIVGVLLAATLFNQKND-------VYFMVGLLTTI-----GLSAKN 940

Query: 654 EVFLVSRMVEARER-GMSTAEAIRIGTATTGR--LITAAAMVLAVVASSFVFSD-LVLMK 709
+ +V + E+ G EA + R L+T+ A +L V+ +
Sbjct: 941 AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN 1000

Query: 710 YLAFGLMAALLLDATVVRMFLVPSVMKLL 738
+ G+M ++ AT++ +F VP ++
Sbjct: 1001 AVGIGVMGGMVS-ATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3644PF06872300.018 EspG protein
		>PF06872#EspG protein

Length = 398

Score = 29.7 bits (66), Expect = 0.018
Identities = 16/56 (28%), Positives = 27/56 (48%), Gaps = 2/56 (3%)

Query: 185 GVDVPAAARERLLDTLDLFGT--ALAIAAIRRGAGPAQLRTLLRRISGVDAVLAEI 238
GVD+PA A++ L +TL L T + + IR G +++ S + A +
Sbjct: 219 GVDIPADAQKLLRNTLGLKDTNSSPDLNVIRNGIPRHYAEQIVKESSSTNEQKAAV 274


104MAP3735cMAP3740N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3735c-128-3.458264hypothetical protein
MAP3736c-227-3.290147hypothetical protein
MAP3737-125-2.975547hypothetical protein
MAP3738c025-3.242021hypothetical protein
MAP3739c025-3.232363hypothetical protein
MAP3740124-3.152133hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3735cTYPE3IMPPROT300.017 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 29.8 bits (67), Expect = 0.017
Identities = 23/149 (15%), Positives = 50/149 (33%), Gaps = 13/149 (8%)

Query: 9 VLIGLVMYQRAMASADTKYPEFMQWLARLNSAAVEFVNGIGVVKAFGTPGVASRRFQE-- 66
L G+ + S +P V F + + K R +
Sbjct: 51 TLNGVALL----LSMFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKY 106

Query: 67 VSRAFAHFFLD------WAKSTSVAAVIAETLLSPPSILVVVAGAGGALTANGKLPLDSF 120
R FF + + + T + + P ++ A A + + K+ +
Sbjct: 107 SDRELVQFFENAQLKRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLY 166

Query: 121 IAFLVFGTVITAGLMTV-MMSIHPLVTAL 148
+ F+V V+++ L+ + MM + P+ +
Sbjct: 167 LPFVVVDLVVSSVLLALGMMMMSPVTIST 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3737PF03544300.017 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.9 bits (67), Expect = 0.017
Identities = 18/65 (27%), Positives = 24/65 (36%), Gaps = 5/65 (7%)

Query: 288 APAPEPVPATVADVPAPTSPPNQALPVAGIPSASTPASAPASAPATSATAPASAPAPAPA 347
P P+P P V V P ++P A A TS+TA A+ P +
Sbjct: 98 KPKPKPKPKPVKKVEQPKRDVKPVE-----SRPASPFENTAPARPTSSTATAATSKPVTS 152

Query: 348 SAGSP 352
A P
Sbjct: 153 VASGP 157



Score = 29.2 bits (65), Expect = 0.029
Identities = 15/116 (12%), Positives = 26/116 (22%), Gaps = 10/116 (8%)

Query: 289 PAPEPVPATVADVPAPTSPPNQALPVAGIPSASTPASAPASAPATSATAPASAPAPAPAS 348
P P P + PP +A V P PAS
Sbjct: 70 PEPVVEPEPEPEPIP--EPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPAS 127

Query: 349 AGSPVFGYLVGGGGGGESGPTLTGRSNATAPAGLAAASAAAAKAPTRDQTRSRRRR 404
+ + +A+ A ++ + R++ R
Sbjct: 128 PFENTAPARPTSSTATAAT--------SKPVTSVASGPRALSRNQPQYPARAQALR 175



Score = 28.8 bits (64), Expect = 0.043
Identities = 11/67 (16%), Positives = 17/67 (25%), Gaps = 2/67 (2%)

Query: 288 APAPEPVPATVADVPAPTSPPNQALPVAGIPSASTPASAPASAPATSATAPA--SAPAPA 345
P PEP + P P P + S + P +
Sbjct: 81 EPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSS 140

Query: 346 PASAGSP 352
A+A +
Sbjct: 141 TATAATS 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3739cTCRTETB1244e-33 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 124 bits (314), Expect = 4e-33
Identities = 87/408 (21%), Positives = 159/408 (38%), Gaps = 27/408 (6%)

Query: 27 FLMTLDITVVNVALPSIQKDLGASLEGLQWVVNAYVLAFAALLLTVGSVSDRLGRKRLFL 86
F L+ V+NV+LP I D WV A++L F+ G +SD+LG KRL L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 87 TGVAVFTVASALCVASRTESP-LIAARALQGIGGALVFGTCLALIADAYTDAEEEQRRKA 145
G+ + S + + LI AR +QG G A + ++A +E R KA
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVA---RYIPKENRGKA 140

Query: 146 VGLAMAAGAAAATLGPLIGGGLVEIGTWQWIFAINVPVGVALAICTALKVREPHAPHAAD 205
GL + A +GP IGG + W + + +P+ + + +K+ +
Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAH--YIHWSYLLLIPMITIITVPFLMKLLKKEVRI--- 195

Query: 206 NSRVDSVGAVVAIVVLFALNYGLLTGAAKGWGRGDVLAALAIGLAGGVGFVLHQLRRGSE 265
D G ++ V + T + + +++ L+ + FV H R+ ++
Sbjct: 196 KGHFDIKGIILMSVGIVFFMLF-TTSYSISF---LIVSVLSFLI-----FVKHI-RKVTD 245

Query: 266 ATLDLTLFRIPTFLAAIVLGFTVRALSFGVFPFLILWLAGAHGRSAFDIGLILSALALPL 325
+D L + F+ ++ G + G + + H S +IG + + P
Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSV---IIFPG 302

Query: 326 MVCAVLSTSVARAV----GVRATMSIAMVITAAGLFLATLIRGDGSWTTILPALAVLGVG 381
+ ++ + + G ++I + + A+ + SW + + VLG G
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-G 361

Query: 382 NGVAMPHLMNLAVDVVPSNKAGMATGAANTAFPLGTATGVAAFGVVLS 429
+ + + +AG N L TG+A G +LS
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3740ISCHRISMTASE438e-06 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 43.5 bits (102), Expect = 8e-06
Identities = 17/66 (25%), Positives = 30/66 (45%)

Query: 615 LAQHLAAMLGVEPYELAPDADLTTLGLTSMMTAQIVEWSSSQSRRLDFADLYAEPTLRSW 674
+ + +A +L P ++ DL GL S+ +VE + + F +L PT+ W
Sbjct: 235 IRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEW 294

Query: 675 QRLFDA 680
Q+L
Sbjct: 295 QKLLTT 300


105MAP3776cMAP3782N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3776c-19-0.397179hypothetical protein
MAP3777-2100.710571hypothetical protein
MAP3778-2100.605619hypothetical protein
MAP3779-2101.020632hypothetical protein
MAP3780-2110.904348hypothetical protein
MAP37812112.778351hypothetical protein
MAP3782083.307808hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3776cADHESNFAMILY976e-25 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 96.9 bits (241), Expect = 6e-25
Identities = 52/266 (19%), Positives = 96/266 (36%), Gaps = 30/266 (11%)

Query: 90 AIGRTAAPPCPTAPLAVVVSVDQWGDIVSELGGACANVKTVLASSSVDPHDYEPSPADAA 149
A L VV + DI + G ++ + DPH+YEP P D
Sbjct: 19 ACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLH-SIVPIGQDPHEYEPLPEDVK 77

Query: 150 DFMNAKLIVVNG----AGYDSWASKLAGSSASGA---PLVSAAAVTTTPDGA-------N 195
A LI NG G ++W +KL ++ + V +
Sbjct: 78 KTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKED 137

Query: 196 PHLWYLPSAVTAVADAVTQELSRMEPPAAGYFSQRRAQFTSAT----RLYVNLIAKIKAE 251
PH W A + ++LS +P ++ + ++T + + KI AE
Sbjct: 138 PHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAE 197

Query: 252 AAGKSYGATETVFDYQAQAAGLVNKTPAGYRRASANESEPSPGDVDAFLTALAGRHIDLL 311
K +E F Y ++A G+ A + E E +P + + L + L
Sbjct: 198 --KKLIVTSEGAFKYFSKAYGV---PSAYIWEINT-EEEGTPEQIKTLVEKLRQTKVPSL 251

Query: 312 IYNTQTEGSIPEE-IRSAAEQSSVPV 336
E S+ + +++ ++ +++P+
Sbjct: 252 F----VESSVDDRPMKTVSQDTNIPI 273


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3778HTHFIS300.038 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.8 bits (67), Expect = 0.038
Identities = 26/179 (14%), Positives = 56/179 (31%), Gaps = 32/179 (17%)

Query: 218 ALSYLEEPEGPVAVAAVDGALAKALVLRAHVD---------EDSASEVLQDLYAAHPENE 268
L+ G + A + D +++A ++L + A P+
Sbjct: 18 VLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLP 77

Query: 269 QVEQALTDTSFGIVTTTAARIEAR-----TDPWDPETEPSAEDFVDPAAHERKAVLLHEA 323
L ++ T E P+D L E
Sbjct: 78 ----VLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE-----------LIGIIGRALAEP 122

Query: 324 ERQLAEFIGLEEVKNQVSRLKSSVAMELVRKQRGLAVAQRAHHLVFAGPPGTGKTTIAR 382
+R+ ++ ++ ++ + + S AM+ + + + ++ G GTGK +AR
Sbjct: 123 KRRPSK--LEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT-GESGTGKELVAR 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3780FLGBIOSNFLIP350.002 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 34.8 bits (80), Expect = 0.002
Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 11/67 (16%)

Query: 24 IEAPPELP-RVIPPSF----LRRAMPYVLVI----LIVGMIVA--LFATGMRLISPQTLF 72
++ P +P R++ P++ L+ A I LI+ +++A L A GM ++ P T+
Sbjct: 159 LQGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIA 218

Query: 73 FPFVLLL 79
PF L+L
Sbjct: 219 LPFKLML 225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3782PF03544290.048 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 28.8 bits (64), Expect = 0.048
Identities = 24/122 (19%), Positives = 34/122 (27%), Gaps = 8/122 (6%)

Query: 279 AAPAEIAPVAAPAAITPVLASNKPPTLPAAVALAPSVATPAGAPASTVAAGSGASAAPAP 338
A P + VA PA + P A PP P A V P P
Sbjct: 47 AQPISVTMVA-PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 105

Query: 339 ------AAAAGNFAYLVAFGGDPDTGVGPTLSGRGGAKAPAATIPAAGAAAPARAEARAR 392
+ + + P P A A + P A+ RA +R +
Sbjct: 106 KPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSK-PVTSVASGPRALSRNQ 164

Query: 393 RR 394
+
Sbjct: 165 PQ 166


106MAP3821MAP3825N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP38210113.653917hypothetical protein
MAP38220113.085210hypothetical protein
MAP3823-281.679615hypothetical protein
MAP3824-391.563620hypothetical protein
MAP3825010-0.076861UdgA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3821TONBPROTEIN455e-07 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 44.6 bits (105), Expect = 5e-07
Identities = 29/121 (23%), Positives = 42/121 (34%), Gaps = 3/121 (2%)

Query: 371 ISVPEPVAAPKPLSLPVAAPLPAAPPPAAPPLPEAPPIPAAPPVVPVPVVVPPVPVPVPV 430
V E A +P+S+ + P PP A P PE P P P+ PP PV +
Sbjct: 33 HQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPE---PIPEPPKEAPVVI 89

Query: 431 RIPVPDPVSPPQLLAPPRLSVPQPVQPPVRVPQPPSPPQVGGTVPQSPPQHQTPPGEGTP 490
P P P P+ + + + V+P P P + S T +
Sbjct: 90 EKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSV 149

Query: 491 P 491

Sbjct: 150 A 150



Score = 41.1 bits (96), Expect = 6e-06
Identities = 30/124 (24%), Positives = 39/124 (31%), Gaps = 7/124 (5%)

Query: 368 QPKISVPEPVAAPKPLSLPVAAPLPAAPPPAAPPLPEAPPIPAAPPVVPVPVVVPPVPVP 427
P + P V P L + P P P PE PIP P PV + P P P
Sbjct: 38 LPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV-IEKPKPKP 96

Query: 428 VPVRIPVPDPVSPPQLLAPP---RLSVPQPVQPPVRVPQPP---SPPQVGGTVPQSPPQH 481
P PV P+ P R + P P R+ + + +V P
Sbjct: 97 KPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRAL 156

Query: 482 QTPP 485

Sbjct: 157 SRNQ 160



Score = 36.9 bits (85), Expect = 2e-04
Identities = 29/129 (22%), Positives = 35/129 (27%), Gaps = 8/129 (6%)

Query: 339 VALQPTPNQHLIVPTEQAPAPPPVQASAPQPKISVPEPVAAPKPLSLPVAAPLPAAPPPA 398
V P P Q + V P QA P P+ V P P P
Sbjct: 35 VIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPI-------PEPPKEAPV 87

Query: 399 APPLPEAPPIPAAPPVVPVPVVVPPVPVPVPVRIPVPDPVSPPQLLAPPRLSVPQPVQPP 458
P+ P P PV V PV R P + P L + +P
Sbjct: 88 VIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAA-TSKPV 146

Query: 459 VRVPQPPSP 467
V P
Sbjct: 147 TSVASGPRA 155



Score = 32.3 bits (73), Expect = 0.005
Identities = 20/103 (19%), Positives = 29/103 (28%), Gaps = 8/103 (7%)

Query: 351 VPTEQAPAPPPVQASAPQPKISVPEPVAAPKPLSLPVAAPL--------PAAPPPAAPPL 402
E P P P+ + + + +P PKP PV P PA+P
Sbjct: 67 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFE 126

Query: 403 PEAPPIPAAPPVVPVPVVVPPVPVPVPVRIPVPDPVSPPQLLA 445
AP + P + P P + A
Sbjct: 127 NTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQA 169



Score = 30.3 bits (68), Expect = 0.019
Identities = 21/80 (26%), Positives = 33/80 (41%), Gaps = 2/80 (2%)

Query: 419 VVVPPVPVPVPVRIPVPDPVSPPQLLAPPRLSVPQPVQPPVRVPQPPSPPQV--GGTVPQ 476
+ +P P+ V + P + PPQ + PP V +P P +P+PP V P+
Sbjct: 36 IELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPK 95

Query: 477 SPPQHQTPPGEGTPPKPDSP 496
P+ + PK D
Sbjct: 96 PKPKPKPVKKVQEQPKRDVK 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3823PERTACTIN300.029 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 29.7 bits (66), Expect = 0.029
Identities = 17/51 (33%), Positives = 20/51 (39%)

Query: 340 ADQRPEVGTQVVAPDNPPPPSAAPSPPPAAPVPAAPPPQAPVSPAPSPHSG 390
A P P P P PP P P PP + P +PAP P +G
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3824TONBPROTEIN401e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 39.6 bits (92), Expect = 1e-05
Identities = 21/61 (34%), Positives = 23/61 (37%), Gaps = 1/61 (1%)

Query: 335 PSPNQNVVVAPTRPAPAAPAPAPVQVAPVPVPEAPAPAPAAPPPPPPAVAPIPVPIPLPI 394
P+P Q + V PA P A V P P P P PP A I P P P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPI-PEPPKEAPVVIEKPKPKPK 97

Query: 395 P 395
P
Sbjct: 98 P 98



Score = 33.0 bits (75), Expect = 0.002
Identities = 13/51 (25%), Positives = 15/51 (29%), Gaps = 1/51 (1%)

Query: 339 QNVVVAPTRPAPAAPAPAPVQVAPVPVPEAPAPAPAAPPPPPPAVAPIPVP 389
V P P P P + P EAP P P P P+
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPP-KEAPVVIEKPKPKPKPKPKPVKKV 106



Score = 31.5 bits (71), Expect = 0.005
Identities = 16/60 (26%), Positives = 20/60 (33%), Gaps = 1/60 (1%)

Query: 348 PAPAAPAPAPVQVAPVPVPEAPAPAPAAPPPPPPAVAPIPVPIPLPIPGLGGPGFGGPPG 407
PAPA P + V P + A P P P P P+P P + P
Sbjct: 39 PAPAQPI-SVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPK 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3825NUCEPIMERASE290.041 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.0 bits (65), Expect = 0.041
Identities = 24/87 (27%), Positives = 37/87 (42%), Gaps = 17/87 (19%)

Query: 1 MRCTVFGT-GYLGATHAVGMAELGHDVLGVDIDPGKVAKLAGGDIPFYEPGLRKLLNENL 59
M+ V G G++G + + E GH V+G+D L +Y+ L++ E L
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGID-------NLN----DYYDVSLKQARLELL 49

Query: 60 AAGRLRFTT----DYD-MAAGFADVHF 81
A +F D + M FA HF
Sbjct: 50 AQPGFQFHKIDLADREGMTDLFASGHF 76


107MAP3849MAP3856N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP3849-270.177780hypothetical protein
MAP3850c-180.270949hypothetical protein
MAP3851c080.121784hypothetical protein
MAP3852c1100.090776hypothetical protein
MAP38531110.409083ClpB
MAP38541100.430386hypothetical protein
MAP38552122.346953hypothetical protein
MAP38560110.901587hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3849PF03544396e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 38.8 bits (90), Expect = 6e-05
Identities = 24/140 (17%), Positives = 30/140 (21%), Gaps = 32/140 (22%)

Query: 778 AATPSPAAPFTPPAAPFTPPFMPPAAAPAPMPPAPADYTQQLFGYLQAWRQYLEQMAGAS 837
A P A PP P P P P PP A
Sbjct: 58 ADLEPPQAVQPPPEPVVEP---EPEPEPIPEPPKEA------------------------ 90

Query: 838 SGSAQQPTAPPTAPPTMPPTMPPPPPPPSTPSTGGQAFGAAPPGQPGTSGDPTPGGSTPV 897
P P P A P + PT +T
Sbjct: 91 -----PVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAA 145

Query: 898 SATAKGSTLTWPPPLLGLEP 917
++ S + P L +P
Sbjct: 146 TSKPVTSVASGPRALSRNQP 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3853HTHFIS412e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 41.0 bits (96), Expect = 2e-05
Identities = 41/184 (22%), Positives = 67/184 (36%), Gaps = 30/184 (16%)

Query: 550 AGRMLEGETAKLLRMEDEL--GKRVVGQKRAVQAVSDAVRRARAGVADPNRPTGSFMFLG 607
GR L + ++ED+ G +VG+ A+Q + + R + + M G
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR----LMQTDLTL---MITG 167

Query: 608 PTGVGKTELAKALADFLFDDKRAMVRIDMSEYGEKHSVARLVGAPPGYIGYDQGGQLTEA 667
+G GK +A+AL D+ V I+M+ + L G + G T A
Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGA 219

Query: 668 VRRRPYTV-------ILFDEIEKAHPDVFDVLLQVLDEG---RLTDGQGRTVDFRNTILI 717
R + DEI D LL+VL +G + D R ++
Sbjct: 220 QTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IV 276

Query: 718 LTSN 721
+N
Sbjct: 277 AATN 280



Score = 30.2 bits (68), Expect = 0.034
Identities = 18/101 (17%), Positives = 39/101 (38%), Gaps = 14/101 (13%)

Query: 125 LLTGHGASPQALREAFVKVRGSARVTSPDP-------EATYQALEKYSTDLTARAREG-K 176
+++ A++ A P P +AL + + +
Sbjct: 80 VMSAQNTFMTAIKA----SEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQD 135

Query: 177 LDPVIGRDNEIRRVVQVLSRRTKNN-PVLI-GEPGVGKTAI 215
P++GR ++ + +VL+R + + ++I GE G GK +
Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELV 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3855PF05616320.003 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 31.6 bits (71), Expect = 0.003
Identities = 26/73 (35%), Positives = 32/73 (43%), Gaps = 11/73 (15%)

Query: 201 PLPEETPQEAPA---APRNSSPSRPLAPAGRAELPPRRAQQDVAGLLGPDVQPGRRAAEP 257
PLPE +P E PA AP + +RP P +L P A D G QPG R P
Sbjct: 327 PLPEVSPAENPANNPAPNENPGTRP-NPEPDPDLNP-DANPDTDG------QPGTRPDSP 378

Query: 258 VRRDQPRPEARPD 270
D+P R +
Sbjct: 379 AVPDRPNGRHRKE 391


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3856DHBDHDRGNASE702e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 70.5 bits (172), Expect = 2e-16
Identities = 50/184 (27%), Positives = 86/184 (46%), Gaps = 4/184 (2%)

Query: 5 VALITGPTSGIGAGYARRYASDGYDLVLVARDVDRLTALAGELRDRAGNIEILPADLGDA 64
+A ITG GIG AR AS G + V + ++L + L+ A + E PAD+ D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 65 ADRQRVAERLA---AGVRVLVNNAGFATSGDFWHTDPALLQSQLDVNVTAVMHLTRAALP 121
A + R+ + +LVN AG G ++ VN T V + +R+
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 122 AMLDAGAGTVINIASIAGLLPGRG-STYSASKAWVISFSEGLSVGLQGTGVSVHAVCPGY 180
M+D +G+++ + S +P + Y++SKA + F++ L + L + + V PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 181 VRTE 184
T+
Sbjct: 190 TETD 193


108MAP3886MAP3893cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP388609-1.597491acetate kinase
MAP3887c08-1.251963hypothetical protein
MAP3888c-17-1.192763hypothetical protein
MAP388919-1.658457hypothetical protein
MAP389029-1.110333hypothetical protein
MAP38912101.278741hypothetical protein
MAP38923122.081587hypothetical protein
MAP3893c3132.376941hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3886ACETATEKNASE479e-172 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 479 bits (1234), Expect = e-172
Identities = 179/397 (45%), Positives = 250/397 (62%), Gaps = 20/397 (5%)

Query: 9 RVLVINSGSSSLKFQLVDPEFGVAASTGIVERIGEESS---------------PVPDHDA 53
++LVIN GSSSLK+QL++ + G + G+ ERIG S + DH
Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61

Query: 54 ALRRAFDMLAGD--GVDLNTAGLVAVGHRVVHGGNTFYRPTVLDDAVIARLHELSELAPL 111
A++ D L GV + + + AVGHRVVHGG F ++ D V+ + + ELAPL
Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121

Query: 112 HNPPALQGIEVARRLLPDIAHVAVFDTGFFHDLPPAAATYAIDRELADRWQIRRYGFHGT 171
HNP ++GI+ +++PD+ VAVFDT F +P A Y I E +++IR+YGFHGT
Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181

Query: 172 SHRYVSEQAAAFLDRPLRGLKQIVLHLGNGCSASAIAGTRPLDTSMGLTPLEGLVMGTRS 231
SH+YVS++AA L++P+ LK I HLGNG S +A+ + +DTSMG TPLEGL MGTRS
Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241

Query: 232 GDIDPSIVSYLCHTAGMGVDDVESMLNHRSGVVGLSGV-RDFRRLREL-IESGDGAAQLA 289
G IDPSI+SYL + ++V ++LN +SGV G+SG+ DFR L + ++GD AQLA
Sbjct: 242 GSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLA 301

Query: 290 YSVFTHRLRKYIGAYLAVLGHTDVISFTAGIGENDAAVRRDAVSGMEELGIVLDERRNLA 349
+VF +R++K IG+Y A +G DVI FTAGIGEN +R + G+E LG LD+ +N
Sbjct: 302 LNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKV 361

Query: 350 GGKGARQISADDSPITVLVVPTNEELAIARDCVRVLG 386
G+ A IS DS + V+VVPTNEE IA+D +++
Sbjct: 362 RGEEAI-ISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3890ACRIFLAVINRP512e-08 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 51.0 bits (122), Expect = 2e-08
Identities = 49/288 (17%), Positives = 105/288 (36%), Gaps = 46/288 (15%)

Query: 151 SYESVAAVRKIVDSTPA--PPGVKAYVAGNTVLNADTSIVGHKSMATMALVSIVVIFVML 208
+ ++ A++ + P G+K +T SI + +I+++F+++
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSI---HEVVKTLFEAIMLVFLVM 355

Query: 209 LVVYRSIVTTVLSLVIIGIELFAAQGITATAG-NLNIIGLTPYAVSMITMLSIAAGTDYV 267
+ +++ T++ + + + L I A G ++N + + L+I D
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMV------LAIGLLVDDA 409

Query: 268 IFLLGRYHEARSFGQDREEAFYTAYHGVSHVILGSGLTIAGACLCLTAARLP-------- 319
I ++ R +D+ +S + + G + L+A +P
Sbjct: 410 IVVVENVE--RVMMEDKLPPKEATEKSMSQI----QGALVGIAMVLSAVFIPMAFFGGST 463

Query: 320 --YFQTMGLPCAIAMVVIVLAALTLAPAILAV-------------GSRFGLFDPKRAIDV 364
++ + AM + VL AL L PA+ A G FG F+ V
Sbjct: 464 GAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSV 523

Query: 365 RGWRKVGTAVVRWPKPIIVVTAAIAVIGFISLLTYVPNYDDQKFTPKD 412
+ ++ +++ A I V G + L +P F P++
Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALI-VAGMVVLFLRLP----SSFLPEE 566



Score = 38.3 bits (89), Expect = 2e-04
Identities = 51/261 (19%), Positives = 93/261 (35%), Gaps = 27/261 (10%)

Query: 684 FKNPDFKRGLKMFVSPDGTAVRF---------------IITHQGDPASVEGIKHVAGVKD 728
FKNP+ + + V+ DG+ VR I G PA+ GIK G
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 729 -AVADAIKGTPLESSKVYLAG----TASMYSDMQEGVIIDLLVAGISCLILIFTIMLIIT 783
A AIK E + G + + I +++ ++L+F +M +
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 784 RSVVAALVIVGTVAASLGTACGLSVLMWQDLIGLGVQWIVLPLSIVILLAVGSDYNLLLV 843
+++ A L+ V L + + L + +VL + +++ A+ N V
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN---V 416

Query: 844 SRLKEEIPAGLNTGIIRGMGASGRVVTAAGLVFAFT---MASMIVSQLRVIGELGTTIAL 900
R+ E + M + +V + MA S + + TI
Sbjct: 417 ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVS 476

Query: 901 GLLVDTLIVRSFMTPSIAAAL 921
+ + L+ TP++ A L
Sbjct: 477 AMALSVLVALIL-TPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3891HTHTETR691e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.3 bits (169), Expect = 1e-16
Identities = 28/168 (16%), Positives = 58/168 (34%), Gaps = 8/168 (4%)

Query: 16 QRTEGRLDRSRDPAILDAALAALAEHGYDATNMNDIAARAGVGKAAIYRRWSSKAALMTD 75
++T+ +R ILD AL ++ G +T++ +IA AGV + AIY + K+ L ++
Sbjct: 3 RKTKQEAQETRQ-HILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 76 AL----IYWRPELLNDDAPDTGSLAGDLDAIVKRAKRNDNALISNDLVLRV---ALEAAH 128
L A G L I+ + L++ + E
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 129 DPELATALNDLILFKGRRVLSAVLAQAADRGEIDPNRDWSLVADVLTA 176
+ + + + + L + + + A ++
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRG 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP3893cYERSSTKINASE320.010 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.0 bits (72), Expect = 0.010
Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 5/80 (6%)

Query: 273 ILPALGYLHSIGLVYNDLKPENIMLTEEQLK--LIDLGAVSRINSFGYLYGTPGFQAPE- 329
+L +L G+V+ND+KP N++ + +IDLG SR + T F+APE
Sbjct: 254 LLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGF-TESFKAPEL 312

Query: 330 -IVRTGPTVATDIYTVGRTL 348
+ G + +D++ V TL
Sbjct: 313 GVGNLGASEKSDVFLVVSTL 332


109MAP4139MAP4146N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MAP4139521-2.634137hypothetical protein
MAP4140419-3.09333230S ribosomal protein S12
MAP4141418-2.55755030S ribosomal protein S7
MAP4142416-1.749085elongation factor G
MAP4143211-0.536734elongation factor Tu
MAP4144110-0.342103hypothetical protein
MAP41450110.651929hypothetical protein
MAP41460100.7606053-ketoacyl-ACP reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4139TETREPRESSOR484e-09 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 48.0 bits (114), Expect = 4e-09
Identities = 22/44 (50%), Positives = 29/44 (65%)

Query: 24 AKLSREGIIDGALTFLDREGWDALTINALATQLGTKGPSLYNHV 67
A+L+RE +ID AL L+ G D LT LA +LG + P+LY HV
Sbjct: 2 ARLNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHV 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4142TCRTETOQM5890.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 589 bits (1519), Expect = 0.0
Identities = 161/676 (23%), Positives = 311/676 (46%), Gaps = 71/676 (10%)

Query: 12 KVRNIGIMAHIDAGKTTTTERILYYTGISYKIGEVHDGAATMDWMEQEQERGITITSAAT 71
K+ NIG++AH+DAGKTT TE +LY +G ++G V G D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 72 TCFWNDNQINIIDTPGHVDFTVEVERSLRVLDGAVAVFDGKEGVEPQSEQVWRQADKYDV 131
+ W + ++NIIDTPGH+DF EV RSL VLDGA+ + K+GV+ Q+ ++ K +
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 132 PRICFVNKMDKIGADFYFSVRTMEERLGANVIPIQIPVGSEGDFEGVVDLVEMKAKVWSA 191
P I F+NK+D+ G D + ++E+L A ++ Q
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ------------------------- 156

Query: 192 EAKLGEKYDVVDIPADLQEKAEEYRTKLLEAVAETDEALLDKYLGGEELTIEEIKGAIRK 251
+ +L V + ++E++ + V E ++ LL+KY+ G+ L E++
Sbjct: 157 KVELYPNMCVTNFT-----ESEQW-----DTVIEGNDDLLEKYMSGKSLEALELEQEESI 206

Query: 252 LTISSEAYPVLCGSAFKNKGVQPMLDAVIDYLPSPLDVPPAEGHVPGKEEELITRKPSTD 311
+ +PV GSA N G+ +++ + + S
Sbjct: 207 RFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH--------------------RGQ 246

Query: 312 EPFSALAFKVATHPFFGKLTYVRVYSGKVDSGSQVINSTKGKKERLGKLFQMHSNKENPV 371
FK+ +L Y+R+YSG + V S K K ++ +++ + + +
Sbjct: 247 SELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMYTSINGELCKI 305

Query: 372 ETASAGHIYAVIG----LKDTTTGDTLSDPNHQIVLESMTFPDPVIEVAIEPKTKSDQEK 427
+ A +G I + L GDT P E + P P+++ +EP +E
Sbjct: 306 DKAYSGEIVILQNEFLKLNS-VLGDTKLLPQR----ERIENPLPLLQTTVEPSKPQQREM 360

Query: 428 LSLSIQKLAEEDPTFKVHLDQETGQTVIGGMGELHLDILVDRMRREFKVEANVGKPQVAY 487
L ++ ++++ DP + ++D T + ++ +G++ +++ ++ ++ VE + +P V Y
Sbjct: 361 LLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420

Query: 488 KETIRRKVENVEYTHKKQTGGSGQFAKVIINLEPFTGEDGATYEFENKVTGGRIPREYIP 547
E R ++ EYT + + +A + +++ P G+ ++E+ V+ G + + +
Sbjct: 421 ME---RPLKKAEYTIHIEVPPNPFWASIGLSVSP--LPLGSGMQYESSVSLGYLNQSFQN 475

Query: 548 SVDAGAQDAMQYGVLAGYPLVNLKVTLLDGAFHEVDSSEMAFKIAGSQVLKKAAAQAQPV 607
+V G + + G L G+ + + K+ G ++ S+ F++ VL++ +A
Sbjct: 476 AVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTE 534

Query: 608 ILEPIMAVEVTTPEDYMGDVIGDLNSRRGQIQAMEERSGARVVKAHVPLSEMFGYVGDLR 667
+LEP ++ ++ P++Y+ D I + ++ ++ +P + Y DL
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLT 594

Query: 668 SKTQGRANYSMVFDSY 683
T GR+ Y
Sbjct: 595 FFTNGRSVCLTELKGY 610


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4143TCRTETOQM804e-18 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 79.5 bits (196), Expect = 4e-18
Identities = 54/160 (33%), Positives = 80/160 (50%), Gaps = 23/160 (14%)

Query: 13 VNIGTIGHVDHGKTTLTAAITKVLHDKYPDLNESRAFDQI----------DNAPEERQRG 62
+NIG + HVD GKTTLT ++ L S A ++ DN ERQRG
Sbjct: 4 INIGVLAHVDAGKTTLTESL----------LYNSGAITELGSVDKGTTRTDNTLLERQRG 53

Query: 63 ITINISHVEYQTDKRHYAHVDAPGHADYIKNMITGAAQMDGAILVVAATDGPMPQTREHV 122
ITI +Q + +D PGH D++ + + +DGAIL+++A DG QTR
Sbjct: 54 ITIQTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILF 113

Query: 123 LLARQVGVPYILVALNKADMVDDEELLELVEMEVRELLAA 162
R++G+P I +NK D + L V +++E L+A
Sbjct: 114 HALRKMGIPTI-FFINKIDQNGID--LSTVYQDIKEKLSA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MAP4146DHBDHDRGNASE1183e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 118 bits (297), Expect = 3e-34
Identities = 84/273 (30%), Positives = 127/273 (46%), Gaps = 23/273 (8%)

Query: 5 AGSLQGRVAFITGAARGQGRSHAVRLAAEGADIIACDICAPVSASVTYAPASPEDLDETA 64
A ++G++AFITGAA+G G + A LA++GA I A D +PE L++
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD-------------YNPEKLEKVV 49

Query: 65 RLVEDQGRKALTRVLDVRDDAALRELVADGMEQFGRLDVVVANAGVLSWGRVWELTDEQW 124
++ + R A DVRD AA+ E+ A + G +D++V AGVL G + L+DE+W
Sbjct: 50 SSLKAEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEW 109

Query: 125 DTVIGVNLTGTWRTLRATVPAMIEAGNGGSIVVVSSSAGLKATPGNGHYSASKHGLTALT 184
+ VN TG + R+ M GSIV V S+ Y++SK T
Sbjct: 110 EATFSVNSTGVFNASRSVSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFT 168

Query: 185 NTLAIELGEYGIRVNSIHPYSVETPM-----IEPEAMMEIFARHPSFVHSFPPMPVQPNG 239
L +EL EY IR N + P S ET M + ++ + +F +
Sbjct: 169 KCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIK---GSLETF-KTGIPLKK 224

Query: 240 FMTADEVADVVAWLAGDGSGTLTGTQIPVDKGA 272
++AD V +L +G +T + VD GA
Sbjct: 225 LAKPSDIADAVLFLVSGQAGHITMHNLCVDGGA 257



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.