PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome1667.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_010612 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1MMAR_0020MMAR_0025Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0020215-1.123415serine/threonine phosphatase PstP
MMAR_0021419-2.504449hypothetical protein
MMAR_0022319-3.076127hypothetical protein
MMAR_0023225-5.101625*killer suppression protein
MMAR_0024223-4.830235plasmid maintenance system antidote protein
MMAR_0025221-3.972296hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0020PF03544320.004 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 31.9 bits (72), Expect = 0.004
Identities = 20/91 (21%), Positives = 25/91 (27%)

Query: 419 PPCPAPRATSPPESSAPSTASETPGQPSVTSSPASTTPPTPSASTTPTTTSGSTAATSTP 478
PP P PE P P + ++P
Sbjct: 69 PPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASP 128

Query: 479 PTGTSPAAPTSPTPTLTTSSPNVTALPPPPP 509
T+PA PTS T T TS P + P
Sbjct: 129 FENTAPARPTSSTATAATSKPVTSVASGPRA 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0021IGASERPTASE280.018 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.018
Identities = 14/55 (25%), Positives = 21/55 (38%)

Query: 88 ADDSTLVLTDDYASTRHARLSQRGSEWYVEDLGSTNGTYLDRAKVTTAVRVPMGT 142
+D T +DY R + + S GTY D+ K VR+ G+
Sbjct: 154 TEDQTQKRREDYYMPRLDKFVTEVAPIEASTASSDAGTYNDQNKYPAFVRLGSGS 208


2MMAR_0072MMAR_0094Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0072217-0.66454730S ribosomal protein S6
MMAR_0073215-0.983268single-stranded DNA-binding protein
MMAR_0074415-1.45660830S ribosomal protein S18
MMAR_0075311-0.36073150S ribosomal protein L9
MMAR_0076210-0.338482replicative DNA helicase DnaB
MMAR_0077114-0.030335hypothetical protein
MMAR_0078214-0.030308hypothetical protein
MMAR_00792130.142377hypothetical protein
MMAR_00803130.210831carbon monoxyde dehydrogenase large chain
MMAR_00812120.072845carbon monoxyde dehydrogenase medium chain
MMAR_0082213-0.091722carbon monoxyde dehydrogenase small chain
MMAR_0083212-0.042706amidohydrolase
MMAR_0084113-0.081042short chain dehydrogenase
MMAR_00850120.131385transcriptional regulatory protein
MMAR_0086-212-0.129891hydrolase
MMAR_0087-212-0.281171hypothetical protein
MMAR_0088-116-1.795058transcriptional regulator
MMAR_0089017-4.852710hypothetical protein
MMAR_0090018-5.698250transposase, ISMyma01_aa2
MMAR_0091-122-6.465393transposase, ISMyma01_aa1
MMAR_0092023-6.874660hypothetical protein
MMAR_0093023-5.322183integral membrane efflux protein ErmB
MMAR_0094025-3.450315transmembrane transport protein MmpL
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0073cloacin290.008 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.3 bits (65), Expect = 0.008
Identities = 15/30 (50%), Positives = 15/30 (50%)

Query: 123 GGGGGGFGGGGGGGGGGGARQAPAQASSPA 152
GGG G GGG G GG A AP PA
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPA 94



Score = 28.5 bits (63), Expect = 0.013
Identities = 15/46 (32%), Positives = 19/46 (41%)

Query: 123 GGGGGGFGGGGGGGGGGGARQAPAQASSPAGGDDPWGSAPASGSFG 168
G G G GG G GGG G + ++P P S P +G
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 28.5 bits (63), Expect = 0.016
Identities = 16/44 (36%), Positives = 19/44 (43%), Gaps = 7/44 (15%)

Query: 118 NKASRGGGGGGFGGGGGGGGGGGARQA-------PAQASSPAGG 154
GGG G GGG G GG + A PA ++ AGG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0084DHBDHDRGNASE419e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 41.2 bits (96), Expect = 9e-07
Identities = 37/177 (20%), Positives = 69/177 (38%), Gaps = 25/177 (14%)

Query: 3 AVVVGSG-AVGAAVARTLRAHDHEVVSV-----------------GRTSGDYLADLTDIG 44
A + G+ +G AVARTL + + +V R + + AD+ D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 45 SLR----RLFEAIGEFAAVACAAGDVFPAQLEMSTDKQWTDSIAAKGRGQIDLVRAALPH 100
++ R+ +G + AG + P + +D++W + + G + R+ +
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 101 IADRGSFTLISGVLGEEITAACTIGA--TVNAMVEGFVVAAATEL-PRGIRINCVSP 154
+ DR S ++++ ++ A + A F EL IR N VSP
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0087SACTRNSFRASE353e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.3 bits (81), Expect = 3e-04
Identities = 16/58 (27%), Positives = 27/58 (46%)

Query: 520 RLQIEGLRVAKAERAQGLGTALVEWAHNYGRAHGAQLAQVTTDEARERARAFYRRLGY 577
IE + VAK R +G+GTAL+ A + + + + T + A FY + +
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0088HTHTETR677e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.0 bits (163), Expect = 7e-16
Identities = 29/152 (19%), Positives = 54/152 (35%), Gaps = 5/152 (3%)

Query: 17 STERKGQRTRRRILDAARAVFAEVGYERATIRGIAAAAGVDKSSIIKYFGTKQALFHEAV 76
T+++ Q TR+ ILD A +F++ G ++ IA AAGV + +I +F K LF E
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 77 HWD----IPVAELTTDDAGQTTENYARAMLTAWAADPNSPMAVLLRTSMTSEDAADILRR 132
+ + R +L + L + +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 133 HITAQGVDAVA-ATIDASDARLRAAVAGAILM 163
+ Q + + D + L+ + +L
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLP 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0093TCRTETB1313e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 131 bits (331), Expect = 3e-35
Identities = 89/413 (21%), Positives = 173/413 (41%), Gaps = 23/413 (5%)

Query: 47 ICVFASVAVNLANTAVSVAQRSLIVTFGSNQAVVAWTVTAYTLTEAAAIPLSGWAADRIG 106
+C+ + +V L ++V+ + F A W TA+ LT + + G +D++G
Sbjct: 19 LCILSFFSV-LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77

Query: 107 TKRLFMISVLGFTLGSVLCAVAPNIACLIIF-RAVQGGGGGILMPLVITILAREAGPNRL 165
KRL + ++ GSV+ V + L+I R +QG G LV+ ++AR
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 166 ARLMSVMGIPLLLGPMAGPILGGWLIDDYGWQWIFWINVPIGLITVALAAIAFPGDHTAP 225
+ ++G + +G GP +GG + W ++ +P+ I +
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRI 195

Query: 226 SETLDIIGMLLLSPGLATFLYGLSTVPARGTVADRHVLIPATAGLVLMGAFVFHALYRAD 285
DI G++L+S G+ F+ T LI + ++ FV H +
Sbjct: 196 KGHFDIKGIILMSVGIVFFMLFT-------TSYSISFLIVSVLSFLI---FVKHIR-KVT 244

Query: 286 RPLIDLRLFRNR--VVTVANATIVFVAAGFSGAVLLVPSYFQQLLRETPLQVG-IHMIPL 342
P +D L +N ++ V I+F +G V +VP + + + + ++G + + P
Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGT--VAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 343 GLGAAVTIPTSSVLMDRHGAGKVVLGGVTLISVGMGTLAFGAAEHAAYVPTLLIGLTIVG 402
+ + +L+DR G V+ GVT +SV T +F + ++ I + V
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFM---TIIIVFVL 359

Query: 403 MGIGSIMLQLTTVAVQTLAPHQIARGSTLVSVNQQLSASASTALMSVILTSQF 455
G+ ++T+ +L + G +L++ LS A++ +L+
Sbjct: 360 GGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0094ACRIFLAVINRP497e-08 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 49.5 bits (118), Expect = 7e-08
Identities = 51/302 (16%), Positives = 98/302 (32%), Gaps = 51/302 (16%)

Query: 153 IEAVRQILARTPP--PPGIKVYVTGPSALTADMSRTGDKSL--VIVTMI--SVLVIFTML 206
+A++ LA P P G+KV D + S+ V+ T+ +LV M
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYP------YDTTPFVQLSIHEVVKTLFEAIMLVFLVMY 356

Query: 207 LLVYRSIVTVTLLLITVGIELTAARGVVAFLAWHGVIGLSTYAINLLT----TMAIAAGT 262
L +++ + I V + L ++A G S IN LT +AI
Sbjct: 357 L-FLQNMRATLIPTIAVPVVLLGTFAILAAF------GYS---INTLTMFGMVLAIGLLV 406

Query: 263 DYSIFIIGRYQEARQ-AGEDAETAFYTMYRGVAHVILGSGLTIAGA---MYCLTFTRMPY 318
D +I ++ + + A + ++G + ++ M +
Sbjct: 407 DDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAI 466

Query: 319 FQSMGIPCAAGMLVAVVAALTMGPAVLAL-------------GSRFGLFDPKRKIKTRGW 365
++ I + M ++V+ AL + PA+ A G FG F+ +
Sbjct: 467 YRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHY 526

Query: 366 RRIGTAVVRWPAPILTATCAVSLVGLIALPGYQTNYDDQTYVPENIPANAGYAAANRHFP 425
++ L ++A +++PE + G P
Sbjct: 527 TNSVGKILGSTGRYLLI-----YALIVAGMVVLFLRLPSSFLPEE---DQGVFLTMIQLP 578

Query: 426 PS 427

Sbjct: 579 AG 580



Score = 48.7 bits (116), Expect = 1e-07
Identities = 33/162 (20%), Positives = 66/162 (40%), Gaps = 7/162 (4%)

Query: 761 DLIIAGLSSLCLIFIIMLLITRGFVAALVIVGTVALSLGVSFGLSVLLWQHLLRIELHWL 820
+++ ++ L+F++M L + A L+ V + L +F + + + + +
Sbjct: 339 EVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGM 398

Query: 821 VLAMSVIVLLAVGSDYNLLLVSRLKEEVGAGIKTGIIRAMGGTGKVVTSAGLVFA---LT 877
VLA+ ++V A+ N V R+ E K ++M + +V + +
Sbjct: 399 VLAIGLLVDDAIVVVEN---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIP 455

Query: 878 MASMAVSDLIVIGQIGTTIGLGLLFDTLIVRSLMTPSIAALL 919
MA S + Q TI + L+ L TP++ A L
Sbjct: 456 MAFFGGSTGAIYRQFSITIVSAMALSVLVALIL-TPALCATL 496



Score = 40.2 bits (94), Expect = 4e-05
Identities = 40/213 (18%), Positives = 76/213 (35%), Gaps = 15/213 (7%)

Query: 145 GGPLANQSIEAVRQILAR--TPPPPGIKVYVTGPSALTADMSRTGDKSLVIVTMISVLVI 202
G S ++ + P GI TG ++ +G + IS +V+
Sbjct: 828 GEAAPGTSSGDAMALMENLASKLPAGIGYDWTG---MSYQERLSG-NQAPALVAISFVVV 883

Query: 203 FTMLLLVYRSIVTVTLLLITVGIELTAARGVVAFLAWHGVIGLSTYAINLLTTMAIAAGT 262
F L +Y S +++ V + + GV+ + + LLTT+ ++A
Sbjct: 884 FLCLAALYESWSIPVSVMLVVPLGIV---GVLLAATLFNQKNDVYFMVGLLTTIGLSAKN 940

Query: 263 DYSIFIIGRYQEARQA-GEDAETAFYTMYRGVAHVILGSGLTIAGAMYCLTFTRMP---Y 318
+I I+ ++ + G+ A R IL + L + L +
Sbjct: 941 --AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGA 998

Query: 319 FQSMGIPCAAGMLVAVVAALTMGPAVLALGSRF 351
++GI GM+ A + A+ P + R
Sbjct: 999 QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


3MMAR_0157MMAR_0193Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0157011-3.625189adenylate cyclase CyaA
MMAR_0158012-4.770247isocitrate dehydrogenase [NADP] Icd2
MMAR_0159-118-4.474039hypothetical protein
MMAR_0160-115-4.129459hypothetical protein
MMAR_0161-115-3.423881hypothetical protein
MMAR_0162-114-2.654884hypothetical protein
MMAR_0163-114-2.045888hypothetical protein
MMAR_0164-215-2.188066nucleoside-diphosphate-sugar epimerase
MMAR_0165-114-2.154436hypothetical protein
MMAR_0166-114-2.2813184-aminobutyrate aminotransferase
MMAR_0167014-3.301328glutamate decarboxylase
MMAR_0168115-4.590805hypothetical protein
MMAR_0169-113-3.237082hypothetical protein
MMAR_0170-113-2.418465hypothetical protein
MMAR_0171012-2.694019ABC transporter ATP-binding protein
MMAR_0172-112-2.508535hypothetical protein
MMAR_0173-113-2.392331acyl carrier protein
MMAR_0174-114-2.667626long-chain-fatty-acid--CoA ligase
MMAR_0175-115-3.602790integral membrane protein YrbE6A
MMAR_0176-215-3.798822integral membrane protein YrbE6B
MMAR_0177-216-3.808265MCE-family protein Mce6A
MMAR_0178015-0.757972MCE-family protein Mce6B
MMAR_0179014-0.405892MCE-family protein Mce6C
MMAR_01800130.436738MCE-family protein Mce6D
MMAR_01811131.685010MCE family lipoprotein Mce6E
MMAR_01821122.362117MCE-family protein Mce6F
MMAR_01831133.277078PE-PGRS family protein
MMAR_0184013-0.025729hypothetical protein
MMAR_01853160.875265PE family protein
MMAR_0186219-0.375125PPE family protein
MMAR_0187218-2.78759810 kDa culture filtrate antigen EsxB
MMAR_0188318-2.4271396 kDa culture filtrate antigen EsxA
MMAR_0189219-2.671728EsxA-like protein, EsxA_3
MMAR_0190219-2.583545hypothetical protein
MMAR_0191216-2.734064PPE family protein
MMAR_0193217-2.898713hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0164NUCEPIMERASE581e-11 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 57.9 bits (140), Expect = 1e-11
Identities = 31/167 (18%), Positives = 68/167 (40%), Gaps = 20/167 (11%)

Query: 4 RILVTGATGYLGSTILEALVRAGERAT----------ILVQPGDPHVMSPELRSNVDVVR 53
+ LVTGA G++G + + L+ AG + + ++ +++ + +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA---QPGFQFHK 58

Query: 54 GDITDAQSVDEAMR--GIARVYHLAGIASPNSRLAN--QIWRTNVLGAYHVAQSAWRHGV 109
D+ D + + + RV+ + L N +N+ G ++ + + +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 110 QRLVHVSSTAAIGYPPNGVIADEDFDPRDSVLDNVYSATKRAGEQLV 156
Q L++ SS++ G + +D L Y+ATK+A E +
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSL---YAATKKANELMA 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0177TONBPROTEIN300.013 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.3 bits (68), Expect = 0.013
Identities = 25/104 (24%), Positives = 36/104 (34%), Gaps = 8/104 (7%)

Query: 370 EVLLPQNYQPPPNLAPPPGTVVGPDGNLVAVGPPLINPTPNLEDPNPPLPAGVTPAPPVP 429
++ P + +PP + PPP VV P+ + P +E P P P V
Sbjct: 48 TMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQ 107

Query: 430 GTANPDLLPAAPSSFGGNVGPVGSVQERAALSLITGEQATVATQ 473
D+ P S E A + +T AT AT
Sbjct: 108 EQPKRDVKPVES--------RPASPFENTAPARLTSSTATAATS 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0178BACINVASINC280.045 Salmonella/Shigella invasin protein C signature.
		>BACINVASINC#Salmonella/Shigella invasin protein C signature.

Length = 409

Score = 28.3 bits (62), Expect = 0.045
Identities = 17/85 (20%), Positives = 37/85 (43%), Gaps = 5/85 (5%)

Query: 185 QTSTLTATFAGNDDALGNVIGSLDRVAGSLASQSAEFEHTIAQTRQMVSQFNSRRSELV- 243
T +T A G++I G +A S ++ T ++ Q +SQ N+R +
Sbjct: 309 NTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGASRQYAATQERSEQQISQVNNRVASTAS 368

Query: 244 ----ESTGNMAAVVRQLGGILADVN 264
ES+ ++++++ + +N
Sbjct: 369 DEARESSRKSTSLIQEMLKTMESIN 393


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0183cloacin382e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 2e-04
Identities = 28/89 (31%), Positives = 34/89 (38%), Gaps = 1/89 (1%)

Query: 712 GIGGAGGNGGLLFSGGGV-GGAGGLGPSGGSGGTGGHGGWFGAAGTGGSGGFGFSIAGGA 770
G G G N G + G + GG GLG GG+ G G G G + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 771 GGAGGDSTSGVGGGGGAGGDSTVTGGAGG 799
G GG+ SG G G G + A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 37.0 bits (85), Expect = 3e-04
Identities = 37/111 (33%), Positives = 44/111 (39%), Gaps = 7/111 (6%)

Query: 128 NGVAGAPGTGQNGGAGGWLIGNGGAGGSG-----TPPGLGSAGGTGGTGGAAGLIGNGGA 182
N A + NGG G +G G + GSG P G GS G GG+ GNGG
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH--GNGGG 67

Query: 183 AGAGGASTSGVGGAGGAGGAGGWLFGAGGTGGAGGLTVGTTGGQGGAGGAG 233
G G + G + F A T GAGGL V + G A A
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 36.6 bits (84), Expect = 4e-04
Identities = 34/110 (30%), Positives = 41/110 (37%), Gaps = 7/110 (6%)

Query: 361 GSGGSGGSGGTSGFLGTGSGGAGGDGGNASLFGFGGAGGV-GGISGNDTGGVGGMGGTAG 419
G G + G+ TSG + G G G GG + G+ GG SG+ GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH--- 62

Query: 420 LLAGNGGMGGAGGEGFLTGGAGGKGGNAMFFGTAGNGGNGGYGPTFGIGG 469
GNGG G G G TGG + FG G G I
Sbjct: 63 ---GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 35.5 bits (81), Expect = 9e-04
Identities = 34/104 (32%), Positives = 39/104 (37%), Gaps = 14/104 (13%)

Query: 681 GGQGGQGGDGGVIFGYGGAGGTGGVTPGGTGGIGGAGGNGGLLFSGGGVGGAGGLGPSGG 740
G G G I G G GG G+G GG SG GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 741 SGGTGGHGG--------------WFGAAGTGGSGGFGFSIAGGA 770
+G +GG G F A T G+GG SI+ GA
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.5 bits (81), Expect = 0.001
Identities = 27/102 (26%), Positives = 30/102 (29%)

Query: 607 GAGGAGGSGGTALLGTGGTGGAGGAGGILGGTGGQGGIGGFGETGAGDGGSGGIAGLFGS 666
G G G + G GG G G G + G G G G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 667 GNAGGAGGYANTSDGGQGGQGGDGGVIFGYGGAGGTGGVTPG 708
GN GG G S G V FG+ G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.001
Identities = 29/80 (36%), Positives = 33/80 (41%), Gaps = 7/80 (8%)

Query: 759 SGGFGFSIAGGAGGAGGDSTSGVGGGGGAGGDSTVTG-------GAGGDGGGALLLGNGG 811
SGG G GA G+ G G G GG S +G GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 812 NGGNGGAVVTGGTPGGGGNG 831
+G GG +GG G GGN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 34.3 bits (78), Expect = 0.002
Identities = 31/89 (34%), Positives = 33/89 (37%), Gaps = 3/89 (3%)

Query: 172 GAAGLIGNGGAAGAGGASTSGVGGAGGAGGAG---GWLFGAGGTGGAGGLTVGTTGGQGG 228
G G N GA G G G G GGA GW GG G + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 229 AGGAGGLFGPGGTGGAGGTSFVDAGGAGG 257
G G GG+G G S V A A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.5 bits (76), Expect = 0.003
Identities = 35/109 (32%), Positives = 40/109 (36%), Gaps = 7/109 (6%)

Query: 151 GAGGSGTPPGLGSAGGTGGTGGAAGLIGNGGAAGAGGASTSGVGGAGGAGGAGGWLFGAG 210
G G G G S G GG GL GGA+ G S+ GG+G W G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 211 GTGGAGGLTVGTTGGQGGAGGAGGLFGPGGTGGAGGTSFVDAGGAGGAG 259
G G G GG G GG A G + GAGG
Sbjct: 62 HGNGGGN------GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.003
Identities = 34/104 (32%), Positives = 40/104 (38%), Gaps = 9/104 (8%)

Query: 702 TGGVTPGGTGGIGGAGGNGGLLFSGGGVGGAGGLGPSGGS------GGTGGHGGWFGAAG 755
+GG G G GN +G GVGG G S GG+G W G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 756 TGGSGGFGFSIAGGAGGAGGDSTSGVGGGGGAGGDSTVTGGAGG 799
G GG G G G G + S V G + T GAGG
Sbjct: 62 HGNGGGNG---NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.003
Identities = 36/109 (33%), Positives = 41/109 (37%), Gaps = 7/109 (6%)

Query: 521 GGAGGSGEAGQAGGSGGAAGLLFGVGGAGGVGGSAGLVGDAGAGGSGGSGGLLWGNGGAG 580
GG G G SG G G+G GG +G + G G G+ WG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 581 GAGGLSIANTNGTGGVGGAGGDSGLLGAGGAGGSGGTALLGTGGTGGAG 629
G GG G G GG G G L A A + G L T G GG
Sbjct: 63 GNGG-------GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.1 bits (75), Expect = 0.006
Identities = 31/99 (31%), Positives = 37/99 (37%)

Query: 346 GDGGAGGAGGTAGLLGSGGSGGSGGTSGFLGTGSGGAGGDGGNASLFGFGGAGGVGGISG 405
G G GA T+G + G +G G G+G G S G GG G +G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 406 NDTGGVGGMGGTAGLLAGNGGMGGAGGEGFLTGGAGGKG 444
G GG GT G L+ G T GAGG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.016
Identities = 31/99 (31%), Positives = 40/99 (40%), Gaps = 4/99 (4%)

Query: 499 NGANAATGSGQDGGAGGWLLGDGGAGGSGEAGQAGGSGGAAGLLFGVGGAGGVGGSAGLV 558
N +T +GG G +G G + GSG + + GG +G GG G G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG-- 67

Query: 559 GDAGAGGSGGSGGLLWGNGGAGGAGGLSIANTNGTGGVG 597
GGSG G L A A G +T G GG+
Sbjct: 68 NGNSGGGSGTGGNL--SAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.019
Identities = 23/74 (31%), Positives = 28/74 (37%)

Query: 300 GRGGDGGAGGLSNSVDAVGGAGGAGGDGSRLVGSGGAGGTGGTSLAGDGGAGGAGGTAGL 359
GRG + GA S +++ G GG S G G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 360 LGSGGSGGSGGTSG 373
G+G SGG GT G
Sbjct: 66 GGNGNSGGGSGTGG 79



Score = 30.5 bits (68), Expect = 0.030
Identities = 32/116 (27%), Positives = 41/116 (35%), Gaps = 2/116 (1%)

Query: 519 GDGGAGGSGEAGQAGGSGGAAGLLFGVGGAGGVGGSAGLVGDAGAGGSGGSGGLLWGNGG 578
G G G + A G+ GVGG G + GGSG G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 579 AGGAGGLSIANTNGTGGVGGAGGDSGLLG--AGGAGGSGGTALLGTGGTGGAGGAG 632
G G + +GTGG A G A G+GG A+ + G A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 30.1 bits (67), Expect = 0.042
Identities = 29/99 (29%), Positives = 37/99 (37%)

Query: 272 GGGHGGLGGAGVTAGGDGGAGGDGGPLIGRGGDGGAGGLSNSVDAVGGAGGAGGDGSRLV 331
G G G GA T+G G G G G +N G+G G GS
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 332 GSGGAGGTGGTSLAGDGGAGGAGGTAGLLGSGGSGGSGG 370
GG G +GG S G + A A + + G+GG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


4MMAR_0426MMAR_0431Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_04264165.574944RNA polymerase factor sigma-70
MMAR_04274145.328520lysophospholipase
MMAR_04285145.510395hypothetical protein
MMAR_04293145.210244hypothetical protein
MMAR_04303144.994728beta-glucosidase BglS
MMAR_04314175.415143PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0431cloacin391e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 1e-04
Identities = 34/85 (40%), Positives = 42/85 (49%), Gaps = 4/85 (4%)

Query: 432 GQGGSGGNSGAGGTNGSGGNGGLGGAGGDGGSGTADTGGGAGNGGSGGKGGTGGVSGVAG 491
G G G N+GA T+G+ GG G G G A G G + + GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN----GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 492 GGGAGGTGGTGGTGGAGGTGGNGAA 516
G G G GG G +GG GTGGN +A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 37.8 bits (87), Expect = 3e-04
Identities = 33/91 (36%), Positives = 37/91 (40%), Gaps = 5/91 (5%)

Query: 703 GAGGTGGTGGNSGTGGTSGNGGTGGTGGGGGYGGPGDYNENGYAGGSGGTGGTGGDPGTG 762
G G G G T G G TG GGG G G +EN G GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-----GGSGSGIHWG 57

Query: 763 GTAAVGGDGGTGGIGGGGGYGGTGFDAAGAM 793
G + G GG G GGG G GG A +
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 37.0 bits (85), Expect = 5e-04
Identities = 26/84 (30%), Positives = 36/84 (42%)

Query: 407 NGGTGGAGANAVAGTGDNGSDGAAGGQGGSGGNSGAGGTNGSGGNGGLGGAGGDGGSGTA 466
+GG G T N + G G G G + G+G ++ + GG G+G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 467 DTGGGAGNGGSGGKGGTGGVSGVA 490
GG GG G G +S VA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 35.8 bits (82), Expect = 0.001
Identities = 30/108 (27%), Positives = 37/108 (34%)

Query: 1013 GGTGGIANGTGNGGTGGKGGTGGDGGTGGTATTAGQTGGDGGDGGKGGGGGTGGKANGGD 1072
GG G N + +G G G GG A+ + G G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1073 GGDGGDGGDGGAGGAGGNGDGAVGGIGGGGGTGGTGGTGTPPGGANGG 1120
G GG+G GG G GGN + G T G G + G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 34.7 bits (79), Expect = 0.002
Identities = 31/101 (30%), Positives = 40/101 (39%)

Query: 384 GGAAGVGGFGGSAGSYGNGGGGGNGGTGGAGANAVAGTGDNGSDGAAGGQGGSGGNSGAG 443
G G S NGG G G GGA + + +N G +G GG SG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 444 GTNGSGGNGGLGGAGGDGGSGTADTGGGAGNGGSGGKGGTG 484
G+G +GG G GG+ + A G + G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.002
Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 234 GDGGAGGAGGRAGLLGYGGAGGAGGLGGSGGAGLPNQQSGNGGGGGHGGAGGAAGWFGHA 293
G G GA +G + G G G G S G+G ++ + GGG G G G G+
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 294 GVGGDGGTG-GAGGNGQA 310
G G+ G G G GGN A
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 34.3 bits (78), Expect = 0.003
Identities = 25/80 (31%), Positives = 28/80 (35%)

Query: 1005 GTGGAGGFGGTGGIANGTGNGGTGGKGGTGGDGGTGGTATTAGQTGGDGGDGGKGGGGGT 1064
G G G T G NG G G G + G G + G +G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1065 GGKANGGDGGDGGDGGDGGA 1084
GG N G G G A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVA 85



Score = 34.3 bits (78), Expect = 0.003
Identities = 38/121 (31%), Positives = 45/121 (37%), Gaps = 5/121 (4%)

Query: 300 GTGGAGGNGQAGQLSNDVGGDGGRGGAGGAAGAGGDAGLLGLNGAGGHGGGGGMGGTGGT 359
G G G N A S ++ G G GG A G GG G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 360 GAAAAAGINAAAGGTGGDGGAAGAGGAAGVGGFGGSAGSYGNGGGGGNGGTGGAGANAVA 419
G G + GTGG+ A A A FG A S GG + GA + A+A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVA-----FGFPALSTPGAGGLAVSISAGALSAAIA 117

Query: 420 G 420

Sbjct: 118 D 118



Score = 33.9 bits (77), Expect = 0.004
Identities = 27/79 (34%), Positives = 28/79 (35%)

Query: 1051 GDGGDGGKGGGGGTGGKANGGDGGDGGDGGDGGAGGAGGNGDGAVGGIGGGGGTGGTGGT 1110
G G G G T G NGG G G GG G + GG G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1111 GTPPGGANGGPGKTGADGL 1129
G G N G G L
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.004
Identities = 32/81 (39%), Positives = 37/81 (45%), Gaps = 1/81 (1%)

Query: 155 GGAGGSAGLIGNGGIGGQGGAGGIGGAGGSAAFFGNGGAGGHGGAGGAGGIGGNGGFFGN 214
GA ++G I G G G G G+G S+ GG G G G G GNGG GN
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 215 GGAG-GAGGNGSTGGVGLAGG 234
G G G GGN S +A G
Sbjct: 71 SGGGSGTGGNLSAVAAPVAFG 91



Score = 33.9 bits (77), Expect = 0.004
Identities = 25/77 (32%), Positives = 29/77 (37%)

Query: 772 GTGGIGGGGGYGGTGFDAAGAMGGIGGTGGTGGNPGPGGIGGNGGDGGTGGPGGYGDTGF 831
G G G G T + G G+G GG G G G G G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 832 GTAGYAGGTGGTGGTGG 848
G G G +GG GTGG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 33.5 bits (76), Expect = 0.005
Identities = 35/109 (32%), Positives = 39/109 (35%)

Query: 794 GGIGGTGGTGGNPGPGGIGGNGGDGGTGGPGGYGDTGFGTAGYAGGTGGTGGTGGAPGPG 853
GG G TG + G I G G GG G GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 854 GSAAGNGGGGGIGGIGGTGGSGATTAGFAGGDGGKGGTGGTGGSAGAGA 902
G+ GNG GG G GG + A F G GG S AGA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.1 bits (75), Expect = 0.007
Identities = 29/87 (33%), Positives = 34/87 (39%), Gaps = 3/87 (3%)

Query: 454 LGGAGGDGGSGTADTGGGAGNGGSGGKGGTGGVSGVAGGGGAGGTGGTGGTGGAGGTGGN 513
+ G G G + A + G NGG G G GG S G G G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGAS---DGSGWSSENNPWGGGSGSGIHWG 57

Query: 514 GAAGKSDTGDIGGSGGYGGDGGNGGNS 540
G +G + G G SGG G GGN
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.8 bits (74), Expect = 0.008
Identities = 30/89 (33%), Positives = 37/89 (41%), Gaps = 7/89 (7%)

Query: 966 GEGGFGGTGGNSYYAGNAGPGGQGGQGGAGANGAPGLA-------GGTGGAGGFGGTGGI 1018
G G G G +GN G G G GA+ G + GG+G +GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1019 ANGTGNGGTGGKGGTGGDGGTGGTATTAG 1047
NG GNG +GG GTGG+ G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.009
Identities = 26/77 (33%), Positives = 30/77 (38%), Gaps = 1/77 (1%)

Query: 839 GTGGTGGTGGAPGPGGSAAGNGGGGGIGGIGGTG-GSGATTAGFAGGDGGKGGTGGTGGS 897
G G G GA G+ G G G+GG G G + + GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 898 AGAGAANGSGGTGGQGG 914
G SGG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 32.8 bits (74), Expect = 0.009
Identities = 33/83 (39%), Positives = 38/83 (45%), Gaps = 4/83 (4%)

Query: 653 GEGGTGGNSAAGGTSGDGTQVAWGRGGDGGDGGQGGYGGAGSFEQPGGTGGAGGTGGTGG 712
G G G N+ A TSG+ + G G G GG G S P G GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGN---INGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGG 58

Query: 713 NSGTGGTSGNGGTGGTGGGGGYG 735
SG G GNG +GG G GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 32.8 bits (74), Expect = 0.009
Identities = 32/106 (30%), Positives = 39/106 (36%), Gaps = 6/106 (5%)

Query: 470 GGAGNGGSGGKGGTGGVSGVAGGGGAGGTGGTGGTGGAGGTGGNGAAGKSDTGDIGGSGG 529
GG G G + G T G + GG G GG G + N G S +G G G
Sbjct: 3 GGDGRGHNTGAHSTSG--NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 530 YGGDGGNGGNSLPGGTNGAGGTGGTGGEGGGGGTGTPDTGSGGGDG 575
G+GG GNS G+G G G P + G G
Sbjct: 61 GHGNGGGNGNS----GGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.8 bits (74), Expect = 0.010
Identities = 27/79 (34%), Positives = 35/79 (44%)

Query: 501 TGGTGGAGGTGGNGAAGKSDTGDIGGSGGYGGDGGNGGNSLPGGTNGAGGTGGTGGEGGG 560
+GG G TG + +G + G G G G G+G +S G G+G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 561 GGTGTPDTGSGGGDGGDGG 579
G G + SGGG G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 32.8 bits (74), Expect = 0.010
Identities = 24/67 (35%), Positives = 31/67 (46%)

Query: 119 GANGTTVNGVGTPGGDGGILYGNGGNGGTSTNAATAGGAGGSAGLIGNGGIGGQGGAGGI 178
GA+ T+ N G P G G + G+G +S N GG+G G G G GG G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 179 GGAGGSA 185
GG G+
Sbjct: 72 GGGSGTG 78



Score = 32.4 bits (73), Expect = 0.011
Identities = 30/87 (34%), Positives = 36/87 (41%), Gaps = 2/87 (2%)

Query: 527 SGGYGGDGGNGGNSLPGGTNGAGGTGG-TGGEGGGGGTGTPDTGSGGGDGGDGGYGGYGG 585
SGG G G +S G NG G GG G G + + GGG G +GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 586 SGGLPGHGDGADGVAGTGGKGGAGGAA 612
G G + G +GTGG A A
Sbjct: 62 HGN-GGGNGNSGGGSGTGGNLSAVAAP 87



Score = 32.4 bits (73), Expect = 0.011
Identities = 25/82 (30%), Positives = 32/82 (39%)

Query: 582 GYGGSGGLPGHGDGADGVAGTGGKGGAGGAAGTGANATAGTDRYGGYGGDGGGGGGGGYG 641
G G G G + + G G GG A G+ ++ + +GG G G GGG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 642 GNSIRGTGGVGGEGGTGGNSAA 663
GN GG G G SA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.4 bits (73), Expect = 0.012
Identities = 27/89 (30%), Positives = 33/89 (37%)

Query: 576 GDGGYGGYGGSGGLPGHGDGADGVAGTGGKGGAGGAAGTGANATAGTDRYGGYGGDGGGG 635
G G G G+ G+ +G G GG G + N G G + G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 636 GGGGYGGNSIRGTGGVGGEGGTGGNSAAG 664
G GG GNS G+G G A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.4 bits (73), Expect = 0.012
Identities = 32/100 (32%), Positives = 40/100 (40%), Gaps = 1/100 (1%)

Query: 682 GDGGQGGYGGAGSFEQPGGTGGAGGTGGTGGNSGTG-GTSGNGGTGGTGGGGGYGGPGDY 740
G G+G GA S G G G G + G+G + N GG+G G +GG +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 741 NENGYAGGSGGTGGTGGDPGTGGTAAVGGDGGTGGIGGGG 780
G G SGG GTGG+ G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.012
Identities = 28/101 (27%), Positives = 35/101 (34%)

Query: 472 AGNGGSGGKGGTGGVSGVAGGGGAGGTGGTGGTGGAGGTGGNGAAGKSDTGDIGGSGGYG 531
+G G G G SG GG G G G + G+G + N G I GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 532 GDGGNGGNSLPGGTNGAGGTGGTGGEGGGGGTGTPDTGSGG 572
G G + GG+ G G G+GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.018
Identities = 30/100 (30%), Positives = 38/100 (38%), Gaps = 1/100 (1%)

Query: 634 GGGGGGYGGNSIRGTGGV-GGEGGTGGNSAAGGTSGDGTQVAWGRGGDGGDGGQGGYGGA 692
GG G G+ + +G + GG G G A SG ++ GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 693 GSFEQPGGTGGAGGTGGTGGNSGTGGTSGNGGTGGTGGGG 732
G+ G +GG GTGG G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.018
Identities = 33/104 (31%), Positives = 42/104 (40%), Gaps = 4/104 (3%)

Query: 205 IGGNGGFFGNGGAGGAGGNGSTGGVGLAGGDGGAGGAGGRAGLLGYGGAGGAG----GLG 260
+ G G N GA GN + G GL G G + G+G + +GG G+G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 261 GSGGAGLPNQQSGNGGGGGHGGAGGAAGWFGHAGVGGDGGTGGA 304
G G G G G GG+ A A FG + G G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.032
Identities = 27/89 (30%), Positives = 32/89 (35%)

Query: 925 TNGTYEGMPGGVGGTGGAGGEGGLPGTGAGTAGSIGSAGNGGEGGFGGTGGNSYYAGNAG 984
T+G G P G+G GGA G G GS + G G G GG + +G
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 985 PGGQGGQGGAGANGAPGLAGGTGGAGGFG 1013
G A A T GAGG
Sbjct: 76 GTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.033
Identities = 32/106 (30%), Positives = 38/106 (35%), Gaps = 4/106 (3%)

Query: 276 GGGGHGGAGGAAGWFGHAGVGGDGGTGGAGGNGQAGQLSNDVGGDGGRGGAGGAAGAGGD 335
GG G G GA G+ G G G G + +G S + GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 336 AGLLGLNGAGGHGGGGGMGGTGGTGAAAAAGINAAAGGTGGDGGAA 381
G +GG G GG + AA A T G GG A
Sbjct: 63 GNGGGNGNSGGGSGTGG----NLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.041
Identities = 25/85 (29%), Positives = 32/85 (37%), Gaps = 4/85 (4%)

Query: 350 GGGMGGTGGTGAAAAAGINAAAGGTGGDGGAAGAGGAAGV----GGFGGSAGSYGNGGGG 405
GG G + + IN G G GGA+ G + GG GS +G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 406 GNGGTGGAGANAVAGTGDNGSDGAA 430
GNGG G G+ + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87


5MMAR_0473MMAR_0480Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0473-2124.708992methyltransferase
MMAR_0474-2124.584855glycosyltransferase
MMAR_0475-1114.668531transmembrane protein
MMAR_0476-1143.607735hypothetical protein
MMAR_04773173.587965integral membrane acyltransferase
MMAR_04785233.545814PE-PGRS family protein
MMAR_0479433-2.594964hypothetical protein
MMAR_04805260.781690hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0478cloacin429e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 41.6 bits (97), Expect = 9e-06
Identities = 29/81 (35%), Positives = 34/81 (41%)

Query: 379 GAGGHGGSGGSGGGTGGSGGSAGTLLGNGGAGGGGGSGSTTAGGGGSGGDAGALFGNGGA 438
G G G + G+ +G G L GGA G G S GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 439 GGAGGSGAATGGSGGTGGSSA 459
G GG+G + GGSG G SA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 41.2 bits (96), Expect = 1e-05
Identities = 30/78 (38%), Positives = 31/78 (39%)

Query: 324 GAGGDGGNGGVFFGGGGDGGIGGHGITGSGGSGGSGGSGGFLPGGAGVAGLIGDGGAGGH 383
G G G N G G G G G S GSG S P G G I GG GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 384 GGSGGSGGGTGGSGGSAG 401
G GG+G GGSG
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 35.5 bits (81), Expect = 7e-04
Identities = 27/82 (32%), Positives = 33/82 (40%)

Query: 437 GAGGAGGSGAATGGSGGTGGSSALLFGTGGVGGVGGSGGDSGGTGGAGGQGGAMFGNGGA 496
G G G + A SG G L GG G ++ GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 497 GGGGGDGINVGGKGGAGGAAAT 518
G GGG+G + GG G G +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 35.5 bits (81), Expect = 7e-04
Identities = 38/114 (33%), Positives = 48/114 (42%), Gaps = 3/114 (2%)

Query: 406 NGGAGGGGGSGSTTAGGGGSGGDAGALFGNGGAGGAGGSGAAT--GGSGGTGGSSALLFG 463
+GG G G +G+ + G +GG G G G + G+G S GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 464 TGGVGGVGGSGGDSGGTGGAGGQGGAM-FGNGGAGGGGGDGINVGGKGGAGGAA 516
G GG G SGG SG G + FG G G+ V GA AA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 35.1 bits (80), Expect = 0.001
Identities = 39/123 (31%), Positives = 43/123 (34%), Gaps = 6/123 (4%)

Query: 319 GQVSGGAGGDGGNGGVFFGGGGDGGIGGHGITGSGGSGGSGGSGGFLPGGAGVAGLIGDG 378
G G G G GG G+GG GSG S + GG G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 379 GAGGHGGSGGSGGGTGGSGGSAGTL------LGNGGAGGGGGSGSTTAGGGGSGGDAGAL 432
GG+G SGG G G A + L GAGG S S A AL
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAAL 123

Query: 433 FGN 435
G
Sbjct: 124 KGP 126



Score = 34.7 bits (79), Expect = 0.001
Identities = 33/109 (30%), Positives = 43/109 (39%), Gaps = 6/109 (5%)

Query: 129 NGGDGGILYGNGGAGGSGAAGGNGGAGGAAGLVGNGGAGGAGGNNAAGGSGGAGGLLYGN 188
+GGDG + GG G G G + G+G + NN GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 189 GGAGGIGGTSAGSGGAGGAGGLLFGTGGAGGTGGFGGGLMTSGGAGGTG 237
G G GG GG+G G L FG +++ GAGG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNL----SAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.001
Identities = 27/80 (33%), Positives = 31/80 (38%)

Query: 498 GGGGDGINVGGKGGAGGAAATLFGSGGTGGDGGNAGAGVTFGGAGGEGGAAALLGGNGGS 557
GG G G N G +G G G GG +G GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 558 GGHGGIGNVGGKGGAGAAGA 577
G GG GN GG G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 33.5 bits (76), Expect = 0.003
Identities = 33/109 (30%), Positives = 42/109 (38%), Gaps = 1/109 (0%)

Query: 248 GSGGAGGGSNAFTAGDGGAGGAGGLLFGTGGSGGDGGLAPGAMFGNGGHGGTGGAGGTLS 307
G G G + A + GG GL G G S G G + +G G G GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 308 GIGGHGGAGGLGQVSGGAGGDGGNGGVF-FGGGGDGGIGGHGITGSGGS 355
G GG G G G +GG F F G GG ++ S G+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.1 bits (75), Expect = 0.004
Identities = 28/90 (31%), Positives = 34/90 (37%)

Query: 275 GTGGSGGDGGLAPGAMFGNGGHGGTGGAGGTLSGIGGHGGAGGLGQVSGGAGGDGGNGGV 334
G G G + G + NGG G G GG G G G SG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 335 FFGGGGDGGIGGHGITGSGGSGGSGGSGGF 364
GGG GG G G+ + + + GF
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 32.8 bits (74), Expect = 0.005
Identities = 31/108 (28%), Positives = 43/108 (39%), Gaps = 7/108 (6%)

Query: 126 SGANGGDGGILYGNGGAGGSGAAGGNGGAGGAAGLVGNGGAGGAGGNNAAGGSGGAGGLL 185
+GA+ G I G G G G G + G+G ++ GG G+G + G G GG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGG--GASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG-- 66

Query: 186 YGNGGAGGIGGTSAGSGGAGGAGGLLFGTGGAGGTGGFGGGLMTSGGA 233
G GG+ G + A + FG G G + S GA
Sbjct: 67 ---GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.4 bits (73), Expect = 0.006
Identities = 33/105 (31%), Positives = 41/105 (39%), Gaps = 10/105 (9%)

Query: 358 SGGSGGFLPGGAGVAGLIGDGGAGGHGGSGGSGGGTGGS------GGSAGTLLGNGGAGG 411
SGG G GA +GG G G GG+ G+G S GG +G+ + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 412 GGGSGSTTAGGGGSGGDAGALFGNGGAGGAGGSGAATGGSGGTGG 456
G G G G SGG +G G + G GG
Sbjct: 62 HGNGG----GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.007
Identities = 38/127 (29%), Positives = 47/127 (37%), Gaps = 11/127 (8%)

Query: 144 GSGAAGGNGGAGGAAGLVGNGGAGGAGGNNAAGGSGGAGGLLYGNGGAGGIGGTSAGSGG 203
G G N GA +G + G G G A+ GSG + N GG G+ GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSE----NNPWGGGSGSGIHWGG 58

Query: 204 AGGAGGLLFGTGGAGGTGGFGGGLMTSGGAGGTGGAGGLLFGTGGSGGAGGGSNAFTAGD 263
G G GG G GGG T G F + GAGG + + +AG
Sbjct: 59 GSGHGN-------GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111

Query: 264 GGAGGAG 270
A A
Sbjct: 112 LSAAIAD 118



Score = 32.0 bits (72), Expect = 0.009
Identities = 28/108 (25%), Positives = 33/108 (30%)

Query: 187 GNGGAGGIGGTSAGSGGAGGAGGLLFGTGGAGGTGGFGGGLMTSGGAGGTGGAGGLLFGT 246
G G G G + SG G L GGA G+ GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 247 GGSGGAGGGSNAFTAGDGGAGGAGGLLFGTGGSGGDGGLAPGAMFGNG 294
G GG G G + A + FG G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.2 bits (70), Expect = 0.013
Identities = 38/112 (33%), Positives = 43/112 (38%), Gaps = 12/112 (10%)

Query: 216 GAGGTGGFGGGLMTSGGAGGTGGAGGLLFGTGGSGGAGGGSNAFTAGDGGAGGAGGLLFG 275
G G G G TSG G G+ G G S G+G S G G G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGG--SGSGIHWGG 58

Query: 276 TGGSGGDGGLAPGAMFGNGGHGGTGGAGGTLSGIGGHGGAGGLGQVSGGAGG 327
G G G GNG GG G GG LS + G + GAGG
Sbjct: 59 GSGHGNGG--------GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.5 bits (68), Expect = 0.024
Identities = 28/77 (36%), Positives = 33/77 (42%), Gaps = 2/77 (2%)

Query: 589 GGNSLASLAGAGGAAGN--AGLIGSGGTGGAGGQSFSGTGGKGGDGGSALLIGDGGNGGN 646
GG+ GA +GN G G G GGA S + GGS I GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 647 GGTGATAGVGGTAGTGG 663
G G GG +GTGG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79


6MMAR_0584MMAR_0617Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0584231-4.457018hypothetical protein
MMAR_0585237-5.909249hypothetical protein
MMAR_0586443-8.583676hypothetical protein
MMAR_0587543-9.498746short-chain type dehydrogenase/reductase
MMAR_0588337-8.102317hypothetical protein
MMAR_0589125-5.518717site-specific recombinase PinR
MMAR_0590122-5.276607hypothetical protein
MMAR_0593017-4.074059hypothetical protein
MMAR_0594-213-0.666387hypothetical protein
MMAR_0595-1130.388493*transcriptional regulatory protein
MMAR_0596-1111.918410integral membrane transport protein
MMAR_0597-1113.081584hypothetical protein
MMAR_0598-1112.247662pyrrolidone-carboxylate peptidase
MMAR_0599-1112.256198hypothetical protein
MMAR_0600-1102.654398deoxycytidine triphosphate deaminase
MMAR_0601-182.210368hypothetical protein
MMAR_0602192.672406hypothetical protein
MMAR_06035133.834040UDP-glucose dehydrogenase UdgA
MMAR_060410167.088760hypothetical protein
MMAR_06057176.113603hypothetical protein
MMAR_06065155.024159alpha-D-glucose-1-phosphate thymidylyl-
MMAR_06073125.146849PE-PGRS family protein
MMAR_06082114.436240PE-PGRS family protein
MMAR_06091114.143337PE-PGRS family protein
MMAR_06100111.723664aminotransferase AlaT
MMAR_06110112.064040iron-sulfur-binding reductase
MMAR_0612-1122.363823transcriptional regulatory protein
MMAR_06132142.253946hypothetical protein
MMAR_06142142.515169membrane protein, IniB
MMAR_06150110.954801hypothetical protein
MMAR_06162121.700957isoniazid inductible protein IniC
MMAR_06173131.529451hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0587DHBDHDRGNASE1124e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 112 bits (280), Expect = 4e-32
Identities = 77/252 (30%), Positives = 118/252 (46%), Gaps = 10/252 (3%)

Query: 4 LEGKTALVTGGNSGIGLAAAQRLAAEGAHVFLTGRN---QATIDAAVASIGSRAHGIRAD 60
+EGK A +TG GIG A A+ LA++GAH+ N + +++ + A AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 VSNIEDLARVADAIAAAGRGLDVLFANAGGGEFVLLGDITIEHFTNGFMTNVAGTLFTVQ 120
V + + + I +D+L AG L+ ++ E + F N G +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 121 TMLPLLNS--GASIVLTGSTAAYNGSPAFSVYAATKAAIRSFGRTWAAELVSRNIRVNTV 178
++ + SIV GS A + + YA++KAA F + EL NIR N V
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 179 VPGPVETPGLKGL-APEGQEQQLLEGEAAK----VPMGRLGKPAEIAAAVLFLASDQSSF 233
PG ET L A E +Q+++G +P+ +L KP++IA AVLFL S Q+
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 234 MTGSEMFVDGGT 245
+T + VDGG
Sbjct: 246 ITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0601cloacin494e-08 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 48.6 bits (115), Expect = 4e-08
Identities = 34/80 (42%), Positives = 35/80 (43%), Gaps = 5/80 (6%)

Query: 477 RGHFGGGAPGRSGPSGGGHGGQFGGGGHGGRFGGGGHGGGFGGGRGGGGHGGGFGGFGGG 536
RGH GA SG GG G GGG G +GGG G G H GG G G
Sbjct: 7 RGH-NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG----GSG 61

Query: 537 HGGGGHGGGFGGGHGGGFGG 556
HG GG G GGG G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 45.1 bits (106), Expect = 5e-07
Identities = 33/91 (36%), Positives = 39/91 (42%), Gaps = 4/91 (4%)

Query: 464 GRGPAGGSLGPGRRGHFGGGAPGRSGPSGGGHGGQFGGGGHGGRFGGGGHGGGFGGGRGG 523
GRG G+ G+ GG P G GG G + GG G G +GGG G
Sbjct: 6 GRGHNTGAHSTS--GNINGG-PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG-SG 61

Query: 524 GGHGGGFGGFGGGHGGGGHGGGFGGGHGGGF 554
G+GGG G GGG G GG+ GF
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 44.7 bits (105), Expect = 7e-07
Identities = 30/79 (37%), Positives = 35/79 (44%), Gaps = 8/79 (10%)

Query: 456 GLGRIPNPGRGPAGGSL--GPGRRGHFGGGAPG-----RSGPSGGGHGGQFGGGGHGGRF 508
G GR N G G++ GP G GG + G + P GGG G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH- 62

Query: 509 GGGGHGGGFGGGRGGGGHG 527
G GG G GGG G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 44.7 bits (105), Expect = 7e-07
Identities = 27/76 (35%), Positives = 29/76 (38%), Gaps = 3/76 (3%)

Query: 488 SGPSGGGHGG---QFGGGGHGGRFGGGGHGGGFGGGRGGGGHGGGFGGFGGGHGGGGHGG 544
SG G GH G +GG G G GG G + GG G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 545 GFGGGHGGGFGGGHGG 560
GG G GGG G
Sbjct: 62 HGNGGGNGNSGGGSGT 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0602PF03544477e-08 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 46.9 bits (111), Expect = 7e-08
Identities = 29/146 (19%), Positives = 38/146 (26%), Gaps = 18/146 (12%)

Query: 313 GALAVSLAVSIRSEPGTRPDPGQSVVTHLPAPAHAAPAPQPQAPAPQAPAPQAQAPAPQA 372
GA+ L + + P P Q + + APA P Q P
Sbjct: 26 GAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPP---------------- 69

Query: 373 QVPAPAPAPQAPAPRAPAPQAPAPQAQVPAPKAPAPVPVPVAQAPVPVPQAPAPVPVPIP 432
P P P+ P P AP P P P PV + P P
Sbjct: 70 --PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPAS 127

Query: 433 QAPAPAPEAPVPAPVPIPVPIQIPLP 458
AP P + +
Sbjct: 128 PFENTAPARPTSSTATAATSKPVTSV 153



Score = 39.6 bits (92), Expect = 1e-05
Identities = 24/107 (22%), Positives = 31/107 (28%), Gaps = 4/107 (3%)

Query: 380 APQAPAPRAP---APQAPAPQAQVPAPKAPAPVPVPVAQAPVPVPQAPAPVPVPIPQ-AP 435
+ PAP P APA A + P V P P+P+ P PV I + P
Sbjct: 40 VIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP 99

Query: 436 APAPEAPVPAPVPIPVPIQIPLPQIFGPGGGGGFPGGDDDRGGRGGG 482
P P+ V P P+ P
Sbjct: 100 KPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT 146



Score = 36.9 bits (85), Expect = 1e-04
Identities = 26/121 (21%), Positives = 30/121 (24%), Gaps = 5/121 (4%)

Query: 362 APQAQAPAPQAQVPAPAPAPQAPAPRAPAPQAPAPQAQVPAPKAPAPVPV---PVAQAPV 418
+ APA V APA P P P P + P P P PV
Sbjct: 40 VIELPAPAQPISVTMVAPADLEPPQAVQPP--PEPVVEPEPEPEPIPEPPKEAPVVIEKP 97

Query: 419 PVPQAPAPVPVPIPQAPAPAPEAPVPAPVPIPVPIQIPLPQIFGPGGGGGFPGGDDDRGG 478
P P PV + P + P P P G
Sbjct: 98 KPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGP 157

Query: 479 R 479
R
Sbjct: 158 R 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0603NUCEPIMERASE300.025 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.025
Identities = 23/87 (26%), Positives = 37/87 (42%), Gaps = 17/87 (19%)

Query: 1 MRCTVFGT-GYLGATHAVGMAQLGHEVVGVDIDPGKVAKLAGGDIPFYEPGLRKLLRDNL 59
M+ V G G++G + + + GH+VVG+D L +Y+ L++ + L
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGID-------NLN----DYYDVSLKQARLELL 49

Query: 60 AAGRLHFTT----DYD-MAAEFADVHF 81
A F D + M FA HF
Sbjct: 50 AQPGFQFHKIDLADREGMTDLFASGHF 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0607cloacin394e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 4e-05
Identities = 35/114 (30%), Positives = 46/114 (40%), Gaps = 5/114 (4%)

Query: 365 GGNGGQGGTGGTLFGNGGGGGTGGAGFVGPSSAGDGGNGGGGGRAGLIGNGGAGGAGGAP 424
G N G T G + G G G GG G + + GGG +G+ GG+G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI---HWGGGSGHGN 64

Query: 425 GPNGGFSGGNGGNGGDAVLIGNGGNSGDVGLS--GAGTPGLPGNGGLLIGTIGN 476
G G SGG G GG+ + G LS GAG + + G L I +
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 35.8 bits (82), Expect = 3e-04
Identities = 33/111 (29%), Positives = 42/111 (37%), Gaps = 4/111 (3%)

Query: 138 GDGGAGGSGGLEQQGGT--GGAAGLFGNGGAGGAGGASTVGTGAAGGAGGAGGLLWGQGG 195
G G G + G G GG GL GGA G S+ GG+G G+ WG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGS--GIHWGGGS 60

Query: 196 IGGTGGSGIDGGAGGAGGAGGALFGIGGAGGQGGIASTGLTGGGAGGDGGA 246
G GG + G G G + A G +++ G G GA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.5 bits (81), Expect = 4e-04
Identities = 28/81 (34%), Positives = 35/81 (43%), Gaps = 8/81 (9%)

Query: 245 GAGGLFGNGGIGGTGGLASVGPGGVGGNGGAA--------GTLLGNGGGGGVGGFGATQG 296
G G N G T G + GP G+G GGA+ G G G G+ G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 297 GDGGDGGATGIFGGTGGAGGA 317
G+GG G +G GTGG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.002
Identities = 34/101 (33%), Positives = 38/101 (37%), Gaps = 1/101 (0%)

Query: 336 GGDGRLFGNGGAGGVGGAAYTSNILMNATGGNGGQGGTGGTLFGNGGGGGTGGAGFVGPS 395
GGDGR N GA G + GG G GGG G+G G
Sbjct: 3 GGDGRGH-NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 396 SAGDGGNGGGGGRAGLIGNGGAGGAGGAPGPNGGFSGGNGG 436
GGNG GG +G GN A A A G + G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.013
Identities = 35/107 (32%), Positives = 42/107 (39%), Gaps = 14/107 (13%)

Query: 220 GIGGAGGQGGIASTGLTGGGAGGDGGAGGLFGNGGIGGT----GGLASVGPGGVGGNGGA 275
G G G +++G GG G G GG G GG + G GG+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH- 62

Query: 276 AGTLLGNGGGGGVGGFGATQGGDGGDGGATGIFG----GTGGAGGAG 318
GNGGG G G G+ GG+ A FG T GAGG
Sbjct: 63 -----GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.015
Identities = 25/71 (35%), Positives = 31/71 (43%)

Query: 282 NGGGGGVGGFGATQGGDGGDGGATGIFGGTGGAGGAGGQSTNALADSVGGNGGQGGDGRL 341
+GG G GA +GG TG+ G G + G+G S N G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 342 FGNGGAGGVGG 352
GNGG G G
Sbjct: 62 HGNGGGNGNSG 72



Score = 29.7 bits (66), Expect = 0.032
Identities = 21/72 (29%), Positives = 29/72 (40%)

Query: 118 NGANGAAGTGANGGAAGWLLGDGGAGGSGGLEQQGGTGGAAGLFGNGGAGGAGGASTVGT 177
N + NGG G +G G + GSG + GG +G + G G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 178 GAAGGAGGAGGL 189
+ GG+G G L
Sbjct: 70 NSGGGSGTGGNL 81



Score = 29.3 bits (65), Expect = 0.034
Identities = 37/132 (28%), Positives = 46/132 (34%), Gaps = 12/132 (9%)

Query: 186 AGGLLWGQGGIGGTGGSGIDGGAGGAGGAGGALFGIGGAGGQGGIASTGLTGGGAGGDGG 245
+GG G + I+GG G G GGA + G G + GGG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGA------SDGSGWSSENNPWGGGSGSGIH 55

Query: 246 AGGLFGNGGIGGTGGLASVGPGGVGGNGGAAGTLLGNG----GGGGVGGFGATQGGDGGD 301
GG G+G GG G S G G GGN A + G G GG +
Sbjct: 56 WGGGSGHGNGGGNGN--SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113

Query: 302 GGATGIFGGTGG 313
I G
Sbjct: 114 AAIADIMAALKG 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0608cloacin354e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.5 bits (81), Expect = 4e-04
Identities = 26/78 (33%), Positives = 31/78 (39%)

Query: 231 GAGGDGGVGGFGTFAGNGGDGGTGLFAAGGDGGAGGSGLTQGGDGGIGGAALGLFGAGGA 290
G G G G + +GN G TGL GG G GG G+ + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 291 GGTGGEGGLFGGAGGMGG 308
G GG G GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 35.5 bits (81), Expect = 5e-04
Identities = 34/111 (30%), Positives = 42/111 (37%), Gaps = 10/111 (9%)

Query: 364 GAGGAGGNAGFLYGAGGAGGAGGDSVGGADGGDGGNGGKAGLVGQGGDGGAGGNNNSGFG 423
G G G N GA G+ GG G G G G + GG + SG
Sbjct: 3 GGDGRGHN-------TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 424 TGGSGGKGGDAVLIGNGGNGGNAGSGGVGPGIAGAAGIGGLLIGEDGMAGL 474
GG G G GNG +GG +G+GG +A G + G GL
Sbjct: 56 WGGGSGHGNGG---GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 34.7 bits (79), Expect = 8e-04
Identities = 34/115 (29%), Positives = 38/115 (33%), Gaps = 5/115 (4%)

Query: 258 AGGDGGAGGSGLTQGGDGGIGGAALGLFGAGGAGGTGGEGGLFGGAGGMGGSAGMLFGNG 317
+GGDG +G G I G GL GGA G GG GS G
Sbjct: 2 SGGDGRGHNTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 318 GDGGAGAAATIGNAAGNGGAGGNAGMLIGAG----GAGGNGGFGFSASDGGAGGA 368
G G G G +G GG + G G GG S S G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 34.7 bits (79), Expect = 8e-04
Identities = 36/114 (31%), Positives = 45/114 (39%), Gaps = 5/114 (4%)

Query: 346 GAGGAGGNGGFGFSASDGGAGGAGGNAGFLYGAGGAGGAGGDSVGGADGGDGGNGGKAGL 405
G G G N G A GG G G G + G+G S GG G+G G
Sbjct: 3 GGDGRGHNTG----AHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 406 VGQGGDGGAGGNNNSGFGTGGSGGKGGDAVLIGNGGNGGNAGSGGVGPGIAGAA 459
G+GG GN+ G GTGG+ V G G+GG+ I+ A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPAL-STPGAGGLAVSISAGA 111



Score = 33.5 bits (76), Expect = 0.002
Identities = 31/91 (34%), Positives = 33/91 (36%), Gaps = 5/91 (5%)

Query: 202 GRGGAGGAGGFGNNTTGGIGGLGGAGGLFGAGGDGGVGGFGTFAGNGGDGGTGLFAAGGD 261
GRG GA N GG GLG GG G G GG G+G+ GG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGG-----ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 262 GGAGGSGLTQGGDGGIGGAALGLFGAGGAGG 292
G G G G G G L A A G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.5 bits (76), Expect = 0.002
Identities = 28/89 (31%), Positives = 36/89 (40%), Gaps = 8/89 (8%)

Query: 312 MLFGNGGDGGAGAAATIGNAAGNGGAGGNAGMLIGAGGAGGNGGFGFSASDGGAGGAGGN 371
M G+G GA +T GN G G G G + G G+S+ + GG G+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLG--------VGGGASDGSGWSSENNPWGGGSGS 52

Query: 372 AGFLYGAGGAGGAGGDSVGGADGGDGGNG 400
G G G GG+ G G GGN
Sbjct: 53 GIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.012
Identities = 25/81 (30%), Positives = 34/81 (41%), Gaps = 1/81 (1%)

Query: 384 AGGDSVGGADGGDGGNGGKAGLVGQGGDGGAGGNNNSGFGTGGSGGKGGDAVLIGNGGNG 443
+GGD G G +G G G GG G ++ SG+ + + GG I GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 444 GNAGSGGVGPGIAGAAGIGGL 464
G+ GG G G+ G L
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.014
Identities = 28/81 (34%), Positives = 35/81 (43%), Gaps = 4/81 (4%)

Query: 140 GAGGSGTPGTASVAGGNGGNGGAGGLLFGTGGAGGAG-GTASSLVGAIPGGNGGYGGDGG 198
G G G A GN NGG GL G G + G+G + ++ G G +GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 199 LLFGRGGAGGAGGFGNNTTGG 219
G GG G G G+ T G
Sbjct: 62 --HGNGGGNGNSGGGSGTGGN 80



Score = 30.5 bits (68), Expect = 0.015
Identities = 27/103 (26%), Positives = 34/103 (33%), Gaps = 2/103 (1%)

Query: 289 GAGGTGGEGGLFGGAGGMGGSAGMLFGNGGDGGAGAAATIGNAAGNGGAGGNAGMLIGAG 348
G G G G +G + G L GG ++ N G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 349 GAGGNGGFGFSASDGGAGGAGGNAGFLYG--AGGAGGAGGDSV 389
G GG G S G + A +G A GAGG +V
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAV 105



Score = 30.5 bits (68), Expect = 0.018
Identities = 26/82 (31%), Positives = 35/82 (42%), Gaps = 1/82 (1%)

Query: 246 GNGGDGGTGLFAAGGDGGAGGSGLTQGGDGGIG-GAALGLFGAGGAGGTGGEGGLFGGAG 304
G G + G + +GG G G+ G G G + +G G G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 305 GMGGSAGMLFGNGGDGGAGAAA 326
G G++G G GG+ A AA
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAP 87



Score = 29.7 bits (66), Expect = 0.027
Identities = 32/109 (29%), Positives = 37/109 (33%), Gaps = 10/109 (9%)

Query: 116 GNGANGAPGTGADGGAAGWLIGNGGAGGSGTPGTASVAGGNGGNGGAGGLLFGTGGAGGA 175
G G N + + G G G S G +S GG G+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 176 GGTASSLVGAIPGGNGGYGGDGGLLFGRGGAGGAGGFGNNTTGGIGGLG 224
GG GN G G G A A GF +T G GGL
Sbjct: 66 GG----------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 29.3 bits (65), Expect = 0.041
Identities = 33/109 (30%), Positives = 38/109 (34%), Gaps = 8/109 (7%)

Query: 170 GGAGGAGGTASSLVGAIPGGNGGYGGDGGLLFGRGGAGGAGGFGNNTTGGIGGLGGAGGL 229
G G A S G I GG G G GG A G+ + GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGG-------ASDGSGWSSENNPWGGGSGSGIHW 56

Query: 230 FGAGGDGGVGGFGTFAGNGGDGGTGLFAAGGDGGAGGSGLTQGGDGGIG 278
G G G GG G +G G G L A G L+ G GG+
Sbjct: 57 GGGSGHGNGGGNGN-SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 28.9 bits (64), Expect = 0.050
Identities = 27/84 (32%), Positives = 31/84 (36%), Gaps = 7/84 (8%)

Query: 110 TGRPLIGNGANGAPGTGADGGAA----GWLIGNGGAGGSGTPGTASVAGGNGGNGGAGGL 165
TG NG P GG A GW N GG G G GNGG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG- 69

Query: 166 LFGTGGAGGAGGTASSLVGAIPGG 189
+GG G GG S++ + G
Sbjct: 70 --NSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0609cloacin391e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 1e-04
Identities = 30/86 (34%), Positives = 37/86 (43%), Gaps = 3/86 (3%)

Query: 403 GAGAVAGASGAAGTIIAGNGGNGGAGGAGYAADGPAGPAIGNGGDGGRGGAGGFYGNGGA 462
G G GA +G I NGG G G G A+DG + N GG G + G G
Sbjct: 6 GRGHNTGAHSTSGNI---NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 463 GGAGGNSAPGGGNGGNGGTGGDSGAM 488
G GGN GGG+G G + +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 36.2 bits (83), Expect = 7e-04
Identities = 32/107 (29%), Positives = 43/107 (40%), Gaps = 4/107 (3%)

Query: 462 AGGAGGNSAPGGGNGGNGGTGGDSGAMGSSGGRGGDGGVGTNGGAGGGGGNATSYGTANA 521
+GG G G + GG +G G G G N GGG G+ +G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 522 TGGAGGDGGAGSRGTGGTGGAGGGGGAAQILNGASAATATGGAGGAG 568
G GG+G +G GG+G G A + A +T GAGG
Sbjct: 62 HGNGGGNGNSG----GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.002
Identities = 32/108 (29%), Positives = 37/108 (34%), Gaps = 5/108 (4%)

Query: 321 SGGGGAGGNGGAGGAGGHGSALFGAAGANGNGGAGGAGGNPGAPGNGGIGGVGPDAATSG 380
SGG G G N GA G+ + G G G + P GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNP-WGGGSGSGIHWGGGS 60

Query: 381 GMGGTGGDPGAVGGGGNGGAAGGAGAVAGASGAAGTIIAGNGGNGGAG 428
G G GG+ G G G GG + A A G G GG
Sbjct: 61 GHGNGGGN----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.002
Identities = 28/87 (32%), Positives = 32/87 (36%)

Query: 222 MFGNGGAGGMGGAGADGAVGAAGTAGTSTSAGGVGGVGGDGGNAGNGGAGGNGGLFVGVG 281
M G G G GA + G G G G G N GG G+G + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 282 GAGGQGGAGGAGGTGGAGGAGWDATAA 308
G G GG G +GG G GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 34.3 bits (78), Expect = 0.003
Identities = 33/102 (32%), Positives = 41/102 (40%), Gaps = 2/102 (1%)

Query: 426 GAGGAGYAADGPAGPAIGNGGDGGRGGAGGFYGNGGAGGAGGNSAPGGGNGGNGGTGGDS 485
G G G+ + NGG G G GG + G+G + N+ GGG+G GG S
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 486 GAMGSSGGRGGDGGVGTNGGAGGGGGNATSYGTANATGGAGG 527
G G GG GT G A +T GAGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.003
Identities = 37/117 (31%), Positives = 43/117 (36%), Gaps = 2/117 (1%)

Query: 144 GNGGNGGSGAAGQAGGA--GGAAGLIGTGGAGGMGGAGGGAGGMGGSGGWLLGNGGAGGA 201
G G G + A G GG GL GGA G GG G + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 202 GGVGGAGVSGGVGGTGGNAVMFGNGGAGGMGGAGADGAVGAAGTAGTSTSAGGVGGV 258
G GG G SGG GTGGN A G GA G A + + + +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 33.9 bits (77), Expect = 0.004
Identities = 33/105 (31%), Positives = 41/105 (39%), Gaps = 1/105 (0%)

Query: 864 ATGGAGGDGGSGGTGRGGTGGVGGVGINNGSGEAIGGAPGAGGTGAVGGDGGQGGAAYSY 923
+G G G G G + G G NN G G GG G GG G +
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 924 GTGDATGSAGAAGTAGTTGVGGTGGAGGAAYTLNGASTATATGGI 968
GTG SA AA A T GAGG A +++ + + A I
Sbjct: 76 GTG-GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 33.9 bits (77), Expect = 0.004
Identities = 23/83 (27%), Positives = 29/83 (34%)

Query: 259 GGDGGNAGNGGAGGNGGLFVGVGGAGGQGGAGGAGGTGGAGGAGWDATAAGVLAATGGDG 318
GGDG G +G + G G G GGA G + +G+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 319 GDSGGGGAGGNGGAGGAGGHGSA 341
G+ GG G G G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.8 bits (74), Expect = 0.007
Identities = 36/117 (30%), Positives = 42/117 (35%), Gaps = 8/117 (6%)

Query: 265 AGNGGAGGNGGLFVGVGGAGGQGGAGGAGGTGGAGGAGWDATAAGVLAATGGDGGDSGGG 324
+G G G N G G G G GG G + G+GW + GG SG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSE-------NNPWGGGSGSG 53

Query: 325 GAGGNGGAGGAGGHGSALFGAAGANGNGGAGGAGGNPGAPGNGGIGGVGPDAATSGG 381
G G G GG G +G GN A A G P G G + S G
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.8 bits (74), Expect = 0.009
Identities = 28/101 (27%), Positives = 35/101 (34%), Gaps = 1/101 (0%)

Query: 671 GNGGTGHGGGGGSGGTAINYGAGDAFGGAAGKGGTGVVGGNGGSGGAAYNYGTGNATGAD 730
G G GH G S IN G G G+G N GG + G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGS-GSGIHWGGGSG 61

Query: 731 GAAGTDGTTGAGGSGGSGGAASVLNSASIATATSGSGGAGG 771
G GGSG G ++V + + GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.014
Identities = 27/83 (32%), Positives = 33/83 (39%), Gaps = 2/83 (2%)

Query: 420 GNGGNGGAGGAGYAADGPAGPAIGNGGDGGRGGAGGFYGNGGAGGAGGNSAPGGGNGGNG 479
G G N GA +G GP G G G+G N GG G+ GG G+G
Sbjct: 6 GRGHNTGAHSTSGNING--GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 480 GTGGDSGAMGSSGGRGGDGGVGT 502
GG+ + G SG G V
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAA 86



Score = 31.6 bits (71), Expect = 0.019
Identities = 36/117 (30%), Positives = 46/117 (39%), Gaps = 1/117 (0%)

Query: 644 AGGDGGRTTIDGAGSRATATGGTGGDGGNGGTGHGGGGGSGGTAINYGAGDAFGGAAGKG 703
+GGDG + GG G G GG G G S G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH-WGGGS 60

Query: 704 GTGVVGGNGGSGGAAYNYGTGNATGADGAAGTDGTTGAGGSGGSGGAASVLNSASIA 760
G G GGNG SGG + G +A A A G + G G + ++ SA+IA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 31.6 bits (71), Expect = 0.021
Identities = 26/83 (31%), Positives = 36/83 (43%)

Query: 713 GSGGAAYNYGTGNATGADGAAGTDGTTGAGGSGGSGGAASVLNSASIATATSGSGGAGGD 772
G G +N G + +G T G G S GSG ++ + + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 773 GTDGGNGGSGGFAFTFGTGNIIA 795
G GGNG SGG + T G + +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0611IGASERPTASE582e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 57.8 bits (139), Expect = 2e-10
Identities = 42/250 (16%), Positives = 68/250 (27%), Gaps = 13/250 (5%)

Query: 726 LDREKATLPEKGTAAKEAEKRAKAAPKAAAPAAPAPAPAEA-PAKAAEAPAAATAASPAA 784
+D T P A + + A AP P PA A P++ E A +
Sbjct: 992 VDTTNITTPNNIQADVPSV-PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 785 PAKGLGMAGGAKRPGAKKAAPAPAAETAAAEAPAAPAKGLGMAAGAKKPGAKKAAAPTGE 844
K + A A+ A + A +G++ +
Sbjct: 1051 VEKN------EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104

Query: 845 TKPAEAAAPAAPAAPVKGLGMAS--GAKRPGAKKAAPPAAAAPEAAATASAPEAAA---A 899
T E A + + S K+ ++ P A A E T + E +
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 900 PAEPAAPAAPVKGLGIATGAKRPGAKKAPARAEAPAAAAPAQPEPEATPEPEPASKQDGE 959
A+ PA + + E P PA +P E K
Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHR 1224

Query: 960 PTPPAAPAAP 969
+ + P
Sbjct: 1225 RSVRSVPHNV 1234



Score = 40.8 bits (95), Expect = 3e-05
Identities = 34/209 (16%), Positives = 63/209 (30%), Gaps = 22/209 (10%)

Query: 698 TDGVNDRQEEAGRSGVEV----LDVAQVLLGSLDREKATLPEKGTAAKEAEKRAKAAPKA 753
T N + +S V+ +VAQ GS +E T K TA E E++AK +
Sbjct: 1061 TTAQNREVAKEAKSNVKANTQTNEVAQS--GSETKETQTTETKETATVEKEEKAKV--ET 1116

Query: 754 AAPAAPAPAPAEAPAKAAEAPAAATAASPAAPAKGLGMAGGAKRPGAKKAAPAPAAETAA 813
++ K ++ A PA + A A+ +
Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS 1176

Query: 814 AEAPAAPAKGLGMAAGAKKPGAKKAAAPTGETKPAEAAAPAAPAAPVKGLGMASGAKRPG 873
+ + + G + P T+P + +S +
Sbjct: 1177 SNVEQPVTESTTVNTGNSVVENPENTTPA-TTQPTVNSE-------------SSNKPKNR 1222

Query: 874 AKKAAPPAAAAPEAAATASAPEAAAAPAE 902
+++ E A T+S + A +
Sbjct: 1223 HRRSVRSVPHNVEPATTSSNDRSTVALCD 1251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0614FLAGELLIN421e-05 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 41.6 bits (97), Expect = 1e-05
Identities = 26/286 (9%), Positives = 58/286 (20%), Gaps = 2/286 (0%)

Query: 566 IASQAGLAGQAGLAGQAGIASQAGIASQSGLAAGGSAGIASQAGLGIGGQAALGGQAGAA 625
+++Q G L+ + Q G + GL
Sbjct: 125 VSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGD 184

Query: 626 VGGGLAGVGNVSGLTGIGGNASLGATGQAGLIASEGAALNGAATPHVSGPLGGVGVGGQA 685
+ V + A + + + + +
Sbjct: 185 LKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENN 244

Query: 686 GAAGGAGLGLGAGSRGGILSGDSTTLGGHPNPQPAALGAAGGTGIGAHSGAGGGMAAGLG 745
A + GG G + G ++ +
Sbjct: 245 TAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTIN 304

Query: 746 GSAAGGAGVGL-GGSAAGGAGAEAAGGFGGGTHIGGQAGLGGSAAGGAGTELGGTAGSPG 804
G + G+A A + + + GQ + L +
Sbjct: 305 GEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAK-LSDLEANNA 363

Query: 805 AGMGAGVGGGTHAGGQVGLGGGSTAGGQAGLGGGSSAGGSVGHGDI 850
+ + G T G+ +++G S +
Sbjct: 364 VKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINED 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0617SHAPEPROTEIN350.001 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 34.7 bits (80), Expect = 0.001
Identities = 20/86 (23%), Positives = 34/86 (39%), Gaps = 9/86 (10%)

Query: 260 IRFSRNEFEQLITQPLDRFIGSVEDMLQRSGVPRPSLAA------VAAVGGGAAIPLIGN 313
+ NE + + +PL + +V L++ P LA+ + GGGA + +
Sbjct: 249 FTLNSNEILEALQEPLTGIVSAVMVALEQC---PPELASDISERGMVLTGGGALLRNLDR 305

Query: 314 RLSERLQVPVFTTAQPIFSAAIGAAM 339
L E +PV P+ A G
Sbjct: 306 LLMEETGIPVVVAEDPLTCVARGGGK 331


7MMAR_0627MMAR_0651Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_06272160.059796L-asparagine permease AnsP1
MMAR_06281180.914541monooxygenase
MMAR_06291171.673539dehydrogenase
MMAR_06301171.301791transcriptional regulatory protein
MMAR_06312161.101610potassium-transporting ATPase subunit A
MMAR_06324171.111892potassium-transporting ATPase subunit B
MMAR_06334141.157146potassium-transporting ATPase subunit KdpC
MMAR_06343141.046986two-component system response phosphate sensor
MMAR_06353150.070496transcriptional regulatory protein KdpE
MMAR_0636320-1.954899hypothetical protein
MMAR_0637623-2.985597molecular chaperone DnaK
MMAR_0638621-2.990288GrpE protein (Hsp-70 cofactor)
MMAR_0639621-3.004720chaperone protein DnaJ
MMAR_0640521-2.821326heat shock protein transcriptional repressor
MMAR_0641521-2.725905PPE family protein
MMAR_0642318-2.244559PPE family protein
MMAR_06430110.492189hypothetical protein
MMAR_06441110.457309monooxygenase
MMAR_06451130.777365endopeptidase ATP binding protein (chain B)
MMAR_06460120.987622enoyl-CoA hydratase, EchA8
MMAR_0647-1130.718480hypothetical protein
MMAR_06480130.665408ketoacyl reductase
MMAR_06491161.788799orotate phosphoribosyltransferase
MMAR_06502162.035782hypothetical protein
MMAR_06512151.896901RNA methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0628PHPHTRNFRASE290.020 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.0 bits (65), Expect = 0.020
Identities = 14/59 (23%), Positives = 23/59 (38%), Gaps = 4/59 (6%)

Query: 196 EPLEEFRRKNDLVR-RHADAAQRDEAKIERAMAWTTPADADAYHR---EGVSLLTTEIQ 250
+ ++K + + + +D A +E A TP D D EG+ L TE
Sbjct: 243 KRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0629DHBDHDRGNASE1131e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (284), Expect = 1e-32
Identities = 75/252 (29%), Positives = 111/252 (44%), Gaps = 12/252 (4%)

Query: 1 MAGLTALITGATSGIGRATAHGLAELGATVLVSGRDEARGREVVDEVTARGGRGIFLAAE 60
+ G A ITGA GIG A A LA GA + + + +VV + A A+
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 LRDVTGVRNLATAAADAGAGKVDILINSAGAFPFGPTADTSPDDFDAVFALNVRAPYFLV 120
+RD + + TA + G +DIL+N AG G S ++++A F++N +
Sbjct: 66 VRDSAAIDEI-TARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 121 GALAPTMAKRGRGSIVNVTTMVAEFGAAGTGLYGASKAAIALLTKSWAAEYGPSGVRVNA 180
+++ M R GSIV V + A Y +SKAA + TK E +R N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 181 VSPGPTRTE-----GTVEMGEA------LDQLAAAAPAGRPADPTEIASTIVYLASDAAS 229
VSPG T T+ E G L+ P + A P++IA +++L S A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 230 FIHGAVVPVDGG 241
I + VDGG
Sbjct: 245 HITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0630HTHTETR646e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.3 bits (156), Expect = 6e-15
Identities = 28/158 (17%), Positives = 49/158 (31%), Gaps = 1/158 (0%)

Query: 12 VLSAARDEFRSHGYAATSVDSLAAATGLNRSSLYGSFGDKHRLFLRALDGYCEATLHDVR 71
+L A F G ++TS+ +A A G+ R ++Y F DK LF +
Sbjct: 16 ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELEL 75

Query: 72 EVLRERGVSARQRLINHVHAIVNGIVADTDRRGC-MMSRSSAELAGADPDVSGIVERSLE 130
E + L + ++ V + RR + E G V
Sbjct: 76 EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCL 135

Query: 131 AWRRELADCIAEAQLEGAVAGDGSPQALATVMLSLMQG 168
+ + + D + A +M + G
Sbjct: 136 ESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0635HTHFIS1061e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 106 bits (265), Expect = 1e-28
Identities = 36/118 (30%), Positives = 60/118 (50%), Gaps = 1/118 (0%)

Query: 2 TRVLVIDDEPQILRALRINFSVRGYDVVTAATGAAALRAAAEQRPDVVILDLGLPDMSGI 61
+LV DD+ I L S GYDV + A R A D+V+ D+ +PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVLAGLRGWFS-APVIVLSARSDSSDKVQALDAGADDYVTKPFGMDELLARLRAAVRR 118
++L ++ PV+V+SA++ ++A + GA DY+ KPF + EL+ + A+
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0637SHAPEPROTEIN1349e-37 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 134 bits (338), Expect = 9e-37
Identities = 74/368 (20%), Positives = 141/368 (38%), Gaps = 66/368 (17%)

Query: 2 ARAVGIDLGTTNSVVAVLEGGDP-----VVVANSEGSRTTPSVVAFARNGEVLVGQPAKN 56
+ + IDLGT N+++ V G VV + + + SV A VG AK
Sbjct: 10 SNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAA--------VGHDAKQ 61

Query: 57 QAVTNVD--RTIRSVKRHMGGDWSIEIDDKKYTAPEISARVLMKLKRDAEAYLGEDIADA 114
IR +K + D+ + ++ + ++ ++
Sbjct: 62 MLGRTPGNIAAIRPMKDGVIADF--------FVTEKMLQHFIKQVHSNS---FMRPSPRV 110

Query: 115 VITVPAYFNDAQRQATKDAGQIAGLNVLRIVNEPTAAALAYGLDKGEKEQTILVFDLGGG 174
++ VP +R+A +++ Q AG + ++ EP AAA+ GL E + +V D+GGG
Sbjct: 111 LVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGS-MVVDIGGG 169

Query: 175 TFDVSLLEIGEGVVEVRATSGDNHLGGDDWDDRVVEWLVDKFKGTSGIDLTKDKMAMQRL 234
T +V+++ + V S +GGD +D+ ++ ++ + G
Sbjct: 170 TTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLIG------------- 211

Query: 235 REAAEKAKIELSSS----QSTSINLPYITVDAD--KNPLFLDEQLTRAEFQRITQDL--- 285
AE+ K E+ S+ + I + + + ++ A + +T +
Sbjct: 212 EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAV 271

Query: 286 ---LDRTRKPFQSVIADTGISVSDIDHVVLVGGSTRMPAVTELVKELTGGKEPNKGVNPD 342
L++ S I++ G+ VL GG + + L+ E T G +P
Sbjct: 272 MVALEQCPPELASDISERGM--------VLTGGGALLRNLDRLLMEET-GIPVVVAEDPL 322

Query: 343 EVVAVGAA 350
VA G
Sbjct: 323 TCVARGGG 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0641cloacin393e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 3e-04
Identities = 29/108 (26%), Positives = 43/108 (39%), Gaps = 6/108 (5%)

Query: 242 SGNIGNGNNGDGNLGGGNLGSYNLGWGNLGGANQGFGNAGSNNQGFANTGSNNQGFANTG 301
SG G G+N + GN+ G G GGA+ G G + NN +GS +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 302 SFNVGFGNTGSNNIGIGLSGDG-----KIGFGSLNS-GSGNIGLFNSG 343
N G G G + GF +L++ G+G + + S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 33.1 bits (75), Expect = 0.016
Identities = 27/103 (26%), Positives = 35/103 (33%), Gaps = 11/103 (10%)

Query: 229 NVGTSNLGFGNIGSGNIGNGNNGDGNLGGGNLGSYNLGWGNLGGANQG-FGNAGSNNQGF 287
N G + GNI G G G G + G G S N WG G+ G +G N G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 288 ANTGSNNQGFANTGSFN---VGFG-----NTGSNNIGIGLSGD 322
G S V FG G+ + + +S
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.1 bits (75), Expect = 0.017
Identities = 23/81 (28%), Positives = 29/81 (35%), Gaps = 2/81 (2%)

Query: 2024 NTGDGNIGFGNTGDGNIGIGLNGDGLRGFEALNSGTDNVGLFNSGTGNVGIGNSGTGNWG 2083
NTG + GN G G+G+ G G + G SG G G G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 2084 IGNSGNYNTGVGNTGAANTGM 2104
GNSG + GN A +
Sbjct: 69 -GNSGGGSGTGGNLSAVAAPV 88



Score = 32.8 bits (74), Expect = 0.020
Identities = 27/104 (25%), Positives = 35/104 (33%), Gaps = 1/104 (0%)

Query: 729 GFGNTGNGNIGFGNTGNGNIGIGLNGDGLQGFGGWNSGSGNIGLFNSGTDNVGIGNSGTG 788
G G+ + GN G G+G+ G G G + + G SG G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 789 NSGIGNTGSYNTGIGNVGVANTGLFNIGNLNT-GIGNPGNYNSG 831
+ G TG VA F L+T G G S
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 32.4 bits (73), Expect = 0.026
Identities = 24/87 (27%), Positives = 34/87 (39%), Gaps = 4/87 (4%)

Query: 754 GDGLQGFGGWNSGSGNIGLFNSGTDNVGIGNSGTGNSGIGNTGSYNTGIGNVGVANTGLF 813
GDG G +S SGNI N G +G+G + SG + + G G+ G
Sbjct: 4 GDGRGHNTGAHSTSGNI---NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 814 NIGNLNTGIGNPGNYNSGAHNVGSTNT 840
GN GN G + N+ +
Sbjct: 61 GHGNGGGN-GNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.028
Identities = 25/79 (31%), Positives = 33/79 (41%), Gaps = 5/79 (6%)

Query: 1213 GDGLQGFGGWNSGSGNIGVFNSGTDNVGIGNSGTGNSGIGNSGSYNTGIGNSGLANTGLF 1272
GDG G +S SGNI N G +G+G + SG + + G SG+ G
Sbjct: 4 GDGRGHNTGAHSTSGNI---NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG-- 58

Query: 1273 NSGSFNTGIGNVGSYNTGT 1291
SG N G +GT
Sbjct: 59 GSGHGNGGGNGNSGGGSGT 77



Score = 32.0 bits (72), Expect = 0.031
Identities = 25/76 (32%), Positives = 33/76 (43%), Gaps = 3/76 (3%)

Query: 1181 NTGSNNIGIGNTGDGNIGFGNTGNDNTGIGLNGDGLQGFGGWNSGSGNIGVFNSGTDNVG 1240
NTG+++ GN G G G G + G G + + GG SG G SG N G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--GSGHGNGG 66

Query: 1241 IGNSGTGNSGIGNSGS 1256
+ G SG G + S
Sbjct: 67 GNGNSGGGSGTGGNLS 82



Score = 32.0 bits (72), Expect = 0.033
Identities = 22/77 (28%), Positives = 29/77 (37%), Gaps = 1/77 (1%)

Query: 1191 NTGDGNIGFGNTGNDNTGIGLNGDGLQGFGGWNSGSGNIGVFNSGTDNVGIGNSGTGNSG 1250
NTG + GN TG+G+ G G G + + G SG G G G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 1251 IGNSGSYNTGIGNSGLA 1267
+ G TG S +A
Sbjct: 69 GNSGGGSGTGGNLSAVA 85



Score = 32.0 bits (72), Expect = 0.037
Identities = 25/76 (32%), Positives = 31/76 (40%), Gaps = 3/76 (3%)

Query: 722 NTGDNNIGFGNTGNGNIGFGNTGNGNIGIGLNGDGLQGFGGWNSGSGNIGLFNSGTDNVG 781
NTG ++ GN G G G G + G G + + GG SG G SG N G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--GSGHGNGG 66

Query: 782 IGNSGTGNSGIGNTGS 797
+ G SG G S
Sbjct: 67 GNGNSGGGSGTGGNLS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0642cloacin340.009 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 0.009
Identities = 24/78 (30%), Positives = 30/78 (38%), Gaps = 2/78 (2%)

Query: 251 NTGNNNIGFGNTGDNNRGIGLTGTGQFGLGGLNSGSGNIGLFNSGTGNFGIGNSGTGNWG 310
NTG ++ GN G+G+ G G G + + G SG G G G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 311 IGNSGNSYNTGIGNSGDA 328
GNSG TG S A
Sbjct: 69 -GNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.036
Identities = 29/89 (32%), Positives = 36/89 (40%), Gaps = 3/89 (3%)

Query: 250 GNTGNNNIGFGNTGDNNRGIGLTGTGQFGLGGLNSGSGNIGLFNSGTGNFGIGNSGTGNW 309
G+ +N G +T N G TG GG + GSG N G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNING---GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 310 GIGNSGNSYNTGIGNSGDANTGFFNAGVA 338
G GN G + N+G G+ N A VA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0645HTHFIS434e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 42.9 bits (101), Expect = 4e-06
Identities = 39/184 (21%), Positives = 67/184 (36%), Gaps = 30/184 (16%)

Query: 550 AGRMLEGETAKLLRMEDEL--GHRVIGQKKAVQAVSDAVRRSRAGVADPNRPTGSFMFLG 607
GR L + ++ED+ G ++G+ A+Q + + R + + M G
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR----LMQTDLTL---MITG 167

Query: 608 PTGVGKTELAKALAEFLFDDERAMVRIDMSEYGEKHSVARLVGAPPGYIGYDQGGQLTEA 667
+G GK +A+AL ++ V I+M+ + L G + G T A
Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGA 219

Query: 668 VRRRPYTV-------ILFDEIEKAHPDVFDVLLQVLDEG---RLTDGQGRTVDFRNTILI 717
R + DEI D LL+VL +G + D R ++
Sbjct: 220 QTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IV 276

Query: 718 LTSN 721
+N
Sbjct: 277 AATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0647PF05616300.008 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 30.5 bits (68), Expect = 0.008
Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 1/68 (1%)

Query: 201 PLPQESPQEAEESEPAQSGNRSLTPSRRPELPPRRAQVDPAAGLLPDASRRTPEPMRREE 260
PLP+ SP E + PA + N P+ P+ P +P P +P R
Sbjct: 327 PLPEVSPAENPANNPAPNENPGTRPNPEPD-PDLNPDANPDTDGQPGTRPDSPAVPDRPN 385

Query: 261 GRSEGSRR 268
GR R+
Sbjct: 386 GRHRKERK 393


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0648DHBDHDRGNASE695e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 68.9 bits (168), Expect = 5e-16
Identities = 52/184 (28%), Positives = 89/184 (48%), Gaps = 4/184 (2%)

Query: 5 VALITGPTSGIGAGYARRYAQDGYDLILVARDVDRLKQLAVELEDDAGNVEILPADLADA 64
+A ITG GIG AR A G + V + ++L+++ L+ +A + E PAD+ D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 65 AGRDKVAERLSR---GVRVLVNNAGFATSGEFWETEPAALQAQLDVNVTAVMQLTRAALP 121
A D++ R+ R + +LVN AG G +A VN T V +R+
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 122 PMLAAGAGTVINIAS-VAGLLSGRGSTYSASKAWVISFSEGLSTGLEGTGVGVHAVCPGY 180
M+ +G+++ + S AG+ + Y++SKA + F++ L L + + V PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 181 VHTE 184
T+
Sbjct: 190 TETD 193


8MMAR_0750MMAR_0806Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_07501125.817382MFS transporter
MMAR_07510136.666199hypothetical protein
MMAR_07520156.598360ATPase
MMAR_07532186.861417CDP-diacylglycerol--serine
MMAR_07540236.156808phosphatidylserine decarboxylase
MMAR_07550236.138113PE-PGRS family protein
MMAR_07560201.569532molybdopterin biosynthesis protein MoeA2
MMAR_0757-1200.950609short chain dehydrogenase
MMAR_07580201.086900hypothetical protein
MMAR_07590181.348979chaperonin GroEL
MMAR_07600131.949083hypothetical protein
MMAR_07611131.686199PPE family protein
MMAR_07622131.615807D-amino acid aminohydrolase
MMAR_07633141.631398hypothetical protein
MMAR_07641140.937220hypothetical protein
MMAR_07652150.814148hypothetical protein
MMAR_0766010-1.573101RNA polymerase sigma factor SigK
MMAR_0767-110-2.193445transmembrane protein
MMAR_0768-29-2.377628cyclopropane-fatty-acyl-phospholipid synthase
MMAR_0769-211-3.294516hypothetical protein
MMAR_0770-110-4.111100oxidoreductase
MMAR_0771-112-4.702163transmembrane transport protein MmpL
MMAR_0772-213-4.731225membrane protein MmpS4_2
MMAR_0773-112-3.932148putative regulatory protein
MMAR_0774010-3.431456hypothetical protein
MMAR_0775-19-3.092711MmpL family transport protein
MMAR_0776-110-1.379807hypothetical protein
MMAR_0777-110-1.680880hypothetical protein
MMAR_0778-19-1.501768enoyl-CoA hydratase
MMAR_0779-18-2.893861peptidase
MMAR_0780-19-3.632566aldehyde dehydrogenase, PutA
MMAR_0781-111-4.007416hypothetical protein
MMAR_0782-111-4.563084hypothetical protein
MMAR_0783012-4.592822transmembrane protein
MMAR_0784012-4.1623938-amino-7-oxononanoate synthase
MMAR_078508-2.385831dihydrolipoamide dehydrogenase
MMAR_0786-111-2.388706hypothetical protein
MMAR_5564012-1.841766hypothetical protein
MMAR_0787011-2.256838PPE family protein
MMAR_0789215-2.821558hypothetical protein
MMAR_0790115-2.808329transcriptional regulatory protein
MMAR_0791117-3.411775hypothetical protein
MMAR_0792117-3.116645isocitrate lyase Icl
MMAR_0793012-3.1906813-hydroxybutyryl-CoA dehydrogenase
MMAR_0794112-3.328205mycolic acid synthase UmaA
MMAR_0795116-1.492275transposase for ISMyma04
MMAR_0796216-1.834494mycolic acid synthase PcaA
MMAR_0797116-0.492737TetR family transcriptional regulator
MMAR_07981140.617661transmembrane protein
MMAR_07993150.051592transcriptional regulatory protein
MMAR_08000110.844351iron-regulated heparin binding hemagglutinin
MMAR_080110167.927086transmembrane protein
MMAR_08029157.629894hypothetical protein
MMAR_08039157.320547deoxyribose-phosphate aldolase
MMAR_08049146.476559hypothetical protein
MMAR_080510136.928055amidohydrolase
MMAR_08069137.114527PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0750TCRTETA320.004 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.004
Identities = 39/166 (23%), Positives = 59/166 (35%), Gaps = 16/166 (9%)

Query: 242 FAWLPVIVTSAGGSEALGGAMVATFSGVGFLATLVTPRLCARAANPFPIVAASALCFLVG 301
F W + G S A G + + + V RL R A ++A L+
Sbjct: 241 FHWDATTI---GISLAAFGILHSLAQA--MITGPVAARLGERRALMLGMIADGTGYILLA 295

Query: 302 FAGLLWAPLTAPIVWAIFLGLGPSTFPAAITLINLRSRTESGSAALSGFTQGMGYLLASP 361
FA W PI+ + L G PA +++ + E L G + L +
Sbjct: 296 FATRGWMAF--PIM--VLLASGGIGMPALQAMLSRQVDEER-QGQLQGSLAALTSLTSIV 350

Query: 362 GPLIFGILYNATGR------WELSFGFLLVPLIALLAGGYHACKPR 401
GPL+F +Y A+ W L+ L AL G + R
Sbjct: 351 GPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQR 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0755cloacin374e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 4e-04
Identities = 33/95 (34%), Positives = 40/95 (42%), Gaps = 1/95 (1%)

Query: 431 TGGHGAAGAVAGGDGGRGGTGGGLAGSGGTGGNGAVGTVSGVGGDGGNAAGLFGDGGTGG 490
TG H +G + GG G G GG GSG + N G SG G G +G G+GG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG-HGNGGGNG 69

Query: 491 NGGLATAGGAGGDGGAGGKAALIGSGGNGGAGGSG 525
N G + G A A + GAGG
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.2 bits (83), Expect = 7e-04
Identities = 37/136 (27%), Positives = 50/136 (36%)

Query: 802 GGAGGTGGGAATLFGAGGAGGAGAIGLDTGGAGGAGGSAGALSGTGGAGGAGGIGVGGGG 861
GG G A GG +G+ G + G+G S+ GG+G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 862 GGGGVGGAGGNAGVVYGDGGAGGAGGGGTVAAGAAGGSGGNAAMLFGNGGAGGTGAVGAA 921
G GG G G G+ A A A + G+GG A + + + AA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 922 VGGNGGTGGDGGGLSG 937
+ G G G L G
Sbjct: 123 LKGPFKFGLWGVALYG 138



Score = 35.5 bits (81), Expect = 0.001
Identities = 40/128 (31%), Positives = 51/128 (39%), Gaps = 6/128 (4%)

Query: 642 LSGGAGVGGTGGIGAVLGGAGGTGGMAGLFGAGGAGGEGGGGQSGGAGGAGGVGLFGAGG 701
+SGG G G G + G G G+ G G + G G ++ GG G G+ GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGV-GGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 702 NGGTGGFSLVTGGAGGAGGASLLSGNGGAGGAGGIGGTGAGGAGGDGGAAGAFSGNGGAG 761
+G G GG G +GG S GN A A G A G GG A + S +
Sbjct: 60 SGHGNG-----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114

Query: 762 GAGGIAPA 769
I A
Sbjct: 115 AIADIMAA 122



Score = 34.3 bits (78), Expect = 0.003
Identities = 32/119 (26%), Positives = 41/119 (34%), Gaps = 2/119 (1%)

Query: 724 LSGNGGAGGAGGIGGTGAGGAGGDGGAAGAFSGNGGAGGAGGIAPAGTEGGAGGAGGNAG 783
+SG G G G T GG G + G+G + P G GG+G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG--GGSGSGIHWGG 58

Query: 784 VFSGTGGAGGAGGAGQTVGGAGGTGGGAATLFGAGGAGGAGAIGLDTGGAGGAGGSAGA 842
G G G + G + A FG GA GL + GA +A A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 34.3 bits (78), Expect = 0.003
Identities = 35/119 (29%), Positives = 44/119 (36%), Gaps = 2/119 (1%)

Query: 737 GGTGAGGAGGDGGAAGAFSGNGGAGGAGGIAPAGTEGGAGGAGGNAGVFSGTGGAGGAGG 796
GG G G G +G +G G GG A G+ G G SG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGS--GWSSENNPWGGGSGSGIHWGGGS 60

Query: 797 AGQTVGGAGGTGGGAATLFGAGGAGGAGAIGLDTGGAGGAGGSAGALSGTGGAGGAGGI 855
GG G +GGG+ T A G GAGG A ++S + I
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 34.3 bits (78), Expect = 0.003
Identities = 39/109 (35%), Positives = 49/109 (44%), Gaps = 3/109 (2%)

Query: 115 GNGANGAAGTGASGGDGGILIGNGGAGGSGATGLTGGAGGNGGAAGLLAGTAGAGGSGGL 174
G G N A + + +GG G G S +G + GG +G +G GGSG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSG--SGIHWGGGSGHG 63

Query: 175 GAAGAGGAGGQGGTGGLFSAGGAGGTGGVGASGGTGGAGGLGLFGAGGA 223
G G +GG GTGG SA A G A T GAGGL + + GA
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPAL-STPGAGGLAVSISAGA 111



Score = 34.3 bits (78), Expect = 0.003
Identities = 36/108 (33%), Positives = 44/108 (40%), Gaps = 2/108 (1%)

Query: 774 GAGGAGGNAGVFSGTGGA-GGAGGAGQTVGGAGGTGGGAA-TLFGAGGAGGAGAIGLDTG 831
G G G N G S +G GG G G G + G+G + +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 832 GAGGAGGSAGALSGTGGAGGAGGIGVGGGGGGGGVGGAGGNAGVVYGD 879
G GG G++G SGTGG A V G GAGG A +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.9 bits (77), Expect = 0.004
Identities = 37/115 (32%), Positives = 50/115 (43%), Gaps = 10/115 (8%)

Query: 676 AGGEGGGGQSGGAGGAGGVGLFGAGGNGGTGGFSLVTGGAGGAGGASLLSGNGGAGGAG- 734
+GG+G G +G +G + NGG G + G + G+G +S + GG G+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 735 GIGGTGAGGAGGDGGAAGAFSGNGGAGG--AGGIAPAGTEGGAGGAGGNAGVFSG 787
GG G GG G +G SG GG A +A GAGG A S
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 33.5 bits (76), Expect = 0.004
Identities = 35/110 (31%), Positives = 45/110 (40%), Gaps = 4/110 (3%)

Query: 905 MLFGNGGAGGTGAVGAAVGGNGGTGGDGGGLSGSGGAGGNGAGGGTGGTGGDGGRARGLL 964
M G+G TGA + NGG G G G S G+G + GG G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 965 GDGGTGGDGGFGGITSGDGGNGGTGALIGDG----GNGGAGGIGLGAAPG 1010
G G GG+G GG + G A + G GAGG+ + + G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.8 bits (74), Expect = 0.007
Identities = 24/75 (32%), Positives = 31/75 (41%)

Query: 196 GAGGTGGVGASGGTGGAGGLGLFGAGGAGGAGGMANATVGGTGGAGGASLLFGNGGAGGL 255
G G G ++ G G GL GGA G ++ GG+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 256 GGGGATAGGAGGQGG 270
GG G + GG+G G
Sbjct: 66 GGNGNSGGGSGTGGN 80



Score = 32.4 bits (73), Expect = 0.010
Identities = 31/97 (31%), Positives = 36/97 (37%), Gaps = 1/97 (1%)

Query: 940 GAGGNGAGGGTGGTGGDGGRARGLLGDGGTGGDGGFGGITSGDGGNGGTGALIGDGGNGG 999
G G G G T G+ LG GG DG + G GG+G+ I GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSG 61

Query: 1000 AGGIGLGAAPGGDGGKGGDAQLVGTGGNGGILGLGLP 1036
G G GG G GG+ V G L P
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTP 98



Score = 32.4 bits (73), Expect = 0.010
Identities = 29/84 (34%), Positives = 35/84 (41%)

Query: 212 AGGLGLFGAGGAGGAGGMANATVGGTGGAGGASLLFGNGGAGGLGGGGATAGGAGGQGGD 271
+GG G GA G N G G GGAS G GGG+ +G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 272 AGTFYGDGGVGGAGGAGGNIPGSS 295
G G+G GG G GGN+ +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.012
Identities = 36/106 (33%), Positives = 42/106 (39%), Gaps = 8/106 (7%)

Query: 127 SGGDG----GILIGNGGAGGSGATGLTGGAGGNGGAAGLLAGTAGAGGSG-GLGAAGAGG 181
SGGDG G G TGL G G + G+ GGSG G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 182 AGGQGGTGGLFSAGGAGGTGGVGASGGTGGAGGLGLFGAGGAGGAG 227
G GG G +GG GTGG ++ A G GAGG
Sbjct: 62 HGNGGGNGN---SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.012
Identities = 33/106 (31%), Positives = 38/106 (35%), Gaps = 4/106 (3%)

Query: 865 GVGGAGGNAGVVYGDGGAGGAGGGGTVAAGAAGGSGGNAAMLFGNGGAGGTGAVGAAVGG 924
G G G N G G G G V GA+ GSG ++ N GG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSE----NNPWGGGSGSGIHWGG 58

Query: 925 NGGTGGDGGGLSGSGGAGGNGAGGGTGGTGGDGGRARGLLGDGGTG 970
G G GG + GG+G G G A G GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.014
Identities = 32/105 (30%), Positives = 37/105 (35%)

Query: 397 GDGGAGGAGGIGGAAAGGKGGDGGDAATLFGSGGTGGHGAAGAVAGGDGGRGGTGGGLAG 456
G G GA G GG G G GSG + + G +G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 457 SGGTGGNGAVGTVSGVGGDGGNAAGLFGDGGTGGNGGLATAGGAG 501
G G GT + A F T G GGLA + AG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.6 bits (71), Expect = 0.021
Identities = 30/83 (36%), Positives = 33/83 (39%), Gaps = 4/83 (4%)

Query: 310 GDGGAGGTGGVATSAGGAGGAGGNAADLVGTGGVGGAGGTSFDAGGAGGIGGSAGALFGA 369
GDG TG +TS GG G L GG G S + GG GS G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTG----LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 370 GGAGGAGGFGQVSGGAGGAGGNS 392
G G GG G GG+G G S
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLS 82



Score = 31.2 bits (70), Expect = 0.024
Identities = 34/104 (32%), Positives = 40/104 (38%), Gaps = 3/104 (2%)

Query: 618 AGGSGAVNGLGTGQAGGN--GGAAGLLSGGAGVGGTGGIGAVLGGAGGTGGMAGLFGAGG 675
+GG G + G GN GG GL GG G+G GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 676 AGGEGGGGQSGGAGGAGGVGLFGAGGNGGTGGFSLVTGGAGGAG 719
G GGG + G G G L G +L T GAGG
Sbjct: 62 HGN-GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.026
Identities = 32/127 (25%), Positives = 44/127 (34%), Gaps = 3/127 (2%)

Query: 287 AGGNIPGSSGGAGGAGGNAGLFHGDGGAGGTGGVATSAGGAGGAGGNAADLVGTGGVGGA 346
+GG+ G + GA GN +G G GG A+ G G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGN---INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 347 GGTSFDAGGAGGIGGSAGALFGAGGAGGAGGFGQVSGGAGGAGGNSGMVYGDGGAGGAGG 406
G + GG G GG +G FG + GAGG + + +
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118

Query: 407 IGGAAAG 413
I A G
Sbjct: 119 IMAALKG 125



Score = 30.8 bits (69), Expect = 0.034
Identities = 35/119 (29%), Positives = 41/119 (34%), Gaps = 8/119 (6%)

Query: 457 SGGTGGNGAVGTVSGVGGDGGNAAGLFGDGGTGGNGGLATAGGAGGDGGAGGKAALIGSG 516
SGG G G S G G GL GG G ++ G G G GSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 517 GNGGAGGSGTVDPGGNGGRGGDAQLFGTGGNGGNPGLGVPAGTAGEAGAPGLATSNQAL 575
G G + GG G GG+ G P L P AG ++ S AL
Sbjct: 62 HGNGGGNGNS---GGGSGTGGNLSAVAAPVAFGFPALSTPG-----AGGLAVSISAGAL 112



Score = 30.8 bits (69), Expect = 0.035
Identities = 34/104 (32%), Positives = 44/104 (42%), Gaps = 1/104 (0%)

Query: 355 GAGGIGGSAGALFGAGGA-GGAGGFGQVSGGAGGAGGNSGMVYGDGGAGGAGGIGGAAAG 413
G G G + GA +G GG G G G + G+G +S GG+G GG +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 414 GKGGDGGDAATLFGSGGTGGHGAAGAVAGGDGGRGGTGGGLAGS 457
G GG G++ G+GG AA G GGLA S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 30.5 bits (68), Expect = 0.039
Identities = 31/85 (36%), Positives = 37/85 (43%), Gaps = 7/85 (8%)

Query: 885 AGGGGTVAAGAAGGSGGNAAMLFGNGGAGGTGAVGAAVGGNG--GTGGDGGGLSGSGGAG 942
+GG G A + GN NGG G G G A G+G GG SGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 943 GNGAGGGTGGTGGDGGRARGLLGDG 967
G G+G G GG G+ G G G+
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0757DHBDHDRGNASE571e-11 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 57.4 bits (138), Expect = 1e-11
Identities = 52/210 (24%), Positives = 88/210 (41%), Gaps = 22/210 (10%)

Query: 21 GRVVVVTGANTGLGYHTAEALAGRGAHVVLAVRNPEKGNAAVAQIVAAKPQADVTLQALD 80
G++ +TGA G+G A LA +GAH+ NPEK V+ + A A+ D
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF--PAD 65

Query: 81 LSSLDSVRSAADALRSAYPRIDLLINNAGV--MWTPKQVTKDGFEMQFGTNHLGHFALTG 138
+ ++ + ID+L+N AGV ++ + +E F N G F +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 139 LLLDHLLPVPGSRVITV-SSLGHRIRAAIHFDDLQWERSYNRVAAYGQSKLANLLFTYEL 197
+ +++ ++TV S+ R ++ AAY SK A ++FT L
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSM--------------AAYASSKAAAVMFTKCL 171

Query: 198 QRRLAADSQAATIAVAAHPGGSNTELARNL 227
LA + I PG + T++ +L
Sbjct: 172 GLELAEYNIRCNI---VSPGSTETDMQWSL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0759PF06917310.017 Periplasmic pectate lyase
		>PF06917#Periplasmic pectate lyase

Length = 555

Score = 30.7 bits (69), Expect = 0.017
Identities = 12/50 (24%), Positives = 25/50 (50%)

Query: 193 FDKGYISGYFVTDAERQEAVLEDPYILLVSSKVSTVKDLLPLLEKVIQGG 242
F + Y G FV A+ + +++P L + + ++ +D L + + I G
Sbjct: 477 FKRHYHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNG 526


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0761cloacin340.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 0.001
Identities = 23/94 (24%), Positives = 35/94 (37%), Gaps = 7/94 (7%)

Query: 231 GSGNTGNSNVGLGNLGSGNVGFGNTGNGDFGFGLTGDHQFGFGGFNSGSGNVGIGNSGTG 290
G G+ ++ GN+ G G G G G G + ++ GG SG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG-- 63

Query: 291 NVGFFNSGNGNMGIGNSGSLNSGLGNSGSMSTGF 324
N G G SG+ + + ++ GF
Sbjct: 64 -----NGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 30.5 bits (68), Expect = 0.016
Identities = 34/114 (29%), Positives = 54/114 (47%), Gaps = 7/114 (6%)

Query: 266 GDHQFGFGGFNSGSGNVGIGNSGTGNVGFFNSGNG--NMGIGNSGSLNSGLGNSGSMSTG 323
GD + G +S SGN+ G +G G G + G+G + G SG+ G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 324 FGTASMSSGMWQSMHGSDMASSTSLA----SSATYATGGTA-TLSSGILSSALA 372
G + +SG G+ A + +A + +T GG A ++S+G LS+A+A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 30.1 bits (67), Expect = 0.023
Identities = 24/83 (28%), Positives = 34/83 (40%)

Query: 197 TGNVGNNNVGNNNWGSGNTGSSNVGTGNTGSSNIGSGNTGNSNVGLGNLGSGNVGFGNTG 256
+G G + + SGN G G G ++ GSG + +N G GSG G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 257 NGDFGFGLTGDHQFGFGGFNSGS 279
+G+ G G GG S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 29.7 bits (66), Expect = 0.028
Identities = 26/95 (27%), Positives = 36/95 (37%), Gaps = 3/95 (3%)

Query: 195 SGTGNVGNNNVGNNNWGSGNTGSSNVGTGNTGSSNIGSGNTGNSNVGLGNLGSGNVGFGN 254
SG N G +G S +G S+ G S G G S G GN G G
Sbjct: 17 SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS--GHGNGGGNGNSGGG 74

Query: 255 TGNGDFGFGLTGDHQFGFGGFNS-GSGNVGIGNSG 288
+G G + FGF ++ G+G + + S
Sbjct: 75 SGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0762UREASE533e-09 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 52.8 bits (127), Expect = 3e-09
Identities = 30/100 (30%), Positives = 44/100 (44%), Gaps = 13/100 (13%)

Query: 4 DLLIRNGTIVDGLGGEPYVGDVAVRDGIIVAVGP---PD--DSVN--GDAAGRVIDASGL 56
D +I N I+D G D+ ++DG I A+G PD V VI G
Sbjct: 69 DTVITNALILDHWG--IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGK 126

Query: 57 LVTPGFVDLHTHYDGQSIWSDRLTPSSAHGVTTVLMGNCG 96
+VT G +D H H+ I ++ + G+T +L G G
Sbjct: 127 IVTAGGMDSHIHF----ICPQQIEEALMSGLTCMLGGGTG 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0771ACRIFLAVINRP468e-07 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 46.0 bits (109), Expect = 8e-07
Identities = 41/209 (19%), Positives = 78/209 (37%), Gaps = 13/209 (6%)

Query: 724 AQGIASIDQIRTAAEESLKGTPLEDAKIYVAGTASVFKDIS-EGADWDLLIAGISSLCLI 782
A + + I+ A L+ + K+ + F +S L A + L+
Sbjct: 297 ANALDTAKAIK-AKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIM----LV 351

Query: 783 FIIMLILTRAFVAAAVIVGTVALSLGASFGMSVLLWQHILNIDLHYMVLAMSVIVLLAVG 842
F++M + + A + V + L +F + I + + MVLA+ ++V A+
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411

Query: 843 SDYNLLLVSRFKEEIPAGLKTGIIRAMGGTGKVVTNAGLVFAFT---MASMAVSDLRVIG 899
N V R E K ++M + +V + MA S +
Sbjct: 412 VVEN---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYR 468

Query: 900 QVGTTIGLGLLFDTLIVRAFMTPSIAALL 928
Q TI + +++V +TP++ A L
Sbjct: 469 QFSITIVSAMAL-SVLVALILTPALCATL 496



Score = 44.4 bits (105), Expect = 2e-06
Identities = 35/225 (15%), Positives = 76/225 (33%), Gaps = 21/225 (9%)

Query: 208 VAVIFIMLLLVYRSVITVVVLLLTVGVELTAARGVVALLGHSGAIGLSTFAVSLLTSLAI 267
+ ++F+++ L +++ ++ + V V L ++A G+S L+ F + L AI
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT-LTMFGMVL----AI 402

Query: 268 AAGTDYGIFIFGRYQEARQAGEDKETAFYTMYRGTAHV---ILGSGLTIAGA---TFCLK 321
D I + + EDK + + + ++G + ++
Sbjct: 403 GLLVDDAIVVVENVERVMM--EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFG 460

Query: 322 FARMPYFETLGIPCAVGMLVAVMVALTLGPA----VLTVGSRFGLFDPKRLIKV--RGWR 375
+ + I M ++V+VAL L PA +L S + +
Sbjct: 461 GSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFD 520

Query: 376 RVGTVVVRWPLPVLAATLA--VALVGLLALPGYRTNYNDRDYLPN 418
+L +T + ++A +LP
Sbjct: 521 HSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPE 565



Score = 39.4 bits (92), Expect = 9e-05
Identities = 46/222 (20%), Positives = 81/222 (36%), Gaps = 13/222 (5%)

Query: 144 YVQLNLAGNQGEPLANESVEAVRKIVK--ETPAPPGVTTYVTGAAALVSDMHSSGDKSMI 201
Y L QGE S +++ + P G+ TG + SG+++
Sbjct: 818 YNGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTG---MSYQERLSGNQAPA 874

Query: 202 KITVTTVAVIFIMLLLVYRSVITVVVLLLTVGVELTAARGVVALLGHSGAIGLSTFAVSL 261
+ ++ V+F+ L +Y S V ++L V L ++A + + F V L
Sbjct: 875 LVAIS-FVVVFLCLAALYESWSIPVSVMLV--VPLGIVGVLLAATLFNQKNDV-YFMVGL 930

Query: 262 LTSLAIAAGTDYGIFIFGRYQEARQAGEDKETAFYTMYRGTAHVILGSGLTIAGATFCLK 321
LT++ ++A I F + + G+ A R IL + L L
Sbjct: 931 LTTIGLSAKNAILIVEFAK-DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLA 989

Query: 322 FARMPYFET---LGIPCAVGMLVAVMVALTLGPAVLTVGSRF 360
+ +GI GM+ A ++A+ P V R
Sbjct: 990 ISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0773HTHTETR657e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.0 bits (158), Expect = 7e-15
Identities = 23/168 (13%), Positives = 57/168 (33%), Gaps = 4/168 (2%)

Query: 33 RTEENKRQRAAALVEAARSMASETGVASVTLTAVASRAGIHYSAVRRYFTSHKEVLLHLA 92
+T++ ++ +++ A + S+ GV+S +L +A AG+ A+ +F ++ +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 93 AEGWQRWSNTVCGELSQPGPMTESRVAAALADGLAAD---PLFCDLLANLHLHLEHEVEV 149
++ S + L L + L+ + E E+
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 150 DRVVEVKRTSTAAVIAL-ADAIENALPALGRAGAFDVLLAAYSLGATL 196
V + +R +++ + A AA + +
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0775ACRIFLAVINRP566e-10 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 56.0 bits (135), Expect = 6e-10
Identities = 45/283 (15%), Positives = 93/283 (32%), Gaps = 54/283 (19%)

Query: 175 QGEAKANESVDAVRELVNDTPA--PPGVKAYVTGPAALIADQSTAGDASIQRV------- 225
A A ++ A++ + + P G+K D + SI V
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYP------YDTTPFVQLSIHEVVKTLFEA 347

Query: 226 TFITIGVIFVMLLSVYRSLITVISVLVMVGIELMAARGVVAFLADNNVIGLSTFAVNLLV 285
+ V+++ L ++ +LI I+V V++ ++ +++N L
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFA-------------ILAAFGYSINTLT 394

Query: 286 LMAIAAGT----DYAIFVLGRYQEARGEGESREKAFYTMFHGTAHV---VLGSGLTIAGA 338
+ + D AI V+ + E + K + + ++G + ++
Sbjct: 395 MFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKE--ATEKSMSQIQGALVGIAMVLSAV 452

Query: 339 ---MYCLSFTRLPYFQTLGAPCAVGMLVAVLAALTLGPAVLVV-------------GSFF 382
M + ++ M ++VL AL L PA+ G FF
Sbjct: 453 FIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFF 512

Query: 383 KLFDPKRKMRTRGWRRVGTAIVRWPGPILAVSIAIALIGLLAL 425
F+ + I+ G L + I + G++ L
Sbjct: 513 GWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALI-VAGMVVL 554



Score = 45.2 bits (107), Expect = 2e-06
Identities = 32/159 (20%), Positives = 68/159 (42%), Gaps = 17/159 (10%)

Query: 800 AVSLILIIMLIITRSLVAAVVIVGTVLLSLGASFGLSVLVWQDIFGVELHWMVLAMSVIL 859
A+ L+ ++M + +++ A ++ V + L +F + FG ++ + + +
Sbjct: 347 AIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAA-----FGYSINTLTMFG---M 398

Query: 860 LLAVGS--DYNLLLV---SRLKEEIGAGLKTGIIRAMAGTGGVVTTAGLVFAAT---MAS 911
+LA+G D +++V R+ E K ++M+ G + +V +A MA
Sbjct: 399 VLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAF 458

Query: 912 FIFSDLRVIGQVGTTIGLGLLFDTLIVRSFMTPSIAALM 950
F S + Q TI + L+ TP++ A +
Sbjct: 459 FGGSTGAIYRQFSITIVSAMALSVLVALIL-TPALCATL 496



Score = 33.3 bits (76), Expect = 0.005
Identities = 47/216 (21%), Positives = 85/216 (39%), Gaps = 16/216 (7%)

Query: 175 QGEAKANESVDAVRELVNDTPA--PPGVKAYVTGPAALIADQSTAGDASIQRVTFITIGV 232
QGEA S L+ + + P G+ TG + ++ + A + I+ V
Sbjct: 827 QGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSY--QERLSGNQAPA--LVAISFVV 882

Query: 233 IFVMLLSVYRSLITVISVLVMVGIELMAARGVVAFLADNNVIGLSTFAVNLLVLMAIAAG 292
+F+ L ++Y S I V VM+ + L ++A N + F V LL + ++A
Sbjct: 883 VFLCLAALYESWS--IPVSVMLVVPLGIVGVLLAATLFNQKNDV-YFMVGLLTTIGLSAK 939

Query: 293 TDYAIFVLGRYQEA-RGEGESREKAFYTMFHGTAHVVLGSGLTIAGAMYCLSFTRLP--- 348
AI ++ ++ EG+ +A +L + L + L+ +
Sbjct: 940 N--AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSG 997

Query: 349 YFQTLGAPCAVGMLVAVLAALTLGPAV-LVVGSFFK 383
+G GM+ A L A+ P +V+ FK
Sbjct: 998 AQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFK 1033


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0783OMPADOMAIN290.015 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 28.7 bits (64), Expect = 0.015
Identities = 11/20 (55%), Positives = 16/20 (80%)

Query: 163 TALSLALMSAPFASVAAAAP 182
TA+++A+ A FA+VA AAP
Sbjct: 4 TAIAIAVALAGFATVAQAAP 23


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0787TYPE3IMRPROT290.024 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 29.3 bits (66), Expect = 0.024
Identities = 19/105 (18%), Positives = 35/105 (33%)

Query: 197 LDIIEIILAILINLIAITFFLVFAIIAYTIIFAILLIPILIALAFSVVVFAFYIAIAIII 256
L + +NL V A+I+ I + +P + L ++++
Sbjct: 2 LQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPAN 61

Query: 257 VTPPLLAALLPVALVSSLIGIPIALATTLPIALPVGIGQYLADQT 301
P L +A+ LIGI + A G+ + Q
Sbjct: 62 DVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQM 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0797HTHTETR485e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.1 bits (114), Expect = 5e-09
Identities = 34/183 (18%), Positives = 64/183 (34%), Gaps = 16/183 (8%)

Query: 17 RRWHQHKVERRNELVDGTIVAIRRHGRF-LSMDEIAAEIGVSKTVLYRYFVDKNDLTTAV 75
R+ Q E R ++D + + G S+ EIA GV++ +Y +F DK+DL + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 76 M---MRFAQTTLIPNMAAALSSNLDGFDLAREIIRVYVETVAAEPEPYRFVMANSSASKS 132
+ A L REI+ +E+ E + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVL---REILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 133 ----KVIADSERIIA---RMLAVMLRRRMAEAGMDTGGVEP--WAYLIVGGVQLATHSWM 183
V+ ++R + + EA M + A ++ G + +W+
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179

Query: 184 SDP 186
P
Sbjct: 180 FAP 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0806cloacin401e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.7 bits (92), Expect = 1e-04
Identities = 32/80 (40%), Positives = 39/80 (48%), Gaps = 1/80 (1%)

Query: 1723 NGGDG-GHGNGSNIAGGNGGSGGDGGTGATGGAGGRGGAGGNGFNNGGDGGNGGNGGSGG 1781
+GGDG GH G++ GN G G G + G G + N GG G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1782 NGVVRGNGGSGGNAGNGGNG 1801
+G GNG SGG +G GGN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 39.3 bits (91), Expect = 2e-04
Identities = 36/111 (32%), Positives = 45/111 (40%), Gaps = 1/111 (0%)

Query: 1562 LGGAGGAGGNGGVTAASNIVTGNGGNGGNGGNGGPGNSSTGGNSGNGGGGGLG-GYGATG 1620
+ G G G N G + S + G G GG G+ + N+ GGG G G +G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1621 GTGGKGGNGGSGGIGGNGGNGGFGGGGTIRGDGGNGGNGGNGIGAGMDGGA 1671
G G GGNG SGG G GGN G G G+ + GA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 38.2 bits (88), Expect = 4e-04
Identities = 36/103 (34%), Positives = 44/103 (42%), Gaps = 4/103 (3%)

Query: 1857 NGGNGNTGGTGGRGGNGGANGAVHGGDGGNGGKGGNGVIAGAGGDGGTGGAGGGFQATGG 1916
+GG+G TG +G NG G G G G+G + GG G+G + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1917 SGGDGGAGGSGTTGGTGGRGGAGGAADGF----RATGGAGGTG 1955
G GG G SG GTGG A A F +T GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.4 bits (86), Expect = 7e-04
Identities = 30/89 (33%), Positives = 38/89 (42%)

Query: 333 GGGGAGGTGGAGGAAGLIGHGGAGGGGGAGDGGSTGGAGQTGGDGGRAGDGGVGGRGGWL 392
GG G G GA +G I G G G G G +G + + GG +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 393 AGAGGDGGAGGAGGVGGAGGGGADGLVLG 421
GG+G +GG G GG A + G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 35.5 bits (81), Expect = 0.003
Identities = 24/71 (33%), Positives = 29/71 (40%)

Query: 1545 GGAGGAGGNGGNGRNGGLGGAGGAGGNGGVTAASNIVTGNGGNGGNGGNGGPGNSSTGGN 1604
GA GN G G G G + G+G + + G+G GG G GN GN
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 1605 SGNGGGGGLGG 1615
SG G G G
Sbjct: 71 SGGGSGTGGNL 81



Score = 35.5 bits (81), Expect = 0.003
Identities = 29/79 (36%), Positives = 32/79 (40%)

Query: 1612 GLGGYGATGGTGGKGGNGGSGGIGGNGGNGGFGGGGTIRGDGGNGGNGGNGIGAGMDGGA 1671
G G G G GN G G G G G G + GG G+GI G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1672 GGKGGEGLTGGNGGNGGNG 1690
G GG G +GG G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.003
Identities = 25/83 (30%), Positives = 33/83 (39%)

Query: 1104 SAGGAGGYGGNGGALAGDGGDGGDGGNGGSGGDGGSGANGGAGDNGTTTSPNGGRGGDGG 1163
S G G+ + +G+ G G G G GSG + G + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1164 HGGHGGDGGAGGHGGVAGKAQAA 1186
HG GG+G +GG G G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 35.1 bits (80), Expect = 0.003
Identities = 34/102 (33%), Positives = 44/102 (43%), Gaps = 2/102 (1%)

Query: 1287 GAGGHGGDGGAFGDGGN--GGDGGTGGTGTAATTAGSPGAVGGHGGAGGNGGDGGYLNGN 1344
G G G + GA GN GG G G G A+ +G GG G+G G +G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1345 AGNGGSGGAGGTGGAGATGTSQAAATGLGGGRGGDGGAGGAG 1386
GG+G +GG G G ++ AA G GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.003
Identities = 26/85 (30%), Positives = 31/85 (36%)

Query: 1670 GAGGKGGEGLTGGNGGNGGNGDNGVNGSNGGNGGKGGNAGNGTVVTGLGGNGGNGGDGGH 1729
G G+G GN G G+ G + G G ++ N G G GG GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1730 GNGSNIAGGNGGSGGDGGTGATGGA 1754
GNG GGSG G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 35.1 bits (80), Expect = 0.004
Identities = 27/84 (32%), Positives = 35/84 (41%)

Query: 1347 NGGSGGAGGTGGAGATGTSQAAATGLGGGRGGDGGAGGAGGDGGNGGKAHATGFHNGTGG 1406
+GG G TG +G TGLG G G G+G + + GG + + G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1407 QGGDGGQGGKAGNGGNGADGQAAA 1430
G GG G G G G + A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 35.1 bits (80), Expect = 0.004
Identities = 32/89 (35%), Positives = 40/89 (44%), Gaps = 1/89 (1%)

Query: 1700 GNGGKGGNAGNGTVVTGL-GGNGGNGGDGGHGNGSNIAGGNGGSGGDGGTGATGGAGGRG 1758
G G+G N G + + GG G G GG +GS + N GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1759 GAGGNGFNNGGDGGNGGNGGSGGNGVVRG 1787
G GG N+GG G GGN + V G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.7 bits (79), Expect = 0.004
Identities = 38/111 (34%), Positives = 44/111 (39%), Gaps = 2/111 (1%)

Query: 877 GFAGGGGGAGGAGGAGGTQAGAGGAGGDGGAGGTGGFGGS-GANGGAGDHGTAANPNGGR 935
G G G G +G G G G GGA G+ GG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 936 GGDGGSGAAGGAGGDGGNGGAGGQAQAAGY-ADGTRGAGGNGGVGGAGGLA 985
G GG+G +GG G GGN A A G+ A T GAGG AG L+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 34.7 bits (79), Expect = 0.005
Identities = 33/94 (35%), Positives = 40/94 (42%), Gaps = 7/94 (7%)

Query: 431 GTGGAGGTGGAGGAGGLISFFGGQGAGGAGGAGGAGGLAGDGGVGATGTFAGGGSGTGGA 490
G G G GA G I+ GG G G GG + G + GGGSG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN-------GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 491 GGDGGTPGVGGAGGAGGAGSIAGAHGSEGARPLS 524
G G G GG G G GS G + S A P++
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 34.3 bits (78), Expect = 0.006
Identities = 21/71 (29%), Positives = 27/71 (38%)

Query: 1535 GGSGGTGDVTGGAGGAGGNGGNGRNGGLGGAGGAGGNGGVTAASNIVTGNGGNGGNGGNG 1594
G +G++ GG G G GG G G G + GNGG GN
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 1595 GPGNSSTGGNS 1605
G G+ + G S
Sbjct: 72 GGGSGTGGNLS 82



Score = 33.9 bits (77), Expect = 0.008
Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 3/83 (3%)

Query: 295 AGGQGMGLDGGAGGGGGQGGLIYGGGGDGGAGGVGGEVGGGGAGGTGGAGGAAGLIGHGG 354
+GG G G + GA G I GG G GG + G + GG+ I GG
Sbjct: 2 SGGDGRGHNTGAHSTSGN---INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 355 AGGGGGAGDGGSTGGAGQTGGDG 377
G G G G++GG TGG+
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.008
Identities = 33/90 (36%), Positives = 43/90 (47%), Gaps = 4/90 (4%)

Query: 542 IAGDGGVGGNGGVFGNGGTGGAGGTGIAGQRGVSADTPGGSGTAGEDGGVGGNGGAGGLG 601
++G G G N G G G TG+ GV GSG + E+ GG G+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGL----GVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 602 GALAGHGGDGGAGGTGGDGGAGGSGAAGTA 631
G +GHG GG G +GG G GG+ +A A
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.010
Identities = 32/102 (31%), Positives = 35/102 (34%)

Query: 260 GGGGDGGAGGVGALSGGVGGAAGRAWLWGAGGAGGAGGQGMGLDGGAGGGGGQGGLIYGG 319
GG G G G + SG + G + G G GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 320 GGDGGAGGVGGEVGGGGAGGTGGAGGAAGLIGHGGAGGGGGA 361
G GG G GG G GG A A G G GG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.011
Identities = 26/81 (32%), Positives = 31/81 (38%), Gaps = 2/81 (2%)

Query: 1983 VAGSGGNGGNGGIGATGG--NAGRGGDGGSSTGFGGGGGMGGNGGTGGLGNAGYGGYGGD 2040
++G G G N G +T G N G G G G G N GG +G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 2041 GGKGGDGGDTGGGVGRSGGTG 2061
G G G GG +GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 32.8 bits (74), Expect = 0.015
Identities = 27/80 (33%), Positives = 29/80 (36%)

Query: 706 TGGGAGGDGGHGGDTGTGGAGGSGGAGSSNGLSGLSGHSPTSGGDGGAGGDGGDSPATGG 765
+GG G T GG G G G S SG S + GG G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 766 RGGAGGNGGKYGNGGAGGAG 785
G GGNG G G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 32.8 bits (74), Expect = 0.018
Identities = 31/81 (38%), Positives = 37/81 (45%), Gaps = 2/81 (2%)

Query: 1067 GAYGNGGDGGDGGDGVNGTRGLTAITPGGSGTDGSAGSAGGA--GGYGGNGGALAGDGGD 1124
G G G + G N G T + GG +DGS S+ GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1125 GGDGGNGGSGGDGGSGANGGA 1145
G GGNG SGG G+G N A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.021
Identities = 29/85 (34%), Positives = 32/85 (37%), Gaps = 2/85 (2%)

Query: 1970 NGGKGGGATDLESVAGSGGNGGNGGIGATGGNAGRGGDGGSSTGFGGGGGMGGNGGTGGL 2029
+GG G G NGG G+G GG G G SS GGG G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 2030 GNAGYGGYGGDGGKGGDGGDTGGGV 2054
G GG G+ G G G V
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.4 bits (73), Expect = 0.022
Identities = 29/84 (34%), Positives = 34/84 (40%), Gaps = 5/84 (5%)

Query: 1956 GGGGNGTSTGGVGGNGGKGGGATDLESVAGSGGNGGNGGIGATGGNAGRGGDGGSSTGFG 2015
GG G G +TG +G GG T G G + G G + N GG GS +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPT-----GLGVGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 2016 GGGGMGGNGGTGGLGNAGYGGYGG 2039
GG G G GG G G G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.023
Identities = 32/108 (29%), Positives = 41/108 (37%), Gaps = 6/108 (5%)

Query: 1810 DGGDGGNGGNGGDGAIRGDGGAGGNGGNGVGTLNTVNPNGGDGGNGGNGGNGNTGGTGGR 1869
+ G GN G G G + G+G + N NP GG G+G + G G+ G GG
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN--NPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 1870 GGNGGANGAVHGGDGGNGGKGGNGVIAGAGGDGGTGGAGGGFQATGGS 1917
GN G G GGN V G G G + G+
Sbjct: 68 NGNSGG----GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.4 bits (73), Expect = 0.024
Identities = 23/75 (30%), Positives = 29/75 (38%)

Query: 1946 RATGGAGGTGGGGGNGTSTGGVGGNGGKGGGATDLESVAGSGGNGGNGGIGATGGNAGRG 2005
R + G NG TG G G G E+ GG+G G G+ G
Sbjct: 7 RGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 2006 GDGGSSTGFGGGGGM 2020
G+G S G G GG +
Sbjct: 67 GNGNSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.024
Identities = 30/85 (35%), Positives = 38/85 (44%), Gaps = 6/85 (7%)

Query: 1621 GTGGKGGNGGSGGIGGNGGNGGFGGGGTIRGDGGNGGNGGNGIGAGMDGGAGGKGGEGLT 1680
G G+G N G+ GN NGG G G G G + G+G + + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLG-----VGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 1681 GGNGGNGGNGDNGVNGSNGGNGGKG 1705
GG G+G G NG +G G GG
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.024
Identities = 29/85 (34%), Positives = 34/85 (40%), Gaps = 3/85 (3%)

Query: 289 AGGAGGAGGQGMGLDGGAGGGGGQGGLIYGGGGDGGAGGVGGEVGGGGAGGTGGAGGAAG 348
+GG G G G GG G + GG DG GGG+G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 349 LIGHGGAGGGGGAGDGGSTGGAGQT 373
HG GG G +G G TGG
Sbjct: 62 ---HGNGGGNGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.025
Identities = 35/110 (31%), Positives = 42/110 (38%), Gaps = 1/110 (0%)

Query: 835 GAGGDGGAGGAYGDGGDGGAGGAGGDGRNGVDATTAGASGTQGFAGGGGGAGGAGGAGGT 894
G G G GA+ G+ GG G G G + +G S GGG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 895 QAGAGGAGGDGGAGGTGGFGGSGANGGAGDHGTAANPNGGRGGDGGSGAA 944
GG G GG GTGG + A A + P G S A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.0 bits (72), Expect = 0.029
Identities = 27/83 (32%), Positives = 37/83 (44%), Gaps = 3/83 (3%)

Query: 947 AGGDGGNGGAGGQAQAAGYADGTRGAGGNGGVGGAGGLAGDGGRGGDGA---VGFGGAGG 1003
+GGDG G + + G G G GG G + + G G+ + +GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1004 DGGHGGNTGAGGAGGTGGVGSST 1026
G GGN +GG GTGG S+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.0 bits (72), Expect = 0.032
Identities = 31/114 (27%), Positives = 41/114 (35%), Gaps = 4/114 (3%)

Query: 656 AGGAGGNGGAGGQALAAGYTDGVRGAGGSGGAGGAGGLAGAGGDGGDAYTTGGGAGGDGG 715
+GG G G + + G G G GGA G + G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 716 HGGDTGTGGAGGSGGAGSSNGLSGLSGHSPTSGGDGGAGGDGGDSPATGGRGGA 769
HG GG G+ G GS G + + +P + G G A GA
Sbjct: 62 HGN----GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 31.6 bits (71), Expect = 0.037
Identities = 31/113 (27%), Positives = 39/113 (34%), Gaps = 5/113 (4%)

Query: 994 GAVGFGGAGGDGGHGGNTGAGGAGGTGGVGSSTGLTGLGGHSPTSGGDGGNGGNGGRGAV 1053
G G G G GN G G G G+S G ++P GG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS-- 60

Query: 1054 DIAGGAGGTGGNGGAYGNGGDGGDGGDGVNGTRGLTAITPGGSGTDGSAGSAG 1106
G G G G+G G G A++ G+G + SAG
Sbjct: 61 ---GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.6 bits (71), Expect = 0.038
Identities = 26/80 (32%), Positives = 31/80 (38%)

Query: 1791 SGGNAGNGGNGVNGLFTNRDGGDGGNGGNGGDGAIRGDGGAGGNGGNGVGTLNTVNPNGG 1850
SGG+ G + N +GG G G GG G G G G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1851 DGGNGGNGGNGNTGGTGGRG 1870
G GGNG +G GTGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.043
Identities = 20/81 (24%), Positives = 26/81 (32%)

Query: 1897 GAGGDGGTGGAGGGFQATGGSGGDGGAGGSGTTGGTGGRGGAGGAADGFRATGGAGGTGG 1956
G G + G G G GG G+ + GG+ G GG+G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1957 GGGNGTSTGGVGGNGGKGGGA 1977
GG + G G A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAA 86



Score = 31.2 bits (70), Expect = 0.050
Identities = 33/100 (33%), Positives = 37/100 (37%), Gaps = 10/100 (10%)

Query: 1301 GGNGGDGGTGGTGTAATTAGSPGAVGGHGGAGGNGGDGGYLNGNAGNGGSGGAGGTGGAG 1360
GG+G TG T+ G P +G GGA G N G GSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1361 ATGTSQAAATGLGGGRGGDGGAGGAGGDGGNGGKAHATGF 1400
G GGG G GG G GG+ A GF
Sbjct: 63 ----------GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92


9MMAR_0827MMAR_0851Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0827112-3.635356hypothetical protein
MMAR_082819-2.714191hypothetical protein
MMAR_082908-2.352411UDP-glucose 4-epimerase GalE2
MMAR_0830-110-2.384414hypothetical protein
MMAR_0831-19-1.907298cyclopropane-fatty-acyl-phospholipid synthase 2
MMAR_08320122.282299hypothetical protein
MMAR_08331121.769650phosphoserine phosphatase SerB1
MMAR_08343141.422933carbon monoxyde dehydrogenase small chain CoxS
MMAR_08353161.204189carbon monoxyde dehydrogenase medium chain CoxM
MMAR_0836212-0.994601carbon monoxyde dehydrogenase large chain CoxL
MMAR_0837215-0.982239PE-PGRS family protein
MMAR_0838011-3.546659hypothetical protein
MMAR_5582-111-2.623065macrophage infection protein, MimJ
MMAR_0839-210-1.214927MmpS-family membrane protein
MMAR_0840-210-0.541744transmembrane transport protein MmpL
MMAR_0841-1122.080530hypothetical protein
MMAR_0842-1122.214990glutamyl-tRNA reductase
MMAR_084312330.506390porphobilinogen deaminase
MMAR_084412330.296073uroporphyrin-III C-methyltransferase HemD
MMAR_084512340.083732delta-aminolevulinic acid dehydratase
MMAR_08461234-0.102079transmembrane protein
MMAR_08471233-0.051733transmembrane protein
MMAR_08511234-0.091569non-ribosomal peptide synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0829NUCEPIMERASE1333e-38 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 133 bits (337), Expect = 3e-38
Identities = 33/151 (21%), Positives = 63/151 (41%), Gaps = 15/151 (9%)

Query: 27 VLVTGACRFLGGYLTARLAQNPLISSVIAVDAIAPSKDMLRRMGRAE--------FVRAD 78
LVTGA F+G +++ RL + V+ +D + D+ + R E F + D
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG--HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 79 IRN-PFIAKVIRNGDVDTVVHAAAASYAPRS-GGSAALKELNVMGAMQLFAACQKAPSVR 136
+ + + + +G + V + S A + N+ G + + C+ ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN-KIQ 119

Query: 137 RVVLKSTSEVYGSSPHDPVVFTEDSSSRRPF 167
++ S+S VYG + P F+ D S P
Sbjct: 120 HLLYASSSSVYGLNRKMP--FSTDDSVDHPV 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0837cloacin394e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 4e-05
Identities = 35/102 (34%), Positives = 39/102 (38%), Gaps = 3/102 (2%)

Query: 468 GAGGAGANGGLLIGHGGAGGGGGTGGNGHGRPIGSGGAGGD---GGGGGTGGWLYGNGGH 524
G G G N G G GG G G G GSG + + GGG G+G G GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 525 GGTGATGGSGRHSGASGDGGDGGDAQAIGDGGAGGSGGAGGA 566
G G G SG SG G+ A G G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.0 bits (85), Expect = 2e-04
Identities = 33/102 (32%), Positives = 39/102 (38%)

Query: 161 GAAGLIGNGGAGGTGYSPTTGSGAVGGNGGAGGTGGWLYGSGGSGGIGGVGGSGTIGAPS 220
G G N GA T + G +G GGA GW + GG G G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 221 GHGGSGGLGGGTGLFGQGGAGGNGGQGGGENFASTAGAGGPA 262
G+GG G GG G + G ST GAGG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.6 bits (84), Expect = 3e-04
Identities = 24/85 (28%), Positives = 31/85 (36%)

Query: 180 TGSGAVGGNGGAGGTGGWLYGSGGSGGIGGVGGSGTIGAPSGHGGSGGLGGGTGLFGQGG 239
+G G N GA T G + G G+GG G+ + + GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 240 AGGNGGQGGGENFASTAGAGGPAGT 264
G GG G + T G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 35.5 bits (81), Expect = 5e-04
Identities = 34/112 (30%), Positives = 42/112 (37%), Gaps = 10/112 (8%)

Query: 362 GGQGGGGHLGGAFAGGGGDGGAGAVGGTGGWLLGDGGAGGNGGDGGDGGAASGFAGDGGR 421
GG G G + G G +GG +G GG G G + N GG G+ + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 422 GGQGAVGGASGWLLGNGGAGGDGGAGGEGGDSSGGAASAGGDGGAGGAGGAG 473
G G GNG +GG G GG + A GAGG
Sbjct: 63 GNGG----------GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.001
Identities = 36/105 (34%), Positives = 41/105 (39%), Gaps = 5/105 (4%)

Query: 297 GVGGSGGSGGATGLLGNGGAGGTGGQGGDGGRGFDGSLGAGGAGGTGAVGGSGGWLVGDG 356
G G G + GA GN G TG G G S G+G + GG G + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGA-----SDGSGWSSENNPWGGGSGSGIHWG 57

Query: 357 GTGGDGGQGGGGHLGGAFAGGGGDGGAGAVGGTGGWLLGDGGAGG 401
G G G GG G+ GG GG A G L GAGG
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.7 bits (79), Expect = 0.001
Identities = 29/80 (36%), Positives = 36/80 (45%)

Query: 416 AGDGGRGGQGAVGGASGWLLGNGGAGGDGGAGGEGGDSSGGAASAGGDGGAGGAGGAGAN 475
+G GRG SG + G G GG +G S GG G+G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 476 GGLLIGHGGAGGGGGTGGNG 495
G G+G +GGG GTGGN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.001
Identities = 23/71 (32%), Positives = 27/71 (38%)

Query: 501 GSGGAGGDGGGGGTGGWLYGNGGHGGTGATGGSGRHSGASGDGGDGGDAQAIGDGGAGGS 560
G G G + G T G + G G G G + + GG I GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 561 GGAGGAGGSGG 571
G GG G SGG
Sbjct: 63 GNGGGNGNSGG 73



Score = 34.3 bits (78), Expect = 0.001
Identities = 32/91 (35%), Positives = 40/91 (43%), Gaps = 6/91 (6%)

Query: 440 AGGDGGAGGEGGDSSGGAASAGGDGGAGGAGGAGANGGLLIGHGGAGGGGGTGGNGHGRP 499
+GGDG G S+ G + GG G G GGA G + GGG G+G + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG-- 58

Query: 500 IGSGGAGGDGGGGGTGGWLYGNGGHGGTGAT 530
G G+GGG G G G GG+ A
Sbjct: 59 ---GSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.003
Identities = 27/81 (33%), Positives = 30/81 (37%), Gaps = 2/81 (2%)

Query: 395 GDGGAGGNGGDGGDGGAASGFAGDGGRGGQGAVGGASGWLLGNGGAGGDGGAGGEGGDSS 454
G G G N G G +G G G G G SGW N GG G+G G S
Sbjct: 3 GGDGRGHNTGAHSTSGNING--GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 455 GGAASAGGDGGAGGAGGAGAN 475
G G GG+G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 32.8 bits (74), Expect = 0.005
Identities = 24/80 (30%), Positives = 35/80 (43%)

Query: 491 TGGNGHGRPIGSGGAGGDGGGGGTGGWLYGNGGHGGTGATGGSGRHSGASGDGGDGGDAQ 550
+GG+G G G+ G+ GG TG + G G ++ + G+ GG +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 551 AIGDGGAGGSGGAGGAGGSG 570
GG G SGG G GG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.006
Identities = 23/78 (29%), Positives = 28/78 (35%)

Query: 436 GNGGAGGDGGAGGEGGDSSGGAASAGGDGGAGGAGGAGANGGLLIGHGGAGGGGGTGGNG 495
G G G + GA G+ +GG G GGA G + G G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 496 HGRPIGSGGAGGDGGGGG 513
GG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 31.6 bits (71), Expect = 0.008
Identities = 29/89 (32%), Positives = 35/89 (39%), Gaps = 3/89 (3%)

Query: 142 GNGGAGGTGSAANPDGGNGGAAGLIGNGGAGGTGYSPTTGSGAVGGNGGAGGTGGWLYGS 201
GN G TG +G N GG+G G G+ GNGG G G GS
Sbjct: 18 GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSG--GGS 75

Query: 202 GGSGGIGGVGGSGTIGAPS-GHGGSGGLG 229
G G + V G P+ G+GGL
Sbjct: 76 GTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.009
Identities = 22/75 (29%), Positives = 29/75 (38%)

Query: 273 GNGGAGGIGGAGGVVGSSEGGVDGGVGGSGGSGGATGLLGNGGAGGTGGQGGDGGRGFDG 332
G G G G + G+ G G S GSG ++ GG G+G G G +G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 333 SLGAGGAGGTGAVGG 347
GG+G G
Sbjct: 66 GGNGNSGGGSGTGGN 80



Score = 31.6 bits (71), Expect = 0.009
Identities = 38/122 (31%), Positives = 42/122 (34%), Gaps = 5/122 (4%)

Query: 255 TAGAGGPAGTGGHGGWLYGNGGAGGIGGAGGVVGSSEGGVDGGVGGSGGSGGATGLLGNG 314
+ G G TG H NGG G+G GG S + G GGSG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGS 60

Query: 315 GAGGTGGQGGDGGRGFDGSLGAGGAGGTGAVGGSGGWLVGDGGTGGDGGQGGGGHLGGAF 374
G G GG G GG GS G A G + G GG G L A
Sbjct: 61 GHGNGGGNGNSGG----GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116

Query: 375 AG 376
A
Sbjct: 117 AD 118



Score = 30.1 bits (67), Expect = 0.025
Identities = 35/105 (33%), Positives = 44/105 (41%), Gaps = 3/105 (2%)

Query: 240 AGGNG-GQGGGENFASTAGAGGPAGTGGHGGWLYGNGGAGGIGGAGGVVGSSEGGVDGGV 298
+GG+G G G + S GGP G G GG G+G + GG GS G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 299 GGSGGSGGATGLLGNGGAGGTGGQGGDGGRGFDGSLGAGGAGGTG 343
G+GG G +G G+G G GF +L GAGG
Sbjct: 62 HGNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFP-ALSTPGAGGLA 104



Score = 29.7 bits (66), Expect = 0.037
Identities = 25/78 (32%), Positives = 32/78 (41%), Gaps = 1/78 (1%)

Query: 217 GAPSGHGGSGGLGGGTGLFGQGGAGGNGGQGGGENFASTAGA-GGPAGTGGHGGWLYGNG 275
G GH G G G G GG G ++S GG +G+G H G G+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 276 GAGGIGGAGGVVGSSEGG 293
GG G +GG G+
Sbjct: 64 NGGGNGNSGGGSGTGGNL 81



Score = 29.7 bits (66), Expect = 0.042
Identities = 29/83 (34%), Positives = 32/83 (38%), Gaps = 1/83 (1%)

Query: 201 SGGSGGIGGVGGSGTIGAPSGHGGSGGLGGGTGLFGQGGAGGNGGQGGGENFASTAGAGG 260
SGG G G T G +G G+GGG G G + N GGG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 261 PAGTGGHGGWLYGNGGAGGIGGA 283
G GG G G G GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0840ACRIFLAVINRP521e-08 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 51.8 bits (124), Expect = 1e-08
Identities = 48/292 (16%), Positives = 100/292 (34%), Gaps = 32/292 (10%)

Query: 140 VVVYIVGKNETEAYASVHAVRHIVD--NTPAPPGLKAYVTGPSALNADQAEAGDKSIAKV 197
+ I A + A++ + P G+K D SI +V
Sbjct: 287 AGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYP------YDTTPFVQLSIHEV 340

Query: 198 --TAITSVVIAVMLLFI-YRSVVTAFLVLIMVGIDLGAIRGTIAFLANHNIFNLSTFATN 254
T ++++ +++++ +++ + I V + + GT A LA ++++T
Sbjct: 341 VKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV---VLLGTFAILAAFG-YSINTLTMF 396

Query: 255 LLVLLAIAASTDYAIFMLGRYHEARYAGEDRETAFYTMFHGTAHV---ILGSGLTIAGA- 310
+VL AI D AI ++ R ED+ + + ++G + ++
Sbjct: 397 GMVL-AIGLLVDDAIVVVENVE--RVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVF 453

Query: 311 --MYCLSFARLPYFQTLAAPCAIGMLVAVFAALTLGPAVLAV----GSAFKLFDPKRRVN 364
M + ++ + M ++V AL L PA+ A SA +
Sbjct: 454 IPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFG 513

Query: 365 --TRRWRRVGTAIVRWPGPVLAATC--LVASIGLLALPSYKTTYDLRKFMPS 412
+ G +L +T L+ ++A F+P
Sbjct: 514 WFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPE 565



Score = 46.0 bits (109), Expect = 8e-07
Identities = 33/154 (21%), Positives = 64/154 (41%), Gaps = 7/154 (4%)

Query: 772 AISLIVIIMMLITRSVVAAAVIVGTVLLSMGSSFGLSVLVWEDILGIELYWMVLAMSVIL 831
AI L+ ++M L +++ A + V + + +F + I + ++ MVLA+ +++
Sbjct: 347 AIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLV 406

Query: 832 LLAVGSDYNLLLISRLKEEIGAGLNTGIIRAMAGTGGVVTAAGMVF-AVTMSLFVFSDL- 889
A+ N + R+ E ++M+ G + MV AV + + F
Sbjct: 407 DDAIVVVEN---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGST 463

Query: 890 -RIIGQIGTTIGLGLLFDTLIVRSFMTPSIAALL 922
I Q TI + L+ TP++ A L
Sbjct: 464 GAIYRQFSITIVSAMALSVLVALIL-TPALCATL 496


10MMAR_0922MMAR_0943Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0922314-0.517166ABC transporter ATP-binding protein
MMAR_0923115-0.830439transcriptional regulator
MMAR_0924015-1.001552carveol-like dehydrogenase
MMAR_0925013-1.949834ion antiporter, NhaP
MMAR_0926113-3.308182PPE family protein
MMAR_0927122-5.538418hypothetical protein
MMAR_0928023-5.251698cytochrome P450 189A6 Cyp189A6
MMAR_0929225-5.328720TetR family transcriptional regulator
MMAR_0930226-4.939825transcriptional regulatory protein EmbR_2
MMAR_0931220-3.691621hypothetical protein
MMAR_0932118-2.801909PPE family protein
MMAR_0933-1100.201901hypothetical protein
MMAR_0934-110-0.184135methyltransferase/methylase
MMAR_0935-1100.049542adenylate cyclase
MMAR_09360100.579412arylsulfatase AtsA
MMAR_09371111.058992peptide amidase, GatA
MMAR_09385145.998887cytochrome P450 135B4 Cyp135B4
MMAR_09397146.187166hypothetical protein
MMAR_09404136.310756galactose-1-phosphate uridylyltransferase
MMAR_09413146.567465galactokinase
MMAR_09422135.779782transcriptional regulator
MMAR_09432155.837607PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0923HTHTETR677e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.6 bits (162), Expect = 7e-16
Identities = 22/185 (11%), Positives = 63/185 (34%), Gaps = 12/185 (6%)

Query: 5 AERGAQTRAALMAAAVAVIAERGWGAATTRMVAERAGLPPGLVHYHFASLNDLLIDAALQ 64
+ +TR ++ A+ + +++G + + +A+ AG+ G +++HF +DL +
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF-SEIWE 64

Query: 65 AAREEAAQVLDGLAGDSPSQGIDRLIDAVSSYDVDDRNQNPAILVFGEMLLAATRYERLR 124
+ ++ P + L + + ++ + E++ +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHV-LESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 125 MGLAEILGDYRSALRQWLADQGGA----------IDPEATAALMFAAIDGLVLHRVIDPR 174
+ + + + + A +M I GL+ + + P+
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 175 LRTLA 179
L
Sbjct: 184 SFDLK 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0924DHBDHDRGNASE1112e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 111 bits (278), Expect = 2e-31
Identities = 70/251 (27%), Positives = 119/251 (47%), Gaps = 17/251 (6%)

Query: 3 ALDGRVALITGGARGQGRAHALALAGQGADIALADAPGPMAELTYPLGSEEDLLATAELV 62
++G++A ITG A+G G A A LA QGA IA D + E L +
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDY------------NPEKLEKVVSSL 52

Query: 63 GQLGRRCLPMVVDVRDAAQVNTAVERTVRELGSLDIVLANAGIVSTGRLEEVSDQVWQQL 122
R DVRD+A ++ R RE+G +DI++ AG++ G + +SD+ W+
Sbjct: 53 KAEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT 112

Query: 123 MDTNLTGVFHTLRAAIPVMRQQRFGRIVATSSMGGRMGIPELAAYNATKWGIIGLIKSVA 182
N TGVF+ R+ M +R G IV S + +AAY ++K + K +
Sbjct: 113 FSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLG 172

Query: 183 LEVAKEGITANVICPTTTQTPMVQPAGIGDDQEVPDDLVRRMMKANPIPQPW---LQPED 239
LE+A+ I N++ P +T+T M ++ + +++ ++ P +P D
Sbjct: 173 LELAEYNIRCNIVSPGSTETDMQWSLWADENGA--EQVIKGSLETFKTGIPLKKLAKPSD 230

Query: 240 VSRGVVYLVTD 250
++ V++LV+
Sbjct: 231 IADAVLFLVSG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0925ACRIFLAVINRP290.039 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.039
Identities = 17/71 (23%), Positives = 32/71 (45%), Gaps = 11/71 (15%)

Query: 54 PIVVALADVALFTVLFTDGQRANVRELRETWTLSGRALGVGMPLTMIGIAVPAHFLTGLN 113
P +VA++ V +F L L E+W++ + + +PL ++G + A L
Sbjct: 873 PALVAISFVVVFLCLAA---------LYESWSIPVSVM-LVVPLGIVG-VLLAATLFNQK 921

Query: 114 WPTAFLVGAIL 124
F+VG +
Sbjct: 922 NDVYFMVGLLT 932


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0926cloacin381e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 1e-04
Identities = 27/84 (32%), Positives = 38/84 (45%), Gaps = 13/84 (15%)

Query: 564 NSGNTNTGLWNAGNVNTGFGGIGTYSGNSGFFNSGTGNSGFFNSSDDNSGFGNSSSGGHN 623
N+G +T GN+N G G+G G + G SS++N G S SG H
Sbjct: 10 NTGAHSTS----GNINGGPTGLGV---------GGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 624 SGAANSGSGGYNAGFGNSNTGGGS 647
G + G+GG N G + GG+
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0929HTHTETR521e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.3 bits (125), Expect = 1e-10
Identities = 27/183 (14%), Positives = 57/183 (31%), Gaps = 22/183 (12%)

Query: 19 AVRRDDRILDIVVHLLQTEGYDAVQLREVARRARTSLATIYKRYANRDELILAALEFWMD 78
A ILD+ + L +G + L E+A+ A + IY + ++ +L E +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE--LS 66

Query: 79 EHHYAGLAEQTPAPGESLYAGMMRVLRTIFQPWETHPDIVKAYFRARAAPGGQRLVHRGL 138
E + L + A P +I+ + +RL
Sbjct: 67 ESNIGELELEYQAKFPG-------------DPLSVLREILIHVLESTVTEERRRL----- 108

Query: 139 DMVVPAAMEVLAGVDENFIHDLDTVISSLVYGLLGRFTAGEIAITEILPSID-RTVFWLI 197
++ + + + + Y + + I + + R ++
Sbjct: 109 -LMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167

Query: 198 RGY 200
RGY
Sbjct: 168 RGY 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0932CABNDNGRPT405e-05 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 40.3 bits (94), Expect = 5e-05
Identities = 49/229 (21%), Positives = 74/229 (32%), Gaps = 13/229 (5%)

Query: 237 NFGSGNLGSSNFGWAN---LGSNNIGVANAGGGNQGFGNIGNVNTGFGNTGIGNFGLANT 293
N GS G F LG + G NAG G+ + + + + + +G T
Sbjct: 175 NPGSEEYGRQTFTHEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGENET 234

Query: 294 GNNNIGIALTGDNQIGIGGLNSGVGNFGLFNSGTGNVGFF-NSGNGNFGIGNTGDFNTGV 352
G + G I + G +G GF N+ + ++
Sbjct: 235 GADYNGHYGGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFS 294

Query: 353 WNSGSGNSGFFNPGMFNTGVLDVGNANTGYLNTGSYNMGSFNPGASNTGAFNIGDGNTGW 412
G F G N +++ + + N+ S G + A G GN
Sbjct: 295 VWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNV-SIAHGVTIENAIG-GSGNDIL 352

Query: 413 F-NNGD--LNTGALN---FGDMNNGLLNTGDLNNGFFYRGVGQGSLHFA 455
N+ D L GA N +G L G + F Y G GQ S A
Sbjct: 353 VGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVY-GSGQDSTVAA 400



Score = 31.1 bits (70), Expect = 0.028
Identities = 26/156 (16%), Positives = 42/156 (26%), Gaps = 21/156 (13%)

Query: 202 QLIGVNLGLANVGSGNVGNANNGLGNIGN----------GNLGNGNFGSGNLGSSNFGWA 251
IG LGLA+ G N G + + G +G + ++G A
Sbjct: 188 HEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGENE--TGADYNGHYGGA 245

Query: 252 NLGSNNIGVANAGGGNQGFGNIGNVNTGFGNTGIGNFGLANTGNNNIGIALTGD------ 305
+ + + G N +V NT + ++ I
Sbjct: 246 PMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFD 305

Query: 306 --NQIGIGGLNSGVGNFGLFNSGTGNVGFFNSGNGN 339
+N G+F GNV G
Sbjct: 306 FSGYSNNQRINLNEGSFSDVGGLKGNV-SIAHGVTI 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0943cloacin446e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 43.5 bits (102), Expect = 6e-06
Identities = 36/84 (42%), Positives = 42/84 (50%)

Query: 839 GQGGAGGDGGAGSTTGNQGGGGAGGNGGGGGAGGNGGSGANGADGSISGLGGQAGGNGGD 898
G G G + GA ST+GN GG G GGG + G+G S N G SG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 899 GGDAGVGGSGGDGGAGLSVGAHGA 922
G G G SGG G G ++ A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 42.8 bits (100), Expect = 1e-05
Identities = 38/102 (37%), Positives = 43/102 (42%)

Query: 1016 GNGGAGGTGKASSATGGIGGGGGNGGAAGQVGDGGGGGTGGNGGNGGAGRDGAMGGTGGI 1075
G G G A S +G I GG G G DG G + N GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1076 GGDAGMVGNGGGGGTGGNGGNGGNGNAGGGPGIGGNGAEGGA 1117
G G +GGG GTGGN A G P + GA G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 38.5 bits (89), Expect = 2e-04
Identities = 29/79 (36%), Positives = 35/79 (44%)

Query: 974 GNGGAGGSSPQGSGGGGGNGGAGGTGGDAGLQEGGGGTGGNGGNGGAGGTGKASSATGGI 1033
G G G ++ S G NGG G G G +G G + N GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1034 GGGGGNGGAAGQVGDGGGG 1052
G GGGNG + G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 37.0 bits (85), Expect = 6e-04
Identities = 32/84 (38%), Positives = 37/84 (44%), Gaps = 1/84 (1%)

Query: 385 VSGGDGSSGGNGGNPGVGGAGGAGGTGAGGARAADGATGNSPTSGGNGGNGGAGADAIGS 444
+SGGDG G N G G G TG G A +G S + GG G+G G
Sbjct: 1 MSGGDGR-GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 445 GQSGGAGGAGGNGGRVGNGGNGGA 468
G GG G +GG G GGN A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 36.6 bits (84), Expect = 7e-04
Identities = 35/115 (30%), Positives = 43/115 (37%)

Query: 449 GAGGAGGNGGRVGNGGNGGAGGNGFLGAPAFSSNPGGNGGNGGAGGAGGNGGIQAGNGGD 508
G G G N G GN G G S G + N GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 509 GGAGGHGGAGATGGSGTNGFDGAFSGADGSPGGNGGKGGTGGAGGNGGAAGLALA 563
G GG+G +G G+G N A A G P + G + GA A+A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 36.6 bits (84), Expect = 8e-04
Identities = 30/82 (36%), Positives = 34/82 (41%), Gaps = 2/82 (2%)

Query: 813 TGADGATVGTGVDGQDGGNGGAGGTGGQGGAGGDGGAGSTTGNQGGGGAGGN--GGGGGA 870
+G DG TG G G G GG DG S+ N GGG+G GGG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 871 GGNGGSGANGADGSISGLGGQA 892
GNGG N GS +G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 35.5 bits (81), Expect = 0.001
Identities = 33/89 (37%), Positives = 41/89 (46%), Gaps = 4/89 (4%)

Query: 361 GGGGTGGSGGAGGAAGDGGDGATGVSGGDGSSGGNG----GNPGVGGAGGAGGTGAGGAR 416
GG G G + GA +G+ G TG+ G G+S G+G NP GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 417 AADGATGNSPTSGGNGGNGGAGADAIGSG 445
G GNS G GGN A A + G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 35.5 bits (81), Expect = 0.002
Identities = 27/79 (34%), Positives = 34/79 (43%), Gaps = 3/79 (3%)

Query: 1057 NGGNGGAGRDGAMGGTGGIGGDAGMVGNGGGGGTGGNGGNGGNGNAGG---GPGIGGNGA 1113
+GG+G GA +G I G +G GGG G + N GG G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1114 EGGAGGSGGSSSGNGGAGG 1132
G GG+G S G+G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 35.5 bits (81), Expect = 0.002
Identities = 32/85 (37%), Positives = 38/85 (44%), Gaps = 1/85 (1%)

Query: 663 GNGGRGGNGGLVGNGGDGGTGGAGGTGLGGLTNSFPGSSGTGGDPGGTGGNGGAGGSGGF 722
G GRG N G G+ GG G G+GG + G S GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGN-INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 723 FAGDGGNGGAGGVGGTGGDGGNGAT 747
GGNG +GG GTGG+ A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 34.7 bits (79), Expect = 0.003
Identities = 32/103 (31%), Positives = 42/103 (40%), Gaps = 1/103 (0%)

Query: 800 GAGGNAGNGGSGATGADGATVGTGVDGQDGGNGGAGGTGGQGGAGGDGGAGSTTGNQGGG 859
G G N G+ +T + TG+ G + G+G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 860 GAGGNGGGGGAGGNGGSGANGADGSISGLGGQAGGNGGDGGDA 902
G GG G G GG+G G A + G A G GG A
Sbjct: 63 GNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.004
Identities = 33/100 (33%), Positives = 38/100 (38%), Gaps = 1/100 (1%)

Query: 316 TGADAANPTTTGQAGGDGGNGGAGGAGGDGGGGGKSGWLGLAGALGGGGTGGSGGAGGAA 375
+G D T + NGG G G GG SGW GGG+G GG +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGW-SSENNPWGGGSGSGIHWGGGS 60

Query: 376 GDGGDGATGVSGGDGSSGGNGGNPGVGGAGGAGGTGAGGA 415
G G G G SGG +GGN A G GA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGA 100



Score = 34.3 bits (78), Expect = 0.004
Identities = 27/87 (31%), Positives = 32/87 (36%)

Query: 738 TGGDGGNGATGGTGATAGANGQDGGNGGTGGTGGQGGAGGAGGATFTGKAGTQGAGGAGG 797
+GGDG TG + NG G G GG G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 798 DGGAGGNAGNGGSGATGADGATVGTGV 824
G GGN +GG TG + + V V
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 34.3 bits (78), Expect = 0.004
Identities = 31/87 (35%), Positives = 35/87 (40%), Gaps = 7/87 (8%)

Query: 1159 NGGDGRGGAGGTGGTGGTGGDGGVGGNGGAGGKAISPTGQDGAQGAGGNGGAGGTGGNGG 1218
+GGDGRG G T G G G G G S + GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1219 FGGSGGSGADGTLFSSLAGTGGTGGNG 1245
G GG+G G G GTGGN
Sbjct: 62 HGNGGGNGNSG-------GGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.006
Identities = 34/94 (36%), Positives = 40/94 (42%), Gaps = 6/94 (6%)

Query: 1278 AGTGGFGGNGGLGSAFGP--GGTGGNGGSGGSSSTSAGGGGGTGGAGGTGFDGSAGGAGG 1335
+G G G N G S G GG G G GG+S S GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1336 NGGQGGDGLGTAGDGGNGGTGGKGSNAPIGFSFG 1369
+G GG+G GG GTGG S +FG
Sbjct: 62 HGNGGGNG----NSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.5 bits (76), Expect = 0.006
Identities = 32/79 (40%), Positives = 36/79 (45%), Gaps = 2/79 (2%)

Query: 1086 GGGGTGGNGG-NGGNGNAGGGPGIGGNGAEGGAGGSGGSSSGNGGAGGKGGSGGTGGAGQ 1144
GG G G N G + +GN GGP G G + GSG SS N GG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPT-GLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1145 FGNTGGNGANGTFDNGGDG 1163
GN GGNG +G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 33.5 bits (76), Expect = 0.006
Identities = 26/84 (30%), Positives = 30/84 (35%)

Query: 550 GAGGNGGAAGLALAATGHDGGHGTGGAGGNGGAGGSAGNGGNGAKGVSGVNNGAGGNGGD 609
G G G G + +GG G GG G + N G SG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 610 GGTPGIGGTGGAGGDSGGGSHGAT 633
G G G +GG G G S A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.008
Identities = 37/112 (33%), Positives = 43/112 (38%), Gaps = 9/112 (8%)

Query: 685 AGGTGLGGLTNSFPGSSGTGGDPGGTGGNGGAGGSGGFFAGDGGNGGAGGVGGTGGDGGN 744
+GG G G T + S G P G G GGA G+ + + GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 745 GATGGTGATAGANGQDGGNGGTGGTGGQGGAGGAGGATFTGKAGTQGAGGAG 796
GG G +GG GTGG A A A T GAGG
Sbjct: 62 HGNGGGN---------GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.1 bits (75), Expect = 0.008
Identities = 34/105 (32%), Positives = 39/105 (37%), Gaps = 6/105 (5%)

Query: 601 NGAGGNGGDGGTPG-IGGTGGAGGDSGGGSHGATGSDGATPHSGGNGGKGGDGADATGFG 659
+G G N G T G I G G GG S G+ S P GG+G G G
Sbjct: 5 DGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG-----G 59

Query: 660 QTGGNGGRGGNGGLVGNGGDGGTGGAGGTGLGGLTNSFPGSSGTG 704
GNGG GN G G + A G S PG+ G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.1 bits (75), Expect = 0.009
Identities = 31/123 (25%), Positives = 40/123 (32%)

Query: 1115 GGAGGSGGSSSGNGGAGGKGGSGGTGGAGQFGNTGGNGANGTFDNGGDGRGGAGGTGGTG 1174
GG G + + + GG G G G + G + GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1175 GTGGDGGVGGNGGAGGKAISPTGQDGAQGAGGNGGAGGTGGNGGFGGSGGSGADGTLFSS 1234
G GG G G G G +S A G G G S A + ++
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 1235 LAG 1237
L G
Sbjct: 123 LKG 125



Score = 33.1 bits (75), Expect = 0.010
Identities = 27/73 (36%), Positives = 31/73 (42%)

Query: 1071 GTGGIGGDAGMVGNGGGGGTGGNGGNGGNGNAGGGPGIGGNGAEGGAGGSGGSSSGNGGA 1130
G G G GN GG TG G G + +G G G+G G SG+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1131 GGKGGSGGTGGAG 1143
GG G SGG G G
Sbjct: 66 GGNGNSGGGSGTG 78



Score = 32.8 bits (74), Expect = 0.010
Identities = 24/81 (29%), Positives = 32/81 (39%)

Query: 969 SGGHGGNGGAGGSSPQGSGGGGGNGGAGGTGGDAGLQEGGGGTGGNGGNGGAGGTGKASS 1028
+G H +G G GGG + G+G + + G G GG G G G +
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 1029 ATGGIGGGGGNGGAAGQVGDG 1049
+ GG G GG A V G
Sbjct: 71 SGGGSGTGGNLSAVAAPVAFG 91



Score = 32.0 bits (72), Expect = 0.018
Identities = 26/80 (32%), Positives = 30/80 (37%), Gaps = 4/80 (5%)

Query: 642 SGGNGGKGGDGADATGFGQTGGNGGRGGNGGLVGNGGDGGTGGAGGTGLGGLTNSFPGSS 701
SGG+G GA +T GG G G GG DG + GG + S
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG----ASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 702 GTGGDPGGTGGNGGAGGSGG 721
G G G G GGSG
Sbjct: 58 GGSGHGNGGGNGNSGGGSGT 77



Score = 32.0 bits (72), Expect = 0.020
Identities = 30/85 (35%), Positives = 37/85 (43%), Gaps = 7/85 (8%)

Query: 857 GGGGAGGNGGGGGAGGNGGSGANGADGSISGLGGQAGGNGGDGGDAGVGGSGGDGGAGLS 916
GG G G N G GN +G +GLG G + G G + GG G+G+
Sbjct: 3 GGDGRGHNTGAHSTSGN-------INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 917 VGAHGATGNVVANGGNGGQGGNGGN 941
G GN NG +GG G GGN
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 31.6 bits (71), Expect = 0.024
Identities = 35/107 (32%), Positives = 45/107 (42%), Gaps = 6/107 (5%)

Query: 929 NGGNGGQGGNGGNALIGTSDGGAGGAGGAGGKAWLMGDGGSGGHGGNGGAGGSSPQGSGG 988
+GG+G G ++ G +GG G G GG + G G S + GG GS GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGAS--DGSGWSSENNPWGGGSGSGI-HWGG 58

Query: 989 GGGNGGAGGTGGDAGLQEGGGGTGGNGGNGGAGGTGKASSATGGIGG 1035
G G+G GG G G G G G G + +T G GG
Sbjct: 59 GSGHGNGGGNGNSGG---GSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.027
Identities = 31/89 (34%), Positives = 36/89 (40%), Gaps = 8/89 (8%)

Query: 308 GGVGGAGATGADAANPTTTGQAGGDGGNGGAGGAGG--------DGGGGGKSGWLGLAGA 359
GG G TGA + + G G G GGA G GG G W G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 360 LGGGGTGGSGGAGGAAGDGGDGATGVSGG 388
GGG G SGG G G+ A V+ G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.8 bits (69), Expect = 0.044
Identities = 33/114 (28%), Positives = 45/114 (39%), Gaps = 4/114 (3%)

Query: 588 NGGNGAKGVSGVNNGAGGNGGDGGTPGIGGTGGAGGDSGGGSHGATGSDGATPHSGGNGG 647
+GG+G +G ++ +G G G+GG G ++ G G+ H GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 648 KGGDGADATGFGQTGGNGGRGGNGGLVGNGGDGGTGGAGGTGLGGLTNSFPGSS 701
G G G G +GG G GGN V G G GGL S +
Sbjct: 62 HGNGG----GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 30.8 bits (69), Expect = 0.048
Identities = 28/104 (26%), Positives = 36/104 (34%)

Query: 269 VSGTDGGAGGIGGDGGAGGAGGLLIGNGGAGGAGGLGGLGGVGGAGATGADAANPTTTGQ 328
+SG DG G +G G G G GGA G G+ + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 329 AGGDGGNGGAGGAGGDGGGGGKSGWLGLAGALGGGGTGGSGGAG 372
G+GG G G G GG + +A T G+GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


11MMAR_1097MMAR_1108Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1097-183.036279cutinase
MMAR_1098193.412187cutinase
MMAR_10992103.388042hypothetical protein
MMAR_11002102.918872membrane-anchored serine protease (mycosin),
MMAR_11011112.184792hypothetical protein
MMAR_1102-191.640441hypothetical protein
MMAR_11031121.630459hypothetical protein
MMAR_1104-1140.091683EsaT-6 like protein, EsxU
MMAR_11051112.003488EsaT-6 like protein, EsxT
MMAR_1106291.73220050S ribosomal protein L13
MMAR_11071102.55729930S ribosomal protein S9
MMAR_11082122.571440phosphoglucosamine mutase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1098PF06057300.007 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 30.2 bits (68), Expect = 0.007
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 10/46 (21%)

Query: 112 KIVLGGYSQGADVIDIVAGVPLAGISFGNELPAQYADNIAAVAVFG 157
K++L GYS GA+VI V NE+PA+Y N+ +
Sbjct: 118 KVILIGYSFGAEVIPFVL----------NEMPARYRKNVLGAVLLS 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1100SUBTILISIN1688e-51 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 168 bits (428), Expect = 8e-51
Identities = 87/321 (27%), Positives = 137/321 (42%), Gaps = 54/321 (16%)

Query: 72 LAGLDLPEAWRLTRGSGQRVAVIDTGV-AHHRRLA-HLVAGGDYVFTGDG----TQDCDA 125
+ + P W TRG G +VAV+DTG A H L ++ G ++ +G +D +
Sbjct: 26 VEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNG 85

Query: 126 HGTIVAGIIAAAPDSATDQFSGVAPDATVIGIRQSSAKFSPVGNRSSTGVGDVDTMARAV 185
HGT VAG IAA + GVAP+A ++ I+ + + G G D + + +
Sbjct: 86 HGTHVAGTIAATENENG--VVGVAPEADLLIIKVLNKQ----------GSGQYDWIIQGI 133

Query: 186 RTAADLGASVINISTIACVAAESPPDDRALGAALAYAVDVKNAVIVAAAGNTGGAAQCPS 245
A + +I++S P D L A+ AV +++ AAGN G
Sbjct: 134 YYAIEQKVDIISMS------LGGPEDVPELHEAVKKAVA-SQILVMCAAGNEGDGD---- 182

Query: 246 QRPETSRDTATVVVSPAWYDDYVLTVGSVNANGEPSAFTLPSPWVDVAAGGENVTSLNPV 305
D + P Y+ V++VG++N + S F+ + VD+ A GE++ S P
Sbjct: 183 -------DRTDELGYPGCYN-EVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVP- 233

Query: 306 GDGTVNGLDDHGGFRPLSGTSYAAPVVSGLAALIRARFPALTAR-----QVMARIISTAH 360
G + SGTS A P V+G ALI+ A R ++ A++I
Sbjct: 234 ----------GGKYATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRT- 282

Query: 361 HPPHGWDPFVGNGTVDVLAAV 381
P GNG + + A
Sbjct: 283 IPLGNSPKMEGNGLLYLTAVE 303


12MMAR_1119MMAR_1137Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_11190143.368836alanine racemase
MMAR_1120-1153.319233hydrolase
MMAR_11210182.886864hypothetical protein
MMAR_1122-1152.238786hypothetical protein
MMAR_1123-3131.966034ribosomal-protein-alanine acetyltransferase,
MMAR_11240130.670048putative DNA-binding/iron metalloprotein/AP
MMAR_1125113-0.048969co-chaperonin GroES
MMAR_1126414-0.343000chaperonin GroEL
MMAR_1127412-1.364830hypothetical protein
MMAR_1128413-1.489297hypothetical protein
MMAR_1129313-1.146909PPE family protein
MMAR_5550210-0.692364hypothetical protein
MMAR_1130210-0.034765PPE family protein
MMAR_11310120.486696metal-dependent hydrolase
MMAR_1132-192.154219WhiB-like regulatory protein, WhiB3
MMAR_1133092.463920hypothetical protein
MMAR_1134190.621795RNA polymerase sigma factor SigD
MMAR_11351110.630228hypothetical protein
MMAR_11362130.065504hypothetical protein
MMAR_11372130.253703inosine 5'-monophosphate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1119ALARACEMASE398e-140 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 398 bits (1024), Expect = e-140
Identities = 113/373 (30%), Positives = 178/373 (47%), Gaps = 28/373 (7%)

Query: 14 AEAVVDLGAIAHNVRLLRERAGSAQVMAVVKADGYGHGATAVARTALAAGAVELGVASVD 73
+A +DL A+ N+ ++R+ A A+V +VVKA+ YGHG + A + +++
Sbjct: 5 IQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDG--FALLNLE 62

Query: 74 EALTLRADGITAPVL---AWLHAPGMDFGPALAADVQIAISSIRQLDEVLDAARRTGTTA 130
EA+TLR G P+L + HA ++ + + S QL + +A R
Sbjct: 63 EAITLRERGWKGPILMLEGFFHAQDLEIYDQ--HRLTTCVHSNWQLKALQNA--RLKAPL 118

Query: 131 TVTVKIDTGLNRNGVAPALYPEMVTRLRQAVAEDAIRLRGLMTHMVHADAPEKPINDIQS 190
+ +K+++G+NR G P ++T +Q A + LM+H A+ P+ +
Sbjct: 119 DIYLKVNSGMNRLGFQP---DRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMA- 174

Query: 191 QRFKQMFDHARDQGVRFEVAHLSNSSATMARPDLTLDLVRPGIAVYGLSPVPRLGDM--- 247
R +Q +G+ LSNS+AT+ P+ D VRPGI +YG SP + D+
Sbjct: 175 -RIEQAA-----EGLECRR-SLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANT 227

Query: 248 GLVPAMTVKCAVALVKSVSAGEGVSYGHTWIAPHDTNVALLPIGYADGVFRSLGGRLEVL 307
GL P MT+ + V+++ AGE V YG + A + + ++ GYADG R VL
Sbjct: 228 GLRPVMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVL 287

Query: 308 INGKRRPGVGRVCMDQFLVDLGPGPLDVAEGDEAILFGPGTRGEPTAQDWADLVGTIHYE 367
++G R VG V MD VDL P P G L+G E D A GT+ YE
Sbjct: 288 VDGVRTMTVGTVSMDMLAVDLTPCP-QAGIGTPVELWGK----EIKIDDVAAAAGTVGYE 342

Query: 368 VVTSPRGRITRVY 380
++ + R+ V
Sbjct: 343 LMCALALRVPVVT 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1120DHBDHDRGNASE280.046 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 28.1 bits (62), Expect = 0.046
Identities = 22/102 (21%), Positives = 40/102 (39%), Gaps = 14/102 (13%)

Query: 20 LTAVATIVGASARRSLAERGS-VCDDPYAGEDFERLDSDRARVVTTPDGVPLAVREAGPV 78
+T A +G + R+LA +G+ + Y E E++ S + P VR++ +
Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72

Query: 79 D----------APLTMVFAHGFCLRMGAFHFQRMRLGEQWGS 110
D P+ ++ LR G H E+W +
Sbjct: 73 DEITARIEREMGPIDILVNVAGVLRPGLIHSLSD---EEWEA 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1123SACTRNSFRASE473e-09 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 46.9 bits (111), Expect = 3e-09
Identities = 20/93 (21%), Positives = 32/93 (34%), Gaps = 8/93 (8%)

Query: 53 GARCADNLVGYAGV-SRLGRVAPFEYEIHTIGVDPAYQGRGIGRRLLDELLAFA---DGG 108
+N +G + S A I I V Y+ +G+G LL + + +A
Sbjct: 69 LYYLENNCIGRIKIRSNWNGYA----LIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFC 124

Query: 109 VVFLEVRTDNEPAIALYRSVGFEQVGLRRRYYR 141
+ LE + N A Y F + Y
Sbjct: 125 GLMLETQDINISACHFYAKHHFIIGAVDTMLYS 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1129cloacin358e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.1 bits (80), Expect = 8e-04
Identities = 28/100 (28%), Positives = 38/100 (38%), Gaps = 6/100 (6%)

Query: 187 NANLGSGNTGIGNIGVGNSGEGNSALVPPQSGNYNIGGGNNGNNNLGAGNIGNFNFGFGN 246
+ N+ G TG+G G + G G S+ P G G G + G GN G G
Sbjct: 17 SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS--GHGNGGGNGNSGGG 74

Query: 247 NGTGNFGFGNAGPADLSNPNLFTFHVTPGENNIGIGNTGN 286
+GTG A P P L TPG + + +
Sbjct: 75 SGTGGNLSAVAAPVAFGFPAL----STPGAGGLAVSISAG 110



Score = 29.3 bits (65), Expect = 0.050
Identities = 35/131 (26%), Positives = 49/131 (37%), Gaps = 27/131 (20%)

Query: 223 GGGNNGNNNLGAGNIGNFNFGFGNNGTGNFGFGNAGPADLSNPNLFTFHVTPGENNIGIG 282
G G+N + +GNI G G G + G G + ENN G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSS-----------------ENNPWGG 48

Query: 283 NTGNGNFGLGNTGDGNIGGGNTGIGNIGFGLNGNNLVGVGGAYYDTAAGQFHFDGLNT-G 341
+G+G G +G GN GG G G G N + + A F F L+T G
Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV---------AAPVAFGFPALSTPG 99

Query: 342 SGNIGIGNSGS 352
+G + + S
Sbjct: 100 AGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1130cloacin300.022 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.5 bits (68), Expect = 0.022
Identities = 29/94 (30%), Positives = 39/94 (41%), Gaps = 5/94 (5%)

Query: 386 GNSGIGNFGVGNAGAGNFGAGNSGLLNTGVGNAGSIDTGAFNGNNLNTGFLNSGKTNTGF 445
G G G+ ++ +GN G +GL GVG S +G + NN G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGL---GVGGGASDGSGWSSENNPWGGGSGSGIHW--G 57

Query: 446 GNSGHENTGFWNSGDVNTGVGATTDSGLATSGFG 479
G SGH N G + +G G + A FG
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1131UREASE358e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 35.1 bits (81), Expect = 8e-04
Identities = 24/76 (31%), Positives = 35/76 (46%), Gaps = 9/76 (11%)

Query: 6 DAIYTNGDIVTVDDEQPIAEA-VAVKDGRIVAVGAHD-----DVVREHLGPHTRRVDLAG 59
D + TN I+ D I +A + +KDGRI A+G V +GP T + G
Sbjct: 69 DTVITNALIL---DHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEG 125

Query: 60 NTLLPGFIDPHSHYIN 75
+ G +D H H+I
Sbjct: 126 KIVTAGGMDSHIHFIC 141



Score = 31.2 bits (71), Expect = 0.012
Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 487 ITINAAYQYSEEQSKGSITVGKLADLVIVD 516
TIN A + GS+ VGK ADLV+ +
Sbjct: 409 YTINPAIAHGLSHEIGSLEVGKRADLVLWN 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1135PF03544300.013 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.6 bits (66), Expect = 0.013
Identities = 17/97 (17%), Positives = 23/97 (23%), Gaps = 2/97 (2%)

Query: 212 VLAPQVPPGNSLTPLVPETVSPVPPNESG--APESAPAPSASPTTTGAKPSASLPPAGAT 269
A Q PP + P P PP E+ + P P P
Sbjct: 63 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVE 122

Query: 270 ATSPAPTSVPTPPVSAVVPGETPADTSVVAPGSPAAA 306
+ +P P V + S A
Sbjct: 123 SRPASPFENTAPARPTSSTATAATSKPVTSVASGPRA 159


13MMAR_1157MMAR_1167Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_11570143.690076hypothetical protein
MMAR_11581164.321246error-prone DNA polymerase
MMAR_11594246.123198hypothetical protein
MMAR_11601175.292393nitroreductase
MMAR_11612175.740943PE-PGRS family protein
MMAR_1162-1144.476826PE-PGRS family protein
MMAR_1163-1102.101864tRNA/rRNA methyltransferase SpoU
MMAR_1164-1101.739186methyltransferase
MMAR_11650100.367145putative regultory protein
MMAR_1166114-4.384764hypothetical protein
MMAR_1167116-4.463192hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1161cloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 2e-04
Identities = 34/96 (35%), Positives = 40/96 (41%), Gaps = 8/96 (8%)

Query: 160 GNGGNAGLFGSGGA--GGSGGAGAAGGAGGSGGWLYGSGGNGGSGGNAVIAGGNGGAGGA 217
G G N G + G GG G G GGA GW + GG G+ + GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 218 GGAAGLWGAGGNGGQGGSGLAGSNGVNPAAVTDPAL 253
G G G +GG G+G S P A PAL
Sbjct: 66 G------GNGNSGGGSGTGGNLSAVAAPVAFGFPAL 95



Score = 37.0 bits (85), Expect = 2e-04
Identities = 28/85 (32%), Positives = 39/85 (45%), Gaps = 1/85 (1%)

Query: 445 AGGDGGWLVGNGGTGGTGGIGGQGGIGGQGGVGGNGAVGGNAHSPILDAHGGDGGAGGDA 504
+GGDG + GG G+G GG +G+ + ++P G GG +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 505 GHGGTGGDGGDGGLSGAGGRGGLLA 529
GHG GG+G GG SG GG +A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 36.2 bits (83), Expect = 3e-04
Identities = 27/77 (35%), Positives = 33/77 (42%)

Query: 374 GDGGNGGAGGDGATGGSGATGGNGGAGGITTADQNGYSAVGTQAVGGDGGNGGNGGSGGS 433
G G G G +T G+ G G G +D +G+S+ GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 434 GGTGGAAGSGGAGGDGG 450
G GG SGG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 35.8 bits (82), Expect = 5e-04
Identities = 37/113 (32%), Positives = 49/113 (43%), Gaps = 12/113 (10%)

Query: 308 SGGSGATGGDGGANGGTGGAGGQAASYGYSSGGNGGTGGAGGTGADGGAGGDGGHGGAGG 367
SGG G G + GA+ +G G G G + G+G + GG G G H G
Sbjct: 2 SGGDG-RGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG--- 57

Query: 368 RGGWLLGDGGNGGAGGDGATGGSGATGGNGGAGGITTADQNGYSAVGTQAVGG 420
G G+G GG+G +GG TGGN A + G+ A+ T GG
Sbjct: 58 ------GGSGHGNGGGNGNSGGGSGTGGNLSA--VAAPVAFGFPALSTPGAGG 102



Score = 35.5 bits (81), Expect = 7e-04
Identities = 37/116 (31%), Positives = 47/116 (40%), Gaps = 8/116 (6%)

Query: 143 GNGGAGFNNGATAGAAGGNGGNAGLFGSGGAGGSGGAGA-----AGGAGGSGGWLYGSGG 197
G G G N GA + + NGG GL GGA G + GG+G W GSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 198 NGGSGGNAVIAGGNGGAGGAGGAAGL---WGAGGNGGQGGSGLAGSNGVNPAAVTD 250
G G G G + AA + + A G GG ++ S G AA+ D
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 33.9 bits (77), Expect = 0.002
Identities = 24/67 (35%), Positives = 30/67 (44%)

Query: 422 GGNGGNGGSGGSGGTGGAAGSGGAGGDGGWLVGNGGTGGTGGIGGQGGIGGQGGVGGNGA 481
GN G +G G G + GSG + + W G+G GG G G GG G GG
Sbjct: 17 SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSG 76

Query: 482 VGGNAHS 488
GGN +
Sbjct: 77 TGGNLSA 83



Score = 33.1 bits (75), Expect = 0.003
Identities = 29/92 (31%), Positives = 40/92 (43%), Gaps = 13/92 (14%)

Query: 356 AGGDG-GHGGAGGRGGWLLGDGGNGGAGGDGATGGSGATGGNGGAGGITTADQNGYSAVG 414
+GGDG GH + G G G GA+ GSG + N GG + +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI------- 54

Query: 415 TQAVGGDGGNGGNGGSGGSGGTGGAAGSGGAG 446
GG G+G GG+G +GG +G+GG
Sbjct: 55 -----HWGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.006
Identities = 31/91 (34%), Positives = 33/91 (36%)

Query: 410 YSAVGTQAVGGDGGNGGNGGSGGSGGTGGAAGSGGAGGDGGWLVGNGGTGGTGGIGGQGG 469
+S G G G G G S GSG + GG G G G G G GG G GG
Sbjct: 14 HSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGG 73

Query: 470 IGGQGGVGGNGAVGGNAHSPILDAHGGDGGA 500
G GG A P L G G A
Sbjct: 74 GSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.008
Identities = 25/79 (31%), Positives = 30/79 (37%), Gaps = 3/79 (3%)

Query: 280 GTGVNGGNGGAGGDANDDVANSSGGLAGSGGSGATGGDGGANGGTGGAGGQAASYGYSSG 339
G G N G G+ N +G G G S +G N GG+G G S
Sbjct: 6 GRGHNTGAHSTSGNIN---GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 340 GNGGTGGAGGTGADGGAGG 358
GNGG G G G+ G
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 29.7 bits (66), Expect = 0.035
Identities = 31/111 (27%), Positives = 35/111 (31%), Gaps = 21/111 (18%)

Query: 333 SYGYSSGGNGGTGGAGGTGADGGAGGDGGHGGAGGRGGWLLGDGGNGGAGGDGATGGSGA 392
S G G N G G +GG G G GGA GW + GG G G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGN-INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 393 TGGNGGAGGITTADQNGYSAVGTQAVGGDGGNGGNGGSGGSGGTGGAAGSG 443
GNGG G GG+G G G
Sbjct: 61 GHGNGGGNG--------------------NSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1162cloacin381e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 1e-04
Identities = 39/102 (38%), Positives = 45/102 (44%), Gaps = 6/102 (5%)

Query: 143 GNGGAGFNNGATAGAAGGNGGNAGLIGNGGAGGNGGAGA-----AGGTGGNGGWLYGSG- 196
G G G N GA + + NGG GL GGA G + GG+G W GSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 197 GAGGAGGNALVAGGTGGNGGAGGAAGLWGNGGAGGHGGSGLA 238
G GG GN+ GTGGN A A +G G GLA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.2 bits (83), Expect = 4e-04
Identities = 29/89 (32%), Positives = 36/89 (40%)

Query: 423 TGGDGGAGGTGGHGGAGGAGGAGGVGGDGGWLIGDGGAGGQGGIGGMGGTGGAGGSGVAG 482
T G+ G TG G G + G+G + W G G GG G G GG G SG
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 483 AHGGNATSSIAAFGGDGGAGGDAGHGGMG 511
GGN ++ A A G GG+
Sbjct: 76 GTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.2 bits (83), Expect = 4e-04
Identities = 32/91 (35%), Positives = 39/91 (42%), Gaps = 2/91 (2%)

Query: 125 GAAGTATNPNGGAGGLLYGNGGAGFNNGATAGA--AGGNGGNAGLIGNGGAGGNGGAGAA 182
GA T+ N NGG GL G G + + ++ GG+G G G G GG G +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 183 GGTGGNGGWLYGSGGAGGAGGNALVAGGTGG 213
GG G GG L G AL G GG
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.1 bits (80), Expect = 9e-04
Identities = 25/76 (32%), Positives = 28/76 (36%)

Query: 376 GNGGAGGAGGAGGAGGNGAIGGDGGAAGFTGSNNDSAASTLYGAHGGTGGDGGAGGTGGH 435
G G GA G G G G GS S + G G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 436 GGAGGAGGAGGVGGDG 451
GG G +GG G GG+
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.001
Identities = 28/77 (36%), Positives = 32/77 (41%)

Query: 376 GNGGAGGAGGAGGAGGNGAIGGDGGAAGFTGSNNDSAASTLYGAHGGTGGDGGAGGTGGH 435
G G G GA GN G G G S+ +S GG+G GG GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 436 GGAGGAGGAGGVGGDGG 452
G GG G +GG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 34.7 bits (79), Expect = 0.001
Identities = 28/84 (33%), Positives = 34/84 (40%), Gaps = 6/84 (7%)

Query: 341 GGNGGNGGTGGTGTDGLAGSGGGSGGAGGRG------GWLIGNGGAGGAGGAGGAGGNGA 394
G N G T G G G G G G + G G W G+G GG G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 395 IGGDGGAAGFTGSNNDSAASTLYG 418
G GG +G G+ + AA +G
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.4 bits (73), Expect = 0.005
Identities = 26/84 (30%), Positives = 33/84 (39%), Gaps = 3/84 (3%)

Query: 319 GGADGGTGGVAGVAASYNGVAIGGNGGNGGTGGTGTDGLAGSGGGSGGAGGRGGWLIGNG 378
G G G + + NG G G G + G+G GG G+G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG---GGS 60

Query: 379 GAGGAGGAGGAGGNGAIGGDGGAA 402
G G GG G +GG GG+ A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.0 bits (72), Expect = 0.006
Identities = 29/88 (32%), Positives = 37/88 (42%), Gaps = 8/88 (9%)

Query: 160 GNGGNAGLIGNGGA--GGNGGAGAAGGTGGNGGWLYGSGGAGGAGGNALVAGGTGGNGGA 217
G G N G G GG G G GG GW + GG G+ + GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 218 GGAAGLWGNGGAGGHGGSGLAGNSGVNP 245
G GNG +GG G+G ++ P
Sbjct: 66 G------GNGNSGGGSGTGGNLSAVAAP 87



Score = 30.5 bits (68), Expect = 0.023
Identities = 35/114 (30%), Positives = 43/114 (37%), Gaps = 9/114 (7%)

Query: 309 DGGNGAQGGQGGADGGTGGVAGVAASYNGVAIGGNGGNGGTGGTGTDGLAGSGGGSGGAG 368
+GG G GGA G+G +S N GG+G GG G G G SGG
Sbjct: 21 NGGPTGLGVGGGASDGSGW-----SSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 369 GRGGWLIGNGGAGGAGGAGGAGGNGAIGGDGGAAGFTGSNNDSAASTLYGAHGG 422
G G GN A A A G G G A + +A + + A G
Sbjct: 76 GTG----GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKG 125



Score = 30.1 bits (67), Expect = 0.029
Identities = 37/131 (28%), Positives = 44/131 (33%), Gaps = 17/131 (12%)

Query: 423 TGGDGGAGGTGGHGGAGGAGGAGGVGGDGGWLIGDGGAGGQGGIGGMGGTGGAGGSGVAG 482
+GGDG TG H +G +GG G G G G + G+G S
Sbjct: 2 SGGDGRGHNTGAHSTSGNI---------------NGGPTGLGVGG--GASDGSGWSSENN 44

Query: 483 AHGGNATSSIAAFGGDGGAGGDAGHGGMGGDGGNGGQGASGGRGGLLSGAQGVTGTAGDG 542
GG + S I GG G G GG G G A A G G
Sbjct: 45 PWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104

Query: 543 GTGGAGGLHGA 553
+ AG L A
Sbjct: 105 VSISAGALSAA 115



Score = 30.1 bits (67), Expect = 0.031
Identities = 27/93 (29%), Positives = 33/93 (35%), Gaps = 11/93 (11%)

Query: 352 TGTDGLAGSGGGSGGAGGRGGWLIGNGGAGGAGGAGGAGGNGAIGGDGGAAGFTGSNNDS 411
+G DG + G +G G G G GGA G G G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH------ 55

Query: 412 AASTLYGAHGGTGGDGGAGGTGGHGGAGGAGGA 444
+G G G GG G +GG G GG A
Sbjct: 56 -----WGGGSGHGNGGGNGNSGGGSGTGGNLSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1164NUCEPIMERASE342e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 34.4 bits (79), Expect = 2e-04
Identities = 20/71 (28%), Positives = 32/71 (45%), Gaps = 12/71 (16%)

Query: 60 GRLGRHLAAA----GHRVVGVDV-----DPALIEA--AEQDYPGPQWLVGDLAELDLPAR 108
G +G H++ GH+VVG+D D +L +A PG Q+ DLA+ +
Sbjct: 10 GFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLADREGMTD 69

Query: 109 GIAE-PFDVIV 118
A F+ +
Sbjct: 70 LFASGHFERVF 80


14MMAR_1190MMAR_1208Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1190-1145.267163N-acetylglucosamine-6-phosphate deacetylase
MMAR_1191-1145.224051sugar-transport integral membrane protein SugI
MMAR_1192-1145.892726penicillin-binding protein DacC
MMAR_1193-1146.153252transcriptional regulator
MMAR_11945127.471793hypothetical protein
MMAR_11957157.341497PE-PGRS family protein
MMAR_11964155.103059RNA polymerase sigma factor SigJ
MMAR_11974174.820835hypothetical protein
MMAR_11985164.775062hypothetical protein
MMAR_11995174.796938PE-PGRS family protein
MMAR_12000150.039443succinate dehydrogenase iron-sulfur subunit
MMAR_1201-113-0.287118succinate dehydrogenase flavoprotein subunit
MMAR_12023154.676178succinate dehydrogenase (hydrophobic membrane
MMAR_12038197.203561succinate dehydrogenase (cytochrome B-556
MMAR_12046186.889627cytidine deaminase
MMAR_12055176.422216thymidine phosphorylase
MMAR_12066176.288978adenosine deaminase
MMAR_12076165.692608PE-PGRS family protein
MMAR_12081103.066744PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1192BLACTAMASEA320.005 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 31.7 bits (72), Expect = 0.005
Identities = 27/119 (22%), Positives = 45/119 (37%), Gaps = 13/119 (10%)

Query: 111 DLDSGAIIAARDPHARHRPASIIKVLVAM-----VSIKELNQNKAVPGTNDD--SAAEGT 163
DL SG + A R S KV++ V + + + D + +
Sbjct: 46 DLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVS 105

Query: 164 KVGVNAGGMYTVNQLLHGLLMHSGNDAAHALAIQLGGMQTALEKINMLAAKLGGRDTRV 222
+ + A GM TV +L + S N AA+ L +GG + ++G TR+
Sbjct: 106 EKHL-ADGM-TVGELCAAAITMSDNSAANLLLATVGG----PAGLTAFLRQIGDNVTRL 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1195cloacin368e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 8e-04
Identities = 36/107 (33%), Positives = 43/107 (40%), Gaps = 6/107 (5%)

Query: 722 SGGIGGPGGFAVAASGNGGQGGHAGLLWSSGGAGGAGAFSVSGPAGAGGHGGDAGWLGNG 781
SGG G ++ GG GL G + G+G S + P G GG G W G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGS 60

Query: 782 GAGGTGGISRNGNGGDGGAGGSGGELLGVGGAGGQGGQANAGGGTGT 828
G G G GNG GG G+GG L V G A + G G
Sbjct: 61 GHGNGG-----GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.5 bits (81), Expect = 0.001
Identities = 35/104 (33%), Positives = 41/104 (39%), Gaps = 1/104 (0%)

Query: 200 LMGTGGAGGAGGAGGFSGTGDGGSGGAGGAGGLLGGGGVGGAGGFSNGGTG-GVGGTGGA 258
+ G G G GA SG +GG G G GG G G GG+G G+ GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 259 GGVLSGVVGAGGGHGGTGGYGAFDGGAGGVGGHAGLLGGPGGAG 302
G G G GG GTGG + G A G GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.001
Identities = 39/109 (35%), Positives = 45/109 (41%), Gaps = 7/109 (6%)

Query: 537 VAGSDGAGGSGGAAGLWGTGGTGGAGARSTTGNGGAGGAGGAGGWLSGDGGTGGTGGAAT 596
++G DG G + GA T G TG G GGA GW S + GG G+
Sbjct: 1 MSGGDGRGHNTGAHS------TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 597 FNGATGGNGGDAGNGGLFGAGGNGGAGGAGLGNPGFG-GAGGNGGAGGL 644
G G+G GNG G G GG A FG A GAGGL
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 34.7 bits (79), Expect = 0.002
Identities = 31/106 (29%), Positives = 36/106 (33%), Gaps = 2/106 (1%)

Query: 262 LSGVVGAGGGHGGTGGYGAFDGGAGGVGGHAGLLGGPGGAGGDGGDTFGGGSGGAGGAGG 321
+SG G G G G +GG G+G G G G + +GGGSG GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGW--SSENNPWGGGSGSGIHWGG 58

Query: 322 DGGWAFGSGGTGGTGGYGGAGGAGGSGGDAGLLFSNGGAGGTGGFG 367
G G G GG G G F G GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.003
Identities = 30/80 (37%), Positives = 36/80 (45%), Gaps = 1/80 (1%)

Query: 754 AGGAGAFSVSGPAGAGGH-GGDAGWLGNGGAGGTGGISRNGNGGDGGAGGSGGELLGVGG 812
+GG G +G G+ G LG GG G + N GG GSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 813 AGGQGGQANAGGGTGTGGEG 832
G GG N+GGG+GTGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.003
Identities = 31/116 (26%), Positives = 37/116 (31%), Gaps = 8/116 (6%)

Query: 344 AGGSGGDAGLLFSNGGAGGTGGFGETGGEGGAGGGAGWLGDGGAGGTGGAANIGSGGDGG 403
+GG G + GG G GGA G+GW + G G + I GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 404 GGGTGGTLLGNGGAGGAGGHGPTDGGAGGTGGNAVVVGNGGNGGIGGTGPTTGTTG 459
G NGG G G G GG V G G G +
Sbjct: 62 HG--------NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 33.9 bits (77), Expect = 0.003
Identities = 32/106 (30%), Positives = 41/106 (38%), Gaps = 6/106 (5%)

Query: 377 GGAGWLGDGGAGGTGGAANIGSGGDG-GGGGTGGTLLGNGGAGGAGGHGPTDGGAGGTGG 435
GG G + GA T G N G G G GGG + G+ + GG G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 436 NAVVVGNGGNGGIGGTGPTTGTTGAGGIGGLLLGLDGYNAPTSTSP 481
GNGG G G G TG + + G + P +
Sbjct: 63 -----GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 33.9 bits (77), Expect = 0.003
Identities = 34/107 (31%), Positives = 42/107 (39%), Gaps = 2/107 (1%)

Query: 217 GTGDGGSGGAGGAGGLLGGG--GVGGAGGFSNGGTGGVGGTGGAGGVLSGVVGAGGGHGG 274
G G G + GA G + GG G+G GG S+G GG SG+ GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 275 TGGYGAFDGGAGGVGGHAGLLGGPGGAGGDGGDTFGGGSGGAGGAGG 321
GG GG G GG+ + P G T G G + G
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.9 bits (77), Expect = 0.003
Identities = 24/77 (31%), Positives = 32/77 (41%)

Query: 792 NGNGGDGGAGGSGGELLGVGGAGGQGGQANAGGGTGTGGEGGDGGDAGLIGDGGNGGNAG 851
+G G + GA + G + G G GG A+ G G + GG I GG G+
Sbjct: 5 DGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 852 TDTDGTPTGDPGTGGTG 868
+G G GTGG
Sbjct: 65 GGGNGNSGGGSGTGGNL 81



Score = 33.5 bits (76), Expect = 0.004
Identities = 34/115 (29%), Positives = 47/115 (40%), Gaps = 8/115 (6%)

Query: 582 LSGDGGTGGTGGAATFNGATGGNGGDAGNGGLFGAGGNGGAGGAGLGNPGFGGAGGN--- 638
+SG G G GA + +G G G GG G + G+G + NP GG+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGG----GASDGSGWSSENNPWGGGSGSGIHW 56

Query: 639 -GGAGGLLSGLVGAGGGHGGAGGLGVSPSTPAALAGGGIGGAGGDAGLLGGAGGA 692
GG+G G G GG G GG + + P A + G + + GA
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.1 bits (75), Expect = 0.004
Identities = 27/80 (33%), Positives = 31/80 (38%)

Query: 329 SGGTGGTGGYGGAGGAGGSGGDAGLLFSNGGAGGTGGFGETGGEGGAGGGAGWLGDGGAG 388
SGG G G +G G L GGA G+ G G G+G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 389 GTGGAANIGSGGDGGGGGTG 408
G N SGG G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 33.1 bits (75), Expect = 0.005
Identities = 31/85 (36%), Positives = 35/85 (41%), Gaps = 7/85 (8%)

Query: 775 AGWLGNGGAGGTGGISRNGNGGDGGAGGSGGELLGVG-------GAGGQGGQANAGGGTG 827
+G G G G S N NGG G G GG G G GG G + GGG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 828 TGGEGGDGGDAGLIGDGGNGGNAGT 852
G GG+G G G GGN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.005
Identities = 33/106 (31%), Positives = 39/106 (36%), Gaps = 1/106 (0%)

Query: 136 TGANGAAGGWLLGDGGAGGSGAPGMAGGNGGAAGLWGTGGAGGAGGRTFNGDGGNGGAGG 195
TGA+ +G G G G G G WG GG+G GNGG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSGHGNGGGNG 69

Query: 196 AGGWLMGTGGAGGAGGAGGFSGTGDGGSGGAGGAGGLLGGGGVGGA 241
G GTGG A A G + GAGG + G + A
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 32.8 bits (74), Expect = 0.007
Identities = 34/123 (27%), Positives = 41/123 (33%), Gaps = 7/123 (5%)

Query: 528 GDGGAGGSAVAGSDGAGGSGGAAGLWGTGGTGGAGARSTTGNGGAGGAGGAGGWLSGDGG 587
G G G + A S +GG GL GG S+ N GG+G W G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 588 TGGTGGAATFNGATGGNGGDAGNGGLFGAGGNGGAGGAGLGNPGFGGAGGNGGAGGLLSG 647
G G GG G L G L PG GG + AG L +
Sbjct: 63 GNGGGNG-------NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115

Query: 648 LVG 650
+
Sbjct: 116 IAD 118



Score = 32.0 bits (72), Expect = 0.010
Identities = 35/109 (32%), Positives = 41/109 (37%), Gaps = 8/109 (7%)

Query: 148 GDGGAGGSGAPGMAGGNGGAAGLWGTGGAGGAGGRTFNGDGGNGGAGGAGGWLMGTGGAG 207
GDG +GA +G G G GG G + + GG G+G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 208 GAGGAGGFSGTGDGGSGGAGGAGGLLGGGGVGGAGGFSNGGTGGVGGTG 256
G G+G SGG G GG L A GF T G GG
Sbjct: 64 NGG--------GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.013
Identities = 27/82 (32%), Positives = 31/82 (37%)

Query: 313 SGGAGGAGGDGGWAFGSGGTGGTGGYGGAGGAGGSGGDAGLLFSNGGAGGTGGFGETGGE 372
SGG G G + GG G G GGA G + GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 373 GGAGGGAGWLGDGGAGGTGGAA 394
G GGG G G G G +A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 31.6 bits (71), Expect = 0.014
Identities = 32/101 (31%), Positives = 41/101 (40%), Gaps = 6/101 (5%)

Query: 300 GAGGDGGDTFGGGSGGAGGAGGDGGWAFGSGGTGGTGGYGGAGGAGGSGGDAGLLFSNGG 359
G T G +GG G G GG + GSG + +GG G+G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG------GGSG 61

Query: 360 AGGTGGFGETGGEGGAGGGAGWLGDGGAGGTGGAANIGSGG 400
G GG G +GG G GG + A G + G+GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.027
Identities = 21/80 (26%), Positives = 27/80 (33%)

Query: 301 AGGDGGDTFGGGSGGAGGAGGDGGWAFGSGGTGGTGGYGGAGGAGGSGGDAGLLFSNGGA 360
+GGDG G +G G GG G+ G G +G+ + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 361 GGTGGFGETGGEGGAGGGAG 380
G GG G G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.028
Identities = 24/68 (35%), Positives = 29/68 (42%), Gaps = 3/68 (4%)

Query: 398 SGGDGGGGGTGGTLLGNGGAGGAGGHGPTDGGAGGTGGNAVVVGNGGNGGIGGTGPTTGT 457
SGGDG G TG GG G G G + G+G ++ N GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSE---NNPWGGGSGSGIHWGG 58

Query: 458 TGAGGIGG 465
G GG
Sbjct: 59 GSGHGNGG 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1199cloacin392e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 2e-04
Identities = 39/107 (36%), Positives = 45/107 (42%), Gaps = 6/107 (5%)

Query: 624 AGGDGMAGTSGADATVAGATGGAGGAGGGGGAGGDGGSGGTHAGNGGDGGIGGKGGAGGL 683
+GGDG +GA +T GG G G GGGA G + GG G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG--IHWGGG 59

Query: 684 GGHGATGGAGKQAGANGDDGGDGGVGGAGGSG----GAGGAGGGAVA 726
GHG GG G G +G G V G GAGG AV+
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 35.1 bits (80), Expect = 0.003
Identities = 42/124 (33%), Positives = 48/124 (38%), Gaps = 4/124 (3%)

Query: 514 GGDGALGGAAGEGGAGGNGAAGPTGGDGGAGGAGGDPGVGGNGGIGGDSGNGTHAASGAT 573
GGDG G G GN GPTG G G + G N GG SG+G H G+
Sbjct: 3 GGDGR-GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 574 AGDDGTGTRGISGNGGAGGRGAAATVAG---GAGGAGGAGGDGGRIGSGGAGGAGGDGMA 630
G+ G G+G G A A A GAGG I +G A D MA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 631 GTSG 634
G
Sbjct: 122 ALKG 125



Score = 34.3 bits (78), Expect = 0.004
Identities = 27/90 (30%), Positives = 34/90 (37%)

Query: 850 AGGDGMAGTSGADAAVAGGSGGDGSAGGSGGAGGDGGKGGAGAGNGGDGGVGGNGAAGGD 909
+GGDG +GA + +GG G GGA G GG G G + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 910 GGDGADGKLMSVHNGAGGNGGDSGAGGAGG 939
G+G +G GGN A A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.3 bits (78), Expect = 0.005
Identities = 26/81 (32%), Positives = 32/81 (39%), Gaps = 1/81 (1%)

Query: 1064 GDGGRLGNGGTGGAGGAGRVGAEGSGGLAAGDNGGAGQAGGTGGSGGAGGKGGSEFGNGG 1123
G GR N G G G G+ G + G+G + GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1124 NGGAGGAGGDGGVGGTGAGGA 1144
+G GG G GG GTG +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 33.5 bits (76), Expect = 0.008
Identities = 28/100 (28%), Positives = 35/100 (35%)

Query: 713 GSGGAGGAGGGAVAAGYVSGGHGAGGVGGDGALGGAAGDGGAGAAGSAAVNDGAGGAGGA 772
G G G G +G ++GG GVGG + G G + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 773 GGDPGAGGKGGIGGDSGNGTHAASGATAGDDGTGARGISG 812
G G G GG G GN + A+ G G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.1 bits (75), Expect = 0.009
Identities = 39/117 (33%), Positives = 45/117 (38%), Gaps = 6/117 (5%)

Query: 983 IGGGDGSGGGTYGVDPTGGAGGSGGDSGYNGAGGAGGTGSGISFSGSTTLTAGSGNGGNG 1042
+ GGDG G T G SG +G G GG S S S G G+G
Sbjct: 1 MSGGDGRGHNT------GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 1043 GAGADSGWSASSPNGGDGGVGGDGGRLGNGGTGGAGGAGRVGAEGSGGLAAGDNGGA 1099
G SG NG GG G GG L A G + G+GGLA + GA
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.8 bits (74), Expect = 0.013
Identities = 31/86 (36%), Positives = 41/86 (47%), Gaps = 6/86 (6%)

Query: 1032 LTAGSGNGGNGGAGADSGWSASSPNGGDGGVGGDGGRLGNGGTGGAGGAGRVGAEGSGGL 1091
++ G G G N GA + SG + NGG G+G GG + G+G + G G+
Sbjct: 1 MSGGDGRGHNTGAHSTSG----NINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGI 54

Query: 1092 AAGDNGGAGQAGGTGGSGGAGGKGGS 1117
G G G GG G SGG G GG+
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 32.8 bits (74), Expect = 0.014
Identities = 25/80 (31%), Positives = 32/80 (40%)

Query: 827 GGAGGAGGAGGDGGRIGSGGAGGAGGDGMAGTSGADAAVAGGSGGDGSAGGSGGAGGDGG 886
G GA G+ +G G G +G S + GGSG GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 887 KGGAGAGNGGDGGVGGNGAA 906
G +G G+G G + A
Sbjct: 68 NGNSGGGSGTGGNLSAVAAP 87



Score = 32.4 bits (73), Expect = 0.016
Identities = 29/90 (32%), Positives = 33/90 (36%), Gaps = 5/90 (5%)

Query: 1416 GDGGNGGNGANGQTGGYIHGASGGSGGASGAGGAGGSGGADSNGTTSASGGTGAAGTWGN 1475
G G G N T G I+G G G GA G S+ GG+G+ WG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGAS----DGSGWSSENNPWGGGSGSGIHWGG 58

Query: 1476 GGNGGNGADGAGGEAGGQGGRGGDSLYGTA 1505
G GNG G GG G G S
Sbjct: 59 GSGHGNGGGN-GNSGGGSGTGGNLSAVAAP 87



Score = 32.4 bits (73), Expect = 0.016
Identities = 33/100 (33%), Positives = 40/100 (40%), Gaps = 2/100 (2%)

Query: 417 GAGGSGGSGGSGGNGGWLLGVGGAGGVGGLG--GAGGAGAAGGVGGVGGDAITPGGAGGT 474
G G G + G+ G + G GVGG G+G + GG G I GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 475 GGIGGAGGAGGDGGAGGAGGRGGARGLFGLHGSAGAGGIG 514
G GG G +GG G GG A FG + G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.019
Identities = 36/111 (32%), Positives = 45/111 (40%), Gaps = 2/111 (1%)

Query: 782 GGIGGDSGNGTHAASGATAGDDGTGARGISGNGGAGGRGADATVAGGAGGAGGAGGDGGR 841
GG G G H+ SG G G + G+G + GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 842 IGSGGAGGAGGDGMAGTSGADAAVAGGSGGDGSAGGSGGAGGDGGKGGAGA 892
GG G +GG +GT G +AVA A + GAGG AGA
Sbjct: 63 GNGGGNGNSGGG--SGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.4 bits (73), Expect = 0.019
Identities = 28/100 (28%), Positives = 34/100 (34%)

Query: 1378 GAGGTGGVGGAGGAAGAAPAGNAGSLGAGGVGGTGGQGGDGGNGGNGANGQTGGYIHGAS 1437
G G G GA +G G G GG G + G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1438 GGSGGASGAGGAGGSGGADSNGTTSASGGTGAAGTWGNGG 1477
G GG +GG G+GG S + G A T G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.021
Identities = 35/112 (31%), Positives = 42/112 (37%), Gaps = 11/112 (9%)

Query: 1077 AGGAGRVGAEGSGGLAAGDNGGAGQAGGTGGSGGAGGKGGSEFGNGGNGGAGGAGGDGGV 1136
+GG GR +G + N G G G G + G G S N GG+G GG
Sbjct: 2 SGGDGR--GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1137 GGTGAGGAVGNALSNDIGGGTGGDGGTGGAGGSGGAGGKGGTAAAGNAGEQG 1188
G G GG G +GG GTGG + A G A G G
Sbjct: 60 SGHGNGGG---------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.028
Identities = 35/119 (29%), Positives = 50/119 (42%), Gaps = 7/119 (5%)

Query: 936 GAGGKGGNGGTAAAGTQGHGGNGGNGGLAGIAGDGGDGATGTYDSRLIGGGDGSGGGTYG 995
G G+G N G + +GG G G+ G DG+ + ++ GGG GSG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTG----LGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 996 VDPTGGAGGSGGDSGYNGAGGAGGTGSGISFSGSTTLTAGSGNGGNGGAGADSGWSASS 1054
G G GG+ G G GG S ++ + A S G G A + S + S+
Sbjct: 59 G---SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 31.6 bits (71), Expect = 0.031
Identities = 26/90 (28%), Positives = 33/90 (36%)

Query: 1382 TGGVGGAGGAAGAAPAGNAGSLGAGGVGGTGGQGGDGGNGGNGANGQTGGYIHGASGGSG 1441
+GG G + +GN G G G G G + N G G GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1442 GASGAGGAGGSGGADSNGTTSASGGTGAAG 1471
+G G GG+ + G SA A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.2 bits (70), Expect = 0.035
Identities = 26/81 (32%), Positives = 32/81 (39%), Gaps = 2/81 (2%)

Query: 831 GAGGAGGDGGRIGSGGAGGAGGDGMAGTSGADAAVAGGSGGDGSAGGSGGAGGDGGKGGA 890
G G G + G + G G G+ GA S + GGSG GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--GS 60

Query: 891 GAGNGGDGGVGGNGAAGGDGG 911
G GNGG G G G+ G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 31.2 bits (70), Expect = 0.037
Identities = 29/109 (26%), Positives = 38/109 (34%)

Query: 555 NGGIGGDSGNGTHAASGATAGDDGTGTRGISGNGGAGGRGAAATVAGGAGGAGGAGGDGG 614
+GG G G H+ SG G G + G+G GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 615 RIGSGGAGGAGGDGMAGTSGADATVAGATGGAGGAGGGGGAGGDGGSGG 663
GG G +GG G + + A G + G G S G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.2 bits (70), Expect = 0.037
Identities = 29/80 (36%), Positives = 33/80 (41%), Gaps = 3/80 (3%)

Query: 828 GAGGAGGAGGDGGRIGSGGAGGAGGDGMAGTSGADAAVAGGSGGDGSAGGSGGAGGDGGK 887
G G GA G I G G G G + G+ + G GS G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 888 GGAGAGNGGDG-GVGGNGAA 906
G G GN G G G GGN +A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.048
Identities = 30/110 (27%), Positives = 33/110 (30%)

Query: 1348 TGGSGGTGGVGGRGTDGVETGSTGLFGTKGGAGGTGGVGGAGGAAGAAPAGNAGSLGAGG 1407
+GG G G T G G G GGA G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1408 VGGTGGQGGDGGNGGNGANGQTGGYIHGASGGSGGASGAGGAGGSGGADS 1457
G GG G GG G G N + GAGG S A +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 30.8 bits (69), Expect = 0.048
Identities = 31/118 (26%), Positives = 43/118 (36%), Gaps = 8/118 (6%)

Query: 584 ISGNGGAGGRGAAATVAGGAGGAGGAGGDGGRIGSGGAGGAGGDGMAGTSGADATVAGAT 643
+SG G G A + +G G G GG G + G G + + +G+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGG-------GASDGSGWSSENNPWGGGSGSG 53

Query: 644 GGAGGAGGGGGAGGDGGSGGTHAGNGGDGGIGGKGGAGGLGGHGATGGAGKQAGANGD 701
GG G G GG+G SGG +G GG+ A G G G +
Sbjct: 54 IHWGGGSGHGNGGGNGNSGG-GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1207cloacin373e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 3e-04
Identities = 26/72 (36%), Positives = 32/72 (44%)

Query: 170 GIGGAGGSGGAGGTGGWLYGNGGAGGAGGAGAPGPVSFNGNNGGNGGNGGAAGWWGTGGA 229
G G G + GA T G + G G GG + G + NN GG+G W G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 230 GGAGGEGGAAGG 241
G GG G + GG
Sbjct: 63 GNGGGNGNSGGG 74



Score = 37.4 bits (86), Expect = 3e-04
Identities = 34/102 (33%), Positives = 39/102 (38%), Gaps = 3/102 (2%)

Query: 643 GRGGIGGQGGTGGEAGGGTGIHAGNGGTGGNGGGGGWLSGDAGAGGQGGASVASNYIAGG 702
GRG G T G GG G G GG G GW S + GG G+ + +G
Sbjct: 6 GRGHNTGAHSTSGNINGGPT---GLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 703 GGAGGNGGAAGLFGAGGAGGTGGTGGNGNAPAGGDAGHGGQG 744
G GGNG + G G GG PA G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.5 bits (81), Expect = 0.001
Identities = 35/89 (39%), Positives = 41/89 (46%), Gaps = 6/89 (6%)

Query: 607 GDGGTGGTGGDAVAGLPGVNGGNGGNGGAGGAAGWWGRGGIGGQGGTGGEAGGGTGIHAG 666
GDG TG + +G +NGG G G GGA+ G G G G+GIH G
Sbjct: 4 GDGRGHNTGAHSTSG--NINGGPTGLGVGGGASDGSGWSSENNPWG----GGSGSGIHWG 57

Query: 667 NGGTGGNGGGGGWLSGDAGAGGQGGASVA 695
G GNGGG G G +G GG A A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 35.1 bits (80), Expect = 0.001
Identities = 29/92 (31%), Positives = 38/92 (41%), Gaps = 3/92 (3%)

Query: 822 GNGGAGGQGGVAGTPDGVDGLLGGTGGAGGTGGTAGWLYGSGGSGGAGGAGTASFYPGST 881
G G G G T ++G G G GG +GW + GG G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 882 GGNGGNGGNGGAAQVIGSGGSGGAAGTGGAGG 913
G GGNG +GG + G+GG+ A A G
Sbjct: 63 GNGGGNGNSGGGS---GTGGNLSAVAAPVAFG 91



Score = 35.1 bits (80), Expect = 0.002
Identities = 35/107 (32%), Positives = 40/107 (37%), Gaps = 16/107 (14%)

Query: 680 LSGDAGAGGQGGASVASNYIAGGGGAGGNGGAAGLFGAGGAGGTGGTGGNGNAPAGGDAG 739
+SG G G GA S I GG G GG G + N P GG +G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGL---------GVGGGASDGSGWSSENNPWGGGSG 51

Query: 740 HGGQGGYGGWLAGSGGAGGTGGVGGLGDSASGTGGSGGGGGAARLFG 786
G G G GG G +GG SGTGG+ A FG
Sbjct: 52 SGIHWGGGSGHGNGGGNGNSGG-------GSGTGGNLSAVAAPVAFG 91



Score = 34.7 bits (79), Expect = 0.002
Identities = 41/117 (35%), Positives = 47/117 (40%), Gaps = 10/117 (8%)

Query: 436 GFGGSGGNGGTGGIGGLIGNGGAGGVGGAGNTNQVSGYSG-NGGNGGNGGAAQLIGAGGG 494
G G G N G G I NGG G+G G + SG+S N GG G+ G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 495 GGEGGVGGTGAAGVTGSAGASGVGGRLYSGNGIPDIFGRPLIGDGADGAPGTGQAGG 551
G GG G S G SG GG L S P FG P + G + G
Sbjct: 62 HGNGGGNG-------NSGGGSGTGGNL-SAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 34.3 bits (78), Expect = 0.003
Identities = 30/78 (38%), Positives = 33/78 (42%), Gaps = 1/78 (1%)

Query: 382 GDGAAGGAGGAGG-ASQTFTGDGGAGGTGGAGGWLYGNGGAGGSGGAGGAGIGTGGFGGS 440
G G GA G + TG G GG GW N GG G+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 441 GGNGGTGGIGGLIGNGGA 458
GGNG +GG G GN A
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 33.1 bits (75), Expect = 0.005
Identities = 29/79 (36%), Positives = 35/79 (44%)

Query: 154 GGAGGAAGLIGNGGAGGIGGAGGSGGAGGTGGWLYGNGGAGGAGGAGAPGPVSFNGNNGG 213
G GA GN G G G G + G+G N GG+G G S +GN GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 214 NGGNGGAAGWWGTGGAGGA 232
NG +GG +G G A A
Sbjct: 68 NGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.005
Identities = 38/115 (33%), Positives = 43/115 (37%), Gaps = 3/115 (2%)

Query: 403 GGAGGTGGAGGWLYGNGGAGGSGGAGGAGIGTGGFGGSGGNGGTGGIGGL-IGNGGAGGV 461
GG G G GG G G G + G G S N GG G I GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 462 GGAGNTNQVSGYSGNGGNGGNGGAAQLIG--AGGGGGEGGVGGTGAAGVTGSAGA 514
G G G SG GGN A G A G GG+ + +AG +A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 33.1 bits (75), Expect = 0.005
Identities = 23/64 (35%), Positives = 27/64 (42%)

Query: 122 ADGTAPGQAGGAGGLLYGNGGNGAAGTNPGVAGGAGGAAGLIGNGGAGGIGGAGGSGGAG 181
G G G G + G+G + N GG+G G G G GG G SGG
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 182 GTGG 185
GTGG
Sbjct: 76 GTGG 79



Score = 33.1 bits (75), Expect = 0.006
Identities = 31/116 (26%), Positives = 37/116 (31%)

Query: 719 GAGGTGGTGGNGNAPAGGDAGHGGQGGYGGWLAGSGGAGGTGGVGGLGDSASGTGGSGGG 778
G G G G + + G G G GG GSG + GG S GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 779 GGAARLFGDGGSGGNGGVGGPGTVVFAGGGGSGGTGGAAGWLIGNGGAGGQGGVAG 834
G GG G GG A G + T GA G + +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.0 bits (72), Expect = 0.014
Identities = 29/79 (36%), Positives = 35/79 (44%), Gaps = 3/79 (3%)

Query: 384 GAAGGAGGAGGASQTFTGDGGAGGTGGAGGWLYGNGGAGGS---GGAGGAGIGTGGFGGS 440
G G G S + +GG G G GG G+G + + GG G+GI GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 441 GGNGGTGGIGGLIGNGGAG 459
G GG G GG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 30.5 bits (68), Expect = 0.033
Identities = 27/84 (32%), Positives = 33/84 (39%), Gaps = 2/84 (2%)

Query: 140 NGGNGAAGTNPGVAGGAGGAAGLIGNGGAGG--IGGAGGSGGAGGTGGWLYGNGGAGGAG 197
NGG G G + G+G ++ GG G I GGSG G G G G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 198 GAGAPGPVSFNGNNGGNGGNGGAA 221
+ PV+F G GG A
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1208cloacin397e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 7e-05
Identities = 33/82 (40%), Positives = 37/82 (45%), Gaps = 1/82 (1%)

Query: 440 GGNGGTGGIGGLIGNGGAGGAGGAGSASGPSGYSGGNGGNGGNGGAAQLIGAGGGGGVAG 499
G N G G I NGG G G G AS SG+S N GG G+ G G G G G
Sbjct: 8 GHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 500 VGGAGGTGAAAGVTGSAGASGV 521
G G G+ G SA A+ V
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPV 88



Score = 39.3 bits (91), Expect = 8e-05
Identities = 43/132 (32%), Positives = 48/132 (36%), Gaps = 9/132 (6%)

Query: 767 GGSGGAGGTGGTGGSGDIYAGAGGTGGSGGAARLFGTGGTGGAGGVGGASDFQFAGSGGS 826
GG G TG SG+I G G G GGA+ G G G S + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 827 GGTGGAAGWLIGNGGSGGQGGLAGTADPVTGLFGGTGGAGGAGGAAGWLYGAGGSGGAGG 886
G GG GGSG G L+ A PV F G G A S GA
Sbjct: 63 GNGGGNGN---SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSI------SAGALS 113

Query: 887 AGTASVYPGLSG 898
A A + L G
Sbjct: 114 AAIADIMAALKG 125



Score = 37.8 bits (87), Expect = 2e-04
Identities = 36/124 (29%), Positives = 48/124 (38%), Gaps = 11/124 (8%)

Query: 400 GAGGAGGTGGAGGWLYGNGGAGGAGGVGGAGIGTGGLGSNGGNGGTGGIGGLIGNGGAGG 459
G G G G + NGG G G GGA G+G N G G G+G
Sbjct: 6 GRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWG---------GGSGSGI 54

Query: 460 AGGAGSASGPSGYSGGNGGNGGNGGAAQLIGAGGGGGVAGVGGAGGTGAAAGVTGSAGAS 519
G GS G G +G +GG G GG + A G + G G A ++ A ++
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114

Query: 520 GVGG 523
+
Sbjct: 115 AIAD 118



Score = 36.6 bits (84), Expect = 5e-04
Identities = 30/101 (29%), Positives = 38/101 (37%)

Query: 713 GGSTGGSGGAGGNGGTAGLFGAGGAGGTGGTAGAGIAVGSNGGHGGQGGYGGWLGGSGGA 772
G G + GA G G G G + G+G + +N GG G W GGSG
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 773 GGTGGTGGSGDIYAGAGGTGGSGGAARLFGTGGTGGAGGVG 813
G G G G + + A F T GAGG+
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.002
Identities = 26/85 (30%), Positives = 33/85 (38%)

Query: 838 GNGGSGGQGGLAGTADPVTGLFGGTGGAGGAGGAAGWLYGAGGSGGAGGAGTASVYPGLS 897
G G G G T+ + G G G GGA +GW GG G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 898 GGNGGNGGNGGAAQVIGNGGSGGTA 922
G GGNG +GG + GN +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 34.7 bits (79), Expect = 0.002
Identities = 35/106 (33%), Positives = 39/106 (36%), Gaps = 9/106 (8%)

Query: 568 GSGAAGQAGGAGGAAGLIGNGGAGGTGGSGAAGGNGGAGGWLFGNGGNGGTGGDAVAGLP 627
G G GA +G I G G G GA+ G+G W N GG G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG----WSSENNPWGGGSGSGIHWGG 58

Query: 628 GLNGGNGGNGGAGGAAGWWGHGGIGGQGGTGGAAGGFANTAPSTNG 673
G GNGG G G G G GG A F A ST G
Sbjct: 59 GSGHGNGGGNGNSG-----GGSGTGGNLSAVAAPVAFGFPALSTPG 99



Score = 33.9 bits (77), Expect = 0.004
Identities = 31/81 (38%), Positives = 34/81 (41%), Gaps = 1/81 (1%)

Query: 381 GNGAAGGAGGAGG-ASQTFTGAGGAGGTGGAGGWLYGNGGAGGAGGVGGAGIGTGGLGSN 439
G G GA G + TG G GG GW N GG G G G G G+
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 440 GGNGGTGGIGGLIGNGGAGGA 460
GGNG +GG G GN A A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.005
Identities = 38/116 (32%), Positives = 43/116 (37%), Gaps = 6/116 (5%)

Query: 213 GNGGNGGAAGWWGT--GGAGGAGGEGGAAGGDPAGIAYGSTGGVGGNGGSGGNGGWFAGN 270
G G N GA G GG G G GGA+ G GG G+G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG---- 61

Query: 271 AGAGGNGGVGGDGNAADSGLVGGAGGVGGAGGAAGLFGDGGAGGQGGDGAVSDALA 326
G GG G G G+ L A V A G GG GA+S A+A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 33.1 bits (75), Expect = 0.005
Identities = 23/64 (35%), Positives = 27/64 (42%)

Query: 122 ADGTAPGQAGGAGGLLYGNGGNGAAGTNPGVAGGAGGAAGLIGNGGAGGIGGAGGSGGAG 181
G G G G + G+G + N GG+G G G G GG G SGG
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 182 GTGG 185
GTGG
Sbjct: 76 GTGG 79



Score = 32.4 bits (73), Expect = 0.009
Identities = 35/110 (31%), Positives = 42/110 (38%), Gaps = 3/110 (2%)

Query: 308 GDGGAGGQGGDGAVSDALAASHGGTGGDGGRS---GWLSGNAGAGGAGGAGGSTNGIGNF 364
G G G G + S + G G GG S GW S N GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 365 AGNGGAGGNGGAAGLFGNGAAGGAGGAGGASQTFTGAGGAGGTGGAGGWL 414
GG G +GG +G GN +A A A G T G + G L
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112



Score = 31.6 bits (71), Expect = 0.016
Identities = 37/110 (33%), Positives = 43/110 (39%), Gaps = 8/110 (7%)

Query: 721 GAGGNGGTAGLFGAGGAGGTGGTAGAGIAVGSNGGHGGQGGYGGWLGGSGGAGGTGGTGG 780
G G G G G GG G G+ G++ G G W GGSG GG G
Sbjct: 3 GGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 781 SGDIYAGAGGTGGSGGAARLFGTGGTGGAGGVGGASDFQFAGSGGSGGTG 830
G GG G SGG + GTGG A A F + G+GG
Sbjct: 62 HG----NGGGNGNSGGGS---GTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.017
Identities = 33/107 (30%), Positives = 38/107 (35%), Gaps = 4/107 (3%)

Query: 354 AGGSTNGIGNFAGNGGAGGNGGAAGLFGNGAAGGAGGAGGASQTFTGAGGAGGTGGAGGW 413
+GG G A + NGG GL G A G + + G G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 414 LYGNGGAGGAGGVGGAGIGTGGLGSNGGNGGTGGIGGLIGNGGAGGA 460
G GG G G G GTGG S G L G G A
Sbjct: 62 ----HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.022
Identities = 30/87 (34%), Positives = 38/87 (43%), Gaps = 3/87 (3%)

Query: 425 GVGGAGIGTGGLGSNGG-NGGTGGIGGLIGNGGAGGAGGAGSASGPSGYSGGNGGNGGNG 483
G G G TG ++G NGG G+G G G + G+G + + G SG GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 484 GAAQLIGAGGGGGVAGVGGAGGTGAAA 510
G G G GG +G GG AA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 31.2 bits (70), Expect = 0.024
Identities = 28/91 (30%), Positives = 35/91 (38%), Gaps = 2/91 (2%)

Query: 554 GGDGGWLYGNGGNGGSGAAGQAGGAGGAAGLIGNGGAGGTGGSGAAGGNGGAGGWLFGNG 613
GG G G G + GSG + + GG +G GG G G GGNG +GG G
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSG--SGIHWGGGSGHGNGGGNGNSGGGSGTGG 79

Query: 614 GNGGTGGDAVAGLPGLNGGNGGNGGAGGAAG 644
G P L+ G +AG
Sbjct: 80 NLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.2 bits (70), Expect = 0.025
Identities = 36/125 (28%), Positives = 43/125 (34%), Gaps = 1/125 (0%)

Query: 736 GAGGTGGTAGAGIAVGS-NGGHGGQGGYGGWLGGSGGAGGTGGTGGSGDIYAGAGGTGGS 794
G G G GA G+ NGG G G GG GSG + GG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 795 GGAARLFGTGGTGGAGGVGGASDFQFAGSGGSGGTGGAAGWLIGNGGSGGQGGLAGTADP 854
G +GG G GG A A + T GA G + +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 855 VTGLF 859
+ G F
Sbjct: 123 LKGPF 127


15MMAR_1246MMAR_1258Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1246291.107564hypothetical protein
MMAR_1247291.247111anti-sigma factor RsbW
MMAR_1248291.379420RNA polymerase sigma factor SigF
MMAR_12492101.568257transcriptional regulatory protein
MMAR_12501101.608308peptide synthetase Nrp (peptide synthase)
MMAR_12512251.585303bifunctional protein acetyl-/propionyl-coenzyme
MMAR_12521181.597348Fe-S metabolism associated protein, SufE
MMAR_12532170.919668thiosulfate sulfurtransferase SseA
MMAR_12542160.178732Maf-like protein
MMAR_12552150.707068hypothetical protein
MMAR_12562160.758750propionyl-CoA carboxylase beta chain 5 AccD5
MMAR_12571150.713608bifunctional protein BirA
MMAR_12582150.798449hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1249HTHTETR625e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.6 bits (149), Expect = 5e-14
Identities = 28/139 (20%), Positives = 53/139 (38%), Gaps = 1/139 (0%)

Query: 1 MSERSQYHHGGLRDAILTEAATLVAERGVGAVSVRELARRAGVSHNAYGHHFTDRRGLFT 60
M+ +++ R IL A L +++GV + S+ E+A+ AGV+ A HF D+ LF+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 ALAAQGFMLLAEALREARGNFLDGARAYVRFAIGHPGHYAV-MFDWSLIDRTDSDLAAAH 119
+ + E E + F + +R + H V L+
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 120 AAASMELARGASLLQDPKA 138
++ +L +
Sbjct: 121 GEMAVVQQAQRNLCLESYD 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1250ISCHRISMTASE350.004 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 35.0 bits (80), Expect = 0.004
Identities = 20/79 (25%), Positives = 35/79 (44%), Gaps = 2/79 (2%)

Query: 2518 ATPSETTDSTRNLAASTVDQIRQVFAEVLGLP--SVGADDNFFDIGGDSMHAVQLSAKAR 2575
A +T+ +T T + IR+ AE+L + ++ D G DS+ + L + R
Sbjct: 215 ADVQKTSANTGKKNVFTCENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWR 274

Query: 2576 ALGLAVEVQDLFQCQTPEQ 2594
G V +L + T E+
Sbjct: 275 REGAEVTFVELAERPTIEE 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1255BCTERIALGSPG270.012 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 26.8 bits (59), Expect = 0.012
Identities = 11/43 (25%), Positives = 14/43 (32%), Gaps = 8/43 (18%)

Query: 51 TEQELAALVAVLGSLRSTAPAAAPEPSRWGLPVDRLRYPVFSW 93
T Q L +LV AP P + + R P W
Sbjct: 69 TNQGLESLV--------EAPTLPPLAANYNKEGYIKRLPADPW 103


16MMAR_1381MMAR_1391Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1381012-3.295867citrate synthase
MMAR_1382112-1.133079hypothetical protein
MMAR_1383211-0.949254hypothetical protein
MMAR_138419-0.058044hypothetical protein
MMAR_13851110.536533hypothetical protein
MMAR_13863134.788494hypothetical protein
MMAR_13873134.442226monoamine oxidase
MMAR_13882123.719113non-heme haloperoxidase Hpx
MMAR_13892133.844966flavin-containing monoamine oxidase AofH
MMAR_13903134.372112oxidoreductase
MMAR_13913124.591059PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1391FLAGELLIN403e-05 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 40.4 bits (94), Expect = 3e-05
Identities = 34/272 (12%), Positives = 51/272 (18%), Gaps = 5/272 (1%)

Query: 202 NAGAGGAGGPGLFGFNGGAGGAGGAGGLLGAG-GLGGAGGYGPGGVGGTGGAGGAGGLLA 260
+ GL GFN G L + + G Y G +
Sbjct: 158 DLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTD 217

Query: 261 GLVGAGGGHGGTGGFGAGGTGGDGGAGGNAGLFGGPGGAGGTGGVGTGGDGGNGGAGGNA 320
T D LF GT GG G+
Sbjct: 218 TTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDT 277

Query: 321 GALFGTGGAGGAGGSGVAGAGGVGGAGGNAGLLFSAGGVGGAGGYGSSDGGAGGAGGNGG 380
G +G G G V + + + +
Sbjct: 278 FDYKGVTFTIDT-KTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTS 336

Query: 381 LLYSNGGVGGTGGYGAAAAGGAGGAGGRAGLAIGGGGAGGAGGEGATTGGDGGAGGTGVL 440
G + + A GD +
Sbjct: 337 ---VVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTM 393

Query: 441 IGNGGNAGVGGTGPAAGATGVGGTSGLLLGLD 472
+ +GV A T+ L +D
Sbjct: 394 FIDKTASGVSTLINEDAAAAKKSTANPLASID 425


17MMAR_1419MMAR_1442Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1419217-1.751150hypothetical protein
MMAR_1420116-2.780970carbonic anhydrase, CynT
MMAR_1421121-3.011698transposase for insertion sequence
MMAR_1422120-2.589484hypothetical protein
MMAR_1423-121-2.370561anchored-membrane serine/threonine-protein
MMAR_1424227-2.872882hypothetical protein
MMAR_1425223-1.957043transposase for insertion sequence
MMAR_1426-2140.303496hypothetical protein
MMAR_1427-1130.249762hypothetical protein
MMAR_1428-214-0.082148hypothetical protein
MMAR_1429-3110.235541hypothetical protein
MMAR_1430-1120.084943hypothetical protein
MMAR_1431-1110.023092cation transport ATPase, ZntA
MMAR_1432-114-1.362730hypothetical protein
MMAR_14333154.939333site-specific integrase
MMAR_14382154.516143hypothetical protein
MMAR_14394164.882192hypothetical protein
MMAR_14404144.921725ISL3 family transposase
MMAR_14415165.217767hypothetical protein
MMAR_14422144.635142PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1423PERTACTIN320.009 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.0 bits (72), Expect = 0.009
Identities = 18/72 (25%), Positives = 25/72 (34%)

Query: 280 WGVGATAAGPAPQPTYAPNPQETQVAFNGYAPVRPQYPPPPTPPPAGKRPGRRPVVVVAA 339
W + A PAP+P P PQ P +P PP P + P +
Sbjct: 560 WSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELS 619

Query: 340 VVATALLIGSGI 351
A A + G+
Sbjct: 620 AAANAAVNTGGV 631


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_144160KDINNERMP280.016 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 27.6 bits (61), Expect = 0.016
Identities = 8/31 (25%), Positives = 13/31 (41%)

Query: 33 NGGAVETSSDVWELYPFFDTSDKKRLKRTCN 63
G A T + +E Y F +D + L +
Sbjct: 219 RGAAYSTPDEKYEKYKFDTIADNENLNISSK 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1442cloacin360.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.2 bits (83), Expect = 0.001
Identities = 29/77 (37%), Positives = 33/77 (42%), Gaps = 1/77 (1%)

Query: 1344 LSGANSAGARGGAGGAGGAGITGGAGGAGGAGSNGDGTSNQVEGQPGGDGGSGGIGGTGT 1403
+SG + G GA G I GG G G G DG+ E P G G GI G
Sbjct: 1 MSGGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1404 AGAGGTGGAGGDGGAGG 1420
+G G GG G GG G
Sbjct: 60 SGHGNGGGNGNSGGGSG 76



Score = 36.2 bits (83), Expect = 0.001
Identities = 29/100 (29%), Positives = 33/100 (33%)

Query: 573 GASGGGGQAGGSGGSGGAGGAGGALAGTGGAGGEGGTGGDGGTGGNGAGGAPGAAAGAAG 632
G G G G SG G L GGA G + G G+G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 633 GNGGNGGVGGSGGIGGNGGAAGVALAGSGHDGAQGAGGAG 672
GNGG G G G G +A A G G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.7 bits (79), Expect = 0.003
Identities = 35/109 (32%), Positives = 42/109 (38%)

Query: 426 GAGGVGGTGGLISFLGGHGTGGAGGEGGSGGIAGDGGKGAAGTFGGGDGVGGAGGRGGDP 485
G G G G S G G G G G G G +GGG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 486 GLGGAGGAGGTGSTIGAHGADGARPNSGGNGGAGGQGADALGPAFTSGA 534
G GG G G GS G + + A P + G GA L + ++GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.7 bits (79), Expect = 0.003
Identities = 34/103 (33%), Positives = 42/103 (40%), Gaps = 1/103 (0%)

Query: 1308 GGAGDGSSGGAGGRGGDGGAGITGAGGQGGAGGDGGLSGANSAGARGGAGGAGGAGITGG 1367
GG G G + GA G+ G TG G GGA G S N+ GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSG 61

Query: 1368 AGGAGGAGSNGDGTSNQVEGQPGGDGGSGGIGGTGTAGAGGTG 1410
G GG G++G G+ + G T GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.004
Identities = 35/106 (33%), Positives = 41/106 (38%), Gaps = 4/106 (3%)

Query: 286 GGAGEGSGNGGVGGEGGRGGQWFGHGGGGGAGGAGGADAADGGHGGAGGAARLWGTGGHG 345
GG G G G G G G G GGGA G + + GG G+ WG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 346 GSGGAGGVGALGGAGESGGAGGAAGDGGAGGRGGWLIGTGGAGGLG 391
G+GG G G SG G + G + T GAGGL
Sbjct: 63 GNGGGNG----NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.004
Identities = 29/82 (35%), Positives = 33/82 (40%)

Query: 1195 GDGGAGGNGGDAIGFGSGNGGLGGGGGAGGTGANGGTGGHGGVGGFGDIGGKGGSGGTGG 1254
G G G N G G+ NGG G G GG G G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1255 TGLTGAGGAGGAGGTGGQADSV 1276
G G +GG GTGG +V
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.9 bits (77), Expect = 0.005
Identities = 30/98 (30%), Positives = 34/98 (34%), Gaps = 3/98 (3%)

Query: 536 GGTGGDGGAGGLVGDGGNGGAGGRGATGGVGASATAPGASGGGGQAGGSGGSGGAGGAGG 595
G G G + G G G GA+ G G S+ GG G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 596 ALAGTGGAGGEGGTGGDGGTGGNGAGGAPGAAAGAAGG 633
GG G G A G P + AGG
Sbjct: 68 ---NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.9 bits (77), Expect = 0.005
Identities = 30/96 (31%), Positives = 37/96 (38%), Gaps = 7/96 (7%)

Query: 705 GAGGRGGDPGLGGAGGAGGAGSTTGAPGADGTRPTTGGNGGEGGRGADAVGAGGSGAAGG 764
G GRG + G G G T G G + G G + GGSG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-------GASDGSGWSSENNPWGGGSGSGIH 55

Query: 765 AGGDGGLVGDGGHGGDGGHGATGAAGASAVAPGASG 800
GG G GG+G GG TG ++ AP A G
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.9 bits (77), Expect = 0.005
Identities = 24/83 (28%), Positives = 28/83 (33%)

Query: 549 GDGGNGGAGGRGATGGVGASATAPGASGGGGQAGGSGGSGGAGGAGGALAGTGGAGGEGG 608
GDG G +G + T G GG G G G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 609 TGGDGGTGGNGAGGAPGAAAGAA 631
GG G G G+G +A AA
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.007
Identities = 29/92 (31%), Positives = 33/92 (35%), Gaps = 8/92 (8%)

Query: 899 GNGGRGEVGGLPGNGGDGGNGALGGGAGGNGGNGGNPGDSGTGGAGGTGSTTGMNGVSNS 958
G GRG G G+ G G G GG +G GG+GS G S
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 959 RIVVGGLWGNGGHGGTGGTGSAAGGPGGSGGA 990
GNGG G G GS GG + A
Sbjct: 63 --------GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.009
Identities = 26/66 (39%), Positives = 32/66 (48%)

Query: 1188 TGSTPDGGDGGAGGNGGDAIGFGSGNGGLGGGGGAGGTGANGGTGGHGGVGGFGDIGGKG 1247
T +GG G G GG + G G + GGG+G GG GHG GG G+ GG
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 1248 GSGGTG 1253
G+GG
Sbjct: 76 GTGGNL 81



Score = 32.8 bits (74), Expect = 0.013
Identities = 33/101 (32%), Positives = 39/101 (38%), Gaps = 2/101 (1%)

Query: 1033 AGGDGGAGGVGGTGGQGGTQAGNGGVGGAGGAG-GKGADGGNGANGD-SGNGVGSDGFAG 1090
+GGDG G G G G+G GGA G G N G SG+G+ G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1091 GNGGAGGSGGTGGDGGAGGLALADTGQDGAQGAGGDGGAGG 1131
G G GG G G L+ A GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.015
Identities = 27/81 (33%), Positives = 32/81 (39%)

Query: 755 GAGGSGAAGGAGGDGGLVGDGGHGGDGGHGATGAAGASAVAPGASGGNGQTGGSGGAGGA 814
G G G GA G + G G G GA+ +G S+ GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 815 GGAGGTLAGHGGDGGAGGNGA 835
G GG GG G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.015
Identities = 38/116 (32%), Positives = 47/116 (40%), Gaps = 9/116 (7%)

Query: 615 TGGNGAGGAPGAAAGAAGGNGGNGGVGGSGGIG-GNGGAAGVALAGSGHDGAQGAGGAGG 673
+GG+G G GA + + NGG G+G GG G+G ++ G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 674 SGGMGGVAGDGGKGAAGAFAGGGGGGNDGVGGAGGRGGDPGLGGAGGAGGAGSTTG 729
G GG GG G G GGN A G P L G G A S +
Sbjct: 62 HGNGGGNGNSGG--------GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 32.4 bits (73), Expect = 0.015
Identities = 22/61 (36%), Positives = 24/61 (39%)

Query: 968 NGGHGGTGGTGSAAGGPGGSGGAGGTGGAGGHGGLWGNGGDGGTGGQGADGGAGISASAQ 1027
NGG G G G A+ G G S GG G G WG G G GG + G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 1028 G 1028

Sbjct: 81 L 81



Score = 32.4 bits (73), Expect = 0.015
Identities = 31/102 (30%), Positives = 38/102 (37%)

Query: 677 MGGVAGDGGKGAAGAFAGGGGGGNDGVGGAGGRGGDPGLGGAGGAGGAGSTTGAPGADGT 736
M G G G A + +G GG G+G GG G G GS +G G+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 737 RPTTGGNGGEGGRGADAVGAGGSGAAGGAGGDGGLVGDGGHG 778
GG G G G+ G + AA A G L G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.016
Identities = 37/103 (35%), Positives = 45/103 (43%), Gaps = 4/103 (3%)

Query: 799 SGGNGQTGGSGGAGGAGGAGGTLAGHGGDGGAGGNGANGGIGANGAHGTLGIAAGADGST 858
SGG+G+ +G +G G G G GGA + +G N G + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGA--SDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 859 GGNGGVGGNGGVGGNGGNGGNGG--AAGVALGSGQDGAEGAGG 899
G+G GGNG GG G GGN AA VA G GAGG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.020
Identities = 34/106 (32%), Positives = 38/106 (35%), Gaps = 4/106 (3%)

Query: 602 GAGGEGGTGGDGGTGGNGAGGAPGAAAGAAGGNGGNGGVGGSGGIGGNGGAAGVALAGSG 661
G G G G T GN GG G G G + G G S GG +G + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGP----TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 662 HDGAQGAGGAGGSGGMGGVAGDGGKGAAGAFAGGGGGGNDGVGGAG 707
G GG G SGG G G+ AA G G GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.027
Identities = 24/76 (31%), Positives = 28/76 (36%)

Query: 1245 GKGGSGGTGGTGLTGAGGAGGAGGTGGQADSVLLGDSGGGEGGSGGFGGTGLTTGGEGGR 1304
G+G + G T GG G G GG +D GG G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1305 GGIGGAGDGSSGGAGG 1320
GG G +G GS G
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.041
Identities = 28/81 (34%), Positives = 34/81 (41%)

Query: 386 GAGGLGGVGGVGGSGGFGANAVTPGGAGGQGGAGGDGGAGGAGGVGGTGGLISFLGGHGT 445
G G G G + G T G GG G + GG+G I + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 446 GGAGGEGGSGGIAGDGGKGAA 466
G GG G SGG +G GG +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.043
Identities = 36/115 (31%), Positives = 46/115 (40%), Gaps = 6/115 (5%)

Query: 517 GAGGQGADALGPAFTSGATGGTGGDGGAGGLVGDGGNGGAGGRGATGGVGASATAPGASG 576
G G+G + + + GG G G GG G+G + GG S G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 577 GGGQAGGSGGSGGAGGAGGALAGTGGAGGEG----GTGGDGGTGGNGAGGAPGAA 627
G G GG+G SGG G GG L+ G T G GG + + GA AA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 30.8 bits (69), Expect = 0.048
Identities = 25/84 (29%), Positives = 28/84 (33%)

Query: 1255 TGLTGAGGAGGAGGTGGQADSVLLGDSGGGEGGSGGFGGTGLTTGGEGGRGGIGGAGDGS 1314
+G G G GA T G + G GG G + G G GI G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1315 SGGAGGRGGDGGAGITGAGGQGGA 1338
G GG G GG TG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85


18MMAR_1453MMAR_1462Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1453021-3.516772transposase, ISMyma01_aa1
MMAR_1454-121-2.836114hypothetical protein
MMAR_1455221-1.694337transposase, ISMyma03_aa2
MMAR_1456323-0.856047transposase, ISMyma03_aa1
MMAR_1457218-0.976339hypothetical protein
MMAR_1458221-1.178141hypothetical protein
MMAR_1459220-1.482307hypothetical protein
MMAR_1460119-1.420436PPE family protein
MMAR_1461017-0.506854PPE family protein
MMAR_1462213-1.699390oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1462PHPHTRNFRASE290.048 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 28.6 bits (64), Expect = 0.048
Identities = 19/114 (16%), Positives = 38/114 (33%), Gaps = 19/114 (16%)

Query: 126 FVHDPAL-------IDAHGVNPDADVVDAYSYLALTID--TDWYLAWLAHEAEVAGVTAV 176
+ DP L I+ +N + + + + + Y+ A + V
Sbjct: 80 VLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEYMK-----ERAADIRDV 134

Query: 177 QRRIRGPLLEQEQRLRAEYGAEVIVNCAGLGARELADDSTLDLHRCALLRIVND 230
+R+ G L+ E A E + + A +L T L++ + D
Sbjct: 135 SKRVLGHLIGVETGSLATIAEETV-----IIAEDLTPSDTAQLNKQFVKGFATD 183


19MMAR_1588MMAR_1607Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_15886408.742233hypothetical protein
MMAR_15897418.943312camphor resistance protein CrcB
MMAR_15906408.626175camphor resistance protein CrcB
MMAR_15915398.345757phosphoglucomutase
MMAR_15926439.075115*hypothetical protein
MMAR_15945428.690023PE-PGRS family protein
MMAR_1595017-2.513516O-methyltransferase
MMAR_1596018-3.159698hypothetical protein
MMAR_1597-115-0.604160transcriptional regulatory protein
MMAR_1598013-0.555686transposase
MMAR_1599012-0.391361transposase
MMAR_16001120.485716hypothetical protein
MMAR_16011110.565229PE-PGRS family protein
MMAR_16022110.797955oxidoreductase GMC-type
MMAR_1603011-0.164141hypothetical protein
MMAR_1604-111-0.054348hypothetical protein
MMAR_1605011-0.056324hypothetical protein
MMAR_1606113-0.471422fatty-acid-CoA ligase
MMAR_1607214-0.770279succinate-semialdehyde dehydrogenase [NADP+]
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1592BACYPHPHTASE270.012 Salmonella/Yersinia modular tyrosine phosphatase si...
		>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase

signature.
Length = 468

Score = 26.7 bits (58), Expect = 0.012
Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 10/81 (12%)

Query: 4 ALHSQATRVPRGVGSE-GRVAPPFPNTAGKVSDGYPPNFLLTLAMAMALATATAWSAMSP 62
ALH+ T V G+ S PP P + G+ A ATA S +SP
Sbjct: 138 ALHAPGTPVREGLRSHLDPRTPPLPPRERPHTSGH---------HGAGEARATAPSTVSP 188

Query: 63 FGPSLKPLSSQQLAVMRECVA 83
+GP + S +L +R +A
Sbjct: 189 YGPEARAELSSRLTTLRNTLA 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1594cloacin401e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.5 bits (94), Expect = 1e-04
Identities = 37/114 (32%), Positives = 48/114 (42%)

Query: 786 TGGAGGNAGTGAGAAGNNGDGGAGGTGGNSGAQAGDGGAGAANTTIGGTGGAGGSGGNIG 845
+GG G TGA + N +GG G G GA G G + N GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 846 AGGAGGQGGAAGTTSGVGGASGTSGTSGQSIGTGGDGGAGGDGGTGTTGADAAA 899
G GG G + G + G S + GAGG + + GA +AA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 35.5 bits (81), Expect = 0.005
Identities = 33/95 (34%), Positives = 39/95 (41%), Gaps = 5/95 (5%)

Query: 272 AGGDNGGFAGGDGGHGGLLCGAGGDGGLGGGDGGNAGWLLGFGGHGGAGGDGGFAGGEGG 331
+GGD G G G + G G+GGG +GW GG G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 332 NGGWLLGFGGHGGTGGIGGGTGGAGGAGGLVMGFG 366
+G GG G G G GTGG A + FG
Sbjct: 62 HGN-----GGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.7 bits (79), Expect = 0.007
Identities = 28/84 (33%), Positives = 38/84 (45%)

Query: 884 AGGDGGTGTTGADAAAGSGLGGDIGGTGGAGGAGGTGGSGGTDSGVGGSGGVGGVGGTGG 943
+GGDG TGA + +G+ GG G G G + G+G S + GGSG GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 944 QGGAGSDNSANAGVAGGAGGQGGA 967
G G + ++ G G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 34.3 bits (78), Expect = 0.010
Identities = 25/81 (30%), Positives = 32/81 (39%)

Query: 1251 GTGGAGGQGGTGGAAGTGTGGIQGAGGQGGTGGTGGTGGSGGDGTDNSTTPGAAGGAGGQ 1310
G G G T G G G+ GG G G G+ + G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1311 GGTGGTGGAAGSGGTGSTMGA 1331
GG G +GG +G+GG S + A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAA 86



Score = 33.9 bits (77), Expect = 0.015
Identities = 27/78 (34%), Positives = 34/78 (43%)

Query: 575 GGDGGTGGTGGEGTDAAAGSGLTGGTGYAGGTGGTGGAGGNSGLGGTNGSGGHGGTGGTG 634
GGDG TG T G TG G + G+G + N+ GG +GSG H G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 635 GTGGTGGSGADNTTGIGG 652
G GG G+ + G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 33.5 bits (76), Expect = 0.016
Identities = 25/86 (29%), Positives = 33/86 (38%)

Query: 1944 GTGGAGGTGGTGGAAGTGTGGIQGAGGQGGTGGTGGQGGTGGDGTDNSTTPGAAGGAGGQ 2003
G G G T G G G+ GG G + G G+ + G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 2004 GGTGGTGGAAGQGGTGSTSGSAGSSG 2029
GG G +GG +G GG S + + G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.5 bits (76), Expect = 0.017
Identities = 26/84 (30%), Positives = 33/84 (39%)

Query: 1364 GQGASGGAGGTGGTGGAAGTGTGGIQGAGGQGGTGGTGGQGGTGGDGTDNSTTPGAAGGA 1423
G G G T G G G+ GG G + G G+ + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1424 GGQGGTGGTGGAAGTGGTGSTMGA 1447
G GG G +GG +GTGG S + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.019
Identities = 25/79 (31%), Positives = 36/79 (45%)

Query: 910 TGGAGGAGGTGGSGGTDSGVGGSGGVGGVGGTGGQGGAGSDNSANAGVAGGAGGQGGAGG 969
+GG G TG + + GG G+G GG G S+N+ G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 970 TGGAAGSGGTGSVAGADGS 988
G G+G +G +G G+
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 33.1 bits (75), Expect = 0.021
Identities = 30/79 (37%), Positives = 38/79 (48%), Gaps = 1/79 (1%)

Query: 726 GAGGAGGNSGLGGTNGSGGHGGTGGTGGTGGTGGSGADALDGSGASGHDGGDSGAGGDG- 784
G G G N+G T+G+ G TG G G + GSG + + G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 785 GTGGAGGNAGTGAGAAGNN 803
G GG GN+G G+G GN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 33.1 bits (75), Expect = 0.025
Identities = 27/77 (35%), Positives = 33/77 (42%)

Query: 693 GGDGGTGGTGGQGTDAAAGTGLTGGTGYAGGTGGAGGAGGNSGLGGTNGSGGHGGTGGTG 752
GGDG TG T G TG G + G+G + N+ GG +GSG H G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 753 GTGGTGGSGADALDGSG 769
G GG G+ G
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 33.1 bits (75), Expect = 0.026
Identities = 25/84 (29%), Positives = 32/84 (38%)

Query: 2634 GQGASGGAGGTGGTGGAAGTGTGGIQGAGGQGGTGGAGGSGGTGGDGTDNSTTPGAAGGA 2693
G G G T G G G+ GG G G G+ + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 2694 GGQGGTGGTGGAAGSGGTGSTMGA 2717
G GG G +GG +G+GG S + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.8 bits (74), Expect = 0.032
Identities = 31/82 (37%), Positives = 33/82 (40%), Gaps = 3/82 (3%)

Query: 153 AGGSAGLWGNGGSGGSGGIGGGTGGNGGSGGWLLGRGGIGGAGGTGGGSGGSGGGAWLFG 212
+GG G SG I GG G G GG G G + GGSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 213 DGGAGGMGGTGGSGGGFGGTGG 234
G G GG G SGGG G G
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGN 80



Score = 32.4 bits (73), Expect = 0.037
Identities = 25/84 (29%), Positives = 32/84 (38%)

Query: 2402 GQGASGGAGGTGGTGGAAGTGTGGTQGAGGQGGTGGTGGQGGTGGDGTDNSTTPGAAGGA 2461
G G G T G G G GG G + G G+ + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 2462 GGQGGTGGTGGAAGSGGTGSTMGA 2485
G GG G +GG +G+GG S + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.037
Identities = 25/84 (29%), Positives = 32/84 (38%)

Query: 2518 GQGASGGAGGTGGTGGAAGTGTGGTQGAGGQGGTGGTGGQGGTGGDGTDNSTTPGAAGGA 2577
G G G T G G G GG G + G G+ + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 2578 GGQGGTGGTGGAAGSGGTGSTMGA 2601
G GG G +GG +G+GG S + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.038
Identities = 33/104 (31%), Positives = 41/104 (39%), Gaps = 1/104 (0%)

Query: 148 GQAGGAGGSAGLWGNGGSGGSGGIGGGTGGNGGSGGWLLGRGGIGGAGGTGGGSGGSGGG 207
G G A +GG G+G G G + GSG W GG G+G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSGH 62

Query: 208 AWLFGDGGAGGMGGTGGSGGGFGGTGGHGGLLYGNGGAGGAGAT 251
G+G +GG GTGG+ G GAGG +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 32.4 bits (73), Expect = 0.041
Identities = 27/86 (31%), Positives = 35/86 (40%)

Query: 462 AGGDGGAGGAGADGSNGVGDGGNGATGGGGGVGGDGGAGGQAALLFGCGGTGGHAGAGGA 521
+GGDG GA ++G +GG G GGG G + G G+G H G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 522 GGGGGTGADSLTGIGGAGGDGGIGGA 547
G GG +S G G G +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 32.4 bits (73), Expect = 0.042
Identities = 33/108 (30%), Positives = 40/108 (37%), Gaps = 4/108 (3%)

Query: 744 GHGGTGGTGGTGGTGGSGADALDGSGASGHDGGDSGAGGDGGTGGAGGNAGTGAGAAGNN 803
G G G G T G+ G G G SG + G G +G G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 804 GDGGAGGTGGNSGAQAGDGGAGAANTTIG----GTGGAGGSGGNIGAG 847
G+GG G G G+ A AA G T GAGG +I AG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.4 bits (73), Expect = 0.045
Identities = 33/106 (31%), Positives = 39/106 (36%), Gaps = 4/106 (3%)

Query: 2588 GAAGSGGTGSTMGAEGSSGDGGTGGLGGAGGTGGQGTTAVNPGDTGGQGASGGAGGTGGT 2647
G G G G+ G TG G G + G G ++ N GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 2648 GGAAGTGTGGIQGAGGQGGTGGAGGSGGTGGDGTDNSTTPGAAGGA 2693
G G G G GG G G G +TPGA G A
Sbjct: 63 GNGGGNGNSG----GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.047
Identities = 28/78 (35%), Positives = 31/78 (39%)

Query: 212 GDGGAGGMGGTGGSGGGFGGTGGHGGLLYGNGGAGGAGATGQTGGSGGWAALWGRGGAGG 271
GDG G SG GG G G + G+G + GG G WG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 272 AGGDNGGFAGGDGGHGGL 289
GG NG GG G G L
Sbjct: 64 NGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1597HTHTETR543e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.2 bits (130), Expect = 3e-11
Identities = 26/111 (23%), Positives = 45/111 (40%), Gaps = 5/111 (4%)

Query: 19 RRLRYRDRRGEILDAVMAHLLEHGISGMSFRTLAAAAGVSHITLRHHFGTKDELLVEIFG 78
+ ++ R ILD + + G+S S +A AAGV+ + HF K +L EI+
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 79 -----VIGARVQIPDHFGADDVESLVRKMWQRWTEPQSDRRSRLVFEAYAH 124
+ ++ F D + L + ++ R RL+ E H
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFH 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1601RTXTOXINA343e-04 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 33.8 bits (77), Expect = 3e-04
Identities = 26/146 (17%), Positives = 52/146 (35%), Gaps = 19/146 (13%)

Query: 9 ECLNQAAGQLENIGTALGAAN-AAAAPPTTG-IAAAAGDEVSAAVASL-FAEHGQGFQHL 65
E + G + + A AA T+ A V+ A++ L F F+
Sbjct: 274 ELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISPLSFLSIADKFKR- 332

Query: 66 CGEAAAFHSRFVQALG------------GAGSAFAASEATNAAL--MSASAAAASAAAIP 111
+ + RF + LG G+ A+ + L +S+ +AA+ ++
Sbjct: 333 ANKIEEYSQRF-KKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLV 391

Query: 112 VPPLPPLPPIINELINWAFDVVNWAI 137
P+ L + +I+ + A+
Sbjct: 392 GAPVSALVGAVTGIISGILEASKQAM 417


20MMAR_1634MMAR_1643Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1634511-3.397683cytochrome P450 136A2 Cyp136A2
MMAR_1635512-3.499688TetR family transcriptional regulator
MMAR_1636611-3.580152short chain dehydrogenase
MMAR_1637512-4.297505TetR family transcriptional regulator
MMAR_1638512-4.402910hypothetical protein
MMAR_1639411-4.376174PPE family protein
MMAR_1640314-3.560281glutaredoxin electron transport component of
MMAR_1641213-3.398123ribonucleotide reductase stimulatory protein
MMAR_1642315-3.546443ribonucleotide-diphosphate reductase subunit
MMAR_1643212-1.982781AsnC family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1635HTHTETR691e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.3 bits (169), Expect = 1e-16
Identities = 21/147 (14%), Positives = 53/147 (36%), Gaps = 5/147 (3%)

Query: 17 RRRGDKQRQAILQAVRELLEERPFAELSVATISNRAGVARSGFYFYFDSKYSVLAQLMAE 76
++ + RQ IL L ++ + S+ I+ AGV R Y++F K + +++
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 77 AVEELEERTQYFAPRQPGESPQEFAKRM--VGSAAIVYTHNDPVMMACN--AARHTDIEI 132
+ + E + + PG+ + + V + + +M ++ +
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAV 125

Query: 133 RDILDQQFDVVLRE-IVGVIDAEMRAG 158
+ + + I + + A
Sbjct: 126 VQQAQRNLCLESYDRIEQTLKHCIEAK 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1636DHBDHDRGNASE941e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 93.6 bits (232), Expect = 1e-24
Identities = 59/189 (31%), Positives = 92/189 (48%), Gaps = 2/189 (1%)

Query: 9 GKRCLVTGAASGIGRATALRLAEQGAELYLTDRDGDGLAQTVSAARALGAQVPEHRVLDI 68
GK +TGAA GIG A A LA QGA + D + + L + VS+ +A A+ E D+
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADV 66

Query: 69 SDYDEVAAFAADIHASHPSMDVVLNIAGISAWGTVDRLSHEHWSKMVAVNLMGPIHVIET 128
D + A I +D+++N+AG+ G + LS E W +VN G + +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 129 FVPPMVAAGRGGHLVNVSSAAGLVALPWHAAYSASKYGLRGLSEVLRFDLARHRIGVSVV 188
M+ R G +V V S V AAY++SK ++ L +LA + I ++V
Sbjct: 127 VSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 189 VPGAVDTPL 197
PG+ +T +
Sbjct: 186 SPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1637HTHTETR728e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.0 bits (176), Expect = 8e-18
Identities = 28/130 (21%), Positives = 51/130 (39%), Gaps = 4/130 (3%)

Query: 14 ARQERGDAARNRELLLQAARRLVAKRGAEAVTTDDIAAEAGVGKGTLFRRFGSRAGLMMV 73
AR+ + +A R+ +L A RL +++G + + +IA AGV +G ++ F ++ L
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 74 LLDEDERASQQAF--LFGPPPLGPEAAPLDRLIAFGRERICFVHAHHELLSEANRNPLTR 131
+ + E + P P + + LI LL E +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLES--TVTEERRRLLMEIIFHKCEF 119

Query: 132 YGAAASVHRR 141
G A V +
Sbjct: 120 VGEMAVVQQA 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1643HTHTETR485e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.5 bits (115), Expect = 5e-09
Identities = 25/108 (23%), Positives = 41/108 (37%), Gaps = 1/108 (0%)

Query: 23 RWREHRKKVRNEIVEAAFRAIDRLGPE-LSVREIAEEAGTAKPKIYRHFTDKSDLFVAIG 81
+ ++ ++ R I++ A R + G S+ EIA+ AG + IY HF DKSDLF I
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 82 ERLRDMLWTSIFPSINLATDSAREVIRRSVEEYVSLVDKHPNVLRVFI 129
E + V+R + + +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME 111


21MMAR_1824MMAR_1829Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_18249194.716266hypothetical protein
MMAR_18258184.696451transcriptional regulatory protein
MMAR_18269184.879927uridylate kinase
MMAR_182710175.169699ribosome recycling factor
MMAR_182810185.510132integral membrane phosphatidate
MMAR_182910186.000608PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1829cloacin453e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.1 bits (106), Expect = 3e-06
Identities = 37/111 (33%), Positives = 45/111 (40%), Gaps = 5/111 (4%)

Query: 1540 GSGGSGGNGGAGGSGGTTGTGGAGGGGGNGGNGGTGFSTNNVASGGMGGGGGKGGGGGDG 1599
G G G N GA + G G G G G G + G+G+S+ N GG G G GGG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1600 LIGGAGGGGGDGGNGAAGLTSGGNGGAGG-----AGGAGGAGRTTSGDGGS 1645
GG G G G L++ A G GAGG + S S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 36.6 bits (84), Expect = 0.001
Identities = 34/109 (31%), Positives = 42/109 (38%)

Query: 333 GGGGVGGTGGAGGAAGLIGRGGDGGGGGAGDGGSTGGAGQTGGDGGRAGDGGVGGRGGWL 392
GG G G GA +G I G G G G G +G + + GG +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 393 AGAGGDGGAGGVGGVGGAGGGGADGLVLGGDGGDGGDGGTGGAGGSGGA 441
GG+G +GG G GG A + G G S GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.2 bits (83), Expect = 0.001
Identities = 31/81 (38%), Positives = 40/81 (49%), Gaps = 3/81 (3%)

Query: 1485 GAGGNGLDGTAKAAGGN--GGNGGVGGAGGVAGGAG-GAGGAGGAGGNGPGGLFGPDGGS 1541
G G G + A + GN GG G+G GG + G+G + GG+G G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1542 GGSGGNGGAGGSGGTTGTGGA 1562
G GGNG +GG GT G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 35.1 bits (80), Expect = 0.003
Identities = 31/92 (33%), Positives = 37/92 (40%), Gaps = 4/92 (4%)

Query: 1142 GDGGRGGNGAATASGNGGDGGNGGDPGVGGAGGKGGIGETIGLSGLGGYNPTSGGNGGDG 1201
GDG GA + SGN G G G G + G G E G G GG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 1202 GDGGSGSDGDFNHLAGSDGGNGGHGGSGGAFG 1233
GG+G+ G + GGN + AFG
Sbjct: 64 NGGGNGNSGGGS----GTGGNLSAVAAPVAFG 91



Score = 34.7 bits (79), Expect = 0.004
Identities = 33/94 (35%), Positives = 41/94 (43%), Gaps = 7/94 (7%)

Query: 431 GTGGAGGSGGAGGAGGLISLFGGQGAGGAGGAGGAGGLAGDGGVGAAGTFAGGGSGTGGA 490
G G G + GA G I+ GG G G GG + G + GGGSG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN-------GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 491 GGDGGTPGVGGAGGAGGAGSVAGAHGSEGARPLS 524
G G G GG G G GS G + S A P++
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 34.7 bits (79), Expect = 0.004
Identities = 34/90 (37%), Positives = 45/90 (50%), Gaps = 4/90 (4%)

Query: 542 IAGDGGVGGNGGVFGNGGSGGAGGTGVAGQRGVSADTAGGSGTAGEDGGVGGDGGAGGLG 601
++G G G N G G+ G TG+ GV + GSG + E+ GG G+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGL----GVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 602 GALAGHGGDGGAGGIGGAGGDGGSGAAGAA 631
G +GHG GG G GG G GG+ +A AA
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 34.7 bits (79), Expect = 0.004
Identities = 34/101 (33%), Positives = 40/101 (39%), Gaps = 1/101 (0%)

Query: 996 GNGGAYGNGGAGGDGGDGVNGTRGLTAIAPGGSGTDGGAGSAGGAGGYGGNGGALAGDGG 1055
G G N GA G+ +NG + G S G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGN-INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1056 DGGDGGNGGSGGDGGSGANGGAGTNGTTTGIPDGGRGGDGG 1096
G GGNG SGG G+G N A G P G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.005
Identities = 34/109 (31%), Positives = 42/109 (38%), Gaps = 1/109 (0%)

Query: 375 GDGGRAGDGGVGGRGGWLAGAGGDGGAGGVGGVGGAGGGGADGLVLGGDGGDGGDGGTGG 434
G GR + G G + G G GG G G+G + GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 435 AGGSGGAGGAGGLISLFGGQGAGGAGGAGGAGGLAGDGGVGAAGTFAGG 483
G GG G +GG G A A A G L+ G G A + + G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 34.3 bits (78), Expect = 0.005
Identities = 33/104 (31%), Positives = 41/104 (39%)

Query: 1585 GMGGGGGKGGGGGDGLIGGAGGGGGDGGNGAAGLTSGGNGGAGGAGGAGGAGRTTSGDGG 1644
G G G +G G G GGG + GG G+G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 1645 SGGNGGTGGEGGGDILVAPVVAGTGGAGGEGGAGGYSNTVGGVA 1688
+G +GG G GG VA VA A GAGG + ++ A
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.3 bits (78), Expect = 0.006
Identities = 33/88 (37%), Positives = 37/88 (42%), Gaps = 3/88 (3%)

Query: 1065 SGGDGGSGANGGAGTNGTTTGIPDGGRGGDGGHGGHGGDGGAGGHGGVAGKAQAAGYTDG 1124
SGGDG G T+G G P G G G G G GG +G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG---G 58

Query: 1125 THGAGGVGGNGGAGGLAGDGGRGGNGAA 1152
G G GGNG +GG +G GG AA
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.008
Identities = 28/85 (32%), Positives = 33/85 (38%), Gaps = 4/85 (4%)

Query: 781 GGDGGAGGDGVTGSQGLTATTPGGSGTNGGAGSAGGAGGS----GGLGGTLAGHGGDGGA 836
GGDG G + G P G G GGA G GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 837 GGAGGDGGFGGSGATGSSGDQGSAA 861
G GG+G GG TG + +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.5 bits (76), Expect = 0.010
Identities = 27/83 (32%), Positives = 35/83 (42%), Gaps = 3/83 (3%)

Query: 295 AGGQGVGLDGGAGGGGGQGGLVYGGGGDGGAGGVGGEVGGGGVGGTGGAGGAAGLIGRGG 354
+GG G G + GA G + GG G GG + G GG+ I GG
Sbjct: 2 SGGDGRGHNTGAHSTSGN---INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 355 DGGGGGAGDGGSTGGAGQTGGDG 377
G G G G++GG TGG+
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 33.5 bits (76), Expect = 0.010
Identities = 26/79 (32%), Positives = 29/79 (36%)

Query: 1263 GGAGGAGGDGGYLAGNGGTGGTGGAGGAGGTGAAGEPQFENGQNGGAGGIGRAGGAGGDG 1322
GG G G + GG G G GG EN GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1323 GNGGHAHAAGFQNGAGGAG 1341
GNGG +G +G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 33.1 bits (75), Expect = 0.010
Identities = 36/110 (32%), Positives = 39/110 (35%), Gaps = 3/110 (2%)

Query: 260 GGGGDGGTGGVGAVSGGVGGGAGRAWLWGAGGAGGAGGQGVGLDGGAGGGGGQGGLVYGG 319
GG G G G + SG + GG G GG G + GGG G G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGL---GVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 320 GGDGGAGGVGGEVGGGGVGGTGGAGGAAGLIGRGGDGGGGGAGDGGSTGG 369
G G GG G GG G GG A A G G G S
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 33.1 bits (75), Expect = 0.013
Identities = 30/102 (29%), Positives = 36/102 (35%)

Query: 1317 GAGGDGGNGGHAHAAGFQNGAGGAGGDGGQGGTGGGGGFGGDGKAGYNQRAGGDGGAGGD 1376
G G G N G +G NG G GG G G + G + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1377 GGFGGTGGTGGLNGDGTTRATSGANGIRGDGGSGGTGGNGLG 1418
G GG G +GG +G G + A G G GL
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.013
Identities = 25/83 (30%), Positives = 32/83 (38%)

Query: 813 SAGGAGGSGGLGGTLAGHGGDGGAGGAGGDGGFGGSGATGSSGDQGSAANPNGGRGGDGG 872
S G G + +G+ G G G G GSG + + G + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 873 SGADGGAGGTGGDGGAGGQAQAA 895
G GG G +GG G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.8 bits (74), Expect = 0.014
Identities = 33/111 (29%), Positives = 42/111 (37%), Gaps = 1/111 (0%)

Query: 1032 GGAGSAGGAGGYGGNGGALAGDGGDGGDGGNGGSGGDGGSGANGGAGTNGTTTGIPDGGR 1091
GG G G + +G G G G GG G G G+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1092 GGDGGHGGHGGDGGAGGHGGVAGKAQAAGYTD-GTHGAGGVGGNGGAGGLA 1141
G GG+G GG G GG+ A G+ T GAGG+ + AG L+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 32.8 bits (74), Expect = 0.015
Identities = 27/89 (30%), Positives = 33/89 (37%)

Query: 1201 GGDGGSGSDGDFNHLAGSDGGNGGHGGSGGAFGNGGNGGDGGTGGDGSRAFQPEIAGGAA 1260
GGDG + G + +GG G G GGA G + G GS + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1261 GHGGAGGAGGDGGYLAGNGGTGGTGGAGG 1289
G+GG G G G GN A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.017
Identities = 39/121 (32%), Positives = 47/121 (38%), Gaps = 12/121 (9%)

Query: 128 GADGVAGTGQAGGAGGLLWGNGGSGGSGGVGQSGGAGGSAGLIGRGGAGGAGGFSGGSGA 187
G DG A G + G G GG G S GG+G + GGSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 188 GGAGGVGGNGGWLWGAGGDGGAGGVGAVSGGVGGGAGRAWLWGAGGAGGAGGQGVGLDGG 247
G GG G +GG G G G + AV+ V G + A GAGG V + G
Sbjct: 63 GNGGGNGNSGG------GSGTGGNLSAVAAPVAFG------FPALSTPGAGGLAVSISAG 110

Query: 248 A 248
A
Sbjct: 111 A 111



Score = 32.4 bits (73), Expect = 0.022
Identities = 34/109 (31%), Positives = 40/109 (36%), Gaps = 4/109 (3%)

Query: 202 GAGGDGGAGGVGAVSGGVGGGAGRAWLWGAGGAGGAGGQGVGLDGGAGGGGGQGGLVYGG 261
G G G G + SG + GG + G G GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 262 GGDGGTGGVGAVSGGVGGG----AGRAWLWGAGGAGGAGGQGVGLDGGA 306
G GG G G SG G A A+ + A GAGG V + GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.0 bits (72), Expect = 0.025
Identities = 28/103 (27%), Positives = 35/103 (33%)

Query: 1620 SGGNGGAGGAGGAGGAGRTTSGDGGSGGNGGTGGEGGGDILVAPVVAGTGGAGGEGGAGG 1679
SGG+G G +G G G G GG G P G+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1680 YSNTVGGVAGNGGAGGSGGTGGTLAAHKVVYTSSGRGGVGGTG 1722
+ N G GG+G G A + + G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.028
Identities = 36/102 (35%), Positives = 41/102 (40%), Gaps = 4/102 (3%)

Query: 1427 GHGGKGGVGGYGSVGGV--GGSGGHGGNGGQSGTPRFGG-HGGDGGAGGAGVVTGGTGGT 1483
G G+G G S G GG G G GG S + + GG G+G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1484 GGAGGNGLDGTAKAAGGNGGNGGVGGAGGV-AGGAGGAGGAG 1524
G GGNG G GGN A G A GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.028
Identities = 32/105 (30%), Positives = 40/105 (38%), Gaps = 3/105 (2%)

Query: 359 GGAGDGGSTGGAGQTGGDGGRAGDGGVGGRGGWLAGAGGDGGAGGVGGVGGAGGGGADGL 418
GG G G +TG +G G GVGG +G + G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG- 61

Query: 419 VLGGDGGDGGDGGTGGAGGSGGAGGAGGLISLFGGQGAGGAGGAG 463
G+GG G+ G G G + A + F GAGG
Sbjct: 62 --HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.031
Identities = 22/79 (27%), Positives = 29/79 (36%)

Query: 846 GGSGATGSSGDQGSAANPNGGRGGDGGSGADGGAGGTGGDGGAGGQAQAAGYADGTRGAG 905
GG G ++G ++ N NGG G G G G + G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 906 GNGGAGGSGGLAGDGGRGG 924
GNGG G+ G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.032
Identities = 37/121 (30%), Positives = 41/121 (33%), Gaps = 5/121 (4%)

Query: 1407 GGSGGTGGNGLGGRTENGDGGHGGKGGVGGYGSVGGVGGSGGHGGNGGQSGTPRFGGHGG 1466
GG G G + N +GG G G GG G G G SG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1467 DGGAGGAGVVTGGTGGTGGAGGNGLDGTAKAAGGNGGNGGVGGAGGVAGGAGGAGGAGGA 1526
G G G +GG G GGN A A G G G + GA A A
Sbjct: 63 GNGGGN-----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117

Query: 1527 G 1527

Sbjct: 118 D 118



Score = 31.6 bits (71), Expect = 0.032
Identities = 26/82 (31%), Positives = 34/82 (41%), Gaps = 5/82 (6%)

Query: 658 GSGGNGGAGGQAQAAGYADGVRGAGGSGGAGGAGGLAGAGGAGGDAYTTGGGAGGDGGHG 717
G G G G +G +G G G G GG + G + GGG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNING-----GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 718 GDTGAGGAGGSGGVGSSTGLTG 739
G +G G GG+G G +G G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGG 79



Score = 31.6 bits (71), Expect = 0.033
Identities = 24/79 (30%), Positives = 31/79 (39%)

Query: 656 AGGSGGNGGAGGQAQAAGYADGVRGAGGSGGAGGAGGLAGAGGAGGDAYTTGGGAGGDGG 715
+GG G G + + G G G GGA G + G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 716 HGGDTGAGGAGGSGGVGSS 734
HG G G +GG G G +
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 31.6 bits (71), Expect = 0.037
Identities = 31/100 (31%), Positives = 38/100 (38%)

Query: 1602 GGAGGGGGDGGNGAAGLTSGGNGGAGGAGGAGGAGRTTSGDGGSGGNGGTGGEGGGDILV 1661
GG G G G + +G +GG G G GGA +S + GG G+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1662 APVVAGTGGAGGEGGAGGYSNTVGGVAGNGGAGGSGGTGG 1701
GG G G S VA A + G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.048
Identities = 26/84 (30%), Positives = 30/84 (35%), Gaps = 1/84 (1%)

Query: 1415 NGLGGRTENGDGGHGGKGGVGGYGSVGGVGGSGGHGGNGGQSGTPRFGGHGGDGGAGGAG 1474
+G GR N G H G + G + GVGG G P GG G GG
Sbjct: 2 SGGDGRGHNT-GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1475 VVTGGTGGTGGAGGNGLDGTAKAA 1498
G G GG+G G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAV 84


22MMAR_1844MMAR_1850Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_18442170.884486amidotransferase
MMAR_18456154.142971PPE family protein
MMAR_18469156.448018hypothetical protein
MMAR_18476154.756531PPE family protein
MMAR_18487164.751305hypothetical protein
MMAR_18496143.679033PPE family protein
MMAR_18504133.513044PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1846PF04183270.048 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 27.2 bits (60), Expect = 0.048
Identities = 14/64 (21%), Positives = 23/64 (35%), Gaps = 7/64 (10%)

Query: 42 SPRRWQRAVVVVVVAVGLFA------AVIAGWALRSGLSAAALLQPAAWSQAIPDIGHLQ 95
+P RW + V+ L + + RSGL A L + + + HL
Sbjct: 354 NPCRWLKPDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLT-QLFRVVVVPLYHLL 412

Query: 96 LAHG 99
+G
Sbjct: 413 CRYG 416


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1850cloacin399e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 9e-05
Identities = 35/111 (31%), Positives = 45/111 (40%)

Query: 256 LAGLFGSGGGNGGNGGFGSVNGGAGGDGGDGGALAGPGGSGGSGAFGGTLGGDGGAGGAA 315
++G G G G + G++NGG G G GGA G G S + +GG G GG +
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 316 GLLFGPGGSGGAGGATFSGTGGVGGAGGHAGLFGDGGAGAVGGFSSVDGGA 366
G G G GG+ G A G GA G S+ GA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 37.8 bits (87), Expect = 2e-04
Identities = 32/113 (28%), Positives = 44/113 (38%)

Query: 373 AGWFGNGGIGGAGGAGYFSSGGVGGGGGTGGVLFGNGGAGGNGGTASTGGAGGGGGAGVL 432
+G G G GA +GG G G GG G+G + N G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 433 IGNGGNAGIGGAGVTVGGTGVGGTSGLLLGLDGFNAPASASPIHALQQQALTA 485
GNGG G G G GG + + G + P + ++ AL+A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 37.0 bits (85), Expect = 3e-04
Identities = 37/114 (32%), Positives = 46/114 (40%), Gaps = 10/114 (8%)

Query: 670 LGGAGGAGGTGGYSDDAPSSNGGAGGNAGLLFGNGGHGGSGGASRGFAGTGGAGGNAGLL 729
+ G G G G + + NGG G G G GSG +S GG+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGV---GGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 730 FGSGGFGGFGGFGSGGGSGGNGGNAGL-------FGSGGDGGAGGFGVFIDGGS 776
GSG G G SGGGSG G + + F + GAGG V I G+
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.1 bits (80), Expect = 0.001
Identities = 32/101 (31%), Positives = 37/101 (36%), Gaps = 2/101 (1%)

Query: 551 TGGAGGTSTGGLGGAGGE--GGAGGFLAGTGGAGGAGGFSPATGTGAGGVGGAGGAGGLF 608
+GG G G G GG G G G + G+G S G G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 609 GGGGGGGVGGGSLAGPGGGGGAGGAGGPLSGLVGAGGGAGG 649
G GGG G +G GG A A + GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.9 bits (77), Expect = 0.003
Identities = 34/98 (34%), Positives = 38/98 (38%), Gaps = 8/98 (8%)

Query: 547 GTGGTGGAGGTSTGGLGGAGGEGGAGGFLAGTGGAG----GAGGFSPATGTGAGGVGGAG 602
G G GA TS GG G G GG G+G + GG G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 603 GAGGLFGGGGGGGVGGGSLAGPGGGG----GAGGAGGP 636
G G GGG G G ++A P G GAGG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 33.5 bits (76), Expect = 0.004
Identities = 29/85 (34%), Positives = 32/85 (37%), Gaps = 5/85 (5%)

Query: 593 TGAGGVGGAGGAGGLFGGGGGGGVGGGSLAGPGGGGGAGGAGGPLSGLVGAGGGAGGHGG 652
+G G G GA G GG G G G G G P G G+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 653 PGDTDGGDGGAGGNGGLLGGAGGAG 677
G+GG GN G G GG
Sbjct: 62 -----HGNGGGNGNSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.008
Identities = 32/92 (34%), Positives = 38/92 (41%), Gaps = 6/92 (6%)

Query: 743 SGGGSGGNGGNAGLFGSGGDGGAGGFGVFIDGGSGGSGGSGGPLLGTGGSGGAGGECDLG 802
SGG G+ A +GG G GV G G S GSG GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGV----GGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 803 GFGDGGDGGDGGGAMLIGNGGNGGNGAAGATA 834
G G+GG G + G G GGN +A A
Sbjct: 58 GGSGHGNGGGNGNSG--GGSGTGGNLSAVAAP 87



Score = 32.4 bits (73), Expect = 0.008
Identities = 42/106 (39%), Positives = 47/106 (44%), Gaps = 9/106 (8%)

Query: 324 SGGAGGATFSGTGGVGGA--GGHAGLFGDGGAGAVGGFSSVDG-GAGGAGGDAGWFGNGG 380
SGG G +G G GG GL GGA G+SS + GG+G W G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 381 IGGAGGAGYFSSGGVGGGGGTGGVLFGNGGAGGNGGTA-STGGAGG 425
G GG +G GGG GTGG L G A ST GAGG
Sbjct: 62 HGNGGG-----NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.009
Identities = 29/85 (34%), Positives = 32/85 (37%), Gaps = 5/85 (5%)

Query: 575 LAGTGGAGGAGGFSPATGTGAGGVGGAGGAGGLFGGGGGGGVGGGSLAGPGGGGGAGGAG 634
++G G G G +G GG G G GGG G G S P GGG G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGV-----GGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 635 GPLSGLVGAGGGAGGHGGPGDTDGG 659
G GGG G GG T G
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 32.0 bits (72), Expect = 0.012
Identities = 32/110 (29%), Positives = 38/110 (34%), Gaps = 2/110 (1%)

Query: 228 GAGGLGGTGGASGMDTGGDGGAGGAGGLLAGLFGSGGGNGGNGGFGSVNGGAGGDGGDGG 287
G G G GA +GG G G GSG + N G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 288 ALAGPGGSGGSGAFGGTLGGDGGAGGAAGLLFGPGGSGGAGGATFSGTGG 337
G G+ G G+ GT G F + GAGG S + G
Sbjct: 63 GNGGGNGNSGGGS--GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.6 bits (71), Expect = 0.013
Identities = 31/119 (26%), Positives = 41/119 (34%), Gaps = 1/119 (0%)

Query: 201 TGGAGGYSNFLTASGGVGGTGGTGGLFGAGGLGGTGGASGMDTGGDGGAGGAGGLLAGLF 260
+GG G N S GG GL GG G S + GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 261 GSGGGNGGNGGFGSVNGGAGGDGGDGGALAGPG-GSGGSGAFGGTLGGDGGAGGAAGLL 318
GG GN G GS GG A P + G+G ++ + A ++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIM 120



Score = 31.6 bits (71), Expect = 0.014
Identities = 45/133 (33%), Positives = 52/133 (39%), Gaps = 11/133 (8%)

Query: 148 SGAAGSGQNGGAGGAAGLF--GSGGAGGTGGSS-----STGNGGAGGAGGAGGLLLGNAG 200
SG G G N GA +G G G G GG+S S+ N GG G+G G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 201 TGGAGGYSNFLTASGGVGGTGGTGGLFGAGGLGGTGGASGMDTGGDGGAGGAGGLLAGLF 260
G GG N SGG GTGG A G S GG + AG L A +
Sbjct: 62 HGNGGGNGN----SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117

Query: 261 GSGGGNGGNGGFG 273
G FG
Sbjct: 118 DIMAALKGPFKFG 130



Score = 31.2 bits (70), Expect = 0.018
Identities = 28/83 (33%), Positives = 34/83 (40%), Gaps = 5/83 (6%)

Query: 349 GDGGAGAVGGFSSVDGGAGGAGGDAGWFGNGGIGGAGGAGYFSSGGV-GGGGGTGGVLFG 407
G G G G S G G G G G + G+G+ S GGG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG----VGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 408 NGGAGGNGGTASTGGAGGGGGAG 430
G G GG ++GG G GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 31.2 bits (70), Expect = 0.019
Identities = 26/79 (32%), Positives = 31/79 (39%), Gaps = 1/79 (1%)

Query: 717 AGTGGAGGNAGLLFGSGGF-GGFGGFGSGGGSGGNGGNAGLFGSGGDGGAGGFGVFIDGG 775
+G G G N G SG GG G G GGG+ G + G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 776 SGGSGGSGGPLLGTGGSGG 794
G GG+G G+G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 31.2 bits (70), Expect = 0.021
Identities = 35/110 (31%), Positives = 41/110 (37%), Gaps = 10/110 (9%)

Query: 621 LAGPGGGGGAGGAGGPLSGLVGAGGGAGGHGGPGDTDGGDGGAGGNGGLLGGAGGAGGTG 680
++G G G GA + G G G GG D G GG G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 681 GYSDDAPSSNGGAGGNAGLLFGNGGHGGSGGASRGFA----GTGGAGGNA 726
G NGG GN+G G GG+ + A F T GAGG A
Sbjct: 61 G------HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.029
Identities = 22/75 (29%), Positives = 25/75 (33%)

Query: 114 TGRPLIGNGANGTPGTGADGAAGGWLLGNGGAGGSGAAGSGQNGGAGGAAGLFGSGGAGG 173
TG NG P G G GSG GG +G GG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 174 TGGSSSTGNGGAGGA 188
+GG S TG + A
Sbjct: 71 SGGGSGTGGNLSAVA 85



Score = 30.1 bits (67), Expect = 0.045
Identities = 33/120 (27%), Positives = 40/120 (33%)

Query: 522 DGGAGGSAPLFTAQDAGAGGAAGLWGTGGTGGAGGTSTGGLGGAGGEGGAGGFLAGTGGA 581
+GG G A D + GG+G G G GG G G +GTGG
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 582 GGAGGFSPATGTGAGGVGGAGGAGGLFGGGGGGGVGGGSLAGPGGGGGAGGAGGPLSGLV 641
A A G A GAGG G +A G G G L G++
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKGPFKFGLWGVALYGVL 140


23MMAR_2026MMAR_2033Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2026218-2.058561antibiotic-transport ATP-binding protein ABC
MMAR_2027218-1.296322antibiotic ABC transporter integral membrane
MMAR_20284254.216973antibiotic ABC transporter integral membrane
MMAR_20293244.398040arsenic-transport integral membrane protein
MMAR_20302235.214783arsenic-transport integral membrane protein
MMAR_20314225.799392hypothetical protein
MMAR_20323215.7136231-deoxy-D-xylulose-5-phosphate synthase
MMAR_20333226.668836PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2033cloacin392e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 2e-04
Identities = 30/81 (37%), Positives = 36/81 (44%)

Query: 274 GAGGTGGAGGAGGTGGDGLAATTAGGTGGNAGDGGGGGTGGNAGDGGAGGQGGLFGNAGT 333
G G G GA T G+ T G GG A DG G + N GG+G G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 334 TGAGGDGGNGGNGGLAGNGGA 354
GG+G +GG G GN A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 37.0 bits (85), Expect = 5e-04
Identities = 35/99 (35%), Positives = 39/99 (39%), Gaps = 6/99 (6%)

Query: 893 GGDGRNGSTGAIGGNGGTGGTGGIGGTAGTGGTGGSAGSAGTGGTGGNGGAGDSGASGGT 952
GGDGR +TGA +G I G G GG A + N G SG+
Sbjct: 3 GGDGRGHNTGA------HSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 953 GGTGGEGGAGIGGKDGGTGGTGGTGGTGGAGVVSGLSAF 991
GG G G G G GG GTGG A V G A
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPAL 95



Score = 36.2 bits (83), Expect = 9e-04
Identities = 32/102 (31%), Positives = 37/102 (36%)

Query: 308 GGGGTGGNAGDGGAGGQGGLFGNAGTTGAGGDGGNGGNGGLAGNGGAGGNGDASNPNGGT 367
GG G G N G G G G G+G + GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 368 GGNGANPGAGGAGGAGGNGSRTGAPGTTGNTPTTAAGHGGKG 409
G G N +GG G GGN S AP G + G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.5 bits (81), Expect = 0.001
Identities = 29/80 (36%), Positives = 35/80 (43%)

Query: 859 AGGTGGNGGTGGNSGPGGTNGTGGHGGNGGTGGDGGDGRNGSTGAIGGNGGTGGTGGIGG 918
+GG G TG +S G NG G GG DG + + GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 919 TAGTGGTGGSAGSAGTGGTG 938
GG G S G +GTGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.002
Identities = 30/97 (30%), Positives = 36/97 (37%)

Query: 507 GAGGNAGAGGLSGTGDSTGNAGTGGNGGAGGNGGTGGDGSDGGPGGAGGKGGNSGPGGTN 566
G G G T + TG G G + G+G + GG G G + G G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 567 GTGGHGGNGGTGGDGGDGASGVGLGKAGGTGATGGNG 603
G GG GN G G G S V A G A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPG 99



Score = 34.7 bits (79), Expect = 0.003
Identities = 29/81 (35%), Positives = 36/81 (44%), Gaps = 1/81 (1%)

Query: 756 AGGTGGAGGTGGNSGPGGTNGTGGHGGNGGTGGDGGDGASGVGQGKAGGTGATGGSGGNA 815
+GG G TG +S G NG G GG G G G S GG+G+ GG +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 816 GSGGAAGTGGNGGTNGNAGTG 836
G G G G +GG +G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.005
Identities = 32/102 (31%), Positives = 39/102 (38%)

Query: 819 GAAGTGGNGGTNGNAGTGGTGGTGGNGGDGDKGAVGDSGFAGGTGGNGGTGGNSGPGGTN 878
G G G N G + +G G TG G G G S GG G+G + G G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 879 GTGGHGGNGGTGGDGGDGRNGSTGAIGGNGGTGGTGGIGGTA 920
G GG GN G G G + + T G GG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.005
Identities = 32/97 (32%), Positives = 38/97 (39%)

Query: 610 GAAGTGGTGGTNGSAGTGGTGGTGGNGGAGDKGAVGDSGFAGGTGGAGGTGGNSGPGGTN 669
G G G G + ++G G TG G G G S GG G+G + G G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 670 GTGGHGGNGGTGGDGGDGASGVGQGKAGGTGATGGNG 706
G GG GN G G G S V A G A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPG 99



Score = 33.5 bits (76), Expect = 0.006
Identities = 25/73 (34%), Positives = 32/73 (43%)

Query: 1110 GNAGNAGDGGNGGQGGTGGQGAAAPSAAKAAGNGGAGGVGGDGGNGADASGGGNGGDGGK 1169
A + NGG G G G A+ + ++ N GG G G + SG GNGG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 1170 SGGGGTGGTGGKS 1182
SGGG G +
Sbjct: 71 SGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.006
Identities = 33/105 (31%), Positives = 37/105 (35%), Gaps = 5/105 (4%)

Query: 381 GAGGNGSRTGAPGTTGNTPTTAAGHGGKGGDGFSPATSGQDGGSGGKGGDAGRFGNGGNG 440
G G G TGA T+GN G G GG S ++ GG G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 441 GNGGNGAAGDAAGSGHTGGNGGAGGGGGNAGQFGEPGTGGSGGNG 485
GNGG SG G GG FG P G G
Sbjct: 63 GNGGGN-----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.1 bits (75), Expect = 0.007
Identities = 28/80 (35%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 548 GGPGGAGGKGGNSGPGGTNGTGGHGGNGGTGGDGGDGASGVGLGKAGGTGATGGNGGNAG 607
GG G G +S G NG G GG G G G S GG+G+ GG +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 608 SGGAAGTGGTGGTNGSAGTG 627
G G G +GG +G+ G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 32.8 bits (74), Expect = 0.009
Identities = 26/86 (30%), Positives = 36/86 (41%)

Query: 527 AGTGGNGGAGGNGGTGGDGSDGGPGGAGGKGGNSGPGGTNGTGGHGGNGGTGGDGGDGAS 586
+G G G G T G+ + G G G G + G G ++ GG G+G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 587 GVGLGKAGGTGATGGNGGNAGSGGAA 612
G G +G G GGN + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 32.8 bits (74), Expect = 0.010
Identities = 29/80 (36%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 650 AGGTGGAGGTGGNSGPGGTNGTGGHGGNGGTGGDGGDGASGVGQGKAGGTGATGGNGGNA 709
+GG G TG +S G NG G GG G G G S GG+G+ GG +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 710 GSGGAAGTGGDASGGGTNGS 729
G G G G G GT G+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGN 80



Score = 32.8 bits (74), Expect = 0.010
Identities = 26/80 (32%), Positives = 35/80 (43%)

Query: 403 AGHGGKGGDGFSPATSGQDGGSGGKGGDAGRFGNGGNGGNGGNGAAGDAAGSGHTGGNGG 462
+G G+G + + +TSG G G G +G + N G + H GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 463 AGGGGGNAGQFGEPGTGGSG 482
G GGGN G GTGG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 32.8 bits (74), Expect = 0.012
Identities = 23/81 (28%), Positives = 32/81 (39%)

Query: 480 GSGGNGGKGGDGAAGGLAQAGGNGGEGGAGGNAGAGGLSGTGDSTGNAGTGGNGGAGGNG 539
G G G G + G G G G G + G+G S G +G+G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 540 GTGGDGSDGGPGGAGGKGGNS 560
G GG + G G G ++
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.012
Identities = 24/79 (30%), Positives = 29/79 (36%)

Query: 757 GGTGGAGGTGGNSGPGGTNGTGGHGGNGGTGGDGGDGASGVGQGKAGGTGATGGSGGNAG 816
G GA T GN G T G G + G+G + G G G G G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 817 SGGAAGTGGNGGTNGNAGT 835
+G + G G GG
Sbjct: 68 NGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.014
Identities = 26/83 (31%), Positives = 36/83 (43%)

Query: 995 IGGAGGAGGQGGQATGGGNAGDGGTGGQGGTGGQGATSLFEASSGGTGGAGGAGGLGGQA 1054
+ G G G G + GN G TG G G + ++ GG+G GG +
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1055 ADGGNAGDGGTGGQGGTGGQGAA 1077
G G+G +GG GTGG +A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.014
Identities = 32/84 (38%), Positives = 36/84 (42%), Gaps = 2/84 (2%)

Query: 878 NGTGGHGGNGGTGGDGGDGRNGSTGAIGGNGGTGGTGGIGGTAGTGGTGGSAGSAGTGGT 937
+G G G N G G+ G TG G G + G+G GG GS G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 938 GGNGGAGDSGASGGTGGTGGEGGA 961
GNGG G SGG GTGG A
Sbjct: 62 HGNGGGN--GNSGGGSGTGGNLSA 83



Score = 32.0 bits (72), Expect = 0.016
Identities = 33/99 (33%), Positives = 39/99 (39%), Gaps = 1/99 (1%)

Query: 951 GTGGTGGEGGAGIGGKDGGTGGTGGTGGTGGAGVVSGLSAFPGGIGGAGGAGGQGGQATG 1010
G G G G TG G G + G+G S + + GG G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1011 GGNAGDGGTGGQGGTGGQGATSL-FEASSGGTGGAGGAG 1048
GGN GG G GG A + F + T GAGG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.016
Identities = 31/98 (31%), Positives = 38/98 (38%)

Query: 715 AGTGGDASGGGTNGSAGTGGTGGTGGNGGDGDKGAVGDSGFAGGTGGAGGTGGNSGPGGT 774
+G G G + ++G G TG G G G S GG G+G + G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 775 NGTGGHGGNGGTGGDGGDGASGVGQGKAGGTGATGGSG 812
+G GG GN G G G S V A G A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPG 99



Score = 32.0 bits (72), Expect = 0.018
Identities = 28/77 (36%), Positives = 33/77 (42%)

Query: 692 GQGKAGGTGATGGNGGNAGSGGAAGTGGDASGGGTNGSAGTGGTGGTGGNGGDGDKGAVG 751
G G+ TGA +G G G GG AS G S GG+G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 752 DSGFAGGTGGAGGTGGN 768
+ G G +GG GTGGN
Sbjct: 64 NGGGNGNSGGGSGTGGN 80



Score = 31.6 bits (71), Expect = 0.021
Identities = 25/84 (29%), Positives = 30/84 (35%)

Query: 470 AGQFGEPGTGGSGGNGGKGGDGAAGGLAQAGGNGGEGGAGGNAGAGGLSGTGDSTGNAGT 529
+G G G+ G G G G + G G + N GG SG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 530 GGNGGAGGNGGTGGDGSDGGPGGA 553
GNGG GN G G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 31.6 bits (71), Expect = 0.023
Identities = 25/85 (29%), Positives = 33/85 (38%)

Query: 1175 TGGTGGKSLGNLPSGDGGVGGNAGTGGNGGNATDGGAGGKGGDGAAGGTGGNAGEFQSLL 1234
+GG G S G + G G GG A+DG + GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1235 GIGKGGDGGTGGDGGNGGTGTPVGA 1259
GG+G +GG G GG + V A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 31.6 bits (71), Expect = 0.025
Identities = 33/102 (32%), Positives = 40/102 (39%), Gaps = 2/102 (1%)

Query: 667 GTNGTGGHGGNGGTGGD--GGDGASGVGQGKAGGTGATGGNGGNAGSGGAAGTGGDASGG 724
G +G G + G T G+ GG GVG G + G+G + N G G+ G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 725 GTNGSAGTGGTGGTGGNGGDGDKGAVGDSGFAGGTGGAGGTG 766
G G G G G G V A T GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.025
Identities = 28/96 (29%), Positives = 35/96 (36%), Gaps = 8/96 (8%)

Query: 1041 TGGAGGAGGLGGQAADGGNAGDGGTGGQGGTGGQGAAGPTVNDTAGDGGAGGDGGVGGAG 1100
+GG G G + G G G GG G+ + N+ G G G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1101 GQGGQATDGGNAGNAGDGGNGGQGGTGGQGAAAPSA 1136
GN G G+ G G G AAP A
Sbjct: 62 --------HGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 31.2 bits (70), Expect = 0.028
Identities = 31/102 (30%), Positives = 37/102 (36%)

Query: 220 HGGDGGTGGTGASGVAGANGGAGQTGLAGTDGGTGGVGGAGGKGGLLFGAGGEGGAGGTG 279
+ G T G G G G G + +G GG G G G G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 280 GAGGAGGTGGDGLAATTAGGTGGNAGDGGGGGTGGNAGDGGA 321
+GG GTGG+ A G A G G + GA
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 31.2 bits (70), Expect = 0.033
Identities = 28/100 (28%), Positives = 35/100 (35%), Gaps = 1/100 (1%)

Query: 438 GNGGNGGNGAAGDAAGSGHTGGNGGAGGGGGNAGQ-FGEPGTGGSGGNGGKGGDGAAGGL 496
G G G N A +G+ + G G GGG + G + GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 497 AQAGGNGGEGGAGGNAGAGGLSGTGDSTGNAGTGGNGGAG 536
GGNG GG G G + G G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.034
Identities = 33/123 (26%), Positives = 43/123 (34%), Gaps = 6/123 (4%)

Query: 934 TGGTGGNGGAGDSGASGGTGGTGGEGGAGIGGKDGGTGGTGGTGGTGGAGVVSGLSAFPG 993
+GG G G SG G G G G DG + GG+G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSG------SGIH 55

Query: 994 GIGGAGGAGGQGGQATGGGNAGDGGTGGQGGTGGQGATSLFEASSGGTGGAGGAGGLGGQ 1053
GG+G G G +GGG+ G G +L +GG + AG L
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115

Query: 1054 AAD 1056
AD
Sbjct: 116 IAD 118



Score = 30.8 bits (69), Expect = 0.038
Identities = 24/81 (29%), Positives = 31/81 (38%)

Query: 1147 GVGGDGGNGADASGGGNGGDGGKSGGGGTGGTGGKSLGNLPSGDGGVGGNAGTGGNGGNA 1206
G G G N S GN G G G G + G + + GG G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1207 TDGGAGGKGGDGAAGGTGGNA 1227
+GG G G G+ G +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.042
Identities = 28/86 (32%), Positives = 38/86 (44%), Gaps = 2/86 (2%)

Query: 632 TGGNGGAGDKGAVGDSGFAGG--TGGAGGTGGNSGPGGTNGTGGHGGNGGTGGDGGDGAS 689
+GG+G + GA SG G TG G G + G G ++ GG G+G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 690 GVGQGKAGGTGATGGNGGNAGSGGAA 715
G G +G G GGN + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 30.8 bits (69), Expect = 0.043
Identities = 24/79 (30%), Positives = 28/79 (35%)

Query: 595 GTGATGGNGGNAGSGGAAGTGGTGGTNGSAGTGGTGGTGGNGGAGDKGAVGDSGFAGGTG 654
G G N G + G G TG G + G+G + N G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 655 GAGGTGGNSGPGGTNGTGG 673
G GG GNSG G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.043
Identities = 24/85 (28%), Positives = 29/85 (34%)

Query: 790 GGDGASGVGQGKAGGTGATGGSGGNAGSGGAAGTGGNGGTNGNAGTGGTGGTGGNGGDGD 849
GGDG + GG G GGA+ G N G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 850 KGAVGDSGFAGGTGGNGGTGGNSGP 874
G+ GG+G G + P
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87


24MMAR_2048MMAR_2053Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_20480185.733994ATP-dependent protease ATP-binding subunit
MMAR_20490175.223322hypothetical protein
MMAR_20501164.660080hypothetical protein
MMAR_20511164.518513hypothetical protein
MMAR_20521164.121509amino acid transporter
MMAR_20532184.481974****PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2053cloacin395e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 5e-05
Identities = 34/90 (37%), Positives = 39/90 (43%), Gaps = 5/90 (5%)

Query: 624 GAGGQGGDGAQGGDGGAGVDTTTGAGGDGGAGAGGGTSGFLYGNGGAGGAGGHGGGGPGM 683
G G+G + G TG G GGA G G S GG G+G H GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 684 GGDGGDGGDGGRAQLIGNGGNGGAGGTAAP 713
G GG+G GG G+G G AAP
Sbjct: 63 GNGGGNGNSGG-----GSGTGGNLSAVAAP 87



Score = 37.0 bits (85), Expect = 3e-04
Identities = 29/81 (35%), Positives = 33/81 (40%), Gaps = 2/81 (2%)

Query: 246 GSGGQGGNGGSAIETGN--GGAAGAGGTGGAGGHGGWLVGNGGIGGDGGSGGVGGDAGAY 303
G G+G N G+ +GN GG G G GGA GW N GG GSG G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 304 GPPANGGAGGHGGMGGVGGDA 324
G G G G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 35.8 bits (82), Expect = 6e-04
Identities = 37/125 (29%), Positives = 52/125 (41%), Gaps = 2/125 (1%)

Query: 297 GGDAGAYGPPANGGAGG-HGGMGGVGGDAGLLFGSGGAGGVGGNGAIGGTGVMSGAGGAG 355
GGD + A+ +G +GG G+G G GSG + G G+G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 356 GDGGNGGYAQLIGDGGGGGAGGAGASGAGDGDPGTAGGDGLLLSSASAAPLYAVEQQVLG 415
G+GG G + GG + A G T G GL + S SA L A ++
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAV-SISAGALSAAIADIMA 121

Query: 416 AINAP 420
A+ P
Sbjct: 122 ALKGP 126



Score = 35.1 bits (80), Expect = 0.001
Identities = 34/109 (31%), Positives = 42/109 (38%), Gaps = 9/109 (8%)

Query: 225 GAGGSGGSGGAARLFGN------GGAGGSGGQGGNGGSAIETGNGGAAGAG-GTGGAGGH 277
G G G + GA GN G G G G+G S+ GG +G+G GG GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 278 GGWLVGNGGIGGDGGSGGVGGDAG--AYGPPANGGAGGHGGMGGVGGDA 324
G GG G G + A A+G PA G G + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.1 bits (80), Expect = 0.001
Identities = 24/79 (30%), Positives = 29/79 (36%)

Query: 612 GSGGVGGSAGLWGAGGQGGDGAQGGDGGAGVDTTTGAGGDGGAGAGGGTSGFLYGNGGAG 671
G G G + G G G G G G +G + GG SG +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 672 GAGGHGGGGPGMGGDGGDG 690
G GG G G G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.002
Identities = 29/77 (37%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 200 GNGGAGGIGGTGGSGLDGPDSGAAGGAGGSGGSGGAARLFGNGGAGGSG-GQGGNGGSAI 258
G G G T G+ ++G +G G G S GSG ++ GG GSG GG G
Sbjct: 6 GRGHNTGAHSTSGN-INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 259 ETGNGGAAGAGGTGGAG 275
GNG + G GTGG
Sbjct: 65 GGGNGNSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.007
Identities = 27/80 (33%), Positives = 30/80 (37%), Gaps = 1/80 (1%)

Query: 648 AGGDGGAGAGGGTSGFLYGNGGAGGAGGHGGGGPGMG-GDGGDGGDGGRAQLIGNGGNGG 706
+GGDG G S NGG G G GG G G + GG I GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 707 AGGTAAPGGTDGTSGAAGKG 726
G G + G SG G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.008
Identities = 30/109 (27%), Positives = 33/109 (30%), Gaps = 6/109 (5%)

Query: 478 GAGGAGYSASGLPPAGIGAPGGAGGAGGSGGWLVGNGGAGGIGGIGASGSFGAPSGQGGN 537
G +S SG G G GGA GW N GG G G G+ G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 538 GGIGGAAGLFGQGGAGGNGGQGGGDDFALSAGAGGAGGAGGHGGQLYGD 586
G G G G A A GAGG +
Sbjct: 68 NGNSGG------GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.0 bits (72), Expect = 0.010
Identities = 30/83 (36%), Positives = 35/83 (42%), Gaps = 5/83 (6%)

Query: 186 GGSGGVGGTGGWLYGNGGAGGIGGTGGSGLDGPDSGAAGGAGGSGGSGGAARLFGNGGAG 245
G + G T G + NGG G+G GG DG + G GGSG G G G
Sbjct: 8 GHNTGAHSTSGNI--NGGPTGLGVGGG-ASDGSGWSSENNPWG-GGSGSGIHWGGGSGHG 63

Query: 246 GSGGQGGNGGSAIETGNGGAAGA 268
GG GN G TG +A A
Sbjct: 64 NGGG-NGNSGGGSGTGGNLSAVA 85



Score = 30.8 bits (69), Expect = 0.023
Identities = 33/98 (33%), Positives = 41/98 (41%), Gaps = 6/98 (6%)

Query: 566 LSAGAGGAGGAGGHGGQLYGDGGGGGAGGVGGSVDIAESGVTGGRGGSGGVGGSAGLWGA 625
+S G G G H +GG G G GG+ D SG + GG GS WG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD--GSGWSSENNPWGGGSGSGIHWGG 58

Query: 626 GGQGGDGAQGGDGGAGVDTTTGAGGDGGAGAGGGTSGF 663
G G+G G+ G G +G GG+ A A GF
Sbjct: 59 GSGHGNGGGNGNSGGG----SGTGGNLSAVAAPVAFGF 92



Score = 30.5 bits (68), Expect = 0.030
Identities = 30/87 (34%), Positives = 37/87 (42%), Gaps = 4/87 (4%)

Query: 513 NGGAGGIGGIGASGSFGAPSGQGGNGGIGGAAGLFGQGGAGGNGGQGGGDDFALSAGAGG 572
+GG G GA + G +G G+GG A G G + N GGG + G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGAS-DGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 573 AGGAGGHGGQLYGDGGGGGAGGVGGSV 599
G GG G GGG G GG +V
Sbjct: 61 GHGNGGGNG---NSGGGSGTGGNLSAV 84



Score = 30.1 bits (67), Expect = 0.034
Identities = 39/120 (32%), Positives = 47/120 (39%), Gaps = 15/120 (12%)

Query: 452 GGAGGTGTAAHPTGGS--GGSAGLIGNGGAG-GAGYSASGLPPAGIGAPGGAGGAGGSGG 508
G G T AH T G+ GG GL GGA G+G+S+ P GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNP------------WGGGSG 51

Query: 509 WLVGNGGAGGIGGIGASGSFGAPSGQGGNGGIGGAAGLFGQGGAGGNGGQGGGDDFALSA 568
+ GG G G G +G+ G SG GGN A FG G G + A
Sbjct: 52 SGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 30.1 bits (67), Expect = 0.035
Identities = 29/98 (29%), Positives = 38/98 (38%), Gaps = 4/98 (4%)

Query: 154 HPTGGSGGSAGLIGTGGAGGAGYGPAAQTGATGGSGGVGGTGGWLYGNGGAGGIGGTGGS 213
H TG S + G G G G + +G + + GG G GG G G GG+
Sbjct: 9 HNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 214 GLDGPDSGAAGGAGGSGGSGGAARLFGNGGAGGSGGQG 251
G +SG G GG+ + A FG G G
Sbjct: 69 G----NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.1 bits (67), Expect = 0.040
Identities = 28/105 (26%), Positives = 36/105 (34%)

Query: 134 GGDGGLLWGNGGAGGVGTAAHPTGGSGGSAGLIGTGGAGGAGYGPAAQTGATGGSGGVGG 193
G + G +G G T GG+ +G G G G G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 194 TGGWLYGNGGAGGIGGTGGSGLDGPDSGAAGGAGGSGGSGGAARL 238
G G+G G + G + + GAGG S A L
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112


25MMAR_2092MMAR_2113Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_20921123.651795alpha-mannosyltransferase PimA
MMAR_20932133.724528hypothetical protein
MMAR_20941122.754917pyridoxal biosynthesis lyase PdxS
MMAR_20955145.963971acyl-CoA thioesterase II TesB2
MMAR_20966166.148456glutamine amidotransferase subunit PdxT
MMAR_20979187.463829PE-PGRS family protein
MMAR_20987165.406568hypothetical protein
MMAR_20997145.467316transcriptional regulator
MMAR_21007155.866894PE-PGRS family protein
MMAR_21016133.676258hypothetical protein
MMAR_21025144.032909PE-PGRS family protein
MMAR_2103-111-0.324396PE-PGRS family protein
MMAR_21041141.119778spermidine synthase
MMAR_2105-1110.496541hypothetical protein
MMAR_2106-1140.858367hypothetical protein
MMAR_21070154.081045hypothetical protein
MMAR_21081195.290459hypothetical protein
MMAR_21092175.264113Holliday junction resolvase
MMAR_21101185.289597Holliday junction DNA helicase RuvA
MMAR_21116207.029559Holliday junction DNA helicase RuvB
MMAR_21122205.218766PE-PGRS family protein
MMAR_21130183.996408PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2097cloacin397e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 7e-05
Identities = 34/114 (29%), Positives = 46/114 (40%), Gaps = 8/114 (7%)

Query: 245 GAGGHGGTGGTSASGTGATGGSGGAGGLLFSPGGAGGDGGAGFSGADGGAGGNGGAGGLL 304
G G G G ++ GG G G G G G+G+S + GG G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV------GGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 305 FGTGGGGGEGGATSPSSSTGSGGDGGIGGTSGLFG--TGGTGGAGGAAANATGG 356
G G G GG + +G+GG+ FG T GAGG A + + G
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 35.5 bits (81), Expect = 7e-04
Identities = 35/112 (31%), Positives = 42/112 (37%), Gaps = 6/112 (5%)

Query: 339 GTGGTGGAGGAAANATGGNGGAGGGGLWFGNGGAGGIGGFDAHGNGGDGGAGGNAGIYGG 398
G G G GA + + NGG G G GGA G+ + N GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG---VGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 399 NGGAGGTGGVGVGGNLFTGGQGGAGGNA---GLLAGNGGAGGNGGVRFSGNA 447
+G G G GG TGG A G A + G V S A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.1 bits (80), Expect = 8e-04
Identities = 28/85 (32%), Positives = 32/85 (37%), Gaps = 3/85 (3%)

Query: 461 MFGNGGAGGAGGDRVAGSQGNGGDGGDGGHGGTYFGSGGAGGH---GGYDDSGSGGQGGH 517
M G G G G NGG G G GG GSG + + GG SG GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 518 GGDGGAAGTIGNGGDGGTGGDALVS 542
G G GG G G + V+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 34.7 bits (79), Expect = 0.001
Identities = 27/90 (30%), Positives = 35/90 (38%)

Query: 279 AGGDGGAGFSGADGGAGGNGGAGGLLFGTGGGGGEGGATSPSSSTGSGGDGGIGGTSGLF 338
+GGDG +GA +G G L GG G +S ++ G G GI G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 339 GTGGTGGAGGAAANATGGNGGAGGGGLWFG 368
G G + TGGN A + FG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.9 bits (77), Expect = 0.002
Identities = 28/85 (32%), Positives = 36/85 (42%), Gaps = 4/85 (4%)

Query: 309 GGGGEGGATSPSSSTGSGGDGGIGGTSGLFGTGGTGGAGGAAANATGGNGGAGGGGLWFG 368
GG G G T S++G+ GG +GL GG G ++ GG+G G W G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN----GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 369 NGGAGGIGGFDAHGNGGDGGAGGNA 393
G G GG G G G +A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 32.8 bits (74), Expect = 0.004
Identities = 30/82 (36%), Positives = 35/82 (42%), Gaps = 2/82 (2%)

Query: 397 GGNGGAGGTGGVGVGGNLFTGGQGGAGGNAGLLAGNG--GAGGNGGVRFSGNAGAGGAGG 454
G N GA T G GG G GGA +G + N G G G+ + G +G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 455 TGGDAGMFGNGGAGGAGGDRVA 476
G G G GG A VA
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVA 89



Score = 31.6 bits (71), Expect = 0.009
Identities = 28/81 (34%), Positives = 34/81 (41%), Gaps = 8/81 (9%)

Query: 509 SGSGGQGGHGGDGGAAGTIGNGGDGGTGGDALVSGGTGGD-------GGDGGDAREIGNG 561
SG G+G + G +G I NGG G G S G+G GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 562 GNGGNAGAGATAGNEGTGGTG 582
G+G G G + G GTGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 31.2 bits (70), Expect = 0.013
Identities = 31/85 (36%), Positives = 38/85 (44%), Gaps = 3/85 (3%)

Query: 217 IGGAGGEGGNSATTAGVGGA-GGAGGLLVGAGGHGGTGGTSASGTGATGGSGGAGGLLFS 275
+ G G G N+ + G GG GL VG G G+G +S + GG G+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN--NPWGGGSGSGIHWGG 58

Query: 276 PGGAGGDGGAGFSGADGGAGGNGGA 300
G G GG G SG G GGN A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.016
Identities = 23/86 (26%), Positives = 31/86 (36%)

Query: 445 GNAGAGGAGGTGGDAGMFGNGGAGGAGGDRVAGSQGNGGDGGDGGHGGTYFGSGGAGGHG 504
G GA T G+ G G G +G G G G ++G G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 505 GYDDSGSGGQGGHGGDGGAAGTIGNG 530
G + + GG G G A + G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.5 bits (68), Expect = 0.020
Identities = 33/107 (30%), Positives = 38/107 (35%), Gaps = 4/107 (3%)

Query: 320 SSSTGSGGDGGIGGTSGLFGTGGTGGAGGAAANATGGNGGAGGGGLWFGNGGAGGIGGFD 379
S G G + G TSG G TG G A+ G +G G GI
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNP---WGGGSGSGIHWGG 58

Query: 380 AHGNGGDGGAGGNAGIYGGNGGAGGTGGVGVGGNLFTGGQGGAGGNA 426
G+G GG GN+G G GG V GAGG A
Sbjct: 59 GSGHGNGGG-NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.1 bits (67), Expect = 0.029
Identities = 30/85 (35%), Positives = 36/85 (42%), Gaps = 5/85 (5%)

Query: 125 GADGTAPGQAGGDGGLLYGNGGAGGPGGAGGNAGLIGNGGAGGSGAALGLFGGTGGNGGL 184
GA T+ GG GL G G + G G + N N GGSG+ + GG+G G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSEN-----NPWGGGSGSGIHWGGGSGHGNGG 66

Query: 185 LFGNGGTGGAAGDLASGVGLPGGAG 209
GN G G G S V P G
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 29.7 bits (66), Expect = 0.039
Identities = 30/106 (28%), Positives = 37/106 (34%), Gaps = 6/106 (5%)

Query: 397 GGNGGAGGTGGVGVGGNLFTGGQGGAGGNAGLLAGNGGAGGNGGVRFSGNAGAGGAGGTG 456
GG+G TG GN+ GG G G G G+G + N +G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 457 GDAGMFGNGGAGGAGGDRVAGSQGNGGDGGDGGHGGTYFGSGGAGG 502
GNGG G G G + GAGG
Sbjct: 62 H-----GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 29.3 bits (65), Expect = 0.048
Identities = 23/61 (37%), Positives = 27/61 (44%)

Query: 209 GGHAGLFGIGGAGGEGGNSATTAGVGGAGGAGGLLVGAGGHGGTGGTSASGTGATGGSGG 268
GG GL GGA G S+ GG G+G G GHG GG SG G+ G
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 269 A 269
+
Sbjct: 82 S 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2100cloacin378e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 8e-04
Identities = 31/101 (30%), Positives = 38/101 (37%)

Query: 516 SGGDGGAGGAGGAGGDGGLVAGNGGVGGAGGIGGVGGTGGDGSVGVDAAGAGQDGGVGGA 575
SGGDG G G + G G+G GG G + + +G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 576 GGAGGAGGAGGAGGEGGAGGHALAAGYADGSQGAGGAGGAG 616
G GG G G G G A+AA A G G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.1 bits (80), Expect = 0.002
Identities = 34/88 (38%), Positives = 39/88 (44%), Gaps = 7/88 (7%)

Query: 144 GSGGVGQAGGAGGAAGLIGSGGAGGAGGAGGTGGAGGAGGWLYGNGGAGGVGGAGAVGGA 203
G G G GA +G I GG G G GGA GW N GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN----GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 204 GGNTWLWGNGGAGGSGGVGSGSGGAGGS 231
G GNGG G+ G GSG+GG +
Sbjct: 59 GSGH---GNGGGNGNSGGGSGTGGNLSA 83



Score = 33.9 bits (77), Expect = 0.006
Identities = 32/120 (26%), Positives = 46/120 (38%)

Query: 425 GAGGAGGTGGSGAGGSRAATGATGSTPSSGGNGGAGGAGADSITTGGAGAAGGTGGDGGL 484
G G G G+ + G TG G + G+G + ++ GG+G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 485 VGDGGAGGDGGAGLGGAPGTSVIFPGGQPGSSGGDGGAGGAGGAGGDGGLVAGNGGVGGA 544
GG G GG G ++V P + GAGG + G L A + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122



Score = 33.5 bits (76), Expect = 0.006
Identities = 26/87 (29%), Positives = 31/87 (35%)

Query: 551 GGTGGDGSVGVDAAGAGQDGGVGGAGGAGGAGGAGGAGGEGGAGGHALAAGYADGSQGAG 610
GG G + G + +GG G G GGA G E G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 611 GAGGAGGIGGDGAAGGKGAEGAAAAGA 637
G GG G G G+ G AA A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 32.8 bits (74), Expect = 0.010
Identities = 23/86 (26%), Positives = 31/86 (36%)

Query: 1026 GDGGAGGAGGAGGDGGAVAGDGGRGGAGGDGAMGGNGGNGFDGLHGTTPGANGQYGGDGG 1085
G G GA G+ G GG DG+ + N + G G+ G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1086 EGGRGGVGGAGGAGGAAAAGQAGSQG 1111
G GG+G G +A + G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.012
Identities = 37/115 (32%), Positives = 45/115 (39%), Gaps = 4/115 (3%)

Query: 1250 GHGGDGGDAGDSGSSAFGVGSPGGGGGQGGFGVAGGGDGGDGGNGGAGGFGQNGGPGGRG 1309
G G G + G +S G P G G GG G + GG G G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1310 GGGGHSLVGPGGDGGLGGTGGNGGNGSQPPFGSTPAGSGGDGGNGGAGGSSGFVS 1364
G GG + G GG GTGGN + P PA S G S+G +S
Sbjct: 63 GNGGGN----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 32.0 bits (72), Expect = 0.018
Identities = 27/81 (33%), Positives = 34/81 (41%), Gaps = 2/81 (2%)

Query: 1094 GAGGAGGAAAAGQAGSQGDGGNGGDGGDGGTPGNGGSGADGANSAIGVSAGDGGYGGAGG 1153
G G G A +GG G G GG + GSG N+ G +G G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1154 NAGAGGLGGEGGAGSTSGASG 1174
G GG G G GS +G +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.019
Identities = 25/79 (31%), Positives = 31/79 (39%)

Query: 413 GTGGNGGNGGDPGAGGAGGTGGSGAGGSRAATGATGSTPSSGGNGGAGGAGADSITTGGA 472
G G G N G G G +G G A+ +G + + GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 473 GAAGGTGGDGGLVGDGGAG 491
G GG G GG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.023
Identities = 29/69 (42%), Positives = 32/69 (46%), Gaps = 1/69 (1%)

Query: 216 GGSGGVGSGSGGAGGSGGWLYGNGGAGGTGGVADGVGEGGGHGGAGGNARLLGTGGAGGD 275
GG G+G G G + GS GW N GG G G G GHG GGN G G GG+
Sbjct: 22 GGPTGLGVGGGASDGS-GWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 276 GGAGLAGAA 284
A A A
Sbjct: 81 LSAVAAPVA 89



Score = 31.6 bits (71), Expect = 0.024
Identities = 31/117 (26%), Positives = 44/117 (37%), Gaps = 1/117 (0%)

Query: 872 GAGGAGGAGGLAQATGYLDGSHGSGGSGGAGGQAGNAGDGGDGADATVAGGKGGAGGNGG 931
G G G G +G ++G G GG G G+ + +G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 932 DAGVGGSGGLGGDSGNGTHAANGASAGAYGTGGNGGAGGDGADATAAGQAGGAGGAG 988
GG+G GG SG G + + A+ A+G G G + + A A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 31.6 bits (71), Expect = 0.025
Identities = 27/89 (30%), Positives = 31/89 (34%)

Query: 1147 GYGGAGGNAGAGGLGGEGGAGSTSGASGLDGSQAAGGDGGNGGFGGTGGLFEAAGSGGAG 1206
G G G N GA G G T G S +G N +GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1207 GVGGDGSQGGDGGDGGAGGSSPGATGGWG 1235
G GG G G G S+ A +G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.6 bits (71), Expect = 0.027
Identities = 35/112 (31%), Positives = 41/112 (36%), Gaps = 4/112 (3%)

Query: 1284 GGGDGGDGGNGGAGGFGQNGGPGGRGGGGGHSLVGPGGDGGLGGTGGNGGNGSQPPFGST 1343
G G G + G G NGGP G G GGG S G GG+GS +G
Sbjct: 4 GDGRGHNTGAHSTSG-NINGGPTGLGVGGGAS---DGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1344 PAGSGGDGGNGGAGGSSGFVSGQSGTDGQDGGDPSGQFGGTGGAGGSGGAGA 1395
G G GGS + + G P+ G GG S AGA
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 31.2 bits (70), Expect = 0.036
Identities = 30/106 (28%), Positives = 36/106 (33%), Gaps = 2/106 (1%)

Query: 1183 GDGGNGGFGGTGGLFEAAGSGGAGGVGGDGSQGGDGGDGGAGGSSPGATGGWGGQGGLGG 1242
G G N G T G G G G GG G + G G+ WGG G G
Sbjct: 6 GRGHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 1243 VGTAGVGGHGGDGGDAGDSGSSAFGVGSPG-GGGGQGGFGVAGGGD 1287
G G G G G + ++ G P G GG V+
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.8 bits (69), Expect = 0.049
Identities = 31/101 (30%), Positives = 35/101 (34%), Gaps = 1/101 (0%)

Query: 1088 GRGGVGGAGGAGGAAAAGQAGSQGDGGNGGDGGDGGTPGNGGSGADGANSAIGVSAGDGG 1147
G G G GA + G G G GG G G+ S I G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH-WGGGSG 61

Query: 1148 YGGAGGNAGAGGLGGEGGAGSTSGASGLDGSQAAGGDGGNG 1188
+G GGN +GG G GG S A G A G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2101PERTACTIN270.036 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 27.4 bits (60), Expect = 0.036
Identities = 16/49 (32%), Positives = 22/49 (44%)

Query: 56 HIATLIGYTRGDGGFQWENAMGDLAIGVVGIMAYWFRGHFWLATIVVLS 104
H+ L GYTRGD GF + ++ V G Y F+L + S
Sbjct: 703 HLGGLAGYTRGDRGFTGDGGGHTDSVHVGGYATYIANSGFYLDATLRAS 751


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2102cloacin375e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 5e-04
Identities = 29/91 (31%), Positives = 37/91 (40%)

Query: 332 VSGAAGSGGHGGTGGAAGLWGVGGHGGDGAHGGAGASGGAGDAGSGGGDAGDGGAGGRGG 391
+SG G G + G +G G G G + SG + + GG +G G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 392 WLVGGGGAGGSAGSGGGGGAGGSGANAVTLG 422
GGG G S G G GG + A V G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 37.0 bits (85), Expect = 5e-04
Identities = 39/109 (35%), Positives = 45/109 (41%), Gaps = 6/109 (5%)

Query: 999 GNGGKGGNGGAGG-----NGAAGSNASGAGATGGTGLMGGTGGSGGAGGEGGALAGNGGQ 1053
G G+G N GA NG G GA+ G+G GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1054 GGSGGSGGIGGTGGQGGGGSAGGAGVA-GVQDGAGGAGGGGGLGGSGGA 1101
G GG+G GG G GG SA A VA G + GG + S GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.2 bits (83), Expect = 0.001
Identities = 29/86 (33%), Positives = 36/86 (41%)

Query: 1169 GGGGTGGAGGAGGTSGDGVTVAGAGPTGGTGAGGAGGNGGAGGNADGGGIVNPEYGDGGA 1228
GG G G GA TSG+ GG + G+G + G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1229 GGNGGNGSSGGDGGSGGTGGRSGAGI 1254
G GGNG+SGG G+GG A +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 35.8 bits (82), Expect = 0.001
Identities = 33/99 (33%), Positives = 36/99 (36%)

Query: 1033 GTGGSGGAGGEGGALAGNGGQGGSGGSGGIGGTGGQGGGGSAGGAGVAGVQDGAGGAGGG 1092
G G + GA G + G G GG G GG+G G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1093 GGLGGSGGAGGAGGVGGIGGQAHAGGAFHDGDAGAGGLG 1131
GG G SGG G GG A G GAGGL
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.002
Identities = 32/92 (34%), Positives = 37/92 (40%), Gaps = 5/92 (5%)

Query: 432 GDGGAGGVGGAGGRGGWISFLSGQGAGGAGGDGGAGGGAGN---GGDGAVGTFFGGTGAG 488
G G G GA G I+ G G G+G + N GG G +GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 489 GNGGHGGDPGSGGAGGAGGAGSSAGAAGAGGL 520
GNGG G SGG G GG S+ A A G
Sbjct: 63 GNGGGNG--NSGGGSGTGGNLSAVAAPVAFGF 92



Score = 34.7 bits (79), Expect = 0.003
Identities = 36/103 (34%), Positives = 43/103 (41%), Gaps = 4/103 (3%)

Query: 806 TGGDGSTGGTGGTGGAGGSGGAMAGKGGDGGAG---GMGGGGGVGGNGSNGDHGVSGGNV 862
+GGDG TG +G G G G GGA G G GS GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 863 NGGTGGDGGKGGSGGQGGNGGAAGKALAASY-ADGAEGAGGAG 904
+G GG+G GG G GGN A +A + A GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.005
Identities = 29/81 (35%), Positives = 35/81 (43%), Gaps = 1/81 (1%)

Query: 704 SGGNGIGGTGGDGGDNGAGGAGGTGGSGSTTGSDGASGTTTTSGGNGGNGGRGADSVMIG 763
SGG+G G G +G G TG SDG SG ++ + GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG-SGWSSENNPWGGGSGSGIHWGGGS 60

Query: 764 GKGAAGGDGGDGGLYGNGGKG 784
G G GG+G GG G GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.005
Identities = 26/99 (26%), Positives = 34/99 (34%)

Query: 1251 GAGIYNKGGTGGVGGDGGNGTNGAGGKGGSGGNGGRGADASLFADAGDGGTGGDGGDGGT 1310
G G + G G+ G G G GG+ G ++ + + G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 1311 GTTGAGRGGTGGAGGGGGTGTGTPPPFGTGAPGGSGGTG 1349
G G G G GG + P FG A G G
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.006
Identities = 32/113 (28%), Positives = 41/113 (36%), Gaps = 3/113 (2%)

Query: 589 GIGGSGGEGGAGGMGGAMAGHGGDGGAGGHGGQGGSGGSGSRGADGVTGPNPNGGGGGDG 648
G G G GA G + G G GG G S + G +G + GGG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 649 GAGGAGGQGGNG---GLAGHAQAAGYSDGVQGVGGAGGKGGAGGLAGDGGTGA 698
G GG G G G G A AA + G + G G A ++ + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.5 bits (76), Expect = 0.006
Identities = 33/124 (26%), Positives = 38/124 (30%)

Query: 1071 GGSAGGAGVAGVQDGAGGAGGGGGLGGSGGAGGAGGVGGIGGQAHAGGAFHDGDAGAGGL 1130
GG G GG GLG GGA G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1131 GGKGGTGGTGGKGGTGGGGADATVFEPFAGNGGHGGAGGGGGTGGAGGAGGTSGDGVTVA 1190
G GG G +GG GTGG + F GG + GA + + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 1191 GAGP 1194
GP
Sbjct: 123 LKGP 126



Score = 33.5 bits (76), Expect = 0.006
Identities = 33/108 (30%), Positives = 42/108 (38%)

Query: 598 GAGGMGGAMAGHGGDGGAGGHGGQGGSGGSGSRGADGVTGPNPNGGGGGDGGAGGAGGQG 657
G G G H G G G GG S G+ + NP GGG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 658 GNGGLAGHAQAAGYSDGVQGVGGAGGKGGAGGLAGDGGTGAAGTFASG 705
GNGG G++ + G A G L+ G G A + ++G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.5 bits (76), Expect = 0.006
Identities = 37/113 (32%), Positives = 44/113 (38%), Gaps = 6/113 (5%)

Query: 144 GNGGSGGSGAAGQAGGA--GGAAGLIGNGGAGGAGGQGMFN----GGSGGAGGWAGLIGA 197
G G G + A G GG GL GGA G N GGSG W G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 198 GGAGGVGGTGVALDGGAGGAGGNAGVLFGPGGIGGSGGQGMASGGAGGAGGAS 250
G GG G +G G + A V FG + G G+A + GA A+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.1 bits (75), Expect = 0.008
Identities = 39/112 (34%), Positives = 47/112 (41%), Gaps = 9/112 (8%)

Query: 483 GGTGAGGNGGHGGDPGSGGAGGAGGAGSSAGAAGAGGLSPTTGGNGGNGGRGADGYGTGI 542
GG G G N G G+ GG G G GA+ G S GG G G+GI
Sbjct: 3 GGDGRGHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGG-------GSGSGI 54

Query: 543 SGASGGAGGDGGRYGNGGDG-GAGGDGMGGASGFSIVFPPGQDGGGGGIGGS 593
G G+GG GN G G G GG+ A+ + FP G GG+ S
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 33.1 bits (75), Expect = 0.008
Identities = 33/107 (30%), Positives = 40/107 (37%), Gaps = 5/107 (4%)

Query: 1103 GAGGVGGIGGQAHAGGAFHDGDAGAGGLGGKGGTGGTGGKGGTGGGGADATVFEPFAGNG 1162
G G G G G + G G G GG G + GGG+ + + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI-----HWG 57

Query: 1163 GHGGAGGGGGTGGAGGAGGTSGDGVTVAGAGPTGGTGAGGAGGNGGA 1209
G G G GGG G +GG GT G+ VA G G G A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.010
Identities = 27/103 (26%), Positives = 36/103 (34%)

Query: 939 NGAGGAGGKGGTGLTTGADGATGSRLTAGGNGGDGGDGGSAATAGAKGGAGGVGGDGGLY 998
+G G G G T+G + L GG DG S G G+ GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 999 GNGGKGGNGGAGGNGAAGSNASGAGATGGTGLMGGTGGSGGAG 1041
G G GG+G G+ ++ A T G+GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.013
Identities = 31/108 (28%), Positives = 41/108 (37%), Gaps = 1/108 (0%)

Query: 946 GKGGTGLTTGADGATGS-RLTAGGNGGDGGDGGSAATAGAKGGAGGVGGDGGLYGNGGKG 1004
G G G TGA +G+ G G GG + + GG G G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1005 GNGGAGGNGAAGSNASGAGATGGTGLMGGTGGSGGAGGEGGALAGNGG 1052
GNGG GN GS G + + G G G A++ + G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.4 bits (73), Expect = 0.013
Identities = 34/100 (34%), Positives = 42/100 (42%), Gaps = 12/100 (12%)

Query: 1195 TGGTGAGGAGGNGGAGGNADGGGIVNPEYGDGGAGGNGGNGSSGGDGGSGGTGGRSGAGI 1254
+GG G G G GN +GG G G G + SG + GG SG+GI
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGP-------TGLGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 1255 YNKGGTGGVGGDGGNGTNGAGGKGGSGGNGGRGADASLFA 1294
+ GG+G GNG GGSG G A A+ A
Sbjct: 55 HWGGGSGH-----GNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 32.4 bits (73), Expect = 0.015
Identities = 27/74 (36%), Positives = 32/74 (43%)

Query: 1190 AGAGPTGGTGAGGAGGNGGAGGNADGGGIVNPEYGDGGAGGNGGNGSSGGDGGSGGTGGR 1249
GA T G GG G G GG +DG G + GG G+G + G G+GG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 1250 SGAGIYNKGGTGGV 1263
SG G G V
Sbjct: 71 SGGGSGTGGNLSAV 84



Score = 32.0 bits (72), Expect = 0.017
Identities = 27/90 (30%), Positives = 34/90 (37%)

Query: 582 GQDGGGGGIGGSGGEGGAGGMGGAMAGHGGDGGAGGHGGQGGSGGSGSRGADGVTGPNPN 641
G DG G G G G + GG G + G GS G + +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 642 GGGGGDGGAGGAGGQGGNGGLAGHAQAAGY 671
G GGG+G +GG G GGN A G+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 31.6 bits (71), Expect = 0.027
Identities = 29/120 (24%), Positives = 34/120 (28%)

Query: 1040 AGGEGGALAGNGGQGGSGGSGGIGGTGGQGGGGSAGGAGVAGVQDGAGGAGGGGGLGGSG 1099
+GG+G +GG G G GG G G G G GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1100 GAGGAGGVGGIGGQAHAGGAFHDGDAGAGGLGGKGGTGGTGGKGGTGGGGADATVFEPFA 1159
G G GG G A G G G G A + + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121



Score = 31.6 bits (71), Expect = 0.028
Identities = 25/81 (30%), Positives = 31/81 (38%)

Query: 296 SGSGGAGGDGGLGGLVYGNGGGGGAGGVGGAGGAGIVSGAAGSGGHGGTGGAAGLWGVGG 355
SG G G + G GG GVGG G + + GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 356 HGGDGAHGGAGASGGAGDAGS 376
HG G +G +G G G S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 31.6 bits (71), Expect = 0.028
Identities = 41/120 (34%), Positives = 47/120 (39%), Gaps = 12/120 (10%)

Query: 784 GGDGGTGGTGQGGISFLAPGGQTGGDGSTGGTGGTG------------GAGGSGGAMAGK 831
GGDG TG S GG TG G + G+G G+G G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 832 GGDGGAGGMGGGGGVGGNGSNGDHGVSGGNVNGGTGGDGGKGGSGGQGGNGGAAGKALAA 891
G GG G GGG G GGN S V+ G T G GG S G A +AA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122



Score = 30.8 bits (69), Expect = 0.039
Identities = 24/72 (33%), Positives = 27/72 (37%)

Query: 376 SGGGDAGDGGAGGRGGWLVGGGGAGGSAGSGGGGGAGGSGANAVTLGSAGGNGGNGGDGG 435
SGG G + GG G G G G+G S N G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 436 AGGVGGAGGRGG 447
G GG G GG
Sbjct: 62 HGNGGGNGNSGG 73



Score = 30.8 bits (69), Expect = 0.046
Identities = 25/79 (31%), Positives = 35/79 (44%), Gaps = 3/79 (3%)

Query: 915 NGGDGADAAAGSAGTGGNGGHGGNNGAGGAGGKGGTGLTTGAD---GATGSRLTAGGNGG 971
+GGDG G+ T GN G G G G+G ++ + G +GS + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 972 DGGDGGSAATAGAKGGAGG 990
G GG+ + G G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 30.8 bits (69), Expect = 0.046
Identities = 27/79 (34%), Positives = 35/79 (44%), Gaps = 1/79 (1%)

Query: 1139 TGGKGGTGGGGADATVFEPFAGNGGHGGAGGGGGTGGAGGAGGTSGDGVTVAGAGPTGGT 1198
+GG G GA +T G G G GGG G + G + +G GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTG-LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1199 GAGGAGGNGGAGGNADGGG 1217
G G GGNG +GG + GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGG 79



Score = 30.8 bits (69), Expect = 0.047
Identities = 38/120 (31%), Positives = 44/120 (36%), Gaps = 7/120 (5%)

Query: 213 GAGGAGGNAGVLFGPGGI-GGSGGQGMASGGAGGAGGASGLVGNGAVGGAGGIGTTDGGA 271
G G G N G G I GG G G+ G + G+G +S G G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 272 GGQGGNARLFGTGGVGGHGGTGAGSGSGGAGGDGGLGGLVYGNGGGGGAGGVGGAGGAGI 331
G GGN G GG GTG + A G L GG GA A I
Sbjct: 63 GNGGGN------GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2103cloacin456e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.5 bits (107), Expect = 6e-07
Identities = 38/109 (34%), Positives = 42/109 (38%), Gaps = 12/109 (11%)

Query: 127 NGAPGTGQAGGA----GGILWGNGGAGGSGAPGQQGGSGGNAGLIGNGGVGGVGGIGGGV 182
+G G G GA G I NGG G G G G S G+ N GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI---NGGPTGLGVGG--GASDGSGWSSENNPWGGGSGSGIHW 56

Query: 183 GGVGGTGGWLLGNGGTGGTGGVGTGNIAGGAGGFGGSALSLLGNPGATG 231
GG G G GG+G G + FG ALS PGA G
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS---TPGAGG 102



Score = 41.2 bits (96), Expect = 1e-05
Identities = 32/109 (29%), Positives = 43/109 (39%), Gaps = 15/109 (13%)

Query: 148 AGGSGAPGQQGGSGGNAGLIGNGGVGGVGGIGGGV------------GGVGGTG-GWLLG 194
+GG G G G+ +G I NGG G+G GG GG G+G W G
Sbjct: 2 SGGDGR-GHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 195 NGGTGGTGGVGTGNIAGGAGGFGGSALSLLGNPGATGTPGGHADVLYLS 243
+G G G +G +G G A + A TPG + +S
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSIS 108



Score = 40.9 bits (95), Expect = 2e-05
Identities = 35/111 (31%), Positives = 42/111 (37%), Gaps = 13/111 (11%)

Query: 123 GEGANGAPGTGQA---GGAGGILWGNGGAGGSGAPGQQ----GGSGGNAGLIGNGGVGGV 175
G G N + GG G+ G G + GSG + GGSG G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN- 64

Query: 176 GGIGGGVGGVGGTGGWLLGNGGTGGTGGVGTGNIAGGAGGFGGSALSLLGN 226
GG G GG GTGG + V G A G GG A+S+
Sbjct: 65 GGGNGNSGGGSGTGG-----NLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 37.4 bits (86), Expect = 2e-04
Identities = 32/94 (34%), Positives = 37/94 (39%), Gaps = 10/94 (10%)

Query: 127 NGAPGTGQAGGAG--GILWG-----NGGAGGSGAPGQQGGSGGNAGLIGNGGVGGVGGIG 179
NG P GG G W GG GSG G GN G GNG GG G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG--GNGNSGGGSGTG 78

Query: 180 GGVGGVGGTGGWLLGNGGTGGTGGVGTGNIAGGA 213
G + V + T G GG+ +I+ GA
Sbjct: 79 GNLSAVAAPVAFGFPALSTPGAGGLAV-SISAGA 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2112cloacin359e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.1 bits (80), Expect = 9e-04
Identities = 45/121 (37%), Positives = 52/121 (42%), Gaps = 22/121 (18%)

Query: 153 SGGLQNGGSGGSAGLIGNGGNGGNGFLGGTGGAGGSGGWLAGSGGNGGAGGSVSGIGEIA 212
SGG G + G+ GN NGG LG GGA GW S N GGS SGI
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGW--SSENNPWGGGSGSGIHWGG 58

Query: 213 GAGGAGGNAPLLGWGGNGGVGGNAPQGTGGIGGAGGAGGALSAVGGTG----GTGGSGGV 268
G GNGG GN +GG G GG A++A G T G+GG+
Sbjct: 59 G-----------SGHGNGGGNGN----SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103

Query: 269 A 269
A
Sbjct: 104 A 104



Score = 32.8 bits (74), Expect = 0.005
Identities = 33/115 (28%), Positives = 46/115 (40%), Gaps = 1/115 (0%)

Query: 326 GIGGFANDTGGLGGQGGDATALLGVGVGGAGSIG-GAGNAAASAGGAGGAGAALVGVGVG 384
G G ++TG G G+GVGG S G G + GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 385 GIGGIGGFANGTSGAGGAGGSGAAVMGLGVGGAGSIGGAANSTAGAGGDGGEGVA 439
G GG G + G SG GG + AA + G + G + + + G +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 32.4 bits (73), Expect = 0.005
Identities = 27/85 (31%), Positives = 36/85 (42%), Gaps = 6/85 (7%)

Query: 143 GNGGNGYSFTSGGLQNGGSGGSAGLIGNGGNGGNGFLGGT----GGAGGSGGWLAGSGGN 198
G G N + ++ G NGG G G G + G+G+ GG+G W GSG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 199 GGAGGSVSGIGEIAGAGGAGGNAPL 223
G G SG G G + AP+
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 32.0 bits (72), Expect = 0.007
Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 1/70 (1%)

Query: 461 AGGQGGQGAVLIGAGFGGAGGDGGSATVNAVGNGGDGGNAGALFGIGAGGHGGNAGSGVG 520
G G +G G G + G G S+ N G GG G G G G GGN SG G
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSGHGNGGGNGNSGGG 74

Query: 521 AANGGNGGSV 530
+ GGN +V
Sbjct: 75 SGTGGNLSAV 84



Score = 31.6 bits (71), Expect = 0.010
Identities = 27/104 (25%), Positives = 36/104 (34%), Gaps = 12/104 (11%)

Query: 474 AGFGGAGGDGGSATVNAVGNGGDGGNAGALFGIGAGGHGGNAGSGVGAANGGNGGSVGVI 533
+G G G + G+ + + NGG G G G + GSG + N GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTG--------LGVGGGASDGSGWSSENNPWGG----G 49

Query: 534 SDGSFTPTPVGYGGNGGNGVNGGTGGTGGTGGTLIGTDGTNGSP 577
S GNGG N G G G + + G P
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93



Score = 30.8 bits (69), Expect = 0.015
Identities = 28/80 (35%), Positives = 30/80 (37%), Gaps = 6/80 (7%)

Query: 225 GWGGNGGVGGNAPQGTGGIGGAGGAGGALSAVGGT------GGTGGSGGVAGGDGGAGGA 278
G G N G + GG G G GGA G + GG GSG GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 279 GRGLFYGLGGAGGMGGSATA 298
G G G G SA A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVA 85



Score = 30.5 bits (68), Expect = 0.021
Identities = 34/120 (28%), Positives = 43/120 (35%), Gaps = 5/120 (4%)

Query: 258 GTGGTGGSGGVAGGDGGAGGAGRGLFYGLGGAGGMGGSATAVTPHTGGTGGVGGEGGAVF 317
G G G + G G G GL G G + G G S+ GG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSE-----NNPWGGGSGSGIHWG 57

Query: 318 GYAQGGTGGIGGFANDTGGLGGQGGDATALLGVGVGGAGSIGGAGNAAASAGGAGGAGAA 377
G + G GG G + G GG A + G + G G A + + GA A A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 30.1 bits (67), Expect = 0.026
Identities = 35/99 (35%), Positives = 45/99 (45%), Gaps = 8/99 (8%)

Query: 125 GANGTATSPNGGAGGILYGNGGN-GYSFTSGGLQNGGSGGSAGLIGNGGNGGNGFLGGTG 183
GA+ T+ + NGG G+ G G + G ++S GG GS G G GNG G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 184 GAGGSGGWLAGSGGNGGAGGSVSGIGEIA-GAGGAGGNA 221
G G +G+GGN A + G A GAGG A
Sbjct: 72 GGG------SGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.1 bits (67), Expect = 0.026
Identities = 27/90 (30%), Positives = 33/90 (36%), Gaps = 6/90 (6%)

Query: 123 GNGANGTATSPNGGAGGILYGNGGNGYSFTSGGLQNGGSGGSAGLIGNGGNGGNGFLGGT 182
G G N A S +G NGG GG +G S GG+G GG
Sbjct: 6 GRGHNTGAHSTSGNI------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 183 GGAGGSGGWLAGSGGNGGAGGSVSGIGEIA 212
G G GG GG+G G + +A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2113cloacin402e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.5 bits (94), Expect = 2e-05
Identities = 40/119 (33%), Positives = 49/119 (41%), Gaps = 10/119 (8%)

Query: 153 SGGTQSGGTGGSAGLIGNGGNGGNGFLGGAGGAAGSGGWLAGSGGNGGAGGSVTGVGEVG 212
SGG G G+ GN NGG LG GGA+ GW S N GGS +G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGW--SSENNPWGGGSGSGIHWGG 58

Query: 213 GAGGAGGSAPLLGWGGNGGAGGDSTQGAGGMGGAGGAGGALASIGGAGGAGGTGTTSGG 271
G+G G GGNG +GG S G A ++ G G + S G
Sbjct: 59 GSGHGNG-------GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 36.6 bits (84), Expect = 2e-04
Identities = 39/138 (28%), Positives = 49/138 (35%), Gaps = 7/138 (5%)

Query: 225 GWGGNGGAGGDSTQGAGGMGGAGGAGGALASIGGAGGAGGTGTTSGGDGGVGGEGSGRLF 284
G G N GA S GG G G GGA G + G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGA-----SDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 285 GLGGAGGAGGTGITSGGVGGDGGAGGGLLFGLGG--SGGAGGVATDATGIGGTGGAGGES 342
G G GG G +G SG G + FG + GAGG+A + +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIM 120

Query: 343 GVIIGYAQSGAGGIGGYG 360
+ G + G G+ YG
Sbjct: 121 AALKGPFKFGLWGVALYG 138



Score = 35.5 bits (81), Expect = 5e-04
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 459 NGGDG-GNGGGLFNIGRGGDGGNGGNAGATGGNGGNGGNIGVVANGTFTQTLFGDGGNGG 517
+GGDG G+ G + +GG G G + G+G + G + + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 518 NGGTGGTPGTGGTGGSGG 535
+G GG +GG G+GG
Sbjct: 62 HGNGGGNGNSGGGSGTGG 79



Score = 33.5 bits (76), Expect = 0.002
Identities = 27/79 (34%), Positives = 34/79 (43%), Gaps = 1/79 (1%)

Query: 455 GAGGNGGDGGNGGGLFNIGRGGDGGNGGNAGATGGNGGNGGNIGVVANGTFTQTLFGDGG 514
G G N G G + N G G G GG + +G + N G +G G G
Sbjct: 6 GRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 515 NGGNGGTGGTPGTGGTGGS 533
GGNG +GG GTGG +
Sbjct: 65 GGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.002
Identities = 25/71 (35%), Positives = 29/71 (40%), Gaps = 7/71 (9%)

Query: 421 GTATGGDGGAGGQGAALWGAGFGGDGAVGGNSFVGAGGNGGDGGNGGGLFNIGRGGDGGN 480
G GG G G G A G+G+ + G G+G GG G G GG GN
Sbjct: 18 GNINGGPTGLGVGGGASDGSGWSSENNPWGG---GSGSGIHWGGGSGH----GNGGGNGN 70

Query: 481 GGNAGATGGNG 491
G TGGN
Sbjct: 71 SGGGSGTGGNL 81



Score = 33.1 bits (75), Expect = 0.003
Identities = 29/99 (29%), Positives = 36/99 (36%), Gaps = 2/99 (2%)

Query: 421 GTATGGDGGAGGQGAALWGAGFGGDGAVGGNSFVGAGG--NGGDGGNGGGLFNIGRGGDG 478
G G + GA + G G G + G N GG+G G+ G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 479 GNGGNAGATGGNGGNGGNIGVVANGTFTQTLFGDGGNGG 517
GGN + GG+G G V A F G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.1 bits (75), Expect = 0.003
Identities = 31/113 (27%), Positives = 46/113 (40%), Gaps = 7/113 (6%)

Query: 192 LAGSGGNGGAGGSVTGVGEVGGAGGAGGSAPLLGWGGNGGAGGDSTQGA-GGMGGAGGAG 250
++G G G G+ + G + G G +G G + G+G S GG G+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLG----VGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 251 GALASIGGAGGAGGTGTTSGGDGGVGGEGSGRLFGLGG--AGGAGGTGITSGG 301
G + G GG G +G SG G + + FG GAGG ++
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 32.8 bits (74), Expect = 0.004
Identities = 32/118 (27%), Positives = 46/118 (38%), Gaps = 2/118 (1%)

Query: 261 GAGGTGTTSGGDGGVGGEGSGRLFGLGGAGGAGGTGITSGGVGGDGGAGGGLLFGLGGSG 320
G G G +G G G G G + G+G +S GG+G G+ +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 321 GAGGVATDATGIGGTGGAGGESG--VIIGYAQSGAGGIGGYGGDIGGTGGAGGVAGVL 376
G GG ++ G GTGG V G+ G GG I + +A ++
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIM 120



Score = 32.0 bits (72), Expect = 0.006
Identities = 36/120 (30%), Positives = 43/120 (35%), Gaps = 12/120 (10%)

Query: 240 AGGMGGAGGAGGALASIGGAGGAGGTGTTSGGDGGVGGEGSGRLFGLGGAGGAGGTGITS 299
+GG G G S GG G G G G G +G G G G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 300 GGVGGDGGAGGGLLFGLGGSGGAGGVATDATGIG------GTGGAGGESGVIIGYAQSGA 353
G GG G G GGSG G ++ A + T GAGG + I A S A
Sbjct: 62 HGNGGGNGNSG------GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 31.2 bits (70), Expect = 0.010
Identities = 21/55 (38%), Positives = 23/55 (41%), Gaps = 1/55 (1%)

Query: 415 GGVGGFGTATGGDGGAG-GQGAALWGAGFGGDGAVGGNSFVGAGGNGGDGGNGGG 468
GG G G G G+G WG G G GG S G GG G+ G G G
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSG 76



Score = 30.5 bits (68), Expect = 0.022
Identities = 32/121 (26%), Positives = 43/121 (35%), Gaps = 10/121 (8%)

Query: 285 GLGGAGGAGGTGITSGGVGGDGGAGGGLLFGLGGSGGAGGVATDATGIGGTGGAGGESGV 344
G G G G TSG + G G G G S G+G + + GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV---GGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 345 IIGYAQSGAGGIGGYGGDIGGTGGAGGVAGVLVGAGVGGFGGMGGAGTTGGAGGVGGQGV 404
G GG G+ GG G GG + GF + G G A + +
Sbjct: 60 -------SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112

Query: 405 T 405
+
Sbjct: 113 S 113



Score = 29.7 bits (66), Expect = 0.032
Identities = 28/99 (28%), Positives = 30/99 (30%)

Query: 123 GDGANGTATSPNGGAGGFLYGNGGNGYSFTSGGTQSGGTGGSAGLIGNGGNGGNGFLGGA 182
G G N A S +G G G G G + G S G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 183 GGAAGSGGWLAGSGGNGGAGGSVTGVGEVGGAGGAGGSA 221
GG SGG G V GAGG A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


26MMAR_2126MMAR_2131Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_21263151.258794Zn-dependent glyoxylase
MMAR_21273161.611324histidyl-tRNA synthetase
MMAR_21284161.66684513e12 repeat-containing protein
MMAR_21294181.488555site-specific integrase
MMAR_21303201.746614replication initiator protein
MMAR_21312171.430891DNA segregation ATPase FtsK/SpoIIIE-like
27MMAR_2306MMAR_2334Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2306-113-3.201450hypothetical protein
MMAR_2307-211-2.366271hypothetical protein
MMAR_2308014-3.742114hypothetical protein
MMAR_2309015-4.408179UDP-glucose 6-dehydrogenase, UdgL
MMAR_2310116-4.573261UDP-glucose 4-epimerase
MMAR_2311119-3.663894glycosyl transferase family protein
MMAR_2312219-3.771943nucleoside-diphosphate-sugar epimerase
MMAR_2313122-6.071712glycosyltransferase, LosA
MMAR_2314122-5.266682hypothetical protein
MMAR_2315119-4.785817methyltransferase
MMAR_2316017-5.027111transcriptional regulator
MMAR_2317120-5.791556O-methyltransferase
MMAR_2318119-6.177278hypothetical protein
MMAR_2319217-5.724731hypothetical protein
MMAR_2320217-6.371567TDP-4-oxo-6-deoxy-D-glucose transaminase
MMAR_2321121-6.779678acyltransferase
MMAR_2322019-5.154169hypothetical protein
MMAR_2323-117-3.678235hypothetical protein
MMAR_2324-116-3.135767hypothetical protein
MMAR_2325015-2.805633hypothetical protein
MMAR_2326011-2.283685hypothetical protein
MMAR_2327-110-1.328301hypothetical protein
MMAR_2328112-1.671385putative sugar kinase
MMAR_2329113-2.8243763-dehydroquinate synthase
MMAR_2330211-2.542306short chain dehydrogenase
MMAR_2331011-2.239130hypothetical protein
MMAR_233209-2.011638acetolactate synthase large subunit IlvB
MMAR_2333-111-2.857217glycosyltransferase, WcaA
MMAR_2334-210-3.050255nucleotidyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2310NUCEPIMERASE1346e-39 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 134 bits (340), Expect = 6e-39
Identities = 82/353 (23%), Positives = 132/353 (37%), Gaps = 50/353 (14%)

Query: 1 MKVLVTGSAGFINGYVVQELLQAGHEVVGIDNYSKYGKVTKSYD-----DHPNYHFVEGD 55
MK LVTG+AGFI +V + LL+AGH+VVGIDN + Y V+ P + F + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 56 VKDVDLMFELVE--GCEQMVASAARIGGITYFHEYAYDLLAENERIAAAHFDTAIYAYRK 113
+ D + M +L E++ S R + Y E + N F + R
Sbjct: 61 LADREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNL----TGFLNILEGCRH 115

Query: 114 GWLKKINVISSSMVFENASIFPTPEKHITECPPPTSTYGFQKLACEYFAHGAYEQYGLPY 173
++ + SSS V+ P + P S Y K A E AH YGLP
Sbjct: 116 NKIQHLLYASSSSVYGLNRKMPFSTDDSVD--HPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 174 TIIRPFNCVGTGEQRALGGREIPSGNVKLAMSHVVPDLVQKVVKGQDPLHILGDGSQVRH 233
T +R F G P G +A + + +++G+ + + G R
Sbjct: 174 TGLRFFTVYG------------PWGRPDMA----LFKFTKAMLEGK-SIDVYNYGKMKRD 216

Query: 234 YTYGGDLARGIRTCMEHPAALNGD-----------------FNLSTPEATTVLELAEVIW 276
+TY D+A I + + +N+ +++ + +
Sbjct: 217 FTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALE 276

Query: 277 RKMRPDTPFRYESDPPFEHDVQLRSPDVHKATQVLGFEATTTLDAMLDEVIPW 329
+ + P DV S D +V+GF TT+ + + W
Sbjct: 277 DAL--GIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2314ACETATEKNASE290.028 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 28.6 bits (64), Expect = 0.028
Identities = 12/25 (48%), Positives = 13/25 (52%), Gaps = 3/25 (12%)

Query: 108 ILPSFPSVATPEVFQNAFHQDFPRV 132
I+P P VA VF AFHQ P
Sbjct: 136 IMPDVPMVA---VFDTAFHQTMPDY 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2316HTHTETR574e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.6 bits (136), Expect = 4e-12
Identities = 24/164 (14%), Positives = 46/164 (28%), Gaps = 15/164 (9%)

Query: 9 KARIRNAAVARFARDGFQKVNLRAIATSAGVSEALIFHHFGSKDGL-RAACDEYVLNVLV 67
+ I + A+ F++ G +L IA +AGV+ I+ HF K L + N+
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 68 GRARTAGRPTAM-----ADLLGVYLSNPEEYR---------LQVQYMARAIEDDAPAAGT 113
+ ++L L + + A
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132

Query: 114 FVDTMVEESEAIFRAGAADGSMRPSSDPRALAVLNLLVALGLLT 157
+ E + + R A++ GL+
Sbjct: 133 LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2330DHBDHDRGNASE1197e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 119 bits (298), Expect = 7e-35
Identities = 69/246 (28%), Positives = 115/246 (46%), Gaps = 33/246 (13%)

Query: 13 ALVTGAARGIGRSVADTLESKGITVL----RPGRQE-----------------LDLSIPE 51
A +TGAA+GIG +VA TL S+G + P + E D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 52 SVADYLTGLDQVV---DILVLNAGINNPEPLQTLSADNWSSTQQVNVTANLLLLQGLLPR 108
++ + +++ + DILV AG+ P + +LS + W +T VN T + +
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 109 MAAAGFGRVVAVSSVYAHRARTGRVAYSASKAAIEEVVRSVAVEYGPYGVLANCVAPGFV 168
M G +V V S A RT AY++SKAA + + +E Y + N V+PG
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGST 190

Query: 169 LTDL-----TYQNNDAKQLQALAER----VPVGRLAEPEEIAVFISWLVSAENSYITGQS 219
TD+ +N + ++ E +P+ +LA+P +IA + +LVS + +IT +
Sbjct: 191 ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHN 250

Query: 220 ITIDGG 225
+ +DGG
Sbjct: 251 LCVDGG 256


28MMAR_2346MMAR_2353Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2346316-3.166322dioxygenase
MMAR_2347219-3.767222hypothetical protein
MMAR_2348218-3.836734hypothetical protein
MMAR_2349218-4.080659rhamnosyl transferase WbbL2
MMAR_2350114-3.673062methylase
MMAR_2351011-3.391242glycosyl transferase family protein
MMAR_2352-110-2.611749hypothetical protein
MMAR_2353-111-3.027032UDP-glycosyltransferase
29MMAR_2456MMAR_2461Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2456828-0.749192phenylalanyl-tRNA synthetase subunit alpha
MMAR_2457828-0.563262phenylalanyl-tRNA synthetase subunit beta
MMAR_2458929-0.990181PE-PGRS family protein
MMAR_24591030-1.201598PE-PGRS family protein
MMAR_24601029-1.056965PE-PGRS family protein
MMAR_24611130-0.825848PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2458PF00577389e-05 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 38.3 bits (89), Expect = 9e-05
Identities = 37/198 (18%), Positives = 55/198 (27%), Gaps = 16/198 (8%)

Query: 89 SVEATAQQDL--SMAEQGLVNAVNAPAQALLGHPIIGTGGSGSQSATTGTSTSGVTSGSV 146
++ + ++ +Q L VN P L S S + +G +
Sbjct: 576 TLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLA 635

Query: 147 -ASGSA--AGSVSSGATAGTVSTGATTGSVSSGATA------GTVSAGATDGAVTSGTTA 197
G+ ++S G G + AT G + G +
Sbjct: 636 GVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYY 695

Query: 198 SSSDGTTVSAGGVAATDPGSDGGVAVSDPGSDGGVAVSGSGSGS----GVAVGGTDTPGT 253
S G A GV P +D V V PG+ V G AV T
Sbjct: 696 GVSGGVLAHANGVTLGQPLNDTVVLVKAPGA-KDAKVENQTGVRTDWRGYAVLPYATEYR 754

Query: 254 SAPAPVTPNTFGDVVAAP 271
+ NT D V
Sbjct: 755 ENRVALDTNTLADNVDLD 772


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2459cloacin381e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 1e-04
Identities = 39/131 (29%), Positives = 50/131 (38%), Gaps = 13/131 (9%)

Query: 125 GANATVAGGDGRPGGILMGKGGDGAPG--GPGQPGGNGGAAGLIGTGGAGGKEAPSMPGG 182
GA++T +G P G+ +G G G P G G +G+ GG+G GG
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG-----NGG 66

Query: 183 LGGRGGLLWGDGGKGGDGGAPVYDAWGYLLAEAGTGGAGGLPGMFGGLPGVAGAWGGEVP 242
G G G GG APV A+G T GAGGL G A ++
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPV--AFG--FPALSTPGAGGLAVSISA--GALSAAIADIM 120

Query: 243 TALAAPMALSA 253
AL P
Sbjct: 121 AALKGPFKFGL 131


30MMAR_2586MMAR_2593Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2586512-1.025692hypothetical protein
MMAR_2587514-0.971734NADH dehydrogenase Ndh1
MMAR_2588513-1.223439hypothetical protein
MMAR_2589414-1.161365Z-decaprenyl diphosphate synthase
MMAR_2590513-1.003495hypothetical protein
MMAR_25916130.523360PPE family protein
MMAR_25921121.555598short chain dehydrogenase
MMAR_25932120.984140divalent cation-transport integral membrane
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2586PF05616320.004 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 32.4 bits (73), Expect = 0.004
Identities = 31/95 (32%), Positives = 38/95 (40%), Gaps = 9/95 (9%)

Query: 86 LNPGARRSAPVQQQAAVPAPAPPNPANQAPNATQIAPDAAPIPAPAPPPPPD-GSDAGGA 144
L PG+ AP Q +PA NPAN AP+ P P P P PD DA
Sbjct: 315 LTPGSAE-APNAQPLPEVSPAE-NPANNP------APNENPGTRPNPEPDPDLNPDANPD 366

Query: 145 LGGATTSLAEWVTGPDSPNKTLERFGISGTDLGIL 179
G + + PD PN + G D G+L
Sbjct: 367 TDGQPGTRPDSPAVPDRPNGRHRKERKEGEDGGLL 401


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2592DHBDHDRGNASE1104e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (276), Expect = 4e-31
Identities = 71/252 (28%), Positives = 117/252 (46%), Gaps = 14/252 (5%)

Query: 14 RVAVVTGGAGGIGAATSRLFAQHGAQVVIADIDAELAHRTVDEIGGAAWV---VGTDVRD 70
++A +TG A GIG A +R A GA + D + E + V + A DVRD
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 71 ADQVSALAQRVLDRYGRVDILVNNVGHWLRHPGNFVDTDPQLWDELYRVNLHHVLLATHA 130
+ + + R+ G +DILVN G + PG + W+ + VN V A+ +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAG--VLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 131 FLPAMIEQHGGAIVNVSSVEGLRGYPEDPVYAAFKAAVIHFTRSLAVQVGNHGVRINAIA 190
M+++ G+IV V S YA+ KAA + FT+ L +++ + +R N ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 191 PDVTESLQVPYSQWLSD--AEQT------QWPGWVPVGRMGVPEDQARVILFLACELSAF 242
P TE+ + +S W + AEQ + +P+ ++ P D A +LFL +
Sbjct: 187 PGSTET-DMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 243 VTGHTIPTDGGT 254
+T H + DGG
Sbjct: 246 ITMHNLCVDGGA 257


31MMAR_2649MMAR_2656Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_26495213.139613hypothetical protein
MMAR_26505223.613634PPE family protein
MMAR_26515224.201315transcriptional regulatory protein
MMAR_26544194.091240cytochrome P450 144A4 Cyp144A4
MMAR_26554204.243367transcriptional regulatory protein
MMAR_26565204.507352PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2651HTHTETR453e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.0 bits (106), Expect = 3e-08
Identities = 21/188 (11%), Positives = 60/188 (31%), Gaps = 20/188 (10%)

Query: 11 DRRRAAADRIYDAATDLIAHEGINQLDIDRLATLVHCSRATVYRYVGGKNDIRNVVVKRA 70
+ I D A L + +G++ + +A +R +Y + K+D+ + + + +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 71 AARIADSVRSAVENLSG------RERVVAAI-----------ILSVQRIRADPLGQLMIS 113
+ I + G RE ++ + ++ + + + +G++ +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 114 SIHGGTQEVAWLADSPLLAGVASDLTGL-AGGDPHAAKWVVRIVLSLMY--WPAESEDVE 170
+ + L A A ++R +S + W + +
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFD 186

Query: 171 RLMVEKFV 178
+
Sbjct: 187 LKKEARDY 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2655HTHTETR477e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 47.3 bits (112), Expect = 7e-09
Identities = 20/131 (15%), Positives = 42/131 (32%), Gaps = 3/131 (2%)

Query: 10 DRSAVAAELIYDAAAELIASDGLSAFDIDKLAARVHCSRATIYRYAGGKAKIRDVVIARA 69
+ + I D A L + G+S+ + ++A +R IY + K+ + + +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 70 AARIVESVRAQAESLTG--AERVVASVEFALAGVRSDPLGRHLVGSFPKSANGA-EWFVG 126
+ I E G + + L ++ R L+ E V
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 127 SKLVANFAADL 137
+ N +
Sbjct: 127 QQAQRNLCLES 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2656cloacin392e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 2e-04
Identities = 28/80 (35%), Positives = 35/80 (43%)

Query: 1202 GNGGNGGTGGTGSTGTAGSSDVMGANGGAGGSGWAGGDGGAGGMGGTLAGHGGDGGDGGS 1261
G G N G T G + + G + GSGW+ + GG G+ GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1262 GGTGGTGGRGGNGFNGSTKA 1281
GG G +GG G G N S A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVA 85



Score = 35.1 bits (80), Expect = 0.003
Identities = 28/102 (27%), Positives = 34/102 (33%)

Query: 1404 VAGGAGGAGGDGGLYGDGGDGGSGGNGGAGKAGAAGVSAGSNGEAGGQAGAGGVGGAGGN 1463
++GG G G G G G G + G S G G+ GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1464 GGANAGNGGTGGNGGDGGVGGTGGAGKVGTTGPAGGAGGEGG 1505
G N G G G G G + A V PA G GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.005
Identities = 33/99 (33%), Positives = 37/99 (37%)

Query: 171 GNGGAGGAGGISAAGNGGSGGVGGRGGLVYGSGGAGGAGGQGALSGGAGGAGGGAWLWGA 230
G G GA S NGG G+G GG GSG + G SG GGG+
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 231 GGAGGSGGEGLASAGGVGGAGGNAGLIGTGGLGGAGGVG 269
GG G SGG A A GAGG+
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.006
Identities = 27/101 (26%), Positives = 34/101 (33%)

Query: 1444 SNGEAGGQAGAGGVGGAGGNGGANAGNGGTGGNGGDGGVGGTGGAGKVGTTGPAGGAGGE 1503
S G+ G NGG G G + G G G +G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1504 GGDGGKGGTGGRGGNGGAGGTAQAAGYSDGSQGVGGDGGAG 1544
G+GG G G G G +A AA + G + G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.007
Identities = 28/82 (34%), Positives = 35/82 (42%)

Query: 1511 GTGGRGGNGGAGGTAQAAGYSDGSQGVGGDGGAGGTGGTAGNGGKGGAGTWAVNNGIGGK 1570
G GRG N GA T+ GVGG G + N GG+G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1571 GGNGGNAGTGGTGGSFGTGSQI 1592
G GGN +GG G+ G S +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.5 bits (76), Expect = 0.010
Identities = 28/78 (35%), Positives = 33/78 (42%)

Query: 286 GRGGTGGVGGASDGGNGGAGGDGGVGGGLFGSGGAGGSGGAGGVLGTGGDGGSGGAAAGL 345
GRG G S NGG G G GG GSG + + GG G+G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 346 WGAGGSGGAGGNGADGIS 363
G G SGG G G + +
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 33.1 bits (75), Expect = 0.013
Identities = 26/81 (32%), Positives = 34/81 (41%)

Query: 1170 NGGDGGNGGSGTTGTTGSKGGAGGAGGDGGRYGNGGNGGTGGTGSTGTAGSSDVMGANGG 1229
+GGDG +G T+G+ G G GG +G + G +GS G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1230 AGGSGWAGGDGGAGGMGGTLA 1250
G G G GG G GG L+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 33.1 bits (75), Expect = 0.014
Identities = 32/117 (27%), Positives = 42/117 (35%)

Query: 1236 AGGDGGAGGMGGTLAGHGGDGGDGGSGGTGGTGGRGGNGFNGSTKAGLNGGDAGDGGAGG 1295
+GGDG G +GG G G GG G + G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1296 VGGAGGNGGAAGLAQAAGFSDGIQGAGGAGGDGGAGGGAGDGGDGANAAAGSGAVGG 1352
G GGNG + G + G + G + GAG +A A S A+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.8 bits (74), Expect = 0.016
Identities = 34/113 (30%), Positives = 45/113 (39%), Gaps = 5/113 (4%)

Query: 1298 GAGGNGGAAGLAQAAGFSDGIQGAGGAGGDGGAGGGAGD----GGDGANAAAGSGAVGGN 1353
G G G G +G +G G GG G G G G+ + G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1354 GGDGGDPGLGGGGGAGGTGATTGAHGADGLSP-TTGGNGGKGGNGGSGAIGVA 1405
G GG+ GGG G GG + A A G +T G GG + +GA+ A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 32.4 bits (73), Expect = 0.021
Identities = 39/104 (37%), Positives = 43/104 (41%), Gaps = 9/104 (8%)

Query: 270 GQNGGAGGDGGNAPLLGRGGTGGVGGASDGG--------NGGAGGDGGVGGGLFGSGGAG 321
G N GA GN G G G GGASDG GG G G GG G G G
Sbjct: 8 GHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 322 GSGGAGGVLGTGGDGGSGGAAAGLWGAGGSGGAGGNGADGISGG 365
G+G +GG GTGG+ + A S G A IS G
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.4 bits (73), Expect = 0.026
Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 2/76 (2%)

Query: 1673 GAGGAGGNGGTSRGDGGAGGAGGTGGVGGSGGDGA--DGTSGLFGGADGTAGGAGGDGGD 1730
G G G N G G G GVGG DG+ + +GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1731 GGAGGAGGAGGKAVSG 1746
G GG G +GG + +G
Sbjct: 63 GNGGGNGNSGGGSGTG 78



Score = 31.6 bits (71), Expect = 0.036
Identities = 30/84 (35%), Positives = 32/84 (38%)

Query: 1870 AGGAGGAGGTGSTQGSAGTTGAWRAGGDGGSGGDGGDGFGLWNPGEGGRGGSGGDGGTGG 1929
+GG G TG+ S G G GG DG NP GG G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1930 DGGDGGNGRVEIWEGRGGNGGNGA 1953
G GGNG G GGN A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 31.6 bits (71), Expect = 0.040
Identities = 30/101 (29%), Positives = 35/101 (34%), Gaps = 5/101 (4%)

Query: 1750 NGSQGAGGNGGAAGDGGDGGNGGNGHDGNNGSVPSGGTDRDGGDGQGGGDGGAGGAGGAG 1809
+G G G N GA G+ G G V G +D G + GG G+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTG-----LGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 1810 GNGGAAGAGGGGTRGAGGDGGDGGNGGFAGDGGLGMDGLDA 1850
G G G GGG GG G G A G L
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALST 97


32MMAR_2680MMAR_2702Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2680315-0.301253hypothetical protein
MMAR_26815180.516085PPE family protein
MMAR_26823160.362636PPE family protein
MMAR_26831140.377909PPE family protein
MMAR_2684-1130.140078PPE family protein
MMAR_2685011-0.275367PPE family protein
MMAR_2686011-1.869975hypothetical protein
MMAR_2687110-1.206694Mg2+ transport p-type ATPase C MgtC
MMAR_2688-113-1.818261NADH dehydrogenase
MMAR_2689-113-2.344265hypothetical protein
MMAR_2690482.441212hypothetical protein
MMAR_2691280.898789membrane-bound C-5 sterol desaturase Erg3
MMAR_2692281.932962hypothetical protein
MMAR_2693071.353771hypothetical protein
MMAR_2694071.567894hypothetical protein
MMAR_2695081.930650PE-PGRS family protein
MMAR_2696-19-0.881396drug-transport transmembrane ABC transporter
MMAR_2697-180.204154hypothetical protein
MMAR_2698010-0.651631preprotein translocase subunit SecA
MMAR_2699011-0.708376CDP-diacylglycerol--glycerol-3-phosphate
MMAR_2700214-0.500049hypothetical protein
MMAR_2701316-1.569435hypothetical protein
MMAR_2702316-1.738392hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2681CHANLCOLICIN290.037 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 29.3 bits (65), Expect = 0.037
Identities = 30/99 (30%), Positives = 39/99 (39%), Gaps = 10/99 (10%)

Query: 54 GDGWLGPASESMEAAVFPYLAW--MNITGMQAEQTARQAAAAAGAFEAAFAMTVPPAQVA 111
G G G SES AA+ W + QAEQ AR AAA +A A
Sbjct: 37 GGGKGGSKSES-SAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAK-------ANRD 88

Query: 112 ANRTQLQTLVATNLLGQNTPAIAATEAAYGEMWAQDAAA 150
A +L+ +V L + +ATE A+ A A
Sbjct: 89 ALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAED 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2695cloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 2e-04
Identities = 33/92 (35%), Positives = 40/92 (43%), Gaps = 3/92 (3%)

Query: 171 GAGGAGGNSPLGNTANGGNGGTGGAGGLLFGPGGVGGAGGASFLTGGTGGDGGAGGLFGA 230
G G G N+ +T+ NGG G G G G G+G +S GG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV---GGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 231 GGLGGVGGSGNVGGNGGAGGAGGLLAGLVGAG 262
G G GG+GN GG G GG +A V G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 36.6 bits (84), Expect = 4e-04
Identities = 34/99 (34%), Positives = 43/99 (43%), Gaps = 3/99 (3%)

Query: 376 GGGGAGGAGGFGTISDGGAGGRGGVGGQLLGNGGAGGAGGQGGNDGGAGGLGGNGVLIGN 435
GG G G G + S GG G+G G G + G+G N+ GG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV---GGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 436 GGNGGVGGVGETPGGDGGGGISGLLLGADGFNAPASSSP 474
G+G GG G + GG G GG + F PA S+P
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTP 98



Score = 35.8 bits (82), Expect = 9e-04
Identities = 29/91 (31%), Positives = 39/91 (42%), Gaps = 1/91 (1%)

Query: 733 IGGAGGRGGNAGLLFSDAGV-GGFGGFGGTGGGTGGTGGNAGWLGSGGSGGAGGGSNGDG 791
+ G GRG N G + + GG G G GG + G+G ++ GG G+G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 792 GAGGTGGSGGQIVGDGGAGGAGGQGDLLAAG 822
G G GG+G G G G +A G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 35.5 bits (81), Expect = 0.001
Identities = 35/110 (31%), Positives = 42/110 (38%), Gaps = 7/110 (6%)

Query: 760 GTGGGTGGTGGNAGWLGSGGSGGAGGGSNGDGGA------GGTGGSGGQIVGDGGAGGAG 813
G G T GN G G G GG S+G G + GG GSG G G G G
Sbjct: 8 GHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 814 GQGDLLAAGGSGGDGGQGGDAVLIGTGGNGGNGASGVLAGIGGDAGSGGL 863
G G+ G+GG+ V G GA G+ I A S +
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 33.9 bits (77), Expect = 0.003
Identities = 38/111 (34%), Positives = 44/111 (39%), Gaps = 5/111 (4%)

Query: 229 GAGGLGGVGGSGNVGGNGGAGGAGGLLAGLVGAGGGDGGSGGLGAAGGDGGNGGRAGLFG 288
G G G G+ + GN G G + G GA G G S GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG--GASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 289 GAGGAGGMGATGGHSGGDGGAGGDAGLL---FGTGGAGGAGGHAFDGDGGA 336
G G GG G +GG SG G A + F GAGG A GA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.9 bits (77), Expect = 0.003
Identities = 34/107 (31%), Positives = 41/107 (38%), Gaps = 5/107 (4%)

Query: 321 GAGGAGGHAFDGDGGAGGAGGNAGLMFSSGGSGGVGGAGSIDGGAGGAGGDAGWLGGGGA 380
G G G + GG GL G S G G + + GG+G W GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 381 GGAGGFGTISDGGAGGRGGVGGQLLGNGGAGGAGGQGGNDGGAGGLG 427
G GG +G +GG G GG L G + GAGGL
Sbjct: 63 GNGGG-----NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.1 bits (75), Expect = 0.005
Identities = 33/112 (29%), Positives = 43/112 (38%), Gaps = 3/112 (2%)

Query: 523 GGAGGSAAATPGAAGGAGGAAGLVGAGGAGGAGANTGISNPGFGGAGGDGGAGGFLLGDG 582
G G +G G +G GG G+ N +GG G G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 583 GAGGAGGAGAGAGGGVGGAGGAGGL---LGAGGTGGGGGVAPSLVDGGTGGA 631
GG G +G G+G G + A + A T G GG+A S+ G A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.1 bits (75), Expect = 0.005
Identities = 35/98 (35%), Positives = 43/98 (43%), Gaps = 4/98 (4%)

Query: 547 GAGGAGGAGANTGISNPGFGGAGGDGGAG---GFLLGDGGAGGAGGAGAGAGGGVGGAGG 603
G G GA + +G N G G G GGA G+ + GG G+G GGG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 604 AGGLLGAGGTGGGGGVAPSLVDGGTG-GAGGAGGAGGL 640
G GG+G GG ++ G A GAGGL
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 33.1 bits (75), Expect = 0.006
Identities = 35/104 (33%), Positives = 44/104 (42%), Gaps = 8/104 (7%)

Query: 300 GGHSGGDGGAGGDAGLLFGTGGAGGAGGHAFDGDGGAG-----GAGGNAGLMFSSGGSGG 354
G G + GA +G + G G GG A DG G + G G +G+ + G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 355 VGGAGSIDGGAGGAGGDAGWLGGGGAGGAGGFGTISDGGAGGRG 398
GG G +GG G G L A A GF +S GAGG
Sbjct: 64 NGGGN---GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.007
Identities = 35/100 (35%), Positives = 38/100 (38%), Gaps = 3/100 (3%)

Query: 144 GNGGAGGSGAAGQAGGD--GGAAGLIGAGGAGGAGGNSPLGNTANGGNG-GTGGAGGLLF 200
G G G + A G+ GG GL GGA G S N GG+G G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 201 GPGGVGGAGGASFLTGGTGGDGGAGGLFGAGGLGGVGGSG 240
G GG G G TGG A FG L G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.009
Identities = 27/81 (33%), Positives = 34/81 (41%), Gaps = 1/81 (1%)

Query: 349 SGGSGGVGGAGSIDGGAGGAGGDAGWLGGGGAGGAGGFGTISDGGAGGRGGVGGQLLGNG 408
SGG G G+ GG G GGGA G+ + ++ GG G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGS-GIHWGGGS 60

Query: 409 GAGGAGGQGGNDGGAGGLGGN 429
G G GG G + GG+G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.014
Identities = 32/108 (29%), Positives = 41/108 (37%), Gaps = 5/108 (4%)

Query: 587 AGGAGAGAGGGVGGAGG-AGGLLGAGGTGGGGGVAPSLVDGGTGGAGGAGGAGGLFGGIF 645
+GG G G G G G G GGG GG+G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG-- 59

Query: 646 GAGGGDGGAGGFTGGIDGAGGVGGAGGNAGLLGGPG-GSGGSGGDGIA 692
+G G+GG G +GG G GG A G P + G+GG ++
Sbjct: 60 -SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 31.2 bits (70), Expect = 0.020
Identities = 28/82 (34%), Positives = 35/82 (42%), Gaps = 2/82 (2%)

Query: 263 GGDGGSGGLGAAGGDGG-NGGRAGLFGGAGGAGGMGATGGHSGGDGGAGGDAGLLFGTGG 321
GGDG GA G NGG GL G G + G G + ++ GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW-GGGSG 61

Query: 322 AGGAGGHAFDGDGGAGGAGGNA 343
G GG+ G G G +A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 30.1 bits (67), Expect = 0.044
Identities = 31/92 (33%), Positives = 34/92 (36%), Gaps = 15/92 (16%)

Query: 681 GGSGGSGGDGIAKLGLNADGGAGGAGGNGGFLFGSG-GTGGFGGVGGVGTSTGIGGAGGR 739
GG G G N +GG G G GG GSG + GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 740 GGNAGLLFSDAGVGGFGGFGGTGGGTGGTGGN 771
G GG G GG GTGGN
Sbjct: 63 GN--------------GGGNGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2696PF05272340.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.5 bits (76), Expect = 0.003
Identities = 12/52 (23%), Positives = 23/52 (44%)

Query: 456 LVITGRSGSGKTTLLRSLAELWPFASGTLSRPDGANDTMFLSQLPYVPLGSL 507
+V+ G G GK+TL+ +L L F+ G + ++ + L +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEM 650


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2698SECA8390.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 839 bits (2170), Expect = 0.0
Identities = 282/813 (34%), Positives = 398/813 (48%), Gaps = 112/813 (13%)

Query: 46 RLLGATTEKNQNRSLAQVTASADFDKEAADLNDEKLR-KAAGLLNLEDLADSAD--IPQF 102
++ G+ ++ R V + E L+DE+L+ K A + + + IP+
Sbjct: 8 KVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLENLIPEA 67

Query: 103 LAIAREAGERATGLRPFDVQLLGALRMLAGDVIEMATGEGKTLAGAIAAAGYALGGRHVH 162
A+ REA +R G+R FDVQLLG + + + EM TGEGKTL + A AL G+ VH
Sbjct: 68 FAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVH 127

Query: 163 VVTINDYLARRDAEWMAPLLEAMDLTVGWITAESTGADRRAAYECDVTYASVNEIGFDVL 222
VVT+NDYLA+RDAE PL E + LTVG +R AY D+TY + NE GFD L
Sbjct: 128 VVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNEYGFDYL 187

Query: 223 RDQLVTDVADLVSPNPDVALIDEADSVLVDEALVPLVLAGTTHRETPRLEII-KLVGQLV 281
RD + + V AL+DE DS+L+DEA PL+++G + + + K++ L+
Sbjct: 188 RDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNKIIPHLI 247

Query: 282 KDKD-------ADEYFATDADSRNVHLTEAGARKVEKAL-------GGIDLYSEEHVGTT 327
+ + + +F+ D SR V+LTE G +E+ L G LYS ++
Sbjct: 248 RQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSPANI-ML 306

Query: 328 LTEVNVALHAHVLLQRDVHYIVRDDAVHLINASRGRIAQLQRWPDGLQAAVEAKEGIETT 387
+ V AL AH L RDV YIV+D V +++ GR Q +RW DGL AVEAKEG++
Sbjct: 307 MHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKEGVQIQ 366

Query: 388 ETGEVLDTITVQALINRYVTVCGMTGTALAAGEQLRQFYKLGVSPIPPNTPNIREDESDR 447
+ L +IT Q Y + GMTGTA + YKL +P N P IR+D D
Sbjct: 367 NENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRKDLPDL 426

Query: 448 VYITAAAKNDAIVEHIAEVHDTGQPVLVGTRDVAESEDLHERLLRRDIPAVVLNAKNDAE 507
VY+T A K AI+E I E GQPVLVGT + +SE + L + I VLNAK A
Sbjct: 427 VYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHAN 486

Query: 508 EAAVIAEAGTLSRVTVSTQMAGRGTDIRLGGSDEA----------------------DHD 545
EAA++A+AG + VT++T MAGRGTDI LGGS +A HD
Sbjct: 487 EAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHD 546

Query: 546 QVAELGGLHVVGTGRHHTQRLDNQLRGRAGRQGDPGSSVFFSSWEDDVV----AANLDG- 600
V E GGLH++GT RH ++R+DNQLRGR+GRQGD GSS F+ S ED ++ + + G
Sbjct: 547 AVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSGM 606

Query: 601 -NKLPMETDEDGQIVSAKAAGLLDHAQRVAEGRMLDVHANTWRYNQLIAQQRAIIVDRRN 659
KL M+ E I + +AQR E R D+ Y+ + QR I +RN
Sbjct: 607 MRKLGMKPGE--AIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIYSQRN 664

Query: 660 TLLRTATAREELAD-------------LAPKRYKEL------------------------ 682
LL + E + + P+ +E+
Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIAEWL 724

Query: 683 ---SETVSEDRLEKIC-----------------------RMIMLYHLDRGWADHLAYLAD 716
E E E+I + +ML LD W +HLA +
Sbjct: 725 DKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAMDY 784

Query: 717 IRESIHLRALGRQNPLDEFHRMAVDAFASLAAD 749
+R+ IHLR +++P E+ R + FA++
Sbjct: 785 LRQGIHLRGYAQKDPKQEYKRESFSMFAAMLES 817


33MMAR_2802MMAR_2829Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2802226-2.395889hypothetical protein
MMAR_2803123-3.518841hypothetical protein
MMAR_2804019-3.947282hypothetical protein
MMAR_2805-119-3.560566hypothetical protein
MMAR_2806-216-3.553174PPE family protein
MMAR_2809-114-3.227679hypothetical protein
MMAR_2810-113-2.678984hypothetical protein
MMAR_2811-214-1.910911hypothetical protein
MMAR_2812015-1.474023TetR family transcriptional regulator
MMAR_2813014-1.024015dehydrogenase
MMAR_28141130.452170oxidoreductase FadB5
MMAR_28150150.017509hypothetical protein
MMAR_2816211-1.440450ArsR-type repressor
MMAR_2817118-3.904512hypothetical protein
MMAR_2818118-3.565077hypothetical protein
MMAR_2819015-3.659752hypothetical protein
MMAR_2820117-3.966686hypothetical protein
MMAR_2821016-3.762584isocitrate lyase AceAb
MMAR_2822119-3.890269PPE family protein
MMAR_2823-112-1.336254PE-PGRS family protein
MMAR_2824012-1.849631monooxygenase
MMAR_2825-111-1.801613hypothetical protein
MMAR_2826-112-1.993865hypothetical protein
MMAR_2827114-2.092409lipoprotein
MMAR_2828216-2.156125lipase LipD
MMAR_2829216-2.065946hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2812HTHTETR617e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.2 bits (148), Expect = 7e-14
Identities = 26/202 (12%), Positives = 68/202 (33%), Gaps = 10/202 (4%)

Query: 6 NRHELRRRSTHEALRRAALKSFACKGFAQVTVTELAREAGVTERTFFRHFPTKEAVLFQD 65
+ + + T + + AL+ F+ +G + ++ E+A+ AGVT + HF K + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 66 YENQLEWLAQALAQRPVSEP------LFDAVLASVAAFPHDLEVVRQAATARSELISADR 119
+E + + + P L + ++ + + + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 120 IANHLRVVQSSFAQVLTAFVRDRYADVANVDLVA----EVAGATIAAALVVAVENWGRNG 175
+A + ++ + + + L A A + + +ENW
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 176 CAVDLGELVAASLDLVRSGLAP 197
+ DL + + ++
Sbjct: 183 QSFDLKKEARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2822cloacin366e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.2 bits (83), Expect = 6e-04
Identities = 25/80 (31%), Positives = 33/80 (41%)

Query: 215 GNVGNGNNGFGNFGSGNLGSGNFGSGNFGSSNIGASNLGSNNFGFGNLGSFNNGFANIGA 274
G G G+N + SGN+ G G G G ++ G+ NN G GS + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 275 GNFGFGNNGNNNIGIGLNGN 294
GN G N G G N +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2823cloacin385e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 5e-06
Identities = 28/73 (38%), Positives = 32/73 (43%), Gaps = 2/73 (2%)

Query: 63 GPSGTGGPEGGGGGVPGGPTGSGGPGGGGGSIPGGPTGGGGPGGGGGTIPGVGGGGG-GP 121
G G G + GGPTG G GGG G + GGG G+ GGG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTG-LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 122 GGGGGCIGNICGS 134
GGG G G G+
Sbjct: 65 GGGNGNSGGGSGT 77



Score = 34.3 bits (78), Expect = 1e-04
Identities = 24/73 (32%), Positives = 27/73 (36%), Gaps = 5/73 (6%)

Query: 54 ATESSPAPGGPSGTGGPEGGGGGVPGGPTGSGGPGGGGGSIPGGPTGGGGPGGGGGTIPG 113
A +S G G GG G + + GGG GS G G GGG
Sbjct: 13 AHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN---- 68

Query: 114 VGGGGGGPGGGGG 126
G GGG G GG
Sbjct: 69 -GNSGGGSGTGGN 80



Score = 29.7 bits (66), Expect = 0.004
Identities = 23/69 (33%), Positives = 25/69 (36%), Gaps = 2/69 (2%)

Query: 50 SSSPATESSPAPGGPSGTGGPEGGGGGVPGGPTGSGGPGGGGGSIPGGPTGGGGPGGGGG 109
S+S P G G G +G G P G GG G G G G GG G G
Sbjct: 15 STSGNINGGPTGLGVGG-GASDGSGWSSENNPWG-GGSGSGIHWGGGSGHGNGGGNGNSG 72

Query: 110 TIPGVGGGG 118
G GG
Sbjct: 73 GGSGTGGNL 81



Score = 27.0 bits (59), Expect = 0.025
Identities = 32/99 (32%), Positives = 35/99 (35%), Gaps = 6/99 (6%)

Query: 25 GGGENKGPSSTTPTTTPTTT---VPSSPSSSPATESSPAP-GGPSGTGGPEGGGGGVPGG 80
G G N G ST+ T V S S P GG SG+G GGG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG--HG 63

Query: 81 PTGSGGPGGGGGSIPGGPTGGGGPGGGGGTIPGVGGGGG 119
G G GGG G + P G G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2827BLACTAMASEA340.001 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 33.6 bits (77), Expect = 0.001
Identities = 21/83 (25%), Positives = 36/83 (43%), Gaps = 12/83 (14%)

Query: 136 GVLGIADLATNKKVTK---DTVFDIGSVSKQFTATAVLLLINEGRLTLDDPLAHYVPDLP 192
G++ + DLA+ + +T D F + S K AVL ++ G L+ + + DL
Sbjct: 41 GMIEM-DLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLV 99

Query: 193 DWS--------SAVTVAQLMHHT 207
D+S +TV +L
Sbjct: 100 DYSPVSEKHLADGMTVGELCAAA 122


34MMAR_2842MMAR_2855Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2842012-3.339597hypothetical protein
MMAR_2843013-3.013957hypothetical protein
MMAR_2844013-2.747718hydroxydechloroatrazine ethylaminohydrolase
MMAR_2845113-3.488802peroxiredoxin
MMAR_2846113-2.639485alpha-L-fucosidase
MMAR_2847215-2.363840hypothetical protein
MMAR_2848115-2.171796hypothetical protein
MMAR_2849017-2.476742short chain dehydrogenase
MMAR_2850-216-1.698454hypothetical protein
MMAR_2851-116-2.110022hypothetical protein
MMAR_2852-117-2.355534hypothetical protein
MMAR_2853-217-2.591521hypothetical protein
MMAR_2854-117-3.053495hypothetical protein
MMAR_2855-114-3.246802transcriptional regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2844UREASE349e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 34.3 bits (79), Expect = 9e-04
Identities = 14/29 (48%), Positives = 18/29 (62%)

Query: 371 TVGGARCLGRDQDLGSLEVGKLADIALWQ 399
T+ A G ++GSLEVGK AD+ LW
Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVLWN 438



Score = 30.1 bits (68), Expect = 0.022
Identities = 20/74 (27%), Positives = 31/74 (41%), Gaps = 19/74 (25%)

Query: 17 LVIDRVSIATVDPKAAEFSEGHIIVEDDLIVAVGDGPAPEV---------PGATVIDGRG 67
L++D I D I ++D I A+G P++ PG VI G G
Sbjct: 76 LILDHWGIVKAD----------IGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEG 125

Query: 68 CLATPGLVNTHEHL 81
+ T G +++H H
Sbjct: 126 KIVTAGGMDSHIHF 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2848PF05272300.014 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.014
Identities = 16/43 (37%), Positives = 19/43 (44%)

Query: 22 PVTPPPLPRPVTFDQRWSDLTFVHWPVLPDSVAHMYPPGTRPD 64
P P P PRPV + W + V +P S H P G PD
Sbjct: 118 PPRPEPPPRPVVEKECWETIQPVPEHAVPPSFWHPAPKGREPD 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2849DHBDHDRGNASE1437e-44 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 143 bits (361), Expect = 7e-44
Identities = 74/256 (28%), Positives = 125/256 (48%), Gaps = 14/256 (5%)

Query: 9 LSGKRALITGASTGIGKKVALAYAEAGAQVAVAARHSDALQVVADEIAGVGGKALPIRCD 68
+ GK A ITGA+ GIG+ VA A GA +A + + L+ V + A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 69 VTQPDQVRGMLDQMTGELGGIDIAVCNAGIVSVQAMLDMPLEEFQRIQDTNVTGVFLTAQ 128
V + + ++ E+G IDI V AG++ + + EE++ N TGVF ++
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 129 AAARAMVDQGLGGTIITTASMSGHIINIPQQVSHYCTSKAAVVHLTKAMAVELAPHQIRV 188
+ ++ M+D+ G+I+T S + ++ Y +SKAA V TK + +ELA + IR
Sbjct: 126 SVSKYMMDRR-SGSIVTVGSNPAGVPRT--SMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 189 NSVSPGYIRTELVEPL-----------ADYHALWEPKIPLGRMGRPEELTGLYLYLASAA 237
N VSPG T++ L ++ IPL ++ +P ++ L+L S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 238 SSYMTGSDIVIDGGYT 253
+ ++T ++ +DGG T
Sbjct: 243 AGHITMHNLCVDGGAT 258


35MMAR_2943MMAR_2969Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2943-1103.933136hypothetical protein
MMAR_2944-1114.037320PPE family protein
MMAR_2945-1124.2411531-aminocyclopropane-1-carboxylate deaminase
MMAR_2946-1114.179602hypothetical protein
MMAR_2947-193.393894HrpA-like helicase
MMAR_29483113.366147PE-PGRS family protein
MMAR_2949-113-0.904304hypothetical protein
MMAR_2950-114-1.337814oxidoreductase
MMAR_2951018-2.258142chitinase/cellulase
MMAR_2952416-4.882314hypothetical protein
MMAR_2953316-4.612813hypothetical protein
MMAR_2954316-4.775196dehydrogenase fad flavoprotein GMC
MMAR_2955519-5.677472hypothetical protein
MMAR_2956413-2.795473PemK-like protein
MMAR_2958315-2.865812macrophage infection protein, MimD
MMAR_2959-190.698997Zn-dependent alcohol dehydrogenase
MMAR_2960-190.924883transcriptional regulatory protein
MMAR_2961-280.635444hypothetical protein
MMAR_2962-180.477710transcriptional regulatory protein
MMAR_2963-29-0.000591oxidoreductase
MMAR_2964-182.131537transmembrane transport protein
MMAR_29653132.827766ATPase/kinase, NadR
MMAR_29663122.834270integral membrane drug efflux protein
MMAR_29673123.116744hypothetical protein
MMAR_29683112.782523hypothetical protein
MMAR_29693103.005418PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2944PF03544357e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 34.6 bits (79), Expect = 7e-04
Identities = 24/114 (21%), Positives = 32/114 (28%), Gaps = 15/114 (13%)

Query: 365 VPPPALPVAAPAAPPVALTPNVPPTPPAPAPVDASTLTLTQAPPPPPSTAPPPVSGAGLG 424
P P P PV + P P P PV P A P + A
Sbjct: 77 EPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTA--- 133

Query: 425 VGMENFGYMVGGLGADAKSAARASARKKAPRPDGAEVPVVAPAPSEPARSQRRR 478
A A ++ ++ PR + P PAR+Q R
Sbjct: 134 ------PARPTSSTATAATSKPVTSVASGPR------ALSRNQPQYPARAQALR 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2948cloacin391e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 1e-04
Identities = 38/115 (33%), Positives = 43/115 (37%), Gaps = 1/115 (0%)

Query: 140 GDGGAGGSGATGQVGG-NGGAAGLLGSGGAGGAGGGSTVGNGGAGGVGGTGGWLSGSGGV 198
GDG +GA G NGG GL GGA G S+ N GG G W GSG
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 199 GGAGGATSDVGAAGGAGGDGGAGGLLGAGGTGGAGGAGRLGSGATGGAGGAGGAG 253
G G S G+ G A + GAG L + GA A A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 37.0 bits (85), Expect = 4e-04
Identities = 38/103 (36%), Positives = 40/103 (38%), Gaps = 1/103 (0%)

Query: 535 SGAAGSGIAGGDGGAAGLIGTGGTGGAGAGSASDDGGGGGSGGAGGWLSGAG-GVGGVGG 593
SG G G G +G I G TG G ASD G G SG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 594 FSLTGGTGGSGGAGGAGGLLGAAGLGGAGGAGVNGDGGGGGSG 636
GG G SGG G GG L A A G G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.2 bits (83), Expect = 5e-04
Identities = 36/109 (33%), Positives = 48/109 (44%), Gaps = 12/109 (11%)

Query: 650 GGDGGAGGASESANGGAGGAGGHAGAFGGPGGAGGSGGFGDVAGGIGGTGGNAGTLFGSG 709
G + GA S + NGG G G GA G G + + +G +G GG +G G+G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH--GNG 65

Query: 710 GAGGDGGFGLSVVGGHGGDGGNAGLLFSSAGSGGFGGSSTKTAGDGGMG 758
G G+ GG G GGN L + A FG + T G GG+
Sbjct: 66 GGNGNS-------GGGSGTGGN---LSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.2 bits (83), Expect = 6e-04
Identities = 40/125 (32%), Positives = 50/125 (40%), Gaps = 7/125 (5%)

Query: 388 GGVGGSSTGGVGGQGGT--GGRAGLLIGNAGAGGAGGEGTAAGGDGGNGGNGVLIGNGGN 445
GG G G G GG GL +G + G+G GG+G G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 446 AGTGGAGPSNGGNGVGGTGGVLLGADGFNAPASSSP-----LHSLQQQALTAVNAPIQAA 500
GG G S GG+G GG + F PA S+P S+ AL+A A I AA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 501 TGRPL 505
P
Sbjct: 123 LKGPF 127



Score = 35.8 bits (82), Expect = 7e-04
Identities = 36/114 (31%), Positives = 46/114 (40%), Gaps = 11/114 (9%)

Query: 618 LGGAGGAGVNGDGGGGGSGGAGGLLGGLVGAGGGDGGAGGASESANGGAGGAGGHAGAFG 677
+ G G G N GG G VG G DG + + GG G+G H G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 678 GPGGAGGSGGFGDVAGGIGGTGGNAGTL-------FGSGGAGGDGGFGLSVVGG 724
G G GG+G +GG GTGGN + F + G GG +S+ G
Sbjct: 61 GHGNGGGNGN----SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 34.7 bits (79), Expect = 0.002
Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 1/82 (1%)

Query: 771 GAGGFGGVVGAGDGGSGGNGGAGGQLLGIGGAGGAGGQSLSSGVGGDGGTGGNAVLIGNG 830
G G G GA NGG G +G G + G+G S ++ GG G+G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 831 GNGGNGGNGGNVGTGTPGAGGS 852
GNGG GN G G+GT G +
Sbjct: 63 GNGGGNGNSGG-GSGTGGNLSA 83



Score = 34.3 bits (78), Expect = 0.002
Identities = 37/107 (34%), Positives = 46/107 (42%), Gaps = 1/107 (0%)

Query: 120 NGANGAPGTGANGGDGGWLLGDGGAGGSGATGQVGGNGGAAGLLGSGGAGGAGGGSTVGN 179
N + NGG G +G G + GSG + + GG +G G GG+G G+ GN
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGS-GIHWGGGSGHGNGGGN 68

Query: 180 GGAGGVGGTGGWLSGSGGVGGAGGATSDVGAAGGAGGDGGAGGLLGA 226
G +GG GTGG LS G AGG AG L A
Sbjct: 69 GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 34.3 bits (78), Expect = 0.002
Identities = 25/90 (27%), Positives = 29/90 (32%)

Query: 728 DGGNAGLLFSSAGSGGFGGSSTKTAGDGGMGGAAGWLGFGGAGGAGGFGGVVGAGDGGSG 787
+GG GL S G G SS GG G W G G G GG G G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 788 GNGGAGGQLLGIGGAGGAGGQSLSSGVGGD 817
+ A G G L+ +
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 34.3 bits (78), Expect = 0.003
Identities = 37/105 (35%), Positives = 40/105 (38%), Gaps = 4/105 (3%)

Query: 235 AGRLGSGATGGAGGAGGA--GGPWAGLVGAGGGDGGIGGMGQDNGGAGGTGGSAGVLGGP 292
+G G G GA G GGP VG G DG G +N GG GS GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGS--GWSSENNPWGGGSGSGIHWGGG 59

Query: 293 GGAGGTGGYGGVTGGSGGSGGDAGGMFGVRGLFGTGGAGGTGGFG 337
G G GG G GGSG G + V F G GG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.004
Identities = 39/107 (36%), Positives = 47/107 (43%), Gaps = 5/107 (4%)

Query: 528 GDGGAGGSGAAGSGIAGGDGGAAGLIGTGGTGGAGAGSASD-DGGGGGSGGAGGWLSGAG 586
G G G+ + I GG G G G + G+G S ++ GGG GSG G SG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 587 GVGGVGGFSLTGGTGGSGGAGGAGGLLGAAGLG--GAGGAGVNGDGG 631
GG G GTGG+ A A G L GAGG V+ G
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.5 bits (76), Expect = 0.004
Identities = 34/118 (28%), Positives = 40/118 (33%), Gaps = 2/118 (1%)

Query: 582 LSGAGGVGGVGGFSLTGGT--GGSGGAGGAGGLLGAAGLGGAGGAGVNGDGGGGGSGGAG 639
+SG G G G T G GG G G GG +G G G G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 640 GLLGGLVGAGGGDGGAGGASESANGGAGGAGGHAGAFGGPGGAGGSGGFGDVAGGIGG 697
G G G G G + SA G A + G GG S G ++ I
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 33.5 bits (76), Expect = 0.004
Identities = 32/109 (29%), Positives = 44/109 (40%), Gaps = 3/109 (2%)

Query: 508 NGAPGATGSGASGSPGGWLLGDGGAGGSGAAGSGIAGGDGGAAGLIGTGGTGGAGAGSAS 567
N +T +G P G +G G + GSG + G G +G+ GG+G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG-- 67

Query: 568 DDGGGGGSGGAGGWLSGAGGVGGVGGFSLTGGTGGSGGAGGAGGLLGAA 616
+G GG G GG LS G +L+ G + G L AA
Sbjct: 68 -NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.1 bits (75), Expect = 0.005
Identities = 36/126 (28%), Positives = 43/126 (34%), Gaps = 5/126 (3%)

Query: 554 GTGGTGGAGAGSASDDGGGGGSGGAGGWLSGAGGVGGVGGFSLTGGTGGSGGAGGAGGLL 613
G G GA + S + +GG G G GG G+G GGSG GG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGW-----SSENNPWGGGSGSGIHWGGGS 60

Query: 614 GAAGLGGAGGAGVNGDGGGGGSGGAGGLLGGLVGAGGGDGGAGGASESANGGAGGAGGHA 673
G GG G +G GG S A + G G S SA +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIM 120

Query: 674 GAFGGP 679
A GP
Sbjct: 121 AALKGP 126



Score = 32.4 bits (73), Expect = 0.010
Identities = 36/117 (30%), Positives = 42/117 (35%), Gaps = 18/117 (15%)

Query: 261 GAGGGDGGIGGMGQDNGGAGGTGGSAGVLGGPGGAGGTGGYGGVTGGSGGSGGDAGGMFG 320
G G G G NGG G G G G G + +GG +G GG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG------- 58

Query: 321 VRGLFGTGGAGGTGGFGSTSGAAGGTGGDGGLFFSSGGAGGEGGAGATAGPGGGGGA 377
G G GG SG GTGG+ S+ A G A + PG GG A
Sbjct: 59 -------GSGHGNGGGNGNSGGGSGTGGNL----SAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.010
Identities = 33/102 (32%), Positives = 38/102 (37%), Gaps = 1/102 (0%)

Query: 154 GGNGGAAGLLGSGGAGGAGGGSTVGNGGAGGVGGTGGWLSGSGGVGGAGGATSDVGAAGG 213
GG+G +G GG T G G G+ GW S + GG G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGS-GWSSENNPWGGGSGSGIHWGGGSG 61

Query: 214 AGGDGGAGGLLGAGGTGGAGGAGRLGSGATGGAGGAGGAGGP 255
G GG G G GTGG A A GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 31.2 bits (70), Expect = 0.019
Identities = 28/83 (33%), Positives = 32/83 (38%), Gaps = 2/83 (2%)

Query: 326 GTGGAGGTGGFGSTSGAAGGTGGDGGLFFSSGGAGGEGGAGATAGPGGGGGAGGLLFSDG 385
G G G G STSG GG GL G + G G + GGG G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 386 GVGGVGGSSTGGVGGQGGTGGRA 408
G G GG+ G G G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSA 83



Score = 31.2 bits (70), Expect = 0.021
Identities = 32/110 (29%), Positives = 37/110 (33%), Gaps = 2/110 (1%)

Query: 599 GTGGSGGAGGAGGLLGAAGLGGAGGAGVNGDGGGGGSGGAGGLLGGLVGAGGGDGGAGGA 658
G G G GA G G G G G G GG G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 659 SESANGGAGGAGGHAGAFGGPGGAGGSGGFGDVAGGIGGTGGNAGTLFGS 708
GG G +GG +G G FG A G GG A ++
Sbjct: 63 GNG--GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.5 bits (68), Expect = 0.034
Identities = 39/141 (27%), Positives = 46/141 (32%), Gaps = 9/141 (6%)

Query: 192 LSGSGGVGGAGGATSDVGAAGGAGGDGGAGGLLGAGGTGGAGGAGRLGSGATGGAGGAGG 251
+SG G G GA S G G G GG G + G+G + G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGG----GASDGSGWSSENNPWGGGSGSGIHW 56

Query: 252 AGGPWAGLVGAGGGDGGIGGMGQDNGGAGGTGGSAGVLGGPG-GAGGTGGYGGVTGGSGG 310
GG G GG G G G GG + G P G GG
Sbjct: 57 GGGSGHG----NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112

Query: 311 SGGDAGGMFGVRGLFGTGGAG 331
S A M ++G F G G
Sbjct: 113 SAAIADIMAALKGPFKFGLWG 133



Score = 30.5 bits (68), Expect = 0.034
Identities = 37/105 (35%), Positives = 44/105 (41%), Gaps = 12/105 (11%)

Query: 164 GSGGAGGAGGGSTVGNGGAGGVGGTGGWLSGSGGVGGAGGATSDVGAAGGAGGDGGAGGL 223
G G GA S NGG G+G GG G+G ++ + GG+G GG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGG------ASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 224 LGAGGTGGAGGAGRLGSGATGGAGGAGGA----GGPWAGLVGAGG 264
G G GG G +G G TGG A A G P GAGG
Sbjct: 60 SGHGNGGGNGNSG--GGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2949INFPOTNTIATR506e-10 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 50.0 bits (119), Expect = 6e-10
Identities = 33/101 (32%), Positives = 51/101 (50%), Gaps = 1/101 (0%)

Query: 82 QVHTLQAGDGPVVPGTARVSVCYMGVNGRDGSVFDSSYERGAPVVFPLNGVVPGFQKAIA 141
Q + AG G + V+V Y G DG+VFDS+ + G P F ++ V+PG+ +A+
Sbjct: 129 QYKIIDAGTGAKPGKSDTVTVEYTG-TLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQ 187

Query: 142 GQKVGSTVAVAMTSADGYPDGQPSAGIRPGDTLVFAIKVLS 182
GST V + + Y I P +TL+F I ++S
Sbjct: 188 LMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLIS 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2960HTHTETR506e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.4 bits (120), Expect = 6e-10
Identities = 29/195 (14%), Positives = 67/195 (34%), Gaps = 16/195 (8%)

Query: 7 RQRMVAGAAEMISRRGLNATSVRELAKHTQAPLGSTYHYFPGGKYDLATEAVRWADDLTV 66
RQ ++ A + S++G+++TS+ E+AK G+ Y +F K DL +E ++
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHF-KDKSDLFSEIWELSESNIG 71

Query: 67 GVLARELAAGPQAGLSAFLAMWRKIVIDSNFHAGCPVLAVSVEDLPEEHHA---PRRAAA 123
+ A P LS + ++ + +L + E ++A
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 124 TAFQRWTSMLADSLRDAGAAEQ-----DAQQVATLIVASVEGTVAMCRAQQSIAPLDLVT 178
+ +L+ A+ ++ A ++ + G + +
Sbjct: 132 NLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME-----NWLFAPQSFD 186

Query: 179 A--QLGRAIDAVLPG 191
+ + +L
Sbjct: 187 LKKEARDYVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2964TCRTETB1364e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 136 bits (344), Expect = 4e-36
Identities = 84/414 (20%), Positives = 169/414 (40%), Gaps = 20/414 (4%)

Query: 44 VLLVAAFGAFLAFLDSTIVNVAFPDIQRYFHSGISDLSWVLNAYNIVFAAFLVAAGKLAD 103
+L+ +F + L+ ++NV+ PDI F+ + +WV A+ + F+ GKL+D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 104 LLGRKRLFVYGVVLFTIASGLCAAADS-VEQLVAFRVLQGIGAAVLVPASLGLVVESFPA 162
LG KRL ++G+++ S + S L+ R +QG GAA + +V P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 163 ERRAHGVNLWGAAGAIAAGLGPPIGGALVEALNWRWVFLVNLPLGIVAVLAARRALVESR 222
E R L G+ A+ G+GP IGG + ++W ++ L+ + + I+ V + L +
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKE- 192

Query: 223 ACGRRRVP-DVRGAAMLATALGLLTLGLIKGPDWGWSSLPAIGSLVAAALAMIGFVMSSR 281
R + D++G +++ + L ++ +I L+ + L+ + FV R
Sbjct: 193 --VRIKGHFDIKGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIR 241

Query: 282 NHPTPLVEPALLRIRSFVAGSALTAIASAGFYAYLLTHVLFLNYVWGYTLLQAGLAVC-P 340
P V+P L + F+ G I ++ + V + + G + P
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 341 AAIIAAVTAGLLGRVADRHGYRVIIGVGALIWAGSLLWYLTCVGTTPNFLGEWLPGQILQ 400
+ + + G + DR G ++ +G + S L ++ I+
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFL----TASFLLETTSWFMTIIIVF 357

Query: 401 GIGVGAAFPLLGSAALAGLASGSSYATASAVTGTIRQVGAVIGVALLVILVGTP 454
+G + + S ++ ++ + G+A++ L+ P
Sbjct: 358 VLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2966TCRTETB1492e-41 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 149 bits (377), Expect = 2e-41
Identities = 79/413 (19%), Positives = 166/413 (40%), Gaps = 19/413 (4%)

Query: 39 VCVLGSIMTMVDTSVVTVAQRTFVDTFGSTQAVVAWTITGYTLALAAVVPLAGWAADRFG 98
+C+L S ++++ V+ V+ + F A W T + L + + G +D+ G
Sbjct: 19 LCIL-SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77

Query: 99 TKRMFMGSILVFTLSSLLCAIAPNIA-LLIASRVVQGLGGGMLAPLALTIVNREAGPKRV 157
KR+ + I++ S++ + + LLI +R +QG G L + +V R +
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 158 GRVMAVLGIPGVLAPAFGPALGGWLIDSYSWQWIFWVNLPVGVVAVGLAAVVFPRDTPAP 217
G+ ++G + GPA+GG + W ++ +P+ + + +
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRI 195

Query: 218 SETFDVVGMLLLSPGLPAFLYGMSEIPIYGTVADRHVWVPAGIGIALIVGFMFHALYRAD 277
FD+ G++L+S G+ F+ + + + + F+ H +
Sbjct: 196 KGHFDIKGIILMSVGIVFFMLF----------TTSYSISFLIVSVLSFLIFVKHIR-KVT 244

Query: 278 KPLIDLRLLTNRALTLANVAMFLYIVSTFGAGVLFPSYFQQLLDHTPLQAGMS-LLPRGI 336
P +D L N + + + + G + P + + + + G + P +
Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTM 304

Query: 337 GAALAVPLAGALVDRRGARGVLVIGVTLIATGMGVFAFGVATQRDYLPMLLIGLTILGMG 396
+ + G LVDRRG VL IGVT ++ +F + T + I + + G
Sbjct: 305 SVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETT---SWFMTIIIVFVLGG 361

Query: 397 MGCTRMPLVAVAMQSLAPNQIARGSTLIKVNQQMAAAVGTALMSVILTSQLNN 449
+ T+ + + SL + G +L+ ++ G A++ +L+ L +
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLD 414


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2969cloacin381e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 1e-04
Identities = 38/120 (31%), Positives = 40/120 (33%), Gaps = 11/120 (9%)

Query: 168 GSGGAGGAGGIDGGGGAGTGGTGGRGGLIFGDAGAGGQGGLGFAPNPNGGGGGAGGTGGA 227
G G G G G GG G G GA G NP GGG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV----GGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 228 GGLFGAGGPGGNGAPSVSGGGSGDGGDGGRGGVFGPGGRGGDGAPNPGGGSPGSGGNGGA 287
G G GG GN G G G G V P G PG G + GA
Sbjct: 59 GSGHGNGGGNGN-------SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 37.0 bits (85), Expect = 3e-04
Identities = 27/81 (33%), Positives = 33/81 (40%)

Query: 237 GGNGAPSVSGGGSGDGGDGGRGGVFGPGGRGGDGAPNPGGGSPGSGGNGGAGGLFGAGGA 296
GG+G +G S G G G GG DG+ +P GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 297 GGNGAPDVGGGSPGSGGNGGA 317
G G GG G+GGN A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 35.8 bits (82), Expect = 6e-04
Identities = 38/109 (34%), Positives = 44/109 (40%), Gaps = 10/109 (9%)

Query: 198 GDAGAGGQGGLGFAPNPNGGGGGAGGTGGAGGLFGAGGPGGNGAPSVSGGGSGDGGDGGR 257
GD G + N NGG G G GGA + G G + + GGGSG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGA-----SDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 258 GGVFGPGGRGGDGAPNPGGGSPGSGGNGGAGGLFGAGGAGGNGAPDVGG 306
G G GG G+ GG G+GGN A A G P GG
Sbjct: 59 GSGHGNGGGNGNS-----GGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.1 bits (80), Expect = 9e-04
Identities = 27/83 (32%), Positives = 32/83 (38%)

Query: 267 GGDGAPNPGGGSPGSGGNGGAGGLFGAGGAGGNGAPDVGGGSPGSGGNGGAGGLFFGDGG 326
GGDG + G SG G G GG +G+ +P GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 327 AGGNGAPNVGGGSPGAGGNGGDA 349
G G N GGGS G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 34.7 bits (79), Expect = 0.001
Identities = 25/72 (34%), Positives = 30/72 (41%)

Query: 124 NGANGAPGTGANGEAGGILFGSGGSGGSGGVGQNGGNGGDAGLFGSGGAGGAGGIDGGGG 183
N + NG G+ G G S GSG +N GG +G G G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 184 AGTGGTGGRGGL 195
GG+G G L
Sbjct: 70 NSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.007
Identities = 34/120 (28%), Positives = 47/120 (39%), Gaps = 7/120 (5%)

Query: 479 GVGGAGGAAALSGAGNGGTGGAGGLFFGVGGAGGAAPLFGGGTGGTGGAGGLLFGLGGAG 538
G G GA + SG NGG G +G GGA+ G + GG G+ G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTG-------LGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 539 GNAPVFGGGSGGTGGRAGLIGIGGAGGSSSVFAGGDGGAGGAGGTFIGFGGAGGDGGVSG 598
G+ GGG+G +GG +G G A + F GAGG + ++
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.4 bits (73), Expect = 0.007
Identities = 33/115 (28%), Positives = 40/115 (34%), Gaps = 1/115 (0%)

Query: 430 SAGRSVGTIGSVGGAGGNGGLFGTGGAGGSGGQDGYNYGGNGGAGGLLFGVGGAGGAAAL 489
S G G GN TG G G DG + G G G G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 490 SGAGNGGTGGAGGLFFGVGGAGGAAPL-FGGGTGGTGGAGGLLFGLGGAGGNAPV 543
G G G GG G + AAP+ FG T GAGGL + +A +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 32.0 bits (72), Expect = 0.009
Identities = 35/113 (30%), Positives = 39/113 (34%), Gaps = 6/113 (5%)

Query: 397 GGDNQNTNTGPGGVGGAGGDAGLFSGAIGGAGGSAGRSVGTIGSVGGAGGNGGLFGTGGA 456
GGD + NTG G G GGA +G S GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 457 GGSGGQDGYNYGGNGGAGGLLFGVGGAGGAAALSGAGNGGTGGAGGLFFGVGG 509
G GG GN G G G A A G T GAGGL +
Sbjct: 63 GNGGG------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 31.2 bits (70), Expect = 0.013
Identities = 32/109 (29%), Positives = 39/109 (35%), Gaps = 5/109 (4%)

Query: 559 GIGGAGGSSSVFAGGDGGAGGAGGTFIGFGGAGGDGGVSGNGGAGGKAGLIGVGGNGGNG 618
G G G+ S +GG G G G+G + GG G G G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 619 GNGGNGGAGGDAQLIGIGGNGGNGGDGQLGGPGTGGTGGTGGTLLGLNG 667
G GN G G G GGN G T G GG + ++
Sbjct: 66 GGNGNSGGGS-----GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 31.2 bits (70), Expect = 0.013
Identities = 33/103 (32%), Positives = 36/103 (34%), Gaps = 9/103 (8%)

Query: 217 GGGGAGGTGGAGGLFGA--GGPGGNGAPSVSGGGSGDGGDGGRGGVFGPGGRGGDGAPNP 274
GG G G GA G GGP G G V GG S G +G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG---VGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 275 GGGSPGSGGNGGAGGLFGAGGAGGNGAPDVGG----GSPGSGG 313
G G G GG G AP G +PG+GG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.018
Identities = 30/103 (29%), Positives = 40/103 (38%), Gaps = 3/103 (2%)

Query: 296 AGGNGAPDVGGGSPGSGG-NGGAGGLFFGDGGAGGNGAPNVGGGSPGAGGNGGDAGLFGA 354
+GG+G G SG NGG GL G G + G+G + +P GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN--NPWGGGSGSGIHWGGG 59

Query: 355 GGAGGRGGNNLANPATDGGAGGAGGNGGAGGLFAGAGGPGGQG 397
G G GGN + + G + F PG G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.1 bits (67), Expect = 0.030
Identities = 32/108 (29%), Positives = 35/108 (32%), Gaps = 11/108 (10%)

Query: 283 GNGGAGGLFGAGGAGGNGAPDVGGGSPGSGGNGGAG----GLFFGDGGAGGNGAPNVGGG 338
G G G GA GN G G G + G+G +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 339 SPGAGGNGGDAGLFGAGGAGGRGGNNLANPATDGGAGGAGGNGGAGGL 386
G G GG G GGN A A A GAGGL
Sbjct: 63 GNGGGNGNS-------GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 29.7 bits (66), Expect = 0.039
Identities = 27/87 (31%), Positives = 32/87 (36%), Gaps = 8/87 (9%)

Query: 571 AGGDGGAGGAGGTFIGFGGAGGDGGVSGNGGAGGKAGLIGVGGNGGNGGNGGNGGAGGDA 630
+GGDG G GG G+ GGA G+G + N GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGAS--------DGSGWSSENNPWGGGSGSG 53

Query: 631 QLIGIGGNGGNGGDGQLGGPGTGGTGG 657
G G GNGG G G+G G
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGN 80


36MMAR_2993MMAR_3002Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2993215-1.891734hypothetical protein
MMAR_2994215-2.054016ferredoxin FdxA_2
MMAR_2995216-1.970878hypothetical protein
MMAR_2996416-2.552958hypothetical protein
MMAR_2997316-1.720499alternative RNA polymerase sigma factor
MMAR_2998416-1.705906hypothetical protein
MMAR_2999418-2.137784hypothetical protein
MMAR_5570626-1.707792hypothetical protein
MMAR_3000533-1.182923EsaT-6 family protein
MMAR_3001229-0.739926EsaT-6 like protein EsxG
MMAR_30022221.051341PPE family protein
37MMAR_3079MMAR_3102Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3079213-0.586071sec-independent protein translocase
MMAR_3080211-0.509928twin arginine translocase protein A
MMAR_3081211-0.486693hypothetical protein
MMAR_3082111-0.855943hypothetical protein
MMAR_3083110-2.136819hypothetical protein
MMAR_3084-111-2.081089proteasome PrcA
MMAR_3085011-1.989247proteasome PrcB
MMAR_3086210-2.398276hypothetical protein
MMAR_3087210-1.914572hypothetical protein
MMAR_3088211-2.223455hypothetical protein
MMAR_3089310-1.439677integral membrane protein
MMAR_3090311-1.268325hypothetical protein
MMAR_3091210-0.829396ATPase
MMAR_30921266-3.392729lipoprotein LppK
MMAR_3093959-2.757859hypothetical protein
MMAR_3094752-2.200692RNA methyltransferase
MMAR_3095751-2.274059hypothetical protein
MMAR_3096750-2.278734hypothetical protein
MMAR_3097750-2.323703non-ribosomal peptide synthetase
MMAR_3098227-0.729596polyketide synthase
MMAR_3099120-0.767305polyketide synthase and peptide synthetase
MMAR_31000111.941633integral membrane drug efflux protein, ErmB
MMAR_31010132.665198mercuric reductase
MMAR_31020123.020787hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3080TATBPROTEIN305e-04 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 30.4 bits (68), Expect = 5e-04
Identities = 16/92 (17%), Positives = 35/92 (38%), Gaps = 10/92 (10%)

Query: 7 WHWAILAVVVIVLFGAKKLPDAARSLGKSMRIFKSEMREMQSETKAEPSAIE-------- 58
++ ++ +V+ G ++LP A +++ +R +S +Q+E E E
Sbjct: 7 SELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQDSLKKV 66

Query: 59 --TNTANPTPVQSQRIDPAAATGQDQTEARPA 88
+ N TP +D + + A
Sbjct: 67 EKASLTNLTPELKASMDELRQAAESMKRSYVA 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3098DHBDHDRGNASE521e-08 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 51.6 bits (123), Expect = 1e-08
Identities = 38/163 (23%), Positives = 60/163 (36%), Gaps = 9/163 (5%)

Query: 1740 VLITGGTGMVAAALARHLVSSHGVRHLVLVSRRGDAAAGASKLVDELTAAGATVRVVACD 1799
ITG + A+AR L S H+ V + K+V L A D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGA--HIAAVDYNPEKL---EKVVSSLKAEARHAEAFPAD 65

Query: 1800 VADPAAVSRLMNQLPEQCPPLSAVIHAAGTLDDALITSLTPQRVDAVLRAKVDGAWNLHE 1859
V D AA+ + ++ + P+ +++ AG L LI SL+ + +A G +N
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 1860 AT----RDLGLSAFVLCSSIAATLGAPGQANYAAGNAFLDALA 1898
+ D + V S A + A YA+ A
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFT 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3099NUCEPIMERASE350.005 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 34.8 bits (80), Expect = 0.005
Identities = 29/148 (19%), Positives = 45/148 (30%), Gaps = 35/148 (23%)

Query: 2393 TVLITGGTGMVASVLARHLVSSYGVKHVVLASRRADATAGVAEL------------VADL 2440
L+TG G + +++ L+ G+ L + L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLL------------EAGHQVVGIDNLNDYYDVSLKQARLELL 49

Query: 2441 AAAGAAVAVVACDVADRAAVTRLLDHVSTCHPPLTGVIHAAGTLDDAVIASLTPDRVDAV 2500
A G D+ADR +T L V + AV SL A
Sbjct: 50 AQPG--FQFHKIDLADREGMTDLFASGHFER-----VFISPH--RLAVRYSLENPH--AY 98

Query: 2501 LRAKVDGAWNLHEATRHLGLSMFVLCSS 2528
+ + G N+ E RH + + SS
Sbjct: 99 ADSNLTGFLNILEGCRHNKIQHLLYASS 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3100TCRTETB1192e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 119 bits (301), Expect = 2e-31
Identities = 78/408 (19%), Positives = 157/408 (38%), Gaps = 18/408 (4%)

Query: 39 IAAIMANLDISIVTVAQRTFTVAFHSTQATVAWTVAGYMLGMATATPMTGWAADRLGAKR 98
I + + L+ ++ V+ F+ A+ W +ML + T + G +D+LG KR
Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80

Query: 99 LFMGAVATFTLGSMLCA-SAPNIGLLITFRVVQGIGGGVLGPLVLAIVTHQAGPRRLGRL 157
L + + GS++ LLI R +QG G LV+ +V G+
Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140

Query: 158 LAVGAIPMLTAPMLGPILGGWLIDSYGWQWIFLINVPAGLLAFGLAAILVPEDPPKPSER 217
+ + +GP +GG + W +L+ +P + + + + +
Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGH 198

Query: 218 FDFIGMLLLLPGIAMLLLGVSAIPGSGTVTDHRVWVPAISGAVLITAFALHAWYRTDHPL 277
FD G++L+ GI +L ++ I + F H + P
Sbjct: 199 FDIKGIILMSVGIVFFMLFTTS----------YSISFLIVSVLSFLIFVKHI-RKVTDPF 247

Query: 278 IDLRLFTDRVVRLANLALLLYVAGAAGASLLLPSYFQQLLHQTPMRSG-LMMVPIGFGAM 336
+D L + + L + AG ++P + + + G +++ P +
Sbjct: 248 VDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVI 307

Query: 337 LTMPLTGAFMDSRGPRKVVLIGLTLIAAGTGTFVFGVANEADYLPTLLAGLTIAGMGLGC 396
+ + G +D RGP V+ IG+T ++ T F + + T++ + GL
Sbjct: 308 IFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASF-LLETTSWFMTIIIVFVL--GGLSF 364

Query: 397 TGLLLAASVMRVLAPHQIARGSALISVNQQISGSIGAALMSMILTNQF 444
T +++ V L + G +L++ +S G A++ +L+
Sbjct: 365 TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412


38MMAR_3116MMAR_3186Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3116217-0.175225undecaprenyl pyrophosphate phosphatase
MMAR_31171160.418571hypothetical protein
MMAR_31180140.362628lipoprotein LppL
MMAR_3119-115-0.047205hypothetical protein
MMAR_3120-114-0.407743dihydroorotate dehydrogenase 2
MMAR_3121015-0.782987hypothetical protein
MMAR_3122016-1.213519hypothetical protein
MMAR_3123015-1.139629hypothetical protein
MMAR_3124216-1.418911*integrase
MMAR_3125315-0.896512hypothetical protein
MMAR_3126-112-2.125303FtsK/SpoIIIE family protein
MMAR_3127015-3.425983putative regulatory protein
MMAR_3128115-3.532653hypothetical protein
MMAR_3129119-4.360447hypothetical protein
MMAR_3130121-4.736515hydrolase
MMAR_3132122-5.009701acyl-CoA dehydrogenase
MMAR_3133225-5.773517long-chain-fatty-acid--CoA ligase
MMAR_3134227-6.563029hypothetical protein
MMAR_3135228-7.065231cytochrome P450 136B2 Cyp136B2
MMAR_3136230-7.424261monooxygenase
MMAR_3137230-7.309830esterase/lipase
MMAR_3138328-7.777870monooxygenase
MMAR_3139325-6.891191monooxygenase
MMAR_3140323-6.126747hypothetical protein
MMAR_3141322-5.278447short-chain membrane-associated dehydrogenase
MMAR_3142425-4.874267TetR family transcriptional regulator
MMAR_3143221-4.121243hypothetical protein
MMAR_3144221-3.444946AcrR family transcriptional regulator
MMAR_3145218-2.698245hypothetical protein
MMAR_3146016-2.246231hypothetical protein
MMAR_3147017-2.492228hypothetical protein
MMAR_3148013-1.118814transposase, ISMyma01_aa2
MMAR_3149114-2.159107transposase, ISMyma01_aa1
MMAR_3150114-2.223756zinc-containing alcohol dehydrogenase NAD-
MMAR_3151314-2.419478long-chain-fatty-acid--CoA ligase
MMAR_3152515-2.609424hypothetical protein
MMAR_3153616-2.176029ferredoxin reductase
MMAR_3154518-3.623501cytochrome P450 153A16 Cyp153A16
MMAR_3155723-3.234496ferredoxin
MMAR_3156823-3.101630AraC/XylS family transcriptional regulator
MMAR_3158522-2.577872TetR family transcriptional regulator
MMAR_3159323-3.040495hydrolase
MMAR_3160225-3.312075hypothetical protein
MMAR_3161224-2.236705transcriptional regulatory protein
MMAR_3162-125-1.771649hypothetical protein
MMAR_3163-121-2.141648Zn-dependent alcohol dehydrogenase
MMAR_3164225-2.364990hypothetical protein
MMAR_3165422-1.793107hypothetical protein
MMAR_3166112-2.750577hypothetical protein
MMAR_5567114-3.340351hypothetical protein
MMAR_3170125-3.341481methyltransferase
MMAR_5571125-3.351117hypothetical protein
MMAR_3173025-3.163101protein MbtH
MMAR_3174024-3.064132transmembrane transport protein MmpL
MMAR_3175126-2.995007membrane protein MmpS4
MMAR_3176126-2.868219non-ribosomal peptide synthetase
MMAR_3177215-0.570382aspartate alpha-decarboxylase
MMAR_3182315-0.418216hypothetical protein
MMAR_3183215-1.374753hypothetical protein
MMAR_3184013-0.534343hypothetical protein
MMAR_3185012-0.433212secreted antigen Wag31
MMAR_3186213-0.010704hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3141DHBDHDRGNASE702e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 70.5 bits (172), Expect = 2e-16
Identities = 52/201 (25%), Positives = 83/201 (41%), Gaps = 8/201 (3%)

Query: 18 AVVTGAGSGIGRAFAVELARRGGRVVCADKDPITAKESAELVRQAGGEGFDVVCDVTDLE 77
A +TGA GIG A A LA +G + D +P ++ ++ DV D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 78 QVRNLADASEDWFGKAASLVINNAGIGAGGNRIGAT---SVEDWNAAISVNLWGVIYGCE 134
+ + E G LV N AG+ R G S E+W A SVN GV
Sbjct: 71 AIDEITARIEREMGPIDILV-NVAGV----LRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 135 TFVPRLRSNGRGGVINVASAASFGSAPRMGAYNVSKAGVLALSETLAAELSGTNVNVTVL 194
+ + G ++ V S + M AY SKA + ++ L EL+ N+ ++
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 195 CPTFVKTNIAKNPQIEESAAK 215
P +T++ + +E+ A+
Sbjct: 186 SPGSTETDMQWSLWADENGAE 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3142HTHTETR567e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.8 bits (134), Expect = 7e-12
Identities = 27/180 (15%), Positives = 63/180 (35%), Gaps = 15/180 (8%)

Query: 14 RTGAERRAERRQQLIEAATEIWSESGWAAVTMRGVCARTGLNDRYFYEDFKTREDLLVAA 73
R + E RQ +++ A ++S+ G ++ ++ + G+ Y FK + DL
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 74 WDGVRNDMLGEVSALFDERVDRPPIETITAAIAIVVDRIARDPGRAHIL-----LAQHVG 128
W+ +GE+ + + P+ + + V++ + R ++ + VG
Sbjct: 63 WEL-SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 129 SSPLQDRRAVALQEAT-----QLVVEA-SRPHLREDADETALRMDTLVAVGGFVEVITAW 182
+ + L + Q + L D + + G ++ W
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAI---IMRGYISGLMENW 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3144HTHTETR523e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.6 bits (123), Expect = 3e-10
Identities = 26/177 (14%), Positives = 65/177 (36%), Gaps = 13/177 (7%)

Query: 14 QRDAQRRALLIDAAVALMGKQGAAACTVTAVCTESGVTSRYFYQQFRDRDALLRAMFTKI 73
Q + R ++D A+ L +QG ++ ++ + +GVT Y F+D+ L ++
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 74 STTFQAVITKAIPDDTVAPQELAYAPIKALVQMIENDPSMARILFV------ESGAEPLL 127
+ + + P + + +++ + ++ + G ++
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 128 RQLRSELMSEFAELVLREARLHLDIPSEVIQVADLAATYGVGGLFEILRRWIDGQLN 184
+Q + L E + + + + I+ L A I+R +I G +
Sbjct: 127 QQAQRNLCLESYDRIEQTLK-------HCIEAKMLPADLMTRRAAIIMRGYISGLME 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3158HTHTETR601e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.4 bits (146), Expect = 1e-13
Identities = 20/114 (17%), Positives = 38/114 (33%)

Query: 11 RRHIREQVLRATRELTIEKGWEQVRVSEVAELVGVSRPTLYKEFGDKQGLGDALVVAEGQ 70
+ R+ +L L ++G + E+A+ GV+R +Y F DK L +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 71 RFLEGIHAILAEHTGDVQGGITAAVRFTLREAEASPLLKSVLTSSHSGDDRAGA 124
E A+ GD + + L + ++ + G
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3161HTHTETR691e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.9 bits (168), Expect = 1e-16
Identities = 31/169 (18%), Positives = 65/169 (38%), Gaps = 7/169 (4%)

Query: 10 DRDSARDRLLDAAERCLESCGVVGTTMEDIGRTAGVSRATVYRYFPNREAVMSGVIIRAA 69
+ R +LD A R GV T++ +I + AGV+R +Y +F ++ + S + +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 70 ERYLDRINPRIAE-----HTDLGSALVDFVEYTVEAARREEIIGLLFGSDEELAGVGLAA 124
+ A+ + L L+ +E TV RR ++ ++F E + + +
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 125 GTSTSLFELVTEFLRPIFRRHWS--YVEPGVSVDDAAEWVLRTILSLLT 171
+L + + + + + AA + I L+
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3174ACRIFLAVINRP512e-08 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 51.0 bits (122), Expect = 2e-08
Identities = 37/260 (14%), Positives = 83/260 (31%), Gaps = 30/260 (11%)

Query: 175 PPGVKVYVTGPAALQADLI-HSAERTVRTIKIATFTVIIVLMLFFFRSVPTVLVLLGVVG 233
P G+KV + S V+T+ A V +V M F +++ L+ V
Sbjct: 318 PQGMKVLYPYD---TTPFVQLSIHEVVKTLFEAIMLVFLV-MYLFLQNMRATLIPTIAVP 373

Query: 234 IQLSAATGVAAVIGYLGVVELSTFAVNLV--VAMAIATGT--DYAIFLIGRYQEARA-AG 288
+ ++G ++ +++N + M +A G D AI ++ +
Sbjct: 374 V---------VLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDK 424

Query: 289 VDRESAYYEMWHGTAHVVLASGLTIAGAALCMSL---TRMPYLQSLGVPTAVAMLVALSV 345
+ + A + ++ + ++ + M+ + + + AM +++ V
Sbjct: 425 LPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLV 484

Query: 346 ALTLGPAAVTVASRFGLLEPKRAIR------IRFWRRIGTAVVRWPAPILLASLAA--AL 397
AL L PA + E + IL ++
Sbjct: 485 ALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIY 544

Query: 398 VGLIALPFYQPSYNDRRYIP 417
++A ++P
Sbjct: 545 ALIVAGMVVLFLRLPSSFLP 564



Score = 34.0 bits (78), Expect = 0.003
Identities = 40/242 (16%), Positives = 83/242 (34%), Gaps = 36/242 (14%)

Query: 707 DGKSLRMIISHRGDPASAEGISR---------VDPIKLAAIEALKGTPLENAKVSLGGTA 757
G++ +I G PA+ GI IK A + L+ + KV
Sbjct: 271 GGENYNVIARINGKPAAGLGIKLATGANALDTAKAIK-AKLAELQPFFPQGMKVLYPYDT 329

Query: 758 SVYHDLS-EGTRYDLVIAVIATVCLIFAIMLLITRSLVAAMVIVGTVLLSLGASVGLSVL 816
+ + LS L A++ L+F +M L +++ A ++ V + L + +
Sbjct: 330 TPFVQLSIHEVVKTLFEAIM----LVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAA 385

Query: 817 LWQYIVGLPLQWMVMSMAVIILLAVGADYNLMMV---ARFKEEMPAGLKTGIIRAMGGAG 873
G + + M +++ + + D +++V R E K ++M
Sbjct: 386 F-----GYSINTLTM-FGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQ 439

Query: 874 SVVTAAGLVFAFTMTSMVVSDLRTIGQIGTF-------IGLGLLFDTLIVRSFMVPSIAA 926
+ A ++++ + G G I + L+ P++ A
Sbjct: 440 GALVGI----AMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALIL-TPALCA 494

Query: 927 LL 928
L
Sbjct: 495 TL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3185IGASERPTASE300.007 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.007
Identities = 29/180 (16%), Positives = 59/180 (32%), Gaps = 19/180 (10%)

Query: 36 ENELTRLIEENSDLRQRIAELDQELAAGAGGGAAVTAQPTQAMPVYEPEPEPAKPAAPVA 95
N L + R + + + A V + P+ + + P P AP
Sbjct: 974 VNGRYDLYNPEVEKRNQTVDTTN-ITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPAT 1032

Query: 96 SAATNEEQAMKAARVLSLAQDTADRLTSTAKAESDKMLSDARANADQILSEARHT--AET 153
+ T E A + + S ++++ ++ A ++ EA+ A T
Sbjct: 1033 PSETTETVAENSKQE------------SKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080

Query: 154 TVTE-ARQRADGMLADAQARSESQLRQAQEKADAL---QADAERKHSEIMGTINQQRTVL 209
E A+ ++ E+ + +EKA + + S++ Q TV
Sbjct: 1081 QTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQ 1140


39MMAR_3259MMAR_3326Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3259-312-3.505592short chain dehydrogenase
MMAR_3260-117-4.613550dihydrolipoamide acetyltransferase
MMAR_3261026-7.046392hypothetical protein
MMAR_3262031-8.109382integral membrane protein ABC transporter
MMAR_32631131-1.112381ABC transporter ATP-binding protein
MMAR_32641132-0.972507MmpL family transport protein
MMAR_32651437-0.227204MbtH-like protein
MMAR_32661335-0.274312MmpL family transport protein
MMAR_326714380.072673transmembrane protein
MMAR_326814380.138956non-ribosomal peptide synthetase
MMAR_32701233-0.426855non-ribosomal peptide synthetase
MMAR_32711233-0.316502non-ribosomal peptide synthetase
MMAR_3272024-3.903014long-chain-fatty-acid--CoA ligase
MMAR_3273120-4.492456hypothetical protein
MMAR_3274015-2.593655hypothetical protein
MMAR_3275013-2.546512hypothetical protein
MMAR_3276211-2.448580alternative RNA polymerase sigma factor
MMAR_3277210-2.316927hypothetical protein
MMAR_3278210-1.060176multimeric flavodoxin WrbA
MMAR_32792100.273976hypothetical protein
MMAR_3280290.470334hypothetical protein
MMAR_32811100.861329acetolactate synthase large subunit IlvB
MMAR_3282-2121.628161hypothetical protein
MMAR_3283-2121.827242drug-transport transmembrane ABC transporter
MMAR_3284-2171.045899drug-transport transmembrane ABC transporter
MMAR_32852183.485593lipoate-protein ligase B
MMAR_32862182.989249lipoyl synthase
MMAR_32871173.292044transmembrane protein
MMAR_3288-1143.147044hypothetical protein
MMAR_32890142.485840glutamine synthetase GlnA1
MMAR_3290-1123.009993PE-PGRS family protein
MMAR_3291-2110.033316hypothetical protein
MMAR_3292-1120.276018hypothetical protein
MMAR_3293-1120.024647bifunctional glutamine-synthetase
MMAR_3294-212-0.750276glutamine synthetase
MMAR_3295-110-0.530047exported protease
MMAR_3296-2110.122492cytochrome C oxidase polypeptide I CtaD
MMAR_3297-1111.005321exported protease
MMAR_32980124.283206hypothetical protein
MMAR_32990124.5371873-methyl-2-oxobutanoate
MMAR_33000104.291705hypothetical protein
MMAR_33010104.353886hypothetical protein
MMAR_33021114.548627hypothetical protein
MMAR_33030125.231527PE-PGRS family protein
MMAR_3304-1102.057957bifunctional RNase H/acid phosphatase
MMAR_3305-1102.209579hypothetical protein
MMAR_3306-1143.776899hypothetical protein
MMAR_3307-1153.298867hypothetical protein
MMAR_3308-2152.681125hypothetical protein
MMAR_3309-2151.604111phosphotyrosine protein phosphatase PtpA
MMAR_3310-2132.146063transmembrane protein
MMAR_33113145.592290cobalamin biosynthesis protein
MMAR_33122155.076670hypothetical protein
MMAR_33133155.388848hypothetical protein
MMAR_33143165.265931hydrolase
MMAR_33152155.658329transposase for ISMyma07
MMAR_33161155.548388PE-PGRS family protein
MMAR_33171142.547009transposase for ISMyma07
MMAR_33181162.420646hypothetical protein
MMAR_33191172.286751dehydrogenase
MMAR_33201192.231479AraC/XylS family transcriptional regulator
MMAR_33210235.566961hypothetical protein
MMAR_33220245.828735pyruvate dehydrogenase E1 component (beta
MMAR_33230235.485757pyruvate dehydrogenase E1 component (alpha
MMAR_33240195.045554nucleoside-diphosphate-sugar epimerase
MMAR_33250175.039752hypothetical protein
MMAR_33260144.967511PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3259DHBDHDRGNASE1002e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 99.7 bits (248), Expect = 2e-25
Identities = 57/185 (30%), Positives = 90/185 (48%), Gaps = 1/185 (0%)

Query: 328 VAVTGAGSGIGRETALAFAREGAEVVLSDIDEATVKDTAAEIAARGGVAHPYVLDVSDTE 387
+TGA GIG A A +GA + D + ++ + + A A + DV D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 388 AVEAFADQVSATHGLPDIVVNNAGVGQAGRFLDTPAEQFDRVLDVNLGGVVNGCRAFGQR 447
A++ ++ G DI+VN AGV + G E+++ VN GV N R+ +
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 448 LVERGTGGHIVNVSSMAAYAPLQSLSAYCTSKAATFMFSDCLRAELDAADVGLTTICPGV 507
+++R G IV V S A P S++AY +SKAA MF+ CL EL ++ + PG
Sbjct: 131 MMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 508 INTNI 512
T++
Sbjct: 190 TETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3260IGASERPTASE429e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.6 bits (97), Expect = 9e-06
Identities = 42/310 (13%), Positives = 82/310 (26%), Gaps = 34/310 (10%)

Query: 93 PEAE----PAAAAQPEPEAEPEPQPEAKPQSGGSSAAGGDATPVLMPELGESVAEGTVTR 148
PE E + S A D PV P A T +
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQAD-VPSVPSNNEEIARVDEAPVPPP------APATPSE 1035

Query: 149 WLKKVGDSVQVDEALVEVSTDKVDTEIPSPVAGVLLSITAEEDDVVQVGGELARIGSGSA 208
+ V ++ + + VE + + + E+A+ GS +
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATE--TTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 209 AAAPPESKPAPAPEAAPETKA-------APEPKAAPEPKPAPEPKAAPEPKPAPAATPQP 261
E+K E + K P+ + PK P+ +PA P
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 262 AAAPAPSAGDGTPYVTPLVRKLAEENNIDLDSVTGTGVGGRI------------RKQDVL 309
S + T ++ + + T G + +
Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS 1213

Query: 310 AAAEKKKERPEAKPAAAQASAPASPSKAAAPAAAAALAHLRGTKQKASRIRQITAIKTRE 369
++ K K R + + + ++ + AL L + + + A
Sbjct: 1214 ESSNKPKNR-HRRSVRSVPHNVEPATTSSNDRSTVALCDL-TSTNTNAVLSDARAKAQFV 1271

Query: 370 SLQATAQLTQ 379
+L ++Q
Sbjct: 1272 ALNVGKAVSQ 1281



Score = 29.6 bits (66), Expect = 0.047
Identities = 29/162 (17%), Positives = 50/162 (30%), Gaps = 8/162 (4%)

Query: 15 TEGTVTRWLKQEGDTVEIDEPLVEVSTDKVDTEIPSPAAGVLTKIVAKEDDTVEVGGELA 74
T K+ V+ + EV+ +T+ T V KE+ E
Sbjct: 1061 TTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV---ETE 1117

Query: 75 IIGDAAESGGGDAPSQPEPEAEPAAAAQPEPEAEPEPQPEAK-PQS-GGSSAAGGDATPV 132
+ + +P Q + E A EP E +P K PQS ++A
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQPQA---EPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174

Query: 133 LMPELGESVAEGTVTRWLKKVGDSVQVDEALVEVSTDKVDTE 174
+ + V E T V ++ + T ++
Sbjct: 1175 TSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS 1216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3261NUCEPIMERASE375e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 37.5 bits (87), Expect = 5e-05
Identities = 31/150 (20%), Positives = 50/150 (33%), Gaps = 32/150 (21%)

Query: 6 VAIAGSSGLIGSALAAALRAADHRVLRI--------VRRTPANSEELHWNPESGEF---- 53
+ G++G IG ++ L A H+V+ I V A E L + G
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELL---AQPGFQFHKI 59

Query: 54 ---DPDALTD------VDVVINLC---GVGIGRRRWSGAFKQSLRDSRITPTEVLSSAVA 101
D + +TD + V V R+S + DS +T +
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNILEGCR 114

Query: 102 DAGVPTLINASAVGYYGDTRDRVVDENDPA 131
+ L+ AS+ YG R +D
Sbjct: 115 HNKIQHLLYASSSSVYGLNRKMPFSTDDSV 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3263PF05272310.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.008
Identities = 21/143 (14%), Positives = 42/143 (29%), Gaps = 38/143 (26%)

Query: 3 RSQSAVEVIDLVKRRGSVTAVDGISFAVPPG----GVLGLLGPNGAGKTTTVRMLATLTR 58
+ ++ G + ++ + PG + L G G GK+T + L
Sbjct: 562 PDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLV---- 617

Query: 59 PTSGAAWVAGH--DVCAAPESVRREIGLTCQEATLDGLLTARENINMIGSLRGIRRKELA 116
G + + D+ +S + G+ E + + RR +
Sbjct: 618 ---GLDFFSDTHFDIGTGKDSYEQIAGIVAYE---------------LSEMTAFRRADAE 659

Query: 117 SLTDRLLDQFSIAEFADRRVDTY 139
++ F R D Y
Sbjct: 660 ----------AVKAFFSSRKDRY 672


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3264ACRIFLAVINRP467e-07 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 46.0 bits (109), Expect = 7e-07
Identities = 47/289 (16%), Positives = 104/289 (35%), Gaps = 42/289 (14%)

Query: 212 IGVIVVMLLVIYGSVTTALVVLVMVVLQLAAARGVVALLGYHGLVGLSIFSTNVLVTLVI 271
I ++ +++ + ++ L+ + V + L ++A GY ++ + + +V+
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGY----SINTLT---MFGMVL 400

Query: 272 AAGT--DYAIFLVGRYQEARSAGQD--RESAFFTMFGGTAHVVLGSGLTIAGAMLCLSF- 326
A G D AI +V + + +E+ +M ++G + ++ + ++F
Sbjct: 401 AIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSM-SQIQGALVGIAMVLSAVFIPMAFF 459

Query: 327 --TRLPYLQTLGVPLAVGMTVGVLAALTLGPALIAV------------TSRFGKVLEPRR 372
+ + + + M + VL AL L PAL A F
Sbjct: 460 GGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTF 519

Query: 373 QLRVRRWRKLGAAIARWPGPILITASVLALGGLLVLPGYRASFDDRNYLPKDVPANIGYA 432
V + I G L+ L + G++VL ++LP++ + G
Sbjct: 520 DHSVNHYTNSVGKILGSTGRYLL-IYALIVAGMVVL----FLRLPSSFLPEE---DQGVF 571

Query: 433 AAERGFGAARMNPDVLLVESNHDLRNSADLLVIDKIA--KAIFAVEGIS 479
++ + L D + ++ A +++F V G S
Sbjct: 572 -----LTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFS 615



Score = 37.5 bits (87), Expect = 3e-04
Identities = 40/237 (16%), Positives = 80/237 (33%), Gaps = 39/237 (16%)

Query: 148 YVQVNLMGLQGHARSIRSVKAVQRIVDSTPA--PDGVTTFVTGSAALMVDQQTVGSRSMR 205
Y + M +QG A S ++++ + P G+ TG M Q+ +
Sbjct: 818 YNGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTG----MSYQERLSGNQAP 873

Query: 206 LVELVTIGVIVVMLLVIYGSVTTALVVLVMVVLQLAAARGVVALLGYHGLVGLSIFSTNV 265
+ ++ V+ + L +Y S + + V+++V L + L ++
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQK----NDVYFMVG 929

Query: 266 LVTLVIAAGTDYAIFLVGRYQEARSA-GQDRESA---------------FFTMFGGTAHV 309
L+T + + + AI +V ++ G+ A G +
Sbjct: 930 LLTTIGLSAKN-AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 310 VLGSGLTIAGAMLCLSFTRLPYLQTLGVPLAVGMTVGVLAALTLGPALIAVTSRFGK 366
+ +G AG+ +G+ + GM L A+ P V R K
Sbjct: 989 AISNG---AGSGA---------QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFK 1033



Score = 34.0 bits (78), Expect = 0.004
Identities = 33/171 (19%), Positives = 62/171 (36%), Gaps = 7/171 (4%)

Query: 781 IAALSLIFLIMLNITRSAIAALVIVGSVAASLGASVGLSVLLWQHLIGIELHWLVLSMSV 840
A+ L+FL+M ++ A L+ +V L + + + + + +VL++ +
Sbjct: 345 FEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGL 404

Query: 841 IVLLAVGADYNLLLVSRLKEELHAGINTAIIRTVGATGSVATSAGLVFAFTMISMAV--- 897
+V A+ N V R+ E A +++ +V + I MA
Sbjct: 405 LVDDAIVVVEN---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGG 461

Query: 898 SDLTVIAQIGTTIGMGLLFDTFVVRALMTPSIAVLLGRWFWWPHHVRPRPI 948
S + Q TI + V L TP++ L + HH
Sbjct: 462 STGAIYRQFSITIVSAMALSVLVALIL-TPALCATLLKPVSAEHHENKGGF 511


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3266ACRIFLAVINRP442e-06 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 44.4 bits (105), Expect = 2e-06
Identities = 47/293 (16%), Positives = 103/293 (35%), Gaps = 42/293 (14%)

Query: 208 LLVTFAVIVVMLLVIYGSVTTALAVLVMVVLQLAAARGVVALLGYHGLVGLSIFSTNVLV 267
L ++ +++ + ++ L + V + L ++A GY ++ + +
Sbjct: 344 LFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGY----SINTLT---MF 396

Query: 268 TLVIAAGT--DYAIFLVGRYQEARSAGQD--RESAFFTMFGGTAHVVLGSGLTIAGAMLC 323
+V+A G D AI +V + + +E+ +M ++G + ++ +
Sbjct: 397 GMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSM-SQIQGALVGIAMVLSAVFIP 455

Query: 324 LSF---TRLPYLQTLGVPLAVGMTVGVLAALTLGPALIAV------------TSRFGKVL 368
++F + + + + M + VL AL L PAL A F
Sbjct: 456 MAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWF 515

Query: 369 EPRRQLRVRRWRKLGAAIARWPGPILITASVLALGGLLVLPGYRASFDDRNYLPKDVPAN 428
V + I G L+ L + G++VL ++LP++ +
Sbjct: 516 NTTFDHSVNHYTNSVGKILGSTGRYLL-IYALIVAGMVVL----FLRLPSSFLPEE---D 567

Query: 429 IGYAAAERGFGAARMNPDVLLVESNHDLRNSADLLVIDKIA--KAIFAVEGIS 479
G ++ + L D + ++ A +++F V G S
Sbjct: 568 QGVF-----LTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFS 615



Score = 37.9 bits (88), Expect = 2e-04
Identities = 35/219 (15%), Positives = 77/219 (35%), Gaps = 39/219 (17%)

Query: 164 ESVAAVQGTLRDMPGPDGVNAFLTGPAVVLADQQIAGDRSMRLILLVTFAVIVVMLLVIY 223
+++A ++ +P G+ TG ++ Q+ ++ ++F V+ + L +Y
Sbjct: 838 DAMALMENLASKLP--AGIGYDWTG----MSYQERLSGNQAPALVAISFVVVFLCLAALY 891

Query: 224 GSVTTALAVLVMVVLQLAAARGVVALLGYHGLVGLSIFSTNVLVTLVIAAGTDYAIFLVG 283
S + ++V+++V L + L ++ L+T + + + AI +V
Sbjct: 892 ESWSIPVSVMLVVPLGIVGVLLAATLFNQK----NDVYFMVGLLTTIGLSAKN-AILIVE 946

Query: 284 RYQEARSA-GQDRESA---------------FFTMFGGTAHVVLGSGLTIAGAMLCLSFT 327
++ G+ A G + + +G AG+
Sbjct: 947 FAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG---AGSGA----- 998

Query: 328 RLPYLQTLGVPLAVGMTVGVLAALTLGPALIAVTSRFGK 366
+G+ + GM L A+ P V R K
Sbjct: 999 ----QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFK 1033



Score = 34.0 bits (78), Expect = 0.003
Identities = 33/171 (19%), Positives = 62/171 (36%), Gaps = 7/171 (4%)

Query: 781 IAALSLIFLIMLNITRSAIAALVIVGSVAASLGASVGLSVLLWQHLIGIELHWLVLSMSV 840
A+ L+FL+M ++ A L+ +V L + + + + + +VL++ +
Sbjct: 345 FEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGL 404

Query: 841 IVLLAVGADYNLLLVSRLKEELHAGINTAIIRTVGATGSVATSAGLVFAFTMISMAV--- 897
+V A+ N V R+ E A +++ +V + I MA
Sbjct: 405 LVDDAIVVVEN---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGG 461

Query: 898 SDLTVIAQIGTTIGMGLLFDTFVVRALMTPSIAVLLGRWFWWPHHVRPRPI 948
S + Q TI + V L TP++ L + HH
Sbjct: 462 STGAIYRQFSITIVSAMALSVLVALIL-TPALCATLLKPVSAEHHENKGGF 511


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3273ISCHRISMTASE270.014 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 26.9 bits (59), Expect = 0.014
Identities = 14/52 (26%), Positives = 22/52 (42%), Gaps = 1/52 (1%)

Query: 37 LNMSPGSIDVDLPLPECGIDSAMSLSLCADLQREHGIEADATIVWDYPTIRA 88
L +P I L + G+DS ++L +RE E + + PTI
Sbjct: 243 LQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGA-EVTFVELAERPTIEE 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3274NUCEPIMERASE270.006 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 27.1 bits (60), Expect = 0.006
Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 8 ITGSSGLIGSALAAALRVADHRALRI 33
+TG++G IG ++ L A H+ + I
Sbjct: 5 VTGAAGFIGFHVSKRLLEAGHQVVGI 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3287PREPILNPTASE290.014 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 29.0 bits (65), Expect = 0.014
Identities = 10/46 (21%), Positives = 17/46 (36%), Gaps = 2/46 (4%)

Query: 47 LPYMIGAFVLIVGISVAVGVWAGGLTMITMIPFG--LLLGGLVAFI 90
LP ++ L+ + IPFG L + G +A +
Sbjct: 231 LPIVLLLSSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIALL 276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3290cloacin397e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 7e-05
Identities = 37/123 (30%), Positives = 49/123 (39%), Gaps = 3/123 (2%)

Query: 361 GAGGAAGNAGLISGTGGVGGQGGVGGFGEGGAGGSGGGAGLIGNGGSGGTGGNAVGNSGV 420
G G N G S +G + G G G G + GSG + GG G+G + G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 421 GGHGGAGGQGGRLYGNGGVGGNGGFSGPITAGGAGGTGGTGGSAGLFGDGGAGGAGGASG 480
G GG G GG G+G G + P+ G + G + GA A A
Sbjct: 63 GNGGGNGNSGG---GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119

Query: 481 FAS 483
A+
Sbjct: 120 MAA 122



Score = 35.1 bits (80), Expect = 0.001
Identities = 33/85 (38%), Positives = 38/85 (44%), Gaps = 9/85 (10%)

Query: 468 GDGGAGGAGGASGFASGGTGGIGGTGGLLFGAAGDGGNGGFGAGGRGGSGGAGGDAWLFG 527
G G GA SG +GG G+G GG A DG G+ + GG+G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGG-----ASDGS--GWSSENNPWGGGSGSGIHWGG 58

Query: 528 SGGSGGGGGAGAINDGGSGGAGGNG 552
G G GGG G N GG G GGN
Sbjct: 59 GSGHGNGGGNG--NSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.001
Identities = 34/107 (31%), Positives = 42/107 (39%), Gaps = 5/107 (4%)

Query: 132 GGDGGILFGSGGAGGSGAGGQDGGAGGRAGLFGNGGAGGAGGTGQTQGGAGGAGGLFFGN 191
GGDG G S +G +GG G G G GG G+G + G
Sbjct: 3 GGDGR---GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 192 GGAGGPGGSGGLNGGAGGAGGVGGLLFGAGGAGGAGGTGTSGIGGLG 238
G G GG+G GG+G G + + A A G T G GGL
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAV--AAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.003
Identities = 23/77 (29%), Positives = 31/77 (40%)

Query: 582 GGAGGNGGSAGLVGNGGAGGAGGQGGLIVSADGNNGGAGGDGGGAGLLLGAGGAGGQGGL 641
G G ++G + G G G G S + G G G+G+ G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 642 GGNTGDGGNGGNGGNAA 658
GN+G G G +A
Sbjct: 68 NGNSGGGSGTGGNLSAV 84



Score = 33.5 bits (76), Expect = 0.003
Identities = 34/100 (34%), Positives = 40/100 (40%), Gaps = 7/100 (7%)

Query: 164 GNGGAGGAGGTGQTQGGA-GGAGGLFFGNGGAGGPGGS------GGLNGGAGGAGGVGGL 216
G G G G T G GG GL G G + G G S GG +G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 217 LFGAGGAGGAGGTGTSGIGGLGGDGGSAGALSISAGGAGG 256
G G GG+GT G + G ++S GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.007
Identities = 31/109 (28%), Positives = 37/109 (33%)

Query: 144 AGGSGAGGQDGGAGGRAGLFGNGGAGGAGGTGQTQGGAGGAGGLFFGNGGAGGPGGSGGL 203
+GG G G G + G G GG G + G G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 204 NGGAGGAGGVGGLLFGAGGAGGAGGTGTSGIGGLGGDGGSAGALSISAG 252
+G GG G GG G G L G A+SISAG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.0 bits (72), Expect = 0.009
Identities = 33/102 (32%), Positives = 39/102 (38%), Gaps = 10/102 (9%)

Query: 618 GAGGDGGGAGLLLGAGGAGGQGGLGGNTGDGGNGGNGGNAALIGDGGSGGAGGDAGSGDA 677
G G + G G G+GG DG + N G G GG +G G+
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 678 GDGGDGGDARLVGSGGNGGNGGFSATPAAGG-----TGGAGG 714
G G+ G G G GGN A P A G T GAGG
Sbjct: 66 GGNGNSG-----GGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.014
Identities = 24/75 (32%), Positives = 26/75 (34%)

Query: 279 GHGGGGGAGGTGTGMGVDNDGIGGAGGAGGSGGWLIGTGGTGGTGGFGDGPLGGQGGDGG 338
G G GA T + G+G GGA GW GG G G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 339 NAGLFGVGGDGGLGG 353
GG G G
Sbjct: 66 GGNGNSGGGSGTGGN 80



Score = 30.8 bits (69), Expect = 0.019
Identities = 31/104 (29%), Positives = 34/104 (32%)

Query: 206 GAGGAGGVGGLLFGAGGAGGAGGTGTSGIGGLGGDGGSAGALSISAGGAGGNGGTGLSGF 265
G G G G +G G G G G G S+ G G G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 266 GGAGGAGGNAGLYGHGGGGGAGGTGTGMGVDNDGIGGAGGAGGS 309
G GG G + G G GG A G GAGG S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 30.8 bits (69), Expect = 0.019
Identities = 30/81 (37%), Positives = 35/81 (43%), Gaps = 6/81 (7%)

Query: 352 GGTGFFGVGGAGGAAGNAGLISGTGGVGGQGGVG-GFGE-----GGAGGSGGGAGLIGNG 405
GG G GA +GN GVGG G G+ GG GSG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 406 GSGGTGGNAVGNSGVGGHGGA 426
G+GG GN+ G SG GG+ A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.020
Identities = 29/86 (33%), Positives = 37/86 (43%), Gaps = 2/86 (2%)

Query: 330 LGGQGGDGGNAGLFGVGGDGGLGGTGFFGVGGAGGAAGNAGLISGTGGVGGQGGVGGFGE 389
+ G G G N G G+ G TG GGA +G + + GG G G +G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI--HWGG 58

Query: 390 GGAGGSGGGAGLIGNGGSGGTGGNAV 415
G G+GGG G G G G +AV
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 30.5 bits (68), Expect = 0.029
Identities = 33/106 (31%), Positives = 40/106 (37%), Gaps = 5/106 (4%)

Query: 322 TGGFGDGPLGGQGGDGGNAGLFGVGGDGGLGGTGFFGVGGAGGAAGNAGLISGTGGVGGQ 381
+GG G G G GN GG GLG G G+G ++ N G+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNIN----GGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHW 56

Query: 382 GGVGGFGEGGAGGSGGGAGLIGNGGSGGTGGNAVGNSGVGGHGGAG 427
GG G G GG G+ GG G S A G + G G
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.1 bits (67), Expect = 0.036
Identities = 33/113 (29%), Positives = 41/113 (36%), Gaps = 3/113 (2%)

Query: 372 ISGTGGVGGQGGVGGFGEGGAGGSGGGAGLIGNGGSGGTGGNAVGNSGVGGHGGAGGQGG 431
+SG G G G GG G G G S G+G ++ N GG G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 432 RLYGNGGVGGNGGFSGPITAGGAGGTGGTGGSAGLFGDGGAGGAGGASGFASG 484
G+G GGNG G GG + G G G A ++G
Sbjct: 59 G-SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3291VACCYTOTOXIN310.008 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 30.8 bits (69), Expect = 0.008
Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 3/45 (6%)

Query: 67 VLVGALAQVT---RRLKFVTTVYIPAMRNPYSAAKAIGTAALLAS 108
LVGAL +T F TTV IPA+ + A+GT + L
Sbjct: 18 ALVGALVSITPQQSHAAFFTTVIIPAIVGGIATGAAVGTVSGLLG 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3303cloacin396e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 6e-05
Identities = 31/102 (30%), Positives = 39/102 (38%)

Query: 372 GGVGTSTNGGAAGGTGGRGGLSGGVGGTDGAGGDGGQGGSGGAVGTGATAGAGGAGGAGA 431
GG G N GA +G G G+G GA G G G+ +G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 432 AATSGGPGYGGGGGGGGGGARVLPTSGGITSGTATGGEGGVG 473
G GGG G GG + V +T G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.8 bits (87), Expect = 2e-04
Identities = 33/102 (32%), Positives = 39/102 (38%)

Query: 413 GAVGTGATAGAGGAGGAGAAATSGGPGYGGGGGGGGGGARVLPTSGGITSGTATGGEGGV 472
G G G GA G +G GG G G + P GG SG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 473 GGAGTTVAAGGVGGHGGNATLAASDAGSSADGTATGGAGGAG 514
G G +GG G GGN + A+ +T GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.8 bits (82), Expect = 7e-04
Identities = 35/113 (30%), Positives = 44/113 (38%), Gaps = 10/113 (8%)

Query: 329 TGGNGGDGGSGGVAGAGGSGGFLGSAGATGGAGDGGRGGTGGLGGVGTSTNGGAAGGTGG 388
+GG+G +G + +G G G GGA DG G S+ GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGS----------GWSSENNPWGGGSG 51

Query: 389 RGGLSGGVGGTDGAGGDGGQGGSGGAVGTGATAGAGGAGGAGAAATSGGPGYG 441
G GG G GG+G GG G G + A A G A +T G G
Sbjct: 52 SGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.002
Identities = 31/113 (27%), Positives = 39/113 (34%)

Query: 398 GTDGAGGDGGQGGSGGAVGTGATAGAGGAGGAGAAATSGGPGYGGGGGGGGGGARVLPTS 457
G DG G + G + G + G T G G + + S GGG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 458 GGITSGTATGGEGGVGGAGTTVAAGGVGGHGGNATLAASDAGSSADGTATGGA 510
G +GG G GG + VAA G +T A S A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 34.3 bits (78), Expect = 0.002
Identities = 21/82 (25%), Positives = 31/82 (37%)

Query: 567 NGAGSSASGSAIGGYGGAGTTGGLGGVGGYAQVSASNGGTATGTVSGGYGGAGTTGGHGG 626
N S SG+ GG G G GG G++ + GG + + G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 627 GGGNAFLRANGAGSSASGISIG 648
G + A+ ++ G
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.9 bits (77), Expect = 0.003
Identities = 30/106 (28%), Positives = 36/106 (33%), Gaps = 2/106 (1%)

Query: 279 GHGGPGGVGGVGGRVVGLFGSGGGGGAGGDGGVGGTGAQGAPGTQSFSAGTGGNGGDGGS 338
G G G G + G G GG DG + G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 339 GGVAGAGGSGGFLGSAGATGGAGDGGRGG--TGGLGGVGTSTNGGA 382
GG +GG G G+ A G T G GG+ S + GA
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.9 bits (77), Expect = 0.003
Identities = 31/94 (32%), Positives = 39/94 (41%), Gaps = 4/94 (4%)

Query: 501 SADGTATGGAGGAGGTNHASGGRGGDATISTSGGGTITGGTATGGVGGAGTTGGSGGGGG 560
S G GG G G AS G G + + GGG+ +G GG G G GGG
Sbjct: 15 STSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGG 74

Query: 561 SGALFANGAGSSASGSAIGGYGGAGTTGGLGGVG 594
SG G SA + + A +T G GG+
Sbjct: 75 SGT----GGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.003
Identities = 27/84 (32%), Positives = 35/84 (41%)

Query: 594 GGYAQVSASNGGTATGTVSGGYGGAGTTGGHGGGGGNAFLRANGAGSSASGISIGGAGGA 653
GG + + + +G ++GG G G GG G G + G S SGI GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 654 GTAGGYGGSGNYASIRGYSGATVT 677
G GG G SG + G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.004
Identities = 26/82 (31%), Positives = 31/82 (37%)

Query: 532 SGGGTITGGTATGGVGGAGTTGGSGGGGGSGALFANGAGSSASGSAIGGYGGAGTTGGLG 591
+ G T G GG G G GG+ G G + G S SG GG G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 592 GVGGYAQVSASNGGTATGTVSG 613
GG + + A G
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.1 bits (75), Expect = 0.005
Identities = 30/93 (32%), Positives = 38/93 (40%), Gaps = 5/93 (5%)

Query: 140 GGPGGWLWGDGGDGGSGTPGTATSPAGGAGGAGGSAFLFGSGGHGGSGGTAYAGSGAVGG 199
GGP G G G S G ++ GG+G G GHG GG +G G+ G
Sbjct: 22 GGPTG---LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG 78

Query: 200 TGGNGGAGGLIFG--GGGAGGVGGLGAAGSATA 230
+ A + FG G GGL + SA A
Sbjct: 79 GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.1 bits (75), Expect = 0.005
Identities = 32/108 (29%), Positives = 43/108 (39%), Gaps = 5/108 (4%)

Query: 512 GAGGTNHASGGRGGDATISTSGGGTITGGTATGGVGGAGTTGGSGGGGGSGALFANGAGS 571
G G H +G I+ G GG A+ G G + GGG GSG + G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 572 SASGSAIGGYGGAGTTGGLGGVG-----GYAQVSASNGGTATGTVSGG 614
G GG+GT G L V G+ +S G ++S G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.2 bits (70), Expect = 0.019
Identities = 31/113 (27%), Positives = 38/113 (33%)

Query: 233 GMGGHGGAGGTSYQLGGAAGAGGAGGQGGLGYDAAADPMAAAGASGGHGGPGGVGGVGGR 292
G G + GA TS + G G GG G +++ G SG GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 293 VVGLFGSGGGGGAGGDGGVGGTGAQGAPGTQSFSAGTGGNGGDGGSGGVAGAG 345
GG G G V A G P + AG G+ A A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 31.2 bits (70), Expect = 0.021
Identities = 30/89 (33%), Positives = 34/89 (38%), Gaps = 8/89 (8%)

Query: 305 AGGDGGVGGTGAQGAPGTQSFSAGTGGNGGDGGSGGVAGAGGSGGFLGSAGATGGAGDGG 364
+GGDG TGA G NGG G G GA G+ GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI--------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG 53

Query: 365 RGGTGGLGGVGTSTNGGAAGGTGGRGGLS 393
GG G NG + GG+G G LS
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82



Score = 30.8 bits (69), Expect = 0.026
Identities = 33/109 (30%), Positives = 41/109 (37%), Gaps = 2/109 (1%)

Query: 168 AGGAGGSAFLFGSGGHGGSGGTAYAGSGAVGGTGGNGGAGGLIFGGGGAGGVGGLGAAGS 227
G S + G G GG A GSG GG G GG G G G G+
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 228 ATAAPGMGGHGGAGGTSYQLGGAAGAGGAGGQGGLGYDAAADPMAAAGA 276
+ G GG+ A G A + G GGL +A ++AA A
Sbjct: 71 SGGGSGTGGNLSAVAAPVAFGFPALS--TPGAGGLAVSISAGALSAAIA 117



Score = 30.5 bits (68), Expect = 0.029
Identities = 28/85 (32%), Positives = 33/85 (38%), Gaps = 1/85 (1%)

Query: 148 GDGGDGGSGTPGTATSPAGGAGGAGGSAFLFGSGGHGGSGGTAYAGSGAVGGTGGNGGAG 207
GDG +G T+ + GG G G G GSG+ GG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 208 GLIFGGGGAGGVGGLGAAGSATAAP 232
G G +GG G G SA AAP
Sbjct: 64 NG-GGNGNSGGGSGTGGNLSAVAAP 87



Score = 30.1 bits (67), Expect = 0.045
Identities = 28/83 (33%), Positives = 31/83 (37%), Gaps = 1/83 (1%)

Query: 301 GGGGAGGDGGVGGTGAQGAPGTQSFSAGTGGNGGDGGSGGVAGAGGSGGFLGSAGATGGA 360
GG G G + G T G G G + G G S GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 361 GDGGRGGTGGLGGVGTSTNGGAA 383
G+GG G G GG GT N A
Sbjct: 63 GNGGGNGNSG-GGSGTGGNLSAV 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3305RTXTOXIND310.005 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.005
Identities = 26/156 (16%), Positives = 50/156 (32%), Gaps = 21/156 (13%)

Query: 35 SAFEQVRVQHEAVSDRLAAVRIALEDLDAQVSRLEDEIDAVRKREDRDRSLLTSGAVDAK 94
F + Q L R + A+++R E+ + R D SLL A+
Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA-- 250

Query: 95 QLADLQHELETLERRQASLEDSLLEVMERREELQAQQNTEIAALEVLQAELTAAQQSVDA 154
+H + E + E + + LE +++E+ +A++
Sbjct: 251 -----KHAVLEQENKYV--------------EAVNELRVYKSQLEQIESEILSAKEEYQL 291

Query: 155 ALAELDQSRQEHSSRRDTLAASLNPDLAALYERLRA 190
+ + L +LA ER +A
Sbjct: 292 VTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQA 327



Score = 29.0 bits (65), Expect = 0.017
Identities = 19/132 (14%), Positives = 40/132 (30%), Gaps = 5/132 (3%)

Query: 62 DAQVSRLEDEIDAVRKREDRDRSLLTSGAVDAKQLADLQHE--LETLERRQASLEDSLLE 119
+A + + + R + R + L S ++ L E + + + SL++
Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192

Query: 120 VMERREELQAQQNTEIAALEVLQAELTAAQQSVDAALAELDQSRQEHSSRRDTLAASLNP 179
E+ Q Q+ + L+ +AE ++ + L
Sbjct: 193 --EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250

Query: 180 DLAAL-YERLRA 190
A L E
Sbjct: 251 KHAVLEQENKYV 262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3314HTHFIS290.025 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.4 bits (66), Expect = 0.025
Identities = 12/47 (25%), Positives = 18/47 (38%)

Query: 87 HQDAFPLLITHGWPGSVVEFHKVIEPLTNPTAHGGRAEDAFHVVCPS 133
Q+A L+ H WPG+V E ++ LT + S
Sbjct: 339 DQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRS 385


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3316cloacin375e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 5e-04
Identities = 30/83 (36%), Positives = 35/83 (42%)

Query: 442 GSGGNGGNGGISGLIGNGGAGGAGGNGSAAGYNGYSGNGGNGGNGGAAQLIGAGGGGGVA 501
G G N G SG I G G G G++ G S N GG G+ G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 502 GIGGAGGTGAAAGVTGSAGASGV 524
G G G G+ G SA A+ V
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPV 88



Score = 35.5 bits (81), Expect = 0.001
Identities = 35/108 (32%), Positives = 42/108 (38%), Gaps = 8/108 (7%)

Query: 421 NGGAGGAGGASGAGIGGDSLGGSGGNGGNGGISGLIG----NGGAGGAGGNGSAAGYNGY 476
+GG G G+ GG G G GG S G N GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 477 SGNGGNGGNGGAAQLIGAGGGGGVAGIGGAGGTGAAAGVTGSAGASGV 524
GNGG GN G G+G GG ++ + G A T AG V
Sbjct: 62 HGNGGGNGNSGG----GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAV 105



Score = 34.7 bits (79), Expect = 0.002
Identities = 26/76 (34%), Positives = 33/76 (43%)

Query: 170 GIGGAGGSGGAGGTGGWLYGNGGAGGAGGAGAPGLVSFNGSNGGNGGNGGAAGWWGTGGA 229
G G G + GA T G + G G GG + G + +N GG+G W G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 230 GGAGGEGGAAGGGPAG 245
G GG G + GG G
Sbjct: 63 GNGGGNGNSGGGSGTG 78



Score = 34.3 bits (78), Expect = 0.002
Identities = 37/111 (33%), Positives = 45/111 (40%), Gaps = 4/111 (3%)

Query: 308 GDGGAGGQGGDGGGVTGLVGAGHGGAGGDGGRAGWLSGNAGAGGGGGAGGAVDNHGSGGD 367
G G G G + G G G G +GW S N GGG G+G + G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDG-SGWSSENNPWGGGSGSG---IHWGGGSG 61

Query: 368 YAGGGGAGGSGGAAGLFGNGAAGGAGGAGGASQTFTGAGGAGGTGGAGGWL 418
+ GGG G SGG +G GN +A A A G T G + G L
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112



Score = 33.9 bits (77), Expect = 0.004
Identities = 31/81 (38%), Positives = 34/81 (41%), Gaps = 1/81 (1%)

Query: 385 GNGAAGGAGGAGG-ASQTFTGAGGAGGTGGAGGWLYGNGGAGGAGGASGAGIGGDSLGGS 443
G G GA G + TG G GG GW N GG G+ GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 444 GGNGGNGGISGLIGNGGAGGA 464
GGNG +GG SG GN A A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.005
Identities = 23/64 (35%), Positives = 27/64 (42%)

Query: 122 ADGTAPGQAGGAGGLLYGNGGNGAAGTNPGVAGGAGGAAGLIGNGGAGGIGGAGGSGGAG 181
G G G G + G+G + N GG+G G G G GG G SGG
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 182 GTGG 185
GTGG
Sbjct: 76 GTGG 79



Score = 33.1 bits (75), Expect = 0.006
Identities = 34/102 (33%), Positives = 39/102 (38%), Gaps = 3/102 (2%)

Query: 571 GSGAAGQAGGAGGAAGLIGNGGAGGTGGSGAAGGSGAAGGN---GGAGGWLFGNGGTGGT 627
G G GA +G I G G G GA+ GSG + N GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 628 GGDAVAGLPGLNGGNGGNGGAGGAAGWWGHGGIGGQGGTGGA 669
G G G G GGN A A +G + G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.1 bits (75), Expect = 0.006
Identities = 29/86 (33%), Positives = 34/86 (39%), Gaps = 1/86 (1%)

Query: 351 GGGGAGGAVDNHGSGGDYAGG-GGAGGSGGAAGLFGNGAAGGAGGAGGASQTFTGAGGAG 409
GG G G H + G+ GG G G GGA+ G + G G S G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 410 GTGGAGGWLYGNGGAGGAGGASGAGI 435
G GG G G G GG A A +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 33.1 bits (75), Expect = 0.006
Identities = 30/100 (30%), Positives = 35/100 (35%), Gaps = 1/100 (1%)

Query: 604 GSGAAGGNGGAGGWLFGNGGTGGTGGDAVAGLPGLNGGNGGNGGAGGAAGWWGHGGIGGQ 663
G G G G + G G GG A G G + N GG G+ WG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDG-SGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 664 GGTGGAAGGNPGIYAGNAGTGGDGGAGGWLFGDAGAGGQG 703
GG G +GG G + G GAGG
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.009
Identities = 28/108 (25%), Positives = 34/108 (31%)

Query: 546 GADGAPGTGQAGGDGGWLYGNGGAGGSGAAGQAGGAGGAAGLIGNGGAGGTGGSGAAGGS 605
G DG A G + G G G G + GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 606 GAAGGNGGAGGWLFGNGGTGGTGGDAVAGLPGLNGGNGGNGGAGGAAG 653
G GGNG +GG G G P L+ G +AG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.0 bits (72), Expect = 0.012
Identities = 26/99 (26%), Positives = 30/99 (30%)

Query: 331 GGAGGDGGRAGWLSGNAGAGGGGGAGGAVDNHGSGGDYAGGGGAGGSGGAAGLFGNGAAG 390
G G A SGN G G G + GSG GGSG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 391 GAGGAGGASQTFTGAGGAGGTGGAGGWLYGNGGAGGAGG 429
GG G + G + + GAGG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.024
Identities = 28/113 (24%), Positives = 35/113 (30%), Gaps = 8/113 (7%)

Query: 211 NGGNGGNGGAAGWWGTGGAGGAGGEGGAAGGGPAGIAYGQTGGVGGNGGNGGNGGWFAGN 270
+GG+G +G G G GG G + G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 271 AGAGGNGGVGGDGNAADNGLVGGTGGVGGAGGSAGLFGDGGAGGQGGDGGGVT 323
G GG G G G+ GTGG A + FG G G V+
Sbjct: 62 HGNGGGNGNSGGGS--------GTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 30.8 bits (69), Expect = 0.028
Identities = 29/104 (27%), Positives = 38/104 (36%), Gaps = 3/104 (2%)

Query: 280 GGDGNAADNGLVGGTGGVGGAGGSAGLFGDGGAGGQGGDGGGVTGLVGAGHGGAGGDGGR 339
GGDG + G +G + G G+ GG G G G G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV---GGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 340 AGWLSGNAGAGGGGGAGGAVDNHGSGGDYAGGGGAGGSGGAAGL 383
+G +G GGG+G + A G A + GA GL
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 30.8 bits (69), Expect = 0.029
Identities = 37/105 (35%), Positives = 45/105 (42%), Gaps = 8/105 (7%)

Query: 641 GNGGNGGAGGAAGWWGHGGIGGQGGTGGAAGGNPGIYAGNAGTGGDGGAGGWLFGDAGAG 700
G G N GA +G +GG G G GGA+ G G + N GG G+G G +G G
Sbjct: 6 GRGHNTGAHSTSGNI-NGGPTGLGVGGGASDG-SGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 701 GQGGTGGAANDPLVTSSTGGSGGAGGNGGAAGL--FGAGGAGGTG 743
GG G + S TGG+ A A G GAGG
Sbjct: 64 NGGGNGNSGG----GSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3319NUCEPIMERASE939e-24 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 93.3 bits (232), Expect = 9e-24
Identities = 61/359 (16%), Positives = 111/359 (30%), Gaps = 73/359 (20%)

Query: 17 TIFITGANGFIGRAMAARFRALGAVVRGVD----------------LAADPAGDVVAGDI 60
+TGA GFIG ++ R G V G+D L A P D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 61 TRPETWAAGPTGLEGADTVIHTAALLGAAFPLKQAW---HVNVLGTSRVLRAAIDAGVRR 117
E + V + L + L+ N+ G +L ++
Sbjct: 62 ADRE-GMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 118 FVHFSSVAAYGFEFPDGADETYPVHVNGDVYTDTKVNSEAVVLAAHGAGEIDVTVIRPGD 177
++ SS + YG V +Y TK +E + + T +R
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 178 VWGPGSVWVRSPIA------EMRKRTGFPLPNGGNGIFSPVYIDNFVDGMVLAV------ 225
V+GP W R +A M + + N G YID+ + ++
Sbjct: 181 VYGP---WGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHA 237

Query: 226 -----------SSDDAVGQIFNITDGKGVRCADFFGRMASMSDGTIHTLPIRVAAPAAEL 274
++ A +++NI + V D+ I L
Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDY-----------IQALE---------- 276

Query: 275 LGSLLRRLGQKTDLSAGTMWLLNRPGTY-SIEKAQKMLGYQPRVSIEEGMARVHEWARA 332
LG + + + + T + +++G+ P ++++G+ W R
Sbjct: 277 -----DALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3324NUCEPIMERASE1003e-26 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 100 bits (250), Expect = 3e-26
Identities = 71/362 (19%), Positives = 124/362 (34%), Gaps = 82/362 (22%)

Query: 17 KIFITGANGFIGANLAARLRQLGARVTGVD----------------LVADPANGIIAGST 60
K +TGA GFIG +++ RL + G +V G+D L+A P
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 61 ADPAAWASALD--GVDAVVHLAALVSTVVAVETAW---DVNVLGTKKVIDAAVDAGVRRF 115
AD + V ++ ++E D N+ G +++ ++
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 116 VHLSSIAAYGWDFPDHVTEDYPTRVTGGLSTYVDTKTNSELVALA-NANRGMEMVVVRPA 174
++ SS + YG + + D +S Y TK +EL+A + G+ +R
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHP--VSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 175 DVYGPGSVWIREPIAMAKANQLILPERGSGVF-------DVIYIDNFVDAMVLVL----- 222
VYGP W R +A+ K + +L + V+ D YID+ +A++ +
Sbjct: 180 TVYGP---WGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 223 ------------ATEGIAGEVFNLGEELAVSCQEYFGEVASWTG--AKVRSVPIRIG-AP 267
A V+N+G V +Y + G AK +P++ G
Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVL 296

Query: 268 ALGAIGRIQRRLGMSSELGPALLHMLNRRYVVSNDKARDRLGFKPVVSYHEGMARSKEWA 327
A + +GF P + +G+ W
Sbjct: 297 ETSA----------------------------DTKALYEVIGFTPETTVKDGVKNFVNWY 328

Query: 328 RH 329
R
Sbjct: 329 RD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3326cloacin365e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.2 bits (83), Expect = 5e-04
Identities = 28/92 (30%), Positives = 34/92 (36%)

Query: 206 AGGAGGYTTSGTGGAGGAGGTGGLLGGGGVGGAGGMAYSAGTTGGAGGAGGAGGVLSGLV 265
+GG G +G G G G G G + G +S+ GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 266 GAGGGHGGTGGAGSGTGGAGGAGGPAGLLGGP 297
GG G G GSGTGG A G P
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93



Score = 35.5 bits (81), Expect = 0.001
Identities = 33/81 (40%), Positives = 39/81 (48%), Gaps = 4/81 (4%)

Query: 268 GGGHGGTGGAGSGTGGAGGAGGPAGLLGGPGGAGGDGGYGDTGGAGGAGGAGGWLFGNGG 327
G G G GA S +G G GP GL G G + G G + GG G+G G G
Sbjct: 4 GDGRGHNTGAHSTSGNING--GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 328 AGGTGGTSIGSTGGAGGTGGN 348
G GG G++GG GTGGN
Sbjct: 62 HGNGGGN--GNSGGGSGTGGN 80



Score = 35.1 bits (80), Expect = 0.001
Identities = 30/82 (36%), Positives = 34/82 (41%)

Query: 559 GGRGGTGGDGGAGGAGGWFSGDGGVGGIGGGSTAAGATGGTGGTGGAGGLFGAGGNGGAG 618
GG G G GGA GW S + GG G G G G GG G G G GG
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 619 GAGTYFISAFGGAGGTGGSGGL 640
A ++ A T G+GGL
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGL 103



Score = 34.7 bits (79), Expect = 0.002
Identities = 29/104 (27%), Positives = 39/104 (37%)

Query: 261 LSGLVGAGGGHGGTGGAGSGTGGAGGAGGPAGLLGGPGGAGGDGGYGDTGGAGGAGGAGG 320
+SG G G G +G+ GG G G G G G + + +G G+G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 321 WLFGNGGAGGTGGTSIGSTGGAGGTGGNAGWLGNGGTGGTGGFS 364
GG G +GG S + A T G GG +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.002
Identities = 33/102 (32%), Positives = 41/102 (40%), Gaps = 5/102 (4%)

Query: 661 NSGAGGAGG-AGGDAGLLGGPGGAGGTGGPGDFGTTHGGAGGAGGNAGLLFGSGGTGGTG 719
N+GA G G LG GGA G GG G+G + G G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 720 GYGASAGDGGHGGSAG----LLFSGAGAGGAGGVGLSGPGGA 757
G +G GG+ + F GAGG+ +S GA
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.9 bits (77), Expect = 0.003
Identities = 26/82 (31%), Positives = 30/82 (36%), Gaps = 5/82 (6%)

Query: 331 TGGTSIGSTGGAGGTGGNAGWLGNGGTGGTGGFSRYGLGGD-----GGTGGNAGWLGNGG 385
+GG G GA T GN G G G G + GG+G W G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 386 TGGTGGFSGSGLGGDGGYGGTA 407
G GG SG G G +A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.9 bits (77), Expect = 0.003
Identities = 36/108 (33%), Positives = 42/108 (38%), Gaps = 7/108 (6%)

Query: 148 GDGGAGGSGAPGKAGGAGGAAGLWGSGGAGGAGGSTSSGVAGAGGAGGAGGWLLGTGGAG 207
GDG +GA +G G G GG G SS GG G+G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 208 GAGGYTTSGTGGAGGAGGTGGLLGGGGVGGAGGMAYSAGTTGGAGGAG 255
G G G +GG GTGG V + A +T GAGG
Sbjct: 64 NGG-----GNGNSGGGSGTGG--NLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.003
Identities = 32/100 (32%), Positives = 37/100 (37%), Gaps = 1/100 (1%)

Query: 202 GTGGAGGAGGYTTSGTGGAGGAGGTGGLLGGGGVGGAGGMAYSAGTTGGAGGAGGAGGVL 261
G G GA + + GG G G GG G G + G+ G GG+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENN-PWGGGSGSGIHWGGGSGHGN 64

Query: 262 SGLVGAGGGHGGTGGAGSGTGGAGGAGGPAGLLGGPGGAG 301
G G GG GTGG S G PA G GG
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.003
Identities = 29/92 (31%), Positives = 37/92 (40%), Gaps = 5/92 (5%)

Query: 724 SAGDGGHGGSAGLLFSGAGAGGAGGVGLSGPGGAGGRGGDAGWLGDGGAGGTGGSSHYGD 783
S GDG + SG GG G+G+ G G +GW + G G S
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG-----SGWSSENNPWGGGSGSGIHW 56

Query: 784 GGDGGAGGAAGQLSGGGGAGGQGGQGNIAGAV 815
GG G G G + GGG+G G +A V
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 33.9 bits (77), Expect = 0.003
Identities = 35/124 (28%), Positives = 46/124 (37%), Gaps = 3/124 (2%)

Query: 560 GRGGTGGDGGAGGAGGWFSGDGGVGGIGGGSTAAGATGGTGGTGGAGGLFGAGGNGGAGG 619
G G G + GA G +G G+GGG++ G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 620 AGTYFISAFGGAGGTGGSGGLLSGLVGAGG---GHGGAGGTAISNSGAGGAGGAGGDAGL 676
GG GTGG+ ++ V G GAGG A+S S +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 677 LGGP 680
L GP
Sbjct: 123 LKGP 126



Score = 33.1 bits (75), Expect = 0.005
Identities = 23/72 (31%), Positives = 30/72 (41%), Gaps = 3/72 (4%)

Query: 507 NGTPGATGSGADGTPGGWLLGDGGAGGSGLAGRD---GGAGGAAGLWGTGGTGGAGGRGG 563
N +T +G P G +G G + GSG + + GG G+ WG G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 564 TGGDGGAGGAGG 575
G G G
Sbjct: 70 NSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.011
Identities = 33/102 (32%), Positives = 37/102 (36%), Gaps = 2/102 (1%)

Query: 298 GGAGGDGGYGDTGGAGGAGGAGGWLFGNGGAGGTGGTSIGSTGGAGGTGGNAGWLGNGGT 357
GG G G +G G L GGA G S + GG+G W G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 358 GGTGGFSRYGLGGDGGTGGNAGWLGNGGTGGTGGFSGSGLGG 399
G GG GG GTGGN + G S G GG
Sbjct: 63 GNGGGNG--NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.014
Identities = 29/93 (31%), Positives = 38/93 (40%), Gaps = 3/93 (3%)

Query: 764 AGWLGDGGAGGTGGSSHYGDGGDGGAGGAAGQLSGGGGAGGQGGQGNIAGAVTRTGGDGG 823
+G G G G +S +GG G G G G G + G +G+ GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 824 AGGDAVLIGNGGNGGNGGTDGTGAPTGSPGAGG 856
G GNG +GG GT G + +P A G
Sbjct: 62 HGNGG---GNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.6 bits (71), Expect = 0.016
Identities = 33/113 (29%), Positives = 43/113 (38%), Gaps = 5/113 (4%)

Query: 231 GGGGVGGAGGMAYSAGT-TGGAGGAGGAGGVLSGLVGAGGGHGGTGGAGSGTGGAGGAGG 289
GG G G G ++G GG G G GG G + + GG+GSG GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 290 PAGLLGGPGGAGGDGGYGDTGGAGGAGGAGGWLFGNGGAGGTGGTSIGSTGGA 342
G G GG G G + F G GG ++ + GA
Sbjct: 63 G----NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 30.8 bits (69), Expect = 0.027
Identities = 24/77 (31%), Positives = 31/77 (40%), Gaps = 1/77 (1%)

Query: 782 GDGGDGGAGGAAGQLSGGGGAGGQGGQGNIAGAVTRTGGDGGAGGDAVLIGNGGNGGNGG 841
G G + GA +G ++GG G GG + + G GG I GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSGHGN 64

Query: 842 TDGTGAPTGSPGAGGTG 858
G G G G GG
Sbjct: 65 GGGNGNSGGGSGTGGNL 81



Score = 30.5 bits (68), Expect = 0.037
Identities = 30/87 (34%), Positives = 35/87 (40%), Gaps = 6/87 (6%)

Query: 678 GGPGGAGGTGGPGDFGTTHGGAGGAGGNAGLLFGSGGTGGTGGYGASAGDGGHGGSAGLL 737
GG G TG G +GG G G G GSG + +G +G G H G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG----- 57

Query: 738 FSGAGAGGAGGVGLSGPGGAGGRGGDA 764
G+G G GG G SG G G A
Sbjct: 58 -GGSGHGNGGGNGNSGGGSGTGGNLSA 83


40MMAR_3374MMAR_3381Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3374221-3.112645hypothetical protein
MMAR_3375322-3.646631hypothetical protein
MMAR_3376125-4.7223496-pyruvoyl tetrahydrobiopterin synthase
MMAR_3377222-4.534137hypothetical protein
MMAR_3378120-4.071433PPE family protein
MMAR_3379-113-2.795236PPE family protein
MMAR_3380011-2.818393transposase ISMyma01_aa1-like protein
MMAR_3381-112-3.596170hypothetical protein
41MMAR_3442MMAR_3447Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3442013-3.245872hypothetical protein
MMAR_3443-112-3.367387PPE family protein
MMAR_3444-112-4.067843PPE family protein
MMAR_3445-115-4.703448CDP-diacylglycerol pyrophosphatase
MMAR_3446-212-4.056064hypothetical protein
MMAR_3447-214-3.283081hypothetical protein
42MMAR_3461MMAR_3477Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3461121-3.108748hypothetical protein
MMAR_3464220-2.588246transposase
MMAR_3465219-2.623831PPE family protein
MMAR_3466218-2.939513PPE family protein
MMAR_3467119-2.556457PE family protein
MMAR_3468018-1.238078hypothetical protein
MMAR_3470-113-0.624628short chain membrane-associated dehydrogenase
MMAR_3471-111-0.883868hypothetical protein
MMAR_3472-112-0.822596haloalkane dehalogenase
MMAR_3473113-0.645317ketoacyl-acyl carrier protein synthase III
MMAR_3474210-0.814127ketoacyl-acyl carrier protein synthase III
MMAR_3475111-0.741036hypothetical protein
MMAR_3476110-0.403966hypothetical protein
MMAR_34772120.052303hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3470DHBDHDRGNASE1001e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (251), Expect = 1e-27
Identities = 70/253 (27%), Positives = 108/253 (42%), Gaps = 21/253 (8%)

Query: 12 VTGASSGIGLATVGRLLEQGCAVVGAD--------VVTPPHDLGPRFTFVTADVTDEDAV 63
+TGA+ GIG A L QG + D VV+ ADV D A+
Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72

Query: 64 AGV---FDAVPDRLDGVVHSAGVAGGGPVHLLARAEWDRVIGVNLTGTFLVAKAALARMI 120
+ + +D +V+ AGV G +H L+ EW+ VN TG F +++ M+
Sbjct: 73 DEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMM 132

Query: 121 EQPRVDGERGSIVTLASIEGLEGTAGGSSYNAAKGGVVLLTKNIALDYGPSGIRANAICP 180
++ GSIVT+ S ++Y ++K V+ TK + L+ IR N + P
Sbjct: 133 DR-----RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 181 GFIATPLAEGVFGMPGMEGPRASITNEH-----ALQRLGKPEEIAAMAAFLLSPDASFVT 235
G T + ++ + E L++L KP +IA FL+S A +T
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 236 GQAIAVDGGYTAG 248
+ VDGG T G
Sbjct: 248 MHNLCVDGGATLG 260


43MMAR_3557MMAR_3606Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3557214-0.619062Lrp/AsnC family transcriptional regulator
MMAR_35582130.030269hypothetical protein
MMAR_3559114-0.160115hypothetical protein
MMAR_35601151.145929transcriptional regulatory protein
MMAR_3561-116-0.483110short chain dehydrogenase
MMAR_3562-215-0.183369hypothetical protein
MMAR_3563-216-0.338624rifampin ADP-ribosyl transferase
MMAR_35640141.535140quinone reductase, Qor
MMAR_35651131.821121amidase
MMAR_35662150.605621membrane-associated phospholipase C2 PlcB
MMAR_35682200.927274hypothetical protein
MMAR_35691190.473035hypothetical protein
MMAR_35701180.973146PE-PGRS family protein
MMAR_3571218-1.473061hypothetical protein
MMAR_3572220-1.959962hypothetical protein
MMAR_3573122-1.143151hypothetical protein
MMAR_3574121-0.881665hypothetical protein
MMAR_3575023-1.124796hypothetical protein
MMAR_35763270.320056hypothetical protein
MMAR_3577432-0.246262hypothetical protein
MMAR_35785320.308573PPE family protein
MMAR_35796300.244095EsaT-6 like protein EsxK
MMAR_35807300.616311EsaT-6 like protein EsxN_6
MMAR_3581727-0.454238PE-PGRS family protein
MMAR_3582430-6.042441hypothetical protein
MMAR_3583332-7.092050hypothetical protein
MMAR_3584235-8.043479phage-related integrase
MMAR_3585334-8.905509phage-related integrase
MMAR_3586437-9.275904hypothetical protein
MMAR_3587532-6.671511hypothetical protein
MMAR_3588626-4.129662hypothetical protein
MMAR_3591424-2.250723hypothetical protein
MMAR_3592424-5.568362hypothetical protein
MMAR_3593425-6.941621hypothetical protein
MMAR_3594927-5.534394hypothetical protein
MMAR_35951027-5.627215hypothetical protein
MMAR_35961027-5.215641hypothetical protein
MMAR_35971126-5.198850ATP-dependent OLD family endonuclease
MMAR_35981126-3.501256helicase
MMAR_35991227-2.669041chromosome partition ATPase protein
MMAR_3600418-1.006617DNA repair exonuclease
MMAR_3601317-0.777101hypothetical protein
MMAR_3602119-0.800324hypothetical protein
MMAR_3603121-1.682015killer suppression protein
MMAR_3604020-1.678899plasmid maintenance system antidote protein
MMAR_3605-120-0.594578ATP-dependent exoDNAse
MMAR_3606018-3.133948hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3560HTHTETR385e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 38.5 bits (89), Expect = 5e-06
Identities = 16/90 (17%), Positives = 35/90 (38%)

Query: 1 MTARERLIESAIELLRRNGVAGTGLADLLEHSGTARRSVYVNFPGGKSELMTEATRTAGR 60
R+ +++ A+ L + GV+ T L ++ + +G R ++Y +F +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 61 LMDSTLASIAAGGDQPLVAFADSWKQTLRA 90
+ + L A PL + L +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLES 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3561DHBDHDRGNASE1099e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 109 bits (274), Expect = 9e-31
Identities = 60/193 (31%), Positives = 91/193 (47%), Gaps = 8/193 (4%)

Query: 7 VALITGVSSGIGAAIAVRLASAGFRVVGTSRAPQRLAPIPGV--------ETLALDVTDD 58
+A ITG + GIG A+A LAS G + P++L + E DV D
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 TAVRSVVSEVIDRTGRIDVLVNNAGLGIAGAAEESSIDQARSLFDTNFFGLIRLTNEVLP 118
A+ + + + G ID+LVN AG+ G S ++ + F N G+ + V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 119 HMRRRGSGRIINISSVLGFLPAPYAALYAASKHAVEGYTESLDHELREYGVRALLVEPSY 178
+M R SG I+ + S +P A YA+SK A +T+ L EL EY +R +V P
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 179 TRTDFESNMWEAD 191
T TD + ++W +
Sbjct: 190 TETDMQWSLWADE 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3568PERTACTIN387e-05 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 37.8 bits (87), Expect = 7e-05
Identities = 22/44 (50%), Positives = 23/44 (52%)

Query: 293 PPRAPAPTPQPQLRPPRPQAPTPPPPQDVPPPQQLEPPVVAPQP 336
P PAP P PQ P PQ P PP P P P Q +P APQP
Sbjct: 569 PAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQP 612



Score = 35.8 bits (82), Expect = 2e-04
Identities = 18/43 (41%), Positives = 20/43 (46%)

Query: 294 PRAPAPTPQPQLRPPRPQAPTPPPPQDVPPPQQLEPPVVAPQP 336
P AP P PQP +P P PPQ PPQ + AP P
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAP 610



Score = 33.9 bits (77), Expect = 9e-04
Identities = 24/58 (41%), Positives = 27/58 (46%), Gaps = 4/58 (6%)

Query: 280 LPAQPPPPAHILIPPRAPAPTPQPQLRPPRPQAPTPPPPQDVPPPQQLEPPVVAPQPG 337
L PPA P AP P PQP +PP+P P PP PP +Q E P P G
Sbjct: 562 LVGAKAPPA----PKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615



Score = 33.2 bits (75), Expect = 0.002
Identities = 22/65 (33%), Positives = 26/65 (40%), Gaps = 1/65 (1%)

Query: 258 ADPAPLVKAVPAGETVAPMP-PQLPAQPPPPAHILIPPRAPAPTPQPQLRPPRPQAPTPP 316
A+ V A AP P PQ QP P P P PQP R P AP PP
Sbjct: 554 ANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613

Query: 317 PPQDV 321
+++
Sbjct: 614 AGREL 618



Score = 29.7 bits (66), Expect = 0.019
Identities = 17/53 (32%), Positives = 20/53 (37%), Gaps = 1/53 (1%)

Query: 274 APMPPQLPAQPPPPAHILIPPRAPAPTPQPQLRPPRPQAPTPPPPQDVPPPQQ 326
A PP P P P P PQP +PP+P P P PP +
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPP-QPPQPPQRQPEAPAPQPPAGR 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3570cloacin497e-08 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 48.9 bits (116), Expect = 7e-08
Identities = 41/119 (34%), Positives = 50/119 (42%), Gaps = 5/119 (4%)

Query: 417 SGTGGSGGTGGTGGSSFIGSGGTGGAGGAGGTATGSGASGAGGAGGSGGYNGFIGAGGTG 476
SG G G G +S +GG G G GG + GSG S G GG I GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGS 60

Query: 477 GAGGDGGGGNQGAAAGDGGSGGTAAGL----FGAGGTGGTGGTGGTTTANGGTGGAAGV 531
G G GG GN G +G GG+ A F A T G GG + +A + A +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 43.9 bits (103), Expect = 2e-06
Identities = 30/110 (27%), Positives = 40/110 (36%)

Query: 348 GNGGNGGVGGNASLVGNGGAGGAGGAGGTGSSLSYGGHGGNGGNGGTDGLLFGKGGAGGD 407
G G N G + + G G G G + S + GG G+ G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 408 GGAGGGTTSSGTGGSGGTGGTGGSSFIGSGGTGGAGGAGGTATGSGASGA 457
GG G SGTGG+ + + T GAGG + + S A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 43.5 bits (102), Expect = 3e-06
Identities = 32/102 (31%), Positives = 39/102 (38%)

Query: 363 GNGGAGGAGGAGGTGSSLSYGGHGGNGGNGGTDGLLFGKGGAGGDGGAGGGTTSSGTGGS 422
G G G GA T +++ G G G G +DG + GG+G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 423 GGTGGTGGSSFIGSGGTGGAGGAGGTATGSGASGAGGAGGSG 464
G GG G S G + A A G A GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 41.2 bits (96), Expect = 2e-05
Identities = 30/93 (32%), Positives = 35/93 (37%), Gaps = 9/93 (9%)

Query: 162 GNGGAGGNGGDYLVGG---GGAGGAGGNGGVLYGNGGTGGAGGSGTTSYGAAGVGGNALL 218
G G G N G + G GG G G G G + G+G S + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG------GASDGSGWSSENNPWGGGSGSGIHW 56

Query: 219 FGNGGTGGSGLGGGGNGGNAVFGNGGAGGAGVA 251
G G G G G GG+ GN A A VA
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 40.1 bits (93), Expect = 3e-05
Identities = 35/116 (30%), Positives = 50/116 (43%), Gaps = 14/116 (12%)

Query: 452 SGASGAGGAGGSGGYNGFIGAGGTGGAGGDGGGGNQGAAAGD---GGSGGTAAGLFGAGG 508
SG G G G+ +G I G TG G G G ++ + GG G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 509 TGGTGGTGGTTTANGGTGGAAGVGGSGGDASSLIALG----GTGGVGGVGGAATGG 560
G GG NG +GG +G GG+ ++ +A G T G GG+ + + G
Sbjct: 62 HGNGGG-------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 38.9 bits (90), Expect = 9e-05
Identities = 33/117 (28%), Positives = 47/117 (40%)

Query: 137 GNGGNGYSQTDGGLAGGKGGNAGLIGNGGAGGNGGDYLVGGGGAGGAGGNGGVLYGNGGT 196
G G G++ +G G +G GG +G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 197 GGAGGSGTTSYGAAGVGGNALLFGNGGTGGSGLGGGGNGGNAVFGNGGAGGAGVAGI 253
G GG+G + G+ G + + G L G GG AV + GA A +A I
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 38.9 bits (90), Expect = 9e-05
Identities = 36/120 (30%), Positives = 41/120 (34%), Gaps = 21/120 (17%)

Query: 478 AGGDGGGGNQGAAAGDGGSGGTAAGLFGAGGTGGTGGTGGTTTANGGTGGAAGVGGSGGD 537
+GGDG G N GA + G G GL GG G GG G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGS--------- 52

Query: 538 ASSLIALGGTGGVGGVGGAATGGGATIGADGGTGGNGGNATGLLNFG----GSGGAGGAG 593
G GG G GG G GTGGN + FG + GAGG
Sbjct: 53 --------GIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.8 bits (87), Expect = 2e-04
Identities = 36/118 (30%), Positives = 42/118 (35%), Gaps = 8/118 (6%)

Query: 384 GHGGNGGNGGTDGLLFGKGGAGGDGGAGGGTTSSGTGGSGGTGGTGGSSFIGSGGTGGAG 443
G G N G T G + GG G G GG + SG G G S I GG G G
Sbjct: 6 GRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 444 GAGGTATGSGASGAGGAGGSGGYNGFIGAGGTGGAGGDGGGGNQGAAAGDGGSGGTAA 501
GG +GG G+GG + A G G G A +AA
Sbjct: 64 NGGG------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 37.4 bits (86), Expect = 3e-04
Identities = 36/126 (28%), Positives = 43/126 (34%), Gaps = 7/126 (5%)

Query: 435 GSGGTGGAGGAGGTATGSGASGAGGAGGSGGYNGFIGAGGTGGAGGDGGGGNQGAAAGDG 494
G G GA G G G G G GG + G G G G G G
Sbjct: 6 GRGHNTGAHSTSGNING----GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 495 GSGGTAAGLFGAGGTGGTGGTGGTTTANGGTG-GAAGVGGSGGDASSLIALGGTGGVGGV 553
G G +GG GTGG A G A G+GG A S+ A + + +
Sbjct: 62 HGNGGGNG--NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119

Query: 554 GGAATG 559
A G
Sbjct: 120 MAALKG 125



Score = 37.0 bits (85), Expect = 4e-04
Identities = 37/135 (27%), Positives = 50/135 (37%), Gaps = 8/135 (5%)

Query: 506 AGGTGGTGGTGGTTTANGGTGGAAGVGGSGGDASSLIALGGTGGVGGVGGAATGGGATIG 565
+GG G TG +T+ GG G+G GG + G G +G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG----SGWSSENNPWGGGSGSGIHWG 57

Query: 566 ADGGTGGNGGNATGLLNFGGSGGAGGAGGAGATTGVQGAGGMGGTPGGQPGAAAVSALLP 625
G G GGN N GG G GG A A G + G + + L
Sbjct: 58 GGSGHGNGGGNG----NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113

Query: 626 GGLASSTAAVGGAYQ 640
+A AA+ G ++
Sbjct: 114 AAIADIMAALKGPFK 128



Score = 36.6 bits (84), Expect = 4e-04
Identities = 32/107 (29%), Positives = 39/107 (36%), Gaps = 11/107 (10%)

Query: 119 GIDGTATNPNGGDGGLLYGNGGN---GYSQTDGGLAGGKGGNAGLIGNGGAGGNGGDYLV 175
G T+ N NGG GL G G + G+S + GG G G G G G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG----- 66

Query: 176 GGGGAGGAGGNGGVLYGNGGTGGAGGSGTTSYGAAGVGGNALLFGNG 222
G G +GG G GG G + G GG A+ G
Sbjct: 67 GNGNSGGGSGTGG---NLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 35.1 bits (80), Expect = 0.001
Identities = 28/82 (34%), Positives = 33/82 (40%)

Query: 208 GAAGVGGNALLFGNGGTGGSGLGGGGNGGNAVFGNGGAGGAGVAGIGGNGGNTFIGTGGA 267
G G G N G G G G GG A G+G + G G G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 268 GGAGGGTILGGGSHAGGDGGSV 289
G GG GGGS GG+ +V
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.5 bits (76), Expect = 0.004
Identities = 28/101 (27%), Positives = 39/101 (38%), Gaps = 4/101 (3%)

Query: 544 LGGTGGVGGVGGAATGGGATIGADGGTGGNGGNATGLLNFGGSGGAGGAGGAGATTGVQG 603
+ G G G GA + G G G G GG + G G G G+ +G+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG----SGWSSENNPWGGGSGSGIHW 56

Query: 604 AGGMGGTPGGQPGAAAVSALLPGGLASSTAAVGGAYQDLFT 644
GG G GG G + + G L++ A V + L T
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALST 97



Score = 33.1 bits (75), Expect = 0.005
Identities = 29/92 (31%), Positives = 33/92 (35%), Gaps = 10/92 (10%)

Query: 242 NGGAGGAGVAGIGGNGGNTFIGTGGAGGAGGGTILGGGSHAGGDGGSVGLWGSGGVGGAG 301
+GG G G GN G G G GG S G WG GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVG------GGASDGSGWSSENNPWG----GGSG 51

Query: 302 TTPDADSGYAGGDGGNGGNGGLLYGDGGAGGA 333
+ G G+GG GN G G GG A
Sbjct: 52 SGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 31.6 bits (71), Expect = 0.017
Identities = 26/75 (34%), Positives = 32/75 (42%), Gaps = 2/75 (2%)

Query: 309 GYAGGDGGNGGNGGLLYGDGGAGGAGGLGSDYSSLFYNAGNGGNGGV--GGNASLVGNGG 366
G+ G GN G GG GS +SS G G G+ GG + GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 367 AGGAGGAGGTGSSLS 381
G +GG GTG +LS
Sbjct: 68 NGNSGGGSGTGGNLS 82



Score = 31.6 bits (71), Expect = 0.018
Identities = 26/89 (29%), Positives = 33/89 (37%), Gaps = 6/89 (6%)

Query: 293 GSGGVGGAGTTPDADSGYAGGDGGNGGNGGLLYGDGGAGGAGGLGSDYSSLFYNAGNGGN 352
G G G GG G G GG G G + G S + G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 353 GGVGGNASLVGNGGAGGAGGAGGTGSSLS 381
G G GNG +GG G GG S+++
Sbjct: 63 GNGG------GNGNSGGGSGTGGNLSAVA 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3581cloacin412e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.9 bits (95), Expect = 2e-05
Identities = 37/94 (39%), Positives = 46/94 (48%), Gaps = 13/94 (13%)

Query: 434 MNGGDGGNGGTGLGVGGAGGAGGTGNINGGDGGNGGNGGKALIIGAAGGAGGTGDINGGG 493
M+GGDG TG +GNINGG G G GG A+ G+G + + N G
Sbjct: 1 MSGGDGRGHNTG-------AHSTSGNINGGPTGLGVGGG------ASDGSGWSSENNPWG 47

Query: 494 GGNGGNSLGVGGAGGAGGEGSTEDGGAGGTGGNG 527
GG+G GG+G G G+ GG GTGGN
Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 39.7 bits (92), Expect = 4e-05
Identities = 34/97 (35%), Positives = 40/97 (41%), Gaps = 19/97 (19%)

Query: 509 AGGEGSTEDGGAGGTGGNGVVTGGAGGAGGNGDINGGVGGNGGTGGFAAGTGGAGGNGDN 568
+GG+G + GA T GN INGG G G GG + G+G + N
Sbjct: 2 SGGDGRGHNTGAHSTSGN---------------INGGPTGLGVGGGASDGSGWSSENNPW 46

Query: 569 IGGDGGNGGDALMIGGAGGAGGNGDFNGSGGHGGTGG 605
GG G GG G G G SGG GTGG
Sbjct: 47 GGGSGSGIHW----GGGSGHGNGGGNGNSGGGSGTGG 79



Score = 37.8 bits (87), Expect = 1e-04
Identities = 34/108 (31%), Positives = 38/108 (35%), Gaps = 3/108 (2%)

Query: 334 GNGGHGGNGGDGGTTGIARGDNGGDGGNGGNGGWFGTAGNGGIGGDGIGGGYGGDGGGGL 393
G G G N G T+G G G G GG G + G G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 394 TIGGAGGAGGEGSMGDGGIGGDG---GFGGFAAGSGGAGGDGAMNGGD 438
GG G G GS G + FG A + GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 37.8 bits (87), Expect = 2e-04
Identities = 39/110 (35%), Positives = 47/110 (42%), Gaps = 16/110 (14%)

Query: 480 AGGAGGTGDINGGGGGNGGNSLGVGGAGGAGGEGSTEDGGAGGTGGNGVVTGGAGGAGGN 539
G +G+INGG G LGVGG G S+E+ GG G+G+ GG G
Sbjct: 11 TGAHSTSGNINGGPTG-----LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG---- 61

Query: 540 GDINGGVGGNGGTGGFAAGTGGAGGNGDNIGGDGGNGGDALMIGGAGGAG 589
GNGG G + G G GGN + G AL GAGG
Sbjct: 62 -------HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.4 bits (86), Expect = 2e-04
Identities = 35/102 (34%), Positives = 42/102 (41%), Gaps = 3/102 (2%)

Query: 357 GDGGNGGNGGWFGTAGNGGIGGDGIGGGYGGDGGGGLTIGGAGGAGGEGS---MGDGGIG 413
G G G N G T+GN G G+G G G G G + GG GS G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 414 GDGGFGGFAAGSGGAGGDGAMNGGDGGNGGTGLGVGGAGGAG 455
G+GG G + G G GG+ + G L GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.6 bits (84), Expect = 3e-04
Identities = 32/80 (40%), Positives = 39/80 (48%), Gaps = 4/80 (5%)

Query: 315 SGWFGNGGNGGDGGTSGWF--GNGGHGGNGGDGGTTGIARGDNGGDGGNGGNGGWFGTAG 372
SG G G N G TSG G G G GG +G + +N GG+G W G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 373 NGGIGGDGIGGGYGGDGGGG 392
+G GG+G GG G G GG
Sbjct: 62 HGNGGGNGNSGG--GSGTGG 79



Score = 35.1 bits (80), Expect = 0.001
Identities = 28/84 (33%), Positives = 31/84 (36%), Gaps = 5/84 (5%)

Query: 279 GNGGGAGDTGLFGHGGNGGNARGLFGHGGNGGDGGTSGWFGNGGNGGDGGTSGWFGNGGH 338
G G +TG GN G GG DG N GG G W G GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 339 GGNGGDGGTTGIARGDNGGDGGNG 362
G GG+G + G G GGN
Sbjct: 63 GNGGGNGNS-----GGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.001
Identities = 31/115 (26%), Positives = 43/115 (37%)

Query: 395 IGGAGGAGGEGSMGDGGIGGDGGFGGFAAGSGGAGGDGAMNGGDGGNGGTGLGVGGAGGA 454
+ G G G +GG G G G + G G + + GG+G G+ GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 455 GGTGNINGGDGGNGGNGGKALIIGAAGGAGGTGDINGGGGGNGGNSLGVGGAGGA 509
G G+ G G G L AA A G ++ G G S+ G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 34.3 bits (78), Expect = 0.002
Identities = 31/103 (30%), Positives = 42/103 (40%)

Query: 422 AAGSGGAGGDGAMNGGDGGNGGTGLGVGGAGGAGGTGNINGGDGGNGGNGGKALIIGAAG 481
+ G G GA + NGG G G + G+G + + GG+G G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 482 GAGGTGDINGGGGGNGGNSLGVGGAGGAGGEGSTEDGGAGGTG 524
G G+ N GGG G +L A A G + GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.003
Identities = 31/83 (37%), Positives = 32/83 (38%), Gaps = 2/83 (2%)

Query: 161 GAGAVGGAGGAAGLFGNGGNGGTGGWSGHGGAGGRGGWLIGNGGTGGNGGLSGNGAAGGT 220
G G GA GN NGG G GGA GW N GG G SG GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSG-SGIHWGGGS 60

Query: 221 GGWLAGNGGNGGTGVLGGNGGDA 243
G G GN G G G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.003
Identities = 37/108 (34%), Positives = 40/108 (37%), Gaps = 6/108 (5%)

Query: 377 GGDGIGGGYGGDGGGGLTIGGAGGAGGEGSMGDGGIGGDGGFGGFAAGSGGAGGDGAMNG 436
GGDG G G G GG G G G DG G+ GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGS-----GWSSENNPWGGGSGSGIHWG 57

Query: 437 GDGGNGGTGLGVGGAGGAGGTGNINGGDGGNGGNGGKALIIGAAGGAG 484
G G+G G G G +GG GTG G AL AGG
Sbjct: 58 GGSGHGNGG-GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.008
Identities = 29/81 (35%), Positives = 34/81 (41%), Gaps = 1/81 (1%)

Query: 176 GNGGNGGTGGWSGHGGAGGRGGWLIGNGGTGGNGGLSGNGAAGGTGGWLAGNGGNGGTGV 235
G G N G SG+ GG G +G G + G+G S N GG G GG G G
Sbjct: 6 GRGHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 236 LGGNGGDARGLFGHGGNAGGA 256
GGNG G G + A
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVA 85



Score = 30.8 bits (69), Expect = 0.018
Identities = 28/96 (29%), Positives = 39/96 (40%), Gaps = 6/96 (6%)

Query: 239 NGGDARGLFGHGGNAGGANDYRLFDTGGNGGHSGRLYGNGGNGGGAGDTGLFGHGGNGGN 298
+GGD RG + G + GG G + G+G + + G G+G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNIN------GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 299 ARGLFGHGGNGGDGGTSGWFGNGGNGGDGGTSGWFG 334
G GHG GG+G + G G GGN FG
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.5 bits (68), Expect = 0.024
Identities = 25/76 (32%), Positives = 30/76 (39%), Gaps = 2/76 (2%)

Query: 147 GNGGNGAAGAIEQHGAGAVGGAGGAAGLFGNGGNGGTGGWSGHGGAGGRGGWLIGNGGTG 206
G G G GG G G G + G+G S + GG G I GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 207 GNGGLSGNGAAGGTGG 222
G+G GNG +GG G
Sbjct: 61 GHGNGGGNGNSGGGSG 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3584MICOLLPTASE300.006 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 29.7 bits (66), Expect = 0.006
Identities = 12/40 (30%), Positives = 17/40 (42%), Gaps = 1/40 (2%)

Query: 74 GLKRRPTGATALSADGLHDLLVRLRADYYCQ-RNDLVDPI 112
GL+ TA G+ L+ LRA YY N + +
Sbjct: 144 GLEDSGRTYTADDDKGIPTLVEFLRAGYYLGFYNKQLSYL 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3599RTXTOXIND391e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.7 bits (90), Expect = 1e-04
Identities = 33/181 (18%), Positives = 57/181 (31%), Gaps = 10/181 (5%)

Query: 270 VLEELDKRAVGLDRDERKLQRSDERASAAIERAEAAGESEATLARAQTVLSRLLSDLPEL 329
VL +L A+G + D K Q S +A R + S + L E
Sbjct: 123 VLLKLT--ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLP------DEP 174

Query: 330 AAEKTRLEEERQALELEAKQLVSDEESLQLLRAAALEKERALKTIEKEVLAASNALQKAS 389
+ EE + L +Q + + +K T+ + N +
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 390 ERLEEYRRLALEERTAAKTLAALEVEQQGAEKNLAKARAAEHQARSEAETAAAALAHIQR 449
RL+++ L + A A LE E + E + E+E +A +
Sbjct: 235 SRLDDFSSLL--HKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292

Query: 450 K 450

Sbjct: 293 T 293


44MMAR_3630MMAR_3635Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3630012-3.067478membrane protein
MMAR_3631013-3.229837acyl-CoA dehydrogenase
MMAR_3632016-3.716744tryptophan halogenase
MMAR_5573-116-3.078799hypothetical protein
MMAR_3633-115-3.066359peptide synthetase Nrp (peptide synthase)
MMAR_3634-113-3.007362oxidase
MMAR_3635-112-3.013865integral membrane ion antiporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3630SECYTRNLCASE290.017 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 29.0 bits (65), Expect = 0.017
Identities = 11/40 (27%), Positives = 18/40 (45%), Gaps = 1/40 (2%)

Query: 175 AVMPLIPYLLGFG-SLSAGLIFGGAGLLIAGGVTARFTRK 213
++ L+P + G S FGG +LI GV ++
Sbjct: 383 GLIALVPTMALVGFGASQNFPFGGTSILIIVGVGLETVKQ 422


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3631PF05704310.012 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 31.0 bits (70), Expect = 0.012
Identities = 11/49 (22%), Positives = 16/49 (32%), Gaps = 5/49 (10%)

Query: 237 VDNGRITFDHVR-IPRVNLLNRYGDVATDGT----YSSPIDNPNRRFFT 280
G++ I R+ LL +YG + D T P F
Sbjct: 124 WQEGKMLDAWFSDILRLFLLCKYGGLWIDATVYMFDKVPNYIVESNRFM 172


45MMAR_3653MMAR_3677Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3653-112-3.697927EsaT-6 like protein EsxN_3
MMAR_3654014-3.802029EsaT-6 like protein EsxP_2
MMAR_3655016-3.968592hypothetical protein
MMAR_3656225-2.931689membrane-associated phospholipase C2 PlcB
MMAR_3657533-2.648731membrane-associated phospholipase C2 PlcB
MMAR_3658547-2.071758hypothetical protein
MMAR_3659767-0.667883EsaT-6 like protein EsxN_4
MMAR_3660753-1.986563EsaT-6 like protein EsxP_3
MMAR_3661744-2.075608PPE family protein
MMAR_3662431-3.106276EsaT-6 like protein EsxN_5
MMAR_3663427-2.893213EsaT-6 like protein EsxP_4
MMAR_3664425-2.683990PPE family protein
MMAR_3665416-3.374991PPE family protein
MMAR_3666211-2.595755PPE family protein
MMAR_3667012-1.648092glycyl-tRNA synthetase
MMAR_3668-3120.523491ArsR family transcriptional regulator
MMAR_3669-211-0.237726Zinc uptake regulation protein Zur
MMAR_3670-311-0.143618hypothetical protein
MMAR_3671-310-0.507064undecaprenyl pyrophosphate synthase
MMAR_3672-110-0.410453DNA repair protein RecO
MMAR_3673010-0.597488amidase
MMAR_3674212-1.408100GTP-binding protein Era
MMAR_36752110.106440hypothetical protein
MMAR_3676210-0.022196hypothetical protein
MMAR_36772111.258615putative metalloprotease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3665cloacin330.004 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.8 bits (74), Expect = 0.004
Identities = 27/79 (34%), Positives = 32/79 (40%)

Query: 307 GVGNIGNTNIGSGNIGDTNFGSGNIGSLNFGSGNSGSNNIGFGNFGSSNFGFGNTGNNNI 366
G G+ + SGNI G G G + GSG S NN G GS G +G+ N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 367 GFGLTGDGQFGIGGLNSGS 385
G G G GG S
Sbjct: 66 GGNGNSGGGSGTGGNLSAV 84



Score = 32.0 bits (72), Expect = 0.009
Identities = 23/92 (25%), Positives = 38/92 (41%), Gaps = 11/92 (11%)

Query: 472 NSGDVNT-GFLNAGNINSGFGNAGNVNTGWANAGDINTGGFNGGVLNTGFFSATTHAGPN 530
N+G +T G +N G G G + +GW++ + GG G+ H G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI----------HWGGG 59

Query: 531 SGYFNVGTGNSGFGHNDPSGSGNSGLQNSGFG 562
SG+ N G + G + G+ ++ FG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.1 bits (67), Expect = 0.030
Identities = 28/105 (26%), Positives = 36/105 (34%), Gaps = 6/105 (5%)

Query: 297 GSGNIGSINFGVGNIGNTNIGSGNIGDTNFGSGNIGSLNFGSGNSGSNNIGFGNFGSSNF 356
G G+ + GNI G G G + GSG N G SGS G G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 357 GFGNTGNNNIGFGLTGDG-----QFGIGGLNS-GSGNIGLFNSGD 395
G G G FG L++ G+G + + S
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3666cloacin330.003 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.1 bits (75), Expect = 0.003
Identities = 26/77 (33%), Positives = 33/77 (42%)

Query: 281 TGNTGSGNAGDYNFGSGNFGSGNLGSGNIGSLNLGSGNFGTLNLFGGNNGDLGLGSGNTG 340
+G G G+ + SGN G G G G + GSG N +GG +G G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 341 DVNVGSGNNGGGNIGFG 357
N G N GG G G
Sbjct: 62 HGNGGGNGNSGGGSGTG 78



Score = 31.6 bits (71), Expect = 0.011
Identities = 29/100 (29%), Positives = 38/100 (38%)

Query: 235 GIGNVGTFNLGSGNLGNTNLGSGNVGNQNVGSGNYGSGNWGGGNTGTGNTGSGNAGDYNF 294
G G+ + SGN+ G G G + GSG N GG +G+G G +G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 295 GSGNFGSGNLGSGNIGSLNLGSGNFGTLNLFGGNNGDLGL 334
G G G+G S FG L G L +
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAV 105



Score = 31.6 bits (71), Expect = 0.012
Identities = 30/86 (34%), Positives = 35/86 (40%), Gaps = 7/86 (8%)

Query: 256 SGNVGNQNVGSGNYGSGNWGGGNTGTGNTGSGNAGDYNFGSGNFGSGNLGSGNIGSLNLG 315
SG G + + SGN GG TG G G + G N G GSG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 316 SGNFGTLNLFGGNNGDLGLGSGNTGD 341
GN GG NG+ G GSG G+
Sbjct: 62 HGN-------GGGNGNSGGGSGTGGN 80



Score = 31.6 bits (71), Expect = 0.012
Identities = 26/69 (37%), Positives = 31/69 (44%), Gaps = 1/69 (1%)

Query: 327 GNNGDLGLGSGNTGDVNVGSGNNGGGNIGFGNLGDNNIGFGNKGSGNIGFGLTGDGQFGI 386
G+N SGN G G GG + G G +NN G GSG G +G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 387 GGFNSGTGN 395
G NSG G+
Sbjct: 68 NG-NSGGGS 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3669ARGREPRESSOR260.037 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 26.0 bits (57), Expect = 0.037
Identities = 15/67 (22%), Positives = 30/67 (44%), Gaps = 5/67 (7%)

Query: 6 VRSTRQRAAISALLETVDDFRSAQELHDELRRRGENIGLTTVYRTLQSMASAGTVDTLRT 65
+ ++ I ++ T ++ + EL D L++ G N+ TV R ++ + V T
Sbjct: 1 MNKGQRHIKIREII-TANEIETQDELVDILKKDGYNVTQATVSRDIKEL---HLVKV-PT 55

Query: 66 DTGESVY 72
+ G Y
Sbjct: 56 NNGSYKY 62


46MMAR_3758MMAR_3800Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_37586175.201651PE-PGRS family protein
MMAR_37593163.222174NAD synthetase
MMAR_37604173.862130Sir2-like regulatory protein
MMAR_37615184.292575cytochrome P450 268A2 Cyp268A2
MMAR_37626195.021026AcrR family transcriptional regulator
MMAR_37634174.246973PE-PGRS family protein
MMAR_37644120.439337gamma-glutamyl kinase
MMAR_37655130.404270GTPase ObgE
MMAR_3766412-0.06951050S ribosomal protein L27
MMAR_3767310-0.54174450S ribosomal protein L21
MMAR_3768210-0.186564ribonuclease E Rne
MMAR_3769-110-0.659144nucleoside diphosphate kinase
MMAR_3770-19-0.179819hypothetical protein
MMAR_3771-280.495587folylpolyglutamate synthase protein FolC
MMAR_3772-280.056775valyl-tRNA synthetase
MMAR_3773128-4.421241hypothetical protein
MMAR_3774133-5.583651hypothetical protein
MMAR_3775032-5.475829hypothetical protein
MMAR_3776032-5.863312resuscitation-promoting factor RpfE
MMAR_3778134-6.537611flavin-dependent oxidoreductase
MMAR_3779032-6.465480non-ribosomal peptide synthetase
MMAR_3780021-5.278962monooxygenase
MMAR_3782016-3.331799transposase
MMAR_3784018-3.152985transposase, ISMyma01_aa2-like protein
MMAR_3785-117-3.441611transposase, ISMyma01_aa1-like protein
MMAR_3786021-4.217304transposase
MMAR_3787022-4.027541hypothetical protein
MMAR_3790-130-4.437547isochorismatase family protein
MMAR_3792029-3.336884transposase
MMAR_3793230-4.209630transposase
MMAR_3794231-4.542491transcriptional regulatory protein
MMAR_3795231-4.726829adenylate cyclase
MMAR_3796231-4.660141type I modular polyketide synthase
MMAR_3797229-4.598737type I modular polyketide synthase
MMAR_3798228-4.874447type I modular polyketide synthase
MMAR_3799122-4.601153type I modular polyketide synthase
MMAR_3800010-3.566917beta-ketoacyl synthase-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3758cloacin391e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 1e-04
Identities = 35/105 (33%), Positives = 38/105 (36%), Gaps = 3/105 (2%)

Query: 484 GGGGRGGTGGAGGIGGTTTGGGDAGAGGGGGAGGTGGSGGEGGAGGFGGDGGAGGTGGKA 543
GG GRG GA G GG GGG + G+G S GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 544 GAGGDGGNATDGGNAGAGGDGGTGGAGGTGGGIVTGINGGAGGAG 588
G GG GN G G + A G GAGG
Sbjct: 63 GNGGGNGN---SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 38.5 bits (89), Expect = 1e-04
Identities = 33/101 (32%), Positives = 40/101 (39%)

Query: 440 TGGDGGKGGGGGNATDGGHGGNGGTGGTAGDGGNGQSGDLNANGGGGGRGGTGGAGGIGG 499
+GGDG G ++T G G G G +G N GGG G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 500 TTTGGGDAGAGGGGGAGGTGGSGGEGGAGGFGGDGGAGGTG 540
GGG+ +GGG G GG + A GF G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.8 bits (87), Expect = 2e-04
Identities = 39/116 (33%), Positives = 43/116 (37%), Gaps = 4/116 (3%)

Query: 558 AGAGGDGGTGGAGGTGGGIVTGINGGAGGAGG-DGGDSGNGGNATGGGNGGNGGTAGTGG 616
+G G G GA T G I G G G G DG + N GGG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 617 AGGLGGGGFRVGHGGGGGNGGNGGVGGTATAGGDAGSGGTGGVGGDGSVGGDASDA 672
G GG G GGG G GGN A G G G S+ A A
Sbjct: 62 HGNGGGNG---NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 37.0 bits (85), Expect = 3e-04
Identities = 29/90 (32%), Positives = 34/90 (37%)

Query: 531 GGDGGAGGTGGKAGAGGDGGNATDGGNAGAGGDGGTGGAGGTGGGIVTGINGGAGGAGGD 590
GGDG TG + +G G T G G DG + G +G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 591 GGDSGNGGNATGGGNGGNGGTAGTGGAGGL 620
G GNG + G G GGN A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 36.6 bits (84), Expect = 4e-04
Identities = 28/80 (35%), Positives = 36/80 (45%)

Query: 539 TGGKAGAGGDGGNATDGGNAGAGGDGGTGGAGGTGGGIVTGINGGAGGAGGDGGDSGNGG 598
+GG G ++T G G G GG G G + N GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 599 NATGGGNGGNGGTAGTGGAG 618
+ GGGNG +GG +GTGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 36.6 bits (84), Expect = 4e-04
Identities = 28/85 (32%), Positives = 34/85 (40%)

Query: 461 NGGTGGTAGDGGNGQSGDLNANGGGGGRGGTGGAGGIGGTTTGGGDAGAGGGGGAGGTGG 520
+GG G G + SG++N G G GG G + G+G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 521 SGGEGGAGGFGGDGGAGGTGGKAGA 545
G GG G GG G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 34.7 bits (79), Expect = 0.002
Identities = 25/72 (34%), Positives = 31/72 (43%)

Query: 723 GHGGGAGGAGGDAGDGGKGGDALDGGTAGAGGAGGKAGNGGGAGSGGKGTFDGGAGGAGG 782
GH GA G+ G G G + G+G + GGG+GSG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 783 DGGKAGNGGAGG 794
+G G G GG
Sbjct: 68 NGNSGGGSGTGG 79



Score = 34.3 bits (78), Expect = 0.002
Identities = 26/80 (32%), Positives = 33/80 (41%)

Query: 371 AGNTGTGGTGGTGGTGGNGDLHTNGGNGGTGGSGGAGIAGVGGGNGGTGGDGGKGGTGAN 430
+G G G G T GN + G G G S G+G + GG G G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 431 GAPLGASGGTGGDGGKGGGG 450
G +G +GG G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.003
Identities = 27/79 (34%), Positives = 30/79 (37%), Gaps = 1/79 (1%)

Query: 631 GGGGNGGNGGVGGTA-TAGGDAGSGGTGGVGGDGSVGGDASDAVAGGDGGTGGRGGSGGA 689
GG G G N G T+ G G GG DGS ++ GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 690 GGIATGGGSAGHGGVGGGG 708
G G S G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.003
Identities = 31/85 (36%), Positives = 38/85 (44%), Gaps = 1/85 (1%)

Query: 353 AGRGGAAGAGGPLVTPGHAGNTGTGGTGGTGGTGGNGDLHTNGGNGGTGGSGGAGIAGVG 412
+G G G T G+ TG G G + G+G N GG GSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 413 GGNGGTGGDGGKG-GTGANGAPLGA 436
GNGG G+ G G GTG N + + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.004
Identities = 24/80 (30%), Positives = 30/80 (37%)

Query: 526 GAGGFGGDGGAGGTGGKAGAGGDGGNATDGGNAGAGGDGGTGGAGGTGGGIVTGINGGAG 585
G G G + GA T G G G G + G+G GG G + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 586 GAGGDGGDSGNGGNATGGGN 605
G GG G+SG G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 33.1 bits (75), Expect = 0.005
Identities = 29/110 (26%), Positives = 37/110 (33%)

Query: 415 NGGTGGDGGKGGTGANGAPLGASGGTGGDGGKGGGGGNATDGGHGGNGGTGGTAGDGGNG 474
+GG G G +G G G G GG G G +++ G G G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 475 QSGDLNANGGGGGRGGTGGAGGIGGTTTGGGDAGAGGGGGAGGTGGSGGE 524
GGG G G + G A + G G S G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.1 bits (75), Expect = 0.005
Identities = 32/105 (30%), Positives = 42/105 (40%), Gaps = 2/105 (1%)

Query: 265 AGSGGSGGAGGDGAPGDTAGAGGTGGTGSAGGAGGTGGAGGANQFFGHAGDGGHGGTGGT 324
+G G G G + G TG G + G+G + N + G +G G H G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 325 GGTGGAGGSGVGSGLAGGAGGDGGAGGAAGRGGAA--GAGGPLVT 367
G GG G+ G GG A A G + GAGG V+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 31.6 bits (71), Expect = 0.017
Identities = 28/82 (34%), Positives = 32/82 (39%), Gaps = 6/82 (7%)

Query: 395 GGNGGTGGSGGAGIAGVGGGNGGTGGDGGKGGTGANGAPLGASGGTGGDGGKGGGGGNAT 454
G N G + G G G G G G G + N G SG GG G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN--- 64

Query: 455 DGGHGGNGGTGGTAGDGGNGQS 476
GG GN GG +G GGN +
Sbjct: 65 -GGGNGNS--GGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.025
Identities = 27/89 (30%), Positives = 34/89 (38%), Gaps = 1/89 (1%)

Query: 321 TGGTGGTGGAGGSGVGSGLAGGAGGDGGAGGAAGRGGAAGAGGPLVTPGHAGNTGTGGTG 380
+GG G G + GG G G GGA+ G + P G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPW-GGGSGSGIHWGGGS 60

Query: 381 GTGGTGGNGDLHTNGGNGGTGGSGGAGIA 409
G G GGNG+ G GG + A +A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 30.8 bits (69), Expect = 0.026
Identities = 24/79 (30%), Positives = 29/79 (36%)

Query: 146 GSGAPGQRGGAGGAGGLLLGNGGAGGAGGVGALGQASGTGGNGGAGGLLFGKGGAGGAGG 205
G G GA G + G G GG + G + N GG G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 206 VGGLGQAGGTGGNGGAGGL 224
G G GG+G G L
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.028
Identities = 30/95 (31%), Positives = 34/95 (35%), Gaps = 2/95 (2%)

Query: 213 GGTGGNGGAGGLLFGNGGAGGVGGAGGVGGVDSAGGTGGTGGTGGANGLFGAAGSGGSGG 272
G G G + G GVGG G S+ GG+G G +G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 273 AGGDGAPGDTAGAGGTGGTGSAGG--AGGTGGAGG 305
G G T G A G A T GAGG
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.5 bits (68), Expect = 0.031
Identities = 33/103 (32%), Positives = 37/103 (35%), Gaps = 1/103 (0%)

Query: 171 GAGGVGALGQASGTGGNGGAGGLLFGKGGAGGAGGVGGLGQAGGTGGNGGAGGLLFGNGG 230
G G G A T GN G G GG G + G G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 231 AGGVGGAGGVGGVDSAGGTGGTGGTGGANGLFGAAGSGGSGGA 273
G GG G GG GG A G + G G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.032
Identities = 28/82 (34%), Positives = 33/82 (40%), Gaps = 2/82 (2%)

Query: 651 AGSGGTGGVGGDGSVGGDASDAVAGGDGGTGGRGGSGGAGGIATGGGSAGHGGVGGGGGN 710
+G G G G S G+ + G G G GSG + GG G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGG--GSGSGIHWGGG 59

Query: 711 GGHGTDGTAGVAGHGGGAGGAG 732
GHG G G +G G G GG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3762HTHTETR552e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.6 bits (131), Expect = 2e-11
Identities = 26/167 (15%), Positives = 57/167 (34%), Gaps = 11/167 (6%)

Query: 6 RNAQANRRQRREQMECRLLEATERLMNNGASFTELSVDRLATEAGISRASFYIYFDDKGH 65
R + ++ R+ +L+ RL + + S+ +A AG++R + Y +F DK
Sbjct: 3 RKTKQEAQETRQ----HILDVALRLFSQQ-GVSSTSLGEIAKAAGVTRGAIYWHFKDKSD 57

Query: 66 LLRRLAGQVFDDLATGAQHWWDVAWRHDPDDVRAAMRAII------ARYRRHQPILIALN 119
L + ++ + +R + ++ R R I+
Sbjct: 58 LFSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKC 117

Query: 120 EMAGYDPQTAQTYRDILTAISARLARVIEDGQADGSIRPELSATTTA 166
E G Q R++ R+ + ++ + +L A
Sbjct: 118 EFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3763cloacin375e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 5e-04
Identities = 35/118 (29%), Positives = 46/118 (38%), Gaps = 4/118 (3%)

Query: 649 IGGAGGDGGAGGKAQAAGFADGTEGVGGAGGEGGAGGVAGD----GGKGADAAAFSGAAG 704
+ G G G G +G +G G GG G G G+ + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 705 GNGGHGGDNGAGGAGGTGGAGSTVGAHGADGFSPITGGNGGDGGNGASGPAASAGVAG 762
G+G GG+ +GG GTGG S V A A GF ++ G S A SA +A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 37.0 bits (85), Expect = 7e-04
Identities = 38/119 (31%), Positives = 47/119 (39%), Gaps = 2/119 (1%)

Query: 423 NGGNGGVGGAGGVGGSGGQGGFLRFLGGQGDGGAGGAGGAGGVAGDGGKGADAAAFSGAA 482
+GG+G G SG G LG G G + GG G+ G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 483 GGNGGHGGDNGAGGAGGTGGAGSTVGAHGADGFSPITGGNGGDGGNGASGPAASAGVAG 541
GNGG G +GG GTGG S V A A GF ++ G S A SA +A
Sbjct: 62 HGNGGGNG--NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 37.0 bits (85), Expect = 8e-04
Identities = 34/81 (41%), Positives = 38/81 (46%), Gaps = 3/81 (3%)

Query: 234 AGGQGLGLNGGAGGAGGN--GGLFGIGGTGGAGGDSAGSGSAGAGGDG-GHGGLWGRGGA 290
+GG G G N GA GN GG G+G GGA S S G G G G WG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 291 GGIGGVNSDSGDGGAGGGGGA 311
G GG N +SG G GG +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 35.8 bits (82), Expect = 0.002
Identities = 30/89 (33%), Positives = 33/89 (37%)

Query: 1618 GNGGSGGNGGIGGNATGFNAPGTGGAGGNGGNGAFGGSGGTGGRGGSSSVGTGGAGGDGG 1677
G G G N G + N TG G G + G S GG S G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1678 DGGMGFGGIGGGGGTGGQGGSGVDAQGIG 1706
G G G GGG GTGG + G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.7 bits (79), Expect = 0.004
Identities = 36/103 (34%), Positives = 41/103 (39%), Gaps = 1/103 (0%)

Query: 798 AGASGENGGDGGAGGMGGAGGMGGVLGGHGGAGGIGGVGATGGSGGLGATGAEGVTGGNL 857
+G G G G G LG GGA G + G G+ GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 858 HGGDGGNGGKGGIGGAGGDGGAGGKAQAAGF-ADGTEGAGGAG 899
HG GGNG GG G GG+ A A GF A T GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.005
Identities = 34/118 (28%), Positives = 40/118 (33%)

Query: 1018 NGGDGHTGLTGGDGGAGGKGGALAGHGGDGGTGGVGGTGGTGGSGGTGTSGIFSSANGGD 1077
+GGDG TG +G G G G GG G G G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1078 GGNGGDGGTGGTGGVGGLGGQAQAAGFADGTQGVGGAGGVGGTGGNAGNGGHGANADV 1135
GNGG G G G G A AA A G + G G + A AD+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 33.9 bits (77), Expect = 0.007
Identities = 35/116 (30%), Positives = 43/116 (37%), Gaps = 5/116 (4%)

Query: 1240 GTGGTGGQGGAGGSGGALAGHGGDGGSGGDGGTGGTGGTGRNGANGITGADI----DGGS 1295
G G G GA + G + G G GG G + N G +G+ I G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1296 GGKGGNGGTGGAGGAGGQGGQAQAAGYSDGTQGVGGAGGDGGTAGIGGDGGDGANA 1351
G GGNG +GG G GG A AA + G + G G I A A
Sbjct: 63 GNGGGNGNSGGGSGTGG-NLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 33.9 bits (77), Expect = 0.007
Identities = 27/81 (33%), Positives = 35/81 (43%)

Query: 993 GNGGTGGTGGTGGIGFNGSTTIGGGNGGDGHTGLTGGDGGAGGKGGALAGHGGDGGTGGV 1052
G G G T G G T +G G G +G + + GG G+ GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1053 GGTGGTGGSGGTGTSGIFSSA 1073
GG G +GG GTG + +A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.010
Identities = 29/92 (31%), Positives = 37/92 (40%), Gaps = 4/92 (4%)

Query: 870 IGGAGGDGGAGGKAQAAGFADGTEGAGGAGGEGGAGGVAGD----GGKGADAAAFSGAAG 925
+ G G G G +G +G G GG G G G+ + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 926 GNGGHGGDNGAGGAGGTGGAGSTVGAHGADGF 957
G+G GG+ +GG GTGG S V A A GF
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 33.1 bits (75), Expect = 0.010
Identities = 35/103 (33%), Positives = 40/103 (38%), Gaps = 1/103 (0%)

Query: 577 AGASGENGGDGGAGGMGGAGGMGGVLGGHGGAGGIGGVGATGGSGGLGATGAEGVTGGNL 636
+G G G G G LG GGA G + G G+ GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 637 HGGDGGNGGKGGIGGAGGDGGAGGKAQAAGF-ADGTEGVGGAG 678
HG GGNG GG G GG+ A A GF A T G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.014
Identities = 28/83 (33%), Positives = 36/83 (43%)

Query: 1584 GGDGGGGGPGGLGGQGGLSGDGSSTGAAGELGTFGNGGSGGNGGIGGNATGFNAPGTGGA 1643
GGDG G G G ++G + G G S N GG+ +G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1644 GGNGGNGAFGGSGGTGGRGGSSS 1666
G GGNG GG GTGG + +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.019
Identities = 31/84 (36%), Positives = 39/84 (46%), Gaps = 1/84 (1%)

Query: 1201 SGGDGGLYGNGGDGGDGG-NGGKGQVGSTGPVAGSAGGTGGTGGTGGQGGAGGSGGALAG 1259
SGGDG + G G NGG +G G + +G + GG G+G G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1260 HGGDGGSGGDGGTGGTGGTGRNGA 1283
HG GG+G GG GTGG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.0 bits (72), Expect = 0.025
Identities = 34/117 (29%), Positives = 47/117 (40%), Gaps = 1/117 (0%)

Query: 1058 TGGSGGTGTSGIFSSANGGDGGNGGDGGTGGTGGVGGLGGQAQAAGFADGTQGVGGAGGV 1117
+GG G +G S++ +GG G G GG G + G G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1118 GGTGGNAGNGGHGANADVGSGKAGGNGGDGGDPGVGGIGGQGGAGSIAGVEGAAGVA 1174
G GG GN G G + G+ A G P + G G A SI+ +A +A
Sbjct: 62 HGNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 32.0 bits (72), Expect = 0.025
Identities = 33/100 (33%), Positives = 40/100 (40%), Gaps = 3/100 (3%)

Query: 1415 GAGGAGGDGGLYGNGGN--GGDGGTGGKGTVGD-SGATLSADGDRGGAGATGGVGGAGGD 1471
G G G + G + GN GG G G G D SG + + GG+G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1472 GGAKGGNGGLGGSGGTGGMGGTGGDGAHGADMASGSGANG 1511
G G GGSG G + A G S GA G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.029
Identities = 31/99 (31%), Positives = 37/99 (37%), Gaps = 3/99 (3%)

Query: 261 GGAGGDSAGSGSAGAGGDGGHGGLWGRGGAG---GIGGVNSDSGDGGAGGGGGAGGRLFG 317
G G + G+ S +GG GL GGA G N+ G G G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 318 SGGAGGAGGTGAVAGSGGDGGAGGAAVGLWGLGGHGGAG 356
+GG G G G+ G A A G L G G
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.032
Identities = 22/83 (26%), Positives = 30/83 (36%)

Query: 1116 GVGGTGGNAGNGGHGANADVGSGKAGGNGGDGGDPGVGGIGGQGGAGSIAGVEGAAGVAP 1175
G G G N G N + G G GG G G GS +G+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1176 TSGGNGGDGGNGASGATGVNGGA 1198
+GG G+ G G+ ++ A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 31.6 bits (71), Expect = 0.033
Identities = 29/80 (36%), Positives = 32/80 (40%), Gaps = 2/80 (2%)

Query: 941 GTGGAGSTVGAHGADGF--SPTTGGNGGDGGSGGSGFQGVKGGAGGVGGDGGLYGNGGTG 998
G G G GAH G TG G G S GSG+ GG G G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 999 GTGGTGGIGFNGSTTIGGGN 1018
G GG G GS T G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 31.6 bits (71), Expect = 0.034
Identities = 23/73 (31%), Positives = 29/73 (39%)

Query: 1193 GVNGGAGGSGGDGGLYGNGGDGGDGGNGGKGQVGSTGPVAGSAGGTGGTGGTGGQGGAGG 1252
G N GA + G+ G G G + G G P G +G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 1253 SGGALAGHGGDGG 1265
+G + G G G
Sbjct: 68 NGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3764CARBMTKINASE392e-05 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 38.7 bits (90), Expect = 2e-05
Identities = 25/104 (24%), Positives = 44/104 (42%), Gaps = 7/104 (6%)

Query: 156 DNDRLSALVAHLVGADALVLLSDIDGLYDADPGKFQNARFIPEVSGPADLDGVVAGQGSH 215
D D +A V AD ++L+D++G G + +++ EV +L + H
Sbjct: 214 DKDLAGEKLAEEVNADIFMILTDVNGAA-LYYGT-EKEQWLREVK-VEELRKYY--EEGH 268

Query: 216 LGTGGMASKMSSALLAADA-GVPVLLAPAADAAAALTDASVGTV 258
G M K+ +A+ + G ++A A AL + GT
Sbjct: 269 FKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVEAL-EGKTGTQ 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3765SECA310.009 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.4 bits (71), Expect = 0.009
Identities = 21/101 (20%), Positives = 40/101 (39%), Gaps = 29/101 (28%)

Query: 387 IGQTNFDNDEAVGYLADRLARLGVEEELL---------RLGAKPGC--AVTIGEMTFDWE 435
+G + + E +++ L + G++ +L + A+ G AVTI
Sbjct: 454 VGTISIEKSE---LVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAAVTIA------- 503

Query: 436 PQTPAGGHVAMSGRGTDVRLERSDRVGAAERKAARRQRRER 476
M+GRGTD+ L S + A + ++ E+
Sbjct: 504 --------TNMAGRGTDIVLGGSWQAEVAALENPTAEQIEK 536


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3776TONBPROTEIN343e-04 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 34.2 bits (78), Expect = 3e-04
Identities = 24/115 (20%), Positives = 32/115 (27%)

Query: 49 PAAPGAFEPPAPQDAPPAPELADADPDLPPPAPDAMIAPVDMPEAPAPEEAPPAPQDAPP 108
PA P + P D P + + P P+ P EAP E P P
Sbjct: 41 PAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 100

Query: 109 APEAMIAPVDMPDGPAPEDAPPAPDAFAPPAPEDVPPAPEPVAFEPPAPDVAPPP 163
P + D E P +P PA A + + P
Sbjct: 101 KPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRA 155



Score = 27.6 bits (61), Expect = 0.042
Identities = 28/111 (25%), Positives = 35/111 (31%), Gaps = 12/111 (10%)

Query: 74 PDLPPPAPDAMIAPVDMPEAPAPEEAPPAPQDAPPAPEAMIAPVDMPDGPAPEDAPPAPD 133
P P M+ P D+ A + P + P PE P PE AP
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPE-----------PIPEPPKEAPV 87

Query: 134 AFAPPAPEDVP-PAPEPVAFEPPAPDVAPPPAPKVYTVNWDAIAQCESGGN 183
P P+ P P P E P DV P + A A+ S
Sbjct: 88 VIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTA 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3790ISCHRISMTASE351e-04 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 35.4 bits (81), Expect = 1e-04
Identities = 21/84 (25%), Positives = 33/84 (39%)

Query: 138 LTANSWGAAILDGLVVDDIDIHVNKHRMSGFWDTELDSILRNLGVRHLFLCGVNVDQCVY 197
L + + I+ L +D D+ + K R S F T L ++R G L + G+
Sbjct: 99 LNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCL 158

Query: 198 ATLIDAACAGYDCLLITDASATTS 221
T +A + DA A S
Sbjct: 159 VTACEAFMEDIKAFFVGDAVADFS 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3796NUCEPIMERASE330.011 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 32.8 bits (75), Expect = 0.011
Identities = 22/149 (14%), Positives = 43/149 (28%), Gaps = 38/149 (25%)

Query: 1381 TVLITGGTGMAGGWLARHVVDHYGVRHVLLASRSGDRAGGSAEIA-----------AELA 1429
L+TG G G +++ +++ G + G + EL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEA------------GHQVVGIDNLNDYYDVSLKQARLELL 49

Query: 1430 ARGVQVEVVACDVADRDAVTALLARLPQQYPLTGVIH---AAGVLDDAVITSLTPDRVDT 1486
A+ + D+ADR+ + L V V + +
Sbjct: 50 AQP-GFQFHKIDLADRE----GMTDLFASGHFERVFISPHRLAV-------RYSLENPHA 97

Query: 1487 VLRAKVDAAWNLHELTRDLGVSAFVLFSS 1515
+ + N+ E R + + SS
Sbjct: 98 YADSNLTGFLNILEGCRHNKIQHLLYASS 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3797DHBDHDRGNASE320.016 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 32.3 bits (73), Expect = 0.016
Identities = 33/164 (20%), Positives = 55/164 (33%), Gaps = 10/164 (6%)

Query: 1432 VLITGGTGVLGMALARHLATHHHCEHLLLVSRRGVAADGAQELRAELAGHGCQVEFAACD 1491
ITG +G A+AR LA ++ + +++ + L E D
Sbjct: 11 AFITGAAQGIGEAVARTLA-----SQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 1492 TADSDQLSTLLQSIPVE-HPLGAVIHAAGVLSDGVIEGLGREQVEQVLRPKLDAALLLHE 1550
DS + + I E P+ +++ AGVL G+I L E+ E
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 1551 LT----QDLDLSAFVLFSSAAGVLGSPGQANYAAANAFLDALAQ 1590
D + V S + A YA++ A +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTK 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3798DHBDHDRGNASE501e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 49.7 bits (118), Expect = 1e-07
Identities = 49/230 (21%), Positives = 85/230 (36%), Gaps = 27/230 (11%)

Query: 3276 ALVTGVTGHLGQHIARWLAQAGASHLVLLSRTAAEHPQVAELEKELNSAGITTTSISVDV 3335
A +TG +G+ +AR LA GA ++ ++ ++ L + + DV
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAH----IAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 3336 TDRDALAAVVAETRIEHGPIHTVVHAAAHIGLVTTAETTIDEFIKSFAAKALGAENLI-- 3393
D A+ + A E GPI +V+ A + + +E+ +F+ + G N
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 3394 --AVLEDQPPQTFIMFSSAAATWGGTRQGAYAAANAYIEALVT----RLRGRG--CRAIA 3445
+ D+ + + S A T AYA++ A L C ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 3446 PA-------WGAWTDDRTTTQEVVGYFSR----IGLNQIS--PDIAFAAL 3482
P W W D+ Q + G I L +++ DIA A L
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236


47MMAR_3870MMAR_3943Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3870314-1.279983hypothetical protein
MMAR_38712170.729486hypothetical protein
MMAR_38721170.011908lipoprotein LppS
MMAR_55570210.504815hypothetical protein
MMAR_38731231.769027*hypothetical protein
MMAR_38741281.088943hypothetical protein
MMAR_38750280.666989PE-PGRS family protein
MMAR_3876227-0.767075hypothetical protein
MMAR_3877016-2.292551hypothetical protein
MMAR_3878-116-2.232737hypothetical protein
MMAR_3879-116-2.857685hypothetical protein
MMAR_3880-115-3.734423hypothetical protein
MMAR_3881-214-3.038042hypothetical protein
MMAR_3882-114-2.851282bacteriophage membrane protein
MMAR_3883-114-2.351048bacteriophage membrane protein
MMAR_3884017-2.609041hypothetical protein
MMAR_3885018-2.753382hypothetical protein
MMAR_3886021-2.823067bacteriophage-like membrane protein
MMAR_3887119-4.323264hypothetical protein
MMAR_3888125-4.484535hypothetical protein
MMAR_3889029-4.779785hypothetical protein
MMAR_3890028-3.483015hypothetical protein
MMAR_3891123-3.098135phage-like protein
MMAR_3892425-1.295734phage-like protein
MMAR_3893324-1.138760hypothetical protein
MMAR_3894222-1.093577hypothetical protein
MMAR_3895124-1.232976phage-like protein
MMAR_3896225-1.870081phage structural protein
MMAR_3897226-1.813468phage-like protein
MMAR_3898124-2.552701hypothetical protein
MMAR_3899124-2.588326hypothetical protein
MMAR_3900023-2.978937hypothetical protein
MMAR_3901-119-4.034381phage terminase-like large subunit protein
MMAR_3902229-5.118990hypothetical protein
MMAR_3903232-4.862834hypothetical protein
MMAR_3904130-5.166138hypothetical protein
MMAR_3905028-5.701691hypothetical protein
MMAR_3906034-5.914965hypothetical protein
MMAR_3908-137-4.099059hypothetical protein
MMAR_3909046-3.564437hypothetical protein
MMAR_3910451-4.327832hypothetical protein
MMAR_3911549-4.028507hypothetical protein
MMAR_3912543-3.282731transcriptional regulatory protein
MMAR_3913239-2.612145hypothetical protein
MMAR_3914235-2.516534hypothetical protein
MMAR_3915031-2.637170hypothetical protein
MMAR_3916032-2.168242hypothetical protein
MMAR_3917133-1.650735hypothetical protein
MMAR_3919332-3.014711hypothetical protein
MMAR_3921227-3.254250phage DNA methylase
MMAR_3923125-4.030589hypothetical protein
MMAR_3924125-3.948237hypothetical protein
MMAR_3925224-4.390576hypothetical protein
MMAR_3926125-4.295153DNA polymerase III (beta chain) DnaN
MMAR_3927124-3.030946RecT-family phage protein
MMAR_3928227-2.854864hypothetical protein
MMAR_3929230-2.567618hypothetical protein
MMAR_3930330-2.322522putative regulatory protein
MMAR_3931224-1.786356hypothetical protein
MMAR_3932318-0.773771hypothetical protein
MMAR_3933319-1.336379hypothetical protein
MMAR_3934322-1.828784hypothetical protein
MMAR_3935425-0.880276hypothetical protein
MMAR_3936426-1.046554hypothetical protein
MMAR_3937528-1.036238hypothetical protein
MMAR_3938321-2.360133hypothetical protein
MMAR_3939322-2.429076hypothetical protein
MMAR_3940323-2.543160hypothetical protein
MMAR_3941225-3.334867hypothetical protein
MMAR_3942122-3.112645hypothetical protein
MMAR_3943122-3.146483phage antirepressor protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3870FLGHOOKFLIK290.012 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 29.4 bits (65), Expect = 0.012
Identities = 28/107 (26%), Positives = 43/107 (40%), Gaps = 18/107 (16%)

Query: 88 AMPLLLAGALALSAFLGWQQWQQ---HQVKLAGQQAQQAA--------IAYAQVLTSIDS 136
PL A LSA LG +WQQ + L +Q QQ+A + Q+ +D
Sbjct: 220 TQPLPTVAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDD 279

Query: 137 NNVD-------QNFRQVLDGATGEFKDMYTQSSVQLRQLLIDNKASA 176
N Q+ R L+ A + +S +QL Q I ++ +
Sbjct: 280 NQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSNISGESFS 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3871IGASERPTASE310.005 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.005
Identities = 19/101 (18%), Positives = 27/101 (26%), Gaps = 7/101 (6%)

Query: 155 TSNPLTAIPPLGQPYIQAALDAAAPPHDPGTYPFVADRWTPDKVYSGGYRAWA----LTP 210
N Q L A D G+ FV DR ++ G Y WA +
Sbjct: 260 FGNSKEEHSDPKGILSQDPLTNYAVLGDSGSPLFVYDREKGKWLFLGSYDFWAGYNKKSW 319

Query: 211 DELVLYLPDYPVG---HDEPIDFTPGAAQWSMDGGAVVAHI 248
E +Y + D +S + I
Sbjct: 320 QEWNIYKSQFTKDVLNKDSAGSLIGSKTDYSWSSNGKTSTI 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3875cloacin345e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 5e-04
Identities = 28/83 (33%), Positives = 34/83 (40%), Gaps = 6/83 (7%)

Query: 163 MIGNGGAGGNGGPAGLILGNLYRGGAGGMGGAGGWLIGDGGAGGTGGFAGIDNGRGGWGG 222
M G G G N G A GN+ GG G+G GG G G + + G WGG
Sbjct: 1 MSGGDGRGHNTG-AHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 223 WGGPATMFGSGGAGGNGGDANAG 245
G G+GG GN G +
Sbjct: 59 GSGH----GNGGGNGNSGGGSGT 77



Score = 31.6 bits (71), Expect = 0.003
Identities = 31/91 (34%), Positives = 33/91 (36%), Gaps = 10/91 (10%)

Query: 156 GAGGMGGMIGNGGAGGN--GGPAGLILGNLYRGGAGGMGGAGGWLIGDGGAGGTGGFAGI 213
G G G G GN GGP GL G GG GW + GG G +GI
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGL-------GVGGGASDGSGWSSENNPWGGGSG-SGI 54

Query: 214 DNGRGGWGGWGGPATMFGSGGAGGNGGDANA 244
G G G GG G G G A A
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 28.1 bits (62), Expect = 0.044
Identities = 25/77 (32%), Positives = 31/77 (40%), Gaps = 7/77 (9%)

Query: 140 GNGGRGGDSTAVGIAGGAGGMGGMIGNGGAGGNGG-------PAGLILGNLYRGGAGGMG 192
G GRG ++ A +G G +G GG +G P G G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 193 GAGGWLIGDGGAGGTGG 209
G GG GG GTGG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3888IGASERPTASE310.006 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.006
Identities = 18/73 (24%), Positives = 27/73 (36%)

Query: 176 DIDFNQTKTTLGSTPTPNPLPTVNNLTVGTIAATTVDLSWTDIPIADSFKVQQSPTGAGT 235
+D L S N + N LTV +++ TD+ KV + + G
Sbjct: 864 QLDLANGHIHLNSADNSNNVTKYNTLTVNSLSGNGSFYYLTDLSNKQGDKVVVTKSATGN 923

Query: 236 WTDVTALNGGEPT 248
+T A GEP
Sbjct: 924 FTLQVADKTGEPN 936


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3892adhesinmafb290.005 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 29.3 bits (65), Expect = 0.005
Identities = 18/64 (28%), Positives = 25/64 (39%), Gaps = 6/64 (9%)

Query: 17 RLRDNVKDVKPDYERLVDTQMRLA------AVEGEAYMKEHAPWLDSTGNRKDRVPGAAR 70
++ N D + +R+ D L A E M EH LD GN + + G A
Sbjct: 174 SIKLNPTDTRSIRQRISDNYSNLGSNFSDRADEANRKMFEHNAKLDRWGNSMEFINGVAA 233

Query: 71 SGLN 74
LN
Sbjct: 234 GALN 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3901CHANNELTSX290.037 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 29.2 bits (65), Expect = 0.037
Identities = 19/65 (29%), Positives = 30/65 (46%), Gaps = 5/65 (7%)

Query: 80 KGFAKTELMALIAYAELHPESPVRFDGFNRDGGL-KQGRPVFDPYIPMLANAKLQVNELA 138
+ FAK + Y + +PV F G + G+ +G P+F P + KL +L+
Sbjct: 62 EAFAKKDWFDFYGYID----APVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLS 117

Query: 139 FGALK 143
FG K
Sbjct: 118 FGPFK 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3927OMADHESIN290.044 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 28.7 bits (63), Expect = 0.044
Identities = 29/91 (31%), Positives = 36/91 (39%), Gaps = 1/91 (1%)

Query: 160 VTHYSEYVQTTKVDGVAQPNSMWSKMPRNQLAKCAEALALQRAYPDELSGIVLEDAAQVI 219
+TH + + T VAQ K N + AE LA AY D S VL A
Sbjct: 196 LTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIANNYT 255

Query: 220 DSDGQIINETQRPPARARGAAALRDRAKAEA 250
DS E R A A+ L + AKA +
Sbjct: 256 DSKSAETLENARKEAFAQSKDVL-NMAKAHS 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3929FERRIBNDNGPP290.031 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 28.8 bits (64), Expect = 0.031
Identities = 18/58 (31%), Positives = 22/58 (37%), Gaps = 19/58 (32%)

Query: 71 EWLAA-------ITPSKVAAILGVSRFESPYRLWHRMKGLVDPEPPKDIFDVGHDFEP 121
EWL I P VA + YRLW + +P P + DVG EP
Sbjct: 42 EWLPVELLLALGIVPYGVADTIN-------YRLW-----VSEPPLPDSVIDVGLRTEP 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3941ICENUCLEATIN315e-04 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 31.3 bits (70), Expect = 5e-04
Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 72 YGSRSTAQKRADLLESYGATAIVERSSRI 100
YGS TAQ+R+DL YG+T+ S +
Sbjct: 804 YGSTQTAQERSDLTTGYGSTSTAGADSSL 832



Score = 29.0 bits (64), Expect = 0.004
Identities = 11/21 (52%), Positives = 15/21 (71%)

Query: 72 YGSRSTAQKRADLLESYGATA 92
YGS TAQ+ +DL YG+T+
Sbjct: 900 YGSTQTAQENSDLTTGYGSTS 920



Score = 28.2 bits (62), Expect = 0.006
Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 72 YGSRSTAQKRADLLESYGATAIVERSSRIS 101
YGS TA ++ L YG+T E SS ++
Sbjct: 980 YGSTQTAGYQSTLTAGYGSTQTAEHSSTLT 1009



Score = 28.2 bits (62), Expect = 0.008
Identities = 11/21 (52%), Positives = 15/21 (71%)

Query: 72 YGSRSTAQKRADLLESYGATA 92
YGS TAQ +DL+ YG+T+
Sbjct: 516 YGSTQTAQNESDLITGYGSTS 536



Score = 28.2 bits (62), Expect = 0.008
Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 72 YGSRSTAQKRADLLESYGATAIVERSSRI 100
YGS TAQ+ +DL YG+T+ S +
Sbjct: 852 YGSTQTAQENSDLTTGYGSTSTAGYDSSL 880



Score = 27.8 bits (61), Expect = 0.009
Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 72 YGSRSTAQKRADLLESYGATA 92
YGS TA++++ L YG+T+
Sbjct: 756 YGSTQTAREQSVLTTGYGSTS 776



Score = 27.8 bits (61), Expect = 0.010
Identities = 12/20 (60%), Positives = 14/20 (70%)

Query: 72 YGSRSTAQKRADLLESYGAT 91
YGS TAQK +DL YG+T
Sbjct: 372 YGSTQTAQKGSDLTAGYGST 391



Score = 27.8 bits (61), Expect = 0.011
Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 72 YGSRSTAQKRADLLESYGATA 92
YGS TAQK +DL YG+T
Sbjct: 276 YGSTQTAQKGSDLTAGYGSTG 296



Score = 27.8 bits (61), Expect = 0.011
Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 72 YGSRSTAQKRADLLESYGATA 92
YGS TA++++ L YG+T+
Sbjct: 612 YGSTQTAREQSVLTTGYGSTS 632



Score = 27.4 bits (60), Expect = 0.011
Identities = 12/20 (60%), Positives = 14/20 (70%)

Query: 72 YGSRSTAQKRADLLESYGAT 91
YGS TAQK +DL YG+T
Sbjct: 420 YGSTQTAQKGSDLTAGYGST 439



Score = 27.4 bits (60), Expect = 0.012
Identities = 12/20 (60%), Positives = 14/20 (70%)

Query: 72 YGSRSTAQKRADLLESYGAT 91
YGS TAQK +DL YG+T
Sbjct: 324 YGSTQTAQKGSDLTAGYGST 343



Score = 27.4 bits (60), Expect = 0.014
Identities = 12/21 (57%), Positives = 15/21 (71%)

Query: 72 YGSRSTAQKRADLLESYGATA 92
YGS TAQK +DL YG+T+
Sbjct: 468 YGSTQTAQKGSDLTAGYGSTS 488



Score = 27.0 bits (59), Expect = 0.017
Identities = 11/21 (52%), Positives = 15/21 (71%)

Query: 72 YGSRSTAQKRADLLESYGATA 92
YGS TAQ+ +DL YG+T+
Sbjct: 660 YGSTQTAQEGSDLTAGYGSTS 680


48MMAR_3967MMAR_3972Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3967-118-4.129971hypothetical protein
MMAR_3968-118-4.481675hypothetical protein
MMAR_3969-116-3.730118cytochrome P450 269A1 Cyp269A1
MMAR_3970-217-3.834944hypothetical protein
MMAR_3971-118-4.186645oxidoreductase
MMAR_3972021-4.369638non-ribosomal peptide synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3970TCRTETB1334e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 133 bits (337), Expect = 4e-36
Identities = 93/421 (22%), Positives = 188/421 (44%), Gaps = 18/421 (4%)

Query: 27 RRNFIFLALVLGILLSSLDQTIVAIALPTIVADLGEAGRQ-SWVVTSYLLASTIATALVG 85
R N I + L + S L++ ++ ++LP I D + +WV T+++L +I TA+ G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 86 KLGDMFGRKRVFQVAALLFVAGSVSCGLTQSM-TMLVASRALQGVGGGAITVTAIALIGE 144
KL D G KR+ ++ GSV + S ++L+ +R +QG G A + ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130

Query: 145 VVPLRDRGRYQGILGAVIGIATIGGPLLGGYFTDCLSWRWAFWINLPVSAVVICVATAAI 204
+P +RG+ G++G+++ + GP +GG + W++ + +P+ ++ +
Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH--WSYLLLIPMITIITVPFLMKL 188

Query: 205 PALAATTRPVIDYAGIMFIGLSLAALTLATSLGGSVYAWGSAPIIGLFTAAGVTLAVFVW 264
+ D GI+ + + + L T + Y+ + L +FV
Sbjct: 189 LKKEVRIKGHFDIKGIILMSVGIVFFMLFT----TSYSISFLIVSVLS------FLIFVK 238

Query: 265 VETIAAQPILPIRLFAAPVFSVCCVLAFVVGFAMLGALIFVPTFMQYVNGVS-ATASGLR 323
P + L F + + ++ + G + VP M+ V+ +S A +
Sbjct: 239 HIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI 298

Query: 324 ILPMVIGMLITSIGSGSMVGRTGRYKIFPVLGTALMTLAFLLMSRMDESTSAAVQSVYLL 383
I P + ++I G +V R G + +G ++++FL S + E+TS + + +
Sbjct: 299 IFPGTMSVIIFGYIGGILVDRRGPLYVL-NIGVTFLSVSFLTASFLLETTSWFMTIIIVF 357

Query: 384 ILGSAIGMSSQVLVIIVQNTSEFEDLGVATSGVSLFRTIGGSFGAAIFGSLF-VNFLNSR 442
+LG + + V+ IV ++ + ++ G S ++ + G AI G L + L+ R
Sbjct: 358 VLGG-LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQR 416

Query: 443 L 443
L
Sbjct: 417 L 417


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3972NUCEPIMERASE623e-12 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 62.5 bits (152), Expect = 3e-12
Identities = 44/209 (21%), Positives = 79/209 (37%), Gaps = 44/209 (21%)

Query: 662 LVTGATGFLGLYLASRLVELLPE---LDVFCLIRAQSEEQARERLRQSCLRYGMSVALLD 718
LVTGA GF+G +++ RL+E + +D + + ++ R L
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNL----NDYYDVSLKQARLELLAQ-------P 52

Query: 719 RVSVVAGDIEDSALALTDDVYSTLAGRVDTVYHCAADISYVKPYSVMRGP------NVTG 772
D+ D D++++ G + V+ ++ YS + P N+TG
Sbjct: 53 GFQFHKIDLAD--REGMTDLFAS--GHFERVFISPHRLAV--RYS-LENPHAYADSNLTG 105

Query: 773 TQNLLKFAVEGHAKSFHYVSTAAVFGATGTFLGINSVDEGFNIDQSLELMSVENGYTQSK 832
N+L+ + Y S+++V+G ++D + L Y +K
Sbjct: 106 FLNILEGCRHNKIQHLLYASSSSVYGLNRKM----PFSTDDSVDHPVSL------YAATK 155

Query: 833 WVAETMVQAASH------RGLR-VSIYRP 854
E M SH GLR ++Y P
Sbjct: 156 KANELMAHTYSHLYGLPATGLRFFTVYGP 184


49MMAR_4083MMAR_4089Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4083216-2.493518UDP-N-acetylglucosamine
MMAR_4084417-2.508001cobalamin adenosyltransferase
MMAR_4085416-2.052899hypothetical protein
MMAR_4086417-2.248210F0F1 ATP synthase subunit epsilon
MMAR_4087318-2.140377F0F1 ATP synthase subunit beta
MMAR_4088415-2.566836F0F1 ATP synthase subunit gamma
MMAR_4089415-2.508321F0F1 ATP synthase subunit alpha
50MMAR_4149MMAR_4163Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_41492182.688961PE-PGRS family protein
MMAR_4150223-0.752143hypothetical protein
MMAR_4151117-0.368597hypothetical protein
MMAR_4152013-0.818385lipoprotein LprA
MMAR_4153-313-0.895802hypothetical protein
MMAR_4154-313-1.596389hypothetical protein
MMAR_4155-314-2.255212transcriptional regulatory protein EmbR
MMAR_4156-417-2.400589Ser/Thr protein kinase
MMAR_4157-116-3.126283LuxR family transcriptional regulator
MMAR_4158-219-3.063744citrate (pro-3S)-lyase subunit beta
MMAR_4159-217-2.602554dehydratase
MMAR_4160-216-2.060920acetyl-CoA hydrolase/transferase
MMAR_4161014-1.246711acyl-CoA dehydrogenase FadE3
MMAR_4162216-1.450285glycerolphosphodiesterase GdpD
MMAR_4163416-1.693555hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4149cloacin381e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 1e-04
Identities = 27/81 (33%), Positives = 32/81 (39%)

Query: 499 GGDGSSSGTGAGGSGGDATDGGTGGDGGSGGGFGTGGAGGTGGWLIGHSGTNGIGGTGGA 558
GGDG TGA + G+ G TG G G G+G + W G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 559 GGAGAVGGDGGVGGDPGVGTT 579
G G G GG G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 37.4 bits (86), Expect = 2e-04
Identities = 29/74 (39%), Positives = 33/74 (44%), Gaps = 2/74 (2%)

Query: 470 GNGGDGGAGGDGGV--GGTGGIGGAGGNGGVGGDGSSSGTGAGGSGGDATDGGTGGDGGS 527
G G + GA G GG G+G GG G S + GGSG GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 528 GGGFGTGGAGGTGG 541
GG +GG GTGG
Sbjct: 66 GGNGNSGGGSGTGG 79



Score = 35.8 bits (82), Expect = 5e-04
Identities = 30/80 (37%), Positives = 34/80 (42%), Gaps = 2/80 (2%)

Query: 339 SGGEG-GTGGVAIASGGNAYGGSGGTGGNGATGASGGTGGTGGTGGDG-GHGGLLIGNGG 396
SGG+G G A ++ GN GG G G G G G G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 397 IGGIGGTGGVGGIGGTGGDG 416
G GG G GG GTGG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 35.5 bits (81), Expect = 7e-04
Identities = 28/84 (33%), Positives = 34/84 (40%), Gaps = 2/84 (2%)

Query: 291 GNGGSGADGGAGVTGGT--GGVGGFANNEGSGDANGGFGGTGGNGGAPGASGGEGGTGGV 348
G G G + GA T G GG G G+ D +G GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 349 AIASGGNAYGGSGGTGGNGATGAS 372
G GG GTGGN + A+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 35.1 bits (80), Expect = 9e-04
Identities = 32/91 (35%), Positives = 38/91 (41%), Gaps = 6/91 (6%)

Query: 471 NGGDGGAGGDGGVGGTGGIGGAGGNGGVGGDGSSSGTGAGGSGGDATDGGTGGDGGSGGG 530
+GGDG G +G I G GVGG S GSG + + GG GSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD------GSGWSSENNPWGGGSGSGIH 55

Query: 531 FGTGGAGGTGGWLIGHSGTNGIGGTGGAGGA 561
+G G G GG G +G GG A A
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 35.1 bits (80), Expect = 9e-04
Identities = 29/78 (37%), Positives = 35/78 (44%), Gaps = 1/78 (1%)

Query: 438 GGDGGDAGTGGDGGTGGV-GGVGGSGGAGGLLFGNGGDGGAGGDGGVGGTGGIGGAGGNG 496
GGDG TG +G + GG G G GG G+G GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 497 GVGGDGSSSGTGAGGSGG 514
G GG +SG G+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 33.9 bits (77), Expect = 0.002
Identities = 27/84 (32%), Positives = 28/84 (33%)

Query: 456 GGVGGSGGAGGLLFGNGGDGGAGGDGGVGGTGGIGGAGGNGGVGGDGSSSGTGAGGSGGD 515
GG G G +GG G G GG G G GS SG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 516 ATDGGTGGDGGSGGGFGTGGAGGT 539
GG G GG G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.9 bits (77), Expect = 0.002
Identities = 32/83 (38%), Positives = 37/83 (44%), Gaps = 8/83 (9%)

Query: 303 VTGGTGGVGGFANNEGSGDANGGFGGTGGNGGAPGASGGE------GGTGGVAIASGGNA 356
++GG G + SG+ NGG G G GGA SG GG G I GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG- 59

Query: 357 YGGSGGTGGNGATGASGGTGGTG 379
G G GGNG +G GTGG
Sbjct: 60 -SGHGNGGGNGNSGGGSGTGGNL 81



Score = 31.2 bits (70), Expect = 0.012
Identities = 30/88 (34%), Positives = 36/88 (40%), Gaps = 4/88 (4%)

Query: 420 GVGGSALNTGSGVADANIGGDGGDAGTGGDGGTGGVGGVGGSGGAGGLLFGNGGDGGAGG 479
G G NTG+ NI +GG G G GG G G G+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWG--GGSGSGIHWGG 58

Query: 480 DGGVGGTGGIGGAGGNGGVGGDGSSSGT 507
G G GG G +GG G GG+ S+
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 30.8 bits (69), Expect = 0.016
Identities = 28/82 (34%), Positives = 32/82 (39%), Gaps = 2/82 (2%)

Query: 155 GTGGSAGLIGAGGT--GGAGGLGAAGGAGGNGGWLFGQGGAGGIGGSGAAGGAGGNGGWL 212
G G + G G GG GLG GGA GW GG GSG G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 213 YGDGGAGGQGGAAAESINDGSP 234
G+G +GG G +P
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAP 87



Score = 30.5 bits (68), Expect = 0.020
Identities = 31/112 (27%), Positives = 41/112 (36%), Gaps = 2/112 (1%)

Query: 131 SGGDGGILYGNGGDGGSGASGQIGGTGGSAGLIGAGGTGGAGGLGAAGGAGGNGGWLFGQ 190
SGGDG +G G G G + G+G + GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 191 GGAGGIGGSGAAGGAGGNGGWLYGDGGAGGQGGAAAESINDGSPGQGGNGGS 242
G G GG+G +GG G GG L G A + G + G+
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 29.7 bits (66), Expect = 0.037
Identities = 24/74 (32%), Positives = 32/74 (43%), Gaps = 1/74 (1%)

Query: 317 EGSGDANGGFGGTGGNGGAPGASGGEGG-TGGVAIASGGNAYGGSGGTGGNGATGASGGT 375
+G G G +G G P G GG + G +S N +GG G+G + G+ G
Sbjct: 5 DGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 376 GGTGGTGGDGGHGG 389
GG G G G G
Sbjct: 65 GGGNGNSGGGSGTG 78



Score = 29.7 bits (66), Expect = 0.039
Identities = 33/110 (30%), Positives = 42/110 (38%), Gaps = 6/110 (5%)

Query: 259 LAGADGVNYSNGTAGPGAWGNNVTTSIGDANGGNGGSG------ADGGAGVTGGTGGVGG 312
++G DG ++ G N T +G G + GSG GG +G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 313 FANNEGSGDANGGFGGTGGNGGAPGASGGEGGTGGVAIASGGNAYGGSGG 362
N G +GG GTGGN A A G +GG A S G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4156YERSSTKINASE403e-05 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 40.1 bits (93), Expect = 3e-05
Identities = 45/197 (22%), Positives = 86/197 (43%), Gaps = 35/197 (17%)

Query: 134 GVMHRDVKPPNILITR-DDFAYLVDFGIASATGDEKLTQLGTAVGTWKYMAPER-FANEE 191
GV+H D+KP N++ R ++D G+ S +G++ T + APE N
Sbjct: 265 GVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQP------KGFTESFKAPELGVGNLG 318

Query: 192 VTYRADIYALACVLFECLTG---SPPYRSDSAGTLVT---AHLMD----PV--PQVSTVR 239
+ ++D++ + L C+ G +P + + +T AH+MD P+ P ++ V
Sbjct: 319 ASEKSDVFLVVSTLLHCIEGFEKNPEIKPNQGLRFITSEPAHVMDENGYPIHRPGIAGVE 378

Query: 240 SGIPKAFDAVIARGMAKKPEERYASAGDLARAAHDALSNP--DQDHAADILRRSRESTLP 297
+ + ++ +P+ A H+ LS+ D++ A IL+ TL
Sbjct: 379 TAYTRFITDILGVSADSRPDSNEAR-------LHEFLSDGTIDEESAKQILK----DTL- 426

Query: 298 GTAAITPQPPTMPAVTP 314
T ++P + +TP
Sbjct: 427 -TGEMSPLSTDVRRITP 442


51MMAR_4219MMAR_4224Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_42193154.291804transcriptional regulator
MMAR_42202133.892246antibiotic ABC transporter ATP-binding protein
MMAR_42214133.866589tetronasin-transport integral membrane protein
MMAR_42225153.778594integral membrane protein
MMAR_42236154.272218PE-PGRS family protein
MMAR_42242142.293420PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4219HTHTETR568e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.8 bits (134), Expect = 8e-12
Identities = 36/155 (23%), Positives = 54/155 (34%), Gaps = 19/155 (12%)

Query: 10 ARIRDAAIEQFGQHGF-GVSLRAIAEGAGVSAALVIHHFGSKEGLRKACDNYVAEEIRSE 68
I D A+ F Q G SL IA+ AGV+ + HF K L I E
Sbjct: 14 QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG-E 72

Query: 69 KLTAMQSNDPATWLGQLAQV-----------ESYAPLMAYLVRSMQSGGELAMM------ 111
Q+ P L L ++ E LM + + GE+A++
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132

Query: 112 LWQQMIDNAEEYLAVGVRAGTIKASRDPKARAKFL 146
L + D E+ L + A + A + A +
Sbjct: 133 LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4223cloacin397e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 7e-05
Identities = 25/81 (30%), Positives = 33/81 (40%)

Query: 483 GGGGTGGDGGNNIIGANVGGDGGAGGAGGLGGNGTDGGWLSGNGGDGGAGGQGGDGGHGG 542
GG G G + G + N+ G G GG +G+ + G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 543 SPGGFDGKSGVGGDGGDGGNA 563
GG +G SG G G +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 35.5 bits (81), Expect = 6e-04
Identities = 30/87 (34%), Positives = 34/87 (39%), Gaps = 3/87 (3%)

Query: 127 DGAPGQAGGDGGLLYGNGGNGGTSTTAGVAGGDGGAAGLIGNGGAGGGGGAGALGGNGGA 186
+G P G GG + G+G +S GG G G G G GGG G GG G
Sbjct: 21 NGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77

Query: 187 GGWLFGQGGAGGNGGTATLAGGAGGAG 213
GG L G A GAGG
Sbjct: 78 GGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.001
Identities = 27/86 (31%), Positives = 33/86 (38%), Gaps = 8/86 (9%)

Query: 247 GGGDGGRGGWLYGNGGVGGTGGTGGIGLQGASGGDGGAGGGTGLWGTGGVGGNGGTGGIG 306
G G G G +G + G G+G GGA G+G G G GI
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVG--------GGASDGSGWSSENNPWGGGSGSGIH 55

Query: 307 LDGVAGHIGGGNAGNAGNGGTGGSGG 332
G +GH GG GN+G G G
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.001
Identities = 32/86 (37%), Positives = 40/86 (46%), Gaps = 7/86 (8%)

Query: 406 GNAGVAGNGGAGGSAAMLFGNGGAGGNGGSGGDGGHGGNSTVSVPGGIGGDAGAGGTGGS 465
G G N GA ++ + NGG G G GG G S+ + P G G +G GGS
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 466 AGKSGLLFGAGGAGGQGGGGGTGGDG 491
+G GG G GGG GTGG+
Sbjct: 61 GHGNG-----GGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.002
Identities = 31/86 (36%), Positives = 37/86 (43%), Gaps = 4/86 (4%)

Query: 337 NGGDG-GHGGTGGGGGRGINGADGGHGGDGGTGGAAGSAGLLFGDGGTGGHGGAGFGGGN 395
+GGDG GH ING G G G GGA+ +G + GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNING---GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 396 GSDSPQGGQGGNAGVAGNGGAGGSAA 421
GS GG GN+G G SA
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.5 bits (76), Expect = 0.003
Identities = 37/105 (35%), Positives = 45/105 (42%), Gaps = 7/105 (6%)

Query: 142 GNGGNGGTSTTAGVAGGDGGAAGLIGNGGAGGGGGAGALGG-NGGAGGWLFGQGGAGGNG 200
G G N G +T+G +GG GL GGA G G + GG G GG G+G
Sbjct: 6 GRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 201 GTATLAGGAGGAGGVGGSAGLWGTGGAGGNGGFGALNLAGDGGAG 245
GG G +GG G+ G A GF AL+ G GG
Sbjct: 64 N----GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.005
Identities = 29/82 (35%), Positives = 34/82 (41%), Gaps = 4/82 (4%)

Query: 297 GGNGGTGGIGLDGVAGHIGGGNAGNAGNGGTGGSGGLLLGN----GGDGGHGGTGGGGGR 352
GG+G G +G+I GG G GG G N GG G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 353 GINGADGGHGGDGGTGGAAGSA 374
G G +G GG GTGG +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.4 bits (73), Expect = 0.006
Identities = 25/76 (32%), Positives = 29/76 (38%), Gaps = 1/76 (1%)

Query: 275 QGASGGDGGAGGGTGLWGTGGVGGNGGTGGIGLDGVAGHIGGGNAGNAGNGGTGGSGGLL 334
+G + G G TG G G + G G GGG+ GG G G
Sbjct: 7 RGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN-G 65

Query: 335 LGNGGDGGHGGTGGGG 350
GNG GG GTGG
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.007
Identities = 35/125 (28%), Positives = 46/125 (36%), Gaps = 7/125 (5%)

Query: 348 GGGGRGINGADGGHGGDGGTGGAAGSAGLLFGDGGTGGHGGAGFGGGNGSDSPQGGQGGN 407
GG GRG N G+ G G DG +GGG+GS GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 408 AGVAGNGGAGGSAAMLFGNGGAGGNGGSGGDGGHGGNSTVSVPGGIGGDAGAGGTGGSAG 467
GNG +GG + G GGN + G +S PG G SA
Sbjct: 63 GNGGGNGNSGGGS-------GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115

Query: 468 KSGLL 472
+ ++
Sbjct: 116 IADIM 120



Score = 31.6 bits (71), Expect = 0.010
Identities = 28/82 (34%), Positives = 35/82 (42%), Gaps = 5/82 (6%)

Query: 192 GQGGAGGNGGTATLAGGAGGAGGVGGSAGLWGTGGAGGNGGFGALNLAGDGGAGAGGGDG 251
G G G N G + +G G G GL GGA G+ + N GG+G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNING-----GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 252 GRGGWLYGNGGVGGTGGTGGIG 273
G G G G GG+G G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGG 79



Score = 31.6 bits (71), Expect = 0.011
Identities = 30/105 (28%), Positives = 36/105 (34%), Gaps = 3/105 (2%)

Query: 453 IGGDAGAGGTGGSAGKSGLLFGAGGAGGQGGGGGTGGDGGNNIIGANVGGDGGAGGAGGL 512
+ G G G G+ SG + G G GGG G + N G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSE---NNPWGGGSGSGIHWG 57

Query: 513 GGNGTDGGWLSGNGGDGGAGGQGGDGGHGGSPGGFDGKSGVGGDG 557
GG+G G +GN G G G GF S G G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4224cloacin373e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 3e-04
Identities = 30/78 (38%), Positives = 35/78 (44%), Gaps = 4/78 (5%)

Query: 154 GVAGGAGGSAGLIGNGGAGGGGGAGAVGGNGGAGGWLFGNGGAGGAGGATPGIGGGGGAG 213
G GA ++G I G G G G GA G+G W N GG G+ GGG G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSG----WSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 214 GAGGIGGAAGLFGNGGAG 231
GG G + G G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.001
Identities = 30/83 (36%), Positives = 34/83 (40%), Gaps = 4/83 (4%)

Query: 304 GTGGTGLNGNAPFADGQHPVILDGGHGGTGGNGGAAGNGGLLFGNGGNGGLGGMGGGGGN 363
G G G N A G ++GG G G GGA+ G N GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGN----INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 364 GLPSTGVGGDGGDGGNGGTAGNG 386
G GG+G GG GT GN
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.001
Identities = 33/98 (33%), Positives = 41/98 (41%), Gaps = 6/98 (6%)

Query: 127 DGAPGQAGGDGGLLYGNGGNGGTSTTAGVAGGAGGSAGLIGNGGAGGGGGAGAVGGNGGA 186
+G P G GG + G+G +S GG+G G G G GGG G GG G
Sbjct: 21 NGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77

Query: 187 GGWLFGNGGA---GGAGGATPGIGGGGGAGGAGGIGGA 221
GG L G +TPG GG + AG + A
Sbjct: 78 GGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 34.3 bits (78), Expect = 0.001
Identities = 36/115 (31%), Positives = 45/115 (39%), Gaps = 7/115 (6%)

Query: 328 GHGGTGGNGGAAGNGGLLFGNGGNGGLGGMGGGG-GNGLPSTGVGGDGGDGGNGGTAGNG 386
G G G N GA G + NGG GLG GG G+G S GG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 387 GWLIGNGGTGGQGGAGFAGGTGADNVSAGRPGGAGGTGGIGAGGGNAGLIGTGGS 441
G +G GG G +G GTG + + P G G G + + G+
Sbjct: 61 G----HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.3 bits (78), Expect = 0.001
Identities = 32/105 (30%), Positives = 41/105 (39%), Gaps = 5/105 (4%)

Query: 271 GDGGRGLPGGDGGSGGAGGGTGLWGSGGAGGQGGTGGTGLNGNAPFADGQHPVILDGGHG 330
G GRG G + G G G G G+G + + N P+ G I GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWS--SENNPWGGGSGSGIHWGGGS 60

Query: 331 GTGGNGGAAGNGGLLFGNGGNGGLGGMGGGGGNGLPSTGVGGDGG 375
G G GG +GG G+G G L + G P+ G GG
Sbjct: 61 GHGNGGGNGNSGG---GSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.001
Identities = 34/114 (29%), Positives = 46/114 (40%), Gaps = 1/114 (0%)

Query: 484 GSGSRGGDGGDGGNGGEGRGGFSVPQHGVGGQGGTGGNGSDGGWLYGDGGAGGAGGNGGF 543
G RG + G G GG + G G G+G + + W G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 544 GGSGVQNGAGGDAGSGGDSRLIGDGGAAGAGGTGAP-PGADGHSGADGLLSAAL 596
G G +GG +G+GG+ + A G P G S + G LSAA+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 33.9 bits (77), Expect = 0.002
Identities = 32/87 (36%), Positives = 37/87 (42%), Gaps = 8/87 (9%)

Query: 371 GGDGGDGGNGGTAGNGGWLIGNGGTGGQGGAGFAGGTGADNVSAGRPGGAGGTGGIGAGG 430
GGDG G + +G G G G GGA G ++N P G G GI GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN----NPWGGGSGSGIHWGG 58

Query: 431 GNAGLIGTGGSGGNGGMGGHGGDSGYG 457
G+ G G GGNG GG G G
Sbjct: 59 GS----GHGNGGGNGNSGGGSGTGGNL 81



Score = 33.5 bits (76), Expect = 0.002
Identities = 27/76 (35%), Positives = 30/76 (39%), Gaps = 1/76 (1%)

Query: 452 GDSGYGDQTGGQGGNGGM-GGAGGAAGNGGLLLGSGSRGGDGGDGGNGGEGRGGFSVPQH 510
G G G TG +G + GG G GG GSG + GG G G H
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 511 GVGGQGGTGGNGSDGG 526
G GG G G GS G
Sbjct: 63 GNGGGNGNSGGGSGTG 78



Score = 33.5 bits (76), Expect = 0.003
Identities = 32/98 (32%), Positives = 37/98 (37%), Gaps = 3/98 (3%)

Query: 134 GGDGGLLYGNGGNGGTSTTAGVAGGAGGSAGLIGNGGAGGGGGAGAVGGNGGAGGWLFGN 193
G + G +G G T GV GGA +G GGG + GG+G GN
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH---GN 64

Query: 194 GGAGGAGGATPGIGGGGGAGGAGGIGGAAGLFGNGGAG 231
GG G G G GG A A G L G G
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.013
Identities = 27/80 (33%), Positives = 30/80 (37%), Gaps = 4/80 (5%)

Query: 419 GAGGTGGIGAGGGNAGLIGTGGSGGNGGMGGHGGDSGYGDQTGGQGGNGGMGGAGGAAGN 478
G G G + GN TG G G G G S GG G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 479 GGLLLGSGSRGGDGGDGGNG 498
G G+G+ GG G GGN
Sbjct: 66 G----GNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.018
Identities = 28/81 (34%), Positives = 33/81 (40%)

Query: 234 GGDGGLNYYGDGGVAGAGGDGGRGGWLHGDGGDGGAGGDGGRGLPGGDGGSGGAGGGTGL 293
GGDG + G +G G G + G DG GG G GGG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 294 WGSGGAGGQGGTGGTGLNGNA 314
GG G GG GTG N +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 29.7 bits (66), Expect = 0.044
Identities = 25/76 (32%), Positives = 30/76 (39%), Gaps = 3/76 (3%)

Query: 415 GRPGGAGGTGGIGAGGGNAGLIGTGGSGGNGGMGGHGGDSGYGDQTGGQGGNGGMGGAGG 474
G GA T G GG +G G S G+G + G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 475 AAGNGGLLLGSGSRGG 490
+GG GSG+ G
Sbjct: 68 NGNSGG---GSGTGGN 80


52MMAR_4255MMAR_4266Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4255-115-4.314337hypothetical protein
MMAR_4256014-5.612077hypothetical protein
MMAR_5553015-4.910744hypothetical protein
MMAR_4257-114-5.113798nitrite extrusion protein, NarK3_3
MMAR_4258018-6.845213monophosphatase CysQ-like protein
MMAR_4259017-6.445978bifunctional enzyme CysN/CysC-like: sulfate
MMAR_4260-115-5.366922glycolipid sulfotransferase
MMAR_4261018-4.604890hypothetical protein
MMAR_4262020-4.956266hypothetical protein
MMAR_4263119-3.688106sulfate adenylyltransferase
MMAR_4264320-1.589948hypothetical protein
MMAR_42654161.627741integral membrane nitrite extrusion protein
MMAR_42663161.771696PPE family protein
53MMAR_4311MMAR_4320Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_431110264.756520integral membrane protein
MMAR_431210254.770877hypothetical protein
MMAR_431310254.455238chalcone synthase
MMAR_431411223.705246hypothetical protein
MMAR_431511192.160438oxidoreductase
MMAR_431610191.319214PE-PGRS family protein
MMAR_4317412-3.177810enoyl-CoA hydratase, EchA1
MMAR_4318413-3.374459acetyl-CoA acetyltransferase
MMAR_4319514-3.437899PPE family protein
MMAR_4320312-2.917713PPE family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4312NUCEPIMERASE634e-13 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 62.9 bits (153), Expect = 4e-13
Identities = 31/125 (24%), Positives = 51/125 (40%), Gaps = 18/125 (14%)

Query: 1 MRILVTGATGYVGSRLVTALLADGHEVLA---------ATRNMARLSRLAWFDDVTPVIL 51
M+ LVTGA G++G + LL GH+V+ + ARL LA +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA-QPGFQFHKI 59

Query: 52 DATDRASAQAAMNAAGQIDVVYYLVH------GIGQPD-FRDRDKTAAANLAVAARDTGV 104
D DR + A+G + V+ H + P + D + T N+ R +
Sbjct: 60 DLADRE-GMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 105 RRIVY 109
+ ++Y
Sbjct: 119 QHLLY 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4315SECA290.039 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 28.7 bits (64), Expect = 0.039
Identities = 23/86 (26%), Positives = 32/86 (37%), Gaps = 15/86 (17%)

Query: 243 VAGRVLLVGDAAGYEDALTGEGISLAVKQAAA-------AVRAIADND-PASYEAAWHRV 294
+ G VL A + TGEG +L A V + ND A +A
Sbjct: 89 LGGMVLNERCIA---EMRTGEGKTLTATLPAYLNALTGKGVHVVTVNDYLAQRDAEN--N 143

Query: 295 TRSYRWL--TRGLVLASAPRPARRAI 318
+ +L T G+ L P PA+R
Sbjct: 144 RPLFEFLGLTVGINLPGMPAPAKREA 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4316cloacin401e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.1 bits (93), Expect = 1e-04
Identities = 33/104 (31%), Positives = 41/104 (39%), Gaps = 1/104 (0%)

Query: 1212 IGGAGGQGGSGGAAGTGGDVGGSPGEVGGAGGGGGFGGAGGAGGVGGGAGGSG-GVGGSG 1270
+ G G+G + GA T G++ G P +G GG G GG GSG GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1271 GNGGLGVATGGAGGVGGQGGAAGAGGAAGAGATAAGAGGVGGLG 1314
G+G G GG G G + G A G GGL
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 39.3 bits (91), Expect = 2e-04
Identities = 29/88 (32%), Positives = 35/88 (39%)

Query: 1057 TGGAGGGAGSGAGGLGAGGDGGTGGAGGAGGVGSSAGWGSGLAGQVGGAGGVGGAGGDSG 1116
+GG G G +GA +GG G G GG +GW S GG+G GG SG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1117 GLGGTNGDGGAGGLGGRGGAGGSSTTVA 1144
G GG G G + VA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 38.2 bits (88), Expect = 3e-04
Identities = 37/112 (33%), Positives = 45/112 (40%), Gaps = 2/112 (1%)

Query: 1193 GRGGDGGAGGIGGG--GGLSGIGGAGGQGGSGGAAGTGGDVGGSPGEVGGAGGGGGFGGA 1250
GRG + GA G GG +G+G GG G + GG G GGG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1251 GGAGGVGGGAGGSGGVGGSGGNGGLGVATGGAGGVGGQGGAAGAGGAAGAGA 1302
GG G GGG+G G + G G GG + AG + A A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 37.4 bits (86), Expect = 7e-04
Identities = 39/110 (35%), Positives = 50/110 (45%), Gaps = 2/110 (1%)

Query: 509 AGGDGGAGGVGAEGAAGAGVVGGGAGGDGGAGGAAGAGGSGGGGIGGGKAGTG-GDGGIG 567
+GGDG GA +G + GG G G G + G+G S GG +G+G GG
Sbjct: 2 SGGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 568 GAGGTGGGGGNGDTYSTGGAGGDGGAGGAAGSAGGAGAGSGGAAGSAGSG 617
G G GG G +G TGG A A G + G+GG A S +G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 37.0 bits (85), Expect = 8e-04
Identities = 31/101 (30%), Positives = 44/101 (43%)

Query: 545 AGGSGGGGIGGGKAGTGGDGGIGGAGGTGGGGGNGDTYSTGGAGGDGGAGGAAGSAGGAG 604
+GG G G G + +G G G GGG +G +S+ GG+G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 605 AGSGGAAGSAGSGGSGGDGGAGGASGRELGSNLGYAGGVGG 645
G+GG G++G G G + A+ G G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.5 bits (81), Expect = 0.002
Identities = 32/118 (27%), Positives = 42/118 (35%), Gaps = 6/118 (5%)

Query: 1616 NGGRGGAGGAGGFAGDGEGSGGTAGSGGNGGKGGNAGAGGNGVPAAGAAAGNGGLGGSGG 1675
+GG G G + G +GG G G GG +G P G + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1676 AGGSGAIGAAAGGAGGAGGNGGTGGNAGIGVMRGASTPALAGDGGVGGAGGLGGVARS 1733
G G G GG+G G + + PAL+ G G A + A S
Sbjct: 62 HGNGG------GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 34.7 bits (79), Expect = 0.004
Identities = 36/103 (34%), Positives = 42/103 (40%), Gaps = 2/103 (1%)

Query: 970 GSGGGVGNAGVGGAGGVGGAGGAGGAADGPGLFGYDG--GAGGAGGIGGAAGVGGSNGAG 1027
G G + GG G G GGA+DG G + G G GI G G NG G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 1028 GTGGAGGVGGVGADSGLASRPGGAGGAGGTGGAGGGAGSGAGG 1070
GG G G S +A+ A T GAGG A S + G
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 34.7 bits (79), Expect = 0.005
Identities = 27/80 (33%), Positives = 37/80 (46%)

Query: 485 AGGAGGAGGAGGAAAAGGTAGVGGAGGDGGAGGVGAEGAAGAGVVGGGAGGDGGAGGAAG 544
+GG G G + +G G G GG G+ ++ GGG+G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 545 AGGSGGGGIGGGKAGTGGDG 564
G GG G GG +GTGG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 34.3 bits (78), Expect = 0.005
Identities = 25/77 (32%), Positives = 27/77 (35%)

Query: 1104 GAGGVGGAGGDSGGLGGTNGDGGAGGLGGRGGAGGSSTTVAGAGGGGGRGGDGGSAGGGV 1163
G G GA SG + G G GG G S G G G G GGS G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1164 GGGGVGGAAGSGGAGGA 1180
GG G G G +
Sbjct: 66 GGNGNSGGGSGTGGNLS 82



Score = 34.3 bits (78), Expect = 0.006
Identities = 35/108 (32%), Positives = 41/108 (37%), Gaps = 1/108 (0%)

Query: 1451 GAGGAGGQGGAANGGVAGDGGVGGNGGVGGVGGRGGDGANGAPGGGIDGTG-RPGGAGGQ 1509
G G G GA + +GG G G GG G + P GG G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1510 GGSGGRAGFGGAAGAGEGGEYGAAGVGGNGGDGGAGGRGGYGTTGSGG 1557
G GG GG +G G AA V G GG + S G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.9 bits (77), Expect = 0.006
Identities = 27/85 (31%), Positives = 35/85 (41%)

Query: 440 AAGQGRGGTVGAAGVGGTGGVGGDGGAGDSGAAASAPGGAGGTGWAGGAGGAGGAGGAAA 499
+ G GRG GA G G G GA+ + + W GG+G GG +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 500 AGGTAGVGGAGGDGGAGGVGAEGAA 524
G G G +GG G GG + AA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.009
Identities = 34/104 (32%), Positives = 43/104 (41%), Gaps = 2/104 (1%)

Query: 660 LAGAAGSGGNGGAGGAGGASAVALVGGAGGAGGAGGQGGTAGDGPGGVGGHGGSGGSGGI 719
++G G G N GA G G G G + G G ++ + P G GG G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGG 59

Query: 720 GGTGGDGYQSGDVGGQGGEGGAGGAAGAGGEAGAQGLAGAGGTG 763
G G G +G+ GG G GG A A G L+ G G
Sbjct: 60 SGHGNGG-GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.1 bits (75), Expect = 0.011
Identities = 31/89 (34%), Positives = 37/89 (41%), Gaps = 2/89 (2%)

Query: 1496 GIDGTGRPGGAGGQGGS--GGRAGFGGAAGAGEGGEYGAAGVGGNGGDGGAGGRGGYGTT 1553
G DG G GA G+ GG G G GA +G + + GG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1554 GSGGLFGGSGGHGGVGGIGGNGGSAAAGG 1582
G+GG G SGG G GG + A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.017
Identities = 40/115 (34%), Positives = 45/115 (39%), Gaps = 10/115 (8%)

Query: 346 GGAGGAGGVGGAGTSGGDAVVPGGVGGVGGSGGAGG--------AGGSGGGAGWLGTAGD 397
GG G G TSG P G+G GG+ G GGSG G W G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 398 GGVGGVGGGGGGGGVGASGVGHQLAGGAGGAGGAGGAAGAGGAAGQGRGGTVGAA 452
G G G G GGG G G +A A GAGG A G + AA
Sbjct: 63 GNGG--GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 32.4 bits (73), Expect = 0.019
Identities = 32/103 (31%), Positives = 37/103 (35%), Gaps = 5/103 (4%)

Query: 600 AGGAGAGSGGAAGSAGSGGSGGDGGAGGASGRELGSNLGYAGGVGGDGGQGGQGGAAVGG 659
+GG G G A S +GG G G G GS G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 660 LAGAAGSGGNGGAGGAGG-----ASAVALVGGAGGAGGAGGQG 697
G+G +GG G GG A+ VA A GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.019
Identities = 35/100 (35%), Positives = 42/100 (42%), Gaps = 4/100 (4%)

Query: 1148 GGGGRGGDGG--SAGGGVGGGGVGGAAGSGGAGGAGGRGIDSYAAAGGRGG--DGGAGGI 1203
GG GRG + G S G + GG G G G + G+G ++ G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1204 GGGGGLSGIGGAGGQGGSGGAAGTGGDVGGSPGEVGGAGG 1243
G GGG GG G GG+ A G GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.020
Identities = 28/81 (34%), Positives = 32/81 (39%)

Query: 1145 GAGGGGGRGGDGGSAGGGVGGGGVGGAAGSGGAGGAGGRGIDSYAAAGGRGGDGGAGGIG 1204
G G G G+ GG G GVGG A G + + +G G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1205 GGGGLSGIGGAGGQGGSGGAA 1225
GG G SG G G S AA
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.021
Identities = 30/84 (35%), Positives = 36/84 (42%), Gaps = 2/84 (2%)

Query: 1869 GAGGVGGFGGVGGTGASGLGGSGGIGGDGGA--GGVGGDCSVPLSPGNGGDGGAGGDGGD 1926
G G G G T + GG G+G GGA G + P G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1927 GGDGGNGQPGGPGGAGGGAASGGA 1950
G GGNG GG G GG ++ A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.021
Identities = 32/101 (31%), Positives = 40/101 (39%), Gaps = 2/101 (1%)

Query: 1567 GVGGIGGNGGSAAAGGVNGNGGNGGIGGNAGDAGNGANGSLLHHAGDGGNGGRG-GAGGA 1625
G G G N G+ + G N NGG G+G G + S + G G G G G
Sbjct: 3 GGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1626 GGFAGDGEGSGGTAGSGGNGGKGGNAGAGGNGVPAAGAAAG 1666
G G SGG +G+GGN A G + A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.022
Identities = 35/107 (32%), Positives = 43/107 (40%), Gaps = 3/107 (2%)

Query: 1249 GAGGAGGVGGGAGGSGGVGGSGGNGGLGVATGGAGGVGGQGGAAGAGGAAGAGATAAGAG 1308
G G G G SG + +GG GLGV G + G G GG +G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1309 GVGGLGGDGGNGGNGVRGAAGVAGGDGAVGGGGGAGGAGGQGGAGVT 1355
G G GG GN G G ++ V G A G GG V+
Sbjct: 61 GHGN-GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 32.4 bits (73), Expect = 0.022
Identities = 30/82 (36%), Positives = 36/82 (43%), Gaps = 2/82 (2%)

Query: 1771 GAGGVGGNGGFAALGTGGAAGSGGGGGTGGAGGVSDSPTTSRSVGGAGGVGGVGGNGGIG 1830
G G G N G + G G G GGA S + + GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1831 GNGQIGGDGGSGGAAGAGGAGA 1852
GNG GG+G SGG +G GG +
Sbjct: 63 GNG--GGNGNSGGGSGTGGNLS 82



Score = 32.0 bits (72), Expect = 0.025
Identities = 39/135 (28%), Positives = 46/135 (34%)

Query: 756 LAGAGGTGGTGGQGGTGGIGAQGSNGHGVGGRPGTAGAVGGAGGAGGQGGAAGLDGTAGD 815
++G G G G T G G G GVGG G G +G+ G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 816 GGVGGTGGRGGAGGDGAGGVGHQLAGGAGGDGGAGGAAGVGGAAGAGSGGVVGAAGTGGT 875
G G G GG G GG +A A G GG A + S G + AA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIM 120

Query: 876 GGAGGNGGAGDTGVA 890
G G GVA
Sbjct: 121 AALKGPFKFGLWGVA 135



Score = 32.0 bits (72), Expect = 0.029
Identities = 22/81 (27%), Positives = 30/81 (37%)

Query: 539 AGGAAGAGGSGGGGIGGGKAGTGGDGGIGGAGGTGGGGGNGDTYSTGGAGGDGGAGGAAG 598
+GG +G G G G+GG G G + + GG+G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 599 SAGGAGAGSGGAAGSAGSGGS 619
G G G+ G G S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 32.0 bits (72), Expect = 0.032
Identities = 32/114 (28%), Positives = 38/114 (33%), Gaps = 6/114 (5%)

Query: 1425 GGNGGAGDAGVAGADGGGAGGSGWAGGAGGAGGQGGAANGGVAGDGGVGGNGGVGGVGGR 1484
GG+G + G G GG G GGA G ++ GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1485 GGDGANGAPGGGIDGTGRPGGAGGQGGSGGRAGFGGAAGAGEGGEYGAAGVGGN 1538
G G NG GGG G G FG A + G A +
Sbjct: 63 GNGGGNGNSGGG------SGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.6 bits (71), Expect = 0.034
Identities = 31/80 (38%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 224 SGGAGGAGDVGVAGGAGGV-GGRGGWVFGDGGSGGVGGSGGVGVVGGVGGVGGGTGVFGG 282
SGG G + G +G + GG G G G S G G S GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 283 GGAGGAGGVGGGTGGSGGNG 302
G GG G GG G+GGN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.034
Identities = 32/108 (29%), Positives = 40/108 (37%), Gaps = 6/108 (5%)

Query: 929 GDGGTGGAGGAGAGGDRTDGGRGGVGGAGGDAGAGGVTGAGGSGGGVGNAGVGGAGGVGG 988
G G G GA + +GG G+G GG + G + GG +G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 989 AGGAGGAADGPGLFGYDGGAGGAGGIGGAAGVGGSNGAGGTGGAGGVG 1036
G G G G G AA V A T GAGG+
Sbjct: 63 GNGGGNGNSG------GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.039
Identities = 34/122 (27%), Positives = 46/122 (37%), Gaps = 5/122 (4%)

Query: 1596 AGDAGNGANGSLLHHAGDGGNGGRGGAGGAGGFAGDGEGSGGTAGSGGNGGKGGNAGAGG 1655
+G G G N +G+ G G G G G G S GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1656 NGVPAAGAAAGNGGLGGSGGAGGSGAIGAAAGGAGGAGGNGGTGGNAGIGVMRGASTPAL 1715
+G GNG GG G GG+ + AA G + G + + GA + A+
Sbjct: 62 HG-----NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116

Query: 1716 AG 1717
A
Sbjct: 117 AD 118



Score = 31.2 bits (70), Expect = 0.047
Identities = 34/112 (30%), Positives = 44/112 (39%), Gaps = 1/112 (0%)

Query: 571 GTGGGGGNGDTYSTGGAGGDGGAGGAAGSAGGAGAGSGGAAGSAGSGGSGGDGGAGGASG 630
G G G N +ST G +GG G G + + + GGSG GG SG
Sbjct: 3 GGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 631 RELGSNLGYAGGVGGDGGQGGQGGAAVGGLAGAAGSGGNGGAGGAGGASAVA 682
G G +GG G GG A V A + G GG + A A++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4319cloacin340.002 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.9 bits (77), Expect = 0.002
Identities = 23/81 (28%), Positives = 30/81 (37%), Gaps = 6/81 (7%)

Query: 458 MGFGNGGGGNTGFY------NSGTYNTGFSNAGETNTGWENSGNVNTGGYNSGGLNTGIG 511
M G+G G NTG + N G G +GW + N GG SG G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 512 SPDTQAGPNSGFGHSGSGNSG 532
G + G SG+G +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.019
Identities = 23/89 (25%), Positives = 30/89 (33%), Gaps = 11/89 (12%)

Query: 225 GSGNTGSANLGGGNIGNGNLGSGNTGNVNLGNGNNGFFNFGNGNLGDTNFGSGNSGNLNL 284
G G+ A+ GNI G G G G + G+G N G +
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-----------WSSENNPWGGGSGSGI 54

Query: 285 GSGNRFGSGNIGFGNRFGDGNFGSGNAGS 313
G G GN G G G+ GN +
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 30.1 bits (67), Expect = 0.032
Identities = 27/90 (30%), Positives = 38/90 (42%), Gaps = 10/90 (11%)

Query: 378 MGFGNAGDNNVGFFNSGSNNIGFFNSGDGNFGFANAGSTNTGFWNSGGTNTGFGNGGSLN 437
M G+ +N G ++ N N G G S +G W+S G G+G ++
Sbjct: 1 MSGGDGRGHNTGAHSTSGN----INGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIH 55

Query: 438 FGFGNGGVENMGHGNAGSFNMGFGNGGGGN 467
+G G G N G N G G+G GGN
Sbjct: 56 WG-GGSGHGNGGGNG----NSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4320cloacin360.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 0.001
Identities = 29/98 (29%), Positives = 38/98 (38%), Gaps = 3/98 (3%)

Query: 757 GSGNHGDANLGFGNFGNGNIGSGNHGAGNFGSGNTGSRNLGSGNAGSTNFGSGNHGNSNV 816
G G++ A+ GN G G G G + GSG + N G +GS G G+ N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 817 GLGNFGNNNLGLGNNGSNN---IGFGLTGDNLVGIGAL 851
G G G N S + FG + G G L
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 34.3 bits (78), Expect = 0.003
Identities = 24/77 (31%), Positives = 30/77 (38%)

Query: 717 GFGNIGQANLGSGNAGNTNLGSGNTGSTNFGSGNIGALNLGSGNHGDANLGFGNFGNGNI 776
G G+ A+ SGN G G G + GSG N G G G G+GN
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 777 GSGNHGAGNFGSGNTGS 793
G + G G+G S
Sbjct: 66 GGNGNSGGGSGTGGNLS 82



Score = 34.3 bits (78), Expect = 0.003
Identities = 27/109 (24%), Positives = 40/109 (36%), Gaps = 2/109 (1%)

Query: 737 GSGNTGSTNFGSGNIGALNLGSGNHGDANLGFGNFGNGNIGSGNHGAGNFGSGNTGSRNL 796
G G+ + SGNI G G G A+ G G N G G+G G +G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 797 GSGNAGSTNFGSGNHGNSNVGLGNFGNNNLGLGNNGSNNIGFGLTGDNL 845
G G+G + ++ FG L G+ + ++ L
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFP--ALSTPGAGGLAVSISAGAL 112



Score = 33.5 bits (76), Expect = 0.005
Identities = 28/109 (25%), Positives = 41/109 (37%), Gaps = 3/109 (2%)

Query: 942 NSGSYNT-GSFNSGTLNTGDFNGGDHNTGWGNSGNTNTGGINSGDLNTGFGSSADQAVTN 1000
N+G+++T G+ N G G G +GW + N GG SG G S
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI--HWGGGSGHGNGGG 67

Query: 1001 SGFGNNGSGNSGFNNTGDTNSGFHNANTSALFSGHSGLLNAGGSQSVGI 1049
+G GSG G + F S +G + + G+ S I
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 32.4 bits (73), Expect = 0.013
Identities = 23/78 (29%), Positives = 29/78 (37%)

Query: 727 GSGNAGNTNLGSGNTGSTNFGSGNIGALNLGSGNHGDANLGFGNFGNGNIGSGNHGAGNF 786
G G+ + SGN G G G + GSG + N G G+G G G GN
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 787 GSGNTGSRNLGSGNAGST 804
G G+G S
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 30.5 bits (68), Expect = 0.040
Identities = 27/82 (32%), Positives = 34/82 (41%), Gaps = 4/82 (4%)

Query: 707 GSGNTGDANFGFGNIGQANLGSGNAGNTNLGSGNTGSTNFGSGNIGALNLGSGNHGDANL 766
G G+ A+ GNI G G G + GSG + N G G+ G G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 767 GFGNFGNGNIGSGNHGAGNFGS 788
G GNGN G G+ GN +
Sbjct: 66 G----GNGNSGGGSGTGGNLSA 83


54MMAR_4393MMAR_4405Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_43930113.160677acetyl-CoA acetyltransferase
MMAR_43944146.073982hypothetical protein
MMAR_43955146.5859573-hydroxyisobutyryl-CoA hydrolase
MMAR_43965137.020839enoyl-CoA hydratase
MMAR_43976137.566173hypothetical protein
MMAR_43987158.956126PE-PGRS family protein
MMAR_43993157.473852PE-PGRS family protein
MMAR_44001153.647617hypothetical protein
MMAR_44010133.041000hypothetical protein
MMAR_44022116.418006lipoprotein LpqV
MMAR_44032116.086716hypothetical protein
MMAR_44043115.230076hypothetical protein
MMAR_44052112.996844hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4398cloacin382e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 2e-04
Identities = 37/121 (30%), Positives = 46/121 (38%), Gaps = 5/121 (4%)

Query: 560 TGGTGGAGGSGGTDSISGIAGGDGGAGGAGGWLSGTGGAG-----GSGGAGGDGGVLGSG 614
+GG G +G + I GG G G GG G+G + G G G GSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 615 DGGAGGAGGSGGAGGLLGAGGTGGTGAIGGFSGLLASGDGGAGGAGGAGGAGGLLGGLVG 674
G GG G SGG G G GF L G GG + AG + ++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 675 A 675
A
Sbjct: 122 A 122



Score = 37.0 bits (85), Expect = 3e-04
Identities = 35/122 (28%), Positives = 45/122 (36%), Gaps = 10/122 (8%)

Query: 399 GSGGNGGAGGAGGVFFGNGGAGGAGGTGGDG----------GGIGGAGGAAGNGVLIGNG 448
G G N GA G G G GG DG GG G+G G G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 449 GNGGIGGIGLTPGADGIGGTSGLVLGLDGFNAPASTSPLHTLQQQALNAINAPVEAATGR 508
G G G G G + + + G + P + ++ AL+A A + AA
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKG 125

Query: 509 PL 510
P
Sbjct: 126 PF 127



Score = 35.8 bits (82), Expect = 8e-04
Identities = 37/104 (35%), Positives = 43/104 (41%), Gaps = 10/104 (9%)

Query: 264 GGGHGGTGGYGGGAGGAGGNAGLLAGTGGSGGTGGYGGGAGGAGGNAGLLFGNGGAGGAG 323
G H +G GG G G G G+G S +GGG+G G G G G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW----GGGSGHGNGG 66

Query: 324 ATGGGNTGGIGGDGGNAGMLFSNGGAGGAGGASELADGGAGGAG 367
G GN+GG G GGN S A A G L+ GAGG
Sbjct: 67 --GNGNSGGGSGTGGN----LSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.001
Identities = 29/86 (33%), Positives = 36/86 (41%), Gaps = 7/86 (8%)

Query: 352 AGGASELADGGAGGAGGNGGWLLSNGGVGGAGGAGADAVGPPFGIPAGSGGNGGAGGAGG 411
+GG + GA GN NGG G G G + G G + + GG G+G
Sbjct: 2 SGGDGRGHNTGAHSTSGN-----INGGPTGLGVGGGASDGS--GWSSENNPWGGGSGSGI 54

Query: 412 VFFGNGGAGGAGGTGGDGGGIGGAGG 437
+ G G G GG G GGG G G
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 34.7 bits (79), Expect = 0.002
Identities = 38/107 (35%), Positives = 41/107 (38%), Gaps = 15/107 (14%)

Query: 293 SGGTG-GYGGGAGGAGGNAGLLFGNGGAGGAGATGGGNTGGIGGDGGNAGMLFSNGGAGG 351
SGG G G+ GA GN NGG G G GG + G S G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-----NGGPTGLGVGGGASDGSGWS---------SENNPWG 47

Query: 352 AGGASELADGGAGGAGGNGGWLLSNGGVGGAGGAGADAVGPPFGIPA 398
G S + GG G G GG S GG G G A A FG PA
Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPA 94



Score = 34.3 bits (78), Expect = 0.003
Identities = 34/107 (31%), Positives = 43/107 (40%), Gaps = 5/107 (4%)

Query: 272 GYGGGAGGAGGNAGLLAGTGGSGGTGGYGGGAGGAGGNAGLLFGNGGAGGAGATGGGNTG 331
G+ GA GN GG G G GG + G+G ++ GG+G GGG+
Sbjct: 8 GHNTGAHSTSGNI-----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 332 GIGGDGGNAGMLFSNGGAGGAGGASELADGGAGGAGGNGGWLLSNGG 378
G GG GN+G GG A A A G GG +S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 33.1 bits (75), Expect = 0.005
Identities = 29/97 (29%), Positives = 34/97 (35%)

Query: 230 GAGGLGGAGGFGGSAGGAGGQGGAGGLLSGLVGAGGGHGGTGGYGGGAGGAGGNAGLLAG 289
G G GA G+ G G GG S G + GG G GG +G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 290 TGGSGGTGGYGGGAGGAGGNAGLLFGNGGAGGAGATG 326
G GG G G + A + FG GA G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.011
Identities = 32/99 (32%), Positives = 39/99 (39%), Gaps = 6/99 (6%)

Query: 215 GKGGAGGAGGSGGLFGAGGLGGAGGFGGSAGGAGGQ------GGAGGLLSGLVGAGGGHG 268
G+G GA + G G G G G S G GG+G + G+G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 269 GTGGYGGGAGGAGGNAGLLAGTGGSGGTGGYGGGAGGAG 307
G G GG G GGN +A G GAGG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.011
Identities = 28/85 (32%), Positives = 33/85 (38%), Gaps = 2/85 (2%)

Query: 123 NGAPGTGANGGAGGWLIGNGGAGGSGAPGLNGGAGGAAGLIGTGGAGGAGGSSSTVNGGV 182
+G G G N GA GG G+ GGA +G G G S GG
Sbjct: 2 SGGDGRGHNTGAHS--TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 183 GGTGGAGGWLLGNGGAGGAGGASAI 207
G G GG GG+G G SA+
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.0 bits (72), Expect = 0.012
Identities = 27/81 (33%), Positives = 32/81 (39%)

Query: 533 GDGGAGGSSTAGSGLDGGTGGAAGLWGTGGTGGAGGSGGTDSISGIAGGDGGAGGAGGWL 592
G G G ++ A S GG GL GG G ++ G G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 593 SGTGGAGGSGGAGGDGGVLGS 613
GG G SGG G GG L +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 31.6 bits (71), Expect = 0.017
Identities = 23/78 (29%), Positives = 29/78 (37%)

Query: 701 GTGGAGGDAGLLGGPGGAGGTGGTGGPNIDAGGVLGAPGNSGAGGAGGNAGTLFGSGGVG 760
G G G + G G G G A G + G G +G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 761 GDGGSGHGNGGDGGGGGN 778
G+GG +GG G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 31.2 bits (70), Expect = 0.022
Identities = 35/113 (30%), Positives = 38/113 (33%), Gaps = 11/113 (9%)

Query: 597 GAGGSGGAGGDGGVLGSGDGGAGGAGGSGGAGGLLGAGGTGGTGAIGGFSGLLASGDGGA 656
G G G G G+ +GG G G GGA G G SG+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 657 GGAGGAGGAGGLLGGLVGAGGGDGGAGGTGGFHGDAVTGTAGAGGTGGAGGDA 709
G GG G GG G GG V A T GAGG A
Sbjct: 63 GNGGGNGN-----------SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.025
Identities = 39/120 (32%), Positives = 50/120 (41%), Gaps = 16/120 (13%)

Query: 717 GAGGTGGTGGPNIDAGGVLGAPGNSGAGGAGGNAGTLFGSGGVGGDGGSGHGNGGDGGGG 776
G G G G + +G + G P G GG G + G+ + S GGSG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 777 GNAGLLFSDAGGGGFGGFGLAGGGGTGGSGGDAGWLGSGG-----AGGAGGISVNGDAGA 831
G G G GG GTGG+ + G GAGG++V+ AGA
Sbjct: 62 H----------GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 30.8 bits (69), Expect = 0.028
Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 1/81 (1%)

Query: 649 LASGDGGAGGAGGAGGAGGLLGGLVGAGGGDGGAGGTG-GFHGDAVTGTAGAGGTGGAGG 707
++ GDG G +G + GG G G G G + G+G + G +G+G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 708 DAGLLGGPGGAGGTGGTGGPN 728
G GG G +GG GTGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 30.1 bits (67), Expect = 0.048
Identities = 31/113 (27%), Positives = 38/113 (33%), Gaps = 12/113 (10%)

Query: 760 GGDGGSGHGNGGDGGGGGNAGLLFSDAGGGGFGGFGLAGGGGTGGSGGDAGWLGSGGAGG 819
G SG+ NGG G G G G+ GGSG W G G G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGG-------ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 820 AGGISVNGDAGAGGAGGTSGQLLGDGGAGGAGGEAELAAGGAGGSGGVGGDAV 872
GG + +GG GT G L G A G G + + A+
Sbjct: 65 GGG-----NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4399cloacin382e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 2e-04
Identities = 36/106 (33%), Positives = 48/106 (45%), Gaps = 2/106 (1%)

Query: 410 GGTGGLLLGNGGAGGAGGTSGNG--GGTGGAGGAAGSGVLIGNGGNGGIGGAGPTPGGNG 467
GG GL +G G + G+G +S N GG G+G G G GNGG G G G GGN
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 468 IGGTSGLLLGLDGFNAPTSTSPIHTLQQQALNAINAPIQAATGRPL 513
+ + G + P + ++ AL+A A I AA P
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKGPF 127



Score = 37.8 bits (87), Expect = 2e-04
Identities = 30/82 (36%), Positives = 34/82 (41%), Gaps = 6/82 (7%)

Query: 164 GSGGAGGAGGSSTSTNGGAGGAGGAGG------WLSGNAGVGGAGGASTVANGGAGGAGG 217
G G GA +S + NGG G G GG W S N GG G+ GG+G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 218 AGGLLGGGGLGGAGGASTSATA 239
G GGG G G S A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAP 87



Score = 37.8 bits (87), Expect = 2e-04
Identities = 36/110 (32%), Positives = 37/110 (33%), Gaps = 8/110 (7%)

Query: 639 GAGGTGGPAGLGVFVGVPTFGDGGGGGAGGAGGAGGLLSGLVGAGGGDGGAGGQGGIGFG 698
G G G G G G G G GGA G S GGG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 699 GSGGSGGGGGVGGNAGLLGGPGGAGGSGGAAGPALFGIGVDGTAGVGGAG 748
G+GG G G GG G G A P FG T G GG
Sbjct: 63 GNGGGNGNSG--------GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.8 bits (87), Expect = 2e-04
Identities = 28/76 (36%), Positives = 32/76 (42%), Gaps = 1/76 (1%)

Query: 536 GDGGAGGSGGTGLDGG-DGGAAGLMGTGGTGGAGGWGGTDEITGGTGGAGGTGGGGGWLS 594
GDG +G G +GG GL GG GW + GG G+G GGG
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 595 GSGGIGGGGGAGGYGG 610
GG G GG G GG
Sbjct: 64 NGGGNGNSGGGSGTGG 79



Score = 36.6 bits (84), Expect = 4e-04
Identities = 35/110 (31%), Positives = 41/110 (37%), Gaps = 1/110 (0%)

Query: 258 GADGGHGGAGGHGAAGGDGGAGGDGGPLAGAGGSGGTGGSGNAGGGAGGAGGNAGLLFGD 317
G DG G H +G G G GA G N GG G+G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 318 GGVGGTGGTSGQGVGGIGGDGGSAGLLFSNGGAGGAGGAGTALISGNGGA 367
G GG G SG G G G A + A GAG +S + GA
Sbjct: 63 GNGGG-NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.6 bits (84), Expect = 4e-04
Identities = 38/121 (31%), Positives = 48/121 (39%), Gaps = 5/121 (4%)

Query: 698 GGSGGSGGGGGVGGNAGLLGGPGGAGGSGGAAGPALFGIGVDGTAGVGGAGGNAGLLFGS 757
GG G G + + GGP G G GGA+ G G G G +G+ +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD----GSGWSSENNPWGGGSGSGIHWGG 58

Query: 758 GGAGGDGGFGPGAGGTGGRGGNAGLLFSSAGAGGFGGYGTTGGGTGGAGGDAGWLGCGGA 817
G G+GG +GG G GGN ++ A GF T G G AG L A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGN-LSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117

Query: 818 G 818

Sbjct: 118 D 118



Score = 36.2 bits (83), Expect = 7e-04
Identities = 32/111 (28%), Positives = 38/111 (34%)

Query: 593 LSGSGGIGGGGGAGGYGGAIIPGNGGAGGSAGAGGAGGLFGGGGTGGAGGTGGPAGLGVF 652
+SG G G GA G I G G G GA G G G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 653 VGVPTFGDGGGGGAGGAGGAGGLLSGLVGAGGGDGGAGGQGGIGFGGSGGS 703
G+G GG G GG ++ V G G GG+ S G+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.8 bits (82), Expect = 7e-04
Identities = 36/118 (30%), Positives = 43/118 (36%), Gaps = 10/118 (8%)

Query: 564 TGGAGGWGGTDEITGGTGGAGGTGGGGGWLSGSGGIGGGGGAGGYGGAIIPGNGGAGGSA 623
TG G + G G GG G GW S + GGG G+G GG +
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI----------HWGGGS 60

Query: 624 GAGGAGGLFGGGGTGGAGGTGGPAGLGVFVGVPTFGDGGGGGAGGAGGAGGLLSGLVG 681
G G GG GG G GG V G P G GG + AG L + +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 34.3 bits (78), Expect = 0.002
Identities = 36/119 (30%), Positives = 42/119 (35%), Gaps = 16/119 (13%)

Query: 615 GNGGAGGSAGAGGAGGLFGGGGTGGAGGTGGPAGLGVFVGVPTFGDGGGGGAGGAGGAGG 674
G G G + GA G GG TG G G G G +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG--------- 53

Query: 675 LLSGLVGAGGGDGGAGGQGGIGFGGSGGSGGGGGVGGNAGLLGGPGGAGGSGGAAGPAL 733
+ GGG G G G GG G+GG G P A + GA G A+
Sbjct: 54 -----IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP--ALSTPGAGGLAV 105



Score = 33.9 bits (77), Expect = 0.003
Identities = 32/90 (35%), Positives = 37/90 (41%), Gaps = 8/90 (8%)

Query: 223 GGGGLGGAGGA-STSATASGGAGGQGGAAGLLSGFVGADGGHGGAGGHGAAGGDGGAGGD 281
GG G G GA STS +GG G G G A G G + + GG G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-------ASDGSGWSSENNPWGGGSGSGIH 55

Query: 282 GGPLAGAGGSGGTGGSGNAGGGAGGAGGNA 311
G +G G GG G SG G G A
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 33.5 bits (76), Expect = 0.004
Identities = 36/114 (31%), Positives = 47/114 (41%), Gaps = 10/114 (8%)

Query: 192 LSGNAGVGGAGGASTVANGGAGGAGGAGGLLGGGGLGGAGGASTSATASGGAGGQGGAAG 251
+SG G G GA + + GG G G GGG G+G +S + GG+G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 252 LLSGFVGADGGHGGAGGHGAAGGDGGAGGDGGPLAGAGGSGGTGGSGNAGGGAG 305
GHG GG+G +GG G GG+ +A G S GG
Sbjct: 59 --------GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.006
Identities = 34/105 (32%), Positives = 45/105 (42%), Gaps = 3/105 (2%)

Query: 303 GAGGAGGNAGLLFGDGGVGGTGGTSGQGVGGIGGDGGSAGLLFSNGGAGGAGGAGTALIS 362
G G G N G G + GG +G GVGG DG + G G G S
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 363 GNGGAGGNGGNGGWLFSNGGVGGAGGAGAPAMPSLGISAGSGGVG 407
G+G GGNG +GG + G + A P+L + G+GG+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS-TPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.009
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 1/74 (1%)

Query: 827 GTGGTGGAGGTGGQLLGA-GGAGGAGGQTDPLGGSTGGSGGVGGNAVLIGTGGNGGNGGA 885
G G GA T G + G G G GG +D G S+ + GG+ I GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 886 GTAKGTPGTGGTGG 899
G + G GTGG
Sbjct: 66 GGNGNSGGGSGTGG 79



Score = 31.2 bits (70), Expect = 0.024
Identities = 27/81 (33%), Positives = 33/81 (40%), Gaps = 3/81 (3%)

Query: 772 GTGGRGGNAGLLFSSAGAGGFGGYGTTGGGTGGAGGDAGWLGCGGAGGAGGITIGGTGGT 831
G GRG N G + + +G G T G GGA +GW G G + GG
Sbjct: 3 GGDGRGHNTG---AHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 832 GGAGGTGGQLLGAGGAGGAGG 852
G G GG GG+G G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGN 80



Score = 30.8 bits (69), Expect = 0.024
Identities = 34/108 (31%), Positives = 39/108 (36%), Gaps = 6/108 (5%)

Query: 760 AGGDG-GFGPGAGGTGGR--GGNAGLLFSSAGAGGFGGYGTTGGGTGGAGGDAGWLGCGG 816
+GGDG G GA T G GG GL + G G GG+G W G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 817 AGGAGGITIGGTGGTGGAGGTGGQLLGAGGAGGAGGQTDPLGGSTGGS 864
G GG G G G G + A A G + P G S
Sbjct: 62 HGNGGG---NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 30.8 bits (69), Expect = 0.029
Identities = 27/93 (29%), Positives = 36/93 (38%), Gaps = 4/93 (4%)

Query: 272 AGGDGGAGGDGGPLAGAGGSGGTGGSGNAGGGAGGAG-GNAGLLFGDGGVGGTGGTSGQG 330
+GGDG G +GG G G GG + G+G + +G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 331 VGGIGGDGGSAGLLFSNGGAGGAGGAGTALISG 363
G GG+G S G +G G + G
Sbjct: 62 HGNGGGNGNSGG---GSGTGGNLSAVAAPVAFG 91



Score = 30.5 bits (68), Expect = 0.035
Identities = 28/97 (28%), Positives = 38/97 (39%), Gaps = 2/97 (2%)

Query: 120 NGANGTPGTGADGAPGGWLLGDGGAGGSGAPGLNGGAGGAAGLLGSGGAGGAGGSSTSTN 179
N + +G P G +G G + GSG N GG +G G G G+
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 180 GGAGGAGGAGGWLSGNAGVGGAGGASTVANGGAGGAG 216
GG+G G + A V A G ++ GAGG
Sbjct: 70 NSGGGSGTGGNLSAVAAPV--AFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.035
Identities = 22/75 (29%), Positives = 27/75 (36%), Gaps = 3/75 (4%)

Query: 516 NGTPGDAGSGTDGTPGGWLLGDGGAGGSGGTGLDGGDGGAAGLMGTGGTGGAGGWGGTDE 575
N +G P G G G S G+G + G G+G G G G
Sbjct: 10 NTGAHSTSGNINGGPTG---LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 576 ITGGTGGAGGTGGGG 590
G +GG GTGG
Sbjct: 67 GNGNSGGGSGTGGNL 81


55MMAR_4424MMAR_4455Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4424-112-3.339577hypothetical protein
MMAR_4425016-4.184389methylase
MMAR_4426015-3.561150transcriptional regulatory protein
MMAR_4427-114-3.770425dehydrogenase fad flavoprotein GMC
MMAR_4428017-4.174467haloalkane dehalogenase
MMAR_4429019-4.771086transmembrane transport protein MmpL
MMAR_4430021-3.637355cytochrome P450 138B1 Cyp138B1
MMAR_4431117-1.742044hypothetical protein
MMAR_4432-116-2.103039hypothetical protein
MMAR_4433-117-2.156194methylase
MMAR_4434-118-3.329177hypothetical protein
MMAR_4435224-5.007978hypothetical protein
MMAR_4436223-4.737394hypothetical protein
MMAR_4437222-4.450366hypothetical protein
MMAR_4438321-4.576762hypothetical protein
MMAR_4439624-4.143897hypothetical protein
MMAR_4440623-3.326890hypothetical protein
MMAR_4441418-1.526476hypothetical protein
MMAR_4442321-1.535187prophage integrase
MMAR_4443321-3.154260site-specific recombinase
MMAR_4444324-3.533455hypothetical protein
MMAR_4445523-5.569279hypothetical protein
MMAR_4446319-3.929125hypothetical protein
MMAR_4447116-2.230703hypothetical protein
MMAR_4448215-2.193972hypothetical protein
MMAR_4449216-1.044769hypothetical protein
MMAR_4450016-1.574666hypothetical protein
MMAR_4451-2130.059361*PE family protein
MMAR_4452-112-0.033161PPE family protein
MMAR_4453113-0.580319EsaT-6 like protein EsxP
MMAR_4454112-0.064249EsaT-6 like protein EsxN
MMAR_44552130.256680two-component transcriptional regulator TrcR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4429ACRIFLAVINRP465e-07 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 46.4 bits (110), Expect = 5e-07
Identities = 45/231 (19%), Positives = 93/231 (40%), Gaps = 23/231 (9%)

Query: 127 TAAGAQSDDGKAVTVQLSLGGNRGESLANESVEAVRKIVAETPA--PPGIKTYVTGPSAL 184
A + ++L+ G N A ++ +A++ +AE P G+K +
Sbjct: 277 VIARINGKPAAGLGIKLATGAN-----ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTP 331

Query: 185 MVDMQRSGDKSMLKITLTTVTVIFFMLLLVYRSVSTVIALLSMVGVALTASRGIVALIGH 244
V + ++K + ++F ++ L +++ + V V L + I+A G+
Sbjct: 332 FV---QLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGY 388

Query: 245 SGGIGLTTFAVTLLTSLTIAAGTDYGIFVFGRYHEARLIGEDEETAFYTMYRGTTHV--- 301
S LT F + L I D I V R++ ED+ + + +
Sbjct: 389 SINT-LTMFGMVL----AIGLLVDDAIVVVENVE--RVMMEDKLPPKEATEKSMSQIQGA 441

Query: 302 ILGTGLTIAAATLCLLF---ARLPSFQTLAIPCAVGTLVTVAVALTLTPAV 349
++G + ++A + + F + ++ +I ++V VAL LTPA+
Sbjct: 442 LVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPAL 492



Score = 36.4 bits (84), Expect = 7e-04
Identities = 46/212 (21%), Positives = 78/212 (36%), Gaps = 13/212 (6%)

Query: 149 RGESLANESVEAVRKIV--AETPAPPGIKTYVTGPSALMVDMQRSGDKSMLKITLTTVTV 206
+GE+ S ++ + P GI TG S + SG+++ + ++ V
Sbjct: 827 QGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMS---YQERLSGNQAPALVAIS-FVV 882

Query: 207 IFFMLLLVYRSVSTVIALLSMVGVALTASRGIVALIGHSGGIGLTTFAVTLLTSLTIAAG 266
+F L +Y S S I + M+ V L ++A + + F V LLT++ ++A
Sbjct: 883 VFLCLAALYESWS--IPVSVMLVVPLGIVGVLLAATLFNQKNDV-YFMVGLLTTIGLSAK 939

Query: 267 TDYGIFVFGRYHEARLIGEDEETAFYTMYRGTTHVILGTGLTIAAATLCLLFARLP---S 323
I F + G+ A R IL T L L L + +
Sbjct: 940 NAILIVEFAK-DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGA 998

Query: 324 FQTLAIPCAVGTLVTVAVALTLTPAVLVVGSR 355
+ I G + +A+ P VV R
Sbjct: 999 QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4455HTHFIS931e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.6 bits (230), Expect = 1e-23
Identities = 33/131 (25%), Positives = 62/131 (47%)

Query: 27 SPIRVLLVDDEPALTNLVKMALHYEGWDVEIAHNGREAISKFDKISPDVLVLDIMLPDVD 86
+ +L+ DD+ A+ ++ AL G+DV I N D++V D+++PD +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 87 GLQILQRVRDSDAYTPTLFLTARDSVMDRVTGLTAGADDYMTKPFSLEELVARLRGLLRR 146
+L R++ + P L ++A+++ M + GA DY+ KPF L EL+ + L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 147 SSHLAPPADES 157
++
Sbjct: 122 PKRRPSKLEDD 132


56MMAR_4554MMAR_4561Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4554312-2.193058PPE family protein
MMAR_4555010-0.386198hypothetical protein
MMAR_45563101.459633secreted antigen 85-C FbpC_2
MMAR_45575122.882806glucose-6-phosphate isomerase
MMAR_45588133.687757short chain dehydrogenase
MMAR_45597123.490230formamidopyrimidine-DNA glycosylase
MMAR_45606123.519985PE-PGRS family protein
MMAR_45613132.886049PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4554cloacin350.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.7 bits (79), Expect = 0.001
Identities = 25/90 (27%), Positives = 34/90 (37%), Gaps = 9/90 (10%)

Query: 214 GGNIGNFNFGSGNRGGNVNFGNGNNGFFNLGGGNIGSNNFGSGNRGNGNIGFGNYQSTGG 273
GG+ N G+ + GN+N G LG G S+ G + N G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTG-----LGVGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 274 ANIGGGNSGSGNKGFGNTGNYNIGSGNFGS 303
G GN G GN+G + GN +
Sbjct: 58 GGSGHGNGGGN----GNSGGGSGTGGNLSA 83



Score = 33.9 bits (77), Expect = 0.002
Identities = 26/86 (30%), Positives = 38/86 (44%)

Query: 189 AGNLGFGNTGIANLGNGNTGNLNFGGGNIGNFNFGSGNRGGNVNFGNGNNGFFNLGGGNI 248
+G G G+ A+ +GN G G G + GSG N +G G+ + GGG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 249 GSNNFGSGNRGNGNIGFGNYQSTGGA 274
N G+GN G G+ GN +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 31.6 bits (71), Expect = 0.012
Identities = 22/72 (30%), Positives = 25/72 (34%)

Query: 272 GGANIGGGNSGSGNKGFGNTGNYNIGSGNFGSFNFGDGNRGSNNFGFGNTNSGNVGFGNL 331
GA+ GN G G G G + GSG N G GS G + GN G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 332 GANNVGFGNLGS 343
G G S
Sbjct: 71 SGGGSGTGGNLS 82



Score = 30.8 bits (69), Expect = 0.019
Identities = 22/83 (26%), Positives = 32/83 (38%), Gaps = 5/83 (6%)

Query: 495 SGSDNTGFLNSGSVNTGFLNSGSTNTGAGNSGEVNTGFGIATD--SGATNSG---FGNTG 549
SG D G +G +N G T G G +G+ + G + SG G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 550 SGNSGFNNDGNDNSGFQNTGTSS 572
GN G N + SG ++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 30.1 bits (67), Expect = 0.036
Identities = 28/83 (33%), Positives = 35/83 (42%), Gaps = 8/83 (9%)

Query: 203 GNGNTGNLNFGGGNIGNFNFGSGNRGGNVNFGNGNNGFFNLGGGNIGSNNFGSGNRGNGN 262
G G+ + GNI G G GG + G+G + N GG GS G G+GN
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 263 IGFGNYQSTGGANIGGGNSGSGN 285
G G N GGG+ GN
Sbjct: 65 GG-------GNGNSGGGSGTGGN 80



Score = 29.7 bits (66), Expect = 0.045
Identities = 25/86 (29%), Positives = 32/86 (37%)

Query: 461 GLGNAGSFNMGFGNAGSGNVGYENAGGANVGFGNSGSDNTGFLNSGSVNTGFLNSGSTNT 520
G G+ + GN G G GGA+ G G S +N SGS SG N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 521 GAGNSGEVNTGFGIATDSGATNSGFG 546
G + +G G + A FG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 29.7 bits (66), Expect = 0.046
Identities = 24/80 (30%), Positives = 27/80 (33%)

Query: 244 GGGNIGSNNFGSGNRGNGNIGFGNYQSTGGANIGGGNSGSGNKGFGNTGNYNIGSGNFGS 303
GG G N GN N G GGA+ G G S N G +G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 304 FNFGDGNRGSNNFGFGNTNS 323
N G G G S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 29.7 bits (66), Expect = 0.047
Identities = 22/81 (27%), Positives = 30/81 (37%)

Query: 283 SGNKGFGNTGNYNIGSGNFGSFNFGDGNRGSNNFGFGNTNSGNVGFGNLGANNVGFGNLG 342
SG G G+ + SGN G G G + G G ++ N G G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 343 SGNVGFGNTGNNNFGIGLSGN 363
GN G G G + +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4558DHBDHDRGNASE642e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 64.3 bits (156), Expect = 2e-14
Identities = 53/190 (27%), Positives = 87/190 (45%), Gaps = 9/190 (4%)

Query: 6 ILITGASSGLGAGMARAFAARGRDLALCARRTDRLEELKSELAQ--KHPEITIAIAELDV 63
ITGA+ G+G +AR A++G +A ++LE++ S L +H E DV
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF----PADV 66

Query: 64 NDHDQVPKVFAELRDELGGIDRVIVNAGIGKGAPLGSGKLWANKATIETNLVAALVQIET 123
D + ++ A + E+G ID ++ AG+ + + S +AT N +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 124 ALEMFHKSGSGHLVLISSVLASKGVPGVK-AAYAASKAGLSSLGESLRAEYDKGPITVSV 182
+ SG +V + S A GVP AAYA+SKA + L E + I ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 183 MEPGYIESEM 192
+ PG E++M
Sbjct: 185 VSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4560cloacin462e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.9 bits (108), Expect = 2e-07
Identities = 35/101 (34%), Positives = 43/101 (42%), Gaps = 4/101 (3%)

Query: 210 GLGGNGGTVGTGQSTNGGAGGDGGSGGSAGLFGGGGAGALGGDGGNGVGSDGSGGGAGSG 269
G G N G T + NGG G G GG++ G G G G G G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 270 GDGGNGGFFYGDGGNGADAGSPGAGQSSFGSLGIAGEGDGG 310
G GN G G GGN + +P A FG ++ G GG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVA----FGFPALSTPGAGG 102



Score = 37.4 bits (86), Expect = 8e-05
Identities = 31/83 (37%), Positives = 37/83 (44%), Gaps = 3/83 (3%)

Query: 188 GNGGNGGNAGLLQGVAG-NGAAGGLGGNGG-TVGTGQSTNGGAGGDGGSGGSAGLFGGGG 245
G G G N G NG GLG GG + G+G S+ G GGSG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSG 61

Query: 246 AGALGGDGGNGVGSDGSGGGAGS 268
G GG+G +G GS G +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 35.8 bits (82), Expect = 2e-04
Identities = 35/113 (30%), Positives = 40/113 (35%), Gaps = 16/113 (14%)

Query: 228 AGGDG-GSGGSAGLFGGGGAGALGGDGGNGVGSDGSGGGAGSGGDGGNGGFFYGDGGNGA 286
+GGDG G A G G G G G SDGSG + + GG G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 287 DAGSPGAGQSSFGSLGIAGEGDGGDGGNAFLIGNGGNGGAAAAFGFPGFGGNG 339
G G+G GG + GN A AFGFP G
Sbjct: 62 HGN---------------GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPG 99



Score = 33.5 bits (76), Expect = 0.001
Identities = 36/129 (27%), Positives = 43/129 (33%), Gaps = 13/129 (10%)

Query: 125 AAGTGQNGGDGGWLIGSGGRGGSGGVGQKGG-NGGSAGLWGNGGNGGLGGEGVQGGPGHP 183
+ G G+ G GG G+G GG + GS N GG G G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 184 GQAGGNGGNGGNAGLLQGVAGNGAAGGLGGNGGTVGTGQSTNGGAGGDGGSGGSAGLFGG 243
GG GN G G GGN V + A G+GG A
Sbjct: 62 HGNGGGNGNSGG------------GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109

Query: 244 GGAGALGGD 252
G A D
Sbjct: 110 GALSAAIAD 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4561cloacin394e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 4e-05
Identities = 37/92 (40%), Positives = 41/92 (44%), Gaps = 3/92 (3%)

Query: 356 GGDGAQGGNGGKAGFFYGNGGNGGFGGNGANGGDNSGTSSAVNGAGGMGGWGGAGGQAGL 415
GGDG G + NGG G G G D SG SS N GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGS- 60

Query: 416 IGNGGTGGAGGSGGAGGSGGTENGDAGPGGFG 447
G+G GG G SGG G+GG + A P FG
Sbjct: 61 -GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 38.2 bits (88), Expect = 9e-05
Identities = 42/137 (30%), Positives = 53/137 (38%), Gaps = 11/137 (8%)

Query: 185 LTGGNGGIGGAGGFLYGLGGNGGIGGHGGDGGAAIGTGTDGGNGGNGGLAGAGGLLFGNG 244
++GG+G G NGG G G GGA+ G+G N GG +G+G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 245 GVGGQGGDGGDATGGTATFSGGLAGSGGNGGNGGQSGWLYGNGGDGGNSGSGGTFESAGG 304
G G GG+G SGG GSG G + + G+GG S
Sbjct: 61 GHGNGGGNGN---------SGG--GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109

Query: 305 SVLSGAQGGGFAASAGN 321
LS A AA G
Sbjct: 110 GALSAAIADIMAALKGP 126



Score = 37.4 bits (86), Expect = 1e-04
Identities = 30/97 (30%), Positives = 36/97 (37%)

Query: 154 GGNGGSAGLLGNGGAGGAGGAGASGAAGDSGLTGGNGGIGGAGGFLYGLGGNGGIGGHGG 213
G N G+ GN G G GA+ SG + N GG G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 214 DGGAAIGTGTDGGNGGNGGLAGAGGLLFGNGGVGGQG 250
+G + G+GT G G G GG
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.4 bits (86), Expect = 2e-04
Identities = 30/86 (34%), Positives = 38/86 (44%), Gaps = 5/86 (5%)

Query: 397 VNGAGGMGGWGGAGGQAGLIGNGGTGGAGGSGGAGGSG-GTENGDAGPGGFGGHGGDALL 455
++G G G GA +G I G TG G G + GSG +EN G GG G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG----GGSGSGIHW 56

Query: 456 FGNGGNGANGGNTGAPGTLSGGGTGT 481
G G+G GGN + G GG +
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLS 82



Score = 37.0 bits (85), Expect = 2e-04
Identities = 33/97 (34%), Positives = 43/97 (44%), Gaps = 3/97 (3%)

Query: 226 GNGGNGGLAGAGGLLFGNGGVGGQGGDGGDATGGTATFSGGLAGSGGNGGNGGQSGWLYG 285
G G N G G + G G GG D +G ++ + GSG GG SG +G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG--HG 63

Query: 286 NGGDGGNSGSGGTFESAGGSVLSGAQGGGFAASAGNG 322
NGG GNSG GG+ S ++ GF A + G
Sbjct: 64 NGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPG 99



Score = 35.5 bits (81), Expect = 7e-04
Identities = 30/83 (36%), Positives = 37/83 (44%), Gaps = 6/83 (7%)

Query: 269 GSGGNGGNGGQSGWLYGNGGDGGNSGSGGTFESAGGSVLSGAQGGGFAASAGNGGNSGLF 328
G G N G SG + NGG G GG + +G S + GGG + GG SG
Sbjct: 6 GRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG-- 61

Query: 329 GNGGSGGNGGNGGLGQAASGDDS 351
G+GG GN G G G+ S
Sbjct: 62 --HGNGGGNGNSGGGSGTGGNLS 82



Score = 33.1 bits (75), Expect = 0.003
Identities = 34/114 (29%), Positives = 42/114 (36%), Gaps = 3/114 (2%)

Query: 320 GNGGNSGLFGNGGSGGNGGNGGLGQAASGDDSQGGIGGDGAQGGNGGKAGFFYGNGG--N 377
G G N+G G+ NGG GLG D G + GG G + G G N
Sbjct: 6 GRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 378 GGFGGNGANGGDNSGTSSAVNGAGGMGGWGGAGGQAGLIGNGGTGGAGGSGGAG 431
GG GN G G SAV G + AG + + GA + A
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 30.8 bits (69), Expect = 0.014
Identities = 36/113 (31%), Positives = 49/113 (43%), Gaps = 7/113 (6%)

Query: 379 GFGGNGANGGDNSGTSSAVNGAGGMGGWGGAGGQAGLIGNGGTGGAGGSGGAGGSGGTEN 438
G G G N G +S + + G G+G GGA G+G + GG GSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD-----GSGWSSENNPWGGGSGSGIHWG 57

Query: 439 GDAGPGGFGGHGGDALLFGNGGNGANGGNTGAPG--TLSGGGTGTVYLTSNGG 489
G +G G GG+G G GGN + A G LS G G + ++ + G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.5 bits (68), Expect = 0.024
Identities = 22/61 (36%), Positives = 31/61 (50%), Gaps = 1/61 (1%)

Query: 131 SGGDGGWLLGNGGNGGSGAAGQAGGNGGSAGL-LGNGGAGGAGGAGASGAAGDSGLTGGN 189
+GG G +G G + GSG + + GG +G + GG G G G +G +G TGGN
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 190 G 190

Sbjct: 81 L 81


57MMAR_4586MMAR_4605Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4586220-3.411524hypothetical protein
MMAR_4587325-4.583979oxidoreductase
MMAR_4588231-5.315374*hypothetical protein
MMAR_4589133-6.735482hypothetical protein
MMAR_4590130-6.812818hypothetical protein
MMAR_4591329-6.482896hypothetical protein
MMAR_4592121-4.481405transposase
MMAR_4595122-5.025792transposase
MMAR_4596019-3.849638hypothetical protein
MMAR_45976186.041296hypothetical protein
MMAR_45985186.371134transposase, ISMyma01_aa1
MMAR_45996195.827444transposase, ISMyma01_aa2
MMAR_46006205.487039hypothetical protein
MMAR_46015193.994861hypothetical protein
MMAR_46025213.267847PE-PGRS family protein
MMAR_4603029-4.400163hypothetical protein
MMAR_4604-128-4.473189transposase
MMAR_4605-122-3.897958hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4589PF07520320.007 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 31.9 bits (72), Expect = 0.007
Identities = 19/76 (25%), Positives = 24/76 (31%), Gaps = 2/76 (2%)

Query: 46 MSAALRLVKSSTGNHDGQLPPIATSYRCAAAAWASRTIKHTA--APKTRQRLPHRIHAAV 103
+S AL LVK G DG W + + Q+ RI +
Sbjct: 518 VSGALTLVKEMLGTKDGTSTIAVEGKPELLVDWDEASCTQLVYLYSELTQKFDGRIDTFL 577

Query: 104 DHLATPRRRAPGSPSP 119
D PR G SP
Sbjct: 578 DLKGQPRPDPAGGESP 593


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4602FLAGELLIN438e-06 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 42.7 bits (100), Expect = 8e-06
Identities = 35/284 (12%), Positives = 50/284 (17%), Gaps = 2/284 (0%)

Query: 755 GSGGAGSNGGNNTGSTGGVGADGGKGGTGGTAGAAGVGVDGGAAGSIGTGGVGGQGGDGG 814
G N G T + + G G G G + + G D
Sbjct: 139 QDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTY 198

Query: 815 SGGTGALGADGTNPTGGGRGGQGGQGGQGGDGGSGIGGVGGGAGGHGGVGGSGGTGGTGG 874
+ G D +G + G
Sbjct: 199 AVGANKYRVDVN--SGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKST 256

Query: 875 SNTASTGGVGADGGKGGTGGTAGAAGVGVDGGAAGSIGTGGVGGTGGEGGSGGTGVAGAS 934
+ TA + G G T GV G T G VA +
Sbjct: 257 AGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADIT 316

Query: 935 GITPAVGGPGGQGGAGGTGGDGGAGIDGVGGGAGGQGGQGGAGGNGGMGGTYTGTGPGSS 994
V Q N + G T G+
Sbjct: 317 AGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAE 376

Query: 995 ASGGTGGAGGAGGDGGAPGAGSTTGGVGGNGGKGGNGGFGAGNS 1038
+ G + +G N
Sbjct: 377 YTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANP 420



Score = 42.7 bits (100), Expect = 9e-06
Identities = 28/272 (10%), Positives = 46/272 (16%), Gaps = 3/272 (1%)

Query: 644 GSGGAGSNGGNNTGSTGGVGADGGKGGTGGTAGAAGVGVDGGAAGSIGTGGVGGTGGNGG 703
G N G T + + + G G V+G ++G G
Sbjct: 139 QDNQMKIQVGANDGETITIDL---QKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGY 195

Query: 704 AGADNDVNGTSGTAPTAGGQGGQGGQGGQGGDGGSGIGGVGGGAGGHGGVGGSGGAGSNG 763
N + + G +
Sbjct: 196 DTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKS 255

Query: 764 GNNTGSTGGVGADGGKGGTGGTAGAAGVGVDGGAAGSIGTGGVGGQGGDGGSGGTGALGA 823
T + G G T GV G +G
Sbjct: 256 TAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADI 315

Query: 824 DGTNPTGGGRGGQGGQGGQGGDGGSGIGGVGGGAGGHGGVGGSGGTGGTGGSNTASTGGV 883
Q + + G + + G
Sbjct: 316 TAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGA 375

Query: 884 GADGGKGGTGGTAGAAGVGVDGGAAGSIGTGG 915
G T + +D A+G
Sbjct: 376 EYTANAAGDKVTLAGKTMFIDKTASGVSTLIN 407



Score = 31.2 bits (70), Expect = 0.034
Identities = 32/243 (13%), Positives = 42/243 (17%), Gaps = 3/243 (1%)

Query: 939 AVGGPGGQGGAGGTGGDGGAGIDGVGGGAGGQGGQGGAGGNGGMGGTYTGTGPGSSASGG 998
G G + G D GA +G T +
Sbjct: 174 VNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAAN 233

Query: 999 TGGAGGAGGDGGAPGAGSTTGGVGGNGGKGGNGGFGAGNSSGSS---AGFGFGGGGGGGG 1055
+ A TT G G G G + G F G
Sbjct: 234 GQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGN 293

Query: 1056 AGGHGGNGGGGGASGGQGGAGGDGGLGGTAVSPTTGSAAGGGFGGGGGGGGNATGAAGGT 1115
G + G A G + S G +
Sbjct: 294 DGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESA 353

Query: 1116 GGGGGGGGAGAAAAVAAAGGGGGGGGAAGGPGGTAGNGGVGGNAFGGGSSGTGGTAGAGA 1175
G A G T + + G S A A
Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAA 413

Query: 1176 PGG 1178

Sbjct: 414 KKS 416


58MMAR_4616MMAR_4621Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_46161336-5.234024hypothetical protein
MMAR_46171031-5.148220hypothetical protein
MMAR_46181030-5.259254hypothetical protein
MMAR_46191031-5.109267hypothetical protein
MMAR_4620931-4.954511hypothetical protein
MMAR_4621829-4.165592PPE family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4618ENTEROTOXINB310.002 Heat labile enterotoxin B chain signature.
		>ENTEROTOXINB#Heat labile enterotoxin B chain signature.

Length = 124

Score = 31.2 bits (70), Expect = 0.002
Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 157 SQLVSAQAKAIERVRDVLIVERIALIGTFALCLW 190
SQ + +Q KAIER++D L + + LC+W
Sbjct: 76 SQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVW 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4621cloacin402e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.7 bits (92), Expect = 2e-04
Identities = 29/82 (35%), Positives = 34/82 (41%), Gaps = 6/82 (7%)

Query: 683 GGNVGGGNVGVANVGDGNVGGANVGGANVGGANTGDG----NWGWGNTGGGNIGWGNTGV 738
GG+ G N G + GN+ G G GGA+ G G N WG G I WG
Sbjct: 3 GGDGRGHNTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 739 GNFGFGNQGSGNIGIGLSGDHQ 760
G GN SG G G G+
Sbjct: 62 HGNGGGNGNSGG-GSGTGGNLS 82



Score = 38.5 bits (89), Expect = 4e-04
Identities = 28/81 (34%), Positives = 33/81 (40%), Gaps = 5/81 (6%)

Query: 1621 GNVGDGNVGVANVGDGNVGGANVGGANVGGANTGDG----NWGWGNTGGGNIGWGNTGVG 1676
G G G+ A+ GN+ G G GGA+ G G N WG G I WG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1677 NFGFGNQGSGNIGIGLSGDHQ 1697
G GN SG G G G+
Sbjct: 63 GNGGGNGNSGG-GSGTGGNLS 82



Score = 38.5 bits (89), Expect = 4e-04
Identities = 28/81 (34%), Positives = 33/81 (40%), Gaps = 5/81 (6%)

Query: 2552 GNVGDGNVGVANVGDGNVGGANVGGANVGGANTGDG----NWGWGNTGGGNIGWGNTGVG 2607
G G G+ A+ GN+ G G GGA+ G G N WG G I WG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 2608 NFGFGNQGSGNIGIGLSGDHQ 2628
G GN SG G G G+
Sbjct: 63 GNGGGNGNSGG-GSGTGGNLS 82



Score = 35.5 bits (81), Expect = 0.004
Identities = 32/109 (29%), Positives = 44/109 (40%), Gaps = 13/109 (11%)

Query: 663 SANVGDANLGSANVGSANVGGGNVGGGNVGVANVGDGNVGGANV-GGANVGGANTGDGNW 721
S G + A+ S N+ GG G G G A+ G G N GG + G + G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 722 GWGNTGGGNIGWGNTGVGN---------FGF---GNQGSGNIGIGLSGD 758
G GN G G+ GN FGF G+G + + +S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.1 bits (75), Expect = 0.019
Identities = 34/109 (31%), Positives = 39/109 (35%), Gaps = 4/109 (3%)

Query: 239 GSGNWGGFNLGSGNIGSYNFGPGNLGSYNIGFGNAGDYNVGFGNSGLGNIGFGNSGSNNL 298
G G+ G + SGNI G G G + G G + + N G SG G G SG N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 299 GIGLTGSGQVGFGGWNSGSGNVGLFNSGVGNVGLFNSGTGNWGVGNSGE 347
G G G GG S F G L G G V S
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAF----GFPALSTPGAGGLAVSISAG 110



Score = 33.1 bits (75), Expect = 0.019
Identities = 34/109 (31%), Positives = 39/109 (35%), Gaps = 4/109 (3%)

Query: 1183 GSGNWGGFNLGSGNIGSYNFGPGNLGSYNIGFGNAGDYNVGFGNSGLGNIGFGNSGSNNL 1242
G G+ G + SGNI G G G + G G + + N G SG G G SG N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1243 GIGLTGSGQVGFGGWNSGSGNVGLFNSGVGNVGLFNSGTGNWGVGNSGE 1291
G G G GG S F G L G G V S
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAF----GFPALSTPGAGGLAVSISAG 110



Score = 33.1 bits (75), Expect = 0.019
Identities = 34/109 (31%), Positives = 39/109 (35%), Gaps = 4/109 (3%)

Query: 2111 GSGNWGGFNLGSGNIGSYNFGPGNLGSYNIGFGNAGDYNVGFGNSGLGNIGFGNSGSNNL 2170
G G+ G + SGNI G G G + G G + + N G SG G G SG N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 2171 GIGLTGSGQVGFGGWNSGSGNVGLFNSGDGNVGLFNSGTGNWGVGNSGE 2219
G G G GG S F G L G G V S
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAF----GFPALSTPGAGGLAVSISAG 110



Score = 32.4 bits (73), Expect = 0.032
Identities = 25/81 (30%), Positives = 31/81 (38%), Gaps = 2/81 (2%)

Query: 226 GNVGFGNVGGLNFGSGNWGGFNLGSGNIGSYNFGPGNLGSYNIGFGNAGDYNVG-FGNSG 284
G G G+ G + SGN G G G G + G G S N +G + G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSG 61

Query: 285 LGNIGFGNSGSNNLGIGLTGS 305
GN G + G G S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 32.4 bits (73), Expect = 0.032
Identities = 25/81 (30%), Positives = 31/81 (38%), Gaps = 2/81 (2%)

Query: 1170 GNVGFGNVGGLNFGSGNWGGFNLGSGNIGSYNFGPGNLGSYNIGFGNAGDYNVG-FGNSG 1228
G G G+ G + SGN G G G G + G G S N +G + G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSG 61

Query: 1229 LGNIGFGNSGSNNLGIGLTGS 1249
GN G + G G S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 32.4 bits (73), Expect = 0.032
Identities = 25/81 (30%), Positives = 31/81 (38%), Gaps = 2/81 (2%)

Query: 2098 GNVGFGNVGGLNFGSGNWGGFNLGSGNIGSYNFGPGNLGSYNIGFGNAGDYNVG-FGNSG 2156
G G G+ G + SGN G G G G + G G S N +G + G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSG 61

Query: 2157 LGNIGFGNSGSNNLGIGLTGS 2177
GN G + G G S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 32.4 bits (73), Expect = 0.036
Identities = 26/82 (31%), Positives = 33/82 (40%), Gaps = 2/82 (2%)

Query: 719 GNWGWGNTGGGNIGWGNTGVGNFGFGNQGSGNIGIGLSGDHQVGFGGWNSGSGNVGLFNS 778
G G G+ G + GN G G G G + G G S ++ GG SG G S
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--GS 60

Query: 779 GDGNIGFFNSGSGNFGIANSGS 800
G GN G + G G + S
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLS 82



Score = 32.4 bits (73), Expect = 0.036
Identities = 26/82 (31%), Positives = 33/82 (40%), Gaps = 2/82 (2%)

Query: 1656 GNWGWGNTGGGNIGWGNTGVGNFGFGNQGSGNIGIGLSGDHQVGFGGWNSGSGNVGLFNS 1715
G G G+ G + GN G G G G + G G S ++ GG SG G S
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--GS 60

Query: 1716 GDGNIGFFNSGSGNFGIANSGS 1737
G GN G + G G + S
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLS 82



Score = 32.4 bits (73), Expect = 0.036
Identities = 26/82 (31%), Positives = 33/82 (40%), Gaps = 2/82 (2%)

Query: 2587 GNWGWGNTGGGNIGWGNTGVGNFGFGNQGSGNIGIGLSGDHQVGFGGWNSGSGNVGLFNS 2646
G G G+ G + GN G G G G + G G S ++ GG SG G S
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--GS 60

Query: 2647 GDGNIGFFNSGSGNFGIANSGS 2668
G GN G + G G + S
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLS 82



Score = 32.0 bits (72), Expect = 0.046
Identities = 29/109 (26%), Positives = 39/109 (35%), Gaps = 13/109 (11%)

Query: 1600 SANVGDANLGSANVGSANVGGGNVGDGNVGVANVGDG-------NVGGANVG---GANVG 1649
S G + A+ S N+ GG G G G A+ G G GG+ G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1650 GANTGDGNWGWGNTGGGNIGWGNTGVGNFGF---GNQGSGNIGIGLSGD 1695
N G G +G G FGF G+G + + +S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.0 bits (72), Expect = 0.046
Identities = 29/109 (26%), Positives = 39/109 (35%), Gaps = 13/109 (11%)

Query: 2531 SANVGDANLGSANVGSANVGGGNVGDGNVGVANVGDG-------NVGGANVG---GANVG 2580
S G + A+ S N+ GG G G G A+ G G GG+ G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 2581 GANTGDGNWGWGNTGGGNIGWGNTGVGNFGF---GNQGSGNIGIGLSGD 2626
N G G +G G FGF G+G + + +S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


59MMAR_4660MMAR_4665Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_46602102.183778acyl-CoA dehydrogenase FadE10
MMAR_46611124.354069cold shock-like protein B CspB
MMAR_46620133.934841hypothetical protein
MMAR_4663-194.176917molybdenum cofactor biosynthesis protein A
MMAR_4664-1133.895237molybdenum cofactor biosynthesis protein D 2
MMAR_4665-1124.215960resuscitation-promoting factor RpfA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4665PF03544362e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 35.7 bits (82), Expect = 2e-04
Identities = 27/124 (21%), Positives = 31/124 (25%), Gaps = 3/124 (2%)

Query: 216 PPAPEDVAAPAPADLPPAPEDVAPPVELVGNDVPAPVDLPPAPEDLPPAPEDLAPPAPAD 275
P P V APADL P PP +V + P E + P P
Sbjct: 46 PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 105

Query: 276 LPPAPEDLAPPAPADLPPAPEDLAPPAPADLPPAPEDLAPPAPADLPPAPEDLPPAPEDL 335
P D+ P A P P P A P P
Sbjct: 106 KPVKKV---EQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSR 162

Query: 336 PPPA 339
P
Sbjct: 163 NQPQ 166



Score = 33.0 bits (75), Expect = 0.002
Identities = 21/87 (24%), Positives = 29/87 (33%)

Query: 262 PPAPEDLAPPAPADLPPAPEDLAPPAPADLPPAPEDLAPPAPADLPPAPEDLAPPAPADL 321
P P + APADL P PP P P + P P + P E P
Sbjct: 46 PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 105

Query: 322 PPAPEDLPPAPEDLPPPADADNPPVDV 348
P + P + P + +P +
Sbjct: 106 KPVKKVEQPKRDVKPVESRPASPFENT 132



Score = 31.5 bits (71), Expect = 0.004
Identities = 26/131 (19%), Positives = 31/131 (23%), Gaps = 3/131 (2%)

Query: 172 DVPAPADLPPAPEDLPPAPEDLAPPAPADLPPAPEDLPPAPQDLPPAPEDVAAPAPADLP 231
V AP DL PP PP P P + P P AP +
Sbjct: 39 QVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPP---KEAPVVIE 95

Query: 232 PAPEDVAPPVELVGNDVPAPVDLPPAPEDLPPAPEDLAPPAPADLPPAPEDLAPPAPADL 291
P + V D+ P E+ AP P P
Sbjct: 96 KPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155

Query: 292 PPAPEDLAPPA 302
P P
Sbjct: 156 GPRALSRNQPQ 166



Score = 31.1 bits (70), Expect = 0.006
Identities = 18/90 (20%), Positives = 23/90 (25%)

Query: 253 DLPPAPEDLPPAPEDLAPPAPADLPPAPEDLAPPAPADLPPAPEDLAPPAPADLPPAPED 312
AP DL P PP P P + P P + P E P P +
Sbjct: 52 VTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKV 111

Query: 313 LAPPAPADLPPAPEDLPPAPEDLPPPADAD 342
P + P P +
Sbjct: 112 EQPKRDVKPVESRPASPFENTAPARPTSST 141



Score = 29.6 bits (66), Expect = 0.016
Identities = 21/117 (17%), Positives = 30/117 (25%), Gaps = 3/117 (2%)

Query: 125 DVPAPAALDAPLDAPGINGEPAPLAPPPGDDVPPPADPAPPVELAANDVPAPADLPPAPE 184
+ APA L+ P + P P+ P + P P P + P P P +
Sbjct: 53 TMVAPADLEPPQA---VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109

Query: 185 DLPPAPEDLAPPAPADLPPAPEDLPPAPQDLPPAPEDVAAPAPADLPPAPEDVAPPV 241
+ D+ P P P P P P
Sbjct: 110 KVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQ 166



Score = 29.6 bits (66), Expect = 0.019
Identities = 21/112 (18%), Positives = 27/112 (24%), Gaps = 1/112 (0%)

Query: 130 AALDAPLDAPGINGEPAPLAPPPGDDVPPPADPAPPVELAANDVPAPADLPPAPEDLPPA 189
L AP + PP PPP P P + P E P
Sbjct: 41 IELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPK 100

Query: 190 PEDLAPPAPADLPPAPEDLPPAPQDLPPAPEDVAAPAPADLPPAPEDVAPPV 241
P+ P + D+ P E+ A P P
Sbjct: 101 PKP-KPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVT 151


60MMAR_4682MMAR_4693Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4682210-0.160489hypothetical protein
MMAR_4683110-0.459095pyruvate or indole-3-pyruvate decarboxylase Pdc
MMAR_4684312-1.287439hypothetical protein
MMAR_4685311-1.393295fatty-acid-CoA ligase FadD16
MMAR_4686311-1.306263hypothetical protein
MMAR_4687210-1.494667hypothetical protein
MMAR_4688111-1.998541long-chain-fatty-acid--CoA ligase
MMAR_468909-2.047294hypothetical protein
MMAR_4690-110-2.166830lipoprotein
MMAR_4691-112-3.000241enoyl-CoA hydratase, EchA8_2
MMAR_4692012-3.481905hypothetical protein
MMAR_4693-112-3.205238transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4686PF04335335e-04 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 32.9 bits (75), Expect = 5e-04
Identities = 22/160 (13%), Positives = 45/160 (28%), Gaps = 19/160 (11%)

Query: 23 ERRRAWLHKLTPGKVVVAVVGLLVAAALGLLAALFVFSYRPDRE------VDAQAAGAAV 76
++ AW+ + +A G++ AAL L + + DR A +
Sbjct: 31 SKKLAWV--VAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAKLHGDATI 88

Query: 77 SA--------ASDGAIAILSYSPDTLDRDFSSARSHLTGEFLSYYDQF--TQQIVAPAAK 126
+ + + + F + + +F T +P
Sbjct: 89 TYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDNPQSPQNI 148

Query: 127 RKSVRTSAVVLRAAISELHPDSAVVLLFVNQTTQSADRPE 166
+ V ++ +S L + A V T S
Sbjct: 149 LANRTDVFVEIK-RVSFLGGNVAQVYFTKESVTGSNSTKT 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4687IGASERPTASE300.018 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.018
Identities = 27/153 (17%), Positives = 50/153 (32%), Gaps = 13/153 (8%)

Query: 20 AADSRSAPVDEDEIGGTAAEMRALAEEAEAEAAEAEALAAAAGARARALQLRRRAERAEA 79
A AP E T AE + E E E +A A R A + + +
Sbjct: 1023 APVPPPAPATPSETTETVAE-NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 80 AAEVD-----------AACEATHEGAADGQATASHNGAEEPSANSADVLESDDVAETEEL 128
EV + T + +A +E ++ V + +ET +
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 129 SADDD-EPAAAGSAEADESEADELADESERSRR 160
A+ E + + +S+ + AD + ++
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4693HTHTETR538e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.7 bits (126), Expect = 8e-11
Identities = 25/125 (20%), Positives = 48/125 (38%), Gaps = 4/125 (3%)

Query: 4 ARRIGAPDAKNRGVLLDTAEELMIEEGYAAVTSRRVASEAGLKPQLVHYYFRTMEELFLE 63
AR+ + R +LD A L ++G ++ + +A AG+ ++++F+ +LF E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 64 LFRRRAEEGLRAQAQALQSPKPLWALWRLGSDPAFARISMEFMALANHRKALRAEIAHYA 123
++ + Q+ P L L +E R+ L I H
Sbjct: 62 IWELSESN-IGELELEYQAKFPGDPLSVLR---EILIHVLESTVTEERRRLLMEIIFHKC 117

Query: 124 ERFRD 128
E +
Sbjct: 118 EFVGE 122


61MMAR_4817MMAR_4842Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_48171144.286484hypothetical protein
MMAR_48181164.289516hypothetical protein
MMAR_48190123.614848hypothetical protein
MMAR_48201113.814816hypothetical protein
MMAR_48210192.711961hypothetical protein
MMAR_48220172.007420PE-PGRS family protein
MMAR_4823217-0.268965cold shock protein a, CspA
MMAR_4824114-0.167365ATP-dependent RNA helicase, RhlE1
MMAR_5579214-0.595799hypothetical protein
MMAR_4825113-0.862899B12-dependent methionine synthase
MMAR_4826-125-3.175980hypothetical protein
MMAR_4827-126-3.233663hypothetical protein
MMAR_4828-122-3.126728hypothetical protein
MMAR_4829019-3.645797hypothetical protein
MMAR_4830118-3.270249hypothetical protein
MMAR_4831-117-2.767990hypothetical protein
MMAR_4832-219-2.383300TetR family transcriptional regulator
MMAR_4833-218-2.469247cytochrome P450 123B1 Cyp123B1
MMAR_4834-319-2.300349TetR family transcriptional regulator
MMAR_4835-219-3.105408hypothetical protein
MMAR_4836-221-3.511505MmpL family transport protein
MMAR_4837329-4.653520hydrolase
MMAR_4838331-5.419394hypothetical protein
MMAR_4839328-5.191563TetR family transcriptional regulator
MMAR_4840222-4.170846putative regulatory protein
MMAR_4841218-3.131446hypothetical protein
MMAR_4842219-2.177671regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4818DHBDHDRGNASE594e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 58.9 bits (142), Expect = 4e-12
Identities = 50/208 (24%), Positives = 88/208 (42%), Gaps = 16/208 (7%)

Query: 8 LQGKVAVVTGGAGGIGRALGKRFGMEGMKVVLADVLAEPLDRATRALTDEGIEAVGVVTD 67
++GK+A +TG A GIG A+ + +G + D E L++ +L E A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 68 VTDYSSVEALAKETLHRFGAVHVVCNNAGTGGVSEGYMWEHDLADWHWGIDVNVVGVIHG 127
V D ++++ + G + ++ N AG + G + +W VN GV +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGV--LRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 128 IKAFVPILLERGEGHVVNTCSGNGGFAPIARGAMGGP--AMAVYPMTKAAVLCLTESLYT 185
++ +++R G +V + G P +MA Y +KAA + T+ L
Sbjct: 124 SRSVSKYMMDRRSGSIVT----------VGSNPAGVPRTSMAAYASSKAAAVMFTKCL-- 171

Query: 186 HLEMTGTRVRAHALFPGGFLNTGIWESW 213
LE+ +R + + PG W W
Sbjct: 172 GLELAEYNIRCNIVSPGSTETDMQWSLW 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4822cloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 2e-04
Identities = 37/114 (32%), Positives = 43/114 (37%), Gaps = 7/114 (6%)

Query: 220 GDGGPGSPGAASFDPTVAGGAGGPGGDARGIGDGGRGGDGGPGATGAPGGRGSDGGPGGK 279
GDG + GA S + GG G G GG G + P G GS G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVG------GGASDGSGWSSENNPWGGGSGSGIH-W 56

Query: 280 GGNAGDYGNGGTGGTGGIGGAGGPGSPGGTPGAQGFRAGDAGNGGVGGIGGDGG 333
GG +G GG G +GG G GG S P A GF A G + G
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 36.6 bits (84), Expect = 3e-04
Identities = 31/88 (35%), Positives = 38/88 (43%), Gaps = 4/88 (4%)

Query: 275 GPGGKGGNAGDYGNGGTGGTGGIGGAGGPGSPGGTPGAQGFRAGDAGNGGVGGIGGDGGH 334
G G+G N G + G GG G G GG G+ + + GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN----GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 335 IKGHGGAGGIGGQGGAGGIGGDGQAGSA 362
GHG GG G GG G GG+ A +A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.003
Identities = 22/68 (32%), Positives = 28/68 (41%)

Query: 449 NGGQGGLSGDGATRAAHGVQGTMGDGGDGGDGGNGSTTSDQPDIDGGNGGYGGWGFNGGN 508
+GG G GA + + G G GG +GS S + + GG G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 509 GGNGGDGG 516
GNGG G
Sbjct: 62 HGNGGGNG 69



Score = 33.1 bits (75), Expect = 0.003
Identities = 26/84 (30%), Positives = 31/84 (36%), Gaps = 2/84 (2%)

Query: 475 GDGGDGGNGSTTSDQPDIDGGNGGYGGWGFNGGNGGNGGDGGTRPKGTVFFRSGGDGGFG 534
G G G N S +I+GG G G G G + G+G P G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG--GASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 535 GWGGNGYGGVGGDGGNGGWGGNGV 558
G G G G G G G + V
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.8 bits (74), Expect = 0.004
Identities = 23/81 (28%), Positives = 28/81 (34%)

Query: 372 SGGDGARGGNGGDGGAGGAGGQALAEGFHDGAAGTGGVGGNGGDGGNGADGGDGHSGDPS 431
SGGDG G +G G G GA+ G G G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 432 WRSGGDGGNGGNGAYGGNGGQ 452
+GG GN G G+ G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 31.2 bits (70), Expect = 0.013
Identities = 23/80 (28%), Positives = 28/80 (35%), Gaps = 6/80 (7%)

Query: 354 GGDGQAGSAGEFPGDRGGSGGDGARGGNGGDGGAGGAG------GQALAEGFHDGAAGTG 407
GGDG+ + G +GG G GG G G G H G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 408 GVGGNGGDGGNGADGGDGHS 427
G GG G+ G G+ G S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 30.5 bits (68), Expect = 0.021
Identities = 33/90 (36%), Positives = 37/90 (41%), Gaps = 8/90 (8%)

Query: 322 NGGVGGIGGDGGHIKGHGGAGGIGGQGGAGGIGGDGQAGSAGEFPGDRGGSGGDGARGGN 381
NGG G+G GG G G + GG G G GS G GG+G GG
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH------GNGGGNGNSGGG 74

Query: 382 GGDGGAGGAGGQALAEGFHDGAAGTGGVGG 411
G GG A +A GF A T G GG
Sbjct: 75 SGTGGNLSAVAAPVAFGF--PALSTPGAGG 102



Score = 30.5 bits (68), Expect = 0.024
Identities = 24/79 (30%), Positives = 31/79 (39%), Gaps = 2/79 (2%)

Query: 543 GVGGDGGNGGWGGNGVNRYGVLNPGWGGDGGNGGAGGTAYTS--GGVYAGGPVQPGTEGT 600
G G G N G N G G G + G+G ++ + GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 601 GPSGDHGGFGGSGGIGGHG 619
G G +G GG G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4832HTHTETR697e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.3 bits (169), Expect = 7e-17
Identities = 29/189 (15%), Positives = 66/189 (34%), Gaps = 7/189 (3%)

Query: 12 LPAAAELFAERGLNDTKIEDVAATTGIAKATLYYYFAGKEEILAFLLEDVLQHVAD-EVT 70
L A LF+++G++ T + ++A G+ + +Y++F K ++ + + E ++ + E+
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 71 AIVEADGTAAQRLHTVINAQLRVMAQRPAVCRALI---GELGRAARMPAIADMITTAYFE 127
+ G L ++ L + + M + E
Sbjct: 77 YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLE 136

Query: 128 PVET---LLRAGAADGSLVALDKPRAAAIALFGAVTISALTYLITDDALNEELIARTIHD 184
+ L+ L A R AAI + G ++ +L + + + AR
Sbjct: 137 SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVA 196

Query: 185 VAFIGLRPR 193
+
Sbjct: 197 ILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4834HTHTETR633e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.7 bits (152), Expect = 3e-14
Identities = 26/155 (16%), Positives = 48/155 (30%), Gaps = 12/155 (7%)

Query: 18 PRRLRSRTRLLDAATKLLSAGGIEAVTIDAVTKASKVARTTLYRHFSSSTQLLAATFERL 77
+R +LD A +L S G+ + ++ + KA+ V R +Y HF + L + +E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 78 LPQVHPPPAT------GSMRDQLIELLSRQATLFQEAPLHVTTLAWVALGPTPDGTQETQ 131
+ G L E+L L + + E
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEER--RRLLMEIIFHKCEFVGEMA 124

Query: 132 DRHALRARIIDQYRQPFVALL----QSPEARADLD 162
+ + + L ++ ADL
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLM 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4836ACRIFLAVINRP582e-10 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 57.9 bits (140), Expect = 2e-10
Identities = 34/229 (14%), Positives = 85/229 (37%), Gaps = 23/229 (10%)

Query: 212 IASAEEDLVVISIATAGLIAMILLVVYRSVFTALLPLLVIGVSLAVGRGVLSALGESGMP 271
+ + ++V L+ +++ + +++ L+P + + V L +L+A G S
Sbjct: 333 VQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYS--- 389

Query: 272 VSQFTIAFMTVILLGAGTDYSVFLISRYHEQRR-QNVPPDLSVINATATIGRVILASAAT 330
++ T+ M V+ +G D ++ ++ +PP + + + I ++ A
Sbjct: 390 INTLTMFGM-VLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMV 448

Query: 331 VAFAFLAMVFAKLS---VFAALGPACAIAVFVGFAATVTLFPPVLALAAKRGIGEPKADR 387
++ F+ M F S ++ A+ + + L P + A K E ++
Sbjct: 449 LSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENK 508

Query: 388 TRRYWNWIAV--------------AVVRRPVPLLVASLALVLGLAAVAL 422
++ W ++ L+ +V G+ + L
Sbjct: 509 -GGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFL 556



Score = 39.4 bits (92), Expect = 8e-05
Identities = 27/174 (15%), Positives = 61/174 (35%), Gaps = 11/174 (6%)

Query: 210 DQIASAEEDLVVISIATAGLIAMILLVVYRSVFTALLPLLVIGVSLAVGRGVLSALGESG 269
Q + + + ++ + L +Y S + +LV+ + + GVL A
Sbjct: 863 YQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIV---GVLLAATLFN 919

Query: 270 MPVSQFT-IAFMTVILLGAGTDYSVFLISRYHE-QRRQNVPPDLSVINATATIGRVILAS 327
+ + +T +G ++ ++ + ++ + + A R IL +
Sbjct: 920 QKNDVYFMVGLLT--TIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMT 977

Query: 328 AATVAFAFLAMVFAKLSVFAALGPACAIAVFVG-FAATVT--LFPPVLALAAKR 378
+ L + + + A I V G +AT+ F PV + +R
Sbjct: 978 SLAFILGVLPLAIS-NGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 34.8 bits (80), Expect = 0.002
Identities = 31/183 (16%), Positives = 65/183 (35%), Gaps = 27/183 (14%)

Query: 841 IQRLLSADFHQLAFATLVIVGLILVVL--LRA-----LVAPLYLLGTVVLNYGAALGLGT 893
+Q + L A +++ ++ + L +RA + P+ LLGT + + T
Sbjct: 333 VQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT 392

Query: 894 LVFQYGLGKEIAWPVPLLAFIILVAVGADYNMLL---ISRLREESAHNIRVGVLRTVANT 950
L + ++ + + D +++ + R+ E + ++++
Sbjct: 393 LT--------------MFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQI 438

Query: 951 GSVITSAGLIFAASM--FGLIAGSIA-IMIQAGFIIGCGLLLDTFVVRTLTVPAIATLLR 1007
+ ++ +A GS I Q I + L V LT ATLL+
Sbjct: 439 QGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK 498

Query: 1008 EAS 1010
S
Sbjct: 499 PVS 501


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4839HTHTETR455e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.0 bits (106), Expect = 5e-08
Identities = 17/86 (19%), Positives = 33/86 (38%), Gaps = 2/86 (2%)

Query: 19 AAVLDATRAVATLGGFKAVHFKSVAKQAGVTVGSVYDHFTSKTHLLVTLLAREFVRLDE- 77
+LD + + G + +AK AGVT G++Y HF K+ L + + E
Sbjct: 14 QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGEL 73

Query: 78 -ERDWSTCAASPIRRVESLTRRLHDE 102
+ P+ + + + +
Sbjct: 74 ELEYQAKFPGDPLSVLREILIHVLES 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4840HTHTETR531e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.7 bits (126), Expect = 1e-10
Identities = 14/97 (14%), Positives = 33/97 (34%), Gaps = 3/97 (3%)

Query: 51 ANTGSLRDRRRAELLSQIQGTAHQLFAERGFAAVTTEDIAAASGISISTYFRYAPTKEDL 110
A + + I A +LF+++G ++ + +IA A+G++ + + K DL
Sbjct: 2 ARKTKQEAQETRQ---HILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58

Query: 111 LIAPLRQTVAEIVAAYGTQPSDQSAADALIALFAETA 147
+ + I + +
Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIH 95


62MMAR_4928MMAR_4944Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4928211-1.196205NAD-dependent aldehyde dehydrogenase AldA
MMAR_4929412-1.744135hypothetical protein
MMAR_4930313-2.202448cytochrome P450 123A3 Cyp123A3
MMAR_4931114-1.678542short chain dehydrogenase
MMAR_4932115-2.365727cytochrome P450 51B1 Cyp51B1
MMAR_4933116-1.022245ferredoxin
MMAR_4934113-0.444615hypothetical protein
MMAR_4935213-0.650810zinc-containing alcohol dehydrogenase NAD-
MMAR_49360110.442663hypothetical protein
MMAR_4937-111-0.146632hypothetical protein
MMAR_4938-1110.916898hypothetical protein
MMAR_49390131.000475PE-PGRS family protein
MMAR_49400152.319764hypothetical protein
MMAR_49410162.882643two-component system response phosphate sensor
MMAR_49420182.647336two-component system response phosphate regulon
MMAR_49430183.215479hypothetical protein
MMAR_49442181.911457*oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4929HTHTETR493e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.9 bits (116), Expect = 3e-09
Identities = 22/154 (14%), Positives = 51/154 (33%), Gaps = 8/154 (5%)

Query: 20 RRRNRRQEETFRRVLAAGMDTLRASSYPDLTVRMVAARAGVSPATAYTYFSSKNHLIAEV 79
R+ + +ET + +L + ++ +A AGV+ Y +F K+ L +E+
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 80 YLDLVRKV-PFFTDVNVAMRDRVVQALRHLALVVADEPEVGAACTAAL-------LGGGA 131
+ + + + LR + + V + + G
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 132 DPAVRAVRDQIGAEIHRRIASAIGPGAQPGTIAA 165
V+ + + E + RI + + + A
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPA 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4931DHBDHDRGNASE1133e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (283), Expect = 3e-32
Identities = 69/229 (30%), Positives = 113/229 (49%), Gaps = 4/229 (1%)

Query: 13 AIVAGASSGIGAATAVELAAHGFPVALGARRVQKCEEIVEKIRADGGDAVALALDVTDAD 72
A + GA+ GIG A A LA+ G +A +K E++V ++A+ A A DV D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 73 SVKDFVHQATERLGDIEVLVAGAGDTYFGRLYEIDTETFESQVQIHLIGANRLATAVLPG 132
++ + + +G I++LV AG G ++ + E +E+ ++ G + +V
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 133 MLERQRGDLIFVGSDVALRQRPHMGAYGAAKAALVAMVTNLQMELEGTGLRASIVHPGPT 192
M++R+ G ++ VGS+ A R M AY ++KAA V L +EL +R +IV PG T
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGST 190

Query: 193 KTAMGWSLPVESIGPALEDWAKWGQARHDYFL----RASDIARAITFVA 237
+T M WSL + G + L + SDIA A+ F+
Sbjct: 191 ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4941PF06580310.008 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.4 bits (71), Expect = 0.008
Identities = 21/114 (18%), Positives = 36/114 (31%), Gaps = 28/114 (24%)

Query: 361 PRLRQVLSNLVGNALQH----TPDSADVTVRVGTAGQNAVLEVADKGPGMPAEDAARVFE 416
P + ++ LV N ++H P + ++ LEV + G
Sbjct: 256 PPM--LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT------ 307

Query: 417 RFYRTDSSRARASGGTGLGLSIVHS-LVKAHGGD--VTLTTAPGEGCCFRVTLP 467
TG GL V L +G + + L+ G+ V +P
Sbjct: 308 ------------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4942HTHFIS1099e-30 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 109 bits (273), Expect = 9e-30
Identities = 36/136 (26%), Positives = 63/136 (46%)

Query: 14 ARILVVDDEDNIVELLSVSLKFQGFEVHTATNGAQALDRARETRPDAVILDVMMPGMDGF 73
A ILV DD+ I +L+ +L G++V +N A D V+ DV+MP + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 74 GVLRRLRADGIDAPALFLTARDSLQDKIAGLTLGGDDYVTKPFSLEEVVARLRVILRRAG 133
+L R++ D P L ++A+++ I G DY+ KPF L E++ + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 134 KGSAEPRNSRLTFADI 149
+ ++ + +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


63MMAR_4990MMAR_4999Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_49903206.5625093-ketoacyl-ACP reductase
MMAR_49912206.777084ferredoxin FdxD
MMAR_49922186.550036acyl-CoA dehydrogenase FadE26
MMAR_49933166.607761acyl-CoA dehydrogenase FadE27
MMAR_49947228.880183acyl-CoA synthetase
MMAR_49958239.703937PE-PGRS family protein
MMAR_49962186.257140hypothetical protein
MMAR_49971185.892333hypothetical protein
MMAR_49981205.074781hypothetical protein
MMAR_49991205.189805PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4990DHBDHDRGNASE771e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 77.4 bits (190), Expect = 1e-18
Identities = 55/201 (27%), Positives = 87/201 (43%), Gaps = 10/201 (4%)

Query: 6 NVTDLSGRVAVVTGAAAGLGRAEAVGLARLGATVVVNDIGSALDASDVIDEISAVGAKAI 65
N + G++A +TGAA G+G A A LA GA + D V+ + A A
Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPE-KLEKVVSSLKAEARHAE 60

Query: 66 AVAGDISQRATADELIAT-ADGLGGLDVVVNNAGITRDRMLFNMTDEDWDQVIGVHLRGH 124
A D+ A DE+ A +G +D++VN AG+ R ++ +++DE+W+ V+ G
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 125 FLLTRNAATYWRQKAKAAGGSIFGRLVNTSSEAGLVGPVGQANYGAAKAGITALTLSAAR 184
F +R+ + Y + G +V S V A Y ++KA T
Sbjct: 121 FNASRSVSKYMMDRRS-------GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173

Query: 185 ALGRYGVCANAICP-RARTAM 204
L Y + N + P T M
Sbjct: 174 ELAEYNIRCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4995cloacin382e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 2e-04
Identities = 34/121 (28%), Positives = 46/121 (38%)

Query: 1192 TGGTGGAGNDGTINAGTGGNGGAGGRAGSGGANGGNGGTGGTGGDGGKGGGGAVGGSGGD 1251
+GG G N G + NGG G GGA+ G+G + GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1252 GGDGGINNGAGANGNGGGGGTGGTGSIGAAGGNGGQGGSGGAAGSGGGTAGNGGAGGVGG 1311
G+GG N +G GG + + G+GG A S A + +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 1312 A 1312
A
Sbjct: 122 A 122



Score = 37.0 bits (85), Expect = 6e-04
Identities = 37/114 (32%), Positives = 48/114 (42%), Gaps = 1/114 (0%)

Query: 1247 GSGGDGGDGGINNGAGANGNGGGGGTGGTGSIGAAGGNGGQGGSGGAAGSGGGTAGNGGA 1306
G G G + G ++ +G G G G G+ +G + GG +GSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1307 GGVGGAGGTGGVGGGGGDGGAGALAGTGGTGGSGGTGGASGAGGSGGGGGLSGS 1360
G GG G +GG G GG+ A A G T GA G S G LS +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPAL-STPGAGGLAVSISAGALSAA 115



Score = 37.0 bits (85), Expect = 6e-04
Identities = 33/102 (32%), Positives = 37/102 (36%)

Query: 1292 GAAGSGGGTAGNGGAGGVGGAGGTGGVGGGGGDGGAGALAGTGGTGGSGGTGGASGAGGS 1351
G G G T + +G + G GVGGG DG + GGSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1352 GGGGGLSGSGQSGANGGTGGASQEGQDGQASTDSDGGAGGAG 1393
G GGG SG GG A S GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.8 bits (82), Expect = 0.001
Identities = 29/101 (28%), Positives = 39/101 (38%)

Query: 661 GKGGDGGAGGAAGANGGTGGGGGIGGGGGVGGDGGNGGSGNTGTSSAPAGGTGGIGGTGA 720
G+G + GA +G G G G+GGG G + + G S + GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 721 DGGAGGNGGAAGGAGGTGGQGGTGGGAGKGGTGGAGGAAVT 761
G GG+ G + G T GAGG AV+
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 35.1 bits (80), Expect = 0.002
Identities = 26/83 (31%), Positives = 32/83 (38%)

Query: 1283 GNGGQGGSGGAAGSGGGTAGNGGAGGVGGAGGTGGVGGGGGDGGAGALAGTGGTGGSGGT 1342
G G+G + GA + G G GVGG G + G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1343 GGASGAGGSGGGGGLSGSGQSGA 1365
G G G SGGG G G+ + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 34.7 bits (79), Expect = 0.003
Identities = 26/79 (32%), Positives = 34/79 (43%)

Query: 207 GGMGGNGGSGSWLIGGGGAGGAGGIGGTGGGTGGVGGTGGDAGWLVGAGGSGGSGGIGAT 266
GG G +G+ G GG G+G GG + G G + + W G+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 267 GTGSGGAAGNGGNGGAGGL 285
G G G GG+G G L
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.003
Identities = 30/97 (30%), Positives = 34/97 (35%)

Query: 1226 GNGGTGGTGGDGGKGGGGAVGGSGGDGGDGGINNGAGANGNGGGGGTGGTGSIGAAGGNG 1285
G G G G GG G G G G + N GGG G+G G+ GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1286 GQGGSGGAAGSGGGTAGNGGAGGVGGAGGTGGVGGGG 1322
G G+ G GG A G G GG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.004
Identities = 28/86 (32%), Positives = 35/86 (40%)

Query: 774 TGGTGGHGGTGGHGGTGGAAGNGGAGGAGGQAGDGGSGGEGGDGGTGAQGTTVSLPGSGG 833
+GG G TG H +G G G GG A DG + G G+ + G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 834 NAGNGGNAGGGGVGGTGGAGATGAPA 859
+ GGN GG GTGG + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 34.3 bits (78), Expect = 0.004
Identities = 26/80 (32%), Positives = 30/80 (37%)

Query: 727 NGGAAGGAGGTGGQGGTGGGAGKGGTGGAGGAAVTPGLGIGDEPAGGTGGTGGHGGTGGH 786
+GG G G G G GGA+ G + P GG G+G H G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 787 GGTGGAAGNGGAGGAGGQAG 806
G GG GN G G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.005
Identities = 26/81 (32%), Positives = 33/81 (40%)

Query: 1164 SPAGGNGSAGGDGGDGGDVASGATGTAGTGGTGGAGNDGTINAGTGGNGGAGGRAGSGGA 1223
S G G G G++ G TG GG + N GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1224 NGGNGGTGGTGGDGGKGGGGA 1244
+G GG G +GG G GG +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 33.5 bits (76), Expect = 0.006
Identities = 34/97 (35%), Positives = 40/97 (41%), Gaps = 1/97 (1%)

Query: 294 AGGQGGDGAAGAAAAAAGGTGGDGGLGGNGGAGGAGGGGPVLFGHGGAGGLG-GQGGTGG 352
+GG G GA + + GG GLG GGA G GG G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 353 IGGTGGAGTTSIAAGTGGNGGAGGAAGGGGTAGAAGP 389
G GG G + +GTGGN A A G + P
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTP 98



Score = 33.1 bits (75), Expect = 0.008
Identities = 33/108 (30%), Positives = 39/108 (36%), Gaps = 6/108 (5%)

Query: 631 GGHGGAGGNGVDGADANVFTGDNGGDGTAGGKGGDGGAGGAAGANGGTGGGGGIGGGGGV 690
GG G G N+ G G G G G + GG+G G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 691 GGDGGNGGSGNTGTSSAPAGGTGGIGGTGADGGAGGNGGAAGGAGGTG 738
G GGNG SG +G G + A G + GAGG
Sbjct: 63 GNGGGNGNSGGG------SGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.011
Identities = 24/80 (30%), Positives = 32/80 (40%)

Query: 497 GGAAGSGGGGQTGSLGIGGTGGHGGAGGDGTPGGNGAAGPTAGTGDPGVDGGAGGNGGAG 556
G G G + S I G G GG + G ++ G G GG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 557 GQGGSNGTGGDGGTGGSGAT 576
GG+ +GG GTGG+ +
Sbjct: 64 NGGGNGNSGGGSGTGGNLSA 83



Score = 32.8 bits (74), Expect = 0.012
Identities = 23/76 (30%), Positives = 27/76 (35%)

Query: 433 GGNGGNGGAGGAALGGGAAGDGGQGGHGGAGGMGGAGANGITGGGVTGAGGNGGDGGQGG 492
G N G G GG G G G+G G G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 493 EPGTGGAAGSGGGGQT 508
+GG +G+GG
Sbjct: 68 NGNSGGGSGTGGNLSA 83



Score = 32.8 bits (74), Expect = 0.013
Identities = 30/101 (29%), Positives = 37/101 (36%)

Query: 451 AGDGGQGGHGGAGGMGGAGANGITGGGVTGAGGNGGDGGQGGEPGTGGAAGSGGGGQTGS 510
+G G+G + GA G G TG GV G +G P GG+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 511 LGIGGTGGHGGAGGDGTPGGNGAAGPTAGTGDPGVDGGAGG 551
G GG G+ G G + A P A GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.013
Identities = 33/79 (41%), Positives = 40/79 (50%), Gaps = 5/79 (6%)

Query: 1176 GGDGGDVASGATGTAGT--GGTGGAGNDGTINAGTG---GNGGAGGRAGSGGANGGNGGT 1230
GGDG +GA T+G GG G G G + G+G N GG +GSG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1231 GGTGGDGGKGGGGAVGGSG 1249
G GG+G GGG GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.018
Identities = 28/78 (35%), Positives = 32/78 (41%)

Query: 166 GAGGAGGTGGARGGNGGSGGLLFGGGGIGGTGGTGAAGAGMGGMGGNGGSGSWLIGGGGA 225
G G G GA +G G G G GG + GG GSG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 226 GGAGGIGGTGGGTGGVGG 243
G GG G +GGG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 32.0 bits (72), Expect = 0.020
Identities = 31/97 (31%), Positives = 35/97 (36%)

Query: 427 GTNGYIGGNGGNGGAGGAALGGGAAGDGGQGGHGGAGGMGGAGANGITGGGVTGAGGNGG 486
G N GN G LG G G G GG +GI GG +G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 487 DGGQGGEPGTGGAAGSGGGGQTGSLGIGGTGGHGGAG 523
+G GG GTGG + T G GG
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.021
Identities = 33/103 (32%), Positives = 38/103 (36%), Gaps = 2/103 (1%)

Query: 712 TGGIGGTGADGGAGGNGGAAGGAGGTGGQGGTGGGAGKGGTGGAGGAAVTPGLGIGDEPA 771
+GG G G +G GG G G GG G+G G G GI
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGG--SGSGIHWGGG 59

Query: 772 GGTGGTGGHGGTGGHGGTGGAAGNGGAGGAGGQAGDGGSGGEG 814
G G GG+G +GG GTGG A A G G G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.026
Identities = 34/102 (33%), Positives = 38/102 (37%), Gaps = 5/102 (4%)

Query: 139 GSGGNGAAGAAGQAGGS--GGSAGLLGNGGAG---GAGGTGGARGGNGGSGGLLFGGGGI 193
G G G A G+ GG GL GGA G GG GSG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 194 GGTGGTGAAGAGMGGMGGNGGSGSWLIGGGGAGGAGGIGGTG 235
G GG G +G G G G + + G A G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.042
Identities = 24/80 (30%), Positives = 28/80 (35%)

Query: 1149 NGGNGGNANNGTIEGSPAGGNGSAGGDGGDGGDVASGATGTAGTGGTGGAGNDGTINAGT 1208
+GG+G N G S G G G G SG + G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1209 GGNGGAGGRAGSGGANGGNG 1228
GNGG G +G G GGN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.045
Identities = 35/104 (33%), Positives = 40/104 (38%), Gaps = 3/104 (2%)

Query: 1297 GGGTAGNGGAGGVGGA--GGTGGVGGGGGDGGAGALAGTGGTGGSGGTGGASGAGGSGGG 1354
G G N GA G GG G+G GGG G+ + GG+G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1355 GGLSGSGQSGANGGTGGASQEGQDGQASTDSDGGAGGAGGTGGS 1398
G G+G SG GTGG A GAGG S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 30.8 bits (69), Expect = 0.049
Identities = 31/99 (31%), Positives = 42/99 (42%), Gaps = 11/99 (11%)

Query: 551 GNGGAGGQGGSNGTGGDGGTGGSGATGGDGGTGGAGTATDAGGTGGQGGVGGAGGAPGAA 610
G G G G++ T G+ G +G G G + G+G +++ GG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG--------- 53

Query: 611 GLAGTGGQAGNEGVGGNGGQGGHGGAGGNGVDGADANVF 649
GG +G+ GGNG GG G GGN A F
Sbjct: 54 --IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAF 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4996BLACTAMASEA300.008 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.8 bits (67), Expect = 0.008
Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 4/42 (9%)

Query: 73 AITESDNAAAEQLWSQLGDPLDAAQQVQAVIGTAGDECTRVE 114
AIT SDN+AA L + +G P + A + GD TR++
Sbjct: 122 AITMSDNSAANLLLATVGGP----AGLTAFLRQIGDNVTRLD 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4999cloacin392e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 2e-04
Identities = 32/80 (40%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 1443 GAAGTGGIGGAAGAGGNGNGGAGGTGGVGGGGGDGGTGGSSGKGGDGGTGGTGAVGGMGG 1502
G G G GA GN NGG G GVGGG DG S GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTG-LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1503 AGGSGTGTGTGGTGGDGGDG 1522
G G +GG G GG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 37.4 bits (86), Expect = 4e-04
Identities = 36/120 (30%), Positives = 46/120 (38%)

Query: 1116 GSGGDGGAGGIGGVAGAGGQGGNAGAGGNGTGGDGGNGGIGGTGGVGGDAEPGAGGNGGA 1175
G G + GA G G G G G + G G G G G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1176 GGHGGTGGAAGAGGSGSGSGADGSSGMGGTGGQGGDGGAGSTGANAANGSGATGKAGFAG 1235
GG+G +GG +G GG+ S A + G G G A S A A + + A A G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKG 125



Score = 37.4 bits (86), Expect = 5e-04
Identities = 30/100 (30%), Positives = 34/100 (34%)

Query: 1411 GAAGAGGTGGNGGKGGNARLNGNGDGGMGGQGGAAGTGGIGGAAGAGGNGNGGAGGTGGV 1470
G G G G GN G G GG +G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1471 GGGGGDGGTGGSSGKGGDGGTGGTGAVGGMGGAGGSGTGT 1510
G GGG+G +GG SG GG+ G G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.0 bits (85), Expect = 6e-04
Identities = 25/75 (33%), Positives = 27/75 (36%)

Query: 1337 GAGGAGGAGGAGGDGIGSTGGGTGGVGGNAGDGGDGGNGSNGGNGNSDGGTGGTGGAAGA 1396
G G GA G+ G G G G + G G N GG S GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1397 GGTGGVGGSGGGDGG 1411
GG G GG G G
Sbjct: 66 GGNGNSGGGSGTGGN 80



Score = 36.2 bits (83), Expect = 0.001
Identities = 34/87 (39%), Positives = 38/87 (43%), Gaps = 1/87 (1%)

Query: 1202 MGGTGGQGGDGGAGSTGANAANGSGATGKAGFAGGKGGGGGDGGAGIGGVGGGDGGNGGS 1261
M G G+G + GA ST N G G G A G + GG G G GGS
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1262 GGLGGDGGNGGSGGTNQTGGKGGAGGT 1288
G G GGNG SGG + TGG A
Sbjct: 61 GHGNG-GGNGNSGGGSGTGGNLSAVAA 86



Score = 36.2 bits (83), Expect = 0.001
Identities = 33/79 (41%), Positives = 38/79 (48%), Gaps = 2/79 (2%)

Query: 1241 GGDGGAGIGGVGGGDGG-NGGSGGLGGDGG-NGGSGGTNQTGGKGGAGGTGGDGGAAGAG 1298
GGDG G G NGG GLG GG + GSG +++ GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1299 GAGGGANGSGGTGGTGGTG 1317
G GGG SGG GTGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 35.8 bits (82), Expect = 0.001
Identities = 27/81 (33%), Positives = 35/81 (43%)

Query: 1129 VAGAGGQGGNAGAGGNGTGGDGGNGGIGGTGGVGGDAEPGAGGNGGAGGHGGTGGAAGAG 1188
++G G+G N GA +GG G+G GG + + N GG G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1189 GSGSGSGADGSSGMGGTGGQG 1209
G G+G G S G GTGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.002
Identities = 24/79 (30%), Positives = 31/79 (39%)

Query: 1232 GFAGGKGGGGGDGGAGIGGVGGGDGGNGGSGGLGGDGGNGGSGGTNQTGGKGGAGGTGGD 1291
G G G+ G G+G G G + GSG + GG G+ G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 1292 GGAAGAGGAGGGANGSGGT 1310
G +G G GG +
Sbjct: 68 NGNSGGGSGTGGNLSAVAA 86



Score = 35.1 bits (80), Expect = 0.002
Identities = 28/80 (35%), Positives = 35/80 (43%)

Query: 1355 TGGGTGGVGGNAGDGGDGGNGSNGGNGNSDGGTGGTGGAAGAGGTGGVGGSGGGDGGAAG 1414
+GG G A NG G G G + G+G ++ GG GSG GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1415 AGGTGGNGGKGGNARLNGNG 1434
G GGNG GG + GN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 34.3 bits (78), Expect = 0.004
Identities = 25/58 (43%), Positives = 27/58 (46%)

Query: 438 NGGTGGAGAGGTDGGGSGGAGGAGGNGGAGGRAPLLFGRGGTGGYGGSGGSGGHGGTG 495
NGG G G GG GSG + GG G G G G GG+G SGG GTG
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG 78



Score = 33.9 bits (77), Expect = 0.005
Identities = 27/82 (32%), Positives = 31/82 (37%)

Query: 976 GDGGTGGTAGTGGTGSTMGATGTGGTGGVGGSGGTGGHGGHGPLNSDPGGAGGKGGDGGT 1035
G G G G T + TG G G S G+G + P G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1036 GGTGGAGIGTSGGGTGGDGGAA 1057
G GG G G GTGG+ A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.9 bits (77), Expect = 0.005
Identities = 31/86 (36%), Positives = 37/86 (43%), Gaps = 2/86 (2%)

Query: 1264 LGGDGGNGGSGGTNQTGG--KGGAGGTGGDGGAAGAGGAGGGANGSGGTGGTGGTGGDGG 1321
+ G G G + G + T G GG G G GGA+ G N GG G+G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1322 KGGTVTGNGKAGTTGGAGGAGGAGGA 1347
G GNG +G G GG A A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.9 bits (77), Expect = 0.006
Identities = 35/103 (33%), Positives = 39/103 (37%), Gaps = 5/103 (4%)

Query: 1316 TGGDGGKGGTVTGNGKAGTTGGAGGAGGAGGAG-GDGIGST----GGGTGGVGGNAGDGG 1370
+GGDG T + GG G G GGA G G S GGG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1371 DGGNGSNGGNGNSDGGTGGTGGAAGAGGTGGVGGSGGGDGGAA 1413
G G NG +G G G A G S G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.007
Identities = 30/102 (29%), Positives = 36/102 (35%)

Query: 1370 GDGGNGSNGGNGNSDGGTGGTGGAAGAGGTGGVGGSGGGDGGAAGAGGTGGNGGKGGNAR 1429
G G G N G ++ G G G GG G + G G G GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1430 LNGNGDGGMGGQGGAAGTGGIGGAAGAGGNGNGGAGGTGGVG 1471
NG G+G GG G G A A G G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.011
Identities = 26/81 (32%), Positives = 28/81 (34%)

Query: 1403 GGSGGGDGGAAGAGGTGGNGGKGGNARLNGNGDGGMGGQGGAAGTGGIGGAAGAGGNGNG 1462
GG G G A + NGG G G DG GG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1463 GAGGTGGVGGGGGDGGTGGSS 1483
G GG G GGG G S+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 32.8 bits (74), Expect = 0.013
Identities = 33/116 (28%), Positives = 42/116 (36%), Gaps = 1/116 (0%)

Query: 1303 GANGSGGTGGTGGTGGDGGKGGTVTGNGKAGTTGGAGGAGGAGGAGGDGIGSTGGGTGGV 1362
G +G G G T G+ G T G G + G + GG G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1363 GGNAGDGGDGGNGSNGGNGNSDGGTGGTG-GAAGAGGTGGVGGSGGGDGGAAGAGG 1417
G G+G GG GGN ++ G A G GG+ S +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.8 bits (74), Expect = 0.014
Identities = 30/82 (36%), Positives = 35/82 (42%), Gaps = 2/82 (2%)

Query: 153 AGGSGGLLGNGGNGGAGGIGGGAGGVGGNGGWLYGRGGVGGAGGMGGGTGGAGGHAWLFG 212
+GG G G + +G I GG G+G GG G G GGG+G G W G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSG--SGIHWGGG 59

Query: 213 HGGTGGLGGGGGIGAGGAGGNG 234
G G G G G G GGN
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.015
Identities = 34/118 (28%), Positives = 39/118 (33%), Gaps = 4/118 (3%)

Query: 431 MHGGAGGNGGTGGAGAGGTDGGGSGGAGGAGGNGGAGG----RAPLLFGRGGTGGYGGSG 486
M GG G TG G GG G G GG G P G G +GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 487 GSGGHGGTGLDGNLAGLGGDGGGGGTGGDGGAPGTGGAGGARHLFSHNGRSGSTGIGG 544
G G GG G G +G GG+ G P G S + + S I
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.4 bits (73), Expect = 0.016
Identities = 32/107 (29%), Positives = 40/107 (37%), Gaps = 3/107 (2%)

Query: 1024 GGAGGKGGDGGTGGTGGAGIGTSGGGTGGDGGAAGDGGTGGDGGEVGGTGGVGGAAGTGG 1083
G G G G G+G GG + G G ++ + GG G GG G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH---GN 64

Query: 1084 DGGDGGTGGGDGGKGGTGGTGGTGGIGDPRVGGSGGDGGAGGIGGVA 1130
GG+G +GGG G G G P + G G A I A
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.0 bits (72), Expect = 0.022
Identities = 30/94 (31%), Positives = 36/94 (38%)

Query: 137 GNGGNGAAGAAGQAGGAGGSGGLLGNGGNGGAGGIGGGAGGVGGNGGWLYGRGGVGGAGG 196
GN G G G + GSG N GG G G GG G+G GG+G
Sbjct: 18 GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77

Query: 197 MGGGTGGAGGHAWLFGHGGTGGLGGGGGIGAGGA 230
G + A A+ F T G GG + GA
Sbjct: 78 GGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.0 bits (72), Expect = 0.022
Identities = 24/83 (28%), Positives = 30/83 (36%), Gaps = 1/83 (1%)

Query: 1179 GGTGGAAGAGGSGSGSGADGSSGMGGTGGQGGDGGAGSTGANAANGSGATGKAGFAGGKG 1238
GG G G + +G G GG G G+G + N G G+ + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1239 GGGGDGGAGIGGVGGGDGGNGGS 1261
G G G GG G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 31.6 bits (71), Expect = 0.027
Identities = 31/108 (28%), Positives = 35/108 (32%)

Query: 1083 GDGGDGGTGGGDGGKGGTGGTGGTGGIGDPRVGGSGGDGGAGGIGGVAGAGGQGGNAGAG 1142
G G G G G G G+G GSG GG +G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1143 GNGTGGDGGNGGIGGTGGVGGDAEPGAGGNGGAGGHGGTGGAAGAGGS 1190
GNG G GG G G + A P A G G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.2 bits (70), Expect = 0.040
Identities = 31/93 (33%), Positives = 35/93 (37%), Gaps = 2/93 (2%)

Query: 1483 SGKGGDGGTGGTGAVGGMGGAGGSGTGTGTGGTGGDGGDGGDGGDGGGGDGGAIPGTGGG 1542
SG G G G + G G +G G G G + G G + GGG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1543 DGGSGDTGNLPVGVTGGTGGTGGTAGTAGAGGL 1575
G G GN G GTGG A G
Sbjct: 62 HGNGGGNGN--SGGGSGTGGNLSAVAAPVAFGF 92



Score = 31.2 bits (70), Expect = 0.040
Identities = 24/64 (37%), Positives = 30/64 (46%)

Query: 397 GTGGLGGAGGAAASTGTAGGVGGDGGAGGRGGLFMHGGAGGNGGTGGAGAGGTDGGGSGG 456
G GLG GGA+ +G + GG G G + G GNGG G GG+ GG+
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82

Query: 457 AGGA 460
A A
Sbjct: 83 AVAA 86



Score = 30.8 bits (69), Expect = 0.044
Identities = 27/83 (32%), Positives = 33/83 (39%)

Query: 220 GGGGGIGAGGAGGNGGLLYGHGGAGAAGGAGQAGGDGGSAGLWGRGGAGGAAGVGGSTGG 279
G G G G +G + G G G GGA G WG G G GGS G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 280 SGGNGGLLIGAGASGGAGTSGGA 302
+GG G G +GG ++ A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAA 86


64MMAR_5033MMAR_5044Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_5033292.107021acetyl-CoA acetyltransferase
MMAR_50344102.175600hypothetical protein
MMAR_50353112.126262hypothetical protein
MMAR_50363102.068808short chain dehydrogenase
MMAR_50372111.833498short chain dehydrogenase
MMAR_5038190.672591PE-PGRS family protein
MMAR_50398201.253445enoyl-CoA hydratase
MMAR_50407191.021166CoA-transferase subunit alpha
MMAR_50417190.812075CoA-transferase subunit beta
MMAR_50427170.5079102-nitropropane dioxygenase
MMAR_50438180.486744electron transfer protein FdxB
MMAR_50448201.149371PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5036DHBDHDRGNASE762e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 76.2 bits (187), Expect = 2e-18
Identities = 63/267 (23%), Positives = 114/267 (42%), Gaps = 30/267 (11%)

Query: 4 LDGRVVIVTGAGGGIGRAHALAFAAEGARVVVNDIGVGLDGSPAGGGSAAHGVVDEITAA 63
++G++ +TGA GIG A A A++GA + +D +P VV + A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIA------AVDYNP----EKLEKVVSSLKAE 55

Query: 64 GGEAVANGSDVSNWQQAADLISTAVDTFGGLDVLVNNAGIVRDRMMANTSEEEFDAVIAV 123
A A +DV + ++ + G +D+LVN AG++R ++ + S+EE++A +V
Sbjct: 56 ARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSV 115

Query: 124 HLKGHFATMRHAASYWRGLSKAGKAVDARIINTSSGAGLQGSVGQANYSAAKAGIAALTL 183
+ G F R + Y I+ S A Y+++KA T
Sbjct: 116 NSTGVFNASRSVSKYMMDRRSGS------IVTVGSNPAGVPRTSMAAYASSKAAAVMFTK 169

Query: 184 VGAAEMGRYGVTVNAIAPAA-RTRMTETVFAEMMAKPD--EGFDAM-----------APE 229
E+ Y + N ++P + T M +++A+ +G P
Sbjct: 170 CLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPS 229

Query: 230 NVSPLVVWLASAEAGAVTGKVFEVEGG 256
+++ V++L S +AG +T V+GG
Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5037DHBDHDRGNASE981e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 97.8 bits (243), Expect = 1e-26
Identities = 70/254 (27%), Positives = 106/254 (41%), Gaps = 23/254 (9%)

Query: 13 GLAGRVVLVTGGVRGVGAGISSVFAEQGATV------------ITCARRAVENSPHEFHC 60
G+ G++ +TG +G+G ++ A QGA + + + +A F
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 61 CDIRDEDSVKRLMGSIAERHGRLDVLVNNAGGSPYALAAEATPRFHSKIVELNLLAPLLV 120
D+RD ++ + I G +D+LVN AG L + +N
Sbjct: 65 -DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 121 SQHAYGLMQEQPTGGSIVNVCSVSGRRPTPGTAAYGAAKAGFESLTSTLAVEWAP-KIRV 179
S+ M ++ GSIV V S P AAY ++KA T L +E A IR
Sbjct: 124 SRSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 180 NALVVGMVATE-QSELF---YGDAESIARVAAT----VPLGRLAQPADIGWAAAFLASDL 231
N + G T+ Q L+ G + I T +PL +LA+P+DI A FL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 232 ASYISGATLEVHGG 245
A +I+ L V GG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5038RTXTOXINA435e-06 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 42.6 bits (100), Expect = 5e-06
Identities = 43/132 (32%), Positives = 54/132 (40%), Gaps = 17/132 (12%)

Query: 469 GQGVAGIDG-DGGDGGNGGNGSTTDHGVIPQGWGGNGGNGGRGFNGGDGGDGGNGGSGPT 527
G I+G DG D G G+ T G GNG GGDG D G +G
Sbjct: 743 ADGDDLIEGNDGNDRLYGDKGNDTLSG----------GNGDDQLYGGDGNDKLIGVAGNN 792

Query: 528 TLRLFGRVGGDGGDG-GRGGDAYGGNAGWGGDGGNGGRGAYGSGLIPPARGGDGGNGGDG 586
L GGDG D G++ N +GG G + G+ G+ L+ G D GG G
Sbjct: 793 YL-----NGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYG 847

Query: 587 GDLYINGVMYQH 598
D+Y Y H
Sbjct: 848 NDIYRYLSGYGH 859



Score = 35.3 bits (81), Expect = 8e-04
Identities = 39/121 (32%), Positives = 49/121 (40%), Gaps = 15/121 (12%)

Query: 511 FNGGDGGDGGNGGSGPTTLRLFGRVGGDGGDGGRGGDAYGGNAGWGGDGGNGGRGAYGSG 570
F+G DG D G G RL+G G D GG G D +GGDG + G G+
Sbjct: 740 FHGADGDDLIEGNDGND--RLYGDKGNDTLSGGNGDDQL-----YGGDGNDKLIGVAGNN 792

Query: 571 LIPPARGGDGGNGGDGGDLYINGVMYQ--HGGAGFDGLPGEPPDIPPIGGLGGKGGSGGL 628
+ +GG+G D + N + GG G D L G GG G GG
Sbjct: 793 YL------NGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGY 846

Query: 629 G 629
G
Sbjct: 847 G 847



Score = 34.9 bits (80), Expect = 0.001
Identities = 35/122 (28%), Positives = 43/122 (35%), Gaps = 6/122 (4%)

Query: 440 GGNGGDGGNGGYGNLGGYGGLSGDGSTRAGQGVAGIDG-DGGDGGNGGNGSTTDHGVIPQ 498
G +G D G GN YG G+ + G G + G DG D G G+ +G
Sbjct: 742 GADGDDLIEGNDGNDRLYGD-KGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGD 800

Query: 499 G--WGGNGGNGGRGFNGGDGGDGGNGGSGPTTLRLFGRVGGDGGDGGRGGDAYGGNAGWG 556
GG G D G G L G G D GG G D Y +G+G
Sbjct: 801 DEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLD--GGEGDDLLKGGYGNDIYRYLSGYG 858

Query: 557 GD 558

Sbjct: 859 HH 860



Score = 31.9 bits (72), Expect = 0.011
Identities = 37/117 (31%), Positives = 39/117 (33%), Gaps = 1/117 (0%)

Query: 245 GGDARGIGDGGRGGDGGPGATGAPGGRGSDGGPGGKGGNAGDYGNGGTGGTGGIGGAGGP 304
G I G G D G G G G GGN D GG G IG AG
Sbjct: 733 GSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNN 792

Query: 305 GSPGQSWGDPGSQGGLGGRGGAGGIGGDGGQLSGSGGAGGI-GGEGGHGGAGGDGKD 360
G D G G +L GS GA + GGEG GG G D
Sbjct: 793 YLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGND 849


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5044cloacin375e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 5e-04
Identities = 35/99 (35%), Positives = 38/99 (38%)

Query: 509 GFGGAGGFGGASGFGTGGAGGVGGAGGALFGVGGVGGNGAWGAGGIGGDGGVGGAGSALG 568
G G G SG GG G+G GGA G G N WG G G GG+G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 569 FGLGGAGGAGGASGLGDGGAGGTGGAGGLLGGNGGGGGA 607
G G +GG G G A L G GG A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.0 bits (85), Expect = 5e-04
Identities = 35/99 (35%), Positives = 38/99 (38%)

Query: 1191 GFGGAGGFGGASGFGTGGAGGVGGAGGALFGVGGVGGNGAWGAGGIGGDGGVGGAGSALG 1250
G G G SG GG G+G GGA G G N WG G G GG+G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1251 FGLGGAGGAGGASGLGDGGAGGTGGAGGLLGGNGGGGGA 1289
G G +GG G G A L G GG A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.6 bits (84), Expect = 8e-04
Identities = 29/86 (33%), Positives = 36/86 (41%), Gaps = 8/86 (9%)

Query: 1356 GSGGAGGIGGAVGTGGTVAGAGGDGGGAGWVGNAGNGGAGGDGW-TNGNGGDGGDGGDVG 1414
G G G GA T G + G G G G + G GW + N GG G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-------GASDGSGWSSENNPWGGGSGSGIH 55

Query: 1415 LFGAGGNGGNGGTGVTAGTGGTGGSG 1440
G G+G GG G + G GTGG+
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 35.5 bits (81), Expect = 0.002
Identities = 35/112 (31%), Positives = 41/112 (36%), Gaps = 3/112 (2%)

Query: 167 GAGGAGGNTDLFGNNGAVGGAGGAGGWLIGSGGAGGTGGIGAFNGGAGGAGGSAWLFGTG 226
G G G NT +G + G G GGA G + N GG GS +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG---VGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 227 GVGGTGGIGTLGVGGAGGAGGGSGVLSFADGGAGGFGGSGANAGGVGGSGGA 278
G GG GG+G G S V + G GA V S GA
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.5 bits (81), Expect = 0.002
Identities = 35/112 (31%), Positives = 41/112 (36%), Gaps = 3/112 (2%)

Query: 849 GAGGAGGNTDLFGNNGAVGGAGGAGGWLIGSGGAGGTGGIGAFNGGAGGAGGSAWLFGTG 908
G G G NT +G + G G GGA G + N GG GS +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG---VGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 909 GVGGTGGIGTLGVGGAGGAGGGSGVLSFADGGAGGFGGSGANAGGVGGSGGA 960
G GG GG+G G S V + G GA V S GA
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.3 bits (78), Expect = 0.004
Identities = 36/118 (30%), Positives = 47/118 (39%), Gaps = 5/118 (4%)

Query: 568 GFGLGGAGGAGGASGLGDGGAGGTGGAGGLLGGNGGGGGAGGAGAVGFLDNTADGAGGAG 627
G G G GA SG +GG G G GG + G G + G + GG
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 628 GAGGSGGLLFGDGGVGGAGGAGGISTSSSVNNGGAGGAGGNAGGLFGSGGAGGIGGAV 685
G G GG G+ G G G + ++ V G + AGGL S AG + A+
Sbjct: 61 GHGNGGG--NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 34.3 bits (78), Expect = 0.004
Identities = 36/118 (30%), Positives = 47/118 (39%), Gaps = 5/118 (4%)

Query: 1250 GFGLGGAGGAGGASGLGDGGAGGTGGAGGLLGGNGGGGGAGGAGAVGFLDNTADGAGGAG 1309
G G G GA SG +GG G G GG + G G + G + GG
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1310 GAGGSGGLLFGDGGVGGAGGAGGISTSSSVNNGGAGGAGGNAGGLFGSGGAGGIGGAV 1367
G G GG G+ G G G + ++ V G + AGGL S AG + A+
Sbjct: 61 GHGNGGG--NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 34.3 bits (78), Expect = 0.004
Identities = 28/77 (36%), Positives = 37/77 (48%), Gaps = 4/77 (5%)

Query: 1340 NNGGAGGAGGNAGGLFGSGGAGGIGGAVGTGGTVAG---AGGDGGGAGWVGNAGNGGAGG 1396
+N GA GN G G GG G + G+G + GG G G W G +G+G GG
Sbjct: 9 HNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 1397 DGWTNGNGGDGGDGGDV 1413
+G + G G GG+ V
Sbjct: 68 NGNSGGGSGTGGNLSAV 84



Score = 33.9 bits (77), Expect = 0.005
Identities = 35/99 (35%), Positives = 40/99 (40%), Gaps = 1/99 (1%)

Query: 350 GVGGIGGTGGTGFSGVGGNAGAGGDGGLLWGSGGAGARGGQGGAGAGGIG-GAGGGGGLF 408
G G G T + GG G G GG GSG + GG GI G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 409 GGDGGAGGVGGTGAGGPGGAGGVGGDAGLFGVGGAGGVG 447
GG+G +GG GTG A V GAGG+
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.005
Identities = 35/99 (35%), Positives = 40/99 (40%), Gaps = 1/99 (1%)

Query: 1032 GVGGIGGTGGTGFSGVGGNAGAGGDGGLLWGSGGAGARGGQGGAGAGGIG-GAGGGGGLF 1090
G G G T + GG G G GG GSG + GG GI G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1091 GGDGGAGGVGGTGAGGPGGAGGVGGDAGLFGVGGAGGVG 1129
GG+G +GG GTG A V GAGG+
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.005
Identities = 27/74 (36%), Positives = 36/74 (48%), Gaps = 4/74 (5%)

Query: 658 NNGGAGGAGGNAGGLFGSGGAGGIGGAVGTGGTVAG---AGGDGGGAGWVGNAGNGGAGG 714
+N GA GN G G GG G + G+G + GG G G W G +G+G GG
Sbjct: 9 HNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 715 DGWTNGNGGDGGDG 728
+G + G G GG+
Sbjct: 68 NGNSGGGSGTGGNL 81



Score = 33.5 bits (76), Expect = 0.007
Identities = 29/84 (34%), Positives = 36/84 (42%), Gaps = 8/84 (9%)

Query: 674 GSGGAGGIGGAVGTGGTVAGAGGDGGGAGWVGNAGNGGAGGDGWTNGNGGDGGDGGDAV- 732
G G G GA T G + G G G G + G GW++ N GG G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-------GASDGSGWSSENNPWGGGSGSGIH 55

Query: 733 LVGNGGNGGNGGTGLVAGSTGTGG 756
G G+G GG G G +GTGG
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGG 79



Score = 33.5 bits (76), Expect = 0.007
Identities = 36/104 (34%), Positives = 41/104 (39%), Gaps = 7/104 (6%)

Query: 476 GLGGNGGTGGFSGPLSAGAGGGDGGAGGGVGLIGFGGAGGFGGASGFGTGGAGGVGGAGG 535
G G N G SG ++ G G G G G +GG SG G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH--- 62

Query: 536 ALFGVGGVGGNGAWGAGGIGGDGGVGGAGSALGFGLGGAGGAGG 579
G GG GN G+G GG+ A A GF GAGG
Sbjct: 63 ---GNGGGNGNSGGGSGT-GGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.007
Identities = 36/104 (34%), Positives = 41/104 (39%), Gaps = 7/104 (6%)

Query: 1158 GLGGNGGTGGFSGPLSAGAGGGDGGAGGGVGLIGFGGAGGFGGASGFGTGGAGGVGGAGG 1217
G G N G SG ++ G G G G G +GG SG G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH--- 62

Query: 1218 ALFGVGGVGGNGAWGAGGIGGDGGVGGAGSALGFGLGGAGGAGG 1261
G GG GN G+G GG+ A A GF GAGG
Sbjct: 63 ---GNGGGNGNSGGGSGT-GGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.8 bits (74), Expect = 0.013
Identities = 29/82 (35%), Positives = 34/82 (41%), Gaps = 1/82 (1%)

Query: 621 DGAGGAGGAGGSGGLLFG-DGGVGGAGGAGGISTSSSVNNGGAGGAGGNAGGLFGSGGAG 679
DG G GA + G + G G+G GGA S SS NN GG+G GSG
Sbjct: 5 DGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 680 GIGGAVGTGGTVAGAGGDGGGA 701
G G GG+ G A
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.8 bits (74), Expect = 0.013
Identities = 29/82 (35%), Positives = 34/82 (41%), Gaps = 1/82 (1%)

Query: 1303 DGAGGAGGAGGSGGLLFG-DGGVGGAGGAGGISTSSSVNNGGAGGAGGNAGGLFGSGGAG 1361
DG G GA + G + G G+G GGA S SS NN GG+G GSG
Sbjct: 5 DGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 1362 GIGGAVGTGGTVAGAGGDGGGA 1383
G G GG+ G A
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.016
Identities = 34/100 (34%), Positives = 38/100 (38%), Gaps = 3/100 (3%)

Query: 120 NGANGAPGTGQNGGDGGLLFGNGGAGGSGADGQNGGAGGNAGFFGSGGAGGAGGNTDLFG 179
N + NGG GL G G + GSG +N GG +G G G GN G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG---G 66

Query: 180 NNGAVGGAGGAGGWLIGSGGAGGTGGIGAFNGGAGGAGGS 219
NG GG G GG L G GAGG S
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 32.0 bits (72), Expect = 0.021
Identities = 30/80 (37%), Positives = 34/80 (42%), Gaps = 1/80 (1%)

Query: 215 GAGGSAWLFGTGGVGGTGGIGTLGVGGAGGAGGGSGVLSFADGGAGGFGGSGANAGGVGG 274
G G G G G G+G GGA GSG S + GG GSG + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSG 61

Query: 275 SGGAAGIFGTGGAGGTGGAG 294
G G +GG GTGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.021
Identities = 30/80 (37%), Positives = 34/80 (42%), Gaps = 1/80 (1%)

Query: 897 GAGGSAWLFGTGGVGGTGGIGTLGVGGAGGAGGGSGVLSFADGGAGGFGGSGANAGGVGG 956
G G G G G G+G GGA GSG S + GG GSG + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSG 61

Query: 957 SGGAAGIFGTGGAGGTGGAG 976
G G +GG GTGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.048
Identities = 31/100 (31%), Positives = 37/100 (37%), Gaps = 3/100 (3%)

Query: 802 NGTPGAAGSGADGTAGGWLLGNGGAGGSGADGQNGGAGGNAGFFGSGGAGGAGGNTDLFG 861
N + +G G +G G + GSG +N GG +G G G GN G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG---G 66

Query: 862 NNGAVGGAGGAGGWLIGSGGAGGTGGIGAFNGGAGGAGGS 901
NG GG G GG L G GAGG S
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106


65MMAR_5130MMAR_5147Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_51301153.226718hypothetical protein
MMAR_51311143.091769glycosyl transferase family protein
MMAR_51320133.177483hypothetical protein
MMAR_5133-2112.564300UDP-glucose 4-epimerase GalE1
MMAR_5134-1102.786010hypothetical protein
MMAR_5135-193.102312PE-PGRS family protein
MMAR_5136-180.900976*DNA polymerase III subunit delta'
MMAR_5137-180.309567adenylate cyclase
MMAR_5138090.232932DNA topoisomerase I
MMAR_5139-3130.294574hypothetical protein
MMAR_5140-314-0.306731cold-shock protein
MMAR_5141-2150.569975DEAD/DEAH box helicase
MMAR_51420181.426713hypothetical protein
MMAR_51434194.034217hypothetical protein
MMAR_55772154.032695hypothetical protein
MMAR_51440133.203785hypothetical protein
MMAR_51451133.121923hypothetical protein
MMAR_51461123.594241hypothetical protein
MMAR_5147-1113.176093hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5133NUCEPIMERASE2442e-81 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 244 bits (624), Expect = 2e-81
Identities = 98/333 (29%), Positives = 156/333 (46%), Gaps = 28/333 (8%)

Query: 1 MRALVTGAAGFIGSTLVDRLLADGHTVVGLDNFATGRATNLEH----LVDNLAHVFVEAD 56
M+ LVTGAAGFIG + RLL GH VVG+DN +L+ L+ F + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 57 IVDAD-LQAIFEQHRPEVIFHLAAQIDVRHSVADPQFDASVNVIGTLRLAEAARLTGVRK 115
+ D + + +F E +F ++ VR+S+ +P A N+ G L + E R ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 116 VVHTSSGGSIYGTPPQYPTSERVPTD-PASPYAAGKVAGEIYLNTFRHLYGLECSHIAPA 174
+++ SS S+YG + P S D P S YAA K A E+ +T+ HLYGL + +
Sbjct: 121 LLYASS-SSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 175 NVYGPRQDPHGEAGVVAIFAQALLSGKPTKVFGDGTNTRDYVFVDDVVDAFVRA------ 228
VYGP P + F +A+L GK V+ G RD+ ++DD+ +A +R
Sbjct: 180 TVYGPWGRPD---MALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 229 ------------GSDVGGGQRFNIGTGVETSDRQLHSAVAAAVGGPDDPEFHPPRLGDLK 276
+ + + +NIG A+ A+G P + GD+
Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVL 296

Query: 277 RSCLDISRAEEVLGWRPQVELADGVRRTVDYFR 309
+ D EV+G+ P+ + DGV+ V+++R
Sbjct: 297 ETSADTKALYEVIGFTPETTVKDGVKNFVNWYR 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5135cloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 2e-04
Identities = 25/81 (30%), Positives = 36/81 (44%)

Query: 424 AGVGGIGGDGGDATAASSATGGHGGDGGDSDPSVGTGGNGGTGGNGGNGGAASLLFGNGG 483
+G G G + G + + + GG G G S G+G + GG G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 484 SGGDGGVGGTGGTGGAGGSGA 504
G GG G +GG G GG+ +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 35.8 bits (82), Expect = 6e-04
Identities = 35/119 (29%), Positives = 49/119 (41%), Gaps = 9/119 (7%)

Query: 267 LTGGNGDTGDTGGTGETGGTGGAGGSATASSGSATGGKGGAGGDPGGGANGGAGGAGGDA 326
++GG+G +TG +G G G++ G + +P GG +G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG-- 58

Query: 327 ESFSGQAHGGAGGDGSQGGIPGGSGGAGGDGGKATGLASGIGGDGGDGGRGFATDVSGG 385
G HG GG+G+ GG GSG G A +A G G G A +S G
Sbjct: 59 ----GSGHGNGGGNGNSGG---GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 35.1 bits (80), Expect = 9e-04
Identities = 28/111 (25%), Positives = 32/111 (28%)

Query: 404 GSGGNGGTGGTGGTGGTGAAAGVGGIGGDGGDATAASSATGGHGGDGGDSDPSVGTGGNG 463
G G G G T G G G G + S+ G G S G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 464 GTGGNGGNGGAASLLFGNGGSGGDGGVGGTGGTGGAGGSGAGGGVGGDGSS 514
G GG GN G S GN + G G G + S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 35.1 bits (80), Expect = 0.001
Identities = 32/93 (34%), Positives = 37/93 (39%), Gaps = 3/93 (3%)

Query: 499 AGGSGAGGGVGGDGSSRGGIGGNGGSGANSMAIGGEGGVGGRGGDAGTSGLLIGNGGDGG 558
+GG G G G +S GG G G A G G G SG I GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 559 SGGAGGTGGTGGAGGAGAAGGAGGNGSTQIAFG 591
G GG G +GG G GG + +AFG
Sbjct: 62 HGNGGGNGNSGGGSG---TGGNLSAVAAPVAFG 91



Score = 33.9 bits (77), Expect = 0.002
Identities = 30/95 (31%), Positives = 36/95 (37%)

Query: 553 NGGDGGSGGAGGTGGTGGAGGAGAAGGAGGNGSTQIAFGGDGGNGGDGGDGAQGGEGGAT 612
N G + G G TG G GA+ G+G + GG G GG G GG
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 613 GSVGGIGSQGGILFSHAGADGSTGAPGAGGAGGAG 647
S GG G+ G + A A GAGG
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.005
Identities = 24/76 (31%), Positives = 30/76 (39%)

Query: 249 TGTAGTGAAGGNVSGNGPLTGGNGDTGDTGGTGETGGTGGAGGSATASSGSATGGKGGAG 308
+G G G G S +G + GG G GG + G SGS GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 309 GDPGGGANGGAGGAGG 324
GGG GG+G
Sbjct: 62 HGNGGGNGNSGGGSGT 77



Score = 32.4 bits (73), Expect = 0.006
Identities = 27/102 (26%), Positives = 38/102 (37%)

Query: 238 LAGADGVNPTPTGTAGTGAAGGNVSGNGPLTGGNGDTGDTGGTGETGGTGGAGGSATASS 297
++G DG + +G G +G G G + +G + GG G+G S
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 298 GSATGGKGGAGGDPGGGANGGAGGAGGDAESFSGQAHGGAGG 339
G GG G G G + A A F + GAGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.007
Identities = 34/115 (29%), Positives = 40/115 (34%)

Query: 226 GDGGTGGQGGTGLAGADGVNPTPTGTAGTGAAGGNVSGNGPLTGGNGDTGDTGGTGETGG 285
GDG G +G PT G G + G S GG +G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 286 TGGAGGSATASSGSATGGKGGAGGDPGGGANGGAGGAGGDAESFSGQAHGGAGGD 340
GG G++ SG+ A G GAGG A S S A A D
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.0 bits (72), Expect = 0.009
Identities = 34/96 (35%), Positives = 42/96 (43%), Gaps = 7/96 (7%)

Query: 349 GSGGAGGDGGKATGLASGIGGDGGDGGRGFATDVSGGTAGGGGTGGDGGRGGLLIGSGGN 408
G G G + G + + GG G G G A+D SG ++ GG G G I GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG---IHWGGG 59

Query: 409 GGTGGTGGTGGTGAAAGVGGIGGDGGDATAASSATG 444
G G GG G +G +G GG A AA A G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGG----NLSAVAAPVAFG 91



Score = 31.6 bits (71), Expect = 0.012
Identities = 34/102 (33%), Positives = 39/102 (38%), Gaps = 1/102 (0%)

Query: 480 GNGGSGGDGGVGGTGGTGGAGGSGAGGGVGG-DGSSRGGIGGNGGSGANSMAIGGEGGVG 538
G G G + G T G G +G G G G DGS G G+ S G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 539 GRGGDAGTSGLLIGNGGDGGSGGAGGTGGTGGAGGAGAAGGA 580
G GG G SG G GG+ + A G GA G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.017
Identities = 36/99 (36%), Positives = 42/99 (42%), Gaps = 9/99 (9%)

Query: 481 NGGSGGDGGVGGTGGTGGAGGSGAGGGVGGDGSSRGGIGGNGGSGANSMAIGGEGGVGGR 540
+GG G G +G G G GVGG S G+G S N+ GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD-----GSGWSSENNPWGGGSGSGIHW 56

Query: 541 GGDAGTSGLLIGNGGDGGSGGAGGTGGTGGAGGAGAAGG 579
GG +G GG+G SGG GTGG A A A G
Sbjct: 57 GGGSGHG----NGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.2 bits (70), Expect = 0.017
Identities = 29/78 (37%), Positives = 34/78 (43%), Gaps = 6/78 (7%)

Query: 150 GSGAPGQNGGAGGNAGLLGSGGTGGVGGAGATGGTG-GTGGLLWGNG-----GIGGQGGT 203
G G N GA +G + G TG G GA+ G+G + WG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 204 AMAGVNGGSPGHGGNGGN 221
G NG S G G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 30.8 bits (69), Expect = 0.020
Identities = 27/92 (29%), Positives = 31/92 (33%), Gaps = 2/92 (2%)

Query: 441 SATGGHGGDGGDSDPSVGTGGNGGTGGNGGNGGAASLLFGNGGSGGDGGVGGTGGTGGAG 500
S G G + G S G G GG S G G G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 501 GSGAGGGVGGDGSSRGGIGGNGGSGANSMAIG 532
GG G S G GGN + A +A G
Sbjct: 62 HGNGGGNGNSGGGS--GTGGNLSAVAAPVAFG 91



Score = 30.8 bits (69), Expect = 0.020
Identities = 33/114 (28%), Positives = 45/114 (39%), Gaps = 2/114 (1%)

Query: 538 GGRGGDAGTSGLLIGNGGDGGSGGAGGTGGTGGAGGAGAAGGAGGNGS-TQIAFGGDGGN 596
GG G T +GG G G GG G + G GS + I +GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 597 GGDGGDGAQGGEGGATGSVGGIGSQGGILFSHAGADGSTGAPGAGGAGGAGGAA 650
G GG+G GG G G++ + + F G+ G + GA AA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGA-GGLAVSISAGALSAA 115



Score = 30.8 bits (69), Expect = 0.022
Identities = 31/103 (30%), Positives = 39/103 (37%), Gaps = 1/103 (0%)

Query: 577 AGGAGGNGSTQIAFGGDGGNGGDGGDGAQGGEGGATGSVGGIGSQGGILFSHAGADGSTG 636
+GG G +T NGG G G GG +G GG S G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 637 APGAGGAGGAGGAAGGFGAGGSGATAGADGKAGTAGTAGATGP 679
GG G +GG +G G + A A G + T GA G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS-TPGAGGL 103



Score = 30.1 bits (67), Expect = 0.038
Identities = 29/85 (34%), Positives = 33/85 (38%), Gaps = 5/85 (5%)

Query: 315 ANGGAGGAGGDAESFSGQAHGGAGGDGSQGGIPGGSGGAGGDGGKATGLASGIGGDGGDG 374
+ G G A S SG +GG G G GG GSG + + G SGI GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 375 GRGFATDVSGGTAGGGGTGGDGGRG 399
GG GG G GG
Sbjct: 62 HGN-----GGGNGNSGGGSGTGGNL 81



Score = 29.7 bits (66), Expect = 0.045
Identities = 23/84 (27%), Positives = 28/84 (33%)

Query: 123 GNGADGAPGSGANGADGGILIGNGGTGGSGAPGQNGGAGGNAGLLGSGGTGGVGGAGATG 182
G G + S + +GG G G S G + G GSG G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 183 GTGGTGGLLWGNGGIGGQGGTAMA 206
G G G G GG +A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVA 89


66MMAR_5180MMAR_5207Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_5180210-1.541271hypothetical protein
MMAR_518139-0.875934anti-anti-sigma regulatory factor
MMAR_51822110.706456anti-anti-sigma regulatory factor
MMAR_51831100.835643hypothetical protein
MMAR_5184091.472810putative regulatory protein
MMAR_5185-1102.368653putative regulatory protein
MMAR_5186-1113.370833sensor-component of a two-component regulator
MMAR_5187-1124.64696813e12 repeat-containing protein
MMAR_5188-1114.023874hypothetical protein
MMAR_5189-1124.206357thiamine-monophosphate kinase
MMAR_5190-1153.303432*selenocysteine synthase
MMAR_5191-2132.566098selenocysteine-specific translation elongation
MMAR_5192-1142.415536integral membrane transport protein
MMAR_51930151.265623hypothetical protein
MMAR_51940172.011089formate dehydrogenase subunit alpha,
MMAR_51950162.026182formate dehydrogenase subunit beta,
MMAR_51960102.839294Fe-S-cluster-containing hydrogenase, HybA
MMAR_51970123.132657formate-dependent nitrite reductase, membrane
MMAR_5198-1112.652914hypothetical protein
MMAR_5199-1122.686207hypothetical protein
MMAR_52000132.610755hypothetical protein
MMAR_5201-1132.923565hypothetical protein
MMAR_52021156.271961methanol dehydrogenase transcriptional
MMAR_52031165.519139hypothetical protein
MMAR_52042154.294762hypothetical protein
MMAR_52053164.407853hypothetical protein
MMAR_52062154.415386PadR-like transcriptional regulatory protein
MMAR_52071153.877780PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5191TCRTETOQM441e-06 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 44.5 bits (105), Expect = 1e-06
Identities = 28/119 (23%), Positives = 48/119 (40%), Gaps = 18/119 (15%)

Query: 4 IATAGHVDHGKSTLVHRLT---------------DMWPDRLAEEQRRGLTIDLGFAWTEL 48
I HVD GK+TL L D E++RG+TI G +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 49 DGRQLAFVDVPGHERFVTNMLAGSGAMPPDSPVLFVVAATEGWMPQSEEHLAALDALRV 107
+ ++ +D PGH F+ + + D +L +++A +G Q+ AL + +
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYRSLSVL--DGAIL-LISAKDGVQAQTRILFHALRKMGI 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5192TCRTETB1553e-43 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 155 bits (392), Expect = 3e-43
Identities = 85/422 (20%), Positives = 176/422 (41%), Gaps = 25/422 (5%)

Query: 19 HQGLVLAVTCLALGTVVAAMASLNVALPDIARQTHADQTQQAWIVDAYSLVFASLLLPAG 78
H +++ + L+ +V+ M LNV+LPDIA + W+ A+ L F+ G
Sbjct: 12 HNQILIWLCILSFFSVLNEMV-LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 79 AVGDRYGRRLALLVGLVVFGAGSGVATLT-TDPTALAGLRALLGVGAALIMPATLSTITS 137
+ D+ G + LL G+++ GS + + + + L R + G GAA PA + + +
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVA 129

Query: 138 SF-PAQRRAAAVSIWTAVAGASGVVGLLASGLLLQWWSWQSIFWLNVLLAAVALTGTLLF 196
+ P + R A + ++ VG G++ + W + L + + + L+
Sbjct: 130 RYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMK 187

Query: 197 VPESAERDPAPLDVVGAVLAAAGIAVLVYAVIEAPAYGWTDPVTVGGIAVGLLILAGFVV 256
+ + R D+ G +L + GI + +Y + + + +L+ +
Sbjct: 188 LLKKEVRIKGHFDIKGIILMSVGIVFFMLFT---TSYSISFLI--------VSVLSFLIF 236

Query: 257 IEMGRR--YPLLDPRLFTNRRFAAGSLSITLQFFALFGFLFVIMQYLQSVRHYSALTAAL 314
++ R+ P +DP L N F G L + F + GF+ ++ ++ V S TA +
Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLS--TAEI 294

Query: 315 GLLAMPIGM---IPSSRLSPYLTERFGIRLPWVAGLLIVAEGLMVLAHLDSADPYWHIAS 371
G + + G I + L +R G G+ ++ + + L ++ +
Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF-MTI 353

Query: 372 GLIPLGAGLGLAMPPATTAITGALPSRLQNVGSAVNDLARELGGALGIAVLGSLLTATYR 431
++ + GL +T ++ +L + G ++ + L GIA++G LL+
Sbjct: 354 IIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413

Query: 432 NH 433
+
Sbjct: 414 DQ 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5193HTHTETR692e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.9 bits (168), Expect = 2e-16
Identities = 39/177 (22%), Positives = 64/177 (36%), Gaps = 16/177 (9%)

Query: 6 ARGRRAGKPDTRAKILEVARRRFLEGGYQGVKLRSVAAEAGVDLALISYYFGSKRGLVGE 65
AR + +TR IL+VA R F + G L +A AGV I ++F K L E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 66 ALALSANPADVLDQAVAKGDPATFPQRVLAGLLALWEDPASGASLRALV------AGAAH 119
LS + L+ P + L+ + E + R L+
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 120 DPALANRVKEMVERELIDKIAAQL----------SGTDARKRASMFCSQIAGLIVTR 166
+ A+ + + + E D+I L + R+ A + I+GL+
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENW 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5199BACINVASINB394e-05 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 38.6 bits (89), Expect = 4e-05
Identities = 35/140 (25%), Positives = 65/140 (46%), Gaps = 21/140 (15%)

Query: 237 LAGLVVVILVGVAAAANGATAALLGFPLVLLVGLLVAYLYTVLMFA-----PVL-IVLER 290
L L+ ++ V A GA+ AL L ++V + T + F P++ VL+
Sbjct: 321 LGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLK- 379

Query: 291 LPLVDAITRSFALVTGGFWRVLGIRLLTAIVVGLVGGAISAPFGIVGQILLGATASEGST 350
PL++ I ++ G LG+ TA + G + GAI A +V I++ A +G+
Sbjct: 380 -PLMELIGKAITKALEG----LGVDKKTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAA 434

Query: 351 GMFLVGMTLSSIGSAISQII 370
+ +G+A+S+++
Sbjct: 435 ---------AKLGNALSKMM 445


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5202HTHFIS362e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.0 bits (83), Expect = 2e-04
Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 1/47 (2%)

Query: 117 DEINRTPPKTQAALLEAMEERQVSVEGQAKPLP-DPFIVAATQNPIE 162
DEI P Q LL +++ + + G P+ D IVAAT ++
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLK 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5205PERTACTIN310.005 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 31.2 bits (70), Expect = 0.005
Identities = 20/51 (39%), Positives = 22/51 (43%), Gaps = 1/51 (1%)

Query: 242 ARLRPAGAGAPPGWPPQTPPAPVWWPGQPAPQPQIQPQFAPD-PAPSPPQG 291
A+ PA AP P P P PQP PQ P+ PAP PP G
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615



Score = 30.1 bits (67), Expect = 0.014
Identities = 17/45 (37%), Positives = 18/45 (40%)

Query: 248 GAGAPPGWPPQTPPAPVWWPGQPAPQPQIQPQFAPDPAPSPPQGP 292
GA APP P P P P P P QP P P P+ P
Sbjct: 564 GAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAP 608


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5206cloacin290.023 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 28.5 bits (63), Expect = 0.023
Identities = 15/37 (40%), Positives = 16/37 (43%)

Query: 55 PLGFGGGFGPGFGPGLGFGFGPGGARGGGRRGGPGRG 91
P G G G G +G G G G G G GG G G
Sbjct: 45 PWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5207cloacin399e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 9e-05
Identities = 41/108 (37%), Positives = 43/108 (39%), Gaps = 2/108 (1%)

Query: 571 GGGGSGGGGGASGGTGG-TGGAGGLLSAGGAGGVGGAGFNDGGDGGAGGSGGLLGGLVGA 629
GG G G GA +G GG GL GGA G + GG GSG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 630 GGGAGGNGGAGFGGTPGNGGAGGDAGLLGGPG-GTGGAGGYNTSGPGG 676
G G G G GT GN A G P T GAGG S G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 37.8 bits (87), Expect = 2e-04
Identities = 35/110 (31%), Positives = 47/110 (42%), Gaps = 6/110 (5%)

Query: 657 LGGPGGTGGAGGYNTSGPGGNGGSGGNAGTLFGSGGGGGNGGSGYSGIGGTGGTGGSAGL 716
+ G G G G +++ NGG G GGG + GSG+S G G +G+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTG------LGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 717 VFSDAGAGGFGGFGSTAGGTGGTGGNAVLLGGGGAGGAGGISFTGAGGQG 766
+ G GG +GG GTGGN + A G +S GAGG
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.4 bits (86), Expect = 2e-04
Identities = 35/115 (30%), Positives = 42/115 (36%)

Query: 513 AGGSGAANTGASGGAGGAAGLLGTGGTGGAGARLAGGAGGTGGAGGAGGWLLGDGGGGGG 572
+GG G + + G TG G GA G G G GGG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 573 GGSGGGGGASGGTGGTGGAGGLLSAGGAGGVGGAGFNDGGDGGAGGSGGLLGGLV 627
G+GGG G SGG GTGG ++A A G G S G L +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 37.4 bits (86), Expect = 3e-04
Identities = 36/109 (33%), Positives = 43/109 (39%), Gaps = 2/109 (1%)

Query: 125 NGGAGGSGAAGSAGGAGGAAGLIGAGGAGGAGGSSTGGAGGTGGAGGAGGWLFGPGGVGG 184
N GA + + G G G + G+G + ++ G G G GG G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 185 AGGSSSSAGGAGGVGGAGGLFGGGGLG--GAGGAGVSASGGAGGAGGAG 231
G S GG A FG L GAGG VS S GA A A
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 37.4 bits (86), Expect = 3e-04
Identities = 43/124 (34%), Positives = 49/124 (39%), Gaps = 3/124 (2%)

Query: 152 AGGAGGSSTGGAGGTGGAGGAGGWLFGPGGVGGAGGSSSSAGGAGGVGGAGGLFGGGGLG 211
+GG G GA T G G G GG G SS G G G+ GGG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 212 GAGGAGVSASGGAGGAGGAGGALAGFLGAGG---GDGGAGGSGVNHEGGAGGAGGAGGLI 268
G G SGG G GG A+A + G GAGG V+ GA A A +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 269 AGTG 272
A G
Sbjct: 122 ALKG 125



Score = 37.0 bits (85), Expect = 3e-04
Identities = 27/80 (33%), Positives = 31/80 (38%)

Query: 747 GGGGAGGAGGISFTGAGGQGGAGGTGGQLSGNGGSGGTGGEGDYVGGADSGGAGGTGGNA 806
GG G G G T GG G G + GSG + + GG+ SG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 807 GLTGDGGNGGNGGSGGTPGS 826
G G GN G G G S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 37.0 bits (85), Expect = 4e-04
Identities = 24/71 (33%), Positives = 33/71 (46%)

Query: 614 GGAGGSGGLLGGLVGAGGGAGGNGGAGFGGTPGNGGAGGDAGLLGGPGGTGGAGGYNTSG 673
G SG + GG G G G G + G+G+ G G +G+ G G G GG N +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 674 PGGNGGSGGNA 684
GG+G G +
Sbjct: 72 GGGSGTGGNLS 82



Score = 35.8 bits (82), Expect = 7e-04
Identities = 28/76 (36%), Positives = 32/76 (42%)

Query: 761 GAGGQGGAGGTGGQLSGNGGSGGTGGEGDYVGGADSGGAGGTGGNAGLTGDGGNGGNGGS 820
G G GA T G ++G G GG G S GG+ GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 821 GGTPGSPGGGGTGGAL 836
GG S GG GTGG L
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.001
Identities = 41/125 (32%), Positives = 45/125 (36%), Gaps = 13/125 (10%)

Query: 630 GGGAGGNGGAGFGGTPGNGGAGGDAGLLGGPGGTGGAGGYNTSGPGGNGGSGGNAGTLFG 689
G G G N GA NGG G G GG G +S GG G+ G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTG-----LGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 690 SGGGGGNGGSGYSGIGGTGGTGGSAGLVFSDAGAGGFGGFGSTAGGTGGTGGNAVLLGGG 749
G G GG+G SG G G SA FG A T G GG AV + G
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSA--------VAAPVAFGFPALSTPGAGGLAVSISAG 110

Query: 750 GAGGA 754
A
Sbjct: 111 ALSAA 115



Score = 34.3 bits (78), Expect = 0.002
Identities = 33/106 (31%), Positives = 38/106 (35%), Gaps = 4/106 (3%)

Query: 549 GAGGTGGAGGAGGWLLGDGGGGGGGGSGGGGGASGGT----GGTGGAGGLLSAGGAGGVG 604
G G GA G + G G G GG G GG G+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 605 GAGFNDGGDGGAGGSGGLLGGLVGAGGGAGGNGGAGFGGTPGNGGA 650
G N GG G GG+ + V G A GAG + GA
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.9 bits (77), Expect = 0.003
Identities = 32/104 (30%), Positives = 37/104 (35%), Gaps = 2/104 (1%)

Query: 578 GGGASGGTGGTGGAGGLLSAGGAGGVGGAGFNDGGDGGAGGSGGLLGGLVGAGGGAGGNG 637
GG G G G ++ G G G G +DG G GG G+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGS--GWSSENNPWGGGSGSGIHWGGGS 60

Query: 638 GAGFGGTPGNGGAGGDAGLLGGPGGTGGAGGYNTSGPGGNGGSG 681
G G GG GN G G G A G+ G GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.007
Identities = 29/112 (25%), Positives = 42/112 (37%), Gaps = 3/112 (2%)

Query: 242 GGDGGAGGSGVNHEGGAGGAGGAGGLIAGTGGNGGAGGTDAYSRGGAGGAGGDAGLLFGS 301
GGDG +G + G G G + G +G ++ GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG---GG 59

Query: 302 GGAGGTGGTGGTDMSDSGGTGGAGGNAGLLFGSGGAGGAGGAAVALNDVGGA 353
G G GG G + G + A + FG G +A++ GA
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.4 bits (73), Expect = 0.008
Identities = 30/86 (34%), Positives = 36/86 (41%), Gaps = 1/86 (1%)

Query: 206 GGGGLGGAGGAGVSASGGAGGAGGAGGALAGFLGAGGGDGGAGGSGVN-HEGGAGGAGGA 264
G G GA + +GG G G GGA G + + GGSG H GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 265 GGLIAGTGGNGGAGGTDAYSRGGAGG 290
GG GG+G G A + A G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.8 bits (69), Expect = 0.026
Identities = 35/115 (30%), Positives = 41/115 (35%), Gaps = 6/115 (5%)

Query: 310 TGGTDMSDSGGTGGAGGNAGLLFGSGGAGGAGGAAVALNDVGGAGGAGGNAGLFGNGGVG 369
+GG + G GN G G G GGA +D G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNIN--GGPTGLGVGGGA----SDGSGWSSENNPWGGGSGSGIH 55

Query: 370 GVGGVGAGDGGAGGRAGLVIGNGGAGGAGGESFGFGAGVGGAGGNGGNGVLIGNG 424
GG G G+GG G +G G GG A FG G GG V I G
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.1 bits (67), Expect = 0.047
Identities = 23/73 (31%), Positives = 33/73 (45%), Gaps = 3/73 (4%)

Query: 489 NGTPGAAGSGTDGTPGGWLLGDGGAGGSGAANTGASGGAGGAAGLLGTGGTGGAGARLAG 548
N + +G P G +G G + GSG ++ G G +G+ GG+G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG---G 66

Query: 549 GAGGTGGAGGAGG 561
G G +GG G GG
Sbjct: 67 GNGNSGGGSGTGG 79


67MMAR_5252MMAR_5258Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_52522121.937391oxidoreductase
MMAR_52532112.267177hypothetical protein
MMAR_52542102.095924membrane-anchored adenylyl cyclase
MMAR_52552111.759206hypothetical protein
MMAR_52561101.308240hypothetical protein
MMAR_52572100.899550membrane-anchored adenylyl cyclase
MMAR_5258216-2.648919PE family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5252NUCEPIMERASE543e-10 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 53.6 bits (129), Expect = 3e-10
Identities = 28/126 (22%), Positives = 50/126 (39%), Gaps = 9/126 (7%)

Query: 6 MHVLVTGGTGFVGGWTAKAIADAGHSVRFL-----VRNPAKLESSVAKLGVDVSDFIIGD 60
M LVTG GF+G +K + +AGH V + + + ++ + L F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 61 IKDRDLVRE--ALTGCDAVVHSAAL--VATDQRQTQDMLSTNMEGARNVLGQAVALGLDP 116
+ DR+ + + A + V S V +N+ G N+L +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 117 IVHVSS 122
+++ SS
Sbjct: 121 LLYASS 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5254PF05272320.016 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.016
Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 232 LVGRGWEVAAVSTMLDRAIHGRGSVVGVTGPVGIGKSRLVREAIGL 277
LVG+ + V+ +++ SVV + G GIGKS L+ +GL
Sbjct: 575 LVGKYILMGHVARVMEPGCKFDYSVV-LEGTGGIGKSTLINTLVGL 619


68MMAR_5289MMAR_5326Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_528912324.993233hydrolase
MMAR_529014355.688525hypothetical protein
MMAR_529116386.321868hypothetical protein
MMAR_529215366.310209hypothetical protein
MMAR_529314366.091059hypothetical protein
MMAR_529414346.060030PE-PGRS family protein
MMAR_5295-1140.011568*cytidine/deoxycytidylate deaminase
MMAR_5296-112-0.735317hypothetical protein
MMAR_5297-110-1.000981prephenate dehydrogenase
MMAR_5298-114-1.569644hypothetical protein
MMAR_5299-115-1.063353osmoprotectant (glycine
MMAR_5300015-0.772014osmoprotectant (glycine
MMAR_53012124.121104osmoprotectant transport ATP-binding protein ABC
MMAR_53027187.140177osmoprotectant (glycine
MMAR_530311239.169739hypothetical protein
MMAR_530410238.842359acyl-CoA dehydrogenase FadE36
MMAR_530511249.037032hypothetical protein
MMAR_530610238.319962PE-PGRS family protein
MMAR_53078237.222306PE-PGRS family protein
MMAR_53083194.949839PE-PGRS family protein
MMAR_5309-111-0.184135hypothetical protein
MMAR_5310-112-0.507465short-chain type dehydrogenase/reductase
MMAR_5311011-0.123662acyl-CoA dehydrogenase FadE1
MMAR_5312110-0.295822hypothetical protein
MMAR_5313111-0.476042quinone oxidoreductase
MMAR_531408-1.743094hydrolase
MMAR_5315-114-2.12347719 kDa lipoprotein antigen precursor LpqH
MMAR_53164131.799062two-component sensor kinase TcrY
MMAR_53177153.136168two-component transcriptional regulatory protein
MMAR_53187143.626319hypothetical protein
MMAR_53197143.638878hypothetical protein
MMAR_53204154.083419hypothetical protein
MMAR_53212154.335583PE-PGRS family protein
MMAR_53220172.101126PE-PGRS family protein
MMAR_53230180.792536O-methyltransferase
MMAR_5324-1170.660072hypothetical protein
MMAR_5325-2160.571935hypothetical protein
MMAR_5326213-2.022400**putative aminotransferase
69MMAR_5336MMAR_5345Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_53365185.503698O-antigen/lipopolysaccharide transport ATP-
MMAR_53375175.870525L-rhamnosyltransferase
MMAR_53384176.146614O-antigen/lipopolysaccharide transport integral
MMAR_53399208.434196PE-PGRS family protein
MMAR_53409197.566979hypothetical protein
MMAR_534111216.408028PE-PGRS family protein
MMAR_53427152.948122hypothetical protein
MMAR_53437163.201896transcriptional regulator
MMAR_53446143.051617PE-PGRS family protein
MMAR_5345314-1.654512hypothetical protein
70MMAR_5357MMAR_5368Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_53574215.984556integral membrane indolylacetylinositol
MMAR_53584226.443277hypothetical protein
MMAR_53590225.632701hypothetical protein
MMAR_5360-1235.216790hypothetical protein
MMAR_5361-1225.175667acyl-CoA dehydrogenase FadE35
MMAR_5362-1235.156860PE-PGRS family protein
MMAR_53630211.330666propionyl-CoA carboxylase beta chain 4 AccD4
MMAR_5364-1191.508262polyketide synthase Pks13
MMAR_53652170.956928long-chain-fatty-acid--CoA ligase
MMAR_53664181.589632hypothetical protein
MMAR_53673181.049774secreted Mpt51/Mpb51 antigen protein FbpD
MMAR_53682170.324285secreted antigen 85-A FbpA
71MMAR_5414MMAR_5439Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_54144190.527324hypothetical protein
MMAR_54154190.591059hypothetical protein
MMAR_54164200.264741hypothetical protein
MMAR_54176200.622865PPE family protein
MMAR_54186220.387807hypothetical protein
MMAR_5419522-0.165023hypothetical protein
MMAR_5420421-0.038742hypothetical protein
MMAR_5421421-0.609685hypothetical protein
MMAR_5422421-0.167319hypothetical protein
MMAR_5423324-0.775516hypothetical protein
MMAR_5424425-0.955429hypothetical protein
MMAR_5425426-1.021260hypothetical protein
MMAR_5426526-1.153615hypothetical protein
MMAR_5427526-1.004861hypothetical protein
MMAR_5428525-1.334867hypothetical protein
MMAR_5429527-1.598876hypothetical protein
MMAR_5430525-1.049282hypothetical protein
MMAR_5431519-0.298621hypothetical protein
MMAR_54324180.204585hypothetical protein
MMAR_54332170.612063hypothetical protein
MMAR_5434417-0.250604hypothetical protein
MMAR_5435318-0.448406hypothetical protein
MMAR_5436218-0.995322hypothetical protein
MMAR_5437217-2.196161hypothetical protein
MMAR_5438114-2.812396putative regulatory protein
MMAR_5439113-3.206822hypothetical protein
72MMAR_0087MMAR_0094N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0087-212-0.281171hypothetical protein
MMAR_0088-116-1.795058transcriptional regulator
MMAR_0089017-4.852710hypothetical protein
MMAR_0090018-5.698250transposase, ISMyma01_aa2
MMAR_0091-122-6.465393transposase, ISMyma01_aa1
MMAR_0092023-6.874660hypothetical protein
MMAR_0093023-5.322183integral membrane efflux protein ErmB
MMAR_0094025-3.450315transmembrane transport protein MmpL
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0087SACTRNSFRASE353e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.3 bits (81), Expect = 3e-04
Identities = 16/58 (27%), Positives = 27/58 (46%)

Query: 520 RLQIEGLRVAKAERAQGLGTALVEWAHNYGRAHGAQLAQVTTDEARERARAFYRRLGY 577
IE + VAK R +G+GTAL+ A + + + + T + A FY + +
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0088HTHTETR677e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.0 bits (163), Expect = 7e-16
Identities = 29/152 (19%), Positives = 54/152 (35%), Gaps = 5/152 (3%)

Query: 17 STERKGQRTRRRILDAARAVFAEVGYERATIRGIAAAAGVDKSSIIKYFGTKQALFHEAV 76
T+++ Q TR+ ILD A +F++ G ++ IA AAGV + +I +F K LF E
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 77 HWD----IPVAELTTDDAGQTTENYARAMLTAWAADPNSPMAVLLRTSMTSEDAADILRR 132
+ + R +L + L + +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 133 HITAQGVDAVA-ATIDASDARLRAAVAGAILM 163
+ Q + + D + L+ + +L
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLP 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0093TCRTETB1313e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 131 bits (331), Expect = 3e-35
Identities = 89/413 (21%), Positives = 173/413 (41%), Gaps = 23/413 (5%)

Query: 47 ICVFASVAVNLANTAVSVAQRSLIVTFGSNQAVVAWTVTAYTLTEAAAIPLSGWAADRIG 106
+C+ + +V L ++V+ + F A W TA+ LT + + G +D++G
Sbjct: 19 LCILSFFSV-LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77

Query: 107 TKRLFMISVLGFTLGSVLCAVAPNIACLIIF-RAVQGGGGGILMPLVITILAREAGPNRL 165
KRL + ++ GSV+ V + L+I R +QG G LV+ ++AR
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 166 ARLMSVMGIPLLLGPMAGPILGGWLIDDYGWQWIFWINVPIGLITVALAAIAFPGDHTAP 225
+ ++G + +G GP +GG + W ++ +P+ I +
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRI 195

Query: 226 SETLDIIGMLLLSPGLATFLYGLSTVPARGTVADRHVLIPATAGLVLMGAFVFHALYRAD 285
DI G++L+S G+ F+ T LI + ++ FV H +
Sbjct: 196 KGHFDIKGIILMSVGIVFFMLFT-------TSYSISFLIVSVLSFLI---FVKHIR-KVT 244

Query: 286 RPLIDLRLFRNR--VVTVANATIVFVAAGFSGAVLLVPSYFQQLLRETPLQVG-IHMIPL 342
P +D L +N ++ V I+F +G V +VP + + + + ++G + + P
Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGT--VAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 343 GLGAAVTIPTSSVLMDRHGAGKVVLGGVTLISVGMGTLAFGAAEHAAYVPTLLIGLTIVG 402
+ + +L+DR G V+ GVT +SV T +F + ++ I + V
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFM---TIIIVFVL 359

Query: 403 MGIGSIMLQLTTVAVQTLAPHQIARGSTLVSVNQQLSASASTALMSVILTSQF 455
G+ ++T+ +L + G +L++ LS A++ +L+
Sbjct: 360 GGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0094ACRIFLAVINRP497e-08 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 49.5 bits (118), Expect = 7e-08
Identities = 51/302 (16%), Positives = 98/302 (32%), Gaps = 51/302 (16%)

Query: 153 IEAVRQILARTPP--PPGIKVYVTGPSALTADMSRTGDKSL--VIVTMI--SVLVIFTML 206
+A++ LA P P G+KV D + S+ V+ T+ +LV M
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYP------YDTTPFVQLSIHEVVKTLFEAIMLVFLVMY 356

Query: 207 LLVYRSIVTVTLLLITVGIELTAARGVVAFLAWHGVIGLSTYAINLLT----TMAIAAGT 262
L +++ + I V + L ++A G S IN LT +AI
Sbjct: 357 L-FLQNMRATLIPTIAVPVVLLGTFAILAAF------GYS---INTLTMFGMVLAIGLLV 406

Query: 263 DYSIFIIGRYQEARQ-AGEDAETAFYTMYRGVAHVILGSGLTIAGA---MYCLTFTRMPY 318
D +I ++ + + A + ++G + ++ M +
Sbjct: 407 DDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAI 466

Query: 319 FQSMGIPCAAGMLVAVVAALTMGPAVLAL-------------GSRFGLFDPKRKIKTRGW 365
++ I + M ++V+ AL + PA+ A G FG F+ +
Sbjct: 467 YRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHY 526

Query: 366 RRIGTAVVRWPAPILTATCAVSLVGLIALPGYQTNYDDQTYVPENIPANAGYAAANRHFP 425
++ L ++A +++PE + G P
Sbjct: 527 TNSVGKILGSTGRYLLI-----YALIVAGMVVLFLRLPSSFLPEE---DQGVFLTMIQLP 578

Query: 426 PS 427

Sbjct: 579 AG 580



Score = 48.7 bits (116), Expect = 1e-07
Identities = 33/162 (20%), Positives = 66/162 (40%), Gaps = 7/162 (4%)

Query: 761 DLIIAGLSSLCLIFIIMLLITRGFVAALVIVGTVALSLGVSFGLSVLLWQHLLRIELHWL 820
+++ ++ L+F++M L + A L+ V + L +F + + + + +
Sbjct: 339 EVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGM 398

Query: 821 VLAMSVIVLLAVGSDYNLLLVSRLKEEVGAGIKTGIIRAMGGTGKVVTSAGLVFA---LT 877
VLA+ ++V A+ N V R+ E K ++M + +V + +
Sbjct: 399 VLAIGLLVDDAIVVVEN---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIP 455

Query: 878 MASMAVSDLIVIGQIGTTIGLGLLFDTLIVRSLMTPSIAALL 919
MA S + Q TI + L+ L TP++ A L
Sbjct: 456 MAFFGGSTGAIYRQFSITIVSAMALSVLVALIL-TPALCATL 496



Score = 40.2 bits (94), Expect = 4e-05
Identities = 40/213 (18%), Positives = 76/213 (35%), Gaps = 15/213 (7%)

Query: 145 GGPLANQSIEAVRQILAR--TPPPPGIKVYVTGPSALTADMSRTGDKSLVIVTMISVLVI 202
G S ++ + P GI TG ++ +G + IS +V+
Sbjct: 828 GEAAPGTSSGDAMALMENLASKLPAGIGYDWTG---MSYQERLSG-NQAPALVAISFVVV 883

Query: 203 FTMLLLVYRSIVTVTLLLITVGIELTAARGVVAFLAWHGVIGLSTYAINLLTTMAIAAGT 262
F L +Y S +++ V + + GV+ + + LLTT+ ++A
Sbjct: 884 FLCLAALYESWSIPVSVMLVVPLGIV---GVLLAATLFNQKNDVYFMVGLLTTIGLSAKN 940

Query: 263 DYSIFIIGRYQEARQA-GEDAETAFYTMYRGVAHVILGSGLTIAGAMYCLTFTRMP---Y 318
+I I+ ++ + G+ A R IL + L + L +
Sbjct: 941 --AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGA 998

Query: 319 FQSMGIPCAAGMLVAVVAALTMGPAVLALGSRF 351
++GI GM+ A + A+ P + R
Sbjct: 999 QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


73MMAR_0098MMAR_0108N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0098029-1.894984polyketide synthase
MMAR_0099029-1.748551non-ribosomal peptide synthetase
MMAR_0100029-1.4814204-hydroxybenzoate synthetase
MMAR_0101026-1.086339polyketide synthase
MMAR_0102021-0.459362polyketide synthase
MMAR_0103090.259933hypothetical protein
MMAR_0104011-0.237671hypothetical protein
MMAR_0105-111-0.029266hypothetical protein
MMAR_0106-210-0.087236hypothetical protein
MMAR_0107-210-0.516794cellobiohydrolase a (1,4-beta-cellobiosidase a)
MMAR_0108-310-1.300984transcriptional regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0098ISCHRISMTASE381e-04 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 38.1 bits (88), Expect = 1e-04
Identities = 19/75 (25%), Positives = 30/75 (40%), Gaps = 2/75 (2%)

Query: 933 CTEIERTVAAAIGQVLGIAQVSRDDGFLALGGDSVSAMRLAARVRAEGLPLTPELLFEHA 992
C I + +A + ++ + L G DSV M L + R EG +T L E
Sbjct: 232 CENIRKQIAELLQ--ETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERP 289

Query: 993 TVRQIAAALQEAANQ 1007
T+ + L + Q
Sbjct: 290 TIEEWQKLLTTRSQQ 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0101DHBDHDRGNASE391e-04 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 39.3 bits (91), Expect = 1e-04
Identities = 34/164 (20%), Positives = 62/164 (37%), Gaps = 11/164 (6%)

Query: 2009 VLITGGTGMAGGVLARHVVGAGGVRHVVLASRQGEQAPGVAELVSELSAAGAEVLVLACD 2068
ITG G +AR + G H+ E+ + ++VS L A D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGA--HIAAVDYNPEK---LEKVVSSLKAEARHAEAFPAD 65

Query: 2069 VADRDAVAQMMEQVRRRCPPLTGVIHAAGVLDDTVIASLTPDRMNPVLRAKVDGAWHLHQ 2128
V D A+ ++ ++ R P+ +++ AGVL +I SL+ + G ++ +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 2129 -----LTRGLGLSMFVLCSSIAGVMGSPGQGNYAAANTFLDALA 2167
+ S+ + S+ AGV YA++
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGV-PRTSMAAYASSKAAAVMFT 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0102DHBDHDRGNASE421e-05 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 42.3 bits (99), Expect = 1e-05
Identities = 34/164 (20%), Positives = 65/164 (39%), Gaps = 11/164 (6%)

Query: 1738 VLITGGTGLVGSVLARHLVSAYGVRNLVLVSRMGEQGAGVAELVDELSDAGARVLVAACD 1797
ITG +G +AR L S G ++ + + ++V L D
Sbjct: 11 AFITGAAQGIGEAVARTLASQ-GAH----IAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 1798 VADQSAVEKLIAGWGREYPALTGVIHAAGVLDDAVITSMTPDQVDSVLRAKVDGAWNLHH 1857
V D +A++++ A RE + +++ AGVL +I S++ ++ ++ G +N
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 1858 AT-----RGLGLSMFVLCSSIAGVVGAPGQGNYAAANAFLDALV 1896
+ S+ + S+ AGV YA++ A
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRT-SMAAYASSKAAAVMFT 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0105PF06580280.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.3 bits (63), Expect = 0.003
Identities = 12/63 (19%), Positives = 23/63 (36%), Gaps = 5/63 (7%)

Query: 10 LARIH--AKRSFWWHLGAYILGNAALVAIWFFTSGGYFWPIWPALGWGIGLVFHGLGVFL 67
+A H A + +W+ +G F + Y P ++ + I + GL +
Sbjct: 1 MASTHRQANKYYWY---CQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTH 57

Query: 68 GMR 70
R
Sbjct: 58 AYR 60


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0108HTHTETR476e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 47.3 bits (112), Expect = 6e-09
Identities = 21/118 (17%), Positives = 37/118 (31%), Gaps = 10/118 (8%)

Query: 11 ADGRQLRYQHRRGEIFDAVMAHVLEHGITGLSFRTLAAAVGVSHVTLRHHFGTKDQLLVE 70
A + Q R I D + + G++ S +A A GV+ + HF K L E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 71 ILGAIGTRIV-IPEQLGADDVEALWRRWNEPGAQRRSQLLFEAYAQAVRHPDEYRGFL 127
I + I + + A E + ++ + R +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLRE---------ILIHVLESTVTEERRRLLM 110


74MMAR_0237MMAR_0261N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_023718-0.237998TetR family transcriptional regulator
MMAR_023818-0.127443TetR family transcriptional regulator
MMAR_023907-0.014299monooxygenase
MMAR_02400100.591059monoooxygenase
MMAR_0241-290.709142zinc-type alcohol dehydrogenase AdhD
MMAR_0242-2101.073881PE-PGRS family protein
MMAR_02430110.392383short chain dehydrogenase
MMAR_0244-1150.199984methyltransferase/methylase
MMAR_0245015-0.021365enoyl-CoA hydratase, EchA8_7
MMAR_0246-116-0.382824hypothetical protein
MMAR_02470150.279532hypothetical protein
MMAR_0248-1150.659834short-chain type dehydrogenase/reductase
MMAR_0249-212-0.640622TetR family transcriptional regulator
MMAR_0250-29-0.285872hypothetical protein
MMAR_0251-110-0.381252bifunctional Mta/Sah nucleosidase Mtn
MMAR_0253-28-0.321386hypothetical protein
MMAR_0254-28-0.720290ketoacyl reductase
MMAR_0255-19-1.363526transmembrane transport protein MmpL
MMAR_0256-211-0.515952non-ribosomal peptide synthetase
MMAR_0257011-1.317338hypothetical protein
MMAR_0258-29-0.834159acyl-CoA synthetase
MMAR_0259-3100.410878hypothetical protein
MMAR_0260-390.816284oxidoreductase
MMAR_0261-2101.524637PPE family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0237HTHTETR785e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 78.1 bits (192), Expect = 5e-20
Identities = 39/210 (18%), Positives = 67/210 (31%), Gaps = 11/210 (5%)

Query: 1 MGARGEQTRQRIITAAMRCVAEAGPAQASIREIAKAADMTSGSLYHYFPNKSELLNATAT 60
++TRQ I+ A+R ++ G + S+ EIAKAA +T G++Y +F +KS+L +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 EIEEIVLPRLHA-AAASSDDIVDRLDAVLDESKRLMRDYPYLAGFLRAVRAENADQTG-- 117
E + A D + L +L + + +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 118 ---GGRRTYPGSKALRDVITEIVDDARGRGVLSAGIAPGAAVDAICALTRGLTEQAANLG 174
+R D I + + +L A + A + GL E
Sbjct: 125 VVQQAQRNLCL--ESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF-- 180

Query: 175 PQSYDVTLDSAKQLLRGSLFAGAKPTATHR 204
L + L T R
Sbjct: 181 -APQSFDLKKEARDYVAILLEMYLLCPTLR 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0238HTHTETR741e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.3 bits (182), Expect = 1e-18
Identities = 34/177 (19%), Positives = 61/177 (34%), Gaps = 2/177 (1%)

Query: 6 LGRPVGASGEETRQRIIVATMQCVSKVGYARATIREIARTANVTSASLYNYFPNKSELIK 65
+ R +ETRQ I+ ++ S+ G + ++ EIA+ A VT ++Y +F +KS+L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 66 ETIAARADAAMPRLR-RAAEGGGNIVDRIEAVLDECGELIREYPQLAAFEFAIRAEDGIA 124
E A+ G+ + + +L E + I +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKC-EF 119

Query: 125 LDAQEGSGQLGEVGFTAFREIIRGLVEDARRRGELAEQADTAGAIEAIYALIYGLTE 181
+ Q + I ++ L T A + I GL E
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0240TYPE3OMOPROT310.011 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 31.1 bits (70), Expect = 0.011
Identities = 26/64 (40%), Positives = 32/64 (50%), Gaps = 9/64 (14%)

Query: 465 TQCQSYFRSPSGRIVT-QWPYTELEYARRTWR--IKPRDWLHHKGVSPAAQRGGAGSAAS 521
T+CQ + R + T Q + L A + W IKP DWL H VSPA AG+A S
Sbjct: 20 TECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWLEH--VSPAL----AGAAVS 73

Query: 522 AQTE 525
A E
Sbjct: 74 AGAE 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0242cloacin456e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.1 bits (106), Expect = 6e-07
Identities = 27/82 (32%), Positives = 33/82 (40%)

Query: 144 GNGGDGGSGAAGQVGGNGGNAGLIGTGGAGGQGGSGVSTTSAQAGGRGGSGGLLFGNGGL 203
G G G + A GN G G GSG S+ + GG GSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 204 GGQGGSGGTSGTGGPGGPGGSA 225
G GG+G + G G GG +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 42.8 bits (100), Expect = 3e-06
Identities = 30/80 (37%), Positives = 35/80 (43%), Gaps = 2/80 (2%)

Query: 178 SGVSTTSAQAGGRGGSGGLLFGNGGLGGQGGSGGTSGTGGPGGP--GGSAQGIGDGGNGG 235
SG G SG + G GLG GG+ SG P GGS GI GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 236 SGGSGVDPGAGGAAGNGGRL 255
G G + +GG +G GG L
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.001
Identities = 23/61 (37%), Positives = 29/61 (47%), Gaps = 1/61 (1%)

Query: 135 DGGPGGLLYGNGGDGGSGAAGQVGGNGGNAGL-IGTGGAGGQGGSGVSTTSAQAGGRGGS 193
+GGP GL G G GSG + + GG +G I GG G G G + S G GG+
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 194 G 194

Sbjct: 81 L 81



Score = 31.2 bits (70), Expect = 0.011
Identities = 21/71 (29%), Positives = 29/71 (40%)

Query: 117 TGRQLIGDGADGAPGTGRDGGPGGLLYGNGGDGGSGAAGQVGGNGGNAGLIGTGGAGGQG 176
T + G G G G G N GGSG+ GG G+ G G +GG
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 177 GSGVSTTSAQA 187
G+G + ++ A
Sbjct: 76 GTGGNLSAVAA 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0243DHBDHDRGNASE473e-08 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 46.6 bits (110), Expect = 3e-08
Identities = 62/278 (22%), Positives = 103/278 (37%), Gaps = 49/278 (17%)

Query: 3 RDVLTVIGVGGMGQAIARRLGS-GKTVLLADNNADTLASVSETLAAEGHDVKSRGVDVCA 61
+ G+G+A+AR L S G + D N + L V +L AE ++ DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 62 AESVHDL-AQYAATLGAVTQLAHTAGL-------SPAQASAQAILAVDLLGVALVLQEFG 113
+ ++ ++ A+ +G + L + AG+ S + +A +V+ GV +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 114 AVIAP--GGAGVIIASMAGHLLPPPSAEQERELAHRPPGQLLELDFVGSIVEPAFAYPFA 171
+ G+ V + S + P S AY +
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGV-PRTSMA---------------------------AYASS 160

Query: 172 KQANSIRVRAASRQWGQREARVNSISPGIISTPMGQQELASPVGDGMRAMIAMSGT---- 227
K A + + + + R N +SPG T M A +G +I S
Sbjct: 161 KAAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADE--NGAEQVIKGSLETFKT 218

Query: 228 ----GRIGTPDDIAAAAAFLLGPEATFITGADLLVDGG 261
++ P DIA A FL+ +A IT +L VDGG
Sbjct: 219 GIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0248DHBDHDRGNASE585e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 58.1 bits (140), Expect = 5e-12
Identities = 61/279 (21%), Positives = 102/279 (36%), Gaps = 66/279 (23%)

Query: 6 ITGSASGMGNATASRLREAGHRVIGVDLDGADVVADLSTQQGRLRAAS----DV------ 55
ITG+A G+G A A L G + VD + + +S+ + R A DV
Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72

Query: 56 ------IAACDGRLDGAVLAAGLGPSPGPGRLHRIAQ--------VNYLGVVELLQAWRP 101
I G +D V AG+ PG +H ++ VN GV ++
Sbjct: 73 DEITARIEREMGPIDILVNVAGV---LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 102 ALAAAERAKAVVIASNSTTTVPMVPRRSVRALLDHDADKAVRAVRLFGKAAPSLMYAASK 161
+ V + SN PR S+ A YA+SK
Sbjct: 130 YMMDRRSGSIVTVGSNPAGV----PRTSMAA------------------------YASSK 161

Query: 162 IAVSHWVRRQAVLPEWAGSGVRLNALAPGAIMTPLLAEQL--SDPTQAKAVRSFP----- 214
A + + + E A +R N ++PG+ T + L + + ++
Sbjct: 162 AAAVMFTK--CLGLELAEYNIRCNIVSPGSTETD-MQWSLWADENGAEQVIKGSLETFKT 218

Query: 215 -IPIGGFGEVTHMADWICFMLSDSADFLCGSVVFVDGGS 252
IP+ + + +AD + F++S A + + VDGG+
Sbjct: 219 GIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0249HTHTETR683e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.7 bits (165), Expect = 3e-16
Identities = 38/203 (18%), Positives = 73/203 (35%), Gaps = 9/203 (4%)

Query: 16 KVREAQRLRTRARVFDAAVAEIGRRGLAGADVAAIAAAAGVARGTFYFHFPTKEHVLVEL 75
+ + + TR + D A+ ++G++ + IA AAGV RG Y+HF K + E+
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 76 -ERAEELAIVAKLRDPTADPTDLVSVLSSLVHQVVAV------ERRLGPVVFRDMLGLHF 128
E +E +L P D +SVL ++ V+ R L ++F +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 129 APTRPVEDQLGEHPLAEFVIETIARAQRANRVPPDADAGELGVIFLTGLFALLATGATTP 188
+ + + +T+ A +P D +I + L+ P
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 189 DARTAL--LNRFVTTVVHGMEAR 209
+ +V ++
Sbjct: 183 QSFDLKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0254DHBDHDRGNASE754e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 75.1 bits (184), Expect = 4e-18
Identities = 49/184 (26%), Positives = 81/184 (44%), Gaps = 3/184 (1%)

Query: 10 RAAIVTGASSGIGEEFARILSQRGYQVVLVARSADRLEALAGRL---GSDTHPLPADLSV 66
+ A +TGA+ GIGE AR L+ +G + V + ++LE + L PAD+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 67 RSDRAGLVDRVAALGLVPDILINNAGLSTLGPVAKSVPEQELNLVEVDVAAVVDLCSRFL 126
+ + R+ DIL+N AG+ G + E+ V+ V +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 127 PAMVERGRGAVLNVASVAGFAPLPGQAAYGAAKAFVLSYTHSLRGELHGSGVSVTALSPG 186
M++R G+++ V S P AAY ++KA + +T L EL + +SPG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 187 PVDT 190
+T
Sbjct: 189 STET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0255ACRIFLAVINRP444e-06 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 43.7 bits (103), Expect = 4e-06
Identities = 50/269 (18%), Positives = 91/269 (33%), Gaps = 32/269 (11%)

Query: 208 TVAVILIMLLFVYRSPVTVFLLLVTVGLELTAARGAVALLGHSGLIGLSTFAVSLLTSLA 267
+ V L+M LF ++ + + V + L +A G+S L+ F + L A
Sbjct: 348 IMLVFLVMYLF-LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT-LTMFGMVL----A 401

Query: 268 IAAGTDYGIFIVGRYQEARQAGEDRESAFYTMYRGTAHV---ILGSGLTISAA---TFCL 321
I D I +V + ED+ + + + ++G + +SA
Sbjct: 402 IGLLVDDAIVVVENVERVMM--EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFF 459

Query: 322 SFTRMPYFQTLGIPCSVGMLVALLVALTLAPAVLVV-------------GGRFGAFDPKR 368
+ ++ I M +++LVAL L PA+ GG FG F+
Sbjct: 460 GGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTF 519

Query: 369 VLQVRGWRRVGTAIVRWPLPILAVTCAVALVGLITLPGYRPS----YNDRAYLPSFIPAN 424
V + I+ L + A+ + G++ L PS D+ + I
Sbjct: 520 DHSVNHYTNSVGKILGSTGRYLLIY-ALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLP 578

Query: 425 QGLAVADRHFSQARINPEILMIESDHDMR 453
G ++ L E +
Sbjct: 579 AGATQERTQKVLDQVTDYYLKNEKANVES 607



Score = 40.2 bits (94), Expect = 4e-05
Identities = 47/210 (22%), Positives = 82/210 (39%), Gaps = 15/210 (7%)

Query: 154 QGELLANESVEAVRKIVD--ATPAPPGVKAYVTGGPAMAADLHKSGDRSMAKITLTTVAV 211
QGE S +++ A+ P G+ TG M+ SG+++ A + ++ V
Sbjct: 827 QGEAAPGTSSGDAMALMENLASKLPAGIGYDWTG---MSYQERLSGNQAPALVAIS-FVV 882

Query: 212 ILIMLLFVYRSPVTVFLLLVTVGLELTAARGAVALLGHSGLIGLSTFAVSLLTSLAIAAG 271
+ + L +Y S +++ V L + A L + F V LLT++ ++A
Sbjct: 883 VFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDV---YFMVGLLTTIGLSAK 939

Query: 272 TDYGIFIVGRYQEARQA-GEDRESAFYTMYRGTAHVILGSGLTISAATFCLSFTRMP--- 327
I IV ++ + G+ A R IL + L L+ +
Sbjct: 940 N--AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSG 997

Query: 328 YFQTLGIPCSVGMLVALLVALTLAPAVLVV 357
+GI GM+ A L+A+ P VV
Sbjct: 998 AQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027



Score = 39.4 bits (92), Expect = 7e-05
Identities = 34/160 (21%), Positives = 59/160 (36%), Gaps = 10/160 (6%)

Query: 773 LIAGIASLCLIFVIMLILTRALVAAGVIVGTVAISLGASFGLSVLLWQHIIGMPLHWLVI 832
L I L+F++M + + + A + V + L +F + I + + +V+
Sbjct: 344 LFEAIM---LVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVL 400

Query: 833 AMSVIVLLAVGSDYNLLLVSRFKQEIPAGINTGIIRSMGGTGKVVTNAGLVFAFT---MA 889
A+ ++V A+ N V R E +SM + +V + MA
Sbjct: 401 AIGLLVDDAIVVVEN---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMA 457

Query: 890 SMVVSDVRMIGQVGTTIGLGLLFDTLVVRAFMTPAIAALL 929
S + Q TI + LV TPA+ A L
Sbjct: 458 FFGGSTGAIYRQFSITIVSAMALSVLVALIL-TPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0257ISCHRISMTASE314e-04 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 30.8 bits (69), Expect = 4e-04
Identities = 20/57 (35%), Positives = 28/57 (49%)

Query: 19 IDETDLIDGDATDLRDLGLDSVRFVLLMKRLGVDRESELPSRLAENLSIEGWVSELS 75
+ ET D DL D GLDSVR + L+++ + LAE +IE W L+
Sbjct: 243 LQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0261PF03544330.001 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.4 bits (76), Expect = 0.001
Identities = 21/109 (19%), Positives = 34/109 (31%), Gaps = 8/109 (7%)

Query: 271 TPITAPVAAPA--------ERSVADLAGPQPVVGLTPAASAFAPPSPQATSPAPSAPSPA 322
PI+ + APA + + P+P P AP + P P
Sbjct: 48 QPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKP 107

Query: 323 PTGSPPSSPAISYAVLGPAPPGVSSGPRATTGAAATATDTTRAAAPAAL 371
+ PA P ++ P T + ATA + + A+
Sbjct: 108 VKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASG 156


75MMAR_0381MMAR_0390N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0381210-1.308477TetR family transcriptional regulator
MMAR_0382110-1.111599PE-PGRS family protein
MMAR_03830110.280456PE family protein
MMAR_0384-1100.656226PE family protein
MMAR_03850141.492457hypothetical protein
MMAR_03860141.352433hypothetical protein
MMAR_03870131.41909620-beta-hydroxysteroid dehydrogenase FabG3
MMAR_0388-1131.838034oxidoreductase
MMAR_0389-2121.036919putative regulatory protein
MMAR_0390-1101.640927hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0381HTHTETR1181e-35 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 118 bits (298), Expect = 1e-35
Identities = 36/213 (16%), Positives = 78/213 (36%), Gaps = 22/213 (10%)

Query: 3 SETGLSRREELLAVATKLFAARGYHGTRMDDVADVIGLNKATVYHYYASKSLILFDIY-- 60
+ R+ +L VA +LF+ +G T + ++A G+ + +Y ++ KS + +I+
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 61 -----RQAAEGTLGALHAEPSWTAREALYQYTVRLLTGIADNPER---AAVYFQEQPYIA 112
+ +P RE L + +L R + F + ++
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREIL----IHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 113 EWFTPEQVAEVREKEAQVHEHVHGLIDRGIASGEFYE-CDSHVVA---LGYIGMTLGSYR 168
E +Q R + ++ + + I + + A GYI + +
Sbjct: 122 EMAVVQQAQ--RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN-- 177

Query: 169 WLRPHGRRTAREIAAEFSTALLRGLIRDESIRN 201
WL ++ A ++ LL + ++RN
Sbjct: 178 WLFAPQSFDLKKEARDYVAILLEMYLLCPTLRN 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0384RTXTOXINA310.015 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.7 bits (69), Expect = 0.015
Identities = 25/118 (21%), Positives = 41/118 (34%), Gaps = 22/118 (18%)

Query: 20 GIAATLQVANSAAAAPTSGLLAAAQDEVSAAIA-------------KVFSAYGQEYQAAI 66
A L + +AA S + A +IA + F G + + +
Sbjct: 295 RAAQGLSTSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLL 354

Query: 67 AQ-------ASAFHTEFTRALAAAGAAYAQAEAANASLITAGVSDALTAITTPIQSLL 117
A A T + LA+ + + A SL+ A VS + A+T I +L
Sbjct: 355 AAFHKETGAIDASLTTISTVLASVSSGISAAATT--SLVGAPVSALVGAVTGIISGIL 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0386DPTHRIATOXIN300.010 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 30.5 bits (68), Expect = 0.010
Identities = 23/80 (28%), Positives = 35/80 (43%), Gaps = 3/80 (3%)

Query: 54 TGIAAINDLDAVLATAPECVVYCAMGDTRLPDAMADVMRILAAGSNV---VGSSPGLLQY 110
+ + IN+ + A + E + R DAM + M AG+ V VGSS +
Sbjct: 177 SSVEYINNWEQAKALSVELEINFETRGKRGQDAMYEYMAQACAGNRVRRSVGSSLSCINL 236

Query: 111 PWGVIPDKYIARVQRAAEQG 130
W VI DK +++ E G
Sbjct: 237 DWDVIRDKTKTKIESLKEHG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0387DHBDHDRGNASE1276e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 127 bits (320), Expect = 6e-38
Identities = 76/251 (30%), Positives = 114/251 (45%), Gaps = 18/251 (7%)

Query: 4 VDGKVALISGGARGMGASHARLLVQEGAKVVIGDILDEEGKALAEEIGDAARYVH---LD 60
++GK+A I+G A+G+G + AR L +GA + D E+ + + + AR+ D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 VTQPDQWEAAVATAVDEFGKLDVLVNNAGIVALGQLKKFDLGKWQKVIDVNLTGTFLGMR 120
V + A E G +D+LVN AG++ G + +W+ VN TG F R
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 121 AAVEPMTAAGSGSIINVSSIEGLRGAPAVHPYVASKWAVRGLTKSAALELAPLNIRVNSI 180
+ + M SGSI+ V S ++ Y +SK A TK LELA NIR N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 181 HPGFIRTPMTANLPDD---------------MVTIPLGRPAESREVSTFVVFLASDDASY 225
PG T M +L D IPL + A+ +++ V+FL S A +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 226 ATGSEFVMDGG 236
T +DGG
Sbjct: 246 ITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0388PF00577300.029 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 29.8 bits (67), Expect = 0.029
Identities = 28/159 (17%), Positives = 46/159 (28%), Gaps = 26/159 (16%)

Query: 30 GRYRGRASALVRPASADQVAEVLRVCRDAGAHVTVQGGRTSLVAGTVP-EHDDVLLSTER 88
G+ LV A + +V G +G +++ + V L T
Sbjct: 711 GQPLNDTVVLV---KAPGAKDA-KVENQTGVRTDWRG--YAVLPYATEYRENRVALDTNT 764

Query: 89 ICDVADVDTLERRVAVGAGA---------TLAAVQRAATAAG--LVFGVDLSARESATVG 137
+ D D+D V GA + T L FG +++ S + G
Sbjct: 765 LADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSG 824

Query: 138 -----GMA---STNAGGLRTVRYGNMGEQVVGLDVALPD 168
G G V++G + LP
Sbjct: 825 IVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLPP 863


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0389HTHTETR572e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 2e-12
Identities = 24/96 (25%), Positives = 39/96 (40%), Gaps = 2/96 (2%)

Query: 17 TRQRILAATAEVLARSGKTKLSLSEVAAQAGVSRPTLYRWFASKEELL-ATFSRYERQVF 75
TRQ IL + ++ G + SL E+A AGV+R +Y F K +L + E +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 76 ESGLSKATAGLKG-VDKLDAALRFIVDYQYSYSGVR 110
E L + L L +++ + R
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRR 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_039056KDTSANTIGN280.041 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 28.0 bits (62), Expect = 0.041
Identities = 25/111 (22%), Positives = 43/111 (38%), Gaps = 14/111 (12%)

Query: 58 LSRVVVDLIGAVPADGDLWVSSRIERGGKQIELVSAELAAPGPDGRPRAVARASGWRLAQ 117
+ + VD G + AD + I + K L P P P ++A R
Sbjct: 105 VGKGEVDSKGEIKADSGGGTDAPIRKPFK--------LTPPQPTMSPISIAD----RDFG 152

Query: 118 LDTQELRHAAEQPPRPLAEARNRKLAPMSWDRNYVHSLDWRWLTEPMTPGP 168
+D + A Q +P + R A ++W +N +D+ + +P PG
Sbjct: 153 IDIPNIPQAQRQAAQPPLNDQKRAAARIAWLKNCA-GIDYM-VKDPNNPGH 201


76MMAR_0414MMAR_0423N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_04142160.237007MCE-family protein Mce1C
MMAR_0415315-0.360891MCE-family protein Mce1D
MMAR_04163150.145242MCE family lipoprotein LprK
MMAR_04172130.417176MCE-family protein Mce1F
MMAR_04180140.196156Mce associated membrane protein
MMAR_0419-113-0.032161Mce associated transmembrane protein
MMAR_0420-112-0.152252Mce associated protein
MMAR_04210120.146818Mce associated membrane protein
MMAR_04220120.056381lipoprotein LprO
MMAR_04230120.347156transmembrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0414PF03544362e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 36.5 bits (84), Expect = 2e-04
Identities = 18/77 (23%), Positives = 19/77 (24%), Gaps = 1/77 (1%)

Query: 423 SSPPNPNGLPPTPGIPIAGRPGEPAPDAPGTPVPLPPDAPPGARTEPVGPAGPTPPPSTF 482
PP P P PI P E P P P P + P S
Sbjct: 67 QPPPEPVVEPEPEPEPIPEPPKEAPVVIE-KPKPKPKPKPKPVKKVEQPKRDVKPVESRP 125

Query: 483 APGLPPGPPAPPGPGPQ 499
A PA P
Sbjct: 126 ASPFENTAPARPTSSTA 142



Score = 33.8 bits (77), Expect = 0.001
Identities = 20/104 (19%), Positives = 28/104 (26%), Gaps = 2/104 (1%)

Query: 415 PAPLDGVESSPPNPNGLPPTPGI-PIAGRPGEPAPDAPGTPVPLPPDAPPGARTEPVGPA 473
PAP + + P L P + P EP P+ P P P +AP P
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEP-PKEAPVVIEKPKPKPK 102

Query: 474 GPTPPPSTFAPGLPPGPPAPPGPGPQLPDPYITPGGTGGSGATG 517
P P P + + + A
Sbjct: 103 PKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT 146



Score = 30.3 bits (68), Expect = 0.014
Identities = 18/94 (19%), Positives = 25/94 (26%), Gaps = 10/94 (10%)

Query: 433 PTPGIPIAGRPGEPAPDAPGTPVPLPPDAPPGARTEPVGPAGPTP----PPSTFAPGLPP 488
P P PI+ PA P PP EP P P P
Sbjct: 44 PAPAQPISVTMVAPA----DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP 99

Query: 489 GPPAPPGPGPQLP--DPYITPGGTGGSGATGGSQ 520
P P P ++ + P + + +
Sbjct: 100 KPKPKPKPVKKVEQPKRDVKPVESRPASPFENTA 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0415TONBPROTEIN330.002 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 33.0 bits (75), Expect = 0.002
Identities = 15/44 (34%), Positives = 15/44 (34%)

Query: 485 LPPIGLQAPVPIQPPPPGPEVIPGPVAPTPAPGPAPAPVGAPLP 528
L P P P P PE P P P AP P P P
Sbjct: 55 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 98



Score = 29.2 bits (65), Expect = 0.031
Identities = 20/80 (25%), Positives = 28/80 (35%), Gaps = 8/80 (10%)

Query: 460 PDIEPVQSTLQTPPGPPNAYDEYPVLPPIGLQAPVPIQPPPPGPEVIPGPVA-----PTP 514
+ VQ + P + P P +APV I+ P P P+ P PV P
Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPEPPK---EAPVVIEKPKPKPKPKPKPVKKVQEQPKR 112

Query: 515 APGPAPAPVGAPLPAEAGAG 534
P + +P A A
Sbjct: 113 DVKPVESRPASPFENTAPAR 132



Score = 29.2 bits (65), Expect = 0.034
Identities = 15/52 (28%), Positives = 15/52 (28%)

Query: 477 NAYDEYPVLPPIGLQAPVPIQPPPPGPEVIPGPVAPTPAPGPAPAPVGAPLP 528
D P PV P P P P AP P P P P P
Sbjct: 51 TPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKP 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0419PF04335280.034 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 28.3 bits (63), Expect = 0.034
Identities = 10/91 (10%), Positives = 28/91 (30%), Gaps = 5/91 (5%)

Query: 219 TYDPSTLQEDFARARSLTTDKYREQLAVQQETVAKGHPV-----LNEYWVTDSSIQAATP 273
+ + +E F ++ +++ + +T P + +V +
Sbjct: 108 GWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGG 167

Query: 274 DHATMLMFMQGRRGSPPEIRYISATVRVSFA 304
+ A + + GS AT++
Sbjct: 168 NVAQVYFTKESVTGSNSTKTDAVATIKYKVD 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0420PF04335270.030 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 27.5 bits (61), Expect = 0.030
Identities = 13/102 (12%), Positives = 27/102 (26%), Gaps = 19/102 (18%)

Query: 24 KRQWGLPLAATAAAVLMAAAITVCSLM---------------FASHVTRYHAARKDHEVV 68
K W + A A A A+ + + AS + H
Sbjct: 33 KLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAKLHGDATITYDE 92

Query: 69 NSVKSFMSEFT----SVDPYHANEYIEGILAHATGDFAKQYR 106
K F++ + EY + ++ + ++
Sbjct: 93 AVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWS 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0421PF05616330.001 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 33.2 bits (75), Expect = 0.001
Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 5/66 (7%)

Query: 40 PAAEAEPAEGAADDEATTDSVEDTGEASADSTDPDTDADADADAEADGAKGKAAARRARP 99
P E PAE A++ A ++ + + +PD D + DA+ + DG G A P
Sbjct: 327 PLPEVSPAENPANNPAPNEN-----PGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVP 381

Query: 100 KRLTGR 105
R GR
Sbjct: 382 DRPNGR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0423BINARYTOXINB320.008 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 31.6 bits (71), Expect = 0.008
Identities = 21/113 (18%), Positives = 36/113 (31%), Gaps = 5/113 (4%)

Query: 102 DRAQVYGSAVIPPTFSSQLRDFAVSAVQPGSTDRPVITISTNPRAGTLGASIAGQTLTQA 161
D +V G I S + R V+A D I +S N T + QT T +
Sbjct: 264 DFEKVTGR--IDKNVSPEARHPLVAAYPIVHVDMENIILSKNEDQST--QNTDSQTRTIS 319

Query: 162 MAAANSK-VGERVTAEVAAQTGGAPLSGASALGLASPIDVKTTVYNPLPNGTG 213
+ S+ V + G+ + G ++ + + L
Sbjct: 320 KNTSTSRTHTSEVHGNAEVHASFFDIGGSVSAGFSNSNSSTVAIDHSLSLAGE 372


77MMAR_0431MMAR_0436N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_04314175.415143PE-PGRS family protein
MMAR_0432-1100.351757dihydroxy-acid dehydratase
MMAR_0433-3100.212885hypothetical protein
MMAR_0434-2100.307590integral membrane protein
MMAR_0435-2110.146244secretory protein
MMAR_0436-2100.039478transcriptional regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0431cloacin391e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 1e-04
Identities = 34/85 (40%), Positives = 42/85 (49%), Gaps = 4/85 (4%)

Query: 432 GQGGSGGNSGAGGTNGSGGNGGLGGAGGDGGSGTADTGGGAGNGGSGGKGGTGGVSGVAG 491
G G G N+GA T+G+ GG G G G A G G + + GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN----GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 492 GGGAGGTGGTGGTGGAGGTGGNGAA 516
G G G GG G +GG GTGGN +A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 37.8 bits (87), Expect = 3e-04
Identities = 33/91 (36%), Positives = 37/91 (40%), Gaps = 5/91 (5%)

Query: 703 GAGGTGGTGGNSGTGGTSGNGGTGGTGGGGGYGGPGDYNENGYAGGSGGTGGTGGDPGTG 762
G G G G T G G TG GGG G G +EN G GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-----GGSGSGIHWG 57

Query: 763 GTAAVGGDGGTGGIGGGGGYGGTGFDAAGAM 793
G + G GG G GGG G GG A +
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 37.0 bits (85), Expect = 5e-04
Identities = 26/84 (30%), Positives = 36/84 (42%)

Query: 407 NGGTGGAGANAVAGTGDNGSDGAAGGQGGSGGNSGAGGTNGSGGNGGLGGAGGDGGSGTA 466
+GG G T N + G G G G + G+G ++ + GG G+G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 467 DTGGGAGNGGSGGKGGTGGVSGVA 490
GG GG G G +S VA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 35.8 bits (82), Expect = 0.001
Identities = 30/108 (27%), Positives = 37/108 (34%)

Query: 1013 GGTGGIANGTGNGGTGGKGGTGGDGGTGGTATTAGQTGGDGGDGGKGGGGGTGGKANGGD 1072
GG G N + +G G G GG A+ + G G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1073 GGDGGDGGDGGAGGAGGNGDGAVGGIGGGGGTGGTGGTGTPPGGANGG 1120
G GG+G GG G GGN + G T G G + G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 34.7 bits (79), Expect = 0.002
Identities = 31/101 (30%), Positives = 40/101 (39%)

Query: 384 GGAAGVGGFGGSAGSYGNGGGGGNGGTGGAGANAVAGTGDNGSDGAAGGQGGSGGNSGAG 443
G G S NGG G G GGA + + +N G +G GG SG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 444 GTNGSGGNGGLGGAGGDGGSGTADTGGGAGNGGSGGKGGTG 484
G+G +GG G GG+ + A G + G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.002
Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 234 GDGGAGGAGGRAGLLGYGGAGGAGGLGGSGGAGLPNQQSGNGGGGGHGGAGGAAGWFGHA 293
G G GA +G + G G G G S G+G ++ + GGG G G G G+
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 294 GVGGDGGTG-GAGGNGQA 310
G G+ G G G GGN A
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 34.3 bits (78), Expect = 0.003
Identities = 25/80 (31%), Positives = 28/80 (35%)

Query: 1005 GTGGAGGFGGTGGIANGTGNGGTGGKGGTGGDGGTGGTATTAGQTGGDGGDGGKGGGGGT 1064
G G G T G NG G G G + G G + G +G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1065 GGKANGGDGGDGGDGGDGGA 1084
GG N G G G A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVA 85



Score = 34.3 bits (78), Expect = 0.003
Identities = 38/121 (31%), Positives = 45/121 (37%), Gaps = 5/121 (4%)

Query: 300 GTGGAGGNGQAGQLSNDVGGDGGRGGAGGAAGAGGDAGLLGLNGAGGHGGGGGMGGTGGT 359
G G G N A S ++ G G GG A G GG G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 360 GAAAAAGINAAAGGTGGDGGAAGAGGAAGVGGFGGSAGSYGNGGGGGNGGTGGAGANAVA 419
G G + GTGG+ A A A FG A S GG + GA + A+A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVA-----FGFPALSTPGAGGLAVSISAGALSAAIA 117

Query: 420 G 420

Sbjct: 118 D 118



Score = 33.9 bits (77), Expect = 0.004
Identities = 27/79 (34%), Positives = 28/79 (35%)

Query: 1051 GDGGDGGKGGGGGTGGKANGGDGGDGGDGGDGGAGGAGGNGDGAVGGIGGGGGTGGTGGT 1110
G G G G T G NGG G G GG G + GG G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1111 GTPPGGANGGPGKTGADGL 1129
G G N G G L
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.004
Identities = 32/81 (39%), Positives = 37/81 (45%), Gaps = 1/81 (1%)

Query: 155 GGAGGSAGLIGNGGIGGQGGAGGIGGAGGSAAFFGNGGAGGHGGAGGAGGIGGNGGFFGN 214
GA ++G I G G G G G+G S+ GG G G G G GNGG GN
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 215 GGAG-GAGGNGSTGGVGLAGG 234
G G G GGN S +A G
Sbjct: 71 SGGGSGTGGNLSAVAAPVAFG 91



Score = 33.9 bits (77), Expect = 0.004
Identities = 25/77 (32%), Positives = 29/77 (37%)

Query: 772 GTGGIGGGGGYGGTGFDAAGAMGGIGGTGGTGGNPGPGGIGGNGGDGGTGGPGGYGDTGF 831
G G G G T + G G+G GG G G G G G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 832 GTAGYAGGTGGTGGTGG 848
G G G +GG GTGG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 33.5 bits (76), Expect = 0.005
Identities = 35/109 (32%), Positives = 39/109 (35%)

Query: 794 GGIGGTGGTGGNPGPGGIGGNGGDGGTGGPGGYGDTGFGTAGYAGGTGGTGGTGGAPGPG 853
GG G TG + G I G G GG G GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 854 GSAAGNGGGGGIGGIGGTGGSGATTAGFAGGDGGKGGTGGTGGSAGAGA 902
G+ GNG GG G GG + A F G GG S AGA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.1 bits (75), Expect = 0.007
Identities = 29/87 (33%), Positives = 34/87 (39%), Gaps = 3/87 (3%)

Query: 454 LGGAGGDGGSGTADTGGGAGNGGSGGKGGTGGVSGVAGGGGAGGTGGTGGTGGAGGTGGN 513
+ G G G + A + G NGG G G GG S G G G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGAS---DGSGWSSENNPWGGGSGSGIHWG 57

Query: 514 GAAGKSDTGDIGGSGGYGGDGGNGGNS 540
G +G + G G SGG G GGN
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.8 bits (74), Expect = 0.008
Identities = 30/89 (33%), Positives = 37/89 (41%), Gaps = 7/89 (7%)

Query: 966 GEGGFGGTGGNSYYAGNAGPGGQGGQGGAGANGAPGLA-------GGTGGAGGFGGTGGI 1018
G G G G +GN G G G GA+ G + GG+G +GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1019 ANGTGNGGTGGKGGTGGDGGTGGTATTAG 1047
NG GNG +GG GTGG+ G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.009
Identities = 26/77 (33%), Positives = 30/77 (38%), Gaps = 1/77 (1%)

Query: 839 GTGGTGGTGGAPGPGGSAAGNGGGGGIGGIGGTG-GSGATTAGFAGGDGGKGGTGGTGGS 897
G G G GA G+ G G G+GG G G + + GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 898 AGAGAANGSGGTGGQGG 914
G SGG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 32.8 bits (74), Expect = 0.009
Identities = 33/83 (39%), Positives = 38/83 (45%), Gaps = 4/83 (4%)

Query: 653 GEGGTGGNSAAGGTSGDGTQVAWGRGGDGGDGGQGGYGGAGSFEQPGGTGGAGGTGGTGG 712
G G G N+ A TSG+ + G G G GG G S P G GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGN---INGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGG 58

Query: 713 NSGTGGTSGNGGTGGTGGGGGYG 735
SG G GNG +GG G GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 32.8 bits (74), Expect = 0.009
Identities = 32/106 (30%), Positives = 39/106 (36%), Gaps = 6/106 (5%)

Query: 470 GGAGNGGSGGKGGTGGVSGVAGGGGAGGTGGTGGTGGAGGTGGNGAAGKSDTGDIGGSGG 529
GG G G + G T G + GG G GG G + N G S +G G G
Sbjct: 3 GGDGRGHNTGAHSTSG--NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 530 YGGDGGNGGNSLPGGTNGAGGTGGTGGEGGGGGTGTPDTGSGGGDG 575
G+GG GNS G+G G G P + G G
Sbjct: 61 GHGNGGGNGNS----GGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.8 bits (74), Expect = 0.010
Identities = 27/79 (34%), Positives = 35/79 (44%)

Query: 501 TGGTGGAGGTGGNGAAGKSDTGDIGGSGGYGGDGGNGGNSLPGGTNGAGGTGGTGGEGGG 560
+GG G TG + +G + G G G G G+G +S G G+G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 561 GGTGTPDTGSGGGDGGDGG 579
G G + SGGG G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 32.8 bits (74), Expect = 0.010
Identities = 24/67 (35%), Positives = 31/67 (46%)

Query: 119 GANGTTVNGVGTPGGDGGILYGNGGNGGTSTNAATAGGAGGSAGLIGNGGIGGQGGAGGI 178
GA+ T+ N G P G G + G+G +S N GG+G G G G GG G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 179 GGAGGSA 185
GG G+
Sbjct: 72 GGGSGTG 78



Score = 32.4 bits (73), Expect = 0.011
Identities = 30/87 (34%), Positives = 36/87 (41%), Gaps = 2/87 (2%)

Query: 527 SGGYGGDGGNGGNSLPGGTNGAGGTGG-TGGEGGGGGTGTPDTGSGGGDGGDGGYGGYGG 585
SGG G G +S G NG G GG G G + + GGG G +GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 586 SGGLPGHGDGADGVAGTGGKGGAGGAA 612
G G + G +GTGG A A
Sbjct: 62 HGN-GGGNGNSGGGSGTGGNLSAVAAP 87



Score = 32.4 bits (73), Expect = 0.011
Identities = 25/82 (30%), Positives = 32/82 (39%)

Query: 582 GYGGSGGLPGHGDGADGVAGTGGKGGAGGAAGTGANATAGTDRYGGYGGDGGGGGGGGYG 641
G G G G + + G G GG A G+ ++ + +GG G G GGG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 642 GNSIRGTGGVGGEGGTGGNSAA 663
GN GG G G SA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.4 bits (73), Expect = 0.012
Identities = 27/89 (30%), Positives = 33/89 (37%)

Query: 576 GDGGYGGYGGSGGLPGHGDGADGVAGTGGKGGAGGAAGTGANATAGTDRYGGYGGDGGGG 635
G G G G+ G+ +G G GG G + N G G + G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 636 GGGGYGGNSIRGTGGVGGEGGTGGNSAAG 664
G GG GNS G+G G A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.4 bits (73), Expect = 0.012
Identities = 32/100 (32%), Positives = 40/100 (40%), Gaps = 1/100 (1%)

Query: 682 GDGGQGGYGGAGSFEQPGGTGGAGGTGGTGGNSGTG-GTSGNGGTGGTGGGGGYGGPGDY 740
G G+G GA S G G G G + G+G + N GG+G G +GG +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 741 NENGYAGGSGGTGGTGGDPGTGGTAAVGGDGGTGGIGGGG 780
G G SGG GTGG+ G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.012
Identities = 28/101 (27%), Positives = 35/101 (34%)

Query: 472 AGNGGSGGKGGTGGVSGVAGGGGAGGTGGTGGTGGAGGTGGNGAAGKSDTGDIGGSGGYG 531
+G G G G SG GG G G G + G+G + N G I GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 532 GDGGNGGNSLPGGTNGAGGTGGTGGEGGGGGTGTPDTGSGG 572
G G + GG+ G G G+GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.018
Identities = 30/100 (30%), Positives = 38/100 (38%), Gaps = 1/100 (1%)

Query: 634 GGGGGGYGGNSIRGTGGV-GGEGGTGGNSAAGGTSGDGTQVAWGRGGDGGDGGQGGYGGA 692
GG G G+ + +G + GG G G A SG ++ GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 693 GSFEQPGGTGGAGGTGGTGGNSGTGGTSGNGGTGGTGGGG 732
G+ G +GG GTGG G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.018
Identities = 33/104 (31%), Positives = 42/104 (40%), Gaps = 4/104 (3%)

Query: 205 IGGNGGFFGNGGAGGAGGNGSTGGVGLAGGDGGAGGAGGRAGLLGYGGAGGAG----GLG 260
+ G G N GA GN + G GL G G + G+G + +GG G+G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 261 GSGGAGLPNQQSGNGGGGGHGGAGGAAGWFGHAGVGGDGGTGGA 304
G G G G G GG+ A A FG + G G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.032
Identities = 27/89 (30%), Positives = 32/89 (35%)

Query: 925 TNGTYEGMPGGVGGTGGAGGEGGLPGTGAGTAGSIGSAGNGGEGGFGGTGGNSYYAGNAG 984
T+G G P G+G GGA G G GS + G G G GG + +G
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 985 PGGQGGQGGAGANGAPGLAGGTGGAGGFG 1013
G A A T GAGG
Sbjct: 76 GTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.033
Identities = 32/106 (30%), Positives = 38/106 (35%), Gaps = 4/106 (3%)

Query: 276 GGGGHGGAGGAAGWFGHAGVGGDGGTGGAGGNGQAGQLSNDVGGDGGRGGAGGAAGAGGD 335
GG G G GA G+ G G G G + +G S + GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 336 AGLLGLNGAGGHGGGGGMGGTGGTGAAAAAGINAAAGGTGGDGGAA 381
G +GG G GG + AA A T G GG A
Sbjct: 63 GNGGGNGNSGGGSGTGG----NLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.041
Identities = 25/85 (29%), Positives = 32/85 (37%), Gaps = 4/85 (4%)

Query: 350 GGGMGGTGGTGAAAAAGINAAAGGTGGDGGAAGAGGAAGV----GGFGGSAGSYGNGGGG 405
GG G + + IN G G GGA+ G + GG GS +G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 406 GNGGTGGAGANAVAGTGDNGSDGAA 430
GNGG G G+ + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0434TCRTETA431e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.9 bits (101), Expect = 1e-06
Identities = 71/334 (21%), Positives = 118/334 (35%), Gaps = 27/334 (8%)

Query: 59 GTLLSWYALVAALTTVPLVRWTAHWPRRHALMVSLVCLTISQLISALAPNFAVLAAGRAL 118
G LL+ YAL+ L + + RR L+VSL + I A AP VL GR +
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 119 CAITHG---LLWSVIAPIATRLVPPSHAGRATTSIYIGTSLALVIGSPLTAALSLMWGWR 175
IT + + IA I H G + G V+G L S +
Sbjct: 106 AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG-LMGGFSPHAPFF 164

Query: 176 LAAVCVTVAAAVVTVAARLLLPEMVLSAD---QLQYVGPRSRHHRNRALIIVSLITMVGV 232
AA + LLPE + + + P + R + +V+ + V
Sbjct: 165 AAAALNGLNFLTGC----FLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF 220

Query: 233 TGHFVSY---TYIVVIIRQVVGVRGPSLAWLLAAYGVAGVVAVALVARPLDRRPKGTIIF 289
V V+ ++ LAA+G+ +A A++ P+ R G
Sbjct: 221 IMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAAR-LGERRA 279

Query: 290 CVAGLTFAFVLLTALAFGGH--LAPMTALVVGTGAIVLWGAAATAVSPMLQSAAMRSGAD 347
+ G+ LAF +A +++ +G I + P LQ+ R +
Sbjct: 280 LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM---------PALQAMLSRQVDE 330

Query: 348 DPDGASGLYVTAFQ-VGIMAGSLIGGLLYERSVA 380
+ G + A + + G L+ +Y S+
Sbjct: 331 ERQGQLQGSLAALTSLTSIVGPLLFTAIYAASIT 364


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0435PF03544320.002 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 32.3 bits (73), Expect = 0.002
Identities = 20/88 (22%), Positives = 26/88 (29%), Gaps = 2/88 (2%)

Query: 42 GDPPVIAPAEPAPDPLAPPPGPLALPPMPDPLAPPPFPDPLAPPPLVPVAAGPVAGQDP- 100
PP EP P+P P P P + + P P P P + +
Sbjct: 66 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRP 125

Query: 101 -TPFFGPPPFRPPSFNPVDGAMVGVAKP 127
+PF P RP S V
Sbjct: 126 ASPFENTAPARPTSSTATAATSKPVTSV 153



Score = 31.5 bits (71), Expect = 0.004
Identities = 11/81 (13%), Positives = 15/81 (18%)

Query: 32 PAVADPDDAPGDPPVIAPAEPAPDPLAPPPGPLALPPMPDPLAPPPFPDPLAPPPLVPVA 91
P+ P VI +P P P P + P P
Sbjct: 79 EPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPT 138

Query: 92 AGPVAGQDPTPFFGPPPFRPP 112
+ P
Sbjct: 139 SSTATAATSKPVTSVASGPRA 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0436HTHTETR388e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 38.1 bits (88), Expect = 8e-06
Identities = 18/152 (11%), Positives = 52/152 (34%), Gaps = 14/152 (9%)

Query: 5 RERMVISAVLLIRERGARATAISDVLQHSGAPRGSAYHYFPGGRTQLLCEAVDYAGAHVA 64
R+ ++ A+ L ++G +T++ ++ + +G RG+ Y +F ++ L E + + +++
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHF-KDKSDLFSEIWELSESNIG 71

Query: 65 KIIGSANRSLDLLDTLIDQYREQLRATDFRAGCPVAAVSVEAGEPSDRERMAPVVEHAAA 124
++ + RE L + R + ++ H
Sbjct: 72 ELE--LEYQAKFPGDPLSVLREILIHV-LES----------TVTEERRRLLMEIIFHKCE 118

Query: 125 VFDRWSDLIAQRFVSDGIPLDSAHELAVTAMS 156
+ + + D + +
Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIE 150


78MMAR_0484MMAR_0492N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0484-1151.090902hypothetical protein
MMAR_5555-2131.222154PE-PGRS family protein
MMAR_0487-1132.698954phosphotriesterase PHP
MMAR_04880112.389498acyl-CoA dehydrogenase FadE4
MMAR_0489182.951276TetR family transcriptional regulator
MMAR_0490282.729617succinate-semialdehyde dehydrogenase
MMAR_0491273.446586PPE family protein
MMAR_0492173.684809PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0484PERTACTIN320.002 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 31.6 bits (71), Expect = 0.002
Identities = 33/131 (25%), Positives = 40/131 (30%), Gaps = 8/131 (6%)

Query: 60 ADLAIVSTKADELRQASKIARDGAGTIGIAQRRVLHAVEDAHNAGFTVGEDFSVTDIRTS 119
+D +V A Q R+ +L A FT+ DI T
Sbjct: 491 SDKLVVMRDASG--QHRLWVRNSGSEPASGNTMLLVQTPRGSAATFTLANKDGKVDIGTY 548

Query: 120 RSSAEQAARQAQAQAQALATDIRQRAVEPLPPPAPGRLPPLTPGEMATPRLPAPPPQPPI 179
R + A A P P P PG P P + P P PPQPP
Sbjct: 549 RYRLAANGNGQWSLVGAKAPP------APKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQ 602

Query: 180 RGGADLAPAEP 190
R AP P
Sbjct: 603 RQPEAPAPQPP 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5555cloacin362e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.2 bits (83), Expect = 2e-04
Identities = 36/107 (33%), Positives = 45/107 (42%), Gaps = 7/107 (6%)

Query: 135 GNGGNGGSGAPGQRGGDGGDAGLLGKGGYGGNGGAGQT------GGTGGSVGLFGNGGGG 188
G+G +GA G G LG GG G + G+G + GG GS +G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 189 GGGGAGGGAGGLGGDAGLVFGVGGPGGDGGDGGGTGGAGGAGGTFWG 235
G GG G +GG G G + V P G T GAGG +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 36.2 bits (83), Expect = 2e-04
Identities = 37/110 (33%), Positives = 41/110 (37%), Gaps = 10/110 (9%)

Query: 159 GKGGYGGNGGAGQTGGTGGSVGLFGNGGGGGGGGAGGGAGGLGGDAGLVFGVGGPGGDGG 218
G G G N GA T G GG G G GGGA G + GG G G
Sbjct: 3 GGDGRGHNTGAHSTSGN--------INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 219 DGGGTGGAGGAGGTFWGPGGAGGSGGHGGAGIAGNGGAGGPGGAVFGTGG 268
GG G G GG G G G G + +A G P + G GG
Sbjct: 55 HWGGGSGHGNGGGN--GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 6e-04
Identities = 30/108 (27%), Positives = 36/108 (33%)

Query: 122 GTGQDGGDGGILWGNGGNGGSGAPGQRGGDGGDAGLLGKGGYGGNGGAGQTGGTGGSVGL 181
G G+ G NGG G GG +G + G G GGS
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 182 FGNGGGGGGGGAGGGAGGLGGDAGLVFGVGGPGGDGGDGGGTGGAGGA 229
G G G GGG+G G A + FG G G + GA
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0489HTHTETR633e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.7 bits (152), Expect = 3e-14
Identities = 28/157 (17%), Positives = 56/157 (35%), Gaps = 2/157 (1%)

Query: 13 RRAAIVEAAEAEFGAHGFSQGSLNVIARRARVAKGSLFQYFADKRDLYAYIADIASQRVR 72
R I++ A F G S SL IA+ A V +G+++ +F DK DL++ I +++ +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 73 THIEGLIREL--DSSRPFFEFLTDLLDGWVAYFAEHPRERALHAAATLEVDTDARISVRS 130
+ D E L +L+ V + + +
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 131 VIHRHYLEVLRPLVRDALARGDLRADSDTDALLSLLL 167
+ + + ++ + L AD T ++
Sbjct: 132 NLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMR 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0492cloacin367e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 7e-04
Identities = 34/87 (39%), Positives = 44/87 (50%), Gaps = 5/87 (5%)

Query: 439 GAGTGGAGGAGAHGFGLLSGTAGAGGAGGAASAGTGGAGGL----GGTGFGLIAAGGDGG 494
G G G GA + G ++G G GG AS G+G + GG+G G+ GG G
Sbjct: 4 GDGRGHNTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 495 GAGTGVGSNGGDGGGGGGAHAVLAAIA 521
G G G G++GG G GG AV A +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 34.7 bits (79), Expect = 0.001
Identities = 33/113 (29%), Positives = 38/113 (33%), Gaps = 1/113 (0%)

Query: 133 GDAGGPGGLLFGNGGNGGSGSAGMAGGAGGSAGLIGNGGAGGAGGADAAGGPGGAGGWLW 192
GD G GN G G+ G G S G G G + G GG
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG-SGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 193 GGGGAGGLGGAASGAGNGGAGGAGGAGGAFIAIGGVGGDGGAASSGTGGVGGA 245
G GG G G SG G + A F A+ G G A S G + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.9 bits (77), Expect = 0.003
Identities = 32/118 (27%), Positives = 38/118 (32%)

Query: 633 TGTAGSGGNGGFVENFDFFGFGVAHGGDGGSGGAASGAGGIGGAGGNGGSGTTPLFGAYS 692
+G G G N G G G GG SG G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 693 GGAGGDGGAATGTGGAGGNGGAGGAASGLGVGIGGAGGHGGAAPTGNGGSGGAGAGGL 750
G GG G + G G GGN A A G G GG A + + G+ A +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 33.5 bits (76), Expect = 0.003
Identities = 25/79 (31%), Positives = 29/79 (36%)

Query: 691 YSGGAGGDGGAATGTGGAGGNGGAGGAASGLGVGIGGAGGHGGAAPTGNGGSGGAGAGGL 750
++ GA G G G GG SG GG G+ GGSG GG
Sbjct: 9 HNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 751 GLIGAGGNGGDAGAGVAAA 769
G G G G + VAA
Sbjct: 69 GNSGGGSGTGGNLSAVAAP 87



Score = 33.1 bits (75), Expect = 0.005
Identities = 31/110 (28%), Positives = 43/110 (39%), Gaps = 1/110 (0%)

Query: 324 GQGGAGGIGGAASTGMAGAGGSGGSCVAFDFVGFAAAHGGAGGTGGAATGVGATAGAAGS 383
G+G G + G G G A D G+++ + GG G+ G +G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 384 GGLGVAIVGSGVGGVGGVGGAATGAGATA-AAGGAGGLGLAAVGSGTGGA 432
GG G + GSG GG A G A + GAGGL ++ A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.1 bits (75), Expect = 0.005
Identities = 38/114 (33%), Positives = 45/114 (39%), Gaps = 5/114 (4%)

Query: 489 AGGDGGGAGTGVGSNGGDGGGGGGAHAVLAAIAGSGGQGQAGTSGFGGFGGSGGSAESLF 548
+GGDG G TG S G+ GG V G G +S +GG GS
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGV----GGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 549 FSIGGAGGAGGDASTGGGGLGGNGGVAVAHSPIGID-IGIGGAGGHGGSGTSGA 601
G G G S GG G GGN A G + GAGG S ++GA
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.1 bits (75), Expect = 0.005
Identities = 30/83 (36%), Positives = 35/83 (42%), Gaps = 3/83 (3%)

Query: 670 AGGIGGAGGNGGSGTTPLFGAYSGGAGGDGGAATGTGGAGGNGGAGGAASGLGVGIGGAG 729
+GG G G T+ G G GGA+ G+G + N GG G G GI G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGG---GSGSGIHWGG 58

Query: 730 GHGGAAPTGNGGSGGAGAGGLGL 752
G G GNG SGG G L
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 33.1 bits (75), Expect = 0.005
Identities = 33/112 (29%), Positives = 39/112 (34%), Gaps = 2/112 (1%)

Query: 456 LSGTAGAGGAGGAASAGTGGAGGLGGTGFGLIAAGGDGGGAGTGVGSNGGDGGGGGGAHA 515
+SG G G GA S GG G G G DG G + GG G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVG--GGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 516 VLAAIAGSGGQGQAGTSGFGGFGGSGGSAESLFFSIGGAGGAGGDASTGGGG 567
G G G SG GG + + + F GAGG A + G
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.8 bits (74), Expect = 0.006
Identities = 26/92 (28%), Positives = 33/92 (35%), Gaps = 1/92 (1%)

Query: 728 AGGHGGAAPTGNGGSGGAGAGGLGLIGAGGNGGD-AGAGVAAADGGDGGNAGLVINGTYE 786
+GG G TG + G GG +G GG D +G G G +G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 787 ASPYGNGGNGGNGVNGGSGGKGGSAGQVGGTP 818
G GN G G G +A G P
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93



Score = 32.8 bits (74), Expect = 0.007
Identities = 37/119 (31%), Positives = 46/119 (38%), Gaps = 3/119 (2%)

Query: 652 GFGVAHGGDGGSGGAASGAGGIGGAGG-NGGSGTTPLFGAYSGGAGGDGGAATGTGGAGG 710
G G G SG G G+G GG + GSG + + GG+G G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGS--GIHWGGGSGHG 63

Query: 711 NGGAGGAASGLGVGIGGAGGHGGAAPTGNGGSGGAGAGGLGLIGAGGNGGDAGAGVAAA 769
NGG G + G G G GAGGL + + G A A + AA
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122



Score = 32.0 bits (72), Expect = 0.010
Identities = 28/90 (31%), Positives = 39/90 (43%), Gaps = 5/90 (5%)

Query: 391 VGSGVGGVGGVGGAATGAGATAAAGGAGGLGLAAVGSGTGGAGGAGGVGAGTGGAGGAGA 450
+ G G+G GGA+ G+G ++ GG + + G G G GG +GG G G
Sbjct: 20 INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79

Query: 451 HGFGLLSGTAGAGGAGGAASAGTGGAGGLG 480
+ A A G + T GAGGL
Sbjct: 80 NL-----SAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.014
Identities = 26/79 (32%), Positives = 35/79 (44%), Gaps = 1/79 (1%)

Query: 697 GDGGAATGTGGAGGNGGAGGAASGLGVGIGGAGGHG-GAAPTGNGGSGGAGAGGLGLIGA 755
G G TG +G G +GLGVG G + G G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 756 GGNGGDAGAGVAAADGGDG 774
G GG+ +G + GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.026
Identities = 37/122 (30%), Positives = 49/122 (40%), Gaps = 5/122 (4%)

Query: 172 AGGAGGADAAGGPGGAGGWLWGGGGAGGLGGAASGAG----NGGAGGAGGAGGAFIAIGG 227
+GG G G +G G G G GGA+ G+G N GG G+G + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 228 VGGDGGAASSGTGGVGGAGGNADGLIVSLG-GAGGHGGDATTGIGGAGGAGGMATARIPA 286
G GG +SG G G +A V+ G A G + + GA A A I A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 287 GI 288
+
Sbjct: 122 AL 123



Score = 30.5 bits (68), Expect = 0.028
Identities = 38/118 (32%), Positives = 46/118 (38%), Gaps = 12/118 (10%)

Query: 193 GGGGAGGLGGAASGAGN--GGAGGAGGAGGAFIAIGGVGGDGGAASSGTGGVGGAGGNAD 250
GG G G GA S +GN GG G G GG DG SS G GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--------ASDGSGWSSENNPWG--GGSGS 52

Query: 251 GLIVSLGGAGGHGGDATTGIGGAGGAGGMATARIPAGINFNVGGAGGHGGAGATGGAG 308
G+ G G+GG GG+G G ++ P F G GG + AG
Sbjct: 53 GIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.5 bits (68), Expect = 0.034
Identities = 35/120 (29%), Positives = 44/120 (36%), Gaps = 4/120 (3%)

Query: 263 GGDATTGIGGAGGAGGMATARIPAGINFNVGGAGGHGGAGATGGAGGGGGSAYSGYVGIA 322
GGD GA G P G+ G + G G + GGG GS G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGG-PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 323 FGQGGAGGIGGAASTGMAGAGGSGGSCVAFDFVGFAAAHGGAGGTGGAATGVGATAGAAG 382
G GG G G S G G + + VAF F + GAGG + + +A A
Sbjct: 62 HGNGGGNGNSGGGS-GTGGNLSAVAAPVAFGFPALSTP--GAGGLAVSISAGALSAAIAD 118



Score = 30.1 bits (67), Expect = 0.048
Identities = 38/131 (29%), Positives = 48/131 (36%), Gaps = 6/131 (4%)

Query: 156 MAGGAGGSAGLIGNGGAGGAGGADAAGGPGGAGGWLWGGGGAGGLGGAASGAGNGGAGGA 215
M+GG G + +G G G GG G G SG+G GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 216 GGAGGAFIAIGGVGGDGGAASSGTGGVGGAGGNADGL-IVSLGGAGGHGGDATTGIGGAG 274
G G GG G GG + +G A A G +S GAGG + G A
Sbjct: 61 GHGNG-----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115

Query: 275 GAGGMATARIP 285
A MA + P
Sbjct: 116 IADIMAALKGP 126


79MMAR_0562MMAR_0572N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_05620111.634702hypothetical protein
MMAR_05630102.393920hypothetical protein
MMAR_05640111.172116hypothetical protein
MMAR_05650112.375912hypothetical protein
MMAR_0566-2122.372893hypothetical protein
MMAR_05671114.453428beta-1,3-glucanase
MMAR_0568-194.190371muconolactone isomerase
MMAR_0569-2103.529293hypothetical protein
MMAR_0570-3133.712621carbohydrate kinase
MMAR_0571-2162.670678mannitol dehydrogenase
MMAR_0572-2153.241602PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0562PF03544416e-06 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 41.1 bits (96), Expect = 6e-06
Identities = 28/125 (22%), Positives = 40/125 (32%), Gaps = 3/125 (2%)

Query: 470 VVPSVGPLPAPSRSIAASSAPPKALEP--SFVPPPASAAR-APSAAPAPTSVAPPPPPPR 526
V V LPAP++ I+ + P LEP + PPP P P P P
Sbjct: 36 SVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIE 95

Query: 527 PVATATTTVAPPVTTTKTTVPPTTAATTTPTTTPPPTTTAPPTTTAPPSTTSTVKMTTEW 586
PV + + P + T A PT++ + TS +
Sbjct: 96 KPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155

Query: 587 LHVPL 591
L
Sbjct: 156 GPRAL 160



Score = 29.6 bits (66), Expect = 0.030
Identities = 21/102 (20%), Positives = 29/102 (28%), Gaps = 3/102 (2%)

Query: 467 QSPVVPSVGPLPAPSRSIAASSAPPKALEPSFVPPPASAARAPSAAPAPTSVAPPPPPPR 526
Q P P V P P P P +E P P + P
Sbjct: 67 QPPPEPVVEPEPEPEPIPEPPKEAPVVIEK---PKPKPKPKPKPVKKVEQPKRDVKPVES 123

Query: 527 PVATATTTVAPPVTTTKTTVPPTTAATTTPTTTPPPTTTAPP 568
A+ AP T+ T T+ T+ + P + P
Sbjct: 124 RPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQP 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0563IGASERPTASE403e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.7 bits (92), Expect = 3e-05
Identities = 35/241 (14%), Positives = 52/241 (21%), Gaps = 34/241 (14%)

Query: 262 AGALAYSALADDGE-----DAVEGRRRPLVLAGSAMLGIAAFAAGLMVVTLTSDVRPAAA 316
GA Y +G VE R + + A V
Sbjct: 964 LGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNITTP--NNIQAD------VPSVPSNNE 1015

Query: 317 TQPSPREGVLTPASPAAPPKTAAPVPAKAPAQVP--------APGPAPAASPVPVAAPPS 368
E + P +PA P +T V + + A V A +
Sbjct: 1016 EIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSN 1075

Query: 369 VVQPPPVVRAPR---------PTVVKPPP----QYIAPATQPRRTPAPQAPALTPVQAPA 415
V + T K + A + P+ + +
Sbjct: 1076 VKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQ 1135

Query: 416 PEVRPPVAVPEPAPAPPVPTQAPMTMYLHLPFVSIPIPINPPPPPAPPPEPAPVEPPPPE 475
E P A P P V + P + P P E V
Sbjct: 1136 SETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSV 1195

Query: 476 P 476

Sbjct: 1196 V 1196



Score = 35.8 bits (82), Expect = 4e-04
Identities = 25/121 (20%), Positives = 35/121 (28%), Gaps = 7/121 (5%)

Query: 313 PAAATQPSPREGVLTPASPAAPPKTAAPVPAKAPAQVPAPGPAPAASPVPVAAPPSVVQP 372
P +Q SP++ P A P P + + A + P S V+
Sbjct: 1123 PKVTSQVSPKQEQSETVQPQAEPARE-NDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQ 1181

Query: 373 PPVVRAPR---PTVVKPPPQYIAPATQPRRTPAPQAPALTPVQAPAPEVRPPVAVPEPAP 429
P +VV+ P TQP + P VR EPA
Sbjct: 1182 PVTESTTVNTGNSVVENPENTTPATTQPTVN---SESSNKPKNRHRRSVRSVPHNVEPAT 1238

Query: 430 A 430

Sbjct: 1239 T 1239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0565PF05272280.035 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.035
Identities = 9/40 (22%), Positives = 13/40 (32%), Gaps = 5/40 (12%)

Query: 53 YQPPPGSVPPSYGYQPAFPGATP-----RSSSGNRAFWII 87
Y+ G + Q T +GNR FW +
Sbjct: 672 YRGAYGRYVQDHPRQVVIWCTTNKRQYLFDITGNRRFWPV 711


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0566RTXTOXIND290.033 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.033
Identities = 8/35 (22%), Positives = 18/35 (51%)

Query: 13 RRPSGLVQPGDTMLVTMPAPEPADGEALLRTTYVG 47
G+V +T++V +P + + AL++ +G
Sbjct: 344 HTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIG 378


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0570SHAPEPROTEIN356e-04 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 34.7 bits (80), Expect = 6e-04
Identities = 25/95 (26%), Positives = 39/95 (41%), Gaps = 4/95 (4%)

Query: 328 PNRPKATGALHGLRLANATPAHLARAAVEGLLCALADGLAQLT---EHGVVARRVLLIGG 384
R A G G L + + + G++ A+ L Q + R ++L GG
Sbjct: 237 RGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGG 296

Query: 385 GARSQALREIAPLIFGVPVLV-PDPAEYVALGAAR 418
GA + L + G+PV+V DP VA G +
Sbjct: 297 GALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0572cloacin403e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.7 bits (92), Expect = 3e-05
Identities = 41/117 (35%), Positives = 49/117 (41%), Gaps = 7/117 (5%)

Query: 145 GANPGVAGGAGGAAGLIGNGGAGGSGGAGAAASGSGDGGDGGAGGAGGSGGWLYGTGGAG 204
G G GA +G I NGG G G G A+ GSG + G G G +G G
Sbjct: 4 GDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 205 GSGGAGGVSGMNPGNHGGLGGNGGAGGAAGIVG--EGGAGGAGGMGGSGFTSAVTTA 259
G+GG G N G G GGN A A G GAGG+ S A++ A
Sbjct: 63 GNGGGNG----NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 37.8 bits (87), Expect = 9e-05
Identities = 35/111 (31%), Positives = 43/111 (38%), Gaps = 5/111 (4%)

Query: 222 GLGGNGGAGGAAGIVGEGGAGGAGGMGGSGFTSAVTTAGGNGGNGGTGGGGGLLAGNAGA 281
G G N GA +G + G G G G S + + GG G+G G +G+
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 282 GGAGGDGGGPVGDLGGNGGIGGDGGAAGLFGNGGAGGAGGGGGAAPVSGAA 332
GG G GGG G GG A FG G GG A +S A
Sbjct: 66 GGNGNSGGG-----SGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 37.8 bits (87), Expect = 9e-05
Identities = 39/117 (33%), Positives = 49/117 (41%), Gaps = 8/117 (6%)

Query: 165 GAGGSGGAGAAASGSGDGGDGGAGGAGGSGGWLYGTGGAGGSGGAGGVSGMNPGNHGGLG 224
G G G A S SG+ +GG G G GG G+G + + GG SG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 225 GNGGAGGAAGIVGEGGAGGAGGMGGSGFTSAVTTAGGNGGNGGTGGGGGLLAGNAGA 281
G G G +GG G GG+ A A G G GG ++ +AGA
Sbjct: 62 HGNGGGN-------GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.9 bits (77), Expect = 0.002
Identities = 32/83 (38%), Positives = 39/83 (46%), Gaps = 4/83 (4%)

Query: 295 LGGNGGIGGDGGAAGLFGN--GGAGGAGGGGGAAPVSGAAAAGGAGGGGGAGGLLYGDGG 352
+ G G G + GA GN GG G G GGGA+ SG ++ GGG G+ GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI--HWGG 58

Query: 353 LGGAGGAGGEALDGGVGGSGGAG 375
G G GG GG G+GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 33.5 bits (76), Expect = 0.002
Identities = 34/106 (32%), Positives = 40/106 (37%), Gaps = 4/106 (3%)

Query: 241 AGGAGGMGGSGFTSAVTTAGGNGGNGGTGGGGGLLAGNAGAGGAGGDGGGPVGDLGGNGG 300
+GG G +G S G G GGG +G + G G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 301 IGGDGGAAGLFGNGGAGGAGGGGGAAPVSGAAAAGGAGGGGGAGGL 346
G GG GN G G GG +A + A A GAGGL
Sbjct: 62 HGNGGGN----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 32.8 bits (74), Expect = 0.004
Identities = 33/93 (35%), Positives = 40/93 (43%), Gaps = 4/93 (4%)

Query: 119 GADGTAPGQAGGAGGLLYGNGGNGAPG---ANPGVAGGAGGAAGLIGNGGAGGSGGAGAA 175
GA T+ GG GL G G + G N GG+G G G G GG G +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 176 ASGSGDGGDGGAGGAGGSGGW-LYGTGGAGGSG 207
GSG GG+ A A + G+ T GAGG
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.018
Identities = 24/74 (32%), Positives = 26/74 (35%)

Query: 406 RGGTGGTGGNGGVGGLLYGNGGRGGAGGDGANLAGGGNGGNGGDTRLIGNGGDGGHGGSG 465
RG G G G GG DG+ + N GG I GG GHG G
Sbjct: 7 RGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 466 ALVGGAGGSGGIGG 479
GGSG G
Sbjct: 67 GNGNSGGGSGTGGN 80



Score = 29.3 bits (65), Expect = 0.041
Identities = 26/99 (26%), Positives = 34/99 (34%)

Query: 126 GQAGGAGGLLYGNGGNGAPGANPGVAGGAGGAAGLIGNGGAGGSGGAGAAASGSGDGGDG 185
G+ G NG P G + G+ N GG G+G G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 186 GAGGAGGSGGWLYGTGGAGGSGGAGGVSGMNPGNHGGLG 224
G G G G G A + A G ++ GGL
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


80MMAR_0601MMAR_0620N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0601-182.210368hypothetical protein
MMAR_0602192.672406hypothetical protein
MMAR_06035133.834040UDP-glucose dehydrogenase UdgA
MMAR_060410167.088760hypothetical protein
MMAR_06057176.113603hypothetical protein
MMAR_06065155.024159alpha-D-glucose-1-phosphate thymidylyl-
MMAR_06073125.146849PE-PGRS family protein
MMAR_06082114.436240PE-PGRS family protein
MMAR_06091114.143337PE-PGRS family protein
MMAR_06100111.723664aminotransferase AlaT
MMAR_06110112.064040iron-sulfur-binding reductase
MMAR_0612-1122.363823transcriptional regulatory protein
MMAR_06132142.253946hypothetical protein
MMAR_06142142.515169membrane protein, IniB
MMAR_06150110.954801hypothetical protein
MMAR_06162121.700957isoniazid inductible protein IniC
MMAR_06173131.529451hypothetical protein
MMAR_0618090.424207hypothetical protein
MMAR_0619-190.683703hypothetical protein
MMAR_0620-2100.282952lipoprotein LpqJ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0601cloacin494e-08 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 48.6 bits (115), Expect = 4e-08
Identities = 34/80 (42%), Positives = 35/80 (43%), Gaps = 5/80 (6%)

Query: 477 RGHFGGGAPGRSGPSGGGHGGQFGGGGHGGRFGGGGHGGGFGGGRGGGGHGGGFGGFGGG 536
RGH GA SG GG G GGG G +GGG G G H GG G G
Sbjct: 7 RGH-NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG----GSG 61

Query: 537 HGGGGHGGGFGGGHGGGFGG 556
HG GG G GGG G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 45.1 bits (106), Expect = 5e-07
Identities = 33/91 (36%), Positives = 39/91 (42%), Gaps = 4/91 (4%)

Query: 464 GRGPAGGSLGPGRRGHFGGGAPGRSGPSGGGHGGQFGGGGHGGRFGGGGHGGGFGGGRGG 523
GRG G+ G+ GG P G GG G + GG G G +GGG G
Sbjct: 6 GRGHNTGAHSTS--GNINGG-PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG-SG 61

Query: 524 GGHGGGFGGFGGGHGGGGHGGGFGGGHGGGF 554
G+GGG G GGG G GG+ GF
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 44.7 bits (105), Expect = 7e-07
Identities = 30/79 (37%), Positives = 35/79 (44%), Gaps = 8/79 (10%)

Query: 456 GLGRIPNPGRGPAGGSL--GPGRRGHFGGGAPG-----RSGPSGGGHGGQFGGGGHGGRF 508
G GR N G G++ GP G GG + G + P GGG G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH- 62

Query: 509 GGGGHGGGFGGGRGGGGHG 527
G GG G GGG G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 44.7 bits (105), Expect = 7e-07
Identities = 27/76 (35%), Positives = 29/76 (38%), Gaps = 3/76 (3%)

Query: 488 SGPSGGGHGG---QFGGGGHGGRFGGGGHGGGFGGGRGGGGHGGGFGGFGGGHGGGGHGG 544
SG G GH G +GG G G GG G + GG G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 545 GFGGGHGGGFGGGHGG 560
GG G GGG G
Sbjct: 62 HGNGGGNGNSGGGSGT 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0602PF03544477e-08 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 46.9 bits (111), Expect = 7e-08
Identities = 29/146 (19%), Positives = 38/146 (26%), Gaps = 18/146 (12%)

Query: 313 GALAVSLAVSIRSEPGTRPDPGQSVVTHLPAPAHAAPAPQPQAPAPQAPAPQAQAPAPQA 372
GA+ L + + P P Q + + APA P Q P
Sbjct: 26 GAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPP---------------- 69

Query: 373 QVPAPAPAPQAPAPRAPAPQAPAPQAQVPAPKAPAPVPVPVAQAPVPVPQAPAPVPVPIP 432
P P P+ P P AP P P P PV + P P
Sbjct: 70 --PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPAS 127

Query: 433 QAPAPAPEAPVPAPVPIPVPIQIPLP 458
AP P + +
Sbjct: 128 PFENTAPARPTSSTATAATSKPVTSV 153



Score = 39.6 bits (92), Expect = 1e-05
Identities = 24/107 (22%), Positives = 31/107 (28%), Gaps = 4/107 (3%)

Query: 380 APQAPAPRAP---APQAPAPQAQVPAPKAPAPVPVPVAQAPVPVPQAPAPVPVPIPQ-AP 435
+ PAP P APA A + P V P P+P+ P PV I + P
Sbjct: 40 VIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP 99

Query: 436 APAPEAPVPAPVPIPVPIQIPLPQIFGPGGGGGFPGGDDDRGGRGGG 482
P P+ V P P+ P
Sbjct: 100 KPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT 146



Score = 36.9 bits (85), Expect = 1e-04
Identities = 26/121 (21%), Positives = 30/121 (24%), Gaps = 5/121 (4%)

Query: 362 APQAQAPAPQAQVPAPAPAPQAPAPRAPAPQAPAPQAQVPAPKAPAPVPV---PVAQAPV 418
+ APA V APA P P P P + P P P PV
Sbjct: 40 VIELPAPAQPISVTMVAPADLEPPQAVQPP--PEPVVEPEPEPEPIPEPPKEAPVVIEKP 97

Query: 419 PVPQAPAPVPVPIPQAPAPAPEAPVPAPVPIPVPIQIPLPQIFGPGGGGGFPGGDDDRGG 478
P P PV + P + P P P G
Sbjct: 98 KPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGP 157

Query: 479 R 479
R
Sbjct: 158 R 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0603NUCEPIMERASE300.025 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.025
Identities = 23/87 (26%), Positives = 37/87 (42%), Gaps = 17/87 (19%)

Query: 1 MRCTVFGT-GYLGATHAVGMAQLGHEVVGVDIDPGKVAKLAGGDIPFYEPGLRKLLRDNL 59
M+ V G G++G + + + GH+VVG+D L +Y+ L++ + L
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGID-------NLN----DYYDVSLKQARLELL 49

Query: 60 AAGRLHFTT----DYD-MAAEFADVHF 81
A F D + M FA HF
Sbjct: 50 AQPGFQFHKIDLADREGMTDLFASGHF 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0607cloacin394e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 4e-05
Identities = 35/114 (30%), Positives = 46/114 (40%), Gaps = 5/114 (4%)

Query: 365 GGNGGQGGTGGTLFGNGGGGGTGGAGFVGPSSAGDGGNGGGGGRAGLIGNGGAGGAGGAP 424
G N G T G + G G G GG G + + GGG +G+ GG+G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI---HWGGGSGHGN 64

Query: 425 GPNGGFSGGNGGNGGDAVLIGNGGNSGDVGLS--GAGTPGLPGNGGLLIGTIGN 476
G G SGG G GG+ + G LS GAG + + G L I +
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 35.8 bits (82), Expect = 3e-04
Identities = 33/111 (29%), Positives = 42/111 (37%), Gaps = 4/111 (3%)

Query: 138 GDGGAGGSGGLEQQGGT--GGAAGLFGNGGAGGAGGASTVGTGAAGGAGGAGGLLWGQGG 195
G G G + G G GG GL GGA G S+ GG+G G+ WG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGS--GIHWGGGS 60

Query: 196 IGGTGGSGIDGGAGGAGGAGGALFGIGGAGGQGGIASTGLTGGGAGGDGGA 246
G GG + G G G + A G +++ G G GA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.5 bits (81), Expect = 4e-04
Identities = 28/81 (34%), Positives = 35/81 (43%), Gaps = 8/81 (9%)

Query: 245 GAGGLFGNGGIGGTGGLASVGPGGVGGNGGAA--------GTLLGNGGGGGVGGFGATQG 296
G G N G T G + GP G+G GGA+ G G G G+ G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 297 GDGGDGGATGIFGGTGGAGGA 317
G+GG G +G GTGG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.002
Identities = 34/101 (33%), Positives = 38/101 (37%), Gaps = 1/101 (0%)

Query: 336 GGDGRLFGNGGAGGVGGAAYTSNILMNATGGNGGQGGTGGTLFGNGGGGGTGGAGFVGPS 395
GGDGR N GA G + GG G GGG G+G G
Sbjct: 3 GGDGRGH-NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 396 SAGDGGNGGGGGRAGLIGNGGAGGAGGAPGPNGGFSGGNGG 436
GGNG GG +G GN A A A G + G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.013
Identities = 35/107 (32%), Positives = 42/107 (39%), Gaps = 14/107 (13%)

Query: 220 GIGGAGGQGGIASTGLTGGGAGGDGGAGGLFGNGGIGGT----GGLASVGPGGVGGNGGA 275
G G G +++G GG G G GG G GG + G GG+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH- 62

Query: 276 AGTLLGNGGGGGVGGFGATQGGDGGDGGATGIFG----GTGGAGGAG 318
GNGGG G G G+ GG+ A FG T GAGG
Sbjct: 63 -----GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.015
Identities = 25/71 (35%), Positives = 31/71 (43%)

Query: 282 NGGGGGVGGFGATQGGDGGDGGATGIFGGTGGAGGAGGQSTNALADSVGGNGGQGGDGRL 341
+GG G GA +GG TG+ G G + G+G S N G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 342 FGNGGAGGVGG 352
GNGG G G
Sbjct: 62 HGNGGGNGNSG 72



Score = 29.7 bits (66), Expect = 0.032
Identities = 21/72 (29%), Positives = 29/72 (40%)

Query: 118 NGANGAAGTGANGGAAGWLLGDGGAGGSGGLEQQGGTGGAAGLFGNGGAGGAGGASTVGT 177
N + NGG G +G G + GSG + GG +G + G G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 178 GAAGGAGGAGGL 189
+ GG+G G L
Sbjct: 70 NSGGGSGTGGNL 81



Score = 29.3 bits (65), Expect = 0.034
Identities = 37/132 (28%), Positives = 46/132 (34%), Gaps = 12/132 (9%)

Query: 186 AGGLLWGQGGIGGTGGSGIDGGAGGAGGAGGALFGIGGAGGQGGIASTGLTGGGAGGDGG 245
+GG G + I+GG G G GGA + G G + GGG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGA------SDGSGWSSENNPWGGGSGSGIH 55

Query: 246 AGGLFGNGGIGGTGGLASVGPGGVGGNGGAAGTLLGNG----GGGGVGGFGATQGGDGGD 301
GG G+G GG G S G G GGN A + G G GG +
Sbjct: 56 WGGGSGHGNGGGNGN--SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113

Query: 302 GGATGIFGGTGG 313
I G
Sbjct: 114 AAIADIMAALKG 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0608cloacin354e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.5 bits (81), Expect = 4e-04
Identities = 26/78 (33%), Positives = 31/78 (39%)

Query: 231 GAGGDGGVGGFGTFAGNGGDGGTGLFAAGGDGGAGGSGLTQGGDGGIGGAALGLFGAGGA 290
G G G G + +GN G TGL GG G GG G+ + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 291 GGTGGEGGLFGGAGGMGG 308
G GG G GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 35.5 bits (81), Expect = 5e-04
Identities = 34/111 (30%), Positives = 42/111 (37%), Gaps = 10/111 (9%)

Query: 364 GAGGAGGNAGFLYGAGGAGGAGGDSVGGADGGDGGNGGKAGLVGQGGDGGAGGNNNSGFG 423
G G G N GA G+ GG G G G G + GG + SG
Sbjct: 3 GGDGRGHN-------TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 424 TGGSGGKGGDAVLIGNGGNGGNAGSGGVGPGIAGAAGIGGLLIGEDGMAGL 474
GG G G GNG +GG +G+GG +A G + G GL
Sbjct: 56 WGGGSGHGNGG---GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 34.7 bits (79), Expect = 8e-04
Identities = 34/115 (29%), Positives = 38/115 (33%), Gaps = 5/115 (4%)

Query: 258 AGGDGGAGGSGLTQGGDGGIGGAALGLFGAGGAGGTGGEGGLFGGAGGMGGSAGMLFGNG 317
+GGDG +G G I G GL GGA G GG GS G
Sbjct: 2 SGGDGRGHNTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 318 GDGGAGAAATIGNAAGNGGAGGNAGMLIGAG----GAGGNGGFGFSASDGGAGGA 368
G G G G +G GG + G G GG S S G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 34.7 bits (79), Expect = 8e-04
Identities = 36/114 (31%), Positives = 45/114 (39%), Gaps = 5/114 (4%)

Query: 346 GAGGAGGNGGFGFSASDGGAGGAGGNAGFLYGAGGAGGAGGDSVGGADGGDGGNGGKAGL 405
G G G N G A GG G G G + G+G S GG G+G G
Sbjct: 3 GGDGRGHNTG----AHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 406 VGQGGDGGAGGNNNSGFGTGGSGGKGGDAVLIGNGGNGGNAGSGGVGPGIAGAA 459
G+GG GN+ G GTGG+ V G G+GG+ I+ A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPAL-STPGAGGLAVSISAGA 111



Score = 33.5 bits (76), Expect = 0.002
Identities = 31/91 (34%), Positives = 33/91 (36%), Gaps = 5/91 (5%)

Query: 202 GRGGAGGAGGFGNNTTGGIGGLGGAGGLFGAGGDGGVGGFGTFAGNGGDGGTGLFAAGGD 261
GRG GA N GG GLG GG G G GG G+G+ GG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGG-----ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 262 GGAGGSGLTQGGDGGIGGAALGLFGAGGAGG 292
G G G G G G L A A G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.5 bits (76), Expect = 0.002
Identities = 28/89 (31%), Positives = 36/89 (40%), Gaps = 8/89 (8%)

Query: 312 MLFGNGGDGGAGAAATIGNAAGNGGAGGNAGMLIGAGGAGGNGGFGFSASDGGAGGAGGN 371
M G+G GA +T GN G G G G + G G+S+ + GG G+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLG--------VGGGASDGSGWSSENNPWGGGSGS 52

Query: 372 AGFLYGAGGAGGAGGDSVGGADGGDGGNG 400
G G G GG+ G G GGN
Sbjct: 53 GIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.012
Identities = 25/81 (30%), Positives = 34/81 (41%), Gaps = 1/81 (1%)

Query: 384 AGGDSVGGADGGDGGNGGKAGLVGQGGDGGAGGNNNSGFGTGGSGGKGGDAVLIGNGGNG 443
+GGD G G +G G G GG G ++ SG+ + + GG I GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 444 GNAGSGGVGPGIAGAAGIGGL 464
G+ GG G G+ G L
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.014
Identities = 28/81 (34%), Positives = 35/81 (43%), Gaps = 4/81 (4%)

Query: 140 GAGGSGTPGTASVAGGNGGNGGAGGLLFGTGGAGGAG-GTASSLVGAIPGGNGGYGGDGG 198
G G G A GN NGG GL G G + G+G + ++ G G +GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 199 LLFGRGGAGGAGGFGNNTTGG 219
G GG G G G+ T G
Sbjct: 62 --HGNGGGNGNSGGGSGTGGN 80



Score = 30.5 bits (68), Expect = 0.015
Identities = 27/103 (26%), Positives = 34/103 (33%), Gaps = 2/103 (1%)

Query: 289 GAGGTGGEGGLFGGAGGMGGSAGMLFGNGGDGGAGAAATIGNAAGNGGAGGNAGMLIGAG 348
G G G G +G + G L GG ++ N G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 349 GAGGNGGFGFSASDGGAGGAGGNAGFLYG--AGGAGGAGGDSV 389
G GG G S G + A +G A GAGG +V
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAV 105



Score = 30.5 bits (68), Expect = 0.018
Identities = 26/82 (31%), Positives = 35/82 (42%), Gaps = 1/82 (1%)

Query: 246 GNGGDGGTGLFAAGGDGGAGGSGLTQGGDGGIG-GAALGLFGAGGAGGTGGEGGLFGGAG 304
G G + G + +GG G G+ G G G + +G G G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 305 GMGGSAGMLFGNGGDGGAGAAA 326
G G++G G GG+ A AA
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAP 87



Score = 29.7 bits (66), Expect = 0.027
Identities = 32/109 (29%), Positives = 37/109 (33%), Gaps = 10/109 (9%)

Query: 116 GNGANGAPGTGADGGAAGWLIGNGGAGGSGTPGTASVAGGNGGNGGAGGLLFGTGGAGGA 175
G G N + + G G G S G +S GG G+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 176 GGTASSLVGAIPGGNGGYGGDGGLLFGRGGAGGAGGFGNNTTGGIGGLG 224
GG GN G G G A A GF +T G GGL
Sbjct: 66 GG----------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 29.3 bits (65), Expect = 0.041
Identities = 33/109 (30%), Positives = 38/109 (34%), Gaps = 8/109 (7%)

Query: 170 GGAGGAGGTASSLVGAIPGGNGGYGGDGGLLFGRGGAGGAGGFGNNTTGGIGGLGGAGGL 229
G G A S G I GG G G GG A G+ + GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGG-------ASDGSGWSSENNPWGGGSGSGIHW 56

Query: 230 FGAGGDGGVGGFGTFAGNGGDGGTGLFAAGGDGGAGGSGLTQGGDGGIG 278
G G G GG G +G G G L A G L+ G GG+
Sbjct: 57 GGGSGHGNGGGNGN-SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 28.9 bits (64), Expect = 0.050
Identities = 27/84 (32%), Positives = 31/84 (36%), Gaps = 7/84 (8%)

Query: 110 TGRPLIGNGANGAPGTGADGGAA----GWLIGNGGAGGSGTPGTASVAGGNGGNGGAGGL 165
TG NG P GG A GW N GG G G GNGG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG- 69

Query: 166 LFGTGGAGGAGGTASSLVGAIPGG 189
+GG G GG S++ + G
Sbjct: 70 --NSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0609cloacin391e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 1e-04
Identities = 30/86 (34%), Positives = 37/86 (43%), Gaps = 3/86 (3%)

Query: 403 GAGAVAGASGAAGTIIAGNGGNGGAGGAGYAADGPAGPAIGNGGDGGRGGAGGFYGNGGA 462
G G GA +G I NGG G G G A+DG + N GG G + G G
Sbjct: 6 GRGHNTGAHSTSGNI---NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 463 GGAGGNSAPGGGNGGNGGTGGDSGAM 488
G GGN GGG+G G + +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 36.2 bits (83), Expect = 7e-04
Identities = 32/107 (29%), Positives = 43/107 (40%), Gaps = 4/107 (3%)

Query: 462 AGGAGGNSAPGGGNGGNGGTGGDSGAMGSSGGRGGDGGVGTNGGAGGGGGNATSYGTANA 521
+GG G G + GG +G G G G N GGG G+ +G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 522 TGGAGGDGGAGSRGTGGTGGAGGGGGAAQILNGASAATATGGAGGAG 568
G GG+G +G GG+G G A + A +T GAGG
Sbjct: 62 HGNGGGNGNSG----GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.002
Identities = 32/108 (29%), Positives = 37/108 (34%), Gaps = 5/108 (4%)

Query: 321 SGGGGAGGNGGAGGAGGHGSALFGAAGANGNGGAGGAGGNPGAPGNGGIGGVGPDAATSG 380
SGG G G N GA G+ + G G G + P GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNP-WGGGSGSGIHWGGGS 60

Query: 381 GMGGTGGDPGAVGGGGNGGAAGGAGAVAGASGAAGTIIAGNGGNGGAG 428
G G GG+ G G G GG + A A G G GG
Sbjct: 61 GHGNGGGN----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.002
Identities = 28/87 (32%), Positives = 32/87 (36%)

Query: 222 MFGNGGAGGMGGAGADGAVGAAGTAGTSTSAGGVGGVGGDGGNAGNGGAGGNGGLFVGVG 281
M G G G GA + G G G G G N GG G+G + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 282 GAGGQGGAGGAGGTGGAGGAGWDATAA 308
G G GG G +GG G GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 34.3 bits (78), Expect = 0.003
Identities = 33/102 (32%), Positives = 41/102 (40%), Gaps = 2/102 (1%)

Query: 426 GAGGAGYAADGPAGPAIGNGGDGGRGGAGGFYGNGGAGGAGGNSAPGGGNGGNGGTGGDS 485
G G G+ + NGG G G GG + G+G + N+ GGG+G GG S
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 486 GAMGSSGGRGGDGGVGTNGGAGGGGGNATSYGTANATGGAGG 527
G G GG GT G A +T GAGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.003
Identities = 37/117 (31%), Positives = 43/117 (36%), Gaps = 2/117 (1%)

Query: 144 GNGGNGGSGAAGQAGGA--GGAAGLIGTGGAGGMGGAGGGAGGMGGSGGWLLGNGGAGGA 201
G G G + A G GG GL GGA G GG G + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 202 GGVGGAGVSGGVGGTGGNAVMFGNGGAGGMGGAGADGAVGAAGTAGTSTSAGGVGGV 258
G GG G SGG GTGGN A G GA G A + + + +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 33.9 bits (77), Expect = 0.004
Identities = 33/105 (31%), Positives = 41/105 (39%), Gaps = 1/105 (0%)

Query: 864 ATGGAGGDGGSGGTGRGGTGGVGGVGINNGSGEAIGGAPGAGGTGAVGGDGGQGGAAYSY 923
+G G G G G + G G NN G G GG G GG G +
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 924 GTGDATGSAGAAGTAGTTGVGGTGGAGGAAYTLNGASTATATGGI 968
GTG SA AA A T GAGG A +++ + + A I
Sbjct: 76 GTG-GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 33.9 bits (77), Expect = 0.004
Identities = 23/83 (27%), Positives = 29/83 (34%)

Query: 259 GGDGGNAGNGGAGGNGGLFVGVGGAGGQGGAGGAGGTGGAGGAGWDATAAGVLAATGGDG 318
GGDG G +G + G G G GGA G + +G+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 319 GDSGGGGAGGNGGAGGAGGHGSA 341
G+ GG G G G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.8 bits (74), Expect = 0.007
Identities = 36/117 (30%), Positives = 42/117 (35%), Gaps = 8/117 (6%)

Query: 265 AGNGGAGGNGGLFVGVGGAGGQGGAGGAGGTGGAGGAGWDATAAGVLAATGGDGGDSGGG 324
+G G G N G G G G GG G + G+GW + GG SG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSE-------NNPWGGGSGSG 53

Query: 325 GAGGNGGAGGAGGHGSALFGAAGANGNGGAGGAGGNPGAPGNGGIGGVGPDAATSGG 381
G G G GG G +G GN A A G P G G + S G
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.8 bits (74), Expect = 0.009
Identities = 28/101 (27%), Positives = 35/101 (34%), Gaps = 1/101 (0%)

Query: 671 GNGGTGHGGGGGSGGTAINYGAGDAFGGAAGKGGTGVVGGNGGSGGAAYNYGTGNATGAD 730
G G GH G S IN G G G+G N GG + G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGS-GSGIHWGGGSG 61

Query: 731 GAAGTDGTTGAGGSGGSGGAASVLNSASIATATSGSGGAGG 771
G GGSG G ++V + + GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.014
Identities = 27/83 (32%), Positives = 33/83 (39%), Gaps = 2/83 (2%)

Query: 420 GNGGNGGAGGAGYAADGPAGPAIGNGGDGGRGGAGGFYGNGGAGGAGGNSAPGGGNGGNG 479
G G N GA +G GP G G G+G N GG G+ GG G+G
Sbjct: 6 GRGHNTGAHSTSGNING--GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 480 GTGGDSGAMGSSGGRGGDGGVGT 502
GG+ + G SG G V
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAA 86



Score = 31.6 bits (71), Expect = 0.019
Identities = 36/117 (30%), Positives = 46/117 (39%), Gaps = 1/117 (0%)

Query: 644 AGGDGGRTTIDGAGSRATATGGTGGDGGNGGTGHGGGGGSGGTAINYGAGDAFGGAAGKG 703
+GGDG + GG G G GG G G S G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH-WGGGS 60

Query: 704 GTGVVGGNGGSGGAAYNYGTGNATGADGAAGTDGTTGAGGSGGSGGAASVLNSASIA 760
G G GGNG SGG + G +A A A G + G G + ++ SA+IA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 31.6 bits (71), Expect = 0.021
Identities = 26/83 (31%), Positives = 36/83 (43%)

Query: 713 GSGGAAYNYGTGNATGADGAAGTDGTTGAGGSGGSGGAASVLNSASIATATSGSGGAGGD 772
G G +N G + +G T G G S GSG ++ + + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 773 GTDGGNGGSGGFAFTFGTGNIIA 795
G GGNG SGG + T G + +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0611IGASERPTASE582e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 57.8 bits (139), Expect = 2e-10
Identities = 42/250 (16%), Positives = 68/250 (27%), Gaps = 13/250 (5%)

Query: 726 LDREKATLPEKGTAAKEAEKRAKAAPKAAAPAAPAPAPAEA-PAKAAEAPAAATAASPAA 784
+D T P A + + A AP P PA A P++ E A +
Sbjct: 992 VDTTNITTPNNIQADVPSV-PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 785 PAKGLGMAGGAKRPGAKKAAPAPAAETAAAEAPAAPAKGLGMAAGAKKPGAKKAAAPTGE 844
K + A A+ A + A +G++ +
Sbjct: 1051 VEKN------EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104

Query: 845 TKPAEAAAPAAPAAPVKGLGMAS--GAKRPGAKKAAPPAAAAPEAAATASAPEAAA---A 899
T E A + + S K+ ++ P A A E T + E +
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 900 PAEPAAPAAPVKGLGIATGAKRPGAKKAPARAEAPAAAAPAQPEPEATPEPEPASKQDGE 959
A+ PA + + E P PA +P E K
Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHR 1224

Query: 960 PTPPAAPAAP 969
+ + P
Sbjct: 1225 RSVRSVPHNV 1234



Score = 40.8 bits (95), Expect = 3e-05
Identities = 34/209 (16%), Positives = 63/209 (30%), Gaps = 22/209 (10%)

Query: 698 TDGVNDRQEEAGRSGVEV----LDVAQVLLGSLDREKATLPEKGTAAKEAEKRAKAAPKA 753
T N + +S V+ +VAQ GS +E T K TA E E++AK +
Sbjct: 1061 TTAQNREVAKEAKSNVKANTQTNEVAQS--GSETKETQTTETKETATVEKEEKAKV--ET 1116

Query: 754 AAPAAPAPAPAEAPAKAAEAPAAATAASPAAPAKGLGMAGGAKRPGAKKAAPAPAAETAA 813
++ K ++ A PA + A A+ +
Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS 1176

Query: 814 AEAPAAPAKGLGMAAGAKKPGAKKAAAPTGETKPAEAAAPAAPAAPVKGLGMASGAKRPG 873
+ + + G + P T+P + +S +
Sbjct: 1177 SNVEQPVTESTTVNTGNSVVENPENTTPA-TTQPTVNSE-------------SSNKPKNR 1222

Query: 874 AKKAAPPAAAAPEAAATASAPEAAAAPAE 902
+++ E A T+S + A +
Sbjct: 1223 HRRSVRSVPHNVEPATTSSNDRSTVALCD 1251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0614FLAGELLIN421e-05 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 41.6 bits (97), Expect = 1e-05
Identities = 26/286 (9%), Positives = 58/286 (20%), Gaps = 2/286 (0%)

Query: 566 IASQAGLAGQAGLAGQAGIASQAGIASQSGLAAGGSAGIASQAGLGIGGQAALGGQAGAA 625
+++Q G L+ + Q G + GL
Sbjct: 125 VSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGD 184

Query: 626 VGGGLAGVGNVSGLTGIGGNASLGATGQAGLIASEGAALNGAATPHVSGPLGGVGVGGQA 685
+ V + A + + + + +
Sbjct: 185 LKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENN 244

Query: 686 GAAGGAGLGLGAGSRGGILSGDSTTLGGHPNPQPAALGAAGGTGIGAHSGAGGGMAAGLG 745
A + GG G + G ++ +
Sbjct: 245 TAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTIN 304

Query: 746 GSAAGGAGVGL-GGSAAGGAGAEAAGGFGGGTHIGGQAGLGGSAAGGAGTELGGTAGSPG 804
G + G+A A + + + GQ + L +
Sbjct: 305 GEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAK-LSDLEANNA 363

Query: 805 AGMGAGVGGGTHAGGQVGLGGGSTAGGQAGLGGGSSAGGSVGHGDI 850
+ + G T G+ +++G S +
Sbjct: 364 VKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINED 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0617SHAPEPROTEIN350.001 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 34.7 bits (80), Expect = 0.001
Identities = 20/86 (23%), Positives = 34/86 (39%), Gaps = 9/86 (10%)

Query: 260 IRFSRNEFEQLITQPLDRFIGSVEDMLQRSGVPRPSLAA------VAAVGGGAAIPLIGN 313
+ NE + + +PL + +V L++ P LA+ + GGGA + +
Sbjct: 249 FTLNSNEILEALQEPLTGIVSAVMVALEQC---PPELASDISERGMVLTGGGALLRNLDR 305

Query: 314 RLSERLQVPVFTTAQPIFSAAIGAAM 339
L E +PV P+ A G
Sbjct: 306 LLMEETGIPVVVAEDPLTCVARGGGK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0618PYOCINKILLER270.033 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.5 bits (60), Expect = 0.033
Identities = 18/82 (21%), Positives = 25/82 (30%)

Query: 97 APTKTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTPTTTTTTTTAPTTTTTTTTNPMS 156
A TT T +TT T T + P++TT P T
Sbjct: 390 AYNATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQNPSSTTPVVPKPVPVYEGATLTPV 449

Query: 157 PGAMPTFPSQLTPSIPTVINLP 178
T+P +T +I P
Sbjct: 450 KATPETYPGVITLPEDLIIGFP 471


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0619GPOSANCHOR368e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 35.8 bits (82), Expect = 8e-05
Identities = 15/73 (20%), Positives = 23/73 (31%), Gaps = 10/73 (13%)

Query: 27 PAGPPPSRPAGPQSGPPPQAPPGPDSLSPSEQFASAEGYDQEKPESEPKPWYRNPVT--- 83
A + + Q+ P A PG ++ Q A + + + P T
Sbjct: 455 LAKLRAGKASDSQT---PDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGET 511

Query: 84 ----LTGWALLVM 92
T AL VM
Sbjct: 512 ANPFFTAAALTVM 524


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0620PF05616280.020 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 28.2 bits (62), Expect = 0.020
Identities = 22/94 (23%), Positives = 34/94 (36%), Gaps = 6/94 (6%)

Query: 32 PIATPGAGPTEPSFPTRRPTTSP---PPPTSTSPSQPTSPASPTSPAGAIPLPPDDNGYV 88
P +P P P P T P P P + P + P + + +P NG
Sbjct: 329 PEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRH 388

Query: 89 FIETKSGQT---RCQINHDSVGCEAPFTNSPIKD 119
E K G+ C+ D + C+ +P +D
Sbjct: 389 RKERKEGEDGGLLCKFFPDILACDRLPEPNPAED 422


81MMAR_0628MMAR_0648N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_06281180.914541monooxygenase
MMAR_06291171.673539dehydrogenase
MMAR_06301171.301791transcriptional regulatory protein
MMAR_06312161.101610potassium-transporting ATPase subunit A
MMAR_06324171.111892potassium-transporting ATPase subunit B
MMAR_06334141.157146potassium-transporting ATPase subunit KdpC
MMAR_06343141.046986two-component system response phosphate sensor
MMAR_06353150.070496transcriptional regulatory protein KdpE
MMAR_0636320-1.954899hypothetical protein
MMAR_0637623-2.985597molecular chaperone DnaK
MMAR_0638621-2.990288GrpE protein (Hsp-70 cofactor)
MMAR_0639621-3.004720chaperone protein DnaJ
MMAR_0640521-2.821326heat shock protein transcriptional repressor
MMAR_0641521-2.725905PPE family protein
MMAR_0642318-2.244559PPE family protein
MMAR_06430110.492189hypothetical protein
MMAR_06441110.457309monooxygenase
MMAR_06451130.777365endopeptidase ATP binding protein (chain B)
MMAR_06460120.987622enoyl-CoA hydratase, EchA8
MMAR_0647-1130.718480hypothetical protein
MMAR_06480130.665408ketoacyl reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0628PHPHTRNFRASE290.020 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.0 bits (65), Expect = 0.020
Identities = 14/59 (23%), Positives = 23/59 (38%), Gaps = 4/59 (6%)

Query: 196 EPLEEFRRKNDLVR-RHADAAQRDEAKIERAMAWTTPADADAYHR---EGVSLLTTEIQ 250
+ ++K + + + +D A +E A TP D D EG+ L TE
Sbjct: 243 KRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0629DHBDHDRGNASE1131e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (284), Expect = 1e-32
Identities = 75/252 (29%), Positives = 111/252 (44%), Gaps = 12/252 (4%)

Query: 1 MAGLTALITGATSGIGRATAHGLAELGATVLVSGRDEARGREVVDEVTARGGRGIFLAAE 60
+ G A ITGA GIG A A LA GA + + + +VV + A A+
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 LRDVTGVRNLATAAADAGAGKVDILINSAGAFPFGPTADTSPDDFDAVFALNVRAPYFLV 120
+RD + + TA + G +DIL+N AG G S ++++A F++N +
Sbjct: 66 VRDSAAIDEI-TARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 121 GALAPTMAKRGRGSIVNVTTMVAEFGAAGTGLYGASKAAIALLTKSWAAEYGPSGVRVNA 180
+++ M R GSIV V + A Y +SKAA + TK E +R N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 181 VSPGPTRTE-----GTVEMGEA------LDQLAAAAPAGRPADPTEIASTIVYLASDAAS 229
VSPG T T+ E G L+ P + A P++IA +++L S A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 230 FIHGAVVPVDGG 241
I + VDGG
Sbjct: 245 HITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0630HTHTETR646e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.3 bits (156), Expect = 6e-15
Identities = 28/158 (17%), Positives = 49/158 (31%), Gaps = 1/158 (0%)

Query: 12 VLSAARDEFRSHGYAATSVDSLAAATGLNRSSLYGSFGDKHRLFLRALDGYCEATLHDVR 71
+L A F G ++TS+ +A A G+ R ++Y F DK LF +
Sbjct: 16 ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELEL 75

Query: 72 EVLRERGVSARQRLINHVHAIVNGIVADTDRRGC-MMSRSSAELAGADPDVSGIVERSLE 130
E + L + ++ V + RR + E G V
Sbjct: 76 EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCL 135

Query: 131 AWRRELADCIAEAQLEGAVAGDGSPQALATVMLSLMQG 168
+ + + D + A +M + G
Sbjct: 136 ESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0635HTHFIS1061e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 106 bits (265), Expect = 1e-28
Identities = 36/118 (30%), Positives = 60/118 (50%), Gaps = 1/118 (0%)

Query: 2 TRVLVIDDEPQILRALRINFSVRGYDVVTAATGAAALRAAAEQRPDVVILDLGLPDMSGI 61
+LV DD+ I L S GYDV + A R A D+V+ D+ +PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVLAGLRGWFS-APVIVLSARSDSSDKVQALDAGADDYVTKPFGMDELLARLRAAVRR 118
++L ++ PV+V+SA++ ++A + GA DY+ KPF + EL+ + A+
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0637SHAPEPROTEIN1349e-37 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 134 bits (338), Expect = 9e-37
Identities = 74/368 (20%), Positives = 141/368 (38%), Gaps = 66/368 (17%)

Query: 2 ARAVGIDLGTTNSVVAVLEGGDP-----VVVANSEGSRTTPSVVAFARNGEVLVGQPAKN 56
+ + IDLGT N+++ V G VV + + + SV A VG AK
Sbjct: 10 SNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAA--------VGHDAKQ 61

Query: 57 QAVTNVD--RTIRSVKRHMGGDWSIEIDDKKYTAPEISARVLMKLKRDAEAYLGEDIADA 114
IR +K + D+ + ++ + ++ ++
Sbjct: 62 MLGRTPGNIAAIRPMKDGVIADF--------FVTEKMLQHFIKQVHSNS---FMRPSPRV 110

Query: 115 VITVPAYFNDAQRQATKDAGQIAGLNVLRIVNEPTAAALAYGLDKGEKEQTILVFDLGGG 174
++ VP +R+A +++ Q AG + ++ EP AAA+ GL E + +V D+GGG
Sbjct: 111 LVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGS-MVVDIGGG 169

Query: 175 TFDVSLLEIGEGVVEVRATSGDNHLGGDDWDDRVVEWLVDKFKGTSGIDLTKDKMAMQRL 234
T +V+++ + V S +GGD +D+ ++ ++ + G
Sbjct: 170 TTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLIG------------- 211

Query: 235 REAAEKAKIELSSS----QSTSINLPYITVDAD--KNPLFLDEQLTRAEFQRITQDL--- 285
AE+ K E+ S+ + I + + + ++ A + +T +
Sbjct: 212 EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAV 271

Query: 286 ---LDRTRKPFQSVIADTGISVSDIDHVVLVGGSTRMPAVTELVKELTGGKEPNKGVNPD 342
L++ S I++ G+ VL GG + + L+ E T G +P
Sbjct: 272 MVALEQCPPELASDISERGM--------VLTGGGALLRNLDRLLMEET-GIPVVVAEDPL 322

Query: 343 EVVAVGAA 350
VA G
Sbjct: 323 TCVARGGG 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0641cloacin393e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 3e-04
Identities = 29/108 (26%), Positives = 43/108 (39%), Gaps = 6/108 (5%)

Query: 242 SGNIGNGNNGDGNLGGGNLGSYNLGWGNLGGANQGFGNAGSNNQGFANTGSNNQGFANTG 301
SG G G+N + GN+ G G GGA+ G G + NN +GS +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 302 SFNVGFGNTGSNNIGIGLSGDG-----KIGFGSLNS-GSGNIGLFNSG 343
N G G G + GF +L++ G+G + + S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 33.1 bits (75), Expect = 0.016
Identities = 27/103 (26%), Positives = 35/103 (33%), Gaps = 11/103 (10%)

Query: 229 NVGTSNLGFGNIGSGNIGNGNNGDGNLGGGNLGSYNLGWGNLGGANQG-FGNAGSNNQGF 287
N G + GNI G G G G + G G S N WG G+ G +G N G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 288 ANTGSNNQGFANTGSFN---VGFG-----NTGSNNIGIGLSGD 322
G S V FG G+ + + +S
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.1 bits (75), Expect = 0.017
Identities = 23/81 (28%), Positives = 29/81 (35%), Gaps = 2/81 (2%)

Query: 2024 NTGDGNIGFGNTGDGNIGIGLNGDGLRGFEALNSGTDNVGLFNSGTGNVGIGNSGTGNWG 2083
NTG + GN G G+G+ G G + G SG G G G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 2084 IGNSGNYNTGVGNTGAANTGM 2104
GNSG + GN A +
Sbjct: 69 -GNSGGGSGTGGNLSAVAAPV 88



Score = 32.8 bits (74), Expect = 0.020
Identities = 27/104 (25%), Positives = 35/104 (33%), Gaps = 1/104 (0%)

Query: 729 GFGNTGNGNIGFGNTGNGNIGIGLNGDGLQGFGGWNSGSGNIGLFNSGTDNVGIGNSGTG 788
G G+ + GN G G+G+ G G G + + G SG G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 789 NSGIGNTGSYNTGIGNVGVANTGLFNIGNLNT-GIGNPGNYNSG 831
+ G TG VA F L+T G G S
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 32.4 bits (73), Expect = 0.026
Identities = 24/87 (27%), Positives = 34/87 (39%), Gaps = 4/87 (4%)

Query: 754 GDGLQGFGGWNSGSGNIGLFNSGTDNVGIGNSGTGNSGIGNTGSYNTGIGNVGVANTGLF 813
GDG G +S SGNI N G +G+G + SG + + G G+ G
Sbjct: 4 GDGRGHNTGAHSTSGNI---NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 814 NIGNLNTGIGNPGNYNSGAHNVGSTNT 840
GN GN G + N+ +
Sbjct: 61 GHGNGGGN-GNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.028
Identities = 25/79 (31%), Positives = 33/79 (41%), Gaps = 5/79 (6%)

Query: 1213 GDGLQGFGGWNSGSGNIGVFNSGTDNVGIGNSGTGNSGIGNSGSYNTGIGNSGLANTGLF 1272
GDG G +S SGNI N G +G+G + SG + + G SG+ G
Sbjct: 4 GDGRGHNTGAHSTSGNI---NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG-- 58

Query: 1273 NSGSFNTGIGNVGSYNTGT 1291
SG N G +GT
Sbjct: 59 GSGHGNGGGNGNSGGGSGT 77



Score = 32.0 bits (72), Expect = 0.031
Identities = 25/76 (32%), Positives = 33/76 (43%), Gaps = 3/76 (3%)

Query: 1181 NTGSNNIGIGNTGDGNIGFGNTGNDNTGIGLNGDGLQGFGGWNSGSGNIGVFNSGTDNVG 1240
NTG+++ GN G G G G + G G + + GG SG G SG N G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--GSGHGNGG 66

Query: 1241 IGNSGTGNSGIGNSGS 1256
+ G SG G + S
Sbjct: 67 GNGNSGGGSGTGGNLS 82



Score = 32.0 bits (72), Expect = 0.033
Identities = 22/77 (28%), Positives = 29/77 (37%), Gaps = 1/77 (1%)

Query: 1191 NTGDGNIGFGNTGNDNTGIGLNGDGLQGFGGWNSGSGNIGVFNSGTDNVGIGNSGTGNSG 1250
NTG + GN TG+G+ G G G + + G SG G G G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 1251 IGNSGSYNTGIGNSGLA 1267
+ G TG S +A
Sbjct: 69 GNSGGGSGTGGNLSAVA 85



Score = 32.0 bits (72), Expect = 0.037
Identities = 25/76 (32%), Positives = 31/76 (40%), Gaps = 3/76 (3%)

Query: 722 NTGDNNIGFGNTGNGNIGFGNTGNGNIGIGLNGDGLQGFGGWNSGSGNIGLFNSGTDNVG 781
NTG ++ GN G G G G + G G + + GG SG G SG N G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--GSGHGNGG 66

Query: 782 IGNSGTGNSGIGNTGS 797
+ G SG G S
Sbjct: 67 GNGNSGGGSGTGGNLS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0642cloacin340.009 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 0.009
Identities = 24/78 (30%), Positives = 30/78 (38%), Gaps = 2/78 (2%)

Query: 251 NTGNNNIGFGNTGDNNRGIGLTGTGQFGLGGLNSGSGNIGLFNSGTGNFGIGNSGTGNWG 310
NTG ++ GN G+G+ G G G + + G SG G G G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 311 IGNSGNSYNTGIGNSGDA 328
GNSG TG S A
Sbjct: 69 -GNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.036
Identities = 29/89 (32%), Positives = 36/89 (40%), Gaps = 3/89 (3%)

Query: 250 GNTGNNNIGFGNTGDNNRGIGLTGTGQFGLGGLNSGSGNIGLFNSGTGNFGIGNSGTGNW 309
G+ +N G +T N G TG GG + GSG N G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNING---GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 310 GIGNSGNSYNTGIGNSGDANTGFFNAGVA 338
G GN G + N+G G+ N A VA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0645HTHFIS434e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 42.9 bits (101), Expect = 4e-06
Identities = 39/184 (21%), Positives = 67/184 (36%), Gaps = 30/184 (16%)

Query: 550 AGRMLEGETAKLLRMEDEL--GHRVIGQKKAVQAVSDAVRRSRAGVADPNRPTGSFMFLG 607
GR L + ++ED+ G ++G+ A+Q + + R + + M G
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR----LMQTDLTL---MITG 167

Query: 608 PTGVGKTELAKALAEFLFDDERAMVRIDMSEYGEKHSVARLVGAPPGYIGYDQGGQLTEA 667
+G GK +A+AL ++ V I+M+ + L G + G T A
Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGA 219

Query: 668 VRRRPYTV-------ILFDEIEKAHPDVFDVLLQVLDEG---RLTDGQGRTVDFRNTILI 717
R + DEI D LL+VL +G + D R ++
Sbjct: 220 QTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IV 276

Query: 718 LTSN 721
+N
Sbjct: 277 AATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0647PF05616300.008 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 30.5 bits (68), Expect = 0.008
Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 1/68 (1%)

Query: 201 PLPQESPQEAEESEPAQSGNRSLTPSRRPELPPRRAQVDPAAGLLPDASRRTPEPMRREE 260
PLP+ SP E + PA + N P+ P+ P +P P +P R
Sbjct: 327 PLPEVSPAENPANNPAPNENPGTRPNPEPD-PDLNPDANPDTDGQPGTRPDSPAVPDRPN 385

Query: 261 GRSEGSRR 268
GR R+
Sbjct: 386 GRHRKERK 393


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0648DHBDHDRGNASE695e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 68.9 bits (168), Expect = 5e-16
Identities = 52/184 (28%), Positives = 89/184 (48%), Gaps = 4/184 (2%)

Query: 5 VALITGPTSGIGAGYARRYAQDGYDLILVARDVDRLKQLAVELEDDAGNVEILPADLADA 64
+A ITG GIG AR A G + V + ++L+++ L+ +A + E PAD+ D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 65 AGRDKVAERLSR---GVRVLVNNAGFATSGEFWETEPAALQAQLDVNVTAVMQLTRAALP 121
A D++ R+ R + +LVN AG G +A VN T V +R+
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 122 PMLAAGAGTVINIAS-VAGLLSGRGSTYSASKAWVISFSEGLSTGLEGTGVGVHAVCPGY 180
M+ +G+++ + S AG+ + Y++SKA + F++ L L + + V PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 181 VHTE 184
T+
Sbjct: 190 TETD 193


82MMAR_0755MMAR_0762N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_07550236.138113PE-PGRS family protein
MMAR_07560201.569532molybdopterin biosynthesis protein MoeA2
MMAR_0757-1200.950609short chain dehydrogenase
MMAR_07580201.086900hypothetical protein
MMAR_07590181.348979chaperonin GroEL
MMAR_07600131.949083hypothetical protein
MMAR_07611131.686199PPE family protein
MMAR_07622131.615807D-amino acid aminohydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0755cloacin374e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 4e-04
Identities = 33/95 (34%), Positives = 40/95 (42%), Gaps = 1/95 (1%)

Query: 431 TGGHGAAGAVAGGDGGRGGTGGGLAGSGGTGGNGAVGTVSGVGGDGGNAAGLFGDGGTGG 490
TG H +G + GG G G GG GSG + N G SG G G +G G+GG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG-HGNGGGNG 69

Query: 491 NGGLATAGGAGGDGGAGGKAALIGSGGNGGAGGSG 525
N G + G A A + GAGG
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.2 bits (83), Expect = 7e-04
Identities = 37/136 (27%), Positives = 50/136 (36%)

Query: 802 GGAGGTGGGAATLFGAGGAGGAGAIGLDTGGAGGAGGSAGALSGTGGAGGAGGIGVGGGG 861
GG G A GG +G+ G + G+G S+ GG+G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 862 GGGGVGGAGGNAGVVYGDGGAGGAGGGGTVAAGAAGGSGGNAAMLFGNGGAGGTGAVGAA 921
G GG G G G+ A A A + G+GG A + + + AA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 922 VGGNGGTGGDGGGLSG 937
+ G G G L G
Sbjct: 123 LKGPFKFGLWGVALYG 138



Score = 35.5 bits (81), Expect = 0.001
Identities = 40/128 (31%), Positives = 51/128 (39%), Gaps = 6/128 (4%)

Query: 642 LSGGAGVGGTGGIGAVLGGAGGTGGMAGLFGAGGAGGEGGGGQSGGAGGAGGVGLFGAGG 701
+SGG G G G + G G G+ G G + G G ++ GG G G+ GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGV-GGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 702 NGGTGGFSLVTGGAGGAGGASLLSGNGGAGGAGGIGGTGAGGAGGDGGAAGAFSGNGGAG 761
+G G GG G +GG S GN A A G A G GG A + S +
Sbjct: 60 SGHGNG-----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114

Query: 762 GAGGIAPA 769
I A
Sbjct: 115 AIADIMAA 122



Score = 34.3 bits (78), Expect = 0.003
Identities = 32/119 (26%), Positives = 41/119 (34%), Gaps = 2/119 (1%)

Query: 724 LSGNGGAGGAGGIGGTGAGGAGGDGGAAGAFSGNGGAGGAGGIAPAGTEGGAGGAGGNAG 783
+SG G G G T GG G + G+G + P G GG+G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG--GGSGSGIHWGG 58

Query: 784 VFSGTGGAGGAGGAGQTVGGAGGTGGGAATLFGAGGAGGAGAIGLDTGGAGGAGGSAGA 842
G G G + G + A FG GA GL + GA +A A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 34.3 bits (78), Expect = 0.003
Identities = 35/119 (29%), Positives = 44/119 (36%), Gaps = 2/119 (1%)

Query: 737 GGTGAGGAGGDGGAAGAFSGNGGAGGAGGIAPAGTEGGAGGAGGNAGVFSGTGGAGGAGG 796
GG G G G +G +G G GG A G+ G G SG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGS--GWSSENNPWGGGSGSGIHWGGGS 60

Query: 797 AGQTVGGAGGTGGGAATLFGAGGAGGAGAIGLDTGGAGGAGGSAGALSGTGGAGGAGGI 855
GG G +GGG+ T A G GAGG A ++S + I
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 34.3 bits (78), Expect = 0.003
Identities = 39/109 (35%), Positives = 49/109 (44%), Gaps = 3/109 (2%)

Query: 115 GNGANGAAGTGASGGDGGILIGNGGAGGSGATGLTGGAGGNGGAAGLLAGTAGAGGSGGL 174
G G N A + + +GG G G S +G + GG +G +G GGSG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSG--SGIHWGGGSGHG 63

Query: 175 GAAGAGGAGGQGGTGGLFSAGGAGGTGGVGASGGTGGAGGLGLFGAGGA 223
G G +GG GTGG SA A G A T GAGGL + + GA
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPAL-STPGAGGLAVSISAGA 111



Score = 34.3 bits (78), Expect = 0.003
Identities = 36/108 (33%), Positives = 44/108 (40%), Gaps = 2/108 (1%)

Query: 774 GAGGAGGNAGVFSGTGGA-GGAGGAGQTVGGAGGTGGGAA-TLFGAGGAGGAGAIGLDTG 831
G G G N G S +G GG G G G + G+G + +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 832 GAGGAGGSAGALSGTGGAGGAGGIGVGGGGGGGGVGGAGGNAGVVYGD 879
G GG G++G SGTGG A V G GAGG A +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.9 bits (77), Expect = 0.004
Identities = 37/115 (32%), Positives = 50/115 (43%), Gaps = 10/115 (8%)

Query: 676 AGGEGGGGQSGGAGGAGGVGLFGAGGNGGTGGFSLVTGGAGGAGGASLLSGNGGAGGAG- 734
+GG+G G +G +G + NGG G + G + G+G +S + GG G+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 735 GIGGTGAGGAGGDGGAAGAFSGNGGAGG--AGGIAPAGTEGGAGGAGGNAGVFSG 787
GG G GG G +G SG GG A +A GAGG A S
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 33.5 bits (76), Expect = 0.004
Identities = 35/110 (31%), Positives = 45/110 (40%), Gaps = 4/110 (3%)

Query: 905 MLFGNGGAGGTGAVGAAVGGNGGTGGDGGGLSGSGGAGGNGAGGGTGGTGGDGGRARGLL 964
M G+G TGA + NGG G G G S G+G + GG G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 965 GDGGTGGDGGFGGITSGDGGNGGTGALIGDG----GNGGAGGIGLGAAPG 1010
G G GG+G GG + G A + G GAGG+ + + G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.8 bits (74), Expect = 0.007
Identities = 24/75 (32%), Positives = 31/75 (41%)

Query: 196 GAGGTGGVGASGGTGGAGGLGLFGAGGAGGAGGMANATVGGTGGAGGASLLFGNGGAGGL 255
G G G ++ G G GL GGA G ++ GG+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 256 GGGGATAGGAGGQGG 270
GG G + GG+G G
Sbjct: 66 GGNGNSGGGSGTGGN 80



Score = 32.4 bits (73), Expect = 0.010
Identities = 31/97 (31%), Positives = 36/97 (37%), Gaps = 1/97 (1%)

Query: 940 GAGGNGAGGGTGGTGGDGGRARGLLGDGGTGGDGGFGGITSGDGGNGGTGALIGDGGNGG 999
G G G G T G+ LG GG DG + G GG+G+ I GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSG 61

Query: 1000 AGGIGLGAAPGGDGGKGGDAQLVGTGGNGGILGLGLP 1036
G G GG G GG+ V G L P
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTP 98



Score = 32.4 bits (73), Expect = 0.010
Identities = 29/84 (34%), Positives = 35/84 (41%)

Query: 212 AGGLGLFGAGGAGGAGGMANATVGGTGGAGGASLLFGNGGAGGLGGGGATAGGAGGQGGD 271
+GG G GA G N G G GGAS G GGG+ +G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 272 AGTFYGDGGVGGAGGAGGNIPGSS 295
G G+G GG G GGN+ +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.012
Identities = 36/106 (33%), Positives = 42/106 (39%), Gaps = 8/106 (7%)

Query: 127 SGGDG----GILIGNGGAGGSGATGLTGGAGGNGGAAGLLAGTAGAGGSG-GLGAAGAGG 181
SGGDG G G TGL G G + G+ GGSG G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 182 AGGQGGTGGLFSAGGAGGTGGVGASGGTGGAGGLGLFGAGGAGGAG 227
G GG G +GG GTGG ++ A G GAGG
Sbjct: 62 HGNGGGNGN---SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.012
Identities = 33/106 (31%), Positives = 38/106 (35%), Gaps = 4/106 (3%)

Query: 865 GVGGAGGNAGVVYGDGGAGGAGGGGTVAAGAAGGSGGNAAMLFGNGGAGGTGAVGAAVGG 924
G G G N G G G G V GA+ GSG ++ N GG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSE----NNPWGGGSGSGIHWGG 58

Query: 925 NGGTGGDGGGLSGSGGAGGNGAGGGTGGTGGDGGRARGLLGDGGTG 970
G G GG + GG+G G G A G GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.014
Identities = 32/105 (30%), Positives = 37/105 (35%)

Query: 397 GDGGAGGAGGIGGAAAGGKGGDGGDAATLFGSGGTGGHGAAGAVAGGDGGRGGTGGGLAG 456
G G GA G GG G G GSG + + G +G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 457 SGGTGGNGAVGTVSGVGGDGGNAAGLFGDGGTGGNGGLATAGGAG 501
G G GT + A F T G GGLA + AG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.6 bits (71), Expect = 0.021
Identities = 30/83 (36%), Positives = 33/83 (39%), Gaps = 4/83 (4%)

Query: 310 GDGGAGGTGGVATSAGGAGGAGGNAADLVGTGGVGGAGGTSFDAGGAGGIGGSAGALFGA 369
GDG TG +TS GG G L GG G S + GG GS G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTG----LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 370 GGAGGAGGFGQVSGGAGGAGGNS 392
G G GG G GG+G G S
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLS 82



Score = 31.2 bits (70), Expect = 0.024
Identities = 34/104 (32%), Positives = 40/104 (38%), Gaps = 3/104 (2%)

Query: 618 AGGSGAVNGLGTGQAGGN--GGAAGLLSGGAGVGGTGGIGAVLGGAGGTGGMAGLFGAGG 675
+GG G + G GN GG GL GG G+G GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 676 AGGEGGGGQSGGAGGAGGVGLFGAGGNGGTGGFSLVTGGAGGAG 719
G GGG + G G G L G +L T GAGG
Sbjct: 62 HGN-GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.026
Identities = 32/127 (25%), Positives = 44/127 (34%), Gaps = 3/127 (2%)

Query: 287 AGGNIPGSSGGAGGAGGNAGLFHGDGGAGGTGGVATSAGGAGGAGGNAADLVGTGGVGGA 346
+GG+ G + GA GN +G G GG A+ G G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGN---INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 347 GGTSFDAGGAGGIGGSAGALFGAGGAGGAGGFGQVSGGAGGAGGNSGMVYGDGGAGGAGG 406
G + GG G GG +G FG + GAGG + + +
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118

Query: 407 IGGAAAG 413
I A G
Sbjct: 119 IMAALKG 125



Score = 30.8 bits (69), Expect = 0.034
Identities = 35/119 (29%), Positives = 41/119 (34%), Gaps = 8/119 (6%)

Query: 457 SGGTGGNGAVGTVSGVGGDGGNAAGLFGDGGTGGNGGLATAGGAGGDGGAGGKAALIGSG 516
SGG G G S G G GL GG G ++ G G G GSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 517 GNGGAGGSGTVDPGGNGGRGGDAQLFGTGGNGGNPGLGVPAGTAGEAGAPGLATSNQAL 575
G G + GG G GG+ G P L P AG ++ S AL
Sbjct: 62 HGNGGGNGNS---GGGSGTGGNLSAVAAPVAFGFPALSTPG-----AGGLAVSISAGAL 112



Score = 30.8 bits (69), Expect = 0.035
Identities = 34/104 (32%), Positives = 44/104 (42%), Gaps = 1/104 (0%)

Query: 355 GAGGIGGSAGALFGAGGA-GGAGGFGQVSGGAGGAGGNSGMVYGDGGAGGAGGIGGAAAG 413
G G G + GA +G GG G G G + G+G +S GG+G GG +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 414 GKGGDGGDAATLFGSGGTGGHGAAGAVAGGDGGRGGTGGGLAGS 457
G GG G++ G+GG AA G GGLA S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 30.5 bits (68), Expect = 0.039
Identities = 31/85 (36%), Positives = 37/85 (43%), Gaps = 7/85 (8%)

Query: 885 AGGGGTVAAGAAGGSGGNAAMLFGNGGAGGTGAVGAAVGGNG--GTGGDGGGLSGSGGAG 942
+GG G A + GN NGG G G G A G+G GG SGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 943 GNGAGGGTGGTGGDGGRARGLLGDG 967
G G+G G GG G+ G G G+
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0757DHBDHDRGNASE571e-11 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 57.4 bits (138), Expect = 1e-11
Identities = 52/210 (24%), Positives = 88/210 (41%), Gaps = 22/210 (10%)

Query: 21 GRVVVVTGANTGLGYHTAEALAGRGAHVVLAVRNPEKGNAAVAQIVAAKPQADVTLQALD 80
G++ +TGA G+G A LA +GAH+ NPEK V+ + A A+ D
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF--PAD 65

Query: 81 LSSLDSVRSAADALRSAYPRIDLLINNAGV--MWTPKQVTKDGFEMQFGTNHLGHFALTG 138
+ ++ + ID+L+N AGV ++ + +E F N G F +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 139 LLLDHLLPVPGSRVITV-SSLGHRIRAAIHFDDLQWERSYNRVAAYGQSKLANLLFTYEL 197
+ +++ ++TV S+ R ++ AAY SK A ++FT L
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSM--------------AAYASSKAAAVMFTKCL 171

Query: 198 QRRLAADSQAATIAVAAHPGGSNTELARNL 227
LA + I PG + T++ +L
Sbjct: 172 GLELAEYNIRCNI---VSPGSTETDMQWSL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0759PF06917310.017 Periplasmic pectate lyase
		>PF06917#Periplasmic pectate lyase

Length = 555

Score = 30.7 bits (69), Expect = 0.017
Identities = 12/50 (24%), Positives = 25/50 (50%)

Query: 193 FDKGYISGYFVTDAERQEAVLEDPYILLVSSKVSTVKDLLPLLEKVIQGG 242
F + Y G FV A+ + +++P L + + ++ +D L + + I G
Sbjct: 477 FKRHYHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNG 526


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0761cloacin340.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 0.001
Identities = 23/94 (24%), Positives = 35/94 (37%), Gaps = 7/94 (7%)

Query: 231 GSGNTGNSNVGLGNLGSGNVGFGNTGNGDFGFGLTGDHQFGFGGFNSGSGNVGIGNSGTG 290
G G+ ++ GN+ G G G G G G + ++ GG SG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG-- 63

Query: 291 NVGFFNSGNGNMGIGNSGSLNSGLGNSGSMSTGF 324
N G G SG+ + + ++ GF
Sbjct: 64 -----NGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 30.5 bits (68), Expect = 0.016
Identities = 34/114 (29%), Positives = 54/114 (47%), Gaps = 7/114 (6%)

Query: 266 GDHQFGFGGFNSGSGNVGIGNSGTGNVGFFNSGNG--NMGIGNSGSLNSGLGNSGSMSTG 323
GD + G +S SGN+ G +G G G + G+G + G SG+ G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 324 FGTASMSSGMWQSMHGSDMASSTSLA----SSATYATGGTA-TLSSGILSSALA 372
G + +SG G+ A + +A + +T GG A ++S+G LS+A+A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 30.1 bits (67), Expect = 0.023
Identities = 24/83 (28%), Positives = 34/83 (40%)

Query: 197 TGNVGNNNVGNNNWGSGNTGSSNVGTGNTGSSNIGSGNTGNSNVGLGNLGSGNVGFGNTG 256
+G G + + SGN G G G ++ GSG + +N G GSG G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 257 NGDFGFGLTGDHQFGFGGFNSGS 279
+G+ G G GG S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 29.7 bits (66), Expect = 0.028
Identities = 26/95 (27%), Positives = 36/95 (37%), Gaps = 3/95 (3%)

Query: 195 SGTGNVGNNNVGNNNWGSGNTGSSNVGTGNTGSSNIGSGNTGNSNVGLGNLGSGNVGFGN 254
SG N G +G S +G S+ G S G G S G GN G G
Sbjct: 17 SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS--GHGNGGGNGNSGGG 74

Query: 255 TGNGDFGFGLTGDHQFGFGGFNS-GSGNVGIGNSG 288
+G G + FGF ++ G+G + + S
Sbjct: 75 SGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0762UREASE533e-09 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 52.8 bits (127), Expect = 3e-09
Identities = 30/100 (30%), Positives = 44/100 (44%), Gaps = 13/100 (13%)

Query: 4 DLLIRNGTIVDGLGGEPYVGDVAVRDGIIVAVGP---PD--DSVN--GDAAGRVIDASGL 56
D +I N I+D G D+ ++DG I A+G PD V VI G
Sbjct: 69 DTVITNALILDHWG--IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGK 126

Query: 57 LVTPGFVDLHTHYDGQSIWSDRLTPSSAHGVTTVLMGNCG 96
+VT G +D H H+ I ++ + G+T +L G G
Sbjct: 127 IVTAGGMDSHIHF----ICPQQIEEALMSGLTCMLGGGTG 162


83MMAR_0923MMAR_0932N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_0923115-0.830439transcriptional regulator
MMAR_0924015-1.001552carveol-like dehydrogenase
MMAR_0925013-1.949834ion antiporter, NhaP
MMAR_0926113-3.308182PPE family protein
MMAR_0927122-5.538418hypothetical protein
MMAR_0928023-5.251698cytochrome P450 189A6 Cyp189A6
MMAR_0929225-5.328720TetR family transcriptional regulator
MMAR_0930226-4.939825transcriptional regulatory protein EmbR_2
MMAR_0931220-3.691621hypothetical protein
MMAR_0932118-2.801909PPE family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0923HTHTETR677e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.6 bits (162), Expect = 7e-16
Identities = 22/185 (11%), Positives = 63/185 (34%), Gaps = 12/185 (6%)

Query: 5 AERGAQTRAALMAAAVAVIAERGWGAATTRMVAERAGLPPGLVHYHFASLNDLLIDAALQ 64
+ +TR ++ A+ + +++G + + +A+ AG+ G +++HF +DL +
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF-SEIWE 64

Query: 65 AAREEAAQVLDGLAGDSPSQGIDRLIDAVSSYDVDDRNQNPAILVFGEMLLAATRYERLR 124
+ ++ P + L + + ++ + E++ +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHV-LESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 125 MGLAEILGDYRSALRQWLADQGGA----------IDPEATAALMFAAIDGLVLHRVIDPR 174
+ + + + + A +M I GL+ + + P+
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 175 LRTLA 179
L
Sbjct: 184 SFDLK 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0924DHBDHDRGNASE1112e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 111 bits (278), Expect = 2e-31
Identities = 70/251 (27%), Positives = 119/251 (47%), Gaps = 17/251 (6%)

Query: 3 ALDGRVALITGGARGQGRAHALALAGQGADIALADAPGPMAELTYPLGSEEDLLATAELV 62
++G++A ITG A+G G A A LA QGA IA D + E L +
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDY------------NPEKLEKVVSSL 52

Query: 63 GQLGRRCLPMVVDVRDAAQVNTAVERTVRELGSLDIVLANAGIVSTGRLEEVSDQVWQQL 122
R DVRD+A ++ R RE+G +DI++ AG++ G + +SD+ W+
Sbjct: 53 KAEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT 112

Query: 123 MDTNLTGVFHTLRAAIPVMRQQRFGRIVATSSMGGRMGIPELAAYNATKWGIIGLIKSVA 182
N TGVF+ R+ M +R G IV S + +AAY ++K + K +
Sbjct: 113 FSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLG 172

Query: 183 LEVAKEGITANVICPTTTQTPMVQPAGIGDDQEVPDDLVRRMMKANPIPQPW---LQPED 239
LE+A+ I N++ P +T+T M ++ + +++ ++ P +P D
Sbjct: 173 LELAEYNIRCNIVSPGSTETDMQWSLWADENGA--EQVIKGSLETFKTGIPLKKLAKPSD 230

Query: 240 VSRGVVYLVTD 250
++ V++LV+
Sbjct: 231 IADAVLFLVSG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0925ACRIFLAVINRP290.039 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.039
Identities = 17/71 (23%), Positives = 32/71 (45%), Gaps = 11/71 (15%)

Query: 54 PIVVALADVALFTVLFTDGQRANVRELRETWTLSGRALGVGMPLTMIGIAVPAHFLTGLN 113
P +VA++ V +F L L E+W++ + + +PL ++G + A L
Sbjct: 873 PALVAISFVVVFLCLAA---------LYESWSIPVSVM-LVVPLGIVG-VLLAATLFNQK 921

Query: 114 WPTAFLVGAIL 124
F+VG +
Sbjct: 922 NDVYFMVGLLT 932


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0926cloacin381e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 1e-04
Identities = 27/84 (32%), Positives = 38/84 (45%), Gaps = 13/84 (15%)

Query: 564 NSGNTNTGLWNAGNVNTGFGGIGTYSGNSGFFNSGTGNSGFFNSSDDNSGFGNSSSGGHN 623
N+G +T GN+N G G+G G + G SS++N G S SG H
Sbjct: 10 NTGAHSTS----GNINGGPTGLGV---------GGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 624 SGAANSGSGGYNAGFGNSNTGGGS 647
G + G+GG N G + GG+
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0929HTHTETR521e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.3 bits (125), Expect = 1e-10
Identities = 27/183 (14%), Positives = 57/183 (31%), Gaps = 22/183 (12%)

Query: 19 AVRRDDRILDIVVHLLQTEGYDAVQLREVARRARTSLATIYKRYANRDELILAALEFWMD 78
A ILD+ + L +G + L E+A+ A + IY + ++ +L E +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE--LS 66

Query: 79 EHHYAGLAEQTPAPGESLYAGMMRVLRTIFQPWETHPDIVKAYFRARAAPGGQRLVHRGL 138
E + L + A P +I+ + +RL
Sbjct: 67 ESNIGELELEYQAKFPG-------------DPLSVLREILIHVLESTVTEERRRL----- 108

Query: 139 DMVVPAAMEVLAGVDENFIHDLDTVISSLVYGLLGRFTAGEIAITEILPSID-RTVFWLI 197
++ + + + + Y + + I + + R ++
Sbjct: 109 -LMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167

Query: 198 RGY 200
RGY
Sbjct: 168 RGY 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0932CABNDNGRPT405e-05 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 40.3 bits (94), Expect = 5e-05
Identities = 49/229 (21%), Positives = 74/229 (32%), Gaps = 13/229 (5%)

Query: 237 NFGSGNLGSSNFGWAN---LGSNNIGVANAGGGNQGFGNIGNVNTGFGNTGIGNFGLANT 293
N GS G F LG + G NAG G+ + + + + + +G T
Sbjct: 175 NPGSEEYGRQTFTHEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGENET 234

Query: 294 GNNNIGIALTGDNQIGIGGLNSGVGNFGLFNSGTGNVGFF-NSGNGNFGIGNTGDFNTGV 352
G + G I + G +G GF N+ + ++
Sbjct: 235 GADYNGHYGGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFS 294

Query: 353 WNSGSGNSGFFNPGMFNTGVLDVGNANTGYLNTGSYNMGSFNPGASNTGAFNIGDGNTGW 412
G F G N +++ + + N+ S G + A G GN
Sbjct: 295 VWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNV-SIAHGVTIENAIG-GSGNDIL 352

Query: 413 F-NNGD--LNTGALN---FGDMNNGLLNTGDLNNGFFYRGVGQGSLHFA 455
N+ D L GA N +G L G + F Y G GQ S A
Sbjct: 353 VGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVY-GSGQDSTVAA 400



Score = 31.1 bits (70), Expect = 0.028
Identities = 26/156 (16%), Positives = 42/156 (26%), Gaps = 21/156 (13%)

Query: 202 QLIGVNLGLANVGSGNVGNANNGLGNIGN----------GNLGNGNFGSGNLGSSNFGWA 251
IG LGLA+ G N G + + G +G + ++G A
Sbjct: 188 HEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGENE--TGADYNGHYGGA 245

Query: 252 NLGSNNIGVANAGGGNQGFGNIGNVNTGFGNTGIGNFGLANTGNNNIGIALTGD------ 305
+ + + G N +V NT + ++ I
Sbjct: 246 PMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFD 305

Query: 306 --NQIGIGGLNSGVGNFGLFNSGTGNVGFFNSGNGN 339
+N G+F GNV G
Sbjct: 306 FSGYSNNQRINLNEGSFSDVGGLKGNV-SIAHGVTI 340


84MMAR_0989MMAR_0996N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_09891163.390247PE-PGRS family protein
MMAR_0990321-0.63112150S ribosomal protein L10
MMAR_0991525-0.83198050S ribosomal protein L7/L12
MMAR_0992424-0.839871transcriptional regulatory protein
MMAR_0993322-0.650869dioxygenase
MMAR_0994322-0.798559ribonucleotide-transport ATP-binding protein ABC
MMAR_0995321-0.568207DNA-directed RNA polymerase subunit beta
MMAR_0996015-0.030132DNA-directed RNA polymerase subunit beta'
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0989cloacin398e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 8e-05
Identities = 31/100 (31%), Positives = 34/100 (34%)

Query: 706 GAGGRGGDGGAYGNGGVGGNGGAGGAGSPGAHGGTAGEDGFNAGDGGHGGAGGTGGGGGA 765
G GRG + GA+ G G G GA G+ N GG G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 766 KGGVGGAGGSGGVGGQGGTGGDGATGLSGKAGTGTDGESG 805
G G GG G G A G T G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 36.2 bits (83), Expect = 7e-04
Identities = 41/116 (35%), Positives = 48/116 (41%), Gaps = 11/116 (9%)

Query: 133 GSGQAGGNGGLLWGNGGNGGSGGVGQTGGAGGSAGLLGHGGAGGAGGVSGVSAVGATGGA 192
G G+ G NGG G+G GGA +G G G SG+ G +G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH- 62

Query: 193 GGNGGWLYGNGGAGGLGGQGVLTGGNGGAGGAARFFG--AGGTGGAGGLGLGDTGG 246
GNGG G G G TGGN A A FG A T GAGGL + + G
Sbjct: 63 --------GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 35.1 bits (80), Expect = 0.001
Identities = 29/88 (32%), Positives = 36/88 (40%)

Query: 779 GGQGGTGGDGATGLSGKAGTGTDGESGGRGGHAGNGGNGGDGGAGGVSHAPGFLDGADGA 838
GG G GA SG G G G G G+G + + GG S + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 839 GGSGGAGGNGGNGANGGNGGSAPNPIGF 866
G GG G +GG GGN + P+ F
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAF 90



Score = 34.7 bits (79), Expect = 0.002
Identities = 31/92 (33%), Positives = 37/92 (40%), Gaps = 3/92 (3%)

Query: 881 GAAGGVGGTGGISGDGSTHAAPGDDGIMGNGGHGGHGGDGSRDNPDGGGHPGDGGYGGNG 940
G G TG S G+ + P G+ GG G S +NP GGG +GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPT--GLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 941 GSGFNAGNGGNGGRGGSAHASLVGKAGNGGFG 972
G G N G GN G G +L A FG
Sbjct: 61 GHG-NGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.7 bits (79), Expect = 0.002
Identities = 28/91 (30%), Positives = 33/91 (36%)

Query: 741 AGEDGFNAGDGGHGGAGGTGGGGGAKGGVGGAGGSGGVGGQGGTGGDGATGLSGKAGTGT 800
+G DG G H +G GG G GGA G + G G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 801 DGESGGRGGHAGNGGNGGDGGAGGVSHAPGF 831
G GG G G G GG+ A A GF
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 33.5 bits (76), Expect = 0.005
Identities = 28/92 (30%), Positives = 36/92 (39%), Gaps = 11/92 (11%)

Query: 798 TGTDGESGGRGGHAGNGGNGGDGGAGGVSHAPGFLDGADGAGGSGGAGGNGGNGANGGNG 857
+G DG G H+ +G G GV G G S G+G + N GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGV-----------GGGASDGSGWSSENNPWGGGS 50

Query: 858 GSAPNPIGFENGHNGGNGGNGGYGAAGGVGGT 889
GS + G NGG GN G G+ G +
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82



Score = 32.8 bits (74), Expect = 0.007
Identities = 31/101 (30%), Positives = 37/101 (36%)

Query: 368 TGGDGGVGGTGGVGGTGGEGGFLGQRGAGGAGGAGGAGGLAGDGGRGAAGTFAGGTGIGG 427
+GGDG TG +G G G GG G + G +G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 428 AGGDGGNAGVGGAGGAGGAGSVTGAHGAEGARPIGGNGGAG 468
G GGN GG G GG S A A G + G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.8 bits (74), Expect = 0.008
Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 2/87 (2%)

Query: 557 IGGNGGAGTTGLAGHIGIDMNGGAGGVGGAGGNGGAGGTGGDAGHAQAGGYSDGLQGAGG 616
+ G G G A ++NGG G+G GG + G+G + + GG S GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 617 DGGHGGSGGVAGDGGRGADAAAGSGLA 643
GHG GG GG S +A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.8 bits (74), Expect = 0.008
Identities = 24/79 (30%), Positives = 31/79 (39%)

Query: 281 GDGGAGGAAGLYGLGGAGGAGGDGGAGLSGGAGDAGGAGGHAGNGGAGGRGGWLVGNGGV 340
G G G G + G G G G + +G + + GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 341 GGLGGVGGTGGAGGAGGDG 359
G GG G +GG G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.010
Identities = 26/78 (33%), Positives = 30/78 (38%)

Query: 229 GAGGTGGAGGLGLGDTGGIGGIGGNAGALFGPGGAGGAGGAGGPGGANGTNGGDGGAGGA 288
G G GA GG G+G GA G G + GG G+ GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 289 AGLYGLGGAGGAGGDGGA 306
G GG G GG+ A
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.010
Identities = 31/115 (26%), Positives = 41/115 (35%), Gaps = 6/115 (5%)

Query: 466 GAGGRGADAIAAGKSGGAGGAGGNGGLFGHGGDGGDGGVGKAGEAGASGLLPGEAGFAGE 525
G GRG + A SG G G+ G DG G SG G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 526 AGGAGGNGGAGGSGGALAGDGGDGGAGGAGGIGGNGGAGTTGLAGHIGIDMNGGA 580
G G GGSG G + A + A +T AG + + ++ GA
Sbjct: 63 GNGGGNGNSGGGSG------TGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 31.6 bits (71), Expect = 0.016
Identities = 31/120 (25%), Positives = 40/120 (33%)

Query: 494 GHGGDGGDGGVGKAGEAGASGLLPGEAGFAGEAGGAGGNGGAGGSGGALAGDGGDGGAGG 553
G G + G G +GL G G + N GGSG + GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 554 AGGIGGNGGAGTTGLAGHIGIDMNGGAGGVGGAGGNGGAGGTGGDAGHAQAGGYSDGLQG 613
G GG+GT G + + G + G G A A A L+G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKG 125



Score = 31.6 bits (71), Expect = 0.020
Identities = 37/106 (34%), Positives = 48/106 (45%), Gaps = 3/106 (2%)

Query: 655 AGGEGGAAGGGSVAGTAGLDGIGPTSGGNGGNGGHGGSGAVGVEGGAGSAGGAGGRGGDG 714
+GG+G G+ + + ++G GPT G GG G GSG G G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNING-GPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 715 GAYGNGGVGGNGGAGGAGSPGAHGGTAGEDGFNAGDGGHGGAGGTG 760
+GNGG GN G GG+G+ G A F GAGG
Sbjct: 60 SGHGNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.022
Identities = 33/97 (34%), Positives = 41/97 (42%), Gaps = 1/97 (1%)

Query: 280 GGDGGAGGAAGLYGLGGAGGAGGDGGAGLSGGAGDAGGAGGHAGNGGAGGRGGWLVGNGG 339
G + GA +G G G G G + SG + + GG +G+G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 340 VGGLGGVGGTGGAGGAGGDGVVLG-SAGGTGGDGGVG 375
G GG GTGG A V G A T G GG+
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.024
Identities = 21/68 (30%), Positives = 24/68 (35%)

Query: 870 HNGGNGGNGGYGAAGGVGGTGGISGDGSTHAAPGDDGIMGNGGHGGHGGDGSRDNPDGGG 929
H+ NGG G GG SG S + G G GG G N + GG
Sbjct: 14 HSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGG 73

Query: 930 HPGDGGYG 937
G GG
Sbjct: 74 GSGTGGNL 81



Score = 31.2 bits (70), Expect = 0.026
Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 2/82 (2%)

Query: 840 GSGGAGGNGGNGANGGNGGSAPNPIGFENGHNGGNGGNGGYGAAGGVGGTGGISGDGSTH 899
G G G N G + GN P +G G + G+G + GG G+G G GS H
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 900 AAPGDDGIMGNGGHGGHGGDGS 921
G +G +GG G GG+ S
Sbjct: 63 GNGGGNG--NSGGGSGTGGNLS 82



Score = 31.2 bits (70), Expect = 0.027
Identities = 24/78 (30%), Positives = 31/78 (39%)

Query: 595 TGGDAGHAQAGGYSDGLQGAGGDGGHGGSGGVAGDGGRGADAAAGSGLAGGDGGRGGDPG 654
+GGD G +S GG G G GG + G ++ G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 655 AGGEGGAAGGGSVAGTAG 672
G GG G +GT G
Sbjct: 62 HGNGGGNGNSGGGSGTGG 79



Score = 30.8 bits (69), Expect = 0.030
Identities = 22/79 (27%), Positives = 25/79 (31%)

Query: 236 AGGLGLGDTGGIGGIGGNAGALFGPGGAGGAGGAGGPGGANGTNGGDGGAGGAAGLYGLG 295
+GG G G G GN G GG G + G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 296 GAGGAGGDGGAGLSGGAGD 314
G G G SG G+
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 30.8 bits (69), Expect = 0.030
Identities = 26/81 (32%), Positives = 31/81 (38%)

Query: 685 GNGGHGGSGAVGVEGGAGSAGGAGGRGGDGGAYGNGGVGGNGGAGGAGSPGAHGGTAGED 744
G G G + G + G G G G + G+G N GG G H G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 745 GFNAGDGGHGGAGGTGGGGGA 765
G G+G GG GTGG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.030
Identities = 35/106 (33%), Positives = 47/106 (44%), Gaps = 6/106 (5%)

Query: 792 LSGKAGTGTDGESGGRGGHAGNGGNGGDGGAGGVSHAPGFLDGADGAGGSGGAGGNGGNG 851
+SG G G + + G+ NGG G G GG S G+ + GG G+G + G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 852 ANGGNGGSAPNPIGFENGHNGGNGGNGGYGAAGGVGGTGGISGDGS 897
+ GNGG N +G G GGN AA G +S G+
Sbjct: 60 SGHGNGGGNGN-----SGGGSGTGGNLSAVAAPVAFGFPALSTPGA 100



Score = 30.8 bits (69), Expect = 0.033
Identities = 31/115 (26%), Positives = 39/115 (33%), Gaps = 15/115 (13%)

Query: 762 GGGAKGGVGGAGGSGGVGGQGGTGGDGATGLSGKAGTGTDGESGGRGGHAGNGGNGGDGG 821
GG +G GA + G G TG G S +G ++ G G +G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 822 AGGVSHAPGFLDGADGAGGSGGAGGNGGNGANGGNGGSAPNPIGFENGHNGGNGG 876
G GG G G G + +AP GF G GG
Sbjct: 63 ---------------GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.5 bits (68), Expect = 0.037
Identities = 26/93 (27%), Positives = 29/93 (31%)

Query: 908 MGNGGHGGHGGDGSRDNPDGGGHPGDGGYGGNGGSGFNAGNGGNGGRGGSAHASLVGKAG 967
M G GH + + G P G GG G + N GGS G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 968 NGGFGGTGGNSFGGVSGNGGNGGDGGHALHGQP 1000
G GG GNS GG G G P
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0992HTHTETR553e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.6 bits (131), Expect = 3e-11
Identities = 21/106 (19%), Positives = 39/106 (36%), Gaps = 3/106 (2%)

Query: 7 SPTQRSGVRDEMLHAAVALLDAHGPDALQTRKVAGAAGTSTMAVYTHFGGMPELIAEVAE 66
+ + R +L A+ L G + ++A AAG + A+Y HF +L +E+ E
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 67 EG---LRQFDTALAVPPSDDPVADLVATGAAYRRYAIERPHMYRLM 109
+ + + DP++ L + LM
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLM 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0994PF05272280.049 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.049
Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 32 VSVLLGPSGTGKSVFLKSLIGL 53
VL G G GKS + +L+GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_0996GPOSANCHOR330.007 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 33.5 bits (76), Expect = 0.007
Identities = 23/153 (15%), Positives = 47/153 (30%), Gaps = 2/153 (1%)

Query: 147 ELSTLEAEMMVERKAVEDQRDA--DLEARAQKLEADLAELEAEGAKADARRKVRDSGERE 204
E + L A KA+E + A+ + LEA+ A LEA A+ + + +
Sbjct: 149 EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTA 208

Query: 205 MRQLRDRAQRELDRLEDIWSTFTKLAPKQLIVDENLYRELQDRYGEYFTGAMGAESIQKL 264
+ E L + K + +++ E ++K
Sbjct: 209 DSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 268

Query: 265 IETFDIDAEAEILRDVIRNGKGQKKLRALKRLK 297
+E + A+ + + L+
Sbjct: 269 LEGAMNFSTADSAKIKTLEAEKAALEAEKADLE 301


85MMAR_1010MMAR_1019N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1010526-2.140596TetR family transcriptional regulator
MMAR_1011624-2.31501330S ribosomal protein S12
MMAR_1012520-1.75657130S ribosomal protein S7
MMAR_1013417-1.468090elongation factor G
MMAR_1014211-0.736684elongation factor Tu
MMAR_1015-18-0.399653hypothetical protein
MMAR_1016-190.1880653-ketoacyl-ACP reductase
MMAR_1017-2110.141095ferredoxin reductase
MMAR_1018-2130.019067hypothetical protein
MMAR_1019-2150.019208transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1010TETREPRESSOR483e-09 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 48.4 bits (115), Expect = 3e-09
Identities = 20/44 (45%), Positives = 30/44 (68%)

Query: 24 AKLSRDAIVDGALTFLDREGWDSLTINALATQLGTKGPSLYNHV 67
A+L+R++++D AL L+ G D LT LA +LG + P+LY HV
Sbjct: 2 ARLNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHV 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1013TCRTETOQM5870.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 587 bits (1516), Expect = 0.0
Identities = 163/676 (24%), Positives = 303/676 (44%), Gaps = 71/676 (10%)

Query: 12 KVRNIGIMAHIDAGKTTTTERILYYTGISYKIGEVHDGAATMDWMEQEQERGITITSAAT 71
K+ NIG++AH+DAGKTT TE +LY +G ++G V G D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 72 TCFWNDNQINIIDTPGHVDFTVEVERSLRVLDGAVAVFDGKEGVEPQSEQVWRQADKYDV 131
+ W + ++NIIDTPGH+DF EV RSL VLDGA+ + K+GV+ Q+ ++ K +
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 132 PRICFVNKMDKIGADFYFSVRTMEERLGANVIPIQLPVGSEGDFEGVVDLVEMKAKVWSA 191
P I F+NK+D+ G D + ++E+L A ++ Q
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ------------------------- 156

Query: 192 DAKLGEKYDVVDIPADLQEKADEYRTKLLEAVAETDEALLEKYLGGEELTEAEIKGAIRK 251
+L V + Q + V E ++ LLEKY+ G+ L E++
Sbjct: 157 KVELYPNMCVTNFTESEQ----------WDTVIEGNDDLLEKYMSGKSLEALELEQEESI 206

Query: 252 LTITSEAYPVLCGSAFKNKGVQPMLDAVIDYLPSPLDVPAAIGHVPGKEDEEVVRKPSTD 311
+PV GSA N G+ +++ + + S
Sbjct: 207 RFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH--------------------RGQ 246

Query: 312 EPFSALAFKVATHPFFGKLTYVRVYSGKVDSGSQVINSTKGKKERLGKLFQMHSNKENPV 371
FK+ +L Y+R+YSG + V S K K ++ +++ + + +
Sbjct: 247 SELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMYTSINGELCKI 305

Query: 372 ETASAGHIYAVIG----LKDTTTGDTLSDPNNQIVLESMTFPDPVIEVAIEPKTKSDQEK 427
+ A +G I + L GDT P E + P P+++ +EP +E
Sbjct: 306 DKAYSGEIVILQNEFLKLNS-VLGDTKLLPQR----ERIENPLPLLQTTVEPSKPQQREM 360

Query: 428 LSLSIQKLAEEDPTFKVHLDQETGQTVIGGMGELHLDILVDRMRREFKVEANVGKPQVAY 487
L ++ ++++ DP + ++D T + ++ +G++ +++ ++ ++ VE + +P V Y
Sbjct: 361 LLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420

Query: 488 KETIKRLVEKVEFTHKKQTGGSGQFAKVLISIEPFTGEDGATYEFESKVTGGRIPREYIP 547
E + K E+T + + +A + +S+ P G+ ++ES V+ G + + +
Sbjct: 421 MERPLK---KAEYTIHIEVPPNPFWASIGLSVSP--LPLGSGMQYESSVSLGYLNQSFQN 475

Query: 548 SVDAGAQDAMQYGVLAGYPLVNLKVTLLDGAFHEVDSSEMAFKIAGSQVLKKAAAAAHPV 607
+V G + + G L G+ + + K+ G ++ S+ F++ VL++ A
Sbjct: 476 AVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTE 534

Query: 608 ILEPIMAVEVTTPEDYMGDVIGDLNSRRGQIQAMEERSGARVVKAHVPLSEMFGYVGDLR 667
+LEP ++ ++ P++Y+ D I + ++ ++ +P + Y DL
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLT 594

Query: 668 SKTQGRANYSMVFDSY 683
T GR+ Y
Sbjct: 595 FFTNGRSVCLTELKGY 610


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1014TCRTETOQM802e-18 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 79.9 bits (197), Expect = 2e-18
Identities = 53/155 (34%), Positives = 82/155 (52%), Gaps = 13/155 (8%)

Query: 13 VNIGTIGHVDHGKTTLTAAITKVLH-----DKYPELNESRAFDQIDNAPEERQRGITINI 67
+NIG + HVD GKTTLT ++ L+ + +++ + DN ERQRGITI
Sbjct: 4 INIGVLAHVDAGKTTLTESL---LYNSGAITELGSVDKGTT--RTDNTLLERQRGITIQT 58

Query: 68 SHVEYQTEKRHYAHVDAPGHADYIKNMITGAAQMDGAILVVAATDGPMPQTREHVLLARQ 127
+Q E +D PGH D++ + + +DGAIL+++A DG QTR R+
Sbjct: 59 GITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRK 118

Query: 128 VGVPYILVALNKSDAVDDEELLELVEMEVRELLAA 162
+G+P I +NK D + L V +++E L+A
Sbjct: 119 MGIPTI-FFINKIDQNGID--LSTVYQDIKEKLSA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1016DHBDHDRGNASE1111e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 111 bits (279), Expect = 1e-31
Identities = 88/271 (32%), Positives = 131/271 (48%), Gaps = 25/271 (9%)

Query: 8 LDGRVAFVTGAARAQGRSHAVRLAREGADIIAMDICGPVSETITYAPATAADLSETIRAV 67
++G++AF+TGAA+ G + A LA +GA I A+D Y P + +++A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD----------YNPEKLEKVVSSLKA- 54

Query: 68 ESEGRKVLARQADVRDSAALQQLVADGVEEFGRLDVVVANAGVLGWGRLWELTDEQWDTV 127
E R A ADVRDSAA+ ++ A E G +D++V AGVL G + L+DE+W+
Sbjct: 55 --EARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT 112

Query: 128 IGVNLSGTWRTLRAAVPAMIEAGNGGSIIVVSSSAGLKATPGNGHYAASKHGLVALTNTL 187
VN +G + R+ M GSI+ V S+ YA+SK V T L
Sbjct: 113 FSVNSTGVFNASRSVSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCL 171

Query: 188 AIELGEYDIRVNSIHPYSVETPM-----IEPDVMMQVF-GEHPRFLHSFPPMPLQYKGLM 241
+EL EY+IR N + P S ET M + + QV G F P K L
Sbjct: 172 GLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIP-----LKKLA 226

Query: 242 TSEEVSDVVVWLAGDGSGTLSGAQIPVDKGA 272
+++D V++L +G ++ + VD GA
Sbjct: 227 KPSDIADAVLFLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1019HTHTETR945e-26 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 93.5 bits (232), Expect = 5e-26
Identities = 34/203 (16%), Positives = 76/203 (37%), Gaps = 15/203 (7%)

Query: 4 ESRAGRRPSTTKRHIADVAIDLFAARTFAEVSVDDVAQAAGIARRTLFRYYASKNAIPWG 63
+ + T++HI DVA+ LF+ + + S+ ++A+AAG+ R ++ ++ K+ +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 64 DFDTHLAQLQDLLERIDGHVR--LGKALREALLAFNTYDESETIRHRQRMRIILQTAELQ 121
++ + + +L LRE L+ +E R+ + I+
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTE--ERRRLLMEIIFHKCEF 119

Query: 122 AYSMTMYAGWRAVIAGFV----------ARRLSVKPTDLVPQTVAWTMLGVALSAYEHW- 170
M + + + + P DL+ + A M G E+W
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179

Query: 171 LSDESVSLPEALGNAFDVVGAGL 193
+ +S L + + ++
Sbjct: 180 FAPQSFDLKKEARDYVAILLEMY 202


86MMAR_1129MMAR_1135N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1129313-1.146909PPE family protein
MMAR_5550210-0.692364hypothetical protein
MMAR_1130210-0.034765PPE family protein
MMAR_11310120.486696metal-dependent hydrolase
MMAR_1132-192.154219WhiB-like regulatory protein, WhiB3
MMAR_1133092.463920hypothetical protein
MMAR_1134190.621795RNA polymerase sigma factor SigD
MMAR_11351110.630228hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1129cloacin358e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.1 bits (80), Expect = 8e-04
Identities = 28/100 (28%), Positives = 38/100 (38%), Gaps = 6/100 (6%)

Query: 187 NANLGSGNTGIGNIGVGNSGEGNSALVPPQSGNYNIGGGNNGNNNLGAGNIGNFNFGFGN 246
+ N+ G TG+G G + G G S+ P G G G + G GN G G
Sbjct: 17 SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS--GHGNGGGNGNSGGG 74

Query: 247 NGTGNFGFGNAGPADLSNPNLFTFHVTPGENNIGIGNTGN 286
+GTG A P P L TPG + + +
Sbjct: 75 SGTGGNLSAVAAPVAFGFPAL----STPGAGGLAVSISAG 110



Score = 29.3 bits (65), Expect = 0.050
Identities = 35/131 (26%), Positives = 49/131 (37%), Gaps = 27/131 (20%)

Query: 223 GGGNNGNNNLGAGNIGNFNFGFGNNGTGNFGFGNAGPADLSNPNLFTFHVTPGENNIGIG 282
G G+N + +GNI G G G + G G + ENN G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSS-----------------ENNPWGG 48

Query: 283 NTGNGNFGLGNTGDGNIGGGNTGIGNIGFGLNGNNLVGVGGAYYDTAAGQFHFDGLNT-G 341
+G+G G +G GN GG G G G N + + A F F L+T G
Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV---------AAPVAFGFPALSTPG 99

Query: 342 SGNIGIGNSGS 352
+G + + S
Sbjct: 100 AGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1130cloacin300.022 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.5 bits (68), Expect = 0.022
Identities = 29/94 (30%), Positives = 39/94 (41%), Gaps = 5/94 (5%)

Query: 386 GNSGIGNFGVGNAGAGNFGAGNSGLLNTGVGNAGSIDTGAFNGNNLNTGFLNSGKTNTGF 445
G G G+ ++ +GN G +GL GVG S +G + NN G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGL---GVGGGASDGSGWSSENNPWGGGSGSGIHW--G 57

Query: 446 GNSGHENTGFWNSGDVNTGVGATTDSGLATSGFG 479
G SGH N G + +G G + A FG
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1131UREASE358e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 35.1 bits (81), Expect = 8e-04
Identities = 24/76 (31%), Positives = 35/76 (46%), Gaps = 9/76 (11%)

Query: 6 DAIYTNGDIVTVDDEQPIAEA-VAVKDGRIVAVGAHD-----DVVREHLGPHTRRVDLAG 59
D + TN I+ D I +A + +KDGRI A+G V +GP T + G
Sbjct: 69 DTVITNALIL---DHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEG 125

Query: 60 NTLLPGFIDPHSHYIN 75
+ G +D H H+I
Sbjct: 126 KIVTAGGMDSHIHFIC 141



Score = 31.2 bits (71), Expect = 0.012
Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 487 ITINAAYQYSEEQSKGSITVGKLADLVIVD 516
TIN A + GS+ VGK ADLV+ +
Sbjct: 409 YTINPAIAHGLSHEIGSLEVGKRADLVLWN 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1135PF03544300.013 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.6 bits (66), Expect = 0.013
Identities = 17/97 (17%), Positives = 23/97 (23%), Gaps = 2/97 (2%)

Query: 212 VLAPQVPPGNSLTPLVPETVSPVPPNESG--APESAPAPSASPTTTGAKPSASLPPAGAT 269
A Q PP + P P PP E+ + P P P
Sbjct: 63 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVE 122

Query: 270 ATSPAPTSVPTPPVSAVVPGETPADTSVVAPGSPAAA 306
+ +P P V + S A
Sbjct: 123 SRPASPFENTAPARPTSSTATAATSKPVTSVASGPRA 159


87MMAR_1263MMAR_1268N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_126309-0.342004hypothetical protein
MMAR_1264-210-0.680721two-component system membrane associated sensor
MMAR_1265-212-0.529631two-component transcriptional regulator
MMAR_1266-2110.269884transmembrane carbonic anhydrase, SulP_2
MMAR_1267-3131.257725acyl-CoA transferase
MMAR_1268-2121.552135putative regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1263HTHFIS741e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.5 bits (183), Expect = 1e-16
Identities = 38/146 (26%), Positives = 65/146 (44%), Gaps = 4/146 (2%)

Query: 27 SLLLVEDDRADAMLVEELIADAAVDIQVVWARSMAHAERELSAARPDCVLLDLNLPDASG 86
++L+ +DD A ++ + ++ A D V + A R ++A D V+ D+ +PD +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 87 IDALDRIANRDATVPVVVLTGLNDEYFGATAVAAGAQDYLVKGRVDPEM--LRRAMLYAI 144
D L RI +PV+V++ N A GA DYL K E+ + L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 145 ERKRAELIAADLHATQLRARENALLE 170
+R+ ++L L R A+ E
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQE 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1264BCTERIALGSPH280.049 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 28.4 bits (63), Expect = 0.049
Identities = 19/61 (31%), Positives = 27/61 (44%), Gaps = 8/61 (13%)

Query: 26 VLSIMGVMVLAGTVAGAVLLNRTDDVSRELSDNIEPARVAAFQLQ-----SALRDQESGI 80
+L +M +++L G AG VLL SR+ S AR A QL+ Q G+
Sbjct: 8 LLEMMLILLLMGVSAGMVLL--AFPASRDDSAAQTLARFEA-QLRFVQQRGLQTGQFFGV 64

Query: 81 R 81

Sbjct: 65 S 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1265HTHFIS601e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.2 bits (146), Expect = 1e-13
Identities = 23/123 (18%), Positives = 52/123 (42%), Gaps = 11/123 (8%)

Query: 10 ILLIEDDPGDELITREAFEHNKVNNRLHVAHDGEEGLDYLYQRGKYQQARRPDLILLDLN 69
IL+ +DD + +A + + + + ++ A DL++ D+
Sbjct: 6 ILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWI-------AAGDGDLVVTDVV 56

Query: 70 LPKYDGRQLLEKIKSDSELCRIPVVVLTTSSAEEDILRSYNLHANAYVTKPVDLDQFMTA 129
+P + LL +IK +PV+V++ + +++ A Y+ KP DL + +
Sbjct: 57 MPDENAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 130 VRQ 132
+ +
Sbjct: 115 IGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1268HTHTETR472e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 46.5 bits (110), Expect = 2e-08
Identities = 21/107 (19%), Positives = 38/107 (35%), Gaps = 3/107 (2%)

Query: 20 GAASRAQTRHLLLTAAAEEFARVGYVASTVSRIAEGAGVTVQTLYLAWGSKRALLRGYLE 79
+TR +L A F++ G ++++ IA+ AGVT +Y + K L E
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 80 ---STLAPDAAPSGQHFAAQLQPDSPAGTLAQVSALVCDAARRSAIA 123
S + F + + + V + RR +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME 111


88MMAR_1329MMAR_1337N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_13291100.755668hypothetical protein
MMAR_1330-1120.405873hypothetical protein
MMAR_1331-1130.294950hypothetical protein
MMAR_1332012-1.386963hypothetical protein
MMAR_1333112-1.745870short chain dehydrogenase
MMAR_1334012-1.054416RNA polymerase sigma factor RpoE
MMAR_1335013-0.007581anti-sigma factor
MMAR_1336-1130.019927putative acetyl-CoA carboxylase biotin carboxyl
MMAR_1337-1130.633279sensor kinase from two-component regulatory
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1329PF03544300.020 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.9 bits (67), Expect = 0.020
Identities = 20/109 (18%), Positives = 31/109 (28%), Gaps = 3/109 (2%)

Query: 289 PVQPTKPPKANEVKIDPPAQAKPPEQIVVPPGPDPVPAP---ADDWPVDEALPNPTDMPV 345
P QP ++PP +PP + VV P P+P P P + V E
Sbjct: 46 PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 105

Query: 346 VPFAGSPQLPGNTLADSFAGRGGGTGLSAGAPKLKPASFGGAGAASMRP 394
P Q + + P A+ + +
Sbjct: 106 KPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1331TONBPROTEIN381e-04 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 37.7 bits (87), Expect = 1e-04
Identities = 27/123 (21%), Positives = 34/123 (27%), Gaps = 6/123 (4%)

Query: 184 PAAPSQRRDVQTGQPIEQEPAPPAPAQPVPVTPVIVPEAEQSTPGAAEPGWIPPAPAAMS 243
PA + VQ EP P P P V + +P P
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKP------KPKPKPKPKPVKK 105

Query: 244 PVGQPMMPSSPAHPVPGSVAGPASPAPAAADGGAQGPVLRAAAARSAPGAGGRGNPMAPA 303
QP P P S +PA + + S P A R P PA
Sbjct: 106 VQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPA 165

Query: 304 PSQ 306
+Q
Sbjct: 166 RAQ 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1333DHBDHDRGNASE702e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 70.5 bits (172), Expect = 2e-16
Identities = 50/191 (26%), Positives = 88/191 (46%), Gaps = 10/191 (5%)

Query: 3 LNGKTMFISGASRGIGLAIAKRAAQDGANIALIAKTAEPHPKLPGTVYTAAKELEEAGGQ 62
+ GK FI+GA++GIG A+A+ A GA+IA + E K+ ++ A+ E
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE----- 60

Query: 63 ALPIVGDVRDPDSVSAAVAKTVEQFGGIDICVNNASAINLGSITEVPMKRFDLMNGIQVR 122
A P DVRD ++ A+ + G IDI VN A + G I + + ++ +
Sbjct: 61 AFPA--DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118

Query: 123 GTYAVSQACIPHLKGRENPHILTL-SPPVQLDKKWLKPTAYMMAKFGMTLCALGIAEEMR 181
G + S++ ++ R + I+T+ S P + + AY +K + + E+
Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPR--TSMAAYASSKAAAVMFTKCLGLELA 176

Query: 182 DEGIASNTLWP 192
+ I N + P
Sbjct: 177 EYNIRCNIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1334adhesinb290.011 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 29.4 bits (66), Expect = 0.011
Identities = 18/50 (36%), Positives = 22/50 (44%), Gaps = 8/50 (16%)

Query: 169 VEALEALPDTEIKEALQALPEEFRMAV-------YYADVEGFPYKEIAEI 211
VE L AL D E KE +P E +M V Y++ P I EI
Sbjct: 178 VEKLSAL-DKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEI 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1337PF06580455e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.9 bits (106), Expect = 5e-07
Identities = 37/225 (16%), Positives = 88/225 (39%), Gaps = 40/225 (17%)

Query: 282 EVKRRDRALISKDATIREIHHRVK-----NNLQTVAALLRLQARRTTNAEGREALIESVR 336
E+ + A ++++A + + ++ N L + AL+ + ++ S+
Sbjct: 148 EIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALIL------EDPTKAREMLTSL- 200

Query: 337 RVSSIALVHDALSMSVDEQVNLDEVIDRILPIMNDVASVDRPIRIN--RVGD-LGV---L 390
L+ +L S QV+L + ++ VD +++ + D L +
Sbjct: 201 ----SELMRYSLRYSNARQVSLAD----------ELTVVDSYLQLASIQFEDRLQFENQI 246

Query: 391 DSDRATALI--MVITELVQNAIEHAFDPAAQGA-VTIRAERSARWLDVVVHDDGRGLPSG 447
+ + M++ LV+N I+H QG + ++ + + + V + G
Sbjct: 247 NPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK- 305

Query: 448 FSLEKSDSLGLQIVRTLVSAEL--DGSLGMREAPGRGTDVVLRVP 490
+ ++S GLQ VR + + + + E G+ ++ +P
Sbjct: 306 -NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


89MMAR_1441MMAR_1449N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_14415165.217767hypothetical protein
MMAR_14422144.635142PE-PGRS family protein
MMAR_1443-214-0.194979hypothetical protein
MMAR_1444-1140.195552TetR family transcriptional regulator
MMAR_1445-2140.952122transposase for insertion sequence ISMyma02
MMAR_1447-2130.511957hypothetical protein
MMAR_1448-2130.622744zinc cation transport ATPase
MMAR_14490160.194528PPE family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_144160KDINNERMP280.016 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 27.6 bits (61), Expect = 0.016
Identities = 8/31 (25%), Positives = 13/31 (41%)

Query: 33 NGGAVETSSDVWELYPFFDTSDKKRLKRTCN 63
G A T + +E Y F +D + L +
Sbjct: 219 RGAAYSTPDEKYEKYKFDTIADNENLNISSK 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1442cloacin360.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.2 bits (83), Expect = 0.001
Identities = 29/77 (37%), Positives = 33/77 (42%), Gaps = 1/77 (1%)

Query: 1344 LSGANSAGARGGAGGAGGAGITGGAGGAGGAGSNGDGTSNQVEGQPGGDGGSGGIGGTGT 1403
+SG + G GA G I GG G G G DG+ E P G G GI G
Sbjct: 1 MSGGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1404 AGAGGTGGAGGDGGAGG 1420
+G G GG G GG G
Sbjct: 60 SGHGNGGGNGNSGGGSG 76



Score = 36.2 bits (83), Expect = 0.001
Identities = 29/100 (29%), Positives = 33/100 (33%)

Query: 573 GASGGGGQAGGSGGSGGAGGAGGALAGTGGAGGEGGTGGDGGTGGNGAGGAPGAAAGAAG 632
G G G G SG G L GGA G + G G+G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 633 GNGGNGGVGGSGGIGGNGGAAGVALAGSGHDGAQGAGGAG 672
GNGG G G G G +A A G G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.7 bits (79), Expect = 0.003
Identities = 35/109 (32%), Positives = 42/109 (38%)

Query: 426 GAGGVGGTGGLISFLGGHGTGGAGGEGGSGGIAGDGGKGAAGTFGGGDGVGGAGGRGGDP 485
G G G G S G G G G G G G +GGG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 486 GLGGAGGAGGTGSTIGAHGADGARPNSGGNGGAGGQGADALGPAFTSGA 534
G GG G G GS G + + A P + G GA L + ++GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.7 bits (79), Expect = 0.003
Identities = 34/103 (33%), Positives = 42/103 (40%), Gaps = 1/103 (0%)

Query: 1308 GGAGDGSSGGAGGRGGDGGAGITGAGGQGGAGGDGGLSGANSAGARGGAGGAGGAGITGG 1367
GG G G + GA G+ G TG G GGA G S N+ GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSG 61

Query: 1368 AGGAGGAGSNGDGTSNQVEGQPGGDGGSGGIGGTGTAGAGGTG 1410
G GG G++G G+ + G T GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.004
Identities = 35/106 (33%), Positives = 41/106 (38%), Gaps = 4/106 (3%)

Query: 286 GGAGEGSGNGGVGGEGGRGGQWFGHGGGGGAGGAGGADAADGGHGGAGGAARLWGTGGHG 345
GG G G G G G G G GGGA G + + GG G+ WG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 346 GSGGAGGVGALGGAGESGGAGGAAGDGGAGGRGGWLIGTGGAGGLG 391
G+GG G G SG G + G + T GAGGL
Sbjct: 63 GNGGGNG----NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.004
Identities = 29/82 (35%), Positives = 33/82 (40%)

Query: 1195 GDGGAGGNGGDAIGFGSGNGGLGGGGGAGGTGANGGTGGHGGVGGFGDIGGKGGSGGTGG 1254
G G G N G G+ NGG G G GG G G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1255 TGLTGAGGAGGAGGTGGQADSV 1276
G G +GG GTGG +V
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.9 bits (77), Expect = 0.005
Identities = 30/98 (30%), Positives = 34/98 (34%), Gaps = 3/98 (3%)

Query: 536 GGTGGDGGAGGLVGDGGNGGAGGRGATGGVGASATAPGASGGGGQAGGSGGSGGAGGAGG 595
G G G + G G G GA+ G G S+ GG G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 596 ALAGTGGAGGEGGTGGDGGTGGNGAGGAPGAAAGAAGG 633
GG G G A G P + AGG
Sbjct: 68 ---NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.9 bits (77), Expect = 0.005
Identities = 30/96 (31%), Positives = 37/96 (38%), Gaps = 7/96 (7%)

Query: 705 GAGGRGGDPGLGGAGGAGGAGSTTGAPGADGTRPTTGGNGGEGGRGADAVGAGGSGAAGG 764
G GRG + G G G T G G + G G + GGSG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-------GASDGSGWSSENNPWGGGSGSGIH 55

Query: 765 AGGDGGLVGDGGHGGDGGHGATGAAGASAVAPGASG 800
GG G GG+G GG TG ++ AP A G
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.9 bits (77), Expect = 0.005
Identities = 24/83 (28%), Positives = 28/83 (33%)

Query: 549 GDGGNGGAGGRGATGGVGASATAPGASGGGGQAGGSGGSGGAGGAGGALAGTGGAGGEGG 608
GDG G +G + T G GG G G G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 609 TGGDGGTGGNGAGGAPGAAAGAA 631
GG G G G+G +A AA
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.007
Identities = 29/92 (31%), Positives = 33/92 (35%), Gaps = 8/92 (8%)

Query: 899 GNGGRGEVGGLPGNGGDGGNGALGGGAGGNGGNGGNPGDSGTGGAGGTGSTTGMNGVSNS 958
G GRG G G+ G G G GG +G GG+GS G S
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 959 RIVVGGLWGNGGHGGTGGTGSAAGGPGGSGGA 990
GNGG G G GS GG + A
Sbjct: 63 --------GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.009
Identities = 26/66 (39%), Positives = 32/66 (48%)

Query: 1188 TGSTPDGGDGGAGGNGGDAIGFGSGNGGLGGGGGAGGTGANGGTGGHGGVGGFGDIGGKG 1247
T +GG G G GG + G G + GGG+G GG GHG GG G+ GG
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 1248 GSGGTG 1253
G+GG
Sbjct: 76 GTGGNL 81



Score = 32.8 bits (74), Expect = 0.013
Identities = 33/101 (32%), Positives = 39/101 (38%), Gaps = 2/101 (1%)

Query: 1033 AGGDGGAGGVGGTGGQGGTQAGNGGVGGAGGAG-GKGADGGNGANGD-SGNGVGSDGFAG 1090
+GGDG G G G G+G GGA G G N G SG+G+ G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1091 GNGGAGGSGGTGGDGGAGGLALADTGQDGAQGAGGDGGAGG 1131
G G GG G G L+ A GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.015
Identities = 27/81 (33%), Positives = 32/81 (39%)

Query: 755 GAGGSGAAGGAGGDGGLVGDGGHGGDGGHGATGAAGASAVAPGASGGNGQTGGSGGAGGA 814
G G G GA G + G G G GA+ +G S+ GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 815 GGAGGTLAGHGGDGGAGGNGA 835
G GG GG G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.015
Identities = 38/116 (32%), Positives = 47/116 (40%), Gaps = 9/116 (7%)

Query: 615 TGGNGAGGAPGAAAGAAGGNGGNGGVGGSGGIG-GNGGAAGVALAGSGHDGAQGAGGAGG 673
+GG+G G GA + + NGG G+G GG G+G ++ G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 674 SGGMGGVAGDGGKGAAGAFAGGGGGGNDGVGGAGGRGGDPGLGGAGGAGGAGSTTG 729
G GG GG G G GGN A G P L G G A S +
Sbjct: 62 HGNGGGNGNSGG--------GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 32.4 bits (73), Expect = 0.015
Identities = 22/61 (36%), Positives = 24/61 (39%)

Query: 968 NGGHGGTGGTGSAAGGPGGSGGAGGTGGAGGHGGLWGNGGDGGTGGQGADGGAGISASAQ 1027
NGG G G G A+ G G S GG G G WG G G GG + G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 1028 G 1028

Sbjct: 81 L 81



Score = 32.4 bits (73), Expect = 0.015
Identities = 31/102 (30%), Positives = 38/102 (37%)

Query: 677 MGGVAGDGGKGAAGAFAGGGGGGNDGVGGAGGRGGDPGLGGAGGAGGAGSTTGAPGADGT 736
M G G G A + +G GG G+G GG G G GS +G G+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 737 RPTTGGNGGEGGRGADAVGAGGSGAAGGAGGDGGLVGDGGHG 778
GG G G G+ G + AA A G L G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.016
Identities = 37/103 (35%), Positives = 45/103 (43%), Gaps = 4/103 (3%)

Query: 799 SGGNGQTGGSGGAGGAGGAGGTLAGHGGDGGAGGNGANGGIGANGAHGTLGIAAGADGST 858
SGG+G+ +G +G G G G GGA + +G N G + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGA--SDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 859 GGNGGVGGNGGVGGNGGNGGNGG--AAGVALGSGQDGAEGAGG 899
G+G GGNG GG G GGN AA VA G GAGG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.020
Identities = 34/106 (32%), Positives = 38/106 (35%), Gaps = 4/106 (3%)

Query: 602 GAGGEGGTGGDGGTGGNGAGGAPGAAAGAAGGNGGNGGVGGSGGIGGNGGAAGVALAGSG 661
G G G G T GN GG G G G + G G S GG +G + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGP----TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 662 HDGAQGAGGAGGSGGMGGVAGDGGKGAAGAFAGGGGGGNDGVGGAG 707
G GG G SGG G G+ AA G G GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.027
Identities = 24/76 (31%), Positives = 28/76 (36%)

Query: 1245 GKGGSGGTGGTGLTGAGGAGGAGGTGGQADSVLLGDSGGGEGGSGGFGGTGLTTGGEGGR 1304
G+G + G T GG G G GG +D GG G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1305 GGIGGAGDGSSGGAGG 1320
GG G +G GS G
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.041
Identities = 28/81 (34%), Positives = 34/81 (41%)

Query: 386 GAGGLGGVGGVGGSGGFGANAVTPGGAGGQGGAGGDGGAGGAGGVGGTGGLISFLGGHGT 445
G G G G + G T G GG G + GG+G I + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 446 GGAGGEGGSGGIAGDGGKGAA 466
G GG G SGG +G GG +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.043
Identities = 36/115 (31%), Positives = 46/115 (40%), Gaps = 6/115 (5%)

Query: 517 GAGGQGADALGPAFTSGATGGTGGDGGAGGLVGDGGNGGAGGRGATGGVGASATAPGASG 576
G G+G + + + GG G G GG G+G + GG S G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 577 GGGQAGGSGGSGGAGGAGGALAGTGGAGGEG----GTGGDGGTGGNGAGGAPGAA 627
G G GG+G SGG G GG L+ G T G GG + + GA AA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 30.8 bits (69), Expect = 0.048
Identities = 25/84 (29%), Positives = 28/84 (33%)

Query: 1255 TGLTGAGGAGGAGGTGGQADSVLLGDSGGGEGGSGGFGGTGLTTGGEGGRGGIGGAGDGS 1314
+G G G GA T G + G GG G + G G GI G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1315 SGGAGGRGGDGGAGITGAGGQGGA 1338
G GG G GG TG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1444HTHTETR603e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.6 bits (144), Expect = 3e-13
Identities = 21/73 (28%), Positives = 34/73 (46%), Gaps = 2/73 (2%)

Query: 2 RAASTRERLVTEAMRLFGERGYHATSVAQIEAAAGLASGSGALYHHFDSKESLLEAGIDR 61
A TR+ ++ A+RLF ++G +TS+ +I AAG+ GA+Y HF K L +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVT--RGAIYWHFKDKSDLFSEIWEL 65

Query: 62 QLDRRRAMGDLRA 74
+
Sbjct: 66 SESNIGELELEYQ 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1449IGASERPTASE290.039 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.3 bits (65), Expect = 0.039
Identities = 23/122 (18%), Positives = 37/122 (30%), Gaps = 12/122 (9%)

Query: 83 SAAAAQAQQTATQARAAA-AAFEEAFAATVPPPLIAANRNQLVSLAAANTLGQNSPAIEA 141
+ QA + + A +E A VPPP A + A Q S +E
Sbjct: 999 TPNNIQADVPSVPSNNEEIARVDE---APVPPPAPATPSET--TETVAENSKQESKTVEK 1053

Query: 142 AQAEYAEMWAQDAAAMYSYAGASEAASMLTPFNQPSQTVDPAGPLTQAAAAAATNAQTAL 201
+ + E AQ+ A EA S + Q ++ + T
Sbjct: 1054 NEQDATETTAQNREV------AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 202 AQ 203
+
Sbjct: 1108 KE 1109


90MMAR_1611MMAR_1616N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1611-113-0.184887transcriptional regulatory protein
MMAR_1612-111-1.188743multidrug-transport integral membrane protein
MMAR_1613-211-1.070375hypothetical protein
MMAR_1614-18-1.200065short chain dehydrogenase
MMAR_1615-28-1.556467chaperone protein DnaK1
MMAR_1616-17-1.423508carbon starvation protein, CstA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1611HTHTETR452e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.4 bits (107), Expect = 2e-08
Identities = 21/132 (15%), Positives = 51/132 (38%), Gaps = 5/132 (3%)

Query: 1 MTREVERRPRDPAGRRQTIIEAAGRLIARHGLGDLTHRRVAAEADVPVGSTTYYFSDLGE 60
M R+ ++ ++ RQ I++ A RL ++ G+ + +A A V G+ ++F D +
Sbjct: 1 MARKTKQEAQE---TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSD 57

Query: 61 LREAALAHVATSATDWLEH-WERDLDESTDIP-ATLARLTADYLTDPDRHRTLNELYVAA 118
L ++ + + + + L + +T+ R + ++
Sbjct: 58 LFSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKC 117

Query: 119 SHQPELQSLAQL 130
E+ + Q
Sbjct: 118 EFVGEMAVVQQA 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1614DHBDHDRGNASE949e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 93.6 bits (232), Expect = 9e-25
Identities = 51/185 (27%), Positives = 75/185 (40%), Gaps = 9/185 (4%)

Query: 5 LITGCSTGLGRALAEAVIDAGHHTVATARSVGGVADLVQ------RSPERVLPLALDITE 58
ITG + G+G A+A + G H A + + +V R E D+ +
Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---FPADVRD 68

Query: 59 PDQITAAVQAAQQRFGGIDVLVNNAGYGYRAAVEEGDDAEVRDLFETHFFGTVALIKAVL 118
I ++ G ID+LVN AG + D E F + G ++V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PDMRARRSGAIVNISSIAVALTPVGSGYYAAAKAAMEGMSGALHGELAPLGISVTVVEPG 178
M RRSG+IV + S + YA++KAA + L ELA I +V PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 AFRTD 183
+ TD
Sbjct: 189 STETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1615SHAPEPROTEIN672e-14 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 67.5 bits (165), Expect = 2e-14
Identities = 83/376 (22%), Positives = 141/376 (37%), Gaps = 68/376 (18%)

Query: 3 VGIDFGTTHTVAAVVDRGNYPVVSFDGVDAWPSAIAANAAGE------LRFGLDATA-VR 55
+ ID GT +T+ V +G V PS +A G DA +
Sbjct: 13 LSIDLGTANTLIYVKGQGI--------VLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLG 64

Query: 56 RDPGWSVLRSFKRLLNDAGPHTEVSLAGRSYRLTELLARFLEQLKDDLQHRSNAGLTPGE 115
R PG R + D G + + ++L F++Q+ SN+ + P
Sbjct: 65 RTPGNIAA---IRPMKD-GVIADFFVT------EKMLQHFIKQVH------SNSFMRPSP 108

Query: 116 PVEAAISVPANASSAQRFLTLDAFVAAGFQVVALLNEPSAASLEYAHRYRSTITAKREYV 175
V + VP A+ +R ++ AG + V L+ EP AA++ ++ +
Sbjct: 109 RV--LVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLP----VSEATGSM 162

Query: 176 VIYDLGGGTFDASLLKMTGHVNDVVRSEGIQRLGGDDFDEAILQLVAARLP-EIAELAAT 234
V+ D+GGGT + +++ + G VV S + R+GGD FDEAI+ V I E A
Sbjct: 163 VV-DIGGGTTEVAVISLNG----VVYSSSV-RIGGDRFDEAIINYVRRNYGSLIGEATAE 216

Query: 235 DVTGYDVLREECAARKEAVG---PQTRRFLMDLTGI----GGDRPPFSCDIDDVYSACAP 287
+ K +G P +++ G G R F+ + +++ A
Sbjct: 217 RI-------------KHEIGSAYPGDEVREIEVRGRNLAEGVPR-GFTLNSNEILEALQE 262

Query: 288 LVDDTIGVLSRVLRDPAPGGDGVAWSEVAGIYLAGGAGSFPLISRMLRATFGDKRVKRSP 347
+ + + L P + + G+ L GG + R+L G V +
Sbjct: 263 PLTGIVSAVMVALEQCPP--ELASDISERGMVLTGGGALLRNLDRLLMEETG-IPVVVAE 319

Query: 348 HAFAATAIGLAVFLDH 363
A G L+
Sbjct: 320 DPLTCVARGGGKALEM 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1616TCRTETA300.036 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.036
Identities = 44/182 (24%), Positives = 66/182 (36%), Gaps = 20/182 (10%)

Query: 114 DYVPTDRRVVFGHHFAAIAGAGPLVGPVLATQMGYLPGTIWIVAGAVFAGCVHDYLVLWI 173
D D R +A G G + GPVL MG A A G +
Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181

Query: 174 --STRRRGRSLGQMVRDEL------GATAGVAALIGVPVIITIIIAV-LALVVVRALSQS 224
S + R L + + L VAAL+ V I+ ++ V AL V+ +
Sbjct: 182 PESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 225 PWGVFSIAMTIPI---------ALFMGCYLRFLRPGRVAEVSVI--GIGLLLLAVVSGGW 273
W +I +++ A+ G L R + +I G G +LLA + GW
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 274 VA 275
+A
Sbjct: 302 MA 303


91MMAR_1791MMAR_1797N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_17910101.605429hypothetical protein
MMAR_1792-1101.545282signal recognition particle protein Ffh
MMAR_1793-2121.426290hypothetical protein
MMAR_1794-2111.076090Ser/Thr protein kinase
MMAR_17951150.274245D-amino acid aminohydrolase
MMAR_1796-1140.400582TetR family transcriptional regulator
MMAR_17970130.405873D-alanyl-D-alanine carboxypeptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1791AUTOINDCRSYN270.006 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 26.7 bits (59), Expect = 0.006
Identities = 6/21 (28%), Positives = 12/21 (57%)

Query: 36 SSRLGWRLEVNDGGQWAFFDD 56
RL W ++ DG ++ +D+
Sbjct: 29 KDRLNWAVQCTDGMEFDQYDN 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1794YERSSTKINASE340.001 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 34.3 bits (78), Expect = 0.001
Identities = 25/95 (26%), Positives = 44/95 (46%), Gaps = 2/95 (2%)

Query: 112 GEVLEIVAPVADALDYAHQRGLVHGDVKPADIVMTNAGEGQPRILLKGFGIAAPHGAPGD 171
G + I + D ++ + G+VH D+KP ++V A G+P ++ G + G
Sbjct: 245 GTIKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRA-SGEPVVIDLGLHSRSGEQPKGF 303

Query: 172 ATGFVAPEQLTG-AEADGRSDQYALAATAMILLTG 205
F APE G A +SD + + +T + + G
Sbjct: 304 TESFKAPELGVGNLGASEKSDVFLVVSTLLHCIEG 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1795UREASE330.005 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 32.8 bits (75), Expect = 0.005
Identities = 23/88 (26%), Positives = 35/88 (39%), Gaps = 15/88 (17%)

Query: 4 DVIIRDGLWFDGTGSAPQTRTLGIRDGVVATVSAGPLDETGCAQVIDAAGKWVMPGFIDV 63
D+ ++DG G A GV V G +VI GK V G +D
Sbjct: 87 DIGLKDGRIA-AIGKAGNPDMQ---PGVTIIVGPG-------TEVIAGEGKIVTAGGMDS 135

Query: 64 HTHYDAEVLLDPGLRESVRHGVTTVLLG 91
H H+ ++ E++ G+T +L G
Sbjct: 136 HIHFICPQQIE----EALMSGLTCMLGG 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1796HTHTETR476e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 47.3 bits (112), Expect = 6e-09
Identities = 24/93 (25%), Positives = 40/93 (43%), Gaps = 1/93 (1%)

Query: 1 MARTQQQRREETVARLLQASIDTIVEVGYARASAAIITKRAGVSVGALFRHFETMGDFMA 60
MAR +Q +ET +L ++ + G + S I K AGV+ GA++ HF+ D +
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 ATAYEVLRRQLDIFTKQVAEIPAD-RPALEAAL 92
++ + A+ P D L L
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREIL 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1797BLACTAMASEA361e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 35.9 bits (83), Expect = 1e-04
Identities = 27/99 (27%), Positives = 40/99 (40%), Gaps = 7/99 (7%)

Query: 48 DLDTGQVLAGRDQNVTHPPASTIKVLLALVALDELD-----LNSTVVADEADTHVECNCV 102
DL +G+ L + P ST KV+L L +D L + + D V+ + V
Sbjct: 46 DLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDL-VDYSPV 104

Query: 103 GVK-AGHTYTARQLLDGLLLVSGNDAANTLAQMLGGQDA 140
K T +L + +S N AAN L +GG
Sbjct: 105 SEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAG 143


92MMAR_1876MMAR_1883N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1876-310-2.750844long-chain acyl-CoA synthetase
MMAR_1877-212-3.274042transmembrane transport protein MmpL
MMAR_1878-110-0.199455hypothetical protein
MMAR_18790100.591059drug-transport integral membrane protein
MMAR_1880081.206985hypothetical protein
MMAR_1881090.662467malate:quinone oxidoreductase
MMAR_1882-1101.167468acyltransferase ElaA
MMAR_1883-171.079677magnesium chelatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1876ISCHRISMTASE330.005 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 33.1 bits (75), Expect = 0.005
Identities = 19/86 (22%), Positives = 35/86 (40%), Gaps = 1/86 (1%)

Query: 612 DELGTAGPVVTSAGAWPAAPGDPLLASVRDQVAAVFGIRAARVDIDQPLSEQGLDSVGFV 671
D+L A V A ++R Q+A + + + L ++GLDSV +
Sbjct: 208 DQLQNAPADVQKTSANTGKKNVFTCENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIM 267

Query: 672 KLASRLSQAFDRDVPPVDVFNHPTVR 697
L + + +V V++ PT+
Sbjct: 268 TLVEQWRRE-GAEVTFVELAERPTIE 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1877ACRIFLAVINRP422e-05 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 41.7 bits (98), Expect = 2e-05
Identities = 39/216 (18%), Positives = 74/216 (34%), Gaps = 35/216 (16%)

Query: 204 AAVIFIMLLLVYRSPLTVVLLLLTVGAEFTVARGVVALLGQVGAIALSTFAVSLLT---- 259
++F+++ L ++ ++ + V V LLG +A ++++ LT
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAV---------PVVLLGTFAILAAFGYSINTLTMFGM 398

Query: 260 SLAIAAGTDYGIFIIGRYQEARQAGEDRETAFYTMYRGVAHVIAGSGLTIAGATSCLSLA 319
LAI D I ++ + ED+ + ++ + + G LS
Sbjct: 399 VLAIGLLVDDAIVVVENVERVMM--EDKLPPKEATEKSMSQI----QGALVGIAMVLSAV 452

Query: 320 RLP----------YFRTLGIPCAVGMLLAVVVALTLGPAVLAIGSRFGVFDPKRMIK--- 366
+P +R I M L+V+VAL L PA+ A + +
Sbjct: 453 FIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFF 512

Query: 367 ---IRGWRRIGTVVVRWPGPVLAATIAIALIGLLAL 399
+ G +L +T LI L +
Sbjct: 513 GWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIV 548



Score = 38.3 bits (89), Expect = 2e-04
Identities = 30/162 (18%), Positives = 62/162 (38%), Gaps = 14/162 (8%)

Query: 768 LIAGIASLCLIFIIMLILTRALIAATVIVGTVALSLGASFGLSVLVWQHIFGIKLHWLVL 827
L I L+F++M + + + A + V + L +F + FG ++ L +
Sbjct: 344 LFEAIM---LVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAA-----FGYSINTLTM 395

Query: 828 PMSVIVLLAVGSDYNLLL--VSRFKQEIGAGLNTGIIRAIGGTGKVVTNAGLVFAIT--- 882
V+ + + D +++ V R E +++ + +V +
Sbjct: 396 FGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIP 455

Query: 883 MASMIVSDLRIIGQVGTTISLGLLFDTLIVRAFMTPSIAALL 924
MA S I Q TI + +++V +TP++ A L
Sbjct: 456 MAFFGGSTGAIYRQFSITIVSAMAL-SVLVALILTPALCATL 496



Score = 34.0 bits (78), Expect = 0.003
Identities = 48/235 (20%), Positives = 85/235 (36%), Gaps = 39/235 (16%)

Query: 140 YVQLNLAGNQGEPLGNESVEAVRSIVDGA--KPPPGIAVYVTGTAALVADMQHSGDRSLA 197
Y L QGE S ++++ K P GI TG + + SG+++ A
Sbjct: 818 YNGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSY---QERLSGNQAPA 874

Query: 198 RITVTTAAVIFIMLLLVYRSPLTVVLLLLTVGAEFTVARGVVALLGQVGAIALSTFAVSL 257
+ ++ V+F+ L +Y S V ++L V L Q + F V L
Sbjct: 875 LVAIS-FVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVY---FMVGL 930

Query: 258 LTSLAIAAGTDYGIFIIGRYQEARQA-GEDRETAFY---------------TMYRGVAHV 301
LT++ ++A I I+ ++ + G+ A GV +
Sbjct: 931 LTTIGLSAKN--AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 302 IAGSGLTIAGATSCLSLARLPYFRTLGIPCAVGMLLAVVVALTLGPAVLAIGSRF 356
+G AG+ + +GI GM+ A ++A+ P + R
Sbjct: 989 AISNG---AGSGA---------QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1879TCRTETB1503e-42 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 150 bits (379), Expect = 3e-42
Identities = 89/412 (21%), Positives = 173/412 (41%), Gaps = 18/412 (4%)

Query: 17 HRDTLWAIAIGIFMTSVDDTVVYVANPSIMAGLNTSYHMVIWVTSGYVLAYTVPSLVAGR 76
H L + I F + +++ V+ V+ P I N WV + ++L +++ + V G+
Sbjct: 12 HNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGK 71

Query: 77 LGDRFGVKNLYLAGLAVFTVSSLWCGVSGT-IEILIAARVAQGIGAALLYTQTFTIVTRA 135
L D+ G+K L L G+ + S+ V + +LI AR QG GAA +V R
Sbjct: 72 LSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARY 131

Query: 136 FPPERRGAAATVWAAAAGFGNLVGPLLGGALVDTLGWQWIFFVNIPVGIVGLVLAVRFVP 195
P E RG A + + G VGP +GG + + W ++ + + + I+ + ++ +
Sbjct: 132 IPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLK 190

Query: 196 ALPTHARRFDLIGVGLSGLGLLLIVFGLQQGHAAGWSPWIWAMIAVGVGFVTVFVYWQSV 255
FD+ G+ L +G++ + + + + V V +FV
Sbjct: 191 KEVRIKGHFDIKGIILMSVGIVFFMLFTTS--------YSISFLIVSVLSFLIFVKHIR- 241

Query: 256 NTSEPLIPLRIFGDRNFALCSF--GVAVIAFVAAAMMLPGVFYMQTVRGLSP-TRTALLM 312
++P + + + F + G+ M+P + M+ V LS ++++
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVP--YMMKDVHQLSTAEIGSVII 299

Query: 313 APLPIVVGLLTPFIGKILDRAHPRAVAGFGFSVVAISLIWFSIEMAPATPVWRLAVPIAF 372
P + V + G ++DR P V G + +++S + + T W + + I F
Sbjct: 300 FPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVS--FLTASFLLETTSWFMTIIIVF 357

Query: 373 LGVGMVFAWPVLTVTATRDLPAELVGASSGVYNAARALGATLGSAGMAALMT 424
+ G+ F V++ + L + GA + N L G A + L++
Sbjct: 358 VLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1883HTHFIS330.004 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.004
Identities = 30/145 (20%), Positives = 48/145 (33%), Gaps = 16/145 (11%)

Query: 31 VLIRGEKGTAKSTAVRGLAALLSAATGSSGPGLVEMPLGATEDRVVGSLDLQRVLR---D 87
++I GE GT K R L G V + + A ++ S +L +
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGP----FVAINMAAIPRDLIES-ELFGHEKGAFT 217

Query: 88 GEHAFAPGLLARAHGGVLYVDEVNLLHDHLVDILLDAAAMGRVHVERDGISHSHEARFVL 147
G + G +A GG L++DE+ + LL G G + +
Sbjct: 218 GAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGE--YTTVGGRTPIRSDVRI 275

Query: 148 IGTMNP------EEGELRPQLLDRF 166
+ N +G R L R
Sbjct: 276 VAATNKDLKQSINQGLFREDLYYRL 300


93MMAR_1965MMAR_1971N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1965012-0.0269413-ketoacyl-ACP reductase
MMAR_1966011-0.513592hypothetical protein
MMAR_1967-112-0.062190cell division transmembrane protein FtsK
MMAR_1968112-1.039843N-acetylglutamate synthase
MMAR_19691141.316222PGP synthase PgsA3
MMAR_19702161.809925transcriptional regulatory protein
MMAR_19712141.645456hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1965DHBDHDRGNASE893e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 89.3 bits (221), Expect = 3e-23
Identities = 84/269 (31%), Positives = 124/269 (46%), Gaps = 23/269 (8%)

Query: 9 LSGRVAFITGAARGQGRAHALRLARDGADVIAVDLCDQIASVPYPLGTAEELATTVKLVE 68
+ G++AFITGAA+G G A A LA GA + AVD E+L V ++
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDY------------NPEKLEKVVSSLK 53

Query: 69 DTGARIVASQADVRDREALAAALQAGIDELGQVDIVVANAGIAPM----QSGDDGWRDVI 124
A ADVRD A+ E+G +DI+V AG+ D+ W
Sbjct: 54 AEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATF 113

Query: 125 DVNLSGAYYTVEVAIPTMIEQGRGGSIVLISSAAGLVGISSADAGAIGYAASKHALVGLM 184
VN +G + M+++ R GSIV + S V +S A YA+SK A V
Sbjct: 114 SVNSTGVFNASRSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAA----YASSKAAAVMFT 168

Query: 185 RVYANLLAPHSIRVNSLHPSGVDTPMINNEFIRRWLADLVAETGSGPGAGNALPV-QILQ 243
+ LA ++IR N + P +T M + + A+ V + GS +P+ ++ +
Sbjct: 169 KCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIK-GSLETFKTGIPLKKLAK 227

Query: 244 ADDIAGALAWLVSDEARYITGVALPVDAG 272
DIA A+ +LVS +A +IT L VD G
Sbjct: 228 PSDIADAVLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1967PREPILNPTASE330.005 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 32.9 bits (75), Expect = 0.005
Identities = 29/100 (29%), Positives = 42/100 (42%), Gaps = 8/100 (8%)

Query: 152 PIVLAGTAI----VLMRTEPNPDTRPRLILGSSLIALSFLGLRHLWAGSPESPELRQRAA 207
P+V TA+ V M P T L+L L+AL+F+ L + P+ L
Sbjct: 111 PLVELLTALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLL--PDQLTLPLLWG 168

Query: 208 GFIGFTIGGPLSDGLTVWIAAP--LLFIGALFGLLLLTGT 245
G + +GG +S G V A L+ + LLTG
Sbjct: 169 GLLFNLLGGFVSLGDAVIGAMAGYLVLWSLYWAFKLLTGK 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1968SACTRNSFRASE393e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.8 bits (90), Expect = 3e-06
Identities = 17/76 (22%), Positives = 32/76 (42%), Gaps = 5/76 (6%)

Query: 58 IQGNVIGCGALHVLWSDLGEVRTVAVDPAMTGHGIGHAIVDRLLEVARELQLERLFVLTF 117
++ N IG + W+ + +AV G+G A++ + +E A+E L + T
Sbjct: 72 LENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQ 131

Query: 118 ET-----EFFTAHGFT 128
+ F+ H F
Sbjct: 132 DINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1971IGASERPTASE320.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.3 bits (73), Expect = 0.002
Identities = 36/230 (15%), Positives = 81/230 (35%), Gaps = 11/230 (4%)

Query: 24 ADPKVQIQQAIEEAQRTHQALTQQA--AQVIGNQRQLEMRLNRQLADIEKLQVNVRQALT 81
+ Q ++ +EA+ +A TQ AQ ++ + ++ A +EK + + +
Sbjct: 1059 TETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE---KAKVE 1115

Query: 82 LADQATAAGDAAKAVEYNNAAEAFAAQLVTAEQSVEDLKALHDQAL-NAAAQAKRAVEQN 140
++ +E Q A ++ + Q+ N A ++ ++
Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKET 1175

Query: 141 SMVLQQKIAERTKLLSQLEQAKMQEQVSSSLRSMSELAAPGNVPSLDEVRDKIERRYATA 200
S ++Q + E T + + + E + + + + N P + R + R +
Sbjct: 1176 SSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKP---KNRHRRSVR--SV 1230

Query: 201 LGQAELAQSSVQGRMLEVQQAGVQMAGHSRLEQIRASMRGESLPAGGAAT 250
E A +S R ++ L RA + +L G A +
Sbjct: 1231 PHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVS 1280


94MMAR_1975MMAR_1982N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_1975-111-0.216250hypothetical protein
MMAR_19760110.539777hypothetical protein
MMAR_19770100.764219recombinase A
MMAR_1978-1101.651598recombination regulator RecX
MMAR_1979-1111.640058(dimethylallyl)adenosine tRNA
MMAR_19800121.836722hypothetical protein
MMAR_1981-2112.328886hypothetical protein
MMAR_1982-1102.252885Ser/Thr protein kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1975PF06580240.039 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 24.4 bits (53), Expect = 0.039
Identities = 5/23 (21%), Positives = 9/23 (39%)

Query: 4 GVRLTEFHERITLRFGAAYGASV 26
G L ER+ + +G +
Sbjct: 312 GTGLQNVRERLQMLYGTEAQIKL 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_19762FE2SRDCTASE399e-06 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 38.9 bits (90), Expect = 9e-06
Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 221 RNSCCLYYRLPGAGKCGDCPL 241
R +CC YRLP +CGDC L
Sbjct: 241 RRTCCQRYRLPDVQQCGDCTL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1981AUTOINDCRSYN300.016 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 29.8 bits (67), Expect = 0.016
Identities = 15/88 (17%), Positives = 30/88 (34%), Gaps = 11/88 (12%)

Query: 48 SDPHRFGRVDDDGTVWLITTAGERIVGSWQAGDA------EAAFAHFGRRFDDLNTEITL 101
+D F + D++ T +L ++ S + + F + F ++N
Sbjct: 39 TDGMEFDQYDNNNTTYLFGIKDNTVICSLRFIETKYPNMITGTFFPY---FKEINIPEGN 95

Query: 102 MEE--RLAAGTGDARKIRANAAALAETL 127
E R A+ I N ++ L
Sbjct: 96 YLESSRFFVDKSRAKDILGNEYPISSML 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_1982PERTACTIN372e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 37.0 bits (85), Expect = 2e-04
Identities = 32/89 (35%), Positives = 37/89 (41%), Gaps = 13/89 (14%)

Query: 352 LFDASTSSWGDLTAAPPPQAPPAPPTPPTPPVPPTPPQPPKQTSDTSPSGPGPHSLAARL 411
L A P PQ P PP PP PP PP PPQPP++ P P P A R
Sbjct: 562 LVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQ----PEAPAPQPPAGR- 616

Query: 412 RNPWMLLGAAALVALI---VFAAQGIWLS 437
L AAA A+ V A +W +
Sbjct: 617 -----ELSAAANAAVNTGGVGLASTLWYA 640


95MMAR_2097MMAR_2103N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_20979187.463829PE-PGRS family protein
MMAR_20987165.406568hypothetical protein
MMAR_20997145.467316transcriptional regulator
MMAR_21007155.866894PE-PGRS family protein
MMAR_21016133.676258hypothetical protein
MMAR_21025144.032909PE-PGRS family protein
MMAR_2103-111-0.324396PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2097cloacin397e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 7e-05
Identities = 34/114 (29%), Positives = 46/114 (40%), Gaps = 8/114 (7%)

Query: 245 GAGGHGGTGGTSASGTGATGGSGGAGGLLFSPGGAGGDGGAGFSGADGGAGGNGGAGGLL 304
G G G G ++ GG G G G G G+G+S + GG G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV------GGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 305 FGTGGGGGEGGATSPSSSTGSGGDGGIGGTSGLFG--TGGTGGAGGAAANATGG 356
G G G GG + +G+GG+ FG T GAGG A + + G
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 35.5 bits (81), Expect = 7e-04
Identities = 35/112 (31%), Positives = 42/112 (37%), Gaps = 6/112 (5%)

Query: 339 GTGGTGGAGGAAANATGGNGGAGGGGLWFGNGGAGGIGGFDAHGNGGDGGAGGNAGIYGG 398
G G G GA + + NGG G G GGA G+ + N GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG---VGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 399 NGGAGGTGGVGVGGNLFTGGQGGAGGNA---GLLAGNGGAGGNGGVRFSGNA 447
+G G G GG TGG A G A + G V S A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.1 bits (80), Expect = 8e-04
Identities = 28/85 (32%), Positives = 32/85 (37%), Gaps = 3/85 (3%)

Query: 461 MFGNGGAGGAGGDRVAGSQGNGGDGGDGGHGGTYFGSGGAGGH---GGYDDSGSGGQGGH 517
M G G G G NGG G G GG GSG + + GG SG GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 518 GGDGGAAGTIGNGGDGGTGGDALVS 542
G G GG G G + V+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 34.7 bits (79), Expect = 0.001
Identities = 27/90 (30%), Positives = 35/90 (38%)

Query: 279 AGGDGGAGFSGADGGAGGNGGAGGLLFGTGGGGGEGGATSPSSSTGSGGDGGIGGTSGLF 338
+GGDG +GA +G G L GG G +S ++ G G GI G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 339 GTGGTGGAGGAAANATGGNGGAGGGGLWFG 368
G G + TGGN A + FG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.9 bits (77), Expect = 0.002
Identities = 28/85 (32%), Positives = 36/85 (42%), Gaps = 4/85 (4%)

Query: 309 GGGGEGGATSPSSSTGSGGDGGIGGTSGLFGTGGTGGAGGAAANATGGNGGAGGGGLWFG 368
GG G G T S++G+ GG +GL GG G ++ GG+G G W G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN----GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 369 NGGAGGIGGFDAHGNGGDGGAGGNA 393
G G GG G G G +A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 32.8 bits (74), Expect = 0.004
Identities = 30/82 (36%), Positives = 35/82 (42%), Gaps = 2/82 (2%)

Query: 397 GGNGGAGGTGGVGVGGNLFTGGQGGAGGNAGLLAGNG--GAGGNGGVRFSGNAGAGGAGG 454
G N GA T G GG G GGA +G + N G G G+ + G +G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 455 TGGDAGMFGNGGAGGAGGDRVA 476
G G G GG A VA
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVA 89



Score = 31.6 bits (71), Expect = 0.009
Identities = 28/81 (34%), Positives = 34/81 (41%), Gaps = 8/81 (9%)

Query: 509 SGSGGQGGHGGDGGAAGTIGNGGDGGTGGDALVSGGTGGD-------GGDGGDAREIGNG 561
SG G+G + G +G I NGG G G S G+G GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 562 GNGGNAGAGATAGNEGTGGTG 582
G+G G G + G GTGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 31.2 bits (70), Expect = 0.013
Identities = 31/85 (36%), Positives = 38/85 (44%), Gaps = 3/85 (3%)

Query: 217 IGGAGGEGGNSATTAGVGGA-GGAGGLLVGAGGHGGTGGTSASGTGATGGSGGAGGLLFS 275
+ G G G N+ + G GG GL VG G G+G +S + GG G+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN--NPWGGGSGSGIHWGG 58

Query: 276 PGGAGGDGGAGFSGADGGAGGNGGA 300
G G GG G SG G GGN A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.016
Identities = 23/86 (26%), Positives = 31/86 (36%)

Query: 445 GNAGAGGAGGTGGDAGMFGNGGAGGAGGDRVAGSQGNGGDGGDGGHGGTYFGSGGAGGHG 504
G GA T G+ G G G +G G G G ++G G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 505 GYDDSGSGGQGGHGGDGGAAGTIGNG 530
G + + GG G G A + G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.5 bits (68), Expect = 0.020
Identities = 33/107 (30%), Positives = 38/107 (35%), Gaps = 4/107 (3%)

Query: 320 SSSTGSGGDGGIGGTSGLFGTGGTGGAGGAAANATGGNGGAGGGGLWFGNGGAGGIGGFD 379
S G G + G TSG G TG G A+ G +G G GI
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNP---WGGGSGSGIHWGG 58

Query: 380 AHGNGGDGGAGGNAGIYGGNGGAGGTGGVGVGGNLFTGGQGGAGGNA 426
G+G GG GN+G G GG V GAGG A
Sbjct: 59 GSGHGNGGG-NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.1 bits (67), Expect = 0.029
Identities = 30/85 (35%), Positives = 36/85 (42%), Gaps = 5/85 (5%)

Query: 125 GADGTAPGQAGGDGGLLYGNGGAGGPGGAGGNAGLIGNGGAGGSGAALGLFGGTGGNGGL 184
GA T+ GG GL G G + G G + N N GGSG+ + GG+G G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSEN-----NPWGGGSGSGIHWGGGSGHGNGG 66

Query: 185 LFGNGGTGGAAGDLASGVGLPGGAG 209
GN G G G S V P G
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 29.7 bits (66), Expect = 0.039
Identities = 30/106 (28%), Positives = 37/106 (34%), Gaps = 6/106 (5%)

Query: 397 GGNGGAGGTGGVGVGGNLFTGGQGGAGGNAGLLAGNGGAGGNGGVRFSGNAGAGGAGGTG 456
GG+G TG GN+ GG G G G G+G + N +G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 457 GDAGMFGNGGAGGAGGDRVAGSQGNGGDGGDGGHGGTYFGSGGAGG 502
GNGG G G G + GAGG
Sbjct: 62 H-----GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 29.3 bits (65), Expect = 0.048
Identities = 23/61 (37%), Positives = 27/61 (44%)

Query: 209 GGHAGLFGIGGAGGEGGNSATTAGVGGAGGAGGLLVGAGGHGGTGGTSASGTGATGGSGG 268
GG GL GGA G S+ GG G+G G GHG GG SG G+ G
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 269 A 269
+
Sbjct: 82 S 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2100cloacin378e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 8e-04
Identities = 31/101 (30%), Positives = 38/101 (37%)

Query: 516 SGGDGGAGGAGGAGGDGGLVAGNGGVGGAGGIGGVGGTGGDGSVGVDAAGAGQDGGVGGA 575
SGGDG G G + G G+G GG G + + +G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 576 GGAGGAGGAGGAGGEGGAGGHALAAGYADGSQGAGGAGGAG 616
G GG G G G G A+AA A G G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.1 bits (80), Expect = 0.002
Identities = 34/88 (38%), Positives = 39/88 (44%), Gaps = 7/88 (7%)

Query: 144 GSGGVGQAGGAGGAAGLIGSGGAGGAGGAGGTGGAGGAGGWLYGNGGAGGVGGAGAVGGA 203
G G G GA +G I GG G G GGA GW N GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN----GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 204 GGNTWLWGNGGAGGSGGVGSGSGGAGGS 231
G GNGG G+ G GSG+GG +
Sbjct: 59 GSGH---GNGGGNGNSGGGSGTGGNLSA 83



Score = 33.9 bits (77), Expect = 0.006
Identities = 32/120 (26%), Positives = 46/120 (38%)

Query: 425 GAGGAGGTGGSGAGGSRAATGATGSTPSSGGNGGAGGAGADSITTGGAGAAGGTGGDGGL 484
G G G G+ + G TG G + G+G + ++ GG+G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 485 VGDGGAGGDGGAGLGGAPGTSVIFPGGQPGSSGGDGGAGGAGGAGGDGGLVAGNGGVGGA 544
GG G GG G ++V P + GAGG + G L A + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122



Score = 33.5 bits (76), Expect = 0.006
Identities = 26/87 (29%), Positives = 31/87 (35%)

Query: 551 GGTGGDGSVGVDAAGAGQDGGVGGAGGAGGAGGAGGAGGEGGAGGHALAAGYADGSQGAG 610
GG G + G + +GG G G GGA G E G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 611 GAGGAGGIGGDGAAGGKGAEGAAAAGA 637
G GG G G G+ G AA A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 32.8 bits (74), Expect = 0.010
Identities = 23/86 (26%), Positives = 31/86 (36%)

Query: 1026 GDGGAGGAGGAGGDGGAVAGDGGRGGAGGDGAMGGNGGNGFDGLHGTTPGANGQYGGDGG 1085
G G GA G+ G GG DG+ + N + G G+ G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1086 EGGRGGVGGAGGAGGAAAAGQAGSQG 1111
G GG+G G +A + G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.012
Identities = 37/115 (32%), Positives = 45/115 (39%), Gaps = 4/115 (3%)

Query: 1250 GHGGDGGDAGDSGSSAFGVGSPGGGGGQGGFGVAGGGDGGDGGNGGAGGFGQNGGPGGRG 1309
G G G + G +S G P G G GG G + GG G G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1310 GGGGHSLVGPGGDGGLGGTGGNGGNGSQPPFGSTPAGSGGDGGNGGAGGSSGFVS 1364
G GG + G GG GTGGN + P PA S G S+G +S
Sbjct: 63 GNGGGN----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 32.0 bits (72), Expect = 0.018
Identities = 27/81 (33%), Positives = 34/81 (41%), Gaps = 2/81 (2%)

Query: 1094 GAGGAGGAAAAGQAGSQGDGGNGGDGGDGGTPGNGGSGADGANSAIGVSAGDGGYGGAGG 1153
G G G A +GG G G GG + GSG N+ G +G G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1154 NAGAGGLGGEGGAGSTSGASG 1174
G GG G G GS +G +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.019
Identities = 25/79 (31%), Positives = 31/79 (39%)

Query: 413 GTGGNGGNGGDPGAGGAGGTGGSGAGGSRAATGATGSTPSSGGNGGAGGAGADSITTGGA 472
G G G N G G G +G G A+ +G + + GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 473 GAAGGTGGDGGLVGDGGAG 491
G GG G GG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.023
Identities = 29/69 (42%), Positives = 32/69 (46%), Gaps = 1/69 (1%)

Query: 216 GGSGGVGSGSGGAGGSGGWLYGNGGAGGTGGVADGVGEGGGHGGAGGNARLLGTGGAGGD 275
GG G+G G G + GS GW N GG G G G GHG GGN G G GG+
Sbjct: 22 GGPTGLGVGGGASDGS-GWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 276 GGAGLAGAA 284
A A A
Sbjct: 81 LSAVAAPVA 89



Score = 31.6 bits (71), Expect = 0.024
Identities = 31/117 (26%), Positives = 44/117 (37%), Gaps = 1/117 (0%)

Query: 872 GAGGAGGAGGLAQATGYLDGSHGSGGSGGAGGQAGNAGDGGDGADATVAGGKGGAGGNGG 931
G G G G +G ++G G GG G G+ + +G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 932 DAGVGGSGGLGGDSGNGTHAANGASAGAYGTGGNGGAGGDGADATAAGQAGGAGGAG 988
GG+G GG SG G + + A+ A+G G G + + A A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 31.6 bits (71), Expect = 0.025
Identities = 27/89 (30%), Positives = 31/89 (34%)

Query: 1147 GYGGAGGNAGAGGLGGEGGAGSTSGASGLDGSQAAGGDGGNGGFGGTGGLFEAAGSGGAG 1206
G G G N GA G G T G S +G N +GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1207 GVGGDGSQGGDGGDGGAGGSSPGATGGWG 1235
G GG G G G S+ A +G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.6 bits (71), Expect = 0.027
Identities = 35/112 (31%), Positives = 41/112 (36%), Gaps = 4/112 (3%)

Query: 1284 GGGDGGDGGNGGAGGFGQNGGPGGRGGGGGHSLVGPGGDGGLGGTGGNGGNGSQPPFGST 1343
G G G + G G NGGP G G GGG S G GG+GS +G
Sbjct: 4 GDGRGHNTGAHSTSG-NINGGPTGLGVGGGAS---DGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1344 PAGSGGDGGNGGAGGSSGFVSGQSGTDGQDGGDPSGQFGGTGGAGGSGGAGA 1395
G G GGS + + G P+ G GG S AGA
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 31.2 bits (70), Expect = 0.036
Identities = 30/106 (28%), Positives = 36/106 (33%), Gaps = 2/106 (1%)

Query: 1183 GDGGNGGFGGTGGLFEAAGSGGAGGVGGDGSQGGDGGDGGAGGSSPGATGGWGGQGGLGG 1242
G G N G T G G G G GG G + G G+ WGG G G
Sbjct: 6 GRGHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 1243 VGTAGVGGHGGDGGDAGDSGSSAFGVGSPG-GGGGQGGFGVAGGGD 1287
G G G G G + ++ G P G GG V+
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.8 bits (69), Expect = 0.049
Identities = 31/101 (30%), Positives = 35/101 (34%), Gaps = 1/101 (0%)

Query: 1088 GRGGVGGAGGAGGAAAAGQAGSQGDGGNGGDGGDGGTPGNGGSGADGANSAIGVSAGDGG 1147
G G G GA + G G G GG G G+ S I G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH-WGGGSG 61

Query: 1148 YGGAGGNAGAGGLGGEGGAGSTSGASGLDGSQAAGGDGGNG 1188
+G GGN +GG G GG S A G A G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2101PERTACTIN270.036 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 27.4 bits (60), Expect = 0.036
Identities = 16/49 (32%), Positives = 22/49 (44%)

Query: 56 HIATLIGYTRGDGGFQWENAMGDLAIGVVGIMAYWFRGHFWLATIVVLS 104
H+ L GYTRGD GF + ++ V G Y F+L + S
Sbjct: 703 HLGGLAGYTRGDRGFTGDGGGHTDSVHVGGYATYIANSGFYLDATLRAS 751


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2102cloacin375e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 5e-04
Identities = 29/91 (31%), Positives = 37/91 (40%)

Query: 332 VSGAAGSGGHGGTGGAAGLWGVGGHGGDGAHGGAGASGGAGDAGSGGGDAGDGGAGGRGG 391
+SG G G + G +G G G G + SG + + GG +G G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 392 WLVGGGGAGGSAGSGGGGGAGGSGANAVTLG 422
GGG G S G G GG + A V G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 37.0 bits (85), Expect = 5e-04
Identities = 39/109 (35%), Positives = 45/109 (41%), Gaps = 6/109 (5%)

Query: 999 GNGGKGGNGGAGG-----NGAAGSNASGAGATGGTGLMGGTGGSGGAGGEGGALAGNGGQ 1053
G G+G N GA NG G GA+ G+G GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1054 GGSGGSGGIGGTGGQGGGGSAGGAGVA-GVQDGAGGAGGGGGLGGSGGA 1101
G GG+G GG G GG SA A VA G + GG + S GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.2 bits (83), Expect = 0.001
Identities = 29/86 (33%), Positives = 36/86 (41%)

Query: 1169 GGGGTGGAGGAGGTSGDGVTVAGAGPTGGTGAGGAGGNGGAGGNADGGGIVNPEYGDGGA 1228
GG G G GA TSG+ GG + G+G + G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1229 GGNGGNGSSGGDGGSGGTGGRSGAGI 1254
G GGNG+SGG G+GG A +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 35.8 bits (82), Expect = 0.001
Identities = 33/99 (33%), Positives = 36/99 (36%)

Query: 1033 GTGGSGGAGGEGGALAGNGGQGGSGGSGGIGGTGGQGGGGSAGGAGVAGVQDGAGGAGGG 1092
G G + GA G + G G GG G GG+G G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1093 GGLGGSGGAGGAGGVGGIGGQAHAGGAFHDGDAGAGGLG 1131
GG G SGG G GG A G GAGGL
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.002
Identities = 32/92 (34%), Positives = 37/92 (40%), Gaps = 5/92 (5%)

Query: 432 GDGGAGGVGGAGGRGGWISFLSGQGAGGAGGDGGAGGGAGN---GGDGAVGTFFGGTGAG 488
G G G GA G I+ G G G+G + N GG G +GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 489 GNGGHGGDPGSGGAGGAGGAGSSAGAAGAGGL 520
GNGG G SGG G GG S+ A A G
Sbjct: 63 GNGGGNG--NSGGGSGTGGNLSAVAAPVAFGF 92



Score = 34.7 bits (79), Expect = 0.003
Identities = 36/103 (34%), Positives = 43/103 (41%), Gaps = 4/103 (3%)

Query: 806 TGGDGSTGGTGGTGGAGGSGGAMAGKGGDGGAG---GMGGGGGVGGNGSNGDHGVSGGNV 862
+GGDG TG +G G G G GGA G G GS GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 863 NGGTGGDGGKGGSGGQGGNGGAAGKALAASY-ADGAEGAGGAG 904
+G GG+G GG G GGN A +A + A GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.005
Identities = 29/81 (35%), Positives = 35/81 (43%), Gaps = 1/81 (1%)

Query: 704 SGGNGIGGTGGDGGDNGAGGAGGTGGSGSTTGSDGASGTTTTSGGNGGNGGRGADSVMIG 763
SGG+G G G +G G TG SDG SG ++ + GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG-SGWSSENNPWGGGSGSGIHWGGGS 60

Query: 764 GKGAAGGDGGDGGLYGNGGKG 784
G G GG+G GG G GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.005
Identities = 26/99 (26%), Positives = 34/99 (34%)

Query: 1251 GAGIYNKGGTGGVGGDGGNGTNGAGGKGGSGGNGGRGADASLFADAGDGGTGGDGGDGGT 1310
G G + G G+ G G G GG+ G ++ + + G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 1311 GTTGAGRGGTGGAGGGGGTGTGTPPPFGTGAPGGSGGTG 1349
G G G G GG + P FG A G G
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.006
Identities = 32/113 (28%), Positives = 41/113 (36%), Gaps = 3/113 (2%)

Query: 589 GIGGSGGEGGAGGMGGAMAGHGGDGGAGGHGGQGGSGGSGSRGADGVTGPNPNGGGGGDG 648
G G G GA G + G G GG G S + G +G + GGG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 649 GAGGAGGQGGNG---GLAGHAQAAGYSDGVQGVGGAGGKGGAGGLAGDGGTGA 698
G GG G G G G A AA + G + G G A ++ + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.5 bits (76), Expect = 0.006
Identities = 33/124 (26%), Positives = 38/124 (30%)

Query: 1071 GGSAGGAGVAGVQDGAGGAGGGGGLGGSGGAGGAGGVGGIGGQAHAGGAFHDGDAGAGGL 1130
GG G GG GLG GGA G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1131 GGKGGTGGTGGKGGTGGGGADATVFEPFAGNGGHGGAGGGGGTGGAGGAGGTSGDGVTVA 1190
G GG G +GG GTGG + F GG + GA + + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 1191 GAGP 1194
GP
Sbjct: 123 LKGP 126



Score = 33.5 bits (76), Expect = 0.006
Identities = 33/108 (30%), Positives = 42/108 (38%)

Query: 598 GAGGMGGAMAGHGGDGGAGGHGGQGGSGGSGSRGADGVTGPNPNGGGGGDGGAGGAGGQG 657
G G G H G G G GG S G+ + NP GGG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 658 GNGGLAGHAQAAGYSDGVQGVGGAGGKGGAGGLAGDGGTGAAGTFASG 705
GNGG G++ + G A G L+ G G A + ++G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.5 bits (76), Expect = 0.006
Identities = 37/113 (32%), Positives = 44/113 (38%), Gaps = 6/113 (5%)

Query: 144 GNGGSGGSGAAGQAGGA--GGAAGLIGNGGAGGAGGQGMFN----GGSGGAGGWAGLIGA 197
G G G + A G GG GL GGA G N GGSG W G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 198 GGAGGVGGTGVALDGGAGGAGGNAGVLFGPGGIGGSGGQGMASGGAGGAGGAS 250
G GG G +G G + A V FG + G G+A + GA A+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.1 bits (75), Expect = 0.008
Identities = 39/112 (34%), Positives = 47/112 (41%), Gaps = 9/112 (8%)

Query: 483 GGTGAGGNGGHGGDPGSGGAGGAGGAGSSAGAAGAGGLSPTTGGNGGNGGRGADGYGTGI 542
GG G G N G G+ GG G G GA+ G S GG G G+GI
Sbjct: 3 GGDGRGHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGG-------GSGSGI 54

Query: 543 SGASGGAGGDGGRYGNGGDG-GAGGDGMGGASGFSIVFPPGQDGGGGGIGGS 593
G G+GG GN G G G GG+ A+ + FP G GG+ S
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 33.1 bits (75), Expect = 0.008
Identities = 33/107 (30%), Positives = 40/107 (37%), Gaps = 5/107 (4%)

Query: 1103 GAGGVGGIGGQAHAGGAFHDGDAGAGGLGGKGGTGGTGGKGGTGGGGADATVFEPFAGNG 1162
G G G G G + G G G GG G + GGG+ + + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI-----HWG 57

Query: 1163 GHGGAGGGGGTGGAGGAGGTSGDGVTVAGAGPTGGTGAGGAGGNGGA 1209
G G G GGG G +GG GT G+ VA G G G A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.010
Identities = 27/103 (26%), Positives = 36/103 (34%)

Query: 939 NGAGGAGGKGGTGLTTGADGATGSRLTAGGNGGDGGDGGSAATAGAKGGAGGVGGDGGLY 998
+G G G G T+G + L GG DG S G G+ GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 999 GNGGKGGNGGAGGNGAAGSNASGAGATGGTGLMGGTGGSGGAG 1041
G G GG+G G+ ++ A T G+GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.013
Identities = 31/108 (28%), Positives = 41/108 (37%), Gaps = 1/108 (0%)

Query: 946 GKGGTGLTTGADGATGS-RLTAGGNGGDGGDGGSAATAGAKGGAGGVGGDGGLYGNGGKG 1004
G G G TGA +G+ G G GG + + GG G G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1005 GNGGAGGNGAAGSNASGAGATGGTGLMGGTGGSGGAGGEGGALAGNGG 1052
GNGG GN GS G + + G G G A++ + G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.4 bits (73), Expect = 0.013
Identities = 34/100 (34%), Positives = 42/100 (42%), Gaps = 12/100 (12%)

Query: 1195 TGGTGAGGAGGNGGAGGNADGGGIVNPEYGDGGAGGNGGNGSSGGDGGSGGTGGRSGAGI 1254
+GG G G G GN +GG G G G + SG + GG SG+GI
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGP-------TGLGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 1255 YNKGGTGGVGGDGGNGTNGAGGKGGSGGNGGRGADASLFA 1294
+ GG+G GNG GGSG G A A+ A
Sbjct: 55 HWGGGSGH-----GNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 32.4 bits (73), Expect = 0.015
Identities = 27/74 (36%), Positives = 32/74 (43%)

Query: 1190 AGAGPTGGTGAGGAGGNGGAGGNADGGGIVNPEYGDGGAGGNGGNGSSGGDGGSGGTGGR 1249
GA T G GG G G GG +DG G + GG G+G + G G+GG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 1250 SGAGIYNKGGTGGV 1263
SG G G V
Sbjct: 71 SGGGSGTGGNLSAV 84



Score = 32.0 bits (72), Expect = 0.017
Identities = 27/90 (30%), Positives = 34/90 (37%)

Query: 582 GQDGGGGGIGGSGGEGGAGGMGGAMAGHGGDGGAGGHGGQGGSGGSGSRGADGVTGPNPN 641
G DG G G G G + GG G + G GS G + +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 642 GGGGGDGGAGGAGGQGGNGGLAGHAQAAGY 671
G GGG+G +GG G GGN A G+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 31.6 bits (71), Expect = 0.027
Identities = 29/120 (24%), Positives = 34/120 (28%)

Query: 1040 AGGEGGALAGNGGQGGSGGSGGIGGTGGQGGGGSAGGAGVAGVQDGAGGAGGGGGLGGSG 1099
+GG+G +GG G G GG G G G G GGSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1100 GAGGAGGVGGIGGQAHAGGAFHDGDAGAGGLGGKGGTGGTGGKGGTGGGGADATVFEPFA 1159
G G GG G A G G G G A + + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121



Score = 31.6 bits (71), Expect = 0.028
Identities = 25/81 (30%), Positives = 31/81 (38%)

Query: 296 SGSGGAGGDGGLGGLVYGNGGGGGAGGVGGAGGAGIVSGAAGSGGHGGTGGAAGLWGVGG 355
SG G G + G GG GVGG G + + GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 356 HGGDGAHGGAGASGGAGDAGS 376
HG G +G +G G G S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 31.6 bits (71), Expect = 0.028
Identities = 41/120 (34%), Positives = 47/120 (39%), Gaps = 12/120 (10%)

Query: 784 GGDGGTGGTGQGGISFLAPGGQTGGDGSTGGTGGTG------------GAGGSGGAMAGK 831
GGDG TG S GG TG G + G+G G+G G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 832 GGDGGAGGMGGGGGVGGNGSNGDHGVSGGNVNGGTGGDGGKGGSGGQGGNGGAAGKALAA 891
G GG G GGG G GGN S V+ G T G GG S G A +AA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122



Score = 30.8 bits (69), Expect = 0.039
Identities = 24/72 (33%), Positives = 27/72 (37%)

Query: 376 SGGGDAGDGGAGGRGGWLVGGGGAGGSAGSGGGGGAGGSGANAVTLGSAGGNGGNGGDGG 435
SGG G + GG G G G G+G S N G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 436 AGGVGGAGGRGG 447
G GG G GG
Sbjct: 62 HGNGGGNGNSGG 73



Score = 30.8 bits (69), Expect = 0.046
Identities = 25/79 (31%), Positives = 35/79 (44%), Gaps = 3/79 (3%)

Query: 915 NGGDGADAAAGSAGTGGNGGHGGNNGAGGAGGKGGTGLTTGAD---GATGSRLTAGGNGG 971
+GGDG G+ T GN G G G G+G ++ + G +GS + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 972 DGGDGGSAATAGAKGGAGG 990
G GG+ + G G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 30.8 bits (69), Expect = 0.046
Identities = 27/79 (34%), Positives = 35/79 (44%), Gaps = 1/79 (1%)

Query: 1139 TGGKGGTGGGGADATVFEPFAGNGGHGGAGGGGGTGGAGGAGGTSGDGVTVAGAGPTGGT 1198
+GG G GA +T G G G GGG G + G + +G GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTG-LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1199 GAGGAGGNGGAGGNADGGG 1217
G G GGNG +GG + GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGG 79



Score = 30.8 bits (69), Expect = 0.047
Identities = 38/120 (31%), Positives = 44/120 (36%), Gaps = 7/120 (5%)

Query: 213 GAGGAGGNAGVLFGPGGI-GGSGGQGMASGGAGGAGGASGLVGNGAVGGAGGIGTTDGGA 271
G G G N G G I GG G G+ G + G+G +S G G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 272 GGQGGNARLFGTGGVGGHGGTGAGSGSGGAGGDGGLGGLVYGNGGGGGAGGVGGAGGAGI 331
G GGN G GG GTG + A G L GG GA A I
Sbjct: 63 GNGGGN------GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2103cloacin456e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.5 bits (107), Expect = 6e-07
Identities = 38/109 (34%), Positives = 42/109 (38%), Gaps = 12/109 (11%)

Query: 127 NGAPGTGQAGGA----GGILWGNGGAGGSGAPGQQGGSGGNAGLIGNGGVGGVGGIGGGV 182
+G G G GA G I NGG G G G G S G+ N GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI---NGGPTGLGVGG--GASDGSGWSSENNPWGGGSGSGIHW 56

Query: 183 GGVGGTGGWLLGNGGTGGTGGVGTGNIAGGAGGFGGSALSLLGNPGATG 231
GG G G GG+G G + FG ALS PGA G
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS---TPGAGG 102



Score = 41.2 bits (96), Expect = 1e-05
Identities = 32/109 (29%), Positives = 43/109 (39%), Gaps = 15/109 (13%)

Query: 148 AGGSGAPGQQGGSGGNAGLIGNGGVGGVGGIGGGV------------GGVGGTG-GWLLG 194
+GG G G G+ +G I NGG G+G GG GG G+G W G
Sbjct: 2 SGGDGR-GHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 195 NGGTGGTGGVGTGNIAGGAGGFGGSALSLLGNPGATGTPGGHADVLYLS 243
+G G G +G +G G A + A TPG + +S
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSIS 108



Score = 40.9 bits (95), Expect = 2e-05
Identities = 35/111 (31%), Positives = 42/111 (37%), Gaps = 13/111 (11%)

Query: 123 GEGANGAPGTGQA---GGAGGILWGNGGAGGSGAPGQQ----GGSGGNAGLIGNGGVGGV 175
G G N + GG G+ G G + GSG + GGSG G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN- 64

Query: 176 GGIGGGVGGVGGTGGWLLGNGGTGGTGGVGTGNIAGGAGGFGGSALSLLGN 226
GG G GG GTGG + V G A G GG A+S+
Sbjct: 65 GGGNGNSGGGSGTGG-----NLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 37.4 bits (86), Expect = 2e-04
Identities = 32/94 (34%), Positives = 37/94 (39%), Gaps = 10/94 (10%)

Query: 127 NGAPGTGQAGGAG--GILWG-----NGGAGGSGAPGQQGGSGGNAGLIGNGGVGGVGGIG 179
NG P GG G W GG GSG G GN G GNG GG G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG--GNGNSGGGSGTG 78

Query: 180 GGVGGVGGTGGWLLGNGGTGGTGGVGTGNIAGGA 213
G + V + T G GG+ +I+ GA
Sbjct: 79 GNLSAVAAPVAFGFPALSTPGAGGLAV-SISAGA 111


96MMAR_2112MMAR_2121N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_21122205.218766PE-PGRS family protein
MMAR_21130183.996408PE-PGRS family protein
MMAR_2114-2162.787947hypothetical protein
MMAR_2115-1142.442911hypothetical protein
MMAR_2116-2152.053547PE-PGRS family protein
MMAR_2117-111-0.309040fatty-acid-CoA ligase FadD9
MMAR_2118-310-0.4593624-aminobutyrate aminotransferase
MMAR_2119-212-0.670732preprotein translocase subunit YajC
MMAR_2120-213-0.407908preprotein translocase subunit SecD
MMAR_2121-114-0.669557preprotein translocase subunit SecF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2112cloacin359e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.1 bits (80), Expect = 9e-04
Identities = 45/121 (37%), Positives = 52/121 (42%), Gaps = 22/121 (18%)

Query: 153 SGGLQNGGSGGSAGLIGNGGNGGNGFLGGTGGAGGSGGWLAGSGGNGGAGGSVSGIGEIA 212
SGG G + G+ GN NGG LG GGA GW S N GGS SGI
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGW--SSENNPWGGGSGSGIHWGG 58

Query: 213 GAGGAGGNAPLLGWGGNGGVGGNAPQGTGGIGGAGGAGGALSAVGGTG----GTGGSGGV 268
G GNGG GN +GG G GG A++A G T G+GG+
Sbjct: 59 G-----------SGHGNGGGNGN----SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103

Query: 269 A 269
A
Sbjct: 104 A 104



Score = 32.8 bits (74), Expect = 0.005
Identities = 33/115 (28%), Positives = 46/115 (40%), Gaps = 1/115 (0%)

Query: 326 GIGGFANDTGGLGGQGGDATALLGVGVGGAGSIG-GAGNAAASAGGAGGAGAALVGVGVG 384
G G ++TG G G+GVGG S G G + GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 385 GIGGIGGFANGTSGAGGAGGSGAAVMGLGVGGAGSIGGAANSTAGAGGDGGEGVA 439
G GG G + G SG GG + AA + G + G + + + G +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 32.4 bits (73), Expect = 0.005
Identities = 27/85 (31%), Positives = 36/85 (42%), Gaps = 6/85 (7%)

Query: 143 GNGGNGYSFTSGGLQNGGSGGSAGLIGNGGNGGNGFLGGT----GGAGGSGGWLAGSGGN 198
G G N + ++ G NGG G G G + G+G+ GG+G W GSG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 199 GGAGGSVSGIGEIAGAGGAGGNAPL 223
G G SG G G + AP+
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPV 88



Score = 32.0 bits (72), Expect = 0.007
Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 1/70 (1%)

Query: 461 AGGQGGQGAVLIGAGFGGAGGDGGSATVNAVGNGGDGGNAGALFGIGAGGHGGNAGSGVG 520
G G +G G G + G G S+ N G GG G G G G GGN SG G
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSGHGNGGGNGNSGGG 74

Query: 521 AANGGNGGSV 530
+ GGN +V
Sbjct: 75 SGTGGNLSAV 84



Score = 31.6 bits (71), Expect = 0.010
Identities = 27/104 (25%), Positives = 36/104 (34%), Gaps = 12/104 (11%)

Query: 474 AGFGGAGGDGGSATVNAVGNGGDGGNAGALFGIGAGGHGGNAGSGVGAANGGNGGSVGVI 533
+G G G + G+ + + NGG G G G + GSG + N GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTG--------LGVGGGASDGSGWSSENNPWGG----G 49

Query: 534 SDGSFTPTPVGYGGNGGNGVNGGTGGTGGTGGTLIGTDGTNGSP 577
S GNGG N G G G + + G P
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93



Score = 30.8 bits (69), Expect = 0.015
Identities = 28/80 (35%), Positives = 30/80 (37%), Gaps = 6/80 (7%)

Query: 225 GWGGNGGVGGNAPQGTGGIGGAGGAGGALSAVGGT------GGTGGSGGVAGGDGGAGGA 278
G G N G + GG G G GGA G + GG GSG GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 279 GRGLFYGLGGAGGMGGSATA 298
G G G G SA A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVA 85



Score = 30.5 bits (68), Expect = 0.021
Identities = 34/120 (28%), Positives = 43/120 (35%), Gaps = 5/120 (4%)

Query: 258 GTGGTGGSGGVAGGDGGAGGAGRGLFYGLGGAGGMGGSATAVTPHTGGTGGVGGEGGAVF 317
G G G + G G G GL G G + G G S+ GG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSE-----NNPWGGGSGSGIHWG 57

Query: 318 GYAQGGTGGIGGFANDTGGLGGQGGDATALLGVGVGGAGSIGGAGNAAASAGGAGGAGAA 377
G + G GG G + G GG A + G + G G A + + GA A A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 30.1 bits (67), Expect = 0.026
Identities = 35/99 (35%), Positives = 45/99 (45%), Gaps = 8/99 (8%)

Query: 125 GANGTATSPNGGAGGILYGNGGN-GYSFTSGGLQNGGSGGSAGLIGNGGNGGNGFLGGTG 183
GA+ T+ + NGG G+ G G + G ++S GG GS G G GNG G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 184 GAGGSGGWLAGSGGNGGAGGSVSGIGEIA-GAGGAGGNA 221
G G +G+GGN A + G A GAGG A
Sbjct: 72 GGG------SGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.1 bits (67), Expect = 0.026
Identities = 27/90 (30%), Positives = 33/90 (36%), Gaps = 6/90 (6%)

Query: 123 GNGANGTATSPNGGAGGILYGNGGNGYSFTSGGLQNGGSGGSAGLIGNGGNGGNGFLGGT 182
G G N A S +G NGG GG +G S GG+G GG
Sbjct: 6 GRGHNTGAHSTSGNI------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 183 GGAGGSGGWLAGSGGNGGAGGSVSGIGEIA 212
G G GG GG+G G + +A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2113cloacin402e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.5 bits (94), Expect = 2e-05
Identities = 40/119 (33%), Positives = 49/119 (41%), Gaps = 10/119 (8%)

Query: 153 SGGTQSGGTGGSAGLIGNGGNGGNGFLGGAGGAAGSGGWLAGSGGNGGAGGSVTGVGEVG 212
SGG G G+ GN NGG LG GGA+ GW S N GGS +G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGW--SSENNPWGGGSGSGIHWGG 58

Query: 213 GAGGAGGSAPLLGWGGNGGAGGDSTQGAGGMGGAGGAGGALASIGGAGGAGGTGTTSGG 271
G+G G GGNG +GG S G A ++ G G + S G
Sbjct: 59 GSGHGNG-------GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 36.6 bits (84), Expect = 2e-04
Identities = 39/138 (28%), Positives = 49/138 (35%), Gaps = 7/138 (5%)

Query: 225 GWGGNGGAGGDSTQGAGGMGGAGGAGGALASIGGAGGAGGTGTTSGGDGGVGGEGSGRLF 284
G G N GA S GG G G GGA G + G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGA-----SDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 285 GLGGAGGAGGTGITSGGVGGDGGAGGGLLFGLGG--SGGAGGVATDATGIGGTGGAGGES 342
G G GG G +G SG G + FG + GAGG+A + +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIM 120

Query: 343 GVIIGYAQSGAGGIGGYG 360
+ G + G G+ YG
Sbjct: 121 AALKGPFKFGLWGVALYG 138



Score = 35.5 bits (81), Expect = 5e-04
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 459 NGGDG-GNGGGLFNIGRGGDGGNGGNAGATGGNGGNGGNIGVVANGTFTQTLFGDGGNGG 517
+GGDG G+ G + +GG G G + G+G + G + + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 518 NGGTGGTPGTGGTGGSGG 535
+G GG +GG G+GG
Sbjct: 62 HGNGGGNGNSGGGSGTGG 79



Score = 33.5 bits (76), Expect = 0.002
Identities = 27/79 (34%), Positives = 34/79 (43%), Gaps = 1/79 (1%)

Query: 455 GAGGNGGDGGNGGGLFNIGRGGDGGNGGNAGATGGNGGNGGNIGVVANGTFTQTLFGDGG 514
G G N G G + N G G G GG + +G + N G +G G G
Sbjct: 6 GRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 515 NGGNGGTGGTPGTGGTGGS 533
GGNG +GG GTGG +
Sbjct: 65 GGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.002
Identities = 25/71 (35%), Positives = 29/71 (40%), Gaps = 7/71 (9%)

Query: 421 GTATGGDGGAGGQGAALWGAGFGGDGAVGGNSFVGAGGNGGDGGNGGGLFNIGRGGDGGN 480
G GG G G G A G+G+ + G G+G GG G G GG GN
Sbjct: 18 GNINGGPTGLGVGGGASDGSGWSSENNPWGG---GSGSGIHWGGGSGH----GNGGGNGN 70

Query: 481 GGNAGATGGNG 491
G TGGN
Sbjct: 71 SGGGSGTGGNL 81



Score = 33.1 bits (75), Expect = 0.003
Identities = 29/99 (29%), Positives = 36/99 (36%), Gaps = 2/99 (2%)

Query: 421 GTATGGDGGAGGQGAALWGAGFGGDGAVGGNSFVGAGG--NGGDGGNGGGLFNIGRGGDG 478
G G + GA + G G G + G N GG+G G+ G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 479 GNGGNAGATGGNGGNGGNIGVVANGTFTQTLFGDGGNGG 517
GGN + GG+G G V A F G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.1 bits (75), Expect = 0.003
Identities = 31/113 (27%), Positives = 46/113 (40%), Gaps = 7/113 (6%)

Query: 192 LAGSGGNGGAGGSVTGVGEVGGAGGAGGSAPLLGWGGNGGAGGDSTQGA-GGMGGAGGAG 250
++G G G G+ + G + G G +G G + G+G S GG G+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLG----VGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 251 GALASIGGAGGAGGTGTTSGGDGGVGGEGSGRLFGLGG--AGGAGGTGITSGG 301
G + G GG G +G SG G + + FG GAGG ++
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 32.8 bits (74), Expect = 0.004
Identities = 32/118 (27%), Positives = 46/118 (38%), Gaps = 2/118 (1%)

Query: 261 GAGGTGTTSGGDGGVGGEGSGRLFGLGGAGGAGGTGITSGGVGGDGGAGGGLLFGLGGSG 320
G G G +G G G G G + G+G +S GG+G G+ +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 321 GAGGVATDATGIGGTGGAGGESG--VIIGYAQSGAGGIGGYGGDIGGTGGAGGVAGVL 376
G GG ++ G GTGG V G+ G GG I + +A ++
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIM 120



Score = 32.0 bits (72), Expect = 0.006
Identities = 36/120 (30%), Positives = 43/120 (35%), Gaps = 12/120 (10%)

Query: 240 AGGMGGAGGAGGALASIGGAGGAGGTGTTSGGDGGVGGEGSGRLFGLGGAGGAGGTGITS 299
+GG G G S GG G G G G G +G G G G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 300 GGVGGDGGAGGGLLFGLGGSGGAGGVATDATGIG------GTGGAGGESGVIIGYAQSGA 353
G GG G G GGSG G ++ A + T GAGG + I A S A
Sbjct: 62 HGNGGGNGNSG------GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 31.2 bits (70), Expect = 0.010
Identities = 21/55 (38%), Positives = 23/55 (41%), Gaps = 1/55 (1%)

Query: 415 GGVGGFGTATGGDGGAG-GQGAALWGAGFGGDGAVGGNSFVGAGGNGGDGGNGGG 468
GG G G G G+G WG G G GG S G GG G+ G G G
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSG 76



Score = 30.5 bits (68), Expect = 0.022
Identities = 32/121 (26%), Positives = 43/121 (35%), Gaps = 10/121 (8%)

Query: 285 GLGGAGGAGGTGITSGGVGGDGGAGGGLLFGLGGSGGAGGVATDATGIGGTGGAGGESGV 344
G G G G TSG + G G G G S G+G + + GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV---GGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 345 IIGYAQSGAGGIGGYGGDIGGTGGAGGVAGVLVGAGVGGFGGMGGAGTTGGAGGVGGQGV 404
G GG G+ GG G GG + GF + G G A + +
Sbjct: 60 -------SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112

Query: 405 T 405
+
Sbjct: 113 S 113



Score = 29.7 bits (66), Expect = 0.032
Identities = 28/99 (28%), Positives = 30/99 (30%)

Query: 123 GDGANGTATSPNGGAGGFLYGNGGNGYSFTSGGTQSGGTGGSAGLIGNGGNGGNGFLGGA 182
G G N A S +G G G G G + G S G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 183 GGAAGSGGWLAGSGGNGGAGGSVTGVGEVGGAGGAGGSA 221
GG SGG G V GAGG A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2115IGASERPTASE361e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 1e-04
Identities = 19/71 (26%), Positives = 31/71 (43%), Gaps = 2/71 (2%)

Query: 144 AVLARLVARLARDVQPVPAPAYAAYAPEADQTAEEEPQDEPESKDPKSKDEATEEEAPKE 203
+V + D PVP PA A + + AE Q+ + K++ +ATE A
Sbjct: 1009 SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVE--KNEQDATETTAQNR 1066

Query: 204 PEASEAEAETE 214
A EA++ +
Sbjct: 1067 EVAKEAKSNVK 1077


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2116cloacin442e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 43.9 bits (103), Expect = 2e-06
Identities = 36/103 (34%), Positives = 49/103 (47%), Gaps = 2/103 (1%)

Query: 535 TGGTASSSNTGPTPSAGTGGGGTGGGGGGGASASGAVGGTGGG--GGGGGAGAAAGTGAV 592
+GG NTG ++G GG G G GG ++ G+ + GGG G+G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 593 GGGGGGGGAAGGAGGTAGNGGAGGNAIAFAGSTSFSSASGGAG 635
G GGG G +GG GT GN A +AF + +GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 42.0 bits (98), Expect = 8e-06
Identities = 35/115 (30%), Positives = 44/115 (38%)

Query: 506 GGGGAGGHGGNGGGNGALGGQGGTGGTGGTGGTASSSNTGPTPSAGTGGGGTGGGGGGGA 565
GG G G + G +G + G G GG S ++ P G G G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 566 SASGAVGGTGGGGGGGGAGAAAGTGAVGGGGGGGGAAGGAGGTAGNGGAGGNAIA 620
G G +GGG G GG +A G G + + GA AIA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 40.9 bits (95), Expect = 2e-05
Identities = 36/113 (31%), Positives = 46/113 (40%)

Query: 455 GNGGAGGNGGAPGAANAIGGQGGQGGIGGNGGNGGNNTSGIDTAVGYAGGGGGGGAGGHG 514
G G G N GA + I G G+GG +G +S + G +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 515 GNGGGNGALGGQGGTGGTGGTGGTASSSNTGPTPSAGTGGGGTGGGGGGGASA 567
GNGGGNG GG GTGG + + G GG G ++A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 39.7 bits (92), Expect = 4e-05
Identities = 37/116 (31%), Positives = 46/116 (39%), Gaps = 1/116 (0%)

Query: 509 GAGGHGGNGGGNGALGG-QGGTGGTGGTGGTASSSNTGPTPSAGTGGGGTGGGGGGGASA 567
G G G N G + G GG G G GG + S + GG G+G GGG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 568 SGAVGGTGGGGGGGGAGAAAGTGAVGGGGGGGGAAGGAGGTAGNGGAGGNAIAFAG 623
G GGG G G + A G + GAGG A + AG + A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 39.7 bits (92), Expect = 4e-05
Identities = 35/101 (34%), Positives = 45/101 (44%), Gaps = 3/101 (2%)

Query: 552 TGGGGTGGGGGGGASASGAVGGTGGGGGGGGAGAAAGTGAVG---GGGGGGGAAGGAGGT 608
+GG G G G +++ GG G G GGGA +G + GGG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 609 AGNGGAGGNAIAFAGSTSFSSASGGAGGTGAPGGTGGGGGG 649
GNGG GN+ +G+ SA G P + G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 38.5 bits (89), Expect = 1e-04
Identities = 31/89 (34%), Positives = 35/89 (39%)

Query: 363 GGIGTGGNGGLGGNGGAGSIGTTGTDGTAPTTGGPGGAGGAGGDGGAGGAGIGGVGGGSG 422
GG G G N G G + G TG + G G + GG G+GI GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 423 GNGGLGGNGGAGGHGGINTGTGGTASASG 451
GNGG GN G G G N A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 35.5 bits (81), Expect = 7e-04
Identities = 30/108 (27%), Positives = 39/108 (36%), Gaps = 5/108 (4%)

Query: 500 GYAGGGGGGGAGGHGGNGGGNGALGGQGGTGGTGGTGGTASSSNTGPTPSAGTGGGGTGG 559
G+ G +GG G G G+G + S +G G+G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 560 GGGGGASASGAVGGTGGGGGGGGAGAAAGTGAVGGGGGGGGAAGGAGG 607
G G + GTGG A A G A+ G GG A + G
Sbjct: 68 NGNSGGGS-----GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 34.7 bits (79), Expect = 0.002
Identities = 39/128 (30%), Positives = 48/128 (37%), Gaps = 17/128 (13%)

Query: 427 LGGNGGAGGHGGINTGTGGTASASGGTGGNGGAGGNGGAPGAANAIGGQGGQGGIGGNGG 486
+ G G G + G ++ +G G G GGA G N GG G G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 487 NGGNNTSGIDTAVGYAGGGGGGGAGGHGGNGGGNGALGGQGGTG----GTGGTGGTASSS 542
GN GGG G +GG G GG A+ G T G GG A S
Sbjct: 61 GHGN-------------GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSI 107

Query: 543 NTGPTPSA 550
+ G +A
Sbjct: 108 SAGALSAA 115



Score = 34.3 bits (78), Expect = 0.002
Identities = 24/79 (30%), Positives = 29/79 (36%), Gaps = 1/79 (1%)

Query: 399 GAGGAGGDGGAGGAGIGGVGGGSGGNGGLGGNGGAGGHGGINTGTGGTASASGGTGGNGG 458
G G G + GA G + GG G G GG G N GG + + GG G
Sbjct: 3 GGDGRGHNTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 459 AGGNGGAPGAANAIGGQGG 477
G GG + G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 33.9 bits (77), Expect = 0.002
Identities = 30/100 (30%), Positives = 36/100 (36%), Gaps = 1/100 (1%)

Query: 132 GTGAAGGDGGWLVGNGGNGGSGAPGQAGGAGGSAGLWGAGGAGGAGGSATTPGGAGGAGG 191
G G G NGG G GGA +G W + GGS + GG+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSGH 62

Query: 192 TGGANGLIGGGNGGVGGAGGAGAAGGAGAVGSTAQAGGAG 231
G GG G GG A AA A + + G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.003
Identities = 27/76 (35%), Positives = 35/76 (46%)

Query: 419 GGSGGNGGLGGNGGAGGHGGINTGTGGTASASGGTGGNGGAGGNGGAPGAANAIGGQGGQ 478
GG G G + +G G TG G AS G+G + GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 479 GGIGGNGGNGGNNTSG 494
G GGNG +GG + +G
Sbjct: 63 GNGGGNGNSGGGSGTG 78



Score = 32.8 bits (74), Expect = 0.006
Identities = 25/71 (35%), Positives = 31/71 (43%)

Query: 624 STSFSSASGGAGGTGAPGGTGGGGGGGPLASTLSIDAGNGADGVNGGTGGTGTTSGGSGG 683
+T S SG G G GGG G S+ + G G+ GG+G +GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 684 TGGGAGGQGGN 694
GG G GGN
Sbjct: 70 NSGGGSGTGGN 80



Score = 32.4 bits (73), Expect = 0.007
Identities = 34/108 (31%), Positives = 40/108 (37%), Gaps = 6/108 (5%)

Query: 479 GGIGGNGGNGGNNTSGIDTAVGYAGGGGGGGAGGHGGNGGGNGALGGQGGTGGTGGTGGT 538
GG G G ++TSG G GGG + G G + N GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG- 61

Query: 539 ASSSNTGPTPSAGTGGGGTGGGGGGGASASGAVGGTGGGGGGGGAGAA 586
G G GGG+G GG A A+ G G G A
Sbjct: 62 -----HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.024
Identities = 35/110 (31%), Positives = 42/110 (38%), Gaps = 9/110 (8%)

Query: 258 AGADGAAGTLGSAGVTGQIGGFGGAGGTGGAGGADQSLIGGSSGGGGAGGLGGAGGLGGN 317
+G DG G+ +G I G G GG G S G G + GG G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG---------GASDGSGWSSENNPWGGGSGS 52

Query: 318 GGDATGFGMTGGDGAMGGAGGAAGAAGAAGAVTVPVNFLAHAGSDGGIGT 367
G G G G G +GG +G G AV PV F A S G G
Sbjct: 53 GIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.1 bits (67), Expect = 0.031
Identities = 32/113 (28%), Positives = 40/113 (35%), Gaps = 14/113 (12%)

Query: 595 GGGGGGAAGGAGGTAGNGGAGGNAIAFAGSTSFSSASGGAGGTGAPGGTGGGGGGGPLAS 654
GG G G GA T+GN G +G G GA G+G P
Sbjct: 3 GGDGRGHNTGAHSTSGNINGG--------------PTGLGVGGGASDGSGWSSENNPWGG 48

Query: 655 TLSIDAGNGADGVNGGTGGTGTTSGGSGGTGGGAGGQGGNSFQFVPSAAGGAG 707
G +G GG G + GGSG G + +F F + GAG
Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2117NUCEPIMERASE373e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 37.5 bits (87), Expect = 3e-04
Identities = 51/252 (20%), Positives = 77/252 (30%), Gaps = 77/252 (30%)

Query: 780 TVLLTGATGFLGRYLALEWLERMDLV-------DGKLICLVRA-------------KSDT 819
L+TGA GF+G +++ LE V D + L +A K D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 820 EARARLDKTFDSGDPELLAHYRALAGDHLEVLAGDKGEADLGLDRQTWQRLADTVDLIVD 879
R + F SG E + R + + +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLA-----------------VRYSLE----------N 94

Query: 880 PAALVNHVLPYSQLFGPNALGTAELLRLALTSKIKPYSYTSTIGVADQIPPSAFTEDADI 939
P A + N G +L +KI+ Y S+ V F+ D +
Sbjct: 95 PHAYAD----------SNLTGFLNILEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSV 144

Query: 940 RVISATRAVDDSYANGYSNSKWAGEVLLREAHDLCGLPVAVFRCDMILADTTWAGQLNVP 999
D + Y+ +K A E++ L GLP R T G P
Sbjct: 145 ----------DHPVSLYAATKKANELMAHTYSHLYGLPATGLRF------FTVYGPWGRP 188

Query: 1000 DM----FTRMIL 1007
DM FT+ +L
Sbjct: 189 DMALFKFTKAML 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2120SECFTRNLCASE571e-10 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 56.8 bits (137), Expect = 1e-10
Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 24/197 (12%)

Query: 386 QLANVLKYGSLPLSFESSEAQTVSATLGLTSLRAGLIAGAIGLALVLLY-SLLYYRVLGL 444
++ L L S E +V + + + + +++ Y + + L
Sbjct: 123 KVETALTAVDPALKITSFE--SVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFAL 180

Query: 445 LTALSLIASGAMVFAILVLLGRYINYTLDLAGIAGLIIGIGTTADSFVVFFERIKDEIRE 504
++L+ + + +L + L +A L+ G + + VV F+R+++ + +
Sbjct: 181 GAVVALVHDVLLTVGLFAVLQLKFD----LTTVAALLTITGYSINDTVVVFDRLRENLIK 236

Query: 505 GRSFR------SAVPRGWARARKTIVSGNAVTFLAAAVLYFLAIGQVKGFAFTLGLTTIL 558
++ +V +R T ++ T LA + ++GF F +
Sbjct: 237 YKTMPLRDVMNLSVNETLSRTVMTGMT----TLLALVPMLIWGGDVIRGFVFAM------ 286

Query: 559 DIVVVFLVTWPLVYLAS 575
+ VF T+ VY+A
Sbjct: 287 -VWGVFTGTYSSVYVAK 302



Score = 36.4 bits (84), Expect = 2e-04
Identities = 22/126 (17%), Positives = 44/126 (34%), Gaps = 12/126 (9%)

Query: 14 LSVFLVLLIGVYLLVF-LTGDKKAAPKLGIDLQGGTRVTLTARTPDGSAPSREALAQAQQ 72
++ +++ + LV L GID +GGT + + T R AL + +
Sbjct: 26 AAIVMMIASVILPLVIGLN--------FGIDFKGGTTIRTESTTAIDVGVYRAAL-EPLE 76

Query: 73 IISARVNGLGVSGSEVVVDGDNLVITVPGNDGNEARNLGQTARLYIRPVMNSM-PAQPAA 131
+ ++ + S ++ DG A G + + V ++ PA
Sbjct: 77 LGDVIISEVR-DPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDPAL 135

Query: 132 QEPQQE 137
+ E
Sbjct: 136 KITSFE 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2121SECFTRNLCASE2585e-85 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 258 bits (660), Expect = 5e-85
Identities = 75/310 (24%), Positives = 146/310 (47%), Gaps = 20/310 (6%)

Query: 60 FEVVGRRKLWYGISGAIMAIAILSIIVRGFTFGIDFKGGTTVSFP----------RGDSQ 109
F+ + +G + +M +++ +V G FGIDFKGGTT+ R +
Sbjct: 14 FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALE 73

Query: 110 VTQVEEVFHNVVGSDPESVVTVGSGASATVQIRSETLSNEQTEKLRDALFDAFHPKGADG 169
++ +V + V S A +Q++ + E L +
Sbjct: 74 PLELGDVIISEVRD--PSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAV 131

Query: 170 KPSKKAISDAAVSETWGGQITKKAVIALVVFLVLVAIYITVRYERYMTISAIAAMIFDLT 229
P+ K S +V G++ AV +L+ V++ YI VR+E + A+ A++ D+
Sbjct: 132 DPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVL 191

Query: 230 VTAGVYSLVGFEVTPATVIGLLTILGFSLYDTVIVFDKVEENTHGFQHTTRRTFAEQANL 289
+T G+++++ + TV LLTI G+S+ DTV+VFD++ EN ++ R + NL
Sbjct: 192 LTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLR---DVMNL 248

Query: 290 AVNQTFMRSINTSLISVLPVLSLMVVAVWLLGVGTLKDLALVQLIGIIVGTYSSIFFATP 349
+VN+T R++ T + ++L ++ +++ G ++ + G+ GTYSS++ A
Sbjct: 249 SVNETLSRTVMTGMTTLLALVPMLI-----WGGDVIRGFVFAMVWGVFTGTYSSVYVAKN 303

Query: 350 LLVTLRERTE 359
+++ +
Sbjct: 304 IVLFIGLDRN 313


97MMAR_2279MMAR_2290N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2279-114-0.583931ABC transporter ATP-binding protein
MMAR_2280-114-1.117934transcriptional regulatory protein
MMAR_2281-113-0.842483transcriptional regulatory protein
MMAR_2282-212-0.517413aconitate hydratase
MMAR_2283-1120.124254hypothetical protein
MMAR_22840120.257725invasion and intracellular persistence protein,
MMAR_2285013-0.149682invasion and intracellular persistence protein,
MMAR_22860130.010277transcriptional regulatory protein, MoxR1
MMAR_2287110-0.146484hypothetical protein
MMAR_2288010-0.798657hypothetical protein
MMAR_2289-110-1.0100223-oxoacyl-ACP reductase
MMAR_2290-113-1.147890enoyl-(acyl carrier protein) reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2279BCTERIALGSPC300.020 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 29.9 bits (67), Expect = 0.020
Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 2/53 (3%)

Query: 82 ARDRVLSARGLDVLLTDLEKQQALMAEVADDAARDRAIRRYGQLEERFVALGG 134
D ++ GLD L D E+ + M +AD + R GQ ++ ++ GG
Sbjct: 220 DNDMAVALNGLD--LRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEFGG 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2281HTHTETR682e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.1 bits (166), Expect = 2e-16
Identities = 32/177 (18%), Positives = 63/177 (35%), Gaps = 14/177 (7%)

Query: 1 MPKVSEDHLAARRRQILDGARRCFAEYGYDKATVRRLEQAIGMSRGAIFHHFRDKDALFF 60
M + ++ R+ ILD A R F++ G ++ + +A G++RGAI+ HF+DK LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 ALAHEDAERMADVAS--------------REGLIQVMRDMLAAPDQFDWLATRLEIARKL 106
+ + ++ RE LI V+ + + + +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 107 RNDPEFSRGWAERSAELAAATTDRLRRQKQANRVRDDVPNEVLQCYLDLVLDGLVAR 163
+ E L+ +A + D+ + + GL+
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2284GPOSANCHOR547e-10 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 53.9 bits (129), Expect = 7e-10
Identities = 36/195 (18%), Positives = 65/195 (33%), Gaps = 8/195 (4%)

Query: 44 DSIAALIADVARANQRLDDLSAAVELEQEGVNKAMVAVETARDEAAAAEHELEASQQAVK 103
AAL A A + L+ + + A E LE +
Sbjct: 218 AEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFST 277

Query: 104 DANAAIAAAQHRFD----TFAAATYMNGPSDSYLTATSPDEIIAAATAAKTLAASAQTVM 159
+A I + A + + ++ + D + A+ A K L A Q +
Sbjct: 278 ADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRD-LDASREAKKQLEAEHQKLE 336

Query: 160 ANL---ERARTRQVNKESASRLAKQKADKAAEEAKTSQDAAVTALTDTQRKFDQQREEVN 216
E +R ASR AK++ + ++ + + + +R D RE
Sbjct: 337 EQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK 396

Query: 217 RLAAERDEAEAKLQA 231
++ +EA +KL A
Sbjct: 397 QVEKALEEANSKLAA 411



Score = 48.9 bits (116), Expect = 3e-08
Identities = 33/194 (17%), Positives = 59/194 (30%), Gaps = 8/194 (4%)

Query: 46 IAALIADVARANQRLDDLSAAVELEQEGVNKAMVAVETARDE-------AAAAEHELEAS 98
AAL A A + L+ + + A + A
Sbjct: 185 KAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTAD 244

Query: 99 QQAVKDANAAIAAAQHRFDTFAAATYMNGPSDSYLTATSPDEIIAAATAAKTLAASAQTV 158
+K A AA + R A + +A + A A + A +
Sbjct: 245 SAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI-KTLEAEKAALEAEKADLEHQ 303

Query: 159 MANLERARTRQVNKESASRLAKQKADKAAEEAKTSQDAAVTALTDTQRKFDQQREEVNRL 218
L R ASR AK++ + ++ + + + +R D RE +L
Sbjct: 304 SQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQL 363

Query: 219 AAERDEAEAKLQAA 232
AE + E + + +
Sbjct: 364 EAEHQKLEEQNKIS 377



Score = 42.7 bits (100), Expect = 2e-06
Identities = 55/257 (21%), Positives = 85/257 (33%), Gaps = 31/257 (12%)

Query: 44 DSIAALIADVARANQRLDDLSAAVELEQEGVNKAMVAVETAR-------DEAAAAEHELE 96
I L A+ A R +L A+E ++T E A EH+ +
Sbjct: 246 AKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQ 305

Query: 97 ASQQAVKDANAAIAAAQHRFDTFAA----ATYMNGPSDSYLTATSPDEIIAAATAAKTLA 152
+ + A++ A N S++ + D + A+ AK
Sbjct: 306 VLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD--LDASREAKKQL 363

Query: 153 ASAQTVMANL----ERARTRQVNKESASRLAKQKADKAAEEAKTSQDAAVTALTDTQRKF 208
+ + E +R ASR AK++ +KA EEA + A + +
Sbjct: 364 EAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESK 423

Query: 209 DQQREEVNRLAAERDEAEAKL-------QAARLVAWSSAGAEGPPPGAMWDPGARPGN-G 260
+E L A+ EAEAK QA L + A P A+PGN
Sbjct: 424 KLTEKEKAELQAKL-EAEAKALKEKLAKQAEELAKLRAGKASDSQT-----PDAKPGNKA 477

Query: 261 RRWDGWDPTLPQIPSAN 277
G P P+ N
Sbjct: 478 VPGKGQAPQAGTKPNQN 494



Score = 37.7 bits (87), Expect = 9e-05
Identities = 33/196 (16%), Positives = 62/196 (31%), Gaps = 4/196 (2%)

Query: 38 NADSRTDSIAALIADVARANQRLDDLSAAVELEQEGVNKAMVAVETARDEAAAAEHELEA 97
A + + + +A I + L+ A +E E AM + E E A
Sbjct: 166 GAMNFSTADSAKIKTLEAEKAALEARQAELEKALE---GAMNFSTADSAKIKTLEAEKAA 222

Query: 98 SQQAVKDANAAIAAAQHRFDTFAAATYMNGPSDSYLTATSPDEIIAAATAAKTLAASAQT 157
D A+ A + +A + L A E+ A A + +
Sbjct: 223 LAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA-ELEKALEGAMNFSTADSA 281

Query: 158 VMANLERARTRQVNKESASRLAKQKADKAAEEAKTSQDAAVTALTDTQRKFDQQREEVNR 217
+ LE + +++ Q + + + DA+ A + + + E+
Sbjct: 282 KIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKI 341

Query: 218 LAAERDEAEAKLQAAR 233
A R L A+R
Sbjct: 342 SEASRQSLRRDLDASR 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2286HTHFIS320.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.5 bits (74), Expect = 0.003
Identities = 32/153 (20%), Positives = 58/153 (37%), Gaps = 23/153 (15%)

Query: 48 IVGQD----QLVERMLVGLLAKGHVLLEGVPGVAKTL---AVETFARVVGGSFARIQ--- 97
+VG+ ++ + + +++ G G K L A+ + + G F I
Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198

Query: 98 FTPDLVPTDIVGTRIYRQGKEEFDTELGPVVANF-------LLADEINRAPAKVQSALLE 150
DL+ +++ G K F F L DEI P Q+ LL
Sbjct: 199 IPRDLIESELFGHE-----KGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLR 253

Query: 151 VMAERHVS-IGGKTFPMPNPFLVMATQNPIEQE 182
V+ + + +GG+T + +V AT ++Q
Sbjct: 254 VLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2289DHBDHDRGNASE1119e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 111 bits (279), Expect = 9e-32
Identities = 69/252 (27%), Positives = 122/252 (48%), Gaps = 21/252 (8%)

Query: 24 RSVLVTGGNRGIGLAIAQRLAADGHRVAVTHRGSGAPEGLFGVE-----------CDVTD 72
+ +TG +GIG A+A+ LA+ G +A E + DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 73 NDAVDRAFKEVEEHQGPVEVLVSNAGLSADAFLIRMTEERFEKVIDANLTGAFRVAQRAS 132
+ A+D +E GP+++LV+ AG+ + +++E +E N TG F ++ S
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 133 RSMQRKKFGRLIFIGSVSGSWGIGNQANYAASKAGVIGMARSIARELSKVNVTANVVAPG 192
+ M ++ G ++ +GS + A YA+SKA + + + EL++ N+ N+V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 193 YIDTDMTRAL-------DERIQEGALQF---IPAKRVGTAAEVAGVVSFLASEDASYISG 242
+TDM +L ++ I+ F IP K++ +++A V FL S A +I+
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 243 AVIPVDGGMGMG 254
+ VDGG +G
Sbjct: 249 HNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2290DHBDHDRGNASE466e-08 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 45.8 bits (108), Expect = 6e-08
Identities = 50/275 (18%), Positives = 101/275 (36%), Gaps = 38/275 (13%)

Query: 5 LEGKRILVSGIITDSSIAFHIARVAQEQGAQLVLTGFD----RMRLIQRIVDRLPQKAPL 60
+EGK ++G I +AR QGA + D ++ + + + A
Sbjct: 6 IEGKIAFITG--AAQGIGEAVARTLASQGAHI--AAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 61 IELDVQNEEHLASLAGRVTEVIGEGNNLDGVVHSIGFMPQSGMGINPFFDAPYEDVSKGI 120
DV++ + + R+ +G +D +V+ G + E+
Sbjct: 62 FPADVRDSAAIDEITARIEREMG---PIDILVNVAGVLR-----PGLIHSLSDEEWEATF 113

Query: 121 HISAYSYASLAKALLPIM--NPGGSIVGMDFD----PTRAMPAYNWMTVAKSALESVNRF 174
+++ + ++++ M GSIV + + P +M AY +K+A +
Sbjct: 114 SVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYA---SSKAAAVMFTKC 170

Query: 175 VAREAGKYGVRSNLVAAGPIRTLAMSAIVGGALGEE-----AGAQIQLLEDGWDQRAPVG 229
+ E +Y +R N+V+ G T ++ G E + + P+
Sbjct: 171 LGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKT-------GIPLK 223

Query: 230 WNMKDPTPVAKTVCAVLSEWLPATTGDIIFADGGA 264
+ P+ +A V ++S T + DGGA
Sbjct: 224 -KLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGA 257


98MMAR_2294MMAR_2300N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2294-2111.270071hypothetical protein
MMAR_2295-1121.787640hypothetical protein
MMAR_22960122.062512hypothetical protein
MMAR_22970122.266178transcriptional regulator
MMAR_2298-181.326894two-component regulator receiver
MMAR_2299-191.267938two-component regulator - sensor kinase
MMAR_2300-2100.700989hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2294IGASERPTASE290.045 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.9 bits (64), Expect = 0.045
Identities = 33/211 (15%), Positives = 66/211 (31%), Gaps = 24/211 (11%)

Query: 158 LRVARVELRSIDPPPS---IQASMEKQMKADREKRAMILTAEGTREAAIKQAEGQKQSQI 214
VA+ + + + A++EK+ KA E Q + SQ+
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEK-------------TQEVPKVTSQV 1129

Query: 215 LAAEGAKQAAILAAEADRQSRMLRAQGERAAAYLRAQGEAKAIQKTFAAIKAGRPTPEML 274
+ + AE R++ E + + ++T + ++ +
Sbjct: 1130 SPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE-----QPVT 1184

Query: 275 AYQYLQTLPEMARGDANKVWVVPSDFNAALQGFTRMLGKPGEDGVFRFEASPVEDLPKHA 334
+ T + N P+ + + K R VE +
Sbjct: 1185 ESTTVNTGNSVVE---NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSS 1241

Query: 335 TDGDDDEVSDWFSTETDPAIAQAVAKAEAIA 365
D + D ST T+ ++ A AKA+ +A
Sbjct: 1242 NDRSTVALCDLTSTNTNAVLSDARAKAQFVA 1272


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2297HTHTETR572e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 2e-12
Identities = 18/53 (33%), Positives = 30/53 (56%)

Query: 8 RTARARIRDEALRLFAERGPDAVTMRDIATAAGVSPALLIRHYGSKDGLVEAV 60
+ R I D ALRLF+++G + ++ +IA AAGV+ + H+ K L +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2298HTHFIS862e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.0 bits (213), Expect = 2e-21
Identities = 39/129 (30%), Positives = 70/129 (54%), Gaps = 1/129 (0%)

Query: 26 APRVLVVEDSETIREMVNEALADVGYHTDTRSDGEGLERVLQGLRPDLVVLDVMLPGRDG 85
+LV +D IR ++N+AL+ GY S+ L R + DLVV DV++P +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 86 FALIDVIREWG-DIGIVLITARDGLPDRLRGLDGGADDYVVKPFELAELVSRVGAVLRRR 144
F L+ I++ D+ +++++A++ ++ + GA DY+ KPF+L EL+ +G L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 145 GRLPRVVQV 153
R P ++
Sbjct: 123 KRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2299PRTACTNFAMLY355e-04 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 35.4 bits (81), Expect = 5e-04
Identities = 19/58 (32%), Positives = 22/58 (37%)

Query: 104 SPDTTAGPAVPALPGPPPLIPPGGHQGPHGPPPPPPPPPGGPPPGPPPDATATAAVHT 161
+ + P P P G Q P P P P P PP G A A AAV+T
Sbjct: 559 NGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNT 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2300V8PROTEASE481e-08 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 48.5 bits (115), Expect = 1e-08
Identities = 27/170 (15%), Positives = 59/170 (34%), Gaps = 31/170 (18%)

Query: 125 DGSGGMGCTAGFLVRTNAGRTGILTAGHCNKE--GEASKVSINYSA------GGGYVNIG 176
+G + G +V G+ +LT H G+ + SA G
Sbjct: 97 APTGTFIAS-GVVV----GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAE 151

Query: 177 TFSQSVSEGLNGEAHDIGLITLDSGKIPQSPAIKAAVPVTGIAT--DLKVGQLLCKFGMK 234
++ EG D+ ++ Q+ I V ++ + +V Q + G
Sbjct: 152 QITKYSGEG------DLAIVKF--SPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYP 203

Query: 235 TGRAEC------GQVTDISASKVAFLAASECGDSGGPVYRLDDDGTAVAV 278
+ G++T + + + ++ G+SG PV+ ++ + +
Sbjct: 204 GDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVF--NEKNEVIGI 251


99MMAR_2481MMAR_2485N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2481-1151.494129PE-PGRS family protein
MMAR_2482015-1.916690hypothetical protein
MMAR_2483-111-0.712877antibiotic resistance ABC transporter efflux
MMAR_2484-211-1.448322hypothetical protein
MMAR_2485-213-1.532912antibiotic resistance ABC transporter efflux
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2481cloacin396e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 6e-05
Identities = 31/93 (33%), Positives = 36/93 (38%), Gaps = 4/93 (4%)

Query: 398 GNGGAGHDGHIGGGAGGTGGAGGNGTSADGVGHGGNGGIGGNGNSASNGGGDGGNGGAGG 457
G G GH+ GA T G G + GVG G + G G + + GGG G GG
Sbjct: 3 GGDGRGHNT----GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 458 TGGHGGLLIGNGGAGGIGGTGGSGGAGAPGGIG 490
GHG GG G G AP G
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 37.0 bits (85), Expect = 2e-04
Identities = 36/99 (36%), Positives = 44/99 (44%), Gaps = 4/99 (4%)

Query: 546 GFGGNGGASGTGG--AGGAGGAGGAGGAGGAGGTSSVSGNIGAVGGNGGVGGDGGDGGDG 603
G G N GA T G GG G G GGA G SS + G G+G G G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 604 GDGGAAGAAGGAGGQGGFLGSAGSAGGSGVG--GAGGAA 640
G G +G G GG + + + G + GAGG A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.6 bits (84), Expect = 3e-04
Identities = 44/140 (31%), Positives = 55/140 (39%), Gaps = 15/140 (10%)

Query: 444 SNGGGDGGNGGAGGTGGHGGLLIGNGGAGGIGGTGGSGGAGAPGGIGGGGGGGGIGTATN 503
S G G G N GA T G+ NGG G+G GG A G G G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-----NGGPTGLGVGGG-----ASDGSGWSSENNPWGGGSG 51

Query: 504 LGIAAEGGLGGDGGGGGDTTAVGGNGGTGGVGGNGGNASAVFGFGGNGGASGTGGAGGAG 563
GI GG G GGG G +GG G GGN +A FG ++ G
Sbjct: 52 SGIHWGGGSGHGNGGGN-----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106

Query: 564 GAGGAGGAGGAGGTSSVSGN 583
+ GA A A +++ G
Sbjct: 107 ISAGALSAAIADIMAALKGP 126



Score = 35.5 bits (81), Expect = 8e-04
Identities = 33/104 (31%), Positives = 41/104 (39%), Gaps = 2/104 (1%)

Query: 373 NGGDG-GRGGDALSDYSTVTGPSATGGNGGAGHDGHIGGGAGGTGGAGGNGTSADGVGHG 431
+GGDG G A S + G G GG DG G + GG+G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGS-GWSSENNPWGGGSGSGIHWGGGS 60

Query: 432 GNGGIGGNGNSASNGGGDGGNGGAGGTGGHGGLLIGNGGAGGIG 475
G+G GGNGNS G G G + GAGG+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.5 bits (81), Expect = 8e-04
Identities = 32/94 (34%), Positives = 34/94 (36%), Gaps = 5/94 (5%)

Query: 567 GAGGAGGAGGTSSVSGNIGAVGGNGGVGGDGGDGGDGGDGGAAGAAGGAGGQGGFLGSAG 626
G G G G S SGNI GG G+G GG G GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN--GGPTGLGVGGGASDGSGWSSENNPWGGGSGSG---IHWG 57

Query: 627 SAGGSGVGGAGGAAGNGGSAGAGGDGGVAAGTFG 660
G G GG G +G G G A FG
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.3 bits (78), Expect = 0.002
Identities = 32/109 (29%), Positives = 42/109 (38%), Gaps = 4/109 (3%)

Query: 390 VTGPSATGGNGGAGHDGHIGGGAGGTGGAGGNGTSADGVGHGGNGGIGGNGNSASNGGGD 449
++G G N GA G GG G G G ++DG G G G+ + G
Sbjct: 1 MSGGDGRGHNTGAHSTS--GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGS--GIHW 56

Query: 450 GGNGGAGGTGGHGGLLIGNGGAGGIGGTGGSGGAGAPGGIGGGGGGGGI 498
GG G G GG+G G+G G + G P G GG +
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAV 105



Score = 34.3 bits (78), Expect = 0.002
Identities = 28/86 (32%), Positives = 35/86 (40%)

Query: 489 IGGGGGGGGIGTATNLGIAAEGGLGGDGGGGGDTTAVGGNGGTGGVGGNGGNASAVFGFG 548
+ GG G G A + GG G G GGG + G + GG G+ G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 549 GNGGASGTGGAGGAGGAGGAGGAGGA 574
G+G G G +GG G GG A A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.003
Identities = 36/113 (31%), Positives = 42/113 (37%), Gaps = 7/113 (6%)

Query: 527 GNGGTGGVGGNGGNASAVFGFGGNGGASGTGGAGGAGGAGGAGGAGGAGGTSSVSGNIGA 586
G G G GN + GG G GGA G G G S + G+
Sbjct: 6 GRGHNTGAHSTSGNIN-----GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 587 VGGNGGVGGDGGDGGDGGDGGAAGAAGGAGGQGGFLGSAGSAGGSGVGGAGGA 639
GNGG G+ G G G +A AA A G S AGG V + GA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPAL--STPGAGGLAVSISAGA 111



Score = 33.1 bits (75), Expect = 0.003
Identities = 24/73 (32%), Positives = 30/73 (41%)

Query: 140 GNGGNGGSGATGQVGGSGGAAGLIGTGGTGGAGGTGAAGGNGGAGGWLFGDGGIGGTGGA 199
G G N G+ +T G +G G + G+G + GG G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 200 GATGGSGGAGGVG 212
G G SGG G G
Sbjct: 66 GGNGNSGGGSGTG 78



Score = 32.8 bits (74), Expect = 0.005
Identities = 22/72 (30%), Positives = 26/72 (36%)

Query: 356 GGNGGDGGAAGLFAIGGNGGDGGRGGDALSDYSTVTGPSATGGNGGAGHDGHIGGGAGGT 415
G N G +G G G G G S +S+ P G G G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 416 GGAGGNGTSADG 427
G G G+ G
Sbjct: 68 NGNSGGGSGTGG 79



Score = 32.4 bits (73), Expect = 0.007
Identities = 34/102 (33%), Positives = 44/102 (43%), Gaps = 9/102 (8%)

Query: 189 GDGGIGGTGGAGATGGS--GGAGGVGFGSGGTGGFGGAGAA---GGTGGDGALLWGNGGA 243
G G G GA +T G+ GG G+G G G + G G + GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 244 GGQGGTGMTGINGGSAGHGGSGGNAVGL----LGSGGAGGQG 281
G GG G +G G+ G+ + V L + GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.017
Identities = 34/121 (28%), Positives = 48/121 (39%), Gaps = 7/121 (5%)

Query: 194 GGTGGAGATGGSGGAGGVGFGSGGTGGFGGAGAAGGTGGDGALLWGNGGAGGQGGTGMTG 253
GG G TG +G + G G G GGA G + WG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNP-WGGGSGSGIHWGG--- 58

Query: 254 INGGSAGHGGSGGNAVGLLGSGGAGGQGGTGLA-GIDGVSSHGSGTASTGAAGTNVEHST 312
G G+GG GN+ G G+GG +A G +S+ G+G + + + +
Sbjct: 59 --GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116

Query: 313 A 313
A
Sbjct: 117 A 117



Score = 30.1 bits (67), Expect = 0.039
Identities = 31/106 (29%), Positives = 39/106 (36%), Gaps = 11/106 (10%)

Query: 120 NGTDGAAGTGANGGDGGLLWGNGGNGGSGATGQVGGSGGAAGLIGTGGTGGAGGTGAAGG 179
N + NGG GL G G + GSG + + GG +G +G G G GG
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSG---SGIHWGGGSGHGNGG 66

Query: 180 NGGAGGWLFGDGGIGGTGGAGATGGSGGAGGVGFGSGGTGGFGGAG 225
G G GG+G G GF + T G GG
Sbjct: 67 GNGNSG--------GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2482HTHTETR552e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.0 bits (132), Expect = 2e-11
Identities = 19/56 (33%), Positives = 31/56 (55%)

Query: 18 SNTREHILTCARELFALNGLDRTSVRSVAAAAGVDASLVHHYYGTKQQLFAAAIQI 73
TR+HIL A LF+ G+ TS+ +A AAGV ++ ++ K LF+ ++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2484HTHTETR552e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.6 bits (131), Expect = 2e-11
Identities = 19/56 (33%), Positives = 31/56 (55%)

Query: 18 SNTREHILTCARELFALNGLDRTSVRSVAAAAGVDASLVHHYYGTKQQLFAAAIQI 73
TR+HIL A LF+ G+ TS+ +A AAGV ++ ++ K LF+ ++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2485ABC2TRNSPORT471e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 46.9 bits (111), Expect = 1e-08
Identities = 38/172 (22%), Positives = 76/172 (44%), Gaps = 2/172 (1%)

Query: 53 TMQRERASGTLERVLTTPLRRLDMLAGYGTAFSLAAAAQATVACIVSFWLLGFDTAGSPV 112
R T E +L T LR D++ G A++ AA A V LG+ S +
Sbjct: 90 AFGRMEGQRTWEAMLYTQLRLGDIVLGE-MAWAATKAALAGAGIGVVAAALGYTQWLSLL 148

Query: 113 WVFVIAVVNAILGVGLGLLFSAFARTEFQAVQFIPLVMVPQLLLAGIIVPRAVMPTWLEW 172
+ + + + LG++ +A A + + + LV+ P L L+G + P +P +
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 173 VSNAMPASYALEALQQVGAHPELTYIALRDIVVVVVFAVASLCLAAATLRRR 224
+ +P S++++ ++ + + + + + ++ V L+ A LRRR
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQH-VGALCIYIVIPFFLSTALLRRR 259


100MMAR_2598MMAR_2607N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2598-211-1.187631hypothetical protein
MMAR_2599-29-0.054212hypothetical protein
MMAR_2600-280.486143short-chain dehydrogenase
MMAR_2601-2100.839198hypothetical protein
MMAR_2602-2110.480211hypothetical protein
MMAR_2603-110-0.640658hypothetical protein
MMAR_2604-29-1.329471hypothetical protein
MMAR_2605-210-2.183955anaerobic dehydrogenase
MMAR_2606-310-2.305358Ser/Thr-protein kinase
MMAR_2607-210-2.157310hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2598CHANLCOLICIN300.018 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.018
Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 7/77 (9%)

Query: 230 ASPTLLNHTAKSAARAYANMELPLAEVKAVAKATDTSINDVVMTIVDDALHHYLDEHRAP 289
++ L A+ AARA A AE +A AKA ++ + IV++AL H + R P
Sbjct: 58 STAQLKKTQAEQAARAKA-----AAEAQAKAKANRDALTQRLKDIVNEALRH--NASRTP 110

Query: 290 ADRPLVALMPMSMRSQA 306
+ L +M+++
Sbjct: 111 SATELAHANNAAMQAED 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2600DHBDHDRGNASE813e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.3 bits (200), Expect = 3e-20
Identities = 50/194 (25%), Positives = 87/194 (44%), Gaps = 1/194 (0%)

Query: 9 NTSDLAGRVVAITGAGSGIGRELALLCAQRGADLALCDINDTAVADTAQTARGFGHDVIT 68
N + G++ ITGA GIG +A A +GA +A D N + + +
Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 69 RRVDVSDPEQMTAFADATLGHFGGVDLLVNNAGVGLIGGFLDTSRKDWDWLVSINVMGVV 128
DV D + G +D+LVN AGV G S ++W+ S+N GV
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 129 HGCEAFLPAMIESGRGGHVVNLSSAAGLLANSALSAYSATKFAVLGLSEALRIELEPHRI 188
+ + M++ R G +V + S + ++++AY+++K A + ++ L +EL + I
Sbjct: 122 NASRSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 189 GVTAICPGVINTAI 202
+ PG T +
Sbjct: 181 RCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2604TCRTETA522e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 52.1 bits (125), Expect = 2e-09
Identities = 42/184 (22%), Positives = 72/184 (39%), Gaps = 4/184 (2%)

Query: 2 RVIVLLALVVGLEGASNGTIGALAVALKQAFGITNLQV---GLLVTASTAIGIVVMLVSG 58
R ++++ V L+ G I + L + +N G+L+ + V G
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 59 TLADRVNRTRVLWITVLIWSVAMALGGISAGYGWLLASRVALGVVVAVGGPVVASLMGDF 118
L+DR R VL +++ +V A+ + L R+ G+ A G V + + D
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYIADI 123

Query: 119 FAQHERGRIYGFVLAGEGICTALGVLVSGWLAAITWRLSFLWLAVAGLLLTLALARTVPE 178
ER R +GF+ A G G ++ G + + F A L L +PE
Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 179 PARG 182
+G
Sbjct: 184 SHKG 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2605RTXTOXINA330.006 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 32.6 bits (74), Expect = 0.006
Identities = 17/60 (28%), Positives = 25/60 (41%), Gaps = 6/60 (10%)

Query: 170 TDLLVIMGANPAASQGSLLAAP------DVMGLIDAIRQRGKVIVIDPVRTVTAARADEW 223
T L + AA+ SL+ AP V G+I I + K + + V + A EW
Sbjct: 373 TVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEHVASKMADVIAEW 432


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2607SALSPVBPROT270.019 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 27.0 bits (59), Expect = 0.019
Identities = 9/14 (64%), Positives = 10/14 (71%)

Query: 83 APLPEDYPPPPPPP 96
AP+ PPPPPPP
Sbjct: 360 APVNNMMPPPPPPP 373


101MMAR_2651MMAR_2664N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_26515224.201315transcriptional regulatory protein
MMAR_26544194.091240cytochrome P450 144A4 Cyp144A4
MMAR_26554204.243367transcriptional regulatory protein
MMAR_26565204.507352PE-PGRS family protein
MMAR_2657012-0.229454isochorismatase family protein
MMAR_2659-114-0.247053hypothetical protein
MMAR_2660-110-0.701963hypothetical protein
MMAR_2661-111-1.538543hypothetical protein
MMAR_2662-111-1.389557hypothetical protein
MMAR_2663-111-1.5943864-alpha-glucanotransferase
MMAR_2664-213-2.146575hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2651HTHTETR453e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.0 bits (106), Expect = 3e-08
Identities = 21/188 (11%), Positives = 60/188 (31%), Gaps = 20/188 (10%)

Query: 11 DRRRAAADRIYDAATDLIAHEGINQLDIDRLATLVHCSRATVYRYVGGKNDIRNVVVKRA 70
+ I D A L + +G++ + +A +R +Y + K+D+ + + + +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 71 AARIADSVRSAVENLSG------RERVVAAI-----------ILSVQRIRADPLGQLMIS 113
+ I + G RE ++ + ++ + + + +G++ +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 114 SIHGGTQEVAWLADSPLLAGVASDLTGL-AGGDPHAAKWVVRIVLSLMY--WPAESEDVE 170
+ + L A A ++R +S + W + +
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFD 186

Query: 171 RLMVEKFV 178
+
Sbjct: 187 LKKEARDY 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2655HTHTETR477e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 47.3 bits (112), Expect = 7e-09
Identities = 20/131 (15%), Positives = 42/131 (32%), Gaps = 3/131 (2%)

Query: 10 DRSAVAAELIYDAAAELIASDGLSAFDIDKLAARVHCSRATIYRYAGGKAKIRDVVIARA 69
+ + I D A L + G+S+ + ++A +R IY + K+ + + +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 70 AARIVESVRAQAESLTG--AERVVASVEFALAGVRSDPLGRHLVGSFPKSANGA-EWFVG 126
+ I E G + + L ++ R L+ E V
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 127 SKLVANFAADL 137
+ N +
Sbjct: 127 QQAQRNLCLES 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2656cloacin392e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 2e-04
Identities = 28/80 (35%), Positives = 35/80 (43%)

Query: 1202 GNGGNGGTGGTGSTGTAGSSDVMGANGGAGGSGWAGGDGGAGGMGGTLAGHGGDGGDGGS 1261
G G N G T G + + G + GSGW+ + GG G+ GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1262 GGTGGTGGRGGNGFNGSTKA 1281
GG G +GG G G N S A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVA 85



Score = 35.1 bits (80), Expect = 0.003
Identities = 28/102 (27%), Positives = 34/102 (33%)

Query: 1404 VAGGAGGAGGDGGLYGDGGDGGSGGNGGAGKAGAAGVSAGSNGEAGGQAGAGGVGGAGGN 1463
++GG G G G G G G + G S G G+ GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1464 GGANAGNGGTGGNGGDGGVGGTGGAGKVGTTGPAGGAGGEGG 1505
G N G G G G G + A V PA G GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.005
Identities = 33/99 (33%), Positives = 37/99 (37%)

Query: 171 GNGGAGGAGGISAAGNGGSGGVGGRGGLVYGSGGAGGAGGQGALSGGAGGAGGGAWLWGA 230
G G GA S NGG G+G GG GSG + G SG GGG+
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 231 GGAGGSGGEGLASAGGVGGAGGNAGLIGTGGLGGAGGVG 269
GG G SGG A A GAGG+
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.006
Identities = 27/101 (26%), Positives = 34/101 (33%)

Query: 1444 SNGEAGGQAGAGGVGGAGGNGGANAGNGGTGGNGGDGGVGGTGGAGKVGTTGPAGGAGGE 1503
S G+ G NGG G G + G G G +G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1504 GGDGGKGGTGGRGGNGGAGGTAQAAGYSDGSQGVGGDGGAG 1544
G+GG G G G G +A AA + G + G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.007
Identities = 28/82 (34%), Positives = 35/82 (42%)

Query: 1511 GTGGRGGNGGAGGTAQAAGYSDGSQGVGGDGGAGGTGGTAGNGGKGGAGTWAVNNGIGGK 1570
G GRG N GA T+ GVGG G + N GG+G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1571 GGNGGNAGTGGTGGSFGTGSQI 1592
G GGN +GG G+ G S +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.5 bits (76), Expect = 0.010
Identities = 28/78 (35%), Positives = 33/78 (42%)

Query: 286 GRGGTGGVGGASDGGNGGAGGDGGVGGGLFGSGGAGGSGGAGGVLGTGGDGGSGGAAAGL 345
GRG G S NGG G G GG GSG + + GG G+G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 346 WGAGGSGGAGGNGADGIS 363
G G SGG G G + +
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 33.1 bits (75), Expect = 0.013
Identities = 26/81 (32%), Positives = 34/81 (41%)

Query: 1170 NGGDGGNGGSGTTGTTGSKGGAGGAGGDGGRYGNGGNGGTGGTGSTGTAGSSDVMGANGG 1229
+GGDG +G T+G+ G G GG +G + G +GS G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1230 AGGSGWAGGDGGAGGMGGTLA 1250
G G G GG G GG L+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 33.1 bits (75), Expect = 0.014
Identities = 32/117 (27%), Positives = 42/117 (35%)

Query: 1236 AGGDGGAGGMGGTLAGHGGDGGDGGSGGTGGTGGRGGNGFNGSTKAGLNGGDAGDGGAGG 1295
+GGDG G +GG G G GG G + G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1296 VGGAGGNGGAAGLAQAAGFSDGIQGAGGAGGDGGAGGGAGDGGDGANAAAGSGAVGG 1352
G GGNG + G + G + G + GAG +A A S A+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.8 bits (74), Expect = 0.016
Identities = 34/113 (30%), Positives = 45/113 (39%), Gaps = 5/113 (4%)

Query: 1298 GAGGNGGAAGLAQAAGFSDGIQGAGGAGGDGGAGGGAGD----GGDGANAAAGSGAVGGN 1353
G G G G +G +G G GG G G G G+ + G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1354 GGDGGDPGLGGGGGAGGTGATTGAHGADGLSP-TTGGNGGKGGNGGSGAIGVA 1405
G GG+ GGG G GG + A A G +T G GG + +GA+ A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 32.4 bits (73), Expect = 0.021
Identities = 39/104 (37%), Positives = 43/104 (41%), Gaps = 9/104 (8%)

Query: 270 GQNGGAGGDGGNAPLLGRGGTGGVGGASDGG--------NGGAGGDGGVGGGLFGSGGAG 321
G N GA GN G G G GGASDG GG G G GG G G G
Sbjct: 8 GHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 322 GSGGAGGVLGTGGDGGSGGAAAGLWGAGGSGGAGGNGADGISGG 365
G+G +GG GTGG+ + A S G A IS G
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.4 bits (73), Expect = 0.026
Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 2/76 (2%)

Query: 1673 GAGGAGGNGGTSRGDGGAGGAGGTGGVGGSGGDGA--DGTSGLFGGADGTAGGAGGDGGD 1730
G G G N G G G GVGG DG+ + +GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1731 GGAGGAGGAGGKAVSG 1746
G GG G +GG + +G
Sbjct: 63 GNGGGNGNSGGGSGTG 78



Score = 31.6 bits (71), Expect = 0.036
Identities = 30/84 (35%), Positives = 32/84 (38%)

Query: 1870 AGGAGGAGGTGSTQGSAGTTGAWRAGGDGGSGGDGGDGFGLWNPGEGGRGGSGGDGGTGG 1929
+GG G TG+ S G G GG DG NP GG G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1930 DGGDGGNGRVEIWEGRGGNGGNGA 1953
G GGNG G GGN A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 31.6 bits (71), Expect = 0.040
Identities = 30/101 (29%), Positives = 35/101 (34%), Gaps = 5/101 (4%)

Query: 1750 NGSQGAGGNGGAAGDGGDGGNGGNGHDGNNGSVPSGGTDRDGGDGQGGGDGGAGGAGGAG 1809
+G G G N GA G+ G G V G +D G + GG G+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTG-----LGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 1810 GNGGAAGAGGGGTRGAGGDGGDGGNGGFAGDGGLGMDGLDA 1850
G G G GGG GG G G A G L
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALST 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2657ISCHRISMTASE342e-04 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 34.2 bits (78), Expect = 2e-04
Identities = 23/100 (23%), Positives = 37/100 (37%)

Query: 68 LTRGEPGWEIIPEMEPLPGEMVVDKLGKGSFYATDLELILTTRRITHLIFTGIATDVCVH 127
L G +II E+ P ++V+ K +F T+L ++ LI TGI +
Sbjct: 99 LNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCL 158

Query: 128 TTMREANDRGYECLLLSDCTGATDYANHLAALKMITMQGG 167
T EA + + D H AL+ +
Sbjct: 159 VTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCA 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2660IGASERPTASE433e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 43.1 bits (101), Expect = 3e-06
Identities = 41/232 (17%), Positives = 69/232 (29%), Gaps = 21/232 (9%)

Query: 309 DRHAAARAQRERAELDADTAIAVKRAEVRQAAEIMWAEHQLNQTRMAIEAQAEIDRAQQR 368
+ AR A A AE Q ++T E A AQ R
Sbjct: 1013 NNEEIARVDEA---PVPPPAPATPSETTETVAE---NSKQESKTVEKNEQDATETTAQNR 1066

Query: 369 RRVIEA-LELPVHASSQRTDEPVEEDMYLPIAAEAEAAASRAVAELPA-GAAKADTDTTH 426
EA + + + + E E + ++ A + AK +T+ T
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSE------TKETQTTETKETATVEKEEKAKVETEKTQ 1120

Query: 427 HLPA-QVESSPTVERHEQDRAAPLIPSIPDATKAAARWIRPLVPPFVARVIDNTTQPIRS 485
+P + SP E+ E + D T +T QP +
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTT------ADTEQPAKE 1174

Query: 486 ARQVFEEVEEIAFSFKRTRKVTVNTESSDDHREQPAPQSAGADAPAPVNRIA 537
E+ + + V N E++ QP S ++ P +R +
Sbjct: 1175 TSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRS 1226



Score = 39.3 bits (91), Expect = 5e-05
Identities = 36/200 (18%), Positives = 68/200 (34%), Gaps = 11/200 (5%)

Query: 364 RAQQRRRVIEALELPVHASSQRTDEPVEEDMYLPIAAEAEAAASRAVAELPAGAAKADTD 423
++R + ++ + + Q D P P E A A PA A ++T
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQ-ADVPSV-----PSNNEEIARVDEAPVPPPAPATPSETT 1037

Query: 424 TTHHLPAQVESSPTVERHEQDRAAPLIPSIPDATKAAARWIRPLVPPFVARVIDNTTQPI 483
T ++ ES TVE++EQD + A +A + VA+ T +
Sbjct: 1038 ETVAENSKQESK-TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ 1096

Query: 484 RSARQVFEEVEEIAFSFKRTRKVTVNTESSDDHREQPAPQSAGADAPAPVNRIASSRGDA 543
+ + VE+ ++ + T T+ Q +P+ ++ P A
Sbjct: 1097 TTETKETATVEKE----EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152

Query: 544 ESGSWAEDGEGLETHQGQPS 563
+ + QP+
Sbjct: 1153 VNIKEPQSQTNTTADTEQPA 1172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2664SOPEPROTEIN320.004 Salmonella type III secretion SopE effector protein ...
		>SOPEPROTEIN#Salmonella type III secretion SopE effector protein

signature.
Length = 239

Score = 32.0 bits (72), Expect = 0.004
Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 2/61 (3%)

Query: 293 QQYSLVLTDGVQTLPPL--VAQILQNAGRPGNTKPVTVQPSSLAKMPVVNRLDLSAYPDD 350
Q +L+++ G+ P L + + +NAG PG TK PS P + L SA
Sbjct: 124 QCCNLLISKGINIAPFLQEIGEAAKNAGLPGTTKNDVFTPSGAGANPFITPLISSANSKY 183

Query: 351 P 351
P
Sbjct: 184 P 184


102MMAR_2737MMAR_2744N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2737-1130.119284alanine and proline rich secreted protein Apa
MMAR_2738-110-0.658941hypothetical protein
MMAR_2739-110-0.561126alcohol dehydrogenase AdhA
MMAR_2740-310-0.111517hypothetical protein
MMAR_2741-3110.186586hypothetical protein
MMAR_2742-3100.387807putative phosphoketolase
MMAR_2743-2100.337380short chain dehydrogenase
MMAR_2744-290.455171acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2737PERTACTIN377e-05 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 37.4 bits (86), Expect = 7e-05
Identities = 22/57 (38%), Positives = 23/57 (40%)

Query: 55 ADPNAAPPPADPNAPPPPPADPNAPPPPPADPNAPPPPPADPNAPPPPVVDPNAPEP 111
A+ N APP P P P P P PP PP P P PP P AP P
Sbjct: 554 ANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAP 610



Score = 33.9 bits (77), Expect = 0.001
Identities = 22/48 (45%), Positives = 22/48 (45%), Gaps = 1/48 (2%)

Query: 37 ASADPAPAPAPSTTAAPPADPNAAPPPADPNAPPPPPA-DPNAPPPPP 83
A A PAP PAP P P P P P PP PP P AP P P
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQP 612



Score = 30.1 bits (67), Expect = 0.014
Identities = 17/51 (33%), Positives = 19/51 (37%)

Query: 31 IALPATASADPAPAPAPSTTAAPPADPNAAPPPADPNAPPPPPADPNAPPP 81
+ A + PAP P P PP P PP P P P P PP
Sbjct: 563 VGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2738SECYTRNLCASE260.030 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 25.9 bits (57), Expect = 0.030
Identities = 11/54 (20%), Positives = 22/54 (40%)

Query: 40 KGGGSGILMNIVIGVVGALIGGFLLSFFVDTAAGGWWFTLFTAILGSVILLWIV 93
+G G+G+ + + I + T AGGW +G +++ +V
Sbjct: 184 RGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLIMVALVV 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2743DHBDHDRGNASE983e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 97.8 bits (243), Expect = 3e-26
Identities = 66/236 (27%), Positives = 108/236 (45%), Gaps = 23/236 (9%)

Query: 11 VRDKVIVITGGARGIGLATATALHKLGAKVAIGDVDEPAVKEAGADLGLEVYG----KLD 66
+ K+ ITG A+GIG A A L GA +A D + +++ + L E D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 67 VTDPNSFSDFLDQVERQLGPLDVLVNNAGIMPVGRIVDEPDSVTRRILDINVYGVMVGSK 126
V D + + ++ER++GP+D+LVN AG++ G I D +N GV S+
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 127 LAAQRMVPRGRGHVINVASLAGEIYVVGLATYCASKHAVIAFTDAARIEYRSTGVKFSMV 186
++ M+ R G ++ V S + +A Y +SK A + FT +E ++ ++V
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 187 LPTFVNTELA--------SGTPGMKGF-----------KNAEPSDIADAIVALVAN 223
P T++ +KG K A+PSDIADA++ LV+
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2744SACTRNSFRASE385e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.4 bits (89), Expect = 5e-06
Identities = 14/56 (25%), Positives = 21/56 (37%)

Query: 104 IYVDPEHVCTGVGRLLMTAARERLRRVGVTAAVLWVLDGNARARRFYERDGWNFDG 159
I V ++ GVG L+ A E + +L D N A FY + +
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


103MMAR_2827MMAR_2836N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2827114-2.092409lipoprotein
MMAR_2828216-2.156125lipase LipD
MMAR_2829216-2.065946hypothetical protein
MMAR_2830114-2.078574hypothetical protein
MMAR_2831-110-0.990400transcriptional regulator
MMAR_2832-110-0.976933hypothetical protein
MMAR_2833-211-0.472232TetR family transcriptional regulator
MMAR_2834-111-0.637103hypothetical protein
MMAR_2835-29-0.983232putative fatty-acid--CoA ligase
MMAR_2836-310-1.036813phosphotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2827BLACTAMASEA340.001 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 33.6 bits (77), Expect = 0.001
Identities = 21/83 (25%), Positives = 36/83 (43%), Gaps = 12/83 (14%)

Query: 136 GVLGIADLATNKKVTK---DTVFDIGSVSKQFTATAVLLLINEGRLTLDDPLAHYVPDLP 192
G++ + DLA+ + +T D F + S K AVL ++ G L+ + + DL
Sbjct: 41 GMIEM-DLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLV 99

Query: 193 DWS--------SAVTVAQLMHHT 207
D+S +TV +L
Sbjct: 100 DYSPVSEKHLADGMTVGELCAAA 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2830HTHTETR508e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.0 bits (119), Expect = 8e-09
Identities = 29/188 (15%), Positives = 55/188 (29%), Gaps = 17/188 (9%)

Query: 4 ERLLRAAADHLS--GRPNATLDEIGAAAGVSHSTLYRHFDGRTALLEALDHAAIEQMRDA 61
+ +L A S G + +L EI AAGV+ +Y HF ++ L + + + +
Sbjct: 14 QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGEL 73

Query: 62 L-----KTSHWQEYSPIDALRILVAACEPVAGYLTLRYVQGQS---FETRKSVAEWR--- 110
K + L ++ + L + V + +
Sbjct: 74 ELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNL 133

Query: 111 --EINSEIEEIFLRGQRAGEFRTDVTADWLTEAFFSLVSG--AGWSVQHGRVARREFTHM 166
E IE+ A D+ +SG W ++
Sbjct: 134 CLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARD 193

Query: 167 ITALFLDG 174
A+ L+
Sbjct: 194 YVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2831HTHTETR476e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 46.9 bits (111), Expect = 6e-09
Identities = 31/192 (16%), Positives = 53/192 (27%), Gaps = 17/192 (8%)

Query: 4 DRDRILREAAECLGKRPT--ATQDEIAAAVGVSRATLHRHFAKRGALLEALDRLAISQLG 61
R IL A ++ + EIA A GV+R ++ HF + L + L+ S +G
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 62 EAM--TISRCQEGTAAEALQRLVAACRPVSGYLRLLYIRAQDFESDQLTEGWAEIDAQLR 119
E ++ + + L+ R + F + A + R
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 120 QLFL-----------RGQRSGEFRRDLPTLWLNQAFFSLVAG--AGRSANTGRIARSDFT 166
L L + DL T ++G
Sbjct: 132 NLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEA 191

Query: 167 GMVTELLLRGAR 178
+LL
Sbjct: 192 RDYVAILLEMYL 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2833HTHTETR574e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.6 bits (136), Expect = 4e-12
Identities = 28/188 (14%), Positives = 55/188 (29%), Gaps = 17/188 (9%)

Query: 4 ERLLRAAADHLSTRP--NATLDEIGAAAGVSHSTLYRHFDGRTALLEALDHAAIEQMRDA 61
+ +L A S + + +L EI AAGV+ +Y HF ++ L + + + +
Sbjct: 14 QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGEL 73

Query: 62 L-----KTSHWQEYSPIDALRILVAACEPVAGYLTLRYVQGQS---FETRKSVAEWR--- 110
K + L ++ + L + V + +
Sbjct: 74 ELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNL 133

Query: 111 --EINSEIEEIFLRGQRAGEFRTDVTADWLTEAFFSLVSG--AGWSVQHGRVARREFTHM 166
E IE+ A D+ +SG W ++
Sbjct: 134 CLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARD 193

Query: 167 ITALFLDG 174
A+ L+
Sbjct: 194 YVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2836SACTRNSFRASE356e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.5 bits (79), Expect = 6e-04
Identities = 15/54 (27%), Positives = 24/54 (44%), Gaps = 2/54 (3%)

Query: 251 VRPQFQGRGVGSALMRRVEATLFERHAA-IRL-TTDGSSRAAGFYRKLGWSAAG 302
V ++ +GVG+AL+ + E H + L T D + A FY K +
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


104MMAR_2964MMAR_2970N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2964-182.131537transmembrane transport protein
MMAR_29653132.827766ATPase/kinase, NadR
MMAR_29663122.834270integral membrane drug efflux protein
MMAR_29673123.116744hypothetical protein
MMAR_29683112.782523hypothetical protein
MMAR_29693103.005418PE-PGRS family protein
MMAR_29700101.358603PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2964TCRTETB1364e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 136 bits (344), Expect = 4e-36
Identities = 84/414 (20%), Positives = 169/414 (40%), Gaps = 20/414 (4%)

Query: 44 VLLVAAFGAFLAFLDSTIVNVAFPDIQRYFHSGISDLSWVLNAYNIVFAAFLVAAGKLAD 103
+L+ +F + L+ ++NV+ PDI F+ + +WV A+ + F+ GKL+D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 104 LLGRKRLFVYGVVLFTIASGLCAAADS-VEQLVAFRVLQGIGAAVLVPASLGLVVESFPA 162
LG KRL ++G+++ S + S L+ R +QG GAA + +V P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 163 ERRAHGVNLWGAAGAIAAGLGPPIGGALVEALNWRWVFLVNLPLGIVAVLAARRALVESR 222
E R L G+ A+ G+GP IGG + ++W ++ L+ + + I+ V + L +
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKE- 192

Query: 223 ACGRRRVP-DVRGAAMLATALGLLTLGLIKGPDWGWSSLPAIGSLVAAALAMIGFVMSSR 281
R + D++G +++ + L ++ +I L+ + L+ + FV R
Sbjct: 193 --VRIKGHFDIKGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIR 241

Query: 282 NHPTPLVEPALLRIRSFVAGSALTAIASAGFYAYLLTHVLFLNYVWGYTLLQAGLAVC-P 340
P V+P L + F+ G I ++ + V + + G + P
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 341 AAIIAAVTAGLLGRVADRHGYRVIIGVGALIWAGSLLWYLTCVGTTPNFLGEWLPGQILQ 400
+ + + G + DR G ++ +G + S L ++ I+
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFL----TASFLLETTSWFMTIIIVF 357

Query: 401 GIGVGAAFPLLGSAALAGLASGSSYATASAVTGTIRQVGAVIGVALLVILVGTP 454
+G + + S ++ ++ + G+A++ L+ P
Sbjct: 358 VLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2966TCRTETB1492e-41 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 149 bits (377), Expect = 2e-41
Identities = 79/413 (19%), Positives = 166/413 (40%), Gaps = 19/413 (4%)

Query: 39 VCVLGSIMTMVDTSVVTVAQRTFVDTFGSTQAVVAWTITGYTLALAAVVPLAGWAADRFG 98
+C+L S ++++ V+ V+ + F A W T + L + + G +D+ G
Sbjct: 19 LCIL-SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77

Query: 99 TKRMFMGSILVFTLSSLLCAIAPNIA-LLIASRVVQGLGGGMLAPLALTIVNREAGPKRV 157
KR+ + I++ S++ + + LLI +R +QG G L + +V R +
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 158 GRVMAVLGIPGVLAPAFGPALGGWLIDSYSWQWIFWVNLPVGVVAVGLAAVVFPRDTPAP 217
G+ ++G + GPA+GG + W ++ +P+ + + +
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRI 195

Query: 218 SETFDVVGMLLLSPGLPAFLYGMSEIPIYGTVADRHVWVPAGIGIALIVGFMFHALYRAD 277
FD+ G++L+S G+ F+ + + + + F+ H +
Sbjct: 196 KGHFDIKGIILMSVGIVFFMLF----------TTSYSISFLIVSVLSFLIFVKHIR-KVT 244

Query: 278 KPLIDLRLLTNRALTLANVAMFLYIVSTFGAGVLFPSYFQQLLDHTPLQAGMS-LLPRGI 336
P +D L N + + + + G + P + + + + G + P +
Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTM 304

Query: 337 GAALAVPLAGALVDRRGARGVLVIGVTLIATGMGVFAFGVATQRDYLPMLLIGLTILGMG 396
+ + G LVDRRG VL IGVT ++ +F + T + I + + G
Sbjct: 305 SVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETT---SWFMTIIIVFVLGG 361

Query: 397 MGCTRMPLVAVAMQSLAPNQIARGSTLIKVNQQMAAAVGTALMSVILTSQLNN 449
+ T+ + + SL + G +L+ ++ G A++ +L+ L +
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLD 414


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2969cloacin381e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 1e-04
Identities = 38/120 (31%), Positives = 40/120 (33%), Gaps = 11/120 (9%)

Query: 168 GSGGAGGAGGIDGGGGAGTGGTGGRGGLIFGDAGAGGQGGLGFAPNPNGGGGGAGGTGGA 227
G G G G G GG G G GA G NP GGG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGV----GGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 228 GGLFGAGGPGGNGAPSVSGGGSGDGGDGGRGGVFGPGGRGGDGAPNPGGGSPGSGGNGGA 287
G G GG GN G G G G V P G PG G + GA
Sbjct: 59 GSGHGNGGGNGN-------SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 37.0 bits (85), Expect = 3e-04
Identities = 27/81 (33%), Positives = 33/81 (40%)

Query: 237 GGNGAPSVSGGGSGDGGDGGRGGVFGPGGRGGDGAPNPGGGSPGSGGNGGAGGLFGAGGA 296
GG+G +G S G G G GG DG+ +P GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 297 GGNGAPDVGGGSPGSGGNGGA 317
G G GG G+GGN A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 35.8 bits (82), Expect = 6e-04
Identities = 38/109 (34%), Positives = 44/109 (40%), Gaps = 10/109 (9%)

Query: 198 GDAGAGGQGGLGFAPNPNGGGGGAGGTGGAGGLFGAGGPGGNGAPSVSGGGSGDGGDGGR 257
GD G + N NGG G G GGA + G G + + GGGSG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGA-----SDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 258 GGVFGPGGRGGDGAPNPGGGSPGSGGNGGAGGLFGAGGAGGNGAPDVGG 306
G G GG G+ GG G+GGN A A G P GG
Sbjct: 59 GSGHGNGGGNGNS-----GGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.1 bits (80), Expect = 9e-04
Identities = 27/83 (32%), Positives = 32/83 (38%)

Query: 267 GGDGAPNPGGGSPGSGGNGGAGGLFGAGGAGGNGAPDVGGGSPGSGGNGGAGGLFFGDGG 326
GGDG + G SG G G GG +G+ +P GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 327 AGGNGAPNVGGGSPGAGGNGGDA 349
G G N GGGS G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 34.7 bits (79), Expect = 0.001
Identities = 25/72 (34%), Positives = 30/72 (41%)

Query: 124 NGANGAPGTGANGEAGGILFGSGGSGGSGGVGQNGGNGGDAGLFGSGGAGGAGGIDGGGG 183
N + NG G+ G G S GSG +N GG +G G G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 184 AGTGGTGGRGGL 195
GG+G G L
Sbjct: 70 NSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.007
Identities = 34/120 (28%), Positives = 47/120 (39%), Gaps = 7/120 (5%)

Query: 479 GVGGAGGAAALSGAGNGGTGGAGGLFFGVGGAGGAAPLFGGGTGGTGGAGGLLFGLGGAG 538
G G GA + SG NGG G +G GGA+ G + GG G+ G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTG-------LGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 539 GNAPVFGGGSGGTGGRAGLIGIGGAGGSSSVFAGGDGGAGGAGGTFIGFGGAGGDGGVSG 598
G+ GGG+G +GG +G G A + F GAGG + ++
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.4 bits (73), Expect = 0.007
Identities = 33/115 (28%), Positives = 40/115 (34%), Gaps = 1/115 (0%)

Query: 430 SAGRSVGTIGSVGGAGGNGGLFGTGGAGGSGGQDGYNYGGNGGAGGLLFGVGGAGGAAAL 489
S G G GN TG G G DG + G G G G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 490 SGAGNGGTGGAGGLFFGVGGAGGAAPL-FGGGTGGTGGAGGLLFGLGGAGGNAPV 543
G G G GG G + AAP+ FG T GAGGL + +A +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 32.0 bits (72), Expect = 0.009
Identities = 35/113 (30%), Positives = 39/113 (34%), Gaps = 6/113 (5%)

Query: 397 GGDNQNTNTGPGGVGGAGGDAGLFSGAIGGAGGSAGRSVGTIGSVGGAGGNGGLFGTGGA 456
GGD + NTG G G GGA +G S GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 457 GGSGGQDGYNYGGNGGAGGLLFGVGGAGGAAALSGAGNGGTGGAGGLFFGVGG 509
G GG GN G G G A A G T GAGGL +
Sbjct: 63 GNGGG------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 31.2 bits (70), Expect = 0.013
Identities = 32/109 (29%), Positives = 39/109 (35%), Gaps = 5/109 (4%)

Query: 559 GIGGAGGSSSVFAGGDGGAGGAGGTFIGFGGAGGDGGVSGNGGAGGKAGLIGVGGNGGNG 618
G G G+ S +GG G G G+G + GG G G G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 619 GNGGNGGAGGDAQLIGIGGNGGNGGDGQLGGPGTGGTGGTGGTLLGLNG 667
G GN G G G GGN G T G GG + ++
Sbjct: 66 GGNGNSGGGS-----GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 31.2 bits (70), Expect = 0.013
Identities = 33/103 (32%), Positives = 36/103 (34%), Gaps = 9/103 (8%)

Query: 217 GGGGAGGTGGAGGLFGA--GGPGGNGAPSVSGGGSGDGGDGGRGGVFGPGGRGGDGAPNP 274
GG G G GA G GGP G G V GG S G +G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG---VGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 275 GGGSPGSGGNGGAGGLFGAGGAGGNGAPDVGG----GSPGSGG 313
G G G GG G AP G +PG+GG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.018
Identities = 30/103 (29%), Positives = 40/103 (38%), Gaps = 3/103 (2%)

Query: 296 AGGNGAPDVGGGSPGSGG-NGGAGGLFFGDGGAGGNGAPNVGGGSPGAGGNGGDAGLFGA 354
+GG+G G SG NGG GL G G + G+G + +P GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN--NPWGGGSGSGIHWGGG 59

Query: 355 GGAGGRGGNNLANPATDGGAGGAGGNGGAGGLFAGAGGPGGQG 397
G G GGN + + G + F PG G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.1 bits (67), Expect = 0.030
Identities = 32/108 (29%), Positives = 35/108 (32%), Gaps = 11/108 (10%)

Query: 283 GNGGAGGLFGAGGAGGNGAPDVGGGSPGSGGNGGAG----GLFFGDGGAGGNGAPNVGGG 338
G G G GA GN G G G + G+G +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 339 SPGAGGNGGDAGLFGAGGAGGRGGNNLANPATDGGAGGAGGNGGAGGL 386
G G GG G GGN A A A GAGGL
Sbjct: 63 GNGGGNGNS-------GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 29.7 bits (66), Expect = 0.039
Identities = 27/87 (31%), Positives = 32/87 (36%), Gaps = 8/87 (9%)

Query: 571 AGGDGGAGGAGGTFIGFGGAGGDGGVSGNGGAGGKAGLIGVGGNGGNGGNGGNGGAGGDA 630
+GGDG G GG G+ GGA G+G + N GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGAS--------DGSGWSSENNPWGGGSGSG 53

Query: 631 QLIGIGGNGGNGGDGQLGGPGTGGTGG 657
G G GNGG G G+G G
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2970cloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 2e-04
Identities = 38/103 (36%), Positives = 42/103 (40%), Gaps = 7/103 (6%)

Query: 443 GRGGNGG---TGGWLFGNGGVGGAGGTGADGGGMSTTGGHGGTGGTGGSARLIGAGGAGG 499
GRG N G T G + G G GG +DG G S+ G GGS I GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG----GGSGSGIHWGGGSG 61

Query: 500 EGGAGGIGIDVGESGGGGGLGGTGGTGGVLFGAGGDGGSGGAG 542
G GG G G SG GG L F A G+GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.002
Identities = 37/100 (37%), Positives = 45/100 (45%), Gaps = 6/100 (6%)

Query: 349 GTGGAGGNGGNGVEAAAMVGGTGGTGGAGGVG---GWLYGN---GGAGGAGGHGGTHAGL 402
G G G N G + + GG G G GG GW N GG G+G H G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 403 GSTGGIGGAGGAAGLIGAGGAGGAGGAGGPSYLTADGAAG 442
G+ GG G +GG +G G A A A G L+ GA G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.1 bits (75), Expect = 0.003
Identities = 30/94 (31%), Positives = 36/94 (38%), Gaps = 6/94 (6%)

Query: 515 GGGGLGGTGGTGGVLFGAGGDGGSGGAGGTGVLDADGGNGGGGGNGGTAIVIGNGGSGGA 574
G G G T G + G G GG G + N GGG+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 575 GGTGGSGLADGTGGAGGNGGSGGLIGSDGASGTP 608
GG G SG G G GG+ + + A G P
Sbjct: 66 GGNGNSG------GGSGTGGNLSAVAAPVAFGFP 93



Score = 32.8 bits (74), Expect = 0.004
Identities = 39/121 (32%), Positives = 46/121 (38%), Gaps = 10/121 (8%)

Query: 473 MSTTGGHGGTGGTGGSARLIGAGGAGGEGGAGGIGIDVGESGGGGGLGGTGGTGGVLFGA 532
MS G G G ++ I GG G G GG G S GG G+G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 533 GGDGGSGGAGGTGVLDADGGNGGGGGNGGTAIVIGNG----GSGGAGGTGGSGLADGTGG 588
G G GG G +G GG+G GG A + G + GAGG S A
Sbjct: 60 SGHGNGGGNGNSG-----GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114

Query: 589 A 589
A
Sbjct: 115 A 115



Score = 32.8 bits (74), Expect = 0.004
Identities = 34/102 (33%), Positives = 37/102 (36%), Gaps = 14/102 (13%)

Query: 254 GDGGAGGTGGDSIGGRANGGDGGAAGLIGVGGTGGTGGDALGTFGQAAGNGGAGGHGGLL 313
GDG TG S G NGG G G GG D G + GG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTG-------LGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 314 IGAGGDGGTGGVGGTNSPGGGPGGDGGSGGNAGLIGTGGAGG 355
G G G GG G GG G+GGN + A G
Sbjct: 57 GGGSGHGNGGG-------NGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.004
Identities = 34/95 (35%), Positives = 39/95 (41%), Gaps = 5/95 (5%)

Query: 497 AGGEGGAGGIGIDVGESGGGGGLGGTGGTGGVLFGAGGDGGSGGAGGTGVLDADGGNGGG 556
+GG+G G GG G G GG G+G + GG G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 557 GGNGGTAIVIGNGGSGGAGGTGGSGLADGTGGAGG 591
GNGG GNG SGG GTGG+ A A G
Sbjct: 62 HGNGG-----GNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.4 bits (73), Expect = 0.006
Identities = 28/74 (37%), Positives = 32/74 (43%), Gaps = 3/74 (4%)

Query: 289 TGGDALGTFGQAAGNGGA--GGHGGLLIGAGGDGGTGGVGGTNSPGGGPGGDGGSGGNAG 346
+GGD G A G GG GL +G G G+G N+P GG G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGW-SSENNPWGGGSGSGIHWGGGS 60

Query: 347 LIGTGGAGGNGGNG 360
G GG GN G G
Sbjct: 61 GHGNGGGNGNSGGG 74



Score = 31.6 bits (71), Expect = 0.009
Identities = 36/114 (31%), Positives = 42/114 (36%), Gaps = 3/114 (2%)

Query: 316 AGGDGGTGGVGGTNSPG---GGPGGDGGSGGNAGLIGTGGAGGNGGNGVEAAAMVGGTGG 372
+GGDG G ++ G GGP G G GG + G G G + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 373 TGGAGGVGGWLYGNGGAGGAGGHGGTHAGLGSTGGIGGAGGAAGLIGAGGAGGA 426
G GG G G+G G A GAGG A I AG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 31.6 bits (71), Expect = 0.011
Identities = 25/68 (36%), Positives = 31/68 (45%), Gaps = 1/68 (1%)

Query: 125 GANGTAADPNGEAGGLLYGNGG-DGYSFTSGTTSEAGGAGGAAGLIGNGGAGGAGYLGGI 183
GA+ T+ + NG GL G G DG ++S GG+G G G G G G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 184 GGAGGNGG 191
GG G GG
Sbjct: 72 GGGSGTGG 79



Score = 30.8 bits (69), Expect = 0.017
Identities = 30/105 (28%), Positives = 38/105 (36%), Gaps = 5/105 (4%)

Query: 406 GGIGGAGGAAGLIGAGGAGGAGGAGGPSYLTADGAAGGRGGNGGTGGWLFGNGGVGGAGG 465
G GA +G I G G G G GG G+G G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 466 TGADGGGMSTTGGHGGTGGTGGSARLIGAGGAGGEGGAGGIGIDV 510
G GGG T G + +A + A GAGG+ + +
Sbjct: 68 NGNSGGGSGTGGN-----LSAVAAPVAFGFPALSTPGAGGLAVSI 107



Score = 30.8 bits (69), Expect = 0.019
Identities = 32/106 (30%), Positives = 39/106 (36%), Gaps = 1/106 (0%)

Query: 419 GAGGAGGAGGAGGPSYLTADGAAGGRGGNGGTGGWLFGNGGVGGAGGTGADGGGMSTTGG 478
G G GA G G G G + G+G W N GG G+G GG S G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 479 HGGTGGTGGSARLIGAGGAGGEGGAGGIGIDVGESGGGGGLGGTGG 524
GG G +GG + G A A G GG + + G
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.5 bits (68), Expect = 0.020
Identities = 33/125 (26%), Positives = 41/125 (32%), Gaps = 6/125 (4%)

Query: 213 GMGGNAGLIGAGGA--GGAGGAGSSLGGAGGTGGAGGRGGWLYGDGGAGGTGGDSIGGRA 270
G G N G G GG G G G + G+G + W G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPW--GGGSGSGIHWGGGSGHG 63

Query: 271 NGGDGGAAGLIGVGGTGGTGGDALGTFGQAAGNGGAGGHGGLLIGAGGDGGTGGVGGTNS 330
NGG G +G G GTGG G GGL + + + +
Sbjct: 64 NGGGNGNSG--GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 331 PGGGP 335
GP
Sbjct: 122 ALKGP 126



Score = 30.5 bits (68), Expect = 0.023
Identities = 33/104 (31%), Positives = 37/104 (35%), Gaps = 1/104 (0%)

Query: 493 GAGGAGGEGGAGGIGIDV-GESGGGGGLGGTGGTGGVLFGAGGDGGSGGAGGTGVLDADG 551
G G G GA ++ G G G GG G GG G+G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 552 GNGGGGGNGGTAIVIGNGGSGGAGGTGGSGLADGTGGAGGNGGS 595
GNGGG GN G G S A A T GAGG S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 30.1 bits (67), Expect = 0.028
Identities = 31/101 (30%), Positives = 38/101 (37%), Gaps = 22/101 (21%)

Query: 144 NGGDGYSFTSGTTSEAGGAGGAAGLIGNGGAGGAGYLGGIGGAGGNGGWLYGNGGAGGAG 203
+GGDG +G S +G G G LG GGA GW
Sbjct: 2 SGGDGRGHNTGAHSTSGNING-----------GPTGLGVGGGASDGSGW----------- 39

Query: 204 GADGNAATGGMGGNAGLIGAGGAGGAGGAGSSLGGAGGTGG 244
++ N GG G G G G GG G+S GG+G G
Sbjct: 40 SSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80


105MMAR_2981MMAR_2992N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_2981-310-1.15253120-beta-hydroxysteroid dehydrogenase
MMAR_2982-311-1.344816hypothetical protein
MMAR_2983-112-1.876133hypothetical protein
MMAR_2984-113-1.850871TetR family transcriptional regulator
MMAR_2985-113-2.1587312,3-dihydroxybiphenyl-1,2-dioxygenase BphC
MMAR_2986013-2.4774892-hydroxyhepta-2,4-diene-1,7-dioate isomerase
MMAR_2987-115-1.8626093-(3-hydroxyphenyl)propionate hydroxylase
MMAR_2988015-1.677372acetyltransferase
MMAR_2989114-1.740633hypothetical protein
MMAR_2990114-1.573443heat shock protein transcriptional repressor
MMAR_2991014-2.327011molecular chaperone
MMAR_2992114-2.036992GTP cyclohydrolase I FolE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2981DHBDHDRGNASE1304e-39 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 130 bits (329), Expect = 4e-39
Identities = 76/259 (29%), Positives = 118/259 (45%), Gaps = 18/259 (6%)

Query: 2 AGRLTGKVALVSGGARGMGASHVRALVAEGAHVVLGDILDDEGRAVAAELGDAARYVH-- 59
A + GK+A ++G A+G+G + R L ++GAH+ D ++ V + L AR+
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 60 -LDVTQPEQWTAAVDTAVNEFGGLHVLVNNAGILNIGTIEDYALSEWQRILDINVTGVFL 118
DV E G + +LVN AG+L G I + EW+ +N TGVF
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 119 GIRAAVKPMKEAGRGSIINISSIEGLAGTIASHGYTVSKFAVRGLTKSTALELGPSGIRV 178
R+ K M + GSI+ + S + Y SK A TK LEL IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 179 NSIHPGLVKTPM--TEWVPED-----------LFQTA--LGRAAEPMEVSNLVVYLASDE 223
N + PG +T M + W E+ F+T L + A+P ++++ V++L S +
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 224 SSYSTGAEFVVDGGTVAGL 242
+ + T VDGG G+
Sbjct: 243 AGHITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2984HTHTETR531e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.7 bits (126), Expect = 1e-10
Identities = 41/199 (20%), Positives = 60/199 (30%), Gaps = 17/199 (8%)

Query: 12 ANRSERRKRRTRTALLRAAQRLIAE-GKLNVPVLEITQAADVGMGSFYNHFDSKEQLFEA 70
A ++++ + TR +L A RL ++ G + + EI +AA V G+ Y HF K LF
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 71 AVADVLDAHGALLDRCTASI-EDPAETFAASFRLTG---------RLFRRRPQESEILLA 120
G L A DP RL +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 121 NGAALLSSDRGLAPRALRDIKAASDAGRFRVE-----DPELALAMAGGALLGL-GTLLRA 174
A + + R L + I+ A + G + GL L A
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 175 DPQRDGCAAADAVTEDLLR 193
D A LL
Sbjct: 182 PQSFDLKKEARDYVAILLE 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2987TYPE3OMOPROT320.005 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 31.9 bits (72), Expect = 0.005
Identities = 19/76 (25%), Positives = 35/76 (46%), Gaps = 9/76 (11%)

Query: 193 QQRWLVVDVATECDLAQWDGVYQVCDPFRAATYMRIGQSRYRWEFQLRPGERTEDFN--- 249
++ WL+ ATEC + + P R ++R+ + RW ++PG+ E +
Sbjct: 10 RREWLLAQTATECQRHGREATLEY--PTRQGMWVRLSDAEKRWSAWIKPGDWLEHVSPAL 67

Query: 250 ---SISA-LRPLIAPW 261
++SA L+ PW
Sbjct: 68 AGAAVSAGAEHLVVPW 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2988SACTRNSFRASE356e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.5 bits (79), Expect = 6e-05
Identities = 21/95 (22%), Positives = 37/95 (38%), Gaps = 10/95 (10%)

Query: 57 ALTVAVAENLPVGFSGIADG-----KLEMLFIDQQFRGRGAGSALLRAALA-----AIPD 106
A + EN +G I +E + + + +R +G G+ALL A+
Sbjct: 66 AAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCG 125

Query: 107 LLVDVNEQNPQAVGFYHRHGFVTFGRSETDGDGRP 141
L+++ + N A FY +H F+ P
Sbjct: 126 LMLETQDINISACHFYAKHHFIIGAVDTMLYSNFP 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2989IGASERPTASE260.020 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 25.8 bits (56), Expect = 0.020
Identities = 11/60 (18%), Positives = 20/60 (33%)

Query: 18 AKGKAKEVFGAVTGRDDVKREGQAQQDKADAQRDAAKKEAEAEAARRGADVAEERQKANQ 77
+ AKE V Q+ + + Q K+ A E + E+ Q+ +
Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_2992PF05272290.018 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.018
Identities = 10/36 (27%), Positives = 15/36 (41%)

Query: 64 CAVQQLLDALGVDDGEHTAGTPARVARAWREMLWGY 99
+ L+ ALG D G+ + +V E W Y
Sbjct: 798 VTIADLVQALGADPGKSSPMLEGQVRDWLNENGWEY 833


106MMAR_3098MMAR_3105N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3098227-0.729596polyketide synthase
MMAR_3099120-0.767305polyketide synthase and peptide synthetase
MMAR_31000111.941633integral membrane drug efflux protein, ErmB
MMAR_31010132.665198mercuric reductase
MMAR_31020123.020787hypothetical protein
MMAR_3103-1112.927241ATP phosphoribosyltransferase
MMAR_3104-1112.906071phosphoribosyl-ATP pyrophosphatase
MMAR_3105-1102.867481PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3098DHBDHDRGNASE521e-08 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 51.6 bits (123), Expect = 1e-08
Identities = 38/163 (23%), Positives = 60/163 (36%), Gaps = 9/163 (5%)

Query: 1740 VLITGGTGMVAAALARHLVSSHGVRHLVLVSRRGDAAAGASKLVDELTAAGATVRVVACD 1799
ITG + A+AR L S H+ V + K+V L A D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGA--HIAAVDYNPEKL---EKVVSSLKAEARHAEAFPAD 65

Query: 1800 VADPAAVSRLMNQLPEQCPPLSAVIHAAGTLDDALITSLTPQRVDAVLRAKVDGAWNLHE 1859
V D AA+ + ++ + P+ +++ AG L LI SL+ + +A G +N
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 1860 AT----RDLGLSAFVLCSSIAATLGAPGQANYAAGNAFLDALA 1898
+ D + V S A + A YA+ A
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFT 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3099NUCEPIMERASE350.005 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 34.8 bits (80), Expect = 0.005
Identities = 29/148 (19%), Positives = 45/148 (30%), Gaps = 35/148 (23%)

Query: 2393 TVLITGGTGMVASVLARHLVSSYGVKHVVLASRRADATAGVAEL------------VADL 2440
L+TG G + +++ L+ G+ L + L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLL------------EAGHQVVGIDNLNDYYDVSLKQARLELL 49

Query: 2441 AAAGAAVAVVACDVADRAAVTRLLDHVSTCHPPLTGVIHAAGTLDDAVIASLTPDRVDAV 2500
A G D+ADR +T L V + AV SL A
Sbjct: 50 AQPG--FQFHKIDLADREGMTDLFASGHFER-----VFISPH--RLAVRYSLENPH--AY 98

Query: 2501 LRAKVDGAWNLHEATRHLGLSMFVLCSS 2528
+ + G N+ E RH + + SS
Sbjct: 99 ADSNLTGFLNILEGCRHNKIQHLLYASS 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3100TCRTETB1192e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 119 bits (301), Expect = 2e-31
Identities = 78/408 (19%), Positives = 157/408 (38%), Gaps = 18/408 (4%)

Query: 39 IAAIMANLDISIVTVAQRTFTVAFHSTQATVAWTVAGYMLGMATATPMTGWAADRLGAKR 98
I + + L+ ++ V+ F+ A+ W +ML + T + G +D+LG KR
Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80

Query: 99 LFMGAVATFTLGSMLCA-SAPNIGLLITFRVVQGIGGGVLGPLVLAIVTHQAGPRRLGRL 157
L + + GS++ LLI R +QG G LV+ +V G+
Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140

Query: 158 LAVGAIPMLTAPMLGPILGGWLIDSYGWQWIFLINVPAGLLAFGLAAILVPEDPPKPSER 217
+ + +GP +GG + W +L+ +P + + + + +
Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGH 198

Query: 218 FDFIGMLLLLPGIAMLLLGVSAIPGSGTVTDHRVWVPAISGAVLITAFALHAWYRTDHPL 277
FD G++L+ GI +L ++ I + F H + P
Sbjct: 199 FDIKGIILMSVGIVFFMLFTTS----------YSISFLIVSVLSFLIFVKHI-RKVTDPF 247

Query: 278 IDLRLFTDRVVRLANLALLLYVAGAAGASLLLPSYFQQLLHQTPMRSG-LMMVPIGFGAM 336
+D L + + L + AG ++P + + + G +++ P +
Sbjct: 248 VDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVI 307

Query: 337 LTMPLTGAFMDSRGPRKVVLIGLTLIAAGTGTFVFGVANEADYLPTLLAGLTIAGMGLGC 396
+ + G +D RGP V+ IG+T ++ T F + + T++ + GL
Sbjct: 308 IFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASF-LLETTSWFMTIIIVFVL--GGLSF 364

Query: 397 TGLLLAASVMRVLAPHQIARGSALISVNQQISGSIGAALMSMILTNQF 444
T +++ V L + G +L++ +S G A++ +L+
Sbjct: 365 TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3105cloacin403e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.1 bits (93), Expect = 3e-05
Identities = 30/86 (34%), Positives = 36/86 (41%), Gaps = 2/86 (2%)

Query: 635 LGGGGGVGGNGGQGGAGGQGITGGSGGAGGSGGNGGTGGDDTENAEPSAGGAGGEGGMGG 694
+ GG G G N G G G +G G G + G+G N P GG+G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENN--PWGGGSGSGIHWGG 58

Query: 695 GGGDGVVGTGGMGGGGGQGGAGGDAI 720
G G G G G GGG G A+
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 36.2 bits (83), Expect = 5e-04
Identities = 30/85 (35%), Positives = 36/85 (42%), Gaps = 1/85 (1%)

Query: 460 GDGKAGGGGGAGGRGGDGGDGDTGGAAGNGGDGGEGGNAVANSGRGGGTGGAGGVGGDGG 519
GDG+ G GA G+ G TG G G G G ++ N GG G GG G
Sbjct: 4 GDGR-GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 520 NGTNGVGGAGGHGGMGGNAGSSDAA 544
G G +GG G GGN + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 36.2 bits (83), Expect = 5e-04
Identities = 39/106 (36%), Positives = 43/106 (40%), Gaps = 10/106 (9%)

Query: 592 GNGGDGGAGGDGGIGTLGPGGDGGAGG-NGGSGGDGNTQPVGHSLGGGGGVGGNGGQGGA 650
G G + GA G GP G G GG + GSG P G G G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN- 64

Query: 651 GGQGITGGSGGAGGSGGNGGTGGDDTENAEPSAGGAGGEGGMGGGG 696
GG G SGG GTGG+ + A P A G G GG
Sbjct: 65 --------GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.8 bits (82), Expect = 6e-04
Identities = 27/82 (32%), Positives = 34/82 (41%), Gaps = 1/82 (1%)

Query: 469 GAGGRGGDGGDGDTGGAAGNGGDGGEGGNAVANSGRGGGTGGAGGVGGDGGNGTNGVGGA 528
G GRG + G T G NGG G G A+ G G + GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 529 GGHGGMGGNAGSSDAANGAVGS 550
G+GG GN+G G + +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 35.5 bits (81), Expect = 8e-04
Identities = 35/98 (35%), Positives = 37/98 (37%)

Query: 194 SGGAGGLGAVGGQGGGAGTLSLVGSGGDGGHGGEGTPGGSGGTGGGGGSAGLFGRGGFGG 253
+GG GLG GG G+G S G G G GGSG GGG G G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 254 SGGTGGDGVVGSPAALDGGAGGTGGSGGRGGLFAVGGD 291
G PA GAGG S G L A D
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 34.3 bits (78), Expect = 0.002
Identities = 24/78 (30%), Positives = 29/78 (37%), Gaps = 2/78 (2%)

Query: 551 RGGDGGAGGKGGQGTTGGDGGNGGSAGKGGTGGSAVQTNDAGNGGDGGAGGDGGIGTLGP 610
RG + GA G G G G G+G S+ N+ GG G GG G
Sbjct: 7 RGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSS--ENNPWGGGSGSGIHWGGGSGHGN 64

Query: 611 GGDGGAGGNGGSGGDGNT 628
GG G G G G +
Sbjct: 65 GGGNGNSGGGSGTGGNLS 82



Score = 33.9 bits (77), Expect = 0.003
Identities = 33/97 (34%), Positives = 39/97 (40%), Gaps = 1/97 (1%)

Query: 139 GNGGNGYDNSASAGVAGGAGGSAGLIG-NGGIGGAGGALAAGGAGGTGGLLFGNGGSGGA 197
G G N +S S + GG G G + G G + GG G+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 198 GGLGAVGGQGGGAGTLSLVGSGGDGGHGGEGTPGGSG 234
GG G GG G G LS V + G TPG G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.004
Identities = 26/86 (30%), Positives = 31/86 (36%)

Query: 401 AGGHGARGADGLLDGEAGSKGGAGGGGGAGGAGGAGSQAGSDGVGGAGGDGGDGGHGAYG 460
+GG G G GG G G GGA + + G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 461 DGKAGGGGGAGGRGGDGGDGDTGGAA 486
G GG G +GG G GG+ A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.1 bits (75), Expect = 0.004
Identities = 26/76 (34%), Positives = 28/76 (36%)

Query: 706 MGGGGGQGGAGGDAIDSVGGIGGKGGGGADGGGGGASSTSTGGAPGGVGGDGDDGDPPTG 765
M GG G+G G S GG G G GG S S+ P G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 766 QTGGGGGTGGGGGAPG 781
G GGG G GG G
Sbjct: 61 GHGNGGGNGNSGGGSG 76



Score = 32.4 bits (73), Expect = 0.008
Identities = 22/77 (28%), Positives = 28/77 (36%)

Query: 702 GTGGMGGGGGQGGAGGDAIDSVGGIGGKGGGGADGGGGGASSTSTGGAPGGVGGDGDDGD 761
G G G G G+ G+G GG G ++ GG+ G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 762 PPTGQTGGGGGTGGGGG 778
G G GG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 32.0 bits (72), Expect = 0.009
Identities = 27/86 (31%), Positives = 34/86 (39%), Gaps = 1/86 (1%)

Query: 316 AGGNGGAGGAGGVGGTGILSGSDGVGGGGGNGGRGGDGIRGADGVLDGESGYLGGSGGAG 375
+GG+G G +G ++G G G G G G G + G SG GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPT-GLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 376 GAGGAGGGGSQVGSDGVGGAGGTGGA 401
G G GG G+ G G GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.0 bits (72), Expect = 0.011
Identities = 32/103 (31%), Positives = 41/103 (39%), Gaps = 4/103 (3%)

Query: 490 GDGGEGGNAVANSGRGGGTGGAGGVGGDGGNGTNGVGGAGGHGGMGGNAGSSDAANGAVG 549
G G G N A+S G GG G+G GG ++G G + + GG +GS G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGI---HWGG 58

Query: 550 SRGGDGGAGGKGGQGTTGGDGGNGGSAGKGGTGGSAVQTNDAG 592
G G G G +G G A G A+ T AG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101



Score = 30.8 bits (69), Expect = 0.026
Identities = 22/78 (28%), Positives = 26/78 (33%)

Query: 527 GAGGHGGMGGNAGSSDAANGAVGSRGGDGGAGGKGGQGTTGGDGGNGGSAGKGGTGGSAV 586
G G G G +S NG G GGA G + G G +G GGS
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 587 QTNDAGNGGDGGAGGDGG 604
GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 30.1 bits (67), Expect = 0.036
Identities = 31/108 (28%), Positives = 37/108 (34%), Gaps = 6/108 (5%)

Query: 510 GAGGVGGDGGNGTNGVGGAGGHGGMGGNAGSSDAANGAVGSRGGDGGAGGKGGQGTTGGD 569
G G G + G + GG G+G G+SD GS GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD------GSGWSSENNPWGGGSGSGIHW 56

Query: 570 GGNGGSAGKGGTGGSAVQTNDAGNGGDGGAGGDGGIGTLGPGGDGGAG 617
GG G GG G S + GN A G L G GG
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 29.7 bits (66), Expect = 0.047
Identities = 25/79 (31%), Positives = 32/79 (40%), Gaps = 3/79 (3%)

Query: 275 GTGGSGGRGGLFAVGGD--GGHGGIGGNGARGADGVGDGESGTAGGNGGAGGAGGVGGTG 332
G G G G + G+ GG G+G G +DG G G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 333 ILSGSDGVGGGGGNGGRGG 351
+G GGG+G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80


107MMAR_3194MMAR_3201N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_31941142.642172FtsW-like protein FtsW
MMAR_31950122.494625UDP-N-acetylmuramoyl-L-alanyl-D-glutamate
MMAR_31961122.842446phospho-N-acetylmuramoyl-pentapeptide-
MMAR_31970123.206278UDP-N-acetylmuramoylalanyl-D-glutamyl-2,6-
MMAR_31981122.747491UDP-N-acetylmuramoylalanyl-D-glutamate--2,
MMAR_31992132.520958PE-PGRS family protein
MMAR_32000111.269945penicillin-binding membrane protein PbpB
MMAR_32010101.471147proline rich membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3194PF03544330.002 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.4 bits (76), Expect = 0.002
Identities = 27/149 (18%), Positives = 50/149 (33%), Gaps = 7/149 (4%)

Query: 387 FINIGYVIGLLPVTGLQLPLISAGGTSTATTLAMIGIIANAARHEPEAVAALRAGRDDRV 446
I+ V GLL + Q+ + A + T+ +A A P+AV +
Sbjct: 23 CIHGAVVAGLLYTSVHQVIELPAPAQPISVTM-----VAPADLEPPQAVQPPPEPVVEPE 77

Query: 447 NRMLRLPLPKPYAPTRLEVFRDRKRVQPPAARPPAKQAAARKAPKAATRLAEEPLRPALP 506
+P P AP +E + + + +P + + K ++ E PA P
Sbjct: 78 PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARP 137

Query: 507 RRPDRPGARSGQQGAGQRYAGQRHSGRVR 535
+ + +G R R +
Sbjct: 138 --TSSTATAATSKPVTSVASGPRALSRNQ 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3199cloacin394e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 4e-05
Identities = 32/80 (40%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 356 SGIPGVGGNGGAGGTSGLIGSGGAGGAGGTGGANHASTNSIGGVGGAGGTGGAARLFGSG 415
SG G G N GA TSG I +GG G G GGA+ S S GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 416 GSGGTGGVGGIGNTIGGTGG 435
G G GG G G G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGN 80



Score = 39.3 bits (91), Expect = 4e-05
Identities = 28/88 (31%), Positives = 36/88 (40%), Gaps = 7/88 (7%)

Query: 416 GSGGTGGVGGIGNTIGGTGGVGGGGGAAGLIGDGGAGGAGGNGGDGTGAGGLGGNGASAD 475
G G G G +T G G G G G G + G+G GG+G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG-------VGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 476 WIGNGGSGGAGGSGGLGAHAGAGGNGGS 503
W G G G GG+G G +G GGN +
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 37.0 bits (85), Expect = 2e-04
Identities = 33/100 (33%), Positives = 42/100 (42%), Gaps = 6/100 (6%)

Query: 487 GSGGLGAHAGAGGNGGSLYGNGGSGGVGGTGASGAG------GNGGGGGAAGLIGDGGAG 540
G G G + GA G++ G GVGG + G+G GGG G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 541 GDGGDGSGAGGLGGDGGNADWMGNGGAGGSGGAGVPPATG 580
G+GG +GG G GGN + A G P A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 36.2 bits (83), Expect = 4e-04
Identities = 39/122 (31%), Positives = 44/122 (36%), Gaps = 15/122 (12%)

Query: 378 GAGGAGGTGGANHASTNSIGGVGGAGGTGGAARLFGSGGSGGTGGVGGIGNTIGGTGGVG 437
G G G GA+ S N GG G G GGA+ G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG--------------G 48

Query: 438 GGGGAAGLIGDGGAGGAGGNGGDGTGAGGLGGNGASADWIGNGGSG-GAGGSGGLGAHAG 496
G G G G G GGNG G G+G G A A + G G+GGL
Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSIS 108

Query: 497 AG 498
AG
Sbjct: 109 AG 110



Score = 33.9 bits (77), Expect = 0.002
Identities = 30/88 (34%), Positives = 38/88 (43%), Gaps = 4/88 (4%)

Query: 462 TGAGGLGGNGASADWIGNGGSGGAGGSGGLGAHAGAGGNGGSLYGNGGSGGVGGTGASGA 521
+G G G N + GN G G G GA G+G + + GGSG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 522 GGNGGGGGAAGLIGDGGAGGDGGDGSGA 549
GNGGG G +G GG+G G + A
Sbjct: 62 HGNGGGNGNSG----GGSGTGGNLSAVA 85



Score = 33.1 bits (75), Expect = 0.003
Identities = 33/104 (31%), Positives = 41/104 (39%), Gaps = 3/104 (2%)

Query: 437 GGGGGAAGLIGDGGAGGAGGNGGDGTGAGGLGGNGASADWIGNGGSGGAGGSGGLGAHAG 496
G G G GG G G G G G+G S++ GG G+G G G+ G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 497 AGGNGGSLYGNGGSGGVGGTGASGAGGNGGGGGAAGLIGDGGAG 540
GG G +GG G GG ++ A G A G GG
Sbjct: 64 NGGGNG---NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.1 bits (75), Expect = 0.003
Identities = 33/101 (32%), Positives = 41/101 (40%)

Query: 300 GNGGAGGHGGFGGTATAPVGDGGVGGNGGAGGNGGLFYGDGGAGGAGGTGGGLLQGSGIP 359
G+G G + G G+G GGA G + GG G+G GSG
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 360 GVGGNGGAGGTSGLIGSGGAGGAGGTGGANHASTNSIGGVG 400
GGNG +GG SG G+ A A G ST GG+
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.013
Identities = 28/93 (30%), Positives = 36/93 (38%), Gaps = 9/93 (9%)

Query: 268 GGGDAGLNNQIFGAKGTPNGAAGGYGGNAGLW-GNGGAGGHGGFGGTATAPVGDGGVGGN 326
GG G N G NG G G G G+G + + +GG + + + GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 327 GGAGGNGGLFYGDGGAGGAGGTGGGLLQGSGIP 359
G GGNG G G GG L P
Sbjct: 63 GNGGGNGN--------SGGGSGTGGNLSAVAAP 87



Score = 30.5 bits (68), Expect = 0.021
Identities = 28/87 (32%), Positives = 31/87 (35%), Gaps = 3/87 (3%)

Query: 319 GDGGVGGNGGAGGNGGLFYGDGGAGGAGGTGGGLLQGSGIPGVGGNGGAGGTSGLIGSGG 378
G G G N GA G G G GGG GSG G G SG+ GG
Sbjct: 3 GGDGRGHNTGAHSTSG---NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 379 AGGAGGTGGANHASTNSIGGVGGAGGT 405
+G G G N + GG A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAA 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3200IGASERPTASE300.036 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.036
Identities = 18/91 (19%), Positives = 38/91 (41%), Gaps = 1/91 (1%)

Query: 2 SRGESRQARPSHSSRSRRVSGKAHSAHEPRQPRSSGKTRADRSPKQVREPKRLPQAKQAK 61
S E+++ + + + + V + + E + + K + SPKQ + PQA+ A+
Sbjct: 1088 SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR 1147

Query: 62 KTRPAARTEVAPPGRSARERRTRQAVEVASR 92
+ P + P ++ T Q + S
Sbjct: 1148 ENDPTVNIK-EPQSQTNTTADTEQPAKETSS 1177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3201PERTACTIN300.015 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.5 bits (68), Expect = 0.015
Identities = 20/69 (28%), Positives = 27/69 (39%), Gaps = 2/69 (2%)

Query: 201 LVQAPDGNWVVVGTPKPADGVPPPPLNTKLPEEGPPAPPKPAALPPEVPVRVMPGPDDPA 260
L +G W +VG P P P + + P P P PP+ P P+ PA
Sbjct: 552 LAANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQ--PPQPPQPPQRQPEAPA 609

Query: 261 LLPRTGPQL 269
P G +L
Sbjct: 610 PQPPAGREL 618


108MMAR_3254MMAR_3266N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3254-190.664238hypothetical protein
MMAR_3255-1110.968907branched-chain amino acid aminotransferase
MMAR_3256-1101.195361glycine cleavage system aminomethyltransferase
MMAR_3257-29-0.076868adenylate cyclase
MMAR_3258-29-0.840276leucyl aminopeptidase
MMAR_3259-312-3.505592short chain dehydrogenase
MMAR_3260-117-4.613550dihydrolipoamide acetyltransferase
MMAR_3261026-7.046392hypothetical protein
MMAR_3262031-8.109382integral membrane protein ABC transporter
MMAR_32631131-1.112381ABC transporter ATP-binding protein
MMAR_32641132-0.972507MmpL family transport protein
MMAR_32651437-0.227204MbtH-like protein
MMAR_32661335-0.274312MmpL family transport protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3254RTXTOXINA330.004 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 32.6 bits (74), Expect = 0.004
Identities = 25/81 (30%), Positives = 36/81 (44%), Gaps = 9/81 (11%)

Query: 87 ISASAATLILANAAVPWTGALVAAVFFVTATASGVVAGVSGVAYTDMISNKLSVLRRGEL 146
+SA +A+ IL+NA T AA + + V+ V IS + R +
Sbjct: 249 LSAISASFILSNADAD-TRTKAAAGVEL---TTKVLGNV-----GKGISQYIIAQRAAQG 299

Query: 147 LLTQGAVGSLLATGVTLVIVP 167
L T A L+A+ VTL I P
Sbjct: 300 LSTSAAAAGLIASAVTLAISP 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3259DHBDHDRGNASE1002e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 99.7 bits (248), Expect = 2e-25
Identities = 57/185 (30%), Positives = 90/185 (48%), Gaps = 1/185 (0%)

Query: 328 VAVTGAGSGIGRETALAFAREGAEVVLSDIDEATVKDTAAEIAARGGVAHPYVLDVSDTE 387
+TGA GIG A A +GA + D + ++ + + A A + DV D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 388 AVEAFADQVSATHGLPDIVVNNAGVGQAGRFLDTPAEQFDRVLDVNLGGVVNGCRAFGQR 447
A++ ++ G DI+VN AGV + G E+++ VN GV N R+ +
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 448 LVERGTGGHIVNVSSMAAYAPLQSLSAYCTSKAATFMFSDCLRAELDAADVGLTTICPGV 507
+++R G IV V S A P S++AY +SKAA MF+ CL EL ++ + PG
Sbjct: 131 MMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 508 INTNI 512
T++
Sbjct: 190 TETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3260IGASERPTASE429e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.6 bits (97), Expect = 9e-06
Identities = 42/310 (13%), Positives = 82/310 (26%), Gaps = 34/310 (10%)

Query: 93 PEAE----PAAAAQPEPEAEPEPQPEAKPQSGGSSAAGGDATPVLMPELGESVAEGTVTR 148
PE E + S A D PV P A T +
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQAD-VPSVPSNNEEIARVDEAPVPPP------APATPSE 1035

Query: 149 WLKKVGDSVQVDEALVEVSTDKVDTEIPSPVAGVLLSITAEEDDVVQVGGELARIGSGSA 208
+ V ++ + + VE + + + E+A+ GS +
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATE--TTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 209 AAAPPESKPAPAPEAAPETKA-------APEPKAAPEPKPAPEPKAAPEPKPAPAATPQP 261
E+K E + K P+ + PK P+ +PA P
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 262 AAAPAPSAGDGTPYVTPLVRKLAEENNIDLDSVTGTGVGGRI------------RKQDVL 309
S + T ++ + + T G + +
Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS 1213

Query: 310 AAAEKKKERPEAKPAAAQASAPASPSKAAAPAAAAALAHLRGTKQKASRIRQITAIKTRE 369
++ K K R + + + ++ + AL L + + + A
Sbjct: 1214 ESSNKPKNR-HRRSVRSVPHNVEPATTSSNDRSTVALCDL-TSTNTNAVLSDARAKAQFV 1271

Query: 370 SLQATAQLTQ 379
+L ++Q
Sbjct: 1272 ALNVGKAVSQ 1281



Score = 29.6 bits (66), Expect = 0.047
Identities = 29/162 (17%), Positives = 50/162 (30%), Gaps = 8/162 (4%)

Query: 15 TEGTVTRWLKQEGDTVEIDEPLVEVSTDKVDTEIPSPAAGVLTKIVAKEDDTVEVGGELA 74
T K+ V+ + EV+ +T+ T V KE+ E
Sbjct: 1061 TTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV---ETE 1117

Query: 75 IIGDAAESGGGDAPSQPEPEAEPAAAAQPEPEAEPEPQPEAK-PQS-GGSSAAGGDATPV 132
+ + +P Q + E A EP E +P K PQS ++A
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQPQA---EPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174

Query: 133 LMPELGESVAEGTVTRWLKKVGDSVQVDEALVEVSTDKVDTE 174
+ + V E T V ++ + T ++
Sbjct: 1175 TSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS 1216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3261NUCEPIMERASE375e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 37.5 bits (87), Expect = 5e-05
Identities = 31/150 (20%), Positives = 50/150 (33%), Gaps = 32/150 (21%)

Query: 6 VAIAGSSGLIGSALAAALRAADHRVLRI--------VRRTPANSEELHWNPESGEF---- 53
+ G++G IG ++ L A H+V+ I V A E L + G
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELL---AQPGFQFHKI 59

Query: 54 ---DPDALTD------VDVVINLC---GVGIGRRRWSGAFKQSLRDSRITPTEVLSSAVA 101
D + +TD + V V R+S + DS +T +
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNILEGCR 114

Query: 102 DAGVPTLINASAVGYYGDTRDRVVDENDPA 131
+ L+ AS+ YG R +D
Sbjct: 115 HNKIQHLLYASSSSVYGLNRKMPFSTDDSV 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3263PF05272310.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.008
Identities = 21/143 (14%), Positives = 42/143 (29%), Gaps = 38/143 (26%)

Query: 3 RSQSAVEVIDLVKRRGSVTAVDGISFAVPPG----GVLGLLGPNGAGKTTTVRMLATLTR 58
+ ++ G + ++ + PG + L G G GK+T + L
Sbjct: 562 PDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLV---- 617

Query: 59 PTSGAAWVAGH--DVCAAPESVRREIGLTCQEATLDGLLTARENINMIGSLRGIRRKELA 116
G + + D+ +S + G+ E + + RR +
Sbjct: 618 ---GLDFFSDTHFDIGTGKDSYEQIAGIVAYE---------------LSEMTAFRRADAE 659

Query: 117 SLTDRLLDQFSIAEFADRRVDTY 139
++ F R D Y
Sbjct: 660 ----------AVKAFFSSRKDRY 672


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3264ACRIFLAVINRP467e-07 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 46.0 bits (109), Expect = 7e-07
Identities = 47/289 (16%), Positives = 104/289 (35%), Gaps = 42/289 (14%)

Query: 212 IGVIVVMLLVIYGSVTTALVVLVMVVLQLAAARGVVALLGYHGLVGLSIFSTNVLVTLVI 271
I ++ +++ + ++ L+ + V + L ++A GY ++ + + +V+
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGY----SINTLT---MFGMVL 400

Query: 272 AAGT--DYAIFLVGRYQEARSAGQD--RESAFFTMFGGTAHVVLGSGLTIAGAMLCLSF- 326
A G D AI +V + + +E+ +M ++G + ++ + ++F
Sbjct: 401 AIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSM-SQIQGALVGIAMVLSAVFIPMAFF 459

Query: 327 --TRLPYLQTLGVPLAVGMTVGVLAALTLGPALIAV------------TSRFGKVLEPRR 372
+ + + + M + VL AL L PAL A F
Sbjct: 460 GGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTF 519

Query: 373 QLRVRRWRKLGAAIARWPGPILITASVLALGGLLVLPGYRASFDDRNYLPKDVPANIGYA 432
V + I G L+ L + G++VL ++LP++ + G
Sbjct: 520 DHSVNHYTNSVGKILGSTGRYLL-IYALIVAGMVVL----FLRLPSSFLPEE---DQGVF 571

Query: 433 AAERGFGAARMNPDVLLVESNHDLRNSADLLVIDKIA--KAIFAVEGIS 479
++ + L D + ++ A +++F V G S
Sbjct: 572 -----LTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFS 615



Score = 37.5 bits (87), Expect = 3e-04
Identities = 40/237 (16%), Positives = 80/237 (33%), Gaps = 39/237 (16%)

Query: 148 YVQVNLMGLQGHARSIRSVKAVQRIVDSTPA--PDGVTTFVTGSAALMVDQQTVGSRSMR 205
Y + M +QG A S ++++ + P G+ TG M Q+ +
Sbjct: 818 YNGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTG----MSYQERLSGNQAP 873

Query: 206 LVELVTIGVIVVMLLVIYGSVTTALVVLVMVVLQLAAARGVVALLGYHGLVGLSIFSTNV 265
+ ++ V+ + L +Y S + + V+++V L + L ++
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQK----NDVYFMVG 929

Query: 266 LVTLVIAAGTDYAIFLVGRYQEARSA-GQDRESA---------------FFTMFGGTAHV 309
L+T + + + AI +V ++ G+ A G +
Sbjct: 930 LLTTIGLSAKN-AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 310 VLGSGLTIAGAMLCLSFTRLPYLQTLGVPLAVGMTVGVLAALTLGPALIAVTSRFGK 366
+ +G AG+ +G+ + GM L A+ P V R K
Sbjct: 989 AISNG---AGSGA---------QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFK 1033



Score = 34.0 bits (78), Expect = 0.004
Identities = 33/171 (19%), Positives = 62/171 (36%), Gaps = 7/171 (4%)

Query: 781 IAALSLIFLIMLNITRSAIAALVIVGSVAASLGASVGLSVLLWQHLIGIELHWLVLSMSV 840
A+ L+FL+M ++ A L+ +V L + + + + + +VL++ +
Sbjct: 345 FEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGL 404

Query: 841 IVLLAVGADYNLLLVSRLKEELHAGINTAIIRTVGATGSVATSAGLVFAFTMISMAV--- 897
+V A+ N V R+ E A +++ +V + I MA
Sbjct: 405 LVDDAIVVVEN---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGG 461

Query: 898 SDLTVIAQIGTTIGMGLLFDTFVVRALMTPSIAVLLGRWFWWPHHVRPRPI 948
S + Q TI + V L TP++ L + HH
Sbjct: 462 STGAIYRQFSITIVSAMALSVLVALIL-TPALCATLLKPVSAEHHENKGGF 511


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3266ACRIFLAVINRP442e-06 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 44.4 bits (105), Expect = 2e-06
Identities = 47/293 (16%), Positives = 103/293 (35%), Gaps = 42/293 (14%)

Query: 208 LLVTFAVIVVMLLVIYGSVTTALAVLVMVVLQLAAARGVVALLGYHGLVGLSIFSTNVLV 267
L ++ +++ + ++ L + V + L ++A GY ++ + +
Sbjct: 344 LFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGY----SINTLT---MF 396

Query: 268 TLVIAAGT--DYAIFLVGRYQEARSAGQD--RESAFFTMFGGTAHVVLGSGLTIAGAMLC 323
+V+A G D AI +V + + +E+ +M ++G + ++ +
Sbjct: 397 GMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSM-SQIQGALVGIAMVLSAVFIP 455

Query: 324 LSF---TRLPYLQTLGVPLAVGMTVGVLAALTLGPALIAV------------TSRFGKVL 368
++F + + + + M + VL AL L PAL A F
Sbjct: 456 MAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWF 515

Query: 369 EPRRQLRVRRWRKLGAAIARWPGPILITASVLALGGLLVLPGYRASFDDRNYLPKDVPAN 428
V + I G L+ L + G++VL ++LP++ +
Sbjct: 516 NTTFDHSVNHYTNSVGKILGSTGRYLL-IYALIVAGMVVL----FLRLPSSFLPEE---D 567

Query: 429 IGYAAAERGFGAARMNPDVLLVESNHDLRNSADLLVIDKIA--KAIFAVEGIS 479
G ++ + L D + ++ A +++F V G S
Sbjct: 568 QGVF-----LTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFS 615



Score = 37.9 bits (88), Expect = 2e-04
Identities = 35/219 (15%), Positives = 77/219 (35%), Gaps = 39/219 (17%)

Query: 164 ESVAAVQGTLRDMPGPDGVNAFLTGPAVVLADQQIAGDRSMRLILLVTFAVIVVMLLVIY 223
+++A ++ +P G+ TG ++ Q+ ++ ++F V+ + L +Y
Sbjct: 838 DAMALMENLASKLP--AGIGYDWTG----MSYQERLSGNQAPALVAISFVVVFLCLAALY 891

Query: 224 GSVTTALAVLVMVVLQLAAARGVVALLGYHGLVGLSIFSTNVLVTLVIAAGTDYAIFLVG 283
S + ++V+++V L + L ++ L+T + + + AI +V
Sbjct: 892 ESWSIPVSVMLVVPLGIVGVLLAATLFNQK----NDVYFMVGLLTTIGLSAKN-AILIVE 946

Query: 284 RYQEARSA-GQDRESA---------------FFTMFGGTAHVVLGSGLTIAGAMLCLSFT 327
++ G+ A G + + +G AG+
Sbjct: 947 FAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG---AGSGA----- 998

Query: 328 RLPYLQTLGVPLAVGMTVGVLAALTLGPALIAVTSRFGK 366
+G+ + GM L A+ P V R K
Sbjct: 999 ----QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFK 1033



Score = 34.0 bits (78), Expect = 0.003
Identities = 33/171 (19%), Positives = 62/171 (36%), Gaps = 7/171 (4%)

Query: 781 IAALSLIFLIMLNITRSAIAALVIVGSVAASLGASVGLSVLLWQHLIGIELHWLVLSMSV 840
A+ L+FL+M ++ A L+ +V L + + + + + +VL++ +
Sbjct: 345 FEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGL 404

Query: 841 IVLLAVGADYNLLLVSRLKEELHAGINTAIIRTVGATGSVATSAGLVFAFTMISMAV--- 897
+V A+ N V R+ E A +++ +V + I MA
Sbjct: 405 LVDDAIVVVEN---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGG 461

Query: 898 SDLTVIAQIGTTIGMGLLFDTFVVRALMTPSIAVLLGRWFWWPHHVRPRPI 948
S + Q TI + V L TP++ L + HH
Sbjct: 462 STGAIYRQFSITIVSAMALSVLVALIL-TPALCATLLKPVSAEHHENKGGF 511


109MMAR_3509MMAR_3514N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3509091.594847short-chain alcohol dehydrogenase
MMAR_3510-19-0.139086enoyl-CoA hydratase, EchA8_5
MMAR_3511-18-0.806565acyl-CoA dehydrogenase
MMAR_351209-1.422255acyl-CoA dehydrogenase
MMAR_3513011-1.873200oxidoreductase
MMAR_3514011-2.716978oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3509DHBDHDRGNASE609e-13 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 59.7 bits (144), Expect = 9e-13
Identities = 39/191 (20%), Positives = 68/191 (35%), Gaps = 9/191 (4%)

Query: 21 GVHVRTIAVDLVDPEAVALICSETADLEVGLLIYNAGANTCREQFLDA-ELTELGRVIDL 79
H D+ D A+ I + + I A R + + E +
Sbjct: 56 ARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSV 115

Query: 80 NVTRMLELVQHFGRPMRARRRGGIVLVGSTAATYGAMRQSAYAGAKAFSRLFAESLWLEL 139
N T + + + M RR G IV VGS A +AYA +KA + +F + L LEL
Sbjct: 116 NSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175

Query: 140 RETGVDVLELVLGVTRTPAMQRAGLNFDAPGISVSDPADVAREGL--------DNLANGP 191
E + + G T T + + + + + G+ ++A+
Sbjct: 176 AEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAV 235

Query: 192 VVIAGDNAAHV 202
+ + A H+
Sbjct: 236 LFLVSGQAGHI 246


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3511TYPE3OMGPROT320.004 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 32.2 bits (73), Expect = 0.004
Identities = 19/60 (31%), Positives = 27/60 (45%), Gaps = 3/60 (5%)

Query: 248 GWTQVTSELSFERSGPERFLSTFVLLAATAESMAEQRFTRDQQLGRLVARIAGLHQMSAA 307
GW S SGP R+L L+ TA ++ +Q R ++ G L I L SA+
Sbjct: 139 GWRPDASNRLVYVSGPPRYLE---LVEQTAAALEQQTQIRSEKTGALAIEIFPLKYASAS 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3513DHBDHDRGNASE383e-05 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 37.7 bits (87), Expect = 3e-05
Identities = 39/179 (21%), Positives = 70/179 (39%), Gaps = 17/179 (9%)

Query: 4 EALVAVVTGASRGAGRGIAAALAARGWRVYA----------TSRSVADAPPGGVAVRVDH 53
E +A +TGA++G G +A LA++G + A S+ A D
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 54 RDDTATAALFERVQQESGRLDLLVNNAAAISDDLVD--PKPFWEKPLALADVLDVGLRSS 111
RD A + R+++E G +D+LVN A + L+ WE ++ + V S
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSV-NSTGVFNASR 125

Query: 112 YVASWYAAPLLVAGGRGLIAFTSSPGSVCYMHGPAYGAQKAGIDKMAADMAVDFHDTGV 170
V+ + ++ S+P V AY + KA + ++ + +
Sbjct: 126 SVSKYMMD----RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3514DHBDHDRGNASE562e-11 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 55.8 bits (134), Expect = 2e-11
Identities = 42/175 (24%), Positives = 66/175 (37%), Gaps = 6/175 (3%)

Query: 6 VIGASSGLGRCIGVGLARRGDHVA---LLARRRERTEAAAKEAGPGTVAVECDVTDEASC 62
+ GA+ G+G + LA +G H+A + E+ ++ K A DV D A+
Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72

Query: 63 RSALAAAADALGGIDNLVYAPAISPLVRLAETDADTWRRVFDTNVIGASLATAAALPHLR 122
A +G ID LV + + + W F N G A+ + ++
Sbjct: 73 DEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMM 132

Query: 123 AA-AGKAVYLSSDAGSHAPPWPGLGAYGVSKAALERLVEAWRAEHPEIGFTCVIV 176
+G V + S+ P + AY SKAA + E E C IV
Sbjct: 133 DRRSGSIVTVGSNPAG--VPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185


110MMAR_3547MMAR_3551N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3547-112-0.758924serine/threonine-protein kinase transcriptional
MMAR_3548-213-0.067340hypothetical protein
MMAR_3549-212-0.127791hypothetical protein
MMAR_3550-1140.667371PE-PGRS family protein
MMAR_3551-111-0.163156proline, glycine, valine-rich secreted protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3547YERSSTKINASE382e-04 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 38.2 bits (88), Expect = 2e-04
Identities = 36/112 (32%), Positives = 50/112 (44%), Gaps = 11/112 (9%)

Query: 176 RVGVVHRDVKPANVLLTD-YGEPALCDFGIARMEAGFETGPGMFVGSPAYTAPELLAGE- 233
+ GVVH D+KP NV+ GEP + D G+ P F S + APEL G
Sbjct: 263 KAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSG---EQPKGFTES--FKAPELGVGNL 317

Query: 234 SPNAASDVYALGASLFAGLTGHAAFERRNGEQVVAQFLRIANESIPDLRDNN 285
+ SDV+ + ++L + G FE +N E Q LR + D N
Sbjct: 318 GASEKSDVFLVVSTLLHCIEG---FE-KNPEIKPNQGLRFITSEPAHVMDEN 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3548HTHTETR642e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.9 bits (155), Expect = 2e-14
Identities = 19/63 (30%), Positives = 31/63 (49%)

Query: 73 PGRPRGNSSDTRQRILASARELFASNGIDRTSIRAVAAAADVDAALVHHYFGTKQQLFAA 132
+ + + +TRQ IL A LF+ G+ TS+ +A AA V ++ +F K LF+
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 133 AIH 135

Sbjct: 62 IWE 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3550cloacin388e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 8e-05
Identities = 33/121 (27%), Positives = 46/121 (38%), Gaps = 4/121 (3%)

Query: 207 GAARAGGAANSAGGTDAAASSSGAGGGGSAGQGVAST----GGDHGAGATSSASGHSPGH 262
G GA +++G + + G GGG S G G +S GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 263 GVGLGSGLGRSLDGYGTAALAGGPLGAVGSGAPGAVGGVGELSAPTATSALPTAASATAE 322
G SG G G +A A G PGA G +SA ++A+ +A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKG 125

Query: 323 P 323
P
Sbjct: 126 P 126



Score = 29.7 bits (66), Expect = 0.028
Identities = 33/119 (27%), Positives = 45/119 (37%), Gaps = 11/119 (9%)

Query: 172 AAGNGDAASVGTGVQHGGAGGGSGDGMASSHVSDGGAARAGGAANSAGGTDAAASSSGAG 231
+GN + G GV G G G G +S + GG + +G G G G
Sbjct: 16 TSGNINGGPTGLGV---GGGASDGSGWSSENNPWGGGSGSGIHWGGGSG-------HGNG 65

Query: 232 GGGSAGQGVASTGGDHGAGATSSASGHSPGHGVGLGSGLGRSLDGYGTAALAGGPLGAV 290
GG G + TGG+ A A A G G G GL S+ +A + A+
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG-GLAVSISAGALSAAIADIMAAL 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3551PRTACTNFAMLY300.007 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.0 bits (67), Expect = 0.007
Identities = 20/53 (37%), Positives = 23/53 (43%)

Query: 94 PAPVPAGAPVPLPAGAPVPVPVAVGAPVPAAAPAPAAAPLLLQAGGKGEPTAI 146
PAP PA P P P P P P A PA AAA + GG G + +
Sbjct: 573 PAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTL 625


111MMAR_3758MMAR_3765N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_37586175.201651PE-PGRS family protein
MMAR_37593163.222174NAD synthetase
MMAR_37604173.862130Sir2-like regulatory protein
MMAR_37615184.292575cytochrome P450 268A2 Cyp268A2
MMAR_37626195.021026AcrR family transcriptional regulator
MMAR_37634174.246973PE-PGRS family protein
MMAR_37644120.439337gamma-glutamyl kinase
MMAR_37655130.404270GTPase ObgE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3758cloacin391e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 1e-04
Identities = 35/105 (33%), Positives = 38/105 (36%), Gaps = 3/105 (2%)

Query: 484 GGGGRGGTGGAGGIGGTTTGGGDAGAGGGGGAGGTGGSGGEGGAGGFGGDGGAGGTGGKA 543
GG GRG GA G GG GGG + G+G S GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 544 GAGGDGGNATDGGNAGAGGDGGTGGAGGTGGGIVTGINGGAGGAG 588
G GG GN G G + A G GAGG
Sbjct: 63 GNGGGNGN---SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 38.5 bits (89), Expect = 1e-04
Identities = 33/101 (32%), Positives = 40/101 (39%)

Query: 440 TGGDGGKGGGGGNATDGGHGGNGGTGGTAGDGGNGQSGDLNANGGGGGRGGTGGAGGIGG 499
+GGDG G ++T G G G G +G N GGG G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 500 TTTGGGDAGAGGGGGAGGTGGSGGEGGAGGFGGDGGAGGTG 540
GGG+ +GGG G GG + A GF G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.8 bits (87), Expect = 2e-04
Identities = 39/116 (33%), Positives = 43/116 (37%), Gaps = 4/116 (3%)

Query: 558 AGAGGDGGTGGAGGTGGGIVTGINGGAGGAGG-DGGDSGNGGNATGGGNGGNGGTAGTGG 616
+G G G GA T G I G G G G DG + N GGG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 617 AGGLGGGGFRVGHGGGGGNGGNGGVGGTATAGGDAGSGGTGGVGGDGSVGGDASDA 672
G GG G GGG G GGN A G G G S+ A A
Sbjct: 62 HGNGGGNG---NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 37.0 bits (85), Expect = 3e-04
Identities = 29/90 (32%), Positives = 34/90 (37%)

Query: 531 GGDGGAGGTGGKAGAGGDGGNATDGGNAGAGGDGGTGGAGGTGGGIVTGINGGAGGAGGD 590
GGDG TG + +G G T G G DG + G +G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 591 GGDSGNGGNATGGGNGGNGGTAGTGGAGGL 620
G GNG + G G GGN A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 36.6 bits (84), Expect = 4e-04
Identities = 28/80 (35%), Positives = 36/80 (45%)

Query: 539 TGGKAGAGGDGGNATDGGNAGAGGDGGTGGAGGTGGGIVTGINGGAGGAGGDGGDSGNGG 598
+GG G ++T G G G GG G G + N GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 599 NATGGGNGGNGGTAGTGGAG 618
+ GGGNG +GG +GTGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 36.6 bits (84), Expect = 4e-04
Identities = 28/85 (32%), Positives = 34/85 (40%)

Query: 461 NGGTGGTAGDGGNGQSGDLNANGGGGGRGGTGGAGGIGGTTTGGGDAGAGGGGGAGGTGG 520
+GG G G + SG++N G G GG G + G+G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 521 SGGEGGAGGFGGDGGAGGTGGKAGA 545
G GG G GG G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 34.7 bits (79), Expect = 0.002
Identities = 25/72 (34%), Positives = 31/72 (43%)

Query: 723 GHGGGAGGAGGDAGDGGKGGDALDGGTAGAGGAGGKAGNGGGAGSGGKGTFDGGAGGAGG 782
GH GA G+ G G G + G+G + GGG+GSG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 783 DGGKAGNGGAGG 794
+G G G GG
Sbjct: 68 NGNSGGGSGTGG 79



Score = 34.3 bits (78), Expect = 0.002
Identities = 26/80 (32%), Positives = 33/80 (41%)

Query: 371 AGNTGTGGTGGTGGTGGNGDLHTNGGNGGTGGSGGAGIAGVGGGNGGTGGDGGKGGTGAN 430
+G G G G T GN + G G G S G+G + GG G G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 431 GAPLGASGGTGGDGGKGGGG 450
G +G +GG G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.003
Identities = 27/79 (34%), Positives = 30/79 (37%), Gaps = 1/79 (1%)

Query: 631 GGGGNGGNGGVGGTA-TAGGDAGSGGTGGVGGDGSVGGDASDAVAGGDGGTGGRGGSGGA 689
GG G G N G T+ G G GG DGS ++ GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 690 GGIATGGGSAGHGGVGGGG 708
G G S G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.003
Identities = 31/85 (36%), Positives = 38/85 (44%), Gaps = 1/85 (1%)

Query: 353 AGRGGAAGAGGPLVTPGHAGNTGTGGTGGTGGTGGNGDLHTNGGNGGTGGSGGAGIAGVG 412
+G G G T G+ TG G G + G+G N GG GSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 413 GGNGGTGGDGGKG-GTGANGAPLGA 436
GNGG G+ G G GTG N + + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.004
Identities = 24/80 (30%), Positives = 30/80 (37%)

Query: 526 GAGGFGGDGGAGGTGGKAGAGGDGGNATDGGNAGAGGDGGTGGAGGTGGGIVTGINGGAG 585
G G G + GA T G G G G + G+G GG G + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 586 GAGGDGGDSGNGGNATGGGN 605
G GG G+SG G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 33.1 bits (75), Expect = 0.005
Identities = 29/110 (26%), Positives = 37/110 (33%)

Query: 415 NGGTGGDGGKGGTGANGAPLGASGGTGGDGGKGGGGGNATDGGHGGNGGTGGTAGDGGNG 474
+GG G G +G G G G GG G G +++ G G G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 475 QSGDLNANGGGGGRGGTGGAGGIGGTTTGGGDAGAGGGGGAGGTGGSGGE 524
GGG G G + G A + G G S G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.1 bits (75), Expect = 0.005
Identities = 32/105 (30%), Positives = 42/105 (40%), Gaps = 2/105 (1%)

Query: 265 AGSGGSGGAGGDGAPGDTAGAGGTGGTGSAGGAGGTGGAGGANQFFGHAGDGGHGGTGGT 324
+G G G G + G TG G + G+G + N + G +G G H G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 325 GGTGGAGGSGVGSGLAGGAGGDGGAGGAAGRGGAA--GAGGPLVT 367
G GG G+ G GG A A G + GAGG V+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 31.6 bits (71), Expect = 0.017
Identities = 28/82 (34%), Positives = 32/82 (39%), Gaps = 6/82 (7%)

Query: 395 GGNGGTGGSGGAGIAGVGGGNGGTGGDGGKGGTGANGAPLGASGGTGGDGGKGGGGGNAT 454
G N G + G G G G G G G + N G SG GG G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN--- 64

Query: 455 DGGHGGNGGTGGTAGDGGNGQS 476
GG GN GG +G GGN +
Sbjct: 65 -GGGNGNS--GGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.025
Identities = 27/89 (30%), Positives = 34/89 (38%), Gaps = 1/89 (1%)

Query: 321 TGGTGGTGGAGGSGVGSGLAGGAGGDGGAGGAAGRGGAAGAGGPLVTPGHAGNTGTGGTG 380
+GG G G + GG G G GGA+ G + P G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPW-GGGSGSGIHWGGGS 60

Query: 381 GTGGTGGNGDLHTNGGNGGTGGSGGAGIA 409
G G GGNG+ G GG + A +A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 30.8 bits (69), Expect = 0.026
Identities = 24/79 (30%), Positives = 29/79 (36%)

Query: 146 GSGAPGQRGGAGGAGGLLLGNGGAGGAGGVGALGQASGTGGNGGAGGLLFGKGGAGGAGG 205
G G GA G + G G GG + G + N GG G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 206 VGGLGQAGGTGGNGGAGGL 224
G G GG+G G L
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.028
Identities = 30/95 (31%), Positives = 34/95 (35%), Gaps = 2/95 (2%)

Query: 213 GGTGGNGGAGGLLFGNGGAGGVGGAGGVGGVDSAGGTGGTGGTGGANGLFGAAGSGGSGG 272
G G G + G GVGG G S+ GG+G G +G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 273 AGGDGAPGDTAGAGGTGGTGSAGG--AGGTGGAGG 305
G G T G A G A T GAGG
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.5 bits (68), Expect = 0.031
Identities = 33/103 (32%), Positives = 37/103 (35%), Gaps = 1/103 (0%)

Query: 171 GAGGVGALGQASGTGGNGGAGGLLFGKGGAGGAGGVGGLGQAGGTGGNGGAGGLLFGNGG 230
G G G A T GN G G GG G + G G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 231 AGGVGGAGGVGGVDSAGGTGGTGGTGGANGLFGAAGSGGSGGA 273
G GG G GG GG A G + G G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.032
Identities = 28/82 (34%), Positives = 33/82 (40%), Gaps = 2/82 (2%)

Query: 651 AGSGGTGGVGGDGSVGGDASDAVAGGDGGTGGRGGSGGAGGIATGGGSAGHGGVGGGGGN 710
+G G G G S G+ + G G G GSG + GG G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGG--GSGSGIHWGGG 59

Query: 711 GGHGTDGTAGVAGHGGGAGGAG 732
GHG G G +G G G GG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3762HTHTETR552e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.6 bits (131), Expect = 2e-11
Identities = 26/167 (15%), Positives = 57/167 (34%), Gaps = 11/167 (6%)

Query: 6 RNAQANRRQRREQMECRLLEATERLMNNGASFTELSVDRLATEAGISRASFYIYFDDKGH 65
R + ++ R+ +L+ RL + + S+ +A AG++R + Y +F DK
Sbjct: 3 RKTKQEAQETRQ----HILDVALRLFSQQ-GVSSTSLGEIAKAAGVTRGAIYWHFKDKSD 57

Query: 66 LLRRLAGQVFDDLATGAQHWWDVAWRHDPDDVRAAMRAII------ARYRRHQPILIALN 119
L + ++ + +R + ++ R R I+
Sbjct: 58 LFSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKC 117

Query: 120 EMAGYDPQTAQTYRDILTAISARLARVIEDGQADGSIRPELSATTTA 166
E G Q R++ R+ + ++ + +L A
Sbjct: 118 EFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3763cloacin375e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 5e-04
Identities = 35/118 (29%), Positives = 46/118 (38%), Gaps = 4/118 (3%)

Query: 649 IGGAGGDGGAGGKAQAAGFADGTEGVGGAGGEGGAGGVAGD----GGKGADAAAFSGAAG 704
+ G G G G +G +G G GG G G G+ + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 705 GNGGHGGDNGAGGAGGTGGAGSTVGAHGADGFSPITGGNGGDGGNGASGPAASAGVAG 762
G+G GG+ +GG GTGG S V A A GF ++ G S A SA +A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 37.0 bits (85), Expect = 7e-04
Identities = 38/119 (31%), Positives = 47/119 (39%), Gaps = 2/119 (1%)

Query: 423 NGGNGGVGGAGGVGGSGGQGGFLRFLGGQGDGGAGGAGGAGGVAGDGGKGADAAAFSGAA 482
+GG+G G SG G LG G G + GG G+ G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 483 GGNGGHGGDNGAGGAGGTGGAGSTVGAHGADGFSPITGGNGGDGGNGASGPAASAGVAG 541
GNGG G +GG GTGG S V A A GF ++ G S A SA +A
Sbjct: 62 HGNGGGNG--NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 37.0 bits (85), Expect = 8e-04
Identities = 34/81 (41%), Positives = 38/81 (46%), Gaps = 3/81 (3%)

Query: 234 AGGQGLGLNGGAGGAGGN--GGLFGIGGTGGAGGDSAGSGSAGAGGDG-GHGGLWGRGGA 290
+GG G G N GA GN GG G+G GGA S S G G G G WG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 291 GGIGGVNSDSGDGGAGGGGGA 311
G GG N +SG G GG +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 35.8 bits (82), Expect = 0.002
Identities = 30/89 (33%), Positives = 33/89 (37%)

Query: 1618 GNGGSGGNGGIGGNATGFNAPGTGGAGGNGGNGAFGGSGGTGGRGGSSSVGTGGAGGDGG 1677
G G G N G + N TG G G + G S GG S G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1678 DGGMGFGGIGGGGGTGGQGGSGVDAQGIG 1706
G G G GGG GTGG + G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.7 bits (79), Expect = 0.004
Identities = 36/103 (34%), Positives = 41/103 (39%), Gaps = 1/103 (0%)

Query: 798 AGASGENGGDGGAGGMGGAGGMGGVLGGHGGAGGIGGVGATGGSGGLGATGAEGVTGGNL 857
+G G G G G LG GGA G + G G+ GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 858 HGGDGGNGGKGGIGGAGGDGGAGGKAQAAGF-ADGTEGAGGAG 899
HG GGNG GG G GG+ A A GF A T GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.005
Identities = 34/118 (28%), Positives = 40/118 (33%)

Query: 1018 NGGDGHTGLTGGDGGAGGKGGALAGHGGDGGTGGVGGTGGTGGSGGTGTSGIFSSANGGD 1077
+GGDG TG +G G G G GG G G G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1078 GGNGGDGGTGGTGGVGGLGGQAQAAGFADGTQGVGGAGGVGGTGGNAGNGGHGANADV 1135
GNGG G G G G A AA A G + G G + A AD+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 33.9 bits (77), Expect = 0.007
Identities = 35/116 (30%), Positives = 43/116 (37%), Gaps = 5/116 (4%)

Query: 1240 GTGGTGGQGGAGGSGGALAGHGGDGGSGGDGGTGGTGGTGRNGANGITGADI----DGGS 1295
G G G GA + G + G G GG G + N G +G+ I G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1296 GGKGGNGGTGGAGGAGGQGGQAQAAGYSDGTQGVGGAGGDGGTAGIGGDGGDGANA 1351
G GGNG +GG G GG A AA + G + G G I A A
Sbjct: 63 GNGGGNGNSGGGSGTGG-NLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 33.9 bits (77), Expect = 0.007
Identities = 27/81 (33%), Positives = 35/81 (43%)

Query: 993 GNGGTGGTGGTGGIGFNGSTTIGGGNGGDGHTGLTGGDGGAGGKGGALAGHGGDGGTGGV 1052
G G G T G G T +G G G +G + + GG G+ GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1053 GGTGGTGGSGGTGTSGIFSSA 1073
GG G +GG GTG + +A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.010
Identities = 29/92 (31%), Positives = 37/92 (40%), Gaps = 4/92 (4%)

Query: 870 IGGAGGDGGAGGKAQAAGFADGTEGAGGAGGEGGAGGVAGD----GGKGADAAAFSGAAG 925
+ G G G G +G +G G GG G G G+ + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 926 GNGGHGGDNGAGGAGGTGGAGSTVGAHGADGF 957
G+G GG+ +GG GTGG S V A A GF
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 33.1 bits (75), Expect = 0.010
Identities = 35/103 (33%), Positives = 40/103 (38%), Gaps = 1/103 (0%)

Query: 577 AGASGENGGDGGAGGMGGAGGMGGVLGGHGGAGGIGGVGATGGSGGLGATGAEGVTGGNL 636
+G G G G G LG GGA G + G G+ GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 637 HGGDGGNGGKGGIGGAGGDGGAGGKAQAAGF-ADGTEGVGGAG 678
HG GGNG GG G GG+ A A GF A T G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.014
Identities = 28/83 (33%), Positives = 36/83 (43%)

Query: 1584 GGDGGGGGPGGLGGQGGLSGDGSSTGAAGELGTFGNGGSGGNGGIGGNATGFNAPGTGGA 1643
GGDG G G G ++G + G G S N GG+ +G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1644 GGNGGNGAFGGSGGTGGRGGSSS 1666
G GGNG GG GTGG + +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.019
Identities = 31/84 (36%), Positives = 39/84 (46%), Gaps = 1/84 (1%)

Query: 1201 SGGDGGLYGNGGDGGDGG-NGGKGQVGSTGPVAGSAGGTGGTGGTGGQGGAGGSGGALAG 1259
SGGDG + G G NGG +G G + +G + GG G+G G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1260 HGGDGGSGGDGGTGGTGGTGRNGA 1283
HG GG+G GG GTGG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.0 bits (72), Expect = 0.025
Identities = 34/117 (29%), Positives = 47/117 (40%), Gaps = 1/117 (0%)

Query: 1058 TGGSGGTGTSGIFSSANGGDGGNGGDGGTGGTGGVGGLGGQAQAAGFADGTQGVGGAGGV 1117
+GG G +G S++ +GG G G GG G + G G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1118 GGTGGNAGNGGHGANADVGSGKAGGNGGDGGDPGVGGIGGQGGAGSIAGVEGAAGVA 1174
G GG GN G G + G+ A G P + G G A SI+ +A +A
Sbjct: 62 HGNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 32.0 bits (72), Expect = 0.025
Identities = 33/100 (33%), Positives = 40/100 (40%), Gaps = 3/100 (3%)

Query: 1415 GAGGAGGDGGLYGNGGN--GGDGGTGGKGTVGD-SGATLSADGDRGGAGATGGVGGAGGD 1471
G G G + G + GN GG G G G D SG + + GG+G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1472 GGAKGGNGGLGGSGGTGGMGGTGGDGAHGADMASGSGANG 1511
G G GGSG G + A G S GA G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.029
Identities = 31/99 (31%), Positives = 37/99 (37%), Gaps = 3/99 (3%)

Query: 261 GGAGGDSAGSGSAGAGGDGGHGGLWGRGGAG---GIGGVNSDSGDGGAGGGGGAGGRLFG 317
G G + G+ S +GG GL GGA G N+ G G G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 318 SGGAGGAGGTGAVAGSGGDGGAGGAAVGLWGLGGHGGAG 356
+GG G G G+ G A A G L G G
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.032
Identities = 22/83 (26%), Positives = 30/83 (36%)

Query: 1116 GVGGTGGNAGNGGHGANADVGSGKAGGNGGDGGDPGVGGIGGQGGAGSIAGVEGAAGVAP 1175
G G G N G N + G G GG G G GS +G+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1176 TSGGNGGDGGNGASGATGVNGGA 1198
+GG G+ G G+ ++ A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 31.6 bits (71), Expect = 0.033
Identities = 29/80 (36%), Positives = 32/80 (40%), Gaps = 2/80 (2%)

Query: 941 GTGGAGSTVGAHGADGF--SPTTGGNGGDGGSGGSGFQGVKGGAGGVGGDGGLYGNGGTG 998
G G G GAH G TG G G S GSG+ GG G G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 999 GTGGTGGIGFNGSTTIGGGN 1018
G GG G GS T G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 31.6 bits (71), Expect = 0.034
Identities = 23/73 (31%), Positives = 29/73 (39%)

Query: 1193 GVNGGAGGSGGDGGLYGNGGDGGDGGNGGKGQVGSTGPVAGSAGGTGGTGGTGGQGGAGG 1252
G N GA + G+ G G G + G G P G +G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 1253 SGGALAGHGGDGG 1265
+G + G G G
Sbjct: 68 NGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3764CARBMTKINASE392e-05 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 38.7 bits (90), Expect = 2e-05
Identities = 25/104 (24%), Positives = 44/104 (42%), Gaps = 7/104 (6%)

Query: 156 DNDRLSALVAHLVGADALVLLSDIDGLYDADPGKFQNARFIPEVSGPADLDGVVAGQGSH 215
D D +A V AD ++L+D++G G + +++ EV +L + H
Sbjct: 214 DKDLAGEKLAEEVNADIFMILTDVNGAA-LYYGT-EKEQWLREVK-VEELRKYY--EEGH 268

Query: 216 LGTGGMASKMSSALLAADA-GVPVLLAPAADAAAALTDASVGTV 258
G M K+ +A+ + G ++A A AL + GT
Sbjct: 269 FKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVEAL-EGKTGTQ 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3765SECA310.009 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.4 bits (71), Expect = 0.009
Identities = 21/101 (20%), Positives = 40/101 (39%), Gaps = 29/101 (28%)

Query: 387 IGQTNFDNDEAVGYLADRLARLGVEEELL---------RLGAKPGC--AVTIGEMTFDWE 435
+G + + E +++ L + G++ +L + A+ G AVTI
Sbjct: 454 VGTISIEKSE---LVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAAVTIA------- 503

Query: 436 PQTPAGGHVAMSGRGTDVRLERSDRVGAAERKAARRQRRER 476
M+GRGTD+ L S + A + ++ E+
Sbjct: 504 --------TNMAGRGTDIVLGGSWQAEVAALENPTAEQIEK 536


112MMAR_3790MMAR_3798N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3790-130-4.437547isochorismatase family protein
MMAR_3792029-3.336884transposase
MMAR_3793230-4.209630transposase
MMAR_3794231-4.542491transcriptional regulatory protein
MMAR_3795231-4.726829adenylate cyclase
MMAR_3796231-4.660141type I modular polyketide synthase
MMAR_3797229-4.598737type I modular polyketide synthase
MMAR_3798228-4.874447type I modular polyketide synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3790ISCHRISMTASE351e-04 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 35.4 bits (81), Expect = 1e-04
Identities = 21/84 (25%), Positives = 33/84 (39%)

Query: 138 LTANSWGAAILDGLVVDDIDIHVNKHRMSGFWDTELDSILRNLGVRHLFLCGVNVDQCVY 197
L + + I+ L +D D+ + K R S F T L ++R G L + G+
Sbjct: 99 LNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCL 158

Query: 198 ATLIDAACAGYDCLLITDASATTS 221
T +A + DA A S
Sbjct: 159 VTACEAFMEDIKAFFVGDAVADFS 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3796NUCEPIMERASE330.011 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 32.8 bits (75), Expect = 0.011
Identities = 22/149 (14%), Positives = 43/149 (28%), Gaps = 38/149 (25%)

Query: 1381 TVLITGGTGMAGGWLARHVVDHYGVRHVLLASRSGDRAGGSAEIA-----------AELA 1429
L+TG G G +++ +++ G + G + EL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEA------------GHQVVGIDNLNDYYDVSLKQARLELL 49

Query: 1430 ARGVQVEVVACDVADRDAVTALLARLPQQYPLTGVIH---AAGVLDDAVITSLTPDRVDT 1486
A+ + D+ADR+ + L V V + +
Sbjct: 50 AQP-GFQFHKIDLADRE----GMTDLFASGHFERVFISPHRLAV-------RYSLENPHA 97

Query: 1487 VLRAKVDAAWNLHELTRDLGVSAFVLFSS 1515
+ + N+ E R + + SS
Sbjct: 98 YADSNLTGFLNILEGCRHNKIQHLLYASS 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3797DHBDHDRGNASE320.016 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 32.3 bits (73), Expect = 0.016
Identities = 33/164 (20%), Positives = 55/164 (33%), Gaps = 10/164 (6%)

Query: 1432 VLITGGTGVLGMALARHLATHHHCEHLLLVSRRGVAADGAQELRAELAGHGCQVEFAACD 1491
ITG +G A+AR LA ++ + +++ + L E D
Sbjct: 11 AFITGAAQGIGEAVARTLA-----SQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 1492 TADSDQLSTLLQSIPVE-HPLGAVIHAAGVLSDGVIEGLGREQVEQVLRPKLDAALLLHE 1550
DS + + I E P+ +++ AGVL G+I L E+ E
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 1551 LT----QDLDLSAFVLFSSAAGVLGSPGQANYAAANAFLDALAQ 1590
D + V S + A YA++ A +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTK 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3798DHBDHDRGNASE501e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 49.7 bits (118), Expect = 1e-07
Identities = 49/230 (21%), Positives = 85/230 (36%), Gaps = 27/230 (11%)

Query: 3276 ALVTGVTGHLGQHIARWLAQAGASHLVLLSRTAAEHPQVAELEKELNSAGITTTSISVDV 3335
A +TG +G+ +AR LA GA ++ ++ ++ L + + DV
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAH----IAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 3336 TDRDALAAVVAETRIEHGPIHTVVHAAAHIGLVTTAETTIDEFIKSFAAKALGAENLI-- 3393
D A+ + A E GPI +V+ A + + +E+ +F+ + G N
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 3394 --AVLEDQPPQTFIMFSSAAATWGGTRQGAYAAANAYIEALVT----RLRGRG--CRAIA 3445
+ D+ + + S A T AYA++ A L C ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 3446 PA-------WGAWTDDRTTTQEVVGYFSR----IGLNQIS--PDIAFAAL 3482
P W W D+ Q + G I L +++ DIA A L
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236


113MMAR_3848MMAR_3871N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_3848-111-0.757689acetyl-/propionyl-coenzyme A carboxylase alpha
MMAR_3849010-1.343465acetyl-/propionyl-CoA carboxylase subunit beta
MMAR_3850-19-1.134269succinyl-CoA:3-ketoacid-coenzyme A transferase
MMAR_3851-37-0.601217succinyl-CoA:3-ketoacid-coenzyme A transferase
MMAR_3852-37-0.379815AMP-binding protein
MMAR_3853-180.086501putative regulatory protein
MMAR_3854-310-0.172424proline rich membrane protein
MMAR_3855-211-0.595885integral membrane leucine and alanine rich
MMAR_3856-112-1.014018short-chain type dehydrogenase/reductase
MMAR_3857-212-1.272066transposase for ISMyma04
MMAR_3858011-1.742839hypothetical protein
MMAR_3859011-1.547162oligoribonuclease
MMAR_3860-29-1.817726*hypothetical protein
MMAR_3861-29-1.958941putative regulatory protein
MMAR_3862-18-1.903168YrbE family protein, YrbE5A
MMAR_3863-28-1.692648integral ABC-type transport protein
MMAR_3864-28-1.320828Mce protein, Mce5A
MMAR_3865-110-2.065606Mce family protein, Mce5B
MMAR_3866-19-1.564747Mce family protein, Mce5C
MMAR_3867011-1.467342MCE-family protein Mce5D
MMAR_3868011-1.469822Mce family protein, Mce5E
MMAR_3869110-1.372381Mce family protein Mce5F
MMAR_3870314-1.279983hypothetical protein
MMAR_38712170.729486hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3848RTXTOXIND290.049 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.049
Identities = 11/60 (18%), Positives = 26/60 (43%), Gaps = 5/60 (8%)

Query: 606 AVSEGDVVVIIEAMKMEHPLVAPISGRV-EVLVAVGDQVKVEQVLARLIPDTEQAQSKDS 664
A + G + + +++ + V E++V G+ V+ VL +L +A + +
Sbjct: 84 ATANGKLTHSGRSKEIKPIE----NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKT 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3851TYPE3OMGPROT290.017 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 29.1 bits (65), Expect = 0.017
Identities = 20/69 (28%), Positives = 25/69 (36%), Gaps = 14/69 (20%)

Query: 124 GTQIADGGLPWRYDASGAVAVVSPPKETREF--DGATYVLE-----RGIRTD------FA 170
+ I + WR DAS + VS P E A LE R +T F
Sbjct: 130 RSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAA-ALEQQTQIRSEKTGALAIEIFP 188

Query: 171 LVHAWKGDR 179
L +A DR
Sbjct: 189 LKYASASDR 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3853HTHTETR751e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.7 bits (183), Expect = 1e-18
Identities = 37/183 (20%), Positives = 65/183 (35%), Gaps = 9/183 (4%)

Query: 13 PNRRSQQKSDRRLQLLSAAERLFAERGFLAVRLEDIGAAAGISGPAIYRHFPNKESLLVE 72
+ Q+ + R +L A RLF+++G + L +I AAG++ AIY HF +K L E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 73 LLVGISSRLLAGARQVTTN-SNDAAAALDGLIDFHLDFALGEPDLIRIQDRDLGHLPAAA 131
+ S + + D + L ++ L+ + E + +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 132 ERQ-VRKAQRQYVEIWVGVLRQL------DPGLAEA-DARLMAHAAFGLLNSTPHSMKSA 183
E V++AQR + Q L R A G ++ + A
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 184 DTK 186

Sbjct: 182 PQS 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3854SURFACELAYER300.008 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 30.4 bits (68), Expect = 0.008
Identities = 20/119 (16%), Positives = 38/119 (31%), Gaps = 5/119 (4%)

Query: 128 ALEPLPPIAGPSSTIAPPTTRTSPTPSSSPAPTTTSGSATPTTTPTAGAMQTVVYTVTGE 187
AL + PIA + + TT + + ++ TP+ + A ++
Sbjct: 14 ALLAVAPIAATAMPVNAATTINADSAINANTNAKYDVDVTPSISAIAAVAKSDTMPAIPG 73

Query: 188 GRAISVTYMDTGDVIQTEFNVALPWSKEVSLSRSANHPASVTIVNIGHNVTCSVTVSGV 246
S++ G S +++ S N+ + T VTV V
Sbjct: 74 SLTGSISASYNGKSYTANLP---KDSGNATITDSNNNTVKPAELEADKAYT--VTVPDV 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3856DHBDHDRGNASE808e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 79.7 bits (196), Expect = 8e-20
Identities = 57/185 (30%), Positives = 92/185 (49%), Gaps = 3/185 (1%)

Query: 12 AVVTGASQNIGEALATELAARGHNLIVTARRESLLNELAARLTDKYRVTVEVRPADLADP 71
A +TGA+Q IGEA+A LA++G ++ L ++ + L + R E PAD+ D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEAR-HAEAFPADVRDS 69

Query: 72 QERTKLTDELAAR--PISILCANAGTATFGPVASLDPAGEKAQVQLNAVAVHDLTLAVLP 129
++T + PI IL AG G + SL +A +N+ V + + +V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 130 GMIERRAGGILISGSAAGNSPIPYNATYAATKAFANTFSESLRGELRGSGVHVTLLAPGP 189
M++RR+G I+ GS P A YA++KA A F++ L EL + +++PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 190 VRTDL 194
TD+
Sbjct: 190 TETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3858TONBPROTEIN300.029 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 29.6 bits (66), Expect = 0.029
Identities = 22/104 (21%), Positives = 33/104 (31%), Gaps = 6/104 (5%)

Query: 409 RMRAPRSLMGAIGSEAIRAVAQASPLQAVYGQTIDRESAHEMLSAKYAPATEAAEPPATA 468
R P L I + + S Q + + + M++ EPP
Sbjct: 8 RFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTP------ADLEPPQAV 61

Query: 469 PQPPRGKYDPLPWPQDFDLPPMPTPVEPQGPPLWEEVLKNPTVK 512
PP +P P P+ PP PV + P + P K
Sbjct: 62 QPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3861HTHTETR513e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.2 bits (122), Expect = 3e-10
Identities = 27/190 (14%), Positives = 59/190 (31%), Gaps = 13/190 (6%)

Query: 28 IQQTAHRLFAERGFDAVTTEDIAAAAGVSISTYFRHAPTKEGLLVDPVRQAITEIVSSYR 87
I A RLF+++G + + +IA AAGV+ + H K L + + + I
Sbjct: 16 ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELEL 75

Query: 88 ---------SWPADESAVEALIALFVSYARDAGDLKLDTWRRAIATAPYLLSKSALVSED 138
+ ++ V+ R +++ + ++ ++
Sbjct: 76 EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCL 135

Query: 139 DQHRFIEHVASRMGV--DARTDIRPALLVHTSLATVKFVFDRWLSTDPPSSPIFHVQMDQ 196
+ + IE D+ + + + WL P S +
Sbjct: 136 ESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF--APQSFDLKKEARD 193

Query: 197 ALRIALAGFR 206
+ I L +
Sbjct: 194 YVAILLEMYL 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3864PF05616300.023 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 30.1 bits (67), Expect = 0.023
Identities = 25/73 (34%), Positives = 30/73 (41%), Gaps = 9/73 (12%)

Query: 383 PPKDLAPPPGTAVGPDGN-LVALGP---PLINPSPNL---TDPNPPLPAWLTPSPRVPGT 435
P DL P G+A P+ L + P P NP+PN T PNP L P
Sbjct: 311 PRPDLTP--GSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTD 368

Query: 436 GDPDDAPPAPPAP 448
G P P +P P
Sbjct: 369 GQPGTRPDSPAVP 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3865PF07675290.040 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 28.9 bits (64), Expect = 0.040
Identities = 23/57 (40%), Positives = 28/57 (49%), Gaps = 4/57 (7%)

Query: 115 SPDQLAPGSVIPVQRTEPSFDVTALLNGYEPLFSLLNPRDADNL--TKGIIESLQGD 169
D GSVIP T P F TA N Y F L P +AD + T+ II + QG+
Sbjct: 409 DADHNTFGSVIPA--TGPLFTGTASSNLYSANFEYLTPANADPVVTTQNIIVTGQGE 463


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3870FLGHOOKFLIK290.012 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 29.4 bits (65), Expect = 0.012
Identities = 28/107 (26%), Positives = 43/107 (40%), Gaps = 18/107 (16%)

Query: 88 AMPLLLAGALALSAFLGWQQWQQ---HQVKLAGQQAQQAA--------IAYAQVLTSIDS 136
PL A LSA LG +WQQ + L +Q QQ+A + Q+ +D
Sbjct: 220 TQPLPTVAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDD 279

Query: 137 NNVD-------QNFRQVLDGATGEFKDMYTQSSVQLRQLLIDNKASA 176
N Q+ R L+ A + +S +QL Q I ++ +
Sbjct: 280 NQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSNISGESFS 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_3871IGASERPTASE310.005 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.005
Identities = 19/101 (18%), Positives = 27/101 (26%), Gaps = 7/101 (6%)

Query: 155 TSNPLTAIPPLGQPYIQAALDAAAPPHDPGTYPFVADRWTPDKVYSGGYRAWA----LTP 210
N Q L A D G+ FV DR ++ G Y WA +
Sbjct: 260 FGNSKEEHSDPKGILSQDPLTNYAVLGDSGSPLFVYDREKGKWLFLGSYDFWAGYNKKSW 319

Query: 211 DELVLYLPDYPVG---HDEPIDFTPGAAQWSMDGGAVVAHI 248
E +Y + D +S + I
Sbjct: 320 QEWNIYKSQFTKDVLNKDSAGSLIGSKTDYSWSSNGKTSTI 360


114MMAR_4051MMAR_4054N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4051-111-1.303027MCE-family protein Mce3A
MMAR_4052011-1.590293MCE-family protein Mce3B
MMAR_4053-112-0.674019MCE-family protein, Mce3C
MMAR_4054010-1.051218MCE-family protein Mce3D
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4051PERTACTIN340.001 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 34.3 bits (78), Expect = 0.001
Identities = 23/62 (37%), Positives = 27/62 (43%), Gaps = 6/62 (9%)

Query: 429 APDGTPLW----PGLPPAPPPGAPRESGPTPGSEPFVVPAPAQAQPTPLPPAPLPQEVAP 484
A +G W PPAP P + GP PG +P P P Q P PP P+ AP
Sbjct: 553 AANGNGQWSLVGAKAPPAPKPAP--QPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAP 610

Query: 485 SP 486
P
Sbjct: 611 QP 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4052BACINVASINC310.007 Salmonella/Shigella invasin protein C signature.
		>BACINVASINC#Salmonella/Shigella invasin protein C signature.

Length = 409

Score = 31.0 bits (69), Expect = 0.007
Identities = 24/136 (17%), Positives = 55/136 (40%), Gaps = 8/136 (5%)

Query: 117 GSPKQLKPGDTIPMTHTSPALDLDALIGGFRPLLKALDPEQVNALSGQLIRALQGEGATI 176
G+ D ++ + ++L G + K + PE LS +L +++ +
Sbjct: 252 GTDATKNLNDATLKSNAGTSA-TESL--GIKNSNKQISPEHQAILSKRLE-SVESDIRLE 307

Query: 177 NSFLAQTAALTTTLADRDQLIGDVIINLNVVLGSLGDQNKQFAKAVDALAELMEGLQARK 236
T +T A + Q+ GD+I+ +V +G + ++Q+A + + + + R
Sbjct: 308 Q----NTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGASRQYAATQERSEQQISQVNNRV 363

Query: 237 EDITKGVAYTNAAASS 252
A ++ S+
Sbjct: 364 ASTASDEARESSRKST 379


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4053PRTACTNFAMLY320.005 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 32.3 bits (73), Expect = 0.005
Identities = 15/49 (30%), Positives = 15/49 (30%)

Query: 388 PLPAPPPGGPPPGPPAPAPPELASIPQPTPSSVLVPAPGEVSAPQTAGA 436
P P P P P P P P A PQP L A G
Sbjct: 573 PAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGL 621


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4054TONBPROTEIN310.008 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.7 bits (69), Expect = 0.008
Identities = 12/44 (27%), Positives = 16/44 (36%)

Query: 390 PPQAIPPQAIPPQQAPPAAAPPAQAPPPAVGPPLPAEAPATPDP 433
PPQA+ P P + P P + P A + P P
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 100


115MMAR_4182MMAR_4187N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4182191.884097hypothetical protein
MMAR_41831112.049493oxidoreductase
MMAR_41842111.532868cytochrome P450 130A4 Cyp130A4
MMAR_41852110.653364transcriptional regulatory protein
MMAR_41862110.420118PE-PGRS family protein
MMAR_5552-39-0.934582hypothetical protein
MMAR_4187-29-0.489966PPE family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4182TCRTETA392e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.4 bits (92), Expect = 2e-05
Identities = 91/411 (22%), Positives = 146/411 (35%), Gaps = 45/411 (10%)

Query: 8 PAFLILFATLAACAGNGISIVAFPWLVLEREGSAGQAS---IVAGAITLPLLFSTLVAGT 64
P +IL G G+ + P L+ + S + I+ L V G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 65 AVDYFGRRPVSMVSDALSGSAVAIVPLIAWAFGDDAVSVAVLAAL--GACSATFD-PAGI 121
D FGRRPV +VS L+G+AV + A + V + + G AT
Sbjct: 66 LSDRFGRRPVLLVS--LAGAAVDYAIM---ATAP-FLWVLYIGRIVAGITGATGAVAGAY 119

Query: 122 TARQSMLPEAAARAGWSLDRINSSYEAILNLAFVVGPGIGGLMIATVGGITTM--WITAG 179
A + E A G+ A V GP +GGLM GG + + A
Sbjct: 120 IADITDGDERARHFGF--------MSACFGFGMVAGPVLGGLM----GGFSPHAPFFAAA 167

Query: 180 AFGLSLLAIGALRLEGAGRPHHETRPDGLV-SGVTQGLRFVWNLRVLRALALIDLTVTAL 238
A G L + + E RP R+ + V+ AL + + L
Sbjct: 168 ALNGLNFLTGCFLLPESHKG--ERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ-L 224

Query: 239 YLPMESVLFPTYFSDRQQ--PAQLGSVLVALS-LGSLVGALGYGMLSNYVSRRTIMLTAV 295
+ + L+ + DR +G L A L SL A+ G ++ + R ++
Sbjct: 225 VGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM--- 281

Query: 296 LTLGAATTGIALLPPLP----VILVLCALIGVVYGPIQPIYNYVMQTRAPHYLRGRVVGV 351
L + A TG LL ++ L G P ++ + +G++ G
Sbjct: 282 LGMIADGTGYILLAFATRGWMAFPIMVLLASG--GIGMPALQAMLSRQVDEERQGQLQGS 339

Query: 352 MTSLAYAAGPLGLLLAGPVADAAGLKVAFFALALPIAITGLMCIRLPSLRE 402
+ +L +G LL + A+ + + IA L + LP+LR
Sbjct: 340 LAALTSLTSIVGPLLFTAIYAAS---ITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4185HTHTETR625e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.6 bits (149), Expect = 5e-14
Identities = 28/163 (17%), Positives = 50/163 (30%), Gaps = 10/163 (6%)

Query: 11 RSEVAADRILDAAERLYTEHDPASIGMNEIARAAGCSRATLYRYFESREALRTAYVHRET 70
++ ILD A RL+++ +S + EIA+AAG +R +Y +F+ + L +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 71 LRLGREIMQQIGDI-DDPRDQLITSITATLRMVRQRPALAAWFASTRPPIGGELAGQSEV 129
+G ++ DP L + L E G+ V
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK--CEFVGEMAV 125

Query: 130 IAGLAAAFLQ-------SLGPEAPAAVERRARWTVRVIVSLLM 165
+ A A R ++
Sbjct: 126 VQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMR 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4186cloacin402e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.5 bits (94), Expect = 2e-05
Identities = 41/112 (36%), Positives = 51/112 (45%), Gaps = 12/112 (10%)

Query: 202 GGDGTGNGFGSLSGGGNGGSGGGAGLWGVGGAGGNGGAGGSPTVPGHAGGNGGSGGIGGA 261
GGDG G+ G+ S GN GG G G GG+ G + N GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNIN----------GGPTGLGVGGGASDGSGWSSENNPWGGGSGS 52

Query: 262 GGVFGNGGAGGNGGIGGTGGTGGNGGIGGNGAAGGAGGLWGDGGVGGNGAVG 313
G +G G GNG GG G +GG G GGN +A A +G + GA G
Sbjct: 53 GIHWGGGSGHGNG--GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 39.7 bits (92), Expect = 3e-05
Identities = 24/77 (31%), Positives = 29/77 (37%)

Query: 284 GNGGIGGNGAAGGAGGLWGDGGVGGNGAVGGNSGGGFGVMNDGGSGGHGGDARLFGNGGN 343
G G G N A G G G G + G G+ N+ GG G G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 344 GGAGAVGGAGGNGADGG 360
G G G +GG GG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 37.8 bits (87), Expect = 1e-04
Identities = 32/81 (39%), Positives = 36/81 (44%), Gaps = 1/81 (1%)

Query: 593 NGGVGGAGGVGAFGSGGTGGNGGAGGAVGNGGAGGDAGSSGNLSPAGGGKGGNAKLVGNG 652
+GG G GA + G NGG G GGA +G S +P GGG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 653 GDGGAGVFGGLGGDGGTGGQL 673
G G G G GG GTGG L
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 37.8 bits (87), Expect = 1e-04
Identities = 33/98 (33%), Positives = 44/98 (44%), Gaps = 5/98 (5%)

Query: 125 GANGTAANPNGGDGGLLYGNGGNGFTQTGNNNVAGGNGGNAGLIGNGGAGGGGGTAFAGG 184
GA+ T+ N NGG GL G G + + + N G G +G+ GG+G G GG
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN-----GG 66

Query: 185 NGGHGGLLYGNGGAGAIGGDGTGNGFGSLSGGGNGGSG 222
G+ G G GG + GF +LS G GG
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.0 bits (85), Expect = 2e-04
Identities = 32/88 (36%), Positives = 38/88 (43%), Gaps = 2/88 (2%)

Query: 160 GNGGNAGLIGNGGAGGGGGTAFAGGNGGHGGLLYGNGGAGAIGGDGTGNGFGSLSGGGNG 219
G G N G G GG T G G G + + GG G+G +G SG GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 220 GSGGGAGLWGVGGAGGNGGAGGSPTVPG 247
G G +G G G GGN A +P G
Sbjct: 66 GGNGNSG--GGSGTGGNLSAVAAPVAFG 91



Score = 36.6 bits (84), Expect = 3e-04
Identities = 26/78 (33%), Positives = 33/78 (42%)

Query: 251 GNGGSGGIGGAGGVFGNGGAGGNGGIGGTGGTGGNGGIGGNGAAGGAGGLWGDGGVGGNG 310
G G + G G G G G G + G+G + G G+G WG G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 311 AVGGNSGGGFGVMNDGGS 328
GNSGGG G + +
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 35.8 bits (82), Expect = 5e-04
Identities = 32/94 (34%), Positives = 39/94 (41%), Gaps = 4/94 (4%)

Query: 173 AGGGGGTAFAGGNGGHGGLLYGNGGAGAIGGDGTGNGFGSLSGGGNGGSGGGAGLWGVGG 232
+GG G G + G + G G G GG G+G+ S + GGSG G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 233 AGGNGGAGGSPTVPGHAGGNGGSGGIGGAGGVFG 266
G GG G S G G GG+ A FG
Sbjct: 62 HGNGGGNGNS----GGGSGTGGNLSAVAAPVAFG 91



Score = 33.1 bits (75), Expect = 0.004
Identities = 32/103 (31%), Positives = 40/103 (38%), Gaps = 5/103 (4%)

Query: 562 GDGGLGGDGGIGLAGTGGNGGNGGDAVGVIGNGGVGGAGGVGAFGSG-----GTGGNGGA 616
G G G + G NGG G VG + G G + +G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 617 GGAVGNGGAGGDAGSSGNLSPAGGGKGGNAKLVGNGGDGGAGV 659
G GNG +GG +G+ GNLS + G GG V
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAV 105



Score = 32.8 bits (74), Expect = 0.005
Identities = 35/130 (26%), Positives = 43/130 (33%), Gaps = 19/130 (14%)

Query: 323 MNDGGSGGHGGDARLFGNGGNGGAGAVGGAGGNGADGGIGGQFFGNGGDGGAGGIGTAGL 382
M+ G GH A NGG +G GG G + GG G+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG------- 53

Query: 383 AGSGGTGGSAVGLVGNGGTGGAGGLGPIGGAGGNGGGGGLIGNGGNGGAGGAASATVGTP 442
+ G G G GG G GG+G GG L G A +T G
Sbjct: 54 ----------IHWGGGSGHGNGGGNG--NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101

Query: 443 APGVGGNGGA 452
V + GA
Sbjct: 102 GLAVSISAGA 111



Score = 32.0 bits (72), Expect = 0.009
Identities = 30/100 (30%), Positives = 33/100 (33%)

Query: 231 GGAGGNGGAGGSPTVPGHAGGNGGSGGIGGAGGVFGNGGAGGNGGIGGTGGTGGNGGIGG 290
GG G G T GG G G GGA G G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 291 NGAAGGAGGLWGDGGVGGNGAVGGNSGGGFGVMNDGGSGG 330
G G G G AV GF ++ G+GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.020
Identities = 30/87 (34%), Positives = 33/87 (37%), Gaps = 7/87 (8%)

Query: 269 GAGGNGGIGGTGGTGGNGGIGGNGAAGGAGGLWGDGGVGGNGAVGGNSGGGFGVMNDGGS 328
G G G G T GN G G G G G G N GG G G G+ GGS
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGG--GSGSGIHWGGGS 60

Query: 329 GGHGGDARLFGNGGNGGAGAVGGAGGN 355
G G G GN G G+ G +
Sbjct: 61 GHGNG-----GGNGNSGGGSGTGGNLS 82



Score = 30.5 bits (68), Expect = 0.023
Identities = 33/112 (29%), Positives = 43/112 (38%), Gaps = 12/112 (10%)

Query: 424 GNGGNGGAGGAASATVGTPAPGVGGNGGAAGLFGDGGNGGAGAPGLSGLGGAGGRGGYLI 483
G G N GA + G P G+G GGA+ +G + + GG G G +
Sbjct: 6 GRGHNTGAHSTSGNINGGPT-GLGVGGGAS-------DGSGWSSENNPWGGGSGSGIHWG 57

Query: 484 GSGGNGGAGAGGGDGGYLSGNGGNGGDGVIVGNG----SAGGAGGNALGLFG 531
G G+G G G GG G V G S GAGG A+ +
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 30.1 bits (67), Expect = 0.030
Identities = 21/79 (26%), Positives = 28/79 (35%)

Query: 143 GNGGNGFTQTGNNNVAGGNGGNAGLIGNGGAGGGGGTAFAGGNGGHGGLLYGNGGAGAIG 202
G G N + + N+ GG G G G G G G+ +G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 203 GDGTGNGFGSLSGGGNGGS 221
G +G GS +GG
Sbjct: 66 GGNGNSGGGSGTGGNLSAV 84



Score = 30.1 bits (67), Expect = 0.034
Identities = 26/80 (32%), Positives = 29/80 (36%), Gaps = 4/80 (5%)

Query: 464 AGAPGLSGLGGAGGRGGYLIGSGGNGGAGAGGGDGGYLSG----NGGNGGDGVIVGNGSA 519
+G G GA G + G G G G DG S GG G G+ G GS
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 520 GGAGGNALGLFGHGGAGGAG 539
G GG G G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4187PF05616435e-07 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 43.2 bits (101), Expect = 5e-07
Identities = 20/64 (31%), Positives = 28/64 (43%)

Query: 186 SSGGSPSPTPNPNPDPSPTPDPTPTPTPDPTPTPTPDPTPAPTPEPTPAPAPEPAPEPTP 245
+ G + +P P P+ SP +P P P+ P P+P P P P P + P P
Sbjct: 316 TPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRP 375

Query: 246 VEPA 249
PA
Sbjct: 376 DSPA 379



Score = 40.9 bits (95), Expect = 3e-06
Identities = 18/51 (35%), Positives = 22/51 (43%)

Query: 195 PNPNPDPSPTPDPTPTPTPDPTPTPTPDPTPAPTPEPTPAPAPEPAPEPTP 245
P P+ P P P P+ +P P PAP P P PEP P+ P
Sbjct: 311 PRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNP 361



Score = 39.0 bits (90), Expect = 1e-05
Identities = 21/72 (29%), Positives = 27/72 (37%), Gaps = 2/72 (2%)

Query: 177 LLPGCKAPSSSGGSPSPTPNPNPDPSPTPDPTPTPTPDPTPTPTPDPTPAPTPEPTPAPA 236
L PG + + P P +P +P+ P P P P P P PD P P+ P
Sbjct: 315 LTPG--SAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPG 372

Query: 237 PEPAPEPTPVEP 248
P P P
Sbjct: 373 TRPDSPAVPDRP 384



Score = 38.6 bits (89), Expect = 2e-05
Identities = 17/55 (30%), Positives = 21/55 (38%)

Query: 191 PSPTPNPNPDPSPTPDPTPTPTPDPTPTPTPDPTPAPTPEPTPAPAPEPAPEPTP 245
P P P +P P P +P P P P P P P P P+ P+ P
Sbjct: 311 PRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANP 365



Score = 33.2 bits (75), Expect = 0.001
Identities = 17/62 (27%), Positives = 26/62 (41%)

Query: 166 SQWPPLQQLLRLLPGCKAPSSSGGSPSPTPNPNPDPSPTPDPTPTPTPDPTPTPTPDPTP 225
++ P Q L + P ++ + +P PNP+P P +P P D P PD
Sbjct: 320 AEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPA 379

Query: 226 AP 227
P
Sbjct: 380 VP 381



Score = 32.8 bits (74), Expect = 0.001
Identities = 21/50 (42%), Positives = 22/50 (44%), Gaps = 3/50 (6%)

Query: 204 TPDPTPTPTPDPTPTPTPDPTPAPTPEPTPA--PAPEPAPEPTP-VEPAP 250
T D P PD TP P P PE +PA PA PAP P P P
Sbjct: 304 TVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNP 353



Score = 28.9 bits (64), Expect = 0.022
Identities = 17/46 (36%), Positives = 18/46 (39%), Gaps = 1/46 (2%)

Query: 206 DPTPTPTPDPTPTPTPDPTPAPTPEPTPAPAPEPAPEPTPV-EPAP 250
D T D P PD TP P P PE +P P PAP
Sbjct: 298 DSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAP 343


116MMAR_4206MMAR_4224N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4206-1101.993487sugar-binding lipoprotein LpqY
MMAR_42071132.152717hypothetical protein
MMAR_42080132.384366proline and glycine rich transmembrane protein
MMAR_42090101.614240transport transmembrane protein
MMAR_4210-1121.786873hypothetical protein
MMAR_4211-191.485673membrane protein
MMAR_4212090.591059Mrp-related protein Mrp
MMAR_4213011-0.609077sec-independent translocase
MMAR_4214-111-1.008554serine protease HtrA (DegP protein)
MMAR_4215-111-1.578988hypothetical protein
MMAR_4216-19-0.676978RNA polymerase sigma factor SigE
MMAR_4217-110-0.975135methyltransferase
MMAR_42181131.122570PPE family protein
MMAR_42193154.291804transcriptional regulator
MMAR_42202133.892246antibiotic ABC transporter ATP-binding protein
MMAR_42214133.866589tetronasin-transport integral membrane protein
MMAR_42225153.778594integral membrane protein
MMAR_42236154.272218PE-PGRS family protein
MMAR_42242142.293420PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4206MALTOSEBP523e-09 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 51.7 bits (123), Expect = 3e-09
Identities = 42/157 (26%), Positives = 69/157 (43%), Gaps = 12/157 (7%)

Query: 152 RLYAAPVTTNTQLLWYRPDLVAQPPETWNAVVAEAGRLRAAGQPTWIAVQANQGEGLMVW 211
+L A P+ L Y DL+ PP+TW + A L+A G+ A+ N E W
Sbjct: 128 KLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKS---ALMFNLQEPYFTW 184

Query: 212 FNTVLSSVGGQVLSDDGTRVTLTDTPAHRAATVAALRVLKSVATAP--GADPSITRAEAG 269
++++ GG + + + D A A L L + AD + AEA
Sbjct: 185 --PLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEA- 241

Query: 270 TARLAFEQGKAALELNWPYVFASMLENAVKGGVPFLP 306
AF +G+ A+ +N P+ ++++ + V GV LP
Sbjct: 242 ----AFNKGETAMTINGPWAWSNIDTSKVNYGVTVLP 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4208PF03544300.006 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.9 bits (67), Expect = 0.006
Identities = 23/109 (21%), Positives = 25/109 (22%), Gaps = 1/109 (0%)

Query: 19 PPPGEQPSEQPFSPPPDAPWAAPEAASPADDYPAPSYPPPAYPPEPVGPGGYPPDYATGY 78
PP QP +P P P PE A P P P+PV P
Sbjct: 62 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPV 121

Query: 79 PPPPGYPPPGYPPYGAAAGEYGGTPYPPPPPPPAPMAAPYGAPPPNYPP 127
P P P A P YP
Sbjct: 122 ESRPASPFEN-TAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPA 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4211PF03544401e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 39.6 bits (92), Expect = 1e-05
Identities = 26/113 (23%), Positives = 37/113 (32%), Gaps = 3/113 (2%)

Query: 332 PQMPAPPPPMFPWMQP---TPQPLAPQPGCTLICVTNPPEAVPPPAPMPFGLMPPPPAPA 388
++PAP P+ M P A QP + P P P ++ P P
Sbjct: 41 IELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPK 100

Query: 389 APPPPDPLAPLGAPPVGGPPAAPAPAPAGPAPAPAGPAPAPGAPVPAGPAPTP 441
P P P+ + P P PA APA P + + P +
Sbjct: 101 PKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSV 153



Score = 30.7 bits (69), Expect = 0.008
Identities = 25/130 (19%), Positives = 34/130 (26%), Gaps = 6/130 (4%)

Query: 319 DPLSHMPLIDFAPPQMPAPPPPMFPWMQPTPQPLA-PQPGCTLICVTNPPEAVPPPAPMP 377
P + + AP + P QP P+P+ P+P I V P P
Sbjct: 45 APAQPISVTMVAPADLEPPQAV-----QPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP 99

Query: 378 FGLMPPPPAPAAPPPPDPLAPLGAPPVGGPPAAPAPAPAGPAPAPAGPAPAPGAPVPAGP 437
P P P + P+ + P P A P
Sbjct: 100 KPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRA 159

Query: 438 APTPTPSGPA 447
P PA
Sbjct: 160 LSRNQPQYPA 169



Score = 29.2 bits (65), Expect = 0.031
Identities = 17/79 (21%), Positives = 19/79 (24%), Gaps = 9/79 (11%)

Query: 382 PPPPAPAAPPPPDPLAPLGA---------PPVGGPPAAPAPAPAGPAPAPAGPAPAPGAP 432
PP A PP P PV P P P P
Sbjct: 61 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKP 120

Query: 433 VPAGPAPTPTPSGPAPGEP 451
V + PA + PA
Sbjct: 121 VESRPASPFENTAPARPTS 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4213TATBPROTEIN804e-22 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 79.7 bits (196), Expect = 4e-22
Identities = 24/70 (34%), Positives = 42/70 (60%), Gaps = 2/70 (2%)

Query: 1 MLVLVVVGLVVLGPERLPGAIRWSSGALRQARDYLSGVTSQLRDDMG-PEFDDLRGQLGE 59
+L++ ++GLVVLGP+RLP A++ +G +R R + V ++L ++ EF D ++ E
Sbjct: 9 LLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQDSLKKV-E 67

Query: 60 LQKLRGMTPR 69
L +TP
Sbjct: 68 KASLTNLTPE 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4214V8PROTEASE751e-16 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 74.7 bits (183), Expect = 1e-16
Identities = 41/191 (21%), Positives = 73/191 (38%), Gaps = 26/191 (13%)

Query: 204 RFTKVAAAVADSVVTIETKSDQEGMQGSGVIVDGRGYIVTNNHVISEAANNPSQFKTTVV 263
+ T V I+ ++ SGV+V G+ ++TN HV+ +P K
Sbjct: 78 QITDTTNGHYAPVTYIQVEAPTGTFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPS 136

Query: 264 FNDGKEVP------ASLVGRDPKTDLAVLKVDNV-------DNLTVARLGNSDKVRVGDE 310
+ P + + DLA++K + + A + N+ + +V
Sbjct: 137 AINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQN 196

Query: 311 VLAVGAPLGLRSTVTEGIVSALHRPVPLSGEGSDTDTVIDAIQTDASINHGNSGGPLIDM 370
+ G P V+ + +G T +A+Q D S GNSG P+ +
Sbjct: 197 ITVTGYPGDKP-------VATMWE-----SKGKITYLKGEAMQYDLSTTGGNSGSPVFNE 244

Query: 371 DSQVIGINTAG 381
++VIGI+ G
Sbjct: 245 KNEVIGIHWGG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4218cloacin340.002 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 0.002
Identities = 26/83 (31%), Positives = 33/83 (39%), Gaps = 1/83 (1%)

Query: 571 NTGLWNT-GNVNTGVGGTGTHSGNSGFGNSGTGNSGFFNSGSYNSGLINSSAGGYSSGVG 629
NTG +T GN+N G G G G S + N+ + S G G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 630 NSGGGGYNVGFFNSSAGGTDTGF 652
NSGGG G ++ A GF
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGF 92



Score = 33.5 bits (76), Expect = 0.003
Identities = 24/77 (31%), Positives = 33/77 (42%), Gaps = 1/77 (1%)

Query: 415 SGSDNWGLANTGSTNWGAVNSGSLNTGIGNTGSTNTGWWNAGSVNDGLFNAGNANLGLAN 474
SG D G + G +N G G+G S +GW + + G +G G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 475 SGNGNLGGFNSGGGTSA 491
GNG G NSGGG+
Sbjct: 62 HGNGGGNG-NSGGGSGT 77



Score = 30.1 bits (67), Expect = 0.033
Identities = 29/100 (29%), Positives = 37/100 (37%), Gaps = 7/100 (7%)

Query: 205 GGNSGPSSTSGGIGNIGFGNYGNSNVGAGNGNAALGVPGSSYPGASSYNVGAGNIGQFNI 264
G N+G STSG NI G G G + + + + G S + G
Sbjct: 8 GHNTGAHSTSG---NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 265 GLGNFGSGNIGFGNGNLVTAVA----GNPSASNVGAANIG 300
G GN SG GNL A G P+ S GA +
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 29.7 bits (66), Expect = 0.046
Identities = 22/66 (33%), Positives = 26/66 (39%), Gaps = 3/66 (4%)

Query: 184 GIGNIGNFNVGSGNFGSYNFGGGNSGPSSTSGGIGNIGFGNYGNSNVGAGNGNAALGVPG 243
G + N G G + GG G GG GN G G+ N+ A A G P
Sbjct: 38 GWSSENNPWGGGSGSGIHWGGGSGHGNG---GGNGNSGGGSGTGGNLSAVAAPVAFGFPA 94

Query: 244 SSYPGA 249
S PGA
Sbjct: 95 LSTPGA 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4219HTHTETR568e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.8 bits (134), Expect = 8e-12
Identities = 36/155 (23%), Positives = 54/155 (34%), Gaps = 19/155 (12%)

Query: 10 ARIRDAAIEQFGQHGF-GVSLRAIAEGAGVSAALVIHHFGSKEGLRKACDNYVAEEIRSE 68
I D A+ F Q G SL IA+ AGV+ + HF K L I E
Sbjct: 14 QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG-E 72

Query: 69 KLTAMQSNDPATWLGQLAQV-----------ESYAPLMAYLVRSMQSGGELAMM------ 111
Q+ P L L ++ E LM + + GE+A++
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132

Query: 112 LWQQMIDNAEEYLAVGVRAGTIKASRDPKARAKFL 146
L + D E+ L + A + A + A +
Sbjct: 133 LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4223cloacin397e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 7e-05
Identities = 25/81 (30%), Positives = 33/81 (40%)

Query: 483 GGGGTGGDGGNNIIGANVGGDGGAGGAGGLGGNGTDGGWLSGNGGDGGAGGQGGDGGHGG 542
GG G G + G + N+ G G GG +G+ + G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 543 SPGGFDGKSGVGGDGGDGGNA 563
GG +G SG G G +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 35.5 bits (81), Expect = 6e-04
Identities = 30/87 (34%), Positives = 34/87 (39%), Gaps = 3/87 (3%)

Query: 127 DGAPGQAGGDGGLLYGNGGNGGTSTTAGVAGGDGGAAGLIGNGGAGGGGGAGALGGNGGA 186
+G P G GG + G+G +S GG G G G G GGG G GG G
Sbjct: 21 NGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77

Query: 187 GGWLFGQGGAGGNGGTATLAGGAGGAG 213
GG L G A GAGG
Sbjct: 78 GGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.001
Identities = 27/86 (31%), Positives = 33/86 (38%), Gaps = 8/86 (9%)

Query: 247 GGGDGGRGGWLYGNGGVGGTGGTGGIGLQGASGGDGGAGGGTGLWGTGGVGGNGGTGGIG 306
G G G G +G + G G+G GGA G+G G G GI
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVG--------GGASDGSGWSSENNPWGGGSGSGIH 55

Query: 307 LDGVAGHIGGGNAGNAGNGGTGGSGG 332
G +GH GG GN+G G G
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.001
Identities = 32/86 (37%), Positives = 40/86 (46%), Gaps = 7/86 (8%)

Query: 406 GNAGVAGNGGAGGSAAMLFGNGGAGGNGGSGGDGGHGGNSTVSVPGGIGGDAGAGGTGGS 465
G G N GA ++ + NGG G G GG G S+ + P G G +G GGS
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 466 AGKSGLLFGAGGAGGQGGGGGTGGDG 491
+G GG G GGG GTGG+
Sbjct: 61 GHGNG-----GGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.002
Identities = 31/86 (36%), Positives = 37/86 (43%), Gaps = 4/86 (4%)

Query: 337 NGGDG-GHGGTGGGGGRGINGADGGHGGDGGTGGAAGSAGLLFGDGGTGGHGGAGFGGGN 395
+GGDG GH ING G G G GGA+ +G + GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNING---GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 396 GSDSPQGGQGGNAGVAGNGGAGGSAA 421
GS GG GN+G G SA
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.5 bits (76), Expect = 0.003
Identities = 37/105 (35%), Positives = 45/105 (42%), Gaps = 7/105 (6%)

Query: 142 GNGGNGGTSTTAGVAGGDGGAAGLIGNGGAGGGGGAGALGG-NGGAGGWLFGQGGAGGNG 200
G G N G +T+G +GG GL GGA G G + GG G GG G+G
Sbjct: 6 GRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 201 GTATLAGGAGGAGGVGGSAGLWGTGGAGGNGGFGALNLAGDGGAG 245
GG G +GG G+ G A GF AL+ G GG
Sbjct: 64 N----GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.005
Identities = 29/82 (35%), Positives = 34/82 (41%), Gaps = 4/82 (4%)

Query: 297 GGNGGTGGIGLDGVAGHIGGGNAGNAGNGGTGGSGGLLLGN----GGDGGHGGTGGGGGR 352
GG+G G +G+I GG G GG G N GG G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 353 GINGADGGHGGDGGTGGAAGSA 374
G G +G GG GTGG +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.4 bits (73), Expect = 0.006
Identities = 25/76 (32%), Positives = 29/76 (38%), Gaps = 1/76 (1%)

Query: 275 QGASGGDGGAGGGTGLWGTGGVGGNGGTGGIGLDGVAGHIGGGNAGNAGNGGTGGSGGLL 334
+G + G G TG G G + G G GGG+ GG G G
Sbjct: 7 RGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN-G 65

Query: 335 LGNGGDGGHGGTGGGG 350
GNG GG GTGG
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.007
Identities = 35/125 (28%), Positives = 46/125 (36%), Gaps = 7/125 (5%)

Query: 348 GGGGRGINGADGGHGGDGGTGGAAGSAGLLFGDGGTGGHGGAGFGGGNGSDSPQGGQGGN 407
GG GRG N G+ G G DG +GGG+GS GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 408 AGVAGNGGAGGSAAMLFGNGGAGGNGGSGGDGGHGGNSTVSVPGGIGGDAGAGGTGGSAG 467
GNG +GG + G GGN + G +S PG G SA
Sbjct: 63 GNGGGNGNSGGGS-------GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115

Query: 468 KSGLL 472
+ ++
Sbjct: 116 IADIM 120



Score = 31.6 bits (71), Expect = 0.010
Identities = 28/82 (34%), Positives = 35/82 (42%), Gaps = 5/82 (6%)

Query: 192 GQGGAGGNGGTATLAGGAGGAGGVGGSAGLWGTGGAGGNGGFGALNLAGDGGAGAGGGDG 251
G G G N G + +G G G GL GGA G+ + N GG+G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNING-----GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 252 GRGGWLYGNGGVGGTGGTGGIG 273
G G G G GG+G G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGG 79



Score = 31.6 bits (71), Expect = 0.011
Identities = 30/105 (28%), Positives = 36/105 (34%), Gaps = 3/105 (2%)

Query: 453 IGGDAGAGGTGGSAGKSGLLFGAGGAGGQGGGGGTGGDGGNNIIGANVGGDGGAGGAGGL 512
+ G G G G+ SG + G G GGG G + N G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSE---NNPWGGGSGSGIHWG 57

Query: 513 GGNGTDGGWLSGNGGDGGAGGQGGDGGHGGSPGGFDGKSGVGGDG 557
GG+G G +GN G G G GF S G G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4224cloacin373e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 3e-04
Identities = 30/78 (38%), Positives = 35/78 (44%), Gaps = 4/78 (5%)

Query: 154 GVAGGAGGSAGLIGNGGAGGGGGAGAVGGNGGAGGWLFGNGGAGGAGGATPGIGGGGGAG 213
G GA ++G I G G G G GA G+G W N GG G+ GGG G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSG----WSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 214 GAGGIGGAAGLFGNGGAG 231
GG G + G G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.001
Identities = 30/83 (36%), Positives = 34/83 (40%), Gaps = 4/83 (4%)

Query: 304 GTGGTGLNGNAPFADGQHPVILDGGHGGTGGNGGAAGNGGLLFGNGGNGGLGGMGGGGGN 363
G G G N A G ++GG G G GGA+ G N GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGN----INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 364 GLPSTGVGGDGGDGGNGGTAGNG 386
G GG+G GG GT GN
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.001
Identities = 33/98 (33%), Positives = 41/98 (41%), Gaps = 6/98 (6%)

Query: 127 DGAPGQAGGDGGLLYGNGGNGGTSTTAGVAGGAGGSAGLIGNGGAGGGGGAGAVGGNGGA 186
+G P G GG + G+G +S GG+G G G G GGG G GG G
Sbjct: 21 NGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77

Query: 187 GGWLFGNGGA---GGAGGATPGIGGGGGAGGAGGIGGA 221
GG L G +TPG GG + AG + A
Sbjct: 78 GGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 34.3 bits (78), Expect = 0.001
Identities = 36/115 (31%), Positives = 45/115 (39%), Gaps = 7/115 (6%)

Query: 328 GHGGTGGNGGAAGNGGLLFGNGGNGGLGGMGGGG-GNGLPSTGVGGDGGDGGNGGTAGNG 386
G G G N GA G + NGG GLG GG G+G S GG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 387 GWLIGNGGTGGQGGAGFAGGTGADNVSAGRPGGAGGTGGIGAGGGNAGLIGTGGS 441
G +G GG G +G GTG + + P G G G + + G+
Sbjct: 61 G----HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.3 bits (78), Expect = 0.001
Identities = 32/105 (30%), Positives = 41/105 (39%), Gaps = 5/105 (4%)

Query: 271 GDGGRGLPGGDGGSGGAGGGTGLWGSGGAGGQGGTGGTGLNGNAPFADGQHPVILDGGHG 330
G GRG G + G G G G G+G + + N P+ G I GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWS--SENNPWGGGSGSGIHWGGGS 60

Query: 331 GTGGNGGAAGNGGLLFGNGGNGGLGGMGGGGGNGLPSTGVGGDGG 375
G G GG +GG G+G G L + G P+ G GG
Sbjct: 61 GHGNGGGNGNSGG---GSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.001
Identities = 34/114 (29%), Positives = 46/114 (40%), Gaps = 1/114 (0%)

Query: 484 GSGSRGGDGGDGGNGGEGRGGFSVPQHGVGGQGGTGGNGSDGGWLYGDGGAGGAGGNGGF 543
G RG + G G GG + G G G+G + + W G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 544 GGSGVQNGAGGDAGSGGDSRLIGDGGAAGAGGTGAP-PGADGHSGADGLLSAAL 596
G G +GG +G+GG+ + A G P G S + G LSAA+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 33.9 bits (77), Expect = 0.002
Identities = 32/87 (36%), Positives = 37/87 (42%), Gaps = 8/87 (9%)

Query: 371 GGDGGDGGNGGTAGNGGWLIGNGGTGGQGGAGFAGGTGADNVSAGRPGGAGGTGGIGAGG 430
GGDG G + +G G G G GGA G ++N P G G GI GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN----NPWGGGSGSGIHWGG 58

Query: 431 GNAGLIGTGGSGGNGGMGGHGGDSGYG 457
G+ G G GGNG GG G G
Sbjct: 59 GS----GHGNGGGNGNSGGGSGTGGNL 81



Score = 33.5 bits (76), Expect = 0.002
Identities = 27/76 (35%), Positives = 30/76 (39%), Gaps = 1/76 (1%)

Query: 452 GDSGYGDQTGGQGGNGGM-GGAGGAAGNGGLLLGSGSRGGDGGDGGNGGEGRGGFSVPQH 510
G G G TG +G + GG G GG GSG + GG G G H
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 511 GVGGQGGTGGNGSDGG 526
G GG G G GS G
Sbjct: 63 GNGGGNGNSGGGSGTG 78



Score = 33.5 bits (76), Expect = 0.003
Identities = 32/98 (32%), Positives = 37/98 (37%), Gaps = 3/98 (3%)

Query: 134 GGDGGLLYGNGGNGGTSTTAGVAGGAGGSAGLIGNGGAGGGGGAGAVGGNGGAGGWLFGN 193
G + G +G G T GV GGA +G GGG + GG+G GN
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH---GN 64

Query: 194 GGAGGAGGATPGIGGGGGAGGAGGIGGAAGLFGNGGAG 231
GG G G G GG A A G L G G
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.013
Identities = 27/80 (33%), Positives = 30/80 (37%), Gaps = 4/80 (5%)

Query: 419 GAGGTGGIGAGGGNAGLIGTGGSGGNGGMGGHGGDSGYGDQTGGQGGNGGMGGAGGAAGN 478
G G G + GN TG G G G G S GG G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 479 GGLLLGSGSRGGDGGDGGNG 498
G G+G+ GG G GGN
Sbjct: 66 G----GNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.018
Identities = 28/81 (34%), Positives = 33/81 (40%)

Query: 234 GGDGGLNYYGDGGVAGAGGDGGRGGWLHGDGGDGGAGGDGGRGLPGGDGGSGGAGGGTGL 293
GGDG + G +G G G + G DG GG G GGG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 294 WGSGGAGGQGGTGGTGLNGNA 314
GG G GG GTG N +A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 29.7 bits (66), Expect = 0.044
Identities = 25/76 (32%), Positives = 30/76 (39%), Gaps = 3/76 (3%)

Query: 415 GRPGGAGGTGGIGAGGGNAGLIGTGGSGGNGGMGGHGGDSGYGDQTGGQGGNGGMGGAGG 474
G GA T G GG +G G S G+G + G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 475 AAGNGGLLLGSGSRGG 490
+GG GSG+ G
Sbjct: 68 NGNSGG---GSGTGGN 80


117MMAR_4278MMAR_4294N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4278012-0.442823PE-PGRS family protein
MMAR_42791110.232493FO synthase
MMAR_42802110.576496PE family protein
MMAR_42811110.819282hypothetical protein
MMAR_42830120.886698N-acetyl-1-D-myo-inosityl-2-amino-2-deoxy-alpha-
MMAR_42841110.484732transcriptional regulatory protein
MMAR_42871120.517610PE-PGRS family protein
MMAR_4288-3141.728456lipoprotein LpqW
MMAR_4289-1132.935605GTP-binding translation elongation factor TypA
MMAR_42900153.485415mutator protein MutT2
MMAR_42911152.719748pterin-4-alpha-carbinolamine dehydratase
MMAR_42921163.134065mannosyltransferase
MMAR_42933183.258981hypothetical protein
MMAR_42941151.078864hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4278cloacin481e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 48.2 bits (114), Expect = 1e-07
Identities = 30/79 (37%), Positives = 34/79 (43%)

Query: 329 GGSGDGSGTGTGGTGSGSGGTGDTGGTGSGTGDTGGTGSGGTGDTGGTGSGTGDTGGTGT 388
GG G G TG T G G G G D G S GG+GSG GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 389 GTGSGTGDTGDTGDTGGTG 407
G G G G++G TGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 45.1 bits (106), Expect = 1e-06
Identities = 31/82 (37%), Positives = 39/82 (47%), Gaps = 4/82 (4%)

Query: 332 GDGSGTGTGGTGSGSGGTGDTGGTGSGTGDTGGTGSGGTGDTGGTGSGTGDTGGTGTGTG 391
GDG G TG + G G G G G + G+G + G GSG+ G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGS----GIHWGGG 59

Query: 392 SGTGDTGDTGDTGGTGGTGGTG 413
SG G+ G G++GG GTGG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNL 81



Score = 44.7 bits (105), Expect = 1e-06
Identities = 29/80 (36%), Positives = 38/80 (47%), Gaps = 6/80 (7%)

Query: 306 QGAQTGSYSEPTNLIVNGVSVGGGGSGDGSGTGTGGTGSGSGGTGDTGGTGSGTGDTGGT 365
+G TG++S N +NG G G G S G+G S GG+GSG GG+
Sbjct: 7 RGHNTGAHSTSGN--INGGPTGLGVGGGASD----GSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 366 GSGGTGDTGGTGSGTGDTGG 385
G G G G +G G+G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGN 80



Score = 42.0 bits (98), Expect = 9e-06
Identities = 34/117 (29%), Positives = 47/117 (40%), Gaps = 4/117 (3%)

Query: 105 MGGGGAQGDS--AAQSGGGGNAGQAAEGASSGQAAASGGAVSPNSIAAGATNTTAASAVM 162
M GG +G + A + G N G G G + SG + N G+ +
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 163 SGGTGNGGAISGGGAAGSGGASGATTAAATGGGELG--GSGGVAALSSAAATTATVS 217
G G G SGGG+ G S A G L G+GG+A SA A +A ++
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 38.9 bits (90), Expect = 8e-05
Identities = 24/84 (28%), Positives = 32/84 (38%), Gaps = 1/84 (1%)

Query: 350 GDTGGTGSGTGDTGGTGSGGTGDTGGTGSGTGDTGGTGTGTGSGTGDTGDTGDTGGTGGT 409
GD G +G T G +GG G G + +G + G G GG+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 410 GGTGGTGDTGDTGDTGGTGDGTGT 433
G GG G++G TGG
Sbjct: 64 NG-GGNGNSGGGSGTGGNLSAVAA 86



Score = 38.5 bits (89), Expect = 1e-04
Identities = 22/71 (30%), Positives = 32/71 (45%)

Query: 383 TGGTGTGTGSGTGDTGDTGDTGGTGGTGGTGGTGDTGDTGDTGGTGDGTGTGTGDGGDTG 442
+GG G G +G T + G TG G G + +G + + G G+G+G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 443 GSGTGGTVGDG 453
GG G
Sbjct: 62 HGNGGGNGNSG 72



Score = 32.0 bits (72), Expect = 0.011
Identities = 26/79 (32%), Positives = 27/79 (34%)

Query: 307 GAQTGSYSEPTNLIVNGVSVGGGGSGDGSGTGTGGTGSGSGGTGDTGGTGSGTGDTGGTG 366
GA GS N G S G G GSG G GG SGG TGG S G
Sbjct: 32 GASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91

Query: 367 SGGTGDTGGTGSGTGDTGG 385
G G + G
Sbjct: 92 FPALSTPGAGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4280RTXTOXINA300.020 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.5 bits (66), Expect = 0.020
Identities = 44/195 (22%), Positives = 69/195 (35%), Gaps = 49/195 (25%)

Query: 11 LTAAASDVAGIGTTISSANAAASAPTTGVLAAAGDQVSAQVAALFSSHGEIY---QRLSS 67
L + ++ I + +NA A T AAAG +++ +V Y QR +
Sbjct: 242 LDTVSGILSAISASFILSNADAD---TRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQ 298

Query: 68 QLSTFHDQFAAALNTSANSYA---------SAEANAAKTL------LSAVNSPAEKLLGQ 112
LST AA L SA + A + + A + + + LL
Sbjct: 299 GLST--SAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAA 356

Query: 113 PLMGQGGIVANAVSQVQSVFAGAGSNALGANASMLALAPTGGAATAAASGSLLGPIASAA 172
G I A +++ + +V LA +AAA+ SL+G
Sbjct: 357 FHKETGAIDA-SLTTISTV-----------------LASVSSGISAAATTSLVG------ 392

Query: 173 AAPAAILPVSVATAI 187
AP + L V T I
Sbjct: 393 -APVSAL-VGAVTGI 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4284HTHTETR573e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 3e-12
Identities = 25/192 (13%), Positives = 56/192 (29%), Gaps = 10/192 (5%)

Query: 13 RSRRRGEVLERALYSATLAELIAVGYGRLTMEGIAARAQTGKAALYRRWASKHDLVQAAL 72
++++ + + + L G ++ IA A + A+Y + K DL
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 73 QYAVPPPPE------PRPGRSARENLLTVFTAHRDVLAGKTEFPGLVAI----GQLIHEP 122
+ + E + L + + + L+ I + + E
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 123 ELRAIFANSVVHPRLKIIDSVLRAAIREGDLDPDTVTPFTARIGPALMNQHFILTGTPPN 182
+ ++ I+ L+ I L D +T A I ++ P
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 183 RRELALIVDTVI 194
+L +
Sbjct: 184 SFDLKKEARDYV 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4287PF07132320.006 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 32.4 bits (73), Expect = 0.006
Identities = 58/217 (26%), Positives = 88/217 (40%), Gaps = 6/217 (2%)

Query: 189 LLGGQVPGAQIGAGLSGVVQTGQSLAGSFNAALSGMGTQLSAALSGSLSAGLPDLSSLGV 248
L GG I AG +G + QS + + S G Q S ++ LS + + +G
Sbjct: 5 LGGGASLQITIKAGGNGGLFPSQSSQNGGSPSQSAFGGQRSN-IAEQLSDIMTTMMFMGS 63

Query: 249 QLGAGLSGDLSA---GLPSVSGLVQTGQVLAGSFNDAVGGLGSRVAGVLSGAVGGELPDL 305
+G GL G L L + G + G + G + GLGS + G L GA+G + +
Sbjct: 64 MMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGGLGGALGAGMNAM 123

Query: 306 SGLVQAGAVLMGGVGTALSGLGAQLSAALSGSLGTAVPGLSALIQ-TGESLAGGFNAALG 364
+ G++L + L G +Q L G+ + P +SA Q ++L+ L
Sbjct: 124 NPSAMMGSLLFSALEDLLGGGMSQQQGGLFGNKQPSSPEISAYTQGVNDALSAILGNGLS 183

Query: 365 -TLGTQVSGMLSGGLGGGLPGLAALIQTGESLAGGFG 400
T G L GL G A Q G +L G
Sbjct: 184 QTKGQTSPLQLGNNGLQGLSGAGAFNQLGSTLGMSVG 220



Score = 30.8 bits (69), Expect = 0.020
Identities = 36/139 (25%), Positives = 54/139 (38%), Gaps = 1/139 (0%)

Query: 283 VGGLGSRVAGVLSGAVGGELPDLSGLVQAGAVLMGGVGTALSGLGAQLSAALSGSLGTAV 342
+G + G G +G L L G + G + G + SGLG+ L L G+LG +
Sbjct: 61 MGSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGGLGGALGAGM 120

Query: 343 PGLSALIQTGESLAGGFNAALGTLGTQVSGMLSGGLGGGLPGLAALIQ-TGESLAGGFGT 401
++ G L LG +Q G L G P ++A Q ++L+ G
Sbjct: 121 NAMNPSAMMGSLLFSALEDLLGGGMSQQQGGLFGNKQPSSPEISAYTQGVNDALSAILGN 180

Query: 402 GLTGFMGGVDVLGNLGGEL 420
GL+ G L L
Sbjct: 181 GLSQTKGQTSPLQLGNNGL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4288RTXTOXINA310.020 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.7 bits (69), Expect = 0.020
Identities = 32/129 (24%), Positives = 50/129 (38%), Gaps = 21/129 (16%)

Query: 416 GGISKDGRQLTLVIGVAANDPTSVAVANTAADQLRNVGIAASV--LALDPVTLYGDALND 473
G + K Q + A TS A A G+ AS LA+ P++ A
Sbjct: 281 GNVGKGISQYIIAQRAAQGLSTSAAAA----------GLIASAVTLAISPLSFLSIADKF 330

Query: 474 NRVDAIVGWHQA----GGNLATLLASRY---GC--PALQTTEVWEPTIPANTPAATTGSM 524
R + I + Q G + +LLA+ + G +L T ++ + AA T S+
Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSL 390

Query: 525 PSAVPSAVT 533
A SA+
Sbjct: 391 VGAPVSALV 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4289TCRTETOQM1901e-54 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 190 bits (485), Expect = 1e-54
Identities = 106/447 (23%), Positives = 174/447 (38%), Gaps = 62/447 (13%)

Query: 1 MLFRNVAIVAHVDHGKTTLVDAMLRQSGALTERGEVQE--RVMDSGDLEREKGITILAKN 58
M N+ ++AHVD GKTTL +++L SGA+TE G V + D+ LER++GITI
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 59 TAVHRHHPDGTVTVINVIDTPGHADFGGEVERGLSMVDGVLLLVDASEGPLPQTRFVLRK 118
T+ + T +N+IDTPGH DF EV R LS++DG +LL+ A +G QTR +
Sbjct: 61 TSFQWEN-----TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHA 115

Query: 119 ALSAHLPVILVVNKTDRPDARIKEVVEASHDLLLDVA----------------SDLDDEA 162
+P I +NK D+ + V + + L ++
Sbjct: 116 LRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQW 175

Query: 163 AAAAEHALGLPTLYASG------------RAGIAS-TIEPA-DGQAPDGTNLDPLFDVLL 208
E L Y SG + ++ P G A + +D L +V+
Sbjct: 176 DTVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT 235

Query: 209 EHVPPPQGDSEAPLQALVTNLDASAFLGRLALVRIYNGKLRKGQQVAWLREVDGVPVVTS 268
++ L V ++ S RLA +R+Y+G L V +
Sbjct: 236 NKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVR-------ISEKEK 288

Query: 269 AKITELLATEGVERSTTDEAVAGDIVAVAGLP---EIMIGDTLADPDHAHALPRITVDEP 325
KITE+ + E D+A +G+IV + ++GDT P I P
Sbjct: 289 IKITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRER----IENPLP 344

Query: 326 AISVTVGTNTSPLAGKVSGHKLTARMVRGRLDTELIGNVSIRVVDIGRPDAWEVQGRGEL 385
+ TV + + L + +R + G++
Sbjct: 345 LLQTTVEPSKPQQREM----------LLDALLEISDSDPLLRYYVDSATHEIILSFLGKV 394

Query: 386 ALAVLVETMRRE-GFELTVGKPQVVTK 411
+ V ++ + E+ + +P V+
Sbjct: 395 QMEVTCALLQEKYHVEIEIKEPTVIYM 421



Score = 42.9 bits (101), Expect = 3e-06
Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 1/84 (1%)

Query: 416 QLHEPFEAMTIDCPDEFVGAITQLMAGRKGRMEEMTNHAAGWVRMDFIVPSRGLIGFRTD 475
+L EP+ + I P E++ + + T V + +P+R + +R+D
Sbjct: 534 ELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVD-TQLKNNEVILSGEIPARCIQEYRSD 592

Query: 476 FLTLTRGTGIANAVFDGYRPWAGE 499
T G + GY GE
Sbjct: 593 LTFFTNGRSVCLTELKGYHVTTGE 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4292PF06580290.034 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.034
Identities = 16/87 (18%), Positives = 28/87 (32%), Gaps = 18/87 (20%)

Query: 343 LVPLMIWLLSGPLRDRLGARILGW---------GWLALTVIGVPWLLSFAQPTIWQI--- 390
+ LM +L+ R + + A VIG+ W A +IW++
Sbjct: 47 AISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWF--VANTSIWRLLAF 104

Query: 391 ----GRPWYLAWAGLVYVVATLATLGW 413
+ L A + + T W
Sbjct: 105 INTKPVAFTLPLALSIIFNVVVVTFMW 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4294PF07201330.001 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 33.3 bits (76), Expect = 0.001
Identities = 10/56 (17%), Positives = 21/56 (37%)

Query: 46 LPALAQLSPIIQQAAGNPEQATQLLMAAAQAFAHNPAAPTESRNVASSVNQFVQEP 101
+L+QL ++ + P + ++L A P S V ++ +E
Sbjct: 115 NISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQ 170


118MMAR_4304MMAR_4331N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_43040110.147524TetR family transcriptional regulator
MMAR_4305-1100.621570transmembrane transport protein MmpL13
MMAR_4306090.935746hypothetical protein
MMAR_4307090.861101short-chain type dehydrogenase/reductase
MMAR_4308-1100.574963alpha-methylacyl-CoA racemase Mcr
MMAR_4309-1120.039948enoyl-CoA hydratase
MMAR_43100120.522542oxidoreductase
MMAR_431110264.756520integral membrane protein
MMAR_431210254.770877hypothetical protein
MMAR_431310254.455238chalcone synthase
MMAR_431411223.705246hypothetical protein
MMAR_431511192.160438oxidoreductase
MMAR_431610191.319214PE-PGRS family protein
MMAR_4317412-3.177810enoyl-CoA hydratase, EchA1
MMAR_4318413-3.374459acetyl-CoA acetyltransferase
MMAR_4319514-3.437899PPE family protein
MMAR_4320312-2.917713PPE family protein
MMAR_4321-116-2.752585PE family protein
MMAR_4322013-2.171090NAD dependent aldehyde dehydrogenase
MMAR_4323116-1.858165transposase
MMAR_5543010-1.062472hypothetical protein
MMAR_4324010-1.580944PPE family protein
MMAR_432508-2.037278TetR family transcriptional regulator
MMAR_4326-18-0.967181short-chain type dehydrogenase/reductase
MMAR_4327-18-0.989863hypothetical protein
MMAR_4328-28-0.6131625-
MMAR_4329-19-0.353611hypothetical protein
MMAR_4330-2100.355679hypothetical protein
MMAR_4331-290.299620pyruvate phosphate dikinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4304HTHTETR686e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.7 bits (165), Expect = 6e-16
Identities = 35/164 (21%), Positives = 61/164 (37%), Gaps = 11/164 (6%)

Query: 9 ARAPRGSGDLLRHEILDAATELLLQTRQARAVSIRSVAERVGVTPPSIYLHFQDKDALLD 68
AR + R ILD A L Q + + S+ +A+ GVT +IY HF+DK L
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQ-QGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 69 AVCARYLARLDE-EMERAAMGHTCVVEVLRAQGLAYVRFALQTPELYRLATM-------- 119
+ + + E E+E A + VLR + + + L +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 120 GEWRSGSNVDSALDSSAFRHMCASVQAMMDEGIYRAD-DPTTIA 162
GE L ++ + +++ ++ + AD A
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4305ACRIFLAVINRP534e-09 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 53.3 bits (128), Expect = 4e-09
Identities = 36/233 (15%), Positives = 85/233 (36%), Gaps = 29/233 (12%)

Query: 186 LIAIPLSFLVLVWVFGGLLAAALPMALGALAVVGSMSVLRLVTFTTDVSIFALNLSTALG 245
AI L FLV+ + A +P + ++G+ ++L ++ +N T G
Sbjct: 345 FEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYS-------INTLTMFG 397

Query: 246 LALAI-----DYTLLIISRYRDELAEGSSREEALVRTMATSGRTVLFSAVT---VALSMS 297
+ LAI D +++ + R + + +EA ++M+ ++ A+ V + M+
Sbjct: 398 MVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMA 457

Query: 298 ATVAFPMYFLKSFAYAGVATVAFVATASIVVTPAAIVLLGPRLDALNVRRLARRMLGRPE 357
+ F+ V+ +A ++++TPA L ++ ++
Sbjct: 458 FFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATL--------LKPVSAEHHENKG 509

Query: 358 PQHKPVDQLF------WYRSTKFVMRRALPVGLAVVAVLVILGLPFFSVKWGF 404
+ F + S ++ L ++ + + F + F
Sbjct: 510 GFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSF 562



Score = 40.6 bits (95), Expect = 3e-05
Identities = 42/233 (18%), Positives = 81/233 (34%), Gaps = 27/233 (11%)

Query: 116 SAPDLVSKDGKSGL-IVVNIKGGES--NAQKNAQTLADEIVHDRDGVTVRAGGSAMEYAQ 172
+P L +G + I G S +A + LA ++ G+ G + +
Sbjct: 811 GSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASKLP---AGIGYDWTGMSYQERL 867

Query: 173 INKQNQDDLLVMELIAIPLSFLVLVWVFGGLLAAALPMALGALAVVGSMSVLRLVTFTTD 232
Q + I+ + FL L ++ M + L +VG + L D
Sbjct: 868 SGNQ----APALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKND 923

Query: 233 VSIFALNLSTALGLALAIDYTLLIISRYRDEL-AEGSSREEALVRTMATSGRTVLFSAVT 291
V F + L T +G L+ +LI+ +D + EG EA + + R +L +++
Sbjct: 924 V-YFMVGLLTTIG--LSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLA 980

Query: 292 VALSMSATVAFPMYF--------LKSFAYAGVATVAFVATASIVVTPAAIVLL 336
L + P+ + + + +I P V++
Sbjct: 981 FILGV-----LPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4307DHBDHDRGNASE732e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 72.8 bits (178), Expect = 2e-17
Identities = 58/205 (28%), Positives = 87/205 (42%), Gaps = 19/205 (9%)

Query: 3 IRDAVAVVTGGASGLGLATTKRLLDAGAQVVVLDIRGE---DVVADLGDRARFA---AAD 56
I +A +TG A G+G A + L GA + +D E VV+ L AR A AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 57 VTDEAAVASALD-LAETMGTLRIVVNCAGTGNAIRVLSRDGVFPLAAFRKIVDINLVGSF 115
V D AA+ + MG + I+VN AG +R + + +N G F
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGV---LRPGLIHSL-SDEEWEATFSVNSTGVF 121

Query: 116 NVLRLAAERIAKTEPVGPNAEERGVIINTASVAAFDGQIGQAAYSASKGGVVGMTLPIAR 175
N R ++ + G I+ S A + AAY++SK V T +
Sbjct: 122 NASRSVSKYM--------MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173

Query: 176 DLASHRIRVMTIAPGLFDTPLLASL 200
+LA + IR ++PG +T + SL
Sbjct: 174 ELAEYNIRCNIVSPGSTETDMQWSL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4310NUCEPIMERASE604e-12 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 60.2 bits (146), Expect = 4e-12
Identities = 29/123 (23%), Positives = 54/123 (43%), Gaps = 18/123 (14%)

Query: 9 LVTGATGYIGARLVPRLLDEGHRVRAL---------ARDPGKLADVPWRDRAEVVRGDLG 59
LVTGA G+IG + RLL+ GH+V + + +L + + + + DL
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA-QPGFQFHKIDLA 62

Query: 60 DTDSLEAAFA--GMDVVYYLVH------SMGSAKHFADEEARAAHNVVLAARRSGVRRVV 111
D + + FA + V+ H S+ + +AD N++ R + ++ ++
Sbjct: 63 DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHLL 122

Query: 112 YLS 114
Y S
Sbjct: 123 YAS 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4312NUCEPIMERASE634e-13 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 62.9 bits (153), Expect = 4e-13
Identities = 31/125 (24%), Positives = 51/125 (40%), Gaps = 18/125 (14%)

Query: 1 MRILVTGATGYVGSRLVTALLADGHEVLA---------ATRNMARLSRLAWFDDVTPVIL 51
M+ LVTGA G++G + LL GH+V+ + ARL LA +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA-QPGFQFHKI 59

Query: 52 DATDRASAQAAMNAAGQIDVVYYLVH------GIGQPD-FRDRDKTAAANLAVAARDTGV 104
D DR + A+G + V+ H + P + D + T N+ R +
Sbjct: 60 DLADRE-GMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 105 RRIVY 109
+ ++Y
Sbjct: 119 QHLLY 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4315SECA290.039 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 28.7 bits (64), Expect = 0.039
Identities = 23/86 (26%), Positives = 32/86 (37%), Gaps = 15/86 (17%)

Query: 243 VAGRVLLVGDAAGYEDALTGEGISLAVKQAAA-------AVRAIADND-PASYEAAWHRV 294
+ G VL A + TGEG +L A V + ND A +A
Sbjct: 89 LGGMVLNERCIA---EMRTGEGKTLTATLPAYLNALTGKGVHVVTVNDYLAQRDAEN--N 143

Query: 295 TRSYRWL--TRGLVLASAPRPARRAI 318
+ +L T G+ L P PA+R
Sbjct: 144 RPLFEFLGLTVGINLPGMPAPAKREA 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4316cloacin401e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.1 bits (93), Expect = 1e-04
Identities = 33/104 (31%), Positives = 41/104 (39%), Gaps = 1/104 (0%)

Query: 1212 IGGAGGQGGSGGAAGTGGDVGGSPGEVGGAGGGGGFGGAGGAGGVGGGAGGSG-GVGGSG 1270
+ G G+G + GA T G++ G P +G GG G GG GSG GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1271 GNGGLGVATGGAGGVGGQGGAAGAGGAAGAGATAAGAGGVGGLG 1314
G+G G GG G G + G A G GGL
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 39.3 bits (91), Expect = 2e-04
Identities = 29/88 (32%), Positives = 35/88 (39%)

Query: 1057 TGGAGGGAGSGAGGLGAGGDGGTGGAGGAGGVGSSAGWGSGLAGQVGGAGGVGGAGGDSG 1116
+GG G G +GA +GG G G GG +GW S GG+G GG SG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1117 GLGGTNGDGGAGGLGGRGGAGGSSTTVA 1144
G GG G G + VA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 38.2 bits (88), Expect = 3e-04
Identities = 37/112 (33%), Positives = 45/112 (40%), Gaps = 2/112 (1%)

Query: 1193 GRGGDGGAGGIGGG--GGLSGIGGAGGQGGSGGAAGTGGDVGGSPGEVGGAGGGGGFGGA 1250
GRG + GA G GG +G+G GG G + GG G GGG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1251 GGAGGVGGGAGGSGGVGGSGGNGGLGVATGGAGGVGGQGGAAGAGGAAGAGA 1302
GG G GGG+G G + G G GG + AG + A A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117



Score = 37.4 bits (86), Expect = 7e-04
Identities = 39/110 (35%), Positives = 50/110 (45%), Gaps = 2/110 (1%)

Query: 509 AGGDGGAGGVGAEGAAGAGVVGGGAGGDGGAGGAAGAGGSGGGGIGGGKAGTG-GDGGIG 567
+GGDG GA +G + GG G G G + G+G S GG +G+G GG
Sbjct: 2 SGGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 568 GAGGTGGGGGNGDTYSTGGAGGDGGAGGAAGSAGGAGAGSGGAAGSAGSG 617
G G GG G +G TGG A A G + G+GG A S +G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 37.0 bits (85), Expect = 8e-04
Identities = 31/101 (30%), Positives = 44/101 (43%)

Query: 545 AGGSGGGGIGGGKAGTGGDGGIGGAGGTGGGGGNGDTYSTGGAGGDGGAGGAAGSAGGAG 604
+GG G G G + +G G G GGG +G +S+ GG+G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 605 AGSGGAAGSAGSGGSGGDGGAGGASGRELGSNLGYAGGVGG 645
G+GG G++G G G + A+ G G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.5 bits (81), Expect = 0.002
Identities = 32/118 (27%), Positives = 42/118 (35%), Gaps = 6/118 (5%)

Query: 1616 NGGRGGAGGAGGFAGDGEGSGGTAGSGGNGGKGGNAGAGGNGVPAAGAAAGNGGLGGSGG 1675
+GG G G + G +GG G G GG +G P G + GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1676 AGGSGAIGAAAGGAGGAGGNGGTGGNAGIGVMRGASTPALAGDGGVGGAGGLGGVARS 1733
G G G GG+G G + + PAL+ G G A + A S
Sbjct: 62 HGNGG------GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 34.7 bits (79), Expect = 0.004
Identities = 36/103 (34%), Positives = 42/103 (40%), Gaps = 2/103 (1%)

Query: 970 GSGGGVGNAGVGGAGGVGGAGGAGGAADGPGLFGYDG--GAGGAGGIGGAAGVGGSNGAG 1027
G G + GG G G GGA+DG G + G G GI G G NG G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 1028 GTGGAGGVGGVGADSGLASRPGGAGGAGGTGGAGGGAGSGAGG 1070
GG G G S +A+ A T GAGG A S + G
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 34.7 bits (79), Expect = 0.005
Identities = 27/80 (33%), Positives = 37/80 (46%)

Query: 485 AGGAGGAGGAGGAAAAGGTAGVGGAGGDGGAGGVGAEGAAGAGVVGGGAGGDGGAGGAAG 544
+GG G G + +G G G GG G+ ++ GGG+G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 545 AGGSGGGGIGGGKAGTGGDG 564
G GG G GG +GTGG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 34.3 bits (78), Expect = 0.005
Identities = 25/77 (32%), Positives = 27/77 (35%)

Query: 1104 GAGGVGGAGGDSGGLGGTNGDGGAGGLGGRGGAGGSSTTVAGAGGGGGRGGDGGSAGGGV 1163
G G GA SG + G G GG G S G G G G GGS G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1164 GGGGVGGAAGSGGAGGA 1180
GG G G G +
Sbjct: 66 GGNGNSGGGSGTGGNLS 82



Score = 34.3 bits (78), Expect = 0.006
Identities = 35/108 (32%), Positives = 41/108 (37%), Gaps = 1/108 (0%)

Query: 1451 GAGGAGGQGGAANGGVAGDGGVGGNGGVGGVGGRGGDGANGAPGGGIDGTG-RPGGAGGQ 1509
G G G GA + +GG G G GG G + P GG G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1510 GGSGGRAGFGGAAGAGEGGEYGAAGVGGNGGDGGAGGRGGYGTTGSGG 1557
G GG GG +G G AA V G GG + S G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.9 bits (77), Expect = 0.006
Identities = 27/85 (31%), Positives = 35/85 (41%)

Query: 440 AAGQGRGGTVGAAGVGGTGGVGGDGGAGDSGAAASAPGGAGGTGWAGGAGGAGGAGGAAA 499
+ G GRG GA G G G GA+ + + W GG+G GG +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 500 AGGTAGVGGAGGDGGAGGVGAEGAA 524
G G G +GG G GG + AA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.5 bits (76), Expect = 0.009
Identities = 34/104 (32%), Positives = 43/104 (41%), Gaps = 2/104 (1%)

Query: 660 LAGAAGSGGNGGAGGAGGASAVALVGGAGGAGGAGGQGGTAGDGPGGVGGHGGSGGSGGI 719
++G G G N GA G G G G + G G ++ + P G GG G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGG 59

Query: 720 GGTGGDGYQSGDVGGQGGEGGAGGAAGAGGEAGAQGLAGAGGTG 763
G G G +G+ GG G GG A A G L+ G G
Sbjct: 60 SGHGNGG-GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.1 bits (75), Expect = 0.011
Identities = 31/89 (34%), Positives = 37/89 (41%), Gaps = 2/89 (2%)

Query: 1496 GIDGTGRPGGAGGQGGS--GGRAGFGGAAGAGEGGEYGAAGVGGNGGDGGAGGRGGYGTT 1553
G DG G GA G+ GG G G GA +G + + GG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1554 GSGGLFGGSGGHGGVGGIGGNGGSAAAGG 1582
G+GG G SGG G GG + A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.017
Identities = 40/115 (34%), Positives = 45/115 (39%), Gaps = 10/115 (8%)

Query: 346 GGAGGAGGVGGAGTSGGDAVVPGGVGGVGGSGGAGG--------AGGSGGGAGWLGTAGD 397
GG G G TSG P G+G GG+ G GGSG G W G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 398 GGVGGVGGGGGGGGVGASGVGHQLAGGAGGAGGAGGAAGAGGAAGQGRGGTVGAA 452
G G G G GGG G G +A A GAGG A G + AA
Sbjct: 63 GNGG--GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 32.4 bits (73), Expect = 0.019
Identities = 32/103 (31%), Positives = 37/103 (35%), Gaps = 5/103 (4%)

Query: 600 AGGAGAGSGGAAGSAGSGGSGGDGGAGGASGRELGSNLGYAGGVGGDGGQGGQGGAAVGG 659
+GG G G A S +GG G G G GS G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 660 LAGAAGSGGNGGAGGAGG-----ASAVALVGGAGGAGGAGGQG 697
G+G +GG G GG A+ VA A GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.019
Identities = 35/100 (35%), Positives = 42/100 (42%), Gaps = 4/100 (4%)

Query: 1148 GGGGRGGDGG--SAGGGVGGGGVGGAAGSGGAGGAGGRGIDSYAAAGGRGG--DGGAGGI 1203
GG GRG + G S G + GG G G G + G+G ++ G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1204 GGGGGLSGIGGAGGQGGSGGAAGTGGDVGGSPGEVGGAGG 1243
G GGG GG G GG+ A G GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.020
Identities = 28/81 (34%), Positives = 32/81 (39%)

Query: 1145 GAGGGGGRGGDGGSAGGGVGGGGVGGAAGSGGAGGAGGRGIDSYAAAGGRGGDGGAGGIG 1204
G G G G+ GG G GVGG A G + + +G G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1205 GGGGLSGIGGAGGQGGSGGAA 1225
GG G SG G G S AA
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.021
Identities = 30/84 (35%), Positives = 36/84 (42%), Gaps = 2/84 (2%)

Query: 1869 GAGGVGGFGGVGGTGASGLGGSGGIGGDGGA--GGVGGDCSVPLSPGNGGDGGAGGDGGD 1926
G G G G T + GG G+G GGA G + P G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1927 GGDGGNGQPGGPGGAGGGAASGGA 1950
G GGNG GG G GG ++ A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.021
Identities = 32/101 (31%), Positives = 40/101 (39%), Gaps = 2/101 (1%)

Query: 1567 GVGGIGGNGGSAAAGGVNGNGGNGGIGGNAGDAGNGANGSLLHHAGDGGNGGRG-GAGGA 1625
G G G N G+ + G N NGG G+G G + S + G G G G G
Sbjct: 3 GGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1626 GGFAGDGEGSGGTAGSGGNGGKGGNAGAGGNGVPAAGAAAG 1666
G G SGG +G+GGN A G + A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.022
Identities = 35/107 (32%), Positives = 43/107 (40%), Gaps = 3/107 (2%)

Query: 1249 GAGGAGGVGGGAGGSGGVGGSGGNGGLGVATGGAGGVGGQGGAAGAGGAAGAGATAAGAG 1308
G G G G SG + +GG GLGV G + G G GG +G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1309 GVGGLGGDGGNGGNGVRGAAGVAGGDGAVGGGGGAGGAGGQGGAGVT 1355
G G GG GN G G ++ V G A G GG V+
Sbjct: 61 GHGN-GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 32.4 bits (73), Expect = 0.022
Identities = 30/82 (36%), Positives = 36/82 (43%), Gaps = 2/82 (2%)

Query: 1771 GAGGVGGNGGFAALGTGGAAGSGGGGGTGGAGGVSDSPTTSRSVGGAGGVGGVGGNGGIG 1830
G G G N G + G G G GGA S + + GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1831 GNGQIGGDGGSGGAAGAGGAGA 1852
GNG GG+G SGG +G GG +
Sbjct: 63 GNG--GGNGNSGGGSGTGGNLS 82



Score = 32.0 bits (72), Expect = 0.025
Identities = 39/135 (28%), Positives = 46/135 (34%)

Query: 756 LAGAGGTGGTGGQGGTGGIGAQGSNGHGVGGRPGTAGAVGGAGGAGGQGGAAGLDGTAGD 815
++G G G G T G G G GVGG G G +G+ G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 816 GGVGGTGGRGGAGGDGAGGVGHQLAGGAGGDGGAGGAAGVGGAAGAGSGGVVGAAGTGGT 875
G G G GG G GG +A A G GG A + S G + AA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIM 120

Query: 876 GGAGGNGGAGDTGVA 890
G G GVA
Sbjct: 121 AALKGPFKFGLWGVA 135



Score = 32.0 bits (72), Expect = 0.029
Identities = 22/81 (27%), Positives = 30/81 (37%)

Query: 539 AGGAAGAGGSGGGGIGGGKAGTGGDGGIGGAGGTGGGGGNGDTYSTGGAGGDGGAGGAAG 598
+GG +G G G G+GG G G + + GG+G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 599 SAGGAGAGSGGAAGSAGSGGS 619
G G G+ G G S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 32.0 bits (72), Expect = 0.032
Identities = 32/114 (28%), Positives = 38/114 (33%), Gaps = 6/114 (5%)

Query: 1425 GGNGGAGDAGVAGADGGGAGGSGWAGGAGGAGGQGGAANGGVAGDGGVGGNGGVGGVGGR 1484
GG+G + G G GG G GGA G ++ GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1485 GGDGANGAPGGGIDGTGRPGGAGGQGGSGGRAGFGGAAGAGEGGEYGAAGVGGN 1538
G G NG GGG G G FG A + G A +
Sbjct: 63 GNGGGNGNSGGG------SGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.6 bits (71), Expect = 0.034
Identities = 31/80 (38%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 224 SGGAGGAGDVGVAGGAGGV-GGRGGWVFGDGGSGGVGGSGGVGVVGGVGGVGGGTGVFGG 282
SGG G + G +G + GG G G G S G G S GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 283 GGAGGAGGVGGGTGGSGGNG 302
G GG G GG G+GGN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.034
Identities = 32/108 (29%), Positives = 40/108 (37%), Gaps = 6/108 (5%)

Query: 929 GDGGTGGAGGAGAGGDRTDGGRGGVGGAGGDAGAGGVTGAGGSGGGVGNAGVGGAGGVGG 988
G G G GA + +GG G+G GG + G + GG +G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 989 AGGAGGAADGPGLFGYDGGAGGAGGIGGAAGVGGSNGAGGTGGAGGVG 1036
G G G G G AA V A T GAGG+
Sbjct: 63 GNGGGNGNSG------GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.039
Identities = 34/122 (27%), Positives = 46/122 (37%), Gaps = 5/122 (4%)

Query: 1596 AGDAGNGANGSLLHHAGDGGNGGRGGAGGAGGFAGDGEGSGGTAGSGGNGGKGGNAGAGG 1655
+G G G N +G+ G G G G G G S GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1656 NGVPAAGAAAGNGGLGGSGGAGGSGAIGAAAGGAGGAGGNGGTGGNAGIGVMRGASTPAL 1715
+G GNG GG G GG+ + AA G + G + + GA + A+
Sbjct: 62 HG-----NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116

Query: 1716 AG 1717
A
Sbjct: 117 AD 118



Score = 31.2 bits (70), Expect = 0.047
Identities = 34/112 (30%), Positives = 44/112 (39%), Gaps = 1/112 (0%)

Query: 571 GTGGGGGNGDTYSTGGAGGDGGAGGAAGSAGGAGAGSGGAAGSAGSGGSGGDGGAGGASG 630
G G G N +ST G +GG G G + + + GGSG GG SG
Sbjct: 3 GGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 631 RELGSNLGYAGGVGGDGGQGGQGGAAVGGLAGAAGSGGNGGAGGAGGASAVA 682
G G +GG G GG A V A + G GG + A A++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4319cloacin340.002 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.9 bits (77), Expect = 0.002
Identities = 23/81 (28%), Positives = 30/81 (37%), Gaps = 6/81 (7%)

Query: 458 MGFGNGGGGNTGFY------NSGTYNTGFSNAGETNTGWENSGNVNTGGYNSGGLNTGIG 511
M G+G G NTG + N G G +GW + N GG SG G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 512 SPDTQAGPNSGFGHSGSGNSG 532
G + G SG+G +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.019
Identities = 23/89 (25%), Positives = 30/89 (33%), Gaps = 11/89 (12%)

Query: 225 GSGNTGSANLGGGNIGNGNLGSGNTGNVNLGNGNNGFFNFGNGNLGDTNFGSGNSGNLNL 284
G G+ A+ GNI G G G G + G+G N G +
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-----------WSSENNPWGGGSGSGI 54

Query: 285 GSGNRFGSGNIGFGNRFGDGNFGSGNAGS 313
G G GN G G G+ GN +
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 30.1 bits (67), Expect = 0.032
Identities = 27/90 (30%), Positives = 38/90 (42%), Gaps = 10/90 (11%)

Query: 378 MGFGNAGDNNVGFFNSGSNNIGFFNSGDGNFGFANAGSTNTGFWNSGGTNTGFGNGGSLN 437
M G+ +N G ++ N N G G S +G W+S G G+G ++
Sbjct: 1 MSGGDGRGHNTGAHSTSGN----INGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIH 55

Query: 438 FGFGNGGVENMGHGNAGSFNMGFGNGGGGN 467
+G G G N G N G G+G GGN
Sbjct: 56 WG-GGSGHGNGGGNG----NSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4320cloacin360.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 0.001
Identities = 29/98 (29%), Positives = 38/98 (38%), Gaps = 3/98 (3%)

Query: 757 GSGNHGDANLGFGNFGNGNIGSGNHGAGNFGSGNTGSRNLGSGNAGSTNFGSGNHGNSNV 816
G G++ A+ GN G G G G + GSG + N G +GS G G+ N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 817 GLGNFGNNNLGLGNNGSNN---IGFGLTGDNLVGIGAL 851
G G G N S + FG + G G L
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 34.3 bits (78), Expect = 0.003
Identities = 24/77 (31%), Positives = 30/77 (38%)

Query: 717 GFGNIGQANLGSGNAGNTNLGSGNTGSTNFGSGNIGALNLGSGNHGDANLGFGNFGNGNI 776
G G+ A+ SGN G G G + GSG N G G G G+GN
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 777 GSGNHGAGNFGSGNTGS 793
G + G G+G S
Sbjct: 66 GGNGNSGGGSGTGGNLS 82



Score = 34.3 bits (78), Expect = 0.003
Identities = 27/109 (24%), Positives = 40/109 (36%), Gaps = 2/109 (1%)

Query: 737 GSGNTGSTNFGSGNIGALNLGSGNHGDANLGFGNFGNGNIGSGNHGAGNFGSGNTGSRNL 796
G G+ + SGNI G G G A+ G G N G G+G G +G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 797 GSGNAGSTNFGSGNHGNSNVGLGNFGNNNLGLGNNGSNNIGFGLTGDNL 845
G G+G + ++ FG L G+ + ++ L
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFP--ALSTPGAGGLAVSISAGAL 112



Score = 33.5 bits (76), Expect = 0.005
Identities = 28/109 (25%), Positives = 41/109 (37%), Gaps = 3/109 (2%)

Query: 942 NSGSYNT-GSFNSGTLNTGDFNGGDHNTGWGNSGNTNTGGINSGDLNTGFGSSADQAVTN 1000
N+G+++T G+ N G G G +GW + N GG SG G S
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI--HWGGGSGHGNGGG 67

Query: 1001 SGFGNNGSGNSGFNNTGDTNSGFHNANTSALFSGHSGLLNAGGSQSVGI 1049
+G GSG G + F S +G + + G+ S I
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 32.4 bits (73), Expect = 0.013
Identities = 23/78 (29%), Positives = 29/78 (37%)

Query: 727 GSGNAGNTNLGSGNTGSTNFGSGNIGALNLGSGNHGDANLGFGNFGNGNIGSGNHGAGNF 786
G G+ + SGN G G G + GSG + N G G+G G G GN
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 787 GSGNTGSRNLGSGNAGST 804
G G+G S
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 30.5 bits (68), Expect = 0.040
Identities = 27/82 (32%), Positives = 34/82 (41%), Gaps = 4/82 (4%)

Query: 707 GSGNTGDANFGFGNIGQANLGSGNAGNTNLGSGNTGSTNFGSGNIGALNLGSGNHGDANL 766
G G+ A+ GNI G G G + GSG + N G G+ G G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 767 GFGNFGNGNIGSGNHGAGNFGS 788
G GNGN G G+ GN +
Sbjct: 66 G----GNGNSGGGSGTGGNLSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4322GPOSANCHOR300.021 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.0 bits (67), Expect = 0.021
Identities = 15/55 (27%), Positives = 29/55 (52%), Gaps = 6/55 (10%)

Query: 43 EVADASKTDMHRAIDAARRAFDETDWSTNRALRKRCLEQLQEAIEAEREELREEL 97
+V +A++ + R +DA+R A + + + LE+ + EA R+ LR +L
Sbjct: 305 QVLNANRQSLRRDLDASREAKKQLE------AEHQKLEEQNKISEASRQSLRRDL 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4325HTHTETR631e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.5 bits (154), Expect = 1e-13
Identities = 33/191 (17%), Positives = 66/191 (34%), Gaps = 4/191 (2%)

Query: 16 RRTEILQTAAALIASSGLR-TSLQEIADAAGILPGSLYHHFESKEAILIELIRRYQDDLH 74
R IL A L + G+ TSL EIA AAG+ G++Y HF+ K + E+ + ++
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 75 Q-IGQSWQAKLDQPDSRTVAEKITQLGAAIANCAVAHRAALQMSFYEGPSADPELMKLTS 133
+ + P S I L + + + E + +
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 134 QRPLAIQEAMLQTLRAGRWSGCIRTEIDLPTLADRI--CQTMLQVGLDMMRRNASADQVS 191
L + + QTL+ + + ++ A + + L ++ + +
Sbjct: 132 NLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEA 191

Query: 192 GLMCRTILQGL 202
+L+
Sbjct: 192 RDYVAILLEMY 202



Score = 46.2 bits (109), Expect = 6e-08
Identities = 22/149 (14%), Positives = 46/149 (30%), Gaps = 7/149 (4%)

Query: 234 ADPSDKAAHVRAVARIEFGRKGYEVTTIRDIASASGLATGTVYRVIGSKDKLLASIM-RS 292
+ + H+ VA F ++G T++ +IA A+G+ G +Y K L + I S
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 293 FGQKVEAGWVAVLRSNATPIEKLDALSWVNINALDQFSDEFRIQLAWMRQSPPDTPNPGW 352
E + P+ L + + + + +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 353 LYAARLRQ------MKSLLSEGLRSAEIQ 375
A R ++ L + + +
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLP 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4326DHBDHDRGNASE584e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 58.1 bits (140), Expect = 4e-12
Identities = 68/277 (24%), Positives = 103/277 (37%), Gaps = 54/277 (19%)

Query: 10 DGKRALIVGGATGMGAAAAKSAAELGAEIIVMDYAPVGYDA-----------AQTLSVDL 58
+GK A I G A G+G A A++ A GA I +DY P + A+ D+
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 59 RDPASIDSAVERLG---GPVHAVFSAAGVADGPDLMKINFIGHRHLIDRLLANDQLPSGS 115
RD A+ID R+ GP+ + + AGV R L
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVL------------------RPGLIHSLSD-E 107

Query: 116 AVCFISSVAGMGWENDLPRLTEFLATPDYGAAQDWVS--AHEAE-GIIHYGFSKKAINAY 172
SV G N +++++ G+ S A + Y SK A +
Sbjct: 108 EWEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMF 167

Query: 173 VATRAYPLLKRGIRINAICPGPTDTPLAQANADLWLT----------FAQDYRDETG--- 219
L + IR N + PG T+T + + LW + ++ TG
Sbjct: 168 TKCLGLELAEYNIRCNIVSPGSTETDMQWS---LWADENGAEQVIKGSLETFK--TGIPL 222

Query: 220 SKVHTPEQMGDVMVFLNSAAAFGISGITLLVDYGHTM 256
K+ P + D ++FL S A I+ L VD G T+
Sbjct: 223 KKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4327NUCEPIMERASE365e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 36.3 bits (84), Expect = 5e-05
Identities = 34/185 (18%), Positives = 59/185 (31%), Gaps = 35/185 (18%)

Query: 3 RVVVFGGHGKVALLLGHILADRGDQVSSV-----FRNP---DHRDDIAAT-GATPVQADI 53
+ +V G G + + L + G QV + + + R ++ A G + D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 54 EGLDTAALAGLLTGH--DAVVFSAGAGG-----GNPARTYAVDRDAAIRVIDAATRAGVQ 106
D + L + V S NP + + +++ +Q
Sbjct: 62 A--DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 107 RFVMVS---YFGAGPNHGVSVDDS----FFPYAQAKAAAD--AHLRASNLD--------W 149
+ S +G S DDS YA K A + AH + +
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 150 TVLGP 154
TV GP
Sbjct: 180 TVYGP 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4331PHPHTRNFRASE769e-17 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 76.4 bits (188), Expect = 9e-17
Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 5/99 (5%)

Query: 389 LAYTDVDEALDAADRGEQVILVRDHTRPEDVSGMLA--AQGIVTEIGGAASHAAVVSREL 446
L + E A E+ +++ + P D + + +G T+IGG SH+A++SR L
Sbjct: 139 LGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSL 198

Query: 447 GRVAVVGCGDGVATSLAGKRITVDGYTGEVREGILAPSA 485
AVVG + G + VDG G V I+ P+
Sbjct: 199 EIPAVVGTKEVTEKIQHGDMVIVDGIEGIV---IVNPTE 234


119MMAR_4554MMAR_4562N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4554312-2.193058PPE family protein
MMAR_4555010-0.386198hypothetical protein
MMAR_45563101.459633secreted antigen 85-C FbpC_2
MMAR_45575122.882806glucose-6-phosphate isomerase
MMAR_45588133.687757short chain dehydrogenase
MMAR_45597123.490230formamidopyrimidine-DNA glycosylase
MMAR_45606123.519985PE-PGRS family protein
MMAR_45613132.886049PE-PGRS family protein
MMAR_45621130.892359PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4554cloacin350.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.7 bits (79), Expect = 0.001
Identities = 25/90 (27%), Positives = 34/90 (37%), Gaps = 9/90 (10%)

Query: 214 GGNIGNFNFGSGNRGGNVNFGNGNNGFFNLGGGNIGSNNFGSGNRGNGNIGFGNYQSTGG 273
GG+ N G+ + GN+N G LG G S+ G + N G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTG-----LGVGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 274 ANIGGGNSGSGNKGFGNTGNYNIGSGNFGS 303
G GN G GN+G + GN +
Sbjct: 58 GGSGHGNGGGN----GNSGGGSGTGGNLSA 83



Score = 33.9 bits (77), Expect = 0.002
Identities = 26/86 (30%), Positives = 38/86 (44%)

Query: 189 AGNLGFGNTGIANLGNGNTGNLNFGGGNIGNFNFGSGNRGGNVNFGNGNNGFFNLGGGNI 248
+G G G+ A+ +GN G G G + GSG N +G G+ + GGG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 249 GSNNFGSGNRGNGNIGFGNYQSTGGA 274
N G+GN G G+ GN +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 31.6 bits (71), Expect = 0.012
Identities = 22/72 (30%), Positives = 25/72 (34%)

Query: 272 GGANIGGGNSGSGNKGFGNTGNYNIGSGNFGSFNFGDGNRGSNNFGFGNTNSGNVGFGNL 331
GA+ GN G G G G + GSG N G GS G + GN G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 332 GANNVGFGNLGS 343
G G S
Sbjct: 71 SGGGSGTGGNLS 82



Score = 30.8 bits (69), Expect = 0.019
Identities = 22/83 (26%), Positives = 32/83 (38%), Gaps = 5/83 (6%)

Query: 495 SGSDNTGFLNSGSVNTGFLNSGSTNTGAGNSGEVNTGFGIATD--SGATNSG---FGNTG 549
SG D G +G +N G T G G +G+ + G + SG G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 550 SGNSGFNNDGNDNSGFQNTGTSS 572
GN G N + SG ++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 30.1 bits (67), Expect = 0.036
Identities = 28/83 (33%), Positives = 35/83 (42%), Gaps = 8/83 (9%)

Query: 203 GNGNTGNLNFGGGNIGNFNFGSGNRGGNVNFGNGNNGFFNLGGGNIGSNNFGSGNRGNGN 262
G G+ + GNI G G GG + G+G + N GG GS G G+GN
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 263 IGFGNYQSTGGANIGGGNSGSGN 285
G G N GGG+ GN
Sbjct: 65 GG-------GNGNSGGGSGTGGN 80



Score = 29.7 bits (66), Expect = 0.045
Identities = 25/86 (29%), Positives = 32/86 (37%)

Query: 461 GLGNAGSFNMGFGNAGSGNVGYENAGGANVGFGNSGSDNTGFLNSGSVNTGFLNSGSTNT 520
G G+ + GN G G GGA+ G G S +N SGS SG N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 521 GAGNSGEVNTGFGIATDSGATNSGFG 546
G + +G G + A FG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 29.7 bits (66), Expect = 0.046
Identities = 24/80 (30%), Positives = 27/80 (33%)

Query: 244 GGGNIGSNNFGSGNRGNGNIGFGNYQSTGGANIGGGNSGSGNKGFGNTGNYNIGSGNFGS 303
GG G N GN N G GGA+ G G S N G +G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 304 FNFGDGNRGSNNFGFGNTNS 323
N G G G S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 29.7 bits (66), Expect = 0.047
Identities = 22/81 (27%), Positives = 30/81 (37%)

Query: 283 SGNKGFGNTGNYNIGSGNFGSFNFGDGNRGSNNFGFGNTNSGNVGFGNLGANNVGFGNLG 342
SG G G+ + SGN G G G + G G ++ N G G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 343 SGNVGFGNTGNNNFGIGLSGN 363
GN G G G + +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4558DHBDHDRGNASE642e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 64.3 bits (156), Expect = 2e-14
Identities = 53/190 (27%), Positives = 87/190 (45%), Gaps = 9/190 (4%)

Query: 6 ILITGASSGLGAGMARAFAARGRDLALCARRTDRLEELKSELAQ--KHPEITIAIAELDV 63
ITGA+ G+G +AR A++G +A ++LE++ S L +H E DV
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF----PADV 66

Query: 64 NDHDQVPKVFAELRDELGGIDRVIVNAGIGKGAPLGSGKLWANKATIETNLVAALVQIET 123
D + ++ A + E+G ID ++ AG+ + + S +AT N +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 124 ALEMFHKSGSGHLVLISSVLASKGVPGVK-AAYAASKAGLSSLGESLRAEYDKGPITVSV 182
+ SG +V + S A GVP AAYA+SKA + L E + I ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 183 MEPGYIESEM 192
+ PG E++M
Sbjct: 185 VSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4560cloacin462e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.9 bits (108), Expect = 2e-07
Identities = 35/101 (34%), Positives = 43/101 (42%), Gaps = 4/101 (3%)

Query: 210 GLGGNGGTVGTGQSTNGGAGGDGGSGGSAGLFGGGGAGALGGDGGNGVGSDGSGGGAGSG 269
G G N G T + NGG G G GG++ G G G G G G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 270 GDGGNGGFFYGDGGNGADAGSPGAGQSSFGSLGIAGEGDGG 310
G GN G G GGN + +P A FG ++ G GG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVA----FGFPALSTPGAGG 102



Score = 37.4 bits (86), Expect = 8e-05
Identities = 31/83 (37%), Positives = 37/83 (44%), Gaps = 3/83 (3%)

Query: 188 GNGGNGGNAGLLQGVAG-NGAAGGLGGNGG-TVGTGQSTNGGAGGDGGSGGSAGLFGGGG 245
G G G N G NG GLG GG + G+G S+ G GGSG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSG 61

Query: 246 AGALGGDGGNGVGSDGSGGGAGS 268
G GG+G +G GS G +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 35.8 bits (82), Expect = 2e-04
Identities = 35/113 (30%), Positives = 40/113 (35%), Gaps = 16/113 (14%)

Query: 228 AGGDG-GSGGSAGLFGGGGAGALGGDGGNGVGSDGSGGGAGSGGDGGNGGFFYGDGGNGA 286
+GGDG G A G G G G G SDGSG + + GG G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 287 DAGSPGAGQSSFGSLGIAGEGDGGDGGNAFLIGNGGNGGAAAAFGFPGFGGNG 339
G G+G GG + GN A AFGFP G
Sbjct: 62 HGN---------------GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPG 99



Score = 33.5 bits (76), Expect = 0.001
Identities = 36/129 (27%), Positives = 43/129 (33%), Gaps = 13/129 (10%)

Query: 125 AAGTGQNGGDGGWLIGSGGRGGSGGVGQKGG-NGGSAGLWGNGGNGGLGGEGVQGGPGHP 183
+ G G+ G GG G+G GG + GS N GG G G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 184 GQAGGNGGNGGNAGLLQGVAGNGAAGGLGGNGGTVGTGQSTNGGAGGDGGSGGSAGLFGG 243
GG GN G G GGN V + A G+GG A
Sbjct: 62 HGNGGGNGNSGG------------GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109

Query: 244 GGAGALGGD 252
G A D
Sbjct: 110 GALSAAIAD 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4561cloacin394e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 4e-05
Identities = 37/92 (40%), Positives = 41/92 (44%), Gaps = 3/92 (3%)

Query: 356 GGDGAQGGNGGKAGFFYGNGGNGGFGGNGANGGDNSGTSSAVNGAGGMGGWGGAGGQAGL 415
GGDG G + NGG G G G D SG SS N GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGS- 60

Query: 416 IGNGGTGGAGGSGGAGGSGGTENGDAGPGGFG 447
G+G GG G SGG G+GG + A P FG
Sbjct: 61 -GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 38.2 bits (88), Expect = 9e-05
Identities = 42/137 (30%), Positives = 53/137 (38%), Gaps = 11/137 (8%)

Query: 185 LTGGNGGIGGAGGFLYGLGGNGGIGGHGGDGGAAIGTGTDGGNGGNGGLAGAGGLLFGNG 244
++GG+G G NGG G G GGA+ G+G N GG +G+G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 245 GVGGQGGDGGDATGGTATFSGGLAGSGGNGGNGGQSGWLYGNGGDGGNSGSGGTFESAGG 304
G G GG+G SGG GSG G + + G+GG S
Sbjct: 61 GHGNGGGNGN---------SGG--GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109

Query: 305 SVLSGAQGGGFAASAGN 321
LS A AA G
Sbjct: 110 GALSAAIADIMAALKGP 126



Score = 37.4 bits (86), Expect = 1e-04
Identities = 30/97 (30%), Positives = 36/97 (37%)

Query: 154 GGNGGSAGLLGNGGAGGAGGAGASGAAGDSGLTGGNGGIGGAGGFLYGLGGNGGIGGHGG 213
G N G+ GN G G GA+ SG + N GG G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 214 DGGAAIGTGTDGGNGGNGGLAGAGGLLFGNGGVGGQG 250
+G + G+GT G G G GG
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.4 bits (86), Expect = 2e-04
Identities = 30/86 (34%), Positives = 38/86 (44%), Gaps = 5/86 (5%)

Query: 397 VNGAGGMGGWGGAGGQAGLIGNGGTGGAGGSGGAGGSG-GTENGDAGPGGFGGHGGDALL 455
++G G G GA +G I G TG G G + GSG +EN G GG G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG----GGSGSGIHW 56

Query: 456 FGNGGNGANGGNTGAPGTLSGGGTGT 481
G G+G GGN + G GG +
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLS 82



Score = 37.0 bits (85), Expect = 2e-04
Identities = 33/97 (34%), Positives = 43/97 (44%), Gaps = 3/97 (3%)

Query: 226 GNGGNGGLAGAGGLLFGNGGVGGQGGDGGDATGGTATFSGGLAGSGGNGGNGGQSGWLYG 285
G G N G G + G G GG D +G ++ + GSG GG SG +G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG--HG 63

Query: 286 NGGDGGNSGSGGTFESAGGSVLSGAQGGGFAASAGNG 322
NGG GNSG GG+ S ++ GF A + G
Sbjct: 64 NGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPG 99



Score = 35.5 bits (81), Expect = 7e-04
Identities = 30/83 (36%), Positives = 37/83 (44%), Gaps = 6/83 (7%)

Query: 269 GSGGNGGNGGQSGWLYGNGGDGGNSGSGGTFESAGGSVLSGAQGGGFAASAGNGGNSGLF 328
G G N G SG + NGG G GG + +G S + GGG + GG SG
Sbjct: 6 GRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG-- 61

Query: 329 GNGGSGGNGGNGGLGQAASGDDS 351
G+GG GN G G G+ S
Sbjct: 62 --HGNGGGNGNSGGGSGTGGNLS 82



Score = 33.1 bits (75), Expect = 0.003
Identities = 34/114 (29%), Positives = 42/114 (36%), Gaps = 3/114 (2%)

Query: 320 GNGGNSGLFGNGGSGGNGGNGGLGQAASGDDSQGGIGGDGAQGGNGGKAGFFYGNGG--N 377
G G N+G G+ NGG GLG D G + GG G + G G N
Sbjct: 6 GRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 378 GGFGGNGANGGDNSGTSSAVNGAGGMGGWGGAGGQAGLIGNGGTGGAGGSGGAG 431
GG GN G G SAV G + AG + + GA + A
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 30.8 bits (69), Expect = 0.014
Identities = 36/113 (31%), Positives = 49/113 (43%), Gaps = 7/113 (6%)

Query: 379 GFGGNGANGGDNSGTSSAVNGAGGMGGWGGAGGQAGLIGNGGTGGAGGSGGAGGSGGTEN 438
G G G N G +S + + G G+G GGA G+G + GG GSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD-----GSGWSSENNPWGGGSGSGIHWG 57

Query: 439 GDAGPGGFGGHGGDALLFGNGGNGANGGNTGAPG--TLSGGGTGTVYLTSNGG 489
G +G G GG+G G GGN + A G LS G G + ++ + G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.5 bits (68), Expect = 0.024
Identities = 22/61 (36%), Positives = 31/61 (50%), Gaps = 1/61 (1%)

Query: 131 SGGDGGWLLGNGGNGGSGAAGQAGGNGGSAGL-LGNGGAGGAGGAGASGAAGDSGLTGGN 189
+GG G +G G + GSG + + GG +G + GG G G G +G +G TGGN
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 190 G 190

Sbjct: 81 L 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4562cloacin340.002 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.5 bits (76), Expect = 0.002
Identities = 26/73 (35%), Positives = 31/73 (42%)

Query: 154 GGNGGSAGLLGNGGNGGAGGAGAAGASGVSGQAGGSGGMGGSGGLLYGLGGTGGLGGLGG 213
G N G+ GN G G GAS SG + + GG G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 214 NGGAATTAGTNGG 226
NG + +GT G
Sbjct: 68 NGNSGGGSGTGGN 80



Score = 33.5 bits (76), Expect = 0.002
Identities = 31/104 (29%), Positives = 40/104 (38%), Gaps = 15/104 (14%)

Query: 207 GLGGLGGNGGAATTAGTNGGAGGDGGLGGRGGFLFGDGGLGGQGGDGGDATGSTNAAGTG 266
G G G N GA +T+G G G+GG G G G + + G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-----------GASDGSGWSSENNPWGGGSG 51

Query: 267 GTAGSAGNGGDGGHSGWLYGNGGNGGNTGNGGTFQASADTILSG 310
G G G G NG +GG +G GG A A + G
Sbjct: 52 SGIHWGGGSGHGNGGG----NGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.5 bits (76), Expect = 0.002
Identities = 27/80 (33%), Positives = 32/80 (40%), Gaps = 2/80 (2%)

Query: 406 GFGGTGGQGGLFGNGGN--GGSGGNGGDGGGGARNGADAGPGALGGQGGSALLFGNGGAG 463
G G G G GN GG G G GG +G + GG GS + +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 464 GDGGATGTPGTYSDGTGTGS 483
G+GG G G S G S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 32.8 bits (74), Expect = 0.004
Identities = 27/102 (26%), Positives = 32/102 (31%)

Query: 354 AGAGGDGAFGGNGGNGGFIWGNGGVGGDAGDGAAGGAGTNNFQSYAGAGGFGGFGGTGGQ 413
+G G G G G I G G G + G ++ + G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 414 GGLFGNGGNGGSGGNGGDGGGGARNGADAGPGALGGQGGSAL 455
G G GN G G G G AL G L
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 32.4 bits (73), Expect = 0.005
Identities = 27/90 (30%), Positives = 33/90 (36%)

Query: 186 AGGSGGMGGSGGLLYGLGGTGGLGGLGGNGGAATTAGTNGGAGGDGGLGGRGGFLFGDGG 245
+GG G +G GG GLG GGA+ +G + GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 246 LGGQGGDGGDATGSTNAAGTGGTAGSAGNG 275
G GG+G GS A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.4 bits (73), Expect = 0.006
Identities = 37/107 (34%), Positives = 44/107 (41%), Gaps = 22/107 (20%)

Query: 250 GGDG-GDATGSTNAAGT--GGTAGSAGNGGDGGHSGWLYGNGGNGGNTGNGGTFQASADT 306
GGDG G TG+ + +G GG G GG SGW N GG +G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG--------- 53

Query: 307 ILSGAGGGNFAVTGGNGGDSGLFGMGGAGGNGGNGGAGQAAAGDVTF 353
GGG+ GG G+S GG G GG A A V F
Sbjct: 54 --IHWGGGSGHGNGGGNGNS--------GGGSGTGGNLSAVAAPVAF 90



Score = 31.6 bits (71), Expect = 0.010
Identities = 32/93 (34%), Positives = 39/93 (41%), Gaps = 6/93 (6%)

Query: 274 NGGDG-GHSGWLYGNGGN--GGNTGNGGTFQASADTILSGAGGGNFAVTGGNGGDSGLFG 330
+GGDG GH+ + GN GG TG G AS SG N GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG---SGWSSENNPWGGGSGSGIHWGG 58

Query: 331 MGGAGGNGGNGGAGQAAAGDVTFAGAGGDGAFG 363
G G GGNG +G + + AFG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.2 bits (70), Expect = 0.011
Identities = 22/78 (28%), Positives = 27/78 (34%)

Query: 120 NGANGAAGTGAAGGDGGWLYGNGGNGGSGAAGQAGGNGGSAGLLGNGGNGGAGGAGAAGA 179
+G +G A G + G G G G S GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 180 SGVSGQAGGSGGMGGSGG 197
G G G SGG G+GG
Sbjct: 62 HGNGGGNGNSGGGSGTGG 79



Score = 31.2 bits (70), Expect = 0.013
Identities = 23/76 (30%), Positives = 31/76 (40%), Gaps = 2/76 (2%)

Query: 120 NGANGAAGTGAAGGDGGWLYGNGGNGGSGAAGQAGGNGGSAGLLGNGGNGGAGGAGAAGA 179
N + GG G G G + GSG + + GG +G GG G G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSG--SGIHWGGGSGHGNGGG 67

Query: 180 SGVSGQAGGSGGMGGS 195
+G SG G+GG +
Sbjct: 68 NGNSGGGSGTGGNLSA 83



Score = 31.2 bits (70), Expect = 0.014
Identities = 32/108 (29%), Positives = 39/108 (36%), Gaps = 6/108 (5%)

Query: 422 NGGSGGNGGDGGGGARNGADAGPGALGGQGGSALLFGNGGAGGDGGATGTPGTYSDGTGT 481
+GG G G + GP LG GG + G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG-----ASDGSGWSSENNPWGGGSGSGIHW 56

Query: 482 GSVYQTSDGGGGGDGGFFFGNGGKGGA-AGSIPAGFMAPSENGIGGTG 528
G +GGG G+ G G GG A A + GF A S G GG
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.021
Identities = 27/90 (30%), Positives = 35/90 (38%), Gaps = 2/90 (2%)

Query: 120 NGANGAAGTGAAGGDG-GWLYGNGGNGG-SGAAGQAGGNGGSAGLLGNGGNGGAGGAGAA 177
NG G G DG GW N GG SG+ GG G GNG +GG G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 178 GASGVSGQAGGSGGMGGSGGLLYGLGGTGG 207
++ + A G + G + + G
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


120MMAR_4626MMAR_4640N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4626-116-2.092120acetyl-CoA carboxylase carboxyl transferase
MMAR_4627-217-2.894311hypothetical protein
MMAR_4628-123-2.7599118-amino-7-oxononanoate synthase
MMAR_4629-219-1.045918cysteine synthase B
MMAR_4630-115-0.894173membrane-bound C-5 sterol desaturase
MMAR_4631114-0.003153transcriptional regulatory protein
MMAR_4632211-0.443916putative FAD-binding dehydrogenase
MMAR_4633210-0.624095two-component response transcriptional
MMAR_4634090.195880two-component sensor histidine kinase PrrB
MMAR_4635112-0.227795exported or membrane protein
MMAR_4636-29-1.344536hypothetical protein
MMAR_4637-39-1.085187outer membrane protein OmpA
MMAR_4638-110-0.348350hypothetical protein
MMAR_4639-38-0.777198oxidoreductase
MMAR_4640-210-1.619981type II citrate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4626ARGDEIMINASE300.019 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 30.2 bits (68), Expect = 0.019
Identities = 10/35 (28%), Positives = 16/35 (45%), Gaps = 1/35 (2%)

Query: 149 GVFASWGSLGHVTVAEPGALIGFLGP-RVYELLYD 182
+F+ G L V + PG + L P + L+D
Sbjct: 9 NIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFD 43


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4627TCRTETB888e-21 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 88.0 bits (218), Expect = 8e-21
Identities = 81/408 (19%), Positives = 151/408 (37%), Gaps = 33/408 (8%)

Query: 26 FIVYLDTTVLLVAFGAISESFPEASSSARSWVLDAYFIVFAALMVPGGRWADQFGSRNVF 85
F L+ VL V+ I+ F ++ +WV A+ + F+ G+ +DQ G + +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDF-NKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 86 AIGVSTFILSSAGCAVAPTLGA-LVAARAAQAVGAALMGPASLALILPYFGRGSRATAVS 144
G+ S V + + L+ AR Q GAA + ++ Y + +R A
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 145 LWGTSAALAAALGPPLGGFLADTVGWRGIFLINVPIGLAV-LAGLRNVDNRGDAVAGQLV 203
L G+ A+ +GP +GG +A + W +L+ +P+ + + L + + + G
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200

Query: 204 NTSAIVLIASGVGALTAGILEGPSWGWGRRRTLLLLIAGAILLAAAMVRVARHHRRAE-P 262
I++ + +L S+ I+ + + +H R+ P
Sbjct: 201 IKGIILMSV----GIVFFMLFTTSYSISF----------LIVSVLSFLIFVKHIRKVTDP 246

Query: 263 IHDFD--KGRFFAANAATA--IFGAGFYGLLLAVVFFLTSHWHYSTFEAG-LAMMPIFVA 317
D K F IFG G + V + + ST E G + + P ++
Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGT-VAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305

Query: 318 AALAAIPAGRIADARGHRWAVIPGCWVFTLGVFLFWLLTTSRADYASRWLP--GSILCGI 375
+ G + D RG + + G ++ LT S + W +
Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVS-----FLTASFLLETTSWFMTIIIVFVLG 360

Query: 376 GIGCVMPVLASAAIDAMPGQLLGTANALNSMLRQFGAALGTAAVGTLL 423
G+ V+++ ++ Q G +L + G A VG LL
Sbjct: 361 GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4631HTHTETR587e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.5 bits (141), Expect = 7e-13
Identities = 40/180 (22%), Positives = 67/180 (37%), Gaps = 7/180 (3%)

Query: 11 ARVTKRRAE-TRARLVDAAFRVFADKGFGHVRIEDVCAAAGYTRGAFYSQFDSLEELFFT 69
AR TK+ A+ TR ++D A R+F+ +G + ++ AAG TRGA Y F +LF
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 70 LYDQRATLISEQVGTAMASV-DDPTDVPGTVDRIASTLLLDRDWLLIKTDFLMHAARHPD 128
+++ + I E A DP V + + + + + + H
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 129 LAQRLAAHRAQLRAAVEDRLAGSDVELPAAIGSVAD-----AARAVVAAYDGVSIQLLLD 183
+ + L DR+ + A AD AA + G+ L
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4633HTHFIS1022e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 102 bits (257), Expect = 2e-27
Identities = 35/119 (29%), Positives = 58/119 (48%)

Query: 11 PRVLVVDDDSDVLASLERGLRLSGFEVSTAVDGAEALRSATETRPDAIVLDINMPVLDGV 70
+LV DDD+ + L + L +G++V + A R D +V D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 SVVTALRAMDNDVPVCVLSARSSVDDRVAGLEAGADDYLVKPFVLAELVARVKALLRRR 129
++ ++ D+PV V+SA+++ + E GA DYL KPF L EL+ + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4634PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 19/101 (18%), Positives = 39/101 (38%), Gaps = 22/101 (21%)

Query: 354 IANAVKHGGSTR-----VQLSAVSSRAGVEIAVDDNGSGVPEAERQMVFERFSRGSTASH 408
+ N +KHG + + L V + V++ GS + ++
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------------- 309

Query: 409 SGSGLGLALVAQQ-AHLHGGTASLQ-NSPLGGARLLLKLPG 447
+G GL V ++ L+G A ++ + G ++ +PG
Sbjct: 310 -STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4636IGASERPTASE290.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.3 bits (65), Expect = 0.001
Identities = 13/54 (24%), Positives = 21/54 (38%)

Query: 24 IVTVSIKPVATTAEAVEAEQTEQPEQTEQTEQTEQTEQTEEPDPAESHQTEVAQ 77
I V PV A A +E TE + + E + ++ + EVA+
Sbjct: 1017 IARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAK 1070



Score = 25.0 bits (54), Expect = 0.039
Identities = 8/45 (17%), Positives = 16/45 (35%)

Query: 33 ATTAEAVEAEQTEQPEQTEQTEQTEQTEQTEEPDPAESHQTEVAQ 77
T Q + P E+ + ++ P PA + +E +
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4637OMPADOMAIN1054e-28 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 105 bits (263), Expect = 4e-28
Identities = 46/146 (31%), Positives = 69/146 (47%), Gaps = 16/146 (10%)

Query: 197 GQPAGSTPPTGPAATGACADLQAAVTALTGGAIAFGNDGVSLTPDSNKVLTQVVDKLRAC 256
G+ A P A ++Q L + F + +L P+ L Q+ +L
Sbjct: 194 GEAAPVVAPAPAPA----PEVQTKHFTLKS-DVLFNFNKATLKPEGQAALDQLYSQLSNL 248

Query: 257 --PDAKVTVNGYTDNSGSEGLNIPLSAQRAQTVADFLVAHGVPTDHITAKGLGSANPIAS 314
D V V GYTD GS+ N LS +RAQ+V D+L++ G+P D I+A+G+G +NP+
Sbjct: 249 DPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTG 308

Query: 315 NDTAEGR---------IKNRRVEIVV 331
N + +RRVEI V
Sbjct: 309 NTCDNVKQRAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_464056KDTSANTIGN300.028 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 29.5 bits (66), Expect = 0.028
Identities = 13/35 (37%), Positives = 19/35 (54%)

Query: 97 LPNTDQLAQFTGRIQRHTMLHEDLKRFFDGFPRNA 131
LPN+ + Q +IQ E+L+ FDG+ NA
Sbjct: 291 LPNSASIEQIQSKIQELGDTLEELRDSFDGYINNA 325


121MMAR_4752MMAR_4756N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4752-210-0.690993transcriptional regulatory protein
MMAR_4753-110-1.387889cytochrome P450 189A7 Cyp189A7
MMAR_4754-212-0.360511short-chain type dehydrogenase/reductase
MMAR_4755-1110.065806short chain dehydrogenase
MMAR_4756-110-0.632065oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4752HTHTETR626e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.0 bits (150), Expect = 6e-14
Identities = 31/164 (18%), Positives = 62/164 (37%), Gaps = 5/164 (3%)

Query: 20 KTAKLRAAQRVQRFLDAAQAIIIEKGSTDFTVQEVVDRSRQSLRSFYLQFDGKHELLLAL 79
+ K A + Q LD A + ++G + ++ E+ + + + Y F K +L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 80 FEDALSRSADQIRAATE-THADPLERLKVAVELLYEASRPDPTAKRPLFTDFAPRLLVSH 138
+E + S + DPL L+ + + E++ + + + F V
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 139 PAEV----KVAHAPLVALLTELMEAAAEAGELREDLNPRRIAAM 178
A V + + + ++ EA L DL RR A +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAII 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4754DHBDHDRGNASE1002e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (250), Expect = 2e-27
Identities = 82/282 (29%), Positives = 121/282 (42%), Gaps = 36/282 (12%)

Query: 9 VAGKVAFITGAARGQGRSHAVRLAQEGADIIAVDVCKPIVENTTIPASTPEDLAETADLV 68
+ GK+AFITGAA+G G + A LA +GA I AVD P E S+ + A A+
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD-YNP--EKLEKVVSSLKAEARHAEAF 62

Query: 69 KGHNRRIFTAEADVRDYDALKAAVDAGVDELGRLDIIVANAGIGNGGATLDKTSEHDWQE 128
ADVRD A+ E+G +DI+V AG+ G S+ +W+
Sbjct: 63 P----------ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLI-HSLSDEEWEA 111

Query: 129 MIDVNLSGVWKSVKAAVPHLIAGGNGGSIVLTSSVGGMKAYPHCGNYVAAKHGVVGLMRS 188
VN +GV+ + ++ +++ GSIV S Y ++K V +
Sbjct: 112 TFSVNSTGVFNASRSVSKYMMD-RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKC 170

Query: 189 FAVELGQHMIRVNSVHPTHVRTPM-----LHNEGTFKMFRPDLENPGPDDMAPICQLFHT 243
+EL ++ IR N V P T M G ++ + LE
Sbjct: 171 LGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLET-------------FK 217

Query: 244 LPIPW---VEAEDISNAVLFLASDESRYITGVTLPVDAGGCL 282
IP + DI++AVLFL S ++ +IT L VD G L
Sbjct: 218 TGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4755DHBDHDRGNASE851e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.1 bits (210), Expect = 1e-21
Identities = 49/188 (26%), Positives = 84/188 (44%), Gaps = 1/188 (0%)

Query: 3 GFAGRGAVITGGASGIGLATATEFARRGAKVVLADIDKPGLERSVEHLRGKGFDAHGVMC 62
G G+ A ITG A GIG A A A +GA + D + LE+ V L+ + A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 63 DVRHLSEVNHLAAESARLLGQIDVVFSNAGIVVAGPIADMTHDDWRWVIDIDLWGSIHTV 122
DVR + ++ + A R +G ID++ + AG++ G I ++ ++W ++ G +
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 123 EAFLPKLL-EHGGHIAFTASFAGLVPNAGLGAYGVAKYGVVGLAETLSREMKDRGVGVTV 181
+ ++ G I S VP + AY +K V + L E+ + + +
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 182 LCPMVVET 189
+ P ET
Sbjct: 185 VSPGSTET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4756DHBDHDRGNASE1061e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 106 bits (266), Expect = 1e-29
Identities = 74/255 (29%), Positives = 119/255 (46%), Gaps = 19/255 (7%)

Query: 10 LNGRVAVVTGAGSGIGRGVAAGLAAFGASVAIWERDPQTCAQAAAELGGLG-----IATD 64
+ G++A +TGA GIG VA LA+ GA +A + +P+ + + L D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 65 VRSGDQVDAALQRTQAELGEVTILVNNVGGVFWSPLLQTSEKGWDALYRSNLGHVLLCTQ 124
VR +D R + E+G + ILVN G + + S++ W+A + N V ++
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 125 RVARRLVAAELPGSIMAITSIEGVRAAPGYAAYAAAKAGVVNYTKTAALELAHHGIRVNA 184
V++ ++ GSI+ + S AAYA++KA V +TK LELA + IR N
Sbjct: 126 SVSKYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 185 IAPDITMTEGLAQL-----------GGAAALERIGSLVPLGRPGDIDEIASVAVFLASDM 233
++P T T+ L G+ + G +PL + +IA +FL S
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTG--IPLKKLAKPSDIADAVLFLVSGQ 242

Query: 234 ARYVTGQTIHVDGGT 248
A ++T + VDGG
Sbjct: 243 AGHITMHNLCVDGGA 257


122MMAR_4764MMAR_4767N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4764-112-0.342975hypothetical protein
MMAR_4765-110-0.599418short chain dehydrogenase
MMAR_4766-110-0.596513hypothetical protein
MMAR_4767-210-1.116335integral membrane transport protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4764HTHTETR531e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.7 bits (126), Expect = 1e-10
Identities = 26/108 (24%), Positives = 46/108 (42%), Gaps = 17/108 (15%)

Query: 11 ERASSTQEAILVAAERLFAEHGVFAVSNRQVSEAAGQGNNAAVGYHFGTKTDLVRAI--- 67
+ A T++ IL A RLF++ GV + S ++++AAG A+ +HF K+DL I
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGV-TRGAIYWHFKDKSDLFSEIWEL 65

Query: 68 -------------EQKHRVPVERLREQMVAAAAAKGVAATMRDWVACL 102
+ P+ LRE ++ + R + +
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEII 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4765DHBDHDRGNASE1153e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 115 bits (289), Expect = 3e-33
Identities = 80/259 (30%), Positives = 123/259 (47%), Gaps = 7/259 (2%)

Query: 5 LAGKIAIVTGGASGIGRATVARFIAEGARVVIADVEEERGESLAAALGADAMFC---RTD 61
+ GKIA +TG A GIG A ++GA + D E+ E + ++L A+A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 62 VSQPEQVAAVVAAAVENFGGLHVMVNNAGVSGVMHRRFLDDDLADFHRVMAVNVLGVMAG 121
V + + A G + ++VN AGV L D+ ++ +VN GV
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDE--EWEATFSVNSTGVFNA 123

Query: 122 TRDAARHMAAHGGGSIVNLTSIGGIQAGGGVMTYRASKAAVIQFTKSAAIELAHYEIRVN 181
+R +++M GSIV + S + Y +SKAA + FTK +ELA Y IR N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 182 AIAPGNIPTPLLASSAAGMDQEQVERFTAQIRQTMREDRPLKREGTPEDIAEAALYFAGE 241
++PG+ T + S A D+ E+ +T + PLK+ P DIA+A L+
Sbjct: 184 IVSPGSTETDMQWSLWA--DENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 242 RSRYVTGTVLPVDGGTVAG 260
++ ++T L VDGG G
Sbjct: 242 QAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4766NUCEPIMERASE310.003 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 31.3 bits (71), Expect = 0.003
Identities = 13/29 (44%), Positives = 15/29 (51%)

Query: 1 MKILVIGGSGLIGSQVVAQLTGLAHQAVA 29
MK LV G +G IG V +L HQ V
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVG 29


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4767TCRTETA569e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 56.0 bits (135), Expect = 9e-11
Identities = 64/294 (21%), Positives = 108/294 (36%), Gaps = 9/294 (3%)

Query: 11 NRPSRVLMINQFGINVGFYMLMPYLADYLA--GPLGLAAWAVGLVMGVRNFSQQGMFFVG 68
NRP V++ VG ++MP L L G+++ + Q V
Sbjct: 4 NRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVL 63

Query: 69 GTLADRFGYKPLIVAGCLIRTGGFALLVVAQSLPSVLIASAATGFAGALFNPAVRAYVAA 128
G L+DRFG +P+++ +A++ A L + I G GA AV A
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG--AVAGAYIA 121

Query: 129 DS--GDRKLEAFAMFNIFYQAGILLGPLVGLALLTLDFRMTVLGASAVFAVLTAAQLMAL 186
D GD + F + + G++ GP++G + A+A+ + L
Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181

Query: 187 PQ-HLADPGTKNESILQGWKAIVRNRSFLGFAAAMTGAYVLSF--QVYLALPMQASLLAP 243
P+ H + L + R AA M +++ QV AL +
Sbjct: 182 PESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 244 GHQSVLVAAMFAVSGLIAIAGQLRITRWFAAHWRVSRSLVVGAAILAIAFVPLA 297
+ + A G++ Q IT AA R+L++G ++ LA
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLA 295


123MMAR_4832MMAR_4840N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_4832-219-2.383300TetR family transcriptional regulator
MMAR_4833-218-2.469247cytochrome P450 123B1 Cyp123B1
MMAR_4834-319-2.300349TetR family transcriptional regulator
MMAR_4835-219-3.105408hypothetical protein
MMAR_4836-221-3.511505MmpL family transport protein
MMAR_4837329-4.653520hydrolase
MMAR_4838331-5.419394hypothetical protein
MMAR_4839328-5.191563TetR family transcriptional regulator
MMAR_4840222-4.170846putative regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4832HTHTETR697e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.3 bits (169), Expect = 7e-17
Identities = 29/189 (15%), Positives = 66/189 (34%), Gaps = 7/189 (3%)

Query: 12 LPAAAELFAERGLNDTKIEDVAATTGIAKATLYYYFAGKEEILAFLLEDVLQHVAD-EVT 70
L A LF+++G++ T + ++A G+ + +Y++F K ++ + + E ++ + E+
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELE 76

Query: 71 AIVEADGTAAQRLHTVINAQLRVMAQRPAVCRALI---GELGRAARMPAIADMITTAYFE 127
+ G L ++ L + + M + E
Sbjct: 77 YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLE 136

Query: 128 PVET---LLRAGAADGSLVALDKPRAAAIALFGAVTISALTYLITDDALNEELIARTIHD 184
+ L+ L A R AAI + G ++ +L + + + AR
Sbjct: 137 SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVA 196

Query: 185 VAFIGLRPR 193
+
Sbjct: 197 ILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4834HTHTETR633e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.7 bits (152), Expect = 3e-14
Identities = 26/155 (16%), Positives = 48/155 (30%), Gaps = 12/155 (7%)

Query: 18 PRRLRSRTRLLDAATKLLSAGGIEAVTIDAVTKASKVARTTLYRHFSSSTQLLAATFERL 77
+R +LD A +L S G+ + ++ + KA+ V R +Y HF + L + +E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 78 LPQVHPPPAT------GSMRDQLIELLSRQATLFQEAPLHVTTLAWVALGPTPDGTQETQ 131
+ G L E+L L + + E
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEER--RRLLMEIIFHKCEFVGEMA 124

Query: 132 DRHALRARIIDQYRQPFVALL----QSPEARADLD 162
+ + + L ++ ADL
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLM 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4836ACRIFLAVINRP582e-10 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 57.9 bits (140), Expect = 2e-10
Identities = 34/229 (14%), Positives = 85/229 (37%), Gaps = 23/229 (10%)

Query: 212 IASAEEDLVVISIATAGLIAMILLVVYRSVFTALLPLLVIGVSLAVGRGVLSALGESGMP 271
+ + ++V L+ +++ + +++ L+P + + V L +L+A G S
Sbjct: 333 VQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYS--- 389

Query: 272 VSQFTIAFMTVILLGAGTDYSVFLISRYHEQRR-QNVPPDLSVINATATIGRVILASAAT 330
++ T+ M V+ +G D ++ ++ +PP + + + I ++ A
Sbjct: 390 INTLTMFGM-VLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMV 448

Query: 331 VAFAFLAMVFAKLS---VFAALGPACAIAVFVGFAATVTLFPPVLALAAKRGIGEPKADR 387
++ F+ M F S ++ A+ + + L P + A K E ++
Sbjct: 449 LSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENK 508

Query: 388 TRRYWNWIAV--------------AVVRRPVPLLVASLALVLGLAAVAL 422
++ W ++ L+ +V G+ + L
Sbjct: 509 -GGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFL 556



Score = 39.4 bits (92), Expect = 8e-05
Identities = 27/174 (15%), Positives = 61/174 (35%), Gaps = 11/174 (6%)

Query: 210 DQIASAEEDLVVISIATAGLIAMILLVVYRSVFTALLPLLVIGVSLAVGRGVLSALGESG 269
Q + + + ++ + L +Y S + +LV+ + + GVL A
Sbjct: 863 YQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIV---GVLLAATLFN 919

Query: 270 MPVSQFT-IAFMTVILLGAGTDYSVFLISRYHE-QRRQNVPPDLSVINATATIGRVILAS 327
+ + +T +G ++ ++ + ++ + + A R IL +
Sbjct: 920 QKNDVYFMVGLLT--TIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMT 977

Query: 328 AATVAFAFLAMVFAKLSVFAALGPACAIAVFVG-FAATVT--LFPPVLALAAKR 378
+ L + + + A I V G +AT+ F PV + +R
Sbjct: 978 SLAFILGVLPLAIS-NGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 34.8 bits (80), Expect = 0.002
Identities = 31/183 (16%), Positives = 65/183 (35%), Gaps = 27/183 (14%)

Query: 841 IQRLLSADFHQLAFATLVIVGLILVVL--LRA-----LVAPLYLLGTVVLNYGAALGLGT 893
+Q + L A +++ ++ + L +RA + P+ LLGT + + T
Sbjct: 333 VQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT 392

Query: 894 LVFQYGLGKEIAWPVPLLAFIILVAVGADYNMLL---ISRLREESAHNIRVGVLRTVANT 950
L + ++ + + D +++ + R+ E + ++++
Sbjct: 393 LT--------------MFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQI 438

Query: 951 GSVITSAGLIFAASM--FGLIAGSIA-IMIQAGFIIGCGLLLDTFVVRTLTVPAIATLLR 1007
+ ++ +A GS I Q I + L V LT ATLL+
Sbjct: 439 QGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK 498

Query: 1008 EAS 1010
S
Sbjct: 499 PVS 501


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4839HTHTETR455e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.0 bits (106), Expect = 5e-08
Identities = 17/86 (19%), Positives = 33/86 (38%), Gaps = 2/86 (2%)

Query: 19 AAVLDATRAVATLGGFKAVHFKSVAKQAGVTVGSVYDHFTSKTHLLVTLLAREFVRLDE- 77
+LD + + G + +AK AGVT G++Y HF K+ L + + E
Sbjct: 14 QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGEL 73

Query: 78 -ERDWSTCAASPIRRVESLTRRLHDE 102
+ P+ + + + +
Sbjct: 74 ELEYQAKFPGDPLSVLREILIHVLES 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_4840HTHTETR531e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.7 bits (126), Expect = 1e-10
Identities = 14/97 (14%), Positives = 33/97 (34%), Gaps = 3/97 (3%)

Query: 51 ANTGSLRDRRRAELLSQIQGTAHQLFAERGFAAVTTEDIAAASGISISTYFRYAPTKEDL 110
A + + I A +LF+++G ++ + +IA A+G++ + + K DL
Sbjct: 2 ARKTKQEAQETRQ---HILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58

Query: 111 LIAPLRQTVAEIVAAYGTQPSDQSAADALIALFAETA 147
+ + I + +
Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIH 95


124MMAR_5199MMAR_5207N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
MMAR_5199-1122.686207hypothetical protein
MMAR_52000132.610755hypothetical protein
MMAR_5201-1132.923565hypothetical protein
MMAR_52021156.271961methanol dehydrogenase transcriptional
MMAR_52031165.519139hypothetical protein
MMAR_52042154.294762hypothetical protein
MMAR_52053164.407853hypothetical protein
MMAR_52062154.415386PadR-like transcriptional regulatory protein
MMAR_52071153.877780PE-PGRS family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5199BACINVASINB394e-05 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 38.6 bits (89), Expect = 4e-05
Identities = 35/140 (25%), Positives = 65/140 (46%), Gaps = 21/140 (15%)

Query: 237 LAGLVVVILVGVAAAANGATAALLGFPLVLLVGLLVAYLYTVLMFA-----PVL-IVLER 290
L L+ ++ V A GA+ AL L ++V + T + F P++ VL+
Sbjct: 321 LGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLK- 379

Query: 291 LPLVDAITRSFALVTGGFWRVLGIRLLTAIVVGLVGGAISAPFGIVGQILLGATASEGST 350
PL++ I ++ G LG+ TA + G + GAI A +V I++ A +G+
Sbjct: 380 -PLMELIGKAITKALEG----LGVDKKTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAA 434

Query: 351 GMFLVGMTLSSIGSAISQII 370
+ +G+A+S+++
Sbjct: 435 ---------AKLGNALSKMM 445


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5202HTHFIS362e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.0 bits (83), Expect = 2e-04
Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 1/47 (2%)

Query: 117 DEINRTPPKTQAALLEAMEERQVSVEGQAKPLP-DPFIVAATQNPIE 162
DEI P Q LL +++ + + G P+ D IVAAT ++
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLK 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5205PERTACTIN310.005 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 31.2 bits (70), Expect = 0.005
Identities = 20/51 (39%), Positives = 22/51 (43%), Gaps = 1/51 (1%)

Query: 242 ARLRPAGAGAPPGWPPQTPPAPVWWPGQPAPQPQIQPQFAPD-PAPSPPQG 291
A+ PA AP P P P PQP PQ P+ PAP PP G
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAG 615



Score = 30.1 bits (67), Expect = 0.014
Identities = 17/45 (37%), Positives = 18/45 (40%)

Query: 248 GAGAPPGWPPQTPPAPVWWPGQPAPQPQIQPQFAPDPAPSPPQGP 292
GA APP P P P P P P QP P P P+ P
Sbjct: 564 GAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAP 608


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5206cloacin290.023 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 28.5 bits (63), Expect = 0.023
Identities = 15/37 (40%), Positives = 16/37 (43%)

Query: 55 PLGFGGGFGPGFGPGLGFGFGPGGARGGGRRGGPGRG 91
P G G G G +G G G G G G GG G G
Sbjct: 45 PWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
MMAR_5207cloacin399e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 9e-05
Identities = 41/108 (37%), Positives = 43/108 (39%), Gaps = 2/108 (1%)

Query: 571 GGGGSGGGGGASGGTGG-TGGAGGLLSAGGAGGVGGAGFNDGGDGGAGGSGGLLGGLVGA 629
GG G G GA +G GG GL GGA G + GG GSG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 630 GGGAGGNGGAGFGGTPGNGGAGGDAGLLGGPG-GTGGAGGYNTSGPGG 676
G G G G GT GN A G P T GAGG S G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 37.8 bits (87), Expect = 2e-04
Identities = 35/110 (31%), Positives = 47/110 (42%), Gaps = 6/110 (5%)

Query: 657 LGGPGGTGGAGGYNTSGPGGNGGSGGNAGTLFGSGGGGGNGGSGYSGIGGTGGTGGSAGL 716
+ G G G G +++ NGG G GGG + GSG+S G G +G+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTG------LGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 717 VFSDAGAGGFGGFGSTAGGTGGTGGNAVLLGGGGAGGAGGISFTGAGGQG 766
+ G GG +GG GTGGN + A G +S GAGG
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.4 bits (86), Expect = 2e-04
Identities = 35/115 (30%), Positives = 42/115 (36%)

Query: 513 AGGSGAANTGASGGAGGAAGLLGTGGTGGAGARLAGGAGGTGGAGGAGGWLLGDGGGGGG 572
+GG G + + G TG G GA G G G GGG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 573 GGSGGGGGASGGTGGTGGAGGLLSAGGAGGVGGAGFNDGGDGGAGGSGGLLGGLV 627
G+GGG G SGG GTGG ++A A G G S G L +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 37.4 bits (86), Expect = 3e-04
Identities = 36/109 (33%), Positives = 43/109 (39%), Gaps = 2/109 (1%)

Query: 125 NGGAGGSGAAGSAGGAGGAAGLIGAGGAGGAGGSSTGGAGGTGGAGGAGGWLFGPGGVGG 184
N GA + + G G G + G+G + ++ G G G GG G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 185 AGGSSSSAGGAGGVGGAGGLFGGGGLG--GAGGAGVSASGGAGGAGGAG 231
G S GG A FG L GAGG VS S GA A A
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 37.4 bits (86), Expect = 3e-04
Identities = 43/124 (34%), Positives = 49/124 (39%), Gaps = 3/124 (2%)

Query: 152 AGGAGGSSTGGAGGTGGAGGAGGWLFGPGGVGGAGGSSSSAGGAGGVGGAGGLFGGGGLG 211
+GG G GA T G G G GG G SS G G G+ GGG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 212 GAGGAGVSASGGAGGAGGAGGALAGFLGAGG---GDGGAGGSGVNHEGGAGGAGGAGGLI 268
G G SGG G GG A+A + G GAGG V+ GA A A +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 269 AGTG 272
A G
Sbjct: 122 ALKG 125



Score = 37.0 bits (85), Expect = 3e-04
Identities = 27/80 (33%), Positives = 31/80 (38%)

Query: 747 GGGGAGGAGGISFTGAGGQGGAGGTGGQLSGNGGSGGTGGEGDYVGGADSGGAGGTGGNA 806
GG G G G T GG G G + GSG + + GG+ SG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 807 GLTGDGGNGGNGGSGGTPGS 826
G G GN G G G S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 37.0 bits (85), Expect = 4e-04
Identities = 24/71 (33%), Positives = 33/71 (46%)

Query: 614 GGAGGSGGLLGGLVGAGGGAGGNGGAGFGGTPGNGGAGGDAGLLGGPGGTGGAGGYNTSG 673
G SG + GG G G G G + G+G+ G G +G+ G G G GG N +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 674 PGGNGGSGGNA 684
GG+G G +
Sbjct: 72 GGGSGTGGNLS 82



Score = 35.8 bits (82), Expect = 7e-04
Identities = 28/76 (36%), Positives = 32/76 (42%)

Query: 761 GAGGQGGAGGTGGQLSGNGGSGGTGGEGDYVGGADSGGAGGTGGNAGLTGDGGNGGNGGS 820
G G GA T G ++G G GG G S GG+ GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 821 GGTPGSPGGGGTGGAL 836
GG S GG GTGG L
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.001
Identities = 41/125 (32%), Positives = 45/125 (36%), Gaps = 13/125 (10%)

Query: 630 GGGAGGNGGAGFGGTPGNGGAGGDAGLLGGPGGTGGAGGYNTSGPGGNGGSGGNAGTLFG 689
G G G N GA NGG G G GG G +S GG G+ G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTG-----LGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 690 SGGGGGNGGSGYSGIGGTGGTGGSAGLVFSDAGAGGFGGFGSTAGGTGGTGGNAVLLGGG 749
G G GG+G SG G G SA FG A T G GG AV + G
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSA--------VAAPVAFGFPALSTPGAGGLAVSISAG 110

Query: 750 GAGGA 754
A
Sbjct: 111 ALSAA 115



Score = 34.3 bits (78), Expect = 0.002
Identities = 33/106 (31%), Positives = 38/106 (35%), Gaps = 4/106 (3%)

Query: 549 GAGGTGGAGGAGGWLLGDGGGGGGGGSGGGGGASGGT----GGTGGAGGLLSAGGAGGVG 604
G G GA G + G G G GG G GG G+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 605 GAGFNDGGDGGAGGSGGLLGGLVGAGGGAGGNGGAGFGGTPGNGGA 650
G N GG G GG+ + V G A GAG + GA
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.9 bits (77), Expect = 0.003
Identities = 32/104 (30%), Positives = 37/104 (35%), Gaps = 2/104 (1%)

Query: 578 GGGASGGTGGTGGAGGLLSAGGAGGVGGAGFNDGGDGGAGGSGGLLGGLVGAGGGAGGNG 637
GG G G G ++ G G G G +DG G GG G+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGS--GWSSENNPWGGGSGSGIHWGGGS 60

Query: 638 GAGFGGTPGNGGAGGDAGLLGGPGGTGGAGGYNTSGPGGNGGSG 681
G G GG GN G G G A G+ G GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.007
Identities = 29/112 (25%), Positives = 42/112 (37%), Gaps = 3/112 (2%)

Query: 242 GGDGGAGGSGVNHEGGAGGAGGAGGLIAGTGGNGGAGGTDAYSRGGAGGAGGDAGLLFGS 301
GGDG +G + G G G + G +G ++ GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG---GG 59

Query: 302 GGAGGTGGTGGTDMSDSGGTGGAGGNAGLLFGSGGAGGAGGAAVALNDVGGA 353
G G GG G + G + A + FG G +A++ GA
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.4 bits (73), Expect = 0.008
Identities = 30/86 (34%), Positives = 36/86 (41%), Gaps = 1/86 (1%)

Query: 206 GGGGLGGAGGAGVSASGGAGGAGGAGGALAGFLGAGGGDGGAGGSGVN-HEGGAGGAGGA 264
G G GA + +GG G G GGA G + + GGSG H GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 265 GGLIAGTGGNGGAGGTDAYSRGGAGG 290
GG GG+G G A + A G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.8 bits (69), Expect = 0.026
Identities = 35/115 (30%), Positives = 41/115 (35%), Gaps = 6/115 (5%)

Query: 310 TGGTDMSDSGGTGGAGGNAGLLFGSGGAGGAGGAAVALNDVGGAGGAGGNAGLFGNGGVG 369
+GG + G GN G G G GGA +D G G G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNIN--GGPTGLGVGGGA----SDGSGWSSENNPWGGGSGSGIH 55

Query: 370 GVGGVGAGDGGAGGRAGLVIGNGGAGGAGGESFGFGAGVGGAGGNGGNGVLIGNG 424
GG G G+GG G +G G GG A FG G GG V I G
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.1 bits (67), Expect = 0.047
Identities = 23/73 (31%), Positives = 33/73 (45%), Gaps = 3/73 (4%)

Query: 489 NGTPGAAGSGTDGTPGGWLLGDGGAGGSGAANTGASGGAGGAAGLLGTGGTGGAGARLAG 548
N + +G P G +G G + GSG ++ G G +G+ GG+G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG---G 66

Query: 549 GAGGTGGAGGAGG 561
G G +GG G GG
Sbjct: 67 GNGNSGGGSGTGG 79



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.