PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genomesequence.gbThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in AL123456 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1Rv0018cRv0026Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0018c2120.734782Phosphoserine/threonine phosphatase PstP
Rv0019c1110.848605Conserved protein with FHA domain, FhaB
Rv0020c2121.092559Conserved protein with FHA domain, FhaA
Rv0021c0130.769523*Conserved hypothetical protein
Rv0022c2140.621921Probable transcriptional regulatory protein
Rv00232140.212702Possible transcriptional regulatory protein
Rv0024118-0.001866Putative secreted protein P60-related protein
Rv0025319-0.745629Conserved hypothetical protein
Rv0026218-0.670123Conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0018cIGASERPTASE365e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 5e-04
Identities = 20/83 (24%), Positives = 25/83 (30%), Gaps = 7/83 (8%)

Query: 426 APRATS--PPGRPAPPTTSETTEPNVTSSPASPSPTTSAPAPTGTTPAIPTSASPAAPAS 483
P+ TS P + T EP PT + P T + PA S
Sbjct: 1122 VPKVTSQVSPKQEQSETVQPQAEPA-----RENDPTVNIKEPQSQTNTTADTEQPAKETS 1176

Query: 484 PPTPWPVTSSPTMAALPPPPPQP 506
PVT S T+ P
Sbjct: 1177 SNVEQPVTESTTVNTGNSVVENP 1199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0019cIGASERPTASE280.016 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.016
Identities = 14/55 (25%), Positives = 21/55 (38%)

Query: 88 ADDSTLVLTDDYASTRHARLSMRGSEWYVEDLGSTNGTYLDRAKVTTAVRVPIGT 142
+D T +DY R + + S GTY D+ K VR+ G+
Sbjct: 154 TEDQTQKRREDYYMPRLDKFVTEVAPIEASTASSDAGTYNDQNKYPAFVRLGSGS 208


2Rv0274Rv0305cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv02745175.346379Conserved protein
Rv0275c10228.112180Possible transcriptional regulatory protein
Rv027611197.740593Conserved hypothetical protein
Rv0277c11217.750325Possible toxin VapC25. Contains PIN domain.
Rv0278c6156.101805PE-PGRS family protein PE_PGRS3
Rv0279c0102.922633PE-PGRS family protein PE_PGRS4
Rv0280-490.904985PPE family protein PPE3
Rv0281-3100.662373Possible S-adenosylmethionine-dependent
Rv0282-3110.603737ESX conserved component EccA3. ESX-3 type VII
Rv0283-1130.856522ESX conserved component EccB3. ESX-3 type VII
Rv0284-1130.509853ESX conserved component EccC3. ESX-3 type VII
Rv02854201.591787PE family protein PE5
Rv02863161.920725PPE family protein PPE4
Rv02872152.077828ESAT-6 like protein EsxG (conserved protein
Rv02881131.400284Low molecular weight protein antigen 7 EsxH (10
Rv0289-1131.443377ESX-3 secretion-associated protein EspG3
Rv02901131.421631ESX conserved component EccD3. ESX-3 type VII
Rv02910110.693454Probable membrane-anchored mycosin MycP3 (serine
Rv02922142.650896ESX conserved component EccE3. ESX-3 type VII
Rv0293c2132.432154Conserved protein
Rv02943143.413526Probable trans-aconitate methyltransferase Tam
Rv0295c4163.686454Conserved protein
Rv0296c5153.664078Probable sulfatase
Rv02974184.894383PE-PGRS family protein PE_PGRS5
Rv0298117-0.333013Hypothetical protein
Rv0299320-3.589283Hypothetical protein
Rv0300418-3.735906Possible antitoxin VapB2
Rv0301417-3.536067Possible toxin VapC2
Rv0302418-3.442211Probable transcriptional regulatory protein
Rv0303417-3.665699Probable dehydrogenase/reductase
Rv0304c418-4.016120PPE family protein PPE5
Rv0305c215-2.984917PPE family protein PPE6
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0275cHTHTETR441e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 43.8 bits (103), Expect = 1e-07
Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 13/195 (6%)

Query: 14 AERLATRRRQSLSAGLDLLGSDQHDIAELTIRTICRRAGLSVRYFYESFTDKDEFVGRVF 73
+ R+ L L L Q ++ ++ I + AG++ Y F DK + ++
Sbjct: 6 KQEAQETRQHILDVALRLF--SQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 74 DWVVAELVATTQAAVTAVPA--REQTRAGMANIVRTITADARVGRLL-FSTQLANAVITR 130
+ + + P R + +++ + + R L+ V
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 131 -KRAESSALFAMLSGQHAVDTLHA-------PANDHVKAVAHFAVGGVGQTISAWLAGDV 182
++ + S TL PA+ + A G + + WL
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 183 RLDPDQLVDQLAALL 197
D + A+L
Sbjct: 184 SFDLKKEARDYVAIL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0278ccloacin442e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 44.3 bits (104), Expect = 2e-06
Identities = 42/115 (36%), Positives = 53/115 (46%), Gaps = 3/115 (2%)

Query: 117 NGANGAPGTGANGGDGGWLIGNGGAGGSGAAGVNGGAGGNGGAGGNGGAGGLIGNGGAGG 176
N + NGG G +G G + GSG + N GG G+G + G G GNGG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 177 AGGVASSGIGGSGGAGGNAMLFG--AGGAGGAGGGVVALTGGAGGAGGAGGNAGL 229
G SG GG+ A + FG A GAGG V+++ GA A A A L
Sbjct: 70 NSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAAL 123



Score = 38.2 bits (88), Expect = 2e-04
Identities = 35/114 (30%), Positives = 41/114 (35%), Gaps = 6/114 (5%)

Query: 143 GSGAAGVNGGAGGNGGAGGNGGAGGLIGNGGAGGAGGVASSGIGGSGGAGGNAMLFGAGG 202
G G N GA G NGG GL GGA G +S GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 203 AGGAGGGVVALTGGAGGAGGAGGNAGLLFG-----AAGVGGAGGFTNGSALGGA 251
G GG + G G + A + FG G GG + AL A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 36.2 bits (83), Expect = 7e-04
Identities = 38/104 (36%), Positives = 43/104 (41%), Gaps = 9/104 (8%)

Query: 722 GTGGAGTNFGAGGNGGN--GGLFGAGGTGGAAGSGGSGITTGGGGHGGNAGLLSLGASGG 779
G G G N GA GN GG G G GGA+ G G G +G+ G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 780 AGGSGGASSLAGGAGGTGGNG-----ALLFGFRGAGGAGGHGGA 818
G G +S GG GTGGN + FGF G G A
Sbjct: 63 GNGGGNGNS--GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.8 bits (82), Expect = 7e-04
Identities = 31/110 (28%), Positives = 41/110 (37%), Gaps = 3/110 (2%)

Query: 687 AGGNGGLFANGGAGGAGGFNAAGGNGGNGGLFGTGGTGGAGTNFGAGGNGGNGGLFGAGG 746
+GG+G G +G N G GG G + N GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 747 TGGAAGSGGSGITTGGGGHGGNAGL---LSLGASGGAGGSGGASSLAGGA 793
G G+G SG +G GG+ A G G A S++ GA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.7 bits (79), Expect = 0.002
Identities = 34/111 (30%), Positives = 43/111 (38%), Gaps = 5/111 (4%)

Query: 248 LGGAGGAGGAGGLFATGGV--GGSGGAGSSGGAGGAGGAGGL---FGAGGTGGHGGFADS 302
+ G G G G +T G GG G G GGA G +G G G S
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 303 SFGGVGGAGGAGGLFGAGGEGGSGGHSLVAGGDGGAGGNAGMLALGAAGGA 353
G GG G +GG G GG + + G + AG LA+ + GA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.3 bits (78), Expect = 0.002
Identities = 21/71 (29%), Positives = 25/71 (35%)

Query: 671 TGGHGAAGGVPAGVGGAGGNGGLFANGGAGGAGGFNAAGGNGGNGGLFGTGGTGGAGTNF 730
TG H +G + G G G GG G G G G+G G G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 731 GAGGNGGNGGL 741
GG+G G L
Sbjct: 71 SGGGSGTGGNL 81



Score = 34.3 bits (78), Expect = 0.002
Identities = 31/87 (35%), Positives = 36/87 (41%), Gaps = 3/87 (3%)

Query: 368 IGGAGGAGGNAGLLFGSGG-SGGAGGFGFADGGQGGPGGNAGTVFGSGGAGGNGGVGQGF 426
+ G G G N G SG +GG G G G G G ++ GG+G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 427 AGGIGGAGGTPGLIGNGGNGGNGGASA 453
G GG G G G G GGN A A
Sbjct: 61 GHGNGGGNGNSG--GGSGTGGNLSAVA 85



Score = 33.9 bits (77), Expect = 0.004
Identities = 34/110 (30%), Positives = 42/110 (38%), Gaps = 7/110 (6%)

Query: 215 GGAGGAGGAGGNAGLLFGAAGVGGAGGFTNGSALGG-----AGGAGGAGGLFATGGVGGS 269
G GA GN G G+G GG ++GS GG+G G G
Sbjct: 8 GHNTGAHSTSGNIN--GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 270 GGAGSSGGAGGAGGAGGLFGAGGTGGHGGFADSSFGGVGGAGGAGGLFGA 319
GG G+SGG G GG A G + GG+ + AG L A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.9 bits (77), Expect = 0.004
Identities = 37/134 (27%), Positives = 51/134 (38%)

Query: 263 TGGVGGSGGAGSSGGAGGAGGAGGLFGAGGTGGHGGFADSSFGGVGGAGGAGGLFGAGGE 322
+GG G G+ +G G G GG G S GG G+G +G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 323 GGSGGHSLVAGGDGGAGGNAGMLALGAAGGAGGIGGDGGTLTAGGIGGAGGAGGNAGLLF 382
G+GG + +GG G GGN +A A G + G A I + A ++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 383 GSGGSGGAGGFGFA 396
G G +G A
Sbjct: 122 ALKGPFKFGLWGVA 135



Score = 32.0 bits (72), Expect = 0.012
Identities = 37/113 (32%), Positives = 41/113 (36%), Gaps = 24/113 (21%)

Query: 357 GGDGGTLTAGGIGGAGGAGGNAGLLFGSGGSGGAGGFGFADGGQGGPGGNAGTVFGSGGA 416
GGDG G GA GN +GG G G G GSG +
Sbjct: 3 GGDG----RGHNTGAHSTSGN------------------INGGPTGLGVGGGASDGSGWS 40

Query: 417 GGNGGVGQGFAGGIGGAGGTPGLIGNGGNGGNGGASAVTGGNGGIGGTGVLIG 469
N G G GI GG+ GNGG GN G + TGGN V G
Sbjct: 41 SENNPWGGGSGSGIHWGGGSGH--GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.6 bits (71), Expect = 0.017
Identities = 33/103 (32%), Positives = 38/103 (36%), Gaps = 2/103 (1%)

Query: 751 AGSGGSGITTGGGGHGGN--AGLLSLGASGGAGGSGGASSLAGGAGGTGGNGALLFGFRG 808
+G G G TG GN G LG GGA G SS GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 809 AGGAGGHGGAALTSIQQGGAGGAGGNGGLLFGSAGAGGAGGSG 851
G GG+G + S G F + GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.019
Identities = 36/116 (31%), Positives = 43/116 (37%), Gaps = 8/116 (6%)

Query: 235 GVGGAGGFTNGSALGGAGGAGGAGGLFATGGVGGSGGAGSSGGAGGAGGAGGLFGAGGTG 294
G T+G+ GG G G GG A+ G G S GG G+G G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 295 GHGGFADSSFGGVGGAGGAGGLFGAGGEGGSGGHSLVAGGDGGAGGNAGMLALGAA 350
G G GG+G G L G +L G GG + AL AA
Sbjct: 66 GGNG------NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 30.8 bits (69), Expect = 0.027
Identities = 35/111 (31%), Positives = 43/111 (38%), Gaps = 4/111 (3%)

Query: 290 AGGTG-GHGGFADSSFGGVGGAGGAGGLFGAGGEGGSGGHSLVAGGDGGAGGNAGMLALG 348
+GG G GH A S+ G + G G+ G G GSG S GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGV-GGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 349 AAGGAGGIGGDGGTLTAGGIGGAGGAGGNAGLLFGSGGSGGAGGFGFADGG 399
G GG G GG GG A A G F + + GAGG +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG--FPALSTPGAGGLAVSISA 109



Score = 30.8 bits (69), Expect = 0.030
Identities = 33/104 (31%), Positives = 42/104 (40%), Gaps = 5/104 (4%)

Query: 575 GNGGNGGHGATNTAATATGGAGGAGGILFGTGGNGGTGGIATGAGGIGGAGGAGGVSLLI 634
G+G GA +T+ GG G G G G + G+G + GG G+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGV---GGGASDGSGWSSENNPW-GGGSGSGIHWGGG 59

Query: 635 GSGGTGGNGGNSIGVAGIGGAGGRGGDAGLLFGAAGTGGHGAAG 678
G GG GNS G +G GG A + FG GA G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGG-NLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.030
Identities = 28/93 (30%), Positives = 35/93 (37%), Gaps = 2/93 (2%)

Query: 338 AGGNAGMLALGAAGGAGGIGGDGGTLTAGGIGGAGGAGGNAGLLFGSGGSGGAGGFGFAD 397
+GG+ GA +G I G L GG G + +G G G G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 398 GGQGGPGGNAGTVFGSGGAGGNGGVGQGFAGGI 430
G GG GN+G GSG G V A G
Sbjct: 62 HGNGGGNGNSGG--GSGTGGNLSAVAAPVAFGF 92



Score = 30.5 bits (68), Expect = 0.039
Identities = 29/95 (30%), Positives = 38/95 (40%), Gaps = 9/95 (9%)

Query: 779 GAGGSGGASSLAGGAGGTGGNGALLFGFRGAGGAGGHGGAALTSIQQGGAGGAGGNGGLL 838
G G + GA S +G G G G GG + S + GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPT---------GLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 839 FGSAGAGGAGGSGANALGAGTGGTGGDGGHAGVFG 873
G +G G GG+G + G+GTGG FG
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0279ccloacin381e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 1e-04
Identities = 31/102 (30%), Positives = 41/102 (40%), Gaps = 4/102 (3%)

Query: 117 NGTNGAPGTGANGGDGGWLIGNGGAGGSGAAGVN----GGAGGNGGAGGLIGNGGAGGAG 172
N + NGG G +G G + GSG + N GG+G GG G+G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 173 GRASTGTGGAGGAGGAAGMLFGAAGVGGPGGFAAAFGATGGA 214
G + AA + FG + PG A + GA
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.6 bits (84), Expect = 4e-04
Identities = 32/107 (29%), Positives = 45/107 (42%), Gaps = 4/107 (3%)

Query: 628 GTGGLFANGGAGGAGGFGANGGTGGNGLLFGAGGTGGAGTLGADGGAGGHGGLFGAGGTG 687
G G N GA G G TG + G+G + GG G G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 688 GAGGSSGGTFGGNGGSGGNAGLLALGASG----GAGGSGGSALNVGG 730
G GG +G + GG+G G + + A A G G+GG A+++
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 35.8 bits (82), Expect = 6e-04
Identities = 30/82 (36%), Positives = 38/82 (46%)

Query: 283 AGGTGGSGGTSIINVGGNGGAGGDAGMLSLGAAGGAGGSGGSNPDGGGGAGGIGGDGGTL 342
+GG G T + GN G + GA+ G+G S +NP GGG GI GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 343 FGSGGAGGVCGLGFDAGGAGGA 364
G+GG G G G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 34.7 bits (79), Expect = 0.002
Identities = 26/78 (33%), Positives = 33/78 (42%)

Query: 693 SGGTFGGNGGSGGNAGLLALGASGGAGGSGGSALNVGGTGGVGGNGGSGGSLFGFGGAGG 752
SGG G+ + G G G GG++ G + GG GS +GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 753 TGGSSGIGSSGGTGGDGG 770
G G G+SGG G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGG 79



Score = 33.9 bits (77), Expect = 0.003
Identities = 29/110 (26%), Positives = 40/110 (36%), Gaps = 2/110 (1%)

Query: 211 TGGAGGAGGNGGLFADGGVGGAGGATDAGTGGAGGSG--GNGGLFGAGGTGGPGGFGIFG 268
+GG G G G + G G G + GSG +G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 269 GGAGGDGGSGGLFGAGGTGGSGGTSIINVGGNGGAGGDAGMLSLGAAGGA 318
G GG G+ G G S + + G + AG L++ + GA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.5 bits (76), Expect = 0.003
Identities = 32/95 (33%), Positives = 37/95 (38%), Gaps = 9/95 (9%)

Query: 682 GAGGTGGAGGSSGGTFGGNGGSGGNAGLLALGASGGAGGSGGSALNVGGTGGVGGNGGSG 741
G G GA +SG GG G G G AS G+G S + GG+G GG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGG-----ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 742 GSLFGFGGAGGTGGSSGIGSSGGTGGDGGTAGVFG 776
G G GG G S G +GG FG
Sbjct: 61 GH----GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.006
Identities = 33/105 (31%), Positives = 41/105 (39%), Gaps = 1/105 (0%)

Query: 659 AGGTGGAGTLGADGGAGGHGGLFGAGGTGGAGGSSGGTFGGNGGSGGNAGLLALGASGGA 718
+GG G GA +G G G GG G N GG +G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 719 GGSGGSALNVGGTGGVGGNGGSGGSLFGFG-GAGGTGGSSGIGSS 762
G+GG N GG G GGN + + FG A T G+ G+ S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 32.8 bits (74), Expect = 0.007
Identities = 31/93 (33%), Positives = 37/93 (39%), Gaps = 2/93 (2%)

Query: 377 AGGAGGGSFAGAGGTGGA--GGAPGLVGNAGNGGNGGASANGAGAAGGAGGSGVLIGNGG 434
+GG G G GA T G GG GL G G S+ GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 435 NGGSGGTGAPAGTAGAGGLGGQLLGRDGFNAPA 467
+G GG G G +G GG + F PA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPA 94



Score = 32.4 bits (73), Expect = 0.008
Identities = 32/120 (26%), Positives = 38/120 (31%), Gaps = 12/120 (10%)

Query: 219 GNGGLFADGGVGGAGGATDAGTGGAGGSGGNGGLFGAGGTGGPGGFGIFGGGAGGDGGSG 278
G G + G G + G G G GG G P G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG--------GGSGSGI 54

Query: 279 GLFGAGGTGGSGGTSIINVGGNGGAGGDAGMLSLGAAGGAGGSGGSNPDGGGGAGGIGGD 338
G G G GG N GG+G + ++ A G S P GG A I
Sbjct: 55 HWGGGSGHGNGGG----NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.4 bits (73), Expect = 0.009
Identities = 32/101 (31%), Positives = 38/101 (37%)

Query: 575 SGGTGGSGGFGLDTGGAGGRGGDAGLFLGAAGTGGQAALSQNFIGAGGTAGAGGTGGLFA 634
SGG G G + GG GL +G + G S+N GG+ GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 635 NGGAGGAGGFGANGGTGGNGLLFGAGGTGGAGTLGADGGAG 675
+G GG G G GTGGN A G L G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.010
Identities = 37/132 (28%), Positives = 46/132 (34%), Gaps = 16/132 (12%)

Query: 242 GAGGSGGNGGLFGAGGT--GGPGGFGIFGGGAGGDGGSGGLFGAGGTGGSGGTSIINVGG 299
G G G N G G GGP G G+ GG + G G S GG GSG I+ GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG----IHWGG 58

Query: 300 NGGAGGDAGMLSLGAAGGAGGSGGSNPDGGGGAGGIGGDGGTLFGSGGAGGVCGLGFDAG 359
G G GG G+ G GG + F + G GL
Sbjct: 59 GSGHGN----------GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSIS 108

Query: 360 GAGGAGGKAGLL 371
+ A ++
Sbjct: 109 AGALSAAIADIM 120



Score = 32.0 bits (72), Expect = 0.011
Identities = 39/135 (28%), Positives = 47/135 (34%), Gaps = 10/135 (7%)

Query: 139 GGAGGSGAAGVNGGAGGNGGAGGLIGNGGAGGAGGRASTGTGGAGGAGGAAGMLFGAAGV 198
G G + NGG GL GGA G +S GG+G G+
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 199 GGPGGFAAAFGATGGAGGAGGNGGLFADGGVGGAGGATDAGTGGAGGSGGNGGLFGA--- 255
G G G +GG G GGN A G + G GG S G L A
Sbjct: 64 NGGGN-----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118

Query: 256 --GGTGGPGGFGIFG 268
GP FG++G
Sbjct: 119 IMAALKGPFKFGLWG 133



Score = 32.0 bits (72), Expect = 0.013
Identities = 31/114 (27%), Positives = 39/114 (34%), Gaps = 5/114 (4%)

Query: 171 AGGRASTGTGGAGGAGGAAGMLFGAAGVGGPGGFAAAFGATGGAGGAGGNGGLFADGGVG 230
+GG GA G GVGG + + + G G G+ GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 231 GAGGATDAGTGGAGGSGGNGGLFGAGGTGGPGGFGIFGGGAGGDGGSGGLFGAG 284
G + +GG G+GGN A P FG G GG AG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA-----PVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.2 bits (70), Expect = 0.017
Identities = 35/110 (31%), Positives = 44/110 (40%), Gaps = 14/110 (12%)

Query: 619 GAGGTAGAGGTGGLFANGGAGGAGGFGANGGTG----------GNGLLFGAGGTGGAGTL 668
G G GA T G G G G GA+ G+G G+G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 669 GADGGAGGHGGLFGAGGTGGAGGSSGGTFGGNGGSGGNAGLLALGASGGA 718
G +G +GG G G + ++ FG S AG LA+ S GA
Sbjct: 66 GGNGNSGGGSG----TGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 31.2 bits (70), Expect = 0.018
Identities = 25/79 (31%), Positives = 33/79 (41%), Gaps = 3/79 (3%)

Query: 353 GLGFDAGGAGGAGGKAGLLIGAGGAGGAGGGSFAGAGGT---GGAGGAPGLVGNAGNGGN 409
G G + G +G G G G GGA GS + GG+G G +G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 410 GGASANGAGAAGGAGGSGV 428
GG +G G+ G S V
Sbjct: 66 GGNGNSGGGSGTGGNLSAV 84



Score = 30.5 bits (68), Expect = 0.036
Identities = 34/121 (28%), Positives = 41/121 (33%), Gaps = 5/121 (4%)

Query: 640 GAGGFGANGGTGGNGLLFGAGGTGGAGTLGADGGAG-GHGGLFGAGGTGGAGGSSGGTFG 698
G G G N G G TG GA G+G GG+G GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 699 GNGGSGGNAGLLALGASGGAGGSGGSALNVGGTGGVGGNGGSGGSLFGFGGAGGTGGSSG 758
GNGG GN+G G SG G A V G+GG + +
Sbjct: 63 GNGGGNGNSG----GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118

Query: 759 I 759
I
Sbjct: 119 I 119



Score = 30.1 bits (67), Expect = 0.041
Identities = 33/113 (29%), Positives = 40/113 (35%), Gaps = 2/113 (1%)

Query: 531 GGDGGSGGAGGILSGIGGTGGSGGIGTTGQGGTGGTGGAALLIGSGGTG-GSGGFGLDTG 589
GGDG G + GG G+G G G + GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 590 GAGGRGGDAGLFLGAAGTGGQAALSQNF-IGAGGTAGAGGTGGLFANGGAGGA 641
G GG G++G G G A F A T GAGG + G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0284TYPE3IMPPROT340.003 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 33.6 bits (77), Expect = 0.003
Identities = 18/64 (28%), Positives = 33/64 (51%), Gaps = 13/64 (20%)

Query: 27 PPELPRVIPP---SLLRRALPYLIGILI------VGMIVA--LVATGMRVISPQTLFFPF 75
P + ++P S ++ A + IG + V ++V+ L+A GM ++SP T+ P
Sbjct: 140 KPSIFALLPAYALSEIKSA--FKIGFYLYLPFVVVDLVVSSVLLALGMMMMSPVTISTPI 197

Query: 76 VLLL 79
L+L
Sbjct: 198 KLVL 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0286PF03544290.043 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 28.8 bits (64), Expect = 0.043
Identities = 19/126 (15%), Positives = 27/126 (21%), Gaps = 13/126 (10%)

Query: 295 IVLAPAVIPPASTPLAAAAVAAGSVWPAVSMAVTGAGTAGAATPAAGAAPSAGAAPAPAA 354
++APA + P P A V P A P P P
Sbjct: 53 TMVAPADLEP---PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109

Query: 355 PATASFAYAVGGSGDWGPSLGPTVGGRGGIKAPAATVPAAAAAAATRGQ----------S 404
T R A + A+ + +
Sbjct: 110 KVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPA 169

Query: 405 RARRRR 410
RA+ R
Sbjct: 170 RAQALR 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0291SUBTILISIN1564e-46 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 156 bits (397), Expect = 4e-46
Identities = 83/334 (24%), Positives = 123/334 (36%), Gaps = 70/334 (20%)

Query: 74 MLNLPAAWQFSRGEGQLVAIIDTGVQPG-PRLP-NVDAGGDFVESTDG----LTDCDGHG 127
M+ PA W +RG G VA++DTG P L + G +F + +G D +GHG
Sbjct: 28 MIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHG 87

Query: 128 TLVAGIVAGQPGNDGFSGVAPAARLLSIRAMSTKFSPRTSGGDPQLAQATLDVAVLAGAI 187
T VAG +A +G GVAP A LL I+ ++ + S + + I
Sbjct: 88 THVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDW--------------IIQGI 133

Query: 188 VHAADLGAKVINVSTITCLPADRMVDQAALGAAIRYAAVDKDAVIVAAAGNTGASGSVSA 247
+A + +I++S D L A++ AV +++ AAGN G
Sbjct: 134 YYAIEQKVDIISMS------LGGPEDVPELHEAVKK-AVASQILVMCAAGNEGDGDD-RT 185

Query: 248 SCDSNPLTDLSRPDDPRNWAGVTSVSIPSWWQPYVLSVASLTSAGQPSKFSMPGPWVGIA 307
P V+SV ++ S+FS V +
Sbjct: 186 DELGYP-----------------------GCYNEVISVGAINFDRHASEFSNSNNEVDLV 222

Query: 308 APGENIASVSNSGDGALANGLPDAHQKLVALSGTSYAAGYVSGVAALVRSRYPG-----L 362
APGE+I S G K SGTS A +V+G AL++ L
Sbjct: 223 APGEDILSTVPGG-------------KYATFSGTSMATPHVAGALALIKQLANASFERDL 269

Query: 363 NATEVVRRLTATAHRGARESSNIVGAGNLDAVAA 396
E+ +L S + G G L A
Sbjct: 270 TEPELYAQLIKRTIPLGN-SPKMEGNGLLYLTAV 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0294BCTLIPOCALIN270.044 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 27.3 bits (60), Expect = 0.044
Identities = 16/59 (27%), Positives = 21/59 (35%), Gaps = 3/59 (5%)

Query: 17 PFYELVSRVGLERARRVVDLGCGPGHLTRYLARRWPGAVIEALDSSPEMVAAAAERGID 75
PFY L+R GP +L R P LD + + + ERG D
Sbjct: 106 PFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSRTPTVERGILD---KFIEMSKERGFD 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0297cloacin371e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 1e-04
Identities = 36/97 (37%), Positives = 39/97 (40%), Gaps = 2/97 (2%)

Query: 149 GQAGGAGGAAGFFGNGGNGGDGGAGANGGAGGTAGWFFGFGGNGGAGGIGVAGINGGLGG 208
G G A NGG G G GGA +GW GG G G+ GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH--WGGGSG 61

Query: 209 AGGDGGNAGFFGNGGNGGMGGAGAAGVNAVNPGLATP 245
G GGN G G GG A AA V P L+TP
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTP 98



Score = 35.5 bits (81), Expect = 6e-04
Identities = 26/77 (33%), Positives = 31/77 (40%)

Query: 282 GAGGDGGNASTSGGIGIAQTGGAGGAGGAGGDGAPGGNGGNGGSVEHTGATGSSASGGNG 341
G G + G STSG I TG G G + G G N GG G + GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 342 ATGGNGGVGAPGGAGGN 358
GN G G+ G +
Sbjct: 66 GGNGNSGGGSGTGGNLS 82



Score = 35.1 bits (80), Expect = 8e-04
Identities = 28/90 (31%), Positives = 33/90 (36%)

Query: 488 GGDGGGRGIFGQFGAGGAGGAGGVGGAGGAGGTGGGGGNGGAIFNAGTPGAAGTGGDGGV 547
GGDG G +G G G GG G G + + G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 548 GGTGAAGGKGGAGGSGGVNGATGADGAKGL 577
G G G GG G+GG A A A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 33.1 bits (75), Expect = 0.003
Identities = 32/115 (27%), Positives = 41/115 (35%), Gaps = 4/115 (3%)

Query: 263 GTAGGGADGANGSAIGQAGGAGGDGGNASTSGGIGIAQTGGAGGAGGAGGDGAPGGNGGN 322
G G GA+ ++ GG G G S G G + G G G GG+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 323 GGSVEHTGATGSSASGGNGATGGNGGVGAPG----GAGGNGGHVSGGSVNTAGAG 373
G GS G A G P GAGG +S G+++ A A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 33.1 bits (75), Expect = 0.003
Identities = 32/101 (31%), Positives = 36/101 (35%), Gaps = 9/101 (8%)

Query: 453 GAGAAGAIGGHGGDGGSVNTPIGGSEAGDGGKGGLG--------GDGGGRGIFGQFGAGG 504
G G G G++N G G G G G G G G GI G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI-HWGGGSG 61

Query: 505 AGGAGGVGGAGGAGGTGGGGGNGGAIFNAGTPGAAGTGGDG 545
G GG G +GG GTGG A G P + G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.005
Identities = 27/83 (32%), Positives = 33/83 (39%)

Query: 377 GNGGTGGAGGPGGHGGSVLSGPVGDSGNGGAGGDGGAGVSATDIAGTGGRGGNGGHGGLW 436
G G G G G++ GP G GGA G G G G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 437 IGNGGDGGAGGVGGVGGAGAAGA 459
GG+G +GG G GG +A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.006
Identities = 33/107 (30%), Positives = 41/107 (38%), Gaps = 6/107 (5%)

Query: 355 AGGNGGHVSGGSVNTAGAGGKGGNGGTGGAGGPGGHGGSVLSGPVGDSGNGGAGGDGGAG 414
+GG+G + G+ +T+G G G G G G G S + P G G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 415 VSATDIAGTGGRGGNGGHGGLWIGNGGDGGAGGVGGVGGAGAAGAIG 461
G GG GN G G GN A G GA G
Sbjct: 62 ------HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.012
Identities = 25/77 (32%), Positives = 34/77 (44%)

Query: 321 GNGGSVEHTGATGSSASGGNGATGGNGGVGAPGGAGGNGGHVSGGSVNTAGAGGKGGNGG 380
G G +TGA +S + G TG G GA G+G + + G + +G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 381 TGGAGGPGGHGGSVLSG 397
G G GGS G
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 30.1 bits (67), Expect = 0.029
Identities = 21/78 (26%), Positives = 29/78 (37%)

Query: 309 GAGGDGAPGGNGGNGGSVEHTGATGSSASGGNGATGGNGGVGAPGGAGGNGGHVSGGSVN 368
G G G G G++ G + +G + GG G+G H GGS +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 369 TAGAGGKGGNGGTGGAGG 386
G G GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 30.1 bits (67), Expect = 0.031
Identities = 30/105 (28%), Positives = 38/105 (36%), Gaps = 3/105 (2%)

Query: 206 LGGAGGDGGNAGFFGNGGN--GGMGGAGAAGVNAVNPGLATPVTPAANGGNGLNLVGVPG 263
+ G G G N G GN GG G G G + G ++ P GG+G + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGG 59

Query: 264 TAGGGADGANGSAIGQAGGAGGDGGNASTSGGIGIAQTGGAGGAG 308
+ G G S G G A + G T GAGG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.1 bits (67), Expect = 0.033
Identities = 32/98 (32%), Positives = 35/98 (35%), Gaps = 3/98 (3%)

Query: 118 NGANGAPGTGQAGGDG-GLLFGNGGNGGSGAPGQAGGAGGAAGFFGNGGNGGDGGAGANG 176
NG G G DG G N GG G G G G G GN G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 177 GAGGTAGWFFGFGG--NGGAGGIGVAGINGGLGGAGGD 212
+ A FGF GAGG+ V+ G L A D
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0302HTHTETR845e-22 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 83.5 bits (206), Expect = 5e-22
Identities = 43/207 (20%), Positives = 78/207 (37%), Gaps = 11/207 (5%)

Query: 5 AKKKQQQGERSRESILDATERLMATKGYAATSISDIRDACGLAPSSIYWHFGSKEGVLAA 64
A+K +Q+ + +R+ ILD RL + +G ++TS+ +I A G+ +IYWHF K + +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 65 MMERGAQRFFAAIP-TWDEAHGPVEQRSERQLTELVSLQSQHPDFLRLFYLLSMERSQDP 123
+ E + G L ++ + + RL + + +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLES-TVTEERRRLLMEIIFHKCEFV 120

Query: 124 AVAAVVRRVRNTAIARFRDSITHLLPSDIPPG--KADLVVAELTAFAVALSDGVYFAGHL 181
AVV++ + D I L I ADL+ G+
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 182 EPDTTDVERMYRRLRQALEALIPVLLE 208
P + D L++ + +LLE
Sbjct: 181 APQSFD-------LKKEARDYVAILLE 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0303DHBDHDRGNASE404e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 40.4 bits (94), Expect = 4e-06
Identities = 50/209 (23%), Positives = 76/209 (36%), Gaps = 35/209 (16%)

Query: 6 AVITGASSGLGLQCARALLRRDASWHVVLAVRDPARGRAAMEELGEPNRCS-VLEVDLAS 64
A ITGA+ G+G AR L + A H+ +P + + L R + D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGA--HIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 VRSVRSFVETVRTTPLPPIRALVCNAGLQVVSGI-AFTDDGVEMTFGVNHLGHFALVTGI 123
++ + + PI LV AG+ I + +D+ E TF VN G F +
Sbjct: 69 SAAIDEITARI-EREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 124 LDWLA--RPARIVVVSSGTHDPSKHTGMPDPRYTCAADLAHPPTDQNTPAEGRRRYTTSK 181
++ R IV V S +P+ P Y +SK
Sbjct: 128 SKYMMDRRSGSIVTVGS---NPA-----------------------GVPRTSMAAYASSK 161

Query: 182 LCNVLFTYELDRRLDHGEQGVMVNAFDPG 210
V+FT L L+ E + N PG
Sbjct: 162 AAAVMFTKCLG--LELAEYNIRCNIVSPG 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0305ccloacin320.009 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.4 bits (73), Expect = 0.009
Identities = 25/84 (29%), Positives = 31/84 (36%), Gaps = 8/84 (9%)

Query: 247 GHNIGFANTGSNNVGFGNTGSNNVGIGLTGNGQIGFGSFNSGSHNIGLFNSGSGNVGLFN 306
GHN G +T GN G+G+ G G G + + G SG G
Sbjct: 8 GHNTGAHSTS------GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--G 59

Query: 307 SGTGNFGIGNSGTGNFGLGNTGST 330
SG GN G + G G G S
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSA 83


3Rv0345Rv0355cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0345210-2.913992Conserved hypothetical protein
Rv0346c311-2.930682Possible L-asparagine permease AnsP2
Rv0347312-1.946527Probable conserved membrane protein
Rv0348312-1.607512Possible transcriptional regulatory protein
Rv0349413-1.386554Hypothetical protein
Rv0350621-3.248704Probable chaperone protein DnaK (heat shock
Rv0351722-3.147163Probable GrpE protein (HSP-70 cofactor)
Rv0352622-3.066628Probable chaperone protein DnaJ1
Rv0353622-3.064487Probable heat shock protein transcriptional
Rv0354c621-2.828755PPE family protein PPE7
Rv0355c620-2.628713PPE family protein PPE8
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0350SHAPEPROTEIN1302e-35 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 130 bits (329), Expect = 2e-35
Identities = 71/368 (19%), Positives = 141/368 (38%), Gaps = 66/368 (17%)

Query: 2 ARAVGIDLGTTNSVVSVLEGGDP-----VVVANSEGSRTTPSIVAFARNGEVLVGQPAKN 56
+ + IDLGT N+++ V G VV + + + S+ A VG AK
Sbjct: 10 SNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAA--------VGHDAKQ 61

Query: 57 QAVTNVD--RTVRSVKRHMGSDWSIEIDGKKYTAPEISARILMKLKRDAEAYLGEDITDA 114
+R +K + +D+ + ++ + ++ ++
Sbjct: 62 MLGRTPGNIAAIRPMKDGVIADF--------FVTEKMLQHFIKQVHSNS---FMRPSPRV 110

Query: 115 VITTPAYFNDAQRQATKDAGQIAGLNVLRIVNEPTAAALAYGLDKGEKEQRILVFDLGGG 174
++ P +R+A +++ Q AG + ++ EP AAA+ GL E +V D+GGG
Sbjct: 111 LVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATG-SMVVDIGGG 169

Query: 175 TFDVSLLEIGEGVVEVRATSGDNHLGGDDWDQRVVDWLVDKFKGTSGIDLTKDKMAMQRL 234
T +V+++ + V S +GGD +D+ +++++ + G
Sbjct: 170 TTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLIG------------- 211

Query: 235 REAAEKAKIELSSS----QSTSINLPYITVDAD--KNPLFLDEQLTRAEFQRITQDL--- 285
AE+ K E+ S+ + I + + + ++ A + +T +
Sbjct: 212 EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAV 271

Query: 286 ---LDRTRKPFQSVIADTGISVSEIDHVVLVGGSTRMPAVTDLVKELTGGKEPNKGVNPD 342
L++ S I++ G+ VL GG + + L+ E T G +P
Sbjct: 272 MVALEQCPPELASDISERGM--------VLTGGGALLRNLDRLLMEET-GIPVVVAEDPL 322

Query: 343 EVVAVGAA 350
VA G
Sbjct: 323 TCVARGGG 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0355ccloacin350.004 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.5 bits (81), Expect = 0.004
Identities = 25/78 (32%), Positives = 31/78 (39%), Gaps = 2/78 (2%)

Query: 267 NTGSNNIGFGNTGDGNRGIGLTGSGLLGFGGLNSGTGNIGLFNSGTGNVGIGNSGTGNWG 326
NTG+++ GN G G+G+ G G G + G SG G G G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 327 IGNSGNSYNTGFGNSGDA 344
GNSG TG S A
Sbjct: 69 -GNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.035
Identities = 23/78 (29%), Positives = 33/78 (42%), Gaps = 4/78 (5%)

Query: 301 GTGNIGLFNSGTGNVGIGNSGTGNWGIGNSGNSYNTGFGNSGDANTGFFNSGIANTGVGN 360
G G+ +S +GN+ G +G G G S +G+ + + G SGI G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLG----VGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 361 AGNYNTGSYNPGNSNTGG 378
GN + G S TGG
Sbjct: 62 HGNGGGNGNSGGGSGTGG 79



Score = 32.0 bits (72), Expect = 0.050
Identities = 24/81 (29%), Positives = 31/81 (38%), Gaps = 2/81 (2%)

Query: 3094 GNIGFGNTGNGNIGIGNTGTGNIGFGNTGNGNIGIGLTGDTMTGFGGWNSGTGNIGLFNS 3153
G G G+ + GN G G G G + G G + + GG SG G S
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--GS 60

Query: 3154 GTGNIGFGNSGTGNWGIGNSG 3174
G GN G + G G G +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81


4Rv0371cRv0378Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0371c2141.956561Conserved hypothetical protein
Rv0372c2131.768614Conserved hypothetical protein
Rv0373c2121.774134Probable carbon monoxyde dehydrogenase (large
Rv0374c3152.647095Probable carbon monoxyde dehydrogenase (small
Rv0375c3143.042250Probable carbon monoxyde dehydrogenase (medium
Rv0376c5132.740581Conserved hypothetical protein
Rv03772122.355426Probable transcriptional regulatory protein
Rv03783132.560658Conserved hypothetical glycine rich protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0371cACRIFLAVINRP290.016 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.7 bits (64), Expect = 0.016
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 3/48 (6%)

Query: 33 VLGATLDVARQAGFDQLILTLGGAASAVRAAMALDGTDVVVVEDVERG 80
VL T + G+ LT+ G A+ + +D +VVVE+VER
Sbjct: 375 VLLGTFAILAAFGYSINTLTMFGMVLAI--GLLVDDA-IVVVENVERV 419


5Rv0399cRv0405Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0399c-215-4.539874Possible conserved lipoprotein LpqK
Rv0400c-122-5.858374Acyl-CoA dehydrogenase FadE7
Rv0401-125-6.331013Probable conserved transmembrane protein
Rv0402c-123-6.318652Probable conserved transmembrane transport
Rv0403c024-5.658286Probable conserved membrane protein MmpS1
Rv0404023-5.206176Fatty-acid-AMP ligase FadD30 (fatty-acid-AMP
Rv0405-214-3.118354Probable membrane bound polyketide synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0402cACRIFLAVINRP512e-08 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 51.0 bits (122), Expect = 2e-08
Identities = 48/281 (17%), Positives = 102/281 (36%), Gaps = 44/281 (15%)

Query: 146 QGGSQANESVAAVQRIVDSVPP--PPGIKAYVTGPGPLGADRVVYGDRSLHTIT---GIS 200
G+ A ++ A++ + + P P G+K D + S+H + +
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYP------YDTTPFVQLSIHEVVKTLFEA 347

Query: 201 IAVIAIMLFIAYRSLSAALIMLLTVGLELLAVRGIISTFAVNDLMGLSTFTVNVL----V 256
I ++ +++++ +++ A LI + V + LL TFA+ G +++N L +
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLG------TFAILAAFG---YSINTLTMFGM 398

Query: 257 ALTIAASTDYIIFLVGRYQEARATG--QNREAAYYTMFGGTAHVVLASGLTVAGA---MY 311
L I D I +V + +EA +M ++ + ++ M
Sbjct: 399 VLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSM-SQIQGALVGIAMVLSAVFIPMA 457

Query: 312 CLGFTRLPYFNTLASPCAIGLVTVMLASLTLAPAIIAVASR-------------FGLFDP 358
G + + + + +L +L L PA+ A + FG F+
Sbjct: 458 FFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNT 517

Query: 359 KRATTKRRWRRIGTVVVRWPGPVLAATLLIALIGLLALPKY 399
+ + ++ G L LI + G++ L
Sbjct: 518 TFDHSVNHYTNSVGKILGSTGRYLLIYALI-VAGMVVLFLR 557



Score = 37.9 bits (88), Expect = 2e-04
Identities = 31/167 (18%), Positives = 63/167 (37%), Gaps = 11/167 (6%)

Query: 758 EGTLYDVMIAVVASLCLIFIIMLGITRSVVASAVIVGTVALSLGSAFGLSVLIWQHILHM 817
+ + A++ L+F++M +++ A+ + V + L F + I +
Sbjct: 338 HEVVKTLFEAIM----LVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTL 393

Query: 818 PLHWLVLPMAIIVMLAVGSDYNLLLIARFQEEIGAGLKTGMIRAMAGTGRVVTIAGLVFA 877
+ +VL + ++V A+ N + R E K ++M+ + +V +
Sbjct: 394 TMFGMVLAIGLLVDDAIVVVEN---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLS 450

Query: 878 FTMGSMVA---SDLRVVGQIGTTIMIGLLFDTLVVRSYMTPALATLL 921
M S + Q TI+ + LV +TPAL L
Sbjct: 451 AVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALI-LTPALCATL 496



Score = 35.2 bits (81), Expect = 0.002
Identities = 36/215 (16%), Positives = 78/215 (36%), Gaps = 19/215 (8%)

Query: 145 DQGGSQANESVAAVQRIVDSVPPPPGIKAYVTGPGPLGADRVVYGDRSLHTITGISIAVI 204
G+ + +++A ++ + +P G TG + + + IS V+
Sbjct: 830 AAPGTSSGDAMALMENLASKLPAGIGYD--WTG----MSYQERLSGNQAPALVAISFVVV 883

Query: 205 AIMLFIAYRSLSAALIMLLTVGLELLAVRGIISTFAVNDLMGLSTFTVNVLVALTIAAST 264
+ L Y S S + ++L V L ++ +++ N + F V +L + ++A
Sbjct: 884 FLCLAALYESWSIPVSVMLVVPLGIVG--VLLAATLFNQKNDV-YFMVGLLTTIGLSAKN 940

Query: 265 DYIIFLVGRYQEA-RATGQNREAAYYTMFGGTAHV--VLASGLTVAGAMYCLGFTRLP-- 319
I +V ++ G+ A T+ + +L + L + L +
Sbjct: 941 A--ILIVEFAKDLMEKEGKGVVEA--TLMAVRMRLRPILMTSLAFILGVLPLAISNGAGS 996

Query: 320 -YFNTLASPCAIGLVTVMLASLTLAPAIIAVASRF 353
N + G+V+ L ++ P V R
Sbjct: 997 GAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 31.7 bits (72), Expect = 0.019
Identities = 55/319 (17%), Positives = 105/319 (32%), Gaps = 55/319 (17%)

Query: 635 TVKDLAQTLTSAFSGL-VTQMEDMTRNATVMGRTFDAANNDDSFYLPPEAFQN-PDFQRG 692
++ D+ QT+++A G V D R + + D F + PE
Sbjct: 741 SLSDINQTISTALGGTYVNDFIDRGRVKKLYVQA------DAKFRMLPEDVDKLYVRSAN 794

Query: 693 -----LKLFLSPDGTCARFVITHR-GDPAS------AEGISHIDPIMQAADEAVKGTPLQ 740
F + + G P+ A G S M + P
Sbjct: 795 GEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTS-SGDAMALMENLASKLP-- 851

Query: 741 AASIYLAGTSSTYKDIHEGTLYDVMIAVVASLCLIFIIMLGITRSVVASAVIVGTVALSL 800
A I T +Y++ G V S ++F+ + + S ++ V L +
Sbjct: 852 -AGIGYDWTGMSYQERLSGN--QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGI 908

Query: 801 GSAFGLSVLIWQHILHMPLHWLVLPMAIIVMLAVG-SDYNLLLIARFQEEIGAGLKTGMI 859
+ VL+ + + + + ++ +G S N +LI F +++ G++
Sbjct: 909 -----VGVLLAATLFNQKND---VYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 860 RAMAGTGRV----------VTIAG---LVFAFTMGSMVASDLRVVGQIGTTIMIGLLFDT 906
A R+ I G L + GS + +G +M G++ T
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN------AVGIGVMGGMVSAT 1014

Query: 907 LVVRSYMTPALATLLGRWF 925
L+ + P ++ R F
Sbjct: 1015 LLAI-FFVPVFFVVIRRCF 1032


6Rv0459Rv0474Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0459210-2.043777Conserved hypothetical protein
Rv046019-2.105590Conserved hydrophobic protein
Rv0461110-2.106451Probable transmembrane protein
Rv0462014-1.761245Dihydrolipoamide dehydrogenase LpdC (lipoamide
Rv0463115-1.159799Probable conserved membrane protein
Rv0464c011-2.936096Conserved protein
Rv0465c113-3.300722Probable transcriptional regulatory protein
Rv0466113-3.634470Conserved protein
Rv0467012-3.165518Isocitrate lyase Icl (isocitrase)
Rv0468-111-4.2056023-hydroxybutyryl-CoA dehydrogenase FadB2
Rv0469-111-3.669285Possible mycolic acid synthase UmaA
Rv0470c011-1.769494Mycolic acid synthase PcaA (cyclopropane
Rv0470A215-0.670041Hypothetical protein
Rv0471c216-1.073676Hypothetical protein
Rv0472c315-1.408955Probable transcriptional regulatory protein
Rv0473217-0.129948Possible conserved transmembrane protein
Rv0474313-0.735107Probable transcriptional regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0461OMPADOMAIN270.034 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 27.2 bits (60), Expect = 0.034
Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 135 TALSLGVMSGPFASVAAAAPLYGYYY 160
TA+++ V FA+VA AAP +Y
Sbjct: 4 TAIAIAVALAGFATVAQAAPKDNTWY 29


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0472cHTHTETR523e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.9 bits (124), Expect = 3e-10
Identities = 34/182 (18%), Positives = 66/182 (36%), Gaps = 14/182 (7%)

Query: 17 RRWHQHKVERRNELVDGTIEAIRRHGRF-LSMDEIAAEIGVSKTVLYRYFVDKNDLTTAV 75
R+ Q E R ++D + + G S+ EIA GV++ +Y +F DK+DL + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 76 MMRFTQTTLIPNMIAALSADMDG--FELTREIIRVYVETVAAQPEPYRFVMANSSASKS- 132
I + A G + REI+ +E+ + + +
Sbjct: 63 WELSESN--IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 133 ---KVIADSERIIA---RMLAVMLRRRMQEAGMDTGGVEP--WAYLIVGGVQLATHSWMS 184
V+ ++R + + EA M + A ++ G + +W+
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 185 DP 186
P
Sbjct: 181 AP 182


7Rv0514Rv0538Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv05142130.481006Possible transmembrane protein
Rv0515212-0.049525Conserved 13E12 repeat family protein
Rv0516c29-0.887253Possible anti-anti-sigma factor
Rv051719-1.385650Possible membrane acyltransferase
Rv0518011-1.252211Possible exported protein
Rv0519c-2100.124361Possible conserved membrane protein
Rv0520-211-0.011010Possible methyltransferase/methylase (fragment)
Rv0521-390.554746Possible methyltransferase/methylase (fragment)
Rv0522-2111.237359Probable GABA permease GabP (4-amino butyrate
Rv0523c0173.262597Conserved protein
Rv05241183.305392Probable glutamate-1-semialdehyde
Rv05252183.242475Conserved protein
Rv05263183.057469Possible thioredoxin protein (thiol-disulfide
Rv05272203.149059Possible cytochrome C-type biogenesis protein
Rv05284165.299946Probable conserved transmembrane protein
Rv05293134.684634Possible cytochrome C-type biogenesis protein
Rv05304144.996711Conserved protein
Rv0530A6154.650663Conserved protein
Rv05316154.820329Possible conserved membrane protein
Rv05324144.140309PE-PGRS family protein PE_PGRS6
Rv0533c0132.7095583-oxoacyl-[acyl-carrier-protein] synthase III
Rv0534c1143.3360181,4-dihydroxy-2-naphthoate octaprenyltransferase
Rv05350133.280387Probable 5'-methylthioadenosine phosphorylase
Rv0536-1153.458648Probable UDP-glucose 4-epimerase GalE3
Rv0537c-1143.263654Probable integral membrane protein
Rv0538-1143.619127Possible conserved membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0532cloacin427e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 41.6 bits (97), Expect = 7e-06
Identities = 29/85 (34%), Positives = 35/85 (41%)

Query: 375 LSSGDGGAGGAGGGGGWLFGNGGDGGAGGGGGGRFGSGSGAGGDGAVGGAGGAGAWFGNG 434
+S GDG G NGG G G GGG GSG + + GG+G W G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 435 GAGGVGGGGGRGTTAIGGDGGAGGA 459
G G GG G G + G + A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 37.0 bits (85), Expect = 2e-04
Identities = 29/97 (29%), Positives = 33/97 (34%)

Query: 344 GGAAGAGGAGGWLYGDGGAGGVGGVGGAVFSLSSGDGGAGGAGGGGGWLFGNGGDGGAGG 403
G GA G + G GVGG SS + GG G G G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 404 GGGGRFGSGSGAGGDGAVGGAGGAGAWFGNGGAGGVG 440
G GSG+G GAGG+
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.0 bits (85), Expect = 2e-04
Identities = 24/77 (31%), Positives = 36/77 (46%)

Query: 138 GNGGNGGSGAAGQAGGAGGAAGLIGHGGTGGAVTGVSTTGGPGGHGGDAGLYGFGGAGGA 197
G+G +GA +G G +G GG +G S+ P G G +G++ GG+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 198 GGFGQSGAAGGAGGAGG 214
G G + GG+G G
Sbjct: 64 NGGGNGNSGGGSGTGGN 80



Score = 36.2 bits (83), Expect = 4e-04
Identities = 33/100 (33%), Positives = 37/100 (37%), Gaps = 1/100 (1%)

Query: 358 GDGGAGGVGGVGGAVFSLSSGDGGAGGAGGGGGWLFGNGGDGGAGGGGGGRFGSGSGAGG 417
G G G G + +G G GGA G GW N GG G G G GSG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG-GSGHGN 64

Query: 418 DGAVGGAGGAGAWFGNGGAGGVGGGGGRGTTAIGGDGGAG 457
G G +GG GN A G + G GG
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.8 bits (82), Expect = 5e-04
Identities = 24/80 (30%), Positives = 28/80 (35%)

Query: 413 SGAGGDGAVGGAGGAGAWFGNGGAGGVGGGGGRGTTAIGGDGGAGGAGGAGGWLYGDGGA 472
SG G G GA G G GGG + + G G G +G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 473 GGAGGGGGRGGTGNDGGDGG 492
G GGG G G G+ G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 35.8 bits (82), Expect = 5e-04
Identities = 34/103 (33%), Positives = 45/103 (43%), Gaps = 3/103 (2%)

Query: 204 GAAGGAGGAGGWLYGDGGDGGAGDNGGNESGTGVSAVGGVGGAGGAGGLLFGNGGDGGVG 263
G GA G + +GG G G GG G+G S+ G G G+ +G G G G
Sbjct: 8 GHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 264 GDGGDGSSTQDSGGDGGAGGAGGAGGW-LLGNGGAGGAGGAAS 305
G G+ +GG+ A A A G+ L GAGG + S
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSIS 108



Score = 34.7 bits (79), Expect = 0.001
Identities = 25/77 (32%), Positives = 28/77 (36%)

Query: 410 GSGSGAGGDGAVGGAGGAGAWFGNGGAGGVGGGGGRGTTAIGGDGGAGGAGGAGGWLYGD 469
G G G G G G GG G G GG G+G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 470 GGAGGAGGGGGRGGTGN 486
GG G +GGG G GG +
Sbjct: 66 GGNGNSGGGSGTGGNLS 82



Score = 34.7 bits (79), Expect = 0.001
Identities = 33/107 (30%), Positives = 40/107 (37%)

Query: 435 GAGGVGGGGGRGTTAIGGDGGAGGAGGAGGWLYGDGGAGGAGGGGGRGGTGNDGGDGGDG 494
G G G G +T+ +GG G G GG G G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 495 GRGGDAQLLGNGGDGGAGGAGGPAGLALPPGPARPAGAAVPAVRCSA 541
G GG G G G + A +A GA AV SA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 32.4 bits (73), Expect = 0.005
Identities = 29/108 (26%), Positives = 33/108 (30%), Gaps = 1/108 (0%)

Query: 153 GAGGAAGLIGHGGTGGAVTGVSTTGGPGGHGGDAGLYGFGGAGGAGGFGQSGAAGGAGGA 212
G G G T G + G T G GG D + GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 213 GGWLYGDGGDGGAGDNGGNESGTGVSAVGGVG-GAGGAGGLLFGNGGD 259
G GG+G G + A G GAGGL
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.0 bits (72), Expect = 0.007
Identities = 32/94 (34%), Positives = 35/94 (37%), Gaps = 6/94 (6%)

Query: 235 TGVSAVGGVGGAGGAGGLLFGNGGDGGVGGDGGDG---SSTQDSGGDGGAGGAGGAGGWL 291
+G G GA G + G GVGG DG SS + G G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 292 LGNGGAGGAGGAASIKVATGGLGGDGGDAGLFGF 325
GNGG G G S TGG FGF
Sbjct: 62 HGNGGGNGNSGGGS---GTGGNLSAVAAPVAFGF 92



Score = 31.2 bits (70), Expect = 0.014
Identities = 29/103 (28%), Positives = 33/103 (32%), Gaps = 3/103 (2%)

Query: 326 GGDGGWGGRGVDARFGAAGGAAGAGGAGGWLYGDGGAGGVGGVGGAVFSLSSGDGGAGGA 385
GGDG G + G G G GG G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 386 GGGGGWLFGNGGDGGAGGGGGGRFGSGSGAGGDGAVGGAGGAG 428
G GGG GG+G GG + A G A+ G G
Sbjct: 63 GNGGG---NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.1 bits (67), Expect = 0.027
Identities = 35/102 (34%), Positives = 40/102 (39%), Gaps = 4/102 (3%)

Query: 256 NGGDGGVGGDGGDGSSTQDSGGDGGAGGAGGAG---GWLLGNGGAGGAGGAASIKVATGG 312
+GGDG G +S +GG G G GGA GW N GG G+ I G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGS-GIHWGGGS 60

Query: 313 LGGDGGDAGLFGFGGDGGWGGRGVDARFGAAGGAAGAGGAGG 354
G+GG G G G G V A A GAGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 29.7 bits (66), Expect = 0.039
Identities = 40/124 (32%), Positives = 47/124 (37%), Gaps = 5/124 (4%)

Query: 194 AGGAGGFGQSGAAGGAGGAGGWLYGDGGDGGAGDNGG----NESGTGVSAVGGVGGAGGA 249
+GG G +GA +G G G G GGA D G N G S G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 250 GGLLFGNGGDGGVGGDGGDGSSTQDSGGDG-GAGGAGGAGGWLLGNGGAGGAGGAASIKV 308
G GNG GG G GG+ S+ G A GAGG + + A I
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 309 ATGG 312
A G
Sbjct: 122 ALKG 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0536NUCEPIMERASE1719e-53 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 171 bits (434), Expect = 9e-53
Identities = 91/369 (24%), Positives = 132/369 (35%), Gaps = 74/369 (20%)

Query: 1 MRVLLTGAAGFIGSRVDAALRAAGHDVVGVDALLPAAHGPNPVLPPGCQ----------- 49
M+ L+TGAAGFIG V L AGH VVG+D L + L
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLND---YYDVSLKQARLELLAQPGFQFH 57

Query: 50 RVDVRDASALAPLLA--GVDLVCHQAAMVGAGVNAADAPAYGGHNDFATTVLLAQMFAAG 107
++D+ D + L A + V + + + AY N +L
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK 117

Query: 108 VRRLVLASSMVVYGQGRYDCPQHGPVDPLPRRRADLDNGVFEHRCPGCGEPVIWQLVDED 167
++ L+ ASS VYG R +P F +D
Sbjct: 118 IQHLLYASSSSVYGLNR----------KMP----------FST---------------DD 142

Query: 168 APLRPRSLYAASKTAQEHYALAWSEASGGSVVALRYHNVYGPGMPRDTPYSGVAAIFRSA 227
+ P SLYAA+K A E A +S G LR+ VYGP D F A
Sbjct: 143 SVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALF----KFTKA 198

Query: 228 VEKGKPPKVFEDGGQMRDFVHVDDVAAANLAAVH---LGEADRDGFTA-----------V 273
+ +GK V+ G RDF ++DD+A A + + T
Sbjct: 199 MLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVY 258

Query: 274 NVCSGRPISILQVATAICDARGGSMSPAITGH--YRSGDVRHIVADPARAARVLGFRAAV 331
N+ + P+ ++ A+ DA G A + GDV AD V+GF
Sbjct: 259 NIGNSSPVELMDYIQALEDALG---IEAKKNMLPLQPGDVLETSADTKALYEVIGFTPET 315

Query: 332 DPGEGLREF 340
+G++ F
Sbjct: 316 TVKDGVKNF 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0538IGASERPTASE398e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.5 bits (89), Expect = 8e-05
Identities = 25/113 (22%), Positives = 34/113 (30%), Gaps = 6/113 (5%)

Query: 402 PGWQPGMPTIPTAPPTTPVTTSATTPPTTPPTTPVTTPPTTPPTTPVTTPPTTPPTTPVT 461
P +P PT P + + TT T P T PVT T V
Sbjct: 1141 PQAEPARENDPTVNIKEPQSQTNTTADTEQPAK--ETSSNVEQ--PVTESTTVNTGNSVV 1196

Query: 462 TPPTTVAPTTVAPTTVAPTTVAPTTVAPAT--ATPTTVAPQPTQQPTQQPTQQ 512
P P T PT + ++ P + + P V P T +
Sbjct: 1197 ENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVAL 1249



Score = 33.1 bits (75), Expect = 0.003
Identities = 19/89 (21%), Positives = 25/89 (28%), Gaps = 2/89 (2%)

Query: 439 PPTTPPTTPVTTPPTTPPTTPVTTPPTTVAPTTVAPTTVAPTTVAPTTVAPATATPTTVA 498
P T +P T P P PT + T T PA T + V
Sbjct: 1123 PKVTSQVSPKQEQSET--VQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 499 PQPTQQPTQQPTQQMPTQQQTVAPQTVAP 527
T+ T + + P T P
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQP 1209


8Rv0573cRv0586Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0573c3175.124063Nicotinic acid phosphoribosyltransferase PncB2
Rv0574c3165.383552Conserved hypothetical protein
Rv0575c3185.846628Possible oxidoreductase
Rv05765176.559067Probable transcriptional regulatory protein
Rv05776206.983439Conserved protein TB27.3
Rv0578c5207.016630PE-PGRS family protein PE_PGRS7
Rv0579111-0.924088Conserved hypothetical protein
Rv0580c-112-1.047327Conserved protein
Rv0581-211-1.228777Possible antitoxin VapB26
Rv0582-211-1.416914Possible toxin VapC26. Contains PIN domain.
Rv0583c-210-1.509364Probable conserved lipoprotein LpqN
Rv0584-210-2.006327Possible conserved exported protein
Rv0585c-212-2.417612Probable conserved integral membrane protein
Rv0586010-3.245774Probable transcriptional regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0578cFLAGELLIN421e-05 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 42.3 bits (99), Expect = 1e-05
Identities = 43/296 (14%), Positives = 62/296 (20%), Gaps = 14/296 (4%)

Query: 904 NGDHALSGNGAAGGNGGNGGNGSLRGSGGAGGHGGNGGNASRGMGGDGGTGGAGGNAGQI 963
NG LS + G ++ S G+ G G G +
Sbjct: 132 NGVKVLSQDNQMKIQVGANDGETITID------LQKIDVKSLGLDGFNVNGPKEATVGDL 185

Query: 964 GNGGAGGNGGDGGTGSDGNPGAITGSGGRGGDGGVGGQGGSVAGDGADGGRGGAGGTGGT 1023
+ G D SG D V + A+G T
Sbjct: 186 KSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNT 245

Query: 1024 GLRGTTGATGATGTFDAGADGHGGNGGTGGVGGT---GGAGGGGGNGGAGGKALSPTGNN 1080
+ GT +A A GG G G G +S T N
Sbjct: 246 AVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTING 305

Query: 1081 GSQGAGGDGGAGGAGGTGGTGGDGGRGAHGTLFSSLAGTGGTGGNGGTGGTGGTGGAGGA 1140
GA + + + G + T A
Sbjct: 306 EKVTLTVADITAGAANVDAATLQSSKNVYTS-----VVNGQFTFDDKTKNESAKLSDLEA 360

Query: 1141 GGTGSTLGATGATGAAGRAGNGGVGGSGGLGSAFGPGGTGGMGGAGGTSTVSAGGD 1196
GA A G + + F G+ +A
Sbjct: 361 NNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKS 416


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0581SHAPEPROTEIN240.040 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 24.3 bits (53), Expect = 0.040
Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 13 KAAVKRAARQRGVSEAQVIRESIRAAVGGAKP 44
+ A++ +A+ G E +I E + AA+G P
Sbjct: 123 RRAIRESAQGAGAREVFLIEEPMAAAIGAGLP 154


9Rv0737Rv0755cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv07372161.637740Possible transcriptional regulatory protein
Rv07383150.982612Conserved protein
Rv0739114-0.093875Conserved hypothetical protein
Rv07400180.870737Conserved hypothetical protein
Rv07417157.202271Probable transposase (fragment)
Rv074212189.564406PE-PGRS family protein PE_PGRS8
Rv0743c10189.257948Hypothetical protein
Rv0744c9189.596683Possible transcriptional regulatory protein
Rv0745111910.129693Conserved hypothetical protein
Rv0746122310.359504PE-PGRS family protein PE_PGRS9
Rv07479227.417849PE-PGRS family protein PE_PGRS10
Rv07481180.769523Possible antitoxin VapB31
Rv07490160.860349Possible toxin VapC31. Contains PIN domain.
Rv0749A1131.589767Hypothetical protein (fragment)
Rv0750411-0.558499Conserved hypothetical protein
Rv0751c411-0.743195Probable 3-hydroxyisobutyrate dehydrogenase MmsB
Rv0752c49-0.789772Probable acyl-CoA dehydrogenase FadE9
Rv0753c39-1.351954Probable methylmalonate-semialdehyde
Rv0754312-1.345947PE-PGRS family protein PE_PGRS11
Rv0755c314-2.758797PPE family protein PPE12
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0746cloacin366e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 6e-04
Identities = 26/79 (32%), Positives = 30/79 (37%)

Query: 514 GGAGGAAGLWGTGGAGGAGGSSAGGGGAGGAGGAGGWLLGDGGAGGIGGASTVLGGTGGG 573
GG G +G G G G GGA GW + GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 574 GGVGGLWGAGGAGGAGGTG 592
G GG +GG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.001
Identities = 29/103 (28%), Positives = 40/103 (38%), Gaps = 2/103 (1%)

Query: 272 TGGRGFLNNGGVGGAGGN--AGLLFGAGGTGGSGGAGLGGDGGAGGAGGNTGVLFGNAGS 329
+GG G +N G GN G G G S G+G + G G +G+ +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 330 GGTGGFGDTDGGAGGAGGDAGWLGSGGVGGAGGFGETGDGGVG 372
G GG GG G GG+ + + G G GG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.002
Identities = 24/91 (26%), Positives = 36/91 (39%), Gaps = 9/91 (9%)

Query: 232 GAGGHGGAGGLGAVTGGVGGTGGAGGLLAGLLAGPGGAGGTGGRGFLNNGGVGGAGGNAG 291
G G G G + +G + G G G G + G G+ + G G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG---------VGGGASDGSGWSSENNPWGGGSGSG 53

Query: 292 LLFGAGGTGGSGGAGLGGDGGAGGAGGNTGV 322
+ +G G G+GG GG+G G + V
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.9 bits (77), Expect = 0.003
Identities = 35/134 (26%), Positives = 49/134 (36%), Gaps = 3/134 (2%)

Query: 588 AGGTGLVGGDGGAGGAGGTGGLLAGLIGAGGGHGGTGGLSTNGDGGVGGAGGNAGMLAGP 647
+GG G G +G G GL GG G+G S N G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 648 GGAGGAGGDGENLDTGGDGGAGGSAGLLFG--SGGAGGAGGFGF-LGGDGGAGGNAGLLL 704
G GG G+ G + +A + FG + GAGG + + A ++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 705 SSGGAGGFGGFGTA 718
+ G FG +G A
Sbjct: 122 ALKGPFKFGLWGVA 135



Score = 33.5 bits (76), Expect = 0.004
Identities = 29/95 (30%), Positives = 38/95 (40%), Gaps = 2/95 (2%)

Query: 478 NGTPGAVGSGATGAPGGWLLGDGGAGGSGAAGSGAPGGAGGAAGLWGTGGAGGAGGSSAG 537
N + G P G +G G + GSG + P G G +G+ GG+G G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG--GG 67

Query: 538 GGGAGGAGGAGGWLLGDGGAGGIGGASTVLGGTGG 572
G +GG G GG L G + G GG
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.8 bits (74), Expect = 0.005
Identities = 38/122 (31%), Positives = 42/122 (34%), Gaps = 20/122 (16%)

Query: 302 SGGAGLGGDGGAGGAGGNTGVLFGNAGSGGTGGFGDTDGGAGGAGGDAGWLGSGGVGGAG 361
SGG G G + GA GN N G G G GGA +GW G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-----NGGPTGLGV-------GGGASDGSGWSSENNPWGGG 49

Query: 362 GFGETGDGGVGGAGGKAGLLIGNGGAGGAGGQGAVTGGTGGAGGDGVLIGNGGNAGIGGT 421
GG G G NGG G G G+ TGG A V G + G
Sbjct: 50 SGSGIHWGGGSGHG--------NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101

Query: 422 GP 423
G
Sbjct: 102 GL 103



Score = 32.4 bits (73), Expect = 0.009
Identities = 30/106 (28%), Positives = 39/106 (36%)

Query: 128 NGANGAPGTGANGAPGGWLLGNGGAGGSAAAGSGLPGGAGGAAGLFGTGGAGGAGGSSTV 187
N + NG P G +G G + GS + P G G +G+ GG+G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 188 GDGEAGGAGGSGGWLLGTGGVGGVGGLGAGAGGAGGVGGAGGLLGA 233
G G GG+ + G GAGG AG L A
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 32.4 bits (73), Expect = 0.009
Identities = 24/68 (35%), Positives = 25/68 (36%)

Query: 174 GTGGAGGAGGSSTVGDGEAGGAGGSGGWLLGTGGVGGVGGLGAGAGGAGGVGGAGGLLGA 233
G G G G GGA GW GG G G GG G G GG +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 234 GGHGGAGG 241
GG G GG
Sbjct: 72 GGGSGTGG 79



Score = 31.6 bits (71), Expect = 0.015
Identities = 36/120 (30%), Positives = 45/120 (37%), Gaps = 7/120 (5%)

Query: 148 GNGGAGGSAAAGSGLPGGAGGAAGLFGTGGAGGAGGSST---VGDGEAGGAGGSGGWLLG 204
G G G+ + + GG G G G + G+G SS G G G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 205 TGGVGGVGGLGAGAGGAGGVGGAGGLLG--AGGHGGAGGLGAVTGGVGGTGGAGGLLAGL 262
GG G G G+G GG A G A GAGGL + ++A L
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAAL 123



Score = 31.2 bits (70), Expect = 0.017
Identities = 30/91 (32%), Positives = 35/91 (38%), Gaps = 6/91 (6%)

Query: 567 LGGTGGGGGVGGLWGAGGAGGAGGTGLVGGDGGAGGAGGT------GGLLAGLIGAGGGH 620
+ G G G G G G TGL G G + G+G + GG I GGG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 621 GGTGGLSTNGDGGVGGAGGNAGMLAGPGGAG 651
G G GG G GGN +A P G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.2 bits (70), Expect = 0.018
Identities = 31/108 (28%), Positives = 38/108 (35%), Gaps = 7/108 (6%)

Query: 678 SGGAGGAGGFGFLGGDGGAGGNAGLLLSSGGAGGFGGFGTAGGVGGAGGNAGWLGFGGAG 737
SGG G G G G L GGA G+ + G G +G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 738 GVGGSAGLIGTGGNGGNGGTGANAGSPGTG-------GAGGLLLGQNG 778
G GG+G G A A G GAGGL + +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 30.8 bits (69), Expect = 0.025
Identities = 34/106 (32%), Positives = 41/106 (38%), Gaps = 5/106 (4%)

Query: 498 GDGGAGGSGAAGSGAPGGAGGAAGLWGTGGAGGAGGSSAGGGGAGGAGGAGGWLLGDGGA 557
GDG +GA + G G G + G+G SS GG+G W G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 558 GGIGGASTVLGGTGGGGGVGGLWGAGGAGGAGGTGLVGGDGGAGGA 603
G G G +GGG G GG A A A G + G G A
Sbjct: 64 NGGGN-----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.032
Identities = 32/110 (29%), Positives = 39/110 (35%), Gaps = 1/110 (0%)

Query: 205 TGGVGGVGGLGAGAGGAGGVGGAGGLLGAGGHGGAGGLGAVTGGVGGTGGAG-GLLAGLL 263
+GG G GA + GG GL GG G + GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 264 AGPGGAGGTGGRGFLNNGGVGGAGGNAGLLFGAGGTGGSGGAGLGGDGGA 313
G GG G G G G + F A T G+GG + GA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 30.1 bits (67), Expect = 0.035
Identities = 30/96 (31%), Positives = 40/96 (41%), Gaps = 2/96 (2%)

Query: 358 GGAGGFGETGDGGVGGA--GGKAGLLIGNGGAGGAGGQGAVTGGTGGAGGDGVLIGNGGN 415
GG G TG G GG GL +G G + G+G GG+G G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 416 AGIGGTGPTAGDTGAGGISGLLLGADGFNTPASASP 451
GG G + G +G GG + F PA ++P
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTP 98



Score = 30.1 bits (67), Expect = 0.040
Identities = 25/81 (30%), Positives = 29/81 (35%)

Query: 690 LGGDGGAGGNAGLLLSSGGAGGFGGFGTAGGVGGAGGNAGWLGFGGAGGVGGSAGLIGTG 749
+ G G G N G +SG G GG G GG G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 750 GNGGNGGTGANAGSPGTGGAG 770
G+G GG G + G GTGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 30.1 bits (67), Expect = 0.042
Identities = 32/110 (29%), Positives = 42/110 (38%), Gaps = 7/110 (6%)

Query: 353 GSGGVGGAGGFGETGDGGVGGAGGKAGLLIGNGGAGGAGGQGAVTGGTGGAGGDGVLIGN 412
G+ G G TG G GGA +G N GG G G GG G G N
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG-------N 64

Query: 413 GGNAGIGGTGPTAGDTGAGGISGLLLGADGFNTPASASPLHTLKQQALAA 462
GG G G G G + + + G +TP + ++ AL+A
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0747cloacin381e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 1e-04
Identities = 38/120 (31%), Positives = 48/120 (40%)

Query: 531 GNGGAGGNGGLFANGGAGGPGGFGSPAGAGGIGGAGGNGGLFGAGGTGGAGGGSTLAGGA 590
G G G N G + G G G G G G+G + GG G+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 591 GGAGGNGGLFGAGGTGGAGSHSTAAGVSGGAGGAGGDAGLLSLGASGGAGGSGGSSLTAA 650
G GGNG G GTGG S A G + AG L++ S GA + + + AA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122



Score = 34.7 bits (79), Expect = 0.002
Identities = 33/91 (36%), Positives = 42/91 (46%), Gaps = 6/91 (6%)

Query: 638 GAGGSGGSSLTAAGVVGGIGGAGGLLFGSGGAGGSGGFS--NSGNGGAGGAGGDAGLLVG 695
G G + G+ T+ + GG G G GGA G+S N+ GG G+G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGV----GGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 696 SGGAGGAGASATGAATGGDGGAGGKSGAFGL 726
G GG G S G+ TGG+ A AFG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 34.3 bits (78), Expect = 0.002
Identities = 31/106 (29%), Positives = 45/106 (42%), Gaps = 5/106 (4%)

Query: 544 NGGAGGPGGFGSPAGAGGIGGAGGNGGLFGAGGTGGAGGGSTLAGGAGGAGGNGGLFGAG 603
+GG G G+ + +G I G G+ G G + G+G S GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGV-GGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 604 GTGGAGSHSTAAGVSGGAGGAGGDAGLLSLG----ASGGAGGSGGS 645
G G G + + G SG G A ++ G ++ GAGG S
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 33.9 bits (77), Expect = 0.003
Identities = 36/113 (31%), Positives = 44/113 (38%), Gaps = 4/113 (3%)

Query: 197 GGVGGFSNGGA--TGGAGGAGGAGGLFGAGRERGSGGSGNLTGGAGGAGGNAGTLATGDG 254
GG G N GA T G G G G G GSG S GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 255 GAGGTGGASRSGGFGGAGGAGGDAGMFFG--SGGSGGAGGISKSVGDSAAGGA 305
G GG G S G G + A + FG + + GAGG++ S+ A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.5 bits (76), Expect = 0.003
Identities = 33/93 (35%), Positives = 39/93 (41%), Gaps = 8/93 (8%)

Query: 430 GNGGNGG-QGTIGGVNGGAGGAGGAGGILFGTGGTGGSGGPGATGLGGIGGAGGAALLFG 488
G G N G T G +NGG G G GG G+G + + G GG+G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-------GGSGSGIHWGG 58

Query: 489 SGGAGGSGGAGAVGGNGGAGGNAGALLGAAGAG 521
G G GG G GG G GGN A+ G
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.5 bits (76), Expect = 0.003
Identities = 39/120 (32%), Positives = 48/120 (40%), Gaps = 11/120 (9%)

Query: 459 GTGGTGGSGGPGATGLGGIGGAGGAALLFGSGGAGGSGGAGAVGGNGGAGGNAGALLGAA 518
G G G + G +T GG G G G S G+G N GG +G+ +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTG-----LGVGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 519 GAGGAGGAGAVGGNGGAGGNGGLFANGGAGGPGGFGSPA----GAGGIGGAGGNGGLFGA 574
G G G G G +GG G GG P FG PA GAGG+ + G L A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGG--NLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.1 bits (75), Expect = 0.004
Identities = 27/101 (26%), Positives = 37/101 (36%)

Query: 283 GSGGSGGAGGISKSVGDSAAGGAGGAPGLIGNGGNGGNGGASTGGGDGGPGGAGGTGVLI 342
G G G G + G+ G G G + G+G + + GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 343 GNGGNGGSGGTGATLGKAGIGGTGGVLLGLDGFTAPASTSP 383
GNGG G+ G G+ G V G + P +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 32.4 bits (73), Expect = 0.009
Identities = 28/95 (29%), Positives = 32/95 (33%)

Query: 117 NGANGAPGTGANGGPGGWLIGNGGAGGSGAPGAGAGGNGGAGGLFGSGGAGGASTDVAGG 176
N + NGGP G +G G + GSG GG+G GG G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 177 AGGAGGAGGNAGMLFGAAGVGGVGGFSNGGATGGA 211
G G G A G S GA G A
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.009
Identities = 28/85 (32%), Positives = 31/85 (36%), Gaps = 2/85 (2%)

Query: 503 GNGGAGGNAGALLGAAGAGGAGGAGAVGGNGGAGGNGGLFANGGAGGPGGFGSPAGAGGI 562
G G G N GA + G VGG G N GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 563 GGAGGNGGLFGAGGTGGAGGGSTLA 587
G GGNG GG+G G S +A
Sbjct: 63 GNGGGNGN--SGGGSGTGGNLSAVA 85



Score = 31.6 bits (71), Expect = 0.013
Identities = 30/84 (35%), Positives = 34/84 (40%), Gaps = 6/84 (7%)

Query: 229 SGGSGNLTGGAGGAGGNAGTLATGDGGAGGTGGASRSGGFGGA----GGAGGDAGMFFGS 284
SGG G G GA +G + G G G GGAS G+ GG G + G
Sbjct: 2 SGGDGR--GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 285 GGSGGAGGISKSVGDSAAGGAGGA 308
G G GG S G S GG A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 31.2 bits (70), Expect = 0.018
Identities = 33/100 (33%), Positives = 38/100 (38%), Gaps = 5/100 (5%)

Query: 488 GSGGAGGSGGAGAVGGN-----GGAGGNAGALLGAAGAGGAGGAGAVGGNGGAGGNGGLF 542
G G G + GA + GN G G GA G+ + G G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 543 ANGGAGGPGGFGSPAGAGGIGGAGGNGGLFGAGGTGGAGG 582
NGG G G GS G A F A T GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.018
Identities = 26/71 (36%), Positives = 34/71 (47%), Gaps = 2/71 (2%)

Query: 669 AGGSGGFSNSGNGGAGGA--GGDAGLLVGSGGAGGAGASATGAATGGDGGAGGKSGAFGL 726
+GG G N+G G GG GL VG G + G+G S+ GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 727 GGDGGAGGATG 737
G+GG G +G
Sbjct: 62 HGNGGGNGNSG 72



Score = 31.2 bits (70), Expect = 0.021
Identities = 35/112 (31%), Positives = 47/112 (41%), Gaps = 13/112 (11%)

Query: 601 GAGGTGGAGSHSTAAGVSGGAGGAGGDAGLLSLGASGGAGGSGGSSLTAAGVVGGIGGAG 660
G G G+HST+ ++GG G G G G S GS ++ G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGV-----------GGGASDGSGWSSENNPWGGGSGS 52

Query: 661 GLLFGSGGAGGSGGFSNSGNGGAGGAGGDAGLLVGSGGAGGAGASATGAATG 712
G+ +G G G+GG +GN G G G V + A G A +T A G
Sbjct: 53 GIHWGGGSGHGNGG--GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.024
Identities = 25/86 (29%), Positives = 33/86 (38%)

Query: 228 GSGGSGNLTGGAGGAGGNAGTLATGDGGAGGTGGASRSGGFGGAGGAGGDAGMFFGSGGS 287
G G + +G G L G G + G+G +S + +GG G+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 288 GGAGGISKSVGDSAAGGAGGAPGLIG 313
GG G G A AP G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.5 bits (68), Expect = 0.030
Identities = 25/80 (31%), Positives = 36/80 (45%), Gaps = 1/80 (1%)

Query: 156 GAGGLFGSGGAGGASTDVAGGAGGAGGAGG-NAGMLFGAAGVGGVGGFSNGGATGGAGGA 214
G G + GA S ++ GG G G GG + G + + GG +G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 215 GGAGGLFGAGRERGSGGSGN 234
G GG +G G+GG+ +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 30.5 bits (68), Expect = 0.035
Identities = 26/95 (27%), Positives = 34/95 (35%), Gaps = 3/95 (3%)

Query: 562 IGGAGGNGGLFGAGGTGGAGGGSTLAGGAGGAGGNGGLFGAGGTGGAGSHSTAAGVSGGA 621
+ G G G GA T G G G GG +G + + G + GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 622 GGAGGDAGLLSLGASGGAGGSGGSSLTAAGVVGGI 656
G G + GG+G G S AA V G
Sbjct: 61 GHGNGGGN---GNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 30.1 bits (67), Expect = 0.036
Identities = 32/107 (29%), Positives = 38/107 (35%), Gaps = 6/107 (5%)

Query: 137 GNGGAGGSGAPGAGAGGNGGAGGLFGSGGAGGASTDVAGGAGGAGGAGGNAGMLFGAAGV 196
G+G +GA NGG GL GGA S + GG+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG------ 57

Query: 197 GGVGGFSNGGATGGAGGAGGAGGLFGAGRERGSGGSGNLTGGAGGAG 243
GG G + GG GG+G G L G T GAGG
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.1 bits (67), Expect = 0.042
Identities = 28/107 (26%), Positives = 43/107 (40%), Gaps = 6/107 (5%)

Query: 577 TGGAGGGSTLAGGAGGAGGNGGLFGAGGTGGAGSHSTAAGVSGGAGGAGGDAGLLSLGAS 636
TG + GG G G GG + G+G + ++ G SG GG +G +
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGSG----HGN 64

Query: 637 GGAGGSGGSSLTAAGVVGGIGGAGGLLFGSGGAGGSGGFSNSGNGGA 683
GG G+ G G + + F + G+GG + S + GA
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 29.7 bits (66), Expect = 0.048
Identities = 20/56 (35%), Positives = 27/56 (48%)

Query: 115 IGNGANGAPGTGANGGPGGWLIGNGGAGGSGAPGAGAGGNGGAGGLFGSGGAGGAS 170
+G GA+ G + P G G+G G G+ GGNG +GG G+GG A
Sbjct: 29 VGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0748PF06057260.028 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 25.6 bits (56), Expect = 0.028
Identities = 12/61 (19%), Positives = 20/61 (32%)

Query: 11 ILAAAKRRARERGQSLGAVIEDALRREFAAAHVGGARPTVPVFDGGTGPRRGIDLTSNRA 70
+ + A A E +LG + A +P + +F G G +D
Sbjct: 14 LCSTANAFADEFADNLGLTLLPVEPSTQVNAASSHTKPPLVIFLSGDGGWATLDKAVGGI 73

Query: 71 L 71
L
Sbjct: 74 L 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0754cloacin426e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 42.0 bits (98), Expect = 6e-06
Identities = 34/110 (30%), Positives = 42/110 (38%), Gaps = 6/110 (5%)

Query: 175 GDGGAGYTGGNGGSAGLIGNGGTGGAGFAGGVGGMGGTGGWLMGNGGMGGAG-GVGGNGG 233
G G G+ G ++G I NGG G G GG G GG G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 234 AGGQALLFGNGGLGGAGGAGGVDGAIGRGGWFIGTGGMATIGGGGNGQSI 283
G G G G G G + + G ++T G GG SI
Sbjct: 62 HGNG----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSI 107



Score = 41.6 bits (97), Expect = 8e-06
Identities = 36/98 (36%), Positives = 40/98 (40%), Gaps = 8/98 (8%)

Query: 157 GVTGTAAAPNGGPGGLLFGDGGAGYTGGNGGSAGLIGNGGTG-GAGFAGGVGGMGGTGGW 215
G T+ NGGP GL G GG G S GG+G G + GG G G
Sbjct: 12 GAHSTSGNINGGPTGL--GVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG--- 66

Query: 216 LMGNGGMGGAGGVGGNGGAGGQALLFGNGGLGGAGGAG 253
GNG GG G GGN A + FG L G G
Sbjct: 67 --GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.4 bits (86), Expect = 1e-04
Identities = 39/137 (28%), Positives = 56/137 (40%), Gaps = 13/137 (9%)

Query: 197 TGGAGFAGGVGGMGGTGGWLMGNGGMGGAGGVGGNGGAGGQALLFGNGGLGGAGGAGGVD 256
+GG G G G T G + G G GG G + G+G + GG G+G G
Sbjct: 2 SGGDG-RGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 257 GAIGRGGWFIGTGGMATIGGGGNGQSIVIDF-VRHGQTPGNAAMLIDTAVPGPGLTALGQ 315
G GG +GG + GG + + + F TPG G ++
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG---------GLAVSISAG 110

Query: 316 QQAQAIANALAA-KGPY 331
+ AIA+ +AA KGP+
Sbjct: 111 ALSAAIADIMAALKGPF 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0755ccloacin350.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.1 bits (80), Expect = 0.001
Identities = 28/93 (30%), Positives = 32/93 (34%), Gaps = 2/93 (2%)

Query: 253 SGNIGNANVGGGNSGDNNFGFGNFGNANIGIGNAGPNMSSPAVPTPGNGNVGIGNGGNGN 312
SG G + G +S N G G G + G SS P G GI GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS- 60

Query: 313 FGGGNTGNANIGLGNVGDGNVGFGNSGSYNFGF 345
G GN G G G G + FGF
Sbjct: 61 -GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 32.8 bits (74), Expect = 0.004
Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 1/75 (1%)

Query: 310 NGNFGGGNTGNANIGLGNVGDGNVGFGNSGSYNFGFGNTGNNNIGIGLTGSNQIGFGGLN 369
+G G G+ A+ GN+ G G G G + G G + NN G +GS GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 370 SGSGNIGFGNSGTGN 384
G+G G GNSG G+
Sbjct: 62 HGNGG-GNGNSGGGS 75



Score = 30.1 bits (67), Expect = 0.033
Identities = 27/92 (29%), Positives = 37/92 (40%), Gaps = 15/92 (16%)

Query: 226 GSGNIGNFNLGSGNGNVGIGPSSFNVGSGNIGNANVGGGNSGDNNFGFGNFGNANIGIGN 285
G G N S +GN+ GP+ VG GG + G G+ + N G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVG---------GGASDGS---GWSSENNPWGGGSG 51

Query: 286 AGPNMSSPAVPTPGNGNVGIGNGGNGNFGGGN 317
+G + + G GN GN G G+ GGN
Sbjct: 52 SGIHWGGGSGHGNGGGN---GNSGGGSGTGGN 80


10Rv0828cRv0834cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0828c7205.769523Possible deaminase
Rv08299208.167830Possible transposase (fragment)
Rv08308207.449210Possible S-adenosylmethionine-dependent
Rv0831c8208.241251Conserved protein
Rv08326198.504112****PE-PGRS family protein PE_PGRS12
Rv08335188.008832PE-PGRS family protein PE_PGRS13
Rv0834c0154.488468PE-PGRS family protein PE_PGRS14
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0832RTXTOXINA270.027 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 27.2 bits (60), Expect = 0.027
Identities = 26/107 (24%), Positives = 39/107 (36%), Gaps = 13/107 (12%)

Query: 20 RIGSALSLASAVAAAQTSAVQAAAADEVSAAIAALFSAHGRDFQALSARA---------- 69
R LS ++A A SAV A + +IA F + S R
Sbjct: 295 RAAQGLSTSAAAAGLIASAVTLAISPLSFLSIADKFK-RANKIEEYSQRFKKLGYDGDSL 353

Query: 70 -AAFHHEFVQALAAGAG-SYAVAEIAAASPLQSLIDVFNAPIQAATG 114
AAFH E A+ S +A +++ + + AP+ A G
Sbjct: 354 LAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVG 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0833cloacin395e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 5e-05
Identities = 36/126 (28%), Positives = 47/126 (37%)

Query: 273 AGGSGGAGGFANGSTGGAGGAGGGAGLIGNGGNGGSGGTSVATGGAGNGGAGGAGGGAGL 332
+GG G ST G G +G G + GSG +S G G+G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 333 IGNGGNGGSGGMGDAPGGTGVGGIGGLLLGLDGANAPASTNPLHTAQQQALAAVNAPIQA 392
GNGG G+ G G GG + G + P + + AL+A A I A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 393 VTGRPL 398
P
Sbjct: 122 ALKGPF 127



Score = 37.4 bits (86), Expect = 2e-04
Identities = 35/107 (32%), Positives = 43/107 (40%), Gaps = 5/107 (4%)

Query: 512 GAGGAGGSGVSGGAGGEGGAGGAGGLFAGGGAGGAGGSGNNVGGAGGAGGVGGLFGAGGA 571
G G G + + G G G GG + G+G S N GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 572 GGSGGGGSVAGDSGAGGNAGLLAPGLAGGAGGGGGQGFDTGGAGGPG 618
G GG G+ G SG GGN +A +A G T GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF-----PALSTPGAGGLA 104



Score = 35.8 bits (82), Expect = 6e-04
Identities = 34/119 (28%), Positives = 45/119 (37%), Gaps = 1/119 (0%)

Query: 477 GAGGAGGAGGSSGIGGFAAGGAGGPGGAGGLFNGGGAGGAGGSGVSGGAGGEGGAGGAGG 536
G G G G+ G GG G G GG +G G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 537 LFAGGGAGGAGGSGNNVGGAGGAGGVGGLFGAGGAGGSGGGG-SVAGDSGAGGNAGLLA 594
GG GGSG + A V F A G+GG S++ + + A ++A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121



Score = 35.5 bits (81), Expect = 9e-04
Identities = 41/120 (34%), Positives = 53/120 (44%), Gaps = 7/120 (5%)

Query: 1 MIGNGGAGGSGAPGAIGGA--GGPAGLIGVGGAGGAGGDSAVAGVIGGAGGAGGAALLFG 58
M G G G + + G GGP GL GGA G S+ GG G+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG-----IH 55

Query: 59 AGGAGGAGGSGGSGAAGGAGGAGGAGGLFASGGSGGFGGFASTGTGGAGGTGGAGGLFAS 118
GG G G GG+G +GG G GG A+ + GF ++ G GG + AG L A+
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 34.3 bits (78), Expect = 0.002
Identities = 27/82 (32%), Positives = 34/82 (41%), Gaps = 1/82 (1%)

Query: 61 GAGGAGGSGGSGAAGGAGGAGGAGGLFASGGSGGFGGFASTGTGGAGGTGGAGGLFASGG 120
G G G + G+ + G GG GL GG+ G++S GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 121 VGGTGGGAGSGGTGGVGGTGGA 142
G GG SGG G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 34.3 bits (78), Expect = 0.002
Identities = 39/107 (36%), Positives = 45/107 (42%), Gaps = 7/107 (6%)

Query: 497 GAGGPGGAGGLFNGGGA--GGAGGSGVSGGAGGEGGAGGAGGLFAGGGAGGAGGSGNNVG 554
G G G G + G GG G GV GGA + G+G GG GSG + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGA-----SDGSGWSSENNPWGGGSGSGIHWG 57

Query: 555 GAGGAGGVGGLFGAGGAGGSGGGGSVAGDSGAGGNAGLLAPGLAGGA 601
G G G GG +GG G+GG S A G L PG G A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.002
Identities = 36/114 (31%), Positives = 46/114 (40%), Gaps = 4/114 (3%)

Query: 109 TGGAGGLFASGGVGGTGGGAGSGGTGGVGGTGGAGGLFASGGAGGAGGSGGTGGAGGTGG 168
+GG G +G +G G GVGG G ++S GGSG GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 169 AGGLFGAGGAGGLGGQGNHTGGHGGAGGSAGLLALGDGGAGGAGGAATTGTGGA 222
G GG G G G+ TGG+ A + GAGG A + + GA
Sbjct: 62 HGN----GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.5 bits (76), Expect = 0.003
Identities = 38/121 (31%), Positives = 48/121 (39%), Gaps = 9/121 (7%)

Query: 120 GVGGTGGGAGSGGTGGVGGTGGAGGLFASGGAGGAGGSGGTGGAGGTGGAGGLFGAGGAG 179
G G G G+ T G G G G + G+G S GG G+G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 180 GLGGQGNHTGGHGGAGGSA---------GLLALGDGGAGGAGGAATTGTGGAGGAGGKAG 230
G GG ++GG G GG+ G AL GAGG + + G A A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 231 L 231
L
Sbjct: 123 L 123



Score = 33.1 bits (75), Expect = 0.004
Identities = 26/82 (31%), Positives = 33/82 (40%)

Query: 656 SGGAGGAGGSGRTDLGGAGGAGGKAGLIGNGGNGGAGGAGGNGGGDGGPGGAAFGLGNGG 715
SGG G +G G G +G G + G+G + N GG G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 716 NGGNGGTGTSAGSPGAGGAGGS 737
+G GG G S G G GG +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 32.8 bits (74), Expect = 0.006
Identities = 35/115 (30%), Positives = 46/115 (40%), Gaps = 1/115 (0%)

Query: 627 SGGVGGAGGFGLTTGGPGAAGGDAGLLFGSGGAGGAGGSGRTDLGGAGGAGGKAGLIGNG 686
SGG G G + GG GL G G + G+G S + G GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGS 60

Query: 687 GNGGAGGAGGNGGGDGGPGGAAFGLGNGGNGGNGGTGTSAGSPGAGGAGGSLIGA 741
G+G GG G +GGG G G + G + AG + G+L A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 32.4 bits (73), Expect = 0.007
Identities = 37/114 (32%), Positives = 44/114 (38%), Gaps = 7/114 (6%)

Query: 425 TGGSGVSGGAGGDGGAGGILFGAGGAGGAGGAVTGTG----ATGGSGGAGGGALLFGAGG 480
+GG G G +G I G G G GGA G+G GG+G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 481 AGGAGGSSGIGGFAAGGAGGPGGAGGL---FNGGGAGGAGGSGVSGGAGGEGGA 531
G GG+ GG + G A + F GAGG VS AG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 32.4 bits (73), Expect = 0.008
Identities = 26/82 (31%), Positives = 38/82 (46%)

Query: 245 AGTFGDTGNSGGAGGAGGKAGLLFGSGGAGGSGGAGGFANGSTGGAGGAGGGAGLIGNGG 304
+G G N+G +G G G G GG+ G+++ + GG+G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 305 NGGSGGTSVATGGAGNGGAGGA 326
+G GG + GG+G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.008
Identities = 31/116 (26%), Positives = 44/116 (37%), Gaps = 4/116 (3%)

Query: 60 GGAGGAGGSGGSGAAGGAGGAGGAGGLFASGGSGGFGGFASTGTGGAGGTGGAGGLFASG 119
GA G+ G G G G + G S + +GG + +G GG+G G G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG----G 66

Query: 120 GVGGTGGGAGSGGTGGVGGTGGAGGLFASGGAGGAGGSGGTGGAGGTGGAGGLFGA 175
G G +GGG+G+GG A G A G G + + + A
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122



Score = 31.6 bits (71), Expect = 0.014
Identities = 34/111 (30%), Positives = 50/111 (45%), Gaps = 2/111 (1%)

Query: 180 GLGGQGNHTGGHGGAGGSAGLLALGDGGAGGAGGAATTGTGGAGGAGGKAGLLFGSGGAG 239
G G+G++TG H +G G G G + G+ + G G +G+ +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 240 GSGGAAGTFGDTGNSGGAGGAGGKAGLLFGSGGAGGSGGAGGFANGSTGGA 290
G+GG G G G+ G + A + FG A + GAGG A + GA
Sbjct: 63 GNGGGNGNSGG-GSGTGGNLSAVAAPVAFGF-PALSTPGAGGLAVSISAGA 111



Score = 31.2 bits (70), Expect = 0.018
Identities = 36/120 (30%), Positives = 47/120 (39%), Gaps = 3/120 (2%)

Query: 451 GGAGGAVTGTGATGGSGGAGGGALLFGAGGAGGAGGSSGIGGFAAGGAGGPGGAGGLFNG 510
G G TG +T G+ G L G G + G+G SS + G G GG +G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 511 GGAGGAGGSGVSGGAGGEGGAGGAGGLFA--GGGAGGAGGSGNNVGGAGGAGGVGGLFGA 568
G GG G SG G GG A A F GAGG ++ + + + A
Sbjct: 64 NG-GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122



Score = 30.5 bits (68), Expect = 0.029
Identities = 38/119 (31%), Positives = 44/119 (36%), Gaps = 5/119 (4%)

Query: 80 AGGAGGLFASGGSGGFGGFASTGTGGAGGTGGAGGLFASGGVGGTGGGAGSGGTGGVGGT 139
+GG G +G G TG G G + G S GGG+GSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 140 GGAGGLFASGGAGGAGGSGGTGGAGGTGGAGGLFGAGGAGGLGGQGNHTGGHGGAGGSA 198
G GG G G +GG GTGG A FG G G GA +A
Sbjct: 62 HGNGG-----GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 30.5 bits (68), Expect = 0.030
Identities = 30/102 (29%), Positives = 36/102 (35%), Gaps = 7/102 (6%)

Query: 401 NGANGAPGSGAPGGHGGWLFGGGGTGGSGVSGGAGGDGGAGGILFGAGGAGGAGGAVTGT 460
N + GG G GGG + GSG S GG G GG G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 461 GATGGSGGAGGGALLFGAGGAGGAGGSSGIGGFAAGGAGGPG 502
+ GGSG G + A A + G + GAGG
Sbjct: 70 NSGGGSGTGGNLS-------AVAAPVAFGFPALSTPGAGGLA 104



Score = 30.1 bits (67), Expect = 0.036
Identities = 30/108 (27%), Positives = 39/108 (36%), Gaps = 5/108 (4%)

Query: 414 GHGGWLFGGGGTGGSGVSGGAGGDGGAGGILFGAGGAGGAGGAVTGTGATGGSGGAGGGA 473
GH G G +G G G + G + + GG+ +G GGSG GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 474 LLFGAGGAGGAGGSSGIG-----GFAAGGAGGPGGAGGLFNGGGAGGA 516
GG+G G S + GF A G GG + G A
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0834ccloacin391e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 1e-04
Identities = 37/98 (37%), Positives = 42/98 (42%), Gaps = 1/98 (1%)

Query: 498 GGNGGAGGSGTGLLGGVGGAGGHGGGASVGTGGSGGAGGDGFGFVGAGGNGGNAGTGVGV 557
G N GA + + GG G G GGGAS G+G S G G GG +G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGV-GGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 558 NGANGGNGGSATGALAAVGGAGAAGGDATSGTGGFGGA 595
N G G G L+AV A G A S G G A
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 38.2 bits (88), Expect = 1e-04
Identities = 35/114 (30%), Positives = 40/114 (35%), Gaps = 4/114 (3%)

Query: 256 AGGAATTGTGGAGGAGSNALGLFLGLGGSGGQGGDSAMGSGGAGGAGGSGGAASPFGIDI 315
+GG GA N G GLG GG S S GGSG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG-- 59

Query: 316 GIGGAGGHGGAGTNGGAGGAGGAGGSSGTVFALDLSWGGAGGNGGAATTGTGGA 369
G G GG G +GG G GG + A G GG A + + GA
Sbjct: 60 --SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.6 bits (84), Expect = 4e-04
Identities = 44/122 (36%), Positives = 50/122 (40%), Gaps = 17/122 (13%)

Query: 150 GSGAPGQTGGAGGAAGLLGHGGTGGAGGTGASGGKGGTGGWLWGSGGAGGAGGSGGGSGG 209
G G GA +G + G TG G GAS G G W S GGSG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG------WSSENNPWGGGSGSGIHW 56

Query: 210 AGGNALMFGIGGNGGAGGAASGVGNGGVGGAGGAGGALVAIG----GAGGAGGAATTGTG 265
GG+ G G G SG G+G GG A A VA G GAGG A + +
Sbjct: 57 GGGS------GHGNGGGNGNSGGGSGT-GGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109

Query: 266 GA 267
GA
Sbjct: 110 GA 111



Score = 36.2 bits (83), Expect = 7e-04
Identities = 37/125 (29%), Positives = 49/125 (39%), Gaps = 16/125 (12%)

Query: 174 GAGGTGASGGKGGTGGWLWGSGGAGGAGGSGGGSGGAGGNALMFGIGGNGGAGGAASGVG 233
G G G + G T G + +GG G G GG S G+G + N GG+ SG+
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSG-----WSSENNPWGGGSGSGIH 55

Query: 234 NGGVGGAGGAGGALVAIGGAGGAGGAATTGTGGAGGAGSNALGLFLGLGGSGGQGGDSAM 293
GG G G GG +GTGG A + + + G GG +
Sbjct: 56 WGGGSGHGNGGG---------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106

Query: 294 GSGGA 298
S GA
Sbjct: 107 ISAGA 111



Score = 34.7 bits (79), Expect = 0.002
Identities = 35/104 (33%), Positives = 40/104 (38%), Gaps = 7/104 (6%)

Query: 353 GGAGGNGGAATTGTGGAGGTGGFAVAPDFIGFGAAYGGAGGLGGAATGAGGTGGTGGVGA 412
GA G G G G GG A D G+ + GG G+ GG G G G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 413 GGFAALGVGVGGAGGAGGAATETG----GIGGAGGLGVGLLGGA 452
G + G G GG A A G GAGGL V + GA
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.7 bits (79), Expect = 0.002
Identities = 32/120 (26%), Positives = 44/120 (36%), Gaps = 2/120 (1%)

Query: 295 SGGAGGAGGSGGAASPFGIDIGIGGAGGHGGAGTNGGAGGAGG--AGGSSGTVFALDLSW 352
SGG G +G ++ I+ G G G GGA G GGS + S
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 353 GGAGGNGGAATTGTGGAGGTGGFAVAPDFIGFGAAYGGAGGLGGAATGAGGTGGTGGVGA 412
G GG G + G+G G A F + GAGGL + + + + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121



Score = 33.1 bits (75), Expect = 0.005
Identities = 32/112 (28%), Positives = 38/112 (33%), Gaps = 2/112 (1%)

Query: 131 GTGEAGGPGGWLLGNGGNGGSGAPGQTGGAGGAAGLLGHGGTGGAGGTGASGGKGGTGGW 190
G G G NGG G GGA +G G G GG+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 191 LWGSGGAGGAGGSGGGSGGAGGNALMFGIGG--NGGAGGAASGVGNGGVGGA 240
G G G G GG+ A + FG GAGG A + G + A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 32.8 bits (74), Expect = 0.007
Identities = 38/123 (30%), Positives = 46/123 (37%), Gaps = 8/123 (6%)

Query: 216 MFGIGGNGGAGGAASGVGNGGVGGAGGAGGALVAIGGAGGAGGAATTGTGGAGGAGSNAL 275
M G G G GA S GN G G G GGA G ++ GG+GS
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVG-----GGASDGSGWSSENNPWGGGSGS--- 52

Query: 276 GLFLGLGGSGGQGGDSAMGSGGAGGAGGSGGAASPFGIDIGIGGAGGHGGAGTNGGAGGA 335
G+ G G G GG + GG+G G A+P G GG + AG
Sbjct: 53 GIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112

Query: 336 GGA 338
A
Sbjct: 113 SAA 115



Score = 32.8 bits (74), Expect = 0.007
Identities = 25/85 (29%), Positives = 34/85 (40%), Gaps = 5/85 (5%)

Query: 449 LGGAGGAGGPGGAASAGSGGHGGTGGDALGLIGAGIGGVGGVGGAATDTGGNGGAGGSGT 508
+ G G G GA S +GG G +G G G G G ++ + GG+G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTG-----LGVGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 509 GLLGGVGGAGGHGGGASVGTGGSGG 533
G G GG G + G+G G
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 32.4 bits (73), Expect = 0.009
Identities = 30/95 (31%), Positives = 35/95 (36%), Gaps = 1/95 (1%)

Query: 489 GVGGAATDTGGNGGAGGSGTGLLGGVGGAGGHGGGASVGTGGSG-GAGGDGFGFVGAGGN 547
G A T GN G +G G+ GG G + GGSG G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 548 GGNAGTGVGVNGANGGNGGSATGALAAVGGAGAAG 582
GN+G G G G A+ GA G
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.015
Identities = 33/114 (28%), Positives = 45/114 (39%), Gaps = 6/114 (5%)

Query: 654 SGATGGAGGDGVFEGIAVLGLGFGGAAGAGGAATGDG-ATGGAGGFGGAGAGIANFLGFS 712
SG G G + G G GGA+ G G ++ GG+G+GI G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG-- 59

Query: 713 VLHGGAGGAGGTATGTGGNGGAGGGGGLSSPVILGIGIGGAGGDGGGALGVLGG 766
G G GG GG+G G +++PV G G GG A+ + G
Sbjct: 60 ---SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.5 bits (68), Expect = 0.035
Identities = 36/120 (30%), Positives = 45/120 (37%), Gaps = 11/120 (9%)

Query: 503 AGGSGTGLLGGVGGAGGHGGGASVGTGGSGGAGGDGFGFVGAGGNGGNAGTGVGVNGANG 562
+GG G G G G+ G G G GGA GG +G+G+ G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 563 GNGGSATGALAAVGGAGAAGGDATSGTGGFGGAGGSARGLIFALGGAGAAGGDASTGVGG 622
G GG G +GG SGTGG A + F AGG A + G
Sbjct: 62 HGNG---------GGNGNSGG--GSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.5 bits (68), Expect = 0.037
Identities = 36/122 (29%), Positives = 45/122 (36%), Gaps = 10/122 (8%)

Query: 543 GAGGNGGNAGTGVGVNGANGGNGGSATGALAAVGGAGAAGGDATSGTGGFGGAGGSARGL 602
G G N G T +NG G G VGG + G +S +GG GS
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLG---------VGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 603 IFALGGAGAAGGDASTGVGGPGGPGGTGTASSPFGI-AIAIGGAGAQGGAGTSGATGGAG 661
G G S G G GG A FG A++ GAG + ++GA A
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116

Query: 662 GD 663
D
Sbjct: 117 AD 118



Score = 30.1 bits (67), Expect = 0.040
Identities = 24/70 (34%), Positives = 30/70 (42%)

Query: 402 GGTGGTGGVGAGGFAALGVGVGGAGGAGGAATETGGIGGAGGLGVGLLGGAGGAGGPGGA 461
G T G GG LGVG G + G+G ++ GG+G G G GG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 462 ASAGSGGHGG 471
+ GSG G
Sbjct: 71 SGGGSGTGGN 80



Score = 30.1 bits (67), Expect = 0.043
Identities = 27/90 (30%), Positives = 35/90 (38%), Gaps = 1/90 (1%)

Query: 194 SGGAGGAGGSGGGSGGAGGNALMFGIGGNGGAG-GAASGVGNGGVGGAGGAGGALVAIGG 252
SGG G +G S N G+G GGA G+ N GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 253 AGGAGGAATTGTGGAGGAGSNALGLFLGLG 282
G GG +G G G +A+ + G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.1 bits (67), Expect = 0.049
Identities = 31/103 (30%), Positives = 39/103 (37%), Gaps = 4/103 (3%)

Query: 753 AGGDGGGALGVLGGMGGDGGDGGEAVAVGIAVGGAGGAGGAAPTGNGGAGGNGGDALGLV 812
+GGDG G G G G +G+ G + G+G ++ G G G G
Sbjct: 2 SGGDGRGHNT--GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 813 GVGGNGGNAGTGFGANTGGNGGDT--TIVVNGMLAPSTLGYGG 853
GNGG G G + G V G A ST G GG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


11Rv0864Rv0878cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0864-2163.245142Probable molybdenum cofactor biosynthesis
Rv0865-2163.034645Probable molybdopterin biosynthesis Mog protein
Rv0866-1153.210468Probable molybdenum cofactor biosynthesis
Rv0867c2165.457323Possible resuscitation-promoting factor RpfA
Rv0868c2112.387318Probable molybdenum cofactor biosynthesis
Rv0869c082.867527Probable molybdenum cofactor biosynthesis
Rv0870c182.392795Possible conserved integral membrane protein
Rv0871082.200195Probable cold shock-like protein B CspB
Rv0872c072.181736PE-PGRS family protein PE_PGRS15
Rv0873-19-0.207278Probable acyl-CoA dehydrogenase FadE10
Rv0874c-1110.804647Conserved hypothetical protein
Rv0875c212-0.056258Possible conserved exported protein
Rv0876c0140.114460Possible conserved transmembrane protein
Rv08771160.468091Conserved hypothetical protein
Rv0878c2150.668359PPE family protein PPE13
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0865ARGDEIMINASE280.019 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 27.9 bits (62), Expect = 0.019
Identities = 8/44 (18%), Positives = 20/44 (45%)

Query: 31 WLEQHGFSSVQPQVVADGNPVGEALHDAVNAGVDVIITSGGTGI 74
++ + SS + + + + + L + +D+I +GG I
Sbjct: 298 YVLTYNPSSSKIHIKKEKARIKDVLSFYLGRKIDIIKCAGGDLI 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0867cPF03544330.001 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.0 bits (75), Expect = 0.001
Identities = 22/100 (22%), Positives = 28/100 (28%)

Query: 221 DPAPPADLAPPAPADLAPPAPADLAPPAPADLAPPVELAVNDLPAPLGEPLPAAPAELAP 280
PA P + APADL PP P + P E P
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104

Query: 281 PADLAPASADLAPPAPADLAPPAPAELAPPAPADLAPPAA 320
P + P + P +P E PA + A
Sbjct: 105 PKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATA 144



Score = 31.9 bits (72), Expect = 0.003
Identities = 24/129 (18%), Positives = 31/129 (24%), Gaps = 15/129 (11%)

Query: 232 APADLAPPAPADLAPPAPADLAPPVELAVNDLPAPLGEPLPAAPAELAPPADLAPASADL 291
AP P + APADL PP +P P E P + P
Sbjct: 40 VIELPAPAQPISVTMVAPADLEPP----------QAVQPPPEPVVEPEPEPEPIPE---- 85

Query: 292 APPAPADLAPPAPAELAPPAPADLAPPAAVNEQTAPGDQPATAPGGPVGLATDLELPEPD 351
PP A + P P P + P + +P A
Sbjct: 86 -PPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATA 144

Query: 352 PQPADAPPP 360

Sbjct: 145 ATSKPVTSV 153



Score = 31.9 bits (72), Expect = 0.004
Identities = 31/133 (23%), Positives = 44/133 (33%), Gaps = 6/133 (4%)

Query: 181 DPAPPADLAPPAPADVAPPVELAVNDLPAPLGEPLPAAPADPAPPADLAPPAPADLAPPA 240
PA P + APAD+ PP AV P P+ EP P P PP AP + P
Sbjct: 45 APAQPISVTMVAPADLEPP--QAVQPPPEPVVEPEPEPEPIPEPP----KEAPVVIEKPK 98

Query: 241 PADLAPPAPADLAPPVELAVNDLPAPLGEPLPAAPAELAPPADLAPASADLAPPAPADLA 300
P P P + V + + P + A++ +
Sbjct: 99 PKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPR 158

Query: 301 PPAPAELAPPAPA 313
+ + PA A
Sbjct: 159 ALSRNQPQYPARA 171



Score = 31.9 bits (72), Expect = 0.004
Identities = 29/132 (21%), Positives = 39/132 (29%), Gaps = 2/132 (1%)

Query: 147 APLAPPPADPAPPVELAANDLPAPL-GEPLPAAPADPAPPADLAPPAPADVAPPVELAVN 205
P PA P +A DL P +P P +P P + P P + AP V
Sbjct: 40 VIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKE-APVVIEKPK 98

Query: 206 DLPAPLGEPLPAAPADPAPPADLAPPAPADLAPPAPADLAPPAPADLAPPVELAVNDLPA 265
P P +P+ + + APA +V P
Sbjct: 99 PKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPR 158

Query: 266 PLGEPLPAAPAE 277
L P PA
Sbjct: 159 ALSRNQPQYPAR 170



Score = 28.8 bits (64), Expect = 0.037
Identities = 22/130 (16%), Positives = 34/130 (26%), Gaps = 2/130 (1%)

Query: 200 VELAVNDLPAPLGEP--LPAAPADPAPPADLAPPAPADLAPPAPADLAPPAPADLAPPVE 257
V +LPAP APAD PP + PP + P + P P + +E
Sbjct: 36 SVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIE 95

Query: 258 LAVNDLPAPLGEPLPAAPAELAPPADLAPASADLAPPAPADLAPPAPAELAPPAPADLAP 317
+ + ++ APA +A
Sbjct: 96 KPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155

Query: 318 PAAVNEQTAP 327
+ P
Sbjct: 156 GPRALSRNQP 165



Score = 28.4 bits (63), Expect = 0.042
Identities = 26/121 (21%), Positives = 32/121 (26%), Gaps = 7/121 (5%)

Query: 145 EPAPLAPPPADPAPPVELAANDLPAPLGEPLPAAPADPAPPADLAPPAPADVAPPVELAV 204
PA L PP A PP E P P P P A P P
Sbjct: 56 APADLEPPQAVQPPP-EPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK----- 109

Query: 205 NDLPAPLGEPLPAAPADPAPPADLAPPAPADLAPPAPADLAPPAPADLAPPVELAVNDLP 264
+ P + P +P + AP P A + A + P
Sbjct: 110 -KVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYP 168

Query: 265 A 265
A
Sbjct: 169 A 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0872ccloacin371e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 1e-04
Identities = 30/85 (35%), Positives = 40/85 (47%)

Query: 427 AGGDGGAANTDSAGSSRKAFGGDGGVGGDGASALGTGGEGGIGGQGGNGGAGGLLIGNGG 486
+GGDG NT + +S GG G+G G ++ G+G GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 487 AGGVGGTAGAGGTGGSGGAGGAGGA 511
G GG +GG G+GG A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 36.6 bits (84), Expect = 3e-04
Identities = 30/88 (34%), Positives = 34/88 (38%), Gaps = 4/88 (4%)

Query: 486 GAGGVGGTAGAGGTGGSGGAGGAGGAGGGGTNSGPGAAFGGNGNTGGNGGNGGAPGALGG 545
G G G GA T G+ G G GGG + G G + N GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 546 KGGSGGLIGRAGSDGGVGAGGAGGAGGA 573
G G S GG G GG A A
Sbjct: 63 GNGGGN----GNSGGGSGTGGNLSAVAA 86



Score = 36.2 bits (83), Expect = 4e-04
Identities = 32/86 (37%), Positives = 37/86 (43%), Gaps = 2/86 (2%)

Query: 461 GTGGEGGIGGQGGN--GGAGGLLIGNGGAGGVGGTAGAGGTGGSGGAGGAGGAGGGGTNS 518
G G G GN GG GL +G G + G G ++ GG G+G G G G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 519 GPGAAFGGNGNTGGNGGNGGAPGALG 544
G GG TGGN AP A G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.5 bits (76), Expect = 0.003
Identities = 26/67 (38%), Positives = 28/67 (41%)

Query: 368 NGGSGGNGFDSFASGGTGGAGGTGGAGGRGGLLIGDGGAGGAGGVGGTGGSGAPGGGGGA 427
NGG G G AS G+G + GG G I GG G G GG G SG G GG
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 428 GGDGGAA 434
A
Sbjct: 81 LSAVAAP 87



Score = 33.1 bits (75), Expect = 0.003
Identities = 29/96 (30%), Positives = 36/96 (37%), Gaps = 9/96 (9%)

Query: 484 NGGAGGVGGTAGAGGTGGSGGAGGAGGAG--------GGGTNSGPGAAFGGNGNTGGNGG 535
N GA G G TG G G + G+G GGG+ SG GG+G+ G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW-GGGSGHGNGGGN 68

Query: 536 NGGAPGALGGKGGSGGLIGRAGSDGGVGAGGAGGAG 571
G+ G S A + GAGG
Sbjct: 69 GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.006
Identities = 29/94 (30%), Positives = 35/94 (37%), Gaps = 1/94 (1%)

Query: 510 GAGGGGTNSGPGAAFGG-NGNTGGNGGNGGAPGALGGKGGSGGLIGRAGSDGGVGAGGAG 568
G G G N+G + G NG G G GGA G + G +GS G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 569 GAGGAGGTGGEGGTGGDGKTTDGNPGMGGSPGSA 602
G GG G G G G + P G P +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS 96



Score = 31.6 bits (71), Expect = 0.009
Identities = 25/76 (32%), Positives = 26/76 (34%)

Query: 404 GGAGGAGGVGGTGGSGAPGGGGGAGGDGGAANTDSAGSSRKAFGGDGGVGGDGASALGTG 463
GG G G SG GG G GG A+ S SS G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 464 GEGGIGGQGGNGGAGG 479
G GG G G G G
Sbjct: 63 GNGGGNGNSGGGSGTG 78



Score = 31.6 bits (71), Expect = 0.011
Identities = 37/119 (31%), Positives = 47/119 (39%), Gaps = 9/119 (7%)

Query: 146 GSGGVGQAGGAGGSAGLIGIGGTGGAGGAGAVGGVGGNGGWLYGNGGAGGLGGTGVAGVN 205
G G G GA ++G I GG G G GG GW N GG G+G+
Sbjct: 3 GGDGRGHNTGAHSTSGNI----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 206 GGMGAAGGAGGNAYLFGSGGAGGQGGMGAAGADGVNPTPTGTADAGSTGTDQTLGGNAI 264
G GG GN SGG G GG +A A V + G+ G ++ A+
Sbjct: 59 GSGHGNGGGNGN-----SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112



Score = 30.8 bits (69), Expect = 0.019
Identities = 27/91 (29%), Positives = 36/91 (39%), Gaps = 9/91 (9%)

Query: 250 AGSTGTDQTLGGNAIGGNGGPGDAGDAMTSGGAGGSGGNAVSTVNGDAVGGEGGKGGEGA 309
+G G G ++ GN G G + G + GSG ++ + G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 310 YGGAGGAGGSAASIGNAAIGGNGGAGGNAQA 340
+G GG G S GG G GGN A
Sbjct: 62 HGNGGGNGNS---------GGGSGTGGNLSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0876cRTXTOXINA300.034 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.9 bits (67), Expect = 0.034
Identities = 17/48 (35%), Positives = 25/48 (52%), Gaps = 4/48 (8%)

Query: 416 TVLVTVLA-IAAAVAGSLAATAIATL---ITAGSSAIAKASLDASLQH 459
TVL +V + I+AA SL ++ L +T S I +AS A +H
Sbjct: 373 TVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0878ccloacin354e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.5 bits (81), Expect = 4e-04
Identities = 29/103 (28%), Positives = 39/103 (37%), Gaps = 25/103 (24%)

Query: 263 GNGNDGNTNFGSGNAGFLNIGSGNEGSGNLGFGNAGDDNTGWGNSGDTNTGGFNSGDLNT 322
G G++ + SGN N G LG G D +GW + + GG SG
Sbjct: 6 GRGHNTGAHSTSGNI--------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI--- 54

Query: 323 GIGSPVTQGVANSGFGNTGTGHSGFFNSGNSGSGFQNLGNGSS 365
G +G G+ G +GNSG G GN S+
Sbjct: 55 ------------HWGGGSGHGNGG--GNGNSGGGSGTGGNLSA 83



Score = 30.1 bits (67), Expect = 0.022
Identities = 27/75 (36%), Positives = 33/75 (44%), Gaps = 1/75 (1%)

Query: 243 GNGNIGNANLGSGNAGFFNFGNGNDGNTNFGSGNAGFLNIGSGNEGSGNLGFGNAGDDNT 302
G G+ A+ SGN G G G + GSG + N G GSG G +G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 303 GW-GNSGDTNTGGFN 316
G GNSG + G N
Sbjct: 66 GGNGNSGGGSGTGGN 80


12Rv0973cRv0991cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0973c0193.614333Probable acetyl-/propionyl-coenzyme A
Rv0974c2203.965706Probable acetyl-/propionyl-CoA carboxylase (beta
Rv0975c4194.729919Probable acyl-CoA dehydrogenase FadE13
Rv0976c5195.301077Conserved hypothetical protein
Rv09776164.980558PE-PGRS family protein PE_PGRS16
Rv0978c2132.246956PE-PGRS family protein PE_PGRS17
Rv0979c091.350047Hypothetical protein
Rv0979A091.26152850S ribosomal protein L32 RpmF
Rv0980c0100.752781PE-PGRS family protein PE_PGRS18
Rv0981-212-2.279486Mycobacterial persistence regulator MRPA (two
Rv0982124-5.844563Two component sensor kinase MprB
Rv0983333-8.076687Probable serine protease PepD (serine
Rv0984135-8.957863Possible pterin-4-alpha-carbinolamine
Rv0985c235-8.565029Possible large-conductance ion mechanosensitive
Rv0986335-8.479391Probable adhesion component transport
Rv0987230-6.685572Probable adhesion component transport
Rv0988-117-2.481985Possible conserved exported protein
Rv0989c2101.231319Probable polyprenyl-diphosphate synthase GrcC2
Rv0990c2131.906403Hypothetical protein
Rv0991c2131.682765Conserved serine rich protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0973cRTXTOXIND320.009 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.009
Identities = 7/34 (20%), Positives = 15/34 (44%)

Query: 625 TIAAPADGVLTHVSVNTGQQVEVGAILARVEAPQ 658
I + ++ + V G+ V G +L ++ A
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALG 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0977cloacin442e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 44.3 bits (104), Expect = 2e-06
Identities = 35/101 (34%), Positives = 43/101 (42%), Gaps = 1/101 (0%)

Query: 468 AGGDGGQGDIGFDGGRGG-DGGPGGGGGAGGDGSGTFNAQANNGGDGGAGGVGGAGGTGG 526
+GGDG + G G +GGP G G GG G+ + NN GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 527 TGGVGADGGRGGDSGRGGDGGNAGHGGAAQFSGRGAYGGEG 567
G G +G GG SG GG+ A F G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 43.9 bits (103), Expect = 3e-06
Identities = 37/108 (34%), Positives = 46/108 (42%), Gaps = 7/108 (6%)

Query: 357 GGAGAGGDGGAGGIGGTGGNGSIGGAAGNGGNGGRGGA------GGMATAGSDGGNGGGG 410
GG G G + GA G G G G G + G G + GG + +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 411 GNGGVGVGSAGGAGGTGGDGGAAGAGGAPGHGYFQQPAPQGLPIGTGG 458
GNGG G++GG GTGG+ A A A G P GL +
Sbjct: 63 GNGGGN-GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 42.0 bits (98), Expect = 9e-06
Identities = 30/82 (36%), Positives = 36/82 (43%), Gaps = 2/82 (2%)

Query: 304 GIGEQGGQGGDGGA--GGAGGIGGSAGGIGGSQGAGGHGGDGGQGGAGGSGGVGGGGAGA 361
G G G G GG G+G G GS + + GG G+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 362 GGDGGAGGIGGTGGNGSIGGAA 383
GG+G +GG GTGGN S A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAP 87



Score = 41.2 bits (96), Expect = 2e-05
Identities = 33/99 (33%), Positives = 42/99 (42%), Gaps = 1/99 (1%)

Query: 158 GRGGDAGLFGHGGHGGVGGPGIAGAAGTAGLPGGNGANGGSGGIGGAGGAGGNGGLLFGN 217
GRG + G G+ G G+ G + G + N GG G+G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 218 GGAGGQGGSGGLGGSGGTGGAGMAAG-PAGGTGGIGGIG 255
GG G GG G GG+ A +A G PA T G GG+
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 41.2 bits (96), Expect = 2e-05
Identities = 32/79 (40%), Positives = 36/79 (45%), Gaps = 1/79 (1%)

Query: 312 GGDGGAGGAGGIGGSAGGIGGSQGAGGHGGDGGQGGAGGSGGVGGGGAGAG-GDGGAGGI 370
GGDG G S GG G G GG G GGG+G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 371 GGTGGNGSIGGAAGNGGNG 389
G GGNG+ GG +G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 40.5 bits (94), Expect = 3e-05
Identities = 30/104 (28%), Positives = 33/104 (31%)

Query: 330 IGGSQGAGGHGGDGGQGGAGGSGGVGGGGAGAGGDGGAGGIGGTGGNGSIGGAAGNGGNG 389
+ G G G + G G G G G G DG G G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 390 GRGGAGGMATAGSDGGNGGGGGNGGVGVGSAGGAGGTGGDGGAA 433
G G GG +G G GG V A T G GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 40.1 bits (93), Expect = 4e-05
Identities = 27/80 (33%), Positives = 34/80 (42%)

Query: 538 GDSGRGGDGGNAGHGGAAQFSGRGAYGGEGGSGGAGGNAGGAGTGGTAGSGGAGGFGGNG 597
G GRG + G G G G G S G+G ++ GG +GSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 598 ADGGNGGNGGNGGFGGINGT 617
+GG GN G G G N +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 39.3 bits (91), Expect = 7e-05
Identities = 36/104 (34%), Positives = 45/104 (43%), Gaps = 3/104 (2%)

Query: 257 IGGAGGVGGHGSALFGHGGINGDGGTGGMGGQGGAGGNGWAAEGITVGIGEQGGQGGDGG 316
+ G G G + A G ING G TG G G + G+GW++E G G G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 317 AGGAGGIGGSAGGIGGSQGAGGHGGDGGQGGAGGSGGVGGGGAG 360
+G GG G GG G GG+ A G + GAG
Sbjct: 60 SGHGN--GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101



Score = 37.4 bits (86), Expect = 3e-04
Identities = 27/90 (30%), Positives = 35/90 (38%)

Query: 558 SGRGAYGGEGGSGGAGGNAGGAGTGGTAGSGGAGGFGGNGADGGNGGNGGNGGFGGINGT 617
SG G G+ GN G TG G G + G G + + GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 618 FGTNGAGGTGGLGTLLGGHNGNIGLNGATG 647
G G G G G+ GG+ + A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 36.6 bits (84), Expect = 5e-04
Identities = 35/113 (30%), Positives = 45/113 (39%), Gaps = 14/113 (12%)

Query: 492 GGGAGGDGSGTFNAQANNGGDGGAGGVGGAGGTGGTGGVGADGGRGGDSGRG-GDGGNAG 550
GG G +G + N G GVGG G + G+G + GG SG G GG +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 551 HGGAAQFSGRGAYGGEGGSGGAGGNAGGAGTGGTAGSGGAGGFGGNGADGGNG 603
HG GG+G +GG +G G + A GF G G
Sbjct: 62 HGNG------------GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.8 bits (82), Expect = 9e-04
Identities = 28/83 (33%), Positives = 34/83 (40%), Gaps = 1/83 (1%)

Query: 195 NGGSGGIGGAGGAGGNGGLLFGNGGAGGQGGSG-GLGGSGGTGGAGMAAGPAGGTGGIGG 253
NGG G+G GGA G N GG GSG GG G G G GG+G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 254 IGGIGGAGGVGGHGSALFGHGGI 276
+ + G + G GG+
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGL 103



Score = 35.5 bits (81), Expect = 0.001
Identities = 31/80 (38%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 183 AGTAGLPGGNGANGGSGGI-GGAGGAGGNGGLLFGNGGAGGQGGSGGLGGSGGTGGAGMA 241
+G G GA+ SG I GG G G GG G+G + GG GSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 242 AGPAGGTGGIGGIGGIGGAG 261
G GG G GG G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 35.5 bits (81), Expect = 0.001
Identities = 29/81 (35%), Positives = 33/81 (40%), Gaps = 6/81 (7%)

Query: 422 GAGGTGGDGGAAGAGGAPGHGYFQQPAPQGLPIGTGGTGGEGGAGGAGGDGGQGDIGFDG 481
G G G + GA G G P GL +G G + G G + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGG------PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 482 GRGGDGGPGGGGGAGGDGSGT 502
G G G GGG G G GSGT
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGT 77



Score = 33.5 bits (76), Expect = 0.004
Identities = 23/77 (29%), Positives = 27/77 (35%)

Query: 455 GTGGTGGEGGAGGAGGDGGQGDIGFDGGRGGDGGPGGGGGAGGDGSGTFNAQANNGGDGG 514
G G G GA G+ G G G G G G G G+ + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 515 AGGVGGAGGTGGTGGVG 531
G G GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 32.4 bits (73), Expect = 0.009
Identities = 26/81 (32%), Positives = 30/81 (37%), Gaps = 1/81 (1%)

Query: 201 IGGAGGAGGNGGLLFGNGGA-GGQGGSGGLGGSGGTGGAGMAAGPAGGTGGIGGIGGIGG 259
+ G G G N G +G GG G G GG+ G P GG G G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 260 AGGVGGHGSALFGHGGINGDG 280
G GG G G G+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.025
Identities = 27/74 (36%), Positives = 32/74 (43%), Gaps = 11/74 (14%)

Query: 134 NGGAGGILWGNGGNGGSGAPGQPGGRGGDAGLFGHGGHGGVGGPGIAGAAGTAGLPGGNG 193
NGG G+ G G + GSG + GG +G H G G G G GGNG
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG-----------GGNG 69

Query: 194 ANGGSGGIGGAGGA 207
+GG G GG A
Sbjct: 70 NSGGGSGTGGNLSA 83



Score = 30.5 bits (68), Expect = 0.037
Identities = 24/71 (33%), Positives = 28/71 (39%), Gaps = 7/71 (9%)

Query: 379 IGGAAGNGGNGGRGGAGGMATAGSDGGNGGGGGNGGVGVGSA----GGAGGTG---GDGG 431
+ G G G N G G G G GGG + G G S GG G+G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 432 AAGAGGAPGHG 442
G GG G+
Sbjct: 61 GHGNGGGNGNS 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0978ccloacin378e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 8e-05
Identities = 28/90 (31%), Positives = 35/90 (38%), Gaps = 5/90 (5%)

Query: 122 NGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAGLIGNGGAGGTGGAVSLARAGT 181
+G DG G G + +G P G GGA+ G G S +
Sbjct: 2 SGGDGRGHNTGAH-----STSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 182 AGGAGRGPVGGIGGAGGVGGAGGAAGAVTT 211
GG+G G GG G +GG G GG AV
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.0 bits (72), Expect = 0.003
Identities = 24/78 (30%), Positives = 29/78 (37%), Gaps = 6/78 (7%)

Query: 117 IGDGANGIDGTGQA------GGNGGWLWGNGGNGGSGAPGQAGGAGGAAGLIGNGGAGGT 170
+G G DG+G + GG G GG G G G G +GG +G GN A
Sbjct: 27 LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86

Query: 171 GGAVSLARAGTAGGAGRG 188
A T G G
Sbjct: 87 PVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.005
Identities = 32/103 (31%), Positives = 36/103 (34%), Gaps = 4/103 (3%)

Query: 117 IGDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAGLIGNGGAGG----TGG 172
I G G+ G A GW N GG G G G G G G G TGG
Sbjct: 20 INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79

Query: 173 AVSLARAGTAGGAGRGPVGGIGGAGGVGGAGGAAGAVTTITHA 215
+S A A G G GG AG + A+ I A
Sbjct: 80 NLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0980ccloacin424e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 42.0 bits (98), Expect = 4e-06
Identities = 40/121 (33%), Positives = 49/121 (40%), Gaps = 1/121 (0%)

Query: 170 AGGQGLPFEAGANGGAGGAGGWLFGNGGAGGVGGAGGAGTTFGVAGGDGGTGGVGGHGGL 229
+GG G GA+ +G G G G GG G + GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 230 IGVGGHGGDGGTG-GTGGAVSLARAGTAGGAGGGPAGGIGGAGGVGGAGGAAGAVTTITH 288
G GG G+ G G GTGG +S A A G G GG AG + A+ I
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 289 A 289
A
Sbjct: 122 A 122



Score = 39.3 bits (91), Expect = 3e-05
Identities = 26/78 (33%), Positives = 33/78 (42%), Gaps = 1/78 (1%)

Query: 139 GNGGNGGSGAPGQAGGAGGAAGLIGNGGAGGAGGQGLPFEAGANGGAGGAGGWLFGNGGA 198
G+G +GA +G G +G GG G + G G E GG G+G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 199 GGVGGAGGAGTTFGVAGG 216
G GG G +G G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 33.9 bits (77), Expect = 0.001
Identities = 30/117 (25%), Positives = 35/117 (29%), Gaps = 11/117 (9%)

Query: 118 GDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAGLIGNGGAGGAGGQGLPF 177
GDG G GN NGG G GGA +G G G
Sbjct: 4 GDGRGHNTGAHSTSGNI--------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 178 EAGANGGAGGAGGWLFGNGGAGGVGGAGGAGTTFGVAGGDGGTGGVGGHGGLIGVGG 234
G +G G G GN G G G + VA G G G + +
Sbjct: 56 WGGGSGHGNGGGN---GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 33.1 bits (75), Expect = 0.003
Identities = 29/86 (33%), Positives = 35/86 (40%), Gaps = 13/86 (15%)

Query: 213 VAGGDGGTGGVGGH-------GGLIGVGGHGGDGGTGGTG------GAVSLARAGTAGGA 259
++GGDG G H GG G+G GG G G S + GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 260 GGGPAGGIGGAGGVGGAGGAAGAVTT 285
G G GG G +GG G GG AV
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 29.3 bits (65), Expect = 0.033
Identities = 26/78 (33%), Positives = 33/78 (42%), Gaps = 5/78 (6%)

Query: 117 IGDGANGIDGTGQAGGNGGWLWGNGGN---GGSGAPGQAGGAGGAAGLIGNGGAGGAGGQ 173
+G G DG+G + N W G+G GG G GG G + G G GG A
Sbjct: 27 LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86

Query: 174 GLPF--EAGANGGAGGAG 189
+ F A + GAGG
Sbjct: 87 PVAFGFPALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0981HTHFIS1039e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 103 bits (258), Expect = 9e-28
Identities = 45/135 (33%), Positives = 71/135 (52%), Gaps = 1/135 (0%)

Query: 2 RILVVDDDRAVRESLRRSLSFNGYSVELAHDGVEALDMIASDRPDALVLDVMMPRLDGLE 61
ILV DDD A+R L ++LS GY V + + IA+ D +V DV+MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCRQLRGTGDDLPILVLTARDSVSERVAGLDAGADDYLPKPFALEELLARM-RALLRRTK 120
+ +++ DLP+LV++A+++ + + GA DYLPKPF L EL+ + RAL +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 PEDAAESMAMRFSDL 135
E + L
Sbjct: 125 RPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0983V8PROTEASE574e-11 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 57.3 bits (138), Expect = 4e-11
Identities = 36/182 (19%), Positives = 72/182 (39%), Gaps = 27/182 (14%)

Query: 165 PSVVMLETDLGRQSEEGSGIILSAEGLILTNNHVIAAAAKPPLG---SPPPKTTVTFSDG 221
V ++ + + SG+++ + +LTN HV+ A P P + +G
Sbjct: 88 APVTYIQVEAPTGTFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNG 146

Query: 222 RTAPFTVVGADPTSDIAVVRV----QGVS---GLTPISLGSSSDLRVGQPVLAIGSPLGL 274
+ D+A+V+ Q + P ++ ++++ +V Q + G P
Sbjct: 147 GFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDK 206

Query: 275 EGTVTTGIVSALNRPVSTTGEAGNQNTVLD--AIQTDAAINPGNSGGALVNMNAQLVGVN 332
PV+T E+ + T L A+Q D + GNSG + N +++G++
Sbjct: 207 --------------PVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIH 252

Query: 333 SA 334

Sbjct: 253 WG 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0985cMECHCHANNEL1621e-54 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 162 bits (412), Expect = 1e-54
Identities = 47/134 (35%), Positives = 66/134 (49%), Gaps = 6/134 (4%)

Query: 1 MLKGFKEFLARGNIVDLAVAVVIGTAFTALVTKFTDSIITPLINR----IGVNAQSDVGI 56
++K F+EF RGN+VDLAV V+IG AF +V+ II P + I +
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTLR 62

Query: 57 LRIGIGGGQTIDLNVLLSAAINFFLIAFAVYFLVVLPYNTLRKKGE-VEQPGDT-QVVLL 114
G + V + +F ++AFA++ + L RKK E P T + VLL
Sbjct: 63 DAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEEPAAAPAPTKEEVLL 122

Query: 115 TEIRDLLAQTNGDS 128
TEIRDLL + N S
Sbjct: 123 TEIRDLLKEQNNRS 136


13Rv1034cRv1067cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1034c2141.557172Probable transposase (fragment)
Rv1035c3141.821571Probable transposase (fragment)
Rv1036c3160.637562Probable IS1560 transposase (fragment)
Rv1037c2160.847283Putative ESAT-6 like protein EsxI (ESAT-6 like
Rv1038c0140.530340ESAT-6 like protein EsxJ (ESAT-6 like protein
Rv1039c0170.505617PPE family protein PPE15
Rv1040c021-0.914415PE family protein PE8
Rv1041c122-1.817680Probable is like-2 transposase
Rv1042c218-1.331050Probable is like-2 transposase
Rv1043c221-1.868367Conserved hypothetical protein
Rv1044220-1.440308Conserved hypothetical protein
Rv1045117-0.947886Hypothetical protein
Rv1046c116-0.350926Hypothetical protein
Rv1047116-0.567101Probable transposase
Rv1048c019-1.466406Hypothetical protein
Rv1049119-0.492613Probable transcriptional repressor protein
Rv1050220-1.354307Probable oxidoreductase
Rv1051c318-2.377993Conserved hypothetical protein
Rv1052516-1.646409Hypothetical protein
Rv1053c312-1.692731Hypothetical protein
Rv1054111-1.232833Probable integrase (fragment)
Rv1055011-1.191262Possible integrase (fragment)
Rv1056010-1.269929*Conserved protein
Rv1057010-0.744881Conserved hypothetical protein
Rv1058010-0.435297Probable medium chain fatty-acid-CoA ligase
Rv10592110.706550Conserved protein
Rv10600111.238346Unknown protein
Rv10610112.011165Conserved protein
Rv10624186.681685Conserved hypothetical protein
Rv1063c8237.968431Conserved hypothetical protein
Rv1064c6206.378571Possible lipoprotein LpqV
Rv10655185.670204Conserved hypothetical protein
Rv10665195.139772Conserved hypothetical protein
Rv1067c5174.150237PE-PGRS family protein PE_PGRS19
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1043cV8PROTEASE473e-08 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 47.3 bits (112), Expect = 3e-08
Identities = 39/164 (23%), Positives = 65/164 (39%), Gaps = 29/164 (17%)

Query: 145 GTGLVVDHNHVITNKHVVTGLAGTSAGLSVYPSSNHAEA---------ELVNFSGTAHPH 195
+G+VV + ++TNKHVV G L +PS+ + + ++ +SG
Sbjct: 104 ASGVVVGKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEG--- 160

Query: 196 PTLDVAVIKFEMPE-----GKYIPRLGGMAFRDPDWADEVYVFGYPRVPMTAEMAITVQR 250
D+A++KF E G+ + + + V GYP A M +
Sbjct: 161 ---DLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMW--ESK 215

Query: 251 GEVVNPAATTIPGRQKIFLYSAIARPGNSGGPIVAQDGRVIGLV 294
G++ T + G Y GNSG P+ + VIG+
Sbjct: 216 GKI-----TYLKGE--AMQYDLSTTGGNSGSPVFNEKNEVIGIH 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1050DHBDHDRGNASE844e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 83.9 bits (207), Expect = 4e-21
Identities = 53/187 (28%), Positives = 84/187 (44%), Gaps = 1/187 (0%)

Query: 9 QVVLITGASSGIGEATAKAFAREGAVVALAARREGALRRVAREIEAAGGRAMVAPLDVSS 68
++ ITGA+ GIGEA A+ A +GA +A L +V ++A A P DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 69 SESVRAMVADVVGEFGRIDVVFNNAGVSLVGPVDAETFLDDTREMLEIDYLGTVRVVREV 128
S ++ + A + E G ID++ N AGV G + ++ ++ G R V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIH-SLSDEEWEATFSVNSTGVFNASRSV 127

Query: 129 LPIMKQQRSGRIMNMSSVVGRKAFARFAGYSSAMHAIAGFSDALRQELRGSGIAVSVIHP 188
M +RSG I+ + S A Y+S+ A F+ L EL I +++ P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 189 ALTQTPL 195
T+T +
Sbjct: 188 GSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1067ccloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 2e-04
Identities = 36/102 (35%), Positives = 45/102 (44%), Gaps = 1/102 (0%)

Query: 303 GAGGRGGDGGSAGWLSG-NGGDAGNGGGGGTAGGAGNGGQFGGDGGTGGTGGTAGAGGNG 361
G GRG + G+ NGG G G GGG + G+G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 362 GRGAVLFGHGGNAGHGGAGGNGAAAGAGGEHVVATAGKGGTG 403
G G GG +G GG AA A G ++T G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.4 bits (86), Expect = 2e-04
Identities = 33/82 (40%), Positives = 41/82 (50%), Gaps = 4/82 (4%)

Query: 512 TGNAGNGGNGG--SAARLFGGGGAGGAGGTGSTAGSGGSGGTNPPTGLQAAGGNGGSGHA 569
+G G G N G S + GG G G G++ GSG S NP G +G + G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 570 GGHGGNGGGAGLLGGGGTGGNG 591
G+GG G +G GG GTGGN
Sbjct: 62 HGNGGGNGNSG--GGSGTGGNL 81



Score = 37.0 bits (85), Expect = 3e-04
Identities = 30/89 (33%), Positives = 37/89 (41%), Gaps = 1/89 (1%)

Query: 475 GAGGSGGSAGAGGAGGKGGDTPNGLAINPGIG-GNGGDTGNAGNGGNGGSAARLFGGGGA 533
G G G + GA G P GL + G G+G + N GG GS GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 534 GGAGGTGSTAGSGGSGGTNPPTGLQAAGG 562
G GG G++ G G+GG A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 36.6 bits (84), Expect = 3e-04
Identities = 37/101 (36%), Positives = 44/101 (43%), Gaps = 8/101 (7%)

Query: 119 GADGTAANPNGGAGGLLYGNGGNGFSQTTAGLTGGTGGSAGLIGNGGNGGAGGAGANGGA 178
GA T+ N NGG GL G G + S ++ GGS I GG G G G NG +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 179 GGNGGWLYGSGGNGGAGGAGPAGAIGAPGVAGGAGGAGGTA 219
GG GSG G A G P ++ GAGG A
Sbjct: 72 GG------GSGTGGNLSAVAAPVAFGFPALS--TPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.001
Identities = 34/100 (34%), Positives = 40/100 (40%), Gaps = 3/100 (3%)

Query: 299 GGAGGAGGRGGDGGSAGWLSGNGGDAGNGGGGGTAGGAGNGGQFGGDGGTGGTGGTAGAG 358
GG G G GG +GW S N GGG G+ G G G GG G +GG +G G
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNP---WGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG 78

Query: 359 GNGGRGAVLFGHGGNAGHGGAGGNGAAAGAGGEHVVATAG 398
GN A G A G A + + G A A
Sbjct: 79 GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 34.3 bits (78), Expect = 0.002
Identities = 34/117 (29%), Positives = 39/117 (33%), Gaps = 8/117 (6%)

Query: 344 GDGGTGGTGGTAGAGGNGGRGAVLFGHGGNAGHGGAGGNGAAAGAGGEHVVATAGKGGTG 403
G G G G GN +GG G G GG +G E+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI--------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 404 GVGGDGGGGGAGGGGGLLYGNGGAGGAGNSGGDGGTGLNAALGGNGGGGGVGGNAGA 460
GG G G GG G G+G G G A GG V +AGA
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.3 bits (78), Expect = 0.002
Identities = 31/77 (40%), Positives = 36/77 (46%)

Query: 529 GGGGAGGAGGTGSTAGSGGSGGTNPPTGLQAAGGNGGSGHAGGHGGNGGGAGLLGGGGTG 588
GG G G G ST+G+ G T G A+ G+G S GG G GGG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 589 GNGGGGGQGGLGAAAGG 605
GNGGG G G G+ GG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 33.9 bits (77), Expect = 0.002
Identities = 37/111 (33%), Positives = 45/111 (40%), Gaps = 11/111 (9%)

Query: 374 AGHGGAGGNGAAAGAGGEHVVATAGKGGTGGVGGDGGGGGA-----GGGGGLLYGNGGAG 428
+G G G N A G ++ G GG DG G + GGG G GG
Sbjct: 2 SGGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 429 GAGNSGGDGGTGLNAALGGNGGGGGVGGNAGAGGTGGSAGWLSGNGGAGGS 479
G GN GG+G +G GG+G GG + A G A G GG S
Sbjct: 61 GHGNGGGNGNSG-----GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 33.1 bits (75), Expect = 0.003
Identities = 26/91 (28%), Positives = 36/91 (39%)

Query: 555 TGLQAAGGNGGSGHAGGHGGNGGGAGLLGGGGTGGNGGGGGQGGLGAAAGGVDGNGGNGG 614
+G G N G+ G+ G +GGG + G+G G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 615 NGGKGGDAQLVGDGGNGGNGGKGGAGLIAGL 645
+G GG+ G G GGN A + G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 31.6 bits (71), Expect = 0.010
Identities = 26/74 (35%), Positives = 30/74 (40%), Gaps = 1/74 (1%)

Query: 153 GTGGSAGLIGNGGNGGAGGAGANGGAGGNGGWLYGSGGNGGAGGAGPA-GAIGAPGVAGG 211
G A NGG G G GGA GW + GG G+G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 212 AGGAGGTAGLFGNG 225
G +GG +G GN
Sbjct: 68 NGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.012
Identities = 35/120 (29%), Positives = 42/120 (35%), Gaps = 16/120 (13%)

Query: 426 GAGGAGNSGGDGGTGLNAALGGNGGGGGVGGNAGAGGTGGSAGWLSGNGGAGGSGGSAGA 485
G G G++ G T N NGG G+G GA G + + GG GSG G
Sbjct: 3 GGDGRGHNTGAHSTSGNI----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 486 GGAGGKGGDTPNGLAINPGIGGNGGDTGNAGNGGNGGSAARLFGGGGAGGAGGTGSTAGS 545
G G GG G G+ G AA + G A G G A S
Sbjct: 59 GSGHGNGG------------GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 31.2 bits (70), Expect = 0.014
Identities = 27/78 (34%), Positives = 32/78 (41%), Gaps = 1/78 (1%)

Query: 576 GGGAGLLGGGGTGGNGGGGGQGGLGAAAGGVDGNG-GNGGNGGKGGDAQLVGDGGNGGNG 634
G G G G + GG GLG G DG+G + N GG + GG G+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 635 GKGGAGLIAGLDGAGGAG 652
GG G G G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.020
Identities = 32/96 (33%), Positives = 39/96 (40%), Gaps = 10/96 (10%)

Query: 406 GGDGGGGGAGGGGGLLYGNGGAGGAG-NSGGDGGTGLNAALGGNGGGGGVGGNAGAGGTG 464
GGDG G G NGG G G G G+G ++ GGG G G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS-- 60

Query: 465 GSAGWLSGNGGAGGSGGSAGAGGAGGKGGDTPNGLA 500
G+G GG+G S G G GG +A
Sbjct: 61 -------GHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 30.8 bits (69), Expect = 0.022
Identities = 29/91 (31%), Positives = 35/91 (38%), Gaps = 3/91 (3%)

Query: 168 GAGGAGANGGAGGNGGWLYG-SGGNGGAGGAGPAGAIGAPGVAGGAGGAGGTAGLFGNGG 226
G G G N GA G + G G G GGA + G GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSG 61

Query: 227 VGGVGGDGGQGGNGAGAGASGTKGGDAGAGG 257
G GG G G G+G G + + A G
Sbjct: 62 HGN-GGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.8 bits (69), Expect = 0.022
Identities = 31/97 (31%), Positives = 36/97 (37%), Gaps = 5/97 (5%)

Query: 210 GGAGGAGGTAGLFGNGGVGGVGGDGGQGGNGAGAGAS---GTKGGDAGAGGAGGAGGWIH 266
G GA T+G G G G G G+G + + G G GG G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 267 GHGGAGGDGGAGGAGGQASPGAPGPP--SQPGGAGGA 301
GG G G A+P A G P S PG G A
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.024
Identities = 31/113 (27%), Positives = 41/113 (36%), Gaps = 5/113 (4%)

Query: 228 GGVGGDGGQGGNGAGAGASGTKGGDAGAGGAGGAGGWIHGHGGAGGDGGAGGAGGQASPG 287
GG G G + +G G GGA GW + GG G+G G S
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 288 APGPPSQPGGAGGAGGAGGRGGDGGSAGWLSGNGGDAGNGGGGGTAGGAGNGG 340
G GG G +GG G GG+ + G A + G G + + G
Sbjct: 63 GNG-----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.1 bits (67), Expect = 0.029
Identities = 23/84 (27%), Positives = 28/84 (33%)

Query: 253 AGAGGAGGAGGWIHGHGGAGGDGGAGGAGGQASPGAPGPPSQPGGAGGAGGAGGRGGDGG 312
+G G G G G G G GG AS G+ GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 313 SAGWLSGNGGDAGNGGGGGTAGGA 336
G+G GG + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85


14Rv1082Rv1091Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv10823155.513002Mycothiol conjugate amidase Mca (mycothiol
Rv10834135.884572Conserved hypothetical protein
Rv10844135.905886Conserved protein
Rv1085c7175.383973Possible hemolysin-like protein
Rv10868196.050968Short (C15) chain Z-isoprenyl diphosphate
Rv10878196.216310PE-PGRS family protein PE_PGRS21
Rv1087A13217.110871Conserved hypothetical protein
Rv108811185.728370PE family protein PE9
Rv10897185.098299PE family protein PE10
Rv1089A3174.448019Probable cellulase CelA2a
Rv10901153.877557Probable cellulase CelA2b
Rv10911143.697104PE-PGRS family protein PE_PGRS22
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1087cloacin373e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 3e-04
Identities = 31/102 (30%), Positives = 39/102 (38%), Gaps = 1/102 (0%)

Query: 446 GDSGEGGFGGPGLAGGLFGNPGNGGVGGIGGDAAAGGAGGAGGNGGAGGNGGWLFGNGGA 505
GD G +G + G P GVGG D + + GG+G W G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW-GGGSGH 62

Query: 506 GGSGGDGGAAGRGGAGNLGSAGGINAPAGNPGSGSVGIGGAG 547
G GG+G + G G G SA G P + G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.6 bits (84), Expect = 4e-04
Identities = 30/80 (37%), Positives = 34/80 (42%), Gaps = 4/80 (5%)

Query: 587 GAMGGAGGVGGNARLLGTGGAGGVGGGGGAGGDGGRGGVATPGGQGGDAGDGGAGGAGGN 646
G GA GN GG G+G GGGA G P G G +G GG+G
Sbjct: 8 GHNTGAHSTSGNI----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 647 GGGASGAGGWLLGTGGAGGA 666
GG +G G GTGG A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSA 83



Score = 36.2 bits (83), Expect = 5e-04
Identities = 32/83 (38%), Positives = 36/83 (43%), Gaps = 4/83 (4%)

Query: 352 LSGGDGGA--GGAGGAGGA--GGTGGWLYGGGGAAGSGGDGGTGGQGGAGGAGVFSLFGS 407
+SGGDG GA G GG G GGG + GSG GG G+G+ GS
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 408 GGGPGGNGGVGGVGGVGGAGGRA 430
G G GG G G G G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSA 83



Score = 35.8 bits (82), Expect = 6e-04
Identities = 38/116 (32%), Positives = 44/116 (37%), Gaps = 3/116 (2%)

Query: 538 SGSVGIGGAGGAGGTAGLFGDGGAGGAGGAGAAGGFGGISAATPSAGSEGAMGGAGGVGG 597
SG G G GA T+G G G G GA+ G G S P G G+ GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 598 NARLLGTGGAGGVGGGGGAGGDGGRGGVATPGGQGGDAGDGGAGGAGGNGGGASGA 653
+ GG G GGG G GG+ G + G G A GA A
Sbjct: 62 HG---NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 35.8 bits (82), Expect = 6e-04
Identities = 35/103 (33%), Positives = 44/103 (42%), Gaps = 4/103 (3%)

Query: 482 GAGGAGGNGGAGGNGGWLFGNGGAGGSGGDGGAAGRGGAGNLGSAGGINAPAGNPGSGSV 541
G G G N GA G + NGG G G GGA+ G + + G + +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 542 GIGGAGGAGGTAGLFGDGGAGGAGGAGAAGGFGGISAATPSAG 584
G G GG G + G G G G A FG + +TP AG
Sbjct: 61 GHGNGGGNGNSGG--GSGTGGNLSAVAAPVAFGFPALSTPGAG 101



Score = 35.1 bits (80), Expect = 0.001
Identities = 29/88 (32%), Positives = 32/88 (36%)

Query: 258 GVGGAGGVGGAGGAGGWLYGDAGAGGDGGVGGAGGTGGLGNRGGAGGAGGAGGVGGAGGA 317
G G G GA G + G G GG G N GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 318 AGLWGGGGAGGVGGTGGGAGLGAQSVTF 345
G G +GG GTGG A V F
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAF 90



Score = 34.7 bits (79), Expect = 0.001
Identities = 34/108 (31%), Positives = 43/108 (39%), Gaps = 5/108 (4%)

Query: 616 AGGDGGRGGVATPGGQGGDAGDGGAGGAGGNGGGASGAGGWLLGTGGAGGAGGNGGNGGK 675
+GGDG G T +GG G G GG + G+G GG+G GG
Sbjct: 2 SGGDGR--GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 676 AGFSPGPTNFGLNGAGGGGGVGGNGATGPWLFGDGGPTPGSTGAGAAG 723
+G G N +GGG G GGN + G P + GAG
Sbjct: 60 SGHGNGGGN---GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.002
Identities = 38/112 (33%), Positives = 42/112 (37%), Gaps = 12/112 (10%)

Query: 304 GAGGAGGVGGAGGAAGLWGGGGAGGVGGTGGGAGLGAQSVTFSSSLSGLSGGDGGAGGAG 363
G G G GA +G GG G G G G SG S + GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG------------SGWSSENNPWGGGS 50

Query: 364 GAGGAGGTGGWLYGGGGAAGSGGDGGTGGQGGAGGAGVFSLFGSGGGPGGNG 415
G+G G G GGG SGG GTGG A A V F + PG G
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.002
Identities = 29/95 (30%), Positives = 40/95 (42%), Gaps = 3/95 (3%)

Query: 669 NGGNGGKAGFSPGPTNFGLNGAGGGGGVGGNGATGPWLFGDGGPTPGSTGAGAAGGHGGD 728
+GG+G T+ +NG G GVGG + G + P G +G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 729 AQLIGNGGHGGAGGTGVPNGSGGAGGLSGLLFGEP 763
GNGG G G G G + + + FG P
Sbjct: 62 H---GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93



Score = 33.9 bits (77), Expect = 0.002
Identities = 29/83 (34%), Positives = 34/83 (40%)

Query: 236 AGGAGVAGAGGFEGTIGAGGAGGVGGAGGVGGAGGAGGWLYGDAGAGGDGGVGGAGGTGG 295
+GG G G T G G G G G + G+G + GG G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 296 LGNRGGAGGAGGAGGVGGAGGAA 318
GN GG G +GG G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.9 bits (77), Expect = 0.003
Identities = 40/112 (35%), Positives = 45/112 (40%), Gaps = 4/112 (3%)

Query: 157 GGNGGAAGLIGNGGFGGGGGPGAAGGNGGAGGWLFGNGGAGGAGGLGVAPGVPGGAGGAG 216
G N GA GN G G G + G+G W N GG G G+ G G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 217 GAGGVGGPAGLWGHGGAGGAGGAGVAGAGGFEGTIGAGGAGGVGGAGGVGGA 268
G G GG + G GG A A VA T GAGG AG + A
Sbjct: 67 GNGNSGGGS---GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.1 bits (75), Expect = 0.005
Identities = 38/105 (36%), Positives = 42/105 (40%), Gaps = 11/105 (10%)

Query: 123 GDGANGGPGQ-----DGGPGGLLYGNGGNGGTSTTVGMAGGNGGAAGLIGNGGFGGGGGP 177
G G N G +GGP GL G GG S G + N G G+G GGG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGL----GVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 178 GAAGGNGGAGGWLFGNGGAGGAGGLGVAPGVPGGAGGAGGAGGVG 222
GG G G G GG A VA G P A GAGG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP--ALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1091cloacin398e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 8e-05
Identities = 35/99 (35%), Positives = 43/99 (43%), Gaps = 5/99 (5%)

Query: 665 GAGGNGGPGGSGGAADIGGNGGAGNGGGTDGNG---GNGGSGGGAGSGGDGGGAGGNGAW 721
G G N G + G + G G GG +DG+G N GGG+GSG GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 722 LFGNGGAGGGGGKGGNGAGGGLGGGSFGLPGLNGSGGDG 760
G G GGG G +FG P L+ G G
Sbjct: 66 --GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.4 bits (86), Expect = 2e-04
Identities = 29/82 (35%), Positives = 36/82 (43%), Gaps = 3/82 (3%)

Query: 309 NGGSGGTGGAGGSTAGAGGNGGAGGGGGTGGLLFGNGGAGGHGAAAGNGLAAGNGVSSSG 368
+GG G G + NGG G G GG + G+G G +G+G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 369 GGGAGGTGGAGGDGGAGGAGGN 390
G G G GG G GG G GGN
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGN 80



Score = 36.6 bits (84), Expect = 4e-04
Identities = 29/76 (38%), Positives = 35/76 (46%)

Query: 480 AGAGGRGGAGGSGGSGGDGGGGAAGPAGWLFGDGGAGGNGGAAAAGGAGGQAGGGGGNGG 539
GA G G +G GGGA+ +GW + GG G+ G G G GGGNG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 540 NGGNGGNGGNGGNGAT 555
+GG G GGN A
Sbjct: 71 SGGGSGTGGNLSAVAA 86



Score = 36.2 bits (83), Expect = 6e-04
Identities = 38/113 (33%), Positives = 47/113 (41%), Gaps = 6/113 (5%)

Query: 631 GAGGTGFISSDGGAGGDGGDGGNGGAGGTGGLLFGAGGNGGPGGSGGAADIGGNGGAGNG 690
G G S G G G G G + G + + N GGSG GG G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 691 GGTDGNGGNGGSGGGAGSGGDGGGAGGNGAWLFGNGGAGGGGGKGGNGAGGGL 743
GGNG SGGG+G+GG+ A+ F G GG + + G L
Sbjct: 66 ------GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112



Score = 36.2 bits (83), Expect = 6e-04
Identities = 28/85 (32%), Positives = 30/85 (35%)

Query: 708 SGGDGGGAGGNGAWLFGNGGAGGGGGKGGNGAGGGLGGGSFGLPGLNGSGGDGGDGGNGA 767
SGGDG G GN G G G GA G G S P GSG GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 768 PGGVLYGNGGAGGQGSSGGIGGPGA 792
G GG G+ G + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 35.8 bits (82), Expect = 8e-04
Identities = 37/102 (36%), Positives = 39/102 (38%), Gaps = 1/102 (0%)

Query: 302 GAGGAGGNGGSGGTGG-AGGSTAGAGGNGGAGGGGGTGGLLFGNGGAGGHGAAAGNGLAA 360
G G G N G+ T G G G G GGA G G GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 361 GNGVSSSGGGGAGGTGGAGGDGGAGGAGGNARLWGVGGAGGA 402
GNG + GG GTGG A A G L G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.5 bits (81), Expect = 0.001
Identities = 39/130 (30%), Positives = 45/130 (34%), Gaps = 9/130 (6%)

Query: 353 AAGNGLAAGNGVSSSGGGGAGGTGGAGGDGGAGGAGGNARLWGVGGAGGAGGDGGAGGAG 412
+ G+G G S+ G GG G G GGA G W GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG----WSSENNPWGGGSGSGIHWG 57

Query: 413 GKGGSGLSGNANGGAGGDSGRGGTGGAGGEGGAAGLLVGTGGHGGDGGAGGAAVKGGDGG 472
G G G NGG G+SG G G AA + G G G A
Sbjct: 58 GGSGHG-----NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112

Query: 473 AAAGTGIAGA 482
+AA I A
Sbjct: 113 SAAIADIMAA 122



Score = 34.7 bits (79), Expect = 0.002
Identities = 32/94 (34%), Positives = 41/94 (43%), Gaps = 10/94 (10%)

Query: 533 GGGGNGGNGGNGGNGGNGGNGATGGWLYGNGGAGGQGATAGAGGAGANGVSSTNGGGTGG 592
GG G G N G GN G TG G GG + G+G + ++ GGG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGL-----GVGGG-----ASDGSGWSSENNPWGGGSGS 52

Query: 593 NGGIGGTGGSGGAGGNAGLLGVGGAGGHGASGGA 626
GG G G GGN G G GG+ ++ A
Sbjct: 53 GIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 34.3 bits (78), Expect = 0.002
Identities = 33/114 (28%), Positives = 45/114 (39%), Gaps = 11/114 (9%)

Query: 479 IAGAGGRGGAGGSGGSGGDGGGGAAGPAGWLFGDGGAGGNGGAAAAGGAGGQAG-GGGGN 537
++G GRG G+ + G+ GG G G GGA+ G + GGG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGP----------TGLGVGGGASDGSGWSSENNPWGGGS 50

Query: 538 GGNGGNGGNGGNGGNGATGGWLYGNGGAGGQGATAGAGGAGANGVSSTNGGGTG 591
G GG G+G G G G+G G A A G +S+ GG
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.003
Identities = 39/118 (33%), Positives = 45/118 (38%), Gaps = 18/118 (15%)

Query: 511 GDGGAGGNGGAAAAGG--AGGQAGGGGGNGGNGGNGGNGGNGGNGATGGWLYGNGGAGGQ 568
G G G N GA + G GG G G G G + G+G + N G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 569 GATAGAGGAGANGVSSTNGGGTGGNGGIGGTGGSGGAGGNAGLLGVGGAGGHGASGGA 626
G NGGG G +GG GTGG+ A G GA G A
Sbjct: 63 G----------------NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.003
Identities = 23/78 (29%), Positives = 33/78 (42%)

Query: 343 GNGGAGGHGAAAGNGLAAGNGVSSSGGGGAGGTGGAGGDGGAGGAGGNARLWGVGGAGGA 402
G G G + +GN G+ GG G + + GG+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 403 GGDGGAGGAGGKGGSGLS 420
GG+G +GG G GG+ +
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.003
Identities = 25/80 (31%), Positives = 33/80 (41%)

Query: 246 GGNGGAGGAAGLFGDAGAGGNGGKGGAGGAAFSINFTAGDGGAGGAGGSGGHALLWGAGG 305
G N GA +G G G G + G+ +S GG+G GG + GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 306 AGGNGGSGGTGGAGGSTAGA 325
G +GG GTGG + A
Sbjct: 68 NGNSGGGSGTGGNLSAVAAP 87



Score = 33.1 bits (75), Expect = 0.005
Identities = 38/135 (28%), Positives = 49/135 (36%), Gaps = 11/135 (8%)

Query: 261 AGAGGNGGKGGAGGAAFSINFTAGDGGAGGAGGSGGHALLWGAGGAGGNGGSGGTGGAGG 320
+G G G GA + +IN G GG G W + GGSG GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG---WSSENNPWGGGSGSGIHWGG 58

Query: 321 STAGAGGNGGAGGGGGTGGLLFGNGGAGGHGAAAGNGLAAGNGVSSSGGGGAGGTGGAGG 380
+ G G GGG+G GG+ +A +A G S+ G G + G
Sbjct: 59 GSGHGNGGGNGNSGGGSGT--------GGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110

Query: 381 DGGAGGAGGNARLWG 395
A A A L G
Sbjct: 111 ALSAAIADIMAALKG 125



Score = 33.1 bits (75), Expect = 0.005
Identities = 35/112 (31%), Positives = 41/112 (36%), Gaps = 11/112 (9%)

Query: 725 NGGAGGGGGKGGNGAGGGLGGGSFGLPGLNGSGGDGGDGGNGAPGGVLYGNGGAGGQGSS 784
+GG G G G + G + GG +G G G + G N GG GS
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGP--------TGLGVGGGASDGSGWSSENNPWGGGSGSG 53

Query: 785 GGIGGPGATGGAGGKGGDGGDAQLIGDGGNGGNGGAGGTGGTPGPGGPGGSG 836
GG G GG G GG + G GGN A G P PG G
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGS---GTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.1 bits (75), Expect = 0.005
Identities = 25/62 (40%), Positives = 30/62 (48%), Gaps = 3/62 (4%)

Query: 127 NGGPGQDGGPGGLLYGNGGNGGTSTTAGVAGGNGGAAGLIGNGGAGGGGGAGAAGGNGGA 186
NGGP G GG + G+G +S GG+G G G G GGG G +GG G
Sbjct: 21 NGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77

Query: 187 GG 188
GG
Sbjct: 78 GG 79



Score = 31.2 bits (70), Expect = 0.020
Identities = 30/87 (34%), Positives = 37/87 (42%), Gaps = 6/87 (6%)

Query: 601 GSGGAGGNAGLLGVGGAGGHGASGGAGDRGGAGGTGFISSDGGAGGDGGDGGNGGAGGTG 660
G G G N G G G +G G + G+G+ S + GG G G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS-- 60

Query: 661 GLLFGAGGNGGPGGSGGAADIGGNGGA 687
G G GG G SGG + GGN A
Sbjct: 61 ----GHGNGGGNGNSGGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.024
Identities = 37/121 (30%), Positives = 43/121 (35%), Gaps = 13/121 (10%)

Query: 564 GAGGQGATAGAGGAGANGVSSTNGGGTGGNGGIGGTGGSGGAGGNAGLLGVGGAGGHGAS 623
G G+G GA N NGG TG G G + GSG + N G G+G H
Sbjct: 3 GGDGRGHNTGAHSTSGN----INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH--W 56

Query: 624 GGAGDRGGAGGTGFISSDGGAGGDGGDGGNGGAGGTGGLLFGAGGNGGPGGSGGAADIGG 683
GG G GG G G G G + + FG PG G A I
Sbjct: 57 GGGSGHGNGGG-------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109

Query: 684 N 684

Sbjct: 110 G 110



Score = 30.5 bits (68), Expect = 0.031
Identities = 34/89 (38%), Positives = 37/89 (41%), Gaps = 6/89 (6%)

Query: 429 GDSGRGGTGGAGG-----EGGAAGLLVGTGGHGGDG-GAGGAAVKGGDGGAAAGTGIAGA 482
G GRG GA GG GL VG G G G + GG G G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 483 GGRGGAGGSGGSGGDGGGGAAGPAGWLFG 511
G GG G SGG G GG +A A FG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91


15Rv1234Rv1242Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1234-313-3.219081Probable transmembrane protein
Rv1235-112-2.528035Probable sugar-binding lipoprotein LpqY
Rv1236012-3.167634Probable sugar-transport integral membrane
Rv1237111-3.050139Probable sugar-transport integral membrane
Rv12380120.804890Probable sugar-transport ATP-binding protein ABC
Rv1239c0132.794270Possible magnesium and cobalt transport
Rv12402143.613125Probable malate dehydrogenase Mdh
Rv12413143.608351Possible antitoxin VapB33
Rv12423143.488280Possible toxin VapC33. Contains PIN domain.
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1235MALTOSEBP492e-08 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 49.3 bits (117), Expect = 2e-08
Identities = 43/162 (26%), Positives = 70/162 (43%), Gaps = 16/162 (9%)

Query: 139 WNHKLYAAPVTTNTQLLWYRPDLVNSPPTDWNAMIAEAARLHAAGEPSWIAVQANQGEGL 198
+N KL A P+ L Y DL+ +PP W + A L A G+ A+ N E
Sbjct: 125 YNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKS---ALMFNLQEPY 181

Query: 199 VVWFNTLLVSAGGSVLS-EDGRHVTL---TDTPAHRAATVSALQILKSVATTPGADPSIT 254
W L+ + GG E+G++ D +A + ++K+ D SI
Sbjct: 182 FTW--PLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSI- 238

Query: 255 RTEEGSARLAFEQGKAALEVNWPFVFASMLENAVKGGVPFLP 296
A AF +G+ A+ +N P+ ++++ + V GV LP
Sbjct: 239 ------AEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLP 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1238PF05272354e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.4 bits (81), Expect = 4e-04
Identities = 24/122 (19%), Positives = 39/122 (31%), Gaps = 35/122 (28%)

Query: 33 LILVGPSGCGKTTTLNMIAGLEDISSGELRIAGERVNEKAPKDRDIAMVFQSYALYPHMT 92
++L G G GK+T +N + GL+ S I +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVAY---- 645

Query: 93 VRQNIAFPLTLAKMRKADIAQKVSETAKILDLTNLLDRKPSQLSGGQRQRVAMGRAIVRH 152
++ + R+AD + + D R R A GR + H
Sbjct: 646 ---ELS---EMTAFRRADAEAVKAFFSSRKD----------------RYRGAYGRYVQDH 683

Query: 153 PK 154
P+
Sbjct: 684 PR 685


16Rv1301Rv1308Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1301-114-3.003087Conserved protein
Rv1302014-3.158500Probable undecapaprenyl-phosphate
Rv1303313-3.697625Conserved hypothetical transmembrane protein
Rv1304312-3.507763Probable ATP synthase a chain AtpB (protein 6)
Rv1305312-2.704059Probable ATP synthase C chain AtpE
Rv1306411-2.621392Probable ATP synthase B chain AtpF
Rv1307412-2.401998Probable ATP synthase delta chain AtpH
Rv1308212-1.805793Probable ATP synthase alpha chain AtpA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1307IGASERPTASE320.006 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.0 bits (72), Expect = 0.006
Identities = 23/117 (19%), Positives = 46/117 (39%), Gaps = 10/117 (8%)

Query: 31 SARQDTVRQQLADAAAAADRLAEASQ----AHTKALEDAKSEAHRVVEEARTDAERIAEQ 86
S + Q + A +A+ ++ A+T+ E A+S + +E +T +
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE--TKETQTTETKETAT 1105

Query: 87 LEAQADVEAERIKMQGARQVDLIRAQLT-RQLRLELGHESVRQARELVRNHVADQAQ 142
+E + + E K Q +V +Q++ +Q + E ARE + Q
Sbjct: 1106 VEKEEKAKVETEKTQEVPKV---TSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159


17Rv1350Rv1355cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1350-123-4.364834Probable 3-oxoacyl-[acyl-carrier protein]
Rv1351025-4.722541Hypothetical protein
Rv1352-123-4.466915Conserved protein
Rv1353c021-4.129498Probable transcriptional regulatory protein
Rv1354c022-4.165777Conserved hypothetical protein
Rv1355c021-3.923460Possible molybdopterin biosynthesis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1350DHBDHDRGNASE1145e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 114 bits (287), Expect = 5e-33
Identities = 69/258 (26%), Positives = 120/258 (46%), Gaps = 19/258 (7%)

Query: 2 ASLLNARTAVITGGAQGLGLAIGQRFVAEGARVVLGDVNLEATEVAAKRLGGDD-VALAV 60
A + + A ITG AQG+G A+ + ++GA + D N E E L + A A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 61 RCDVTQADDVDILIRTAVERFGGLDVMVNNAGITRDATMRTMTEEQFDQVIAVHLKGTWN 120
DV + +D + G +D++VN AG+ R + ++++E+++ +V+ G +N
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 121 GTRLAAAIMRERKRGAIVNMSSVSGKVGMVGQTNYSAAKAGIVGMTKAAAKELAHLGIRV 180
+R + M +R+ G+IV + S V Y+++KA V TK ELA IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 181 NAIAPGLIRSAMTEAMPQRIWDQKLAE--------------VPMGRAGEPSEVASVAVFL 226
N ++PG + M +W + +P+ + +PS++A +FL
Sbjct: 183 NIVSPGSTETDMQ----WSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 227 ASDLSSYMTGTVLDVTGG 244
S + ++T L V GG
Sbjct: 239 VSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1353cTETREPRESSOR853e-22 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 85.4 bits (211), Expect = 3e-22
Identities = 51/207 (24%), Positives = 84/207 (40%), Gaps = 15/207 (7%)

Query: 16 INPEDIISGAFELAQQVSIDNLSMPLLGKHLGVGVTSIYWYFRKKDDLLNAMTDRALSKY 75
+N E +I A EL + ID L+ L + LG+ ++YW+ + K LL+A+ L+++
Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILARH 63

Query: 76 VFATPYIEAGDWRETLRNHARSMRKTFADNPVLCDLILIRAALSPKTARLGAQEMEKAIA 135
+ W+ LRN+A S R+ D + P + +E +
Sbjct: 64 HDYSLPAAGESWQSFLRNNAMSFRRALL---RYRDGAKVHLGTRPDEKQYDT--VETQLR 118

Query: 136 NLVTAGLSLEDAFDIYSAVSVHVRGSVVLDRLSRKSQSAGSGPSAIEHPVAIDPATTPLL 195
+ G SL D SAVS G+V+ + + + + P A D PLL
Sbjct: 119 FMTENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALT--------DRPAAPDENLPPLL 170

Query: 196 AHATGRGHRIGAPDETNFEYGLECILD 222
A E F +GLE ++
Sbjct: 171 REALQIMDSDDG--EQAFLHGLESLIR 195


18Rv1441cRv1455Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1441c0153.028606PE-PGRS family protein PE_PGRS26
Rv1442-2120.660131Probable biotin sulfoxide reductase BisC (BDS
Rv1443c113-0.150018Unknown protein
Rv1444c0140.476997Unknown protein
Rv1445c2185.099400Probable 6-phosphogluconolactonase DevB (6PGL)
Rv1446c2174.892553Putative OXPP cycle protein OpcA
Rv1447c4176.402025Probable glucose-6-phosphate 1-dehydrogenase
Rv1448c5176.356595Probable transaldolase Tal
Rv1449c5186.709170Transketolase Tkt (TK)
Rv1450c7187.212018PE-PGRS family protein PE_PGRS27
Rv14512113.865455Probable cytochrome C oxidase assembly factor
Rv1452c2133.941429PE-PGRS family protein PE_PGRS28
Rv1453-1110.734909Possible transcriptional activator protein
Rv1454c1111.342042Probable quinone reductase Qor (NADPH:quinone
Rv14552111.686353Conserved protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1441ccloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 2e-04
Identities = 36/103 (34%), Positives = 41/103 (39%), Gaps = 1/103 (0%)

Query: 300 GTAGQGGDGGTALGAGGIGGDGGTGGAGGTGGTAGIGGSSAGAGGAGGDGGAGGTGGGSS 359
G G+G + G +G I G G TG G G + G G SS GG G GGGS
Sbjct: 3 GGDGRGHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 360 MIGGKGGTGGNGGVGGTGGASALTIGNGSSAGAGGAGGAGGTG 402
G G GG G G SA+ A GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.2 bits (83), Expect = 3e-04
Identities = 34/116 (29%), Positives = 38/116 (32%), Gaps = 2/116 (1%)

Query: 233 GGGDAGGAAGMFGTGGA--GGTGGDGGAGGAGDSPNSGANGARGGDGGNGAAGGAGGRLF 290
GG G G T G GG G G GGA D + G G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 291 GNGGAGGNGGTAGQGGDGGTALGAGGIGGDGGTGGAGGTGGTAGIGGSSAGAGGAG 346
GNGG GN G G +A+ A G G G I + A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 35.1 bits (80), Expect = 6e-04
Identities = 28/81 (34%), Positives = 33/81 (40%), Gaps = 2/81 (2%)

Query: 291 GNGGAGGNGGTAGQGGDGGTALGAGGIGGDGGTGGAGGTGGTAGIGGSSAGAGGAGGDGG 350
G G G N G G+ G+GG G + GGS +G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 351 AGGTGGGSSMIGGKGGTGGNG 371
G G G+S GG GTGGN
Sbjct: 63 GNGGGNGNS--GGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 7e-04
Identities = 33/101 (32%), Positives = 43/101 (42%), Gaps = 1/101 (0%)

Query: 369 GNGGVGGTGGASALTIGNGSSAGAGGAGGAGGTGGTGGYIESLDGKGQAGNGGNGGNGAA 428
G G G GA + T GN + G G G + G+G E+ G +G+G + G G+
Sbjct: 3 GGDGRGHNTGAHS-TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 429 GGAGGGGTGAGGNGGAGGNGGDGGPSQGGGNPGFGGDGGTG 469
G GGG +GG G GGN G P G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.7 bits (79), Expect = 7e-04
Identities = 29/116 (25%), Positives = 38/116 (32%)

Query: 248 GAGGTGGDGGAGGAGDSPNSGANGARGGDGGNGAAGGAGGRLFGNGGAGGNGGTAGQGGD 307
G G G + GA + N G G G G + +G + GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 308 GGTALGAGGIGGDGGTGGAGGTGGTAGIGGSSAGAGGAGGDGGAGGTGGGSSMIGG 363
G GG G G G + GAGG + G S+ I
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 33.9 bits (77), Expect = 0.002
Identities = 27/92 (29%), Positives = 35/92 (38%), Gaps = 4/92 (4%)

Query: 390 AGAGGAGGAGGTGGTGGYIESLDGKGQAGNGGNGGNGAAG----GAGGGGTGAGGNGGAG 445
+G G G G T G I G G + G+G + GG G+G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 446 GNGGDGGPSQGGGNPGFGGDGGTGGPGGVGVP 477
G G + GGG+ G P G P
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93



Score = 33.5 bits (76), Expect = 0.002
Identities = 28/85 (32%), Positives = 34/85 (40%), Gaps = 3/85 (3%)

Query: 326 AGGTGGTAGIGGSSAGAGGAGGDGGAGGTGGGSSMIGGKGGTGGNGGVGGTGGASALTIG 385
+GG G G S GG G G GG S G G + N GG G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD---GSGWSSENNPWGGGSGSGIHWGG 58

Query: 386 NGSSAGAGGAGGAGGTGGTGGYIES 410
GG G +GG GTGG + +
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.002
Identities = 31/109 (28%), Positives = 37/109 (33%)

Query: 258 AGGAGDSPNSGANGARGGDGGNGAAGGAGGRLFGNGGAGGNGGTAGQGGDGGTALGAGGI 317
+GG G N+GA+ G G G GG G G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 318 GGDGGTGGAGGTGGTAGIGGSSAGAGGAGGDGGAGGTGGGSSMIGGKGG 366
G+GG G G G G S+ A A G G G + G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.1 bits (75), Expect = 0.002
Identities = 31/105 (29%), Positives = 37/105 (35%), Gaps = 2/105 (1%)

Query: 363 GKGGTGGNGGVGGT--GGASALTIGNGSSAGAGGAGGAGGTGGTGGYIESLDGKGQAGNG 420
G+G G G GG + L +G G+S G+G + GG G G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 421 GNGGNGAAGGAGGGGTGAGGNGGAGGNGGDGGPSQGGGNPGFGGD 465
G GN G GG A A G P GG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.1 bits (75), Expect = 0.003
Identities = 26/88 (29%), Positives = 34/88 (38%), Gaps = 5/88 (5%)

Query: 229 GGNGGGGDAGGAAGMFGTGGAGGTGGDGGAG-GAGDSPNSGANGARGGDGGNGAAGGAGG 287
G N G G TG G G G+G + ++P G +G+ GG G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 288 RLFGNGGAGGNGGTAGQGGDGGTALGAG 315
NG +GG GT G + G
Sbjct: 68 ----NGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.4 bits (73), Expect = 0.005
Identities = 26/79 (32%), Positives = 30/79 (37%), Gaps = 4/79 (5%)

Query: 412 DGKGQAGNGGNGGNGAAGGAGGGGTGAGGNGGAG----GNGGDGGPSQGGGNPGFGGDGG 467
DG+G + GG G G G G + G+G N GG G G G G
Sbjct: 5 DGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 468 TGGPGGVGVPDGIGGANGA 486
GG G G G GG A
Sbjct: 65 GGGNGNSGGGSGTGGNLSA 83



Score = 31.6 bits (71), Expect = 0.007
Identities = 27/88 (30%), Positives = 37/88 (42%), Gaps = 2/88 (2%)

Query: 198 LSGNGGAGGNGGTGASGADGGGGLPPVPASPGGNGGGGDAGGAAGMFGTGGAGGTGGDGG 257
+SG G G N G ++ + GG P GG G + GG+G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGG--PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 258 AGGAGDSPNSGANGARGGDGGNGAAGGA 285
G G+ +G +G G GGN +A A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 30.5 bits (68), Expect = 0.016
Identities = 27/78 (34%), Positives = 32/78 (41%), Gaps = 3/78 (3%)

Query: 142 GNGGNGGAGDAAHPNGGNGGDAGMFGNGGAG-GAGYSPAAGTGAAGGAGGAGGAGGWLSG 200
G G N GA + NGG G+ GGA G+G+S G G GG G
Sbjct: 6 GRGHNTGAHSTS--GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 201 NGGAGGNGGTGASGADGG 218
NGG GN G G+
Sbjct: 64 NGGGNGNSGGGSGTGGNL 81



Score = 30.1 bits (67), Expect = 0.021
Identities = 34/111 (30%), Positives = 37/111 (33%), Gaps = 11/111 (9%)

Query: 161 GDAGMFGNGGAGGAGYSPAAGTGAAGGAGGAGGAGGWLSGNGGAGGNGGTGASGADGGGG 220
G G N GA + G G GGA GW S N GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 221 LPPVPASPGGNGGGGDAGGAAGMFGTGGAGGTGGDGGAGGAGDSPNSGANG 271
G GG G + G GTGG A G GA G
Sbjct: 63 -----------GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 28.9 bits (64), Expect = 0.047
Identities = 28/73 (38%), Positives = 32/73 (43%), Gaps = 4/73 (5%)

Query: 422 NGGNGAAGGAGGGGTGAGGNGGAGGNGGDGGPSQGGG----NPGFGGDGGTGGPGGVGVP 477
+GG+G G T NGG G G GG S G G N +GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 478 DGIGGANGAQGKH 490
G GG NG G
Sbjct: 62 HGNGGGNGNSGGG 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1450ccloacin412e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 41.2 bits (96), Expect = 2e-05
Identities = 35/100 (35%), Positives = 39/100 (39%), Gaps = 2/100 (2%)

Query: 839 GDGGNGGNGGSAGTGGN--GGRGGDGAFGGMSANATNPGENGPNGNPGGNGGAGGAGGAG 896
G G G N G+ T GN GG G G GG S + EN P G G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 897 LNGGNGGAGGNGGLGGFGGNGAAGANGVAVGAPGQPGGAG 936
NGG G G G G + A A PG G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 40.1 bits (93), Expect = 6e-05
Identities = 32/103 (31%), Positives = 36/103 (34%)

Query: 408 SSGGKAGGNGGAGGAGGLVGNGGAGGAGGNGAPGAPPSGGDPNGGGGGAGGAGGKGGDGG 467
S G G N GA G + G G G GA + N GGG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 468 AQAGDGGAGGAGGKGGNGGNGATGATGLNGLGAGADGTDGGKG 510
G G GG G G A A G A + GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 38.5 bits (89), Expect = 2e-04
Identities = 38/111 (34%), Positives = 46/111 (41%), Gaps = 3/111 (2%)

Query: 394 SGGNGGAGGNGAHFSSGGKAGGNGGAGGAGGLVGNGGAGGAGGNGAPGAPPSGGDPNGGG 453
SGG+G GAH +SG GG G G GG + G+G + N P SG + GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENN-PWGGGSGSGIHWGG 58

Query: 454 GGAGGAGGKGGDGGAQAGDGGAGGAGGKGGNGGNGATGATGLNGLGAGADG 504
G G GG G+ G +G GG A G A G GL
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 38.5 bits (89), Expect = 2e-04
Identities = 36/102 (35%), Positives = 45/102 (44%), Gaps = 3/102 (2%)

Query: 780 GNGGHGGNGGNPGAGGQ--GGSGGAGSTPGAKGAHGFTPTSGGDGGDGGNGGNSQVVGGN 837
G G G N G G GG G G GA G++ + GG G+G + G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 838 GGDGGNGGNGGSAGTGGNGGRGGDGAFGGMSANATNPGENGP 879
G GGNG +GG +GTGGN G A +T PG G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALST-PGAGGL 103



Score = 37.4 bits (86), Expect = 4e-04
Identities = 31/79 (39%), Positives = 37/79 (46%), Gaps = 2/79 (2%)

Query: 991 GGGGRGGDGGNAGNAGA--GGPGGTGSTAGKAGPAGSILHDGGNGGHGGHGAASGGNGGP 1048
GG GRG + G +G GGP G G G + +G + GG G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1049 GGHGGNGGNGGTGANGGNG 1067
G GGNG +GG GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 37.4 bits (86), Expect = 4e-04
Identities = 26/87 (29%), Positives = 31/87 (35%)

Query: 934 GAGGHGGAGGNGGAGGNGGQGVVSDGAGGAGGAGGDGGAPGDGANGGNGQGAGAFAGGGG 993
G G G G GN G G GG G + + GG+G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 994 GRGGDGGNAGNAGAGGPGGTGSTAGKA 1020
G GG GN+G G + A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 37.4 bits (86), Expect = 4e-04
Identities = 34/92 (36%), Positives = 39/92 (42%), Gaps = 9/92 (9%)

Query: 971 GAPGDGANGGNGQGAGAFAGGGGGRGGDGGNAGNAGAGGPGGTGSTAGKAGPAGSILHDG 1030
G G G N G +G GG G G GG + G G + G +GS +H G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-----ASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 1031 GNGGHGGHGAASGGNGGPGGHGGNGGNGGTGA 1062
G GHG GGNG GG G GGN A
Sbjct: 58 GGSGHGN----GGGNGNSGGGSGTGGNLSAVA 85



Score = 37.4 bits (86), Expect = 4e-04
Identities = 36/108 (33%), Positives = 41/108 (37%), Gaps = 5/108 (4%)

Query: 1176 TGGTGGTGGTGGQGANGGLTGGRGGTGGNGGNGNTGGTGGAGGTGGTGHNGSQPGMGGNG 1235
+GG G TG +G + GG G G GG + G G G GS GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGS-GSGIHWGGGS 60

Query: 1236 GAGGFGGNGFAGVGGRGGMGGSGGTGGTGDAGPFGTGTGGTGGHGGQG 1283
G G GGNG +G GG G G FG T G GG
Sbjct: 61 GHGNGGGNGNSG----GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.6 bits (84), Expect = 8e-04
Identities = 30/87 (34%), Positives = 37/87 (42%)

Query: 728 GNGGQGGPGGLAGNLFGQNGIQGVGGSGGKGGAGGLAGDGGNGANGNFAFGDGNGGHGGN 787
G G G +GN+ G GVGG G + G +G+ G GHG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 788 GGNPGAGGQGGSGGAGSTPGAKGAHGF 814
GGN +GG G+GG S A A GF
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 36.2 bits (83), Expect = 8e-04
Identities = 36/102 (35%), Positives = 43/102 (42%), Gaps = 7/102 (6%)

Query: 361 GDNGGDPGAGGA--GGTGGAGSTIGAHGAAGASPTSGGNGGAGGNGAHFSSGGKAGGNGG 418
G N G G GG G G GA +G S + GG G+G H+ G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 419 AGGAGGLVGNGGAGGAGGNGAPGAPPSGGDPNGGGGGAGGAG 460
G +G GG+G G A AP + G P GAGG
Sbjct: 68 NGNSG-----GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.5 bits (81), Expect = 0.001
Identities = 31/83 (37%), Positives = 39/83 (46%), Gaps = 3/83 (3%)

Query: 1145 GGGGRGGDAGRGGDAGLGGSSGPGGTPGDWGTGGTGGTGGTGGQGANGGLTGGRGGTGGN 1204
GG GRG + G G+ G T G G + G+G + GG +G GG
Sbjct: 3 GGDGRGHN---TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1205 GGNGNTGGTGGAGGTGGTGHNGS 1227
G+GN GG G +GG GTG N S
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLS 82



Score = 35.5 bits (81), Expect = 0.002
Identities = 30/87 (34%), Positives = 36/87 (41%), Gaps = 1/87 (1%)

Query: 445 SGGDPNGGGGGAGGAGGKGGDGGAQAGDGGAGGAGGKGGNGGNGATGATGLNGLGAGADG 504
SGGD G GA G +GG G G + G G + N G +G+ G
Sbjct: 2 SGGDGRGHNTGAHSTSGN-INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 505 TDGGKGGNGGAGGGGGAGGQGGKALAA 531
G GGNG +GGG G GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 35.1 bits (80), Expect = 0.002
Identities = 32/110 (29%), Positives = 40/110 (36%)

Query: 521 AGGQGGKALAATHQDGSMGAGGAGGNGGAGGMGGDGGNGAKGTFDNGGDGVGGNGGNGGS 580
+GG G H GG G G GG G ++ GG G G + G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 581 RGIGGAGGIGGAGSTAGADGARGATPTSGGNGGTGGNGANATVAGGAGGA 630
G GG G G GS G + + A P + G GA + GA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.1 bits (80), Expect = 0.002
Identities = 32/93 (34%), Positives = 37/93 (39%), Gaps = 8/93 (8%)

Query: 1042 SGGNGGPGGHGGNGGNGGTGANGGNGGIGGTGGAGSTGAKGVLGTNEGDGGDGGRGGNGG 1101
SGG+G GH + NGG G+G GGA G G G GG
Sbjct: 2 SGGDGR--GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1102 RGGNGGQGLTGAGGNGGTGGTPGNGGNGGNGAS 1134
G G GGNG +GG G GGN A+
Sbjct: 60 SGHGNG------GGNGNSGGGSGTGGNLSAVAA 86



Score = 34.7 bits (79), Expect = 0.003
Identities = 29/81 (35%), Positives = 38/81 (46%), Gaps = 4/81 (4%)

Query: 484 NGGNGATGATGLNGLGAGADGTDGGKGGNGGAGGGGGAGGQ----GGKALAATHQDGSMG 539
+GG+G TG + +G G G GGA G G + GG + + H G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 540 AGGAGGNGGAGGMGGDGGNGA 560
G GGNG +GG G GGN +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 33.9 bits (77), Expect = 0.005
Identities = 30/89 (33%), Positives = 38/89 (42%), Gaps = 1/89 (1%)

Query: 566 NGGDGVGGNGGNGGSRGIGGAGGIGGAGSTAGADGARGATPTSGGNGGTGGNGANATVAG 625
+GGDG G N G + G GG G G GA G + + GG G+G +
Sbjct: 2 SGGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 626 GAGGAGGKGGNGGLVGNGGAGGKGGDGMA 654
G G GG G +GG G GG +A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 33.9 bits (77), Expect = 0.005
Identities = 30/89 (33%), Positives = 39/89 (43%), Gaps = 2/89 (2%)

Query: 629 GAGGKGGNGGLVGNGG--AGGKGGDGMAGVAGSSPTTAGESGTSGQNGGAGGAGGAGGRG 686
G G+G N G G GG G G+ G A + E+ G G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 687 GDFGGDGGTGGAGGNGANGANATTPGAKG 715
G+ GG+G +GG G G N + P A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.010
Identities = 26/71 (36%), Positives = 29/71 (40%)

Query: 884 GGNGGAGGAGGAGLNGGNGGAGGNGGLGGFGGNGAAGANGVAVGAPGQPGGAGGHGGAGG 943
G N GA G G G G G G G + G G+ GG GHG GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 944 NGGAGGNGGQG 954
NG +GG G G
Sbjct: 68 NGNSGGGSGTG 78



Score = 32.4 bits (73), Expect = 0.012
Identities = 35/115 (30%), Positives = 45/115 (39%), Gaps = 1/115 (0%)

Query: 1029 DGGNGGHGGHGAASGGNGGPGGHGGNGGNGGTGANGGNGGIGGTGGAGSTGAKGVLGTNE 1088
DG G H + NGGP G G GG G+ + GG+GS G +
Sbjct: 5 DGRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 1089 GDGGDGGRGGNGGRGGNGGQGLTGAGGNGGTGGTPGNGGNGGNGASGDLVTSPGD 1143
GG+G GG G GGN TPG GG + ++G L + D
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.4 bits (73), Expect = 0.014
Identities = 29/79 (36%), Positives = 33/79 (41%), Gaps = 2/79 (2%)

Query: 168 GTGGAGGAGGAGAAGGA--GGSGGWLLGNGGVGGAGGQSLLGGATGGAGGNAGLFGVGGT 225
G G G GA + G GG G +G G G+G S GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 226 GGPGGPGGPGGVGGTGGAG 244
G GG G GG GTGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.021
Identities = 29/97 (29%), Positives = 34/97 (35%), Gaps = 2/97 (2%)

Query: 1142 GDGGGGGRGGDAGRGGDAGLGGSSGPGGTPGDWGTGGTGGTGGTGGQGANGGLTGGRGGT 1201
G G GG GLG G G W + GG+G G + G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSG-SGIHWGGGSGHGNG 65

Query: 1202 GGNGGNGNTGGTGGAGGTGGTGHNGSQPGMGGNGGAG 1238
GGNG +G GTGG P + G G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.029
Identities = 29/104 (27%), Positives = 35/104 (33%), Gaps = 15/104 (14%)

Query: 859 GGDGAFGGMSANATNPGENGPNGNPGGNGGAGGAGGAGLNGGNGGAGGNGGLGGFGGNGA 918
GGDG A++T+ NG G GGA G G G G+ GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG- 61

Query: 919 AGANGVAVGAPGQPGGAGGHGGAGGNGGAGGNGGQGVVSDGAGG 962
G GG G G G G V + A G
Sbjct: 62 --------------HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.2 bits (70), Expect = 0.032
Identities = 31/83 (37%), Positives = 36/83 (43%), Gaps = 4/83 (4%)

Query: 1098 GNGGRGGNGGQGLTGAGGNGGTGGTPGNGGNGGNGASGDLVTSPGDGGGGGRGGDAGRGG 1157
G GRG N G T NGG G G G + SG +S + GGG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTG--LGVGGGASDGSG--WSSENNPWGGGSGSGIHWGG 58

Query: 1158 DAGLGGSSGPGGTPGDWGTGGTG 1180
+G G G G + G GTGG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.040
Identities = 31/99 (31%), Positives = 35/99 (35%), Gaps = 2/99 (2%)

Query: 1070 GGTGGAGSTGAKGVLGTNEGDGGDGGRGGNGGRGGNGGQGLTGAGGNGGTGGTPGNGGNG 1129
GG G +TGA G +GG G G GG G GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1130 GNGASGDLVTSPGDGGGGGRGGDAGRGGDAGLGGSSGPG 1168
G+G G S G G GG G S PG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPG 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1452ccloacin382e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 2e-04
Identities = 31/81 (38%), Positives = 39/81 (48%), Gaps = 1/81 (1%)

Query: 463 SGGNGGAGGNGANATTAGTNGANGGPGGHGGLV-GNGGAGGNGANGAAGTNASDSGAVGG 521
SGG+G GA++T+ NG G G GG G+G + N G + G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 522 KGNSGGNGGQGGAGGDGGTLA 542
GN GGNG GG G GG L+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 33.9 bits (77), Expect = 0.002
Identities = 29/79 (36%), Positives = 33/79 (41%), Gaps = 2/79 (2%)

Query: 168 GTGGAGGAGGAGAAGGA--GGSGGWLLGNGGVGGAGGQSLLGGATGGAGGNAGLFGVGGT 225
G G G GA + G GG G +G G G+G S GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 226 GGPGGPGGPGGVGGTGGAG 244
G GG G GG GTGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 32.4 bits (73), Expect = 0.008
Identities = 25/86 (29%), Positives = 34/86 (39%), Gaps = 2/86 (2%)

Query: 433 GGDPGAGGAGGTGGAGSITGAQGAIGATPTSGGNGGAGGNGANATTAGTNGANGGPGGHG 492
GGD G +G+I G +G G + G+G + N G +G+ GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG--VGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 493 GLVGNGGAGGNGANGAAGTNASDSGA 518
G GG G +G G N S A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.0 bits (72), Expect = 0.010
Identities = 26/80 (32%), Positives = 30/80 (37%), Gaps = 4/80 (5%)

Query: 527 GNGGQGGAGGDGGTLAGNGGAGGTGGRGADGGLGGSGAEGANATTAGERGQDGGKGGNGG 586
G G GA G + G G GG +DG SG N G G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDG----SGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 587 VGGTGGNAVAPGANGGHGGN 606
G GGN + G +G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.022
Identities = 30/106 (28%), Positives = 36/106 (33%), Gaps = 5/106 (4%)

Query: 127 AGAPGTGQAGGAGGILWGNGGAGGSGAPGQVGGAGGAAGLFGTGGAGGAGGAGAAGGAGG 186
+G G G GA + +G P +G GGA+ G G G+ G
Sbjct: 2 SGGDGRGHNTGAH-----STSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 187 SGGWLLGNGGVGGAGGQSLLGGATGGAGGNAGLFGVGGTGGPGGPG 232
GG GNGG G G G A FG PG G
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.5 bits (68), Expect = 0.029
Identities = 30/94 (31%), Positives = 34/94 (36%), Gaps = 3/94 (3%)

Query: 372 MGGAGGAGGPGGAGGLISLLGGQGAGGAGGTGGAGGVGGDRGAGGPGNQAFNAGAGGAGG 431
M G G G GA + G G G G GGA G P +G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 432 HGGDPGAGGAGGTGGAGSITGAQGAIGATPTSGG 465
G GG G G GS TG + A P + G
Sbjct: 60 SGHGN--GGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.1 bits (67), Expect = 0.039
Identities = 28/102 (27%), Positives = 35/102 (34%), Gaps = 5/102 (4%)

Query: 302 GEDLSPHGTSGGVGGDAGDGGTGGRGGWLAGAGGAGGAGGVGGTGGAGGAGFSRALIVAG 361
G + H TSG + G G GG + G+G + GG G+G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGG-----GASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 362 DNGGDGGNGGMGGAGGAGGPGGAGGLISLLGGQGAGGAGGTG 403
NGG GN G G G A + GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


19Rv1497Rv1521Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1497221-5.453310Probable esterase LipL
Rv1498c429-8.700460Probable methyltransferase
Rv1498A432-8.637430Conserved protein
Rv1499330-8.418661Hypothetical protein
Rv1500331-9.589654Probable glycosyltransferase
Rv1501332-9.483078Conserved hypothetical protein
Rv1502432-9.864424Hypothetical protein
Rv1503c433-9.981331Conserved hypothetical protein
Rv1504c128-7.726387Conserved hypothetical protein
Rv1505c231-8.016349Conserved hypothetical protein
Rv1506c130-7.721838Hypothetical protein
Rv1507c125-4.930333Conserved protein
Rv1507A018-3.403531Hypothetical protein
Rv1508c-113-2.071386Probable membrane protein
Rv1508A014-1.944980Conserved hypothetical protein
Rv1509013-1.864829Hypothetical protein
Rv1510113-1.525100Conserved probable membrane protein
Rv1511211-2.163516GDP-D-mannose dehydratase GmdA (GDP-mannose 4,6
Rv1512113-2.000185Probable nucleotide-sugar epimerase EpiA
Rv1513013-2.819922Conserved protein
Rv1514c-113-2.747101Conserved hypothetical protein
Rv1515c-113-2.588083Conserved hypothetical protein
Rv1516c-113-2.972502Probable sugar transferase
Rv1517-111-3.896901Conserved hypothetical transmembrane protein
Rv1518011-3.873167Conserved hypothetical protein
Rv1519012-3.517979Conserved hypothetical protein
Rv1520012-3.538137Probable sugar transferase
Rv1521013-3.251841Probable fatty-acid-AMP ligase FadD25
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1511NUCEPIMERASE944e-24 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 94.5 bits (235), Expect = 4e-24
Identities = 70/337 (20%), Positives = 122/337 (36%), Gaps = 48/337 (14%)

Query: 3 RALITGITGQDGSYLAELLLAKGYEVHGLIRRASTFNTSRIDHLYVDPHQPGARL----- 57
+ L+TG G G ++++ LL G++V G+ N Y D ARL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGI----DNLND------YYDVSLKQARLELLAQ 51

Query: 58 ---FLHYGDLIDGTRLVTLLSTIEPDEVYNLAAQSHVRVSFDEPVHTGDTTGMGSMRLLE 114
H DL D + L ++ + V+ + VR S + P D+ G + +LE
Sbjct: 52 PGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILE 111

Query: 115 AVRLSRVHCRFYQASSSEMFGASP--PPQNELTPFYPRSPYGAAKVYSYWATRNYREAYG 172
R +++ ASSS ++G + P + + +P S Y A K + Y YG
Sbjct: 112 GCRHNKIQ-HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 173 LFAVNGILFNHESPRRGETFVTRKITRAVARIKAGIQSEVYMGNLDAVRDWGYAPEYVEG 232
L A F P K T+A + G +VY RD+ Y + E
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKA---MLEGKSIDVY-NYGKMKRDFTYIDDIAEA 226

Query: 233 MWRMLQTDEPDD-------------------FVLATGRGFTVREFARAAFEHAGLDWQQY 273
+ R+ D + + + ++ +A + G++
Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE---- 282

Query: 274 VKFDQRYLRPTEVDSLIGDATKAAELLGWRASVHTDE 310
K + L+P +V D E++G+ +
Sbjct: 283 AKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKD 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1512NUCEPIMERASE761e-17 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 75.6 bits (186), Expect = 1e-17
Identities = 60/288 (20%), Positives = 104/288 (36%), Gaps = 35/288 (12%)

Query: 48 ELDLTDRAATFDFVLESRPQVVIDAAARVGGILANDTYPADFLSENLQIQVNLLDAAVAA 107
++DL DR D + V + R + + P + NL +N+L+
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 108 RVPRLLFLGSSCIYPKLAPQPIPESALLTGPLEPTNDAYAIAKIAGILAVQAVRRQHGLP 167
++ LL+ SS +Y P + P+ YA K A L +GLP
Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSL----YAATKKANELMAHTYSHLYGLP 172

Query: 168 WISAMPTNLYGP-GDNFSPSGSHLLPALIRRYDEAKASGAPNVTNWGTGTPRRELLHVDD 226
+YGP G P + L + E K+ +V + G +R+ ++DD
Sbjct: 173 ATGLRFFTVYGPWGR---PDMA--LFKFTKAMLEGKS---IDV--YNYGKMKRDFTYIDD 222

Query: 227 LASACLYLLEHFDGPTH------------------VNVGTGIDHTIGEIAEMVASAVGYS 268
+A A + L + N+G + + + + A+G
Sbjct: 223 IAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE 282

Query: 269 GETRWDPSKPDGTPRKLLDVSVLREA-GWRPSIALRDGIEATVAWYRE 315
+ P +P D L E G+ P ++DG++ V WYR+
Sbjct: 283 AKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


20Rv1566cRv1587cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1566c2144.344027Possible Inv protein
Rv1567c4154.319819Probable hypothetical membrane protein
Rv15683165.097595Adenosylmethionine-8-amino-7-oxononanoate
Rv15691144.668894Probable 8-amino-7-oxononanoate synthase BioF1
Rv15701162.515554Dethiobiotin synthetase BioD
Rv15715170.922850Conserved protein
Rv1572c821-0.272144Conserved hypothetical protein
Rv15739210.411396Probable PhiRv1 phage protein
Rv1574822-0.362339Probable PhiRv1 phage related protein
Rv1575720-0.174683Probable PhiRv1 phage protein
Rv1576c5180.120555Probable PhiRv1 phage protein
Rv1577c219-1.653645Probable PhiRv1 phage protein
Rv1578c018-0.545462Probable PhiRv1 phage protein
Rv1579c120-0.918660Probable PhiRv1 phage protein
Rv1580c019-0.596089Probable PhiRv1 phage protein
Rv1581c-1220.127174Probable PhiRv1 phage protein
Rv1582c-1230.386844Probable PhiRv1 phage protein
Rv1583c0241.646716Probable PhiRv1 phage protein
Rv1584c1231.139208Possible PhiRv1 phage protein
Rv1585c2221.137847Possible phage PhiRv1 protein
Rv1586c2191.245997Probable PhiRv1 integrase
Rv1587c2140.668879Partial REP13E12 repeat protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1573PF07201270.024 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 27.1 bits (60), Expect = 0.024
Identities = 16/72 (22%), Positives = 22/72 (30%), Gaps = 12/72 (16%)

Query: 44 EVREALVRLQAALNRHEHTGELEAFLRISVEHADAAGGDECGPAILAG-----------R 92
E E L + + EL L VE A + +E G I+ G +
Sbjct: 131 EPSEQFKMLCGLRDALKGRPELAH-LSHLVEQALVSMAEEQGETIVLGARITPEAYRESQ 189

Query: 93 SGPEQAAINRQL 104
SG R
Sbjct: 190 SGVNPLQPLRDT 201


21Rv1646Rv1667cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv16462112.683243PE family protein PE17
Rv16471103.148189Adenylate cyclase (ATP pyrophosphate-lyase)
Rv16480103.632890Probable transmembrane protein
Rv16490103.753983Probable phenylalanyl-tRNA synthetase, alpha
Rv16500124.367663Probable phenylalanyl-tRNA synthetase, beta
Rv1651c1154.602736PE-PGRS family protein PE_PGRS30
Rv16521224.477058Probable N-acetyl-gamma-glutamyl-phoshate
Rv16532254.087697Probable glutamate N-acetyltransferase ArgJ
Rv16541264.094706Probable acetylglutamate kinase ArgB
Rv16551233.747976Probable acetylornithine aminotransferase ArgD
Rv16560192.257773Probable ornithine carbamoyltransferase,
Rv16573251.179279Probable arginine repressor ArgR (AHRC)
Rv16583260.616814Probable argininosuccinate synthase ArgG
Rv16593280.072792Probable argininosuccinate lyase ArgH
Rv1660329-0.366015Chalcone synthase Pks10
Rv1661328-0.406753Probable polyketide synthase Pks7
Rv1662125-1.338546Probable polyketide synthase Pks8
Rv1663-215-1.002425Probable polyketide synthase Pks17
Rv1664-214-0.820935Probable polyketide synthase Pks9
Rv1665013-0.284414Chalcone synthase Pks11
Rv1666c114-0.549422Probable cytochrome P450 139 Cyp139
Rv1667c2140.056764Probable second part of macrolide-transport
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1649PF07201330.001 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 32.9 bits (75), Expect = 0.001
Identities = 22/82 (26%), Positives = 31/82 (37%), Gaps = 9/82 (10%)

Query: 12 DAAQQAIALADTLDVLA-RVKTEHLGDRSPLALARQALAVLPKEQ--RAEAGKRVNAARN 68
+ ++Q L D L R + HL L QAL + +EQ G R+
Sbjct: 131 EPSEQFKMLCGLRDALKGRPELAHL-----SHLVEQALVSMAEEQGETIVLGARITPEAY 185

Query: 69 AAQRSYDERLATLR-AERDAAV 89
+S L LR RDA +
Sbjct: 186 RESQSGVNPLQPLRDTYRDAVM 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1651ccloacin451e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.1 bits (106), Expect = 1e-06
Identities = 38/108 (35%), Positives = 43/108 (39%), Gaps = 5/108 (4%)

Query: 605 SGGNGIGGKGGASGNGGNAGQVFGDGGTGGTGGAGGAGSGTKAGGTGSDGGHGGNATLIG 664
SGG+G G GA GN + G G GG GSG + GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGN---INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 665 NGGDGGAGGAGGAGSPAGAPGNGGTGGTGGVLFGQSGSSGPPGAAALA 712
G G GG G +G +G GN V FG S PGA LA
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAA-PVAFGFPALS-TPGAGGLA 104



Score = 42.4 bits (99), Expect = 8e-06
Identities = 35/99 (35%), Positives = 40/99 (40%), Gaps = 6/99 (6%)

Query: 119 GNGADGVAGTGSNAGGNGGPGGILYGNGGNGGAGGNGGAAGLIGNGGAGGAGGAGGAGGA 178
GN G G G G + G G N GG+G I GG G G GG G +
Sbjct: 18 GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG------IHWGGGSGHGNGGGNGNS 71

Query: 179 GGAGGTGGLLYGNGGAGGNGGSAAAAGGAGGNALLFGNG 217
GG GTGG L G A + GAGG A+ G
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 42.4 bits (99), Expect = 8e-06
Identities = 41/116 (35%), Positives = 45/116 (38%), Gaps = 10/116 (8%)

Query: 124 GVAGTGSNAGGNGGPGGILYGNGGNGGAGGNGGAAGLIG-------NGGAGGAGGAGGAG 176
G G G N G + G I NGG G G GGA+ G GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI---NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 177 GAGGAGGTGGLLYGNGGAGGNGGSAAAAGGAGGNALLFGNGGNGGSGASGGAAGHA 232
G GG G G G GGN + AA G AL G S GA A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 40.9 bits (95), Expect = 3e-05
Identities = 35/99 (35%), Positives = 43/99 (43%), Gaps = 4/99 (4%)

Query: 215 GNGGNGGSGASGGAAGHAGTIFGNGGNAGAGSGLAGAD----GGLFGNGGDGGSSTSKAG 270
G G N G+ ++ G T G GG A GSG + + GG GG S G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 271 GAGGNALFGNGGDGGSSTVAAGGAGGNTLVGNGGAGGAG 309
G GN+ G+G G S VAA A G + GAGG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 39.7 bits (92), Expect = 5e-05
Identities = 34/103 (33%), Positives = 40/103 (38%), Gaps = 6/103 (5%)

Query: 147 GNGGAGGNGGAAGLIGN--GGAGGAGGAGGAGGAGGAGGTGGLLYGNGGAGGNGGSAAAA 204
G G G N GA GN GG G G GGA G G G+G + G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 205 GGAGGNALLFGNGGNGGSGASGGAAGHAGTIFGNGGNAGAGSG 247
G GGN GN G G +A A FG + G+G
Sbjct: 63 GNGGGN----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101



Score = 39.7 bits (92), Expect = 6e-05
Identities = 38/115 (33%), Positives = 46/115 (40%), Gaps = 10/115 (8%)

Query: 497 SGVGGAGGSGGTALLLGS--GGAGGNGGTGGANSGSLFASPGGTGGAGGHGGAGGLIWGN 554
SG G G + G G+ GG G G GGA+ GS ++S G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 555 GGAGGNGGNGGTTADGALEGGTGGIGGTGGSAIAFGNGGQG--GAGGTGGDHSGG 607
G GG GN G + GTGG + +AFG GAGG S G
Sbjct: 62 HGNGGGNGNSGGGS------GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 39.3 bits (91), Expect = 7e-05
Identities = 25/85 (29%), Positives = 33/85 (38%)

Query: 554 NGGAGGNGGNGGTTADGALEGGTGGIGGTGGSAIAFGNGGQGGAGGTGGDHSGGNGIGGK 613
+GG G G + G + GG G+G GG++ G + G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 614 GGASGNGGNAGQVFGDGGTGGTGGA 638
G G GN+G G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 37.8 bits (87), Expect = 2e-04
Identities = 34/95 (35%), Positives = 39/95 (41%), Gaps = 10/95 (10%)

Query: 591 NGGQGGAGGTGGDHSGGNGIGGKGGASGNGGNAGQVFGDG-GTGGTGGAGGAGSGTKAGG 649
+GG G TG + GN GG G GG + G G + GG+GSG GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASD---GSGWSSENNPWGGGSGSGIHWGG 58

Query: 650 TGSDGGHGGNATLIGNGGDGGAGGAGGAGSPAGAP 684
G G GNG GG G GG S AP
Sbjct: 59 GSGHGNGG------GNGNSGGGSGTGGNLSAVAAP 87



Score = 37.4 bits (86), Expect = 3e-04
Identities = 32/105 (30%), Positives = 41/105 (39%), Gaps = 3/105 (2%)

Query: 258 NGGDGGSSTSKAGGAGGNALFGNGGDGGSSTVAAGGAGGNTLVGNGGAGGAGGTSGLTGS 317
+GGDG + A GN G G G + G + GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 318 GVAGGAGGSVGLWGSGGAGGDGGAATSLLGVGMNA-GAGGAGGNA 361
GG G+ G G G GG+ A + + G A GAGG A
Sbjct: 62 HGNGGGNGNSG--GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.2 bits (83), Expect = 7e-04
Identities = 35/122 (28%), Positives = 49/122 (40%), Gaps = 14/122 (11%)

Query: 171 GAGGAGGAGGAGGTGGLLYGNGGAGGNGGSAAAAGGAGGNALLFGNGGNGGSGASGGAAG 230
G G G GA T G + NGG G G A+ G+G ++ GG GSG
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI------ 54

Query: 231 HAGTIFGNGGNAGAGSGLAGADGGLFGNGGDGGSSTSKAGGAGGNALFGNGGDGGSSTVA 290
GG +G G+G + G G S+ + G AL G G + +++
Sbjct: 55 ------HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSIS 108

Query: 291 AG 292
AG
Sbjct: 109 AG 110



Score = 35.8 bits (82), Expect = 8e-04
Identities = 34/108 (31%), Positives = 38/108 (35%), Gaps = 4/108 (3%)

Query: 537 GTGGAGGHGGAGGLIWGNGGAGGNGGNGGTTADGALEGGTGGIGGTGGSAIAFGNGGQGG 596
G G G G I NGG G G GG + GG GS I GG G
Sbjct: 6 GRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI--HWGGGSG 61

Query: 597 AGGTGGDHSGGNGIGGKGGASGNGGNAGQVFGDGGTGGTGGAGGAGSG 644
G GG+ + G G G G S F T G GG + S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 35.5 bits (81), Expect = 0.001
Identities = 25/112 (22%), Positives = 33/112 (29%), Gaps = 2/112 (1%)

Query: 224 ASGGAAGHAGTIFGNGGNAGAGSGLAGADGGLFGNGGDGGSSTSKAGGAGGNALFGNGGD 283
+ G GH GN G G GG + G G SS + G G + GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 284 GGSSTVAAGGAGGNTLVGNGGAGGAGGTSGLTGSGVAGGAGGSVGLWGSGGA 335
G G G G ++ G + + S GA
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.7 bits (79), Expect = 0.002
Identities = 34/115 (29%), Positives = 44/115 (38%), Gaps = 3/115 (2%)

Query: 165 GAGGAGGAGGAGGAGGA--GGTGGLLYGNGGAGGNGGSAAAAGGAGGNALLFGNGGNGGS 222
G G G GA G GG GL G G + G+G S+ GG+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 223 GASGGAAGHAGTIFGNGGNAGAGSGLAGADGGLFGNGGDGGSSTSKAGGAGGNAL 277
G GG G G + + +A L G GG + S + GA A+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPAL-STPGAGGLAVSISAGALSAAI 116



Score = 34.3 bits (78), Expect = 0.002
Identities = 33/109 (30%), Positives = 45/109 (41%), Gaps = 8/109 (7%)

Query: 331 GSGGAGGDGGAATSLLGVGMNAGAGGAGGNAGLLYGNGGAGGAGGNGGDTTVPLFDSGVG 390
G G G + GA ++ +N G G G G G+G + GG + + G
Sbjct: 3 GGDGRGHNTGAHST--SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 391 GAGGAGGNASLFGNGGTGGVGGKGGTSSDLASATSGAGGAGGAGGVGGL 439
G G G GNG +GG G GG S +A+ + A G GGL
Sbjct: 61 GHGNGG------GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 34.3 bits (78), Expect = 0.003
Identities = 31/106 (29%), Positives = 39/106 (36%), Gaps = 3/106 (2%)

Query: 294 AGGNTLVGNGGAGGAGGTSGLTGSGVAGGAGGSVGLWGSGGAGGDGGAATSLLGVGMNAG 353
+GG+ N GA G +G+ G G S G S GG + S + G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 354 AGGAGGNAGLLYGNGGAGGAGGNGGDTTVPLFDSGVGGAGGAGGNA 399
G GGN GG+G G F GAGG A
Sbjct: 62 HGNGGGNGN---SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.1 bits (75), Expect = 0.007
Identities = 29/81 (35%), Positives = 36/81 (44%), Gaps = 4/81 (4%)

Query: 575 GTGGIGGTGGSAIAFGN--GGQGGAGGTGGDHSGGNGIGGKGGASGNGGNAGQVFGDGGT 632
G G G G+ GN GG G G GG S G+G + G G +G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGA-SDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 633 GGTGGAGG-AGSGTKAGGTGS 652
G GG G +G G+ GG S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 32.8 bits (74), Expect = 0.007
Identities = 37/125 (29%), Positives = 45/125 (36%), Gaps = 11/125 (8%)

Query: 390 GGAGGAGGNASLFGNGGTGGVGGKGGTSSDLA-SATSGAGGAGGAGGVGGLLYGNGGNGG 448
GA GN NGG G+G GG S S+ + G G G+ GNGG
Sbjct: 11 TGAHSTSGNI----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 449 AGGIGGAAINILANAGAGGAGGAAGSSFIGNGGNGGAGGAGGAAALFSSGVGGAGGSGGT 508
G G +G GG A + GAGG A S+G A +
Sbjct: 67 GNGNSG------GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIM 120

Query: 509 ALLLG 513
A L G
Sbjct: 121 AALKG 125



Score = 32.4 bits (73), Expect = 0.010
Identities = 28/103 (27%), Positives = 33/103 (32%)

Query: 367 NGGAGGAGGNGGDTTVPLFDSGVGGAGGAGGNASLFGNGGTGGVGGKGGTSSDLASATSG 426
+GG G G +T + G G G GG + G G G S SG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 427 AGGAGGAGGVGGLLYGNGGNGGAGGIGGAAINILANAGAGGAG 469
G GG G GG G L+ GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.034
Identities = 31/86 (36%), Positives = 39/86 (45%), Gaps = 10/86 (11%)

Query: 464 GAGGAGGAAGSSFIGNGGNGGAGGAGGAA--ALFSSGVGGAGGSGGTALLLGSGGAGGNG 521
G G GA +S NGG G G GGA+ + +SS GG G+ + G G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 522 GTGGANSGSLFASPGGTGGAGGHGGA 547
G G + GG G GG+ A
Sbjct: 66 GGNG--------NSGGGSGTGGNLSA 83



Score = 30.5 bits (68), Expect = 0.037
Identities = 34/104 (32%), Positives = 39/104 (37%), Gaps = 9/104 (8%)

Query: 442 GNGGNGGAGGIGGAAINILANAGAGGAGGAAGSSFIGNGGNGGAGGAGGAAALFSSGVGG 501
G G N GA G G G GGA+ S + N GG+G G
Sbjct: 6 GRGHNTGAHSTSGNING--GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 502 AGGSGGTALLLGSGGAGGNGGTGGANSGSL-FASPG-GTGGAGG 543
GG G SGG G GG A + + F P T GAGG
Sbjct: 64 NGGGNG-----NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1654CARBMTKINASE504e-09 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 49.8 bits (119), Expect = 4e-09
Identities = 32/126 (25%), Positives = 51/126 (40%), Gaps = 12/126 (9%)

Query: 172 GRIPVVSTLAPDADGVVHNINADTAAAAVAEALGAEKLLMLTDIDGLYTRW--PDRDSLV 229
G +PV+ + GV I+ D A +AE + A+ ++LTD++G + L
Sbjct: 195 GGVPVILE-DGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWL- 252

Query: 230 SEIDTGTLAQLLPTL---ESGMVPKVEACLRAVIGGVPSAHIIDGRVTHCVLVELFTDAG 286
E+ L + M PKV A +R + G A I H +
Sbjct: 253 REVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAII-----AHLEKAVEALEGK 307

Query: 287 TGTKVV 292
TGT+V+
Sbjct: 308 TGTQVL 313



Score = 36.0 bits (83), Expect = 1e-04
Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 10/62 (16%)

Query: 28 GKVVVVKYGGNAMTDDTLRRAFAADMAFLRNC----------GIHPVVVHGGGPQITAML 77
GK VV+ GGNA+ + ++ M +R G V+ HG GPQ+ ++L
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61

Query: 78 RR 79

Sbjct: 62 LH 63


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1657ARGREPRESSOR1682e-56 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 168 bits (427), Expect = 2e-56
Identities = 47/148 (31%), Positives = 80/148 (54%), Gaps = 6/148 (4%)

Query: 17 NRAGRQARIVAILSSAQVRSQNELAALLAAEGIEVTQATLSRDLEELGAVKLRGADGGTG 76
N+ R +I I+++ ++ +Q+EL +L +G VTQAT+SRD++EL VK+ + G+
Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPT-NNGSY 60

Query: 77 IYVVPEDGSPVRGVSGGTDRMARLLGELLVSTDDSGNLAVLRTPPGAAHYLASAIDRAAL 136
Y +P D ++ R L + V D + +L VL+T PG A + + +D
Sbjct: 61 KYSLPADQR-----FNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDW 115

Query: 137 PQVVGTIAGDDTILVVAREPTTGAQLAG 164
+++GTI GDDTIL++ R +
Sbjct: 116 EEIMGTICGDDTILIICRTHDDTKVVQK 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1663NUCEPIMERASE453e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 45.2 bits (107), Expect = 3e-07
Identities = 32/136 (23%), Positives = 52/136 (38%), Gaps = 15/136 (11%)

Query: 65 TVLITGGTGMAGSAVARHLVSRYGVRQVVLASRAGEHTESVA-ALVDELGSAGARVQVVS 123
L+TG G G V++ L+ G + V + + + S+ A ++ L G Q
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEA-GHQVVGIDNLNDYYDVSLKQARLELLAQPG--FQFHK 58

Query: 124 CDVADRDAVAGLVASQPDLTAVFH---AAGVLDDAVITGLTPERVDKVLRAKVDGAWNLH 180
D+ADR+ + L AS VF V + E + + G N+
Sbjct: 59 IDLADREGMTDLFASGH-FERVFISPHRLAVRY-------SLENPHAYADSNLTGFLNIL 110

Query: 181 ELTRHLDVSAFVLFSS 196
E RH + + SS
Sbjct: 111 EGCRHNKIQHLLYASS 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1667cYERSINIAYOPE300.005 Yersinia virulence determinant YopE protein signature.
		>YERSINIAYOPE#Yersinia virulence determinant YopE protein signature.

Length = 219

Score = 30.1 bits (67), Expect = 0.005
Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 1/83 (1%)

Query: 132 APAPAERPAPPAMSGAQRRATEKELAAVDRQLARL-ADRVAAKHTELAEHDQSDHVGITR 190
AP PA+ P+P + S + ++ + L +QL L A+ + H + A IT+
Sbjct: 90 APTPAQMPSPTSFSDSIKQLAAETLPKYMQQLNSLDAEMLQKNHDQFATGSGPLRGSITQ 149

Query: 191 LTQQLRVLQDHVAAMENRWLELS 213
++ + A + L
Sbjct: 150 CQGLMQFCGGELQAEASAILNTP 172


22Rv1749cRv1773cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1749c213-3.176656Possible integral membrane protein
Rv1750c215-3.403990Possible fatty-acid-CoA ligase FadD1
Rv1751215-3.304682Probable oxidoreductase
Rv1752215-3.653477Conserved hypothetical protein
Rv1753c314-3.675806PPE family protein PPE24
Rv1754c5152.413012Conserved protein
Rv1755c5143.162833Probable phospholipase C 4 (fragment) PlcD
Rv1756c4154.073521Putative transposase
Rv1757c5134.050300Putative transposase for insertion sequence
Rv17585134.050300Probable cutinase Cut1
Rv1759c2153.617765PE-PGRS family protein Wag22
Rv1760-114-1.527297Possible triacylglycerol synthase
Rv1761c017-2.229688Possible exported protein
Rv1762c017-2.241055Unknown protein
Rv1763018-1.778053Putative transposase for insertion sequence
Rv17641141.918228Putative transposase
Rv1765c3161.958583Conserved hypothetical protein
Rv1765A1122.603334Putative transposase (fragment)
Rv17660121.958866Conserved protein
Rv17670131.903738Conserved protein
Rv17681111.912804PE-PGRS family protein PE_PGRS31
Rv1769213-0.436459Conserved protein
Rv1770313-0.197161Conserved protein
Rv1771313-0.907999L-gulono-1,4-lactone dehydrogenase
Rv1772211-0.803732Hypothetical protein
Rv1773c212-0.704726Probable transcriptional regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1753cCHANLCOLICIN350.002 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 34.7 bits (79), Expect = 0.002
Identities = 30/102 (29%), Positives = 42/102 (41%), Gaps = 10/102 (9%)

Query: 50 SGLVGGAWQGASSSAMAAAAAPYAAWLAA--AAVQAEQT--AAQAAAMIAEFEAVKTAVV 105
SG GG +G S S +AA A W A QAEQ A AA A+ +A + A+
Sbjct: 32 SGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALT 91

Query: 106 QPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAAD 147
Q + D+V+ + + + A A M A D
Sbjct: 92 QRL------KDIVNEALRHNASRTPSATELAHANNAAMQAED 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1754cPF05616426e-06 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 41.7 bits (97), Expect = 6e-06
Identities = 37/102 (36%), Positives = 41/102 (40%), Gaps = 20/102 (19%)

Query: 154 LNPG---ARNAAPLQQQALVPRANPGPNPAPNP-PATGPQPPNATQLTPNPAPAPDPAPA 209
L PG A NA PL + + P NP NPAPN P T PNP P PD P
Sbjct: 315 LTPGSAEAPNAQPLPE--VSPAENPANNPAPNENPGT----------RPNPEPDPDLNPD 362

Query: 210 AAPDPGATLAGATTSLAEWVTGPDSPNKTLERFGISGTDLGI 251
A PD S A PD PN + G D G+
Sbjct: 363 ANPDTDGQPGTRPDSPAV----PDRPNGRHRKERKEGEDGGL 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1759ccloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 2e-04
Identities = 36/110 (32%), Positives = 47/110 (42%), Gaps = 1/110 (0%)

Query: 243 AGGAGGAGGAGGLFTTGGV-GGAGGQGHTGGAGGAGGAGGLFGAGGMGGAGGFGDHGTLG 301
+GG G G T+G + GG G G GGA G G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 302 TGGAGGDGGGGGLFGAGGDGGAGGSGLTTGGAAGNGGNAGTLSLGAAGGA 351
G GG+G GG G GG+ A + + G A + AG L++ + GA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 37.0 bits (85), Expect = 3e-04
Identities = 39/112 (34%), Positives = 52/112 (46%), Gaps = 7/112 (6%)

Query: 673 NGGAGGAGGTGSTAGGAGGAGGAGGLYAHGGTGGPGGNGGSTGAGGTGGAGGPGGLYGAG 732
+GG G TG+ + GG GL G G G+G S+ GG G G +G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGL--GVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 733 GSGGAGGHGGMAGGGGGVGGNAGSLTLNASGG-----AGGSGGSSLSGKAGA 779
G GG G +GGG G GGN ++ + G G+GG ++S AGA
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 37.0 bits (85), Expect = 4e-04
Identities = 30/101 (29%), Positives = 43/101 (42%)

Query: 331 GGAAGNGGNAGTLSLGAAGGAGGTGGAGGTVFGGGKGGAGGAGGNAGMLFGSGGGGGTGG 390
GA GN G G G + G+G + GG G+G + G G G GGG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 391 FGFAAGGQGGVGGSAGMLSGSGGSGGAGGSGGPAGTAAGGA 431
G +G G + A ++ + G+GG A + + GA
Sbjct: 71 SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.2 bits (83), Expect = 7e-04
Identities = 34/113 (30%), Positives = 42/113 (37%), Gaps = 6/113 (5%)

Query: 263 GAGGQGHTGGAGGA-----GGAGGLFGAGGMGGAGGFGDHGTLGTGGAGGDGGGGGLFGA 317
G G+GH GA GG GL GG G+ GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 318 GGDGGAGGSGLTTGGAAGNGGNAGTLSLG-AAGGAGGTGGAGGTVFGGGKGGA 369
G GG G SG +G A ++ G A G GG ++ G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 35.8 bits (82), Expect = 7e-04
Identities = 33/103 (32%), Positives = 41/103 (39%)

Query: 683 GSTAGGAGGAGGAGGLYAHGGTGGPGGNGGSTGAGGTGGAGGPGGLYGAGGSGGAGGHGG 742
G G GA G G TG G G S G+G + GG G+G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 743 MAGGGGGVGGNAGSLTLNASGGAGGSGGSSLSGKAGAGGAGGS 785
GG G GG +G+ ++ A + G GAGG S
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 35.5 bits (81), Expect = 0.001
Identities = 24/74 (32%), Positives = 32/74 (43%)

Query: 534 NGDSGTPGTGDDGGAGGWLFGNGGNGGAGAAGTNGSAGGAGGAGGILFGTGGAGGAGGVG 593
N + + +GG G G G + G+G + N GG G+G G G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 594 TAGAGGAGGAGGSA 607
+G G G SA
Sbjct: 70 NSGGGSGTGGNLSA 83



Score = 35.1 bits (80), Expect = 0.001
Identities = 24/81 (29%), Positives = 30/81 (37%)

Query: 644 LGGCGGGAFTAGVTTGGAGGTGGAAGLFANGGAGGAGGTGSTAGGAGGAGGAGGLYAHGG 703
+ G G G + GG GL GGA G S GG G+G + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 704 TGGPGGNGGSTGAGGTGGAGG 724
G GG G++G G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.001
Identities = 34/101 (33%), Positives = 42/101 (41%), Gaps = 5/101 (4%)

Query: 117 NGANGAPGTGANGGDAGWLIGNGGAGGSGAKGAN---GGAGGPGGAAGLFGNGGAGGAGG 173
N + NGG G +G G + GSG N GG G G G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 174 TATANNGIGGAGGAGGSAMLFG--AGGAGGAGGAATSLVGG 212
+ +G GG A + + FG A GAGG A S+ G
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 35.1 bits (80), Expect = 0.001
Identities = 36/103 (34%), Positives = 40/103 (38%), Gaps = 14/103 (13%)

Query: 368 GAGGAGGNAGMLFGSGGGGGTGGFGFAAGGQGGVGGSAGMLSGSGGSGGAGGSGGPAGTA 427
G G G N G SG GG G+G G GSG S GG +G+
Sbjct: 3 GGDGRGHNTGAHSTSGN---------INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG 53

Query: 428 AGGAGGAGGAPGLIGNGGNGGNGGESGGTGGVGGAGGNAVLIG 470
GG+G GNGG GN G GTGG A V G
Sbjct: 54 IHWGGGSGH-----GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.7 bits (79), Expect = 0.002
Identities = 35/117 (29%), Positives = 44/117 (37%), Gaps = 2/117 (1%)

Query: 137 GNGGAGGSGAKGANGGAGGPGGAAGLFGNGGAGGAGGTATANNGIGGAGGAGGSAMLFGA 196
G+G +GA +G G G GL GGA G ++ NN GG G+G
Sbjct: 4 GDGRGHNTGAHSTSGNING--GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 197 GGAGGAGGAATSLVGGIGGTGGTGGNAGMLAGAAGAGGAGGFSFSTAGGAGGAGGAG 253
G GG G + G G A GAGG + S + GA A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 33.5 bits (76), Expect = 0.004
Identities = 40/129 (31%), Positives = 49/129 (37%), Gaps = 12/129 (9%)

Query: 733 GSGGAGGHGGMAGGGGGVGGNAGSLTLNASGGAGGSGGSSLSGKAGAGGAGGSAGLFYGS 792
G G G + G G + G L G G S GS S + G G +G+ +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGL----GVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 793 GGAGGNGGYSLNGTGGDGGTGGAGQITGLRSGFG-------GAGGAGGASDTGAGGNGGA 845
G GNGG + +GG GTGG FG GAGG + GA A
Sbjct: 59 GSGHGNGGGN-GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIA 117

Query: 846 GGKAGLYGN 854
A L G
Sbjct: 118 DIMAALKGP 126



Score = 33.1 bits (75), Expect = 0.005
Identities = 32/125 (25%), Positives = 47/125 (37%), Gaps = 2/125 (1%)

Query: 703 GTGGPGGNGGSTGAGGTGGAGGPGGLYGAGGSGGAGGHGGMAGGGGGVGGNAGSLTLNAS 762
G G G N G+ G G G G G S G+G GGG G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 763 GGAGGSGGSSLSGKAGAGGAGGSAGLFYG--SGGAGGNGGYSLNGTGGDGGTGGAGQITG 820
G GG+G S G + +A + +G + G GG +++ + G A +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 821 LRSGF 825
L+ F
Sbjct: 123 LKGPF 127



Score = 32.8 bits (74), Expect = 0.007
Identities = 31/108 (28%), Positives = 36/108 (33%)

Query: 289 GGAGGFGDHGTLGTGGAGGDGGGGGLFGAGGDGGAGGSGLTTGGAAGNGGNAGTLSLGAA 348
GG G + G T G G G G G G+G S G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 349 GGAGGTGGAGGTVFGGGKGGAGGAGGNAGMLFGSGGGGGTGGFGFAAG 396
G GG G +GG GG A A G S G G +AG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.6 bits (71), Expect = 0.017
Identities = 33/123 (26%), Positives = 43/123 (34%), Gaps = 8/123 (6%)

Query: 226 LAGAAGAGGAGGFSFSTAGGAGGAGGAGGLFTTGGVGGAGGQGHTGGAGGAGGAGGLFGA 285
++G G G G + ST+G G G+ G + GG+G G
Sbjct: 1 MSGGDGRGHNTG-AHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 286 GGMGGAGGFGDHGTLGTGGAGGDGGGGGLFGAGGDGGAGGSGLTTGGAAGNGGNAGTLSL 345
G G GG GG G GG L G L+T GA G + +L
Sbjct: 60 SGHGNGGG-------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112

Query: 346 GAA 348
AA
Sbjct: 113 SAA 115



Score = 30.8 bits (69), Expect = 0.026
Identities = 29/109 (26%), Positives = 39/109 (35%), Gaps = 2/109 (1%)

Query: 803 LNGTGGDGGTGGAGQITGLRSGFGGAGGAGGASDTGAGGNGGAGGKAGLYGNGGDGGAGG 862
++G G G GA +G +G G GG + G+G + G G+G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 863 DGATSGKGGAGGNAVVIGNGGNGGNAGKAGG--TAGAGGAGGLVLGRDG 909
G G G G + A A G GAGGL +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 30.5 bits (68), Expect = 0.036
Identities = 23/81 (28%), Positives = 32/81 (39%)

Query: 555 NGGNGGAGAAGTNGSAGGAGGAGGILFGTGGAGGAGGVGTAGAGGAGGAGGSAFLIGSGG 614
+GG+G G + ++G G L GGA G + GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 615 TGGVGGAATTTGGVGGAGGNA 635
G GG + GG G G +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 30.5 bits (68), Expect = 0.037
Identities = 26/86 (30%), Positives = 33/86 (38%), Gaps = 8/86 (9%)

Query: 410 GSGGSGGAGGSGGPAGTAAGGAGGAGGAPGLIG--------NGGNGGNGGESGGTGGVGG 461
G G + GA + G G G GGA G GG+G GG+G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 462 AGGNAVLIGNGGEGGIGALAGKSGFG 487
G G+G G + A+A FG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1768cloacin363e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.2 bits (83), Expect = 3e-04
Identities = 48/164 (29%), Positives = 59/164 (35%), Gaps = 28/164 (17%)

Query: 141 GNGGNGGSGGVNQAGGNGGNAGLWGNGGSGGAGGNATTAGRNGFNGGAGGSGGLLWGNGG 200
G G G + G + GN N G G G GGA + + N GG GSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 201 AGGAGGNGGPAPLVGGVGTTGGAGGNGGGAGLFYGFGGAGGNGGMGGVAPSTGPSMGILP 260
G GGNG +G GG+G G + VA L
Sbjct: 62 HGNGGGNGN----------------SG---------GGSGTGGNLSAVAAPVAFGFPALS 96

Query: 261 AGGVGGPGGSGGASALAFGSGGVGGAGGLGGPTDGTVQGVGGFG 304
G GG S A AL+ + A L GP + GV +G
Sbjct: 97 TPGAGGLAVSISAGALSAAIADIMAA--LKGPFKFGLWGVALYG 138



Score = 35.5 bits (81), Expect = 7e-04
Identities = 32/106 (30%), Positives = 39/106 (36%), Gaps = 2/106 (1%)

Query: 297 VQGVGGFGGQGGNGGQSGLLFGNAGAGGAGAAGGAGTGDTESFGGHGGAGGDGGAVGLIG 356
+ G G G G SG + N G G G GGA G S + GG G + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 357 NGGAGGTGSPGAVVGGNGGVGGLGGAGSPGGLLYGTGGAGGNGGPG 402
G G G G GG+G G L +P + G GG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.5 bits (81), Expect = 7e-04
Identities = 31/87 (35%), Positives = 35/87 (40%), Gaps = 5/87 (5%)

Query: 471 GVGGMGGNGGATSVGGTLYAAGGNGGDGGLVWGNGGTGGSGGAGGAGSVGNGGAGGNAAL 530
G G G N GA S G + NGG GL G G + GSG + G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 531 LFGNGGAGGAGGAGGIGAGGAGGFGAV 557
G GG G G G+G G AV
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 33.1 bits (75), Expect = 0.003
Identities = 28/87 (32%), Positives = 37/87 (42%), Gaps = 7/87 (8%)

Query: 471 GVGGMGGNGGATSVGG--TLYAAGGNGGDGGLVWGNGGTGGSGGAGGAGSVGNGGAGGNA 528
G G+G GGA+ G + G G G+ WG G G+GG G G+G G +
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82

Query: 529 ALLFGNGGAGGAGGAGGIGAGGAGGFG 555
A+ A A G + GAGG
Sbjct: 83 AV-----AAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.005
Identities = 31/103 (30%), Positives = 35/103 (33%), Gaps = 5/103 (4%)

Query: 266 GPGGSGGASALAFGSGGVGGAGGLGGPTDGTVQGVGGFGGQGGNGGQSGLLFGNAGAGGA 325
G G G + SG + G G G G G GG SG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 326 GAAGGAGTGDTESFGGHGGAGGDGGAVGLIGNGGAGGTGSPGA 368
G GG G GG G GG+ AV G +PGA
Sbjct: 63 GNGGGNGNS-----GGGSGTGGNLSAVAAPVAFGFPALSTPGA 100



Score = 32.4 bits (73), Expect = 0.006
Identities = 34/108 (31%), Positives = 37/108 (34%), Gaps = 10/108 (9%)

Query: 510 SGGAGGAGSVGNGGAGGNAALLFGNGGAGGAGGAGGIGAGGAGGFGAVLFGNGGAGGSGA 569
SGG G + G GN NGG G G GG G +G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI-----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 570 PGGIGAGGNGGNALLVGNGGNGGAGTGGAAGGAGGSGGLLFGQNGMPG 617
GG G G GGN GG+GTGG F PG
Sbjct: 57 GGGSGHGNGGGN-----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPG 99



Score = 32.4 bits (73), Expect = 0.007
Identities = 28/99 (28%), Positives = 33/99 (33%)

Query: 279 GSGGVGGAGGLGGPTDGTVQGVGGFGGQGGNGGQSGLLFGNAGAGGAGAAGGAGTGDTES 338
G G GA G +G G+G GG G S G G+G G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 339 FGGHGGAGGDGGAVGLIGNGGAGGTGSPGAVVGGNGGVG 377
G GG G L G P G GG+
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.010
Identities = 25/99 (25%), Positives = 30/99 (30%), Gaps = 7/99 (7%)

Query: 129 TGQNGGDGGILYGNGGNGGSGGVNQAGGNGGNAGLWGNGGSGGAGGNATTAGRNGFNGGA 188
TG + G I G G G GG + G WG G G + NG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 189 GGSGGLLWGNGGAGGAGGNGGPAPLVGGVGTTGGAGGNG 227
G G G + AP+ G G G
Sbjct: 71 SG-------GGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.017
Identities = 31/93 (33%), Positives = 41/93 (44%), Gaps = 3/93 (3%)

Query: 132 NGGDGGILYGNGGNGGSGGVNQAGGNGGNAGLWGNGGSGGAGGNATTAGRNGFNGGAGGS 191
NGG G+ G G + GSG ++ GG +G + G G GN G NG +GG G+
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG---GGNGNSGGGSGT 77

Query: 192 GGLLWGNGGAGGAGGNGGPAPLVGGVGTTGGAG 224
GG L G P GG+ + AG
Sbjct: 78 GGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.8 bits (69), Expect = 0.019
Identities = 36/118 (30%), Positives = 46/118 (38%), Gaps = 6/118 (5%)

Query: 332 GTGDTESFGGHGGAGG-DGGAVGLIGNGGAGGTGSPGAVVGGNGGVGGLGGAGSPGGLLY 390
G G + G H +G +GG GL GGA + GG G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 391 GTGGAGGNGGPGGDGGTGATVGFAGSGGF-----GGAGGIAQLFGTGGMGGSGGGIGA 443
GG G +GG G GG + V + GF GAGG+A G + + I A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121


23Rv1798Rv1814Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv17982120.151591ESX conserved component EccA5. ESX-5 type VII
Rv17994140.514962Probable lipoprotein LppT
Rv18005160.229880PPE family protein PPE28
Rv18017170.859467PPE family protein PPE29
Rv18025151.316270PPE family protein PPE30
Rv1803c6171.633372PE-PGRS family protein PE_PGRS32
Rv1804c417-0.040951Conserved protein
Rv1805c4160.129291Hypothetical protein
Rv18064140.384833PE family protein PE20
Rv18073120.736747PPE family protein PPE31
Rv1808313-0.036627PPE family protein PPE32
Rv1809212-1.327541PPE family protein PPE33
Rv1810113-1.480829Conserved protein
Rv1811-111-1.397794Possible Mg2+ transport P-type ATPase C MgtC
Rv1812c-18-1.501090Probable dehydrogenase
Rv1813c4101.634178Conserved hypothetical protein
Rv1814290.489998Membrane-bound C-5 sterol desaturase Erg3
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1803ccloacin428e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 41.6 bits (97), Expect = 8e-06
Identities = 31/76 (40%), Positives = 36/76 (47%), Gaps = 2/76 (2%)

Query: 523 GAGGSGGLLFGSGGAGGIGGAGGVGGSGNDGGNGG--DGGQGGASGLGIGNGGPGGSGGT 580
G G + G SG G GVGG +DG + GG SG GI GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 581 GGAGGTGGSAGTGGAG 596
GG G +GG +GTGG
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 36.6 bits (84), Expect = 3e-04
Identities = 26/85 (30%), Positives = 36/85 (42%), Gaps = 2/85 (2%)

Query: 273 AGSGGLGGNGGNGAQGGWLIGNGGQGGDSGAGGGTDSTQTGVMNGASGGSAGIAGNGGDA 332
+G G G N G + G + NGG G GG +D + N GG +G + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 333 GLVGNGGAGGNGGNGAAGSALGTTI 357
GNGG GN G G+ + +
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 35.8 bits (82), Expect = 5e-04
Identities = 31/92 (33%), Positives = 36/92 (39%), Gaps = 8/92 (8%)

Query: 133 GTGQNGGDGGWLYGNGGNGGSGGTGQNGGNGGSAGLWGSGGNGGQGGAGANGAAGQPGKA 192
G G+ G NGG G G GG +G W S N GG+G+ G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSGH 62

Query: 193 GGSGGNGGAGGWIYGHGGHGGAGGNGGNATAP 224
G GGNG + GG G GGN AP
Sbjct: 63 GNGGGNGNS-------GGGSGTGGNLSAVAAP 87



Score = 35.8 bits (82), Expect = 6e-04
Identities = 25/79 (31%), Positives = 32/79 (40%)

Query: 152 GSGGTGQNGGNGGSAGLWGSGGNGGQGGAGANGAAGQPGKAGGSGGNGGAGGWIYGHGGH 211
G G G N G ++G G G G GA+ +G + GG G+G G GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 212 GGAGGNGGNATAPGGASAG 230
G GGNG + G
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.002
Identities = 26/71 (36%), Positives = 28/71 (39%), Gaps = 1/71 (1%)

Query: 339 GAGGNGGNGAAGSALGTTIFGGSGGVGGSGGDGGNGGWLFGSGASGGNGGQGGDAGTNGF 398
G G G N A S G I GG G+G GG GW + GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGN-INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 399 AGFGGSAGGGG 409
G GG G G
Sbjct: 62 HGNGGGNGNSG 72



Score = 33.9 bits (77), Expect = 0.002
Identities = 23/85 (27%), Positives = 36/85 (42%)

Query: 268 TAGDSAGSGGLGGNGGNGAQGGWLIGNGGQGGDSGAGGGTDSTQTGVMNGASGGSAGIAG 327
+ GD G + GG G G G+G +++ G +G+ G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 328 NGGDAGLVGNGGAGGNGGNGAAGSA 352
+G G +GG G GGN +A +A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.004
Identities = 27/80 (33%), Positives = 33/80 (41%), Gaps = 4/80 (5%)

Query: 210 GHGGAGGNGGNATAPGGASAGFDGGAGGNGGSGGRGGLLFGNGGNGSVGGMGGQGTNDTA 269
G G G N G + G + G G G G S G G + N GG G G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG----WSSENNPWGGGSGSGIHWGG 58

Query: 270 GDSAGSGGLGGNGGNGAQGG 289
G G+GG GN G G+ G
Sbjct: 59 GSGHGNGGGNGNSGGGSGTG 78



Score = 31.6 bits (71), Expect = 0.011
Identities = 27/94 (28%), Positives = 29/94 (30%), Gaps = 17/94 (18%)

Query: 170 GSGGNGGQGGAGANGAAGQPGKAGGSGGNGGAGGWIYGHGGHGGAGGNGGNATAPGGASA 229
G G N G N G P G GG GW + P G +
Sbjct: 6 GRGHNTGAHSTSGN-INGGPTGLGVGGGASDGSGW--------------SSENNPWGGGS 50

Query: 230 GFDGGAGGNGGSGGRGGLLFGNGGNGSVGGMGGQ 263
G GG G G GG GN G GS G
Sbjct: 51 GSGIHWGGGSGHGNGGG--NGNSGGGSGTGGNLS 82



Score = 31.6 bits (71), Expect = 0.011
Identities = 29/84 (34%), Positives = 34/84 (40%), Gaps = 5/84 (5%)

Query: 553 GGNGGDGGQGGASGLGIGNGGPGGSGGTGGAGGTGGSAGTGGAGGDGGNAALLIGTGGDG 612
GG+G G S G NGGP G G GGA G + G G + + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 613 GDGVPPAPGGQGGKGGLIGLPGQN 636
G+G GG G GG G G
Sbjct: 63 GNG-----GGNGNSGGGSGTGGNL 81



Score = 31.2 bits (70), Expect = 0.015
Identities = 36/91 (39%), Positives = 40/91 (43%), Gaps = 6/91 (6%)

Query: 500 NGATGGTGVGNIIQEAGGDGSDGGAGGSGGLLFGSG-GAGGIGGAGGVGGSGNDGGNGGD 558
NG G GVG + G S+ G G GSG GG G G GG+GN GG G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGS---GSGIHWGGGSGHGNGGGNGNSGGGSGT 77

Query: 559 GGQGGASGLGIGNGGPGGSGGTGGAGGTGGS 589
GG A + G P S T GAGG S
Sbjct: 78 GGNLSAVAAPVAFGFPALS--TPGAGGLAVS 106



Score = 30.8 bits (69), Expect = 0.016
Identities = 28/80 (35%), Positives = 31/80 (38%), Gaps = 1/80 (1%)

Query: 493 GNGGAGGNGATGGTGVGNIIQEAGGDGSDGGAGGSGGLLFGSGGAGGIGGAGGVGGSGND 552
G G G N T GNI G G GGA G + GG G+G G G+
Sbjct: 3 GGDGRGHNTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 553 GGNGGDGGQGGASGLGIGNG 572
GNGG G G GN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 30.1 bits (67), Expect = 0.031
Identities = 28/70 (40%), Positives = 34/70 (48%), Gaps = 4/70 (5%)

Query: 264 GTNDTAGD-SAGSGGLGGNGGNGAQGGWLIGNGGQGGDSGAG---GGTDSTQTGVMNGAS 319
G + T+G+ + G GLG GG GW N GG SG+G GG G NG S
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 320 GGSAGIAGNG 329
GG +G GN
Sbjct: 72 GGGSGTGGNL 81


24Rv1912cRv1918cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1912c217-3.683964Possible oxidoreductase FadB5
Rv1913318-4.440546Conserved hypothetical protein
Rv1914c418-4.535668Unknown protein
Rv1915416-4.388123Probable isocitrate lyase AceAa [first part]
Rv1916316-4.615507Probable isocitrate lyase AceAb [second part]
Rv1917c416-4.663982PPE family protein PPE34
Rv1918c113-3.543616PPE family protein PPE35
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1917ccloacin360.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 0.001
Identities = 24/77 (31%), Positives = 33/77 (42%), Gaps = 3/77 (3%)

Query: 240 NVGSNNVGLGNTGNGNIGFGNTGNGNIGFGLTGDNQQGFGGWNSGTGNIGLFNSGTGNIG 299
N G+++ GN G G G G + G G + +N GG SG G SG GN G
Sbjct: 10 NTGAHSTS-GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG--GSGHGNGG 66

Query: 300 IGNTGTGNFGIGNSGTS 316
G G G + ++
Sbjct: 67 GNGNSGGGSGTGGNLSA 83



Score = 33.1 bits (75), Expect = 0.008
Identities = 26/76 (34%), Positives = 32/76 (42%), Gaps = 1/76 (1%)

Query: 882 GFGNIGGNNYGFANIGNGNIGFGNTGTGNIGIGLTGDNQVGFGALNSGSGNIGFFNSGNG 941
G G+ G + NI G G G G + G G + +N G SG G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 942 NIGFFNSGNGNVGIGN 957
G NSG G+ GN
Sbjct: 66 G-GNGNSGGGSGTGGN 80



Score = 31.2 bits (70), Expect = 0.038
Identities = 23/81 (28%), Positives = 28/81 (34%), Gaps = 1/81 (1%)

Query: 247 GLGNTGNGNIGFGNTGNGNIGFGLTGDNQQGFGGWNSGTGNIGLFNSGTGNIGIGNTGTG 306
G G+ + GN G G G+ G G G + G SG G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 307 NFGIGNSGTSYNTGIGNTGQA 327
GNSG TG + A
Sbjct: 66 GGN-GNSGGGSGTGGNLSAVA 85



Score = 30.8 bits (69), Expect = 0.045
Identities = 36/116 (31%), Positives = 47/116 (40%), Gaps = 12/116 (10%)

Query: 826 GSGNVGSYNFGSGNIGNGSFGFGNIGSNNFGFGNVGSNNLGFANTGPGLTEALHNIGFGN 885
G G+ + SGNI G G G G + G G NN +G G+ G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW------GGG 59

Query: 886 IGGNNYGFANIGNGNIGFGNTGTGNIGIGLTGDNQVGFGALNS-GSGNIGFFNSGN 940
G N G GNGN G G +GTG + GF AL++ G+G + S
Sbjct: 60 SGHGNGG----GNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1918ccloacin360.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 0.001
Identities = 61/264 (23%), Positives = 89/264 (33%), Gaps = 41/264 (15%)

Query: 378 NTGSFNVGHYNFGAFNPGPSNTGTFNTGGANTGWFNTGSINTGAFNIGDM---NNGLFNT 434
NTG+ + G N GP+ G +GW + + G G +G N
Sbjct: 10 NTGAHSTS----GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 435 GDMNNGVFYRGVGQGSLQFAITSPDLTLPSLEIPGISVPAFSLPAITLPSL---TIPAVT 491
G N G G G+L P+L PG A S+ A L + + A+
Sbjct: 66 GGNGNSGGGSGTG-GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALK 124

Query: 492 TPANVTVGAFDLPGLTVPSLTI---PAAMTPANITVGAFDLPGLTVPSLTIPATTTPANI 548
P + L G+ +PS P M+ ++ A D+ V SL + T N+
Sbjct: 125 GPFKFGLWGVALYGV-LPSQIAKDDPNMMSKIVTSLPADDITESPVSSLPLDKATVNVNV 183

Query: 549 TV-----------GAFNLPQLSIPSVTVPPITIPAGTALGAFNLPTLSI------PSVTV 591
V + +S+P V P P P L+I P+V
Sbjct: 184 RVVDDVKDERQNISVVSGVPMSVPVVDAKPTERPGVFTASIPGAPVLNISVNNSTPAVQT 243

Query: 592 PPITI---------PAGTTVGGFT 606
+ PAG T GG T
Sbjct: 244 LSPGVTNNTDKDVRPAGFTQGGNT 267


25Rv1929cRv1959cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1929c2132.090277Conserved hypothetical protein
Rv1930c1142.619188Conserved hypothetical protein
Rv1931c1151.898868Probable transcriptional regulatory protein
Rv19320151.188046Probable thiol peroxidase Tpx
Rv1933c0161.208119Probable acyl-CoA dehydrogenase FadE18
Rv1934c1170.958610Probable acyl-CoA dehydrogenase FadE17
Rv1935c0140.769523Possible enoyl-CoA hydratase EchA13 (enoyl
Rv19361140.797891Possible monooxygenase
Rv19372130.913063Possible oxygenase
Rv19384111.084445Probable epoxide hydrolase EphB (epoxide
Rv19396141.043645Probable oxidoreductase
Rv19402130.369256Probable riboflavin biosynthesis protein RibA1
Rv1941115-0.799407Probable short-chain type
Rv1942c016-1.077593Possible toxin MazF5
Rv1943c-118-2.479065Possible antitoxin MazE5
Rv1944c-120-2.554107Conserved protein
Rv1945-120-3.098326Conserved hypothetical protein
Rv1946c-224-3.561927Possible lipoprotein
Rv1947023-3.324770Hypothetical protein
Rv1948c127-4.213295Hypothetical protein
Rv1949c328-2.311710Conserved hypothetical protein
Rv1950c526-1.673474Conserved hypothetical protein
Rv1951c323-2.090438Conserved hypothetical protein
Rv1952327-3.127772Possible antitoxin VapB14
Rv1953227-3.611202Possible toxin VapC14
Rv1954c429-3.352723Hypothetical protein
Rv1954A326-3.672347Hypothetical protein
Rv1955425-3.564599Possible toxin HigB
Rv1956125-2.243660Possible antitoxin HigA
Rv1957325-2.253822Hypothetical protein
Rv1958c425-1.794580Hypothetical protein
Rv1959c516-0.656139Possible toxin ParE1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1941DHBDHDRGNASE1204e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 120 bits (302), Expect = 4e-35
Identities = 73/257 (28%), Positives = 114/257 (44%), Gaps = 6/257 (2%)

Query: 1 MNHPDLAGKVAIVTGAGAGIGLAVARRLADEGCHVLCADIDGD---AADAAATKIGCGAA 57
MN + GK+A +TGA GIG AVAR LA +G H+ D + + ++ A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 58 ACRVDVSDEQQIIAMVDACVAAFGGVDKLVANAGVVHLASLIDTTVEDFDRVIAINLRGA 117
A DV D I + G +D LV AGV+ + + E+++ ++N G
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 118 WLCTKHAAPRMIERGGGAIVNLSSLAGQVAVGGTGAYGMSKAGIIQLSRITAAELRSSGI 177
+ ++ + M++R G+IV + S V AY SKA + ++ EL I
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 178 RSNTLLPAFVDTPMQQTAMAMFDGA---LGAGGARSMIARLQGRMAAPEEMAGIVVFLLS 234
R N + P +T MQ + A +GA + ++A P ++A V+FL+S
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 235 DDASMITGTTQIADGGT 251
A IT DGG
Sbjct: 241 GQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1944cSECA435e-07 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 42.6 bits (100), Expect = 5e-07
Identities = 17/55 (30%), Positives = 19/55 (34%), Gaps = 8/55 (14%)

Query: 138 QEPDSPEARAEYAAYLTAHGDHDVMAWP--------PGRNQQCWCGSGHKYKKCC 184
Q E A+ D A GRN C CGSG KYK+C
Sbjct: 843 QRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQCH 897


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1946cBLACTAMASEA270.032 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 27.1 bits (60), Expect = 0.032
Identities = 11/57 (19%), Positives = 21/57 (36%)

Query: 38 LINLSGIQCFARIEHVAHAQAHPFVVLVGKPAQHGARIGAVAGAILTGDVIVSHDGE 94
I L I A + HA P + +Q R+G + + +G + + +
Sbjct: 3 YIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRAD 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1950cCHLAMIDIAOMP270.005 Chlamydia major outer membrane protein signature.
		>CHLAMIDIAOMP#Chlamydia major outer membrane protein signature.

Length = 393

Score = 26.9 bits (59), Expect = 0.005
Identities = 9/12 (75%), Positives = 9/12 (75%)

Query: 44 IAWEGAGGDGCD 55
I WEG GGD CD
Sbjct: 38 ILWEGFGGDPCD 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1954AHTHFIS260.019 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 26.3 bits (58), Expect = 0.019
Identities = 8/24 (33%), Positives = 14/24 (58%)

Query: 22 RATAGGMPVLVVIESGTGGDQMAR 45
R + +++ ESGTG + +AR
Sbjct: 155 RLMQTDLTLMITGESGTGKELVAR 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1957SECBCHAPRONE270.030 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 27.2 bits (60), Expect = 0.030
Identities = 11/24 (45%), Positives = 12/24 (50%)

Query: 137 LYPYIREYVYDLTGRLALPPLTLE 160
L+PY RE V L R P L L
Sbjct: 117 LFPYARELVSSLVNRGTFPALNLS 140


26Rv1975Rv1980cY        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1975-213-3.384896Conserved hypothetical protein
Rv1976c-114-4.224928Conserved hypothetical protein
Rv1977013-4.250780Conserved protein
Rv1978014-5.016315Conserved protein
Rv1979c-115-4.047327Possible conserved permease
Rv1980c-217-3.365010Immunogenic protein Mpt64 (antigen Mpt64/MPB64)
27Rv2006Rv2024cY        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2006215-2.276992Probable trehalose-6-phosphate phosphatase OtsB1
Rv2007c522-2.718132Ferredoxin FdxA
Rv2008c319-1.719366Conserved hypothetical protein
Rv20093210.277098Antitoxin VapB15
Rv20101160.468483Toxin VapC15
Rv2011c116-0.117974Conserved hypothetical protein, probable
Rv2012015-0.469216Conserved hypothetical protein
Rv2013014-0.195108Transposase
Rv2014016-1.034041Transposase
Rv2015c015-1.828482Conserved hypothetical protein
Rv2016219-2.534049Hypothetical protein
Rv2017118-2.475315Transcriptional regulatory protein
Rv2018224-1.777543Conserved protein
Rv2019124-5.317079Conserved protein
Rv2020c-114-3.170125Conserved hypothetical protein
Rv2021c-210-2.479659Transcriptional regulatory protein
Rv2022c-212-2.545281Conserved protein
Rv2023c010-1.808746Hypothetical protein
Rv2023A19-1.647958Conserved hypothetical protein
Rv2024c28-0.399568Conserved membrane protein
28Rv2060Rv2082Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2060-1173.526906Possible conserved integral membrane protein
Rv2061c-1173.932884Conserved protein
Rv2062c0184.394456Cobalamin biosynthesis protein CobN
Rv20631132.587704Antitoxin MazE7
Rv2063A1132.251004Possible toxin MazF7
Rv20641142.063205Precorrin-3B synthase CobG
Rv20651130.930381Precorrin-8X methylmutase CobH (aka precorrin
Rv20661120.402646Probable bifunctional protein, CobI-COBJ fusion
Rv2067c015-0.440906Conserved protein
Rv2068c2121.892889Class a beta-lactamase BlaC
Rv20692132.183111RNA polymerase sigma factor, ECF subfamily,
Rv2070c192.281238Precorrin-6X reductase CobK
Rv2071c1102.204332Precorrin-3 methylase CobM (precorrin-4
Rv2072c191.684946Precorrin-6Y C(5,15)-methyltransferase
Rv2073c1111.299388Probable shortchain dehydrogenase
Rv20742140.096937Possible pyridoxamine 5'-phosphate oxidase
Rv2075c-113-0.595665Possible hypothetical exported or envelope
Rv2076c-217-2.111136Conserved hypothetical protein
Rv2077c-118-1.860326Possible conserved transmembrane protein
Rv2077A218-0.811162Conserved hypothetical protein
Rv20782160.050997Conserved hypothetical protein
Rv20792170.188127Conserved hypothetical protein
Rv20802210.733545Lipoprotein LppJ
Rv2081c2221.198018Conserved transmembrane protein
Rv20822221.159061Conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2060SURFACELAYER300.003 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 30.0 bits (67), Expect = 0.003
Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 66 SLLITPAAAAARVVVAPVAAIATSV 90
+L I AAAAA + VAP+AA A V
Sbjct: 4 NLRIVSAAAAALLAVAPIAATAMPV 28


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2068cBLACTAMASEA320e-112 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 320 bits (821), Expect = e-112
Identities = 94/273 (34%), Positives = 136/273 (49%), Gaps = 8/273 (2%)

Query: 33 PASTTLPAGADLADRFAELERRYDARLGVYVPATGTTAAIE-YRADERFAFCSTFKAPLV 91
+ A ++ E + R+G+ + + +RADERF STFK L
Sbjct: 14 TLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLC 73

Query: 92 AAVLHQNPLT--HLDKLITYTSDDIRSISPVAQQHVQTGMTIGQLCDAAIRYSDGTAANL 149
AVL + L++ I Y D+ SPV+++H+ GMT+G+LC AAI SD +AANL
Sbjct: 74 GAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANL 133

Query: 150 LLADLGGPGGGTAAFTGYLRSLGDTVSRLDAEEPELNRDPPGDERDTTTPHAIALVLQQL 209
LLA +GGP A T +LR +GD V+RLD E ELN PGD RDTTTP ++A L++L
Sbjct: 134 LLATVGGP----AGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATLRKL 189

Query: 210 VLGNALPPDKRALLTDWMARNTTGAKRIRAGFPADWKVIDKTGTGDYGRANDIAVVWSPT 269
+ L + L WM + IR+ PA W + DKTG G+ G +A++
Sbjct: 190 LTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARGIVALLGPNN 249

Query: 270 GVPYVVAVMSDRAGGGYDAEPREALLAEAATCV 302
+V + R AE + + A +
Sbjct: 250 KAERIVVIYL-RDTPASMAERNQQIAGIGAALI 281


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2073cDHBDHDRGNASE421e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 41.6 bits (97), Expect = 1e-06
Identities = 41/178 (23%), Positives = 59/178 (33%)

Query: 9 VVIFGGRSQIGGELARRLAAGATMVLAARNADQLADQAAALRAAGAIAVHTREFDADDLA 68
I G IG +AR LA+ + A + ++ + A A D D A
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 69 AHGPLVASLVAEHGPIGTAVLAFGILGDQARAETDAAHAVAIVHTDYVAQVSLLTHLAAA 128
A + A + E GPI V G+L A + + ++
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 129 MRTAGRGSLVVFSSVAGIRVRRANYVYGSAKAGLDGFASGLADALHGTGVRLLIARPG 186
M GS+V S R + Y S+KA F L L +R I PG
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2082PF03544310.018 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 30.7 bits (69), Expect = 0.018
Identities = 16/90 (17%), Positives = 24/90 (26%)

Query: 379 APTPSPAPIAPPTTDNASAMTPIAPMVANGPPASPAPPAAAPAGPLPAYGADLRPPVTTP 438
A P P P+ P + P P P PV +
Sbjct: 65 AVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESR 124

Query: 439 PATPPTPTGPISGAAVTPSSPAAGGSLMSP 468
PA+P T P + T ++ +
Sbjct: 125 PASPFENTAPARPTSSTATAATSKPVTSVA 154


29Rv2095cRv2106Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2095c-2113.020726Proteasome accessory factor C PafC
Rv2096c-2133.207404Proteasome accessory factor B PafB
Rv2097c-1153.399826Proteasome accessory factor a PafA
Rv2098c-1134.236626Conserved hypothetical protein
Rv2099c-1122.824972Probable helicase HelZ
Rv21000122.651619Conserved hypothetical protein
Rv21011121.827999Possible toxin VapC37. Contains PIN domain.
Rv2102114-0.854069Possible antitoxin VapB37
Rv2104c116-3.505515Putative transposase for insertion sequence
Rv2106114-3.612324Probable transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2100PF05616300.018 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 30.5 bits (68), Expect = 0.018
Identities = 16/40 (40%), Positives = 20/40 (50%), Gaps = 2/40 (5%)

Query: 308 ASALDAQPDPHLSGDEPPSRPLTPETTLFEALTPDPEPDP 347
A A +AQP P +S E P+ P P+PEPDP
Sbjct: 320 AEAPNAQPLPEVSPAENPANNPAPNEN--PGTRPNPEPDP 357


30Rv2142cRv2167cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2142c317-1.677567*Possible toxin ParE2
Rv2142A415-0.949592Possible antitoxin ParD2
Rv2143115-0.983545Conserved hypothetical protein
Rv2144c-113-0.966306Probable transmembrane protein
Rv2145c012-0.662245Diviva family protein Wag31
Rv2146c2110.013910Possible conserved transmembrane protein
Rv2147c190.786687Conserved hypothetical protein
Rv2148c1102.427505Conserved protein
Rv2149c-1112.760828Conserved protein YfiH
Rv2150c-2113.116761Cell division protein FtsZ
Rv2151c-2123.019169Possible cell division protein FtsQ
Rv2152c1133.590127Probable UDP-N-acetylmuramate-alanine ligase
Rv2153c1134.597556Probable
Rv2154c1134.387468FtsW-like protein FtsW
Rv2155c2134.985489Probable UDP-N-acetylmuramoylalanine-D-glutamate
Rv2156c1145.116651Probable
Rv2157c2166.383208Probable
Rv2158c1177.751616Probable
Rv2159c0135.463494Conserved protein
Rv2160A0135.303338Conserved hypothetical protein
Rv2160c0104.383646Conserved hypothetical protein
Rv2161c093.844756Conserved protein
Rv2162c092.714417PE-PGRS family protein PE_PGRS38
Rv2163c0120.427057Probable penicillin-binding membrane protein
Rv2164c1170.258071Probable conserved proline rich membrane
Rv2165c117-1.274383Conserved protein
Rv2166c116-1.400616Conserved protein
Rv2167c215-1.418772Probable transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2145cRTXTOXIND290.020 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.020
Identities = 12/93 (12%), Positives = 28/93 (30%), Gaps = 10/93 (10%)

Query: 124 ESDKMLADARANAEQILGEARHTADATVAEARQRADAMLADAQSRSEAQLRQAQEKADAL 183
E + +A + + A++ + ++ +LRQ + L
Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESE-ILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 184 Q-----ADAERKHSEIM----GTINQQRAVLEG 207
+ ++ S I + Q + EG
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEG 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2151cTYPE3IMSPROT290.029 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 28.6 bits (64), Expect = 0.029
Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 5/41 (12%)

Query: 30 ESKDEPAE---HPEFEGPRRRARRERAERRAAQA--RATAI 65
E K E E PE + RR+ +E R + R++ +
Sbjct: 220 EIKREYKEMEGSPEIKSKRRQFHQEIQSRNMRENVKRSSVV 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2154cPF03544290.038 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.038
Identities = 25/149 (16%), Positives = 45/149 (30%), Gaps = 7/149 (4%)

Query: 368 FINIGYVIGLLPVTGLQLPLISAGGTSTAATLSLIGIIANAARHEPEAVAALRAGRDDKV 427
I+ V GLL + Q+ + A + T+ +A A P+AV
Sbjct: 23 CIHGAVVAGLLYTSVHQVIELPAPAQPISVTM-----VAPADLEPPQAVQPPPE--PVVE 75

Query: 428 NRLLRLPLPEPYLPPRLEAFRDRKRANPQPAQTQPARKTPRTAPGQPARQMGLPPRPGSP 487
P+PEP + + + + P+P + + R +R
Sbjct: 76 PEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPA 135

Query: 488 RTADPPVRRSVHHGAGQRYAGQRRTRRVR 516
R + +G R R +
Sbjct: 136 RPTSSTATAATSKPVTSVASGPRALSRNQ 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2160AHTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.4 bits (146), Expect = 2e-13
Identities = 21/100 (21%), Positives = 43/100 (43%), Gaps = 1/100 (1%)

Query: 2 PSADVGRQTRAQILRAAMDIASVKGLSGLSIGELAGRLGMSKSGLFRHFGAKEQLQLATV 61
+ ++TR IL A+ + S +G+S S+GE+A G+++ ++ HF K L
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 62 EAAVSVFEAEVVAPAMAAPPG-VDRVRALMHAWVGYLERD 100
E + S + P + +R ++ + +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTE 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2162ccloacin432e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 43.2 bits (101), Expect = 2e-06
Identities = 38/119 (31%), Positives = 47/119 (39%), Gaps = 2/119 (1%)

Query: 255 AGGNGSNGVTGVHGGNGGAGGAAGLIGNGGAGGDGGNGGLSNTGASGGAGGAGGAALIGN 314
+GG+G TG H +G G +G GG DG N GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 315 GGDGGHGGNGGHGNSGGAGGAGGAGGAGGAGGHVGLIGNGGNGGAGGNGGNDNSSTLAD 373
G+GG GN G G+ G GG A A A G L G G A S+ +AD
Sbjct: 62 HGNGGGNGNSGGGS--GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 35.5 bits (81), Expect = 5e-04
Identities = 36/88 (40%), Positives = 40/88 (45%), Gaps = 5/88 (5%)

Query: 127 NGGPGGLLYGNGGNGGAGDTANPNGGNGGSAGLIGNGGAGGAGAATGAGGAGGNGGWLYG 186
NGGP GL G G + G+G ++ N GGS I GG G G GG GN G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH----GNGGGNGNSGGGSG 76

Query: 187 NGGPGGAAGLGTAGGVSPAGGAGGAAGL 214
GG A A G PA GA GL
Sbjct: 77 TGGNLSAVAAPVAFGF-PALSTPGAGGL 103



Score = 35.1 bits (80), Expect = 7e-04
Identities = 34/93 (36%), Positives = 36/93 (38%), Gaps = 11/93 (11%)

Query: 352 GNGGNGGAGGNGGNDNSSTLADAGSGGAGAAGGNGGLFYGNGGVGGRGGNGGFSSAGTSG 411
G G N GA GN N GG G GG G+G GG S +G
Sbjct: 6 GRGHNTGAHSTSGNIN---------GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 412 GDGGIGGAGGIGGLIGSGGGGGDGGNGGQAPTP 444
G G G GG G SGGG G GGN P
Sbjct: 57 GGGSGHGNGGGNG--NSGGGSGTGGNLSAVAAP 87



Score = 35.1 bits (80), Expect = 8e-04
Identities = 37/120 (30%), Positives = 46/120 (38%), Gaps = 8/120 (6%)

Query: 289 GGNGGLSNTGASGGAGGAGGAALIGNGGDGGHGGNGGHGNSGGAGGAGGA-GGAGGAGGH 347
GG+G NTGA +G NGG G G GG + G GG G+G H
Sbjct: 3 GGDGRGHNTGAHSTSGNI-------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 348 VGLIGNGGNGGAGGNGGNDNSSTLADAGSGGAGAAGGNGGLFYGNGGVGGRGGNGGFSSA 407
G GNGG GN G + + + A G G GG+ G S+A
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 34.3 bits (78), Expect = 0.001
Identities = 35/92 (38%), Positives = 38/92 (41%), Gaps = 4/92 (4%)

Query: 144 GDTANPNGGNGGSAGLIGNGGAGGAGAATGAGGAGGNGGWLYGNGGPGGAAGLGTAGGVS 203
GD N G ++G I NGG G G GGA GW N GG +G G G
Sbjct: 4 GDGRGHNTGAHSTSGNI-NGGPTGLG---VGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 204 PAGGAGGAAGLWGHGGAGGAGGSASGAPGAGG 235
G GG G G G G SA AP A G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.3 bits (78), Expect = 0.001
Identities = 27/90 (30%), Positives = 36/90 (40%), Gaps = 2/90 (2%)

Query: 228 SGAPGAGGAGGDGGRGGLLYGDGGAGGAGGNGSNGV--TGVHGGNGGAGGAAGLIGNGGA 285
SG G G G G + G G GG S+G + + GG G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 286 GGDGGNGGLSNTGASGGAGGAGGAALIGNG 315
G+GG G S G+ G + AA + G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.5 bits (76), Expect = 0.002
Identities = 24/80 (30%), Positives = 30/80 (37%)

Query: 331 GAGGAGGAGGAGGAGGHVGLIGNGGNGGAGGNGGNDNSSTLADAGSGGAGAAGGNGGLFY 390
G G G GA G++ G G G + G+ SS G G GG +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 391 GNGGVGGRGGNGGFSSAGTS 410
GNGG G G G + S
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 32.0 bits (72), Expect = 0.006
Identities = 37/121 (30%), Positives = 43/121 (35%), Gaps = 15/121 (12%)

Query: 394 GVGGRGGNGGFSSAGTSGGDGGIGGAGGIGGLIGSGGGGGDGGNGGQAPTPGNAGDGGAG 453
G GRG N G S G I G G GG +G + N GG+G
Sbjct: 3 GGDGRGHNTGAHSTS-----------GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSG 51

Query: 454 GNARLIGDGGRGGNGGEGGDGPPGVKGDGGNGGNGGNAVVIGNGG--NGGAGGFGIPVGS 511
G G G GG G G G G GGN V G GAGG + + +
Sbjct: 52 SGIHWGGGSGHGNGGGNGNSG--GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109

Query: 512 G 512
G
Sbjct: 110 G 110



Score = 32.0 bits (72), Expect = 0.006
Identities = 31/114 (27%), Positives = 37/114 (32%), Gaps = 13/114 (11%)

Query: 210 GAAGLWGHGGAGGAGGSASGAPGAGGAGGDGGRGGLLYGDGGAGGAGGNGSNGVTGVHGG 269
G G + GA G+ +G P G GG G + G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 270 NGGAGGAAGLIGNGGAGGDGGNGGLSNTGASGGAG-------GAGGAALIGNGG 316
G G GG G G LS A G GAGG A+ + G
Sbjct: 63 GNGGG------NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.0 bits (72), Expect = 0.006
Identities = 35/100 (35%), Positives = 45/100 (45%), Gaps = 9/100 (9%)

Query: 249 DGGAGGAGGNGSNGVTGVHGGNGGAGGAAGLIGNGGAGGDGGNGGLSNTGASGGAGGAGG 308
+ GA GN + G TG+ G G + G+ N GG G+G G+ G GG
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG--- 66

Query: 309 AALIGNGGDGGHGGNGGHGNSGGAGGAGG--AGGAGGAGG 346
GNG GG G GG+ ++ A A G A GAGG
Sbjct: 67 ----GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.008
Identities = 36/104 (34%), Positives = 42/104 (40%), Gaps = 7/104 (6%)

Query: 136 GNGGNGGAGDTA-NPNGGNGGSAGLIG-NGGAGGAGAATGAGGAGGNGGWLYGNGGPGGA 193
G G N GA T+ N NGG G G + G+G + GG G+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 194 AGLGTAGGVSPAGGAGGAAGLWGHGGAGGAGGSASGAPGAGGAG 237
G G +GG S GG A G A PGAGG
Sbjct: 66 GGNGNSGGGSGTGGNLSAV-----AAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.018
Identities = 36/106 (33%), Positives = 37/106 (34%), Gaps = 17/106 (16%)

Query: 164 GAGGAGAATGAGGAGGNGGWLYGNGGPGGAAGLGTAGGVSPAGGAGGAAGLWGHGGAGGA 223
G G G TGA GN NGGP G LG GG S G WG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-----NGGPTG---LGVGGGASDGSGWSSENNPWGGGSGSGI 54

Query: 224 GGSASGAPGAGGAGGDGGRGGLLYGDGGAGGAGGNGSNGVTGVHGG 269
G GG G+ G G G GGN S V G
Sbjct: 55 HWGGGSGHGNGGGNGNSG---------GGSGTGGNLSAVAAPVAFG 91



Score = 30.5 bits (68), Expect = 0.021
Identities = 30/98 (30%), Positives = 33/98 (33%), Gaps = 9/98 (9%)

Query: 430 GGGGDGGNGGQAPTPGN--AGDGGAGGNARLIGDGGRGGNGGEGGDGPPGVKGDGGNGGN 487
GG G G N G T GN G G G G G G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 488 GGNAVVIGNGGNGGAGGFGIPVGSGGAGGSRGVLFGTP 525
GNGG G G G G + + V FG P
Sbjct: 63 -------GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2164cPERTACTIN340.001 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 33.5 bits (76), Expect = 0.001
Identities = 20/68 (29%), Positives = 28/68 (41%), Gaps = 1/68 (1%)

Query: 201 LVQDPDGNWVVVGTPKPADGVPPPPLNTKLPEDPPPPPKPAAVPLEVPVRVTPGPDDPAP 260
L + +G W +VG P P P + P PP P P + P P+ PAP
Sbjct: 552 LAANGNGQWSLVGAKAPPAPKPAPQPGPQ-PGPQPPQPPQPPQPPQPPQPPQRQPEAPAP 610

Query: 261 PARSGPEV 268
+G E+
Sbjct: 611 QPPAGREL 618


31Rv2266Rv2274AY        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2266-217-4.107567Probable cytochrome P450 124 Cyp124
Rv2267c-220-3.910103Conserved hypothetical protein
Rv2268c-120-1.662910Probable cytochrome P450 128 Cyp128
Rv2269c125-2.214867Hypothetical protein
Rv2270126-2.048051Probable lipoprotein LppN
Rv2271027-2.842957Conserved hypothetical protein
Rv2272221-2.867387Probable conserved transmembrane protein
Rv2273121-3.630063Probable conserved transmembrane protein
Rv2274c221-3.711182Possible toxin MazF8
Rv2274A118-3.248137Possible antitoxin MazE8
32Rv2294Rv2308Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2294210-1.610265Probable aminotransferase
Rv2295211-1.626097Conserved hypothetical protein
Rv2296113-1.468524Probable haloalkane dehalogenase
Rv2297013-1.712341Unknown protein
Rv2298014-1.998037Conserved protein
Rv2299c115-2.343475Probable chaperone protein HtpG (heat shock
Rv2300c016-0.488339Conserved protein
Rv23010150.010797Probable cutinase Cut2
Rv23023140.309566Conserved protein
Rv2303c2120.583433Probable antibiotic-resistance protein
Rv2304c1141.133975Hypothetical protein
Rv2305-116-0.233380Unknown protein
Rv2306A125-1.132043Possible conserved membrane protein
Rv2306B122-2.296431Possible conserved membrane protein
Rv2307c123-2.981362Conserved hypothetical protein
Rv2307A133-5.403317Hypothetical glycine rich protein
Rv2307B127-3.732749Hypothetical glycine rich protein
Rv2307D220-1.497780Hypothetical protein
Rv2308321-1.921750Conserved hypothetical protein
33Rv2324Rv2356cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2324213-0.066471Probable transcriptional regulatory protein
Rv2325c214-0.326573Conserved hypothetical protein
Rv2326c213-0.601887Possible transmembrane ATP-binding protein ABC
Rv2327220-2.088906Conserved protein
Rv2328016-2.856772PE family protein PE23
Rv2329c012-2.545668Probable nitrite extrusion protein 1 NarK1
Rv2330c112-1.745162Probable lipoprotein LppP
Rv2331213-0.941707Hypothetical protein
Rv2331A113-2.202054Hypothetical protein
Rv2332013-2.099095Probable [NAD] dependent malate oxidoreductase
Rv2333c118-3.305613Integral membrane drug efflux protein Stp
Rv2334123-5.489575Cysteine synthase a CysK1 (O-acetylserine
Rv2335124-4.430071Probable serine acetyltransferase CysE (sat)
Rv2336126-4.970340Hypothetical protein
Rv2337c124-4.125176Hypothetical protein
Rv2338c016-3.172710Possible molybdopterin biosynthesis protein
Rv2339-111-1.598079Probable conserved transmembrane transport
Rv2340c-191.794948PE-PGRS family protein PE_PGRS39
Rv23410100.948036*Probable conserved lipoprotein LppQ
Rv23420100.918554Conserved hypothetical protein
Rv2343c-190.998730Probable DNA primase DnaG
Rv2344c-39-0.350337Probable deoxyguanosine triphosphate
Rv2345-18-1.395633Possible conserved transmembrane protein
Rv2346c214-3.038965Putative ESAT-6 like protein EsxO (ESAT-6 like
Rv2347c319-1.853428Putative ESAT-6 like protein EsxP (ESAT-6 like
Rv2348c517-2.813358Hypothetical protein
Rv2349c518-2.897005Probable phospholipase C 3 PlcC
Rv2350c314-2.486922Membrane-associated phospholipase C 2 PlcB
Rv2351c718-2.802935Membrane-associated phospholipase C 1 PlcA
Rv2352c522-3.145120PPE family protein PPE38
Rv2353c412-4.208078PPE family protein PPE39
Rv2354014-3.264091Probable transposase for insertion sequence
Rv2355013-3.316972Probable transposase
Rv2356c114-3.278230PPE family protein PPE40
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2333cTCRTETB1503e-42 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 150 bits (381), Expect = 3e-42
Identities = 84/398 (21%), Positives = 176/398 (44%), Gaps = 14/398 (3%)

Query: 17 FMIFLDALIVNVALPDIQRSFAVGEDGLQWVVASYSLGMAVFIMSAATLADLDGRRRWYL 76
F L+ +++NV+LPDI F WV ++ L ++ L+D G +R L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 77 IGVSLFTLGSIACGLAPSIA-VLTTARGAQGLGAAAVSVTSLALVSAAFPEAKEKARAIG 135
G+ + GS+ + S +L AR QG GAAA + +V A + + + +A G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVV-ARYIPKENRGKAFG 142

Query: 136 IWTAIASIGTTTGPTLGGLLVDQWGWRSIFYVNLPMGALVLFLTLCYVEESCNERARRFD 195
+ +I ++G GP +GG++ W + + +PM ++ L + + FD
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200

Query: 196 LSGQLLFIVAVGALVYAVIEGPQIGWTSVQTIVMLWTAAVGCALFVWLERRSSNPMMDLT 255
+ G +L V + + + + +++ + +FV R+ ++P +D
Sbjct: 201 IKGIILMSVGIVFFMLFTT-------SYSISFLIVSVLSFL--IFVKHIRKVTDPFVDPG 251

Query: 256 LFRDTSYALAIATICTVFFAVYGMLLLTTQFLQNVRGYTPSVTG-LMILPFSAAVAIVSP 314
L ++ + + + +F V G + + +++V + + G ++I P + +V I
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 315 LVGHLVGRIGARVPILAGLCMLMLGLLMLIFSEHRSSALVLVGLGLCGSGVALCLTPITT 374
+ G LV R G + G+ L + L F +S + + + G++ T I+T
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371

Query: 375 VAMTAVPAERAGMASGIMSAQRAIGSTIGFAVLGSVLA 412
+ +++ + AG +++ + G A++G +L+
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2339ACRIFLAVINRP442e-06 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 44.4 bits (105), Expect = 2e-06
Identities = 39/211 (18%), Positives = 82/211 (38%), Gaps = 30/211 (14%)

Query: 159 ESVEAVRKIVANSTP--PEGIRTYVTGPAALFADQIAAGDRSMKLITGLTFAVITVLLLL 216
++ +A++ +A P P+G++ F Q++ + L + + + L L
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFV-QLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 217 VYRSIATTLLILPMVFIGLGATRGTIAFLGYHGMVGLSTFVVNILT----ALAIAAGTDY 272
+++ TL+ V + L T +A G + +N LT LAI D
Sbjct: 360 --QNMRATLIPTIAVPVVLLGTFAILAAFG---------YSINTLTMFGMVLAIGLLVDD 408

Query: 273 AIFLVGRYQEARHIGQNREASFYTMYRGTANV---ILGSGLTIAGATYCLSFARLT---- 325
AI +V + R + +++ + + + ++G + ++ + A
Sbjct: 409 AIVVVENVE--RVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVF--IPMAFFGGSTG 464

Query: 326 -LFHTMGPPLAIGMLVSVAAALTLAPAIIAI 355
++ + M +SV AL L PA+ A
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCAT 495



Score = 43.7 bits (103), Expect = 4e-06
Identities = 36/161 (22%), Positives = 63/161 (39%), Gaps = 15/161 (9%)

Query: 774 GIAAVCLVFIVMLMITQSLIASLVIVGTVLLSLGTAFGLSVLIWQHFVGLQVH-WTIVAM 832
A+ LVF+VM + Q++ A+L+ V + L F + G ++ T+ M
Sbjct: 344 LFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAI-----LAAFGYSINTLTMFGM 398

Query: 833 SVIVLLAVGSDYNLLLV---SRFKEEVGAGLKTGIIRAMAGTGAVVTSAGLVFAFT---M 886
+ + L V D +++V R E K ++M+ + +V + M
Sbjct: 399 VLAIGLLV--DDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPM 456

Query: 887 ASMAVSELRVIGQVGTTIGLGLLFDTLVVRSFMTPSIAALL 927
A S + Q TI + LV TP++ A L
Sbjct: 457 AFFGGSTGAIYRQFSITIVSAMALSVLVALIL-TPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2340ccloacin340.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.9 bits (77), Expect = 0.001
Identities = 25/83 (30%), Positives = 37/83 (44%)

Query: 179 HGSGATGAAGVADPGGSGAGVGSAAGNGTGAGSADAVGGAGTGRDIVGSVRGDGGVGMAS 238
H +GA +G + G +G GVG A +G+G S + G G+G I G G +
Sbjct: 9 HNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 239 GDGGLSTGAAGASAEGGLMPGFG 261
G+ G +G G + FG
Sbjct: 69 GNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.8 bits (69), Expect = 0.011
Identities = 26/100 (26%), Positives = 35/100 (35%), Gaps = 3/100 (3%)

Query: 113 TGGTGTGQSRGAGGFGGVGQAGGKGWDGGPIGNGQ---VGEQHGAGQLGSTDGNPGVAGA 169
T G G G G GG G + P G G + G+G G+
Sbjct: 16 TSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGS 75

Query: 170 AHGSGVSASHGSGATGAAGVADPGGSGAGVGSAAGNGTGA 209
G +SA A G ++ PG G V +AG + A
Sbjct: 76 GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 29.7 bits (66), Expect = 0.024
Identities = 37/123 (30%), Positives = 45/123 (36%), Gaps = 3/123 (2%)

Query: 208 GAGSADAVGGAGTGRDIVGSVRGDGGVGMASGDGGLSTGAAGASAEGGLMPGFGGAPWVG 267
G G G T +I G G G G AS G S+ G +GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 268 GHWGLGGEG-HSGAIGGVGEQVAPAVATAPAVSP--ATTSAVAAESGSTPATKAQAMHAT 324
G G G SG G + AP PA+S A AV+ +G+ A A M A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAAL 123

Query: 325 TNP 327
P
Sbjct: 124 KGP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2345GPOSANCHOR411e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 40.8 bits (95), Expect = 1e-05
Identities = 52/344 (15%), Positives = 109/344 (31%), Gaps = 39/344 (11%)

Query: 224 VVDVDNAVRTSTNELALAIEEFGERRTAPFTQAVNNAKAALSQAFTVRQQLDDNTPETPA 283
+N ++ ++L+ + + + ++NAK L + + E A
Sbjct: 62 FEIENNTLKLKNSDLSFNNKALKDHNDELTEE-LSNAKEKLRKNDKSLSEKASKIQELEA 120

Query: 284 QRRELLTRVIVSAAHADRELASQTEAFEKLRDLVINAPARLDLLTQQYVELTTRIGPTQQ 343
++ +L + + A + +++ + E + + A L+ + + +T +
Sbjct: 121 RKADL-EKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIK 179

Query: 344 RLAELHTEFDAAAMTSIAGNVTTATERLAFADRNISAARDLADQAVSGRQAGLVDAVRAA 403
L +A + L A +A + + A L
Sbjct: 180 TLEAEKAALEARQ--------AELEKALEGAMNFSTADSAKIKTLEAEKAA-LAARKADL 230

Query: 404 ESALGQARALLDAVDSAATDIRHAVASLPAVVADIQTGIKRANQHLQQAQQPQTGRTGDL 463
E AL A A + + A+L A A+++ ++ A +
Sbjct: 231 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE- 289

Query: 464 IAARDAAARALDRARGAADPLTAFDQLTKVDADLD---RLLATLAEEQATADRLNRSLEQ 520
AA +A L+ + + DLD L E + N+ E
Sbjct: 290 KAALEAEKADLEHQSQVLNA-----NRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 344

Query: 521 ALFTAESRVRAVSEYIDTRRGSIGPEARTRLAEAKRQLEAAHDR 564
+ +++ +D R EAK+QLEA H +
Sbjct: 345 SR-------QSLRRDLDASR------------EAKKQLEAEHQK 369


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2353ccloacin362e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 2e-04
Identities = 27/79 (34%), Positives = 34/79 (43%)

Query: 24 GSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGSGNTGSNNIGFGNTGSGNFGFGNTGNNNI 83
G G+ + SGNI G G G + GSG + NN G +GSG G +G+ N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 84 GIGLTGDGQIGIGGLNSGS 102
G G G GG S
Sbjct: 66 GGNGNSGGGSGTGGNLSAV 84



Score = 31.2 bits (70), Expect = 0.007
Identities = 27/88 (30%), Positives = 35/88 (39%)

Query: 89 GDGQIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFW 148
GDG+ G +S SGNI G +G G G + G+G N +G G +G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 149 NGGSTNTGLANAGAGNTGFFDAGNYNFG 176
NGG +G G A FG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2356ccloacin365e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 5e-04
Identities = 29/91 (31%), Positives = 37/91 (40%)

Query: 209 GVGNIGSLNLGSGNIGGTNVGSGNVGGTNLGSGNYGSLNWGSGNTGTGNAGSGNTGDYNP 268
G G+ + SGNI G G G GG + GSG N G +G+G G +G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 269 GSGNFGSGNFGSGNIGSLNVGSGNFGTLNLA 299
G G G+G S FG L+
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALS 96



Score = 33.9 bits (77), Expect = 0.002
Identities = 29/84 (34%), Positives = 36/84 (42%), Gaps = 9/84 (10%)

Query: 229 GSGNVGGTNLGSGNYGSLNWGSGNTGTGNAGSGNTGDYNPGSGNFGSGNFGSGNIGSLNV 288
G G+ G + SGN G G G + GSG + + NP G GSG G G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 289 GSGNFGTLNLANGNNGDVNFGGGN 312
G NGN+G + GGN
Sbjct: 66 GG---------NGNSGGGSGTGGN 80



Score = 33.1 bits (75), Expect = 0.004
Identities = 23/82 (28%), Positives = 32/82 (39%)

Query: 465 AGDSNTGFANAGNVNTGFFNGGDINTGGFNGGNVNTGFGSALTQAGANSGFGNLGTGNSG 524
+G G + +G NGG G G + +G+ S G SG G G SG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 525 WGNSDPSGTGNSGFFNTGNGNS 546
GN +G G GN ++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 30.8 bits (69), Expect = 0.016
Identities = 27/83 (32%), Positives = 34/83 (40%), Gaps = 4/83 (4%)

Query: 255 TGNAGSGNTGDYNPGSGNFGSGNFGSGNIGSLNVGSGNFGTLNLANGNNGDVNFGGGNTG 314
+G G G+ + SGN G G G G + GSG N G +G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 315 DFNFGGGNNGTLNFGFGNTGSGN 337
N GG N G G+ GN
Sbjct: 62 HGNGGGNGNS----GGGSGTGGN 80


34Rv2421cRv2441cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2421c212-0.196936Probable nicotinate-nucleotide
Rv24221130.345002Hypothetical protein
Rv24231120.334716Hypothetical protein
Rv2424c2140.317167Probable transposase
Rv2425c213-0.203713Conserved hypothetical protein
Rv2426c213-1.269729Conserved hypothetical protein
Rv2427c012-2.190887Probable gamma-glutamyl phosphate reductase
Rv2427A122-3.852851Alkyl hydroperoxide reductase C protein AhpC
Rv2428123-4.361522Alkyl hydroperoxide reductase D protein AhpD
Rv2429221-3.491761PPE family protein PPE41
Rv2430c019-4.626574PE family protein PE25
Rv2431c-118-3.244156Hypothetical protein
Rv2432c-117-3.018356Hypothetical protein
Rv2433c-213-2.858130Probable conserved transmembrane protein
Rv2435c-112-1.681036Probable cyclase (adenylyl-or
Rv24360100.865908Ribokinase RbsK
Rv24371100.518221Conserved transmembrane protein
Rv2438c1120.348740Glutamine-dependent NAD(+) synthetase NadE
Rv2438A0120.504325Conserved hypothetical protein
Rv2439c290.382081Probable glutamate 5-kinase protein ProB
Rv2440c39-0.457381Probable GTP1/Obg-family GTP-binding protein
Rv2441c29-0.96823450S ribosomal protein L27 RpmA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2421cLPSBIOSNTHSS343e-04 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 33.6 bits (77), Expect = 3e-04
Identities = 18/63 (28%), Positives = 26/63 (41%), Gaps = 4/63 (6%)

Query: 3 GTFDPIHYGHLVAASEVADLFDLDEVVFVPSGQPWQKGRQVSAAEHRYLMTVIATASNPR 62
G+FDPI +GHL LF D+V P ++ + + + R A A P
Sbjct: 7 GSFDPITFGHLDIIERGCRLF--DQVYVAVLRNPNKQP--MFSVQERLEQIAKAIAHLPN 62

Query: 63 FSV 65
V
Sbjct: 63 AQV 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2439cCARBMTKINASE408e-06 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 40.2 bits (94), Expect = 8e-06
Identities = 25/97 (25%), Positives = 38/97 (39%), Gaps = 6/97 (6%)

Query: 156 DNDRLSALVAHLVGADALVLLSDIDGLYDCDPRKTADATFIPEVSGPADLDGVVAGRSSH 215
D D +A V AD ++L+D++G T ++ EV +L H
Sbjct: 214 DKDLAGEKLAEEVNADIFMILTDVNGA--ALYYGTEKEQWLREVK-VEELRKYYE--EGH 268

Query: 216 LGTGGMASKVAAALLAADA-GVPVLLAPAADAATALA 251
G M KV AA+ + G ++A A AL
Sbjct: 269 FKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305


35Rv2484cRv2492Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2484c-193.175021Possible triacylglycerol synthase
Rv2485c4126.628014Probable carboxylesterase LipQ
Rv24865146.468792*Probable enoyl-CoA hydratase EchA14 (enoyl
Rv2487c6145.440959PE-PGRS family protein PE_PGRS42
Rv2488c4144.771077Probable transcriptional regulatory protein
Rv2489c8205.572478Hypothetical alanine rich protein
Rv2490c7184.920927PE-PGRS family protein PE_PGRS43
Rv2491-117-4.143500Conserved hypothetical protein
Rv2492-116-3.222873Hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2487ccloacin399e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 9e-05
Identities = 30/86 (34%), Positives = 36/86 (41%), Gaps = 4/86 (4%)

Query: 145 GFGQTGGSGGAAGLIGNGGNGGAGGTGAAGGAGGNGGWLWGNGGNGGVGGTSVAAGIGGA 204
G G G+ +G I NGG G G GGA GW N GG G+ + G G
Sbjct: 6 GRGHNTGAHSTSGNI----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 205 GGNGGNAGLFGHGGAGGTGGAGLAGA 230
GNGG G G G G + +A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 38.2 bits (88), Expect = 1e-04
Identities = 28/98 (28%), Positives = 35/98 (35%)

Query: 566 NGGSGGVGGAGGVGGAGGDGGNGGSGGNASTFGDENSIGGAGGTGGNGGNGANGGNGGAG 625
NGG G+G GG G G S G G G GG GN G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 626 GIAGGAGGSGGFLSGAAGVSGADGIGGAGGAGGAGGAG 663
A A + GF + + +G + + GA A A
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 36.6 bits (84), Expect = 3e-04
Identities = 26/70 (37%), Positives = 28/70 (40%)

Query: 529 GGDGGSGGSSLGVGGVGGAGGVGGKGGASGMLIGNGGNGGSGGVGGAGGVGGAGGDGGNG 588
GG G G G G + GG SG I GG G G GG G GG G GGN
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 589 GSGGNASTFG 598
+ FG
Sbjct: 82 SAVAAPVAFG 91



Score = 35.5 bits (81), Expect = 8e-04
Identities = 35/106 (33%), Positives = 42/106 (39%), Gaps = 1/106 (0%)

Query: 527 GVGGDGGSGGSSLGV-GGVGGAGGVGGKGGASGMLIGNGGNGGSGGVGGAGGVGGAGGDG 585
G G + G+ +S + GG G G GG SG N GG G G G G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 586 GNGGSGGNASTFGDENSIGGAGGTGGNGGNGANGGNGGAGGIAGGA 631
G G+ G S G S A G G G A I+ GA
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.1 bits (80), Expect = 0.001
Identities = 30/103 (29%), Positives = 45/103 (43%), Gaps = 1/103 (0%)

Query: 227 LAGANGVNPTPGPAASTGDSPADVSGIGDQTGGDGGTGGHGTAGTPTGGTGGDGATATAG 286
++G +G G +++G+ +G+G G G+G + P GG G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGG 59

Query: 287 SGKATGGAGGDGGTAAAGGGGGNGGDGGVAQGDIASAFGGDGG 329
SG GG G+ G + GG + VA G A + G GG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.7 bits (79), Expect = 0.001
Identities = 39/122 (31%), Positives = 48/122 (39%), Gaps = 8/122 (6%)

Query: 257 TGGDGGTGGHGTAGTPTGGTGGDGATATAGSGKATGGAGGDGGTAAAGGGGGNGGDGGVA 316
+GGDG GH T T G G T G A+ G+G GGG G+G G
Sbjct: 2 SGGDGR--GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 317 QGDIASAFGGDGGNGSDGVAAGSGGGSGGAGGGAFVHIATATSTGGSGGFGGNGAASAAS 376
G G+ G GS GG+ A A ST G+GG + +A A S
Sbjct: 60 SGHGNGGGNGNSGGGS------GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113

Query: 377 GA 378
A
Sbjct: 114 AA 115



Score = 34.3 bits (78), Expect = 0.002
Identities = 30/87 (34%), Positives = 37/87 (42%), Gaps = 1/87 (1%)

Query: 604 GGAGGTGGNGGNGANGGNGGAGGIAGGAGGSGGFLSGAAGVSGADGIGGAGGAGGAGGAG 663
GA T GN NG G G GG + G+G S G GG G G GG G
Sbjct: 11 TGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 664 GSGGEAGAGGLTNGPGSPGVSGTEGMA 690
SGG +G GG + +P G ++
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALS 96



Score = 33.5 bits (76), Expect = 0.003
Identities = 30/91 (32%), Positives = 36/91 (39%), Gaps = 7/91 (7%)

Query: 339 SGGGSGGAGGGAFVHIATATSTGGSGGFGGNGAASAASGADGGAGGAGGNGGAGGLLFGD 398
SGG G GA H + GG G G G AS DG + N GG G
Sbjct: 2 SGGDGRGHNTGA--HSTSGNINGGPTGLGVGGGAS-----DGSGWSSENNPWGGGSGSGI 54

Query: 399 GGNGGAGGAGGIGGDGATGGPGGSGGNAGIA 429
GG+G G G + GG G G + +A
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 33.5 bits (76), Expect = 0.003
Identities = 27/101 (26%), Positives = 36/101 (35%)

Query: 447 GGDGGKGGSGLGVGGAGGTGGAGGNGGAGGLLFGNGGNGGNAGAGGDGGAGVAGGVGGNG 506
GG G G G G+G + GG G GG G+ GG+G +G G GGN
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 507 GGGGTATFHEDPVAGVWAVGGVGGDGGSGGSSLGVGGVGGA 547
P GG+ +G S + + A
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122



Score = 33.1 bits (75), Expect = 0.004
Identities = 27/78 (34%), Positives = 31/78 (39%)

Query: 397 GDGGNGGAGGAGGIGGDGATGGPGGSGGNAGIARFDSPDPEAEPDVVGGKGGDGGKGGSG 456
G G N GA G G TG G G + G +P G G G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 457 LGVGGAGGTGGAGGNGGA 474
G G +GG G GGN A
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.007
Identities = 23/61 (37%), Positives = 30/61 (49%)

Query: 449 DGGKGGSGLGVGGAGGTGGAGGNGGAGGLLFGNGGNGGNAGAGGDGGAGVAGGVGGNGGG 508
+GG G G+G G + G+G + N GG GG +G G GG G +GG G GG
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 509 G 509

Sbjct: 81 L 81



Score = 30.1 bits (67), Expect = 0.035
Identities = 33/100 (33%), Positives = 38/100 (38%), Gaps = 1/100 (1%)

Query: 325 GGDGGNGSDGVAAGSGGGSGGAGGGAFVHIATATSTGGS-GGFGGNGAASAASGADGGAG 383
GGDG + G + SG +GG G A+ S S G G+ S G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 384 GAGGNGGAGGLLFGDGGNGGAGGAGGIGGDGATGGPGGSG 423
G GG G G G GGN A A G A PG G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2490ccloacin384e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.8 bits (87), Expect = 4e-04
Identities = 37/105 (35%), Positives = 45/105 (42%), Gaps = 5/105 (4%)

Query: 1320 GGGGVGGHGGAGGDAGMNGGGGTGGQGGNGAAGGAGWSPDSDLKGFDGFDGGSGGAGGDG 1379
GG G G + GA +G GG TG G GA+ G+GWS +++ G GGSG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-----GGSGSGIHWG 57

Query: 1380 GAGGAGGTQTGDGGDGGAGGLGGAGGVGGNGVDGFDINETTGRDG 1424
G G G GG+G G V GF T G G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.7 bits (79), Expect = 0.004
Identities = 29/99 (29%), Positives = 40/99 (40%), Gaps = 3/99 (3%)

Query: 427 GGAGGASGGAGARAGANGLAAGNDGPVSGGNGGKGGNGAHAPVAGGHGGN---GGAGGNG 483
G G + GA + +G G G + G G + + P GG G GG G+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 484 GLVGDGGAGGHGGDGAAGAGYADMTAIFLGSSGTPGEDG 522
G+G +GG G G + A A + TPG G
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.005
Identities = 31/97 (31%), Positives = 41/97 (42%), Gaps = 1/97 (1%)

Query: 1024 GAGGAGGAGGAGGSVSG-DGGAGGNGGAGGNGGVGASGGAGARGANGIDSIGGTGGAGGG 1082
G G GA G+++G G G GGA G + G+ GG G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1083 GGDGGAGGVGGHGGDGGVGGAAPSGTVGSHGTGGVGG 1119
GG+G +GG G GG+ A + + T G GG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.005
Identities = 29/108 (26%), Positives = 34/108 (31%)

Query: 749 GGVGGGGAGGAGGDGGAGSSALGSGGNGGRGDAGQAGGAGGAGGAGGAGGSVSGDGGPGG 808
GG G G GA G + G GG G + GG+G + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 809 KGGAGGAGGAGASGGGGGKGASGADSAEAVGGAGGKGGDGGVGGVGGD 856
G G G SG GG A A A G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 33.9 bits (77), Expect = 0.005
Identities = 36/113 (31%), Positives = 40/113 (35%)

Query: 729 GADGTLSGQPGEGSEANGGQGGVGGGGAGGAGGDGGAGSSALGSGGNGGRGDAGQAGGAG 788
G DG + N G G G GGA G S GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 789 GAGGAGGAGGSVSGDGGPGGKGGAGGAGGAGASGGGGGKGASGADSAEAVGGA 841
G GG G G SG GG A A G A G G + + SA A+ A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.9 bits (77), Expect = 0.006
Identities = 29/85 (34%), Positives = 32/85 (37%)

Query: 1508 GQGGFGSTSGAHGKAGFGAPGGDGGDGGNGGHGGDGNGSFADAGDGGPGGNGGNGGLGGA 1567
G G G +GAH +G G G G G G G S + GG G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1568 GRDGGAPGGDGGDGGTGGSGGFGAP 1592
G GG GG G G AP
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.9 bits (77), Expect = 0.006
Identities = 33/101 (32%), Positives = 39/101 (38%)

Query: 784 AGGAGGAGGAGGAGGSVSGDGGPGGKGGAGGAGGAGASGGGGGKGASGADSAEAVGGAGG 843
+GG G G S + +GGP G G GGA G+ S GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 844 KGGDGGVGGVGGDGGPGGDGGAGGAAPAGQVGSHGVGGVGG 884
G GG G GG G GG+ A A A + G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.008
Identities = 33/105 (31%), Positives = 43/105 (40%), Gaps = 6/105 (5%)

Query: 911 GDPGAGGLGGLGGDSGNGTRAASGVDASDHGPGSGGNGGNGGNGAQASVAGGAGGNGGDG 970
G G G G SGN +G+ G G G + G+G + GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGL-----GVGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 971 GNAGRVGDGGAGGNGGDGAAGANGANSGAPGSDAL-ALGQPGGNG 1014
G +G GG G +GG G N + AP + AL PG G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.1 bits (75), Expect = 0.010
Identities = 40/140 (28%), Positives = 52/140 (37%), Gaps = 7/140 (5%)

Query: 1053 NGGVGASGGAGARGANGIDSIGGTGGAGGGGGDGGAGGVGGHGGDGGVGGAAPSGTVGSH 1112
+GG G GA +G + G TG GGG G+G + GG G+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI------- 54

Query: 1113 GTGGVGGDGGLGGAGGVGGAGGNGGIGITVGGAGGAGGNGGDPGAGGRGGLGGDSGNGTS 1172
GG G G GG G GG G GG V G G + +G ++
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114

Query: 1173 AANGVDASKHGPLTGGDGGV 1192
A + A+ GP G GV
Sbjct: 115 AIADIMAALKGPFKFGLWGV 134



Score = 32.4 bits (73), Expect = 0.017
Identities = 35/105 (33%), Positives = 40/105 (38%), Gaps = 5/105 (4%)

Query: 164 IGGAGGAGGAGAPGGTGGTGGWLAGGGGVGGMGGAGGGAGGAGGNAGLFGNGGAGGAGGA 223
+ G G G T G G GVGG G G G G GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 224 GGGAGGAGGNAGWFGHGGAGGVGGVGAAGANG----ATPGQDGAA 264
G G GG GN+G G G G + V A A G +TPG G A
Sbjct: 61 GHGNGGGNGNSGG-GSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.023
Identities = 25/77 (32%), Positives = 29/77 (37%)

Query: 1399 GLGGAGGVGGNGVDGFDINETTGRDGGDGGDGGYGGWGGAGGNGGAGGSAPAGEVGNRGV 1458
G G G G +IN G GG GW G G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1459 GGDGGDGGSGGDAGNGG 1475
G GG+G SGG +G GG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 32.0 bits (72), Expect = 0.024
Identities = 28/78 (35%), Positives = 29/78 (37%), Gaps = 4/78 (5%)

Query: 1280 GNGGAGGAGGAGGAGGAFLGDGGNGGAGGQGGAGRGGS----PGGGGGVGGHGGAGGDAG 1335
G G G GA G G G GG G G S P GGG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1336 MNGGGGTGGQGGNGAAGG 1353
NGGG GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 32.0 bits (72), Expect = 0.026
Identities = 30/90 (33%), Positives = 36/90 (40%), Gaps = 1/90 (1%)

Query: 578 GAGGRGGDGGAGGAGGDAPAGRAGSQGVGGDGGAGGAGGAPGNGGSGGRGDMAFKDGDGG 637
G GRG + GA G+ G G GVGG G + N GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGL-GVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 638 AGGDGGDPGAGGKGGAGGAGATEGVTGATG 667
G GG+ +GG G GG + A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.6 bits (71), Expect = 0.031
Identities = 26/84 (30%), Positives = 29/84 (34%), Gaps = 1/84 (1%)

Query: 1375 AGGDGGAGGAGGTQTGDGGDGGAGGLGGAGGVGGNGVDGFDINETTGRDGGDGGDGGYGG 1434
+GGDG G T +GG GLG GG +G N G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1435 WGGAGGNGGAGGSAPAGEVGNRGV 1458
G GG G G V
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 31.6 bits (71), Expect = 0.031
Identities = 30/103 (29%), Positives = 36/103 (34%), Gaps = 3/103 (2%)

Query: 283 GGDGGAGGVGGNGGRGGWLLGNGGAGGVGGVGGAGGAGAAGGAGGAGATGINGPAGISAA 342
GGDG G + G NGG G+G GGA G
Sbjct: 3 GGDGRGHNTGAHSTSGNI---NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 343 GGDGGAGGNGGAGGNGGVGGAGGAGGSAGLLGYVGRAGDGGAG 385
G G GGNG +GG G GG A + G+ + G G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.036
Identities = 24/76 (31%), Positives = 29/76 (38%)

Query: 522 GGNGGAGGAGGAGGAHAGDGGAGGAGGNGGAGGAGGNGAHGFNAVLVSDGGNGGDGGAGG 581
G N GA G G GG +G + N G + + GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 582 RGGDGGAGGAGGDAPA 597
G GG G GG+ A
Sbjct: 68 NGNSGGGSGTGGNLSA 83



Score = 31.2 bits (70), Expect = 0.037
Identities = 27/82 (32%), Positives = 33/82 (40%)

Query: 1236 LGGDGGAGGAGGKGGDAGDIGDGGDGGKGGDGAHGALGGLTVAGGNGGAGGAGGAGGAGG 1295
+ G G G G +G+I G G G GA G + GG G+G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1296 AFLGDGGNGGAGGQGGAGRGGS 1317
GGNG +GG G G S
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLS 82



Score = 31.2 bits (70), Expect = 0.039
Identities = 27/84 (32%), Positives = 37/84 (44%)

Query: 981 AGGNGGDGAAGANGANSGAPGSDALALGQPGGNGGQGDAGQAGGAGGAGGAGGAGGSVSG 1040
+GG+G GA+ + G G + G G + + GG G+G G SG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1041 DGGAGGNGGAGGNGGVGASGGAGA 1064
G GGNG +GG G G + A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 31.2 bits (70), Expect = 0.040
Identities = 32/101 (31%), Positives = 35/101 (34%), Gaps = 2/101 (1%)

Query: 830 SGADSAEAVGGAGGKGGD--GGVGGVGGDGGPGGDGGAGGAAPAGQVGSHGVGGVGGDGG 887
SG D GA G+ GG G+G GG G GS GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 888 LGGAGGNGGDGGHGSDGGDGGDGGDPGAGGLGGLGGDSGNG 928
G GGNG GG GG+ P A G L G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.044
Identities = 25/75 (33%), Positives = 33/75 (44%), Gaps = 1/75 (1%)

Query: 472 GHGGNGGAGGNGGLVGDG-GAGGHGGDGAAGAGYADMTAIFLGSSGTPGEDGGNGGAGGA 530
G G N GA G + G G GG + G+G++ + G SG+ GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 531 GGAGGAHAGDGGAGG 545
GG G + G G G
Sbjct: 66 GGNGNSGGGSGTGGN 80


36Rv2535cRv2547Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2535c-1213.484989Probable cytoplasmic peptidase PepQ
Rv25360214.910452Probable conserved transmembrane protein
Rv2537c2184.6550033-dehydroquinate dehydratase AroD (AROQ)
Rv2538c1163.8051673-dehydroquinate synthase AroB
Rv2539c0162.585762Shikimate kinase AroK (SK)
Rv2540c1141.627698Probable chorismate synthase AroF
Rv2541116-0.633118Hypothetical alanine rich protein
Rv2542016-2.156467Conserved hypothetical protein
Rv2543023-3.717657Probable conserved lipoprotein LppA
Rv2544123-3.243997Probable conserved lipoprotein LppB
Rv2545319-2.278096Possible antitoxin VapB18
Rv2546519-2.278373Possible toxin VapC18
Rv2547315-0.436459Possible antitoxin VapB19
37Rv2629Rv2677cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv26293152.170083Conserved protein
Rv26305182.851651Hypothetical protein
Rv26315172.553560Conserved hypothetical protein
Rv2632c4173.203860Conserved protein
Rv2633c4173.616057Hypothetical protein
Rv2634c6164.262600PE-PGRS family protein PE_PGRS46
Rv26353160.328022Hypothetical protein
Rv26361160.121943Conserved hypothetical protein
Rv26370151.034976Possible transmembrane protein DedA
Rv26383150.481918Conserved hypothetical protein
Rv2639c215-0.066798Probable conserved integral membrane protein
Rv2640c0170.682490Possible transcriptional regulatory protein
Rv2641-1180.769523Cadmium inducible protein CadI
Rv2642-1190.794548Possible transcriptional regulatory protein
Rv26430190.617316Probable arsenic-transport integral membrane
Rv2644c1240.157635Hypothetical protein
Rv26451170.835399****Hypothetical protein
Rv26462150.254059Probable integrase
Rv2647315-0.000785Hypothetical protein
Rv26482150.179761Probable transposase for insertion sequence
Rv26493160.889773Probable transposase for insertion sequence
Rv2650c4151.465326Possible PhiRv2 prophage protein
Rv2651c2161.867496Possible PhiRv2 prophage protease
Rv2652c1162.202886Probable PhiRv2 prophage protein
Rv2653c3181.499208Possible PhiRv2 prophage protein
Rv2654c0190.926632Possible PhiRv2 prophage protein
Rv2655c-121-0.203905Possible PhiRv2 prophage protein
Rv2656c-122-0.461024Possible PhiRv2 prophage protein
Rv2657c-125-1.538461Probable PhiRv2 prophage protein
Rv2658c-127-2.250355Possible prophage protein
Rv2659c027-1.263970Probable PhiRv2 prophage integrase
Rv2660c324-0.433847Hypothetical protein
Rv2661c4210.680157Hypothetical protein
Rv26622220.807881Hypothetical protein
Rv26630221.364761Hypothetical protein
Rv2664-1201.480806Hypothetical protein
Rv26650140.794852Hypothetical arginine rich protein
Rv26660160.634479Probable transposase for insertion sequence
Rv26670140.999144Possible ATP-dependent protease ATP-binding
Rv26681120.093496Possible exported alanine and valine rich
Rv2669211-0.326271GCN5-related N-acetyltransferase
Rv2670c110-0.072228Conserved hypothetical protein
Rv26711110.154249Possible bifunctional enzyme riboflavin
Rv26720121.408030Possible secreted protease
Rv2673-1151.645305Possible arabinofuranosyltransferase AftC
Rv2674-1132.818703Probable peptide methionine sulfoxide reductase
Rv2675c0123.276070Conserved hypothetical protein
Rv2676c0123.426037Conserved protein
Rv2677c-1113.167503Probable protoporphyrinogen oxidase HemY
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2634ccloacin426e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 42.4 bits (99), Expect = 6e-06
Identities = 42/111 (37%), Positives = 47/111 (42%), Gaps = 2/111 (1%)

Query: 570 AGGVGGAGGEGLTDGAGTAEGGTGGLGGLGGVGGTGGMGGSGGVGGNGGAAGSLIGLGGG 629
+GG G G +G GG GLG GG + G G S GG +GS I GGG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 630 GGAGGVGGTGGIGGIGGAGGNGGAGGAGTTTGGGATIGGGGGTGGVGGAGG 680
G G GG G GG G GGN A A G A G G V + G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 39.3 bits (91), Expect = 5e-05
Identities = 39/116 (33%), Positives = 47/116 (40%), Gaps = 12/116 (10%)

Query: 607 MGGSGGVGGNGGAAGSLIGLGGGGGAGGVGGTGGIGGIGGAGGNGGAGGAGTTTGGGATI 666
M G G G N GA + + GG GVGG G + G+G + G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGG----------GASDGSGWSSENNPWGGGS 50

Query: 667 GGGGGTGGVGGAGGTGGTGGAGGTTGGSGGAGGLIGWAGAAGGTGAGGTGGQGGLG 722
G G GG G G GG G +GG +G G + A A G A T G GGL
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVA--APVAFGFPALSTPGAGGLA 104



Score = 37.0 bits (85), Expect = 3e-04
Identities = 29/82 (35%), Positives = 37/82 (45%), Gaps = 2/82 (2%)

Query: 647 AGGNGGAGGAGTTTGGGATIGGGGGTGGVGGAGGTGGTGGAGGTTGGSGGAGGLIGWAGA 706
+GG+G G + G GG G G G G + G+G + GG+G I W G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLG--VGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 707 AGGTGAGGTGGQGGLGGQGGNG 728
+G GG G GG G GGN
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNL 81



Score = 36.6 bits (84), Expect = 4e-04
Identities = 28/81 (34%), Positives = 32/81 (39%)

Query: 496 GKGGQGHNTGVGDAFGGDGGIGGDGNGALGAAGGNGGTGGAGGNGGRGGMLIGNGGAGGA 555
G G+GHNTG G G GA+ G+G + GG G I GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 556 GGTGGTGGGGAAGFAGGVGGA 576
G GG G G GG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 36.2 bits (83), Expect = 5e-04
Identities = 28/76 (36%), Positives = 33/76 (43%)

Query: 548 GNGGAGGAGGTGGTGGGGAAGFAGGVGGAGGEGLTDGAGTAEGGTGGLGGLGGVGGTGGM 607
G G GA T G GG G G G + G G + GG+G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 608 GGSGGVGGNGGAAGSL 623
GG+G GG G G+L
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.002
Identities = 29/84 (34%), Positives = 35/84 (41%), Gaps = 1/84 (1%)

Query: 348 MPGAGGNGGNANWFGSGGAGGQGGTGLAGTNGVNPGSIANPNTGANGTDNSGNGNQTGGN 407
M G G G N + G G TGL G + GS + G SG+G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGG 59

Query: 408 GGPGPAGGVGEAGGVGGQGGLGES 431
G G GG G +GG G GG +
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.9 bits (77), Expect = 0.003
Identities = 30/83 (36%), Positives = 38/83 (45%), Gaps = 2/83 (2%)

Query: 432 LDGNDGTGGKGGAGGTAGTDGGAGGAGGAGGIGETDGSAGGVATGGEGGDGATGGVDGGV 491
+ G DG G GA T+G G G GG G +DGS G + G G+ G+ G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGS-GWSSENNPWGGGSGSGIHWGG 58

Query: 492 GGAGGKGGQGHNTGVGDAFGGDG 514
G G GG N+G G GG+
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNL 81



Score = 33.5 bits (76), Expect = 0.004
Identities = 34/108 (31%), Positives = 42/108 (38%), Gaps = 2/108 (1%)

Query: 470 AGGVATGGEGGDGATGG-VDGGVGGAGGKGGQGHNTGVGDAFGGDGGIGGDGNGALGAAG 528
+GG G G +T G ++GG G G GG +G GG G G G +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 529 -GNGGTGGAGGNGGRGGMLIGNGGAGGAGGTGGTGGGGAAGFAGGVGG 575
GNGG G G G G + A A G GA G A +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 33.1 bits (75), Expect = 0.005
Identities = 26/90 (28%), Positives = 34/90 (37%), Gaps = 3/90 (3%)

Query: 390 TGANGTDNSGNGNQTGGNGGPGPAGGVGEAGGVGGQGGLGESLDGNDGTG---GKGGAGG 446
+G +G ++ + T GN GP G G G G E+ G+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 447 TAGTDGGAGGAGGAGGIGETDGSAGGVATG 476
G GG+G G A VA G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.005
Identities = 30/82 (36%), Positives = 33/82 (40%), Gaps = 3/82 (3%)

Query: 549 NGGAGGAGGT--GGTGGGGAAGFAGGVGGAGGEGLTDGAGTAEG-GTGGLGGLGGVGGTG 605
N GA G GG G G G A G E G G+ G GG G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 606 GMGGSGGVGGNGGAAGSLIGLG 627
GG G GGN A + + G
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.0 bits (72), Expect = 0.009
Identities = 35/104 (33%), Positives = 44/104 (42%), Gaps = 10/104 (9%)

Query: 255 GNGGFGGAGGLGAAGGVGGAASYFGTGGGG----GVGGDGAPGGDG-GAGPLLIGNGGVG 309
G+G G +G + G + G GGG G + P G G G+G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 310 GLGGAGAAGGNGGAGGMLLGDGGAGGQGGPAVAGVLGGMPGAGG 353
GG G +GG G GG L G PA++ PGAGG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS-----TPGAGG 102



Score = 31.6 bits (71), Expect = 0.014
Identities = 25/71 (35%), Positives = 31/71 (43%)

Query: 389 NTGANGTDNSGNGNQTGGNGGPGPAGGVGEAGGVGGQGGLGESLDGNDGTGGKGGAGGTA 448
NTGA+ T + NG TG G G + G G + GG S G G G GG
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 449 GTDGGAGGAGG 459
+ GG+G G
Sbjct: 70 NSGGGSGTGGN 80



Score = 30.8 bits (69), Expect = 0.024
Identities = 20/78 (25%), Positives = 25/78 (32%)

Query: 464 GETDGSAGGVATGGEGGDGATGGVDGGVGGAGGKGGQGHNTGVGDAFGGDGGIGGDGNGA 523
G+ G G + +G G+ G G + G G N G G GG
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 524 LGAAGGNGGTGGAGGNGG 541
G GN G G G
Sbjct: 64 NGGGNGNSGGGSGTGGNL 81



Score = 30.1 bits (67), Expect = 0.036
Identities = 31/102 (30%), Positives = 36/102 (35%)

Query: 197 GTGGAGGAAGATLVGGTGGVGGATGLIGSGGFGGAGGAAAGVGTTGGVGGSGGVGGVFGN 256
G G G GA G G +G G G+G ++ GG G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 257 GGFGGAGGLGAAGGVGGAASYFGTGGGGGVGGDGAPGGDGGA 298
G GG G G G GG S G PG G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.1 bits (67), Expect = 0.036
Identities = 31/96 (32%), Positives = 36/96 (37%), Gaps = 6/96 (6%)

Query: 677 GAGGTGGTGGAGGTTGG-SGGAGGLIGWAGAAGGTGAGGTGGQGGLGGQGGNGGNGGTGA 735
G G G GA T+G +GG GL GA+ G+G G GG G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG-GGSGSGIHWGGGSG 61

Query: 736 TGGQGGDFALGGNGGAGGAGGSPGGSSGIQGNMGPP 771
G GG GN G G G + G P
Sbjct: 62 HGNGGG----NGNSGGGSGTGGNLSAVAAPVAFGFP 93



Score = 30.1 bits (67), Expect = 0.039
Identities = 25/72 (34%), Positives = 32/72 (44%), Gaps = 2/72 (2%)

Query: 120 NGANGADGTGAPGGPGGLLLGNGGNGGSG--APGQPGGAGGDAGLIGNGGTGGKGGDGLV 177
N + GGP GL +G G + GSG + P G G +G+ GG+G G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 178 GSGAAGGVGGRG 189
SG G GG
Sbjct: 70 NSGGGSGTGGNL 81



Score = 30.1 bits (67), Expect = 0.045
Identities = 30/109 (27%), Positives = 41/109 (37%)

Query: 167 GTGGKGGDGLVGSGAAGGVGGRGGWLLGNGGTGGAGGAAGATLVGGTGGVGGATGLIGSG 226
G G+G + S + GG G +G G + G+G ++ GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 227 GFGGAGGAAAGVGTTGGVGGSGGVGGVFGNGGFGGAGGLGAAGGVGGAA 275
G GG G + G TGG + FG G G A + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2668FIMBRILLIN300.003 Porphyromonas gingivalis: fimbrillin protein signature.
		>FIMBRILLIN#Porphyromonas gingivalis: fimbrillin protein signature.

Length = 348

Score = 30.4 bits (68), Expect = 0.003
Identities = 18/55 (32%), Positives = 20/55 (36%), Gaps = 5/55 (9%)

Query: 97 NSFILATNFSFTGV-TPFADAYKPRPCDASDWL----DAALGNAPQGSIVRGGVY 146
S + + TG T F AY P DWL NAPQG V Y
Sbjct: 174 ASLANSDDAYLTGSLTNFNGAYSPANYTHVDWLGRDYTEPSNNAPQGFYVLESTY 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2669SACTRNSFRASE515e-11 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 51.5 bits (123), Expect = 5e-11
Identities = 20/97 (20%), Positives = 34/97 (35%)

Query: 43 EYLTDPRRAILTARHDGRIVGYAMLIRGDDRDVELSKLYLLPGYHGTGAAAALMHKVLAT 102
Y+ + +A + +G + + + + + Y G AL+HK +
Sbjct: 58 SYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEW 117

Query: 103 AADWGALRVWLGVNQKNQRAQRFYAKTGFKINGTRTF 139
A + + L N A FYAK F I T
Sbjct: 118 AKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTM 154


38Rv2731Rv2741Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2731210-0.267892Conserved alanine and arginine rich protein
Rv2732c012-1.752216Probable conserved transmembrane protein
Rv2733c112-2.048251Conserved hypothetical alanine, arginine-rich
Rv2734118-3.716532Conserved hypothetical protein
Rv2735c115-0.682950Conserved hypothetical protein
Rv2736c2161.075895Regulatory protein RecX
Rv2737c0122.870716RecA protein (recombinase A) [contains:
Rv2737A3125.463400Conserved hypothetical cysteine rich protein
Rv2738c3115.225851Conserved hypothetical protein
Rv2739c2114.235482Possible alanine rich transferase
Rv27404123.255539Epoxide hydrolase
Rv27414132.706810PE-PGRS family protein PE_PGRS47
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2731PF05616357e-04 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 34.7 bits (79), Expect = 7e-04
Identities = 18/46 (39%), Positives = 21/46 (45%), Gaps = 1/46 (2%)

Query: 6 PRSDDSSGSAPQPAATPVPRPGPRPGPRPVPRPTSYPVGAHPPSDP 51
PR D + GSA P A P+P P P P P P G P +P
Sbjct: 311 PRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENP-GTRPNPEP 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2737A2FE2SRDCTASE384e-07 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 37.7 bits (87), Expect = 4e-07
Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 32 DLTFRRRSCCLFYRVPAGGKCGDCPL 57
D RR+CC YR+P +CGDC L
Sbjct: 236 DGLLVRRTCCQRYRLPDVQQCGDCTL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2741cloacin393e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 3e-05
Identities = 37/113 (32%), Positives = 48/113 (42%), Gaps = 5/113 (4%)

Query: 233 GVGGAALGGGAQAAGGNGGAGGVGGLFGAGGAGGAGGFSDTGGTGGAGGAGGLFGPGGGS 292
G G GA + GN G G G G + G+G S+ GG G+G +G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 293 GGVGGFGDTGGTGGDGGSGGLFGVGGAGGHGGFGSAAGGDGGAGGAGGTVFGS 345
G GG G++GG G GG+ A FG A GAGG ++
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVA-----FGFPALSTPGAGGLAVSISAG 110



Score = 35.5 bits (81), Expect = 6e-04
Identities = 29/78 (37%), Positives = 32/78 (41%), Gaps = 1/78 (1%)

Query: 289 GGGSGGVGGFGDTGGTGGDGGSGGLFGVGGAGGHGGFGSAAGGDGGAGGAGGTVFGSGGA 348
G G G G T G +GG GL GGA G+ S GG G+G G G
Sbjct: 4 GDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 349 GGAGGVATVAGHGGHGGN 366
G GG G G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 34.3 bits (78), Expect = 0.001
Identities = 30/89 (33%), Positives = 36/89 (40%), Gaps = 1/89 (1%)

Query: 372 GTGGAGGAGGFGGFGGDGGDGGIGGLVGSGGAGGSGGTGTLSGGRGGAGGNAGTFYGSG- 430
G G G G G+ G G VG G + GSG + + GG+G GSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 431 GAGGAGGESDNGDGGNGGVGGKAGLVGEG 459
G GG G S G G G + A V G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.9 bits (77), Expect = 0.002
Identities = 38/115 (33%), Positives = 43/115 (37%), Gaps = 5/115 (4%)

Query: 400 SGGAGGSGGTGTLSGGRGGAGGNAGTFYGSGGAGGAG-GESDNGDGGNGGVGGKAGLVGE 458
SGG G TG S GG G G G + G+G +N GG G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 459 GGNGGDGGATIAGKGGSGGNGGNAWLTGQGGNGGNAAFGKAGTGSVGVGGAGGLL 513
GNGG G GG G GGN G A G G + V + G L
Sbjct: 62 HGNGGGNG----NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112



Score = 33.5 bits (76), Expect = 0.002
Identities = 28/102 (27%), Positives = 37/102 (36%), Gaps = 2/102 (1%)

Query: 260 GAGGAGGAGGFSDTGGTGGAGGAGGLFGPGGGSGGVGGFGDTGGTGGDGGSGGLFGVGGA 319
G G G G T G G G G GGG+ G+ G G G+ GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGL--GVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 320 GGHGGFGSAAGGDGGAGGAGGTVFGSGGAGGAGGVATVAGHG 361
G G G+ G G G + + A G ++T G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.002
Identities = 33/113 (29%), Positives = 41/113 (36%), Gaps = 5/113 (4%)

Query: 160 GAAGLFGNGGAGGAGASNQAGNGGAGGNGGAG-GLIWGTAGTGGNGGFTTFLDAAGGAGG 218
G G N GA + G G G GGA G W + GG + + GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 219 AGGAGGLFGAGGAGGVGGAALGGGAQAAG----GNGGAGGVGGLFGAGGAGGA 267
G G GG+G G + A G GAGG+ AG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.5 bits (76), Expect = 0.002
Identities = 21/72 (29%), Positives = 28/72 (38%)

Query: 120 NGANGKPGTGQDGGAGGLLYGSGGNGGSGLAGSGQKGGNGGAAGLFGNGGAGGAGASNQA 179
N +GG GL G G + GSG + G G +G+ GG+G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 180 GNGGAGGNGGAG 191
+GG G GG
Sbjct: 70 NSGGGSGTGGNL 81



Score = 32.8 bits (74), Expect = 0.004
Identities = 35/105 (33%), Positives = 42/105 (40%), Gaps = 5/105 (4%)

Query: 169 GAGGAGASNQAGNGGAGGNGGAGGLIWGTAGTGGNGGFTTFLDAAGGAGGAGGAGGLFGA 228
G G G + A + NGG GL G + G+G + GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 229 GGAGGVGGAALGGGAQAAGGNGGAGGVGGLFG--AGGAGGAGGFS 271
G GG G GG GGN A FG A GAGG +
Sbjct: 63 GNGGGNGN---SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.006
Identities = 30/103 (29%), Positives = 37/103 (35%), Gaps = 1/103 (0%)

Query: 198 AGTGGNGGFTTFLDAAGGAGGAGGAGGLFGAGGAGGVGGAALGGGAQAAGGNGGAGGVGG 257
+G G G T +G G G+ G G + +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 258 LFGAGGAGGAGGFSDTGGTGGAGGAGGLFG-PGGGSGGVGGFG 299
GG G +GG S TGG A A FG P + G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.006
Identities = 33/108 (30%), Positives = 41/108 (37%), Gaps = 5/108 (4%)

Query: 326 GSAAGGDGGAGGAGGTVFGSGGAGGAGGVATVAGHGGHGGNAGLLYGTGGAGGAGGFGGF 385
G G + GA G + G G GG A+ G G N G+G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGAS-DGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 386 GGDGGDGGIGGLVGSGGAGGSGGTGTLSG----GRGGAGGNAGTFYGS 429
G GG+G GG G+GG + G GAGG A +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.0 bits (72), Expect = 0.007
Identities = 32/112 (28%), Positives = 38/112 (33%), Gaps = 7/112 (6%)

Query: 140 GSGGNGGSGLAGSGQKGGNGGAAGLFGNGGAGGAGASNQAGNGGAGGNGGAGGLIWGTAG 199
G G N G+ GG G G G + N GG+G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH--G 63

Query: 200 TGGNGGFTTFLDAAGGAGGAGGAGGLFG-----AGGAGGVGGAALGGGAQAA 246
GG G + GG A A FG GAGG+ + G AA
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 31.6 bits (71), Expect = 0.009
Identities = 29/103 (28%), Positives = 36/103 (34%), Gaps = 1/103 (0%)

Query: 318 GAGGHGGFGSAAGGDGGAGGAGGTVFGSGGAGGAGGVATVAGHGGHGGNAGLLYGTGGAG 377
G G G A G G + GGA G ++ G G +G+ +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 378 GAGGFGGFGGDGGDGGIGGLVGSGGAGGSGGTGTLSGGRGGAG 420
G GG G G GG G G L G + G GG
Sbjct: 63 GNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


39Rv2755cRv2769cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2755c112-3.047036Possible type I restriction/modification system
Rv2756c212-2.549061Possible type I restriction/modification system
Rv2757c215-0.661542Possible toxin VapC21
Rv2758c2150.188922Possible antitoxin VapB21
Rv2759c213-0.910176Possible toxin VapC42. Contains PIN domain.
Rv2760c012-0.837855Possible antitoxin VapB42
Rv2761c012-0.878574Possible type I restriction/modification system
Rv2762c-114-0.856494Conserved hypothetical protein
Rv2763c-114-1.031031Dihydrofolate reductase DfrA (DHFR)
Rv2764c014-0.619366Probable thymidylate synthase ThyA (ts) (TSASE)
Rv27652140.491579Probable alanine rich hydrolase
Rv2766c2160.159253Probable short-chain type
Rv2767c3180.409406Possible membrane protein
Rv2768c2140.541638PPE family protein PPE43
Rv2769c2140.448297PE family protein PE27
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2760cadhesinb280.006 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 27.5 bits (61), Expect = 0.006
Identities = 12/36 (33%), Positives = 17/36 (47%)

Query: 31 DAVARRLSELDREDRARAEARRAAAEQTLRDLDKLL 66
+A+RLSE D ++ E A + L LDK
Sbjct: 153 QNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEA 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2766cDHBDHDRGNASE1132e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (284), Expect = 2e-32
Identities = 78/261 (29%), Positives = 111/261 (42%), Gaps = 15/261 (5%)

Query: 1 MTSLDLTGRTAIITGASRGIGLAIAQQLAAAGAHVV---LTARRQEAADEAAAQVGDRAL 57
M + + G+ A ITGA++GIG A+A+ LA+ GAH+ + E + A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 58 GVGAHAVDEDAARRCVDLTLERFGSVDILINNAGTNPAYGPLLEQDHARFAKIFDVNLWA 117
A D A G +DIL+N AG G + + F VN
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTG 119

Query: 118 PLMWTSLVVTAWMGEHGGAVVNTASIGGMHQSPAMGMYNATKAALIHVTKQLALELSPR- 176
+ V M G++V S +M Y ++KAA + TK L LEL+
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 177 IRVNAICPGVVRTRLAEALWKDHE----------DPLAATIALGRIGEPADIASAVAFLV 226
IR N + PG T + +LW D + I L ++ +P+DIA AV FLV
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 227 SDAASWITGETMIIDGGLLLG 247
S A IT + +DGG LG
Sbjct: 240 SGQAGHITMHNLCVDGGATLG 260


40Rv2795cRv2801AY        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2795c3192.603284Conserved hypothetical protein
Rv2796c2182.667165Probable conserved lipoprotein LppV
Rv2797c1172.694175Conserved hypothetical protein
Rv2798c1173.270112Conserved hypothetical protein
Rv27992163.029797Probable membrane protein
Rv28001133.395093Possible hydrolase
Rv2801c3182.428215Toxin MazF9
Rv2801A2172.722385Possible antitoxin MazE9
41Rv2813Rv2828cY        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2813420-3.567047Conserved hypothetical protein
Rv2814c521-5.089063Probable transposase
Rv2815c622-5.306011Probable transposase
Rv2816c722-5.528042Conserved hypothetical protein
Rv2817c720-5.318585Conserved hypothetical protein
Rv2818c721-5.034715Hypothetical protein
Rv2819c720-4.509025Hypothetical protein
Rv2820c718-3.135429Hypothetical protein
Rv2821c714-2.600514Conserved hypothetical protein
Rv2822c413-1.580178Hypothetical protein
Rv2823c413-0.554981Conserved protein
Rv2824c2132.299891Hypothetical protein
Rv2825c3163.634536Conserved hypothetical protein
Rv2826c3132.989660Hypothetical protein
Rv2827c2164.001648Hypothetical protein
Rv2828c2213.685422Conserved hypothetical protein
42Rv2884Rv2894cY        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2884314-0.232587Probable transcriptional regulatory protein
Rv2885c211-0.179899Probable transposase
Rv2886c3130.730763Probable resolvase
Rv28874101.369632Probable transcriptional regulatory protein
Rv2888c391.760112Probable amidase AmiC (aminohydrolase)
Rv2889c292.450651Probable elongation factor Tsf (EF-ts)
Rv2890c092.52958030S ribosomal protein S2 RpsB
Rv2891093.867922Conserved hypothetical protein
Rv2892c-1113.497069PPE family protein PPE45
Rv2893-2113.685327Possible oxidoreductase
Rv2894c-2133.035410Probable integrase/recombinase XerC
43Rv2947cRv2959cY        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2947c025-4.611643Probable polyketide synthase Pks15
Rv2948c123-4.901506P-hydroxybenzoyl-AMP ligase FadD22
Rv2949c020-4.853720Chorismate pyruvate lyase
Rv2950c016-3.145962Fatty-acid-AMP ligase FadD29 (fatty-acid-AMP
Rv2951c216-3.414874Possible oxidoreductase
Rv2952213-3.533371Possible methyltransferase (methylase)
Rv2953214-2.939932Enoyl reductase
Rv2954c317-4.688124Hypothetical protein
Rv2955c417-4.751311Conserved protein
Rv2956320-5.835591Conserved protein
Rv2957115-3.299044Possible glycosyl transferase
Rv2958c013-3.146428Possible glycosyl transferase
Rv2959c-214-3.778726Possible methyltransferase (methylase)
44Rv3010cRv3020cY        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3010c1143.146122Probable 6-phosphofructokinase PfkA
Rv3011c1133.171340Probable glutamyl-tRNA(GLN) amidotransferase
Rv3012c1112.674285Probable glutamyl-tRNA(GLN) amidotransferase
Rv3013-2141.679366Conserved protein
Rv3014c-1161.335733DNA ligase [NAD dependent] LigA
Rv3015c332-0.178987Conserved hypothetical protein
Rv3016739-2.092430Probable lipoprotein LpqA
Rv3017c1148-2.332614ESAT-6 like protein EsxQ (TB12.9) (ESAT-6 like
Rv3018c1252-2.111885PPE family protein PPE46
Rv3018A1052-1.224409PE family protein PE27A
Rv3019c426-0.672231Secreted ESAT-6 like protein EsxR (TB10.3)
Rv3020c222-0.164838ESAT-6 like protein EsxS
45Rv3046cRv3051cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3046c216-4.020797Conserved protein
Rv3047c215-4.434208Hypothetical protein
Rv3048c216-4.489297Ribonucleoside-diphosphate reductase (beta
Rv3049c015-3.853219Probable monooxygenase
Rv3050c113-3.839952Probable transcriptional regulatory protein
Rv3051c012-4.104124Ribonucleoside-diphosphate reductase (alpha
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3046cPF06057260.048 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 26.0 bits (57), Expect = 0.048
Identities = 12/40 (30%), Positives = 18/40 (45%), Gaps = 5/40 (12%)

Query: 70 RDEIAEFISEVTHHDAGPENIQR-VAGILAAAGWPLAGVD 108
+ + F+S D G + + V GIL GWP+ G
Sbjct: 50 KPPLVIFLS----GDGGWATLDKAVGGILQQQGWPVVGWS 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3050cHTHTETR531e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 52.7 bits (126), Expect = 1e-10
Identities = 27/145 (18%), Positives = 47/145 (32%), Gaps = 2/145 (1%)

Query: 23 RWREHRKKVRNEIVDAAFRAIDRLGPE-LSVRQIAEEAGTAKPKIYRHFTDKSDLLEAIG 81
+ ++ ++ R I+D A R + G S+ +IA+ AG + IY HF DKSDL I
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 82 MRLRDMLWAAIFPSLDLATDSAREVIRRSVEEYVNLVDQHPNVLRVF-IQGRSAKQSEAT 140
+ V+R + + + I +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 141 VRTLNEGREITLAMAEMFNNELREM 165
R + L + L+
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHC 148


46Rv3106Rv3124Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3106020-3.445891NADPH:adrenodoxin oxidoreductase FprA
Rv3107c029-5.137650Possible alkyldihydroxyacetonephosphate synthase
Rv3108139-8.899233Hypothetical protein
Rv3109137-8.936958Probable molybdenum cofactor biosynthesis
Rv3110224-5.049264Probable pterin-4-alpha-carbinolamine
Rv3111122-5.322195Probable molybdenum cofactor biosynthesis
Rv3112018-4.863981Probable molybdenum cofactor biosynthesis
Rv3113016-4.004400Possible phosphatase
Rv3114-115-3.822089Conserved hypothetical protein
Rv3115-115-3.076631Probable transposase
Rv3116-219-4.935985Probable molybdenum cofactor biosynthesis
Rv3117-117-3.507305Probable thiosulfate sulfurtransferase CysA3
Rv3118-323-2.441617Conserved hypothetical protein SseC1
Rv3119-128-4.345504Probable molybdenum cofactor biosynthesis
Rv3120-230-2.862690Conserved hypothetical protein
Rv3121-232-3.490963Probable cytochrome P450 141 Cyp141
Rv3122-229-2.600491Hypothetical protein
Rv3123-224-3.226400Hypothetical protein
Rv3124-223-3.700222Transcriptional regulatory protein MoaR1
47Rv3171cRv3190cY        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3171c2150.808441Possible non-heme haloperoxidase Hpx
Rv3172c216-0.112830Hypothetical protein
Rv3173c2140.247120Probable transcriptional regulatory protein
Rv3174313-0.431042Probable short-chain dehydrogenase/reductase
Rv3175214-0.046652Possible amidase (aminohydrolase)
Rv3176c314-1.112938Probable epoxide hydrolase MesT (epoxide
Rv3177213-1.847999Possible peroxidase (non-haem peroxidase)
Rv3178314-0.880026Conserved hypothetical protein
Rv3179413-0.519786Conserved protein
Rv3180c321-0.134196Hypothetical alanine rich protein
Rv3181c219-0.711156Conserved protein
Rv3182424-0.986457Conserved hypothetical protein
Rv3183524-0.591639Possible transcriptional regulatory protein
Rv3184426-1.065340Probable transposase for insertion sequence
Rv3185323-2.316897Probable transposase
Rv3186322-2.768652Probable transposase for insertion sequence
Rv3187320-2.834686Probable transposase
Rv3188218-2.477560Conserved hypothetical protein
Rv3189012-3.368618Conserved hypothetical protein
Rv3190c010-3.316499Hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3173cHTHTETR892e-24 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 89.3 bits (221), Expect = 2e-24
Identities = 42/169 (24%), Positives = 67/169 (39%), Gaps = 12/169 (7%)

Query: 10 PPRRGGRGARQRILKAAAELFYCEGINATGVELIANKASVSKRTLYQHFPSKSALVEEYL 69
++ + RQ IL A LF +G+++T + IA A V++ +Y HF KS L E
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 70 RGLRQAAGEA-----DKMPKASNATPRERLLALFDRPNRGDGR--MRGCPFHNAAVEAAG 122
GE K P + RE L+ + + + R + FH E G
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKC--EFVG 121

Query: 123 EMPGV---ERIVHSHKRDYIKGLARLAREAGAAHPRSLGNQLAVLFEGA 168
EM V +R + D I+ + EA + + A++ G
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3174DHBDHDRGNASE635e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 63.1 bits (153), Expect = 5e-14
Identities = 51/189 (26%), Positives = 83/189 (43%), Gaps = 15/189 (7%)

Query: 7 RTVLVTGANRGMGREYVAQLLGRKVAKVYAATRNPLAIDVSDPRVIP-------LQLDVT 59
+ +TGA +G+G E VA+ L + A + A NP ++ + DV
Sbjct: 9 KIAFITGAAQGIG-EAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 60 DAVSVAE-AADLATDVG---ILINNAGISRASSVLDKDTSALRGELETNLFGPLALASAF 115
D+ ++ E A + ++G IL+N AG+ R + N G + +
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 116 ADRI-AERSGAIVNVSSVLAWLP-LGMS-YGVSKAAMWSATESMRIELAPRGVQVVGVYV 172
+ + RSG+IV V S A +P M+ Y SKAA T+ + +ELA ++ V
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 173 GLVDTDMGR 181
G +TDM
Sbjct: 188 GSTETDMQW 196


48Rv3338Rv3388Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3338514-2.545656Conserved hypothetical protein
Rv3339c614-0.966511Probable isocitrate dehydrogenase [NADP] Icd1
Rv334010192.779484Probable O-acetylhomoserine sulfhydrylase MetC
Rv334111202.899610Probable homoserine O-acetyltransferase MetA
Rv334210170.893963Possible methyltransferase (methylase)
Rv3343c10170.794661PPE family protein PPE54
Rv3344c10193.199528PE-PGRS family protein PE_PGRS49
Rv3345c10150.945335PE-PGRS family protein PE_PGRS50
Rv3346c814-1.238247Conserved transmembrane protein
Rv3347c814-1.180532PPE family protein PPE55
Rv3348814-0.497492Probable transposase
Rv3349c814-0.448566Probable transposase
Rv3350c815-0.647955PPE family protein PPE56
Rv3351c0191.039976Conserved hypothetical protein
Rv3352c0201.507530Possible oxidoreductase
Rv3353c-1150.382426Conserved hypothetical protein
Rv33540130.830967Conserved hypothetical protein
Rv3355c2150.429387Probable integral membrane protein
Rv3356c1140.196785Probable bifunctional protein FolD:
Rv3357114-0.934319Antitoxin RelJ
Rv3358013-0.766159Toxin RelK
Rv3359013-0.127468Possible oxidoreductase
Rv33600100.871647Conserved hypothetical protein
Rv3361c-2101.230260Conserved protein
Rv3362c1113.492255Probable ATP/GTP-binding protein
Rv3363c1123.815775Conserved hypothetical protein
Rv3364c0143.910305Conserved protein
Rv3365c-1154.341340Conserved protein
Rv33660164.228284Probable tRNA/rRNA methylase SpoU (tRNA/rRNA
Rv33670154.151671PE-PGRS family protein PE_PGRS51
Rv3368c-1152.922217Possible oxidoreductase
Rv3369-2142.929624Conserved protein
Rv3370c-1132.577348Probable DNA polymerase III (alpha chain) DnaE2
Rv33711160.569158Possible triacylglycerol synthase
Rv3372021-2.554969Trehalose 6-phosphate phosphatase OtsB2
Rv3373429-6.195096Probable enoyl-CoA hydratase EchA18 (enoyl
Rv3374128-5.840353Probable enoyl-CoA hydratase (fragment) EchA18.1
Rv3375125-5.476062Probable amidase AmiD (acylamidase) (acylase)
Rv3376129-6.609995Conserved hypothetical protein
Rv3377c028-6.423126Halimadienyl diphosphate synthase
Rv3378c-226-4.760431Diterpene synthase
Rv3379c-121-2.121729Probable 1-deoxy-D-xylulose 5-phosphate synthase
Rv3380c122-1.792685Probable transposase
Rv3381c228-1.427989Probable transposase for insertion sequence
Rv3382c121-1.241136Probable LYTB-related protein LytB1
Rv3383c4174.819016Possible polyprenyl synthetase IdsB (polyprenyl
Rv3384c4206.376818Possible toxin VapC46. Contains PIN domain.
Rv3385c5185.411336Possible antitoxin VapB46
Rv33864154.690265Possible transposase
Rv33873173.903543Possible transposase
Rv33883163.919443PE-PGRS family protein PE_PGRS52
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3343ccloacin320.036 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.0 bits (72), Expect = 0.036
Identities = 28/79 (35%), Positives = 34/79 (43%), Gaps = 7/79 (8%)

Query: 1971 GAGNVGGFNVGGGNIGGNNVGLGNVGWGNFGLG-NSGLTP--GLMGLGNIGFGNAGSYNF 2027
G G+ G + GNI G GLG G + G G +S P G G G G +G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 2028 GLANMGVGNIGFANTGSGN 2046
G G GN G + GN
Sbjct: 66 G----GNGNSGGGSGTGGN 80



Score = 32.0 bits (72), Expect = 0.036
Identities = 29/106 (27%), Positives = 40/106 (37%), Gaps = 4/106 (3%)

Query: 193 NVGLFNAGSGNVGSYNVGAGNVGSYNVGGGNIGGNNVGLGNVGWGNFGLGNSGLTPGLMG 252
N G + SGN+ G G G + G G NN G G G G SG G
Sbjct: 10 NTGAHST-SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 253 LGNIGFGNAGSYNFG--LANMGVGNIGFANTGSGNFGIGLTGDNLT 296
GN G G+ N A + G + G+G + ++ L+
Sbjct: 69 -GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 31.6 bits (71), Expect = 0.043
Identities = 26/94 (27%), Positives = 37/94 (39%), Gaps = 3/94 (3%)

Query: 1842 GNTTGAPSSGFFNSGAGGVSGFGNVGAMVSGGWNQAPSALLGGGSGVFNAGTLHSGVLNF 1901
GN G P+ GA SG+ + GG GGGSG N G +
Sbjct: 18 GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW--GGGSGHGNGGGNGNSGGGS 75

Query: 1902 GSGMSGLFNTSVLGLGAPALVS-GLGSVGQQLSG 1934
G+G + + + G PAL + G G + +S
Sbjct: 76 GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3344ccloacin340.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 0.001
Identities = 30/99 (30%), Positives = 37/99 (37%), Gaps = 1/99 (1%)

Query: 174 GAGGSGGNGGAGGNATGSGGKGGAGGNGGDGS-FGATSGPASIGVTGAPGGNGGKGGAGG 232
G G + G GN G G GG DGS + + + P G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 233 SNPNGSGGDGGKGGNGGAGGNGGSIGANSGIVGGSGGAG 271
SGG G GGN A + G + G+GG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.001
Identities = 31/80 (38%), Positives = 36/80 (45%), Gaps = 2/80 (2%)

Query: 121 TGGSGSGIGGGAGGNGGN--GGAGGTGVVLGGKGGDGGNGDHGGPATNPGSGSRGGAGGS 178
+GG G G GA GN GG G GV G G G + ++ GSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 179 GGNGGAGGNATGSGGKGGAG 198
GNGG GN+ G G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.001
Identities = 36/121 (29%), Positives = 41/121 (33%), Gaps = 13/121 (10%)

Query: 13 GAGGNGGAGSAGNGGAGGAGGNGGAGGNGGGGDAGNAGSGGNGGKGGDGVGPGSTGGAGG 72
G G G A + GG G G GG D S N GG G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 73 KGGAGANGGSSNGNARGGNAGNGGHGGAGGSGDTGGAGGAGGQGGFGGTGGSGSGIGGGA 132
G GGN +GG G GG+ A A G G G + A
Sbjct: 63 GNG-------------GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109

Query: 133 G 133
G
Sbjct: 110 G 110



Score = 33.9 bits (77), Expect = 0.001
Identities = 27/79 (34%), Positives = 35/79 (44%), Gaps = 1/79 (1%)

Query: 69 GAGGKGGAGANGGSSNGNARGGNAGNGGHGGAG-GSGDTGGAGGAGGQGGFGGTGGSGSG 127
G G GA + G+ NG G G G G+G S + GG+G +GG G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 128 IGGGAGGNGGNGGAGGTGV 146
G G G G G + V
Sbjct: 66 GGNGNSGGGSGTGGNLSAV 84



Score = 33.1 bits (75), Expect = 0.003
Identities = 28/90 (31%), Positives = 31/90 (34%), Gaps = 4/90 (4%)

Query: 240 GDGGKGGNGGAGGNGGSIGANSGIVGGSGGAGGAGGAGG----NGSLSSGEGGKGGDGGH 295
G G+G N GA G+I +G GGA G G S GG GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 296 GGDGVGGNSSVTQGGSGGGGGAGGAGGSGF 325
G G GNS G G GF
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 32.8 bits (74), Expect = 0.003
Identities = 27/84 (32%), Positives = 33/84 (39%)

Query: 193 GKGGAGGNGGDGSFGATSGPASIGVTGAPGGNGGKGGAGGSNPNGSGGDGGKGGNGGAGG 252
G G G N G S G+ G + G G + +NP G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 253 NGGSIGANSGIVGGSGGAGGAGGA 276
G NSG G+GG A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 31.2 bits (70), Expect = 0.009
Identities = 22/69 (31%), Positives = 27/69 (39%)

Query: 4 SPAAHGGSGGAGGNGGAGSAGNGGAGGAGGNGGAGGNGGGGDAGNAGSGGNGGKGGDGVG 63
P G GGA G S N GG+G GG G G+ G G+ G G G +
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82

Query: 64 PGSTGGAGG 72
+ A G
Sbjct: 83 AVAAPVAFG 91



Score = 30.8 bits (69), Expect = 0.012
Identities = 30/84 (35%), Positives = 36/84 (42%), Gaps = 5/84 (5%)

Query: 223 GNGGKGGAGGSNPNGSGGDGGKGGNGGAGGNGGSIGANSGIVGGSGGAGGAGGAGGNGSL 282
G G GA ++ N +GG G G GGA G N+ GGSG GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG---- 61

Query: 283 SSGEGGKGGDGGHGGDGVGGNSSV 306
G GG G+ G G G S+V
Sbjct: 62 -HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 30.8 bits (69), Expect = 0.012
Identities = 31/80 (38%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 83 SNGNARGGNAGNGGHGGAGGSGDTGGAGGAGGQGGFGGT-------GGSGSGIGGGAGGN 135
S G+ RG N G G G TG G G G G + GGSGSGI G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 136 GGNGGAGGTGVVLGGKGGDG 155
GNGG G G GG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.015
Identities = 30/83 (36%), Positives = 39/83 (46%), Gaps = 1/83 (1%)

Query: 290 GGDG-GHGGDGVGGNSSVTQGGSGGGGGAGGAGGSGFFGGKGGFGGDGGQGGPNGGGTVG 348
GGDG GH + ++ G +G G G G + GSG+ +GG G G GGG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 349 TVAGGGGNGGVGGRGGDGVFAGA 371
GG GN G G G + A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 30.5 bits (68), Expect = 0.019
Identities = 32/84 (38%), Positives = 33/84 (39%), Gaps = 5/84 (5%)

Query: 333 GGDGGQGGPNGGGTVGTVAGGGGNGGVGGRGGDGVFAGAGGQGGLGGQGGNGGGSTGGNG 392
GGDG T G + GG GVGG DG G G G GS G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG-----SGWSSENNPWGGGSGSGIHWG 57

Query: 393 GLGGAGGGGGNAPDGGFGGNGGKG 416
G G G GGGN GG G GG
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 30.1 bits (67), Expect = 0.021
Identities = 25/86 (29%), Positives = 31/86 (36%)

Query: 369 AGAGGQGGLGGQGGNGGGSTGGNGGLGGAGGGGGNAPDGGFGGNGGKGGQGGIGGGTQSA 428
+G G+G G G GG GLG GG + G G GI G S
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 429 TGLGGDGGDGGDGGNGGNSGAKAGGA 454
G GG G+ G G G + +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 30.1 bits (67), Expect = 0.023
Identities = 33/109 (30%), Positives = 40/109 (36%), Gaps = 9/109 (8%)

Query: 108 GAGGAGGQGGFGGTGGSGSGIGGGAGGNGGNGGAGGTGVVLGGKGGDGGNGDHGGPATNP 167
G G G G T G+ +G G G GG G GG G+G H G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG---- 58

Query: 168 GSGSRGGAGGSGGNGGAGGNATGSGGKGGAGGNGGDGSFGATSGPASIG 216
G G+GG G G +G+GG A F A S P + G
Sbjct: 59 -----GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3345ccloacin408e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.1 bits (93), Expect = 8e-05
Identities = 31/104 (29%), Positives = 37/104 (35%), Gaps = 2/104 (1%)

Query: 1174 LGGNGGAGGNGGVSTTGGD--GGAGGKGGNGGDGGNVGLGGDAGSGGAGGNGGIGTDAGG 1231
+ G G G N G +T G+ GG G G GG G + G G GI G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1232 AGGAGGAGGNGGSSKSTTTGNAGSGGAGGNGGTGLNGAGGAGGA 1275
G GG GN G T + G L+ G G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 39.7 bits (92), Expect = 8e-05
Identities = 41/121 (33%), Positives = 50/121 (41%), Gaps = 5/121 (4%)

Query: 1311 AGGKGGNGSSGAASGSGVVNVTAGHGGNGGNGGNGGNGSAGAGGQGGAGGSAGNGGHGGG 1370
+GG G ++GA S SG +N G GG +G S+ GG GS + G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1371 ATGGDGGNGGNGGNSGNSTGVAGLAGGAAGAGGNGGGTSSAAGHGGSGGSGGSGTTGGAG 1430
GNGG GNSG +G G A G S G GG S +G A
Sbjct: 62 -----HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116

Query: 1431 A 1431
A
Sbjct: 117 A 117



Score = 38.5 bits (89), Expect = 2e-04
Identities = 36/100 (36%), Positives = 46/100 (46%), Gaps = 2/100 (2%)

Query: 962 GNGGNGGNGGKGGTAGNGSGAAGGNGGNGG-SGLNGGDAGNGGNGGGALNQAGFFGTGGK 1020
G G G N G T+GN +G G G GG S +G + N GGG+ + + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1021 GGNGGNGGAGMINGGLGGFGGAGGGGAVDVAA-TTGGAGG 1059
G GGNG +G +G G A A +T GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.8 bits (87), Expect = 4e-04
Identities = 38/104 (36%), Positives = 43/104 (41%), Gaps = 3/104 (2%)

Query: 562 SGGNGGNGGAGATPTVAGENGGAGGNGGHGGSVGNGGAGGAGGNGVAGTGLALNGGNGGN 621
SGG+G GA T NGG G G GG + G+G + N G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 622 GGIGGNGGSAAGTGGDGGKGGNGGAGANGQDFSASANGANGGQG 665
G GNGG +GG G GGN A A F A G G
Sbjct: 60 SG-HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.0 bits (85), Expect = 6e-04
Identities = 26/85 (30%), Positives = 34/85 (40%)

Query: 599 AGGAGGNGVAGTGLALNGGNGGNGGIGGNGGSAAGTGGDGGKGGNGGAGANGQDFSASAN 658
+GG G G NGG G+G GG++ G+G GG +G + +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 659 GANGGQGGNGGNGGIGGKGGDAFAT 683
NGG GN G G G A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 36.6 bits (84), Expect = 9e-04
Identities = 34/109 (31%), Positives = 44/109 (40%), Gaps = 3/109 (2%)

Query: 1167 GPEGNGGLGGNGGAGGNGGVSTTGGDGGAGGKGGNGGDGGNVGLGGDAGSG---GAGGNG 1223
G +G G G GN TG G G G+G N GG +GSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1224 GIGTDAGGAGGAGGAGGNGGSSKSTTTGNAGSGGAGGNGGTGLNGAGGA 1272
G G G +GG G GGN + + + G GG ++ + GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.2 bits (83), Expect = 0.001
Identities = 35/116 (30%), Positives = 40/116 (34%), Gaps = 8/116 (6%)

Query: 1337 GNGGNGGNGGNGSAGAGGQGGAGGSAGNGGHGGGATGGDGGNGGNGGNSGNSTGVAGLAG 1396
G G G N G S GG G GG G +G N GG SG+ G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG-SGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1397 GAAGAGGNGGGTSSAAGHGGSGGSGGSGTTGGAGAAGGNGGAGAGGGSLSTGQSGG 1452
G G G GGSG G A G + G G L+ S G
Sbjct: 62 HGNGGGNGNSG-------GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 35.8 bits (82), Expect = 0.001
Identities = 29/80 (36%), Positives = 34/80 (42%), Gaps = 3/80 (3%)

Query: 1291 GGDGGNGGNGGHGGDGTTGGAGGKGGNGSSGAASGSG--VVNVTAGHGGNGGNGGNGGNG 1348
GGDG G H G G G GA+ GSG N G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG-VGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1349 SAGAGGQGGAGGSAGNGGHG 1368
GG G +GG +G GG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 35.8 bits (82), Expect = 0.002
Identities = 24/76 (31%), Positives = 32/76 (42%)

Query: 747 GNGGDGGKGGSGGNVGNGGNGGAGGNGAAGQAGTPGPTSGDSGTSGTDGGAGGNGGAGGA 806
G G + G + GN+ G G G GA+ +G + G SG+ GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 807 GGTLAGHGGNGGKGGN 822
GG GG+G G
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.003
Identities = 28/97 (28%), Positives = 33/97 (34%), Gaps = 1/97 (1%)

Query: 796 GAGGNGGAGGAGGTLAGHGGNGGKGGNGGQG-GIGGAGERGADGAGPNANGANGENGGSG 854
G G N GA G + G G GG G G G+G + G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 855 GNGGDGGAGGNGGAGGKAQAAGYTDGATGTGGDGGNG 891
G G+ G G G A AA G G G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.1 bits (80), Expect = 0.003
Identities = 27/83 (32%), Positives = 34/83 (40%)

Query: 1350 AGAGGQGGAGGSAGNGGHGGGATGGDGGNGGNGGNSGNSTGVAGLAGGAAGAGGNGGGTS 1409
+G G+G G+ G+ G G G GG SG S+ GG+ GGG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1410 SAAGHGGSGGSGGSGTTGGAGAA 1432
G G GGSGT G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 34.7 bits (79), Expect = 0.003
Identities = 29/104 (27%), Positives = 31/104 (29%)

Query: 1136 LGGNGGLGGNGGVSETGFGGAGGNGGYGGPGGPEGNGGLGGNGGAGGNGGVSTTGGDGGA 1195
+ G G G N G T GG G G GG G G G S GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1196 GGKGGNGGDGGNVGLGGDAGSGGAGGNGGIGTDAGGAGGAGGAG 1239
G G G G G G A GAGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.003
Identities = 36/102 (35%), Positives = 40/102 (39%), Gaps = 3/102 (2%)

Query: 1056 GAGGNGGAGGFASTGLGGPGGAGGPGGAGDFASGVGGVGGAGGDGGAGGVGGFGGQGGIG 1115
G G N GA + GGP G G GGA D SG G G G+ GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASD-GSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 1116 GEGRTGGNGGSGGDGGGGISLGGNGGLGGNGGVSETGFGGAG 1157
G G GN G G GG +S G +S G GG
Sbjct: 65 GGG--NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.004
Identities = 31/110 (28%), Positives = 37/110 (33%), Gaps = 10/110 (9%)

Query: 379 GGIGATANSPLQAGGAGGNGGHGGLVGNGGTGGAGGAGHAGSTGATGTALQPTGGNGTNG 438
GG G N+ + NGG GL GG G + G+ G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 439 GAGGHGGNGGNGGAQHGDGGVGGKGGAGGSGGAGGNGFDAATLGSPGADG 488
G GG GN G G G S A F L +PGA G
Sbjct: 63 GNGGGNGNSGGGSG----------TGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.004
Identities = 28/101 (27%), Positives = 34/101 (33%), Gaps = 1/101 (0%)

Query: 826 GGIGGAGERGADGAGPNANGANGENGGSGGNGGDGGAGGNGGAGGKAQAAGYTDGATGTG 885
GG G GA N NG G GG G G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 886 GDGGNGGDGGKAGDGGAGENGLNSGAMLPGGGTVGNPGTGG 926
G+GG G+ G G G G + + G + PG GG
Sbjct: 63 GNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.004
Identities = 37/126 (29%), Positives = 48/126 (38%), Gaps = 10/126 (7%)

Query: 1284 VSFGNAVGGDGGNGGNGGHGGDGTTGGAGGKGGNGSS---------GAASGSGVVNVTAG 1334
+S G+ G + G G+ G TG G G + S G SGSG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1335 HGGNGGNGGNGGNGSAGAGGQGGAGGSAGNGGHGGGATGGDGGNGGNGGNSGNSTGVAGL 1394
GNGG GN G GS G GG A + G +T G GG + S +A +
Sbjct: 61 GHGNGGGNGNSGGGS-GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119

Query: 1395 AGGAAG 1400
G
Sbjct: 120 MAALKG 125



Score = 33.9 bits (77), Expect = 0.005
Identities = 25/78 (32%), Positives = 31/78 (39%)

Query: 1211 GGDAGSGGAGGNGGIGTDAGGAGGAGGAGGNGGSSKSTTTGNAGSGGAGGNGGTGLNGAG 1270
GGD G + G GG G G GG S ++ N GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1271 GAGGAGGNAGVAGVSFGN 1288
G GG GN+G + GN
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 33.9 bits (77), Expect = 0.006
Identities = 24/86 (27%), Positives = 30/86 (34%)

Query: 1347 NGSAGAGGQGGAGGSAGNGGHGGGATGGDGGNGGNGGNSGNSTGVAGLAGGAAGAGGNGG 1406
+G G G GA ++GN G G GG G S + G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1407 GTSSAAGHGGSGGSGGSGTTGGAGAA 1432
+ GGSG G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.5 bits (76), Expect = 0.008
Identities = 25/71 (35%), Positives = 34/71 (47%)

Query: 876 GYTDGATGTGGDGGNGGDGGKAGDGGAGENGLNSGAMLPGGGTVGNPGTGGNGGNGGNAG 935
G+ GA T G+ G G G G + +G +S GGG+ GG G+G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 936 VGGTGGKAGTG 946
G +GG +GTG
Sbjct: 68 NGNSGGGSGTG 78



Score = 33.1 bits (75), Expect = 0.008
Identities = 41/126 (32%), Positives = 44/126 (34%), Gaps = 9/126 (7%)

Query: 1096 AGGDGGAGGVGGFGGQGGIGGEGRTGGNGGSGGDGGGGISLGGNGGLGGNGGVSETGFGG 1155
+GGDG G G I G G TG G G G G S N GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1156 AGGNGGYGGPGGPEGNGGLGGNGGAGGNGGVSTTGGDGGAGGKGGNGGDGGNVGLGGDAG 1215
GNGG GNG GG G GGN G G G V + A
Sbjct: 61 GHGNGG--------GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112

Query: 1216 SGGAGG 1221
S
Sbjct: 113 SAAIAD 118



Score = 33.1 bits (75), Expect = 0.010
Identities = 32/102 (31%), Positives = 38/102 (37%)

Query: 917 GTVGNPGTGGNGGNGGNAGVGGTGGKAGTGSLTGLDGTDGITPNGGNGGNGGNGGKGGTA 976
G G G GN G TG G G+ G + P GG G+G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 977 GNGSGAAGGNGGNGGSGLNGGDAGNGGNGGGALNQAGFFGTG 1018
GNG G GG+G G A G AL+ G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.1 bits (75), Expect = 0.010
Identities = 35/102 (34%), Positives = 44/102 (43%), Gaps = 1/102 (0%)

Query: 175 GSGGAGAAGGVGGSGGWLNGNGGAGGAGGTGANGGAGGNAWLFGAGGSGGAGTNGGVGGS 234
G G G G + G +NG G GG GA+ G+G ++ GG G+G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 235 GGFVYGNGGAGGIGGIGGIGGNGGDAGLFGNGGAGGAGAAGL 276
G GNG +GG G GG FG GA GL
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 32.0 bits (72), Expect = 0.020
Identities = 39/114 (34%), Positives = 47/114 (41%), Gaps = 4/114 (3%)

Query: 285 GDGSDGGNGGTGGN--GGRGGLLVGNGGAGGAG-GVGGDGGKGGAGDPSFAVNNGAGGNG 341
G G + G T GN GG GL VG G + G+G + GG+G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 342 GHGGNPGVGGAGGAGGLLAGAHGAAGATPTSGGNGGDGGIGATANSPLQAGGAG 395
G GN G GG+G G L A A A P G G + + L A A
Sbjct: 66 GGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.0 bits (72), Expect = 0.021
Identities = 27/86 (31%), Positives = 32/86 (37%), Gaps = 7/86 (8%)

Query: 901 GAGENGLNSGAMLPGGGTVGNPGTGGNGGNGGNAGVGGTGGKAGTGSLTGLDGTDGITPN 960
G G N+GA G G P G GG G + G + G G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-------GASDGSGWSSENNPWGGGSGSGIH 55

Query: 961 GGNGGNGGNGGKGGTAGNGSGAAGGN 986
G G GNGG G +G GSG G
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.025
Identities = 39/121 (32%), Positives = 47/121 (38%), Gaps = 8/121 (6%)

Query: 1115 GGEGRTGGNGGSGGDGGGGISLGGNGGLGGNGGVSETGFGGAGGNGGYGGPGGPEGNGGL 1174
GG+GR G+ G I+ GG GLG GG S+ + N GG G GG
Sbjct: 3 GGDGR--GHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1175 GGNGGAGGNGGVSTTGGDGGAGGKGGNGGDGGNVGLGGDAGSGGAGGNGGIGTDAGGAGG 1234
G+G GGNG GG+G G V G A S G + AG
Sbjct: 60 SGHGNGGGNGN-----SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114

Query: 1235 A 1235
A
Sbjct: 115 A 115



Score = 31.6 bits (71), Expect = 0.028
Identities = 29/102 (28%), Positives = 35/102 (34%), Gaps = 2/102 (1%)

Query: 1197 GKGGNGGDGGNVGLGGDAGSGGAGGNGGIGTDAGGAGGAGGAGGNGGSSKSTTTGNAGSG 1256
G G G + G G+ G G G G G + GGS G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1257 GAGGNGGTGLNGAGGAGGAGGNAGVAGVSFGNAVGGDGGNGG 1298
G GG G G+G G +A A V+FG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGG--NLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.032
Identities = 21/71 (29%), Positives = 27/71 (38%)

Query: 442 GHGGNGGNGGAQHGDGGVGGKGGAGGSGGAGGNGFDAATLGSPGADGGMGGNGGKGGDGG 501
G G G N GA G + G G GG +G ++ +P G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 502 KAGDGGAGAAG 512
G G + G
Sbjct: 63 GNGGGNGNSGG 73



Score = 31.2 bits (70), Expect = 0.033
Identities = 25/78 (32%), Positives = 32/78 (41%), Gaps = 1/78 (1%)

Query: 1016 GTGGKGGNGG-NGGAGMINGGLGGFGGAGGGGAVDVAATTGGAGGNGGAGGFASTGLGGP 1074
G G+G N G + +G INGG G G GG ++ G G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1075 GGAGGPGGAGDFASGVGG 1092
G GG G +G + G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 31.2 bits (70), Expect = 0.034
Identities = 26/79 (32%), Positives = 32/79 (40%)

Query: 221 GSGGAGTNGGVGGSGGFVYGNGGAGGIGGIGGIGGNGGDAGLFGNGGAGGAGAAGLPGAA 280
G G G N G + G + G G+GG G GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 281 GLNGGDGSDGGNGGTGGNG 299
G GG+G+ GG GTGGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 31.2 bits (70), Expect = 0.041
Identities = 32/104 (30%), Positives = 35/104 (33%)

Query: 250 IGGIGGNGGDAGLFGNGGAGGAGAAGLPGAAGLNGGDGSDGGNGGTGGNGGRGGLLVGNG 309
+ G G G + G G G GL G + G G N GG G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 310 GAGGAGGVGGDGGKGGAGDPSFAVNNGAGGNGGHGGNPGVGGAG 353
G G GG G GG G G AV PG GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.043
Identities = 35/111 (31%), Positives = 40/111 (36%), Gaps = 2/111 (1%)

Query: 538 GGAGGVSANPALNGSAGANGTAPTSGGNGGNGGAGATPTVAGENGGAGGNGGHGGSVGNG 597
GG G A + S NG G GG + GG G+G H G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG--GGS 60

Query: 598 GAGGAGGNGVAGTGLALNGGNGGNGGIGGNGGSAAGTGGDGGKGGNGGAGA 648
G G GGNG +G G G G A T G GG + AGA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3347ccloacin320.034 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.4 bits (73), Expect = 0.034
Identities = 24/91 (26%), Positives = 33/91 (36%), Gaps = 8/91 (8%)

Query: 363 GFNTGVANVGSYNTGSFNAGNTNTGGFNPGNVNTGWLNTGNTNTGIANSGNVNTGAFISG 422
G NTG + G+ N G T G + +GW + N G + SG G G
Sbjct: 8 GHNTGAHSTS----GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 423 NFSNGVLWRGDYEGLWGLSGGSTIPAIPIGL 453
N G+ G G G + A P+
Sbjct: 64 NGGGN----GNSGGGSGTGGNLSAVAAPVAF 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3349cINTIMIN310.005 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.8 bits (69), Expect = 0.005
Identities = 19/67 (28%), Positives = 33/67 (49%), Gaps = 4/67 (5%)

Query: 4 DPAAAYASAIRTPGLLPNAKLVVDHFHVTTLANDALTAVRRRVTWAFHDRRGRKIDPQWA 63
+P AA TP +P + +D+ H T ND L +++ + F ++I+PQ+
Sbjct: 369 NPGAATVGVNYTP--IPLVTMGIDYRHGTGNENDLLYSMQ--FRYQFDKPWSQQIEPQYV 424

Query: 64 NRRRLLT 70
N R L+
Sbjct: 425 NELRTLS 431


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3350cPYOCINKILLER370.002 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 36.7 bits (84), Expect = 0.002
Identities = 46/199 (23%), Positives = 68/199 (34%), Gaps = 17/199 (8%)

Query: 3442 ISVPSIHLGLDPAVHVGSITVNPITVRTPPVLVSYSQGAVTSTSGPTSEIWVKPSFFPGI 3501
+ + + LGL P+V++ ++ TV P L + ++G T+ S +++ P P
Sbjct: 328 LGMDAAKLGLPPSVNLNAVAKASGTVDLPMRLTNEARGNTTTLSVVSTDGVSVPKAVPVR 387

Query: 3502 RIAPSSGGGATSTQGAYFVGPISIPSGTVTFPGFTIPLDPIDIGLPVSLT--IP-GFTI- 3557
A +T G Y V S + P P P S T +P +
Sbjct: 388 MAAY------NATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQNPSSTTPVVPKPVPVY 441

Query: 3558 PGGTLIPTLPLGLALSNGIPPVDIPAIVLDRILLDLHADTTIGPINVPIAGFGGAPGFGN 3617
G TL P I + I AD+ I PI V PG
Sbjct: 442 EGATLTPVKATPETYPGVITLPEDLIIGFP-------ADSGIKPIYVMFRDPRDVPGAAT 494

Query: 3618 STTLPSSGFFNTGAGGGSG 3636
P SG + A G G
Sbjct: 495 GKGQPVSGNWLGAASQGEG 513


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3357PF04605260.022 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 25.6 bits (56), Expect = 0.022
Identities = 9/38 (23%), Positives = 16/38 (42%)

Query: 47 QETVYLLRSPENARRLMEAVARDKAGHSAFTKSVDELR 84
Q + Y + P N RR++ V + + + V E
Sbjct: 44 QYSGYTSKEPINERRVIRIVNKLTKKFTWLGECVKEFD 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3367cloacin402e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.5 bits (94), Expect = 2e-05
Identities = 31/84 (36%), Positives = 40/84 (47%), Gaps = 1/84 (1%)

Query: 445 GNGGNGGNGGTGGSGGVGGNGGIGGDGAGGGNATSTSSIPF-DAHGGNGGAGGDAGHGGT 503
G G N G T G+ G G G GA G+ S+ + P+ G GG +GHG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 504 GGDGGDGGHAGTGGRGGLLAGQHA 527
GG+G GG +GTGG +A A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 38.2 bits (88), Expect = 8e-05
Identities = 29/76 (38%), Positives = 32/76 (42%), Gaps = 2/76 (2%)

Query: 154 GNGGNAGLIGNGGT--GGSGGAGAAGGAGGSGGWLYGNGGNGGIGGNAIVAGGAGGNGGA 211
G G N G G GG G G GGA GW N GG G+ I GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 212 GGAAGLWGSGGSGGQG 227
GG G G+GG
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 37.0 bits (85), Expect = 2e-04
Identities = 25/70 (35%), Positives = 33/70 (47%)

Query: 413 GGDGGNGGTGGTGGRGGDGGSGGAGGASGWLMGNGGNGGNGGTGGSGGVGGNGGIGGDGA 472
G+ G TG G G GSG + + W G+G GG G G GGNG GG
Sbjct: 17 SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSG 76

Query: 473 GGGNATSTSS 482
GGN ++ ++
Sbjct: 77 TGGNLSAVAA 86



Score = 36.6 bits (84), Expect = 2e-04
Identities = 32/91 (35%), Positives = 37/91 (40%), Gaps = 4/91 (4%)

Query: 307 GAAGGDGGANGGAGGAGGQAASAGSSVGGDGGNGGAGGTGTNGHAGGAGGAGGAGGRGGW 366
GA G NGG G G ++ S N GG+G+ H GG G G GG G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 367 LVGNGGNGGNGAAGGNGAIG----GTGGAGG 393
G+G G A A G T GAGG
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.8 bits (82), Expect = 4e-04
Identities = 33/104 (31%), Positives = 40/104 (38%), Gaps = 4/104 (3%)

Query: 309 AGGDG-GANGGAGGAGGQAASAGSSVGGDGGNGGAGGTGTNGHAGGAGGAGGAGGRGGWL 367
+GGDG G N GA G + +G GG G + + G G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 368 VGNGGNGGNGAAGGNGAIGGTGGAGGVPANQGGNSALGTQPVGG 411
GNGG GN G G+ G + G AL T GG
Sbjct: 62 HGNGGGNGN---SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.1 bits (80), Expect = 7e-04
Identities = 36/106 (33%), Positives = 43/106 (40%), Gaps = 9/106 (8%)

Query: 369 GNGGNGGNGAAGGNGAIGGTGGAGGVPANQGGNSALGTQPVGGDGGDGGNGGTGGTGGRG 428
G G N G + GN G TG G A+ G + P GG G G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGG--GSGSGIHWGGGSGHG 63

Query: 429 GDGGSGGAGGASGWLMGNGGNGGNGGTGGSGGVGGNGGIGGDGAGG 474
GG+G +GG S G GGN + G + GAGG
Sbjct: 64 NGGGNGNSGGGS-------GTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.002
Identities = 27/77 (35%), Positives = 33/77 (42%)

Query: 390 GAGGVPANQGGNSALGTQPVGGDGGDGGNGGTGGTGGRGGDGGSGGAGGASGWLMGNGGN 449
G G N G +S G G G G G + G+G + GG G+ G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 450 GGNGGTGGSGGVGGNGG 466
G GG G SGG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 33.1 bits (75), Expect = 0.003
Identities = 33/116 (28%), Positives = 41/116 (35%), Gaps = 12/116 (10%)

Query: 443 LMGNGGNGGNGGTGGSGGVGGNGGIGGDGAGGGNATSTSSIPFDAHGGNGGAGGDAGHGG 502
+ G G G N G + G G G GG + S S + GG G+G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 503 TGGDGGDGGHAGTGGRGGLLAGQHANSGNGGGGGTGGAGGTHGTPGSGNAGGTGTG 558
G+GG G++G G SG GG A G P G G
Sbjct: 61 GHGNGGGNGNSGGG------------SGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.006
Identities = 18/64 (28%), Positives = 25/64 (39%)

Query: 270 DGTPGGAGVNGGNGGAGGDANGNPANTSIANAGAGGNGAAGGDGGANGGAGGAGGQAASA 329
+G P G GV GG G ++ N + +G G +G G G G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 330 GSSV 333
S+V
Sbjct: 81 LSAV 84



Score = 31.6 bits (71), Expect = 0.010
Identities = 26/82 (31%), Positives = 34/82 (41%), Gaps = 7/82 (8%)

Query: 285 AGGDANGNPANTSIANAGAGGNGAAGGDGGANGGAGGAGGQA-----ASAGSSVGGDGGN 339
+GGD G+ NT + NG G G G + G+G + S GG
Sbjct: 2 SGGDGRGH--NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 340 GGAGGTGTNGHAGGAGGAGGAG 361
G G G NG++GG G GG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.016
Identities = 25/91 (27%), Positives = 28/91 (30%)

Query: 137 GNGGAGYNSAATPGMAGGNGGNAGLIGNGGTGGSGGAGAAGGAGGSGGWLYGNGGNGGIG 196
G N P G GG + G G G+ G GG +GNGG G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 197 GNAIVAGGAGGNGGAGGAAGLWGSGGSGGQG 227
G GG A A G G G
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.016
Identities = 26/92 (28%), Positives = 32/92 (34%), Gaps = 2/92 (2%)

Query: 354 AGGAGGAGGRGGWLVGNGGNGGNGAAGGNGAIGGTGGAGGVPANQGGNSALGTQPVGGDG 413
+GG G G NGG G G G + G+G N G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG--GASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 414 GDGGNGGTGGTGGRGGDGGSGGAGGASGWLMG 445
GNGG G G G G + A+ G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 30.5 bits (68), Expect = 0.025
Identities = 32/106 (30%), Positives = 43/106 (40%), Gaps = 16/106 (15%)

Query: 337 GGNGGAGGTGTNGHAGGAGGAGGAGGRGGWLVGNGGNGGNGAAGGNGAIGGTGGAGGVPA 396
GG+G TG + +G G G GG G + G+G + N GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-----GASDGSGWSSENNPWGG--------- 48

Query: 397 NQGGNSALGTQPVGGDGGDGGNGGTGGTGGRGGDGGSGGAGGASGW 442
G S + G G GGNG +GG G GG+ + A A G+
Sbjct: 49 --GSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 29.7 bits (66), Expect = 0.037
Identities = 23/71 (32%), Positives = 30/71 (42%), Gaps = 7/71 (9%)

Query: 523 AGQHANSGNGGGGGTGGAGGTHGTPGSG-------NAGGTGTGNADSTNGGPGSDGLGGD 575
G H+ SGN GG TG G + GSG GG+G+G G G+ G G+
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 576 AFNGSRGTDGN 586
+ GS
Sbjct: 71 SGGGSGTGGNL 81



Score = 29.7 bits (66), Expect = 0.041
Identities = 30/86 (34%), Positives = 31/86 (36%), Gaps = 12/86 (13%)

Query: 335 GDGGNGGAGGTG--TNGHAGGAGGAGGAGGRGGWLVGNGGNGGNGAAGGNGAIGGTGGAG 392
G G N GA T NG G G GGA GW N GG G I GG+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGG----GSGSGIHWGGGSG 61

Query: 393 GVPANQGGNSALGTQPVGGDGGDGGN 418
GNS GG G G
Sbjct: 62 HGNGGGNGNSG------GGSGTGGNL 81



Score = 29.3 bits (65), Expect = 0.044
Identities = 27/88 (30%), Positives = 32/88 (36%)

Query: 490 GNGGAGGDAGHGGTGGDGGDGGHAGTGGRGGLLAGQHANSGNGGGGGTGGAGGTHGTPGS 549
G G G + G T G+ G G G ++ N GGG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 550 GNAGGTGTGNADSTNGGPGSDGLGGDAF 577
GN GG G S GG S AF
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAF 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3388cloacin397e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 7e-05
Identities = 38/104 (36%), Positives = 47/104 (45%), Gaps = 14/104 (13%)

Query: 583 TGSGGIGGNGGAGGTGGNAGVALSVGSTGGLGGNGGSGGLGGGGGSLFGNGGAGGVGATG 642
+G G G N GA T GN NGG GLG GGG+ G+G + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNI--------------NGGPTGLGVGGGASDGSGWSSENNPWG 47

Query: 643 GNGGSGIGPASVGGNGGKGGVGAAGGLAGQIGNGGSGGSGGAGG 686
G GSGI G+G GG G +GG +G GN + + A G
Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 37.0 bits (85), Expect = 3e-04
Identities = 25/75 (33%), Positives = 34/75 (45%)

Query: 656 GNGGKGGVGAAGGLAGQIGNGGSGGSGGAGGNGGTGDTAGNGGNGGAGAVGGNAQLIGNG 715
G G+G A +G I G +G G G + G+G ++ N GG G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 716 GNGGGGGNGGTGADG 730
GNGGG GN G G+
Sbjct: 63 GNGGGNGNSGGGSGT 77



Score = 36.6 bits (84), Expect = 3e-04
Identities = 31/101 (30%), Positives = 41/101 (40%), Gaps = 2/101 (1%)

Query: 332 GTGGTGGAGGAAGWLYGSGGAGGAGGAGGLNNAGGATGGTGGTGGAGGSGAWLYGNGGAA 391
G G GA +G + +GG G G GG ++ G + GG GSG G G
Sbjct: 6 GRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 392 GAGGNGGNNTSAGTGGVGASGGTGGNAGLIGAGGHGGAGGA 432
GGNG + +GTGG ++ G G G A
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.6 bits (84), Expect = 4e-04
Identities = 33/87 (37%), Positives = 42/87 (48%), Gaps = 1/87 (1%)

Query: 569 GNGGNGGAGGTAGFTGSGGIGGNGGAGGTGGNAGVALSVGSTGGLGGNGGSGGLGGGGGS 628
G G N GA T+G +GG G G GG +G + GG G+G G G G G+
Sbjct: 6 GRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 629 LFGNGGAGGVGATGGNGGSGIGPASVG 655
GNG +GG TGGN + P + G
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 35.1 bits (80), Expect = 0.001
Identities = 30/92 (32%), Positives = 36/92 (39%), Gaps = 10/92 (10%)

Query: 423 AGGHGGAGGAGGNQTGGVGNGGAGGNGGAGGAGGQLYGNGGDGGNGGAGGANIAGGNGSD 482
+GG G G + T G NGG G G GGA + + GG G+ I G GS
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 483 GGAAGHGGAGGSARLIGAGGHGGDGGAGGNTA 514
G G G G GG G G +A
Sbjct: 62 HGNGGGNGNSG----------GGSGTGGNLSA 83



Score = 35.1 bits (80), Expect = 0.001
Identities = 27/87 (31%), Positives = 32/87 (36%), Gaps = 1/87 (1%)

Query: 368 TGGTGGTGGAGGSGAWLYGNGGAAGAGGNGGNNTSAGTGGVGASGGTGGNAGLIGAGGHG 427
+GG G G NGG G G GG + +G G G +G I GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG-IHWGGGS 60

Query: 428 GAGGAGGNQTGGVGNGGAGGNGGAGGA 454
G G GGN G G+G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.1 bits (75), Expect = 0.004
Identities = 29/103 (28%), Positives = 39/103 (37%), Gaps = 2/103 (1%)

Query: 293 NGGAGGAGGGGGYLVGDGGAGGTGGAGGKNSSGGATLTGGTGGTGGAGGAAGWLYGSGGA 352
+GG G G + GG G G G + +G + GG +G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 353 GGAGGAGGLNNAGGATGGTGGTGGAGGSGAWLYGNGGAAGAGG 395
G G GG N+GG +G G A+ + GAGG
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.8 bits (74), Expect = 0.005
Identities = 25/92 (27%), Positives = 35/92 (38%), Gaps = 6/92 (6%)

Query: 523 GTGGDGGNGGNGGLLSGNAGAGGHGGAGGSSTATTTTGTPPTGATGGNGGNGGAGGTAGF 582
G G + G G ++G G GG + ++ P G +G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG---- 61

Query: 583 TGSGGIGGNGGAGGTGGNAGVALSVGSTGGLG 614
G GGNG +GG G G +V + G
Sbjct: 62 --HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.006
Identities = 31/93 (33%), Positives = 39/93 (41%), Gaps = 3/93 (3%)

Query: 119 GANGTAENPDGQNGGLLFGNGGN---GFTQTTAGVAGGNGGSAGLIGNGGAGGGGGAGAA 175
GA+ T+ N +G GL G G + G++ GG+G G G G GGG G +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 176 GGLGGNGGWLYGNGGAGGIGGAGTGTGGHGGAG 208
GG G GG L G T G GG
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.006
Identities = 27/95 (28%), Positives = 33/95 (34%), Gaps = 1/95 (1%)

Query: 188 NGGAGGIGGAGTGTGGHGGAGGAGGRAWLWGTGGAGGAGGDG-GWLFGDGGAGGTGGNGG 246
N GA G G G GG W + GG G G +G G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 247 SGFNSLTSSVGGAGGAGGHAGLFGAGGTGGTGGIG 281
+ + + A A F A T G GG+
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.006
Identities = 36/117 (30%), Positives = 45/117 (38%), Gaps = 1/117 (0%)

Query: 243 GNGGSGFNSLTSSVGGAGGAGGHAGLFGAGGTGGTGGIGGQNTETGPAASNGGAGGAGGG 302
G G G N+ S G GG GL GG G +N G + +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 303 GGYLVGDGGAGGTGGAGGKNSSGGATLTGGTGGTGGAGGAAGWLYGSGGAGGAGGAG 359
G G+G +GG G GG S+ A + G G + S GA A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.4 bits (73), Expect = 0.006
Identities = 31/92 (33%), Positives = 37/92 (40%), Gaps = 5/92 (5%)

Query: 389 GAAGAGGNGGNNTSAGT--GGVGASGGTGGNAGLIGAGGHGGAGGAGGNQTGGVGNGGAG 446
G G G N G ++++G GG G GG + G G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 447 GNGGAGGAGGQLYGNGGDGGNGGAGGANIAGG 478
GNGG G G G G GGN A A +A G
Sbjct: 63 GNGGGNGNSG---GGSGTGGNLSAVAAPVAFG 91



Score = 32.4 bits (73), Expect = 0.008
Identities = 28/115 (24%), Positives = 42/115 (36%), Gaps = 5/115 (4%)

Query: 315 TGGAGGKNSSGGATLTGGTGGTGGAGGAAGWLYGSGGAGGAGGAGGLNNAGGATGGTGGT 374
+GG G +++G + +G G G G G + G+G + N GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG-----GASDGSGWSSENNPWGGGSGSGIHW 56

Query: 375 GGAGGSGAWLYGNGGAAGAGGNGGNNTSAGTGGVGASGGTGGNAGLIGAGGHGGA 429
GG G G G+G G + A G + AG + GA
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.0 bits (72), Expect = 0.009
Identities = 28/75 (37%), Positives = 30/75 (40%), Gaps = 1/75 (1%)

Query: 152 GGNGGSAGLIGNGGAGGGGGAGAAGGLGGNGGWLYGNGGAGGIGGAGTGTGGHGGAGGAG 211
G N G+ GN GG G G GG GW N GG G+G GG G G G
Sbjct: 8 GHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 212 GRAWLWGTGGAGGAG 226
G G G GG
Sbjct: 67 GNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.009
Identities = 34/113 (30%), Positives = 42/113 (37%), Gaps = 2/113 (1%)

Query: 413 GTGGNAGLIGAGGH--GGAGGAGGNQTGGVGNGGAGGNGGAGGAGGQLYGNGGDGGNGGA 470
G G N G G+ GG G G G+G + N GG G GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 471 GGANIAGGNGSDGGAAGHGGAGGSARLIGAGGHGGDGGAGGNTAGRRADAIAG 523
GG +GG GG A + G G A +AG + AIA
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 31.6 bits (71), Expect = 0.013
Identities = 30/86 (34%), Positives = 37/86 (43%), Gaps = 2/86 (2%)

Query: 218 GTGGAGGAGGDGGWLFG--DGGAGGTGGNGGSGFNSLTSSVGGAGGAGGHAGLFGAGGTG 275
G G GA G + G G G G + GSG++S + GG G+G H G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 276 GTGGIGGQNTETGPAASNGGAGGAGG 301
G G G + TG S A A G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.2 bits (70), Expect = 0.016
Identities = 29/88 (32%), Positives = 37/88 (42%), Gaps = 8/88 (9%)

Query: 639 GATGGNGGSGIGPASVGGNGGKGGVGAAGGLAGQIGNGGSGGSGGAGGNGGTGDTAGNGG 698
G G +G S NGG G+G G G S GSG + N G +G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG--------GASDGSGWSSENNPWGGGSGSGI 54

Query: 699 NGGAGAVGGNAQLIGNGGNGGGGGNGGT 726
+ G G+ GN GN G G G G +
Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGTGGNLS 82



Score = 30.8 bits (69), Expect = 0.020
Identities = 38/131 (29%), Positives = 43/131 (32%), Gaps = 31/131 (23%)

Query: 441 GNGGAGGNGGAGGAGGQLYGNGGDGGNGGAGGANIAGGNGSDGGAAGHGGAGGSARLIGA 500
G G G N GA G + NGG G G GGA+ G S+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWG------------- 47

Query: 501 GGHGGDGGAGGNTAGRRADAIAGTGGDGGNGGNGGLLSGNAGAGGHGGAGGSSTATTTTG 560
GG G GG + G GG GN G G G GG + A G
Sbjct: 48 GGSGSGIHWGGGSGH-------GNGGGNGNSGGG---------SGTGGNLSAVAAPVAFG 91

Query: 561 TPPTGATGGNG 571
P G G
Sbjct: 92 FPALSTPGAGG 102



Score = 29.7 bits (66), Expect = 0.046
Identities = 23/78 (29%), Positives = 31/78 (39%)

Query: 137 GNGGNGFTQTTAGVAGGNGGSAGLIGNGGAGGGGGAGAAGGLGGNGGWLYGNGGAGGIGG 196
G G N +T+G G G+ G G G + GG+G ++ GG+G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 197 AGTGTGGHGGAGGAGGRA 214
G G G G G A
Sbjct: 66 GGNGNSGGGSGTGGNLSA 83


49Rv3502cRv3514Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3502c6216.127840Probable short-chain type
Rv3503c12289.942314Probable ferredoxin FdxD
Rv350411259.349016Probable acyl-CoA dehydrogenase FadE26
Rv350511269.486230Probable acyl-CoA dehydrogenase FadE27
Rv3506132810.189949Fatty-acid-CoA synthetase FadD17 (fatty-acid-CoA
Rv3507173211.772814PE-PGRS family protein PE_PGRS53
Rv3508142910.986745PE-PGRS family protein PE_PGRS54
Rv3509c143010.655543Probable acetohydroxyacid synthase IlvX
Rv3510c123110.087774Conserved protein
Rv3511123210.290023PE-PGRS family protein PE_PGRS55
Rv35129319.493548PE-PGRS family protein PE_PGRS56
Rv3513c4256.507313Probable fatty-acid-CoA ligase FadD18 (fragment)
Rv35144256.485584PE-PGRS family protein PE_PGRS57
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3502cDHBDHDRGNASE818e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 80.9 bits (199), Expect = 8e-20
Identities = 73/267 (27%), Positives = 114/267 (42%), Gaps = 28/267 (10%)

Query: 14 NTTDLSGKVAVVTGAAAGLGRAEALGLARLGATVVVNDVASALDASDVVDEIGAAAADAG 73
N + GK+A +TGAA G+G A A LA GA + D + ++++ ++
Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNP-----EKLEKVVSSLKAEA 56

Query: 74 AKAVAVAGDISQRATADELLAS-AVGLGGLDIVVNNAGITRDRMLFNMSDEEWDAVIAVH 132
A A D+ A DE+ A +G +DI+VN AG+ R ++ ++SDEEW+A +V+
Sbjct: 57 RHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVN 116

Query: 133 LRGHFLLTRNAAAYWRDKAKDAEGGSVFGRLVNTSSEAGLVGPVGQANYAAAKAGITALT 192
G F +R+ + Y D+ G +V S V A YA++KA T
Sbjct: 117 STGVFNASRSVSKYMMDRRS--------GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFT 168

Query: 193 LSAARALGRYGVCANVICP-RARTAMTADVFG---AAPDVEAGQID------PLS----P 238
L Y + N++ P T M ++ A V G ++ PL P
Sbjct: 169 KCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKP 228

Query: 239 QHVVSLVQFLASPAAAEVNGQVFIVYG 265
+ V FL S A + V G
Sbjct: 229 SDIADAVLFLVSGQAGHITMHNLCVDG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3507cloacin391e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 1e-04
Identities = 35/103 (33%), Positives = 43/103 (41%)

Query: 1172 NGGNGGNGNDGAVNAGTGGSGGNGGNAGGGGANGGDGGAGGAGGAGGRGGKGIDGGFGGD 1231
+GG+G N GA + +GG G GGGA+ G G + GG G GI G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1232 GGNGGSNNGTGAGGNGGNGGTGGVGSVGAAGGDGGNGGTGGFA 1274
GNGG N +G G G + V G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.8 bits (87), Expect = 3e-04
Identities = 30/79 (37%), Positives = 35/79 (44%)

Query: 1154 GKGGDGGNIAAGATGTAGNGGNGGNGNDGAVNAGTGGSGGNGGNAGGGGANGGDGGAGGA 1213
G G G N A +T NGG G G G + G+G S N GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1214 GGAGGRGGKGIDGGFGGDG 1232
G GG G G G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 37.0 bits (85), Expect = 5e-04
Identities = 34/99 (34%), Positives = 37/99 (37%), Gaps = 1/99 (1%)

Query: 837 GNAGDGGNGGNAGAGGNGGSGDFGGNTTSGAS-GSGGNGGNAGTAGSGGAGGTGGTGLSG 895
G G G N G GN G G GAS GSG + N G G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 896 GNGGNGGNGGNGGDGGNGAHGTVGAQFVPATSLPTPNGG 934
GNGG GN G G G +L TP G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101



Score = 37.0 bits (85), Expect = 5e-04
Identities = 35/109 (32%), Positives = 47/109 (43%), Gaps = 1/109 (0%)

Query: 518 TGGNGGNGTGGVNGADNTLNPDTPGGAGEPGGAGGAGGAGGAAGGPGGTGGTGGNGGNGG 577
+GG+G G + +N G G + G+G + GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 578 NGGNGGNGGNGGNGGNAGNNSTNA-PVGGEGGAGGDGGAGGAGGAANGG 625
+G GGNG +GG G GN S A PV A GAGG + + G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 36.2 bits (83), Expect = 0.001
Identities = 28/76 (36%), Positives = 33/76 (43%)

Query: 1148 GNGSAGGKGGDGGNIAAGATGTAGNGGNGGNGNDGAVNAGTGGSGGNGGNAGGGGANGGD 1207
G G G GNI G TG GG + N GG G+G + GGG +G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1208 GGAGGAGGAGGRGGKG 1223
GG G +GG G GG
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 36.2 bits (83), Expect = 0.001
Identities = 32/81 (39%), Positives = 37/81 (45%), Gaps = 2/81 (2%)

Query: 436 GKGGNGGIGGAAVTGGVAGDGGTGGKGGTGGAGGAGNDAGSTGNPGGKGGDGGIGGAGGA 495
G G G GA T G G TG G G + G+G S NP G G GI GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG--WSSENNPWGGGSGSGIHWGGGS 60

Query: 496 GGAAGTGNGGHAGNTGDGGDG 516
G G GNG G +G GG+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 35.8 bits (82), Expect = 0.001
Identities = 29/81 (35%), Positives = 38/81 (46%), Gaps = 1/81 (1%)

Query: 1167 TGTAGNGGNGGNGN-DGAVNAGTGGSGGNGGNAGGGGANGGDGGAGGAGGAGGRGGKGID 1225
+G G G N G + G +N G G G GG + G G + + GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1226 GGFGGDGGNGGSNNGTGAGGN 1246
G GG GN G +GTG +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 35.1 bits (80), Expect = 0.002
Identities = 32/96 (33%), Positives = 41/96 (42%), Gaps = 3/96 (3%)

Query: 1061 GGEGGVGSILGGPGGNGGTGGNASATGTNGVANAGNGGKGGDGGQFGAGGNGGAGGSVTD 1120
G G+I GGP G G GG + +G + N GG G G G+G GG +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG---N 68

Query: 1121 GSAGSTAGNGGNGGNATNGTIAGQPAGGNGSAGGKG 1156
G++G +G GGN G PA AGG
Sbjct: 69 GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.002
Identities = 25/84 (29%), Positives = 33/84 (39%)

Query: 1263 GDGGNGGTGGFAGFGGTAGNGGSGGTGGAGGDGGTGGDGGNGVIAGGGGTGGNGGASGAG 1322
G G G G G G +G G G G+G N GG G+G + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1323 GAGGTGGFAGNGNAGGNGGTGGAS 1346
G GG G +G G+ G + A+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 34.3 bits (78), Expect = 0.004
Identities = 35/105 (33%), Positives = 44/105 (41%), Gaps = 5/105 (4%)

Query: 1233 GNGGSNNGTGAGGNGGNGGTGGVGSVGAAGGDGGNGGTGGFAGFGGTAGNGGSGGTGGAG 1292
G G + TGA GN G G G G+G + +GG +G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1293 GDGGTGGDGGNGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAG 1337
G+GG G+ G G GTGGN A A A G + G G
Sbjct: 63 GNGGGNGNSGG-----GSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.004
Identities = 30/85 (35%), Positives = 38/85 (44%), Gaps = 4/85 (4%)

Query: 1184 VNAGTGGSGGNGGNAGGGGANGGDGGAGGAGGAGGRGGKGIDGGFGGDGGNGGSNNGTGA 1243
++ G G G ++ G NGG G G GGA G + G GGS +G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG----GGSGSGIHW 56

Query: 1244 GGNGGNGGTGGVGSVGAAGGDGGNG 1268
GG G+G GG G+ G G GGN
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.004
Identities = 32/102 (31%), Positives = 36/102 (35%), Gaps = 1/102 (0%)

Query: 580 GNGGNGGNGGNGGNAGNNSTNAPVGGEGGAGGDGGAGGAGGAANGGTAGSQ-GTGGVGGD 638
G G G N G +GN + G GG DG + GG +GS GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 639 GGAGGNGGGGKAGTGNSGNFGVDGEAGFSGGAGGNGGVGGAA 680
G GGNG G V F A G GG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.005
Identities = 32/103 (31%), Positives = 41/103 (39%)

Query: 641 AGGNGGGGKAGTGNSGNFGVDGEAGFSGGAGGNGGVGGAAGANGGTGGSGGNGGDGGAGG 700
+GG+G G G ++ G G G G + G G ++ N GGSG GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 701 IGGAGGNGIPGTGTEPAGGTGAKGGDGGDGGAGGAGGNAGGAG 743
G GGNG G G+ G A G + AGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.006
Identities = 27/78 (34%), Positives = 33/78 (42%)

Query: 206 GGTGGNGGNGALLIGGGGLGGAGGMGGTGGGTGGTGGNGGNGALLIGAGGVGGAGGIGGQ 265
GG G GA G GG G+G GG + G+G + N G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 266 GTGAGGAAGAGGTGGNGG 283
G G G GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 33.5 bits (76), Expect = 0.006
Identities = 27/86 (31%), Positives = 36/86 (41%), Gaps = 5/86 (5%)

Query: 542 GGAGEPGGAGGAGGAGGAAGGPGGTGGTGGNGGNGGNGGNGGNGGNGGNGGNAGNNSTNA 601
GG G G +G GGP G G GG + G+G + N GG +G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-----ASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 602 PVGGEGGAGGDGGAGGAGGAANGGTA 627
G G GG+G +GG G +A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.1 bits (75), Expect = 0.008
Identities = 32/110 (29%), Positives = 40/110 (36%)

Query: 164 NGGAGGAGGSGGAAGGNGGNGGWLFGAGGTGGIGGTGAPGAMGGTGGNGGNGALLIGGGG 223
+GG G +G + NGG G G G+G GG G+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 224 LGGAGGMGGTGGGTGGTGGNGGNGALLIGAGGVGGAGGIGGQGTGAGGAA 273
G GG G +GGG+G G A + G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.8 bits (74), Expect = 0.011
Identities = 28/85 (32%), Positives = 35/85 (41%), Gaps = 2/85 (2%)

Query: 1291 AGGDGGTGGDGGNGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAGGNGGTGGASEDGD 1350
+GGDG G G GG G GGA G++ N G G G G
Sbjct: 2 SGGDGR--GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1351 NGNAGSGATGGTGGNGGTGGDGGAA 1375
+G+ G G +GG GTGG+ A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.8 bits (74), Expect = 0.012
Identities = 28/83 (33%), Positives = 31/83 (37%)

Query: 1299 GDGGNGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAGGNGGTGGASEDGDNGNAGSGA 1358
G G G G T GN G G G G+G + N GG S G + GSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1359 TGGTGGNGGTGGDGGAAGLGGVA 1381
G G GG G L VA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.013
Identities = 31/111 (27%), Positives = 46/111 (41%), Gaps = 7/111 (6%)

Query: 784 GKGGQGGSGGTGGSGAPIGGGAGGTGGSGGHAGKGGAGGIGAQGTTITVPGNGGNAGDGG 843
G G+G + G + I GG G G GG G+ ++ P GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-------ASDGSGWSSENNPWGGGSGSGIH 55

Query: 844 NGGNAGAGGNGGSGDFGGNTTSGASGSGGNGGNAGTAGSGGAGGTGGTGLS 894
GG +G G GG+G+ GG + +G + S A + G GG +S
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 32.4 bits (73), Expect = 0.015
Identities = 28/86 (32%), Positives = 35/86 (40%), Gaps = 6/86 (6%)

Query: 1133 GGNATNGTIAGQPAGGNGSAGGKGGDGGNIAAGATGTAGNGGNGGNGNDGAVNAGTGGSG 1192
G ++T+G I G P G G G G + G G G G G+G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG------HGNG 65

Query: 1193 GNGGNAGGGGANGGDGGAGGAGGAGG 1218
G GN+GGG GG+ A A A G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.4 bits (73), Expect = 0.016
Identities = 30/95 (31%), Positives = 36/95 (37%), Gaps = 1/95 (1%)

Query: 1287 GTGGAGGDGGTGGDGGNGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAGGNGGTGGAS 1346
G G G + G GN + G G G GGAS G G G+ G GG+
Sbjct: 3 GGDGRGHNTGAHSTSGN-INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1347 EDGDNGNAGSGATGGTGGNGGTGGDGGAAGLGGVA 1381
GN SG GTGGN A G ++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS 96



Score = 32.0 bits (72), Expect = 0.019
Identities = 27/83 (32%), Positives = 33/83 (39%)

Query: 551 GGAGGAGGAAGGPGGTGGTGGNGGNGGNGGNGGNGGNGGNGGNAGNNSTNAPVGGEGGAG 610
GG G G G G+G + N GG G+G + G G GN G N + G GG
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 611 GDGGAGGAGGAANGGTAGSQGTG 633
A A G T G+ G
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.020
Identities = 32/96 (33%), Positives = 36/96 (37%), Gaps = 2/96 (2%)

Query: 733 GGAGGNAGGAGGQGGNAGQGGAGGAGGNAVIPGDGVGKAPHGDAGGSGGDGGKGGQGGSG 792
G G+ GA GN G G G G G + GGSG GG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 793 GTGGSGAPIGGGAGGTGGSGGHAGKGGAGGIGAQGT 828
GG+G GG GTGG+ A G A T
Sbjct: 64 NGGGNGN--SGGGSGTGGNLSAVAAPVAFGFPALST 97



Score = 32.0 bits (72), Expect = 0.020
Identities = 28/79 (35%), Positives = 38/79 (48%)

Query: 1223 GIDGGFGGDGGNGGSNNGTGAGGNGGNGGTGGVGSVGAAGGDGGNGGTGGFAGFGGTAGN 1282
G DG G + S N G G GG GS ++ + GG+G +GG +G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1283 GGSGGTGGAGGDGGTGGDG 1301
G GG G +GG GTGG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.021
Identities = 31/106 (29%), Positives = 37/106 (34%), Gaps = 5/106 (4%)

Query: 618 AGGAANGGTAGSQGTGGV--GGDGGAGGNGGGGKAGTGNSGNFGVDGEAGFSGGAGGNGG 675
+GG G G+ T G GG G G GG +S N + G SG GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN---NPWGGGSGSGIHWGG 58

Query: 676 VGGAAGANGGTGGSGGNGGDGGAGGIGGAGGNGIPGTGTEPAGGTG 721
G G GG+G G + G P T AGG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.021
Identities = 30/86 (34%), Positives = 40/86 (46%), Gaps = 6/86 (6%)

Query: 121 GANATTPGGNGGDGGWLFGSGGNGAPGAAGQSGGNGGSAGLWGNGGAGGAGGSGGAAGGN 180
GA++T+ NGG G G G + G + ++ GG +G G GGSG GG
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSG----SGIHWGGGSGHGNGGG 67

Query: 181 GGNGGWLFGAGGTGGIGGTGAPGAMG 206
GN G G+G G + AP A G
Sbjct: 68 NGNSGG--GSGTGGNLSAVAAPVAFG 91



Score = 31.6 bits (71), Expect = 0.023
Identities = 29/110 (26%), Positives = 36/110 (32%)

Query: 667 SGGAGGNGGVGGAAGANGGTGGSGGNGGDGGAGGIGGAGGNGIPGTGTEPAGGTGAKGGD 726
SGG G G + + GG G G GGA G P G +G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 727 GGDGGAGGAGGNAGGAGGQGGNAGQGGAGGAGGNAVIPGDGVGKAPHGDA 776
G+GG G G G GG A G + G+ + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 31.2 bits (70), Expect = 0.036
Identities = 28/105 (26%), Positives = 36/105 (34%), Gaps = 6/105 (5%)

Query: 677 GGAAGANGGTGGSGGNGGDGGAGGIGGAGGNGIPGTGTEPAGGTGAKGGDGGDGGAGGAG 736
G G N G + GN G G G G + G +E GG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSE------NNPWGGGSGSGIHWG 57

Query: 737 GNAGGAGGQGGNAGQGGAGGAGGNAVIPGDGVGKAPHGDAGGSGG 781
G +G G G GG+G G + + P G+GG
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.037
Identities = 31/77 (40%), Positives = 36/77 (46%), Gaps = 7/77 (9%)

Query: 565 GTGGTGGNGGNGG-----NGGNGGNGGNGGNGGNAGNNSTNAPVGGEGGAGGD--GGAGG 617
G G G N G NGG G G GG +G +S N P GG G+G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 618 AGGAANGGTAGSQGTGG 634
G NG + G GTGG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 30.8 bits (69), Expect = 0.046
Identities = 32/102 (31%), Positives = 41/102 (40%), Gaps = 2/102 (1%)

Query: 314 GTGGKGGQGGDGGTGGAGGAGPVLFGHGGAGGMGGQGGTGGMGGAGGDGTTVIAAGTGGE 373
G G+G G T G GP G GG G + GG G+ + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 374 GGTGGAAGAGGAAGARGALTS--GGLAGGVGAGGTGGTGGTG 413
G GG +GG +G G L++ +A G A T G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3508cloacin383e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 3e-04
Identities = 36/107 (33%), Positives = 41/107 (38%), Gaps = 4/107 (3%)

Query: 1782 GGAGGLGGGGGTGGTNGNGGLGGGGGNGGAGGAGGTPTGSGTEGTGGDGGDAGAGGNGGS 1841
G G G + N NGG G G GGA G + + G G G GG+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 1842 ATGVGNGGNGGDGGNGGDGGNGAPGGFGGGA----GAGGLGGSGAGG 1884
G GG G G AP FG A GAGGL S + G
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 37.8 bits (87), Expect = 5e-04
Identities = 33/79 (41%), Positives = 39/79 (49%), Gaps = 3/79 (3%)

Query: 923 GQGGQGGLGGASTTSINANGGAGGNGGTGGKGGAGGAGTLGV---GGSGGTGGDGGDAGS 979
G G+G GA +TS N NGG G G GG G + GGSG GG +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 980 GGGGGFGGAAGKAGGGGNG 998
G GGG G + G +G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 37.8 bits (87), Expect = 5e-04
Identities = 33/79 (41%), Positives = 39/79 (49%), Gaps = 3/79 (3%)

Query: 1124 GQGGQGGLGGASTTSINANGGAGGNGGTGGKGGAGGAGTLGV---GGSGGTGGDGGDAGS 1180
G G+G GA +TS N NGG G G GG G + GGSG GG +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1181 GGGGGFGGAAGKAGGGGNG 1199
G GGG G + G +G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 37.4 bits (86), Expect = 7e-04
Identities = 31/79 (39%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 538 GAGGAGGNTGVGGTNGS--GGQGGTGGAGGAGGAGGVGADNPTGIGGTGGTGGKGGAGGA 595
G G G NTG T+G+ GG G G GGA G ++N GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 596 GGQGGSSGAGGTNGSGGAG 614
G GG+ +GG +G+GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 36.6 bits (84), Expect = 9e-04
Identities = 38/119 (31%), Positives = 46/119 (38%), Gaps = 5/119 (4%)

Query: 1094 GAAGNGGNGGAGGAGGAGDNNFNGGQGGAGGQGGQGGLGGASTTSINANGGAGGNGGTGG 1153
G G G N GA G N NGG G G GG G S+ + GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSG----NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 1154 KGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGAAGKAGGGGNGGVGGDGGEGASGL 1212
G G G G G GG+G G + FG A G G V G ++ +
Sbjct: 59 GSGHGNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 35.1 bits (80), Expect = 0.003
Identities = 36/104 (34%), Positives = 41/104 (39%), Gaps = 5/104 (4%)

Query: 893 GAAGNGGNGGAGGAGGAGDNNFNGGQGGAGGQGGQGGLGGASTTSINANGGAGGNGGTGG 952
G G G N GA G N NGG G G GG G S+ + GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSG----NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 953 KGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGAAGKAGGGG 996
G G G G G GG+G G + FG A G G
Sbjct: 59 GSGHGNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAG 101



Score = 34.7 bits (79), Expect = 0.004
Identities = 32/108 (29%), Positives = 40/108 (37%), Gaps = 1/108 (0%)

Query: 709 GAGGEGGAGGNSGVGGTNGSGGAGGAGGKGGTGGAGGSGADNPTGAGFAGGAGGTGGAAG 768
G G G G G G G G G + G+G S +NP G G G GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 769 AGGAGGATGTGGTGGVVGATGSAGIGGAGGRGGDGGDGASGLGLGLSG 816
G G GG+G G + A G GA GL + +S
Sbjct: 63 GNGGGNGNSGGGSGT-GGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 34.7 bits (79), Expect = 0.004
Identities = 30/83 (36%), Positives = 35/83 (42%)

Query: 1410 GGAGGRGGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDG 1469
GG G G GGA D S + N + GG G GG +G G G G +GG G GG+
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 1470 QNGTTGVASEGGAGGQGGDGGQG 1492
VA A G GG
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.004
Identities = 30/83 (36%), Positives = 35/83 (42%)

Query: 1615 GGAGGRGGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDG 1674
GG G G GGA D S + N + GG G GG +G G G G +GG G GG+
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 1675 QNGTTGVASEGGAGGQGGDGGQG 1697
VA A G GG
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.004
Identities = 31/97 (31%), Positives = 37/97 (38%)

Query: 126 GGAGGAGGLLFGNGGAGAAGAPGQAGGPGGPAGLWGNGGPGGAGGSGGGTGGAGGAGGWL 185
GG G G + G+G + GG G WG G G GG G +GG G GG L
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 186 FGVGGAGGVGGAGGGTGGAGGPGGLIWGGGGAGGVGG 222
V G T GAGG I G + +
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 33.9 bits (77), Expect = 0.006
Identities = 34/104 (32%), Positives = 38/104 (36%), Gaps = 1/104 (0%)

Query: 1171 TGGDGGDAGSGGGGGFGGAAGKAGGGGNGGVGGDGGEGASGLGLGLSGFDGGQGGQGGAG 1230
+GGDG +G G G G G GG G G G S G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1231 GSAGAGGINGAGGAGGTGGAGGDGAPATLIGGPDGGDGGQGGIG 1274
G GG +GG GTGG A G P G GG+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.006
Identities = 35/102 (34%), Positives = 39/102 (38%)

Query: 759 GAGGTGGAAGAGGAGGATGTGGTGGVVGATGSAGIGGAGGRGGDGGDGASGLGLGLSGFD 818
G G G GA G G TG VG S G G + GG SG+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 819 GGQGGQGGAGGSAGAGGINGAGGAGGNGGDGGDGATGAAGLG 860
G GG G +GG +G GG A A G GA GL
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.008
Identities = 33/101 (32%), Positives = 39/101 (38%)

Query: 1435 AGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTGVASEGGAGGQGGDGGQGGI 1494
+GG G G ++ +G ING G GG DG ++ GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1495 GGAGGNAGFGAGVPGDGGIGGTGGAGGAGGAGADGDPSIDG 1535
G GG G G G GG A A G A P G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.008
Identities = 33/101 (32%), Positives = 39/101 (38%)

Query: 1640 AGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTGVASEGGAGGQGGDGGQGGI 1699
+GG G G ++ +G ING G GG DG ++ GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1700 GGAGGNAGFGAGVPGDGGIGGTGGAGGAGGAGADGDPSIDG 1740
G GG G G G GG A A G A P G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.009
Identities = 35/113 (30%), Positives = 50/113 (44%), Gaps = 2/113 (1%)

Query: 594 GAGGQGGSSGAGGTNGS--GGAGGTGGQGGAGGAGGAGADNPTGIGGAGGTGGTGGAAGA 651
G G+G ++GA T+G+ GG G G GGA G ++N GG+G GG +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 652 GGAGGAIGTGGTGGAVGSVGNAGIGGTGGTGGVGGAGGAGAAAAAGSSATGGA 704
G GG +GG G G++ G + G G A + + A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.1 bits (75), Expect = 0.012
Identities = 25/85 (29%), Positives = 31/85 (36%)

Query: 1730 AGADGDPSIDGGQGGAGGHGGQGGKGGLNSTGLASAASGDGGNGGAGGAGGNGGAGGLGG 1789
+G DG G +G G G+ + N GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1790 GGGTGGTNGNGGLGGGGGNGGAGGA 1814
G GG +GG G GGN A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.012
Identities = 29/88 (32%), Positives = 34/88 (38%), Gaps = 4/88 (4%)

Query: 656 GAIGTGGTGGAVGSVGNAGIGGTGGTGGVGGAGGAGAAAAAGSSATGGAGFAGGAGGEGG 715
G G G GA + GN GG G+G GGA + S G +G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNI----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 716 AGGNSGVGGTNGSGGAGGAGGKGGTGGA 743
G+ GG SGG G GG A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.013
Identities = 32/82 (39%), Positives = 38/82 (46%), Gaps = 2/82 (2%)

Query: 1821 SGTEGTGGDGGDAGAGGN-GGSATGVGNGGNGGDGGNGGDGGNGAPGGFGGGAGAGGLGG 1879
SG +G G + G GN G TG+G GG DG N GG G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1880 SGAGGGTDGDDGNGGSPGTDGS 1901
G GGG +G+ G G G + S
Sbjct: 62 HGNGGG-NGNSGGGSGTGGNLS 82



Score = 32.8 bits (74), Expect = 0.015
Identities = 27/80 (33%), Positives = 32/80 (40%)

Query: 1807 GNGGAGGAGGTPTGSGTEGTGGDGGDAGAGGNGGSATGVGNGGNGGDGGNGGDGGNGAPG 1866
G G G G + SG G G G G + GS N GG G+G G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1867 GFGGGAGAGGLGGSGAGGGT 1886
G GGG G G G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 32.8 bits (74), Expect = 0.016
Identities = 37/106 (34%), Positives = 43/106 (40%), Gaps = 4/106 (3%)

Query: 569 AGGVGADNPTGIGGTGGTGGKGGAGGAGGQGGSSGAG--GTNGSGGAGGTGGQGGAGGAG 626
+GG G + TG T G G G G G S G+G N G G G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 627 GAGADNPTGIGGAGGTGGTGGAAGAGGAGG--AIGTGGTGGAVGSV 670
GG GTGG A A A G A+ T G GG S+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSI 107



Score = 32.4 bits (73), Expect = 0.018
Identities = 25/86 (29%), Positives = 33/86 (38%)

Query: 1738 IDGGQGGAGGHGGQGGKGGLNSTGLASAASGDGGNGGAGGAGGNGGAGGLGGGGGTGGTN 1797
+ GG G G G +N G +G + N GG G G GG +
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1798 GNGGLGGGGGNGGAGGAGGTPTGSGT 1823
G+G GG G +GG G GG +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.0 bits (72), Expect = 0.024
Identities = 36/117 (30%), Positives = 46/117 (39%), Gaps = 2/117 (1%)

Query: 556 GQGGTGGAGGAGGAGGVGADNPTGIGGTGG-TGGKGGAGGAGGQGGSSGAGGTNGSGGAG 614
G G G GA G PTG+G GG + G G + GG SG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 615 GTGGQGGAGGAGGAGADNPTGIGGAGGTGGTGGAA-GAGGAGGAIGTGGTGGAVGSV 670
G GG G G G N + + G + GAGG +I G A+ +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 32.0 bits (72), Expect = 0.025
Identities = 36/116 (31%), Positives = 46/116 (39%), Gaps = 3/116 (2%)

Query: 1416 GGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTG 1475
GGDG ++GA S GN G G G G + G+G + GG G+G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1476 VASEGGAGGQGGDGGQGGIGGAGGNAGFG---AGVPGDGGIGGTGGAGGAGGAGAD 1528
G GG G G + FG PG GG+ + AG A AD
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.0 bits (72), Expect = 0.025
Identities = 36/116 (31%), Positives = 46/116 (39%), Gaps = 3/116 (2%)

Query: 1621 GGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTG 1680
GGDG ++GA S GN G G G G + G+G + GG G+G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1681 VASEGGAGGQGGDGGQGGIGGAGGNAGFG---AGVPGDGGIGGTGGAGGAGGAGAD 1733
G GG G G + FG PG GG+ + AG A AD
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.0 bits (72), Expect = 0.029
Identities = 36/107 (33%), Positives = 45/107 (42%), Gaps = 4/107 (3%)

Query: 970 TGGDGGDAGSGGGGGFGGAAGKAGGGGNGGRGGDGGDGASGLGLGLSGFDGGQGGQGGAG 1029
+GGDG +G G G G G GG DG SG + + GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG----SGWSSENNPWGGGSGSGIHWG 57

Query: 1030 GSAGAGGINGAGGAGGNGGDGGDGATGAAGLGDNGGVGGDGGAGGAA 1076
G +G G G G +GG G GG+ + AA + GAGG A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.030
Identities = 29/87 (33%), Positives = 34/87 (39%), Gaps = 3/87 (3%)

Query: 483 GAAGTGGTGGVVGAAGKAGIGGTGGQGGAGGAGSAGTDATATGATGGTGFSGGAGGAGGA 542
G G G G +G G TG G G + +G + GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 543 GGNTGVGGTNGSGGQGGTGGAGGAGGA 569
G GG SGG GTGG A A
Sbjct: 63 GNG---GGNGNSGGGSGTGGNLSAVAA 86



Score = 31.6 bits (71), Expect = 0.039
Identities = 42/117 (35%), Positives = 50/117 (42%), Gaps = 5/117 (4%)

Query: 873 GAAGNGGNAGVGLTAKAGDGGAAGNGGNGGAG-GAGGAGDNNFNGGQGGAGGQGGQGGLG 931
G G G N G T+ +GG G G GGA G+G + +NN GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 932 GASTTSINANGGAGGNGGTGGKGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGA 988
G NG +GG GTGG A A + T G GG A S G A
Sbjct: 63 GNG----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 31.6 bits (71), Expect = 0.039
Identities = 42/117 (35%), Positives = 50/117 (42%), Gaps = 5/117 (4%)

Query: 1074 GAAGNGGNAGVGLTAKAGDGGAAGNGGNGGAG-GAGGAGDNNFNGGQGGAGGQGGQGGLG 1132
G G G N G T+ +GG G G GGA G+G + +NN GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1133 GASTTSINANGGAGGNGGTGGKGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGA 1189
G NG +GG GTGG A A + T G GG A S G A
Sbjct: 63 GNG----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3511cloacin402e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.1 bits (93), Expect = 2e-05
Identities = 31/89 (34%), Positives = 33/89 (37%)

Query: 270 GAGGTGGNAAWLLGGGGTGGAGGIGGGNGGHGGNGGWLLGNGGNGGLGGDGDGGTGGGHG 329
G G G N G G G GG GW N GG G G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 330 GNGGNPGWLLGTAGGGGNGGAGSTGTAGG 358
GNGG G G +G GGN A + A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 37.0 bits (85), Expect = 2e-04
Identities = 31/79 (39%), Positives = 36/79 (45%), Gaps = 1/79 (1%)

Query: 237 GAGGIGGGSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTGGNAAWLLGGG-GTGGAGGIGG 295
G G G +G +G G G+G GG G+G + N W G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 296 GNGGHGGNGGWLLGNGGNG 314
GNGG GN G G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 37.0 bits (85), Expect = 3e-04
Identities = 35/115 (30%), Positives = 46/115 (40%), Gaps = 2/115 (1%)

Query: 583 AGAGGTGFTQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGTTGGAGGAGGAGGTGGAAG 642
+G G G GA +GN G G G GA + + ++ GG+G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 643 TGTGGQQGNGGNGGNGGTGGKGGTGGAGMNSLDPLLAAQDGGQGGTGGTGGNAGA 697
G GG GN G G GTGG A + P L+ G + G A
Sbjct: 62 HGNGGGNGNSGGG--SGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 36.6 bits (84), Expect = 3e-04
Identities = 28/73 (38%), Positives = 33/73 (45%), Gaps = 5/73 (6%)

Query: 233 GGHGGAGGIGGGSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTGGNAAWLLGGGGTGGAGG 292
G H +G I GG G G GG + G G + GG+G W GGG G GG
Sbjct: 12 GAHSTSGNINGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHW--GGGSGHGNGG 66

Query: 293 IGGGNGGHGGNGG 305
G +GG G GG
Sbjct: 67 GNGNSGGGSGTGG 79



Score = 36.2 bits (83), Expect = 4e-04
Identities = 28/84 (33%), Positives = 31/84 (36%), Gaps = 5/84 (5%)

Query: 244 GSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTGGNAAWLLGGGGTGGAGGIGGGNGGHGGN 303
G G G N G +G I G G GG + W GG G G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 304 GGWLLGNGGNGGLGGDGDGGTGGG 327
GNGG G G G G G
Sbjct: 63 -----GNGGGNGNSGGGSGTGGNL 81



Score = 35.8 bits (82), Expect = 6e-04
Identities = 35/106 (33%), Positives = 43/106 (40%), Gaps = 6/106 (5%)

Query: 378 GAGAGGHGGTGGAGGAGVNGGGAGGAGGAGGNGGAGGQAALLFGRGGTGGAGGYGGDGGG 437
GA + GG G GV GG + G+G + N GG G G GG G G G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGG------GSGSGIHWGGGSGHGNG 65

Query: 438 GGDGFDGTMAGLGGTGGSGGTGGDGGAPGNGGAGGAGQLLSHSGVA 483
GG+G G +G GG + G P G G +S S A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.5 bits (81), Expect = 7e-04
Identities = 29/100 (29%), Positives = 35/100 (35%)

Query: 486 SGKGGAGGTGGNGGAGSAGADAPAGSGAMGSTGFAGGAGGDGGNGGGSGASQGNGGNGGN 545
S G G G G +D S G G+G G G G G GNG +GG
Sbjct: 15 STSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGG 74

Query: 546 GGTGGKGGTGGAGMNSLDPLLAAQDGGQGGTGGTGGNAGA 585
GTGG A + P L+ G + G A
Sbjct: 75 SGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 35.5 bits (81), Expect = 8e-04
Identities = 36/104 (34%), Positives = 43/104 (41%), Gaps = 3/104 (2%)

Query: 193 AGGIGGGTGGAGGHAWLFGHGGTGGIGGGPGGN--GGWLLGNGGHGGAGGIGGGSGGAGG 250
+GG G G +GG G+G G G + GW N GG G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 251 NGGWLLGNGGIGGAGGTGGGAGGTGGNAAWLLGGGGTGGAGGIG 294
+G GNG GG GTGG A+ T GAGG+
Sbjct: 62 HGNG-GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.001
Identities = 30/81 (37%), Positives = 35/81 (43%), Gaps = 7/81 (8%)

Query: 289 GAGGIGGGNGGHGGNGGWLLGNGGNGGLGGDGDGGTGGG----HGGNGGNPGWLLGTAGG 344
G G G G H +G NGG GLG G G G + GG G + GG
Sbjct: 3 GGDGRGHNTGAHSTSGNI---NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 345 GGNGGAGSTGTAGGGSGGTGG 365
G+G G G +GGGSG G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGN 80



Score = 33.9 bits (77), Expect = 0.002
Identities = 35/104 (33%), Positives = 42/104 (40%), Gaps = 9/104 (8%)

Query: 176 GNGGNAGWLYGRGGV-GGAGGIGGGTGGAGGHAWLFGH---GGTGGIGGGPGGNGGWLLG 231
G G N G G + GG G+G G G + G W + GG G G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG---- 61

Query: 232 NGGHGGAGGIGGGSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTG 275
G+GG G GG G GGN + G + GAGG
Sbjct: 62 -HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.003
Identities = 28/95 (29%), Positives = 35/95 (36%)

Query: 310 NGGNGGLGGDGDGGTGGGHGGNGGNPGWLLGTAGGGGNGGAGSTGTAGGGSGGTGGDGGT 369
N G G+ +GG G G G + G + GG+GS GGGSG G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 370 GGRGGLLMGAGAGGHGGTGGAGGAGVNGGGAGGAG 404
GG G G ++ GAGG
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.003
Identities = 28/85 (32%), Positives = 32/85 (37%), Gaps = 6/85 (7%)

Query: 490 GAGGTGGNGGAGSAGADAPAGSGAMGSTGFAGGAGGDG------GNGGGSGASQGNGGNG 543
G G G N GA S + G +G G A G G G GSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 544 GNGGTGGKGGTGGAGMNSLDPLLAA 568
GNGG G G G +L + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.5 bits (76), Expect = 0.003
Identities = 33/134 (24%), Positives = 45/134 (33%), Gaps = 13/134 (9%)

Query: 376 LMGAGAGGHGGTGGAGGAGVNGGGAGGAGGAGGNGGAGGQAALLFGRGGTGGAGGYGGDG 435
+ G GH + +NGG G G G + G+G + GG+G +GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 436 GGGGDGFDGTMAGLGGTGGSGGTGGDGGAPGNGGAGGAGQLLSHSGVAGASGKGGAGGTG 495
G G GG G G G G + A + G GG +
Sbjct: 61 GHGN-------------GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSI 107

Query: 496 GNGGAGSAGADAPA 509
G +A AD A
Sbjct: 108 SAGALSAAIADIMA 121



Score = 32.8 bits (74), Expect = 0.005
Identities = 34/114 (29%), Positives = 39/114 (34%), Gaps = 14/114 (12%)

Query: 527 GGNGGGSGASQGNGGNGGNGGTGGKGGTGGAGMNSLDPLLAAQDGGQGGTGGTGGNAGAG 586
GG+G G + NGG G G GGA G G + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGA------------SDGSGWSSENNPWGGGS 50

Query: 587 GTGFTQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGTTGG--AGGAGGAGGTG 638
G+G G GNGG G G G N + AA G A GAGG
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.009
Identities = 32/106 (30%), Positives = 36/106 (33%), Gaps = 10/106 (9%)

Query: 421 GRGGTGGAGGYGGDGGGGGDGFDGTMAGLGGTGGSGGTGGDGGAPGNGGAGGAGQLLSHS 480
GRG GA G+ GG G G G + G G + N GG H
Sbjct: 6 GRGHNTGAHSTSGNINGGPTG---------LGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 481 GVAGASGKGGAGGTGGNGGAGSAGADAPAGSGAMGSTGFAG-GAGG 525
G G GG G G G A A A G + GAGG
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.013
Identities = 33/101 (32%), Positives = 35/101 (34%), Gaps = 2/101 (1%)

Query: 219 GGGPGGNGGWLLGNGGHGGAGGIGGGSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTGG-- 276
G G G N G +G G G GGA GW N GG G+G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 277 NAAWLLGGGGTGGAGGIGGGNGGHGGNGGWLLGNGGNGGLG 317
N GG G GG G L G GGL
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.016
Identities = 27/96 (28%), Positives = 33/96 (34%), Gaps = 8/96 (8%)

Query: 352 STGTAGGGSGGTGGDGGTGGRGGLLMGAGAGGHGGTGGAGGAGVNGGGAGGAGGAGGNGG 411
S G G + G G G +G G G G+G + GGG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 412 AGGQAALLFGRGGTGGAGGYGGDGGGGGDGFDGTMA 447
G GG G G G GG +A
Sbjct: 62 HGN--------GGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 30.8 bits (69), Expect = 0.021
Identities = 22/77 (28%), Positives = 29/77 (37%)

Query: 572 GQGGTGGTGGNAGAGGTGFTQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGTTGGAGGA 631
G+G G +G G T G + G G N G + + G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 632 GGAGGTGGAAGTGTGGQ 648
GG G +GG +GTG
Sbjct: 66 GGNGNSGGGSGTGGNLS 82



Score = 30.5 bits (68), Expect = 0.027
Identities = 27/101 (26%), Positives = 37/101 (36%), Gaps = 17/101 (16%)

Query: 494 TGGNGGAGSAGADAPAGSGAMGSTGFAGGAGGDGGNGGGSGASQGNGGNGGNGGTGGKGG 553
+GG+G + GA + +G+ G TG G G G+G S + GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 554 TGGAGMNSLDPLLAAQDGGQGGTGGTGGNAGAGGTGFTQGA 594
G GG G G G + A
Sbjct: 62 HGN-----------------GGGNGNSGGGSGTGGNLSAVA 85



Score = 30.5 bits (68), Expect = 0.028
Identities = 30/90 (33%), Positives = 39/90 (43%), Gaps = 13/90 (14%)

Query: 345 GGNGGAGSTGTAGGGSGGTGGDGGTGGRGGLLMGAGAGG-HGGTGGAGGAGVNGGGAGGA 403
GG+G +TG GG G G GG G+G + GG G+G++ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 404 GGAGGNGGAGGQAALLFGRGGTGGAGGYGG 433
G GGNG +GG G+G G
Sbjct: 63 GNGGGNGNSGG------------GSGTGGN 80



Score = 30.5 bits (68), Expect = 0.029
Identities = 27/79 (34%), Positives = 29/79 (36%), Gaps = 1/79 (1%)

Query: 148 GQAGGAGGSAGLLGNGGSGGAGGTGAPGGNGGNAGWLYGRGGVGGAGGIGGGTGGAGGHA 207
G G A +GG G G GG +GW GG G G GG GH
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 208 WLFGHGGTGGIGGGPGGNG 226
G G G G G GGN
Sbjct: 64 -NGGGNGNSGGGSGTGGNL 81



Score = 29.7 bits (66), Expect = 0.048
Identities = 34/110 (30%), Positives = 38/110 (34%), Gaps = 6/110 (5%)

Query: 402 GAGGAGGNGGAGGQAALLFGRGGTGGAGGYGGDGGGGGDGFDGTMAGLGGTGGSGGTGGD 461
G G G N GA + G G G GGG DG + GGSG
Sbjct: 3 GGDGRGHNTGAHSTS------GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 462 GGAPGNGGAGGAGQLLSHSGVAGASGKGGAGGTGGNGGAGSAGADAPAGS 511
GG G+G GG G SG G A G + GA A S
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 29.7 bits (66), Expect = 0.049
Identities = 27/83 (32%), Positives = 32/83 (38%), Gaps = 12/83 (14%)

Query: 619 TAAAGTTGGAGGAGGAGGTGGAAGTGTGGQQGNGGNGGNGGTGGKGGTGGAGMNSLDPLL 678
T A T+G G G GG A G+G N GG G+G G G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH-------- 62

Query: 679 AAQDGGQGGTGGTGGNAGAGGTG 701
G GG G +GG +G GG
Sbjct: 63 ----GNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3512cloacin375e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 5e-04
Identities = 38/104 (36%), Positives = 44/104 (42%), Gaps = 2/104 (1%)

Query: 444 GQGGAGGTGGAGAASSATNGGSGGAGGTGGDGGSGGAGGTGGAGGTGGAAGDGGQGGQGG 503
G G G GA + S NGG G G GG S G+G + GG +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 504 AGGGAGGQGGAGGAGGTGGNGGNITGGTAGTAGAAGNGGAAGKG 547
G GG G +GG GTGGN + A A GA G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.2 bits (83), Expect = 8e-04
Identities = 37/111 (33%), Positives = 47/111 (42%), Gaps = 5/111 (4%)

Query: 633 GNGTGGAGGNGGGGANGGAGGAGGSGGGTGGNGGAGGDAGDAGNGGNGNGTGNGGNGGNG 692
G G + G NGG G G GG + G+G + + G G+G G G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 693 GIAGMGGNGGAGTGSGNGGNGGSGGNGGNAGMGGNSGTGSGDGGAGGNGGA 743
GGNG +G GSG GGN + G S G+G + GA
Sbjct: 66 -----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.2 bits (83), Expect = 8e-04
Identities = 26/73 (35%), Positives = 31/73 (42%)

Query: 588 GSGGQGGNAGGGGAGGQGGADGGSGGDGGDAGTGGNGGNGGNRNSGNGTGGAGGNGGGGA 647
G + G GG G G G G + N GG SG GG G+G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 648 NGGAGGAGGSGGG 660
NG +GG G+GG
Sbjct: 68 NGNSGGGSGTGGN 80



Score = 35.8 bits (82), Expect = 0.001
Identities = 29/78 (37%), Positives = 32/78 (41%)

Query: 948 GAAGNGGNGGNAGAGGNGNGGTGGAGGIGGTGGNGGDAEPGVPPGAGGAGGAGTTGGKGG 1007
G G G N G GN NGG G G GG G + P G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1008 TGGNGSGTGSGGTGGDGG 1025
G G+G GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 34.7 bits (79), Expect = 0.002
Identities = 25/74 (33%), Positives = 32/74 (43%)

Query: 678 GNGNGTGNGGNGGNGGIAGMGGNGGAGTGSGNGGNGGSGGNGGNAGMGGNSGTGSGDGGA 737
G+ G + NGG G+G GGA GSG G G +G+ G+G G+GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 738 GGNGGAAGTGGTGG 751
GN G G
Sbjct: 68 NGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.002
Identities = 30/78 (38%), Positives = 36/78 (46%)

Query: 682 GTGNGGNGGNGGIAGMGGNGGAGTGSGNGGNGGSGGNGGNAGMGGNSGTGSGDGGAGGNG 741
G G G N G +G G G G G G + GSG + N GG SG+G GG G+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 742 GAAGTGGTGGDGGLTGTG 759
G G +GG G G
Sbjct: 64 NGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.002
Identities = 30/84 (35%), Positives = 35/84 (41%)

Query: 732 SGDGGAGGNGGAAGTGGTGGDGGLTGTGGTGGSGGTGGDGGNGGNGADNTANMTAQAGGD 791
SG G G N GA T G G G G S G+G N G + + + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 792 GGNGGDGGFGGGAGAGGGGLTAGA 815
GNGG G GG GG L+A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 34.3 bits (78), Expect = 0.003
Identities = 35/87 (40%), Positives = 42/87 (48%), Gaps = 3/87 (3%)

Query: 658 GGGTGGNGGAGGDAGDAGNGGNGNGTGNGGNGGNGGIAGMGGNGGAGTGSGNGGNGGSGG 717
G G G N GA +G+ G G G G G + G+G + GG GSG+G + G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGG---GSGSGIHWGGGS 60

Query: 718 NGGNAGMGGNSGTGSGDGGAGGNGGAA 744
GN G GNSG GSG GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.9 bits (77), Expect = 0.004
Identities = 31/97 (31%), Positives = 37/97 (38%)

Query: 845 GNGGTGGNGGTGGTGGAGIGSLGGGTGGDGGNGGNGGTGGEGGEVGGAGGTGGAAGNGGD 904
G G G T G G LG G G G+G + GG G GG +G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 905 GGTGGTGGGDGGAGGTGGTGGTGGLGDPRVGGSGGDG 941
GG G +GGG G G G P + G G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.005
Identities = 29/80 (36%), Positives = 32/80 (40%)

Query: 420 SGGAGGAAGAGGAGGGANGTAGNGGQGGAGGTGGAGAASSATNGGSGGAGGTGGDGGSGG 479
SGG G G N G G G GG SS N GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 480 AGGTGGAGGTGGAAGDGGQG 499
G GG G +GG +G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 33.5 bits (76), Expect = 0.005
Identities = 24/75 (32%), Positives = 32/75 (42%)

Query: 1004 GKGGTGGNGSGTGSGGTGGDGGTGGGGGNGGTGWNGGKGDTGSGGGAGDGGKAPAGGTGG 1063
G+G G S +G+ G G GGG + G+GW+ G G G+G +G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1064 AGGDGGAGGKGGSGG 1078
G GG G G
Sbjct: 66 GGNGNSGGGSGTGGN 80



Score = 33.1 bits (75), Expect = 0.006
Identities = 25/84 (29%), Positives = 30/84 (35%)

Query: 600 GAGGQGGADGGSGGDGGDAGTGGNGGNGGNRNSGNGTGGAGGNGGGGANGGAGGAGGSGG 659
G G+G G G G G GG + G+G GGG+ G GGSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 660 GTGGNGGAGGDAGDAGNGGNGNGT 683
G GG G G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.8 bits (74), Expect = 0.008
Identities = 35/109 (32%), Positives = 41/109 (37%), Gaps = 1/109 (0%)

Query: 410 GNGGQGGQGGSGGAGGAAGAGGAGGGANGTAGNG-GQGGAGGTGGAGAASSATNGGSGGA 468
G G+G G+ G G G G G A +G G G G+ S GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 469 GGTGGDGGSGGAGGTGGAGGTGGAAGDGGQGGQGGAGGGAGGQGGAGGA 517
G GG+G SGG GTGG A G G G + GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.8 bits (74), Expect = 0.008
Identities = 35/101 (34%), Positives = 40/101 (39%), Gaps = 2/101 (1%)

Query: 565 AGGDGGAGGTGGDRTVGG--GTVPAGSGGQGGNAGGGGAGGQGGADGGSGGDGGDAGTGG 622
+GGDG TG T G G G G + G G + GGSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 623 NGGNGGNRNSGNGTGGAGGNGGGGANGGAGGAGGSGGGTGG 663
+G GGN NSG G+G G A G S G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.8 bits (74), Expect = 0.008
Identities = 28/86 (32%), Positives = 37/86 (43%), Gaps = 2/86 (2%)

Query: 589 SGGQGGNAGGGGAGGQGGADGGSGGDGGDAGTGGNGGNGGNRNSGNGTGGAGGNGGGGAN 648
SGG G G G +GG G G G G + G+G + + GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 649 GGAGGAGGSGGGTGGNGGAGGDAGDA 674
G G GG+G GG+G G + A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.010
Identities = 27/84 (32%), Positives = 30/84 (35%)

Query: 878 GNGGTGGEGGEVGGAGGTGGAAGNGGDGGTGGTGGGDGGAGGTGGTGGTGGLGDPRVGGS 937
G G G G +G G G GG G G G G G+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 938 GGDGGTGGSGGAAGNGGNGGNAGA 961
G GG G SGG +G GGN A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.012
Identities = 28/70 (40%), Positives = 29/70 (41%)

Query: 844 GGNGGTGGNGGTGGTGGAGIGSLGGGTGGDGGNGGNGGTGGEGGEVGGAGGTGGAAGNGG 903
G T GN G TG G G+G N GG G G GG G G GNG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 904 DGGTGGTGGG 913
GG GTGG
Sbjct: 71 SGGGSGTGGN 80



Score = 32.0 bits (72), Expect = 0.014
Identities = 32/102 (31%), Positives = 41/102 (40%)

Query: 120 GGKGGNGGIGAAGTTGPVGTGASGGTGGSGGAGGTGGDGGAANGGTAGAGGAGGNGGKGG 179
GG G GA T+G + G +G G G + G+G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 180 DGGAGVTSSTAGNSGGAGGSGGKGGDAGAGGAGATPGANGIA 221
G G +S G+ G S A A +TPGA G+A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.015
Identities = 27/83 (32%), Positives = 32/83 (38%), Gaps = 1/83 (1%)

Query: 663 GNGGAGGDAGDAGNGGNGNGTGNGGNGGNGGIAGMG-GNGGAGTGSGNGGNGGSGGNGGN 721
G G G + G GN NG G G G G G + G G+G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 722 AGMGGNSGTGSGDGGAGGNGGAA 744
GGN +G G G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 31.6 bits (71), Expect = 0.017
Identities = 26/80 (32%), Positives = 33/80 (41%)

Query: 242 GAGDGGHGGTGAAGGNGGTGGAGGSGIDGVGGGTGGTGGNGGNGAIGGAGGDAGGSGNSG 301
G G G + G + GN G G G G+G + N G G+G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 302 GNGGIGGKGGNAGAGGAAGS 321
GG G GG +G GG +
Sbjct: 64 NGGGNGNSGGGSGTGGNLSA 83



Score = 31.6 bits (71), Expect = 0.021
Identities = 38/115 (33%), Positives = 46/115 (40%), Gaps = 3/115 (2%)

Query: 700 NGGAGTGSGNGGNGGSGG-NGGNAGMGGNSGTGSGDGGAGGNGGAAGTGGTGGDGGLTGT 758
+GG G G G + SG NGG G+G G G G + N G G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG--GG 59

Query: 759 GGTGGSGGTGGDGGNGGNGADNTANMTAQAGGDGGNGGDGGFGGGAGAGGGGLTA 813
G G GG G GG G G + +A A G G G G L+A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 31.6 bits (71), Expect = 0.022
Identities = 25/84 (29%), Positives = 30/84 (35%)

Query: 809 GGLTAGANGTGGQGGAGGDGGNGAIGGHGPLTDDPGGNGGTGGNGGTGGTGGAGIGSLGG 868
GG G N +GG +G G +D G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 869 GTGGDGGNGGNGGTGGEGGEVGGA 892
G GG GN G G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 30.8 bits (69), Expect = 0.031
Identities = 35/114 (30%), Positives = 45/114 (39%), Gaps = 1/114 (0%)

Query: 19 NGGNGADNTTTAAAGTTGGAGGAGGAGGTGGTGGAAGTGTGGQQGNGGNGGNGGTGGKGG 78
+GG+G + T A + + GG G G GG +G + GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 79 TGGDGALAGSSGGAGGKGGNGGDAGKAGTG-SAPGTAGTGGDGGKGGNGGIGAA 131
G G S GG+G G A G A T G GG G + AA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 30.8 bits (69), Expect = 0.034
Identities = 27/81 (33%), Positives = 32/81 (39%)

Query: 898 AAGNGGDGGTGGTGGGDGGAGGTGGTGGTGGLGDPRVGGSGGDGGTGGSGGAAGNGGNGG 957
+ G+G TG GG G G GG D S + GGSG GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 958 NAGAGGNGNGGTGGAGGIGGT 978
+ GGNGN G G G +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 30.8 bits (69), Expect = 0.034
Identities = 28/102 (27%), Positives = 35/102 (34%)

Query: 567 GDGGAGGTGGDRTVGGGTVPAGSGGQGGNAGGGGAGGQGGADGGSGGDGGDAGTGGNGGN 626
G G G G + G +G G G+G + GG G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 627 GGNRNSGNGTGGAGGNGGGGANGGAGGAGGSGGGTGGNGGAG 668
G +GN GG+G G A G T G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.034
Identities = 32/100 (32%), Positives = 35/100 (35%)

Query: 485 GAGGTGGAAGDGGQGGQGGAGGGAGGQGGAGGAGGTGGNGGNITGGTAGTAGAAGNGGAA 544
G G G G G G G GG G + N GG +G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 545 GKGGAGGQGGTGGGTGGQGGAGGDGGAGGTGGDRTVGGGT 584
G GG G G G GTGG A A G T G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.5 bits (68), Expect = 0.040
Identities = 30/85 (35%), Positives = 38/85 (44%), Gaps = 2/85 (2%)

Query: 305 GIGGKGGNAGAGGAAGSNGGTVGANGTGGDGGNGGAAGAATAGSNGGAGTGSAGGNGGTG 364
G G+G N GA +G+ G G G G GG +G ++ + G G+GS GG
Sbjct: 3 GGDGRGHNTGAHSTSGNING--GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 365 GRGGSGGAGGDGIGGVGGGKGGNGA 389
G G GG G G G GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 30.5 bits (68), Expect = 0.043
Identities = 29/80 (36%), Positives = 35/80 (43%), Gaps = 2/80 (2%)

Query: 801 GGGAGAGGGGLTAGANGTGGQGGAGGDGGNGAIGGHGPLTDDPGGNGGTGGNGGTGGTGG 860
G G G G + N GG G G GG A G G +++ GG+G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 861 AGIGSLGGGTGGDGGNGGNG 880
G G G +GG G GGN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 30.5 bits (68), Expect = 0.044
Identities = 27/85 (31%), Positives = 35/85 (41%)

Query: 148 SGGAGGTGGDGGAANGGTAGAGGAGGNGGKGGDGGAGVTSSTAGNSGGAGGSGGKGGDAG 207
SGG G G + G G G G G G+G +S GG+G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 208 AGGAGATPGANGIAGNGGDGGDGAA 232
G G + G +G GG+ AA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 30.5 bits (68), Expect = 0.048
Identities = 30/102 (29%), Positives = 39/102 (38%)

Query: 368 GSGGAGGDGIGGVGGGKGGNGADGEVGGAGGAGGSGPNTSPGGNGGQGGQGGSGGAGGAA 427
G G G + G G G G G + GSG ++ GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 428 GAGGAGGGANGTAGNGGQGGAGGTGGAGAASSATNGGSGGAG 469
G GG G + G +G GG A A + + G+GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.048
Identities = 38/109 (34%), Positives = 43/109 (39%), Gaps = 11/109 (10%)

Query: 961 AGGNGNGGTGGAGGIGGTGGNGGDAEPGVPPGAGGAGGAGTTGGKGGTGGNGSGTGSGGT 1020
+GG+G G GA G G P G GG + G + N G GSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGG--------PTGLGVGGGASDGSGWSSENNPWGGGSGSG 53

Query: 1021 GGDGGTGGGGGNGGTGWNGGKGDTGSGGGAGDGGKA---PAGGTGGAGG 1066
GG G G GG G +GG TG A A PA T GAGG
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3514cloacin377e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 7e-04
Identities = 33/102 (32%), Positives = 41/102 (40%)

Query: 479 GAAGTGGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGGA 538
G G G G T+GN G G G G + G+G ++ GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 539 GGSSGAGGTNGSGGAGGTGGQGGAGGAGGAGADNPTGIGGTG 580
G G G + G G GG A A G A + G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.6 bits (84), Expect = 9e-04
Identities = 34/109 (31%), Positives = 40/109 (36%)

Query: 505 GGDGGAGGAGADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGTGGQGGAGG 564
GGDG GA + GG G GGA G S G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 565 AGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNA 613
G G N G GTGG+ A A G + G G + + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.2 bits (83), Expect = 0.001
Identities = 29/89 (32%), Positives = 35/89 (39%)

Query: 1045 GAGGKGGAGGSSGAGGTNGSGGAGGAGGQGGAGGAGISFSNGSNGGTGGTGGVGGTGGDG 1104
G G+G G+ G G G G G + G+G S N GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1105 GNAGTGAGDPGKGGTGGTGGTGGSGGAGG 1133
GN G G GTGG + A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 36.2 bits (83), Expect = 0.001
Identities = 30/90 (33%), Positives = 38/90 (42%)

Query: 1244 AGGPGGKGGAGGNAGTGGTNGSGAGGAGGQGGAGGAGISFSNGSNGGTGGTGGVGGTGGD 1303
+GG G G ++ +G NG G G G + G+G S N GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1304 GGNAGTGAGDPGKGGTGGTGGTGGSGGAGG 1333
GN G G GTGG + A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 35.8 bits (82), Expect = 0.001
Identities = 34/109 (31%), Positives = 40/109 (36%)

Query: 622 GGDGGAGGAGADADQPGATGGTGFAGGAGGAGKAGGSSSAGGTNSSGSAGGTGRQSGTGG 681
GGDG GA + GG G GGA G SS GS G G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 682 AGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNA 730
G G N G GTGG+ A A G + G G + + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.8 bits (82), Expect = 0.001
Identities = 27/89 (30%), Positives = 34/89 (38%)

Query: 1378 GKGGTNGNGGSGGTGGTGGPGGSGGAPTGSGTGGKGGAGGDGGDGADGGAATGVGDGGDG 1437
G G N G+ T G G +G G + G G + + G G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1438 GNGGNGGNGGTGVGSPGGLGGAGGTGGLG 1466
GNGG GN G G G+ G L G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.7 bits (79), Expect = 0.003
Identities = 36/101 (35%), Positives = 39/101 (38%), Gaps = 3/101 (2%)

Query: 1321 GTGGTGGSGGAGGSGGANFNGGTGGTGGTGGTGGKGGMGGIAG--DGGPGGDGGNAGVGG 1378
G G G + GA + G N NGG G G GG G GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1379 KGGTNGNGGSGGTGGTGGPGGSGGAPTGSGTGGKGGAGGDG 1419
G GNG SGG GTGG + AP G G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.004
Identities = 31/97 (31%), Positives = 37/97 (38%)

Query: 126 GGAGGAGGLLFGNGGAGAAGAPGQAGGPGGPAGLWGNGGPGGAGGSGGGTGGAGGAGGWL 185
GG G G + G+G + GG G WG G G GG G +GG G GG L
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 186 FGVGGAGGVGGAGGGTGGAGGPGGLIWGGGGAGGVGG 222
V G T GAGG I G + +
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 34.3 bits (78), Expect = 0.004
Identities = 26/79 (32%), Positives = 34/79 (43%)

Query: 1218 IGGDGGQGGNGGQGDSGSGLGGQPGFAGGPGGKGGAGGNAGTGGTNGSGAGGAGGQGGAG 1277
+ G G+G N G + + G P G GG G + G G+G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1278 GAGISFSNGSNGGTGGTGG 1296
G G NG++GG GTGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGG 79



Score = 33.9 bits (77), Expect = 0.005
Identities = 32/102 (31%), Positives = 40/102 (39%), Gaps = 1/102 (0%)

Query: 1070 AGGQGGAGGAGISFSNGSNGGTGGTGGVGGTGGDGGNAGTGAGDPGKGGTGGTGGTGGSG 1129
+GG G G ++G+ G GVGG DG + +P GG+G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSS-ENNPWGGGSGSGIHWGGGS 60

Query: 1130 GAGGSGGANFNGGTGGTGGTGGTGGKGGMGGIAGDGGPGGDG 1171
G G GG +GG GTGG G PG G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.9 bits (77), Expect = 0.005
Identities = 32/102 (31%), Positives = 40/102 (39%), Gaps = 1/102 (0%)

Query: 1270 AGGQGGAGGAGISFSNGSNGGTGGTGGVGGTGGDGGNAGTGAGDPGKGGTGGTGGTGGSG 1329
+GG G G ++G+ G GVGG DG + +P GG+G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSS-ENNPWGGGSGSGIHWGGGS 60

Query: 1330 GAGGSGGANFNGGTGGTGGTGGTGGKGGMGGIAGDGGPGGDG 1371
G G GG +GG GTGG G PG G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.9 bits (77), Expect = 0.005
Identities = 32/103 (31%), Positives = 39/103 (37%), Gaps = 1/103 (0%)

Query: 793 GAAGTGGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGGS 852
G G G G T+GN G G G G + G+G ++ GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 853 GGSSCAGGTNGSGGAGGTCGQVVAGGAGISFSNGSNGGTGGTG 895
G G + G G GG VA F S G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGN-LSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.005
Identities = 37/108 (34%), Positives = 42/108 (38%), Gaps = 5/108 (4%)

Query: 1100 TGGDGGNAGTGAGDPGKGGTGGTGGTGGSGGAGGSGGANFNGGTGGTGGTGGTGGKGGMG 1159
+GGDG TGA GG G G GGA G + G G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1160 GIAGDGGPGGDGGNAGVGGKGGTNGNGGSGGTGGTGGAGGNAGAGGLA 1207
G GG GN+G G G N + + A GAGGLA
Sbjct: 62 -----HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.007
Identities = 27/89 (30%), Positives = 32/89 (35%)

Query: 1369 GDGGNAGVGGKGGTNGNGGSGGTGGTGGPGGSGGAPTGSGTGGKGGAGGDGGDGADGGAA 1428
G G G T+GN G TG G G S G+ S GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1429 TGVGDGGDGGNGGNGGNGGTGVGSPGGLG 1457
G G+ G G G + V +P G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.011
Identities = 28/102 (27%), Positives = 36/102 (35%)

Query: 596 GAAGTGGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGKA 655
G G G G T+GN G G G G + G+G ++ GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 656 GGSSSAGGTNSSGSAGGTGRQSGTGGAGGAGADNPTGIGGTG 697
G G + GG A G A + G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.012
Identities = 36/119 (30%), Positives = 45/119 (37%), Gaps = 10/119 (8%)

Query: 713 GAAGTGGTGGMIGTTGNAGVGGAGGSSGAGGTNGSGGAGGTDGQGGAGGAGGAGADNPTG 772
G G G G T+GN G G G G ++GSG + + GG G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG--------- 53

Query: 773 IGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADA 831
GG G G G G +GG +GTGG + G G G A A A
Sbjct: 54 -IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.8 bits (74), Expect = 0.014
Identities = 34/101 (33%), Positives = 37/101 (36%), Gaps = 2/101 (1%)

Query: 391 GGDGGAGGAGADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGQGGAGGAGG 450
GGDG GA + GG G GGA G S G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 451 AGADNPTGIGGTGGDGGTGGAAGAGGAGG--AAGTGGTGGM 489
GG G GG A A A G A T G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 32.4 bits (73), Expect = 0.014
Identities = 33/112 (29%), Positives = 37/112 (33%), Gaps = 9/112 (8%)

Query: 1183 NGNGGSGGTGGTGGAGGNAGAGGLANTGGTAGNAGIGGDGGQGGNGGQGDSGSGLGGQPG 1242
+G G G G GN G G + G G GG SG GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1243 FAGGPGGKGGAGGNAGTGGTNGSGAGGAGGQG-------GAGGAGISFSNGS 1287
G GG G G G N S G GAGG +S S G+
Sbjct: 62 --HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.4 bits (73), Expect = 0.015
Identities = 40/128 (31%), Positives = 47/128 (36%), Gaps = 23/128 (17%)

Query: 965 SGTGGTGGTGGKGGTGGAGDDSAGGTGGTGGAGGNAGAGGLANTGGTAGNAGIGGDGGQG 1024
SG G G G T G + G G GGA +G N G +GI GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1025 GNGGQGDSGSGLGGQPGFAGGAGGKGGAGGSSGAGGTNGSGGAGGAGG-----QGGAGGA 1079
G GG G +GG SG GG + A A G GAGG
Sbjct: 62 H------------------GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103

Query: 1080 GISFSNGS 1087
+S S G+
Sbjct: 104 AVSISAGA 111



Score = 32.0 bits (72), Expect = 0.023
Identities = 30/101 (29%), Positives = 39/101 (38%)

Query: 363 GGAGGAAGQLFSASGAAGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAG 422
G G S SG G G G G+G + + G +G GG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 423 GAGGSSGAGGTNGSGGAGGQGGAGGAGGAGADNPTGIGGTG 463
GG+ +GG +G+GG A A G A + G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.040
Identities = 25/74 (33%), Positives = 31/74 (41%)

Query: 877 GGAGISFSNGSNGGTGGTGGVGGTGGDGGNAGTGAGDPGKGGTGGTGGTGGSGGAGGSGG 936
G + S N G TG G G G+ + +P GG+G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 937 ANFNGGTGGTGGTG 950
+GG GTGG
Sbjct: 68 NGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.045
Identities = 26/82 (31%), Positives = 36/82 (43%)

Query: 856 SCAGGTNGSGGAGGTCGQVVAGGAGISFSNGSNGGTGGTGGVGGTGGDGGNAGTGAGDPG 915
S G + GA T G + G G+ G++ G+G + GG G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 916 KGGTGGTGGTGGSGGAGGSGGA 937
G GG G +GG G GG+ A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83


50Rv3523Rv3532Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3523-114-4.519603Probable lipid carrier protein or keto acyl-CoA
Rv3524-214-5.226110Probable conserved membrane protein
Rv3525c-213-5.203744Possible siderophore-binding protein
Rv3526-111-5.055000Oxygenase component of
Rv3527013-3.761756Hypothetical protein
Rv3528c311-2.993440Unknown protein
Rv3529c212-1.039153Conserved hypothetical protein
Rv3530c113-0.386462Possible oxidoreductase
Rv3531c2140.052834Hypothetical protein
Rv35322110.039892PPE family protein PPE61
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3530cDHBDHDRGNASE812e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.3 bits (200), Expect = 2e-20
Identities = 60/254 (23%), Positives = 103/254 (40%), Gaps = 11/254 (4%)

Query: 8 KVIVVSGVGPGLGTTLAHRCARDGADLVLAARSAERLDDVAKQIIDTGRRAVAVRTDITD 67
K+ ++G G+G +A A GA + + E+L+ V + R A A D+ D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 68 DDDVSNLVQATLAAYGKADVLINNAFRVPSMKPLAGTTFEHIRDAIELSALGTLRLIQAF 127
+ + G D+L+N A V + + E +++ G ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVA-GVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 128 TPAL-AQSHGAIVNVNSMVIRHSQPKYGTYKMAKSVLLAMSHSLATELGEQGIRVNSVAP 186
+ + + G+IV V S + Y +K+ + + L EL E IR N V+P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 187 GYIWGDTLKSYFDHQAGKYGTTVDQIYQATAANSD----LKRLPTEDEVASAILFLASDL 242
G D S + + G +Q+ + + LK+L ++A A+LFL S
Sbjct: 188 GSTETDMQWSLWADENGA-----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 243 ASGITGQTLDVNCG 256
A IT L V+ G
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3531cPF03944310.011 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 30.8 bits (69), Expect = 0.011
Identities = 13/48 (27%), Positives = 22/48 (45%)

Query: 59 SGTGPFTKMGLDNPDTLYFGTRLQANRDYVVSGRRGTTTDLSFQLLGG 106
SG+GP + D + + Q N +YV++G G +F + G
Sbjct: 278 SGSGPQQTQSFTSQDWPFLYSLFQVNSNYVLNGFSGARLSNTFPNIVG 325


51Rv3585Rv3596cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv35850104.113448DNA repair protein RadA (DNA repair protein
Rv35862114.304876Conserved hypothetical protein
Rv3587c4104.938602Probable conserved membrane protein
Rv3588c2103.850463Beta-carbonic anhydrase CanB
Rv3589394.047384Probable adenine glycosylase MutY
Rv3590c5105.223410PE-PGRS family protein PE_PGRS58
Rv3591c0101.525190Possible hydrolase
Rv35920121.410262Possible heme degrading protein MhuD
Rv35931100.238616Probable conserved lipoprotein LpqF
Rv35942110.468136Conserved hypothetical protein
Rv3595c2100.015033PE-PGRS family protein PE_PGRS59
Rv3596c311-2.460790Probable ATP-dependent protease ATP-binding
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3587cPF03544280.023 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 28.4 bits (63), Expect = 0.023
Identities = 25/103 (24%), Positives = 35/103 (33%), Gaps = 4/103 (3%)

Query: 27 VVVVGIAVAIVIAFVDSSAGAKPVSADKPASAQSHPGSPAPQAPQPAGQTEGNAAAAP-P 85
VV G+ V ++ A A+P+S A A P P+P + E P P
Sbjct: 27 AVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEP 86

Query: 86 QGQNP---ETPTPTAAVQPPPVLKEGDDCPDSTLAVKGLTNAP 125
+ P E P P +P PV K D +
Sbjct: 87 PKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPF 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3590ccloacin393e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 3e-05
Identities = 43/132 (32%), Positives = 50/132 (37%), Gaps = 2/132 (1%)

Query: 449 AGTGGVGGVGGAGGQGGLLFGDGGNGGAGGAGGIGGTGASGGAGGKGGSGLVGGDGGNGG 508
+G G G GA G + G G GG G +S GGSG GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 509 AGGAGGNGGKGGAGGAGGGAGMFSQPGVHG--AGGTGGQGGAGGAGGAGGAAGAGTVVAG 566
G GGNG GG G GG + P G A T G GG + AG + A +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 567 NPGDPGGFGAAG 578
P FG G
Sbjct: 122 ALKGPFKFGLWG 133



Score = 38.2 bits (88), Expect = 1e-04
Identities = 32/91 (35%), Positives = 41/91 (45%)

Query: 136 GGPGGLLWGNGGNGGSGVAGVGGPGGSGGAAGLFGHGGNGGAGGSNAAGAGGVGGAGGAG 195
GGP GL G G + GSG + P G G +G+ GG+G G +GG G GG
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 196 WLVGNGGAGGFGGVGTTVSGNGGAGGAAGAF 226
V A GF + T +G +AGA
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112



Score = 35.5 bits (81), Expect = 5e-04
Identities = 31/84 (36%), Positives = 40/84 (47%), Gaps = 5/84 (5%)

Query: 314 AGGAGGLGGSGSDTSEGPVTGGNGGNGGDGGPGAPGGNGA---PGGIGVNTGTGWAYGGN 370
+GG G +G+ ++ G + GG G G GG G + P G G +G W GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW--GGG 59

Query: 371 GGNGGDGGAGARGGDGGNGGNGLA 394
G+G GG G GG G GGN A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.003
Identities = 30/97 (30%), Positives = 36/97 (37%), Gaps = 10/97 (10%)

Query: 395 LNGGNGIGGNGGAGGRGGTGAAGGNGGIGGGATGTLTFFGSGGDGGPGGAGANTAGTGGV 454
++GG+G G N GA G GG G+G G G G G + N GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGN-INGGPTGLGV---------GGGASDGSGWSSENNPWGGGS 50

Query: 455 GGVGGAGGQGGLLFGDGGNGGAGGAGGIGGTGASGGA 491
G GG G G G GG+G G A
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 32.8 bits (74), Expect = 0.004
Identities = 33/97 (34%), Positives = 37/97 (38%), Gaps = 3/97 (3%)

Query: 145 NGGNGGSGVAGVGGPGGSGGAAGLFGHGGNGGAGGSNAAGAGGVGGAGGAGWLVGNGGAG 204
N G + GGP G G G G G+G GG GNGG
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG-HGNGGGN 68

Query: 205 GFGGVGTTVSGNGGAGGAAGAFGNGGVG--GAGGAAV 239
G G G+ GN A A AFG + GAGG AV
Sbjct: 69 GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAV 105



Score = 32.8 bits (74), Expect = 0.004
Identities = 28/81 (34%), Positives = 32/81 (39%), Gaps = 3/81 (3%)

Query: 383 GGDGGNGGNGLALNGGNGIGGNGGAGGRGGTGAAGGNGGIG---GGATGTLTFFGSGGDG 439
GGDG G GN GG G G GG G GG +G+ +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 440 GPGGAGANTAGTGGVGGVGGA 460
G GG N+ G G GG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 32.8 bits (74), Expect = 0.004
Identities = 25/76 (32%), Positives = 30/76 (39%)

Query: 291 GANNGAGSGGAGLPGNPGAVPGRAGGAGGLGGSGSDTSEGPVTGGNGGNGGDGGPGAPGG 350
G N GA S + G P + G + G G S + G +G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 351 NGAPGGIGVNTGTGWA 366
NG GG G A
Sbjct: 68 NGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.005
Identities = 32/128 (25%), Positives = 44/128 (34%), Gaps = 10/128 (7%)

Query: 335 GNGGNGGDGGPGAPGGN--GAPGGIGVNTGTGWAYGGNGGNGGDGGAGARGGDGGNGGNG 392
G G G + G + GN G P G+GV GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVG-------GGASDGSGWSSENNPWGGGSGSGIH 55

Query: 393 LALNGGNGIGGNGGAGGRGGTGAAGGNGGIGGGATGTLTFFGSGGDGGPGGAGANTAGTG 452
G+G GG G G GG+G G + + G GG + + A +
Sbjct: 56 WGGGSGHGNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114

Query: 453 GVGGVGGA 460
+ + A
Sbjct: 115 AIADIMAA 122



Score = 31.6 bits (71), Expect = 0.009
Identities = 29/102 (28%), Positives = 36/102 (35%)

Query: 174 NGGAGGSNAAGAGGVGGAGGAGWLVGNGGAGGFGGVGTTVSGNGGAGGAAGAFGNGGVGG 233
+GG G + GA G G G G G G + N GG+ GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 234 AGGAAVIGGLPGNGGAGGNAGLIGAGGDGGVGGVGAPGTNGM 275
G G G G GGN + A G + PG G+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 31.6 bits (71), Expect = 0.010
Identities = 31/102 (30%), Positives = 37/102 (36%), Gaps = 2/102 (1%)

Query: 483 GGTGASGGAGGKGGSGLVGGDGGNGGAGGAGGNGGKGGAGGAGGGAGMFSQPGVHGAGGT 542
GG G G SG + +GG G G GG G G S G+H GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 543 GGQGGAGGAGGAGGAAGAGTVVAGNPGDPGGFGAAGADGLPG 584
G G G GG+ G + A GF A G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.016
Identities = 37/108 (34%), Positives = 41/108 (37%), Gaps = 8/108 (7%)

Query: 160 GGSGGAAGLFGHGGNGGAGGSNAAGAGGVGGAGGAGWLVGNGGAGGFGGVGTTVSGN--- 216
GG G H +G G G G + G+GW N GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 217 ---GGAGGAAGAFGNGGVGGAGGAAVIGGLPG--NGGAGGNAGLIGAG 259
GG G + G G GG A A V G P GAGG A I AG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3593BLACTAMASEA393e-05 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 39.0 bits (91), Expect = 3e-05
Identities = 24/137 (17%), Positives = 46/137 (33%), Gaps = 3/137 (2%)

Query: 148 PSIASWRDVDAALSKTGARYSFQVAKVDNGRCDPVAGTNTGESLPLASIFKLYVLHALAG 207
S + + S+ R + ++D + E P+ S FK+ + A+
Sbjct: 21 ASPQPLEQIKLSESQLSGR--VGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLA 78

Query: 208 AVQHNTVSWDDLLTVTAKSKAVGSSGLELPVGARVSVRTAAEKMIATSDNMATDLLIERL 267
V + + + S E + ++V I SDN A +LL+ +
Sbjct: 79 RVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATV 138

Query: 268 -GTRAIEEALASAGHHD 283
G + L G +
Sbjct: 139 GGPAGLTAFLRQIGDNV 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3595ccloacin394e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 4e-05
Identities = 31/95 (32%), Positives = 36/95 (37%)

Query: 152 GGSGGAAGLIGNGGSGGAGGAGAAGGSGGQGGLLYGNGGAGGNGGAATIPGGNGGAGGAG 211
G + GA GN G G G S G G N GG+G GG+G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 212 GNAWLFGNGGAGGLGAAGAAGAAGVNPLTVPAGQG 246
G+G G L A A A G L+ P G
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.4 bits (86), Expect = 1e-04
Identities = 34/108 (31%), Positives = 40/108 (37%), Gaps = 8/108 (7%)

Query: 309 GGAGGIGGTGGEGGIGARGGTGGQGGMGGAGQPGVGGDAGDGGNGGIGGDGGAGGDGGAG 368
GG G TG G G G+GG G G + + GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 369 GAGGAGGLFGVSGSSGLGGAAGSGGNGGGGGEPGVAGSPGVGPAGRGG 416
G GG G GG +G+GGN P G P + G GG
Sbjct: 63 GNGGGNG--------NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.1 bits (80), Expect = 6e-04
Identities = 33/93 (35%), Positives = 37/93 (39%), Gaps = 3/93 (3%)

Query: 119 GADGTAPGQNGGAGGLLYGNGGNGAAG---VNAGIAGGSGGAAGLIGNGGSGGAGGAGAA 175
GA T+ NGG GL G G + +G N GGSG G G G GG G +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 176 GGSGGQGGLLYGNGGAGGNGGAATIPGGNGGAG 208
GG G GG L G A G GG
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 7e-04
Identities = 35/106 (33%), Positives = 41/106 (38%), Gaps = 1/106 (0%)

Query: 286 TGGTGGTGGAGGSGGRGGLLVGDGGAGGIGGTGGEGGIGARGGTGGQGGMGGAGQPGVGG 345
+GG G G G + G G G GG G + G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 346 DAGDGGNGGIGGDGGAGGDGGAGGAGGAGGLFGVSGSSGLGGAAGS 391
GGNG GG G GG+ A A A G +S + G GG A S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS-TPGAGGLAVS 106



Score = 31.2 bits (70), Expect = 0.010
Identities = 33/106 (31%), Positives = 39/106 (36%), Gaps = 6/106 (5%)

Query: 130 GAGGLLYGNGGNGAAGVNAGIAGGSGGAAGLIGNGGSGGAGGAGAAGGSGGQGGLLYGNG 189
G G + G + +G G G G G + GSG + GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 190 GAGGNGGAATIPGGNGGAGGAGGNAWLFGNGGAGGLGAAGAAGAAG 235
G G GG G +GG G GGN A G A GA G
Sbjct: 61 GHGNGGGN----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.5 bits (68), Expect = 0.013
Identities = 27/82 (32%), Positives = 29/82 (35%), Gaps = 1/82 (1%)

Query: 267 TGGTG-GTGGTGLSVGGTGGTGGTGGTGGAGGSGGRGGLLVGDGGAGGIGGTGGEGGIGA 325
+GG G G S G G TG G G S G G + GG G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 326 RGGTGGQGGMGGAGQPGVGGDA 347
G GG G GG G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 29.3 bits (65), Expect = 0.034
Identities = 29/91 (31%), Positives = 36/91 (39%)

Query: 342 GVGGDAGDGGNGGIGGDGGAGGDGGAGGAGGAGGLFGVSGSSGLGGAAGSGGNGGGGGEP 401
G G + G G+ G G G G + G S ++ GG +GSG + GGG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 402 GVAGSPGVGPAGRGGDGNLGQFGPEGAPGQP 432
G G G G G GNL A G P
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3596cHTHFIS320.007 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.5 bits (74), Expect = 0.007
Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 28/160 (17%)

Query: 518 IIGQEDAVKAVSKAIRRTRAGLKDPKRPSGSFIFAGPSGVGKTELSKALANFLFGDDDAL 577
++G+ A++ + + + R + + + G SG GK +++AL ++ +
Sbjct: 139 LVGRSAAMQEIYRVLARL-------MQTDLTLMITGESGTGKELVARALHDYGKRRNGPF 191

Query: 578 IQIDMGEFHDRFTASRLFGAPPGYVGYEEGGQLTEKVRRKP--FS-----VVLFDEIEKA 630
+ I+M S LFG E G T R F + DEI
Sbjct: 192 VAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 631 HQEIYNSLLQVLEDG---RLTDGQGRTVDFKNTVLIFTSN 667
+ LL+VL+ G + D + ++ +N
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN 280


52Rv3609cRv3623Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3609c213-0.602130GTP cyclohydrolase I FolE (GTP-ch-I)
Rv3610c214-0.627125Membrane-bound protease FtsH (cell division
Rv3611425-1.705415Hypothetical arginine and proline rich protein
Rv3612c218-2.395576Conserved hypothetical protein
Rv3613c117-2.208713Hypothetical protein
Rv3614c015-2.283062ESX-1 secretion-associated protein EspD
Rv3615c115-2.185560ESX-1 secretion-associated protein EspC
Rv3616c213-0.819625ESX-1 secretion-associated protein A, EspA
Rv3617114-0.234916Probable epoxide hydrolase EphA (epoxide
Rv36181150.026028Possible monooxygenase
Rv3619c-1150.426469Putative ESAT-6 like protein EsxV (ESAT-6 like
Rv3620c0151.701422Putative ESAT-6 like protein EsxW (ESAT-6 like
Rv3621c-1131.761225PPE family protein PPE65
Rv3622c2131.890467PE family protein PE32
Rv36232131.584843Probable conserved lipoprotein LpqG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3610cHTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 0.002
Identities = 23/82 (28%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 199 VLLYGPPGTGKTLLARAV---AGEAGVPFFT-----ISGSDFVEMFVGV------GASRV 244
+++ G GTGK L+ARA+ PF I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 245 RD-LFEQAKQNSPCIIFVDEID 265
FEQA+ + +F+DEI
Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241


53Rv3649Rv3657cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3649-1123.271086Probable helicase
Rv36500153.812545PE family protein PE33
Rv3651-1143.106811Conserved hypothetical protein
Rv36526226.418509PE-PGRS family-related protein PE_PGRS60
Rv36535166.152767PE-PGRS family-related protein PE_PGRS61
Rv3654c4174.035701Conserved hypothetical protein
Rv3655c3173.911398Conserved hypothetical protein
Rv3656c1143.269523Conserved hypothetical protein
Rv3657c1154.067727Possible conserved alanine rich membrane
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3652RTXTOXINA280.010 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 27.6 bits (61), Expect = 0.010
Identities = 10/49 (20%), Positives = 24/49 (48%), Gaps = 5/49 (10%)

Query: 13 AAATDLATLGSTIGAANAAAA-----GSTTALLTAGADEVSAAIAAYSE 56
A+ T ++T+ +++ + +AAA G+ + L + + I S+
Sbjct: 366 ASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASK 414


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3653cloacin320.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.0 bits (72), Expect = 0.001
Identities = 27/102 (26%), Positives = 38/102 (37%), Gaps = 7/102 (6%)

Query: 17 NGANGAPGTGANGGDGGILFGSGGAGGSGAAGMAGGNGGAAGLFGNGGAGGAGGSATAGA 76
N + NGG G+ G G + GSG + GG +G + G G G+
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 77 AGAGGNGGAGGL-------LFGTAGAGGNGGLSLGLGVAGGA 111
GG+G G L FG G L + ++ GA
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 31.6 bits (71), Expect = 0.002
Identities = 26/104 (25%), Positives = 36/104 (34%)

Query: 49 MAGGNGGAAGLFGNGGAGGAGGSATAGAAGAGGNGGAGGLLFGTAGAGGNGGLSLGLGVA 108
M+GG+G + +G G T G G + G+G GG+G G +
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 109 GGAGGAGGSGGSDTAGHGGTGGAGGLLFGAGEDGTTPGGNGGAG 152
G G G +G GG A G + G GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.003
Identities = 23/74 (31%), Positives = 28/74 (37%)

Query: 109 GGAGGAGGSGGSDTAGHGGTGGAGGLLFGAGEDGTTPGGNGGAGGVAGLFGDGGNGGNAG 168
G GA + G+ G G G GG G+G GG+G G G+G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 169 VGTPAGNVGAGGTG 182
G G G GG
Sbjct: 68 NGNSGGGSGTGGNL 81



Score = 29.3 bits (65), Expect = 0.010
Identities = 31/90 (34%), Positives = 36/90 (40%), Gaps = 4/90 (4%)

Query: 74 AGAAGAGGNGGA----GGLLFGTAGAGGNGGLSLGLGVAGGAGGAGGSGGSDTAGHGGTG 129
+G G G N GA G + G G G GG S G G + GG GS GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 130 GAGGLLFGAGEDGTTPGGNGGAGGVAGLFG 159
G G G+ GGN A FG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 28.5 bits (63), Expect = 0.017
Identities = 23/80 (28%), Positives = 27/80 (33%)

Query: 20 NGAPGTGANGGDGGILFGSGGAGGSGAAGMAGGNGGAAGLFGNGGAGGAGGSATAGAAGA 79
+G G G N G G G +G N GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 80 GGNGGAGGLLFGTAGAGGNG 99
GNGG G G +G GGN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3656cMECHCHANNEL240.030 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 24.4 bits (53), Expect = 0.030
Identities = 14/37 (37%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 27 VEYAIGTIAAAAFGAILYTVVTGDSIVSALNRIIGRA 63
V+ A+G I AAFG I+ + + D I+ L +IG
Sbjct: 17 VDLAVGVIIGAAFGKIV-SSLVADIIMPPLGLLIGGI 52


54Rv3816cRv3826Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3816c-213-3.018356Possible acyltransferase
Rv3817-117-3.629206Possible phosphotransferase
Rv3818-119-4.964611Unknown protein
Rv3819-124-5.753242Unknown protein
Rv3820c-217-3.416940Possible conserved polyketide synthase
Rv3821-218-3.628299Probable conserved integral membrane protein
Rv3822-217-3.384143Conserved hypothetical protein
Rv3823c-217-3.303273Conserved integral membrane transport protein
Rv3824c-316-2.606487Conserved polyketide synthase associated protein
Rv3825c-314-1.809063Polyketide synthase Pks2
Rv3826-116-3.377359Probable fatty-acid-AMP ligase FadD23
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3823cACRIFLAVINRP539e-09 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 52.5 bits (126), Expect = 9e-09
Identities = 42/240 (17%), Positives = 91/240 (37%), Gaps = 36/240 (15%)

Query: 205 ADLNLTGQRDRSR-IEFAI----------TILLLVILLIIYGNPITMVLPLITIGMSVVV 253
+ + D + ++ +I +L+ +++ + N ++P I + + +
Sbjct: 319 QGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVL-- 376

Query: 254 AQRLVAIAGLAGLGIANQSIIFMSGMMVGAGT--DYAVFLISRYHDYLR-QGADSDQAVK 310
L A LA G + + + M GM++ G D A+ ++ + +A +
Sbjct: 377 ---LGTFAILAAFGY-SINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATE 432

Query: 311 KALTSIGKVIAASAATVAITFLGMVFT--QLG-ILKTVGPMLGISVAVVFFAAVTLLPAL 367
K+++ I + A ++ F+ M F G I + + ++A+ A+ L PAL
Sbjct: 433 KSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPAL 492

Query: 368 MVL-------------TGRRGWIAPRRDLTRRFWRSSGVHIVRRPKTHLLASALVLVILA 414
G GW D + + +S I+ +LL AL++ +
Sbjct: 493 CATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMV 552


55Rv3896cRv3901cY        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3896c2191.837613Conserved hypothetical protein
Rv3897c224-0.299476Conserved hypothetical protein
Rv3898c223-0.525298Conserved hypothetical protein
Rv3899c223-0.471255Conserved hypothetical protein
Rv3900c325-1.669502Conserved hypothetical alanine rich protein
Rv3901c221-1.201454Possible membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3897c60KDINNERMP280.029 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 28.0 bits (62), Expect = 0.029
Identities = 11/37 (29%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 17 VGGVMGPLTQLPQQAMQAGQGAMQPLMSALQQTYGAE 53
V G+M PLT+ +M + +QP + A+++ G +
Sbjct: 365 VRGIMYPLTKAQYTSMAKMR-MLQPKIQAMRERLGDD 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3899cPRTACTNFAMLY300.027 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 29.6 bits (66), Expect = 0.027
Identities = 43/188 (22%), Positives = 60/188 (31%), Gaps = 20/188 (10%)

Query: 2 VTGQPAAAGAHSLSEGAMTAMQSGSVPPPQAT---PPITTPPVVSAPTMAAGIEATHGPV 58
VT Q +A L GA+ ++Q +PP + +T P AP + + A+ +
Sbjct: 171 VTVQRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTL 230

Query: 59 DTPANTSG-APPASTGTTGPVAPTVVTAGPVAAPAAPVVGGSAVPAGPLPAYGSDLRPPV 117
D T G A + V T APA V G AVP G +P
Sbjct: 231 DGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGP----- 285

Query: 118 VAAPAVPSVPTAPVSGAPVAPSASSAPSAGGALVSPVERAASKAVAGQAGASSSTMAGAS 177
V S SS A + A A + G +
Sbjct: 286 ------GGFGPVLDGWYGVDVSGSSVELAQSIV-----EAPELGAAIRVGRGARVTVSGG 334

Query: 178 ALSATAGA 185
+LSA G
Sbjct: 335 SLSAPHGN 342


56Rv0273cRv0279cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0273c-211-1.096465Possible transcriptional regulatory protein
Rv02745175.346379Conserved protein
Rv0275c10228.112180Possible transcriptional regulatory protein
Rv027611197.740593Conserved hypothetical protein
Rv0277c11217.750325Possible toxin VapC25. Contains PIN domain.
Rv0278c6156.101805PE-PGRS family protein PE_PGRS3
Rv0279c0102.922633PE-PGRS family protein PE_PGRS4
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0273cHTHTETR542e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 54.2 bits (130), Expect = 2e-11
Identities = 27/173 (15%), Positives = 52/173 (30%), Gaps = 8/173 (4%)

Query: 1 MPDFPTQRGRRTQAAIDAAARTVVVRNGILATTVADITAEAGRSAASFYNYYDSKEAMVR 60
M Q + T+ I A + + G+ +T++ +I AG + + Y ++ K +
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 QWALRFRDDANQRALSVIRHGLSDRERAYEAAAAHWYTY-----RNRLAEAISVSQLAMV 115
+ + + L D H R RL I + V
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 116 SDDF--AQYWSEICQIPISFITETVKRAQAHGYCVGD-DPQLMAEAIVAMFNQ 165
+ Q +C I +T+K D + A + +
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0275cHTHTETR441e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 43.8 bits (103), Expect = 1e-07
Identities = 29/195 (14%), Positives = 60/195 (30%), Gaps = 13/195 (6%)

Query: 14 AERLATRRRQSLSAGLDLLGSDQHDIAELTIRTICRRAGLSVRYFYESFTDKDEFVGRVF 73
+ R+ L L L Q ++ ++ I + AG++ Y F DK + ++
Sbjct: 6 KQEAQETRQHILDVALRLF--SQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 74 DWVVAELVATTQAAVTAVPA--REQTRAGMANIVRTITADARVGRLL-FSTQLANAVITR 130
+ + + P R + +++ + + R L+ V
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 131 -KRAESSALFAMLSGQHAVDTLHA-------PANDHVKAVAHFAVGGVGQTISAWLAGDV 182
++ + S TL PA+ + A G + + WL
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 183 RLDPDQLVDQLAALL 197
D + A+L
Sbjct: 184 SFDLKKEARDYVAIL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0278ccloacin442e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 44.3 bits (104), Expect = 2e-06
Identities = 42/115 (36%), Positives = 53/115 (46%), Gaps = 3/115 (2%)

Query: 117 NGANGAPGTGANGGDGGWLIGNGGAGGSGAAGVNGGAGGNGGAGGNGGAGGLIGNGGAGG 176
N + NGG G +G G + GSG + N GG G+G + G G GNGG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 177 AGGVASSGIGGSGGAGGNAMLFG--AGGAGGAGGGVVALTGGAGGAGGAGGNAGL 229
G SG GG+ A + FG A GAGG V+++ GA A A A L
Sbjct: 70 NSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAAL 123



Score = 38.2 bits (88), Expect = 2e-04
Identities = 35/114 (30%), Positives = 41/114 (35%), Gaps = 6/114 (5%)

Query: 143 GSGAAGVNGGAGGNGGAGGNGGAGGLIGNGGAGGAGGVASSGIGGSGGAGGNAMLFGAGG 202
G G N GA G NGG GL GGA G +S GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 203 AGGAGGGVVALTGGAGGAGGAGGNAGLLFG-----AAGVGGAGGFTNGSALGGA 251
G GG + G G + A + FG G GG + AL A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 36.2 bits (83), Expect = 7e-04
Identities = 38/104 (36%), Positives = 43/104 (41%), Gaps = 9/104 (8%)

Query: 722 GTGGAGTNFGAGGNGGN--GGLFGAGGTGGAAGSGGSGITTGGGGHGGNAGLLSLGASGG 779
G G G N GA GN GG G G GGA+ G G G +G+ G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 780 AGGSGGASSLAGGAGGTGGNG-----ALLFGFRGAGGAGGHGGA 818
G G +S GG GTGGN + FGF G G A
Sbjct: 63 GNGGGNGNS--GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.8 bits (82), Expect = 7e-04
Identities = 31/110 (28%), Positives = 41/110 (37%), Gaps = 3/110 (2%)

Query: 687 AGGNGGLFANGGAGGAGGFNAAGGNGGNGGLFGTGGTGGAGTNFGAGGNGGNGGLFGAGG 746
+GG+G G +G N G GG G + N GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 747 TGGAAGSGGSGITTGGGGHGGNAGL---LSLGASGGAGGSGGASSLAGGA 793
G G+G SG +G GG+ A G G A S++ GA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.7 bits (79), Expect = 0.002
Identities = 34/111 (30%), Positives = 43/111 (38%), Gaps = 5/111 (4%)

Query: 248 LGGAGGAGGAGGLFATGGV--GGSGGAGSSGGAGGAGGAGGL---FGAGGTGGHGGFADS 302
+ G G G G +T G GG G G GGA G +G G G S
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 303 SFGGVGGAGGAGGLFGAGGEGGSGGHSLVAGGDGGAGGNAGMLALGAAGGA 353
G GG G +GG G GG + + G + AG LA+ + GA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 34.3 bits (78), Expect = 0.002
Identities = 21/71 (29%), Positives = 25/71 (35%)

Query: 671 TGGHGAAGGVPAGVGGAGGNGGLFANGGAGGAGGFNAAGGNGGNGGLFGTGGTGGAGTNF 730
TG H +G + G G G GG G G G G+G G G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 731 GAGGNGGNGGL 741
GG+G G L
Sbjct: 71 SGGGSGTGGNL 81



Score = 34.3 bits (78), Expect = 0.002
Identities = 31/87 (35%), Positives = 36/87 (41%), Gaps = 3/87 (3%)

Query: 368 IGGAGGAGGNAGLLFGSGG-SGGAGGFGFADGGQGGPGGNAGTVFGSGGAGGNGGVGQGF 426
+ G G G N G SG +GG G G G G G ++ GG+G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 427 AGGIGGAGGTPGLIGNGGNGGNGGASA 453
G GG G G G G GGN A A
Sbjct: 61 GHGNGGGNGNSG--GGSGTGGNLSAVA 85



Score = 33.9 bits (77), Expect = 0.004
Identities = 34/110 (30%), Positives = 42/110 (38%), Gaps = 7/110 (6%)

Query: 215 GGAGGAGGAGGNAGLLFGAAGVGGAGGFTNGSALGG-----AGGAGGAGGLFATGGVGGS 269
G GA GN G G+G GG ++GS GG+G G G
Sbjct: 8 GHNTGAHSTSGNIN--GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 270 GGAGSSGGAGGAGGAGGLFGAGGTGGHGGFADSSFGGVGGAGGAGGLFGA 319
GG G+SGG G GG A G + GG+ + AG L A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.9 bits (77), Expect = 0.004
Identities = 37/134 (27%), Positives = 51/134 (38%)

Query: 263 TGGVGGSGGAGSSGGAGGAGGAGGLFGAGGTGGHGGFADSSFGGVGGAGGAGGLFGAGGE 322
+GG G G+ +G G G GG G S GG G+G +G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 323 GGSGGHSLVAGGDGGAGGNAGMLALGAAGGAGGIGGDGGTLTAGGIGGAGGAGGNAGLLF 382
G+GG + +GG G GGN +A A G + G A I + A ++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 383 GSGGSGGAGGFGFA 396
G G +G A
Sbjct: 122 ALKGPFKFGLWGVA 135



Score = 32.0 bits (72), Expect = 0.012
Identities = 37/113 (32%), Positives = 41/113 (36%), Gaps = 24/113 (21%)

Query: 357 GGDGGTLTAGGIGGAGGAGGNAGLLFGSGGSGGAGGFGFADGGQGGPGGNAGTVFGSGGA 416
GGDG G GA GN +GG G G G GSG +
Sbjct: 3 GGDG----RGHNTGAHSTSGN------------------INGGPTGLGVGGGASDGSGWS 40

Query: 417 GGNGGVGQGFAGGIGGAGGTPGLIGNGGNGGNGGASAVTGGNGGIGGTGVLIG 469
N G G GI GG+ GNGG GN G + TGGN V G
Sbjct: 41 SENNPWGGGSGSGIHWGGGSGH--GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 31.6 bits (71), Expect = 0.017
Identities = 33/103 (32%), Positives = 38/103 (36%), Gaps = 2/103 (1%)

Query: 751 AGSGGSGITTGGGGHGGN--AGLLSLGASGGAGGSGGASSLAGGAGGTGGNGALLFGFRG 808
+G G G TG GN G LG GGA G SS GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 809 AGGAGGHGGAALTSIQQGGAGGAGGNGGLLFGSAGAGGAGGSG 851
G GG+G + S G F + GAGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.019
Identities = 36/116 (31%), Positives = 43/116 (37%), Gaps = 8/116 (6%)

Query: 235 GVGGAGGFTNGSALGGAGGAGGAGGLFATGGVGGSGGAGSSGGAGGAGGAGGLFGAGGTG 294
G T+G+ GG G G GG A+ G G S GG G+G G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 295 GHGGFADSSFGGVGGAGGAGGLFGAGGEGGSGGHSLVAGGDGGAGGNAGMLALGAA 350
G G GG+G G L G +L G GG + AL AA
Sbjct: 66 GGNG------NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 30.8 bits (69), Expect = 0.027
Identities = 35/111 (31%), Positives = 43/111 (38%), Gaps = 4/111 (3%)

Query: 290 AGGTG-GHGGFADSSFGGVGGAGGAGGLFGAGGEGGSGGHSLVAGGDGGAGGNAGMLALG 348
+GG G GH A S+ G + G G+ G G GSG S GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGV-GGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 349 AAGGAGGIGGDGGTLTAGGIGGAGGAGGNAGLLFGSGGSGGAGGFGFADGG 399
G GG G GG GG A A G F + + GAGG +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG--FPALSTPGAGGLAVSISA 109



Score = 30.8 bits (69), Expect = 0.030
Identities = 33/104 (31%), Positives = 42/104 (40%), Gaps = 5/104 (4%)

Query: 575 GNGGNGGHGATNTAATATGGAGGAGGILFGTGGNGGTGGIATGAGGIGGAGGAGGVSLLI 634
G+G GA +T+ GG G G G G + G+G + GG G+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGV---GGGASDGSGWSSENNPW-GGGSGSGIHWGGG 59

Query: 635 GSGGTGGNGGNSIGVAGIGGAGGRGGDAGLLFGAAGTGGHGAAG 678
G GG GNS G +G GG A + FG GA G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGG-NLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.030
Identities = 28/93 (30%), Positives = 35/93 (37%), Gaps = 2/93 (2%)

Query: 338 AGGNAGMLALGAAGGAGGIGGDGGTLTAGGIGGAGGAGGNAGLLFGSGGSGGAGGFGFAD 397
+GG+ GA +G I G L GG G + +G G G G +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 398 GGQGGPGGNAGTVFGSGGAGGNGGVGQGFAGGI 430
G GG GN+G GSG G V A G
Sbjct: 62 HGNGGGNGNSGG--GSGTGGNLSAVAAPVAFGF 92



Score = 30.5 bits (68), Expect = 0.039
Identities = 29/95 (30%), Positives = 38/95 (40%), Gaps = 9/95 (9%)

Query: 779 GAGGSGGASSLAGGAGGTGGNGALLFGFRGAGGAGGHGGAALTSIQQGGAGGAGGNGGLL 838
G G + GA S +G G G G GG + S + GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPT---------GLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 839 FGSAGAGGAGGSGANALGAGTGGTGGDGGHAGVFG 873
G +G G GG+G + G+GTGG FG
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0279ccloacin381e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 1e-04
Identities = 31/102 (30%), Positives = 41/102 (40%), Gaps = 4/102 (3%)

Query: 117 NGTNGAPGTGANGGDGGWLIGNGGAGGSGAAGVN----GGAGGNGGAGGLIGNGGAGGAG 172
N + NGG G +G G + GSG + N GG+G GG G+G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 173 GRASTGTGGAGGAGGAAGMLFGAAGVGGPGGFAAAFGATGGA 214
G + AA + FG + PG A + GA
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.6 bits (84), Expect = 4e-04
Identities = 32/107 (29%), Positives = 45/107 (42%), Gaps = 4/107 (3%)

Query: 628 GTGGLFANGGAGGAGGFGANGGTGGNGLLFGAGGTGGAGTLGADGGAGGHGGLFGAGGTG 687
G G N GA G G TG + G+G + GG G G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 688 GAGGSSGGTFGGNGGSGGNAGLLALGASG----GAGGSGGSALNVGG 730
G GG +G + GG+G G + + A A G G+GG A+++
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 35.8 bits (82), Expect = 6e-04
Identities = 30/82 (36%), Positives = 38/82 (46%)

Query: 283 AGGTGGSGGTSIINVGGNGGAGGDAGMLSLGAAGGAGGSGGSNPDGGGGAGGIGGDGGTL 342
+GG G T + GN G + GA+ G+G S +NP GGG GI GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 343 FGSGGAGGVCGLGFDAGGAGGA 364
G+GG G G G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 34.7 bits (79), Expect = 0.002
Identities = 26/78 (33%), Positives = 33/78 (42%)

Query: 693 SGGTFGGNGGSGGNAGLLALGASGGAGGSGGSALNVGGTGGVGGNGGSGGSLFGFGGAGG 752
SGG G+ + G G G GG++ G + GG GS +GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 753 TGGSSGIGSSGGTGGDGG 770
G G G+SGG G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGG 79



Score = 33.9 bits (77), Expect = 0.003
Identities = 29/110 (26%), Positives = 40/110 (36%), Gaps = 2/110 (1%)

Query: 211 TGGAGGAGGNGGLFADGGVGGAGGATDAGTGGAGGSG--GNGGLFGAGGTGGPGGFGIFG 268
+GG G G G + G G G + GSG +G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 269 GGAGGDGGSGGLFGAGGTGGSGGTSIINVGGNGGAGGDAGMLSLGAAGGA 318
G GG G+ G G S + + G + AG L++ + GA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 33.5 bits (76), Expect = 0.003
Identities = 32/95 (33%), Positives = 37/95 (38%), Gaps = 9/95 (9%)

Query: 682 GAGGTGGAGGSSGGTFGGNGGSGGNAGLLALGASGGAGGSGGSALNVGGTGGVGGNGGSG 741
G G GA +SG GG G G G AS G+G S + GG+G GG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGG-----ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 742 GSLFGFGGAGGTGGSSGIGSSGGTGGDGGTAGVFG 776
G G GG G S G +GG FG
Sbjct: 61 GH----GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.006
Identities = 33/105 (31%), Positives = 41/105 (39%), Gaps = 1/105 (0%)

Query: 659 AGGTGGAGTLGADGGAGGHGGLFGAGGTGGAGGSSGGTFGGNGGSGGNAGLLALGASGGA 718
+GG G GA +G G G GG G N GG +G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 719 GGSGGSALNVGGTGGVGGNGGSGGSLFGFG-GAGGTGGSSGIGSS 762
G+GG N GG G GGN + + FG A T G+ G+ S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 32.8 bits (74), Expect = 0.007
Identities = 31/93 (33%), Positives = 37/93 (39%), Gaps = 2/93 (2%)

Query: 377 AGGAGGGSFAGAGGTGGA--GGAPGLVGNAGNGGNGGASANGAGAAGGAGGSGVLIGNGG 434
+GG G G GA T G GG GL G G S+ GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 435 NGGSGGTGAPAGTAGAGGLGGQLLGRDGFNAPA 467
+G GG G G +G GG + F PA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPA 94



Score = 32.4 bits (73), Expect = 0.008
Identities = 32/120 (26%), Positives = 38/120 (31%), Gaps = 12/120 (10%)

Query: 219 GNGGLFADGGVGGAGGATDAGTGGAGGSGGNGGLFGAGGTGGPGGFGIFGGGAGGDGGSG 278
G G + G G + G G G GG G P G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG--------GGSGSGI 54

Query: 279 GLFGAGGTGGSGGTSIINVGGNGGAGGDAGMLSLGAAGGAGGSGGSNPDGGGGAGGIGGD 338
G G G GG N GG+G + ++ A G S P GG A I
Sbjct: 55 HWGGGSGHGNGGG----NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.4 bits (73), Expect = 0.009
Identities = 32/101 (31%), Positives = 38/101 (37%)

Query: 575 SGGTGGSGGFGLDTGGAGGRGGDAGLFLGAAGTGGQAALSQNFIGAGGTAGAGGTGGLFA 634
SGG G G + GG GL +G + G S+N GG+ GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 635 NGGAGGAGGFGANGGTGGNGLLFGAGGTGGAGTLGADGGAG 675
+G GG G G GTGGN A G L G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.010
Identities = 37/132 (28%), Positives = 46/132 (34%), Gaps = 16/132 (12%)

Query: 242 GAGGSGGNGGLFGAGGT--GGPGGFGIFGGGAGGDGGSGGLFGAGGTGGSGGTSIINVGG 299
G G G N G G GGP G G+ GG + G G S GG GSG I+ GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG----IHWGG 58

Query: 300 NGGAGGDAGMLSLGAAGGAGGSGGSNPDGGGGAGGIGGDGGTLFGSGGAGGVCGLGFDAG 359
G G GG G+ G GG + F + G GL
Sbjct: 59 GSGHGN----------GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSIS 108

Query: 360 GAGGAGGKAGLL 371
+ A ++
Sbjct: 109 AGALSAAIADIM 120



Score = 32.0 bits (72), Expect = 0.011
Identities = 39/135 (28%), Positives = 47/135 (34%), Gaps = 10/135 (7%)

Query: 139 GGAGGSGAAGVNGGAGGNGGAGGLIGNGGAGGAGGRASTGTGGAGGAGGAAGMLFGAAGV 198
G G + NGG GL GGA G +S GG+G G+
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 199 GGPGGFAAAFGATGGAGGAGGNGGLFADGGVGGAGGATDAGTGGAGGSGGNGGLFGA--- 255
G G G +GG G GGN A G + G GG S G L A
Sbjct: 64 NGGGN-----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118

Query: 256 --GGTGGPGGFGIFG 268
GP FG++G
Sbjct: 119 IMAALKGPFKFGLWG 133



Score = 32.0 bits (72), Expect = 0.013
Identities = 31/114 (27%), Positives = 39/114 (34%), Gaps = 5/114 (4%)

Query: 171 AGGRASTGTGGAGGAGGAAGMLFGAAGVGGPGGFAAAFGATGGAGGAGGNGGLFADGGVG 230
+GG GA G GVGG + + + G G G+ GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 231 GAGGATDAGTGGAGGSGGNGGLFGAGGTGGPGGFGIFGGGAGGDGGSGGLFGAG 284
G + +GG G+GGN A P FG G GG AG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA-----PVAFGFPALSTPGAGGLAVSISAG 110



Score = 31.2 bits (70), Expect = 0.017
Identities = 35/110 (31%), Positives = 44/110 (40%), Gaps = 14/110 (12%)

Query: 619 GAGGTAGAGGTGGLFANGGAGGAGGFGANGGTG----------GNGLLFGAGGTGGAGTL 668
G G GA T G G G G GA+ G+G G+G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 669 GADGGAGGHGGLFGAGGTGGAGGSSGGTFGGNGGSGGNAGLLALGASGGA 718
G +G +GG G G + ++ FG S AG LA+ S GA
Sbjct: 66 GGNGNSGGGSG----TGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 31.2 bits (70), Expect = 0.018
Identities = 25/79 (31%), Positives = 33/79 (41%), Gaps = 3/79 (3%)

Query: 353 GLGFDAGGAGGAGGKAGLLIGAGGAGGAGGGSFAGAGGT---GGAGGAPGLVGNAGNGGN 409
G G + G +G G G G GGA GS + GG+G G +G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 410 GGASANGAGAAGGAGGSGV 428
GG +G G+ G S V
Sbjct: 66 GGNGNSGGGSGTGGNLSAV 84



Score = 30.5 bits (68), Expect = 0.036
Identities = 34/121 (28%), Positives = 41/121 (33%), Gaps = 5/121 (4%)

Query: 640 GAGGFGANGGTGGNGLLFGAGGTGGAGTLGADGGAG-GHGGLFGAGGTGGAGGSSGGTFG 698
G G G N G G TG GA G+G GG+G GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 699 GNGGSGGNAGLLALGASGGAGGSGGSALNVGGTGGVGGNGGSGGSLFGFGGAGGTGGSSG 758
GNGG GN+G G SG G A V G+GG + +
Sbjct: 63 GNGGGNGNSG----GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118

Query: 759 I 759
I
Sbjct: 119 I 119



Score = 30.1 bits (67), Expect = 0.041
Identities = 33/113 (29%), Positives = 40/113 (35%), Gaps = 2/113 (1%)

Query: 531 GGDGGSGGAGGILSGIGGTGGSGGIGTTGQGGTGGTGGAALLIGSGGTG-GSGGFGLDTG 589
GGDG G + GG G+G G G + GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 590 GAGGRGGDAGLFLGAAGTGGQAALSQNF-IGAGGTAGAGGTGGLFANGGAGGA 641
G GG G++G G G A F A T GAGG + G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115


57Rv0450cRv0456AN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0450c115-4.048247Probable conserved transmembrane transport
Rv0451c013-1.160541Probable conserved membrane protein MmpS4
Rv0452113-0.215699Possible transcriptional regulatory protein
Rv0453114-0.948137PPE family protein PPE11
Rv0454-114-2.308502Conserved hypothetical protein
Rv0455c-212-1.579471Conserved protein
Rv0456c-212-0.598568enoyl-CoA hydratase EchA2 (enoyl hydrase)
Rv0456A-213-0.857009Possible toxin MazF1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0450cACRIFLAVINRP504e-08 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 50.2 bits (120), Expect = 4e-08
Identities = 41/206 (19%), Positives = 74/206 (35%), Gaps = 13/206 (6%)

Query: 727 IKSIDAIRTAAEESLKGTPLEDAKIYLAGTAAVFHDIS-EGAQWDLLIAAISSLCLIFII 785
+ + AI+ A L+ + K+ F +S L A + L+F++
Sbjct: 300 LDTAKAIK-AKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIM----LVFLV 354

Query: 786 MLIITRAFIAAAVIVGTVALSLGASFGLSVLLWQHILAIHLHWLVLAMSVIVLLAVGSDY 845
M + + A + V + L +F + I + + +VLA+ ++V A+
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 846 NLLLVSRFKQEIGAGLKTGIIRSMGGTGKVVTNAGLVFAVT---MASMAVSDLRVIGQVG 902
N V R E K +SM + +V + MA S + Q
Sbjct: 415 N---VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFS 471

Query: 903 TTIGLGLLFDTLIVRSFMTPSIAALL 928
TI + L+ TP++ A L
Sbjct: 472 ITIVSAMALSVLVALIL-TPALCATL 496



Score = 45.6 bits (108), Expect = 1e-06
Identities = 42/238 (17%), Positives = 83/238 (34%), Gaps = 32/238 (13%)

Query: 208 VAVIFIMLLLVYRSIITVVLLLITVGVELTAARGVVAVLGHSGAIGLTTFAVSLLTSLAI 267
+ ++F+++ L +++ ++ I V V L ++A G+S LT F + L AI
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT-LTMFGMVL----AI 402

Query: 268 AAGTDYGIFIIGRYQEARQAGED--KEAAYYTMYRGTAHVILGSGLTIAGA---TFCLSF 322
D I ++ + + KEA +M ++G + ++
Sbjct: 403 GLLVDDAIVVVENVERVMMEDKLPPKEATEKSM-SQIQGALVGIAMVLSAVFIPMAFFGG 461

Query: 323 ARMPYFQTLGIPCAVGMLVAVAVALTLGPAVLHV-------------GSRFGLFDPKRLL 369
+ ++ I M ++V VAL L PA+ G FG F+
Sbjct: 462 STGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDH 521

Query: 370 KVRGWRRVGTVVVRWPLPVLVATCAIALVGLLALPGYKTSYNDRDYLPDFIPANQGYA 427
V + ++ L+ ++A +LP+ +QG
Sbjct: 522 SVNHYTNSVGKILGSTGRYLL-----IYALIVAGMVVLFLRLPSSFLPEE---DQGVF 571



Score = 38.7 bits (90), Expect = 1e-04
Identities = 45/214 (21%), Positives = 84/214 (39%), Gaps = 15/214 (7%)

Query: 153 QGTPLANESVEAVRSIVESTPA--PPGIKAYVTGPSALAADMHHSGDRSMARITMVTVAV 210
QG S +++E+ + P GI TG ++ SG+++ A + ++ V
Sbjct: 827 QGEAAPGTSSGDAMALMENLASKLPAGIGYDWTG---MSYQERLSGNQAPA-LVAISFVV 882

Query: 211 IFIMLLLVYRSIITVVLLLITVGVELTAARGVVAVLGHSGAIGLTTFAVSLLTSLAIAAG 270
+F+ L +Y S V +++ V L ++A + + F V LLT++ ++A
Sbjct: 883 VFLCLAALYESWSIPVSVMLV--VPLGIVGVLLAATLFNQKNDV-YFMVGLLTTIGLSAK 939

Query: 271 TDYGIFIIGRYQEARQA-GEDKEAAYYTMYRGTAHVILGSGLTIAGATFCLSFARMP--- 326
I I+ ++ + G+ A R IL + L L+ +
Sbjct: 940 N--AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSG 997

Query: 327 YFQTLGIPCAVGMLVAVAVALTLGPAVLHVGSRF 360
+GI GM+ A +A+ P V R
Sbjct: 998 AQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0452HTHTETR671e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.0 bits (163), Expect = 1e-15
Identities = 20/141 (14%), Positives = 51/141 (36%), Gaps = 3/141 (2%)

Query: 16 RTEENKRQRAAALVEAARSLALETGVASVTLTAVAGRAGIHYSAVRRYFTSHKEVLLHLA 75
+T++ ++ +++ A L + GV+S +L +A AG+ A+ +F ++ +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 76 AEGWARWSGTVCEQLGEPGPMSAPRVAEALANGLAAD---PLFCDLLANLHLHLEQEVDV 132
+ E + + E L + L + L+ + E ++
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 133 DRVIEVKRTSIAAVIALVDAI 153
V + +R ++
Sbjct: 124 AVVQQAQRNLCLESYDRIEQT 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0453PF03544330.002 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.0 bits (75), Expect = 0.002
Identities = 20/72 (27%), Positives = 27/72 (37%)

Query: 320 VPQPPVVPAPAPDAVVPTVLPLAGTATPTTAPASAPAAGAAPGPPAGTATATSASVPTSA 379
+ +P P P P V P + PAS A P + TATA ++ TS
Sbjct: 94 IEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSV 153

Query: 380 GGFPPYLVGSGP 391
P L + P
Sbjct: 154 ASGPRALSRNQP 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0456ATYPE3IMSPROT260.021 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 26.3 bits (58), Expect = 0.021
Identities = 12/64 (18%), Positives = 24/64 (37%), Gaps = 13/64 (20%)

Query: 2 LRGEIWQVDLDPARGSAA-NMRRPAVIVSNDRANAAAIRLDRGVVPVVPVTSNTEKVPIP 60
++ + Q + + N++R +V+V+N A I R + P+P
Sbjct: 234 IKSKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKR------------GETPLP 281

Query: 61 GVVA 64
V
Sbjct: 282 LVTF 285


58Rv0681Rv0691cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0681319-2.918113Probable transcriptional regulatory protein
Rv0682216-2.99196130S ribosomal protein S12 RpsL
Rv0683211-2.61672130S ribosomal protein S7 RpsG
Rv0684212-2.230946Probable elongation factor G FusA1 (EF-G)
Rv0685010-1.008505Probable iron-regulated elongation factor TU Tuf
Rv0686-112-1.378298Probable membrane protein
Rv0687013-1.138690Probable short-chain type
Rv0688013-1.070126Putative ferredoxin reductase
Rv0689c-314-0.703927Hypothetical protein
Rv0690c-213-1.127676Conserved hypothetical protein
Rv0691c-213-1.023353Probable transcriptional regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0681TETREPRESSOR507e-10 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 49.9 bits (119), Expect = 7e-10
Identities = 41/176 (23%), Positives = 71/176 (40%), Gaps = 25/176 (14%)

Query: 5 AKLSRESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLRRAVRIRVID 64
A+L+RES+++ AL L+ G D LT LA +LG + P+LY HV + +RA+ +
Sbjct: 2 ARLNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKN----KRALLDALAV 57

Query: 65 DIITMLNRVGAGRARDDAVLVMAGAYRSY-----AHHHPGRYSAFTRMPLGGDDPEYTAA 119
+I+ + A + + S+ + + TR D+ +Y
Sbjct: 58 EILARHHDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLGTR----PDEKQYDTV 113

Query: 120 TRGAAAPVIAVLSSYGLDGEQAFYAALEFWSALHGFVLLEMTGVMDDIDTDAVFTD 175
+ ++ G YA SA+ F L V++ + A TD
Sbjct: 114 ET-----QLRFMTENGFSLRDGLYAI----SAVSHFTLGA---VLEQQEHTAALTD 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0684TCRTETOQM5850.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 585 bits (1510), Expect = 0.0
Identities = 162/678 (23%), Positives = 305/678 (44%), Gaps = 71/678 (10%)

Query: 12 RVRNFGIMAHIDAGKTTTTERILYYTGINYKIGEVHDGAATMDWMEQEQERGITITSAAT 71
++ N G++AH+DAGKTT TE +LY +G ++G V G D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 72 TTFWKDNQLNIIDTPGHVDFTVEVERNLRVLDGAVAVFDGKEGVEPQSEQVWRQADKYDV 131
+ W++ ++NIIDTPGH+DF EV R+L VLDGA+ + K+GV+ Q+ ++ K +
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 132 PRICFVNKMDKIGADFYFSVRTMGERLGANAVPIQLPVGAEADFEGVVDLVEMNAKVWRG 191
P I F+NK+D+ G D + + E+L A V Q
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ------------------------- 156

Query: 192 ETKLGETYDTVEIPADLAEQAEEYRTKLLEVVAESDEHLLEKYLGGEELTVDEIKGAIRK 251
+ + ++E++ + V E ++ LLEKY+ G+ L E++
Sbjct: 157 -----KVELYPNMCVTNFTESEQW-----DTVIEGNDDLLEKYMSGKSLEALELEQEESI 206

Query: 252 LTIASEIYPVLCGSAFKNKGVQPMLDAVVDYLPSPLDVPPAIGHAPAKEDEEVVRKATTD 311
++PV GSA N G+ +++ + + S
Sbjct: 207 RFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH--------------------RGQ 246

Query: 312 EPFAALAFKIATHPFFGKLTYIRVYSGTVESGSQVINATKGKKERLGKLFQMHSNKENPV 371
FKI +L YIR+YSG + V + K K ++ +++ + + +
Sbjct: 247 SELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMYTSINGELCKI 305

Query: 372 DRASAGHIYAVIG----LKDTTTGDTLSDPNQQIVLESMTFPDPVIEVAIEPKTKSDQEK 427
D+A +G I + L GDT P + E + P P+++ +EP +E
Sbjct: 306 DKAYSGEIVILQNEFLKLNS-VLGDTKLLPQR----ERIENPLPLLQTTVEPSKPQQREM 360

Query: 428 LSLSIQKLAEEDPTFKVHLDSETGQTVIGGMGELHLDILVDRMRREFKVEANVGKPQVAY 487
L ++ ++++ DP + ++DS T + ++ +G++ +++ ++ ++ VE + +P V Y
Sbjct: 361 LLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420

Query: 488 KETIKRLVQNVEYTHKKQTGGSGQFAKVIINLEPFTGEEGATYEFESKVTGGRIPREYIP 547
E + EYT + + +A + +++ P G+ ++ES V+ G + + +
Sbjct: 421 MERPLK---KAEYTIHIEVPPNPFWASIGLSVSP--LPLGSGMQYESSVSLGYLNQSFQN 475

Query: 548 SVDAGAQDAMQYGVLAGYPLVNLKVTLLDGAYHEVDSSEMAFKIAGSQVLKKAAALAQPV 607
+V G + + G L G+ + + K+ G Y+ S+ F++ VL++ A
Sbjct: 476 AVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTE 534

Query: 608 ILEPIMAVEVTTPEDYMGDVIGDLNSRRGQIQAMEERAGARVVRAHVPLSEMFGYVGDLR 667
+LEP ++ ++ P++Y+ D I + + ++ +P + Y DL
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLT 594

Query: 668 SKTQGRANYSMVFDSYSE 685
T GR+ Y
Sbjct: 595 FFTNGRSVCLTELKGYHV 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0685TCRTETOQM803e-18 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 79.9 bits (197), Expect = 3e-18
Identities = 53/153 (34%), Positives = 81/153 (52%), Gaps = 9/153 (5%)

Query: 13 VNIGTIGHVDHGKTTLTAAITKVLHDK--FPDLNETKAFD-QIDNAPEERQRGITINIAH 69
+NIG + HVD GKTTLT ++ L++ +L + DN ERQRGITI
Sbjct: 4 INIGVLAHVDAGKTTLTESL---LYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 70 VEYQTDKRHYAHVDAPGHADYIKNMITGAAQMDGAILVVAATDGPMPQTREHVLLARQVG 129
+Q + +D PGH D++ + + +DGAIL+++A DG QTR R++G
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 120

Query: 130 VPYILVALNKADAVDDEELLELVEMEVRELLAA 162
+P I +NK D + L V +++E L+A
Sbjct: 121 IPTI-FFINKIDQNGID--LSTVYQDIKEKLSA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0687DHBDHDRGNASE1243e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 124 bits (311), Expect = 3e-36
Identities = 93/278 (33%), Positives = 138/278 (49%), Gaps = 27/278 (9%)

Query: 1 MSARGGSLHGRVAFVTGAARAQGRSHAVRLAREGADIVALDICAPVSGSVTYPPATSEDL 60
M+A+G + G++AF+TGAA+ G + A LA +GA I A+D P E L
Sbjct: 1 MNAKG--IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVD-YNP------------EKL 45

Query: 61 GETVRAVEAEGRKVLAREVDIRDDAELRRLVADGVEQFGRLDIVVANAGVLGWGRLWELT 120
+ V +++AE R A D+RD A + + A + G +DI+V AGVL G + L+
Sbjct: 46 EKVVSSLKAEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLS 105

Query: 121 DEQWETVIGVNLTGTWRTLRATVPAMIDAGNGGSIVVVSSSAGLKATPGNGHYAASKHAL 180
DE+WE VN TG + R+ M+D GSIV V S+ YA+SK A
Sbjct: 106 DEEWEATFSVNSTGVFNASRSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAA 164

Query: 181 VALTNTLAIELGEFGIRVNSIHPYSVDTPM-----IEPEAMIQTFAKHPGYVHSFP-PMP 234
V T L +EL E+ IR N + P S +T M + Q G + +F +P
Sbjct: 165 VMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIK---GSLETFKTGIP 221

Query: 235 LQPKGFMTPDEISDVVVWLAGDGSGALSGNQIPVDKGA 272
L K P +I+D V++L +G ++ + + VD GA
Sbjct: 222 L--KKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0691cHTHTETR1172e-35 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 117 bits (295), Expect = 2e-35
Identities = 35/205 (17%), Positives = 71/205 (34%), Gaps = 15/205 (7%)

Query: 4 ESRVGRRRSTTPHHISDVAIELFAAHGFTDVSVDDIARAAGIARRTLFRYYASKNAIPWG 63
+ + T HI DVA+ LF+ G + S+ +IA+AAG+ R ++ ++ K+ +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 64 DFSTHLAQLQGLLDNIDSRI--QLRDALRAALLAFNTFDESETIRHRKRMRVILQTPELQ 121
+ + + L ++ LR L+ + + T R+ + I+
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHV--LESTVTEERRRLLMEIIFHKCEF 119

Query: 122 AYSMTMYAGWR-----EVIAKFV-----ARRSGGKTTDFMPQTVAWTMLGVALSAYEHW- 170
M + + E + + D M + A M G E+W
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179

Query: 171 LRDESVSLTEALGAAFDVVGAGLDR 195
+S L + ++
Sbjct: 180 FAPQSFDLKKEARDYVAILLEMYLL 204


59Rv0754Rv0758N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0754312-1.345947PE-PGRS family protein PE_PGRS11
Rv0755c314-2.758797PPE family protein PPE12
Rv0755A-214-0.704488Putative transposase (fragment)
Rv0756c-111-0.686081*Unknown protein
Rv0757-111-1.606801Possible two component system response
Rv0758-112-1.283263Possible two component system response sensor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0754cloacin426e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 42.0 bits (98), Expect = 6e-06
Identities = 34/110 (30%), Positives = 42/110 (38%), Gaps = 6/110 (5%)

Query: 175 GDGGAGYTGGNGGSAGLIGNGGTGGAGFAGGVGGMGGTGGWLMGNGGMGGAG-GVGGNGG 233
G G G+ G ++G I NGG G G GG G GG G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 234 AGGQALLFGNGGLGGAGGAGGVDGAIGRGGWFIGTGGMATIGGGGNGQSI 283
G G G G G G + + G ++T G GG SI
Sbjct: 62 HGNG----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSI 107



Score = 41.6 bits (97), Expect = 8e-06
Identities = 36/98 (36%), Positives = 40/98 (40%), Gaps = 8/98 (8%)

Query: 157 GVTGTAAAPNGGPGGLLFGDGGAGYTGGNGGSAGLIGNGGTG-GAGFAGGVGGMGGTGGW 215
G T+ NGGP GL G GG G S GG+G G + GG G G
Sbjct: 12 GAHSTSGNINGGPTGL--GVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG--- 66

Query: 216 LMGNGGMGGAGGVGGNGGAGGQALLFGNGGLGGAGGAG 253
GNG GG G GGN A + FG L G G
Sbjct: 67 --GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.4 bits (86), Expect = 1e-04
Identities = 39/137 (28%), Positives = 56/137 (40%), Gaps = 13/137 (9%)

Query: 197 TGGAGFAGGVGGMGGTGGWLMGNGGMGGAGGVGGNGGAGGQALLFGNGGLGGAGGAGGVD 256
+GG G G G T G + G G GG G + G+G + GG G+G G
Sbjct: 2 SGGDG-RGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 257 GAIGRGGWFIGTGGMATIGGGGNGQSIVIDF-VRHGQTPGNAAMLIDTAVPGPGLTALGQ 315
G GG +GG + GG + + + F TPG G ++
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG---------GLAVSISAG 110

Query: 316 QQAQAIANALAA-KGPY 331
+ AIA+ +AA KGP+
Sbjct: 111 ALSAAIADIMAALKGPF 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0755ccloacin350.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.1 bits (80), Expect = 0.001
Identities = 28/93 (30%), Positives = 32/93 (34%), Gaps = 2/93 (2%)

Query: 253 SGNIGNANVGGGNSGDNNFGFGNFGNANIGIGNAGPNMSSPAVPTPGNGNVGIGNGGNGN 312
SG G + G +S N G G G + G SS P G GI GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS- 60

Query: 313 FGGGNTGNANIGLGNVGDGNVGFGNSGSYNFGF 345
G GN G G G G + FGF
Sbjct: 61 -GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 32.8 bits (74), Expect = 0.004
Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 1/75 (1%)

Query: 310 NGNFGGGNTGNANIGLGNVGDGNVGFGNSGSYNFGFGNTGNNNIGIGLTGSNQIGFGGLN 369
+G G G+ A+ GN+ G G G G + G G + NN G +GS GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 370 SGSGNIGFGNSGTGN 384
G+G G GNSG G+
Sbjct: 62 HGNGG-GNGNSGGGS 75



Score = 30.1 bits (67), Expect = 0.033
Identities = 27/92 (29%), Positives = 37/92 (40%), Gaps = 15/92 (16%)

Query: 226 GSGNIGNFNLGSGNGNVGIGPSSFNVGSGNIGNANVGGGNSGDNNFGFGNFGNANIGIGN 285
G G N S +GN+ GP+ VG GG + G G+ + N G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVG---------GGASDGS---GWSSENNPWGGGSG 51

Query: 286 AGPNMSSPAVPTPGNGNVGIGNGGNGNFGGGN 317
+G + + G GN GN G G+ GGN
Sbjct: 52 SGIHWGGGSGHGNGGGN---GNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0757HTHFIS1062e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 106 bits (265), Expect = 2e-28
Identities = 36/136 (26%), Positives = 63/136 (46%)

Query: 21 ARVLVVDDEANIVELLSVSLKFQGFEVYTATNGAQALDRARETRPDAVILDVMMPGMDGF 80
A +LV DD+A I +L+ +L G++V +N A D V+ DV+MP + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 81 GVLRRLRADGIDAPALFLTARDSLQDKIAGLTLGGDDYVTKPFSLEEVVARLRVILRRAG 140
+L R++ D P L ++A+++ I G DY+ KPF L E++ + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 141 KGNKEPRNVRLTFADI 156
+ + + +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0758PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 20/109 (18%), Positives = 37/109 (33%), Gaps = 26/109 (23%)

Query: 366 VLRNLVANAIQH----TPESADVTVRVGTEGDDAILEVADDGPGMSQEDALRVFERFYRA 421
+++ LV N I+H P+ + ++ + LEV + G +
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT----------- 307

Query: 422 DSSRARASGGTGLGLSIVDS-LVAAHGG--AVTVTTALGEGCCFRVSLP 467
TG GL V L +G + ++ G+ V +P
Sbjct: 308 -------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA-MVLIP 348


60Rv0872cRv0879cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv0872c072.181736PE-PGRS family protein PE_PGRS15
Rv0873-19-0.207278Probable acyl-CoA dehydrogenase FadE10
Rv0874c-1110.804647Conserved hypothetical protein
Rv0875c212-0.056258Possible conserved exported protein
Rv0876c0140.114460Possible conserved transmembrane protein
Rv08771160.468091Conserved hypothetical protein
Rv0878c2150.668359PPE family protein PPE13
Rv0879c-1121.195848Possible conserved transmembrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0872ccloacin371e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 1e-04
Identities = 30/85 (35%), Positives = 40/85 (47%)

Query: 427 AGGDGGAANTDSAGSSRKAFGGDGGVGGDGASALGTGGEGGIGGQGGNGGAGGLLIGNGG 486
+GGDG NT + +S GG G+G G ++ G+G GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 487 AGGVGGTAGAGGTGGSGGAGGAGGA 511
G GG +GG G+GG A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 36.6 bits (84), Expect = 3e-04
Identities = 30/88 (34%), Positives = 34/88 (38%), Gaps = 4/88 (4%)

Query: 486 GAGGVGGTAGAGGTGGSGGAGGAGGAGGGGTNSGPGAAFGGNGNTGGNGGNGGAPGALGG 545
G G G GA T G+ G G GGG + G G + N GG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 546 KGGSGGLIGRAGSDGGVGAGGAGGAGGA 573
G G S GG G GG A A
Sbjct: 63 GNGGGN----GNSGGGSGTGGNLSAVAA 86



Score = 36.2 bits (83), Expect = 4e-04
Identities = 32/86 (37%), Positives = 37/86 (43%), Gaps = 2/86 (2%)

Query: 461 GTGGEGGIGGQGGN--GGAGGLLIGNGGAGGVGGTAGAGGTGGSGGAGGAGGAGGGGTNS 518
G G G GN GG GL +G G + G G ++ GG G+G G G G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 519 GPGAAFGGNGNTGGNGGNGGAPGALG 544
G GG TGGN AP A G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.5 bits (76), Expect = 0.003
Identities = 26/67 (38%), Positives = 28/67 (41%)

Query: 368 NGGSGGNGFDSFASGGTGGAGGTGGAGGRGGLLIGDGGAGGAGGVGGTGGSGAPGGGGGA 427
NGG G G AS G+G + GG G I GG G G GG G SG G GG
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 428 GGDGGAA 434
A
Sbjct: 81 LSAVAAP 87



Score = 33.1 bits (75), Expect = 0.003
Identities = 29/96 (30%), Positives = 36/96 (37%), Gaps = 9/96 (9%)

Query: 484 NGGAGGVGGTAGAGGTGGSGGAGGAGGAG--------GGGTNSGPGAAFGGNGNTGGNGG 535
N GA G G TG G G + G+G GGG+ SG GG+G+ G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW-GGGSGHGNGGGN 68

Query: 536 NGGAPGALGGKGGSGGLIGRAGSDGGVGAGGAGGAG 571
G+ G S A + GAGG
Sbjct: 69 GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.006
Identities = 29/94 (30%), Positives = 35/94 (37%), Gaps = 1/94 (1%)

Query: 510 GAGGGGTNSGPGAAFGG-NGNTGGNGGNGGAPGALGGKGGSGGLIGRAGSDGGVGAGGAG 568
G G G N+G + G NG G G GGA G + G +GS G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 569 GAGGAGGTGGEGGTGGDGKTTDGNPGMGGSPGSA 602
G GG G G G G + P G P +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS 96



Score = 31.6 bits (71), Expect = 0.009
Identities = 25/76 (32%), Positives = 26/76 (34%)

Query: 404 GGAGGAGGVGGTGGSGAPGGGGGAGGDGGAANTDSAGSSRKAFGGDGGVGGDGASALGTG 463
GG G G SG GG G GG A+ S SS G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 464 GEGGIGGQGGNGGAGG 479
G GG G G G G
Sbjct: 63 GNGGGNGNSGGGSGTG 78



Score = 31.6 bits (71), Expect = 0.011
Identities = 37/119 (31%), Positives = 47/119 (39%), Gaps = 9/119 (7%)

Query: 146 GSGGVGQAGGAGGSAGLIGIGGTGGAGGAGAVGGVGGNGGWLYGNGGAGGLGGTGVAGVN 205
G G G GA ++G I GG G G GG GW N GG G+G+
Sbjct: 3 GGDGRGHNTGAHSTSGNI----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 206 GGMGAAGGAGGNAYLFGSGGAGGQGGMGAAGADGVNPTPTGTADAGSTGTDQTLGGNAI 264
G GG GN SGG G GG +A A V + G+ G ++ A+
Sbjct: 59 GSGHGNGGGNGN-----SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112



Score = 30.8 bits (69), Expect = 0.019
Identities = 27/91 (29%), Positives = 36/91 (39%), Gaps = 9/91 (9%)

Query: 250 AGSTGTDQTLGGNAIGGNGGPGDAGDAMTSGGAGGSGGNAVSTVNGDAVGGEGGKGGEGA 309
+G G G ++ GN G G + G + GSG ++ + G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 310 YGGAGGAGGSAASIGNAAIGGNGGAGGNAQA 340
+G GG G S GG G GGN A
Sbjct: 62 HGNGGGNGNS---------GGGSGTGGNLSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0876cRTXTOXINA300.034 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.9 bits (67), Expect = 0.034
Identities = 17/48 (35%), Positives = 25/48 (52%), Gaps = 4/48 (8%)

Query: 416 TVLVTVLA-IAAAVAGSLAATAIATL---ITAGSSAIAKASLDASLQH 459
TVL +V + I+AA SL ++ L +T S I +AS A +H
Sbjct: 373 TVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0878ccloacin354e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.5 bits (81), Expect = 4e-04
Identities = 29/103 (28%), Positives = 39/103 (37%), Gaps = 25/103 (24%)

Query: 263 GNGNDGNTNFGSGNAGFLNIGSGNEGSGNLGFGNAGDDNTGWGNSGDTNTGGFNSGDLNT 322
G G++ + SGN N G LG G D +GW + + GG SG
Sbjct: 6 GRGHNTGAHSTSGNI--------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI--- 54

Query: 323 GIGSPVTQGVANSGFGNTGTGHSGFFNSGNSGSGFQNLGNGSS 365
G +G G+ G +GNSG G GN S+
Sbjct: 55 ------------HWGGGSGHGNGG--GNGNSGGGSGTGGNLSA 83



Score = 30.1 bits (67), Expect = 0.022
Identities = 27/75 (36%), Positives = 33/75 (44%), Gaps = 1/75 (1%)

Query: 243 GNGNIGNANLGSGNAGFFNFGNGNDGNTNFGSGNAGFLNIGSGNEGSGNLGFGNAGDDNT 302
G G+ A+ SGN G G G + GSG + N G GSG G +G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 303 GW-GNSGDTNTGGFN 316
G GNSG + G N
Sbjct: 66 GGNGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0879cPF06580290.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.002
Identities = 14/67 (20%), Positives = 26/67 (38%), Gaps = 1/67 (1%)

Query: 2 SVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVA-GLATGL 60
S Q + ++L V P V + W VA + + + + +PV LA +
Sbjct: 61 SFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSI 120

Query: 61 LGTTIFV 67
+ + V
Sbjct: 121 IFNVVVV 127


61Rv0977Rv0985cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv09776164.980558PE-PGRS family protein PE_PGRS16
Rv0978c2132.246956PE-PGRS family protein PE_PGRS17
Rv0979c091.350047Hypothetical protein
Rv0979A091.26152850S ribosomal protein L32 RpmF
Rv0980c0100.752781PE-PGRS family protein PE_PGRS18
Rv0981-212-2.279486Mycobacterial persistence regulator MRPA (two
Rv0982124-5.844563Two component sensor kinase MprB
Rv0983333-8.076687Probable serine protease PepD (serine
Rv0984135-8.957863Possible pterin-4-alpha-carbinolamine
Rv0985c235-8.565029Possible large-conductance ion mechanosensitive
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0977cloacin442e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 44.3 bits (104), Expect = 2e-06
Identities = 35/101 (34%), Positives = 43/101 (42%), Gaps = 1/101 (0%)

Query: 468 AGGDGGQGDIGFDGGRGG-DGGPGGGGGAGGDGSGTFNAQANNGGDGGAGGVGGAGGTGG 526
+GGDG + G G +GGP G G GG G+ + NN GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 527 TGGVGADGGRGGDSGRGGDGGNAGHGGAAQFSGRGAYGGEG 567
G G +G GG SG GG+ A F G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 43.9 bits (103), Expect = 3e-06
Identities = 37/108 (34%), Positives = 46/108 (42%), Gaps = 7/108 (6%)

Query: 357 GGAGAGGDGGAGGIGGTGGNGSIGGAAGNGGNGGRGGA------GGMATAGSDGGNGGGG 410
GG G G + GA G G G G G + G G + GG + +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 411 GNGGVGVGSAGGAGGTGGDGGAAGAGGAPGHGYFQQPAPQGLPIGTGG 458
GNGG G++GG GTGG+ A A A G P GL +
Sbjct: 63 GNGGGN-GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 42.0 bits (98), Expect = 9e-06
Identities = 30/82 (36%), Positives = 36/82 (43%), Gaps = 2/82 (2%)

Query: 304 GIGEQGGQGGDGGA--GGAGGIGGSAGGIGGSQGAGGHGGDGGQGGAGGSGGVGGGGAGA 361
G G G G GG G+G G GS + + GG G+G G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 362 GGDGGAGGIGGTGGNGSIGGAA 383
GG+G +GG GTGGN S A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAP 87



Score = 41.2 bits (96), Expect = 2e-05
Identities = 33/99 (33%), Positives = 42/99 (42%), Gaps = 1/99 (1%)

Query: 158 GRGGDAGLFGHGGHGGVGGPGIAGAAGTAGLPGGNGANGGSGGIGGAGGAGGNGGLLFGN 217
GRG + G G+ G G+ G + G + N GG G+G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 218 GGAGGQGGSGGLGGSGGTGGAGMAAG-PAGGTGGIGGIG 255
GG G GG G GG+ A +A G PA T G GG+
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 41.2 bits (96), Expect = 2e-05
Identities = 32/79 (40%), Positives = 36/79 (45%), Gaps = 1/79 (1%)

Query: 312 GGDGGAGGAGGIGGSAGGIGGSQGAGGHGGDGGQGGAGGSGGVGGGGAGAG-GDGGAGGI 370
GGDG G S GG G G GG G GGG+G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 371 GGTGGNGSIGGAAGNGGNG 389
G GGNG+ GG +G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 40.5 bits (94), Expect = 3e-05
Identities = 30/104 (28%), Positives = 33/104 (31%)

Query: 330 IGGSQGAGGHGGDGGQGGAGGSGGVGGGGAGAGGDGGAGGIGGTGGNGSIGGAAGNGGNG 389
+ G G G + G G G G G G DG G G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 390 GRGGAGGMATAGSDGGNGGGGGNGGVGVGSAGGAGGTGGDGGAA 433
G G GG +G G GG V A T G GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 40.1 bits (93), Expect = 4e-05
Identities = 27/80 (33%), Positives = 34/80 (42%)

Query: 538 GDSGRGGDGGNAGHGGAAQFSGRGAYGGEGGSGGAGGNAGGAGTGGTAGSGGAGGFGGNG 597
G GRG + G G G G G S G+G ++ GG +GSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 598 ADGGNGGNGGNGGFGGINGT 617
+GG GN G G G N +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 39.3 bits (91), Expect = 7e-05
Identities = 36/104 (34%), Positives = 45/104 (43%), Gaps = 3/104 (2%)

Query: 257 IGGAGGVGGHGSALFGHGGINGDGGTGGMGGQGGAGGNGWAAEGITVGIGEQGGQGGDGG 316
+ G G G + A G ING G TG G G + G+GW++E G G G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 317 AGGAGGIGGSAGGIGGSQGAGGHGGDGGQGGAGGSGGVGGGGAG 360
+G GG G GG G GG+ A G + GAG
Sbjct: 60 SGHGN--GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101



Score = 37.4 bits (86), Expect = 3e-04
Identities = 27/90 (30%), Positives = 35/90 (38%)

Query: 558 SGRGAYGGEGGSGGAGGNAGGAGTGGTAGSGGAGGFGGNGADGGNGGNGGNGGFGGINGT 617
SG G G+ GN G TG G G + G G + + GG G+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 618 FGTNGAGGTGGLGTLLGGHNGNIGLNGATG 647
G G G G G+ GG+ + A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 36.6 bits (84), Expect = 5e-04
Identities = 35/113 (30%), Positives = 45/113 (39%), Gaps = 14/113 (12%)

Query: 492 GGGAGGDGSGTFNAQANNGGDGGAGGVGGAGGTGGTGGVGADGGRGGDSGRG-GDGGNAG 550
GG G +G + N G GVGG G + G+G + GG SG G GG +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 551 HGGAAQFSGRGAYGGEGGSGGAGGNAGGAGTGGTAGSGGAGGFGGNGADGGNG 603
HG GG+G +GG +G G + A GF G G
Sbjct: 62 HGNG------------GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.8 bits (82), Expect = 9e-04
Identities = 28/83 (33%), Positives = 34/83 (40%), Gaps = 1/83 (1%)

Query: 195 NGGSGGIGGAGGAGGNGGLLFGNGGAGGQGGSG-GLGGSGGTGGAGMAAGPAGGTGGIGG 253
NGG G+G GGA G N GG GSG GG G G G GG+G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 254 IGGIGGAGGVGGHGSALFGHGGI 276
+ + G + G GG+
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGL 103



Score = 35.5 bits (81), Expect = 0.001
Identities = 31/80 (38%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 183 AGTAGLPGGNGANGGSGGI-GGAGGAGGNGGLLFGNGGAGGQGGSGGLGGSGGTGGAGMA 241
+G G GA+ SG I GG G G GG G+G + GG GSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 242 AGPAGGTGGIGGIGGIGGAG 261
G GG G GG G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 35.5 bits (81), Expect = 0.001
Identities = 29/81 (35%), Positives = 33/81 (40%), Gaps = 6/81 (7%)

Query: 422 GAGGTGGDGGAAGAGGAPGHGYFQQPAPQGLPIGTGGTGGEGGAGGAGGDGGQGDIGFDG 481
G G G + GA G G P GL +G G + G G + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGG------PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 482 GRGGDGGPGGGGGAGGDGSGT 502
G G G GGG G G GSGT
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGT 77



Score = 33.5 bits (76), Expect = 0.004
Identities = 23/77 (29%), Positives = 27/77 (35%)

Query: 455 GTGGTGGEGGAGGAGGDGGQGDIGFDGGRGGDGGPGGGGGAGGDGSGTFNAQANNGGDGG 514
G G G GA G+ G G G G G G G G+ + GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 515 AGGVGGAGGTGGTGGVG 531
G G GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 32.4 bits (73), Expect = 0.009
Identities = 26/81 (32%), Positives = 30/81 (37%), Gaps = 1/81 (1%)

Query: 201 IGGAGGAGGNGGLLFGNGGA-GGQGGSGGLGGSGGTGGAGMAAGPAGGTGGIGGIGGIGG 259
+ G G G N G +G GG G G GG+ G P GG G G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 260 AGGVGGHGSALFGHGGINGDG 280
G GG G G G+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.025
Identities = 27/74 (36%), Positives = 32/74 (43%), Gaps = 11/74 (14%)

Query: 134 NGGAGGILWGNGGNGGSGAPGQPGGRGGDAGLFGHGGHGGVGGPGIAGAAGTAGLPGGNG 193
NGG G+ G G + GSG + GG +G H G G G G GGNG
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG-----------GGNG 69

Query: 194 ANGGSGGIGGAGGA 207
+GG G GG A
Sbjct: 70 NSGGGSGTGGNLSA 83



Score = 30.5 bits (68), Expect = 0.037
Identities = 24/71 (33%), Positives = 28/71 (39%), Gaps = 7/71 (9%)

Query: 379 IGGAAGNGGNGGRGGAGGMATAGSDGGNGGGGGNGGVGVGSA----GGAGGTG---GDGG 431
+ G G G N G G G G GGG + G G S GG G+G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 432 AAGAGGAPGHG 442
G GG G+
Sbjct: 61 GHGNGGGNGNS 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0978ccloacin378e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 8e-05
Identities = 28/90 (31%), Positives = 35/90 (38%), Gaps = 5/90 (5%)

Query: 122 NGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAGLIGNGGAGGTGGAVSLARAGT 181
+G DG G G + +G P G GGA+ G G S +
Sbjct: 2 SGGDGRGHNTGAH-----STSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 182 AGGAGRGPVGGIGGAGGVGGAGGAAGAVTT 211
GG+G G GG G +GG G GG AV
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.0 bits (72), Expect = 0.003
Identities = 24/78 (30%), Positives = 29/78 (37%), Gaps = 6/78 (7%)

Query: 117 IGDGANGIDGTGQA------GGNGGWLWGNGGNGGSGAPGQAGGAGGAAGLIGNGGAGGT 170
+G G DG+G + GG G GG G G G G +GG +G GN A
Sbjct: 27 LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86

Query: 171 GGAVSLARAGTAGGAGRG 188
A T G G
Sbjct: 87 PVAFGFPALSTPGAGGLA 104



Score = 31.6 bits (71), Expect = 0.005
Identities = 32/103 (31%), Positives = 36/103 (34%), Gaps = 4/103 (3%)

Query: 117 IGDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAGLIGNGGAGG----TGG 172
I G G+ G A GW N GG G G G G G G G TGG
Sbjct: 20 INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79

Query: 173 AVSLARAGTAGGAGRGPVGGIGGAGGVGGAGGAAGAVTTITHA 215
+S A A G G GG AG + A+ I A
Sbjct: 80 NLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0980ccloacin424e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 42.0 bits (98), Expect = 4e-06
Identities = 40/121 (33%), Positives = 49/121 (40%), Gaps = 1/121 (0%)

Query: 170 AGGQGLPFEAGANGGAGGAGGWLFGNGGAGGVGGAGGAGTTFGVAGGDGGTGGVGGHGGL 229
+GG G GA+ +G G G G GG G + GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 230 IGVGGHGGDGGTG-GTGGAVSLARAGTAGGAGGGPAGGIGGAGGVGGAGGAAGAVTTITH 288
G GG G+ G G GTGG +S A A G G GG AG + A+ I
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 289 A 289
A
Sbjct: 122 A 122



Score = 39.3 bits (91), Expect = 3e-05
Identities = 26/78 (33%), Positives = 33/78 (42%), Gaps = 1/78 (1%)

Query: 139 GNGGNGGSGAPGQAGGAGGAAGLIGNGGAGGAGGQGLPFEAGANGGAGGAGGWLFGNGGA 198
G+G +GA +G G +G GG G + G G E GG G+G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 199 GGVGGAGGAGTTFGVAGG 216
G GG G +G G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 33.9 bits (77), Expect = 0.001
Identities = 30/117 (25%), Positives = 35/117 (29%), Gaps = 11/117 (9%)

Query: 118 GDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAGLIGNGGAGGAGGQGLPF 177
GDG G GN NGG G GGA +G G G
Sbjct: 4 GDGRGHNTGAHSTSGNI--------NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH 55

Query: 178 EAGANGGAGGAGGWLFGNGGAGGVGGAGGAGTTFGVAGGDGGTGGVGGHGGLIGVGG 234
G +G G G GN G G G + VA G G G + +
Sbjct: 56 WGGGSGHGNGGGN---GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 33.1 bits (75), Expect = 0.003
Identities = 29/86 (33%), Positives = 35/86 (40%), Gaps = 13/86 (15%)

Query: 213 VAGGDGGTGGVGGH-------GGLIGVGGHGGDGGTGGTG------GAVSLARAGTAGGA 259
++GGDG G H GG G+G GG G G S + GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 260 GGGPAGGIGGAGGVGGAGGAAGAVTT 285
G G GG G +GG G GG AV
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 29.3 bits (65), Expect = 0.033
Identities = 26/78 (33%), Positives = 33/78 (42%), Gaps = 5/78 (6%)

Query: 117 IGDGANGIDGTGQAGGNGGWLWGNGGN---GGSGAPGQAGGAGGAAGLIGNGGAGGAGGQ 173
+G G DG+G + N W G+G GG G GG G + G G GG A
Sbjct: 27 LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86

Query: 174 GLPF--EAGANGGAGGAG 189
+ F A + GAGG
Sbjct: 87 PVAFGFPALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0981HTHFIS1039e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 103 bits (258), Expect = 9e-28
Identities = 45/135 (33%), Positives = 71/135 (52%), Gaps = 1/135 (0%)

Query: 2 RILVVDDDRAVRESLRRSLSFNGYSVELAHDGVEALDMIASDRPDALVLDVMMPRLDGLE 61
ILV DDD A+R L ++LS GY V + + IA+ D +V DV+MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCRQLRGTGDDLPILVLTARDSVSERVAGLDAGADDYLPKPFALEELLARM-RALLRRTK 120
+ +++ DLP+LV++A+++ + + GA DYLPKPF L EL+ + RAL +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 PEDAAESMAMRFSDL 135
E + L
Sbjct: 125 RPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0983V8PROTEASE574e-11 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 57.3 bits (138), Expect = 4e-11
Identities = 36/182 (19%), Positives = 72/182 (39%), Gaps = 27/182 (14%)

Query: 165 PSVVMLETDLGRQSEEGSGIILSAEGLILTNNHVIAAAAKPPLG---SPPPKTTVTFSDG 221
V ++ + + SG+++ + +LTN HV+ A P P + +G
Sbjct: 88 APVTYIQVEAPTGTFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNG 146

Query: 222 RTAPFTVVGADPTSDIAVVRV----QGVS---GLTPISLGSSSDLRVGQPVLAIGSPLGL 274
+ D+A+V+ Q + P ++ ++++ +V Q + G P
Sbjct: 147 GFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDK 206

Query: 275 EGTVTTGIVSALNRPVSTTGEAGNQNTVLD--AIQTDAAINPGNSGGALVNMNAQLVGVN 332
PV+T E+ + T L A+Q D + GNSG + N +++G++
Sbjct: 207 --------------PVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIH 252

Query: 333 SA 334

Sbjct: 253 WG 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv0985cMECHCHANNEL1621e-54 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 162 bits (412), Expect = 1e-54
Identities = 47/134 (35%), Positives = 66/134 (49%), Gaps = 6/134 (4%)

Query: 1 MLKGFKEFLARGNIVDLAVAVVIGTAFTALVTKFTDSIITPLINR----IGVNAQSDVGI 56
++K F+EF RGN+VDLAV V+IG AF +V+ II P + I +
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTLR 62

Query: 57 LRIGIGGGQTIDLNVLLSAAINFFLIAFAVYFLVVLPYNTLRKKGE-VEQPGDT-QVVLL 114
G + V + +F ++AFA++ + L RKK E P T + VLL
Sbjct: 63 DAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEEPAAAPAPTKEEVLL 122

Query: 115 TEIRDLLAQTNGDS 128
TEIRDLL + N S
Sbjct: 123 TEIRDLLKEQNNRS 136


62Rv1232cRv1238N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1232c-314-1.127060Conserved protein
Rv1233c-115-2.616895Conserved hypothetical membrane protein
Rv1234-313-3.219081Probable transmembrane protein
Rv1235-112-2.528035Probable sugar-binding lipoprotein LpqY
Rv1236012-3.167634Probable sugar-transport integral membrane
Rv1237111-3.050139Probable sugar-transport integral membrane
Rv12380120.804890Probable sugar-transport ATP-binding protein ABC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1232cFLGMOTORFLIG320.005 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 31.7 bits (72), Expect = 0.005
Identities = 21/130 (16%), Positives = 51/130 (39%), Gaps = 20/130 (15%)

Query: 167 LAMPGQDVAQL---LDQFEGWKAVDVADAIRGLPPKRRHEVFKALHDKRLADVLQELPEL 223
+++ + +++ L Q E + + + + V + +A + +
Sbjct: 26 VSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMAQEFIQKGGI 85

Query: 224 DQA-EVLSQ-LGTERAADV---------------LEEMDPDDAADLLAVLNPTEAELLLT 266
D A E+L + LGT++A D+ + DP + + + +P L+L+
Sbjct: 86 DYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEHPQTIALILS 145

Query: 267 RMDPGDSGQV 276
+DP + +
Sbjct: 146 YLDPQKASFI 155



Score = 30.2 bits (68), Expect = 0.015
Identities = 28/164 (17%), Positives = 56/164 (34%), Gaps = 30/164 (18%)

Query: 181 FEGWKAVD---VADAIRGLPPKRRHEVFKALHDKRLADVLQELPELDQAEVLSQLGTERA 237
FE + D + + I+ P+ + L ++ + +L LP Q V ++
Sbjct: 117 FEFVRRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIAL--- 173

Query: 238 ADVLEEMDPDDAADLLAVLNPTEAELLLTR-------------MDPGDSGQVRRLL---- 280
++ P+ ++ VL A L ++ D + ++
Sbjct: 174 ---MDRTSPEVVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLE 230

Query: 281 THSPDTAGG----LMTSDPVVLTPDTSIAEALARVRDPDLTPAL 320
P+ A + + +VL D SI L + +L AL
Sbjct: 231 EEDPELAEEIKKKMFVFEDIVLLDDRSIQRVLREIDGQELAKAL 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1233cPF03544300.004 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.9 bits (67), Expect = 0.004
Identities = 13/85 (15%), Positives = 16/85 (18%)

Query: 18 GPPPVGERPPEQPIADAPWAPPASSPMANHPPPAYPPSGYPPAYQPGYPTGYPPPMPPGG 77
P P E PE P P P +P P
Sbjct: 75 EPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAP 134

Query: 78 YAPPGYPPPGTSSAGYGDIPYPPMP 102
P +S + P
Sbjct: 135 ARPTSSTATAATSKPVTSVASGPRA 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1235MALTOSEBP492e-08 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 49.3 bits (117), Expect = 2e-08
Identities = 43/162 (26%), Positives = 70/162 (43%), Gaps = 16/162 (9%)

Query: 139 WNHKLYAAPVTTNTQLLWYRPDLVNSPPTDWNAMIAEAARLHAAGEPSWIAVQANQGEGL 198
+N KL A P+ L Y DL+ +PP W + A L A G+ A+ N E
Sbjct: 125 YNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKS---ALMFNLQEPY 181

Query: 199 VVWFNTLLVSAGGSVLS-EDGRHVTL---TDTPAHRAATVSALQILKSVATTPGADPSIT 254
W L+ + GG E+G++ D +A + ++K+ D SI
Sbjct: 182 FTW--PLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSI- 238

Query: 255 RTEEGSARLAFEQGKAALEVNWPFVFASMLENAVKGGVPFLP 296
A AF +G+ A+ +N P+ ++++ + V GV LP
Sbjct: 239 ------AEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLP 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1238PF05272354e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.4 bits (81), Expect = 4e-04
Identities = 24/122 (19%), Positives = 39/122 (31%), Gaps = 35/122 (28%)

Query: 33 LILVGPSGCGKTTTLNMIAGLEDISSGELRIAGERVNEKAPKDRDIAMVFQSYALYPHMT 92
++L G G GK+T +N + GL+ S I +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVAY---- 645

Query: 93 VRQNIAFPLTLAKMRKADIAQKVSETAKILDLTNLLDRKPSQLSGGQRQRVAMGRAIVRH 152
++ + R+AD + + D R R A GR + H
Sbjct: 646 ---ELS---EMTAFRRADAEAVKAFFSSRKD----------------RYRGAYGRYVQDH 683

Query: 153 PK 154
P+
Sbjct: 684 PR 685


63Rv1243cRv1255cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1243c0161.371538PE-PGRS family protein PE_PGRS23
Rv1244-114-0.580425Probable lipoprotein LpqZ
Rv1245c-211-1.212105Probable short-chain type
Rv1246c-210-0.337344Toxin RelE
Rv1247c-210-0.171184Antitoxin RelB
Rv1248c-28-0.335450Multifunctional alpha-ketoglutarate metabolic
Rv1249c-27-0.070724Possible membrane protein
Rv1250-17-0.087124Probable drug-transport integral membrane
Rv1251c-180.286764Conserved hypothetical protein
Rv1252c090.017417Probable lipoprotein LprE
Rv1253090.166700Probable cold-shock DeaD-box protein A homolog
Rv12542100.984676Probable acyltransferase
Rv1255c-190.522380Probable transcriptional regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1243cFLAGELLIN372e-04 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 37.3 bits (86), Expect = 2e-04
Identities = 33/286 (11%), Positives = 51/286 (17%), Gaps = 15/286 (5%)

Query: 260 NGTDAGSGSNAVNPGVGGGAG---------------GIGGDGTNLGQTDVSGGAGGDGGD 304
NG S N + VG G G+ G N + G +
Sbjct: 132 NGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKN 191

Query: 305 GANFASGGAGGNGGAAQSGFGDAVGGNGGAGGNGGAGGGGGLGGAGGSANVANAGNSIGG 364
+ + G N G V G N +
Sbjct: 192 VTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFK 251

Query: 365 NGGAGGNGGIGAPGGAGGAGGNANQDNPPGGNSTGGNGGAGGDGGVGASADVGGAGGFGG 424
+ GG G + + G DG S + G
Sbjct: 252 TTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLT 311

Query: 425 SGGRGGLLLGTGGAGGDGGVGGDGGIGAQGGSGGNGGNGGIGADGMANQDGDGGDGGNGG 484
A + + + +
Sbjct: 312 VADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKIT 371

Query: 485 DGGAGGAGGVGGNGGTGGAGGLFGQSGSPGSGAAGGLGGAGGNGGA 530
GA G+ T +F + G A
Sbjct: 372 VNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKST 417


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1245cDHBDHDRGNASE748e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 74.3 bits (182), Expect = 8e-18
Identities = 56/201 (27%), Positives = 92/201 (45%), Gaps = 2/201 (0%)

Query: 2 EGFAGKVAVVTGAGSGIGQALAIELARSGAKVAISDVDTDGLADTEHRLKAISTPVKTDR 61
+G GK+A +TGA GIG+A+A LA GA +A D + + L LKA + +
Sbjct: 4 KGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 62 LDVTEREAFLAYADAVNEHFGTVNQIYNNAGIAFTGDIEVSQFKDIERVMDVDFWGVVNG 121
DV + A + G ++ + N AG+ G I ++ E V+ GV N
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 122 TKAFLPHLIASGDGHVINISSVFGLFSAPGQAAYNSAKFAVRGFTEALRQEMALAGHPVK 181
+++ +++ G ++ + S AAY S+K A FT+ L E LA + ++
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLE--LAEYNIR 181

Query: 182 VTTVHPGGVKTAIARNATAAE 202
V PG +T + + A E
Sbjct: 182 CNIVSPGSTETDMQWSLWADE 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1248cPF03544340.002 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 34.2 bits (78), Expect = 0.002
Identities = 9/61 (14%), Positives = 17/61 (27%)

Query: 42 PEPTSQPAAEPTRVTSPLVAERAAAAAPQAPPKPADTAAAGNGVVAALAAKTAVPPPAEG 101
PEP +P EP + ++ + P+ P + + A
Sbjct: 76 PEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPA 135

Query: 102 D 102

Sbjct: 136 R 136



Score = 32.3 bits (73), Expect = 0.011
Identities = 21/116 (18%), Positives = 31/116 (26%), Gaps = 6/116 (5%)

Query: 38 VDYSPEPTSQPAAEPTRVTSPLVAERAAAAAPQAPPKPADTAAAGNGVVAALAAKTAVPP 97
V PEP +P EP + P E P K
Sbjct: 66 VQPPPEPVVEPEPEPEPIPEP-PKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESR 124

Query: 98 PAEGDEVAVLRGAAAAVVKNMSASLEVPTATSVRAVPAKLLIDNRIVINNQLKRTR 153
PA E A A + +A+ + A + L N+ + + R
Sbjct: 125 PASPFE-----NTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALR 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1250TCRTETB1432e-39 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 143 bits (363), Expect = 2e-39
Identities = 90/415 (21%), Positives = 173/415 (41%), Gaps = 15/415 (3%)

Query: 12 SYFRNPWPALWAMMVGFFMIMLDSTVVAIANPTIMAQLRIGYATVVWVTSAYLLAYAVPM 71
S R+ +W ++ FF ++ + V+ ++ P I A+ WV +A++L +++
Sbjct: 8 SNLRHNQILIWLCILSFFSVL-NEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGT 66

Query: 72 LVAGRLGDRFGPKNLYLIGLGVFTVASLGCGLS-SGAGMLIAARVVQGVGAGLLTPQTLS 130
V G+L D+ G K L L G+ + S+ + S +LI AR +QG GA +
Sbjct: 67 AVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMV 126

Query: 131 TITRIFPAHRRGVALGAWGTVASVASLVGPLAGGALVDSMGWEWIFFVNVPVGVIGLILA 190
+ R P RG A G G++ ++ VGP GG + + W ++ + + + +I +
Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFL 185

Query: 191 AYLIPALPHHPHRFDWFGVGLSGAGMFLIVFGLQQGQSANWQPWIWAVIVGGIGFMSLFV 250
L+ FD G+ L G+ + S + I +V+ +FV
Sbjct: 186 MKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFT---TSYSISFLIVSVL-----SFLIFV 237

Query: 251 YWQARNAREPLIPLEVFNDRNFSLSNLRIAIIAFAGTGMMLPVTFYAQAVCGLSP-THTA 309
+ P + + + F + L II G + V + + V LS +
Sbjct: 238 KHIRKVTD-PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGS 296

Query: 310 VLFAPTAIVGGVLAPFVGMIIDRSHPLCVLGFGFSVLAIAMTWLLCEMAPGTPIWRLVLP 369
V+ P + + G+++DR PL VL G + L+++ +L T W + +
Sbjct: 297 VIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVS--FLTASFLLETTSWFMTII 354

Query: 370 FIALGVAGAFVWSPLTVTATRNLRPHLAGASSGVFNAVRQLGAVLGSASMAAFMT 424
+ + +F + ++ + +L+ AGA + N L G A + ++
Sbjct: 355 IVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1252ccdtoxina290.009 Cytolethal distending toxin A signature.
		>cdtoxina#Cytolethal distending toxin A signature.

Length = 258

Score = 29.3 bits (65), Expect = 0.009
Identities = 16/65 (24%), Positives = 24/65 (36%), Gaps = 3/65 (4%)

Query: 17 VVAALVAATLTGCGSGDSTVAKTPEATPSLSTAHPAPPSSEPSP---PSATAAPPSNHSA 73
+ L+ L GC SG + P+ P P PS + P A P+N +
Sbjct: 10 IAGILIPILLNGCSSGKNKAYLDPKVFPPQVEGGPTVPSPDEPGLPLPGPGPALPTNGAI 69

Query: 74 APVDP 78
+P
Sbjct: 70 PIPEP 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1255cHTHTETR623e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.3 bits (151), Expect = 3e-14
Identities = 30/130 (23%), Positives = 49/130 (37%), Gaps = 3/130 (2%)

Query: 8 SARRTELAADRILDAAERLFTQRDPASIGMNEIAKAAGCSRATLYRYFDSREALRTAYVH 67
+ + + ILD A RLF+Q+ +S + EIAKAAG +R +Y +F + L +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 68 RETRRLG---REIMVKIADVVEPAERLLVSITTTLRMVRDNPALAAWFTTTRPPIGGEMA 124
+G E K R ++ + + L + GEMA
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 125 GRSEVIAALA 134
+ L
Sbjct: 125 VVQQAQRNLC 134


64Rv1473Rv1488N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1473-212-0.744997Probable macrolide-transport ATP-binding protein
Rv1473A-211-1.009950Possible transcriptional regulatory protein
Rv1474c-210-0.957951Probable transcriptional regulatory protein
Rv1475c-29-0.646685Probable iron-regulated aconitate hydratase Acn
Rv1476180.510991Possible membrane protein
Rv1477090.687340Peptidoglycan hydrolase
Rv1478180.270355Possible invasion protein
Rv1479290.023118Probable transcriptional regulatory protein
Rv1480080.102114Conserved protein
Rv148108-0.248001Probable membrane protein
Rv1482c-17-0.096553Conserved hypothetical protein
Rv1483-1110.0945183-oxoacyl-[acyl-carrier protein] reductase FabG1
Rv1484-1140.124361NADH-dependent enoyl-[acyl-carrier-protein]
Rv1485-1140.401063Ferrochelatase HemZ (protoheme ferro-lyase)
Rv1486c-217-2.338212Conserved hypothetical protein
Rv1487-116-2.323017Conserved membrane protein
Rv1488-311-0.398826Possible exported conserved protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1473BCTERIALGSPC320.004 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 32.2 bits (73), Expect = 0.004
Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 2/53 (3%)

Query: 82 ARDRVLSARGLDVLLTDLEKQQALMAEVADEDERDRAIRRYGQLEERFVALGG 134
D ++ GLD L D E+ + M +AD + R GQ ++ ++ GG
Sbjct: 220 DNDMAVALNGLD--LRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEFGG 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1474cHTHTETR696e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.3 bits (169), Expect = 6e-17
Identities = 32/177 (18%), Positives = 63/177 (35%), Gaps = 14/177 (7%)

Query: 1 MPKVSEDHLAARRRQILDGARRCFAEYGYDKATVRRLEQAIGMSRGAIFHHFRDKDALFF 60
M + ++ R+ ILD A R F++ G ++ + +A G++RGAI+ HF+DK LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 ALAREDTERMAAVAS--------------REGLIGVMRDMLAAPDQFDWLATRLEIARKL 106
+ + + RE LI V+ + + + +
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 107 RNDPDFSRGWAERSAELAAATTDRLRRQKQANRVRDDVPSDVLRCYLDLVLDGLLAR 163
+ E L+ +A + D+ + + + GL+
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1477GPOSANCHOR494e-08 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 48.5 bits (115), Expect = 4e-08
Identities = 33/199 (16%), Positives = 60/199 (30%), Gaps = 14/199 (7%)

Query: 44 DTIAALIADVAKANQRLQDLSDEVQAEQESVNKAMVDVETARD-------NAAAAEDDLE 96
AAL A A+ + L+ + A+ + + A +
Sbjct: 183 AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST 242

Query: 97 VSQRAVKDANAAIAAAQHRFDTFAAATYMNGPSVSYLSASSPDEIIATVTAAKTLSASSQ 156
+K A AA + R A + S +I L A
Sbjct: 243 ADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS----AKIKTLEAEKAALEAEKA 298

Query: 157 AVMANLQRARTERVNTE---SAARLAKQKADKAAADAKASQDAAVAALTETRRKFDEQRE 213
+ Q R + A+R AK++ + + + A+ RR D RE
Sbjct: 299 DLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE 358

Query: 214 EVQRLAAERDAAQARLQAA 232
++L AE + + + +
Sbjct: 359 AKKQLEAEHQKLEEQNKIS 377



Score = 45.1 bits (106), Expect = 4e-07
Identities = 35/193 (18%), Positives = 67/193 (34%), Gaps = 8/193 (4%)

Query: 46 IAALIADVAKANQRLQDLSDEVQAEQESVNKAMVDVETARDNAAAAEDDLEVSQRAVKDA 105
AAL A A + L+ + A+ + + A E LE +
Sbjct: 220 KAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTAD 279

Query: 106 NAAIAAAQHRFD----TFAAATYMNGPSVSYLSASSPDEIIATVTAAKTLSASSQAVMAN 161
+A I + A + + + + D + A+ A K L A Q +
Sbjct: 280 SAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRD-LDASREAKKQLEAEHQKLEEQ 338

Query: 162 LQRARTERVNTE---SAARLAKQKADKAAADAKASQDAAVAALTETRRKFDEQREEVQRL 218
+ + R + A+R AK++ + + + A+ RR D RE +++
Sbjct: 339 NKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQV 398

Query: 219 AAERDAAQARLQA 231
+ A ++L A
Sbjct: 399 EKALEEANSKLAA 411



Score = 36.6 bits (84), Expect = 2e-04
Identities = 28/220 (12%), Positives = 62/220 (28%), Gaps = 27/220 (12%)

Query: 36 LATADPQTDTIAALIADVAKANQRLQDLSDEVQAEQESVNKAMVDVETARDNAAAAEDDL 95
L+ + + A AD+ KA + + S A+ +++ + + + A +
Sbjct: 108 LSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGA 167

Query: 96 EVSQRAVKDANAAIAAAQHRFDTFAA---ATYMNGPSVSYLSASSPDEIIATVTAAKTLS 152
A + A + + A + S ++ + A A
Sbjct: 168 MNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARK 227

Query: 153 ASSQAVMANLQRARTERVNTESAARLAK-------------------QKADKAAADAKAS 193
A + +T +A++ + A + A
Sbjct: 228 ADLEK-----ALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAK 282

Query: 194 QDAAVAALTETRRKFDEQREEVQRLAAERDAAQARLQAAR 233
A + + + Q L A R + + L A+R
Sbjct: 283 IKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASR 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1479HTHFIS330.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.002
Identities = 32/150 (21%), Positives = 56/150 (37%), Gaps = 28/150 (18%)

Query: 54 IVGQD----QLVERMLVGLLSKGHVLLEGVPGVAKTL---AVETFARVVGGTFSRIQ--- 103
+VG+ ++ + + + +++ G G K L A+ + + G F I
Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198

Query: 104 FTPDLVPTDIIGTRI--------YRQGREEFDTELGPVVAN--FLLADEINRAPAKVQSA 153
DL+ +++ G GR E A L DEI P Q+
Sbjct: 199 IPRDLIESELFGHEKGAFTGAQTRSTGRFE--------QAEGGTLFLDEIGDMPMDAQTR 250

Query: 154 LLEVMQERHVSIGGRTFPMPSPFLVMATQN 183
LL V+Q+ + G P+ S ++A N
Sbjct: 251 LLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1481TCRTETB290.022 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.5 bits (66), Expect = 0.022
Identities = 19/112 (16%), Positives = 39/112 (34%), Gaps = 16/112 (14%)

Query: 4 PLLGPMTLSGFAHSWFFLFLF----VVAGLVALYILMQLARQRRMLRFANMELLES---- 55
P +G M W +L L ++ + +L + R + + L+
Sbjct: 156 PAIGGMIAHYI--HWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVF 213

Query: 56 -VAPKRPSRWRHVPAILLVLSLLLFTIAMAGPTH---DVRIPRNRAVVMLVI 103
+ + I+ VLS L+F + T D + +N ++ V+
Sbjct: 214 FMLFTTSYSISFL--IVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVL 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1483DHBDHDRGNASE1161e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 116 bits (291), Expect = 1e-33
Identities = 70/252 (27%), Positives = 124/252 (49%), Gaps = 21/252 (8%)

Query: 16 RSVLVTGGNRGIGLAIAQRLAADGHKVAVTHRGSGAPKGLFGVE-----------CDVTD 64
+ +TG +GIG A+A+ LA+ G +A + + DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 SDAVDRAFTAVEEHQGPVEVLVSNAGLSADAFLMRMTEEKFEKVINANLTGAFRVAQRAS 124
S A+D +E GP+++LV+ AG+ + +++E++E + N TG F ++ S
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 RSMQRNKFGRMIFIGSVSGSWGIGNQANYAASKAGVIGMARSIARELSKANVTANVVAPG 184
+ M + G ++ +GS + A YA+SKA + + + EL++ N+ N+V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 185 YIDTDMTRAL-------DERIQQGALQF---IPAKRVGTPAEVAGVVSFLASEDASYISG 234
+TDM +L ++ I+ F IP K++ P+++A V FL S A +I+
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 235 AVIPVDGGMGMG 246
+ VDGG +G
Sbjct: 249 HNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1484DHBDHDRGNASE442e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 44.3 bits (104), Expect = 2e-07
Identities = 54/272 (19%), Positives = 100/272 (36%), Gaps = 32/272 (11%)

Query: 5 LDGKRILVSGIITDSSIAFHIARVAQEQGAQLVLTGFD----RLRLIQRITDRLPAKAPL 60
++GK ++G I +AR QGA + D +L + A
Sbjct: 6 IEGKIAFITG--AAQGIGEAVARTLASQGAHI--AAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 61 LELDVQNEEHLASLAGRVTEAIGAGNKLDGVVHSIGFMPQTGMGINPFFDAPYADVSKGI 120
DV++ + + R+ +G +D +V+ G + +
Sbjct: 62 FPADVRDSAAIDEITARIEREMG---PIDILVNVAGVLR-----PGLIHSLSDEEWEATF 113

Query: 121 HISAYSYASMAKALLPIM--NPGGSIVGMDFD----PSRAMPAYNWMTVAKSALESVNRF 174
+++ + ++++ M GSIV + + P +M AY +K+A +
Sbjct: 114 SVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYA---SSKAAAVMFTKC 170

Query: 175 VAREAGKYGVRSNLVAAGPIRTLAMSAIVGGALGEEAGAQIQLLEEGWDQRAPIGWNMKD 234
+ E +Y +R N+V+ G T ++ G E I+ E + P+ K
Sbjct: 171 LGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAE--QVIKGSLETFKTGIPLK---KL 225

Query: 235 ATP--VAKTVCALLSDWLPATTGDIIYADGGA 264
A P +A V L+S T + DGGA
Sbjct: 226 AKPSDIADAVLFLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1488IGASERPTASE290.047 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.9 bits (64), Expect = 0.047
Identities = 34/211 (16%), Positives = 65/211 (30%), Gaps = 24/211 (11%)

Query: 158 LRVARVELRSIDPPPS---IQASMEKQMKADREKRAMILTAEGTREAAIKQAEGQKQAQI 214
VA+ + + + A++EK+ KA E Q + +Q+
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEK-------------TQEVPKVTSQV 1129

Query: 215 LAAEGAKQAAILAAEADRQSRMLRAQGERAAAYLQAQGQAKAIEKTFAAIKAGRPTPEML 274
+ + AE R++ E Q+Q A + A + +
Sbjct: 1130 SPKQEQSETVQPQAEPARENDPTVNIKEP-----QSQTNTTADTEQPAKETSSNVEQPVT 1184

Query: 275 AYQYLQTLPEMARGDANKVWVVPSDFNAALQGFTRLLGKPGEDGVFRFEPSPVEDQPKHA 334
+ T + N P+ + + K R P VE +
Sbjct: 1185 ESTTVNTGNSVVE---NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSS 1241

Query: 335 ADGDDAEVAGWFSTDTDPSIARAVATAEAIA 365
D + ST+T+ ++ A A A+ +A
Sbjct: 1242 NDRSTVALCDLTSTNTNAVLSDARAKAQFVA 1272


65Rv1680Rv1686cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1680-111-0.840196Hypothetical protein
Rv1681-111-0.984863Possible molybdopterin biosynthesis protein
Rv1682011-1.643605Probable coiled-coil structural protein
Rv1683110-0.330308Possible bifunctional enzyme; long-chain
Rv1684-112-0.735854Conserved hypothetical protein
Rv1685c-114-1.141159Conserved hypothetical protein
Rv1686c-1130.076475Probable conserved integral membrane protein ABC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1680VACJLIPOPROT300.006 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 30.3 bits (68), Expect = 0.006
Identities = 17/84 (20%), Positives = 29/84 (34%), Gaps = 12/84 (14%)

Query: 4 EPLVVGAVAYTPNVVPIWEGIRGYFQDSESPDTQMDFVLYSNYARLVDSL---------- 53
P+ V Y P P G+ + + E P +++ L + + +
Sbjct: 53 RPVAVAWRDYVPQ--PARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILG 110

Query: 54 IAGHIDIAWNTNLAYVRTVLQTGG 77
+ G ID+A N RT G
Sbjct: 111 MGGFIDVAGMANPKLQRTEPHRFG 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1681PF01206290.006 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 29.0 bits (65), Expect = 0.006
Identities = 9/59 (15%), Positives = 21/59 (35%), Gaps = 1/59 (1%)

Query: 2 IIELMRRVVGLAQGATAEVAVYGDRDRDLAERWCANTGNTLVRADVDQTGVGTLVVRRG 60
I++ + + + G V E + TG+ L+ ++ G ++R
Sbjct: 19 ILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQK-EEDGTYHFRLKRA 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1685cHTHTETR713e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.8 bits (173), Expect = 3e-17
Identities = 29/121 (23%), Positives = 46/121 (38%), Gaps = 11/121 (9%)

Query: 13 RPAGSSDTRERILSSARELFAHNGIDRTSIRAVAAKAGVDAALVHHYFGTKQQLFAAAIH 72
+ +TR+ IL A LF+ G+ TS+ +A AGV ++ +F K LF+
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 73 IPIDPMVIIGPIREAPVEELGYKLPSLLLPIWDSELGAGLIATLRSLISGSDVGLARSFL 132
+ I E +E K P L + LI L S ++ L +
Sbjct: 65 LSES------NIGELELEYQA-KFPGDPLSVL----REILIHVLESTVTEERRRLLMEII 113

Query: 133 E 133

Sbjct: 114 F 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1686cABC2TRNSPORT486e-09 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 48.4 bits (115), Expect = 6e-09
Identities = 38/172 (22%), Positives = 74/172 (43%), Gaps = 2/172 (1%)

Query: 53 TMQRERASGTLERILTTPLRRLDLLAGYGTAFSIAAAAQATLACIVAFWFLGFDTAGSPV 112
R T E +L T LR D++ G A++ AA A V LG+ S +
Sbjct: 90 AFGRMEGQRTWEAMLYTQLRLGDIVLGE-MAWAATKAALAGAGIGVVAAALGYTQWLSLL 148

Query: 113 WVFAIAIVNAVLGVGLGLLCSAFARTEFQAVQFIPLVMVPQLLLAGIIVPRALMPTWLEW 172
+ + + + LG++ +A A + + + LV+ P L L+G + P +P +
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 173 ISNVMPASYALEALQQVGAHPELTGIAVRDVVVVLSFAVASLCLAAVTLRRR 224
+ +P S++++ ++ + + + V + + V L+ LRRR
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQH-VGALCIYIVIPFFLSTALLRRR 259


66Rv1816Rv1822N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1816-290.738457Possible transcriptional regulatory protein
Rv1817-190.529489Possible flavoprotein
Rv1818c-290.913296PE-PGRS family protein PE_PGRS33
Rv1819c-19-1.404665Probable drug-transport transmembrane
Rv1820-110-1.035426Probable acetolactate synthase IlvG
Rv1821112-2.142926Possible preprotein translocase ATPase SecA2
Rv1822118-2.669416Probable
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1816HTHTETR531e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.1 bits (127), Expect = 1e-10
Identities = 22/160 (13%), Positives = 50/160 (31%), Gaps = 6/160 (3%)

Query: 8 GKRRDAREQIEAKIVELGRRQLLDHGAAGLSLRAIARNLGMVSSAVYRYVSSRDELLTLL 67
K + ++ I+++ R G + SL IA+ G+ A+Y + + +L + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 68 LVDAYSDLADTVDRARDDTVADSWSDDVIAIARAVRGWAVTNPARWALL----YGSPVPG 123
+ S++ + + D S + + VT R L+ + G
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLE-STVTEERRRLLMEIIFHKCEFVG 121

Query: 124 YHAPPDRT-AGVATRVVGAFFDAIAAGIATGDIRLTDDVA 162
A + + + I +
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTR 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1818ccloacin371e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 1e-04
Identities = 38/118 (32%), Positives = 46/118 (38%), Gaps = 11/118 (9%)

Query: 158 GAAGLFGNGGAGGAGGNVASGTAGFGGAGGA--GGLLYGAGGAGGAGGRAGGGVGGIGGA 215
G G N GA GN+ G G G GGA G G G +G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 216 GGAGGNGGLLFGAGGAGGVGGLAADAGDGGAGGDGGLFFGVGGAGGAGGTGTNVTGGA 273
G GGNG G+G G + +AA G F GAGG +++ GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG---------FPALSTPGAGGLAVSISAGA 111



Score = 37.4 bits (86), Expect = 1e-04
Identities = 27/82 (32%), Positives = 30/82 (36%)

Query: 262 AGGTGTNVTGGAGGAGGNGGLLFGAGGVGGVGGDGVAFLGTAPGGPGGAGGAGGLFGVGG 321
+GG G GA GN GVGG DG + GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 322 AGGAGGIGLVGNGGAGGSGGSA 343
G GG G G G G SA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.002
Identities = 31/101 (30%), Positives = 38/101 (37%), Gaps = 2/101 (1%)

Query: 140 GNGGAGGSGAAGMPGGNGGAAGLFGNGGAGGAGGNVASGTAGFGGAGGAGGLLYGAGGAG 199
G+G +GA G G G GG G +S +GG G+G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 200 GAGGRAGGGVGGIGGAGGAGGNGGLLFG--AGGAGGVGGLA 238
GG G G G + + FG A G GGLA
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.4 bits (73), Expect = 0.004
Identities = 25/81 (30%), Positives = 33/81 (40%)

Query: 408 GAGGAGGFGFGGAGGAGGLGGKAGLIGDGGDGGAGGNGTGAKGGDGGAGGGAILVGNGGN 467
GA G GG G G GG + G + G G+G+ GG G GNG +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 468 GGNAGSGTPNGSAGTGGAGGL 488
GG +G+G + A G
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGF 92



Score = 32.4 bits (73), Expect = 0.005
Identities = 32/107 (29%), Positives = 35/107 (32%), Gaps = 5/107 (4%)

Query: 321 GAGGAGGIGLVGNGGAGGSGGSALLWGDGGAGGAGGVGSTTGGAGGAGGNAGLLVGAGGA 380
G G G + +GG L GGA G S GG G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 381 GGAGALGGGATGVGGAGGNGGTAGLLFGAGGAGGFGFGGAGGAGGLG 427
G G G G G G A A A GF GAGGL
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA-----APVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.005
Identities = 26/79 (32%), Positives = 31/79 (39%)

Query: 380 AGGAGALGGGATGVGGAGGNGGTAGLLFGAGGAGGFGFGGAGGAGGLGGKAGLIGDGGDG 439
+GG G NGG GL G G + G G+ G G +G+ GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 440 GAGGNGTGAKGGDGGAGGG 458
G G G GG G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80



Score = 31.6 bits (71), Expect = 0.008
Identities = 22/72 (30%), Positives = 30/72 (41%)

Query: 120 NGANGAPGTGANGGDGGILIGNGGAGGSGAAGMPGGNGGAAGLFGNGGAGGAGGNVASGT 179
N + NGG G+ +G G + GSG + GG +G + G G GN
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 180 AGFGGAGGAGGL 191
GG+G G L
Sbjct: 70 NSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.014
Identities = 40/133 (30%), Positives = 43/133 (32%), Gaps = 2/133 (1%)

Query: 293 GGDGVAFLGTAPGGPGGAGGAGGLFGVGGAGGAGGIGLVGNGGAGGSGGSALLWG--DGG 350
GGDG A G G GVGG G N GG GS + WG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 351 AGGAGGVGSTTGGAGGAGGNAGLLVGAGGAGGAGALGGGATGVGGAGGNGGTAGLLFGAG 410
G G S G G +A A G G G V + G A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 411 GAGGFGFGGAGGA 423
G F FG G A
Sbjct: 123 LKGPFKFGLWGVA 135



Score = 30.1 bits (67), Expect = 0.026
Identities = 21/71 (29%), Positives = 26/71 (36%)

Query: 362 GGAGGAGGNAGLLVGAGGAGGAGALGGGATGVGGAGGNGGTAGLLFGAGGAGGFGFGGAG 421
GA GN G GG + G G + G G +G+ +G G G G G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 422 GAGGLGGKAGL 432
GG G L
Sbjct: 71 SGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1821SECA8820.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 882 bits (2280), Expect = 0.0
Identities = 285/815 (34%), Positives = 401/815 (49%), Gaps = 116/815 (14%)

Query: 48 RLLGASTEKNRSRSLADVTASAEYDKEAADLSDEKLR-KAAGLLNLDDLAESAD--IPQF 104
++ G+ ++ R V + E LSDE+L+ K A + E + IP+
Sbjct: 8 KVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLENLIPEA 67

Query: 105 LAIAREAAERRTGLRPFDVQLLGALRMLAGDVIEMATGEGKTLAGAIAAAGYALAGRHVH 164
A+ REA++R G+R FDVQLLG + + + EM TGEGKTL + A AL G+ VH
Sbjct: 68 FAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVH 127

Query: 165 VVTINDYLARRDAEWMGPLLDAMGLTVGWITADSTPDERRTAYDRDVTYASVNEIGFDVL 224
VVT+NDYLA+RDAE PL + +GLTVG +R AY D+TY + NE GFD L
Sbjct: 128 VVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNEYGFDYL 187

Query: 225 RDQLVTDVNDLVSPNPDVALIDEADSVLVDEALVPLVLAGTTHRETPRLEII-RLVAELV 283
RD + + V AL+DE DS+L+DEA PL+++G + + + +++ L+
Sbjct: 188 RDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNKIIPHLI 247

Query: 284 GDKD-------ADEYFATDSDNRNVHLTEHGARKVEKAL-------GGIDLYSEEHVGTT 329
+ + +F+ D +R V+LTE G +E+ L G LYS ++
Sbjct: 248 RQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSPANI-ML 306

Query: 330 LTEVNVALHAHVLLQRDVHYIVRDDAVHLINASRGRIAQLQRWPDGLQAAVEAKEGIETT 389
+ V AL AH L RDV YIV+D V +++ GR Q +RW DGL AVEAKEG++
Sbjct: 307 MHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKEGVQIQ 366

Query: 390 ETGEVLDTITVQALINRYATVCGMTGTALAAGEQLRQFYQLGVSPIPPNKPNIREDEADR 449
+ L +IT Q Y + GMTGTA + Y+L +P N+P IR+D D
Sbjct: 367 NENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRKDLPDL 426

Query: 450 VYITTAAKNDGIVEHITEVHQRGQPVLVGTRDVAESEELHERLVRRGVPAVVLNAKNDAE 509
VY+T A K I+E I E +GQPVLVGT + +SE + L + G+ VLNAK A
Sbjct: 427 VYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHAN 486

Query: 510 EARVIAEAGKYGAVTVSTQMAGRGTDIRLGGSDEA----------------------DHD 547
EA ++A+AG AVT++T MAGRGTDI LGGS +A HD
Sbjct: 487 EAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHD 546

Query: 548 RVAELGGLHVVGTGRHHTERLDNQLRGRAGRQGDPGSSVFFSSWEDDV--------VAAN 599
V E GGLH++GT RH + R+DNQLRGR+GRQGD GSS F+ S ED + V+
Sbjct: 547 AVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSGM 606

Query: 600 LDHNKLPMATDENGRIVSPRTGSLLDHAQRVAEGRLLDVHANTWRYNQLIAQQRAIIVER 659
+ KL M E I P + +AQR E R D+ Y+ + QR I +
Sbjct: 607 M--RKLGMKPGE--AIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIYSQ 662

Query: 660 RNTLLRTVTAREEL-------------AELAPKRYEELSD-------------------- 686
RN LL E + A + P+ EE+ D
Sbjct: 663 RNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIAE 722

Query: 687 ------KVSEERL-ETIC-----------------------RQIMLYHLDRGWADHLAYL 716
++ EE L E I + +ML LD W +HLA +
Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782

Query: 717 ADIRESIHLRALGRQNPLDEFHRMAVDAFASLAAD 751
+R+ IHLR +++P E+ R + FA++
Sbjct: 783 DYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLES 817


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1822TYPE3IMSPROT300.007 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.7 bits (67), Expect = 0.007
Identities = 12/88 (13%), Positives = 32/88 (36%), Gaps = 14/88 (15%)

Query: 135 FGFMVGFPTILLGQCDPLWS----HVLLACGWAFLIWGMYAYLWAFVLYAVQMTM----V 186
++ PT + PL +++ C F++ + Y + + Y ++ M +
Sbjct: 162 LVTLLQLPTCGIECITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEI 221

Query: 187 VRQM------PKLKGRAHRPAAQNAGER 208
R+ P++K + + +
Sbjct: 222 KREYKEMEGSPEIKSKRRQFHQEIQSRN 249


67Rv1876Rv1883cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv1876-112-2.433902Probable bacterioferritin BfrA
Rv1877-112-1.881992Probable conserved integral membrane protein
Rv1878-315-2.143977Probable glutamine synthetase GlnA3 (glutamine
Rv1879-115-2.483028Conserved hypothetical protein
Rv1880c-216-2.278234Probable cytochrome P450 140 Cyp140
Rv1881c018-2.339150Possible conserved lipoprotein LppE
Rv1882c015-1.429246Probable short-chain type
Rv1883c114-1.920536Conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1876HELNAPAPROT270.026 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 26.8 bits (59), Expect = 0.026
Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 13/105 (12%)

Query: 47 ESFDEMR-----HAEEITDRILLLDGLPN--YQRIGSLR------IGQTLREQFEADLAI 93
E F+E+ + I +R+L + G P + + E +A +
Sbjct: 48 EKFEELYDHAAETVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVND 107

Query: 94 EYDVLNRLKPGIVMCREKQDTTSAVLLEKIVADEEEHIDYLETQL 138
+ + K I + E QD +A L ++ + E+ + L + L
Sbjct: 108 YKQISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1877TCRTETB1407e-38 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 140 bits (354), Expect = 7e-38
Identities = 96/424 (22%), Positives = 186/424 (43%), Gaps = 18/424 (4%)

Query: 22 SPVRRNIIFTALVFGVLVAATGQTIVVPALPTIVAELGSTVDQ-SWAVTSYLLGGTVVVV 80
S +R N I L + + ++ +LP I + +W T+++L ++
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTA 67

Query: 81 VAGKLGDLLGRNRVLLGSVVVFVVGSVLCGLSQTMTMLAI-SRALQGVGAGAISVTAYAL 139
V GKL D LG R+LL +++ GSV+ + + L I +R +QG GA A +
Sbjct: 68 VYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 140 AAEVVPLRDRGRYQGVLGAVFGVNTVTGPLLGGWLTDYLSWRWAFWINVPVSIAVLTVAA 199
A +P +RG+ G++G++ + GP +GG + Y+ W++ + +P+ +
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH--WSYLLLIPMITIITVPFL 185

Query: 200 TAVPALARPPKPVIDYLGILVIAVATTALIMATSWGGTTYAWGSATIVGLLIGAAVALGF 259
+ K D GI++++V ++ T T+Y+ LI + ++
Sbjct: 186 MKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFT----TSYSI------SFLIVSVLSFLI 235

Query: 260 FVWLEGRAAAAILPPRLFGSPVFAVCCVLSFVVGFAMLGALTFVPIYLGYVDGAS-ATAS 318
FV + + P L + F + + ++ + G ++ VP + V S A
Sbjct: 236 FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG 295

Query: 319 GLRTLPMVIGLLIASTGTGVLVGRTGRYKIFPVAGMALMAVAFLLMSQMDEWTPPLLQSL 378
+ P + ++I G+LV R G + + G+ ++V+FL S + E T ++
Sbjct: 296 SVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNI-GVTFLSVSFLTASFLLETT-SWFMTI 353

Query: 379 YLVVLGAGIGLSMQVLVLIVQNTSSFEDLGVATSGVTFFRVVGASFGTATFGALF-VNFL 437
+V + G+ + V+ IV ++ ++ G S + F + G A G L + L
Sbjct: 354 IIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413

Query: 438 DRRL 441
D+RL
Sbjct: 414 DQRL 417


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1882cDHBDHDRGNASE643e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 64.3 bits (156), Expect = 3e-14
Identities = 58/223 (26%), Positives = 93/223 (41%), Gaps = 10/223 (4%)

Query: 2 KAIFITGAGSGMGREGATLFHANGWRVGAIDRNEDGLAALRVQLGAERLWARA--VDVTD 59
K FITGA G+G A + G + A+D N + L + L AE A A DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 60 KAALEGALADFCAGNVGGGLDMMWNNAGIGEGGWFEDVPYEAAVRVVDVNFKAVLTGAYA 119
AA++ A G +D++ N AG+ G + E VN V + +
Sbjct: 69 SAAIDEITARIEREM--GPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 120 ALPYLKKAPGSLMFSTSSSSGTYGMPR--IAVYSATKHAVKGLTEALSVEWQRHGVRVAD 177
Y+ + + S+ G+PR +A Y+++K A T+ L +E + +R
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 178 VLPGLIDTAILTS--TRQHSDEGPYTISAEQIRAAAPKKGMFR 218
V PG +T + S ++ E S E + P K + +
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAK 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv1883cTCRTETOQM270.040 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 26.7 bits (59), Expect = 0.040
Identities = 8/30 (26%), Positives = 15/30 (50%)

Query: 120 TVYYRVFGGWLRQRRNIRDMTKTLQRIKDL 149
Y R++ G L R ++R K +I ++
Sbjct: 265 LAYIRLYSGVLHLRDSVRISEKEKIKITEM 294


68Rv2041cRv2048cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2041c-210-0.405815Probable sugar-binding lipoprotein
Rv2042c-280.738443Conserved protein
Rv2043c1171.344331Pyrazinamidase/nicotinamidase PncA (PZase)
Rv2044c1171.467839Conserved hypothetical protein
Rv2045c1171.484009Carboxylesterase LipT
Rv20460171.585892Probable lipoprotein LppI
Rv2047c0161.794154Conserved hypothetical protein
Rv2048c0171.827188Polyketide synthase Pks12
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2041cMALTOSEBP484e-08 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 48.2 bits (114), Expect = 4e-08
Identities = 62/267 (23%), Positives = 110/267 (41%), Gaps = 33/267 (12%)

Query: 79 QQLATFCAGGKCPDVLMAWELTYAELADRGVLLDLNTLLARDQAFAAELKSDSIGALYET 138
++ A G PD++ + A G+L ++ D+AF +L + ++
Sbjct: 71 EKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITP----DKAFQDKLYPFT----WDA 122

Query: 139 FTFNGGQYAFPEQWSGNFLFYNKQLFDDAGVPPPPGSWERPWSFAEFLDAAQALTKQGRS 198
+NG A+P L YNK L +P PP +WE E + L +G+S
Sbjct: 123 VRYNGKLIAYPIAVEALSLIYNKDL-----LPNPPKTWE------EIPALDKELKAKGKS 171

Query: 199 GRDRQWGFVNAWVSFYAAGLFAMNNGVPWSVP--RMNPTHLNFDHDGFLEAVQFYADL-T 255
N ++ L A + G + + + + D+ G + F DL
Sbjct: 172 AL-----MFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIK 226

Query: 256 NKHKVAPSAAEQQSMSTADLFSVGKAGIALAGHWRYQTFDRADGLDFDVAPLPIGPRGRA 315
NKH +A S++ A F+ G+ + + G W + D + +++ V LP +G+
Sbjct: 227 NKHM---NADTDYSIAEA-AFNKGETAMTINGPWAWSNIDTSK-VNYGVTVLPTF-KGQP 280

Query: 316 ACSDIGVTGLAIAATSRRKDQAWEFVK 342
+ +GV I A S K+ A EF++
Sbjct: 281 SKPFVGVLSAGINAASPNKELAKEFLE 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2043cISCHRISMTASE310.003 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 30.8 bits (69), Expect = 0.003
Identities = 22/80 (27%), Positives = 32/80 (40%), Gaps = 7/80 (8%)

Query: 103 YSGFEGVDENGTPLLNWLRQRGVDEVDVVGIATDHCVRQTAEDAVRNGLATRVLVDLTAG 162
YS F+ T LL +R+ G D++ + GI TA +A + + D A
Sbjct: 126 YSAFKR-----TNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 163 VSADTTVAALEEM--RTASV 180
S + ALE R A
Sbjct: 181 FSLEKHQMALEYAAGRCAFT 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2047cPHPHTRNFRASE655e-13 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 65.2 bits (159), Expect = 5e-13
Identities = 26/96 (27%), Positives = 44/96 (45%), Gaps = 2/96 (2%)

Query: 743 GGRVRGRVRIVRPETIDDLQPGEILVAEVTDVGYTAAF--CYAAAVVTELGGPMSHAAVV 800
RV G + V ++ + +++AE TA + T++GG SH+A++
Sbjct: 135 SKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIM 194

Query: 801 AREFGFPCVVDAQGATRFLPPGALVEVDGATGEIHV 836
+R P VV + T + G +V VDG G + V
Sbjct: 195 SRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIV 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2048cNUCEPIMERASE402e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 39.8 bits (93), Expect = 2e-04
Identities = 30/142 (21%), Positives = 45/142 (31%), Gaps = 24/142 (16%)

Query: 1681 TVLITGGTGMAGSAVARHVVAR-HGVRNLVLVSRRGPDA------PGAAELVAELAAAGA 1733
L+TG G G V++ ++ H V G D + EL A
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV--------VGIDNLNDYYDVSLKQARLELLAQP- 52

Query: 1734 QVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRVDVVLRSKVD 1793
Q D ADR + + A + V + L AV SL S +
Sbjct: 53 GFQFHKIDLADREGMTDLFASGHFER----VFISPHRL--AVRYSLENPHAYA--DSNLT 104

Query: 1794 AAWHLHELTRDLDVSAFVMFSS 1815
++ E R + + SS
Sbjct: 105 GFLNILEGCRHNKIQHLLYASS 126



Score = 39.8 bits (93), Expect = 2e-04
Identities = 30/142 (21%), Positives = 45/142 (31%), Gaps = 24/142 (16%)

Query: 3711 TVLITGGTGMAGSAVARHVVAR-HGVRNLVLVSRRGPDA------PGAAELVAELAAAGA 3763
L+TG G G V++ ++ H V G D + EL A
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV--------VGIDNLNDYYDVSLKQARLELLAQP- 52

Query: 3764 QVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRVDVVLRSKVD 3823
Q D ADR + + A + V + L AV SL S +
Sbjct: 53 GFQFHKIDLADREGMTDLFASGHFER----VFISPHRL--AVRYSLENPHAYA--DSNLT 104

Query: 3824 AAWHLHELTRDLDVSAFVMFSS 3845
++ E R + + SS
Sbjct: 105 GFLNILEGCRHNKIQHLLYASS 126


69Rv2209Rv2216N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2209-190.757401Probable conserved integral membrane protein
Rv2210c-2100.924784Branched-chain amino acid transaminase IlvE
Rv2211c-2101.222288Probable aminomethyltransferase GcvT (glycine
Rv2212-1111.403827Adenylyl cyclase (ATP pyrophosphate-lyase)
Rv2213-1121.434063Probable aminopeptidase PepB
Rv2214c-1121.378826Possible short-chain dehydrogenase EphD
Rv2215-1121.830383DlaT, dihydrolipoamide acyltransferase, E2
Rv2216-1160.031372Conserved protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2209RTXTOXINA363e-04 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 36.5 bits (84), Expect = 3e-04
Identities = 32/107 (29%), Positives = 45/107 (42%), Gaps = 12/107 (11%)

Query: 66 LGNSLSPLI-LQRAGQLRHLLMAAISATAAALVVCNAAVPWTGVGVAAVF-LATTGAGGV 123
+GN L L L G + +SA +A+ ++ NA T AA L T G V
Sbjct: 225 VGNKLQNLPNLDNIGAGLDTVSGILSAISASFILSNADAD-TRTKAAAGVELTTKVLGNV 283

Query: 124 VTGVSSVAYTDMISSMLPAVRRGELLLTQGAAGSVLATGVTLVIVPM 170
IS + A R + L T AA ++A+ VTL I P+
Sbjct: 284 ---------GKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISPL 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2214cDHBDHDRGNASE1095e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 109 bits (274), Expect = 5e-29
Identities = 68/226 (30%), Positives = 107/226 (47%), Gaps = 3/226 (1%)

Query: 329 VTGAGSGIGRETALAFAREGAEIVISDIDEATVKDTAAEIAARGGIAYPYVLDVSDAEAV 388
+TGA GIG A A +GA I D + ++ + + A A + DV D+ A+
Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72

Query: 389 EAFAERVSAEHGVPDIVVNNAGIGQAGRFLDTPAEQFDRVLAVNLGGVVNGCRAFGQRLV 448
+ R+ E G DI+VN AG+ + G E+++ +VN GV N R+ + ++
Sbjct: 73 DEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMM 132

Query: 449 ERGTGGHIVNVSSMAAYAPLQSLSAYCTSKAATYMFSDCLRAELDAAGVGLTTICPGVID 508
+R G IV V S A P S++AY +SKAA MF+ CL EL + + PG +
Sbjct: 133 DR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTE 191

Query: 509 TNIVATTGFHAPGTDEEKIDGRRGQIDKMFALRSYG-PDKVADAIV 553
T++ + G E+ I G L+ P +ADA++
Sbjct: 192 TDMQWSLWADENG-AEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2215PRTACTNFAMLY330.004 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 32.7 bits (74), Expect = 0.004
Identities = 23/56 (41%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 190 GELARIGVAADIGAAPAPKPAPKPVPEPAPTPKAEPAPSPPAAQPAGAAEGAPYVT 245
G+ + +G A PAP+P P+P P P P+A PAP PPA + AA A T
Sbjct: 562 GQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEA-PAPQPPAGRELSAAANAAVNT 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2216NUCEPIMERASE320.002 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 32.4 bits (74), Expect = 0.002
Identities = 31/146 (21%), Positives = 50/146 (34%), Gaps = 24/146 (16%)

Query: 6 VAIAGSSGLIGSALTAALRAADHTVLRI--------VRRAPANSEELHWNPESGEFDPHA 57
+ G++G IG ++ L A H V+ I V A E L F H
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELL----AQPGFQFHK 58

Query: 58 --LTDVDAVVNLCGVNIAQR----------RWSGAFKQSLRDSRITPTEVLSAAVADAGV 105
L D + + +L +R R+S + DS +T + +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 106 ATLINASAVGYYGNTKDRVVDENDSA 131
L+ AS+ YG + +DS
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSV 144


70Rv2504cRv2510cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2504c-1100.153472Probable succinyl-CoA:3-ketoacid-coenzyme A
Rv2505c-2100.273968Probable fatty-acid-CoA ligase FadD35
Rv2506-1110.650171Probable transcriptional regulatory protein
Rv2507-1130.599376Possible conserved proline rich membrane
Rv2508c0120.241992Probable conserved integral membrane leucine and
Rv2509213-0.405815Probable short-chain type
Rv2510c211-0.384802Conserved protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2504cTYPE3OMGPROT280.047 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 27.9 bits (62), Expect = 0.047
Identities = 17/68 (25%), Positives = 25/68 (36%), Gaps = 12/68 (17%)

Query: 124 GTQVADGGLPWRYDASGGVAVVS-PAKETREFDGVTYVLE-----RGIRTD------FAL 171
+ + + WR DAS + VS P + + LE R +T F L
Sbjct: 130 RSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPL 189

Query: 172 VHAWQGDR 179
+A DR
Sbjct: 190 KYASASDR 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2506HTHTETR751e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.7 bits (183), Expect = 1e-18
Identities = 38/196 (19%), Positives = 70/196 (35%), Gaps = 9/196 (4%)

Query: 17 NRRSQLKSDRRFQLLAAAERLFAERGFLAVRLEDIGAAAGVSGPAIYRHFPNKESLLVEL 76
+ Q + R +L A RLF+++G + L +I AAGV+ AIY HF +K L E+
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 77 LVGVSARLLAGARDVT-TRSANLAAALDGLIEFHLDFALGEADLIRIQDRDLAHLPAVAE 135
+ + + + + L ++ L+ + E + + V E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 136 RQ-VRKAQRQYVEVWVGVLREL------NPGLAEA-DARLMAHAVFGLLNSTPHSMKAAD 187
V++AQR + + L R A + G ++ + A
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 188 SKPARTVRARAVLRAM 203
AR + +
Sbjct: 183 QSFDLKKEARDYVAIL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2507PERTACTIN300.009 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.5 bits (68), Expect = 0.009
Identities = 23/67 (34%), Positives = 24/67 (35%)

Query: 53 WSPGGPPPRPPQWPPGPHEASPTQQLPQYWQYDQPPPGGFPPDGLTPPPPQGPRTPRWLW 112
WS G P P P Q PQ Q QPP PP P P R L
Sbjct: 560 WSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELS 619

Query: 113 FAAGSAV 119
AA +AV
Sbjct: 620 AAANAAV 626


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2509DHBDHDRGNASE762e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 76.2 bits (187), Expect = 2e-18
Identities = 52/185 (28%), Positives = 89/185 (48%), Gaps = 3/185 (1%)

Query: 12 AVVTGASQNIGAALATELAARGHHLIVTARREDVLTELAARLADKYRVTVDVRPADLADP 71
A +TGA+Q IG A+A LA++G H+ + L ++ + L + R + PAD+ D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEAR-HAEAFPADVRDS 69

Query: 72 QERSKLADELAAR--PISILCANAGTATFGPIASLDLAGEKTQVQLNAVAVHDLTLAVLP 129
++ + PI IL AG G I SL + +N+ V + + +V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 130 GMIERKAGGILISGSAAGNSPIPYNATYAATKAFVNTFSESLRGELRGSGVHVTVLAPGP 189
M++R++G I+ GS P A YA++KA F++ L EL + +++PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 190 VRTEL 194
T++
Sbjct: 190 TETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2510cTONBPROTEIN310.008 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 31.1 bits (70), Expect = 0.008
Identities = 21/88 (23%), Positives = 32/88 (36%), Gaps = 2/88 (2%)

Query: 425 AIGAAAQASSLQAVYGQTIDRPSAHEILSAKLAPAQEAPAQEAPAP--RGQYDPLPWPDD 482
I A A L Q I+ P+ + +S + + +A P +P P P+
Sbjct: 18 CIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEP 77

Query: 483 FEVPPMPAPVEPQGPAVWEEILKNPTVK 510
PP APV + P + P K
Sbjct: 78 IPEPPKEAPVVIEKPKPKPKPKPKPVKK 105


71Rv2586cRv2593cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2586c-2141.295413Probable protein-export membrane protein SecF
Rv2587c-2141.759725Probable protein-export membrane protein SecD
Rv2588c-2162.163827Probable conserved membrane protein secretion
Rv2589-3152.3359824-aminobutyrate aminotransferase GabT
Rv2590-2142.419034Probable fatty-acid-CoA ligase FadD9
Rv25912153.763267PE-PGRS family protein PE_PGRS44
Rv2592c0121.635324Probable holliday junction DNA helicase RuvB
Rv2593c1121.248155Probable holliday junction DNA helicase RuvA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2586cSECFTRNLCASE2505e-82 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 250 bits (639), Expect = 5e-82
Identities = 75/314 (23%), Positives = 144/314 (45%), Gaps = 28/314 (8%)

Query: 58 FEVVGRRRLWFGVSGAIVAVAIASIVFRGFTFGIDFKGGTTVSFPRGSTQVAQVEDVYYR 117
F+ + FG + ++ ++ + G FGIDFKGGTT+ ST V
Sbjct: 14 FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTE--STTAIDVGVYRAA 71

Query: 118 ALGSEPQSVVIV--------GAGASATVQIRSETLTSD------QTAKLRDALFEAFGPK 163
E V+I A ++I+ + Q +L + + A
Sbjct: 72 LEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAV 131

Query: 164 GTDGQPSKQAISDSAVSETWGGQITKKAVIALVVFLVLVALYITVRYERYMTISAITAML 223
P+ + S +V G++ AV +L+ V++ YI VR+E + A+ A++
Sbjct: 132 D----PALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALV 187

Query: 224 FDLTVTAGVYSLVGFEVTPATVIGLLTILGFSLYDTVIVFDKVEENTHGFQHTTRRTFAE 283
D+ +T G+++++ + TV LLTI G+S+ DTV+VFD++ EN ++ R +
Sbjct: 188 HDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLR---D 244

Query: 284 QANLAINQTFMRSINTSLIGVLPVLALMVVAVWLLGVGTLKDLALVQLIGIIIGTYSSIF 343
NL++N+T R++ T + +L ++ +++ G ++ + G+ GTYSS++
Sbjct: 245 VMNLSVNETLSRTVMTGMTTLLALVPMLI-----WGGDVIRGFVFAMVWGVFTGTYSSVY 299

Query: 344 FATPLLVTLRERTE 357
A +++ +
Sbjct: 300 VAKNIVLFIGLDRN 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2587cSECFTRNLCASE583e-11 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 57.9 bits (140), Expect = 3e-11
Identities = 41/270 (15%), Positives = 92/270 (34%), Gaps = 20/270 (7%)

Query: 279 GMDQRGIGYVVDLQFKGPA--ANIWADYTAAHIGTQTAFTLDSQVVSAPQIQEAIPGGRT 336
G+D +G G + + A +G + Q I
Sbjct: 46 GIDFKG-GTTIRTESTTAIDVGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQ-M 103

Query: 337 QISGGDPPFTAATARQLANVLK--YGSLPLSFEPSEAQTVSATLGLSSLRAGMIAGAIGL 394
Q G A ++L N ++ ++ + + + ++V + + + +
Sbjct: 104 QEDGQGAEGQGAQGQELVNKVETALTAVDPALKITSFESVGPKVSGELVWTAVWSLLAAT 163

Query: 395 LLVLVY-SLLYYRVLGLLTALSLVASGSMVFAILVLLGRYINYTLDLAGIAGLIIGIGTT 453
++++ Y + + L ++LV + + +L + L +A L+ G +
Sbjct: 164 VVIMFYIWVRFEWQFALGAVVALVHDVLLTVGLFAVLQLKFD----LTTVAALLTITGYS 219

Query: 454 ADSFVVFFERIKDEIR--EGRSFRSAVPRGWARARKTIVSGNAVTFLAAAVLYFLAIGQV 511
+ VV F+R+++ + + R + V T LA + +
Sbjct: 220 INDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVI 279

Query: 512 KGFAFTLGLTTILDLVVVFLVTWPLVYLAS 541
+GF F + + VF T+ VY+A
Sbjct: 280 RGFVFAM-------VWGVFTGTYSSVYVAK 302



Score = 36.4 bits (84), Expect = 2e-04
Identities = 17/100 (17%), Positives = 35/100 (35%), Gaps = 8/100 (8%)

Query: 13 YLSVFLVMLIGIYLLVFFTGDKHTAPKLGIDLQGGTRVTLTARTPDGSAPSREALAQAQQ 72
+ + ++M+ + L + GID +GGT + + T R AL + +
Sbjct: 24 FGAAIVMMIASVILPLVI------GLNFGIDFKGGTTIRTESTTAIDVGVYRAAL-EPLE 76

Query: 73 IISARVNGLGVSGSEVVVDGDNLVITVPGNDGSEARNLGQ 112
+ ++ + S ++ DG A G
Sbjct: 77 LGDVIISEVR-DPSFREDQHVAMIRIQMQEDGQGAEGQGA 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2590NUCEPIMERASE443e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 43.6 bits (103), Expect = 3e-06
Identities = 53/246 (21%), Positives = 82/246 (33%), Gaps = 65/246 (26%)

Query: 770 TVLLTGATGFLGRYLALEWLDRMDLVNGKLICLVRARSDEEAQARLDATFDSGDPYLVRH 829
L+TGA GF+G +++ L+ V G +D D D L +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVG-----------------IDNLNDYYDVSLKQA 44

Query: 830 YRE-LGAGRLEVLAGDKGEADL--------GLDRVTWQRLADTV-DLIVDPAALVNHVLP 879
E L + D + + +RV V + +P A +
Sbjct: 45 RLELLAQPGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD---- 100

Query: 880 YSQLFGPNAAGTAELLRLALTGKRKPYIYTSTIAV---GEQIPPEAFTEDADI-RAISPT 935
N G +L K + +Y S+ +V ++P F+ D + +S
Sbjct: 101 ------SNLTGFLNILEGCRHNKIQHLLYASSSSVYGLNRKMP---FSTDDSVDHPVSL- 150

Query: 936 RRIDDSYANGYANSKWAGEVLLREAHEQCGLPVTVFRCDMILADTSYTGQLNLPDM---- 991
YA +K A E++ GLP T R T Y G PDM
Sbjct: 151 ----------YAATKKANELMAHTYSHLYGLPATGLR-----FFTVY-GPWGRPDMALFK 194

Query: 992 FTRLML 997
FT+ ML
Sbjct: 195 FTKAML 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2591cloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 2e-04
Identities = 31/93 (33%), Positives = 40/93 (43%), Gaps = 3/93 (3%)

Query: 124 GANGTAASPNGGDGGILYGNGGN---GFSQTTAGVAGGAGGSAGLIGNGGNGGAGGAGAA 180
GA+ T+ + NGG G+ G G + G+S GG+G G G+G GG G +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 181 GGAGGAGGWLLGNGGAGGPGGPTDVPAGTGGAG 213
GG G GG L G P G GG
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 9e-04
Identities = 29/102 (28%), Positives = 34/102 (33%)

Query: 173 GAGGAGAAGGAGGAGGWLLGNGGAGGPGGPTDVPAGTGGAGGAGGDAPLIGWGGNGGPGG 232
G G G GA G + G G GG +G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 233 FAAFGNGGAGGNGGASGSLFGVGGAGGVGGSSEDVGGTGGAG 274
GNG +GG G G+L V G + G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.004
Identities = 36/104 (34%), Positives = 43/104 (41%), Gaps = 8/104 (7%)

Query: 361 TGGDGGLGGAGAVLIGTGV-GGFGGLGGGSNGTGGAGGA------GGTGATLIGLGAGGG 413
+GGDG GA + GG GLG G + G+G + GG + I G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 414 GGIGGFAVNVGNGVGGLGGQGGQGAAL-IGLGAGGAGGAGGATV 456
G GG N G G G G A + G A GAGG V
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAV 105



Score = 32.8 bits (74), Expect = 0.004
Identities = 34/113 (30%), Positives = 42/113 (37%), Gaps = 10/113 (8%)

Query: 316 GDGGNGGAGTAIGSNAGDGGAGGDSSALIGYAQGGSGGLGGFGESTGGDGGLGGAGAVLI 375
G G N GA + G+ G G GG+ G+ GG G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVG--------GGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 376 GTGVGGFGGLGGGSNGTGGAGGAGGTGATLIGLG--AGGGGGIGGFAVNVGNG 426
G G GG G S G G GG A + G A G GG AV++ G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.4 bits (73), Expect = 0.005
Identities = 29/103 (28%), Positives = 41/103 (39%)

Query: 298 GGDGGAGGTAGGRLFSLGGDGGNGGAGTAIGSNAGDGGAGGDSSALIGYAQGGSGGLGGF 357
G + GA T+G G G GGA G ++ + GG S + I + G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 358 GESTGGDGGLGGAGAVLIGTGVGGFGGLGGGSNGTGGAGGAGG 400
++GG G GG + + GF L G + G
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 30.8 bits (69), Expect = 0.015
Identities = 38/139 (27%), Positives = 50/139 (35%), Gaps = 9/139 (6%)

Query: 223 GWGGNGGPGGFAAFGNGGAGGNGGASGSLFGVGGAGGVGGSSEDVGGTGGAGGAGRGLFL 282
G G N G + NGG G G G + G G SSE+ GG+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGG------ASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 283 GLGGDGGAGGTSNNNGGDGGAGGTAGGRL---FSLGGDGGNGGAGTAIGSNAGDGGAGGD 339
G+GG G S G GG + F G GG +I + A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119

Query: 340 SSALIGYAQGGSGGLGGFG 358
+AL G + G G+ +G
Sbjct: 120 MAALKGPFKFGLWGVALYG 138



Score = 30.5 bits (68), Expect = 0.022
Identities = 33/116 (28%), Positives = 37/116 (31%), Gaps = 10/116 (8%)

Query: 283 GLGGDGGAGGTSNNNGGDGGAGGTAGGRLFSLGGDGGNGGAGTAIGSNAGDGGAGGDSSA 342
G G + GA TS N G G GG G N G GS GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG---- 61

Query: 343 LIGYAQGGSGGLGGFGESTGGDGGLGGAGAVLIGTGVGGFGGLGGGSNGTGGAGGA 398
G+GG G G GG A A + G G G + GA
Sbjct: 62 ------HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 29.3 bits (65), Expect = 0.049
Identities = 29/88 (32%), Positives = 33/88 (37%), Gaps = 11/88 (12%)

Query: 444 GAGGAGGAGGATVVGLGGNGGDGGDGGGLFSIGVGGDGGNAGNGAMPANGGNGGNAGVIA 503
G G G GA NGG G G G G DG + P GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVG----GGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 504 NGSFAPSFVGFGGNGGNGVNGGTGGSGG 531
G G GGNG +GG G+GG
Sbjct: 59 GS-------GHGNGGGNGNSGGGSGTGG 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2593cDPTHRIATOXIN300.006 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 30.1 bits (67), Expect = 0.006
Identities = 37/123 (30%), Positives = 53/123 (43%), Gaps = 18/123 (14%)

Query: 74 LLSVSGVGPRLA---MAALAVHDAPALRQVLADG---NVAALTRVPGIGKRGAERMVLEL 127
L +V+G P A AA AV+ A + AD AAL+ +PGIG V+ +
Sbjct: 295 LKTVTGTNPVFAGANYAAWAVNVAQVIDSETADNLEKTTAALSILPGIGS------VMGI 348

Query: 128 RDKV------GVAATGGALSTNGHAVRSPVVEALVGLGFAAKQAEEATDTVLAANHDATT 181
D + A ALS+ A P+V LV +GFAA E+ + H++
Sbjct: 349 ADGAVHHNTEEIVAQSIALSSLMVAQAIPLVGELVDIGFAAYNFVESIINLFQVVHNSYN 408

Query: 182 SSA 184
A
Sbjct: 409 RPA 411


72Rv2737ARv2747N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2737A3125.463400Conserved hypothetical cysteine rich protein
Rv2738c3115.225851Conserved hypothetical protein
Rv2739c2114.235482Possible alanine rich transferase
Rv27404123.255539Epoxide hydrolase
Rv27414132.706810PE-PGRS family protein PE_PGRS47
Rv2742c117-0.646765Conserved hypothetical arginine rich protein
Rv2743c080.232447Possible conserved transmembrane alanine rich
Rv2744c110-0.102681Conserved 35 kDa alanine rich protein
Rv2745c0120.277098Transcriptional regulatory protein ClgR
Rv2746c-112-0.173550Probable PGP synthase PgsA3
Rv2747013-0.335289Probable L-glutamate alpha-N-acetyltranferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2737A2FE2SRDCTASE384e-07 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 37.7 bits (87), Expect = 4e-07
Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 32 DLTFRRRSCCLFYRVPAGGKCGDCPL 57
D RR+CC YR+P +CGDC L
Sbjct: 236 DGLLVRRTCCQRYRLPDVQQCGDCTL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2741cloacin393e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 3e-05
Identities = 37/113 (32%), Positives = 48/113 (42%), Gaps = 5/113 (4%)

Query: 233 GVGGAALGGGAQAAGGNGGAGGVGGLFGAGGAGGAGGFSDTGGTGGAGGAGGLFGPGGGS 292
G G GA + GN G G G G + G+G S+ GG G+G +G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 293 GGVGGFGDTGGTGGDGGSGGLFGVGGAGGHGGFGSAAGGDGGAGGAGGTVFGS 345
G GG G++GG G GG+ A FG A GAGG ++
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVA-----FGFPALSTPGAGGLAVSISAG 110



Score = 35.5 bits (81), Expect = 6e-04
Identities = 29/78 (37%), Positives = 32/78 (41%), Gaps = 1/78 (1%)

Query: 289 GGGSGGVGGFGDTGGTGGDGGSGGLFGVGGAGGHGGFGSAAGGDGGAGGAGGTVFGSGGA 348
G G G G T G +GG GL GGA G+ S GG G+G G G
Sbjct: 4 GDGRGHNTGAHSTSGNI-NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 349 GGAGGVATVAGHGGHGGN 366
G GG G G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 34.3 bits (78), Expect = 0.001
Identities = 30/89 (33%), Positives = 36/89 (40%), Gaps = 1/89 (1%)

Query: 372 GTGGAGGAGGFGGFGGDGGDGGIGGLVGSGGAGGSGGTGTLSGGRGGAGGNAGTFYGSG- 430
G G G G G+ G G VG G + GSG + + GG+G GSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 431 GAGGAGGESDNGDGGNGGVGGKAGLVGEG 459
G GG G S G G G + A V G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 33.9 bits (77), Expect = 0.002
Identities = 38/115 (33%), Positives = 43/115 (37%), Gaps = 5/115 (4%)

Query: 400 SGGAGGSGGTGTLSGGRGGAGGNAGTFYGSGGAGGAG-GESDNGDGGNGGVGGKAGLVGE 458
SGG G TG S GG G G G + G+G +N GG G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 459 GGNGGDGGATIAGKGGSGGNGGNAWLTGQGGNGGNAAFGKAGTGSVGVGGAGGLL 513
GNGG G GG G GGN G A G G + V + G L
Sbjct: 62 HGNGGGNG----NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112



Score = 33.5 bits (76), Expect = 0.002
Identities = 28/102 (27%), Positives = 37/102 (36%), Gaps = 2/102 (1%)

Query: 260 GAGGAGGAGGFSDTGGTGGAGGAGGLFGPGGGSGGVGGFGDTGGTGGDGGSGGLFGVGGA 319
G G G G T G G G G GGG+ G+ G G G+ GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGL--GVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 320 GGHGGFGSAAGGDGGAGGAGGTVFGSGGAGGAGGVATVAGHG 361
G G G+ G G G + + A G ++T G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.002
Identities = 33/113 (29%), Positives = 41/113 (36%), Gaps = 5/113 (4%)

Query: 160 GAAGLFGNGGAGGAGASNQAGNGGAGGNGGAG-GLIWGTAGTGGNGGFTTFLDAAGGAGG 218
G G N GA + G G G GGA G W + GG + + GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 219 AGGAGGLFGAGGAGGVGGAALGGGAQAAG----GNGGAGGVGGLFGAGGAGGA 267
G G GG+G G + A G GAGG+ AG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.5 bits (76), Expect = 0.002
Identities = 21/72 (29%), Positives = 28/72 (38%)

Query: 120 NGANGKPGTGQDGGAGGLLYGSGGNGGSGLAGSGQKGGNGGAAGLFGNGGAGGAGASNQA 179
N +GG GL G G + GSG + G G +G+ GG+G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 180 GNGGAGGNGGAG 191
+GG G GG
Sbjct: 70 NSGGGSGTGGNL 81



Score = 32.8 bits (74), Expect = 0.004
Identities = 35/105 (33%), Positives = 42/105 (40%), Gaps = 5/105 (4%)

Query: 169 GAGGAGASNQAGNGGAGGNGGAGGLIWGTAGTGGNGGFTTFLDAAGGAGGAGGAGGLFGA 228
G G G + A + NGG GL G + G+G + GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 229 GGAGGVGGAALGGGAQAAGGNGGAGGVGGLFG--AGGAGGAGGFS 271
G GG G GG GGN A FG A GAGG +
Sbjct: 63 GNGGGNGN---SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.006
Identities = 30/103 (29%), Positives = 37/103 (35%), Gaps = 1/103 (0%)

Query: 198 AGTGGNGGFTTFLDAAGGAGGAGGAGGLFGAGGAGGVGGAALGGGAQAAGGNGGAGGVGG 257
+G G G T +G G G+ G G + +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 258 LFGAGGAGGAGGFSDTGGTGGAGGAGGLFG-PGGGSGGVGGFG 299
GG G +GG S TGG A A FG P + G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.006
Identities = 33/108 (30%), Positives = 41/108 (37%), Gaps = 5/108 (4%)

Query: 326 GSAAGGDGGAGGAGGTVFGSGGAGGAGGVATVAGHGGHGGNAGLLYGTGGAGGAGGFGGF 385
G G + GA G + G G GG A+ G G N G+G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGAS-DGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 386 GGDGGDGGIGGLVGSGGAGGSGGTGTLSG----GRGGAGGNAGTFYGS 429
G GG+G GG G+GG + G GAGG A +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 32.0 bits (72), Expect = 0.007
Identities = 32/112 (28%), Positives = 38/112 (33%), Gaps = 7/112 (6%)

Query: 140 GSGGNGGSGLAGSGQKGGNGGAAGLFGNGGAGGAGASNQAGNGGAGGNGGAGGLIWGTAG 199
G G N G+ GG G G G + N GG+G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH--G 63

Query: 200 TGGNGGFTTFLDAAGGAGGAGGAGGLFG-----AGGAGGVGGAALGGGAQAA 246
GG G + GG A A FG GAGG+ + G AA
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 31.6 bits (71), Expect = 0.009
Identities = 29/103 (28%), Positives = 36/103 (34%), Gaps = 1/103 (0%)

Query: 318 GAGGHGGFGSAAGGDGGAGGAGGTVFGSGGAGGAGGVATVAGHGGHGGNAGLLYGTGGAG 377
G G G A G G + GGA G ++ G G +G+ +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 378 GAGGFGGFGGDGGDGGIGGLVGSGGAGGSGGTGTLSGGRGGAG 420
G GG G G GG G G L G + G GG
Sbjct: 63 GNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2742cGALLIDERMIN300.001 Gallidermin signature.
		>GALLIDERMIN#Gallidermin signature.

Length = 52

Score = 30.1 bits (67), Expect = 0.001
Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 242 NAICAIEDGVEPRVAWWALCT 262
NA + + G EPR+A LCT
Sbjct: 18 NAKESNDSGAEPRIASKFLCT 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2744cIGASERPTASE280.031 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.5 bits (63), Expect = 0.031
Identities = 33/228 (14%), Positives = 68/228 (29%), Gaps = 15/228 (6%)

Query: 24 ADPKVQIQQAIEEAQRTHQALTQQA--AQVIGNQRQLEMRLNRQLADIEKLQVNVRQALT 81
+ Q ++ +EA+ +A TQ AQ ++ + ++ A +EK + + +
Sbjct: 1059 TETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE---KAKVE 1115

Query: 82 LADQATAAGDAAKATEYNNAAEAF--AAQLVTAEQSVEDLKTLHDQALSAAAQAKKAVER 139
++ + +E A+ ++K Q + A + A E
Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKET 1175

Query: 140 NAMVLQQKIAERTKLLSQLEQAKMQEQVSASLRSMSELAAPGNTPSLDEVRDKIERRYAN 199
++ V + V + P RR
Sbjct: 1176 SSNV-------EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVR 1228

Query: 200 AI-GSAELAESSVQGRMLEVEQAGIQMAGHSRLEQIRASMRGEALPAG 246
++ + E A +S R ++ L RA + AL G
Sbjct: 1229 SVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVG 1276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2747SACTRNSFRASE408e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.3 bits (94), Expect = 8e-07
Identities = 14/73 (19%), Positives = 31/73 (42%), Gaps = 5/73 (6%)

Query: 61 KVVGCGALHVLWSDLGEIRTVAVDPAMTGHGIGHAIVDRLLQVARDLQLQRVFVLTFET- 119
+G + W+ I +AV G+G A++ + ++ A++ + + T +
Sbjct: 75 NCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDIN 134

Query: 120 ----EFFARHGFT 128
F+A+H F
Sbjct: 135 ISACHFYAKHHFI 147


73Rv2843Rv2853N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2843-280.755173Probable conserved transmembrane alanine rich
Rv2844-290.414762Conserved alanine rich protein
Rv2845c-281.183822Probable prolyl-tRNA synthetase ProS
Rv2846c-181.383191Possible integral membrane efflux protein EfpA
Rv2847c-381.166179Possible multifunctional enzyme siroheme
Rv2848c0112.876429Probable cobyrinic acid A,C-diamide synthase
Rv2849c0112.468816Probable cob(I)alamin adenosyltransferase CobO
Rv2850c-1101.743389Possible magnesium chelatase
Rv2851c0130.360859GCN5-related N-acetyltransferase
Rv2852c0120.141826Probable malate:quinone oxidoreductase Mqo
Rv28531110.650048PE-PGRS family protein PE_PGRS48
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2843PYOCINKILLER280.025 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.8 bits (61), Expect = 0.025
Identities = 24/120 (20%), Positives = 44/120 (36%), Gaps = 13/120 (10%)

Query: 56 QARHDGALAAAAATAIGIPPQVAAALTVVATQRTSHARALATEIARAAGKLVSATSETSS 115
Q R + AA A+ AAA Q + A+ A E AR + +A +
Sbjct: 201 QIRMNTLTAAKASIE-------AAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYAMP 253

Query: 116 SSPSPTDPAAPPPAV------SDVIDSLRTSAGEASRLVATTSGYRAGLLASIAASCTAS 169
++ S AA + + + ++ + R++A+ A AS+ S +
Sbjct: 254 ANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGFASLTYSSRTA 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2846cTCRTETB1133e-29 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 113 bits (285), Expect = 3e-29
Identities = 79/368 (21%), Positives = 152/368 (41%), Gaps = 19/368 (5%)

Query: 61 MQLLATMDSTVAIVALPKIQNELSLSDAGRSWVITAYVLTFGGLMLLGGRLGDTIGRKRT 120
+ + ++ V V+LP I N+ + A +WV TA++LTF + G+L D +G KR
Sbjct: 22 LSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRL 81

Query: 121 FIVGVALFTISSVLCAVAWDEATLVI-ARLSQGVGSAIASPTGLALVATTFPKGPARNAA 179
+ G+ + SV+ V +L+I AR QG G+A A P + +V + R A
Sbjct: 82 LLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAA-AFPALVMVVVARYIPKENRGKA 140

Query: 180 TAVFAAMTAIGSVMGLVVGGALTEVSWRWAFLVNVP-IGLVMIYLARTALRETNKERMKL 238
+ ++ A+G +G +GG + W++L+ +P I ++ + L++ + +
Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAH-YIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHF 199

Query: 239 DATGAILATLACTAAVFAFSIGPEKGWMSGITIGSGLVALAAAVAFVIVERTAENPVVPF 298
D G IL ++ + F+ ++ + + FV R +P V
Sbjct: 200 DIKGIILMSVGIVFFML-FTTSYSISFLIVSVLSFLI--------FVKHIRKVTDPFVDP 250

Query: 299 HLFRDRNRLVTFSAILLAGGVMFSLTVCIGLYVQDILGYSALRAGVGFI-PFVIAMGIGL 357
L ++ ++ + G + + ++D+ S G I P +++ I
Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG 310

Query: 358 GVSSQLVSRFSPRVLTIGGGYLLFGAMLYGSFFMHRGVPYFPNLVMPIVVGGIGIGMAVV 417
+ LV R P + G L + L SF + M I++ + G++
Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFVLGGLSFT 365

Query: 418 PLTLSAIA 425
+S I
Sbjct: 366 KTVISTIV 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2848cDHBDHDRGNASE371e-04 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 36.6 bits (84), Expect = 1e-04
Identities = 19/88 (21%), Positives = 32/88 (36%), Gaps = 3/88 (3%)

Query: 256 VAIAAGRAFTFGYAEHAEMLRAAGAEVVEFDPLSETLPEGTDAVVLPGGFPEQFTAELSA 315
+A G A G A A L + GA + D E L + ++ E F A++
Sbjct: 10 IAFITGAAQGIGEAV-ARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 316 NDTVRRQINELAAAGAPVHA--ECAGLL 341
+ + + P+ AG+L
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVL 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2850cTONBPROTEIN340.001 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 33.8 bits (77), Expect = 0.001
Identities = 26/164 (15%), Positives = 47/164 (28%), Gaps = 18/164 (10%)

Query: 300 DEALALASVDPEPEPDPPGGGQSANEPASQPNSRSKSTEPGAPSSMGDDPPRPASPRLRS 359
+ +++ V P P A QP P + + P +
Sbjct: 42 AQPISVTMVTPADLEPPQ---------AVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKP 92

Query: 360 SPRPSAPPSKIFRTRALRVPGVGTGAPGRRSRARNASGSVVAAAEVSDPDAHGLHLFATL 419
P+P P + + + V S N + + + ++ + + + A+
Sbjct: 93 KPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASG 152

Query: 420 LAAGERAFGAGPLRPWPDDVRRAIREGREGN-LVIFVVDASGSM 462
A R P RA EG V F V G +
Sbjct: 153 PRALSRNQPQYP--------ARAQALRIEGQVKVKFDVTPDGRV 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2853cloacin402e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.1 bits (93), Expect = 2e-05
Identities = 27/81 (33%), Positives = 34/81 (41%)

Query: 348 GAGGNGGKGGFAQATTSVTGGNGGNGGNGHDSNAPGGAGGSGGVGGDGGRGGLLAGNGGT 407
G G G G + ++ GG G G G S+ G + + GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 408 GGAGGNGGTGGAGAPGGAGGA 428
G GGNG +GG GG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 38.2 bits (88), Expect = 1e-04
Identities = 30/87 (34%), Positives = 34/87 (39%), Gaps = 1/87 (1%)

Query: 525 GGDGGAGGTGGNGGDGGAGGAPGLGGAGGAGGWLIGQSGSTGGGGAGGAGGAGGAGGAGG 584
G + GA T GN G G G GGA GW + GG G+G G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVG-GGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGG 66

Query: 585 SGGAGGHGDTTSGKNGSSGTAGFDGNP 611
G G G T G + G P
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFP 93



Score = 37.4 bits (86), Expect = 2e-04
Identities = 30/91 (32%), Positives = 40/91 (43%)

Query: 259 NGGNTDLTGGAGGDGNAGSTTVNGGNGGTGGAARNSSGGTGNSFGGAGGAGGDGANGGDG 318
NGG T L G G +G ++ N GG G+ + GG+G+ GG G G G+ G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 319 GAGGEALTEGGATAVSGAGGKGGNAEASGGA 349
+ A G A+S G G S GA
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.6 bits (84), Expect = 3e-04
Identities = 31/93 (33%), Positives = 38/93 (40%)

Query: 439 GDNATVTGGNGGTGGDGGSALGTGGAGGAGGLGGHGGAGGLLIGNGGAGGAGGLGGAGGA 498
G ++T NGG G G + G+G + GG G I GG G G GG G +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 499 GGAGGEGGAGGAGGEAIPGGASTNSAGGDGGAG 531
GG G GG A + G S G GG
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.6 bits (84), Expect = 3e-04
Identities = 34/125 (27%), Positives = 41/125 (32%), Gaps = 9/125 (7%)

Query: 482 GNGGAGGAGGLGGAGGAGGAGGEGGAGGAGGEAIPGGASTNSAGGDGGAGGTGGNGGDGG 541
G G G G G G G G G G +S N+ G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 542 AGGAPGLGGAGGAGGWLIGQSGSTGGGGAGGAGGAGGAGGAGGSGGAGGHGDTTSGKNGS 601
G GG+G TGG + A A + GAGG + S S
Sbjct: 63 GNGGGNGNSGGGSG---------TGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113

Query: 602 SGTAG 606
+ A
Sbjct: 114 AAIAD 118



Score = 34.3 bits (78), Expect = 0.002
Identities = 28/86 (32%), Positives = 36/86 (41%)

Query: 461 TGGAGGAGGLGGHGGAGGLLIGNGGAGGAGGLGGAGGAGGAGGEGGAGGAGGEAIPGGAS 520
+GG G G H +G + G G G GG G G G G GG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 521 TNSAGGDGGAGGTGGNGGDGGAGGAP 546
+ GG+G +GG G GG+ A AP
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.5 bits (76), Expect = 0.002
Identities = 27/93 (29%), Positives = 35/93 (37%)

Query: 140 NGGNGAAGGPNQAGGAGGNAGLIGNGGAGGAGGVGAVGGKRGTGGLLFGNGGAGGQGGLG 199
N G + G G G G + G+G + GG G+G G G G GG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 200 LAGINGGSGGQGGHGGNAILFGQGGAGGPGGTG 232
+G G+GG + FG PG G
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.4 bits (73), Expect = 0.006
Identities = 30/95 (31%), Positives = 43/95 (45%), Gaps = 1/95 (1%)

Query: 258 GNGGNTDLTGGAGGDGNAGSTTVNGGNGGTGGAARNSSGGTGNSFGGAGGAGGDGANGGD 317
G+ T G G G G + G+G ++ N+ G G+ G G G NGG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 318 GG-AGGEALTEGGATAVSGAGGKGGNAEASGGAGG 351
G +GG + T G +AV+ G A ++ GAGG
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.0 bits (72), Expect = 0.008
Identities = 30/93 (32%), Positives = 36/93 (38%), Gaps = 7/93 (7%)

Query: 269 AGGDGNAGSTTVNGGNGGTGGAARNSSGGTGNSFGGAGGAGGDGANGGDGGAGGEALTEG 328
+GGDG N G T G G G G + G+G N GG G + G
Sbjct: 2 SGGDGRGH----NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 329 GATAVSGAGGKGGNAEASGGAGGNGGKGGFAQA 361
G SG G GGN + GG+G G A
Sbjct: 58 G---GSGHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 29.7 bits (66), Expect = 0.038
Identities = 31/110 (28%), Positives = 40/110 (36%), Gaps = 1/110 (0%)

Query: 371 GNGGNGHDSNAPGGAGG-SGGVGGDGGRGGLLAGNGGTGGAGGNGGTGGAGAPGGAGGAG 429
G G GH++ A +G +GG G G GG G+G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 430 GKADIANSLGDNATVTGGNGGTGGDGGSALGTGGAGGAGGLGGHGGAGGL 479
G + G + G GAGGL AG L
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112



Score = 29.7 bits (66), Expect = 0.039
Identities = 25/68 (36%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 121 GADGSTPGQAGGPGGLLYGNGGNGAAGGPNQ----AGGAGGNAGLIGNGGAGGAGGVGAV 176
GA ++ GGP GL G G + +G ++ GG+G G G G GG G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 177 GGKRGTGG 184
GG GTGG
Sbjct: 72 GGGSGTGG 79


74Rv2905Rv2914cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2905-114-1.652002Probable conserved alanine rich lipoprotein
Rv2906c-114-0.909354Probable tRNA (guanine-N1)-methyltransferase
Rv2907c-113-1.185697Probable 16S rRNA processing protein RimM
Rv2908c-110-0.928422Conserved hypothetical protein
Rv2909c-18-0.18190430S ribosomal protein S16 RpsP
Rv2910c-270.255539Conserved hypothetical protein
Rv2911-280.717903Probable penicillin-binding protein DacB2
Rv2912c-170.666713Probable transcriptional regulatory protein
Rv2913c-260.807283Possible D-amino acid aminohydrolase (D-amino
Rv2914c-270.978406Probable transmembrane serine/threonine-protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2905BLACTAMASEA290.022 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.0 bits (65), Expect = 0.022
Identities = 31/156 (19%), Positives = 50/156 (32%), Gaps = 36/156 (23%)

Query: 76 PTRVRQATDEAAAMGATLSVAVLDRATGQLVSNGNT-QIIATASVAKLFIADDLLLAEAE 134
P + Q + + + + +D A+G+ ++ + S K+ + +L
Sbjct: 23 PQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDA 82

Query: 135 GK--------------VTLSP--EDHHA--------LDVMLQSSDDGAAERFWSQDGGNA 170
G V SP E H A + SD+ AA + GG A
Sbjct: 83 GDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPA 142

Query: 171 VVTQVARRYGLRST-----------APPSDGRWWNT 195
+T R+ G T A P D R T
Sbjct: 143 GLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTT 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2909cIGASERPTASE327e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.3 bits (73), Expect = 7e-04
Identities = 11/46 (23%), Positives = 14/46 (30%)

Query: 117 PTTEATKPKKKSPAKKAAKAAEPAPQPEQPDTPALGGEQAELTAES 162
T + S + A P P PA E E AE+
Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2911BLACTAMASEA339e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 33.2 bits (76), Expect = 9e-04
Identities = 29/116 (25%), Positives = 40/116 (34%), Gaps = 9/116 (7%)

Query: 49 DLDSGQVLAGRDQNVAHPPASTIKVLLALVALDELD-----LNSTVVADVADTQAECNCV 103
DL SG+ L + P ST KV+L L +D L + D
Sbjct: 46 DLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVS 105

Query: 104 GVKPGRSYTARQLLDGLLLVSGNDAANTLAHMLGGQDVTVAKMNAKAATLGATSTH 159
T +L + +S N AAN L +GG A + A +G T
Sbjct: 106 EKHLADGMTVGELCAAAITMSDNSAANLLLATVGG----PAGLTAFLRQIGDNVTR 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2912cHTHTETR507e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.0 bits (119), Expect = 7e-10
Identities = 23/91 (25%), Positives = 38/91 (41%), Gaps = 4/91 (4%)

Query: 1 MARTQQQRREETVARLLQASIDTIIEVGYARASAAVITKRAGVSVGALFRHFETMGDFMA 60
MAR +Q +ET +L ++ + G + S I K AGV+ GA++ HF+ D +
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 ATAYEVLRRQLETFTKQVAEIPADRPALPAA 91
+ + E A P P +
Sbjct: 61 E----IWELSESNIGELELEYQAKFPGDPLS 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2913cUREASE340.002 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 34.3 bits (79), Expect = 0.002
Identities = 24/90 (26%), Positives = 38/90 (42%), Gaps = 19/90 (21%)

Query: 17 DVIIRDGLWFDGTGNA--PLTRTLGIRDGVVATVAAGALDETGCPEVVDAAGKWVVPGFI 74
D+ ++DG G A P ++ GV V G EV+ GK V G +
Sbjct: 87 DIGLKDGRIA-AIGKAGNP-----DMQPGVTIIVGPGT-------EVIAGEGKIVTAGGM 133

Query: 75 DVHTHYDAEVLLDPGLRESVRHGVTTVLLG 104
D H H+ ++ E++ G+T +L G
Sbjct: 134 DSHIHFICPQQIE----EALMSGLTCMLGG 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2914cYERSSTKINASE310.013 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 31.2 bits (70), Expect = 0.013
Identities = 37/152 (24%), Positives = 66/152 (43%), Gaps = 25/152 (16%)

Query: 69 HPHILEVHDRGEFD----GQLWIAMDYVDG---IDATQHMAD-----RFPAVLPVGEVLA 116
HP++ VH + + MD VDG D + +AD + + G +
Sbjct: 190 HPNLANVHGMAVVPYGNRKEEALLMDEVDGWRCSDTLRTLADSWKQGKINSEAYWGTIKF 249

Query: 117 IVTAVAGALDYAHQRGLLHRDVNPANVVLTSQSAGDQRILLADFGIAS----QP-----S 167
I + ++ + G++H D+ P NVV +++G+ ++ D G+ S QP S
Sbjct: 250 IAHRLLDVTNHLAKAGVVHNDIKPGNVVF-DRASGEPVVI--DLGLHSRSGEQPKGFTES 306

Query: 168 YPAPELSAG-ADVDGRADQYALALTAIHLFAG 198
+ APEL G ++D + + T +H G
Sbjct: 307 FKAPELGVGNLGASEKSDVFLVVSTLLHCIEG 338


75Rv2930Rv2934N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv2930-28-0.271925Fatty-acid-AMP ligase FadD26 (fatty-acid-AMP
Rv2931-28-0.073356Phenolpthiocerol synthesis type-I polyketide
Rv2932-29-0.217391Phenolpthiocerol synthesis type-I polyketide
Rv2933-110-0.495114Phenolpthiocerol synthesis type-I polyketide
Rv2934-212-1.299889Phenolpthiocerol synthesis type-I polyketide
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2930FLGMOTORFLIM290.046 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 29.1 bits (65), Expect = 0.046
Identities = 18/89 (20%), Positives = 37/89 (41%)

Query: 477 DDIEATIQEITGGRAAAIAVPDDITEQLVAIIEFKRRGSTAEEVMLKLRSVKREVTSAIS 536
D+I+ + I+ G A+ + + + +F+R ++E M L + +
Sbjct: 8 DEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFARLTT 67

Query: 537 KSHSLRVADLVLVSPGSIPITTSGKIRRS 565
S S ++ +V V S+ T + RS
Sbjct: 68 TSLSAQLRSMVHVHVASVDQLTYEEFIRS 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2931ISCHRISMTASE310.038 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 31.1 bits (70), Expect = 0.038
Identities = 15/68 (22%), Positives = 31/68 (45%), Gaps = 1/68 (1%)

Query: 1768 LRRIIAAELRVPEKELDTDRPFAELGLNSLMAMAIRREAEQFVGIELSATMLFNHPTVKS 1827
+R+ IA L+ +++ + GL+S+ M + + + G E++ L PT++
Sbjct: 235 IRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRR-EGAEVTFVELAERPTIEE 293

Query: 1828 LASYLAKR 1835
L R
Sbjct: 294 WQKLLTTR 301


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2932DHBDHDRGNASE434e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 43.1 bits (101), Expect = 4e-06
Identities = 30/103 (29%), Positives = 47/103 (45%), Gaps = 1/103 (0%)

Query: 1155 LVIGATGNIGPHLIRQLARMGAKTIVAMARKPGALDELTQCLAATGTDLIAVAADATDPA 1214
+ GA IG + R LA GA I A+ P L+++ L A A AD D A
Sbjct: 12 FITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 1215 AMQTLFDRFGTELPPLEGIYLAAFAGRPALLSEMTDDDVTTMF 1257
A+ + R E+ P++ + A RP L+ ++D++ F
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATF 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2933DHBDHDRGNASE360.001 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 36.2 bits (83), Expect = 0.001
Identities = 32/154 (20%), Positives = 61/154 (39%), Gaps = 7/154 (4%)

Query: 1805 LIVGGMGGLGFVVARWLAEQGAGLIVLNGRSAPSDEVAAAIAELNASGSRIEVITGDITE 1864
I G G+G VAR LA QGA + +++ ++ L A E D+ +
Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIA---AVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 1865 PDTAERLVRAVEDAGFRLAGVVHSAMVLADEIVLNMTDSAARRVFAPKVTGSWRLHVATA 1924
+ + +E + +V+ A VL ++ +++D F+ TG + + +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 1925 ----ARDVDWWLTFSSAAALLGTPGQGAYAAANS 1954
R +T S A + AYA++ +
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKA 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv2934DHBDHDRGNASE350.003 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 34.6 bits (79), Expect = 0.003
Identities = 36/165 (21%), Positives = 63/165 (38%), Gaps = 8/165 (4%)

Query: 1437 QGASYVVTGGLGGLGLVVARWLVDRGAGRVVLGGRSDPTDEQCNVLAELQTRAEIVVVRG 1496
+G +TG G+G VAR L +GA + +P + V +
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAH--IAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 1497 DVASP-GVAEKLIETARQSGGQLRGVVHAAAVIEDSLVFSMSRDNLERVWAPKATGALRM 1555
DV + E R+ G + +V+ A V+ L+ S+S + E ++ +TG
Sbjct: 65 DVRDSAAIDEITARIEREMGP-IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 1556 HEATADCELDWWLG--FSSAASLLGSP--GQAAYACASAWLDALV 1596
+ + +D G + ++ G P AAYA + A
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFT 168


76Rv3343cRv3350cN        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3343c10170.794661PPE family protein PPE54
Rv3344c10193.199528PE-PGRS family protein PE_PGRS49
Rv3345c10150.945335PE-PGRS family protein PE_PGRS50
Rv3346c814-1.238247Conserved transmembrane protein
Rv3347c814-1.180532PPE family protein PPE55
Rv3348814-0.497492Probable transposase
Rv3349c814-0.448566Probable transposase
Rv3350c815-0.647955PPE family protein PPE56
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3343ccloacin320.036 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.0 bits (72), Expect = 0.036
Identities = 28/79 (35%), Positives = 34/79 (43%), Gaps = 7/79 (8%)

Query: 1971 GAGNVGGFNVGGGNIGGNNVGLGNVGWGNFGLG-NSGLTP--GLMGLGNIGFGNAGSYNF 2027
G G+ G + GNI G GLG G + G G +S P G G G G +G N
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 2028 GLANMGVGNIGFANTGSGN 2046
G G GN G + GN
Sbjct: 66 G----GNGNSGGGSGTGGN 80



Score = 32.0 bits (72), Expect = 0.036
Identities = 29/106 (27%), Positives = 40/106 (37%), Gaps = 4/106 (3%)

Query: 193 NVGLFNAGSGNVGSYNVGAGNVGSYNVGGGNIGGNNVGLGNVGWGNFGLGNSGLTPGLMG 252
N G + SGN+ G G G + G G NN G G G G SG G
Sbjct: 10 NTGAHST-SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68

Query: 253 LGNIGFGNAGSYNFG--LANMGVGNIGFANTGSGNFGIGLTGDNLT 296
GN G G+ N A + G + G+G + ++ L+
Sbjct: 69 -GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 31.6 bits (71), Expect = 0.043
Identities = 26/94 (27%), Positives = 37/94 (39%), Gaps = 3/94 (3%)

Query: 1842 GNTTGAPSSGFFNSGAGGVSGFGNVGAMVSGGWNQAPSALLGGGSGVFNAGTLHSGVLNF 1901
GN G P+ GA SG+ + GG GGGSG N G +
Sbjct: 18 GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW--GGGSGHGNGGGNGNSGGGS 75

Query: 1902 GSGMSGLFNTSVLGLGAPALVS-GLGSVGQQLSG 1934
G+G + + + G PAL + G G + +S
Sbjct: 76 GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3344ccloacin340.001 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 34.3 bits (78), Expect = 0.001
Identities = 30/99 (30%), Positives = 37/99 (37%), Gaps = 1/99 (1%)

Query: 174 GAGGSGGNGGAGGNATGSGGKGGAGGNGGDGS-FGATSGPASIGVTGAPGGNGGKGGAGG 232
G G + G GN G G GG DGS + + + P G GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 233 SNPNGSGGDGGKGGNGGAGGNGGSIGANSGIVGGSGGAG 271
SGG G GGN A + G + G+GG
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.001
Identities = 31/80 (38%), Positives = 36/80 (45%), Gaps = 2/80 (2%)

Query: 121 TGGSGSGIGGGAGGNGGN--GGAGGTGVVLGGKGGDGGNGDHGGPATNPGSGSRGGAGGS 178
+GG G G GA GN GG G GV G G G + ++ GSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 179 GGNGGAGGNATGSGGKGGAG 198
GNGG GN+ G G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.001
Identities = 36/121 (29%), Positives = 41/121 (33%), Gaps = 13/121 (10%)

Query: 13 GAGGNGGAGSAGNGGAGGAGGNGGAGGNGGGGDAGNAGSGGNGGKGGDGVGPGSTGGAGG 72
G G G A + GG G G GG D S N GG G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 73 KGGAGANGGSSNGNARGGNAGNGGHGGAGGSGDTGGAGGAGGQGGFGGTGGSGSGIGGGA 132
G GGN +GG G GG+ A A G G G + A
Sbjct: 63 GNG-------------GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109

Query: 133 G 133
G
Sbjct: 110 G 110



Score = 33.9 bits (77), Expect = 0.001
Identities = 27/79 (34%), Positives = 35/79 (44%), Gaps = 1/79 (1%)

Query: 69 GAGGKGGAGANGGSSNGNARGGNAGNGGHGGAG-GSGDTGGAGGAGGQGGFGGTGGSGSG 127
G G GA + G+ NG G G G G+G S + GG+G +GG G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 128 IGGGAGGNGGNGGAGGTGV 146
G G G G G + V
Sbjct: 66 GGNGNSGGGSGTGGNLSAV 84



Score = 33.1 bits (75), Expect = 0.003
Identities = 28/90 (31%), Positives = 31/90 (34%), Gaps = 4/90 (4%)

Query: 240 GDGGKGGNGGAGGNGGSIGANSGIVGGSGGAGGAGGAGG----NGSLSSGEGGKGGDGGH 295
G G+G N GA G+I +G GGA G G S GG GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 296 GGDGVGGNSSVTQGGSGGGGGAGGAGGSGF 325
G G GNS G G GF
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF 92



Score = 32.8 bits (74), Expect = 0.003
Identities = 27/84 (32%), Positives = 33/84 (39%)

Query: 193 GKGGAGGNGGDGSFGATSGPASIGVTGAPGGNGGKGGAGGSNPNGSGGDGGKGGNGGAGG 252
G G G N G S G+ G + G G + +NP G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 253 NGGSIGANSGIVGGSGGAGGAGGA 276
G NSG G+GG A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 31.2 bits (70), Expect = 0.009
Identities = 22/69 (31%), Positives = 27/69 (39%)

Query: 4 SPAAHGGSGGAGGNGGAGSAGNGGAGGAGGNGGAGGNGGGGDAGNAGSGGNGGKGGDGVG 63
P G GGA G S N GG+G GG G G+ G G+ G G G +
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82

Query: 64 PGSTGGAGG 72
+ A G
Sbjct: 83 AVAAPVAFG 91



Score = 30.8 bits (69), Expect = 0.012
Identities = 30/84 (35%), Positives = 36/84 (42%), Gaps = 5/84 (5%)

Query: 223 GNGGKGGAGGSNPNGSGGDGGKGGNGGAGGNGGSIGANSGIVGGSGGAGGAGGAGGNGSL 282
G G GA ++ N +GG G G GGA G N+ GGSG GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG---- 61

Query: 283 SSGEGGKGGDGGHGGDGVGGNSSV 306
G GG G+ G G G S+V
Sbjct: 62 -HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 30.8 bits (69), Expect = 0.012
Identities = 31/80 (38%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 83 SNGNARGGNAGNGGHGGAGGSGDTGGAGGAGGQGGFGGT-------GGSGSGIGGGAGGN 135
S G+ RG N G G G TG G G G G + GGSGSGI G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 136 GGNGGAGGTGVVLGGKGGDG 155
GNGG G G GG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.015
Identities = 30/83 (36%), Positives = 39/83 (46%), Gaps = 1/83 (1%)

Query: 290 GGDG-GHGGDGVGGNSSVTQGGSGGGGGAGGAGGSGFFGGKGGFGGDGGQGGPNGGGTVG 348
GGDG GH + ++ G +G G G G + GSG+ +GG G G GGG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 349 TVAGGGGNGGVGGRGGDGVFAGA 371
GG GN G G G + A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 30.5 bits (68), Expect = 0.019
Identities = 32/84 (38%), Positives = 33/84 (39%), Gaps = 5/84 (5%)

Query: 333 GGDGGQGGPNGGGTVGTVAGGGGNGGVGGRGGDGVFAGAGGQGGLGGQGGNGGGSTGGNG 392
GGDG T G + GG GVGG DG G G G GS G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG-----SGWSSENNPWGGGSGSGIHWG 57

Query: 393 GLGGAGGGGGNAPDGGFGGNGGKG 416
G G G GGGN GG G GG
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 30.1 bits (67), Expect = 0.021
Identities = 25/86 (29%), Positives = 31/86 (36%)

Query: 369 AGAGGQGGLGGQGGNGGGSTGGNGGLGGAGGGGGNAPDGGFGGNGGKGGQGGIGGGTQSA 428
+G G+G G G GG GLG GG + G G GI G S
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 429 TGLGGDGGDGGDGGNGGNSGAKAGGA 454
G GG G+ G G G + +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 30.1 bits (67), Expect = 0.023
Identities = 33/109 (30%), Positives = 40/109 (36%), Gaps = 9/109 (8%)

Query: 108 GAGGAGGQGGFGGTGGSGSGIGGGAGGNGGNGGAGGTGVVLGGKGGDGGNGDHGGPATNP 167
G G G G T G+ +G G G GG G GG G+G H G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG---- 58

Query: 168 GSGSRGGAGGSGGNGGAGGNATGSGGKGGAGGNGGDGSFGATSGPASIG 216
G G+GG G G +G+GG A F A S P + G
Sbjct: 59 -----GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3345ccloacin408e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.1 bits (93), Expect = 8e-05
Identities = 31/104 (29%), Positives = 37/104 (35%), Gaps = 2/104 (1%)

Query: 1174 LGGNGGAGGNGGVSTTGGD--GGAGGKGGNGGDGGNVGLGGDAGSGGAGGNGGIGTDAGG 1231
+ G G G N G +T G+ GG G G GG G + G G GI G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1232 AGGAGGAGGNGGSSKSTTTGNAGSGGAGGNGGTGLNGAGGAGGA 1275
G GG GN G T + G L+ G G A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 39.7 bits (92), Expect = 8e-05
Identities = 41/121 (33%), Positives = 50/121 (41%), Gaps = 5/121 (4%)

Query: 1311 AGGKGGNGSSGAASGSGVVNVTAGHGGNGGNGGNGGNGSAGAGGQGGAGGSAGNGGHGGG 1370
+GG G ++GA S SG +N G GG +G S+ GG GS + G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1371 ATGGDGGNGGNGGNSGNSTGVAGLAGGAAGAGGNGGGTSSAAGHGGSGGSGGSGTTGGAG 1430
GNGG GNSG +G G A G S G GG S +G A
Sbjct: 62 -----HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116

Query: 1431 A 1431
A
Sbjct: 117 A 117



Score = 38.5 bits (89), Expect = 2e-04
Identities = 36/100 (36%), Positives = 46/100 (46%), Gaps = 2/100 (2%)

Query: 962 GNGGNGGNGGKGGTAGNGSGAAGGNGGNGG-SGLNGGDAGNGGNGGGALNQAGFFGTGGK 1020
G G G N G T+GN +G G G GG S +G + N GGG+ + + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1021 GGNGGNGGAGMINGGLGGFGGAGGGGAVDVAA-TTGGAGG 1059
G GGNG +G +G G A A +T GAGG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.8 bits (87), Expect = 4e-04
Identities = 38/104 (36%), Positives = 43/104 (41%), Gaps = 3/104 (2%)

Query: 562 SGGNGGNGGAGATPTVAGENGGAGGNGGHGGSVGNGGAGGAGGNGVAGTGLALNGGNGGN 621
SGG+G GA T NGG G G GG + G+G + N G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 622 GGIGGNGGSAAGTGGDGGKGGNGGAGANGQDFSASANGANGGQG 665
G GNGG +GG G GGN A A F A G G
Sbjct: 60 SG-HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.0 bits (85), Expect = 6e-04
Identities = 26/85 (30%), Positives = 34/85 (40%)

Query: 599 AGGAGGNGVAGTGLALNGGNGGNGGIGGNGGSAAGTGGDGGKGGNGGAGANGQDFSASAN 658
+GG G G NGG G+G GG++ G+G GG +G + +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 659 GANGGQGGNGGNGGIGGKGGDAFAT 683
NGG GN G G G A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 36.6 bits (84), Expect = 9e-04
Identities = 34/109 (31%), Positives = 44/109 (40%), Gaps = 3/109 (2%)

Query: 1167 GPEGNGGLGGNGGAGGNGGVSTTGGDGGAGGKGGNGGDGGNVGLGGDAGSG---GAGGNG 1223
G +G G G GN TG G G G+G N GG +GSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1224 GIGTDAGGAGGAGGAGGNGGSSKSTTTGNAGSGGAGGNGGTGLNGAGGA 1272
G G G +GG G GGN + + + G GG ++ + GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.2 bits (83), Expect = 0.001
Identities = 35/116 (30%), Positives = 40/116 (34%), Gaps = 8/116 (6%)

Query: 1337 GNGGNGGNGGNGSAGAGGQGGAGGSAGNGGHGGGATGGDGGNGGNGGNSGNSTGVAGLAG 1396
G G G N G S GG G GG G +G N GG SG+ G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG-SGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1397 GAAGAGGNGGGTSSAAGHGGSGGSGGSGTTGGAGAAGGNGGAGAGGGSLSTGQSGG 1452
G G G GGSG G A G + G G L+ S G
Sbjct: 62 HGNGGGNGNSG-------GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 35.8 bits (82), Expect = 0.001
Identities = 29/80 (36%), Positives = 34/80 (42%), Gaps = 3/80 (3%)

Query: 1291 GGDGGNGGNGGHGGDGTTGGAGGKGGNGSSGAASGSG--VVNVTAGHGGNGGNGGNGGNG 1348
GGDG G H G G G GA+ GSG N G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG-VGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1349 SAGAGGQGGAGGSAGNGGHG 1368
GG G +GG +G GG+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 35.8 bits (82), Expect = 0.002
Identities = 24/76 (31%), Positives = 32/76 (42%)

Query: 747 GNGGDGGKGGSGGNVGNGGNGGAGGNGAAGQAGTPGPTSGDSGTSGTDGGAGGNGGAGGA 806
G G + G + GN+ G G G GA+ +G + G SG+ GG G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 807 GGTLAGHGGNGGKGGN 822
GG GG+G G
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 0.003
Identities = 28/97 (28%), Positives = 33/97 (34%), Gaps = 1/97 (1%)

Query: 796 GAGGNGGAGGAGGTLAGHGGNGGKGGNGGQG-GIGGAGERGADGAGPNANGANGENGGSG 854
G G N GA G + G G GG G G G+G + G G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 855 GNGGDGGAGGNGGAGGKAQAAGYTDGATGTGGDGGNG 891
G G+ G G G A AA G G G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.1 bits (80), Expect = 0.003
Identities = 27/83 (32%), Positives = 34/83 (40%)

Query: 1350 AGAGGQGGAGGSAGNGGHGGGATGGDGGNGGNGGNSGNSTGVAGLAGGAAGAGGNGGGTS 1409
+G G+G G+ G+ G G G GG SG S+ GG+ GGG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1410 SAAGHGGSGGSGGSGTTGGAGAA 1432
G G GGSGT G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAV 84



Score = 34.7 bits (79), Expect = 0.003
Identities = 29/104 (27%), Positives = 31/104 (29%)

Query: 1136 LGGNGGLGGNGGVSETGFGGAGGNGGYGGPGGPEGNGGLGGNGGAGGNGGVSTTGGDGGA 1195
+ G G G N G T GG G G GG G G G S GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1196 GGKGGNGGDGGNVGLGGDAGSGGAGGNGGIGTDAGGAGGAGGAG 1239
G G G G G G A GAGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.003
Identities = 36/102 (35%), Positives = 40/102 (39%), Gaps = 3/102 (2%)

Query: 1056 GAGGNGGAGGFASTGLGGPGGAGGPGGAGDFASGVGGVGGAGGDGGAGGVGGFGGQGGIG 1115
G G N GA + GGP G G GGA D SG G G G+ GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASD-GSGWSSENNPWGGGSGSGIHWGGGSGHGN 64

Query: 1116 GEGRTGGNGGSGGDGGGGISLGGNGGLGGNGGVSETGFGGAG 1157
G G GN G G GG +S G +S G GG
Sbjct: 65 GGG--NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.3 bits (78), Expect = 0.004
Identities = 31/110 (28%), Positives = 37/110 (33%), Gaps = 10/110 (9%)

Query: 379 GGIGATANSPLQAGGAGGNGGHGGLVGNGGTGGAGGAGHAGSTGATGTALQPTGGNGTNG 438
GG G N+ + NGG GL GG G + G+ G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 439 GAGGHGGNGGNGGAQHGDGGVGGKGGAGGSGGAGGNGFDAATLGSPGADG 488
G GG GN G G G S A F L +PGA G
Sbjct: 63 GNGGGNGNSGGGSG----------TGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.004
Identities = 28/101 (27%), Positives = 34/101 (33%), Gaps = 1/101 (0%)

Query: 826 GGIGGAGERGADGAGPNANGANGENGGSGGNGGDGGAGGNGGAGGKAQAAGYTDGATGTG 885
GG G GA N NG G GG G G +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 886 GDGGNGGDGGKAGDGGAGENGLNSGAMLPGGGTVGNPGTGG 926
G+GG G+ G G G G + + G + PG GG
Sbjct: 63 GNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.004
Identities = 37/126 (29%), Positives = 48/126 (38%), Gaps = 10/126 (7%)

Query: 1284 VSFGNAVGGDGGNGGNGGHGGDGTTGGAGGKGGNGSS---------GAASGSGVVNVTAG 1334
+S G+ G + G G+ G TG G G + S G SGSG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1335 HGGNGGNGGNGGNGSAGAGGQGGAGGSAGNGGHGGGATGGDGGNGGNGGNSGNSTGVAGL 1394
GNGG GN G GS G GG A + G +T G GG + S +A +
Sbjct: 61 GHGNGGGNGNSGGGS-GTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119

Query: 1395 AGGAAG 1400
G
Sbjct: 120 MAALKG 125



Score = 33.9 bits (77), Expect = 0.005
Identities = 25/78 (32%), Positives = 31/78 (39%)

Query: 1211 GGDAGSGGAGGNGGIGTDAGGAGGAGGAGGNGGSSKSTTTGNAGSGGAGGNGGTGLNGAG 1270
GGD G + G GG G G GG S ++ N GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1271 GAGGAGGNAGVAGVSFGN 1288
G GG GN+G + GN
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 33.9 bits (77), Expect = 0.006
Identities = 24/86 (27%), Positives = 30/86 (34%)

Query: 1347 NGSAGAGGQGGAGGSAGNGGHGGGATGGDGGNGGNGGNSGNSTGVAGLAGGAAGAGGNGG 1406
+G G G GA ++GN G G GG G S + G +G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1407 GTSSAAGHGGSGGSGGSGTTGGAGAA 1432
+ GGSG G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.5 bits (76), Expect = 0.008
Identities = 25/71 (35%), Positives = 34/71 (47%)

Query: 876 GYTDGATGTGGDGGNGGDGGKAGDGGAGENGLNSGAMLPGGGTVGNPGTGGNGGNGGNAG 935
G+ GA T G+ G G G G + +G +S GGG+ GG G+G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 936 VGGTGGKAGTG 946
G +GG +GTG
Sbjct: 68 NGNSGGGSGTG 78



Score = 33.1 bits (75), Expect = 0.008
Identities = 41/126 (32%), Positives = 44/126 (34%), Gaps = 9/126 (7%)

Query: 1096 AGGDGGAGGVGGFGGQGGIGGEGRTGGNGGSGGDGGGGISLGGNGGLGGNGGVSETGFGG 1155
+GGDG G G I G G TG G G G G S N GG+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNING-GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1156 AGGNGGYGGPGGPEGNGGLGGNGGAGGNGGVSTTGGDGGAGGKGGNGGDGGNVGLGGDAG 1215
GNGG GNG GG G GGN G G G V + A
Sbjct: 61 GHGNGG--------GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112

Query: 1216 SGGAGG 1221
S
Sbjct: 113 SAAIAD 118



Score = 33.1 bits (75), Expect = 0.010
Identities = 32/102 (31%), Positives = 38/102 (37%)

Query: 917 GTVGNPGTGGNGGNGGNAGVGGTGGKAGTGSLTGLDGTDGITPNGGNGGNGGNGGKGGTA 976
G G G GN G TG G G+ G + P GG G+G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 977 GNGSGAAGGNGGNGGSGLNGGDAGNGGNGGGALNQAGFFGTG 1018
GNG G GG+G G A G AL+ G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.1 bits (75), Expect = 0.010
Identities = 35/102 (34%), Positives = 44/102 (43%), Gaps = 1/102 (0%)

Query: 175 GSGGAGAAGGVGGSGGWLNGNGGAGGAGGTGANGGAGGNAWLFGAGGSGGAGTNGGVGGS 234
G G G G + G +NG G GG GA+ G+G ++ GG G+G + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 235 GGFVYGNGGAGGIGGIGGIGGNGGDAGLFGNGGAGGAGAAGL 276
G GNG +GG G GG FG GA GL
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 32.0 bits (72), Expect = 0.020
Identities = 39/114 (34%), Positives = 47/114 (41%), Gaps = 4/114 (3%)

Query: 285 GDGSDGGNGGTGGN--GGRGGLLVGNGGAGGAG-GVGGDGGKGGAGDPSFAVNNGAGGNG 341
G G + G T GN GG GL VG G + G+G + GG+G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 342 GHGGNPGVGGAGGAGGLLAGAHGAAGATPTSGGNGGDGGIGATANSPLQAGGAG 395
G GN G GG+G G L A A A P G G + + L A A
Sbjct: 66 GGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.0 bits (72), Expect = 0.021
Identities = 27/86 (31%), Positives = 32/86 (37%), Gaps = 7/86 (8%)

Query: 901 GAGENGLNSGAMLPGGGTVGNPGTGGNGGNGGNAGVGGTGGKAGTGSLTGLDGTDGITPN 960
G G N+GA G G P G GG G + G + G G +
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGG-------GASDGSGWSSENNPWGGGSGSGIH 55

Query: 961 GGNGGNGGNGGKGGTAGNGSGAAGGN 986
G G GNGG G +G GSG G
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.025
Identities = 39/121 (32%), Positives = 47/121 (38%), Gaps = 8/121 (6%)

Query: 1115 GGEGRTGGNGGSGGDGGGGISLGGNGGLGGNGGVSETGFGGAGGNGGYGGPGGPEGNGGL 1174
GG+GR G+ G I+ GG GLG GG S+ + N GG G GG
Sbjct: 3 GGDGR--GHNTGAHSTSGNIN-GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1175 GGNGGAGGNGGVSTTGGDGGAGGKGGNGGDGGNVGLGGDAGSGGAGGNGGIGTDAGGAGG 1234
G+G GGNG GG+G G V G A S G + AG
Sbjct: 60 SGHGNGGGNGN-----SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114

Query: 1235 A 1235
A
Sbjct: 115 A 115



Score = 31.6 bits (71), Expect = 0.028
Identities = 29/102 (28%), Positives = 35/102 (34%), Gaps = 2/102 (1%)

Query: 1197 GKGGNGGDGGNVGLGGDAGSGGAGGNGGIGTDAGGAGGAGGAGGNGGSSKSTTTGNAGSG 1256
G G G + G G+ G G G G G + GGS G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1257 GAGGNGGTGLNGAGGAGGAGGNAGVAGVSFGNAVGGDGGNGG 1298
G GG G G+G G +A A V+FG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGG--NLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.032
Identities = 21/71 (29%), Positives = 27/71 (38%)

Query: 442 GHGGNGGNGGAQHGDGGVGGKGGAGGSGGAGGNGFDAATLGSPGADGGMGGNGGKGGDGG 501
G G G N GA G + G G GG +G ++ +P G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 502 KAGDGGAGAAG 512
G G + G
Sbjct: 63 GNGGGNGNSGG 73



Score = 31.2 bits (70), Expect = 0.033
Identities = 25/78 (32%), Positives = 32/78 (41%), Gaps = 1/78 (1%)

Query: 1016 GTGGKGGNGG-NGGAGMINGGLGGFGGAGGGGAVDVAATTGGAGGNGGAGGFASTGLGGP 1074
G G+G N G + +G INGG G G GG ++ G G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1075 GGAGGPGGAGDFASGVGG 1092
G GG G +G + G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 31.2 bits (70), Expect = 0.034
Identities = 26/79 (32%), Positives = 32/79 (40%)

Query: 221 GSGGAGTNGGVGGSGGFVYGNGGAGGIGGIGGIGGNGGDAGLFGNGGAGGAGAAGLPGAA 280
G G G N G + G + G G+GG G GG+G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 281 GLNGGDGSDGGNGGTGGNG 299
G GG+G+ GG GTGGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 31.2 bits (70), Expect = 0.041
Identities = 32/104 (30%), Positives = 35/104 (33%)

Query: 250 IGGIGGNGGDAGLFGNGGAGGAGAAGLPGAAGLNGGDGSDGGNGGTGGNGGRGGLLVGNG 309
+ G G G + G G G GL G + G G N GG G G G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 310 GAGGAGGVGGDGGKGGAGDPSFAVNNGAGGNGGHGGNPGVGGAG 353
G G GG G GG G G AV PG GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.043
Identities = 35/111 (31%), Positives = 40/111 (36%), Gaps = 2/111 (1%)

Query: 538 GGAGGVSANPALNGSAGANGTAPTSGGNGGNGGAGATPTVAGENGGAGGNGGHGGSVGNG 597
GG G A + S NG G GG + GG G+G H G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG--GGS 60

Query: 598 GAGGAGGNGVAGTGLALNGGNGGNGGIGGNGGSAAGTGGDGGKGGNGGAGA 648
G G GGNG +G G G G A T G GG + AGA
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3347ccloacin320.034 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.4 bits (73), Expect = 0.034
Identities = 24/91 (26%), Positives = 33/91 (36%), Gaps = 8/91 (8%)

Query: 363 GFNTGVANVGSYNTGSFNAGNTNTGGFNPGNVNTGWLNTGNTNTGIANSGNVNTGAFISG 422
G NTG + G+ N G T G + +GW + N G + SG G G
Sbjct: 8 GHNTGAHSTS----GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 423 NFSNGVLWRGDYEGLWGLSGGSTIPAIPIGL 453
N G+ G G G + A P+
Sbjct: 64 NGGGN----GNSGGGSGTGGNLSAVAAPVAF 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3349cINTIMIN310.005 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.8 bits (69), Expect = 0.005
Identities = 19/67 (28%), Positives = 33/67 (49%), Gaps = 4/67 (5%)

Query: 4 DPAAAYASAIRTPGLLPNAKLVVDHFHVTTLANDALTAVRRRVTWAFHDRRGRKIDPQWA 63
+P AA TP +P + +D+ H T ND L +++ + F ++I+PQ+
Sbjct: 369 NPGAATVGVNYTP--IPLVTMGIDYRHGTGNENDLLYSMQ--FRYQFDKPWSQQIEPQYV 424

Query: 64 NRRRLLT 70
N R L+
Sbjct: 425 NELRTLS 431


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3350cPYOCINKILLER370.002 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 36.7 bits (84), Expect = 0.002
Identities = 46/199 (23%), Positives = 68/199 (34%), Gaps = 17/199 (8%)

Query: 3442 ISVPSIHLGLDPAVHVGSITVNPITVRTPPVLVSYSQGAVTSTSGPTSEIWVKPSFFPGI 3501
+ + + LGL P+V++ ++ TV P L + ++G T+ S +++ P P
Sbjct: 328 LGMDAAKLGLPPSVNLNAVAKASGTVDLPMRLTNEARGNTTTLSVVSTDGVSVPKAVPVR 387

Query: 3502 RIAPSSGGGATSTQGAYFVGPISIPSGTVTFPGFTIPLDPIDIGLPVSLT--IP-GFTI- 3557
A +T G Y V S + P P P S T +P +
Sbjct: 388 MAAY------NATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQNPSSTTPVVPKPVPVY 441

Query: 3558 PGGTLIPTLPLGLALSNGIPPVDIPAIVLDRILLDLHADTTIGPINVPIAGFGGAPGFGN 3617
G TL P I + I AD+ I PI V PG
Sbjct: 442 EGATLTPVKATPETYPGVITLPEDLIIGFP-------ADSGIKPIYVMFRDPRDVPGAAT 494

Query: 3618 STTLPSSGFFNTGAGGGSG 3636
P SG + A G G
Sbjct: 495 GKGQPVSGNWLGAASQGEG 513


77Rv3420cRv3427cN        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3420c117-0.379903Ribosomal-protein-alanine acetyltransferase RimI
Rv3421c020-2.258486Conserved hypothetical protein
Rv3422c-118-3.173656Conserved hypothetical protein
Rv3423c-118-2.848693Alanine racemase Alr
Rv3424c123-4.644413Hypothetical protein
Rv3425019-3.478445PPE family protein PPE57
Rv3426-118-2.630668PPE family protein PPE58
Rv3427c017-1.291636Possible transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3420cSACTRNSFRASE435e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 43.4 bits (102), Expect = 5e-08
Identities = 15/66 (22%), Positives = 28/66 (42%), Gaps = 3/66 (4%)

Query: 79 VHTIGVDPAYQGRGIGRRLLRELLDFARG---GVVYLEVRTDNDAALALYRSVGFQRVGL 135
+ I V Y+ +G+G LL + +++A+ + LE + N +A Y F +
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151

Query: 136 RRRYYR 141
Y
Sbjct: 152 DTMLYS 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3422cABC2TRNSPORT310.002 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 30.7 bits (69), Expect = 0.002
Identities = 15/47 (31%), Positives = 21/47 (44%), Gaps = 1/47 (2%)

Query: 22 ATLPRVEDTLTLGSRLGEQLCAGDVVVLSGPLGAGKTVLAKGIAMAM 68
A R+E T + L QL GD+V+ A K LA G + +
Sbjct: 89 AAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALA-GAGIGV 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3423cALARACEMASE425e-151 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 425 bits (1094), Expect = e-151
Identities = 119/373 (31%), Positives = 179/373 (47%), Gaps = 28/373 (7%)

Query: 36 AEAMVDLGAIEHNVRVLREHAGHAQLMAVVKADGYGHGATRVAQTALGAGAAELGVATVD 95
+A +DL A++ N+ ++R+ A HA++ +VVKA+ YGHG R+ + ++
Sbjct: 5 IQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDG--FALLNLE 62

Query: 96 EALALRADGITAPVL---AWLHPPGIDFGPALLADVQVAVSSLRQLDELLHAVRRTGRTA 152
EA+ LR G P+L + H ++ + V S QL L +A R
Sbjct: 63 EAITLRERGWKGPILMLEGFFHAQDLEIYDQ--HRLTTCVHSNWQLKALQNA--RLKAPL 118

Query: 153 TVTVKVDTGLNRNGVGPAQFPAMLTALRQAMAEDAVRLRGLMSHMVYADKPDDSINDVQA 212
+ +KV++G+NR G P + +LT +Q A V LMSH A+ PD +
Sbjct: 119 DIYLKVNSGMNRLGFQPDR---VLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAM-- 173

Query: 213 QRFTAFLAQAREQGVRFEVAHLSNSSATMARPDLTFDLVRPGIAVYGLSPVPALGDM--- 269
A + QA E G+ LSNS+AT+ P+ FD VRPGI +YG SP D+
Sbjct: 174 ----ARIEQAAE-GLECRR-SLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANT 227

Query: 270 GLVPAMTVKCAVALVKSIRAGEGVSYGHTWIAPRDTNLALLPIGYADGVFRSLGGRLEVL 329
GL P MT+ + V++++AGE V YG + A + + ++ GYADG R VL
Sbjct: 228 GLRPVMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVL 287

Query: 330 INGRRCPGVGRICMDQFMVDLGPGPLDVAEGDEAILFGPGIRGEPTAQDWADLVGTIHYE 389
++G R VG + MD VDL P P G L+G E D A GT+ YE
Sbjct: 288 VDGVRTMTVGTVSMDMLAVDLTPCP-QAGIGTPVELWGK----EIKIDDVAAAAGTVGYE 342

Query: 390 VVTSPRGRITRTY 402
++ + R+
Sbjct: 343 LMCALALRVPVVT 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3425SECBCHAPRONE280.023 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 27.6 bits (61), Expect = 0.023
Identities = 12/50 (24%), Positives = 26/50 (52%), Gaps = 1/50 (2%)

Query: 5 IPAEYISNIIYEGPGADSLFFASGQLRELAYSVETTAESLEDELDELDEN 54
I Y+ ++ +E P +F + +L++ + T A+ + D+L E+ N
Sbjct: 22 IQRIYVKDVSFEAPNLPHIFQQDWE-PKLSFDLSTEAKQVGDDLYEVCLN 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3427cHTHFIS280.045 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 27.9 bits (62), Expect = 0.045
Identities = 20/84 (23%), Positives = 29/84 (34%), Gaps = 4/84 (4%)

Query: 48 DEIARRESAALTRRLRRAKFEAQATFEDFDFTANPKLPGAMLRDLAALRWLDAGE-SVIL 106
E+ AL RR + + AM L L + ++++
Sbjct: 109 TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSA---AMQEIYRVLARLMQTDLTLMI 165

Query: 107 HGPVGVGKTHVAQALVHAVARRGG 130
G G GK VA+AL RR G
Sbjct: 166 TGESGTGKELVARALHDYGKRRNG 189


78Rv3507Rv3517N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3507173211.772814PE-PGRS family protein PE_PGRS53
Rv3508142910.986745PE-PGRS family protein PE_PGRS54
Rv3509c143010.655543Probable acetohydroxyacid synthase IlvX
Rv3510c123110.087774Conserved protein
Rv3511123210.290023PE-PGRS family protein PE_PGRS55
Rv35129319.493548PE-PGRS family protein PE_PGRS56
Rv3513c4256.507313Probable fatty-acid-CoA ligase FadD18 (fragment)
Rv35144256.485584PE-PGRS family protein PE_PGRS57
Rv3515c-113-0.225502Fatty-acid-CoA ligase FadD19 (fatty-acid-CoA
Rv3516-1130.241866Possible enoyl-CoA hydratase EchA19 (enoyl
Rv35170120.058828Conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3507cloacin391e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.9 bits (90), Expect = 1e-04
Identities = 35/103 (33%), Positives = 43/103 (41%)

Query: 1172 NGGNGGNGNDGAVNAGTGGSGGNGGNAGGGGANGGDGGAGGAGGAGGRGGKGIDGGFGGD 1231
+GG+G N GA + +GG G GGGA+ G G + GG G GI G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1232 GGNGGSNNGTGAGGNGGNGGTGGVGSVGAAGGDGGNGGTGGFA 1274
GNGG N +G G G + V G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 37.8 bits (87), Expect = 3e-04
Identities = 30/79 (37%), Positives = 35/79 (44%)

Query: 1154 GKGGDGGNIAAGATGTAGNGGNGGNGNDGAVNAGTGGSGGNGGNAGGGGANGGDGGAGGA 1213
G G G N A +T NGG G G G + G+G S N GG G+ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1214 GGAGGRGGKGIDGGFGGDG 1232
G GG G G G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 37.0 bits (85), Expect = 5e-04
Identities = 34/99 (34%), Positives = 37/99 (37%), Gaps = 1/99 (1%)

Query: 837 GNAGDGGNGGNAGAGGNGGSGDFGGNTTSGAS-GSGGNGGNAGTAGSGGAGGTGGTGLSG 895
G G G N G GN G G GAS GSG + N G G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 896 GNGGNGGNGGNGGDGGNGAHGTVGAQFVPATSLPTPNGG 934
GNGG GN G G G +L TP G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101



Score = 37.0 bits (85), Expect = 5e-04
Identities = 35/109 (32%), Positives = 47/109 (43%), Gaps = 1/109 (0%)

Query: 518 TGGNGGNGTGGVNGADNTLNPDTPGGAGEPGGAGGAGGAGGAAGGPGGTGGTGGNGGNGG 577
+GG+G G + +N G G + G+G + GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 578 NGGNGGNGGNGGNGGNAGNNSTNA-PVGGEGGAGGDGGAGGAGGAANGG 625
+G GGNG +GG G GN S A PV A GAGG + + G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 36.2 bits (83), Expect = 0.001
Identities = 28/76 (36%), Positives = 33/76 (43%)

Query: 1148 GNGSAGGKGGDGGNIAAGATGTAGNGGNGGNGNDGAVNAGTGGSGGNGGNAGGGGANGGD 1207
G G G GNI G TG GG + N GG G+G + GGG +G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1208 GGAGGAGGAGGRGGKG 1223
GG G +GG G GG
Sbjct: 66 GGNGNSGGGSGTGGNL 81



Score = 36.2 bits (83), Expect = 0.001
Identities = 32/81 (39%), Positives = 37/81 (45%), Gaps = 2/81 (2%)

Query: 436 GKGGNGGIGGAAVTGGVAGDGGTGGKGGTGGAGGAGNDAGSTGNPGGKGGDGGIGGAGGA 495
G G G GA T G G TG G G + G+G S NP G G GI GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG--WSSENNPWGGGSGSGIHWGGGS 60

Query: 496 GGAAGTGNGGHAGNTGDGGDG 516
G G GNG G +G GG+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 35.8 bits (82), Expect = 0.001
Identities = 29/81 (35%), Positives = 38/81 (46%), Gaps = 1/81 (1%)

Query: 1167 TGTAGNGGNGGNGN-DGAVNAGTGGSGGNGGNAGGGGANGGDGGAGGAGGAGGRGGKGID 1225
+G G G N G + G +N G G G GG + G G + + GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1226 GGFGGDGGNGGSNNGTGAGGN 1246
G GG GN G +GTG +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 35.1 bits (80), Expect = 0.002
Identities = 32/96 (33%), Positives = 41/96 (42%), Gaps = 3/96 (3%)

Query: 1061 GGEGGVGSILGGPGGNGGTGGNASATGTNGVANAGNGGKGGDGGQFGAGGNGGAGGSVTD 1120
G G+I GGP G G GG + +G + N GG G G G+G GG +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG---N 68

Query: 1121 GSAGSTAGNGGNGGNATNGTIAGQPAGGNGSAGGKG 1156
G++G +G GGN G PA AGG
Sbjct: 69 GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.002
Identities = 25/84 (29%), Positives = 33/84 (39%)

Query: 1263 GDGGNGGTGGFAGFGGTAGNGGSGGTGGAGGDGGTGGDGGNGVIAGGGGTGGNGGASGAG 1322
G G G G G G +G G G G+G N GG G+G + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1323 GAGGTGGFAGNGNAGGNGGTGGAS 1346
G GG G +G G+ G + A+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 34.3 bits (78), Expect = 0.004
Identities = 35/105 (33%), Positives = 44/105 (41%), Gaps = 5/105 (4%)

Query: 1233 GNGGSNNGTGAGGNGGNGGTGGVGSVGAAGGDGGNGGTGGFAGFGGTAGNGGSGGTGGAG 1292
G G + TGA GN G G G G+G + +GG +G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1293 GDGGTGGDGGNGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAG 1337
G+GG G+ G G GTGGN A A A G + G G
Sbjct: 63 GNGGGNGNSGG-----GSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.004
Identities = 30/85 (35%), Positives = 38/85 (44%), Gaps = 4/85 (4%)

Query: 1184 VNAGTGGSGGNGGNAGGGGANGGDGGAGGAGGAGGRGGKGIDGGFGGDGGNGGSNNGTGA 1243
++ G G G ++ G NGG G G GGA G + G GGS +G
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWG----GGSGSGIHW 56

Query: 1244 GGNGGNGGTGGVGSVGAAGGDGGNG 1268
GG G+G GG G+ G G GGN
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 33.9 bits (77), Expect = 0.004
Identities = 32/102 (31%), Positives = 36/102 (35%), Gaps = 1/102 (0%)

Query: 580 GNGGNGGNGGNGGNAGNNSTNAPVGGEGGAGGDGGAGGAGGAANGGTAGSQ-GTGGVGGD 638
G G G N G +GN + G GG DG + GG +GS GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 639 GGAGGNGGGGKAGTGNSGNFGVDGEAGFSGGAGGNGGVGGAA 680
G GGNG G V F A G GG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.005
Identities = 32/103 (31%), Positives = 41/103 (39%)

Query: 641 AGGNGGGGKAGTGNSGNFGVDGEAGFSGGAGGNGGVGGAAGANGGTGGSGGNGGDGGAGG 700
+GG+G G G ++ G G G G + G G ++ N GGSG GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 701 IGGAGGNGIPGTGTEPAGGTGAKGGDGGDGGAGGAGGNAGGAG 743
G GGNG G G+ G A G + AGG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.006
Identities = 27/78 (34%), Positives = 33/78 (42%)

Query: 206 GGTGGNGGNGALLIGGGGLGGAGGMGGTGGGTGGTGGNGGNGALLIGAGGVGGAGGIGGQ 265
GG G GA G GG G+G GG + G+G + N G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 266 GTGAGGAAGAGGTGGNGG 283
G G G GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 33.5 bits (76), Expect = 0.006
Identities = 27/86 (31%), Positives = 36/86 (41%), Gaps = 5/86 (5%)

Query: 542 GGAGEPGGAGGAGGAGGAAGGPGGTGGTGGNGGNGGNGGNGGNGGNGGNGGNAGNNSTNA 601
GG G G +G GGP G G GG + G+G + N GG +G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-----ASDGSGWSSENNPWGGGSGSGIHWG 57

Query: 602 PVGGEGGAGGDGGAGGAGGAANGGTA 627
G G GG+G +GG G +A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.1 bits (75), Expect = 0.008
Identities = 32/110 (29%), Positives = 40/110 (36%)

Query: 164 NGGAGGAGGSGGAAGGNGGNGGWLFGAGGTGGIGGTGAPGAMGGTGGNGGNGALLIGGGG 223
+GG G +G + NGG G G G+G GG G+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 224 LGGAGGMGGTGGGTGGTGGNGGNGALLIGAGGVGGAGGIGGQGTGAGGAA 273
G GG G +GGG+G G A + G GG A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.8 bits (74), Expect = 0.011
Identities = 28/85 (32%), Positives = 35/85 (41%), Gaps = 2/85 (2%)

Query: 1291 AGGDGGTGGDGGNGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAGGNGGTGGASEDGD 1350
+GGDG G G GG G GGA G++ N G G G G
Sbjct: 2 SGGDGR--GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 1351 NGNAGSGATGGTGGNGGTGGDGGAA 1375
+G+ G G +GG GTGG+ A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 32.8 bits (74), Expect = 0.012
Identities = 28/83 (33%), Positives = 31/83 (37%)

Query: 1299 GDGGNGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAGGNGGTGGASEDGDNGNAGSGA 1358
G G G G T GN G G G G+G + N GG S G + GSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1359 TGGTGGNGGTGGDGGAAGLGGVA 1381
G G GG G L VA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.013
Identities = 31/111 (27%), Positives = 46/111 (41%), Gaps = 7/111 (6%)

Query: 784 GKGGQGGSGGTGGSGAPIGGGAGGTGGSGGHAGKGGAGGIGAQGTTITVPGNGGNAGDGG 843
G G+G + G + I GG G G GG G+ ++ P GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-------ASDGSGWSSENNPWGGGSGSGIH 55

Query: 844 NGGNAGAGGNGGSGDFGGNTTSGASGSGGNGGNAGTAGSGGAGGTGGTGLS 894
GG +G G GG+G+ GG + +G + S A + G GG +S
Sbjct: 56 WGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 32.4 bits (73), Expect = 0.015
Identities = 28/86 (32%), Positives = 35/86 (40%), Gaps = 6/86 (6%)

Query: 1133 GGNATNGTIAGQPAGGNGSAGGKGGDGGNIAAGATGTAGNGGNGGNGNDGAVNAGTGGSG 1192
G ++T+G I G P G G G G + G G G G G+G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG------HGNG 65

Query: 1193 GNGGNAGGGGANGGDGGAGGAGGAGG 1218
G GN+GGG GG+ A A A G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.4 bits (73), Expect = 0.016
Identities = 30/95 (31%), Positives = 36/95 (37%), Gaps = 1/95 (1%)

Query: 1287 GTGGAGGDGGTGGDGGNGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAGGNGGTGGAS 1346
G G G + G GN + G G G GGAS G G G+ G GG+
Sbjct: 3 GGDGRGHNTGAHSTSGN-INGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1347 EDGDNGNAGSGATGGTGGNGGTGGDGGAAGLGGVA 1381
GN SG GTGGN A G ++
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS 96



Score = 32.0 bits (72), Expect = 0.019
Identities = 27/83 (32%), Positives = 33/83 (39%)

Query: 551 GGAGGAGGAAGGPGGTGGTGGNGGNGGNGGNGGNGGNGGNGGNAGNNSTNAPVGGEGGAG 610
GG G G G G+G + N GG G+G + G G GN G N + G GG
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 611 GDGGAGGAGGAANGGTAGSQGTG 633
A A G T G+ G
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.020
Identities = 32/96 (33%), Positives = 36/96 (37%), Gaps = 2/96 (2%)

Query: 733 GGAGGNAGGAGGQGGNAGQGGAGGAGGNAVIPGDGVGKAPHGDAGGSGGDGGKGGQGGSG 792
G G+ GA GN G G G G G + GGSG GG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 793 GTGGSGAPIGGGAGGTGGSGGHAGKGGAGGIGAQGT 828
GG+G GG GTGG+ A G A T
Sbjct: 64 NGGGNGN--SGGGSGTGGNLSAVAAPVAFGFPALST 97



Score = 32.0 bits (72), Expect = 0.020
Identities = 28/79 (35%), Positives = 38/79 (48%)

Query: 1223 GIDGGFGGDGGNGGSNNGTGAGGNGGNGGTGGVGSVGAAGGDGGNGGTGGFAGFGGTAGN 1282
G DG G + S N G G GG GS ++ + GG+G +GG +G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1283 GGSGGTGGAGGDGGTGGDG 1301
G GG G +GG GTGG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 32.0 bits (72), Expect = 0.021
Identities = 31/106 (29%), Positives = 37/106 (34%), Gaps = 5/106 (4%)

Query: 618 AGGAANGGTAGSQGTGGV--GGDGGAGGNGGGGKAGTGNSGNFGVDGEAGFSGGAGGNGG 675
+GG G G+ T G GG G G GG +S N + G SG GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN---NPWGGGSGSGIHWGG 58

Query: 676 VGGAAGANGGTGGSGGNGGDGGAGGIGGAGGNGIPGTGTEPAGGTG 721
G G GG+G G + G P T AGG
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.021
Identities = 30/86 (34%), Positives = 40/86 (46%), Gaps = 6/86 (6%)

Query: 121 GANATTPGGNGGDGGWLFGSGGNGAPGAAGQSGGNGGSAGLWGNGGAGGAGGSGGAAGGN 180
GA++T+ NGG G G G + G + ++ GG +G G GGSG GG
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSG----SGIHWGGGSGHGNGGG 67

Query: 181 GGNGGWLFGAGGTGGIGGTGAPGAMG 206
GN G G+G G + AP A G
Sbjct: 68 NGNSGG--GSGTGGNLSAVAAPVAFG 91



Score = 31.6 bits (71), Expect = 0.023
Identities = 29/110 (26%), Positives = 36/110 (32%)

Query: 667 SGGAGGNGGVGGAAGANGGTGGSGGNGGDGGAGGIGGAGGNGIPGTGTEPAGGTGAKGGD 726
SGG G G + + GG G G GGA G P G +G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 727 GGDGGAGGAGGNAGGAGGQGGNAGQGGAGGAGGNAVIPGDGVGKAPHGDA 776
G+GG G G G GG A G + G+ + A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 31.2 bits (70), Expect = 0.036
Identities = 28/105 (26%), Positives = 36/105 (34%), Gaps = 6/105 (5%)

Query: 677 GGAAGANGGTGGSGGNGGDGGAGGIGGAGGNGIPGTGTEPAGGTGAKGGDGGDGGAGGAG 736
G G N G + GN G G G G + G +E GG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSE------NNPWGGGSGSGIHWG 57

Query: 737 GNAGGAGGQGGNAGQGGAGGAGGNAVIPGDGVGKAPHGDAGGSGG 781
G +G G G GG+G G + + P G+GG
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.2 bits (70), Expect = 0.037
Identities = 31/77 (40%), Positives = 36/77 (46%), Gaps = 7/77 (9%)

Query: 565 GTGGTGGNGGNGG-----NGGNGGNGGNGGNGGNAGNNSTNAPVGGEGGAGGD--GGAGG 617
G G G N G NGG G G GG +G +S N P GG G+G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 618 AGGAANGGTAGSQGTGG 634
G NG + G GTGG
Sbjct: 63 GNGGGNGNSGGGSGTGG 79



Score = 30.8 bits (69), Expect = 0.046
Identities = 32/102 (31%), Positives = 41/102 (40%), Gaps = 2/102 (1%)

Query: 314 GTGGKGGQGGDGGTGGAGGAGPVLFGHGGAGGMGGQGGTGGMGGAGGDGTTVIAAGTGGE 373
G G+G G T G GP G GG G + GG G+ + G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 374 GGTGGAAGAGGAAGARGALTS--GGLAGGVGAGGTGGTGGTG 413
G GG +GG +G G L++ +A G A T G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3508cloacin383e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.2 bits (88), Expect = 3e-04
Identities = 36/107 (33%), Positives = 41/107 (38%), Gaps = 4/107 (3%)

Query: 1782 GGAGGLGGGGGTGGTNGNGGLGGGGGNGGAGGAGGTPTGSGTEGTGGDGGDAGAGGNGGS 1841
G G G + N NGG G G GGA G + + G G G GG+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 1842 ATGVGNGGNGGDGGNGGDGGNGAPGGFGGGA----GAGGLGGSGAGG 1884
G GG G G AP FG A GAGGL S + G
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 37.8 bits (87), Expect = 5e-04
Identities = 33/79 (41%), Positives = 39/79 (49%), Gaps = 3/79 (3%)

Query: 923 GQGGQGGLGGASTTSINANGGAGGNGGTGGKGGAGGAGTLGV---GGSGGTGGDGGDAGS 979
G G+G GA +TS N NGG G G GG G + GGSG GG +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 980 GGGGGFGGAAGKAGGGGNG 998
G GGG G + G +G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 37.8 bits (87), Expect = 5e-04
Identities = 33/79 (41%), Positives = 39/79 (49%), Gaps = 3/79 (3%)

Query: 1124 GQGGQGGLGGASTTSINANGGAGGNGGTGGKGGAGGAGTLGV---GGSGGTGGDGGDAGS 1180
G G+G GA +TS N NGG G G GG G + GGSG GG +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1181 GGGGGFGGAAGKAGGGGNG 1199
G GGG G + G +G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 37.4 bits (86), Expect = 7e-04
Identities = 31/79 (39%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 538 GAGGAGGNTGVGGTNGS--GGQGGTGGAGGAGGAGGVGADNPTGIGGTGGTGGKGGAGGA 595
G G G NTG T+G+ GG G G GGA G ++N GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 596 GGQGGSSGAGGTNGSGGAG 614
G GG+ +GG +G+GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 36.6 bits (84), Expect = 9e-04
Identities = 38/119 (31%), Positives = 46/119 (38%), Gaps = 5/119 (4%)

Query: 1094 GAAGNGGNGGAGGAGGAGDNNFNGGQGGAGGQGGQGGLGGASTTSINANGGAGGNGGTGG 1153
G G G N GA G N NGG G G GG G S+ + GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSG----NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 1154 KGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGAAGKAGGGGNGGVGGDGGEGASGL 1212
G G G G G GG+G G + FG A G G V G ++ +
Sbjct: 59 GSGHGNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAI 116



Score = 35.1 bits (80), Expect = 0.003
Identities = 36/104 (34%), Positives = 41/104 (39%), Gaps = 5/104 (4%)

Query: 893 GAAGNGGNGGAGGAGGAGDNNFNGGQGGAGGQGGQGGLGGASTTSINANGGAGGNGGTGG 952
G G G N GA G N NGG G G GG G S+ + GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSG----NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 953 KGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGAAGKAGGGG 996
G G G G G GG+G G + FG A G G
Sbjct: 59 GSGHGNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAG 101



Score = 34.7 bits (79), Expect = 0.004
Identities = 32/108 (29%), Positives = 40/108 (37%), Gaps = 1/108 (0%)

Query: 709 GAGGEGGAGGNSGVGGTNGSGGAGGAGGKGGTGGAGGSGADNPTGAGFAGGAGGTGGAAG 768
G G G G G G G G G + G+G S +NP G G G GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 769 AGGAGGATGTGGTGGVVGATGSAGIGGAGGRGGDGGDGASGLGLGLSG 816
G G GG+G G + A G GA GL + +S
Sbjct: 63 GNGGGNGNSGGGSGT-GGNLSAVAAPVAFGFPALSTPGAGGLAVSISA 109



Score = 34.7 bits (79), Expect = 0.004
Identities = 30/83 (36%), Positives = 35/83 (42%)

Query: 1410 GGAGGRGGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDG 1469
GG G G GGA D S + N + GG G GG +G G G G +GG G GG+
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 1470 QNGTTGVASEGGAGGQGGDGGQG 1492
VA A G GG
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.004
Identities = 30/83 (36%), Positives = 35/83 (42%)

Query: 1615 GGAGGRGGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDG 1674
GG G G GGA D S + N + GG G GG +G G G G +GG G GG+
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 1675 QNGTTGVASEGGAGGQGGDGGQG 1697
VA A G GG
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 0.004
Identities = 31/97 (31%), Positives = 37/97 (38%)

Query: 126 GGAGGAGGLLFGNGGAGAAGAPGQAGGPGGPAGLWGNGGPGGAGGSGGGTGGAGGAGGWL 185
GG G G + G+G + GG G WG G G GG G +GG G GG L
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 186 FGVGGAGGVGGAGGGTGGAGGPGGLIWGGGGAGGVGG 222
V G T GAGG I G + +
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 33.9 bits (77), Expect = 0.006
Identities = 34/104 (32%), Positives = 38/104 (36%), Gaps = 1/104 (0%)

Query: 1171 TGGDGGDAGSGGGGGFGGAAGKAGGGGNGGVGGDGGEGASGLGLGLSGFDGGQGGQGGAG 1230
+GGDG +G G G G G GG G G G S G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGG-GASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1231 GSAGAGGINGAGGAGGTGGAGGDGAPATLIGGPDGGDGGQGGIG 1274
G GG +GG GTGG A G P G GG+
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.006
Identities = 35/102 (34%), Positives = 39/102 (38%)

Query: 759 GAGGTGGAAGAGGAGGATGTGGTGGVVGATGSAGIGGAGGRGGDGGDGASGLGLGLSGFD 818
G G G GA G G TG VG S G G + GG SG+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 819 GGQGGQGGAGGSAGAGGINGAGGAGGNGGDGGDGATGAAGLG 860
G GG G +GG +G GG A A G GA GL
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.008
Identities = 33/101 (32%), Positives = 39/101 (38%)

Query: 1435 AGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTGVASEGGAGGQGGDGGQGGI 1494
+GG G G ++ +G ING G GG DG ++ GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1495 GGAGGNAGFGAGVPGDGGIGGTGGAGGAGGAGADGDPSIDG 1535
G GG G G G GG A A G A P G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.008
Identities = 33/101 (32%), Positives = 39/101 (38%)

Query: 1640 AGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTGVASEGGAGGQGGDGGQGGI 1699
+GG G G ++ +G ING G GG DG ++ GG G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1700 GGAGGNAGFGAGVPGDGGIGGTGGAGGAGGAGADGDPSIDG 1740
G GG G G G GG A A G A P G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.009
Identities = 35/113 (30%), Positives = 50/113 (44%), Gaps = 2/113 (1%)

Query: 594 GAGGQGGSSGAGGTNGS--GGAGGTGGQGGAGGAGGAGADNPTGIGGAGGTGGTGGAAGA 651
G G+G ++GA T+G+ GG G G GGA G ++N GG+G GG +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 652 GGAGGAIGTGGTGGAVGSVGNAGIGGTGGTGGVGGAGGAGAAAAAGSSATGGA 704
G GG +GG G G++ G + G G A + + A A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 33.1 bits (75), Expect = 0.012
Identities = 25/85 (29%), Positives = 31/85 (36%)

Query: 1730 AGADGDPSIDGGQGGAGGHGGQGGKGGLNSTGLASAASGDGGNGGAGGAGGNGGAGGLGG 1789
+G DG G +G G G+ + N GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1790 GGGTGGTNGNGGLGGGGGNGGAGGA 1814
G GG +GG G GGN A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.012
Identities = 29/88 (32%), Positives = 34/88 (38%), Gaps = 4/88 (4%)

Query: 656 GAIGTGGTGGAVGSVGNAGIGGTGGTGGVGGAGGAGAAAAAGSSATGGAGFAGGAGGEGG 715
G G G GA + GN GG G+G GGA + S G +G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNI----NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGG 58

Query: 716 AGGNSGVGGTNGSGGAGGAGGKGGTGGA 743
G+ GG SGG G GG A
Sbjct: 59 GSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 33.1 bits (75), Expect = 0.013
Identities = 32/82 (39%), Positives = 38/82 (46%), Gaps = 2/82 (2%)

Query: 1821 SGTEGTGGDGGDAGAGGN-GGSATGVGNGGNGGDGGNGGDGGNGAPGGFGGGAGAGGLGG 1879
SG +G G + G GN G TG+G GG DG N GG G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1880 SGAGGGTDGDDGNGGSPGTDGS 1901
G GGG +G+ G G G + S
Sbjct: 62 HGNGGG-NGNSGGGSGTGGNLS 82



Score = 32.8 bits (74), Expect = 0.015
Identities = 27/80 (33%), Positives = 32/80 (40%)

Query: 1807 GNGGAGGAGGTPTGSGTEGTGGDGGDAGAGGNGGSATGVGNGGNGGDGGNGGDGGNGAPG 1866
G G G G + SG G G G G + GS N GG G+G G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1867 GFGGGAGAGGLGGSGAGGGT 1886
G GGG G G G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 32.8 bits (74), Expect = 0.016
Identities = 37/106 (34%), Positives = 43/106 (40%), Gaps = 4/106 (3%)

Query: 569 AGGVGADNPTGIGGTGGTGGKGGAGGAGGQGGSSGAG--GTNGSGGAGGTGGQGGAGGAG 626
+GG G + TG T G G G G G S G+G N G G G GG+G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 627 GAGADNPTGIGGAGGTGGTGGAAGAGGAGG--AIGTGGTGGAVGSV 670
GG GTGG A A A G A+ T G GG S+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSI 107



Score = 32.4 bits (73), Expect = 0.018
Identities = 25/86 (29%), Positives = 33/86 (38%)

Query: 1738 IDGGQGGAGGHGGQGGKGGLNSTGLASAASGDGGNGGAGGAGGNGGAGGLGGGGGTGGTN 1797
+ GG G G G +N G +G + N GG G G GG +
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1798 GNGGLGGGGGNGGAGGAGGTPTGSGT 1823
G+G GG G +GG G GG +
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.0 bits (72), Expect = 0.024
Identities = 36/117 (30%), Positives = 46/117 (39%), Gaps = 2/117 (1%)

Query: 556 GQGGTGGAGGAGGAGGVGADNPTGIGGTGG-TGGKGGAGGAGGQGGSSGAGGTNGSGGAG 614
G G G GA G PTG+G GG + G G + GG SG+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 615 GTGGQGGAGGAGGAGADNPTGIGGAGGTGGTGGAA-GAGGAGGAIGTGGTGGAVGSV 670
G GG G G G N + + G + GAGG +I G A+ +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADI 119



Score = 32.0 bits (72), Expect = 0.025
Identities = 36/116 (31%), Positives = 46/116 (39%), Gaps = 3/116 (2%)

Query: 1416 GGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTG 1475
GGDG ++GA S GN G G G G + G+G + GG G+G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1476 VASEGGAGGQGGDGGQGGIGGAGGNAGFG---AGVPGDGGIGGTGGAGGAGGAGAD 1528
G GG G G + FG PG GG+ + AG A AD
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.0 bits (72), Expect = 0.025
Identities = 36/116 (31%), Positives = 46/116 (39%), Gaps = 3/116 (2%)

Query: 1621 GGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTG 1680
GGDG ++GA S GN G G G G + G+G + GG G+G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1681 VASEGGAGGQGGDGGQGGIGGAGGNAGFG---AGVPGDGGIGGTGGAGGAGGAGAD 1733
G GG G G + FG PG GG+ + AG A AD
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 32.0 bits (72), Expect = 0.029
Identities = 36/107 (33%), Positives = 45/107 (42%), Gaps = 4/107 (3%)

Query: 970 TGGDGGDAGSGGGGGFGGAAGKAGGGGNGGRGGDGGDGASGLGLGLSGFDGGQGGQGGAG 1029
+GGDG +G G G G G GG DG SG + + GG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG----SGWSSENNPWGGGSGSGIHWG 57

Query: 1030 GSAGAGGINGAGGAGGNGGDGGDGATGAAGLGDNGGVGGDGGAGGAA 1076
G +G G G G +GG G GG+ + AA + GAGG A
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.030
Identities = 29/87 (33%), Positives = 34/87 (39%), Gaps = 3/87 (3%)

Query: 483 GAAGTGGTGGVVGAAGKAGIGGTGGQGGAGGAGSAGTDATATGATGGTGFSGGAGGAGGA 542
G G G G +G G TG G G + +G + GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 543 GGNTGVGGTNGSGGQGGTGGAGGAGGA 569
G GG SGG GTGG A A
Sbjct: 63 GNG---GGNGNSGGGSGTGGNLSAVAA 86



Score = 31.6 bits (71), Expect = 0.039
Identities = 42/117 (35%), Positives = 50/117 (42%), Gaps = 5/117 (4%)

Query: 873 GAAGNGGNAGVGLTAKAGDGGAAGNGGNGGAG-GAGGAGDNNFNGGQGGAGGQGGQGGLG 931
G G G N G T+ +GG G G GGA G+G + +NN GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 932 GASTTSINANGGAGGNGGTGGKGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGA 988
G NG +GG GTGG A A + T G GG A S G A
Sbjct: 63 GNG----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 31.6 bits (71), Expect = 0.039
Identities = 42/117 (35%), Positives = 50/117 (42%), Gaps = 5/117 (4%)

Query: 1074 GAAGNGGNAGVGLTAKAGDGGAAGNGGNGGAG-GAGGAGDNNFNGGQGGAGGQGGQGGLG 1132
G G G N G T+ +GG G G GGA G+G + +NN GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1133 GASTTSINANGGAGGNGGTGGKGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGA 1189
G NG +GG GTGG A A + T G GG A S G A
Sbjct: 63 GNG----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3511cloacin402e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 40.1 bits (93), Expect = 2e-05
Identities = 31/89 (34%), Positives = 33/89 (37%)

Query: 270 GAGGTGGNAAWLLGGGGTGGAGGIGGGNGGHGGNGGWLLGNGGNGGLGGDGDGGTGGGHG 329
G G G N G G G GG GW N GG G G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 330 GNGGNPGWLLGTAGGGGNGGAGSTGTAGG 358
GNGG G G +G GGN A + A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 37.0 bits (85), Expect = 2e-04
Identities = 31/79 (39%), Positives = 36/79 (45%), Gaps = 1/79 (1%)

Query: 237 GAGGIGGGSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTGGNAAWLLGGG-GTGGAGGIGG 295
G G G +G +G G G+G GG G+G + N W G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 296 GNGGHGGNGGWLLGNGGNG 314
GNGG GN G G GGN
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 37.0 bits (85), Expect = 3e-04
Identities = 35/115 (30%), Positives = 46/115 (40%), Gaps = 2/115 (1%)

Query: 583 AGAGGTGFTQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGTTGGAGGAGGAGGTGGAAG 642
+G G G GA +GN G G G GA + + ++ GG+G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 643 TGTGGQQGNGGNGGNGGTGGKGGTGGAGMNSLDPLLAAQDGGQGGTGGTGGNAGA 697
G GG GN G G GTGG A + P L+ G + G A
Sbjct: 62 HGNGGGNGNSGGG--SGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 36.6 bits (84), Expect = 3e-04
Identities = 28/73 (38%), Positives = 33/73 (45%), Gaps = 5/73 (6%)

Query: 233 GGHGGAGGIGGGSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTGGNAAWLLGGGGTGGAGG 292
G H +G I GG G G GG + G G + GG+G W GGG G GG
Sbjct: 12 GAHSTSGNINGGPTGLGVGGG---ASDGSGWSSENNPWGGGSGSGIHW--GGGSGHGNGG 66

Query: 293 IGGGNGGHGGNGG 305
G +GG G GG
Sbjct: 67 GNGNSGGGSGTGG 79



Score = 36.2 bits (83), Expect = 4e-04
Identities = 28/84 (33%), Positives = 31/84 (36%), Gaps = 5/84 (5%)

Query: 244 GSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTGGNAAWLLGGGGTGGAGGIGGGNGGHGGN 303
G G G N G +G I G G GG + W GG G G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 304 GGWLLGNGGNGGLGGDGDGGTGGG 327
GNGG G G G G G
Sbjct: 63 -----GNGGGNGNSGGGSGTGGNL 81



Score = 35.8 bits (82), Expect = 6e-04
Identities = 35/106 (33%), Positives = 43/106 (40%), Gaps = 6/106 (5%)

Query: 378 GAGAGGHGGTGGAGGAGVNGGGAGGAGGAGGNGGAGGQAALLFGRGGTGGAGGYGGDGGG 437
GA + GG G GV GG + G+G + N GG G G GG G G G
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGG------GSGSGIHWGGGSGHGNG 65

Query: 438 GGDGFDGTMAGLGGTGGSGGTGGDGGAPGNGGAGGAGQLLSHSGVA 483
GG+G G +G GG + G P G G +S S A
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.5 bits (81), Expect = 7e-04
Identities = 29/100 (29%), Positives = 35/100 (35%)

Query: 486 SGKGGAGGTGGNGGAGSAGADAPAGSGAMGSTGFAGGAGGDGGNGGGSGASQGNGGNGGN 545
S G G G G +D S G G+G G G G G GNG +GG
Sbjct: 15 STSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGG 74

Query: 546 GGTGGKGGTGGAGMNSLDPLLAAQDGGQGGTGGTGGNAGA 585
GTGG A + P L+ G + G A
Sbjct: 75 SGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 35.5 bits (81), Expect = 8e-04
Identities = 36/104 (34%), Positives = 43/104 (41%), Gaps = 3/104 (2%)

Query: 193 AGGIGGGTGGAGGHAWLFGHGGTGGIGGGPGGN--GGWLLGNGGHGGAGGIGGGSGGAGG 250
+GG G G +GG G+G G G + GW N GG G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 251 NGGWLLGNGGIGGAGGTGGGAGGTGGNAAWLLGGGGTGGAGGIG 294
+G GNG GG GTGG A+ T GAGG+
Sbjct: 62 HGNG-GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 35.1 bits (80), Expect = 0.001
Identities = 30/81 (37%), Positives = 35/81 (43%), Gaps = 7/81 (8%)

Query: 289 GAGGIGGGNGGHGGNGGWLLGNGGNGGLGGDGDGGTGGG----HGGNGGNPGWLLGTAGG 344
G G G G H +G NGG GLG G G G + GG G + GG
Sbjct: 3 GGDGRGHNTGAHSTSGNI---NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 345 GGNGGAGSTGTAGGGSGGTGG 365
G+G G G +GGGSG G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGN 80



Score = 33.9 bits (77), Expect = 0.002
Identities = 35/104 (33%), Positives = 42/104 (40%), Gaps = 9/104 (8%)

Query: 176 GNGGNAGWLYGRGGV-GGAGGIGGGTGGAGGHAWLFGH---GGTGGIGGGPGGNGGWLLG 231
G G N G G + GG G+G G G + G W + GG G G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG---- 61

Query: 232 NGGHGGAGGIGGGSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTG 275
G+GG G GG G GGN + G + GAGG
Sbjct: 62 -HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.003
Identities = 28/95 (29%), Positives = 35/95 (36%)

Query: 310 NGGNGGLGGDGDGGTGGGHGGNGGNPGWLLGTAGGGGNGGAGSTGTAGGGSGGTGGDGGT 369
N G G+ +GG G G G + G + GG+GS GGGSG G G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 370 GGRGGLLMGAGAGGHGGTGGAGGAGVNGGGAGGAG 404
GG G G ++ GAGG
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.003
Identities = 28/85 (32%), Positives = 32/85 (37%), Gaps = 6/85 (7%)

Query: 490 GAGGTGGNGGAGSAGADAPAGSGAMGSTGFAGGAGGDG------GNGGGSGASQGNGGNG 543
G G G N GA S + G +G G A G G G GSG G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 544 GNGGTGGKGGTGGAGMNSLDPLLAA 568
GNGG G G G +L + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.5 bits (76), Expect = 0.003
Identities = 33/134 (24%), Positives = 45/134 (33%), Gaps = 13/134 (9%)

Query: 376 LMGAGAGGHGGTGGAGGAGVNGGGAGGAGGAGGNGGAGGQAALLFGRGGTGGAGGYGGDG 435
+ G GH + +NGG G G G + G+G + GG+G +GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 436 GGGGDGFDGTMAGLGGTGGSGGTGGDGGAPGNGGAGGAGQLLSHSGVAGASGKGGAGGTG 495
G G GG G G G G + A + G GG +
Sbjct: 61 GHGN-------------GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSI 107

Query: 496 GNGGAGSAGADAPA 509
G +A AD A
Sbjct: 108 SAGALSAAIADIMA 121



Score = 32.8 bits (74), Expect = 0.005
Identities = 34/114 (29%), Positives = 39/114 (34%), Gaps = 14/114 (12%)

Query: 527 GGNGGGSGASQGNGGNGGNGGTGGKGGTGGAGMNSLDPLLAAQDGGQGGTGGTGGNAGAG 586
GG+G G + NGG G G GGA G G + G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGA------------SDGSGWSSENNPWGGGS 50

Query: 587 GTGFTQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGTTGG--AGGAGGAGGTG 638
G+G G GNGG G G G N + AA G A GAGG
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.009
Identities = 32/106 (30%), Positives = 36/106 (33%), Gaps = 10/106 (9%)

Query: 421 GRGGTGGAGGYGGDGGGGGDGFDGTMAGLGGTGGSGGTGGDGGAPGNGGAGGAGQLLSHS 480
GRG GA G+ GG G G G + G G + N GG H
Sbjct: 6 GRGHNTGAHSTSGNINGGPTG---------LGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 481 GVAGASGKGGAGGTGGNGGAGSAGADAPAGSGAMGSTGFAG-GAGG 525
G G GG G G G A A A G + GAGG
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 31.6 bits (71), Expect = 0.013
Identities = 33/101 (32%), Positives = 35/101 (34%), Gaps = 2/101 (1%)

Query: 219 GGGPGGNGGWLLGNGGHGGAGGIGGGSGGAGGNGGWLLGNGGIGGAGGTGGGAGGTGG-- 276
G G G N G +G G G GGA GW N GG G+G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 277 NAAWLLGGGGTGGAGGIGGGNGGHGGNGGWLLGNGGNGGLG 317
N GG G GG G L G GGL
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.016
Identities = 27/96 (28%), Positives = 33/96 (34%), Gaps = 8/96 (8%)

Query: 352 STGTAGGGSGGTGGDGGTGGRGGLLMGAGAGGHGGTGGAGGAGVNGGGAGGAGGAGGNGG 411
S G G + G G G +G G G G+G + GGG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 412 AGGQAALLFGRGGTGGAGGYGGDGGGGGDGFDGTMA 447
G GG G G G GG +A
Sbjct: 62 HGN--------GGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 30.8 bits (69), Expect = 0.021
Identities = 22/77 (28%), Positives = 29/77 (37%)

Query: 572 GQGGTGGTGGNAGAGGTGFTQGADGNAGNGGDGGVGGNGGNGADNTTTAAAGTTGGAGGA 631
G+G G +G G T G + G G N G + + G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 632 GGAGGTGGAAGTGTGGQ 648
GG G +GG +GTG
Sbjct: 66 GGNGNSGGGSGTGGNLS 82



Score = 30.5 bits (68), Expect = 0.027
Identities = 27/101 (26%), Positives = 37/101 (36%), Gaps = 17/101 (16%)

Query: 494 TGGNGGAGSAGADAPAGSGAMGSTGFAGGAGGDGGNGGGSGASQGNGGNGGNGGTGGKGG 553
+GG+G + GA + +G+ G TG G G G+G S + GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 554 TGGAGMNSLDPLLAAQDGGQGGTGGTGGNAGAGGTGFTQGA 594
G GG G G G + A
Sbjct: 62 HGN-----------------GGGNGNSGGGSGTGGNLSAVA 85



Score = 30.5 bits (68), Expect = 0.028
Identities = 30/90 (33%), Positives = 39/90 (43%), Gaps = 13/90 (14%)

Query: 345 GGNGGAGSTGTAGGGSGGTGGDGGTGGRGGLLMGAGAGG-HGGTGGAGGAGVNGGGAGGA 403
GG+G +TG GG G G GG G+G + GG G+G++ GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 404 GGAGGNGGAGGQAALLFGRGGTGGAGGYGG 433
G GGNG +GG G+G G
Sbjct: 63 GNGGGNGNSGG------------GSGTGGN 80



Score = 30.5 bits (68), Expect = 0.029
Identities = 27/79 (34%), Positives = 29/79 (36%), Gaps = 1/79 (1%)

Query: 148 GQAGGAGGSAGLLGNGGSGGAGGTGAPGGNGGNAGWLYGRGGVGGAGGIGGGTGGAGGHA 207
G G A +GG G G GG +GW GG G G GG GH
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 208 WLFGHGGTGGIGGGPGGNG 226
G G G G G GGN
Sbjct: 64 -NGGGNGNSGGGSGTGGNL 81



Score = 29.7 bits (66), Expect = 0.048
Identities = 34/110 (30%), Positives = 38/110 (34%), Gaps = 6/110 (5%)

Query: 402 GAGGAGGNGGAGGQAALLFGRGGTGGAGGYGGDGGGGGDGFDGTMAGLGGTGGSGGTGGD 461
G G G N GA + G G G GGG DG + GGSG
Sbjct: 3 GGDGRGHNTGAHSTS------GNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW 56

Query: 462 GGAPGNGGAGGAGQLLSHSGVAGASGKGGAGGTGGNGGAGSAGADAPAGS 511
GG G+G GG G SG G A G + GA A S
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106



Score = 29.7 bits (66), Expect = 0.049
Identities = 27/83 (32%), Positives = 32/83 (38%), Gaps = 12/83 (14%)

Query: 619 TAAAGTTGGAGGAGGAGGTGGAAGTGTGGQQGNGGNGGNGGTGGKGGTGGAGMNSLDPLL 678
T A T+G G G GG A G+G N GG G+G G G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH-------- 62

Query: 679 AAQDGGQGGTGGTGGNAGAGGTG 701
G GG G +GG +G GG
Sbjct: 63 ----GNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3512cloacin375e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 5e-04
Identities = 38/104 (36%), Positives = 44/104 (42%), Gaps = 2/104 (1%)

Query: 444 GQGGAGGTGGAGAASSATNGGSGGAGGTGGDGGSGGAGGTGGAGGTGGAAGDGGQGGQGG 503
G G G GA + S NGG G G GG S G+G + GG +G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 504 AGGGAGGQGGAGGAGGTGGNGGNITGGTAGTAGAAGNGGAAGKG 547
G GG G +GG GTGGN + A A GA G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.2 bits (83), Expect = 8e-04
Identities = 37/111 (33%), Positives = 47/111 (42%), Gaps = 5/111 (4%)

Query: 633 GNGTGGAGGNGGGGANGGAGGAGGSGGGTGGNGGAGGDAGDAGNGGNGNGTGNGGNGGNG 692
G G + G NGG G G GG + G+G + + G G+G G G GNG
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 693 GIAGMGGNGGAGTGSGNGGNGGSGGNGGNAGMGGNSGTGSGDGGAGGNGGA 743
GGNG +G GSG GGN + G S G+G + GA
Sbjct: 66 -----GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.2 bits (83), Expect = 8e-04
Identities = 26/73 (35%), Positives = 31/73 (42%)

Query: 588 GSGGQGGNAGGGGAGGQGGADGGSGGDGGDAGTGGNGGNGGNRNSGNGTGGAGGNGGGGA 647
G + G GG G G G G + N GG SG GG G+G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 648 NGGAGGAGGSGGG 660
NG +GG G+GG
Sbjct: 68 NGNSGGGSGTGGN 80



Score = 35.8 bits (82), Expect = 0.001
Identities = 29/78 (37%), Positives = 32/78 (41%)

Query: 948 GAAGNGGNGGNAGAGGNGNGGTGGAGGIGGTGGNGGDAEPGVPPGAGGAGGAGTTGGKGG 1007
G G G N G GN NGG G G GG G + P G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1008 TGGNGSGTGSGGTGGDGG 1025
G G+G GG+G G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 34.7 bits (79), Expect = 0.002
Identities = 25/74 (33%), Positives = 32/74 (43%)

Query: 678 GNGNGTGNGGNGGNGGIAGMGGNGGAGTGSGNGGNGGSGGNGGNAGMGGNSGTGSGDGGA 737
G+ G + NGG G+G GGA GSG G G +G+ G+G G+GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 738 GGNGGAAGTGGTGG 751
GN G G
Sbjct: 68 NGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.002
Identities = 30/78 (38%), Positives = 36/78 (46%)

Query: 682 GTGNGGNGGNGGIAGMGGNGGAGTGSGNGGNGGSGGNGGNAGMGGNSGTGSGDGGAGGNG 741
G G G N G +G G G G G G + GSG + N GG SG+G GG G+G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 742 GAAGTGGTGGDGGLTGTG 759
G G +GG G G
Sbjct: 64 NGGGNGNSGGGSGTGGNL 81



Score = 34.7 bits (79), Expect = 0.002
Identities = 30/84 (35%), Positives = 35/84 (41%)

Query: 732 SGDGGAGGNGGAAGTGGTGGDGGLTGTGGTGGSGGTGGDGGNGGNGADNTANMTAQAGGD 791
SG G G N GA T G G G G S G+G N G + + + G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 792 GGNGGDGGFGGGAGAGGGGLTAGA 815
GNGG G GG GG L+A A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 34.3 bits (78), Expect = 0.003
Identities = 35/87 (40%), Positives = 42/87 (48%), Gaps = 3/87 (3%)

Query: 658 GGGTGGNGGAGGDAGDAGNGGNGNGTGNGGNGGNGGIAGMGGNGGAGTGSGNGGNGGSGG 717
G G G N GA +G+ G G G G G + G+G + GG GSG+G + G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGG---GSGSGIHWGGGS 60

Query: 718 NGGNAGMGGNSGTGSGDGGAGGNGGAA 744
GN G GNSG GSG GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 33.9 bits (77), Expect = 0.004
Identities = 31/97 (31%), Positives = 37/97 (38%)

Query: 845 GNGGTGGNGGTGGTGGAGIGSLGGGTGGDGGNGGNGGTGGEGGEVGGAGGTGGAAGNGGD 904
G G G T G G LG G G G+G + GG G GG +G+G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 905 GGTGGTGGGDGGAGGTGGTGGTGGLGDPRVGGSGGDG 941
GG G +GGG G G G P + G G
Sbjct: 66 GGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.5 bits (76), Expect = 0.005
Identities = 29/80 (36%), Positives = 32/80 (40%)

Query: 420 SGGAGGAAGAGGAGGGANGTAGNGGQGGAGGTGGAGAASSATNGGSGGAGGTGGDGGSGG 479
SGG G G N G G G GG SS N GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 480 AGGTGGAGGTGGAAGDGGQG 499
G GG G +GG +G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 33.5 bits (76), Expect = 0.005
Identities = 24/75 (32%), Positives = 32/75 (42%)

Query: 1004 GKGGTGGNGSGTGSGGTGGDGGTGGGGGNGGTGWNGGKGDTGSGGGAGDGGKAPAGGTGG 1063
G+G G S +G+ G G GGG + G+GW+ G G G+G +G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 1064 AGGDGGAGGKGGSGG 1078
G GG G G
Sbjct: 66 GGNGNSGGGSGTGGN 80



Score = 33.1 bits (75), Expect = 0.006
Identities = 25/84 (29%), Positives = 30/84 (35%)

Query: 600 GAGGQGGADGGSGGDGGDAGTGGNGGNGGNRNSGNGTGGAGGNGGGGANGGAGGAGGSGG 659
G G+G G G G G GG + G+G GGG+ G GGSG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 660 GTGGNGGAGGDAGDAGNGGNGNGT 683
G GG G G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.8 bits (74), Expect = 0.008
Identities = 35/109 (32%), Positives = 41/109 (37%), Gaps = 1/109 (0%)

Query: 410 GNGGQGGQGGSGGAGGAAGAGGAGGGANGTAGNG-GQGGAGGTGGAGAASSATNGGSGGA 468
G G+G G+ G G G G G A +G G G G+ S GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 469 GGTGGDGGSGGAGGTGGAGGTGGAAGDGGQGGQGGAGGGAGGQGGAGGA 517
G GG+G SGG GTGG A G G G + GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.8 bits (74), Expect = 0.008
Identities = 35/101 (34%), Positives = 40/101 (39%), Gaps = 2/101 (1%)

Query: 565 AGGDGGAGGTGGDRTVGG--GTVPAGSGGQGGNAGGGGAGGQGGADGGSGGDGGDAGTGG 622
+GGDG TG T G G G G + G G + GGSG G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 623 NGGNGGNRNSGNGTGGAGGNGGGGANGGAGGAGGSGGGTGG 663
+G GGN NSG G+G G A G S G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 32.8 bits (74), Expect = 0.008
Identities = 28/86 (32%), Positives = 37/86 (43%), Gaps = 2/86 (2%)

Query: 589 SGGQGGNAGGGGAGGQGGADGGSGGDGGDAGTGGNGGNGGNRNSGNGTGGAGGNGGGGAN 648
SGG G G G +GG G G G G + G+G + + GG+G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGV--GGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 649 GGAGGAGGSGGGTGGNGGAGGDAGDA 674
G G GG+G GG+G G + A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 32.4 bits (73), Expect = 0.010
Identities = 27/84 (32%), Positives = 30/84 (35%)

Query: 878 GNGGTGGEGGEVGGAGGTGGAAGNGGDGGTGGTGGGDGGAGGTGGTGGTGGLGDPRVGGS 937
G G G G +G G G GG G G G G G+ G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 938 GGDGGTGGSGGAAGNGGNGGNAGA 961
G GG G SGG +G GGN A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.012
Identities = 28/70 (40%), Positives = 29/70 (41%)

Query: 844 GGNGGTGGNGGTGGTGGAGIGSLGGGTGGDGGNGGNGGTGGEGGEVGGAGGTGGAAGNGG 903
G T GN G TG G G+G N GG G G GG G G GNG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 904 DGGTGGTGGG 913
GG GTGG
Sbjct: 71 SGGGSGTGGN 80



Score = 32.0 bits (72), Expect = 0.014
Identities = 32/102 (31%), Positives = 41/102 (40%)

Query: 120 GGKGGNGGIGAAGTTGPVGTGASGGTGGSGGAGGTGGDGGAANGGTAGAGGAGGNGGKGG 179
GG G GA T+G + G +G G G + G+G G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 180 DGGAGVTSSTAGNSGGAGGSGGKGGDAGAGGAGATPGANGIA 221
G G +S G+ G S A A +TPGA G+A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.0 bits (72), Expect = 0.015
Identities = 27/83 (32%), Positives = 32/83 (38%), Gaps = 1/83 (1%)

Query: 663 GNGGAGGDAGDAGNGGNGNGTGNGGNGGNGGIAGMG-GNGGAGTGSGNGGNGGSGGNGGN 721
G G G + G GN NG G G G G G + G G+G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 722 AGMGGNSGTGSGDGGAGGNGGAA 744
GGN +G G G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 31.6 bits (71), Expect = 0.017
Identities = 26/80 (32%), Positives = 33/80 (41%)

Query: 242 GAGDGGHGGTGAAGGNGGTGGAGGSGIDGVGGGTGGTGGNGGNGAIGGAGGDAGGSGNSG 301
G G G + G + GN G G G G+G + N G G+G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 302 GNGGIGGKGGNAGAGGAAGS 321
GG G GG +G GG +
Sbjct: 64 NGGGNGNSGGGSGTGGNLSA 83



Score = 31.6 bits (71), Expect = 0.021
Identities = 38/115 (33%), Positives = 46/115 (40%), Gaps = 3/115 (2%)

Query: 700 NGGAGTGSGNGGNGGSGG-NGGNAGMGGNSGTGSGDGGAGGNGGAAGTGGTGGDGGLTGT 758
+GG G G G + SG NGG G+G G G G + N G G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG--GG 59

Query: 759 GGTGGSGGTGGDGGNGGNGADNTANMTAQAGGDGGNGGDGGFGGGAGAGGGGLTA 813
G G GG G GG G G + +A A G G G G L+A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114



Score = 31.6 bits (71), Expect = 0.022
Identities = 25/84 (29%), Positives = 30/84 (35%)

Query: 809 GGLTAGANGTGGQGGAGGDGGNGAIGGHGPLTDDPGGNGGTGGNGGTGGTGGAGIGSLGG 868
GG G N +GG +G G +D G + GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 869 GTGGDGGNGGNGGTGGEGGEVGGA 892
G GG GN G G G A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 30.8 bits (69), Expect = 0.031
Identities = 35/114 (30%), Positives = 45/114 (39%), Gaps = 1/114 (0%)

Query: 19 NGGNGADNTTTAAAGTTGGAGGAGGAGGTGGTGGAAGTGTGGQQGNGGNGGNGGTGGKGG 78
+GG+G + T A + + GG G G GG +G + GG+G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 79 TGGDGALAGSSGGAGGKGGNGGDAGKAGTG-SAPGTAGTGGDGGKGGNGGIGAA 131
G G S GG+G G A G A T G GG G + AA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115



Score = 30.8 bits (69), Expect = 0.034
Identities = 27/81 (33%), Positives = 32/81 (39%)

Query: 898 AAGNGGDGGTGGTGGGDGGAGGTGGTGGTGGLGDPRVGGSGGDGGTGGSGGAAGNGGNGG 957
+ G+G TG GG G G GG D S + GGSG GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 958 NAGAGGNGNGGTGGAGGIGGT 978
+ GGNGN G G G +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 30.8 bits (69), Expect = 0.034
Identities = 28/102 (27%), Positives = 35/102 (34%)

Query: 567 GDGGAGGTGGDRTVGGGTVPAGSGGQGGNAGGGGAGGQGGADGGSGGDGGDAGTGGNGGN 626
G G G G + G +G G G+G + GG G GG G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 627 GGNRNSGNGTGGAGGNGGGGANGGAGGAGGSGGGTGGNGGAG 668
G +GN GG+G G A G T G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.8 bits (69), Expect = 0.034
Identities = 32/100 (32%), Positives = 35/100 (35%)

Query: 485 GAGGTGGAAGDGGQGGQGGAGGGAGGQGGAGGAGGTGGNGGNITGGTAGTAGAAGNGGAA 544
G G G G G G G GG G + N GG +G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 545 GKGGAGGQGGTGGGTGGQGGAGGDGGAGGTGGDRTVGGGT 584
G GG G G G GTGG A A G T G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.5 bits (68), Expect = 0.040
Identities = 30/85 (35%), Positives = 38/85 (44%), Gaps = 2/85 (2%)

Query: 305 GIGGKGGNAGAGGAAGSNGGTVGANGTGGDGGNGGAAGAATAGSNGGAGTGSAGGNGGTG 364
G G+G N GA +G+ G G G G GG +G ++ + G G+GS GG
Sbjct: 3 GGDGRGHNTGAHSTSGNING--GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 365 GRGGSGGAGGDGIGGVGGGKGGNGA 389
G G GG G G G GG A
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 30.5 bits (68), Expect = 0.043
Identities = 29/80 (36%), Positives = 35/80 (43%), Gaps = 2/80 (2%)

Query: 801 GGGAGAGGGGLTAGANGTGGQGGAGGDGGNGAIGGHGPLTDDPGGNGGTGGNGGTGGTGG 860
G G G G + N GG G G GG A G G +++ GG+G GG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 861 AGIGSLGGGTGGDGGNGGNG 880
G G G +GG G GGN
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 30.5 bits (68), Expect = 0.044
Identities = 27/85 (31%), Positives = 35/85 (41%)

Query: 148 SGGAGGTGGDGGAANGGTAGAGGAGGNGGKGGDGGAGVTSSTAGNSGGAGGSGGKGGDAG 207
SGG G G + G G G G G G+G +S GG+G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 208 AGGAGATPGANGIAGNGGDGGDGAA 232
G G + G +G GG+ AA
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 30.5 bits (68), Expect = 0.048
Identities = 30/102 (29%), Positives = 39/102 (38%)

Query: 368 GSGGAGGDGIGGVGGGKGGNGADGEVGGAGGAGGSGPNTSPGGNGGQGGQGGSGGAGGAA 427
G G G + G G G G G + GSG ++ GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 428 GAGGAGGGANGTAGNGGQGGAGGTGGAGAASSATNGGSGGAG 469
G GG G + G +G GG A A + + G+GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 30.5 bits (68), Expect = 0.048
Identities = 38/109 (34%), Positives = 43/109 (39%), Gaps = 11/109 (10%)

Query: 961 AGGNGNGGTGGAGGIGGTGGNGGDAEPGVPPGAGGAGGAGTTGGKGGTGGNGSGTGSGGT 1020
+GG+G G GA G G P G GG + G + N G GSG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGG--------PTGLGVGGGASDGSGWSSENNPWGGGSGSG 53

Query: 1021 GGDGGTGGGGGNGGTGWNGGKGDTGSGGGAGDGGKA---PAGGTGGAGG 1066
GG G G GG G +GG TG A A PA T GAGG
Sbjct: 54 IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3514cloacin377e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 7e-04
Identities = 33/102 (32%), Positives = 41/102 (40%)

Query: 479 GAAGTGGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGGA 538
G G G G T+GN G G G G + G+G ++ GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 539 GGSSGAGGTNGSGGAGGTGGQGGAGGAGGAGADNPTGIGGTG 580
G G G + G G GG A A G A + G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 36.6 bits (84), Expect = 9e-04
Identities = 34/109 (31%), Positives = 40/109 (36%)

Query: 505 GGDGGAGGAGADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGTGGQGGAGG 564
GGDG GA + GG G GGA G S G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 565 AGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNA 613
G G N G GTGG+ A A G + G G + + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 36.2 bits (83), Expect = 0.001
Identities = 29/89 (32%), Positives = 35/89 (39%)

Query: 1045 GAGGKGGAGGSSGAGGTNGSGGAGGAGGQGGAGGAGISFSNGSNGGTGGTGGVGGTGGDG 1104
G G+G G+ G G G G G + G+G S N GG G+G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1105 GNAGTGAGDPGKGGTGGTGGTGGSGGAGG 1133
GN G G GTGG + A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 36.2 bits (83), Expect = 0.001
Identities = 30/90 (33%), Positives = 38/90 (42%)

Query: 1244 AGGPGGKGGAGGNAGTGGTNGSGAGGAGGQGGAGGAGISFSNGSNGGTGGTGGVGGTGGD 1303
+GG G G ++ +G NG G G G + G+G S N GG G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1304 GGNAGTGAGDPGKGGTGGTGGTGGSGGAGG 1333
GN G G GTGG + A G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 35.8 bits (82), Expect = 0.001
Identities = 34/109 (31%), Positives = 40/109 (36%)

Query: 622 GGDGGAGGAGADADQPGATGGTGFAGGAGGAGKAGGSSSAGGTNSSGSAGGTGRQSGTGG 681
GGDG GA + GG G GGA G SS GS G G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 682 AGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNA 730
G G N G GTGG+ A A G + G G + + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 35.8 bits (82), Expect = 0.001
Identities = 27/89 (30%), Positives = 34/89 (38%)

Query: 1378 GKGGTNGNGGSGGTGGTGGPGGSGGAPTGSGTGGKGGAGGDGGDGADGGAATGVGDGGDG 1437
G G N G+ T G G +G G + G G + + G G+ G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1438 GNGGNGGNGGTGVGSPGGLGGAGGTGGLG 1466
GNGG GN G G G+ G L G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 34.7 bits (79), Expect = 0.003
Identities = 36/101 (35%), Positives = 39/101 (38%), Gaps = 3/101 (2%)

Query: 1321 GTGGTGGSGGAGGSGGANFNGGTGGTGGTGGTGGKGGMGGIAG--DGGPGGDGGNAGVGG 1378
G G G + GA + G N NGG G G GG G GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSG-NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1379 KGGTNGNGGSGGTGGTGGPGGSGGAPTGSGTGGKGGAGGDG 1419
G GNG SGG GTGG + AP G G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 34.3 bits (78), Expect = 0.004
Identities = 31/97 (31%), Positives = 37/97 (38%)

Query: 126 GGAGGAGGLLFGNGGAGAAGAPGQAGGPGGPAGLWGNGGPGGAGGSGGGTGGAGGAGGWL 185
GG G G + G+G + GG G WG G G GG G +GG G GG L
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 186 FGVGGAGGVGGAGGGTGGAGGPGGLIWGGGGAGGVGG 222
V G T GAGG I G + +
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 34.3 bits (78), Expect = 0.004
Identities = 26/79 (32%), Positives = 34/79 (43%)

Query: 1218 IGGDGGQGGNGGQGDSGSGLGGQPGFAGGPGGKGGAGGNAGTGGTNGSGAGGAGGQGGAG 1277
+ G G+G N G + + G P G GG G + G G+G GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 1278 GAGISFSNGSNGGTGGTGG 1296
G G NG++GG GTGG
Sbjct: 61 GHGNGGGNGNSGGGSGTGG 79



Score = 33.9 bits (77), Expect = 0.005
Identities = 32/102 (31%), Positives = 40/102 (39%), Gaps = 1/102 (0%)

Query: 1070 AGGQGGAGGAGISFSNGSNGGTGGTGGVGGTGGDGGNAGTGAGDPGKGGTGGTGGTGGSG 1129
+GG G G ++G+ G GVGG DG + +P GG+G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSS-ENNPWGGGSGSGIHWGGGS 60

Query: 1130 GAGGSGGANFNGGTGGTGGTGGTGGKGGMGGIAGDGGPGGDG 1171
G G GG +GG GTGG G PG G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.9 bits (77), Expect = 0.005
Identities = 32/102 (31%), Positives = 40/102 (39%), Gaps = 1/102 (0%)

Query: 1270 AGGQGGAGGAGISFSNGSNGGTGGTGGVGGTGGDGGNAGTGAGDPGKGGTGGTGGTGGSG 1329
+GG G G ++G+ G GVGG DG + +P GG+G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSS-ENNPWGGGSGSGIHWGGGS 60

Query: 1330 GAGGSGGANFNGGTGGTGGTGGTGGKGGMGGIAGDGGPGGDG 1371
G G GG +GG GTGG G PG G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 33.9 bits (77), Expect = 0.005
Identities = 32/103 (31%), Positives = 39/103 (37%), Gaps = 1/103 (0%)

Query: 793 GAAGTGGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGGS 852
G G G G T+GN G G G G + G+G ++ GG+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 853 GGSSCAGGTNGSGGAGGTCGQVVAGGAGISFSNGSNGGTGGTG 895
G G + G G GG VA F S G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGN-LSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.9 bits (77), Expect = 0.005
Identities = 37/108 (34%), Positives = 42/108 (38%), Gaps = 5/108 (4%)

Query: 1100 TGGDGGNAGTGAGDPGKGGTGGTGGTGGSGGAGGSGGANFNGGTGGTGGTGGTGGKGGMG 1159
+GGDG TGA GG G G GGA G + G G G GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1160 GIAGDGGPGGDGGNAGVGGKGGTNGNGGSGGTGGTGGAGGNAGAGGLA 1207
G GG GN+G G G N + + A GAGGLA
Sbjct: 62 -----HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 33.5 bits (76), Expect = 0.007
Identities = 27/89 (30%), Positives = 32/89 (35%)

Query: 1369 GDGGNAGVGGKGGTNGNGGSGGTGGTGGPGGSGGAPTGSGTGGKGGAGGDGGDGADGGAA 1428
G G G T+GN G TG G G S G+ S GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 1429 TGVGDGGDGGNGGNGGNGGTGVGSPGGLG 1457
G G+ G G G + V +P G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 32.8 bits (74), Expect = 0.011
Identities = 28/102 (27%), Positives = 36/102 (35%)

Query: 596 GAAGTGGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGKA 655
G G G G T+GN G G G G + G+G ++ GG+G GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 656 GGSSSAGGTNSSGSAGGTGRQSGTGGAGGAGADNPTGIGGTG 697
G G + GG A G A + G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 32.8 bits (74), Expect = 0.012
Identities = 36/119 (30%), Positives = 45/119 (37%), Gaps = 10/119 (8%)

Query: 713 GAAGTGGTGGMIGTTGNAGVGGAGGSSGAGGTNGSGGAGGTDGQGGAGGAGGAGADNPTG 772
G G G G T+GN G G G G ++GSG + + GG G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG--------- 53

Query: 773 IGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADA 831
GG G G G G +GG +GTGG + G G G A A A
Sbjct: 54 -IHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.8 bits (74), Expect = 0.014
Identities = 34/101 (33%), Positives = 37/101 (36%), Gaps = 2/101 (1%)

Query: 391 GGDGGAGGAGADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGQGGAGGAGG 450
GGDG GA + GG G GGA G S G G G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 451 AGADNPTGIGGTGGDGGTGGAAGAGGAGG--AAGTGGTGGM 489
GG G GG A A A G A T G GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 32.4 bits (73), Expect = 0.014
Identities = 33/112 (29%), Positives = 37/112 (33%), Gaps = 9/112 (8%)

Query: 1183 NGNGGSGGTGGTGGAGGNAGAGGLANTGGTAGNAGIGGDGGQGGNGGQGDSGSGLGGQPG 1242
+G G G G GN G G + G G GG SG GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1243 FAGGPGGKGGAGGNAGTGGTNGSGAGGAGGQG-------GAGGAGISFSNGS 1287
G GG G G G N S G GAGG +S S G+
Sbjct: 62 --HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 32.4 bits (73), Expect = 0.015
Identities = 40/128 (31%), Positives = 47/128 (36%), Gaps = 23/128 (17%)

Query: 965 SGTGGTGGTGGKGGTGGAGDDSAGGTGGTGGAGGNAGAGGLANTGGTAGNAGIGGDGGQG 1024
SG G G G T G + G G GGA +G N G +GI GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 1025 GNGGQGDSGSGLGGQPGFAGGAGGKGGAGGSSGAGGTNGSGGAGGAGG-----QGGAGGA 1079
G GG G +GG SG GG + A A G GAGG
Sbjct: 62 H------------------GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103

Query: 1080 GISFSNGS 1087
+S S G+
Sbjct: 104 AVSISAGA 111



Score = 32.0 bits (72), Expect = 0.023
Identities = 30/101 (29%), Positives = 39/101 (38%)

Query: 363 GGAGGAAGQLFSASGAAGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAG 422
G G S SG G G G G+G + + G +G GG G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 423 GAGGSSGAGGTNGSGGAGGQGGAGGAGGAGADNPTGIGGTG 463
GG+ +GG +G+GG A A G A + G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 31.2 bits (70), Expect = 0.040
Identities = 25/74 (33%), Positives = 31/74 (41%)

Query: 877 GGAGISFSNGSNGGTGGTGGVGGTGGDGGNAGTGAGDPGKGGTGGTGGTGGSGGAGGSGG 936
G + S N G TG G G G+ + +P GG+G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 937 ANFNGGTGGTGGTG 950
+GG GTGG
Sbjct: 68 NGNSGGGSGTGGNL 81



Score = 30.8 bits (69), Expect = 0.045
Identities = 26/82 (31%), Positives = 36/82 (43%)

Query: 856 SCAGGTNGSGGAGGTCGQVVAGGAGISFSNGSNGGTGGTGGVGGTGGDGGNAGTGAGDPG 915
S G + GA T G + G G+ G++ G+G + GG G+ G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 916 KGGTGGTGGTGGSGGAGGSGGA 937
G GG G +GG G GG+ A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3517cdtoxina300.007 Cytolethal distending toxin A signature.
		>cdtoxina#Cytolethal distending toxin A signature.

Length = 258

Score = 30.1 bits (67), Expect = 0.007
Identities = 8/17 (47%), Positives = 11/17 (64%)

Query: 37 GADLTAWSRAQAAWLWS 53
G+ LT WSR + LW+
Sbjct: 87 GSVLTMWSRGAGSSLWA 103


79Rv3590cRv3600cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv3590c5105.223410PE-PGRS family protein PE_PGRS58
Rv3591c0101.525190Possible hydrolase
Rv35920121.410262Possible heme degrading protein MhuD
Rv35931100.238616Probable conserved lipoprotein LpqF
Rv35942110.468136Conserved hypothetical protein
Rv3595c2100.015033PE-PGRS family protein PE_PGRS59
Rv3596c311-2.460790Probable ATP-dependent protease ATP-binding
Rv3597c112-2.369163Iron-regulated H-NS-like protein Lsr2
Rv3598c012-1.344507Lysyl-tRNA synthetase 1 LysS (lysine--tRNA
Rv3599c-1131.710105Hypothetical short protein
Rv3600c-2131.842755Conserved protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3590ccloacin393e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 39.3 bits (91), Expect = 3e-05
Identities = 43/132 (32%), Positives = 50/132 (37%), Gaps = 2/132 (1%)

Query: 449 AGTGGVGGVGGAGGQGGLLFGDGGNGGAGGAGGIGGTGASGGAGGKGGSGLVGGDGGNGG 508
+G G G GA G + G G GG G +S GGSG GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 509 AGGAGGNGGKGGAGGAGGGAGMFSQPGVHG--AGGTGGQGGAGGAGGAGGAAGAGTVVAG 566
G GGNG GG G GG + P G A T G GG + AG + A +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 567 NPGDPGGFGAAG 578
P FG G
Sbjct: 122 ALKGPFKFGLWG 133



Score = 38.2 bits (88), Expect = 1e-04
Identities = 32/91 (35%), Positives = 41/91 (45%)

Query: 136 GGPGGLLWGNGGNGGSGVAGVGGPGGSGGAAGLFGHGGNGGAGGSNAAGAGGVGGAGGAG 195
GGP GL G G + GSG + P G G +G+ GG+G G +GG G GG
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 196 WLVGNGGAGGFGGVGTTVSGNGGAGGAAGAF 226
V A GF + T +G +AGA
Sbjct: 82 SAVAAPVAFGFPALSTPGAGGLAVSISAGAL 112



Score = 35.5 bits (81), Expect = 5e-04
Identities = 31/84 (36%), Positives = 40/84 (47%), Gaps = 5/84 (5%)

Query: 314 AGGAGGLGGSGSDTSEGPVTGGNGGNGGDGGPGAPGGNGA---PGGIGVNTGTGWAYGGN 370
+GG G +G+ ++ G + GG G G GG G + P G G +G W GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW--GGG 59

Query: 371 GGNGGDGGAGARGGDGGNGGNGLA 394
G+G GG G GG G GGN A
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.003
Identities = 30/97 (30%), Positives = 36/97 (37%), Gaps = 10/97 (10%)

Query: 395 LNGGNGIGGNGGAGGRGGTGAAGGNGGIGGGATGTLTFFGSGGDGGPGGAGANTAGTGGV 454
++GG+G G N GA G GG G+G G G G G + N GG
Sbjct: 1 MSGGDGRGHNTGAHSTSGN-INGGPTGLGV---------GGGASDGSGWSSENNPWGGGS 50

Query: 455 GGVGGAGGQGGLLFGDGGNGGAGGAGGIGGTGASGGA 491
G GG G G G GG+G G A
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAP 87



Score = 32.8 bits (74), Expect = 0.004
Identities = 33/97 (34%), Positives = 37/97 (38%), Gaps = 3/97 (3%)

Query: 145 NGGNGGSGVAGVGGPGGSGGAAGLFGHGGNGGAGGSNAAGAGGVGGAGGAGWLVGNGGAG 204
N G + GGP G G G G G+G GG GNGG
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG-HGNGGGN 68

Query: 205 GFGGVGTTVSGNGGAGGAAGAFGNGGVG--GAGGAAV 239
G G G+ GN A A AFG + GAGG AV
Sbjct: 69 GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAV 105



Score = 32.8 bits (74), Expect = 0.004
Identities = 28/81 (34%), Positives = 32/81 (39%), Gaps = 3/81 (3%)

Query: 383 GGDGGNGGNGLALNGGNGIGGNGGAGGRGGTGAAGGNGGIG---GGATGTLTFFGSGGDG 439
GGDG G GN GG G G GG G GG +G+ +G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 440 GPGGAGANTAGTGGVGGVGGA 460
G GG N+ G G GG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 32.8 bits (74), Expect = 0.004
Identities = 25/76 (32%), Positives = 30/76 (39%)

Query: 291 GANNGAGSGGAGLPGNPGAVPGRAGGAGGLGGSGSDTSEGPVTGGNGGNGGDGGPGAPGG 350
G N GA S + G P + G + G G S + G +G GG G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 351 NGAPGGIGVNTGTGWA 366
NG GG G A
Sbjct: 68 NGNSGGGSGTGGNLSA 83



Score = 32.4 bits (73), Expect = 0.005
Identities = 32/128 (25%), Positives = 44/128 (34%), Gaps = 10/128 (7%)

Query: 335 GNGGNGGDGGPGAPGGN--GAPGGIGVNTGTGWAYGGNGGNGGDGGAGARGGDGGNGGNG 392
G G G + G + GN G P G+GV GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVG-------GGASDGSGWSSENNPWGGGSGSGIH 55

Query: 393 LALNGGNGIGGNGGAGGRGGTGAAGGNGGIGGGATGTLTFFGSGGDGGPGGAGANTAGTG 452
G+G GG G G GG+G G + + G GG + + A +
Sbjct: 56 WGGGSGHGNGGGNGNSG-GGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSA 114

Query: 453 GVGGVGGA 460
+ + A
Sbjct: 115 AIADIMAA 122



Score = 31.6 bits (71), Expect = 0.009
Identities = 29/102 (28%), Positives = 36/102 (35%)

Query: 174 NGGAGGSNAAGAGGVGGAGGAGWLVGNGGAGGFGGVGTTVSGNGGAGGAAGAFGNGGVGG 233
+GG G + GA G G G G G G + N GG+ GG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 234 AGGAAVIGGLPGNGGAGGNAGLIGAGGDGGVGGVGAPGTNGM 275
G G G G GGN + A G + PG G+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGL 103



Score = 31.6 bits (71), Expect = 0.010
Identities = 31/102 (30%), Positives = 37/102 (36%), Gaps = 2/102 (1%)

Query: 483 GGTGASGGAGGKGGSGLVGGDGGNGGAGGAGGNGGKGGAGGAGGGAGMFSQPGVHGAGGT 542
GG G G SG + +GG G G GG G G S G+H GG+
Sbjct: 3 GGDGRGHNTGAHSTSGNI--NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 543 GGQGGAGGAGGAGGAAGAGTVVAGNPGDPGGFGAAGADGLPG 584
G G G GG+ G + A GF A G G
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.8 bits (69), Expect = 0.016
Identities = 37/108 (34%), Positives = 41/108 (37%), Gaps = 8/108 (7%)

Query: 160 GGSGGAAGLFGHGGNGGAGGSNAAGAGGVGGAGGAGWLVGNGGAGGFGGVGTTVSGN--- 216
GG G H +G G G G + G+GW N GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 217 ---GGAGGAAGAFGNGGVGGAGGAAVIGGLPG--NGGAGGNAGLIGAG 259
GG G + G G GG A A V G P GAGG A I AG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3593BLACTAMASEA393e-05 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 39.0 bits (91), Expect = 3e-05
Identities = 24/137 (17%), Positives = 46/137 (33%), Gaps = 3/137 (2%)

Query: 148 PSIASWRDVDAALSKTGARYSFQVAKVDNGRCDPVAGTNTGESLPLASIFKLYVLHALAG 207
S + + S+ R + ++D + E P+ S FK+ + A+
Sbjct: 21 ASPQPLEQIKLSESQLSGR--VGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLA 78

Query: 208 AVQHNTVSWDDLLTVTAKSKAVGSSGLELPVGARVSVRTAAEKMIATSDNMATDLLIERL 267
V + + + S E + ++V I SDN A +LL+ +
Sbjct: 79 RVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATV 138

Query: 268 -GTRAIEEALASAGHHD 283
G + L G +
Sbjct: 139 GGPAGLTAFLRQIGDNV 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3595ccloacin394e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 38.5 bits (89), Expect = 4e-05
Identities = 31/95 (32%), Positives = 36/95 (37%)

Query: 152 GGSGGAAGLIGNGGSGGAGGAGAAGGSGGQGGLLYGNGGAGGNGGAATIPGGNGGAGGAG 211
G + GA GN G G G S G G N GG+G GG+G G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 212 GNAWLFGNGGAGGLGAAGAAGAAGVNPLTVPAGQG 246
G+G G L A A A G L+ P G
Sbjct: 68 NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 37.4 bits (86), Expect = 1e-04
Identities = 34/108 (31%), Positives = 40/108 (37%), Gaps = 8/108 (7%)

Query: 309 GGAGGIGGTGGEGGIGARGGTGGQGGMGGAGQPGVGGDAGDGGNGGIGGDGGAGGDGGAG 368
GG G TG G G G+GG G G + + GG G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 369 GAGGAGGLFGVSGSSGLGGAAGSGGNGGGGGEPGVAGSPGVGPAGRGG 416
G GG G GG +G+GGN P G P + G GG
Sbjct: 63 GNGGGNG--------NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 35.1 bits (80), Expect = 6e-04
Identities = 33/93 (35%), Positives = 37/93 (39%), Gaps = 3/93 (3%)

Query: 119 GADGTAPGQNGGAGGLLYGNGGNGAAG---VNAGIAGGSGGAAGLIGNGGSGGAGGAGAA 175
GA T+ NGG GL G G + +G N GGSG G G G GG G +
Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71

Query: 176 GGSGGQGGLLYGNGGAGGNGGAATIPGGNGGAG 208
GG G GG L G A G GG
Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 34.7 bits (79), Expect = 7e-04
Identities = 35/106 (33%), Positives = 41/106 (38%), Gaps = 1/106 (0%)

Query: 286 TGGTGGTGGAGGSGGRGGLLVGDGGAGGIGGTGGEGGIGARGGTGGQGGMGGAGQPGVGG 345
+GG G G G + G G G GG G + G G G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 346 DAGDGGNGGIGGDGGAGGDGGAGGAGGAGGLFGVSGSSGLGGAAGS 391
GGNG GG G GG+ A A A G +S + G GG A S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALS-TPGAGGLAVS 106



Score = 31.2 bits (70), Expect = 0.010
Identities = 33/106 (31%), Positives = 39/106 (36%), Gaps = 6/106 (5%)

Query: 130 GAGGLLYGNGGNGAAGVNAGIAGGSGGAAGLIGNGGSGGAGGAGAAGGSGGQGGLLYGNG 189
G G + G + +G G G G G + GSG + GG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG--ASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 190 GAGGNGGAATIPGGNGGAGGAGGNAWLFGNGGAGGLGAAGAAGAAG 235
G G GG G +GG G GGN A G A GA G
Sbjct: 61 GHGNGGGN----GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 30.5 bits (68), Expect = 0.013
Identities = 27/82 (32%), Positives = 29/82 (35%), Gaps = 1/82 (1%)

Query: 267 TGGTG-GTGGTGLSVGGTGGTGGTGGTGGAGGSGGRGGLLVGDGGAGGIGGTGGEGGIGA 325
+GG G G S G G TG G G S G G + GG G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 326 RGGTGGQGGMGGAGQPGVGGDA 347
G GG G GG G A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 29.3 bits (65), Expect = 0.034
Identities = 29/91 (31%), Positives = 36/91 (39%)

Query: 342 GVGGDAGDGGNGGIGGDGGAGGDGGAGGAGGAGGLFGVSGSSGLGGAAGSGGNGGGGGEP 401
G G + G G+ G G G G + G S ++ GG +GSG + GGG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 402 GVAGSPGVGPAGRGGDGNLGQFGPEGAPGQP 432
G G G G G GNL A G P
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFP 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3596cHTHFIS320.007 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.5 bits (74), Expect = 0.007
Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 28/160 (17%)

Query: 518 IIGQEDAVKAVSKAIRRTRAGLKDPKRPSGSFIFAGPSGVGKTELSKALANFLFGDDDAL 577
++G+ A++ + + + R + + + G SG GK +++AL ++ +
Sbjct: 139 LVGRSAAMQEIYRVLARL-------MQTDLTLMITGESGTGKELVARALHDYGKRRNGPF 191

Query: 578 IQIDMGEFHDRFTASRLFGAPPGYVGYEEGGQLTEKVRRKP--FS-----VVLFDEIEKA 630
+ I+M S LFG E G T R F + DEI
Sbjct: 192 VAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 631 HQEIYNSLLQVLEDG---RLTDGQGRTVDFKNTVLIFTSN 667
+ LL+VL+ G + D + ++ +N
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3600cPF03309361e-129 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 361 bits (927), Expect = e-129
Identities = 234/272 (86%), Positives = 252/272 (92%), Gaps = 1/272 (0%)

Query: 1 MLLAIDVRNTHTVVGLLSGMKEHAKVVQQWRIRTESEVTADELALTIDGLIGEDSERLTG 60
MLLAIDVRNTHTVVGL+SG +HAKVVQQWRIRTE EVTADELALTIDGLIG+D+ERLTG
Sbjct: 1 MLLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGDDAERLTG 60

Query: 61 TAALSTVPSVLHEVRIMLDQYWPSVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAAY 120
+ LSTVPSVLHEVR+ML+QYWP+VPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAAY
Sbjct: 61 ASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAAY 120

Query: 121 DRFRKAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRVELARPRS 180
++ AAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRVEL RPRS
Sbjct: 121 HKYGTAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRVELTRPRS 180

Query: 181 VVGKNTVECMQAGAVFGFAGLVDGLVGRIREDVSGFSVDHDVAIVATGHTAPLLLPELHT 240
V+GKNTVECMQAGAVFGFAGLVDGLV RIR+DV GFS DVA+VATGHTAPL+LP+L T
Sbjct: 181 VIGKNTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFS-GADVAVVATGHTAPLVLPDLRT 239

Query: 241 VDHYDQHLTLQGLRLVFERNLEVQRGRLKTAR 272
V+HYD+HLTL GLRLVFERN QRG+LK AR
Sbjct: 240 VEHYDRHLTLDGLRLVFERNRANQRGKLKPAR 271


80Rv3876Rv3886cN        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Rv38760180.155139ESX-1 secretion-associated protein EspI.
Rv38770160.132249ESX conserved component EccD1. ESX-1 type VII
Rv38780130.929651ESX-1 secretion-associated protein EspJ.
Rv3879c0140.029304ESX-1 secretion-associated protein EspK. Alanine
Rv3880c-213-0.427705ESX-1 secretion-associated protein EspL
Rv3881c0140.055237Secreted ESX-1 substrate protein B, EspB.
Rv3882c2140.013157ESX conserved component EccE1. ESX-1 type VII
Rv3883c115-0.704325Membrane-anchored mycosin MycP1 (serine
Rv3884c115-1.086244ESX conserved component EccA2. ESX-2 type VII
Rv3885c317-0.685389ESX conserved component EccE2. ESX-2 type VII
Rv3886c417-1.105920Probable alanine and proline rich
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3876TONBPROTEIN423e-06 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 42.3 bits (99), Expect = 3e-06
Identities = 29/126 (23%), Positives = 38/126 (30%), Gaps = 5/126 (3%)

Query: 71 PPPPPPPTPMPIAAGEPPSPEPAASKPPTPPMPIAGPEPAPPKPPTPPMPIAGPEPAPPK 130
P P P + + + P+ P P PEP P P P+ I P+P P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 98

Query: 131 PPTP-PMPIAGPAPTPTESQLAPPRPPTPQTPTGAPQQPESPAPHVP----SHGPHQPRR 185
P P P + P P P + A P + GP R
Sbjct: 99 KPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSR 158

Query: 186 TAPAPP 191
P P
Sbjct: 159 NQPQYP 164



Score = 40.0 bits (93), Expect = 1e-05
Identities = 28/137 (20%), Positives = 38/137 (27%), Gaps = 6/137 (4%)

Query: 92 PAASKPPTPPMPIAGPEPAPPKPPTPPMPIAGPEPAPPKPPTPPMPIAGPAPTPTESQLA 151
PA ++P + M P PP P+ PEP P P PP P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 98

Query: 152 PPRPPTPQTPTGAPQQPESPAPHVPSHGPHQPRRTAPAPPWAKMPIGEPPPAPSRPSASP 211
P+P + P R + P AS
Sbjct: 99 KPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTA------TAATSKPVTSVASG 152

Query: 212 AEPPTRPAPQHSRRARR 228
+R PQ+ RA+
Sbjct: 153 PRALSRNQPQYPARAQA 169



Score = 36.1 bits (83), Expect = 2e-04
Identities = 25/116 (21%), Positives = 35/116 (30%), Gaps = 5/116 (4%)

Query: 34 PPAPASANLPKPNGQTPPPTSDDLSERFVSAPPPPPPPPPPPPPTPMPIAAGEPPSPEPA 93
P P S + P PP E V P P P P PP P+ I +P
Sbjct: 41 PAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 100

Query: 94 ASKPPTPPMPIAGPEPAPPKPPTPPMPIAGPEPAPPKPPTPPMPIAGPAPTPTESQ 149
P +P +P +P AP + + A P + +
Sbjct: 101 KPVKKVQEQPKRDVKPVESRPASPFE-----NTAPARLTSSTATAATSKPVTSVAS 151



Score = 30.7 bits (69), Expect = 0.014
Identities = 22/122 (18%), Positives = 33/122 (27%)

Query: 109 PAPPKPPTPPMPIAGPEPAPPKPPTPPMPIAGPAPTPTESQLAPPRPPTPQTPTGAPQQP 168
PAP +P + M P PP P+ P P P P P +P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 98

Query: 169 ESPAPHVPSHGPHQPRRTAPAPPWAKMPIGEPPPAPSRPSASPAEPPTRPAPQHSRRARR 228
+ P + + + P + P S + + P R R
Sbjct: 99 KPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSR 158

Query: 229 GH 230

Sbjct: 159 NQ 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3879cPF05616451e-06 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 44.7 bits (105), Expect = 1e-06
Identities = 47/174 (27%), Positives = 64/174 (36%), Gaps = 15/174 (8%)

Query: 173 LEDLLQQKSPPPPDVPTLVVPSPGTPG-TPGTPITPGTPITPGTPITPIPGAPVTPITPT 231
LE++L K PD + + G PG + + PGT + G P+T G PV +
Sbjct: 240 LEEILSLKVDANPDK---YIKATGYPGYSEKVEVAPGTKVNMG-PVTDRNGNPVQVVATF 295

Query: 232 PGTPVTPVTPGKPVTPVTPVKPGTPGEPTPITPVTPPVAPATPATPATPVTPAPAPHPQP 291
T V P + PG+ P P+ P V+PA PA P P
Sbjct: 296 GRDSQGNTTVDVQVIPRPDLTPGSAEAPNA-QPL-PEVSPAE--------NPANNPAPNE 345

Query: 292 APAPAPSPGPQPVTPATPGPSGPATPGTPGGEPAPHVKPAALAEQPGVPGQHAG 345
P P+P P P P PGT PA +P + G+ G
Sbjct: 346 NPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRKERKEGEDGG 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3881cTONBPROTEIN330.002 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.7 bits (74), Expect = 0.002
Identities = 18/86 (20%), Positives = 25/86 (29%), Gaps = 1/86 (1%)

Query: 271 EPVNPPKPPPAIKIDPPPPPQEQGLIPGFLMPPSDGSGVTPGTGMPAAPMVPPTG-SPGG 329
P +P P + P PP + +I P V P P
Sbjct: 64 PPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPAS 123

Query: 330 GLPADTAAQLTSAGREAAALSGDVAV 355
A+LTS+ AA +V
Sbjct: 124 PFENTAPARLTSSTATAATSKPVTSV 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3883cSUBTILISIN1761e-53 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 176 bits (448), Expect = 1e-53
Identities = 86/343 (25%), Positives = 129/343 (37%), Gaps = 69/343 (20%)

Query: 64 PWSNTYLGVADAHKFATGAGVTVAVIDTGVDAS-PRVP--AEPGGDFVDQAG---NGLSD 117
P + G GV VAV+DTG DA P + G +F D D
Sbjct: 23 PRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKD 82

Query: 118 CDAHGTLTASIIAGRPAPTDGFVGVAPDARLLSLRQTSEAFEPVGSQANPNDPNATPAAG 177
+ HGT A IA +G VGVAP+A LL ++ GS G
Sbjct: 83 YNGHGTHVAGTIAATE-NENGVVGVAPEADLLIIK----VLNKQGS-------------G 124

Query: 178 SIRSLARAVVHAANLGVGVINISEAACYKVSRPIDETSLGASIDYAVNVKGVVVVVAAGN 237
+ + + +A V +I++S + P D L ++ AV ++V+ AAGN
Sbjct: 125 QYDWIIQGIYYAIEQKVDIISMS------LGGPEDVPELHEAVKKAVA-SQILVMCAAGN 177

Query: 238 TGGDCVQNPAPDPSTPGDPRGWNNVQTVVTPAWYAPLVLSVGGIGQTGMPSSFSMHGPWV 297
G G + + P Y V+SVG I S FS V
Sbjct: 178 EG-----------------DGDDRTDELGYPGCY-NEVISVGAINFDRHASEFSNSNNEV 219

Query: 298 DVAAPAENIVALGDTGEPVNALQGREGPVPIAGTSFAAAYVSGLAALLRQRFP-----DL 352
D+ AP E+I++ G+ +GTS A +V+G AL++Q DL
Sbjct: 220 DLVAPGEDILSTVPGGKYAT----------FSGTSMATPHVAGALALIKQLANASFERDL 269

Query: 353 TPAQIIHRITATARHPGGGVDDLVGAGVIDAVAA----LTWDI 391
T ++ ++ P G + G G++ A +D
Sbjct: 270 TEPELYAQLIKRTI-PLGNSPKMEGNGLLYLTAVEELSRIFDT 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3885cPF06580290.048 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.048
Identities = 10/57 (17%), Positives = 22/57 (38%)

Query: 24 VLASAGWALGGQLGAVMAVVVGVALVFVQWWGQPAWSWAVLGLRGRRPVKWNDPITL 80
+ GW ++ V+ ++ + W+ W +L +PV + P+ L
Sbjct: 62 FIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLAL 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Rv3886cSUBTILISIN1188e-32 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 118 bits (298), Expect = 8e-32
Identities = 57/259 (22%), Positives = 99/259 (38%), Gaps = 63/259 (24%)

Query: 236 GVVGVAPHATIISIRQSSRAFEPVNPSSAGPNSDEKVKAGTLDSVARAVVHAANMGAKVI 295
GVVGVAP A ++ I+ + K +G D + + + +A +I
Sbjct: 102 GVVGVAPEADLLIIKVLN-----------------KQGSGQYDWIIQGIYYAIEQKVDII 144

Query: 296 NISVTACLPAAAPGDQRVLGAALWYAATVKDAVIVAAAGNDGEAGCGNNPMYDPLDPSDP 355
++S+ P D L A+ A +++ AAGN+G+ +
Sbjct: 145 SMSL------GGPEDVPELHEAVKKA-VASQILVMCAAGNEGDGDDRTDE---------- 187

Query: 356 RDWHQVTVVSSPSWFSDYVLSVGAVDAYGAALDKSMSGPWVGVAAPGTHIMGLSPQGGGP 415
+ P + + V+SVGA++ A + S S V + APG I+ P G
Sbjct: 188 --------LGYPGCY-NEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGG--- 235

Query: 416 VNAYPPSRPGEKNMPFWGTSFSAAYVSGVAALVRAKFP-----ELTAYQVINRIVQSAHN 470
K F GTS + +V+G AL++ +LT ++ ++++
Sbjct: 236 -----------KYATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRT-I 283

Query: 471 PPAGVDNKLGYGLVDPVAA 489
P G GL+ A
Sbjct: 284 PLGNSPKMEGNGLLYLTAV 302



Score = 64.1 bits (156), Expect = 3e-13
Identities = 24/85 (28%), Positives = 39/85 (45%), Gaps = 7/85 (8%)

Query: 71 PDVAQLAPGFNLVNISKAWQYSTGNGVPVAVIDTGVSPN-PRLP--VVPGGDYIMGEDG- 126
V ++ G ++ W + G GV VAV+DTG + P L ++ G ++ ++G
Sbjct: 17 QQVNEIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGD 76

Query: 127 ---LSDCDAHGTVVSSIIAAAPLGI 148
D + HGT V+ IAA
Sbjct: 77 PEIFKDYNGHGTHVAGTIAATENEN 101



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.