PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome466.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_009080 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1BMA10247_0001BMA10247_0027Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0001010-4.168845chromosomal replication initiation protein
BMA10247_0002013-4.469314DNA polymerase III subunit beta
BMA10247_0003114-4.022329DNA gyrase subunit B
BMA10247_0004113-1.811934transposase subfamily protein
BMA10247_0005-113-0.302579transposase
BMA10247_0007-1111.061567TonB domain-containing protein
BMA10247_0008-111-0.068544hypothetical protein
BMA10247_0009-213-1.205062hypothetical protein
BMA10247_0010-113-1.078100O-antigen polymerase
BMA10247_0011215-0.920019type IV pilin
BMA10247_0012214-2.176412TerC family integral membrane protein
BMA10247_0013213-1.992173succinyl-CoA synthetase subunit alpha
BMA10247_001419-0.682875succinyl-CoA synthetase subunit beta
BMA10247_0015-19-0.636172hypothetical protein
BMA10247_0016011-0.507061recombination regulator RecX
BMA10247_0017213-1.683531recombinase A
BMA10247_0018112-1.904057DNA-binding response regulator
BMA10247_0019213-2.828499sensor histidine kinase
BMA10247_0020317-4.294794major facilitator family transporter
BMA10247_0021728-4.271849hypothetical protein
BMA10247_0023414-1.232471*hypothetical protein
BMA10247_00240140.605981IS407A, transposase OrfB
BMA10247_00250112.632678IS407A, transposase OrfA
BMA10247_0026-1123.107292molybdenum cofactor biosynthesis protein C
BMA10247_0027-1103.010749hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0001PERTACTIN330.004 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.8 bits (74), Expect = 0.004
Identities = 24/93 (25%), Positives = 31/93 (33%)

Query: 81 PKAGQRSPAGATPLAPRAPLPSANPAPVGPGPACAPAVDAHAPAPAGMNAATAAAVAAAQ 140
P A + +P P+ P P P P P +A AP P +AAA AA
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAANAAVN 627

Query: 141 AAQAAQANAAALNADEAADLDLPSLTAHEAAAG 173
A+ A L L + A G
Sbjct: 628 TGGVGLASTLWYAESNALSKRLGELRLNPDAGG 660


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0007PF03544399e-07 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 39.2 bits (91), Expect = 9e-07
Identities = 18/97 (18%), Positives = 28/97 (28%), Gaps = 2/97 (2%)

Query: 18 AGCAAFAPRDAAKLECTMPVAAYPENAKPLERRATVLVRAMITASGNAENVTVTTSSRNA 77
+ L P YP A+ L V V+ +T G +NV + ++
Sbjct: 147 SKPVTSVASGPRALSRNQPQ--YPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPAN 204

Query: 78 AADRAAVDAMSRIACSQTPARGGEPYPFTLTRPFVFE 114
+R +AM R G E
Sbjct: 205 MFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTE 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0010PF06580290.048 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.048
Identities = 17/107 (15%), Positives = 41/107 (38%), Gaps = 14/107 (13%)

Query: 205 AALSALLSVGLALTVSRGPWLQVG-----------VMVVAGFWMAFA-QARRDPA--ASR 250
+ + +L+ + R WL++ +V+ W R A ++
Sbjct: 49 SLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTK 108

Query: 251 ARAWAIPVVLGVLFVAVNVAVRWANVHYHLGLAESAADRMRDAGQIA 297
A+ +P+ L ++F V V W+ +++ ++ D ++A
Sbjct: 109 PVAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMA 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0011BCTERIALGSPG422e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 42.2 bits (99), Expect = 2e-07
Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 5/59 (8%)

Query: 15 RRRLRARGFTLIELMIVLAIVGVVAAYAIPAYQDYLARSRVGEGLALAASARLAVAENA 73
R + RGFTL+E+M+V+ I+GV+A+ +P + A + + ENA
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLM-----GNKEKADKQKAVSDIVALENA 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0018HTHFIS998e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 98.8 bits (246), Expect = 8e-26
Identities = 35/118 (29%), Positives = 64/118 (54%), Gaps = 1/118 (0%)

Query: 2 RILIAEDDSILADGLTRSLRQSGYAVDHVRNGVEADTALSMQTFDLLILDLGLPRMSGLE 61
IL+A+DD+ + L ++L ++GY V N ++ DL++ D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRRLRARNSNLPVLILTAADSVDERVKGLDLGADDYMAKPFALNE-LEARVRALTRR 118
+L R++ +LPVL+++A ++ +K + GA DY+ KPF L E + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0019PF06580477e-08 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 47.2 bits (112), Expect = 7e-08
Identities = 44/198 (22%), Positives = 78/198 (39%), Gaps = 49/198 (24%)

Query: 287 LAGLRTQAEF-ALRHEVNADVAH----SLEQIA----TSSEQAARLVTQLLALARAENRA 337
+A + +A+ AL+ ++N H +L I +A ++T L L R
Sbjct: 154 MASMAQEAQLMALKAQINP---HFMFNALNNIRALILEDPTKAREMLTSLSELMRY---- 206

Query: 338 TGLTFEPVEIASLARQ--AVRDWV---QVALAKQMDLGYESPDTDAPLRIDGQPVMLREM 392
L + SLA + V ++ + ++ + +++ P ML
Sbjct: 207 -SLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQV---PPML--- 259

Query: 393 LGNLIDNAIRY----TPAGGRITVRVRAERAAGAVHLEVEDTGPGIPPNERERVVERFYR 448
+ L++N I++ P GG+I ++ + G V LEVE+TG N +E
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDN--GTVTLEVENTGSLALKNTKE-------- 309

Query: 449 ILGREGDGSGLGLAIVRE 466
+G GL VRE
Sbjct: 310 -------STGTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0020TCRTETA358e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 8e-04
Identities = 47/264 (17%), Positives = 93/264 (35%), Gaps = 28/264 (10%)

Query: 77 AIVFGRLGDLVGRKHTFLITIVIMGISTFVVGFLPGYASIGIAAPVIFIAMRLLQGLALG 136
A V G L D GR+ L+++ + ++ P V++I R++ G+ G
Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-------FLWVLYIG-RIVAGIT-G 110

Query: 137 GEYGGAATYVAEHAPSHRRGFYTSWIQTTATLGLFLSLLVILGVRTAIGEEAFGSWGWRV 196
A Y+A+ R + ++ G+ + G +G G +
Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGM------VAG--PVLGGLM-GGFSPHA 161

Query: 197 PFVASILLLAVSVWIRLQLNESPVFLRIKAEGKTSKAPLTEAFGQWKNLKIVILALIGLT 256
PF A+ L ++ L + + + PL + + L
Sbjct: 162 PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASF-----RWARGMTVVAALM 216

Query: 257 AGQAVVWYTGQFYA---LFFLTQTLKVDGASANILIALALLIGTPF-FVFFGSLSDRIGR 312
A ++ GQ A + F D + I +A ++ + + G ++ R+G
Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE 276

Query: 313 KPIILAGCLIAALTYFPLFKALTH 336
+ ++ G +IA T + L T
Sbjct: 277 RRALMLG-MIADGTGYILLAFATR 299



Score = 34.8 bits (80), Expect = 8e-04
Identities = 17/42 (40%), Positives = 24/42 (57%)

Query: 287 ILIALALLIGTPFFVFFGSLSDRIGRKPIILAGCLIAALTYF 328
IL+AL L+ G+LSDR GR+P++L AA+ Y
Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYA 88


2BMA10247_0082BMA10247_0097Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0082093.271661HAD family hydrolase
BMA10247_00840113.877397carbohydrate ABC transporter ATP-binding
BMA10247_00830114.542665hypothetical protein
BMA10247_0085-1114.465870hypothetical protein
BMA10247_00861115.245779LysR family transcriptional regulator
BMA10247_00871125.499174esterase
BMA10247_00880125.296582major facilitator family transporter
BMA10247_00890124.840151transcriptional regulator
BMA10247_00902115.074949xylulokinase
BMA10247_00912114.751357mannitol dehydrogenase
BMA10247_00921115.165784LysR family transcriptional regulator
BMA10247_00931114.458416benzoylformate decarboxylase
BMA10247_00942113.826509aldehyde dehydrogenase
BMA10247_00953123.6350282-dehydropantoate 2-reductase
BMA10247_00962112.649399hypothetical protein
BMA10247_00972122.230252major facilitator family transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0084PF05272300.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.017
Identities = 14/35 (40%), Positives = 17/35 (48%)

Query: 32 VVFVGPSGCGKSTLMRMIAGLEEISGGELLIDGAK 66
VV G G GKSTL+ + GL+ S I K
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0085PF06776340.001 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 33.8 bits (77), Expect = 0.001
Identities = 20/80 (25%), Positives = 25/80 (31%), Gaps = 11/80 (13%)

Query: 60 RLCRRIGGRHAAGP------APARESPSENSMKTGRRHFVRSVASASAALAAAAWSPARA 113
R+ RR HA PA SP + + RR R+ A A A A
Sbjct: 10 RISRRPVTNHAVPALKAIQMGPAELSPM---LASCRRLARRNGARLMLAGAMAI--ALSF 64

Query: 114 AIDAPASPATALSLTPGRWS 133
A A+ G W
Sbjct: 65 GWSDRADAQGAVRSVHGDWQ 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0087BLACTAMASEA300.019 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.8 bits (67), Expect = 0.019
Identities = 11/35 (31%), Positives = 15/35 (42%)

Query: 57 REDALFRFASVSKPIVSAAAMRAVAAGKLDLDASI 91
R D F S K ++ A + V AG L+ I
Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKI 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0088TCRTETB363e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.6 bits (82), Expect = 3e-04
Identities = 31/155 (20%), Positives = 59/155 (38%), Gaps = 5/155 (3%)

Query: 26 LLALATAGFITIVTEALPAGLLPLMGRDLRVSDALVGQLVTVYAAGSIVAAMPLVAATRG 85
L+ L F +++ E + LP + D A + T + + +
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 86 MRRRPLLLAALAGFVVANTATAASPYYAPVLV-ARCVAGVSAGLLWALLAGYASRMVDAR 144
+ + LLL + + + +L+ AR + G A AL+ +R +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 145 QRGRAIAIAMLGAPVAMSVGI-PL-GTALGAALGW 177
RG+A ++G+ VAM G+ P G + + W
Sbjct: 136 NRGKAFG--LIGSIVAMGEGVGPAIGGMIAHYIHW 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0097TCRTETA401e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 40.2 bits (94), Expect = 1e-05
Identities = 88/424 (20%), Positives = 144/424 (33%), Gaps = 52/424 (12%)

Query: 50 AVLLAAFAIVLDGFDSQLIGFAIPVLIKEWGITRDA---FAPAVAAGLFGMGVGSACAGL 106
+++ + LD LI +P L+++ + D + +A + G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 107 FADRFGRRWAIIGSVFVFGAATCAIGFAPNVATIAALRFVAGLGIGGALPTATTMTAEYT 166
+DRFGRR ++ S+ + AP + + R VAG+ G A A+ T
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124

Query: 167 PARRRTMMVTATIVCVPAGGMLAGLFAHEVLPAYGWRGLFWLGGALPLALGLLLVRALPE 226
R C GM+AG ++ + F+ AL L LPE
Sbjct: 125 DGDERARHFGFMSACF-GFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 227 SPRYLARNPARWRELGALLARMGRPVADGTAFTDLAEARAHEGQRRGVRALFSAAYARDT 286
S + R P R L AR V AL + +
Sbjct: 184 SHKG-ERRPLRREALNP--------------LASFRWARG----MTVVAALMAVFF---I 221

Query: 287 IALWCAFCMCLLAVYSA--FSWLPTMLTSQGLSVSVAGSGLTAYNLGGVLGALGCALAIG 344
+ L L ++ F W T + +S+A G+L +L A+ G
Sbjct: 222 MQLVGQVPAALWVIFGEDRFHWDATTI-----GISLAA--------FGILHSLAQAMITG 268

Query: 345 RFGSRW-PLAFCCAGGAASAAWLLGVDAGSHAGWLI----VGLAAHGFFVNAVQSTMYAL 399
+R G A + + + GW+ V LA+ G + A+Q+ +
Sbjct: 269 PVAARLGERRALMLGMIADGTGYILLAFATR-GWMAFPIMVLLASGGIGMPALQAMLSRQ 327

Query: 400 CTFIYPTPVRATGTAGAVAFGRVGAILSAFAGAYVISAGGANAYLAMLAAAMAVVLVALL 459
++ + A VG +L F Y S N + + AA+ L+ L
Sbjct: 328 VDEERQGQLQGSLAALTSLTSIVGPLL--FTAIYAASITTWNGWAWIAGAAL--YLLCLP 383

Query: 460 ALRR 463
ALRR
Sbjct: 384 ALRR 387


3BMA10247_0130BMA10247_0151Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0130211-2.471321ISBma2, transposase
BMA10247_0131110-2.169686mulitcopper oxidase domain-containing protein
BMA10247_0133318-5.021923hypothetical protein
BMA10247_0132418-5.475353isocitrate dehydrogenase
BMA10247_0134725-4.614683RNA pseudouridine synthase
BMA10247_0135728-4.828830transposase
BMA10247_0136726-4.096400transposase subfamily protein
BMA10247_0138425-2.106543*transposase
BMA10247_01393280.351178transposase subfamily protein
BMA10247_01423131.120956hypothetical protein
BMA10247_01411101.480240chorismate mutase
BMA10247_01430111.239707hypothetical protein
BMA10247_01440101.000084hypothetical protein
BMA10247_01450120.233135hypothetical protein
BMA10247_01460120.285699TonB-dependent receptor
BMA10247_01472210.104178hypothetical protein
BMA10247_0148230-1.735860hypothetical protein
BMA10247_0149432-2.680416hypothetical protein
BMA10247_0150229-3.983773hypothetical protein
BMA10247_0151118-3.247469hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0146SURFACELAYER300.028 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 30.4 bits (68), Expect = 0.028
Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 1/49 (2%)

Query: 13 RLAAACAAALAWPAAHAASTAAAVPADSTPAAAAEMTASGKTLDTVKVT 61
R+ +A AAAL A A+TA V A +T A + + A+ V VT
Sbjct: 6 RIVSAAAAALL-AVAPIAATAMPVNAATTINADSAINANTNAKYDVDVT 53


4BMA10247_0181BMA10247_0201Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_01813132.263520formate dehydrogenase subunit beta
BMA10247_0182-118-0.310467formate dehydrogenase subunit gamma
BMA10247_0183-120-1.839149hypothetical protein
BMA10247_0184-2140.230972molybdate transport repressor
BMA10247_0185015-1.482208hypothetical protein
BMA10247_0186217-2.095054phospholipid N-methyltransferase PmtA
BMA10247_0187420-3.836924transposase
BMA10247_0188321-3.098801transposase subfamily protein
BMA10247_0190-212-2.679151phosphoglycolate phosphatase
BMA10247_0191-112-4.0951403-demethylubiquinone-9 3-methyltransferase
BMA10247_0192-111-4.435825OmpA family protein
BMA10247_0194-29-3.598425hypothetical protein
BMA10247_0193-18-1.541468hypothetical protein
BMA10247_0195-17-1.310082DNA gyrase subunit A
BMA10247_019609-1.858899hypothetical protein
BMA10247_019708-1.739554phosphoserine aminotransferase
BMA10247_0198-27-2.012133chorismate mutase/prephenate dehydratase
BMA10247_0199-27-2.281322bifunctional prephenate
BMA10247_0200-18-3.859985cytidylate kinase
BMA10247_0201-28-3.45777030S ribosomal protein S1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0192OMPADOMAIN1684e-53 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 168 bits (426), Expect = 4e-53
Identities = 75/146 (51%), Positives = 99/146 (67%), Gaps = 3/146 (2%)

Query: 74 AQAPAPAPVAPVAPAITSQKITYQADTLFDFDKAVLKPAGKQKLDELAAKIQGMNVE--V 131
AP AP AP + ++ T ++D LF+F+KA LKP G+ LD+L +++ ++ +
Sbjct: 195 EAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGS 254

Query: 132 VVATGYTDRIGSDKYNDRLSLRRAQAVKSYLVSKGVPANKVYTEGKGKRNPVTGNTC-KQ 190
VV GYTDRIGSD YN LS RRAQ+V YL+SKG+PA+K+ G G+ NPVTGNTC
Sbjct: 255 VVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNV 314

Query: 191 KNRKQLIACLAPDRRVEVEVVGTQEV 216
K R LI CLAPDRRVE+EV G ++V
Sbjct: 315 KQRAALIDCLAPDRRVEIEVKGIKDV 340


5BMA10247_0243BMA10247_0262Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0243317-0.021431ISBma2, transposase
BMA10247_02443220.670781dihydrofolate reductase
BMA10247_02452190.269292sigma-54 dependent DNA-binding transcriptional
BMA10247_0246219-0.965939hypothetical protein
BMA10247_0247018-3.208730hypothetical protein
BMA10247_0248116-2.312833hypothetical protein
BMA10247_0249116-2.468555hypothetical protein
BMA10247_0250114-1.889039sigma-54 dependent trancsriptional regulator
BMA10247_0251116-2.335651thymidylate synthase
BMA10247_0252113-1.774234ISBma2, transposase
BMA10247_02551110.089518ArsR family transcriptional regulator
BMA10247_0256115-2.706787hypothetical protein
BMA10247_0257214-3.197334hypothetical protein
BMA10247_0258214-3.213113thioesterase
BMA10247_0259011-3.015896fumarate hydratase
BMA10247_0260014-3.850626hypothetical protein
BMA10247_0261013-4.276124IS407A, transposase OrfB
BMA10247_0262-112-3.303062IS407A, transposase OrfA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0245HTHFIS354e-118 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 354 bits (910), Expect = e-118
Identities = 145/453 (32%), Positives = 217/453 (47%), Gaps = 46/453 (10%)

Query: 174 VHVARSANEAARRVKPNQPQAGIADL---DGFAPRELPTLEAVLRQQQVGWIALAGDTRI 230
V + +A R + + D+ D A LP ++ V + ++
Sbjct: 30 VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPV--LVMSAQNTF 87

Query: 231 NDPDVRRLIRQYCFDYMQGLPPHETIDYLVGHAYGMVALCDLDVTAGAAATGDEMVGACD 290
+ + +DY+ + ++G A + G +VG
Sbjct: 88 MTA--IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR-RPSKLEDDSQDGMPLVGRSA 144

Query: 291 AMQQLFRTIRKVAATDATVFISGESGTGKELTALAIHERSERRKAPFVAINCGAIPNHLL 350
AMQ+++R + ++ TD T+ I+GESGTGKEL A A+H+ +RR PFVAIN AIP L+
Sbjct: 145 AMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLI 204

Query: 351 QSELFGYERGAFTGASQRKVGRVEAADGGTLFLDEIGDMPLESQASMLRFLQEGKIERLG 410
+SELFG+E+GAFTGA R GR E A+GGTLFLDEIGDMP+++Q +LR LQ+G+ +G
Sbjct: 205 ESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVG 264

Query: 411 GHESIPVDVRIISATHVDLDAAMREGRFRDDLYHRLCVLKLDEPPLRARGKDIEILAHHI 470
G I DVRI++AT+ DL ++ +G FR+DLY+RL V+ L PPLR R +DI L H
Sbjct: 265 GRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHF 324

Query: 471 LHQFRSDGARRIHGFTSCAIEAMYNYHWPGNVRELINRIRRAIVMSDSRQLSAADLDL-- 528
+ Q +G + F A+E M + WPGNVREL N +RR + ++ ++
Sbjct: 325 VQQAEKEG-LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENEL 383

Query: 529 -----------------------------------APFAARQATTLAEARERAERRTIEA 553
A + E I A
Sbjct: 384 RSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILA 443

Query: 554 SLLRHRNHLTEAAAELGVSRATLYRLMVSHGLR 586
+L R + +AA LG++R TL + + G+
Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0250HTHFIS376e-129 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 376 bits (968), Expect = e-129
Identities = 133/388 (34%), Positives = 202/388 (52%), Gaps = 40/388 (10%)

Query: 101 FDYVTVPYECDRIVESVGHAYGMVTLSEGLAPAAATVRNEGEMVGTCEAMLALFKMIRKV 160
+DY+ P++ ++ +G A + + ++ +VG AM +++++ ++
Sbjct: 99 YDYLPKPFDLTELIGIIGRA--LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARL 156

Query: 161 ASTDAPVFISGESGTGKELTAVAIHERSSRAGAPFVAINCGAIPPTLLQAELFGYERGAF 220
TD + I+GESGTGKEL A A+H+ R PFVAIN AIP L+++ELFG+E+GAF
Sbjct: 157 MQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAF 216

Query: 221 TGANQRKIGRIEAANGGTLFLDEIGDLPFESQASLLRFLQEHKVERVGGHQSIPVDVRII 280
TGA R GR E A GGTLFLDEIGD+P ++Q LLR LQ+ + VGG I DVRI+
Sbjct: 217 TGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIV 276

Query: 281 SATHVDMQIALRNGRFREDLYHRLCVLKLEEPPLRERGKDIEILARHMLERFKGDAHRRL 340
+AT+ D++ ++ G FREDLY+RL V+ L PPLR+R +DI L RH +++ + + +
Sbjct: 277 AATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEG-LDV 335

Query: 341 RGFTPDAIAALHNYAWPGNVRELINRVRRAIVMSEGRMISAADLELSGYAEVA------- 393
+ F +A+ + + WPGNVREL N VRR + +I+ +E +E+
Sbjct: 336 KRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKA 395

Query: 394 ------------------------------PMSLEEARESAERHAIEVALLRHRGRLADA 423
+ E I AL RG A
Sbjct: 396 AARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKA 455

Query: 424 ARELGVSRVTLYRLLCAYGMRDDDGARA 451
A LG++R TL + + G+ +R+
Sbjct: 456 ADLLGLNRNTLRKKIRELGVSVYRSSRS 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0255adhesinmafb320.002 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 32.0 bits (72), Expect = 0.002
Identities = 17/67 (25%), Positives = 28/67 (41%), Gaps = 3/67 (4%)

Query: 36 SARPAGELTMIAGLSPSAASAHLARLTDGGLLAL---DVRGRHRYYRIATPDIAAAIEAL 92
R A + + ++P A A + G +A + R + P+ A +EA+
Sbjct: 254 GTRYAIDKAAMRNIAPLPAEGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAV 313

Query: 93 ANVAQAA 99
NVA AA
Sbjct: 314 FNVAAAA 320


6BMA10247_0310BMA10247_0318Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_03100143.762866hypothetical protein
BMA10247_0311-1133.908502glycosyl hydrolase
BMA10247_03122174.358135hypothetical protein
BMA10247_03132144.417968hypothetical protein
BMA10247_03143165.001414hypothetical protein
BMA10247_03152154.801110DNA translocase FtsK
BMA10247_03163153.428731hypothetical protein
BMA10247_03172122.348089hypothetical protein
BMA10247_0318193.022078phosphoribosylglycinamide formyltransferase 2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0315IGASERPTASE492e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 49.3 bits (117), Expect = 2e-07
Identities = 55/329 (16%), Positives = 96/329 (29%), Gaps = 32/329 (9%)

Query: 280 RAQARPTAPDPRFAPRRPATQAAVSAARNRPMTFTPSRQTTGATPPQPAPPAQTAAPTAE 339
+ R D QA V + + PP PA P++T AE
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETTETVAE 1042

Query: 340 TARKRAPANPARAPLYAWHEKPAERIAPAASVHETLRSIEASAAQWTALAGATSTAATPV 399
+++ + + E A V + +S + Q +A + S
Sbjct: 1043 NSKQESKTVEKN------EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ 1096

Query: 400 TARESIAAPAAPSGGAAASAARDGRAPTSAETAAPDGHAPTSAETVAPDGHVPTSAETAA 459
T A A + P +P ETV P AE A
Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQS---ETVQP------QAEPAR 1147

Query: 460 PDGHVPTSAETAAPNDHASTSAETVAPDSHAPTSAETAAPDGHASTITEATAPNGHVSAT 519
+ E + +T+A+T P ++ E + T T + +
Sbjct: 1148 ENDPTVNIKEPQSQT---NTTADTEQPAKETSSNVEQPV----TESTTVNTGNSVVENPE 1200

Query: 520 VETSAVAAPAGITQAAPPIAADICPAGEHVIAAVEPACTSDSAAIGAGAIAHAEAGAAAS 579
T A P ++++ + V VEPA TS + +A + + +
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSN---DRSTVALCDLTSTNT 1257

Query: 580 TAET------ASPIGADTHIAPSREADRT 602
A A + + A S+ +
Sbjct: 1258 NAVLSDARAKAQFVALNVGKAVSQHISQL 1286



Score = 41.6 bits (97), Expect = 3e-05
Identities = 53/315 (16%), Positives = 96/315 (30%), Gaps = 51/315 (16%)

Query: 578 ASTAETASPIGADTHIAPSREADRTA-QTAPTAPSPAEATPHVDAPHALDVAARALVGNT 636
+ T + I AD PS + AP P PA ATP +
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPP-PAPATP----------SETTET-VA 1041

Query: 637 AATAHGAAAVNGSAQRADTASPAASTSGPPAPVAASAASSDRAAPQPVATAAPASIATSG 696
+ + V + Q A + VA A S+ +A Q +
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNRE------VAKEAKSNVKANTQ-------TNEVAQS 1088

Query: 697 ALGTMKASGTAGPQPSTIAAQRASAIDDTGQPPSTGHSTHAAVSNELGRRPHAAPDAVTP 756
T + T + +T+ + + ++ ++ + E + V P
Sbjct: 1089 GSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQE-------QSETVQP 1141

Query: 757 VLPPAAAAVPTNASAVQRQALASESAEAAQGVARAAAAGDSRETTQVSPAGARPDGAAPS 816
PA PT + + Q+ + +A+ Q ++ + T + +
Sbjct: 1142 QAEPARENDPT-VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTV-------NTGN 1193

Query: 817 AAVANPIAPLPDASAITAHEDAPT--------SAAPDAATPVIAAMDSAMPNAVAPASAI 868
+ V NP P + T + ++ S A S + VA
Sbjct: 1194 SVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLT 1253

Query: 869 A--SNAGMSPASASA 881
+ +NA +S A A A
Sbjct: 1254 STNTNAVLSDARAKA 1268


7BMA10247_0410BMA10247_0425Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0410012-3.541116triosephosphate isomerase
BMA10247_0411-114-5.502683preprotein translocase subunit SecG
BMA10247_0413011-4.954559*NADH dehydrogenase subunit A
BMA10247_0414-112-2.688945NADH dehydrogenase subunit B
BMA10247_0415-214-3.003804NADH dehydrogenase subunit C
BMA10247_0416-114-3.112103NADH dehydrogenase subunit D
BMA10247_0417014-2.567788NADH dehydrogenase subunit E
BMA10247_0418014-2.626161NADH dehydrogenase I subunit F
BMA10247_0419015-3.369088NADH dehydrogenase subunit G
BMA10247_0420117-5.291608NADH dehydrogenase subunit H
BMA10247_0421116-4.695486NADH dehydrogenase subunit I
BMA10247_0422017-4.591809NADH dehydrogenase subunit J
BMA10247_0423016-4.816606NADH dehydrogenase subunit K
BMA10247_0424016-4.187890NADH dehydrogenase subunit L
BMA10247_0425-314-3.384958NADH dehydrogenase subunit M
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0411SECGEXPORT838e-24 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 82.7 bits (204), Expect = 8e-24
Identities = 46/102 (45%), Positives = 68/102 (66%), Gaps = 1/102 (0%)

Query: 8 IIVVQLLSALGVIGLVLLQHGKGADMGAAFGSGASGSLFGATGSANFLSRTTAVLATIFF 67
++VV L+ A+G++GL++LQ GKGADMGA+FG+GAS +LFG++GS NF++R TA+LAT+FF
Sbjct: 5 LLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLATLFF 64

Query: 68 VATLALTYLGSYKSAPSVGVLGAAPAPAASAPAASQTPAASA 109
+ +L L + S K+ APA + PA
Sbjct: 65 IISLVLGNINSNKTNKGSEWEN-LSAPAKTEQTQPAAPAKPT 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0420OUTRMMBRANEA300.014 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 29.9 bits (67), Expect = 0.014
Identities = 16/96 (16%), Positives = 28/96 (29%), Gaps = 10/96 (10%)

Query: 136 YAVILAGWASNSKYAFLGAMR-------AAAQMVSYEISMGFALVLVLMTAGSLNLSEIV 188
Y GW+ F+ A Y+++ + G +
Sbjct: 29 YTGAKLGWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGRMPY---K 85

Query: 189 GSQQHGFFAGHGVNFLSWNWLPLLPVFVIYFISGIA 224
GS ++G + GV + P+ IY G
Sbjct: 86 GSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGM 121


8BMA10247_0467BMA10247_0544Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_04670123.104253amino acid transporter
BMA10247_0468094.246179hypothetical protein
BMA10247_0469094.248995hypothetical protein
BMA10247_04702105.112334hypothetical protein
BMA10247_0471074.981461exodeoxyribonuclease V subunit gamma
BMA10247_0472194.459504exodeoxyribonuclease V subunit beta
BMA10247_04734134.079440exodeoxyribonuclease V subunit alpha
BMA10247_0474222-0.431743peptidyl-tRNA hydrolase domain-containing
BMA10247_0476425-2.086632hypothetical protein
BMA10247_0477012-2.819637hypothetical protein
BMA10247_0478113-4.465725lipoprotein
BMA10247_0479113-2.995359lipoprotein
BMA10247_0480112-3.098716IS407A, transposase OrfA
BMA10247_0481013-2.612683IS407A, transposase OrfB
BMA10247_0482013-1.616372thiamine biosynthesis protein ThiC
BMA10247_04832130.217938hypothetical protein
BMA10247_0485082.076212hypothetical protein
BMA10247_04862111.948572molybdopterin-binding oxidoreductase
BMA10247_04872133.294246hypothetical protein
BMA10247_04883133.476859hypothetical protein
BMA10247_04895133.295431hypothetical protein
BMA10247_04905133.464032major facilitator family transporter
BMA10247_04913163.219119hypothetical protein
BMA10247_04923162.666317hypothetical protein
BMA10247_04933132.388763hypothetical protein
BMA10247_04941111.602012Ser/Thr protein phosphatase
BMA10247_04960111.770700ABC transporter ATP-binding protein
BMA10247_0497-2111.016655ABC transporter permease
BMA10247_0499-1110.128375hypothetical protein
BMA10247_0500-1120.587905ABC transporter permease
BMA10247_0501010-0.501847ABC transporter periplasmic substrate-binding
BMA10247_0502312-0.190045LacI family transcriptional regulator
BMA10247_0503015-1.728095hypothetical protein
BMA10247_0505-113-1.756711hypothetical protein
BMA10247_0506-111-0.364917hypothetical protein
BMA10247_0508-112-0.002593diguanylate cyclase
BMA10247_0509-1131.447907hypothetical protein
BMA10247_0510093.313852hypothetical protein
BMA10247_0511082.813257hypothetical protein
BMA10247_0512292.974797hypothetical protein
BMA10247_0513483.125591hypothetical protein
BMA10247_0514473.492265transcriptional regulator
BMA10247_0515473.484877GntR family transcriptional regulator
BMA10247_0516382.319737AsnC family transcriptional regulator
BMA10247_0517083.391580glucosamine--fructose-6-phosphate
BMA10247_05182134.338521carotenoid 9,10-9',10' cleavage dioxygenase
BMA10247_05192103.668363hypothetical protein
BMA10247_05201103.003969LysR family transcriptional regulator
BMA10247_05210102.861923short chain dehydrogenase
BMA10247_0522-1123.071418hypothetical protein
BMA10247_0523-1101.521597hypothetical protein
BMA10247_0525191.038362nitrite/sulfite reductase
BMA10247_05262130.767015hypothetical protein
BMA10247_0527-180.891236GntR family transcriptional regulator
BMA10247_0529-2101.274915HSP20 family protein
BMA10247_0530-1111.527685HSP20 family protein
BMA10247_0531-1132.230610hypothetical protein
BMA10247_0532-1122.063063hypothetical protein
BMA10247_05330121.886671phosphoenolpyruvate carboxykinase
BMA10247_05350122.579222LysR family transcriptional regulator
BMA10247_05362123.316469malonate transporter MadL subunit
BMA10247_05373133.794176malonate transporter MadM subunit
BMA10247_05382143.984984hypothetical protein
BMA10247_05391114.940514malonate decarboxylase subunit alpha
BMA10247_05403126.573722malonate decarboxylase subunit delta
BMA10247_05413126.674326malonate decarboxylase subunit beta
BMA10247_05421126.028653malonate decarboxylase subunit gamma
BMA10247_05432134.017967phosphoribosyl-dephospho-CoA transferase
BMA10247_05442123.008158triphosphoribosyl-dephospho-CoA synthase MdcB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0482CHLAMIDIAOM6320.007 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 32.4 bits (73), Expect = 0.007
Identities = 15/61 (24%), Positives = 24/61 (39%), Gaps = 3/61 (4%)

Query: 562 FNLG-LDPDKAREFHDETLPKDSAKVAHFC--SMCGPHFCSMKITQDVREFAAQQGVSEN 618
F LG + P + R E P + + S CG H + +T + E Q ++
Sbjct: 265 FTLGDMQPGEHRTITVEFCPLKRGRATNIATVSYCGGHKNTASVTTVINEPCVQVSIAGA 324

Query: 619 D 619
D
Sbjct: 325 D 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0490TCRTETA453e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.8 bits (106), Expect = 3e-07
Identities = 60/266 (22%), Positives = 95/266 (35%), Gaps = 11/266 (4%)

Query: 66 YATGMLVLAPLG----DRFDRRTLILLQIAGLSAALVVAAAAPTLGVLAAASLAIGILAT 121
YA AP+ DRF RR ++L+ +AG + + A AP L VL + GI
Sbjct: 52 YALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA 111

Query: 122 IAQQAVPFAAEIAPPAARGQAVGTVMSGLLLGILLARTAAGFVAEYFGWRAVFAASVAAL 181
A + A+I R + G + + G++ G + + FAA AAL
Sbjct: 112 TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAA--AAL 169

Query: 182 AALAAVIVA-RLPRSSPTSTLPYGKLLASMWQLVRELRGLR--EASMTGGAIFAAFSAFW 238
L + LP S P + + R RG+ A M I
Sbjct: 170 NGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVP 229

Query: 239 PVLTLLLAGAPFHLGPQAAGL-FGIVGAAGALAAPY-AGRFADKRGPRAIISLAIALIAA 296
L ++ FH G+ G +LA G A + G R + L +
Sbjct: 230 AALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289

Query: 297 SFAIFALSGASLIGLVIGVIVLDVGV 322
+ + A + + I V++ G+
Sbjct: 290 GYILLAFATRGWMAFPIMVLLASGGI 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0496PF05272290.030 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.030
Identities = 14/37 (37%), Positives = 20/37 (54%), Gaps = 3/37 (8%)

Query: 21 RVLEPLDLAIGAGETLVLLGPSGCGKTTTLRLIAGLD 57
RV+EP ++VL G G GK+T + + GLD
Sbjct: 587 RVMEP---GCKFDYSVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0521DHBDHDRGNASE673e-15 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 67.0 bits (163), Expect = 3e-15
Identities = 74/266 (27%), Positives = 119/266 (44%), Gaps = 19/266 (7%)

Query: 1 MADHSIKGKTVIIAGGAKNLGGLIARDLAAQGAQAVAIHYNSAASKGAAAETVAAIEAAG 60
M I+GK I G A+ +G +AR LA+QGA A+ YN + + V++++A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLE----KVVSSLKAEA 56

Query: 61 ARAVALQADLTAAGAVEKLFVDTVAAIGRPDIAINTVGKVLKKPFVEITEAEYDEMAAVN 120
A A AD+ + A++++ +G DI +N G + +++ E++ +VN
Sbjct: 57 RHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVN 116

Query: 121 SKTAFFFLKEAGRHVND--NGKIVTLVTSLLGAFTPFYAAYAGMKAPVEHFTRAAAKEFG 178
S F + +++ D +G IVT+ ++ G AAYA KA FT+ E
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 179 ARGISVTAVGPGPMDTPFFYPAEGADAVAYHKTAAALSPFSKTGL--------TDIGDVV 230
I V PG +T + + A +L F KTG+ +DI D V
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETF-KTGIPLKKLAKPSDIADAV 235

Query: 231 PFIRHLVSD-GWWITGQTILINGGYT 255
F LVS IT + ++GG T
Sbjct: 236 LF---LVSGQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0522IGASERPTASE442e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 43.9 bits (103), Expect = 2e-06
Identities = 44/288 (15%), Positives = 69/288 (23%), Gaps = 14/288 (4%)

Query: 313 PTRRDKAAVKAAEKERVAPLPEPAE-------TAEGAPMKLKTPAAPTPPAA-PVPASSA 364
P+ A E P P PA AE + + KT A +
Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067

Query: 365 APGTSASSAVAAPAAAGSGPAASAPAAPVRHAAPAPASATAA--ASAPTAASAPAPTPAS 422
+ S+ A + S A+ A T + P S
Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS 1127

Query: 423 APAPASTPAPASAPT--PTPASAPTPASIPAPAPASAPASTPAPASAPAPAPTTSPASSI 480
+P + P P + PT + + A T PA + S
Sbjct: 1128 QVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187

Query: 481 APTAAPFASAIPPARAEKFA-PAVTATTAGSTSTPASAAAPSSPSSPWLPPLLPPLLSPD 539
P P V + ++ + S P + S
Sbjct: 1188 TVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247

Query: 540 APSPPADTARTAPLAPAASPATAAAAATNATATAGAMQSAPRDDAATN 587
A T A L+ A + A A + Q ++
Sbjct: 1248 ALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLE-MNNEGQY 1294



Score = 41.6 bits (97), Expect = 9e-06
Identities = 30/213 (14%), Positives = 54/213 (25%), Gaps = 5/213 (2%)

Query: 312 DPTRRDKAAVKAAEKERVAPLPEPAETAEGAPMKLKTPAAPTPPAAPVPASSAAPGTSAS 371
+ T +++ K A K V + E A+ +T T A V A +
Sbjct: 1060 ETTAQNREVAKEA-KSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK 1118

Query: 372 SAVAAPAAAGSGPAASAPAAPVRHAAPAPASATAAASAPTAASAPAPTPASAPAPASTPA 431
+ + P A PA + + PA ++
Sbjct: 1119 TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178

Query: 432 PASAPTPTPASAPTPASIPAPAPASAPASTPAPASAPAPAPTTSPASSIAPTAA---PFA 488
T + + + P + + P S + P S+ P
Sbjct: 1179 VEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238

Query: 489 SAIPPARAEKFAPAVTATTAGSTSTPASAAAPS 521
++ + T S A A A
Sbjct: 1239 TSSNDRSTVALCDLTSTNTNAVLSD-ARAKAQF 1270



Score = 41.6 bits (97), Expect = 1e-05
Identities = 37/246 (15%), Positives = 73/246 (29%), Gaps = 10/246 (4%)

Query: 334 EPAETAEGAPMKLKTPAAPTPPAAPVPASSAAPGTSASSAVAAPAAAGSGPAASAPAAPV 393
P+ + + A PPA P+ + S + A A
Sbjct: 1007 VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR 1066

Query: 394 RHAAPAPASATAAASAPTAASAPAPTPASAPAPASTPAP------ASAPTPTPASAPTPA 447
A A ++ A A + + T + A A T P
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT 1126

Query: 448 SIPAPAPASAPASTPAPASAPAPAPTTSPASSIAPTAAPFASAIPPARAEKFAPAVTATT 507
S +P + P A PT + + T + P A++ + V
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP---AKETSSNVEQPV 1183

Query: 508 AGSTSTPASAAAPSSPSSPWLPPLLPPLLSPDAPSPPADTARTAPLAPAASPATAAAAAT 567
ST+ + +P + P P ++ ++ + P + R + + + A ++
Sbjct: 1184 TESTTVNTGNSVVENPENT-TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSN 1242

Query: 568 NATATA 573
+ + A
Sbjct: 1243 DRSTVA 1248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0532RTXTOXINA250.044 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 24.5 bits (53), Expect = 0.044
Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 2/51 (3%)

Query: 8 AGFHRGARAFGRAPGASVASVASVASGASGASAAS--GASGAAGAAGAAGA 56
A FH+ A + +ASV+SG S A+ S GA +A G
Sbjct: 355 AAFHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGI 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0533FLGMOTORFLIG340.002 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 33.6 bits (77), Expect = 0.002
Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 3/49 (6%)

Query: 566 SPAQYAQVTSMNPDEWRAELALHAELFDKLSARLPDALAETKARIEKRL 614
P + + + S P E + +A L D+ S P+ + E + +EK+L
Sbjct: 148 DPQKASFILSSLPTEVQTNVARRIALMDRTS---PEVVREVERVLEKKL 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0539ADHESNFAMILY300.019 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 30.2 bits (68), Expect = 0.019
Identities = 28/128 (21%), Positives = 42/128 (32%), Gaps = 19/128 (14%)

Query: 388 LKAGEEADARTPAA---LRRGRKLVVQIGE----------TFGEKNAPMFVEQLDALRLA 434
L+ E P A L G I + F EKN + ++LD L
Sbjct: 127 LEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKE 186

Query: 435 DKLALDLAPVMVYGDDVTHVVTEEGIANLLMCRDADEREHAIRGVAGYTEIGRGRDRRLV 494
K + P + +VT EG + I + E + + LV
Sbjct: 187 SKDKFNKIP-----AEKKLIVTSEGAFKYF-SKAYGVPSAYIWEINTEEEGTPEQIKTLV 240

Query: 495 ERLRERGV 502
E+LR+ V
Sbjct: 241 EKLRQTKV 248


9BMA10247_0600BMA10247_0684Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0600530-4.320066hypothetical protein
BMA10247_0601530-4.023886hypothetical protein
BMA10247_0602225-3.400524peptidoglycan-binding LysM/M23B peptidase
BMA10247_0604326-4.292525IS407A, transposase OrfA
BMA10247_0605123-1.705933IS407A, transposase OrfB
BMA10247_0606223-0.741912hypothetical protein
BMA10247_0607222-0.751263hypothetical protein
BMA10247_0608221-0.588158hypothetical protein
BMA10247_0609424-0.771918PAAR motif-containing protein
BMA10247_06103161.392147MerR family transcriptional regulator
BMA10247_06114133.699412GntR family transcriptional regulator
BMA10247_06123104.962841hypothetical protein
BMA10247_06141105.139142hypothetical protein
BMA10247_0613094.553492hypothetical protein
BMA10247_0615094.817878malto-oligosyltrehalose synthase
BMA10247_0616-1104.5674314-alpha-glucanotransferase
BMA10247_0617-193.721874maltooligosyl trehalose trehalohydrolase
BMA10247_0618-172.801519glycogen debranching protein GlgX
BMA10247_0621082.630064trehalose synthase/ maltokinase
BMA10247_06223112.770192alpha amylase
BMA10247_0623926-1.645349hypothetical protein
BMA10247_0625826-1.537333hypothetical protein
BMA10247_0624627-2.673630poly(3-hydroxybutyrate) depolymerase
BMA10247_0626427-0.710841hypothetical protein
BMA10247_0627226-0.182741hypothetical protein
BMA10247_0628-120-0.442397hypothetical protein
BMA10247_0630-217-0.341079hypothetical protein
BMA10247_0631-217-0.471410hypothetical protein
BMA10247_0632-215-0.122773lipoprotein
BMA10247_0633-218-1.841322hypothetical protein
BMA10247_0634-219-2.862777LysR family transcriptional regulator
BMA10247_0635020-3.451878DNA-binding response regulator
BMA10247_0636528-3.640718hypothetical protein
BMA10247_0637415-0.841469hypothetical protein
BMA10247_0638314-0.658454prophage protein
BMA10247_06401120.008056hypothetical protein
BMA10247_06411120.064669DNA-binding response regulator
BMA10247_06421120.535543hypothetical protein
BMA10247_06432100.580117hemagglutinin family protein
BMA10247_06442121.693628ompA family protein
BMA10247_06451111.170534hypothetical protein
BMA10247_06461150.889568H-NS histone family protein
BMA10247_06471131.565414hypothetical protein
BMA10247_06481131.716512hypothetical protein
BMA10247_06490132.842096hypothetical protein
BMA10247_06501132.293741galactose oxidase
BMA10247_06514182.950696hypothetical protein
BMA10247_0652083.660526hypothetical protein
BMA10247_0654093.655497hypothetical protein
BMA10247_0656-1112.171776hypothetical protein
BMA10247_06570130.568459PAAR motif-containing protein
BMA10247_06580120.030754hypothetical protein
BMA10247_06590110.379690Rhs element Vgr protein
BMA10247_0660020-2.844788hypothetical protein
BMA10247_0661325-4.282184IS407A, transposase OrfB
BMA10247_0662225-3.054467IS407A, transposase OrfA
BMA10247_0663124-2.228067hypothetical protein
BMA10247_0664119-0.870523hypothetical protein
BMA10247_0665113-0.545551manganese transport protein MntH
BMA10247_0666015-0.595500hypothetical protein
BMA10247_0667013-0.594554lipoprotein
BMA10247_0668114-0.908981H-NS histone family protein
BMA10247_0669219-2.008999amidohydrolase
BMA10247_0670524-3.642258major facilitator family transporter
BMA10247_0672835-4.822959hypothetical protein
BMA10247_0673426-3.261668LysR family transcriptional regulator
BMA10247_0674825-2.407186hypothetical protein
BMA10247_0676516-0.421164hypothetical protein
BMA10247_06752101.167160hypothetical protein
BMA10247_06772120.293655hypothetical protein
BMA10247_0678-1131.535115hypothetical protein
BMA10247_06790141.492810hypothetical protein
BMA10247_06800151.336145fimbrial assembly chaperone
BMA10247_06830120.193688hypothetical protein
BMA10247_06821130.266927hypothetical protein
BMA10247_06843111.743748sensor histidine kinase/response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0602RTXTOXIND310.004 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.3 bits (71), Expect = 0.004
Identities = 11/64 (17%), Positives = 28/64 (43%), Gaps = 12/64 (18%)

Query: 209 VIAAAAGTVVYAGNGLRGYGNLLIVKHDADFLTTYAHNRALLVKEGQTVAQGQKIAEMGD 268
++A A G + ++G +K + + ++VKEG++V +G + ++
Sbjct: 82 IVATANGKLTHSGR-------SKEIKP---IENSIV--KEIIVKEGESVRKGDVLLKLTA 129

Query: 269 TDND 272
+
Sbjct: 130 LGAE 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0624PF07675310.011 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 31.2 bits (70), Expect = 0.011
Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 376 SYNVYRNGNKVGSS-TSTAYTDAGLIAGTAYSYTVTEIDPSLGESA 420
+Y +YRN ++ S T T Y D L G Y+Y V ++ GESA
Sbjct: 1260 TYTIYRNNTQIASGVTETTYRDPDLATGF-YTYGV-KVVYPNGESA 1303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0635HTHFIS799e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 9e-19
Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 1/117 (0%)

Query: 1 MSAARKVLLVEDDEAQANWAKLVLTRGRFDVTHCQTGGQAIRAMTKEVPDAVVLDMRLPD 60
M+ A +L+ +DD A L+R +DV R + D VV D+ +PD
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 VHGLEVLVWIRRNFFDVPVIVLSNAMQEMQIVEAFSAGADDYVLKPAREAEFLARIA 117
+ ++L I++ D+PV+V+S M ++A GA DY+ KP E + I
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0641HTHFIS802e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 2e-19
Identities = 35/135 (25%), Positives = 58/135 (42%), Gaps = 1/135 (0%)

Query: 4 IYLIEDDEVQARCYAAILQHAGYSVRVLPDGERALREIQRAAPDLIVLDRRLPDIDGLEI 63
I + +DD L AGY VR+ + R I DL+V D +PD + ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 IAWVRERCAPLPILVLTNAVLETDLVEALEAGADDYLIKPPREREFVARV-NALRRRASI 122
+ +++ LP+LV++ ++A E GA DYL KP E + + AL
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 123 SKQFEGTIEIGGYRI 137
+ E + G +
Sbjct: 126 PSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0643PF03895412e-06 Serum resistance protein DsrA.
		>PF03895#Serum resistance protein DsrA.

Length = 79

Score = 40.6 bits (95), Expect = 2e-06
Identities = 21/77 (27%), Positives = 40/77 (51%)

Query: 1046 VARAAYGGIAAATALTMIPEVDKDKTIAVGIGGGTYRGYQAVALGATARITENIKVRAGV 1105
+++ G+A +AL+M+ + + +V G YR A+A+G +RIT+ +AGV
Sbjct: 1 LSKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGV 60

Query: 1106 GMSSGGTTAGIGASMQW 1122
++ GAS+ +
Sbjct: 61 AFNTYNGGMSYGASVGY 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0644OUTRMMBRANEA1268e-37 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 126 bits (317), Expect = 8e-37
Identities = 67/151 (44%), Positives = 95/151 (62%), Gaps = 10/151 (6%)

Query: 87 FQCGEPAQPVAQQPQPAPAAAPAAEPIRLNADAMFAFDRADAASMTEQGRQQLSQLAQRL 146
F GE A VA P PAPA + L +D +F F++A + +G+ L QL +L
Sbjct: 191 FGQGEAAPVVA--PAPAPAPEVQTKHFTLKSDVLFNFNKAT---LKPEGQAALDQLYSQL 245

Query: 147 TDRHAQTVSIV--GYTDRLGSDAYNRQLSQARAKTVGDYLIAAGVPADSVHAEGRGASDP 204
++ + S+V GYTDR+GSDAYN+ LS+ RA++V DYLI+ G+PAD + A G G S+P
Sbjct: 246 SNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP 305

Query: 205 LV--QCDQ-RERAALIACLSPNRRVEVVAAG 232
+ CD ++RAALI CL+P+RRVE+ G
Sbjct: 306 VTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0670TCRTETB393e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.7 bits (90), Expect = 3e-05
Identities = 30/169 (17%), Positives = 70/169 (41%), Gaps = 5/169 (2%)

Query: 1 MFSLVIPALLTAWGIGKGQAGLIGGATLAAGAIGGLLAGMIADRFGRVRALQITVCWFSL 60
+ ++ +P + + + A + +IG + G ++D+ G R L +
Sbjct: 32 VLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCF 91

Query: 61 FTFLSAFAQNFEQLLVL-KTLQGLGFGGEWTAGAVLLSETIRARHRGKAMGIVQSAWGFG 119
+ + +F LL++ + +QG G V+++ I +RGKA G++ S G
Sbjct: 92 GSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMG 151

Query: 120 WGGAVLLYTLVFSWLPPEWAWRVLFAIGVLPALLVLYIRRAIPEPPRDD 168
G + ++ ++ W L I ++ + V ++ + + + R
Sbjct: 152 EGVGPAIGGMIAHYI----HWSYLLLIPMITIITVPFLMKLLKKEVRIK 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0684HTHFIS632e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.9 bits (153), Expect = 2e-12
Identities = 30/122 (24%), Positives = 50/122 (40%), Gaps = 10/122 (8%)

Query: 401 RVLVVDDQEMNRIVLRYQLDALGHHARLCASGDEALRALGTAAYDVVLTDCRMPGMDGIA 460
+LV DD R VL L G+ R+ ++ R + D+V+TD MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 461 LTAAIRAH-PDARVRATPIVGVTALVSDAEHARCVDAGMTLCIGKP----TTLDALERAL 515
L I+ PD P++ ++A + + + G + KP + + RAL
Sbjct: 65 LLPRIKKARPD-----LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 516 VE 517
E
Sbjct: 120 AE 121


10BMA10247_0708BMA10247_0722Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0708-121-3.377298IS407A, transposase OrfA
BMA10247_0709023-2.248503IS407A, transposase OrfB
BMA10247_07112141.049230hypothetical protein
BMA10247_07131131.091786hypothetical protein
BMA10247_07142123.035262MutT/NUDIX NTP pyrophosphatase
BMA10247_07151134.309619hypothetical protein
BMA10247_07161144.787580hypothetical protein
BMA10247_07172134.394432alcohol dehydrogenase
BMA10247_07191134.160946thioesterase
BMA10247_07202124.516787ABC transporter permease
BMA10247_07211124.307413ABC transporter permease/ATP-binding protein
BMA10247_07222133.070407ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0713FLGHOOKAP1280.042 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 27.6 bits (61), Expect = 0.042
Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 2/34 (5%)

Query: 190 VDIREEALHELIDRLDDLASEFHSAF--LHEAGK 221
+ R + L + + L LA F AF H+AG
Sbjct: 283 LTFRSQDLDQTRNTLGQLALAFAEAFNTQHKAGF 316


11BMA10247_0757BMA10247_0785Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0757216-2.880113hypothetical protein
BMA10247_0758218-3.247389hypothetical protein
BMA10247_0759014-3.086235hypothetical protein
BMA10247_0762012-2.789154chorismate synthase
BMA10247_0763313-3.335183isrso11-transposase orfb protein
BMA10247_076409-2.588099transposase
BMA10247_0765-210-1.468632transposase subfamily protein
BMA10247_0766-29-1.034762His/Glu/Gln/Arg/opine amino acid ABC transporter
BMA10247_0767-210-0.824664amino acid ABC transporter substrate-binding
BMA10247_0768-29-0.411465dihydroorotate dehydrogenase 2
BMA10247_07690120.095917arginyl-tRNA-protein transferase
BMA10247_07702130.812663leucyl/phenylalanyl-tRNA--protein transferase
BMA10247_07712121.178816nudix hydrolase
BMA10247_07731131.375464*hypothetical protein
BMA10247_07742112.399424SMR family multidrug efflux pump
BMA10247_07752112.3333353-hydroxyisobutyryl-CoA hydrolase
BMA10247_07761101.434382hypothetical protein
BMA10247_0779091.105640alkanesulfonate monooxygenase
BMA10247_0780-314-0.301839aliphatic sulfonate ABC transporter permease
BMA10247_0781118-1.555162aliphatic sulfonate ABC transporter ATP-binding
BMA10247_0782729-7.092685molybdenum-pterin-binding protein
BMA10247_0783424-5.458388hypothetical protein
BMA10247_0784524-3.761051hypothetical protein
BMA10247_0785116-3.576939isrso15-transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0758RTXTOXINA270.003 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 27.2 bits (60), Expect = 0.003
Identities = 14/45 (31%), Positives = 21/45 (46%), Gaps = 3/45 (6%)

Query: 7 DSLLERLRRPRGASRVSLCG-GAPLAATASAAAVAASAAARAVAA 50
DSLL + GA SL LA+ + + ++A+A V A
Sbjct: 351 DSLLAAFHKETGAIDASLTTISTVLASVS--SGISAAATTSLVGA 393


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0785PERTACTIN280.012 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 28.1 bits (62), Expect = 0.012
Identities = 16/41 (39%), Positives = 22/41 (53%)

Query: 38 KRLWFYRKPACAEKAWGQWFEQAQQSGIAALQKFAQRLQGY 78
KRL R A AWG+ F Q QQ A ++F Q++ G+
Sbjct: 647 KRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGF 687


12BMA10247_0809BMA10247_0839Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0809010-4.419271sulfate/thiosulfate ABC transporter ATP-binding
BMA10247_0810110-6.195090transcriptional regulator CysB-like protein
BMA10247_081118-6.422093hypothetical protein
BMA10247_081218-5.436109IS407A, transposase OrfA
BMA10247_081308-4.376507IS407A, transposase OrfB
BMA10247_0814010-4.016734protein kinase
BMA10247_0815-19-2.173053hypothetical protein
BMA10247_0816011-0.669528SpoVR family protein
BMA10247_08171110.645990dicarboxylic acid transporter PcaT
BMA10247_08182102.396530ABC transporter substrate binding protein
BMA10247_0819394.491206hypothetical protein
BMA10247_0820-2124.421179ABC transporter ATP-binding protein
BMA10247_0821-2125.115586permease
BMA10247_0822-1135.571858hypothetical protein
BMA10247_08233135.241408fimbriae assembly-related protein
BMA10247_08242124.962274Flp pilus assembly protein CpaB
BMA10247_08253144.207730RhcC2
BMA10247_08265134.334738lipoprotein
BMA10247_08273124.154237CpaE protein
BMA10247_08282123.796993CpaF
BMA10247_0829092.855671type II secretion system protein
BMA10247_08301112.296996type II secretion system protein F
BMA10247_08322101.878810hypothetical protein
BMA10247_08342112.417519hypothetical protein
BMA10247_08330181.541106hypothetical protein
BMA10247_0835-1180.830243ABC transporter substrate-binding protein
BMA10247_0836-1162.265744hypothetical protein
BMA10247_0837-1123.272129hypothetical protein
BMA10247_0838-3103.193455binding-protein-dependent transport system inner
BMA10247_0839-293.081468amino acid ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0817TCRTETB310.010 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.0 bits (70), Expect = 0.010
Identities = 24/111 (21%), Positives = 44/111 (39%), Gaps = 11/111 (9%)

Query: 268 LVNTAGMHAKTASNVMTAALFVYMLMQPVFGALSDKIGRR----MSMILFGTGAVIGTVP 323
+ N + + V TA + + + V+G LSD++G + +I+ G+VIG V
Sbjct: 40 IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVG 99

Query: 324 LMHALGGVTSPLVAFGLIVVALAIVSFYTSISGLIKAEMFPPEVRAMGVGL 374
L+ + +F ++ ++ A P E R GL
Sbjct: 100 HSFF------SLLIMARFIQGAGAAAF-PALVMVVVARYIPKENRGKAFGL 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0823PREPILNPTASE300.003 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.2 bits (68), Expect = 0.003
Identities = 31/145 (21%), Positives = 50/145 (34%), Gaps = 12/145 (8%)

Query: 7 LVASWTLASLALADLRTRRLA---TFAVALVGALYAALALVGAPGDGGFASHAALGAAA- 62
L+ +W L +L DL L T + G L+ L + GD + A
Sbjct: 138 LLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLWS 197

Query: 63 -FALGAAMFRAGWIAGGDVKLAAVVFLWAGPAHAWPVAFAIGVGGLAVGAVCIAAGRAPR 121
+ + + GD KL A + W G V + G +G I
Sbjct: 198 LYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNH-- 255

Query: 122 VLAWFAPARGVPYGVALAAGGLLAV 146
++ +P+G LA G +A+
Sbjct: 256 -----HQSKPIPFGPYLAIAGWIAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0825BCTERIALGSPD1442e-39 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 144 bits (364), Expect = 2e-39
Identities = 68/283 (24%), Positives = 116/283 (40%), Gaps = 16/283 (5%)

Query: 170 VVQTLKPYLRQQEALVNRLTLARPIQVHLRVRITEVDRNITQQLGINWSALGA------- 222
+V + E ++ +L + RP QV + I EV LGI W+ A
Sbjct: 322 IVTAAPDVMNDLERVIAQLDIRRP-QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTN 380

Query: 223 SGNFVGGLFNGRTLFDTASKAFDLSPSGAFSVVGGFHTSRYSIDG--VLDALDQEGLITM 280
SG + G ++ S A S G Y + +L AL +
Sbjct: 381 SGLPISTAIAGANQYNKDGTVSSSLAS-ALSSFNGIAAGFYQGNWAMLLTALSSSTKNDI 439

Query: 281 LAEPNLTAISGQTASFLAGGEFPIPVAQDTTGA----ITIQFKPYGVSLDFTPTVLADNR 336
LA P++ + A+F G E P+ TT T++ K G+ L P + +
Sbjct: 440 LATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDS 499

Query: 337 ISLKVRPEVSEIDPTNSVTTGSIKVPALTVRRVDTTVELSSGQSFAIGGLLQSKSSDVLA 396
+ L++ EVS + S T+ + R V+ V + SG++ +GGLL SD
Sbjct: 500 VLLEIEQEVSSVADAASSTSSDLGA-TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTAD 558

Query: 397 ELPGLARLPVLGKLFSSRNYLNDKTEVVVIVTPYIVQPANPGE 439
++P L +PV+G LF S + K +++ + P +++ +
Sbjct: 559 KVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYR 601


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0827HTHFIS340.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 0.001
Identities = 29/165 (17%), Positives = 52/165 (31%), Gaps = 20/165 (12%)

Query: 22 GARLVAIVADAASDEVIRNLIADQAMTGAQVARGGIDDAIALMRDLSHGPQHLLVDVSGA 81
GA ++ DAA V+ ++ + + ++ DV
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIA--AGDGDLVVTDVV-- 56

Query: 82 AMP----LSDLARLADVCDPSVNVIVIGERNDVGLFRSMLRIGVRDYLVKPL----TVEL 133
MP L R+ P + V+V+ +N G DYL KP + +
Sbjct: 57 -MPDENAFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 134 VHRALSAADPNAAARAGKAIGFVGARGGVGVTSIAVALARHLADR 178
+ RAL+ + + + G S A+ + R
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVG----RSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0828PF05272300.032 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.032
Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 299 IVISGGTGSGKTTLLNAL---SHFIDSHERIVTIEDAAELQLQQPHVVSL 345
+V+ G G GK+TL+N L F D+H I T +D+ E Q+ L
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYE-QIAGIVAYEL 647


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0834PYOCINKILLER320.004 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 31.7 bits (71), Expect = 0.004
Identities = 30/132 (22%), Positives = 49/132 (37%), Gaps = 4/132 (3%)

Query: 40 RVAAARNELQNAADAAALAGAASLEAGAGAPAWAAAASAAAAALSLNASDGAALSSGDVQ 99
A A+ + + A A AA+ A PA + + AA + + GAA + +
Sbjct: 226 AAAEAKRKAEEQARQQAAIRAANTYA---MPANGSVVATAAGRGLIQVAQGAASLAQAIS 282

Query: 100 TGYWNVTGVPAGLEPTTLAPGEYDVPAVQATVTRAPNQNGGPLSLLMGGLLGLVGTPAAA 159
V G P+ +A G + T + +Q + +G +G P +
Sbjct: 283 DAI-AVLGRVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSV 341

Query: 160 TAVAVAGAPATV 171
AVA A TV
Sbjct: 342 NLNAVAKASGTV 353


13BMA10247_0853BMA10247_0898Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0853291.282262hypothetical protein
BMA10247_08554121.515380AraC family transcriptional regulator
BMA10247_08563142.391753ribose ABC transporter periplasmic
BMA10247_08574143.085255carbohydrate ABC transporter ATP-binding
BMA10247_08583153.584560carbohydrate ABC transporter permease
BMA10247_08594143.675174hypothetical protein
BMA10247_08603143.335195zinc-binding dehydrogenase oxidoreductase
BMA10247_08613144.348771carbohydrate kinase
BMA10247_08623156.361240short chain dehydrogenase
BMA10247_08642126.191166extracytoplasmic-function sigma-70 factor
BMA10247_08653136.557964mbtH domain-containing protein
BMA10247_08663116.170875syringomycin biosynthesis enzyme
BMA10247_08671115.662916iron ABC transporter ATP-binding protein
BMA10247_08682126.696866iron-hydroxamate transporter permease subunit
BMA10247_08692126.584568ferric iron reductase FhuF
BMA10247_08702126.422699iron ABC transporter substrate-binding protein
BMA10247_08712135.703396hypothetical protein
BMA10247_08722145.593563cyclic peptide ABC transporter ATP-binding
BMA10247_08733125.916072non-ribosomal peptide synthetase
BMA10247_08741114.328088non-ribosomal peptide synthetase
BMA10247_08750101.385490l-ornithine 5-monooxygenase
BMA10247_0876081.205358TonB-dependent siderophore receptor
BMA10247_0877182.711107hypothetical protein
BMA10247_0878283.286043cobyrinic acid a,c-diamide synthase
BMA10247_0879073.327591cob(I)yrinic acid a,c-diamide
BMA10247_0881084.074469high affinity nickel transporter
BMA10247_0882095.124638cobalamin biosynthesis protein CobW
BMA10247_08830115.688982cobaltochelatase subunit CobN
BMA10247_08845167.160628hypothetical protein
BMA10247_08852207.149276magnesium chelatase subunit ChII
BMA10247_08872154.218891hypothetical protein
BMA10247_08863135.315026hypothetical protein
BMA10247_08892114.433661hypothetical protein
BMA10247_08901134.476756hypothetical protein
BMA10247_08911135.659464hypothetical protein
BMA10247_08921135.513622glycosyl hydrolase
BMA10247_08932127.211971precorrin-3B C(17)-methyltransferase
BMA10247_08940137.116144precorrin-2 C(20)-methyltransferase
BMA10247_08951137.396298precorrin-8X methylmutase
BMA10247_08962157.249885precorrin-3B synthase
BMA10247_08971154.895416hypothetical protein
BMA10247_08981144.693444precorrin-6Y C5,15-methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0862DHBDHDRGNASE1224e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 122 bits (308), Expect = 4e-36
Identities = 81/252 (32%), Positives = 118/252 (46%), Gaps = 15/252 (5%)

Query: 9 GRSFLVTGASSGIGRAAAVALRGGGARVVAAARNARELERLAHETGC-----EPLELDVG 63
G+ +TGA+ GIG A A L GA + A N +LE++ E DV
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 64 CDASVRAALSG-ERMRDAFDGLINCAGVTSLAAAIDTTADEFDRVMAVNARGAMLVARHV 122
A++ + ER D L+N AGV + +E++ +VN+ G +R V
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 123 ARAMIRAGRGGSIVNVSSQAALVALPSHLAYCASKAALDAMTRVLCVELGPHGIRVNSVN 182
++ M R GSIV V S A V S AY +SKAA T+ L +EL + IR N V+
Sbjct: 128 SKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 183 PTVTLTPMAERAWSDPHASGPMLA--------AIPLGRFARVADVVAPILFLSSDAAAMV 234
P T T M W+D + + ++ IPL + A+ +D+ +LFL S A +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 235 SGVALPVDGGYT 246
+ L VDGG T
Sbjct: 247 TMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0867PF05272280.040 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.040
Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 36 VTALCGPNGCGKSTLLRTLAGLQ 58
L G G GKSTL+ TL GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_08692FE2SRDCTASE562e-11 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 55.8 bits (134), Expect = 2e-11
Identities = 50/186 (26%), Positives = 72/186 (38%), Gaps = 24/186 (12%)

Query: 78 RALVSQWSKYYFNLAASAGFAAALLLGRPLDMAPQRMRVALRGGMPVALLFEADALRPAQ 137
+ L+S W+++Y L A L + LD++P+ VA F D
Sbjct: 89 KPLISLWAQWYIGLMVPPLMLALLTQEKALDVSPEHFHAEFHETGRVA-CFWVDVCEDKN 147

Query: 138 AEPAS---RYAALVDH-LRATIDTLAVLAKLSPRVLWANAGNLLD-YLFEQCAHAPRAGA 192
A P S R L+ L + L +++ +++W+N G L++ YL E G
Sbjct: 148 ATPHSPQHRMETLISQALVPVVQALEATGEINGKLIWSNTGYLINWYLTEM---KQLLGE 204

Query: 193 DA------AWLFGPVDSRGEANPLRLPVRRVKPCSARLPDPFRARRVCCLRNEIPGEDQL 246
A F + GE NPL V L D RR CC R +P Q
Sbjct: 205 ATVESLRHALFFEKTLTNGEDNPLWRTV--------VLRDGLLVRRTCCQRYRLPDVQQ- 255

Query: 247 CGSCPL 252
CG C L
Sbjct: 256 CGDCTL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0870FERRIBNDNGPP1121e-30 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 112 bits (282), Expect = 1e-30
Identities = 77/264 (29%), Positives = 112/264 (42%), Gaps = 15/264 (5%)

Query: 115 PARIVVLEFMFAEDLAALDITPVGMADPAYYPIWIGYDDARFARVSDVGTRQEPSLEAIA 174
P RIV LE++ E L AL I P G+AD Y +W+ + V DVG R EP+LE +
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVS-EPPLPDSVIDVGLRTEPNLELLT 93

Query: 175 AAKPDLILGVGLRHAPIFDALSRIAPTVLFKYSPNYIEDGRQVTQYDWACAILRTIGCLT 234
KP ++ + P + L+RIAP F +S DG+Q A L + L
Sbjct: 94 EMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFS-----DGKQ--PLAMARKSLTEMADLL 145

Query: 235 GRARDARAVQARVDAGLARDARRIAAAGRAGERVAWLQELGLPDRYWAFTGNSASAGIAR 294
A A+ + + R + G R L L P F NS I
Sbjct: 146 NLQSAAETHLAQYEDFIRSMKPRFV---KRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202

Query: 295 ALGLE-PWPGEPTREGTAYVTSEDLLKQPDLAVLFVSATEPGVPLDAKLDSSIWRFVPAR 353
G+ W GE G+ V+ + L D+ VL +DA + + +W+ +P
Sbjct: 203 EYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD-MDALMATPLWQAMPFV 261

Query: 354 RAGRVALVERNIWGFGGPMSALRL 377
RAGR V +W +G +SA+
Sbjct: 262 RAGRFQRVP-AVWFYGATLSAMHF 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0885HTHFIS446e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.4 bits (105), Expect = 6e-07
Identities = 39/176 (22%), Positives = 64/176 (36%), Gaps = 14/176 (7%)

Query: 17 DRALPAAYPFSALIGQ-AALQQALLLVA-VDPGLGGVLVSGPRGTAKSTAARALAELLP- 73
+ + L+G+ AA+Q+ ++A + ++++G GT K ARAL +
Sbjct: 127 SKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKR 186

Query: 74 -EGRFVTLPLSASDEQVTGSLDLASALADNT--VRFSPGLVARAHLGVLYVDEINLLPDA 130
G FV + ++A + S T S G +A G L++DEI +P
Sbjct: 187 RNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMD 246

Query: 131 LVDALLDAAASGVNTVERDGVSHSHAARFALVGTMNP------EEGELRPQLLDRF 180
LL G G + +V N +G R L R
Sbjct: 247 AQTRLLRVLQQG--EYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0898OMADHESIN290.026 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 29.5 bits (65), Expect = 0.026
Identities = 25/63 (39%), Positives = 28/63 (44%)

Query: 147 ADGATPAAIAGALVARGFGPSAMSVFEHLGGPLERRLDARADAWRDARAAALNVVAIECR 206
A GAT A GA VA G G A V GPL + L A + A A + VAI R
Sbjct: 74 AIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGAR 133

Query: 207 ACA 209
A
Sbjct: 134 AST 136


14BMA10247_0915BMA10247_0946Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_09150103.018096arginine deiminase
BMA10247_0916-1123.334731arginine/ornithine antiporter
BMA10247_09182135.239546hypothetical protein
BMA10247_09192144.988616amine ABC transporter permease
BMA10247_09201134.614264amine ABC transporter ATP-binding protein
BMA10247_0921-1113.776123amine ABC transporter permease
BMA10247_0922-1112.706713amine ABC transporter periplasmic amine-binding
BMA10247_09240123.343189EmrB/QacA family drug resistance transporter
BMA10247_09250112.606162AMP-binding protein
BMA10247_0926-3121.209387hypothetical protein
BMA10247_0927-2111.753770methyl-accepting chemotaxis protein
BMA10247_09280152.013151chemotaxis protein CheW
BMA10247_0930116-0.156814hypothetical protein
BMA10247_0931017-1.731049AraC family transcriptional regulator
BMA10247_0932317-3.698782outer membrane porin
BMA10247_0934313-4.594739hypothetical protein
BMA10247_0935211-4.684395hypothetical protein
BMA10247_0936311-4.805322JmjC domain-containing protein
BMA10247_0939514-3.324496IS407A, transposase OrfB
BMA10247_0940413-2.938855IS407A, transposase OrfA
BMA10247_0942413-2.607579electron transfer flavoprotein-ubiquinone
BMA10247_0943216-2.046179short chain dehydrogenase
BMA10247_0944421-2.352428thioesterase
BMA10247_0945418-2.560724hypothetical protein
BMA10247_0946-114-3.263310hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0915ARGDEIMINASE5130.0 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 513 bits (1324), Expect = 0.0
Identities = 130/423 (30%), Positives = 227/423 (53%), Gaps = 21/423 (4%)

Query: 8 MSQAIPQVGVHSEVGKLRKVLVCSPGLAHQRLTPSNCDELLFDDVMWVNQAKRDHFDFVS 67
M + + + + SE+G+L+KVL+ PG + LTP LFDD+ ++ A+++H F S
Sbjct: 1 MEEYLNPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFAS 60

Query: 68 KMRERGVEVLEMHNLLTETVQNPAALK------WILDRKITPDNVGIGLVDEVRAWLEGL 121
++ VE+ + +L++E + + AL+ +IL+ +I D ++ ++ + L
Sbjct: 61 ILKNNLVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFT----INLLKDYFSSL 116

Query: 122 EPRALAEFLIGGVAASDIAGAERSKVLTLFRDYLGKSSFVLPPLPNMMFTRDTSCWIYGG 181
+ +I GV ++ S + G + F++ P+PN++FTRD I G
Sbjct: 117 TIDNMISKMISGVVTEELKNYTSSLDDLV----NGANLFIIDPMPNVLFTRDPFASIGNG 172

Query: 182 VTLNPMHWPARRQETLLVAAVYKFHPAFTDAKFDVWYGDPDRDHGMATLEGGDVMPIGRG 241
VT+N M R++ET+ ++K+HP + +W + A+LEGGD + + +G
Sbjct: 173 VTINKMFTKVRQRETIFAEYIFKYHPVYK-ENVPIWLNRWE----EASLEGGDELVLNKG 227

Query: 242 VVLVGMGERTSRQAVGQLAQALFA-KGAAERVIVAGLPNSRASMHLDTVFSFCDRDLVTV 300
++++G+ ERT ++V +LA +LF K + + ++ +P +R+ MHLDTVF+ D + T
Sbjct: 228 LLVIGISERTEAKSVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTS 287

Query: 301 FPEVVNRIVPFTLRPGGDARYGIDIEREDKPFVDVVAQALGLKSLRVVETGGNDFAAERE 360
F + L + I I++E DV++ LG K + GG+ RE
Sbjct: 288 FTSDDMYFSIYVLTYNPSSSK-IHIKKEKARIKDVLSFYLGRKIDIIKCAGGDLIHGARE 346

Query: 361 QWDDGNNMVCIEPGVVVGYDRNTYTNTLLRKAGVEVITIGSSELGRGRGGGHCMTCPVLR 420
QW+DG N++ I PG ++ Y RN TN L + G++V I SSEL RGRGG CM+ P++R
Sbjct: 347 QWNDGANVLAIAPGEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIR 406

Query: 421 DPV 423
+ +
Sbjct: 407 EDI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0924TCRTETB931e-22 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 93.4 bits (232), Expect = 1e-22
Identities = 74/403 (18%), Positives = 153/403 (37%), Gaps = 16/403 (3%)

Query: 18 FMQNLDSTVVATALPSMARELGVNVVFLSSAITSYLVALTVFIPVSGWIAERFGAKRVFI 77
F L+ V+ +LP +A + + T++++ ++ V G ++++ G KR+ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 78 AAIAIFTAASVMCAAANGLAT-LVAARILQGAGGALMVPVGRLILYRGVSRHEMLAATTW 136
I I SV+ + + L+ AR +QGAG A + +++ R + + A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 137 LTMPALVGPLLGPPLGGFLTDALSWRAVFWINVPVGVAGAALAARLVPASAGERRAPADA 196
+ +G +GP +GG + + W + + +P+ + + D
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 197 RGMLLVGAALAALMLGVETAGRGVLPAGAPALCLGAGVALGGLAIRHCRRVAHPAVDLSL 256
+G++L+ + ML + L + + ++H R+V P VD L
Sbjct: 202 KGIILMSVGIVFFMLFTTSYSISFLIVSVLSF---------LIFVKHIRKVTDPFVDPGL 252

Query: 257 L-GIPTFHAATIAGSLFRAGAGALPFLVPLTLQVGFGASASRSGAITLASA-LGSLVMRP 314
IP G +F AG + +VP ++ S + G++ + + ++
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFV-SMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 315 MTHAALHRAPMRTVLIAGSVSFAAVLAACATLSPAWPDAAVFALLLVGGLSRSLSFASLG 374
+ + R VL G + + L ++ V G S + +
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIS 370

Query: 375 ALVFSDVPSERLSAATSFQGTAQQLMRAVGVAVAAGALHLAML 417
+V S + + A S L G+A+ G L + +L
Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0927IGASERPTASE300.024 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.024
Identities = 24/171 (14%), Positives = 47/171 (27%), Gaps = 5/171 (2%)

Query: 404 ASEVRSLAQRSSSAAKEIKDLINASVQKIHDGSALAGEAGKTMTEVTQAVARVTDIMGEI 463
+ + S++ D S + + ++ V + E
Sbjct: 1002 NIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET 1061

Query: 464 AAASGEQSRGIEQVNQAIAQMDEVTQQNAALVEEAAAASKSLEEQGRHLTQAVSFFRASA 523
A + E ++ + +A Q +EV Q + E +K + +
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA-----KVET 1116

Query: 524 ASAAPQARHAAPAKPKAKRGVAAPASAPRAAHAAPTFNKPAPALAAAATAS 574
+ + PK ++ A A PT N P TA
Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTAD 1167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0932ECOLNEIPORIN935e-23 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 92.6 bits (230), Expect = 5e-23
Identities = 88/379 (23%), Positives = 140/379 (36%), Gaps = 62/379 (16%)

Query: 32 ASTAHAQSSVVLYGLIDTSITYANNQRTHGAGSPGSPGWAVTSGALNASRWGLRGREDLG 91
A A + V LYG I + + + +GA + T S+ G +G+EDLG
Sbjct: 12 ALPVAAMADVTLYGTIKAGVETSRSVAHNGAQAASVE--TGTGIVDLGSKIGFKGQEDLG 69

Query: 92 DGVSAIFALENGFSGASGALSQKGVDMFGRQAWIGLKSKEGGALTLGRQYDLILDF--VT 149
+G+ AI+ +E AS A + G RQ++IGLK G L +GR ++ D +
Sbjct: 70 NGLKAIWQVE---QKASIAGTDSG--WGNRQSFIGLK-GGFGKLRVGRLNSVLKDTGDIN 123

Query: 150 PLGASGPGWGGNLAVHPYDNDDSNRNIRINNAVKYTSPTYRGWTLGAMYGFSNTAGPFGN 209
P + G N P R I +V+Y SP + G + Y ++ AG N
Sbjct: 124 PWDSKSDYLGVNKIAEP-----EARLI----SVRYDSPEFAGLSGSVQYALNDNAG-RHN 173

Query: 210 NAAWSAGLSYANGPLKLGAGYLRINRNPNAANANGALSTTDGSATITGGSQQIWAVAGRY 269
+ ++ AG +Y NG + G + QI + Y
Sbjct: 174 SESYHAGFNYKNGGFFVQYGGAYKRH-------------HQVQENVNIEKYQIHRLVSGY 220

Query: 270 -AFGPHSIGAAWSHSATDRVSGVLQGGSIAKLDGKSLVFDNFTLDGRYVVTPRLSLAAAY 328
++ A A + F N VTPR+S A +
Sbjct: 221 DNDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFGN--------VTPRVSYAHGF 272

Query: 329 TYTMGRFDARSGETRPKWNHMVAQADYAFSIRTDAYLEAVYQRVSGGNGIPAFNATIWTL 388
+ + + ++ +V A+Y FS RT A + A + + G G F +T
Sbjct: 273 KGSFDATNYNN-----DYDQVVVGAEYDFSKRTSALVSAGWLQ--EGKGESKFVSTA--- 322

Query: 389 TPSANGNQVVVALGLRHRF 407
+GLRH+F
Sbjct: 323 ----------GGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0943DHBDHDRGNASE1205e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 120 bits (301), Expect = 5e-35
Identities = 77/261 (29%), Positives = 125/261 (47%), Gaps = 16/261 (6%)

Query: 7 LEGKVALITGASSGLGQRFAQVLSQAGAKVVLASRRVERLKELRAEIEAAGGAAHVVSLD 66
+EGK+A ITGA+ G+G+ A+ L+ GA + E+L+++ + ++A A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 67 VTDYQSIRAAVAHAETEAGTIDILVNNSGVSTMQKLVDVSPADFEYVFDTNTRGAFFVAQ 126
V D +I A E E G IDILVN +GV + +S ++E F N+ G F ++
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 127 EVAKRMMMRAGSGNAKPACRIINIASVAGLRPFSQIGLYAMSKAAVVHMTRAMALEWGRH 186
V+K MM R I+ + S P + + YA SKAA V T+ + LE +
Sbjct: 126 SVSKYMMDRRSGS-------IVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY 178

Query: 187 GINVNAICPGYIDTEINHYLWETEQGQ---------KLQSMLPRRRVGKPQDLDGLLLLL 237
I N + PG +T++ LW E G ++ +P +++ KP D+ +L L
Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 238 AADESQFINGSIVSADDGLGL 258
+ ++ I + D G L
Sbjct: 239 VSGQAGHITMHNLCVDGGATL 259


15BMA10247_0959BMA10247_0979Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0959-112-3.859089endoribonuclease L-PSP
BMA10247_0960-112-4.234552GTP pyrophosphokinase
BMA10247_0962117-6.760794*hypothetical protein
BMA10247_0963013-4.624248threonyl-tRNA synthetase
BMA10247_0964113-3.933020translation initiation factor IF-3
BMA10247_0965013-3.05853150S ribosomal protein L35
BMA10247_0966-210-2.56111650S ribosomal protein L20
BMA10247_0967-310-2.304731phenylalanyl-tRNA synthetase subunit alpha
BMA10247_0968-210-1.655102phenylalanyl-tRNA synthetase subunit beta
BMA10247_0969-211-1.725779integration host factor subunit alpha
BMA10247_0970-211-2.043503MerR family transcriptional regulator
BMA10247_0971-112-2.765242lipoprotein
BMA10247_0973318-3.682568*lipoprotein
BMA10247_0974221-4.245260hypothetical protein
BMA10247_0975223-5.708814hypothetical protein
BMA10247_0976423-6.028600lipoprotein
BMA10247_0978535-7.404252IS407A, transposase OrfB
BMA10247_0979435-6.109302IS407A, transposase OrfA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0969DNABINDINGHU1191e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (301), Expect = 1e-38
Identities = 35/89 (39%), Positives = 53/89 (59%)

Query: 37 TKAELAELLFDSVGLNKREAKDMVEAFFEVIRDALENGESVKLSGFGNFQLRDKPQRPGR 96
K +L + ++ L K+++ V+A F + L GE V+L GFGNF++R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 97 NPKTGEAIPIAARRVVTFHASQKLKALVE 125
NP+TGE I I A +V F A + LK V+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0971PF00577310.024 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 30.6 bits (69), Expect = 0.024
Identities = 32/179 (17%), Positives = 58/179 (32%), Gaps = 36/179 (20%)

Query: 480 APWDAMSDLFNRHLLDYSPRSLNDLKLSADGGALRVRGGIKLWNQVPPGVWLPADMKGSL 539
AP + FN L P+++ DL +G ++PPG + D+ +
Sbjct: 40 APLSSAELYFNPRFLADDPQAVADLSRFENG------------QELPPGTY-RVDIYLNN 86

Query: 540 TLLDERHLAFTPTQVSVLGIP--QAKLLRALGIELSSLAPLKRRGAELRGDSLVLDQYTV 597
+ R + F +P L ++G+ +S++ + + L
Sbjct: 87 GYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLA---DDACVPLTSM-- 141

Query: 598 FPPPVLIGHMSQATVEPDG----LRLTFRPAPNAPVLRPPANLPGSYLWLEGGDTKMFN 652
+ AT + D L LT P A + LW G + + N
Sbjct: 142 ---------IHDATAQLDVGQQRLNLTI---PQAFMSNRARGYIPPELWDPGINAGLLN 188


16BMA10247_1041BMA10247_1056Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_10410114.154958hypothetical protein
BMA10247_10420134.185621hypothetical protein
BMA10247_10431124.231470hypothetical protein
BMA10247_10441113.592248type II secretion system protein
BMA10247_1045-2112.802204type II secretion system protein
BMA10247_1047-391.178518hypothetical protein
BMA10247_1048-212-0.417709type II/III secretion system protein
BMA10247_1049-216-2.079728pilus assembly protein CpaB
BMA10247_1050114-4.828185TadE family protein
BMA10247_1051113-4.411409peptidase A24A, prepilin type IV
BMA10247_1052212-4.667240IS407A, transposase OrfB
BMA10247_1053211-4.414754IS407A, transposase OrfA
BMA10247_1055212-4.205844putrescine ABC transporter permease PotI
BMA10247_105609-3.257907putrescine ABC transporter permease PotH
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1043PYOCINKILLER310.009 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.5 bits (68), Expect = 0.009
Identities = 28/86 (32%), Positives = 37/86 (43%), Gaps = 3/86 (3%)

Query: 214 LMNQLKLAPAVRAEIRNDATRIAAAARARQRA-LARPGAPGAAASAGATLAASAAGSNGG 272
MN L A A + R AAA A+++A AA A T A A GS
Sbjct: 203 RMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQ--AAIRAANTYAMPANGSVVA 260

Query: 273 AAAGKGAVAGSGASAPGAAATATAAA 298
AAG+G + + +A A A + A A
Sbjct: 261 TAAGRGLIQVAQGAASLAQAISDAIA 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1047HTHFIS385e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.3 bits (89), Expect = 5e-05
Identities = 11/63 (17%), Positives = 26/63 (41%)

Query: 79 AALRVSHPGLPIVALGSLGEPESALAALRAGVRDFIDFSAPAEDALRITRGLLDHVGDQP 138
++ + P LP++ + + +A+ A G D++ + + I L +P
Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRP 126

Query: 139 SRH 141
S+
Sbjct: 127 SKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1048BCTERIALGSPD1382e-37 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 138 bits (348), Expect = 2e-37
Identities = 58/249 (23%), Positives = 111/249 (44%), Gaps = 11/249 (4%)

Query: 151 VQVDVRVVEFSRSVLKQAGLNFFKQNNGFTFGSFAPAGLASVTGGG----TSSMSVSANI 206
V V+ + E + G+ + +N G T + + +++ G S+
Sbjct: 347 VLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA 406

Query: 207 PIASAFN-LVVGSATRGLFADLSILEANNLARVLAQPTLVALSGQSASFLAGGEIPVPVP 265
S+FN + G L+ L ++ +LA P++V L A+F G E+PV
Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466

Query: 266 QSLGT-----ISIDWKPYGVGLTLTPTVLSPRRIALKVAPESSQLDFVHSITINGVTVPA 320
+ +++ K G+ L + P + + L++ E S + S + +
Sbjct: 467 SQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAAS-STSSDLGAT 525

Query: 321 LTTRRADTTVELGDGESFAIGGLIDRETTSNVDKVPFLGDLPIIGTFFKHLSYQQNDKEL 380
TR + V +G GE+ +GGL+D+ + DKVP LGD+P+IG F+ S + + + L
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 381 VIIVTPHLV 389
++ + P ++
Sbjct: 586 MLFIRPTVI 594


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1051PREPILNPTASE534e-11 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 53.3 bits (128), Expect = 4e-11
Identities = 31/124 (25%), Positives = 52/124 (41%), Gaps = 10/124 (8%)

Query: 4 LFSIGFFFAWAAAVAIADCRDRRIPNELVLVGLAAVIIFTVCRQNPFGTTLSGALIGGAV 63
+ A+ D +P++L L L ++F + F +L A+IG
Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL--LGGF-VSLGDAVIGAMA 190

Query: 64 GLVSLFPFFAL-------RVMGAADVKVFAVLGAWCGLSALPRLWVVASVAAGVHALALM 116
G + L+ + MG D K+ A LGAW G ALP + +++S+ + L+
Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250

Query: 117 LLTR 120
LL
Sbjct: 251 LLRN 254


17BMA10247_1161BMA10247_1193Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1161-112-4.271284phosphate transporter family protein
BMA10247_1162115-4.753707hypothetical protein
BMA10247_1163218-4.676729replicative DNA helicase
BMA10247_1164316-1.296385hypothetical protein
BMA10247_1165213-0.05829250S ribosomal protein L9
BMA10247_11660131.16165730S ribosomal protein S18
BMA10247_11671122.145117primosomal replication protein N
BMA10247_11681140.89318530S ribosomal protein S6
BMA10247_11692140.986021hypothetical protein
BMA10247_11701120.399802RNA polymerase sigma factor
BMA10247_1171217-0.284231Ser/Thr protein phosphatase
BMA10247_1172418-1.300390cytochrome c oxidase
BMA10247_1173320-1.422435IS407A, transposase OrfB
BMA10247_11743180.166305IS407A, transposase OrfA
BMA10247_11751141.042856asparaginase
BMA10247_1176015-1.071023hypothetical protein
BMA10247_11770110.121057phosphate starvation-inducible protein
BMA10247_11792100.724341hypothetical protein
BMA10247_11782110.488512hypothetical protein
BMA10247_11800100.157665lipoprotein
BMA10247_11810110.045681glycoside hydrolase family protein
BMA10247_1182-1100.940495hypothetical protein
BMA10247_1183-391.358624mechanosensitive ion channel family protein
BMA10247_1184-1101.340013hypothetical protein
BMA10247_1185-1111.337103ankyrin repeat-containing protein
BMA10247_1186-1112.483135TatD family hydrolase
BMA10247_1187-1113.432565phosphinothricin acetyltransferase
BMA10247_1188-2103.499882DNA polymerase III subunit delta'
BMA10247_1189092.743394thymidylate kinase
BMA10247_11901102.992525lipoprotein
BMA10247_11911103.349886glycine cleavage T-protein (aminomethyl
BMA10247_11922112.950615hypothetical protein
BMA10247_11932122.735419acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1165UREASE270.046 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 26.6 bits (59), Expect = 0.046
Identities = 11/33 (33%), Positives = 17/33 (51%), Gaps = 5/33 (15%)

Query: 121 LKMIGEHGVQVALHTDVV-----VDVTVNVIGD 148
L + E+ VQV +HTD + V+ T+ I
Sbjct: 235 LSVADEYDVQVMIHTDTLNESGFVEDTIAAIKG 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1187SACTRNSFRASE300.003 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.9 bits (67), Expect = 0.003
Identities = 11/56 (19%), Positives = 21/56 (37%)

Query: 88 IYLDEAARGSGLGSRLLEAALAKAPALGVHTALGFIFGHNEPSLRLFARYGFTTWG 143
I + + R G+G+ LL A+ A + N + +A++ F
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1193SACTRNSFRASE376e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.2 bits (86), Expect = 6e-06
Identities = 17/59 (28%), Positives = 25/59 (42%), Gaps = 3/59 (5%)

Query: 74 IGRVSVLADARGRGVGSRLLDALLAEARGRGDALVRLYAQQR---AVAFYLRIGFRIVG 129
I ++V D R +GVG+ LL + A+ + L Q A FY + F I
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


18BMA10247_1208BMA10247_1246Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1208-1113.352183peptidyl-prolyl cis-trans isomerase
BMA10247_1209-1114.103506acetyltransferase
BMA10247_1210-1123.110603phosphoribosylformylglycinamidine synthase
BMA10247_12110124.089955hypothetical protein
BMA10247_1212-1113.931826D-amino acid dehydrogenase small subunit
BMA10247_1214-1113.407378carbohydrate kinase
BMA10247_1213-3120.871737hypothetical protein
BMA10247_1215-310-0.030168glucose-6-phosphate isomerase
BMA10247_1216-2130.012282hypothetical protein
BMA10247_1217-2130.080555ABC transporter ATP-binding protein
BMA10247_1219-214-0.481070acyl-CoA thioesterase
BMA10247_1218212-2.568154hypothetical protein
BMA10247_1220210-3.340904peptidyl-prolyl cis-trans isomerse D
BMA10247_1222311-4.195609*carboxymuconolactone decarboxylase
BMA10247_122449-4.281340hypothetical protein
BMA10247_122648-4.396525hypothetical protein
BMA10247_123038-2.503557*ATP-dependent protease La
BMA10247_123119-1.522985ATP-dependent protease ATP-binding subunit ClpX
BMA10247_12321100.774679ATP-dependent Clp protease proteolytic subunit
BMA10247_12332102.562609trigger factor
BMA10247_12342114.102183hypothetical protein
BMA10247_1235-193.574521glycerate kinase
BMA10247_1236-2121.639015MarR family transcriptional regulator
BMA10247_1237-2131.775150hypothetical protein
BMA10247_1238-1130.0301572-dehydropantoate 2-reductase
BMA10247_1239115-1.979171LuxR family transcriptional regulator
BMA10247_1240220-3.446201porin
BMA10247_1242223-3.923857phospholipase D
BMA10247_1243319-2.157333hypothetical protein
BMA10247_1244215-1.232804IS407A, transposase OrfB
BMA10247_12451130.183689IS407A, transposase OrfA
BMA10247_12462150.219217hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1214PYOCINKILLER300.024 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.1 bits (67), Expect = 0.024
Identities = 53/222 (23%), Positives = 73/222 (32%), Gaps = 21/222 (9%)

Query: 84 DALVAAAELRRLGFAADAWMPIEVKPDDARWALERARAANVPIDEAAPESFDGYGWLVDG 143
+ L AA AA A E +A+ E I A + G +V
Sbjct: 205 NTLTAAKASIE---AAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVAT 261

Query: 144 LFGIGLARPLDGAFAAIAQRIAARARHTGRVLALDVPSGLDSDTGARVGGGTAVTATCTL 203
G GL + GA A++AQ I+ GRVLA S G ++T +
Sbjct: 262 AAGRGLIQVAQGA-ASLAQAISDAIAVLGRVLA--------SAPSVMAVGFASLTYSSRT 312

Query: 204 SFIAAKPGLYTGDGRDLAGEIHVAPLDLGEPPAPAIRLNAPELFEAR--LPERAFASHKG 261
+ T D A + A L P++ LNA LP R +G
Sbjct: 313 AEQWQD---QTPDSVRYALGMDAAKLG----LPPSVNLNAVAKASGTVDLPMRLTNEARG 365

Query: 262 TYGSLGIVGGDTGMCGAPILAARAALFAGAGKVHVGFVGTGA 303
+L +V D + AA A G V T A
Sbjct: 366 NTTTLSVVSTDGVSVPKAVPVRMAAYNATTGLYEVTVPSTTA 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1230GPOSANCHOR403e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 40.4 bits (94), Expect = 3e-05
Identities = 35/192 (18%), Positives = 73/192 (38%), Gaps = 15/192 (7%)

Query: 92 KVLVEGLQRAQALSIEEQETQFSCEVMPLEPDHADSAETEALRRAIVSQFDQYVKLNKKI 151
L + L+ A S + + E A A+ E ++ K +
Sbjct: 193 AELEKALEGAMNFSTADSAKIKTLEAE-KAALAARKADLEKALEGAMNFSTADSAKIKTL 251

Query: 152 PPEILTSLSGIDEAGRLADTIAAHLPLKLDQKQHILEMFPVIERLEHLLAQLEAEIDILQ 211
E + E + + + + + LE A LE + +L
Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE---KAALEAEKADLEHQSQVLN 308

Query: 212 VEKRIRGRVKRQMEKSQREYYLNEQVKAIQKELGEGEEGAD--LEELEKRINAARMPKEA 269
R ++R ++ S+ +Q++A ++L E + ++ + L + ++A+R EA
Sbjct: 309 AN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEASRQSLRRDLDASR---EA 359

Query: 270 KKKADAELKKLK 281
KK+ +AE +KL+
Sbjct: 360 KKQLEAEHQKLE 371


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1231HTHFIS310.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.3 bits (71), Expect = 0.008
Identities = 19/112 (16%), Positives = 40/112 (35%), Gaps = 12/112 (10%)

Query: 51 EAAAAGVEASLSKSDLPSPQEIRDILDQYVIGQERAKKILAVAVYNHYKRL-------KH 103
+A+ G L K E+ I+ + + +R L + + +
Sbjct: 92 KASEKGAYDYLPKP--FDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEI 149

Query: 104 LDKKDDVELSKSNILLIGPTGSGKTLLAQTLARL---LNVPFVIADATTLTE 152
+ + +++ G +G+GK L+A+ L N PFV + +
Sbjct: 150 YRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPR 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1240ECOLNEIPORIN671e-14 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 67.1 bits (164), Expect = 1e-14
Identities = 72/323 (22%), Positives = 119/323 (36%), Gaps = 37/323 (11%)

Query: 20 AATLAALSGPAHAQSTLTLYGVADAGVQYLSRADGRHAAWRLQN-----YGILPSQLGIK 74
A TLAAL P A + +TLYG AGV SR+ + A L S++G K
Sbjct: 7 ALTLAAL--PVAAMADVTLYGTIKAGV-ETSRSVAHNGAQAASVETGTGIVDLGSKIGFK 63

Query: 75 GEEDLGGGWRARFQLEQGINLNDSTATVPGYAFFRGAYVGMGGPAGTVTLGRQFSTLFDK 134
G+EDLG G +A +Q+EQ ++ + + R +++G+ G G + +GR S L D
Sbjct: 64 GQEDLGNGLKAIWQVEQKASIAGTDSGW----GNRQSFIGLKGGFGKLRVGRLNSVLKDT 119

Query: 135 TLFYDPLWYASYSGQGVLVPLSANFVDHSIKFQSATFAGFDVEALAAMAGIAGNTRAGRV 194
+P S + + S+++ S FAG ++ A N AGR
Sbjct: 120 GDI-NPWDSKSDYLGVNKIAEPEARLI-SVRYDSPEFAGL-SGSVQ----YALNDNAGRH 172

Query: 195 ------LELGGQFTSRGLSASAVLHRSH-GTAQGGADRSAQRRDIGTFAARYAFASLPLT 247
+ + R H ++ R + + +AS+ +
Sbjct: 173 NSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASVAVQ 232

Query: 248 VHAGVQRLTGELDPARTIV-------WGGARYQASGRFGFAGGIYHTDSPTPQVGHPTLF 300
++T V +G + S GF G T+
Sbjct: 233 QQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGSFDATNYNN----DYDQV 288

Query: 301 IASTTCSLSKRTVAYLNLGYAKN 323
+ SKRT A ++ G+ +
Sbjct: 289 VVGAEYDFSKRTSALVSAGWLQE 311


19BMA10247_1324BMA10247_1340Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1324216-2.700746undecaprenyl diphosphate synthase
BMA10247_1325016-3.312701ribosome recycling factor
BMA10247_1326-111-1.717528uridylate kinase
BMA10247_1327-2120.199955elongation factor Ts
BMA10247_1328-3100.46998430S ribosomal protein S2
BMA10247_1329-381.060215hypothetical protein
BMA10247_1330-371.289105methionine aminopeptidase
BMA10247_1331-272.066027PII uridylyl-transferase
BMA10247_13322102.447658RNA pseudouridine synthase
BMA10247_1333191.858207peptide deformylase
BMA10247_1334192.248456NAD-dependent DNA ligase
BMA10247_13362102.774900hypothetical protein
BMA10247_13351111.959793hypothetical protein
BMA10247_13373121.629012chromosome segregation protein SMC
BMA10247_13380131.477025hypothetical protein
BMA10247_13393160.963652hypothetical protein
BMA10247_13402150.589196succinyldiaminopimelate transaminase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1326CARBMTKINASE320.002 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 31.7 bits (72), Expect = 0.002
Identities = 48/240 (20%), Positives = 79/240 (32%), Gaps = 79/240 (32%)

Query: 6 KRVLLKLSGEALM---GDDAFGINRATIERMVADIAEVVRLGTQLAVVIGG----GNIFR 58
KRV++ L G AL ++ + + IAE++ G ++ + G G++
Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62

Query: 59 GVAGGAAG-------MDRATADYMGMLATMMNALALQDAMRHAGIEARVQSALRMDQV-- 109
+ G A MD A A G + M+ AL++ +R G+E +V + + V
Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQ-ALKNELRKRGMEKKVVTIITQTIVDK 121

Query: 110 ------------------------------------------VEPYIRP------RAIRQ 121
V P P I++
Sbjct: 122 NDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKK 181

Query: 122 L-EEGKVVIFAAGTGNPFFTT-------------DTAAALRGSEVGAEVVLKATKVDGVY 167
L E G +VI + G G P D A EV A++ + T V+G
Sbjct: 182 LVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAA 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1332IGASERPTASE350.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.0 bits (80), Expect = 0.001
Identities = 36/277 (12%), Positives = 69/277 (24%), Gaps = 12/277 (4%)

Query: 5 NPRPATPGRAPVRSGSLTARKVARPDPKAAGAKPAA-AKPAAKSASAAKPAAPRSAANAA 63
NP + V + ++T + D + + A+ PA P
Sbjct: 982 NPEVEKRNQ-TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040

Query: 64 PKRAPGPSRPAAASEGKRVAKPRTAHDAGRTGGERAPAKRATTPGAASAPRTRRTDAKPA 123
+ + S+ +E + + A T A S T+ T
Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100

Query: 124 RRTNERPAGRDERAPRDSDARAFDAGTRGK-DRAPREGARPGARGATGAKFGGAARRSDD 182
+ T + + ++ + E +P A A +
Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160

Query: 183 ADRRTPRATRADSRARDAAPSSFAGKTATAGKRAPQRADDRYGAAGKRTSPRTE------ 236
T + T + + A + + +E
Sbjct: 1161 QTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK 1220

Query: 237 -RTERTERPARFGERPATRASASGERRPTARAATGSR 272
R R+ R PAT ++S +R A S
Sbjct: 1221 NRHRRSVRSVPHNVEPAT--TSSNDRSTVALCDLTST 1255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1337GPOSANCHOR542e-09 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 54.3 bits (130), Expect = 2e-09
Identities = 46/342 (13%), Positives = 112/342 (32%), Gaps = 19/342 (5%)

Query: 199 AAGVSKYKERRRETENRLHDTRENLTRVEDIVRELGANLEKLEAQAVVATKYKELVADGE 258
+ ++ + + + + D+ A + + + KE + +
Sbjct: 46 RSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKND 105

Query: 259 EKQRLLWLLRKNEAAAEQDRQRRAIGDAQIELDAQTAKLREVEAQLETLRVAHYSASDAM 318
+ + A + D ++ A+ A A +AK++ +EA+ L A+
Sbjct: 106 KSLSEKASKIQELEARKADLEK-ALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKAL 164

Query: 319 QGAQGALYEANAEVSRLEAQIKFIVESRNRVQAQIAALVAQQEQWRAQADKAQGDLEAAE 378
+GA +A++ LEA+ + + ++ + + A+ + + A
Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224

Query: 379 EARAVADEKAAIAEDDAAAKHDALPALEARWRDAQTGLNDERGRIAQTEQALKLEAAHQR 438
+A ++ A + + A + LEA + + + ++A +
Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIK 284

Query: 439 NA-------DQQLQQLQQRHERLKVEAGGLDAPDEAQLEELRMQLAEHEAMLADAQARLA 491
+ + L+ + + L A + LR L +A
Sbjct: 285 TLEAEKAALEAEKADLEHQSQVL-----------NANRQSLRRDLDASREAKKQLEAEHQ 333

Query: 492 DAQEALPRLDAQRRAAHERVQAESAQIHQLEARLAALKQLQE 533
+E +A R++ + A QLEA L++ +
Sbjct: 334 KLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNK 375



Score = 43.1 bits (101), Expect = 6e-06
Identities = 48/278 (17%), Positives = 104/278 (37%), Gaps = 16/278 (5%)

Query: 268 RKNEAAAEQDRQRRAIGDAQIELDAQTAKLREVEAQLETLRVAHYSASDAMQGAQGALYE 327
+E E + + L + +K++E+EA+ L A++GA
Sbjct: 86 HNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADL-------EKALEGAMNFSTA 138

Query: 328 ANAEVSRLEAQIKFIVESRNRVQAQIAALVAQQEQWRAQADKAQGDLEAAEEARAVADEK 387
+A++ LEA+ + + ++ + + A+ + + A E +A ++
Sbjct: 139 DSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 198

Query: 388 AAIAEDDAAAKHDALPALEARWRDAQTGLNDERGRIAQTEQALKLEAAHQRNADQQLQQL 447
A + + A + LEA D + ++A + + + L
Sbjct: 199 LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 258

Query: 448 QQRHERLKVEAGGLDAPDEAQLEELRMQLAEHEAMLADA-------QARLADAQEALPRL 500
+ R L+ G A +++ AE A+ A+ Q A+ Q L
Sbjct: 259 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL 318

Query: 501 DAQRRAAHERVQAESAQI-HQLEARLAALKQLQENVQT 537
DA R A ++++AE ++ Q + A+ + L+ ++
Sbjct: 319 DASRE-AKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 355



Score = 40.8 bits (95), Expect = 3e-05
Identities = 40/313 (12%), Positives = 95/313 (30%), Gaps = 20/313 (6%)

Query: 734 TEVRAQAERA--TQRVHALQMDVLKLTQAHERYTQRSTQIREELEEIGAQIEEQRALRAE 791
T + T + +Q K + +++ + + + +E +
Sbjct: 37 TNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSN 96

Query: 792 SEANFERHDAELAELQARFEDNQLAFESLDETLTNARQEARERERAATDARFAARQSANR 851
++ ++D L+E ++ ++ + L++ L A +
Sbjct: 97 AKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKI-----------KT 145

Query: 852 IDELKRSIQVAHEQAERVAASLEDARAELETINEQTAHTGLQDALEVRAAKEQALGAARA 911
++ K ++ E+ + +T +A E+AL A
Sbjct: 146 LEAEKAALAARKADLEKALEGAMNFSTADSA-KIKTLEAEKAALEARQAELEKALEGAMN 204

Query: 912 ELDDLTAKLRAADEARLAAERSLQPLRDRITELQLKEQAARMTGEQFAEQLATAEVDEAA 971
+AK++ + + A L + A + + A E +A
Sbjct: 205 FSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAE 264

Query: 972 LKEKLMPDMKPSYLQGEVTRINNAINALGPVNMAALDELAAASERKVFLDAQSADLTNAI 1031
L++ L T + I L A E A + L+A L +
Sbjct: 265 LEKAL------EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL 318

Query: 1032 ETLEDAIRKIDQE 1044
+ +A ++++ E
Sbjct: 319 DASREAKKQLEAE 331



Score = 40.8 bits (95), Expect = 3e-05
Identities = 57/268 (21%), Positives = 99/268 (36%), Gaps = 11/268 (4%)

Query: 714 EAKAAAIRAEAAHTQASQALTEVRAQAERATQRVHALQMDVLKLTQAHERYTQRSTQIRE 773
+A E A A T A+ + AL L +A E ST
Sbjct: 187 ALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSA 246

Query: 774 ELEEIGAQIEEQRALRAESEANFERHDAELAELQARFEDNQLAFESLDETLTNARQEARE 833
+++ + A+ A +AE E E A+ + + +L+ + +++
Sbjct: 247 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQV 306

Query: 834 RERAATDARFAARQSANRIDELKRSIQVAHEQAERVAASLEDARAELETINEQTAHTGLQ 893
R S +L+ Q EQ + AS + R +L+ E
Sbjct: 307 LNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK----- 361

Query: 894 DALEVRAAK-EQALGAARAELDDLTAKLRAADEARLAAERSLQPLRDRITELQLK----E 948
LE K E+ + A L L A+ EA+ E++L+ ++ L+ E
Sbjct: 362 -QLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELE 420

Query: 949 QAARMTGEQFAEQLATAEVDEAALKEKL 976
++ ++T ++ AE A E + ALKEKL
Sbjct: 421 ESKKLTEKEKAELQAKLEAEAKALKEKL 448


20BMA10247_1538BMA10247_1553Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1538525-5.512784LysR family transcriptional regulator
BMA10247_1541631-6.671063*hypothetical protein
BMA10247_1542429-6.068185IS407A, transposase OrfB
BMA10247_1543218-2.159722IS407A, transposase OrfA
BMA10247_1544217-1.103485hypothetical protein
BMA10247_15455210.364661hypothetical protein
BMA10247_15466200.642983hypothetical protein
BMA10247_15476210.631838hypothetical protein
BMA10247_15487221.146657D-alanyl-D-alanine carboxypeptidase
BMA10247_15499300.551840ecotin
BMA10247_155017341.513911lipoprotein
BMA10247_155213320.634614hypothetical protein
BMA10247_15517260.479319hypothetical protein
BMA10247_15543171.589325hypothetical protein
BMA10247_15532161.283502hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1550cloacin280.012 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 27.8 bits (61), Expect = 0.012
Identities = 18/59 (30%), Positives = 22/59 (37%), Gaps = 1/59 (1%)

Query: 49 GTVNVWGGDGWRDRDHWHGGDDRWHGGWRGGGNWRDGNDWHGGRGNGWQGGRGPAGGRN 107
G + G G D W ++ W GG G +W G HG G G G G N
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW-GGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 26.2 bits (57), Expect = 0.046
Identities = 20/51 (39%), Positives = 23/51 (45%), Gaps = 3/51 (5%)

Query: 74 GGWRGGGNWRDGNDWHGGRGNGWQGGRGPAGGRNVRGGNDWPDGGGNGRGG 124
G G G + N W GG G+G G G G GN GGG+G GG
Sbjct: 32 GASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN---SGGGSGTGG 79


21BMA10247_1625BMA10247_1635Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1625-1113.296809aspartate alpha-decarboxylase
BMA10247_1626-1114.190786ParA family protein
BMA10247_16270123.300542DoxD-like family protein
BMA10247_16281133.660672cobyric acid synthase
BMA10247_16291153.776055cobinamide kinase/cobinamide phosphate
BMA10247_16303143.920843cobalamin biosynthesis protein
BMA10247_16323143.636253cobalamin ABC transporter periplasmic
BMA10247_16333154.310966hypothetical protein
BMA10247_16343114.307909alpha-ribazole-5'-phosphate phosphatase
BMA10247_16351113.497987cobalamin synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1632FERRIBNDNGPP407e-06 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 39.9 bits (93), Expect = 7e-06
Identities = 39/186 (20%), Positives = 68/186 (36%), Gaps = 9/186 (4%)

Query: 42 AITLAAPARRVVSLAPHVTELIYAAG----GGAKLVGAVSYSDYPPAAKAIARVGSNKAL 97
A A R+V+L EL+ A G G A + + PP ++ VG
Sbjct: 28 AHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEP 87

Query: 98 DLERIAALKPDLIVVWRHGNAEHETERLRALGIPLYFSEPRH-LDDVAASLDKLGLLLGT 156
+LE + +KP +V E A G FS+ + L SL ++ LL
Sbjct: 88 NLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNL 147

Query: 157 HEIASAAADAYRRRIAQLRARYADK--PPVTVFFQAWDKPLITLNGDH-IVSDVIALCGG 213
A Y I ++ R+ + P+ + D + + G + + +++ G
Sbjct: 148 QSAAETHLAQYEDFIRSMKPRFVKRGARPL-LLTTLIDPRHMLVFGPNSLFQEILDEYGI 206

Query: 214 RNVFAR 219
N +
Sbjct: 207 PNAWQG 212


22BMA10247_1661BMA10247_1673Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1661-112-3.395243hypothetical protein
BMA10247_1662-111-4.187354sulfite reductase (NADPH) hemoprotein
BMA10247_1663-114-3.040789transcriptional regulator CysB-like protein
BMA10247_1664019-2.554073hypothetical protein
BMA10247_1666019-2.395782*transposase
BMA10247_1667-120-1.577035transposase subfamily protein
BMA10247_1669-116-0.372837glutamine synthetase
BMA10247_16702192.551441glutamine amidotransferase
BMA10247_16713162.267618hypothetical protein
BMA10247_16722132.651879hypothetical protein
BMA10247_16732112.967631hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1662TCRTETB320.007 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.8 bits (72), Expect = 0.007
Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 3/39 (7%)

Query: 190 KKNDAGEVVASILAGG-GLGRTPIVGAIIRENLPWQHLL 227
+ A ++ SI+A G G+G P +G +I + W +LL
Sbjct: 136 NRGKAFGLIGSIVAMGEGVG--PAIGGMIAHYIHWSYLL 172


23BMA10247_1710BMA10247_1719Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_17103112.857295hypothetical protein
BMA10247_17112102.896476hypothetical protein
BMA10247_17132112.393530SUF system FeS assembly ATPase SufC, internal
BMA10247_17142131.830044SufD domain-containing protein
BMA10247_1715011-0.009891cysteine desulfurase SufS
BMA10247_1716014-2.250563NifU-like domain-containing protein
BMA10247_1717011-3.063574hypothetical protein
BMA10247_1718-112-3.944068hypothetical protein
BMA10247_1719010-3.766865transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1711PF06580330.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.002
Identities = 29/141 (20%), Positives = 50/141 (35%), Gaps = 25/141 (17%)

Query: 40 ALWLASLRG---HAVAGLSPAISGLAWHVHEMVFGFSAAIIVGFLLTAIRAWTSRETLHG 96
W G + + G A + +H M+F + +++ L A R++ R
Sbjct: 11 YYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKR----- 65

Query: 97 APLAALWLPWAAGRLLVWAGPEPLAAVVDSAFLPITAILLLRVLLAARNHRNVFLTVALF 156
WL G++++ P A V + A + LLA N + V T+ L
Sbjct: 66 ----QGWLKLNMGQIILRVLP----ACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLA 117

Query: 157 L---------FGALNALFHGW 168
L + L+ GW
Sbjct: 118 LSIIFNVVVVTFMWSLLYFGW 138


24BMA10247_1731BMA10247_1744Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_17312110.647248succinylglutamate desuccinylase
BMA10247_1732111-0.164018succinylarginine dihydrolase
BMA10247_1734010-0.789735arginine N-succinyltransferase subunit beta
BMA10247_1735011-0.959701arginine N-succinyltransferase, alpha chain
BMA10247_1736214-2.436282bifunctional
BMA10247_1737421-3.630350lipoprotein
BMA10247_1738211-1.996684AraC family transcriptional regulator
BMA10247_1739311-1.866124arginine/ornithine ABC transporter ATP-binding
BMA10247_1740210-1.387323arginine/ornithine ABC transporter permease
BMA10247_1741210-0.307870IS407A, transposase OrfB
BMA10247_1742191.309457IS407A, transposase OrfA
BMA10247_17431101.821551non-hemolytic phospholipase C
BMA10247_1744-1123.021613hypothetical protein
25BMA10247_1823BMA10247_1848Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1823-411-3.026534hypothetical protein
BMA10247_1824-210-3.281513nicotinate phosphoribosyltransferase
BMA10247_1825220-2.592982hypothetical protein
BMA10247_1826114-1.683531ferredoxin
BMA10247_1829-1110.559250**CreA protein
BMA10247_1831-113-0.072072hypothetical protein
BMA10247_1832217-1.124050hypothetical protein
BMA10247_1833016-2.516344hypothetical protein
BMA10247_1834021-3.328967hypothetical protein
BMA10247_1835023-3.168835multidrug efflux pump NorM
BMA10247_1836331-5.673169IS407A, transposase OrfA
BMA10247_1837234-5.958776IS407A, transposase OrfB
BMA10247_1838236-6.526733polysaccharide synthase
BMA10247_1839343-7.727791glycoside hydrolase family protein
BMA10247_1840141-7.623175NAD-dependent epimerase/dehydratase
BMA10247_1841243-9.101223glycosyl transferase family protein
BMA10247_1842343-8.999708glycosyl transferase family protein
BMA10247_1843442-9.779169O-antigen methyl transferase
BMA10247_1844542-9.352728glycosyl transferase family protein
BMA10247_1845543-9.695549NAD-dependent epimerase/dehydratase
BMA10247_1846539-9.389070O-antigen acetylase
BMA10247_1847330-7.661889lipopolysaccharide ABC transporter ATP-binding
BMA10247_1848124-5.575041polysaccharide ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1838NUCEPIMERASE728e-16 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 72.1 bits (177), Expect = 8e-16
Identities = 53/301 (17%), Positives = 108/301 (35%), Gaps = 50/301 (16%)

Query: 288 VMVTGAGGSIGSELCRQILKFQPAQLIAFD-LSEYAMYRLTEELRERFPDLPVVPIIGDA 346
+VTGA G IG + +++L+ Q++ D L++Y L + E D
Sbjct: 3 YLVTGAAGFIGFHVSKRLLE-AGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 347 KDSLLLDQVMSRYAPHIVFHAAAYKHVPLMEELNAWQALRNNVLGTYRVARAAIRHDVRH 406
D + + + VF + V E N +N+ G + + ++H
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLE-NPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 407 FVLIST---------------DKAVNPTNVMGASKRLAE-MACQALQQTSARTQFETV-- 448
+ S+ D +P ++ A+K+ E MA S
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY----SHLYGLPATGL 176

Query: 449 RFGNVLGSAGS---VIPKFQQQIAKGGPVTV-THPEITRFFMTIPEASQLVLQA------ 498
RF V G G + KF + + +G + V + ++ R F I + ++ +++
Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 499 ------------SSMGQGGEIFILDMGEPVKIVDLARDLIRLYGFTEEQIRIEFSGLRPG 546
++ ++ + PV+++D + L G + + L+PG
Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI---EAKKNMLPLQPG 293

Query: 547 E 547
+
Sbjct: 294 D 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1840NUCEPIMERASE1045e-28 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 104 bits (262), Expect = 5e-28
Identities = 67/344 (19%), Positives = 130/344 (37%), Gaps = 42/344 (12%)

Query: 3 RVIVTGANGFVGRALCRALLAAGHEVTGL-------------VRRRGVCAEGVSEWVHEA 49
+ +VTGA GF+G + + LL AGH+V G+ R + G H+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQ--FHKI 59

Query: 50 D--DFDGVADRWPAGLQVDAVVHLAARVHMMRDRSPDPDAAFRASNVAATMRVARAAQQQ 107
D D +G+ D + +G + V R+ + S + A+ SN+ + + +
Sbjct: 60 DLADREGMTDLFASG-HFERVFISPHRLAV--RYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 108 GARRFVFLS--SVKAIAESDGGTPLCE-NSTPAPQDAYGRSKLEAERALEQLRDELSFDT 164
+ ++ S SV + P +S P Y +K E
Sbjct: 117 KIQHLLYASSSSVYGLNRKM---PFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 165 VIVRPPLVYGPGVRAN--FLSLMRAVSRGVPLPL-GAVRARRSMVYVDNLADAVMRCVTE 221
+R VYGP R + +A+ G + + + +R Y+D++A+A++R
Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233

Query: 222 PAATNGCFHVADSDMPPTIAEL-LDDIGHHLGRPARLLPVPERLLRVAGALTGRAAQ--- 277
+ + V +IA + +IG+ P L+ ++ G A+
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGN--SSPVELM----DYIQALEDALGIEAKKNM 287

Query: 278 IDRLTSDLR---LDTTHIRTVLDWRPPRSSEEGLAETACWFKSL 318
+ D+ DT + V+ + P + ++G+ W++
Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1845NUCEPIMERASE1682e-51 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 168 bits (426), Expect = 2e-51
Identities = 83/363 (22%), Positives = 136/363 (37%), Gaps = 58/363 (15%)

Query: 6 KILVTGGAGFIGCAISERLAARASRYVVMDNLHPQIHASAVRPGALHEKAE----LVVAD 61
K LVTG AGFIG +S+RL + V +DNL+ + +++ L A+ D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDY-YDVSLKQARLELLAQPGFQFHKID 60

Query: 62 VTDAGAWDALLSDFQPEIIIHLAAETGTGQSLTEASRHALVNVVGTTRLTDALVKHGIVV 121
+ D L + E + SL +A N+ G + + + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--I 118

Query: 122 EHILLTSSRAVYGEGAWQKDDGTIVYPGQRGRAQLEAAQWDFPGMTMLPSRADRTEPRPT 181
+H+L SS +VYG +P D + P
Sbjct: 119 QHLLYASSSSVYGLN------------------------------RKMPFSTDDSVDHPV 148

Query: 182 SVYGATKLAQEHVLRAWSLATKTPLSILRLQNVYGPGQSLTNSYTGIVALFSRLAREKKV 241
S+Y ATK A E + +S P + LR VYGP + F++ E K
Sbjct: 149 SLYAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALF----KFTKAMLEGKS 204

Query: 242 IPLYEDGNVTRDFVSIDDVADAIVATLVRTPEA-----------------LSLFDIGSGQ 284
I +Y G + RDF IDD+A+AI+ P A +++IG+
Sbjct: 205 IDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSS 264

Query: 285 ATSILDMARIIAAHYGAPEPQINGAFRDGDVRHAACDLSESLANLGWKPQWSLKRGIGEL 344
++D + + G + + GDV + D +G+ P+ ++K G+
Sbjct: 265 PVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324

Query: 345 QTW 347
W
Sbjct: 325 VNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1848ABC2TRNSPORT300.007 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 30.3 bits (68), Expect = 0.007
Identities = 16/59 (27%), Positives = 24/59 (40%)

Query: 195 LFTMVLMFLSPVFYPASALPEKYRFWLELNPLTLFIEQSRGILLEGRVPDFHPLGLAFL 253
L ++FLS +P LP ++ PL+ I+ R I+L V D A
Sbjct: 184 LVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALC 242


26BMA10247_1861BMA10247_1876Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1861526-1.374173hypothetical protein
BMA10247_1862527-1.644659phosphomethylpyrimidine kinase ThiD
BMA10247_1863832-3.195694molecular chaperone GroEL
BMA10247_18641239-3.537190co-chaperonin GroES
BMA10247_18651240-3.113880hypothetical protein
BMA10247_1866932-2.565236hypothetical protein
BMA10247_1867431-3.095682hypothetical protein
BMA10247_1869329-2.763021hypothetical protein
BMA10247_1868223-1.623035hypothetical protein
BMA10247_1870118-1.309839transcriptional regulator family protein
BMA10247_1871114-1.373165alcohol dehydrogenase
BMA10247_1872213-1.886334hypothetical protein
BMA10247_1874413-3.171822hypothetical protein
BMA10247_1873313-3.486441hypothetical protein
BMA10247_1875212-3.601094OmpW family outer membrane protein
BMA10247_1876111-3.043305lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1866PYOCINKILLER300.028 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.1 bits (67), Expect = 0.028
Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 8/160 (5%)

Query: 477 ANALSVANPAALTAAANTVAGTLARAANGTPVAGAIGGLVAALPVANPAGALTSAANNAA 536
A A A A AA A T A ANG+ VA A G + VA A +L A ++A
Sbjct: 228 AEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGR--GLIQVAQGAASLAQAISDAI 285

Query: 537 STIATVAGTNPAAAIGGVAGALTGAAGTGVATASQLGSVGSALMGSGAASAGKVLTSGSA 596
+ + V + P+ G A +LT ++ T Q +G AA G +
Sbjct: 286 AVLGRVLASAPSVMAVGFA-SLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSVNLN 344

Query: 597 AFGSAAASAG-----SLLTTGAAATSSVVNSLGSSVGAVV 631
A A+ + + G T SVV++ G SV V
Sbjct: 345 AVAKASGTVDLPMRLTNEARGNTTTLSVVSTDGVSVPKAV 384


27BMA10247_1946BMA10247_1958Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1946312-2.361641tolQ protein
BMA10247_1947613-2.496539TolR protein
BMA10247_1948714-3.219040TolA protein
BMA10247_1949618-4.510117translocation protein TolB
BMA10247_1950820-5.375679peptidoglycan-associated outer-membrane
BMA10247_19511020-5.305262tol-pal system protein YbgF
BMA10247_19531125-5.951331*IS407A, transposase OrfA
BMA10247_1954823-5.440206IS407A, transposase OrfB
BMA10247_1955215-4.736629hypothetical protein
BMA10247_1956215-4.793650isrso15-transposase
BMA10247_1957-110-4.255099outer membrane porin
BMA10247_1958-18-3.325898ISBma1, transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1948IGASERPTASE604e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 60.5 bits (146), Expect = 4e-12
Identities = 34/189 (17%), Positives = 68/189 (35%), Gaps = 10/189 (5%)

Query: 46 QNSTPAGAEAELWTSVPDTSTPQPAPTPPVKVAPPPPPVKNEEADIALQQKRREQQAAAA 105
+TP +A++ SVP + V PP P +E + + ++E +
Sbjct: 996 NITTPNNIQADV-PSVPSNNEEIARVDEAP-VPPPAPATPSETTETVAENSKQESKTVEK 1053

Query: 106 REAQLEEQRRQQQLKAQQ-----LAAQQAAQLAAQKAAEREKQKQAEKLKQQQLAEQQQR 160
E E Q + A++ A Q ++A + +E Q K E++ +
Sbjct: 1054 NEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAK 1113

Query: 161 KLEQQKLEQQKLEQQ---KKQEQLAAQKKADAEKAEKAEKAAKAAAAAKANAAAKAKLDK 217
++ E K+ Q K+++ Q +A+ + K + A + K
Sbjct: 1114 VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173

Query: 218 ERQARLAQL 226
E + + Q
Sbjct: 1174 ETSSNVEQP 1182



Score = 41.2 bits (96), Expect = 6e-06
Identities = 18/132 (13%), Positives = 44/132 (33%), Gaps = 8/132 (6%)

Query: 93 LQQKRREQQAAAAREAQLEEQRRQQQLKAQQLAAQQAAQLAAQKAAEREKQKQ--AEKLK 150
Q Q + +++A A + A + + AE K
Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPS--NNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045

Query: 151 QQQLAEQQQRKLEQQKLEQQKLEQQKKQEQLAAQKKADAEKAEKAEKAAKAAAAAKANAA 210
Q+ ++ Q + + ++ ++ + KA+ + E A+ ++
Sbjct: 1046 QESKTVEKNE----QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101

Query: 211 AKAKLDKERQAR 222
A ++KE +A+
Sbjct: 1102 ETATVEKEEKAK 1113



Score = 33.1 bits (75), Expect = 0.002
Identities = 19/145 (13%), Positives = 44/145 (30%), Gaps = 7/145 (4%)

Query: 87 EEADIALQQKRREQQAAAAREAQLEEQRRQQQLKAQQLAAQQAAQLAAQKAAERE----- 141
+EA ++ + + A + E Q + + A ++A + +
Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQV 1129

Query: 142 --KQKQAEKLKQQQLAEQQQRKLEQQKLEQQKLEQQKKQEQLAAQKKADAEKAEKAEKAA 199
KQ+Q+E ++ Q ++ K Q + EQ A + ++ E+
Sbjct: 1130 SPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTV 1189

Query: 200 KAAAAAKANAAAKAKLDKERQARLA 224
+ N +
Sbjct: 1190 NTGNSVVENPENTTPATTQPTVNSE 1214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1950OMPADOMAIN1005e-28 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 100 bits (250), Expect = 5e-28
Identities = 32/137 (23%), Positives = 56/137 (40%), Gaps = 10/137 (7%)

Query: 33 QGDAVSTQPNPENVAQVTVDPLNDPNSPLAKRSVYFDFDSYSVQDQYQALLQQHAQYLKS 92
QG+A A S V F+F+ +++ + QA L Q L +
Sbjct: 193 QGEAAPVVAPAPAPAPEVQTKHFTLKS-----DVLFNFNKATLKPEGQAALDQLYSQLSN 247

Query: 93 HPQRH--ILIQGNTDERGTSEYNLALGQKRAEAVRRALSLLGVGDAQMEAVSLGKEKPVA 150
+ +++ G TD G+ YN L ++RA++V L G+ ++ A +G+ PV
Sbjct: 248 LDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVT 307

Query: 151 LGHDEASWAQNRRADLV 167
+RA L+
Sbjct: 308 ---GNTCDNVKQRAALI 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1957ECOLNEIPORIN1274e-36 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 127 bits (322), Expect = 4e-36
Identities = 89/386 (23%), Positives = 143/386 (37%), Gaps = 62/386 (16%)

Query: 1 MKKTLIVAALSGVFATAAHAQSSVTLYGLIDAGITYTNNQGGHSAWS-----QSTGSVNG 55
MKK+LI L+ + A + VTLYG I AG+ + + + A + + G
Sbjct: 1 MKKSLIALTLAALPVAAM---ADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLG 57

Query: 56 SRWGLRGAEDLGGGLKAIFVLENGFGINNGTLKQNGREFGRQAFVGLSHEQYGALTLGRQ 115
S+ G +G EDLG GLKAI+ +E I + RQ+F+GL +G L +GR
Sbjct: 58 SKIGFKGQEDLGNGLKAIWQVEQKASIAGT----DSGWGNRQSFIGLK-GGFGKLRVGRL 112

Query: 116 YDSVVDYLG--PLSLTGTQFGGTQFAHPFDNDNLNNSFRINNAVKYTSVNWAGLKFGALY 173
+ D P G + A P + S V+Y S +AGL Y
Sbjct: 113 NSVLKDTGDINPWDSKSDYLGVNKIAEP---EARLIS------VRYDSPEFAGLSGSVQY 163

Query: 174 GFSNNNQFANNRAYSAGVSYSYAGFNIGAGYLQLNNNFGPTVSNASGAVALDNTFVGKRQ 233
++N N+ +Y AG +Y GF + G ++ ++
Sbjct: 164 ALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQV------------QENVNIEKY 211

Query: 234 RVFGGGLNYTFGPATAGFVFTQSRVNRATAIGAGASGVSSGIALDGTFMRFNNYEVNARY 293
++ Y A + + A + S S RF N
Sbjct: 212 QIHRLVSGYD---NDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFG----NVTP 264

Query: 294 AITPAWTVAGSYTYTAGFIENHHPGWNQFNLQTAYALSKRTDVYLQGVYQKVNNDGTGLG 353
++ A GS+ T N++ ++Q + Y SKRT + + + +G G
Sbjct: 265 RVSYAHGFKGSFDAT-----NYNNDYDQVVVGAEYDFSKRTSALVSAGWLQ---EGKGES 316

Query: 354 AYINGIGGMSSTEKQIAVTAGLRHRF 379
++ A GLRH+F
Sbjct: 317 KFV-----------STAGGVGLRHKF 331


28BMA10247_2081BMA10247_2096Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2081211-0.197332LysR family transcriptional regulator
BMA10247_20822110.167630hypothetical protein
BMA10247_20832100.141241LysR family transcriptional regulator
BMA10247_2084211-0.467684hypothetical protein
BMA10247_208519-0.758594DNA topoisomerase IV subunit A
BMA10247_2086012-1.555709DNA topoisomerase IV subunit B
BMA10247_2087015-2.226641ABC transporter ATP-binding protein
BMA10247_2088427-4.130668hypothetical protein
BMA10247_2089428-3.702277rubredoxin
BMA10247_2090524-3.873180hypothetical protein
BMA10247_2091320-3.737629alpha/beta hydrolase
BMA10247_2093319-3.797791transposase
BMA10247_2094015-1.503716transposase subfamily protein
BMA10247_2095117-1.662150hypothetical protein
BMA10247_2096217-2.553097ISBma2, transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2085GPOSANCHOR310.023 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.8 bits (69), Expect = 0.023
Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 7/52 (13%)

Query: 460 ARLEKIKIEKELEELRAEKAKLEELLANESAMKRLMIKE-------IEADAK 504
+R K ++EK LEE ++ A LE+L K+L KE +EA+AK
Sbjct: 391 SREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAK 442


29BMA10247_2117BMA10247_2186Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2117419-1.796782hypothetical protein
BMA10247_2118110-3.648705hypothetical protein
BMA10247_2119112-3.325688transposase subfamily protein
BMA10247_2120015-2.153330transposase
BMA10247_21215160.637625hypothetical protein
BMA10247_21226180.787105hypothetical protein
BMA10247_21232150.380893elongation factor G
BMA10247_21247221.703907hypothetical protein
BMA10247_21257211.756564hypothetical protein
BMA10247_21267191.977206hypothetical protein
BMA10247_21273170.563660hypothetical protein
BMA10247_21284170.805920aldo/keto reductase
BMA10247_21305183.993933GntR family transcriptional regulator
BMA10247_21296186.283948hypothetical protein
BMA10247_21312123.530621hypothetical protein
BMA10247_21322123.741525hypothetical protein
BMA10247_21332112.847420hypothetical protein
BMA10247_21351102.959048citrate synthase
BMA10247_21342111.200593hypothetical protein
BMA10247_21363101.790953DHA2 family drug:H+ antiporter-1
BMA10247_2137-193.028085hypothetical protein
BMA10247_2138-1112.651193cyclic diguanylate phosphodiesterase
BMA10247_2139-1124.150390hypothetical protein
BMA10247_2140-1142.201264hypothetical protein
BMA10247_21410141.5912462-dehydropantoate 2-reductase
BMA10247_2142014-0.411633hypothetical protein
BMA10247_21431110.944904transcriptional regulator
BMA10247_21442100.473462chromate transporter
BMA10247_21452110.148219IS407A, transposase OrfB
BMA10247_21461101.557575IS407A, transposase OrfA
BMA10247_2148-1102.480244superoxide dismutase
BMA10247_2149-192.763939exodeoxyribonuclease VII large subunit
BMA10247_2150110-0.551707hypothetical protein
BMA10247_215108-0.991413tetraacyldisaccharide 4'-kinase
BMA10247_2152214-3.867534hypothetical protein
BMA10247_2153-19-3.7505803-deoxy-manno-octulosonate cytidylyltransferase
BMA10247_2154212-4.977203adenylate kinase
BMA10247_2155314-5.516865ISBma2, transposase
BMA10247_2156112-5.468803cold-shock domain-contain protein
BMA10247_2157011-3.684356ATP-dependent Clp protease adaptor protein ClpS
BMA10247_215809-2.008151ATP-dependent Clp protease ATP-binding subunit
BMA10247_2160013-0.229393IS407A, transposase OrfB
BMA10247_2161-191.934452IS407A, transposase OrfA
BMA10247_21620111.013134hypothetical protein
BMA10247_21630121.111066hypothetical protein
BMA10247_2164115-0.1128128-amino-7-oxononanoate synthase
BMA10247_2165218-0.357681UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine
BMA10247_2166220-1.030593type I polyketide synthase WcbR
BMA10247_2167542-7.205740capsular polysaccharide biosynthesis protein
BMA10247_2168540-6.959665short chain dehydrogenase/reductase
BMA10247_2169641-9.111572capsule polysaccharide biosynthesis/ export
BMA10247_2170536-8.883332hypothetical protein
BMA10247_2171538-9.320662D-glycero-D-manno-heptose 1,7-bisphosphate
BMA10247_2172439-10.394477D-glycero-D-manno-heptose 1-phosphate
BMA10247_2173440-10.930626phosphoheptose isomerase
BMA10247_2174441-11.168233D-glycero-D-manno-heptose 7-phosphate kinase
BMA10247_2175444-11.715597GDP-D-mannose dehydratase
BMA10247_2176450-12.520760capsular polysaccharide biosynthesis protein
BMA10247_2177553-13.265082capsular polysaccharide biosynthesis protein
BMA10247_2178652-13.209343glycoside hydrolase family protein
BMA10247_2179655-13.279024capsular polysaccharide biosynthesis protein
BMA10247_2180653-12.681264capsular polysaccharide biosynthesis protein
BMA10247_2181751-11.803625glycoside hydrolase family protein
BMA10247_2182745-9.283066capsular polysaccharide export ABC transporter
BMA10247_2183539-7.890248capsular polysaccharide export inner-membrane
BMA10247_2184334-6.549238capsule polysaccharide exporter
BMA10247_2185127-4.975359capsular polysaccharide biosynthesis/export
BMA10247_2186-123-4.527935glycosyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2123TCRTETOQM6240.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 624 bits (1610), Expect = 0.0
Identities = 172/683 (25%), Positives = 295/683 (43%), Gaps = 75/683 (10%)

Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVSHKIGEVHDGAATMDWMEQEQERGITITSAAT 68
+ NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TAFWKGMAGNYPEHRINIIDTPGHVDFTIEVERSMRVLDGACMVYDSVGGVQPQSETVWR 128
+ W+ ++NIIDTPGH+DF EV RS+ VLDGA ++ + GVQ Q+ ++
Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114

Query: 129 QANKYKVPRIAFVNKMDRVGADFFRVQRQIGERLKGVAVPIQIPVGAEEHFQGVVDLVKM 188
K +P I F+NK+D+ G D V + I E+L V Q V M
Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164

Query: 189 KAIVWDDESQGVKFTYEDIPANLVELAHEWREKMVEAAAEASEELLEKYLTDHNSLTEDE 248
+ + Q + E +++LLEKY+ SL E
Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYM-SGKSLEALE 199

Query: 249 IKAALRKRTIANEIVPMLCGSAFKNKGVQAMLDAVIDYLPSPADVPAILGHDLDDKEAER 308
++ R + P+ GSA N G+ +++ + + S
Sbjct: 200 LEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH---------------- 243

Query: 309 HPSDDEPFSALAFKIMTDPFVGQLIFFRVYSGVVESGDTLLNATKDKKERLGRILQMHAN 368
FKI +L + R+YSGV+ D++ + K+K ++ +
Sbjct: 244 --RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMYTSING 300

Query: 369 ERKEIKEVRAGDIAAAVG--LK-EATTGDTLCDPGKPIILEKMEFPEPVISQAVEPKTKA 425
E +I + +G+I LK + GDT P + E++E P P++ VEP
Sbjct: 301 ELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPLPLLQTTVEPSKPQ 356

Query: 426 DQEKMGLALNRLAQEDPSFRVQTDEESGQTIISGMGELHLEIIVDRMKREFGVEATVGKP 485
+E + AL ++ DP R D + + I+S +G++ +E+ ++ ++ VE + +P
Sbjct: 357 QREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEP 416

Query: 486 QVAYRETVRTVAEDVEGKFVKQSGGRGQYGHAVIKLEPNP-GKGYEFLDEIKGGVIPREF 544
V Y E + E + + + + P P G G ++ + G + + F
Sbjct: 417 TVIYME---RPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSF 473

Query: 545 IPAVNKGIEETLKSGVLAGYPVVDVKVHLTFGSYHDVDSNENAFRMAGSMAFKEAMRRAK 604
AV +GI + G L G+ V D K+ +G Y+ S FRM + ++ +++A
Sbjct: 474 QNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAG 532

Query: 605 PVLLEPMMAVEVETPEDFMGNVMGDLSSRRGIVQGMEDIAGGGGKLVRAEVPLAEMFGYS 664
LLEP ++ ++ P++++ D + + ++ E+P + Y
Sbjct: 533 TELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ--LKNNEVILSGEIPARCIQEYR 590

Query: 665 TSLRSATQGRATYTMEFKHYAET 687
+ L T GR+ E K Y T
Sbjct: 591 SDLTFFTNGRSVCLTELKGYHVT 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2136TCRTETB1383e-38 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 138 bits (349), Expect = 3e-38
Identities = 92/408 (22%), Positives = 171/408 (41%), Gaps = 15/408 (3%)

Query: 17 VMLWLVATGFFMQTLDATIVNTALPSMAASLGESPLRMQSVVIAYSLTMAVMIPVSGWLA 76
+++WL FF L+ ++N +LP +A + P V A+ LT ++ V G L+
Sbjct: 15 ILIWLCILSFF-SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 77 DTLGTRRVFFSAILIFTLGSLLCANAHT-LPLLVAFRVIQGVGGAMLLPVGRLAVLRTFP 135
D LG +R+ I+I GS++ H+ LL+ R IQG G A + + V R P
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133

Query: 136 AERYLPALSFVAIPGLIGPLIGPTLGGWLVKIASWHWIFLINVPVGIAGCIATFYSMPDS 195
E A + +G +GP +GG + HW +L+ +P+ + +
Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAH--YIHWSYLLLIPMITIITVPFLMKLLKK 191

Query: 196 RNPAAGRFDLKGYLLLTIGMIAISLSLDGLADLGMQHAMVLVLLILSLACFVAYGLYAVR 255
G FD+KG +L+++G++ L L++ +LS FV + +
Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTT------SYSISFLIVSVLSFLIFVKHIR---K 242

Query: 256 APQPIFSLELFGIHTFSVGLLGNLFARIGSGAMPYLIPLLLQVSLGYGAFEAG-LMMLPV 314
P L F +G+L ++P +++ E G +++ P
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 315 AAAGMFSKRIITVLITRHGYRKVLLANTIMVGLMMASFALVSDAMPTWLKIAQLALFGGF 374
+ + I +L+ R G VL + + + + + + ++ I + + GG
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL 362

Query: 375 NSMQFTAMNTLTLKDLGTGGASSGNSLFSLVQMLSMSLGVTVAGALLA 422
+ + T ++T+ L A +G SL + LS G+ + G LL+
Sbjct: 363 SFTK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2152SECA250.027 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 24.8 bits (54), Expect = 0.027
Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 5/36 (13%)

Query: 33 KLAYPIRDGIPVMLVDEARQTVEGTPVDPAGPARGR 68
KL Y + D + +L+DEAR TP+ +GPA
Sbjct: 202 KLHYALVDEVDSILIDEAR-----TPLIISGPAEDS 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2158HTHFIS373e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.1 bits (86), Expect = 3e-04
Identities = 33/143 (23%), Positives = 59/143 (41%), Gaps = 9/143 (6%)

Query: 152 AKPADANAEGEDAGAQKETPLAQFTQNLNQMAKDGR-IDPLIGRESEVERVVQVLCR--R 208
KP D + LA+ + +++ D + PL+GR + ++ + +VL R +
Sbjct: 103 PKPFDL----TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQ 158

Query: 209 RKNNPLLVGEAGVGKTAIAEGLAYRITRGEVPDILANAQVYSLD-MGALLAGTKYRGDFE 267
++ GE+G GK +A L R P + N D + + L G + +G F
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHE-KGAFT 217

Query: 268 QRLKTVLKELKERPHAILFIDEI 290
++ LF+DEI
Sbjct: 218 GAQTRSTGRFEQAEGGTLFLDEI 240



Score = 32.5 bits (74), Expect = 0.006
Identities = 39/183 (21%), Positives = 63/183 (34%), Gaps = 32/183 (17%)

Query: 446 QDDRSKLQTLDRDLKSVVFGQDPAIDALAAAIKMARAGLGKLDKPIGAFLFSGPTGVGKT 505
+ SKL+ +D +V G+ A+ + + + D + + +G +G GK
Sbjct: 123 KRRPSKLEDDSQDGMPLV-GRSAAMQEIYRVLARL----MQTDLTL---MITGESGTGKE 174

Query: 506 EVAR---QLAFTLGIELIRFDMSEYMERHAVSRLIGAPPGYVGFDQGGLLTEAVTKKPHC 562
VAR + +M+ S L G + G T A T+
Sbjct: 175 LVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTGR 226

Query: 563 V-------LLLDEIEKAHPDIFNVLLQVMDHGTLT---DNNGRKADFRNVIIIMTTNAGA 612
L LDEI D LL+V+ G T ++D R I+ TN
Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATNKDL 283

Query: 613 ESM 615
+
Sbjct: 284 KQS 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2166DHBDHDRGNASE391e-04 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 39.3 bits (91), Expect = 1e-04
Identities = 28/123 (22%), Positives = 48/123 (39%), Gaps = 5/123 (4%)

Query: 2142 LVVGGTGGLGFASARWMVERGARRLTLASRSGELAVAARDEIECWRATLGVAVDIVSCDV 2201
+ G G+G A AR + +GA + +L + DV
Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSL-----KAEARHAEAFPADV 66

Query: 2202 TDAAAVDAMIAAIVRRDIPLKGVLHSAMTIDDGLVRNLDDARMAAVLAPKVAGAWNLHRA 2261
D+AA+D + A I R P+ +++ A + GL+ +L D A + G +N R+
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 2262 TRS 2264

Sbjct: 127 VSK 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2168DHBDHDRGNASE702e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 70.5 bits (172), Expect = 2e-16
Identities = 66/249 (26%), Positives = 101/249 (40%), Gaps = 26/249 (10%)

Query: 12 ITGASAGLGRALARAYARPGVVLSLGGRDAVRLEESAADCRARGATVFVASIDVRDADAM 71
ITGA+ G+G A+AR A G ++ + +LE+ + +A DVRD+ A+
Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72

Query: 72 R----RWLEQFDDAHPIHLLIANAGVASTLAHGGDWEARERTAAIVDTNFYGAMNAVLPV 127
R + PI +L+ AGV + E A N G NA V
Sbjct: 73 DEITARIEREMG---PIDILVNVAGVLRPGLI--HSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 128 IDRMRARGSGQVALISSLAALRGMAISPAYCASKAALKAWGDSVRPVLKRDGIRLSVVLP 187
M R SG + + S A AY +SKAA + + L IR ++V P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 188 GFVKTAMSDVFPADKPLLWSPDKAAQYIQRGIAARRAEIAFPALLALGMRLLPLL-PAVM 246
G +T M + LW+ + A+ + +G G+ L L P+ +
Sbjct: 188 GSTETDM-------QWSLWADENGAEQVIKGSLET---------FKTGIPLKKLAKPSDI 231

Query: 247 ADAILGRLS 255
ADA+L +S
Sbjct: 232 ADAVLFLVS 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2175NUCEPIMERASE1294e-37 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 129 bits (326), Expect = 4e-37
Identities = 80/352 (22%), Positives = 136/352 (38%), Gaps = 52/352 (14%)

Query: 4 RVLITGITGMVGSHLADFLLENTDWEIYGLCRWRSPLDNV-SHLLPRINEKNRIRL---- 58
+ L+TG G +G H++ LLE ++ G+ DN+ + + + L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-QVVGI-------DNLNDYYDVSLKQARLELLAQPG 53

Query: 59 ---VYGDLRDYLSIHEAVKQSTPDFVFHLAAQSYPKTSFDSPLDTLETNVQGTANVLEAL 115
DL D + + + VF + + S ++P ++N+ G N+LE
Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113

Query: 116 RKNNIDAVTHVCASSEVFGRVPREKLPIDEE-CTFHPASPYAISKVGTDLIGRYYAEAYN 174
R N I + +SS V+G K+P + HP S YA +K +L+ Y+ Y
Sbjct: 114 RHNKIQHLL-YASSSSVYGL--NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 175 MTVMTTRMFTHTGPR-RGDVFAESTFAKQIAMIERGLIPPVVKTGNLDSLRTFADVRDAV 233
+ R FT GP R D+ A F K + G V G + R F + D
Sbjct: 171 LPATGLRFFTVYGPWGRPDM-ALFKFTKAML---EGKSIDVYNYGKM--KRDFTYIDDIA 224

Query: 234 RAYYMLVTINPI-----------------PGAYYNIGGTYSCTVGQMLDTLISMSTSKDV 276
A L + P P YNIG + + + L +D
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL------EDA 278

Query: 277 IRVETDPE--RLRPIDADLQVPNTRKFEAVTGWKPEISFEKTMEDLLNYWRA 326
+ +E L+P D +T+ V G+ PE + + +++ +N++R
Sbjct: 279 LGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2176NUCEPIMERASE451e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 44.8 bits (106), Expect = 1e-07
Identities = 59/332 (17%), Positives = 105/332 (31%), Gaps = 82/332 (24%)

Query: 1 MKVFLVGSTGYIGKTLFDA-CSRRWRTLGT-STRDGADIVFSLARAEAFPYEQVSA--GD 56
MK + G+ G+IG + + +G + D D+ AR E D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 57 ------------------VVAVAA------AISSPDACAKDYETAFQVNVTGTLTLIRGV 92
V ++ +P A Y N+TG L ++ G
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHA----Y---ADSNLTGFLNILEG- 112

Query: 93 VARGA---RVIFFSSDTVYGASEQLLSEEAELT--PAGAYGAMKRRVEA---ELGENAAV 144
R +++ SS +VYG + ++ + P Y A K+ E +
Sbjct: 113 -CRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171

Query: 145 KVIRLSY--VFSLRDR-------FTQYLLGCAKEGKRADIFK--PFSRCVVYLSDVVEGV 193
L + V+ R FT+ +L EGK D++ R Y+ D+ E +
Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAML----EGKSIDVYNYGKMKRDFTYIDDIAEAI 227

Query: 194 VSLIE-------RWD---------AIDERVINFVGPELVAREDFVEKIRNLAAPELDYGF 237
+ L + +W RV N V D+++ + + E
Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287

Query: 238 SEP-EGDFFVNRPRIINVSSARFEKLLGRRPR 268
GD + + +++G P
Sbjct: 288 LPLQPGDVLETSA---DTKALY--EVIGFTPE 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2177PF05043300.007 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 29.9 bits (67), Expect = 0.007
Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 7/76 (9%)

Query: 48 RHLEEIGASLRIDIDE---IESWCVDELKSREVGENDGGKQIDISVTDFILANCRQKRLF 104
H + + +L +E W EL+ + D DI +++FI+ KRL
Sbjct: 414 YHAKFVAETLSYYCSNNFELEVW--TELELSKESLED--SPYDIIISNFIIPPIENKRLI 469

Query: 105 YTMNHPTAALMREIAA 120
Y+ N T +L+ + A
Sbjct: 470 YSNNINTVSLIYLLNA 485


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2183ABC2TRNSPORT382e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 38.0 bits (88), Expect = 2e-05
Identities = 32/139 (23%), Positives = 58/139 (41%), Gaps = 7/139 (5%)

Query: 88 MAVTPNLALMYHRNVKVIDIFIARILLEVVGNTASFFVLMITFHALGLVDYPEDILEVMF 147
M M + +++ DI + + + + + ALG + +++
Sbjct: 94 MEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWLS----LLY 149

Query: 148 AWVMIIWFG---ASLGFIIGALSEKTELVEKLWHPVTYLMFPLSGAIFMVDWLSPAFQKI 204
A +I G ASLG ++ AL+ + V + LSGA+F VD L FQ
Sbjct: 150 ALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTA 209

Query: 205 VLWLPMVHGVEMLREGYFG 223
+LP+ H ++++R G
Sbjct: 210 ARFLPLSHSIDLIRPIMLG 228


30BMA10247_2204BMA10247_2271Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2204413-1.683531molecular chaperone DnaJ
BMA10247_2205712-1.438833hypothetical protein
BMA10247_2206412-0.925463molecular chaperone DnaK
BMA10247_22070110.133664hypothetical protein
BMA10247_2208-310-0.494720heat shock protein GrpE
BMA10247_2210-290.583284hypothetical protein
BMA10247_22090111.929872heat shock protein 15
BMA10247_22110122.800199ferrochelatase
BMA10247_22121132.925932heat-inducible transcription repressor
BMA10247_22130143.111967NAD(+)/NADH kinase
BMA10247_22140133.147909DNA repair protein RecN
BMA10247_2215-1132.500647bifunctional glutamine-synthetase
BMA10247_2216-390.950523hypothetical protein
BMA10247_2217010-0.934633carbon-nitrogen family hydrolase
BMA10247_2218010-1.247485TldD/PmbA family protein
BMA10247_2219011-2.395924hypothetical protein
BMA10247_2220010-1.567905phospho-2-dehydro-3-deoxyheptonate aldolase
BMA10247_2221113-0.890330ISBma2, transposase
BMA10247_22222120.818703glutathione-disulfide reductase
BMA10247_2223214-0.002372hypothetical protein
BMA10247_22241150.144074argininosuccinate synthase
BMA10247_22251132.024750hypothetical protein
BMA10247_22260131.795983hypothetical protein
BMA10247_2227-1120.981544hypothetical protein
BMA10247_22281111.262297heavy metal-binding domain-containing protein
BMA10247_22291111.496545hypothetical protein
BMA10247_2230082.253477copper-translocating P-type ATPase
BMA10247_2231292.524534hypothetical protein
BMA10247_2232281.771979LemA family protein
BMA10247_2233382.100252lipoprotein
BMA10247_2234282.028013hypothetical protein
BMA10247_2236181.863790outer membrane efflux protein
BMA10247_2237070.665014RND family efflux transporter MFP subunit
BMA10247_2238180.552408CzcA family heavy metal RND efflux protein
BMA10247_22390111.219989hypothetical protein
BMA10247_2240-190.563883hypothetical protein
BMA10247_2241-190.062862streptavidin
BMA10247_2242-1102.212573glucosamine--fructose-6-phosphate
BMA10247_2243-2113.401214UDP-N-acetylglucosamine pyrophosphorylase
BMA10247_2244-1113.586262hypothetical protein
BMA10247_2245-1113.165748C32 tRNA thiolase
BMA10247_2246-1134.057430dihydroneopterin aldolase
BMA10247_2247-1134.476851hypothetical protein
BMA10247_2248-1143.548016hypothetical protein
BMA10247_22490153.416032hypothetical protein
BMA10247_2251-1154.013945fructokinase
BMA10247_2252-2143.883732N-acylglucosamine 2-epimerase
BMA10247_2253-1113.487352LacI family transcriptional regulator
BMA10247_22541111.534061methyl-accepting chemotaxis protein
BMA10247_22553120.025870sodium/bile acid symporter family protein
BMA10247_2256620-0.968945hypothetical protein
BMA10247_2257226-3.453443hypothetical protein
BMA10247_2259227-2.987525IS407A, transposase OrfA
BMA10247_2260-215-0.744564IS407A, transposase OrfB
BMA10247_2261-2130.845698hypothetical protein
BMA10247_22620121.203433hypothetical protein
BMA10247_22640121.746392hypothetical protein
BMA10247_2263-1141.642687hypothetical protein
BMA10247_2265-1142.009279alkaline phosphatase
BMA10247_22661112.827875alkaline phosphatase
BMA10247_22673103.043991hypothetical protein
BMA10247_22683112.548963cutC family protein
BMA10247_22691122.304626biotin synthase
BMA10247_22701123.224681dithiobiotin synthetase
BMA10247_22711133.1451298-amino-7-oxononanoate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2206SHAPEPROTEIN1353e-37 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 135 bits (342), Expect = 3e-37
Identities = 81/382 (21%), Positives = 138/382 (36%), Gaps = 71/382 (18%)

Query: 5 IGIDLGTTNSCVAIMEGNQVKVIENSEGARTTPSIIAYMDDNEVL-VGAPAKRQSVTNPK 63
+ IDLGT N+ + + V + R V VG AK+ P
Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQD----RAGSPKSVAAVGHDAKQMLGRTPG 68

Query: 64 NTLFAVKRLIGRRFEEKEVQKDIGLMPYAIIKADNGDAWVEAHGEKLAPPQVSAEVLRK- 122
N + A++ + + V D V+ ++L+
Sbjct: 69 N-IAAIRPM------KDGVIADF---------------------------FVTEKMLQHF 94

Query: 123 MKKTAEDYLGEPVTEAVITVPAYFNDSQRQATKDAGRIAGLEVKRIINEPTAAALAFGLD 182
+K+ + P ++ VP +R+A +++ + AG +I EP AAA+ GL
Sbjct: 95 IKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLP 154

Query: 183 KAEKGDRKIAVYDLGGGTFDVSIIEIADVDGEMQFEVLSTNGDTFLGGEDFDQRIIDYII 242
+E V D+GGGT +V++I + V + +GG+ FD+ II+Y+
Sbjct: 155 VSE--ATGSMVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAIINYVR 203

Query: 243 GEFKKEQGVDLSKDVLALQRLKEAAEKAKIELSSS----QQTEINLPYITADASGPKHLN 298
+ G + AE+ K E+ S+ + EI + P+
Sbjct: 204 RNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFT 250

Query: 299 LKVTRAKLEALVEDLVERTIEPCRTAIKDAGVKVSDIDD--VILVGGQTRMPKVQEKVKE 356
L + LEAL E L + SDI + ++L GG + + + E
Sbjct: 251 LN-SNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLME 309

Query: 357 FFGKEPRRDVNPDEAVAVGAAI 378
G +P VA G
Sbjct: 310 ETGIPVVVAEDPLTCVARGGGK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2208IGASERPTASE310.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.002
Identities = 16/77 (20%), Positives = 24/77 (31%), Gaps = 8/77 (10%)

Query: 2 ENTQENPTDQTTEETGREAQAAEPAAQAAENAAPAAEAA--------LAEAQAKIAELQE 53
T E TE T + + A+ A + E A + K E
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 54 SFLRAKAETENVRRRAQ 70
+AK ETE + +
Sbjct: 1108 KEEKAKVETEKTQEVPK 1124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2233cloacin320.003 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.0 bits (72), Expect = 0.003
Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 2/39 (5%)

Query: 241 GGGMGARVGGPFIGGRGGRGGGNDGFRGGGGGFGGGGAS 279
GGG G+ + + GG G GG +G GGG G GG ++
Sbjct: 47 GGGSGSGIH--WGGGSGHGNGGGNGNSGGGSGTGGNLSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2237RTXTOXIND493e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.7 bits (116), Expect = 3e-08
Identities = 34/160 (21%), Positives = 51/160 (31%), Gaps = 34/160 (21%)

Query: 216 LDRTGRAQTHIVLASPETGVVSELNVR-DGAMVTPGQTLAKIAGLS-TLWAVIDVPEALA 273
L + Q V+ +P + V +L V +G +VT +TL I TL V
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDI 377

Query: 274 SGVRPGMRVDATFEGDPQRR---VSGAIREILPG------VNATTRTLQARLE------L 318
+ G E P R + G ++ I + + + E
Sbjct: 378 GFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGN 437

Query: 319 DNRALTPGMLMRARVGASHAASRLVVPSDAVIATGKRSVV 358
N L+ GM A I TG RSV+
Sbjct: 438 KNIPLSSGM-----------------AVTAEIKTGMRSVI 460



Score = 35.2 bits (81), Expect = 5e-04
Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 7/58 (12%)

Query: 211 SVIANLDRTGRAQTHIV-------LASPETGVVSELNVRDGAMVTPGQTLAKIAGLST 261
SV+ ++ A + + E +V E+ V++G V G L K+ L
Sbjct: 75 SVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGA 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2238ACRIFLAVINRP6650.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 665 bits (1718), Expect = 0.0
Identities = 219/1055 (20%), Positives = 437/1055 (41%), Gaps = 48/1055 (4%)

Query: 7 RWSIRNRLLVLLATALVAAWGVVSLNRTPLDALPDLSDTQVIVKASYPGKAPRVVEDQVT 66
+ IR + + ++ G +++ + P+ P ++ V V A+YPG + V+D VT
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 67 YPLTTTLLGVPGAKTIRAYS-SFGDAFVYVLFDDRTDQYWARSRVLEYLNQVQGRLPQGA 125
+ + G+ + + S S G + + F TD A+ +V L LPQ
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 126 -SVALGPDATGVGWVYEYALVDRSGRRDLGELRALNDWFLKFELKAVPDVAEVASVGGMV 184
+ + + ++ V + ++ +K L + V +V G
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ- 181

Query: 185 RQYQVVLDPDRLRAFGITQAAVVDALGKANSESGG------SVVEMAESEYMVRASGYLR 238
++ LD D L + +T V++ L N + + + + A +
Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 239 SLDDFRNVVLRTSESGTPVLLGDVARVQIGPEMRRGIAELNGEGEVAGGVIVMRSGKNAL 298
+ ++F V LR + G+ V L DVARV++G E IA +NG+ AG I + +G NAL
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGANAL 300

Query: 299 STIEAVKAKLAELRRSLPAGVELVTTYDRSQLIGRAVDNLKDKLIEEFVVVGLVCALFLF 358
T +A+KAKLAEL+ P G++++ YD + + ++ + L E ++V LV LFL
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 359 HLRSAFVAILSLPLGVLAAFIVMRHQGVNANLMSLGGIAIAIGAMIDAAVVMIENAHKHL 418
++R+ + +++P+ +L F ++ G + N +++ G+ +AIG ++D A+V++EN + +
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 419 ESHEHAHPGAPLSSAARWELIAASAAEVGPALFFSLLIVTLSFVPVFALEGQEGKLFAPL 478
+ E S +++ AL ++++ F+P+ G G ++
Sbjct: 421 MEDK----------LPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQF 470

Query: 479 AFTKTYTIAAAAGLSVTLVPVLMGYLIRGRIPREASNP------LNRL---LVRLYRPLL 529
+ T +A + +++ L P L L++ N N V Y +
Sbjct: 471 SITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSV 530

Query: 530 EATLARPWRAIAIAAAALVLTAIPMSRLGGEFMPPLDEGDLLYMPTALPGISAQKAAELL 589
L R + I A + + RL F+P D+G L M G + ++ ++L
Sbjct: 531 GKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVL 590

Query: 590 QQTDRLIKT--VPEVATVFGKSGRADTATDPAPLEMFETTIRFRPRGEW-RPGMTPGRLV 646
Q V +VF +G + + +P E + ++
Sbjct: 591 DQVTDYYLKNEKANVESVFTVNGF---SFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 647 DELDRVVKVPGLSNVWVPPIRNRLDMLSTGIKTPVGVKIAGPELAQIDRIAAQVEAAVKR 706
+ V + +++ + + AG + + Q+ +
Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707

Query: 707 VPG-VTSALAERLNGGRYVDVDIDRRAAARYGLSVGDVQAVVASAIGGENVGEVIAGRER 765
P + S L +++D+ A G+S+ D+ +++A+GG V + I
Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767

Query: 766 FPINIRYPREVRDSLEKLRALPIVTERGAQILLRDVAAVTIADGPPMIRSENARLSGYVY 825
+ ++ + R E + L + + G + G P + N S +
Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQ 827

Query: 826 VDIR-GVDLKTAVGAMQRAVAQQVALPPGYSIAWSGQFEYLERAAATLRTVIPVTLAVIF 884
+ G A+ M+ ++ LP G W+G + ++ ++ V+F
Sbjct: 828 GEAAPGTSSGDAMALMENLASK---LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVF 884

Query: 885 VLLFLTFDSAADALLLMTTVPFALVGGLWFVWALGHAVSVATAVGFIALAGVAAEFGVVM 944
+ L ++S + + +M VP +VG L V VG + G++A+ +++
Sbjct: 885 LCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILI 944

Query: 945 LLYLKRAYERRIAAGEPPNEATLADAIREGAVLRVRPKAMTVAVVLAGLVPIMIGHGSGS 1004
+ + K E+ G+ EATL +R+RP MT + G++P+ I +G+GS
Sbjct: 945 VEFAKDLMEKE---GKGVVEATL-----MAVRMRLRPILMTSLAFILGVLPLAISNGAGS 996

Query: 1005 EVMQRIAAPMVGGMVTAPLLSMFVIPAAWLLLQRR 1039
+ ++GGMV+A LL++F +P +++++R
Sbjct: 997 GAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2243SSBTLNINHBTR290.021 Streptomyces subtilisin inhibitor signature.
		>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature.

Length = 144

Score = 29.0 bits (64), Expect = 0.021
Identities = 21/44 (47%), Positives = 23/44 (52%), Gaps = 3/44 (6%)

Query: 21 VLHPLAGRPLLSHVIDTARALAPSRLVVVIGHGAEQVRAAVAAP 64
V PLAG L S A APS LV+ +GHG AA AAP
Sbjct: 18 VCGPLAGASLASPATAPASLYAPSALVLTVGHGES---AATAAP 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2253HTHTETR280.043 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.043
Identities = 17/136 (12%), Positives = 34/136 (25%), Gaps = 8/136 (5%)

Query: 2 GTTIRDVAQAANVSIGTVSRALKNQPGLSEATRARIVE-----IAHRMNYDPTQLRPRIK 56
T++ ++A+AA V+ G + K++ L P ++
Sbjct: 31 STSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLR 90

Query: 57 -RLTFLLHRQHNNFATTPFFSHVLHGVEDACRERGIVPSLLTTGPTDDVIRQMRPHAPDA 115
L +L + H E E +V + ++
Sbjct: 91 EILIHVLESTVTEERRRLLMEIIFHKCEFV-GEMAVVQQAQRNLCL-ESYDRIEQTLKHC 148

Query: 116 IAVAGFMEPETLEALA 131
I A
Sbjct: 149 IEAKMLPADLMTRRAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2264PF04647290.007 Accessory gene regulator B
		>PF04647#Accessory gene regulator B

Length = 212

Score = 29.0 bits (65), Expect = 0.007
Identities = 5/41 (12%), Positives = 15/41 (36%)

Query: 108 AIVALAGFAISVFTTPFKGMLIIAAALIALFLFILYRPAAT 148
+ + + + + +LI+ A + +L + P
Sbjct: 86 LVFNVLAYIAHLIDPAYFQLLILIAFITSLLALLFLVPVDN 126


31BMA10247_2287BMA10247_2294Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_22872100.243488short chain dehydrogenase
BMA10247_2288210-0.371546short chain dehydrogenase
BMA10247_2289510-2.026585thiol:disulfide interchange protein DsbA
BMA10247_2290411-2.271550sporulation repeat-containing protein
BMA10247_2291413-3.660060arginyl-tRNA synthetase
BMA10247_2292827-5.153851hypothetical protein
BMA10247_2293523-4.506112hypothetical protein
BMA10247_2294525-4.175008ISBma2, transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2287DHBDHDRGNASE561e-11 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 56.2 bits (135), Expect = 1e-11
Identities = 51/186 (27%), Positives = 79/186 (42%), Gaps = 16/186 (8%)

Query: 7 VVLVTGANRGLGLAFVEGLKAAGAK------------KIYAAARDPARVTTPGVQPVRLD 54
+ +TGA +G+G A L + GA K+ ++ + AR VR
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 55 VTRAQDIAAAARELRDVNLLVNNAGIFRMGSLLAEADGGGLQAQLDTNFFGPLAMARAFA 114
+ A RE+ +++LVN AG+ R G L+ +A N G +R+ +
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPG-LIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 115 PVLRENGGGAIINVLS-WLGLPNT--GAYGISKAAAWAATNAIRNELREQRTRVLALHSA 171
+ + G+I+ V S G+P T AY SKAAA T + EL E R +
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 172 YIDTDM 177
+TDM
Sbjct: 189 STETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2288DHBDHDRGNASE732e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 73.2 bits (179), Expect = 2e-17
Identities = 50/188 (26%), Positives = 81/188 (43%), Gaps = 10/188 (5%)

Query: 9 VFITGASSGLGLALAAEYARHGATLGLVARRADALAEFAP------RFPKASISIYPADV 62
FITGA+ G+G A+A A GA + V + L + R +A +PADV
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA----FPADV 66

Query: 63 RDADALALAASRFVAAHGCPDVVIANAGISKGAITGEGDLAAFREIMDVNYYGMIATFEP 122
RD+ A+ +R G D+++ AG+ + + + VN G+
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 123 FIAPMTAARRGTLVGIASVAGVRGLPGSGAYSASKAAAIKYLEALRVELRPAQVAVVTIA 182
M R G++V + S AY++SKAAA+ + + L +EL + ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 183 PGYIRTPM 190
PG T M
Sbjct: 187 PGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2290IGASERPTASE330.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.002
Identities = 22/173 (12%), Positives = 50/173 (28%), Gaps = 6/173 (3%)

Query: 58 ASQPQQFDPNRALQGKTPGQPVTPQAAQPAPPNTAPGQAANPSQPPLLPEPQIVEVPSSN 117
A ++ DP ++ T QPA ++ + + +VE P +
Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202

Query: 118 NNGNGSPSASNNAAD-----NGVAVAPKPAEPAPPPAKKPQTAANGSSAPHVANNNAQAS 172
P+ ++ +++ + +V P P + N NA S
Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLS 1262

Query: 173 AAATPPKAAQAPKGASSATTTAAKPTSGADANTGYFLQVGAYKTEADAEQQRA 225
A + G + + + + ++ + + Q R
Sbjct: 1263 DARAKAQFVALNVGKAVSQHISQLEMNNEGQYN-VWVSNTSMNKNYSSSQYRR 1314


32BMA10247_2307BMA10247_2317Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_23072190.569771LysR family transcriptional regulator
BMA10247_2308422-0.194813putrescine-binding periplasmic protein
BMA10247_2309322-0.054109histone deacetylase/AcuC/AphA family protein
BMA10247_2310323-1.193335carbon-nitrogen family hydrolase
BMA10247_2311326-2.236377hypothetical protein
BMA10247_2312221-4.310173carbon-nitrogen family hydrolase
BMA10247_2314521-5.679633hypothetical protein
BMA10247_2315521-5.026325PBSX family phage portal protein
BMA10247_2316327-5.024458hypothetical protein
BMA10247_2317328-4.358875hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2308MALTOSEBP300.018 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 29.7 bits (66), Expect = 0.018
Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 44 FEKETGIKVRLDVYDSNEALQTKLTTGNSGYDLVFPSNDFLARQIQAGL 92
FEK+TGIKV ++ D E ++ G D++F ++D Q+GL
Sbjct: 53 FEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGL 101


33BMA10247_2333BMA10247_2341Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2333219-4.122556carbon-nitrogen family hydrolase
BMA10247_2335521-5.679633hypothetical protein
BMA10247_2336723-5.799349PBSX family phage portal protein
BMA10247_2337932-6.989399hypothetical protein
BMA10247_2338833-6.865350hypothetical protein
BMA10247_2339624-4.856843IS407A, transposase OrfB
BMA10247_2340416-3.595676IS407A, transposase OrfA
BMA10247_2341214-2.627332ISBma1, transposase
34BMA10247_2481BMA10247_2490Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2481323-2.448057hypothetical protein
BMA10247_2482526-3.445335hypothetical protein
BMA10247_2483524-3.444309hypothetical protein
BMA10247_2484015-0.962377hypothetical protein
BMA10247_2485193.872024hypothetical protein
BMA10247_24862113.675057hypothetical protein
BMA10247_24871103.437468hypothetical protein
BMA10247_2488283.925272hypothetical protein
BMA10247_24893103.872024FHA domain-containing protein
BMA10247_2490294.001071protein kinase domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2490YERSSTKINASE340.004 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 34.3 bits (78), Expect = 0.004
Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 2/42 (4%)

Query: 149 QVLDGLAHAHANGVVHRDLKPQNVMVTTRDGEPCAKILDFGI 190
++LD H GVVH D+KP NV+ GEP ++D G+
Sbjct: 253 RLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPV--VIDLGL 292


35BMA10247_2581BMA10247_2592Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_25811123.060632transcriptional regulator
BMA10247_25823113.2079034-hydroxybenzoate octaprenyltransferase
BMA10247_25831134.249231phosphonate metabolism
BMA10247_25841123.647388hypothetical protein
BMA10247_25851122.193854phosphonates metabolism transcriptional
BMA10247_25862152.731211phosphonate metabolism protein PhnG
BMA10247_25872162.770570carbon-phosphorus lyase complex subunit
BMA10247_25883142.511071phosphonate metabolism protein PhnI
BMA10247_25893142.272838phosphonate metabolism protein PhnJ
BMA10247_25904133.004862phosphonate C-P lyase system protein PhnK
BMA10247_25913113.322222phosphonate C-P lyase system protein PhnL
BMA10247_25922102.651217phosphonate metabolism protein PhnM
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2591PF05272290.016 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.016
Identities = 21/68 (30%), Positives = 28/68 (41%), Gaps = 1/68 (1%)

Query: 58 CVALTGPSGAGKSTLLRCLYGNYLANRGTIAVRVGTRAAEHVV-LTASEPHEVIALRRDV 116
V L G G GKSTL+ L G + + G + E + + A E E+ A RR
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRAD 657

Query: 117 IGYVSQFL 124
V F
Sbjct: 658 AEAVKAFF 665


36BMA10247_2612BMA10247_2619Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2612231-1.356734hypothetical protein
BMA10247_2613031-2.959448hypothetical protein
BMA10247_2614221-4.482514HSP20 family protein
BMA10247_2615116-5.238577HSP20 family protein
BMA10247_2616212-5.284354chaperonin, 10 kDa
BMA10247_2617110-4.425012hypothetical protein
BMA10247_2618111-4.524292hypothetical protein
BMA10247_2619-112-3.051279glutamate/aspartate ABC transporter ATP-binding
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2618ACRIFLAVINRP250.040 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 25.2 bits (55), Expect = 0.040
Identities = 9/38 (23%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 14 IEIDDVIVGLLAI-RLNLPENADPRDAISRHLSEAGGP 50
+ +DD IV + + R+ + + P++A + +S+ G
Sbjct: 404 LLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGA 441


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2619PF05272320.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.4 bits (73), Expect = 0.002
Identities = 17/53 (32%), Positives = 24/53 (45%), Gaps = 5/53 (9%)

Query: 29 VVVVCGPSGSGKSTLIKTVNGLEPFQQGEILVNGQSVGDKKTNLSKLRSKVGM 81
VV+ G G GKSTLI T+ GL+ F +G K + ++ V
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHF-----DIGTGKDSYEQIAGIVAY 645


37BMA10247_2649BMA10247_2685Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2649313-2.321999dicarboxylate/amino acid:cation (Na+ or H+)
BMA10247_2650619-2.880759transposase
BMA10247_2651012-1.060711transposase subfamily protein
BMA10247_2652012-0.312625carbohydrate porin
BMA10247_2653-115-1.119197hypothetical protein
BMA10247_2654219-2.946744hypothetical protein
BMA10247_2655217-2.832131hypothetical protein
BMA10247_2656114-2.317245manganese/iron transporter
BMA10247_2657015-1.960766hypothetical protein
BMA10247_2658-112-1.585780hypothetical protein
BMA10247_2659011-1.329211ISBma2, transposase
BMA10247_2661-19-0.875162ipgF protein
BMA10247_2662-19-0.055624general secretion pathway protein D
BMA10247_26632111.128764general secretion pathway protein E
BMA10247_26640141.508515general secretion pathway protein F
BMA10247_26651122.311281general secretion pathway protein C
BMA10247_26660113.075430general secretion pathway protein G
BMA10247_2667-1123.753677general secretion pathway protein H
BMA10247_26680123.471712general secretory pathway protein I
BMA10247_2669-294.022174general secretory pathway protein J
BMA10247_2670-294.136307general secretory pathway protein K
BMA10247_2671-194.132894general secretory pathway protein L
BMA10247_2672-293.456157general secretion pathway protein M
BMA10247_2673-1103.514604general secretory pathway protein N
BMA10247_2674-1104.038819RND efflux system outer membrane lipoprotein
BMA10247_26752102.312473hypothetical protein
BMA10247_26762102.631013MarR family transcriptional regulator
BMA10247_26790102.074297LysR family transcriptional regulator
BMA10247_2680-211-0.462779hypothetical protein
BMA10247_2681011-0.323604LrgA family protein
BMA10247_26820110.403327hypothetical protein
BMA10247_2683313-0.614532hypothetical protein
BMA10247_2684313-2.624392flagellar basal body protein FliL
BMA10247_2685212-2.570513flagellar motor switch protein FliM
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2662BCTERIALGSPD403e-133 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 403 bits (1037), Expect = e-133
Identities = 215/691 (31%), Positives = 325/691 (47%), Gaps = 88/691 (12%)

Query: 13 TALVVAGIVAAQAAHAQVTLNFVNADIDQVAKAIGAATGKTIIVDPRVKGQLNLVAERPV 72
T L+ A ++ AA + + +F DI + + KT+I+DP V+G + + + +
Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72

Query: 73 PEDQALKTLQSALRMQGFALV-QDHGVLKVVPEADAKLQGVPTYIGNAPQVRGDQVVTQV 131
E+Q + S L + GFA++ ++GVLKVV DAK VP AP + GD+VVT+V
Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGI-GDEVVTRV 131

Query: 132 FELRNESANNLLPVLRPLI--SPNNTITAYPANNTIVVTDYADNVRRIAQIIAGVDSAAG 189
L N +A +L P+LR L + ++ Y +N +++T A ++R+ I+ VD+A
Sbjct: 132 VPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGD 191

Query: 190 SQVAVVPLKNANAIDIAAQLTKLLDPGAIGNTDATLKVTVQADPRTNALLLRASNAQRLA 249
V VPL A+A D+ +T+L + ++ V AD RTNA+L+ R
Sbjct: 192 RSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSR-Q 250

Query: 250 AAKKIAQQLDAPSGVPGNMHVVPLRNAEAVKLAKTLRGMLGKGGGESGSSASSNDANAFN 309
+ +QLD GN V+ L+ A+A L + L G+
Sbjct: 251 RIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGIS-------------------- 290

Query: 310 QGGSQSGSNFSTGASGTPPLPSGLSSNSSGGAGGTTGGGGLGNAGLLGGDKDKGDDNQPG 369
S + S +
Sbjct: 291 ---------------------STMQSEKQAAKPVAALDKNI------------------- 310

Query: 370 GMIQADAASNSLIITASDPVYRNLRAVIDQLDSRRAQVYIEALVVELQATTSANLGIQWQ 429
+I+A +N+LI+TA+ V +L VI QLD RR QV +EA++ E+Q NLGIQW
Sbjct: 311 -IIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA 369

Query: 430 VANNALYAGTNLVTGQTGLGNSIVNLTAGAVT--NPGGTLGSLG---SITNGLNIGWLHN 484
N +T T G I AGA G SL S NG+ G
Sbjct: 370 NKNAG-------MTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAG---- 418

Query: 485 MFGVQGLGALLQFFAGSSDANVLSTPNLVTLDNEEAKIVVGQNVPIPTGSYSNLTSGTTA 544
F LL + S+ ++L+TP++VTLDN EA VGQ VP+ TGS + +
Sbjct: 419 -FYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGS----QTTSGD 473

Query: 545 NAFNTYDRRDVGLTLHVKPQITEGGILKLQLYTEDSAVVPGTNTTSANSPGPTFTKRSIQ 604
N FNT +R+ VG+ L VKPQI EG + L++ E S+V +++++ G TF R++
Sbjct: 474 NIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSV-ADAASSTSSDLGATFNTRTVN 532

Query: 605 STVLADNGEIIVLGGLMQDNYQVSNTKVPLLGDIPWIGQLFRSEGKTRQKTNLMVFLRPV 664
+ VL +GE +V+GGL+ + + KVPLLGDIP IG LFRS K K NLM+F+RP
Sbjct: 533 NAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPT 592

Query: 665 IINDRETAQAVTSNRYDYIQGVTGAYKSDNN 695
+I DR+ + +S +Y + N
Sbjct: 593 VIRDRDEYRQASSGQYTAFNDAQSKQRGKEN 623


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2664BCTERIALGSPF382e-133 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 382 bits (982), Expect = e-133
Identities = 174/406 (42%), Positives = 266/406 (65%), Gaps = 2/406 (0%)

Query: 1 MPAFRFEAIDASGRAQKGVIEADSARNARGQLRTQGLTPLVVEPAASAQRGARSQRLALG 60
M + ++A+DA G+ +G EADSAR AR LR +GL PL V+ Q+ + S L+L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 R--KLSQREQAILTRQLASLLVAGLPLDEALAVLTEQAERDYIRELMAAIRAEVLGGHSL 118
R +LS + A+LTRQLA+L+ A +PL+EAL + +Q+E+ ++ +LMAA+R++V+ GHSL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 119 ANALTQHPRDFPEIYRALVAAGEHTGKLGIVLSRLADYIEERNALKQKILLAFTYPAIVT 178
A+A+ P F +Y A+VAAGE +G L VL+RLADY E+R ++ +I A YP ++T
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 179 VIAFGIVTFLLSYVVPQVVNVFASTKQQLPVLTIVMMALSDFVRHWWWAILIGIAAVVYL 238
V+A +V+ LLS VVP+VV F KQ LP+ T V+M +SD VR + +L+ + A
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 239 VKATLSRDGPRLAFDRWLLTAPLAGKLVRGYNTVRFASTLGILTAAGVPILRALQAAGET 298
+ L ++ R++F R LL PL G++ RG NT R+A TL IL A+ VP+L+A++ +G+
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 299 LSNRAMRGNIDDAIVRVREGSALSRALNNVKTFPPVLVHLIRSGEATGDVTTMLDRAAEG 358
+SN R + A VREG +L +AL FPP++ H+I SGE +G++ +ML+RAA+
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 359 ESRELERRTMFLTSLLEPLLILAMGGIVLVIVLAVMLPIIELNNMV 404
+ RE + L EPLL+++M +VL IVLA++ PI++LN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2666BCTERIALGSPG1886e-65 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 188 bits (480), Expect = 6e-65
Identities = 67/140 (47%), Positives = 94/140 (67%), Gaps = 3/140 (2%)

Query: 10 QAARRQRGFTLIEIMVVVAILGILAALIVPKIMSRPDEARRIAAKQDIGTIMQALKLYRL 69
+A +QRGFTL+EIMVV+ I+G+LA+L+VP +M ++A + A DI + AL +Y+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 70 DNGRYPTQDQGLNALIQKPTTDPIPNNWKDGGYLERLPNDPWGNSYKYLNPGVHGEIDVF 129
DN YPT +QGL +L++ PT P+ N+ GY++RLP DPWGN Y +NPG HG D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 130 SYGADGKEGGESNDSDIGSW 149
S G DG+ G E DI +W
Sbjct: 122 SAGPDGEMGTE---DDITNW 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2667BCTERIALGSPH511e-10 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 51.5 bits (123), Expect = 1e-10
Identities = 20/101 (19%), Positives = 33/101 (32%), Gaps = 15/101 (14%)

Query: 51 RARGFTLLEMLVVLVIAGILVSVASLTLRRNPRTDLREEAQRIALLFETAGDEAQVRARP 110
R RGFTLLEM+++L++ G+ + L + + R +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 111 IAWRATEHGFRF---------------DIRTGDGWRPLRDD 136
++F D +G W PLR
Sbjct: 62 FGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2668BCTERIALGSPG300.001 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.2 bits (68), Expect = 0.001
Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 10 RSPARSRGFTMIEVLVALAIIAVALA 35
R+ + RGFT++E++V + II V +
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLAS 27


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2669BCTERIALGSPG343e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 33.7 bits (77), Expect = 3e-04
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 33 RGFTLIEMMIAITILAVIA-ILSWRGLDQIIRGREKVAAAMEDERVFAQMFDQMRIDARR 91
RGFTL+E+M+ I I+ V+A ++ + + ++ A+ D D ++D
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQK--AVSDIVALENALDMYKLDNHH 65

Query: 92 AATDDEAGQPAV 103
T ++ + V
Sbjct: 66 YPTTNQGLESLV 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2685FLGMOTORFLIM2744e-93 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 274 bits (703), Expect = 4e-93
Identities = 82/324 (25%), Positives = 158/324 (48%), Gaps = 10/324 (3%)

Query: 5 EFMSQEEVDALLKGVTGEDDSADEPAEASG---IRPYNIATQERIVRGRMPGLEIINDRF 61
E +SQ+E+D LL ++ D S ++ S I Y+ ++ + +M L ++++ F
Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62

Query: 62 ARLLRIGIFNFMRRTAEISVSQVKVQKYSEFTRNLPIPTNLNLVHVKPLRGTSLFVFDPN 121
ARL + +R + V+ V Y EF R++P P+ L ++ + PL+G ++ DP+
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122

Query: 122 LVFFVVDNLFGGDGRFHTRVEGRDFTATEQRIIGKLLNLVFEHYASAWKSVRPLQFEFVR 181
+ F ++D LFGG G+ RD T E ++ ++ + + +W V L+ +
Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQ 180

Query: 182 SEMHTQFANVATPNEIVIATQFSIEFGPTGGTLHICMPYSMIEPIRDVLSSPIQGEAL-- 239
E + QFA + P+E+V+ + G G ++ C+PY IEPI LSS ++
Sbjct: 181 IETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 240 EVDRRWVRVLSQQVQSAEVELVADLAEVPTTFEKILNLRTGDVLPLD---ITDSITAKVD 296
+++ VL ++ + ++++VA++ + + IL LR GD++ L + D +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 297 GVPVMECGYGIFNGQYALRVQRMI 320
C G+ + A ++ I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


38BMA10247_2731BMA10247_2747Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2731320-1.029402alpha-methylacyl-CoA racemase
BMA10247_2734827-4.104584hypothetical protein
BMA10247_2735716-4.636287hypothetical protein
BMA10247_2736621-4.943359lipoprotein
BMA10247_2737420-5.985502IS407A, transposase OrfB
BMA10247_2738418-6.360402IS407A, transposase OrfA
BMA10247_2739215-5.901306hypothetical protein
BMA10247_2741213-5.196554PBSX family phage portal protein
BMA10247_2744113-5.925564*ClpXP protease specificity-enhancing factor
BMA10247_274509-5.135041stringent starvation protein A
BMA10247_274629-4.171395ubiquinol-cytochrome c reductase, cytochrome c1
BMA10247_2747110-3.615898ubiquinol-cytochrome c reductase, cytochrome b
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2731SHAPEPROTEIN320.004 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 32.0 bits (73), Expect = 0.004
Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 3/67 (4%)

Query: 144 AGQPGDAPFAPPTLVGDLGGGALYLAMGVLAGIVDAR-LRGKGQIVDAAIVDGSANLMNL 202
AG P ++V D+GGG +A+ L G+V + +R G D AI++
Sbjct: 151 AGLPVSEATG--SMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGS 208

Query: 203 LLSIHAA 209
L+ A
Sbjct: 209 LIGEATA 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2735UREASE300.004 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 29.7 bits (67), Expect = 0.004
Identities = 16/44 (36%), Positives = 21/44 (47%), Gaps = 5/44 (11%)

Query: 106 GGILVYDQFVTP----PTPQPVRQRRLRWGAHGRSNNGDNFYVV 145
GG + P PTPQPV R + +GA+GRS + V
Sbjct: 452 GGTIAAAPMGDPNASIPTPQPVHYRPM-FGAYGRSRTNSSVTFV 494


39BMA10247_2842BMA10247_2851Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_28421113.791754NAD(P)H-dependent glycerol-3-phosphate
BMA10247_28442112.270774hypothetical protein
BMA10247_2843-280.725012hypothetical protein
BMA10247_2845-29-0.554168RNA methyltransferase
BMA10247_2846-29-0.981966ComF family protein
BMA10247_2847-111-1.961464hypothetical protein
BMA10247_2848113-3.021180hypothetical protein
BMA10247_2849213-3.278428cytochrome c oxidase subunit II
BMA10247_2850317-4.456822cytochrome c oxidase subunit I
BMA10247_2851115-3.598425hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2849OMPADOMAIN681e-14 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 68.0 bits (166), Expect = 1e-14
Identities = 27/103 (26%), Positives = 46/103 (44%), Gaps = 2/103 (1%)

Query: 391 QADGGAAANAASGAAAQTQAQAPALPAAIYFETGKSELPADAKDAIAAAAEYVKAH--PD 448
Q + A A + Q + L + + F K+ L + + A+ + D
Sbjct: 193 QGEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKD 252

Query: 449 AKLALSGFTDKTGSADANAELAKRRAQVVRDALKTAGVAEDRI 491
+ + G+TD+ GS N L++RRAQ V D L + G+ D+I
Sbjct: 253 GSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKI 295


40BMA10247_2861BMA10247_2908Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2861214-3.145207hypothetical protein
BMA10247_2862314-3.058286methyl-accepting chemotaxis protein
BMA10247_2863615-4.725288hypothetical protein
BMA10247_2864411-4.130015D-methionine ABC transporter periplasmic
BMA10247_2865413-4.280273ISBma2, transposase
BMA10247_2866210-3.832654ISBma1, transposase
BMA10247_2867-112-2.479212hypothetical protein
BMA10247_2868-2110.094711cytochrome d ubiquinol oxidase, subunit I
BMA10247_2869-1111.617637cytochrome d ubiquinol oxidase, subunit II
BMA10247_28700113.353807cyd operon protein YbgT
BMA10247_28711113.839512hypothetical protein
BMA10247_28721123.569854hypothetical protein
BMA10247_28732133.712411beta-N-acetylhexosaminidase
BMA10247_28743123.443728PTS system N-acetylglucosamine-specific
BMA10247_28754113.773259PTS system glucose-glucoside (Glc) family
BMA10247_28763103.387616SIS domain-containing protein
BMA10247_28773112.711885N-acetylglucosamine-6-phosphate deacetylase
BMA10247_28782113.484290GntR family transcriptional regulator
BMA10247_28792123.332028LysR family transcriptional regulator
BMA10247_28802142.879206AMP-binding protein
BMA10247_28812152.723319hypothetical protein
BMA10247_28820132.268148hypothetical protein
BMA10247_28831133.297126hypothetical protein
BMA10247_28841133.444674hypothetical protein
BMA10247_28850122.858364hypothetical protein
BMA10247_2886-1123.974195hypothetical protein
BMA10247_2887-2134.265515pyridoxal-dependent decarboxylase
BMA10247_2888-1134.247522AMP-binding protein
BMA10247_2889-2154.080661hypothetical protein
BMA10247_2890-2143.416888acyl-CoA dehydrogenase
BMA10247_2891-1134.495942citrate synthase-like protein
BMA10247_2892-2123.866261UbiE/COQ5 family methlytransferase
BMA10247_2893-1113.120390syringomycin synthesis regulator SyrP
BMA10247_2894-193.313597ABC transporter ATP-binding protein
BMA10247_2895192.600045ABC-2 type transporter permease
BMA10247_2896182.020172hypothetical protein
BMA10247_2897091.419649diaminobutyrate--2-oxoglutarate
BMA10247_28990102.228293fatty acid desaturase
BMA10247_28981103.080264hypothetical protein
BMA10247_29000102.826920major facilitator family transporter
BMA10247_29011112.743156hypothetical protein
BMA10247_29021112.660507error-prone DNA polymerase
BMA10247_29030101.463211hypothetical protein
BMA10247_2904011-0.820421hypothetical protein
BMA10247_2905011-2.561729ABC transporter ATP-binding protein
BMA10247_2906011-3.407324ABC transporter permease
BMA10247_2907012-3.872986ABC transporter periplasmic substrate-binding
BMA10247_2908-111-3.024639ISBma2, transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2873cloacin310.023 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.8 bits (69), Expect = 0.023
Identities = 31/120 (25%), Positives = 48/120 (40%), Gaps = 8/120 (6%)

Query: 176 VVVDGAAPAVLRYDDTDDELRYVETLPADAQNNSPGNAPP--AAAQPVANRALPSVKRQR 233
V + G P+ + DD + + V +LPAD SP ++ P A V R + VK +R
Sbjct: 134 VALYGVLPSQIAKDDPNMMSKIVTSLPADDITESPVSSLPLDKATVNVNVRVVDDVKDER 193

Query: 234 ALPGALDLRGVELTLPELPSAQVAALRERAGTLGLDGARVPVWGVVAPRRLPADIAVPGG 293
+ GV +++P + A ER G PV + PA + G
Sbjct: 194 QNISVVS--GVPMSVPVVD----AKPTERPGVFTASIPGAPVLNISVNNSTPAVQTLSPG 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2875PHPHTRNFRASE513e-175 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 513 bits (1323), Expect = e-175
Identities = 194/567 (34%), Positives = 312/567 (55%), Gaps = 7/567 (1%)

Query: 300 PNTLAGVCAAPGIAVGTLVRWDDAQIVPPELASGTPAAESRLLDRALAEVDAQLETTVRE 359
+ + G+ A+ G+A+ + + + + + E L AL + +L +
Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQ 61

Query: 360 ASRRGAIGEAGIFAVHRVLLEDPALVDAARDLI-SLGKSAGYAWRETIRAQTAVLADVDD 418
+A IFA H ++L+DP LVD + I + +A YA +E ++ +D+
Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121

Query: 419 TLLAERAADLRDIDKRVLRAL-GYASASARELPAEAVLAAEEFTPSDLASLDRERVAALV 477
+ ERAAD+RD+ KRVL L G + S + E V+ AE+ TPSD A L+++ V
Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 478 MARGGATSHAAIIARQLGIPALVAVGDALYAIAQRTQVVVDASAGRLEYAPSALDVERAR 537
GG TSH+AI++R L IPA+V + I V+VD G + P+ +V+
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 538 HERQRLAGVREANRRMSGEAALTRDGHRIEVAANIATLDDARVALDNGADAVGLLRTELM 597
+R ++ ++ GE + T+DG +E+AANI T D L NG + +GL RTE +
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 598 FIHRQAAPTASEHQQSYQSIVDALQGRTAIIRTLDVGADKEVDYLTLPPEPNPALGLRGI 657
++ R PT E ++Y+ +V + G+ +IRTLD+G DKE+ YL LP E NP LG R I
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 658 RLAQVRPDLLDDQLRGLLAVKPYGSVRILLPMVTDVGELVRIRKRIDD-----FARAMGR 712
RL + D+ QLR LL YG+++++ PM+ + EL + + + + + +
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421

Query: 713 AQAVEVGVMIEVPSAALLADQLAQHADFLSIGTNDLTQYTLAMDRCQADLAAQADGLHPA 772
+ ++EVG+M+E+PS A+ A+ A+ DF SIGTNDL QYT+A DR ++ HPA
Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481

Query: 773 VLRLVDATVRGAEKHGKWVGVCGALGGDPVAVPVLVGLGVTELSVDPVSVPGIKAQVRRL 832
+LRLVD ++ A GKWVG+CG + GD VA+P+L+GLG+ E S+ S+ ++Q+ +L
Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541

Query: 833 DYQLCRQRAQDLLALESAQAVRAASRE 859
+ + AQ L L++A+ V ++
Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKK 568


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2895ABC2TRNSPORT310.006 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 31.1 bits (70), Expect = 0.006
Identities = 33/155 (21%), Positives = 59/155 (38%), Gaps = 7/155 (4%)

Query: 163 YGEFFATGILIMVFMSIGVVSTA-TTIATLRERNTFKMYVCFPVSRF-VFLASLIVSRVI 220
Y F A G++ M+ T + + T++ + + + L + +
Sbjct: 65 YTAFLAAGMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATK 124

Query: 221 LMLAASVTLMLAARYLFQVPLPLWSLRALRAIPVVLLGAAMLLSLGTLLASRARSLAAAE 280
LA + ++AA + SL L A+PV+ L SLG ++ + A S
Sbjct: 125 AALAGAGIGVVAAALGY---TQWLSL--LYALPVIALTGLAFASLGMVVTALAPSYDYFI 179

Query: 281 AWCNLIYFPLLFFSDLTIPLRAAPHWLRVVLLVLP 315
+ L+ P+LF S P+ P + LP
Sbjct: 180 FYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLP 214


41BMA10247_2937BMA10247_2957Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2937211-2.886601ATP-dependent protease La
BMA10247_2938211-2.891991hypothetical protein
BMA10247_2939013-2.696831HPr kinase/phosphorylase
BMA10247_2940-114-2.824302PTS system, nitrogen regulatory IIA protein
BMA10247_2941-213-1.773077ribosomal subunit interface protein
BMA10247_2942-112-0.544266RNA polymerase factor sigma-54
BMA10247_29430110.820045ABC transporter ATP-binding protein
BMA10247_29440111.353506hypothetical protein
BMA10247_29451101.589001hypothetical protein
BMA10247_29463101.3715983-deoxy-D-manno-octulosonate 8-phosphate
BMA10247_2947590.424197carbohydrate isomerase KpsF/GutQ family protein
BMA10247_2948490.412223monovalent cation:proton antiporter-2 (CPA2)
BMA10247_2950013-0.870142LysE family protein
BMA10247_2951-111-1.339001nudix hydrolase
BMA10247_2952-211-1.519033formyltetrahydrofolate deformylase
BMA10247_2953-212-1.869185hypothetical protein
BMA10247_2954-111-2.153843hypothetical protein
BMA10247_2955011-2.988700excinuclease ABC subunit A
BMA10247_2956520-3.744723major facilitator family transporter
BMA10247_2957422-4.434340single-stranded DNA-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2956TCRTETA853e-20 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 84.9 bits (210), Expect = 3e-20
Identities = 77/368 (20%), Positives = 143/368 (38%), Gaps = 31/368 (8%)

Query: 7 RATTSLAAIFALRMLGLFMIMPVFSVYAKTIPGGENVVL-VGIALGAYGVTQSLLYIFYG 65
R + + AL +G+ +IMPV + + +V GI L Y + Q G
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 66 WASDKFGRKPVIAAGLLIFALGSFVAAFAHDITWIIVGRVIQGM-GAVSSAVLAFIADLT 124
SD+FGR+PV+ L A+ + A A + + +GR++ G+ GA + A+IAD+T
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124

Query: 125 SEHNRTKAMAMVGGSIGMSFAVAIVGAPI--VFHWVGMSGLFAIVGALSVAAIGVVLWVV 182
R + + G + G + + F AL+ +++
Sbjct: 125 DGDERARHFGFMSACFGFGM---VAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181

Query: 183 PDAPRPVHVPAPFAEVLHNVELLRLNFGVLVLHATQTALFLVVPRLLVDGGLPVA----- 237
P++ + P E L+ + R G+ V+ A F+ + + G +P A
Sbjct: 182 PESHKGERRPLR-REALNPLASFRWARGMTVVAALMAVFFI----MQLVGQVPAALWVIF 236

Query: 238 ----SHWQ-----VYLPVMGL--AFVMMVPAIIVAEKQGRMKPVLLGGIAAILIGQLLLG 286
HW + L G+ + + VA + G + ++L G+ A G +LL
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALML-GMIADGTGYILLA 295

Query: 287 VATHTILIVAAILFVYFLGFNILEASQPSLVSKLAPGSRKGAATGVYNTTQSIGLALGGV 346
AT + + V I + +++S+ R+G G S+ +G +
Sbjct: 296 FATRGWMAF--PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL 353

Query: 347 VGGVLLKH 354
+ +
Sbjct: 354 LFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2957cloacin462e-08 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 46.2 bits (109), Expect = 2e-08
Identities = 26/65 (40%), Positives = 29/65 (44%)

Query: 109 GGRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSGGGGGGGGGG 168
GG G G GGG D G+ +GGG GGG G GG SGGG G GG
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 169 GGGAS 173
A+
Sbjct: 82 SAVAA 86



Score = 40.5 bits (94), Expect = 2e-06
Identities = 25/65 (38%), Positives = 27/65 (41%)

Query: 109 GGRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSGGGGGGGGGG 168
G + G GG G GGG G G E GG + G G SG G GGG G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 169 GGGAS 173
GG S
Sbjct: 71 SGGGS 75



Score = 38.2 bits (88), Expect = 1e-05
Identities = 25/73 (34%), Positives = 30/73 (41%)

Query: 110 GRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSGGGGGGGGGGG 169
GRG + G + G G G GGG G GGG+G+ GGG G G G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65

Query: 170 GGASRPSAPAGGG 182
GG +G G
Sbjct: 66 GGNGNSGGGSGTG 78



Score = 36.2 bits (83), Expect = 6e-05
Identities = 27/79 (34%), Positives = 30/79 (37%), Gaps = 3/79 (3%)

Query: 107 MLGGRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSG---GGGG 163
M GG G G G GG G G G G G + G G+ SG GGG
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 164 GGGGGGGGASRPSAPAGGG 182
G G GGG + GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGG 79



Score = 30.5 bits (68), Expect = 0.005
Identities = 25/72 (34%), Positives = 27/72 (37%)

Query: 109 GGRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSGGGGGGGGGG 168
GG GSG GGG G GGG G GGG A G A S G GG
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106

Query: 169 GGGASRPSAPAG 180
+ +A A
Sbjct: 107 ISAGALSAAIAD 118


42BMA10247_2998BMA10247_3015Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2998226-3.465077hypothetical protein
BMA10247_2999225-4.298676amino acid uptake ABC transporter periplasmic
BMA10247_3000228-5.123474amino acid uptake ABC transporter permease
BMA10247_3001228-5.251319hydrophobic amino acid ABC transporter
BMA10247_3002026-4.949297hydrophobic amino acid ABC transporter
BMA10247_3003025-4.640382tRNA uridine 5-carboxymethylaminomethyl
BMA10247_3004-118-4.01750116S rRNA methyltransferase GidB
BMA10247_3005017-4.224324sporulation initiation inhibitor protein Soj
BMA10247_3006017-3.914108stage 0 sporulation protein J
BMA10247_3007119-3.935041transporter
BMA10247_3008121-4.769951ATP synthase I
BMA10247_3009120-5.075959ATP synthase F0F1 subunit A
BMA10247_3010124-4.922636ATP synthase F0F1 subunit C
BMA10247_3011223-4.674008ATP synthase F0F1 subunit B
BMA10247_3012218-4.638331ATP synthase F0F1 subunit delta
BMA10247_3013014-4.098990ATP synthase F0F1 subunit alpha
BMA10247_3014113-4.189674ATP synthase F0F1 subunit gamma
BMA10247_3015112-3.878045ATP synthase F0F1 subunit beta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3012FLGMOTORFLIN270.034 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 26.8 bits (59), Expect = 0.034
Identities = 24/87 (27%), Positives = 45/87 (51%), Gaps = 9/87 (10%)

Query: 5 ATIARPYAEALFRVAEGGDISAWSTLVQELAQVAQLPEVLSVASSPKVSRTQ--VAELLL 62
AT + A+A+F+ GGD+S +Q++ + +P L+V ++ RT+ + ELL
Sbjct: 28 ATTTKSAADAVFQQLGGGDVSG---AMQDIDLIMDIPVKLTV----ELGRTRMTIKELLR 80

Query: 63 AALKSPLASGAQAKNFVQMLVDNHRIA 89
S +A A + +L++ + IA
Sbjct: 81 LTQGSVVALDGLAGEPLDILINGYLIA 107


43BMA10247_3060BMA10247_3074Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_30602141.406493glycine cleavage system H protein
BMA10247_30612142.755054glycine cleavage system aminomethyltransferase
BMA10247_3062-1111.008429hypothetical protein
BMA10247_3063-2121.337514lipoprotein
BMA10247_3064-1140.166663Gfo/Idh/MocA family oxidoreductase
BMA10247_3065217-0.968357GDSL-like lipase/acylhydrolase domain-containing
BMA10247_3066319-2.505541hypothetical protein
BMA10247_3067319-4.218953ATP-dependent DNA helicase Rep
BMA10247_3068733-4.831516cytochrome c family protein
BMA10247_3070634-6.633119*prophage DLP12 integrase
BMA10247_3071534-7.073249IS407A, transposase OrfB
BMA10247_3072222-4.924272IS407A, transposase OrfA
BMA10247_3073221-3.914375hypothetical protein
BMA10247_3074120-3.526243lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3066OMADHESIN280.044 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 27.9 bits (61), Expect = 0.044
Identities = 32/107 (29%), Positives = 45/107 (42%), Gaps = 2/107 (1%)

Query: 69 LSLSAIAASEAFSFAYAWTCRRHRWPLALAAGLAAWAAAASALARLPATPPAATAVAFAA 128
+S+SA S FS YA+ P A ++ A A L P PP A A
Sbjct: 7 ISVSAALISALFSSPYAFADDYDGIPNLTAVQISPNADPALGLE-YPVRPPVPGAGGLNA 65

Query: 129 TCFGQSCLPRGATLAPRAPLSHADLAGRLAAGAALALAVTSLAGALG 175
+ G + GAT + A AG +A G ++A+ L+ ALG
Sbjct: 66 SAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVN-SVAIGPLSKALG 111


44BMA10247_3177BMA10247_3183Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_3177014-3.176069phosphoheptose isomerase
BMA10247_3178215-4.205632phospholipid-binding protein
BMA10247_3179519-5.041003cytochrome c family protein
BMA10247_3181315-4.984052*amino acid permease
BMA10247_3182419-4.336051IS407A, transposase OrfB
BMA10247_3183216-3.251866IS407A, transposase OrfA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3179SURFACELAYER260.035 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 26.2 bits (57), Expect = 0.035
Identities = 22/68 (32%), Positives = 30/68 (44%), Gaps = 3/68 (4%)

Query: 5 KRVKRTMSAAAAAMAVVSCAMAAAPAAHADAGDGLKVARSNACMGCHAVDRKLVGPSFQQ 64
K+ R +SAAAAA+ V+ A A +A A + + VD V PS
Sbjct: 2 KKNLRIVSAAAAALLAVAPIAATAMPVNAATTINADSAINANTNAKYDVD---VTPSISA 58

Query: 65 IAERYKND 72
IA K+D
Sbjct: 59 IAAVAKSD 66


45BMA10247_3196BMA10247_3221Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_31962140.715652(2Fe-2S) ferredoxin
BMA10247_31981141.990887lipoprotein
BMA10247_31990122.496058ABC transporter periplasmic substrate-binding
BMA10247_32000122.959542ABC transporter system ATP-binding protein
BMA10247_3201-2123.592364ABC transporter permease
BMA10247_3202-2123.6361892`,3`-cyclic-nucleotide 2`-phosphodiesterase
BMA10247_3203-1124.224217hypothetical protein
BMA10247_3204-1143.457294biotin--protein ligase
BMA10247_32051162.518870pantothenate kinase
BMA10247_32061111.945161hypothetical protein
BMA10247_3207090.889909cytidyltransferase-like protein
BMA10247_3208-1101.412174hypothetical protein
BMA10247_32091113.038691hypothetical protein
BMA10247_32120112.313364hypothetical protein
BMA10247_3213-1111.519119enoyl-CoA hydratase
BMA10247_3214-112-0.126792fumarylacetoacetate hydrolase
BMA10247_3215-38-1.137500IclR family transcriptional regulator
BMA10247_3216-18-1.968114hypothetical protein
BMA10247_3217-18-2.955693hypothetical protein
BMA10247_321808-2.9004635-methyltetrahydrofolate--homocysteine
BMA10247_3219010-3.421607methionine synthase
BMA10247_3220318-3.868672ISBma2, transposase
BMA10247_3221-111-3.202092outer membrane porin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3204SECA290.026 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.1 bits (65), Expect = 0.026
Identities = 18/49 (36%), Positives = 22/49 (44%), Gaps = 4/49 (8%)

Query: 198 AAAEVDALRARDATLAGGLP----PVALAAVRAGATLTDTFAAALNALA 242
A+ V +R D L GG+ +A G TLT T A LNAL
Sbjct: 74 ASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALT 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3205PF033092026e-67 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 202 bits (516), Expect = 6e-67
Identities = 58/279 (20%), Positives = 102/279 (36%), Gaps = 47/279 (16%)

Query: 1 MCLLIDAGNSRIKWALADTARHFVTSGAFEHASDAPDWSTLPAPR------GAWISNVAG 54
M L ID N+ L G+ +HA W P I + G
Sbjct: 1 MLLAIDVRNTHTVVGLIS--------GSGDHAKVVQQWRIRTEPEVTADELALTIDGLIG 52

Query: 55 DAAAA---------------RIDALIEARWPALPRTVVRASAAQCGVTNGYAEPARLGSD 99
D A + ++E WP +P ++ G+ P +G+D
Sbjct: 53 DDAERLTGASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEPGVR-TGIPLLVDNPKEVGAD 111

Query: 100 RWAGLIGAHAAFADEHLLIATFGTATTLEALRADGHFAGGLIAPGWALMMRSLGMHTAQL 159
R + A+ + +++ FG++ ++ + A G F GG IAPG + + +A L
Sbjct: 112 RIVNCLAAYHKYGTAAIVV-DFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAAL 170

Query: 160 PTVSIDAATNLLDELAENDAHAPFAIDTPHALSAGCLQAQAGLIE----RAWRDLEKAWQ 215
V + +++ + +T + AG + AGL++ R D++
Sbjct: 171 RRVELTRPRSVIGK------------NTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSG 218

Query: 216 APVRLVLSGGAADAIVRALTVPHTRHDTLVLTGLALIAH 254
A V +V +G A ++ L L L GL L+
Sbjct: 219 ADVAVVATGHTAPLVLPDLRTVEHYDRHLTLDGLRLVFE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3206GPOSANCHOR300.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.002
Identities = 21/80 (26%), Positives = 29/80 (36%), Gaps = 8/80 (10%)

Query: 70 PALETAPLNASGAAPAAASDSAPGSPAASAPASAVAPASMPASVAAPAAPA----PSSPP 125
A E A L A A+ + D+ PG+ A A + P AP PS+
Sbjct: 451 QAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGE 510

Query: 126 AAQP----ARAPILPGASAA 141
A P A ++ A A
Sbjct: 511 TANPFFTAAALTVMATAGVA 530


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3216PF03544356e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 34.6 bits (79), Expect = 6e-04
Identities = 18/98 (18%), Positives = 28/98 (28%)

Query: 55 PVQVELLKPQPIERAPAPEKPAADRPRAAPKRAARASAPPAHAPRASAPVSSAAESSTES 114
P Q P+P+ +P + P+ AP + P P+ V
Sbjct: 62 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPV 121

Query: 115 SAESPAAASGTEPASAAGGQAAGATSGAAAGASGASAP 152
+ + T PA A ATS +
Sbjct: 122 ESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRA 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3221ECOLNEIPORIN881e-21 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 88.3 bits (219), Expect = 1e-21
Identities = 90/394 (22%), Positives = 139/394 (35%), Gaps = 71/394 (18%)

Query: 1 MKKSLLALVALSAFAGAAHAQSSVTLYGIIDEGFNINTNAGGKHL-----YNLSSGVMQG 55
MKKSL+AL L+A AA A VTLYG I G + + + V G
Sbjct: 1 MKKSLIALT-LAALPVAAMAD--VTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLG 57

Query: 56 SRWGLRGTEDLGGGLKALFVLENGFDVNSGKLNQGGLEFGRQAYVGLSSGFGTVTLGRQY 115
S+ G +G EDLG GLKA++ +E + G RQ+++GL GFG + +GR
Sbjct: 58 SKIGFKGQEDLGNGLKAIWQVEQKASIAGTDSGWGN----RQSFIGLKGGFGKLRVGRLN 113

Query: 116 DSVVDF--VGPLEA-GDQWGGYIAAHPGDLDNFNNAYRVNNAVKFTSANYGGFTFGGLYS 172
+ D + P ++ D G A P + + V++ S + G + Y+
Sbjct: 114 SVLKDTGDINPWDSKSDYLGVNKIAEP---EARLIS------VRYDSPEFAGLSGSVQYA 164

Query: 173 FGGVAGDFSRNQTWSLGAGYTNGPLVLGVGYLNARTPSTAGGLFGNNTTSSTPAAVTTPV 232
AG ++++ G Y NG + G R + +
Sbjct: 165 LNDNAG-RHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHR-------L 216

Query: 233 YAGYASAHTYQVIGAGGAYSFGAATVGITYSNIKFMNFASTVFPNQTATFNNAEINFKYQ 292
+GY + Y A A A + + + T + N +
Sbjct: 217 VSGYDNDALY----ASVAVQQQDAKL-VEENYSHNSQTEVAA----TLAYRFG--NVTPR 265

Query: 293 LTPTLLAGAAYDYTQGSKIAGSSAAKYHQGSVGVDYFLSKRTDVYAIGVYQHASGNVIEA 352
++ ++D T + Y Q VG +Y SKRT + E
Sbjct: 266 VSYAHGFKGSFDATNYNND-------YDQVVVGAEYDFSKRTSALVSAGWLQ------EG 312

Query: 353 DGNTVGPATAAINGLTPSSNRNQFAARVGIRHKF 386
G S A VG+RHKF
Sbjct: 313 KG---------------ESKFVSTAGGVGLRHKF 331


46BMA10247_3251BMA10247_3262Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_3251-117-3.406945type II/IV secretion system protein
BMA10247_3252120-6.415721transporter
BMA10247_3253428-6.729729IS407A, transposase OrfA
BMA10247_3254019-4.954177IS407A, transposase OrfB
BMA10247_3255-314-3.226273hypothetical protein
BMA10247_3257-113-3.555341*octaprenyl-diphosphate synthase
BMA10247_3258014-3.57887550S ribosomal protein L21
BMA10247_3259013-3.16629750S ribosomal protein L27
BMA10247_3260-211-2.868650GTPase ObgE
BMA10247_3261-112-2.968084gamma-glutamyl kinase
BMA10247_3262-112-3.671710ISBma2, transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3261CARBMTKINASE361e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 36.3 bits (84), Expect = 1e-04
Identities = 34/129 (26%), Positives = 48/129 (37%), Gaps = 10/129 (7%)

Query: 132 GVVPIINENDTVVTDEIKFGDNDTLGALVANLIEGDTLVILTDQPGLFTADPRKDPGATL 191
G VP+I E+ + E D D G +A + D +ILTD G +
Sbjct: 195 GGVPVILEDGEIKGVEAVI-DKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLR 253

Query: 192 VAEASAGAPELEAMAGGAGSSIGRGGMLTKILAAKRAAHSGANTVIASGRERDVLVRLAA 251
+ E AGS M K+LAA R G I + E+ V
Sbjct: 254 EVKVEELRKYYEEGHFKAGS------MGPKVLAAIRFIEWGGERAIIAHLEK--AVEALE 305

Query: 252 GEAIGTQLI 260
G+ GTQ++
Sbjct: 306 GKT-GTQVL 313


47BMA10247_3295BMA10247_3303Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_32952142.540387IclR family transcriptional regulator
BMA10247_32962131.8348952-dehydro-3-deoxygalactonokinase
BMA10247_32973121.0798542-dehydro-3-deoxy-6-phosphogalactonate aldolase
BMA10247_32982131.264314short chain dehydrogenase
BMA10247_32993120.791557L-arabinose ABC transporter periplasmic
BMA10247_33004131.273609L-arabinose transporter ATP-binding protein
BMA10247_33014131.502064L-arabinose transporter permease
BMA10247_33023132.017356short chain dehydrogenase/reductase
BMA10247_33032132.929995hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3298DHBDHDRGNASE1358e-41 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 135 bits (341), Expect = 8e-41
Identities = 81/260 (31%), Positives = 130/260 (50%), Gaps = 14/260 (5%)

Query: 4 LAGKVAIVTGAGRGIGAAIARAFVREGAAVAIAELDAA---LAEESADAIARDTAGARVL 60
+ GK+A +TGA +GIG A+AR +GA +A + + S A AR
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE----- 60

Query: 61 AVPTDVARAESVAAALARTERAFGPLDVLVNNAGVNVFGDPLALTDEDWRRCFAIDLDGV 120
A P DV + ++ AR ER GP+D+LVN AGV G +L+DE+W F+++ GV
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 121 WNGCRAALPGMVERGRGSIVNIASTHAFKIIPGCFPYPVAKHGVLGLTRALGIEYAPRNV 180
+N R+ M++R GSIV + S A Y +K + T+ LG+E A N+
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 181 RVNAIAPGYIETQLTHDWWSAQPDPQAARRETLALQ-----PMKRIGRPDEVAMTAVFLA 235
R N ++PG ET + W+ + + + P+K++ +P ++A +FL
Sbjct: 181 RCNIVSPGSTETDMQWSLWADE-NGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 236 SDEAPFINASCITIDGGRSV 255
S +A I + +DGG ++
Sbjct: 240 SGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3300PF05272300.038 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.038
Identities = 14/38 (36%), Positives = 17/38 (44%), Gaps = 5/38 (13%)

Query: 18 RALD-GISFDVQAGQVHGLMGENGAGKSTLLKILGGEY 54
R ++ G FD L G G GKSTL+ L G
Sbjct: 587 RVMEPGCKFDY----SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3302DHBDHDRGNASE1233e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 123 bits (310), Expect = 3e-36
Identities = 76/249 (30%), Positives = 113/249 (45%), Gaps = 8/249 (3%)

Query: 26 GRAVLITGGATGIGASFVEHFARQGARVAFVDLDEKAGRALVARLADAAHEPVFVVCDLT 85
G+ ITG A GIG + A QGA +A VD + + +V+ L A D+
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 86 DIGALRGAIDAIRVRIGPIAVLVNNAANDVRHAVADVTPESFDASIAVNLRHQFFAAQAV 145
D A+ I +GPI +LVN A + ++ E ++A+ +VN F A+++V
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 146 IDDMKRLGGGAIVNLGSIGWMLKNAGYPVYATAKAAVQGLTRALARELGPFGIRVNTLVP 205
M G+IV +GS + YA++KAA T+ L EL + IR N + P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 206 GWVMTDKQRRLWLDDAGRAAIKAGQCIDAEL--------LPGDLARMALFLAADDSRLIT 257
G TD Q LW D+ G + G + P D+A LFL + + IT
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 258 AQDVVVDGG 266
++ VDGG
Sbjct: 248 MHNLCVDGG 256


48BMA10247_3347BMA10247_3352Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_3347318-0.869641flagellar basal body P-ring biosynthesis protein
BMA10247_3348417-1.956068flagellar basal body L-ring protein
BMA10247_3349515-2.057569flagellar basal body rod protein FlgG
BMA10247_3350512-1.512841flagellar basal body rod protein FlgF
BMA10247_3351412-1.553119flagellar hook protein FlgE
BMA10247_3352213-1.019815flagellar basal body rod modification protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3347FLGPRINGFLGI371e-129 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 371 bits (953), Expect = e-129
Identities = 164/392 (41%), Positives = 225/392 (57%), Gaps = 27/392 (6%)

Query: 4 RVVRPLVAARRRAAACCALAACMLALAFAPAAARAERLKDLAQIQGVRDNPLIGYGLVVG 63
RV+R + AA +A L+ PA A R+KD+A +Q RDN LIGYGLVVG
Sbjct: 2 RVLRIIAAALVFSALPF--------LSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVG 53

Query: 64 LDGTGDQTMQTPFTTQTLANMLANLGISINNGSANGGGSSAMTNMQLKNVAAVMVTATLP 123
L GTGD +PFT Q++ ML NLGI+ G +N KN+AAVMVTA LP
Sbjct: 54 LQGTGDSLRSSPFTEQSMRAMLQNLGITTQGGQSN-----------AKNIAAVMVTANLP 102

Query: 124 PFARPGEAIDVTVSSLGNAKSLRGGTLLLTPLKGADGQVYALAQGNMAVGGAGASANGSR 183
PFA PG +DVTVSSLG+A SLRGG L++T L GADGQ+YA+AQG + V G A + +
Sbjct: 103 PFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAAT 162

Query: 184 VQVNQLAAGRIAGGAIVERSVPNAVAQMNGVLQLQLNDMDYGTAQRIVSAVNS----SFG 239
+ + R+ GAI+ER +P+ L LQL + D+ TA R+ VN+ +G
Sbjct: 163 LTQGVTTSARVPNGAIIERELPSKFKDSV-NLVLQLRNPDFSTAVRVADVVNAFARARYG 221

Query: 240 AGTATALDGRTIQLTAPADSAQQVAFMARLQNLEVSPERAAAKVILNARTGSIVMNQMVT 299
A D + I + P + MA ++NL V + AKV++N RTG+IV+ V
Sbjct: 222 DPIAEPRDSQEIAVQKPRVA-DLTRLMAEIENLTVETD-TPAKVVINERTGTIVIGADVR 279

Query: 300 LQNCAVAHGNLSVVVNTQPVVSQPGPFSNGQTVVAQQSQIQLKQDNGSLRMVTAGANLAE 359
+ AV++G L+V V P V QP PFS GQT V Q+ I Q+ + + G +L
Sbjct: 280 ISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSKV-AIVEGPDLRT 338

Query: 360 VVKALNSLGATPADLMSILQAMKAAGALRADL 391
+V LNS+G +++ILQ +K+AGAL+A+L
Sbjct: 339 LVAGLNSIGLKADGIIAILQGIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3348FLGLRINGFLGH2063e-69 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 206 bits (526), Expect = 3e-69
Identities = 127/220 (57%), Positives = 156/220 (70%), Gaps = 7/220 (3%)

Query: 25 AALAAAALAGCAQIPREPITQQPMSAMPPMPPAMQAPGSIY---NPGYAG-RPLFEDQRP 80
++L +L GCA IP P+ Q SA P P A GSI+ P G +PLFED+RP
Sbjct: 12 SSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDRRP 71

Query: 81 RNVGDILTIVIAENINATKSSGANTNRQGNTSFDVPTAG-FLGGLF--NKANLSAQGANK 137
RN+GD LTIV+ EN++A+KSS AN +R G T+F T +L GLF +A++ A G N
Sbjct: 72 RNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGGNT 131

Query: 138 FAATGGASAANTFNGTITVTVTNVLPNGNLVVSGEKQMLINQGNEFVRFSGIVNPNTISG 197
F GGA+A+NTF+GT+TVTV VL NGNL V GEKQ+ INQG EF+RFSG+VNP TISG
Sbjct: 132 FNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISG 191

Query: 198 QNSVYSTQVADARIEYSAKGYINEAETMGWLQRFFLNIAP 237
N+V STQVADARIEY GYINEA+ MGWLQRFFLN++P
Sbjct: 192 SNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSP 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3349FLGHOOKAP1421e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 42.3 bits (99), Expect = 1e-06
Identities = 10/48 (20%), Positives = 23/48 (47%)

Query: 213 TLKQGYVESSNVNVVQELVNMIQTQRAYEINSKAVTTSDQMLQTVTQM 260
L S VN+ +E N+ + Q+ Y N++ + T++ + + +
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 40.3 bits (94), Expect = 5e-06
Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 14/80 (17%)

Query: 4 SLYIAATGMNAQQAQMDVISNNLANVSTNGFKGSRAVFEDLLYQTVRQPGANSTQQTELP 63
+ A +G+NA QA ++ SNN+++ + G+ RQ + + L
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYT--------------RQTTIMAQANSTLG 48

Query: 64 SGLQLGTGVQQVATERLYTQ 83
+G +G GV +R Y
Sbjct: 49 AGGWVGNGVYVSGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3350FLGHOOKAP1290.019 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.2 bits (65), Expect = 0.019
Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 4 LIYTAMTGATQSLEQQSVVANNLANASTTGFRAQ 37
LI AM+G + + +NN+++ + G+ Q
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQ 36


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3351FLGHOOKAP1340.001 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 34.2 bits (78), Expect = 0.001
Identities = 17/58 (29%), Positives = 24/58 (41%)

Query: 356 ISAPGSTNHGTLQGSALENSNVDLTSQLVKLITAQRNYQANAQTIKTQQTVDQTLINL 413
SA L S V+L + L Q+ Y ANAQ ++T + LIN+
Sbjct: 488 SSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 30.3 bits (68), Expect = 0.017
Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 6 GLSGLAGASSDLDVIGNNIANANTVGFKGST 36
+SGL A + L+ NNI++ N G+ T
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


49BMA10247_3391BMA10247_3414Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_3391-183.274557LuxR family transcriptional regulator
BMA10247_3392-183.925716amino acid permease
BMA10247_33931104.839217lipoprotein
BMA10247_33942104.590653hypothetical protein
BMA10247_3395093.747779flagellar biosynthetic protein FlhB
BMA10247_33962102.388281hypothetical protein
BMA10247_33972120.238705hypothetical protein
BMA10247_33983111.270273flagellar protein FliS
BMA10247_33992131.637818flagellar hook-basal body complex protein FliE
BMA10247_34001113.128063flagellar MS-ring protein
BMA10247_3401192.940746flagellar motor switch protein G
BMA10247_3402093.337389flagellar assembly protein H
BMA10247_3403-192.770664flagellum-specific ATP synthase FliI
BMA10247_3404-182.227537flagellar FliJ protein
BMA10247_3405091.843262flagellar hook-length control protein
BMA10247_3406090.777202GMC family oxidoreductase
BMA10247_3409-1100.964552coniferyl aldehyde dehydrogenase
BMA10247_34111151.756469serine/threonine protein kinase
BMA10247_34123142.035477hypothetical protein
BMA10247_34132131.029335hypothetical protein
BMA10247_34142121.212730major facilitator family transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3391HTHFIS853e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.9 bits (210), Expect = 3e-21
Identities = 31/114 (27%), Positives = 55/114 (48%), Gaps = 2/114 (1%)

Query: 5 ILLVDDHAIVRQGIRQLLIDRGIAREVKEAECGGDALVIAEKSEFDVILLDISLPDMNGI 64
IL+ DD A +R + Q L G +V+ + D+++ D+ +PD N
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 65 EVLKRLKRRLPSTPVLMFSMYREDQFAVRALKAGAAGYLSKTVNAAQMVSAISQ 118
++L R+K+ P PVL+ S A++A + GA YL K + +++ I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3395TYPE3IMSPROT625e-15 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 62.1 bits (151), Expect = 5e-15
Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 1/69 (1%)

Query: 12 APRVVAKGYGLVAERIIERARDAGLYVHTAPEMV-SLLMQVDLDARIPPQLYQAVAELLA 70
P V K + + + A + G+ + + +L +D IP + +A AE+L
Sbjct: 280 LPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLR 339

Query: 71 WLYALERDA 79
WL +
Sbjct: 340 WLERQNIEK 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3399FLGHOOKFLIE619e-16 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 61.2 bits (148), Expect = 9e-16
Identities = 47/111 (42%), Positives = 62/111 (55%), Gaps = 8/111 (7%)

Query: 3 APVNGIASALQQMQAMAAQAAGGASPATSLAGSGAASAGSFASAMKASLDKISGDQQKAL 62
+ + GI + Q+QA A A S SFA + A+LD+IS Q A
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQES--------LPQPTISFAGQLHAALDRISDTQTAAR 52

Query: 63 GEAHAFEIGAQNVSLNDVMVDMQKANIGFQFGLQVRNKLVSAYNEIMQMSV 113
+A F +G V+LNDVM DMQKA++ Q G+QVRNKLV+AY E+M M V
Sbjct: 53 TQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3400FLGMRINGFLIF462e-159 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 462 bits (1191), Expect = e-159
Identities = 253/562 (45%), Positives = 359/562 (63%), Gaps = 37/562 (6%)

Query: 53 LSRMKTNPRLPFLIGAALAIAAIVALVLWSRAPDYRVLYSNLSDRDGGAIIAALQQANVP 112
L+R++ NPR+P ++ + A+A +VA+VLW++ PDYR L+SNLSD+DGGAI+A L Q N+P
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 113 YKFADAGGAILVPANQVHETRLKLAAMGLPKGGSVGFELMDNQKFGISQFAEQVNYQRAL 172
Y+FA+ GAI VPA++VHE RL+LA GLPKGG+VGFEL+D +KFGISQF+EQVNYQRAL
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135

Query: 173 EGELQRTVESSNAVRAARVYLAIPKPSVFVRDREAPSASVLVDLYPGRVLDEGQVLAVTR 232
EGEL RT+E+ V++ARV+LA+PKPS+FVR++++PSASV V L PGR LDEGQ+ AV
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195

Query: 233 MVSSSVPDMPAKNVTIVDQDGNLLTQT-ASATGLDASQLKYVQQIERNTQKRIDAILAPI 291
+VSS+V +P NVT+VDQ G+LLTQ+ S L+ +QLK+ +E Q+RI+AIL+PI
Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255

Query: 292 FGAGNARSQVSADVDFSKIEQTSESYGPNGTPQQSAIRSQQTSSSTELAQSGASGVPGAL 351
G GN +QV+A +DF+ EQT E Y PNG ++ +RS+Q + S ++ GVPGAL
Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315

Query: 352 SNTPPQPASAPIVA-------------SNGQPAGPAATPVSDRKDSTTNYELDKTVRHVE 398
SN P P API ++ +A P S +++ T+NYE+D+T+RH +
Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTK 375

Query: 399 QSMGTIKRLSVAVVVNYQPSTDAKGRVTMQPLAADKLAQVQQLVKDAMGYDEKRGDSVNV 458
++G I+RLSVAVVVNY+ D K PL AD++ Q++ L ++AMG+ +KRGD++NV
Sbjct: 376 MNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431

Query: 459 VNSAFSAAADPFANLPWWRQPDMIELGKDIAKWLGVAAAAAALYFMFVRPALRR---AFP 515
VNS FSA + LP+W+Q I+ +WL V A L+ VRP L R
Sbjct: 432 VNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAK 491

Query: 516 PPAEPAAAAVPALDGPDDMLALDGLPSPDKKQLAEEDEEHPALLAFENERNRYERNLDYA 575
E A + + L+ D + N+R E
Sbjct: 492 AAQEQAQVRQETEEAVEVRLSKDEQLQQRR----------------ANQRLGAEVMSQRI 535

Query: 576 RTIARQDPKIVATVVKNWVSDE 597
R ++ DP++VA V++ W+S++
Sbjct: 536 REMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3401FLGMOTORFLIG297e-102 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 297 bits (762), Expect = e-102
Identities = 114/324 (35%), Positives = 191/324 (58%)

Query: 5 GLNKSALLLMSIGDEEAAQVFKFLAPREVQKIGAAMAALKNVTREQVEDVLNDFVQEAEK 64
G K+A+LL+SIG E +++VFK+L+ E++ + +A L+ +T E ++VL +F +
Sbjct: 17 GKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMA 76

Query: 65 HTALSLDSSEYIRTVLTKALGEDKAGVLIDRILQGSDTSGIEGLKWMDSAAVAELIKNEH 124
+ +Y R +L K+LG KA +I+ + + E ++ D A + I+ EH
Sbjct: 77 QEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEH 136

Query: 125 PQIIATILVHLDRDQASEIASCFTERLRNDVLLRIATLDGIQPTALRELDDVLTGLLSGS 184
PQ IA IL +LD +AS I S ++ +V RIA +D P +RE++ VL L+
Sbjct: 137 PQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASL 196

Query: 185 DNLKRAPMGGIRTAAEILNFMTSVHEEAVIENVKQYDPDLAQKIIDQMFVFENLLDLEDR 244
+ GG+ EI+N E+ +IE++++ DP+LA++I +MFVFE+++ L+DR
Sbjct: 197 SSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDR 256

Query: 245 AIQLLLKEVESEALIIALKGAPPALRQKFLSNMSQRAAELLAEDLDARGPVRVSEVETQQ 304
+IQ +L+E++ + L ALK +++K NMS+RAA +L ED++ GP R +VE Q
Sbjct: 257 SIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQ 316

Query: 305 RKILQVVRNLAESGQIVIGGKAED 328
+KI+ ++R L E G+IVI E+
Sbjct: 317 QKIVSLIRKLEEQGEIVISRGGEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3402FLGFLIH1083e-31 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 108 bits (271), Expect = 3e-31
Identities = 64/184 (34%), Positives = 106/184 (57%), Gaps = 4/184 (2%)

Query: 37 AAAALAAELQRVRDAAHAEGLAAGHVEGQALGYQAGYEQGRAKGFDEGRAEAHTHAAQLA 96
A +L +L +++ AH +G AG EG+ G++ GY++G A+G ++G AEA + A +
Sbjct: 36 AEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIH 95

Query: 97 A----LAASFRDALAGVERDLADDIATLALEIAQQVVRQHVQHDPAALIAAAREVLAAEP 152
A L + F+ L ++ +A + +ALE A+QV+ Q D +ALI +++L EP
Sbjct: 96 ARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEP 155

Query: 153 ALAGAPHLIVNPADLPVVEAYLKDELDTLGWSVRTDTSIERGGCRAHASTGEIDATLTTR 212
+G P L V+P DL V+ L L GW +R D ++ GGC+ A G++DA++ TR
Sbjct: 156 LFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATR 215

Query: 213 WERV 216
W+ +
Sbjct: 216 WQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3404FLGFLIJ602e-14 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 59.8 bits (144), Expect = 2e-14
Identities = 43/140 (30%), Positives = 74/140 (52%)

Query: 1 MAQSFPLQLLLERAQDDLDTAAKQLGRAQRERTDAQAQLDALMRYRDEYRVRFAESAQSG 60
MA+ L L + A+ +++ AA+ LG +R A+ QL L+ Y++EYR +G
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MPAGNWRNFQAFLDTLDAAIEQQRRVLAAAQTRIDAARPEWQAKKRTLGSYEILQARGAR 120
+ + W N+Q F+ TL+ AI Q R+ L ++D A W+ KK+ L +++ LQ R +
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 QDAQRAAKREQRDADEHAAK 140
+ +Q+ DE A +
Sbjct: 121 AALLAENRLDQKKMDEFAQR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3405FLGHOOKFLIK742e-16 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 73.7 bits (180), Expect = 2e-16
Identities = 79/257 (30%), Positives = 109/257 (42%), Gaps = 8/257 (3%)

Query: 213 NGDASAPLAANRAAFDKLLAGAKAPAAQAAPTDASGANPATALANAAANAAQPDASG--A 270
N D +A L+A A K A + T L + AQPD +
Sbjct: 124 NEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAPGTP 183

Query: 271 LAALQDAADSARATLAASSAPAALQQAA-PAALAANASAAAASAAPSLAPPVGTPDWTDA 329
L A++ S P+ + AA P AAP L+ P+G+ +W +
Sbjct: 184 AQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQS 243

Query: 330 LSQKVVFLSNAHQQSAELTLNPPDLGPLQVVLRVADNHAHALFVSQHAQVRDAVEAALPK 389
LSQ + + QQSAEL L+P DLG +Q+ L+V DN A VS H VR A+EAALP
Sbjct: 244 LSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPV 303

Query: 390 LREAMEAGGLGLGSASVSDGGFASAQQQQTPQRQSSDGSATRRAFGASTADAALDELAAA 449
LR + G+ LG +++S F+ QQ + Q+QS +A D L
Sbjct: 304 LRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQR-TANHEPLAGEDDDT----LPVP 358

Query: 450 SSGGAARRTVGMVDTFA 466
S VD FA
Sbjct: 359 VSLQGRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3414TCRTETB446e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.5 bits (105), Expect = 6e-07
Identities = 62/361 (17%), Positives = 114/361 (31%), Gaps = 45/361 (12%)

Query: 66 LPEFSKAFGVSPAQSSLALSFATAALAAAVFVAGFVSEALSRHRLMTASLTASSLLTLAA 125
LP+ + F PA ++ + + V G +S+ L RL+ + + ++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 126 AFAPHWHQLLIL-RALTGLALGGVPAVAMAYLAEEVHPDGLGLAMGLYVGGTAIGGMAGR 184
+ LLI+ R + G PA+ M +A + + G A GL A+G G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 185 VITGILTDLFSWRIAVGAIGVLGLASMLAFRMLLPPSRH--------------------- 223
I G++ W + + + ++L R
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFML 216

Query: 224 ------------FVPRRGLNLAHHRTS----LAHHLRGQRELPVLFAMAFVLMGSFVTLY 267
V + + H R + L + ++ G+
Sbjct: 217 FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFV 276

Query: 268 NYIGYRLLAPPYSMGQATIGA--IFVVYLVGVVASPLSGRLADTLGRGRVLI---ASLAV 322
+ + Y ++ + + A IG+ IF + ++ + G L D G VL L+V
Sbjct: 277 SMVPY-MMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSV 335

Query: 323 MLGGVALTLLHPVAAIVAGVACVTFGFFAGHAVASGWVGR-LAQHGKGQAAALYLLAYYL 381
+ L + + V G V S V L Q G +L +L
Sbjct: 336 SFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFL 395

Query: 382 G 382

Sbjct: 396 S 396


50BMA10247_0051BMA10247_0059N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0051-192.013438DNA-binding response regulator
BMA10247_0052-291.647771sensor histidine kinase
BMA10247_0053-290.842655serine protease
BMA10247_0054111-0.943126hypothetical protein
BMA10247_0055011-0.314022hypothetical protein
BMA10247_0056012-0.158737carbon monoxide dehydrogenase
BMA10247_0057110-1.090060multidrug efflux pump repressor protein BpeR
BMA10247_0058110-1.697277multidrug efflux periplasmic linker protein
BMA10247_0059112-2.356623inner membrane multidrug efflux protein BpeB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0051HTHFIS956e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 95.3 bits (237), Expect = 6e-25
Identities = 36/124 (29%), Positives = 60/124 (48%), Gaps = 1/124 (0%)

Query: 2 RILLVEDDRMIAEGVRKALKADGCAVDWVQDGDAALTALGGEAYDLLLLDLGLPKRDGID 61
IL+ +DD I + +AL G V + + DL++ D+ +P + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRTLRARGLALPVLILTARDAVADRVKGLDAGADDYLVKPFDLDE-LAARMRALIRRQS 120
+L ++ LPVL+++A++ +K + GA DYL KPFDL E + RAL +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GRSE 124
S+
Sbjct: 125 RPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0053V8PROTEASE786e-18 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 78.1 bits (192), Expect = 6e-18
Identities = 38/207 (18%), Positives = 71/207 (34%), Gaps = 40/207 (19%)

Query: 69 QRRAAPQLPIDPDDP-----FYQFFRHFYGQIPGMGGGRQPQPDDQPSTSLGSGFIISAD 123
++R + + +D I Q + T + SG ++
Sbjct: 62 EQREHANVILPNNDRHQITDTTNGHYAPVTYI---------QVEAPTGTFIASGVVV-GK 111

Query: 124 GYILTNAHVIDGANVVTVKLTDKR-----------EYKA-KVVGADKQSDVAVLKIDA-- 169
+LTN HV+D + L + A ++ + D+A++K
Sbjct: 112 DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNE 171

Query: 170 ------SGLPIVKIGDPAQSKVGQWVVAIGSPYGFDNTVTSGIISAKSRALPDENYTPFI 223
+ + + A+++V Q + G P +K + + +
Sbjct: 172 QNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMW---ESKGKITYLKGE--AM 226

Query: 224 QTDVPVNPGNSGGPLFNLNGEVIGINS 250
Q D+ GNSG P+FN EVIGI+
Sbjct: 227 QYDLSTTGGNSGSPVFNEKNEVIGIHW 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0057HTHTETR1262e-38 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 126 bits (317), Expect = 2e-38
Identities = 81/209 (38%), Positives = 116/209 (55%), Gaps = 1/209 (0%)

Query: 1 MARRTKEEALATRDRILDAAEHVFFEKGVSHTSLADIAQHAGVTRGAIYWHFASKSELFD 60
MAR+TK+EA TR ILD A +F ++GVS TSL +IA+ AGVTRGAIYWHF KS+LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 AMFDRVLLPIDELKAGT-GEPHADPLGRIREILIWCLLGAARDPQLRRVFSILFMKCEYV 119
+++ I EL+ + DPL +REILI L + + R + I+F KCE+V
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 120 ADMGPLLQRNREGMRDALRNIEADLAQGVANGQLPADLDTWRATLMLHTLVSGFVRDMLM 179
+M + Q R ++ IE L + LPADL T RA +++ +SG + + L
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 180 LPGEIDAERHAEKLVDGCFDMLRMSPAMR 208
P D ++ A V +M + P +R
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLR 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0058RTXTOXIND424e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 4e-06
Identities = 42/266 (15%), Positives = 80/266 (30%), Gaps = 75/266 (28%)

Query: 92 KIDPAPYIAQLNSAKATLAKAQANLATQNALVARYKVLVAANAVSKQQYDDAVAAQGQAA 151
+++ A+ + A + + + + + + + L+ A++K + +A
Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV 265

Query: 152 ADVGAGKAAV-------------------------------------------ETAQINL 168
++ K+ + +
Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325

Query: 169 GYTDVVSPITGRV-GISQVTPGAYVQASQATLMSTVQQLDPVYVDLTQSSLDGLKLRQDI 227
+ + +P++ +V + T G V ++ TLM V + D + V + D +
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQNKDIGFINVG- 383

Query: 228 QSGRIK-------TEGPGAAKVTLILEDGKPYPERGKLQFSDVTVDQTTGSVT--IRAI- 277
Q+ IK G KV I D DQ G V I +I
Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNI--------------NLDAIEDQRLGLVFNVIISIE 429

Query: 278 -----FPNKQRVLLPGMFVRARIEEG 298
NK L GM V A I+ G
Sbjct: 430 ENCLSTGNKNIPLSSGMAVTAEIKTG 455



Score = 30.6 bits (69), Expect = 0.012
Identities = 20/122 (16%), Positives = 35/122 (28%), Gaps = 20/122 (16%)

Query: 1 MRVERVPYRLITVATAAVFLAACGKKESAPPPQTPEVGVVTVQPQPVPVVSELPGRTSAY 60
R V Y ++ A L+ G+ E G +T + +
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATAN----GKLTHSGRSKEIKPIENSIVKEI 110

Query: 61 LVAQVRARVDGIVLRREFTEGSDVKAGQRLYKIDPAPYIAQLNSAKATLAKAQANLATQN 120
+V EG V+ G L K+ A +++L +A+
Sbjct: 111 IVK----------------EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 121 AL 122
L
Sbjct: 155 IL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0059ACRIFLAVINRP12670.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1267 bits (3280), Expect = 0.0
Identities = 673/1035 (65%), Positives = 821/1035 (79%), Gaps = 2/1035 (0%)

Query: 1 MAKFFIDRPIFAWVIAIILMLAGVAAIFTLPIAQYPTIAPPSIQITANYPGASAKTVEDT 60
MA FFI RPIFAWV+AIILM+AG AI LP+AQYPTIAPP++ ++ANYPGA A+TV+DT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQQMSGLDNFLYMSSTSDDSGNATITITFAPGTNPDIAQVQVQNKLSLATPILPQ 120
VTQVIEQ M+G+DN +YMSSTSD +G+ TIT+TF GT+PDIAQVQVQNKL LATP+LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 VVQQLGLSVTKSSSSFLLVLAFNSEDGSMNKYDLANYVASHVKDPISRINGVGTVTLFGS 180
VQQ G+SV KSSSS+L+V F S++ + D+++YVAS+VKD +SR+NGVG V LFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWLDPTKLTNYGLTPVDVTSAISAQNVQIAGGQLGGTPAVPGTVLQATITEATLL 240
QYAMRIWLD L Y LTPVDV + + QN QIA GQLGGTPA+PG L A+I T
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 QTPEQFGNILLKVNQDGSQVRLKDVAQIGLGGETYNFDTKYNGQPTAALGIQLATNANAL 300
+ PE+FG + L+VN DGS VRLKDVA++ LGGE YN + NG+P A LGI+LAT ANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 ATAKAVRAKIDEMSAYFPHGLVVKYPYDTTPFVRLSIEEVVKTLLEGIVLVFLVMYLFLQ 360
TAKA++AK+ E+ +FP G+ V YPYDTTPFV+LSI EVVKTL E I+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NLRATIIPTIAVPVVLLGTFAIMSMVGFSINVLSMFGLVLAIGLLVDDAIVVVENVERVM 420
N+RAT+IPTIAVPVVLLGTFAI++ G+SIN L+MFG+VLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKAMGQITGALVGVALVLSAVFVPVAFSGGSVGAIYRQFSLTIVSAMVL 480
E+ LPPKEAT K+M QI GALVG+A+VLSAVF+P+AF GGS GAIYRQFS+TIVSAM L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATILKPIPQGHHEEKKGFFGWFNRTFNSSRDKYHVGVHHVIKRSGRW 540
SVLVALILTPALCAT+LKP+ HHE K GFFGWFN TF+ S + Y V ++ +GR+
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 LIIYLAVIVAVGLLFVRLPKSFLPDEDQGLMFVIVQTPSGSTQETTARTLANISDYLLTQ 600
L+IY ++ + +LF+RLP SFLP+EDQG+ ++Q P+G+TQE T + L ++DY L
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 EKDIVESAFTVNGFSFAGRGQNSGLVFVKLKDYSQRQSSDQKVQALIGRMFGRYAGYKDA 660
EK VES FTVNGFSF+G+ QN+G+ FV LK + +R + +A+I R +D
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 LVIPFNPPSIPELGTAAGFDFELTDNAGLGHDALMAARNQLLGMAAKDP-TLRGVRPNGL 719
VIPFN P+I ELGTA GFDFEL D AGLGHDAL ARNQLLGMAA+ P +L VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 720 NDTPQYKVDIDREKANALGVTADAIDQTFSIAWASKYVNNFLDTDGRIKKVYVQSDAPFR 779
DT Q+K+++D+EKA ALGV+ I+QT S A YVN+F+D GR+KK+YVQ+DA FR
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID-RGRVKKLYVQADAKFR 779

Query: 780 MTPEDMNIWYVRNGSGGMVPFSAFATGHWTYGSPKLERYNGISAMEIQGQAAPGKSTGQA 839
M PED++ YVR+ +G MVPFSAF T HW YGSP+LERYNG+ +MEIQG+AAPG S+G A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 840 MTAMETLAKKLPTGIGYSWTGLSFQEIQSGSQAPILYAISILVVFLCLAALYESWSIPFS 899
M ME LA KLP GIGY WTG+S+QE SG+QAP L AIS +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 900 VIMVVPLGVIGALLAATLRGLENDVFFQVGLLTTVGLSAKNAILIVEFARELQQTEKMGP 959
V++VVPLG++G LLAATL +NDV+F VGLLTT+GLSAKNAILIVEFA++L + E G
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 960 IEAALEAARLRLRPILMTSLAFILGVMLLAISNGAGSASQHAIGTGVIGGMITATFLAIF 1019
+EA L A R+RLRPILMTSLAFILGV+ LAISNGAGS +Q+A+G GV+GGM++AT LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1020 MIPMFFVKVRAVFSG 1034
+P+FFV +R F G
Sbjct: 1020 FVPVFFVVIRRCFKG 1034


51BMA10247_0084BMA10247_0088N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_00840113.877397carbohydrate ABC transporter ATP-binding
BMA10247_00830114.542665hypothetical protein
BMA10247_0085-1114.465870hypothetical protein
BMA10247_00861115.245779LysR family transcriptional regulator
BMA10247_00871125.499174esterase
BMA10247_00880125.296582major facilitator family transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0084PF05272300.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.017
Identities = 14/35 (40%), Positives = 17/35 (48%)

Query: 32 VVFVGPSGCGKSTLMRMIAGLEEISGGELLIDGAK 66
VV G G GKSTL+ + GL+ S I K
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0085PF06776340.001 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 33.8 bits (77), Expect = 0.001
Identities = 20/80 (25%), Positives = 25/80 (31%), Gaps = 11/80 (13%)

Query: 60 RLCRRIGGRHAAGP------APARESPSENSMKTGRRHFVRSVASASAALAAAAWSPARA 113
R+ RR HA PA SP + + RR R+ A A A A
Sbjct: 10 RISRRPVTNHAVPALKAIQMGPAELSPM---LASCRRLARRNGARLMLAGAMAI--ALSF 64

Query: 114 AIDAPASPATALSLTPGRWS 133
A A+ G W
Sbjct: 65 GWSDRADAQGAVRSVHGDWQ 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0087BLACTAMASEA300.019 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.8 bits (67), Expect = 0.019
Identities = 11/35 (31%), Positives = 15/35 (42%)

Query: 57 REDALFRFASVSKPIVSAAAMRAVAAGKLDLDASI 91
R D F S K ++ A + V AG L+ I
Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKI 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0088TCRTETB363e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.6 bits (82), Expect = 3e-04
Identities = 31/155 (20%), Positives = 59/155 (38%), Gaps = 5/155 (3%)

Query: 26 LLALATAGFITIVTEALPAGLLPLMGRDLRVSDALVGQLVTVYAAGSIVAAMPLVAATRG 85
L+ L F +++ E + LP + D A + T + + +
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 86 MRRRPLLLAALAGFVVANTATAASPYYAPVLV-ARCVAGVSAGLLWALLAGYASRMVDAR 144
+ + LLL + + + +L+ AR + G A AL+ +R +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 145 QRGRAIAIAMLGAPVAMSVGI-PL-GTALGAALGW 177
RG+A ++G+ VAM G+ P G + + W
Sbjct: 136 NRGKAFG--LIGSIVAMGEGVGPAIGGMIAHYIHW 168


52BMA10247_0684BMA10247_0690N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_06843111.743748sensor histidine kinase/response regulator
BMA10247_06850111.142979capsular synthesis regulator component B
BMA10247_0686-190.974561hypothetical protein
BMA10247_0688-190.452168drug:H+ antiporter-2 (DHA2) family protein
BMA10247_0689-1111.597108hypothetical protein
BMA10247_06900100.982526multidrug resistance protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0684HTHFIS632e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.9 bits (153), Expect = 2e-12
Identities = 30/122 (24%), Positives = 50/122 (40%), Gaps = 10/122 (8%)

Query: 401 RVLVVDDQEMNRIVLRYQLDALGHHARLCASGDEALRALGTAAYDVVLTDCRMPGMDGIA 460
+LV DD R VL L G+ R+ ++ R + D+V+TD MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 461 LTAAIRAH-PDARVRATPIVGVTALVSDAEHARCVDAGMTLCIGKP----TTLDALERAL 515
L I+ PD P++ ++A + + + G + KP + + RAL
Sbjct: 65 LLPRIKKARPD-----LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 516 VE 517
E
Sbjct: 120 AE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0685HTHFIS553e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.2 bits (133), Expect = 3e-11
Identities = 41/159 (25%), Positives = 65/159 (40%), Gaps = 13/159 (8%)

Query: 5 VLIADDHPLVLLGVRHMLAGMG-DVSIVGEAHDPAGLLALLAATPCDIVITDFAMPEQPA 63
+L+ADD + + L+ G DV I A L +AA D+V+TD MP+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAAT---LWRWIAAGDGDLVVTDVVMPD--- 59

Query: 64 ADGLAMLTAIRDGYPSVRVIVLTMLDNPVLMHTMRQAGALAVLSKRGDLDEL----PRAL 119
+ +L I+ P + V+V++ + + + GA L K DL EL RAL
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 120 AAVYQGRPFVGTHAGAAGGGAMRGTDAPRQLSPREIEVV 158
A + + G + G A Q R + +
Sbjct: 120 AEPKRRPS--KLEDDSQDGMPLVGRSAAMQEIYRVLARL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0688TCRTETB992e-24 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 99.2 bits (247), Expect = 2e-24
Identities = 70/331 (21%), Positives = 138/331 (41%), Gaps = 20/331 (6%)

Query: 41 AFMEVLDTTIVNVALPHIAGTMSASYDEATWTLTSYLVANGIVLPISGFLGRLLGRKRYF 100
+F VL+ ++NV+LP IA + W T++++ I + G L LG KR
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 101 VLCIVAFTICSFLCGIATDLGQLIVF-RVLQGLFGGGLQPNQQSIILDTF-PPEQRNRAF 158
+ I+ S + + L++ R +QG G P +++ + P E R +AF
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGA-GAAAFPALVMVVVARYIPKENRGKAF 141

Query: 159 SISAVAIVVAPVLGPTLGGWITDNFSWRWVFLLNVPIGVLTSLAVIQLVEDPPWKRGRAR 218
+ + + +GP +GG I W +LL +P+ +T + V L++ K R +
Sbjct: 142 GLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPM--ITIITVPFLMKLLK-KEVRIK 196

Query: 219 GLSIDYIGITLIAIGLGCLQVMLDRGEDEDWFASTFIRTFAGLTAAGLVGATFWLLYAKK 278
G D GI L+++G+ + F +++ +F ++ + +
Sbjct: 197 G-HFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTD 245

Query: 279 PVVDLSCLKDRNFTLGCVTIATFAVVLYGSAVLVPQLAQQRLGYTAMLAG-LVLSPGALL 337
P VD K+ F +G + + G +VP + + + G +++ PG +
Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305

Query: 338 ITLEIPIVSKLMPYVQTRFLVCFGFLLLAAS 368
+ + I L+ +++ G L+ S
Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVS 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0690RTXTOXIND1006e-25 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 99.5 bits (248), Expect = 6e-25
Identities = 63/414 (15%), Positives = 134/414 (32%), Gaps = 91/414 (21%)

Query: 51 KRPGKKPLVVLAIIVVLLLVGAFVW-WFATRNQVSTDDA--YTDGNAITIAPKVSGYVVA 107
+ P + ++A ++ LV AF+ V+T + G + I P + V
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109

Query: 108 LAIDDNVYVHRGDLLLVIDQRDYQAQVDAARAQLGLAQAQLDAAQVQLDIA------HVQ 161
+ + + V +GD+LL + +A ++ L A+ + Q+ ++
Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169

Query: 162 FPAQYRQAQA---QIEAAQASFRQALAAYERQHAVDARATSQQAIDVADAQRLTADANVA 218
P + ++ + ++ + ++ Q + + +D A+RLT A +
Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQ-----KYQKELNLDKKRAERLTVLARIN 224

Query: 219 TARAQA----------------------------RTASLVPQQIRQAQTAVEQRRQQVLQ 250
+ ++R ++ +EQ ++L
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS 284

Query: 251 AQA-----------------------------QLEAAQLALSYCEVRAPSDGWITRRNVQ 281
A+ +L + +RAP + + V
Sbjct: 285 AKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVH 344

Query: 282 -LGSFLQAGAALFAIVTPQ---LWVTANFKESQLERMRAGDRVSVSVDAYP---NLELHG 334
G + L IV P+ L VTA + + + G + V+A+P L G
Sbjct: 345 TEGGVVTTAETLMVIV-PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVG 403

Query: 335 HVDSIQLGSGSRFSAFPPENATGNFVKIVQRVPVKIAIDGGLPRDPPLGIGLSV 388
V +I L + + G ++ + G ++ PL G++V
Sbjct: 404 KVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGN--KNIPLSSGMAV 448


53BMA10247_0823BMA10247_0834N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_08233135.241408fimbriae assembly-related protein
BMA10247_08242124.962274Flp pilus assembly protein CpaB
BMA10247_08253144.207730RhcC2
BMA10247_08265134.334738lipoprotein
BMA10247_08273124.154237CpaE protein
BMA10247_08282123.796993CpaF
BMA10247_0829092.855671type II secretion system protein
BMA10247_08301112.296996type II secretion system protein F
BMA10247_08322101.878810hypothetical protein
BMA10247_08342112.417519hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0823PREPILNPTASE300.003 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.2 bits (68), Expect = 0.003
Identities = 31/145 (21%), Positives = 50/145 (34%), Gaps = 12/145 (8%)

Query: 7 LVASWTLASLALADLRTRRLA---TFAVALVGALYAALALVGAPGDGGFASHAALGAAA- 62
L+ +W L +L DL L T + G L+ L + GD + A
Sbjct: 138 LLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLWS 197

Query: 63 -FALGAAMFRAGWIAGGDVKLAAVVFLWAGPAHAWPVAFAIGVGGLAVGAVCIAAGRAPR 121
+ + + GD KL A + W G V + G +G I
Sbjct: 198 LYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNH-- 255

Query: 122 VLAWFAPARGVPYGVALAAGGLLAV 146
++ +P+G LA G +A+
Sbjct: 256 -----HQSKPIPFGPYLAIAGWIAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0825BCTERIALGSPD1442e-39 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 144 bits (364), Expect = 2e-39
Identities = 68/283 (24%), Positives = 116/283 (40%), Gaps = 16/283 (5%)

Query: 170 VVQTLKPYLRQQEALVNRLTLARPIQVHLRVRITEVDRNITQQLGINWSALGA------- 222
+V + E ++ +L + RP QV + I EV LGI W+ A
Sbjct: 322 IVTAAPDVMNDLERVIAQLDIRRP-QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTN 380

Query: 223 SGNFVGGLFNGRTLFDTASKAFDLSPSGAFSVVGGFHTSRYSIDG--VLDALDQEGLITM 280
SG + G ++ S A S G Y + +L AL +
Sbjct: 381 SGLPISTAIAGANQYNKDGTVSSSLAS-ALSSFNGIAAGFYQGNWAMLLTALSSSTKNDI 439

Query: 281 LAEPNLTAISGQTASFLAGGEFPIPVAQDTTGA----ITIQFKPYGVSLDFTPTVLADNR 336
LA P++ + A+F G E P+ TT T++ K G+ L P + +
Sbjct: 440 LATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDS 499

Query: 337 ISLKVRPEVSEIDPTNSVTTGSIKVPALTVRRVDTTVELSSGQSFAIGGLLQSKSSDVLA 396
+ L++ EVS + S T+ + R V+ V + SG++ +GGLL SD
Sbjct: 500 VLLEIEQEVSSVADAASSTSSDLGA-TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTAD 558

Query: 397 ELPGLARLPVLGKLFSSRNYLNDKTEVVVIVTPYIVQPANPGE 439
++P L +PV+G LF S + K +++ + P +++ +
Sbjct: 559 KVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYR 601


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0827HTHFIS340.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 0.001
Identities = 29/165 (17%), Positives = 52/165 (31%), Gaps = 20/165 (12%)

Query: 22 GARLVAIVADAASDEVIRNLIADQAMTGAQVARGGIDDAIALMRDLSHGPQHLLVDVSGA 81
GA ++ DAA V+ ++ + + ++ DV
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIA--AGDGDLVVTDVV-- 56

Query: 82 AMP----LSDLARLADVCDPSVNVIVIGERNDVGLFRSMLRIGVRDYLVKPL----TVEL 133
MP L R+ P + V+V+ +N G DYL KP + +
Sbjct: 57 -MPDENAFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 134 VHRALSAADPNAAARAGKAIGFVGARGGVGVTSIAVALARHLADR 178
+ RAL+ + + + G S A+ + R
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVG----RSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0828PF05272300.032 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.032
Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 299 IVISGGTGSGKTTLLNAL---SHFIDSHERIVTIEDAAELQLQQPHVVSL 345
+V+ G G GK+TL+N L F D+H I T +D+ E Q+ L
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYE-QIAGIVAYEL 647


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0834PYOCINKILLER320.004 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 31.7 bits (71), Expect = 0.004
Identities = 30/132 (22%), Positives = 49/132 (37%), Gaps = 4/132 (3%)

Query: 40 RVAAARNELQNAADAAALAGAASLEAGAGAPAWAAAASAAAAALSLNASDGAALSSGDVQ 99
A A+ + + A A AA+ A PA + + AA + + GAA + +
Sbjct: 226 AAAEAKRKAEEQARQQAAIRAANTYA---MPANGSVVATAAGRGLIQVAQGAASLAQAIS 282

Query: 100 TGYWNVTGVPAGLEPTTLAPGEYDVPAVQATVTRAPNQNGGPLSLLMGGLLGLVGTPAAA 159
V G P+ +A G + T + +Q + +G +G P +
Sbjct: 283 DAI-AVLGRVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSV 341

Query: 160 TAVAVAGAPATV 171
AVA A TV
Sbjct: 342 NLNAVAKASGTV 353


54BMA10247_0862BMA10247_0870N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_08623156.361240short chain dehydrogenase
BMA10247_08642126.191166extracytoplasmic-function sigma-70 factor
BMA10247_08653136.557964mbtH domain-containing protein
BMA10247_08663116.170875syringomycin biosynthesis enzyme
BMA10247_08671115.662916iron ABC transporter ATP-binding protein
BMA10247_08682126.696866iron-hydroxamate transporter permease subunit
BMA10247_08692126.584568ferric iron reductase FhuF
BMA10247_08702126.422699iron ABC transporter substrate-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0862DHBDHDRGNASE1224e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 122 bits (308), Expect = 4e-36
Identities = 81/252 (32%), Positives = 118/252 (46%), Gaps = 15/252 (5%)

Query: 9 GRSFLVTGASSGIGRAAAVALRGGGARVVAAARNARELERLAHETGC-----EPLELDVG 63
G+ +TGA+ GIG A A L GA + A N +LE++ E DV
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 64 CDASVRAALSG-ERMRDAFDGLINCAGVTSLAAAIDTTADEFDRVMAVNARGAMLVARHV 122
A++ + ER D L+N AGV + +E++ +VN+ G +R V
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 123 ARAMIRAGRGGSIVNVSSQAALVALPSHLAYCASKAALDAMTRVLCVELGPHGIRVNSVN 182
++ M R GSIV V S A V S AY +SKAA T+ L +EL + IR N V+
Sbjct: 128 SKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 183 PTVTLTPMAERAWSDPHASGPMLA--------AIPLGRFARVADVVAPILFLSSDAAAMV 234
P T T M W+D + + ++ IPL + A+ +D+ +LFL S A +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 235 SGVALPVDGGYT 246
+ L VDGG T
Sbjct: 247 TMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0867PF05272280.040 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.040
Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 36 VTALCGPNGCGKSTLLRTLAGLQ 58
L G G GKSTL+ TL GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_08692FE2SRDCTASE562e-11 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 55.8 bits (134), Expect = 2e-11
Identities = 50/186 (26%), Positives = 72/186 (38%), Gaps = 24/186 (12%)

Query: 78 RALVSQWSKYYFNLAASAGFAAALLLGRPLDMAPQRMRVALRGGMPVALLFEADALRPAQ 137
+ L+S W+++Y L A L + LD++P+ VA F D
Sbjct: 89 KPLISLWAQWYIGLMVPPLMLALLTQEKALDVSPEHFHAEFHETGRVA-CFWVDVCEDKN 147

Query: 138 AEPAS---RYAALVDH-LRATIDTLAVLAKLSPRVLWANAGNLLD-YLFEQCAHAPRAGA 192
A P S R L+ L + L +++ +++W+N G L++ YL E G
Sbjct: 148 ATPHSPQHRMETLISQALVPVVQALEATGEINGKLIWSNTGYLINWYLTEM---KQLLGE 204

Query: 193 DA------AWLFGPVDSRGEANPLRLPVRRVKPCSARLPDPFRARRVCCLRNEIPGEDQL 246
A F + GE NPL V L D RR CC R +P Q
Sbjct: 205 ATVESLRHALFFEKTLTNGEDNPLWRTV--------VLRDGLLVRRTCCQRYRLPDVQQ- 255

Query: 247 CGSCPL 252
CG C L
Sbjct: 256 CGDCTL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0870FERRIBNDNGPP1121e-30 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 112 bits (282), Expect = 1e-30
Identities = 77/264 (29%), Positives = 112/264 (42%), Gaps = 15/264 (5%)

Query: 115 PARIVVLEFMFAEDLAALDITPVGMADPAYYPIWIGYDDARFARVSDVGTRQEPSLEAIA 174
P RIV LE++ E L AL I P G+AD Y +W+ + V DVG R EP+LE +
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVS-EPPLPDSVIDVGLRTEPNLELLT 93

Query: 175 AAKPDLILGVGLRHAPIFDALSRIAPTVLFKYSPNYIEDGRQVTQYDWACAILRTIGCLT 234
KP ++ + P + L+RIAP F +S DG+Q A L + L
Sbjct: 94 EMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFS-----DGKQ--PLAMARKSLTEMADLL 145

Query: 235 GRARDARAVQARVDAGLARDARRIAAAGRAGERVAWLQELGLPDRYWAFTGNSASAGIAR 294
A A+ + + R + G R L L P F NS I
Sbjct: 146 NLQSAAETHLAQYEDFIRSMKPRFV---KRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202

Query: 295 ALGLE-PWPGEPTREGTAYVTSEDLLKQPDLAVLFVSATEPGVPLDAKLDSSIWRFVPAR 353
G+ W GE G+ V+ + L D+ VL +DA + + +W+ +P
Sbjct: 203 EYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD-MDALMATPLWQAMPFV 261

Query: 354 RAGRVALVERNIWGFGGPMSALRL 377
RAGR V +W +G +SA+
Sbjct: 262 RAGRFQRVP-AVWFYGATLSAMHF 284


55BMA10247_0994BMA10247_1001N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_0994012-0.721851EmrB/QacA family drug resistance transporter
BMA10247_0995-114-1.795138multidrug resistance protein
BMA10247_0996015-1.856307RND efflux system outer membrane lipoprotein
BMA10247_0997117-2.666485MarR family transcriptional regulator
BMA10247_0998017-2.736772hypothetical protein
BMA10247_0999017-2.502835GTP-binding protein TypA
BMA10247_1000-116-2.9028822-oxoglutarate dehydrogenase E1
BMA10247_1001-213-2.112716dihydrolipoamide succinyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0994TCRTETB1356e-37 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 135 bits (342), Expect = 6e-37
Identities = 84/396 (21%), Positives = 159/396 (40%), Gaps = 16/396 (4%)

Query: 27 VFMNVLDTSIANVAIPTISGDLGVSSDQGTWVITSFAVANAISVPLTGWLTDRIGQVRLF 86
F +VL+ + NV++P I+ D WV T+F + +I + G L+D++G RL
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 87 LASIILFVISSWMCGLAPT-LPFLLASRVLQGAVAGPMIPLSQALLLSSYPRAKAPMALA 145
L II+ S + + + L+ +R +QGA A L ++ P+ A
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 146 LWSMTTLIAPVAGPILGGWISDNYSWPWIFYVNIPVGIAAAAVTWMIYRSRESAVRRAPI 205
L + GP +GG I+ W ++ IP+ I V +++ ++ +
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPM-ITIITVPFLMKLLKKEVRIKGHF 199

Query: 206 DGVGLALLVIWVGSLQIMLDKGKDLDWFASTTIVVLALTALIAFAFFVVWELTAEHPVVD 265
D G+ L+ + G + ML F ++ + + ++++F FV P VD
Sbjct: 200 DIKGIILMSV--GIVFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 266 LSLFRMRNFSGGTIALSVGYGLYFGNLVLLPLWLQTQIGYTATDAG-LVMAPVGFFAILL 324
L + F G + + +G G + ++P ++ + + G +++ P I+
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 325 SPLTGKFLSRTDPRYIATAAFLTFALCFWMRSRYTTGVDEWSLMAPTFVQGIAMAGFFIP 384
+ G + R P Y+ ++ F S + + FV G ++
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTV 368

Query: 385 LVSITLSGLPGHRIPAASGLSNFVRIMCGGIGTSIF 420
+ +I S L A L NF + G G +I
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIV 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0995RTXTOXIND736e-16 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 72.6 bits (178), Expect = 6e-16
Identities = 36/270 (13%), Positives = 85/270 (31%), Gaps = 28/270 (10%)

Query: 94 ADSQVALQQAEANLAQTVRQVRGLYVNDDQYRAQVALRQSDLS--------------KAQ 139
+ Q Q E NL + + + ++Y + +S L
Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVL 255

Query: 140 DDLRRRLAVAQTGAVSQEEISHARDAVKAAQASLDAAGQQLASNRALTANTTVADHPNVL 199
+ + + V + ++ + +A+ Q + L N+
Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN-EILDKLRQT--TDNIG 312

Query: 200 AAAAKVRDAYLNNARNTLPAPVTGYVAKRSVR-VGQRVSPGTPLMSVVPLNAV-WVDANF 257
++ + + APV+ V + V G V+ LM +VP + V A
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 258 KEVQLKHMRIGQPVELTADIYGSSVKYHGKVIGFSAGTGAAFSLLPAQNATGNWIKVVQR 317
+ + + +GQ + + + + +G ++G + + +V
Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTR--YGYLVG-------KVKNINLDAIEDQRLGLVFN 423

Query: 318 LPVRVELDPKELKEHPLRIGLSMQVDVDIK 347
+ + +E + + + M V +IK
Sbjct: 424 VIISIEENCLSTGNKNIPLSSGMAVTAEIK 453



Score = 47.1 bits (112), Expect = 8e-08
Identities = 32/207 (15%), Positives = 72/207 (34%), Gaps = 28/207 (13%)

Query: 29 VIAIAAIAYGLYYLLVARFHETTDDAYVNGNVV------QITPQVTGTVIAVKADDTQTV 82
++A + + + +++ + A NG + +I P V + + ++V
Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118

Query: 83 KSGDPLVVLDPADSQVALQQAEANLAQT---------------VRQVRGLYVNDDQYRAQ 127
+ GD L+ L ++ + +++L Q + ++ L + D+ Y
Sbjct: 119 RKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 128 VA----LRQSDLSKAQ-DDLRRRLAVAQ-TGAVSQEEISHARDAVKAAQASLDAAGQQLA 181
V+ LR + L K Q + + + + E + + +L
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 182 SNRALTANTTVADHPNVLAAAAKVRDA 208
+L +A H VL K +A
Sbjct: 239 DFSSLLHKQAIAKH-AVLEQENKYVEA 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_0999TCRTETOQM1715e-48 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 171 bits (435), Expect = 5e-48
Identities = 102/435 (23%), Positives = 172/435 (39%), Gaps = 62/435 (14%)

Query: 5 LRNIAIIAHVDHGKTTLVDQLLRQSGTFRENQQVAE--RVMDSNDIEKERGITILAKNCA 62
+ NI ++AHVD GKTTL + LL SG E V + D+ +E++RGITI +
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 63 VEYEGTHINIVDTPGHADFGGEVERVLSMVDSVLLLVDAVEGPMPQTRFVTKKALALGLK 122
++E T +NI+DTPGH DF EV R LS++D +LL+ A +G QTR + +G+
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 123 PIVVINKIDRPGARIDWV-------------INQTFDLFDKLGATE----EQLDFPIV-- 163
I INKID+ G + V I Q +L+ + T EQ D I
Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182

Query: 164 -----------------YASGLNGY---ASLDP-----AARDGDMRPLFEAILQHVPVRP 198
+ SL P A + + L E I
Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSST 242

Query: 199 ADPDAPLQLQITSLDYSTYVGRIGVGRITRGRIKPGQPVVMRFGPEGDVLNRKINQVLSF 258
+ L ++ ++YS R+ R+ G + V R + + KI ++ +
Sbjct: 243 HRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSV--RISEKEKI---KITEMYTS 297

Query: 259 QGLERVQVDSAEAGDIVLINGIEDVGIGATICAVEAPEALPMITVDEPTLTMNFLVNSSP 318
E ++D A +G+IV++ E + + + + + I P L +
Sbjct: 298 INGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQ 356

Query: 319 LAGREGKFVTSRQIRDRLMKELNHNVALRVKDTGDETVFEVSGRGELHLTILVENMRRE- 377
+ D L++ V E + +S G++ + + ++ +
Sbjct: 357 QREMLLDALLEISDSDPLLR-------YYVDSATHEII--LSFLGKVQMEVTCALLQEKY 407

Query: 378 GYELAVSRPRVVMQE 392
E+ + P V+ E
Sbjct: 408 HVEIEIKEPTVIYME 422



Score = 33.7 bits (77), Expect = 0.002
Identities = 17/100 (17%), Positives = 32/100 (32%), Gaps = 1/100 (1%)

Query: 387 RVVMQEIDGVKHEPYELLTVDLEDEHQGGVMEELGRRKGEMLDMVSDGRGRTRLEYRIPA 446
V+++ EPY + E+ + + ++D L IPA
Sbjct: 525 EQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQL-KNNEVILSGEIPA 583

Query: 447 RGLIGFQSEFLTLTRGTGLMSHIFDSYAPVKEGSVGERRN 486
R + ++S+ T G + Y V + R
Sbjct: 584 RCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQPRR 623


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1001RTXTOXIND300.015 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.015
Identities = 8/86 (9%), Positives = 27/86 (31%), Gaps = 1/86 (1%)

Query: 48 EVPAPAAGVLAQVLQNDGDTVVADQVIATID-TEAKAGAAAAAAGAADVQSAAAPVAAPA 106
E+ ++ +++ +G++V V+ + A+A + + +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 107 PAAQPAAAASSTAATSPAASKLMAEK 132
+ + P + E+
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEE 183


56BMA10247_1032BMA10247_1051N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1032-212-0.400709major facilitator superfamily transporter
BMA10247_1033-110-0.030269TetR family transcriptional regulator
BMA10247_1034-39-0.889396hypothetical protein
BMA10247_1035-380.042173long-chain-fatty-acid--CoA ligase
BMA10247_1037-2101.381603hypothetical protein
BMA10247_1038-192.158046hfq protein
BMA10247_10390102.573150hypothetical protein
BMA10247_10400112.994288sigma-54 interaction domain/Fis family
BMA10247_10410114.154958hypothetical protein
BMA10247_10420134.185621hypothetical protein
BMA10247_10431124.231470hypothetical protein
BMA10247_10441113.592248type II secretion system protein
BMA10247_1045-2112.802204type II secretion system protein
BMA10247_1047-391.178518hypothetical protein
BMA10247_1048-212-0.417709type II/III secretion system protein
BMA10247_1049-216-2.079728pilus assembly protein CpaB
BMA10247_1050114-4.828185TadE family protein
BMA10247_1051113-4.411409peptidase A24A, prepilin type IV
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1032TCRTETA604e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 60.2 bits (146), Expect = 4e-12
Identities = 58/261 (22%), Positives = 103/261 (39%), Gaps = 12/261 (4%)

Query: 61 IGALIFGRLADHFGRRPTLMINIACYSLLELASGFAPSLAALLVLRTLFGVAMGGEWGVG 120
A + G L+D FGRRP L++++A ++ AP L L + R + G+ G V
Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVA 116

Query: 121 SALTMETVPPRARGAVSGLLQAGYPSGYLLASVVFGLLYPYIGWRGMFMIGVLPALLVLY 180
A + R G + A + G + V+ GL+ + F L L L
Sbjct: 117 GAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLT 176

Query: 181 VRAKVPES-PAWKQMEKRARPGLVATLKQNWKLSIYAVVLMTAF--NFFSHGTQDLYPTF 237
+PES ++ +R +A+ + +++ A ++ F L+ F
Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236

Query: 238 LREQHHFDPHTVSWITIVLNI-GAIVGGLTFGWLSERIGRRRAI---FIAAMIALPVLPL 293
++ H+D T+ I ++ + G ++ R+G RRA+ IA +L
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296

Query: 294 ----WAFSTGALALAAGAFLM 310
W + LA+G M
Sbjct: 297 ATRGWMAFPIMVLLASGGIGM 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1033HTHTETR673e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.3 bits (164), Expect = 3e-15
Identities = 21/83 (25%), Positives = 35/83 (42%)

Query: 4 RQASRQSGGTKARILDAAEDLFIEHGFEAMSMRQITSRAAVNLAAVNYHFGSKEALIHAM 63
R+ +++ T+ ILD A LF + G + S+ +I A V A+ +HF K L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 64 LSRRLDQLNEERLRILDRFDAQL 86
+ E L +F
Sbjct: 63 WELSESNIGELELEYQAKFPGDP 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1038cloacin290.018 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 28.9 bits (64), Expect = 0.018
Identities = 25/85 (29%), Positives = 26/85 (30%), Gaps = 7/85 (8%)

Query: 76 GRGPRAGGAHGGGGRPGGREGGGHGPYGSHG----GSREPRGDGGGYGARESRGDGGYGS 131
GRG G G GG G G G S G P G G G G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG---GGSGH 62

Query: 132 RESRGDGGYGSREPRGDGGYGSREP 156
G+G G G P
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1039IGASERPTASE280.047 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.7 bits (61), Expect = 0.047
Identities = 19/108 (17%), Positives = 32/108 (29%), Gaps = 9/108 (8%)

Query: 113 LFQQKAFWRVIRTASEARAEAVYRDFAKQSETLAVNELQAAKLESQKALTDRQIAVA--- 169
++A V + + T K E K T++ V
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT 1126

Query: 170 ------QERASRLQADLSIAREQRAAVATRQKDKLDETVALREQKSER 211
QE++ +Q ARE V ++ T A EQ ++
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1040HTHFIS2981e-98 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 298 bits (765), Expect = 1e-98
Identities = 130/475 (27%), Positives = 205/475 (43%), Gaps = 53/475 (11%)

Query: 19 ADIVDRVARCMSSFDVEVIRADN-EELSAERTAMRPSLAIISVSMIE-SGAAFLRTWQAE 76
A I + + +S +V N L A L + V M + + L +
Sbjct: 13 AAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKA 72

Query: 77 -IGMPVVWVGA--------------ARDHDPSLYPPEYSHILPLDFTCAELRGMISKLAV 121
+PV+ + A A D+ P P + + ++ + +++
Sbjct: 73 RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK--PFDLTELIGIIGRA------LAEPKR 124

Query: 122 QLRAHAAKALEPSTLVAHSDCMQALLQEVDTFADCDTNVLLHGETGVGKERIAQLLHEKH 181
+ + + LV S MQ + + + D +++ GE+G GKE +A+ LH+ +
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD-Y 183

Query: 182 SRYGMGEFVPVNCGAIPDGLFESLFFGHAKGSFTGAVGTHKGYFEQAAGGTLFLDEVGDL 241
+ G FV +N AIP L ES FGH KG+FTGA G FEQA GGTLFLDE+GD+
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 242 PLYQQVKLLRVLEDGAVLRIGATAPVKVDFRLVAASNKKLPQLVKDGLFRADLYYRLAVI 301
P+ Q +LLRVL+ G +G P++ D R+VAA+NK L Q + GLFR DLYYRL V+
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 302 ELSIPSLEERGPVDKIALFKSFVASIVGEDRLAALPELPYWLAEAVADSYFPGNVRELRN 361
L +P L +R D L + FV E + E + +PGNVREL N
Sbjct: 304 PLRLPPLRDR-AEDIPDLVRHFVQQAEKEGL--DVKRFDQEALELMKAHPWPGNVRELEN 360

Query: 362 LAERVGV------------------------TVRQTGGWDTVRLQRLIAHARSAAQPAPA 397
L R+ + ++ + + + +
Sbjct: 361 LVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFG 420

Query: 398 ESAPDVFVDRSKWDMTERNRVIAALDANGWRRQDTAQHLGISRKVLWEKMRKYQI 452
++ P + E ++AAL A + A LG++R L +K+R+ +
Sbjct: 421 DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1043PYOCINKILLER310.009 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.5 bits (68), Expect = 0.009
Identities = 28/86 (32%), Positives = 37/86 (43%), Gaps = 3/86 (3%)

Query: 214 LMNQLKLAPAVRAEIRNDATRIAAAARARQRA-LARPGAPGAAASAGATLAASAAGSNGG 272
MN L A A + R AAA A+++A AA A T A A GS
Sbjct: 203 RMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQ--AAIRAANTYAMPANGSVVA 260

Query: 273 AAAGKGAVAGSGASAPGAAATATAAA 298
AAG+G + + +A A A + A A
Sbjct: 261 TAAGRGLIQVAQGAASLAQAISDAIA 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1047HTHFIS385e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.3 bits (89), Expect = 5e-05
Identities = 11/63 (17%), Positives = 26/63 (41%)

Query: 79 AALRVSHPGLPIVALGSLGEPESALAALRAGVRDFIDFSAPAEDALRITRGLLDHVGDQP 138
++ + P LP++ + + +A+ A G D++ + + I L +P
Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRP 126

Query: 139 SRH 141
S+
Sbjct: 127 SKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1048BCTERIALGSPD1382e-37 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 138 bits (348), Expect = 2e-37
Identities = 58/249 (23%), Positives = 111/249 (44%), Gaps = 11/249 (4%)

Query: 151 VQVDVRVVEFSRSVLKQAGLNFFKQNNGFTFGSFAPAGLASVTGGG----TSSMSVSANI 206
V V+ + E + G+ + +N G T + + +++ G S+
Sbjct: 347 VLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA 406

Query: 207 PIASAFN-LVVGSATRGLFADLSILEANNLARVLAQPTLVALSGQSASFLAGGEIPVPVP 265
S+FN + G L+ L ++ +LA P++V L A+F G E+PV
Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466

Query: 266 QSLGT-----ISIDWKPYGVGLTLTPTVLSPRRIALKVAPESSQLDFVHSITINGVTVPA 320
+ +++ K G+ L + P + + L++ E S + S + +
Sbjct: 467 SQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAAS-STSSDLGAT 525

Query: 321 LTTRRADTTVELGDGESFAIGGLIDRETTSNVDKVPFLGDLPIIGTFFKHLSYQQNDKEL 380
TR + V +G GE+ +GGL+D+ + DKVP LGD+P+IG F+ S + + + L
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 381 VIIVTPHLV 389
++ + P ++
Sbjct: 586 MLFIRPTVI 594


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1051PREPILNPTASE534e-11 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 53.3 bits (128), Expect = 4e-11
Identities = 31/124 (25%), Positives = 52/124 (41%), Gaps = 10/124 (8%)

Query: 4 LFSIGFFFAWAAAVAIADCRDRRIPNELVLVGLAAVIIFTVCRQNPFGTTLSGALIGGAV 63
+ A+ D +P++L L L ++F + F +L A+IG
Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL--LGGF-VSLGDAVIGAMA 190

Query: 64 GLVSLFPFFAL-------RVMGAADVKVFAVLGAWCGLSALPRLWVVASVAAGVHALALM 116
G + L+ + MG D K+ A LGAW G ALP + +++S+ + L+
Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250

Query: 117 LLTR 120
LL
Sbjct: 251 LLRN 254


57BMA10247_1349BMA10247_1356N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1349-191.243132DNA repair protein RadA
BMA10247_1350-182.696155alanine racemase
BMA10247_1351-182.376903lysophospholipid transporter LplT
BMA10247_1352094.927686phosphomethylpyrimidine kinase
BMA10247_1353-1124.510732hypothetical protein
BMA10247_1354-284.518019hypothetical protein
BMA10247_1355-1102.100892uracil-DNA glycosylase
BMA10247_1356010-0.164501ribosomal-protein-alanine acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1349TCRTETOQM310.011 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 31.0 bits (70), Expect = 0.011
Identities = 16/79 (20%), Positives = 27/79 (34%), Gaps = 17/79 (21%)

Query: 104 LLQSLAQIASERPALYISGEESGAQIALRAQRLALLEGGASAADLKLLAEIQLEKIQATI 163
LL +L +I+ P L + + +I L L ++Q+E A +
Sbjct: 361 LLDALLEISDSDPLLRYYVDSATHEIILS-----------------FLGKVQMEVTCALL 403

Query: 164 DAERPDVAVIDSIQTIYSE 182
+ I IY E
Sbjct: 404 QEKYHVEIEIKEPTVIYME 422


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1350ALARACEMASE438e-156 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 438 bits (1127), Expect = e-156
Identities = 207/353 (58%), Positives = 270/353 (76%)

Query: 1 MPRPISATIHTAALANNLSVVRRHAAQSKVWAIVKANAYGHGLARVFPGLRGTDGFGLLD 60
M RPI A++ AL NLS+VR+ A ++VW++VKANAYGHG+ R++ + TDGF LL+
Sbjct: 1 MTRPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLN 60

Query: 61 LDEAVKLRELGWAGPILLLEGFFRSTDIDVIDRYSLTTAVHNDEQMRMLETARLSKPVNV 120
L+EA+ LRE GW GPIL+LEGFF + D+++ D++ LTT VH++ Q++ L+ ARL P+++
Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDI 120

Query: 121 QLKMNSGMNRLGYTPEKYRAAWERARACPGIGQITLMTHFSDADGERGVAEQMATFERGA 180
LK+NSGMNRLG+ P++ W++ RA +G++TLM+HF++A+ G++ MA E+ A
Sbjct: 121 YLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAA 180

Query: 181 QGIAGARSFANSAAVLWHPSAHFDWVRPGIMLYGASPSGRAADIADRGLKPTMTLASELI 240
+G+ RS +NSAA LWHP AHFDWVRPGI+LYGASPSG+ DIA+ GL+P MTL+SE+I
Sbjct: 181 EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEII 240

Query: 241 AVQTLAKGQAVGYGSMFVAEDTMRIGVVACGYADGYPRIAPEGTPVVVDGVRTRIVGRVS 300
VQTL G+ VGYG + A D RIG+VA GYADGYPR AP GTPV+VDGVRT VG VS
Sbjct: 241 GVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVS 300

Query: 301 MDMLTVDLTPVPQAGVGARVELWGETLPIDDVAARCMTVGYELMCAVAPRVPV 353
MDML VDLTP PQAG+G VELWG+ + IDDVAA TVGYELMCA+A RVPV
Sbjct: 301 MDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPV 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1351TCRTETB290.040 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.1 bits (65), Expect = 0.040
Identities = 31/139 (22%), Positives = 54/139 (38%), Gaps = 4/139 (2%)

Query: 29 FFSSLADSALLIAAIALLKDLHAPNWMIPLLKLFFVLSYVVLAAFVGAFADSRPKGHVMF 88
FFS L + L ++ + D + P + F+L++ + A G +D ++
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 89 ITNSIKVVGCLIMLFGAHP----LIAYGIVGFGAAAYSPAKYGILTELLPPERLVAANGW 144
I G +I G ++A I G GAAA+ ++ +P E A G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 145 IEGTTVGSIILGTVLGGAL 163
I +G +GG +
Sbjct: 144 IGSIVAMGEGVGPAIGGMI 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1356SACTRNSFRASE443e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 44.2 bits (104), Expect = 3e-08
Identities = 21/71 (29%), Positives = 32/71 (45%)

Query: 79 VAPVAQRSGVGLALLREAVRIARAERLDGVLLEVRPSNPRAIRLYERFGFVSVGRRRNYY 138
VA ++ GVG ALL +A+ A+ G++LE + N A Y + F+ Y
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLY 156

Query: 139 PAKHRSREDAI 149
+ E AI
Sbjct: 157 SNFPTANEIAI 167


58BMA10247_1414BMA10247_1427N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_14142112.251428hypothetical protein
BMA10247_14151111.709253peptide synthetase-domain-containing protein
BMA10247_14161111.183492methyltransferase/adenylsulfate kinase
BMA10247_14170100.182092hypothetical protein
BMA10247_14180110.271038multidrug efflux RND membrane fusion protein
BMA10247_141909-1.034095AcrB/AcrD/AcrF family protein
BMA10247_1422-214-1.511155GDSL-like lipase/acylhydrolase domain/outer
BMA10247_1423114-0.327758hypothetical protein
BMA10247_1424115-1.707324hypothetical protein
BMA10247_1425013-0.826275hypothetical protein
BMA10247_1427-111-1.412529*aspartate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1414TCRTETA392e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.4 bits (92), Expect = 2e-05
Identities = 57/271 (21%), Positives = 97/271 (35%), Gaps = 13/271 (4%)

Query: 74 AFTLPIALFALLSGVAADAWDRRTVMLLSQALMFSVALCLVALAAAGAMTPARLLVCMFV 133
+ L A + G +D + RR V+L+S +V ++A A + L + V
Sbjct: 51 LYALMQFACAPVLGALSDRFGRRPVLLVS-LAGAAVDYAIMATAPFLWV----LYIGRIV 105

Query: 134 GGCAGAMFQPAWQSAVTEQVPARELSAAIALDSFSMNFARTAGPALGGFIVASVSPNAAF 193
G GA + + + E + S F AGP LGG + SP+A F
Sbjct: 106 AGITGATG-AVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPF 163

Query: 194 V---LSGLSYAGLIYVLSRSIRGAAARPPVRERLATMLVQGVRYCGRARGIRGTLIRSSL 250
L RP R A + R+ + + +
Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRP--LRREALNPLASFRWARGMTVVAALMAVFFI 221

Query: 251 FGFLGSPVWALLPLFAKTQFGGEARTYGVLLASFGA-GAASGALGGAAGRARLGREALVR 309
+G AL +F + +F +A T G+ LA+FG + + A+ ARLG +
Sbjct: 222 MQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM 281

Query: 310 LCTLTFAAGMLATAWSPCQAVAMLGLAVAGG 340
L + G + A++ +A + +
Sbjct: 282 LGMIADGTGYILLAFATRGWMAFPIMVLLAS 312



Score = 35.2 bits (81), Expect = 4e-04
Identities = 31/167 (18%), Positives = 58/167 (34%), Gaps = 8/167 (4%)

Query: 21 LAALRGPFAYRTFAAIWVAS-LVGNIGGSIQTVAASWLMTSMAPSPTMVSLVQTAFTLPI 79
LA+ R AA+ ++ +G + + T + + AF +
Sbjct: 200 LASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILH 259

Query: 80 ALF-ALLSGVAADAWDRRTVMLLSQALMFSVALCLVALAAAGAMTPARLLVCMFVGGCAG 138
+L A+++G A R ++L M + + LA A A ++ + G
Sbjct: 260 SLAQAMITGPVAARLGERRALMLG---MIADGTGYILLAFATRGWMAFPIMVLLASG--- 313

Query: 139 AMFQPAWQSAVTEQVPARELSAAIALDSFSMNFARTAGPALGGFIVA 185
+ PA Q+ ++ QV + + GP L I A
Sbjct: 314 GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1418RTXTOXIND424e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 4e-06
Identities = 17/126 (13%), Positives = 41/126 (32%), Gaps = 21/126 (16%)

Query: 87 TVRSQVDGQITHVRFHEGQQVRAGDVLVEIDRRALQATADQATAKLEQDKATLANARLEL 146
++ + + + EG+ VR GDVL+++ +A + + L Q + ++
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 147 ----------------ARHQRLAEMNAAPVQML-----DTWKARVNELHAQIRGDQAAVQ 185
Q ++E + L TW+ + + + +A
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 186 NARVAV 191
+
Sbjct: 218 TVLARI 223



Score = 29.0 bits (65), Expect = 0.035
Identities = 14/94 (14%), Positives = 39/94 (41%), Gaps = 11/94 (11%)

Query: 113 LVEIDRRALQATADQAT--AKLEQDKATLANARLELARHQRLAEMNAAPVQMLDTWKARV 170
++E + + ++A + ++LEQ ++ + +A+ E +L + + ++
Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK---------NEILDKL 304

Query: 171 NELHAQIRGDQAAVQNARVAVDYTTIRAPISGRI 204
+ I + + IRAP+S ++
Sbjct: 305 RQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1419ACRIFLAVINRP7550.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 755 bits (1950), Expect = 0.0
Identities = 272/1033 (26%), Positives = 495/1033 (47%), Gaps = 26/1033 (2%)

Query: 9 FIRYPVATCLMTAGILFAGVAAYFHLPVAPLPQVEFPTIQVSAVLPGADPVSVASTLAQP 68
FIR P+ ++ ++ AG A LPVA P + P + VSA PGAD +V T+ Q
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 LETQFSKIPYVTQMTSQSTLS-STSIVLQFSLERSIDAAANDVQSAIDAAAAQLPADLPS 127
+E + I + M+S S + S +I L F D A VQ+ + A LP ++
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PPTFQKVNPADSPIMLLSAISSTLPLTTID--DYVETRLTKSLSQIDGVGSVSIGGQQKP 185
+ S +M+ +S T D DYV + + +LS+++GVG V + G Q
Sbjct: 125 QGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY- 182

Query: 186 SIRIQLDPVKLASRGLSSEDVRRALSGLSGVNPKGVFNGT------TRSYTIYTNGQLTE 239
++RI LD L L+ DV L + G GT + +I +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 240 PAQWNDAIV-AYRDGTPVRIRDIGQAVLGPEDNTLAAWIDGRRAISVGIYKKPGANTVST 298
P ++ + DG+ VR++D+ + LG E+ + A I+G+ A +GI GAN + T
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 299 VDKIRARLPELEASLPPSLKIAVLADRTQTIRASLLDIELTLLLNVVLVVVVIYAFLGSV 358
I+A+L EL+ P +K+ D T ++ S+ ++ TL ++LV +V+Y FL ++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 359 RTTIIPAVTVPVSLFGACALMWVCGYSLDNISLMAMTIAVGFVVDDAIVMVENIARH-VE 417
R T+IP + VPV L G A++ GYS++ +++ M +A+G +VDDAIV+VEN+ R +E
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 418 AGELPLQAALKGLSETSFTIASISLSLVAVLLPLLLMSGIIGRMFREFAVTLSMTIIVSA 477
P +A K +S+ + I++ L AV +P+ G G ++R+F++T+ + +S
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 478 FVSLTLTPMMASYLLRAHRHDAGRPPRP--GLFERAFARTAAAYERALDVALRHRFVTLC 535
V+L LTP + + LL+ + G F F + Y ++ L L
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLL 542

Query: 536 AFLASVAASVFLYVGIPKGFFPQQDTGVITGISEAAQTISVEDMARHSMALAAIIRADPA 595
+ VA V L++ +P F P++D GV + + + E + + +
Sbjct: 543 IYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEK 602

Query: 596 --VEHCQMAVGGSAYAGTTVNNGRWYITLKPRDQRDA---TADEVIRRLRPQFAKVPGVR 650
VE V G +++G N G +++LKP ++R+ +A+ VI R + + K+
Sbjct: 603 ANVES-VFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 651 MYLQAAQDVIIGARLARTQYQLTLQSA-DVGALTTWAPRLLARLSGLP-QLRDVASDQQV 708
+ ++ ++L Q+ ALT +LL + P L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 709 NGSALSVAIDRDQAARYGLTPEAIDGTLYDAFGSRQVAQYFTQLSTYKVIMETLPSLQRD 768
+ + + +D+++A G++ I+ T+ A G V + + K+ ++ +
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 769 PGTLDRIYMKAPSGALVPLSSVARWTTDTVQPLSVNHQSHFPSVTISFNLAPGVSLGEAT 828
P +D++Y+++ +G +VP S+ + + + PS+ I APG S G+A
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTT-SHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 829 AAIEAARASLRMPPAVVGSFQGTAQAFQSTLATMPMLILSALIVAYLVLGALHGSFIHPW 888
A +E + ++P + + G + + + P L+ + +V +L L AL+ S+ P
Sbjct: 841 ALME--NLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPV 898

Query: 889 TILSTLPSAGVGAIATLWLFKYDFNLIALIGVILLIGIVKKNGIMMVDFAIAATRERNMT 948
+++ +P VG + LF ++ ++G++ IG+ KN I++V+FA +
Sbjct: 899 SVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 958

Query: 949 SLDAIRSACLLRLRPIMMTTMTALFGALPLMFTPGMGSELRQPLGYAMVGGLLVSQVLTL 1008
++A A +RLRPI+MT++ + G LPL + G GS + +G ++GG++ + +L +
Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018

Query: 1009 FTTPVIYLYLDTL 1021
F PV ++ +
Sbjct: 1019 FFVPVFFVVIRRC 1031



Score = 90.3 bits (224), Expect = 2e-20
Identities = 78/509 (15%), Positives = 163/509 (32%), Gaps = 37/509 (7%)

Query: 4 NLFAVFIRYPVATCLMTAGILFAGVAAYFHLPVAPLPQVEFPTIQVSAVLP-GADPVSVA 62
N + L+ A I+ V + LP + LP+ + LP GA
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 63 STLAQPLETQF---SKIPYVTQMTSQSTLSSTS-------IVLQFSLERSIDAAANDVQS 112
L Q + + + S + + L+ ER + N ++
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEER--NGDENSAEA 645

Query: 113 AIDAAAAQLPADLPSPPTFQKVNPADSPIMLLSAIS---------STLPLTTIDDYVETR 163
I A + L + I+ L + + L +
Sbjct: 646 VIHRAKME----LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQL 701

Query: 164 LTKSLSQIDGVGSVSIGGQQ-KPSIRIQLDPVKLASRGLSSEDVRRALSGLSGVNPKGVF 222
L + + SV G + ++++D K + G+S D+ + +S G F
Sbjct: 702 LGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF 761

Query: 223 NGTTRSYTIYTNGQ---LTEPAQWNDAIVAYRDGTPVRIRDIGQAVLGPEDNTLAAWIDG 279
R +Y P + V +G V + L +G
Sbjct: 762 IDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLER-YNG 820

Query: 280 RRAISVGIYKKPGANTVSTVDKIRARLPELEASLPPSLKIAVLADRTQTIRASLLDIELT 339
++ + PG ++ A + L + LP + + R S
Sbjct: 821 LPSMEIQGEAAPG----TSSGDAMALMENLASKLPAGIGYDW-TGMSYQERLSGNQAPAL 875

Query: 340 LLLNVVLVVVVIYAFLGSVRTTIIPAVTVPVSLFGACALMWVCGYSLDNISLMAMTIAVG 399
+ ++ V+V + + A S + + VP+ + G + D ++ + +G
Sbjct: 876 VAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIG 935

Query: 400 FVVDDAIVMVENI-ARHVEAGELPLQAALKGLSETSFTIASISLSLVAVLLPLLLMSGII 458
+AI++VE + G+ ++A L + I SL+ + +LPL + +G
Sbjct: 936 LSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAG 995

Query: 459 GRMFREFAVTLSMTIIVSAFVSLTLTPMM 487
+ + ++ + +++ P+
Sbjct: 996 SGAQNAVGIGVMGGMVSATLLAIFFVPVF 1024


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1422IGASERPTASE310.015 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.015
Identities = 34/191 (17%), Positives = 52/191 (27%), Gaps = 22/191 (11%)

Query: 335 SDRLSLFADVGYTRNFHG--AAGGMNAFDSDVEMFSIGADYKLSEASRAGALLSSGNANG 392
S+ + L Y RN + A N + + Y G L G
Sbjct: 1331 SNNVQLGGVFTYVRNSNNFDKATSKNTL----AQVNFYSKYYADNHWYLGIDLGYGKFQS 1386

Query: 393 SLAGGQGR-IGLHAYRLGVY--HAFERAGLFVRAYAGAGWSR-----YRL--DRAAVLPG 442
L H + G+ AF + G +S + L R V P
Sbjct: 1387 KLQTNHNAKFARHTAQFGLTAGKAFNLGNFGITPIVGVRYSYLSNADFALDQARIKVNPI 1446

Query: 443 AVRASTSGFDFGALVKAGYLFALGGVRLGPVADVGYTQLVARGYTEDGDSILAQNVGVQR 502
+V+ + + D Y + LG + P+ Y G A NV Q+
Sbjct: 1447 SVKTAFAQVDLS------YTYHLGEFSVTPILSARYDANQGSGKINVNGYDFAYNVENQQ 1500

Query: 503 LKGVSAGAGVR 513

Sbjct: 1501 QYNAGLKLKYH 1511



Score = 30.4 bits (68), Expect = 0.024
Identities = 29/184 (15%), Positives = 57/184 (30%), Gaps = 34/184 (18%)

Query: 342 ADVGYTRNFHGAAGGMNAFDSDVEMFSIGADYKLSEASRAGALLSSGNANGSLAGGQGRI 401
++ +N+ + F S +G D +S + G + + + + +
Sbjct: 1299 SNTSMNKNYSSSQ--YRRFSSKSTQTQLGWDQTISNNVQLGGVFTYVRNSNNFDKATSK- 1355

Query: 402 GLHAYRLGVYHAFERAGLFVRAYA----------GAGWSRYRLDRAAVLPGAVRASTSGF 451
+ + + + YA G G + +L A + G
Sbjct: 1356 ----------NTLAQVNFYSKYYADNHWYLGIDLGYGKFQSKLQTNHNAKFARHTAQFG- 1404

Query: 452 DFGALVKAGYLFALGGVRLGPVADVGYTQLVARGYTEDGDSILAQNVGVQRLKGVSAGAG 511
+ AG F LG + P+ V Y+ L + D I + V+ A A
Sbjct: 1405 -----LTAGKAFNLGNFGITPIVGVRYSYLSNADFALDQARIKVNPISVKT-----AFAQ 1454

Query: 512 VRFA 515
V +
Sbjct: 1455 VDLS 1458


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1427CARBMTKINASE362e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 36.0 bits (83), Expect = 2e-04
Identities = 33/119 (27%), Positives = 56/119 (47%), Gaps = 15/119 (12%)

Query: 116 IDDERVRRDLDAGKVVIITGFQGV---DPDGHITTL-GRGGSDTSAVAVAAALEADECLI 171
++ E +++ ++ G +VI +G GV DG I + D + +A + AD +I
Sbjct: 174 VEAETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMI 233

Query: 172 YTDVDGVYTTDPRVVEEARRLDSVTFEEMLEMA--------SLGSKVLQ-IRSVEFAGK 221
TDV+G E+ + L V EE+ + S+G KVL IR +E+ G+
Sbjct: 234 LTDVNGAALYYGT--EKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGE 290


59BMA10247_1498BMA10247_1505N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1498-111-2.352466D-alanyl-D-alanine endopeptidase
BMA10247_1499010-2.386107ISBma2, transposase
BMA10247_1500012-2.119470pyruvate dehydrogenase complex E3 component,
BMA10247_1501011-1.730629dihydrolipoamide acetyltransferase
BMA10247_1502011-1.672424pyruvate dehydrogenase subunit E1
BMA10247_1503110-0.700796hypothetical protein
BMA10247_150409-0.550909sensory box histidine kinase
BMA10247_1505012-0.065561LuxR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1498SSBTLNINHBTR280.027 Streptomyces subtilisin inhibitor signature.
		>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature.

Length = 144

Score = 28.3 bits (62), Expect = 0.027
Identities = 15/50 (30%), Positives = 23/50 (46%)

Query: 15 VATAAVAPADAFAATAKTAQSAKGKKSAAKKSLRAASSSAEPRAKGARKR 64
+A+ A APA +A +A G+ +A LRA + + P A G
Sbjct: 27 LASPATAPASLYAPSALVLTVGHGESAATAAPLRAVTLTCAPTASGTHPA 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1501RTXTOXIND365e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.6 bits (82), Expect = 5e-04
Identities = 12/58 (20%), Positives = 22/58 (37%)

Query: 49 VPSPSAGTVKEVKVKVGDAVSQGSLIVLLDGAQAAAQPAQANGAATSAAQPAAAPAAA 106
+ VKE+ VK G++V +G +++ L A A + + A
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQIL 156



Score = 31.0 bits (70), Expect = 0.013
Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 162 VPSPAAGVVKDIKVKVGDAVSEGSLIVVLEASGGAAA 198
+ +VK+I VK G++V +G +++ L A G A
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1504PF06580320.012 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.012
Identities = 18/85 (21%), Positives = 31/85 (36%), Gaps = 18/85 (21%)

Query: 700 PVLIEQVLV-NLMKNAAEAMQEARPQAENGVIRVVADLEAGFVDIRVIDQGPGVDEATAE 758
P ++ Q LV N +K+ + G I + + G V + V + G + T E
Sbjct: 256 PPMLVQTLVENGIKHGIA------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE 309

Query: 759 RLFEPFYSTKSDGMGMGLNICRSII 783
S G G+ N+ +
Sbjct: 310 ----------STGTGL-QNVRERLQ 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1505HTHFIS1132e-31 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 113 bits (283), Expect = 2e-31
Identities = 39/153 (25%), Positives = 67/153 (43%), Gaps = 4/153 (2%)

Query: 11 TVFVVDDDEAVRDSLRWLLEANGYRVQCFSSAEQFLDAYQPAQQAGQIACLILDVRMSGM 70
T+ V DDD A+R L L GY V+ S+A AG ++ DV M
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA----AGDGDLVVTDVVMPDE 60

Query: 71 SGLELQERLIAENAALPIIFVTGHGDVPMAVSTMKKGAMDFIEKPFDEAELRKLVERMLE 130
+ +L R+ LP++ ++ A+ +KGA D++ KPFD EL ++ R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 131 KARNESKSVQEQRAASERLSKLTAREQQVLERI 163
+ + +++ L +A Q++ +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL 153


60BMA10247_1845BMA10247_1852N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1845543-9.695549NAD-dependent epimerase/dehydratase
BMA10247_1846539-9.389070O-antigen acetylase
BMA10247_1847330-7.661889lipopolysaccharide ABC transporter ATP-binding
BMA10247_1848124-5.575041polysaccharide ABC transporter permease
BMA10247_1849-117-2.991104dTDP-4-dehydrorhamnose reductase
BMA10247_1850-310-1.174903dTDP-4-dehydrorhamnose 3,5-epimerase
BMA10247_1851-210-0.325691glucose-1-phosphate thymidylyltransferase
BMA10247_1852-390.947118dTDP-glucose 4,6-dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1845NUCEPIMERASE1682e-51 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 168 bits (426), Expect = 2e-51
Identities = 83/363 (22%), Positives = 136/363 (37%), Gaps = 58/363 (15%)

Query: 6 KILVTGGAGFIGCAISERLAARASRYVVMDNLHPQIHASAVRPGALHEKAE----LVVAD 61
K LVTG AGFIG +S+RL + V +DNL+ + +++ L A+ D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDY-YDVSLKQARLELLAQPGFQFHKID 60

Query: 62 VTDAGAWDALLSDFQPEIIIHLAAETGTGQSLTEASRHALVNVVGTTRLTDALVKHGIVV 121
+ D L + E + SL +A N+ G + + + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--I 118

Query: 122 EHILLTSSRAVYGEGAWQKDDGTIVYPGQRGRAQLEAAQWDFPGMTMLPSRADRTEPRPT 181
+H+L SS +VYG +P D + P
Sbjct: 119 QHLLYASSSSVYGLN------------------------------RKMPFSTDDSVDHPV 148

Query: 182 SVYGATKLAQEHVLRAWSLATKTPLSILRLQNVYGPGQSLTNSYTGIVALFSRLAREKKV 241
S+Y ATK A E + +S P + LR VYGP + F++ E K
Sbjct: 149 SLYAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALF----KFTKAMLEGKS 204

Query: 242 IPLYEDGNVTRDFVSIDDVADAIVATLVRTPEA-----------------LSLFDIGSGQ 284
I +Y G + RDF IDD+A+AI+ P A +++IG+
Sbjct: 205 IDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSS 264

Query: 285 ATSILDMARIIAAHYGAPEPQINGAFRDGDVRHAACDLSESLANLGWKPQWSLKRGIGEL 344
++D + + G + + GDV + D +G+ P+ ++K G+
Sbjct: 265 PVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324

Query: 345 QTW 347
W
Sbjct: 325 VNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1848ABC2TRNSPORT300.007 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 30.3 bits (68), Expect = 0.007
Identities = 16/59 (27%), Positives = 24/59 (40%)

Query: 195 LFTMVLMFLSPVFYPASALPEKYRFWLELNPLTLFIEQSRGILLEGRVPDFHPLGLAFL 253
L ++FLS +P LP ++ PL+ I+ R I+L V D A
Sbjct: 184 LVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALC 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1849NUCEPIMERASE587e-12 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 58.3 bits (141), Expect = 7e-12
Identities = 32/160 (20%), Positives = 56/160 (35%), Gaps = 27/160 (16%)

Query: 1 MKILVTGANGQVGWELARSLAVLGQVV-----------PLTRE--------------QAD 35
MK LVTGA G +G+ +++ L G V ++ + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 36 LGRPETLARIVEDAKPDVVVNAAAYTAVDAAETDGAAANVINGEA-VGVLAAATKRVGGL 94
L E + + + V + AV + + A N + +L
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 95 FVHYSTDYVFDGTKPSPYIETDPT-CPVNAYGASKLLGEL 133
++ S+ V+ + P+ D PV+ Y A+K EL
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1852NUCEPIMERASE1747e-54 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 174 bits (443), Expect = 7e-54
Identities = 90/350 (25%), Positives = 136/350 (38%), Gaps = 45/350 (12%)

Query: 2 ILVTGGAGFIGANFVLDWLAQSDEAVLNVDKLT--YAGNLGTLK-SLQGNPKHVFARVDI 58
LVTG AGFIG + L + V+ +D L Y +L + L P F ++D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQ-VVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 59 CDRAAIDALLAQHKPRAIVHFAAESHVDRSIHGPADFVQTNVVGTFTLLEAARQYWSALG 118
DR + L A + V S+ P + +N+ G +LE R
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN----- 116

Query: 119 TDAKAAFRFLHVSTDEVFGSLSPADPQFSETTPYA-PNSPYSATKAGSDHLVRAYHHTYG 177
L+ S+ V+G L+ P FS P S Y+ATK ++ + Y H YG
Sbjct: 117 ----KIQHLLYASSSSVYG-LNRKMP-FSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 178 LPVLTTNCSNNYGPYQFPEKLIPLMIANALGGKPLPVYGDGQNVRDWLYVGDHCSAIREV 237
LP YGP+ P+ + L GK + VY G+ RD+ Y+ D AI +
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRL 230

Query: 238 L------------------ARGVPGETYNVGGWNEKKNLDVVHTLCDLLD-EARPKAGGS 278
A P YN+G + + +D + L D L EA+
Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKN---- 286

Query: 279 YRDQITYVTDRPGHDRRYAIDARKLERELGWKPAETFETGLAKTVRWYLD 328
+ +PG + D + L +G+ P T + G+ V WY D
Sbjct: 287 ------MLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


61BMA10247_1915BMA10247_1922N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_1915-2101.753035D-lactate dehydrogenase
BMA10247_1917-2111.858040major facilitator family transporter
BMA10247_1919-2121.894463major facilitator family transporter
BMA10247_1920-2121.349423fumarylacetoacetase
BMA10247_1921-3110.639594homogentisate 1,2-dioxygenase
BMA10247_1922-2120.8476214-hydroxybenzoate transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1915SECA340.001 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 33.7 bits (77), Expect = 0.001
Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 9/87 (10%)

Query: 116 LNRRLPRAVARTREGDFSLNGLLGFDLFGKTVGVIGTGLI--GSVFARIMTGFGMRVLAH 173
L +P A A RE + G+ FD V ++G G++ A + TG G + L
Sbjct: 60 LENLIPEAFAVVREASKRVFGMRHFD-----VQLLG-GMVLNERCIAEMRTGEG-KTLTA 112

Query: 174 SLPPHDDALIALGVRYVPLDALLAEAD 200
+LP + +AL GV V ++ LA+ D
Sbjct: 113 TLPAYLNALTGKGVHVVTVNDYLAQRD 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1917TCRTETA491e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 49.4 bits (118), Expect = 1e-08
Identities = 94/399 (23%), Positives = 148/399 (37%), Gaps = 37/399 (9%)

Query: 5 LFALAVAAFGIGTTEFVIMGLLPNVARDLGVSIPAA---GMLVSGYALGVTIGAPILAVV 61
L +A+ A GIG +IM +LP + RDL S G+L++ YAL AP+L +
Sbjct: 11 LSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 62 TAKMPRKATLLALIGVFIVGNLFCAIAPGYATLMVARVVTAFCHGAFFGIGSVVASNLVA 121
+ + R+ LL + V A AP L + R+V G+ +A ++
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITD 125

Query: 122 PNKRAQAIALMFTGLTLANVLGVPLGTALGQALGWRATFWAVTGIGALAAAALAFCVPKR 181
++RA+ M V G LG +G A F+A + L F +P+
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 182 LEMPAAGIAREFGVLRNPQVLMVLGISVLASASLFTVFTYIAPI-----------LEDVT 230
+ + RE NP + A+L VF + + ED
Sbjct: 185 HKGERRPLRRE---ALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 231 GFTPHDVTLVLLLFG-LGLTVGGTVGGKLADW---RRMPSLVATLASIGVVLAAFAGTMR 286
+ + + L FG L + G +A RR L G +L AFA
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 287 TPLPALVTIFVWGVLAFAIVPPLQILIVDRAS-HAPNLASTLNQGAFNLGNALGAWLGGT 345
P +V + G+ +P LQ ++ + +L + +G L
Sbjct: 302 MAFPIMVLLASGGI----GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 346 AIHAGVPLAK-LPW-AGAAL---AMAALALTLWSASLER 379
A + W AGAAL + AL LWS + +R
Sbjct: 358 IYAASITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQR 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1919TCRTETA479e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 46.7 bits (111), Expect = 9e-08
Identities = 95/395 (24%), Positives = 152/395 (38%), Gaps = 25/395 (6%)

Query: 12 LILSVAVVGLGTGATLPLTALALTEAGHGTRIV---GILTAAQAGGGLAVVPFVTAITKR 68
++ +VA+ +G G +P+ L + H + GIL A A A P + A++ R
Sbjct: 10 ILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDR 69

Query: 69 LGARQVIVASVVVLAAATALMQFTSNLVVWGVLRVVCGAALMLLFTIGEAWVNQLADDAT 128
G R V++ S+ A A+M L V + R+V G G A++ + D
Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDE 128

Query: 129 RGRVVAIYATNFTLFQMAGPVLVSQIAGMT-HVRFALSGALFLLAL--------PSLASI 179
R R + F +AGPVL + G + H F + AL L S
Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGE 188

Query: 180 RKTPIADEPHHDAHDRWTRVIPKMPALVVGTAFFALFDTLALSLLPIFAMAR--GVASEA 237
R+ + + A RW R + + AL+ L + +L IF R A+
Sbjct: 189 RRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI 248

Query: 238 AVLFAAILLFGDTAMQFPIGWLADKLGRERVHLGAGCVVLALLPLLPAVVTTPWLCWPLL 297
+ AA + A G +A +LG ER L G + +L A T W+ +P++
Sbjct: 249 GISLAAFGILHSLAQAMITGPVAARLG-ERRALMLGMIADGTGYILLAFATRGWMAFPIM 307

Query: 298 FVLGAAAGSVYTL----SLVACGERFRGSALVTASSLVSASWSAASFGGPLVAGALMEQF 353
+L + + L S ER +G + ++L S S GPL+ A+
Sbjct: 308 VLLASGGIGMPALQAMLSRQVDEER-QGQLQGSLAALT----SLTSIVGPLLFTAIYAAS 362

Query: 354 GGDALIGVLIVSAIAFVGAALWERRALPMQAARRG 388
I A ++ RR L A +R
Sbjct: 363 ITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQRA 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_1922TCRTETA449e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.7 bits (103), Expect = 9e-07
Identities = 78/398 (19%), Positives = 125/398 (31%), Gaps = 59/398 (14%)

Query: 50 VAPSVIAEWGVKKQA---LGPVFSASLFGMLLGALGLSVLADRIGRRPVLIGATLFFALA 106
V P ++ + G + + A L L+DR GRRPVL+ + A+
Sbjct: 27 VLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVD 86

Query: 107 MLATPFATSIPILIALRFVTGLGLGCIMPNAMALVGECSPGAHRVKRM----MIVSCGFT 162
A + +L R V G+ G A A + + + G R + G
Sbjct: 87 YAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMV 145

Query: 163 LGAALGGFVSAALIPAFGWRAVFFVGGAVPLALAAAMAASLPESPQLLVLRGRHDAARAW 222
G LGG + F A FF A+ LPES H R
Sbjct: 146 AGPVLGGLMGG-----FSPHAPFFAAAALNGLNFLTGCFLLPES---------HKGERRP 191

Query: 223 LAKFAPRLAVPPDTRLVVREAGPRGAPVAELFRSGRARVTLLLWAINF-MNLIDLYFLSN 281
L + A P+A + V L A+ F M L+ +
Sbjct: 192 LRREALN-------------------PLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232

Query: 282 WLPTVMRDAGYASGTAVIVGTVLQTGGVIGTLS----LGWFIERHGFARVLFACFACATI 337
W+ + A +G L G++ +L+ G R G R L
Sbjct: 233 WVIFGEDRFHW---DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289

Query: 338 AIGLIGSVAHAFVWLLAAVFVGGFCVVGGQPAVNALAGHYYPTSLRSTGIGWSLGVGRVG 397
L+ ++ V + + G PA+ A+ + G + +
Sbjct: 290 GYILLAFATRGWMAFPIMVLLASGGI--GMPALQAMLSRQVDEERQGQLQGSLAALTSLT 347

Query: 398 SVLGPLVGGQLIA--------LGWSNDALFHAAAVPVL 427
S++GPL+ + A W A + +P L
Sbjct: 348 SIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPAL 385


62BMA10247_2368BMA10247_2377N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2368-391.402336HpcH/HpaI aldolase/citrate lyase
BMA10247_2369-380.268809hypothetical protein
BMA10247_2370-110-0.404444rod shape-determining protein RodA
BMA10247_2371-2120.512340penicillin-binding protein 2
BMA10247_2372011-0.326449rod shape-determining protein MreD
BMA10247_2373-112-0.638851rod shape-determining protein MreC
BMA10247_2374011-1.970742rod shape-determining protein MreB
BMA10247_2375011-2.344220aspartyl/glutamyl-tRNA amidotransferase subunit
BMA10247_2376-111-2.207253aspartyl/glutamyl-tRNA amidotransferase subunit
BMA10247_2377-112-1.902055aspartyl/glutamyl-tRNA amidotransferase subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2368PHPHTRNFRASE443e-07 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 44.0 bits (104), Expect = 3e-07
Identities = 36/178 (20%), Positives = 60/178 (33%), Gaps = 34/178 (19%)

Query: 87 RALDAGARTLMFPGVETADEAAHAVRLTRFQAPDAPDGLRGVAGIVRAAAYGMRRDYVQT 146
RA G +MFP + T +E LR I++ + + V
Sbjct: 380 RASTYGNLKVMFPMIATLEE------------------LRQAKAIMQEEKDKLLSEGVDV 421

Query: 147 ANAQIATIVQIESARGVDEAERIAATPGVDCVFVGPADL----------SASLGHLGDTK 196
++ I + +E A A VD +G DL + + +L
Sbjct: 422 SD-SIEVGIMVEIPSTAVAANLFA--KEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPY 478

Query: 197 HPDVAAALEHVLAAGRRAGVPVGI---FAADTAGARQSLEAGFRVVALSADVVWLLRA 251
HP + ++ V+ A G VG+ A D L G ++SA + R+
Sbjct: 479 HPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARS 536


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2371cloacin310.018 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 31.2 bits (70), Expect = 0.018
Identities = 22/65 (33%), Positives = 29/65 (44%), Gaps = 1/65 (1%)

Query: 680 SGADGASGASGAGGEPTEHANAGGNPAGGGIAGGAAGTANNGSGAAAPGGM-PGANGAAM 738
+G GAS G +E+ GG G GG +G N G + GG G N +A+
Sbjct: 25 TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84

Query: 739 GAPPA 743
AP A
Sbjct: 85 AAPVA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2373GPOSANCHOR280.046 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 28.5 bits (63), Expect = 0.046
Identities = 16/64 (25%), Positives = 22/64 (34%), Gaps = 3/64 (4%)

Query: 293 KAAKGKKATKGADKSAKAADKGADKDKGAKPAAAPPVPARSRPAGPAQPAAPLKPATAPS 352
K + +KA A A+A A K+K AK A + + P A P
Sbjct: 424 KLTEKEKAELQAKLEAEAK---ALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPG 480

Query: 353 PGAP 356
G
Sbjct: 481 KGQA 484


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2374SHAPEPROTEIN5040.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 504 bits (1300), Expect = 0.0
Identities = 247/348 (70%), Positives = 294/348 (84%), Gaps = 2/348 (0%)

Query: 1 MFGFLRSYFSNDLAIDLGTANTLIYMRGKGIVLDEPSVVSIRQEGGPNGKKTIQAVGKEA 60
M R FSNDL+IDLGTANTLIY++G+GIVL+EPSVV+IRQ+ K++ AVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRA-GSPKSVAAVGHDA 59

Query: 61 KQMLGKVPGNIEAIRPMKDGVIADFTVTEQMIKQFIKTAHESRMFSPSPRIIICVPCGST 120
KQMLG+ PGNI AIRPMKDGVIADF VTE+M++ FIK H + PSPR+++CVP G+T
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 QVERRAIKEAAHGAGASQVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVGVISLG 180
QVERRAI+E+A GAGA +V+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEV VISL
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GIVYKGSVRVGGDKFDEAIVNYIRRNYGMLIGEQTAEAIKKEIGSAFPGSEVKEMEVKGR 240
G+VY SVR+GGD+FDEAI+NY+RRNYG LIGE TAE IK EIGSA+PG EV+E+EV+GR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 NLSEGIPRSFTISSNEILEALTDPLNQIVSSVKIALEQTPPELGADIAERGMMLTGGGAL 300
NL+EG+PR FT++SNEILEAL +PL IVS+V +ALEQ PPEL +DI+ERGM+LTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 LRDLDRLLAEETGLPVLVAEDPLTCVVRGSGMALERMDKL-GSIFSYE 347
LR+LDRLL EETG+PV+VAEDPLTCV RG G ALE +D G +FS E
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2377TYPE4SSCAGA310.013 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 30.8 bits (69), Expect = 0.013
Identities = 29/89 (32%), Positives = 43/89 (48%), Gaps = 5/89 (5%)

Query: 395 SNKIAKEIFVTIWDEKAADEGAADRIIEAKGLK-QISDTGALEAIIDEVLAANAKSVEEF 453
+N EIF I E D A KG+K ++SD LE + ++ L KS +EF
Sbjct: 648 ANSQKDEIFALINKEANRDARAIAYAQNLKGIKRELSDK--LENV-NKNLKDFDKSFDEF 704

Query: 454 RAGKDKAFNALVGQAMKATKGKANPQQVN 482
+ GK+K F+ + +KA KG +N
Sbjct: 705 KNGKNKDFSK-AEETLKALKGSVKDLGIN 732


63BMA10247_2662BMA10247_2669N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2662-19-0.055624general secretion pathway protein D
BMA10247_26632111.128764general secretion pathway protein E
BMA10247_26640141.508515general secretion pathway protein F
BMA10247_26651122.311281general secretion pathway protein C
BMA10247_26660113.075430general secretion pathway protein G
BMA10247_2667-1123.753677general secretion pathway protein H
BMA10247_26680123.471712general secretory pathway protein I
BMA10247_2669-294.022174general secretory pathway protein J
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2662BCTERIALGSPD403e-133 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 403 bits (1037), Expect = e-133
Identities = 215/691 (31%), Positives = 325/691 (47%), Gaps = 88/691 (12%)

Query: 13 TALVVAGIVAAQAAHAQVTLNFVNADIDQVAKAIGAATGKTIIVDPRVKGQLNLVAERPV 72
T L+ A ++ AA + + +F DI + + KT+I+DP V+G + + + +
Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72

Query: 73 PEDQALKTLQSALRMQGFALV-QDHGVLKVVPEADAKLQGVPTYIGNAPQVRGDQVVTQV 131
E+Q + S L + GFA++ ++GVLKVV DAK VP AP + GD+VVT+V
Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGI-GDEVVTRV 131

Query: 132 FELRNESANNLLPVLRPLI--SPNNTITAYPANNTIVVTDYADNVRRIAQIIAGVDSAAG 189
L N +A +L P+LR L + ++ Y +N +++T A ++R+ I+ VD+A
Sbjct: 132 VPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGD 191

Query: 190 SQVAVVPLKNANAIDIAAQLTKLLDPGAIGNTDATLKVTVQADPRTNALLLRASNAQRLA 249
V VPL A+A D+ +T+L + ++ V AD RTNA+L+ R
Sbjct: 192 RSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSR-Q 250

Query: 250 AAKKIAQQLDAPSGVPGNMHVVPLRNAEAVKLAKTLRGMLGKGGGESGSSASSNDANAFN 309
+ +QLD GN V+ L+ A+A L + L G+
Sbjct: 251 RIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGIS-------------------- 290

Query: 310 QGGSQSGSNFSTGASGTPPLPSGLSSNSSGGAGGTTGGGGLGNAGLLGGDKDKGDDNQPG 369
S + S +
Sbjct: 291 ---------------------STMQSEKQAAKPVAALDKNI------------------- 310

Query: 370 GMIQADAASNSLIITASDPVYRNLRAVIDQLDSRRAQVYIEALVVELQATTSANLGIQWQ 429
+I+A +N+LI+TA+ V +L VI QLD RR QV +EA++ E+Q NLGIQW
Sbjct: 311 -IIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA 369

Query: 430 VANNALYAGTNLVTGQTGLGNSIVNLTAGAVT--NPGGTLGSLG---SITNGLNIGWLHN 484
N +T T G I AGA G SL S NG+ G
Sbjct: 370 NKNAG-------MTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAG---- 418

Query: 485 MFGVQGLGALLQFFAGSSDANVLSTPNLVTLDNEEAKIVVGQNVPIPTGSYSNLTSGTTA 544
F LL + S+ ++L+TP++VTLDN EA VGQ VP+ TGS + +
Sbjct: 419 -FYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGS----QTTSGD 473

Query: 545 NAFNTYDRRDVGLTLHVKPQITEGGILKLQLYTEDSAVVPGTNTTSANSPGPTFTKRSIQ 604
N FNT +R+ VG+ L VKPQI EG + L++ E S+V +++++ G TF R++
Sbjct: 474 NIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSV-ADAASSTSSDLGATFNTRTVN 532

Query: 605 STVLADNGEIIVLGGLMQDNYQVSNTKVPLLGDIPWIGQLFRSEGKTRQKTNLMVFLRPV 664
+ VL +GE +V+GGL+ + + KVPLLGDIP IG LFRS K K NLM+F+RP
Sbjct: 533 NAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPT 592

Query: 665 IINDRETAQAVTSNRYDYIQGVTGAYKSDNN 695
+I DR+ + +S +Y + N
Sbjct: 593 VIRDRDEYRQASSGQYTAFNDAQSKQRGKEN 623


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2664BCTERIALGSPF382e-133 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 382 bits (982), Expect = e-133
Identities = 174/406 (42%), Positives = 266/406 (65%), Gaps = 2/406 (0%)

Query: 1 MPAFRFEAIDASGRAQKGVIEADSARNARGQLRTQGLTPLVVEPAASAQRGARSQRLALG 60
M + ++A+DA G+ +G EADSAR AR LR +GL PL V+ Q+ + S L+L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 R--KLSQREQAILTRQLASLLVAGLPLDEALAVLTEQAERDYIRELMAAIRAEVLGGHSL 118
R +LS + A+LTRQLA+L+ A +PL+EAL + +Q+E+ ++ +LMAA+R++V+ GHSL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 119 ANALTQHPRDFPEIYRALVAAGEHTGKLGIVLSRLADYIEERNALKQKILLAFTYPAIVT 178
A+A+ P F +Y A+VAAGE +G L VL+RLADY E+R ++ +I A YP ++T
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 179 VIAFGIVTFLLSYVVPQVVNVFASTKQQLPVLTIVMMALSDFVRHWWWAILIGIAAVVYL 238
V+A +V+ LLS VVP+VV F KQ LP+ T V+M +SD VR + +L+ + A
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 239 VKATLSRDGPRLAFDRWLLTAPLAGKLVRGYNTVRFASTLGILTAAGVPILRALQAAGET 298
+ L ++ R++F R LL PL G++ RG NT R+A TL IL A+ VP+L+A++ +G+
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 299 LSNRAMRGNIDDAIVRVREGSALSRALNNVKTFPPVLVHLIRSGEATGDVTTMLDRAAEG 358
+SN R + A VREG +L +AL FPP++ H+I SGE +G++ +ML+RAA+
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 359 ESRELERRTMFLTSLLEPLLILAMGGIVLVIVLAVMLPIIELNNMV 404
+ RE + L EPLL+++M +VL IVLA++ PI++LN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2666BCTERIALGSPG1886e-65 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 188 bits (480), Expect = 6e-65
Identities = 67/140 (47%), Positives = 94/140 (67%), Gaps = 3/140 (2%)

Query: 10 QAARRQRGFTLIEIMVVVAILGILAALIVPKIMSRPDEARRIAAKQDIGTIMQALKLYRL 69
+A +QRGFTL+EIMVV+ I+G+LA+L+VP +M ++A + A DI + AL +Y+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 70 DNGRYPTQDQGLNALIQKPTTDPIPNNWKDGGYLERLPNDPWGNSYKYLNPGVHGEIDVF 129
DN YPT +QGL +L++ PT P+ N+ GY++RLP DPWGN Y +NPG HG D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 130 SYGADGKEGGESNDSDIGSW 149
S G DG+ G E DI +W
Sbjct: 122 SAGPDGEMGTE---DDITNW 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2667BCTERIALGSPH511e-10 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 51.5 bits (123), Expect = 1e-10
Identities = 20/101 (19%), Positives = 33/101 (32%), Gaps = 15/101 (14%)

Query: 51 RARGFTLLEMLVVLVIAGILVSVASLTLRRNPRTDLREEAQRIALLFETAGDEAQVRARP 110
R RGFTLLEM+++L++ G+ + L + + R +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 111 IAWRATEHGFRF---------------DIRTGDGWRPLRDD 136
++F D +G W PLR
Sbjct: 62 FGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2668BCTERIALGSPG300.001 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.2 bits (68), Expect = 0.001
Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 10 RSPARSRGFTMIEVLVALAIIAVALA 35
R+ + RGFT++E++V + II V +
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLAS 27


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2669BCTERIALGSPG343e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 33.7 bits (77), Expect = 3e-04
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 33 RGFTLIEMMIAITILAVIA-ILSWRGLDQIIRGREKVAAAMEDERVFAQMFDQMRIDARR 91
RGFTL+E+M+ I I+ V+A ++ + + ++ A+ D D ++D
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQK--AVSDIVALENALDMYKLDNHH 65

Query: 92 AATDDEAGQPAV 103
T ++ + V
Sbjct: 66 YPTTNQGLESLV 77


64BMA10247_2685BMA10247_2693N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_2685212-2.570513flagellar motor switch protein FliM
BMA10247_2686113-1.317566flagellar motor switch protein FliN
BMA10247_2687115-0.751565flagellar protein FliO
BMA10247_2689015-1.163150IS407A, transposase OrfA
BMA10247_26900130.899197IS407A, transposase OrfB
BMA10247_2692-391.883315flagellar biosynthesis protein FliQ
BMA10247_2693-371.322481flagellar biosynthetic protein FliR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2685FLGMOTORFLIM2744e-93 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 274 bits (703), Expect = 4e-93
Identities = 82/324 (25%), Positives = 158/324 (48%), Gaps = 10/324 (3%)

Query: 5 EFMSQEEVDALLKGVTGEDDSADEPAEASG---IRPYNIATQERIVRGRMPGLEIINDRF 61
E +SQ+E+D LL ++ D S ++ S I Y+ ++ + +M L ++++ F
Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62

Query: 62 ARLLRIGIFNFMRRTAEISVSQVKVQKYSEFTRNLPIPTNLNLVHVKPLRGTSLFVFDPN 121
ARL + +R + V+ V Y EF R++P P+ L ++ + PL+G ++ DP+
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122

Query: 122 LVFFVVDNLFGGDGRFHTRVEGRDFTATEQRIIGKLLNLVFEHYASAWKSVRPLQFEFVR 181
+ F ++D LFGG G+ RD T E ++ ++ + + +W V L+ +
Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQ 180

Query: 182 SEMHTQFANVATPNEIVIATQFSIEFGPTGGTLHICMPYSMIEPIRDVLSSPIQGEAL-- 239
E + QFA + P+E+V+ + G G ++ C+PY IEPI LSS ++
Sbjct: 181 IETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 240 EVDRRWVRVLSQQVQSAEVELVADLAEVPTTFEKILNLRTGDVLPLD---ITDSITAKVD 296
+++ VL ++ + ++++VA++ + + IL LR GD++ L + D +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 297 GVPVMECGYGIFNGQYALRVQRMI 320
C G+ + A ++ I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2686FLGMOTORFLIN1342e-43 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 134 bits (339), Expect = 2e-43
Identities = 78/126 (61%), Positives = 97/126 (76%), Gaps = 3/126 (2%)

Query: 38 AMDD-WAAALAEQNQQPIETGATGAGVFRPLSKATASSTHNDIDLILDIPVKMTVELGRT 96
A+DD WA AL EQ ++ A VF+ L S DIDLI+DIPVK+TVELGRT
Sbjct: 14 ALDDLWADALNEQKATTTKSAADA--VFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRT 71

Query: 97 KIAIRNLLQLAQGSVVELDGLAGEPMDVLVNGCLIAQGEVVVVNDKFGIRLTDIITPSER 156
++ I+ LL+L QGSVV LDGLAGEP+D+L+NG LIAQGEVVVV DK+G+R+TDIITPSER
Sbjct: 72 RMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSER 131

Query: 157 IRKLNR 162
+R+L+R
Sbjct: 132 MRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2692TYPE3IMQPROT694e-19 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 68.6 bits (168), Expect = 4e-19
Identities = 26/85 (30%), Positives = 46/85 (54%)

Query: 4 ENVMTLAHQAMYIGLLLAAPLLLVALAVGLVVSLFQAATQINEATLSFIPKLLAVAATMV 63
++++ ++A+Y+ L+L+ +VA +GL+V LFQ TQ+ E TL F KLL V +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 IAGPWMLSTMIDYLRETLLRVATLG 88
+ W ++ Y R+ + G
Sbjct: 62 LLSGWYGEVLLSYGRQVIFLALAKG 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2693TYPE3IMRPROT1615e-51 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 161 bits (409), Expect = 5e-51
Identities = 117/250 (46%), Positives = 158/250 (63%), Gaps = 1/250 (0%)

Query: 1 MFSVTYAQLNGWLTAFLWPFVRMLALVAIAPVTGHRSTPVRVKIGLAGFMALVVAPTLPP 60
M VT Q WL + WP +R+LAL++ AP+ RS P RVK+GLA + +AP+LP
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 MPVATVFSAQGVWIIVNQFLIGAALGFTMQIVFAAIEAAGDIIGLSMGLGFATFFDPHSS 120
V VFS +W+ V Q LIG ALGFTMQ FAA+ AG+IIGL MGL FATF DP S
Sbjct: 61 NDVP-VFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 GATPVMGRFLNAVAILAFLAFDGHLQVFAALVDSFRLVPVSANLLRAAGWQTLVAFGAAI 180
PV+ R ++ +A+L FL F+GHL + + LVD+F +P+ L + + L G+ I
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 181 FEMGLLLALPVVAALLIANLALGILNRAAPQIGIFQVGFPVTMLVGLLLVQLMAPNLIPF 240
F GL+LALP++ LL NLALG+LNR APQ+ IF +GFP+T+ VG+ L+ + P + PF
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPF 239

Query: 241 VGRLFDTGVD 250
LF +
Sbjct: 240 CEHLFSEIFN 249


65BMA10247_2818BMA10247_2825N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_28183111.217121C4-dicarboxylate transport transcriptional
BMA10247_28197210.967231hypothetical protein
BMA10247_28206153.847442hypothetical protein
BMA10247_28212153.188264thioesterase domain-containing protein
BMA10247_28222152.747880hypothetical protein
BMA10247_2824-1111.610092acetyltransferase
BMA10247_2823-111-0.059316hypothetical protein
BMA10247_2825-29-0.493607Mg chelatase subunit D/I family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2818HTHFIS445e-156 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 445 bits (1145), Expect = e-156
Identities = 152/483 (31%), Positives = 231/483 (47%), Gaps = 47/483 (9%)

Query: 4 RLQVIYIEDDELVRRASVQSLQLAGFDVVGFGSVEAAEKAIVGDATGVIVSDIRLPGASG 63
++ +DD +R Q+L AG+DV + + I ++V+D+ +P +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 LELLAQCRERTPDVPVVLVTGHGDISMAVQAMRDGAYDFIEKPFAAERLTETVRRALERR 123
+LL + ++ PD+PV++++ A++A GAYD++ KPF L + RAL
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 124 ALVLENHALRRELAGQGVVAPRIIGRSPAIEQVRRLIANVAPTDASVLINGDTGAGKELI 183
+L ++GRS A++++ R++A + TD +++I G++G GKEL+
Sbjct: 123 K------RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELV 176

Query: 184 ARSLHELSPRRDKPFIAVNCGALPEPMFESEMFGYEPGAFTGAAKRRIGKLEYASGGTLF 243
AR+LH+ RR+ PF+A+N A+P + ESE+FG+E GAFTGA R G+ E A GGTLF
Sbjct: 177 ARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLF 236

Query: 244 LDEIESMPLALQVKLLRVLQDGVLERLGSNQPIRVNCRVVAAAKGDMSEHVAAGTFRRDL 303
LDEI MP+ Q +LLRVLQ G +G PIR + R+VAA D+ + + G FR DL
Sbjct: 237 LDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDL 296

Query: 304 LYRLNVVTIALPPLAERREDIVPLFEHFMLDAAVRYGRPAPLLTDRQRASLMQRDWPGNV 363
YRLNVV + LPPL +R EDI L HF + A + G + WPGNV
Sbjct: 297 YYRLNVVPLRLPPLRDRAEDIPDLVRHF-VQQAEKEGLDVKRFDQEALELMKAHPWPGNV 355

Query: 364 RELRNAADRFVLGVTEGIVG---------------------------------------- 383
REL N R + ++
Sbjct: 356 RELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQY 415

Query: 384 DAGPETDEHAEQSLKERVEQFERAVIAETLNRTGGAVATTADKLHVGKATLYEKMKRYGL 443
A + + E +I L T G AD L + + TL +K++ G+
Sbjct: 416 FASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475

Query: 444 SAK 446
S
Sbjct: 476 SVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2819TYPE4SSCAGX280.025 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 27.8 bits (61), Expect = 0.025
Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 8/64 (12%)

Query: 97 EFVAVAMNYDPPMYVANYAQTRQ------LPFKVALDDGSVAK-QFGNVQLTPTTFVIGK 149
++V A+ +P NY Q + +P ++ DDG+ F N+ L P FV+
Sbjct: 386 QYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEI-FDDGTFTYFGFKNITLQPAIFVVQP 444

Query: 150 DGKI 153
DGK+
Sbjct: 445 DGKL 448


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2824SACTRNSFRASE326e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.8 bits (72), Expect = 6e-04
Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 6/83 (7%)

Query: 47 GEALLVAQARDE--GIVGFVSVWEPERFVHHLYVAGTRLREGIGAALLRALPGW----PA 100
G+A + + G + S W + + VA ++G+G ALL W
Sbjct: 64 GKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHF 123

Query: 101 ARYRLKCLVRNERALAFYRAHGF 123
L+ N A FY H F
Sbjct: 124 CGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_2825SYCECHAPRONE290.024 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 28.9 bits (64), Expect = 0.024
Identities = 22/84 (26%), Positives = 32/84 (38%), Gaps = 8/84 (9%)

Query: 158 ELYLPLPSAAEAALVPGVTVYGAADLPALCAHLADTPDGRLAPVAAPRLDALPAAATADL 217
+L L +P E + GV V C H+ + P G++ P LD T
Sbjct: 14 QLSLSIPDTIEPVI--GVKVG-----EFAC-HITEHPVGQILMFTLPSLDNNDEKETLLS 65

Query: 218 ADVIGQAGAKRALEVAAAGGHHML 241
++ Q K L GGH +L
Sbjct: 66 HNIFSQDILKPILSWDEVGGHPVL 89


66BMA10247_3339BMA10247_3353N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_33391100.460991chromate transporter
BMA10247_33400120.070855chromate transporter
BMA10247_33421130.161587uracil-xanthine permease
BMA10247_33431150.072395flagellar hook-associated protein FlgL
BMA10247_3344115-0.134236flagellar hook-associated protein FlgK
BMA10247_3345-216-0.148955hypothetical protein
BMA10247_3346018-0.902143flagellar rod assembly protein/muramidase FlgJ
BMA10247_3347318-0.869641flagellar basal body P-ring biosynthesis protein
BMA10247_3348417-1.956068flagellar basal body L-ring protein
BMA10247_3349515-2.057569flagellar basal body rod protein FlgG
BMA10247_3350512-1.512841flagellar basal body rod protein FlgF
BMA10247_3351412-1.553119flagellar hook protein FlgE
BMA10247_3352213-1.019815flagellar basal body rod modification protein
BMA10247_3353-117-1.254960flagellar basal body rod protein FlgC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3339ACRIFLAVINRP280.021 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.3 bits (63), Expect = 0.021
Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 2/63 (3%)

Query: 110 YVQQGMMPVTAGLVVASAVLISEASNRSALQWGITAAVAAL-AYRTRVHPLWLLAGGALA 168
Y G++ T GL +A+LI E + + G A L A R R+ P+ + + +
Sbjct: 925 YFMVGLL-TTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFIL 983

Query: 169 GLV 171
G++
Sbjct: 984 GVL 986


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3343FLAGELLIN416e-06 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 41.2 bits (96), Expect = 6e-06
Identities = 55/369 (14%), Positives = 113/369 (30%), Gaps = 10/369 (2%)

Query: 16 MNDQQAQIAQLYQQVSSGISLTTPADNPLAAAQAVQLSATSATLAQYTQNQTIVQTALQT 75
+N Q+ ++ +++SSG+ + + D+ A A + ++ L Q ++N + QT
Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76

Query: 76 EDTTLTSVNDVLNAAYQALMHAGDGGLSDSDRAALAAQIQGSRDHLLTLANTADGAGNYL 135
+ L +N+ L + + A +G SDSD ++ +IQ + + ++N G +
Sbjct: 77 TEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKV 136

Query: 136 FAGFQPTTQPFSNKPGGGVTY------AGDYGARAVQIADTRTVSQGDNGANVFMSVPFL 189
+ G +T G + + + GD ++ +
Sbjct: 137 LSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYD 196

Query: 190 GSLPVPAAGASNTGTGTIGAVSITNPSDPTNTHQFTITFGGTAAAPTYTVTDNSVTPPTT 249
+ +G + + T A T D T +T
Sbjct: 197 TYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKST 256

Query: 250 TAAQAYSSGQGINLGGQTVAVSGKPAVGDTFTVTPAPQAGTDVFATLD----TVIAALKS 305
+ G GG+ V T V T++ T+ A +
Sbjct: 257 AGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADIT 316

Query: 306 PVGNSQTASTALTNTMATASTKLMNTMTNVLTVQASVGGRLQEVKAMQAVTTTNTLQTTN 365
+ A+T ++ S + T S E + T+
Sbjct: 317 AGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAE 376

Query: 366 SLSNLTDTN 374
+N
Sbjct: 377 YTANAAGDK 385


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3344FLGHOOKAP12314e-70 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 231 bits (591), Expect = 4e-70
Identities = 162/444 (36%), Positives = 253/444 (56%), Gaps = 12/444 (2%)

Query: 3 NTLMNLGVSGLNAALWGLTTTGQNISNAATPGYSVERPVYAEASGQYTSSGYLPQGVSTV 62
++L+N +SGLNAA L T NIS+ GY+ + + A+A+ + G++ GV
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 63 TVERQYNQYLSNQLNAAQTQGSSLSTYYTLVAQLNNYVGSPTAGIATAITNYFTGLQTVA 122
V+R+Y+ +++NQL AAQTQ S L+ Y +++++N + + T+ +AT + ++FT LQT+
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 123 NNAADPSARQTAMSNAQTLASQLVAAGQQYSQLRQSVNSQLTDTVTQINSYTSQIAQLNE 182
+NA DP+ARQ + ++ L +Q Q + VN + +V QIN+Y QIA LN+
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 183 QIA--SASSQGQPPNQLLDQRDLAVSKLSQLAGVQV-VQSNGNYSVFLSGGQPLVVGNAS 239
QI+ + G PN LLDQRD VS+L+Q+ GV+V VQ G Y++ ++ G LV G+ +
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 240 YQLATVASPSDPSELTI-VSKGVAGSAQPGPTQYLPDVSLTGGALGGLLAFRSQTLDPAQ 298
QLA V S +DPS T+ G AG+ + +P+ L G+LGG+L FRSQ LD +
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIE------IPEKLLNTGSLGGILTFRSQDLDQTR 294

Query: 299 AQLGALAVSFASQVNAQNALGVDMSGNPGGSLFAVGAPAVYANQNNTGSATLSVSFVDGT 358
LG LA++FA N Q+ G D +G+ G FA+G PAV N N G + + D +
Sbjct: 295 NTLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDAS 354

Query: 359 QPTTSDYALSYDGAKYTLTDRATGSVVGTATPSSTPPTMTIGGLKLSLSSTPNAGDSFTV 418
+DY +S+D ++ +T R + T TP + + GL+L+ + TP DSFT+
Sbjct: 355 AVLATDYKISFDNNQWQVT-RLASNTTFTVTPDAN-GKVAFDGLELTFTGTPAVNDSFTL 412

Query: 419 LPTRGALDGFSLATANGSAIAAAS 442
P A+ + + + IA AS
Sbjct: 413 KPVSDAIVNMDVLITDEAKIAMAS 436



Score = 83.1 bits (205), Expect = 9e-19
Identities = 46/105 (43%), Positives = 66/105 (62%)

Query: 561 GTNDGRNALALSQLVNSKTMNNGTTTLTGAYAGYVNAIGNAASQLKASSAAQTALVGQIT 620
G +D RN AL L ++ G + AYA V+ IGN + LK SSA Q +V Q++
Sbjct: 441 GDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLS 500

Query: 621 QAQQSVSGVNQNEEAANLMQYQQLYQANAKVIQTANSVFQTVLGL 665
QQS+SGVN +EE NL ++QQ Y ANA+V+QTAN++F ++ +
Sbjct: 501 NQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3346FLGFLGJ2273e-75 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 227 bits (579), Expect = 3e-75
Identities = 124/297 (41%), Positives = 173/297 (58%), Gaps = 15/297 (5%)

Query: 15 ALDVQGFDALRSKATAAAPREGVKMVAGQFDAMFTQMMLKSMRDATPSDGLLDSSSSKMY 74
A D Q + L++KA P ++ VA Q + MF QMMLKSMRDA P DGL S +++Y
Sbjct: 12 AWDAQSLNELKAKA-GEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLY 70

Query: 75 TSMLDQQLAQQMSS-KGIGVADALTKQLLRNANVAPDAQGEGGLAAMNALAKAYANSNGA 133
TSM DQQ+AQQM++ KG+G+A+ + KQ+ + ++ + Y N +
Sbjct: 71 TSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALS 130

Query: 134 PGNGALAGTRGYSAASALTPPLKGNGNSAQADAFVEKMALAAQAASATTGIPARFIVGQA 193
P + + AF+ +++L AQ AS +G+P I+ QA
Sbjct: 131 ------------QLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQA 178

Query: 194 ALESGWGKREIRGANGESSYNVFGIKATKGWTGRTVSAVTTEYVNGKPHRVVAQFRAYDS 253
ALESGWG+R+IR NGE SYN+FG+KA+ W G TTEY NG+ +V A+FR Y S
Sbjct: 179 ALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSS 238

Query: 254 YEHAMTDYANLLKNNPRYASVLNAGHNAEGFAHGMQKAGYATDPHYAKKLISIMQQI 310
Y A++DY LL NPRYA+V A +AE A +Q AGYATDPHYA+KL +++QQ+
Sbjct: 239 YLEALSDYVGLLTRNPRYAAVTTAA-SAEQGAQALQDAGYATDPHYARKLTNMIQQM 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3347FLGPRINGFLGI371e-129 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 371 bits (953), Expect = e-129
Identities = 164/392 (41%), Positives = 225/392 (57%), Gaps = 27/392 (6%)

Query: 4 RVVRPLVAARRRAAACCALAACMLALAFAPAAARAERLKDLAQIQGVRDNPLIGYGLVVG 63
RV+R + AA +A L+ PA A R+KD+A +Q RDN LIGYGLVVG
Sbjct: 2 RVLRIIAAALVFSALPF--------LSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVG 53

Query: 64 LDGTGDQTMQTPFTTQTLANMLANLGISINNGSANGGGSSAMTNMQLKNVAAVMVTATLP 123
L GTGD +PFT Q++ ML NLGI+ G +N KN+AAVMVTA LP
Sbjct: 54 LQGTGDSLRSSPFTEQSMRAMLQNLGITTQGGQSN-----------AKNIAAVMVTANLP 102

Query: 124 PFARPGEAIDVTVSSLGNAKSLRGGTLLLTPLKGADGQVYALAQGNMAVGGAGASANGSR 183
PFA PG +DVTVSSLG+A SLRGG L++T L GADGQ+YA+AQG + V G A + +
Sbjct: 103 PFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAAT 162

Query: 184 VQVNQLAAGRIAGGAIVERSVPNAVAQMNGVLQLQLNDMDYGTAQRIVSAVNS----SFG 239
+ + R+ GAI+ER +P+ L LQL + D+ TA R+ VN+ +G
Sbjct: 163 LTQGVTTSARVPNGAIIERELPSKFKDSV-NLVLQLRNPDFSTAVRVADVVNAFARARYG 221

Query: 240 AGTATALDGRTIQLTAPADSAQQVAFMARLQNLEVSPERAAAKVILNARTGSIVMNQMVT 299
A D + I + P + MA ++NL V + AKV++N RTG+IV+ V
Sbjct: 222 DPIAEPRDSQEIAVQKPRVA-DLTRLMAEIENLTVETD-TPAKVVINERTGTIVIGADVR 279

Query: 300 LQNCAVAHGNLSVVVNTQPVVSQPGPFSNGQTVVAQQSQIQLKQDNGSLRMVTAGANLAE 359
+ AV++G L+V V P V QP PFS GQT V Q+ I Q+ + + G +L
Sbjct: 280 ISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSKV-AIVEGPDLRT 338

Query: 360 VVKALNSLGATPADLMSILQAMKAAGALRADL 391
+V LNS+G +++ILQ +K+AGAL+A+L
Sbjct: 339 LVAGLNSIGLKADGIIAILQGIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3348FLGLRINGFLGH2063e-69 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 206 bits (526), Expect = 3e-69
Identities = 127/220 (57%), Positives = 156/220 (70%), Gaps = 7/220 (3%)

Query: 25 AALAAAALAGCAQIPREPITQQPMSAMPPMPPAMQAPGSIY---NPGYAG-RPLFEDQRP 80
++L +L GCA IP P+ Q SA P P A GSI+ P G +PLFED+RP
Sbjct: 12 SSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDRRP 71

Query: 81 RNVGDILTIVIAENINATKSSGANTNRQGNTSFDVPTAG-FLGGLF--NKANLSAQGANK 137
RN+GD LTIV+ EN++A+KSS AN +R G T+F T +L GLF +A++ A G N
Sbjct: 72 RNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGGNT 131

Query: 138 FAATGGASAANTFNGTITVTVTNVLPNGNLVVSGEKQMLINQGNEFVRFSGIVNPNTISG 197
F GGA+A+NTF+GT+TVTV VL NGNL V GEKQ+ INQG EF+RFSG+VNP TISG
Sbjct: 132 FNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISG 191

Query: 198 QNSVYSTQVADARIEYSAKGYINEAETMGWLQRFFLNIAP 237
N+V STQVADARIEY GYINEA+ MGWLQRFFLN++P
Sbjct: 192 SNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSP 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3349FLGHOOKAP1421e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 42.3 bits (99), Expect = 1e-06
Identities = 10/48 (20%), Positives = 23/48 (47%)

Query: 213 TLKQGYVESSNVNVVQELVNMIQTQRAYEINSKAVTTSDQMLQTVTQM 260
L S VN+ +E N+ + Q+ Y N++ + T++ + + +
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 40.3 bits (94), Expect = 5e-06
Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 14/80 (17%)

Query: 4 SLYIAATGMNAQQAQMDVISNNLANVSTNGFKGSRAVFEDLLYQTVRQPGANSTQQTELP 63
+ A +G+NA QA ++ SNN+++ + G+ RQ + + L
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYT--------------RQTTIMAQANSTLG 48

Query: 64 SGLQLGTGVQQVATERLYTQ 83
+G +G GV +R Y
Sbjct: 49 AGGWVGNGVYVSGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3350FLGHOOKAP1290.019 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.2 bits (65), Expect = 0.019
Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 4 LIYTAMTGATQSLEQQSVVANNLANASTTGFRAQ 37
LI AM+G + + +NN+++ + G+ Q
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQ 36


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3351FLGHOOKAP1340.001 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 34.2 bits (78), Expect = 0.001
Identities = 17/58 (29%), Positives = 24/58 (41%)

Query: 356 ISAPGSTNHGTLQGSALENSNVDLTSQLVKLITAQRNYQANAQTIKTQQTVDQTLINL 413
SA L S V+L + L Q+ Y ANAQ ++T + LIN+
Sbjct: 488 SSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 30.3 bits (68), Expect = 0.017
Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 6 GLSGLAGASSDLDVIGNNIANANTVGFKGST 36
+SGL A + L+ NNI++ N G+ T
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3353FLGHOOKAP1270.029 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 26.8 bits (59), Expect = 0.029
Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 102 NVDPVQEMVNMISASRSYQANVETLNTAKQLMLKTLTI 139
V+ +E N+ + Y AN + L TA + + I
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


67BMA10247_3395BMA10247_3405N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_3395093.747779flagellar biosynthetic protein FlhB
BMA10247_33962102.388281hypothetical protein
BMA10247_33972120.238705hypothetical protein
BMA10247_33983111.270273flagellar protein FliS
BMA10247_33992131.637818flagellar hook-basal body complex protein FliE
BMA10247_34001113.128063flagellar MS-ring protein
BMA10247_3401192.940746flagellar motor switch protein G
BMA10247_3402093.337389flagellar assembly protein H
BMA10247_3403-192.770664flagellum-specific ATP synthase FliI
BMA10247_3404-182.227537flagellar FliJ protein
BMA10247_3405091.843262flagellar hook-length control protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3395TYPE3IMSPROT625e-15 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 62.1 bits (151), Expect = 5e-15
Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 1/69 (1%)

Query: 12 APRVVAKGYGLVAERIIERARDAGLYVHTAPEMV-SLLMQVDLDARIPPQLYQAVAELLA 70
P V K + + + A + G+ + + +L +D IP + +A AE+L
Sbjct: 280 LPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLR 339

Query: 71 WLYALERDA 79
WL +
Sbjct: 340 WLERQNIEK 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3399FLGHOOKFLIE619e-16 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 61.2 bits (148), Expect = 9e-16
Identities = 47/111 (42%), Positives = 62/111 (55%), Gaps = 8/111 (7%)

Query: 3 APVNGIASALQQMQAMAAQAAGGASPATSLAGSGAASAGSFASAMKASLDKISGDQQKAL 62
+ + GI + Q+QA A A S SFA + A+LD+IS Q A
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQES--------LPQPTISFAGQLHAALDRISDTQTAAR 52

Query: 63 GEAHAFEIGAQNVSLNDVMVDMQKANIGFQFGLQVRNKLVSAYNEIMQMSV 113
+A F +G V+LNDVM DMQKA++ Q G+QVRNKLV+AY E+M M V
Sbjct: 53 TQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3400FLGMRINGFLIF462e-159 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 462 bits (1191), Expect = e-159
Identities = 253/562 (45%), Positives = 359/562 (63%), Gaps = 37/562 (6%)

Query: 53 LSRMKTNPRLPFLIGAALAIAAIVALVLWSRAPDYRVLYSNLSDRDGGAIIAALQQANVP 112
L+R++ NPR+P ++ + A+A +VA+VLW++ PDYR L+SNLSD+DGGAI+A L Q N+P
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 113 YKFADAGGAILVPANQVHETRLKLAAMGLPKGGSVGFELMDNQKFGISQFAEQVNYQRAL 172
Y+FA+ GAI VPA++VHE RL+LA GLPKGG+VGFEL+D +KFGISQF+EQVNYQRAL
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135

Query: 173 EGELQRTVESSNAVRAARVYLAIPKPSVFVRDREAPSASVLVDLYPGRVLDEGQVLAVTR 232
EGEL RT+E+ V++ARV+LA+PKPS+FVR++++PSASV V L PGR LDEGQ+ AV
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195

Query: 233 MVSSSVPDMPAKNVTIVDQDGNLLTQT-ASATGLDASQLKYVQQIERNTQKRIDAILAPI 291
+VSS+V +P NVT+VDQ G+LLTQ+ S L+ +QLK+ +E Q+RI+AIL+PI
Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255

Query: 292 FGAGNARSQVSADVDFSKIEQTSESYGPNGTPQQSAIRSQQTSSSTELAQSGASGVPGAL 351
G GN +QV+A +DF+ EQT E Y PNG ++ +RS+Q + S ++ GVPGAL
Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315

Query: 352 SNTPPQPASAPIVA-------------SNGQPAGPAATPVSDRKDSTTNYELDKTVRHVE 398
SN P P API ++ +A P S +++ T+NYE+D+T+RH +
Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTK 375

Query: 399 QSMGTIKRLSVAVVVNYQPSTDAKGRVTMQPLAADKLAQVQQLVKDAMGYDEKRGDSVNV 458
++G I+RLSVAVVVNY+ D K PL AD++ Q++ L ++AMG+ +KRGD++NV
Sbjct: 376 MNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431

Query: 459 VNSAFSAAADPFANLPWWRQPDMIELGKDIAKWLGVAAAAAALYFMFVRPALRR---AFP 515
VNS FSA + LP+W+Q I+ +WL V A L+ VRP L R
Sbjct: 432 VNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAK 491

Query: 516 PPAEPAAAAVPALDGPDDMLALDGLPSPDKKQLAEEDEEHPALLAFENERNRYERNLDYA 575
E A + + L+ D + N+R E
Sbjct: 492 AAQEQAQVRQETEEAVEVRLSKDEQLQQRR----------------ANQRLGAEVMSQRI 535

Query: 576 RTIARQDPKIVATVVKNWVSDE 597
R ++ DP++VA V++ W+S++
Sbjct: 536 REMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3401FLGMOTORFLIG297e-102 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 297 bits (762), Expect = e-102
Identities = 114/324 (35%), Positives = 191/324 (58%)

Query: 5 GLNKSALLLMSIGDEEAAQVFKFLAPREVQKIGAAMAALKNVTREQVEDVLNDFVQEAEK 64
G K+A+LL+SIG E +++VFK+L+ E++ + +A L+ +T E ++VL +F +
Sbjct: 17 GKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMA 76

Query: 65 HTALSLDSSEYIRTVLTKALGEDKAGVLIDRILQGSDTSGIEGLKWMDSAAVAELIKNEH 124
+ +Y R +L K+LG KA +I+ + + E ++ D A + I+ EH
Sbjct: 77 QEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEH 136

Query: 125 PQIIATILVHLDRDQASEIASCFTERLRNDVLLRIATLDGIQPTALRELDDVLTGLLSGS 184
PQ IA IL +LD +AS I S ++ +V RIA +D P +RE++ VL L+
Sbjct: 137 PQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASL 196

Query: 185 DNLKRAPMGGIRTAAEILNFMTSVHEEAVIENVKQYDPDLAQKIIDQMFVFENLLDLEDR 244
+ GG+ EI+N E+ +IE++++ DP+LA++I +MFVFE+++ L+DR
Sbjct: 197 SSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDR 256

Query: 245 AIQLLLKEVESEALIIALKGAPPALRQKFLSNMSQRAAELLAEDLDARGPVRVSEVETQQ 304
+IQ +L+E++ + L ALK +++K NMS+RAA +L ED++ GP R +VE Q
Sbjct: 257 SIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQ 316

Query: 305 RKILQVVRNLAESGQIVIGGKAED 328
+KI+ ++R L E G+IVI E+
Sbjct: 317 QKIVSLIRKLEEQGEIVISRGGEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3402FLGFLIH1083e-31 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 108 bits (271), Expect = 3e-31
Identities = 64/184 (34%), Positives = 106/184 (57%), Gaps = 4/184 (2%)

Query: 37 AAAALAAELQRVRDAAHAEGLAAGHVEGQALGYQAGYEQGRAKGFDEGRAEAHTHAAQLA 96
A +L +L +++ AH +G AG EG+ G++ GY++G A+G ++G AEA + A +
Sbjct: 36 AEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIH 95

Query: 97 A----LAASFRDALAGVERDLADDIATLALEIAQQVVRQHVQHDPAALIAAAREVLAAEP 152
A L + F+ L ++ +A + +ALE A+QV+ Q D +ALI +++L EP
Sbjct: 96 ARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEP 155

Query: 153 ALAGAPHLIVNPADLPVVEAYLKDELDTLGWSVRTDTSIERGGCRAHASTGEIDATLTTR 212
+G P L V+P DL V+ L L GW +R D ++ GGC+ A G++DA++ TR
Sbjct: 156 LFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATR 215

Query: 213 WERV 216
W+ +
Sbjct: 216 WQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3404FLGFLIJ602e-14 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 59.8 bits (144), Expect = 2e-14
Identities = 43/140 (30%), Positives = 74/140 (52%)

Query: 1 MAQSFPLQLLLERAQDDLDTAAKQLGRAQRERTDAQAQLDALMRYRDEYRVRFAESAQSG 60
MA+ L L + A+ +++ AA+ LG +R A+ QL L+ Y++EYR +G
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MPAGNWRNFQAFLDTLDAAIEQQRRVLAAAQTRIDAARPEWQAKKRTLGSYEILQARGAR 120
+ + W N+Q F+ TL+ AI Q R+ L ++D A W+ KK+ L +++ LQ R +
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 QDAQRAAKREQRDADEHAAK 140
+ +Q+ DE A +
Sbjct: 121 AALLAENRLDQKKMDEFAQR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3405FLGHOOKFLIK742e-16 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 73.7 bits (180), Expect = 2e-16
Identities = 79/257 (30%), Positives = 109/257 (42%), Gaps = 8/257 (3%)

Query: 213 NGDASAPLAANRAAFDKLLAGAKAPAAQAAPTDASGANPATALANAAANAAQPDASG--A 270
N D +A L+A A K A + T L + AQPD +
Sbjct: 124 NEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAPGTP 183

Query: 271 LAALQDAADSARATLAASSAPAALQQAA-PAALAANASAAAASAAPSLAPPVGTPDWTDA 329
L A++ S P+ + AA P AAP L+ P+G+ +W +
Sbjct: 184 AQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQS 243

Query: 330 LSQKVVFLSNAHQQSAELTLNPPDLGPLQVVLRVADNHAHALFVSQHAQVRDAVEAALPK 389
LSQ + + QQSAEL L+P DLG +Q+ L+V DN A VS H VR A+EAALP
Sbjct: 244 LSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPV 303

Query: 390 LREAMEAGGLGLGSASVSDGGFASAQQQQTPQRQSSDGSATRRAFGASTADAALDELAAA 449
LR + G+ LG +++S F+ QQ + Q+QS +A D L
Sbjct: 304 LRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQR-TANHEPLAGEDDDT----LPVP 358

Query: 450 SSGGAARRTVGMVDTFA 466
S VD FA
Sbjct: 359 VSLQGRVTGNSGVDIFA 375


68BMA10247_3433BMA10247_3440N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BMA10247_3433011-1.616009ATP-dependent protease ATP-binding subunit HslU
BMA10247_3434011-0.069388Fis family transcriptional regulator
BMA10247_34350100.193012sensor histidine kinase
BMA10247_3436-110-0.534106hypothetical protein
BMA10247_3437-29-1.777033hypothetical protein
BMA10247_3438-28-1.646962acetylglutamate kinase
BMA10247_3439-37-0.062295HAD-superfamily hydrolase
BMA10247_3440-29-0.015127nucleoid occlusion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3433HTHFIS310.016 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.016
Identities = 13/68 (19%), Positives = 29/68 (42%), Gaps = 15/68 (22%)

Query: 17 IIGQAKAKKAVAVALRNRWRRQQVAEPLRQEITPKNILMIGPTGVGKTEIAR---RLAKL 73
++G++ A + + ++ + T +++ G +G GK +AR K
Sbjct: 139 LVGRSAAMQEI------YRVLARLMQ------TDLTLMITGESGTGKELVARALHDYGKR 186

Query: 74 ADAPFIKI 81
+ PF+ I
Sbjct: 187 RNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3434HTHFIS889e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 87.6 bits (217), Expect = 9e-23
Identities = 30/127 (23%), Positives = 60/127 (47%)

Query: 1 MSDKNFLVIDDNEVFAGTLARGLERRGYAVRQAHNKDEALKLAGAEKFEFITVDLHLGND 60
M+ LV DD+ L + L R GY VR N + A + + D+ + ++
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGLSLIAPLCDLQPDARILVLTGYASIATAVQAVKDGADNYLAKPANVESILAALQTNAS 120
+ L+ + +PD +LV++ + TA++A + GA +YL KP ++ ++ + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 EVQAEEA 127
E + +
Sbjct: 121 EPKRRPS 127



Score = 45.2 bits (107), Expect = 4e-08
Identities = 16/101 (15%), Positives = 32/101 (31%), Gaps = 3/101 (2%)

Query: 75 DARILVLTGYASIATAVQAVKDGADNYLAKPANVESILAALQTNASEVQAEEALENPVVL 134
I+ + I + L+ VE + + + L
Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL---YDR 431

Query: 135 SVDRLEWEHIQRVLAENNNNISATARALNMHRRTLQRKLAK 175
+ +E+ I L N A L ++R TL++K+ +
Sbjct: 432 VLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3438CARBMTKINASE445e-07 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 43.7 bits (103), Expect = 5e-07
Identities = 27/99 (27%), Positives = 48/99 (48%), Gaps = 6/99 (6%)

Query: 180 IPVISPIGFGEDGLSYNINADLVAGKLATVLNAEKLVMMTNIPGVMDKEG----NLLTDL 235
+PVI G G+ I+ DL KLA +NA+ +++T++ G G L ++
Sbjct: 197 VPVILEDG-EIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLREV 255

Query: 236 SAREIDALFEDGT-ISGGMLPKISSALDAAKSGVKSVHI 273
E+ +E+G +G M PK+ +A+ + G + I
Sbjct: 256 KVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAII 294



Score = 36.7 bits (85), Expect = 8e-05
Identities = 21/60 (35%), Positives = 27/60 (45%), Gaps = 10/60 (16%)

Query: 31 GKTVVIKYGGNAMTEERLKQGF----------ARDVILLKLVGINPVIVHGGGPQIDQAL 80
GK VVI GGNA+ + K + AR + + G VI HG GPQ+ L
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BMA10247_3440HTHTETR582e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.1 bits (140), Expect = 2e-12
Identities = 31/183 (16%), Positives = 62/183 (33%), Gaps = 15/183 (8%)

Query: 24 ASRTRPKPGERRVHILQTLASMLESPKSEKITTAALAARLDVSEAALYRHFSSKAQMFEG 83
A +T+ + E R HIL + + +A V+ A+Y HF K+ +F
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 84 LIEFIEETFFGLVNQIAANEPNGVLQA-RSIALMLLNFSAKNPGMTRVLTGEALVGEHER 142
+ E E L + A P L R I + +L + ++ + H+
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME----IIFHKC 117

Query: 143 LAERVNQMLERVEASIKQCLR---VALLEAQAHAAGGAPPPVPLPDDYDPALRASLVISY 199
++++ + ++ L+ A P + A ++ Y
Sbjct: 118 EFVGEMAVVQQAQRNLCLESYDRIEQTLKH-CIEAKMLPADL------MTRRAAIIMRGY 170

Query: 200 VLG 202
+ G
Sbjct: 171 ISG 173



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.