PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome437.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_008390 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1Bamb_0001Bamb_0009Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_0001330-7.400031chromosomal replication initiation protein
Bamb_0002433-7.166884DNA polymerase III subunit beta
Bamb_0003436-7.089174DNA gyrase subunit B
Bamb_0004544-7.100722GTPase subunit of restriction endonuclease-like
Bamb_0005340-4.976367hypothetical protein
Bamb_0006131-1.524002cytochrome B561
Bamb_0007-124-1.087371catalase domain-containing protein
Bamb_0008-123-2.343105ECF subfamily RNA polymerase sigma-24 factor
Bamb_0009129-3.293947transmembrane anti-sigma factor
2Bamb_0027Bamb_0043Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_00272130.425162flagellar biosynthetic protein FliR
Bamb_00282120.010912flagellar biosynthesis protein FliQ
Bamb_0029113-0.126569flagellar biosynthesis protein FliP
Bamb_00300131.137321flagellar biosynthesis protein, FliO
Bamb_00312140.488108flagellar motor switch protein FliN
Bamb_00321140.358797flagellar motor switch protein FliM
Bamb_00332131.309928flagellar basal body-associated protein FliL
Bamb_00342131.585244LrgB family protein
Bamb_00350112.906016LrgA family protein
Bamb_00360122.771945LysR family transcriptional regulator
Bamb_0037-1112.790368EmrB/QacA family drug resistance transporter
Bamb_0038-1104.526439MarR family transcriptional regulator
Bamb_0039-1104.568151hypothetical protein
Bamb_0040-194.510345RND efflux system outer membrane lipoprotein
Bamb_00410104.010528hypothetical protein
Bamb_0042084.000211general secretion pathway M protein
Bamb_0043093.382791general secretion pathway protein L
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0027TYPE3IMRPROT1566e-49 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 156 bits (396), Expect = 6e-49
Identities = 119/256 (46%), Positives = 168/256 (65%), Gaps = 1/256 (0%)

Query: 1 MFSVTYAQLNGWLTAFLWPFVRMLALVATAPVVGHAAVPVRVKIGIAAFMALVVAPTLGA 60
M VT Q WL + WP +R+LAL++TAP++ +VP RVK+G+A + +AP+L A
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 MPDVTVFSAPGIWIVVTQFLIGIALGFTMQLVFAAVEAAGDFIGLSMGLGFATFFDPHSN 120
DV VFS +W+ V Q LIGIALGFTMQ FAAV AG+ IGL MGL FATF DP S+
Sbjct: 61 -NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 GATPVMGRFLNAIAMLAFLAVDGHLQVFAALAASFQTLPVSGDLLHAPGWRTLAAFGATV 180
PV+ R ++ +A+L FL +GHL + + L +F TLP+ G+ L++ + L G+ +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 181 FEMGLLLALPVVAALLIANLALGILNRAAPQIGIFQIGFPVTMLVGLLLVQLMIPNLVPF 240
F GL+LALP++ LL NLALG+LNR APQ+ IF IGFP+T+ VG+ L+ ++P + PF
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPF 239

Query: 241 VSHLFDMGLDAMGRVL 256
HLF + + ++
Sbjct: 240 CEHLFSEIFNLLADII 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0028TYPE3IMQPROT664e-18 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 65.9 bits (161), Expect = 4e-18
Identities = 28/85 (32%), Positives = 44/85 (51%)

Query: 4 EQVMTLAHQAMMVGLLLAAPLLLVALAVGLVVSLFQAATQINESTLSFIPKLLAVAATLV 63
+ ++ ++A+ + L+L+ +VA +GL+V LFQ TQ+ E TL F KLL V L
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 IAGPWMLTTMLDYLRQTLLHVATLG 88
+ W +L Y RQ + G
Sbjct: 62 LLSGWYGEVLLSYGRQVIFLALAKG 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0029FLGBIOSNFLIP289e-101 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 289 bits (742), Expect = e-101
Identities = 153/247 (61%), Positives = 196/247 (79%), Gaps = 4/247 (1%)

Query: 6 LRRAARFAPALILGLAPALACAQAAGLPAFNTSPGPNGGTTYSLSVQTMLLLTMLSFLPA 65
+RR AP L+ + P A LP + P P GG ++SL VQT++ +T L+F+PA
Sbjct: 1 MRRLLSVAPVLLWLITPLAF----AQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPA 56

Query: 66 MLLMMTSFTRIIIVLSLLRQALGTATTPPNQVLVGLAMFLTFFVMSPVLDRAYADGYKPF 125
+LLMMTSFTRIIIV LLR ALGT + PPNQVL+GLA+FLTFF+MSPV+D+ Y D Y+PF
Sbjct: 57 ILLMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPF 116

Query: 126 SDGSMPMEQAVRRGVAPFKTFMLKQTRETDLALFAKISKAAPMQGPEDVPLSLLVPAFVT 185
S+ + M++A+ +G P + FML+QTRE DL LFA+++ P+QGPE VP+ +L+PA+VT
Sbjct: 117 SEEKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVT 176

Query: 186 SELKTGFQIGFTIFIPFLIIDMVVASVLMSMGMMMVSPSTVSLPFKLMLFVLVDGWQLLI 245
SELKT FQIGFTIFIPFLIID+V+ASVLM++GMMMV P+T++LPFKLMLFVLVDGWQLL+
Sbjct: 177 SELKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLV 236

Query: 246 GSLAQSF 252
GSLAQSF
Sbjct: 237 GSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0031FLGMOTORFLIN1337e-43 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 133 bits (335), Expect = 7e-43
Identities = 75/132 (56%), Positives = 98/132 (74%), Gaps = 3/132 (2%)

Query: 31 AAQEDQGLDD-WAAALAEQNLQPVQAGATGAGVFQPLSKAAASSTHNDIEMILDIPVKMT 89
+ + LDD WA AL EQ ++ A VFQ L S DI++I+DIPVK+T
Sbjct: 8 SDENTGALDDLWADALNEQKATTTKSAADA--VFQQLGGGDVSGAMQDIDLIMDIPVKLT 65

Query: 90 VELGRTKIAIRNLLQLAQGSVVELDGMAGEPMDVLVNGCLIAQGEVVVVNDKFGIRLTDI 149
VELGRT++ I+ LL+L QGSVV LDG+AGEP+D+L+NG LIAQGEVVVV DK+G+R+TDI
Sbjct: 66 VELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDI 125

Query: 150 ITPAERIRKLNR 161
ITP+ER+R+L+R
Sbjct: 126 ITPSERMRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0032FLGMOTORFLIM2724e-92 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 272 bits (697), Expect = 4e-92
Identities = 80/324 (24%), Positives = 158/324 (48%), Gaps = 10/324 (3%)

Query: 5 EFMSQEEVDALLKGV-TGETDTVDEQ--RDLSSVRPYNIATQERIVRGRMPGLEIINDRF 61
E +SQ+E+D LL + +G+ D + D + Y+ ++ + +M L ++++ F
Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62

Query: 62 ARLLRIGIFNFMRRTAEISVSQVKVQKYSEFTRNLPIPTNLNLVHVKPLRGTSLFVFDPN 121
ARL + +R + V+ V Y EF R++P P+ L ++ + PL+G ++ DP+
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122

Query: 122 LVFFVVDNLFGGDGRFHTRVEGRDFTATEQRIIGKLLNLVFEHYATAWKSVRPLQFEFVR 181
+ F ++D LFGG G+ RD T E ++ ++ + + +W V L+ +
Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQ 180

Query: 182 SEMHTQFANVATPNEIVIVTQFSIEFGPTGGTLHICMPYSMIEPIRDVLSSPIQGEAL-- 239
E + QFA + P+E+V++ + G G ++ C+PY IEPI LSS ++
Sbjct: 181 IETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 240 EVDRRWVRVLSQQVQSAEVELSANLAEIPSTFEKILNLRAGDVLPLE---IEDTITAKVD 296
+++ VL ++ + ++++ A + + + IL LR GD++ L + D +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 297 GVPVMECGYGIFNGQYALRVQKMI 320
C G+ + A ++ + I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0037TCRTETB1201e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 120 bits (303), Expect = 1e-31
Identities = 84/398 (21%), Positives = 161/398 (40%), Gaps = 16/398 (4%)

Query: 30 LALGTFMEVLDTSIANVAVPTISGSLGVATSEGTWVISSYSVASAIAVPLTGWLARRVGE 89
L + +F VL+ + NV++P I+ + WV +++ + +I + G L+ ++G
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 90 VRLFTLSVLAFTIASALCGLA-SNFETLIAFRLLQGLVSGPMVPLSQTILMRSYPPAKRG 148
RL ++ S + + S F LI R +QG + L ++ R P RG
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 149 LALGLWAMTVIVAPIFGPLLGGWISDNYTWPWIFYINLPIGIFSAACAYFLLRGRETKTS 208
A GL V + GP +GG I+ W ++ I + I L ++
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLL---KKEVRI 195

Query: 209 RQRIDAVGLALLVIGVSCLQMMLDLGKDRDWFNSTFIVALALIAVVSLAFMLVWEATEKE 268
+ D G+ L+ +G+ + F +++ ++ +++V+S + +
Sbjct: 196 KGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTD 245

Query: 269 PVVDLSLFKDRNFALGALIISFGFMAFFGSVVIFPLWLQTVMGYTAGKAGLATA-PVGLL 327
P VD L K+ F +G L F G V + P ++ V + + G P +
Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305

Query: 328 ALVLSPLIGRNMHRLDLRMVASFAFIVFAGVSVWNSTFTLDVPFNHVILPRLVQGIGVAC 387
++ + G + R V + + F VS ++F L+ + + + G++
Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIG-VTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSF 364

Query: 388 FFVPMTTITLSSISDERLASASGLSNFLRTLSGAIGTA 425
++TI SS+ + + L NF LS G A
Sbjct: 365 TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIA 402


3Bamb_0084Bamb_0095Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_0084-216-3.114112tRNA uridine 5-carboxymethylaminomethyl
Bamb_0085020-3.09649416S rRNA methyltransferase GidB
Bamb_0086117-3.335457cobyrinic acid a,c-diamide synthase
Bamb_0087219-3.549242parB-like partition proteins
Bamb_0088422-4.277061citrate transporter
Bamb_0089427-5.248511hypothetical protein
Bamb_0090221-4.442926F0F1-type ATP synthase subunit I-like protein
Bamb_0091121-4.901006F0F1 ATP synthase subunit A
Bamb_0092226-4.951067F0F1 ATP synthase subunit C
Bamb_0093323-4.541689F0F1 ATP synthase subunit B
Bamb_0094017-3.555560F0F1 ATP synthase subunit delta
Bamb_0095016-3.458925F0F1 ATP synthase subunit alpha
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0094FLGMOTORFLIN270.035 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 26.8 bits (59), Expect = 0.035
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 5/85 (5%)

Query: 5 ATIARPYAEALFRVAEGGDIAAWSTLVQELAQVARLPEVLSVASSPKVTRTQVVELLLAA 64
AT + A+A+F+ GGD+ S +Q++ + +P L+V TR + ELL
Sbjct: 28 ATTTKSAADAVFQQLGGGDV---SGAMQDIDLIMDIPVKLTVELGR--TRMTIKELLRLT 82

Query: 65 VKSPVAAGAEAKNFVQMLVDNHRIA 89
S VA A + +L++ + IA
Sbjct: 83 QGSVVALDGLAGEPLDILINGYLIA 107


4Bamb_0202Bamb_0209Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_0202323-3.472371hypothetical protein
Bamb_0203429-5.663588phosphoheptose isomerase
Bamb_0204532-6.515528transport-associated protein
Bamb_0205636-7.435703class I cytochrome c
Bamb_0207536-6.919099*Rhs element Vgr protein
Bamb_0208434-4.790700hypothetical protein
Bamb_0209335-4.855599hypothetical protein
5Bamb_0356Bamb_0382Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_035609-3.697342Sec-independent protein translocase subunit
Bamb_0357010-4.299068peptidase S1 and S6, chymotrypsin/Hap
Bamb_0358113-5.095941hypothetical protein
Bamb_0359318-6.757285ubiquinol-cytochrome c reductase, iron-sulfur
Bamb_0360318-7.005155cytochrome b/b6 domain-containing protein
Bamb_0361420-6.904794cytochrome c1
Bamb_0362218-4.162802glutathione S-transferase domain-containing
Bamb_0363023-4.101022ClpXP protease specificity-enhancing factor
Bamb_0364025-4.485264*hypothetical protein
Bamb_0365-126-4.608970hypothetical protein
Bamb_0366028-5.321741extracellular solute-binding protein
Bamb_0367030-5.526548Rhs element Vgr protein
Bamb_0368138-8.052664YD repeat-containing protein
Bamb_0369772-17.622880hypothetical protein
Bamb_0370666-16.277524hypothetical protein
Bamb_0372445-10.764371hypothetical protein
Bamb_0373-122-5.704411hypothetical protein
Bamb_0374022-5.391705hypothetical protein
Bamb_0375117-3.512947hypothetical protein
Bamb_0376210-2.502923hypothetical protein
Bamb_037739-2.776309hypothetical protein
Bamb_037829-3.139485hypothetical protein
Bamb_0379412-3.969520hypothetical protein
Bamb_038049-3.018840hypothetical protein
Bamb_038149-2.525068hypothetical protein
Bamb_0382210-1.086044hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0357V8PROTEASE687e-15 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 68.5 bits (167), Expect = 7e-15
Identities = 33/183 (18%), Positives = 63/183 (34%), Gaps = 38/183 (20%)

Query: 116 NLGSGVIVSPEGYILTNQHVVDGADQIEVALA------------DGRTATAKVIGSDPET 163
+ SGV+V +LTN+HVVD AL +G ++ E
Sbjct: 102 FIASGVVVGK-DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEG 160

Query: 164 DLAVLKIN--------MTNLPTITLGRSDQSRVGDVVLAIGNPFGVGQTVTMGIISALGR 215
DLA++K + + T+ + +++V + G P ++ +
Sbjct: 161 DLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-------KPVATMWE 213

Query: 216 NHLGINTFEN-FIQTDAPINPGNSGGALVDVNGNLLGINTAIYSRSGGSLGIGFAIPVST 274
+ I + +Q D GNSG + + ++GI+ G+ +
Sbjct: 214 SKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGAV 264

Query: 275 ART 277

Sbjct: 265 FIN 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0367RTXTOXINA391e-04 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 39.2 bits (91), Expect = 1e-04
Identities = 29/142 (20%), Positives = 56/142 (39%), Gaps = 25/142 (17%)

Query: 915 SPEAAQAAASGQLASQLQGMAPAATTA---AMGLASGGSAGAVLGGLASSALPAAASALG 971
+ +AAA +L +++ G + A A G S A GL +SA+ A S L
Sbjct: 263 ADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISPLS 322

Query: 972 GAGVVSALRTASSLSGAAKQV-----------------AGMVQAARQG---GLAALAAPA 1011
+ + A+ + +++ G + A+ LA++++
Sbjct: 323 FLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGI 382

Query: 1012 ANAASGALQGALPG--VAGISG 1031
+ AA+ +L GA V ++G
Sbjct: 383 SAAATTSLVGAPVSALVGAVTG 404


6Bamb_0667Bamb_0675Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_06674151.882590AraC family transcriptional regulator
Bamb_06684170.904256YiaAB two helix domain-containing protein
Bamb_06695201.039408glycosyl transferase family protein
Bamb_06704210.328116hypothetical protein
Bamb_0671420-0.261214hypothetical protein
Bamb_0672216-1.130217hypothetical protein
Bamb_0673312-1.020522PA-phosphatase-like phosphoesterase
Bamb_0674210-1.127819fatty acid desaturase
Bamb_0675210-1.413918*hypothetical protein
7Bamb_0715Bamb_0764Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_07150133.081864LysR family transcriptional regulator
Bamb_07160133.545667DSBA oxidoreductase
Bamb_0717-1133.533458TRAP-type transport system periplasmic
Bamb_07180123.830636transport-associated protein
Bamb_0719-1123.584675hypothetical protein
Bamb_07200133.676492hypothetical protein
Bamb_0721-1113.195319DSBA oxidoreductase
Bamb_07220102.914395YheO domain-containing protein
Bamb_07232103.939275ornithine cyclodeaminase/mu-crystallin
Bamb_07241113.257627extracellular solute-binding protein
Bamb_07253123.678301D-amino-acid dehydrogenase
Bamb_07263143.187619hypothetical protein
Bamb_07272122.689503ECF subfamily RNA polymerase sigma-24 factor
Bamb_0728-218-0.252803transmembrane anti-sigma factor
Bamb_0729-119-0.812202hypothetical protein
Bamb_0730022-1.607009hypothetical protein
Bamb_0731-1170.567879hypothetical protein
Bamb_0732118-0.201312co-chaperonin GroES
Bamb_07330160.213050chaperonin GroEL
Bamb_0734-1153.096637phosphomethylpyrimidine kinase type-1
Bamb_0735-2132.335692rubredoxin-type Fe(Cys)4 protein
Bamb_0736-2123.305271hypothetical protein
Bamb_0737-1121.900230hypothetical protein
Bamb_0738-2112.381472Holliday junction resolvase-like protein
Bamb_0739-280.659149bifunctional pyrimidine regulatory protein
Bamb_0740-19-0.879398aspartate carbamoyltransferase catalytic
Bamb_0741-114-1.624207dihydroorotase
Bamb_0742226-5.699218phospholipid/glycerol acyltransferase
Bamb_0743338-9.071891diadenosine tetraphosphatase
Bamb_0744449-12.097027ABC transporter
Bamb_0745454-12.515855ABC transporter-like protein
Bamb_0746352-12.048005hypothetical protein
Bamb_0747347-11.329265NAD-dependent epimerase/dehydratase
Bamb_0748246-10.818267FkbM family methyltransferase
Bamb_0749342-9.376158hypothetical protein
Bamb_0750130-6.776703hypothetical protein
Bamb_0751025-5.671661dTDP-glucose 4,6-dehydratase
Bamb_0752232-5.782787glucose-1-phosphate thymidylyltransferase
Bamb_0753334-6.204603dTDP-4-dehydrorhamnose 3,5-epimerase
Bamb_0754232-6.245109dTDP-4-dehydrorhamnose reductase
Bamb_0755131-6.277883mannose-1-phosphate
Bamb_0756131-6.130340hypothetical protein
Bamb_0757031-5.782344type 11 methyltransferase
Bamb_0758029-4.915549group 1 glycosyl transferase
Bamb_0759026-3.330056GDP-mannose 4,6-dehydratase
Bamb_0760125-2.899013NAD-dependent epimerase/dehydratase
Bamb_0761327-3.570007group 1 glycosyl transferase
Bamb_0762224-3.725706group 1 glycosyl transferase
Bamb_0763219-3.250479NAD-dependent epimerase/dehydratase
Bamb_0764117-3.083316glycosyl transferase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0741UREASE320.005 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 32.0 bits (73), Expect = 0.005
Identities = 22/84 (26%), Positives = 34/84 (40%), Gaps = 18/84 (21%)

Query: 19 RQADVFVADGKIAALAPAG--------TAPAGFNAEKTIDASGLIVAPGLVDLCARLREP 70
+AD+ + DG+IAA+ AG T G E I G IV G +D P
Sbjct: 84 VKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTE-VIAGEGKIVTAGGMDSHIHFICP 142

Query: 71 GYEHKATLASEMAAAVAGGVTTLV 94
++ A+ G+T ++
Sbjct: 143 ---------QQIEEALMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0747NUCEPIMERASE982e-25 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 97.5 bits (243), Expect = 2e-25
Identities = 70/343 (20%), Positives = 120/343 (34%), Gaps = 63/343 (18%)

Query: 7 SVLITGAGGVIGHALKQELADSGYSNVVAITSSD------------------------ID 42
L+TGA G IG + + L ++G+ VV I + + ID
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-QVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 43 LRDQSATEKMFDELRPTIVFHMAARVYGIMGNMSNRGIAYLD-NVRINTNVVEAARQTGC 101
L D+ +F VF R+ + ++ N AY D N+ N++E R
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRL-AVRYSLEN-PHAYADSNLTGFLNILEGCRHNKI 118

Query: 102 KKFVAMGSTAIYSDQVRLPMSEEQIWVGAPHHSEAPYAHSKRGMLAQLEAYKDQYGMDYA 161
+ + S+++Y ++P S + H + YA +K+ Y YG+
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDS----VDHPVSLYAATKKANELMAHTYSHLYGLPAT 174

Query: 162 FCVSTNLFGPHDKFDEKFGHVIPSLVSKFYRASVLGQPISVWGSGKAERDFLFSGDAAYA 221
++GP + D KF +A + G+ I V+ GK +RDF + D A A
Sbjct: 175 GLRFFTVYGPWGRPDMAL--------FKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEA 226

Query: 222 LRLIAENHTGA--------------------INLATGQSHTIRHTVDTLCQISGFSGSVE 261
+ + + A N+ + + L G
Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKN 286

Query: 262 WDATKPDGQKLRAY-DISRL-TALGFKPRFSFDEALAITYDWY 302
+P G L D L +GF P + + + +WY
Sbjct: 287 MLPLQP-GDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0748RTXTOXIND461e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.4 bits (110), Expect = 1e-07
Identities = 20/159 (12%), Positives = 66/159 (41%), Gaps = 10/159 (6%)

Query: 223 FDGFIRASESDMIRRAMDLEQQLIESRKQLQEAHEGWSLEKGAREELEVRLNSMTDRAHD 282
F SE +++R +++Q + Q + + ++ ++ R +
Sbjct: 173 EPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK-------ELNLDKKRAERLTVLARINR 225

Query: 283 EQASQVVLKDSSQDC--VVQNSKISQNEIEQLKARVAASEGDLESHRAQLAELRTRLTES 340
+ V K D ++ I+++ + + + + + +L +++QL ++ + + +
Sbjct: 226 YENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285

Query: 341 E-KSAQVLSTERDTAYQELFESSRHAAWLSQERVRLQER 378
+ + V ++ +L +++ + L+ E + +ER
Sbjct: 286 KEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0751NUCEPIMERASE1769e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 176 bits (449), Expect = 9e-55
Identities = 90/350 (25%), Positives = 135/350 (38%), Gaps = 43/350 (12%)

Query: 2 ILVTGGAGFIGANFVLDWLRHDVEPVLNVDKLT--YAGNLRTL-QSLSGNPKHVFARVDI 58
LVTG AGFIG + L V+ +D L Y +L+ L P F ++D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 59 CDRAALDALFAEHKPRAVAHFAAESHVDRSIHGPADFVQTNVVGTFTLLEAARSYWSGLN 118
DR + LFA V V S+ P + +N+ G +LE R
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ-- 119

Query: 119 DADKAGFRFLHVSTDEVFGSLSPTDPQFSETTPYA-PNSPYSATKAGSDHLVRAYHHTYG 177
L+ S+ V+G L+ P FS P S Y+ATK ++ + Y H YG
Sbjct: 120 -------HLLYASSSSVYG-LNRKMP-FSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 178 LPTLTTNCSNNYGPYQFPEKLIPLMIANALAGKPLPIYGDGQNVRDWLYVGDHCSAIREV 237
LP YGP+ P+ + L GK + +Y G+ RD+ Y+ D AI +
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRL 230

Query: 238 L------------------ARGVPGETYNVGGWNEKKNLDVVHTLCDLLDSASPKAAGSY 279
A P YN+G + + +D + L D L +A +
Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG---IEAKKNM 287

Query: 280 RDQITYVTDRPGHDRRYAIDARKLERELGWKPAETFETGLAKTVRWYLDN 329
+PG + D + L +G+ P T + G+ V WY D
Sbjct: 288 LPL------QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0754NUCEPIMERASE497e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 49.0 bits (117), Expect = 7e-09
Identities = 48/212 (22%), Positives = 72/212 (33%), Gaps = 61/212 (28%)

Query: 10 TILVTGVTGQIGFELLRALQGLG-RVVPCD--------------RSVL----------DL 44
LVTG G IGF + + L G +VV D +L DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 45 ADLDRVRAFARDLKPALIVNPAAYTAVDTAESEVELARRLNVDVPRVFAE---------- 94
AD + + + AV R +++ P +A+
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAV-----------RYSLENPHAYADSNLTGFLNIL 110

Query: 95 EAARSGG--TLIHYSTDYVF-DGTKVGAYVETDAPNPLNAYGATKLEGEQAIAAT----- 146
E R L++ S+ V+ K+ + +P++ Y ATK E +A T
Sbjct: 111 EGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL-MAHTYSHLY 169

Query: 147 GCAHVILRTSWVYGRRGR------NFLRTMLK 172
G LR VYG GR F + ML+
Sbjct: 170 GLPATGLRFFTVYGPWGRPDMALFKFTKAMLE 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0757RTXTOXIND330.003 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.9 bits (75), Expect = 0.003
Identities = 25/159 (15%), Positives = 49/159 (30%), Gaps = 23/159 (14%)

Query: 175 FARVKQLRLQEPPALHEAGARIRLLDVLGGVSPDYAIVAQKSCSEAEAGLFDEAFHEDYG 234
AR++Q R Q L + +L ++ P + V SE E E +
Sbjct: 145 QARLEQTRYQ---ILSRSIELNKLPELKLPDEPYFQNV-----SEEEVLRLTSLIKEQFS 196

Query: 235 LSLAELAARFDAGAARQHEHAAEQIQRTQAEVRRLHGELGVIHEELTR----------TR 284
+ + ++ E A + R V L +
Sbjct: 197 TWQNQKYQKELNLDKKRAE-----RLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAK 251

Query: 285 SELTQAVDELGQVRRDLGRVRSDLGQVNGELKRVHDEAQ 323
+ + ++ + +L +S L Q+ E+ +E Q
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0759NUCEPIMERASE974e-25 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 97.2 bits (242), Expect = 4e-25
Identities = 65/347 (18%), Positives = 121/347 (34%), Gaps = 57/347 (16%)

Query: 7 IITGITGQDGAYLAELLLDKGYTVYG-----TYRRTSSVNFWRIEELGIAKHPNLHLVEY 61
++TG G G ++++ LL+ G+ V G Y S + L + P +
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVS----LKQARLELLAQPGFQFHKI 59

Query: 62 DLTDLSASIRLLQTTGATEVYNLAAQSFVGVSFDQPVTTAEITGVGPLNLLEAIRIVNPK 121
DL D L + V+ + V S + P A+ G LN+LE R +
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 122 IRFYQASTSEMFGKVQAIPQIESTPF-YPRSPYGVAKLYAHWITVNYRESYDIFGCSGIL 180
AS+S ++G + +P +P S Y K + Y Y +
Sbjct: 120 -HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRF 178

Query: 181 FNHESPLRGR-EFVTRKITDSVAKIKLGQLDVLELGNMDAKRDWGFAKEYVEGMWRMLQA 239
F P GR + K T ++ + K +DV G M KRD+ + + E + R+
Sbjct: 179 FTVYGP-WGRPDMALFKFTKAMLEGK--SIDVYNYGKM--KRDFTYIDDIAEAIIRLQDV 233

Query: 240 DEPDT-------------------FVLATNRTETVRDFVRMAFKAAGVDLEFKGSDEQEI 280
+ + + + D+++ A G++ + Q
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ-- 291

Query: 281 AVDVATGKTLVRVNPKFHRPAEVDLLIGNPEKAKQKLGWEPKTTLEE 327
P +V + + + +G+ P+TT+++
Sbjct: 292 -------------------PGDVLETSADTKALYEVIGFTPETTVKD 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0760NUCEPIMERASE1222e-34 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 122 bits (307), Expect = 2e-34
Identities = 75/313 (23%), Positives = 125/313 (39%), Gaps = 42/313 (13%)

Query: 11 RALVTGLGGFTGDYLAESLRAAGYRVFGTTHAADTIEPD---------------TYRVDL 55
+ LVTG GF G ++++ L AG++V G + D + +++DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 56 CDRPTLADVVAEVQPDIVAHLAAVSFV--AHGDADAIYRTNVVGTRNLLEALANLENRPR 113
DR + D+ A + V V + + A +N+ G N+LE N+ +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCR--HNKIQ 119

Query: 114 AVLLASSANIYG-NAAVEIIDESIEPNPANDYAVSKLAMEYMARLWRD--KLPIVIARPF 170
+L ASS+++YG N + + +P + YA +K A E MA + LP R F
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 171 NYTGVGQSPQFLLPKIVGHFQRGERVIELGNIDVERDFSDVRRVADAYRRLLELSPAGG- 229
G P L K G+ + ++RDF+ + +A+A RL ++ P
Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239

Query: 230 -----------------VFNVCSGRAVSLKAVISMMERIAGYAIEVRVNPAFVRANDVRR 272
V+N+ + V L I +E G IE + N ++ DV
Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG--IEAKKNMLPLQPGDVLE 297

Query: 273 LQGNDARLQAAIG 285
+ L IG
Sbjct: 298 TSADTKALYEVIG 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0763NUCEPIMERASE1142e-31 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 114 bits (286), Expect = 2e-31
Identities = 78/341 (22%), Positives = 130/341 (38%), Gaps = 37/341 (10%)

Query: 1 MRLVITGANGFVGRAVCRRALDAGHTVTAL----------VRRPGACIDGVREWVHGSAD 50
M+ ++TGA GF+G V +R L+AGH V + +++ + + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 51 WEGLDAAWPADLVA----DCVIHLAARVHVMRDDSPDPDAAFDATNVAGTLRLAEAARKY 106
D DL A + V R+ V R +P A D +N+ G L + E R
Sbjct: 61 LA--DREGMTDLFASGHFERVFISPHRLAV-RYSLENPHAYAD-SNLTGFLNILEGCRHN 116

Query: 107 GVRRIVYASSIKAVGESDSGAPLSESWPAD-PQDAYGRSKLRAEQQLARFGTSAGLDVVI 165
++ ++YASS G + P S D P Y +K E + GL
Sbjct: 117 KIQHLLYASSSSVYGLNRK-MPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATG 175

Query: 166 VRPPLVYGPHVTAN--FLRMMDAVARGMPLPL-GSISARRSIVYVDNLADALLQCATDPR 222
+R VYGP + + A+ G + + +R Y+D++A+A+++
Sbjct: 176 LRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235

Query: 223 AAGECFHVADDDAPSVTGLLRLVGDALGKPARLLPVPTAALRALGKLTGRSATIDRLTGS 282
A + V + R+ P L+ ++AL G A +
Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDY----IQALEDALGIEA--KKNMLP 289

Query: 283 LQL--------DTGRIKRVLGWQPPYTTRQGLEATAAWYRS 315
LQ DT + V+G+ P T + G++ WYR
Sbjct: 290 LQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


8Bamb_0860Bamb_0871Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_0860-29-3.022768DNA-directed RNA polymerase subunit omega
Bamb_0861-28-2.006240(p)ppGpp synthetase I SpoT/RelA
Bamb_0862215-0.968934***transcription elongation factor GreB
Bamb_0863216-1.047929porin
Bamb_0864322-0.218898hypothetical protein
Bamb_08654130.620005cold-shock DNA-binding domain-containing
Bamb_08661111.781256DNA polymerase III subunit epsilon
Bamb_08670121.811031chorismate mutase
Bamb_08682111.344347hypothetical protein
Bamb_08692111.435563hypothetical protein
Bamb_0870291.286104TonB-dependent receptor
Bamb_0871591.224537hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0863ECOLNEIPORIN1247e-35 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 124 bits (314), Expect = 7e-35
Identities = 92/388 (23%), Positives = 142/388 (36%), Gaps = 65/388 (16%)

Query: 1 MKKTLIVAALAGVAASAAHAQSSVTLYGLIDAGITYTNNQHGHSAW-----QETSGSING 55
MKK+LI LA + +A + VTLYG I AG+ + + + A T G
Sbjct: 1 MKKSLIALTLAALPVAAM---ADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLG 57

Query: 56 SRWGLRGTEDLGGGLKAIFTLENGFGINNGSLKQNGREFGRQAFVGLAHNSFGSLTLGRQ 115
S+ G +G EDLG GLKAI+ +E I + RQ+F+GL FG L +GR
Sbjct: 58 SKIGFKGQEDLGNGLKAIWQVEQKASIAGT----DSGWGNRQSFIGLKGG-FGKLRVGRL 112

Query: 116 YDSVVDYLG--PLSLTGTQYGGTQFAHPFDNDNLNNSFRINNSVKYQSANYGGLKFGGLY 173
+ D P G + A P + I SV+Y S + GL Y
Sbjct: 113 NSVLKDTGDINPWDSKSDYLGVNKIAEP-------EARLI--SVRYDSPEFAGLSGSVQY 163

Query: 174 GFSNSTGFANNRAYSVGASYSYMGFNVAAAYLQLNNNINALALAASDPGAVAGDWTFAAS 233
+++ G N+ +Y G +Y GF V ++
Sbjct: 164 ALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHH--------------QVQENVNIE 209

Query: 234 RQRTWGAGLNYTFGPATAGFVFTQTRLTNSAGISAGQSGVS-TGIPLTGGTRFNNYEVNG 292
+ + Y A + + ++ + S S T + T RF N
Sbjct: 210 KYQIHRLVSGYD---NDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFGNV---- 262

Query: 293 RYALTPAFSLAGSYTYTDSRLDGQTPSWHQFNLQADYALSKRTDLYLQSEYQRVNTNGLA 352
++ A GS+ T+ + Q + A+Y SKRT + + + +
Sbjct: 263 TPRVSYAHGFKGSFDATNY-----NNDYDQVVVGAEYDFSKRTSALVSAGWLQEG----- 312

Query: 353 IGANINGLGAASSTNKQIAVTAGMRHRF 380
G ST A G+RH+F
Sbjct: 313 -----KGESKFVST----AGGVGLRHKF 331


9Bamb_0896Bamb_0911Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_0896223-1.654848NADH dehydrogenase (quinone)
Bamb_0897334-4.198678NADH-ubiquinone oxidoreductase 24 kD
Bamb_0898438-5.853188molybdate metabolism transcriptional regulator
Bamb_0899647-8.721725hypothetical protein
Bamb_0900650-9.158224hypothetical protein
Bamb_0901652-10.321472hypothetical protein
Bamb_0902142-7.783480hypothetical protein
Bamb_0903340-7.256970hypothetical protein
Bamb_0904342-6.887714hypothetical protein
Bamb_0905444-7.874655hypothetical protein
Bamb_0906444-8.870964hypothetical protein
Bamb_0907639-6.999534hypothetical protein
Bamb_0908635-4.959153hypothetical protein
Bamb_0909427-4.686670phage transcriptional regulator AlpA
Bamb_0910525-4.896218hypothetical protein
Bamb_0911014-3.598213hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0905TCRTETOQM260.040 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 25.6 bits (56), Expect = 0.040
Identities = 23/69 (33%), Positives = 30/69 (43%), Gaps = 9/69 (13%)

Query: 13 VVGDGTPGPGRKK-GVPN---KMTVEVK-----EMIRQALDEAGGVDYLVERAKDPRTAS 63
V+GD P R++ P + TVE EM+ AL E D L+ D T
Sbjct: 326 VLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHE 385

Query: 64 AFLSLVGKV 72
LS +GKV
Sbjct: 386 IILSFLGKV 394


10Bamb_1053Bamb_1058Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1053027-4.223506hypothetical protein
Bamb_1054230-6.009346polar amino acid ABC transporter inner membrane
Bamb_1055438-7.216483polar amino acid ABC transporter inner membrane
Bamb_1056429-6.189438ABC transporter-like protein
Bamb_1057325-4.914924AraC family transcriptional regulator
Bamb_1058118-3.659337transmembrane protein
11Bamb_1121Bamb_1134Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_11212100.946087hypothetical protein
Bamb_11223101.653973hypothetical protein
Bamb_1123392.0927193-methyl-2-oxobutanoate dehydrogenase
Bamb_11243101.790148transketolase, central region
Bamb_11252101.793947branched-chain alpha-keto acid dehydrogenase E2
Bamb_11261101.554808dihydrolipoamide dehydrogenase
Bamb_11271101.727016nucleoside-diphosphate-sugar epimerase-like
Bamb_11280121.653737hypothetical protein
Bamb_11291102.077331cytosine/purines uracil thiamine allantoin
Bamb_11300113.209498LysR family transcriptional regulator
Bamb_11311123.756146major facilitator superfamily transporter
Bamb_11320114.001019allantoate amidohydrolase
Bamb_1133-2123.905299histone deacetylase superfamily protein
Bamb_1134-1113.202607major facilitator superfamily transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1125IGASERPTASE310.011 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.011
Identities = 28/152 (18%), Positives = 48/152 (31%), Gaps = 16/152 (10%)

Query: 25 VEVGQTIKEDQPLADVMTDKAAVEIPS--PVAGKVLALGGRIGEMMAVGSELIRVEVEGN 82
+ I+ D P V ++ + PV A E +A S+ VE N
Sbjct: 997 ITTPNNIQADVP--SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN 1054

Query: 83 GNLKPGTKARDAEADATSRPAAVDTPAKS------SKVTEAAEAHDASKAARHTAERAPA 136
T A++ E ++ + S+ E A E+A
Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV 1114

Query: 137 EPRRTEHAA------APRAALAPGERPLASPA 162
E +T+ +P+ + +P A PA
Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1134TCRTETA454e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.8 bits (106), Expect = 4e-07
Identities = 82/403 (20%), Positives = 136/403 (33%), Gaps = 31/403 (7%)

Query: 17 RPGGSATLPLLALAAGAFGIGTTEFSPMGLLPVIADGVHVSIPQA---GMLISAYAIGVM 73
+P + L +A A GIG M +LP + + S G+L++ YA+
Sbjct: 2 KPNRPLIVILSTVALDAVGIGLI----MPVLPGLLRDLVHSNDVTAHYGILLALYALMQF 57

Query: 74 VGAPLMTLLLARWSRRSALIALMSIFTIGNLLSAIAPDYTTLLLARLVTSLNHGAFFGLG 133
AP++ L R+ RR L+ ++ + + A AP L + R+V + G
Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG 117

Query: 134 SVVAAGLVPRERQASAVATMFMGLTIANVGGVPAATWLGQMIGWRMSFAATAALGLIAIA 193
+ + A + + +A M V G +G F A AAL +
Sbjct: 118 AYI-ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFL 175

Query: 194 GLFAALPKGEAGKMPNLRAELSVLTRPVVLGALATTVLGAGAMF-----------TLYTY 242
LP+ G+ LR E T V A+F L+
Sbjct: 176 TGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI 235

Query: 243 VAPTLEHVTGATPGFVTAMLVLIGVGFSIGNIAGGRLADRSLDATLIGFLVLLIVTMAGF 302
H T G A ++ + G +A R + + ++ +I G+
Sbjct: 236 FGEDRFHWDATTIGISLAAFGILHSLAQA--MITGPVAARLGERRAL--MLGMIADGTGY 291

Query: 303 PLLARTHVGAAATLLVWGVATFAVVPPLQMRVM--RAAHEAPGLASAVNIGAFNLGNALG 360
LLA G A ++ +A+ + P ++ + E G +L + +G
Sbjct: 292 ILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG 351

Query: 361 AAAGGAAISAGFGYAAVPLVGGLIAAAGLALVALQRMQRRRGA 403
A +A G AG AL L RRG
Sbjct: 352 PLLFTAIYAASITTW-----NGWAWIAGAALYLLCLPALRRGL 389


12Bamb_1146Bamb_1153Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_11462103.216244malonate transporter subunit MadM
Bamb_11471104.014270hypothetical protein
Bamb_11482115.647006malonate decarboxylase subunit delta
Bamb_11491105.640034malonate decarboxylase subunit beta
Bamb_1150395.881866malonate decarboxylase subunit gamma
Bamb_1151386.115598phosphoribosyl-dephospho-CoA transferase
Bamb_1152395.519198triphosphoribosyl-dephospho-CoA synthase
Bamb_1153294.750531acyl-carrier-protein S-malonyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1147ARGDEIMINASE300.021 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 30.2 bits (68), Expect = 0.021
Identities = 10/47 (21%), Positives = 19/47 (40%), Gaps = 6/47 (12%)

Query: 182 QVDRIVDKVPRVDIPGDRVHFV--VEAGRPFYVEPL----FTRDPAA 222
+ +++ V ++ V F ++P+ FTRDP A
Sbjct: 121 MISKMISGVVTEELKNYTSSLDDLVNGANLFIIDPMPNVLFTRDPFA 167


13Bamb_1177Bamb_1193Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1177-112-3.404067phosphate ABC transporter substrate-binding
Bamb_1178010-3.116594phosphate transporter permease subunit PstC
Bamb_1179-110-2.503419phosphate transporter permease subunit PtsA
Bamb_1180-28-2.103075phosphate transporter ATP-binding protein
Bamb_1181-28-1.650098phosphate uptake regulator PhoU
Bamb_1182-28-2.446933two component transcriptional regulator
Bamb_1183-39-2.828486histidine kinase
Bamb_1184-114-3.854442polyphosphate kinase
Bamb_1185235-6.621698Ppx/GppA phosphatase
Bamb_1186242-9.406781hypothetical protein
Bamb_1187346-9.368712hypothetical protein
Bamb_1188445-8.497245hypothetical protein
Bamb_1189548-8.929840hypothetical protein
Bamb_1190444-8.250513hypothetical protein
Bamb_1191438-7.206727Rhs element Vgr protein
Bamb_1192538-6.086927hypothetical protein
Bamb_1193636-5.422513hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1182HTHFIS862e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 2e-21
Identities = 36/136 (26%), Positives = 64/136 (47%), Gaps = 5/136 (3%)

Query: 5 ILVVEDEPAISELISVNLQHAGHCPIRAYNAEQAQNLISDVLPDLVLLDWMLPGKSGIAF 64
ILV +D+ AI +++ L AG+ NA I+ DLV+ D ++P ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 65 ARDLRNNERTKHIPIIMLTARGDEQDKVLGLEIGADDYVTKPFSPKELMARIKAVL---R 121
++ + +P+++++A+ + E GA DY+ KPF EL+ I L +
Sbjct: 66 LPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 RRAPQLTEDVVSINGL 137
RR +L +D L
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1183PF06580401e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 1e-05
Identities = 20/106 (18%), Positives = 35/106 (33%), Gaps = 26/106 (24%)

Query: 328 LVTNAIRY----TPDGGKIFVSWRREGAQGVFSVTDSGFGIPAADLPRLTERFYRVDRSR 383
LV N I++ P GGKI + ++ V ++G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLAL------------------ 304

Query: 384 SRDTGGTGLGLAIVKHVLQR---HDSHLYVQSEEGRGSTFTARFPA 426
TG GL V+ LQ ++ + + ++G+ P
Sbjct: 305 KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV-NAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1191ICENUCLEATIN472e-07 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 47.4 bits (112), Expect = 2e-07
Identities = 32/120 (26%), Positives = 47/120 (39%)

Query: 547 HDETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTST 606
+D ++ G+ T+T N + G T+T N LT G T +L+ ST
Sbjct: 876 YDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGST 935

Query: 607 ETVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
+T L G G T + +AG S G + + G T TAG + L G
Sbjct: 936 QTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAG 995



Score = 47.4 bits (112), Expect = 2e-07
Identities = 38/145 (26%), Positives = 50/145 (34%), Gaps = 1/145 (0%)

Query: 554 GHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTETVGLAK 613
G+ T T ++ G T+T TL G ++T + +L ST G
Sbjct: 915 GYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDS 974

Query: 614 ALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG-SSVLIM 672
+L G G T +AG S + S T G T TAG L G S L
Sbjct: 975 SLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTS 1034

Query: 673 ESNGHITLRGTQLLIEGSGPVQING 697
+T LI G V G
Sbjct: 1035 GIRSFLTAGYGSTLISGLRSVLTAG 1059



Score = 47.1 bits (111), Expect = 3e-07
Identities = 31/120 (25%), Positives = 47/120 (39%)

Query: 547 HDETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTST 606
D ++ G+ T+T + ++ G T+T LT G T +L+ ST
Sbjct: 252 DDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGST 311

Query: 607 ETVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
+T G T G G T + +AG S G + + G T TAG+ L G
Sbjct: 312 QTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAG 371



Score = 45.9 bits (108), Expect = 5e-07
Identities = 42/160 (26%), Positives = 58/160 (36%), Gaps = 9/160 (5%)

Query: 547 HDETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTST 606
D ++ G+ T+T N + G T+T N LT G T +L+ ST
Sbjct: 828 ADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGST 887

Query: 607 ETVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
+T G LT G G T + + G S G + + G T TA + L G
Sbjct: 888 QTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAG 947

Query: 667 --SSVLIMESNGHITLRGTQL-------LIEGSGPVQING 697
SS E + G+ LI G G Q G
Sbjct: 948 YGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAG 987



Score = 45.1 bits (106), Expect = 9e-07
Identities = 32/113 (28%), Positives = 45/113 (39%)

Query: 554 GHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTETVGLAK 613
G+ T T ++ ++ G T+T G + +LT G +T Q +L ST T G
Sbjct: 243 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 302

Query: 614 ALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
+L G G T +AG S + G T TAGD L G
Sbjct: 303 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAG 355



Score = 44.7 bits (105), Expect = 1e-06
Identities = 29/120 (24%), Positives = 49/120 (40%)

Query: 547 HDETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTST 606
++ ++ G+ T+T T+ G ++T +LT G +L+ ST
Sbjct: 924 YESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGST 983

Query: 607 ETVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
+T G LT G G T + +AG S G + + G ++T+G R L G
Sbjct: 984 QTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAG 1043



Score = 44.7 bits (105), Expect = 1e-06
Identities = 41/154 (26%), Positives = 57/154 (37%), Gaps = 7/154 (4%)

Query: 546 DHDETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTS 605
H + G+ TET ++ T+ G T T G + TL G +T + + S
Sbjct: 171 THQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGS 230

Query: 606 TETVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKT 665
T+T LT G G T + AG S + G + G T TA +L
Sbjct: 231 TQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 290

Query: 666 GSSVLIMESNGHITLRGTQLLIEGSGPVQINGKD 699
G S G T LI G G Q G++
Sbjct: 291 GYG-----STG--TAGADSSLIAGYGSTQTAGEE 317



Score = 44.4 bits (104), Expect = 2e-06
Identities = 32/119 (26%), Positives = 46/119 (38%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
D ++ G+ T+T T G T+T LT G T +L+ ST+
Sbjct: 301 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 360

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
T G +LT G G T + +AG S G + + G T TAG+ G
Sbjct: 361 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAG 419



Score = 43.6 bits (102), Expect = 3e-06
Identities = 41/161 (25%), Positives = 55/161 (34%), Gaps = 9/161 (5%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
D ++ G+ T+T N + G T+T LT G T +L+ ST+
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG- 666
T G LT G G T + ++G S G + + G T TA L G
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 667 --------SSVLIMESNGHITLRGTQLLIEGSGPVQINGKD 699
SVL T LI G G Q G
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYH 797



Score = 42.4 bits (99), Expect = 6e-06
Identities = 40/162 (24%), Positives = 56/162 (34%), Gaps = 9/162 (5%)

Query: 547 HDETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTST 606
++ ++ G+ T+T T+ G T+T N L G T +L+ ST
Sbjct: 492 YESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGST 551

Query: 607 ETVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
+T LT G G T + +AG S G + + G T TA L G
Sbjct: 552 QTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAG 611

Query: 667 ---------SSVLIMESNGHITLRGTQLLIEGSGPVQINGKD 699
SVL T LI G G Q G +
Sbjct: 612 YGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYN 653



Score = 42.4 bits (99), Expect = 7e-06
Identities = 32/119 (26%), Positives = 47/119 (39%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
D ++ G+ T+T + ++ G T+T LT G T +L+ ST+
Sbjct: 589 DSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQ 648

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
T G LT G G T + +AG S G + + G T TAG L G
Sbjct: 649 TAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAG 707



Score = 42.4 bits (99), Expect = 7e-06
Identities = 31/120 (25%), Positives = 44/120 (36%)

Query: 547 HDETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTST 606
+ G+ T T + ++ G T+T G + LT G +T Q +L ST
Sbjct: 764 EQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGST 823

Query: 607 ETVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
T G +L G G T +AG S + + G T TAG L G
Sbjct: 824 STAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAG 883



Score = 42.0 bits (98), Expect = 8e-06
Identities = 41/159 (25%), Positives = 60/159 (37%), Gaps = 9/159 (5%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
D ++ G+ T+T + ++ G T+T LT G T +L+ ST+
Sbjct: 733 DSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQ 792

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG- 666
T G LT G G T + + G S G + + G T TAG L G
Sbjct: 793 TAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGY 852

Query: 667 -SSVLIMESNGHITLRGT-------QLLIEGSGPVQING 697
S+ E++ T G+ LI G G Q G
Sbjct: 853 GSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAG 891



Score = 42.0 bits (98), Expect = 9e-06
Identities = 29/113 (25%), Positives = 43/113 (38%)

Query: 554 GHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTETVGLAK 613
+ T + + + G TET G++ TL G T L+ ST+T G
Sbjct: 163 TYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEES 222

Query: 614 ALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
+ G G T + +AG S G + + G T TAG+ L G
Sbjct: 223 SQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAG 275



Score = 42.0 bits (98), Expect = 9e-06
Identities = 41/155 (26%), Positives = 54/155 (34%), Gaps = 9/155 (5%)

Query: 554 GHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTETVGLAK 613
G+ T T + ++ G T+T G N LT G +T Q +L ST T G
Sbjct: 819 GYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDS 878

Query: 614 ALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG------- 666
+L G G T +AG S + + G T TAG L G
Sbjct: 879 SLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTA 938

Query: 667 --SSVLIMESNGHITLRGTQLLIEGSGPVQINGKD 699
S L+ T R L G G + G D
Sbjct: 939 SFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYD 973



Score = 41.3 bits (96), Expect = 1e-05
Identities = 42/161 (26%), Positives = 56/161 (34%), Gaps = 9/161 (5%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
D ++ G+ T+T T G T+T LT G T +L+ ST+
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 456

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG- 666
T G +LT G G T + +AG S G + + G T TAG L G
Sbjct: 457 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGY 516

Query: 667 --------SSVLIMESNGHITLRGTQLLIEGSGPVQINGKD 699
S LI T LI G G Q +
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN 557



Score = 40.5 bits (94), Expect = 3e-05
Identities = 30/119 (25%), Positives = 45/119 (37%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
D ++ G+ T+T + ++ G T+T LT G T +L+ ST+
Sbjct: 445 DSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQ 504

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
T G LT G G T + G S G + + + G T TA L G
Sbjct: 505 TAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAG 563



Score = 40.1 bits (93), Expect = 3e-05
Identities = 31/119 (26%), Positives = 47/119 (39%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
D ++ G+ T+T + ++ G T+T LT G T +L+ ST+
Sbjct: 349 DSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQ 408

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
T G T G G T + +AG S G + + G T TAG+ L G
Sbjct: 409 TAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAG 467



Score = 40.1 bits (93), Expect = 3e-05
Identities = 41/161 (25%), Positives = 56/161 (34%), Gaps = 9/161 (5%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
D T+ G+ T+T + G T+T LT G T +L+ ST+
Sbjct: 205 DSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 264

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTGS 667
T G +LT G G T + +AG S G + + G T TAG+ G
Sbjct: 265 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGY 324

Query: 668 SVLIMESNGHITLRG---------TQLLIEGSGPVQINGKD 699
G G LI G G Q G+D
Sbjct: 325 GSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGED 365



Score = 40.1 bits (93), Expect = 4e-05
Identities = 31/119 (26%), Positives = 45/119 (37%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
D ++ G+ T+T + + G T+T LT G T +L+ ST+
Sbjct: 781 DSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQ 840

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
T G LT G G T + + G S G + + G T TAG L G
Sbjct: 841 TAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAG 899



Score = 39.7 bits (92), Expect = 4e-05
Identities = 31/119 (26%), Positives = 44/119 (36%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
D ++ G+ T+T N + G T+T LT G T +L+ ST+
Sbjct: 685 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQ 744

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
T +LT G G T + G S G + + G T TAG L G
Sbjct: 745 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAG 803



Score = 39.4 bits (91), Expect = 6e-05
Identities = 29/119 (24%), Positives = 44/119 (36%)

Query: 548 DETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTE 607
+ ++ G+ T+T N + G T+T LT G T +++ ST+
Sbjct: 541 NSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQ 600

Query: 608 TVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
T +LT G G T + G S G + + G T TAG L G
Sbjct: 601 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAG 659



Score = 39.4 bits (91), Expect = 6e-05
Identities = 31/113 (27%), Positives = 43/113 (38%)

Query: 554 GHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTETVGLAK 613
G+ T T ++ G T+T G TLT G +T Q +L+ ST T G
Sbjct: 483 GYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANS 542

Query: 614 ALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
+L G G T + +AG S + G T TAG + G
Sbjct: 543 SLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAG 595



Score = 39.0 bits (90), Expect = 7e-05
Identities = 32/117 (27%), Positives = 43/117 (36%)

Query: 550 TVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTETV 609
+ G+ T T + ++ G T+T G N LT G +T Q +L ST T
Sbjct: 623 VLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTA 682

Query: 610 GLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTG 666
G +L G G T +AG S + G T TAG L G
Sbjct: 683 GADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAG 739



Score = 37.0 bits (85), Expect = 3e-04
Identities = 37/160 (23%), Positives = 62/160 (38%), Gaps = 9/160 (5%)

Query: 547 HDETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTST 606
+D ++ G+ T+T T+ G T+T ++ TLT G T +L+ S+
Sbjct: 972 YDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSS 1031

Query: 607 ETVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVG---------KTYTITA 657
T G+ LT G G + +AG S+ G + G ++ I
Sbjct: 1032 LTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAG 1091

Query: 658 GDRIELKTGSSVLIMESNGHITLRGTQLLIEGSGPVQING 697
+ ++ S+LI T LI G+ VQ+ G
Sbjct: 1092 PESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAG 1131



Score = 32.0 bits (72), Expect = 0.010
Identities = 26/104 (25%), Positives = 40/104 (38%), Gaps = 7/104 (6%)

Query: 569 IGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTSTETVGLAKALTVGGGYQVTVAGA 628
I R+ + E+ + GNR+ I G S++T G L G
Sbjct: 1081 IASHRSSLIAGPESTQITGNRSMLIAGK-------GSSQTAGYRSTLISGADSVQMAGER 1133

Query: 629 VNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIELKTGSSVLIM 672
AG S + G + G +TAGDR +L G+ ++M
Sbjct: 1134 GKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCILM 1177



Score = 32.0 bits (72), Expect = 0.011
Identities = 27/117 (23%), Positives = 42/117 (35%)

Query: 547 HDETVHVGHDRTETVDNNETIHIGVDRTETVGNNETLTVGGNRNETIQGMENLLIALTST 606
H ++ G + T+ N + G ++T G TL G + + L+ ST
Sbjct: 1084 HRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAGADST 1143

Query: 607 ETVGLAKALTVGGGYQVTVAGAVNTSAGLASAEEVGLSKTTMVGKTYTITAGDRIEL 663
+T G L G +T +AG G G +TAG R +L
Sbjct: 1144 QTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCILMAGDRSKLTAGINSILTAGCRSKL 1200


14Bamb_1203Bamb_1226Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1203230-4.122642aldose 1-epimerase
Bamb_1204341-6.218380undecaprenyl pyrophosphate phosphatase
Bamb_1205446-6.107799*TIS1421-transposase orfA protein
Bamb_1206441-6.095950exonuclease
Bamb_1207333-4.550947hypothetical protein
Bamb_1208328-3.850152hypothetical protein
Bamb_1209323-3.779826hypothetical protein
Bamb_1210522-3.509728hypothetical protein
Bamb_1211420-3.719711hypothetical protein
Bamb_1212319-2.514018hypothetical protein
Bamb_1213120-2.923159hypothetical protein
Bamb_1214022-3.875133cyclic nucleotide-binding protein
Bamb_1215024-3.324730ECF subfamily RNA polymerase sigma-24 factor
Bamb_1216225-2.954879hypothetical protein
Bamb_1217224-2.971450hypothetical protein
Bamb_1218327-3.841961hypothetical protein
Bamb_1219227-3.615764hypothetical protein
Bamb_1220226-3.079512hypothetical protein
Bamb_1221123-3.177917altronate dehydratase
Bamb_1222222-3.150213mannitol dehydrogenase domain-containing
Bamb_1223122-3.347448alcohol dehydrogenase
Bamb_1224123-2.824192amidohydrolase 2
Bamb_1225226-2.821188aldo/keto reductase
Bamb_1226325-2.807159periplasmic binding protein/LacI transcriptional
15Bamb_1248Bamb_1267Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_12482111.329945hypothetical protein
Bamb_12491122.308465hypothetical protein
Bamb_12501133.055525phosphatidate cytidylyltransferase
Bamb_12511123.869981phospholipid/glycerol acyltransferase
Bamb_12520133.757073CDP-alcohol phosphatidyltransferase
Bamb_1253-1124.229588alpha/beta fold family hydrolase
Bamb_1254-1115.075503dual specificity protein phosphatase
Bamb_12550105.795773hypothetical protein
Bamb_1256095.817923diguanylate cyclase
Bamb_1257085.984612hypothetical protein
Bamb_12580105.673915cellulose synthase regulator protein
Bamb_12591105.201705endo-1,4-D-glucanase
Bamb_12601104.902859cellulose synthase domain-containing protein
Bamb_1261-1113.745238hypothetical protein
Bamb_1262-1142.679629chromosome partitioning ATPase
Bamb_12630152.434689cellulose synthase
Bamb_12642142.299018hypothetical protein
Bamb_12651131.792145hypothetical protein
Bamb_12661121.573720pirin domain-containing protein
Bamb_12672111.129511OsmC family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1253PERTACTIN300.041 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 29.7 bits (66), Expect = 0.041
Identities = 37/135 (27%), Positives = 54/135 (40%), Gaps = 6/135 (4%)

Query: 374 GVGALIDRTY-LD-SPGWVGIRQRKVHLQELIGAAIGRLRGLGEPVRIVDIAAGHGRYVL 431
G G L+D Y +D S V + Q V +L GAAI RG V ++A HG +
Sbjct: 282 GFGPLLDGWYGVDVSDSTVDLAQSIVEAPQL-GAAIRAGRGARVTVSGGSLSAPHGNVIE 340

Query: 432 DAIATAAERDGAAPDDITLRDYSPPNVEAGRVLIAQRGLEPIARFERGDAFDEASLATLE 491
A+P ITL+ + GR L+ + EP+ G A + + E
Sbjct: 341 TGGGARRFPPPASPLSITLQAGARAQ---GRALLYRVLPEPVKLTLAGGAQGQGDIVATE 397

Query: 492 PRPTLAIVSGLYELF 506
P SG ++
Sbjct: 398 LPPIPGASSGPLDVA 412


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1255RTXTOXINA270.031 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 27.2 bits (60), Expect = 0.031
Identities = 15/48 (31%), Positives = 25/48 (52%), Gaps = 2/48 (4%)

Query: 15 GALDAGAWAIAAALAAAGAGWWIASGWA--PATIARVLLAVVSTGSGI 60
GA+DA I+ LA+ +G A+ + A ++ ++ AV SGI
Sbjct: 362 GAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1258FLGHOOKFLIK320.008 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 32.1 bits (72), Expect = 0.008
Identities = 19/68 (27%), Positives = 28/68 (41%)

Query: 11 SVATFGALHAAALSAAPMPAVPAAASASAGTPAAPSPASGAAVPNAASVASAGLPTTTVH 70
+ T A A P+ + A A + A + PSP + AA P + LPT
Sbjct: 170 QLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAP 229

Query: 71 LPFASLGA 78
+ A LG+
Sbjct: 230 VLSAPLGS 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1261TYPE3OMOPROT320.006 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 32.3 bits (73), Expect = 0.006
Identities = 22/56 (39%), Positives = 27/56 (48%), Gaps = 8/56 (14%)

Query: 491 DALRHCRPRRAGDVVTADAAHLYVFLFACEPVDAEDALARIFDVPVDTLSDRVVCL 546
D L H P AG V+A A HL V A A R F++PV LS R +C+
Sbjct: 58 DWLEHVSPALAGAAVSAGAEHLVVPWLA--------ATERPFELPVPHLSCRRLCV 105


16Bamb_1284Bamb_1310Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1284280.241729hypothetical protein
Bamb_1285180.324186L-carnitine dehydratase/bile acid-inducible
Bamb_1286281.080235alanyl-tRNA synthetase
Bamb_12872101.747634hypothetical protein
Bamb_1288192.207376polypeptide-transport-associated
Bamb_12891113.078724filamentous hemagglutinin outer membrane
Bamb_1290295.530254LysR family transcriptional regulator
Bamb_12912105.693800hypothetical protein
Bamb_12921105.494666NUDIX hydrolase
Bamb_12932115.074697thioesterase superfamily protein
Bamb_12942105.386561inner-membrane translocator
Bamb_12952104.622901inner-membrane translocator
Bamb_1296-1112.588768ABC transporter-like protein
Bamb_1297-292.583283ABC transporter-like protein
Bamb_1298-192.259561short-chain dehydrogenase/reductase SDR
Bamb_12990111.922219hypothetical protein
Bamb_13000131.346494myo-inositol catabolism IolB domain-containing
Bamb_13010131.196968xylose isomerase domain-containing protein
Bamb_13021141.381459thiamine pyrophosphate binding domain-containing
Bamb_13032140.706245ribokinase-like domain-containing protein
Bamb_13043150.039723periplasmic binding protein/LacI transcriptional
Bamb_13053130.326830ABC transporter-like protein
Bamb_13063130.523823inner-membrane translocator
Bamb_13072131.044414xylose isomerase domain-containing protein
Bamb_13082131.611559inositol 2-dehydrogenase
Bamb_13092131.415277xylose isomerase domain-containing protein
Bamb_13102131.645915oxidoreductase domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1288PF00577300.033 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 29.8 bits (67), Expect = 0.033
Identities = 36/238 (15%), Positives = 59/238 (24%), Gaps = 48/238 (20%)

Query: 308 YRAGYQVPVGPLSTRLGVA---YSEMHYRLAGEFSDLEYHGRASVQSVFVAQPLVRARQM 364
R Y + T + + YS Y + + +G V Q +
Sbjct: 459 VRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFT-- 516

Query: 365 DLTAQIQYENKNLHDTYGIFDLQTDKNVGLW-SFSLSGSSEDRWFGGGR----------- 412
+ Y + L + +G + LSGS + W
Sbjct: 517 -DYYNLAYNKRGK------LQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTA 569

Query: 413 -NGVSVTFGAGRLRSND--------------PIGMNTLAKTIGSFSKLNVSALRVQSLGR 457
++ T ++ P + + + + S L
Sbjct: 570 FEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNG 629

Query: 458 RFQFYAQFNAQLASRNLDSSEKFSLGGPYGVRAYGFSAGSGDQGWQASAELRYRAAPG 515
R A L N S Y V+ G G+ G A L YR G
Sbjct: 630 RMTNLAGVYGTLLEDNNLS---------YSVQTGYAGGGDGNSGSTGYATLNYRGGYG 678


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1289PF05860769e-19 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 76.4 bits (188), Expect = 9e-19
Identities = 30/107 (28%), Positives = 50/107 (46%), Gaps = 8/107 (7%)

Query: 53 LPTGGAIVGGKGDIATSSDGKAMSVNQHTDKLITNWQDFSIAGGERVSFHQPTDKSIALN 112
LP I Q L ++Q+FS+ F+ PT+ ++
Sbjct: 9 LPINSNITTEGNTRIIER------GTQAGSNLFHSFQEFSVPTSGTAFFNNPTNIQNIIS 62

Query: 113 RVIGTNGSQIQGQLDANGK--VFLVNPNGVVFGKGAQVNVGGLVATT 157
RV G + S I G + AN +FL+NPNG++FG+ A++++GG +
Sbjct: 63 RVTGGSVSNIDGLIRANATANLFLINPNGIIFGQNARLDIGGSFVGS 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1298DHBDHDRGNASE1031e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 103 bits (257), Expect = 1e-28
Identities = 76/251 (30%), Positives = 118/251 (47%), Gaps = 14/251 (5%)

Query: 7 KVAIVTGGSKGIGAAIAKALAAEGASVV-VNYASSKAGADAVVSAIVEAGGRAVAVGGDV 65
K+A +TG ++GIG A+A+ LA++GA + V+Y K + VVS++ A A DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL--EKVVSSLKAEARHAEAFPADV 66

Query: 66 SKAADAQRIVDAAIENYGRLDVLVNNSGVYEFAPIEAITEEHYRRQFDTNVFGVLLTTQA 125
+A I G +D+LVN +GV I ++++E + F N GV +++
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 126 AVKHL--GEGASIINISSVVTSITPPASAVYSGTKGAVDAITGVLALELGPRKIRVNAIN 183
K++ SI+ + S + + A Y+ +K A T L LEL IR N ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 184 PGMIVTEGTHS--------AGIIGSDLETQVRNDTPLGRLGEPGDIASVAVFLASDDARW 235
PG T+ S +I LE + PL +L +P DIA +FL S A
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLE-TFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 236 LTGERLVASGG 246
+T L GG
Sbjct: 246 ITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1306SOPEPROTEIN310.004 Salmonella type III secretion SopE effector protein ...
		>SOPEPROTEIN#Salmonella type III secretion SopE effector protein

signature.
Length = 239

Score = 31.2 bits (70), Expect = 0.004
Identities = 19/52 (36%), Positives = 26/52 (50%), Gaps = 8/52 (15%)

Query: 148 IPPFIATLGTMVAARGFAKWFTNGMPVSMLTDPFAAIGAGANPVIIFLVVAA 199
I PF+ +G AA+ G+P + D F GAGANP I L+ +A
Sbjct: 136 IAPFLQEIGE--AAK------NAGLPGTTKNDVFTPSGAGANPFITPLISSA 179


17Bamb_1350Bamb_1361Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_13502130.8279423-oxoacid CoA-transferase subunit A
Bamb_13513141.5593233-oxoacid CoA-transferase subunit B
Bamb_13522151.737685short chain dehydrogenase
Bamb_1353-190.939460polysaccharide deacetylase
Bamb_1354-38-0.803928hypothetical protein
Bamb_1355-39-1.751844LysR family transcriptional regulator
Bamb_1356-28-2.742369alpha/beta fold family hydrolase
Bamb_1357-19-3.811435endoribonuclease L-PSP
Bamb_1358-19-3.936386(p)ppGpp synthetase I SpoT/RelA
Bamb_1359-112-4.439510*threonyl-tRNA synthetase
Bamb_1360113-4.037380translation initiation factor IF-3
Bamb_1361014-3.33603350S ribosomal protein L35
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1352DHBDHDRGNASE662e-15 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 66.2 bits (161), Expect = 2e-15
Identities = 46/196 (23%), Positives = 86/196 (43%), Gaps = 18/196 (9%)

Query: 26 LDITDTAAVDAFCAR----VGQFDHVVISAAKTATGPLRALPLADAQAAMDSKFWGAY-- 79
D+ D+AA+D AR +G D +V A G + +L + +A G +
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 80 --RIARSIDIAPGGSLTFVSGYLSVRPSASSVLQGAINAALEALARGLALELAP--VRVN 135
+++ + GS+ V + P S + AA + L LELA +R N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 136 TVSPGLIATPLWDKL--APDARDAMYAGAAQR----LPARRVGQPEDVANAIVYLAT--T 187
VSPG T + L + + + G+ + +P +++ +P D+A+A+++L +
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 188 PYATGSTVLIDGGGAI 203
+ T + +DGG +
Sbjct: 244 GHITMHNLCVDGGATL 259


18Bamb_1388Bamb_1405Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1388017-3.324332MarR family transcriptional regulator
Bamb_1389017-3.404348GTP-binding protein TypA
Bamb_1390017-3.6256242-oxoglutarate dehydrogenase E1 component
Bamb_1391-214-3.271244dihydrolipoamide succinyltransferase
Bamb_1392011-1.937107dihydrolipoamide dehydrogenase
Bamb_13933150.039803AFG1 family ATPase
Bamb_13945190.465172hypothetical protein
Bamb_13954210.092526hypothetical protein
Bamb_1396421-0.014524polypeptide-transport-associated
Bamb_13975271.680583hypothetical protein
Bamb_1398326-0.225020hypothetical protein
Bamb_1399228-1.991283Flp/Fap pilin component
Bamb_1400124-1.881645peptidase A24A, prepilin type IV
Bamb_1401020-1.137024TadE family protein
Bamb_1402120-1.051317CpaB family Flp pilus assembly protein
Bamb_1403119-1.489834type II and III secretion system protein
Bamb_1404220-1.267941response regulator receiver protein
Bamb_1405221-1.814949type II secretion system protein E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1388FLGMOTORFLIM280.027 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 27.6 bits (61), Expect = 0.027
Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 10/55 (18%)

Query: 112 EGRALAERLPPVFRSVLDELLGG----------FTPEEVGFLKSMLRRILSNYCE 156
+G A+ E P + S++D L GG T E ++ ++ RIL+N E
Sbjct: 112 KGNAVLEVDPSITFSIIDRLFGGTGQAAKVQRDLTDIENSVMEGVIVRILANVRE 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1389TCRTETOQM1693e-47 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 169 bits (429), Expect = 3e-47
Identities = 99/435 (22%), Positives = 170/435 (39%), Gaps = 62/435 (14%)

Query: 5 LRNIAIIAHVDHGKTTLVDQLLRQSGTFRENQQIAE--RVMDSNDIEKERGITILAKNCA 62
+ NI ++AHVD GKTTL + LL SG E + + D+ +E++RGITI +
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 63 VEYEGTHINIVDTPGHADFGGEVERVLSMVDSVLLLVDAVEGPMPQTRFVTKKALALGLK 122
++E T +NI+DTPGH DF EV R LS++D +LL+ A +G QTR + +G+
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 123 PIVIVNKIDRPGARIDWV-------------INQTFDLFDKLGATE----EQLDFPIV-- 163
I +NKID+ G + V I Q +L+ + T EQ D I
Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182

Query: 164 -----------------YASGLNGY---ASLDP-----AAREGDMRPLFEAVLQHVPVRP 198
+ SL P A + L E +
Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSST 242

Query: 199 ADPDAPLQLQITSLDYSTYVGRIGVGRITRGRIKPGQPVAMRFGPEGDVLNRKINQVLSF 258
+ L ++ ++YS R+ R+ G + V R + + KI ++ +
Sbjct: 243 HRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSV--RISEKEKI---KITEMYTS 297

Query: 259 TGLERVQVDSAEAGDIVLINGIEDVGIGATICAVDTPEALPMITVDEPTLTMNFLVNSSP 318
E ++D A +G+IV++ E + + + + I P L +
Sbjct: 298 INGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQ 356

Query: 319 LAGREGKFVTSRQIRDRLMKELNHNVALRVKDTGDETVFEVSGRGELHLTILVENMRRE- 377
+ D L++ V E + +S G++ + + ++ +
Sbjct: 357 QREMLLDALLEISDSDPLLR-------YYVDSATHEII--LSFLGKVQMEVTCALLQEKY 407

Query: 378 GYELAVSRPRVVMQE 392
E+ + P V+ E
Sbjct: 408 HVEIEIKEPTVIYME 422



Score = 34.4 bits (79), Expect = 0.001
Identities = 16/100 (16%), Positives = 32/100 (32%), Gaps = 1/100 (1%)

Query: 387 RVVMQEIDGVRHEPYELLTVDVEDEHQGGVMEELGRRKGEMLDMASDGRGRTRLEYKISA 446
V+++ EPY + E+ + + ++D L +I A
Sbjct: 525 EQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVD-TQLKNNEVILSGEIPA 583

Query: 447 RGLIGFQSEFLTLTRGTGLMSHIFDSYAPVKDGSVFERRN 486
R + ++S+ T G + Y V + R
Sbjct: 584 RCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQPRR 623


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1391RTXTOXIND320.005 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.005
Identities = 9/92 (9%), Positives = 29/92 (31%), Gaps = 5/92 (5%)

Query: 48 EVPAPAAGVLAQVLQNDGDTVVADQIIATID---TEAKAGAAEAAAGAAEVKPAAAPAAA 104
E+ ++ +++ +G++V ++ + EA +++ A +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL--EQTRYQI 155

Query: 105 APAAQPAAAVASSSAAASPAASKLLAEKGLSA 136
+ + P + E+ L
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1392INTIMIN310.015 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.8 bits (69), Expect = 0.015
Identities = 20/81 (24%), Positives = 31/81 (38%), Gaps = 2/81 (2%)

Query: 103 TSGIEFLFKKNKITWLKGHGKFTGKTDAGVQIEVSGE--GETEVVTAKNVIIATGSKARH 160
SG L + T G T K+D Q+ VS + T + A VI +KA
Sbjct: 601 VSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASI 660

Query: 161 LPNVPVDNKIVSDNEGALTFE 181
V++ + A+T+
Sbjct: 661 TEIKADKTTAVANGQDAITYT 681


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1397RTXTOXINA290.048 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 28.8 bits (64), Expect = 0.048
Identities = 57/301 (18%), Positives = 107/301 (35%), Gaps = 57/301 (18%)

Query: 140 LGNVVGATS-NTVSGLSSTVKALGTGQLSPLAPVTTPVGTVLDTVANGLTAAGTTIGSTL 198
GN++G + N L L T Q + L + +D + + G S L
Sbjct: 124 AGNILGGGAENIGDNLGKAGGILSTFQ-NFLGTALS--SMKIDELIKKQKSGGNVSSSEL 180

Query: 199 SSGAVQQVTQPLSSAITPLVITAGQVTQQVGTTTGLGQPVSGLLGQVGGAISSAGKQVGS 258
+ +++ + Q L + L +QQ+ T + L + G ++ +
Sbjct: 181 AKASIELINQ-LVDTVASLNNNVNSFSQQLNTLGSVLSNTKHL--------NGVGNKLQN 231

Query: 259 TSNQPLVGDVGQLVTAVGNTVTNAGGLVNPNGPNGAAPIPG--LITSLVGGSTTAVQN-- 314
N +G V+ + + ++ + L N + G L T ++G +
Sbjct: 232 LPNLDNIGAGLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYI 291

Query: 315 ---GSSSGSSATNPLGGLLSG---LGSTPLG----------------------------- 339
++ G S + GL++ L +PL
Sbjct: 292 IAQRAAQGLSTSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGD 351

Query: 340 SLTGAVGGATGGAGGANPLAPVTGLLNTVTAAVGGAAGSGASSNPLAPVTSLVGGVSGTA 399
SL A TG + L ++ +L +V++ + AA +S APV++LVG V+G
Sbjct: 352 SLLAAFHKETGAIDAS--LTTISTVLASVSSGISAAA---TTSLVGAPVSALVGAVTGII 406

Query: 400 S 400
S
Sbjct: 407 S 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1398cloacin358e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.1 bits (80), Expect = 8e-04
Identities = 34/113 (30%), Positives = 47/113 (41%), Gaps = 8/113 (7%)

Query: 30 GGSGSISKGISGGSGSGGSDSISTSGGGTSGGTSGSTSGGTSGSTSGSTSGSTSGSTSGS 89
G+ S S I+GG G ++ G G S + + GG SGS GS G+ G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWS--SENNPWGGGSGSGIHWGGGSGHGNGGG- 67

Query: 90 TSGTTSGTSSGTSGTSGVSANPVG---NVLAQGGNVITSLGGTASGLGSTIAN 139
SG SGT G A PV L+ G ++ +A L + IA+
Sbjct: 68 --NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 31.2 bits (70), Expect = 0.010
Identities = 30/79 (37%), Positives = 37/79 (46%), Gaps = 11/79 (13%)

Query: 39 ISGGSGSG-----GSDSISTSGGGTSGGTSGSTSGGTSGSTS------GSTSGSTSGSTS 87
+SGG G G S S + +GG T G G S G+ S+ GS SG G S
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 88 GSTSGTTSGTSSGTSGTSG 106
G +G +G S G SGT G
Sbjct: 61 GHGNGGGNGNSGGGSGTGG 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1400PREPILNPTASE432e-07 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 42.9 bits (101), Expect = 2e-07
Identities = 36/142 (25%), Positives = 60/142 (42%), Gaps = 11/142 (7%)

Query: 4 LLSTSIFFAWAALVAAGDIRFRLVRNSLVICGGTAALVSSLIHANPFGISTGQALIGMLV 63
L+ + + D+ L+ + L + L+ +L+ F +S G A+IG +
Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLL--GGF-VSLGDAVIGAMA 190

Query: 64 GLVSFFP-------LFAMRVMGAADVKVFAVLGAWCGLPILLWLWVIASLAAGVHVLGLM 116
G + + L MG D K+ A LGAW G L + +++SL +GL+
Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250

Query: 117 LLTRTPLGALWVRGLPAMALAG 138
LL G P +A+AG
Sbjct: 251 LLRNHHQSKPIPFG-PYLAIAG 271


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1403BCTERIALGSPD1335e-36 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 133 bits (337), Expect = 5e-36
Identities = 60/252 (23%), Positives = 113/252 (44%), Gaps = 11/252 (4%)

Query: 160 RSVVQVDVRVVEFSRSVLKEAGLNFFKQSNGFAFGAFSPGGLQSVTGGA----TSAFAAT 215
R V V+ + E + G+ + ++ G S + + GA ++
Sbjct: 344 RPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSS 403

Query: 216 GGIPIASAFNLVVNSAGRGIFG-NISILEANNLARVLAQPTLVALSGQSASFLAGGEIPV 274
S+FN + +G + ++ L ++ +LA P++V L A+F G E+PV
Sbjct: 404 SLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPV 463

Query: 275 PVPQALGSTA-----IDWKQYGVGLTLTPTVLSQHRIALKVAPESSQLDFQHGVTINSVS 329
S ++ K G+ L + P + + L++ E S + T +S
Sbjct: 464 LTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASST-SSDL 522

Query: 330 VPAITTRRADTTVELGDGESFVIGGLIDRETMSNISKVPVLGDLPIIGAFFKSLNYQQND 389
TR + V +G GE+ V+GGL+D+ KVP+LGD+P+IGA F+S + + +
Sbjct: 523 GATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSK 582

Query: 390 KELVIIVTPHLV 401
+ L++ + P ++
Sbjct: 583 RNLMLFIRPTVI 594


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1404HTHFIS401e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 40.2 bits (94), Expect = 1e-05
Identities = 23/119 (19%), Positives = 38/119 (31%), Gaps = 3/119 (2%)

Query: 24 DEHLRW-LRDTLVSAGMVEAVSLEPGALAQRILGLN-PAIVFIDFSRAQAEASAAAAAVR 81
D +R L L AG + A R + +V D A ++
Sbjct: 12 DAAIRTVLNQALSRAGY-DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIK 70

Query: 82 LAHPSLPVVALGTLAQPESALAALRAGVRDFIDVSGSAEDALRITRGLLEHAGAEPANR 140
A P LPV+ + +A+ A G D++ + + I L P+
Sbjct: 71 KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1405cloacin300.031 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.7 bits (66), Expect = 0.031
Identities = 12/25 (48%), Positives = 12/25 (48%)

Query: 430 GGFGGSGGGGGFGGGFGRGGGGFNV 454
G G GG G GGG G GG V
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAV 84


19Bamb_1459Bamb_1471Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_14592120.592971*hypothetical protein
Bamb_14601101.385051small multidrug resistance protein
Bamb_14611101.1888063-hydroxyisobutyryl-CoA hydrolase
Bamb_1462-1110.636999hypothetical protein
Bamb_1463-1120.710692alkanesulfonate monooxygenase
Bamb_1464-213-0.587985binding-protein-dependent transport system inner
Bamb_1465-219-2.818294ABC transporter-like protein
Bamb_1466125-6.654941molybdenum-pterin binding
Bamb_1467126-6.348858hypothetical protein
Bamb_1468026-5.856876hypothetical protein
Bamb_1469024-5.881619tRNA-dihydrouridine synthase A
Bamb_1470129-6.261590*acyltransferase 3
Bamb_1471027-5.057418FkbM family methyltransferase
20Bamb_1501Bamb_1510Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_15012110.115746transcriptional regulator CysB-like protein
Bamb_15022101.772167hypothetical protein
Bamb_1503282.481080periplasmic binding protein/LacI transcriptional
Bamb_15042102.381022ABC transporter-like protein
Bamb_1505-110-0.331904inner-membrane translocator
Bamb_1506-19-1.313808LacI family transcriptional regulator
Bamb_1507-17-2.322312ribokinase
Bamb_1508-110-3.449204methyl-accepting chemotaxis sensory transducer
Bamb_1509-19-4.076362hypothetical protein
Bamb_151009-3.616639serine protein kinase PrkA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1508PF03544310.013 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 30.7 bits (69), Expect = 0.013
Identities = 21/96 (21%), Positives = 27/96 (28%), Gaps = 5/96 (5%)

Query: 488 EAAAAAQSLDEQAARLRDTAAVFRIDDDAAQPAAAPAARQAPRAVPAPAAVPVSSAAPAA 547
A Q + P AP + P+ P P PV
Sbjct: 56 APADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPV----KKV 111

Query: 548 ARDDRDATPKRAAP-VRKPAGAPAAPAPAAATAGGD 582
+ RD P + P APA P + ATA
Sbjct: 112 EQPKRDVKPVESRPASPFENTAPARPTSSTATAATS 147


21Bamb_1522Bamb_1558Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1522210-0.103630RND efflux system outer membrane lipoprotein
Bamb_1523211-1.518535fimbrial protein
Bamb_1524213-0.845205fimbrial biogenesis outer membrane usher
Bamb_1525115-1.077493pili assembly chaperone
Bamb_15261120.035191fimbrial protein
Bamb_15273132.435498hypothetical protein
Bamb_15284134.785740hypothetical protein
Bamb_15294144.933530extracytoplasmic-function sigma-70 factor
Bamb_15304134.848919MbtH domain-containing protein
Bamb_15312124.441787taurine catabolism dioxygenase TauD/TfdA
Bamb_1532395.087966ABC transporter-like protein
Bamb_1533395.193220iron-hydroxamate transporter permease subunit
Bamb_1534395.073546ferric iron reductase
Bamb_1535284.863478periplasmic binding protein
Bamb_1536284.372074cyclic peptide transporter
Bamb_1537284.430322amino acid adenylation domain-containing
Bamb_1538283.479302amino acid adenylation domain-containing
Bamb_15392111.836060hypothetical protein
Bamb_15402110.424873lysine/ornithine N-monooxygenase
Bamb_1541311-0.789308TonB-dependent siderophore receptor
Bamb_1542-117-2.595792folate-dependent phosphoribosylglycinamide
Bamb_1543-120-3.029386hypothetical protein
Bamb_1544021-2.423531SpoVT/AbrB domain-containing protein
Bamb_1545-114-1.777841PilT domain-containing protein
Bamb_1546-113-0.200380dehydratase
Bamb_15470110.537473IS605 family transposase OrfB
Bamb_15480102.746603hypothetical protein
Bamb_15491113.740180hypothetical protein
Bamb_15502123.755185amidohydrolase
Bamb_15512133.806912cobyrinic acid a,c-diamide synthase
Bamb_1552-1103.372122cob(I)yrinic acid a,c-diamide
Bamb_1553093.849395cobalamin biosynthesis protein CbiG
Bamb_1554083.776902uroporphyrin-III C-methyltransferase
Bamb_1555-193.276714hypothetical protein
Bamb_1556-1103.049790cobalamin biosynthesis protein CobW
Bamb_1557-293.300939cobaltochelatase subunit CobN
Bamb_15581143.112011magnesium chelatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1524PF005776790.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 679 bits (1753), Expect = 0.0
Identities = 241/865 (27%), Positives = 364/865 (42%), Gaps = 65/865 (7%)

Query: 2 RIRHSFLCVSVLVVGSPSHATEFNSSFLDIDGTSNVDLSQFSQPDFTLPGEYMLDVQVND 61
+R C S FN FL D + DLS+F PG Y +D+ +N+
Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNN 86

Query: 62 LFYGLQAIQFIALDASGAGKPCLPPELVARFGLKPSLAKDLPRLQGGRCVDLG-AIEGAT 120
+ + + F D+ PCL +A GL + + L CV L I AT
Sbjct: 87 GYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDAT 146

Query: 121 VRYLKSDGRLKITVPQAALEFTDSTYLPPSSWSDGIAGAMLDYRVIANTNRNFGSGGGQT 180
+ RL +T+PQA + Y+PP W GI +L+Y N+ +N GG +
Sbjct: 147 AQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN--RIGGNS 204

Query: 181 NSIQAYGTIGANWDAWRFRGDYQAQSNVGNTAYADRT-FRFSRLYAFRALPSIQSTVTFG 239
+ G N AWR R + N +++ + ++ + R + ++S +T G
Sbjct: 205 HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLG 264

Query: 240 DDYLSSDIFDTFALTGASIRSDDRMLPPSLRGYAPLISGVARTNATVTVSQVGRVLYVTR 299
D Y DIFD GA + SDD MLP S RG+AP+I G+AR A VT+ Q G +Y +
Sbjct: 265 DGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNST 324

Query: 300 VSPGAFALQNIN-TSVQGTLDVTVDEEDGSVQRFQVTTAAVPFLARTGQLRYKAAVGKPR 358
V PG F + +I G L VT+ E DGS Q F V ++VP L R G RY G+ R
Sbjct: 325 VPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYR 384

Query: 359 QFGGAGVTPFFGFGEVAYGLPFDVTLYGGFIAASGYTSIALGVGRDFGTFGAVSADVTHA 418
P F + +GLP T+YGG A Y + G+G++ G GA+S D+T A
Sbjct: 385 SGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQA 444

Query: 419 RAHLWWNGATRNGNSYRINYSKHFDGLDADVRFFGYRFSEREYTNFAQFSGDPTAYGL-- 476
+ L + + +G S R Y+K + +++ GYR+S Y NFA +
Sbjct: 445 NSTL-PDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIE 503

Query: 477 -------------------ANSKQRYSATMSKRFGDTST-YFSYDQTTYW-ARASEQRVG 515
N + + T++++ G TST Y S TYW +++
Sbjct: 504 TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQ 563

Query: 516 LTLTRAFSIGALRNLNVSVSAFRTQSAGASGNQFSVTATLPIGGRHTVTSNLTTGSGSTS 575
L AF ++N ++S T++A G + + I H + S+ + S
Sbjct: 564 AGLNTAF-----EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHAS 618

Query: 576 ANAGYIYDDPAGRT----------------YQINAGATDGRASANASFRQRTSTYQ---- 615
A+ +D T Y + G G + S T Y+
Sbjct: 619 ASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYG 678

Query: 616 -LSAQASTLANAYAAASLEVDGSLVATQYGVSAHANGNAGDTRLLVSTDGVPDVPLS-GT 673
+ S ++ V G ++A GV+ DT +LV G D + T
Sbjct: 679 NANIGYSH-SDDIKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVKAPGAKDAKVENQT 735

Query: 674 LTHTDSRGYAVLDGISPYNVFDATVNVEKLPLEVQVSNPIQRMVLTDGAIGFVKFSAARG 733
TD RGYAVL + Y ++ L V + N + +V T GAI +F A G
Sbjct: 736 GVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVG 795

Query: 734 SNLYLTLTDAAGKPLPFGASVQDAANGKELGIVGEAGAAFLTQVQPKSALVVRAGERT-- 791
L +TLT KPLPFGA V + + GIV + G +L+ + + V+ GE
Sbjct: 796 IKLLMTLTH-NNKPLPFGAMVTS-ESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENA 853

Query: 792 LCAVN-ALPNQLQLEG-TPIPVTCQ 814
C N LP + Q + T + C+
Sbjct: 854 HCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1532PF05272290.023 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.023
Identities = 26/83 (31%), Positives = 35/83 (42%), Gaps = 9/83 (10%)

Query: 35 VTALCGPNGCGKSTLLRTLAGLQPARAGHVDV-NGKPLASFRRRALARELTMLAQFN--- 90
L G G GKSTL+ TL GL H D+ GK +A EL+ + F
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRAD 657

Query: 91 --QIPSGLTVRE---LVAYGRYA 108
+ + + R+ AYGRY
Sbjct: 658 AEAVKAFFSSRKDRYRGAYGRYV 680


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_15342FE2SRDCTASE783e-19 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 78.2 bits (192), Expect = 3e-19
Identities = 58/198 (29%), Positives = 88/198 (44%), Gaps = 16/198 (8%)

Query: 58 DAMVRHYGGDPAQHARALMSQWSKYYFGRAAPAGVVAALTLGRPLDMAPERTFVAL-DDG 116
D + R+ ++ + L+S W+++Y G P ++A LT + LD++PE + G
Sbjct: 75 DHIYRNQPMMIREN-KPLISLWAQWYIGLMVPPLMLALLTQEKALDVSPEHFHAEFHETG 133

Query: 117 MPAALYF--APDALGAPCSEPASRYAGLVAHLGAVIDLLAAMGRVTPRVLWSNAGNLLDY 174
A + D P S + L V+ L A G + +++WSN G L+++
Sbjct: 134 RVACFWVDVCEDKNATPHSPQHRMETLISQALVPVVQALEATGEINGKLIWSNTGYLINW 193

Query: 175 LLDTYRSLPCAA--DPVRDADWLFGASCVRGEPNPLRLPVRDAVPRSALLPTPFRARRVC 232
L + L A + +R A F + GE NPL R V R LL RR C
Sbjct: 194 YLTEMKQLLGEATVESLRHA-LFFEKTLTNGEDNPL---WRTVVLRDGLL-----VRRTC 244

Query: 233 CLRYEIPGETQLCGSCPL 250
C RY +P Q CG C L
Sbjct: 245 CQRYRLPD-VQQCGDCTL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1535FERRIBNDNGPP1132e-31 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 113 bits (284), Expect = 2e-31
Identities = 68/272 (25%), Positives = 115/272 (42%), Gaps = 15/272 (5%)

Query: 67 PQRVVALDFMFAESVIALDLVPVGMADTAFYPGWLGYGSDRLAHVTDIGSRQEPGLEAIA 126
P R+VAL+++ E ++AL +VP G+ADT Y W+ V D+G R EP LE +
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPP-LPDSVIDVGLRTEPNLELLT 93

Query: 127 AVKPDLIIGVGFRHAPIFAALDRIAPTILFQFSPNVSEDGVPVTQLDWMREIFRTIGAVT 186
+KP ++ + P L RIAP F FS L R+ + +
Sbjct: 94 EMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFSDGKQ-------PLAMARKSLTEMADLL 145

Query: 187 GRDARAKAVEAQLDAGIARNAARLKAAGRSGERIALLQDLGLPDRYWAYTGNSTSAGLAR 246
+ A+ AQ + I R + G R LL L P + NS +
Sbjct: 146 NLQSAAETHLAQYEDFIRSMKPRFV---KRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202

Query: 247 ALGLE-PWPKKPTREGTLYVTSADLLRQRDLAVLFVTATGTDVPLSSKLDSPVWRFVPAL 305
G+ W + G+ V+ L +D+ VL + + + + +P+W+ +P +
Sbjct: 203 EYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD-MDALMATPLWQAMPFV 261

Query: 306 RDHRIALIERNIWGFGGPMSALKLADVMTDTM 337
R R + +W +G +SA+ V+ + +
Sbjct: 262 RAGRFQRVP-AVWFYGATLSAMHFVRVLDNAI 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1541PF06776300.020 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 30.3 bits (68), Expect = 0.020
Identities = 13/62 (20%), Positives = 21/62 (33%), Gaps = 10/62 (16%)

Query: 8 RLRAIA-AAASVTFGMAAGHAFAQTAPAVNAGAATSTDSARNGAAASAGASAPNGPATGT 66
L+AI A ++ +A+ A+ A A GA A A + + A
Sbjct: 23 ALKAIQMGPAELSPMLASCRRLARRNGARLMLA---------GAMAIALSFGWSDRADAQ 73

Query: 67 LP 68

Sbjct: 74 GA 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1550TYPE4SSCAGA320.005 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 32.0 bits (72), Expect = 0.005
Identities = 15/35 (42%), Positives = 20/35 (57%)

Query: 237 AIHAGDAPNVIPDRAQMRLSVRALKPEVRDLLEAR 271
AI+ P+V PD A ++ L PE RDLL+ R
Sbjct: 227 AINQEPVPHVQPDIATTTTDIQGLPPEARDLLDER 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1558HTHFIS431e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 43.3 bits (102), Expect = 1e-06
Identities = 56/314 (17%), Positives = 95/314 (30%), Gaps = 35/314 (11%)

Query: 7 PRARAVFPFAALVAQQP-----LQQALLLAAIDPSLGGVLVSGPRGTAKSTAARALAELL 61
LV + + L D + ++++G GT K ARAL +
Sbjct: 128 KLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLT---LMITGESGTGKELVARALHDYG 184

Query: 62 P--EGEFVTLPLSASDEQVTGTLDLAHALAA--NGVRFRPGLLARAHRGVLYVDEVNLLA 117
G FV + ++A + + H A G +A G L++DE+ +
Sbjct: 185 KRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMP 244

Query: 118 DGLVDTLLDVAASGVNVVERDGVSHAH--DARFVLVGTMNPE----EGELRPQLLDRFGL 171
LL V G G D R V + + +G R L R +
Sbjct: 245 MDAQTRLLRVLQQG--EYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNV 302

Query: 172 M-VELENCFDAAQRERIVK-ARLAFDLDPDAFRARHASAQRELRDGIHAARARLSALDFD 229
+ + L R+R L F + +++ A + A +
Sbjct: 303 VPLRL-----PPLRDRAEDIPDLV-----RHFVQQAEKEGLDVKRFDQEALELMKAHPWP 352

Query: 230 DAVHA---RVSALCIDAAVDGLRADLVMLRAARALAALEQADAVTVSHVERVADAVLRHR 286
V V L D + +++ + A S ++ AV +
Sbjct: 353 GNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENM 412

Query: 287 RHAGAPPSSGAPPS 300
R A PPS
Sbjct: 413 RQYFASFGDALPPS 426


22Bamb_1568Bamb_1584Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_15682142.254170IclR family transcriptional regulator
Bamb_15692143.134168chitinase
Bamb_15702154.485163hypothetical protein
Bamb_15712124.945772precorrin-3B C(17)-methyltransferase
Bamb_15721125.186993precorrin-2 C(20)-methyltransferase
Bamb_15732135.313494precorrin-8X methylmutase
Bamb_15743125.234788precorrin-3B synthase
Bamb_15754144.659410precorrin-6y C5,15-methyltransferase subunit
Bamb_15763112.986503cobalt-precorrin-6A synthase
Bamb_15770112.121985cobalt-precorrin-6x reductase
Bamb_15781113.227255precorrin-4 C(11)-methyltransferase
Bamb_15791103.809780major facilitator superfamily transporter
Bamb_1580-193.585255MarR family transcriptional regulator
Bamb_1581093.280303glutathione S-transferase domain-containing
Bamb_1582093.123235porin
Bamb_15832103.721595ATP-dependent transcription regulator LuxR
Bamb_15842112.664381hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1579TCRTETB320.005 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.8 bits (72), Expect = 0.005
Identities = 22/124 (17%), Positives = 42/124 (33%), Gaps = 2/124 (1%)

Query: 69 GIGDGAASLLTTIPILLMGVGALSARRLQRVTGIAGGVWLGVALIGFAC-ASRIGAQHAW 127
+ + + T +L +G +L GI + G+ + F +G
Sbjct: 45 NKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFS 104

Query: 128 VLLASACCAGIGIAMVQALLPGFVKTHFA-TRIGGAMGVYSTSIMGGAVLASVVAPFAAA 186
+L+ + G G A AL+ V + G A G+ + + G + + A
Sbjct: 105 LLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAH 164

Query: 187 RWGW 190
W
Sbjct: 165 YIHW 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1581adhesinmafb290.016 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 28.9 bits (64), Expect = 0.016
Identities = 21/74 (28%), Positives = 34/74 (45%), Gaps = 6/74 (8%)

Query: 88 FESGAILIYLADKTGQLMPKDAAGRYETIQWVMFQMGGIGP----MFGQVGFFHKFAGRD 143
+E G D G + D G+ IQ QMG + + G +G+ +F+G
Sbjct: 45 YEPGGKYHLFGDPRGSV--SDRTGKINVIQDYTHQMGNLLIQQANINGTIGYHTRFSGHG 102

Query: 144 YEDKRPRDRYAAES 157
+E+ P D +AA+S
Sbjct: 103 HEEHAPFDNHAADS 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1582NEISSPPORIN663e-14 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 66.2 bits (161), Expect = 3e-14
Identities = 103/408 (25%), Positives = 149/408 (36%), Gaps = 94/408 (23%)

Query: 1 MKTKLAALAALAGCSALAQAQSSVTLYGVVDTGLLYQSTSAASFRPNAPNTGKVFRMKDG 60
MK L AL A A A + VTLYG + G+ ++R GKV +++ G
Sbjct: 1 MKKSLIALTLAALPVA---AMADVTLYGAIKAGV-------QTYRSVEHTDGKVSKVETG 50

Query: 61 ---GIYSSFWGIKGSEDLGGGYKVNFKL-QGSFDSGTGRLQLSDTPGAVAIFNQIASLGV 116
+ S G KG EDLG G K ++L QG+ +GT N+ + +G+
Sbjct: 51 SEIADFGSKIGFKGQEDLGNGLKAVWQLEQGASVAGTNT----------GWGNKQSFVGL 100

Query: 117 SGPFGTVTAGRQIVPMIYAMADTDVRNAQFFGSVLTAWLGLNTAAGWPATSTNGAIGALY 176
G FGT+ AG + + +T G+ + AW S Y
Sbjct: 101 KGGFGTIRAGS----LNSPLKNT--------GANVNAWESGKFTGNVLEISGMAQREHRY 148

Query: 177 DSNALVYQSPTFAGVSLALEYAP-------------------GGVAGQFQGGTRESVVLR 217
S + Y SP FAG S +++YAP G Q+ G
Sbjct: 149 LS--VRYDSPEFAGFSGSVQYAPKDNSGSNGESYHVGLNYQNSGFFAQYAG--------L 198

Query: 218 YSNYGLNASAVYYNGHDTNPAPGVAPT--------GVDNNRFVYVGAKYTIRDFSVSASY 269
+ YG + Y+ + G DNN +YV +D + Y
Sbjct: 199 FQRYGEGTKKIEYDDQTYSIPSLFVEKLQVHRLVGGYDNNA-LYVSVAAQQQDAKL---Y 254

Query: 270 GNGRNPAHADRVNLDMLSAGVGYRF---TPALQVASAVYYLKDRNRSANRSTAVVLTADY 326
G +H + ++A YRF TP + A D N VV+ A+Y
Sbjct: 255 GAMSGNSHNSQTE---VAATAAYRFGNVTPRVSYAHGFKGTVDSANHDNTYDQVVVGAEY 311

Query: 327 SLSKRTMVYAQAGHVNNRGTMDQMLVYGQPVAPGVGTTAAMVGLRHNF 374
SKRT AG + G +V +TA+ V LRH F
Sbjct: 312 DFSKRTSALVSAGWL-QGGKGADKIV----------STASAVVLRHKF 348


23Bamb_1616Bamb_1625Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1616024-3.997734hypothetical protein
Bamb_1617-221-3.382968hypothetical protein
Bamb_1618-120-2.502475hypothetical protein
Bamb_1619021-2.741327major facilitator superfamily transporter
Bamb_1620023-3.454750LysR family transcriptional regulator
Bamb_1621123-3.236489LuxR family transcriptional regulator
Bamb_1622122-2.260124hypothetical protein
Bamb_1623120-2.470724major facilitator superfamily transporter
Bamb_1624020-3.579543glyoxalase/bleomycin resistance
Bamb_1625020-3.743854O-methyltransferase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1623TCRTETB554e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 54.5 bits (131), Expect = 4e-10
Identities = 55/297 (18%), Positives = 110/297 (37%), Gaps = 15/297 (5%)

Query: 57 WNTTLFMAASIIGAPLSATVLSRFGPRTAYLVALIVFCAGTLACA-SAKDMPWMLAGRAA 115
W T FM IG + + + G + L +I+ C G++ ++ R
Sbjct: 53 WVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFI 112

Query: 116 QGLGGGILFALSYALIRVVFDERLWSRAMAMVSGMWGVATLCGPAIGGIFAQSGTWRLAF 175
QG G AL ++ + +A ++ + + GPAIGG+ A W ++
Sbjct: 113 QGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SY 170

Query: 176 IALLPVAAVLALIVIVQLPAHEVSGVPAARPAIGKILLLAASVLVVSVASLSREIIGNVV 235
+ L+P+ ++ + +++L EV I I+L++ ++ + + S I +V
Sbjct: 171 LLLIPMITIITVPFLMKLLKKEVR--IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIV 228

Query: 236 GVGAGLAIALLIARLERGATTRLLPTGAYDIRAPLGAIYACMSLLVVGMTTEIFVPYFLQ 295
V L+ + + + + T + G + + + VPY ++
Sbjct: 229 SV---LSFLIFVKHIRK-VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMK 284

Query: 296 TIHGYPPLAAGYLTALMAAGWTVGSLVSSGRSPAAVQALVRSGPLVVVIALVTLAIV 352
+H G + G++ + R GPL V+ VT V
Sbjct: 285 DVHQLSTAEIGSVIIF------PGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSV 335


24Bamb_1666Bamb_1706Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_16661123.228397hypothetical protein
Bamb_16670132.894473hypothetical protein
Bamb_16681142.253569sigma-54 dependent trancsriptional regulator
Bamb_16690141.443547beta-lactamase domain-containing protein
Bamb_1670-1120.211461hypothetical protein
Bamb_1671114-1.566356hypothetical protein
Bamb_1672112-3.241791binding-protein-dependent transport system inner
Bamb_167329-3.180658binding-protein-dependent transport system inner
Bamb_1674011-0.182115spermidine/putrescine ABC transporter ATPase
Bamb_1675012-0.333843extracellular solute-binding protein
Bamb_1676-2110.887318hypothetical protein
Bamb_1677-190.888956hypothetical protein
Bamb_1678-291.235214periplasmic chaperone protein
Bamb_1679082.084943fimbrial biogenesis outer membrane usher
Bamb_1680-1111.690747OmpW family protein
Bamb_16812103.2705922-nitropropane dioxygenase
Bamb_16823103.296080betaine aldehyde dehydrogenase
Bamb_16835113.719870hypothetical protein
Bamb_16845113.943272major facilitator superfamily transporter
Bamb_16854113.9266992,3-dihydroxybenzoate-2,3-dehydrogenase
Bamb_16862113.025298amino acid adenylation domain-containing
Bamb_16870122.248596hypothetical protein
Bamb_16880112.463643isochorismatase
Bamb_1689-1102.9974382,3-dihydroxybenzoate-AMP ligase
Bamb_1690-2102.755368isochorismate synthase
Bamb_16910112.548250TonB-dependent receptor, plug
Bamb_16920123.445814AraC family transcriptional regulator
Bamb_16931113.322534Fe3+-hydroxamate ABC transporter periplasmic
Bamb_16942112.892284esterase
Bamb_16951122.105973transport system permease
Bamb_16961122.329820ABC transporter-like protein
Bamb_16972131.362426polysaccharide deacetylase
Bamb_16982130.966890aminoglycoside phosphotransferase
Bamb_16991110.140592cephalosporin hydroxylase
Bamb_17000120.081634glycosyl transferase family protein
Bamb_1701116-2.083807asparagine synthase
Bamb_1702225-5.445978hypothetical protein
Bamb_1703118-5.883439aminoglycoside phosphotransferase
Bamb_1704218-6.550315hypothetical protein
Bamb_1705012-4.102725ABC transporter ATPase
Bamb_1706-111-3.338518hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1668HTHFIS332e-112 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 332 bits (854), Expect = e-112
Identities = 120/376 (31%), Positives = 183/376 (48%), Gaps = 49/376 (13%)

Query: 121 DARGDVIAYVERLTTVRSASAQPSAEGLVGGADAFNAALGALQRVAPSMLPVLLLGESGT 180
G +A +R + +Q LVG + A L R+ + L +++ GESGT
Sbjct: 113 GIIGRALAEPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGT 171

Query: 181 GKELFARALHEASARAMGPFVVVDCSGIAETLFESELFGYEKGAFTGANQRKPGLVETAQ 240
GKEL ARALH+ R GPFV ++ + I L ESELFG+EKGAFTGA R G E A+
Sbjct: 172 GKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAE 231

Query: 241 GGTLFLDEIGDVPLPMQVKLLRLIESGTFRRVGGVEALRADFRLVAATHKPLREMIDDGR 300
GGTLFLDEIGD+P+ Q +LLR+++ G + VGG +R+D R+VAAT+K L++ I+ G
Sbjct: 232 GGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGL 291

Query: 301 FRQDLYYRINAFPIPLPALRERQGDVALLAESILRRIANARGNAGDASARPFAARPFVLT 360
FR+DLYYR+N P+ LP LR+R D+ L +++
Sbjct: 292 FREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGL------------DVKRFD 339

Query: 361 ERARACLDAYAWPGNIRELRNVLERACLFADDGTIRVEHLP----AELVAAAAAPQERDA 416
+ A + A+ WPGN+REL N++ R I E + +E+ + +
Sbjct: 340 QEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARS 399

Query: 417 DARGLSDAE--------------------------------LVRIARTFDGTRKALAEHV 444
+ +S A ++ G + A+ +
Sbjct: 400 GSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLL 459

Query: 445 GMSERTLYRRMKALGL 460
G++ TL ++++ LG+
Sbjct: 460 GLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1679PF00577494e-164 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 494 bits (1274), Expect = e-164
Identities = 178/875 (20%), Positives = 300/875 (34%), Gaps = 96/875 (10%)

Query: 16 LSAVASCMPCAAA-TNPGNTAPDTVEFDTGALREHGIDATASRYFAHAPRFMPGTASVRL 74
L+ + A A + + F+ L + F + PGT V +
Sbjct: 23 LAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDI 82

Query: 75 TVNGKRAGRADARFDDNGN-----LCATPGLLRAAGLVVPDALRDATAATARSAGADTPP 129
+N D F+ + C T L + GL TA+ +
Sbjct: 83 YLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGL--------NTASVSGMNLLADDA 134

Query: 130 CYDYRRAYPQTIVTLRPADGAVDLVVPPDAL---DTGRRTPDMFEHGGFAGLVNYDLLAM 186
C L ++L +P + G P++++ G AGL+NY+
Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194

Query: 187 TTRNP-AGTSQYWQATTEAGFNAADWIVRSSQIAT------VVDGHAGIDHQAAYAQRTF 239
+ +N G S Y ++G N W +R + + H + +R
Sbjct: 195 SVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDI 254

Query: 240 ASRGATLQAGQIVPRSTLFAIGRTFGVQMFPDEALAVT--PGAAARVTGIARTQARVEVR 297
+ L G + +F G Q+ D+ + G A + GIAR A+V ++
Sbjct: 255 IPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIK 314

Query: 298 QLGVLIHASQLPPGPFTLTDLPLVSGSADLDVTVVEATGDVQHFIVPGSSLPGAGLAAAQ 357
Q G I+ S +PPGPFT+ D+ S DL VT+ EA G Q F VP SS+P
Sbjct: 315 QNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHT 374

Query: 358 GLAIAVGRLQNDGYAQ-APWLATATRGWQIRQRARLNAGILVSSPLQSGAASVEFAPLAG 416
+I G ++ Q P +T + + G ++ ++ + G
Sbjct: 375 RYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKN--MG 432

Query: 417 VAAAVGVDLTRAAGR-------NGAQTRIALSSNGDTPLRANVSFVRR---TSGYRELTD 466
A+ VD+T+A +G R + + + N+ V TSGY D
Sbjct: 433 ALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLN-ESGTNIQLVGYRYSTSGYFNFAD 491

Query: 467 AVRS------------------------TDAFTPPARTQFAAALGWHDRTLGMLSFDYTR 502
S A+ + Q L +
Sbjct: 492 TTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT--STLYLSGSH 549

Query: 503 VAVFDGPAMQRIAGA-WTRPLGRGSLALNVSRT--FGTRGAIGTQVYLSATVPIGT---- 555
+ + A + L+ S T +G + L+ +P
Sbjct: 550 QTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGR-DQMLALNVNIPFSHWLRS 608

Query: 556 ------RSVSAFANVTGDS-----VRSGARYSDTFGRTGSYSVAADYDTAIRSPSIRATV 604
R SA +++ D +G + SYSV Y S +T
Sbjct: 609 DSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNS-GSTG 667

Query: 605 SATPKHVRAIVNAGL---YGADRATLGVNLRGAVALLDGVGMLSPYEIRDTLALARVGDR 661
AT + NA + + D L + G V L + DT+ L +
Sbjct: 668 YATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQP-LNDTVVLVKAPGA 726

Query: 662 AGIELVTPSGPVWTDRHGRAVIASLPAYTQTLVRINTKSLPRNLDLKNGIQTVEAGRGSV 721
++ +G V TD G AV+ Y + V ++T +L N+DL N + V RG++
Sbjct: 727 KDAKVENQTG-VRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAI 785

Query: 722 SRVEFAVEQTRRVLLTVTLSNGAPLPVLSTVVDDDDRFVTVSGSEGRLLLTGAQLTTPLR 781
R EF ++L+T+T N PLP + V + + + G++ L+G L ++
Sbjct: 786 VRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQ 844

Query: 782 VAL--PDGAYCRLAIALPATPPATTRYYERADARC 814
V + A+C LP + + + A C
Sbjct: 845 VKWGEEENAHCVANYQLPPE--SQQQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1684TCRTETA702e-15 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 70.2 bits (172), Expect = 2e-15
Identities = 80/345 (23%), Positives = 130/345 (37%), Gaps = 40/345 (11%)

Query: 53 HVGAIIGIVGLVWMIAAPRWGRVADTRGRLPAMRAAIGGFVASSLLLTGYVGWALRDGGS 112
H G ++ + L+ AP G ++D GR P + ++ G ++ +
Sbjct: 44 HYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIM------------A 91

Query: 113 AVPPVWVGFAALLVTRAAMGGCYAGLPVAAMAWIADRTPATGRAAVIARFGAAGAIGMVL 172
P +WV + +V G A A+IAD T RA A GMV
Sbjct: 92 TAPFLWVLYIGRIV-----AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVA 146

Query: 173 APPLAGWLAGFDMTLALAVFALLP-LAGLAGLRRLRDDGAHAVRRASPRLKPTDPRVRLP 231
P L G + GF A L L L G L + +H R R + +P
Sbjct: 147 GPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE--SHKGERRPLRREALNPLASFR 204

Query: 232 WCSAFALYSAVM-----------IANSVLGFYVIDRLHVRTGDASMVAGYVLGSAGIGLI 280
W + +A+M + ++ + DR H G L + GI
Sbjct: 205 WARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI----GISLAAFGILHS 260

Query: 281 AAQSVV-GRL-RAVAPLQWLRWGALAGGIGFVSTLAAPAGHPMLLCASYFVAACGMGAAF 338
AQ+++ G + + + L G +A G G++ L A A + + A G G
Sbjct: 261 LAQAMITGPVAARLGERRALMLGMIADGTGYI--LLAFATRGWMAFPIMVLLASG-GIGM 317

Query: 339 PAVAALASTRVEAHEQAACAGTMSMAQGLSMVVAPLAGTMLYELH 383
PA+ A+ S +V+ Q G+++ L+ +V PL T +Y
Sbjct: 318 PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362



Score = 34.4 bits (79), Expect = 7e-04
Identities = 17/55 (30%), Positives = 23/55 (41%)

Query: 335 GAAFPAVAALASTRVEAHEQAACAGTMSMAQGLSMVVAPLAGTMLYELHPAAPFV 389
GA A + + E+A G MS G MV P+ G ++ P APF
Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFF 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1685DHBDHDRGNASE2651e-91 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 265 bits (679), Expect = 1e-91
Identities = 137/252 (54%), Positives = 175/252 (69%)

Query: 13 VAVVTGAARGIGAAVVRALAACGVDVAAFDIDADALSRAHEDRPPAPGRVHPFAVDVADA 72
+A +TGAA+GIG AV R LA+ G +AA D + + L + F DV D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 73 RAVRAAIEHVEGTVGPIGMLANVAGVLRLAAATSLTDDDWAHCFAVNAHGVFHLSRAVAQ 132
A+ +E +GPI +L NVAGVLR SL+D++W F+VN+ GVF+ SR+V++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 133 HMIERRAGSIVTVGSNAALVPRTQMAAYAASKAAAHQFTRCLGLELAGHGIRCNIVAPGS 192
+M++RR+GSIVTVGSN A VPRT MAAYA+SKAAA FT+CLGLELA + IRCNIV+PGS
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 193 TDTPMQRQLWRGPEGPAGVIAGSLETYRTGIPLGRIAAPDDIAGTVLFLLSDAARHVTLH 252
T+T MQ LW G VI GSLET++TGIPL ++A P DIA VLFL+S A H+T+H
Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMH 249

Query: 253 TLCVDGGATLGV 264
LCVDGGATLGV
Sbjct: 250 NLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1688ISCHRISMTASE357e-126 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 357 bits (917), Expect = e-126
Identities = 151/303 (49%), Positives = 201/303 (66%), Gaps = 15/303 (4%)

Query: 1 MAIPKIASYPMP--AELPANRVNWRFDPHRAALLVHDMQDYFLDFYDRAAAPVPTLVAHV 58
MAIP I Y MP +++P N+V+W DP+RA LL+HDMQ+YF+D + A+PV L A++
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 59 RRLIDFARAAGMPVYYTAQPATQAAADRALLTDMWGPGLTAQPSRAAICDALAPASGDTV 118
R+L + G+PV YTAQP +Q DRALLTD WGPGL + P I LAP D V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 119 LDKWRYSAFRRSPLETCLREQARDQLAICGIYAHIGCLMTACDAFMRDVQPFFVADALAD 178
L KWRYSAF+R+ L +R++ RDQL I GIYAHIGCL+TAC+AFM D++ FFV DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 179 FSEREHRMALDYVAGRCGMAVTTDALVGV---APADDSAP----------MLVAIAAQVA 225
FS +H+MAL+Y AGRC V TD+L+ APAD I Q+A
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 226 RSLRIPAADLREHDNLIDCGLDSIRMMTLVEQWRAQGYDVTFVQLAEQPTLGAWTQVLQR 285
L+ D+ + ++L+D GLDS+R+MTLVEQWR +G +VTFV+LAE+PT+ W ++L
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLTT 300

Query: 286 AAQ 288
+Q
Sbjct: 301 RSQ 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1696SSPANPROTEIN290.026 Salmonella invasion protein InvJ signature.
		>SSPANPROTEIN#Salmonella invasion protein InvJ signature.

Length = 336

Score = 28.6 bits (63), Expect = 0.026
Identities = 37/163 (22%), Positives = 69/163 (42%), Gaps = 8/163 (4%)

Query: 37 LLGPNGVGKSTLLRALARLASPTGRATFGSFDLLDGSRREHTRQVGYLPQTLPQPSSLLV 96
L +G S L+A+ ++ T AT S D + ++ G + + + + L
Sbjct: 131 LESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAG---EGVRKEGAPLA 187

Query: 97 YEAVRSALRATCGGLSDATRDRRLQQVFTRLRLHPLAMSPLDRLSGGQRQMVGLAQVLVR 156
+ + + A G + ++++ V ++L L P ++ L +L+GG +M AQ
Sbjct: 188 RDVAPARMAAANTGKPEDKDHKKVKDV-SQLPLQPTTIADLSQLTGGDEKMPLAAQSKPM 246

Query: 157 DTPLLLLD---EPTSALDLRWQLLALE-AVGEAARQRGAIVLV 195
T D S+L R+Q + +V ARQ G L+
Sbjct: 247 MTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLI 289


25Bamb_1774Bamb_1784Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_17741113.317177lipoyl synthase
Bamb_17750102.732358branched-chain alpha-keto acid dehydrogenase E2
Bamb_17760102.426618transketolase, central region
Bamb_1777-282.894134pyruvate dehydrogenase
Bamb_1778-293.164463ATP-NAD/AcoX kinase
Bamb_17800100.725141Fis family GAF modulated sigma54 specific
Bamb_1781310-0.710226carboxymuconolactone decarboxylase
Bamb_178239-0.287765MerR family transcriptional regulator
Bamb_1783411-0.096867hypothetical protein
Bamb_1784211-0.376512hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1780HTHFIS314e-102 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 314 bits (807), Expect = e-102
Identities = 126/326 (38%), Positives = 177/326 (54%), Gaps = 40/326 (12%)

Query: 351 ELALRVASKRLPILVLGETGAGKEVFARAIHDAGARRARPFVAVNCGALPETLIESELFG 410
+ R+ L +++ GE+G GKE+ ARA+HD G RR PFVA+N A+P LIESELFG
Sbjct: 151 RVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG 210

Query: 411 YAAGAFTGARKHGARGKIALADGGTLFLDEIGDMPLTLQTRLLRVLADGEVVPLGSDTPV 470
+ GAFTGA+ G+ A+GGTLFLDEIGDMP+ QTRLLRVL GE +G TP+
Sbjct: 211 HEKGAFTGAQTRST-GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPI 269

Query: 471 RVDLDVICATHRDLARMVADGTFREDLYYRLSGATFELPPLRERADVGDVIATVFAEEAQ 530
R D+ ++ AT++DL + + G FREDLYYRL+ LPPLR+RA+ + F ++A+
Sbjct: 270 RSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE 329

Query: 531 ATG-HVLTLDPTLAAQLAAYPWPGNVRQLRNVLRYACAVCDAACVTRRDLPADLAAQLGA 589
G V D + A+PWPGNVR+L N++R A+ +TR + +L +++
Sbjct: 330 KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPD 389

Query: 590 GPAG--------------------------------------ALPDDERGRIVAALTAHR 611
P L + E I+AALTA R
Sbjct: 390 SPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATR 449

Query: 612 WRPDAAAKALGISRATLYRRIAKHRI 637
AA LG++R TL ++I + +
Sbjct: 450 GNQIKAADLLGLNRNTLRKKIRELGV 475


26Bamb_1835Bamb_1886Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1835134-6.963720alpha/beta hydrolase domain-containing protein
Bamb_1836752-11.352130oxidoreductase dehydrogenase family protein
Bamb_1837859-12.615656hypothetical protein
Bamb_1838960-12.798287stress responsive alpha-beta barrel
Bamb_1839960-12.731934phage integrase family protein
Bamb_18401063-12.641559hypothetical protein
Bamb_1841956-11.184045hypothetical protein
Bamb_1842648-8.235265adenine-specific DNA methylase-like protein
Bamb_1843438-5.792689hypothetical protein
Bamb_1844543-7.549683hypothetical protein
Bamb_1845642-8.034362PAAR repeat-containing protein
Bamb_1846642-8.186969hypothetical protein
Bamb_1847641-7.497530hypothetical protein
Bamb_1848640-7.878155hypothetical protein
Bamb_1849235-6.262719hypothetical protein
Bamb_1850-129-5.087475D12 class N6 adenine-specific DNA
Bamb_1851-227-3.218340hypothetical protein
Bamb_1852-124-2.406154glycoside hydrolase
Bamb_1853-124-2.839995phage holin family 2 protein
Bamb_1854-121-3.072159phage late control D family protein
Bamb_1855-119-2.986113phage tail X family protein
Bamb_1856-220-2.949955phage P2 GpU family protein
Bamb_1857024-3.589633pyocin R2_PP, tail length determination protein
Bamb_1858226-4.499044hypothetical protein
Bamb_1859127-4.073622phage major tail tube protein
Bamb_1860128-3.746298phage tail sheath protein
Bamb_1861232-3.899884bacteriophage-acquired protein
Bamb_1862236-5.683176phage tail collar domain-containing protein
Bamb_1863035-5.647494phage tail protein I
Bamb_1864034-5.465670baseplate J family protein
Bamb_1865135-5.883227GPW/gp25 family protein
Bamb_1866033-5.892668hypothetical protein
Bamb_1867-127-4.824261hypothetical protein
Bamb_1868021-1.828192phage baseplate assembly protein V
Bamb_1869022-1.319250hypothetical protein
Bamb_1870-120-0.747088hypothetical protein
Bamb_1871020-0.426743hypothetical protein
Bamb_1872-119-1.492957hypothetical protein
Bamb_1873021-2.034768hypothetical protein
Bamb_1874024-2.479313peptidase S14, ClpP
Bamb_1875123-3.487409lambda family phage portal protein
Bamb_1876025-4.188482hypothetical protein
Bamb_1877126-4.523568phage terminase GpA
Bamb_1878029-4.993344hypothetical protein
Bamb_1879128-4.770295hypothetical protein
Bamb_1880227-4.740935virulence-associated E family protein
Bamb_1881651-9.542896hypothetical protein
Bamb_1882654-10.255815hypothetical protein
Bamb_1883755-10.899982XRE family transcriptional regulator
Bamb_1884753-10.568314hypothetical protein
Bamb_1885536-6.892790hypothetical protein
Bamb_1886332-6.480365hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1850AUTOINDCRSYN290.010 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 29.4 bits (66), Expect = 0.010
Identities = 14/39 (35%), Positives = 18/39 (46%), Gaps = 7/39 (17%)

Query: 140 EELSAAHLRL-ANTFIERLDWA-ACI-----DRYDRPHT 171
E S L TF +RL+WA C D+YD +T
Sbjct: 14 ETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNT 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1861CHANLCOLICIN300.006 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.006
Identities = 22/58 (37%), Positives = 29/58 (50%), Gaps = 3/58 (5%)

Query: 196 EAERAEKAAREAEEAARQEEEAARADGEDEMATDAATTTPADSASVSDTAPDVEIDTK 253
EAE AEKA +EAE ++ +E R E E A A++S+ A VEI K
Sbjct: 142 EAEAAEKAFQEAE---QRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQK 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1874TONBPROTEIN330.002 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.7 bits (74), Expect = 0.002
Identities = 12/37 (32%), Positives = 14/37 (37%)

Query: 205 VRALVDETEPTDPPQPDTQPNTPPEPNPDPQPVPPAP 241
V E P P+ PEP P P+P AP
Sbjct: 50 VTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAP 86



Score = 30.3 bits (68), Expect = 0.010
Identities = 8/26 (30%), Positives = 10/26 (38%)

Query: 217 PPQPDTQPNTPPEPNPDPQPVPPAPD 242
P QP P P+P+P P
Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPEP 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1875PHPHTRNFRASE290.038 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.4 bits (66), Expect = 0.038
Identities = 11/30 (36%), Positives = 16/30 (53%)

Query: 351 DLRDVSDRVLRVLLNEFRRSIEQLQQNVFI 380
D+RDVS RVL L+ S+ + + I
Sbjct: 130 DIRDVSKRVLGHLIGVETGSLATIAEETVI 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1877TONBPROTEIN290.049 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 29.2 bits (65), Expect = 0.049
Identities = 15/66 (22%), Positives = 22/66 (33%), Gaps = 7/66 (10%)

Query: 577 LMTEAHWRVEQVRISQVSLFDAVPILESLPSALPVETLPTVQTDADPPPPIEPVQPVAKP 636
L T H +E +Q PI ++ + +E VQ +P EP
Sbjct: 28 LYTSVHQVIELPAPAQ-------PISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPE 80

Query: 637 PETPPP 642
P P
Sbjct: 81 PPKEAP 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1880PF052725630.0 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 563 bits (1452), Expect = 0.0
Identities = 141/468 (30%), Positives = 207/468 (44%), Gaps = 33/468 (7%)

Query: 399 ETPTKPAATSAAAKQP--EWDGREAENGAHTWEQD----LARSDKGTLLPTLGNVHMILS 452
E P K ++ A P G + E+ W D L + L P + L
Sbjct: 401 EPPKKRDPSAGAGTDPGGPGGGDDGEDPFGEWLDDEVARLRLRGRWLLKPRRAALIEALR 460

Query: 453 NHKAWQGVIEQDDFGGRVMKRKAPPFPQGVTGEWTDMDDQRCALWLSQRYG-LSVRTDIV 511
+ A G + D+ + + +A P+ + G D D R A ++ YG
Sbjct: 461 SAPALAGCVAFDELREQPVAVRAFPWRKA-PGPLEDADVLRLADYVETTYGTGEASAQTT 519

Query: 512 MNAVLLVADATHFHDVREYLEGLKWDGVPRVRSMPSTYLRVADS-------EYVQLAFMK 564
A+ + AD H R++++ +WD VPR+ L Y+QL
Sbjct: 520 EQAINVAADMNRVHPFRDWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKY 579

Query: 565 WMIAAVARVMEPGCKVDNVLILEGKQGHRKSTALKVLAGAPWFTDTPIQIG-NKDTYAVL 623
++ VARVMEPGCK D ++LEG G KST + L G +F+DT IG KD+Y +
Sbjct: 580 ILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQI 639

Query: 624 AGKWVIELAELDSLNKADSSAVKSFFATAVDRFRNFYGKRATDVPRQCVFAGSVNFDTYL 683
AG EL+E+ + +AD+ AVK+FF++ DR+R YG+ D PRQ V + N YL
Sbjct: 640 AGIVAYELSEMTAFRRADAEAVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWCTTNKRQYL 699

Query: 684 KDESGNRRYWPLRVGGLVDIDGIVAVRDQLWAEAVHLYRTGVVWHVE-EHERPLFEIEQA 742
D +GNRR+WP+ V G ++ + R QL+AEA+HLY G + E E F EQ
Sbjct: 700 FDITGNRRFWPVLVPGRANLVWLQKFRGQLFAEALHLYLAGERYFPSPEDEEIYFRPEQE 759

Query: 743 ERYEGDVYEDKI--------AKALE------FVSRTTMEEI--LADILKLDTSKWTLAEQ 786
R + ++ A A E + TT I L L D K + +
Sbjct: 760 LRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTIADLVQALGADPGKSSPMLE 819

Query: 787 RRIGKALKSLGWVRKRESTGSRGWYYVSEQQEPEVERELVAAGDDDSP 834
++ L GW RE++G R Y+ Q P V E A +P
Sbjct: 820 GQVRDWLNENGWEYLRETSGQRRRGYMRPQVWPPVIAEDKEADQAHAP 867


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1881CHANLCOLICIN280.014 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 27.7 bits (61), Expect = 0.014
Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 79 ERAARACAALATKAERHAFRDQLTDRL 105
E+AARA AA +A+ A RD LT RL
Sbjct: 68 EQAARAKAAAEAQAKAKANRDALTQRL 94


27Bamb_1899Bamb_1913Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1899-1113.058645BolA family protein
Bamb_1900-1112.590433PpiC-type peptidyl-prolyl cis-trans isomerase
Bamb_19010113.004812N-acetyltransferase GCN5
Bamb_1902-1102.873144phosphoribosylformylglycinamidine synthase
Bamb_1903-282.209671D-amino-acid dehydrogenase
Bamb_1904090.037046carbohydrate kinase
Bamb_190529-2.275141glucose-6-phosphate isomerase
Bamb_190628-2.902811ABC transporter-like protein
Bamb_1907410-3.508923arylesterase
Bamb_1908410-2.589263PpiC-type peptidyl-prolyl cis-trans isomerase
Bamb_190959-2.585551**ATP-dependent protease La
Bamb_191027-0.891558ATP-dependent protease ATP-binding subunit ClpX
Bamb_1911080.882583ATP-dependent Clp protease proteolytic subunit
Bamb_1912-281.526412trigger factor
Bamb_1913-193.173396glycerate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1900IGASERPTASE280.043 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.043
Identities = 19/104 (18%), Positives = 30/104 (28%), Gaps = 6/104 (5%)

Query: 17 AAPAFAQNIAVVNGTP-IPKSRADAMVAQLVQQGQTDSPQLQQAVRQELVNREILMQEAI 75
A V P P + + Q+ +T Q A NRE+ +
Sbjct: 1015 EEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAK- 1073

Query: 76 REGIPNRPDVKAQVAVAQQTVVLRSMIESFLKKNQPTDAEVKAR 119
N VAQ + + K+ + E KA+
Sbjct: 1074 ----SNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAK 1113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1909GPOSANCHOR421e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 41.6 bits (97), Expect = 1e-05
Identities = 34/192 (17%), Positives = 73/192 (38%), Gaps = 15/192 (7%)

Query: 92 KVLVEGLQRAKALSIEEQETQFSCDVMPLEPDHADSAETEALRRAIVSQFDQYVKLNKKI 151
L + L+ A S + + + A A+ E ++ K +
Sbjct: 193 AELEKALEGAMNFSTADSAKIKTLEAE-KAALAARKADLEKALEGAMNFSTADSAKIKTL 251

Query: 152 PPEILTSLSGIDEAGRLADMIAERLPLKLDQKQHILEMFPVIERLEHLLAQLEAEIDILQ 211
E + E + + + + + LE A LE + +L
Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE---KAALEAEKADLEHQSQVLN 308

Query: 212 VEKRIRGRVKRQMEKSQREYYLNEQVKAIQKELGEGEEGAD--LEELEKRINAARMPKEA 269
R ++R ++ S+ +Q++A ++L E + ++ + L + ++A+R EA
Sbjct: 309 AN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEASRQSLRRDLDASR---EA 359

Query: 270 KKKADAELKKLK 281
KK+ +AE +KL+
Sbjct: 360 KKQLEAEHQKLE 371



Score = 30.0 bits (67), Expect = 0.040
Identities = 29/103 (28%), Positives = 44/103 (42%), Gaps = 18/103 (17%)

Query: 201 AQLEAEIDILQVEKRI----RGRVKRQMEKSQRE--------YYLNEQVKAIQKELGEGE 248
QLEAE L+ + +I R ++R ++ S+ N ++ A++K E E
Sbjct: 361 KQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELE 420

Query: 249 EGADLEELEKRINAARMPKEAKK------KADAELKKLKLMSP 285
E L E EK A++ EAK K EL KL+
Sbjct: 421 ESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKA 463


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1910HTHFIS310.013 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.013
Identities = 19/113 (16%), Positives = 37/113 (32%), Gaps = 16/113 (14%)

Query: 43 LCNEIIRDEAAAAGVEASLSRSDLPSPQEIRDILDQYVIGQERAKKILAVAVYNHYKRLK 102
L E A PS E ++G+ A + +
Sbjct: 102 LPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAA-----------MQEIY 150

Query: 103 HLDKKDDVELSKSNILLIGPTGSGKTLLAQTLARL---LNVPFVIADATTLTE 152
+ + + + +++ G +G+GK L+A+ L N PFV + +
Sbjct: 151 RVLAR--LMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPR 201


28Bamb_1954Bamb_2031Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_19542101.543809GntR family transcriptional regulator
Bamb_1955090.852641L,D-carboxypeptidase A
Bamb_19560110.759676CMP/dCMP deaminase
Bamb_1957-1111.234833hypothetical protein
Bamb_19580101.190330major facilitator superfamily transporter
Bamb_1959-115-0.381758GntR family transcriptional regulator
Bamb_1960021-1.713981oxidoreductase
Bamb_1961435-4.1848504Fe-4S ferredoxin
Bamb_1962638-5.202704HEAT repeat-containing PBS lyase
Bamb_19631044-8.180070hypothetical protein
Bamb_1964942-9.635047hypothetical protein
Bamb_1965938-8.883178exonuclease
Bamb_1966837-8.259572hypothetical protein
Bamb_1967835-7.914691ATPase central domain-containing protein
Bamb_1968833-7.112747hypothetical protein
Bamb_1969735-6.722426hypothetical protein
Bamb_1970836-6.034730hypothetical protein
Bamb_1971837-5.852087ATPase central domain-containing protein
Bamb_1972935-5.086423hypothetical protein
Bamb_1973936-5.166195hypothetical protein
Bamb_19741037-5.936367dihydrolipoamide dehydrogenase
Bamb_19751134-6.381778thioredoxin domain-containing protein
Bamb_19761233-6.381963thioredoxin
Bamb_19771133-6.209363NADH:flavin oxidoreductase
Bamb_19781133-5.129031OsmC family protein
Bamb_19791134-4.323791DSBA oxidoreductase
Bamb_19801030-2.932754TetR family transcriptional regulator
Bamb_1981928-1.058560alkylhydroperoxidase
Bamb_1982926-0.150249short-chain dehydrogenase/reductase SDR
Bamb_19838261.288964hypothetical protein
Bamb_19847290.948933hypothetical protein
Bamb_1985931-0.643907replication initiator and transcription
Bamb_19861034-1.573397cobyrinic acid a,c-diamide synthase
Bamb_19871035-2.497667hypothetical protein
Bamb_19881041-4.718770Type IV secretory pathway protease TraF-like
Bamb_19891140-4.665277hypothetical protein
Bamb_19901142-6.627350LysR family transcriptional regulator
Bamb_19911036-4.265656hypothetical protein
Bamb_1992935-3.763813hypothetical protein
Bamb_19931037-3.921453beta-lactamase domain-containing protein
Bamb_19941139-4.586559LysR family transcriptional regulator
Bamb_19951141-5.050089ThiJ/PfpI domain-containing protein
Bamb_19961243-5.728265major facilitator superfamily transporter
Bamb_19971040-5.877875hypothetical protein
Bamb_19981038-5.482393MarR family transcriptional regulator
Bamb_1999835-4.223825ABC transporter-like protein
Bamb_2000730-1.903678LysR family transcriptional regulator
Bamb_2001727-1.260372lipoprotein
Bamb_2002625-1.135590conjugal transfer coupling protein TraG
Bamb_2003726-1.012274CopG/DNA-binding domain-containing protein
Bamb_2004727-0.712943type II secretion system protein E
Bamb_2005926-1.022033conjugal transfer protein TrbC
Bamb_2006826-1.060173conjugal transfer trbD transmembrane protein
Bamb_2007725-1.136373conjugal transfer ATPase TrbE
Bamb_2008825-0.441323conjugal transfer protein TrbJ
Bamb_2009827-0.106166lipoprotein
Bamb_2010927-1.359109conjugal transfer protein TrbL
Bamb_2011926-2.187622conjugal transfer protein TrbF
Bamb_20121026-2.620325conjugal transfer protein TrbG/VirB9/CagX
Bamb_20131030-3.675643conjugation TrbI family protein
Bamb_20141135-5.954074hypothetical protein
Bamb_20151136-6.423554hypothetical protein
Bamb_2016932-5.431322transposase, IS4 family protein
Bamb_2017420-4.415434hypothetical protein
Bamb_2018218-2.868622phage transcriptional regulator AlpA
Bamb_2019213-1.956759prophage CP4-57 regulatory
Bamb_2020112-0.391694hypothetical protein
Bamb_20211110.145558phage integrase family protein
Bamb_2022-2102.585634GMP synthase
Bamb_2023-1113.802804hypothetical protein
Bamb_20240113.565122inosine 5'-monophosphate dehydrogenase
Bamb_20251114.787133hypothetical protein
Bamb_2026-1123.116232hypothetical protein
Bamb_2027-2112.514656hypothetical protein
Bamb_2028-211-0.631455hypothetical protein
Bamb_2029011-1.267912hypothetical protein
Bamb_203009-3.259188hypothetical protein
Bamb_2031110-3.713285cyclase/dehydrase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1954CARBMTKINASE270.041 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 27.5 bits (61), Expect = 0.041
Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 6/55 (10%)

Query: 78 VASPTLQDVHEVFEMRRIIELAVVERLATGPG------AKRLKGVAALIDKERKA 126
V SP + E +++++E V+ + G G +KGV A+IDK+
Sbjct: 165 VPSPDPKGHVEAETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAG 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1958TCRTETB394e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.7 bits (90), Expect = 4e-05
Identities = 28/111 (25%), Positives = 45/111 (40%), Gaps = 4/111 (3%)

Query: 271 GGRIAFGLLGDRLGAKRMLVAGLLAQALGALGYYFVRTLDGFYVVATLFGFIYAGVMP-L 329
G +G L D+LG KR+L+ G++ G++ + + ++A A P L
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123

Query: 330 YAVLARENFPLRMMGTVIGGCAMAGSLGMAAGPVAGGLIVDALGSYGWLYL 380
V+ P G G ++G GP GG+I + W YL
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI---HWSYL 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1970SUBTILISIN562e-10 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 56.4 bits (136), Expect = 2e-10
Identities = 63/295 (21%), Positives = 101/295 (34%), Gaps = 62/295 (21%)

Query: 274 KVAILDGGLPKQHA-IGPWLRSYRVLDEDADDDTEGLE----HGLAV--TSAFLFGPIQP 326
KVA+LD G H + + R +D + D E + HG V T A
Sbjct: 44 KVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENENGV 103

Query: 327 NGAAARPFAYVDHLRVLDKDAETEDPLELYRTLGFVEQVLLSRQYQF------VNLSLGP 380
G A P A + ++VL+K G + ++ Y +++SLG
Sbjct: 104 VGVA--PEADLLIIKVLNKQGS-----------GQYDWIIQGIYYAIEQKVDIISMSLGG 150

Query: 381 DLPIEDTDVHAWTSVIDDLLSDGDTLMTVAIGNNGERDRPSGNARVQVPSDCVNALAVGA 440
DV + ++ L+ A GN G D + P ++VGA
Sbjct: 151 P-----EDVPELHEAVKKAVASQ-ILVMCAAGNEG--DGDDRTDELGYPGCYNEVISVGA 202

Query: 441 SNDTEAGWARAPYSAIGPGRSPGVVKPDLMAFGGDAAKYFHVLASGKKPSLLPQLGTSFA 500
N + +S DL+A G D +L++ GTS A
Sbjct: 203 INFD---RHASEFSNSNNE-------VDLVAPGED------ILSTVPGGKYATFSGTSMA 246

Query: 501 SPYLLRSAVGIRAIL--------GTELTPLAIKALLVHAAEPLEHDKLEVGWGKV 547
+P+ G A++ +LT + A L+ PL + G G +
Sbjct: 247 TPH----VAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLL 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1980HTHTETR704e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.7 bits (170), Expect = 4e-17
Identities = 33/189 (17%), Positives = 64/189 (33%), Gaps = 6/189 (3%)

Query: 2 AVGTRDALIQTAENLMRTRGYTAFSYADLSEAVGIRKASIHHHFPTKEDLGAAIVEEYID 61
A TR ++ A L +G ++ S ++++A G+ + +I+ HF K DL + I E
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 62 RVR-----TEFGRIEMQHEHVIGRLEAFLQIFRSSADGGLLPLCGALAAEMSALPLSLQQ 116
+ + + L L+ + LL E +QQ
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 117 LTQRFFDMQLSWLSRALEQGIKRKEIPEGSGAKHKAFLLLSILEGSCFINWATRDEDPLS 176
+ + + L+ I+ K +P + A ++ + NW +
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI-SGLMENWLFAPQSFDL 187

Query: 177 PSVVRLIVE 185
R V
Sbjct: 188 KKEARDYVA 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1982DHBDHDRGNASE1241e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 124 bits (312), Expect = 1e-36
Identities = 81/260 (31%), Positives = 125/260 (48%), Gaps = 9/260 (3%)

Query: 1 MNTKQFEGRKLLVVGGTSGIGLEVARMVAEQGGAVVVIGNRAEKAEVARQELAAIAGERK 60
MN K EG+ + G GIG VAR +A QG + + EK E L A A R
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEA--RH 58

Query: 61 AAAYASDLSDAASVKSLLGRLAAEQGDIDLLVNTAGIYYPKAFLEHSVEDYNNFLDLNRS 120
A A+ +D+ D+A++ + R+ E G ID+LVN AG+ P S E++ +N +
Sbjct: 59 AEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118

Query: 121 LFFITQQVAASLVARKKPGSIVNVTAVAARQAIETVPASAYSMAKIGLDAMTRHVAAELA 180
F + + + ++ GSIV V + A + +AY+ +K T+ + ELA
Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGSNPAG--VPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 181 EQGIRVNSVSPAMVETKI-----FERFIPGEHLGSALDGFNAFHPLGRNGTPRDVAETVI 235
E IR N VSP ET + + + + +L+ F PL + P D+A+ V+
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236

Query: 236 FLLSDKASWVTGAVWDVDGG 255
FL+S +A +T VDGG
Sbjct: 237 FLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1996TCRTETB461e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.4 bits (110), Expect = 1e-07
Identities = 31/158 (19%), Positives = 59/158 (37%), Gaps = 1/158 (0%)

Query: 32 TISADLHASVAQVGTAVTAYALGAVIGAPGLTALATRWPRKRLLLVAMGLFTLGNAVVSL 91
I+ D + A TA+ L IG L+ + KRLLL + + G+ + +
Sbjct: 39 DIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFV 98

Query: 92 SDALTPMLV-ARFASGLGHGVFLAVASSVATQLAGRHRAGAAVAVVFGGLTLALALGVPL 150
+ +L+ ARF G G F A+ V + + G A ++ + + +G +
Sbjct: 99 GHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158

Query: 151 GTYLGSVLSWQVIFMAVAASGAVGFLGLLALMPTDRDD 188
G + + W + + + + L R
Sbjct: 159 GGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIK 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2010PRTACTNFAMLY310.012 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 31.2 bits (70), Expect = 0.012
Identities = 32/140 (22%), Positives = 45/140 (32%), Gaps = 7/140 (5%)

Query: 280 AVGAAGTAVAIGAAATGLGGAVVAGARMAPAAAKLAGAGARAATSAAGSARLAFQAGSAA 339
A GA +GA+ L G + G R A AA GA A R AG A
Sbjct: 213 ASGAPAAVSVLGASELTLDGGHITGGRAAGVAAM---QGAVVHLQRATIRRGDAPAGGAV 269

Query: 340 AGGGARGAMA----GSSNVAKTGAQSFGRSAASGASGAAQKMTGSFRAGWNGTEAGGGAA 395
GG G G +G + + AQ + + G G
Sbjct: 270 PGGAVPGGAVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGARV 329

Query: 396 SGAAGSGQAAAGEATDSAAS 415
+ + GS A G ++ +
Sbjct: 330 TVSGGSLSAPHGNVIETGGA 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2011PF04335596e-13 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 59.5 bits (144), Expect = 6e-13
Identities = 42/216 (19%), Positives = 70/216 (32%), Gaps = 12/216 (5%)

Query: 20 YQAAAQVWD-ERIGSARVQAKNWRLMAFGCLVLALLMAGGLVWRSAQSIVTPYVIEVDQS 78
Y A W+ +++ +A K ++A LA + + V PYVI VD++
Sbjct: 13 YFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRN 72

Query: 79 GQVRTVGE---AATPYRPADAQIAHHLARFVTLVRSLSIDPIVVRQNWLDAYDYTTDKGA 135
++ +A + LA +V + + +
Sbjct: 73 TGEASIAAKLHGDATITYDEAVRKYFLATYVRYRE--GWIAAAREEYFDAVMVMSARPEQ 130

Query: 136 A-VLNDYASKN--DPFARVGRE-SVTVQITSVTRASDASFNVRWTEQRFVNGVPAGTERW 191
Y + N P + V V+I V+ V +T + V G +
Sbjct: 131 DRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFT-KESVTGSNSTKTDA 189

Query: 192 NAVLSI-VLQTPRTEQRLRKNPLGIYVNGLSWSREL 226
A + V TP E KNPLG V E+
Sbjct: 190 VATIKYKVDGTPSKEVDRFKNPLGYQVESYRADVEV 225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2012PF03544290.025 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 28.8 bits (64), Expect = 0.025
Identities = 19/99 (19%), Positives = 33/99 (33%), Gaps = 3/99 (3%)

Query: 27 QGKPPPSISLDEPVQAQPLPELPKPVEVV---AVPEPLALPAQLKPLPEVGEAAPVPEPA 83
+PPP ++ + +P+PE PK VV P+P P +K + + E
Sbjct: 65 AVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESR 124

Query: 84 DEKVRVSRANAEARIAPTREGYVNAIQVWPYSDGALYQV 122
+ A A + + AL +
Sbjct: 125 PASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRN 163


29Bamb_2095Bamb_2104Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2095293.069613hypothetical protein
Bamb_2096482.960237acriflavin resistance protein
Bamb_20974103.002161RND family efflux transporter MFP subunit
Bamb_20983112.444523RND efflux system outer membrane lipoprotein
Bamb_20994100.921464hypothetical protein
Bamb_21003100.492743two component transcriptional regulator
Bamb_2101412-1.672933sensor signal transduction histidine kinase
Bamb_2102112-3.198446two component transcriptional regulator
Bamb_2103112-3.117351hypothetical protein
Bamb_2104012-3.402207cytochrome C oxidase subunit IV
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2095TONBPROTEIN280.006 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 28.4 bits (63), Expect = 0.006
Identities = 13/44 (29%), Positives = 14/44 (31%)

Query: 34 VPAPVYVAPAPVYAPPPPPVVYQPAPVYAPAPVYAPAPVYAPAP 77
V P P P P P + APV P P P P
Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 104



Score = 27.6 bits (61), Expect = 0.010
Identities = 12/43 (27%), Positives = 12/43 (27%)

Query: 35 PAPVYVAPAPVYAPPPPPVVYQPAPVYAPAPVYAPAPVYAPAP 77
P V P P P P APV P P P
Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2096ACRIFLAVINRP6240.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 624 bits (1610), Expect = 0.0
Identities = 241/1069 (22%), Positives = 427/1069 (39%), Gaps = 57/1069 (5%)

Query: 4 LVRLALARPYTFIVLALLILIAGPLAALRTPTDIFPDIRIPVISVVWNYAGLQPADMAGR 63
+ + RP VLA+++++AG LA L+ P +P I P +SV NY G +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 64 IVTYYERTLGTTVNDVAHIESQSFRSYGI-VKIFFQPSVDIRTATAQVTSISQTVLKQMP 122
+ E+ + ++++ ++ S S + + + + FQ D A QV + Q +P
Sbjct: 61 VTQVIEQNM-NGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLP 119

Query: 123 PGTTPPQILNYNASTVPVLQLALTSDTLNEQQ--LGDYATNVIRPQLLSVAGVAIPSPYG 180
I +S+ ++ SD Q + DY + ++ L + GV +G
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 181 GKVRQVQIDLDPQALQAKGLSAQDVATALAQQNQIIPAGT------QKIGRFEYNIRLND 234
+ ++I LD L L+ DV L QN I AG + +I
Sbjct: 180 AQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 235 SPLTIDQLNALPIRTV-NGAVIFMRDVAHVRDGFPPQGNIVRVDGRRAVLMSVLKSGSAS 293
++ + +R +G+V+ ++DVA V G I R++G+ A + + + A+
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 294 TLDIIAGVKAQLPRIEATLPPSLRLVVMGDQSVFVKGAVSGVAREGLIAAALTSAMILLF 353
LD +KA+L ++ P ++++ D + FV+ ++ V + A L ++ LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 354 LGSWRSTLIIAASIPLAVLAAIAALAAAGETLNVMTLGGLALAVGILVDDATVTIENV-N 412
L + R+TLI ++P+ +L A LAA G ++N +T+ G+ LA+G+LVDDA V +ENV
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 413 WHLEQGKDVRSAILDGASQIVAPAFVSLLCICIVFVPMLLLDGVARFLFVPMAEAVIFAM 472
+E + A SQI + + VF+PM G ++ + ++ AM
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 473 IASFVLSRTFVPMMARYLLRPHAAHPAAVLAPHGAPFPTPRSRNPLVAFQQGFERRFAAL 532
S +++ P + LL+P +A F F F
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEH----------------HENKGGFFGWFNTTFDHS 522

Query: 533 RTGYRAVLGLALAHRARFVVLFLTAVALSFALVPGLGRNFFPSVDAGEIALHVRAPIGTR 592
Y +G L R+++++ VA L L +F P D G ++ P G
Sbjct: 523 VNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGAT 582

Query: 593 IEETAALFDRVERTVRGVVPPRALASIVDNMGLPNSGINLTYSNSGTIGPQDGDILVSLT 652
E T + D+V L + N+ + ++S G VSL
Sbjct: 583 QERTQKVLDQVTD--------YYLKNEKANVESVFTVNGFSFSGQ---AQNAGMAFVSLK 631

Query: 653 GEH-----APTADYV-KQLRTVLPRAFPGVTFSFLPADIVSQILNFGAPAPIDVQVTGPD 706
+A+ V + + L + G F IV + G
Sbjct: 632 PWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELG-TATGFDFELIDQAGLG 690

Query: 707 LAANRAYATELLRRIRTVPG-VADARVQQASTYPQFTVSVDRALAAQLGITEQDVTNAVV 765
A +LL P + R QF + VD+ A LG++ D+ +
Sbjct: 691 HDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTIS 750

Query: 766 ASLSGTSQVSPTYWLDPHNGVSYPIVAQTPQYRMTSLSDLRALPVTGRSGAPQLLGGLAT 825
+L G + V+ G + Q D+ L V +G T
Sbjct: 751 TALGG-TYVNDFI----DRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTT 805

Query: 826 IVRSQTDAVVSHYDIAPLDDIFATTQDRDLGAVSADIARVLHASAADLPKGSRVTVRGQV 885
+ Y+ P +I G S D ++ A+ LP G G
Sbjct: 806 SHWVYGSPRLERYNGLPSMEIQGEAAP---GTSSGDAMALMENLASKLPAGIGYDWTGMS 862

Query: 886 QTMSSAFGGLLAGLAGAVLLIYLLIVVNFHSWRDAFVIVSALPAALAGIVWMLFVTRTPL 945
+ A +A + ++++L + + SW ++ +P + G++ +
Sbjct: 863 YQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKN 922

Query: 946 SVPALTGAILCMGVATANSILVVTFARERLAH-TADATVAALEAGFTRFRPVMMTALAMI 1004
V + G + +G++ N+IL+V FA++ + A L A R RP++MT+LA I
Sbjct: 923 DVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFI 982

Query: 1005 IGMAPMALGLGDGGEQNAPLGRAVIGGLLCATVATLVFVPVVFSLVHRR 1053
+G+ P+A+ G G +G V+GG++ AT+ + FVPV F ++ R
Sbjct: 983 LGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 89.5 bits (222), Expect = 4e-20
Identities = 63/358 (17%), Positives = 133/358 (37%), Gaps = 15/358 (4%)

Query: 714 ATELLRRIRTVPGVADARVQQASTYPQFTVSVDRALAAQLGITEQDVTNAVVA--SLSGT 771
A+ + + + GV D VQ + +D L + +T DV N +
Sbjct: 159 ASNVKDTLSRLNGVGD--VQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAA 216

Query: 772 SQVSPTYWLDPHNGVSYPIVAQTPQYRMTSLSDLRALPV-TGRSGAPQLLGGLATIVR-S 829
Q+ T L P ++ I+AQT R + + + + G+ L +A +
Sbjct: 217 GQLGGTPAL-PGQQLNASIIAQT---RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGG 272

Query: 830 QTDAVVSHYDIAP--LDDIFATTQDRDLGAVSADIARVLHASAADLPKGSRVTV-RGQVQ 886
+ V++ + P I T L + I L P+G +V
Sbjct: 273 ENYNVIARINGKPAAGLGIKLATGANAL-DTAKAIKAKLAELQPFFPQGMKVLYPYDTTP 331

Query: 887 TMSSAFGGLLAGLAGAVLLIYLLIVVNFHSWRDAFVIVSALPAALAGIVWMLFVTRTPLS 946
+ + ++ L A++L++L++ + + R + A+P L G +L ++
Sbjct: 332 FVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSIN 391

Query: 947 VPALTGAILCMGVATANSILVVTFARERLAHTADATVAALEAGFTRFR-PVMMTALAMII 1005
+ G +L +G+ ++I+VV + A E ++ + ++ A+ +
Sbjct: 392 TLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSA 451

Query: 1006 GMAPMALGLGDGGEQNAPLGRAVIGGLLCATVATLVFVPVVFSLVHRRDRAPHSESPS 1063
PMA G G ++ + + + L+ P + + + + A H E+
Sbjct: 452 VFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKG 509


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2097RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 18/124 (14%), Positives = 37/124 (29%), Gaps = 28/124 (22%)

Query: 90 GYLHAWYVDIGAHVKAGQLLASIDTPDLDQQLQQARADLQSATANE-RLAAVTAARWSEM 148
+ V G V+ G +L + + + ++ L A + R ++ +
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164

Query: 149 LAQDSVS---------------------------RQEADEKRSDLDAKRAAVAASTANVR 181
L + + + + +K +LD KRA A +
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224

Query: 182 RLEA 185
R E
Sbjct: 225 RYEN 228



Score = 34.8 bits (80), Expect = 5e-04
Identities = 22/139 (15%), Positives = 45/139 (32%), Gaps = 4/139 (2%)

Query: 117 LDQQLQQARADLQSATANERLAAVTAARWSEMLAQDSVSRQEADEKRSDLDAKRAAVAAS 176
L+Q+ + A + +L + + S V++ +E L +
Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 177 TANVRRLEALESFKRLTAPFDGVVTARKT-DVGALIDAGSGNGAELFTVSDARRLRLYVH 235
T + + E + + AP V K G ++ + V + L +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE---TLMVIVPEDDTLEVTAL 371

Query: 236 IPQDDAGAIRAGMHVALTV 254
+ D G I G + + V
Sbjct: 372 VQNKDIGFINVGQNAIIKV 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2102HTHFIS781e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.6 bits (191), Expect = 1e-18
Identities = 31/120 (25%), Positives = 49/120 (40%)

Query: 2 RVLTVEDDAVTANEIVGELTARGFEVDWIDNGREGMMRAMSASYDAITLDRMLPGADGLA 61
+L +DDA + L+ G++V N + D + D ++P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILTAMRTVGIDTPVLMLSALGDVDERIRGLRAGGDDYLTKPFDSGELSARIEVLLRRRQA 121
+L ++ D PVL++SA I+ G DYL KPFD EL I L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124


30Bamb_2201Bamb_2226Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_22013121.525804glutamate--ammonia ligase
Bamb_22023102.858242peptidase C26
Bamb_22032102.684352hypothetical protein
Bamb_2204-191.717843hypothetical protein
Bamb_2205-281.402644N-formylglutamate amidohydrolase
Bamb_2206-3101.600682N-formimino-L-glutamate deiminase
Bamb_2207-2100.741380imidazolonepropionase
Bamb_2208-2101.015515hypothetical protein
Bamb_2209-391.230894urocanate hydratase
Bamb_2210-282.418712histidine utilization repressor
Bamb_2211-193.120535histidine ammonia-lyase
Bamb_2212193.803149extracellular solute-binding protein
Bamb_22133114.4513994'-phosphopantetheinyl transferase
Bamb_22143123.721613alpha/beta hydrolase
Bamb_22150102.752926LysR family transcriptional regulator
Bamb_2216192.859783major facilitator superfamily transporter
Bamb_22170102.416705deoxyribodipyrimidine photo-lyase
Bamb_2218090.713830phosphoesterase, DHHA1
Bamb_2219-190.443973short chain dehydrogenase
Bamb_2220-311-1.614582major facilitator superfamily transporter
Bamb_2221225-5.2216483-carboxymuconate cyclase-like protein
Bamb_2222328-6.556967AraC family transcriptional regulator
Bamb_2223327-6.164976hypothetical protein
Bamb_2224429-6.213810HAD family hydrolase
Bamb_2225223-5.534196*Rhs element Vgr protein
Bamb_2226219-4.576919EF hand domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2216TCRTETA424e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.7 bits (98), Expect = 4e-06
Identities = 71/346 (20%), Positives = 116/346 (33%), Gaps = 13/346 (3%)

Query: 50 FGLALALQNLVWGIAQPLTGMIADRFGSVRVIVAGMLLYAAGLVTMALAASIGVFTAGAG 109
+G+ LAL L+ P+ G ++DRFG V++ + A MA A + V G
Sbjct: 45 YGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGR- 103

Query: 110 LVIGIALSGSAFASIYGALSRLFPPDRRGWALGVAGAIGGLGQFCMVPVAQVLIGGIGWQ 169
+V GI +G+ A ++ + D R G A G G PV L+GG
Sbjct: 104 IVAGI--TGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAGPVLGGLMGGFSPH 160

Query: 170 HAFVALALVAALLAPLAVLVRDRPAQAASQADGTDQ-SIAAAVREAFTHRGFWLLNAGFF 228
F A A + L + + + + + A+ R A L A FF
Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF 220

Query: 229 ACGFQLAFIATHLPAYLLDH-GLPARHASVALALIALTN-VAGTYACGHLGGVLRRKYVL 286
A + D A ++LA + + +A G + L + L
Sbjct: 221 IMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRAL 280

Query: 287 --SVLYLVRALAMAAFVAAPLSPVSVYVFAAVMGFTWLGTVPLTNGVISQVFGVRYIATL 344
++ + AF + V A G +P ++S+ L
Sbjct: 281 MLGMIADGTGYILLAFATRGWMAFPIMVLLASGGI----GMPALQAMLSRQVDEERQGQL 336

Query: 345 FGFVFFGHQLGSFFGVWLGALVYDATHSYLPLWIGSIALGVLAALL 390
G + L S G L +Y A+ + W + L
Sbjct: 337 QGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCL 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2219DHBDHDRGNASE762e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 76.2 bits (187), Expect = 2e-19
Identities = 47/164 (28%), Positives = 66/164 (40%), Gaps = 9/164 (5%)

Query: 6 PEAVEDADLDGWRAVFDTNVFGTMTLTQEIVPHMKRQKRGAIVMINTQATRKPFAGESGY 65
P + + W A F N G ++ + +M ++ G+IV + + P + Y
Sbjct: 98 PGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAY 157

Query: 66 AVSKGALAVAAKYLARELGVHGICANSIHMGWMWSVPTQTYFRQAAAEYGMTEEQIIAPI 125
A SK A + K L EL + I N + G S T + A E G EQ+I
Sbjct: 158 ASSKAAAVMFTKCLGLELAEYNIRCNIVSPG---STETDMQWSLWADENG--AEQVIKGS 212

Query: 126 ASN----IALAKLPTDDDCARAALFLASDYANAVTGATLDANGG 165
I L KL D A A LFL S A +T L +GG
Sbjct: 213 LETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2226TONBPROTEIN330.005 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.7 bits (74), Expect = 0.005
Identities = 23/88 (26%), Positives = 32/88 (36%), Gaps = 15/88 (17%)

Query: 448 PVSDPKSRWVKVQPKTVKMPAEQPEPPAPAGKGKHKPKPAKKPEP--------------I 493
P + V+P+ P +P AP K KPKP KP+P +
Sbjct: 58 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPV 117

Query: 494 ETSTGSPFWVDSSLGLI-NQMTTAPVKG 520
E+ SPF + L + T A K
Sbjct: 118 ESRPASPFENTAPARLTSSTATAATSKP 145


31Bamb_2270Bamb_2284Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2270-210-3.332301acyl-CoA dehydrogenase domain-containing
Bamb_2271014-4.268119hypothetical protein
Bamb_2272014-4.752576NUDIX hydrolase
Bamb_2273116-4.842284hypothetical protein
Bamb_2274016-4.876628NADH dehydrogenase subunit N
Bamb_2275016-5.112741NADH dehydrogenase subunit M
Bamb_2276017-3.484961NADH dehydrogenase subunit L
Bamb_2277-118-2.966046NADH dehydrogenase subunit K
Bamb_2278018-2.872642NADH dehydrogenase subunit J
Bamb_2279-117-3.080705NADH dehydrogenase subunit I
Bamb_2280-117-2.944420NADH dehydrogenase subunit H
Bamb_2281-115-2.831072NADH dehydrogenase subunit G
Bamb_2282-112-4.727893NADH-quinone oxidoreductase subunit F
Bamb_2283-115-4.776952NADH dehydrogenase subunit E
Bamb_2284015-3.190770NADH dehydrogenase subunit D
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2280OUTRMMBRANEA310.009 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 30.7 bits (69), Expect = 0.009
Identities = 15/96 (15%), Positives = 28/96 (29%), Gaps = 10/96 (10%)

Query: 138 YAVILAGWASNSKYAFLGAMR-------AAAQMVSYEISMGFALVLVLMTAGSLNLSEIV 190
Y GW+ F+ A Y+++ + G +
Sbjct: 29 YTGAKLGWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGRMPY---K 85

Query: 191 GSQQHGFFAGHGVNFLSWNWLPLLPAFVVYFVSGIA 226
GS ++G + GV + P+ +Y G
Sbjct: 86 GSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGM 121


32Bamb_2313Bamb_2319Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2313074.184551glucose-1-dehydrogenase
Bamb_2314-184.028231glycosyltransferase
Bamb_2315-1123.857042hypothetical protein
Bamb_2316-1114.070129threonyl/alanyl tRNA synthetase
Bamb_2317-1123.738688globin
Bamb_23180103.929976hypothetical protein
Bamb_23190113.012190hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2313DHBDHDRGNASE944e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 94.0 bits (233), Expect = 4e-25
Identities = 67/256 (26%), Positives = 109/256 (42%), Gaps = 19/256 (7%)

Query: 8 KAVLITGASRGIGRATAVLAAKRGWDV-GINYARDAAAAELTARAVRDAGARACVVAGDV 66
K ITGA++GIG A A A +G + ++Y + E +++ A DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL--EKVVSSLKAEARHAEAFPADV 66

Query: 67 ANEADVVAMFDAVTAAFGRLDALVNNAGIVAPSMPLADMPADRLRRMFDTNVLGAYLCAR 126
+ A + + + G +D LVN AG++ P + + + F N G + +R
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPG-LIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 127 EAARRLSTDRGGRGGAIVNVSSIASRLGSPNEYVD-YAGSKGAVDSLTIGLAKELGPHGV 185
++ + R G+IV V S + G P + YA SK A T L EL + +
Sbjct: 126 SVSKYM---MDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 186 RVNAVRPGLIETEIHAS-----GGQPGRAARLGAQ----TPLGRAGEAQEIAEAIVWLLG 236
R N V PG ET++ S G PL + + +IA+A+++L+
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 237 DAASYTTGALLDIGGG 252
A + T L + GG
Sbjct: 241 GQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2318ISCHRISMTASE310.015 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 31.1 bits (70), Expect = 0.015
Identities = 18/84 (21%), Positives = 31/84 (36%), Gaps = 8/84 (9%)

Query: 690 PPPVLKDFPAVYLTSFHLPAEQAALLDPLIARYPNLTAIDVAPILAQLQRMMLQVVGAVQ 749
P D P S+ +A LL + Y V A + ++ ++
Sbjct: 10 QMPTASDMPQ-NKVSWVPDPNRAVLLIHDMQNY------FVDAFTAGA-SPVTELSANIR 61

Query: 750 FLFGFTLAAGVLVLYTALAGSRDE 773
L + G+ V+YTA GS++
Sbjct: 62 KLKNQCVQLGIPVVYTAQPGSQNP 85


33Bamb_2392Bamb_2434Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_23922130.143876transmembrane pair domain-containing protein
Bamb_2393212-0.030602LysR family transcriptional regulator
Bamb_2394312-0.816453hypothetical protein
Bamb_239528-1.786354DNA topoisomerase IV subunit A
Bamb_2396112-3.396911DNA topoisomerase IV subunit B
Bamb_2397120-4.772943ABC transporter-like protein
Bamb_2398537-7.356473hypothetical protein
Bamb_2399440-7.871761rubredoxin-type Fe(Cys)4 protein
Bamb_2400442-8.294594*phage integrase family protein
Bamb_2401541-7.187118helix-turn-helix, Fis-type
Bamb_2402340-6.420967prophage CP4-57 regulatory
Bamb_2403339-6.238026hypothetical protein
Bamb_2404440-7.005525hypothetical protein
Bamb_2405440-7.280536hypothetical protein
Bamb_2406438-7.001640TOPRIM domain-containing protein
Bamb_2407438-7.717755hypothetical protein
Bamb_2408439-6.989775hypothetical protein
Bamb_2409438-6.379893hypothetical protein
Bamb_2410436-5.961476hypothetical protein
Bamb_2411436-5.784551hypothetical protein
Bamb_2412536-5.676601hypothetical protein
Bamb_2413641-6.608323hypothetical protein
Bamb_2414742-8.733484phage integrase family protein
Bamb_2415846-10.839718hypothetical protein
Bamb_2416642-9.554353*hypothetical protein
Bamb_2417745-10.228025hypothetical protein
Bamb_2418745-9.593921CheA signal transduction histidine kinase
Bamb_2419643-8.698013hypothetical protein
Bamb_2420645-8.258895hypothetical protein
Bamb_2421647-9.586204phage integrase family protein
Bamb_2422758-11.276200hypothetical protein
Bamb_2423861-11.875783hypothetical protein
Bamb_2424553-10.196390hypothetical protein
Bamb_2425448-8.484546hypothetical protein
Bamb_2426341-7.009115hypothetical protein
Bamb_2427333-4.208692resolvase domain-containing protein
Bamb_2428426-2.661159hypothetical protein
Bamb_24296171.008077*parallel beta-helix repeat-containing protein
Bamb_24305142.175927short-chain dehydrogenase/reductase SDR
Bamb_24314102.215466TetR family transcriptional regulator
Bamb_24325112.413640ecotin
Bamb_24333111.638203hypothetical protein
Bamb_24342111.434442hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2401HTHFIS382e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.5 bits (87), Expect = 2e-05
Identities = 10/35 (28%), Positives = 18/35 (51%)

Query: 127 RTQMMEALERNGGNKTKTATEFGISRQRLAQIIQQ 161
++ AL GN+ K A G++R L + I++
Sbjct: 438 YPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2430DHBDHDRGNASE1246e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 124 bits (313), Expect = 6e-37
Identities = 85/254 (33%), Positives = 135/254 (53%), Gaps = 15/254 (5%)

Query: 4 LQGKRALVTGGSRGIGAAIAKRLAADGADVAITYEKSAERARAVVADIEALGRRAVAIQA 63
++GK A +TG ++GIG A+A+ LA+ GA +A + + E+ VV+ ++A R A A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 64 DSADPVAVRGAVDHAAQTLGGLDILVNNAGIFRAGALDDLTLDDIDATLNVNVRAVIVAS 123
D D A+ + +G +DILVN AG+ R G + L+ ++ +AT +VN V AS
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 124 QAAARHL--GEGGRIVSTGSCLATRVPDAGMSLYAASKAALIGWTQGLARDLGARGITVN 181
++ ++++ G IV+ GS VP M+ YA+SKAA + +T+ L +L I N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSN-PAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 182 LVHPGSTDTDMNPA--DGEHAGAQRSRMATPQY---------GKAEDVAALVAFVVGPEG 230
+V PGST+TDM + E+ Q + + + K D+A V F+V +
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 231 RSINGTGLTIDGGA 244
I L +DGGA
Sbjct: 244 GHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2431HTHTETR543e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.9 bits (129), Expect = 3e-11
Identities = 33/199 (16%), Positives = 69/199 (34%), Gaps = 7/199 (3%)

Query: 1 MAERGRPRSFD-KEAALDRAMEVFWRLGYEGASMTDLTAAMGIASPSLYAAFGSKEALFR 59
MA + + + + ++ LD A+ +F + G S+ ++ A G+ ++Y F K LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 60 QAIE-HYRETEGREIWDGVEQAGSAHDAIENYLMQTARVFTRLSKPAGCLIVLSALHPAE 118
+ E E+ + G + L+ +++ L++ H E
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLEST--VTEERRRLLMEIIFHKCE 118

Query: 119 RSD---TVRQMLIAMREQTVAALRTRLGEGVAAGEISAHADLDAIARYYVTVQQGMSIQA 175
V+Q + ++ + L + A + A A G+
Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENW 178

Query: 176 RDGASRRDLEAVAQAALAA 194
DL+ A+ +A
Sbjct: 179 LFAPQSFDLKKEARDYVAI 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2433cloacin332e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.8 bits (74), Expect = 2e-04
Identities = 21/72 (29%), Positives = 25/72 (34%)

Query: 36 GYAYGPAYGAAPVYGTVNIWGGGGGGRDWDRGHRDYRRWDRDRGDHGGWGRGGGRRGDWN 95
G+ G + + G G GGG D + W G WG G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 96 EGGGGGGRGEGG 107
G GGG G GG
Sbjct: 68 NGNSGGGSGTGG 79


34Bamb_2487Bamb_2502Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2487215-0.016189hypothetical protein
Bamb_2488011-0.031070chromosome segregation and condensation protein
Bamb_2489-2111.822172pantoate--beta-alanine ligase
Bamb_2490-1102.544025aspartate alpha-decarboxylase
Bamb_2491-194.312261cobyrinic acid a,c-diamide synthase
Bamb_2492194.688100DoxX family protein
Bamb_24931104.948554hypothetical protein
Bamb_24942105.437005cobyric acid synthase
Bamb_24953145.437342adenosylcobinamide kinase
Bamb_24963115.189765cobalamin biosynthesis protein
Bamb_24973104.817274threonine-phosphate decarboxylase
Bamb_24984115.185716periplasmic binding protein
Bamb_24994125.226017phosphoglycerate mutase
Bamb_25002124.545629cobalamin synthase
Bamb_25011123.988396nicotinate-nucleotide--dimethylbenzimidazole
Bamb_25021153.017276ABC transporter-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2487SYCECHAPRONE250.011 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 25.4 bits (55), Expect = 0.011
Identities = 8/28 (28%), Positives = 16/28 (57%)

Query: 18 KPTLEAEQRKGRSLLWDKQPIDLDERAE 45
KP L ++ G +LW++QP++ +
Sbjct: 75 KPILSWDEVGGHPVLWNRQPLNSLDNNS 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2498FERRIBNDNGPP421e-06 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 42.2 bits (99), Expect = 1e-06
Identities = 40/174 (22%), Positives = 68/174 (39%), Gaps = 7/174 (4%)

Query: 49 AQRVISLAPHATELVYAAG----GGAKLVGTVTYSDYPPAAQAVPRVGDNKALDLERIAA 104
R+++L EL+ A G G A + + PP +V VG +LE +
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTE 94

Query: 105 LKPDLIVVWRHGNAERQTDALRALHIPLFFSEPKHLHDVA-TSLRQLGTLLGTAPVADAA 163
+KP +V + A A FS+ K +A SL ++ LL A+
Sbjct: 95 MKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETH 154

Query: 164 AASFSRDIAALRARYAARA--PVTMFFQVWDRPLTTLNGAHLFNEVIALCGGRN 215
A + I +++ R+ R P+ + + R + LF E++ G N
Sbjct: 155 LAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPN 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2502PF05272300.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.009
Identities = 20/72 (27%), Positives = 28/72 (38%), Gaps = 10/72 (13%)

Query: 8 TADDMSCAAVDLTLKAGERTLLDGFTQAFRPGEIWCVA-------GPNGAGKTTLLATLA 60
T DD + G+ L+ + PG C G G GK+TL+ TL
Sbjct: 561 TPDDYKPRRLRYLQLVGKYILMGHVARVMEPG---CKFDYSVVLEGTGGIGKSTLINTLV 617

Query: 61 GLRQPAGGHVEI 72
GL + H +I
Sbjct: 618 GLDFFSDTHFDI 629


35Bamb_2572Bamb_2592Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_25722140.170737isocitrate dehydrogenase
Bamb_25733151.249061pseudouridine synthase, Rsu
Bamb_25751140.797300hypothetical protein
Bamb_25761121.684600elongation factor G
Bamb_25774154.194708high-affinity nickel-transporter
Bamb_25784164.906894hypothetical protein
Bamb_2579193.232539aldo/keto reductase
Bamb_25801103.475875GntR family transcriptional regulator
Bamb_25811104.013355L-carnitine dehydratase/bile acid-inducible
Bamb_2582094.065873citrate synthase
Bamb_25830103.009362hypothetical protein
Bamb_25840112.875057EmrB/QacA family drug resistance transporter
Bamb_25850114.048515diguanylate phosphodiesterase
Bamb_25860102.991410type 11 methyltransferase
Bamb_2587-1103.4610082-dehydropantoate 2-reductase
Bamb_2588083.536391hypothetical protein
Bamb_2589073.534181Crp/FNR family transcriptional regulator
Bamb_2590073.180302chromate transporter
Bamb_25912101.941874superoxide dismutase
Bamb_2592-1113.185847exodeoxyribonuclease VII large subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2576TCRTETOQM6250.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 625 bits (1613), Expect = 0.0
Identities = 174/685 (25%), Positives = 298/685 (43%), Gaps = 77/685 (11%)

Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVSHKIGEVHDGAATMDWMEQEQERGITITSAAT 68
+ NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TAFWKGMAGNYPEHRINIIDTPGHVDFTIEVERSMRVLDGACMVYDSVGGVQPQSETVWR 128
+ W+ ++NIIDTPGH+DF EV RS+ VLDGA ++ + GVQ Q+ ++
Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114

Query: 129 QANKYKVPRIAFVNKMDRVGADFFRVQRQIGERLKGVAVPIQIPIGAEEHFQGVVDLVKM 188
K +P I F+NK+D+ G D V + I E+L V Q V M
Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164

Query: 189 KAIVWDDESQGVKFSYEEIPANLVELAHEWRGKMVEAAAEASEELLEKYLHDHESLTEDE 248
+ + Q + E +++LLEKY+ +SL E
Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYM-SGKSLEALE 199

Query: 249 IKAALRQRTIANEIVPMLCGSAFKNKGVQAMLDAVIDYLPSPVDVPAILGHDFADPEKPA 308
++ R + P+ GSA N G+ +++ + + S
Sbjct: 200 LEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH---------------- 243

Query: 309 ERHPSDDEPFSSLAFKIMTDPFVGQLIFFRVYSGVVNSGDTVLNATKDKKERLGRILQMH 368
FKI +L + R+YSGV++ D+V + K+K ++ +
Sbjct: 244 ----RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMYTSI 298

Query: 369 ANERKEIKEVRAGDIAAAVG--LK-EATTGDTLCDPTKPIILEKMEFPEPVISQAVEPKT 425
E +I + +G+I LK + GDT P + E++E P P++ VEP
Sbjct: 299 NGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPLPLLQTTVEPSK 354

Query: 426 KADQEKMGLALNRLAQEDPSFRVQTDEESGQTIISGMGELHLEILVDRMKREFGVEATVG 485
+E + AL ++ DP R D + + I+S +G++ +E+ ++ ++ VE +
Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414

Query: 486 KPQVAYRETVRTTASDVEGKFVKQSGGRGQYGHAVITLEPNP-GKGYEFLDEIKGGVIPR 544
+P V Y E A E + + +++ P P G G ++ + G + +
Sbjct: 415 EPTVIYMERPLKKA---EYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQ 471

Query: 545 EFIPAVDKGITETLKSGVLAGYPVVDVKVHLTFGSYHDVDSNENAFRMAGSMAFKEAMRR 604
F AV +GI + G L G+ V D K+ +G Y+ S FRM + ++ +++
Sbjct: 472 SFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKK 530

Query: 605 AKPVLLEPMMAVEVETPEDFMGNVMGDLSSRRGIVQGMEDIAGGGGKLVRAEVPLAEMFG 664
A LLEP ++ ++ P++++ D + + ++ E+P +
Sbjct: 531 AGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ--LKNNEVILSGEIPARCIQE 588

Query: 665 YSTSLRSATQGRATYTMEFKQYAET 689
Y + L T GR+ E K Y T
Sbjct: 589 YRSDLTFFTNGRSVCLTELKGYHVT 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2578cloacin487e-08 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 47.8 bits (113), Expect = 7e-08
Identities = 42/100 (42%), Positives = 44/100 (44%), Gaps = 1/100 (1%)

Query: 383 SGGHGNG-NGGHGNGNGNGNGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 441
SGG G G N G + +GN NGG G G GGG G G GGG G G GGG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 442 GGGGSGGSGGGGGSGGSGGSGGAGGSGGQGNGGGSTGGHG 481
G G G GGGSG G G ST G G
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAG 101



Score = 46.6 bits (110), Expect = 2e-07
Identities = 33/79 (41%), Positives = 40/79 (50%)

Query: 361 GGNGGGNGGGNGGGNGGGHGGGSGGHGNGNGGHGNGNGNGNGGGGGGGGGGGGGGGGGGG 420
GG+G G+ G +G +GG +G G G+G + N GGG G G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 421 GGGGGGGGGGGGGGGGGGG 439
G GGG G GGG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 46.2 bits (109), Expect = 2e-07
Identities = 40/79 (50%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 337 GGNGGGHGGGNGGGNGGGNGGGNGGGNGGGNGGGNGGG---NGGGHGGGSGGHGNGNGGH 393
GG+G GH G +G NGG G G GGG G+G N G G GSG H G GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 394 GNGNGNGNGGGGGGGGGGG 412
GNG GNGN GGG G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 45.9 bits (108), Expect = 3e-07
Identities = 36/80 (45%), Positives = 41/80 (51%), Gaps = 1/80 (1%)

Query: 353 GGNGGGNGGGNGGGNGGGNGGGNGGGHGGGSGGHGNGNGGHGNGNGNGNGGGGGGGGGGG 412
GG+G G+ G +G NGG G G GGG G+G N G G+G G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 413 GGGGGGGGGGGGGGGGGGGG 432
G GGG G GGG G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 45.9 bits (108), Expect = 3e-07
Identities = 41/109 (37%), Positives = 46/109 (42%), Gaps = 1/109 (0%)

Query: 357 GGNGGGNGGGNGGGNGGGNGG-GHGGGSGGHGNGNGGHGNGNGNGNGGGGGGGGGGGGGG 415
GG+G G+ G +G NGG G GG +G+G N G G G G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 416 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGSGGSGGGGGSGGSGGSGGA 464
G GGG G GGG G GG G S G G S GA
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA 111



Score = 45.5 bits (107), Expect = 4e-07
Identities = 32/79 (40%), Positives = 37/79 (46%)

Query: 273 GDGGGNGGGNGVGNGGGHGGGHGGGGGGGGHGGGNGGGHGGGNGGGNGGGSGGGSGGGHG 332
GDG G+ G +G +GG G G GGG G GGG+G G G G GHG
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 333 GGNGGGNGGGHGGGNGGGN 351
G G GN GG G G +
Sbjct: 64 NGGGNGNSGGGSGTGGNLS 82



Score = 45.5 bits (107), Expect = 4e-07
Identities = 37/80 (46%), Positives = 43/80 (53%), Gaps = 1/80 (1%)

Query: 349 GGNGGGNGGGNGGGNGGGNGGGNGGGNGGGHGGGSGGHGNGNGGHGNGNGNGNGGGGGGG 408
GG+G G+ G +G NGG G G GGG GSG + N G G+G+G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG-WSSENNPWGGGSGSGIHWGGGSG 61

Query: 409 GGGGGGGGGGGGGGGGGGGG 428
G GGG G GGG G GG
Sbjct: 62 HGNGGGNGNSGGGSGTGGNL 81



Score = 44.3 bits (104), Expect = 8e-07
Identities = 41/113 (36%), Positives = 47/113 (41%), Gaps = 2/113 (1%)

Query: 144 GTSGNGASGGGTSGSGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGG 203
G G G + G S SG GG +G G GG + G G S GG SG G GG SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 204 GTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTS 256
G GG + GG SG G G ++ G T G G S G S
Sbjct: 63 GNGGGNGNSGGGSGTG--GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 44.3 bits (104), Expect = 8e-07
Identities = 38/108 (35%), Positives = 43/108 (39%)

Query: 408 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGSGGSGGGGGSGGSGGSGGAGGS 467
GG G G G G GG G G GGG G G S + GGGSG GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 468 GGQGNGGGSTGGHGIGGHGNGGGNGNGNGNGGAGSGGANGVGNGSGGG 515
G G G S GG G GG+ + G + GA G+ G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 43.9 bits (103), Expect = 1e-06
Identities = 30/78 (38%), Positives = 38/78 (48%)

Query: 233 GGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGHGGHGGGGHGDGGGNGGGNGVGNGGGHGG 292
GG G +G ++ G +GG T G G G G + G G G+G+ GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 293 GHGGGGGGGGHGGGNGGG 310
G+GGG G G G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 43.9 bits (103), Expect = 1e-06
Identities = 41/114 (35%), Positives = 49/114 (42%), Gaps = 2/114 (1%)

Query: 133 NGGKGDGSSGGGTSGNGASGGGTSGSGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSG 192
+GG G G + G S +G GG +G G GG + G G S GG SG G GG SG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 193 GGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTS 246
G GG + GG SG G G ++ G T G G S G S
Sbjct: 62 HGNGGGNGNSGGGSGTG--GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 43.9 bits (103), Expect = 1e-06
Identities = 29/81 (35%), Positives = 35/81 (43%)

Query: 243 GGTSGGGTSGGGTSGGGTSGGHGGHGGGGHGDGGGNGGGNGVGNGGGHGGGHGGGGGGGG 302
GG G +G ++ G +GG G G GG G GGG G G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 303 HGGGNGGGHGGGNGGGNGGGS 323
GG G GGG+G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSA 83



Score = 43.9 bits (103), Expect = 1e-06
Identities = 37/102 (36%), Positives = 41/102 (40%)

Query: 423 GGGGGGGGGGGGGGGGGGGGGGGSGGSGGGGGSGGSGGSGGAGGSGGQGNGGGSTGGHGI 482
GG G G G G GG G GGG G S GG G+G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 483 GGHGNGGGNGNGNGNGGAGSGGANGVGNGSGGGGGGGSGGSA 524
G G G +G G+G GG S A V G G+GG A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 43.5 bits (102), Expect = 2e-06
Identities = 29/79 (36%), Positives = 32/79 (40%)

Query: 285 GNGGGHGGGHGGGGGGGGHGGGNGGGHGGGNGGGNGGGSGGGSGGGHGGGNGGGNGGGHG 344
G+G GH G G G G GG + G GGG G G G G GHG
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 345 GGNGGGNGGGNGGGNGGGN 363
G G GN GG G G +
Sbjct: 64 NGGGNGNSGGGSGTGGNLS 82



Score = 43.5 bits (102), Expect = 2e-06
Identities = 39/110 (35%), Positives = 45/110 (40%), Gaps = 2/110 (1%)

Query: 154 GTSGSGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGG 213
G G G + G S G GG +G G GG + G G S GG SG G GG SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 214 GTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGG 263
G GG + GG SG G G ++ G T G G S G
Sbjct: 63 GNGGGNGNSGGGSGTG--GNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110



Score = 43.5 bits (102), Expect = 2e-06
Identities = 34/80 (42%), Positives = 37/80 (46%), Gaps = 1/80 (1%)

Query: 297 GGGGGGHGGGNGGGHGGGNGGGNGGGSGGGSGGGHG-GGNGGGNGGGHGGGNGGGNGGGN 355
GG G GH G G NGG G G GGG+ G G GGG G G G G G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 356 GGGNGGGNGGGNGGGNGGGN 375
G G G GN GG G G +
Sbjct: 63 GNGGGNGNSGGGSGTGGNLS 82



Score = 42.0 bits (98), Expect = 4e-06
Identities = 31/81 (38%), Positives = 38/81 (46%), Gaps = 3/81 (3%)

Query: 305 GGNGGGHGGGNGGGNGGGSGGGSGGGHGGGNGGGNGGGHGGGNGGGNGGGNGGGNGGGNG 364
GG+G GH G +G +GG +G G GG + G GGG+G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLG---VGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59

Query: 365 GGNGGGNGGGNGGGHGGGSGG 385
G+G G G GN GG G G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGN 80



Score = 42.0 bits (98), Expect = 5e-06
Identities = 32/79 (40%), Positives = 35/79 (44%)

Query: 204 GTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGG 263
G G G + G S G GG +G G GG + G G S GG SG G GG SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 264 HGGHGGGGHGDGGGNGGGN 282
G G G G G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 42.0 bits (98), Expect = 5e-06
Identities = 32/78 (41%), Positives = 36/78 (46%)

Query: 189 GTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGG 248
G G G + G S G GG +G G GG + G G S GG SG G GG SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 249 GTSGGGTSGGGTSGGHGG 266
G GG + GG SG G
Sbjct: 63 GNGGGNGNSGGGSGTGGN 80



Score = 42.0 bits (98), Expect = 5e-06
Identities = 32/79 (40%), Positives = 37/79 (46%)

Query: 254 GTSGGGTSGGHGGHGGGGHGDGGGNGGGNGVGNGGGHGGGHGGGGGGGGHGGGNGGGHGG 313
G G G + G G +G G G G G +G G + GGG G G GGG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 314 GNGGGNGGGSGGGSGGGHG 332
GNGGGNG GG GG+
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 41.6 bits (97), Expect = 7e-06
Identities = 34/80 (42%), Positives = 39/80 (48%), Gaps = 1/80 (1%)

Query: 199 GTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGG 258
G G G + G S G GG +G G GG + G G S GG SG G GG SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 259 GTSGGHGGHGGGGHGDGGGN 278
G +GG G+ GGG G GG
Sbjct: 63 G-NGGGNGNSGGGSGTGGNL 81



Score = 41.2 bits (96), Expect = 7e-06
Identities = 29/81 (35%), Positives = 35/81 (43%), Gaps = 1/81 (1%)

Query: 327 SGGGHGGGNGGGNGGGHGGGNGGGNGGGNGGGNGGGNGGGNGGGNGGGNGGGHGGGSGGH 386
SGG G N G + G G GG + G GGG+G G G GG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG-GGS 60

Query: 387 GNGNGGHGNGNGNGNGGGGGG 407
G+GNGG +G G+G GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGNL 81



Score = 41.2 bits (96), Expect = 8e-06
Identities = 31/82 (37%), Positives = 37/82 (45%), Gaps = 3/82 (3%)

Query: 276 GGNGGGNGVGNGGGHGGGHGGGGGGGGHGGGNGGGHGGGNGGGNGGGSGGGSGGGHGGGN 335
GG+G G+ G G +GG G G GG + G GGGSG G H GG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGI---HWGGG 59

Query: 336 GGGNGGGHGGGNGGGNGGGNGG 357
G GG G +GGG+G G
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNL 81



Score = 40.9 bits (95), Expect = 1e-05
Identities = 37/102 (36%), Positives = 43/102 (42%), Gaps = 2/102 (1%)

Query: 164 GTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGG 223
G G G + G S G GG +G G GG + G G S GG SG G GG SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 224 GTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGHG 265
G GG + GG SG G G ++ G T G G
Sbjct: 63 GNGGGNGNSGGGSGTG--GNLSAVAAPVAFGFPALSTPGAGG 102



Score = 40.5 bits (94), Expect = 2e-05
Identities = 31/79 (39%), Positives = 34/79 (43%), Gaps = 1/79 (1%)

Query: 209 GTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTS-GGHGGH 267
G G G + G S G GG +G G GG + G G S GG SG G GG GH
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 268 GGGGHGDGGGNGGGNGVGN 286
G GG G G G G
Sbjct: 63 GNGGGNGNSGGGSGTGGNL 81



Score = 40.1 bits (93), Expect = 2e-05
Identities = 35/81 (43%), Positives = 45/81 (55%), Gaps = 1/81 (1%)

Query: 317 GGNGGGSGGGSGGGHGGGNGGGNGGGHGGGNGGGNGGGNGGGNGGGNGGGNGGGNGGGNG 376
GG+G G G+ G NGG G G GGG G+G + N G G G+G GGG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN-NPWGGGSGSGIHWGGGSG 61

Query: 377 GGHGGGSGGHGNGNGGHGNGN 397
G+GGG+G G G+G GN +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLS 82



Score = 39.7 bits (92), Expect = 2e-05
Identities = 28/89 (31%), Positives = 36/89 (40%)

Query: 440 GGGGGGSGGSGGGGGSGGSGGSGGAGGSGGQGNGGGSTGGHGIGGHGNGGGNGNGNGNGG 499
GG G G +GG G G GG +G G + + G G+G G G G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 500 AGSGGANGVGNGSGGGGGGGSGGSAGGHG 528
GG G GSG GG + + G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 39.3 bits (91), Expect = 3e-05
Identities = 38/112 (33%), Positives = 45/112 (40%), Gaps = 2/112 (1%)

Query: 125 GSGGTASGNGGKGDGSSGGGTSGNGASGGGTSGSGTSGGGTSGGGTSGGGTSGGGTSGGG 184
G G + G+ GG +G G GG + GSG S GG SG G GG SG G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 185 TSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTS 236
GG + GG SG G G ++ G T G G S G S
Sbjct: 64 NGGGNGNSGGGSGTG--GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 39.3 bits (91), Expect = 3e-05
Identities = 38/114 (33%), Positives = 45/114 (39%), Gaps = 2/114 (1%)

Query: 118 SGGMNGTGSGGTASGNGGKGDGSSGGGTSGNGASGGGTSGSGTSGGGTSGGGTSGGGTSG 177
SGG + G S +G G +G G G + G G S GG SG G GG SG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 178 GGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTS 231
G GG + GG SG G G ++ G T G G S G S
Sbjct: 62 HGNGGGNGNSGGGSGTG--GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 39.3 bits (91), Expect = 3e-05
Identities = 30/89 (33%), Positives = 39/89 (43%)

Query: 438 GGGGGGGGSGGSGGGGGSGGSGGSGGAGGSGGQGNGGGSTGGHGIGGHGNGGGNGNGNGN 497
GG G G +G G G G GG G+G S GG G+G G G+G+
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 498 GGAGSGGANGVGNGSGGGGGGGSGGSAGG 526
G G G +G G+G+GG + A G
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91



Score = 38.9 bits (90), Expect = 4e-05
Identities = 39/81 (48%), Positives = 47/81 (58%), Gaps = 4/81 (4%)

Query: 313 GGNGGGNGGGSGGGSGGGHGGGNGGGNGGGHGGGNGGG---NGGGNGGGNGGGNGGGNGG 369
GG+G G+ G+ SG +GG G G GGG G+G N G G G+G GGG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 370 GNGGGNGGGHGGGSGGHGNGN 390
GNGGGNG GGGSG GN +
Sbjct: 63 GNGGGNGNS-GGGSGTGGNLS 82



Score = 38.9 bits (90), Expect = 4e-05
Identities = 34/80 (42%), Positives = 39/80 (48%), Gaps = 4/80 (5%)

Query: 224 GTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGH---GGHGGGGHGDGGGNGG 280
G G G + G S G GG +G G GG + G G S + GG G G GGG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 281 GNGVGNGGGHGGGHGGGGGG 300
GNG G G GGG G GG
Sbjct: 63 GNG-GGNGNSGGGSGTGGNL 81



Score = 35.1 bits (80), Expect = 7e-04
Identities = 32/112 (28%), Positives = 43/112 (38%)

Query: 80 AAGSGAGNTTGGGSPGNGGSGGSGAVGPAGVSGSTGSTSGGMNGTGSGGTASGNGGKGDG 139
+ G G G+ TG S +GG +G G + S N G G + + G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 140 SSGGGTSGNGASGGGTSGSGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTS 191
GG +GN G GT G+ ++ G T G G S G S
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 34.7 bits (79), Expect = 0.001
Identities = 37/114 (32%), Positives = 42/114 (36%), Gaps = 2/114 (1%)

Query: 113 STGSTSGGMNGTGSGGTASGNGGKGDGSSGGGTSGNGASGGGTSGSGTSGGGTSGGGTSG 172
S G G G S G G G GG + G+G S G SG G GG SG
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 173 GGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTSGGGTS 226
G GG + GG SG G G ++ G T G G S G S
Sbjct: 62 HGNGGGNGNSGGGSGTG--GNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 33.5 bits (76), Expect = 0.002
Identities = 26/71 (36%), Positives = 32/71 (45%)

Query: 458 SGGSGGAGGSGGQGNGGGSTGGHGIGGHGNGGGNGNGNGNGGAGSGGANGVGNGSGGGGG 517
SGG G +G G GG G G G +G+G + GG +G G GGG G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 518 GGSGGSAGGHG 528
G+GG G G
Sbjct: 62 HGNGGGNGNSG 72



Score = 32.8 bits (74), Expect = 0.004
Identities = 30/103 (29%), Positives = 37/103 (35%)

Query: 79 GAAGSGAGNTTGGGSPGNGGSGGSGAVGPAGVSGSTGSTSGGMNGTGSGGTASGNGGKGD 138
A S +GN GG + G G S G + + G SG G G GG G+
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 139 GSSGGGTSGNGASGGGTSGSGTSGGGTSGGGTSGGGTSGGGTS 181
G GT GN ++ G T G G S G S
Sbjct: 71 SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALS 113



Score = 32.4 bits (73), Expect = 0.004
Identities = 31/84 (36%), Positives = 37/84 (44%), Gaps = 5/84 (5%)

Query: 446 SGGSGGGGGSGGSGGSGGA-GGSGGQGNGGGSTGGHGIGGHGNGGGNGNGNGNGGAGSGG 504
SGG G G +G SG GG G G GGG++ G G N G G+G+G G G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 505 ANGVGNGSGGGGGGGSGGSAGGHG 528
GG G G G GG+
Sbjct: 62 ----HGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2581PF06872320.007 EspG protein
		>PF06872#EspG protein

Length = 398

Score = 31.6 bits (71), Expect = 0.007
Identities = 32/121 (26%), Positives = 53/121 (43%), Gaps = 22/121 (18%)

Query: 145 DGAALDTALAEAGLCAALIRTPDE------WAAHDQARALASLPLFEIERIGDAPVEAIG 198
DG++L ++ + L A I TP+ A++Q R L SLP I R P +
Sbjct: 101 DGSSLRISVTNSELIEAEIHTPNNEKFLVLLEANEQNRLLQSLP---INR--HMPYIQVH 155

Query: 199 HGEPDQPLAGV----RVLDLTRIIAGPVAGRTLASHGAQTLLVNGPHLPNIASLVIDNGR 254
H P + L + ++L T ++ TL H QT ++G +++ +D R
Sbjct: 156 HTLPQEELTDLLSMHKLLSFTSKLSA-----TLIPHNNQTDPLSGL--TPFSTVFMDTSR 208

Query: 255 G 255
G
Sbjct: 209 G 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2584TCRTETB1407e-39 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 140 bits (355), Expect = 7e-39
Identities = 92/408 (22%), Positives = 175/408 (42%), Gaps = 15/408 (3%)

Query: 28 VMLWLVATGFFMQTLDATIVNTALPSMAVSLGESPLRMQSVVIAYSLTMAVMIPVSGWLA 87
+++WL FF L+ ++N +LP +A + P V A+ LT ++ V G L+
Sbjct: 15 ILIWLCILSFF-SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 88 DTFGTRRVFFSAILVFSLGSLLCANAHTLQQLVVF-RVVQGVGGAMLLPVGRLAVLRTFP 146
D G +R+ I++ GS++ H+ L++ R +QG G A + + V R P
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133

Query: 147 AERYLSALSFVAIPGLIGPLIGPTLGGWLVKIASWHWIFLINVPVGVAGCIATFYSMPDS 206
E A + +G +GP +GG + HW +L+ +P+ + +
Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAH--YIHWSYLLLIPMITIITVPFLMKLLKK 191

Query: 207 RNPAVGRFDLKGYLLLTIGMVAISLSLDGLADLGMQHAAVLVLLILSLACFVAYGLYAVR 266
G FD+KG +L+++G+V L + L++ +LS FV + +
Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTT------SYSISFLIVSVLSFLIFVKHIR---K 242

Query: 267 APLPIFSLELFKIHTFSVGLLGNLFARIGSGAMPYLIPLLLQVSLGYSAFEAG-LMMLPV 325
P L K F +G+L ++P +++ S E G +++ P
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 326 AAAGMFSKRIITQLITRHGYRKVLLVNTIMVGVMMASFALMRDTVPVWVKVVHLALFGGF 385
+ + I L+ R G VL + + V + + + +T ++ ++ + + GG
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL 362

Query: 386 NSMQFTAMNTLTLKDLGTGGASSGNSLFSLVQMLSMSLGVTVAGALLA 433
+ + T ++T+ L A +G SL + LS G+ + G LL+
Sbjct: 363 SFTK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2589LCRVANTIGEN280.020 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 28.5 bits (63), Expect = 0.020
Identities = 12/46 (26%), Positives = 22/46 (47%)

Query: 175 RTHIRLSQEKLAAMLSLTRQTTNQLLKALQADGVVRLHVGEIELVD 220
R+ +R +L A L + ++ K L + G + +H I L+D
Sbjct: 150 RSKLREELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMD 195


36Bamb_2622Bamb_2647Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_26223151.469828hypothetical protein
Bamb_26233131.726415flavin reductase domain-containing protein
Bamb_26244112.325792AsnC family transcriptional regulator
Bamb_26254113.296324cyclase family protein
Bamb_26263113.658645kynureninase
Bamb_26273113.821206tryptophan 2,3-dioxygenase
Bamb_2628394.470440major facilitator superfamily transporter
Bamb_2629294.6957622-dehydropantoate 2-reductase
Bamb_26301104.501898aldehyde dehydrogenase
Bamb_26311114.774955benzoylformate decarboxylase
Bamb_26322114.828359LysR family transcriptional regulator
Bamb_26332114.578981mannitol dehydrogenase domain-containing
Bamb_26341103.933395xylulokinase
Bamb_26352124.035253DeoR family transcriptional regulator
Bamb_26361133.471343major facilitator superfamily transporter
Bamb_26370132.102385beta-lactamase
Bamb_2638-2150.770246LysR family transcriptional regulator
Bamb_2639-1151.635799ABC transporter-like protein
Bamb_26400132.425499HAD family hydrolase
Bamb_26410131.846500binding-protein-dependent transport system inner
Bamb_26421132.399992binding-protein-dependent transport system inner
Bamb_2643-1142.676807extracellular solute-binding protein
Bamb_26442143.423965tagatose-6-phosphate kinase
Bamb_26450132.761235ribokinase-like domain-containing protein
Bamb_2646-1112.054395sorbitol dehydrogenase
Bamb_2647-293.570669ferric uptake regulator family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2628TCRTETA432e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.9 bits (101), Expect = 2e-06
Identities = 48/210 (22%), Positives = 78/210 (37%), Gaps = 8/210 (3%)

Query: 31 VLLAALAIVLDGFDGQLIGFAIPVLIREWGITRGA---FAPAVAAGLIGMGIGSACAGIV 87
+++ + LD LI +P L+R+ + + +A + + G +
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 88 ADRFGRRQAVIGSVFLFGVATCAIGFAPDVATIALLRFCAGLGIGGALPTATTMTAEYTP 147
+DRFGRR ++ S+ V + AP + + + R AG+ G A A+ T
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITD 125

Query: 148 ARRRTMMVTATIVCVPLGGMLAGLFAHEVLPRYGWRGLFFAGGALPL---VLGFVLMRAL 204
R C GM+AG ++ + FFA AL + G L+
Sbjct: 126 GDERARHFGFMSACFGF-GMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 205 PESPRYLARRPARWPELGALLARMQRPVAP 234
+ R RR A P AR VA
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAA 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2636TCRTETB387e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 37.6 bits (87), Expect = 7e-05
Identities = 36/155 (23%), Positives = 65/155 (41%), Gaps = 5/155 (3%)

Query: 31 LLALATAGFITILTEALPAGLLPLMSVDLRVTEALIGQLVTVYALGSIVAAIPLVAATRA 90
L+ L F ++L E + LP ++ D A + T + L + +
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 91 MRRRRLLLAALAGFVVSNALTAAS-PYYALTLAARFVAGMSAGLLWALLAGYASRMVDAS 149
+ +RLLL + + + +++L + ARF+ G A AL+ +R +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 150 LRGRAIAVAMLGAPVAMSIGI-PA-GTALGAMFGW 182
RG+A ++G+ VAM G+ PA G + W
Sbjct: 136 NRGKAF--GLIGSIVAMGEGVGPAIGGMIAHYIHW 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2637BLACTAMASEA290.022 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.4 bits (66), Expect = 0.022
Identities = 15/67 (22%), Positives = 27/67 (40%), Gaps = 5/67 (7%)

Query: 65 REDTLFRLASVSKPIVTAAAMRLVAAGRIELDEPVAHW----LPAFRPTLRDGTPTDITL 120
R D F + S K ++ A + V AG +L+ + H+ L + P +T+
Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKI-HYRQQDLVDYSPVSEKHLADGMTV 115

Query: 121 RHLLSHT 127
L +
Sbjct: 116 GELCAAA 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2639PF05272300.015 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.015
Identities = 14/35 (40%), Positives = 17/35 (48%)

Query: 32 VVFVGPSGCGKSTLMRMIAGLEDISSGELLIDGAK 66
VV G G GKSTL+ + GL+ S I K
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2641RTXTOXIND280.044 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 28.3 bits (63), Expect = 0.044
Identities = 11/52 (21%), Positives = 18/52 (34%), Gaps = 5/52 (9%)

Query: 7 EGRRAPAAFDLVRR---ALPGALAWLIALLLFFPIFWMAITAFKTEQQAYAS 55
E PA +L+ P +A+ I L + + E A A+
Sbjct: 38 ENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLG--QVEIVATAN 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2643MALTOSEBP340.001 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 34.3 bits (78), Expect = 0.001
Identities = 98/445 (22%), Positives = 165/445 (37%), Gaps = 77/445 (17%)

Query: 6 LDAAARCFAGAALATAACAASA------GTLTIATLNNPDMIELKKLSPAFEKANPDIKL 59
+ AR A +AL T +ASA G L I + L ++ FEK D +
Sbjct: 3 IKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEK---DTGI 59

Query: 60 NWVILEENVLRQRATTDITTGSGQFDVMAIGTYETPQWGKRGWLAPMTGLPADYDLNDIV 119
+ + L ++ TG G D++ + + G LA +T P + +
Sbjct: 60 KVTVEHPDKLEEKFPQVAATGDGP-DIIFWAHDRFGGYAQSGLLAEIT--PDKAFQDKLY 116

Query: 120 KTARDSLSYNGQLYALPFYVESSMTFYRKDLFAAKGLKMPDQP-TYDQIAEFADKLTDKA 178
D++ YNG+L A P VE+ Y KDL +P+ P T+++I +L KA
Sbjct: 117 PFTWDAVRYNGKLIAYPIAVEALSLIYNKDL-------LPNPPKTWEEIPALDKEL--KA 167

Query: 179 KGTYGICLRGKAGWGENMAYVSTVVNTFGGRWFD-ENW-----NAQLTSPEWKKAIGFYV 232
KG + + + + ++ GG F EN + + + K + F V
Sbjct: 168 KGKSALMFNLQEPY-----FTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLV 222

Query: 233 NLLKK-----DGPPGASSNGFNENLTLTASGKCAMWIDATVAAGMLYNKQQSQVADKIGF 287
+L+K D + FN+ G+ AM I+ A N S+V G
Sbjct: 223 DLIKNKHMNADTDYSIAEAAFNK-------GETAMTINGPWAWS---NIDTSKV--NYGV 270

Query: 288 AAAPVAATPKGSHWLWAWALAVPKTSKQQDAARKFIA-WATSKQYIEMAGKDEGWASVPP 346
P ++ + + S ++ A++F+ + + + +E KD+
Sbjct: 271 TVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDK------- 323

Query: 347 GTRTSTYQRPEYKAAAPFSDFVLKAIETADPNDPSLKKV---PYTGVQYVGIPEFQSFGT 403
P LK+ E DP + G IP+ +F
Sbjct: 324 ----------------PLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWY 367

Query: 404 VVGQAIAGAVAGQTTVDQALAAGQA 428
V A+ A +G+ TVD+AL Q
Sbjct: 368 AVRTAVINAASGRQTVDEALKDAQT 392


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2646DHBDHDRGNASE1308e-39 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 130 bits (327), Expect = 8e-39
Identities = 84/255 (32%), Positives = 124/255 (48%), Gaps = 7/255 (2%)

Query: 3 LEQKVAILTGAASGIGEAVAQRYLDEGARCVLVDVKPASGSLARLIEASPGR-AVAVTAD 61
+E K+A +TGAA GIGEAVA+ +GA VD P + R A A AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 62 VTRRDDIERIVATAVERFGGVDILFNNAALFDMRPLLDESWDVFDRLFSVNVKGLFFLMQ 121
V I+ I A G +DIL N A + + S + ++ FSVN G+F +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 122 AVAQRMVEQGRGGKIVNMSSQAGRRGEALVSHYCATKAAVISYTQSAALALAPHRINVNG 181
+V++ M+++ R G IV + S ++ Y ++KAA + +T+ L LA + I N
Sbjct: 126 SVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 182 IAPGVVDTPMWEQVDALFARYEQRPPG--EKKRLVGEAVPLGRMGAPGDLTGAALFLASA 239
++PG +T M + A EQ G E + +PL ++ P D+ A LFL S
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKT---GIPLKKLAKPSDIADAVLFLVSG 241

Query: 240 DADYITAQTLNVDGG 254
A +IT L VDGG
Sbjct: 242 QAGHITMHNLCVDGG 256


37Bamb_2730Bamb_2735Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_27302131.834368glycerol-3-phosphate dehydrogenase
Bamb_27313122.298054glycerol kinase
Bamb_27324112.988101MIP family channel protein
Bamb_27333102.964527FAD-dependent pyridine nucleotide-disulfide
Bamb_27343111.743482Rieske (2Fe-2S) domain-containing protein
Bamb_27352112.034221fatty acid desaturase
38Bamb_2807Bamb_2829Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_28073101.849858peptidase M48, Ste24p
Bamb_2808483.639596glycosyl transferase family protein
Bamb_2809493.597115endoribonuclease L-PSP
Bamb_28102103.271904short chain dehydrogenase
Bamb_28112104.414015acyl carrier protein
Bamb_28122105.026977hypothetical protein
Bamb_28131114.832434acyl-coenzyme A synthetase/AMP-(fatty) acid
Bamb_28141114.943628acyltransferase-like protein
Bamb_28152125.069002hypothetical protein
Bamb_28161114.872880exporter-like protein
Bamb_28172134.110523polysaccharide deacetylase
Bamb_28181131.9968363-oxoacyl-ACP synthase
Bamb_28191131.569900hypothetical protein
Bamb_28201130.6142003-hydroxylacyl-(acyl carrier protein)
Bamb_2821-112-0.535243hypothetical protein
Bamb_2822-114-0.323831hypothetical protein
Bamb_2823-114-1.082877acetoacetyl-CoA reductase
Bamb_2824014-0.384040acetyl-CoA acetyltransferase
Bamb_2825215-1.004054phasin family protein
Bamb_2826311-0.582071cobyrinic acid a,c-diamide synthase
Bamb_2827414-0.677612hypothetical protein
Bamb_2828415-0.879480hypothetical protein
Bamb_2829215-0.857266pili assembly chaperone
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2810DHBDHDRGNASE1036e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 103 bits (258), Expect = 6e-29
Identities = 70/247 (28%), Positives = 108/247 (43%), Gaps = 14/247 (5%)

Query: 3 ALVTGGSGALGQAICTALAQAGHEVWVHANRNLAQAEAVAQQIVAAGGTAHAIAFDVTDG 62
A +TG + +G+A+ LA G + + N + E V + A A A DV D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 63 DATLAALAPFVDD-APVQILVNNAGIHDDAPMAGMSRRQWHSVIDVTLNGFFNVTQPLLL 121
A A + P+ ILVN AG+ + +S +W + V G FN ++ +
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 122 PMIRTRRGRIINIASVAGVTGNRGQVNYAAAKAGLIGATKSLSLELASRGITVNAVAPGI 181
M+ R G I+ + S YA++KA + TK L LELA I N V+PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 182 IESPMAD------------LAFPAERIKQLVPAQRAGRPDEVAAMVAYLVSDAAAYVTGQ 229
E+ M + E K +P ++ +P ++A V +LVS A ++T
Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMH 249

Query: 230 VLSVNGG 236
L V+GG
Sbjct: 250 NLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2823DHBDHDRGNASE1261e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 126 bits (318), Expect = 1e-37
Identities = 79/249 (31%), Positives = 122/249 (48%), Gaps = 10/249 (4%)

Query: 4 RIAVVTGGMGGLGEAISIRLNDAGYQVVVTYSPNNTGADRWLTEMHSAGREFHAYPVDVA 63
+IA +TG G+GEA++ L G + N ++ ++ + + R A+P DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 64 DYDSCQQCIEKIVREVGPVDILVNNAGITRDMTLRKLDKVNWDAVIRTNLDSVFNMTKPV 123
D + + +I RE+GP+DILVN AG+ R + L W+A N VFN ++ V
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 124 CDGMVERGWGRIVNIASVNGSKGSIGQTNYAAAKAGMHGFTKSLALETARKGVTVNTVSP 183
M++R G IV + S YA++KA FTK L LE A + N VSP
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 184 GYLATKMVTAI--PQDILDTKI---LPQ----IPAGRLGKPEEVAGLVAYLCSEEAGFVT 234
G T M ++ ++ + I L IP +L KP ++A V +L S +AG +T
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 235 GSNIAINGG 243
N+ ++GG
Sbjct: 248 MHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2824ACRIFLAVINRP300.018 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.2 bits (68), Expect = 0.018
Identities = 22/106 (20%), Positives = 40/106 (37%), Gaps = 6/106 (5%)

Query: 171 TRDEQDAFAALSQNKAEAAQKAGRFNDEIVPVSIPQGKGEPLQFATDEFVRHGVTAESLA 230
DE A A + + K E + F +I G F + + G+ ++L
Sbjct: 637 NGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAI-VELGTATGFDFELIDQAGLGHDALT 695

Query: 231 GLKPAFAKE-----GTVTAANASGLNDGAAAVLVMSAQKAAALGLT 271
+ ++ + +GL D A L + +KA ALG++
Sbjct: 696 QARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVS 741


39Bamb_2840Bamb_2862Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_28405130.223812transporter
Bamb_28413121.152603formyltetrahydrofolate deformylase
Bamb_28423111.901757NUDIX hydrolase
Bamb_28432111.236578lysine exporter protein LysE/YggA
Bamb_28441101.161639adenine phosphoribosyltransferase
Bamb_28451100.784368sodium/hydrogen exchanger
Bamb_2846-212-0.395995KpsF/GutQ family protein
Bamb_2847-114-1.9549663-deoxy-D-manno-octulosonate 8-phosphate
Bamb_2848015-2.939846hypothetical protein
Bamb_2849114-2.678059OstA family protein
Bamb_2850113-3.128145ABC transporter-like protein
Bamb_2851114-3.113950RNA polymerase factor sigma-54
Bamb_2852-211-1.470927sigma 54 modulation protein/ribosomal protein
Bamb_2853-210-0.095277PTS IIA-like nitrogen-regulatory protein PtsN
Bamb_2854-180.759609HPr kinase/phosphorylase
Bamb_2855081.234441hypothetical protein
Bamb_28563100.560903peptidase S16, lon domain-containing protein
Bamb_2857110-0.053544A/G-specific adenine glycosylase
Bamb_2858212-1.397581formamidopyrimidine-DNA glycosylase
Bamb_2859213-1.753120hypothetical protein
Bamb_2860114-2.707925outer membrane lipoprotein LolB
Bamb_2861017-3.7740434-diphosphocytidyl-2-C-methyl-D-erythritol
Bamb_2862-114-3.411031*ribose-phosphate pyrophosphokinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2843BCTERIALGSPF290.015 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 28.6 bits (64), Expect = 0.015
Identities = 19/94 (20%), Positives = 37/94 (39%), Gaps = 9/94 (9%)

Query: 103 QPLRAIFRQSVIGNLMNPKVTLFFVVFL-----PQFVDPHGAQGVTLQMFE---LGALFM 154
Q +R+ +Q++I + V + V L P+ V+ L + +G
Sbjct: 163 QQMRSRIQQAMIYPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDA 222

Query: 155 LQTAAIFSLFGVGAGAIG-TWLKRRPKAGVWLDR 187
++T + L + AG + + R+ K V R
Sbjct: 223 VRTFGPWMLLALLAGFMAFRVMLRQEKRRVSFHR 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2859SYCDCHAPRONE320.004 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 31.8 bits (72), Expect = 0.004
Identities = 17/102 (16%), Positives = 28/102 (27%), Gaps = 1/102 (0%)

Query: 454 PDDPDLRYDYAMAAEKTGHYATMEKQLRELIRTQPDNPQAYNALGYSLADRNQRLPEASK 513
D + Y A ++G Y K + L + + + LG Q A
Sbjct: 33 SDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQ-YDLAIH 91

Query: 514 LIDKALSLAPNDAYIMDSLGWVKYRMGDTTGAAKVLQRAFEL 555
+ + + G+ A L A EL
Sbjct: 92 SYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 31.1 bits (70), Expect = 0.007
Identities = 21/112 (18%), Positives = 37/112 (33%), Gaps = 5/112 (4%)

Query: 478 KQLRELIRTQPDNPQAYNALGYSLADRNQRLPEASKLIDKALSLAPNDAYIMDSLGWVKY 537
L E+ D + +L ++ ++ + +A K+ L D+ LG +
Sbjct: 26 AMLNEIS---SDTLEQLYSLAFN-QYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQ 81

Query: 538 RMGDTTGAAKVLQRAFELQPNAEIGA-HLGEVLWKSGAQDDARIAWRAAQKL 588
MG A + H E L + G +A AQ+L
Sbjct: 82 AMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


40Bamb_2913Bamb_2931Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2913213-0.131300carboxyl-terminal protease
Bamb_2914211-0.119383UBA/THIF-type NAD/FAD binding protein
Bamb_2915010-0.070918phosphoenolpyruvate-protein phosphotransferase
Bamb_2916-19-0.250001HPr family phosphocarrier protein
Bamb_2917-210-0.018956PTS system fructose subfamily transporter
Bamb_2918-1101.768322glutathione synthetase
Bamb_2919-2102.214829glutamate--cysteine ligase
Bamb_2920-292.706322ammonium transporter
Bamb_29211123.238112nitrogen regulatory protein P-II
Bamb_29220123.511259hypothetical protein
Bamb_2923-1112.703787Mg chelatase subunit ChlI
Bamb_2924-1131.194340hypothetical protein
Bamb_2925-1130.989952redoxin domain-containing protein
Bamb_29260141.805121two component sigma54 specific Fis family
Bamb_29270131.902447sensor signal transduction histidine kinase
Bamb_2928-1141.186976C4-dicarboxylate transporter DctA
Bamb_29290152.753742acetate permease
Bamb_29300143.007068hypothetical protein
Bamb_29310133.173642L-carnitine dehydratase/bile acid-inducible
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2915PHPHTRNFRASE5950.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 595 bits (1537), Expect = 0.0
Identities = 218/578 (37%), Positives = 329/578 (56%), Gaps = 10/578 (1%)

Query: 4 SFTLHGIPVSRGIAIGRAYLIAPAALDVAHYLIEANQIDAEVERFRTAREVVHRELNALR 63
+ GI S G+AI +A++ +D+ I + E+E+ A E EL A++
Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNVDIEKTSIT--DVSTEIEKLTAALEKSKEELRAIK 59

Query: 64 ADLTDDTPSEVGAFIDVHTMILSDAMLVQETIDLVRTRRYNVEWALTEQLELLTRHFDDI 123
++ H ++L D LV + + N E+AL E ++ F+ +
Sbjct: 60 DQTEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESM 119

Query: 124 EDEYLRERKADIEQVVERVLKALAGAPSASQALDRAAARGQNEMIVVAHDIAPADMMQFK 183
++EY++ER ADI V +RVL L G + S A E +++A D+ P+D Q
Sbjct: 120 DNEYMKERAADIRDVSKRVLGHLIGVETGSLATI------AEETVIIAEDLTPSDTAQLN 173

Query: 184 SQSFQAFVTDLGGRTSHTAIVARSLGIPAAVGVQHASALIRQDDLIIVDGDQGIVIVDPA 243
Q + F TD+GGRTSH+AI++RSL IPA VG + + I+ D++IVDG +GIVIV+P
Sbjct: 174 KQFVKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPT 233

Query: 244 PIVLEEYSYRQSEKLLEQRKLQRLKFSPTQTLCGTKIELYANIELPDDAKAAVEAGAVGV 303
++ Y +++ ++++ +L P+ T G +EL ANI P D + G G+
Sbjct: 234 EEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGI 293

Query: 304 GLFRSEFLFMHQKEMPEEEEQFAAYKRAVEWMKGMPVTIRTIDVGADKPLEALDEGYETA 363
GL+R+EFL+M + ++P EEEQF AYK V+ M G PV IRT+D+G DK L L E
Sbjct: 294 GLYRTEFLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKEL- 352

Query: 364 PNPALGLRAIRWSLSEPQMFLTQLRAILRASAFGQVKILIPMLAHAQEIDQTLDLIREAK 423
NP LG RAIR L + +F TQLRA+LRAS +G +K++ PM+A +E+ Q +++E K
Sbjct: 353 -NPFLGFRAIRLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEK 411

Query: 424 RQLDDAGLAYDPNVRIGAMIEIPAAAIALPLFLKRFDFLSIGTNDLIQYTLAIDRADNAV 483
+L G+ ++ +G M+EIP+ A+A LF K DF SIGTNDLIQYT+A DR + V
Sbjct: 412 DKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERV 471

Query: 484 AHLYDPLHPAVLHLISYTLREAKRAGVSVSVCGEMAGDPTLTRLLLGMGLTEFSMHPSQL 543
++LY P HPA+L L+ ++ A G V +CGEMAGD LLLG+GL EFSM + +
Sbjct: 472 SYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSI 531

Query: 544 LVVKQEILRAHLKALEKPTADVLAAFEPEEVQAALKRL 581
L + ++L+ + L+ L EEV+ +K+
Sbjct: 532 LPARSQLLKLSKEELKPFAQKALMLDTAEEVEQLVKKT 569


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2926HTHFIS453e-159 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 453 bits (1168), Expect = e-159
Identities = 160/483 (33%), Positives = 240/483 (49%), Gaps = 47/483 (9%)

Query: 4 RLQVIYIEDDALVRRASVQSLQLAGFDVVGFESAEAADKAIVAETAGVIVSDIRLPGASG 63
++ +DDA +R Q+L AG+DV +A + I A ++V+D+ +P +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 LDLLAQCRERVPDVPVILVTGHGDISMAVQAMRDGAYDFIEKPFAAERLIETVRRALERR 123
DLL + ++ PD+PV++++ A++A GAYD++ KPF LI + RAL
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 124 ELVLENHALRRELAGQNIVAPRIIGRSPAIEQVRKLIANVAPTDASVLINGDTGAGKELI 183
+ +L + ++GRS A++++ +++A + TD +++I G++G GKEL+
Sbjct: 123 K------RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELV 176

Query: 184 ARSLHELSPRRDKPFIAVNCGALPEPMFESEMFGYEPGAFTGAAKRRVGKLEYASGGTLF 243
AR+LH+ RR+ PF+A+N A+P + ESE+FG+E GAFTGA R G+ E A GGTLF
Sbjct: 177 ARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLF 236

Query: 244 LDEIESMPLALQVKLLRVLQDGVLERLGSNQPIRVNCRVVAAAKGDMSELVAAGTFRRDL 303
LDEI MP+ Q +LLRVLQ G +G PIR + R+VAA D+ + + G FR DL
Sbjct: 237 LDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDL 296

Query: 304 LYRLNVVTIALPPLAERREDIVPLFEHFMLDAAVRYGRPAPVLTDRQRASLMQRDWPGNV 363
YRLNVV + LPPL +R EDI L HF + A + G + WPGNV
Sbjct: 297 YYRLNVVPLRLPPLRDRAEDIPDLVRHF-VQQAEKEGLDVKRFDQEALELMKAHPWPGNV 355

Query: 364 RELRNAADRFVL----------------------------------------GVADMPQD 383
REL N R +M Q
Sbjct: 356 RELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQY 415

Query: 384 AGASGDDAESDQTLKERIEQFERAVIAQALNQTGGAVAATADRLHVGKATLYEKMKRYGL 443
+ GD + + E +I AL T G AD L + + TL +K++ G+
Sbjct: 416 FASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475

Query: 444 SAK 446
S
Sbjct: 476 SVY 478


41Bamb_2949Bamb_2957Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2949-2103.126083ABC transporter-like protein
Bamb_2950-2113.429468hypothetical protein
Bamb_2951-393.5593295'-nucleotidase domain-containing protein
Bamb_2952-183.640768Sel1 domain-containing protein
Bamb_2953092.863057biotin--protein ligase
Bamb_29542102.365699pantothenate kinase
Bamb_29552111.703327hypothetical protein
Bamb_2956191.360605bifunctional heptose 7-phosphate kinase/heptose
Bamb_29572101.731851hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2954PF033091682e-53 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 168 bits (426), Expect = 2e-53
Identities = 58/278 (20%), Positives = 95/278 (34%), Gaps = 40/278 (14%)

Query: 6 LLIDAGNSRIKWALADA---RRTLVDTGAFGHTRDGGADPDWSALPRPHGAWISNVAGAD 62
L ID N+ L +V + AD I + G D
Sbjct: 3 LAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADE--------LALTIDGLIGDD 54

Query: 63 ---------------VAARLDALLDACWPGLPRTTIRSQPAQCGVTNGYTTPDQLGSDRW 107
V + +L+ WP +P I + G+ P ++G+DR
Sbjct: 55 AERLTGASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEPG-VRTGIPLLVDNPKEVGADRI 113

Query: 108 AGLIGARAAFPGEHLLIATFGTATTLEALRADGRFTGGLIAPGWALMMRALGTHTAQLPT 167
+ A + +++ FG++ ++ + A G F GG IAPG + A +A L
Sbjct: 114 VNCLAAYHKYGTAAIVV-DFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRR 172

Query: 168 LTTDIASGLLAGAQAEPFQVDTPRSLSAGCLYAQAGLIE----RAWRDLADAWQAPVRLV 223
+ ++ +T + AG ++ AGL++ R D+ A V +V
Sbjct: 173 VELTRPRSVIGK--------NTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVV 224

Query: 224 LAGGAADDVARALTLPHTRHDALILSGLALIAAEGAAQ 261
G A V L L L GL L+ A
Sbjct: 225 ATGHTAPLVLPDLRTVEHYDRHLTLDGLRLVFERNRAN 262


42Bamb_2990Bamb_3021Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2990-2103.128217glutathione S-transferase domain-containing
Bamb_2991-3112.774888K+ channel inward rectifier domain-containing
Bamb_2992-2123.433856sulfate ABC transporter substrate-binding
Bamb_2993-1124.424844dihydrodipicolinate synthase
Bamb_29941134.359254LysR family transcriptional regulator
Bamb_29951124.254233major facilitator superfamily transporter
Bamb_29961123.832432TonB-dependent siderophore receptor
Bamb_29975114.889196glutathione S-transferase domain-containing
Bamb_2998383.820184LysR family transcriptional regulator
Bamb_2999383.573395aldehyde oxidase and xanthine dehydrogenase
Bamb_3000192.922900FAD-binding molybdopterin dehydrogenase
Bamb_30011102.753084ferredoxin
Bamb_30021122.354993hypothetical protein
Bamb_30031132.139712phospholipase C
Bamb_30040143.045258hypothetical protein
Bamb_30052152.594916glyoxalase/bleomycin resistance
Bamb_30062153.000852D-isomer specific 2-hydroxyacid dehydrogenase
Bamb_30074152.155602hydroxymethylglutaryl-CoA lyase
Bamb_30084131.606052YbaK/prolyl-tRNA synthetase associated
Bamb_30094131.291198hypothetical protein
Bamb_30102110.457857hypothetical protein
Bamb_3011090.645706AsnC family transcriptional regulator
Bamb_3012-1100.794485alpha/beta fold family hydrolase
Bamb_3013-2121.7687582-nitropropane dioxygenase
Bamb_3014-2122.708826LysR family transcriptional regulator
Bamb_3015-1152.647729porin
Bamb_3016-2163.981278hypothetical protein
Bamb_3017-2183.517897bile acid:sodium symporter
Bamb_3018-1193.550590diguanylate cyclase/phosphodiesterase
Bamb_30191183.201593LacI family transcriptional regulator
Bamb_30202163.859814N-acylglucosamine 2-epimerase
Bamb_30210154.232407ribokinase-like domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2995TCRTETA320.003 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.5 bits (74), Expect = 0.003
Identities = 79/356 (22%), Positives = 118/356 (33%), Gaps = 42/356 (11%)

Query: 80 GIAADRFGDRRVLLTGLVATAAMLALMVITIVPSAHAVPPLM--RVVAAMC-CVGLLGGS 136
G +DRFG R VLL L A A+M A + L R+VA + G + G+
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMAT-----APFLWVLYIGRIVAGITGATGAVAGA 118

Query: 137 V--NGSSGRAVMRWFGERERGLAMSIRQTAVPLGGGVGAALLPSLASHAGFAAVYGALML 194
+ + G R FG A P+ GG+ P AA L
Sbjct: 119 YIADITDGDERARHFG--FMSACFGFGMVAGPVLGGLMGGFSPHAPFF--AAAALNGLNF 174

Query: 195 LCAGSAALTWRWLHEPPDAPAAAHGPTAHRPATQQPPAAAAAAR-SPLASGRVWRIVLGI 253
L L E RP ++ A+ R + + + +
Sbjct: 175 L------TGCFLLPESH--------KGERRPLRREALNPLASFRWARGMTVVAALMAVFF 220

Query: 254 GALCAPQFAVLTFATVFLHDFG-RLGLAGISAAMVALQVGAMVMRVWSGRHTDRHGNRRA 312
Q + F GIS A + + ++ + +G R G RRA
Sbjct: 221 IMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI-LHSLAQAMITGPVAARLGERRA 279

Query: 313 YLRGSVCVAAGSFALLAAATAGSPHVPLAAIVAILVFAGICVSAWHGVAYTELATLAGAN 372
+ G + G + LLA AT G P I+ +L GI + A + L+
Sbjct: 280 LMLGMIADGTG-YILLAFATRGWMAFP---IMVLLASGGIGMPALQAM----LSRQVDEE 331

Query: 373 HAGTALGMANTIVYLGLFATPLAIPPLLAVS--SWS-VVWLAAALIAGATYPLFAR 425
G G + L PL + A S +W+ W+A A + P R
Sbjct: 332 RQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3002FRAGILYSIN300.001 Fragilysin metallopeptidase (M10C) enterotoxin signat...
		>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin

signature.
Length = 405

Score = 29.7 bits (66), Expect = 0.001
Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 2/39 (5%)

Query: 1 MRRVALVVVLCAATFVAACSDDAPHDAHTADGSPPAGAS 39
M+ V L+++L A +AACS++A + D P AS
Sbjct: 9 MKNVKLLLMLGTAALLAACSNEADSLTTSID--APVTAS 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3015ECOLNEIPORIN863e-21 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 86.4 bits (214), Expect = 3e-21
Identities = 81/353 (22%), Positives = 129/353 (36%), Gaps = 46/353 (13%)

Query: 22 VAAAAPVHAQSSVSLYGQVDEWIGATKFPGGDRAWNV-----SGGGMSTSYWGLHGAEDL 76
AA PV A + V+LYG + + ++ + A +G S G G EDL
Sbjct: 9 TLAALPVAAMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDL 68

Query: 77 GNGYKAIFTLESFFRAQNGQFGRFQGDTFFARNAYVGVSSPYGTVTAGRLTTHLFLSTIL 136
GNG KAI+ +E G + R +++G+ +G + GRL
Sbjct: 69 GNGLKAIWQVEQKASIAGTDSG------WGNRQSFIGLKGGFGKLRVGRL---------- 112

Query: 137 FNPFYDSYTFSPMVYHVFLGLGTFPTYPSDQGAVGDSGWNNALSYTSPSFGGLNFGAMYA 196
+ D+ +P ++ A ++ + Y SP F GL+ YA
Sbjct: 113 NSVLKDTGDINPWD-------SKSDYLGVNKIAEPEARLISV-RYDSPEFAGLSGSVQYA 164

Query: 197 LGNTAGDNRSKKWSAQFNYANGPFAATAVYQYVNFNNGPQDLSSLVTGMKSQGIGLVGAT 256
L + AG + S+ + A FNY NG F Y + ++ V K Q LV
Sbjct: 165 LNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQEN----VNIEKYQIHRLVS-G 219

Query: 257 YDLKYVKLFGQYMYTKNDQVAGSWHVNTAQGGVSVPLG--VGNAMASYAY------SRDG 308
YD + + ++ ++ + + +Q V+ L GN +Y S D
Sbjct: 220 YDNDALYAS-VAVQQQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGSFDA 278

Query: 309 GGLDQTRQTWAVGYDYPLSKRTDVYAAYM---NDHISGLSSGNTFGAGIRAKF 358
+ VG +Y SKRT + G G+R KF
Sbjct: 279 TNYNNDYDQVVVGAEYDFSKRTSALVSAGWLQEGKGESKFVSTAGGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3019HTHTETR300.014 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 29.6 bits (66), Expect = 0.014
Identities = 12/96 (12%), Positives = 29/96 (30%), Gaps = 5/96 (5%)

Query: 2 GTTIRDVARAAEVSIGTVSRALKNQPGLSEATRARIVE-----IAQRLGYDPAQLRPRIR 56
T++ ++A+AA V+ G + K++ L + P +R
Sbjct: 31 STSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLR 90

Query: 57 RLTFLLHRQHNRFPASPFFSHVLHGVEDACRERGIV 92
+ + ++ + E +V
Sbjct: 91 EILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126


43Bamb_3063Bamb_3071Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_3063319-0.197373flagellar basal body P-ring protein
Bamb_3064421-0.832291flagellar basal body L-ring protein
Bamb_3065521-0.676999flagellar basal body rod protein FlgG
Bamb_30663161.466168flagellar basal body rod protein FlgF
Bamb_30673161.480704flagellar hook protein FlgE
Bamb_30681142.874994flagellar basal body rod modification protein
Bamb_30691122.909444flagellar basal body rod protein FlgC
Bamb_3070-1123.192629flagellar basal body rod protein FlgB
Bamb_3071-293.317358flagellar basal body P-ring biosynthesis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3063FLGPRINGFLGI370e-129 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 370 bits (952), Expect = e-129
Identities = 164/378 (43%), Positives = 221/378 (58%), Gaps = 21/378 (5%)

Query: 19 IAAALVLAACAF---GAPGAHAERLKDLAQIQGVRDNPLIGYGLVVGLDGTGDQTMQTPF 75
IAAALV +A F A R+KD+A +Q RDN LIGYGLVVGL GTGD +PF
Sbjct: 7 IAAALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPF 66

Query: 76 TTQTLANMLANLGISINNGSANGGPSSLSNMQLKNVAAVMVTATLPPFARPGEALDVTVS 135
T Q++ ML NLGI+ G +N KN+AAVMVTA LPPFA PG +DVTVS
Sbjct: 67 TEQSMRAMLQNLGITTQGGQSN----------AKNIAAVMVTANLPPFASPGSRVDVTVS 116

Query: 136 SLGNAKSLRGGTLLLTPLKGADGQVYALAQGNMAVGGAGASANGSRVQVNQLAAGRIVGG 195
SLG+A SLRGG L++T L GADGQ+YA+AQG + V G A + + + + R+ G
Sbjct: 117 SLGDATSLRGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNG 176

Query: 196 AIVERAVPNAIAQMNGVLQLQLNDMDYGTAQRIVSAVNS----SFGPGTATALDGRTIQL 251
AI+ER +P+ L LQL + D+ TA R+ VN+ +G A D + I +
Sbjct: 177 AIIERELPSKFKDSV-NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAV 235

Query: 252 AAPADSAQQVAFMARLQNLDVSPDKAAAKVILNARTGSIVMNQMVTLQSCAVAHGNLSVV 311
P + MA ++NL V D AKV++N RTG+IV+ V + AV++G L+V
Sbjct: 236 QKPRVA-DLTRLMAEIENLTVETD-TPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQ 293

Query: 312 VNTQPVVSQPGPFSNGQTVVAQQSQIQLKQDNGALKMVTAGANLADVVKALNSLGATPAD 371
V P V QP PFS GQT V Q+ I Q+ + + G +L +V LNS+G
Sbjct: 294 VTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSKVA-IVEGPDLRTLVAGLNSIGLKADG 352

Query: 372 LMSILQAMKAAGALRADL 389
+++ILQ +K+AGAL+A+L
Sbjct: 353 IIAILQGIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3064FLGLRINGFLGH2155e-73 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 215 bits (550), Expect = 5e-73
Identities = 129/222 (58%), Positives = 163/222 (73%), Gaps = 7/222 (3%)

Query: 14 AVCALAVAALAGCAQIPRDPIIQQPMTAQPPMPMSMQAPGSIY---NPGYAG-RPLFEDQ 69
A+ +L V +L GCA IP P++Q +AQP + A GSI+ P G +PLFED+
Sbjct: 10 AISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDR 69

Query: 70 RPRNIGDILTIMIAENINATKSSGANTNRQGNTDFSVPTAG-FLGGLF--AKANMSAAGA 126
RPRNIGD LTI++ EN++A+KSS AN +R G T+F T +L GLF A+A++ A+G
Sbjct: 70 RPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGG 129

Query: 127 NKFAATGGASAANTFNGTITVTVTNVLPNGNLVVSGEKQMLINQGNEFVRFSGVVNPNTI 186
N F GGA+A+NTF+GT+TVTV VL NGNL V GEKQ+ INQG EF+RFSGVVNP TI
Sbjct: 130 NTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTI 189

Query: 187 SGANSVYSTQVADAKIEYSSKGYINEAETMGWLQRFFLNIAP 228
SG+N+V STQVADA+IEY GYINEA+ MGWLQRFFLN++P
Sbjct: 190 SGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSP 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3065FLGHOOKAP1444e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 4e-07
Identities = 9/42 (21%), Positives = 23/42 (54%)

Query: 220 EASNVNVVQELVNMIQTQRAYEINSKAVTTSDQMLQTVTQMK 261
S VN+ +E N+ + Q+ Y N++ + T++ + + ++
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 39.6 bits (92), Expect = 9e-06
Identities = 17/80 (21%), Positives = 33/80 (41%), Gaps = 14/80 (17%)

Query: 4 SLYIAATGMNAQQAQMDVISNNLANTSTNGFKASRAVFEDLLYQTIRQPGANSTQQTELP 63
+ A +G+NA QA ++ SNN+++ + G+ + + + L
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIM--------------AQANSTLG 48

Query: 64 SGLQLGTGVQQVATERLYTQ 83
+G +G GV +R Y
Sbjct: 49 AGGWVGNGVYVSGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3066FLGHOOKAP1280.035 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.0 bits (62), Expect = 0.035
Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 4 LIYTAMTGASQSLDQQAIVANNLANASTTGFRAQ 37
LI AM+G + + +NN+++ + G+ Q
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQ 36


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3067FLGHOOKAP1362e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 36.5 bits (84), Expect = 2e-04
Identities = 18/58 (31%), Positives = 25/58 (43%)

Query: 356 ISAPGSTNHGTLQGSALENSNVDLTSQLVNLITAQRNYQANAQTIKTQQTVDQTLINL 413
SA L S V+L + NL Q+ Y ANAQ ++T + LIN+
Sbjct: 488 SSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 33.0 bits (75), Expect = 0.002
Identities = 20/78 (25%), Positives = 33/78 (42%), Gaps = 11/78 (14%)

Query: 6 GLSGLAGASNALDVIGNNIANANTVGFKSSTA----QFSDMYANSIATSVNTQIGIGTAL 61
+SGL A AL+ NNI++ N G+ T S + A +G G +
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGG-------WVGNGVYV 59

Query: 62 SSVQQQFGQGTINTTNSS 79
S VQ+++ N ++
Sbjct: 60 SGVQREYDAFITNQLRAA 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3069FLGHOOKAP1270.032 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 26.8 bits (59), Expect = 0.032
Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 102 NVDPVQEMVNMISASRSYQANVETLNTAKTLMLKTLTI 139
V+ +E N+ + Y AN + L TA + + I
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


44Bamb_3100Bamb_3112Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_31000113.377358two component LuxR family transcriptional
Bamb_31010114.028096amino acid permease-associated protein
Bamb_31020125.332165hypothetical protein
Bamb_31032105.354036PepSY-associated TM helix domain-containing
Bamb_31040114.555369flagellar protein FhlB
Bamb_31051123.393996hypothetical protein
Bamb_31061141.769747hypothetical protein
Bamb_31072122.393532flagellar protein FliS
Bamb_31081132.208108flagellar hook-basal body complex subunit FliE
Bamb_31090133.467260flagellar MS-ring protein
Bamb_31100113.560467flagellar motor switch protein G
Bamb_31110113.918552flagellar assembly protein H
Bamb_3112-1103.392388flagellar protein export ATPase FliI
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3100HTHFIS882e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.0 bits (218), Expect = 2e-22
Identities = 34/114 (29%), Positives = 55/114 (48%), Gaps = 2/114 (1%)

Query: 5 ILLVDDHAIVRQGVRHLLIDRGVAREVTEAETGSDAVAAVDRQTFDVILLDISLPDTNGI 64
IL+ DD A +R + L G +V + + D+++ D+ +PD N
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 65 ELLKRIKRKLPGTPVLMFSMYREDQYAVRALKAGASGYLSKTVNAAQMIGAIQQ 118
+LL RIK+ P PVL+ S A++A + GA YL K + ++IG I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3104TYPE3IMSPROT613e-14 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 60.5 bits (147), Expect = 3e-14
Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 2/79 (2%)

Query: 9 AAALVYDPKGGDAAPRVVAKGYGLVAEMIVARARDAGLYVHTAPEMV-SLLMQVDLDDRI 67
A ++Y G P V K + + A + G+ + + +L +D I
Sbjct: 268 AIGILYKR-GETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYI 326

Query: 68 PPQLYQAVADLLAWLYSLD 86
P + +A A++L WL +
Sbjct: 327 PAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3108FLGHOOKFLIE653e-17 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 65.1 bits (158), Expect = 3e-17
Identities = 46/112 (41%), Positives = 66/112 (58%), Gaps = 9/112 (8%)

Query: 8 ANVSGIGSVLQQMQAMAAQAGGGVASPTAALAGSGAATAGTFASAMKASLDKISGDQQHA 67
+ + GI V+ Q+QA A A + P + +FA + A+LD+IS Q A
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTI---------SFAGQLHAALDRISDTQTAA 51

Query: 68 LGEAQAFEVGAANVSLNDVMVDMQKANIGFQFGLQVRNKLVSAYNDIMQMSV 119
+A+ F +G V+LNDVM DMQKA++ Q G+QVRNKLV+AY ++M M V
Sbjct: 52 RTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3109FLGMRINGFLIF477e-165 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 477 bits (1229), Expect = e-165
Identities = 250/550 (45%), Positives = 364/550 (66%), Gaps = 23/550 (4%)

Query: 52 ISRMKGNPKLPFVIAVAFAIAAITALVLWSRAPDYRVLYSNLSDRDGGAIIAALQQANVP 111
++R++ NP++P ++A + A+A + A+VLW++ PDYR L+SNLSD+DGGAI+A L Q N+P
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 112 YKFADAGGAILVPSNQVHETRLKLAALGLPKGGSVGFELMDNQKFGISQFAEQVNYQRAL 171
Y+FA+ GAI VP+++VHE RL+LA GLPKGG+VGFEL+D +KFGISQF+EQVNYQRAL
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135

Query: 172 EGELQRTIESINAVRGARVHLAIPKPSVFVRDKEAPSASVFVDLYPGRVLDEGQVQAITR 231
EGEL RTIE++ V+ ARVHLA+PKPS+FVR++++PSASV V L PGR LDEGQ+ A+
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195

Query: 232 MVSSGVPDMPAKNVTIVDQDGNLLTQTASASG-LDASQLKYVQQVERNTQKRIDSILAPI 290
+VSS V +P NVT+VDQ G+LLTQ+ ++ L+ +QLK+ VE Q+RI++IL+PI
Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255

Query: 291 FGTGNARSQVSADIDFSKLEQTSESYSPNGTPQQAAIRSQQTSSATELAQGGASGVPGAL 350
G GN +QV+A +DF+ EQT E YSPNG +A +RS+Q + + ++ G GVPGAL
Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315

Query: 351 SNTPPQPASAPIVA-----GNGQN---------GAQSTPVSDRKDQTTNYELDKTIRHVE 396
SN P P API N QN + P S ++++T+NYE+D+TIRH +
Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTK 375

Query: 397 QPMGNVKRLSVAVVVNYQPVADAKGHVTMQPLPPPKLAQVEQLVKDAMGYDEKRGDSVNV 456
+G+++RLSVAVVVNY+ +AD K PL ++ Q+E L ++AMG+ +KRGD++NV
Sbjct: 376 MNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431

Query: 457 VNSAFSSVGDPYADLPWWRQPDMIAMAKEAAKWLGIAAAAAALYFMFVRPAMRRAFPPPE 516
VNS FS+V + +LP+W+Q I A +WL + A L+ VRP + R +
Sbjct: 432 VNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAK 491

Query: 517 PVAPALAAPDDPVALDGLPAPDRADEPDPLLLGFENEKNRYERNLDYARTIARQDPKIVA 576
+ + + R + + L N++ E R ++ DP++VA
Sbjct: 492 AAQEQAQVRQE--TEEAV--EVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVA 547

Query: 577 TVVKNWVSDE 586
V++ W+S++
Sbjct: 548 LVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3110FLGMOTORFLIG297e-102 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 297 bits (763), Expect = e-102
Identities = 113/324 (34%), Positives = 188/324 (58%)

Query: 5 GLTKSALLLMSIGEEEAAEVFKFLAPREVQKIGAAMAALKNVTREQVEGVLQEFAKEAEQ 64
G K+A+LL+SIG E +++VFK+L+ E++ + +A L+ +T E + VL EF +
Sbjct: 17 GKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMA 76

Query: 65 HTALSLDSSEYIRSVLTKALGEDKAGVLIDRILQGSDTSGIEGLKWMDSGAVAELIKNEH 124
+ +Y R +L K+LG KA +I+ + + E ++ D + I+ EH
Sbjct: 77 QEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEH 136

Query: 125 PQIIATILVHLDRDQASEIASCFTERLRNDVLLRIATLDGIQPAALRELDDVLTGLLSGS 184
PQ IA IL +LD +AS I S ++ +V RIA +D P +RE++ VL L+
Sbjct: 137 PQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASL 196

Query: 185 DNLKRSPMGGIRTAAEILNFMTSNHEEGVLESVRQYDADLAQKIVDQMFVFENLLDLEDR 244
+ + GG+ EI+N E+ ++ES+ + D +LA++I +MFVFE+++ L+DR
Sbjct: 197 SSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDR 256

Query: 245 AIQMVLKEVESETLIIALKGAPPALRQKFLANMSQRAAELLAEDLDARGPVRVSEVETQQ 304
+IQ VL+E++ + L ALK +++K NMS+RAA +L ED++ GP R +VE Q
Sbjct: 257 SIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQ 316

Query: 305 RRILQIVRNLAESGQIVIGGKAED 328
++I+ ++R L E G+IVI E+
Sbjct: 317 QKIVSLIRKLEEQGEIVISRGGEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3111FLGFLIH1129e-33 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 112 bits (281), Expect = 9e-33
Identities = 71/213 (33%), Positives = 114/213 (53%), Gaps = 10/213 (4%)

Query: 15 YQRWEMASFDPPPPPPPPDDA------AAAAAALAEELQRVRDAAHAEGHAAGHVEGQAL 68
++ W PP P A +L ++L +++ AH +G+ AG EG+
Sbjct: 7 WKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQ 66

Query: 69 GYQAGFDQGREQGFEAGQAEAREQAAQLAA----LAASFREAVSSAEHDLASDLAQLALD 124
G++ G+ +G QG E G AEA+ Q A + A L + F+ + + + +AS L Q+AL+
Sbjct: 67 GHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALE 126

Query: 125 IAQQVVRQHVKHDPAALVAAVRDVLAAEPALSGAPHLAVNPADLPVVEAYLQDDLDTLGW 184
A+QV+ Q D +AL+ ++ +L EP SG P L V+P DL V+ L L GW
Sbjct: 127 AARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGW 186

Query: 185 TVRTDTSIERGGCRAHAATGEVDATLPTRWQRV 217
+R D ++ GGC+ A G++DA++ TRWQ +
Sbjct: 187 RLRGDPTLHPGGCKVSADEGDLDASVATRWQEL 219


45Bamb_3147Bamb_3152Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_3147519-2.990832outer membrane efflux protein
Bamb_3148620-3.256783HlyD family type I secretion membrane fusion
Bamb_3149620-3.359964ABC transporter-like protein
Bamb_3150721-3.613702hypothetical protein
Bamb_3151620-2.732561OmpA/MotB domain-containing protein
Bamb_3152619-2.381907YadA domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3148RTXTOXIND2501e-80 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 250 bits (639), Expect = 1e-80
Identities = 89/455 (19%), Positives = 188/455 (41%), Gaps = 59/455 (12%)

Query: 14 APRLRAGDAAYMSDIREALLVRSSAGAQLILYLIAIVLGAGLVWAHFARVEEVTRSEATV 73
P + ++ E + S +L+ Y I L + + +VE V + +
Sbjct: 31 TPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKL 90

Query: 74 VSPSREQLIQSLEGGIVQSVAVREGEVVEKGQLLAKIDPARAQSSYREVLTKALELKASV 133
R + I+ +E IV+ + V+EGE V KG +L K+ A++ + + L+ +
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ 150

Query: 134 SRARAEAYGV------PLDFP----------EDVKRESGLVAQATATYRARRR------- 170
+R + + + L P E+V R + L+ + +T++ ++
Sbjct: 151 TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLD 210

Query: 171 -------ALDEQVTALEKSQALVRREIAMSEPLAAKGLVSEVEILRMRRQSTDIAAQIAE 223
+ ++ E + + + L K +++ +L + + ++
Sbjct: 211 KKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRV 270

Query: 224 RRSR---------------------FTAEASTELSRLEQELAQTNEVLAGRADVLARTDV 262
+S+ F E +L + + LA + + +
Sbjct: 271 YKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVI 330

Query: 263 VAPMRGVVKNIRIRTAGGVVQSGEHIMEIAPLDGRVLVEARIKPSDVAFLRPGLPVLVKL 322
AP+ V+ +++ T GGVV + E +M I P D + V A ++ D+ F+ G ++K+
Sbjct: 331 RAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKV 390

Query: 323 SAYDFSIYGGLHGHVVSLGPDTLKDDQKAAMGRPDANYYRLMVETDSDALAAAGKRLPVL 382
A+ ++ YG L G V ++ D ++D + + +++ + + L+ K +P+
Sbjct: 391 EAFPYTRYGYLVGKVKNINLDAIEDQR-------LGLVFNVIISIEENCLSTGNKNIPLS 443

Query: 383 PGMQATVDIRTGEKTVLDYLLKPIF-KAREAFRER 416
GM T +I+TG ++V+ YLL P+ E+ RER
Sbjct: 444 SGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3150PF07675320.022 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 31.6 bits (71), Expect = 0.022
Identities = 26/103 (25%), Positives = 44/103 (42%), Gaps = 8/103 (7%)

Query: 36 NDSKPTVSGRGDPGSTIHLLVDGVEVGSVVVGANGTWSVALTQPL-NDGEYRLTARASND 94
++ + S + GS + + DGV G+ V A+G +V +T+ + +G Y + SN
Sbjct: 259 PQNQASYSIQASAGSYVAISKDGVLYGTGVANASGVATVNMTKQITENGNYDVVITRSN- 317

Query: 95 VGMSVPSTSYGIQVDVTPPSQP--KIEAATEGAQPTLSGHAEA 135
IQ P QP + A +G + TL A +
Sbjct: 318 ----YLPVIKQIQAGEPSPYQPVSNLTATAQGQKVTLKWDAPS 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3151OMPADOMAIN1058e-29 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 105 bits (262), Expect = 8e-29
Identities = 48/142 (33%), Positives = 69/142 (48%), Gaps = 12/142 (8%)

Query: 112 AQPQPVVQAQPAAEPVAQ-RHVLLQGSANFAFDSAALTPSARQELDRFLD--VNREARFR 168
+ PVV PA P Q +H L+ F F+ A L P + LD+ N + +
Sbjct: 194 GEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG 253

Query: 169 RVTVTGYTDSQGAHAHNVRLSEARARAVATYLRTGGLHAEHFTTVGKGAAEPVASNATAE 228
V V GYTD G+ A+N LSE RA++V YL + G+ A+ + G G + PV N
Sbjct: 254 SVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDN 313

Query: 229 GR---------AQNRRVEIELE 241
+ A +RRVEIE++
Sbjct: 314 VKQRAALIDCLAPDRRVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3152OMADHESIN753e-15 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 74.5 bits (182), Expect = 3e-15
Identities = 66/172 (38%), Positives = 102/172 (59%), Gaps = 16/172 (9%)

Query: 676 GAGGLNAIAVGLQAVASSDHSVAIGSIAQTGVDQPYSVAMGSMVTTNGAGALAIGSRAKA 735
GAGGLNA A G+ HS+AIG+ A+ + +VA+G+ G ++AIG +KA
Sbjct: 59 GAGGLNASAKGI-------HSIAIGATAEAA--KGAAVAVGAGSIATGVNSVAIGPLSKA 109

Query: 736 NADNAVAVGNNGVAAVGKSSIAIGDKAMTAAGTVNSVAMGKSANVAQNVTDAIALGANAS 795
D+AV G A K +AIG +A T+ VA+G ++ + +++A+G ++
Sbjct: 110 LGDSAVTYGAASTAQ--KDGVAIGARASTSD---TGVAVGFNSKA--DAKNSVAIGHSSH 162

Query: 796 VASGNNGGIALGANSVADRGNALSVGSNSLQRQIVNVAKGTKNNDAVNVSQL 847
VA+ + IA+G S DR N++S+G SL RQ+ ++A GTK+ DAVNV+QL
Sbjct: 163 VAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 70.7 bits (172), Expect = 6e-14
Identities = 66/177 (37%), Positives = 102/177 (57%), Gaps = 9/177 (5%)

Query: 58 AIGASASTRPATSGLGGAVAIGNKATAAGNNVAFGANASALGEEGAIALGTGANAGGKSS 117
A+G RP G GG N + +++A GA A A + A+A+G G+ A G +S
Sbjct: 46 ALGLEYPVRPPVPGAGGL----NASAKGIHSIAIGATAEA-AKGAAVAVGAGSIATGVNS 100

Query: 118 IALGNEAQATGWGAYALGRKAKAAAESSLAIGDSSMATRAGTMAIGSQAAAAAENAIAIG 177
+A+G ++A G A G A A + +AIG + + G +A+G + A A+N++AIG
Sbjct: 101 VAIGPLSKALGDSAVTYG-AASTAQKDGVAIGARASTSDTG-VAVGFNSKADAKNSVAIG 158

Query: 178 QSA--AARGADSLALGSYSEADRDNTVSVGTAGFERQIVNVGRGTQATDAVNIAQLK 232
S+ AA S+A+G S+ DR+N+VS+G RQ+ ++ GT+ TDAVN+AQLK
Sbjct: 159 HSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLK 215



Score = 68.0 bits (165), Expect = 4e-13
Identities = 58/161 (36%), Positives = 89/161 (55%), Gaps = 7/161 (4%)

Query: 3122 GLQAVSNSDHSVAIGSIAQTGVDQPYAVAMGSMVTTNGAGALAIGSRAKANADNAVAVGN 3181
GL A + HS+AIG+ A+ + AVA+G+ G ++AIG +KA D+AV G
Sbjct: 62 GLNASAKGIHSIAIGATAEAA--KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 3182 NGVVAVGKSSIAIGDKAMTNAGTVNSIAIGTNANVQQNVADAIALGANSQALSSNSVALG 3241
K +AIG +A T+ +A+G N+ + AI ++ A S+A+G
Sbjct: 120 ASTAQ--KDGVAIGARASTSD---TGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIG 174

Query: 3242 ANSIANRANALSIGKAGAERQIVNVAKGTQDTDAVSLAQLK 3282
S +R N++SIG RQ+ ++A GT+DTDAV++AQLK
Sbjct: 175 DRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLK 215



Score = 48.0 bits (113), Expect = 7e-07
Identities = 40/136 (29%), Positives = 63/136 (46%), Gaps = 3/136 (2%)

Query: 3981 GSGSNLIGGTGGSGKDSAEIVAAKPGNGNGNIAVGSGSQIVDGKNNAAAIGAGSKVSADN 4040
G S IG + DSA A +A+G+ + D A+G SK A N
Sbjct: 97 GVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSD---TGVAVGFNSKADAKN 153

Query: 4041 GTALGQGASVSSGADNSVALGQGSQATEANTVSVGSDGHERRIVNVADGVKATDAVSKGQ 4100
A+G + V++ S+A+G S+ N+VS+G + R++ ++A G K TDAV+ Q
Sbjct: 154 SVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQ 213

Query: 4101 FDRALGGMQGQINDIS 4116
+ + Q N S
Sbjct: 214 LKKEIEKTQENTNKRS 229



Score = 43.7 bits (102), Expect = 1e-05
Identities = 30/57 (52%), Positives = 37/57 (64%)

Query: 2965 GREAIARGPESVAIGANAWATRPQAMALGSGSRANGVNSVAIGYNSVADDDNTVAVG 3021
G A A+G S+AIGA A A + A+A+G+GS A GVNSVAIG S A D+ V G
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118



Score = 43.3 bits (101), Expect = 2e-05
Identities = 33/98 (33%), Positives = 54/98 (55%), Gaps = 2/98 (2%)

Query: 2976 VAIGANAWATRPQAMALGSGSR--ANGVNSVAIGYNSVADDDNTVAVGNVGEERRVVHLA 3033
VA+G N+ A ++A+G S AN S+AIG S D +N+V++G+ R++ HLA
Sbjct: 141 VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLA 200

Query: 3034 AGVDDTDAVNMRQLTDAMHSANTKLDAKMTRMVRDVES 3071
AG DTDAVN+ QL + + + ++ + +
Sbjct: 201 AGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANA 238



Score = 41.4 bits (96), Expect = 8e-05
Identities = 56/218 (25%), Positives = 93/218 (42%), Gaps = 17/218 (7%)

Query: 3223 AIALGANSQALSSNSVALGANSIANRANALSIG---KAGAERQIVNVAKGTQDTDAVSLA 3279
+IA+GA ++A +VA+GA SIA N+++IG KA + + A T D V++
Sbjct: 72 SIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIG 131

Query: 3280 QLKGLADVVAGGTGFDKNGDVTAPTYTIDGKEYHNVNDALQAAAKSGGDGSSGTDPNAVA 3339
+D G N A G H + + A GD S N+V+
Sbjct: 132 ARASTSDT---GVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAI--GDRSKTDRENSVS 186

Query: 3340 YDGELKDKVTLAGQNGTTLSNVAAGKADTDAVNVSQLKSSGLVGEDGKSRAAVTYDKNTD 3399
E ++ L+++AAG DTDAVNV+QLK ++ ++ + N +
Sbjct: 187 IGHESLNR---------QLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANAN 237

Query: 3400 GTPNYKSATLAGEGGTTLTNVKAGALSATSTDAVNGSQ 3437
+ KS+++ G + A L +A S+
Sbjct: 238 AYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSK 275



Score = 37.6 bits (86), Expect = 0.001
Identities = 26/59 (44%), Positives = 39/59 (66%)

Query: 527 AAGMQANALGKNSVAIGSQANATNVDTLAIGSGARASGVNSIAIGVNSVAADANTVSIG 585
A G+ A+A G +S+AIG+ A A +A+G+G+ A+GVNS+AIG S A + V+ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118



Score = 35.6 bits (81), Expect = 0.004
Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 17/115 (14%)

Query: 537 KNSVAIGSQANATNVDTLAIGSGARASGVNSIAIGVNS-VAAD---------------AN 580
K+ VAIG++A+ ++ +A+G ++A NS+AIG +S VAA+ N
Sbjct: 125 KDGVAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDREN 183

Query: 581 TVSIGDVGATRRIVNVSDGVDDTDAVNMKQLTNVMHSANTKLDAKMTRMVRDVES 635
+VSIG R++ +++ G DTDAVN+ QL + + + ++ + +
Sbjct: 184 SVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANA 238



Score = 34.9 bits (79), Expect = 0.008
Identities = 52/215 (24%), Positives = 89/215 (41%), Gaps = 9/215 (4%)

Query: 785 TDAIALGANASVASGNNGGIALGANSVADRGNALSVGSNSLQRQIVNVAKGTKNNDAVNV 844
+IA+GA A A G +A+GA S+A N++++G S V G A +
Sbjct: 70 IHSIAIGATAEAAKG--AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG-----AAST 122

Query: 845 SQLTGVTNALGGGAGIGTDGNITAPTYKVGDTTYNNVGDALDAMAKNGGSDPNAVSYDSA 904
+Q GV A+G A G K +G + A +G S +
Sbjct: 123 AQKDGV--AIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTD 180

Query: 905 TKDKVTLAGGATGTTLSNVKAGTADMDAVNVSQLKSSGLIGEDGKSLAAITYDKNTDGTP 964
++ V++ + L+++ AGT D DAVNV+QLK ++ + + N +
Sbjct: 181 RENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYA 240

Query: 965 NYKSATLAGEGGTTLTNVKAGALSATSTDAVNGSQ 999
+ KS+++ G + A L +A S+
Sbjct: 241 DNKSSSVLGIANNYTDSKSAETLENARKEAFAQSK 275


46Bamb_0018Bamb_0032N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_00180110.290407hypothetical protein
Bamb_00191120.103063type III restriction enzyme, res subunit
Bamb_0020-112-0.357716adenine-specific DNA-methyltransferase
Bamb_0021-111-0.519812outer membrane protein (porin)-like protein
Bamb_00220120.704316two component transcriptional regulator
Bamb_00230120.882275sensor signal transduction histidine kinase
Bamb_0024113-0.387433binding-protein-dependent transport system inner
Bamb_00250130.929889ABC transporter-like protein
Bamb_00260131.300475nitrate/sulfonate/bicarbonate ABC transporter
Bamb_00272130.425162flagellar biosynthetic protein FliR
Bamb_00282120.010912flagellar biosynthesis protein FliQ
Bamb_0029113-0.126569flagellar biosynthesis protein FliP
Bamb_00300131.137321flagellar biosynthesis protein, FliO
Bamb_00312140.488108flagellar motor switch protein FliN
Bamb_00321140.358797flagellar motor switch protein FliM
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0018cloacin326e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 31.6 bits (71), Expect = 6e-04
Identities = 17/45 (37%), Positives = 22/45 (48%)

Query: 23 NAAAGGNGGGGGNGGGHGAGGSGGNAGGMSGGHMSGQALSNSNGF 67
G+G G G GHG GG GN+GG SG + A++ F
Sbjct: 46 WGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAF 90



Score = 27.8 bits (61), Expect = 0.012
Identities = 12/35 (34%), Positives = 15/35 (42%)

Query: 18 GAVSANAAAGGNGGGGGNGGGHGAGGSGGNAGGMS 52
G + GG G G GG +GG G G +S
Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82



Score = 26.2 bits (57), Expect = 0.043
Identities = 13/35 (37%), Positives = 17/35 (48%)

Query: 18 GAVSANAAAGGNGGGGGNGGGHGAGGSGGNAGGMS 52
G+ S GG+G G G G G+ GGSG +
Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0021ECOLNEIPORIN582e-11 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 57.5 bits (139), Expect = 2e-11
Identities = 55/252 (21%), Positives = 99/252 (39%), Gaps = 42/252 (16%)

Query: 34 LKRKTLALSIAAAGLCAGTHAHAQSSVQLYGLMDLSFPTYRTHADANGKHVIGMGNEGEP 93
+K+ +AL++AA A + V LYG + T R+ A NG +
Sbjct: 1 MKKSLIALTLAA------LPVAAMADVTLYGTIKAGVETSRSVAH-NGAQAASVETGTGI 53

Query: 94 WFSGSRWGLRGAEDIGGGTKIIFRLESEFVVANGQMEDEGQIFDRDAWVGVEDERFGKLT 153
GS+ G +G ED+G G K I+++E + +A + +R +++G++ FGKL
Sbjct: 54 VDLGSKIGFKGQEDLGNGLKAIWQVEQKASIAGT----DSGWGNRQSFIGLKGG-FGKLR 108

Query: 154 AGFQNTIARDSAAIYGDAYGSAKLTTEEGGWTNSNNFKQMIFYA---AGPTGTRYNNGLA 210
G N++ +D T + W + +++ + A A RY++
Sbjct: 109 VGRLNSVLKD--------------TGDINPWDSKSDYLGVNKIAEPEARLISVRYDS--- 151

Query: 211 WKKLFSNGIFASAGYQFSNSTAFATGSAYQLALGYNGGPFNVSGFYNHVNH-------NG 263
F+ G+ S Y +++ +Y Y G F V + H N
Sbjct: 152 --PEFA-GLSGSVQYALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNI 208

Query: 264 FRNQTFSVGGNY 275
+ Q + Y
Sbjct: 209 EKYQIHRLVSGY 220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0022HTHFIS964e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 95.7 bits (238), Expect = 4e-25
Identities = 28/126 (22%), Positives = 59/126 (46%), Gaps = 1/126 (0%)

Query: 2 KLLLVEDNAELAHWIVNLLRGEDFAVDCVGDGERADTVLKTERYDAVLLDMRLPGISGKE 61
+L+ +D+A + + L + V + + D V+ D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLARLRRRNDNVPVLMLTAHGSVDDKVDCFGAGADDYVVKPFESRELVARI-RALIRRQA 120
+L R+++ ++PVL+++A + + GA DY+ KPF+ EL+ I RAL +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GVATTQ 126
+ +
Sbjct: 125 RPSKLE 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0023PF06580531e-09 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 52.9 bits (127), Expect = 1e-09
Identities = 24/128 (18%), Positives = 45/128 (35%), Gaps = 26/128 (20%)

Query: 338 LGERLDV--AGSDSLLTALV-----MNLVDNAVRY----TQPGGCVTVVARRGGDTVVLD 386
+RL + +++ V LV+N +++ GG + + + TV L+
Sbjct: 236 FEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLE 295

Query: 387 VVDNGPGIPAEARPHVFKRFYRVAADTEGSGLGLAIVRE-IAQAHGGSAMLAPGPGNRGI 445
V + G + E +G GL VRE + +G A + +
Sbjct: 296 VENTGSLALKNTK--------------ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV 341

Query: 446 VVTVRLPA 453
V +P
Sbjct: 342 NAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0027TYPE3IMRPROT1566e-49 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 156 bits (396), Expect = 6e-49
Identities = 119/256 (46%), Positives = 168/256 (65%), Gaps = 1/256 (0%)

Query: 1 MFSVTYAQLNGWLTAFLWPFVRMLALVATAPVVGHAAVPVRVKIGIAAFMALVVAPTLGA 60
M VT Q WL + WP +R+LAL++TAP++ +VP RVK+G+A + +AP+L A
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 MPDVTVFSAPGIWIVVTQFLIGIALGFTMQLVFAAVEAAGDFIGLSMGLGFATFFDPHSN 120
DV VFS +W+ V Q LIGIALGFTMQ FAAV AG+ IGL MGL FATF DP S+
Sbjct: 61 -NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 GATPVMGRFLNAIAMLAFLAVDGHLQVFAALAASFQTLPVSGDLLHAPGWRTLAAFGATV 180
PV+ R ++ +A+L FL +GHL + + L +F TLP+ G+ L++ + L G+ +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 181 FEMGLLLALPVVAALLIANLALGILNRAAPQIGIFQIGFPVTMLVGLLLVQLMIPNLVPF 240
F GL+LALP++ LL NLALG+LNR APQ+ IF IGFP+T+ VG+ L+ ++P + PF
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPF 239

Query: 241 VSHLFDMGLDAMGRVL 256
HLF + + ++
Sbjct: 240 CEHLFSEIFNLLADII 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0028TYPE3IMQPROT664e-18 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 65.9 bits (161), Expect = 4e-18
Identities = 28/85 (32%), Positives = 44/85 (51%)

Query: 4 EQVMTLAHQAMMVGLLLAAPLLLVALAVGLVVSLFQAATQINESTLSFIPKLLAVAATLV 63
+ ++ ++A+ + L+L+ +VA +GL+V LFQ TQ+ E TL F KLL V L
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 IAGPWMLTTMLDYLRQTLLHVATLG 88
+ W +L Y RQ + G
Sbjct: 62 LLSGWYGEVLLSYGRQVIFLALAKG 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0029FLGBIOSNFLIP289e-101 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 289 bits (742), Expect = e-101
Identities = 153/247 (61%), Positives = 196/247 (79%), Gaps = 4/247 (1%)

Query: 6 LRRAARFAPALILGLAPALACAQAAGLPAFNTSPGPNGGTTYSLSVQTMLLLTMLSFLPA 65
+RR AP L+ + P A LP + P P GG ++SL VQT++ +T L+F+PA
Sbjct: 1 MRRLLSVAPVLLWLITPLAF----AQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPA 56

Query: 66 MLLMMTSFTRIIIVLSLLRQALGTATTPPNQVLVGLAMFLTFFVMSPVLDRAYADGYKPF 125
+LLMMTSFTRIIIV LLR ALGT + PPNQVL+GLA+FLTFF+MSPV+D+ Y D Y+PF
Sbjct: 57 ILLMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPF 116

Query: 126 SDGSMPMEQAVRRGVAPFKTFMLKQTRETDLALFAKISKAAPMQGPEDVPLSLLVPAFVT 185
S+ + M++A+ +G P + FML+QTRE DL LFA+++ P+QGPE VP+ +L+PA+VT
Sbjct: 117 SEEKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVT 176

Query: 186 SELKTGFQIGFTIFIPFLIIDMVVASVLMSMGMMMVSPSTVSLPFKLMLFVLVDGWQLLI 245
SELKT FQIGFTIFIPFLIID+V+ASVLM++GMMMV P+T++LPFKLMLFVLVDGWQLL+
Sbjct: 177 SELKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLV 236

Query: 246 GSLAQSF 252
GSLAQSF
Sbjct: 237 GSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0031FLGMOTORFLIN1337e-43 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 133 bits (335), Expect = 7e-43
Identities = 75/132 (56%), Positives = 98/132 (74%), Gaps = 3/132 (2%)

Query: 31 AAQEDQGLDD-WAAALAEQNLQPVQAGATGAGVFQPLSKAAASSTHNDIEMILDIPVKMT 89
+ + LDD WA AL EQ ++ A VFQ L S DI++I+DIPVK+T
Sbjct: 8 SDENTGALDDLWADALNEQKATTTKSAADA--VFQQLGGGDVSGAMQDIDLIMDIPVKLT 65

Query: 90 VELGRTKIAIRNLLQLAQGSVVELDGMAGEPMDVLVNGCLIAQGEVVVVNDKFGIRLTDI 149
VELGRT++ I+ LL+L QGSVV LDG+AGEP+D+L+NG LIAQGEVVVV DK+G+R+TDI
Sbjct: 66 VELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDI 125

Query: 150 ITPAERIRKLNR 161
ITP+ER+R+L+R
Sbjct: 126 ITPSERMRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0032FLGMOTORFLIM2724e-92 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 272 bits (697), Expect = 4e-92
Identities = 80/324 (24%), Positives = 158/324 (48%), Gaps = 10/324 (3%)

Query: 5 EFMSQEEVDALLKGV-TGETDTVDEQ--RDLSSVRPYNIATQERIVRGRMPGLEIINDRF 61
E +SQ+E+D LL + +G+ D + D + Y+ ++ + +M L ++++ F
Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62

Query: 62 ARLLRIGIFNFMRRTAEISVSQVKVQKYSEFTRNLPIPTNLNLVHVKPLRGTSLFVFDPN 121
ARL + +R + V+ V Y EF R++P P+ L ++ + PL+G ++ DP+
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122

Query: 122 LVFFVVDNLFGGDGRFHTRVEGRDFTATEQRIIGKLLNLVFEHYATAWKSVRPLQFEFVR 181
+ F ++D LFGG G+ RD T E ++ ++ + + +W V L+ +
Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQ 180

Query: 182 SEMHTQFANVATPNEIVIVTQFSIEFGPTGGTLHICMPYSMIEPIRDVLSSPIQGEAL-- 239
E + QFA + P+E+V++ + G G ++ C+PY IEPI LSS ++
Sbjct: 181 IETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 240 EVDRRWVRVLSQQVQSAEVELSANLAEIPSTFEKILNLRAGDVLPLE---IEDTITAKVD 296
+++ VL ++ + ++++ A + + + IL LR GD++ L + D +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 297 GVPVMECGYGIFNGQYALRVQKMI 320
C G+ + A ++ + I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


47Bamb_0045Bamb_0055N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_00451111.591644general secretion pathway protein J
Bamb_00461101.218192general secretion pathway protein I
Bamb_0047080.711078general secretion pathway protein H
Bamb_0048-18-0.148283general secretion pathway protein G
Bamb_004907-0.127604general secretion pathway protein C
Bamb_0050-18-0.564652general secretion pathway protein F
Bamb_0051-190.342575general secretory pathway protein E
Bamb_0052-18-0.301053general secretion pathway protein D
Bamb_005309-0.915094lytic transglycosylase catalytic subunit
Bamb_0054-19-0.513192cobalamin synthesis protein, P47K
Bamb_0055-290.156797histone family protein DNA-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0045BCTERIALGSPG372e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 36.8 bits (85), Expect = 2e-05
Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 5/62 (8%)

Query: 21 SRRVRGFTLIELMIAIAILAVVAILAWRGLDQIMRGRDK--VASAMEDERVFAQMFDQMR 78
+ + RGFTL+E+M+ I I+ V+A L + +M ++K A+ D D +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLV---VPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 79 ID 80
+D
Sbjct: 61 LD 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0046BCTERIALGSPH280.007 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 28.4 bits (63), Expect = 0.007
Identities = 13/55 (23%), Positives = 26/55 (47%), Gaps = 8/55 (14%)

Query: 13 RGFTMIEVLVALAIIAVALAASIRAVGTMANNASDLHHRLLAGWSADNALAQLRL 67
RGFT++E+++ L ++ V+ + A +++ A + AQLR
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRDDS--------AAQTLARFEAQLRF 50


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0047BCTERIALGSPH474e-09 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 46.9 bits (111), Expect = 4e-09
Identities = 18/84 (21%), Positives = 30/84 (35%), Gaps = 10/84 (11%)

Query: 34 RARGFTLLEMLVVLVIAGLLVSLASLSLTRNPRTDLREEAQRIALLFETAGDEAQVRARP 93
R RGFTLLEM+++L++ G+ + L+ + + R +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 94 IAWQPTAHGFRFDVSSPDGWRTLR 117
G PD W+ L
Sbjct: 62 F-------GVSVH---PDRWQFLV 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0048BCTERIALGSPG1881e-64 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 188 bits (478), Expect = 1e-64
Identities = 66/139 (47%), Positives = 92/139 (66%), Gaps = 3/139 (2%)

Query: 11 AVRRQRGFTLIEIMVVVAILGILAALIVPKIMSRPDEARRIAAKQDIGTIMQALKLYRLD 70
A +QRGFTL+EIMVV+ I+G+LA+L+VP +M ++A + A DI + AL +Y+LD
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 71 NGRYPTQEQGLNALIQKPSTDPIPNNWKDGGYLERLPNDPWGNGYKYLNPGVHGEIDVFS 130
N YPT QGL +L++ P+ P+ N+ GY++RLP DPWGN Y +NPG HG D+ S
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLS 122

Query: 131 YGADGKEGGESNDSDIGSW 149
G DG+ G E DI +W
Sbjct: 123 AGPDGEMGTE---DDITNW 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0050BCTERIALGSPF380e-132 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 380 bits (978), Expect = e-132
Identities = 166/406 (40%), Positives = 262/406 (64%), Gaps = 2/406 (0%)

Query: 1 MPAFRFEAIDSAGRPQKGVIDADSARGARGQLRTQGLTPLVVEPAASATRGARSQRLAFG 60
M + ++A+D+ G+ +G +ADSAR AR LR +GL PL V+ + + S L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 R--KLSQREQAILTRQLASLLIAGLPLDEALGVLTEQAERDYIRELMAAIRAEVLGGHSL 118
R +LS + A+LTRQLA+L+ A +PL+EAL + +Q+E+ ++ +LMAA+R++V+ GHSL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 119 ANALGQHPRDFPEIYRALVAAGEHTGKLGIVLSRLADYIEQSNALKQKILLAFTYPGIVT 178
A+A+ P F +Y A+VAAGE +G L VL+RLADY EQ ++ +I A YP ++T
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 179 LIAFGIVTFLLSYVVPQVVNVFASTKQQLPILTVVMMALSEFVRHWWWAILITVALVVWF 238
++A +V+ LLS VVP+VV F KQ LP+ T V+M +S+ VR + +L+ +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 239 VKATLSRDGPRLAFDRWVLTAPLAGKLVRGYNTVRFASTLGILTAAGVPILRALQAAGET 298
+ L ++ R++F R +L PL G++ RG NT R+A TL IL A+ VP+L+A++ +G+
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 299 LSNRAMRANIDDAIVRVREGSALSRALNNVKTFPPVLVHLIRSGEATGDVTTMLDRAAEG 358
+SN R + A VREG +L +AL FPP++ H+I SGE +G++ +ML+RAA+
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 359 EARELERRTMFLTSLLEPLLILAMGGIVLVIVLAVMLPIIELNNMV 404
+ RE + L EPLL+++M +VL IVLA++ PI++LN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0052BCTERIALGSPD394e-129 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 394 bits (1013), Expect = e-129
Identities = 201/692 (29%), Positives = 325/692 (46%), Gaps = 87/692 (12%)

Query: 13 TTLIVAGIIVSQAAYAQVTLNFVNADIDQVAKAIGAATGKTIIVDPRVKGQLNLVSERSV 72
T LI A ++ AA + + +F DI + + KT+I+DP V+G + + S +
Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72

Query: 73 PEDQALKTLQSALRMQGFALV-QDHGVLKVVPEADAKLQGVPTYIGNAPRARGDQVITQV 131
E+Q + S L + GFA++ ++GVLKVV DAK VP +A GD+V+T+V
Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPV-ASDAAPGIGDEVVTRV 131

Query: 132 FELHNESANNLLPVLRPLI--SPNNTVTAYPANNTIVVTDYADNVRRIAQIIAGVDSAAG 189
L N +A +L P+LR L + +V Y +N +++T A ++R+ I+ VD+A
Sbjct: 132 VPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGD 191

Query: 190 AQVQVIPLRNANAIDLAAQLQKMLDPGAIGNSDATLKVSVTADPRTNSLMLRASSASRLA 249
V +PL A+A D+ + ++ + ++ +V AD RTN++++ SR
Sbjct: 192 RSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSR-Q 250

Query: 250 AAKRIVQQLDAPSGVPGNMHVVPLRNADAVKLAKTLRGMLGKGGNDSGSSASSNDANSFN 309
+++QLD GN V+ L+ A A L +
Sbjct: 251 RIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEV------------------------- 285

Query: 310 QSGGSSASGNFSTGTSGTPPLPSGGLGGGSSSSSAYGSGSSGSGGVGSGGLLGGDKDKSD 369
L G SS+ + + +
Sbjct: 286 -------------------------LTGISSTMQSEKQAAKPVAALDKNI---------- 310

Query: 370 DNQPGGMIQADAATNSLIITASDPVYRNLRSVIDQLDARRAQVYIEALIVELNSTTQGNL 429
+I+A TN+LI+TA+ V +L VI QLD RR QV +EA+I E+ NL
Sbjct: 311 ------IIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNL 364

Query: 430 GIQWQVASGQFLGGTNLAPTAGTGLGNSIVNLTSGG-TAATTGLAANLAGLSQGLNIGWL 488
GIQW + TN T + + G +++ ++ G++ G
Sbjct: 365 GIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAG------ 418

Query: 489 HNMFGVQGLGALLQYFAGVSDANVLSTPNLITLDNEEAKIVVGQNVPIATGSYSNLTSGT 548
F LL + + ++L+TP+++TLDN EA VGQ VP+ TGS + +
Sbjct: 419 ---FYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGS----QTTS 471

Query: 549 TSNAFNTYDRRDVGLTLHVKPQITDGGILKLQLYTEDSAV--VSGTTNAQTGPTFTKRSI 606
N FNT +R+ VG+ L VKPQI +G + L++ E S+V + +T++ G TF R++
Sbjct: 472 GDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTV 531

Query: 607 QSTILADNGEIIVLGGLMQDNYQVSNSKVPLLGDIPWIGQLFRSESKQRQKTNLMVFLRP 666
+ +L +GE +V+GGL+ + + KVPLLGDIP IG LFRS SK+ K NLM+F+RP
Sbjct: 532 NNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRP 591

Query: 667 VIISDRSTAQEVTANRYDYIQGVTGAYKSDNN 698
+I DR ++ ++ +Y + N
Sbjct: 592 TVIRDRDEYRQASSGQYTAFNDAQSKQRGKEN 623


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0055DNABINDINGHU1081e-34 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 108 bits (271), Expect = 1e-34
Identities = 45/88 (51%), Positives = 58/88 (65%)

Query: 2 NKQELIDAVAAQTGASKAQTGETLDTLLEVIKKAVSKGDAVQLIGFGSFGSGKRAARTGR 61
NKQ+LI VA T +K + +D + + ++KG+ VQLIGFG+F +RAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPKTGETIKIPAAKTVKFTAGKAFKDAV 89
NP+TGE IKI A+K F AGKA KDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


48Bamb_0108Bamb_0115N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_0108-281.506160TetR family transcriptional regulator
Bamb_0109-191.594919aminotransferase
Bamb_0110-182.190595extracellular solute-binding protein
Bamb_0111092.447836FAD linked oxidase domain-containing protein
Bamb_0112091.853863PadR-like family transcriptional regulator
Bamb_01130101.357638MerR family transcriptional regulator
Bamb_0114-1121.457728heavy metal translocating P-type ATPase
Bamb_0115-2141.452013hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0108HTHTETR641e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.9 bits (155), Expect = 1e-14
Identities = 29/142 (20%), Positives = 50/142 (35%), Gaps = 2/142 (1%)

Query: 26 RPRQSRAQATSDALQQAFVQLLLERGYAKATIREIAAVAGVSIGTFYEYFGDKQSLAALC 85
R + AQ T + ++L ++G + ++ EIA AGV+ G Y +F DK L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 86 IHRRVQALADRLRDAAHGLGGAPRAELAAALVDVQVDAI--GADAALWGAFFALERQVSP 143
+ + + G P + L L+ V + L F V
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 144 LAAYRRHYDAYVALWRDAFAQA 165
+A ++ D Q
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQT 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0112RTXTOXIND320.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.002
Identities = 12/70 (17%), Positives = 27/70 (38%)

Query: 117 DAEDARHQLERRIAALEAERERLESLRNAAQNDQVPRLFLLQNEHALVLLNAELDWARSV 176
+ ++ R E+ RL+ + + + +L+ E+ V EL +S
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 177 VEHLKIGALR 186
+E ++ L
Sbjct: 275 LEQIESEILS 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0113YERSSTKINASE270.040 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 26.6 bits (58), Expect = 0.040
Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 81 NSLLDEHIGHVDARLAELTHLRDQLTELRRQ 111
+SL+DEH+ +L ELT + ++L L R+
Sbjct: 700 SSLMDEHLVEQREKLRELTTIAERLNRLERE 730


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0115cloacin270.020 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 27.4 bits (60), Expect = 0.020
Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 9 GGHGRGHGGGHGGGGHGDGHHGG 31
GG G G+GGG+G G G G G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGN 80



Score = 26.2 bits (57), Expect = 0.050
Identities = 12/27 (44%), Positives = 13/27 (48%)

Query: 10 GHGRGHGGGHGGGGHGDGHHGGAGREG 36
G G G GHG GG GG+G G
Sbjct: 53 GIHWGGGSGHGNGGGNGNSGGGSGTGG 79


49Bamb_0148Bamb_0156N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_01481101.625893hypothetical protein
Bamb_01491121.530518NAD-dependent epimerase/dehydratase
Bamb_01500111.419823methyltransferase
Bamb_01510120.903878hypothetical protein
Bamb_0152-1130.088935hypothetical protein
Bamb_0153012-0.558566hypothetical protein
Bamb_0154114-0.483498hypothetical protein
Bamb_0155212-0.499453flagellar hook-associated 2 domain-containing
Bamb_0156-112-0.161744flagellin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0148SYCDCHAPRONE467e-08 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 46.1 bits (109), Expect = 7e-08
Identities = 15/91 (16%), Positives = 32/91 (35%), Gaps = 1/91 (1%)

Query: 38 ALAHHQADRLEEAETLYRRILDAEPRHADALHLLGLIGHQYGRYHEATELIMAAIEIKP- 96
A +Q+ + E+A +++ + + + LG G+Y A +
Sbjct: 43 AFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIK 102

Query: 97 DATYYYNLGNVMQANNRPAAAAECFRLAIEL 127
+ + ++ + A A LA EL
Sbjct: 103 EPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0149NUCEPIMERASE1839e-58 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 183 bits (466), Expect = 9e-58
Identities = 81/336 (24%), Positives = 143/336 (42%), Gaps = 40/336 (11%)

Query: 5 SVLVTGGAGFLGSHLCERLVHAGYDVMCVDNFHTGSKRNIEH----LIGQVNFEVIRHDV 60
LVTG AGF+G H+ +RL+ AG+ V+ +DN + +++ L+ Q F+ + D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 61 W-------LPLYVEADRVFNMACPASPVHYQ-SDPVSTVKTAVLGAINMLGLAKRCG-AR 111
L +RVF + V Y +P + + + G +N+L +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 112 ILQASTSEVYGDAQQHPQQESYWGNVN-PNGLRACYDEGKRCAETLFFDYHRQHGVDIRV 170
+L AS+S VYG ++ P +V+ P L Y K+ E + Y +G+
Sbjct: 121 LLYASSSSVYGLNRKMPFSTD--DSVDHPVSL---YAATKKANELMAHTYSHLYGLPATG 175

Query: 171 VRIFNTYGPRMRADDGRVVSNFIMQALRGEPITLYGDGSQTRSFCYVDDLVEGLLRMM-- 228
+R F YGP R D + F L G+ I +Y G R F Y+DD+ E ++R+
Sbjct: 176 LRFFTVYGPWGRPD--MALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233

Query: 229 ----NQDDDTGP------------INLGNPSEITIRELAECVLRLTGSKSRIEYRPLPAD 272
+ N+GN S + + + + + G +++ PL
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPG 293

Query: 273 DPLQRRPDIGRARQRLDWQPGIALEDGLKETIAHFR 308
D L+ D + + + P ++DG+K + +R
Sbjct: 294 DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYR 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0151SYCDCHAPRONE422e-06 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 42.2 bits (99), Expect = 2e-06
Identities = 20/98 (20%), Positives = 34/98 (34%), Gaps = 1/98 (1%)

Query: 38 PDAMHFLGLLACQLKQYDAGLALMERSLAARP-DASYFNNLGNMLRECGRLDDAIAHYRR 96
+ ++ L Q +Y+ + + D+ +F LG + G+ D AI Y
Sbjct: 36 LEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSY 95

Query: 97 AVALRPDYPEAHNNLGNALRDARDPAEAMQSCSRAIEL 134
+ P + L + AEA A EL
Sbjct: 96 GAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 33.0 bits (75), Expect = 0.003
Identities = 18/109 (16%), Positives = 30/109 (27%), Gaps = 3/109 (2%)

Query: 237 SDDASLHNNYAGVLRDAGDLDAAAAHYARAIALDASLAAAHANLSGVRRRQARYAQALVH 296
SD + A +G + A + LD + L R+ +Y A+
Sbjct: 33 SDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHS 92

Query: 297 AQEAIRIAPHLADAHNQAGNAHHGLGDLVAAQACYRTALEL---NPADS 342
+ A G+L A++ A EL
Sbjct: 93 YSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFK 141



Score = 32.2 bits (73), Expect = 0.005
Identities = 20/96 (20%), Positives = 31/96 (32%), Gaps = 5/96 (5%)

Query: 138 YAEAYNNLGNVLQDLGELDAAAASYGKAIAFHPAYAEAHSNLGNVLRTQERHADAIVHYR 197
Y+ A+N G+ + A + + LG + ++ AI Y
Sbjct: 40 YSLAFN-----QYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYS 94

Query: 198 RAIELSPALPAACHGLGLSLWALGELSEAVSVLGAA 233
+ P L GEL+EA S L A
Sbjct: 95 YGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLA 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0152SYCDCHAPRONE529e-10 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 51.9 bits (124), Expect = 9e-10
Identities = 19/86 (22%), Positives = 39/86 (45%)

Query: 285 QQGEYEESLRLCRHAIELDPELADAYNFLGLAYHNLDRMAASELSHRHAIDLNPDDADAH 344
Q G+YE++ ++ + LD + + LG + + + S+ + ++ +
Sbjct: 48 QSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFP 107

Query: 345 HNLAAALFRLDKLDEAMSEYRIAQEL 370
+ A L + +L EA S +AQEL
Sbjct: 108 FHAAECLLQKGELAEAESGLFLAQEL 133



Score = 40.7 bits (95), Expect = 5e-06
Identities = 24/133 (18%), Positives = 43/133 (32%), Gaps = 10/133 (7%)

Query: 3 DIQQALQEALTHHQAGRLGEAKTLYDAILHAQPGQPDAMHFLGLLACQLKQYDAGLALME 62
+ Q A++ L T+ + + ++ L Q +Y+ + +
Sbjct: 10 EYQLAMESFL--------KGGGTIAM-LNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQ 60

Query: 63 RSLAERP-DASYFNNLGNMLRECGRLDDAIAHYRRAVALRPDYPEAHNNLGNALRDARDP 121
D+ +F LG + G+ D AI Y + P + L +
Sbjct: 61 ALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGEL 120

Query: 122 AEAMRSCSRAIEL 134
AEA A EL
Sbjct: 121 AEAESGLFLAQEL 133



Score = 31.1 bits (70), Expect = 0.010
Identities = 14/103 (13%), Positives = 33/103 (32%)

Query: 301 ELDPELADAYNFLGLAYHNLDRMAASELSHRHAIDLNPDDADAHHNLAAALFRLDKLDEA 360
E+ + + L + + + + L+ D+ L A + + D A
Sbjct: 30 EISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLA 89

Query: 361 MSEYRIAQELGVDPVKIQLTLGDILWAKRDFSGAVAAFREAVE 403
+ Y + + + + L K + + A + A E
Sbjct: 90 IHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQE 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0153SYCDCHAPRONE363e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 35.7 bits (82), Expect = 3e-04
Identities = 20/105 (19%), Positives = 43/105 (40%), Gaps = 9/105 (8%)

Query: 10 NAAFVHHQAGRFDDARMLYEAIRRDEPDQPDATHFLGLLAC--QLGQFPAGLALMERAIA 67
+ AF +Q+G+++DA +++A+ + FLGL AC +GQ+ +
Sbjct: 41 SLAFNQYQSGKYEDAHKVFQALCVLDHYDSR--FFLGLGACRQAMGQYDLAIHSYSYGAI 98

Query: 68 LRA-DPVYLNNFGNMLRAHGRLGDAIGAYRRAIALAPDYAEAHSN 111
+ +P + + + G+ A + LA + +
Sbjct: 99 MDIKEPRFPFHAAE---CLLQKGELAEA-ESGLFLAQELIADKTE 139



Score = 31.1 bits (70), Expect = 0.011
Identities = 18/110 (16%), Positives = 35/110 (31%)

Query: 127 LSCAQALALRPDYAPAFNNLGNALQDKGELDAAARAYEKAIALDPGYAQARFNQGNVLRA 186
+ A + D +L G+ + A + ++ LD ++ G +A
Sbjct: 23 GTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQA 82

Query: 187 QRRPDEAIACYREAIALQPHLHAAHHALGVLLFERDDREAAIASLTRAAE 236
+ D AI Y + L ++ + A + L A E
Sbjct: 83 MGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQE 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0156FLAGELLIN982e-25 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 98.2 bits (244), Expect = 2e-25
Identities = 96/268 (35%), Positives = 134/268 (50%), Gaps = 4/268 (1%)

Query: 2 LNINTNILSLTTQTNLSGSQSALSQAINRLSSGKRVNTAADDAAGLAISTTQTAAINALT 61
INTN LSL TQ NL+ SQS+LS AI RLSSG R+N+A DDAAG AI+ T+ I LT
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 62 QGVSNANNGISMIQTAAGALQSTVDNLQRIRTLAVESGDGSLDSNARANLQAEVTTRLGE 121
Q NAN+GIS+ QT GAL +NLQR+R L+V++ +G+ + ++Q E+ RL E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 122 IDRVATQTTFNGQTILSNAGNVTFQVGASANQTVAVNFGATVWTSTGAGLSL----SGLT 177
IDRV+ QT FNG +LS + QVGA+ +T+ ++ S G T
Sbjct: 122 IDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEAT 181

Query: 178 VSDQTSAQSAITAIDTALKNVNTFQATLGAAQNTFQAAITTTQTQATNMSAARSQITDAD 237
V D S+ +T DT N ++ + + T + +A TD
Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDA 241

Query: 238 FATETANLSKAQVLQQAGISVLAQANSL 265
+L K A A ++
Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 71.2 bits (174), Expect = 4e-16
Identities = 69/261 (26%), Positives = 116/261 (44%), Gaps = 2/261 (0%)

Query: 14 QTNLSGSQSALSQAINRLSSGKRVNTAADDAAGLAISTTQTAAINALTQGVSNANNGISM 73
+ S + ++A + K T T N VS NG +
Sbjct: 249 LFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKV 308

Query: 74 IQTAAGALQSTVDNLQRIRTLAVESGDGSLDSNARANLQAEVTTRLGEIDRVATQTTFNG 133
T A + + ++ + + + + ++ + G
Sbjct: 309 TLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNES--AKLSDLEANNAVKG 366

Query: 134 QTILSNAGNVTFQVGASANQTVAVNFGATVWTSTGAGLSLSGLTVSDQTSAQSAITAIDT 193
++ ++ G A T+A T++G ++ + + S + + +ID+
Sbjct: 367 ESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDS 426

Query: 194 ALKNVNTFQATLGAAQNTFQAAITTTQTQATNMSAARSQITDADFATETANLSKAQVLQQ 253
AL V+ +++LGA QN F +AIT TN+++ARS+I DAD+ATE +N+SKAQ+LQQ
Sbjct: 427 ALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQ 486

Query: 254 AGISVLAQANSLPQQVLKLLQ 274
AG SVLAQAN +PQ VL LL+
Sbjct: 487 AGTSVLAQANQVPQNVLSLLR 507


50Bamb_0168Bamb_0175N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_01681132.306349response regulator receiver protein
Bamb_01691122.388461CheA signal transduction histidine kinase
Bamb_01701132.009002CheW protein
Bamb_01712132.516897methyl-accepting chemotaxis sensory transducer
Bamb_01720152.398122chemotaxis protein CheR
Bamb_0173-2151.802576chemoreceptor glutamine deamidase CheD
Bamb_0174-2151.438916chemotaxis-specific methylesterase
Bamb_0175-2130.654156response regulator receiver protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0168HTHFIS851e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.9 bits (210), Expect = 1e-22
Identities = 36/120 (30%), Positives = 61/120 (50%), Gaps = 2/120 (1%)

Query: 4 TILAIDDSATMRALLQATLAQAGYDVTVAPDGEAGFDMAATVPYDLVLTDQNMPRRSGLE 63
TIL DD A +R +L L++AGYDV + + + A DLV+TD MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 VIAALRKLSAYADTPILVLTTEGSDAFKDAAREAGATGWIEKPIDPAVLVDLVATLSEQT 123
++ ++K A D P+LV++ + + A E GA ++ KP D L+ ++ +
Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0169PF06580472e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 46.8 bits (111), Expect = 2e-07
Identities = 21/151 (13%), Positives = 49/151 (32%), Gaps = 52/151 (34%)

Query: 446 ELDKSLIERIIDPLT--HLVRNSLDHGIETVDKRVAAGKDAVGQLVLSAAHHGGNIVIEV 503
+++ ++++ + P+ LV N + HGI G+++L G + +EV
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296

Query: 504 SDDGGGLNRERILAKAAKQGMQVSDNISDDEVWQLIFAPGFSTAETVTDVSGRGVGMDVV 563
+ G + G G+ V
Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318

Query: 564 KRNIQSMGG---HVEISSQAGRGTTTRIVLP 591
+ +Q + G +++S + G +++P
Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0171IGASERPTASE372e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.4 bits (86), Expect = 2e-04
Identities = 32/198 (16%), Positives = 63/198 (31%), Gaps = 24/198 (12%)

Query: 401 EVRSLAQRSASAAKEIKQLIGDSAERVESGSALVARAGSTMDEIVQAVRRVTDIMGEISA 460
+V S+ + A+ + + A S + S +
Sbjct: 1006 DVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNE--------QD 1057

Query: 461 ASDEQSTGIEQVNRAVGQMDSVTQQNAALVEEAAAAAASLEEQTRQMKAIVSSWRVTGGI 520
A++ + E A + + TQ N E A + + + E QT + K
Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTN----EVAQSGSETKETQTTETKE----------- 1102

Query: 521 AMAPVRGAARPAAQTQAPASPSESRHEAAPVAHAPQAAQPAAQPARRAAPAPHAAAPATR 580
A V + +T+ + + +P + QP A+PAR P + P ++
Sbjct: 1103 -TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161

Query: 581 NAAKPATDSAARASKDAP 598
T+ A+ +
Sbjct: 1162 TNTTADTEQPAKETSSNV 1179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0173BACYPHPHTASE310.007 Salmonella/Yersinia modular tyrosine phosphatase si...
		>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase

signature.
Length = 468

Score = 30.5 bits (68), Expect = 0.007
Identities = 26/75 (34%), Positives = 35/75 (46%), Gaps = 7/75 (9%)

Query: 176 TEREAALAREADRVRAGRQRAHVELFAAKRPAAPPPARPRIELFGARGAGGAQTTNAKTG 235
+ +AL VR G R+H++ + P PP RP G GAG A+ T T
Sbjct: 133 SHSHSALHAPGTPVREG-LRSHLD---PRTPPLPPRERPHTS--GHHGAGEARATAPSTV 186

Query: 236 SPYAGSPSAANLSRK 250
SPY G + A LS +
Sbjct: 187 SPY-GPEARAELSSR 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0174HTHFIS697e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.7 bits (168), Expect = 7e-15
Identities = 34/146 (23%), Positives = 65/146 (44%), Gaps = 15/146 (10%)

Query: 1 MQKIKVLCVDDSALIRSLMTEIINSQP-DMTVCATAPDPLVARDLIKQHNPDVLTLDVEM 59
M +L DD A IR+++ + ++ D+ + + A I + D++ DV M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAAT---LWRWIAAGDGDLVVTDVVM 57

Query: 60 PRMDGLDFLEKLMRLRP-MPVVMVSSLTERGSEITLRALELGAVDFVTKPRVGIRDGMLD 118
P + D L ++ + RP +PV+++S+ ++A E GA D++ KP D
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNT--FMTAIKASEKGAYDYLPKP--------FD 107

Query: 119 YAEKLADKIRAASRARVRQTPQPQAA 144
E + RA + + R + +
Sbjct: 108 LTELIGIIGRALAEPKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0175HTHFIS881e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 1e-23
Identities = 33/110 (30%), Positives = 53/110 (48%), Gaps = 4/110 (3%)

Query: 1 MDKSMKILVVDDFPTMRRIVRNLLKELGYTNVDEAEDGAAGLARLRGGGFDFVISDWNMP 60
M + ILV DD +R ++ L GY V + A + G D V++D MP
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 NLDGLAMLKEIRADATLTHLPVLMVTAESKKENIIAAAQAGASGYVVKPF 110
+ + +L I+ LPVL+++A++ I A++ GA Y+ KPF
Sbjct: 59 DENAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


51Bamb_0309Bamb_0317N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_0309-290.819271hypothetical protein
Bamb_0310-38-0.081657type IV pilus secretin PilQ
Bamb_0311-310-1.392627shikimate kinase
Bamb_0312-212-0.5430793-dehydroquinate synthase
Bamb_0313-313-0.061506deoxyguanosinetriphosphate
Bamb_0314-311-0.491376glycerol-3-phosphate transporter periplasmic
Bamb_0315-211-0.120675binding-protein-dependent transport system inner
Bamb_0316-114-0.778005glycerol-3-phosphate transporter membrane
Bamb_0317-114-0.715375glycerol-3-phosphate transporter ATP-binding
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0309PERTACTIN310.008 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.8 bits (69), Expect = 0.008
Identities = 26/92 (28%), Positives = 34/92 (36%)

Query: 189 ALPGARLPGGAAPMLAGASGDDPFGGAGSLPVADDAVPRLAGTIRDARAGLALFDAGDGG 248
A+PG +PGG P+L G G D L + P+L IR R G
Sbjct: 273 AVPGGAVPGGFGPLLDGWYGVDVSDSTVDLAQSIVEAPQLGAAIRAGRGARVTVSGGSLS 332

Query: 249 FATVARGEALGAARVMRVEADAVTLATADGAR 280
E G AR A +++ GAR
Sbjct: 333 APHGNVIETGGGARRFPPPASPLSITLQAGAR 364


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0310BCTERIALGSPD2098e-62 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 209 bits (532), Expect = 8e-62
Identities = 106/438 (24%), Positives = 178/438 (40%), Gaps = 46/438 (10%)

Query: 133 GLAAAFDALARFTGLNIIVGEQVRGTVTLRLNNVRWRDAFDTLLDTHGLAMSRRGNVIWV 192
G AA L G++ TV L W A D + L + +
Sbjct: 171 GRAAVIKRLLTIVERVDNAGDRSVVTVPLS-----WASAADVVKLVTELNKDTSKSALPG 225

Query: 193 TPAAELAARERERF-------ETHAR-AAELEPL--------ASRTFALHYPRALDVQRL 236
+ A + A ER + R A ++ L ++ L Y +A D+ +
Sbjct: 226 SMVANVVADERTNAVLVSGEPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEV 285

Query: 237 L-----------AGATGQRLLSKRGAAAADPRTNLLFVTDLAPRIAQIAGLIDAIDRPSR 285
L A L K A +TN L VT + + +I +D
Sbjct: 286 LTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRP 345

Query: 286 QVRIEARIVEGEQGFSRNLGARIALRAQGRAP---TADGAASATDTRNALDLAARPLGGF 342
QV +EA I E + NLG + A + G + ++A N +
Sbjct: 346 QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSL 405

Query: 343 EAATAGFTLFAA-PLSRVLDVELSALEAQGRGQIVSSPRVVTADRVKAIVEQGSELPYQ- 400
+A + F AA + L+AL + + I+++P +VT D ++A G E+P
Sbjct: 406 ASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLT 465

Query: 401 ----AKVGNGVSGVQFRRATLKLEVEPQITPDGRVVLDLDVTKDSIGEPTAA-----GPA 451
N + V+ + +KL+V+PQI V+L+++ S+ + ++ G
Sbjct: 466 GSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525

Query: 452 IHTKHVQTRVEVENGGTVAIGGIYEQLNRDDVTRVPLLGKIPVLGALFRHRARRDQRSEL 511
+T+ V V V +G TV +GG+ ++ D +VPLLG IPV+GALFR +++ + L
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 512 VVFITPTVVGTQCATSSA 529
++FI PTV+ + A
Sbjct: 586 MLFIRPTVIRDRDEYRQA 603



Score = 67.6 bits (165), Expect = 5e-14
Identities = 49/292 (16%), Positives = 109/292 (37%), Gaps = 21/292 (7%)

Query: 126 SLNLQGAGLAAAFDALARFTGLNIIVGEQVRGTVTLR----LNNVRWRDAFDTLLDTHGL 181
S + +G + + +++ +I+ VRGT+T+R LN ++ F ++LD +G
Sbjct: 31 SASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVLDVYGF 90

Query: 182 AMSRRGNVIWVTPAAELAARERERFETHARAAELEPLASRTFALHYPRALDVQRLLAGAT 241
A+ N + ++ A + A + + +R L A D+ LL
Sbjct: 91 AVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQLN 150

Query: 242 GQRLLSKRGAAAADPRTNLLFVTDLAPRIAQIAGLIDAIDRPSRQVRIEARIVEGEQGFS 301
+ G+ +N+L +T A I ++ +++ +D + + +
Sbjct: 151 DN---AGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASAADV 207

Query: 302 RNLGARIALRAQGRA-PTADGAASATDTR-NALDLAARPLGGFEAATAGFTLFAAPLSRV 359
L + A P + A D R NA+ ++ P + + +
Sbjct: 208 VKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEP-----NSRQRII----AMIKQ 258

Query: 360 LDVELSALEAQGRGQIVSSPRVVTADRVKAIVEQGSELPYQAKVGNGVSGVQ 411
LD + + QG +++ +D V+ + S + + + V+ +
Sbjct: 259 LDRQQA---TQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALD 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0311CARBMTKINASE270.032 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 27.5 bits (61), Expect = 0.032
Identities = 15/39 (38%), Positives = 21/39 (53%)

Query: 66 ESQVIADLTQRENIVLATGGGAVLRAENRDCLKGHGIVI 104
E++ I L +R IV+A+GGG V +KG VI
Sbjct: 175 EAETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVI 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0314MALTOSEBP417e-06 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 40.9 bits (95), Expect = 7e-06
Identities = 46/192 (23%), Positives = 76/192 (39%), Gaps = 15/192 (7%)

Query: 124 EKAFVPTIASYYSDA--KTGHLVSMPFNSSTPVLYYNKDAFKKAGLDPNQPPKTWADVQA 181
+KAF + + DA G L++ P L YNKD L PN PPKTW ++ A
Sbjct: 108 DKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKD------LLPN-PPKTWEEIPA 160

Query: 182 DAEKLRKSGMTCGFTTGWQGWIQLENYSVWHALPFASRNNGFDGADAVLEFNKPQQIAHI 241
++L+ G + + + + F N +D D ++ + A +
Sbjct: 161 LDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAK--AGL 218

Query: 242 AFLQQMQKDGTFTYAGRKDEASAKFYSGDCGILTTSSGALANVQKFAKFSYGTGMMPYDA 301
FL + K+ A A F G+ + A +N+ +K +YG ++P
Sbjct: 219 TFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP--- 274

Query: 302 NVKGAPQNAIIG 313
KG P +G
Sbjct: 275 TFKGQPSKPFVG 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0317PF05272362e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 36.2 bits (83), Expect = 2e-04
Identities = 15/33 (45%), Positives = 19/33 (57%)

Query: 33 VVLVGPSGCGKSTLLRMIAGLETVTDGEIAIGD 65
VVL G G GKSTL+ + GL+ +D IG
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631


52Bamb_0608Bamb_0614N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_0608-113-0.716113major facilitator superfamily transporter
Bamb_0609-1130.462114preprotein translocase subunit SecF
Bamb_06100120.725843preprotein translocase subunit SecD
Bamb_06110120.905188preprotein translocase subunit YajC
Bamb_06120131.340496queuine tRNA-ribosyltransferase
Bamb_06130121.546236S-adenosylmethionine--tRNA
Bamb_0614-1111.071223ATP-dependent DNA helicase RecG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0608TCRTETA320.004 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.004
Identities = 75/368 (20%), Positives = 125/368 (33%), Gaps = 51/368 (13%)

Query: 70 FMRPLGAIVLGAYADRAGRKGALTLSILLMMAGTLVIAVLPTYGTIGVAAPLILVAARLM 129
M+ A VLGA +DR GR+ L +S+ ++A P +L R++
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL--------WVLYIGRIV 105

Query: 130 QGFSAGGEFGSATAFLAEHVPGR-RGFFASWQVASQGLTTLLAAGFGTVLNAQLTAEQMA 188
G + G A A++A+ G R + A G + G + M
Sbjct: 106 AGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL---------MG 155

Query: 189 AWGWRIPFFFGLLLGPVAYYI-------RSKVDETPEFLAAESTATPLR--DTFASHKAR 239
+ PFF L + + K + P A + R A
Sbjct: 156 GFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAAL 215

Query: 240 LVAAMGVVVLGTV-ATYLVLFMPTYGVKQLGLAPSAAFAAILVVGVIQ-----MAFAPLV 293
+ + ++G V A V+F G + + ++ G++ M P+
Sbjct: 216 MAVFFIMQLVGQVPAALWVIF----GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271

Query: 294 GHWSDRYGRVRVMIAPALGILVLIYPAFAYLVAHPGFGTLIALQVLLAFLMTGYFAALPG 353
+R + MIA G ++L + ++ + VLLA G AL
Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFATRGWMA--------FPIMVLLASGGIG-MPALQA 322

Query: 354 LLSEVFPVQTRTTGMSLAYNVAVTIFGG-FGPFIIAWLIRATGMKTAPSFYLMFAAVLSL 412
+LS V G A+T GP + + A+ + T + + A L L
Sbjct: 323 MLSRQ--VDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS-ITTWNGWAWIAGAALYL 379

Query: 413 AALFVLRR 420
L LRR
Sbjct: 380 LCLPALRR 387



Score = 29.8 bits (67), Expect = 0.022
Identities = 19/77 (24%), Positives = 37/77 (48%), Gaps = 5/77 (6%)

Query: 240 LVAAMGVVVLGTVATYLVL-FMPTYGVKQLGLAPSAAFAAILVV---GVIQMAFAPLVGH 295
L+ + V L V L++ +P ++ L + +++ ++Q A AP++G
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGL-LRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 296 WSDRYGRVRVMIAPALG 312
SDR+GR V++ G
Sbjct: 66 LSDRFGRRPVLLVSLAG 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0609SECFTRNLCASE320e-111 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 320 bits (822), Expect = e-111
Identities = 97/320 (30%), Positives = 170/320 (53%), Gaps = 17/320 (5%)

Query: 1 MEFFRIRKDIPFMRHALVFNVISLVTFLAAVFFLFHRGLHLSVEFTGGTVIEVQYQQAAQ 60
++ + + F R ++V +A+V GL+ ++F GGT I + A
Sbjct: 5 LKLVPEKTNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAID 64

Query: 61 LEPVRATLGKLGYADAQVQNFGTSR------NVLIRLQLKEGLTSAQQ--------SDQV 106
+ RA L L D + +IR+Q++E A+ ++V
Sbjct: 65 VGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKV 124

Query: 107 MTALKAQSPDVSLQRVEFVGPQVGRELATDGLLALACVVIGIVIYLSIRFEWKYAVAGII 166
TAL A P + + E VGP+V EL + +L + I+ Y+ +RFEW++A+ ++
Sbjct: 125 ETALTAVDPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVV 184

Query: 167 ANLHDVVIILGFFAFFQWEFSLAVLAAILAVLGYSVNESVVIFDRIRETFRRERRMSVSE 226
A +HDV++ +G FA Q +F L +AA+L + GYS+N++VV+FDR+RE + + M + +
Sbjct: 185 ALVHDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRD 244

Query: 227 VINHAITTTMSRTIITHTSTEMMVLSMFFFGGPTLHYFALALTVGIMFGIYSSVFVAGSL 286
V+N ++ T+SRT++T +T + ++ M +GG + F A+ G+ G YSSV+VA ++
Sbjct: 245 VMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNI 304

Query: 287 AMWLGIKREDLIKEKKGAHD 306
+++G+ R KEKK D
Sbjct: 305 VLFIGLDRN---KEKKDPSD 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0610SECFTRNLCASE793e-18 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 79.5 bits (196), Expect = 3e-18
Identities = 53/249 (21%), Positives = 108/249 (43%), Gaps = 14/249 (5%)

Query: 382 KGKGEVLTVATIQSELGDRFQITGQPTPQAAADLALLLRAGSLAAPMDIIEERTIGPSLG 441
+ V + E G + G + + L A A + E ++GP +
Sbjct: 91 REDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDPALKITSFE--SVGPKVS 148

Query: 442 ADNIKMGVHSVIWGFCAIAVFM-IAYYMLFGVISVIGLSVNLLLLVAVLSLMQATLTLPG 500
+ + V S++ I ++ + + F + +V+ L ++LL V + +++Q L
Sbjct: 149 GELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGLFAVLQLKFDLTT 208

Query: 501 IAAIALALGMAIDSNVLINERVREELRA--GQPPQ----LAIQSGYAHAWATILDSNVTT 554
+AA+ G +I+ V++ +R+RE L P + L++ + T++ +TT
Sbjct: 209 VAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSR---TVMTG-MTT 264

Query: 555 LIAGLALLAFGSGPVRAFAIVHCLGILTSMFSAVFFSRGLVNLWYGGRKKLKSLAIGQVW 614
L+A + +L +G +R F G+ T +S+V+ ++ +V R K K + +
Sbjct: 265 LLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNKEKKDPSDKFF 324

Query: 615 RPEGATAGA 623
GA GA
Sbjct: 325 S-NGAQDGA 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0614SECA350.002 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 34.8 bits (80), Expect = 0.002
Identities = 28/100 (28%), Positives = 40/100 (40%), Gaps = 6/100 (6%)

Query: 360 AQARVVDEIAHDLTLPHPMQRLLQGDV-----GSGKTVVAALAATQAIDAGYQAALMAPT 414
A RV D+ L M L + + G GKT+ A L A G ++
Sbjct: 74 ASKRVFGMRHFDVQLLGGM-VLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVHVVTVN 132

Query: 415 EILAEQHARKLRAWLEPLGVSVAWLAGSLKAKEKRAAIEA 454
+ LA++ A R E LG++V + A KR A A
Sbjct: 133 DYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAA 172


53Bamb_0682Bamb_0692N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_0682-114-0.282513short-chain dehydrogenase/reductase SDR
Bamb_0683019-0.016921serine hydroxymethyltransferase
Bamb_06845251.663008transcriptional regulator NrdR
Bamb_06856232.087949Tfp pilus assembly protein FimT-like protein
Bamb_06864211.697772hypothetical protein
Bamb_06872200.417935prepilin-type cleavage/methylation-like protein
Bamb_0688-218-1.796140hypothetical protein
Bamb_0689010-2.719133Tfp pilus assembly protein PilE
Bamb_069029-2.200384hypothetical protein
Bamb_0691110-2.051731membrane protein
Bamb_0692-110-1.866121hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0682DHBDHDRGNASE871e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 87.4 bits (216), Expect = 1e-22
Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 4/180 (2%)

Query: 2 IVFVTGASAGFGAAIARAFVKGGHRVVATARRKDRLDAL-AAELGDALLP--FELDVRDR 58
I F+TGA+ G G A+AR G + A ++L+ + ++ +A F DVRD
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 AAVEAVPAALPAEFAALDVLVNNAGLALGVEPAHKASLDEWQTMIDTNCSGLVTVTHALL 118
AA++ + A + E +D+LVN AG+ L H S +EW+ N +G+ + ++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PGMIARGRGHIFNLGSVAGTYPYPGGNVYGATKAFVRQFSLNLRADLIGTPLRVTDIEPG 178
M+ R G I +GS P Y ++KA F+ L +L +R + PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0685BCTERIALGSPH270.036 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 26.8 bits (59), Expect = 0.036
Identities = 16/62 (25%), Positives = 24/62 (38%), Gaps = 3/62 (4%)

Query: 17 GFTLVELMVAIAL---AASIGLFAAPAFNQWHMRERVDARSRALLGALSFARTEATRLGV 73
GFTL+E+M+ + L +A + L A PA + + L GV
Sbjct: 5 GFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFFGV 64

Query: 74 RV 75
V
Sbjct: 65 SV 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0686PRTACTNFAMLY310.002 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.8 bits (69), Expect = 0.002
Identities = 32/115 (27%), Positives = 49/115 (42%), Gaps = 6/115 (5%)

Query: 7 SMRGTSLLEAVLAVALLAVVMLAVAGTQLAMTRAQRATIWRERALWLADARIERRRAAAG 66
++ A AV++L L + G + RA + + L A I R A AG
Sbjct: 207 NVTAVPASGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAPAG 266

Query: 67 ADDGIAALVAASLPGGAMTLDHGPGGVRYMIVGWRGAGAAVSTRCEGAGATVTPP 121
A+ ++PGGA+ GPGG ++ GW G + S+ E A + V P
Sbjct: 267 G-----AVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVDVSGSS-VELAQSIVEAP 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0689BCTERIALGSPG412e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.0 bits (96), Expect = 2e-07
Identities = 16/56 (28%), Positives = 29/56 (51%)

Query: 6 RMRRTAAFTLLELMIVLAIVAVLAGWGIPSYREHVARMHRASAVAALYRAAQYLEM 61
+ FTLLE+M+V+ I+ VLA +P+ + + + AV+ + L+M
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDM 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0692PREPILNPTASE270.009 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 27.5 bits (61), Expect = 0.009
Identities = 6/26 (23%), Positives = 11/26 (42%)

Query: 73 DYVHEHPWTSIGVAAGVGVLIGLLIN 98
+ H PW + ++IG +N
Sbjct: 6 ELAHGLPWLYFSLVFLFSLMIGSFLN 31


54Bamb_0747Bamb_0768N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_0747347-11.329265NAD-dependent epimerase/dehydratase
Bamb_0748246-10.818267FkbM family methyltransferase
Bamb_0749342-9.376158hypothetical protein
Bamb_0750130-6.776703hypothetical protein
Bamb_0751025-5.671661dTDP-glucose 4,6-dehydratase
Bamb_0752232-5.782787glucose-1-phosphate thymidylyltransferase
Bamb_0753334-6.204603dTDP-4-dehydrorhamnose 3,5-epimerase
Bamb_0754232-6.245109dTDP-4-dehydrorhamnose reductase
Bamb_0755131-6.277883mannose-1-phosphate
Bamb_0756131-6.130340hypothetical protein
Bamb_0757031-5.782344type 11 methyltransferase
Bamb_0758029-4.915549group 1 glycosyl transferase
Bamb_0759026-3.330056GDP-mannose 4,6-dehydratase
Bamb_0760125-2.899013NAD-dependent epimerase/dehydratase
Bamb_0761327-3.570007group 1 glycosyl transferase
Bamb_0762224-3.725706group 1 glycosyl transferase
Bamb_0763219-3.250479NAD-dependent epimerase/dehydratase
Bamb_0764117-3.083316glycosyl transferase family protein
Bamb_0765014-2.609232polysaccharide biosynthesis protein CapD
Bamb_0766015-1.839075curculin domain-containing protein
Bamb_0767-312-0.172840glycosyl transferase family protein
Bamb_0768-1131.239937UDP-glucose 4-epimerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0747NUCEPIMERASE982e-25 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 97.5 bits (243), Expect = 2e-25
Identities = 70/343 (20%), Positives = 120/343 (34%), Gaps = 63/343 (18%)

Query: 7 SVLITGAGGVIGHALKQELADSGYSNVVAITSSD------------------------ID 42
L+TGA G IG + + L ++G+ VV I + + ID
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-QVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 43 LRDQSATEKMFDELRPTIVFHMAARVYGIMGNMSNRGIAYLD-NVRINTNVVEAARQTGC 101
L D+ +F VF R+ + ++ N AY D N+ N++E R
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRL-AVRYSLEN-PHAYADSNLTGFLNILEGCRHNKI 118

Query: 102 KKFVAMGSTAIYSDQVRLPMSEEQIWVGAPHHSEAPYAHSKRGMLAQLEAYKDQYGMDYA 161
+ + S+++Y ++P S + H + YA +K+ Y YG+
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDS----VDHPVSLYAATKKANELMAHTYSHLYGLPAT 174

Query: 162 FCVSTNLFGPHDKFDEKFGHVIPSLVSKFYRASVLGQPISVWGSGKAERDFLFSGDAAYA 221
++GP + D KF +A + G+ I V+ GK +RDF + D A A
Sbjct: 175 GLRFFTVYGPWGRPDMAL--------FKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEA 226

Query: 222 LRLIAENHTGA--------------------INLATGQSHTIRHTVDTLCQISGFSGSVE 261
+ + + A N+ + + L G
Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKN 286

Query: 262 WDATKPDGQKLRAY-DISRL-TALGFKPRFSFDEALAITYDWY 302
+P G L D L +GF P + + + +WY
Sbjct: 287 MLPLQP-GDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0748RTXTOXIND461e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.4 bits (110), Expect = 1e-07
Identities = 20/159 (12%), Positives = 66/159 (41%), Gaps = 10/159 (6%)

Query: 223 FDGFIRASESDMIRRAMDLEQQLIESRKQLQEAHEGWSLEKGAREELEVRLNSMTDRAHD 282
F SE +++R +++Q + Q + + ++ ++ R +
Sbjct: 173 EPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK-------ELNLDKKRAERLTVLARINR 225

Query: 283 EQASQVVLKDSSQDC--VVQNSKISQNEIEQLKARVAASEGDLESHRAQLAELRTRLTES 340
+ V K D ++ I+++ + + + + + +L +++QL ++ + + +
Sbjct: 226 YENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285

Query: 341 E-KSAQVLSTERDTAYQELFESSRHAAWLSQERVRLQER 378
+ + V ++ +L +++ + L+ E + +ER
Sbjct: 286 KEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0751NUCEPIMERASE1769e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 176 bits (449), Expect = 9e-55
Identities = 90/350 (25%), Positives = 135/350 (38%), Gaps = 43/350 (12%)

Query: 2 ILVTGGAGFIGANFVLDWLRHDVEPVLNVDKLT--YAGNLRTL-QSLSGNPKHVFARVDI 58
LVTG AGFIG + L V+ +D L Y +L+ L P F ++D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 59 CDRAALDALFAEHKPRAVAHFAAESHVDRSIHGPADFVQTNVVGTFTLLEAARSYWSGLN 118
DR + LFA V V S+ P + +N+ G +LE R
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ-- 119

Query: 119 DADKAGFRFLHVSTDEVFGSLSPTDPQFSETTPYA-PNSPYSATKAGSDHLVRAYHHTYG 177
L+ S+ V+G L+ P FS P S Y+ATK ++ + Y H YG
Sbjct: 120 -------HLLYASSSSVYG-LNRKMP-FSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 178 LPTLTTNCSNNYGPYQFPEKLIPLMIANALAGKPLPIYGDGQNVRDWLYVGDHCSAIREV 237
LP YGP+ P+ + L GK + +Y G+ RD+ Y+ D AI +
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRL 230

Query: 238 L------------------ARGVPGETYNVGGWNEKKNLDVVHTLCDLLDSASPKAAGSY 279
A P YN+G + + +D + L D L +A +
Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG---IEAKKNM 287

Query: 280 RDQITYVTDRPGHDRRYAIDARKLERELGWKPAETFETGLAKTVRWYLDN 329
+PG + D + L +G+ P T + G+ V WY D
Sbjct: 288 LPL------QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0754NUCEPIMERASE497e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 49.0 bits (117), Expect = 7e-09
Identities = 48/212 (22%), Positives = 72/212 (33%), Gaps = 61/212 (28%)

Query: 10 TILVTGVTGQIGFELLRALQGLG-RVVPCD--------------RSVL----------DL 44
LVTG G IGF + + L G +VV D +L DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 45 ADLDRVRAFARDLKPALIVNPAAYTAVDTAESEVELARRLNVDVPRVFAE---------- 94
AD + + + AV R +++ P +A+
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAV-----------RYSLENPHAYADSNLTGFLNIL 110

Query: 95 EAARSGG--TLIHYSTDYVF-DGTKVGAYVETDAPNPLNAYGATKLEGEQAIAAT----- 146
E R L++ S+ V+ K+ + +P++ Y ATK E +A T
Sbjct: 111 EGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL-MAHTYSHLY 169

Query: 147 GCAHVILRTSWVYGRRGR------NFLRTMLK 172
G LR VYG GR F + ML+
Sbjct: 170 GLPATGLRFFTVYGPWGRPDMALFKFTKAMLE 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0757RTXTOXIND330.003 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.9 bits (75), Expect = 0.003
Identities = 25/159 (15%), Positives = 49/159 (30%), Gaps = 23/159 (14%)

Query: 175 FARVKQLRLQEPPALHEAGARIRLLDVLGGVSPDYAIVAQKSCSEAEAGLFDEAFHEDYG 234
AR++Q R Q L + +L ++ P + V SE E E +
Sbjct: 145 QARLEQTRYQ---ILSRSIELNKLPELKLPDEPYFQNV-----SEEEVLRLTSLIKEQFS 196

Query: 235 LSLAELAARFDAGAARQHEHAAEQIQRTQAEVRRLHGELGVIHEELTR----------TR 284
+ + ++ E A + R V L +
Sbjct: 197 TWQNQKYQKELNLDKKRAE-----RLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAK 251

Query: 285 SELTQAVDELGQVRRDLGRVRSDLGQVNGELKRVHDEAQ 323
+ + ++ + +L +S L Q+ E+ +E Q
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0759NUCEPIMERASE974e-25 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 97.2 bits (242), Expect = 4e-25
Identities = 65/347 (18%), Positives = 121/347 (34%), Gaps = 57/347 (16%)

Query: 7 IITGITGQDGAYLAELLLDKGYTVYG-----TYRRTSSVNFWRIEELGIAKHPNLHLVEY 61
++TG G G ++++ LL+ G+ V G Y S + L + P +
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVS----LKQARLELLAQPGFQFHKI 59

Query: 62 DLTDLSASIRLLQTTGATEVYNLAAQSFVGVSFDQPVTTAEITGVGPLNLLEAIRIVNPK 121
DL D L + V+ + V S + P A+ G LN+LE R +
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 122 IRFYQASTSEMFGKVQAIPQIESTPF-YPRSPYGVAKLYAHWITVNYRESYDIFGCSGIL 180
AS+S ++G + +P +P S Y K + Y Y +
Sbjct: 120 -HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRF 178

Query: 181 FNHESPLRGR-EFVTRKITDSVAKIKLGQLDVLELGNMDAKRDWGFAKEYVEGMWRMLQA 239
F P GR + K T ++ + K +DV G M KRD+ + + E + R+
Sbjct: 179 FTVYGP-WGRPDMALFKFTKAMLEGK--SIDVYNYGKM--KRDFTYIDDIAEAIIRLQDV 233

Query: 240 DEPDT-------------------FVLATNRTETVRDFVRMAFKAAGVDLEFKGSDEQEI 280
+ + + + D+++ A G++ + Q
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ-- 291

Query: 281 AVDVATGKTLVRVNPKFHRPAEVDLLIGNPEKAKQKLGWEPKTTLEE 327
P +V + + + +G+ P+TT+++
Sbjct: 292 -------------------PGDVLETSADTKALYEVIGFTPETTVKD 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0760NUCEPIMERASE1222e-34 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 122 bits (307), Expect = 2e-34
Identities = 75/313 (23%), Positives = 125/313 (39%), Gaps = 42/313 (13%)

Query: 11 RALVTGLGGFTGDYLAESLRAAGYRVFGTTHAADTIEPD---------------TYRVDL 55
+ LVTG GF G ++++ L AG++V G + D + +++DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 56 CDRPTLADVVAEVQPDIVAHLAAVSFV--AHGDADAIYRTNVVGTRNLLEALANLENRPR 113
DR + D+ A + V V + + A +N+ G N+LE N+ +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCR--HNKIQ 119

Query: 114 AVLLASSANIYG-NAAVEIIDESIEPNPANDYAVSKLAMEYMARLWRD--KLPIVIARPF 170
+L ASS+++YG N + + +P + YA +K A E MA + LP R F
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 171 NYTGVGQSPQFLLPKIVGHFQRGERVIELGNIDVERDFSDVRRVADAYRRLLELSPAGG- 229
G P L K G+ + ++RDF+ + +A+A RL ++ P
Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239

Query: 230 -----------------VFNVCSGRAVSLKAVISMMERIAGYAIEVRVNPAFVRANDVRR 272
V+N+ + V L I +E G IE + N ++ DV
Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG--IEAKKNMLPLQPGDVLE 297

Query: 273 LQGNDARLQAAIG 285
+ L IG
Sbjct: 298 TSADTKALYEVIG 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0763NUCEPIMERASE1142e-31 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 114 bits (286), Expect = 2e-31
Identities = 78/341 (22%), Positives = 130/341 (38%), Gaps = 37/341 (10%)

Query: 1 MRLVITGANGFVGRAVCRRALDAGHTVTAL----------VRRPGACIDGVREWVHGSAD 50
M+ ++TGA GF+G V +R L+AGH V + +++ + + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 51 WEGLDAAWPADLVA----DCVIHLAARVHVMRDDSPDPDAAFDATNVAGTLRLAEAARKY 106
D DL A + V R+ V R +P A D +N+ G L + E R
Sbjct: 61 LA--DREGMTDLFASGHFERVFISPHRLAV-RYSLENPHAYAD-SNLTGFLNILEGCRHN 116

Query: 107 GVRRIVYASSIKAVGESDSGAPLSESWPAD-PQDAYGRSKLRAEQQLARFGTSAGLDVVI 165
++ ++YASS G + P S D P Y +K E + GL
Sbjct: 117 KIQHLLYASSSSVYGLNRK-MPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATG 175

Query: 166 VRPPLVYGPHVTAN--FLRMMDAVARGMPLPL-GSISARRSIVYVDNLADALLQCATDPR 222
+R VYGP + + A+ G + + +R Y+D++A+A+++
Sbjct: 176 LRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235

Query: 223 AAGECFHVADDDAPSVTGLLRLVGDALGKPARLLPVPTAALRALGKLTGRSATIDRLTGS 282
A + V + R+ P L+ ++AL G A +
Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDY----IQALEDALGIEA--KKNMLP 289

Query: 283 LQL--------DTGRIKRVLGWQPPYTTRQGLEATAAWYRS 315
LQ DT + V+G+ P T + G++ WYR
Sbjct: 290 LQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0765NUCEPIMERASE721e-15 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 71.7 bits (176), Expect = 1e-15
Identities = 54/298 (18%), Positives = 110/298 (36%), Gaps = 44/298 (14%)

Query: 285 VMVTGAGGSIGSELCRQILRFAPAQLVAFD-LSEYAMYRLAEELRERFPDQPVVPIIGDA 343
+VTGA G IG + +++L A Q+V D L++Y L + E D
Sbjct: 3 YLVTGAAGFIGFHVSKRLLE-AGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 344 KDSLLLDQVMSRHAPHIVFHAAAYKHVPLMEEHNAWQALRNNVLGTYRVACAAIRHDVRH 403
D + + + VF + V E N +N+ G + + ++H
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLE-NPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 404 FVLIST---------------DKAVNPTNVMGASKRLAEMVCQALQQTSGGTQFETVRFG 448
+ S+ D +P ++ A+K+ E++ G +RF
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY-GLPATGLRFF 179

Query: 449 NVLGSAGS---VIPKFQQQIAKGGPVTV-THPEITRFFMTIPEASQLVLQAS-------- 496
V G G + KF + + +G + V + ++ R F I + ++ +++
Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239

Query: 497 --SMGHGG--------EIFILEMGQPVRIVDLARDLIRLYGFSEGQIRIDFTGLRPGE 544
++ G ++ + PV ++D + L G + + + L+PG+
Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI---EAKKNMLPLQPGD 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0766SUBTILISIN494e-08 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 48.7 bits (116), Expect = 4e-08
Identities = 20/88 (22%), Positives = 34/88 (38%), Gaps = 6/88 (6%)

Query: 312 SFAWSDIALAINRAVSDNTARVINMSIGGCENWAPTAAIDTLFQLAVAQGQTFSVSSGDS 371
S + I I A+ +I+MS+GG E+ + + AVA ++G+
Sbjct: 123 SGQYDWIIQGIYYAIEQK-VDIISMSLGGPED---VPELHEAVKKAVASQILVMCAAGNE 178

Query: 372 GSVAYGCNGTSVQYPATSPYVVAVGGTT 399
G + YP V++VG
Sbjct: 179 GD--GDDRTDELGYPGCYNEVISVGAIN 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_076760KDINNERMP290.033 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 29.1 bits (65), Expect = 0.033
Identities = 14/50 (28%), Positives = 20/50 (40%), Gaps = 3/50 (6%)

Query: 164 LASMVSFMMFASLAYVAFHVNDPVVMSASII-MMGAVLGFFLWNFPAGLI 212
L ++ MF V DP M I+ M + F FP+GL+
Sbjct: 467 LPILMGVTMFFIQKMSPTTVTDP--MQQKIMTFMPVIFTVFFLWFPSGLV 514


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0768NUCEPIMERASE1642e-50 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 164 bits (418), Expect = 2e-50
Identities = 81/353 (22%), Positives = 148/353 (41%), Gaps = 54/353 (15%)

Query: 6 TILVTGGAGYIGSHTAVELLDNGYDVVIVDNLVNSKAESVR--RIEKITGRTPAFHQVDV 63
LVTG AG+IG H + LL+ G+ VV +DNL + S++ R+E + FH++D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 64 CDEAALAKVFDAHPITGTIHFAALKAVGESVAKPLEYYQNNIGGLLTVLKVMRERNVRQF 123
D + +F + AV S+ P Y +N+ G L +L+ R ++
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 124 VFSSSATVYGVPERSPIDES----FPLSATNPYGQSKLMAEQV------LRDLEVSDPSW 173
+++SS++VYG+ + P P+S Y +K E + L L
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVS---LYAATKKANELMAHTYSHLYGL------- 171

Query: 174 RVATLRYFNPVGAHASGLIGEDPAGIPNNLMPYVAQVAVGKLEKLRVFGSDYPTPDGTGV 233
LR+F G P G P ++ + A+ + + + V+ G
Sbjct: 172 PATGLRFFTVYG----------PWGRP-DMALFKFTKAMLEGKSIDVYN------YGKMK 214

Query: 234 RDYIHVVDLAKGHIAALDALATRDASF---------------VVNLGTGQGYSVLEVVRA 278
RD+ ++ D+A+ I D + D + V N+G +++ ++A
Sbjct: 215 RDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQA 274

Query: 279 FEKASGRPVPYELVARRPGDIAECYANPQAAADLIGWRATLGIDEMCADHWKW 331
E A G ++ +PGD+ E A+ +A ++IG+ + + + W
Sbjct: 275 LEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327


55Bamb_0886Bamb_0892N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_08860111.769963amidase
Bamb_0887-1121.528524GntR family transcriptional regulator
Bamb_0888-1112.167370peptidase C26
Bamb_0889-192.219436hypothetical protein
Bamb_0890-181.542699hypothetical protein
Bamb_0891-291.637574general substrate transporter
Bamb_0892-2102.296483hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0886MICOLLPTASE320.008 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 31.6 bits (71), Expect = 0.008
Identities = 11/79 (13%), Positives = 32/79 (40%), Gaps = 7/79 (8%)

Query: 303 DINRFGFSPIEAYAWHRPLLAQHRDRYDPRVLSRILKGEPASAADYLDLLAARQAMLDEA 362
D +G +A + + ++ ++ + I + + DY+ +++ + D+
Sbjct: 581 DFYNYG------FALSNYMYNNNMGMFN-KMTNYIKNNDVSGYKDYIASMSSDYGLNDKY 633

Query: 363 AHTVWSRFDALVAPTVPVV 381
+ S + + VP+V
Sbjct: 634 QDYMDSLLNNIDNLDVPLV 652


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0888SSBTLNINHBTR290.014 Streptomyces subtilisin inhibitor signature.
		>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature.

Length = 144

Score = 29.4 bits (65), Expect = 0.014
Identities = 20/68 (29%), Positives = 28/68 (41%), Gaps = 2/68 (2%)

Query: 41 SDPAHAPEVPATQSSVAQAAAAEDVAADGA--SPVTAASEPGAAGTTPAAESASEPASEP 98
+ PA AP S++ + AA A VT P A+GT PAA +A
Sbjct: 28 ASPATAPASLYAPSALVLTVGHGESAATAAPLRAVTLTCAPTASGTHPAAAAACAELRAA 87

Query: 99 AAEPTAAP 106
+P+A
Sbjct: 88 HGDPSALA 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0889RTXTOXIND290.012 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.012
Identities = 14/63 (22%), Positives = 27/63 (42%), Gaps = 1/63 (1%)

Query: 148 QANRAQQLQADLSVARSQQAEVAQRQQSAREQTQALQVE-KRAAQVQLRDLQEQVRQLEK 206
Q N+ + +L V +SQ ++ SA+E+ Q + K +LR + + L
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTL 316

Query: 207 QTE 209
+
Sbjct: 317 ELA 319



Score = 27.9 bits (62), Expect = 0.035
Identities = 8/68 (11%), Positives = 23/68 (33%), Gaps = 1/68 (1%)

Query: 141 LERVIALQANRAQQLQADLSVARSQQAEVAQRQQSAREQTQALQVEKRA-AQVQLRDLQE 199
E N + ++ L S+ + Q + + ++K + L
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTL 316

Query: 200 QVRQLEKQ 207
++ + E++
Sbjct: 317 ELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0891TCRTETA347e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.4 bits (79), Expect = 7e-04
Identities = 24/91 (26%), Positives = 36/91 (39%), Gaps = 10/91 (10%)

Query: 239 VVIAGMGMVIMTTVSFYMITAYTPTFGKEVLHLSSLDALVVTVCVGLSNLVWLPLSGALS 298
V + +G+ ++ V ++ + H L A L P+ GALS
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHS-NDVTAHYGILLA-----LYALMQFACAPVLGALS 67

Query: 299 DRIGRRPVLIA----FTVLTLLSAYPAVLWL 325
DR GRRPVL+ V + A LW+
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWV 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_0892SYCDCHAPRONE482e-08 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 47.6 bits (113), Expect = 2e-08
Identities = 18/105 (17%), Positives = 37/105 (35%)

Query: 52 LGTNPADADALHLFGVLRHQQGQHAEAADLVGRAVELRPGDAALQLNLGNALKALGRLDE 111
+ + L+ ++Q G++ +A + L D+ L LG +A+G+ D
Sbjct: 29 NEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDL 88

Query: 112 AIERFRNALTLAPEFPLAHYNLGNAYAALQRHEDAVDAFGRALRL 156
AI + + + P ++ +A A L
Sbjct: 89 AIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 47.2 bits (112), Expect = 2e-08
Identities = 19/105 (18%), Positives = 36/105 (34%), Gaps = 3/105 (2%)

Query: 25 MDSAFDRAYAAHRAGRLAEAEHGYRAALGTNPADADALHLFGVLRHQQGQHAEAADLVGR 84
++ + A+ +++G+ +A ++A + D+ G R GQ+ A
Sbjct: 36 LEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSY 95

Query: 85 AVELRPGDAALQLNLGNALKALGRLDEAIERFRNALTLA---PEF 126
+ + + L G L EA A L EF
Sbjct: 96 GAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEF 140



Score = 46.8 bits (111), Expect = 4e-08
Identities = 20/83 (24%), Positives = 30/83 (36%)

Query: 200 NLAMALNAMGRADDAIAHFQAAIAAQPRFVAAHFNLGNTFEALGRHGEAAAAFEAALALH 259
+LA G+ +DA FQA LG +A+G++ A ++ +
Sbjct: 41 SLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMD 100

Query: 260 PPFPLALFGLANALCALGRQREA 282
P F A L G EA
Sbjct: 101 IKEPRFPFHAAECLLQKGELAEA 123



Score = 43.4 bits (102), Expect = 5e-07
Identities = 20/105 (19%), Positives = 32/105 (30%)

Query: 154 LRLTPDDASIHNNLGNALNALGRHDDALAAFHRALELRPGHAGAHNNLAMALNAMGRADD 213
++ D +L G+++DA F L + L AMG+ D
Sbjct: 29 NEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDL 88

Query: 214 AIAHFQAAIAAQPRFVAAHFNLGNTFEALGRHGEAAAAFEAALAL 258
AI + + F+ G EA + A L
Sbjct: 89 AIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133



Score = 42.6 bits (100), Expect = 9e-07
Identities = 25/127 (19%), Positives = 46/127 (36%), Gaps = 10/127 (7%)

Query: 112 AIERF-RNALTLA------PEFPLAHYNLGNAYAALQRHEDAVDAFGRALRLTPDDASIH 164
A+E F + T+A + Y+L ++EDA F L D+
Sbjct: 14 AMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFF 73

Query: 165 NNLGNALNALGRHDDALAAFHRALELRPGHAGAHNNLA---MALNAMGRADDAIAHFQAA 221
LG A+G++D A+ ++ + + A + + A+ + Q
Sbjct: 74 LGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133

Query: 222 IAAQPRF 228
IA + F
Sbjct: 134 IADKTEF 140



Score = 39.5 bits (92), Expect = 1e-05
Identities = 16/70 (22%), Positives = 30/70 (42%)

Query: 277 GRQREALPYYERAVGLDPSFSLAWLNLGNAHHALGAHEMALRAFDQALRVAPDLKLAQLH 336
G+ +A ++ LD S +L LG A+G +++A+ ++ + H
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109

Query: 337 RAVTLLTLGD 346
A LL G+
Sbjct: 110 AAECLLQKGE 119


56Bamb_1072Bamb_1081N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1072-192.071727hypothetical protein
Bamb_10731131.486940glyoxalase/bleomycin resistance
Bamb_10741151.2417872Fe-2S iron-sulfur cluster binding
Bamb_10751141.297130aldehyde oxidase and xanthine dehydrogenase
Bamb_10761151.070852outer membrane protein (porin)
Bamb_10771120.670885AraC family transcriptional regulator
Bamb_10780100.110832acriflavin resistance protein
Bamb_10790100.005109acriflavin resistance protein
Bamb_1080-1110.076844RND family efflux transporter MFP subunit
Bamb_1081-2100.000207IclR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1072IGASERPTASE442e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 43.9 bits (103), Expect = 2e-06
Identities = 37/191 (19%), Positives = 66/191 (34%), Gaps = 11/191 (5%)

Query: 323 AVQESVHADAQEAAVVD---VTDEVALVPAATAEATVVT---EATEVADTHGEAKDGRKR 376
A SV ++ +E A VD V P+ T E E+ V +A + +
Sbjct: 1005 ADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQ 1064

Query: 377 ARKSAAKKTGAKKGGEAKGGEAKGAGRQAAAKPDEAAHGGAKHGGDKHAQGMPAEATHRD 436
R+ A + AK +A + A Q+ ++ E K + T +
Sbjct: 1065 NREVAKE---AKSNVKANTQTNEVA--QSGSETKETQTTETKETATVEKEEKAKVETEKT 1119

Query: 437 MEHREERHVAAPTAGEYGAPAAAAQPAHEAAPAAESTGEAAGETVAAKPKKPARKTAPRA 496
E + +P + A+PA E P + A ++PA++T+
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 497 RRPRKTATAAE 507
+P +T
Sbjct: 1180 EQPVTESTTVN 1190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1076ECOLNEIPORIN662e-14 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 66.4 bits (162), Expect = 2e-14
Identities = 70/345 (20%), Positives = 113/345 (32%), Gaps = 57/345 (16%)

Query: 24 AQSSVTLYGIVDTGIQYYNNAAGGGAVAGMPSLTGEVP---SRFGLRGVEDLGGGYRAFF 80
A + VTLYG + G++ + A GA A + S+ G +G EDLG G +A +
Sbjct: 17 AMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKAIW 76

Query: 81 VLENGFAPNSGTLNYGGRLFGRQANVGIESPYGALTLGRQMNMSMRVLLNADVIGP---- 136
+E + RQ+ +G++ +G L +GR + VL + I P
Sbjct: 77 QVEQ----KASIAGTDSGWGNRQSFIGLKGGFGKLRVGRLNS----VLKDTGDINPWDSK 128

Query: 137 ----SIHSMASFDSYLPNARSDNALGYLGRFGGVTLGGTYSTGRDDAGPAGPSATHCAGN 192
++ +A ++ L + R D+ F G++ Y+ D+AG
Sbjct: 129 SDYLGVNKIAEPEARLISVRYDSP-----EFAGLSGSVQYA-LNDNAGRHNS-------- 174

Query: 193 VAGDPVACRQYTMMVAYDAPQFGAAASY--------DVMHGGAGASAPLSSPGYTDTRTI 244
Y Y F GY +
Sbjct: 175 --------ESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALY 226

Query: 245 VDAYVKFGIAKLGAGWIRRNTAAAAHSQSDIFFAGGTVQATPALSFDAQALRYLLRGRFD 304
V+ AKL N+ + F TP +S+ A +
Sbjct: 227 ASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFGN----VTPRVSY-AHGFKGSFDATNY 281

Query: 305 SNL---FVARANYSLSKRTMVYTSVAYMTNSALGTSAVAAGGTVG 346
+N V A Y SKRT S ++ + V+ G VG
Sbjct: 282 NNDYDQVVVGAEYDFSKRTSALVSAGWLQEGKGESKFVSTAGGVG 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1078ACRIFLAVINRP7610.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 761 bits (1966), Expect = 0.0
Identities = 274/1102 (24%), Positives = 505/1102 (45%), Gaps = 95/1102 (8%)

Query: 3 LSRPFITRPVATTLLALGVALAGLFAFIKLPVSPLPQVDFPTISVQASLPGASPETVATS 62
++ FI RP+ +LA+ + +AG A ++LPV+ P + P +SV A+ PGA +TV +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTSPLERHLGSIADVSEMTSTST-VGNARIILQFGLNRDIDGAARDVQAAINAARADLPA 121
VT +E+++ I ++ M+STS G+ I L F D D A VQ + A LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 SLKSNPTYRKVNPADSPIMIVSLTSET--SSPAKLYDAASTVLQQSLSQIDGIGQVSVSG 179
++ + S +M+ S+ ++ + D ++ ++ +LS+++G+G V + G
Sbjct: 121 EVQ-QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 180 SANPAVRVELEPHALFHYGIGLEDVRAALASANANSPKGAIEFGPK------HYQLYTND 233
+ A+R+ L+ L Y + DV L N G + P + +
Sbjct: 180 AQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 234 QASHASQYSDLVV-AYRNGAAVRLSDLSEVVDSVEDLRNLGLSNGKRAVLVILYRSPGAN 292
+ + ++ + + +G+ VRL D++ V E+ + NGK A + + + GAN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 293 IIETIDRVRTALPQLTASLPADITVTPVLDRSTTIRASLKDTEHTLLIAISLVVMVVFLF 352
++T ++ L +L P + V D + ++ S+ + TL AI LV +V++LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 353 LRNWRATLIPSVAVPISIIGTFGAMYLLGFSIDNLSLMALIVATGFVVDDAIVVLENITR 412
L+N RATLIP++AVP+ ++GTF + G+SI+ L++ +++A G +VDDAIVV+EN+ R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 413 HI-ENGKPRLQAAFDGAREVGFTVLSMSISLVAVFLPILLMGGIVGRLFREFALTLSLAI 471
+ E+ P +A ++ ++ +++ L AVF+P+ GG G ++R+F++T+ A+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 472 AVSLAVSLTVTPMMCARLLREPHDAHEE--GRLGRFLERFFTRMQRGYERSLSWALRRPL 529
A+S+ V+L +TP +CA LL+ H E G + F Y S+ L
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 530 LVLLILFATIGLNVYLYIIVPKGFFPQQDTGLMIGGIRADQSTSFQAMKQKFTEMMRIVQ 589
LLI + V L++ +P F P++D G+ + I+ + + ++ ++
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 590 SN--PNVQSAAGFTG----GTQTNSGFMFVTLKDRTER---KLSADQVIQQLRRPLADVA 640
N NV+S G G N+G FV+LK ER + SA+ VI + + L +
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 641 GASTFLQAAQDIRVGGRQSNAQYQFT-LLGDSSADLYKWGP-LLTEALQKRSELTDVNSD 698
I G + ++ G L + LL A Q + L V +
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 699 QQQGGLEAMVTIDRATAARLGIKPAQIDNTLYDAFGQRQVSTIYNPLNQYHVVMEVAPKY 758
+ + + +D+ A LG+ + I+ T+ A G V+ + + ++ K+
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 759 WQSPQMLNEVWISTSGGSANGSQSTNAAAGTFVATSAGTSSAGTAATSAAAIASDSARNQ 818
P+ ++++++ + A G V S A +
Sbjct: 779 RMLPEDVDKLYVRS-------------ANGEMVPFS--------------AFTTSHWVYG 811

Query: 819 ALNSIASSGKSSASSGAAVSTSKSTMIPLSAIATFGPSTTPLSVNHQGLFVATTISFNLP 878
+ +G S + S+ ++ + ++ LP
Sbjct: 812 SPRLERYNGLPSMEIQGEAAPGTSSGDAMALME--------------------NLASKLP 851

Query: 879 PGVSLSQATQVIYQTMAQIGVPPTIVGSFQGTAQAFQQSLNNQPILILAALLAVYIVLGI 938
G+ + G + + S N P L+ + + V++ L
Sbjct: 852 AGIGY----------------------DWTGMSYQERLSGNQAPALVAISFVVVFLCLAA 889

Query: 939 LYESYIHPITILSTLPSAGVGALLALLLFKTEFSIIALIGVILLIGIVKKNAIMMVDFAI 998
LYES+ P++++ +P VG LLA LF + + ++G++ IG+ KNAI++V+FA
Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAK 949

Query: 999 DQTRNGNKSSYDAIHEACLLRFRPIMMTTMAALLGALPLAFGHGDGAELRAPLGIAIAGG 1058
D K +A A +R RPI+MT++A +LG LPLA +G G+ + +GI + GG
Sbjct: 950 DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGG 1009

Query: 1059 LIMSQVLTLYTTPVVYLYMDRL 1080
++ + +L ++ PV ++ + R
Sbjct: 1010 MVSATLLAIFFVPVFFVVIRRC 1031



Score = 97.2 bits (242), Expect = 2e-22
Identities = 82/507 (16%), Positives = 169/507 (33%), Gaps = 33/507 (6%)

Query: 2 NLSRPFITRPVATTLLALGVALAGLFAFIKLPVSPLPQVDFPTISVQASLP-GASPETVA 60
N + L+ + + F++LP S LP+ D LP GA+ E
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 61 TSVTSPLERHLGSIAD-------VSEMTSTSTVGNARIIL-QFGLNRDIDGAARDVQAAI 112
+ + +L + V+ + + NA + + +G +A I
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 113 NAARADLPASLKSNPTYRKVNPADSPIMIVSLTSETS-----SPAKLYDAASTVLQQSLS 167
+ A+ +L + E L A + +L +
Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707

Query: 168 QIDGIGQVSVSGSAN-PAVRVELEPHALFHYGIGLEDVRAALASANANSPKGAIEFGPKH 226
+ V +G + ++E++ G+ L D+ +++A + +
Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767

Query: 227 YQLYT---NDQASHASQYSDLVVAYRNGAAVRLSDLSEVVDSVEDLRNLGLSNGKRAVLV 283
+LY L V NG V S + V L NG ++ +
Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHW-VYGSPRLERYNGLPSMEI 826

Query: 284 ILYRSPGANIIETIDRVRTALPQLTASLPADITVTPVLDRSTTIRASLKDTEHTLLIAIS 343
+PG + + + L + LPA I T + + + + ++
Sbjct: 827 QGEAAPGTSSGD----AMALMENLASKLPAGIGYD-----WTGMSYQERLSGNQAPALVA 877

Query: 344 LVVMVVFLFL----RNWRATLIPSVAVPISIIGTFGAMYLLGFSIDNLSLMALIVATGFV 399
+ +VVFL L +W + + VP+ I+G A L D ++ L+ G
Sbjct: 878 ISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLS 937

Query: 400 VDDAIVVLENIT-RHIENGKPRLQAAFDGAREVGFTVLSMSISLVAVFLPILLMGGIVGR 458
+AI+++E + GK ++A R +L S++ + LP+ + G
Sbjct: 938 AKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSG 997

Query: 459 LFREFALTLSLAIAVSLAVSLTVTPMM 485
+ + + + +++ P+
Sbjct: 998 AQNAVGIGVMGGMVSATLLAIFFVPVF 1024



Score = 61.4 bits (149), Expect = 2e-11
Identities = 27/168 (16%), Positives = 62/168 (36%), Gaps = 1/168 (0%)

Query: 924 LILAALLAVYIVLGILYESYIHPITILSTLPSAGVGALLALLLFKTEFSIIALIGVILLI 983
L A +L +V+ + ++ + +P +G L F + + + G++L I
Sbjct: 344 LFEAIMLVF-LVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAI 402

Query: 984 GIVKKNAIMMVDFAIDQTRNGNKSSYDAIHEACLLRFRPIMMTTMAALLGALPLAFGHGD 1043
G++ +AI++V+ +A ++ ++ M +P+AF G
Sbjct: 403 GLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGS 462

Query: 1044 GAELRAPLGIAIAGGLIMSQVLTLYTTPVVYLYMDRLRVWGEKRRNRR 1091
+ I I + +S ++ L TP + + +
Sbjct: 463 TGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1079ACRIFLAVINRP8100.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 810 bits (2093), Expect = 0.0
Identities = 286/1036 (27%), Positives = 496/1036 (47%), Gaps = 29/1036 (2%)

Query: 4 SRLFILRPVGTALLMAAIMLAGLVALRFLPLAALPEVDYPTIQVQTFYPGASPEVMTSSV 63
+ FI RP+ +L +M+AG +A+ LP+A P + P + V YPGA + + +V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 TAPLERQFGQMPSLNQMSSQS-SAGASVITLQFSLDLPLDIAEQEVQAAINAAGNLLPSD 122
T +E+ + +L MSS S SAG+ ITL F DIA+ +VQ + A LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 LPAPPIYAKVNPADAPVLTLAITSKTLPLTQ--VQDLTDTRLAMKISQIAGVGLVSLSGG 180
+ I + + ++ S TQ + D + + +S++ GVG V L G
Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 NRPAVRIQANPTALAQYGMNLDDLRTTISNLNVNTPKGNFDGP------TRAYTINANDQ 234
A+RI + L +Y + D+ + N G G +I A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 235 LTSADQY-NSAVVAYKSGRPVMLTDVAKVVAGSENTKLGAWVNAEPAIILNVQRQPGANV 293
+ +++ + G V L DVA+V G EN + A +N +PA L ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 IATVDAIKAQLPKLQETLPAALDVQVVTDRTTMIRAAVRDVQFELLLAVVLVVLVMYLFL 353
+ T AIKA+L +LQ P + V D T ++ ++ +V L A++LV LVMYLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 ANVYATIIPSLSVPLSLIGTLAVMYMAGFSLNNLSLMALTIATGFVVDDAIVMIENIARY 413
N+ AT+IP+++VP+ L+GT A++ G+S+N L++ + +A G +VDDAIV++EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 VEEGHT-GLEAALKGSRQIGFTIISLTVSLIAVLIPLLFMGDVVGRLFHEFAITLAVTIV 472
+ E EA K QI ++ + + L AV IP+ F G G ++ +F+IT+ +
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 ISAIVSLTLVPMMCAKLLRHSPPPESH---RFEARVHRVIDAVIARYGVALEWVLNRQGS 529
+S +V+L L P +CA LL+ F + D + Y ++ +L G
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 530 TLVVALLTLALTALLYVYIPKGFFPAQDTGVIQAITQAPQSISYGAMAERQQALAAEILK 589
L++ L +A +L++ +P F P +D GV + Q P + + + LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 590 D--PNVESLTSFIGVDGSNITLNSGRMLINLKARDHRS---ESSAQIIRDLQQRVANVTG 644
+ NVES+ + G S N+G ++LK + R+ S+ +I + + +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 645 ITLFMQSVQDLTIDSTVSPTQYQFMLTS---PNPDEFATWVPKLVTRLQKEPS-LADVAT 700
F+ I + T + F L D +L+ + P+ L V
Sbjct: 660 --GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 701 DLQSNGQSVYIEIDRASAARFGITPATVDNALYDAFGQRIVSTIFTQSNQYRVILESEPK 760
+ + +E+D+ A G++ + ++ + A G V+ + ++ ++++ K
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 761 EQHYAESLNDIYLPSAGGGQVPLSSIASFHERPSPLLIAHLSQFPSTTISFNLAPGASLG 820
+ E ++ +Y+ SA G VP S+ + H + + PS I APG S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 821 EAVKAIGAAEKDIGLPGSFQTRFQGAALAFQASLSNQLFLILAAVITMYIVLGVLYESYI 880
+A+ + LP + G + + S + L+ + + +++ L LYES+
Sbjct: 838 DAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 881 HPITILSTLPSAGVGALLALMITGHDLDIIGIIGIVLLIGIVKKNAIMMIDFALEAERVE 940
P++++ +P VG LLA + D+ ++G++ IG+ KNAI++++FA + E
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 941 GKPPREAIYQACLLRFRPILMTTLAALLGAVPLIVGAGAGSELRQPLGIAIAGGLIVSQV 1000
GK EA A +R RPILMT+LA +LG +PL + GAGS + +GI + GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1001 LTLFTTPVIYLGFDSL 1016
L +F PV ++
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1080RTXTOXIND501e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.8 bits (119), Expect = 1e-08
Identities = 32/166 (19%), Positives = 62/166 (37%), Gaps = 28/166 (16%)

Query: 73 PAAMANIPQPVS-----------------VATATQGEMPIVLSALGTVTPLANV-TVKTQ 114
PA + I PVS + G++ IV +A G +T +K
Sbjct: 43 PAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPI 102

Query: 115 LSGYLQSVAFQEGQLVKKGDLLAQIDPRP-------YQVALETAEGTYARDAALLATARL 167
+ ++ + +EG+ V+KGD+L ++ Q +L A R L + L
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 168 DLKRYQTLLSQ---DSIASQTVDTQASLVKQYEGTVKTDQAAIDSA 210
+ L + +++ + V SL+K+ T + + +
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208



Score = 35.2 bits (81), Expect = 6e-04
Identities = 19/105 (18%), Positives = 39/105 (37%), Gaps = 6/105 (5%)

Query: 145 QVALETAEGTYARDAALLAT--ARLDLKRYQTLLSQDSIASQTVDTQASLVKQY-EGTVK 201
+ A+ E Y L ++L+ + L +++ T + ++ + + T
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 202 TDQ--AAIDSAKLNLTYARITAPVSGRV-GLRQVDPGNYVTAGDT 243
+ + + I APVS +V L+ G VT +T
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1081NEISSPPORIN300.008 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 30.3 bits (68), Expect = 0.008
Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 57 SRITATLVSAGFLFQLPDSERFVLTASVLELSHGF 91
S+ T+ LVSAG+L +++ V TAS + L H F
Sbjct: 314 SKRTSALVSAGWLQGGKGADKIVSTASAVVLRHKF 348


57Bamb_1340Bamb_1347N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1340080.728952phospholipid/glycerol acyltransferase
Bamb_1341-180.586578chorismate synthase
Bamb_1342-190.117620LacI family transcriptional regulator
Bamb_1343-110-0.542638ribokinase
Bamb_1344-110-1.351007major facilitator superfamily transporter
Bamb_1345-29-2.107687dihydrodipicolinate synthetase
Bamb_1346-110-2.108404electron-transferring-flavoprotein
Bamb_1347010-1.085246short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1340TCRTETA300.034 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.8 bits (67), Expect = 0.034
Identities = 46/261 (17%), Positives = 90/261 (34%), Gaps = 13/261 (4%)

Query: 76 AIFILPFVLFSATSGQIADKYDKATLTRFVKTFEIALMLVGAAGF-VTHSATLLYLCTFM 134
A++ L + G ++D++ + R V +A V A +LY+ +
Sbjct: 50 ALYALMQFACAPVLGALSDRFGR----RPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 135 MGMHSTLFGPVKYSYLPQHLGEHELVGGNGLVEMGTFIAILIGTIIGGAAAGIEGSGERV 194
G+ + G V +Y+ E G + ++ G ++GG G
Sbjct: 106 AGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFF 164

Query: 195 LAVSVVVIALAGRLVAQRVPPTPAPQPDLVINWNPFSETWRNLGLAKQNRTVFLSLLGIS 254
A ++ + +P NP + G+ TV +L+ +
Sbjct: 165 AAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGM-----TVVAALMAVF 219

Query: 255 WL-WFVGATFLTSFFNFAKDVLSASPDVVTVLLATFSV-GIGLGSLLCERLSQRRVEIGL 312
++ VG + F +D + + LA F + +++ ++ R E
Sbjct: 220 FIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRA 279

Query: 313 VPLGSIGISVFAIELYFASRS 333
+ LG I I L FA+R
Sbjct: 280 LMLGMIADGTGYILLAFATRG 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1344TCRTETB606e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 60.3 bits (146), Expect = 6e-12
Identities = 68/409 (16%), Positives = 144/409 (35%), Gaps = 78/409 (19%)

Query: 34 LDRGTLAVASSAIRNDLGLSLSEMGLLLSAFSWSYALCQFPVGGLVDRIGPRRLLGVGLI 93
L+ L V+ I ND + + +AF ++++ G L D++G +RLL G+I
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 94 VWSLAQASGGIV-STFGWFIVARIVLGIGEAPQFPSAARVVSNWFPLRARGTPTGIFNAA 152
+ G + S F I+AR + G G A VV+ + P RG G+ +
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147

Query: 153 SPLGTALAPLLLAVLVASFNWRWAFVA---------------------------TGALGL 185
+G + P + ++ +W + + G + +
Sbjct: 148 VAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILM 207

Query: 186 VVAVVWFALYRDPAR---------------AQLTAAERAYLDADAQTAVAMPKLTFADWR 230
V +V+F L+ + ++D +
Sbjct: 208 SVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPF-------MI 260

Query: 231 SLFSHGTTWGMLIGFFGSVYLNWVYLTWLPGYLTMERHMSLIRTGFAASVPFLCGFVGSL 290
+ G +G + GF ++ +P + +S G SV G + +
Sbjct: 261 GVLCGGIIFGTVAGF----------VSMVPYMMKDVHQLSTAEIG---SVIIFPGTMSVI 307

Query: 291 VAGWLSDVVTRRSRSPVVSRRNAVVVAMLGM----VAFTIPAALVQSNTV--ALACISVV 344
+ G++ + +V RR + V +G+ V+F + L+++ + + + V+
Sbjct: 308 IFGYIGGI--------LVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVL 359

Query: 345 IFLANAASACSWALATAAAPPSRIASLGAIQNFGGFIGGALAPILTGVI 393
L+ + S ++++ A + + NF F+ + G +
Sbjct: 360 GGLSFTKTVISTIVSSSLKQQEAGAGMSLL-NFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1345PF03627320.002 PapG
		>PF03627#PapG

Length = 336

Score = 32.2 bits (73), Expect = 0.002
Identities = 10/20 (50%), Positives = 11/20 (55%)

Query: 136 ARLPSDLPLGLYECPAPYRR 155
LP+DLPLG Y PY
Sbjct: 158 VALPADLPLGDYSVTIPYTS 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1347DHBDHDRGNASE1149e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 114 bits (286), Expect = 9e-33
Identities = 73/257 (28%), Positives = 125/257 (48%), Gaps = 15/257 (5%)

Query: 7 LEGKVALVTGASSGLGQRFAQVLSQAGAKVVLASRRVERLKELRAEIEAAGGAAHVVSLD 66
+EGK+A +TGA+ G+G+ A+ L+ GA + E+L+++ + ++A A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 67 VTDVQSIKAAIAHAETEAGTIDILVNNSGVSTMQKLVDVTPADFEFVFDTNTRGAFFVAQ 126
V D +I A E E G IDILVN +GV + ++ ++E F N+ G F ++
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 127 EVAKRMMMRANGNGKPPYRIINIASVAGLRVFPQIGLYAMSKSAVVQMTRAMALEWGRHG 186
V+K MM R +G+ I+ + S + YA SK+A V T+ + LE +
Sbjct: 126 SVSKYMMDRRSGS------IVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 187 INVNAICPGYIDTEINHYLWETEQGQ---------KLQSILPRRRVGKPQDLDGLLLLLA 237
I N + PG +T++ LW E G ++ +P +++ KP D+ +L L
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 238 ADESQFINGSIISADDG 254
+ ++ I + D G
Sbjct: 240 SGQAGHITMHNLCVDGG 256


58Bamb_1382Bamb_1405N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1382-213-0.847932translation initiation factor IF-2
Bamb_1383-37-0.641961ribosome-binding factor A
Bamb_1384-110-0.656612tRNA pseudouridine synthase B
Bamb_1385-114-1.908869EmrB/QacA family drug resistance transporter
Bamb_1386-118-2.117416secretion protein HlyD family protein
Bamb_1387017-2.471412RND efflux system outer membrane lipoprotein
Bamb_1388017-3.324332MarR family transcriptional regulator
Bamb_1389017-3.404348GTP-binding protein TypA
Bamb_1390017-3.6256242-oxoglutarate dehydrogenase E1 component
Bamb_1391-214-3.271244dihydrolipoamide succinyltransferase
Bamb_1392011-1.937107dihydrolipoamide dehydrogenase
Bamb_13933150.039803AFG1 family ATPase
Bamb_13945190.465172hypothetical protein
Bamb_13954210.092526hypothetical protein
Bamb_1396421-0.014524polypeptide-transport-associated
Bamb_13975271.680583hypothetical protein
Bamb_1398326-0.225020hypothetical protein
Bamb_1399228-1.991283Flp/Fap pilin component
Bamb_1400124-1.881645peptidase A24A, prepilin type IV
Bamb_1401020-1.137024TadE family protein
Bamb_1402120-1.051317CpaB family Flp pilus assembly protein
Bamb_1403119-1.489834type II and III secretion system protein
Bamb_1404220-1.267941response regulator receiver protein
Bamb_1405221-1.814949type II secretion system protein E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1382TCRTETOQM711e-14 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 71.4 bits (175), Expect = 1e-14
Identities = 66/280 (23%), Positives = 101/280 (36%), Gaps = 82/280 (29%)

Query: 478 VMGHVDHGKTSLLDHIRRAKVAAGEAG------------------GITQHIGAYHVETPR 519
V+ HVD GKT+L + + A E G GIT G +
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 520 GVITFLDTPGHEAFTAMRARGAKATDIVVLVVAADDGVMPQTKEAIAHAKAGGVPIVVAI 579
+ +DTPGH F A R D +L+++A DGV QT+ + G+P + I
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 580 NKIDKPEANPDRVKQE----LVAEGVV-----------------PEEYG----------- 607
NKID+ + V Q+ L AE V+ E++
Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187

Query: 608 ----GDSP-----------------FVPV---SAKTGVGIDDLLENVLLQAEVLELKAPI 643
G S PV SAK +GID+L+E + + +
Sbjct: 188 KYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVI-----TNKFYSST 242

Query: 644 E---APAKGIVIEAKLDKGKGPVATILVQSGTLNRGDVVL 680
+ G V + + + + +A I + SG L+ D V
Sbjct: 243 HRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVR 282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1385TCRTETB1332e-36 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 133 bits (337), Expect = 2e-36
Identities = 85/396 (21%), Positives = 158/396 (39%), Gaps = 16/396 (4%)

Query: 27 VFMNVLDTSIANVAIPTISGDLGVSSDQGTWVITSFAVANAISVPLTGWLTDRFGQVRLF 86
F +VL+ + NV++P I+ D WV T+F + +I + G L+D+ G RL
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 87 LASILLFVISSWMCGLAPT-LPFLLASRVLQGAVAGPMIPLSQSLLLSSYPRAKAPMALA 145
L I++ S + + + L+ +R +QGA A L ++ P+ A
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 146 LWSMTTLIAPVAGPILGGWISDNYSWPWIFYVNIPVGIAAALATWSIYRTRESTVRRAPI 205
L + GP +GG I+ W ++ IP+ I + + ++ +
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPM-ITIITVPFLMKLLKKEVRIKGHF 199

Query: 206 DGVGLALLVLWVGSLQIMLDKGKDLDWFASTTIIALALIAVISFAFFVIWELTAEHPVVD 265
D G+ L+ VG + ML F ++ I+ +++V+SF FV P VD
Sbjct: 200 DIKGIILMS--VGIVFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 266 LSLFRMRNFTGGTIALAVGYGLYFGNLVLLPLWLQTQIGYTATDAG-LVMAPVGLFAILL 324
L + F G + + +G G + ++P ++ + + G +++ P + I+
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 325 SPLTGKFLPRTDPRYIATAAFLTFALCFWMRSRYTTGVDEWSLTLPTLVQGIAMAGFFIP 384
+ G + R P Y+ ++ F S + + V G ++
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTV 368

Query: 385 LVSITLSGLPGHRIPAASGLSNFVRIMCGGIGTSIF 420
+ +I S L A L NF + G G +I
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIV 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1386RTXTOXIND711e-15 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 71.4 bits (175), Expect = 1e-15
Identities = 44/270 (16%), Positives = 85/270 (31%), Gaps = 28/270 (10%)

Query: 94 ADSQVALQQAEANLAQTVRQVRGLFVNDDQYRAQVALRQSDLSKAQDDLRRRVAVAQTGA 153
+ Q Q E NL + + + ++Y + +S L L + A+A+
Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS-LLHKQAIAKHAV 254

Query: 154 VSQE--------EISHARDAVRAAQASVDAAQQQLASNRALTANTTIASHPNVMAAAAKV 205
+ QE E+ + + ++ + +A+++ L N + +
Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 206 RD----AYLANARNVLPAPVTGYVAKRSVQ-VGQRVSPGNPLMSVVPLNAV-WVDANFKE 259
+V+ APV+ V + V G V+ LM +VP + V A +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 260 VQLKHMRIGQPVEL--TADIYGSSAVYHGKVVGFSAGTGSAFSLLPAQNATGNWIKVVQR 317
+ + +GQ + A Y GKV + G V+
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIIS 427

Query: 318 LPVRIEIDPKELEKHPLRIGLSMQVDVNIK 347
+ + + M V IK
Sbjct: 428 IEENCLST----GNKNIPLSSGMAVTAEIK 453



Score = 46.7 bits (111), Expect = 9e-08
Identities = 26/161 (16%), Positives = 54/161 (33%), Gaps = 21/161 (13%)

Query: 56 VNGNVVQITPQITGTVIAVKADDTQTVKAGDPLVVLDPADSQVALQQAEANLAQT----- 110
+G +I P V + + ++V+ GD L+ L ++ + +++L Q
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQT 151

Query: 111 ----------VRQVRGLFVNDDQYRAQVA----LRQSDLSKAQDDLRRRVAVA--QTGAV 154
+ ++ L + D+ Y V+ LR + L K Q +
Sbjct: 152 RYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211

Query: 155 SQEEISHARDAVRAAQASVDAAQQQLASNRALTANTTIASH 195
+ E + + + +L +L IA H
Sbjct: 212 KRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKH 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1388FLGMOTORFLIM280.027 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 27.6 bits (61), Expect = 0.027
Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 10/55 (18%)

Query: 112 EGRALAERLPPVFRSVLDELLGG----------FTPEEVGFLKSMLRRILSNYCE 156
+G A+ E P + S++D L GG T E ++ ++ RIL+N E
Sbjct: 112 KGNAVLEVDPSITFSIIDRLFGGTGQAAKVQRDLTDIENSVMEGVIVRILANVRE 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1389TCRTETOQM1693e-47 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 169 bits (429), Expect = 3e-47
Identities = 99/435 (22%), Positives = 170/435 (39%), Gaps = 62/435 (14%)

Query: 5 LRNIAIIAHVDHGKTTLVDQLLRQSGTFRENQQIAE--RVMDSNDIEKERGITILAKNCA 62
+ NI ++AHVD GKTTL + LL SG E + + D+ +E++RGITI +
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 63 VEYEGTHINIVDTPGHADFGGEVERVLSMVDSVLLLVDAVEGPMPQTRFVTKKALALGLK 122
++E T +NI+DTPGH DF EV R LS++D +LL+ A +G QTR + +G+
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 123 PIVIVNKIDRPGARIDWV-------------INQTFDLFDKLGATE----EQLDFPIV-- 163
I +NKID+ G + V I Q +L+ + T EQ D I
Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182

Query: 164 -----------------YASGLNGY---ASLDP-----AAREGDMRPLFEAVLQHVPVRP 198
+ SL P A + L E +
Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSST 242

Query: 199 ADPDAPLQLQITSLDYSTYVGRIGVGRITRGRIKPGQPVAMRFGPEGDVLNRKINQVLSF 258
+ L ++ ++YS R+ R+ G + V R + + KI ++ +
Sbjct: 243 HRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSV--RISEKEKI---KITEMYTS 297

Query: 259 TGLERVQVDSAEAGDIVLINGIEDVGIGATICAVDTPEALPMITVDEPTLTMNFLVNSSP 318
E ++D A +G+IV++ E + + + + I P L +
Sbjct: 298 INGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQ 356

Query: 319 LAGREGKFVTSRQIRDRLMKELNHNVALRVKDTGDETVFEVSGRGELHLTILVENMRRE- 377
+ D L++ V E + +S G++ + + ++ +
Sbjct: 357 QREMLLDALLEISDSDPLLR-------YYVDSATHEII--LSFLGKVQMEVTCALLQEKY 407

Query: 378 GYELAVSRPRVVMQE 392
E+ + P V+ E
Sbjct: 408 HVEIEIKEPTVIYME 422



Score = 34.4 bits (79), Expect = 0.001
Identities = 16/100 (16%), Positives = 32/100 (32%), Gaps = 1/100 (1%)

Query: 387 RVVMQEIDGVRHEPYELLTVDVEDEHQGGVMEELGRRKGEMLDMASDGRGRTRLEYKISA 446
V+++ EPY + E+ + + ++D L +I A
Sbjct: 525 EQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVD-TQLKNNEVILSGEIPA 583

Query: 447 RGLIGFQSEFLTLTRGTGLMSHIFDSYAPVKDGSVFERRN 486
R + ++S+ T G + Y V + R
Sbjct: 584 RCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQPRR 623


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1391RTXTOXIND320.005 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.005
Identities = 9/92 (9%), Positives = 29/92 (31%), Gaps = 5/92 (5%)

Query: 48 EVPAPAAGVLAQVLQNDGDTVVADQIIATID---TEAKAGAAEAAAGAAEVKPAAAPAAA 104
E+ ++ +++ +G++V ++ + EA +++ A +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL--EQTRYQI 155

Query: 105 APAAQPAAAVASSSAAASPAASKLLAEKGLSA 136
+ + P + E+ L
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1392INTIMIN310.015 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.8 bits (69), Expect = 0.015
Identities = 20/81 (24%), Positives = 31/81 (38%), Gaps = 2/81 (2%)

Query: 103 TSGIEFLFKKNKITWLKGHGKFTGKTDAGVQIEVSGE--GETEVVTAKNVIIATGSKARH 160
SG L + T G T K+D Q+ VS + T + A VI +KA
Sbjct: 601 VSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASI 660

Query: 161 LPNVPVDNKIVSDNEGALTFE 181
V++ + A+T+
Sbjct: 661 TEIKADKTTAVANGQDAITYT 681


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1397RTXTOXINA290.048 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 28.8 bits (64), Expect = 0.048
Identities = 57/301 (18%), Positives = 107/301 (35%), Gaps = 57/301 (18%)

Query: 140 LGNVVGATS-NTVSGLSSTVKALGTGQLSPLAPVTTPVGTVLDTVANGLTAAGTTIGSTL 198
GN++G + N L L T Q + L + +D + + G S L
Sbjct: 124 AGNILGGGAENIGDNLGKAGGILSTFQ-NFLGTALS--SMKIDELIKKQKSGGNVSSSEL 180

Query: 199 SSGAVQQVTQPLSSAITPLVITAGQVTQQVGTTTGLGQPVSGLLGQVGGAISSAGKQVGS 258
+ +++ + Q L + L +QQ+ T + L + G ++ +
Sbjct: 181 AKASIELINQ-LVDTVASLNNNVNSFSQQLNTLGSVLSNTKHL--------NGVGNKLQN 231

Query: 259 TSNQPLVGDVGQLVTAVGNTVTNAGGLVNPNGPNGAAPIPG--LITSLVGGSTTAVQN-- 314
N +G V+ + + ++ + L N + G L T ++G +
Sbjct: 232 LPNLDNIGAGLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYI 291

Query: 315 ---GSSSGSSATNPLGGLLSG---LGSTPLG----------------------------- 339
++ G S + GL++ L +PL
Sbjct: 292 IAQRAAQGLSTSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGD 351

Query: 340 SLTGAVGGATGGAGGANPLAPVTGLLNTVTAAVGGAAGSGASSNPLAPVTSLVGGVSGTA 399
SL A TG + L ++ +L +V++ + AA +S APV++LVG V+G
Sbjct: 352 SLLAAFHKETGAIDAS--LTTISTVLASVSSGISAAA---TTSLVGAPVSALVGAVTGII 406

Query: 400 S 400
S
Sbjct: 407 S 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1398cloacin358e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.1 bits (80), Expect = 8e-04
Identities = 34/113 (30%), Positives = 47/113 (41%), Gaps = 8/113 (7%)

Query: 30 GGSGSISKGISGGSGSGGSDSISTSGGGTSGGTSGSTSGGTSGSTSGSTSGSTSGSTSGS 89
G+ S S I+GG G ++ G G S + + GG SGS GS G+ G
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWS--SENNPWGGGSGSGIHWGGGSGHGNGGG- 67

Query: 90 TSGTTSGTSSGTSGTSGVSANPVG---NVLAQGGNVITSLGGTASGLGSTIAN 139
SG SGT G A PV L+ G ++ +A L + IA+
Sbjct: 68 --NGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIAD 118



Score = 31.2 bits (70), Expect = 0.010
Identities = 30/79 (37%), Positives = 37/79 (46%), Gaps = 11/79 (13%)

Query: 39 ISGGSGSG-----GSDSISTSGGGTSGGTSGSTSGGTSGSTS------GSTSGSTSGSTS 87
+SGG G G S S + +GG T G G S G+ S+ GS SG G S
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 88 GSTSGTTSGTSSGTSGTSG 106
G +G +G S G SGT G
Sbjct: 61 GHGNGGGNGNSGGGSGTGG 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1400PREPILNPTASE432e-07 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 42.9 bits (101), Expect = 2e-07
Identities = 36/142 (25%), Positives = 60/142 (42%), Gaps = 11/142 (7%)

Query: 4 LLSTSIFFAWAALVAAGDIRFRLVRNSLVICGGTAALVSSLIHANPFGISTGQALIGMLV 63
L+ + + D+ L+ + L + L+ +L+ F +S G A+IG +
Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLL--GGF-VSLGDAVIGAMA 190

Query: 64 GLVSFFP-------LFAMRVMGAADVKVFAVLGAWCGLPILLWLWVIASLAAGVHVLGLM 116
G + + L MG D K+ A LGAW G L + +++SL +GL+
Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250

Query: 117 LLTRTPLGALWVRGLPAMALAG 138
LL G P +A+AG
Sbjct: 251 LLRNHHQSKPIPFG-PYLAIAG 271


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1403BCTERIALGSPD1335e-36 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 133 bits (337), Expect = 5e-36
Identities = 60/252 (23%), Positives = 113/252 (44%), Gaps = 11/252 (4%)

Query: 160 RSVVQVDVRVVEFSRSVLKEAGLNFFKQSNGFAFGAFSPGGLQSVTGGA----TSAFAAT 215
R V V+ + E + G+ + ++ G S + + GA ++
Sbjct: 344 RPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSS 403

Query: 216 GGIPIASAFNLVVNSAGRGIFG-NISILEANNLARVLAQPTLVALSGQSASFLAGGEIPV 274
S+FN + +G + ++ L ++ +LA P++V L A+F G E+PV
Sbjct: 404 SLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPV 463

Query: 275 PVPQALGSTA-----IDWKQYGVGLTLTPTVLSQHRIALKVAPESSQLDFQHGVTINSVS 329
S ++ K G+ L + P + + L++ E S + T +S
Sbjct: 464 LTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASST-SSDL 522

Query: 330 VPAITTRRADTTVELGDGESFVIGGLIDRETMSNISKVPVLGDLPIIGAFFKSLNYQQND 389
TR + V +G GE+ V+GGL+D+ KVP+LGD+P+IGA F+S + + +
Sbjct: 523 GATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSK 582

Query: 390 KELVIIVTPHLV 401
+ L++ + P ++
Sbjct: 583 RNLMLFIRPTVI 594


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1404HTHFIS401e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 40.2 bits (94), Expect = 1e-05
Identities = 23/119 (19%), Positives = 38/119 (31%), Gaps = 3/119 (2%)

Query: 24 DEHLRW-LRDTLVSAGMVEAVSLEPGALAQRILGLN-PAIVFIDFSRAQAEASAAAAAVR 81
D +R L L AG + A R + +V D A ++
Sbjct: 12 DAAIRTVLNQALSRAGY-DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIK 70

Query: 82 LAHPSLPVVALGTLAQPESALAALRAGVRDFIDVSGSAEDALRITRGLLEHAGAEPANR 140
A P LPV+ + +A+ A G D++ + + I L P+
Sbjct: 71 KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1405cloacin300.031 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.7 bits (66), Expect = 0.031
Identities = 12/25 (48%), Positives = 12/25 (48%)

Query: 430 GGFGGSGGGGGFGGGFGRGGGGFNV 454
G G GG G GGG G GG V
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLSAV 84


59Bamb_1411Bamb_1418N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1411-111-1.407901sigma-54 dependent trancsriptional regulator
Bamb_1412-210-0.730311hypothetical protein
Bamb_1413-28-0.771445RNA chaperone Hfq
Bamb_1414-17-0.879398hypothetical protein
Bamb_1415-26-0.992871hypothetical protein
Bamb_141608-0.160942AMP-dependent synthetase and ligase
Bamb_1417181.200488TetR family transcriptional regulator
Bamb_1418080.575993major facilitator superfamily transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1411HTHFIS2931e-96 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 293 bits (752), Expect = 1e-96
Identities = 134/482 (27%), Positives = 202/482 (41%), Gaps = 67/482 (13%)

Query: 19 ADIVDRVARCMASFDVEVIRADNAEISPER-AALRPSLAIISVTMIE-TGAAFLRDWQA- 75
A I + + ++ +V NA AA L + V M + L +
Sbjct: 13 AAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKA 72

Query: 76 NIGMPVVWVGA---------ARDHDASQY---PPDYSHILPLDFTCAELRGMIGKLVTQL 123
+PV+ + A A + A Y P D + ++ + +
Sbjct: 73 RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP------KRRP 126

Query: 124 RAHAAETLQPSELVAHSESMQALLHEVDTFADCDTNVLLHGETGVGKERIAQLLHQKHSR 183
++ LV S +MQ + + D +++ GE+G GKE +A+ LH + +
Sbjct: 127 SKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD-YGK 185

Query: 184 YRNGEFVPVNCGAIPDGLFESLFFGHAKGSFTGAVVAHKGYFEQAAGGTLFLDEVGDLPL 243
RNG FV +N AIP L ES FGH KG+FTGA G FEQA GGTLFLDE+GD+P+
Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245

Query: 244 YQQVKLLRVLEDGAVLRVGATSPIKVDFRLVAASNKKLPQLVKDGLFRADLYYRLAVIEL 303
Q +LLRVL+ G VG +PI+ D R+VAA+NK L Q + GLFR DLYYRL V+ L
Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305

Query: 304 SIPSLEERGAVDKIALFKSFVAEVVGDERLAQLSDLPYWLADAVADS----YFPGNVREL 359
+P L +R D L + FV ++ + + +PGNVREL
Sbjct: 306 RLPPLRDRAE-DIPDLVRHFV------QQAEKEGLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 360 RNLAER-----------------------------VGVTVRQTGGWDAARLQRLVAHARN 390
NL R + A + + + +
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 391 SAQPVPVESAAEVFVDRSKWDMNERSRVIAALDANGWRRQDTALQLGISRKVLWEKMRKY 450
+P + + E ++AAL A + A LG++R L +K+R+
Sbjct: 419 FGDALPPSGLYDRVLAEM-----EYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473

Query: 451 QI 452
+
Sbjct: 474 GV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1412RTXTOXIND344e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.0 bits (78), Expect = 4e-04
Identities = 10/112 (8%), Positives = 31/112 (27%), Gaps = 7/112 (6%)

Query: 124 DETRAEAIYRDFSHQAERLAVNELRAAKLESQKAQTDR-------QIALTQERARRLQAD 176
AEA + + + R L + +
Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187

Query: 177 ISIAREQQAAVVDRQKSVRNETAALQAQQAELQSQLRALQQQVRSLQREANA 228
S+ +EQ + +++ +A++ + +++ + R + +
Sbjct: 188 TSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1417HTHTETR694e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.9 bits (168), Expect = 4e-16
Identities = 20/82 (24%), Positives = 33/82 (40%)

Query: 20 PGNRQAGGTKARILDAAEDLFIEHGFEAMSMRQITSRAAVNLAAVNYHFGSKEALIHAML 79
++A T+ ILD A LF + G + S+ +I A V A+ +HF K L +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 80 SRRLDQLNQERLGILDRFDAQL 101
+ + L +F
Sbjct: 64 ELSESNIGELELEYQAKFPGDP 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1418TCRTETA681e-14 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 67.9 bits (166), Expect = 1e-14
Identities = 74/312 (23%), Positives = 125/312 (40%), Gaps = 15/312 (4%)

Query: 11 TIAAYLGWTLDAFDFFLMVFVLKDIAAEFASTIPAVA---FALTLTLAMRPIGALIFGRL 67
I LDA L++ VL + + + A L L M+ A + G L
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 68 ADRFGRRPTLMVNIACYSLLELASGFAPSLTALLVLRALFGIAMGGEWGVGSALTMETVP 127
+DRFGRRP L+V++A ++ AP L L + R + GI G V A +
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITD 125

Query: 128 THARGFVSGLLQAGYPSGYLLASVVFGLLYQYIGWRGMFMVGVLPALLVLYVRAHVPES- 186
R G + A + G + V+ GL+ + F L L L +PES
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESH 185

Query: 187 PAWKQMEKRPRPSLGATLQQNWKLTIYAIVLMTAF--NFFSHGTQDLYPTFLREQHHFDP 244
++ +R + A+ + +T+ A ++ F L+ F ++ H+D
Sbjct: 186 KGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDA 245

Query: 245 HTVSW-ITIVLNLGAIVGGLSFGAISERIGRRRAIFVAALIALPVLPLWAF-SSGPVA-- 300
T+ + L ++ + G ++ R+G RRA+ + + L AF + G +A
Sbjct: 246 TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFP 305

Query: 301 ----LAAGAFLM 308
LA+G M
Sbjct: 306 IMVLLASGGIGM 317


60Bamb_1428Bamb_1442N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1428-19-1.562220chitinase
Bamb_1429-2100.377358N-acetylglucosamine-binding protein A
Bamb_1430-1131.577556general secretion pathway protein G
Bamb_14310121.853439lytic transglycosylase catalytic subunit
Bamb_14320122.491608type II secretion system protein
Bamb_1433-192.363001type II secretion system protein E
Bamb_1434-1102.445090hypothetical protein
Bamb_1435-191.319619hypothetical protein
Bamb_1436-2101.663175hypothetical protein
Bamb_1437-3101.062544hypothetical protein
Bamb_1438-3110.808725type II and III secretion system protein
Bamb_1439-119-1.323710general secretion pathway GspG related
Bamb_1440-122-2.633841type II secretion system protein G
Bamb_1441225-4.671966hypothetical protein
Bamb_1442-111-3.367680two component transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1428cloacin451e-06 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.1 bits (106), Expect = 1e-06
Identities = 24/64 (37%), Positives = 30/64 (46%)

Query: 850 SGWCSGAPTAYEPGKGFAWSDAWTLYGDDEGGNGGNGGNGGGNGGGEGGNGGGNGGGEGG 909
SG +G PT G G + W+ + GG G+G + GG G G G GN GG G
Sbjct: 17 SGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSG 76

Query: 910 NGGG 913
GG
Sbjct: 77 TGGN 80



Score = 32.8 bits (74), Expect = 0.007
Identities = 14/52 (26%), Positives = 19/52 (36%)

Query: 876 GDDEGGNGGNGGNGGGNGGGEGGNGGGNGGGEGGNGGGDHPQYKEGTKYNAG 927
GD G N G G GG G G G G +G ++ + G+
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIH 55



Score = 31.6 bits (71), Expect = 0.015
Identities = 18/61 (29%), Positives = 22/61 (36%), Gaps = 14/61 (22%)

Query: 867 AWSDAWTLYGDDEGGNGGNGGNGG--------------GNGGGEGGNGGGNGGGEGGNGG 912
A S + + G G G G + G G+G GG G GG GN G
Sbjct: 13 AHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSG 72

Query: 913 G 913
G
Sbjct: 73 G 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1430BCTERIALGSPG1286e-41 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 128 bits (322), Expect = 6e-41
Identities = 45/131 (34%), Positives = 70/131 (53%), Gaps = 3/131 (2%)

Query: 10 KRRGQQGFTLLELLVVLLIIALLAGYVGPKLFSQVDKAKVKSTQAQMKTLGDAVTQFRLD 69
Q+GFTLLE++VV++II +LA V P L +KA + + + L +A+ ++LD
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 70 TGNYPTADEGLDALVVQPQGADG---WNGPYLAKAVPKDGWGRAYQWNVPGRDGEAEIVS 126
+YPT ++GL++LV P +N K +P D WG Y PG G +++S
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLS 122

Query: 127 LGRDGRVGGSG 137
G DG +G
Sbjct: 123 AGPDGEMGTED 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1432BCTERIALGSPF1981e-61 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 198 bits (504), Expect = 1e-61
Identities = 109/403 (27%), Positives = 196/403 (48%), Gaps = 16/403 (3%)

Query: 1 MQAFRVRVL-ADGKIAQQTIEAASEMEVRARLAEKGGVVLEVRRDKRIGRRRAPKF---- 55
M + + L A GK + T EA S + R L E+G V L V ++ ++
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 56 --------ALALFLQELSTLLDAGLVLFEALEALRDKADSGKDAKYVIDRLLAVMVEGQP 107
LAL ++L+TL+ A + L EAL+A+ +++ ++ ++ + + ++EG
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQ-LMAAVRSKVMEGHS 119

Query: 108 LSKALARQPAIFPPLLVATVESSEGSGQLPVVLKRYQQYEVRIEQVRKRVTGALIYPAVV 167
L+ A+ P F L A V + E SG L VL R Y + +Q+R R+ A+IYP V+
Sbjct: 120 LADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVL 179

Query: 168 IGVGIAILLFMAFFVIPRFAVVFESL-ATLPPTAHAMLWWANLLRENGMAVGLAVAASFV 226
V IA++ + V+P+ F + LP + ++ ++ +R G + LA+ A F+
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239

Query: 227 GAGLAIRSAAFKRAAQRLMWRAPKVRDVCALFALTRFYRTVGLLIAGGTPVIVALELSGK 286
+ +R + + R + P + + R+ RT+ +L A P++ A+ +SG
Sbjct: 240 AFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 287 VLP-EHFRARLAAALIDMRAGRPVAAVLAAHALTTSVAERLLRVGEQSGDLGGMCEHIAQ 345
V+ ++ R RL+ A +R G + L AL + ++ GE+SG+L M E A
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 346 FHDGALDRAIEMLSKVFEPVLMLAVGATVGAVVLLLYMPIFEL 388
D + + +FEP+L++++ A V +VL + PI +L
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQL 402


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1435CHANLCOLICIN290.010 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.9 bits (64), Expect = 0.010
Identities = 19/104 (18%), Positives = 45/104 (43%), Gaps = 4/104 (3%)

Query: 42 DQLRDEVDGIESRIDERQRAIERERRRQKTMTPEQRRIERVLAEQRMSEKGSGLSVVDWI 101
+Q R E++ ++ + + + E E +R ++ E + +E +A++++S S + +D
Sbjct: 154 EQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVE--IAQKKLSAAQSEVVKMD-- 209

Query: 102 EQAWTPQIALKSLTVDKAGREARIEGGAAELSHIYIFVDRLNDR 145
+ T L S + + G EL+ L++
Sbjct: 210 GEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDEL 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1437TONBPROTEIN270.028 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 27.3 bits (60), Expect = 0.028
Identities = 10/38 (26%), Positives = 11/38 (28%)

Query: 45 ASLGAEPANDSPPPPVAEPEPEPPSLDEPIDAQPVAEP 82
EP PPP EPEP P +
Sbjct: 51 TPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV 88



Score = 26.5 bits (58), Expect = 0.050
Identities = 15/34 (44%), Positives = 16/34 (47%)

Query: 50 EPANDSPPPPVAEPEPEPPSLDEPIDAQPVAEPP 83
A PP PV EPEPEP + EP PV
Sbjct: 58 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1438BCTERIALGSPD1659e-45 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 165 bits (418), Expect = 9e-45
Identities = 79/379 (20%), Positives = 161/379 (42%), Gaps = 41/379 (10%)

Query: 266 RTFFLSHADAKSVMAALRQM---------------IKPKDVYV--DERVNAVVMRDTPET 308
+ +L +A A ++ L + K++ + + NA+++ P+
Sbjct: 270 KVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDV 329

Query: 309 IQVAERVVMGLDIPQSEVTLDVQVLEVNMNDSLDLGVQ----YPGKIQFNALGGVEGGAL 364
+ ERV+ LDI + +V ++ + EV D L+LG+Q G QF G A+
Sbjct: 330 MNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAI 389

Query: 365 TLGDLLRLNR-------------DRVGVSSESGGLALAIDLLQKQGKTKTLANPKIRVRN 411
+ + + + G A+ + L K LA P I +
Sbjct: 390 AGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLD 449

Query: 412 MEKANIKIGERVPIVT--TTNANGVVTESVSYQDVGLMLKVEPRISLNEEVSVKVNMEVS 469
+A +G+ VP++T T + + +V + VG+ LKV+P+I+ + V +++ EVS
Sbjct: 450 NMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVS 509

Query: 470 SILTKETTKTGLVAYSLGTRNAETLMTAKNGETQILAGLVKRNESDSVAGLPGVSGLPVL 529
S+ ++ + + + TR + +GET ++ GL+ ++ SD+ +P + +PV+
Sbjct: 510 SVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVI 569

Query: 530 GRLFGSNGRSNERSEIVLLITPHIERNLDLPMSAVTTFLSGTESRVTTEALTLESAQAVP 589
G LF S + + ++L I P + R+ D S + +A + + +
Sbjct: 570 GALFRSTSKKVSKRNLMLFIRPTVIRDRD-----EYRQASSGQYTAFNDAQSKQRGKENN 624

Query: 590 ARLPSSDGPDLDAPIEPAT 608
+ + D ++ + A
Sbjct: 625 DAMLNQDLLEIYPRQDTAA 643



Score = 32.6 bits (74), Expect = 0.008
Identities = 27/165 (16%), Positives = 49/165 (29%), Gaps = 20/165 (12%)

Query: 180 SLNFKQQPLANIFDVISRVSGVNFVFDRDVDTSHAATLF-AERTTAEDAINLL---LRTN 235
S +FK + + +S+ + D V + T+ + E L
Sbjct: 31 SASFKGTDIQEFINTVSKNLNKTVIIDPSVRGT--ITVRSYDMLNEEQYYQFFLSVLDVY 88

Query: 236 QLEKKVLDRHTLLVYPSQPEKARNY-----------TEFAIRTFFLSHADAKSVMAALRQ 284
++ L V S+ K E R L++ A+ + LRQ
Sbjct: 89 GFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQ 148

Query: 285 MI---KPKDVYVDERVNAVVMRDTPETIQVAERVVMGLDIPQSEV 326
+ V E N ++M I+ +V +D
Sbjct: 149 LNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRS 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1439BCTERIALGSPG544e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 54.1 bits (130), Expect = 4e-12
Identities = 31/113 (27%), Positives = 55/113 (48%), Gaps = 15/113 (13%)

Query: 1 MNGRRRARGFTLIELMVAMSLLALLATVALPLTDLVKRRSDEAELRRVLVVIRSALDAYK 60
M + RGFTL+E+MV + ++ +LA++ +P K ++D+ + +V + +ALD YK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 AAADDGRIERSVDASGYP---PDLRALVDGVEDKKSPDGAR-LYFLRKLPADP 109
+D YP L +LV+ ++++LPADP
Sbjct: 61 -----------LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADP 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1440BCTERIALGSPG731e-19 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 73.0 bits (179), Expect = 1e-19
Identities = 37/130 (28%), Positives = 58/130 (44%), Gaps = 20/130 (15%)

Query: 9 RRARRAAGFTLIELLVVMAIIAALTAFVAPGYLKQSDRAKETVLRHNLNTLRQSIDDYRA 68
R + GFTL+E++VV+ II L + V P + ++A + ++ L ++D Y+
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 69 DHGRDPDT---LDALVEK-------------RYLRELPLDPLTGKRGSWQPQAGETGGIA 112
D+ P T L++LVE Y++ LP DP P GE G
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNP--GEHGAY- 118

Query: 113 DVKS-GAKGR 121
D+ S G G
Sbjct: 119 DLLSAGPDGE 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1442HTHFIS592e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.7 bits (142), Expect = 2e-12
Identities = 21/94 (22%), Positives = 47/94 (50%), Gaps = 1/94 (1%)

Query: 20 LDKAGYDTQVRHDGDSFIELVRTQRVDVLLLDWDVPGKSGIEVMRWARESFADALPIIMM 79
L +AGYD ++ + + + D+++ D +P ++ +++ +++ D LP+++M
Sbjct: 23 LSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPD-LPVLVM 81

Query: 80 TQHDGENDIVFGLNSGADDYLIKPLRERELVARV 113
+ + + GA DYL KP EL+ +
Sbjct: 82 SAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


61Bamb_1519Bamb_1524N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1519091.553779TetR family transcriptional regulator
Bamb_15200101.414028periplasmic multidrug efflux lipoprotein
Bamb_15210110.491922multidrug efflux protein
Bamb_1522210-0.103630RND efflux system outer membrane lipoprotein
Bamb_1523211-1.518535fimbrial protein
Bamb_1524213-0.845205fimbrial biogenesis outer membrane usher
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1519HTHTETR1073e-31 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 107 bits (269), Expect = 3e-31
Identities = 45/177 (25%), Positives = 90/177 (50%), Gaps = 3/177 (1%)

Query: 1 MARKTREESLAIKHRILDAAELVLLEKGVAQTAMADLAEAAGMSRGAVYGHYRNKMEVCL 60
MARKT++E+ + ILD A + ++GV+ T++ ++A+AAG++RGA+Y H+++K ++
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 ALCDRAFARTSEGFDAVDSLPA---FATLRRAASHYLQQCGEPGSMQRVLVILYTKCEQS 117
+ + + + E + + LR H L+ + ++ I++ KCE
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 118 EENGALQRRRMLLELQMLRITKALLRRAIAAGEIAADLDVHLAAVHLVSLLEGVFAS 174
E +Q+ + L L+ + L+ I A + ADL AA+ + + G+ +
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1520RTXTOXIND416e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 6e-06
Identities = 19/132 (14%), Positives = 38/132 (28%), Gaps = 5/132 (3%)

Query: 70 VRARVAGIVTARTYEEGQEVKQGAVLFRIDPAPLKAARDAAQGALAKAQAAAL---AASD 126
++ IV +EG+ V++G VL ++ +A Q +L +A+ S
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158

Query: 127 KRRRYDDLVRDHAVSERDHTEAVADDTRAKADVASAKAELAR--AQLQLDYATVTAPISG 184
+ + R + + + Q +L+ A
Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218

Query: 185 RARRALVTEGAL 196
R E
Sbjct: 219 VLARINRYENLS 230



Score = 34.4 bits (79), Expect = 8e-04
Identities = 14/101 (13%), Positives = 40/101 (39%), Gaps = 10/101 (9%)

Query: 103 LKAARDAAQGALAKAQAAALAASDKRRRYDDLVRDHAVSERDHTEAVADDTRAKADVASA 162
+ L + ++ L+A ++ + L ++ + + + ++
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKL---------RQTTDNIGLL 314

Query: 163 KAELARAQLQLDYATVTAPISGR-ARRALVTEGALVGQDQA 202
ELA+ + + + + AP+S + + + TEG +V +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1521ACRIFLAVINRP10820.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1082 bits (2799), Expect = 0.0
Identities = 525/1032 (50%), Positives = 714/1032 (69%), Gaps = 6/1032 (0%)

Query: 1 MARFFIDRPVFAWVIALFIMLGGAFAIRALPVAQYPDIAPPVVSIYATYPGASAQVVEES 60
MA FFI RP+FAWV+A+ +M+ GA AI LPVAQYP IAPP VS+ A YPGA AQ V+++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTALIEREMNGAPGLLYTSATS-SAGAASLYLTFKQGVNADLAAVEVQNRLKTVEARLPE 119
VT +IE+ MNG L+Y S+TS SAG+ ++ LTF+ G + D+A V+VQN+L+ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 PVRRDGIQVEKAADNIQLVVSLTSDDGRMSAVQLGEYASANVVQALRRVDGVGKVQFWGA 179
V++ GI VEK++ + +V SD+ + + +Y ++NV L R++GVG VQ +GA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 EYAMRIWPDPVKLAGHGLTASDIASAVRAHNARVTVGDIGRSAVPDSAPIAATVFADAPL 239
+YAMRIW D L + LT D+ + ++ N ++ G +G + + A++ A
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 240 KTPADFGAIALRAQPDGSALHLRDVARIEFGGNDYNYPSYVNGKVATGMGIKLAPGSNAV 299
K P +FG + LR DGS + L+DVAR+E GG +YN + +NGK A G+GIKLA G+NA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 300 STEKRVRATMDELSAYFPPGVKYQIPYETSSFVRVSMNKVVTTLIEAGVLVFLVMFLFMQ 359
T K ++A + EL +FP G+K PY+T+ FV++S+++VV TL EA +LVFLVM+LF+Q
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 360 NLRATLIPTLVVPVALAGTFGVMYAAGFSINVLTMFGMVLAIGILVDDAIVVVENVERLM 419
N+RATLIPT+ VPV L GTF ++ A G+SIN LTMFGMVLAIG+LVDDAIVVVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 420 VEEGLAPYDATVKAMRQISGAIVGITVVLTSVFVPMAFFGGAVGNIYRQFALSLAVSIGF 479
+E+ L P +AT K+M QI GA+VGI +VL++VF+PMAFFGG+ G IYRQF++++ ++
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 480 SAFLALSLTPALCATLLKPVSGDHHE-KRGFFGWFNGFVARSTQRYATRVGAMLKKPLRW 538
S +AL LTPALCATLLKPVS +HHE K GFFGWFN S Y VG +L R+
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 539 LVVYGALTAAAALMLTQLPSAFLPDEDQGNFMVMVIRPQGTPLAETMQSVREVESYIRRD 598
L++Y + A ++ +LPS+FLP+EDQG F+ M+ P G T + + +V Y ++
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 599 EPAAY--TFALGGFNLYGEGPNGGMIFVTLKNWKERKAARDHVQAIVARINERFAGTANT 656
E A F + GF+ G+ N GM FV+LK W+ER + +A++ R +
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 657 TVFAMNSPALPDLGSTSGFDFRLQNRGGLDYAAFSAAREQLLAAGGKDPA-LTDVMFAGT 715
V N PA+ +LG+ +GFDF L ++ GL + A + AR QLL + PA L V G
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 716 QDAPQLKLDIDRAKASALGVSMDEINTTLAVMFGSDYIGDFMHGTQVRRVIVQADGLHRL 775
+D Q KL++D+ KA ALGVS+ +IN T++ G Y+ DF+ +V+++ VQAD R+
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 776 DPADVQKLRVRNAGGEMVPLAAFATLHWTLGPPQLTRYNGFPSFTINGSAAPGHSSGEAM 835
P DV KL VR+A GEMVP +AF T HW G P+L RYNG PS I G AAPG SSG+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 836 AAIERLAAKLPAGIGHAWSGQSFEERLSGAQAPMLFALSVLVVFLALAALYESWSIPFAV 895
A +E LA+KLPAGIG+ W+G S++ERLSG QAP L A+S +VVFL LAALYESWSIP +V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 896 MLVVPLGVIGAVLGVTLRAMPNDIYFKVGLIATIGLSAKNAILIVEVAKDLVAQR-MSLV 954
MLVVPLG++G +L TL ND+YF VGL+ TIGLSAKNAILIVE AKDL+ + +V
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 955 DAALEAARLRLRPIVMTSLAFGVGVLPLAFASGAASGAQMAIGTGVLGGMITATVLAVFL 1014
+A L A R+RLRPI+MTSLAF +GVLPLA ++GA SGAQ A+G GV+GGM++AT+LA+F
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1015 VPLFFVIVGRLF 1026
VP+FFV++ R F
Sbjct: 1021 VPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1524PF005776790.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 679 bits (1753), Expect = 0.0
Identities = 241/865 (27%), Positives = 364/865 (42%), Gaps = 65/865 (7%)

Query: 2 RIRHSFLCVSVLVVGSPSHATEFNSSFLDIDGTSNVDLSQFSQPDFTLPGEYMLDVQVND 61
+R C S FN FL D + DLS+F PG Y +D+ +N+
Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNN 86

Query: 62 LFYGLQAIQFIALDASGAGKPCLPPELVARFGLKPSLAKDLPRLQGGRCVDLG-AIEGAT 120
+ + + F D+ PCL +A GL + + L CV L I AT
Sbjct: 87 GYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDAT 146

Query: 121 VRYLKSDGRLKITVPQAALEFTDSTYLPPSSWSDGIAGAMLDYRVIANTNRNFGSGGGQT 180
+ RL +T+PQA + Y+PP W GI +L+Y N+ +N GG +
Sbjct: 147 AQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN--RIGGNS 204

Query: 181 NSIQAYGTIGANWDAWRFRGDYQAQSNVGNTAYADRT-FRFSRLYAFRALPSIQSTVTFG 239
+ G N AWR R + N +++ + ++ + R + ++S +T G
Sbjct: 205 HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLG 264

Query: 240 DDYLSSDIFDTFALTGASIRSDDRMLPPSLRGYAPLISGVARTNATVTVSQVGRVLYVTR 299
D Y DIFD GA + SDD MLP S RG+AP+I G+AR A VT+ Q G +Y +
Sbjct: 265 DGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNST 324

Query: 300 VSPGAFALQNIN-TSVQGTLDVTVDEEDGSVQRFQVTTAAVPFLARTGQLRYKAAVGKPR 358
V PG F + +I G L VT+ E DGS Q F V ++VP L R G RY G+ R
Sbjct: 325 VPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYR 384

Query: 359 QFGGAGVTPFFGFGEVAYGLPFDVTLYGGFIAASGYTSIALGVGRDFGTFGAVSADVTHA 418
P F + +GLP T+YGG A Y + G+G++ G GA+S D+T A
Sbjct: 385 SGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQA 444

Query: 419 RAHLWWNGATRNGNSYRINYSKHFDGLDADVRFFGYRFSEREYTNFAQFSGDPTAYGL-- 476
+ L + + +G S R Y+K + +++ GYR+S Y NFA +
Sbjct: 445 NSTL-PDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIE 503

Query: 477 -------------------ANSKQRYSATMSKRFGDTST-YFSYDQTTYW-ARASEQRVG 515
N + + T++++ G TST Y S TYW +++
Sbjct: 504 TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQ 563

Query: 516 LTLTRAFSIGALRNLNVSVSAFRTQSAGASGNQFSVTATLPIGGRHTVTSNLTTGSGSTS 575
L AF ++N ++S T++A G + + I H + S+ + S
Sbjct: 564 AGLNTAF-----EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHAS 618

Query: 576 ANAGYIYDDPAGRT----------------YQINAGATDGRASANASFRQRTSTYQ---- 615
A+ +D T Y + G G + S T Y+
Sbjct: 619 ASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYG 678

Query: 616 -LSAQASTLANAYAAASLEVDGSLVATQYGVSAHANGNAGDTRLLVSTDGVPDVPLS-GT 673
+ S ++ V G ++A GV+ DT +LV G D + T
Sbjct: 679 NANIGYSH-SDDIKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVKAPGAKDAKVENQT 735

Query: 674 LTHTDSRGYAVLDGISPYNVFDATVNVEKLPLEVQVSNPIQRMVLTDGAIGFVKFSAARG 733
TD RGYAVL + Y ++ L V + N + +V T GAI +F A G
Sbjct: 736 GVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVG 795

Query: 734 SNLYLTLTDAAGKPLPFGASVQDAANGKELGIVGEAGAAFLTQVQPKSALVVRAGERT-- 791
L +TLT KPLPFGA V + + GIV + G +L+ + + V+ GE
Sbjct: 796 IKLLMTLTH-NNKPLPFGAMVTS-ESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENA 853

Query: 792 LCAVN-ALPNQLQLEG-TPIPVTCQ 814
C N LP + Q + T + C+
Sbjct: 854 HCVANYQLPPESQQQLLTQLSAECR 878


62Bamb_1874Bamb_1881N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_1874024-2.479313peptidase S14, ClpP
Bamb_1875123-3.487409lambda family phage portal protein
Bamb_1876025-4.188482hypothetical protein
Bamb_1877126-4.523568phage terminase GpA
Bamb_1878029-4.993344hypothetical protein
Bamb_1879128-4.770295hypothetical protein
Bamb_1880227-4.740935virulence-associated E family protein
Bamb_1881651-9.542896hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1874TONBPROTEIN330.002 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.7 bits (74), Expect = 0.002
Identities = 12/37 (32%), Positives = 14/37 (37%)

Query: 205 VRALVDETEPTDPPQPDTQPNTPPEPNPDPQPVPPAP 241
V E P P+ PEP P P+P AP
Sbjct: 50 VTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAP 86



Score = 30.3 bits (68), Expect = 0.010
Identities = 8/26 (30%), Positives = 10/26 (38%)

Query: 217 PPQPDTQPNTPPEPNPDPQPVPPAPD 242
P QP P P+P+P P
Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPEP 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1875PHPHTRNFRASE290.038 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.4 bits (66), Expect = 0.038
Identities = 11/30 (36%), Positives = 16/30 (53%)

Query: 351 DLRDVSDRVLRVLLNEFRRSIEQLQQNVFI 380
D+RDVS RVL L+ S+ + + I
Sbjct: 130 DIRDVSKRVLGHLIGVETGSLATIAEETVI 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1877TONBPROTEIN290.049 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 29.2 bits (65), Expect = 0.049
Identities = 15/66 (22%), Positives = 22/66 (33%), Gaps = 7/66 (10%)

Query: 577 LMTEAHWRVEQVRISQVSLFDAVPILESLPSALPVETLPTVQTDADPPPPIEPVQPVAKP 636
L T H +E +Q PI ++ + +E VQ +P EP
Sbjct: 28 LYTSVHQVIELPAPAQ-------PISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPE 80

Query: 637 PETPPP 642
P P
Sbjct: 81 PPKEAP 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1880PF052725630.0 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 563 bits (1452), Expect = 0.0
Identities = 141/468 (30%), Positives = 207/468 (44%), Gaps = 33/468 (7%)

Query: 399 ETPTKPAATSAAAKQP--EWDGREAENGAHTWEQD----LARSDKGTLLPTLGNVHMILS 452
E P K ++ A P G + E+ W D L + L P + L
Sbjct: 401 EPPKKRDPSAGAGTDPGGPGGGDDGEDPFGEWLDDEVARLRLRGRWLLKPRRAALIEALR 460

Query: 453 NHKAWQGVIEQDDFGGRVMKRKAPPFPQGVTGEWTDMDDQRCALWLSQRYG-LSVRTDIV 511
+ A G + D+ + + +A P+ + G D D R A ++ YG
Sbjct: 461 SAPALAGCVAFDELREQPVAVRAFPWRKA-PGPLEDADVLRLADYVETTYGTGEASAQTT 519

Query: 512 MNAVLLVADATHFHDVREYLEGLKWDGVPRVRSMPSTYLRVADS-------EYVQLAFMK 564
A+ + AD H R++++ +WD VPR+ L Y+QL
Sbjct: 520 EQAINVAADMNRVHPFRDWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKY 579

Query: 565 WMIAAVARVMEPGCKVDNVLILEGKQGHRKSTALKVLAGAPWFTDTPIQIG-NKDTYAVL 623
++ VARVMEPGCK D ++LEG G KST + L G +F+DT IG KD+Y +
Sbjct: 580 ILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQI 639

Query: 624 AGKWVIELAELDSLNKADSSAVKSFFATAVDRFRNFYGKRATDVPRQCVFAGSVNFDTYL 683
AG EL+E+ + +AD+ AVK+FF++ DR+R YG+ D PRQ V + N YL
Sbjct: 640 AGIVAYELSEMTAFRRADAEAVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWCTTNKRQYL 699

Query: 684 KDESGNRRYWPLRVGGLVDIDGIVAVRDQLWAEAVHLYRTGVVWHVE-EHERPLFEIEQA 742
D +GNRR+WP+ V G ++ + R QL+AEA+HLY G + E E F EQ
Sbjct: 700 FDITGNRRFWPVLVPGRANLVWLQKFRGQLFAEALHLYLAGERYFPSPEDEEIYFRPEQE 759

Query: 743 ERYEGDVYEDKI--------AKALE------FVSRTTMEEI--LADILKLDTSKWTLAEQ 786
R + ++ A A E + TT I L L D K + +
Sbjct: 760 LRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTIADLVQALGADPGKSSPMLE 819

Query: 787 RRIGKALKSLGWVRKRESTGSRGWYYVSEQQEPEVERELVAAGDDDSP 834
++ L GW RE++G R Y+ Q P V E A +P
Sbjct: 820 GQVRDWLNENGWEYLRETSGQRRRGYMRPQVWPPVIAEDKEADQAHAP 867


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_1881CHANLCOLICIN280.014 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 27.7 bits (61), Expect = 0.014
Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 79 ERAARACAALATKAERHAFRDQLTDRL 105
E+AARA AA +A+ A RD LT RL
Sbjct: 68 EQAARAKAAAEAQAKAKANRDALTQRL 94


63Bamb_2095Bamb_2102N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2095293.069613hypothetical protein
Bamb_2096482.960237acriflavin resistance protein
Bamb_20974103.002161RND family efflux transporter MFP subunit
Bamb_20983112.444523RND efflux system outer membrane lipoprotein
Bamb_20994100.921464hypothetical protein
Bamb_21003100.492743two component transcriptional regulator
Bamb_2101412-1.672933sensor signal transduction histidine kinase
Bamb_2102112-3.198446two component transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2095TONBPROTEIN280.006 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 28.4 bits (63), Expect = 0.006
Identities = 13/44 (29%), Positives = 14/44 (31%)

Query: 34 VPAPVYVAPAPVYAPPPPPVVYQPAPVYAPAPVYAPAPVYAPAP 77
V P P P P P + APV P P P P
Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 104



Score = 27.6 bits (61), Expect = 0.010
Identities = 12/43 (27%), Positives = 12/43 (27%)

Query: 35 PAPVYVAPAPVYAPPPPPVVYQPAPVYAPAPVYAPAPVYAPAP 77
P V P P P P APV P P P
Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2096ACRIFLAVINRP6240.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 624 bits (1610), Expect = 0.0
Identities = 241/1069 (22%), Positives = 427/1069 (39%), Gaps = 57/1069 (5%)

Query: 4 LVRLALARPYTFIVLALLILIAGPLAALRTPTDIFPDIRIPVISVVWNYAGLQPADMAGR 63
+ + RP VLA+++++AG LA L+ P +P I P +SV NY G +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 64 IVTYYERTLGTTVNDVAHIESQSFRSYGI-VKIFFQPSVDIRTATAQVTSISQTVLKQMP 122
+ E+ + ++++ ++ S S + + + + FQ D A QV + Q +P
Sbjct: 61 VTQVIEQNM-NGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLP 119

Query: 123 PGTTPPQILNYNASTVPVLQLALTSDTLNEQQ--LGDYATNVIRPQLLSVAGVAIPSPYG 180
I +S+ ++ SD Q + DY + ++ L + GV +G
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 181 GKVRQVQIDLDPQALQAKGLSAQDVATALAQQNQIIPAGT------QKIGRFEYNIRLND 234
+ ++I LD L L+ DV L QN I AG + +I
Sbjct: 180 AQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 235 SPLTIDQLNALPIRTV-NGAVIFMRDVAHVRDGFPPQGNIVRVDGRRAVLMSVLKSGSAS 293
++ + +R +G+V+ ++DVA V G I R++G+ A + + + A+
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 294 TLDIIAGVKAQLPRIEATLPPSLRLVVMGDQSVFVKGAVSGVAREGLIAAALTSAMILLF 353
LD +KA+L ++ P ++++ D + FV+ ++ V + A L ++ LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 354 LGSWRSTLIIAASIPLAVLAAIAALAAAGETLNVMTLGGLALAVGILVDDATVTIENV-N 412
L + R+TLI ++P+ +L A LAA G ++N +T+ G+ LA+G+LVDDA V +ENV
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 413 WHLEQGKDVRSAILDGASQIVAPAFVSLLCICIVFVPMLLLDGVARFLFVPMAEAVIFAM 472
+E + A SQI + + VF+PM G ++ + ++ AM
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 473 IASFVLSRTFVPMMARYLLRPHAAHPAAVLAPHGAPFPTPRSRNPLVAFQQGFERRFAAL 532
S +++ P + LL+P +A F F F
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEH----------------HENKGGFFGWFNTTFDHS 522

Query: 533 RTGYRAVLGLALAHRARFVVLFLTAVALSFALVPGLGRNFFPSVDAGEIALHVRAPIGTR 592
Y +G L R+++++ VA L L +F P D G ++ P G
Sbjct: 523 VNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGAT 582

Query: 593 IEETAALFDRVERTVRGVVPPRALASIVDNMGLPNSGINLTYSNSGTIGPQDGDILVSLT 652
E T + D+V L + N+ + ++S G VSL
Sbjct: 583 QERTQKVLDQVTD--------YYLKNEKANVESVFTVNGFSFSGQ---AQNAGMAFVSLK 631

Query: 653 GEH-----APTADYV-KQLRTVLPRAFPGVTFSFLPADIVSQILNFGAPAPIDVQVTGPD 706
+A+ V + + L + G F IV + G
Sbjct: 632 PWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELG-TATGFDFELIDQAGLG 690

Query: 707 LAANRAYATELLRRIRTVPG-VADARVQQASTYPQFTVSVDRALAAQLGITEQDVTNAVV 765
A +LL P + R QF + VD+ A LG++ D+ +
Sbjct: 691 HDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTIS 750

Query: 766 ASLSGTSQVSPTYWLDPHNGVSYPIVAQTPQYRMTSLSDLRALPVTGRSGAPQLLGGLAT 825
+L G + V+ G + Q D+ L V +G T
Sbjct: 751 TALGG-TYVNDFI----DRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTT 805

Query: 826 IVRSQTDAVVSHYDIAPLDDIFATTQDRDLGAVSADIARVLHASAADLPKGSRVTVRGQV 885
+ Y+ P +I G S D ++ A+ LP G G
Sbjct: 806 SHWVYGSPRLERYNGLPSMEIQGEAAP---GTSSGDAMALMENLASKLPAGIGYDWTGMS 862

Query: 886 QTMSSAFGGLLAGLAGAVLLIYLLIVVNFHSWRDAFVIVSALPAALAGIVWMLFVTRTPL 945
+ A +A + ++++L + + SW ++ +P + G++ +
Sbjct: 863 YQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKN 922

Query: 946 SVPALTGAILCMGVATANSILVVTFARERLAH-TADATVAALEAGFTRFRPVMMTALAMI 1004
V + G + +G++ N+IL+V FA++ + A L A R RP++MT+LA I
Sbjct: 923 DVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFI 982

Query: 1005 IGMAPMALGLGDGGEQNAPLGRAVIGGLLCATVATLVFVPVVFSLVHRR 1053
+G+ P+A+ G G +G V+GG++ AT+ + FVPV F ++ R
Sbjct: 983 LGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 89.5 bits (222), Expect = 4e-20
Identities = 63/358 (17%), Positives = 133/358 (37%), Gaps = 15/358 (4%)

Query: 714 ATELLRRIRTVPGVADARVQQASTYPQFTVSVDRALAAQLGITEQDVTNAVVA--SLSGT 771
A+ + + + GV D VQ + +D L + +T DV N +
Sbjct: 159 ASNVKDTLSRLNGVGD--VQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAA 216

Query: 772 SQVSPTYWLDPHNGVSYPIVAQTPQYRMTSLSDLRALPV-TGRSGAPQLLGGLATIVR-S 829
Q+ T L P ++ I+AQT R + + + + G+ L +A +
Sbjct: 217 GQLGGTPAL-PGQQLNASIIAQT---RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGG 272

Query: 830 QTDAVVSHYDIAP--LDDIFATTQDRDLGAVSADIARVLHASAADLPKGSRVTV-RGQVQ 886
+ V++ + P I T L + I L P+G +V
Sbjct: 273 ENYNVIARINGKPAAGLGIKLATGANAL-DTAKAIKAKLAELQPFFPQGMKVLYPYDTTP 331

Query: 887 TMSSAFGGLLAGLAGAVLLIYLLIVVNFHSWRDAFVIVSALPAALAGIVWMLFVTRTPLS 946
+ + ++ L A++L++L++ + + R + A+P L G +L ++
Sbjct: 332 FVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSIN 391

Query: 947 VPALTGAILCMGVATANSILVVTFARERLAHTADATVAALEAGFTRFR-PVMMTALAMII 1005
+ G +L +G+ ++I+VV + A E ++ + ++ A+ +
Sbjct: 392 TLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSA 451

Query: 1006 GMAPMALGLGDGGEQNAPLGRAVIGGLLCATVATLVFVPVVFSLVHRRDRAPHSESPS 1063
PMA G G ++ + + + L+ P + + + + A H E+
Sbjct: 452 VFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKG 509


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2097RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 18/124 (14%), Positives = 37/124 (29%), Gaps = 28/124 (22%)

Query: 90 GYLHAWYVDIGAHVKAGQLLASIDTPDLDQQLQQARADLQSATANE-RLAAVTAARWSEM 148
+ V G V+ G +L + + + ++ L A + R ++ +
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164

Query: 149 LAQDSVS---------------------------RQEADEKRSDLDAKRAAVAASTANVR 181
L + + + + +K +LD KRA A +
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224

Query: 182 RLEA 185
R E
Sbjct: 225 RYEN 228



Score = 34.8 bits (80), Expect = 5e-04
Identities = 22/139 (15%), Positives = 45/139 (32%), Gaps = 4/139 (2%)

Query: 117 LDQQLQQARADLQSATANERLAAVTAARWSEMLAQDSVSRQEADEKRSDLDAKRAAVAAS 176
L+Q+ + A + +L + + S V++ +E L +
Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 177 TANVRRLEALESFKRLTAPFDGVVTARKT-DVGALIDAGSGNGAELFTVSDARRLRLYVH 235
T + + E + + AP V K G ++ + V + L +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE---TLMVIVPEDDTLEVTAL 371

Query: 236 IPQDDAGAIRAGMHVALTV 254
+ D G I G + + V
Sbjct: 372 VQNKDIGFINVGQNAIIKV 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2102HTHFIS781e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.6 bits (191), Expect = 1e-18
Identities = 31/120 (25%), Positives = 49/120 (40%)

Query: 2 RVLTVEDDAVTANEIVGELTARGFEVDWIDNGREGMMRAMSASYDAITLDRMLPGADGLA 61
+L +DDA + L+ G++V N + D + D ++P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILTAMRTVGIDTPVLMLSALGDVDERIRGLRAGGDDYLTKPFDSGELSARIEVLLRRRQA 121
+L ++ D PVL++SA I+ G DYL KPFD EL I L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124


64Bamb_2170Bamb_2176N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2170015-1.986335peptidase S11, D-alanyl-D-alanine
Bamb_2171114-2.160841phasin family protein
Bamb_2172014-1.687276dihydrolipoamide dehydrogenase
Bamb_2173-113-1.124076dihydrolipoamide acetyltransferase
Bamb_2174-211-1.128731pyruvate dehydrogenase subunit E1
Bamb_2175-38-0.414917multi-sensor signal transduction histidine
Bamb_2176-310-0.265499two component LuxR family transcriptional
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2170BLACTAMASEA330.001 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 33.2 bits (76), Expect = 0.001
Identities = 32/140 (22%), Positives = 54/140 (38%), Gaps = 13/140 (9%)

Query: 134 YVVDQNTGEPLFDKNSHAVVPIASISKLMTAMVVLDAKSPMTDQL----EVTDED-RDYE 188
+D +G L + P+ S K++ VL +QL +D DY
Sbjct: 43 IEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYS 102

Query: 189 KGTGSRLSVGSVLSREDMLHIALMASENRAAAALSRYYPGGRPAFIAAMNAKAKSLGMND 248
+ L+ G ++ ++ A+ S+N AA L G A + A + +G N
Sbjct: 103 PVSEKHLADG--MTVGELCAAAITMSDNSAANLLLATVGG-----PAGLTAFLRQIGDNV 155

Query: 249 THFE-NSTGLSSSNVSSARD 267
T + T L+ + ARD
Sbjct: 156 TRLDRWETELNEALPGDARD 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2172RTXTOXIND310.021 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.021
Identities = 14/44 (31%), Positives = 22/44 (50%)

Query: 45 SMEVPSDVAGTVKEVKVKAGEKVSQGTIIAIVEAAATDAAPVKA 88
S E+ VKE+ VK GE V +G ++ + A +A +K
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKT 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2173RTXTOXIND357e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.2 bits (81), Expect = 7e-04
Identities = 15/37 (40%), Positives = 22/37 (59%)

Query: 166 VPSPAAGVVKEIKVKVGDSVSEGTLIVLLDAAGAPAA 202
+ +VKEI VK G+SV +G +++ L A GA A
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135



Score = 32.1 bits (73), Expect = 0.006
Identities = 14/71 (19%), Positives = 23/71 (32%), Gaps = 1/71 (1%)

Query: 49 VPSPAGGTVKEVKVKVGDSVSEGSLIILLEG-GAAAQVNGAAAPAAAPAPAAAPAPAAPA 107
+ VKE+ VK G+SV +G +++ L GA A +
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158

Query: 108 AAAPAAAPAAS 118
+ P
Sbjct: 159 SIELNKLPELK 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2176HTHFIS1123e-31 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 112 bits (282), Expect = 3e-31
Identities = 39/153 (25%), Positives = 67/153 (43%), Gaps = 4/153 (2%)

Query: 11 TVFVVDDDEAVRDSLRWLLEANGYRVQCFSSAEQFLDAYQPAQQAGQIACLILDVRMSGM 70
T+ V DDD A+R L L GY V+ S+A AG ++ DV M
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA----AGDGDLVVTDVVMPDE 60

Query: 71 SGLELQERLIADNAALPIIFVTGHGDVPMAVSTMKKGAMDFIEKPFDEAELRKLVERMLD 130
+ +L R+ LP++ ++ A+ +KGA D++ KPFD EL ++ R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 131 KARSESKSVQEQRAASERLSKLTAREQQVLERI 163
+ + +++ L +A Q++ +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL 153


65Bamb_2249Bamb_2253N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2249-213-2.933830aromatic amino acid aminotransferase
Bamb_2250-117-3.7363763-hydroxybutyrate dehydrogenase
Bamb_2251120-3.311156aldo/keto reductase
Bamb_2252122-2.989101hypothetical protein
Bamb_2253023-1.764126hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2249PF05272320.005 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.005
Identities = 17/75 (22%), Positives = 23/75 (30%), Gaps = 25/75 (33%)

Query: 305 LHAAWVQELGEMRDRIRAMRNGLVERLKASGVDRDFSFINAQRGMFSYSGLTSAQVDRLR 364
+ EL EM A R E +K +F S++ DR R
Sbjct: 639 IAGIVAYELSEMT----AFRRADAEAVK--------AFF-------------SSRKDRYR 673

Query: 365 EEFGIYAVGTGRICV 379
+G Y R V
Sbjct: 674 GAYGRYVQDHPRQVV 688


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2250DHBDHDRGNASE1038e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 103 bits (259), Expect = 8e-29
Identities = 73/261 (27%), Positives = 121/261 (46%), Gaps = 11/261 (4%)

Query: 2 AADLSGKTAVVTGAASGIGKEIALELAKAGAAVAIADLNQDGANAVADEINKAGGKAIGV 61
A + GK A +TGAA GIG+ +A LA GA +A D N + V + A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 62 AMDVTNEEAVNSGIDKVAEAFGSVDILVSNAGIQIVNPIENYAFSDWKKMQAIHVDGAFL 121
DV + A++ ++ G +DILV+ AG+ I + + +W+ +++ G F
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 122 TTKAALKHMYKDDRGGVVIYMGSVHSHEASPLKSAYVTAKHGLLGLARVLAKEGAKHNVR 181
+++ K+M D R G ++ +GS + +AY ++K + + L E A++N+R
Sbjct: 123 ASRSVSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 182 SHVVCPGFVRTPLVDKQIPEQAKELGVSEEEVIK----KVMLGNTVDGVFTTVQDVAQTV 237
++V PG T D Q A E G E+VIK G + D+A V
Sbjct: 182 CNIVSPGSTET---DMQWSLWADENG--AEQVIKGSLETFKTGIPL-KKLAKPSDIADAV 235

Query: 238 LFLSAFPSAALTGQSVVVSHG 258
LFL + + +T ++ V G
Sbjct: 236 LFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2252PF06872300.010 EspG protein
		>PF06872#EspG protein

Length = 398

Score = 30.1 bits (67), Expect = 0.010
Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 7/82 (8%)

Query: 160 ALVLIDPSYEDKKDYART---VTCVTECLKRFATGCYAIWYPQVARVESQRFPEQLKRLQ 216
A +++D + + DY + +TC + LK G +W P+ ++ E Q+F L L+
Sbjct: 27 ASLVLDATIKINSDYKKPWNEMTCAEKLLKILTLG---LWNPKYSQDERQQFQGLLTVLE 83

Query: 217 PNNWLHLTL-TVSNPPADGLGL 237
P + H L V +DG L
Sbjct: 84 PVSPAHNELGRVYAKFSDGSSL 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2253YERSSTKINASE280.010 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 27.8 bits (61), Expect = 0.010
Identities = 17/49 (34%), Positives = 24/49 (48%), Gaps = 2/49 (4%)

Query: 50 GGLVLQTAPLSSEPIVEPAGMRTPAGQGPNSSVPLFVAPYINVPGWGAS 98
G +V A S EP+V G+ + +G+ P F AP + V GAS
Sbjct: 274 GNVVFDRA--SGEPVVIDLGLHSRSGEQPKGFTESFKAPELGVGNLGAS 320


66Bamb_2341Bamb_2346N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_23411102.885232hypothetical protein
Bamb_2342-1101.992872PRC-barrel domain-containing protein
Bamb_23430111.266336major facilitator superfamily transporter
Bamb_23440111.276370hypothetical protein
Bamb_2345-191.724102hypothetical protein
Bamb_2346-291.918248lipid A ABC transporter ATPase/inner membrane
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2341cloacin320.003 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.4 bits (73), Expect = 0.003
Identities = 19/65 (29%), Positives = 25/65 (38%)

Query: 30 GGSGSLSKGTGGSGSGSGDTTASTGGTGNGTSGTSASTGGTGSTGSTGNGASGSSANGVG 89
GG L G G S + + G G+G+ G G+ G GN GS G
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 90 QTLAA 94
+AA
Sbjct: 82 SAVAA 86



Score = 32.4 bits (73), Expect = 0.003
Identities = 27/109 (24%), Positives = 43/109 (39%), Gaps = 6/109 (5%)

Query: 32 SGSLSKGTGGSGSGSGDTTASTGGTGNGTSGTSASTGGTGSTGSTGNGASGSSANGVGQT 91
+G + G G+ G +S G SG+ GG G+ G + +G G
Sbjct: 21 NGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80

Query: 92 LAASSNIVTAGGGAISGVGTAIGAQTLPGTNPATTQGLGAVVQDVGAAV 140
L+A + V G A+S G A ++ L A + D+ AA+
Sbjct: 81 LSAVAAPVAFGFPALSTPGAGGLAVSISA------GALSAAIADIMAAL 123



Score = 28.5 bits (63), Expect = 0.050
Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 7/91 (7%)

Query: 53 TGGTGNGTSGTSASTGGTGSTGSTGNGASGSSANGVGQTLAASSNIVTAGGGAISGVGTA 112
+GG G G + + ST G + G TG G G +++G G + + GGG+ SG+
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPW----GGGSGSGIHWG 57

Query: 113 IGAQTLPG---TNPATTQGLGAVVQDVGAAV 140
G+ G N G G + V A V
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPV 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2342TONBPROTEIN330.001 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 33.0 bits (75), Expect = 0.001
Identities = 15/71 (21%), Positives = 21/71 (29%)

Query: 33 LLPTQNPPAPISEALIEPVRETAGEPLTMPPVPAPTHPEEPEAPKKPHREVPRPKPVQRA 92
L P E ++EP E P P +P+ KP + +R
Sbjct: 54 DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRD 113

Query: 93 EPPAPPPPPPP 103
P P P
Sbjct: 114 VKPVESRPASP 124



Score = 31.9 bits (72), Expect = 0.003
Identities = 20/67 (29%), Positives = 25/67 (37%), Gaps = 1/67 (1%)

Query: 40 PAPISEALIEPVRETAGEPLTMPPVPAPTHPEEPEAPKKPHREVPRPKPVQRAEPPAPPP 99
PAP + V EP P EPE +P E P+ PV +P P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEP-VVEPEPEPEPIPEPPKEAPVVIEKPKPKPK 97

Query: 100 PPPPPLV 106
P P P+
Sbjct: 98 PKPKPVK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2343TCRTETA561e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 56.0 bits (135), Expect = 1e-10
Identities = 38/144 (26%), Positives = 58/144 (40%), Gaps = 6/144 (4%)

Query: 255 VIAACIIVPQAIVAMLSPWVGRSSQRWGRRPILLLGFSALPVRALLFAGVSSPYLLVPVQ 314
++ A + Q A P +G S R+GRRP+LL+ + V + A ++L +
Sbjct: 47 ILLALYALMQFACA---PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGR 103

Query: 315 MLDGISAAVFGVMLPLIAADVAGGKGRYNLCIGLFGLAAGIGATLSTAVAGYVADHFGNT 374
++ GI+ A V I AD+ G R G G G + G + F
Sbjct: 104 IVAGITGATGAVAGAYI-ADITDGDERARH-FGFMSACFGFGMVAGPVLGGLMGG-FSPH 160

Query: 375 TSFFGLAAAGALAALLVWLAMPET 398
FF AA L L +PE+
Sbjct: 161 APFFAAAALNGLNFLTGCFLLPES 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2346ACRIFLAVINRP300.032 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.2 bits (68), Expect = 0.032
Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 141 GVMVTLVRDSLTVMFLLGYLFYLNWRLTLIVAVILP 176
V+ TL + V ++ YLF N R TLI + +P
Sbjct: 339 EVVKTLFEAIMLVFLVM-YLFLQNMRATLIPTIAVP 373


67Bamb_2360Bamb_2365N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2360-1131.274953cell division protein FtsK
Bamb_2361-111-0.4105203-carboxymuconate cyclase-like protein
Bamb_2362-390.631407glycoside hydrolase 15-like protein
Bamb_2363-3110.629545polyhydroxyalkanoate depolymerase
Bamb_2364-2110.851258TetR family transcriptional regulator
Bamb_2365-2131.477358ferredoxin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2360PYOCINKILLER378e-04 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 36.7 bits (84), Expect = 8e-04
Identities = 50/252 (19%), Positives = 84/252 (33%), Gaps = 20/252 (7%)

Query: 897 SQPAASTPVSPAAVSSGASGSAFTATSSASATPSAALSSRVPDASAIGQPSMSTAAAQTA 956
+ AA + AA + +A A A + R + A+ P+ + A A
Sbjct: 206 TLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYAM--PANGSVVATAA 263

Query: 957 TGGTAPAAAAGVAAFAASSSGTLAPTTPIAAASPTAFGGTPTSPSSALASVAATPTAT-- 1014
G A G A+ A + S +A + A++P+ S + + + T
Sbjct: 264 GRGLIQVAQ-GAASLAQAISDAIAVLGRVLASAPSVMAVGFASLTYSSRTAEQWQDQTPD 322

Query: 1015 ----------APTTPPTSANPSA--SASPIVSVPGAASIDASTSTPPIAPTPAPQAQPFT 1062
A P S N +A AS V +P + +A +T ++
Sbjct: 323 SVRYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMRLTNEARGNTTTLSVVSTDGVSVPK 382

Query: 1063 AQPAPAAAPSSWTMPGAAATPTTTGTTIPTATTAPLPTATLPAATLPPAAEPTALAEPST 1122
A P AA ++ T P+TT P T T P P++ + +P
Sbjct: 383 AVPVRMAAYNATTGLYEVTVPSTTAEAPPLILTW---TPASPPGNQNPSSTTPVVPKPVP 439

Query: 1123 PAPDAPAAPERP 1134
A P +
Sbjct: 440 VYEGATLTPVKA 451



Score = 33.2 bits (75), Expect = 0.009
Identities = 48/260 (18%), Positives = 90/260 (34%), Gaps = 14/260 (5%)

Query: 797 AASAAASAVGASGPTTAPSSVSAPFAAAAPSAPSAPSATSA--TSAPPATSATATSSTGA 854
+AA +++ A+ A +A A +A A T A PA + ++ G
Sbjct: 206 TLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGR 265

Query: 855 SGMPAAQPAATAPTVTASSQTASTVTASSAPASTGASFNAAASQPAASTPVSPAAVSSGA 914
+ AQ AA+ + + +SAP+ F A+ + + +
Sbjct: 266 GLIQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGF-ASLTYSSRTAEQWQDQTPDSV 324

Query: 915 SGSAFTATSSASATPSAALSSRVPDASAIGQPSMSTAAAQTATGGTAPAAAAGVAAFAAS 974
+ + PS L++ + + P T A+ T + + GV+ A
Sbjct: 325 RYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMRLTNEARGNTTTLSVVSTDGVSVPKAV 384

Query: 975 SSGTLAPTTPIAAASPTAFGGTPTSPSSALASVAATPTATAPTTPPTSANPSASASPIVS 1034
A A+ + T S ++ + T P +PP + NPS++
Sbjct: 385 PVRMAAYN-----ATTGLYEVTVPSTTAEAPPLILT---WTPASPPGNQNPSSTTP---V 433

Query: 1035 VPGAASIDASTSTPPIAPTP 1054
VP + + P+ TP
Sbjct: 434 VPKPVPVYEGATLTPVKATP 453


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2363FLGHOOKFLIK290.036 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 29.4 bits (65), Expect = 0.036
Identities = 11/58 (18%), Positives = 16/58 (27%), Gaps = 2/58 (3%)

Query: 428 PVTTTLAAVPSSKQPETTRATVAKRVRAKAPAATVAPARTPAAKAAPAAKSAGSTRAK 485
T + PS+ P K + T P P A P ++K
Sbjct: 142 DNTPKVTDAPSTVLPTEKPTLFTKLTSEQ--LTTAQPDDAPGTPAQPLTPLVAEAQSK 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2364HTHTETR742e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.9 bits (181), Expect = 2e-18
Identities = 33/189 (17%), Positives = 63/189 (33%), Gaps = 10/189 (5%)

Query: 5 KIKRDPEGTRRRILMAAAEEFASGGLFGARVDQIARRAETNERMLYYYFGSKEQLFTAVL 64
K K++ + TR+ IL A F+ G+ + +IA+ A +Y++F K LF+ +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 65 EHAFSALTEAERVLDLDGVAPVEAVTR---LAHFIWDYYRDHPELLRLINNENLHEARYL 121
E + S + E E +V R + + LL I +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 122 HKSTR-IREMMSPIVAKLGNVLMRGQKAGLFRGDVDPLHFYVTLSGL------GYYIVSN 174
+ R + ++ L +A + D+ + + G +
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 175 RFTLAATLG 183
F L
Sbjct: 184 SFDLKKEAR 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2365RTXTOXIND330.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.9 bits (75), Expect = 0.002
Identities = 16/148 (10%), Positives = 42/148 (28%), Gaps = 14/148 (9%)

Query: 181 QQQADAARERHDRRLARQRREREAAEARAAARR---AASASAAKAAPEAAEPGKQPDAPS 237
+AD + + AR + R +R+ +E
Sbjct: 131 GAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190

Query: 238 AAPADDADAKKRAIIAAALERARKKKEELSEQGTGPRNTERVSAAVQAQIDAAEARRK-- 295
++ L++ R ++ + + E +S ++++D +
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARI---NRYENLSRVEKSRLDDFSSLLHKQ 247

Query: 296 -----RLAEQQAQRDAQAAAAGDENESD 318
+ EQ+ + +A +S
Sbjct: 248 AIAKHAVLEQENKY-VEAVNELRVYKSQ 274


68Bamb_2430Bamb_2438N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_24305142.175927short-chain dehydrogenase/reductase SDR
Bamb_24314102.215466TetR family transcriptional regulator
Bamb_24325112.413640ecotin
Bamb_24333111.638203hypothetical protein
Bamb_24342111.434442hypothetical protein
Bamb_24351101.541846hypothetical protein
Bamb_24360121.442835major facilitator superfamily transporter
Bamb_2437-2120.472994ATPase-like protein
Bamb_2438-110-0.464522alpha,alpha-trehalose-phosphate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2430DHBDHDRGNASE1246e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 124 bits (313), Expect = 6e-37
Identities = 85/254 (33%), Positives = 135/254 (53%), Gaps = 15/254 (5%)

Query: 4 LQGKRALVTGGSRGIGAAIAKRLAADGADVAITYEKSAERARAVVADIEALGRRAVAIQA 63
++GK A +TG ++GIG A+A+ LA+ GA +A + + E+ VV+ ++A R A A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 64 DSADPVAVRGAVDHAAQTLGGLDILVNNAGIFRAGALDDLTLDDIDATLNVNVRAVIVAS 123
D D A+ + +G +DILVN AG+ R G + L+ ++ +AT +VN V AS
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 124 QAAARHL--GEGGRIVSTGSCLATRVPDAGMSLYAASKAALIGWTQGLARDLGARGITVN 181
++ ++++ G IV+ GS VP M+ YA+SKAA + +T+ L +L I N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSN-PAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 182 LVHPGSTDTDMNPA--DGEHAGAQRSRMATPQY---------GKAEDVAALVAFVVGPEG 230
+V PGST+TDM + E+ Q + + + K D+A V F+V +
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 231 RSINGTGLTIDGGA 244
I L +DGGA
Sbjct: 244 GHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2431HTHTETR543e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.9 bits (129), Expect = 3e-11
Identities = 33/199 (16%), Positives = 69/199 (34%), Gaps = 7/199 (3%)

Query: 1 MAERGRPRSFD-KEAALDRAMEVFWRLGYEGASMTDLTAAMGIASPSLYAAFGSKEALFR 59
MA + + + + ++ LD A+ +F + G S+ ++ A G+ ++Y F K LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 60 QAIE-HYRETEGREIWDGVEQAGSAHDAIENYLMQTARVFTRLSKPAGCLIVLSALHPAE 118
+ E E+ + G + L+ +++ L++ H E
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLEST--VTEERRRLLMEIIFHKCE 118

Query: 119 RSD---TVRQMLIAMREQTVAALRTRLGEGVAAGEISAHADLDAIARYYVTVQQGMSIQA 175
V+Q + ++ + L + A + A A G+
Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENW 178

Query: 176 RDGASRRDLEAVAQAALAA 194
DL+ A+ +A
Sbjct: 179 LFAPQSFDLKKEARDYVAI 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2433cloacin332e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.8 bits (74), Expect = 2e-04
Identities = 21/72 (29%), Positives = 25/72 (34%)

Query: 36 GYAYGPAYGAAPVYGTVNIWGGGGGGRDWDRGHRDYRRWDRDRGDHGGWGRGGGRRGDWN 95
G+ G + + G G GGG D + W G WG G G
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67

Query: 96 EGGGGGGRGEGG 107
G GGG G GG
Sbjct: 68 NGNSGGGSGTGG 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2436TCRTETB1111e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 111 bits (278), Expect = 1e-28
Identities = 76/398 (19%), Positives = 159/398 (39%), Gaps = 16/398 (4%)

Query: 25 LAVLDGAIANVALPTIARDLHASDAASIWIVNAYQLAVTITLLPLASLGERVGYRRIYIA 84
+VL+ + NV+LP IA D + A++ W+ A+ L +I L +++G +R+ +
Sbjct: 25 FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLF 84

Query: 85 GLALFTAASLGCALAGS-LPMLAVMRVIQGFGAAGIMSVNAALVRMIYPSSMLGRGLSIN 143
G+ + S+ + S +L + R IQG GAA ++ +V P G+ +
Sbjct: 85 GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 144 AMVVALSSAIGPTVASAILSFASWPWLFAVNVPIGIAAVLGSVRALPANPLHDAPYDFAS 203
+VA+ +GP + I + W +L + + I I V ++ L +D
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRIKGHFDIKG 203

Query: 204 ALM--NACVFGLLITAVDGLGHGERHAYVAAELAVAFVVGYFFVKRQLSQPAPLLPVDLM 261
++ VF +L T + L V+ + FVK P + L
Sbjct: 204 IILMSVGIVFFMLFTTSYSISF----------LIVSVLSFLIFVKHIRKVTDPFVDPGLG 253

Query: 262 RIPMFALSIYTSMASFTSQMLAFVALPFWLQNSLGFSQVETG-LYMTPWPLVIVFAAPLA 320
+ F + + F + +P+ +++ S E G + + P + ++ +
Sbjct: 254 KNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIG 313

Query: 321 GVLSDRYSAGILGGIGLALFAAGLLSLATIGAHPGTVDIVWRMALCGAGFGLFQSPNNRA 380
G+L DR + IG+ + L+ + + + + + G ++ +
Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFL-LETTSWFMTIIIVFVLGGLSFTKTVISTI 372

Query: 381 MLSSAPRERSGGAGGMLSTARLTGQTLGAALVALIFGL 418
+ SS ++ +G +L+ + G A+V + +
Sbjct: 373 VSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2438TYPE3IMPPROT290.029 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 29.0 bits (65), Expect = 0.029
Identities = 17/77 (22%), Positives = 33/77 (42%), Gaps = 3/77 (3%)

Query: 92 YRSDLTRFDRQEYAGYLRVNAM---LAKQLAALLQPDDLIWVHDYHLLPFAHCLRELGVK 148
YR L ++ +E + + ++ + + D I L A+ L E+
Sbjct: 99 YRDYLIKYSDRELVQFFENAQLKRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSA 158

Query: 149 NPIGFFLHIPFPSPDML 165
IGF+L++PF D++
Sbjct: 159 FKIGFYLYLPFVVVDLV 175


69Bamb_2636Bamb_2649N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_26361133.471343major facilitator superfamily transporter
Bamb_26370132.102385beta-lactamase
Bamb_2638-2150.770246LysR family transcriptional regulator
Bamb_2639-1151.635799ABC transporter-like protein
Bamb_26400132.425499HAD family hydrolase
Bamb_26410131.846500binding-protein-dependent transport system inner
Bamb_26421132.399992binding-protein-dependent transport system inner
Bamb_2643-1142.676807extracellular solute-binding protein
Bamb_26442143.423965tagatose-6-phosphate kinase
Bamb_26450132.761235ribokinase-like domain-containing protein
Bamb_2646-1112.054395sorbitol dehydrogenase
Bamb_2647-293.570669ferric uptake regulator family protein
Bamb_2648-182.489902periplasmic solute binding protein
Bamb_2649-1102.519355ABC transporter-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2636TCRTETB387e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 37.6 bits (87), Expect = 7e-05
Identities = 36/155 (23%), Positives = 65/155 (41%), Gaps = 5/155 (3%)

Query: 31 LLALATAGFITILTEALPAGLLPLMSVDLRVTEALIGQLVTVYALGSIVAAIPLVAATRA 90
L+ L F ++L E + LP ++ D A + T + L + +
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 91 MRRRRLLLAALAGFVVSNALTAAS-PYYALTLAARFVAGMSAGLLWALLAGYASRMVDAS 149
+ +RLLL + + + +++L + ARF+ G A AL+ +R +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 150 LRGRAIAVAMLGAPVAMSIGI-PA-GTALGAMFGW 182
RG+A ++G+ VAM G+ PA G + W
Sbjct: 136 NRGKAF--GLIGSIVAMGEGVGPAIGGMIAHYIHW 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2637BLACTAMASEA290.022 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.4 bits (66), Expect = 0.022
Identities = 15/67 (22%), Positives = 27/67 (40%), Gaps = 5/67 (7%)

Query: 65 REDTLFRLASVSKPIVTAAAMRLVAAGRIELDEPVAHW----LPAFRPTLRDGTPTDITL 120
R D F + S K ++ A + V AG +L+ + H+ L + P +T+
Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKI-HYRQQDLVDYSPVSEKHLADGMTV 115

Query: 121 RHLLSHT 127
L +
Sbjct: 116 GELCAAA 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2639PF05272300.015 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.015
Identities = 14/35 (40%), Positives = 17/35 (48%)

Query: 32 VVFVGPSGCGKSTLMRMIAGLEDISSGELLIDGAK 66
VV G G GKSTL+ + GL+ S I K
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2641RTXTOXIND280.044 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 28.3 bits (63), Expect = 0.044
Identities = 11/52 (21%), Positives = 18/52 (34%), Gaps = 5/52 (9%)

Query: 7 EGRRAPAAFDLVRR---ALPGALAWLIALLLFFPIFWMAITAFKTEQQAYAS 55
E PA +L+ P +A+ I L + + E A A+
Sbjct: 38 ENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLG--QVEIVATAN 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2643MALTOSEBP340.001 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 34.3 bits (78), Expect = 0.001
Identities = 98/445 (22%), Positives = 165/445 (37%), Gaps = 77/445 (17%)

Query: 6 LDAAARCFAGAALATAACAASA------GTLTIATLNNPDMIELKKLSPAFEKANPDIKL 59
+ AR A +AL T +ASA G L I + L ++ FEK D +
Sbjct: 3 IKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEK---DTGI 59

Query: 60 NWVILEENVLRQRATTDITTGSGQFDVMAIGTYETPQWGKRGWLAPMTGLPADYDLNDIV 119
+ + L ++ TG G D++ + + G LA +T P + +
Sbjct: 60 KVTVEHPDKLEEKFPQVAATGDGP-DIIFWAHDRFGGYAQSGLLAEIT--PDKAFQDKLY 116

Query: 120 KTARDSLSYNGQLYALPFYVESSMTFYRKDLFAAKGLKMPDQP-TYDQIAEFADKLTDKA 178
D++ YNG+L A P VE+ Y KDL +P+ P T+++I +L KA
Sbjct: 117 PFTWDAVRYNGKLIAYPIAVEALSLIYNKDL-------LPNPPKTWEEIPALDKEL--KA 167

Query: 179 KGTYGICLRGKAGWGENMAYVSTVVNTFGGRWFD-ENW-----NAQLTSPEWKKAIGFYV 232
KG + + + + ++ GG F EN + + + K + F V
Sbjct: 168 KGKSALMFNLQEPY-----FTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLV 222

Query: 233 NLLKK-----DGPPGASSNGFNENLTLTASGKCAMWIDATVAAGMLYNKQQSQVADKIGF 287
+L+K D + FN+ G+ AM I+ A N S+V G
Sbjct: 223 DLIKNKHMNADTDYSIAEAAFNK-------GETAMTINGPWAWS---NIDTSKV--NYGV 270

Query: 288 AAAPVAATPKGSHWLWAWALAVPKTSKQQDAARKFIA-WATSKQYIEMAGKDEGWASVPP 346
P ++ + + S ++ A++F+ + + + +E KD+
Sbjct: 271 TVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDK------- 323

Query: 347 GTRTSTYQRPEYKAAAPFSDFVLKAIETADPNDPSLKKV---PYTGVQYVGIPEFQSFGT 403
P LK+ E DP + G IP+ +F
Sbjct: 324 ----------------PLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWY 367

Query: 404 VVGQAIAGAVAGQTTVDQALAAGQA 428
V A+ A +G+ TVD+AL Q
Sbjct: 368 AVRTAVINAASGRQTVDEALKDAQT 392


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2646DHBDHDRGNASE1308e-39 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 130 bits (327), Expect = 8e-39
Identities = 84/255 (32%), Positives = 124/255 (48%), Gaps = 7/255 (2%)

Query: 3 LEQKVAILTGAASGIGEAVAQRYLDEGARCVLVDVKPASGSLARLIEASPGR-AVAVTAD 61
+E K+A +TGAA GIGEAVA+ +GA VD P + R A A AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 62 VTRRDDIERIVATAVERFGGVDILFNNAALFDMRPLLDESWDVFDRLFSVNVKGLFFLMQ 121
V I+ I A G +DIL N A + + S + ++ FSVN G+F +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 122 AVAQRMVEQGRGGKIVNMSSQAGRRGEALVSHYCATKAAVISYTQSAALALAPHRINVNG 181
+V++ M+++ R G IV + S ++ Y ++KAA + +T+ L LA + I N
Sbjct: 126 SVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 182 IAPGVVDTPMWEQVDALFARYEQRPPG--EKKRLVGEAVPLGRMGAPGDLTGAALFLASA 239
++PG +T M + A EQ G E + +PL ++ P D+ A LFL S
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKT---GIPLKKLAKPSDIADAVLFLVSG 241

Query: 240 DADYITAQTLNVDGG 254
A +IT L VDGG
Sbjct: 242 QAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2648ADHESNFAMILY1312e-38 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 131 bits (332), Expect = 2e-38
Identities = 75/311 (24%), Positives = 128/311 (41%), Gaps = 37/311 (11%)

Query: 18 LLAASAAALSIAA-----PACAQAATVNVVAAENFYGDVASQIGGRHVAVTSILSNPDQD 72
LL +A+ + A + VVA + D+ I G + + SI+ QD
Sbjct: 7 LLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIVP-IGQD 65

Query: 73 PHLFEASPKTARALQHAQVVIYNGAN----YDPWMSKLLGASKQAKRA-TIVVADLVGK- 126
PH +E P+ + A ++ YNG N + W +KL+ +K+ + V+D V
Sbjct: 66 PHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVI 125

Query: 127 -------KAGDNPHVWYDPATMPAAARAIAAEFGRADPANKADYDANLQKFVASL----K 175
K ++PH W + A+ IA + DP NK Y+ NL+++ L K
Sbjct: 126 YLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDKLDK 185

Query: 176 PVDDKVAALRAQYKGVPVTATEPVFGYMSDAIGLDMRNQRFQLATMNDTEASAQDVAAFE 235
DK + A+ K + VT +E F Y S A G+ + + E + + +
Sbjct: 186 ESKDKFNKIPAEKKLI-VT-SEGAFKYFSKAYGV---PSAYIWEINTEEEGTPEQIKTLV 240

Query: 236 NDLRKKQVRVLIYNSQAEAPMTKRLLKIARDGGVP------SVSVTETQPAGKTFQQWMT 289
LR+ +V L S + K + ++D +P + S+ E G ++ M
Sbjct: 241 EKLRQTKVPSLFVESSVDDRPMKTV---SQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMK 297

Query: 290 GQLDALAAALS 300
LD +A L+
Sbjct: 298 YNLDKIAEGLA 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2649PF05272300.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.013
Identities = 19/70 (27%), Positives = 33/70 (47%), Gaps = 4/70 (5%)

Query: 2 SPTPHALALDRVTLELGGRTILRDVSFSIEPG---EFVGVL-GPNGAGKTTLMRAVLGLV 57
+P + R +G ++ V+ +EPG ++ VL G G GK+TL+ ++GL
Sbjct: 561 TPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLD 620

Query: 58 PVSGGTLSVG 67
S +G
Sbjct: 621 FFSDTHFDIG 630


70Bamb_2655Bamb_2663N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2655-1100.010052hydrophobe/amphiphile efflux-1 (HAE1) family
Bamb_2656-262.254636RND family efflux transporter MFP subunit
Bamb_2657182.305368TetR family transcriptional regulator
Bamb_2658082.647118isochorismatase hydrolase
Bamb_2659-293.218182AraC family transcriptional regulator
Bamb_2660-2102.470040carbon monoxide dehydrogenase subunit G
Bamb_2661-2102.357900hypothetical protein
Bamb_2662-391.917575hypothetical protein
Bamb_2663-2101.824249protease Do
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2655ACRIFLAVINRP12670.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1267 bits (3279), Expect = 0.0
Identities = 676/1035 (65%), Positives = 823/1035 (79%), Gaps = 2/1035 (0%)

Query: 1 MAKFFIDRPIFAWVIAIILMLAGVAAIFTLPISQYPTIAPPSIQITANYPGASAKTVEDT 60
MA FFI RPIFAWV+AIILM+AG AI LP++QYPTIAPP++ ++ANYPGA A+TV+DT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQQMSGLDNFLYMSSTSDDSGNATITLTFAPGTNADIAQVQVQNKLSLATPVLPQ 120
VTQVIEQ M+G+DN +YMSSTSD +G+ TITLTF GT+ DIAQVQVQNKL LATP+LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 VVQQLGLSVTKSSSSFLLVLAFNSEDGSMNKYDLANFVASHVKDPISRLNGVGTVTLFGS 180
VQQ G+SV KSSSS+L+V F S++ + D++++VAS+VKD +SRLNGVG V LFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWLDPTRLTNYGLTPVDVSSAIAAQNVQIAGGQIGGTPAKPGTVLQATITESTLL 240
QYAMRIWLD L Y LTPVDV + + QN QIA GQ+GGTPA PG L A+I T
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 QTPEQFGNILLKVNQDGSQVRLKDVSKIELGGENYNFDTKYNGQPTAALGIQLATNANAL 300
+ PE+FG + L+VN DGS VRLKDV+++ELGGENYN + NG+P A LGI+LAT ANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 ATAKAVRAKIDELAPFFPHGLVVKYPYDTTPFVKLSIEEVVKTLLEGIVLVFLVMYLFLQ 360
TAKA++AK+ EL PFFP G+ V YPYDTTPFV+LSI EVVKTL E I+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NLRATIIPTIAVPVVLLGTFAIMSMVGFSINTLSMFGLVLAIGLLVDDAIVVVENVERVM 420
N+RAT+IPTIAVPVVLLGTFAI++ G+SINTL+MFG+VLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLSPKEATRKAMGQITGALVGVALVLSAVFVPVAFSGGSVGAIYRQFSLTIVSAMVL 480
E+ L PKEAT K+M QI GALVG+A+VLSAVF+P+AF GGS GAIYRQFS+TIVSAM L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATILKPIPQGHHEEKKGFFGWFNRTFNNSREKYHVGVHHVIKRSGRW 540
SVLVALILTPALCAT+LKP+ HHE K GFFGWFN TF++S Y V ++ +GR+
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 LIIYLVVIVAVGLLFVRLPKSFLPDEDQGLMFVIVQTPSGSTQETTARTLANISDYLLKD 600
L+IY +++ + +LF+RLP SFLP+EDQG+ ++Q P+G+TQE T + L ++DY LK+
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 EKEIVESAFTVNGFSFAGRGQNSGLVFVRLKDYSQRQHANQKVQALIGRMFGRYAGYKDA 660
EK VES FTVNGFSF+G+ QN+G+ FV LK + +R +A+I R +D
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 MVIPFNPPSIPELGTAAGFDFELTDNAGLGHNALMAARNQLLGMAAKDP-TLQGVRPNGL 719
VIPFN P+I ELGTA GFDFEL D AGLGH+AL ARNQLLGMAA+ P +L VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 720 NDTPQYKVDIDREKANALGVTAEAIDQTFSIAWASKYVNNFLDTDGRIKKVYVQSDAPFR 779
DT Q+K+++D+EKA ALGV+ I+QT S A YVN+F+D GR+KK+YVQ+DA FR
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID-RGRVKKLYVQADAKFR 779

Query: 780 MTPEDLNVWYVRNGSGGMVPFGAFATGHWTYGSPKLERYNGVSAMEIQGQAAPGKSTGQA 839
M PED++ YVR+ +G MVPF AF T HW YGSP+LERYNG+ +MEIQG+AAPG S+G A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 840 MTAMEGLAKKLPVGIGYSWTGLSFQEIQSGSQAPVLYAISILVVFLCLAALYESWSIPFS 899
M ME LA KLP GIGY WTG+S+QE SG+QAP L AIS +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 900 VIMVVPLGVVGALLAATMRGLENDVFFQVGLLTTVGLSAKNAILIVEFARELQMTEKMGP 959
V++VVPLG+VG LLAAT+ +NDV+F VGLLTT+GLSAKNAILIVEFA++L E G
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 960 IEAALEAARLRLRPILMTSLAFILGVMPLAISNGAGSASQHAIGTGVIGGMITATFLAIF 1019
+EA L A R+RLRPILMTSLAFILGV+PLAISNGAGS +Q+A+G GV+GGM++AT LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1020 MIPMFFVKIRAIFSG 1034
+P+FFV IR F G
Sbjct: 1020 FVPVFFVVIRRCFKG 1034



Score = 71.8 bits (176), Expect = 1e-14
Identities = 54/324 (16%), Positives = 113/324 (34%), Gaps = 15/324 (4%)

Query: 724 QYKVDIDREKANALGVTAEAIDQTFS---IAWASKYVNNFLDTDGRIKKVYVQSDAPFRM 780
++ +D + N +T + A+ + G+ + + F+
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK- 241

Query: 781 TPEDLNVWYVR-NGSGGMVPFGAFATGHWTYGSPKLE---RYNGVSAMEIQGQAAPGKST 836
PE+ +R N G +V A G R NG A + + A G +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARV--ELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 837 ----GQAMTAMEGLAKKLPVGIGYSWTGLSFQEIQSGSQAPVLYAI-SILVVFLCLAALY 891
+ L P G+ + + +Q V +I++VFL +
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 892 ESWSIPFSVIMVVPLGVVGALLAATMRGLENDVFFQVGLLTTVGLSAKNAILIVEFAREL 951
++ + VP+ ++G G + G++ +GL +AI++VE +
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 952 QMTEKMGPIEAALEAARLRLRPILMTSLAFILGVMPLAISNGAGSASQHAIGTGVIGGMI 1011
M +K+ P EA ++ ++ ++ +P+A G+ A ++ M
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 1012 TATFLAIFMIPMFFVKIRAIFSGE 1035
+ +A+ + P + S E
Sbjct: 480 LSVLVALILTPALCATLLKPVSAE 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2656RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.2 bits (94), Expect = 1e-05
Identities = 40/215 (18%), Positives = 70/215 (32%), Gaps = 34/215 (15%)

Query: 100 AQLNSAKATLAKAQANLVTQNALVARYKVLVAANAVSKQEYDNAVATQ-GQAAADVAAGK 158
+ A L ++ L + + K + Q + N + + Q ++
Sbjct: 259 NKYVEAVNELRVYKSQLEQIESEILSAK---EEYQLVTQLFKNEILDKLRQTTDNIGLLT 315

Query: 159 AAVDTAQINLGYTDVVSPITGRV-GISQVTPGAYVQASQATLMSTVQQLDPVYVD-LTQS 216
+ + + + +P++ +V + T G V ++ TLM V + D + V L Q+
Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQN 374

Query: 217 SL-----EGLKLRQDVQSGRLKTSGPGAAKVSLILEDGKTYSDAGKLQFSDVTVDQTTGS 271
G V++ G KV I D DQ G
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI--------------NLDAIEDQRLGL 420

Query: 272 VT--IRAV------FPNPGRVLLPGMFVRARIEEG 298
V I ++ N L GM V A I+ G
Sbjct: 421 VFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455



Score = 29.4 bits (66), Expect = 0.028
Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 63 AQVRARVDGIVLR-REFTEGGDVKAGQRLYKIDP 95
+ +RA V V + + TEGG V + L I P
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2657HTHTETR1176e-35 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 117 bits (294), Expect = 6e-35
Identities = 78/208 (37%), Positives = 115/208 (55%)

Query: 1 MVRRTKEEALETRNRILDAAEHVFFEKGVSHTSLADIAQHAGVTRGAIYWHFANKSELFD 60
M R+TK+EA ETR ILD A +F ++GVS TSL +IA+ AGVTRGAIYWHF +KS+LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 AMFDRVFLPIDELKRMPHDAPGGNPLDTIRKILIWCLLGVQRDSQLRRVFSILFMKCEYV 120
+++ I EL+ G+PL +R+ILI L + + R + I+F KCE+V
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 121 ADMEPLLQRNRAGMSEALHAIDADLAVAVRLKLLPERLDTWRATLMLHTLVSGFVRDMLM 180
+M + Q R E+ I+ L + K+LP L T RA +++ +SG + + L
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 181 LPDEIDAEQHAEKLVDGCFDMLRYSPAM 208
P D ++ A V +M P +
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTL 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2658ISCHRISMTASE421e-06 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 41.5 bits (97), Expect = 1e-06
Identities = 26/128 (20%), Positives = 46/128 (35%), Gaps = 12/128 (9%)

Query: 9 ASRRALIVIDVQNEYVTGNLPIEYPPIDTSLANIGRAIDAAHAVGVPVIVV-----QHVA 63
+R L++ D+Q Y P+ ANI + + +G+PV+ Q+
Sbjct: 28 PNRAVLLIHDMQ-NYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPD 86

Query: 64 PAG--APIFAPGTDGVALHPVVADR----PYAHLIVKAQASAFAGTDLAAWLDAHGIDTL 117
+ PG + + ++ K + SAF T+L + G D L
Sbjct: 87 DRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQL 146

Query: 118 SVAGYMTH 125
+ G H
Sbjct: 147 IITGIYAH 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2663V8PROTEASE726e-16 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 72.0 bits (176), Expect = 6e-16
Identities = 33/157 (21%), Positives = 60/157 (38%), Gaps = 26/157 (16%)

Query: 125 LGSGFIVSADGYILTNAHVIDGANVVTVKLTDKR-----------EYKA-KVVGSDKQSD 172
+ SG +V +LTN HV+D + L + A ++ + D
Sbjct: 103 IASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGD 161

Query: 173 VAVLKIDA--------SGLPTVKIGDPGQSKVGQWVVAIGSPYGFDNTVTSGIISAKSRA 224
+A++K + + + +++V Q + G P +K +
Sbjct: 162 LAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMW---ESKGKI 218

Query: 225 LPDENYTPFIQTDVPVNPGNSGGPLFNLQGEVIGINS 261
+ +Q D+ GNSG P+FN + EVIGI+
Sbjct: 219 TYLKGE--AMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253


71Bamb_2831Bamb_2838N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2831015-1.897749fimbrial biogenesis outer membrane usher
Bamb_2832-219-2.682404pili assembly chaperone
Bamb_2833-116-2.181779fimbrial protein
Bamb_2834-210-1.592372Hpt sensor hybrid histidine kinase
Bamb_2835-111-1.959679two component LuxR family transcriptional
Bamb_2836-111-1.666501transposase IS3/IS911 family protein
Bamb_2837-113-0.655332single-stranded DNA-binding protein
Bamb_2838014-0.629628major facilitator superfamily transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2831PF005777480.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 748 bits (1932), Expect = 0.0
Identities = 267/893 (29%), Positives = 401/893 (44%), Gaps = 84/893 (9%)

Query: 20 EADGATALAYNFDSRLLLGTPLGVANIERFNRTHAVDPGRYQVDLYVNDRFVSRRDITFR 79
++ F+ R L P VA++ RF + PG Y+VD+Y+N+ +++ RD+TF
Sbjct: 38 AQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFN 97

Query: 80 NTDDGA-LYPCLSDALLTSAGVLLRDVADARLPHAAAGQAAPDGTQPSPEPTPDHADALP 138
D + PCL+ A L S G+ V+ L A
Sbjct: 98 TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDA----------------------- 134

Query: 139 ADAPTAFCGPLTKRVPGAQTTFDLTRLRLDITVPQFEMRVAPRGAVDPASLDAGEAAAYV 198
C PLT + A D+ + RL++T+PQ M RG + P D G A +
Sbjct: 135 -------CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLL 187

Query: 199 NYDASYYTS-SAYGVRANSVYTGLNAGVNVGLWRVRQQSSFTYNGGTGNSIS--RWNNIR 255
NY+ S + + G ++ Y L +G+N+G WR+R ++++YN +S S +W +I
Sbjct: 188 NYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHIN 247

Query: 256 TYAERPLIGMRSQLTIGQSFTSGSLFSTVGYTGVRLESDDRMLPDSMRGYAPVVNGVAQT 315
T+ ER +I +RS+LT+G +T G +F + + G +L SDD MLPDS RG+APV++G+A+
Sbjct: 248 TWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARG 307

Query: 316 NARVVVNQNGHVIYQTTVAPGPFRIADLNPTSYQGDIDVEVHEANGQVSRFTVPFSAVPN 375
A+V + QNG+ IY +TV PGPF I D+ GD+ V + EA+G FTVP+S+VP
Sbjct: 308 TAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPL 367

Query: 376 SMRPGLSHYSVTLGQVRQIEG--SHARFADLTYQRGLTNSLTANGAVRVSLDYQSVLAGA 433
R G + YS+T G+ R RF T GL T G +++ Y++ G
Sbjct: 368 LQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427

Query: 434 VLGT-RIGAFGWNTTWSHARNAQGGWLNGWMSAITYSHTFTPTQTTFSLAGYRYSTKGYR 492
+GA + T +++ +G Y+ + + T L GYRYST GY
Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487

Query: 493 EFIDALSAREAYRRGETWA-------------SSTYQQRDQFTLNVNQDFGKYGALSLSA 539
F D +R ET + Y +R + L V Q G+ L LS
Sbjct: 488 NFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSG 547

Query: 540 TTSSYYESRPHDTQVQLSYNNHYHSISYNLSFVRQKTATVVAPGSGPMQNLLPGYGRAGA 599
+ +Y+ + D Q Q N + I++ LS+ K A + G
Sbjct: 548 SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNA-----------------WQKGR 590

Query: 600 DTRTSNVLMLTVSIPLG--------SGPRTASLSGSVSHGNDQGTSYQASLSGIADRAQT 651
D L L V+IP S R AS S S+SH + + A + G
Sbjct: 591 DQM----LALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNN 646

Query: 652 LSYGLSLS---GETRNGARTYSGNLQKNLSMITAGASYSNGDHFWQVGATARGAIVAHRG 708
LSY + G N T L A YS+ D Q+ G ++AH
Sbjct: 647 LSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHAN 706

Query: 709 GVTFGPYLGDTFGIVEAKGARGATVRSGMGARVDRFGYAIVPSLTPYRYTDVALETQGID 768
GVT G L DT +V+A GA+ A V + G R D GYA++P T YR VAL+T +
Sbjct: 707 GVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLA 766

Query: 769 RDTELIGNQVRVAPYAGSAVLLKFATLTGHAVLIQGARADGERLPLGANVLDGKGTSIGV 828
+ +L V P G+ V +F G +L+ + + LP GA V S G+
Sbjct: 767 DNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMT-LTHNNKPLPFGAMVTSESSQSSGI 825

Query: 829 VGQGGLAYARVPSAHGTVRVRWGKRREDQCAMRYDLPAQSATKAPIVRIRAQC 881
V G Y G V+V+WG+ C Y LP +S + + ++ A+C
Sbjct: 826 VADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQ-QQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2833NEISSPPORIN280.028 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 28.0 bits (62), Expect = 0.028
Identities = 43/180 (23%), Positives = 61/180 (33%), Gaps = 13/180 (7%)

Query: 24 MKKNLLVLMLAAAPVFAFAQSSNTIQFQGEVTDQTCAVTVNGNASSPTVLLPTVSTADLA 83
MKK+L+ L LAA PV A A + + V +G S AD
Sbjct: 1 MKKSLIALTLAALPVAAMADVTLYGAIKAGVQTYRSVEHTDGKVSKVET---GSEIADFG 57

Query: 84 TAGGTAGETHFTLGLSGCTAPTQTARAINT-VFVGNQVTTNGNLGNTGT--ATNVALQLL 140
+ G G+ GL Q A T GN+ + G G GT A ++ L
Sbjct: 58 SKIGFKGQEDLGNGLKAVWQLEQGASVAGTNTGWGNKQSFVGLKGGFGTIRAGSLNSPLK 117

Query: 141 DPANATTPFNLSGASGYAAPGLSLAVGDTSASYDFAVRYITENGSATAGSVLGSVQYAIN 200
+ + + + L ++ +VRY S GSVQYA
Sbjct: 118 NTGANVNAWE---SGKFTGNVLEISGMAQREHRYLSVRY----DSPEFAGFSGSVQYAPK 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2834HTHFIS594e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.1 bits (143), Expect = 4e-11
Identities = 34/147 (23%), Positives = 54/147 (36%), Gaps = 4/147 (2%)

Query: 581 ILVIDDHRTNLAVLDRQIRATGCVPVLAETGRQALESFSTGRFDLVLMDVDLGDIDGFTL 640
ILV DD VL++ + G + + G DLV+ DV + D + F L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 641 TQRFRDLEAGAGTHTPIVSISASSEPDHHERALACGMDGLLDKPIRADVLRTVVSLWCDL 700
R + A P++ +SA + +A G L KP L ++
Sbjct: 66 LPRIKK----ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 701 DLRQQSPAPTPTQGAANLASTYAALLQ 727
R+ S +Q L AA+ +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQE 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2837cloacin426e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 42.0 bits (98), Expect = 6e-07
Identities = 28/71 (39%), Positives = 31/71 (43%), Gaps = 1/71 (1%)

Query: 109 GGRGGAGGGGGGSDEGGYGGGYG-GGGGGGRGGEQMERGGGGGGGRAGGAPRGGAAGGGQ 167
GG G G GGG SD G+ GGG G G G G GG G + G GG
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 168 SRPSAPAGGGF 178
S +AP GF
Sbjct: 82 SAVAAPVAFGF 92



Score = 30.5 bits (68), Expect = 0.004
Identities = 27/80 (33%), Positives = 29/80 (36%), Gaps = 9/80 (11%)

Query: 107 MLGGRGGAGGGGGGSDEG-------GYGGGYGGGGGGGRGGEQMERGGGGGGG--RAGGA 157
M GG G G S G G G G G G G E GGG G G GG+
Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60

Query: 158 PRGGAAGGGQSRPSAPAGGG 177
G G G S + GG
Sbjct: 61 GHGNGGGNGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2838TCRTETA877e-21 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 86.8 bits (215), Expect = 7e-21
Identities = 76/369 (20%), Positives = 141/369 (38%), Gaps = 33/369 (8%)

Query: 17 RATTSLAAIFALRMLGLFMIMPVFSVYAKT-IPGGDNVLLVGIALGAYGVTQSLFYIFYG 75
R + + AL +G+ +IMPV + + D GI L Y + Q G
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 76 WASDKFGRKPVIATGLVIFALGSFVAAFAHDITWIIVGRVIQGM-GAVSSAVLAFIADLT 134
SD+FGR+PV+ L A+ + A A + + +GR++ G+ GA + A+IAD+T
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124

Query: 135 SEQNRTKAMAMVGGSIGVSFAVAIVGAPI--VFHWVGMSGLFTIVGVLSILAIGVVVWIV 192
R + + G + G + + F L+ L +++
Sbjct: 125 DGDERARHFGFMSACFGFGM---VAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181

Query: 193 PDAAKPVHVPAPFAEVLHNGELLRLNFGVLVLHATQTALFLVVPRLLVDGGLPVAAHWKV 252
P++ K P E L+ R G+ V+ A + V ++ G AA W +
Sbjct: 182 PESHKGERRPLR-REALNPLASFRWARGMTVV-----AALMAVFFIMQLVGQVPAALWVI 235

Query: 253 Y---------------LPVMGL--AFVMMVPAIIVAEKRGKMKPVLLGGILAILIGQLLL 295
+ L G+ + + VA + G+ + ++L G++A G +LL
Sbjct: 236 FGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALML-GMIADGTGYILL 294

Query: 296 GSAPHTILIVAAILFVYFLGFNILEASQPSLVSKLAPGSRKGAATGVYNTTQSIGLALGG 355
A + + V I + +++S+ R+G G S+ +G
Sbjct: 295 AFATRGWMAF--PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGP 352

Query: 356 IVGGWLLKH 364
++ +
Sbjct: 353 LLFTAIYAA 361


72Bamb_2965Bamb_2972N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_2965013-0.924043methionine synthase
Bamb_2966013-0.550812methionine synthase
Bamb_2967214-0.040058hypothetical protein
Bamb_29680121.177099arginyl-tRNA synthetase
Bamb_29690111.301492sporulation domain-containing protein
Bamb_2970-111-0.190116DSBA oxidoreductase
Bamb_2971-2110.107438short chain dehydrogenase
Bamb_2972-211-0.062979extracellular solute-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2965YERSINIAYOPE280.040 Yersinia virulence determinant YopE protein signature.
		>YERSINIAYOPE#Yersinia virulence determinant YopE protein signature.

Length = 219

Score = 28.2 bits (62), Expect = 0.040
Identities = 21/96 (21%), Positives = 36/96 (37%), Gaps = 3/96 (3%)

Query: 224 SGTVTDASGRILSGQTVEAFWNSL--RHAKPLTFGLNCALGAALMRPYIAELAKLCDTYV 281
S +V + SGR +S QT + + N+L R P L + L + + +
Sbjct: 20 SSSVGEMSGRSVSQQTSDQYANNLAGRTESPQGSSLASRIIERLSSVAHSVIGFI-QRMF 78

Query: 282 SCYPNAGLPNPMSDTGFDETPDVTSGLLKEFAQAGL 317
S + + P +P S +K+ A L
Sbjct: 79 SEGSHKPVVTPAPTPAQMPSPTSFSDSIKQLAAETL 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2969IGASERPTASE349e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 9e-04
Identities = 26/152 (17%), Positives = 41/152 (26%), Gaps = 18/152 (11%)

Query: 77 GQPVPQAAQPAPPNTAPGQAANQTQGGLLPEPQIVEVPPSGNANGS---NTTASNNATSG 133
Q V P N + + + VPP A S T A N+
Sbjct: 989 NQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQES 1048

Query: 134 NGVAVAPKPADNTPP-------------PKKTQQAQQQQQGGEDDLARFAAQKQAQQAAA 180
V + A T TQ + Q G E + K+
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 181 QKQQQQQLAANTPKPAPSATSATSAAAAKPPT 212
+++ + + + P TS S + T
Sbjct: 1109 EEKAKVETEKT--QEVPKVTSQVSPKQEQSET 1138



Score = 29.6 bits (66), Expect = 0.019
Identities = 34/179 (18%), Positives = 54/179 (30%), Gaps = 21/179 (11%)

Query: 50 VAPPPTDTGASQPQQFDPNRALQGKTPGQPVPQAAQPAPPNTAPGQAANQTQGGLLPEPQ 109
V PP T + + N + KT + A + N + A + + + Q
Sbjct: 1025 VPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQN---REVAKEAKSNVKANTQ 1081

Query: 110 IVEVPPSGNANGSNTTASNNATSG-----------NGVAVAPKPADNTPPPKKTQQAQQQ 158
EV SG+ T T+ PK P ++ + Q
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 159 QQGGEDDLARFAAQKQAQQAAAQKQQQQQLAANTPKPAPSATSATSAAAAKPPTANDAN 217
Q A A + + Q Q A+T +PA +S + T N N
Sbjct: 1142 Q-------AEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN 1193



Score = 28.5 bits (63), Expect = 0.045
Identities = 28/174 (16%), Positives = 51/174 (29%), Gaps = 6/174 (3%)

Query: 51 APPPTDTGASQPQQFDPNRALQGKTPGQPVPQAAQPAPPNTAPGQAANQTQGGLLPEPQI 110
P + A P N + PVP A P T A N Q E
Sbjct: 997 ITTPNNIQADVPSV-PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNE 1055

Query: 111 VEVPPSGNANGSNTTASNN----ATSGNGVA-VAPKPADNTPPPKKTQQAQQQQQGGEDD 165
+ + N + + T N VA + + K ++++ + +
Sbjct: 1056 QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVE 1115

Query: 166 LARFAAQKQAQQAAAQKQQQQQLAANTPKPAPSATSATSAAAAKPPTANDANTG 219
+ + + KQ+Q + +PA + + T A+T
Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTE 1169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2971DHBDHDRGNASE732e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 73.2 bits (179), Expect = 2e-17
Identities = 48/184 (26%), Positives = 79/184 (42%), Gaps = 2/184 (1%)

Query: 7 VFITGASSGLGLAMAEEYARQGATLALVARRTDALDAFARRFPKLS--VSVYSADVRDAD 64
FITGA+ G+G A+A A QGA +A V + L+ + + ADVRD+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 65 ALATAAASFIAAHGCPDVVIANAGISQGAVTGQGDLATFRDVMDINYYGMVATFEPFVGP 124
A+ A G D+++ AG+ + + + +N G+
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 125 MTAARHGTLVGVASVAGVRGLPGSGAYSASKSAAIKYLEALRVELRPAGVGVVTIAPGYI 184
M R G++V V S AY++SK+AA+ + + L +EL + ++PG
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGST 190

Query: 185 RTPM 188
T M
Sbjct: 191 ETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_2972BINARYTOXINB290.049 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 29.3 bits (65), Expect = 0.049
Identities = 25/95 (26%), Positives = 41/95 (43%), Gaps = 16/95 (16%)

Query: 217 DALIRYDVNPTYWGTKPKVDRLIYAITPDASVRMQK------VKAGECQIALSPKPQDVL 270
+A IRY VN GT P IY + P S+ + K +KA E Q++ P +
Sbjct: 390 NANIRY-VNT---GTAP-----IYNVLPTTSLVLGKNQTLATIKAKENQLSQILAPNNYY 440

Query: 271 AAKGESALKVVQTPAFMTAFVALN-TQKKPLDNEK 304
+K + + + F + + +N Q L+ K
Sbjct: 441 PSKNLAPIALNAQDDFSSTPITMNYNQFLELEKTK 475


73Bamb_3055Bamb_3069N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_30550122.055664chromate transporter
Bamb_30560122.063213chromate transporter
Bamb_3057-1142.038384DNA-binding transcriptional activator GcvA
Bamb_30580151.953901uracil-xanthine permease
Bamb_30590141.747875flagellar hook-associated protein FlgL
Bamb_30600161.302807flagellar hook-associated protein FlgK
Bamb_3061-2161.088353YcgR family protein
Bamb_3062-1180.360727flagellar rod assembly protein/muramidase FlgJ
Bamb_3063319-0.197373flagellar basal body P-ring protein
Bamb_3064421-0.832291flagellar basal body L-ring protein
Bamb_3065521-0.676999flagellar basal body rod protein FlgG
Bamb_30663161.466168flagellar basal body rod protein FlgF
Bamb_30673161.480704flagellar hook protein FlgE
Bamb_30681142.874994flagellar basal body rod modification protein
Bamb_30691122.909444flagellar basal body rod protein FlgC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3055ACRIFLAVINRP280.021 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.3 bits (63), Expect = 0.021
Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 110 YVQQGLMPVTAGLVAASAVLISEASNRTAIQWGITAACAVL-AWRTRIHPLWLLAAGALI 168
Y GL+ T GL A +A+LI E + + G A L A R R+ P+ + + ++
Sbjct: 925 YFMVGLL-TTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFIL 983

Query: 169 GL 170
G+
Sbjct: 984 GV 985


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3059FLAGELLIN462e-07 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 46.2 bits (109), Expect = 2e-07
Identities = 56/390 (14%), Positives = 121/390 (31%), Gaps = 7/390 (1%)

Query: 15 QMNDQQAQLAQLYQQIASGVSLQTPADNPVGAAQAVQLSMTSATLSQYATNQTAALASLQ 74
+N Q+ L+ ++++SG+ + + D+ G A A + + L+Q + N ++ Q
Sbjct: 16 NLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQ 75

Query: 75 AEDQALQSVSGVLTGVQTLVVRAGDGSLADSDRSALATQLQGYRDQLMTLANSNDGAGNY 134
+ AL ++ L V+ L V+A +G+ +DSD ++ ++Q +++ ++N G
Sbjct: 76 TTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVK 135

Query: 135 LFAGLNNSSAPFTSSPNGSVSY------VGDSGTRQVQIGDSSSVAQGDTGSAVFMSVPS 188
+ + N ++ +++ V G + GD S+
Sbjct: 136 VLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGY 195

Query: 189 LGSAPVPSAGAANTGTGTITAVTVTTPSAATNGHQFSIAFGGTPAAPTYTVTDNSAKPPT 248
A + + +G + T + T A T D +
Sbjct: 196 DTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKS 255

Query: 249 TTPAQAYTAGASIALGGGMTVAVSGTPAAGDTFAVTPGPQASGGADIFSTLDSMISALKT 308
T A A GG G +G + + +
Sbjct: 256 TAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTK-TGNDGNGKVSTTINGEKVTLTVAD 314

Query: 309 PVTGNPVAAAALSNALMTGSIKVGNTMRNVTTIQASVGGREQEVKAMQAVNQTASLQTTS 368
G AA + V N + + +++A AV + +
Sbjct: 315 ITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNG 374

Query: 369 NLTDLTSTNMTTTISQYLQVQNALTGAQKA 398
+ T++ +
Sbjct: 375 AEYTANAAGDKVTLAGKTMFIDKTASGVST 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3060FLGHOOKAP12183e-65 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 218 bits (557), Expect = 3e-65
Identities = 151/444 (34%), Positives = 236/444 (53%), Gaps = 15/444 (3%)

Query: 3 NTLMNLGVSGLNAALWGLTTTGQNISNAATPGYSVERPVYAEASGQYTSSGYMPQGVNTV 62
++L+N +SGLNAA L T NIS+ GY+ + + A+A+ + G++ GV
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 63 TVQRQYSQYLSDQLNSAQSQGGALSTWYSLVAQLNNYVGSPTAGISTAITGYFTGLQNVA 122
VQR+Y ++++QL +AQ+Q L+ Y +++++N + + T+ ++T + +FT LQ +
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 123 NNASDPSVRQTAISNAQVLADQLKAAGQQYDALRQSVNTQLTSTVSQINTYTSQIAQLNQ 182
+NA DP+ RQ I ++ L +Q K Q + VN + ++V QIN Y QIA LN
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 183 QIA--AASSQGQPPNQLMDQRDLAVSNLSSLAGVQV-VRNDSGYSVFLAGGQPLVVADKS 239
QI+ G PN L+DQRD VS L+ + GV+V V++ Y++ +A G LV +
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 240 YQLATVTSPSDPSELTVVSQGIAGANPPGPNQALPDTSLSGGTLGGLLAFRSQTLDPAQA 299
QLA V S +DPS TV N +P+ L+ G+LGG+L FRSQ LD +
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAG-----NIEIPEKLLNTGSLGGILTFRSQDLDQTRN 295

Query: 300 QLGALATSFAAQVNGQNALGIDLSGKPGGNLFAVANPAVYSNQGNTGTASLSVSFANAAQ 359
LG LA +FA N Q+ G D +G G + FA+ PAV N N G ++ + +A+
Sbjct: 296 TLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASA 355

Query: 360 PTTSDYTLSYDGTNYTLTDRASGSVVGQSTSMPASIGGLAFS----FASGSMNAGDQFTV 415
+DY +S+D + +T AS + T P + G +AF +G+ D FT+
Sbjct: 356 VLATDYKISFDNNQWQVTRLASNTT---FTVTPDANGKVAFDGLELTFTGTPAVNDSFTL 412

Query: 416 LPTRGALNGFGLATTSGSAIAAAS 439
P A+ + T + IA AS
Sbjct: 413 KPVSDAIVNMDVLITDEAKIAMAS 436



Score = 86.5 bits (214), Expect = 7e-20
Identities = 57/166 (34%), Positives = 83/166 (50%), Gaps = 23/166 (13%)

Query: 521 TITSTTQPAPAGVMNGVTVTLSGAPSDGDSFTIGPYAGGT-------------------- 560
T T T +G+ +T +G P+ DSFT+ P +
Sbjct: 380 TFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIVNMDVLITDEAKIAMASEED 439

Query: 561 ---SDGSNALALSQLVTAKSLGGGTTTLTGAYAKYVNAIGNTASQLKSSSAAQTSLVGQI 617
SD N AL L + GG + AYA V+ IGN + LK+SSA Q ++V Q+
Sbjct: 440 AGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQL 499

Query: 618 TTAQQSVSGVNQNEEAANLMQYQQLYQANAKVIQTAATLFQTVLGL 663
+ QQS+SGVN +EE NL ++QQ Y ANA+V+QTA +F ++ +
Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3062FLGFLGJ2199e-72 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 219 bits (558), Expect = 9e-72
Identities = 127/319 (39%), Positives = 174/319 (54%), Gaps = 41/319 (12%)

Query: 16 ALDVQGFDALRAQARQSPQAGAKAVAGQFDAMFTQMMLKSMRDATPDGGLLDSHTSKMYT 75
A D Q + L+A+A + P A + VA Q + MF QMMLKSMRDA P GL S +++YT
Sbjct: 12 AWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLYT 71

Query: 76 SMLDQQLAQQMSK-RGIGVADALMKQLMRNAGQGGGTAADVGAAGLGAAGLGAAGAGTSG 134
SM DQQ+AQQM+ +G+G+A+ ++KQ+ Q
Sbjct: 72 SMYDQQIAQQMTAGKGLGLAEMMVKQM--TPEQ------------------------PLP 105

Query: 135 NEGSLAAMNAMARAYANAANNGGLAGARGYSAGSALTPPLKGASGVQ----DADAFVDRL 190
E + AA N L S L + D+ AF+ +L
Sbjct: 106 EESTPAAPMKFPLETVVRYQNQAL---------SQLVQKAVPRNYDDSLPGDSKAFLAQL 156

Query: 191 AAPAQAASASTGIPARFIVGQAALESGWGKREIRASDGSTSYNVFGIKANKGWTGRTVSA 250
+ PAQ AS +G+P I+ QAALESGWG+R+IR +G SYN+FG+KA+ W G
Sbjct: 157 SLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEI 216

Query: 251 LTTEYVNGTPRRVVAKFRAYDSYEHAMTDYANLLKNNPRYAGVLSASRSVEGFAHGMQKA 310
TTEY NG ++V AKFR Y SY A++DY LL NPRYA V +A+ + E A +Q A
Sbjct: 217 TTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASA-EQGAQALQDA 275

Query: 311 GYATDPNYAKKLISIMQQI 329
GYATDP+YA+KL +++QQ+
Sbjct: 276 GYATDPHYARKLTNMIQQM 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3063FLGPRINGFLGI370e-129 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 370 bits (952), Expect = e-129
Identities = 164/378 (43%), Positives = 221/378 (58%), Gaps = 21/378 (5%)

Query: 19 IAAALVLAACAF---GAPGAHAERLKDLAQIQGVRDNPLIGYGLVVGLDGTGDQTMQTPF 75
IAAALV +A F A R+KD+A +Q RDN LIGYGLVVGL GTGD +PF
Sbjct: 7 IAAALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPF 66

Query: 76 TTQTLANMLANLGISINNGSANGGPSSLSNMQLKNVAAVMVTATLPPFARPGEALDVTVS 135
T Q++ ML NLGI+ G +N KN+AAVMVTA LPPFA PG +DVTVS
Sbjct: 67 TEQSMRAMLQNLGITTQGGQSN----------AKNIAAVMVTANLPPFASPGSRVDVTVS 116

Query: 136 SLGNAKSLRGGTLLLTPLKGADGQVYALAQGNMAVGGAGASANGSRVQVNQLAAGRIVGG 195
SLG+A SLRGG L++T L GADGQ+YA+AQG + V G A + + + + R+ G
Sbjct: 117 SLGDATSLRGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNG 176

Query: 196 AIVERAVPNAIAQMNGVLQLQLNDMDYGTAQRIVSAVNS----SFGPGTATALDGRTIQL 251
AI+ER +P+ L LQL + D+ TA R+ VN+ +G A D + I +
Sbjct: 177 AIIERELPSKFKDSV-NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAV 235

Query: 252 AAPADSAQQVAFMARLQNLDVSPDKAAAKVILNARTGSIVMNQMVTLQSCAVAHGNLSVV 311
P + MA ++NL V D AKV++N RTG+IV+ V + AV++G L+V
Sbjct: 236 QKPRVA-DLTRLMAEIENLTVETD-TPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQ 293

Query: 312 VNTQPVVSQPGPFSNGQTVVAQQSQIQLKQDNGALKMVTAGANLADVVKALNSLGATPAD 371
V P V QP PFS GQT V Q+ I Q+ + + G +L +V LNS+G
Sbjct: 294 VTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSKVA-IVEGPDLRTLVAGLNSIGLKADG 352

Query: 372 LMSILQAMKAAGALRADL 389
+++ILQ +K+AGAL+A+L
Sbjct: 353 IIAILQGIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3064FLGLRINGFLGH2155e-73 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 215 bits (550), Expect = 5e-73
Identities = 129/222 (58%), Positives = 163/222 (73%), Gaps = 7/222 (3%)

Query: 14 AVCALAVAALAGCAQIPRDPIIQQPMTAQPPMPMSMQAPGSIY---NPGYAG-RPLFEDQ 69
A+ +L V +L GCA IP P++Q +AQP + A GSI+ P G +PLFED+
Sbjct: 10 AISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDR 69

Query: 70 RPRNIGDILTIMIAENINATKSSGANTNRQGNTDFSVPTAG-FLGGLF--AKANMSAAGA 126
RPRNIGD LTI++ EN++A+KSS AN +R G T+F T +L GLF A+A++ A+G
Sbjct: 70 RPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGG 129

Query: 127 NKFAATGGASAANTFNGTITVTVTNVLPNGNLVVSGEKQMLINQGNEFVRFSGVVNPNTI 186
N F GGA+A+NTF+GT+TVTV VL NGNL V GEKQ+ INQG EF+RFSGVVNP TI
Sbjct: 130 NTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTI 189

Query: 187 SGANSVYSTQVADAKIEYSSKGYINEAETMGWLQRFFLNIAP 228
SG+N+V STQVADA+IEY GYINEA+ MGWLQRFFLN++P
Sbjct: 190 SGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSP 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3065FLGHOOKAP1444e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 4e-07
Identities = 9/42 (21%), Positives = 23/42 (54%)

Query: 220 EASNVNVVQELVNMIQTQRAYEINSKAVTTSDQMLQTVTQMK 261
S VN+ +E N+ + Q+ Y N++ + T++ + + ++
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 39.6 bits (92), Expect = 9e-06
Identities = 17/80 (21%), Positives = 33/80 (41%), Gaps = 14/80 (17%)

Query: 4 SLYIAATGMNAQQAQMDVISNNLANTSTNGFKASRAVFEDLLYQTIRQPGANSTQQTELP 63
+ A +G+NA QA ++ SNN+++ + G+ + + + L
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIM--------------AQANSTLG 48

Query: 64 SGLQLGTGVQQVATERLYTQ 83
+G +G GV +R Y
Sbjct: 49 AGGWVGNGVYVSGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3066FLGHOOKAP1280.035 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.0 bits (62), Expect = 0.035
Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 4 LIYTAMTGASQSLDQQAIVANNLANASTTGFRAQ 37
LI AM+G + + +NN+++ + G+ Q
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQ 36


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3067FLGHOOKAP1362e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 36.5 bits (84), Expect = 2e-04
Identities = 18/58 (31%), Positives = 25/58 (43%)

Query: 356 ISAPGSTNHGTLQGSALENSNVDLTSQLVNLITAQRNYQANAQTIKTQQTVDQTLINL 413
SA L S V+L + NL Q+ Y ANAQ ++T + LIN+
Sbjct: 488 SSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 33.0 bits (75), Expect = 0.002
Identities = 20/78 (25%), Positives = 33/78 (42%), Gaps = 11/78 (14%)

Query: 6 GLSGLAGASNALDVIGNNIANANTVGFKSSTA----QFSDMYANSIATSVNTQIGIGTAL 61
+SGL A AL+ NNI++ N G+ T S + A +G G +
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGG-------WVGNGVYV 59

Query: 62 SSVQQQFGQGTINTTNSS 79
S VQ+++ N ++
Sbjct: 60 SGVQREYDAFITNQLRAA 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3069FLGHOOKAP1270.032 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 26.8 bits (59), Expect = 0.032
Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 102 NVDPVQEMVNMISASRSYQANVETLNTAKTLMLKTLTI 139
V+ +E N+ + Y AN + L TA + + I
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


74Bamb_3104Bamb_3114N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_31040114.555369flagellar protein FhlB
Bamb_31051123.393996hypothetical protein
Bamb_31061141.769747hypothetical protein
Bamb_31072122.393532flagellar protein FliS
Bamb_31081132.208108flagellar hook-basal body complex subunit FliE
Bamb_31090133.467260flagellar MS-ring protein
Bamb_31100113.560467flagellar motor switch protein G
Bamb_31110113.918552flagellar assembly protein H
Bamb_3112-1103.392388flagellar protein export ATPase FliI
Bamb_31130102.979373flagellar export protein FliJ
Bamb_3114-192.871481flagellar hook-length control protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3104TYPE3IMSPROT613e-14 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 60.5 bits (147), Expect = 3e-14
Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 2/79 (2%)

Query: 9 AAALVYDPKGGDAAPRVVAKGYGLVAEMIVARARDAGLYVHTAPEMV-SLLMQVDLDDRI 67
A ++Y G P V K + + A + G+ + + +L +D I
Sbjct: 268 AIGILYKR-GETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYI 326

Query: 68 PPQLYQAVADLLAWLYSLD 86
P + +A A++L WL +
Sbjct: 327 PAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3108FLGHOOKFLIE653e-17 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 65.1 bits (158), Expect = 3e-17
Identities = 46/112 (41%), Positives = 66/112 (58%), Gaps = 9/112 (8%)

Query: 8 ANVSGIGSVLQQMQAMAAQAGGGVASPTAALAGSGAATAGTFASAMKASLDKISGDQQHA 67
+ + GI V+ Q+QA A A + P + +FA + A+LD+IS Q A
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTI---------SFAGQLHAALDRISDTQTAA 51

Query: 68 LGEAQAFEVGAANVSLNDVMVDMQKANIGFQFGLQVRNKLVSAYNDIMQMSV 119
+A+ F +G V+LNDVM DMQKA++ Q G+QVRNKLV+AY ++M M V
Sbjct: 52 RTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3109FLGMRINGFLIF477e-165 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 477 bits (1229), Expect = e-165
Identities = 250/550 (45%), Positives = 364/550 (66%), Gaps = 23/550 (4%)

Query: 52 ISRMKGNPKLPFVIAVAFAIAAITALVLWSRAPDYRVLYSNLSDRDGGAIIAALQQANVP 111
++R++ NP++P ++A + A+A + A+VLW++ PDYR L+SNLSD+DGGAI+A L Q N+P
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 112 YKFADAGGAILVPSNQVHETRLKLAALGLPKGGSVGFELMDNQKFGISQFAEQVNYQRAL 171
Y+FA+ GAI VP+++VHE RL+LA GLPKGG+VGFEL+D +KFGISQF+EQVNYQRAL
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135

Query: 172 EGELQRTIESINAVRGARVHLAIPKPSVFVRDKEAPSASVFVDLYPGRVLDEGQVQAITR 231
EGEL RTIE++ V+ ARVHLA+PKPS+FVR++++PSASV V L PGR LDEGQ+ A+
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195

Query: 232 MVSSGVPDMPAKNVTIVDQDGNLLTQTASASG-LDASQLKYVQQVERNTQKRIDSILAPI 290
+VSS V +P NVT+VDQ G+LLTQ+ ++ L+ +QLK+ VE Q+RI++IL+PI
Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255

Query: 291 FGTGNARSQVSADIDFSKLEQTSESYSPNGTPQQAAIRSQQTSSATELAQGGASGVPGAL 350
G GN +QV+A +DF+ EQT E YSPNG +A +RS+Q + + ++ G GVPGAL
Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315

Query: 351 SNTPPQPASAPIVA-----GNGQN---------GAQSTPVSDRKDQTTNYELDKTIRHVE 396
SN P P API N QN + P S ++++T+NYE+D+TIRH +
Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTK 375

Query: 397 QPMGNVKRLSVAVVVNYQPVADAKGHVTMQPLPPPKLAQVEQLVKDAMGYDEKRGDSVNV 456
+G+++RLSVAVVVNY+ +AD K PL ++ Q+E L ++AMG+ +KRGD++NV
Sbjct: 376 MNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431

Query: 457 VNSAFSSVGDPYADLPWWRQPDMIAMAKEAAKWLGIAAAAAALYFMFVRPAMRRAFPPPE 516
VNS FS+V + +LP+W+Q I A +WL + A L+ VRP + R +
Sbjct: 432 VNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAK 491

Query: 517 PVAPALAAPDDPVALDGLPAPDRADEPDPLLLGFENEKNRYERNLDYARTIARQDPKIVA 576
+ + + R + + L N++ E R ++ DP++VA
Sbjct: 492 AAQEQAQVRQE--TEEAV--EVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVA 547

Query: 577 TVVKNWVSDE 586
V++ W+S++
Sbjct: 548 LVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3110FLGMOTORFLIG297e-102 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 297 bits (763), Expect = e-102
Identities = 113/324 (34%), Positives = 188/324 (58%)

Query: 5 GLTKSALLLMSIGEEEAAEVFKFLAPREVQKIGAAMAALKNVTREQVEGVLQEFAKEAEQ 64
G K+A+LL+SIG E +++VFK+L+ E++ + +A L+ +T E + VL EF +
Sbjct: 17 GKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMA 76

Query: 65 HTALSLDSSEYIRSVLTKALGEDKAGVLIDRILQGSDTSGIEGLKWMDSGAVAELIKNEH 124
+ +Y R +L K+LG KA +I+ + + E ++ D + I+ EH
Sbjct: 77 QEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEH 136

Query: 125 PQIIATILVHLDRDQASEIASCFTERLRNDVLLRIATLDGIQPAALRELDDVLTGLLSGS 184
PQ IA IL +LD +AS I S ++ +V RIA +D P +RE++ VL L+
Sbjct: 137 PQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASL 196

Query: 185 DNLKRSPMGGIRTAAEILNFMTSNHEEGVLESVRQYDADLAQKIVDQMFVFENLLDLEDR 244
+ + GG+ EI+N E+ ++ES+ + D +LA++I +MFVFE+++ L+DR
Sbjct: 197 SSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDR 256

Query: 245 AIQMVLKEVESETLIIALKGAPPALRQKFLANMSQRAAELLAEDLDARGPVRVSEVETQQ 304
+IQ VL+E++ + L ALK +++K NMS+RAA +L ED++ GP R +VE Q
Sbjct: 257 SIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQ 316

Query: 305 RRILQIVRNLAESGQIVIGGKAED 328
++I+ ++R L E G+IVI E+
Sbjct: 317 QKIVSLIRKLEEQGEIVISRGGEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3111FLGFLIH1129e-33 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 112 bits (281), Expect = 9e-33
Identities = 71/213 (33%), Positives = 114/213 (53%), Gaps = 10/213 (4%)

Query: 15 YQRWEMASFDPPPPPPPPDDA------AAAAAALAEELQRVRDAAHAEGHAAGHVEGQAL 68
++ W PP P A +L ++L +++ AH +G+ AG EG+
Sbjct: 7 WKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQ 66

Query: 69 GYQAGFDQGREQGFEAGQAEAREQAAQLAA----LAASFREAVSSAEHDLASDLAQLALD 124
G++ G+ +G QG E G AEA+ Q A + A L + F+ + + + +AS L Q+AL+
Sbjct: 67 GHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALE 126

Query: 125 IAQQVVRQHVKHDPAALVAAVRDVLAAEPALSGAPHLAVNPADLPVVEAYLQDDLDTLGW 184
A+QV+ Q D +AL+ ++ +L EP SG P L V+P DL V+ L L GW
Sbjct: 127 AARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGW 186

Query: 185 TVRTDTSIERGGCRAHAATGEVDATLPTRWQRV 217
+R D ++ GGC+ A G++DA++ TRWQ +
Sbjct: 187 RLRGDPTLHPGGCKVSADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3113FLGFLIJ641e-15 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 63.7 bits (154), Expect = 1e-15
Identities = 44/140 (31%), Positives = 73/140 (52%)

Query: 1 MAHGFPLQLLLDRAQEDLDTATKQLGTAQRDRTAAAEQLDALLRYRDEYHARFSQSAQHG 60
MA L L D A+++++ A + LG +R A EQL L+ Y++EY + G
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MPAGNWRNFQAFIDTLDAAIAQQRTVLAAAELRIDEARPNWQQKKRTVGSYETLQARGVA 120
+ + W N+Q FI TL+ AI Q R L ++D A +W++KK+ + +++TLQ R
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 QETQRAAKREQRDADEHAAK 140
+ +Q+ DE A +
Sbjct: 121 AALLAENRLDQKKMDEFAQR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3114FLGHOOKFLIK664e-14 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 66.4 bits (161), Expect = 4e-14
Identities = 83/326 (25%), Positives = 119/326 (36%), Gaps = 6/326 (1%)

Query: 92 TDDANATPNADAAALAAAAAVQAQLQARVNDAAPAGAAADTAATAAQTTAVSGQPDATAA 151
T D + T A AA A L + AA A G+P +
Sbjct: 9 TADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTKGEPLISDI 68

Query: 152 LTNHASKDAAAEPALPASGREALQAALAKLTGGAGAIAMPATGTAAATAATAPASTTSAS 211
+++ + Q+ LT A +
Sbjct: 69 VSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKADDLNEDVT 128

Query: 212 AAAAPLTPKVPTFDRTLADAKGALAPQPT------PTQATAQALQAGAAGQPAAHALAAT 265
A+ + L +P FD T PT + Q A P A T
Sbjct: 129 ASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLT 188

Query: 266 EEAASPAADASVAAAATAAAAAQANLQASPAASSVAAANAHVLAPHVGTADWTDALSQKV 325
A + A V + + AA + L + A VL+ +G+ +W +LSQ +
Sbjct: 189 PLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQSLSQHI 248

Query: 326 VFLSNAHQQSAELTLNPPDLGPLQVVLRVADNHAHALFVSQHAQVRDAVEAALPKLREAM 385
+ QQSAEL L+P DLG +Q+ L+V DN A VS H VR A+EAALP LR +
Sbjct: 249 SLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQL 308

Query: 386 EAGGLGLGSATVSDGGFASQQQNPQQ 411
G+ LG + +S F+ QQQ Q
Sbjct: 309 AESGIQLGQSNISGESFSGQQQAASQ 334


75Bamb_3139Bamb_3152N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_3139-1120.240076ATP-dependent protease ATP-binding subunit HslU
Bamb_31401121.003041response regulator receiver protein
Bamb_31410131.374399sensor signal transduction histidine kinase
Bamb_31420150.033985hypothetical protein
Bamb_3143-115-0.356624acetylglutamate kinase
Bamb_3144-118-0.282806pyrimidine 5'-nucleotidase
Bamb_3145-121-2.301072nucleoid occlusion protein
Bamb_3146022-2.213541hypothetical protein
Bamb_3147519-2.990832outer membrane efflux protein
Bamb_3148620-3.256783HlyD family type I secretion membrane fusion
Bamb_3149620-3.359964ABC transporter-like protein
Bamb_3150721-3.613702hypothetical protein
Bamb_3151620-2.732561OmpA/MotB domain-containing protein
Bamb_3152619-2.381907YadA domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3139HTHFIS310.012 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.012
Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLADAPFVKI 81
T +++ G +G GK +AR K + PFV I
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3140HTHFIS901e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.3 bits (224), Expect = 1e-23
Identities = 30/127 (23%), Positives = 61/127 (48%)

Query: 1 MSENNFLVIDDNEVFAGTLARGLERRGYAVQQAHDKEAALRLAAGGKFQFITVDLHLGED 60
M+ LV DD+ L + L R GY V+ + R A G + D+ + ++
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGLSLIAPLCDLQPDARILVLTGYASIATAVQAVKEGADNYLAKPANIESILAALQTNAS 120
+ L+ + +PD +LV++ + TA++A ++GA +YL KP ++ ++ + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 EVQADEA 127
E + +
Sbjct: 121 EPKRRPS 127



Score = 44.4 bits (105), Expect = 7e-08
Identities = 15/101 (14%), Positives = 32/101 (31%), Gaps = 3/101 (2%)

Query: 75 DARILVLTGYASIATAVQAVKEGADNYLAKPANIESILAALQTNASEVQADEALENPVVL 134
I+ + I + L+ +E + + + L
Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL---YDR 431

Query: 135 SVDRLEWEHIQRVLAENNNNISATARALNMHRRTLQRKLAK 175
+ +E+ I L N A L ++R TL++K+ +
Sbjct: 432 VLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3143CARBMTKINASE436e-07 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 43.3 bits (102), Expect = 6e-07
Identities = 26/99 (26%), Positives = 48/99 (48%), Gaps = 6/99 (6%)

Query: 225 IPVISPIGFGEDGLSYNINADLVAGKLATVLNAEKLLMMTNIPGVM----DKDGNLLTDL 280
+PVI G G+ I+ DL KLA +NA+ +++T++ G + L ++
Sbjct: 197 VPVILEDG-EIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLREV 255

Query: 281 SAREIDALFEDGT-ISGGMLPKISSALDAAKSGVKSVHI 318
E+ +E+G +G M PK+ +A+ + G + I
Sbjct: 256 KVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAII 294



Score = 36.7 bits (85), Expect = 9e-05
Identities = 21/60 (35%), Positives = 27/60 (45%), Gaps = 10/60 (16%)

Query: 76 GKTVVIKYGGNAMTEERLKQGF----------ARDVILLKLVGINPVIVHGGGPQIDQAL 125
GK VVI GGNA+ + K + AR + + G VI HG GPQ+ L
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3145HTHTETR462e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 45.8 bits (108), Expect = 2e-08
Identities = 30/154 (19%), Positives = 53/154 (34%), Gaps = 15/154 (9%)

Query: 16 EKITTAALAARLDVSEAALYRHFASKAKMYEGLIEFIEQALFGLVNQIVAKEPNGVLQA- 74
+ +A V+ A+Y HF K+ ++ + E E + L + AK P L
Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVL 89

Query: 75 RTIALTMLNFTAKNPGMTRVL----TGEALIGEDERLTERVNQLLDRIEATVKQCLRVAR 130
R I + +L T ++ +GE + + L ++Q L+
Sbjct: 90 REILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCI 149

Query: 131 TEANAPQDGAAPFTLPADYDPAARASLLVSYVIG 164
LPAD A ++ Y+ G
Sbjct: 150 EAKM----------LPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3148RTXTOXIND2501e-80 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 250 bits (639), Expect = 1e-80
Identities = 89/455 (19%), Positives = 188/455 (41%), Gaps = 59/455 (12%)

Query: 14 APRLRAGDAAYMSDIREALLVRSSAGAQLILYLIAIVLGAGLVWAHFARVEEVTRSEATV 73
P + ++ E + S +L+ Y I L + + +VE V + +
Sbjct: 31 TPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKL 90

Query: 74 VSPSREQLIQSLEGGIVQSVAVREGEVVEKGQLLAKIDPARAQSSYREVLTKALELKASV 133
R + I+ +E IV+ + V+EGE V KG +L K+ A++ + + L+ +
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ 150

Query: 134 SRARAEAYGV------PLDFP----------EDVKRESGLVAQATATYRARRR------- 170
+R + + + L P E+V R + L+ + +T++ ++
Sbjct: 151 TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLD 210

Query: 171 -------ALDEQVTALEKSQALVRREIAMSEPLAAKGLVSEVEILRMRRQSTDIAAQIAE 223
+ ++ E + + + L K +++ +L + + ++
Sbjct: 211 KKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRV 270

Query: 224 RRSR---------------------FTAEASTELSRLEQELAQTNEVLAGRADVLARTDV 262
+S+ F E +L + + LA + + +
Sbjct: 271 YKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVI 330

Query: 263 VAPMRGVVKNIRIRTAGGVVQSGEHIMEIAPLDGRVLVEARIKPSDVAFLRPGLPVLVKL 322
AP+ V+ +++ T GGVV + E +M I P D + V A ++ D+ F+ G ++K+
Sbjct: 331 RAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKV 390

Query: 323 SAYDFSIYGGLHGHVVSLGPDTLKDDQKAAMGRPDANYYRLMVETDSDALAAAGKRLPVL 382
A+ ++ YG L G V ++ D ++D + + +++ + + L+ K +P+
Sbjct: 391 EAFPYTRYGYLVGKVKNINLDAIEDQR-------LGLVFNVIISIEENCLSTGNKNIPLS 443

Query: 383 PGMQATVDIRTGEKTVLDYLLKPIF-KAREAFRER 416
GM T +I+TG ++V+ YLL P+ E+ RER
Sbjct: 444 SGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3150PF07675320.022 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 31.6 bits (71), Expect = 0.022
Identities = 26/103 (25%), Positives = 44/103 (42%), Gaps = 8/103 (7%)

Query: 36 NDSKPTVSGRGDPGSTIHLLVDGVEVGSVVVGANGTWSVALTQPL-NDGEYRLTARASND 94
++ + S + GS + + DGV G+ V A+G +V +T+ + +G Y + SN
Sbjct: 259 PQNQASYSIQASAGSYVAISKDGVLYGTGVANASGVATVNMTKQITENGNYDVVITRSN- 317

Query: 95 VGMSVPSTSYGIQVDVTPPSQP--KIEAATEGAQPTLSGHAEA 135
IQ P QP + A +G + TL A +
Sbjct: 318 ----YLPVIKQIQAGEPSPYQPVSNLTATAQGQKVTLKWDAPS 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3151OMPADOMAIN1058e-29 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 105 bits (262), Expect = 8e-29
Identities = 48/142 (33%), Positives = 69/142 (48%), Gaps = 12/142 (8%)

Query: 112 AQPQPVVQAQPAAEPVAQ-RHVLLQGSANFAFDSAALTPSARQELDRFLD--VNREARFR 168
+ PVV PA P Q +H L+ F F+ A L P + LD+ N + +
Sbjct: 194 GEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG 253

Query: 169 RVTVTGYTDSQGAHAHNVRLSEARARAVATYLRTGGLHAEHFTTVGKGAAEPVASNATAE 228
V V GYTD G+ A+N LSE RA++V YL + G+ A+ + G G + PV N
Sbjct: 254 SVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDN 313

Query: 229 GR---------AQNRRVEIELE 241
+ A +RRVEIE++
Sbjct: 314 VKQRAALIDCLAPDRRVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3152OMADHESIN753e-15 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 74.5 bits (182), Expect = 3e-15
Identities = 66/172 (38%), Positives = 102/172 (59%), Gaps = 16/172 (9%)

Query: 676 GAGGLNAIAVGLQAVASSDHSVAIGSIAQTGVDQPYSVAMGSMVTTNGAGALAIGSRAKA 735
GAGGLNA A G+ HS+AIG+ A+ + +VA+G+ G ++AIG +KA
Sbjct: 59 GAGGLNASAKGI-------HSIAIGATAEAA--KGAAVAVGAGSIATGVNSVAIGPLSKA 109

Query: 736 NADNAVAVGNNGVAAVGKSSIAIGDKAMTAAGTVNSVAMGKSANVAQNVTDAIALGANAS 795
D+AV G A K +AIG +A T+ VA+G ++ + +++A+G ++
Sbjct: 110 LGDSAVTYGAASTAQ--KDGVAIGARASTSD---TGVAVGFNSKA--DAKNSVAIGHSSH 162

Query: 796 VASGNNGGIALGANSVADRGNALSVGSNSLQRQIVNVAKGTKNNDAVNVSQL 847
VA+ + IA+G S DR N++S+G SL RQ+ ++A GTK+ DAVNV+QL
Sbjct: 163 VAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 70.7 bits (172), Expect = 6e-14
Identities = 66/177 (37%), Positives = 102/177 (57%), Gaps = 9/177 (5%)

Query: 58 AIGASASTRPATSGLGGAVAIGNKATAAGNNVAFGANASALGEEGAIALGTGANAGGKSS 117
A+G RP G GG N + +++A GA A A + A+A+G G+ A G +S
Sbjct: 46 ALGLEYPVRPPVPGAGGL----NASAKGIHSIAIGATAEA-AKGAAVAVGAGSIATGVNS 100

Query: 118 IALGNEAQATGWGAYALGRKAKAAAESSLAIGDSSMATRAGTMAIGSQAAAAAENAIAIG 177
+A+G ++A G A G A A + +AIG + + G +A+G + A A+N++AIG
Sbjct: 101 VAIGPLSKALGDSAVTYG-AASTAQKDGVAIGARASTSDTG-VAVGFNSKADAKNSVAIG 158

Query: 178 QSA--AARGADSLALGSYSEADRDNTVSVGTAGFERQIVNVGRGTQATDAVNIAQLK 232
S+ AA S+A+G S+ DR+N+VS+G RQ+ ++ GT+ TDAVN+AQLK
Sbjct: 159 HSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLK 215



Score = 68.0 bits (165), Expect = 4e-13
Identities = 58/161 (36%), Positives = 89/161 (55%), Gaps = 7/161 (4%)

Query: 3122 GLQAVSNSDHSVAIGSIAQTGVDQPYAVAMGSMVTTNGAGALAIGSRAKANADNAVAVGN 3181
GL A + HS+AIG+ A+ + AVA+G+ G ++AIG +KA D+AV G
Sbjct: 62 GLNASAKGIHSIAIGATAEAA--KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 3182 NGVVAVGKSSIAIGDKAMTNAGTVNSIAIGTNANVQQNVADAIALGANSQALSSNSVALG 3241
K +AIG +A T+ +A+G N+ + AI ++ A S+A+G
Sbjct: 120 ASTAQ--KDGVAIGARASTSD---TGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIG 174

Query: 3242 ANSIANRANALSIGKAGAERQIVNVAKGTQDTDAVSLAQLK 3282
S +R N++SIG RQ+ ++A GT+DTDAV++AQLK
Sbjct: 175 DRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLK 215



Score = 48.0 bits (113), Expect = 7e-07
Identities = 40/136 (29%), Positives = 63/136 (46%), Gaps = 3/136 (2%)

Query: 3981 GSGSNLIGGTGGSGKDSAEIVAAKPGNGNGNIAVGSGSQIVDGKNNAAAIGAGSKVSADN 4040
G S IG + DSA A +A+G+ + D A+G SK A N
Sbjct: 97 GVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSD---TGVAVGFNSKADAKN 153

Query: 4041 GTALGQGASVSSGADNSVALGQGSQATEANTVSVGSDGHERRIVNVADGVKATDAVSKGQ 4100
A+G + V++ S+A+G S+ N+VS+G + R++ ++A G K TDAV+ Q
Sbjct: 154 SVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQ 213

Query: 4101 FDRALGGMQGQINDIS 4116
+ + Q N S
Sbjct: 214 LKKEIEKTQENTNKRS 229



Score = 43.7 bits (102), Expect = 1e-05
Identities = 30/57 (52%), Positives = 37/57 (64%)

Query: 2965 GREAIARGPESVAIGANAWATRPQAMALGSGSRANGVNSVAIGYNSVADDDNTVAVG 3021
G A A+G S+AIGA A A + A+A+G+GS A GVNSVAIG S A D+ V G
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118



Score = 43.3 bits (101), Expect = 2e-05
Identities = 33/98 (33%), Positives = 54/98 (55%), Gaps = 2/98 (2%)

Query: 2976 VAIGANAWATRPQAMALGSGSR--ANGVNSVAIGYNSVADDDNTVAVGNVGEERRVVHLA 3033
VA+G N+ A ++A+G S AN S+AIG S D +N+V++G+ R++ HLA
Sbjct: 141 VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLA 200

Query: 3034 AGVDDTDAVNMRQLTDAMHSANTKLDAKMTRMVRDVES 3071
AG DTDAVN+ QL + + + ++ + +
Sbjct: 201 AGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANA 238



Score = 41.4 bits (96), Expect = 8e-05
Identities = 56/218 (25%), Positives = 93/218 (42%), Gaps = 17/218 (7%)

Query: 3223 AIALGANSQALSSNSVALGANSIANRANALSIG---KAGAERQIVNVAKGTQDTDAVSLA 3279
+IA+GA ++A +VA+GA SIA N+++IG KA + + A T D V++
Sbjct: 72 SIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIG 131

Query: 3280 QLKGLADVVAGGTGFDKNGDVTAPTYTIDGKEYHNVNDALQAAAKSGGDGSSGTDPNAVA 3339
+D G N A G H + + A GD S N+V+
Sbjct: 132 ARASTSDT---GVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAI--GDRSKTDRENSVS 186

Query: 3340 YDGELKDKVTLAGQNGTTLSNVAAGKADTDAVNVSQLKSSGLVGEDGKSRAAVTYDKNTD 3399
E ++ L+++AAG DTDAVNV+QLK ++ ++ + N +
Sbjct: 187 IGHESLNR---------QLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANAN 237

Query: 3400 GTPNYKSATLAGEGGTTLTNVKAGALSATSTDAVNGSQ 3437
+ KS+++ G + A L +A S+
Sbjct: 238 AYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSK 275



Score = 37.6 bits (86), Expect = 0.001
Identities = 26/59 (44%), Positives = 39/59 (66%)

Query: 527 AAGMQANALGKNSVAIGSQANATNVDTLAIGSGARASGVNSIAIGVNSVAADANTVSIG 585
A G+ A+A G +S+AIG+ A A +A+G+G+ A+GVNS+AIG S A + V+ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118



Score = 35.6 bits (81), Expect = 0.004
Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 17/115 (14%)

Query: 537 KNSVAIGSQANATNVDTLAIGSGARASGVNSIAIGVNS-VAAD---------------AN 580
K+ VAIG++A+ ++ +A+G ++A NS+AIG +S VAA+ N
Sbjct: 125 KDGVAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDREN 183

Query: 581 TVSIGDVGATRRIVNVSDGVDDTDAVNMKQLTNVMHSANTKLDAKMTRMVRDVES 635
+VSIG R++ +++ G DTDAVN+ QL + + + ++ + +
Sbjct: 184 SVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANA 238



Score = 34.9 bits (79), Expect = 0.008
Identities = 52/215 (24%), Positives = 89/215 (41%), Gaps = 9/215 (4%)

Query: 785 TDAIALGANASVASGNNGGIALGANSVADRGNALSVGSNSLQRQIVNVAKGTKNNDAVNV 844
+IA+GA A A G +A+GA S+A N++++G S V G A +
Sbjct: 70 IHSIAIGATAEAAKG--AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG-----AAST 122

Query: 845 SQLTGVTNALGGGAGIGTDGNITAPTYKVGDTTYNNVGDALDAMAKNGGSDPNAVSYDSA 904
+Q GV A+G A G K +G + A +G S +
Sbjct: 123 AQKDGV--AIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTD 180

Query: 905 TKDKVTLAGGATGTTLSNVKAGTADMDAVNVSQLKSSGLIGEDGKSLAAITYDKNTDGTP 964
++ V++ + L+++ AGT D DAVNV+QLK ++ + + N +
Sbjct: 181 RENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYA 240

Query: 965 NYKSATLAGEGGTTLTNVKAGALSATSTDAVNGSQ 999
+ KS+++ G + A L +A S+
Sbjct: 241 DNKSSSVLGIANNYTDSKSAETLENARKEAFAQSK 275


76Bamb_3165Bamb_3171N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Bamb_3165-2120.567366rod shape-determining protein MreB
Bamb_3166-3111.545086rod shape-determining protein MreC
Bamb_3167-2120.916663rod shape-determining protein MreD
Bamb_3168-2120.653551peptidoglycan glycosyltransferase
Bamb_3169-2100.554600rod shape-determining protein RodA
Bamb_3170-2100.872471Sel1 domain-containing protein
Bamb_3171-1110.0667362-dehydro-3-deoxyglucarate aldolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3165SHAPEPROTEIN5040.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 504 bits (1300), Expect = 0.0
Identities = 247/348 (70%), Positives = 294/348 (84%), Gaps = 2/348 (0%)

Query: 1 MFGFLRSYFSNDLAIDLGTANTLIYMRGKGIVLDEPSVVSIRQEGGPNGKKTIQAVGKEA 60
M R FSNDL+IDLGTANTLIY++G+GIVL+EPSVV+IRQ+ K++ AVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRA-GSPKSVAAVGHDA 59

Query: 61 KQMLGKVPGNIEAIRPMKDGVIADFTVTEQMIKQFIKTAHESRMFSPSPRIIICVPCGST 120
KQMLG+ PGNI AIRPMKDGVIADF VTE+M++ FIK H + PSPR+++CVP G+T
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 QVERRAIKEAAHGAGASQVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVGVISLG 180
QVERRAI+E+A GAGA +V+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEV VISL
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GIVYKGSVRVGGDKFDEAIVNYIRRNYGMLIGEQTAEAIKKEIGSAFPGSEVKEMEVKGR 240
G+VY SVR+GGD+FDEAI+NY+RRNYG LIGE TAE IK EIGSA+PG EV+E+EV+GR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 NLSEGIPRSFTISSNEILEALTDPLNQIVSSVKIALEQTPPELGADIAERGMMLTGGGAL 300
NL+EG+PR FT++SNEILEAL +PL IVS+V +ALEQ PPEL +DI+ERGM+LTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 LRDLDRLLAEETGLPVLVAEDPLTCVVRGSGMALERMDKL-GSIFSYE 347
LR+LDRLL EETG+PV+VAEDPLTCV RG G ALE +D G +FS E
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3166IGASERPTASE310.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.009
Identities = 12/79 (15%), Positives = 26/79 (32%), Gaps = 5/79 (6%)

Query: 276 QNDVPPRPAEPEPAADKKGKKGAKAAAKGE-KAEKAEKADANAKPAAAAAPGAKP----A 330
N+V +E + + K+ A + + K E + + + + +
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 331 PAAPAAPAQPAAAAAKPAA 349
A PA P +P +
Sbjct: 1142 QAEPARENDPTVNIKEPQS 1160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3168OMADHESIN300.047 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 29.9 bits (66), Expect = 0.047
Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 2/84 (2%)

Query: 639 QNPNNEAAAVAAAASATEPVSAPVVGDASKPAAVAAGFTALPQPVVPTAASAASAA--DA 696
+ P A + A+A ++ +A+K AAVA G ++ V A S A D+
Sbjct: 54 RPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDS 113

Query: 697 ASAPDASSAAQPSDASAAAPMAAS 720
A A+S AQ + A + S
Sbjct: 114 AVTYGAASTAQKDGVAIGARASTS 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Bamb_3171PHPHTRNFRASE376e-05 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 37.1 bits (86), Expect = 6e-05
Identities = 33/171 (19%), Positives = 54/171 (31%), Gaps = 34/171 (19%)

Query: 87 RALDAGARTLMFPCIETADEAAHAVRLTRFPSPDSPDGLRGVAGMVRAAAFGMRRDYLQT 146
RA G +MFP I T +E LR +++ + + +
Sbjct: 380 RASTYGNLKVMFPMIATLEE------------------LRQAKAIMQEEKDKLLSEGVDV 421

Query: 147 ANAQIAVIVQIESARGIDEVERIAATPGVDCLYVGPADLA----------ASLGHLGDSR 196
++ I V + +E A VD +G DL + +L
Sbjct: 422 SD-SIEVGIMVEIPSTAVAANLFA--KEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPY 478

Query: 197 HPDVETAMARVLAAGKQAGVAVGI---FASDTAIARQYREAGYRMITLSAD 244
HP + + V+ A G VG+ A D G ++SA
Sbjct: 479 HPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSAT 529



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.