PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genomeexample.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in BX571965 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1BPSL0008BPSL0027Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL00083141.442819general secretory pathway protein E
BPSL00091171.878230general secretory pathway protein F
BPSL00102152.066824putative general secretory pathway protein
BPSL00111132.953482general secretory pathway protein G
BPSL00120143.790500general secretory pathway protein H
BPSL00130143.569075general secretory pathway protein I
BPSL0014-2104.246534general secretory pathway protein J
BPSL0015-294.340040general secretory pathway protein K
BPSL0016-2104.703839general secretory pathway protein L
BPSL0017-1112.812033general secretory pathway protein M
BPSL0018-1102.755765general secretory pathway protein N
BPSL0019-1123.019488outer membrane efflux lipoprotein
BPSL00202141.429074putative membrane protein
BPSL00211130.859975MarR family protein
BPSL0022112-0.315585putative transporter protein
BPSL00231110.046639LysR family transcription regulatory protein
BPSL00240100.374405LrgA family protein
BPSL0025111-0.710637putative membrane protein
BPSL0026211-0.973985flagellar basal body-associated protein FliL
BPSL0027210-0.763229flagellar motor switch protein FliM
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0009BCTERIALGSPF382e-133 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 382 bits (982), Expect = e-133
Identities = 174/406 (42%), Positives = 266/406 (65%), Gaps = 2/406 (0%)

Query: 1 MPAFRFEAIDASGRAQKGVIEADSARNARGQLRTQGLTPLVVEPAASAQRGARSQRLALG 60
M + ++A+DA G+ +G EADSAR AR LR +GL PL V+ Q+ + S L+L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 R--KLSQREQAILTRQLASLLVAGLPLDEALAVLTEQAERDYIRELMAAIRAEVLGGHSL 118
R +LS + A+LTRQLA+L+ A +PL+EAL + +Q+E+ ++ +LMAA+R++V+ GHSL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 119 ANALTQHPRDFPEIYRALVAAGEHTGKLGIVLSRLADYIEERNALKQKILLAFTYPAIVT 178
A+A+ P F +Y A+VAAGE +G L VL+RLADY E+R ++ +I A YP ++T
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 179 VIAFGIVTFLLSYVVPQVVNVFASTKQQLPVLTIVMMALSDFVRHWWWAILIGIAAVVYL 238
V+A +V+ LLS VVP+VV F KQ LP+ T V+M +SD VR + +L+ + A
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 239 VKATLSRDGPRLAFDRWLLTAPLAGKLVRGYNTVRFASTLGILTAAGVPILRALQAAGET 298
+ L ++ R++F R LL PL G++ RG NT R+A TL IL A+ VP+L+A++ +G+
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 299 LSNRAMRGNIDDAIVRVREGSALSRALNNVKTFPPVLVHLIRSGEATGDVTTMLDRAAEG 358
+SN R + A VREG +L +AL FPP++ H+I SGE +G++ +ML+RAA+
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 359 ESRELERRTMFLTSLLEPLLILAMGGIVLVIVLAVMLPIIELNNMV 404
+ RE + L EPLL+++M +VL IVLA++ PI++LN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0011BCTERIALGSPG1886e-65 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 188 bits (480), Expect = 6e-65
Identities = 67/140 (47%), Positives = 94/140 (67%), Gaps = 3/140 (2%)

Query: 10 QAARRQRGFTLIEIMVVVAILGILAALIVPKIMSRPDEARRIAAKQDIGTIMQALKLYRL 69
+A +QRGFTL+EIMVV+ I+G+LA+L+VP +M ++A + A DI + AL +Y+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 70 DNGRYPTQDQGLNALIQKPTTDPIPNNWKDGGYLERLPNDPWGNSYKYLNPGVHGEIDVF 129
DN YPT +QGL +L++ PT P+ N+ GY++RLP DPWGN Y +NPG HG D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 130 SYGADGKEGGESNDSDIGSW 149
S G DG+ G E DI +W
Sbjct: 122 SAGPDGEMGTE---DDITNW 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0012BCTERIALGSPH511e-10 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 50.7 bits (121), Expect = 1e-10
Identities = 20/101 (19%), Positives = 33/101 (32%), Gaps = 15/101 (14%)

Query: 11 RARGFTLLEMLVVLVIAGILVSVASLTLRRNPRTDLREEAQRIALLFETAGDEAQVRARP 70
R RGFTLLEM+++L++ G+ + L + + R +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 71 IAWRATEHGFRF---------------DIRTGDGWRPLRDD 96
++F D +G W PLR
Sbjct: 62 FGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0013BCTERIALGSPG300.001 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.2 bits (68), Expect = 0.001
Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 8 RSPARSRGFTMIEVLVALAIIAVALA 33
R+ + RGFT++E++V + II V +
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLAS 27


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0014BCTERIALGSPG333e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 33.3 bits (76), Expect = 3e-04
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 33 RGFTLIEMMIAITILAVIA-ILSWRGLDQIIRGREKVAAAMEDERVFAQMFDQMRIDARR 91
RGFTL+E+M+ I I+ V+A ++ + + ++ A+ D D ++D
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQK--AVSDIVALENALDMYKLDNHH 65

Query: 92 AATDDEAGQPAV 103
T ++ + V
Sbjct: 66 YPTTNQGLESLV 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0019PF05616320.007 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 32.0 bits (72), Expect = 0.007
Identities = 22/64 (34%), Positives = 26/64 (40%), Gaps = 10/64 (15%)

Query: 495 PAGAAAPAAMPAAAVAPAAMPAAAVAPAARPAA---------VVAAAGPDTQARRPRATP 545
P A AP A P V+PA PA AP P + A PDT +P P
Sbjct: 317 PGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDG-QPGTRP 375

Query: 546 AAPA 549
+PA
Sbjct: 376 DSPA 379


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0022TCRTETB1223e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 122 bits (307), Expect = 3e-32
Identities = 85/404 (21%), Positives = 162/404 (40%), Gaps = 18/404 (4%)

Query: 28 IGLALGTFMEVLDTSIANVAVPTISGSLGVATSEGTWVISSYSVASAIAVPLTGWLARRV 87
I L + +F VL+ + NV++P I+ + WV +++ + +I + G L+ ++
Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 88 GEVRLFTLSVLAFTIASALCGLAEN-FETLIAFRLLQGLVSGPMVPLSQTILMRSYPPAR 146
G RL ++ S + + + F LI R +QG + L ++ R P
Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136

Query: 147 RGLALGLWAMTVIVAPIFGPLLGGWISDNYTWPWIFYINLPIGVFSAACAFFLLR-GRET 205
RG A GL V + GP +GG I+ W ++ +P+ + FL++ ++
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPM--ITIITVPFLMKLLKKE 192

Query: 206 KTTKQRIDAIGLALLVIGVSCLQMMLDLGKDRDWFNSTFITSLALIAVVSLAFMLVWEST 265
K D G+ L+ +G+ + F +++ S +++V+S +
Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRK 242

Query: 266 EKEPVVDLSLFKDRNFALGAMIISFGFMAFFGSVVIFPLWLQTVMGYTAGLAGLATA-PV 324
+P VD L K+ F +G + F G V + P ++ V + G P
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 325 GILALVLSPMIGRNMHRLDLRMVASFAFVVFAVVSIWNSMFTLDVPFNHVILPRLVQGIG 384
+ ++ + G + R V + +V + S + I+ V G G
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-G 361

Query: 385 VACFFVPMTTITLSSIPDERLASASGLSNFLRTLSGAIGTAVSS 428
++ ++TI SS+ + + L NF LS G A+
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0027FLGMOTORFLIM2762e-93 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 276 bits (706), Expect = 2e-93
Identities = 82/324 (25%), Positives = 159/324 (49%), Gaps = 10/324 (3%)

Query: 5 EFMSQEEVDALLKGVTGEDDSADEPAEASG---IRPYNIATQERIVRGRMPGLEIINDRF 61
E +SQ+E+D LL ++ D S ++ S I Y+ ++ + +M L ++++ F
Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62

Query: 62 ARLLRIGIFNFMRRTAEISVSQVKVQKYSEFTRNLPIPTNLNLVHVKPLRGTSLFVFDPN 121
ARL + +R + V+ V Y EF R++P P+ L ++ + PL+G ++ DP+
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122

Query: 122 LVFFVVDNLFGGDGRFHTRVEGRDFTATEQRIIGKLLNLVFEHYASAWKSVRPLQFEFVR 181
+ F ++D LFGG G+ RD T E ++ ++ + + +W V L+ +
Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQ 180

Query: 182 SEMHTQFANVATPNEIVIVTQFSIEFGPTGGTLHICMPYSMIEPIRDVLSSPIQGEAL-- 239
E + QFA + P+E+V++ + G G ++ C+PY IEPI LSS ++
Sbjct: 181 IETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 240 EVDRRWVRVLSQQVQSAEVELVADLAEVPTTFEKILNLRTGDVLPLD---ITDSITAKVD 296
+++ VL ++ + ++++VA++ + + IL LR GD++ L + D +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 297 GVPVMECGYGIFNGQYALRVQRMI 320
C G+ + A ++ I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


2BPSL0072BPSL0094Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0072-111-3.154705putative type-b cytochrome
BPSL0073-112-3.167190DNA gyrase subunit B
BPSL0074013-3.103216DNA polymerase III, beta chain
BPSL0075013-3.223561chromosomal replication initiator protein DnaA
BPSL0075a-116-2.18617050S ribosomal protein L34
BPSL0076-219-3.532966ribonuclease P protein component
BPSL0077-116-2.881848conserved hypothetical protein
BPSL0078-115-2.880238putative membrane protein
BPSL0079-119-2.776933hypothetical protein
BPSL0080-219-2.317755putative tRNA modification GTPase
BPSL0081019-3.032431putative phage integrase
BPSL0082119-2.799239hypothetical protein
BPSL0083333-5.837955hypothetical protein
BPSL0084340-7.391265hypothetical protein
BPSL0085439-7.243164conserved hypothetical protein
BPSL0086540-7.886479hypothetical protein
BPSL0087543-8.930230putative DNA-binding protein
BPSL0088432-6.965467conserved hypothetical protein
BPSL0089431-7.013410hypothetical protein
BPSL0090427-6.107982putative transposase protein
BPSL0091226-6.854991putative transposase protein
BPSL0092225-6.665288putative lipoprotein
BPSL0093017-4.845347putative lipoprotein
BPSL0094026-5.406482putative lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0075PERTACTIN330.003 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 33.2 bits (75), Expect = 0.003
Identities = 24/93 (25%), Positives = 31/93 (33%)

Query: 81 PKAGQRSPAGATPLAPRAPLPSANPAPVAPGPASAPAVDAHAPAPAGMNAATAAAVAAAQ 140
P A + +P P+ P P P P P +A AP P +AAA AA
Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAANAAVN 627

Query: 141 AAQAAQANAAALNADEAADLDLPSLTAHEAAAG 173
A+ A L L + A G
Sbjct: 628 TGGVGLASTLWYAESNALSKRLGELRLNPDAGG 660


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL007860KDINNERMP490e-171 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 490 bits (1263), Expect = e-171
Identities = 204/576 (35%), Positives = 320/576 (55%), Gaps = 46/576 (7%)

Query: 1 MDIKRTVLWVIFFMSAVMLFDNWQRSHGRPSMFFPNVTQTNTASNATNGNGASGASAAAA 60
MD +R +L + + M++ W++ Q T + T
Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQ-----PQAQQTTQTTTT------------- 42

Query: 61 ANALPAAATGAAPATTAPAAQAQLVRFSTDVYNGEIDTRGGTLAKLTLTK---AGDGKQP 117
AA AA + Q +L+ TDV + I+TRGG + + L + QP
Sbjct: 43 ------AAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQP 96

Query: 118 DLSVTLFDHTANHTYLARTGLLGGDFPN-----HNDVYAQVAGPTSLAADQNTLKLSFES 172
L + + Y A++GL G D P+ +Y LA QN L++
Sbjct: 97 ---FQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTY 153

Query: 173 PVKGGVKVVKTYTFTRGSYVIGVDTKIENVGAAPVTPSVYMELVRD-----NSSVETPMF 227
G KT+ RG Y + V+ ++N G P+ S + +L + + + F
Sbjct: 154 TDAAGNTFTKTFVLKRGDYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNF 213

Query: 228 S-HTFLGPAVYTDQKHFQKITFGDIDKNKADYVTSADNGWIAMVQHYFASAWIPQSGAKR 286
+ HTF G A T + ++K F I N+ ++S GW+AM+Q YFA+AWIP +
Sbjct: 214 ALHTFRGAAYSTPDEKYEKYKFDTIADNENLNISS-KGGWVAMLQQYFATAWIPHNDGTN 272

Query: 287 DIYVEKIDPTLYRVGVKQPVAAIAPGQSADVSARLFAGPEEERMLEGIAPGLELVKDYGW 346
+ Y + + +G K + PGQ+ +++ L+ GPE + + +AP L+L DYGW
Sbjct: 273 NFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGW 332

Query: 347 VTIIAKPLFWLLEKIHGFVGNWGWAIVLLTLLIKAVFFPLSAASYKSMARMKEITPRMQA 406
+ I++PLF LL+ IH FVGNWG++I+++T +++ + +PL+ A Y SMA+M+ + P++QA
Sbjct: 333 LWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQA 392

Query: 407 LRERFKSDPQKMNAALMELYKTEKVNPFGGCLPVVIQIPVFISLYWVLLASVEMRGAPWV 466
+RER D Q+++ +M LYK EKVNP GGC P++IQ+P+F++LY++L+ SVE+R AP+
Sbjct: 393 MRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFA 452

Query: 467 LWIHDLSQRDPYFILPVLMAVSMFVQTKLNPTP-PDPVQAKMMMFMPIAFSVMFFFFPAG 525
LWIHDLS +DPY+ILP+LM V+MF K++PT DP+Q K+M FMP+ F+V F +FP+G
Sbjct: 453 LWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSG 512

Query: 526 LVLYYVVNNVLSIAQQYYITRTL---GGAAAKKKAS 558
LVLYY+V+N+++I QQ I R L G + +KK S
Sbjct: 513 LVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKKS 548


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0080PF05272372e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 36.6 bits (84), Expect = 2e-04
Identities = 25/123 (20%), Positives = 40/123 (32%), Gaps = 9/123 (7%)

Query: 191 IDFLEAADARGKLAHIR--ERLAHVLGDARQGALLREGLSV----VLAGQPNVGKSSLLN 244
+ L K +R + + + ++ G VL G +GKS+L+N
Sbjct: 555 VHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLIN 614

Query: 245 ALAGAELAIVTPI-AGTTRDKVAQTIQIEGIPLHIIDTAGLRETEDEVEKIGIARTWGEI 303
L G + T GT +D Q I L + R + E K +
Sbjct: 615 TLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMT--AFRRADAEAVKAFFSSRKDRY 672

Query: 304 ERA 306
A
Sbjct: 673 RGA 675


3BPSL0129BPSL0160Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0129218-1.654751*prophage integrase
BPSL0130121-0.700763conserved hypothetical phage protein
BPSL0130a333-2.193383hypothetical phage protein
BPSL0131035-3.116212hypothetical phage protein
BPSL0132436-3.660089hypothetical phage protein
BPSL0133334-2.999128hypothetical phage protein
BPSL0134535-3.350478putative phage-encoded membrane protein
BPSL0135536-3.661828conserved hypothetical phage protein
BPSL0136840-5.891525hypothetical phage protein
BPSL0137745-8.595009hypothetical phage protein
BPSL0138653-10.849667putative phage protein
BPSL0139239-8.848043putative phage DNA-binding protein
BPSL0140033-8.165303hypothetical phage protein
BPSL0141022-5.236921putative phage DNA-binding protein
BPSL0142-120-4.712634putative phage-encoded membrane protein
BPSL0143-117-3.185627hypothetical phage protein
BPSL0144-115-2.633092putative phage protein
BPSL0145-115-2.181619putative phage protein
BPSL0146116-1.366304putative phage-encoded membrane protein
BPSL01474150.420697putative phage protein
BPSL01484140.604566putative phage protein
BPSL01492130.559883phage major tail tube protein
BPSL01501121.267516phage major tail sheath protein
BPSL01511121.658563putative phage tail fiber assembly protein
BPSL01521110.743422phage-related tail fiber protein
BPSL0153-215-1.285093putative phage protein
BPSL0154-116-1.765674phage baseplate assembly protein
BPSL0155120-2.619525phage baseplate assembly protein
BPSL0156421-2.624454phage baseplate assembly protein
BPSL0157323-2.630200phage-encoded modification methylase
BPSL0158323-2.095050putative phage protein
BPSL0159219-0.421553phage tail completion protein
BPSL0160219-0.066847phage tail completion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0131CHANLCOLICIN260.037 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 26.2 bits (57), Expect = 0.037
Identities = 20/70 (28%), Positives = 29/70 (41%), Gaps = 2/70 (2%)

Query: 33 AGATFGIHMERHAPHGHPEKWTVTHLASGMAAGVGPTRDAAIAHAAANLERN--KRRLRD 90
G G E A KW+ L A + AA A A A R+ +RL+D
Sbjct: 37 GGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKD 96

Query: 91 MLDEAMTARA 100
+++EA+ A
Sbjct: 97 IVNEALRHNA 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0146GPOSANCHOR320.012 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.0 bits (72), Expect = 0.012
Identities = 28/167 (16%), Positives = 58/167 (34%), Gaps = 2/167 (1%)

Query: 20 TKPLKNVLNSNKGLAQALKQTRGELAELGKQQKAVASFREMRTGLAGTAEKLGEARTRVN 79
K L+ + L++ A + + A + E +
Sbjct: 213 IKTLEAEKAALAARKADLEKALEGAMN--FSTADSAKIKTLEAEKAALEARQAELEKALE 270

Query: 80 GLATALRAADQPSRQMIADFEKAKQSAARLSIEHEKQSARVRELRAQLASTGIDTRQLAE 139
G A + + A+ + A L + + +A + LR L ++ +QL
Sbjct: 271 GAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330

Query: 140 HERTLRSNIAQTTAAMQTQTRQLEAMAEREKKLGAARGKMQALQGVA 186
+ L + A+ Q+ R L+A E +K+L A K++ ++
Sbjct: 331 EHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKIS 377


4BPSL0170BPSL0175Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0170010-3.397744phage major capsid protein precursor
BPSL0171113-4.378863putative phage capsid scaffolding protein
BPSL0172012-4.445055phage terminase, ATPase subunit
BPSL0173117-4.642543putative phage portal vertex protein
BPSL0174021-5.662236putative phage DNA-binding protein
BPSL0175-217-3.574759conserved hypothetical phage protein
5BPSL0214BPSL0234Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL02142141.047999hypothetical protein
BPSL02151131.407560aldo/keto reductase family protein
BPSL02161122.720542putative membrane protein
BPSL0217092.178954putative membrane protein
BPSL0218-281.884243putative transporter protein
BPSL0219091.259470hypothetical protein
BPSL0220082.650347conserved hypothetical protein
BPSL0221092.953529putative coniferyl aldehyde dehydrogenase
BPSL02230103.900643putative acyl-CoA dehydrogenase
BPSL0224193.362914putative GMC oxidoreductase
BPSL02251123.519864putative flagellar hook-length control protein
BPSL02262142.058300flagellar fliJ protein
BPSL02273121.656567flagellum-specific ATP synthase
BPSL02281120.629670flagellar assembly protein
BPSL02292102.678934flagellar motor switch protein
BPSL0230094.054433flagellar M-ring protein
BPSL02312104.689393flagellar hook-basal body complex protein
BPSL02320105.031523flagellar protein
BPSL0233-284.159905conserved hypothetical protein
BPSL0234-183.487850hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0218TCRTETB446e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.5 bits (105), Expect = 6e-07
Identities = 62/361 (17%), Positives = 114/361 (31%), Gaps = 45/361 (12%)

Query: 66 LPEFSKAFGVSPAQSSLALSFATAALAAAVFVAGFVSEALSRHRLMTASLTASSLLTLAA 125
LP+ + F PA ++ + + V G +S+ L RL+ + + ++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 126 AFAPHWHQLLIL-RALTGLALGGVPAVAMAYLAEEVHPDGLGLAMGLYVGGTAIGGMAGR 184
+ LLI+ R + G PA+ M +A + + G A GL A+G G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 185 VITGILTDLFSWRIAVGAIGVLGLASMLAFRMLLPPSRH--------------------- 223
I G++ W + + + ++L R
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFML 216

Query: 224 ------------FVPRRGLNLAHHRTS----LAHHLRGQRELPVLFAMAFVLMGSFVTLY 267
V + + H R + L + ++ G+
Sbjct: 217 FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFV 276

Query: 268 NYIGYRLLAPPYSMGQATIGA--IFVVYLVGVVASPLSGRLADTLGRGRVLI---ASLAV 322
+ + Y ++ + + A IG+ IF + ++ + G L D G VL L+V
Sbjct: 277 SMVPY-MMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSV 335

Query: 323 MLGGVALTLLHPVAAIVAGVACVTFGFFAGHAVASGWVGR-LAQHGKGQAAALYLLAYYL 381
+ L + + V G V S V L Q G +L +L
Sbjct: 336 SFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFL 395

Query: 382 G 382

Sbjct: 396 S 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0225FLGHOOKFLIK742e-16 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 74.1 bits (181), Expect = 2e-16
Identities = 79/257 (30%), Positives = 109/257 (42%), Gaps = 8/257 (3%)

Query: 216 NGDASAPLAANGAAFDKLLAGAKAPAAQAAPTDASGANPATALANAAANAAQPDASG--A 273
N D +A L+A A K A + T L + AQPD +
Sbjct: 124 NEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAPGTP 183

Query: 274 LAALQDAADSARATLAASSAPAALQQAA-PAALAANASAAAASAAPSLAPPVGTPDWTDA 332
L A++ S P+ + AA P AAP L+ P+G+ +W +
Sbjct: 184 AQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQS 243

Query: 333 LSQKVVFLSNAHQQSAELTLNPPDLGPLQVVLRVADNHAHALFVSQHAQVRDAVEAALPK 392
LSQ + + QQSAEL L+P DLG +Q+ L+V DN A VS H VR A+EAALP
Sbjct: 244 LSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPV 303

Query: 393 LREAMEAGGLGLGSASVSDGGFASAQQQQTPQRQSSDGSATRRAFGASTADAALDELAAA 452
LR + G+ LG +++S F+ QQ + Q+QS +A D L
Sbjct: 304 LRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQR-TANHEPLAGEDDDT----LPVP 358

Query: 453 SSGGAARRTVGMVDTFA 469
S VD FA
Sbjct: 359 VSLQGRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0226FLGFLIJ602e-14 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 59.8 bits (144), Expect = 2e-14
Identities = 43/140 (30%), Positives = 74/140 (52%)

Query: 1 MAQSFPLQLLLERAQDDLDTAAKQLGRAQRERTDAQAQLDALMRYRDEYRVRFAESAQSG 60
MA+ L L + A+ +++ AA+ LG +R A+ QL L+ Y++EYR +G
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MPAGNWRNFQAFLDTLDAAIEQQRRVLAAAQTRIDAARPEWQAKKRTLGSYEILQARGAR 120
+ + W N+Q F+ TL+ AI Q R+ L ++D A W+ KK+ L +++ LQ R +
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 QDAQRAAKREQRDADEHAAK 140
+ +Q+ DE A +
Sbjct: 121 AALLAENRLDQKKMDEFAQR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0228FLGFLIH1091e-31 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 109 bits (273), Expect = 1e-31
Identities = 64/184 (34%), Positives = 106/184 (57%), Gaps = 4/184 (2%)

Query: 37 AAAALAAELQRVRDAAHAEGLAAGHVEGQALGYQAGYEQGRAKGFDEGQAEAHTHAAQLA 96
A +L +L +++ AH +G AG EG+ G++ GY++G A+G ++G AEA + A +
Sbjct: 36 AEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIH 95

Query: 97 A----LAASFRDALAGVERDLADDIATLALEIAQQVVRQHVQHDPAALIAAAREVLAAEP 152
A L + F+ L ++ +A + +ALE A+QV+ Q D +ALI +++L EP
Sbjct: 96 ARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEP 155

Query: 153 ALAGAPHLIVNPADLPVVEAYLKDELDTLGWSVRTDTSIERGGCRAHASTGEIDATLTTR 212
+G P L V+P DL V+ L L GW +R D ++ GGC+ A G++DA++ TR
Sbjct: 156 LFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATR 215

Query: 213 WERV 216
W+ +
Sbjct: 216 WQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0229FLGMOTORFLIG298e-102 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 298 bits (765), Expect = e-102
Identities = 114/324 (35%), Positives = 191/324 (58%)

Query: 5 GLNKSALLLMSIGEEEAAQVFKFLAPREVQKIGAAMAALKNVTREQVEDVLNDFVQEAEK 64
G K+A+LL+SIG E +++VFK+L+ E++ + +A L+ +T E ++VL +F +
Sbjct: 17 GKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMA 76

Query: 65 HTALSLDSSEYIRTVLTKALGEDKAGVLIDRILQGSDTSGIEGLKWMDSAAVAELIKNEH 124
+ +Y R +L K+LG KA +I+ + + E ++ D A + I+ EH
Sbjct: 77 QEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEH 136

Query: 125 PQIIATILVHLDRDQASEIASCFTERLRNDVLLRIATLDGIQPTALRELDDVLTGLLSGS 184
PQ IA IL +LD +AS I S ++ +V RIA +D P +RE++ VL L+
Sbjct: 137 PQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASL 196

Query: 185 DNLKRAPMGGIRTAAEILNFMTSVHEEAVIENVKQYDPDLAQKIIDQMFVFENLLDLEDR 244
+ GG+ EI+N E+ +IE++++ DP+LA++I +MFVFE+++ L+DR
Sbjct: 197 SSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDR 256

Query: 245 AIQLLLKEVESEALIIALKGAPPALRQKFLSNMSQRAAELLAEDLDARGPVRVSEVETQQ 304
+IQ +L+E++ + L ALK +++K NMS+RAA +L ED++ GP R +VE Q
Sbjct: 257 SIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQ 316

Query: 305 RKILQVVRNLAESGQIVIGGKAED 328
+KI+ ++R L E G+IVI E+
Sbjct: 317 QKIVSLIRKLEEQGEIVISRGGEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0230FLGMRINGFLIF468e-162 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 468 bits (1206), Expect = e-162
Identities = 254/562 (45%), Positives = 360/562 (64%), Gaps = 37/562 (6%)

Query: 53 LSRMKTNPRLPFLIGAALAIAAIVALVLWSRAPDYRVLYSNLSDRDGGAIIAALQQANVP 112
L+R++ NPR+P ++ + A+A +VA+VLW++ PDYR L+SNLSD+DGGAI+A L Q N+P
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 113 YKFADAGGAILVPANQVHETRLKLAAMGLPKGGSVGFELMDNQKFGISQFAEQVNYQRAL 172
Y+FA+ GAI VPA++VHE RL+LA GLPKGG+VGFEL+D +KFGISQF+EQVNYQRAL
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135

Query: 173 EGELQRTVESINAVRAARVHLAIPKPSVFVRDREAPSASVLVDLYPGRVLDEGQVLAVTR 232
EGEL RT+E++ V++ARVHLA+PKPS+FVR++++PSASV V L PGR LDEGQ+ AV
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195

Query: 233 MVSSSVPDMPAKNVTIVDQDGNLLTQT-ASATGLDASQLKYVQQIERNTQKRIDAILAPI 291
+VSS+V +P NVT+VDQ G+LLTQ+ S L+ +QLK+ +E Q+RI+AIL+PI
Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255

Query: 292 FGAGNARSQVSADVDFSKIEQTSESYGPNGTPQQSAIRSQQTSSSTELAQSGASGVPGAL 351
G GN +QV+A +DF+ EQT E Y PNG ++ +RS+Q + S ++ GVPGAL
Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315

Query: 352 SNTPPQPASAPIVA-------------SNGQPAGPAATPVSDRKDSTTNYELDKTVRHVE 398
SN P P API ++ +A P S +++ T+NYE+D+T+RH +
Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTK 375

Query: 399 QSMGTIKRLSVAVVVNYQPSTDAKGRVTMQPLAADKLAQVQQLVKDAMGYDEKRGDSVNV 458
++G I+RLSVAVVVNY+ D K PL AD++ Q++ L ++AMG+ +KRGD++NV
Sbjct: 376 MNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431

Query: 459 VNSAFSAAADPFANLPWWRQPDMIELGKDIAKWLGVAAAAAALYFMFVRPALRR---AFP 515
VNS FSA + LP+W+Q I+ +WL V A L+ VRP L R
Sbjct: 432 VNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAK 491

Query: 516 PPAEPAAAAVPALDGPDDMLALDGLPSPDKKQLAEEDEEHPALLAFENERNRYERNLDYA 575
E A + + L+ D + N+R E
Sbjct: 492 AAQEQAQVRQETEEAVEVRLSKDEQLQQRR----------------ANQRLGAEVMSQRI 535

Query: 576 RTIARQDPKIVATVVKNWVSDE 597
R ++ DP++VA V++ W+S++
Sbjct: 536 REMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0231FLGHOOKFLIE627e-16 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 61.6 bits (149), Expect = 7e-16
Identities = 46/111 (41%), Positives = 62/111 (55%), Gaps = 8/111 (7%)

Query: 3 APVNGIASALQQMQAMAAQAAGGTSPATSLAGSGAASAGSFASAMKASLDKISGDQQKAL 62
+ + GI + Q+QA A +A SFA + A+LD+IS Q A
Sbjct: 1 SAIQGIEGVISQLQATA-MSARAQESLPQ-------PTISFAGQLHAALDRISDTQTAAR 52

Query: 63 GEAHAFEIGAQNVSLNDVMVDMQKANIGFQFGLQVRNKLVSAYNEIMQMSV 113
+A F +G V+LNDVM DMQKA++ Q G+QVRNKLV+AY E+M M V
Sbjct: 53 TQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


6BPSL0250BPSL0272Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL02502120.896560putative dipeptide transport system permease
BPSL02511111.597183putative dipeptide transport system permease
BPSL02520101.605430putative dipeptide transport system ATP-binding
BPSL02530102.809853putative dipeptide transport system ATP-binding
BPSL0254-1113.417511putative exported protein
BPSL0255-1112.346965putative membrane protein
BPSL0256-1113.290401putative membrane protein
BPSL0257-1102.295503LamB/YcsF family protein
BPSL0258092.884791conserved hypothetical protein
BPSL0259092.281095conserved hypothetical protein
BPSL02601102.466180conserved hypothetical protein
BPSL02611102.8566945-formyltetrahydrofolate cyclo-ligase family
BPSL02620102.373067putative transglycosylase
BPSL02630123.417940conserved hypothetical protein
BPSL0264-1123.880056conserved hypothetical protein
BPSL0265-1113.759239tRNA nucleotidyltransferase
BPSL02660132.404888conserved hypothetical protein
BPSL02670112.171237putative flagella synthesis protein
BPSL02683110.678472putative negative regulator of flagellin
BPSL02693110.434294putative flagella basal body P-ring formation
BPSL0270416-1.559018putative flagellar basal-body rod protein
BPSL0271318-1.462464flagellar basal-body rod protein
BPSL0272320-0.366716putative basal-body rod modification protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0254FLGLRINGFLGH280.037 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 28.4 bits (63), Expect = 0.037
Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 3/40 (7%)

Query: 10 GLAWPPAGALAAGSVAAAPLPQAPIPAPSMSLPGFHAPPP 49
G AW P+ L G+ +A P+P P P + S+ F + P
Sbjct: 21 GCAWIPSTPLVQGATSAQPVP-GPTPVANGSI--FQSAQP 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0264cloacin300.010 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.7 bits (66), Expect = 0.010
Identities = 22/64 (34%), Positives = 32/64 (50%), Gaps = 11/64 (17%)

Query: 99 SVSAEMHAGFPALRSEMPLNVRESHPGRGATPAALADVARIDELWRTCVAASGGPFLFGA 158
+V+A + GFPAL + + S GA AA+AD+ +AA GPF FG
Sbjct: 83 AVAAPVAFGFPALSTPGAGGLAVSISA-GALSAAIADI----------MAALKGPFKFGL 131

Query: 159 FSIA 162
+ +A
Sbjct: 132 WGVA 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0271FLGHOOKAP1270.029 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 26.8 bits (59), Expect = 0.029
Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 102 NVDPVQEMVNMISASRSYQANVETLNTAKQLMLKTLTI 139
V+ +E N+ + Y AN + L TA + + I
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


7BPSL0315BPSL0322Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL03150144.338185conserved hypothetical protein
BPSL0316-1124.424720hypothetical protein
BPSL0317-2122.976950conserved hypothetical protein
BPSL0318-2133.595833putative membrane protein
BPSL0319-1143.618188hypothetical protein
BPSL0320-1154.389189PfkB family carbohydrate kinase
BPSL0321-2143.195840conserved hypothetical protein
BPSL0322-2103.086246LacI family regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0322HTHTETR280.043 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.1 bits (62), Expect = 0.043
Identities = 17/136 (12%), Positives = 34/136 (25%), Gaps = 8/136 (5%)

Query: 2 GTTIRDVAQAANVSIGTVSRALKNQPGLSEATRARIVE-----IAHRMNYDPTQLRPRIK 56
T++ ++A+AA V+ G + K++ L P ++
Sbjct: 31 STSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLR 90

Query: 57 -RLTFLLHRQHNNFATTPFFSHVLHGVEDACRERGIVPSLLTTGPTDDVIRQMRPHAPDA 115
L +L + H E E +V + ++
Sbjct: 91 EILIHVLESTVTEERRRLLMEIIFHKCEFV-GEMAVVQQAQRNLCL-ESYDRIEQTLKHC 148

Query: 116 IAVAGFMEPETLEALA 131
I A
Sbjct: 149 IEAKMLPADLMTRRAA 164


8BPSL0361BPSL0366Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL03611123.264438putative alkaline phosphatase
BPSL03623113.482001putative exported protein
BPSL03633123.228870CutC family protein
BPSL03641133.143341biotin synthase
BPSL03653144.152962dethiobiotin synthetase
BPSL03662143.7661958-amino-7-oxononanoate synthase
9BPSL0388BPSL0402Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0388193.509492putative exported protein
BPSL03891112.168977IclR family regulatory protein
BPSL03901111.712410fumarylacetoacetate (FAA) hydrolase family
BPSL03912122.492041enoyl-CoA hydratase/isomerase family protein
BPSL03920142.901148conserved hypothetical protein
BPSL03930133.591250putative patatin-like phospholipase
BPSL0394-1144.622594conserved hypothetical protein
BPSL0395-2134.080374putative cytidylyltransferase
BPSL0396-2124.056762putative exported protein
BPSL03970143.402429Bordetella pertussis Bvg accessory factor family
BPSL0398-1132.945910putative biotin ligase
BPSL03990152.487166putative membrane protein
BPSL04000142.453066putative 2',3'-cyclic-nucleotide
BPSL04011143.247201putative membrane protein
BPSL04020133.223111ABC transporter system ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0388PF03544415e-06 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 40.7 bits (95), Expect = 5e-06
Identities = 25/144 (17%), Positives = 43/144 (29%), Gaps = 2/144 (1%)

Query: 2 RRAACVLALVLALHWLAALWLVRFREPFRPVEPD-HVPVQVELLKPQPIERAPAPEKPAA 60
RR L + +H L+ P P+ V ++ P +E A + P
Sbjct: 12 RRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPE 71

Query: 61 DRPRAAPKRAARAPAPPAHAPRASAPVSSAAESSTESSAESPAAASGTEPASAAGGQAAG 120
P+ P PP AP + + + +P +
Sbjct: 72 PVVEPEPE-PEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFE 130

Query: 121 ATSGAAAGASGASAPPGEAAQGVK 144
T+ A +S A+A + V
Sbjct: 131 NTAPARPTSSTATAATSKPVTSVA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0396GPOSANCHOR300.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.002
Identities = 21/80 (26%), Positives = 29/80 (36%), Gaps = 8/80 (10%)

Query: 68 PALETAPLNAPGAAPAAASDSAPGSPAASAPASAVAPASMPASVAAPAAPA----PSSPP 123
A E A L A A+ + D+ PG+ A A + P AP PS+
Sbjct: 451 QAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGE 510

Query: 124 AAQP----ARAPILPGASAA 139
A P A ++ A A
Sbjct: 511 TANPFFTAAALTVMATAGVA 530


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0397PF033092035e-67 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 203 bits (517), Expect = 5e-67
Identities = 58/279 (20%), Positives = 102/279 (36%), Gaps = 47/279 (16%)

Query: 4 MCLLIDAGNSRIKWALADTARHFVTSGAFEHASDAPDWSTLPAPR------GAWISNVAG 57
M L ID N+ L G+ +HA W P I + G
Sbjct: 1 MLLAIDVRNTHTVVGLIS--------GSGDHAKVVQQWRIRTEPEVTADELALTIDGLIG 52

Query: 58 DAAAA---------------RIDALIEARWPALPRTVVRASAAQCGVTNGYAEPARLGSD 102
D A + ++E WP +P ++ G+ P +G+D
Sbjct: 53 DDAERLTGASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEPGVR-TGIPLLVDNPKEVGAD 111

Query: 103 RWAGLIGAHAAFADEHLLIATFGTATTLEALRADGHFAGGLIAPGWALMMRSLGMHTAQL 162
R + A+ + +++ FG++ ++ + A G F GG IAPG + + +A L
Sbjct: 112 RIVNCLAAYHKYGTAAIVV-DFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAAL 170

Query: 163 PTVSIDAATNLLDELAENDAHAPFAIDTPHALSAGCLQAQAGLIE----RAWRDLEKAWQ 218
V + +++ + +T + AG + AGL++ R D++
Sbjct: 171 RRVELTRPRSVIGK------------NTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSG 218

Query: 219 APVRLVLSGGAADAIVRALTVPHTRHDTLVLTGLALIAH 257
A V +V +G A ++ L L L GL L+
Sbjct: 219 ADVAVVATGHTAPLVLPDLRTVEHYDRHLTLDGLRLVFE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0398SECA290.027 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.1 bits (65), Expect = 0.027
Identities = 18/49 (36%), Positives = 22/49 (44%), Gaps = 4/49 (8%)

Query: 198 AAAEVDALRARDATLAGGLP----PVALAAVRAGATLTDTFAAALNALA 242
A+ V +R D L GG+ +A G TLT T A LNAL
Sbjct: 74 ASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALT 122


10BPSL0467BPSL0506Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL04671113.134165putative exported protein
BPSL04681113.231044putative branched-chain amino acid transport
BPSL04690103.322318ABC transport system ATP-binding protein
BPSL04700103.145017conserved hypothetical protein
BPSL0471-192.929213putative DNA polymerase III alpha subunit
BPSL0472-192.291182conserved hypothetical protein
BPSL0473082.468566putative transporter protein
BPSL0474-193.326940putative fatty acid desaturase
BPSL0476-1134.294055putative diaminobutyrate--2-oxoglutarate
BPSL0477-1154.939018putative membrane protein
BPSL0478-2163.856737putative membrane protein
BPSL0479-2164.561848ABC transport system ATP-binding protein
BPSL0480-1144.765937conserved hypothetical protein
BPSL0481-2144.781361hypothetical protein
BPSL0482-1144.509344conserved hypothetical protein
BPSL0483-1133.383752conserved hypothetical protein
BPSL04840143.974584hypothetical protein
BPSL04850143.797566putative AMP-binding enzyme
BPSL0486-1142.671546putative pyridoxal-dependent decarboxylase
BPSL04871153.335311hypothetical protein
BPSL04881143.304816hypothetical protein
BPSL04891133.749107hypothetical protein
BPSL04901113.800502hypothetical protein
BPSL04912113.033416putative acyl carrier protein
BPSL04922113.647929hypothetical protein
BPSL04933103.980856putative AMP-binding enzyme
BPSL04943133.708755LysR family regulatory protein
BPSL04951134.012788GntR family regulatory protein
BPSL04961143.754714putative N-acetylglucosamine-6-phosphate
BPSL04970133.266327putative phosphosugar-binding protein
BPSL0498-1131.893486putative multiphosphoryl transfer protein
BPSL0499-2130.552473phosphotransferase system, IIbc component
BPSL0500-112-0.124479putative chitobiase
BPSL0500A-19-3.153043putative exported protein
BPSL050108-2.875643cytochrome d ubiquinol oxidase subunit II
BPSL0502-28-1.995058cytochrome d ubiquinol oxidase subunit I
BPSL0503-27-0.567983putative membrane protein
BPSL0504-160.282618RNA polymerase sigma-32 factor
BPSL0505-182.545959putative exported protein
BPSL0506093.355341conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0478ABC2TRNSPORT320.003 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 31.8 bits (72), Expect = 0.003
Identities = 33/155 (21%), Positives = 60/155 (38%), Gaps = 7/155 (4%)

Query: 163 YGEFFATGILIMAFMSIGVVSTA-TTIATLRERNTFKMYVCFPVSRF-VFLASLIVSRVI 220
Y F A G++ + M+ T + + T++ + + + L + +
Sbjct: 65 YTAFLAAGMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATK 124

Query: 221 LMLAASVTLMLAARYLFQVPLPLWSLRALRAIPVVLLGAAMLLSLGTLLASRARSLAAAE 280
LA + ++AA + SL L A+PV+ L SLG ++ + A S
Sbjct: 125 AALAGAGIGVVAAALGY---TQWLSL--LYALPVIALTGLAFASLGMVVTALAPSYDYFI 179

Query: 281 AWCNLIYFPLLFFSDLTIPLRAAPHWLRVVLLVLP 315
+ L+ P+LF S P+ P + LP
Sbjct: 180 FYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLP 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0498PHPHTRNFRASE511e-175 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 511 bits (1318), Expect = e-175
Identities = 194/567 (34%), Positives = 311/567 (54%), Gaps = 7/567 (1%)

Query: 308 PNTLAGVCAAPGIAVGTLVRWDDAQIVPPELASGTPAAESRLLDRALAEVDAQLETTVRE 367
+ + G+ A+ G+A+ + + + + + E L AL + +L +
Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQ 61

Query: 368 ASRRGAIGEAGIFAVHRVLLEDPALVDAARDLI-SLGKSAGYAWRETIRAQTAVLADVDD 426
+A IFA H ++L+DP LVD + I + +A YA +E ++ +D+
Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121

Query: 427 TLLAERAADLRDIDKRVLRAL-GYASASARELPAEAVLAAEEFTPSDLASLDRERVAALV 485
+ ERAAD+RD+ KRVL L G + S + E V+ AE+ TPSD A L+++ V
Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 486 MARGGATSHAAIIARQLGIPALVAVGDALYAIAQRTQVVVDASAGRLEYAPSALDVERAR 545
GG TSH+AI++R L IPA+V + I V+VD G + P+ +V+
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 546 HERQRLAGVREANRRMSGEAALTRDGHRIEVAANIATLDDARVALDNGADAVGLLRTELM 605
+R ++ ++ GE + T+DG +E+AANI T D L NG + +GL RTE +
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 606 FIHRQAAPTASEHQQSYQSIVDALQGRTAIIRTLDVGADKEVDYLTLPPEPNPALGLRGI 665
++ R PT E ++Y+ +V + G+ +IRTLD+G DKE+ YL LP E NP LG R I
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 666 RLAQVRPDLLDDQLRGLLAVKPYGSVRILLPMVTDVGELVRIRKRIDD-----FARAMGR 720
RL + D+ QLR LL YG+++++ PM+ + EL + + + + + +
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421

Query: 721 AQAVEVGVMIEVPSAALLADQLAQHADFLSIGTNDLTQYTLAMDRCQADLAAQADGLHPA 780
+ ++EVG+M+E+PS A+ A+ A+ DF SIGTNDL QYT+A DR ++ HPA
Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481

Query: 781 VLRLVDATVRGAEKHGKWVGVCGALGGDPVAVPVLAGLGVTELSVDPVSVPGIKAQVRRL 840
+LRLVD ++ A GKWVG+CG + GD VA+P+L GLG+ E S+ S+ ++Q+ +L
Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541

Query: 841 DYQLCRQRAQDLLALESAQAVRAASRE 867
+ + AQ L L++A+ V ++
Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKK 568


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0500cloacin310.023 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.8 bits (69), Expect = 0.023
Identities = 31/120 (25%), Positives = 48/120 (40%), Gaps = 8/120 (6%)

Query: 176 VVVDGAAPAVLRYDDTDDELRYVETLPADAQNNSPGNAPP--AAAQPVANRALPSVKRQR 233
V + G P+ + DD + + V +LPAD SP ++ P A V R + VK +R
Sbjct: 134 VALYGVLPSQIAKDDPNMMSKIVTSLPADDITESPVSSLPLDKATVNVNVRVVDDVKDER 193

Query: 234 ALPGALDLRGVELTLPELPSAQVAALRERAGTLGLDGARVPVWGVVAPRRLPADIAVPGG 293
+ GV +++P + A ER G PV + PA + G
Sbjct: 194 QNISVVS--GVPMSVPVVD----AKPTERPGVFTASIPGAPVLNISVNNSTPAVQTLSPG 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0506TYPE3IMSPROT341e-04 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 34.0 bits (78), Expect = 1e-04
Identities = 14/42 (33%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 51 KRETKQQFIDAITAGRRRYRQIEIQSQDVL-PVGDATYVVAG 91
KRE K+ +RR EIQS+++ V ++ VVA
Sbjct: 222 KREYKEMEGSPEIKSKRRQFHQEIQSRNMRENVKRSSVVVAN 263


11BPSL0529BPSL0600Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0529211-2.502729conserved hypothetical protein
BPSL0530013-2.306599putative HPr(Ser) kinase/phosphatase
BPSL0531-213-2.363920putative nitrogen regulatory IIA protein
BPSL0532-313-1.292024putatve PTS system, EIIa component
BPSL0533-212-0.057290putative RNA polymerase sigma-54 factor
BPSL05340121.260697ABC transporter system, ATP-binding protein
BPSL05351111.455621OstA-like protein
BPSL05362111.575980putative exported protein
BPSL05374112.042258conserved hypothetical protein
BPSL05384101.357668conserved hypothetical protein
BPSL05395120.647360sodium/hydrogen exchanger family protein
BPSL0540016-0.537512putative adenine phosphoribosyltransferase
BPSL0541-113-0.790639LysE type translocator
BPSL0542-211-0.854080NUDIX domain family protein
BPSL0543219-5.669234putative formyltetrahydrofolate deformylase
BPSL0544322-6.148039putative membrane protein
BPSL0545527-6.713471excinuclease ABC subunit A
BPSL0546844-8.751436putative transporter protein
BPSL0547951-10.012707single-strand binding protein
BPSL05481056-10.960263hypothetical protein
BPSL0549A853-8.996305putative DNA-binding protein
BPSL0550850-9.417705hypothetical protein
BPSL0551951-9.918502hypothetical protein
BPSL05521051-9.720207hypothetical protein
BPSL05531150-9.279012putative DNA-binding protein
BPSL05541147-8.868611hypothetical phage protein
BPSL05551246-8.379995putative membrane protein
BPSL05561351-7.996008hypothetical protein
BPSL05571448-8.004279hypothetical protein
BPSL0557A1347-7.942763putative phage protein
BPSL05581451-9.543141putative DNA-binding protein
BPSL05591254-10.304597hypothetical protein
BPSL05601257-11.230505hypothetical protein
BPSL05611155-11.730282putative exported protein
BPSL05621056-11.804909putative DNA-binding protein
BPSL05631055-12.187784hypothetical protein
BPSL05641054-11.859604hypothetical protein
BPSL05651152-11.664110hypothetical protein
BPSL0566949-11.123845hypothetical protein
BPSL0567951-10.822784hypothetical protein
BPSL05681152-10.369528hypothetical protein
BPSL05691159-11.437152conserved hypothetical protein
BPSL05701160-11.711964conserved hypothetical protein
BPSL05711161-12.140483putative membrane protein
BPSL05721262-12.648062hypothetical protein
BPSL05731163-12.395733putative exported protein
BPSL05741061-13.088510subtilase family protein
BPSL0574A851-10.544395hypothetical phage protein
BPSL0574B743-7.882958hypothetical protein
BPSL0575641-7.625225hypothetical protein
BPSL0576537-6.120026hypothetical protein
BPSL0577537-9.203571phage integrase family protein
BPSL0578637-10.200120dienelactone hydrolase family protein
BPSL0579643-11.432430hypothetical protein
BPSL0580851-14.985118hypothetical protein
BPSL0581852-15.428994conserved hypothetical protein
BPSL0582955-16.027952hypothetical protein
BPSL0583953-13.787803hypothetical protein
BPSL0584754-13.553596putative membrane protein
BPSL0585753-14.073746hypothetical protein
BPSL0586211-0.898898hypothetical protein
BPSL0586a080.371176hypothetical protein
BPSL0587091.311584phage integrase family protein
BPSL0588081.751861hypothetical protein
BPSL05891102.758237conserved hypothetical protein
BPSL05901102.770961putative membrane protein
BPSL0591071.104529hypothetical protein
BPSL0592184.075082hypothetical protein
BPSL0593294.046789hypothetical protein
BPSL0594293.924560hypothetical protein
BPSL0595284.427837hypothetical protein
BPSL05962104.387344putative DNA-binding protein
BPSL0597194.550149putative protein kinase
BPSL0598092.517780hypothetical protein
BPSL0599092.844700hypothetical protein
BPSL0600093.148278hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0546TCRTETA861e-20 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 86.0 bits (213), Expect = 1e-20
Identities = 77/368 (20%), Positives = 143/368 (38%), Gaps = 31/368 (8%)

Query: 17 RATTSLAAIFALRMLGLFMIMPVFSVYAKTIPGGENVVL-VGIALGAYGVTQSLLYIFYG 75
R + + AL +G+ +IMPV + + +V GI L Y + Q G
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 76 WASDKFGRKPVIAAGLLIFALGSFVAAFAHDITWIIVGRVIQGM-GAVSSAVLAFIADLT 134
SD+FGR+PV+ L A+ + A A + + +GR++ G+ GA + A+IAD+T
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124

Query: 135 SEHNRTKAMAMVGGSIGMSFAVAIVGAPI--VFHWVGMSGLFAIVGALSVAAIGVVLWVV 192
R + + G + G + + F AL+ +++
Sbjct: 125 DGDERARHFGFMSACFGFGM---VAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181

Query: 193 PDAPRPVHVPAPFAEVLHNVELLRLNFGVLVLHATQTALFLVVPRLLVDGGLPVA----- 247
P++ + P E L+ + R G+ V+ A F+ + + G +P A
Sbjct: 182 PESHKGERRPLR-REALNPLASFRWARGMTVVAALMAVFFI----MQLVGQVPAALWVIF 236

Query: 248 ----SHWQ-----VYLPVMGL--AFVMMVPAIIVAEKQGRMKPVLLGGIAAILIGQLLLG 296
HW + L G+ + + VA + G + ++L G+ A G +LL
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALML-GMIADGTGYILLA 295

Query: 297 VATHTILIVAAILFVYFLGFNILEASQPSLVSKLAPGSRKGAATGVYNTTQSIGLALGGV 356
AT + + V I + +++S+ R+G G S+ +G +
Sbjct: 296 FATRGWMAF--PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL 353

Query: 357 VGGVLLKH 364
+ +
Sbjct: 354 LFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0547cloacin463e-08 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.9 bits (108), Expect = 3e-08
Identities = 29/71 (40%), Positives = 32/71 (45%), Gaps = 4/71 (5%)

Query: 109 GGRGGSGGGGGGGDDGGYGG----GGGGYGGGRDMERGGGGGRASGGGGAGARSGGGGGA 164
GG G G GGG D G+ GGG G G G G G G G +G SG GG
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81

Query: 165 SRPSAPAGGGF 175
S +AP GF
Sbjct: 82 SAVAAPVAFGF 92



Score = 33.1 bits (75), Expect = 5e-04
Identities = 22/69 (31%), Positives = 24/69 (34%), Gaps = 3/69 (4%)

Query: 109 GGRGGSGGGGGGGDDGGYGGGGGGYGGGRDMER---GGGGGRASGGGGAGARSGGGGGAS 165
G + G GG G GGG G G E GGG G GG GGG +
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70

Query: 166 RPSAPAGGG 174
GG
Sbjct: 71 SGGGSGTGG 79



Score = 32.4 bits (73), Expect = 0.001
Identities = 21/62 (33%), Positives = 24/62 (38%), Gaps = 6/62 (9%)

Query: 113 GSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSGGGGGASRPSAPAG 172
G G G G GG G G GG G GGG +GG + + G S P
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGN------SGGGSGTGGNLSAVAAPVAFGFPALSTPGA 100

Query: 173 GG 174
GG
Sbjct: 101 GG 102



Score = 29.7 bits (66), Expect = 0.007
Identities = 17/53 (32%), Positives = 18/53 (33%)

Query: 109 GGRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSGGG 161
GG G GGG G GG G GG + G G GG G
Sbjct: 58 GGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0555CHANLCOLICIN310.004 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 31.2 bits (70), Expect = 0.004
Identities = 33/151 (21%), Positives = 56/151 (37%), Gaps = 26/151 (17%)

Query: 131 GAKLVQTDEILTALEQYKAENPHLAKRVSKIAGYVKDIDKEVLVQSPK-VPASAIFNERS 189
G K+ +E L A E+YK L K+ SK D++ + + V
Sbjct: 380 GKKIGNVNEALAAFEKYKDV---LNKKFSKA-------DRDAIFNALASVKYDDWAKH-- 427

Query: 190 LRTALGFTKYARVVQIFGIVFTAYDLDVAAEQSVQTKSIRP--IRREAVRQMGGWSGATA 247
++A+ ++I G V YD+ + T +P + E G S A
Sbjct: 428 ------LDQFAKYLKITGHVSFGYDVVSDILKIKDTGDWKPLFLTLEKKAADAGVSYVVA 481

Query: 248 GARIGAALGASIGVETGPG-AIITGAIGGLL 277
G ++G+ G AI+TG + +
Sbjct: 482 LL-FSLLAGTTLGI---WGIAIVTGILCSYI 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0558HTHTETR280.004 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.004
Identities = 6/40 (15%), Positives = 16/40 (40%)

Query: 4 PVERALVAAGDDLALARRRRGLSTSSMAERAGISKKTLYR 43
+ ++ L + S +A+ AG+++ +Y
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYW 50


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0566GPOSANCHOR373e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 36.6 bits (84), Expect = 3e-04
Identities = 39/279 (13%), Positives = 92/279 (32%), Gaps = 38/279 (13%)

Query: 232 LQFLTDKISELEAARRALSDTLTKLKQDRTLPRKDANKARASADTLRSTLNAARQAEFAA 291
++ L + + LEA + L L T A L + +A A
Sbjct: 178 IKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGA 237

Query: 292 KDRLAALELDIEDSRLFVGELKSRLQSLGESKETRTYFSNLQFQFCPSCLAELPKATADS 351
+ A I+ L++R L ++ E F + A++ A+
Sbjct: 238 MNFSTADSAKIKTLEAEKAALEARQAELEKALEG-------AMNFSTADSAKIKTLEAE- 289

Query: 352 HVCHLCTGSLGDGRADTQLLRMKNELNIQLKESTTLIETRLKDAEKLRVEVPHLSQRVRK 411
+ + ++++ + +L L + + + ++ Q++ +
Sbjct: 290 -----------KAALEAEKADLEHQSQVLNANRQSL-RRDLDASREAKKQLEAEHQKLEE 337

Query: 412 LEQDYAAAATSWSSELEIALEEHARKLGAVD--EEIRQAYE----------------QQK 453
+ A+ S +L+ + E + EE + E +++
Sbjct: 338 QNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ 397

Query: 454 LATVITDLQKRRDALTAEGKRLQESIIRLEQQQAERKVE 492
+ + + + AL K L+ES E+++AE + +
Sbjct: 398 VEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAK 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0571PERTACTIN310.016 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 31.2 bits (70), Expect = 0.016
Identities = 21/64 (32%), Positives = 26/64 (40%), Gaps = 3/64 (4%)

Query: 521 AMAARNPRPSPPTKITQNPPSSTPAPRPVVPQPMSAPRPIPQ---PVERGHREPPAAASP 577
A A P+P+P P P P PQP P+ P+ P RE AAA+
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAANA 624

Query: 578 AATT 581
A T
Sbjct: 625 AVNT 628


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0574SUBTILISIN1135e-30 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 113 bits (285), Expect = 5e-30
Identities = 76/338 (22%), Positives = 122/338 (36%), Gaps = 46/338 (13%)

Query: 138 PNYPPSEGASFSPAWHLQKAGFPRAWQTTKGEGIRIAHLDIGWWPNHYSAPLKVRKDLGY 197
+ + P ++ P W T+G G+++A LD G +H LK R G
Sbjct: 11 QVIKQEQQVNEIP-RGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPD--LKARIIGGR 67

Query: 198 NFVEGNSNTVDPGVGPNKGHGTATLALLAGNAVSLCGKAGQSEGDQVYRGFIGGAPSAEI 257
NF + + + GHGT +A +G AP A++
Sbjct: 68 NFTDDDEGDPEI-FKDYNGHGTHVAGTIAATENENGV--------------VGVAPEADL 112

Query: 258 VPVRIAGVDGSVVYLYGETMARGLAYAINPGDGRRCDVVSLSHGG-LPMKSWAHAVNMLY 316
+ +++ GS + + +G+ YAI D++S+S GG + AV
Sbjct: 113 LIIKVLNKQGS---GQYDWIIQGIYYAIEQK----VDIISMSLGGPEDVPELHEAVKKAV 165

Query: 317 DAGVVVVAAAGDSYWAVLTDIATHFTVYPSAFYRVVTSTGVTFDDGPYKRDRLGVMQGCW 376
+ ++V+ AAG+ D T YP + V++ + FD +
Sbjct: 166 ASQILVMCAAGNE---GDGDDRTDELGYPGCYNEVISVGAINFDRHASE-----FSNSNN 217

Query: 377 GPDKVMKKAVGAY-TPNVPWMCYNTKYGWDMNGAGTSASTPQMAAACALWLAKYGSVFPN 435
D V A G VP Y T +GTS +TP +A A AL + F
Sbjct: 218 EVDLV---APGEDILSTVPGGKYATF-------SGTSMATPHVAGALALIKQLANASFER 267

Query: 436 DWRRVAACRAALGRSVADAEKDFSEIGLGRLDVSAMIE 473
D A L + G G L ++A+ E
Sbjct: 268 DLTEPEL-YAQLIKRTIPLGNSPKMEGNGLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0590SALSPVBPROT606e-11 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 60.1 bits (145), Expect = 6e-11
Identities = 55/208 (26%), Positives = 81/208 (38%), Gaps = 41/208 (19%)

Query: 12 LNLPSGGGSVSGDGGDFSVDLNTGTATLKFDLTVPAGPNGITPPHTLQYSAGAGDGAFGI 71
LP GG ++S G D G A++ L + A G P L YS+G G+G FG+
Sbjct: 18 PFLPKGGKALSQSGPD-------GLASITLPLPISAE-RGFAPALALHYSSGGGNGPFGV 69

Query: 72 GWSLGLMTIRRR-----------------------ITPATGAAEPAPPGACSLVGVGELV 108
GWS M+I R T +TG A P P + V
Sbjct: 70 GWSCATMSIARSTSHGVPQYNDSDEFLGPDGEVLVQTLSTGDA-PNPVTCFAYGDVSFPQ 128

Query: 109 DMGARRFRPIVDATGLLIEFTGAS------WTATDKTDTQYTLGTSANARIG---GGALP 159
R++P +++ +E+ + W D + LG +A AR+ +
Sbjct: 129 SYTVTRYQPRTESSFYRLEYWVGNSNGDDFWLLHDSNGILHLLGKTAAARLSDPQAASHT 188

Query: 160 AAWLVDRCADSAGNAIAYTWLDVGGARV 187
A WLV+ AG I Y++L G V
Sbjct: 189 AQWLVEESVTPAGEHIYYSYLAENGDNV 216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0591CHANLCOLICIN350.002 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 35.1 bits (80), Expect = 0.002
Identities = 37/196 (18%), Positives = 77/196 (39%), Gaps = 13/196 (6%)

Query: 524 VSQASGQINAAQQQLAVAQAQAQAYQAGVALAQTRATNAAKNAQ-EYGSLNSQVIVIQAT 582
+S+ + + AQ++L+ AQ++ + +R +++ E +L + +
Sbjct: 184 LSEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQA 243

Query: 583 GQQVSGGDDGDYNGVSAMANQYLSGQ-RISGDSATVAAATNLAANRL---SQQFQIDSMN 638
+ D+ +S AN L + V A + + + +I+ +N
Sbjct: 244 SAKYKELDE-LVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRIN 302

Query: 639 RTTAEMQQALAQAQAQLAAANAQVSAAGANLAVAQLNAQAAAQTLGVFDADTFTPQVWKA 698
++Q+A++Q A A+V A NL AQ N + DA T ++
Sbjct: 303 ADITQIQKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIK----DAVDATVSFYQT 358

Query: 699 MGNFVDQIYERYMNMA 714
+ ++ E+Y MA
Sbjct: 359 LT---EKYGEKYSKMA 371


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0597YERSSTKINASE340.004 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 33.9 bits (77), Expect = 0.004
Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 2/42 (4%)

Query: 149 QVLDGLAHAHANGVVHRDLKPQNVMVTTRDGEPCAKILDFGI 190
++LD H GVVH D+KP NV+ GEP ++D G+
Sbjct: 253 RLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPV--VIDLGL 292


12BPSL0707BPSL0787Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL07072103.166803conserved hypothetical protein
BPSL07081103.811131putative LysR family transcriptional regulator
BPSL07090104.043173putative ATP/GTP binding protein
BPSL07100113.790196putative membrane protein
BPSL07110104.035145putative ATP-binding transmembrane ABC
BPSL0712193.983724putative transmembrane transporter
BPSL0713083.225667conserved hypothetical protein
BPSL0716-194.148832conserved hypothetical protein
BPSL0717-1103.467336putative phosphoribosyl transferase protein
BPSL0718-1103.916140putative membrane protein
BPSL0719-2103.437072putative alcohol dehydrogenase cytochrome c
BPSL07200104.283981putative membrane protein
BPSL0721-1103.967986putative cytochrome c oxidase subunit I
BPSL07230114.370760putative cytochrome c oxidase polypeptide II
BPSL0724-1114.063883putative thiamine pyrophosphate requiring
BPSL0725-1105.088540putative mandelate racemase
BPSL0726084.195232putative membrane protein
BPSL0727082.791724hypothetical protein
BPSL0728-192.126315putative glucose dehydrogenase
BPSL0729-191.332156conserved hypothetical protein
BPSL0730-1110.466357family S45 unassigned peptidase
BPSL0731-116-3.231077LysR family transcriptional regulator
BPSL0732018-3.926864putative two-component regulator histidine
BPSL0733331-6.435183putative exported protein
BPSL0734534-6.671100putative two-component transcriptional response
BPSL0735742-7.076906hypothetical protein
BPSL0736843-9.639769hypothetical protein
BPSL0737945-9.983040hypothetical protein
BPSL07381142-8.867365hypothetical protein
BPSL07391034-7.232728hypothetical protein
BPSL0740935-7.174482hypothetical protein
BPSL0741936-7.737799conserved hypothetical protein
BPSL07421138-8.212806putative membrane protein
BPSL07431137-7.890775conserved hypothetical protein
BPSL07441142-9.666597putative phage-related integrase
BPSL07451352-11.109039hypothetical protein
BPSL07461353-11.448323hypothetical protein
BPSL07471354-11.610360hypothetical protein
BPSL0747a1253-11.438239hypothetical protein
BPSL07481256-11.988656hypothetical protein
BPSL07491155-11.906627hypothetical protein
BPSL07501155-12.059221hypothetical protein
BPSL07511157-12.372103hypothetical protein
BPSL07521057-12.391333hypothetical protein
BPSL0753960-12.812772hypothetical protein
BPSL0754856-12.641602putative DNA-binding protein
BPSL0756856-11.879264hypothetical protein
BPSL0757854-11.558469putative phosphoesterase
BPSL0758754-11.240944hypothetical protein
BPSL07591161-11.841857hypothetical protein
BPSL07601160-11.084107hypothetical protein
BPSL07611260-10.910913putative RNA 2'-phosphotransferase
BPSL07621260-10.989327putative helicase SNF2 family protein
BPSL07641363-11.289780hypothetical protein
BPSL07651160-10.927849putative helicase family protein
BPSL07661261-11.591934hypothetical protein
BPSL07671151-9.530479putative phospholipase protein
BPSL0768838-7.973055conserved hypothetical protein
BPSL0769426-5.056561hypothetical protein
BPSL0770221-4.012409conserved hypothetical protein
BPSL0771-110-1.649278hypothetical protein
BPSL077209-1.407150phage integrase family protein
BPSL0773-19-0.644874*putative transmembrane transport protein
BPSL0774111-0.673051two-component sensor kinase transcriptional
BPSL0775214-2.138910response regulator transcription regulatory
BPSL0776215-2.379682putative recombinase A
BPSL0777216-1.249349RecX family regulatory protein
BPSL0778-213-0.780939conserved hypothetical protein
BPSL0779-213-0.891354succinyl-CoA synthetase beta chain
BPSL0780-3100.286575succinyl-CoA ligase alpha-chain
BPSL0781-291.049023putative membrane protein
BPSL07820102.391352putative type 4 fimbrial pilin protein
BPSL0783-192.575084putative membrane protein
BPSL0784-1103.453549putative exported protein
BPSL0785-1123.502299putative lipoprotein
BPSL0786-2103.045567molybdenum cofactor biosynthesis protein c
BPSL0787-1103.064288putative exported protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0721PF05616310.002 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 31.3 bits (70), Expect = 0.002
Identities = 23/71 (32%), Positives = 28/71 (39%), Gaps = 5/71 (7%)

Query: 111 APPGPPAP-ASPASPRTTDRAPAHAPARDPLDAANREPNAPGEPDELGELG----EPDVP 165
AP P P SPA + AP P P + + N PD G+ G P VP
Sbjct: 322 APNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVP 381

Query: 166 DEPDDAASKTR 176
D P+ K R
Sbjct: 382 DRPNGRHRKER 392


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0732HTHFIS588e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.5 bits (139), Expect = 8e-11
Identities = 34/122 (27%), Positives = 52/122 (42%), Gaps = 15/122 (12%)

Query: 484 RALVVDDNENARETLGAMLATLGIRVDLRGTGKEGLRCFGECQHDIVVLDLELPDISGFE 543
LV DD+ R L L+ G V + R D+VV D+ +PD + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 544 VAEQIRWATSSDAARKTTILGVSAYES------ALLKGDHAIFDAFIPKPIHLDTLGGIV 597
+ +I+ A +L +SA + A KG +D ++PKP L L GI+
Sbjct: 65 LLPRIK-----KARPDLPVLVMSAQNTFMTAIKASEKG---AYD-YLPKPFDLTELIGII 115

Query: 598 SR 599
R
Sbjct: 116 GR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0734HTHFIS385e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.5 bits (87), Expect = 5e-05
Identities = 22/125 (17%), Positives = 47/125 (37%), Gaps = 12/125 (9%)

Query: 188 ARIAVVDDSPDVAETICEYFAEKGVAAIAYYDSVSFRKALEVEDFDGYILDWLLGEETAA 247
A I V DD + + + + G ++ + + + D D + D ++ +E A
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 248 PLVRGIRASENADAPIFLLTGKISTGEASEDEIADIVSSFNARCEE---KPVRLPILFAE 304
L+ I+ D P+ +++ ++ + + + KP L L
Sbjct: 64 DLLPRIK-KARPDLPVLVMSA--------QNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 305 VARAL 309
+ RAL
Sbjct: 115 IGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0773TCRTETA358e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 8e-04
Identities = 47/264 (17%), Positives = 93/264 (35%), Gaps = 28/264 (10%)

Query: 81 AIVFGRLGDLVGRKHTFLITIVIMGISTFVVGFLPGYASIGIAAPVIFIAMRLLQGLALG 140
A V G L D GR+ L+++ + ++ P V++I R++ G+ G
Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-------FLWVLYIG-RIVAGIT-G 110

Query: 141 GEYGGAATYVAEHAPSHRRGFYTSWIQTTATLGLFLSLLVILGVRTAIGEEAFGSWGWRV 200
A Y+A+ R + ++ G+ + G +G G +
Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGM------VAG--PVLGGLM-GGFSPHA 161

Query: 201 PFVASILLLAVSVWIRLQLNESPVFLRIKAEGKTSKAPLTEAFGQWKNLKIVILALIGLT 260
PF A+ L ++ L + + + PL + + L
Sbjct: 162 PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASF-----RWARGMTVVAALM 216

Query: 261 AGQAVVWYTGQFYA---LFFLTQTLKVDGASANILIALALLIGTPF-FVFFGSLSDRIGR 316
A ++ GQ A + F D + I +A ++ + + G ++ R+G
Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE 276

Query: 317 KPIILAGCLIAALTYFPLFKALTH 340
+ ++ G +IA T + L T
Sbjct: 277 RRALMLG-MIADGTGYILLAFATR 299



Score = 34.8 bits (80), Expect = 8e-04
Identities = 17/42 (40%), Positives = 24/42 (57%)

Query: 291 ILIALALLIGTPFFVFFGSLSDRIGRKPIILAGCLIAALTYF 332
IL+AL L+ G+LSDR GR+P++L AA+ Y
Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYA 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0774PF06580471e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 46.8 bits (111), Expect = 1e-07
Identities = 49/229 (21%), Positives = 86/229 (37%), Gaps = 53/229 (23%)

Query: 300 LAGLRTQAEF-ALRHEVNA-------DVARSLEQIATSSEQAARLVTQLLALARAENRAT 351
+A + +A+ AL+ ++N + R+L I +A ++T L L R
Sbjct: 154 MASMAQEAQLMALKAQINPHFMFNALNNIRAL--ILEDPTKAREMLTSLSELMRY----- 206

Query: 352 GLTFEPVEIASLARQ--AVRDWV---QAALAKQMDLGYEGPDTDAPLRIDGQPVMLREML 406
L + SLA + V ++ ++ + +++ P ML +
Sbjct: 207 SLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQV---PPML---V 260

Query: 407 GNLIDNAIRY----TPAGGRITVRVHAERAAGAVHLEVEDTGPGIPPNERERVVERFYRI 462
L++N I++ P GG+I ++ + G V LEVE+TG N +E
Sbjct: 261 QTLVENGIKHGIAQLPQGGKILLK--GTKDNGTVTLEVENTGSLALKNTKE--------- 309

Query: 463 LGREGDGSGLGLAIVRE-IVAQHGGTLTIDDNVYQTSPRLAGTLVRVSI 510
+G GL VRE + +G I S + V I
Sbjct: 310 ------STGTGLQNVRERLQMLYGTEAQIK-----LSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0775HTHFIS996e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 98.8 bits (246), Expect = 6e-26
Identities = 35/118 (29%), Positives = 64/118 (54%), Gaps = 1/118 (0%)

Query: 2 RILIAEDDSILADGLTRSLRQSGYAVDHVRNGVEADTALSMQTFDLLILDLGLPRMSGLE 61
IL+A+DD+ + L ++L ++GY V N ++ DL++ D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRRLRARNSNLPVLILTAADSVDERVKGLDLGADDYMAKPFALNE-LEARVRALTRR 118
+L R++ +LPVL+++A ++ +K + GA DY+ KPF L E + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0782BCTERIALGSPH414e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 41.1 bits (96), Expect = 4e-07
Identities = 29/124 (23%), Positives = 44/124 (35%), Gaps = 23/124 (18%)

Query: 1 MRARGFTLIELMIVLAIVGVVAAYAIPAYQDYLARSRVGEGLALAAS--ARLAVAENAAS 58
MR RGFTL+E+M++L ++GV A + A+ SR A A+L +
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPA----SRDDSAAQTLARFEAQLRFVQQRGL 56

Query: 59 GNGFSGGYVSPPATRNVDSIRVDDDSGQIVV-----AFTTRVAAAGANTLVLVPSAPDQA 113
G G + V D Q +V A G + +P +
Sbjct: 57 QTGQFFG------------VSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRV 104

Query: 114 DTPT 117
T
Sbjct: 105 ATSG 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0785PF03544409e-07 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 39.6 bits (92), Expect = 9e-07
Identities = 19/97 (19%), Positives = 29/97 (29%), Gaps = 2/97 (2%)

Query: 18 AGCATFAPRDAAKLECTMPVAAYPENAKPLERRATVLVRAMITASGNAENVTVTTSSRNA 77
+ T L P YP A+ L V V+ +T G +NV + ++
Sbjct: 147 SKPVTSVASGPRALSRNQPQ--YPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPAN 204

Query: 78 AADRAAVDAMSRIACSQTPARGGEPYPFTLTRPFVFE 114
+R +AM R G E
Sbjct: 205 MFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTE 241


13BPSL0832BPSL0844Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL08321133.869743putative haloacid dehalogenase hydrolase
BPSL08330123.528385putative sugar ABC transporter protein
BPSL0834-1124.805715putative exported protein
BPSL08350125.689915LysR family transcriptional regulator
BPSL08360125.924717family S12 unassigned peptidase
BPSL08370135.680340putative transporter protein
BPSL08380125.199585putative transcriptional regulator
BPSL08391125.530714putative carbohydrate kinase
BPSL08402125.234021putative D-arabinitol 4-dehydrogenase
BPSL08411125.449981putative LysR family transcriptional regulator
BPSL08421124.803057putative benzoylformate decarboxylase
BPSL08432114.198037aldehyde dehydrogenase family protein
BPSL08443124.042733putative ketopantoate reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0833PF05272300.021 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.021
Identities = 14/35 (40%), Positives = 17/35 (48%)

Query: 50 VVFVGPSGCGKSTLMRMIAGLEEISGGELLIDGAK 84
VV G G GKSTL+ + GL+ S I K
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0834PF06776300.020 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 29.5 bits (66), Expect = 0.020
Identities = 11/49 (22%), Positives = 15/49 (30%), Gaps = 2/49 (4%)

Query: 1 MKTGRRHFVRSVASASAALAAAAWSPARAAIDAPASPATALSLTPGRWS 49
+ + RR R+ A A A A A A+ G W
Sbjct: 38 LASCRRLARRNGARLMLAGAMAI--ALSFGWSDRADAQGAVRSVHGDWQ 84



Score = 28.7 bits (64), Expect = 0.039
Identities = 7/37 (18%), Positives = 13/37 (35%)

Query: 10 RSVASASAALAAAAWSPARAAIDAPASPATALSLTPG 46
+++ A L+ S R A A A ++
Sbjct: 25 KAIQMGPAELSPMLASCRRLARRNGARLMLAGAMAIA 61


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0836BLACTAMASEA300.018 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.8 bits (67), Expect = 0.018
Identities = 11/35 (31%), Positives = 15/35 (42%)

Query: 57 REDALFRFASVSKPIVSAAAMRAVAAGKLDLDASI 91
R D F S K ++ A + V AG L+ I
Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKI 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0837TCRTETB354e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 4e-04
Identities = 31/155 (20%), Positives = 59/155 (38%), Gaps = 5/155 (3%)

Query: 26 LLALATAGFITIVTEALPAGLLPLMGRDLRVSDALVGQLVTVYAAGSIVAAIPLVAATRG 85
L+ L F +++ E + LP + D A + T + + +
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 86 MRRRPLLLAALAGFVVANTATAASPYYAPVLV-ARCVAGVSAGLLWALLAGYASRMVDAR 144
+ + LLL + + + +L+ AR + G A AL+ +R +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 145 QRGRAIAIAMLGAPVAMSVGI-PL-GTALGAALGW 177
RG+A ++G+ VAM G+ P G + + W
Sbjct: 136 NRGKAFG--LIGSIVAMGEGVGPAIGGMIAHYIHW 168


14BPSL0874BPSL0887Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0874-1103.339136putative short-chain dehydrogenase family
BPSL0875-1102.368008putative adenylate kinase
BPSL0876-382.098109putative 3-deoxy-manno-octulosonate
BPSL0877-492.969038conserved hypothetical protein
BPSL0878-383.008958putative tetraacyldisaccharide 4'-kinase
BPSL0879-392.443674putative exodeoxyribonuclease VII large subunit
BPSL0880-3102.171794putative superoxide dismutase
BPSL0881-3103.285849hypothetical protein
BPSL0882-2113.892186putative chromate transporter protein
BPSL08831112.148562putative regulatory protein
BPSL0884192.350913putative membrane protein
BPSL0885193.572932putative ketopantoate reductase
BPSL08861102.368063conserved hypothetical protein
BPSL08872101.970565hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0874DHBDHDRGNASE732e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 72.8 bits (178), Expect = 2e-17
Identities = 55/199 (27%), Positives = 75/199 (37%), Gaps = 16/199 (8%)

Query: 3 IRDNVFLITGGASGLGAGTARLLTEAGGKVVLADLNQQAGEALARELGGAF-----VKCD 57
I + ITG A G+G AR L G + D N + E + L D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 58 VAREEDAQAAVAA-AAKLGTLRGLVNCAGIAPAAKTVGKDGPHPLELFAKTITVNLIGTF 116
V A ++G + LVN AG+ G E + T +VN G F
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVL----RPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 117 NMIRVAAAAMAANEPAQTGERGVIVSTASVAAFDGQIGQAAYAASKAGVAGMTLPIARDL 176
N R + M G IV+ S A + AAYA+SKA T + +L
Sbjct: 122 NASRSVSKYMMDRR------SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175

Query: 177 SRNAIRVMTIAPGIFETPM 195
+ IR ++PG ET M
Sbjct: 176 AEYNIRCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0877SECA250.027 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 24.8 bits (54), Expect = 0.027
Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 5/36 (13%)

Query: 33 KLAYPIRDGIPVMLVDEARQTVEGTPVDPAGPARGR 68
KL Y + D + +L+DEAR TP+ +GPA
Sbjct: 202 KLHYALVDEVDSILIDEAR-----TPLIISGPAEDS 232


15BPSL0936BPSL0951Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0936-120-3.582504putative dihydrodipicolinate synthetase
BPSL0937441-10.071591putative class II aldolase
BPSL0938851-11.798281putative DNA-binding protein
BPSL0938A952-11.801939hypothetical protein
BPSL09391050-11.902892putative DeoR family regulatory protein
BPSL09401143-10.552702conserved hypothetical protein
BPSL09411142-10.483696hypothetical protein
BPSL09421240-9.634711hypothetical protein
BPSL09431134-8.854222putative insertion element protein
BPSL09441234-9.021605putative phage integrase/recombinase protein
BPSL09451234-9.085371conserved hypothetical protein
BPSL09461138-10.586205conserved hypothetical protein
BPSL09471244-10.261560putative type I restriction enzyme specificity
BPSL09481238-8.791300putative type I restriction-modification
BPSL09491247-5.544295hypothetical protein
BPSL0950531-4.421135insertion element hypothetical protein
BPSL0951119-3.096684putative replication protein
16BPSL0974BPSL0985Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0974-1113.333026conserved hypothetical protein
BPSL09750113.949616conserved hypothetical protein
BPSL09763124.740432putative outer membrane receptor protein
BPSL09773154.739049putative transmembrane ABC transporter permease
BPSL09782144.005374putative ATP-binding ABC transporter protein
BPSL09792144.199805putative
BPSL09801134.907552putative cobalamin [5'-phosphate] synthase
BPSL09810144.917827phosphoglycerate mutase family
BPSL0982-1124.595387putative membrane protein
BPSL0983-1126.026749putaive vitamin B12 transport protein
BPSL0984-1115.486697putative cobalamin biosynthesis aminotransferase
BPSL0985-1123.854078putative cobalamin biosynthesis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0975BACINVASINB270.015 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 27.4 bits (60), Expect = 0.015
Identities = 22/89 (24%), Positives = 33/89 (37%), Gaps = 4/89 (4%)

Query: 23 QTERLALEEQVAQLRNEAQTLHAELEQLRDERNALAAERDTLSAKIDDAQVKLNAILEKL 82
Q + +E Q+ E QT E ++ D A + DT + D A KL KL
Sbjct: 112 QAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKL 171

Query: 83 ----PRTKNVPDAENQLDLLAPQANDEGE 107
P AE ++ +A + E
Sbjct: 172 QSLDPADPGYAQAEAAVEQAGKEATEAKE 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0976SSBTLNINHBTR300.016 Streptomyces subtilisin inhibitor signature.
		>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature.

Length = 144

Score = 29.8 bits (66), Expect = 0.016
Identities = 36/109 (33%), Positives = 49/109 (44%), Gaps = 8/109 (7%)

Query: 8 AALAALSGLPCIALAQGDASASSASFASSVS--YAPAAA--SPADADSALSTAPAAAAAS 63
A AA GL A+ A AS AS A++ + YAP+A + +SA + AP A
Sbjct: 5 ARWAATLGLTATAVCGPLAGASLASPATAPASLYAPSALVLTVGHGESAATAAPLRAVTL 64

Query: 64 PASGAARGAEAVSADAASAV--ASGASSASPARAASAAQL--APVVVTA 108
+ A G +A A + + A G SA A + APVVVT
Sbjct: 65 TCAPTASGTHPAAAAACAELRAAHGDPSALAAEDSVMCTREYAPVVVTV 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0983FERRIBNDNGPP408e-06 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 39.5 bits (92), Expect = 8e-06
Identities = 39/186 (20%), Positives = 68/186 (36%), Gaps = 9/186 (4%)

Query: 42 AITLAAPARRVVSLAPHVTELIYAAG----GGAKLVGAVSYSDYPPAAKAIARVGSNKAL 97
A A R+V+L EL+ A G G A + + PP ++ VG
Sbjct: 28 AHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEP 87

Query: 98 DLERIAALKPDLIVVWRHGNAEHETERLRALGIPLYFSEPRH-LDDVAASLDKLGLLLGT 156
+LE + +KP +V E A G FS+ + L SL ++ LL
Sbjct: 88 NLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNL 147

Query: 157 HEIASAAADAYRRRIAQLRARYADK--PPVTVFFQAWDKPLITLNGDH-IVSDVIALCGG 213
A Y I ++ R+ + P+ + D + + G + + +++ G
Sbjct: 148 QSAAETHLAQYEDFIRSMKPRFVKRGARPL-LLTTLIDPRHMLVFGPNSLFQEILDEYGI 206

Query: 214 RNVFAR 219
N +
Sbjct: 207 PNAWQG 212


17BPSL1047BPSL1069Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL10473142.740062hypothetical protein
BPSL10481151.788351putative sugar transport, integral membrane
BPSL10495181.044373conserved hypothetical protein
BPSL10505181.275910hypothetical protein
BPSL10515170.434765putative lipoprotein
BPSL1053321-0.356118ecotin precursor
BPSL1054121-1.336895murein-DD-endopeptidase
BPSL1055223-1.522133putative exported protein
BPSL1056133-4.003176*conserved hypothetical protein
BPSL1057036-4.914122hypothetical protein
BPSL1058345-4.260147hypothetical protein
BPSL1059333-3.530949translation initiation factor IF-1
BPSL1060428-2.767261hypothetical protein
BPSL1061429-2.986746putative dehydrogenase
BPSL1062328-2.881475putative membrane protein
BPSL1064012-1.116754rubredoxin
BPSL1065110-0.346859conserved hypothetical protein
BPSL10662120.019467putative ABC transport system, ATP-binding
BPSL10672120.574023topoisomerase IV subunit B
BPSL10692120.169269topoisomerase IV subunit A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1048TCRTETB1186e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 118 bits (296), Expect = 6e-31
Identities = 76/397 (19%), Positives = 157/397 (39%), Gaps = 14/397 (3%)

Query: 43 LAVLDGAIANVALPTIARDLRASDAASIWIVNAYQLAVTISLLPLASLGDRIGYRRVYIA 102
+VL+ + NV+LP IA D A++ W+ A+ L +I L D++G +R+ +
Sbjct: 25 FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLF 84

Query: 103 GLMLFTAASLGCALSGT-LPALATLRVIQGFGAAGIMSVNTALVRMIYPSSQLGRGVAIN 161
G+++ S+ + + L R IQG GAA ++ +V P G+ +
Sbjct: 85 GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 162 AMVVALSSAVGPTVASAVLAVAPWPWLFAINVPIGVAAVCGSLRALPANPGRSAPYDFIG 221
+VA+ VGP + + W +L I + I + V ++ L +D G
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRIKGHFDIKG 203

Query: 222 AVMNACVFGLLIVSVDGLGHGGNRASVALTALVAAVIGYF-FVKRQLTQPAPLLPVDLMR 280
++ + V + S ++ L+ +V+ + FVK P + L +
Sbjct: 204 IIL-------MSVGIVFFMLFTTSYS--ISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGK 254

Query: 281 VPIFALSIGTSVASFTSQMLAFVALPFWLQNTLGFSQVQTG-LYMTPWPLVIVVAAPLAG 339
F + + F + +P+ +++ S + G + + P + +++ + G
Sbjct: 255 NIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGG 314

Query: 340 VLSDRYSAGALGGIGLALFASGLFALATIGAHPTPVDIVWRMALCGAGFGLFQSPNNRAI 399
+L DR + IG+ + + + T + + G ++ + +
Sbjct: 315 ILVDRRGPLYVLNIGVTFLSVSFLTASFL-LETTSWFMTIIIVFVLGGLSFTKTVISTIV 373

Query: 400 LSSAPRERAGGASGMLGTARLTGQTLGAALVALIFGI 436
SS ++ AG +L + G A+V + I
Sbjct: 374 SSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1053cloacin280.012 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 27.8 bits (61), Expect = 0.012
Identities = 18/59 (30%), Positives = 22/59 (37%), Gaps = 1/59 (1%)

Query: 49 GTVNVWGGDGWRDRDHWRGGDDRWHGGWRGGGNWRDGNDWHGGRGNGWQGGRGPAGGRN 107
G + G G D W ++ W GG G +W G HG G G G G N
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW-GGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 26.2 bits (57), Expect = 0.048
Identities = 20/51 (39%), Positives = 23/51 (45%), Gaps = 3/51 (5%)

Query: 74 GGWRGGGNWRDGNDWHGGRGNGWQGGRGPAGGRNVRGGNDWPDGGGNGRGG 124
G G G + N W GG G+G G G G GN GGG+G GG
Sbjct: 32 GASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN---SGGGSGTGG 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1070GPOSANCHOR310.025 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.8 bits (69), Expect = 0.025
Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 7/52 (13%)

Query: 460 ARLEKIKIEKELEELRAEKAKLEELLANESAMKRLMIKE-------IEADAK 504
+R K ++EK LEE ++ A LE+L K+L KE +EA+AK
Sbjct: 391 SREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAK 442


18BPSL1104BPSL1110Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1104-1113.353639putative electron transport-related protein
BPSL1105-1123.223345putative TetR family regulatory protein
BPSL1106-1113.796192putative depolymerase/histone-like protein
BPSL11070114.560140conserved hypothetical protein
BPSL11082125.053618putative exported protein
BPSL11092114.951012putative membrane protein
BPSL11100113.526707putative lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1104IGASERPTASE393e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.5 bits (89), Expect = 3e-05
Identities = 17/141 (12%), Positives = 38/141 (26%), Gaps = 3/141 (2%)

Query: 148 SQQQADAARARHDARAARLKREREAAEARAAARRAASAAAAHAPASSAAAPAAPAADDAD 207
S+Q++ + RE A+ + +A + A + S
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 208 AKKRAIIAAALERARKKKEALAAQGAGPKNTEGVSAAVQAQIDAAEARRRRLAEQRDAAD 267
A A +E + ++ PK + + QA+ ++
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE---PARENDPTVNIKEPQS 1160

Query: 268 EPGRPDDANAAGDDASPPSKT 288
+ D + S +
Sbjct: 1161 QTNTTADTEQPAKETSSNVEQ 1181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1105HTHTETR721e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.0 bits (176), Expect = 1e-17
Identities = 34/194 (17%), Positives = 71/194 (36%), Gaps = 11/194 (5%)

Query: 5 KIKRDPEGTRRRILLAAAEEFATGGLFGARVDQIARRAETNERMLYYYFGSKEQLFTAVL 64
K K++ + TR+ IL A F+ G+ + +IA+ A +Y++F K LF+ +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 65 EYAFSALMEAERAIDLEGVAPVEAITR---LAHFVWDYYRDHPDLLRLLNNENLHEARYL 121
E + S + E E + ++ R + + LL + +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123

Query: 122 QKSTRIREMI-SPIVKTLDGVLERGQKAGLFRTDIDSLRFYVTLSGL------GYYMVSN 174
+ + + ++ L+ +A + D+ + R + + G +
Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183

Query: 175 RFTLAAIFGRDFSA 188
F L RD+ A
Sbjct: 184 SFDLKKE-ARDYVA 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1109IGASERPTASE455e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 44.7 bits (105), Expect = 5e-06
Identities = 54/322 (16%), Positives = 97/322 (30%), Gaps = 42/322 (13%)

Query: 534 NDHAPTSAETVAPDGHVPTSAETAAPDSHAPTSAETAAPDGHAPTSAETATPNDHASTSA 593
N +TV + A S + E A D ATP++ T A
Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA 1041

Query: 594 ETAAPDSHAPTSAETAA--PDGHASTITEAAAPNGHVSATVETSAVAAPVGITQAAPPIA 651
E + +S E A + + A N V A +T+ VA T+
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSN--VKANTQTNEVAQSGSETKETQTTE 1099

Query: 652 ADTCPAGEHVIAAVEPAGTSDSAAIGAGAIAHAEAGAAASTAETASPIGVDTHIAPSREA 711
E E A T +T V + ++P +E
Sbjct: 1100 TKETATVE------------------------KEEKAKVETEKTQEVPKVTSQVSPKQEQ 1135

Query: 712 DRTAQTAPTAPSPAEATPHVDAPHALDVAARALVGNTAATAHGAAAVDGSAQRADTASPA 771
T Q + T ++ P + NT A A S
Sbjct: 1136 SETVQPQAEPARENDPTVNIKEPQSQT--------NTTADTEQPAKETSSNVEQPVTEST 1187

Query: 772 ASTSGPPAPVAASAASSDRAAPQPVATAAPASIATSGALGTMKAIGAAGPQPSTIAAQRA 831
+G V + ++ A QP + ++ + +++++ +P+T ++
Sbjct: 1188 TVNTGN--SVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV-PHNVEPATTSSNDR 1244

Query: 832 SAIDDTGQPPSTGHSTHAAVSN 853
S + T +T+A +S+
Sbjct: 1245 STV---ALCDLTSTNTNAVLSD 1263



Score = 43.1 bits (101), Expect = 1e-05
Identities = 44/291 (15%), Positives = 81/291 (27%), Gaps = 31/291 (10%)

Query: 379 RAQARPAAPDPRFAPRRPATQAAVSAARNRPMTFTPSRQTTGATPPQPAPRAQTA----- 433
+ R D QA V + + PP PA ++T
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETTETVAE 1042

Query: 434 -----APTAETARKRAPANPARAPLYAWHEKPAERIAPAAS--VHETLRSIEASAAQWTA 486
+ T E + A A+ A K + + + E +
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 487 LAGATSTAATPVTARESMAAPAAPSGGAAASAAPDGHAPTSAETAAPNDHAPTSAETVAP 546
A V ++ P S + + P AE A ND E +
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP-QAEPARENDPTVNIKEPQSQ 1161

Query: 547 DGHVPTS---AETAAPDSHAPTSAETAAPDGHAP------TSAETATPNDHASTS----- 592
+ A+ + + P + T G++ T+ T P ++ +S
Sbjct: 1162 TNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKN 1221

Query: 593 --AETAAPDSHAPTSAETAAPDGHASTITEAAAPNGHVSATVETSAVAAPV 641
+ H A T++ D + + + N + + + A A V
Sbjct: 1222 RHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTN-AVLSDARAKAQFV 1271



Score = 38.9 bits (90), Expect = 3e-04
Identities = 47/318 (14%), Positives = 85/318 (26%), Gaps = 39/318 (12%)

Query: 690 ASTAETASPIGVDTHIAPSREADRTA-QTAPTAPSPAEATPHVDAPHALDVAARALVGNT 748
+ T + I D PS + AP P PA ATP +
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPP-PAPATP----------SETTET-VA 1041

Query: 749 AATAHGAAAVDGSAQRADTASPAASTSGPPAPVAASAASSDRAAPQPVATAAPASIATSG 808
+ + V+ + Q A + VA A S+ +A Q +
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNRE------VAKEAKSNVKANTQ-------TNEVAQS 1088

Query: 809 ALGTMKAIGAAGPQPSTIAAQRASAIDDTGQPPSTGHSTHAAVSNELGRRPHAAPDAVTP 868
T + + +T+ + + ++ ++ + E +
Sbjct: 1089 GSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE 1148

Query: 869 ALPPAAAARAAAVPTSASAVQRQALASESAEAAQGVARAAAAGDSRETTQVSPAGARPDK 928
P + T+ +A Q S+ Q V + + +P P
Sbjct: 1149 NDPTVNIKEPQS-QTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE-NPENTTPAT 1206

Query: 929 AAPSAAGANPIAPLPGASAITAHEDAPTSAAPDAATPVIAAMDSAMPNAVAPASAIA--S 986
P+ S+ S A S + VA + +
Sbjct: 1207 TQPTVNSE---------SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNT 1257

Query: 987 NAGMSPASASAAAPRMAS 1004
NA +S A A A +
Sbjct: 1258 NAVLSDARAKAQFVALNV 1275



Score = 34.7 bits (79), Expect = 0.005
Identities = 37/279 (13%), Positives = 65/279 (23%), Gaps = 33/279 (11%)

Query: 300 PPPASAMPAPTIAAAKPAAATMPPSGLSKAERLAAPTGGAAAPLAAPAAAVTSPAAFAPA 359
PPPA A P+ T + + + T A A A
Sbjct: 1026 PPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR--EVAKEAKSNVKAN--TQ 1081

Query: 360 ATGIAKPIGSTAAVAALGKRAQARPAAPDPRFAPRRPATQAAVSAARNRPMTFTPSRQTT 419
+A+ T + A + + ++ P
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 420 GATP-PQPAPRAQTAAPTAETARKRAPANPARAPLYAWHEKPAERIAPAASVHETLRSIE 478
A P + P P ++T PA+ ET ++E
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAK---------------------ETSSNVE 1180

Query: 479 ASAAQWTALAGATSTAATPVTARESMAAPAA-------PSGGAAASAAPDGHAPTSAETA 531
+ T + S P + P P S H A T+
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTS 1240

Query: 532 APNDHAPTSAETVAPDGHVPTSAETAAPDSHAPTSAETA 570
+ + + + + + S A A +
Sbjct: 1241 SNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAV 1279


19BPSL1131BPSL1156Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1131212-0.047011conserved hypothetical protein
BPSL1132222-4.636223hypothetical protein
BPSL1133129-6.837304hypothetical protein
BPSL1134230-7.339155hypothetical protein
BPSL1135228-7.845287putative membrane protein
BPSL1136336-9.245167hypothetical protein
BPSL1137439-10.172858hypothetical protein
BPSL1138226-6.666714hypothetical protein
BPSL1139121-4.467085putative phage-related protein
BPSL1140120-4.409394hypothetical protein
BPSL1141222-5.063642putative phage-related protein
BPSL1142222-4.925691putative phage terminase
BPSL1143226-5.672691putative exported protein
BPSL1144433-5.918727putative phage-related protein
BPSL1146744-9.157548hypothetical protein
BPSL1147644-8.706607hypothetical protein
BPSL1148644-9.247604hypothetical protein
BPSL1149647-8.698813hypothetical protein
BPSL1150844-9.471122hypothetical protein
BPSL1151940-7.629515hypothetical protein
BPSL1152946-9.716843hypothetical protein
BPSL1153741-8.710023hypothetical protein
BPSL1153A125-6.547964hypothetical protein
BPSL1154-220-4.736119hypothetical protein
BPSL1155-218-3.515878conserved hypothetical protein
BPSL1156-216-3.047659hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1152PF05272280.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.002
Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 48 DIGSDIAERAAIELGELRSVSRAEVSA 74
D IA A EL E+ + RA+ A
Sbjct: 634 DSYEQIAGIVAYELSEMTAFRRADAEA 660


20BPSL1174BPSL1182Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL11740134.122098two-component system, sensor kinase protein
BPSL1175-2103.620913two-component system, regulatory protein KdpE
BPSL1176-1113.766649conserved hypothetical protein
BPSL11770124.219552putative chaperone
BPSL1178-1114.660578putative membrane protein
BPSL1179-1104.608109putative membrane protein
BPSL1180-283.591660putative globin
BPSL1181-274.192198putative alanyl-tRNA synthetase
BPSL1182-273.921113conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1175HTHFIS891e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.1 bits (221), Expect = 1e-22
Identities = 39/157 (24%), Positives = 71/157 (45%), Gaps = 3/157 (1%)

Query: 7 TVVLIEDEKQIRRFVRSALEEEGIAVFDAETGRQGLIEAATRKPDLAIVDLGLPDGDGLD 66
T+++ +D+ IR + AL G V A DL + D+ +PD + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 VIRELR-GWSEMPVIVLSARTHEEEKVAALDAGADDYLTKPFGVSELLARIRAHL--RRR 123
++ ++ ++PV+V+SA+ + A + GA DYL KPF ++EL+ I L +R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 124 NQAGAAESPVVRFGDVSVDLALRRVWRGGEVVHLTPL 160
+ + V A++ ++R + T L
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL 161


21BPSL1209BPSL1223Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1209-111-3.147615triosephosphate isomerase
BPSL1210-213-5.116797general secretory pathway, protein-export
BPSL1211-211-4.577197*NADH dehydrogenase I chain A
BPSL1212-213-2.374807NADH dehydrogenase I chain B
BPSL1213-215-2.690297NADH dehydrogenase I chain C
BPSL1214-215-2.784999NADH dehydrogenase I chain D
BPSL1215-115-2.245157putative NADH dehydrogenase I chain E
BPSL1216-115-2.304831NADH dehydrogenase I chain F
BPSL1217016-2.969463putative NADH dehydrogenase I chain G
BPSL1218118-4.836236NADH dehydrogenase I chain H
BPSL1219017-4.319367putative NADH dehydrogenase I chain I
BPSL1220018-4.295440NADH dehydrogenase I chain J
BPSL1221017-4.488246NADH-ubiquinone oxidoreductase I chain K
BPSL1222-117-3.860267NADH-ubiquinone oxidoreductase I chain L
BPSL1223-314-3.193629NADH dehydrogenase I chain M
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1210SECGEXPORT838e-24 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 82.7 bits (204), Expect = 8e-24
Identities = 46/102 (45%), Positives = 68/102 (66%), Gaps = 1/102 (0%)

Query: 8 IIVVQLLSALGVIGLVLLQHGKGADMGAAFGSGASGSLFGATGSANFLSRTTAVLATIFF 67
++VV L+ A+G++GL++LQ GKGADMGA+FG+GAS +LFG++GS NF++R TA+LAT+FF
Sbjct: 5 LLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLATLFF 64

Query: 68 VATLALTYLGSYKSAPSVGVLGAAPAPAASAPAASQTPAASA 109
+ +L L + S K+ APA + PA
Sbjct: 65 IISLVLGNINSNKTNKGSEWEN-LSAPAKTEQTQPAAPAKPT 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1218OUTRMMBRANEA300.013 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 29.9 bits (67), Expect = 0.013
Identities = 16/96 (16%), Positives = 28/96 (29%), Gaps = 10/96 (10%)

Query: 138 YAVILAGWASNSKYAFLGAMR-------AAAQMVSYEISMGFALVLVLMTAGSLNLSEIV 190
Y GW+ F+ A Y+++ + G +
Sbjct: 29 YTGAKLGWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGRMPY---K 85

Query: 191 GSQQHGFFAGHGVNFLSWNWLPLLPVFVIYFISGIA 226
GS ++G + GV + P+ IY G
Sbjct: 86 GSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGM 121


22BPSL1256BPSL1297Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1256210-0.330044putative transposase
BPSL1257011-0.385199putative phosphorous compounds
BPSL1258011-0.215337putative integral membrane protein/sensor
BPSL1259011-0.453237hypothetical protein
BPSL1260011-0.076568hypothetical protein
BPSL12611130.587988putative cytochrome c precursor
BPSL12622150.187676putative cytochrome c precursor
BPSL12632140.260393putative membrane protein
BPSL12642140.233831putative cytochrome c oxidase subunit II related
BPSL12651150.405095cytochrome c oxidase subunit 1
BPSL12661140.755366putative cytochrome c related protein
BPSL12670120.996182conserved hypothetical protein
BPSL1268-2112.031120conserved hypothetical protein
BPSL1269-3112.580410AhpC/TSA family membrane protein
BPSL1270-3112.346722conserved hypothetical protein
BPSL1271-2134.032575putative transport system, membrane protein
BPSL12720114.676667putative transport system, membrane protein
BPSL12730124.137391putative transport system, membrane protein
BPSL12740124.021395putative IclR-family transcriptional regulatory
BPSL12750142.946647putative exported alkaline phosphatase
BPSL12760142.117960hypothetical protein
BPSL12770133.264897conserved hypothetical protein
BPSL12781123.280139putative exported protein
BPSL1279194.278922hypothetical protein
BPSL1280094.281907putative AsnC-family transcriptional regulator
BPSL1281084.373276putative ABC transport system, ATP-binding
BPSL1282085.126541putative ABC transport system, membrane protein
BPSL1283094.609122putative ABC transport system, iron-binding
BPSL1284084.173716putative membrane protein
BPSL1285-3130.724654putative amino acid transport system, membrane
BPSL1286-1130.885307putative membrane protein
BPSL1287419-0.691357putative exodeoxyribonuclease V gamma chain
BPSL12888211.310262putative exodeoxyribonuclease V beta chain
BPSL12895151.400286putative exodeoxyribonuclease V alpha chain
BPSL12903130.483653putative release factor
BPSL1290a1101.570935conserved hypothetical protein
BPSL1291092.424103putative membrane protein
BPSL12920113.112377putative lipoprotein
BPSL12930112.400696osmotically inducible lipoprotein B precursor
BPSL12942122.784116thiamine biosynthesis protein
BPSL12954133.945705hypothetical protein
BPSL12963144.141190putative exported protein
BPSL12973153.480253putative exported protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1265PF01540290.015 Adhesin lipoprotein
		>PF01540#Adhesin lipoprotein

Length = 475

Score = 28.9 bits (64), Expect = 0.015
Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 3/84 (3%)

Query: 11 RTGRALADLLLKQQDFEVTALVRRPDFA--LPGAKVVVADLTGDFSSAFN-GITHAIYAA 67
+ G+ AD LKQ + L + PD++ L +A+ T F A + G AI +
Sbjct: 35 KNGKEKADAALKQANALAEELKKNPDYSKILETLNKEIAEATKSFKEAGSYGDYPAIISK 94

Query: 68 GSAESEGATEEEQIDRDAVARAAD 91
SA E A E+Q A + AD
Sbjct: 95 LSAAVENAKSEQQKVDQANKKIAD 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1266ACRIFLAVINRP7450.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 745 bits (1925), Expect = 0.0
Identities = 279/1104 (25%), Positives = 502/1104 (45%), Gaps = 100/1104 (9%)

Query: 3 LARPFITRPVATTLLALGIALAGLFAFVKLPVSPLPQVDFPTILVQASLPGASPETVATS 62
+A FI RP+ +LA+ + +AG A ++LPV+ P + P + V A+ PGA +TV +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTSPLERHLGSIADVAEMTSMS-SVGNARIVLQFNLNRDIDGAARDVQAAINAARADLPA 121
VT +E+++ I ++ M+S S S G+ I L F D D A VQ + A LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 SLKSNPTYRKVNPADSPIMVVSLTS--KTASPAKLYDAASTVLQQSLSQIDGIGQVSLSG 179
++ + S +MV S + + D ++ ++ +LS+++G+G V L G
Sbjct: 121 EVQ-QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 180 SANPAVRVELEPQALFHYGIGLEDVRAALASANANSPKGAIEAGP------HRYQLYTND 233
+ A+R+ L+ L Y + DV L N G + P +
Sbjct: 180 AQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 234 QATKAAQYKDLVI-AYRNHAAVSLSDVSSVVDSVEDLRNLGLMNGERAVLVILYRSPGAN 292
+ ++ + + + + V L DV+ V E+ + +NG+ A + + + GAN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 293 IIDTIERVKAALPQLTAALPADIQVTPVLDRSRTIRASLADTEHTLIIAVSLVVMVVFLF 352
+DT + +KA L +L P ++V D + ++ S+ + TL A+ LV +V++LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 353 LRNWRATLIPSVAVPISIVGTFGAMYLLGFSLNNLSLMALIVATGFVVDDAIVVLENIAR 412
L+N RATLIP++AVP+ ++GTF + G+S+N L++ +++A G +VDDAIVV+EN+ R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 413 HI-ENGTPRLQAAFDGAREVGFTVLSISLSLVAVFLPILLMGGIVGRLFREFALTLSLAI 471
+ E+ P +A ++ ++ I++ L AVF+P+ GG G ++R+F++T+ A+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 472 AVSLVVSLTLTPMMCARLLPEAHAPRDE--GRVARWLERGFEWMQRGYERTLSWALRHPF 529
A+S++V+L LTP +CA LL A E G W F+ Y ++ L
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 530 TILMTLVATIALNIALYIVVPKGFFPQQDTGLMIGGIQADQTTSFQAMKLRFTEMMRIIR 589
L+ +A + L++ +P F P++D G+ + IQ + + + ++
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 590 ANP-----NVANVAGFT-GGAQTNSGFMFVALKDKPQR---KLSADQVIQQLRPQLAEVA 640
N +V V GF+ G N+G FV+LK +R + SA+ VI + + +L ++
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 641 GARTFLQAAQDIRAGGRQSNAQYQFT-LLGDSTAELYKWGP-ILTEALQKRPELADVNSD 698
I G + ++ G L + +L A Q L V +
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 699 QQQGGLEAMVTIDRATAARLGIKPAQIDNTLYDAFGQRQVSTIYNPLNQYHVVMEVAPQY 758
+ + + +D+ A LG+ + I+ T+ A G V+ + + ++ ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 759 WQSPEMLKQIYISTSGGSASGVQTTNAAAGTYVATTARASTAGAAAQSAAAIAADSARNQ 818
PE + ++Y+ ++ G V + +V + R R
Sbjct: 779 RMLPEDVDKLYVRSANGEM--VPFSAFTTSHWVYGSPRLE-----------------RYN 819

Query: 819 ALNSIASSG--KSSASSGAAVSTSKSTMVPLSAIASFGPSTTPLAVNHQGLFVATTISFN 876
L S+ G SSG A++ ++
Sbjct: 820 GLPSMEIQGEAAPGTSSGDAMALMENLAS------------------------------K 849

Query: 877 LPPGVSLSKATQVIYQTMAEVGVPPTIQGSFQGTAQAFQESLKDQPILILAALAAVYIVL 936
LP G+ G + P L+ + V++ L
Sbjct: 850 LPAGIGY------------------DWTGMSYQERLSG----NQAPALVAISFVVVFLCL 887

Query: 937 GILYESYIHPVTILSTLPSAGVGALLGLLLFKTEFSIIALIGVILLIGIVKKNAIMMVDF 996
LYES+ PV+++ +P VG LL LF + + ++G++ IG+ KNAI++V+F
Sbjct: 888 AALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEF 947

Query: 997 AIDA-SRQGKSSFDAIHEACLLRFRPIMMTTMAALLGALPLAFGRGDGAEMRAPLGIAIA 1055
A D ++GK +A A +R RPI+MT++A +LG LPLA G G+ + +GI +
Sbjct: 948 AKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVM 1007

Query: 1056 GGLIVSQMLTLYTTPVVYLYMDRL 1079
GG++ + +L ++ PV ++ + R
Sbjct: 1008 GGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 96.1 bits (239), Expect = 4e-22
Identities = 83/503 (16%), Positives = 167/503 (33%), Gaps = 25/503 (4%)

Query: 2 NLARPFITRPVATTLLALGIALAGLFAFVKLPVSPLPQVDFPTILVQASLP-GASPETVA 60
N + L+ I + F++LP S LP+ D L LP GA+ E
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 61 TSVTSPLERHLGSIAD----VAEMTSMSSVGNAR----IVLQFNLNRDIDGAARDVQAAI 112
+ + +L + V + S G A+ + + +G +A I
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 113 NAARADLPASLKSNPTYRKVNPADSPIMVVSLTSKT-----ASPAKLYDAASTVLQQSLS 167
+ A+ +L + + L A + +L +
Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707

Query: 168 QIDGIGQVSLSGSAN-PAVRVELEPQALFHYGIGLEDVRAALASANANSPKGAIEAGPHR 226
+ V +G + ++E++ + G+ L D+ +++A +
Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767

Query: 227 YQLYT---NDQATKAAQYKDLVIAYRNHAAVSLSDVSSVVDSVEDLRNLGLMNGERAVLV 283
+LY L + N V S ++ V L NG ++ +
Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHW-VYGSPRLERYNGLPSMEI 826

Query: 284 ILYRSPGANIIDTIERVKAALPQLTAALPADIQVTPVLDRSRTIRASLADTEHTLIIAVS 343
+PG + D A + L + LPA I S R S + I+
Sbjct: 827 QGEAAPGTSSGD----AMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPALVAISFV 881

Query: 344 LVVMVVFLFLRNWRATLIPSVAVPISIVGTFGAMYLLGFSLNNLSLMALIVATGFVVDDA 403
+V + + +W + + VP+ IVG A L + ++ L+ G +A
Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941

Query: 404 IVVLENI-ARHIENGTPRLQAAFDGAREVGFTVLSISLSLVAVFLPILLMGGIVGRLFRE 462
I+++E + G ++A R +L SL+ + LP+ + G
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 463 FALTLSLAIAVSLVVSLTLTPMM 485
+ + + + ++++ P+
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVF 1024



Score = 59.9 bits (145), Expect = 5e-11
Identities = 37/225 (16%), Positives = 84/225 (37%), Gaps = 4/225 (1%)

Query: 870 ATTISFNLPPGVSLSKATQVIYQTMAEV--GVPPTIQGS-FQGTAQAFQESLKDQPILIL 926
A + L G + + I +AE+ P ++ T Q S+ + +
Sbjct: 286 AAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLF 345

Query: 927 AALAAVYIVLGILYESYIHPVTILSTLPSAGVGALLGLLLFKTEFSIIALIGVILLIGIV 986
A+ V++V+ + ++ + +P +G L F + + + G++L IG++
Sbjct: 346 EAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLL 405

Query: 987 KKNAIMMVDFAIDASRQGKSSF-DAIHEACLLRFRPIMMTTMAALLGALPLAFGRGDGAE 1045
+AI++V+ + K +A ++ ++ M +P+AF G
Sbjct: 406 VDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGA 465

Query: 1046 MRAPLGIAIAGGLIVSQMLTLYTTPVVYLYMDRLRVWAEKRRDRR 1090
+ I I + +S ++ L TP + + +
Sbjct: 466 IYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1267ACRIFLAVINRP8020.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 802 bits (2072), Expect = 0.0
Identities = 284/1035 (27%), Positives = 499/1035 (48%), Gaps = 31/1035 (2%)

Query: 4 SRVFILRPVGTALLMAAIMLAGLVALRFLPLAALPEVDYPTIQVQTFYPGASPEVMTSSV 63
+ FI RP+ +L +M+AG +A+ LP+A P + P + V YPGA + + +V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 TAPLERQFGQMPSLNQMSSQS-SAGASVITLQFSLDLPLDIAEQEVQAAINAAGNLLPSD 122
T +E+ + +L MSS S SAG+ ITL F DIA+ +VQ + A LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 LPAPPIYAKVNPADAPVITLAVTSKTLPLTQ--VQDLADTRLAMKISQVSGVGLVSLSGG 180
+ I + + ++ S TQ + D + + +S+++GVG V L G
Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 NRPAVRIQANPLALASYGLNLDDLRTTISNLNVNTPKGNFDGP------TRAYTINANDQ 234
A+RI + L Y L D+ + N G G +I A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 235 LTSADQYNDAVV-AYKNGRPVMLTDVAKIVAGSENTKLGAWVDAEPAIILNVQRQPGANV 293
+ +++ + +G V L DVA++ G EN + A ++ +PA L ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 IQTVDNVKAILPKLQESLPAALDVQIVTDRTTMIRAAVRDVQFELGLAVALVVLVMYLFL 353
+ T +KA L +LQ P + V D T ++ ++ +V L A+ LV LVMYLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 ANVYATIIPSLSVPLSLIGTLAVMYLSGFSLNNLSLMALTIATGFVVDDAIVMIENIARY 413
N+ AT+IP+++VP+ L+GT A++ G+S+N L++ + +A G +VDDAIV++EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 -VEEGDSALEAALKGSKQIGFTIISLTVSLIAVLIPLLFMGDVVGRLFHEFAITLAVTIV 472
+E+ EA K QI ++ + + L AV IP+ F G G ++ +F+IT+ +
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 ISAVVSLTLVPMMCAKLLRHTPPPESHRFEAKVHGLIERV----IERYGVALQWVLDRQR 528
+S +V+L L P +CA LL+ E H + G + Y ++ +L
Sbjct: 480 LSVLVALILTPALCATLLK-PVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 529 ATLVVAVLTLALTALLYVVIPKGFFPTQDTGVIQAITQAPQSVSYGAMAERQQALAAEIL 588
L++ L +A +L++ +P F P +D GV + Q P + + + L
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 589 KH--PDVVSLTSFIGVDGANITLNSGRMLINLKPRDERS---ESASDVIRSLQRQVANVT 643
K+ +V S+ + G + N+G ++LKP +ER+ SA VI + ++ +
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 644 GISLYMQPVQDLTIDSTVSPTQYQFMLTS---PNPDEFATWVPKLVDRLRKEPS-LADVA 699
+ P I + T + F L D +L+ + P+ L V
Sbjct: 659 DGFVI--PFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 700 TDLQNSGKSVYIEIDRTSAARFGITPATVDNALYDAYGQRIVSTIFTQSNQYRVILESEP 759
+ +E+D+ A G++ + ++ + A G V+ + ++ ++++
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 760 QMQHYTDSLNGIYLPSAGGGQVPLSAIATFRERPAPLLVSHLSQFPATTISFNLAPGASL 819
+ + + ++ +Y+ SA G VP SA T + + P+ I APG S
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836

Query: 820 GEAVKAIDAAERELGLPASFQTRFQGAALAFQASLSNQLFLILAAIVTMYIVLGVLYESY 879
G+A+ ++ +L PA + G + + S + L+ + V +++ L LYES+
Sbjct: 837 GDAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESW 894

Query: 880 IHPITILSTLPSAGVGALLALMITGHDLDIIGIIGIVLLIGIVKKNAIMMIDFALEAERV 939
P++++ +P VG LLA + D+ ++G++ IG+ KNAI++++FA +
Sbjct: 895 SIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEK 954

Query: 940 EGKPPREAIYQACLLRFRPILMTTLAALLGAVPLIVGSGAGSELRQPLGIAIAGGLIVSQ 999
EGK EA A +R RPILMT+LA +LG +PL + +GAGS + +GI + GG++ +
Sbjct: 955 EGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSAT 1014

Query: 1000 VLTLFTTPVIYLGFD 1014
+L +F PV ++
Sbjct: 1015 LLAIFFVPVFFVVIR 1029


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1268RTXTOXIND484e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.3 bits (115), Expect = 4e-08
Identities = 27/149 (18%), Positives = 57/149 (38%), Gaps = 16/149 (10%)

Query: 84 AARGEMPVVLNALGTVTPLANV-TVRTQLSGYLQAVSFQEGQIVKKGDVLAQIDPRP--- 139
+ G++ +V A G +T ++ + ++ + +EG+ V+KGDVL ++
Sbjct: 75 SVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134

Query: 140 ----YQISLANAQGALARDEALLATARLDLKRYQTLVAQ---DSIAKQTADTQASLVKQY 192
Q SL A+ R + L + L+ L + +++++ SL+K+
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE- 193

Query: 193 EGTVQIDRAAIDSAKLNLAYARITAPVSG 221
Q + L + A
Sbjct: 194 ----QFSTWQNQKYQKELNLDKKRAERLT 218



Score = 38.3 bits (89), Expect = 5e-05
Identities = 33/182 (18%), Positives = 61/182 (33%), Gaps = 26/182 (14%)

Query: 141 QISLANAQGALARDEALLAT--ARLDLKRYQTLVAQDSIAKQTADTQASLVKQY-EGTVQ 197
+ ++ + L ++L+ + L A++ T + ++ + + T
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 198 ID--RAAIDSAKLNLAYARITAPVSGRV-GLRQVDPGNYVTPSDT--------NGIVVIT 246
I + + + I APVS +V L+ G VT ++T + + V
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370

Query: 247 QLQPMSVIFTTSEDNLPAILKQVGAGGKLSVTAYNRNNTTPLETGV-LDTLDNQIDTATG 305
+Q + F AI+K V A+ L V LD D G
Sbjct: 371 LVQNKDIGFINVG--QNAIIK---------VEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419

Query: 306 TV 307
V
Sbjct: 420 LV 421


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1273NUCEPIMERASE352e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 34.8 bits (80), Expect = 2e-04
Identities = 22/126 (17%), Positives = 37/126 (29%), Gaps = 30/126 (23%)

Query: 1 MKIALFGATGMIGSRIAAEAARRGHQVTAL-------------SRNPAASGANVQAKAAD 47
MK + GA G IG ++ GHQV + +R + Q D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 48 LFDPASIA--------------AALAGQDVVASAYGPKQEEASKVVAVAKALVDGARKAG 93
L D + V S P S + +++G R
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLA--VRYSLENPHAYADSNLTGFLN-ILEGCRHNK 117

Query: 94 VKRVVV 99
++ ++
Sbjct: 118 IQHLLY 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1290CHLAMIDIAOM6320.007 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 32.4 bits (73), Expect = 0.007
Identities = 15/61 (24%), Positives = 24/61 (39%), Gaps = 3/61 (4%)

Query: 562 FNLG-LDPDKAREFHDETLPKDSAKVAHFC--SMCGPHFCSMKITQDVREFAAQQGVSEN 618
F LG + P + R E P + + S CG H + +T + E Q ++
Sbjct: 265 FTLGDMQPGEHRTITVEFCPLKRGRATNIATVSYCGGHKNTASVTTVINEPCVQVSIAGA 324

Query: 619 D 619
D
Sbjct: 325 D 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1296TCRTETA453e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.8 bits (106), Expect = 3e-07
Identities = 60/266 (22%), Positives = 95/266 (35%), Gaps = 11/266 (4%)

Query: 66 YATGMLVLAPLG----DRFDRRTLILLQIAGLSAALVVAAAAPTLGVLAAASLAIGILAT 121
YA AP+ DRF RR ++L+ +AG + + A AP L VL + GI
Sbjct: 52 YALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA 111

Query: 122 IAQQAVPFAAEIAPPAARGQAVGTVMSGLLLGILLARTAAGFVAEYFGWRAVFAASVAAL 181
A + A+I R + G + + G++ G + + FAA AAL
Sbjct: 112 TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAA--AAL 169

Query: 182 AALAAVIVA-RLPRSSPTSTLPYGKLLASMWQLVRELRGLR--EASMTGGAIFAAFSAFW 238
L + LP S P + + R RG+ A M I
Sbjct: 170 NGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVP 229

Query: 239 PVLTLLLAGAPFHLGPQAAGL-FGIVGAAGALAAPY-AGRFADKRGPRAIISLAIALIAA 296
L ++ FH G+ G +LA G A + G R + L +
Sbjct: 230 AALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289

Query: 297 SFAIFALSGASLIGLVIGVIVLDVGV 322
+ + A + + I V++ G+
Sbjct: 290 GYILLAFATRGWMAFPIMVLLASGGI 315


23BPSL1308BPSL1335Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL13083103.664719putative exported protein
BPSL1309383.901009hypothetical protein
BPSL1310383.863018putative membrane protein
BPSL1311392.665934putative membrane protein
BPSL1312183.427092conserved hypothetical protein
BPSL13131124.114057putative membrane protein
BPSL13140113.390294conserved hypothetical protein
BPSL1315-1112.606086putative GntR-family regulatory protein
BPSL1316-2112.546976putative AsnC-family regulatory protein
BPSL1317-3132.779773glucosamine--fructose-6-phosphate
BPSL13180121.451053putative dioxygenase
BPSL13192101.028164conserved hypothetical protein
BPSL1320080.451171putative LysR-family transcriptional regulator
BPSL1321-1100.818335putative dehydrogenase
BPSL1322-1111.137792putative lipoprotein
BPSL1323-1121.328120conserved hypothetical protein
BPSL1323a-1132.040378putative nitrite/sulfite reductase
BPSL13240132.344504putative membrane protein
BPSL13251132.969555putative GntR-family transcriptional regulator
BPSL13262134.075055putative heat shock protein
BPSL13273134.317630putative heat shock protein
BPSL13282154.603533conserved hypothetical protein
BPSL13292135.495626phosphoenolpyruvate carboxykinase [GTP]
BPSL13305147.242089putative 3-hydroxyacyl-CoA dehydrogenase
BPSL13315147.086336putative LysR-family transcriptional regulator
BPSL13322126.246158putative malonate transport-related system
BPSL13332135.463083putative malonate transport-related system
BPSL1334-184.622440putative malonate decarboxylase alpha-subunit
BPSL1335-183.633023putative malonate decarboxylase delta-subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1316DHBDHDRGNASE682e-15 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 67.8 bits (165), Expect = 2e-15
Identities = 74/265 (27%), Positives = 119/265 (44%), Gaps = 18/265 (6%)

Query: 1 MADHSIKGKTVIIAGGAKNLGGLIARDLAAQGAQAVAIHYNSAASKGAAETVAAIEAAGA 60
M I+GK I G A+ +G +AR LA+QGA A+ YN + + V++++A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYN---PEKLEKVVSSLKAEAR 57

Query: 61 RAVALQADLTAAGAVEKLFVDTVAAIGRPDIAINTVGKVLKKPFVEITEAEYDEMAAVNS 120
A A AD+ + A++++ +G DI +N G + +++ E++ +VNS
Sbjct: 58 HAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNS 117

Query: 121 KTAFFFLKEAGRHVND--NGKIVTLVTSLLGAFTPFYAAYAGMKAPVEHFTRAAAKEFGA 178
F + +++ D +G IVT+ ++ G AAYA KA FT+ E
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 179 RGISVTAVGPGPMDTPFFYPAEGADAVAYHKTAAALSPFSKTGL--------TDIGDVVP 230
I V PG +T + + A +L F KTG+ +DI D V
Sbjct: 178 YNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETF-KTGIPLKKLAKPSDIADAVL 236

Query: 231 FIRHLVSD-GWWITGQTILINGGYT 254
F LVS IT + ++GG T
Sbjct: 237 F---LVSGQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1317IGASERPTASE456e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.1 bits (106), Expect = 6e-07
Identities = 36/246 (14%), Positives = 71/246 (28%), Gaps = 4/246 (1%)

Query: 204 EPAETAEGAPMKLKTPAAPTPPAAPVPASSAAPGTSASSAVAAPAAAGSGPAASAPAAPV 263
P+ + + A PPA P+ + S + A A
Sbjct: 1007 VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR 1066

Query: 264 RHAAPAPASATAAASAPTAASAPAPTPASAPAPASTPAPASAPTPASAPTPTPASAPTPA 323
A A ++ A A + + T + A A T P
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT 1126

Query: 324 SIPAPAPASAPASTPAPASAPAPAPTTNPASSIAPAAAPFASAIPPARAEKFAPAVTATT 383
S +P + P A PT N + + P A++ + V
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP---AKETSSNVEQPV 1183

Query: 384 AGSASTPASAAAPSSPSSPWLPPLLPPLLSPDAPSPPADTARTAPLAPAASPATAAAAAT 443
S + + +P + P P ++ ++ + P + R + + + A ++
Sbjct: 1184 TESTTVNTGNSVVENPENT-TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSN 1242

Query: 444 NATATA 449
+ + A
Sbjct: 1243 DRSTVA 1248



Score = 40.8 bits (95), Expect = 1e-05
Identities = 32/219 (14%), Positives = 59/219 (26%), Gaps = 11/219 (5%)

Query: 182 DPTRRDKAAVKAAEKERVAPLPEPAETAEGAPMKLKTPAAPTPPAAPVPASSAAPGTSAS 241
+ T +++ K A K V + E A+ +T T A V A +
Sbjct: 1060 ETTAQNREVAKEA-KSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK 1118

Query: 242 SAVAAPAAAGSGPAASAPAAPVRHAAPAPASATAAASAPTAASAPAPTPASAPAPASTPA 301
+ + P A PA + PT + + A PA
Sbjct: 1119 TQEVPKVTSQVSPKQEQSETVQPQAEPAR------ENDPTVNIKEPQSQTNTTADTEQPA 1172

Query: 302 PASAPTPASAPTPTPASAPTPASIPAPAPASAPASTPAPASAPAPAPTTNPASSIAPAAA 361
++ T + + + P + + P S + P S+
Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232

Query: 362 ---PFASAIPPARAEKFAPAVTATTAGSASTPASAAAPS 397
P ++ + T + + A A A
Sbjct: 1233 NVEPATTSSNDRSTVALCDLTSTNT-NAVLSDARAKAQF 1270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1324FLGMOTORFLIG340.002 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 33.6 bits (77), Expect = 0.002
Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 3/49 (6%)

Query: 571 SPAQYAQVTSMNPDEWRAELALHAELFDKLSARLPDALAETKARIEKRL 619
P + + + S P E + +A L D+ S P+ + E + +EK+L
Sbjct: 148 DPQKASFILSSLPTEVQTNVARRIALMDRTS---PEVVREVERVLEKKL 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1329ADHESNFAMILY300.019 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 30.2 bits (68), Expect = 0.019
Identities = 28/128 (21%), Positives = 42/128 (32%), Gaps = 19/128 (14%)

Query: 390 LKAGEEADARTPAA---LRRGRKLVVQIGE----------TFGEKNAPMFVEQLDALRLA 436
L+ E P A L G I + F EKN + ++LD L
Sbjct: 127 LEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKE 186

Query: 437 DKLALDLAPVMVYGDDVTHVVTEEGIANLLMCRDADEREHAIRGVAGYTEIGRGRDRRLV 496
K + P + +VT EG + I + E + + LV
Sbjct: 187 SKDKFNKIP-----AEKKLIVTSEGAFKYF-SKAYGVPSAYIWEINTEEEGTPEQIKTLV 240

Query: 497 ERLRERGV 504
E+LR+ V
Sbjct: 241 EKLRQTKV 248


24BPSL1382BPSL1414Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1382225-4.820206conserved hypothetical protein
BPSL1383330-6.090239putative acetyl-CoA synthetase
BPSL1384034-5.967264subfamily M23B unassigned peptidase
BPSL1385235-6.262588conserved hypothetical protein
BPSL1387232-4.136991putative undecaprenol kinase
BPSL1388337-6.950059*hypothetical protein
BPSL1389437-7.125950hypothetical protein
BPSL1390129-5.739753hypothetical protein
BPSL1391-218-1.339797hypothetical protein
BPSL1392-213-0.142617hypothetical protein
BPSL1393-2100.336450hypothetical protein
BPSL1394-272.459480putative exported avidin family protein
BPSL1395-182.668923putative exported endonuclease
BPSL1396-174.191245histone deacetylase family protein
BPSL1397-172.203748putative membrane protein
BPSL1398081.501022putative porin signal peptide protein
BPSL139909-0.093658putative GerE-family transcriptional regulator
BPSL140128-2.451877putative reductase
BPSL1403310-4.015592putative MarR-family transcriptional regulator
BPSL1404311-4.033381putative kinase
BPSL1405212-2.866042trigger factor
BPSL1406-213-0.945747ATP-dependent Clp protease proteolytic subunit
BPSL1406A-1120.185590ATP-dependent Clp protease ATP-binding subunit
BPSL1408-3110.304261ATP-dependent protease
BPSL1409-2112.379443*hypothetical protein
BPSL1411-1123.417482hypothetical protein
BPSL14120123.738862hypothetical protein
BPSL14130133.307650hypothetical protein
BPSL1414-1113.789647*putative peptidyl-prolyl cis-trans isomerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1396TCRTETA310.009 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.9 bits (70), Expect = 0.009
Identities = 38/135 (28%), Positives = 57/135 (42%), Gaps = 4/135 (2%)

Query: 254 AQTSGNVLAIASLMGIAGAALASYLGGRAARRAMLLAGYGILAASLVALAAAPNANGYTL 313
G +LA+ +LM A A + L R RR +LL A +A AP +
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI 101

Query: 314 A--IFGFKFAWTFVLPFMLASVAAVDATGRLIATLNLVIGSGLAAGPLAAGLMLDGGGTL 371
+ G A V +A + D R ++ G G+ AGP+ GLM GG +
Sbjct: 102 GRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM--GGFSP 159

Query: 372 RALFSIAAAVSLVSL 386
A F AAA++ ++
Sbjct: 160 HAPFFAAAALNGLNF 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1397ECOLNEIPORIN671e-14 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 67.1 bits (164), Expect = 1e-14
Identities = 72/323 (22%), Positives = 118/323 (36%), Gaps = 37/323 (11%)

Query: 20 AATLAALSGPAHAQSTLTLYGVADAGVQYLSRADGRHAAWRLQN-----YGILPSQLGIK 74
A TLAAL P A + +TLYG AGV SR+ + A L S++G K
Sbjct: 7 ALTLAAL--PVAAMADVTLYGTIKAGV-ETSRSVAHNGAQAASVETGTGIVDLGSKIGFK 63

Query: 75 GEEDLGGGWRARFQLEQGINLNDGTATVPGYAFFRGAYVGMGGPAGTVTLGRQFSTLFDK 134
G+EDLG G +A +Q+EQ ++ + R +++G+ G G + +GR S L D
Sbjct: 64 GQEDLGNGLKAIWQVEQKASIAGTDSGW----GNRQSFIGLKGGFGKLRVGRLNSVLKDT 119

Query: 135 TLFYDPLWYASYSGQGVLVPLSANFVDHSIKFQSATFAGFDVEALAAMAGIAGNTRAGRV 194
+P S + + S+++ S FAG ++ A N AGR
Sbjct: 120 GDI-NPWDSKSDYLGVNKIAEPEARLI-SVRYDSPEFAGL-SGSVQ----YALNDNAGRH 172

Query: 195 ------LELGGQFTSRGLSASAVLHRSH-GTAQGGADRSAQRRDIGTFAARYAFASLPLT 247
+ + R H ++ R + + +AS+ +
Sbjct: 173 NSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASVAVQ 232

Query: 248 VHAGVQRLTGELDPARTIV-------WGGARYQASGRFGFAGGIYHTDSPTPQVGHPTLF 300
++T V +G + S GF G T+
Sbjct: 233 QQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGSFDATNYNN----DYDQV 288

Query: 301 IASTTCSLSKRTVAYLNLGYAKN 323
+ SKRT A ++ G+ +
Sbjct: 289 VVGAEYDFSKRTSALVSAGWLQE 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1404HTHFIS310.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.3 bits (71), Expect = 0.008
Identities = 19/112 (16%), Positives = 40/112 (35%), Gaps = 12/112 (10%)

Query: 58 EAAAAGVEASLSKSDLPSPQEIRDILDQYVIGQERAKKILAVAVYNHYKRL-------KH 110
+A+ G L K E+ I+ + + +R L + + +
Sbjct: 92 KASEKGAYDYLPKP--FDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEI 149

Query: 111 LDKKDDVELSKSNILLIGPTGSGKTLLAQTLARL---LNVPFVIADATTLTE 159
+ + +++ G +G+GK L+A+ L N PFV + +
Sbjct: 150 YRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPR 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1405GPOSANCHOR403e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 40.4 bits (94), Expect = 3e-05
Identities = 35/192 (18%), Positives = 73/192 (38%), Gaps = 15/192 (7%)

Query: 92 KVLVEGLQRAQALSIEEQETQFSCEVMPLEPDHADSAETEALRRAIVSQFDQYVKLNKKI 151
L + L+ A S + + E A A+ E ++ K +
Sbjct: 193 AELEKALEGAMNFSTADSAKIKTLEAE-KAALAARKADLEKALEGAMNFSTADSAKIKTL 251

Query: 152 PPEILTSLSGIDEAGRLADTIAAHLPLKLDQKQHILEMFPVIERLEHLLAQLEAEIDILQ 211
E + E + + + + + LE A LE + +L
Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE---KAALEAEKADLEHQSQVLN 308

Query: 212 VEKRIRGRVKRQMEKSQREYYLNEQVKAIQKELGEGEEGAD--LEELEKRINAARMPKEA 269
R ++R ++ S+ +Q++A ++L E + ++ + L + ++A+R EA
Sbjct: 309 AN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEASRQSLRRDLDASR---EA 359

Query: 270 KKKADAELKKLK 281
KK+ +AE +KL+
Sbjct: 360 KKQLEAEHQKLE 371


25BPSL1454BPSL1459Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL14542112.511144conserved hypothetical protein
BPSL14552121.5230332-hydroxy-3-oxopropionate reductase
BPSL14563140.441569hydroxypyruvate isomerase
BPSL1457214-1.264134glyoxylate carboligase
BPSL1458115-4.887931putative LysR-family transcriptional regulator
BPSL1459-114-3.931694putative cytochrome C oxidase
26BPSL1568BPSL1581Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1568-2123.293041putative membrane protein
BPSL1569-2133.737701putative transcriptional regulatory protein
BPSL15700144.274412putative MerR-family transcriptional regulator
BPSL15710143.892777putative outer membrane lipoprotein
BPSL15721134.113681putative drug-resistance cell envelope-related
BPSL15731123.447494putative lipoprotein
BPSL15741133.237267putative TetR-family regulatory protein
BPSL15751122.140202putative transport-related, membrane protein
BPSL15761111.980044putative membrane protein
BPSL1577292.574119putative LysR-family transcriptional regulator
BPSL1578-191.853336putative membrane protein
BPSL15791102.776247conserved hypothetical protein
BPSL15802122.375277putative sugar kinase
BPSL15813122.447058putative transport related, membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1568RTXTOXIND386e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.5 bits (87), Expect = 6e-05
Identities = 30/198 (15%), Positives = 58/198 (29%), Gaps = 28/198 (14%)

Query: 1 MNRSGSRAALLIGVALIAAACHRKEAAPSAPRPVVAVPAQADGAAAAVSLPGEIQPRYAT 60
+ SR L+ ++ + +VA A+G EI+P
Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVAT---ANGKLTHSGRSKEIKP---- 101

Query: 61 PLSFRIAGKLVER-KVRLGDIVKKGQVVALLDTSDVARNAASAQAQLDAATHALTFAQQQ 119
I +V+ V+ G+ V+KG V+ L + Q+ L A +Q
Sbjct: 102 -----IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQAR-----LEQT 151

Query: 120 RERDRAQARENLIAPAQLEQTENAYASARAQRDQAAQQLA----------LAKNQLQYAT 169
R + +++ E P E + + + L + +L
Sbjct: 152 RYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211

Query: 170 LVADHAGYITAEQADTGQ 187
A+ +
Sbjct: 212 KRAERLTVLARINRYENL 229



Score = 34.8 bits (80), Expect = 5e-04
Identities = 10/71 (14%), Positives = 27/71 (38%)

Query: 100 ASAQAQLDAATHALTFAQQQRERDRAQARENLIAPAQLEQTENAYASARAQRDQAAQQLA 159
+ A+++ + + + + + + IA + + EN Y A + QL
Sbjct: 217 LTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLE 276

Query: 160 LAKNQLQYATL 170
++++ A
Sbjct: 277 QIESEILSAKE 287


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1569HTHTETR626e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.6 bits (149), Expect = 6e-14
Identities = 41/203 (20%), Positives = 76/203 (37%), Gaps = 10/203 (4%)

Query: 11 RLTREQSKDLTRERLLSAAHAIFTKKGYVAASVEDIASAAGYTRGAFYSNFRSKAELLIE 70
R T++++++ TR+ +L A +F+++G + S+ +IA AAG TRGA Y +F+ K++L E
Sbjct: 3 RKTKQEAQE-TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 71 LLKRDHEEAEADLQKIFE--SGGTREQMEA---HALEYYSQFFRNNPAFLLWGEAKLQAT 125
+ + + G + H LE R +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 126 RDAKFRARFNEFVKEKRDRFTHYILTFAERVGTPLLLPADVLALGLMSLCDGVQSYHAAD 185
A + E DR + E P L A+ + G+
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 186 PRHVTGDAAQQVLAGFFARVVLA 208
P+ D ++ A + ++L
Sbjct: 182 PQSF--DLKKE--ARDYVAILLE 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1576TCRTETB310.009 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.0 bits (70), Expect = 0.009
Identities = 32/139 (23%), Positives = 53/139 (38%), Gaps = 2/139 (1%)

Query: 244 IGVYGFVLWLPSIVKNGSALGMVATGWLSALP-YLAATIAMLAASWASDRLGSRKGFVWP 302
V GFV +P ++K+ L G + P ++ I DR G
Sbjct: 270 GTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIG 329

Query: 303 FLLIGAAAFAASYTLGSTHFWLSYALLVVAGAAMYAPYGPFFAIVPELLPKNVAGGAMAL 362
+ + AS+ L +T ++++ ++ V G + IV L + AG M+L
Sbjct: 330 VTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKT-VISTIVSSSLKQQEAGAGMSL 388

Query: 363 INSMGALGSFVGSYAVGYL 381
+N L G VG L
Sbjct: 389 LNFTSFLSEGTGIAIVGGL 407



Score = 30.6 bits (69), Expect = 0.013
Identities = 24/140 (17%), Positives = 55/140 (39%), Gaps = 5/140 (3%)

Query: 35 AAAGINQDLGISKGLSSLIGALFFLGYFFFQIPGAIYAERRSVKTLVFWSLVLWGACASL 94
+ I D ++ + F L + +++ +K L+ + +++ S+
Sbjct: 36 SLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCF-GSV 94

Query: 95 TGVV--SNIPSLMAIRFLLGVVEAAVMPAML-IFISNWFTKRERSRANTFLILGNPVTVL 151
G V S L+ RF+ G AA PA++ + ++ + K R +A + +
Sbjct: 95 IGFVGHSFFSLLIMARFIQGA-GAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEG 153

Query: 152 WMSVVSGYLVHEFGWRHMFV 171
+ G + H W ++ +
Sbjct: 154 VGPAIGGMIAHYIHWSYLLL 173



Score = 30.6 bits (69), Expect = 0.014
Identities = 27/109 (24%), Positives = 46/109 (42%), Gaps = 6/109 (5%)

Query: 268 TGWLSALPYLAATIAMLAASWASDRLGSRKGFVWPFLLIGAAAFAASYTLGSTHF-WLSY 326
T W++ L +I SD+LG ++ ++ ++ + +G + F L
Sbjct: 51 TNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGF--VGHSFFSLLIM 108

Query: 327 ALLVVA-GAAMYAPYGPFFAIVPELLPKNVAGGAMALINSMGALGSFVG 374
A + GAA + +V +PK G A LI S+ A+G VG
Sbjct: 109 ARFIQGAGAAAFPAL--VMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1578HTHTETR353e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 34.6 bits (79), Expect = 3e-04
Identities = 15/107 (14%), Positives = 40/107 (37%), Gaps = 1/107 (0%)

Query: 12 ATISDVAREAGTGKTSVSRYLNGETNVLSADLRQRIETAIERLNYRPNQMARGL-KRGRN 70
++ ++A+ AG + ++ + ++++ S E + R
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 71 RLLGLLAADLTNPYTVEVLRGVEAACHALGYMPLICHAANELEMERR 117
L+ +L + +T ++ + C +G M ++ A L +E
Sbjct: 92 ILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESY 138


27BPSL1600BPSL1735Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1600-1183.464043putative transport-related, membrane protein
BPSL1601-1172.486968putative membrane linked regulatory protein
BPSL1602-1170.504716putative cytochrome c-related lipoprotein
BPSL1603-2161.230933hypothetical protein
BPSL1604-2141.883884hypothetical protein
BPSL1605-2141.913721putative lipoprotein
BPSL1606-3112.008416putative membrane protein
BPSL1607-3102.204352putative copper-related ABC transport system,
BPSL1609084.266490putative copper-binding periplasmic protein
BPSL1611284.331763nitrous-oxide reductase precursor
BPSL1612183.496293putative thiamine biosynthesis lipoprotein
BPSL1613082.166733putative ABC transport system, ATP-binding
BPSL1614080.263175putative ABC transport system, membrane protein
BPSL1615-1100.627445putative methyl-accepting chemotaxis
BPSL1616-115-1.835960hypothetical protein
BPSL1618328-3.849116putative glycerol utilisation-related protein
BPSL1619329-2.485158hypothetical protein
BPSL1621330-3.254446putative iron-sulphur protein
BPSL1622429-4.244813putative transport related membrane protein
BPSL1623522-1.992497putative hydroxylase
BPSL1624520-2.190163hypothetical protein
BPSL1625521-2.415295hypothetical protein
BPSL1626718-1.787895putative exported protein
BPSL1627619-2.040342subfamily S9C non-peptidase homologue
BPSL1628519-2.073629hypothetical protein
BPSL1629117-2.081588conserved hypothetical protein
BPSL1630118-2.199862putative fimbrial subunit type 1 precursor
BPSL1631117-2.171967putative fimbrial assembly chaperone precursor
BPSL1632022-5.239504putative fimbrial usher protein
BPSL1633022-4.999270putative exported fimbria-related protein
BPSL1634025-5.007269hypothetical protein
BPSL1635443-8.618853putative outer membrane protein
BPSL1636645-9.402427putative membrane protein
BPSL1637441-8.448201putative two-component system, response
BPSL1638134-5.000685putative two-component regulatory system, sensor
BPSL1639030-5.404524conserved hypothetical protein
BPSL1642029-5.179398putative regulatory protein
BPSL1643-128-5.191263putative lipase
BPSL1644-126-5.450408putative GntR-family regulatory protein
BPSL1645025-5.485458conserved hypothetical protein
BPSL1646024-5.553227putative hydrolase
BPSL1647126-5.731435putative oxygenase
BPSL1649231-6.223200putative monooxygenase
BPSL1650433-6.594777putative betaine aldehyde dehydrogenase
BPSL1651436-7.235106conserved hypothetical protein
BPSL1652537-7.355403putative ABC transport system, substrate-binding
BPSL1653536-7.440308putative ABC transport system, permease protein
BPSL1654737-8.063722putative ABC transport system, permease protein
BPSL1655736-8.376334putative ABC transport system, ATP-binding
BPSL1656734-6.586975putative GntR-family regulatory protein
BPSL16571121-3.836933succinate-semialdehyde dehydrogenase [NADP+]
BPSL16581121-3.808691putative outer membrane porin protein
BPSL1658a1021-3.828002hypothetical protein
BPSL16591021-3.671815insertion element hypothetical protein
BPSL16601020-3.447840insertion element hypothetical protein
BPSL1661821-3.417970putative DNA-biding protein, H-NS-like
BPSL1662329-3.971740putative exported protein
BPSL1663330-4.058074putative outer membrane protein
BPSL1664430-3.884264putative hemolysin-related protein
BPSL1665433-4.880353hypothetical protein
BPSL1667646-6.526209conserved hypothetical protein
BPSL1668A747-8.974114putative toxin transport-related membrane
BPSL1669634-7.605032putative toxin-related secretion protein
BPSL1670629-5.830856hypothetical protein
BPSL1671220-4.306631hypothetical protein
BPSL1672114-2.397443putative adenylylsulfate kinase
BPSL1673-29-0.518903putative two component system, response
BPSL1674-211-0.277496putative transposase
BPSL1675-110-0.032907putative exported protein
BPSL1676-180.514320hypothetical protein
BPSL1677080.494171putative outer membrane porin protein precursor
BPSL16780100.165058putative transposase
BPSL1679215-0.381666putative exported histidine ammonia-lyase
BPSL1680218-0.954050putative transport-related, integral membrane
BPSL1681119-1.181733putative urocanate hydratase
BPSL1682225-1.550683conserved hypothetical protein
BPSL1683430-2.475497putative LysR-family transcriptional regulatory
BPSL1684528-2.348085putative allantoinase
BPSL1685223-1.430051family M20 unassigned peptidase
BPSL1686324-2.073099putative RNA polymerase sigma factor
BPSL1687325-1.651111putative membrane protein
BPSL1688427-2.844739putative membrane protein
BPSL1689328-3.797002putative membrane protein
BPSL1690432-5.524330conserved hypothetical protein
BPSL1691742-8.215100conserved hypothetical protein
BPSL1692941-7.534296putative exported protein
BPSL1693741-7.934811putative GntR-family transcriptional regulator
BPSL1694744-7.654604putative recombinase
BPSL1695852-8.789901hypothetical protein
BPSL1696756-8.372665putative invertase
BPSL1699958-9.366284transposase
BPSL1700643-6.212944hypothetical protein
BPSL1702644-6.573068putative membrane protein
BPSL1703841-6.749815putative HNS-like protein
BPSL1704840-6.639041putative exported oxidase
BPSL1705939-5.982484putative exported protein
BPSL1706427-2.515463putative penicillin amidase
BPSL1707218-2.129700putative carbamoyl transferase
BPSL1708-2103.875036putative non-ribosomal antibiotic-related
BPSL1708A-2113.972517putative acetyltransferase
BPSL1709-2114.052280putative threonine aldolase
BPSL1710-2114.263702putative bifunctional protein (ligase and
BPSL1711-2124.076281putative cysteine synthase
BPSL1712-1124.497125hypothetical protein
BPSL1713-3112.572229putative membrane protein
BPSL1715-1131.837560putative kinase
BPSL1717012-1.619484putative argininosuccinate lyase
BPSL1719-27-2.984944putative argininosuccinate synthase
BPSL1720-27-4.004087putative formyl transferase
BPSL1721-210-5.481077hypothetical protein
BPSL1722-115-3.562750putative histidinol-phosphate aminotransferase
BPSL1723014-2.381969conserved hypothetical protein
BPSL1724117-0.578132conserved hypothetical protein
BPSL17251141.124614putative non-ribosomal peptide synthase
BPSL1726-1132.445532putative exported porin
BPSL1728-3102.272205putative AraC-family transcriptional regulator
BPSL1730-1103.943352putative transmembrane protein
BPSL1731-1123.265169chemotaxis protein CheW2
BPSL1732-2123.130975putative methyl-accepting chemotaxis citrate
BPSL17340134.654710conserved hypothetical protein
BPSL17350144.818525putative AMP-binding enzyme
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1610ABC2TRNSPORT562e-11 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 56.1 bits (135), Expect = 2e-11
Identities = 62/201 (30%), Positives = 100/201 (49%), Gaps = 10/201 (4%)

Query: 17 ASPLRILFGLTQPLLYLFVLGAALRSGTYAEIGG--YQAYIFPGVVGLSLM----FTAIS 70
A+ +L L +PL+YLF LGA L +GG Y A++ G+V S M F I
Sbjct: 30 AALASLLGHLAEPLIYLFGLGAGL-GVMVGRVGGVSYTAFLAAGMVATSAMTAATFETIY 88

Query: 71 AAVGIVHDRQTGLLNALLVSPVRRVDIALGKIGAGALLAWLQALLLLPFSPAIGIGLTAP 130
AA G + ++T A+L + +R DI LG++ A A L + + A+G
Sbjct: 89 AAFGRMEGQRT--WEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY-TQWL 145

Query: 131 RLALLVAAMAFAALAFSALGLALALPFRSVIVFPVVSNTLLLPMFFLSGGLYPLDLAPDW 190
L + +A LAF++LG+ + S F ++ P+ FLSG ++P+D P
Sbjct: 146 SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIV 205

Query: 191 IRAAAAFDPAAYGVDLMRGVL 211
+ AA F P ++ +DL+R ++
Sbjct: 206 FQTAARFLPLSHSIDLIRPIM 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1615PF07675310.008 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.8 bits (69), Expect = 0.008
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 3/59 (5%)

Query: 181 DYVIADPEPRGGRLAME-RGVTWAARRHDHRF--GAHYPWTLRLTPPQDGAPASVEIDT 236
DY I +PEP G++ + G AR D F G Y +T+R DG VE D+
Sbjct: 472 DYCITNPEPASGKMWIAGDGGNQPARYDDFAFEAGKKYTFTMRRAGMGDGTDMEVEDDS 530


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1628PF005777870.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 787 bits (2034), Expect = 0.0
Identities = 292/873 (33%), Positives = 450/873 (51%), Gaps = 47/873 (5%)

Query: 11 FSRIRVTMLAAALTALSATAR----GQQALEFDPAFLELGGGQGGADLSVYATSNRVLPG 66
+ R+ L A A L F+P FL Q ADLS + + PG
Sbjct: 18 IRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLA-DDPQAVADLSRFENGQELPPG 76

Query: 67 VYPISVFVNGEAIERRDITFVSESARDGREDAIPCLSARMFDEWGVDIAAFAKLAQAGED 126
Y + +++N + RD+TF D + +PCL+ G++ A+ + + +D
Sbjct: 77 TYRVDIYLNNGYMATRDVTFN---TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADD 133

Query: 127 ACVDIADSVPHARTEFDSHQLRLNVTVPQAALKRRARGAVDPARWDQGIDAALLDYQLSA 186
ACV + + A + D Q RLN+T+PQA + RARG + P WD GI+A LL+Y S
Sbjct: 134 ACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSG 193

Query: 187 AQYAGGNFASARSRTTLYAGLRGAVNLGAWRLSHTSSFLRGL-----DGRNRFQIVNTFV 241
+ Y L+ +N+GAWRL +++ +N++Q +NT++
Sbjct: 194 NSVQNR---IGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWL 250

Query: 242 QRDIAGWNSRLTAGEGTTPANIFDGFQFLGVQLNTDETMLPDSLQGYAPTVHGVAQTNAQ 301
+RDI SRLT G+G T +IFDG F G QL +D+ MLPDS +G+AP +HG+A+ AQ
Sbjct: 251 ERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQ 310

Query: 302 VTIRQNGFVIYSTYVPPGPFTIDDLYPTSSSGNLEVTITEADGHVTTFTQPYSAVPMLLR 361
VTI+QNG+ IY++ VPPGPFTI+D+Y +SG+L+VTI EADG FT PYS+VP+L R
Sbjct: 311 VTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQR 370

Query: 362 DGSWRYNVTAGQYR-DGISGSHPSFAMATLARGLAGEFSLYGGFIGAGMYQSVLVGIGKN 420
+G RY++TAG+YR P F +TL GL +++YGG A Y++ GIGKN
Sbjct: 371 EGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKN 430

Query: 421 LGSIGAVSLDVTHARSAVDLADSSTVSGHAFRVLYAKAVGSWGTDFRLLAYRYSTAGYRS 480
+G++GA+S+D+T A S L D S G + R LY K++ GT+ +L+ YRYST+GY +
Sbjct: 431 MGALGALSVDMTQANS--TLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFN 488

Query: 481 FADAVQLRDGSEPAAL------------------GAKRQRLEGTVNQRLGRLGSMYATVA 522
FAD R KR +L+ TV Q+LGR ++Y + +
Sbjct: 489 FADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGS 548

Query: 523 VQTYWGSAARSTVYQLGHSGNWGRASYGLYAAYSKGSGVPSSWN-VSLSLSMPLEVFFGG 581
QTYWG++ +Q G + + ++ L + +K + ++L++++P +
Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRS 608

Query: 582 ARVRAPAGGSANVSYFVSRNNENHVNQQMTASGSSSEQ-RLNYSVGVAHS----SESDVS 636
A+ SY +S + + G+ E L+YSV ++ S +
Sbjct: 609 DSKSQW--RHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGST 666

Query: 637 GSVSASYLAPFGRYDASIGSGRGYTQAAFTAAGGMLWHGTGVLFTQPLGETVAVVDVPNV 696
G + +Y +G + Q + +GG+L H GV QPL +TV +V P
Sbjct: 667 GYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGA 726

Query: 697 RGVRFEMHPGVSTDRAGEAVIPRLNPYRVNRIAVDQRRMPQDVEIRNPVSEVVPTRAAVV 756
+ + E GV TD G AV+P YR NR+A+D + +V++ N V+ VVPTR A+V
Sbjct: 727 KDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIV 786

Query: 757 QTHFDSVVGLRALFTLMRADGSFPPQGATAENDEGQVLGVVGMDGETFVAGLPAAEGHFV 816
+ F + VG++ L TL + P GA ++ Q G+V +G+ +++G+P A G
Sbjct: 787 RAEFKARVGIKLLMTL-THNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA-GKVQ 844

Query: 817 VRWGAARQNRCRVNYALPGKAAIGAYLAVEAIC 849
V+WG C NY LP ++ + A C
Sbjct: 845 VKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1631OMADHESIN503e-08 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 50.3 bits (119), Expect = 3e-08
Identities = 52/159 (32%), Positives = 79/159 (49%)

Query: 873 ATGNNASASGTSSTAGGANAIASGENSTANGANSTASGNGSSAFGESAAAAGDGSTALGS 932
A G NASA G S A GA A A+ + A GA S A+G S A G + A GD + G+
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 933 NAVASGVGSVATGAGSVASGANSSAYGTGSNATGAGSVAIGQGATASGSNSVALGTGSVA 992
+ A G S + + + + ++A + ++ A+ S+A+G S
Sbjct: 120 ASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKT 179

Query: 993 SEDNTVSVGSAGSERRITNVAAGVNATDAVNVGQLNSAV 1031
+N+VS+G R++T++AAG TDAVNV QL +
Sbjct: 180 DRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEI 218



Score = 39.9 bits (92), Expect = 5e-05
Identities = 79/305 (25%), Positives = 124/305 (40%), Gaps = 13/305 (4%)

Query: 439 ASGDNATASGTNSTANGTNSTASGDNSTASGTNASASGENSTATGTDSTASGSNSTANGT 498
A G NA+A G +S A G + A+ + A G + A+G NS A G S A G ++ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 499 NSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDN--STASGTNA 556
STA D A G AS T + A G +S A NS A G +S + ++ S A G +
Sbjct: 120 ASTAQKD-GVAIGARAS-TSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRS 177

Query: 557 SASGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASG 616
ENS + G +S A GT T + + A + ++T
Sbjct: 178 KTDRENSVSIGHESLNRQLTHLAAGTKDTDAVN---------VAQLKKEIEKTQENTNKR 228

Query: 617 SNSTANGTNSTASGDNSTASGTNASATGDNSTASGTNASATGENSTATGTDSTASGSNST 676
S N+ A +S+ G + T S + NA + + + SNS
Sbjct: 229 SAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSV 288

Query: 677 ANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGT 736
A T TA ++ + T E++ ++ AS + + ++ T NS T
Sbjct: 289 ARTTLETAEEHANSVARTTLETAEEHANKKSAEALASANVYADSKSSHTLKTANSYTDVT 348

Query: 737 NASAT 741
+++T
Sbjct: 349 VSNST 353



Score = 39.5 bits (91), Expect = 6e-05
Identities = 81/333 (24%), Positives = 125/333 (37%), Gaps = 9/333 (2%)

Query: 444 ATASGTNSTANGTNSTASGDNSTASGTNASASGENSTATGTDSTASGSNSTANGTNSTAS 503
A A + N T S + A G A G +++A G +S A G + A+
Sbjct: 23 AFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAA 82

Query: 504 GDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASASGENS 563
+ A G + ATG NS A G S A G ++ G STA D A G AS S +
Sbjct: 83 KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD-GVAIGARASTS-DTG 140

Query: 564 TATGTDSTASGSNSTANGTNSTASGDN--STASGTNASATGENSTATGTDSTASGSNSTA 621
A G +S A NS A G +S + ++ S A G + ENS + G +S A
Sbjct: 141 VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLA 200

Query: 622 NGTNST-----ASGDNSTASGTNASATGDNSTASGTNASATGENSTATGTDSTASGSNST 676
GT T A + + NA A ++S+ G + + S S
Sbjct: 201 AGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIANNYTDSKSA 260

Query: 677 ANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGT 736
N+ + N + NS A T TA ++ + +++
Sbjct: 261 ETLENARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEHANSVARTTLETAEEHANKKSA 320

Query: 737 NASATGENSTATGTDSTASGSNSTANGTNSTAS 769
A A+ + + T +NS + T S ++
Sbjct: 321 EALASANVYADSKSSHTLKTANSYTDVTVSNST 353



Score = 39.5 bits (91), Expect = 6e-05
Identities = 86/340 (25%), Positives = 130/340 (38%), Gaps = 13/340 (3%)

Query: 628 ASGDNSTASGTNASATGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGD 687
A + + T + + A G A G +++A G +S A G + A+
Sbjct: 25 ADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKG 84

Query: 688 NSTASGTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGTNASATGENSTA 747
+ A G + ATG NS A G S A G ++ GA STA D A G AS T + A
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD-GVAIGARAS-TSDTGVA 142

Query: 748 TGTDSTASGSNSTANGTNSTASGNN--STASGTNASATGENSTATGTDSAASGTNSTANG 805
G +S A NS A G +S + N+ S A G + ENS + G +S A G
Sbjct: 143 VGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202

Query: 806 TNSTASGDNSTASGTNASATGENSTATGTASTASGSNSTANGANSTASGAGATATGENAA 865
T T + + A + +T S AN+ A ++ G
Sbjct: 203 TKDTDAVN---------VAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIANN 253

Query: 866 ATGAGATATGNNASASGTSSTAGGANAIASGENSTANGANSTASGNGSSAFGESAAAAGD 925
T + + T NA + + N + NS A TA + +S + A +
Sbjct: 254 YTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEHANSVARTTLETAEE 313

Query: 926 GSTALGSNAVASGVGSVATGAGSVASGANSSAYGTGSNAT 965
+ + A+AS + + ANS T SN+T
Sbjct: 314 HANKKSAEALASANVYADSKSSHTLKTANSYTDVTVSNST 353



Score = 37.2 bits (85), Expect = 4e-04
Identities = 102/425 (24%), Positives = 169/425 (39%), Gaps = 39/425 (9%)

Query: 691 ASGTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGTNASATGENSTATGT 750
A G NASA G +S A G + A+ + A GA S A+G NS A G + A G+++ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 751 DSTASGSNSTANGTNSTASGNNSTASGTNASATGENSTATGTDSAASGTNSTANGTNSTA 810
STA ST+ + A G N+ A +NS A G S + AN S A
Sbjct: 120 ASTAQKDGVAIGARASTS--DTGVAVGFNSKADAKNSVAIGHSSHVA-----ANHGYSIA 172

Query: 811 SGDNSTASGTNASATGENSTATGTASTASGSNSTANGANSTASGAGATATGENAAATGAG 870
GD S N+ + G S A+G+ T + N T EN A
Sbjct: 173 IGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDT-DAVNVAQLKKEIEKTQENTNKRSAE 231

Query: 871 ATATGNNASASGTSSTAGGANAIASGENSTANGANSTASGNGSSAFGESAAAAGDGSTAL 930
A N + + +SS G AN N +S ++ +A E+ A + D
Sbjct: 232 LLANANAYADNKSSSVLGIAN----------NYTDSKSAETLENARKEAFAQSKDVLNMA 281

Query: 931 GSNAVASGVGSVATGAGSVASGANSSAYGTGSNATGAGSVAIGQGATASGSNSVALGTGS 990
+++ + ++ T S A ++ +A + A+ + S S +
Sbjct: 282 KAHSNSVARTTLETAEEHANSVARTTLETAEEHANKKSAEALASANVYADSKSSHTLKTA 341

Query: 991 VASEDNTVSVGSAGSERRITNVAAGVNATDAVNVGQLNSAVSGIRNQMDGMQGQIDTLAR 1050
+ D TVS + + R + + N++D + ++D
Sbjct: 342 NSYTDVTVSNSTKKAIRESNQYT--------------DHKFRQLDNRLDKLDTRVD---- 383

Query: 1051 DAYSGIAAATALTMIPDVDPGKTLAVGIGTANFKGYQASALGATARITQNLKVKTGVSYS 1110
G+A++ AL + + G ++ QA A+G+ R+ +N+ +K GV+Y+
Sbjct: 384 ---KGLASSAALNSLFQPYGVGKVNFTAGVGGYRSSQALAIGSGYRVNENVALKAGVAYA 440

Query: 1111 GSNYV 1115
GS+ V
Sbjct: 441 GSSDV 445



Score = 35.3 bits (80), Expect = 0.001
Identities = 40/119 (33%), Positives = 53/119 (44%)

Query: 877 NASASGTSSTAGGANAIASGENSTANGANSTASGNGSSAFGESAAAAGDGSTALGSNAVA 936
+ SA+ S+ A A + N S N A G A G NA A
Sbjct: 8 SVSAALISALFSSPYAFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASA 67

Query: 937 SGVGSVATGAGSVASGANSSAYGTGSNATGAGSVAIGQGATASGSNSVALGTGSVASED 995
G+ S+A GA + A+ + A G GS ATG SVAIG + A G ++V G S A +D
Sbjct: 68 KGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1633HTHFIS442e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.4 bits (105), Expect = 2e-07
Identities = 19/84 (22%), Positives = 38/84 (45%), Gaps = 6/84 (7%)

Query: 10 KVVVADDHPIVLRAVTDYVNSLPGFHVVASVSSGDALLSAMREQEVNLVVTDFTMHQAND 69
++VADD + + ++ G+ V S+ L + + +LVVTD M
Sbjct: 5 TILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVM----P 58

Query: 70 DKDGLRLISHLMRAYERTPIIVFT 93
D++ L+ + +A P++V +
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1634HTHFIS817e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.0 bits (200), Expect = 7e-18
Identities = 36/146 (24%), Positives = 60/146 (41%), Gaps = 1/146 (0%)

Query: 854 TVLIAEDNLLNRSLLLDQLTTLGVRVIEAKNGEEALALLLKEPVDVVMTDIDMPMMDGFQ 913
T+L+A+D+ R++L L+ G V N + D+V+TD+ MP + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 914 LLAEMRRLGMTMPVYAVSASARPEDVAEGRARGFTDYLAKPVSLERLETVVRACCSAP-A 972
LL +++ +PV +SA + +G DYL KP L L ++ + P
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 973 GARADEDAQDELPGLPDVPPAYASAF 998
ED + L A +
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIY 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1655ECOLNEIPORIN924e-23 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 92.2 bits (229), Expect = 4e-23
Identities = 74/338 (21%), Positives = 117/338 (34%), Gaps = 52/338 (15%)

Query: 10 ATAQSSVTLYGVIDEGIDYVNNSGGQHLW--RMRDGTYDGMYGSRWGLKGSEDLGGGLSA 67
A + VTLYG I G++ + + GT GS+ G KG EDLG GL A
Sbjct: 15 VAAMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKA 74

Query: 68 LFKLEAGFSLENGQMRQGGREFGRQAYVGLSKTDLGTVTFGRQYDSVVDF--VQPVTAVG 125
++++E S+ G RQ+++GL K G + GR + D + P +
Sbjct: 75 IWQVEQKASIAGTDSGWG----NRQSFIGL-KGGFGKLRVGRLNSVLKDTGDINPWDSKS 129

Query: 126 QFGGPFVRGGDIDNTDNSFRVDNSIKYASPSFGGFTFGGMYSFTNSNAPGLGTTGMWSLG 185
+ G I + S++Y SP F G + Y+ ++ + + G
Sbjct: 130 DYLG----VNKIAEPEARL---ISVRYDSPEFAGLSGSVQYALNDNAGRHNSES--YHAG 180

Query: 186 AAYSHGGFNAGAAYFYAKNPAARFTDGNFIGNTTGAAIGASGPFSYVGAPRNERIMGIGA 245
Y +GGF Y ++ + + I + + A
Sbjct: 181 FNYKNGGFFVQYGGAYKRH--HQVQENVNIEKYQIHRLVSG--------------YDNDA 224

Query: 246 DYAFGSATAGIDYTNTKFDDANGTTSSVTFSNYEVWGQY-----KVTPAATLGAAYVYTD 300
YA + A ++ S EV VTP +Y +
Sbjct: 225 LYA---SVAVQQQDAKLVEENYSHN-----SQTEVAATLAYRFGNVTPR----VSYAHGF 272

Query: 301 GK-VNYNGARPKYHQVSLMGSYSVSKRTSFYAMAGFQQ 337
+ Y QV + Y SKRTS AG+ Q
Sbjct: 273 KGSFDATNYNNDYDQVVVGAEYDFSKRTSALVSAGWLQ 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1659OMPADOMAIN1111e-30 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 111 bits (280), Expect = 1e-30
Identities = 59/180 (32%), Positives = 89/180 (49%), Gaps = 12/180 (6%)

Query: 123 QYQVRF--LGGLAYRGYWADSACRDIAARYADAAGLGVIAVAPCNPSDVAAPLPERVELP 180
Q+ + R ++ R+ V+A AP +V L
Sbjct: 163 QWTNNIGDAHTIGTRP-DNGMLSLGVSYRFGQGEAAPVVAPAPAPAPEVQTK---HFTLK 218

Query: 181 TDTLFAFDKGGFEDISADGRRQLGDLVASIKAKIFSINHLIVTGYTDRLGSDEHNARLSS 240
+D LF F+K + +G+ L L + + ++V GYTDR+GSD +N LS
Sbjct: 219 SDVLFNFNK---ATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSE 275

Query: 241 ERARTVADYMIAEGIPAAKITAVGRGAADPVV--VCNNGEQ-PELIRCLQKNRRVEIRIK 297
RA++V DY+I++GIPA KI+A G G ++PV C+N +Q LI CL +RRVEI +K
Sbjct: 276 RRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1661RTXTOXINA489e-07 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 47.7 bits (113), Expect = 9e-07
Identities = 24/78 (30%), Positives = 38/78 (48%)

Query: 2983 AGADTVTGSSGRDMLNGGAGNDTIVGNGGVDVLEGGGGNDMLVVNGDNIAHFTTPGSYWD 3042
G DT++G +G D L GG GND ++G G + L GG G+D V G+++A G +
Sbjct: 762 KGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGN 821

Query: 3043 GGIMMGGEINTLQFDANN 3060
+ + L +
Sbjct: 822 DKLYGSEGADLLDGGEGD 839



Score = 45.3 bits (107), Expect = 4e-06
Identities = 28/77 (36%), Positives = 36/77 (46%), Gaps = 8/77 (10%)

Query: 2984 GADTVTGSSGRDMLNGGAGNDTIVGNGGVDVLEGGGGNDML--------VVNGDNIAHFT 3035
G D + G+ G D L G GNDT+ G G D L GG GND L + GD F
Sbjct: 745 GDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQ 804

Query: 3036 TPGSYWDGGIMMGGEIN 3052
G+ ++ GG+ N
Sbjct: 805 VQGNSLAKNVLFGGKGN 821



Score = 39.6 bits (92), Expect = 2e-04
Identities = 22/60 (36%), Positives = 30/60 (50%), Gaps = 1/60 (1%)

Query: 2986 DTVTGSSGRDMLNGGAGNDTIVGNGGVDVLEGGGGNDMLVVNGDNIAHFTTPG-SYWDGG 3044
D G+ G D++ G GND + G+ G D L GG G+D L N G +Y +GG
Sbjct: 738 DIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGG 797



Score = 38.4 bits (89), Expect = 5e-04
Identities = 20/43 (46%), Positives = 23/43 (53%)

Query: 2982 TAGADTVTGSSGRDMLNGGAGNDTIVGNGGVDVLEGGGGNDML 3024
T AD GS D+ +G G+D I GN G D L G GND L
Sbjct: 725 TTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTL 767


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1665RTXTOXIND2745e-89 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 274 bits (701), Expect = 5e-89
Identities = 94/439 (21%), Positives = 204/439 (46%), Gaps = 14/439 (3%)

Query: 43 SALGLEEASIAPARRAAALIPTVMLALLIVLVLWATFFKIDIIAAGQGKVIPSTTVQQLS 102
+ L L E ++ R A ++ L++ + + +++I+A GK+ S +++
Sbjct: 44 AHLELIETPVSRRPRLVAYF---IMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIK 100

Query: 103 TLEGGIVRELLVREGQIVKKGQPLVRLDPVVAQGAVTEQAATREGLMASIARLQAEADGK 162
+E IV+E++V+EG+ V+KG L++L + A+ + ++ R Q +
Sbjct: 101 PIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSI 160

Query: 163 ----------ATPLYPAGLKPEIVSEEEHVRAQRAEALNSTIEVLQQQRAAKQAEAADYR 212
Y + E V + ++ + + K+AE
Sbjct: 161 ELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVL 220

Query: 213 GRIPQYVNNQHLLDDQIQRMLPLVGVGSVAPNEITNLQRERGNLAAQIITTREGAAQASA 272
RI +Y N + ++ L+ ++A + + + + ++ + Q +
Sbjct: 221 ARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIES 280

Query: 273 QIAEASHKIEEKISTFRSEAREELARKQVQLQALEGTLSGKQDILDRTLIRSPVNGIVKT 332
+I A + + F++E ++L + + L L+ ++ ++IR+PV+ V+
Sbjct: 281 EILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQ 340

Query: 333 LYITTIGGVASPGKSVIDIVPTNDSLLIEARIQPQDIAYIRVGDDAKVRITAFDSGALGS 392
L + T GGV + ++++ IVP +D+L + A +Q +DI +I VG +A +++ AF G
Sbjct: 341 LKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGY 400

Query: 393 LDAKVELISPDSQADERSGSLYYKVQVRTHSSVVATQVGDLNILPGMVADVDVITGRRTI 452
L KV+ I+ D+ D+R G L + V + + ++T ++ + GM ++ TG R++
Sbjct: 401 LVGKVKNINLDAIEDQRLG-LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSV 459

Query: 453 MSYILRPIVRGMSRAMSER 471
+SY+L P+ ++ ++ ER
Sbjct: 460 ISYLLSPLEESVTESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1666SYCDCHAPRONE330.005 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 33.0 bits (75), Expect = 0.005
Identities = 21/126 (16%), Positives = 45/126 (35%), Gaps = 3/126 (2%)

Query: 898 LAPDDADAVLLRAELALDTGDFDEALSQFERLREQRPDAPESYANLIPALAALERRDDAI 957
++ D + + A +G +++A F+ L + L A+ + D AI
Sbjct: 31 ISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAI 90

Query: 958 AALQRALELNSKHPGALNNGVQFYLRTQQYDKA---MELAQRYVGAHGELASAHTMCGLV 1014
+ ++ K P + + L+ + +A + LAQ + E T +
Sbjct: 91 HSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSM 150

Query: 1015 YHNLKA 1020
+K
Sbjct: 151 LEAIKL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1669HTHFIS758e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.3 bits (185), Expect = 8e-18
Identities = 37/163 (22%), Positives = 63/163 (38%), Gaps = 13/163 (7%)

Query: 3 IYLIEDDEIQAQYYQSMLVEHGWQVKLLLDGERAFREIQRMPPDLIILDRRLPDLDGLEV 62
I + +DD L G+ V++ + +R I DL++ D +PD + ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 63 LMWVRKNYSNIPVLILTNAILESEVVAALEAGADDYVIKPPRKQEFVARVKALYRRATET 122
L ++K ++PVL+++ + A E GA DY+ KP E + + RA
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI----GIIGRALAE 121

Query: 123 RTLSELIEIGPYRIQTSEKVVYFHHEAITLSPKEYEIIELLAR 165
R E + S EI +LAR
Sbjct: 122 PKR---------RPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1674ECOLNEIPORIN924e-23 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 92.2 bits (229), Expect = 4e-23
Identities = 92/377 (24%), Positives = 136/377 (36%), Gaps = 57/377 (15%)

Query: 1 MKKLLIALPLAAAATTHAQSSVTLYGVLEDGVDYVSNVQGKHL----VQLASGV-TAGSR 55
MKK LIAL LAA A + VTLYG ++ GV+ +V V+ +G+ GS+
Sbjct: 1 MKKSLIALTLAALPVA-AMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSK 59

Query: 56 WGVRGTEDLGGGLSAIFRLESGFDINSGRLGSGLAFSRNAYVGVGDAKLGTLTLGRQWDS 115
G +G EDLG GL AI+++E I G G +R +++G+ G L +GR
Sbjct: 60 IGFKGQEDLGNGLKAIWQVEQKASIAGTDSGWG---NRQSFIGLK-GGFGKLRVGRLNSV 115

Query: 116 IVDY--VEPFTLNGNI-GGYYFAHPNDMDNTDNGFPISNAVKYRSPTIAGFTFGGLYAFG 172
+ D + P+ + G A P IS V+Y SP AG + YA
Sbjct: 116 LKDTGDINPWDSKSDYLGVNKIAEPEA-------RLIS--VRYDSPEFAGLSGSVQYALN 166

Query: 173 GQPGRFSDNATFSVGANYAAGPVGFGIGYLRINNPGVSTQGYQNYPGFTNAVYGNYLDAA 232
GR ++ ++ G NY G G Y+ + V
Sbjct: 167 DNAGR-HNSESYHAGFNYKNGGFFVQYGGA-----------YKRHHQVQENVNIEKYQIH 214

Query: 233 RAQKVFGVGASYQVV---QWLKLLADFTNTNFQQGSAGHDATFQNYELSALVKPTPAVTI 289
R + A Y V Q L + ++ Q AT + + + A
Sbjct: 215 RLVSGYDNDALYASVAVQQQDAKLVEENYSHNSQTEVA--ATLAYRFGNVTPRVSYAHGF 272

Query: 290 GAGYTYTTGRDHATNAEPKYHQFNLSVEYALSKRTSVYAMGAFQKAAGDAPVAQIAGFNP 349
+ T + Y Q + EY SKRTS + +
Sbjct: 273 KGSFDATNYNND-------YDQVVVGAEYDFSKRTSALVSAGWLQEG-----------KG 314

Query: 350 SGNQKQAVGRAGIRHVF 366
G G+RH F
Sbjct: 315 ESKFVSTAGGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1686PF08280280.020 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 28.3 bits (63), Expect = 0.020
Identities = 23/95 (24%), Positives = 39/95 (41%), Gaps = 17/95 (17%)

Query: 21 RSFLSELTRHLRG--FLRKRIPQFDADIEDLVQEILLAVHNARHTYRADEPLTAWVHAIA 78
SFLS + HL+ +L + +D ILLA+ RH + P T +
Sbjct: 209 HSFLSHSSTHLKTSPWLSESFSFYD---------ILLALSWKRHQFSVTIPQTRIFQQLK 259

Query: 79 RYKLMDFFRTRARREALHDPLDDHTDI-FSEPDDD 112
+ + D + +R D ++ + + FS D D
Sbjct: 260 KLFVYDSLKKSSR-----DIIETYCQLNFSAGDLD 289


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1702FLGFLIH310.002 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 31.3 bits (70), Expect = 0.002
Identities = 28/111 (25%), Positives = 47/111 (42%), Gaps = 6/111 (5%)

Query: 76 GLVQQLSLREIQFESLTEAMTTNSSSGMLVFHMMAALAQFERSLISERTCAGMAAARARG 135
GL Q L + E+ ++ ++ LV L + + S + AAR
Sbjct: 75 GLAQ--GLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAAR--- 129

Query: 136 QILGRRPALNEKQRAQALKLLLTQ-PIKCVAKQFNVHPRTLQRLQKAHQAT 185
Q++G+ P ++ + ++ LL Q P+ Q VHP LQR+ AT
Sbjct: 130 QVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1705OMADHESIN491e-07 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 49.1 bits (116), Expect = 1e-07
Identities = 55/153 (35%), Positives = 77/153 (50%), Gaps = 14/153 (9%)

Query: 1382 GPGADASGSNSTAVGGAASASGANATALGQASNASGNNSTALGQASSASGSGSTAVGQGA 1441
G A A G +S A+G A A+ A A+G S A+G NS A+G S A G + G +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 1442 SASGDGSS------------AFGQGAIASGTNSTALG--AHSTASAPNSVAIGANSVASA 1487
+A DG + A G + A NS A+G +H A+ S+AIG S
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 1488 PNTVSFGSQGHERRLTNVAPGMDGTDAANMSQL 1520
N+VS G + R+LT++A G TDA N++QL
Sbjct: 182 ENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 44.1 bits (103), Expect = 4e-06
Identities = 54/133 (40%), Positives = 75/133 (56%), Gaps = 4/133 (3%)

Query: 1088 GPAATASGASGIAIGDTANAAATGAVAIGQTAVATGGQAVSIGVANTASGDGAVAIGDPN 1147
G A+A G IAIG TA AA AVA+G ++ATG +V+IG + A GD AV G +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 1148 VATGTGAVALGANNSANGQGAVALGNANVATGTGSLALGSTSTAAG--GGSIALGTNAIA 1205
A G VA+GA S + G VA+G + A S+A+G +S A G SIA+G +
Sbjct: 122 TAQKDG-VAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKT 179

Query: 1206 NNANDVALGSGSV 1218
+ N V++G S+
Sbjct: 180 DRENSVSIGHESL 192



Score = 39.9 bits (92), Expect = 8e-05
Identities = 80/300 (26%), Positives = 134/300 (44%), Gaps = 32/300 (10%)

Query: 319 GGQSQAASAGAIAIGQSALATGGQAVSVGVGNTANGNGAVAIGDPNVATGTGAVALGANN 378
G + A +IAIG +A A G AV+VG G+ A G +VAIG + A G AV GA +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 379 TATGQGAVALGNADIATGQGSVALGNVSTAAGAGSVAFGSNAVANNTNDVALGSGSVTAA 438
TA G VA+G ++ + G VA G N+ A+ N VA+G S AA
Sbjct: 122 TAQKDG---------------VAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAA 165

Query: 439 PNPTGSATIGGTTYSFEGTNPTSVVSVGAVGAERQITNVAAGQLTATSTDAVNGSQL--- 495
N S IG + T+ + VS+G RQ+T++AAG TDAVN +QL
Sbjct: 166 -NHGYSIAIGDRS----KTDRENSVSIGHESLNRQLTHLAAG---TKDTDAVNVAQLKKE 217

Query: 496 -----YSTNQAINTLSTSTSTGLSSANSSIASLSTGLASSGNLASLSTSTSTGLSSANSS 550
+TN+ L + + + +SS+ ++ S + +L + + +
Sbjct: 218 IEKTQENTNKRSAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDV 277

Query: 551 IASLSTSTSTGLSTTNSNIGSLSTGLSTTNSTVASLSTSTSTGLSSATSSITSLSTSTSS 610
+ +++ TT + ++ T A + + + A++++ + S S+ +
Sbjct: 278 LNMAKAHSNSVARTTLETAEEHANSVARTTLETAEEHANKKSAEALASANVYADSKSSHT 337



Score = 38.0 bits (87), Expect = 3e-04
Identities = 45/130 (34%), Positives = 72/130 (55%), Gaps = 4/130 (3%)

Query: 308 AQALASNAIAIGGQSQAASAGAIAIGQSALATGGQAVSVGVGNTANGNGAVAIGDPNVAT 367
A A ++IAIG ++AA A+A+G ++ATG +V++G + A G+ AV G + A
Sbjct: 65 ASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQ 124

Query: 368 GTGAVALGANNTATGQGAVALGNADIATGQGSVALGNVSTAAG--AGSVAFGSNAVANNT 425
G VA+GA + + G VA+G A + SVA+G+ S A S+A G + +
Sbjct: 125 KDG-VAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRE 182

Query: 426 NDVALGSGSV 435
N V++G S+
Sbjct: 183 NSVSIGHESL 192



Score = 35.6 bits (81), Expect = 0.002
Identities = 32/111 (28%), Positives = 56/111 (50%), Gaps = 1/111 (0%)

Query: 241 AQATGSDSIAMGSEAAASSSSTTAIGQYATASNTNATALGAGGTSAATGVIASGAGAVAL 300
A A G SIA+G+ A A+ + A+G + A+ N+ A+G + + GA + A
Sbjct: 65 ASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQ 124

Query: 301 GGNSTQGAQALASN-AIAIGGQSQAASAGAIAIGQSALATGGQAVSVGVGN 350
GA+A S+ +A+G S+A + ++AIG S+ S+ +G+
Sbjct: 125 KDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGD 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1716ARGDEIMINASE290.039 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 28.6 bits (64), Expect = 0.039
Identities = 11/48 (22%), Positives = 18/48 (37%), Gaps = 2/48 (4%)

Query: 263 PTSGAAFMVAEWLRAQRDDGRTIVFIAPDEGHRYADTVYDDAWLRGQG 310
+G + R Q +DG ++ IAP E Y+ + G
Sbjct: 334 KCAGGDLIHGA--REQWNDGANVLAIAPGEIIAYSRNHVTNKLFEENG 379


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1728ECOLNEIPORIN933e-23 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 92.9 bits (231), Expect = 3e-23
Identities = 90/380 (23%), Positives = 146/380 (38%), Gaps = 64/380 (16%)

Query: 32 ASTAHAQSSVVLYGLIDTSITYANNQRTHGAGSPGSPGWAVTSGALNASRWGLRGREDLG 91
A A + V LYG I + + + +GA + T S+ G +G+EDLG
Sbjct: 12 ALPVAAMADVTLYGTIKAGVETSRSVAHNGAQAASVE--TGTGIVDLGSKIGFKGQEDLG 69

Query: 92 DGVSAIFALENGFSGASGALSQKGVDMFGRQAWIGLKSKEGGALTLGRQYDLILDF--VT 149
+G+ AI+ +E AS A + G RQ++IGLK G L +GR ++ D +
Sbjct: 70 NGLKAIWQVE---QKASIAGTDSG--WGNRQSFIGLK-GGFGKLRVGRLNSVLKDTGDIN 123

Query: 150 PLGASGPGWGGNLAVHPYDNDDSNRNIRINNAVKYTSPTYRGWTLGAMYGFSNTAGPFGN 209
P + G N P R I +V+Y SP + G + Y ++ AG N
Sbjct: 124 PWDSKSDYLGVNKIAEP-----EARLI----SVRYDSPEFAGLSGSVQYALNDNAG-RHN 173

Query: 210 NAAWSAGLSYANGPLKLGAGYLRINRNPNAANANGALSTTDGSATITGGSQQIWAVAGRY 269
+ ++ AG +Y NG + G + QI + Y
Sbjct: 174 SESYHAGFNYKNGGFFVQYGGAYKRH-------------HQVQENVNIEKYQIHRLVSGY 220

Query: 270 -AFGPHSIGAAWSHSATDRVSGVLQGGSIAKLDGNSLVFDNFTLDGRY-VVTPRLSLAAA 327
++ A A L + + + TL R+ VTPR+S A
Sbjct: 221 DNDALYASVAVQQQDAK------LVEENYSHNSQTEVA---ATLAYRFGNVTPRVSYAHG 271

Query: 328 YTYTMGRFDARSGETRPKWNHMVAQADYAFSIRTDAYLAAVYQRVSGGNGIPAFNATIWT 387
+ + + + ++ +V A+Y FS RT A ++A + + G G F +T
Sbjct: 272 FKGSFDATNYNN-----DYDQVVVGAEYDFSKRTSALVSAGWLQ--EGKGESKFVSTA-- 322

Query: 388 LTPSANGNQVVVALGLRHRF 407
+GLRH+F
Sbjct: 323 -----------GGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1732IGASERPTASE320.008 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.0 bits (72), Expect = 0.008
Identities = 23/139 (16%), Positives = 42/139 (30%), Gaps = 5/139 (3%)

Query: 436 SALAGEAGKTMTEVTQAVARVTDIMGEIAAASGEQSRGIEQVNQAIAQMDEVTQQNAALV 495
S + + ++ V + E A + E ++ + +A Q +EV Q +
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 496 EEAAAASKSLEEQGRHLTQAVSFFRASAASAAPQARHAAQAKPKAKRGVAAPAPAPRAAH 555
E +K + + + +Q PK ++ A A
Sbjct: 1094 ETQTTETKETATVEKEEKA-----KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE 1148

Query: 556 AAPTFNKPAPALAAAATAS 574
PT N P TA
Sbjct: 1149 NDPTVNIKEPQSQTNTTAD 1167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1735TCRTETB923e-22 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 92.3 bits (229), Expect = 3e-22
Identities = 74/403 (18%), Positives = 153/403 (37%), Gaps = 16/403 (3%)

Query: 18 FMQNLDSTVVATALPSMARELGVNVVFLSSAITSYLVALTVFIPVSGWIAERFGAKRVFI 77
F L+ V+ +LP +A + + T++++ ++ V G ++++ G KR+ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 78 AAIAIFTAASVMCAAANGLAT-LVAARILQGAGGALMVPVGRLILYRGVSRHEMLAATTW 136
I I SV+ + + L+ AR +QGAG A + +++ R + + A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 137 LTMPALVGPLLGPPLGGFLTDALSWRAVFWINVPVGVAGAALAARLVPASAGERRAPADA 196
+ +G +GP +GG + + W + + +P+ + + D
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 197 RGMLLVGAALAALMLGVETAGRDVLPAGAPALCLGAGVALGGLTIRHCRRVAHPVVDLSL 256
+G++L+ + ML + L + + ++H R+V P VD L
Sbjct: 202 KGIILMSVGIVFFMLFTTSYSISFLIVSVLSF---------LIFVKHIRKVTDPFVDPGL 252

Query: 257 L-GIPTFHAATIAGSLFRAGAGALPFLVPLTLQVGFGASASRSGAITLASA-LGSLVMRP 314
IP G +F AG + +VP ++ S + G++ + + ++
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFV-SMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 315 MTHAALHRAPMRTVLIAGSVSFAAVLAACATLSPAWPDAAVFALLLVGGLSRSLSFASLG 374
+ + R VL G + + L ++ V G S + +
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIS 370

Query: 375 ALVFSDVPSERLSAATSFQGTAQQLMRAVGVAVAAGALHLAML 417
+V S + + A S L G+A+ G L + +L
Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413


28BPSL1748BPSL1790Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL17480113.300664ornithine carbamoyltransferase, catabolic
BPSL17501113.815160carbamate kinase
BPSL17511114.661891putative short-chain dehydrogenase
BPSL17521135.341910conserved hypothetical protein
BPSL17532136.142334putative LysR-family transcriptional regulator
BPSL17542146.470726putative glutathione S-transferase
BPSL17553167.557385putative MarR-family transcriptional regulator
BPSL17561157.338943putative amino-acid transport-related exported
BPSL17573147.380619putative MarR-family regulatory protein
BPSL17581136.450976putative transport-related membrane protein
BPSL17590145.631999putative lipoprotein
BPSL17601144.184608precorrin-4 C11-methyltransferase
BPSL17611124.607937putative precorrin-6X reductase
BPSL17620125.575904putative cobalamin biosynthesis-related protein
BPSL1763-194.369096precorrin-6Y C5,15-methyltransferase
BPSL1764-194.678388putative oxidoreductase
BPSL17650104.031612precorrin-8X methylmutase
BPSL17660104.569364precorrin-2 C20-methyltransferase
BPSL17670113.791458precorrin-3b C17-methyltransferase
BPSL17681103.751516putative exported chitinase
BPSL17692122.745873putative exported protein
BPSL1770-1111.847645putative carboxylesterase
BPSL1771-1112.643226conserved hypothetical protein
BPSL17721124.675565putative magnesium chelatase protein
BPSL17732136.284460putative cobalamin biosynthesis-related protein
BPSL17742155.964100putative cobalamin biosynthesis-related protein
BPSL17751146.064580high-affinity nickel transport protein
BPSL17762136.790027putative cobalamin biosynthesis related protein
BPSL17772136.974831cobalamin adenosyltransferase
BPSL17782137.093217cobyrinic acid A,C-diamide synthase
BPSL17792125.965924putative siderophore biosynthesis related
BPSL17804126.432834putative iron uptake receptor precursor
BPSL17813136.806087putative L-ornithine 5-monooxygenase
BPSL17822136.611243putative siderophore-related non-ribosomal
BPSL17833156.730956putative siderophore related no-ribosomal
BPSL17842154.726406putative siderophore biosynthesis related ABC
BPSL17853173.406952conserved hypothetical protein
BPSL17864173.635319putative iron transport-related exported
BPSL17874253.079705putative iron transport-related membrane
BPSL17895232.476368putative iron transport-related membrane
BPSL17883211.603776putative iron transport-related ATP-bidning
BPSL17903201.137733conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1755LCRVANTIGEN300.008 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 30.0 bits (67), Expect = 0.008
Identities = 16/58 (27%), Positives = 24/58 (41%), Gaps = 5/58 (8%)

Query: 46 RAELVVNTAELDLDEIVALLARAHGKGQDVARVHSG-----DPSLYGAIGEQIRRLAA 98
R EL TAEL + ++ H +H D +LYG E+I + +A
Sbjct: 154 REELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYTDEEIFKASA 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1758OMADHESIN290.027 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 29.5 bits (65), Expect = 0.027
Identities = 25/63 (39%), Positives = 28/63 (44%)

Query: 147 ADGATPAAIAGALVARGFGPSAMSVFEHLGGPLERRLDARADAWRDARAAALNVVAIECR 206
A GAT A GA VA G G A V GPL + L A + A A + VAI R
Sbjct: 74 AIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGAR 133

Query: 207 ACA 209
A
Sbjct: 134 AST 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1767HTHFIS431e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 42.9 bits (101), Expect = 1e-06
Identities = 39/171 (22%), Positives = 63/171 (36%), Gaps = 14/171 (8%)

Query: 3 AAYPFSALIGQ-AALQQALLLVA-VDPGLGGVLVSGPRGTAKSTAARALAELLP--EGRF 58
+ L+G+ AA+Q+ ++A + ++++G GT K ARAL + G F
Sbjct: 132 DSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPF 191

Query: 59 VTLPLSASDEQVTGSLDLASALADNT--VRFSPGLVARAHLGVLYVDEINLLPDALVDAL 116
V + ++A + S T S G +A G L++DEI +P L
Sbjct: 192 VAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRL 251

Query: 117 LDAAASGVNTVERDGVSHSHAARFALVGTMNP------EEGELRPQLLDRF 161
L G G + +V N +G R L R
Sbjct: 252 LRVLQQG--EYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1781FERRIBNDNGPP1162e-32 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 116 bits (292), Expect = 2e-32
Identities = 78/264 (29%), Positives = 113/264 (42%), Gaps = 15/264 (5%)

Query: 59 PARIVVLEFMFAEDLAALDITPVGMADPAYYPIWIGYDDARFARVSDVGTRQEPSLEAIA 118
P RIV LE++ E L AL I P G+AD Y +W+ + V DVG R EP+LE +
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVS-EPPLPDSVIDVGLRTEPNLELLT 93

Query: 119 AAKPDLILGVGLRHAPIFDALSRIAPTVLFKYSPNYIEDGRQVTQYDWARAILRTIGCLT 178
KP ++ + P + L+RIAP F +S DG+Q AR L + L
Sbjct: 94 EMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFS-----DGKQ--PLAMARKSLTEMADLL 145

Query: 179 GRARDARAVQARVDAGLARDARRIAAAGRAGERVAWLQELGLPDRYWAFTGNSASAGIAR 238
A A+ + + R + G R L L P F NS I
Sbjct: 146 NLQSAAETHLAQYEDFIRSMKPRFV---KRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202

Query: 239 ALGLE-PWPGEPTREGTAYVTSEDLLKQPDLAVLFVSATEPGVPLDAKLDSSIWRFVPAR 297
G+ W GE G+ V+ + L D+ VL +DA + + +W+ +P
Sbjct: 203 EYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD-MDALMATPLWQAMPFV 261

Query: 298 RAGRVALVERNIWGFGGPMSALRL 321
RAGR V +W +G +SA+
Sbjct: 262 RAGRFQRVP-AVWFYGATLSAMHF 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL17822FE2SRDCTASE576e-12 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 57.4 bits (138), Expect = 6e-12
Identities = 51/186 (27%), Positives = 73/186 (39%), Gaps = 24/186 (12%)

Query: 78 RALVSQWSKYYFNLAASAGFAAALLLGRPLDMAPQRMRVALRGGMPVALLFEADALRPAQ 137
+ L+S W+++Y L A L + LD++P+ VA F D
Sbjct: 89 KPLISLWAQWYIGLMVPPLMLALLTQEKALDVSPEHFHAEFHETGRVA-CFWVDVCEDKN 147

Query: 138 AEPAS---RYAALVDH-LRATIDTLAALAKLSPRVLWANAGNLLD-YLFEQCAHAPRAGA 192
A P S R L+ L + L A +++ +++W+N G L++ YL E G
Sbjct: 148 ATPHSPQHRMETLISQALVPVVQALEATGEINGKLIWSNTGYLINWYLTEM---KQLLGE 204

Query: 193 DA------AWLFGPVDSRGEANPLRLPVRRVKPCSARLPDPFRARRVCCLRNEIPGEDQL 246
A F + GE NPL V L D RR CC R +P Q
Sbjct: 205 ATVESLRHALFFEKTLTNGEDNPLWRTV--------VLRDGLLVRRTCCQRYRLPDVQQ- 255

Query: 247 CGSCPL 252
CG C L
Sbjct: 256 CGDCTL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1784PF05272280.041 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.041
Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 36 VTALCGPNGCGKSTLLRTLAGLQ 58
L G G GKSTL+ TL GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1789DHBDHDRGNASE1224e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 122 bits (308), Expect = 4e-36
Identities = 81/252 (32%), Positives = 118/252 (46%), Gaps = 15/252 (5%)

Query: 9 GRSFLVTGASSGIGRAAAVALRGGGARVVAAARNARELERLAHETGC-----EPLELDVG 63
G+ +TGA+ GIG A A L GA + A N +LE++ E DV
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 64 CDASVRAALSG-ERMRDAFDGLINCAGVTSLAAAIDTTADEFDRVMAVNARGAMLVARHV 122
A++ + ER D L+N AGV + +E++ +VN+ G +R V
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 123 ARAMIRAGRGGSIVNVSSQAALVALPSHLAYCASKAALDAMTRVLCVELGPHGIRVNSVN 182
++ M R GSIV V S A V S AY +SKAA T+ L +EL + IR N V+
Sbjct: 128 SKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 183 PTVTLTPMAERAWSDPHASGPMLA--------AIPLGRFARVADVVAPILFLSSDAAAMV 234
P T T M W+D + + ++ IPL + A+ +D+ +LFL S A +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 235 SGVALPVDGGYT 246
+ L VDGG T
Sbjct: 247 TMHNLCVDGGAT 258


29BPSL1799BPSL1831Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL17992142.710551hypothetical protein
BPSL18001132.701528hypothetical protein
BPSL18010133.438899putative ABC transport system, membrane protein
BPSL18020143.673296putative exported protein
BPSL1803-1143.040178putative fimbrial chaperone
BPSL1804-1133.407812putative outer membrane usher protein precursor
BPSL1805-1123.105586putative type-1 fimbrial protein
BPSL18060122.877570multidrug efflux system putative membrane
BPSL1807-2113.144188multidrug efflux system transporter protein
BPSL1808-2103.409678multidrug efflux system putative membrane fusion
BPSL1809-1103.839035TetR family regulatory protein
BPSL18101114.561011subfamily M23B unassigned peptidase
BPSL18113124.916421putative amino acid transport system, membrane
BPSL18124135.594975putative amino acid transport system, membrane
BPSL18133164.606034putative amino acid transport system, exported
BPSL18142145.358322putative membrane protein
BPSL18152145.666136putative membrane protein
BPSL1816-1146.036667putative membrane protein
BPSL1817-2145.553599putative fimbriae-related membrane protein
BPSL1818-2134.833465putative membrane protein
BPSL1819-1104.807156putative fimbriae assembly-related protein
BPSL1820-2102.275333putative fimbriae assembly-related protein
BPSL1821-112-0.430508putative lipoprotein
BPSL1822-113-1.012556putative fimbriae assembly-related protein
BPSL1823-112-3.110060putative fimbriae-assembly related protein
BPSL1824-19-2.339708putative fimbriae assembly-related protein
BPSL1825-19-1.471696putative fimbriae assembly-related protein
BPSL1826-210-1.117377putative membrane protein
BPSL1827-290.115041putative ABC transport system, ATP-binding
BPSL18280111.879985putative ABC transport system, substrate-binding
BPSL18292113.725129putative ABC transport system, permease protein
BPSL18312152.235255conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1800PF005776770.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 677 bits (1749), Expect = 0.0
Identities = 226/851 (26%), Positives = 353/851 (41%), Gaps = 60/851 (7%)

Query: 2 RIRHSFLCVFMLAAGSHARATEFNASFLSIDGRNDVDLSQFAQADYTLPGTYLLDVQVND 61
+R C F A + FN FL+ D + DLS+F PGTY +D+ +N+
Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNN 86

Query: 62 VFFGLQPIEFVAHDDGQGARACVAPELVAQFGLKKSLVENLPRTMGGRCADLASL-DGVT 120
+ + + F D QG C+ +A GL + V + C L S+ T
Sbjct: 87 GYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDAT 146

Query: 121 IRYQKGEGRLKITIAQAALEFADASYLPPERWSDGVDGAMLDYRVLANANHAFGRGAQQN 180
+ G+ RL +TI QA + Y+PPE W G++ +L+Y + N R +
Sbjct: 147 AQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYN--FSGNSVQNRIGGNS 204

Query: 181 NAVQAYGTIGANWGAWRFRGDYQAQ-TRAGGAVYAERAFRFNQLYAYRALPSIRSTLSFG 239
+ G N GAWR R + + + ++ ++ + R + +RS L+ G
Sbjct: 205 HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLG 264

Query: 240 EIYVDSDIFSTFSMSGVAMKSDDRMLPPSMRGYAPLVTGVARTNAIVKVMQDSRVLYMTK 299
+ Y DIF + G + SDD MLP S RG+AP++ G+AR A V + Q+ +Y +
Sbjct: 265 DGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNST 324

Query: 300 VSPGAFALSNLN-TSVQGTLDVVVEEEDGTVQRFQVATAAVPFLAREGQLRYKTAIGQPR 358
V PG F ++++ G L V ++E DG+ Q F V ++VP L REG RY G+ R
Sbjct: 325 VPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYR 384

Query: 359 TFGGAGITPWFGFAEAAYGLPFDVTVYGGLIAASGYTSVAFGVGRDFGRFGALSADVTHA 418
+ P F + +GLP T+YGG A Y + FG+G++ G GALS D+T A
Sbjct: 385 SGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQA 444

Query: 419 RATLWWNGRTKRGNSYRINYSKHVDALDADVRFFGYRFSERDYTNFQQFSGDPTASGL-- 476
+TL + G S R Y+K ++ +++ GYR+S Y NF +
Sbjct: 445 NSTL-PDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIE 503

Query: 477 -------------------ANGKQRYSAMLSKRFGDTST-YFSYDQTTYW-ARPSDRRIG 515
N + + ++++ G TST Y S TYW D +
Sbjct: 504 TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQ 563

Query: 516 VTLTRAFSLGALKSVNLGFSAFRTQGAGGGGNQVSLTATLPLGER-----------QTLT 564
L AF + L +S + G ++L +P + +
Sbjct: 564 AGLNTAFEDI---NWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASAS 620

Query: 565 SSVSAGEGGTSVNAGYLYDGA---NGRTYQLYGGTTDGRASANASLRQRTPSYQ-----L 616
S+S G N +Y N +Y + G G + S T +Y+
Sbjct: 621 YSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNA 680

Query: 617 TAQASTVANAYASASLEVDGSFVATRYGVTAHANGNAGDTRLLVSTDGVPGVPLS-GSYA 675
S ++ V G +A GVT DT +LV G + +
Sbjct: 681 NIGYS-HSDDIKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVKAPGAKDAKVENQTGV 737

Query: 676 RTNARGYAVIDGVSPYNVYDATVSVEKLGLDTDVTNPIQRTVLTDGAIGYIRFNAARGRN 735
RT+ RGYAV+ + Y + L + D+ N + V T GAI F A G
Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIK 797

Query: 736 VFVTLTGDGGAPVPFGASVQDAATGKELGIVGEAGAAYLTQVQPRAKLVVRAGAKTICT- 794
+ +TLT + P+PFGA V + + GIV + G YL+ + K+ V+ G +
Sbjct: 798 LLMTLTHNNK-PLPFGAMVTS-ESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHC 855

Query: 795 --PAALPDTLQ 803
LP Q
Sbjct: 856 VANYQLPPESQ 866


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1802RTXTOXIND330.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.3 bits (76), Expect = 0.002
Identities = 18/104 (17%), Positives = 34/104 (32%), Gaps = 2/104 (1%)

Query: 382 APRLTLPIFAGGRNRANLDVADARKHIAVAEYEKTIQTAFREV--ADALAARDQIDAQLA 439
P L LP +N + +V I Q +E+ A R + A++
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224

Query: 440 AQQAVYGADAERLRLAQRRYDSGVASYLELLDAQRSTFESGQEL 483
+ + + RL + +L+ + E+ EL
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNEL 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1803ACRIFLAVINRP10790.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1079 bits (2791), Expect = 0.0
Identities = 516/1032 (50%), Positives = 701/1032 (67%), Gaps = 6/1032 (0%)

Query: 1 MARFFIDRPVFAWVISLFIMLGGIFAIRALPVAQYPDIAPPVVSLYATYPGASAQVVEES 60
MA FFI RP+FAWV+++ +M+ G AI LPVAQYP IAPP VS+ A YPGA AQ V+++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTAVIEREMNGVPGLLYTSATS-SAGQASLSLTFKQGVSADLAAVDVQNRLKIVEARLPE 119
VT VIE+ MNG+ L+Y S+TS SAG +++LTF+ G D+A V VQN+L++ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 PVRRDGISIEKAADNAQIIVSLTSEDGRLSGVELGEYASANVLQALRRVEGVGKVQFWGA 179
V++ GIS+EK++ + ++ S++ + ++ +Y ++NV L R+ GVG VQ +GA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 EYAMRIWPDPVKMAALGLTASDIASAVRAHNARVTIGDVGRSAVPDSAPIAATVLADAPL 239
+YAMRIW D + LT D+ + ++ N ++ G +G + + A+++A
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 240 TTPDAFGAIALRARADGSTLYLRDVARIEFGGNDYNYPSFVNGKTATGMGIKLAPGSNAV 299
P+ FG + LR +DGS + L+DVAR+E GG +YN + +NGK A G+GIKLA G+NA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 300 ATEKRVRATMEELAKFFPPGVKYQIPYETASFVRVSMSKVVTTLVEAGVLVFAVMFLFMQ 359
T K ++A + EL FFP G+K PY+T FV++S+ +VV TL EA +LVF VM+LF+Q
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 360 NFRATLIPTLVVPVALLGTFGAMLAAGFSINVLTMFGMVLAIGILVDDAIVVVENVERLM 419
N RATLIPT+ VPV LLGTF + A G+SIN LTMFGMVLAIG+LVDDAIVVVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 420 VEEKLPPYEATVKAMKQISGAIVGITVVLTSVFVPMAFFGGAVGNIYRQFAFALAVSIGF 479
+E+KLPP EAT K+M QI GA+VGI +VL++VF+PMAFFGG+ G IYRQF+ + ++
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 480 SAFLALSLTPALCATLLKPVADDHHE-KDGFFGWFNRFVARSTHRYTRRVGRVLERPLRW 538
S +AL LTPALCATLLKPV+ +HHE K GFFGWFN S + YT VG++L R+
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 539 LVVYGALTAAAALLITKLPAAFLPDEDQGNFMVMVIRPQGTPLAETMQSVRRVEEYVRTH 598
L++Y + A +L +LP++FLP+EDQG F+ M+ P G T + + +V +Y +
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 599 SPSAY--TFALGGYNLYGEGPNGGMIFVTMKDWKERKRARDQVQAIIAEINAHFAGTPNT 656
+ F + G++ G+ N GM FV++K W+ER + +A+I +
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 657 MVFAINMPALPDLGLTGGFDFRLQDRGGLGYGAFVAAREKLLAEGRKDPV-LTDLMFAGT 715
V NMPA+ +LG GFDF L D+ GLG+ A AR +LL + P L + G
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 716 QDAPQLKLDIDRAKASALGVSMEEINATLAVMFGSDYIGDFMHGSQVRRVIVQADGRHRL 775
+D Q KL++D+ KA ALGVS+ +IN T++ G Y+ DF+ +V+++ VQAD + R+
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 776 DAADVTKLRVRNAKGEMVPLAAFATLHWTMGPPQLTRYNGFPSFTINGAASAGHSSGEAM 835
DV KL VR+A GEMVP +AF T HW G P+L RYNG PS I G A+ G SSG+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 836 AAIERIASTLPAGTGYAWSGQSYEERLSGAQAPMLFALSVLVVFLALAALYESWSIPFAV 895
A +E +AS LPAG GY W+G SY+ERLSG QAP L A+S +VVFL LAALYESWSIP +V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 896 MLVVPLGVIGAVAGVTLRGMPNDIYFKVGLIATIGLSAKNAILIVEVAKDLVAQR-MSLA 954
MLVVPLG++G + TL ND+YF VGL+ TIGLSAKNAILIVE AKDL+ + +
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 955 DAALEAARLRLRPIVMTSLAFGVGVLPLAFATGAASGAQIAIGTGVLGGVISATLFAIFL 1014
+A L A R+RLRPI+MTSLAF +GVLPLA + GA SGAQ A+G GV+GG++SATL AIF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1015 VPLFFVCVGRVF 1026
VP+FFV + R F
Sbjct: 1021 VPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1804RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.2 bits (94), Expect = 1e-05
Identities = 20/133 (15%), Positives = 41/133 (30%), Gaps = 5/133 (3%)

Query: 67 EVRARVAGIVTARTYEEGQEVKRGAVLFRIDPAPFKAARDAAAGALEKARAAHLAALDKR 126
E++ IV +EG+ V++G VL ++ +A +L +AR
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 127 RRYDELVRDRAVSERDHTEALADERQAKAAVASARAELA-----RAQLQLDYATVTAPID 181
R + + E + + + + + + Q +L+ A
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 182 GRARRALVTEGAL 194
R E
Sbjct: 218 TVLARINRYENLS 230



Score = 35.2 bits (81), Expect = 4e-04
Identities = 18/100 (18%), Positives = 38/100 (38%), Gaps = 10/100 (10%)

Query: 102 KAARDAAAGALEKARAAHLAALDKRRRYDELVRDRAVSERDHTEALADERQAKAAVASAR 161
LE+ + L+A ++ + +L + E L RQ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFK---------NEILDKLRQTTDNIGLLT 315

Query: 162 AELARAQLQLDYATVTAPIDGR-ARRALVTEGALVGQDQA 200
ELA+ + + + + AP+ + + + TEG +V +
Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1805HTHTETR1175e-35 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 117 bits (295), Expect = 5e-35
Identities = 53/210 (25%), Positives = 100/210 (47%), Gaps = 4/210 (1%)

Query: 1 MARKTREESLNTKNRILDAAELVLLEKGVGQTAMADIAEAAGMSRGAVYGHFNGKIEVCV 60
MARKT++E+ T+ ILD A + ++GV T++ +IA+AAG++RGA+Y HF K ++
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 AVCDRAFSRAVEGFDLSDERPA---LATLRLAASHYLHQCGEPGSMQRVLEILYMKCEQS 117
+ + + S E + L+ LR H L + ++EI++ KCE
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 118 EENAPLMRRRALYELQTLRIAKALLRRAVAAGELDASLDVHLAGVYLLSLLEGIFGSMIW 177
E A + + + L++ + L+ + A L A L A + + + G+ + ++
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 178 TTRLRGDRWRDAEAMLDAGVDTLRASPALR 207
+ D ++A + ++ P LR
Sbjct: 181 APQSF-DLKKEARDYVAILLEMYLLCPTLR 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1810PYOCINKILLER320.004 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 32.1 bits (72), Expect = 0.004
Identities = 30/132 (22%), Positives = 49/132 (37%), Gaps = 4/132 (3%)

Query: 23 RVAAARNELQNAADAAALAGAASLEAGAGAPAWAAAASAAAAALSLNASDGAALSSGDVQ 82
A A+ + + A A AA+ A PA + + AA + + GAA + +
Sbjct: 226 AAAEAKRKAEEQARQQAAIRAANTYA---MPANGSVVATAAGRGLIQVAQGAASLAQAIS 282

Query: 83 TGYWNVTGVPAGLEPTTLAPGEYDVPAVQATVTRAPNQNGGPLSLLMGGLLGLVGTPAAA 142
V G P+ +A G + T + +Q + +G +G P +
Sbjct: 283 DAI-AVLGRVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSV 341

Query: 143 TAVAVAGAPATV 154
AVA A TV
Sbjct: 342 NLNAVAKASGTV 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1812SYCDCHAPRONE310.004 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 31.1 bits (70), Expect = 0.004
Identities = 20/83 (24%), Positives = 32/83 (38%)

Query: 54 SVAESALAAGDAELAATLFERALKADPRSLPAQVGLGDAMYQTGELARAGVLYAQAAAAA 113
S+A + +G E A +F+ D +GLG G+ A Y+ A
Sbjct: 41 SLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMD 100

Query: 114 PDDPRAQLGLARVALRERHLDDA 136
+PR A L++ L +A
Sbjct: 101 IKEPRFPFHAAECLLQKGELAEA 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1815PF05272300.032 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.032
Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 303 IVISGGTGSGKTTLLNAL---SHFIDSHERIVTIEDAAELQLQQPHVVSL 349
+V+ G G GK+TL+N L F D+H I T +D+ E Q+ L
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYE-QIAGIVAYEL 647


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1816HTHFIS340.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 0.001
Identities = 29/165 (17%), Positives = 52/165 (31%), Gaps = 20/165 (12%)

Query: 22 GARLVAIVADAASDEVIRNLIADQAMTGAQVARGGIDDAIALMRDLSHGPQHLLVDVSGA 81
GA ++ DAA V+ ++ + + ++ DV
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIA--AGDGDLVVTDVV-- 56

Query: 82 AMP----LSDLARLADVCDPSVNVIVIGERNDVGLFRSMLRIGVRDYLVKPL----TVEL 133
MP L R+ P + V+V+ +N G DYL KP + +
Sbjct: 57 -MPDENAFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 134 VHRALSAADPNAAARAGKAIGFVGARGGVGVTSIAVALARHLADR 178
+ RAL+ + + + G S A+ + R
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVG----RSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1818BCTERIALGSPD1434e-39 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 143 bits (361), Expect = 4e-39
Identities = 68/283 (24%), Positives = 116/283 (40%), Gaps = 16/283 (5%)

Query: 127 VVQTLKPYLRQQEALVNRLTLARPIQVHLRVRITEVDRNITQQLGINWSALGA------- 179
+V + E ++ +L + RP QV + I EV LGI W+ A
Sbjct: 322 IVTAAPDVMNDLERVIAQLDIRRP-QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTN 380

Query: 180 SGNFVGGLFNGRTLFDTASKAFDLSPSGAFSVVGGFHTSRYSIDG--VLDALDQEGLITM 237
SG + G ++ S A S G Y + +L AL +
Sbjct: 381 SGLPISTAIAGANQYNKDGTVSSSLAS-ALSSFNGIAAGFYQGNWAMLLTALSSSTKNDI 439

Query: 238 LAEPNLTAISGQTASFLAGGEFPIPVAQDTTGA----ITIQFKPYGVSLDFTPTVLADNR 293
LA P++ + A+F G E P+ TT T++ K G+ L P + +
Sbjct: 440 LATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDS 499

Query: 294 ISLKVRPEVSEIDPTNSVTTGSIKVPALTVRRVDTTVELSSGQSFAIGGLLQSKSSDVLA 353
+ L++ EVS + S T+ + R V+ V + SG++ +GGLL SD
Sbjct: 500 VLLEIEQEVSSVADAASSTSSDLGA-TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTAD 558

Query: 354 ELPGLARLPVLGKLFSSRNYLNDKTEVVVIVTPYIVQPANPGE 396
++P L +PV+G LF S + K +++ + P +++ +
Sbjct: 559 KVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYR 601


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1820PREPILNPTASE328e-04 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 32.1 bits (73), Expect = 8e-04
Identities = 31/148 (20%), Positives = 49/148 (33%), Gaps = 18/148 (12%)

Query: 20 LVASWTLASLALADLRTRRLATFAVALVGALYAALALAGAPGDGGFASHAALGAAA---- 75
L+ +W L +L DL L + L+ L G A +GA A
Sbjct: 138 LLLTWVLVALTFIDLDKMLLP--DQLTLPLLWGGLLFNLLGGFVSLGD-AVIGAMAGYLV 194

Query: 76 ----FALGAAMFRAGWIAGGDVKLAAVVFLWAGPAHAWPVAFAIGVGGLAVGAVCIAAGR 131
+ + + GD KL A + W G V + G +G I
Sbjct: 195 LWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRN 254

Query: 132 APRVLAWFAPARGVPYGVALAAGGLLAV 159
++ +P+G LA G +A+
Sbjct: 255 H-------HQSKPIPFGPYLAIAGWIAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1825TCRTETA300.021 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.8 bits (67), Expect = 0.021
Identities = 56/306 (18%), Positives = 102/306 (33%), Gaps = 53/306 (17%)

Query: 57 TTQLLNTAGVFAAGF-LMRPIGGWLFGRIADKHGRRAAMMISVLMMCGGSLVIAVLPTYA 115
+ + G+ A + LM+ + G ++D+ GRR +++S+ ++A P
Sbjct: 38 SNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW 97

Query: 116 QIGALAPLLLLVARLFQGLSVGGEYGTSATYMSEVALQGRR----GFF-ASFQYVTLIGG 170
+L + R+ G++ G + Y++++ R GF A F + + G
Sbjct: 98 --------VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGP 148

Query: 171 QLCALLVLVILQQTLSSDALKAWGWRIPFVVGAAAALIS-----LYLRKSLDETSTSESR 225
L L+ PF AA ++ L +S R
Sbjct: 149 VLGGLMGGF--------------SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRR 194

Query: 226 KAKDAGT-IRGVWQHKG-AFLTVVGFTAGGSLIFYTFTTYMQKYLVNTAGMHAKTASNVM 283
+A + R A L V F L+ + + A T +
Sbjct: 195 EALNPLASFRWARGMTVVAALMAVFFIM--QLVGQVPAALWVIFGEDRFHWDATTIGISL 252

Query: 284 TA-----ALFVYMLMQPMFGALSDKIGRRMSMILFGTG----AVIGTVP------LMHAL 328
A +L M+ P+ L ++ + MI GTG A ++ A
Sbjct: 253 AAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS 312

Query: 329 GGVTSP 334
GG+ P
Sbjct: 313 GGIGMP 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1829AEROLYSIN320.006 Aerolysin signature.
		>AEROLYSIN#Aerolysin signature.

Length = 493

Score = 32.3 bits (73), Expect = 0.006
Identities = 23/76 (30%), Positives = 38/76 (50%), Gaps = 2/76 (2%)

Query: 258 GLAKMQASLAGTVRAVRVGSESIATAARQIAAGNIDLSSRTEEQAAALEQTASSMEELTG 317
GL+ MQ +LA +R VR G +A Q AGNI++ + A + + A S++
Sbjct: 404 GLSTMQNNLARVLRPVRAGITGDFSAESQF-AGNIEIGAPVPLAADSKVRRARSVDGAGQ 462

Query: 318 TVRRNAD-NARQASAL 332
+R +A++ S L
Sbjct: 463 GLRLEIPLDAQELSGL 478


30BPSL1887BPSL1901Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL18870123.443095putative acyl-CoA synthetase
BPSL18881123.587094putative exported protein
BPSL18891124.104556putative exported protein
BPSL18900123.447494Hfq protein
BPSL18911123.493422putative exported protein
BPSL18920122.886067putative sigma-54 related transcriptional
BPSL1893-1102.129688putative membrane protein
BPSL1894-1112.529238putative lipoprotein
BPSL1895-1130.740384putative lipoprotein
BPSL1896-1132.772638putative outer-membrane protein
BPSL18972193.088105putative fimbriae-related outer membrane
BPSL18982151.902948putative type II/IV secretion system ATP-binding
BPSL18993151.613518putative fimbriae assembly protein
BPSL19003151.792630putative exported fimbriae assembly protein
BPSL19012141.611448putative membrane fimbriae assembly protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1887HTHFIS2973e-98 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 297 bits (763), Expect = 3e-98
Identities = 130/475 (27%), Positives = 204/475 (42%), Gaps = 53/475 (11%)

Query: 19 ADIVDRVARCMSSFDVEVIRADN-EELSAERTAMRPSLAIISVSMIE-SGAAFLRTWQAE 76
A I + + +S +V N L A L + V M + + L +
Sbjct: 13 AAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKA 72

Query: 77 -IGMPVVWVGA--------------ARDHDPSLYPPEYSHILPLDFTCAELRGMISKLAV 121
+PV+ + A A D+ P P + + ++ + +++
Sbjct: 73 RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK--PFDLTELIGIIGRA------LAEPKR 124

Query: 122 QLRAHAAKALEPSTLVAHSDCMQALLQEVDTFADCDTNVLLHGETGVGKERIAQLLHEKH 181
+ + + LV S MQ + + + D +++ GE+G GKE +A+ LH+ +
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD-Y 183

Query: 182 SRYGMGEFVPVNCGAIPDGLFESLFFGHAKGSFTGAVGTHKGYFEQAAGGTLFLDEVGDL 241
+ G FV +N AIP L ES FGH KG+FTGA G FEQA GGTLFLDE+GD+
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 242 PLYQQVKLLRVLEDGAVLRIGATAPVKVDFRLVAASNKKLPQLVKDGLFRADLYYRLAVI 301
P+ Q +LLRVL+ G +G P++ D R+VAA+NK L Q + GLFR DLYYRL V+
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 302 ELSIPSLEERGPVDKIALFKSFVASIVGEDRLAALPELPYWLAEAVADSYFPGNVRELRN 361
L +P L +R D L + FV E + E + +PGNVREL N
Sbjct: 304 PLRLPPLRDR-AEDIPDLVRHFVQQAEKEGL--DVKRFDQEALELMKAHPWPGNVRELEN 360

Query: 362 LAERVGV------------------------TVRQTGGWDTARLQRLIAHARSAAQPAPA 397
L R+ + + + + + +
Sbjct: 361 LVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFG 420

Query: 398 ESAPDVFVDRSKWDMTERNRVIAALDANGWRRQDTAQHLGISRKVLWEKMRKYQI 452
++ P + E ++AAL A + A LG++R L +K+R+ +
Sbjct: 421 DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1889cloacin270.031 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 26.6 bits (58), Expect = 0.031
Identities = 13/32 (40%), Positives = 14/32 (43%)

Query: 99 GSAAGMSGMSGGGGGGGGGGGAGYSLAPASGS 130
GS G SG G GGG G G S + S
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1890PYOCINKILLER320.004 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 31.7 bits (71), Expect = 0.004
Identities = 29/86 (33%), Positives = 37/86 (43%), Gaps = 3/86 (3%)

Query: 224 LMNQLKLAPAVRTEIRNDATRIAAAARARQRA-LARPGAPGAAASAGATLAASAAGSNGG 282
MN L A A + R AAA A+++A AA A T A A GS
Sbjct: 203 RMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQ--AAIRAANTYAMPANGSVVA 260

Query: 283 AAAGKGAVAGAGASAPGAAATATAAA 308
AAG+G + A +A A A + A A
Sbjct: 261 TAAGRGLIQVAQGAASLAQAISDAIA 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1894HTHFIS385e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.3 bits (89), Expect = 5e-05
Identities = 11/63 (17%), Positives = 26/63 (41%)

Query: 79 AALRVSHPGLPIVALGSLGEPESALAALRAGVRDFIDFSAPAEDALRITRGLLDHVGDQP 138
++ + P LP++ + + +A+ A G D++ + + I L +P
Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRP 126

Query: 139 SRH 141
S+
Sbjct: 127 SKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1895BCTERIALGSPD1382e-37 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 138 bits (349), Expect = 2e-37
Identities = 58/249 (23%), Positives = 111/249 (44%), Gaps = 11/249 (4%)

Query: 160 VQVDVRVVEFSRSVLKQAGLNFFKQNNGFTFGSFAPAGLASVTGGG----TSSMSVSANI 215
V V+ + E + G+ + +N G T + + +++ G S+
Sbjct: 347 VLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA 406

Query: 216 PIASAFN-LVVGSATRGLFADLSILEANNLARVLAQPTLVALSGQSASFLAGGEIPVPVP 274
S+FN + G L+ L ++ +LA P++V L A+F G E+PV
Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466

Query: 275 QSLGT-----ISIDWKPYGVGLTLTPTVLSPRRIALKVAPESSQLDFVHSITINGVTVPA 329
+ +++ K G+ L + P + + L++ E S + S + +
Sbjct: 467 SQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAAS-STSSDLGAT 525

Query: 330 LTTRRADTTVELGDGESFAIGGLIDRETTSNVDKVPFLGDLPIIGTFFKHLSYQQNDKEL 389
TR + V +G GE+ +GGL+D+ + DKVP LGD+P+IG F+ S + + + L
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 390 VIIVTPHLV 398
++ + P ++
Sbjct: 586 MLFIRPTVI 594


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1898PREPILNPTASE543e-11 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 53.7 bits (129), Expect = 3e-11
Identities = 31/124 (25%), Positives = 52/124 (41%), Gaps = 10/124 (8%)

Query: 4 LFSIGFFFAWAAAVAIADCRDRRIPNELVLVGLAAVIIFTVCRQNPFGTTLSGALIGGAV 63
+ A+ D +P++L L L ++F + F +L A+IG
Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL--LGGF-VSLGDAVIGAMA 190

Query: 64 GLVSLFPFFAL-------RVMGAADVKVFAVLGAWCGLPALPRLWVVASVAAGVHALALM 116
G + L+ + MG D K+ A LGAW G ALP + +++S+ + L+
Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250

Query: 117 LLTR 120
LL
Sbjct: 251 LLRN 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1901cloacin456e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 44.7 bits (105), Expect = 6e-07
Identities = 33/117 (28%), Positives = 51/117 (43%), Gaps = 1/117 (0%)

Query: 30 GGSGTISKGLDGSGSGSGGGNAISTTGGSGSGGTSGAGGSGSGGSGSSGSTGGLSGGGGS 89
G+ + S ++G +G G G S G S GGSGSG G +G +GGG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG-IHWGGGSGHGNGGGNG 69

Query: 90 TSGGGSTSGGGSTSGGTSTSSSINALGTIAGNTGGIISGAGSTVSGLGTVVGSQTLP 146
SGGGS +GG ++ + AL T + AG+ + + ++ + P
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKGP 126



Score = 40.5 bits (94), Expect = 1e-05
Identities = 33/123 (26%), Positives = 46/123 (37%), Gaps = 2/123 (1%)

Query: 38 GLDGSGSGSGGGNAI-STTGGSGSGGTSGAGGSGSGGSGSSGSTGGLSGGG-GSTSGGGS 95
G DG G +G + + GG G G GSG S + GG SG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 96 TSGGGSTSGGTSTSSSINALGTIAGNTGGIISGAGSTVSGLGTVVGSQTLPGVNPQTTQA 155
+GGG+ + G + + N A G + + GL + + L A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 156 LGG 158
L G
Sbjct: 123 LKG 125



Score = 33.9 bits (77), Expect = 0.001
Identities = 31/122 (25%), Positives = 48/122 (39%), Gaps = 1/122 (0%)

Query: 58 SGSGGTSGAGGSGSGGSGSSGSTGGLSGGGGSTSGGGSTSGGGSTSGGTSTSSSINALGT 117
SG G G+ S +G GL GGG++ G G +S GG+ +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 118 IAGNTGGIISGAGSTVSGLGTVVGSQTLPGVNPQTTQALGGIVQDL-GGAVSALGSGVTS 176
G SG GS G + V + G +T GG+ + GA+SA + + +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 177 GI 178
+
Sbjct: 122 AL 123


31BPSL1924BPSL1957Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1924329-3.358007N utilization substance protein A
BPSL1925233-4.323006conserved hypothetical protein
BPSL1926328-5.327355conserved hypothetical protein
BPSL1927220-4.590229conserved hypothetical protein
BPSL1929320-4.074495putative transcriptional regulator
BPSL1930116-3.574669hypothetical protein
BPSL1931223-5.052558hypothetical protein
BPSL1932016-4.048892hypothetical protein
BPSL1933-213-3.561256putative exported protein
BPSL1934-214-3.450022putative exported protein
BPSL1935-311-3.139807hypothetical protein
BPSL1936-211-3.312812conserved hypothetical protein
BPSL1937-210-2.092737transposase
BPSL1938013-2.596462putative exported protein
BPSL1939113-3.625249putative exported protein
BPSL1940-112-4.353170hypothetical protein
BPSL1941-111-3.744466*putative membrane protein
BPSL1942-112-3.584294putative exported protein
BPSL1943-210-2.584562putative transcriptional regulator
BPSL1944-39-1.952866integration host factor alpha-subunit
BPSL1945-39-0.512045phenylalanyl-tRNA synthetase beta chain
BPSL19460101.779090phenylalanyl-tRNA synthetase alpha chain
BPSL19471122.73583950S ribosomal protein L20
BPSL19481113.65951750S ribosomal protein L35
BPSL19491103.238745translation initiation factor IF-3
BPSL1950-1102.612235threonyl-tRNA synthetase
BPSL1951-1101.983835*GTP pyrophosphokinase
BPSL1952-180.095113conserved hypothetical protein
BPSL1953110-0.874050putative hydrolase
BPSL1954111-2.105165putative LysR-family transcriptional regulator
BPSL1955210-1.965298putative exported protein
BPSL1956211-1.993331putative polysaccharide deacetylase
BPSL1957211-1.359844putative dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1937PF00577310.025 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 30.6 bits (69), Expect = 0.025
Identities = 32/179 (17%), Positives = 58/179 (32%), Gaps = 36/179 (20%)

Query: 480 APWDAMSDLFNRHLLDYSPRSLNDLKLSADGGALRVRGGIKLWNQVPPGVWLPADMKGSL 539
AP + FN L P+++ DL +G ++PPG + D+ +
Sbjct: 40 APLSSAELYFNPRFLADDPQAVADLSRFENG------------QELPPGTY-RVDIYLNN 86

Query: 540 TLLDERHLAFTPTQVSVLGIP--QAKLLRALGIELSSLAPLKRRGAELRGDSLVLDQYTV 597
+ R + F +P L ++G+ +S++ + + L
Sbjct: 87 GYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLA---DDACVPLTSM-- 141

Query: 598 FPPPVLIGHMSQATVEPDG----LRLTFRPAPNAPVLRPPANLPGSYLWLEGGDTKMFN 652
+ AT + D L LT P A + LW G + + N
Sbjct: 142 ---------IHDATAQLDVGQQRLNLTI---PQAFMSNRARGYIPPELWDPGINAGLLN 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1939DNABINDINGHU1173e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 117 bits (296), Expect = 3e-38
Identities = 35/89 (39%), Positives = 53/89 (59%)

Query: 18 TKAELAELLFDSVGLNKREAKDMVEAFFEVIRDALENGESVKLSGFGNFQLRDKPQRPGR 77
K +L + ++ L K+++ V+A F + L GE V+L GFGNF++R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 78 NPKTGEAIPIAARRVVTFHASQKLKALVE 106
NP+TGE I I A +V F A + LK V+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1947SECYTRNLCASE270.020 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 27.0 bits (60), Expect = 0.020
Identities = 10/34 (29%), Positives = 15/34 (44%)

Query: 63 SVQIFISDMANFPGMNEVWDAWVAQGATPPRATV 96
S+ + +A F G N W +WV Q T +
Sbjct: 284 SLLYIPALVAQFAGGNSGWKSWVEQNLTKGDHPI 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1951PYOCINKILLER280.025 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.8 bits (61), Expect = 0.025
Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 2/68 (2%)

Query: 22 AATLAPAHADTTGLIEPAHLSVDGSLPAAQRDAQILAARRYDTFWHNGDPALARAALADD 81
AA+LA A +D ++ S + A A + + R W + P R AL D
Sbjct: 274 AASLAQAISDAIAVLGRVLASAPSVM--AVGFASLTYSSRTAEQWQDQTPDSVRYALGMD 331

Query: 82 FADRTPPP 89
A PP
Sbjct: 332 AAKLGLPP 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1952PRTACTNFAMLY290.038 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 28.9 bits (64), Expect = 0.038
Identities = 20/102 (19%), Positives = 33/102 (32%), Gaps = 10/102 (9%)

Query: 39 ALGAAAAPGRALAAGATATADTGAASLAGGSLRRSPAGEP---------EAAHGAFWPNG 89
A R +G + +A G GG+ R +P P A A
Sbjct: 319 AAIRVGRGARVTVSGGSLSAPHGNVIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRV 378

Query: 90 ARLVISISMQFEAGGQPPTGADSPFPPVDFPPQVPVDLASAT 131
+ +++ A Q A P + P+D+A A+
Sbjct: 379 LPEPVKLTLTGGADAQGDIVATEL-PSIPGTSIGPLDVALAS 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1953DHBDHDRGNASE746e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 73.9 bits (181), Expect = 6e-18
Identities = 60/249 (24%), Positives = 98/249 (39%), Gaps = 19/249 (7%)

Query: 10 VLVIGGSSGIGAAAARAFAVLDADVTIASRDANKLAAAARAIDG-PRPVRQAVLDTTDAP 68
+ G + GIG A AR A A + + KL ++ R D D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 69 AVDA----FFAEAGPFDHVVMSAAHTPGGPVRKLPLADAQAAMDSKFWGAY----RVARA 120
A+D E GP D +V A G + L + +A G + V++
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 121 ARIAPGGSLTFVSGFLSVRPSASAVLQGAINAALEALARGLALELAP--VRVNTVSPGLV 178
GS+ V + P S + AA + L LELA +R N VSPG
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGST 190

Query: 179 ATPLWSKL--GDAAREAMYASAAAR----LPARRVGQPEDIANAIVYLAATR--YATGST 230
T + L + E + + +P +++ +P DIA+A+++L + + + T
Sbjct: 191 ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHN 250

Query: 231 VLVDGGGAI 239
+ VDGG +
Sbjct: 251 LCVDGGATL 259


32BPSL1993BPSL2006Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL19931143.437599putative oxidoreductase
BPSL19941134.595512putative oxidoreductase
BPSL19952134.789976putative transcriptional regulatory protein
BPSL19961144.444496putative sugar ABC transport system,
BPSL19972154.646374putative sugar ABC transport system, membrane
BPSL19982154.813912putative ABC transport system, ATP-binding
BPSL19991144.241145putative carbohydrate kinase
BPSL20002112.011744putative TPP-binding acetolactate synthase
BPSL2001192.123090putative amine catabolism-related protein
BPSL20020103.102463conserved hypothetical protein
BPSL20030113.547412putative branched amino acid related transport
BPSL2004-172.648294putative branched amino acid related transport
BPSL2005-2102.757884putative branched amino acid transport system,
BPSL2006-1113.449222putative thioesterase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1993PF05272300.015 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.015
Identities = 15/42 (35%), Positives = 19/42 (45%), Gaps = 7/42 (16%)

Query: 41 LLGDNGAGKSTLIKTLAGVHPPSDGQYLVDGKPVLFDSPKDA 82
L G G GKSTLI TL G+ + D + KD+
Sbjct: 601 LEGTGGIGKSTLINTLVGL------DFFSDT-HFDIGTGKDS 635


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1996PHPHTRNFRASE290.040 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 28.6 bits (64), Expect = 0.040
Identities = 15/90 (16%), Positives = 34/90 (37%), Gaps = 5/90 (5%)

Query: 99 NGATVMVYGEVAGTIQGSPAPLYQRPRFVDDAQW----DAYAERVDAFARYTRAQGV-RL 153
T + + G + P + + A+ ++ +A+ +
Sbjct: 207 KEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKD 266

Query: 154 GYHHHMGAYVESPADVDRLMASTSDAVGLL 183
G H + A + +P DVD ++A+ + +GL
Sbjct: 267 GAHVELAANIGTPKDVDGVLANGGEGIGLY 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2005FLGHOOKAP1280.039 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 27.6 bits (61), Expect = 0.039
Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 2/34 (5%)

Query: 190 VDIREEALHELIDRLDDLASEFHSAF--LHEAGK 221
+ R + L + + L LA F AF H+AG
Sbjct: 283 LTFRSQDLDQTRNTLGQLALAFAEAFNTQHKAGF 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2006V8PROTEASE486e-08 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 47.7 bits (113), Expect = 6e-08
Identities = 32/154 (20%), Positives = 54/154 (35%), Gaps = 26/154 (16%)

Query: 119 GSGFIVGADGIILTTAYVVGQASEATVRLIDRR-----------EFKA-RVLAVDDSSDV 166
SG +VG +LT +VV L F A ++ D+
Sbjct: 104 ASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDL 162

Query: 167 AVLQIDATK--------LPTVRLGDSSRVRTGEPVLTIGTPDGSANTVTTGIVSATARML 218
A+++ + + + +++ + + + G P G T + ++
Sbjct: 163 AIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYP-GDKPVATMW--ESKGKIT 219

Query: 219 PDGGRFPFFQTDVTGNLDNSGGPVFNRAGEVIGI 252
G + TG NSG PVFN EVIGI
Sbjct: 220 YLKGEAMQYDLSTTGG--NSGSPVFNEKNEVIGI 251


33BPSL2020BPSL2086Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL20202112.766663putative membrane attached glycosyl hydrolase
BPSL20211122.456776hypothetical protein
BPSL20222113.436518Di-haem cytochrome c peroxidase
BPSL20233132.788589putative acid phosphatase
BPSL20243122.952148conserved hypothetical protein
BPSL20252122.918827HlyD family secretion protein
BPSL20264112.539054putative drug-resistance related membrane
BPSL20271102.360130putative drug-resistance related outer-membrane
BPSL2028-291.307554putative two component regulatory system,
BPSL2029-190.686840putative two component regulatory system, sensor
BPSL2030-390.047030putative lipoprotein
BPSL2031-211-0.071154putative fimbriae-related protein
BPSL2032012-0.364962putative fimbriae-assembly chaperone
BPSL2033118-3.149036putative exported protein
BPSL2034743-9.343203putative exported protein
BPSL2035848-11.126086putative exported protein
BPSL2036535-6.321400putative LysR-family transcriptional regulator
BPSL2037536-6.387149putative transport-related membrane protein
BPSL2038535-6.233674putative hydrolase
BPSL2039535-5.516968putative regulatory protein
BPSL2040023-0.977710putative exported protein
BPSL2041225-0.499834hypothetical protein
BPSL2042121-2.767899putative exported protein
BPSL2042A430-5.362324putative membrane protein
BPSL2043534-6.700951hypothetical protein
BPSL2044633-6.635656conserved hypothetical protein
BPSL2045530-5.619409hypothetical protein
BPSL2046220-2.719918hypothetical protein
BPSL2047524-3.365074putative lipoprotein
BPSL20481131.395366hypothetical protein
BPSL2048A-192.852256putative lipoprotein
BPSL2049-183.521986conserved hypothetical protein
BPSL2050-183.428879conserved hypothetical protein
BPSL20510102.192240conserved hypothetical protein
BPSL2052-1112.612530conserved hypothetical protein
BPSL2053-1111.707851conserved hypothetical protein
BPSL20541122.189268conserved hypothetical protein
BPSL20550121.645463conserved hypothetical protein
BPSL20561111.549218putative membrane protein
BPSL20572112.070603conserved hypothetical protein
BPSL20582101.405758hypothetical protein
BPSL20591111.013785conserved hypothetical protein
BPSL20601110.460385putative oxidase
BPSL20611110.497830putative exported protein
BPSL20623120.199063conserved hypothetical protein
BPSL2063212-0.458308putative exported protein
BPSL2065-213-0.656366conserved hypothetical protein
BPSL2066-213-0.569101conserved hypothetical protein
BPSL2067-214-0.913126OmpA family protein
BPSL2068113-1.194434putative membrane protein
BPSL2069091.894983hypothetical protein
BPSL2070-182.348702putative two component system, response
BPSL2071-192.940132putative exported protein
BPSL2072-183.138842putative membrane protein
BPSL2073-193.687995putative two component system, response
BPSL2074-1105.020165putative LysR-family transcriptional regulator
BPSL20750105.125527hypothetical protein
BPSL2076095.647076putative membrane protein
BPSL2077185.101675hypothetical protein
BPSL2078284.621367poly(3-hydroxybutyrate) depolymerase precursor
BPSL20792113.511844putative alpha-amylase-related protein
BPSL20801141.402005putative trehalose synthase protein
BPSL2081322-1.2804661,4-alpha-glucan branching enzyme
BPSL2082122-1.199491putative glycogen operon related protein
BPSL2083121-0.542078putative trehalose trehalohydrolase protein
BPSL2084222-1.695931putative glucanotransferase
BPSL2084A129-5.634951putative glycosyl hydrolase
BPSL2085129-5.787886conserved hypothetical protein
BPSL2086025-4.129944conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2020RTXTOXIND994e-25 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 99 bits (249), Expect = 4e-25
Identities = 63/414 (15%), Positives = 134/414 (32%), Gaps = 91/414 (21%)

Query: 51 KRPGKKPLVVLAIIVVLLLVGAFVW-WFATRNQVSTDDA--YTDGNAITIAPKVSGYVVA 107
+ P + ++A ++ LV AF+ V+T + G + I P + V
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109

Query: 108 LAIDDNVYVHRGDLLLVIDQRDYQAQVDAARAQLGLAQAQLDAAQVQLDIA------HVQ 161
+ + + V +GD+LL + +A ++ L A+ + Q+ ++
Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169

Query: 162 FPAQYRQAQA---QIEAAQASFRQALAAYERQHAVDARATSQQAIDVADAQRLTADANVA 218
P + ++ + ++ + ++ Q + + +D A+RLT A +
Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQ-----KYQKELNLDKKRAERLTVLARIN 224

Query: 219 TARAQA----------------------------RTASLVPQQIRQAQTAVEQRRQQVLQ 250
+ ++R ++ +EQ ++L
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS 284

Query: 251 AQA-----------------------------QLEAAQLALSYCEVRAPSDGWITRRNVQ 281
A+ +L + +RAP + + V
Sbjct: 285 AKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVH 344

Query: 282 -LGSFLQAGAALFAIVTPQ---LWVTANFKESQLERMRAGDRVSVSVDAYP---NLELHG 334
G + L IV P+ L VTA + + + G + V+A+P L G
Sbjct: 345 TEGGVVTTAETLMVIV-PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVG 403

Query: 335 HVDSIQLGSGSRFSAFPPENATGNFVKIVQRVPVKIAIDGGLPRDPPLGIGLSV 388
V +I L + + G ++ + G ++ PL G++V
Sbjct: 404 KVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGN--KNIPLSSGMAV 448


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2021TCRTETB1022e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 102 bits (256), Expect = 2e-25
Identities = 71/331 (21%), Positives = 140/331 (42%), Gaps = 20/331 (6%)

Query: 41 AFMEVLDTTIVNVALPHIAGTMSASYDEATWTLTSYLVANGIVLPISGFLGRLLGRKRYF 100
+F VL+ ++NV+LP IA + W T++++ I + G L LG KR
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 101 VLCIVAFTICSFLCGIATDLGQLIVF-RVLQGLFGGGLQPNQQSIILDTF-PPEQRNRAF 158
+ I+ S + + L++ R +QG G P +++ + P E R +AF
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGA-GAAAFPALVMVVVARYIPKENRGKAF 141

Query: 159 SISAVAIVVAPVLGPTLGGWITDNFSWRWVFLLNVPIGVLTSLAVIQLVEDPPWKRGRAR 218
+ + + +GP +GG I W +LL +P+ +T + V L++ K R +
Sbjct: 142 GLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPM--ITIITVPFLMKLLK-KEVRIK 196

Query: 219 GLSIDYIGITLIAIGLGCLQVMLDRGEDEDWFASTFIRTFAVLTVAGLVGATFWLLYAKK 278
G D GI L+++G+ + F +++ +F +++V + +
Sbjct: 197 G-HFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTD 245

Query: 279 PVVDLSCLKDRNFALGCVTIATFAVVLYGSAVLVPQLAQQRLGYTAMLAG-LVLSPGALL 337
P VD K+ F +G + + G +VP + + + G +++ PG +
Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305

Query: 338 ITLEIPIVSKLMPYVQTRFLVCFGFLLLAAS 368
+ + I L+ +++ G L+ S
Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVS 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2024HTHFIS548e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 53.7 bits (129), Expect = 8e-11
Identities = 41/158 (25%), Positives = 64/158 (40%), Gaps = 13/158 (8%)

Query: 2 LIADDHPLVLLGVRHMLAGMG-DVSIVGEAHDPAGLLALLAATPCDIVITDFAMPEQPAA 60
L+ADD + + L+ G DV I A L +AA D+V+TD MP+
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAAT---LWRWIAAGDGDLVVTDVVMPD---E 60

Query: 61 DGLAMLTAIRDGYPSVRVIVLTMLDNPVLMHTMRQAGALAVLSKRGDLDEL----PRALA 116
+ +L I+ P + V+V++ + + + GA L K DL EL RALA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 117 AVYQGRPFVGTHAGAAGGGAMRGTDAPRQLSPREIEVV 154
+ + G + G A Q R + +
Sbjct: 121 EPKRRPS--KLEDDSQDGMPLVGRSAAMQEIYRVLARL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2025HTHFIS631e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 1e-12
Identities = 30/122 (24%), Positives = 50/122 (40%), Gaps = 10/122 (8%)

Query: 440 RVLVVDDQEMNRIVLRYQLDALGHHARLCASGDEALRALGTAAYDVVLTDCRMPGMDGIA 499
+LV DD R VL L G+ R+ ++ R + D+V+TD MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 500 LTAAIRAH-PDARVRATPIVGVTALVSDAEHARCVDAGMTLCIGKP----TTLDALERAL 554
L I+ PD P++ ++A + + + G + KP + + RAL
Sbjct: 65 LLPRIKKARPD-----LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 555 VE 556
E
Sbjct: 120 AE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2027PF00577455e-150 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 455 bits (1171), Expect = e-150
Identities = 166/808 (20%), Positives = 267/808 (33%), Gaps = 89/808 (11%)

Query: 37 GTLYLELVVN-ALSTGRIVPVRYRDGIYYARA----GDLAQASVRTGAQP-------DAL 84
GT +++ +N R V D LA + T + DA
Sbjct: 76 GTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDAC 135

Query: 85 VDL-SRLDGVQVEYESAEQRLKLTVPPDWLPRQTLG--SPRLYDRTPAAVSFGLLFNYDV 141
V L S + + + +QRL LT+P ++ + G P L+D A L NY+
Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINA----GLLNYNF 191

Query: 142 YANSPT--LGTSYTSAWTEQRLFDRWGTVTNTGVYRRDYGGGAGGVGSNRYLRYDTFWRY 199
NS +G + A+ + G Y GS ++ W
Sbjct: 192 SGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLE 251

Query: 200 SDQDRLR-TYTAGDVITGALSWSSAVRLGGVSVERDFKVRPDIVTYPLPQFSGQAAVPTA 258
D LR T GD T + + G + D + PD P G A
Sbjct: 252 RDIIPLRSRLTLGDGYTQGDIFDG-INFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQ 310

Query: 259 VDLFINGSKTTTGQVNPGPFTMNNVPFINGAGEATVVTTDALGRQVATTIPFYVANTLLQ 318
V + NG V PGPFT+N++ +G+ V +A G T+P+ L +
Sbjct: 311 VTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQR 370

Query: 319 KGLSDYSLSAGAMRRDYGIRSFSYGKFAASGTARHGLTDYLTLEGHVEGGERFALGGLGF 378
+G + YS++AG R + T HGL T+ G + +R+ G
Sbjct: 371 EGHTRYSITAGEYRSGNAQQE---KPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427

Query: 379 DLGIGMFGVLGVAATQSRLAGASGRQY---------------------AFGYSYASQRF- 416
+G G L V TQ+ Q+ GY Y++ +
Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487

Query: 417 SVSLQRIQRTNGFRDLS--------VYDLPANVAYRLVRSSTQATGALNLGALG----GT 464
+ + R NG+ + R Q T LG
Sbjct: 488 NFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSG 547

Query: 465 LGAGYFDVRGADGTRTRIANLSYTRPLWRRATLYASVNKTVGEHGVAAQLQLIV--PLG- 521
Y+ D N ++ W TL S+ K + G L L V P
Sbjct: 548 SHQTYWGTSNVDEQFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALNVNIPFSH 604

Query: 522 ----------EPGVVTGALARDANNSFSERVQYSRSVPSDGGLGWNL--AYAGGGSHYQ- 568
+ +++ D N + ++ D L +++ YAGGG
Sbjct: 605 WLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSG 664

Query: 569 ---QADATWRNRYFQAQGGVYGYGAGRGYARWGEVQGSVVVMDGAVLPANRVDDAFVLID 625
A +R Y A G + + V G V+ V ++D VL+
Sbjct: 665 STGYATLNYRGGYGNANIGYSHSDDIKQL--YYGVSGGVLAHANGVTLGQPLNDTVVLVK 722

Query: 626 TQGRGGVPVRYENQLVGKTDGGGHLLVPWAPSYYAGKYEIDPLDLPSNVRVPIVERRVAV 685
G V ENQ +TD G+ ++P+A Y + +D L NV + V
Sbjct: 723 APGAKDAKV--ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780

Query: 686 RDHGGALVTFPIRRIVCAQIALVDAAGRPVAIGSRVLHEESGETALVGWQGETYLEGLSA 745
F R + + +P+ G+ V E S + +V G+ YL G+
Sbjct: 781 TRGAIVRAEFKARVG-IKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPL 839

Query: 746 LNHLRVR--TPDGRTCRATFAADIDAAQ 771
++V+ + C A + ++ Q
Sbjct: 840 AGKVQVKWGEEENAHCVANYQLPPESQQ 867


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2033TCRTETB415e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 41.4 bits (97), Expect = 5e-06
Identities = 59/353 (16%), Positives = 121/353 (34%), Gaps = 55/353 (15%)

Query: 27 VDTQMFSLVIPALLTAWGIGKGQAGLIGGATLAAGAIGGLLAGMIADRFGRVRALQITVC 86
++ + ++ +P + + + A + +IG + G ++D+ G R L +
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 87 WFSLFTFLSAFAQNFEQLLVL-KTLQGLGFGGEWTAGAVLLSETIRARHRGKAMGIVQSA 145
+ + +F LL++ + +QG G V+++ I +RGKA G++ S
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147

Query: 146 WGFGWGGAVLLYTLVFSWLPPEWAWRVLFAIGVLPALLVLYIRRAIPEPPRDDAR----- 200
G G + ++ ++ W L I ++ + V ++ + + + R
Sbjct: 148 VAMGEGVGPAIGGMIAHYI----HWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKG 203

Query: 201 ----------------------VAVSTSAAVAQTAPARASAKSIFDPSV------LRMTI 232
+ VS + + R DP + + +
Sbjct: 204 IILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVL 263

Query: 233 VGGLIGVGAHGGYHAITTWLPTYLKTERHLSVLGTG------AYLAVIIVAFIIGCMTSA 286
GG+I G + +P +K LS G ++VII +I G
Sbjct: 264 CGGIIFGTVAG----FVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGG----- 314

Query: 287 YLQDRIGRRRNLMLFSACCVVTVNLYVMLPLDNVAMLLLGFPLGFFAAGIPAT 339
L DR G +L ++V+ L + + F G+ T
Sbjct: 315 ILVDRRGPL--YVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFT 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2062OUTRMMBRANEA1272e-37 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 127 bits (321), Expect = 2e-37
Identities = 68/151 (45%), Positives = 95/151 (62%), Gaps = 10/151 (6%)

Query: 87 FQCGEPAQPVAQQPQPAPAAAPAAEPIRLNADAMFAFDRADAASMTEQGRQQLSQLAQRL 146
F GE A VA P PAPA + L +D +F F++A + +G+ L QL +L
Sbjct: 191 FGQGEAAPVVA--PAPAPAPEVQTKHFTLKSDVLFNFNKAT---LKPEGQAALDQLYSQL 245

Query: 147 TDRHAQTVSIV--GYTDRLGSDAYNRQLSQARAKTVGDYLIAAGVPADSVHAEGRGASDP 204
++ + S+V GYTDR+GSDAYN+ LS+ RA++V DYLI+ G+PAD + A G G S+P
Sbjct: 246 SNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP 305

Query: 205 LV--QCDQ-RERAALIACLAPNRRVEVVAAG 232
+ CD ++RAALI CLAP+RRVE+ G
Sbjct: 306 VTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2063PF03895394e-06 Serum resistance protein DsrA.
		>PF03895#Serum resistance protein DsrA.

Length = 79

Score = 39.4 bits (92), Expect = 4e-06
Identities = 21/77 (27%), Positives = 40/77 (51%)

Query: 1014 VARAAYGGIAAATALTMIPEVDKDKTIAVGIGGGTYRGYQAVALGATARITENIKVRAGV 1073
+++ G+A +AL+M+ + + +V G YR A+A+G +RIT+ +AGV
Sbjct: 1 LSKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGV 60

Query: 1074 GMSSGGTTAGIGASMQW 1090
++ GAS+ +
Sbjct: 61 AFNTYNGGMSYGASVGY 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2065HTHFIS824e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 4e-20
Identities = 35/143 (24%), Positives = 61/143 (42%), Gaps = 4/143 (2%)

Query: 1 MSRQKVVLIYLIEDDEVQARCYAAILQHAGYSVRVLPDGERALREIQRAAPDLIVLDRRL 60
M+ +++ +DD L AGY VR+ + R I DL+V D +
Sbjct: 1 MTGATILVA---DDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVM 57

Query: 61 PDIDGLEIIAWVRERCAPLPILVLTNAVLETDLVEALEAGADDYLIKPPREREFVARV-N 119
PD + +++ +++ LP+LV++ ++A E GA DYL KP E + +
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 120 ALRRRASISKQFEGTIEIGGYRI 142
AL + E + G +
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2068HTHFIS799e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 9e-19
Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 1/117 (0%)

Query: 1 MSAARKVLLVEDDEAQANWAKLVLTRGRFDVTHCQTGGQAIRAMTKEVPDAVVLDMRLPD 60
M+ A +L+ +DD A L+R +DV R + D VV D+ +PD
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 VHGLEVLVWIRRNFFDVPVIVLSNAMQEMQIVEAFSAGADDYVLKPAREAEFLARIA 117
+ ++L I++ D+PV+V+S M ++A GA DY+ KP E + I
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2073PF07675310.011 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 31.2 bits (70), Expect = 0.011
Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 376 SYNVYRNGNKVGSS-TSTAYTDAGLIAGTAYSYTVTEIDPSLGESA 420
+Y +YRN ++ S T T Y D L G Y+Y V ++ GESA
Sbjct: 1260 TYTIYRNNTQIASGVTETTYRDPDLATGF-YTYGV-KVVYPNGESA 1303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2076PRTACTNFAMLY300.032 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.4 bits (68), Expect = 0.032
Identities = 19/63 (30%), Positives = 26/63 (41%), Gaps = 2/63 (3%)

Query: 213 RTEAPPRTASIVADLDALERFGWHDDAWLRARASLDLAHAPVSIYEVHPESWLRVAAEGN 272
+ RT + A L+A RF D +L +A L + A Y + LRV EG
Sbjct: 754 AVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRAGGGAYRA--ANGLRVRDEGG 811

Query: 273 RSA 275
S
Sbjct: 812 SSV 814


34BPSL2224BPSL2229Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL22240113.152481putative phosphoserine aminotransferase
BPSL22250123.069830putative membrane protein
BPSL22260103.449499hypothetical protein
BPSL22270103.823852putative transketolase
BPSL2228194.038021putative transketolase
BPSL2229184.142574hypothetical protein
35BPSL2568BPSL2584Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2568126-3.103840guanylate kinase
BPSL2569228-4.373340conserved hypothetical protein
BPSL2570329-4.675755ribonuclease PH
BPSL2571326-2.245726conserved hypothetical protein
BPSL2572226-1.703364putative coproporphyrinogen III oxidase family
BPSL2573229-1.598963*hypothetical protein
BPSL2574329-2.979535putative exported protein
BPSL2575230-3.042340hypothetical protein
BPSL2576230-3.115258hypothetical protein
BPSL2577132-4.182241hypothetical protein
BPSL2578238-5.429171hypothetical protein
BPSL2579339-6.248497hypothetical protein
BPSL2580246-7.153052hypothetical protein
BPSL2581334-5.485422hypothetical protein
BPSL2582335-5.174766hypothetical protein
BPSL2583335-4.181794hypothetical protein
BPSL2584228-2.648957hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2575PHPHTRNFRASE280.032 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 28.2 bits (63), Expect = 0.032
Identities = 13/74 (17%), Positives = 28/74 (37%), Gaps = 12/74 (16%)

Query: 54 LGKAYLAGE------TADTEQIDREIEKLEAALREARKTQEGAAAAAALLEAKATTLLHE 107
+ KA++ E + EIEKL AAL +++ A+ + ++ +
Sbjct: 16 IAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSK------EELRAIKDQTEASMGAD 69

Query: 108 EGALRQQQAALARD 121
+ + + D
Sbjct: 70 KAEIFAAHLLVLDD 83


36BPSL2670BPSL2682Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2670118-3.737061lipopolysaccharide heptosyltransferase-1
BPSL2671229-5.173099phosphoglucomutase
BPSL2672136-6.308981putative lipopolysaccharide biosynthesis
BPSL2673242-7.465938putative glycosyl transferase
BPSL2674140-7.357771putative glycosyl transferase
BPSL2675242-8.775537UDP-glucose 4-epimerase
BPSL2676241-8.663779putative undecaprenyl phosphate
BPSL2677340-9.426260putative epimerase/dehydratase
BPSL2678441-8.959814putative undecaprenyl phosphate
BPSL2679442-9.348939putative epimerase/dehydratase
BPSL2680438-9.004077putative glycosyl transferase
BPSL2681229-7.297770putative glycosyl transferase
BPSL2682123-5.205920putative O-antigen methyl transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2670NUCEPIMERASE1589e-48 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 158 bits (400), Expect = 9e-48
Identities = 84/349 (24%), Positives = 144/349 (41%), Gaps = 46/349 (13%)

Query: 6 TILVTGGAGYIGSHTAVELLAHGYDVVIADNLVNSKREAI--ARIEKITGKTPAFHETDV 63
LVTG AG+IG H + LL G+ VV DNL + ++ AR+E + FH+ D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 64 SDERALARIFDAHPITAAIHFAALKAVGESVAKPIEYYRNNLDSLLSLLRVMRERAVKRI 123
+D + +F + AV S+ P Y +NL L++L R ++ +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 124 VFSSSATVYGVPERSPIDE----TFPLSATNPYGQTKLMAEQILRDVEAADPSWRVAT-- 177
+++SS++VYG+ + P P+S Y TK E + + +
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVS---LYAATKKANELM---AHTYSHLYGLPATG 175

Query: 178 LRYFNPVGAHESGLIGEDPAGIPNNLMPYVAQVAVGKLEKLRVFGSDYPTPDGTGVRDYI 237
LR+F G P G P ++ + A+ + + + V+ G RD+
Sbjct: 176 LRFFTVYG----------PWGRP-DMALFKFTKAMLEGKSIDVYN------YGKMKRDFT 218

Query: 238 HVVDLARGHIAALDALERRDASLTV---------------NLGTGRGYSVLEVVRAFEKA 282
++ D+A I D + D TV N+G +++ ++A E A
Sbjct: 219 YIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDA 278

Query: 283 SGRAVPYELVARRPGDVAECYANPAAAAETIGWKAERDLERMCADHWRW 331
G ++ +PGDV E A+ A E IG+ E ++ + W
Sbjct: 279 LGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2672NUCEPIMERASE728e-16 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 72.1 bits (177), Expect = 8e-16
Identities = 53/301 (17%), Positives = 108/301 (35%), Gaps = 50/301 (16%)

Query: 288 VMVTGAGGSIGSELCRQILKFQPAQLIAFD-LSEYAMYRLTEELRERFPDLPVVPIIGDA 346
+VTGA G IG + +++L+ Q++ D L++Y L + E D
Sbjct: 3 YLVTGAAGFIGFHVSKRLLE-AGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 347 KDSLLLDQVMSRYAPHIVFHAAAYKHVPLMEELNAWQALRNNVLGTYRVARAAIRHDVRH 406
D + + + VF + V E N +N+ G + + ++H
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLE-NPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 407 FVLIST---------------DKAVNPTNVMGASKRLAE-MACQALQQTSARTQFETV-- 448
+ S+ D +P ++ A+K+ E MA S
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY----SHLYGLPATGL 176

Query: 449 RFGNVLGSAGS---VIPKFQQQIAKGGPVTV-THPEITRFFMTIPEASQLVLQA------ 498
RF V G G + KF + + +G + V + ++ R F I + ++ +++
Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 499 ------------SSMGQGGEIFILDMGEPVKIVDLARDLIRLYGFTEEQIRIEFSGLRPG 546
++ ++ + PV+++D + L G + + L+PG
Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI---EAKKNMLPLQPG 293

Query: 547 E 547
+
Sbjct: 294 D 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2674NUCEPIMERASE1076e-29 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 107 bits (268), Expect = 6e-29
Identities = 68/344 (19%), Positives = 130/344 (37%), Gaps = 42/344 (12%)

Query: 3 RVIVTGANGFVGRALCRALLAAGHEVTGL-------------VRRRGVCTEGVSEWVHEA 49
+ +VTGA GF+G + + LL AGH+V G+ R + G H+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQ--FHKI 59

Query: 50 D--DFDGVADRWPAGLQVDAVVHLAARVHMMRDRSPDPDAAFRASNVAATMRVARAARQQ 107
D D +G+ D + +G + V R+ + S + A+ SN+ + + R
Sbjct: 60 DLADREGMTDLFASG-HFERVFISPHRLAV--RYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 108 GARRFVFLS--SVKAIAESDGGTPLCE-NSTPAPQDAYGRSKLEAERALEQLRDELSFDT 164
+ ++ S SV + P +S P Y +K E
Sbjct: 117 KIQHLLYASSSSVYGLNRKM---PFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 165 VIVRPPLVYGPGVRAN--FLSLMRAVSRGVPLPL-GAVRARRSMVYVDNLADAVMRCVTE 221
+R VYGP R + +A+ G + + + +R Y+D++A+A++R
Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233

Query: 222 PAATNGCFHVADSDMPPTIAEL-LDDIGHHLGRPARLLPVPERLLRVAGALTGRAAQ--- 277
+ + V +IA + +IG+ P L+ ++ G A+
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGN--SSPVELM----DYIQALEDALGIEAKKNM 287

Query: 278 IDRLTSDLR---LDTTHIRTVLDWRPPRSSEEGLAETACWFKSL 318
+ D+ DT + V+ + P + ++G+ W++
Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2679NUCEPIMERASE1682e-51 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 168 bits (428), Expect = 2e-51
Identities = 83/363 (22%), Positives = 136/363 (37%), Gaps = 58/363 (15%)

Query: 13 KILVTGGAGFIGCAISERLAARASRYVVMDNLHPQIHASAVRPGALHEKAE----LVVAD 68
K LVTG AGFIG +S+RL + V +DNL+ + +++ L A+ D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDY-YDVSLKQARLELLAQPGFQFHKID 60

Query: 69 VTDAGAWDALLSDFQPEIIIHLAAETGTGQSLTEASRHALVNVVGTTRLTDALVKHGIVV 128
+ D L + E + SL +A N+ G + + + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--I 118

Query: 129 EHILLTSSRAVYGEGAWQKDDGTIVYPGQRGRAQLEAAQWDFPGMTMLPSRADRTEPRPT 188
+H+L SS +VYG +P D + P
Sbjct: 119 QHLLYASSSSVYGLN------------------------------RKMPFSTDDSVDHPV 148

Query: 189 SVYGATKLAQEHVLRAWSLATKTPLSILRLQNVYGPGQSLTNSYTGIVALFSRLAREKKV 248
S+Y ATK A E + +S P + LR VYGP + F++ E K
Sbjct: 149 SLYAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALF----KFTKAMLEGKS 204

Query: 249 IPLYEDGNVTRDFVSIDDVADAIVATLARTPEA-----------------LSLFDIGSGQ 291
I +Y G + RDF IDD+A+AI+ P A +++IG+
Sbjct: 205 IDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSS 264

Query: 292 ATSILDMARIIAAHYGAPEPQINGAFRDGDVRHAACDLSESLANLGWKPQWSLKRGIGEL 351
++D + + G + + GDV + D +G+ P+ ++K G+
Sbjct: 265 PVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324

Query: 352 QTW 354
W
Sbjct: 325 VNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2682ABC2TRNSPORT300.007 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 30.3 bits (68), Expect = 0.007
Identities = 16/59 (27%), Positives = 24/59 (40%)

Query: 195 LFTMVLMFLSPVFYPASALPEKYRFWLELNPLTLFIEQSRGILLEGRVPDFHPLGLAFL 253
L ++FLS +P LP ++ PL+ I+ R I+L V D A
Sbjct: 184 LVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALC 242


37BPSL2694BPSL2699Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2694422-0.290678dihydroorotase-like protein
BPSL2695525-1.945041aspartate carbamoyltransferase
BPSL2696423-1.298754bifunctional regulator/uracil
BPSL2697424-1.007049conserved hypothetical protein
BPSL2698624-0.348227conserved hypothetical protein
BPSL2699521-0.446876hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2699PYOCINKILLER300.039 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.1 bits (67), Expect = 0.039
Identities = 26/79 (32%), Positives = 35/79 (44%), Gaps = 2/79 (2%)

Query: 528 ANALSVANPAALTAAANTVAGTLARAANGTPVAGAIGGLVAALPVANPAGALTSAANNAA 587
A A A A AA A T A ANG+ VA A G + VA A +L A ++A
Sbjct: 228 AEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGR--GLIQVAQGAASLAQAISDAI 285

Query: 588 STIATVAGTNPAAAIGGVA 606
+ + V + P+ G A
Sbjct: 286 AVLGRVLASAPSVMAVGFA 304


38BPSL2716BPSL2724Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2716-293.150009LysE-family membrane transport protein
BPSL2717083.277154putative hydrolase
BPSL2718083.712045putative membrane transport protein
BPSL2719394.405337putative membrane protein
BPSL2720394.625587putative lipoprotein
BPSL2721294.746196probable exported hydrolase
BPSL2722193.717280putative lipoprotein
BPSL27230103.670355putative exported protein
BPSL27241103.563356putative isomerase
39BPSL2790BPSL2808Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2790539-6.721322putative acyl-CoA transferase
BPSL2791435-7.183531UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine
BPSL2792536-8.802191capsular polysaccharide biosynthesis fatty acid
BPSL2793437-9.053632putative capsular polysaccharide biosynthesis
BPSL2794439-10.369836putative capsular polysaccharide biosynthesis
BPSL2795440-10.803813putative capsule polysaccharide
BPSL2796441-11.031564putative D-glycero-d-manno-heptose
BPSL2797344-11.463142putative D-glycero-d-manno-heptose 1-phosphate
BPSL2798449-12.199318putative sedoheptulose 7-phosphate isomerase
BPSL2799551-12.885003putative sugar kinase
BPSL2800551-12.786087putative GDP sugar epimerase/dehydratase
BPSL2801653-12.932472putative capsular polysaccharide biosynthesis
BPSL2802654-13.287489putative capsular polysaccharide biosynthesis
BPSL2803752-12.249347putative glycosyl transferase
BPSL2804747-10.776776putative capsular polysaccharide biosynthesis
BPSL2805641-7.570203putative capsule polysaccharide biosynthesis
BPSL2806436-6.301066putative glycosyltransferase
BPSL2806a231-4.938237putative ATP-binding ABC transporter capsular
BPSL2807127-4.550137putative capsular polysaccharide export ABC
BPSL2808024-4.147673putative capsule polysaccharide export ABC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2797NUCEPIMERASE1294e-37 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 129 bits (326), Expect = 4e-37
Identities = 80/352 (22%), Positives = 136/352 (38%), Gaps = 52/352 (14%)

Query: 4 RVLITGITGMVGSHLADFLLENTDWEIYGLCRWRSPLDNV-SHLLPRINEKNRIRL---- 58
+ L+TG G +G H++ LLE ++ G+ DN+ + + + L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-QVVGI-------DNLNDYYDVSLKQARLELLAQPG 53

Query: 59 ---VYGDLRDYLSIHEAVKQSTPDFVFHLAAQSYPKTSFDSPLDTLETNVQGTANVLEAL 115
DL D + + + VF + + S ++P ++N+ G N+LE
Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113

Query: 116 RKNNIDAVTHVCASSEVFGRVPREKLPIDEE-CTFHPASPYAISKVGTDLIGRYYAEAYN 174
R N I + +SS V+G K+P + HP S YA +K +L+ Y+ Y
Sbjct: 114 RHNKIQHLL-YASSSSVYGL--NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 175 MTVMTTRMFTHTGPR-RGDVFAESTFAKQIAMIERGLIPPVVKTGNLDSLRTFADVRDAV 233
+ R FT GP R D+ A F K + G V G + R F + D
Sbjct: 171 LPATGLRFFTVYGPWGRPDM-ALFKFTKAML---EGKSIDVYNYGKM--KRDFTYIDDIA 224

Query: 234 RAYYMLVTINPI-----------------PGAYYNIGGTYSCTVGQMLDTLISMSTSKDV 276
A L + P P YNIG + + + L +D
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL------EDA 278

Query: 277 IRVETDPE--RLRPIDADLQVPNTRKFEAVTGWKPEISFEKTMEDLLNYWRA 326
+ +E L+P D +T+ V G+ PE + + +++ +N++R
Sbjct: 279 LGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2798NUCEPIMERASE451e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 44.8 bits (106), Expect = 1e-07
Identities = 59/332 (17%), Positives = 105/332 (31%), Gaps = 82/332 (24%)

Query: 1 MKVFLVGSTGYIGKTLFDA-CSRRWRTLGT-STRDGADIVFSLARAEAFPYEQVSA--GD 56
MK + G+ G+IG + + +G + D D+ AR E D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 57 ------------------VVAVAA------AISSPDACAKDYETAFQVNVTGTLTLIRGV 92
V ++ +P A Y N+TG L ++ G
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHA----Y---ADSNLTGFLNILEG- 112

Query: 93 VARGA---RVIFFSSDTVYGASEQLLSEEAELT--PAGAYGAMKRRVEA---ELGENAAV 144
R +++ SS +VYG + ++ + P Y A K+ E +
Sbjct: 113 -CRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171

Query: 145 KVIRLSY--VFSLRDR-------FTQYLLGCAKEGKRADIFK--PFSRCVVYLSDVVEGV 193
L + V+ R FT+ +L EGK D++ R Y+ D+ E +
Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAML----EGKSIDVYNYGKMKRDFTYIDDIAEAI 227

Query: 194 VSLIE-------RWD---------AIDERVINFVGPELVAREDFVEKIRNLAAPELDYGF 237
+ L + +W RV N V D+++ + + E
Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287

Query: 238 SEP-EGDFFVNRPRIINVSSARFEKLLGRRPR 268
GD + + +++G P
Sbjct: 288 LPLQPGDVLETSA---DTKALY--EVIGFTPE 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2799PF05043310.005 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 31.1 bits (70), Expect = 0.005
Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 7/76 (9%)

Query: 140 RHLEEIGASLRIDIDE---IESWCVDELKTREVGENDGGKQIDISVTDFILANCRQKRLF 196
H + + +L +E W EL + + DI +++FI+ KRL
Sbjct: 414 YHAKFVAETLSYYCSNNFELEVWTELELSKESLED----SPYDIIISNFIIPPIENKRLI 469

Query: 197 YTMNHPTAALMREIAA 212
Y+ N T +L+ + A
Sbjct: 470 YSNNINTVSLIYLLNA 485


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2805ABC2TRNSPORT382e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 38.0 bits (88), Expect = 2e-05
Identities = 32/139 (23%), Positives = 58/139 (41%), Gaps = 7/139 (5%)

Query: 88 MAVTPNLALMYHRNVKVIDIFIARILLEVVGNTASFFVLMITFHALGLVDYPEDILEVMF 147
M M + +++ DI + + + + + ALG + +++
Sbjct: 94 MEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWLS----LLY 149

Query: 148 AWVMIIWFG---ASLGFIIGALSEKTELVEKLWHPVTYLMFPLSGAIFMVDWLSPAFQKI 204
A +I G ASLG ++ AL+ + V + LSGA+F VD L FQ
Sbjct: 150 ALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTA 209

Query: 205 VLWLPMVHGVEMLREGYFG 223
+LP+ H ++++R G
Sbjct: 210 ARFLPLSHSIDLIRPIMLG 228


40BPSL2826BPSL2834Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2826313-0.490923putative
BPSL2827213-0.492943putative deoxyguanosine kinase/deoxyadenosine
BPSL2828-3120.627776putative 3-methyl-2-oxobutanoate
BPSL2829-1111.165221conserved hypothetical protein
BPSL28300132.306016putative DnaJ chaperone protein
BPSL28310143.171779putative DnaK chaperone protein
BPSL28321143.324493putative heat shock protein
BPSL28330153.518586putative heat shock protein 15
BPSL28340153.391668putative ferrochelatase protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2827SHAPEPROTEIN1353e-37 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 135 bits (342), Expect = 3e-37
Identities = 81/382 (21%), Positives = 138/382 (36%), Gaps = 71/382 (18%)

Query: 5 IGIDLGTTNSCVAIMEGNQVKVIENSEGARTTPSIIAYMDDNEVL-VGAPAKRQSVTNPK 63
+ IDLGT N+ + + V + R V VG AK+ P
Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQD----RAGSPKSVAAVGHDAKQMLGRTPG 68

Query: 64 NTLFAVKRLIGRRFEEKEVQKDIGLMPYAIIKADNGDAWVEAHGEKLAPPQVSAEVLRK- 122
N + A++ + + V D V+ ++L+
Sbjct: 69 N-IAAIRPM------KDGVIADF---------------------------FVTEKMLQHF 94

Query: 123 MKKTAEDYLGEPVTEAVITVPAYFNDSQRQATKDAGRIAGLEVKRIINEPTAAALAFGLD 182
+K+ + P ++ VP +R+A +++ + AG +I EP AAA+ GL
Sbjct: 95 IKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLP 154

Query: 183 KAEKGDRKIAVYDLGGGTFDVSIIEIADVDGEMQFEVLSTNGDTFLGGEDFDQRIIDYII 242
+E V D+GGGT +V++I + V + +GG+ FD+ II+Y+
Sbjct: 155 VSE--ATGSMVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAIINYVR 203

Query: 243 GEFKKEQGVDLSKDVLALQRLKEAAEKAKIELSSS----QQTEINLPYITADASGPKHLN 298
+ G + AE+ K E+ S+ + EI + P+
Sbjct: 204 RNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFT 250

Query: 299 LKVTRAKLEALVEDLVERTIEPCRTAIKDAGVKVSDIDD--VILVGGQTRMPKVQEKVKE 356
L + LEAL E L + SDI + ++L GG + + + E
Sbjct: 251 LN-SNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLME 309

Query: 357 FFGKEPRRDVNPDEAVAVGAAI 378
G +P VA G
Sbjct: 310 ETGIPVVVAEDPLTCVARGGGK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2829IGASERPTASE310.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.002
Identities = 16/77 (20%), Positives = 24/77 (31%), Gaps = 8/77 (10%)

Query: 2 ENTQENPTDQTTEETGREAQAAEPAAQAAENAAPAAEAA--------LAEAQAKIAELQE 53
T E TE T + + A+ A + E A + K E
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 54 SFLRAKAETENVRRRAQ 70
+AK ETE + +
Sbjct: 1108 KEEKAKVETEKTQEVPK 1124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2833TCRTETB290.035 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 28.7 bits (64), Expect = 0.035
Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 10/89 (11%)

Query: 25 LASLAACIAKRGFEVVFEADTAQAIGSAGYPALTP---AEIGARADVAVVLGGDGTMLGM 81
S+ + F ++ A Q G+A +PAL A + + G G+++ M
Sbjct: 91 FGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAM 150

Query: 82 GRQLAPYKTPLIG---INHGRLGFITDIP 107
G + P IG ++ ++ IP
Sbjct: 151 GEGVG----PAIGGMIAHYIHWSYLLLIP 175


41BPSL2848BPSL2857Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL28482142.113744putative glycolate oxidase iron-sulfur subunit
BPSL28492141.910824conserved hypothetical protein
BPSL28502143.001523putative pyrroline-5-carboxylate reductase
BPSL28512163.196459putative phosphonate transport protein PhnE
BPSL28522153.140835putative phosphonates-binding periplasmic
BPSL28531132.496842putative phosphonates transport ATP-binding
BPSL28540123.860078putative phosphonate metabolism PhnM protein
BPSL28551134.467456putative phosphonates transport ATP-binding
BPSL28563113.334590putative phosphonates transport ATP-binding
BPSL28571133.033416putative phosphonate metabolism PhnJ protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2852PF05272300.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.007
Identities = 21/68 (30%), Positives = 28/68 (41%), Gaps = 1/68 (1%)

Query: 61 CVALTGPSGAGKSTLLRCLYGNYLANRGTIAVRAGARAAEHVV-LTASEPHEVIALRRDV 119
V L G G GKSTL+ L G + + G + E + + A E E+ A RR
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRAD 657

Query: 120 IGYVSQFL 127
V F
Sbjct: 658 AEAVKAFF 665


42BPSL2916BPSL2921Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2916221-4.43565950S ribosomal protein L13
BPSL2917017-4.274241putative OsmC-like protein
BPSL2918112-5.176392conserved hypothetical protein
BPSL2919211-5.815929dihydroorotase
BPSL2920113-4.895139family C44 non-peptidase homologue
BPSL2921013-3.558000hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2920ACRIFLAVINRP250.040 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 25.2 bits (55), Expect = 0.040
Identities = 9/38 (23%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 14 IEIDDVIVGLLAI-RLNLPENADPRDAISRHLSEAGGP 50
+ +DD IV + + R+ + + P++A + +S+ G
Sbjct: 404 LLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGA 441


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2921PF05272320.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.4 bits (73), Expect = 0.002
Identities = 17/53 (32%), Positives = 24/53 (45%), Gaps = 5/53 (9%)

Query: 29 VVVVCGPSGSGKSTLIKTVNGLEPFQQGEILVNGQSVGDKKTNLSKLRSKVGM 81
VV+ G G GKSTLI T+ GL+ F +G K + ++ V
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHF-----DIGTGKDSYEQIAGIVAY 645


43BPSL2960BPSL2968Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL29602132.797861Glyoxalase/Bleomycin resistance
BPSL29614141.186133conserved hypothetical protein
BPSL29624131.637163conserved hypothetical protein
BPSL29633121.204686putative exported ribonuclease
BPSL29643141.666753NADP-dependent malic enzyme
BPSL29653141.537268putative thiamine-monophosphate kinase
BPSL29662142.344536putative phosphatidylglycerophosphatase
BPSL29672152.977874putative competence-damaged related protein
BPSL29680143.213892putative orotidine 5'-phosphate decarboxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2965DHBDHDRGNASE1233e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 123 bits (310), Expect = 3e-36
Identities = 76/249 (30%), Positives = 113/249 (45%), Gaps = 8/249 (3%)

Query: 26 GRAVLITGGATGIGASFVEHFARQGARVAFVDLDEKAGRALVARLADAAHEPVFVVCDLT 85
G+ ITG A GIG + A QGA +A VD + + +V+ L A D+
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 86 DIGALRGAIDAIRVRIGPIAVLVNNAANDVRHAVADVTPESFDASIAVNLRHQFFAAQAV 145
D A+ I +GPI +LVN A + ++ E ++A+ +VN F A+++V
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 146 IDDMKRLGGGAIVNLGSIGWMLKNAGYPVYATAKAAVQGLTRALARELGPFGIRVNTLVP 205
M G+IV +GS + YA++KAA T+ L EL + IR N + P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 206 GWVMTDKQRRLWLDDAGRAAIKAGQCIDAEL--------LPGDLARMALFLAADDSRLIT 257
G TD Q LW D+ G + G + P D+A LFL + + IT
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 258 AQDVVVDGG 266
++ VDGG
Sbjct: 248 MHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2967PF06438290.028 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 29.1 bits (65), Expect = 0.028
Identities = 20/97 (20%), Positives = 31/97 (31%), Gaps = 12/97 (12%)

Query: 4 NGGDVAAALRFDNIGKVFPGVRALDGISFDVQAGQVH----GLMGENGAGKSTLLKILGG 59
GG + D+ F + LD + G VH GLM + + + L
Sbjct: 99 TGGASSGGYALDSQEVSFSNL-GLDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLK 157

Query: 60 EYQP-----DSGSVLVDGRAMRFPSAAASIAAGVAVI 91
P + L + A+ AA V V+
Sbjct: 158 AVDPSLSINSTFDQLAAAGVAH--ATPAAAAAEVGVV 192


44BPSL3032BPSL3038Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL3032-114-3.154426UDP-N-acetylmuramoylalanine--D-glutamate ligase
BPSL3033525-5.366471phospho-N-acetylmuramoyl-pentapeptide-
BPSL3034523-5.818915UDP-N-acetylmuramoylalanyl-D-glutamyl-2,
BPSL3035320-4.845768UDP-N-acetylmuramoylalanyl-D-glutamate--2,
BPSL3036216-3.729089peptidoglycan synthetase FtsI
BPSL3037018-3.610716cell division protein FtsL
BPSL3038124-3.632843S-adenosyl-methyltransferase MraW
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL303256KDTSANTIGN270.017 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 26.8 bits (59), Expect = 0.017
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 7/58 (12%)

Query: 24 NQQRQIFIQLQRAQSQEHQLQQDYAQLQYQQSA-------LSKTSRIEQLATSSLKMQ 74
NQ F+ +AQ Q+ Q QQ AQ Q++ L+ + +I QL +K+Q
Sbjct: 328 NQIHLNFVMPPQAQQQQGQGQQQQAQATAQEAVAAAAVRLLNGSDQIAQLYKDLVKLQ 385


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3036ECOLNEIPORIN881e-21 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 88.3 bits (219), Expect = 1e-21
Identities = 90/394 (22%), Positives = 139/394 (35%), Gaps = 71/394 (18%)

Query: 1 MKKSLLALVALSAFAGAAHAQSSVTLYGIIDEGFNINTNAGGKHL-----YNLSSGVMQG 55
MKKSL+AL L+A AA A VTLYG I G + + + V G
Sbjct: 1 MKKSLIALT-LAALPVAAMAD--VTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLG 57

Query: 56 SRWGLRGTEDLGGGLKALFVLENGFDVNSGKLNQGGLEFGRQAYVGLSSGFGTVTLGRQY 115
S+ G +G EDLG GLKA++ +E + G RQ+++GL GFG + +GR
Sbjct: 58 SKIGFKGQEDLGNGLKAIWQVEQKASIAGTDSGWGN----RQSFIGLKGGFGKLRVGRLN 113

Query: 116 DSVVDF--VGPLEA-GDQWGGYIAAHPGDLDNFNNAYRVNNAVKFTSANYGGFTFGGLYS 172
+ D + P ++ D G A P + + V++ S + G + Y+
Sbjct: 114 SVLKDTGDINPWDSKSDYLGVNKIAEP---EARLIS------VRYDSPEFAGLSGSVQYA 164

Query: 173 FGGVAGDFSRNQTWSLGAGYTNGPLVLGVGYLNARTPSTAGGLFGNNTTSSTPAAVTTPV 232
AG ++++ G Y NG + G R + +
Sbjct: 165 LNDNAG-RHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHR-------L 216

Query: 233 YAGYASAHTYQVIGAGGAYSFGAATVGITYSNIKFMNFASTVFPNQTATFNNAEINFKYQ 292
+GY + Y A A A + + + T + N +
Sbjct: 217 VSGYDNDALY----ASVAVQQQDAKL-VEENYSHNSQTEVAA----TLAYRFG--NVTPR 265

Query: 293 LTPTLLAGAAYDYTQGSKIAGSSAAKYHQGSVGVDYFLSKRTDVYAIGVYQHASGNVIEA 352
++ ++D T + Y Q VG +Y SKRT + E
Sbjct: 266 VSYAHGFKGSFDATNYNND-------YDQVVVGAEYDFSKRTSALVSAGWLQ------EG 312

Query: 353 DGNTVGPATAAINGLTPSSNRNQFAARVGIRHKF 386
G S A VG+RHKF
Sbjct: 313 KG---------------ESKFVSTAGGVGLRHKF 331


45BPSL3102BPSL3121Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL3102211-2.456308putative bacteriocin secretion protein
BPSL3103210-2.707823colicin V processing peptidase
BPSL3104313-3.609264putative outer membrane bacteriocin efflux
BPSL3105212-3.101512putative membrane protein
BPSL3106311-2.935036putative bacteriophage-related peptidase
BPSL3107312-2.974064putative membrane protein
BPSL3108321-5.349498conserved hypothetical protein
BPSL3109534-8.916750putative outer membrane protein
BPSL3110636-9.777952conserved hypothetical protein
BPSL31111047-11.624939protease associated ATPase ClpB
BPSL31131155-13.475434conserved hypothetical protein
BPSL31141152-12.587965conserved hypothetical protein
BPSL3115944-11.503785conserved hypothetical protein
BPSL3116735-10.067726conserved hypothetical protein
BPSL3117527-9.064632conserved hypothetical protein
BPSL3118425-8.779608conserved hypothetical protein
BPSL3119113-5.457054putative lipoprotein
BPSL312009-4.763979putative lipoprotein
BPSL312108-4.275355conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3108IGASERPTASE300.011 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.011
Identities = 30/204 (14%), Positives = 62/204 (30%), Gaps = 10/204 (4%)

Query: 7 AKLSGVVLACGIIAGCASQPTPPTTEAFNKSLADADAVAKTGDQERAIGLYQQLAKSDPT 66
K + V I Q P+ + N+ +A D A ++ +
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVP-PPAPATPSETTETVAENS 1044

Query: 67 REEPWSRIAQIQFQQGHYGQAIVAAQEALQRDKTDRQAKSVLAVAGLRIATESLGELRQD 126
++E + Q Q A+EA K + Q VA T+
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT---NEVAQSGSETKETQTTETK 1101

Query: 127 SSLAGDAKSDAQALAKQLRDTLGEAALFPPEQQATKPVVKKRRIVRRAKPVHEAPRAAES 186
+ + + A+ ++ ++ + P+Q+ + + +A+P E
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVPKVTSQVSPKQE------QSETVQPQAEPARENDPTVNI 1155

Query: 187 ETAAAPATPPAAPAQPAATPAPAP 210
+ + A QPA +
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNV 1179



Score = 28.5 bits (63), Expect = 0.029
Identities = 32/191 (16%), Positives = 62/191 (32%), Gaps = 32/191 (16%)

Query: 23 ASQPTPPTTEAFNKSLADADAVAKTGDQERAIGLYQQLAKSDPTREEPWSRIAQIQFQQG 82
+ T P N AD +V ++ I + P P +
Sbjct: 994 TTNITTP-----NNIQADVPSVPSNNEE---IARVDEAPVPPPAPATPSETTETV----- 1040

Query: 83 HYGQAIVAAQEALQRDKTDRQAKSVLAVAGLRIATESLGELRQDSSLAGDAKSDAQALAK 142
A + QE+ +K ++ A A +A E+ ++ ++ A+S ++
Sbjct: 1041 ----AENSKQESKTVEKNEQDATETTAQNR-EVAKEAKSNVKANTQTNEVAQSGSETKET 1095

Query: 143 QLRDTLGEAALFPPEQQATKPVVKKRRIVRRAKPVHEAPRAAESETAAAPATPPAAPAQP 202
Q +T ++ AT +K ++ E P+ + +P + QP
Sbjct: 1096 QTTET---------KETATVEKEEKAKVETEKT--QEVPKVT---SQVSPKQEQSETVQP 1141

Query: 203 AATPAPAPAKA 213
A PA
Sbjct: 1142 QAEPARENDPT 1152



Score = 28.1 bits (62), Expect = 0.038
Identities = 22/130 (16%), Positives = 41/130 (31%), Gaps = 22/130 (16%)

Query: 87 AIVAAQEALQRDKTDRQAKSVLAVAGLRIATESLGELRQDSSLAGDAKSDAQALAKQLRD 146
I A ++ + + V AT S ++A ++K +++ + K
Sbjct: 1002 NIQADVPSVPSNNEEIARVDEAPVPPPAPATPS----ETTETVAENSKQESKTVEKN--- 1054

Query: 147 TLGEAALFPPEQQATKPVVKKRRIVRRAKP-VHEAPRAAESETAAAPATPPAAPAQPAAT 205
EQ AT+ + R + + AK V + E A + Q T
Sbjct: 1055 ----------EQDATETTAQNREVAKEAKSNVKANTQTNE----VAQSGSETKETQTTET 1100

Query: 206 PAPAPAKAAG 215
A +
Sbjct: 1101 KETATVEKEE 1110


46BPSL3250BPSL3274Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL3250-1103.184956putative outer membrane protein
BPSL3251-192.754112conserved hypothetical protein
BPSL32520120.422637NADP-dependent malic enzyme
BPSL3253-111-0.938188orotate phosphoribosyltransferase
BPSL3254222-4.009239putative lipoprotein
BPSL3254A644-12.621464putative flavoprotein (regulator)
BPSL3254B543-12.732075N-acetyl-gamma-glutamyl-phosphate reductase
BPSL3255645-13.518513putative lipoprotein
BPSL3256746-14.052254putative lipoprotein
BPSL32571057-15.367042putative outer membrane protein
BPSL32581054-15.480164putative LysR-family transcriptional regulator
BPSL32591044-8.373700putative inner membrane transport protein
BPSL32601043-8.360169putative amino acid transport protein
BPSL32611043-8.539562putative oxidoreductase
BPSL3262945-8.731795conserved hypothetical protein
BPSL32631050-8.954949putative membrane protein
BPSL32641052-9.070479putative exported protein
BPSL3265956-10.681625putative membrane protein
BPSL3266953-9.792398putative amino acid permease
BPSL3267743-7.694101putative plasmid recombinase
BPSL3268435-5.867144conserved hypothetical protein
BPSL3269225-3.421838putative plasmid conjugal transfer protein
BPSL3270-116-0.661800conserved hypothetical protein
BPSL3271-2111.696167hypothetical protein
BPSL32720101.396433putative plasmid conjugal transfer protein
BPSL3273091.185590putative plasmid conjugal transfer protein
BPSL32742110.310956putative plasmid conjugal transfer protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3258GPOSANCHOR350.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.7 bits (79), Expect = 0.002
Identities = 50/368 (13%), Positives = 122/368 (33%), Gaps = 53/368 (14%)

Query: 121 SKNKEIEVEIDKLEDELGSVENKTGLRHSYQEKKKAHADKKKAAQDAQDELDKKIFDKAN 180
+ +++ L ++ ++ + ++ + + A L+ + A
Sbjct: 96 NAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAA 155

Query: 181 KNPTGIKHNSLYKDANYDVR----KLKLDIKTVQDKKIAPLDDADRSSKVKLLSEVALPD 236
+ K + + L+ + ++ ++ + + +
Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 215

Query: 237 IEKRLSFNSSLPALAKSTVELITKKIKPSAPIQELLNDAILQNWVKSGIPLHEATRETCG 296
+E + +L A + + + S + +
Sbjct: 216 LEAEKA---ALAARKADLEKALEGAMNFSTADSAKIKTLEAEK----------------- 255

Query: 297 FCGSPLPADLWKRLGDHFNQESQDLEKDLDSVLTKISNEKTRIKNVVTVDRKNFYSANQA 356
+LEK L+ + + + +IK + A +A
Sbjct: 256 ---------------AALEARQAELEKALEGAMNFSTADSAKIKTL---------EAEKA 291

Query: 357 TFDNLKADLDDAINNHEKSLQSLEDELNAR---KKDIFTERSTIDVQDNTVSISQKVSLV 413
+ KADL+ + QSL +L+A KK + E ++ Q N +S + + SL
Sbjct: 292 ALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQ-NKISEASRQSLR 350

Query: 414 NELIERNNKTTTSLEEDQKIARNELRLSEISQFTIDIDLAGEEKKIKALEDQITKAKDEL 473
+L + + + LE + + + ++SE S+ ++ DL + K +E + +A +L
Sbjct: 351 RDL-DASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKL 409

Query: 474 DAVEAEGK 481
A+E K
Sbjct: 410 AALEKLNK 417



Score = 32.3 bits (73), Expect = 0.008
Identities = 40/364 (10%), Positives = 95/364 (26%), Gaps = 43/364 (11%)

Query: 127 EVEIDKLEDELGSVENKTGLRHSYQEKKKAHADKKKAAQDAQDELDKKIFDKANKNPTGI 186
++K+++ E + + K D E +K KN +
Sbjct: 49 TDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSL 108

Query: 187 KHNSLYKDANYDVRKLKLDIKTVQDKKIAPLDDADRSSKVKLLSEVALPDIEKRLSFNSS 246
+ + ++ + + + ++ +
Sbjct: 109 SEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL-----AARKADLEKA 163

Query: 247 LPALAKSTVELITKKIKPSAPIQELLNDAILQNWVKSGIPLHEATRETCGFCGSPLPADL 306
L + K A L E + L
Sbjct: 164 LEGAMNFSTADSAKIKTLEAEKAALEARQ------------AELEKA------------L 199

Query: 307 WKRLGDHFNQESQDLEKDLDSVLTKISNEKTRIKNVVTVDRKNFYSANQATFDNLKADLD 366
+ ++ + + ++ SA T + KA L+
Sbjct: 200 EGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE 259

Query: 367 DAINNHEKSLQSLEDELNARKKDIFTERSTIDVQDNTVSISQKVSLVNELIERNNKTTT- 425
EK+L+ + A I T + + + ++++ N ++
Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAE---KADLEHQSQVLNANRQSLRR 316

Query: 426 ----------SLEEDQKIARNELRLSEISQFTIDIDLAGEEKKIKALEDQITKAKDELDA 475
LE + + + ++SE S+ ++ DL + K LE + K +++
Sbjct: 317 DLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKI 376

Query: 476 VEAE 479
EA
Sbjct: 377 SEAS 380


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3262cloacin355e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.5 bits (81), Expect = 5e-04
Identities = 29/107 (27%), Positives = 44/107 (41%), Gaps = 2/107 (1%)

Query: 302 GAAASAGSAVMAGATSAAGGASALKAAFQSAQQHAAQGAGNFSGSGGSGSGAIGGSSGGA 361
A S + G T G A + S++ + G G GSG G+ GG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG--HGNGGGN 68

Query: 362 SSSSGGSGSFGSFMSNAGRVAADMGSSLANGASQVAKEKAASMMDSA 408
+S GGSG+ G+ + A VA + GA +A +A + +A
Sbjct: 69 GNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAA 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3263TYPE4SSCAGX280.038 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 28.2 bits (62), Expect = 0.038
Identities = 26/122 (21%), Positives = 52/122 (42%), Gaps = 4/122 (3%)

Query: 134 DAQRQALADAQVNGSEAQKRANDAVLKGVDQQQQTLVTDAANLRSLQAQASSAQGQMQAI 193
+ Q++AL + +AQK D +++++ + ANL +L S+ Q
Sbjct: 142 EEQKKALEKEKEAKEQAQKAQKDKR----EKRKEERAKNRANLENLTNAMSNPQNLSNNK 197

Query: 194 QAANQLASAQTNQLLQLRGLLVAQQAAAATRAQIVADREAQQAAAGVQLRDGSNVTHSAP 253
+ + + N+L Q+ L Q+ A A + + + +QA V+ R ++
Sbjct: 198 NLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQAEEAVRQRAKDKISIKTD 257

Query: 254 KS 255
KS
Sbjct: 258 KS 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3271SURFACELAYER260.049 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 25.8 bits (56), Expect = 0.049
Identities = 21/64 (32%), Positives = 28/64 (43%), Gaps = 3/64 (4%)

Query: 3 RTMSAAAAAMAVVSCAMAAAPAAHADAGDGLKVARSNACMGCHAVDRKLVGPSFQQIAER 62
R +SAAAAA+ V+ A A +A A + + VD V PS IA
Sbjct: 6 RIVSAAAAALLAVAPIAATAMPVNAATTINADSAINANTNAKYDVD---VTPSISAIAAV 62

Query: 63 YKND 66
K+D
Sbjct: 63 AKSD 66


47BPSL3337BPSL3349Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL3337123-5.758855putative short chain dehydrogenase
BPSL3338128-7.028251putative acetyltransferase
BPSL3339644-10.508550Rieske [2Fe-2S] domain protein
BPSL3340954-12.782300hypothetical protein
BPSL3341958-13.654154putative outer membrane protein
BPSL3342656-12.278110putative amino acid efflux protein
BPSL3343747-9.259903acid phosphatase
BPSL3344748-9.159462putative lipoprotein
BPSL3345947-8.291570putative lipoprotein
BPSL3346949-8.450169putative lipoprotein
BPSL3347847-8.212386putative hydrolase
BPSL33481149-8.591074putative methyl-accepting chemotaxis protein
BPSL33491041-5.887725putative oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3347PHAGEIV1058e-27 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 105 bits (263), Expect = 8e-27
Identities = 61/254 (24%), Positives = 117/254 (46%), Gaps = 20/254 (7%)

Query: 217 NELVIVGSRDEVAMLRKLVPDLDTAPGEVVVRGWVYEVANTDSANSAWSIAVRLLSGQL- 275
N LV+ +D + L + + +D ++++ G ++EV D+ + +S A G +
Sbjct: 170 NLLVVSAPKDILDNLPQFLSTVDLPTDQILIEGLIFEVQQGDALD--FSFAAGSQRGTVA 227

Query: 276 ------RLSSGDTSSDASAMRFTGPGVDAAISALNADSRFKVVSSPHVRIVSGERVRLNV 329
RL+S +S+ S F G + ++ AL +S K++S P + +SG++ ++V
Sbjct: 228 GGVNTDRLTSVLSSAGGSFGIFNGDVLGLSVRALKTNSHSKILSVPRILTLSGQKGSISV 287

Query: 330 GQQVP--TQSSVSYQGSSGTPVQSITYQDAGLIFDVEPTVM-RDVIELKVREEISDFVAT 386
GQ VP T + P Q++ Q+ G+ V P M I L + + ++
Sbjct: 288 GQNVPFITGRVTGESANVNNPFQTVERQNVGISMSVFPVAMAGGNIVLDITSKADSLSSS 347

Query: 387 KTGVDTSPTKNTRQLQTVTRLKDGELVVLGGLIQDRDATARSGYSWLPS------FFDGR 440
D N R + T L+DG+ ++LGGL ++ + SG +L F R
Sbjct: 348 TQASDV--ITNQRSIATTVNLRDGQTLLLGGLTDYKNTSQDSGVPFLSKIPLIGLLFSSR 405

Query: 441 SSSKQRTEVLLVLQ 454
S S + + + ++++
Sbjct: 406 SDSNEESTLYVLVK 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3349PF05616745e-16 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 74.0 bits (181), Expect = 5e-16
Identities = 58/192 (30%), Positives = 83/192 (43%), Gaps = 25/192 (13%)

Query: 395 QVVIDPNVQPDPNANPNPNPGTNPGTNPGTNPGTNPGTNPGTNPGTNPSTNPGTDPSTNP 454
QV+ P++ P PN P P +P NP NP P NPGT P NP DP NP
Sbjct: 308 QVIPRPDLTPGSAEAPNAQP--LPEVSPAENPANNP--APNENPGTRP--NPEPDPDLNP 361

Query: 455 GTNPGTNPGTNPGTNPDPKPDP----------KPDPKPDPKFCALYPDASACAPLGSAN- 503
NP T+ PGT PD P + + + C +PD AC L N
Sbjct: 362 DANPDTD--GQPGTRPDSPAVPDRPNGRHRKERKEGEDGGLLCKFFPDILACDRLPEPNP 419

Query: 504 --DVDVRRESKSVSLAPISIGLTNGVCPHP--YEVEVFGAPLRFDYA--PICELAAKLRP 557
D+++ E+ +V I + CP P + V V + +F ++ C +A +LR
Sbjct: 420 AEDLNLPSETVNVEFQKSGIFQDSAQCPAPVTFTVTVLDSSRQFAFSFENACTIAERLRY 479

Query: 558 LVLLLGALLAGL 569
++L L +A
Sbjct: 480 MLLALAWAVAAF 491


48BPSL3392BPSL3408Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL3392-214-3.086883putative TetR-family transcriptional regulator
BPSL3393-116-3.351005putative acyl-CoA dehydrogenase
BPSL3394017-3.372782putative fatty-acid CoA ligase
BPSL3395223-4.187974putative periplasmic amino acid binding
BPSL3396125-4.489953bifunctional PutA protein [includes: proline
BPSL3397120-4.636743hypothetical protein
BPSL3398121-4.195363primosomal protein N'
BPSL3399119-3.204174uroporphyrinogen decarboxylase
BPSL3400017-3.238412cyclohexadienyl dehydratase
BPSL3401-118-3.841006conserved hypothetical protein
BPSL3402-119-3.627253putative long-chain-fatty-acid--CoA ligase
BPSL3403125-4.215123ATP synthase epsilon chain
BPSL3404125-4.590217ATP synthase beta chain
BPSL3405227-4.985866ATP synthase gamma chain
BPSL3406227-4.884321ATP synthase alpha chain
BPSL3407124-4.157310ATP synthase delta chain
BPSL3408225-3.235314ATP synthase B chain
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3399FLGMOTORFLIN270.035 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 26.8 bits (59), Expect = 0.035
Identities = 24/87 (27%), Positives = 45/87 (51%), Gaps = 9/87 (10%)

Query: 5 ATIARPYAEALFRVAEGGDISAWSTLVQELAQVAQLPEVLSVASSPKVSRTQ--VAELLL 62
AT + A+A+F+ GGD+S +Q++ + +P L+V ++ RT+ + ELL
Sbjct: 28 ATTTKSAADAVFQQLGGGDVSG---AMQDIDLIMDIPVKLTV----ELGRTRMTIKELLR 80

Query: 63 AALKSPLASGAQAKNFVQMLVDNHRIA 89
S +A A + +L++ + IA
Sbjct: 81 LTQGSVVALDGLAGEPLDILINGYLIA 107


49BPSL0004BPSL0019N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0004-18-0.595290DNA-binding protein HU-alpha
BPSL0005-19-0.151620putative cobalamin synthesis protein/P47K
BPSL0006-19-0.509486putative exported protein
BPSL0007-1110.328560general secretory pathway protein D
BPSL00083141.442819general secretory pathway protein E
BPSL00091171.878230general secretory pathway protein F
BPSL00102152.066824putative general secretory pathway protein
BPSL00111132.953482general secretory pathway protein G
BPSL00120143.790500general secretory pathway protein H
BPSL00130143.569075general secretory pathway protein I
BPSL0014-2104.246534general secretory pathway protein J
BPSL0015-294.340040general secretory pathway protein K
BPSL0016-2104.703839general secretory pathway protein L
BPSL0017-1112.812033general secretory pathway protein M
BPSL0018-1102.755765general secretory pathway protein N
BPSL0019-1123.019488outer membrane efflux lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0004DNABINDINGHU1092e-35 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 109 bits (275), Expect = 2e-35
Identities = 45/88 (51%), Positives = 58/88 (65%)

Query: 2 NKQELIDAVAAQTGASKAQTGETLDTLLEVIKKAVSKGDSVQLIGFGSFGSGKRAARTGR 61
NKQ+LI VA T +K + +D + + ++KG+ VQLIGFG+F +RAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPKTGETIKIPAAKTVKFTAGKAFKDAV 89
NP+TGE IKI A+K F AGKA KDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0007BCTERIALGSPD403e-133 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 403 bits (1037), Expect = e-133
Identities = 215/691 (31%), Positives = 324/691 (46%), Gaps = 88/691 (12%)

Query: 13 TALVVAGIVAAQAAHAQVTLNFVNADIDQVAKAIGAATGKTIIVDPRVKGQLNLVAERPV 72
T L+ A ++ AA + + +F DI + + KT+I+DP V+G + + + +
Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72

Query: 73 PEDQALKTLQSALRMQGFALV-QDHGVLKVVPEADAKLQGVPTYIGNAPQARGDQVVTQV 131
E+Q + S L + GFA++ ++GVLKVV DAK VP AP GD+VVT+V
Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGI-GDEVVTRV 131

Query: 132 FELRNESANNLLPVLRPLI--SPNNTITAYPANNTIVVTDYADNVRRIAQIIAGVDSAAG 189
L N +A +L P+LR L + ++ Y +N +++T A ++R+ I+ VD+A
Sbjct: 132 VPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGD 191

Query: 190 SQVAVVPLKNANAIDIAAQLTKLLDPGAIGNTDATLKVTVQADPRTNALLLRASNAQRLA 249
V VPL A+A D+ +T+L + ++ V AD RTNA+L+ R
Sbjct: 192 RSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSR-Q 250

Query: 250 TAKKIAQQLDAPSGVPGNMHVVPLRNAEAVKLAKTLRGMLGKGGGESGSSASSNDANAFN 309
+ +QLD GN V+ L+ A+A L + L G+
Sbjct: 251 RIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGIS-------------------- 290

Query: 310 QGGSQSGSNFSTGASGTPPLPSGLSSNSSGGAGGTTGGGGLGNAGLLGGDKDKGDDNQPG 369
S + S +
Sbjct: 291 ---------------------STMQSEKQAAKPVAALDKNI------------------- 310

Query: 370 GMIQADAASNSLIITASDPVYRNLRAVIDQLDARRAQVYIEALVVELQATTSANLGIQWQ 429
+I+A +N+LI+TA+ V +L VI QLD RR QV +EA++ E+Q NLGIQW
Sbjct: 311 -IIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA 369

Query: 430 VANNALYAGTNLVTGQTGLGNSIVNLTAGAVT--NPGGTLGSLG---SITNGLNIGWLHN 484
N +T T G I AGA G SL S NG+ G
Sbjct: 370 NKNAG-------MTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAG---- 418

Query: 485 MFGVQGLGALLQFFAGSSDANVLSTPNLVTLDNEEAKIVVGQNVPIPTGSYSNLTSGTTA 544
F LL + S+ ++L+TP++VTLDN EA VGQ VP+ TGS + +
Sbjct: 419 -FYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGS----QTTSGD 473

Query: 545 NAFNTYDRRDVGLTLHVKPQITEGGILKLQLYTEDSAVVPGTNTTSANSPGPTFTKRSIQ 604
N FNT +R+ VG+ L VKPQI EG + L++ E S+V +++++ G TF R++
Sbjct: 474 NIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSV-ADAASSTSSDLGATFNTRTVN 532

Query: 605 STVLADNGEIIVLGGLMQDNYQVSNTKVPLLGDIPWIGQLFRSEGKTRQKTNLMVFLRPV 664
+ VL +GE +V+GGL+ + + KVPLLGDIP IG LFRS K K NLM+F+RP
Sbjct: 533 NAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPT 592

Query: 665 IINDRETAQAVTSNRYDYIQGVTGAYKSDNN 695
+I DR+ + +S +Y + N
Sbjct: 593 VIRDRDEYRQASSGQYTAFNDAQSKQRGKEN 623


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0009BCTERIALGSPF382e-133 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 382 bits (982), Expect = e-133
Identities = 174/406 (42%), Positives = 266/406 (65%), Gaps = 2/406 (0%)

Query: 1 MPAFRFEAIDASGRAQKGVIEADSARNARGQLRTQGLTPLVVEPAASAQRGARSQRLALG 60
M + ++A+DA G+ +G EADSAR AR LR +GL PL V+ Q+ + S L+L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 R--KLSQREQAILTRQLASLLVAGLPLDEALAVLTEQAERDYIRELMAAIRAEVLGGHSL 118
R +LS + A+LTRQLA+L+ A +PL+EAL + +Q+E+ ++ +LMAA+R++V+ GHSL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 119 ANALTQHPRDFPEIYRALVAAGEHTGKLGIVLSRLADYIEERNALKQKILLAFTYPAIVT 178
A+A+ P F +Y A+VAAGE +G L VL+RLADY E+R ++ +I A YP ++T
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 179 VIAFGIVTFLLSYVVPQVVNVFASTKQQLPVLTIVMMALSDFVRHWWWAILIGIAAVVYL 238
V+A +V+ LLS VVP+VV F KQ LP+ T V+M +SD VR + +L+ + A
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 239 VKATLSRDGPRLAFDRWLLTAPLAGKLVRGYNTVRFASTLGILTAAGVPILRALQAAGET 298
+ L ++ R++F R LL PL G++ RG NT R+A TL IL A+ VP+L+A++ +G+
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 299 LSNRAMRGNIDDAIVRVREGSALSRALNNVKTFPPVLVHLIRSGEATGDVTTMLDRAAEG 358
+SN R + A VREG +L +AL FPP++ H+I SGE +G++ +ML+RAA+
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 359 ESRELERRTMFLTSLLEPLLILAMGGIVLVIVLAVMLPIIELNNMV 404
+ RE + L EPLL+++M +VL IVLA++ PI++LN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0011BCTERIALGSPG1886e-65 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 188 bits (480), Expect = 6e-65
Identities = 67/140 (47%), Positives = 94/140 (67%), Gaps = 3/140 (2%)

Query: 10 QAARRQRGFTLIEIMVVVAILGILAALIVPKIMSRPDEARRIAAKQDIGTIMQALKLYRL 69
+A +QRGFTL+EIMVV+ I+G+LA+L+VP +M ++A + A DI + AL +Y+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 70 DNGRYPTQDQGLNALIQKPTTDPIPNNWKDGGYLERLPNDPWGNSYKYLNPGVHGEIDVF 129
DN YPT +QGL +L++ PT P+ N+ GY++RLP DPWGN Y +NPG HG D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 130 SYGADGKEGGESNDSDIGSW 149
S G DG+ G E DI +W
Sbjct: 122 SAGPDGEMGTE---DDITNW 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0012BCTERIALGSPH511e-10 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 50.7 bits (121), Expect = 1e-10
Identities = 20/101 (19%), Positives = 33/101 (32%), Gaps = 15/101 (14%)

Query: 11 RARGFTLLEMLVVLVIAGILVSVASLTLRRNPRTDLREEAQRIALLFETAGDEAQVRARP 70
R RGFTLLEM+++L++ G+ + L + + R +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 71 IAWRATEHGFRF---------------DIRTGDGWRPLRDD 96
++F D +G W PLR
Sbjct: 62 FGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAG 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0013BCTERIALGSPG300.001 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.2 bits (68), Expect = 0.001
Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 8 RSPARSRGFTMIEVLVALAIIAVALA 33
R+ + RGFT++E++V + II V +
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLAS 27


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0014BCTERIALGSPG333e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 33.3 bits (76), Expect = 3e-04
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 3/72 (4%)

Query: 33 RGFTLIEMMIAITILAVIA-ILSWRGLDQIIRGREKVAAAMEDERVFAQMFDQMRIDARR 91
RGFTL+E+M+ I I+ V+A ++ + + ++ A+ D D ++D
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQK--AVSDIVALENALDMYKLDNHH 65

Query: 92 AATDDEAGQPAV 103
T ++ + V
Sbjct: 66 YPTTNQGLESLV 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0019PF05616320.007 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 32.0 bits (72), Expect = 0.007
Identities = 22/64 (34%), Positives = 26/64 (40%), Gaps = 10/64 (15%)

Query: 495 PAGAAAPAAMPAAAVAPAAMPAAAVAPAARPAA---------VVAAAGPDTQARRPRATP 545
P A AP A P V+PA PA AP P + A PDT +P P
Sbjct: 317 PGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDG-QPGTRP 375

Query: 546 AAPA 549
+PA
Sbjct: 376 DSPA 379


50BPSL0027BPSL0032N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0027210-0.763229flagellar motor switch protein FliM
BPSL00281131.216454probable flagellar motor switch protein
BPSL0029-1112.085258flagellar protein
BPSL0030-1122.334424flagellar biosynthetic protein
BPSL0031-392.252436flagellar biosynthetic protein
BPSL0032-281.648050flagellar biosynthetic protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0027FLGMOTORFLIM2762e-93 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 276 bits (706), Expect = 2e-93
Identities = 82/324 (25%), Positives = 159/324 (49%), Gaps = 10/324 (3%)

Query: 5 EFMSQEEVDALLKGVTGEDDSADEPAEASG---IRPYNIATQERIVRGRMPGLEIINDRF 61
E +SQ+E+D LL ++ D S ++ S I Y+ ++ + +M L ++++ F
Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62

Query: 62 ARLLRIGIFNFMRRTAEISVSQVKVQKYSEFTRNLPIPTNLNLVHVKPLRGTSLFVFDPN 121
ARL + +R + V+ V Y EF R++P P+ L ++ + PL+G ++ DP+
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122

Query: 122 LVFFVVDNLFGGDGRFHTRVEGRDFTATEQRIIGKLLNLVFEHYASAWKSVRPLQFEFVR 181
+ F ++D LFGG G+ RD T E ++ ++ + + +W V L+ +
Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQ 180

Query: 182 SEMHTQFANVATPNEIVIVTQFSIEFGPTGGTLHICMPYSMIEPIRDVLSSPIQGEAL-- 239
E + QFA + P+E+V++ + G G ++ C+PY IEPI LSS ++
Sbjct: 181 IETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 240 EVDRRWVRVLSQQVQSAEVELVADLAEVPTTFEKILNLRTGDVLPLD---ITDSITAKVD 296
+++ VL ++ + ++++VA++ + + IL LR GD++ L + D +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 297 GVPVMECGYGIFNGQYALRVQRMI 320
C G+ + A ++ I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0028FLGMOTORFLIN1343e-43 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 134 bits (338), Expect = 3e-43
Identities = 78/126 (61%), Positives = 97/126 (76%), Gaps = 3/126 (2%)

Query: 41 AMDD-WAAALAEQNQQPIETGATGAGVFRPLSKATASSTHNDIDLILDIPVKMTVELGRT 99
A+DD WA AL EQ ++ A VF+ L S DIDLI+DIPVK+TVELGRT
Sbjct: 14 ALDDLWADALNEQKATTTKSAADA--VFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRT 71

Query: 100 KIAIRNLLQLAQGSVVELDGLAGEPMDVLVNGCLIAQGEVVVVNDKFGIRLTDIITPSER 159
++ I+ LL+L QGSVV LDGLAGEP+D+L+NG LIAQGEVVVV DK+G+R+TDIITPSER
Sbjct: 72 RMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSER 131

Query: 160 IRKLNR 165
+R+L+R
Sbjct: 132 MRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0030FLGBIOSNFLIP288e-101 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 288 bits (739), Expect = e-101
Identities = 153/242 (63%), Positives = 192/242 (79%), Gaps = 1/242 (0%)

Query: 11 RWLPAILIGLAPALACAQAAGLPAFNSAPGPNGGTTYSLSVQTMLLLTMLSFLPAMLLMM 70
R L + L + A LP S P P GG ++SL VQT++ +T L+F+PA+LLMM
Sbjct: 3 RLLSVAPVLLW-LITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMM 61

Query: 71 TSFTRIIIVLSLLRQAIGTASTPPSQVLVGLALFLTLFVMSPVLDRAYNDAYKPFSEGTL 130
TSFTRIIIV LLR A+GT S PP+QVL+GLALFLT F+MSPV+D+ Y DAY+PFSE +
Sbjct: 62 TSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKI 121

Query: 131 QMDQAVQRGTAPFKAFMLKQTRETDLALFAKISKAAPMQGPEDVPLSLLVPAFVTSELKT 190
M +A+++G P + FML+QTRE DL LFA+++ P+QGPE VP+ +L+PA+VTSELKT
Sbjct: 122 SMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKT 181

Query: 191 GFQIGFTIFIPFLIIDMVVASVLMSMGMMMVSPATVSLPFKLMLFVLVDGWQLLIGSLAQ 250
FQIGFTIFIPFLIID+V+ASVLM++GMMMV PAT++LPFKLMLFVLVDGWQLL+GSLAQ
Sbjct: 182 AFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQ 241

Query: 251 SF 252
SF
Sbjct: 242 SF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0031TYPE3IMQPROT694e-19 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 68.6 bits (168), Expect = 4e-19
Identities = 26/85 (30%), Positives = 46/85 (54%)

Query: 4 ENVMTLAHQAMYIGLLLAAPLLLVALAVGLVVSLFQAATQINEATLSFIPKLLAVAATMV 63
++++ ++A+Y+ L+L+ +VA +GL+V LFQ TQ+ E TL F KLL V +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 IAGPWMLSTMIDYLRETLLRVATLG 88
+ W ++ Y R+ + G
Sbjct: 62 LLSGWYGEVLLSYGRQVIFLALAKG 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0032TYPE3IMRPROT1623e-51 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 162 bits (411), Expect = 3e-51
Identities = 117/250 (46%), Positives = 158/250 (63%), Gaps = 1/250 (0%)

Query: 1 MFSVTYAQLNGWLTAFLWPFVRMLALVAIAPVTGHRSTPVRVKIGLAGFMALVVAPTLPP 60
M VT Q WL + WP +R+LAL++ AP+ RS P RVK+GLA + +AP+LP
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 IPVATVFSAQGVWIIVNQFLIGAALGFTMQIVFAAIEAAGDIIGLSMGLGFATFFDPHSS 120
V VFS +W+ V Q LIG ALGFTMQ FAA+ AG+IIGL MGL FATF DP S
Sbjct: 61 NDVP-VFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 GATPVMGRFLNAVAILAFLAFDGHLQVFAALVDSFRLVPVSANLLRAAGWQTLVAFGAAI 180
PV+ R ++ +A+L FL F+GHL + + LVD+F +P+ L + + L G+ I
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 181 FEMGLLLALPVVAALLIANLALGILNRAAPQIGIFQVGFPVTMLVGLLLVQLMAPNLIPF 240
F GL+LALP++ LL NLALG+LNR APQ+ IF +GFP+T+ VG+ L+ + P + PF
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPF 239

Query: 241 VGRLFDTGVD 250
LF +
Sbjct: 240 CEHLFSEIFN 249


51BPSL0039BPSL0044N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0039-1110.111099sensor kinase protein
BPSL0040-112-0.596116response regulator protein
BPSL0041-112-0.288083outer membrane porin lipoprotein precursor
BPSL0042-1110.239510type III restriction-modification system
BPSL0043-2100.910445type III restriction system endonuclease
BPSL0044-2121.954644putative outer membrane porin protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0039PF06580543e-10 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 54.5 bits (131), Expect = 3e-10
Identities = 24/128 (18%), Positives = 45/128 (35%), Gaps = 22/128 (17%)

Query: 334 RIDLGAELDDDLQVAGSESLLSALLMNLVDNAVRYAHE----GGRVTVSARRDGDAVVLE 389
R+ +++ + + L+ LV+N +++ GG++ + +D V LE
Sbjct: 239 RLQFENQINPAIM---DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLE 295

Query: 390 VVDDGPGIPAEARPHVFKRFYRVARDEEGTGLGLAIVEE-IAQSHGGAVSLATGPGNRGV 448
V + G +E TG GL V E + +G + V
Sbjct: 296 VENTGSLALKN--------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV 341

Query: 449 RMTVRLPA 456
V +P
Sbjct: 342 NAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0040HTHFIS963e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.1 bits (239), Expect = 3e-25
Identities = 30/119 (25%), Positives = 60/119 (50%), Gaps = 1/119 (0%)

Query: 2 KLLLVEDNAELAHWIVDLLRGEGFGVDSAPDGESADTVLKAQRYDALLLDMRLPGMSGKE 61
+L+ +D+A + + L G+ V + + + A D ++ D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLARLRRRGDNVPVLMLTAHGSVDDKVDCFSAGADDYVVKPFESRELVARI-RALIRRQ 119
LL R+++ ++PVL+++A + + GA DY+ KPF+ EL+ I RAL +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0041ECOLNEIPORIN641e-13 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 64.1 bits (156), Expect = 1e-13
Identities = 56/228 (24%), Positives = 94/228 (41%), Gaps = 28/228 (12%)

Query: 1 MKRQYLALSIATAACAAPQAHAQSSVQLYGLIDLSVPTYRSHANAKGDHVIGMGLGGEPW 60
MK+ +AL++A AA + V LYG I V T RS A+ G + G
Sbjct: 1 MKKSLIALTLAALPVAA-----MADVTLYGTIKAGVETSRSVAH-NGAQAASVETGTGIV 54

Query: 61 FSGSRWGLKGAEDIGGGTKVIFRLESEYTVADGNMEDPGQIFDRDAWVGVENDTFGKLTA 120
GS+ G KG ED+G G K I+++E + ++A + +R +++G++ FGKL
Sbjct: 55 DLGSKIGFKGQEDLGNGLKAIWQVEQKASIAGTD----SGWGNRQSFIGLKGG-FGKLRV 109

Query: 121 GFQNTIARDAAAIYGDPYGSAKLTTEEGGWTNANNFKQMIFYAAGATGTRYNNGLAWKKL 180
G N++ +D I +P+ S RY++ +
Sbjct: 110 GRLNSVLKDTGDI--NPWDSKSDYLGVNKIAEPEARL---------ISVRYDS----PEF 154

Query: 181 FGNGIFASAGYAFSNSTSFGQNSTYQVALGYNGGPFNVSGFFSHVNHA 228
G+ S YA +++ + +Y Y G F V ++ H
Sbjct: 155 A--GLSGSVQYALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHH 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0044ECOLNEIPORIN745e-17 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 74.5 bits (183), Expect = 5e-17
Identities = 80/356 (22%), Positives = 118/356 (33%), Gaps = 75/356 (21%)

Query: 1 MKK--FAVAAAGLAVATGAHASDGSVTLFGLIDAGVSYVSNEGGKRNVYFDDGIAVPNLW 58
MKK A+ A L VA A VTL+G I AGV + +
Sbjct: 1 MKKSLIALTLAALPVAAMA-----DVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVD 55

Query: 59 -----GLRGTEDLGGGAKAIFELTSQYALGNGAALPTPGSMFSRTALVGLWSERLGSVTL 113
G +G EDLG G KAI+++ + T +R + +GL G + +
Sbjct: 56 LGSKIGFKGQEDLGNGLKAIWQVEQ-----KASIAGTDSGWGNRQSFIGLKGG-FGKLRV 109

Query: 114 GQQYDFMTDSLTFGSFDGAFRYGGLYNFRQGPFSKLGIPDNPTGSFDFDRLAGSSRVPNS 173
G+ + D+ +D Y G+ + P+ S
Sbjct: 110 GRLNSVLKDTGDINPWDSKSDYLGVNKIAE--------PEA---------------RLIS 146

Query: 174 VKYTSANLNGLVFGLMYGFGNQAGGGLAANSTVSAGLKYETGSFAL--GAAYVEVKYPQM 231
V+Y S GL + Y + AG + + AG Y+ G F + G AY Q
Sbjct: 147 VRYDSPEFAGLSGSVQYALNDNAGRH--NSESYHAGFNYKNGGFFVQYGGAYKRHHQVQE 204

Query: 232 NNGHDGLRNWGLGARYALSAFDLNL-LYTNTRNT--LTGAAIDVIQAGVRYVGAPWTIGA 288
N + + L + Y L + ++ + Q V A
Sbjct: 205 NVNIEKYQIHRLVSGY--DNDALYASVAVQQQDAKLVEENYSHNSQTEV---------AA 253

Query: 289 NYEYMKGNAQLDRNYAH----------------QVTATAQYALSKRTSAYVETVYQ 328
Y GN +YAH QV A+Y SKRTSA V +
Sbjct: 254 TLAYRFGNVTPRVSYAHGFKGSFDATNYNNDYDQVVVGAEYDFSKRTSALVSAGWL 309


52BPSL0180BPSL0189N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0180-491.705128HpcH/HpaI aldolase family protein
BPSL0181-390.596054putative exported protein
BPSL0182-111-0.074171rod shape-determining protein
BPSL0183-2130.838943penicillin-binding protein
BPSL01840120.010093putative rod shape-determining protein
BPSL0185-112-0.300897putative rod shape-determining protein
BPSL0186-111-1.635736putative rod shape-determining protein
BPSL0187-211-1.482931glutamyl-tRNA amidotransferase subunit C
BPSL0188-210-1.025876glutamyl-tRNA amidotransferase subunit A
BPSL0189-211-0.867316glutamyl-tRNA amidotransferase subunit B
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0180PHPHTRNFRASE443e-07 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 44.0 bits (104), Expect = 3e-07
Identities = 36/178 (20%), Positives = 60/178 (33%), Gaps = 34/178 (19%)

Query: 87 RALDAGARTLMFPGVETADEAAHAVRLTRFQAPDAPDGLRGVAGIVRAAAYGMRRDYVQT 146
RA G +MFP + T +E LR I++ + + V
Sbjct: 380 RASTYGNLKVMFPMIATLEE------------------LRQAKAIMQEEKDKLLSEGVDV 421

Query: 147 ANAQIATIVQIESARGVDEAERIAATPGVDCVFVGPADL----------SASLGHLGDTK 196
++ I + +E A A VD +G DL + + +L
Sbjct: 422 SD-SIEVGIMVEIPSTAVAANLFA--KEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPY 478

Query: 197 HPDVAAALEHVLAAGRRAGVPVGI---FAADTAGARQSLEAGFRVVALSADVVWLLRA 251
HP + ++ V+ A G VG+ A D L G ++SA + R+
Sbjct: 479 HPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARS 536


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0183cloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.4 bits (86), Expect = 2e-04
Identities = 30/88 (34%), Positives = 36/88 (40%), Gaps = 9/88 (10%)

Query: 681 SGADGASGASGASGASG--------ASGASGASGAGGEPTEHANAGGNSAGGGIAGGAAG 732
SG DG +GA SG GAS G +E+ GG S G GG +G
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 733 TANNGSGAAAPGGM-PGANGAATGAPPA 759
N G + GG G N +A AP A
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 37.4 bits (86), Expect = 2e-04
Identities = 26/82 (31%), Positives = 32/82 (39%), Gaps = 2/82 (2%)

Query: 681 SGADGASGASGASGASGASGASGASGAGGEPTEHANAGGNSAGGGIAGGAAGTANNGSGA 740
+G GAS SG S + G G + GN G G +GG +GT N S
Sbjct: 25 TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84

Query: 741 AAPG--GMPGANGAATGAPPAS 760
AAP G P + G S
Sbjct: 85 AAPVAFGFPALSTPGAGGLAVS 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0185GPOSANCHOR280.046 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 28.5 bits (63), Expect = 0.046
Identities = 16/64 (25%), Positives = 22/64 (34%), Gaps = 3/64 (4%)

Query: 293 KAAKGKKATKGADKSAKAADKGADKDKGAKPAAAPPVPARSRPAGPAQPAAPLKPATAPS 352
K + +KA A A+A A K+K AK A + + P A P
Sbjct: 424 KLTEKEKAELQAKLEAEAK---ALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPG 480

Query: 353 PGAP 356
G
Sbjct: 481 KGQA 484


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0186SHAPEPROTEIN5040.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 504 bits (1300), Expect = 0.0
Identities = 247/348 (70%), Positives = 294/348 (84%), Gaps = 2/348 (0%)

Query: 1 MFGFLRSYFSNDLAIDLGTANTLIYMRGKGIVLDEPSVVSIRQEGGPNGKKTIQAVGKEA 60
M R FSNDL+IDLGTANTLIY++G+GIVL+EPSVV+IRQ+ K++ AVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRA-GSPKSVAAVGHDA 59

Query: 61 KQMLGKVPGNIEAIRPMKDGVIADFTVTEQMIKQFIKTAHESRMFSPSPRIIICVPCGST 120
KQMLG+ PGNI AIRPMKDGVIADF VTE+M++ FIK H + PSPR+++CVP G+T
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 QVERRAIKEAAHGAGASQVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVGVISLG 180
QVERRAI+E+A GAGA +V+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEV VISL
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GIVYKGSVRVGGDKFDEAIVNYIRRNYGMLIGEQTAEAIKKEIGSAFPGSEVKEMEVKGR 240
G+VY SVR+GGD+FDEAI+NY+RRNYG LIGE TAE IK EIGSA+PG EV+E+EV+GR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 NLSEGIPRSFTISSNEILEALTDPLNQIVSSVKIALEQTPPELGADIAERGMMLTGGGAL 300
NL+EG+PR FT++SNEILEAL +PL IVS+V +ALEQ PPEL +DI+ERGM+LTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 LRDLDRLLAEETGLPVLVAEDPLTCVVRGSGMALERMDKL-GSIFSYE 347
LR+LDRLL EETG+PV+VAEDPLTCV RG G ALE +D G +FS E
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0189TYPE4SSCAGA310.013 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 30.8 bits (69), Expect = 0.013
Identities = 29/89 (32%), Positives = 43/89 (48%), Gaps = 5/89 (5%)

Query: 395 SNKIAKEIFVTIWDEKAADEGAADRIIEAKGLK-QISDTGALEAIIDEVLAANAKSVEEF 453
+N EIF I E D A KG+K ++SD LE + ++ L KS +EF
Sbjct: 648 ANSQKDEIFALINKEANRDARAIAYAQNLKGIKRELSDK--LENV-NKNLKDFDKSFDEF 704

Query: 454 RAGKDKAFNALVGQAMKATKGKANPQQVN 482
+ GK+K F+ + +KA KG +N
Sbjct: 705 KNGKNKDFSK-AEETLKALKGSVKDLGIN 732


53BPSL0198BPSL0203N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0198011-1.224660TetR family transcription regulatory protein
BPSL0199011-1.351324haloacid dehalogenase-like hydrolase
BPSL0200012-2.387756putative acetylglutamate kinase
BPSL0201111-2.859033sensor kinase protein
BPSL0202213-2.516471response regulator protein
BPSL0203112-1.919177ATP-dependent Hsl protease ATP-binding subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0198HTHTETR599e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.9 bits (142), Expect = 9e-13
Identities = 31/183 (16%), Positives = 62/183 (33%), Gaps = 15/183 (8%)

Query: 24 ANRTRPKPGERRVHILQTLASMLEAPKSEKITTAALAARLDVSEAALYRHFSSKAQMFEG 83
A +T+ + E R HIL + + +A V+ A+Y HF K+ +F
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 84 LIEFIEETFFGLVNQIAANEPNGVLQA-RSIALMLLNFSAKNPGMTRVLTGEALVGEHER 142
+ E E L + A P L R I + +L + ++ + H+
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME----IIFHKC 117

Query: 143 LAERVNQMLERVEASIKQCLR---VALLEAQAHAAGGAPPPVPLPDDYDPALRASLVISY 199
++++ + ++ L+ A P + A ++ Y
Sbjct: 118 EFVGEMAVVQQAQRNLCLESYDRIEQTLKH-CIEAKMLPADL------MTRRAAIIMRGY 170

Query: 200 VLG 202
+ G
Sbjct: 171 ISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0200CARBMTKINASE445e-07 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 43.7 bits (103), Expect = 5e-07
Identities = 27/99 (27%), Positives = 48/99 (48%), Gaps = 6/99 (6%)

Query: 180 IPVISPIGFGEDGLSYNINADLVAGKLATVLNAEKLVMMTNIPGVMDKEG----NLLTDL 235
+PVI G G+ I+ DL KLA +NA+ +++T++ G G L ++
Sbjct: 197 VPVILEDG-EIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLREV 255

Query: 236 SAREIDALFEDGT-ISGGMLPKISSALDAAKSGVKSVHI 273
E+ +E+G +G M PK+ +A+ + G + I
Sbjct: 256 KVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAII 294



Score = 36.7 bits (85), Expect = 8e-05
Identities = 21/60 (35%), Positives = 27/60 (45%), Gaps = 10/60 (16%)

Query: 31 GKTVVIKYGGNAMTEERLKQGF----------ARDVILLKLVGINPVIVHGGGPQIDQAL 80
GK VVI GGNA+ + K + AR + + G VI HG GPQ+ L
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0202HTHFIS889e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 87.6 bits (217), Expect = 9e-23
Identities = 30/127 (23%), Positives = 60/127 (47%)

Query: 4 MSDKNFLVIDDNEVFAGTLARGLERRGYAVRQAHNKDEALKLAGAEKFEFITVDLHLGND 63
M+ LV DD+ L + L R GY VR N + A + + D+ + ++
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 64 SGLSLIAPLCDLQPDARILVLTGYASIATAVQAVKDGADNYLAKPANVESILAALQTNAS 123
+ L+ + +PD +LV++ + TA++A + GA +YL KP ++ ++ + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 124 EVQAEEA 130
E + +
Sbjct: 121 EPKRRPS 127



Score = 45.2 bits (107), Expect = 4e-08
Identities = 16/101 (15%), Positives = 32/101 (31%), Gaps = 3/101 (2%)

Query: 78 DARILVLTGYASIATAVQAVKDGADNYLAKPANVESILAALQTNASEVQAEEALENPVVL 137
I+ + I + L+ VE + + + L
Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL---YDR 431

Query: 138 SVDRLEWEHIQRVLAENNNNISATARALNMHRRTLQRKLAK 178
+ +E+ I L N A L ++R TL++K+ +
Sbjct: 432 VLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0203HTHFIS310.014 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.014
Identities = 13/68 (19%), Positives = 29/68 (42%), Gaps = 15/68 (22%)

Query: 17 IIGQAKAKKAVAVALRNRWRRQQVAEPLRQEITPKNILMIGPTGVGKTEIAR---RLAKL 73
++G++ A + + ++ + T +++ G +G GK +AR K
Sbjct: 139 LVGRSAAMQEI------YRVLARLMQ------TDLTLMITGESGTGKELVARALHDYGKR 186

Query: 74 ADAPFIKI 81
+ PF+ I
Sbjct: 187 RNGPFVAI 194


54BPSL0225BPSL0235N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL02251123.519864putative flagellar hook-length control protein
BPSL02262142.058300flagellar fliJ protein
BPSL02273121.656567flagellum-specific ATP synthase
BPSL02281120.629670flagellar assembly protein
BPSL02292102.678934flagellar motor switch protein
BPSL0230094.054433flagellar M-ring protein
BPSL02312104.689393flagellar hook-basal body complex protein
BPSL02320105.031523flagellar protein
BPSL0233-284.159905conserved hypothetical protein
BPSL0234-183.487850hypothetical protein
BPSL0235-1112.462175putative export system protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0225FLGHOOKFLIK742e-16 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 74.1 bits (181), Expect = 2e-16
Identities = 79/257 (30%), Positives = 109/257 (42%), Gaps = 8/257 (3%)

Query: 216 NGDASAPLAANGAAFDKLLAGAKAPAAQAAPTDASGANPATALANAAANAAQPDASG--A 273
N D +A L+A A K A + T L + AQPD +
Sbjct: 124 NEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAPGTP 183

Query: 274 LAALQDAADSARATLAASSAPAALQQAA-PAALAANASAAAASAAPSLAPPVGTPDWTDA 332
L A++ S P+ + AA P AAP L+ P+G+ +W +
Sbjct: 184 AQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQS 243

Query: 333 LSQKVVFLSNAHQQSAELTLNPPDLGPLQVVLRVADNHAHALFVSQHAQVRDAVEAALPK 392
LSQ + + QQSAEL L+P DLG +Q+ L+V DN A VS H VR A+EAALP
Sbjct: 244 LSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPV 303

Query: 393 LREAMEAGGLGLGSASVSDGGFASAQQQQTPQRQSSDGSATRRAFGASTADAALDELAAA 452
LR + G+ LG +++S F+ QQ + Q+QS +A D L
Sbjct: 304 LRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQR-TANHEPLAGEDDDT----LPVP 358

Query: 453 SSGGAARRTVGMVDTFA 469
S VD FA
Sbjct: 359 VSLQGRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0226FLGFLIJ602e-14 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 59.8 bits (144), Expect = 2e-14
Identities = 43/140 (30%), Positives = 74/140 (52%)

Query: 1 MAQSFPLQLLLERAQDDLDTAAKQLGRAQRERTDAQAQLDALMRYRDEYRVRFAESAQSG 60
MA+ L L + A+ +++ AA+ LG +R A+ QL L+ Y++EYR +G
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MPAGNWRNFQAFLDTLDAAIEQQRRVLAAAQTRIDAARPEWQAKKRTLGSYEILQARGAR 120
+ + W N+Q F+ TL+ AI Q R+ L ++D A W+ KK+ L +++ LQ R +
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 QDAQRAAKREQRDADEHAAK 140
+ +Q+ DE A +
Sbjct: 121 AALLAENRLDQKKMDEFAQR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0228FLGFLIH1091e-31 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 109 bits (273), Expect = 1e-31
Identities = 64/184 (34%), Positives = 106/184 (57%), Gaps = 4/184 (2%)

Query: 37 AAAALAAELQRVRDAAHAEGLAAGHVEGQALGYQAGYEQGRAKGFDEGQAEAHTHAAQLA 96
A +L +L +++ AH +G AG EG+ G++ GY++G A+G ++G AEA + A +
Sbjct: 36 AEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIH 95

Query: 97 A----LAASFRDALAGVERDLADDIATLALEIAQQVVRQHVQHDPAALIAAAREVLAAEP 152
A L + F+ L ++ +A + +ALE A+QV+ Q D +ALI +++L EP
Sbjct: 96 ARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEP 155

Query: 153 ALAGAPHLIVNPADLPVVEAYLKDELDTLGWSVRTDTSIERGGCRAHASTGEIDATLTTR 212
+G P L V+P DL V+ L L GW +R D ++ GGC+ A G++DA++ TR
Sbjct: 156 LFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATR 215

Query: 213 WERV 216
W+ +
Sbjct: 216 WQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0229FLGMOTORFLIG298e-102 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 298 bits (765), Expect = e-102
Identities = 114/324 (35%), Positives = 191/324 (58%)

Query: 5 GLNKSALLLMSIGEEEAAQVFKFLAPREVQKIGAAMAALKNVTREQVEDVLNDFVQEAEK 64
G K+A+LL+SIG E +++VFK+L+ E++ + +A L+ +T E ++VL +F +
Sbjct: 17 GKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMA 76

Query: 65 HTALSLDSSEYIRTVLTKALGEDKAGVLIDRILQGSDTSGIEGLKWMDSAAVAELIKNEH 124
+ +Y R +L K+LG KA +I+ + + E ++ D A + I+ EH
Sbjct: 77 QEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEH 136

Query: 125 PQIIATILVHLDRDQASEIASCFTERLRNDVLLRIATLDGIQPTALRELDDVLTGLLSGS 184
PQ IA IL +LD +AS I S ++ +V RIA +D P +RE++ VL L+
Sbjct: 137 PQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASL 196

Query: 185 DNLKRAPMGGIRTAAEILNFMTSVHEEAVIENVKQYDPDLAQKIIDQMFVFENLLDLEDR 244
+ GG+ EI+N E+ +IE++++ DP+LA++I +MFVFE+++ L+DR
Sbjct: 197 SSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDR 256

Query: 245 AIQLLLKEVESEALIIALKGAPPALRQKFLSNMSQRAAELLAEDLDARGPVRVSEVETQQ 304
+IQ +L+E++ + L ALK +++K NMS+RAA +L ED++ GP R +VE Q
Sbjct: 257 SIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQ 316

Query: 305 RKILQVVRNLAESGQIVIGGKAED 328
+KI+ ++R L E G+IVI E+
Sbjct: 317 QKIVSLIRKLEEQGEIVISRGGEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0230FLGMRINGFLIF468e-162 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 468 bits (1206), Expect = e-162
Identities = 254/562 (45%), Positives = 360/562 (64%), Gaps = 37/562 (6%)

Query: 53 LSRMKTNPRLPFLIGAALAIAAIVALVLWSRAPDYRVLYSNLSDRDGGAIIAALQQANVP 112
L+R++ NPR+P ++ + A+A +VA+VLW++ PDYR L+SNLSD+DGGAI+A L Q N+P
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75

Query: 113 YKFADAGGAILVPANQVHETRLKLAAMGLPKGGSVGFELMDNQKFGISQFAEQVNYQRAL 172
Y+FA+ GAI VPA++VHE RL+LA GLPKGG+VGFEL+D +KFGISQF+EQVNYQRAL
Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135

Query: 173 EGELQRTVESINAVRAARVHLAIPKPSVFVRDREAPSASVLVDLYPGRVLDEGQVLAVTR 232
EGEL RT+E++ V++ARVHLA+PKPS+FVR++++PSASV V L PGR LDEGQ+ AV
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195

Query: 233 MVSSSVPDMPAKNVTIVDQDGNLLTQT-ASATGLDASQLKYVQQIERNTQKRIDAILAPI 291
+VSS+V +P NVT+VDQ G+LLTQ+ S L+ +QLK+ +E Q+RI+AIL+PI
Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255

Query: 292 FGAGNARSQVSADVDFSKIEQTSESYGPNGTPQQSAIRSQQTSSSTELAQSGASGVPGAL 351
G GN +QV+A +DF+ EQT E Y PNG ++ +RS+Q + S ++ GVPGAL
Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315

Query: 352 SNTPPQPASAPIVA-------------SNGQPAGPAATPVSDRKDSTTNYELDKTVRHVE 398
SN P P API ++ +A P S +++ T+NYE+D+T+RH +
Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTK 375

Query: 399 QSMGTIKRLSVAVVVNYQPSTDAKGRVTMQPLAADKLAQVQQLVKDAMGYDEKRGDSVNV 458
++G I+RLSVAVVVNY+ D K PL AD++ Q++ L ++AMG+ +KRGD++NV
Sbjct: 376 MNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431

Query: 459 VNSAFSAAADPFANLPWWRQPDMIELGKDIAKWLGVAAAAAALYFMFVRPALRR---AFP 515
VNS FSA + LP+W+Q I+ +WL V A L+ VRP L R
Sbjct: 432 VNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAK 491

Query: 516 PPAEPAAAAVPALDGPDDMLALDGLPSPDKKQLAEEDEEHPALLAFENERNRYERNLDYA 575
E A + + L+ D + N+R E
Sbjct: 492 AAQEQAQVRQETEEAVEVRLSKDEQLQQRR----------------ANQRLGAEVMSQRI 535

Query: 576 RTIARQDPKIVATVVKNWVSDE 597
R ++ DP++VA V++ W+S++
Sbjct: 536 REMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0231FLGHOOKFLIE627e-16 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 61.6 bits (149), Expect = 7e-16
Identities = 46/111 (41%), Positives = 62/111 (55%), Gaps = 8/111 (7%)

Query: 3 APVNGIASALQQMQAMAAQAAGGTSPATSLAGSGAASAGSFASAMKASLDKISGDQQKAL 62
+ + GI + Q+QA A +A SFA + A+LD+IS Q A
Sbjct: 1 SAIQGIEGVISQLQATA-MSARAQESLPQ-------PTISFAGQLHAALDRISDTQTAAR 52

Query: 63 GEAHAFEIGAQNVSLNDVMVDMQKANIGFQFGLQVRNKLVSAYNEIMQMSV 113
+A F +G V+LNDVM DMQKA++ Q G+QVRNKLV+AY E+M M V
Sbjct: 53 TQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0235TYPE3IMSPROT624e-15 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 62.5 bits (152), Expect = 4e-15
Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 1/81 (1%)

Query: 10 AVLAYDAKGGDTAPRVVAKGYGLVAERIIERARDAGLYVHTAPEMV-SLLMQVDLDARIP 68
A+ +G P V K + + + A + G+ + + +L +D IP
Sbjct: 268 AIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIP 327

Query: 69 PQLYQAVAELLAWLYALERDA 89
+ +A AE+L WL +
Sbjct: 328 AEQIEATAEVLRWLERQNIEK 348


55BPSL0271BPSL0285N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0271318-1.462464flagellar basal-body rod protein
BPSL0272320-0.366716putative basal-body rod modification protein
BPSL0273020-0.498373putative flagellar hook protein
BPSL0274-2170.237782flagellar basal-body rod protein
BPSL02750170.249067flagellar basal-body rod protein
BPSL02761170.483835flagellar L-ring protein precursor
BPSL02771150.557644flagellar P-ring protein precuror
BPSL02780120.398592putative peptidoglycan hydrolase
BPSL02791120.969704conserved hypothetical protein
BPSL02801131.171104putative flagellar hook-associated protein
BPSL02810131.158510putative flagellar hook-associated protein
BPSL02820110.832906putative permease
BPSL0283-2110.750744LysR family regulatory protein
BPSL0284-1110.141528putative chromate transport protein
BPSL0285-212-1.759703putative chromate transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0271FLGHOOKAP1270.029 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 26.8 bits (59), Expect = 0.029
Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 102 NVDPVQEMVNMISASRSYQANVETLNTAKQLMLKTLTI 139
V+ +E N+ + Y AN + L TA + + I
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0273FLGHOOKAP1340.001 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 34.2 bits (78), Expect = 0.001
Identities = 17/58 (29%), Positives = 24/58 (41%)

Query: 356 ISAPGSTNHGTLQGSALENSNVDLTSQLVKLITAQRNYQANAQTIKTQQTVDQTLINL 413
SA L S V+L + L Q+ Y ANAQ ++T + LIN+
Sbjct: 488 SSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 30.3 bits (68), Expect = 0.017
Identities = 11/31 (35%), Positives = 17/31 (54%)

Query: 6 GLSGLAGASSDLDVIGNNIANANTVGFKGST 36
+SGL A + L+ NNI++ N G+ T
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0274FLGHOOKAP1290.019 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.2 bits (65), Expect = 0.019
Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 4 LIYTAMTGATQSLEQQSVVANNLANASTTGFRAQ 37
LI AM+G + + +NN+++ + G+ Q
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQ 36


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0275FLGHOOKAP1421e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 42.3 bits (99), Expect = 1e-06
Identities = 10/48 (20%), Positives = 23/48 (47%)

Query: 213 TLKQGYVESSNVNVVQELVNMIQTQRAYEINSKAVTTSDQMLQTVTQM 260
L S VN+ +E N+ + Q+ Y N++ + T++ + + +
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 40.3 bits (94), Expect = 5e-06
Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 14/80 (17%)

Query: 4 SLYIAATGMNAQQAQMDVISNNLANVSTNGFKGSRAVFEDLLYQTVRQPGANSTQQTELP 63
+ A +G+NA QA ++ SNN+++ + G+ RQ + + L
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYT--------------RQTTIMAQANSTLG 48

Query: 64 SGLQLGTGVQQVATERLYTQ 83
+G +G GV +R Y
Sbjct: 49 AGGWVGNGVYVSGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0276FLGLRINGFLGH2051e-68 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 205 bits (522), Expect = 1e-68
Identities = 128/222 (57%), Positives = 156/222 (70%), Gaps = 7/222 (3%)

Query: 25 AALAAAALALAGCAQIPREPITQQPMSAMPPMPPAMQAPGSIY---NPGYAG-RPLFEDQ 80
A + L+L GCA IP P+ Q SA P P A GSI+ P G +PLFED+
Sbjct: 10 AISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDR 69

Query: 81 RPRNVGDILTIVIAENINATKSSGANTNRQGNTSFDVPTAG-FLGGLF--NKANLSAQGA 137
RPRN+GD LTIV+ EN++A+KSS AN +R G T+F T +L GLF +A++ A G
Sbjct: 70 RPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGG 129

Query: 138 NKFAATGGASAANTFNGTITVTVTNVLPNGNLVVSGEKQMLINQGNEFVRFSGIVNPNTI 197
N F GGA+A+NTF+GT+TVTV VL NGNL V GEKQ+ INQG EF+RFSG+VNP TI
Sbjct: 130 NTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTI 189

Query: 198 SGQNSVYSTQVADARIEYSAKGYINEAETMGWLQRFFLNIAP 239
SG N+V STQVADARIEY GYINEA+ MGWLQRFFLN++P
Sbjct: 190 SGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSP 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0277FLGPRINGFLGI371e-129 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 371 bits (953), Expect = e-129
Identities = 164/392 (41%), Positives = 225/392 (57%), Gaps = 27/392 (6%)

Query: 4 RVVRPLVAARRRAAACCALAACMLALAFAPAAARAERLKDLAQIQGVRDNPLIGYGLVVG 63
RV+R + AA +A L+ PA A R+KD+A +Q RDN LIGYGLVVG
Sbjct: 2 RVLRIIAAALVFSALPF--------LSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVG 53

Query: 64 LDGTGDQTMQTPFTTQTLANMLANLGISINNGSANGGGSSAMTNMQLKNVAAVMVTATLP 123
L GTGD +PFT Q++ ML NLGI+ G +N KN+AAVMVTA LP
Sbjct: 54 LQGTGDSLRSSPFTEQSMRAMLQNLGITTQGGQSN-----------AKNIAAVMVTANLP 102

Query: 124 PFARPGEAIDVTVSSLGNAKSLRGGTLLLTPLKGADGQVYALAQGNMAVGGAGASANGSR 183
PFA PG +DVTVSSLG+A SLRGG L++T L GADGQ+YA+AQG + V G A + +
Sbjct: 103 PFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAAT 162

Query: 184 VQVNQLAAGRIAGGAIVERSVPNAVAQMNGVLQLQLNDMDYGTAQRIVSAVNS----SFG 239
+ + R+ GAI+ER +P+ L LQL + D+ TA R+ VN+ +G
Sbjct: 163 LTQGVTTSARVPNGAIIERELPSKFKDSV-NLVLQLRNPDFSTAVRVADVVNAFARARYG 221

Query: 240 AGTATALDGRTIQLTAPADSAQQVAFMARLQNLEVSPERAAAKVILNARTGSIVMNQMVT 299
A D + I + P + MA ++NL V + AKV++N RTG+IV+ V
Sbjct: 222 DPIAEPRDSQEIAVQKPRVA-DLTRLMAEIENLTVETD-TPAKVVINERTGTIVIGADVR 279

Query: 300 LQNCAVAHGNLSVVVNTQPVVSQPGPFSNGQTVVAQQSQIQLKQDNGSLRMVTAGANLAD 359
+ AV++G L+V V P V QP PFS GQT V Q+ I Q+ + + G +L
Sbjct: 280 ISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSKV-AIVEGPDLRT 338

Query: 360 VVKALNSLGATPADLMSILQAMKAAGALRADL 391
+V LNS+G +++ILQ +K+AGAL+A+L
Sbjct: 339 LVAGLNSIGLKADGIIAILQGIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0278FLGFLGJ2273e-75 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 227 bits (579), Expect = 3e-75
Identities = 124/297 (41%), Positives = 173/297 (58%), Gaps = 15/297 (5%)

Query: 15 ALDVQGFDALRSKATAAAPREGVKMVAGQFDAMFTQMMLKSMRDATPSDGLLDSSSSKMY 74
A D Q + L++KA P ++ VA Q + MF QMMLKSMRDA P DGL S +++Y
Sbjct: 12 AWDAQSLNELKAKA-GEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLY 70

Query: 75 TSMLDQQLAQQMSS-KGIGVADALTKQLLRNANVAPDAQGEGGLAAMNALAKAYANSNGA 133
TSM DQQ+AQQM++ KG+G+A+ + KQ+ + ++ + Y N +
Sbjct: 71 TSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALS 130

Query: 134 PGNGALAGTRGYSAASALTPPLKGNGNSAQADAFVEKMALAAQAASATTGIPARFIVGQA 193
P + + AF+ +++L AQ AS +G+P I+ QA
Sbjct: 131 ------------QLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQA 178

Query: 194 ALESGWGKREIRGANGESSYNVFGIKATKGWTGRTVSAVTTEYVNGKPHRVVAQFRAYDS 253
ALESGWG+R+IR NGE SYN+FG+KA+ W G TTEY NG+ +V A+FR Y S
Sbjct: 179 ALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSS 238

Query: 254 YEHAMTDYANLLKNNPRYASVLNAGHNAEGFAHGMQKAGYATDPHYAKKLISIMQQI 310
Y A++DY LL NPRYA+V A +AE A +Q AGYATDPHYA+KL +++QQ+
Sbjct: 239 YLEALSDYVGLLTRNPRYAAVTTAA-SAEQGAQALQDAGYATDPHYARKLTNMIQQM 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0280FLGHOOKAP12314e-70 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 231 bits (591), Expect = 4e-70
Identities = 162/444 (36%), Positives = 253/444 (56%), Gaps = 12/444 (2%)

Query: 3 NTLMNLGVSGLNAALWGLTTTGQNISNAATPGYSVERPVYAEASGQYTSSGYLPQGVSTV 62
++L+N +SGLNAA L T NIS+ GY+ + + A+A+ + G++ GV
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 63 TVERQYNQYLSNQLNAAQTQGSSLSTYYTLVAQLNNYVGSPTAGIATAITNYFTGLQTVA 122
V+R+Y+ +++NQL AAQTQ S L+ Y +++++N + + T+ +AT + ++FT LQT+
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 123 NNAADPSARQTAMSNAQTLASQLVAAGQQYSQLRQSVNSQLTDTVTQINSYTSQIAQLNE 182
+NA DP+ARQ + ++ L +Q Q + VN + +V QIN+Y QIA LN+
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 183 QIA--SASSQGQPPNQLLDQRDLAVSKLSQLAGVQV-VQSNGNYSVFLSGGQPLVVGNAS 239
QI+ + G PN LLDQRD VS+L+Q+ GV+V VQ G Y++ ++ G LV G+ +
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 240 YQLATVASPSDPSELTI-VSKGVAGSAQPGPTQYLPDVSLTGGALGGLLAFRSQTLDPAQ 298
QLA V S +DPS T+ G AG+ + +P+ L G+LGG+L FRSQ LD +
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIE------IPEKLLNTGSLGGILTFRSQDLDQTR 294

Query: 299 AQLGALAVSFASQVNAQNALGVDMSGNPGGSLFAVGAPAVYANQNNTGSATLSVSFVDGT 358
LG LA++FA N Q+ G D +G+ G FA+G PAV N N G + + D +
Sbjct: 295 NTLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDAS 354

Query: 359 QPTTSDYALSYDGAKYTLTDRATGSVVGTATPSSTPPTMTIGGLKLSLSSTPNAGDSFTV 418
+DY +S+D ++ +T R + T TP + + GL+L+ + TP DSFT+
Sbjct: 355 AVLATDYKISFDNNQWQVT-RLASNTTFTVTPDAN-GKVAFDGLELTFTGTPAVNDSFTL 412

Query: 419 LPTRGALDGFSLATANGSAIAAAS 442
P A+ + + + IA AS
Sbjct: 413 KPVSDAIVNMDVLITDEAKIAMAS 436



Score = 83.1 bits (205), Expect = 9e-19
Identities = 46/105 (43%), Positives = 66/105 (62%)

Query: 561 GTNDGRNALALSQLVNSKTMNNGTTTLTGAYAGYVNAIGNAASQLKASSAAQTALVGQIT 620
G +D RN AL L ++ G + AYA V+ IGN + LK SSA Q +V Q++
Sbjct: 441 GDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLS 500

Query: 621 QAQQSVSGVNQNEEAANLMQYQQLYQANAKVIQTANSVFQTVLGL 665
QQS+SGVN +EE NL ++QQ Y ANA+V+QTAN++F ++ +
Sbjct: 501 NQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0281FLAGELLIN416e-06 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 41.2 bits (96), Expect = 6e-06
Identities = 55/369 (14%), Positives = 113/369 (30%), Gaps = 10/369 (2%)

Query: 16 MNDQQAQIAQLYQQVSSGISLTTPADNPLAAAQAVQLSATSATLAQYTQNQTIVQTALQT 75
+N Q+ ++ +++SSG+ + + D+ A A + ++ L Q ++N + QT
Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76

Query: 76 EDTTLTSVNDVLNAAYQALMHAGDGGLSDSDRAALAAQIQGSRDHLLTLANTADGAGNYL 135
+ L +N+ L + + A +G SDSD ++ +IQ + + ++N G +
Sbjct: 77 TEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKV 136

Query: 136 FAGFQPTTQPFSNKPGGGVTY------AGDYGARAVQIADTRTVSQGDNGANVFMSVPFL 189
+ G +T G + + + GD ++ +
Sbjct: 137 LSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYD 196

Query: 190 GSLPVPAAGASNTGTGTIGAVSITNPSDPTNTHQFTITFGGTAAAPTYTVTDNSVTPPTT 249
+ +G + + T A T D T +T
Sbjct: 197 TYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKST 256

Query: 250 TAAQAYSSGQGINLGGQTVAVSGKPAVGDTFTVTPAPQAGTDVFATLD----TVIAALKS 305
+ G GG+ V T V T++ T+ A +
Sbjct: 257 AGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADIT 316

Query: 306 PVGNSQTASTALTNTMATASTKLMNTMTNVLTVQASVGGRLQEVKAMQAVTTTNTLQTTN 365
+ A+T ++ S + T S E + T+
Sbjct: 317 AGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAE 376

Query: 366 SLSNLTDTN 374
+N
Sbjct: 377 YTANAAGDK 385


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0285ACRIFLAVINRP280.021 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.3 bits (63), Expect = 0.021
Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 2/63 (3%)

Query: 110 YVQQGMMPVTAGLVVASAVLISEASNRSALQWGITAAVAAL-AYRTRVHPLWLLAGGALA 168
Y G++ T GL +A+LI E + + G A L A R R+ P+ + + +
Sbjct: 925 YFMVGLL-TTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFIL 983

Query: 169 GLV 171
G++
Sbjct: 984 GVL 986


56BPSL0427BPSL0435N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0427082.861662C4-dicarboxylate transport transcriptional
BPSL04281122.697935putative membrane protein
BPSL04291122.970592conserved hypothetical protein
BPSL0430-2112.595180putative thioesterase
BPSL0431-291.432842acetyltransferase (GNAT) family protein
BPSL0432-1101.306557putative magnesium chelatase subunit
BPSL0433-211-0.467128conserved hypothetical protein
BPSL0434-111-0.490857nitrogen regulatory protein P-II 1
BPSL043509-0.169918ammonium transporter family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0427HTHFIS445e-156 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 445 bits (1145), Expect = e-156
Identities = 152/483 (31%), Positives = 231/483 (47%), Gaps = 47/483 (9%)

Query: 4 RLQVIYIEDDELVRRASVQSLQLAGFDVVGFGSVEAAEKAIVGDATGVIVSDIRLPGASG 63
++ +DD +R Q+L AG+DV + + I ++V+D+ +P +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 LELLAQCRERTPDVPVVLVTGHGDISMAVQAMRDGAYDFIEKPFAAERLTETVRRALERR 123
+LL + ++ PD+PV++++ A++A GAYD++ KPF L + RAL
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 124 ALVLENHALRRELAGQGVVAPRIIGRSPAIEQVRRLIANVAPTDASVLINGDTGAGKELI 183
+L ++GRS A++++ R++A + TD +++I G++G GKEL+
Sbjct: 123 K------RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELV 176

Query: 184 ARSLHELSPRRDKPFIAVNCGALPEPMFESEMFGYEPGAFTGAAKRRIGKLEYASGGTLF 243
AR+LH+ RR+ PF+A+N A+P + ESE+FG+E GAFTGA R G+ E A GGTLF
Sbjct: 177 ARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLF 236

Query: 244 LDEIESMPLALQVKLLRVLQDGVLERLGSNQPIRVNCRVVAAAKGDMSEHVAAGTFRRDL 303
LDEI MP+ Q +LLRVLQ G +G PIR + R+VAA D+ + + G FR DL
Sbjct: 237 LDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDL 296

Query: 304 LYRLNVVTIALPPLAERREDIVPLFEHFMLDAAVRYGRPAPLLTDRQRASLMQRDWPGNV 363
YRLNVV + LPPL +R EDI L HF + A + G + WPGNV
Sbjct: 297 YYRLNVVPLRLPPLRDRAEDIPDLVRHF-VQQAEKEGLDVKRFDQEALELMKAHPWPGNV 355

Query: 364 RELRNAADRFVLGVTEGIVG---------------------------------------- 383
REL N R + ++
Sbjct: 356 RELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQY 415

Query: 384 DAGPETDEHAEQSLKERVEQFERAVIAETLNRTGGAVATTADKLHVGKATLYEKMKRYGL 443
A + + E +I L T G AD L + + TL +K++ G+
Sbjct: 416 FASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475

Query: 444 SAK 446
S
Sbjct: 476 SVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0428TYPE4SSCAGX280.025 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 27.8 bits (61), Expect = 0.025
Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 8/64 (12%)

Query: 97 EFVAVAMNYDPPMYVANYAQTRQ------LPFKVALDDGSVAK-QFGNVQLTPTTFVIGK 149
++V A+ +P NY Q + +P ++ DDG+ F N+ L P FV+
Sbjct: 386 QYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEI-FDDGTFTYFGFKNITLQPAIFVVQP 444

Query: 150 DGKI 153
DGK+
Sbjct: 445 DGKL 448


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0431SACTRNSFRASE326e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.8 bits (72), Expect = 6e-04
Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 6/83 (7%)

Query: 47 GEALLVAQARDE--GIVGFVSVWEPERFVHHLYVAGTRLREGIGAALLRALPGW----PA 100
G+A + + G + S W + + VA ++G+G ALL W
Sbjct: 64 GKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHF 123

Query: 101 ARYRLKCLVRNERALAFYRAHGF 123
L+ N A FY H F
Sbjct: 124 CGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0432SYCECHAPRONE290.024 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 28.9 bits (64), Expect = 0.024
Identities = 22/84 (26%), Positives = 32/84 (38%), Gaps = 8/84 (9%)

Query: 158 ELYLPLPSAAEAALVPGVTVYGAADLPALCAHLADTPDGRLAPVAAPRLDALPAAATADL 217
+L L +P E + GV V C H+ + P G++ P LD T
Sbjct: 14 QLSLSIPDTIEPVI--GVKVG-----EFAC-HITEHPVGQILMFTLPSLDNNDEKETLLS 65

Query: 218 ADVIGQAGAKRALEVAAAGGHHML 241
++ Q K L GGH +L
Sbjct: 66 HNIFSQDILKPILSWDEVGGHPVL 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0435PYOCINKILLER310.010 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 31.3 bits (70), Expect = 0.010
Identities = 19/72 (26%), Positives = 29/72 (40%), Gaps = 1/72 (1%)

Query: 17 AMADDASSAPAAASAATASDTSAGAAASAPAASAAPAAPAAPAASAPAAASAAAPASAAA 76
A A + + AAA A ++ A A+ AA+ A PA + A AA + A
Sbjct: 216 AAAANKAREQAAAEAKRKAEEQARQQAAIRAANTY-AMPANGSVVATAAGRGLIQVAQGA 274

Query: 77 APAAPTAPFSVD 88
A A ++
Sbjct: 275 ASLAQAISDAIA 286


57BPSL0806BPSL0815N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0806-292.468384response regulator protein
BPSL0807-292.080273sensor kinase protein
BPSL0808-3101.373762subfamily S1C unassigned peptidase
BPSL0809012-0.438013putative exported protein
BPSL08100130.171378conserved hypothetical protein
BPSL0811012-0.282664putative membrane protein
BPSL0812-2110.671718TetR family regulatory protein
BPSL0814-1100.854021putative RND family acriflavine resistance
BPSL0815-1120.263620putative RND family acriflavine resistance
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0806HTHFIS956e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 95.3 bits (237), Expect = 6e-25
Identities = 36/124 (29%), Positives = 60/124 (48%), Gaps = 1/124 (0%)

Query: 2 RILLVEDDRMIAEGVRKALKADGCAVDWVQDGDAALTALGGEAYDLLLLDLGLPKRDGID 61
IL+ +DD I + +AL G V + + DL++ D+ +P + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRTLRARGLALPVLILTARDAVADRVKGLDAGADDYLVKPFDLDE-LAARMRALIRRQS 120
+L ++ LPVL+++A++ +K + GA DYL KPFDL E + RAL +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GRSE 124
S+
Sbjct: 125 RPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0808V8PROTEASE788e-18 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 77.7 bits (191), Expect = 8e-18
Identities = 38/207 (18%), Positives = 71/207 (34%), Gaps = 40/207 (19%)

Query: 58 QRRAAPQLPIDPDDP-----FYQFFRHFYGQIPGMGGGRQPQPDDQPSTSLGSGFIISAD 112
++R + + +D I Q + T + SG ++
Sbjct: 62 EQREHANVILPNNDRHQITDTTNGHYAPVTYI---------QVEAPTGTFIASGVVV-GK 111

Query: 113 GYILTNAHVIDGANVVTVKLTDKR-----------EYKA-KVVGADKQSDVAVLKIDA-- 158
+LTN HV+D + L + A ++ + D+A++K
Sbjct: 112 DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNE 171

Query: 159 ------SGLPIVKIGDPAQSKVGQWVVAIGSPYGFDNTVTSGIISAKSRALPDENYTPFI 212
+ + + A+++V Q + G P +K + + +
Sbjct: 172 QNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMW---ESKGKITYLKGE--AM 226

Query: 213 QTDVPVNPGNSGGPLFNLNGEVIGINS 239
Q D+ GNSG P+FN EVIGI+
Sbjct: 227 QYDLSTTGGNSGSPVFNEKNEVIGIHW 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0812HTHTETR1262e-38 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 126 bits (317), Expect = 2e-38
Identities = 81/209 (38%), Positives = 115/209 (55%), Gaps = 1/209 (0%)

Query: 1 MARRTKEEALATRDRILDAAEHVFFEKGVSHTSLADIAQHAGVTRGAIYWHFASKSELFD 60
MAR+TK+EA TR ILD A +F ++GVS TSL +IA+ AGVTRGAIYWHF KS+LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 AMFDRVLLPIDELKAGT-GEPHADPLGRIREILIWCLLGAARDPQLRRVFSILFMKCEYV 119
+++ I EL+ + DPL +REILI L + + R + I+F KCE+V
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 120 ADMGPLLQRNREGMRDALRNIEADLAQGVANGQLPADLDTWRATLMLHTLVSGFVRDMLM 179
+M + Q R ++ IE L + LPADL T RA +++ +SG + + L
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 180 LPGEIDAERHAEKLVDGCFDMLRTSPAMR 208
P D ++ A V +M P +R
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLR 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0814RTXTOXIND424e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 4e-06
Identities = 42/266 (15%), Positives = 80/266 (30%), Gaps = 75/266 (28%)

Query: 92 KIDPAPYIAQLNSAKATLAKAQANLATQNALVARYKVLVAANAVSKQQYDDAVAAQGQAA 151
+++ A+ + A + + + + + + + L+ A++K + +A
Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV 265

Query: 152 ADVGAGKAAV-------------------------------------------ETAQINL 168
++ K+ + +
Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325

Query: 169 GYTDVVSPITGRV-GISQVTPGAYVQASQATLMSTVQQLDPVYVDLTQSSLDGLKLRQDI 227
+ + +P++ +V + T G V ++ TLM V + D + V + D +
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQNKDIGFINVG- 383

Query: 228 QSGRIK-------TEGPGAAKVTLILEDGKPYPERGKLQFSDVTVDQTTGSVT--IRAI- 277
Q+ IK G KV I D DQ G V I +I
Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNI--------------NLDAIEDQRLGLVFNVIISIE 429

Query: 278 -----FPNKQRVLLPGMFVRARIEEG 298
NK L GM V A I+ G
Sbjct: 430 ENCLSTGNKNIPLSSGMAVTAEIKTG 455



Score = 30.6 bits (69), Expect = 0.012
Identities = 20/122 (16%), Positives = 35/122 (28%), Gaps = 20/122 (16%)

Query: 1 MRVERVPYRLITVATAAVFLAACGKKESAPPPQTPEVGVVTVQPQPVPVVSELPGRTSAY 60
R V Y ++ A L+ G+ E G +T + +
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATAN----GKLTHSGRSKEIKPIENSIVKEI 110

Query: 61 LVAQVRARVDGIVLRREFTEGSDVKAGQRLYKIDPAPYIAQLNSAKATLAKAQANLATQN 120
+V EG V+ G L K+ A +++L +A+
Sbjct: 111 IVK----------------EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 121 AL 122
L
Sbjct: 155 IL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0815ACRIFLAVINRP12720.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1272 bits (3292), Expect = 0.0
Identities = 674/1035 (65%), Positives = 822/1035 (79%), Gaps = 2/1035 (0%)

Query: 1 MAKFFIDRPIFAWVIAIILMLAGVAAIFTLPIAQYPTIAPPSIQITANYPGASAKTVEDT 60
MA FFI RPIFAWV+AIILM+AG AI LP+AQYPTIAPP++ ++ANYPGA A+TV+DT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQQMSGLDNFLYMSSTSDDSGNATITITFAPGTNPDIAQVQVQNKLSLATPILPQ 120
VTQVIEQ M+G+DN +YMSSTSD +G+ TIT+TF GT+PDIAQVQVQNKL LATP+LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 VVQQLGLSVTKSSSSFLLVLAFNSEDGSMNKYDLANYVASHVKDPISRINGVGTVTLFGS 180
VQQ G+SV KSSSS+L+V F S++ + D+++YVAS+VKD +SR+NGVG V LFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWLDPTKLTNYGLTPVDVTSAISAQNVQIAGGQLGGTPAVPGTVLQATITEATLL 240
QYAMRIWLD L Y LTPVDV + + QN QIA GQLGGTPA+PG L A+I T
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 QTPEQFGNILLKVNQDGSQVRLKDVAQIGLGGETYNFDTKYNGQPTAALGIQLATNANAL 300
+ PE+FG + L+VN DGS VRLKDVA++ LGGE YN + NG+P A LGI+LAT ANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 ATAKAVRAKIDEMSAYFPHGLVVKYPYDTTPFVRLSIEEVVKTLLEGIVLVFLVMYLFLQ 360
TAKA++AK+ E+ +FP G+ V YPYDTTPFV+LSI EVVKTL E I+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NLRATIIPTIAVPVVLLGTFAIMSMVGFSINVLSMFGLVLAIGLLVDDAIVVVENVERVM 420
N+RAT+IPTIAVPVVLLGTFAI++ G+SIN L+MFG+VLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKAMGQITGALVGVALVLSAVFVPVAFSGGSVGAIYRQFSLTIVSAMVL 480
E+ LPPKEAT K+M QI GALVG+A+VLSAVF+P+AF GGS GAIYRQFS+TIVSAM L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATILKPIPQGHHEEKKGFFGWFNRTFNSSRDKYHVGVHHVIKRSGRW 540
SVLVALILTPALCAT+LKP+ HHE K GFFGWFN TF+ S + Y V ++ +GR+
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 LIIYLAVIVAVGLLFVRLPKSFLPDEDQGLMFVIVQTPSGSTQETTARTLANISDYLLTQ 600
L+IY ++ + +LF+RLP SFLP+EDQG+ ++Q P+G+TQE T + L ++DY L
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 EKDIVESAFTVNGFSFAGRGQNSGLVFVKLKDYSQRQSSDQKVQALIGRMFGRYAGYKDA 660
EK VES FTVNGFSF+G+ QN+G+ FV LK + +R + +A+I R +D
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 LVIPFNPPSIPELGTAAGFDFELTDNAGLGHDALMAARNQLLGMAAKDP-TLRGVRPNGL 719
VIPFN P+I ELGTA GFDFEL D AGLGHDAL ARNQLLGMAA+ P +L VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 720 NDTPQYKVDIDREKANALGVTADAIDQTFSIAWASKYVNNFLDTDGRIKKVYVQSDAPFR 779
DT Q+K+++D+EKA ALGV+ I+QT S A YVN+F+D GR+KK+YVQ+DA FR
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID-RGRVKKLYVQADAKFR 779

Query: 780 MTPEDMNIWYVRNGSGGMVPFSAFATGHWTYGSPKLERYNGISAMEIQGQAAPGKSTGQA 839
M PED++ YVR+ +G MVPFSAF T HW YGSP+LERYNG+ +MEIQG+AAPG S+G A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 840 MTAMETLAKKLPTGIGYSWTGLSFQEIQSGSQAPILYAISILVVFLCLAALYESWSIPFS 899
M ME LA KLP GIGY WTG+S+QE SG+QAP L AIS +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 900 VIMVVPLGVIGALLAATLRGLENDVFFQVGLLTTVGLSAKNAILIVEFARELQQTENMGP 959
V++VVPLG++G LLAATL +NDV+F VGLLTT+GLSAKNAILIVEFA++L + E G
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 960 IEAALEAARLRLRPILMTSLAFILGVMPLAISNGAGSASQHAIGTGVIGGMITATFLAIF 1019
+EA L A R+RLRPILMTSLAFILGV+PLAISNGAGS +Q+A+G GV+GGM++AT LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1020 MIPMFFVKVRAVFSG 1034
+P+FFV +R F G
Sbjct: 1020 FVPVFFVVIRRCFKG 1034


58BPSL0829BPSL0837N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL0829013-0.157333putative periplasmic ABC transporter
BPSL08300130.612273putative transmembrane ABC transporter protein
BPSL08310112.167689putative ABC transporter permease protein
BPSL08321133.869743putative haloacid dehalogenase hydrolase
BPSL08330123.528385putative sugar ABC transporter protein
BPSL0834-1124.805715putative exported protein
BPSL08350125.689915LysR family transcriptional regulator
BPSL08360125.924717family S12 unassigned peptidase
BPSL08370135.680340putative transporter protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0829MALTOSEBP330.002 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 32.8 bits (74), Expect = 0.002
Identities = 69/316 (21%), Positives = 116/316 (36%), Gaps = 55/316 (17%)

Query: 125 DSLSYNGQLYALPFYVESSMTFYRKDLFAAKGLKMPEQP-TYEQIAEFADKLTDRANGTY 183
D++ YNG+L A P VE+ Y KDL +P P T+E+I +L +A G
Sbjct: 121 DAVRYNGKLIAYPIAVEALSLIYNKDL-------LPNPPKTWEEIPALDKEL--KAKGKS 171

Query: 184 GICLRGKAGWGENMAYVSTVVNTFGGRWFD-ENW-----NAQLTSPEWKKAINFYVNLLK 237
+ + + + ++ GG F EN + + + K + F V+L+K
Sbjct: 172 ALMFNLQEPY-----FTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIK 226

Query: 238 KNGPPGASSNGFNENLTLTASGKCAMWIDATVAAGMLYNKQQSQVAEKIGFAAAPVAATP 297
+ E G+ AM I+ A N S+V G P
Sbjct: 227 NKHMNADTDYSIAE--AAFNKGETAMTINGPWAWS---NIDTSKV--NYGVTVLPTFKGQ 279

Query: 298 KGSHWLWAWALAIPKTSKQQDAAKKFV-TWATSKQYVEMVGKDEGWASVPPGTRQSTYQR 356
++ + I S ++ AK+F+ + + + +E V KD+
Sbjct: 280 PSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDK---------------- 323

Query: 357 AEYKAAAPFSEFVLKAIQTADPTDPSLKKV---PYTGVQYVGIPEFQSFGTVVGQAIAGA 413
P LK+ + DP + G IP+ +F V A+ A
Sbjct: 324 -------PLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINA 376

Query: 414 VAGQTSVDQALAAGQA 429
+G+ +VD+AL Q
Sbjct: 377 ASGRQTVDEALKDAQT 392


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0833PF05272300.021 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.021
Identities = 14/35 (40%), Positives = 17/35 (48%)

Query: 50 VVFVGPSGCGKSTLMRMIAGLEEISGGELLIDGAK 84
VV G G GKSTL+ + GL+ S I K
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0834PF06776300.020 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 29.5 bits (66), Expect = 0.020
Identities = 11/49 (22%), Positives = 15/49 (30%), Gaps = 2/49 (4%)

Query: 1 MKTGRRHFVRSVASASAALAAAAWSPARAAIDAPASPATALSLTPGRWS 49
+ + RR R+ A A A A A A+ G W
Sbjct: 38 LASCRRLARRNGARLMLAGAMAI--ALSFGWSDRADAQGAVRSVHGDWQ 84



Score = 28.7 bits (64), Expect = 0.039
Identities = 7/37 (18%), Positives = 13/37 (35%)

Query: 10 RSVASASAALAAAAWSPARAAIDAPASPATALSLTPG 46
+++ A L+ S R A A A ++
Sbjct: 25 KAIQMGPAELSPMLASCRRLARRNGARLMLAGAMAIA 61


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0836BLACTAMASEA300.018 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.8 bits (67), Expect = 0.018
Identities = 11/35 (31%), Positives = 15/35 (42%)

Query: 57 REDALFRFASVSKPIVSAAAMRAVAAGKLDLDASI 91
R D F S K ++ A + V AG L+ I
Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKI 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL0837TCRTETB354e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 4e-04
Identities = 31/155 (20%), Positives = 59/155 (38%), Gaps = 5/155 (3%)

Query: 26 LLALATAGFITIVTEALPAGLLPLMGRDLRVSDALVGQLVTVYAAGSIVAAIPLVAATRG 85
L+ L F +++ E + LP + D A + T + + +
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 86 MRRRPLLLAALAGFVVANTATAASPYYAPVLV-ARCVAGVSAGLLWALLAGYASRMVDAR 144
+ + LLL + + + +L+ AR + G A AL+ +R +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 145 QRGRAIAIAMLGAPVAMSVGI-PL-GTALGAALGW 177
RG+A ++G+ VAM G+ P G + + W
Sbjct: 136 NRGKAFG--LIGSIVAMGEGVGPAIGGMIAHYIHW 168


59BPSL1265BPSL1273N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL12651150.405095cytochrome c oxidase subunit 1
BPSL12661140.755366putative cytochrome c related protein
BPSL12670120.996182conserved hypothetical protein
BPSL1268-2112.031120conserved hypothetical protein
BPSL1269-3112.580410AhpC/TSA family membrane protein
BPSL1270-3112.346722conserved hypothetical protein
BPSL1271-2134.032575putative transport system, membrane protein
BPSL12720114.676667putative transport system, membrane protein
BPSL12730124.137391putative transport system, membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1265PF01540290.015 Adhesin lipoprotein
		>PF01540#Adhesin lipoprotein

Length = 475

Score = 28.9 bits (64), Expect = 0.015
Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 3/84 (3%)

Query: 11 RTGRALADLLLKQQDFEVTALVRRPDFA--LPGAKVVVADLTGDFSSAFN-GITHAIYAA 67
+ G+ AD LKQ + L + PD++ L +A+ T F A + G AI +
Sbjct: 35 KNGKEKADAALKQANALAEELKKNPDYSKILETLNKEIAEATKSFKEAGSYGDYPAIISK 94

Query: 68 GSAESEGATEEEQIDRDAVARAAD 91
SA E A E+Q A + AD
Sbjct: 95 LSAAVENAKSEQQKVDQANKKIAD 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1266ACRIFLAVINRP7450.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 745 bits (1925), Expect = 0.0
Identities = 279/1104 (25%), Positives = 502/1104 (45%), Gaps = 100/1104 (9%)

Query: 3 LARPFITRPVATTLLALGIALAGLFAFVKLPVSPLPQVDFPTILVQASLPGASPETVATS 62
+A FI RP+ +LA+ + +AG A ++LPV+ P + P + V A+ PGA +TV +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTSPLERHLGSIADVAEMTSMS-SVGNARIVLQFNLNRDIDGAARDVQAAINAARADLPA 121
VT +E+++ I ++ M+S S S G+ I L F D D A VQ + A LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 SLKSNPTYRKVNPADSPIMVVSLTS--KTASPAKLYDAASTVLQQSLSQIDGIGQVSLSG 179
++ + S +MV S + + D ++ ++ +LS+++G+G V L G
Sbjct: 121 EVQ-QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 180 SANPAVRVELEPQALFHYGIGLEDVRAALASANANSPKGAIEAGP------HRYQLYTND 233
+ A+R+ L+ L Y + DV L N G + P +
Sbjct: 180 AQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 234 QATKAAQYKDLVI-AYRNHAAVSLSDVSSVVDSVEDLRNLGLMNGERAVLVILYRSPGAN 292
+ ++ + + + + V L DV+ V E+ + +NG+ A + + + GAN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 293 IIDTIERVKAALPQLTAALPADIQVTPVLDRSRTIRASLADTEHTLIIAVSLVVMVVFLF 352
+DT + +KA L +L P ++V D + ++ S+ + TL A+ LV +V++LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 353 LRNWRATLIPSVAVPISIVGTFGAMYLLGFSLNNLSLMALIVATGFVVDDAIVVLENIAR 412
L+N RATLIP++AVP+ ++GTF + G+S+N L++ +++A G +VDDAIVV+EN+ R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 413 HI-ENGTPRLQAAFDGAREVGFTVLSISLSLVAVFLPILLMGGIVGRLFREFALTLSLAI 471
+ E+ P +A ++ ++ I++ L AVF+P+ GG G ++R+F++T+ A+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 472 AVSLVVSLTLTPMMCARLLPEAHAPRDE--GRVARWLERGFEWMQRGYERTLSWALRHPF 529
A+S++V+L LTP +CA LL A E G W F+ Y ++ L
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 530 TILMTLVATIALNIALYIVVPKGFFPQQDTGLMIGGIQADQTTSFQAMKLRFTEMMRIIR 589
L+ +A + L++ +P F P++D G+ + IQ + + + ++
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 590 ANP-----NVANVAGFT-GGAQTNSGFMFVALKDKPQR---KLSADQVIQQLRPQLAEVA 640
N +V V GF+ G N+G FV+LK +R + SA+ VI + + +L ++
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 641 GARTFLQAAQDIRAGGRQSNAQYQFT-LLGDSTAELYKWGP-ILTEALQKRPELADVNSD 698
I G + ++ G L + +L A Q L V +
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 699 QQQGGLEAMVTIDRATAARLGIKPAQIDNTLYDAFGQRQVSTIYNPLNQYHVVMEVAPQY 758
+ + + +D+ A LG+ + I+ T+ A G V+ + + ++ ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 759 WQSPEMLKQIYISTSGGSASGVQTTNAAAGTYVATTARASTAGAAAQSAAAIAADSARNQ 818
PE + ++Y+ ++ G V + +V + R R
Sbjct: 779 RMLPEDVDKLYVRSANGEM--VPFSAFTTSHWVYGSPRLE-----------------RYN 819

Query: 819 ALNSIASSG--KSSASSGAAVSTSKSTMVPLSAIASFGPSTTPLAVNHQGLFVATTISFN 876
L S+ G SSG A++ ++
Sbjct: 820 GLPSMEIQGEAAPGTSSGDAMALMENLAS------------------------------K 849

Query: 877 LPPGVSLSKATQVIYQTMAEVGVPPTIQGSFQGTAQAFQESLKDQPILILAALAAVYIVL 936
LP G+ G + P L+ + V++ L
Sbjct: 850 LPAGIGY------------------DWTGMSYQERLSG----NQAPALVAISFVVVFLCL 887

Query: 937 GILYESYIHPVTILSTLPSAGVGALLGLLLFKTEFSIIALIGVILLIGIVKKNAIMMVDF 996
LYES+ PV+++ +P VG LL LF + + ++G++ IG+ KNAI++V+F
Sbjct: 888 AALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEF 947

Query: 997 AIDA-SRQGKSSFDAIHEACLLRFRPIMMTTMAALLGALPLAFGRGDGAEMRAPLGIAIA 1055
A D ++GK +A A +R RPI+MT++A +LG LPLA G G+ + +GI +
Sbjct: 948 AKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVM 1007

Query: 1056 GGLIVSQMLTLYTTPVVYLYMDRL 1079
GG++ + +L ++ PV ++ + R
Sbjct: 1008 GGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 96.1 bits (239), Expect = 4e-22
Identities = 83/503 (16%), Positives = 167/503 (33%), Gaps = 25/503 (4%)

Query: 2 NLARPFITRPVATTLLALGIALAGLFAFVKLPVSPLPQVDFPTILVQASLP-GASPETVA 60
N + L+ I + F++LP S LP+ D L LP GA+ E
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 61 TSVTSPLERHLGSIAD----VAEMTSMSSVGNAR----IVLQFNLNRDIDGAARDVQAAI 112
+ + +L + V + S G A+ + + +G +A I
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 113 NAARADLPASLKSNPTYRKVNPADSPIMVVSLTSKT-----ASPAKLYDAASTVLQQSLS 167
+ A+ +L + + L A + +L +
Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707

Query: 168 QIDGIGQVSLSGSAN-PAVRVELEPQALFHYGIGLEDVRAALASANANSPKGAIEAGPHR 226
+ V +G + ++E++ + G+ L D+ +++A +
Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767

Query: 227 YQLYT---NDQATKAAQYKDLVIAYRNHAAVSLSDVSSVVDSVEDLRNLGLMNGERAVLV 283
+LY L + N V S ++ V L NG ++ +
Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHW-VYGSPRLERYNGLPSMEI 826

Query: 284 ILYRSPGANIIDTIERVKAALPQLTAALPADIQVTPVLDRSRTIRASLADTEHTLIIAVS 343
+PG + D A + L + LPA I S R S + I+
Sbjct: 827 QGEAAPGTSSGD----AMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPALVAISFV 881

Query: 344 LVVMVVFLFLRNWRATLIPSVAVPISIVGTFGAMYLLGFSLNNLSLMALIVATGFVVDDA 403
+V + + +W + + VP+ IVG A L + ++ L+ G +A
Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941

Query: 404 IVVLENI-ARHIENGTPRLQAAFDGAREVGFTVLSISLSLVAVFLPILLMGGIVGRLFRE 462
I+++E + G ++A R +L SL+ + LP+ + G
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 463 FALTLSLAIAVSLVVSLTLTPMM 485
+ + + + ++++ P+
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVF 1024



Score = 59.9 bits (145), Expect = 5e-11
Identities = 37/225 (16%), Positives = 84/225 (37%), Gaps = 4/225 (1%)

Query: 870 ATTISFNLPPGVSLSKATQVIYQTMAEV--GVPPTIQGS-FQGTAQAFQESLKDQPILIL 926
A + L G + + I +AE+ P ++ T Q S+ + +
Sbjct: 286 AAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLF 345

Query: 927 AALAAVYIVLGILYESYIHPVTILSTLPSAGVGALLGLLLFKTEFSIIALIGVILLIGIV 986
A+ V++V+ + ++ + +P +G L F + + + G++L IG++
Sbjct: 346 EAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLL 405

Query: 987 KKNAIMMVDFAIDASRQGKSSF-DAIHEACLLRFRPIMMTTMAALLGALPLAFGRGDGAE 1045
+AI++V+ + K +A ++ ++ M +P+AF G
Sbjct: 406 VDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGA 465

Query: 1046 MRAPLGIAIAGGLIVSQMLTLYTTPVVYLYMDRLRVWAEKRRDRR 1090
+ I I + +S ++ L TP + + +
Sbjct: 466 IYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1267ACRIFLAVINRP8020.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 802 bits (2072), Expect = 0.0
Identities = 284/1035 (27%), Positives = 499/1035 (48%), Gaps = 31/1035 (2%)

Query: 4 SRVFILRPVGTALLMAAIMLAGLVALRFLPLAALPEVDYPTIQVQTFYPGASPEVMTSSV 63
+ FI RP+ +L +M+AG +A+ LP+A P + P + V YPGA + + +V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 TAPLERQFGQMPSLNQMSSQS-SAGASVITLQFSLDLPLDIAEQEVQAAINAAGNLLPSD 122
T +E+ + +L MSS S SAG+ ITL F DIA+ +VQ + A LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 LPAPPIYAKVNPADAPVITLAVTSKTLPLTQ--VQDLADTRLAMKISQVSGVGLVSLSGG 180
+ I + + ++ S TQ + D + + +S+++GVG V L G
Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 NRPAVRIQANPLALASYGLNLDDLRTTISNLNVNTPKGNFDGP------TRAYTINANDQ 234
A+RI + L Y L D+ + N G G +I A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 235 LTSADQYNDAVV-AYKNGRPVMLTDVAKIVAGSENTKLGAWVDAEPAIILNVQRQPGANV 293
+ +++ + +G V L DVA++ G EN + A ++ +PA L ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 IQTVDNVKAILPKLQESLPAALDVQIVTDRTTMIRAAVRDVQFELGLAVALVVLVMYLFL 353
+ T +KA L +LQ P + V D T ++ ++ +V L A+ LV LVMYLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 ANVYATIIPSLSVPLSLIGTLAVMYLSGFSLNNLSLMALTIATGFVVDDAIVMIENIARY 413
N+ AT+IP+++VP+ L+GT A++ G+S+N L++ + +A G +VDDAIV++EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 -VEEGDSALEAALKGSKQIGFTIISLTVSLIAVLIPLLFMGDVVGRLFHEFAITLAVTIV 472
+E+ EA K QI ++ + + L AV IP+ F G G ++ +F+IT+ +
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 ISAVVSLTLVPMMCAKLLRHTPPPESHRFEAKVHGLIERV----IERYGVALQWVLDRQR 528
+S +V+L L P +CA LL+ E H + G + Y ++ +L
Sbjct: 480 LSVLVALILTPALCATLLK-PVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 529 ATLVVAVLTLALTALLYVVIPKGFFPTQDTGVIQAITQAPQSVSYGAMAERQQALAAEIL 588
L++ L +A +L++ +P F P +D GV + Q P + + + L
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 589 KH--PDVVSLTSFIGVDGANITLNSGRMLINLKPRDERS---ESASDVIRSLQRQVANVT 643
K+ +V S+ + G + N+G ++LKP +ER+ SA VI + ++ +
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 644 GISLYMQPVQDLTIDSTVSPTQYQFMLTS---PNPDEFATWVPKLVDRLRKEPS-LADVA 699
+ P I + T + F L D +L+ + P+ L V
Sbjct: 659 DGFVI--PFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 700 TDLQNSGKSVYIEIDRTSAARFGITPATVDNALYDAYGQRIVSTIFTQSNQYRVILESEP 759
+ +E+D+ A G++ + ++ + A G V+ + ++ ++++
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 760 QMQHYTDSLNGIYLPSAGGGQVPLSAIATFRERPAPLLVSHLSQFPATTISFNLAPGASL 819
+ + + ++ +Y+ SA G VP SA T + + P+ I APG S
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836

Query: 820 GEAVKAIDAAERELGLPASFQTRFQGAALAFQASLSNQLFLILAAIVTMYIVLGVLYESY 879
G+A+ ++ +L PA + G + + S + L+ + V +++ L LYES+
Sbjct: 837 GDAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESW 894

Query: 880 IHPITILSTLPSAGVGALLALMITGHDLDIIGIIGIVLLIGIVKKNAIMMIDFALEAERV 939
P++++ +P VG LLA + D+ ++G++ IG+ KNAI++++FA +
Sbjct: 895 SIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEK 954

Query: 940 EGKPPREAIYQACLLRFRPILMTTLAALLGAVPLIVGSGAGSELRQPLGIAIAGGLIVSQ 999
EGK EA A +R RPILMT+LA +LG +PL + +GAGS + +GI + GG++ +
Sbjct: 955 EGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSAT 1014

Query: 1000 VLTLFTTPVIYLGFD 1014
+L +F PV ++
Sbjct: 1015 LLAIFFVPVFFVVIR 1029


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1268RTXTOXIND484e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.3 bits (115), Expect = 4e-08
Identities = 27/149 (18%), Positives = 57/149 (38%), Gaps = 16/149 (10%)

Query: 84 AARGEMPVVLNALGTVTPLANV-TVRTQLSGYLQAVSFQEGQIVKKGDVLAQIDPRP--- 139
+ G++ +V A G +T ++ + ++ + +EG+ V+KGDVL ++
Sbjct: 75 SVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134

Query: 140 ----YQISLANAQGALARDEALLATARLDLKRYQTLVAQ---DSIAKQTADTQASLVKQY 192
Q SL A+ R + L + L+ L + +++++ SL+K+
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE- 193

Query: 193 EGTVQIDRAAIDSAKLNLAYARITAPVSG 221
Q + L + A
Sbjct: 194 ----QFSTWQNQKYQKELNLDKKRAERLT 218



Score = 38.3 bits (89), Expect = 5e-05
Identities = 33/182 (18%), Positives = 61/182 (33%), Gaps = 26/182 (14%)

Query: 141 QISLANAQGALARDEALLAT--ARLDLKRYQTLVAQDSIAKQTADTQASLVKQY-EGTVQ 197
+ ++ + L ++L+ + L A++ T + ++ + + T
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 198 ID--RAAIDSAKLNLAYARITAPVSGRV-GLRQVDPGNYVTPSDT--------NGIVVIT 246
I + + + I APVS +V L+ G VT ++T + + V
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370

Query: 247 QLQPMSVIFTTSEDNLPAILKQVGAGGKLSVTAYNRNNTTPLETGV-LDTLDNQIDTATG 305
+Q + F AI+K V A+ L V LD D G
Sbjct: 371 LVQNKDIGFINVG--QNAIIK---------VEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419

Query: 306 TV 307
V
Sbjct: 420 LV 421


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1273NUCEPIMERASE352e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 34.8 bits (80), Expect = 2e-04
Identities = 22/126 (17%), Positives = 37/126 (29%), Gaps = 30/126 (23%)

Query: 1 MKIALFGATGMIGSRIAAEAARRGHQVTAL-------------SRNPAASGANVQAKAAD 47
MK + GA G IG ++ GHQV + +R + Q D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 48 LFDPASIA--------------AALAGQDVVASAYGPKQEEASKVVAVAKALVDGARKAG 93
L D + V S P S + +++G R
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLA--VRYSLENPHAYADSNLTGFLN-ILEGCRHNK 117

Query: 94 VKRVVV 99
++ ++
Sbjct: 118 IQHLLY 123


60BPSL1562BPSL1569N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1562082.159565putrescine transport system permease protein
BPSL1563092.446018putrescine transport system permease protein
BPSL1564-1102.300196putative membrane protein
BPSL1565-1112.441893hypothetical protein
BPSL1566-1112.844555putative metallo-beta-lactamase family protein
BPSL15670122.302806putative transcriptional regulatory protein
BPSL1568-2123.293041putative membrane protein
BPSL1569-2133.737701putative transcriptional regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1562HTHFIS333e-112 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 333 bits (855), Expect = e-112
Identities = 124/357 (34%), Positives = 178/357 (49%), Gaps = 42/357 (11%)

Query: 145 ERLTTVRSASAKPSGEGLVGGSDAFNAALSALQRVAPSMLPVLLLGESGTGKELFARALH 204
+ + G LVG S A L R+ + L +++ GESGTGKEL ARALH
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALH 181

Query: 205 EASARAMGPFVVVDCSGIAETLFESELFGYEKGAFTGASARKPGLVETAQGGTLFLDEIG 264
+ R GPFV ++ + I L ESELFG+EKGAFTGA R G E A+GGTLFLDEIG
Sbjct: 182 DYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIG 241

Query: 265 DVPLSMQVKLLRLIESGTFRRVGGVEALCADFRLVAATHKPLKAMIGDGRFRPDLYYRIS 324
D+P+ Q +LLR+++ G + VGG + +D R+VAAT+K LK I G FR DLYYR++
Sbjct: 242 DMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLN 301

Query: 325 AYPISLPAVRERPGDMPLLVDSILRRIAALGPVAGQHFVVAPDALARLEAYAWPGNIREL 384
P+ LP +R+R D+P LV +++ G +AL ++A+ WPGN+REL
Sbjct: 302 VVPLRLPPLRDRAEDIPDLVRHFVQQAEKEG---LDVKRFDQEALELMKAHPWPGNVREL 358

Query: 385 RNVLDRACLLTDDGVIRVEHLPDEVAGGARIEPGAPAKLSDDELARIARAFDGTRRAL-- 442
N++ R L VI E + +E+ P A L+ + R+
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 443 -------------------------------------AERVGMSERTLYRRLRALGI 462
A+ +G++ TL +++R LG+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1565LCRVANTIGEN310.006 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 31.2 bits (70), Expect = 0.006
Identities = 14/57 (24%), Positives = 25/57 (43%)

Query: 293 QRWLELFRHYAGDDPATQLKFREALANEPELMTGTWADDALLGFVREAMQHLAPARR 349
Q ++ + + P TQ + R +A +T DD +L + ++M H AR
Sbjct: 95 QNGIKRVKEFLESSPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNHHGDARS 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1567ACRIFLAVINRP433e-137 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 433 bits (1115), Expect = e-137
Identities = 223/1062 (20%), Positives = 423/1062 (39%), Gaps = 75/1062 (7%)

Query: 13 LSAWALRHQALVVYLIALATIAGILAYSRLAQSEDPPFTFRVMVIRTFWPGATARQVQEQ 72
++ + +R L + +AG LA +L ++ P + + +PGA A+ VQ+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 73 VTDRIGRKLQEMPAIDYLRSYS-RPGESMLFFAMKDSAPVKDVPQTWYQVRKKVGDISMT 131
VT I + + + + Y+ S S G + + QV+ K+ +
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQV---QVQNKLQLATPL 117

Query: 132 LPPGVQGP-FFNDEFGDVYTNIYTLEGDG--FSPAQLHDYAD-QLRVVLLRVPGVAKVDY 187
LP VQ ++ Y + D + + DY ++ L R+ GV V
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 188 FGDPDQRIFVEIDNTRLARLGISPQQIAQAINAQNDVASPGVLTAAHD------RVFIRP 241
FG + + +D L + ++P + + QND + G L I
Sbjct: 178 FG-AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 242 SGQYESVAAIADTLIRVN--GRTFRLGELATIKRGYDDPPVTQMRTIGRNANGRAVLGIG 299
++++ +RVN G RL ++A ++ G + + NG+ G+G
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELG------GENYNVIARINGKPAAGLG 290

Query: 300 VTMQPGGDVIRLGKALDASAKALQAQLPAGLALTEVSSMPHAVARSVDDFLEAVAEAVAI 359
+ + G + + KA+ A LQ P G+ + V S+ + ++ + EA+ +
Sbjct: 291 IKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIML 350

Query: 360 VLIVSLVSLG-LRTGMVVVISIPVVLAVTALFMYLFDIGLHKVSLGTLVLALGLLVDDAI 418
V +V + L +R ++ I++PVVL T + F ++ +++ +VLA+GLLVDDAI
Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAI 410

Query: 419 IAVEMMA-VKLEQGFSRARAAAFAYTSTAFPMLTGTLVTVSGFLPIALAKSSTGEYTRSI 477
+ VE + V +E A + + ++ +V + F+P+A STG R
Sbjct: 411 VVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQF 470

Query: 478 FEVSAIALIASWFAAVVLIPLLGYHMLPERKHPRQDAAGAPHAP-DAAHDHAHGHDIYDT 536
A+ S A++L P L +L + G + DH+
Sbjct: 471 SITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHS-------V 523

Query: 537 RFYTRLRVWIKWCIERRFIVLAITIALFVVALAGFSLVPQQFFPSSDRPELLVDLRLPEG 596
YT + + L I + + F +P F P D+ L ++LP G
Sbjct: 524 NHYTNS---VGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAG 580

Query: 597 ASFSATLKEAERLEKLIAK--RPEIDHAVNFVGSGAPRFYLPLDQQLQLPNFAQFVITAK 654
A+ T K +++ K + ++ G Q N ++ K
Sbjct: 581 ATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSG---------QAQNAGMAFVSLK 631

Query: 655 SVDAR---EKLSAWLAPVLREQFPAARTRISRLENGPPV-------GYPVQ-FRVSGDSI 703
+ R E + + + + R N P + G+ + +G
Sbjct: 632 PWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGH 691

Query: 704 ATVRAIAEKVAATMR---ADARATNVQFDWDEPAERSVRFELDQHKARELNVSSQDVASF 760
+ ++ A + D + E+DQ KA+ L VS D+
Sbjct: 692 DALTQARNQLLGMAAQHPASLVSVRPNGLEDTA---QFKLEVDQEKAQALGVSLSDINQT 748

Query: 761 LAMTLSGTTLTQYRERDKLIAVDLRAPRAQRIDPASLAGLAMPTPNG-PVPLGSLGRFHD 819
++ L GT + + +R ++ + ++A R+ P + L + + NG VP + H
Sbjct: 749 ISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHW 808

Query: 820 TLEYGVVWERDRQPTITVQSDVTAGAQGIDVTHAIDAKLDALRAQLPVGYRIEIGGSVEE 879
+ + P++ +Q + G + A ++ L ++LP G + G +
Sbjct: 809 VYGSPRLERYNGLPSMEIQGEAAPG----TSSGDAMALMENLASKLPAGIGYDWTGMSYQ 864

Query: 880 STKGQTSINAQMPLMVIAVLTLLMIQLQSFSRVLMVVLTAPLGMIGVVGTLLLFGKPFGF 939
A + + + V L +S+S + V+L PLG++GV+ LF +
Sbjct: 865 ERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDV 924

Query: 940 VAMLGVIAMFGIIMRNSVILVDQIEQDIAA-GHGRFDAIVGATVRRFRPITLTAAAAVLA 998
M+G++ G+ +N++++V+ + + G G +A + A R RPI +T+ A +L
Sbjct: 925 YFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILG 984

Query: 999 LIPLLRSNFFG-----PMATALMGGITSATVLTLFFLPALYA 1035
++PL SN G + +MGG+ SAT+L +FF+P +
Sbjct: 985 VLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFV 1026



Score = 84.1 bits (208), Expect = 2e-18
Identities = 92/533 (17%), Positives = 185/533 (34%), Gaps = 63/533 (11%)

Query: 550 IERRFIVLAITIALFVVALAGFSLVPQQFFPSSDRPELLVDLRLPEGASFSATLKEAERL 609
I R + I L + +P +P+ P + V P GA A+ +
Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYP-GAD-------AQTV 57

Query: 610 EKLIAKRPE-----IDHAVNFVGSGAPRFYLPLDQQLQL---PNFAQFVITAKSVDAREK 661
+ + + E ID+ + + + + Q P+ AQ V + K
Sbjct: 58 QDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQ-------VQVQNK 110

Query: 662 LSAWLAPVLREQFP-AARTRISRLENGPPVGYPVQFRVSGDSIATVRAIAEKVAATMRAD 720
L P + + +E V VS + T I++ VA+ ++
Sbjct: 111 LQL-----ATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDT 165

Query: 721 AR----ATNVQFDWDEPAERSVRFELDQHKARELNVSSQDVASFL--------AMTLSGT 768
+VQ A+ ++R LD + ++ DV + L A L GT
Sbjct: 166 LSRLNGVGDVQLF---GAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGT 222

Query: 769 TLTQYRERDKLIAVDLRAPRAQRIDPASLAGLAMPTPNG-PVPLGSLGRFHDTLE-YGVV 826
++ + I R + +L +G V L + R E Y V+
Sbjct: 223 PALPGQQLNASIIAQTRFKNPEEFGKVTLRV----NSDGSVVRLKDVARVELGGENYNVI 278

Query: 827 WERDRQPTITVQSDVTAGAQGIDVTHAIDAKLDALRAQLPVGYRIEI----GGSVEESTK 882
+ +P + + GA +D AI AKL L+ P G ++ V+ S
Sbjct: 279 ARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSI- 337

Query: 883 GQTSINAQMPLMVIAVLTLLMIQLQSFSRVLMVVLTAPLGMIGVVGTLLLFGKPFGFVAM 942
+ +++ L + + LQ+ L+ + P+ ++G L FG + M
Sbjct: 338 -HEVVKTLFEAIMLVFLVMYLF-LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTM 395

Query: 943 LGVIAMFGIIMRNSVILVDQIEQDIAAGHGRF-DAIVGATVRRFRPITLTAAAAVLALIP 1001
G++ G+++ +++++V+ +E+ + +A + + + A IP
Sbjct: 396 FGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIP 455

Query: 1002 LL-----RSNFFGPMATALMGGITSATVLTLFFLPALYAAWFRVKPDERDPEP 1049
+ + + ++ + + ++ L PAL A + E
Sbjct: 456 MAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENK 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1568RTXTOXIND386e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.5 bits (87), Expect = 6e-05
Identities = 30/198 (15%), Positives = 58/198 (29%), Gaps = 28/198 (14%)

Query: 1 MNRSGSRAALLIGVALIAAACHRKEAAPSAPRPVVAVPAQADGAAAAVSLPGEIQPRYAT 60
+ SR L+ ++ + +VA A+G EI+P
Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVAT---ANGKLTHSGRSKEIKP---- 101

Query: 61 PLSFRIAGKLVER-KVRLGDIVKKGQVVALLDTSDVARNAASAQAQLDAATHALTFAQQQ 119
I +V+ V+ G+ V+KG V+ L + Q+ L A +Q
Sbjct: 102 -----IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQAR-----LEQT 151

Query: 120 RERDRAQARENLIAPAQLEQTENAYASARAQRDQAAQQLA----------LAKNQLQYAT 169
R + +++ E P E + + + L + +L
Sbjct: 152 RYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211

Query: 170 LVADHAGYITAEQADTGQ 187
A+ +
Sbjct: 212 KRAERLTVLARINRYENL 229



Score = 34.8 bits (80), Expect = 5e-04
Identities = 10/71 (14%), Positives = 27/71 (38%)

Query: 100 ASAQAQLDAATHALTFAQQQRERDRAQARENLIAPAQLEQTENAYASARAQRDQAAQQLA 159
+ A+++ + + + + + + IA + + EN Y A + QL
Sbjct: 217 LTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLE 276

Query: 160 LAKNQLQYATL 170
++++ A
Sbjct: 277 QIESEILSAKE 287


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1569HTHTETR626e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.6 bits (149), Expect = 6e-14
Identities = 41/203 (20%), Positives = 76/203 (37%), Gaps = 10/203 (4%)

Query: 11 RLTREQSKDLTRERLLSAAHAIFTKKGYVAASVEDIASAAGYTRGAFYSNFRSKAELLIE 70
R T++++++ TR+ +L A +F+++G + S+ +IA AAG TRGA Y +F+ K++L E
Sbjct: 3 RKTKQEAQE-TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 71 LLKRDHEEAEADLQKIFE--SGGTREQMEA---HALEYYSQFFRNNPAFLLWGEAKLQAT 125
+ + + G + H LE R +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 126 RDAKFRARFNEFVKEKRDRFTHYILTFAERVGTPLLLPADVLALGLMSLCDGVQSYHAAD 185
A + E DR + E P L A+ + G+
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 186 PRHVTGDAAQQVLAGFFARVVLA 208
P+ D ++ A + ++L
Sbjct: 182 PQSF--DLKKE--ARDYVAILLE 200


61BPSL1627BPSL1633N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1627619-2.040342subfamily S9C non-peptidase homologue
BPSL1628519-2.073629hypothetical protein
BPSL1629117-2.081588conserved hypothetical protein
BPSL1630118-2.199862putative fimbrial subunit type 1 precursor
BPSL1631117-2.171967putative fimbrial assembly chaperone precursor
BPSL1632022-5.239504putative fimbrial usher protein
BPSL1633022-4.999270putative exported fimbria-related protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1628PF005777870.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 787 bits (2034), Expect = 0.0
Identities = 292/873 (33%), Positives = 450/873 (51%), Gaps = 47/873 (5%)

Query: 11 FSRIRVTMLAAALTALSATAR----GQQALEFDPAFLELGGGQGGADLSVYATSNRVLPG 66
+ R+ L A A L F+P FL Q ADLS + + PG
Sbjct: 18 IRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLA-DDPQAVADLSRFENGQELPPG 76

Query: 67 VYPISVFVNGEAIERRDITFVSESARDGREDAIPCLSARMFDEWGVDIAAFAKLAQAGED 126
Y + +++N + RD+TF D + +PCL+ G++ A+ + + +D
Sbjct: 77 TYRVDIYLNNGYMATRDVTFN---TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADD 133

Query: 127 ACVDIADSVPHARTEFDSHQLRLNVTVPQAALKRRARGAVDPARWDQGIDAALLDYQLSA 186
ACV + + A + D Q RLN+T+PQA + RARG + P WD GI+A LL+Y S
Sbjct: 134 ACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSG 193

Query: 187 AQYAGGNFASARSRTTLYAGLRGAVNLGAWRLSHTSSFLRGL-----DGRNRFQIVNTFV 241
+ Y L+ +N+GAWRL +++ +N++Q +NT++
Sbjct: 194 NSVQNR---IGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWL 250

Query: 242 QRDIAGWNSRLTAGEGTTPANIFDGFQFLGVQLNTDETMLPDSLQGYAPTVHGVAQTNAQ 301
+RDI SRLT G+G T +IFDG F G QL +D+ MLPDS +G+AP +HG+A+ AQ
Sbjct: 251 ERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQ 310

Query: 302 VTIRQNGFVIYSTYVPPGPFTIDDLYPTSSSGNLEVTITEADGHVTTFTQPYSAVPMLLR 361
VTI+QNG+ IY++ VPPGPFTI+D+Y +SG+L+VTI EADG FT PYS+VP+L R
Sbjct: 311 VTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQR 370

Query: 362 DGSWRYNVTAGQYR-DGISGSHPSFAMATLARGLAGEFSLYGGFIGAGMYQSVLVGIGKN 420
+G RY++TAG+YR P F +TL GL +++YGG A Y++ GIGKN
Sbjct: 371 EGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKN 430

Query: 421 LGSIGAVSLDVTHARSAVDLADSSTVSGHAFRVLYAKAVGSWGTDFRLLAYRYSTAGYRS 480
+G++GA+S+D+T A S L D S G + R LY K++ GT+ +L+ YRYST+GY +
Sbjct: 431 MGALGALSVDMTQANS--TLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFN 488

Query: 481 FADAVQLRDGSEPAAL------------------GAKRQRLEGTVNQRLGRLGSMYATVA 522
FAD R KR +L+ TV Q+LGR ++Y + +
Sbjct: 489 FADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGS 548

Query: 523 VQTYWGSAARSTVYQLGHSGNWGRASYGLYAAYSKGSGVPSSWN-VSLSLSMPLEVFFGG 581
QTYWG++ +Q G + + ++ L + +K + ++L++++P +
Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRS 608

Query: 582 ARVRAPAGGSANVSYFVSRNNENHVNQQMTASGSSSEQ-RLNYSVGVAHS----SESDVS 636
A+ SY +S + + G+ E L+YSV ++ S +
Sbjct: 609 DSKSQW--RHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGST 666

Query: 637 GSVSASYLAPFGRYDASIGSGRGYTQAAFTAAGGMLWHGTGVLFTQPLGETVAVVDVPNV 696
G + +Y +G + Q + +GG+L H GV QPL +TV +V P
Sbjct: 667 GYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGA 726

Query: 697 RGVRFEMHPGVSTDRAGEAVIPRLNPYRVNRIAVDQRRMPQDVEIRNPVSEVVPTRAAVV 756
+ + E GV TD G AV+P YR NR+A+D + +V++ N V+ VVPTR A+V
Sbjct: 727 KDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIV 786

Query: 757 QTHFDSVVGLRALFTLMRADGSFPPQGATAENDEGQVLGVVGMDGETFVAGLPAAEGHFV 816
+ F + VG++ L TL + P GA ++ Q G+V +G+ +++G+P A G
Sbjct: 787 RAEFKARVGIKLLMTL-THNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA-GKVQ 844

Query: 817 VRWGAARQNRCRVNYALPGKAAIGAYLAVEAIC 849
V+WG C NY LP ++ + A C
Sbjct: 845 VKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1631OMADHESIN503e-08 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 50.3 bits (119), Expect = 3e-08
Identities = 52/159 (32%), Positives = 79/159 (49%)

Query: 873 ATGNNASASGTSSTAGGANAIASGENSTANGANSTASGNGSSAFGESAAAAGDGSTALGS 932
A G NASA G S A GA A A+ + A GA S A+G S A G + A GD + G+
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 933 NAVASGVGSVATGAGSVASGANSSAYGTGSNATGAGSVAIGQGATASGSNSVALGTGSVA 992
+ A G S + + + + ++A + ++ A+ S+A+G S
Sbjct: 120 ASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKT 179

Query: 993 SEDNTVSVGSAGSERRITNVAAGVNATDAVNVGQLNSAV 1031
+N+VS+G R++T++AAG TDAVNV QL +
Sbjct: 180 DRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEI 218



Score = 39.9 bits (92), Expect = 5e-05
Identities = 79/305 (25%), Positives = 124/305 (40%), Gaps = 13/305 (4%)

Query: 439 ASGDNATASGTNSTANGTNSTASGDNSTASGTNASASGENSTATGTDSTASGSNSTANGT 498
A G NA+A G +S A G + A+ + A G + A+G NS A G S A G ++ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 499 NSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDN--STASGTNA 556
STA D A G AS T + A G +S A NS A G +S + ++ S A G +
Sbjct: 120 ASTAQKD-GVAIGARAS-TSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRS 177

Query: 557 SASGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASG 616
ENS + G +S A GT T + + A + ++T
Sbjct: 178 KTDRENSVSIGHESLNRQLTHLAAGTKDTDAVN---------VAQLKKEIEKTQENTNKR 228

Query: 617 SNSTANGTNSTASGDNSTASGTNASATGDNSTASGTNASATGENSTATGTDSTASGSNST 676
S N+ A +S+ G + T S + NA + + + SNS
Sbjct: 229 SAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSV 288

Query: 677 ANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGT 736
A T TA ++ + T E++ ++ AS + + ++ T NS T
Sbjct: 289 ARTTLETAEEHANSVARTTLETAEEHANKKSAEALASANVYADSKSSHTLKTANSYTDVT 348

Query: 737 NASAT 741
+++T
Sbjct: 349 VSNST 353



Score = 39.5 bits (91), Expect = 6e-05
Identities = 81/333 (24%), Positives = 125/333 (37%), Gaps = 9/333 (2%)

Query: 444 ATASGTNSTANGTNSTASGDNSTASGTNASASGENSTATGTDSTASGSNSTANGTNSTAS 503
A A + N T S + A G A G +++A G +S A G + A+
Sbjct: 23 AFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAA 82

Query: 504 GDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASASGENS 563
+ A G + ATG NS A G S A G ++ G STA D A G AS S +
Sbjct: 83 KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD-GVAIGARASTS-DTG 140

Query: 564 TATGTDSTASGSNSTANGTNSTASGDN--STASGTNASATGENSTATGTDSTASGSNSTA 621
A G +S A NS A G +S + ++ S A G + ENS + G +S A
Sbjct: 141 VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLA 200

Query: 622 NGTNST-----ASGDNSTASGTNASATGDNSTASGTNASATGENSTATGTDSTASGSNST 676
GT T A + + NA A ++S+ G + + S S
Sbjct: 201 AGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIANNYTDSKSA 260

Query: 677 ANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGT 736
N+ + N + NS A T TA ++ + +++
Sbjct: 261 ETLENARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEHANSVARTTLETAEEHANKKSA 320

Query: 737 NASATGENSTATGTDSTASGSNSTANGTNSTAS 769
A A+ + + T +NS + T S ++
Sbjct: 321 EALASANVYADSKSSHTLKTANSYTDVTVSNST 353



Score = 39.5 bits (91), Expect = 6e-05
Identities = 86/340 (25%), Positives = 130/340 (38%), Gaps = 13/340 (3%)

Query: 628 ASGDNSTASGTNASATGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGD 687
A + + T + + A G A G +++A G +S A G + A+
Sbjct: 25 ADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKG 84

Query: 688 NSTASGTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGTNASATGENSTA 747
+ A G + ATG NS A G S A G ++ GA STA D A G AS T + A
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD-GVAIGARAS-TSDTGVA 142

Query: 748 TGTDSTASGSNSTANGTNSTASGNN--STASGTNASATGENSTATGTDSAASGTNSTANG 805
G +S A NS A G +S + N+ S A G + ENS + G +S A G
Sbjct: 143 VGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202

Query: 806 TNSTASGDNSTASGTNASATGENSTATGTASTASGSNSTANGANSTASGAGATATGENAA 865
T T + + A + +T S AN+ A ++ G
Sbjct: 203 TKDTDAVN---------VAQLKKEIEKTQENTNKRSAELLANANAYADNKSSSVLGIANN 253

Query: 866 ATGAGATATGNNASASGTSSTAGGANAIASGENSTANGANSTASGNGSSAFGESAAAAGD 925
T + + T NA + + N + NS A TA + +S + A +
Sbjct: 254 YTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEHANSVARTTLETAEE 313

Query: 926 GSTALGSNAVASGVGSVATGAGSVASGANSSAYGTGSNAT 965
+ + A+AS + + ANS T SN+T
Sbjct: 314 HANKKSAEALASANVYADSKSSHTLKTANSYTDVTVSNST 353



Score = 37.2 bits (85), Expect = 4e-04
Identities = 102/425 (24%), Positives = 169/425 (39%), Gaps = 39/425 (9%)

Query: 691 ASGTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGTNASATGENSTATGT 750
A G NASA G +S A G + A+ + A GA S A+G NS A G + A G+++ G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 751 DSTASGSNSTANGTNSTASGNNSTASGTNASATGENSTATGTDSAASGTNSTANGTNSTA 810
STA ST+ + A G N+ A +NS A G S + AN S A
Sbjct: 120 ASTAQKDGVAIGARASTS--DTGVAVGFNSKADAKNSVAIGHSSHVA-----ANHGYSIA 172

Query: 811 SGDNSTASGTNASATGENSTATGTASTASGSNSTANGANSTASGAGATATGENAAATGAG 870
GD S N+ + G S A+G+ T + N T EN A
Sbjct: 173 IGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDT-DAVNVAQLKKEIEKTQENTNKRSAE 231

Query: 871 ATATGNNASASGTSSTAGGANAIASGENSTANGANSTASGNGSSAFGESAAAAGDGSTAL 930
A N + + +SS G AN N +S ++ +A E+ A + D
Sbjct: 232 LLANANAYADNKSSSVLGIAN----------NYTDSKSAETLENARKEAFAQSKDVLNMA 281

Query: 931 GSNAVASGVGSVATGAGSVASGANSSAYGTGSNATGAGSVAIGQGATASGSNSVALGTGS 990
+++ + ++ T S A ++ +A + A+ + S S +
Sbjct: 282 KAHSNSVARTTLETAEEHANSVARTTLETAEEHANKKSAEALASANVYADSKSSHTLKTA 341

Query: 991 VASEDNTVSVGSAGSERRITNVAAGVNATDAVNVGQLNSAVSGIRNQMDGMQGQIDTLAR 1050
+ D TVS + + R + + N++D + ++D
Sbjct: 342 NSYTDVTVSNSTKKAIRESNQYT--------------DHKFRQLDNRLDKLDTRVD---- 383

Query: 1051 DAYSGIAAATALTMIPDVDPGKTLAVGIGTANFKGYQASALGATARITQNLKVKTGVSYS 1110
G+A++ AL + + G ++ QA A+G+ R+ +N+ +K GV+Y+
Sbjct: 384 ---KGLASSAALNSLFQPYGVGKVNFTAGVGGYRSSQALAIGSGYRVNENVALKAGVAYA 440

Query: 1111 GSNYV 1115
GS+ V
Sbjct: 441 GSSDV 445



Score = 35.3 bits (80), Expect = 0.001
Identities = 40/119 (33%), Positives = 53/119 (44%)

Query: 877 NASASGTSSTAGGANAIASGENSTANGANSTASGNGSSAFGESAAAAGDGSTALGSNAVA 936
+ SA+ S+ A A + N S N A G A G NA A
Sbjct: 8 SVSAALISALFSSPYAFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASA 67

Query: 937 SGVGSVATGAGSVASGANSSAYGTGSNATGAGSVAIGQGATASGSNSVALGTGSVASED 995
G+ S+A GA + A+ + A G GS ATG SVAIG + A G ++V G S A +D
Sbjct: 68 KGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1633HTHFIS442e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.4 bits (105), Expect = 2e-07
Identities = 19/84 (22%), Positives = 38/84 (45%), Gaps = 6/84 (7%)

Query: 10 KVVVADDHPIVLRAVTDYVNSLPGFHVVASVSSGDALLSAMREQEVNLVVTDFTMHQAND 69
++VADD + + ++ G+ V S+ L + + +LVVTD M
Sbjct: 5 TILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVM----P 58

Query: 70 DKDGLRLISHLMRAYERTPIIVFT 93
D++ L+ + +A P++V +
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1634HTHFIS817e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.0 bits (200), Expect = 7e-18
Identities = 36/146 (24%), Positives = 60/146 (41%), Gaps = 1/146 (0%)

Query: 854 TVLIAEDNLLNRSLLLDQLTTLGVRVIEAKNGEEALALLLKEPVDVVMTDIDMPMMDGFQ 913
T+L+A+D+ R++L L+ G V N + D+V+TD+ MP + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 914 LLAEMRRLGMTMPVYAVSASARPEDVAEGRARGFTDYLAKPVSLERLETVVRACCSAP-A 972
LL +++ +PV +SA + +G DYL KP L L ++ + P
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 973 GARADEDAQDELPGLPDVPPAYASAF 998
ED + L A +
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIY 150


62BPSL1658BPSL1664N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL16581121-3.808691putative outer membrane porin protein
BPSL1658a1021-3.828002hypothetical protein
BPSL16591021-3.671815insertion element hypothetical protein
BPSL16601020-3.447840insertion element hypothetical protein
BPSL1661821-3.417970putative DNA-biding protein, H-NS-like
BPSL1662329-3.971740putative exported protein
BPSL1663330-4.058074putative outer membrane protein
BPSL1664430-3.884264putative hemolysin-related protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1659OMPADOMAIN1111e-30 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 111 bits (280), Expect = 1e-30
Identities = 59/180 (32%), Positives = 89/180 (49%), Gaps = 12/180 (6%)

Query: 123 QYQVRF--LGGLAYRGYWADSACRDIAARYADAAGLGVIAVAPCNPSDVAAPLPERVELP 180
Q+ + R ++ R+ V+A AP +V L
Sbjct: 163 QWTNNIGDAHTIGTRP-DNGMLSLGVSYRFGQGEAAPVVAPAPAPAPEVQTK---HFTLK 218

Query: 181 TDTLFAFDKGGFEDISADGRRQLGDLVASIKAKIFSINHLIVTGYTDRLGSDEHNARLSS 240
+D LF F+K + +G+ L L + + ++V GYTDR+GSD +N LS
Sbjct: 219 SDVLFNFNK---ATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSE 275

Query: 241 ERARTVADYMIAEGIPAAKITAVGRGAADPVV--VCNNGEQ-PELIRCLQKNRRVEIRIK 297
RA++V DY+I++GIPA KI+A G G ++PV C+N +Q LI CL +RRVEI +K
Sbjct: 276 RRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1661RTXTOXINA489e-07 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 47.7 bits (113), Expect = 9e-07
Identities = 24/78 (30%), Positives = 38/78 (48%)

Query: 2983 AGADTVTGSSGRDMLNGGAGNDTIVGNGGVDVLEGGGGNDMLVVNGDNIAHFTTPGSYWD 3042
G DT++G +G D L GG GND ++G G + L GG G+D V G+++A G +
Sbjct: 762 KGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGN 821

Query: 3043 GGIMMGGEINTLQFDANN 3060
+ + L +
Sbjct: 822 DKLYGSEGADLLDGGEGD 839



Score = 45.3 bits (107), Expect = 4e-06
Identities = 28/77 (36%), Positives = 36/77 (46%), Gaps = 8/77 (10%)

Query: 2984 GADTVTGSSGRDMLNGGAGNDTIVGNGGVDVLEGGGGNDML--------VVNGDNIAHFT 3035
G D + G+ G D L G GNDT+ G G D L GG GND L + GD F
Sbjct: 745 GDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQ 804

Query: 3036 TPGSYWDGGIMMGGEIN 3052
G+ ++ GG+ N
Sbjct: 805 VQGNSLAKNVLFGGKGN 821



Score = 39.6 bits (92), Expect = 2e-04
Identities = 22/60 (36%), Positives = 30/60 (50%), Gaps = 1/60 (1%)

Query: 2986 DTVTGSSGRDMLNGGAGNDTIVGNGGVDVLEGGGGNDMLVVNGDNIAHFTTPG-SYWDGG 3044
D G+ G D++ G GND + G+ G D L GG G+D L N G +Y +GG
Sbjct: 738 DIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGG 797



Score = 38.4 bits (89), Expect = 5e-04
Identities = 20/43 (46%), Positives = 23/43 (53%)

Query: 2982 TAGADTVTGSSGRDMLNGGAGNDTIVGNGGVDVLEGGGGNDML 3024
T AD GS D+ +G G+D I GN G D L G GND L
Sbjct: 725 TTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTL 767


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1665RTXTOXIND2745e-89 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 274 bits (701), Expect = 5e-89
Identities = 94/439 (21%), Positives = 204/439 (46%), Gaps = 14/439 (3%)

Query: 43 SALGLEEASIAPARRAAALIPTVMLALLIVLVLWATFFKIDIIAAGQGKVIPSTTVQQLS 102
+ L L E ++ R A ++ L++ + + +++I+A GK+ S +++
Sbjct: 44 AHLELIETPVSRRPRLVAYF---IMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIK 100

Query: 103 TLEGGIVRELLVREGQIVKKGQPLVRLDPVVAQGAVTEQAATREGLMASIARLQAEADGK 162
+E IV+E++V+EG+ V+KG L++L + A+ + ++ R Q +
Sbjct: 101 PIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSI 160

Query: 163 ----------ATPLYPAGLKPEIVSEEEHVRAQRAEALNSTIEVLQQQRAAKQAEAADYR 212
Y + E V + ++ + + K+AE
Sbjct: 161 ELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVL 220

Query: 213 GRIPQYVNNQHLLDDQIQRMLPLVGVGSVAPNEITNLQRERGNLAAQIITTREGAAQASA 272
RI +Y N + ++ L+ ++A + + + + ++ + Q +
Sbjct: 221 ARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIES 280

Query: 273 QIAEASHKIEEKISTFRSEAREELARKQVQLQALEGTLSGKQDILDRTLIRSPVNGIVKT 332
+I A + + F++E ++L + + L L+ ++ ++IR+PV+ V+
Sbjct: 281 EILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQ 340

Query: 333 LYITTIGGVASPGKSVIDIVPTNDSLLIEARIQPQDIAYIRVGDDAKVRITAFDSGALGS 392
L + T GGV + ++++ IVP +D+L + A +Q +DI +I VG +A +++ AF G
Sbjct: 341 LKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGY 400

Query: 393 LDAKVELISPDSQADERSGSLYYKVQVRTHSSVVATQVGDLNILPGMVADVDVITGRRTI 452
L KV+ I+ D+ D+R G L + V + + ++T ++ + GM ++ TG R++
Sbjct: 401 LVGKVKNINLDAIEDQRLG-LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSV 459

Query: 453 MSYILRPIVRGMSRAMSER 471
+SY+L P+ ++ ++ ER
Sbjct: 460 ISYLLSPLEESVTESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1666SYCDCHAPRONE330.005 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 33.0 bits (75), Expect = 0.005
Identities = 21/126 (16%), Positives = 45/126 (35%), Gaps = 3/126 (2%)

Query: 898 LAPDDADAVLLRAELALDTGDFDEALSQFERLREQRPDAPESYANLIPALAALERRDDAI 957
++ D + + A +G +++A F+ L + L A+ + D AI
Sbjct: 31 ISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAI 90

Query: 958 AALQRALELNSKHPGALNNGVQFYLRTQQYDKA---MELAQRYVGAHGELASAHTMCGLV 1014
+ ++ K P + + L+ + +A + LAQ + E T +
Sbjct: 91 HSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSM 150

Query: 1015 YHNLKA 1020
+K
Sbjct: 151 LEAIKL 156


63BPSL1738BPSL1744N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL17380112.353931putative methyltransferase
BPSL1740-212-0.197572putative ABC transport system, exported protein
BPSL1741-212-0.417240putative ABC transport system, membrane protein
BPSL1742-1130.376779putative ABC transport system, ATP-binding
BPSL1743-1131.198152putative ABC transport system, membrane protein
BPSL1744-1121.070987hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1741PF06917260.041 Periplasmic pectate lyase
		>PF06917#Periplasmic pectate lyase

Length = 555

Score = 26.0 bits (57), Expect = 0.041
Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 3/37 (8%)

Query: 52 NWSALAEIRRRLHGMYWKRRRIGVWLFSFWDRSDAAE 88
+W L R HG Y K+R V+ +D + AE
Sbjct: 173 DWKTLDLGR---HGNYSKQRDPQVFTHPRYDVVNPAE 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1743ARGDEIMINASE5150.0 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 515 bits (1327), Expect = 0.0
Identities = 130/423 (30%), Positives = 227/423 (53%), Gaps = 21/423 (4%)

Query: 1 MSQAIPQVGVHSEVGKLRKVLVCSPGLAHQRLTPSNCDELLFDDVMWVNQAKRDHFDFVS 60
M + + + + SE+G+L+KVL+ PG + LTP LFDD+ ++ A+++H F S
Sbjct: 1 MEEYLNPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFAS 60

Query: 61 KMRERGVEVLEMHNLLTETVQNPAALK------WILDRKITPDNVGIGLVDEVRAWLEGL 114
++ VE+ + +L++E + + AL+ +IL+ +I D ++ ++ + L
Sbjct: 61 ILKNNLVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFT----INLLKDYFSSL 116

Query: 115 EPRALAEFLIGGVAASDIAGAERSKVLTLFRDYLGKSSFVLPPLPNMMFTRDTSCWIYGG 174
+ +I GV ++ S + G + F++ P+PN++FTRD I G
Sbjct: 117 TIDNMISKMISGVVTEELKNYTSSLDDLV----NGANLFIIDPMPNVLFTRDPFASIGNG 172

Query: 175 VTLNPMHWPARRQETLLVAAVYKFHPAFTDAKFDVWYGDPDRDHGMATLEGGDVMPIGRG 234
VT+N M R++ET+ ++K+HP + +W + A+LEGGD + + +G
Sbjct: 173 VTINKMFTKVRQRETIFAEYIFKYHPVYK-ENVPIWLNRWE----EASLEGGDELVLNKG 227

Query: 235 VVLVGMGERTSRQAVGQLAQALFA-KGAAERVIVAGLPNSRASMHLDTVFSFCDRDLVTV 293
++++G+ ERT ++V +LA +LF K + + ++ +P +R+ MHLDTVF+ D + T
Sbjct: 228 LLVIGISERTEAKSVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTS 287

Query: 294 FPEVVNRIVPFTLRPGGDARYGIDIEREDKPFVDVVAQALGLKSLRVVETGGNDFAAERE 353
F + L + I I++E DV++ LG K + GG+ RE
Sbjct: 288 FTSDDMYFSIYVLTYNPSSSK-IHIKKEKARIKDVLSFYLGRKIDIIKCAGGDLIHGARE 346

Query: 354 QWDDGNNMVCIEPGVVVGYDRNTYTNTLLRKAGVEVITIGSSELGRGRGGGHCMTCPVLR 413
QW+DG N++ I PG ++ Y RN TN L + G++V I SSEL RGRGG CM+ P++R
Sbjct: 347 QWNDGANVLAIAPGEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIR 406

Query: 414 DPV 416
+ +
Sbjct: 407 EDI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1745CARBMTKINASE404e-145 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 404 bits (1040), Expect = e-145
Identities = 144/310 (46%), Positives = 193/310 (62%), Gaps = 13/310 (4%)

Query: 2 RIVIALGGNALLQRNQPMTEVQQRENVKIAVAQIAQ-IAPGNELVIAHGNGPQVGLLALQ 60
R+VIALGGNAL QR Q + + +NV+ QIA+ IA G E+VI HGNGPQVG L L
Sbjct: 4 RVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLLH 63

Query: 61 ---GAAYPAVAPYPLDVLGAQTEGMIGYLIEQEMGNLLPP---DAPFATLLTQVEVDPAD 114
G A + P+DV GA ++G IGY+I+Q + N L + T++TQ VD D
Sbjct: 64 MDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKND 123

Query: 115 PAFEHPTKPIGPVYSRDEAERLAQEKGWHIAPD-GDKFRRVVPSPRPRRIFEIRPVKWLL 173
PAF++PTKP+GP Y + A+RLA+EKGW + D G +RRVVPSP P+ E +K L+
Sbjct: 124 PAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLV 183

Query: 174 EKGTIVICAGGGGIPTRYDANGKLSGVEAVIDKDLCASLLARELSADLLVIATDVDGAYL 233
E+G IVI +GGGG+P + +G++ GVEAVIDKDL LA E++AD+ +I TDV+GA L
Sbjct: 184 ERGVIVIASGGGGVPVILE-DGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242

Query: 234 DWGKPTQALIEAAHPDELERL----GFAAGSMGPKVQAAIEFARQTGHDAVIGSLADIVA 289
+G + + +EL + F AGSMGPKV AAI F G A+I L V
Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302

Query: 290 IAEGRAGTRI 299
EG+ GT++
Sbjct: 303 ALEGKTGTQV 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1746DHBDHDRGNASE901e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 90.1 bits (223), Expect = 1e-23
Identities = 50/187 (26%), Positives = 81/187 (43%), Gaps = 6/187 (3%)

Query: 1 MTGKRILVTGAGSGFGREVALRLAAKGHCVIAGVQIAPQITELSAEAARRGLALDAVKLD 60
+ GK +TGA G G VA LA++G + A ++ ++ + +A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 VT-CARERAQAARWD-----VDVLLNNAGAGEAGALVDLPVDIVRELFETNVFGPLELTQ 114
V A AR + +D+L+N AG G + L + F N G ++
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 115 QVARGMIARGRGRIVFVSSIAGLITGAYTGAYCASKHALEAIAEAMHLELAAHGVQIAVV 174
V++ M+ R G IV V S + AY +SK A + + LELA + ++ +V
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 175 NPGPYRT 181
+PG T
Sbjct: 186 SPGSTET 192


64BPSL1781BPSL1789N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL17813136.806087putative L-ornithine 5-monooxygenase
BPSL17822136.611243putative siderophore-related non-ribosomal
BPSL17833156.730956putative siderophore related no-ribosomal
BPSL17842154.726406putative siderophore biosynthesis related ABC
BPSL17853173.406952conserved hypothetical protein
BPSL17864173.635319putative iron transport-related exported
BPSL17874253.079705putative iron transport-related membrane
BPSL17895232.476368putative iron transport-related membrane
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1781FERRIBNDNGPP1162e-32 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 116 bits (292), Expect = 2e-32
Identities = 78/264 (29%), Positives = 113/264 (42%), Gaps = 15/264 (5%)

Query: 59 PARIVVLEFMFAEDLAALDITPVGMADPAYYPIWIGYDDARFARVSDVGTRQEPSLEAIA 118
P RIV LE++ E L AL I P G+AD Y +W+ + V DVG R EP+LE +
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVS-EPPLPDSVIDVGLRTEPNLELLT 93

Query: 119 AAKPDLILGVGLRHAPIFDALSRIAPTVLFKYSPNYIEDGRQVTQYDWARAILRTIGCLT 178
KP ++ + P + L+RIAP F +S DG+Q AR L + L
Sbjct: 94 EMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFS-----DGKQ--PLAMARKSLTEMADLL 145

Query: 179 GRARDARAVQARVDAGLARDARRIAAAGRAGERVAWLQELGLPDRYWAFTGNSASAGIAR 238
A A+ + + R + G R L L P F NS I
Sbjct: 146 NLQSAAETHLAQYEDFIRSMKPRFV---KRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202

Query: 239 ALGLE-PWPGEPTREGTAYVTSEDLLKQPDLAVLFVSATEPGVPLDAKLDSSIWRFVPAR 297
G+ W GE G+ V+ + L D+ VL +DA + + +W+ +P
Sbjct: 203 EYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD-MDALMATPLWQAMPFV 261

Query: 298 RAGRVALVERNIWGFGGPMSALRL 321
RAGR V +W +G +SA+
Sbjct: 262 RAGRFQRVP-AVWFYGATLSAMHF 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL17822FE2SRDCTASE576e-12 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 57.4 bits (138), Expect = 6e-12
Identities = 51/186 (27%), Positives = 73/186 (39%), Gaps = 24/186 (12%)

Query: 78 RALVSQWSKYYFNLAASAGFAAALLLGRPLDMAPQRMRVALRGGMPVALLFEADALRPAQ 137
+ L+S W+++Y L A L + LD++P+ VA F D
Sbjct: 89 KPLISLWAQWYIGLMVPPLMLALLTQEKALDVSPEHFHAEFHETGRVA-CFWVDVCEDKN 147

Query: 138 AEPAS---RYAALVDH-LRATIDTLAALAKLSPRVLWANAGNLLD-YLFEQCAHAPRAGA 192
A P S R L+ L + L A +++ +++W+N G L++ YL E G
Sbjct: 148 ATPHSPQHRMETLISQALVPVVQALEATGEINGKLIWSNTGYLINWYLTEM---KQLLGE 204

Query: 193 DA------AWLFGPVDSRGEANPLRLPVRRVKPCSARLPDPFRARRVCCLRNEIPGEDQL 246
A F + GE NPL V L D RR CC R +P Q
Sbjct: 205 ATVESLRHALFFEKTLTNGEDNPLWRTV--------VLRDGLLVRRTCCQRYRLPDVQQ- 255

Query: 247 CGSCPL 252
CG C L
Sbjct: 256 CGDCTL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1784PF05272280.041 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.041
Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 36 VTALCGPNGCGKSTLLRTLAGLQ 58
L G G GKSTL+ TL GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1789DHBDHDRGNASE1224e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 122 bits (308), Expect = 4e-36
Identities = 81/252 (32%), Positives = 118/252 (46%), Gaps = 15/252 (5%)

Query: 9 GRSFLVTGASSGIGRAAAVALRGGGARVVAAARNARELERLAHETGC-----EPLELDVG 63
G+ +TGA+ GIG A A L GA + A N +LE++ E DV
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 64 CDASVRAALSG-ERMRDAFDGLINCAGVTSLAAAIDTTADEFDRVMAVNARGAMLVARHV 122
A++ + ER D L+N AGV + +E++ +VN+ G +R V
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 123 ARAMIRAGRGGSIVNVSSQAALVALPSHLAYCASKAALDAMTRVLCVELGPHGIRVNSVN 182
++ M R GSIV V S A V S AY +SKAA T+ L +EL + IR N V+
Sbjct: 128 SKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 183 PTVTLTPMAERAWSDPHASGPMLA--------AIPLGRFARVADVVAPILFLSSDAAAMV 234
P T T M W+D + + ++ IPL + A+ +D+ +LFL S A +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 235 SGVALPVDGGYT 246
+ L VDGG T
Sbjct: 247 TMHNLCVDGGAT 258


65BPSL1799BPSL1819N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL17992142.710551hypothetical protein
BPSL18001132.701528hypothetical protein
BPSL18010133.438899putative ABC transport system, membrane protein
BPSL18020143.673296putative exported protein
BPSL1803-1143.040178putative fimbrial chaperone
BPSL1804-1133.407812putative outer membrane usher protein precursor
BPSL1805-1123.105586putative type-1 fimbrial protein
BPSL18060122.877570multidrug efflux system putative membrane
BPSL1807-2113.144188multidrug efflux system transporter protein
BPSL1808-2103.409678multidrug efflux system putative membrane fusion
BPSL1809-1103.839035TetR family regulatory protein
BPSL18101114.561011subfamily M23B unassigned peptidase
BPSL18113124.916421putative amino acid transport system, membrane
BPSL18124135.594975putative amino acid transport system, membrane
BPSL18133164.606034putative amino acid transport system, exported
BPSL18142145.358322putative membrane protein
BPSL18152145.666136putative membrane protein
BPSL1816-1146.036667putative membrane protein
BPSL1817-2145.553599putative fimbriae-related membrane protein
BPSL1818-2134.833465putative membrane protein
BPSL1819-1104.807156putative fimbriae assembly-related protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1800PF005776770.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 677 bits (1749), Expect = 0.0
Identities = 226/851 (26%), Positives = 353/851 (41%), Gaps = 60/851 (7%)

Query: 2 RIRHSFLCVFMLAAGSHARATEFNASFLSIDGRNDVDLSQFAQADYTLPGTYLLDVQVND 61
+R C F A + FN FL+ D + DLS+F PGTY +D+ +N+
Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNN 86

Query: 62 VFFGLQPIEFVAHDDGQGARACVAPELVAQFGLKKSLVENLPRTMGGRCADLASL-DGVT 120
+ + + F D QG C+ +A GL + V + C L S+ T
Sbjct: 87 GYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDAT 146

Query: 121 IRYQKGEGRLKITIAQAALEFADASYLPPERWSDGVDGAMLDYRVLANANHAFGRGAQQN 180
+ G+ RL +TI QA + Y+PPE W G++ +L+Y + N R +
Sbjct: 147 AQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYN--FSGNSVQNRIGGNS 204

Query: 181 NAVQAYGTIGANWGAWRFRGDYQAQ-TRAGGAVYAERAFRFNQLYAYRALPSIRSTLSFG 239
+ G N GAWR R + + + ++ ++ + R + +RS L+ G
Sbjct: 205 HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLG 264

Query: 240 EIYVDSDIFSTFSMSGVAMKSDDRMLPPSMRGYAPLVTGVARTNAIVKVMQDSRVLYMTK 299
+ Y DIF + G + SDD MLP S RG+AP++ G+AR A V + Q+ +Y +
Sbjct: 265 DGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNST 324

Query: 300 VSPGAFALSNLN-TSVQGTLDVVVEEEDGTVQRFQVATAAVPFLAREGQLRYKTAIGQPR 358
V PG F ++++ G L V ++E DG+ Q F V ++VP L REG RY G+ R
Sbjct: 325 VPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYR 384

Query: 359 TFGGAGITPWFGFAEAAYGLPFDVTVYGGLIAASGYTSVAFGVGRDFGRFGALSADVTHA 418
+ P F + +GLP T+YGG A Y + FG+G++ G GALS D+T A
Sbjct: 385 SGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQA 444

Query: 419 RATLWWNGRTKRGNSYRINYSKHVDALDADVRFFGYRFSERDYTNFQQFSGDPTASGL-- 476
+TL + G S R Y+K ++ +++ GYR+S Y NF +
Sbjct: 445 NSTL-PDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIE 503

Query: 477 -------------------ANGKQRYSAMLSKRFGDTST-YFSYDQTTYW-ARPSDRRIG 515
N + + ++++ G TST Y S TYW D +
Sbjct: 504 TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQ 563

Query: 516 VTLTRAFSLGALKSVNLGFSAFRTQGAGGGGNQVSLTATLPLGER-----------QTLT 564
L AF + L +S + G ++L +P + +
Sbjct: 564 AGLNTAFEDI---NWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASAS 620

Query: 565 SSVSAGEGGTSVNAGYLYDGA---NGRTYQLYGGTTDGRASANASLRQRTPSYQ-----L 616
S+S G N +Y N +Y + G G + S T +Y+
Sbjct: 621 YSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNA 680

Query: 617 TAQASTVANAYASASLEVDGSFVATRYGVTAHANGNAGDTRLLVSTDGVPGVPLS-GSYA 675
S ++ V G +A GVT DT +LV G + +
Sbjct: 681 NIGYS-HSDDIKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVKAPGAKDAKVENQTGV 737

Query: 676 RTNARGYAVIDGVSPYNVYDATVSVEKLGLDTDVTNPIQRTVLTDGAIGYIRFNAARGRN 735
RT+ RGYAV+ + Y + L + D+ N + V T GAI F A G
Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIK 797

Query: 736 VFVTLTGDGGAPVPFGASVQDAATGKELGIVGEAGAAYLTQVQPRAKLVVRAGAKTICT- 794
+ +TLT + P+PFGA V + + GIV + G YL+ + K+ V+ G +
Sbjct: 798 LLMTLTHNNK-PLPFGAMVTS-ESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHC 855

Query: 795 --PAALPDTLQ 803
LP Q
Sbjct: 856 VANYQLPPESQ 866


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1802RTXTOXIND330.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.3 bits (76), Expect = 0.002
Identities = 18/104 (17%), Positives = 34/104 (32%), Gaps = 2/104 (1%)

Query: 382 APRLTLPIFAGGRNRANLDVADARKHIAVAEYEKTIQTAFREV--ADALAARDQIDAQLA 439
P L LP +N + +V I Q +E+ A R + A++
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224

Query: 440 AQQAVYGADAERLRLAQRRYDSGVASYLELLDAQRSTFESGQEL 483
+ + + RL + +L+ + E+ EL
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNEL 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1803ACRIFLAVINRP10790.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1079 bits (2791), Expect = 0.0
Identities = 516/1032 (50%), Positives = 701/1032 (67%), Gaps = 6/1032 (0%)

Query: 1 MARFFIDRPVFAWVISLFIMLGGIFAIRALPVAQYPDIAPPVVSLYATYPGASAQVVEES 60
MA FFI RP+FAWV+++ +M+ G AI LPVAQYP IAPP VS+ A YPGA AQ V+++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTAVIEREMNGVPGLLYTSATS-SAGQASLSLTFKQGVSADLAAVDVQNRLKIVEARLPE 119
VT VIE+ MNG+ L+Y S+TS SAG +++LTF+ G D+A V VQN+L++ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 PVRRDGISIEKAADNAQIIVSLTSEDGRLSGVELGEYASANVLQALRRVEGVGKVQFWGA 179
V++ GIS+EK++ + ++ S++ + ++ +Y ++NV L R+ GVG VQ +GA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 EYAMRIWPDPVKMAALGLTASDIASAVRAHNARVTIGDVGRSAVPDSAPIAATVLADAPL 239
+YAMRIW D + LT D+ + ++ N ++ G +G + + A+++A
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 240 TTPDAFGAIALRARADGSTLYLRDVARIEFGGNDYNYPSFVNGKTATGMGIKLAPGSNAV 299
P+ FG + LR +DGS + L+DVAR+E GG +YN + +NGK A G+GIKLA G+NA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 300 ATEKRVRATMEELAKFFPPGVKYQIPYETASFVRVSMSKVVTTLVEAGVLVFAVMFLFMQ 359
T K ++A + EL FFP G+K PY+T FV++S+ +VV TL EA +LVF VM+LF+Q
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 360 NFRATLIPTLVVPVALLGTFGAMLAAGFSINVLTMFGMVLAIGILVDDAIVVVENVERLM 419
N RATLIPT+ VPV LLGTF + A G+SIN LTMFGMVLAIG+LVDDAIVVVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 420 VEEKLPPYEATVKAMKQISGAIVGITVVLTSVFVPMAFFGGAVGNIYRQFAFALAVSIGF 479
+E+KLPP EAT K+M QI GA+VGI +VL++VF+PMAFFGG+ G IYRQF+ + ++
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 480 SAFLALSLTPALCATLLKPVADDHHE-KDGFFGWFNRFVARSTHRYTRRVGRVLERPLRW 538
S +AL LTPALCATLLKPV+ +HHE K GFFGWFN S + YT VG++L R+
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 539 LVVYGALTAAAALLITKLPAAFLPDEDQGNFMVMVIRPQGTPLAETMQSVRRVEEYVRTH 598
L++Y + A +L +LP++FLP+EDQG F+ M+ P G T + + +V +Y +
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 599 SPSAY--TFALGGYNLYGEGPNGGMIFVTMKDWKERKRARDQVQAIIAEINAHFAGTPNT 656
+ F + G++ G+ N GM FV++K W+ER + +A+I +
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 657 MVFAINMPALPDLGLTGGFDFRLQDRGGLGYGAFVAAREKLLAEGRKDPV-LTDLMFAGT 715
V NMPA+ +LG GFDF L D+ GLG+ A AR +LL + P L + G
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 716 QDAPQLKLDIDRAKASALGVSMEEINATLAVMFGSDYIGDFMHGSQVRRVIVQADGRHRL 775
+D Q KL++D+ KA ALGVS+ +IN T++ G Y+ DF+ +V+++ VQAD + R+
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 776 DAADVTKLRVRNAKGEMVPLAAFATLHWTMGPPQLTRYNGFPSFTINGAASAGHSSGEAM 835
DV KL VR+A GEMVP +AF T HW G P+L RYNG PS I G A+ G SSG+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 836 AAIERIASTLPAGTGYAWSGQSYEERLSGAQAPMLFALSVLVVFLALAALYESWSIPFAV 895
A +E +AS LPAG GY W+G SY+ERLSG QAP L A+S +VVFL LAALYESWSIP +V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 896 MLVVPLGVIGAVAGVTLRGMPNDIYFKVGLIATIGLSAKNAILIVEVAKDLVAQR-MSLA 954
MLVVPLG++G + TL ND+YF VGL+ TIGLSAKNAILIVE AKDL+ + +
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 955 DAALEAARLRLRPIVMTSLAFGVGVLPLAFATGAASGAQIAIGTGVLGGVISATLFAIFL 1014
+A L A R+RLRPI+MTSLAF +GVLPLA + GA SGAQ A+G GV+GG++SATL AIF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1015 VPLFFVCVGRVF 1026
VP+FFV + R F
Sbjct: 1021 VPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1804RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.2 bits (94), Expect = 1e-05
Identities = 20/133 (15%), Positives = 41/133 (30%), Gaps = 5/133 (3%)

Query: 67 EVRARVAGIVTARTYEEGQEVKRGAVLFRIDPAPFKAARDAAAGALEKARAAHLAALDKR 126
E++ IV +EG+ V++G VL ++ +A +L +AR
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 127 RRYDELVRDRAVSERDHTEALADERQAKAAVASARAELA-----RAQLQLDYATVTAPID 181
R + + E + + + + + + Q +L+ A
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 182 GRARRALVTEGAL 194
R E
Sbjct: 218 TVLARINRYENLS 230



Score = 35.2 bits (81), Expect = 4e-04
Identities = 18/100 (18%), Positives = 38/100 (38%), Gaps = 10/100 (10%)

Query: 102 KAARDAAAGALEKARAAHLAALDKRRRYDELVRDRAVSERDHTEALADERQAKAAVASAR 161
LE+ + L+A ++ + +L + E L RQ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFK---------NEILDKLRQTTDNIGLLT 315

Query: 162 AELARAQLQLDYATVTAPIDGR-ARRALVTEGALVGQDQA 200
ELA+ + + + + AP+ + + + TEG +V +
Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1805HTHTETR1175e-35 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 117 bits (295), Expect = 5e-35
Identities = 53/210 (25%), Positives = 100/210 (47%), Gaps = 4/210 (1%)

Query: 1 MARKTREESLNTKNRILDAAELVLLEKGVGQTAMADIAEAAGMSRGAVYGHFNGKIEVCV 60
MARKT++E+ T+ ILD A + ++GV T++ +IA+AAG++RGA+Y HF K ++
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 AVCDRAFSRAVEGFDLSDERPA---LATLRLAASHYLHQCGEPGSMQRVLEILYMKCEQS 117
+ + + S E + L+ LR H L + ++EI++ KCE
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 118 EENAPLMRRRALYELQTLRIAKALLRRAVAAGELDASLDVHLAGVYLLSLLEGIFGSMIW 177
E A + + + L++ + L+ + A L A L A + + + G+ + ++
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 178 TTRLRGDRWRDAEAMLDAGVDTLRASPALR 207
+ D ++A + ++ P LR
Sbjct: 181 APQSF-DLKKEARDYVAILLEMYLLCPTLR 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1810PYOCINKILLER320.004 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 32.1 bits (72), Expect = 0.004
Identities = 30/132 (22%), Positives = 49/132 (37%), Gaps = 4/132 (3%)

Query: 23 RVAAARNELQNAADAAALAGAASLEAGAGAPAWAAAASAAAAALSLNASDGAALSSGDVQ 82
A A+ + + A A AA+ A PA + + AA + + GAA + +
Sbjct: 226 AAAEAKRKAEEQARQQAAIRAANTYA---MPANGSVVATAAGRGLIQVAQGAASLAQAIS 282

Query: 83 TGYWNVTGVPAGLEPTTLAPGEYDVPAVQATVTRAPNQNGGPLSLLMGGLLGLVGTPAAA 142
V G P+ +A G + T + +Q + +G +G P +
Sbjct: 283 DAI-AVLGRVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSV 341

Query: 143 TAVAVAGAPATV 154
AVA A TV
Sbjct: 342 NLNAVAKASGTV 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1812SYCDCHAPRONE310.004 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 31.1 bits (70), Expect = 0.004
Identities = 20/83 (24%), Positives = 32/83 (38%)

Query: 54 SVAESALAAGDAELAATLFERALKADPRSLPAQVGLGDAMYQTGELARAGVLYAQAAAAA 113
S+A + +G E A +F+ D +GLG G+ A Y+ A
Sbjct: 41 SLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMD 100

Query: 114 PDDPRAQLGLARVALRERHLDDA 136
+PR A L++ L +A
Sbjct: 101 IKEPRFPFHAAECLLQKGELAEA 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1815PF05272300.032 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.032
Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 303 IVISGGTGSGKTTLLNAL---SHFIDSHERIVTIEDAAELQLQQPHVVSL 349
+V+ G G GK+TL+N L F D+H I T +D+ E Q+ L
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYE-QIAGIVAYEL 647


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1816HTHFIS340.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 0.001
Identities = 29/165 (17%), Positives = 52/165 (31%), Gaps = 20/165 (12%)

Query: 22 GARLVAIVADAASDEVIRNLIADQAMTGAQVARGGIDDAIALMRDLSHGPQHLLVDVSGA 81
GA ++ DAA V+ ++ + + ++ DV
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIA--AGDGDLVVTDVV-- 56

Query: 82 AMP----LSDLARLADVCDPSVNVIVIGERNDVGLFRSMLRIGVRDYLVKPL----TVEL 133
MP L R+ P + V+V+ +N G DYL KP + +
Sbjct: 57 -MPDENAFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 134 VHRALSAADPNAAARAGKAIGFVGARGGVGVTSIAVALARHLADR 178
+ RAL+ + + + G S A+ + R
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVG----RSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1818BCTERIALGSPD1434e-39 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 143 bits (361), Expect = 4e-39
Identities = 68/283 (24%), Positives = 116/283 (40%), Gaps = 16/283 (5%)

Query: 127 VVQTLKPYLRQQEALVNRLTLARPIQVHLRVRITEVDRNITQQLGINWSALGA------- 179
+V + E ++ +L + RP QV + I EV LGI W+ A
Sbjct: 322 IVTAAPDVMNDLERVIAQLDIRRP-QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTN 380

Query: 180 SGNFVGGLFNGRTLFDTASKAFDLSPSGAFSVVGGFHTSRYSIDG--VLDALDQEGLITM 237
SG + G ++ S A S G Y + +L AL +
Sbjct: 381 SGLPISTAIAGANQYNKDGTVSSSLAS-ALSSFNGIAAGFYQGNWAMLLTALSSSTKNDI 439

Query: 238 LAEPNLTAISGQTASFLAGGEFPIPVAQDTTGA----ITIQFKPYGVSLDFTPTVLADNR 293
LA P++ + A+F G E P+ TT T++ K G+ L P + +
Sbjct: 440 LATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDS 499

Query: 294 ISLKVRPEVSEIDPTNSVTTGSIKVPALTVRRVDTTVELSSGQSFAIGGLLQSKSSDVLA 353
+ L++ EVS + S T+ + R V+ V + SG++ +GGLL SD
Sbjct: 500 VLLEIEQEVSSVADAASSTSSDLGA-TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTAD 558

Query: 354 ELPGLARLPVLGKLFSSRNYLNDKTEVVVIVTPYIVQPANPGE 396
++P L +PV+G LF S + K +++ + P +++ +
Sbjct: 559 KVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYR 601


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1820PREPILNPTASE328e-04 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 32.1 bits (73), Expect = 8e-04
Identities = 31/148 (20%), Positives = 49/148 (33%), Gaps = 18/148 (12%)

Query: 20 LVASWTLASLALADLRTRRLATFAVALVGALYAALALAGAPGDGGFASHAALGAAA---- 75
L+ +W L +L DL L + L+ L G A +GA A
Sbjct: 138 LLLTWVLVALTFIDLDKMLLP--DQLTLPLLWGGLLFNLLGGFVSLGD-AVIGAMAGYLV 194

Query: 76 ----FALGAAMFRAGWIAGGDVKLAAVVFLWAGPAHAWPVAFAIGVGGLAVGAVCIAAGR 131
+ + + GD KL A + W G V + G +G I
Sbjct: 195 LWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRN 254

Query: 132 APRVLAWFAPARGVPYGVALAAGGLLAV 159
++ +P+G LA G +A+
Sbjct: 255 H-------HQSKPIPFGPYLAIAGWIAL 275


66BPSL1881BPSL1901N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1881-312-0.251857putative phospholipase
BPSL1882-210-0.005020conserved hypothetical protein
BPSL1883-49-0.639787putative sulfate transporter membrane protein
BPSL1884-2100.680340conserved hypothetical protein
BPSL1885-2111.500577putative acetyltransferase
BPSL1885A-2102.261907putative metabolite transport, membrane protein
BPSL18860122.769848putative TetR-family regulatory protein
BPSL18870123.443095putative acyl-CoA synthetase
BPSL18881123.587094putative exported protein
BPSL18891124.104556putative exported protein
BPSL18900123.447494Hfq protein
BPSL18911123.493422putative exported protein
BPSL18920122.886067putative sigma-54 related transcriptional
BPSL1893-1102.129688putative membrane protein
BPSL1894-1112.529238putative lipoprotein
BPSL1895-1130.740384putative lipoprotein
BPSL1896-1132.772638putative outer-membrane protein
BPSL18972193.088105putative fimbriae-related outer membrane
BPSL18982151.902948putative type II/IV secretion system ATP-binding
BPSL18993151.613518putative fimbriae assembly protein
BPSL19003151.792630putative exported fimbriae assembly protein
BPSL19012141.611448putative membrane fimbriae assembly protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1881TCRTETA604e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 60.2 bits (146), Expect = 4e-12
Identities = 58/261 (22%), Positives = 103/261 (39%), Gaps = 12/261 (4%)

Query: 59 IGALIFGRLADHFGRRPTLMINIACYSLLELASGFAPSLAALLVLRTLFGVAMGGEWGVG 118
A + G L+D FGRRP L++++A ++ AP L L + R + G+ G V
Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVA 116

Query: 119 SALTMETVPPRARGAVSGLLQAGYPSGYLLASVVFGLLYPYIGWRGMFMIGVLPALLVLY 178
A + R G + A + G + V+ GL+ + F L L L
Sbjct: 117 GAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLT 176

Query: 179 VRAKVPES-PAWKQMEKRARPGLVATLKQNWKLSIYAVVLMTAF--NFFSHGTQDLYPTF 235
+PES ++ +R +A+ + +++ A ++ F L+ F
Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236

Query: 236 LREQHHFDPHTVSWITIVLNI-GAIVGGLTFGWLSERIGRRRAI---FIAAMIALPVLPL 291
++ H+D T+ I ++ + G ++ R+G RRA+ IA +L
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296

Query: 292 ----WAFSTGALALAAGAFLM 308
W + LA+G M
Sbjct: 297 ATRGWMAFPIMVLLASGGIGM 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1882HTHTETR668e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 8e-15
Identities = 21/79 (26%), Positives = 35/79 (44%)

Query: 4 RQASRQSGGTKARILDAAEDLFIEHGFEAMSMRQITSRAAVNLAAVNYHFGSKEALIHAM 63
R+ +++ T+ ILD A LF + G + S+ +I A V A+ +HF K L +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 64 LSRRLDQLNEERLRILDRF 82
+ E L +F
Sbjct: 63 WELSESNIGELELEYQAKF 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1885Acloacin300.008 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.7 bits (66), Expect = 0.008
Identities = 25/85 (29%), Positives = 26/85 (30%), Gaps = 7/85 (8%)

Query: 76 GRGPRAGGAHGGGGRPGGREGGGHGPYGSHG----GSREPRGDGGGYGARESRGDGGYGS 131
GRG G G GG G G G S G P G G G G GG G
Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG---GGSGH 62

Query: 132 RESRGDGGYGSRESRGDGGYGSREP 156
G+G G G P
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1886IGASERPTASE280.043 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.043
Identities = 19/108 (17%), Positives = 32/108 (29%), Gaps = 9/108 (8%)

Query: 119 LFQQKAFWRVIRTASEARAEAVYRDFAKQSETLAVNELQAAKLESQKALTDRQIAVA--- 175
++A V + + T K E K T++ V
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT 1126

Query: 176 ------QERASRLQADLSIAREQRAAVATRQKDKLDETVALREQKSER 217
QE++ +Q ARE V ++ T A EQ ++
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1887HTHFIS2973e-98 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 297 bits (763), Expect = 3e-98
Identities = 130/475 (27%), Positives = 204/475 (42%), Gaps = 53/475 (11%)

Query: 19 ADIVDRVARCMSSFDVEVIRADN-EELSAERTAMRPSLAIISVSMIE-SGAAFLRTWQAE 76
A I + + +S +V N L A L + V M + + L +
Sbjct: 13 AAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKA 72

Query: 77 -IGMPVVWVGA--------------ARDHDPSLYPPEYSHILPLDFTCAELRGMISKLAV 121
+PV+ + A A D+ P P + + ++ + +++
Sbjct: 73 RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK--PFDLTELIGIIGRA------LAEPKR 124

Query: 122 QLRAHAAKALEPSTLVAHSDCMQALLQEVDTFADCDTNVLLHGETGVGKERIAQLLHEKH 181
+ + + LV S MQ + + + D +++ GE+G GKE +A+ LH+ +
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD-Y 183

Query: 182 SRYGMGEFVPVNCGAIPDGLFESLFFGHAKGSFTGAVGTHKGYFEQAAGGTLFLDEVGDL 241
+ G FV +N AIP L ES FGH KG+FTGA G FEQA GGTLFLDE+GD+
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 242 PLYQQVKLLRVLEDGAVLRIGATAPVKVDFRLVAASNKKLPQLVKDGLFRADLYYRLAVI 301
P+ Q +LLRVL+ G +G P++ D R+VAA+NK L Q + GLFR DLYYRL V+
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 302 ELSIPSLEERGPVDKIALFKSFVASIVGEDRLAALPELPYWLAEAVADSYFPGNVRELRN 361
L +P L +R D L + FV E + E + +PGNVREL N
Sbjct: 304 PLRLPPLRDR-AEDIPDLVRHFVQQAEKEGL--DVKRFDQEALELMKAHPWPGNVRELEN 360

Query: 362 LAERVGV------------------------TVRQTGGWDTARLQRLIAHARSAAQPAPA 397
L R+ + + + + + +
Sbjct: 361 LVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFG 420

Query: 398 ESAPDVFVDRSKWDMTERNRVIAALDANGWRRQDTAQHLGISRKVLWEKMRKYQI 452
++ P + E ++AAL A + A LG++R L +K+R+ +
Sbjct: 421 DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1889cloacin270.031 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 26.6 bits (58), Expect = 0.031
Identities = 13/32 (40%), Positives = 14/32 (43%)

Query: 99 GSAAGMSGMSGGGGGGGGGGGAGYSLAPASGS 130
GS G SG G GGG G G S + S
Sbjct: 51 GSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1890PYOCINKILLER320.004 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 31.7 bits (71), Expect = 0.004
Identities = 29/86 (33%), Positives = 37/86 (43%), Gaps = 3/86 (3%)

Query: 224 LMNQLKLAPAVRTEIRNDATRIAAAARARQRA-LARPGAPGAAASAGATLAASAAGSNGG 282
MN L A A + R AAA A+++A AA A T A A GS
Sbjct: 203 RMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQ--AAIRAANTYAMPANGSVVA 260

Query: 283 AAAGKGAVAGAGASAPGAAATATAAA 308
AAG+G + A +A A A + A A
Sbjct: 261 TAAGRGLIQVAQGAASLAQAISDAIA 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1894HTHFIS385e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.3 bits (89), Expect = 5e-05
Identities = 11/63 (17%), Positives = 26/63 (41%)

Query: 79 AALRVSHPGLPIVALGSLGEPESALAALRAGVRDFIDFSAPAEDALRITRGLLDHVGDQP 138
++ + P LP++ + + +A+ A G D++ + + I L +P
Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRP 126

Query: 139 SRH 141
S+
Sbjct: 127 SKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1895BCTERIALGSPD1382e-37 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 138 bits (349), Expect = 2e-37
Identities = 58/249 (23%), Positives = 111/249 (44%), Gaps = 11/249 (4%)

Query: 160 VQVDVRVVEFSRSVLKQAGLNFFKQNNGFTFGSFAPAGLASVTGGG----TSSMSVSANI 215
V V+ + E + G+ + +N G T + + +++ G S+
Sbjct: 347 VLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA 406

Query: 216 PIASAFN-LVVGSATRGLFADLSILEANNLARVLAQPTLVALSGQSASFLAGGEIPVPVP 274
S+FN + G L+ L ++ +LA P++V L A+F G E+PV
Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466

Query: 275 QSLGT-----ISIDWKPYGVGLTLTPTVLSPRRIALKVAPESSQLDFVHSITINGVTVPA 329
+ +++ K G+ L + P + + L++ E S + S + +
Sbjct: 467 SQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAAS-STSSDLGAT 525

Query: 330 LTTRRADTTVELGDGESFAIGGLIDRETTSNVDKVPFLGDLPIIGTFFKHLSYQQNDKEL 389
TR + V +G GE+ +GGL+D+ + DKVP LGD+P+IG F+ S + + + L
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 390 VIIVTPHLV 398
++ + P ++
Sbjct: 586 MLFIRPTVI 594


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1898PREPILNPTASE543e-11 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 53.7 bits (129), Expect = 3e-11
Identities = 31/124 (25%), Positives = 52/124 (41%), Gaps = 10/124 (8%)

Query: 4 LFSIGFFFAWAAAVAIADCRDRRIPNELVLVGLAAVIIFTVCRQNPFGTTLSGALIGGAV 63
+ A+ D +P++L L L ++F + F +L A+IG
Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL--LGGF-VSLGDAVIGAMA 190

Query: 64 GLVSLFPFFAL-------RVMGAADVKVFAVLGAWCGLPALPRLWVVASVAAGVHALALM 116
G + L+ + MG D K+ A LGAW G ALP + +++S+ + L+
Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250

Query: 117 LLTR 120
LL
Sbjct: 251 LLRN 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1901cloacin456e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 44.7 bits (105), Expect = 6e-07
Identities = 33/117 (28%), Positives = 51/117 (43%), Gaps = 1/117 (0%)

Query: 30 GGSGTISKGLDGSGSGSGGGNAISTTGGSGSGGTSGAGGSGSGGSGSSGSTGGLSGGGGS 89
G+ + S ++G +G G G S G S GGSGSG G +G +GGG
Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSG-IHWGGGSGHGNGGGNG 69

Query: 90 TSGGGSTSGGGSTSGGTSTSSSINALGTIAGNTGGIISGAGSTVSGLGTVVGSQTLP 146
SGGGS +GG ++ + AL T + AG+ + + ++ + P
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKGP 126



Score = 40.5 bits (94), Expect = 1e-05
Identities = 33/123 (26%), Positives = 46/123 (37%), Gaps = 2/123 (1%)

Query: 38 GLDGSGSGSGGGNAI-STTGGSGSGGTSGAGGSGSGGSGSSGSTGGLSGGG-GSTSGGGS 95
G DG G +G + + GG G G GSG S + GG SG G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 96 TSGGGSTSGGTSTSSSINALGTIAGNTGGIISGAGSTVSGLGTVVGSQTLPGVNPQTTQA 155
+GGG+ + G + + N A G + + GL + + L A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAA 122

Query: 156 LGG 158
L G
Sbjct: 123 LKG 125



Score = 33.9 bits (77), Expect = 0.001
Identities = 31/122 (25%), Positives = 48/122 (39%), Gaps = 1/122 (0%)

Query: 58 SGSGGTSGAGGSGSGGSGSSGSTGGLSGGGGSTSGGGSTSGGGSTSGGTSTSSSINALGT 117
SG G G+ S +G GL GGG++ G G +S GG+ +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 118 IAGNTGGIISGAGSTVSGLGTVVGSQTLPGVNPQTTQALGGIVQDL-GGAVSALGSGVTS 176
G SG GS G + V + G +T GG+ + GA+SA + + +
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121

Query: 177 GI 178
+
Sbjct: 122 AL 123


67BPSL1908BPSL1918N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL1908-114-1.621972putative exported heme utilisation related
BPSL1909013-1.655307putative exported protein
BPSL1910012-0.819483putative transposase
BPSL19110130.234438conserved hypothetical protein
BPSL1912-1120.178372dihydrolipoamide dehydrogenase
BPSL1913-1120.315897dihydrolipoamide succinyltransferase component
BPSL1914-112-0.6207012-oxoglutarate dehydrogenase E1 component
BPSL1915011-1.107451putative transposase
BPSL19161110.280791GTP-binding protein
BPSL19171110.390960putative transcriptional regulatory protein
BPSL19181100.923717putative exported protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1908RTXTOXIND290.028 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.028
Identities = 8/83 (9%), Positives = 26/83 (31%), Gaps = 3/83 (3%)

Query: 48 EVPAPAAGVLAQVLQNDGDTVVADQVIATID---TEAKAGAAAAAAGAADVQPAAAPVAA 104
E+ ++ +++ +G++V V+ + EA ++ A ++ + +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 105 PAPAAQPAAAAASSTAAASPAAS 127
+ S
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVS 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1911TCRTETOQM1715e-48 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 171 bits (435), Expect = 5e-48
Identities = 102/435 (23%), Positives = 172/435 (39%), Gaps = 62/435 (14%)

Query: 5 LRNIAIIAHVDHGKTTLVDQLLRQSGTFRENQQVAE--RVMDSNDIEKERGITILAKNCA 62
+ NI ++AHVD GKTTL + LL SG E V + D+ +E++RGITI +
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 63 VEYEGTHINIVDTPGHADFGGEVERVLSMVDSVLLLVDAVEGPMPQTRFVTKKALALGLK 122
++E T +NI+DTPGH DF EV R LS++D +LL+ A +G QTR + +G+
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 123 PIVVINKIDRPGARIDWV-------------INQTFDLFDKLGATE----EQLDFPIV-- 163
I INKID+ G + V I Q +L+ + T EQ D I
Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182

Query: 164 -----------------YASGLNGY---ASLDP-----AARDGDMRPLFEAILQHVPVRP 198
+ SL P A + + L E I
Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSST 242

Query: 199 ADPDAPLQLQITSLDYSTYVGRIGVGRITRGRIKPGQPVVMRFGPEGDVLNRKINQVLSF 258
+ L ++ ++YS R+ R+ G + V R + + KI ++ +
Sbjct: 243 HRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSV--RISEKEKI---KITEMYTS 297

Query: 259 QGLERVQVDSAEAGDIVLINGIEDVGIGATICAVEAPEALPMITVDEPTLTMNFLVNSSP 318
E ++D A +G+IV++ E + + + + + I P L +
Sbjct: 298 INGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQ 356

Query: 319 LAGREGKFVTSRQIRDRLMKELNHNVALRVKDTGDETVFEVSGRGELHLTILVENMRRE- 377
+ D L++ V E + +S G++ + + ++ +
Sbjct: 357 QREMLLDALLEISDSDPLLR-------YYVDSATHEII--LSFLGKVQMEVTCALLQEKY 407

Query: 378 GYELAVSRPRVVMQE 392
E+ + P V+ E
Sbjct: 408 HVEIEIKEPTVIYME 422



Score = 33.7 bits (77), Expect = 0.002
Identities = 17/100 (17%), Positives = 32/100 (32%), Gaps = 1/100 (1%)

Query: 387 RVVMQEIDGVKHEPYELLTVDLEDEHQGGVMEELGRRKGEMLDMVSDGRGRTRLEYRIPA 446
V+++ EPY + E+ + + ++D L IPA
Sbjct: 525 EQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQL-KNNEVILSGEIPA 583

Query: 447 RGLIGFQSEFLTLTRGTGLMSHIFDSYAPVKEGSVGERRN 486
R + ++S+ T G + Y V + R
Sbjct: 584 RCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQPRR 623


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1914RTXTOXIND728e-16 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 72.2 bits (177), Expect = 8e-16
Identities = 40/268 (14%), Positives = 92/268 (34%), Gaps = 24/268 (8%)

Query: 94 ADSQVALQQAEANLAQTVRQVRGLYVNDDQYRAQVALRQSDLSKAQDDLRRRLAVAQTGA 153
+ Q Q E NL + + + ++Y + +S L L + A+A+
Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS-LLHKQAIAKHAV 254

Query: 154 VSQE--------EISHARDAVKAAQASLDAAGQQLASNRALTANTTVADHPNVLAAAAKV 205
+ QE E+ + ++ ++ + +A ++ L N + +
Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 206 RN----AYLNNARNTLPAPVTGYVAKRSVQ-VGQRVSPGTPLMSVVPLNAV-WVDANFKE 259
+ + APV+ V + V G V+ LM +VP + V A +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 260 VQLKHMRIGQPVELTADIYGSSVKYHGKVIGFSAGTGAAFSLLPAQNATGNWIKVVQRLP 319
+ + +GQ + + + + +G ++G + + +V +
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTR--YGYLVG-------KVKNINLDAIEDQRLGLVFNVI 425

Query: 320 VRVELDPKELKEHPLRIGLSMQVDVDIK 347
+ +E + + + M V +IK
Sbjct: 426 ISIEENCLSTGNKNIPLSSGMAVTAEIK 453



Score = 47.5 bits (113), Expect = 6e-08
Identities = 22/165 (13%), Positives = 57/165 (34%), Gaps = 20/165 (12%)

Query: 29 VIAIAAIAYGLYYLLVARFHETTDDAYVNGNVV------QITPQVTGTVIAVKADDTQTV 82
++A + + + +++ + A NG + +I P V + + ++V
Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118

Query: 83 KSGDPLVVLDPADSQVALQQAEANLAQTVRQVRGLYVNDDQYRAQVALRQSDLSKAQ--- 139
+ GD L+ L ++ + +++L Q +Q R Q+ R +L+K
Sbjct: 119 RKGDVLLKLTALGAEADTLKTQSSLLQA---------RLEQTRYQILSRSIELNKLPELK 169

Query: 140 --DDLRRRLAVAQTGAVSQEEISHARDAVKAAQASLDAAGQQLAS 182
D+ + + I + + + + +
Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1915TCRTETB1356e-37 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 135 bits (342), Expect = 6e-37
Identities = 84/396 (21%), Positives = 159/396 (40%), Gaps = 16/396 (4%)

Query: 27 VFMNVLDTSIANVAIPTISGDLGVSSDQGTWVITSFAVANAISVPLTGWLTDRIGQVRLF 86
F +VL+ + NV++P I+ D WV T+F + +I + G L+D++G RL
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 87 LASIILFVISSWMCGLAPT-LPFLLASRVLQGAVAGPMIPLSQALLLSSYPRAKAPMALA 145
L II+ S + + + L+ +R +QGA A L ++ P+ A
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 146 LWSMTTLIAPVAGPILGGWISDNYSWPWIFYVNIPVGIAAAAVTWMIYRSRESAVRRAPI 205
L + GP +GG I+ W ++ IP+ I V +++ ++ +
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPM-ITIITVPFLMKLLKKEVRIKGHF 199

Query: 206 DGVGLALLVIWVGSLQIMLDKGKDLDWFASTTIVVLALTALIAFAFFVVWELTAEHPVVD 265
D G+ L+ + G + ML F ++ + + ++++F FV P VD
Sbjct: 200 DIKGIILMSV--GIVFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 266 LSLFRMRNFSGGTIALSVGYGLYFGNLVLLPLWLQTQIGYTATDAG-LVMAPVGFFAILL 324
L + F G + + +G G + ++P ++ + + G +++ P I+
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 325 SPLTGKFLSRTDPRYIATAAFLTFALCFWMRSRYTTGVDEWSLMAPTFVQGIAMAGFFIP 384
+ G + R P Y+ ++ F S + + FV G ++
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTV 368

Query: 385 LVSITLSGLPGHRIPAASGLSNFVRIMCGGIGTSIF 420
+ +I S L A L NF + G G +I
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIV 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1918IGASERPTASE742e-15 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 73.9 bits (181), Expect = 2e-15
Identities = 44/298 (14%), Positives = 97/298 (32%), Gaps = 28/298 (9%)

Query: 60 DKRKITLTRRHTS----EIKQADATGKARTIQVEVRKKRTFVKRDDVSETGADQAQAQTD 115
D ++L + K + G+ EV K+ V +++ QA +
Sbjct: 951 DHLNVSLVGNTVDLGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSV 1010

Query: 116 EQAEAELKRREE--EARREAELLEKQAQELRERQERLEREEAERRAREEAAEAERRRAEE 173
E+ R +E + + + E ++ + + A+ R +
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAK 1070

Query: 174 EAAAK-RAAAAQAEAAQQAAAAREQAQRAQSEPAEQSAQDEARAAAERAA---------- 222
EA + +A E AQ + +E E A +++A+ E+
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS 1130

Query: 223 ----QREAAKKAEDAARE-----AADKARAEQEEIRKRREAAEAEARAIREMM--NTPRR 271
Q E + + ARE + +++ + A+ + + + + +T
Sbjct: 1131 PKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN 1190

Query: 272 AQVKAVEPPKPAEPPAAKAAEAKGTLHKPAKPAGEAAAARPAAKKPASGAPAPAAAPA 329
VE P+ P + + +KP + + P +PA+ + + A
Sbjct: 1191 TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248



Score = 63.2 bits (153), Expect = 5e-12
Identities = 36/321 (11%), Positives = 90/321 (28%), Gaps = 14/321 (4%)

Query: 119 EAELKRREEEARREAELLEKQAQELRERQERLEREEAERRAREEAAEA-ERRRAEEEAAA 177
E E + + + QA E + A A E A
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 178 KRAAAAQAEAAQQAAAAREQAQRAQSEPAEQSAQDEARAAAERAAQREAAKKAEDAAREA 237
+ + E +Q A R ++ A+ + + + + E + +E
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 238 ADKARAEQEEIRKRREAAEAEARAIREMMNTPRRAQVKAVEPPKPAEPPAAKAAEAKGTL 297
A + E+ ++ + + + +P++ Q + V+P K
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTS----QVSPKQEQSETVQPQAEPARENDPTVNIK--- 1156

Query: 298 HKPAKPAGEAAAARPAAKKPASGAPA-PAAAPAGDRTKKPGTGKSGWQDDAAKRRGIKTR 356
++ A +PA + ++ + ++ ++
Sbjct: 1157 EPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS 1216

Query: 357 GDSSGGVDRGWRGGPKGRGKHQDSASSFQAPTEPIVREVHVPETISVADLAHKMSIKASE 416
R R P H ++ + V + T + A L+ +
Sbjct: 1217 NKPKNRHRRSVRSVP-----HNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFV 1271

Query: 417 VIKVMMKMGQMVTINQVLDQE 437
+ V + Q ++ ++ ++
Sbjct: 1272 ALNVGKAVSQHISQLEMNNEG 1292



Score = 56.2 bits (135), Expect = 6e-10
Identities = 32/224 (14%), Positives = 75/224 (33%), Gaps = 27/224 (12%)

Query: 71 TSEIKQADATGKARTIQVEVRKKRTF-VKRDDVSETGADQAQAQTDEQAEAELKRREEEA 129
+E + T + R + E + + ++V+++G++ + QT E E +EE+A
Sbjct: 1053 KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112

Query: 130 RREAELLEKQAQELRERQERLEREEAERRAREEAAEAERRRAEEEAAAKRAAAAQAEAAQ 189
+ E E + QE+ + ++ ++ + + AE R + + A
Sbjct: 1113 KVETE----KTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168

Query: 190 QAAAAREQAQRAQSEPAEQSAQDEARAAAERAAQREAAKKAEDAAREAADKARAEQEEIR 249
+ A ++ +P +S + + + + + + + R
Sbjct: 1169 EQPA--KETSSNVEQPVTESTTVNTGNSVVENPENT----TPATTQPTVNSESSNKPKNR 1222

Query: 250 KRREAAEAEARAIREMMNTPRRAQVKAVEPPKPAEPPAAKAAEA 293
RR R+ VEP + + A
Sbjct: 1223 HRRS----------------VRSVPHNVEPATTSSNDRSTVALC 1250


68BPSL1947BPSL1952N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL19471122.73583950S ribosomal protein L20
BPSL19481113.65951750S ribosomal protein L35
BPSL19491103.238745translation initiation factor IF-3
BPSL1950-1102.612235threonyl-tRNA synthetase
BPSL1951-1101.983835*GTP pyrophosphokinase
BPSL1952-180.095113conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1947SECYTRNLCASE270.020 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 27.0 bits (60), Expect = 0.020
Identities = 10/34 (29%), Positives = 15/34 (44%)

Query: 63 SVQIFISDMANFPGMNEVWDAWVAQGATPPRATV 96
S+ + +A F G N W +WV Q T +
Sbjct: 284 SLLYIPALVAQFAGGNSGWKSWVEQNLTKGDHPI 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1951PYOCINKILLER280.025 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.8 bits (61), Expect = 0.025
Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 2/68 (2%)

Query: 22 AATLAPAHADTTGLIEPAHLSVDGSLPAAQRDAQILAARRYDTFWHNGDPALARAALADD 81
AA+LA A +D ++ S + A A + + R W + P R AL D
Sbjct: 274 AASLAQAISDAIAVLGRVLASAPSVM--AVGFASLTYSSRTAEQWQDQTPDSVRYALGMD 331

Query: 82 FADRTPPP 89
A PP
Sbjct: 332 AAKLGLPP 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1952PRTACTNFAMLY290.038 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 28.9 bits (64), Expect = 0.038
Identities = 20/102 (19%), Positives = 33/102 (32%), Gaps = 10/102 (9%)

Query: 39 ALGAAAAPGRALAAGATATADTGAASLAGGSLRRSPAGEP---------EAAHGAFWPNG 89
A R +G + +A G GG+ R +P P A A
Sbjct: 319 AAIRVGRGARVTVSGGSLSAPHGNVIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRV 378

Query: 90 ARLVISISMQFEAGGQPPTGADSPFPPVDFPPQVPVDLASAT 131
+ +++ A Q A P + P+D+A A+
Sbjct: 379 LPEPVKLTLTGGADAQGDIVATEL-PSIPGTSIGPLDVALAS 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL1953DHBDHDRGNASE746e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 73.9 bits (181), Expect = 6e-18
Identities = 60/249 (24%), Positives = 98/249 (39%), Gaps = 19/249 (7%)

Query: 10 VLVIGGSSGIGAAAARAFAVLDADVTIASRDANKLAAAARAIDG-PRPVRQAVLDTTDAP 68
+ G + GIG A AR A A + + KL ++ R D D+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 69 AVDA----FFAEAGPFDHVVMSAAHTPGGPVRKLPLADAQAAMDSKFWGAY----RVARA 120
A+D E GP D +V A G + L + +A G + V++
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 121 ARIAPGGSLTFVSGFLSVRPSASAVLQGAINAALEALARGLALELAP--VRVNTVSPGLV 178
GS+ V + P S + AA + L LELA +R N VSPG
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGST 190

Query: 179 ATPLWSKL--GDAAREAMYASAAAR----LPARRVGQPEDIANAIVYLAATR--YATGST 230
T + L + E + + +P +++ +P DIA+A+++L + + + T
Sbjct: 191 ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHN 250

Query: 231 VLVDGGGAI 239
+ VDGG +
Sbjct: 251 LCVDGGATL 259


69BPSL2017BPSL2026N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2017-1111.983056glutaminyl-tRNA synthetase
BPSL2018-1111.698848NUDIX family hydrolase
BPSL20190112.209132putative exported protein
BPSL20202112.766663putative membrane attached glycosyl hydrolase
BPSL20211122.456776hypothetical protein
BPSL20222113.436518Di-haem cytochrome c peroxidase
BPSL20233132.788589putative acid phosphatase
BPSL20243122.952148conserved hypothetical protein
BPSL20252122.918827HlyD family secretion protein
BPSL20264112.539054putative drug-resistance related membrane
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2017SSBTLNINHBTR280.036 Streptomyces subtilisin inhibitor signature.
		>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature.

Length = 144

Score = 28.3 bits (62), Expect = 0.036
Identities = 28/92 (30%), Positives = 40/92 (43%), Gaps = 6/92 (6%)

Query: 37 GASAAAAVAPAALAVPAASAARPAPLAQPAAPAIVDSQPQTRAQVYEAVKQMTALGRQLF 96
G +A A P A A A+ A PA L P+A + ++ A A L
Sbjct: 12 GLTATAVCGPLAGASLASPATAPASLYAPSALVLTVGHGESAAT---AAPLRAVT---LT 65

Query: 97 FDPSLSGSGKLACASCHSPQHAFGPPNALPAQ 128
P+ SG+ A A+C + A G P+AL A+
Sbjct: 66 CAPTASGTHPAAAAACAELRAAHGDPSALAAE 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2020RTXTOXIND994e-25 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 99 bits (249), Expect = 4e-25
Identities = 63/414 (15%), Positives = 134/414 (32%), Gaps = 91/414 (21%)

Query: 51 KRPGKKPLVVLAIIVVLLLVGAFVW-WFATRNQVSTDDA--YTDGNAITIAPKVSGYVVA 107
+ P + ++A ++ LV AF+ V+T + G + I P + V
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109

Query: 108 LAIDDNVYVHRGDLLLVIDQRDYQAQVDAARAQLGLAQAQLDAAQVQLDIA------HVQ 161
+ + + V +GD+LL + +A ++ L A+ + Q+ ++
Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169

Query: 162 FPAQYRQAQA---QIEAAQASFRQALAAYERQHAVDARATSQQAIDVADAQRLTADANVA 218
P + ++ + ++ + ++ Q + + +D A+RLT A +
Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQ-----KYQKELNLDKKRAERLTVLARIN 224

Query: 219 TARAQA----------------------------RTASLVPQQIRQAQTAVEQRRQQVLQ 250
+ ++R ++ +EQ ++L
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS 284

Query: 251 AQA-----------------------------QLEAAQLALSYCEVRAPSDGWITRRNVQ 281
A+ +L + +RAP + + V
Sbjct: 285 AKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVH 344

Query: 282 -LGSFLQAGAALFAIVTPQ---LWVTANFKESQLERMRAGDRVSVSVDAYP---NLELHG 334
G + L IV P+ L VTA + + + G + V+A+P L G
Sbjct: 345 TEGGVVTTAETLMVIV-PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVG 403

Query: 335 HVDSIQLGSGSRFSAFPPENATGNFVKIVQRVPVKIAIDGGLPRDPPLGIGLSV 388
V +I L + + G ++ + G ++ PL G++V
Sbjct: 404 KVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGN--KNIPLSSGMAV 448


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2021TCRTETB1022e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 102 bits (256), Expect = 2e-25
Identities = 71/331 (21%), Positives = 140/331 (42%), Gaps = 20/331 (6%)

Query: 41 AFMEVLDTTIVNVALPHIAGTMSASYDEATWTLTSYLVANGIVLPISGFLGRLLGRKRYF 100
+F VL+ ++NV+LP IA + W T++++ I + G L LG KR
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 101 VLCIVAFTICSFLCGIATDLGQLIVF-RVLQGLFGGGLQPNQQSIILDTF-PPEQRNRAF 158
+ I+ S + + L++ R +QG G P +++ + P E R +AF
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGA-GAAAFPALVMVVVARYIPKENRGKAF 141

Query: 159 SISAVAIVVAPVLGPTLGGWITDNFSWRWVFLLNVPIGVLTSLAVIQLVEDPPWKRGRAR 218
+ + + +GP +GG I W +LL +P+ +T + V L++ K R +
Sbjct: 142 GLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPM--ITIITVPFLMKLLK-KEVRIK 196

Query: 219 GLSIDYIGITLIAIGLGCLQVMLDRGEDEDWFASTFIRTFAVLTVAGLVGATFWLLYAKK 278
G D GI L+++G+ + F +++ +F +++V + +
Sbjct: 197 G-HFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTD 245

Query: 279 PVVDLSCLKDRNFALGCVTIATFAVVLYGSAVLVPQLAQQRLGYTAMLAG-LVLSPGALL 337
P VD K+ F +G + + G +VP + + + G +++ PG +
Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305

Query: 338 ITLEIPIVSKLMPYVQTRFLVCFGFLLLAAS 368
+ + I L+ +++ G L+ S
Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVS 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2024HTHFIS548e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 53.7 bits (129), Expect = 8e-11
Identities = 41/158 (25%), Positives = 64/158 (40%), Gaps = 13/158 (8%)

Query: 2 LIADDHPLVLLGVRHMLAGMG-DVSIVGEAHDPAGLLALLAATPCDIVITDFAMPEQPAA 60
L+ADD + + L+ G DV I A L +AA D+V+TD MP+
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAAT---LWRWIAAGDGDLVVTDVVMPD---E 60

Query: 61 DGLAMLTAIRDGYPSVRVIVLTMLDNPVLMHTMRQAGALAVLSKRGDLDEL----PRALA 116
+ +L I+ P + V+V++ + + + GA L K DL EL RALA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 117 AVYQGRPFVGTHAGAAGGGAMRGTDAPRQLSPREIEVV 154
+ + G + G A Q R + +
Sbjct: 121 EPKRRPS--KLEDDSQDGMPLVGRSAAMQEIYRVLARL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2025HTHFIS631e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 1e-12
Identities = 30/122 (24%), Positives = 50/122 (40%), Gaps = 10/122 (8%)

Query: 440 RVLVVDDQEMNRIVLRYQLDALGHHARLCASGDEALRALGTAAYDVVLTDCRMPGMDGIA 499
+LV DD R VL L G+ R+ ++ R + D+V+TD MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 500 LTAAIRAH-PDARVRATPIVGVTALVSDAEHARCVDAGMTLCIGKP----TTLDALERAL 554
L I+ PD P++ ++A + + + G + KP + + RAL
Sbjct: 65 LLPRIKKARPD-----LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 555 VE 556
E
Sbjct: 120 AE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2027PF00577455e-150 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 455 bits (1171), Expect = e-150
Identities = 166/808 (20%), Positives = 267/808 (33%), Gaps = 89/808 (11%)

Query: 37 GTLYLELVVN-ALSTGRIVPVRYRDGIYYARA----GDLAQASVRTGAQP-------DAL 84
GT +++ +N R V D LA + T + DA
Sbjct: 76 GTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDAC 135

Query: 85 VDL-SRLDGVQVEYESAEQRLKLTVPPDWLPRQTLG--SPRLYDRTPAAVSFGLLFNYDV 141
V L S + + + +QRL LT+P ++ + G P L+D A L NY+
Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINA----GLLNYNF 191

Query: 142 YANSPT--LGTSYTSAWTEQRLFDRWGTVTNTGVYRRDYGGGAGGVGSNRYLRYDTFWRY 199
NS +G + A+ + G Y GS ++ W
Sbjct: 192 SGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLE 251

Query: 200 SDQDRLR-TYTAGDVITGALSWSSAVRLGGVSVERDFKVRPDIVTYPLPQFSGQAAVPTA 258
D LR T GD T + + G + D + PD P G A
Sbjct: 252 RDIIPLRSRLTLGDGYTQGDIFDG-INFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQ 310

Query: 259 VDLFINGSKTTTGQVNPGPFTMNNVPFINGAGEATVVTTDALGRQVATTIPFYVANTLLQ 318
V + NG V PGPFT+N++ +G+ V +A G T+P+ L +
Sbjct: 311 VTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQR 370

Query: 319 KGLSDYSLSAGAMRRDYGIRSFSYGKFAASGTARHGLTDYLTLEGHVEGGERFALGGLGF 378
+G + YS++AG R + T HGL T+ G + +R+ G
Sbjct: 371 EGHTRYSITAGEYRSGNAQQE---KPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427

Query: 379 DLGIGMFGVLGVAATQSRLAGASGRQY---------------------AFGYSYASQRF- 416
+G G L V TQ+ Q+ GY Y++ +
Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487

Query: 417 SVSLQRIQRTNGFRDLS--------VYDLPANVAYRLVRSSTQATGALNLGALG----GT 464
+ + R NG+ + R Q T LG
Sbjct: 488 NFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSG 547

Query: 465 LGAGYFDVRGADGTRTRIANLSYTRPLWRRATLYASVNKTVGEHGVAAQLQLIV--PLG- 521
Y+ D N ++ W TL S+ K + G L L V P
Sbjct: 548 SHQTYWGTSNVDEQFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALNVNIPFSH 604

Query: 522 ----------EPGVVTGALARDANNSFSERVQYSRSVPSDGGLGWNL--AYAGGGSHYQ- 568
+ +++ D N + ++ D L +++ YAGGG
Sbjct: 605 WLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSG 664

Query: 569 ---QADATWRNRYFQAQGGVYGYGAGRGYARWGEVQGSVVVMDGAVLPANRVDDAFVLID 625
A +R Y A G + + V G V+ V ++D VL+
Sbjct: 665 STGYATLNYRGGYGNANIGYSHSDDIKQL--YYGVSGGVLAHANGVTLGQPLNDTVVLVK 722

Query: 626 TQGRGGVPVRYENQLVGKTDGGGHLLVPWAPSYYAGKYEIDPLDLPSNVRVPIVERRVAV 685
G V ENQ +TD G+ ++P+A Y + +D L NV + V
Sbjct: 723 APGAKDAKV--ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780

Query: 686 RDHGGALVTFPIRRIVCAQIALVDAAGRPVAIGSRVLHEESGETALVGWQGETYLEGLSA 745
F R + + +P+ G+ V E S + +V G+ YL G+
Sbjct: 781 TRGAIVRAEFKARVG-IKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPL 839

Query: 746 LNHLRVR--TPDGRTCRATFAADIDAAQ 771
++V+ + C A + ++ Q
Sbjct: 840 AGKVQVKWGEEENAHCVANYQLPPESQQ 867


70BPSL2061BPSL2068N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL20611110.497830putative exported protein
BPSL20623120.199063conserved hypothetical protein
BPSL2063212-0.458308putative exported protein
BPSL2065-213-0.656366conserved hypothetical protein
BPSL2066-213-0.569101conserved hypothetical protein
BPSL2067-214-0.913126OmpA family protein
BPSL2068113-1.194434putative membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2062OUTRMMBRANEA1272e-37 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 127 bits (321), Expect = 2e-37
Identities = 68/151 (45%), Positives = 95/151 (62%), Gaps = 10/151 (6%)

Query: 87 FQCGEPAQPVAQQPQPAPAAAPAAEPIRLNADAMFAFDRADAASMTEQGRQQLSQLAQRL 146
F GE A VA P PAPA + L +D +F F++A + +G+ L QL +L
Sbjct: 191 FGQGEAAPVVA--PAPAPAPEVQTKHFTLKSDVLFNFNKAT---LKPEGQAALDQLYSQL 245

Query: 147 TDRHAQTVSIV--GYTDRLGSDAYNRQLSQARAKTVGDYLIAAGVPADSVHAEGRGASDP 204
++ + S+V GYTDR+GSDAYN+ LS+ RA++V DYLI+ G+PAD + A G G S+P
Sbjct: 246 SNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP 305

Query: 205 LV--QCDQ-RERAALIACLAPNRRVEVVAAG 232
+ CD ++RAALI CLAP+RRVE+ G
Sbjct: 306 VTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2063PF03895394e-06 Serum resistance protein DsrA.
		>PF03895#Serum resistance protein DsrA.

Length = 79

Score = 39.4 bits (92), Expect = 4e-06
Identities = 21/77 (27%), Positives = 40/77 (51%)

Query: 1014 VARAAYGGIAAATALTMIPEVDKDKTIAVGIGGGTYRGYQAVALGATARITENIKVRAGV 1073
+++ G+A +AL+M+ + + +V G YR A+A+G +RIT+ +AGV
Sbjct: 1 LSKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGV 60

Query: 1074 GMSSGGTTAGIGASMQW 1090
++ GAS+ +
Sbjct: 61 AFNTYNGGMSYGASVGY 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2065HTHFIS824e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 4e-20
Identities = 35/143 (24%), Positives = 61/143 (42%), Gaps = 4/143 (2%)

Query: 1 MSRQKVVLIYLIEDDEVQARCYAAILQHAGYSVRVLPDGERALREIQRAAPDLIVLDRRL 60
M+ +++ +DD L AGY VR+ + R I DL+V D +
Sbjct: 1 MTGATILVA---DDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVM 57

Query: 61 PDIDGLEIIAWVRERCAPLPILVLTNAVLETDLVEALEAGADDYLIKPPREREFVARV-N 119
PD + +++ +++ LP+LV++ ++A E GA DYL KP E + +
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 120 ALRRRASISKQFEGTIEIGGYRI 142
AL + E + G +
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2068HTHFIS799e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 9e-19
Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 1/117 (0%)

Query: 1 MSAARKVLLVEDDEAQANWAKLVLTRGRFDVTHCQTGGQAIRAMTKEVPDAVVLDMRLPD 60
M+ A +L+ +DD A L+R +DV R + D VV D+ +PD
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 VHGLEVLVWIRRNFFDVPVIVLSNAMQEMQIVEAFSAGADDYVLKPAREAEFLARIA 117
+ ++L I++ D+PV+V+S M ++A GA DY+ KP E + I
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


71BPSL2177BPSL2184N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2177-191.287630hypothetical protein
BPSL2178-281.793993putative cardiolipin synthetase
BPSL2179-191.826476glutathione peroxidase
BPSL2181-2103.828584putative ABC transport system, ATP-binding
BPSL2182-3113.989547hypothetical protein
BPSL2183-1141.994062DNA repair protein
BPSL2184-2151.430443alanine racemase, catabolic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2178TCRTETOQM310.011 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 31.0 bits (70), Expect = 0.011
Identities = 16/79 (20%), Positives = 27/79 (34%), Gaps = 17/79 (21%)

Query: 104 LLQSLAQIASERPALYISGEESGAQIALRAQRLALLEGGASAADLKLLAEIQLEKIQATI 163
LL +L +I+ P L + + +I L L ++Q+E A +
Sbjct: 361 LLDALLEISDSDPLLRYYVDSATHEIILS-----------------FLGKVQMEVTCALL 403

Query: 164 DAERPDVAVIDSIQTIYSE 182
+ I IY E
Sbjct: 404 QEKYHVEIEIKEPTVIYME 422


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2179ALARACEMASE438e-156 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 438 bits (1127), Expect = e-156
Identities = 207/353 (58%), Positives = 270/353 (76%)

Query: 1 MPRPISATIHTAALANNLSVVRRHAAQSKVWAIVKANAYGHGLARVFPGLRGTDGFGLLD 60
M RPI A++ AL NLS+VR+ A ++VW++VKANAYGHG+ R++ + TDGF LL+
Sbjct: 1 MTRPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLN 60

Query: 61 LDEAVKLRELGWAGPILLLEGFFRSTDIDVIDRYSLTTAVHNDEQMRMLETARLSKPVNV 120
L+EA+ LRE GW GPIL+LEGFF + D+++ D++ LTT VH++ Q++ L+ ARL P+++
Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDI 120

Query: 121 QLKMNSGMNRLGYTPEKYRAAWERARACPGIGQITLMTHFSDADGERGVAEQMATFERGA 180
LK+NSGMNRLG+ P++ W++ RA +G++TLM+HF++A+ G++ MA E+ A
Sbjct: 121 YLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAA 180

Query: 181 QGIAGARSFANSAAVLWHPSAHFDWVRPGIMLYGASPSGRAADIADRGLKPTMTLASELI 240
+G+ RS +NSAA LWHP AHFDWVRPGI+LYGASPSG+ DIA+ GL+P MTL+SE+I
Sbjct: 181 EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEII 240

Query: 241 AVQTLAKGQAVGYGSMFVAEDTMRIGVVACGYADGYPRIAPEGTPVVVDGVRTRIVGRVS 300
VQTL G+ VGYG + A D RIG+VA GYADGYPR AP GTPV+VDGVRT VG VS
Sbjct: 241 GVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVS 300

Query: 301 MDMLTVDLTPVPQAGVGARVELWGETLPIDDVAARCMTVGYELMCAVAPRVPV 353
MDML VDLTP PQAG+G VELWG+ + IDDVAA TVGYELMCA+A RVPV
Sbjct: 301 MDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPV 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2180TCRTETB290.049 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 28.7 bits (64), Expect = 0.049
Identities = 31/139 (22%), Positives = 54/139 (38%), Gaps = 4/139 (2%)

Query: 13 FFSSLADSALLIAAIALLKDLHAPNWMIPLLKLFFVLSYVVLAAFVGAFADSRPKGHVMF 72
FFS L + L ++ + D + P + F+L++ + A G +D ++
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 73 ITNSIKVVGCLIMLFGAHP----LIAYGIVGFGAAAYSPAKYGILTELLPPERLVAANGW 128
I G +I G ++A I G GAAA+ ++ +P E A G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 129 IEGTTVGSIILGTVLGGAL 147
I +G +GG +
Sbjct: 144 IGSIVAMGEGVGPAIGGMI 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2184SACTRNSFRASE443e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 44.2 bits (104), Expect = 3e-08
Identities = 21/71 (29%), Positives = 32/71 (45%)

Query: 74 VAPVAQRSGVGLALLREAVRIARAERLDGVLLEVRPSNPRAIRLYERFGFVSVGRRRNYY 133
VA ++ GVG ALL +A+ A+ G++LE + N A Y + F+ Y
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLY 156

Query: 134 PAKHRSREDAI 144
+ E AI
Sbjct: 157 SNFPTANEIAI 167


72BPSL2231BPSL2239N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL22310112.697577hypothetical protein
BPSL2232-1112.179843putative sulfur metabolism-related protein
BPSL2233-1111.736741hypothetical protein
BPSL2234-190.403381putative siderophore non-ribosomal peptide
BPSL2235-180.721902putative exported protein
BPSL2236-290.687129putative membrane protein
BPSL2237-180.429466putative non-ribosomal peptide synthase
BPSL2238-110-0.282900putative adenylylsulfate kinase
BPSL2239-110-0.373550putative exported protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2231TCRTETA385e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.3 bits (89), Expect = 5e-05
Identities = 57/271 (21%), Positives = 97/271 (35%), Gaps = 13/271 (4%)

Query: 74 AFTLPIALFALLSGVAADAWDRRTVMLLSQALMFSVALCLVALAAAGAMTPARLLVCMFV 133
+ L A + G +D + RR V+L+S +V ++A A + L + V
Sbjct: 51 LYALMQFACAPVLGALSDRFGRRPVLLVS-LAGAAVDYAIMATAPFLWV----LYIGRIV 105

Query: 134 GGCAGAMFQPAWQSAVTEQVPARELSAAIALDSFSMNFARTAGPALGGFIVASVSPNAAF 193
G GA + + + E + S F AGP LGG + SP+A F
Sbjct: 106 AGITGATG-AVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPF 163

Query: 194 V---LSGLSYAGLIYALSRSIRGAAARPPVRERLATMLVQGVRYCGRARGIRGTLIRSSL 250
L RP R A + R+ + + +
Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRP--LRREALNPLASFRWARGMTVVAALMAVFFI 221

Query: 251 FGFLGSPVWALLPLFAKTQFGGEARTYGVLLASFGA-GAASGALGGAAGRARLGREALVR 309
+G AL +F + +F +A T G+ LA+FG + + A+ ARLG +
Sbjct: 222 MQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM 281

Query: 310 LCTLTFAAGMLATAWSPCQAVAMLGLAVAGG 340
L + G + A++ +A + +
Sbjct: 282 LGMIADGTGYILLAFATRGWMAFPIMVLLAS 312



Score = 35.2 bits (81), Expect = 5e-04
Identities = 31/167 (18%), Positives = 58/167 (34%), Gaps = 8/167 (4%)

Query: 21 LAALRGPFAYRTFAAIWVAS-LVGNIGGSIQTVAASWLMTSMAPSPTMVSLVQTAFTLPI 79
LA+ R AA+ ++ +G + + T + + AF +
Sbjct: 200 LASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILH 259

Query: 80 ALF-ALLSGVAADAWDRRTVMLLSQALMFSVALCLVALAAAGAMTPARLLVCMFVGGCAG 138
+L A+++G A R ++L M + + LA A A ++ + G
Sbjct: 260 SLAQAMITGPVAARLGERRALMLG---MIADGTGYILLAFATRGWMAFPIMVLLASG--- 313

Query: 139 AMFQPAWQSAVTEQVPARELSAAIALDSFSMNFARTAGPALGGFIVA 185
+ PA Q+ ++ QV + + GP L I A
Sbjct: 314 GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2234RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.9 bits (101), Expect = 2e-06
Identities = 17/126 (13%), Positives = 42/126 (33%), Gaps = 21/126 (16%)

Query: 87 TVRSQVDGQITHVRFREGQQVRAGDVLVEIDRRALQAAADQATAKLEQDKATLANARLEL 146
++ + + + +EG+ VR GDVL+++ +A + + L Q + ++
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 147 ----------------ARHQRLAEMNAAPVQML-----DTWKARVNELHAQIRGDQAAVQ 185
Q ++E + L TW+ + + + +A
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 186 NARVAV 191
+
Sbjct: 218 TVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2235ACRIFLAVINRP7550.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 755 bits (1951), Expect = 0.0
Identities = 273/1033 (26%), Positives = 495/1033 (47%), Gaps = 26/1033 (2%)

Query: 9 FIRYPVATCLMTAGILFAGVAAYFHLPVAPLPQVEFPTIQVSAVLPGADPVSVASTLAQP 68
FIR P+ ++ ++ AG A LPVA P + P + VSA PGAD +V T+ Q
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 LETQFSKIPYVTQMTSQSTLS-STSIVLQFSLERSIDAAANDVQSAIDAAAAQLPADLPS 127
+E + I + M+S S + S +I L F D A VQ+ + A LP ++
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PPTFQKVNPADSPIMLLSAISSTLPLTTID--DYVETRLTKSLSQIDGVGSVSIGGQQKP 185
+ S +M+ +S T D DYV + + +LS+++GVG V + G Q
Sbjct: 125 QGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY- 182

Query: 186 SIRIQLDPVKLASRGLSSEDVRRALSGLSGVNPKGVFNGT------TRSYTIYTNGQLTE 239
++RI LD L L+ DV L + G GT + +I +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 240 PAQWNDAIV-AYRDGTPVRIRDIGQAVLGPEDNTLAAWIDGRRAISVGIYKKPGANTVST 298
P ++ + DG+ VR++D+ + LG E+ + A I+G+ A +GI GAN + T
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 299 VDKILARLPELEASLPPSLKIAVLADRTQTIRASLLDIELTLLLNVVLVVVVIYAFLGSV 358
I A+L EL+ P +K+ D T ++ S+ ++ TL ++LV +V+Y FL ++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 359 RTTIIPAVTVPVSLFGACALMWVCGYSLDNISLMAMTIAVGFVVDDAIVMVENIARH-VE 417
R T+IP + VPV L G A++ GYS++ +++ M +A+G +VDDAIV+VEN+ R +E
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 418 AGERPLQAALKGLSETSFTIASISLSLVAVLLPLLLMSGIIGRMFREFAVTLSMTIIVSA 477
P +A K +S+ + I++ L AV +P+ G G ++R+F++T+ + +S
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 478 FVSLTLTPMMASYLLRAHRHDAGRPPRP--GLFERAFARTAAAYERALDVALRHRFVTLC 535
V+L LTP + + LL+ + G F F + Y ++ L L
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLL 542

Query: 536 AFFASVAASVFLYVGIPKGFFPQQDTGVITGISEAAQTISVEDMARHSMALAAIIRADPA 595
+ VA V L++ +P F P++D GV + + + E + + +
Sbjct: 543 IYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEK 602

Query: 596 --VEHCQMAVGGSAYAGTTVNNGRWYITLKPRDQRDA---TADEVIRRLRPQFAKVPGVR 650
VE V G +++G N G +++LKP ++R+ +A+ VI R + + K+
Sbjct: 603 ANVES-VFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 651 MYLQAAQDVIIGARLARTQYQLTLQSA-DVGALTTWAPRLLARLSGLP-QLRDVASDQQV 708
+ ++ ++L Q+ ALT +LL + P L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 709 NGSALSVAIDRDQAARYGLTPEAIDGTLYDAFGSRQVAQYFTQLSTYKVIMETLPSLQRD 768
+ + + +D+++A G++ I+ T+ A G V + + K+ ++ +
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 769 PGTLDRIYMKAPSGALVPLSSVARWTTDTVQPLSVNHQSHFPSVTISFNLAPGVSLGEAT 828
P +D++Y+++ +G +VP S+ + + + PS+ I APG S G+A
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTT-SHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 829 AAIEAARASLRMPPAVVGSFQGTAQAFQSTLATMPMLILSALIVAYLVLGALYGSFIHPW 888
A +E + ++P + + G + + + P L+ + +V +L L ALY S+ P
Sbjct: 841 ALME--NLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPV 898

Query: 889 TILSTLPSAGVGAIATLWLFKYDFNLIALIGVILLIGIVKKNGIMMVDFAIAATRERNMT 948
+++ +P VG + LF ++ ++G++ IG+ KN I++V+FA +
Sbjct: 899 SVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 958

Query: 949 SLDAIRSACLLRLRPIMMTTMTALFGALPLMLTPGMGSELRQPLGYAMVGGLLVSQVLTL 1008
++A A +RLRPI+MT++ + G LPL ++ G GS + +G ++GG++ + +L +
Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018

Query: 1009 FTTPVIYLYLDTL 1021
F PV ++ +
Sbjct: 1019 FFVPVFFVVIRRC 1031



Score = 89.9 bits (223), Expect = 3e-20
Identities = 78/509 (15%), Positives = 164/509 (32%), Gaps = 37/509 (7%)

Query: 4 NLFAVFIRYPVATCLMTAGILFAGVAAYFHLPVAPLPQVEFPTIQVSAVLP-GADPVSVA 62
N + L+ A I+ V + LP + LP+ + LP GA
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 63 STLAQPLETQF---SKIPYVTQMTSQSTLSSTS-------IVLQFSLERSIDAAANDVQS 112
L Q + + + S + + L+ ER + N ++
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEER--NGDENSAEA 645

Query: 113 AIDAAAAQLPADLPSPPTFQKVNPADSPIMLLSAIS---------STLPLTTIDDYVETR 163
I A + L + I+ L + + L +
Sbjct: 646 VIHRAKME----LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQL 701

Query: 164 LTKSLSQIDGVGSVSIGGQQ-KPSIRIQLDPVKLASRGLSSEDVRRALSGLSGVNPKGVF 222
L + + SV G + ++++D K + G+S D+ + +S G F
Sbjct: 702 LGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF 761

Query: 223 NGTTRSYTIYTNGQ---LTEPAQWNDAIVAYRDGTPVRIRDIGQAVLGPEDNTLAAWIDG 279
R +Y P + V +G V + L +G
Sbjct: 762 IDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLER-YNG 820

Query: 280 RRAISVGIYKKPGANTVSTVDKILARLPELEASLPPSLKIAVLADRTQTIRASLLDIELT 339
++ + PG ++ +A + L + LP + + R S
Sbjct: 821 LPSMEIQGEAAPG----TSSGDAMALMENLASKLPAGIGYDW-TGMSYQERLSGNQAPAL 875

Query: 340 LLLNVVLVVVVIYAFLGSVRTTIIPAVTVPVSLFGACALMWVCGYSLDNISLMAMTIAVG 399
+ ++ V+V + + A S + + VP+ + G + D ++ + +G
Sbjct: 876 VAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIG 935

Query: 400 FVVDDAIVMVENI-ARHVEAGERPLQAALKGLSETSFTIASISLSLVAVLLPLLLMSGII 458
+AI++VE + G+ ++A L + I SL+ + +LPL + +G
Sbjct: 936 LSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAG 995

Query: 459 GRMFREFAVTLSMTIIVSAFVSLTLTPMM 487
+ + ++ + +++ P+
Sbjct: 996 SGAQNAVGIGVMGGMVSATLLAIFFVPVF 1024


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2237IGASERPTASE300.026 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.026
Identities = 34/191 (17%), Positives = 52/191 (27%), Gaps = 22/191 (11%)

Query: 335 SDRLSLFADVGYTRNFHG--AAGGMNAFDSDVEMFSIGADYKLSEASRAGALLSSGNANG 392
S+ + L Y RN + A N + + Y G L G
Sbjct: 1331 SNNVQLGGVFTYVRNSNNFDKATSKNTL----AQVNFYSKYYADNHWYLGIDLGYGKFQS 1386

Query: 393 SLAGGQGR-IGLHAYRLGVY--HAFERAGLFVRAYAGAGWSR-----YRL--DRAAVLPG 442
L H + G+ AF + G +S + L R V P
Sbjct: 1387 KLQTNHNAKFARHTAQFGLTAGKAFNLGNFGITPIVGVRYSYLSNADFALDQARIKVNPI 1446

Query: 443 AVRASTSGFDFGALVKAGYLFALGGVRLGPVADVGYTQLVARGYTEDGDPILAQNVGVQR 502
+V+ + + D Y + LG + P+ Y G A NV Q+
Sbjct: 1447 SVKTAFAQVDLS------YTYHLGEFSVTPILSARYDANQGSGKINVNGYDFAYNVENQQ 1500

Query: 503 LKGVSAGAGVR 513

Sbjct: 1501 QYNAGLKLKYH 1511


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2239CARBMTKINASE362e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 36.0 bits (83), Expect = 2e-04
Identities = 33/119 (27%), Positives = 56/119 (47%), Gaps = 15/119 (12%)

Query: 116 IDDERVRRDLDAGKVVIITGFQGV---DPDGHITTL-GRGGSDTSAVAVAAALEADECLI 171
++ E +++ ++ G +VI +G GV DG I + D + +A + AD +I
Sbjct: 174 VEAETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMI 233

Query: 172 YTDVDGVYTTDPRVVEEARRLDSVTFEEMLEMA--------SLGSKVLQ-IRSVEFAGK 221
TDV+G E+ + L V EE+ + S+G KVL IR +E+ G+
Sbjct: 234 LTDVNGAALYYGT--EKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGE 290


73BPSL2297BPSL2307N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2297-115-1.372668putative membrane protein
BPSL2298015-1.639020putative iron-sulfur binding protein
BPSL2299-114-1.265130conserved hypothetical protein
BPSL2300-113-0.837129IclR regulatory protein
BPSL2301013-0.747585putative transposase
BPSL2302013-0.075984family S11 unassigned peptidase
BPSL23030140.368956phasin-like protein
BPSL23040150.038219putative dihydrolipoamide dehydrogenase
BPSL2305015-0.400317dihydrolipoamide acetyltransferase component of
BPSL2306016-0.065172pyruvate dehydrogenase E1 component
BPSL2307016-0.476781sensor kinase protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2297SSBTLNINHBTR280.027 Streptomyces subtilisin inhibitor signature.
		>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature.

Length = 144

Score = 28.3 bits (62), Expect = 0.027
Identities = 15/50 (30%), Positives = 23/50 (46%)

Query: 15 VATAAVAPADAFAATAKTAQSAKGKKSAAKKSLRAASSSAEPRAKGARKR 64
+A+ A APA +A +A G+ +A LRA + + P A G
Sbjct: 27 LASPATAPASLYAPSALVLTVGHGESAATAAPLRAVTLTCAPTASGTHPA 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2300RTXTOXIND365e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.6 bits (82), Expect = 5e-04
Identities = 12/58 (20%), Positives = 22/58 (37%)

Query: 49 VPSPSAGTVKEVKVKVGDAVSQGSLIVLLDGAQAAAQPAQANGAATSAAQPAAAPAAA 106
+ VKE+ VK G++V +G +++ L A A + + A
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQIL 156



Score = 31.0 bits (70), Expect = 0.014
Identities = 12/37 (32%), Positives = 21/37 (56%)

Query: 162 VPSPAAGVVKDIKVKVGDAVSEGSLIVVLEASGGAAA 198
+ +VK+I VK G++V +G +++ L A G A
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2302PF06580320.012 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.012
Identities = 18/85 (21%), Positives = 31/85 (36%), Gaps = 18/85 (21%)

Query: 711 PVLIEQVLV-NLMKNAAEAMQEARPQAENGVIRVVADLEAGFVDIRVIDQGPGVDEATAE 769
P ++ Q LV N +K+ + G I + + G V + V + G + T E
Sbjct: 256 PPMLVQTLVENGIKHGIA------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE 309

Query: 770 RLFEPFYSTKSDGMGMGLNICRSII 794
S G G+ N+ +
Sbjct: 310 ----------STGTGL-QNVRERLQ 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2303HTHFIS1132e-31 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 113 bits (283), Expect = 2e-31
Identities = 39/153 (25%), Positives = 67/153 (43%), Gaps = 4/153 (2%)

Query: 11 TVFVVDDDEAVRDSLRWLLEANGYRVQCFSSAEQFLDAYQPAQQAGQIACLILDVRMSGM 70
T+ V DDD A+R L L GY V+ S+A AG ++ DV M
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA----AGDGDLVVTDVVMPDE 60

Query: 71 SGLELQERLIAENAALPIIFVTGHGDVPMAVSTMKKGAMDFIEKPFDEAELRKLVERMLE 130
+ +L R+ LP++ ++ A+ +KGA D++ KPFD EL ++ R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 131 KARNESKSVQEQRAASERLSKLTAREQQVLERI 163
+ + +++ L +A Q++ +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2307TCRTETA448e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.7 bits (103), Expect = 8e-07
Identities = 66/352 (18%), Positives = 120/352 (34%), Gaps = 51/352 (14%)

Query: 52 TEFGLIAAMPVLTGSLIRVPLGIWTDRYGGRIVFFILMLVTVVPIWLISYATELWQFLVL 111
+G++ A+ L LG +DR+G R V + + V +++ A LW +
Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102

Query: 112 GLFVGLAGGSFSVGTPYVA---------RWFPKARQGLAMGVFGAGNSGAAVNKFVAPAL 162
+ G+ G + +V Y+A R F G FG G VA +
Sbjct: 103 RIVAGITGATGAVAGAYIADITDGDERARHF-----GFMSACFGFG--------MVAGPV 149

Query: 163 IA--AAGTWTIVPRVYAVAMLAMALLFWLFSATDPAHRSTNATSLRA-------QLRVLK 213
+ G P A A+ + L F + A +
Sbjct: 150 LGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGM 209

Query: 214 NPRVWRYSQYYSVVFGGYVGLSLWLTQYYVGEYGFGIQSAAFLAACFSLPGGVLRA-IGG 272
+ ++ + G V +LW + + + + A F + + +A I G
Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALW-VIFGEDRFHWDATTIGISLAAFGILHSLAQAMITG 268

Query: 273 WLSDRYGAYRTTWWVMWVCWVMFFLLSYPPTDFTIRAAHGPLGFHLSLTPVAFTALLFAV 332
++ R G R M + LL++ A G + F + + LL +
Sbjct: 269 PVAARLGERRALMLGMIADGTGYILLAF--------ATRGWMAFPIMV-------LLASG 313

Query: 333 GIAMAVGKASVFKFIADEFPNDIGAVSGVVGLAGGLAGFALPILFGALVDLT 384
GI M +A + + + +E G + G + L P+LF A+ +
Sbjct: 314 GIGMPALQAMLSRQVDEE---RQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362


74BPSL2471BPSL2475N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL24712103.184904conserved hypothetical protein
BPSL2472182.703003putative multidrug resistance protein
BPSL24731110.930146fumarate hydratase class II
BPSL24741140.356899long-chain acyl-CoA thioester hydrolase
BPSL2474A0110.608965putative exported protein
BPSL2474B-190.584797ArsR family regulatory protein
BPSL2475-390.078624putative prolin-rich exported protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2471adhesinmafb320.002 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 32.0 bits (72), Expect = 0.002
Identities = 17/67 (25%), Positives = 28/67 (41%), Gaps = 3/67 (4%)

Query: 36 SARPAGELTMIAGLSPSAASAHLARLTDGGLLAL---DVRGRHRYYRIATPDIAAAIEAL 92
R A + + ++P A A + G +A + R + P+ A +EA+
Sbjct: 254 GTRYAIDKAAMRNIAPLPAEGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAV 313

Query: 93 ANVAQAA 99
NVA AA
Sbjct: 314 FNVAAAA 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2472IGASERPTASE382e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.1 bits (88), Expect = 2e-04
Identities = 38/249 (15%), Positives = 63/249 (25%), Gaps = 18/249 (7%)

Query: 619 PQPGVAQPTAPHAPGTPPNAMRPDAARPNEARPAPAPSARNGVPRPPAAVENPGMRDEA- 677
P+ T T PN ++ D A VP P A + A
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 678 RAPGEAPRPQPSWTQPHPPIQQQR--ANEGGPRASGEPNAPLNYRSPTQNALPPIRSTPT 735
+ E+ + + Q R A E +S ++ T
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 736 PTHSAPPAPPPAERAQPQPGPAPRNAMRAPEAPRQEVAPPAPRNEYRAPAPAPRPQIEAP 795
E + Q P + + + + V P A PA P +
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA------EPARENDPTVNIK 1156

Query: 796 RMEAPRMPAPRAEAPRMEPRPAPPPPAVP-------HNPPPAPRQEPPHQARP--DQQHG 846
++ E P E P ++ P P +P + +
Sbjct: 1157 EPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS 1216

Query: 847 FAPRREERR 855
P+ RR
Sbjct: 1217 NKPKNRHRR 1225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2474HTHFIS376e-129 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 376 bits (966), Expect = e-129
Identities = 133/388 (34%), Positives = 202/388 (52%), Gaps = 40/388 (10%)

Query: 101 FDYVTVPYECDRIVESVGHAYGMVTLSEGLAPAAATVRNEGEMVGTCEAMLALFKMIRKV 160
+DY+ P++ ++ +G A + + ++ +VG AM +++++ ++
Sbjct: 99 YDYLPKPFDLTELIGIIGRA--LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARL 156

Query: 161 ASTDAPVFISGESGTGKELTAVAIHERSSRAGAPFVAINCGAIPPTLLQAELFGYERGAF 220
TD + I+GESGTGKEL A A+H+ R PFVAIN AIP L+++ELFG+E+GAF
Sbjct: 157 MQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAF 216

Query: 221 TGANQRKIGRIEAANGGTLFLDEIGDLPFESQASLLRFLQEHKVERVGGHQSIPVDVRII 280
TGA R GR E A GGTLFLDEIGD+P ++Q LLR LQ+ + VGG I DVRI+
Sbjct: 217 TGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIV 276

Query: 281 SATHVDMQIALRNGRFREDLYHRLCVLKLEEPPLRERGKDIEILARHMLERFKGDAHRRL 340
+AT+ D++ ++ G FREDLY+RL V+ L PPLR+R +DI L RH +++ + + +
Sbjct: 277 AATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEG-LDV 335

Query: 341 RGFTPDAIAALHNYAWPGNVRELINRVRRAIVMSEGRMISAADLELSGYAEVA------- 393
+ F +A+ + + WPGNVREL N VRR + +I+ +E +E+
Sbjct: 336 KRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKA 395

Query: 394 ------------------------------PMSLEEARESAERHAIEVALLRHRGRLADA 423
+ E I AL RG A
Sbjct: 396 AARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKA 455

Query: 424 ARELGVSRVTLYRLLCAYGMRDDGGARA 451
A LG++R TL + + G+ +R+
Sbjct: 456 ADLLGLNRNTLRKKIRELGVSVYRSSRS 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2475HTHFIS357e-121 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 357 bits (917), Expect = e-121
Identities = 145/453 (32%), Positives = 216/453 (47%), Gaps = 46/453 (10%)

Query: 53 VHVARSANEAARRVKPNQPQAGIADL---DGFAPRELPTLEAVLRQQQVGWIALAGDTRI 109
V + +A R + + D+ D A LP ++ V + ++
Sbjct: 30 VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPV--LVMSAQNTF 87

Query: 110 NDPDVRRLIRQYCFDYMQGLPPHETIDYLVGHAYGMVALCDLDVTAGAAATGDEMVGACD 169
+ + +DY+ + ++G A + G +VG
Sbjct: 88 MTA--IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR-RPSKLEDDSQDGMPLVGRSA 144

Query: 170 AMQQLFRTIRKVAATDATVFISGESGTGKELTALAIHERSERRKAPFVAINCGAIPNHLL 229
AMQ+++R + ++ TD T+ I+GESGTGKEL A A+H+ +RR PFVAIN AIP L+
Sbjct: 145 AMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLI 204

Query: 230 QSELFGYERGAFTGASQRKVGRVEAADGGTLFLDEIGDMPLESQASMLRFLQEGKIERLG 289
+SELFG+E+GAFTGA R GR E A+GGTLFLDEIGDMP+++Q +LR LQ+G+ +G
Sbjct: 205 ESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVG 264

Query: 290 GHESIPVDVRIISATHVDLDAAMREGRFRDDLYHRLCVLKLDEPPLRARGKDIEILAHHI 349
G I DVRI++AT+ DL ++ +G FR+DLY+RL V+ L PPLR R +DI L H
Sbjct: 265 GRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHF 324

Query: 350 LHQFRSDGARRIHGFTSCAIEAMYNYHWPGNVRELINRIRRAIVMSDSRQLSAADLDL-- 407
+ Q +G + F A+E M + WPGNVREL N +RR + ++ ++
Sbjct: 325 VQQAEKEG-LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENEL 383

Query: 408 -----------------------------------APFAALQATTLAEARERAERRTIEA 432
A + E I A
Sbjct: 384 RSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILA 443

Query: 433 SLLRHRNRLTEAAAELGVSRATLYRLMVSHGLR 465
+L R +AA LG++R TL + + G+
Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


75BPSL2679BPSL2686N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2679442-9.348939putative epimerase/dehydratase
BPSL2680438-9.004077putative glycosyl transferase
BPSL2681229-7.297770putative glycosyl transferase
BPSL2682123-5.205920putative O-antigen methyl transferase
BPSL2683-116-2.621983putative glycosyl transferase
BPSL2684-310-0.805782putative epimerase/dehydratase
BPSL2685-3100.072390putative O-antigen acetylase
BPSL2686-491.323736ABC transporter, ATP-binding component
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2679NUCEPIMERASE1682e-51 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 168 bits (428), Expect = 2e-51
Identities = 83/363 (22%), Positives = 136/363 (37%), Gaps = 58/363 (15%)

Query: 13 KILVTGGAGFIGCAISERLAARASRYVVMDNLHPQIHASAVRPGALHEKAE----LVVAD 68
K LVTG AGFIG +S+RL + V +DNL+ + +++ L A+ D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDY-YDVSLKQARLELLAQPGFQFHKID 60

Query: 69 VTDAGAWDALLSDFQPEIIIHLAAETGTGQSLTEASRHALVNVVGTTRLTDALVKHGIVV 128
+ D L + E + SL +A N+ G + + + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--I 118

Query: 129 EHILLTSSRAVYGEGAWQKDDGTIVYPGQRGRAQLEAAQWDFPGMTMLPSRADRTEPRPT 188
+H+L SS +VYG +P D + P
Sbjct: 119 QHLLYASSSSVYGLN------------------------------RKMPFSTDDSVDHPV 148

Query: 189 SVYGATKLAQEHVLRAWSLATKTPLSILRLQNVYGPGQSLTNSYTGIVALFSRLAREKKV 248
S+Y ATK A E + +S P + LR VYGP + F++ E K
Sbjct: 149 SLYAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALF----KFTKAMLEGKS 204

Query: 249 IPLYEDGNVTRDFVSIDDVADAIVATLARTPEA-----------------LSLFDIGSGQ 291
I +Y G + RDF IDD+A+AI+ P A +++IG+
Sbjct: 205 IDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSS 264

Query: 292 ATSILDMARIIAAHYGAPEPQINGAFRDGDVRHAACDLSESLANLGWKPQWSLKRGIGEL 351
++D + + G + + GDV + D +G+ P+ ++K G+
Sbjct: 265 PVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324

Query: 352 QTW 354
W
Sbjct: 325 VNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2682ABC2TRNSPORT300.007 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 30.3 bits (68), Expect = 0.007
Identities = 16/59 (27%), Positives = 24/59 (40%)

Query: 195 LFTMVLMFLSPVFYPASALPEKYRFWLELNPLTLFIEQSRGILLEGRVPDFHPLGLAFL 253
L ++FLS +P LP ++ PL+ I+ R I+L V D A
Sbjct: 184 LVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALC 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2683NUCEPIMERASE587e-12 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 58.3 bits (141), Expect = 7e-12
Identities = 32/160 (20%), Positives = 56/160 (35%), Gaps = 27/160 (16%)

Query: 1 MKILVTGANGQVGWELARSLAVLGQVV-----------PLTRE--------------QAD 35
MK LVTGA G +G+ +++ L G V ++ + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 36 LGRPETLARIVEDAKPDVVVNAAAYTAVDAAETDGAAANVINGEA-VGVLAAATKRVGGL 94
L E + + + V + AV + + A N + +L
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 95 FVHYSTDYVFDGTKPSPYIETDPT-CPVNAYGASKLLGEL 133
++ S+ V+ + P+ D PV+ Y A+K EL
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2686NUCEPIMERASE1753e-54 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 175 bits (445), Expect = 3e-54
Identities = 90/350 (25%), Positives = 136/350 (38%), Gaps = 45/350 (12%)

Query: 2 ILVTGGAGFIGANFVLDWLAQSDEAVLNVDKLT--YAGNLGTLK-SLQGNPKHVFARVDI 58
LVTG AGFIG + L + V+ +D L Y +L + L P F ++D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQ-VVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 59 CDRAAIDALLAQHKPRAIVHFAAESHVDRSIHGPADFVQTNVVGTFTLLEAARQYWSALG 118
DR + L A + V S+ P + +N+ G +LE R
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN----K 117

Query: 119 PDAKAAFRFLHVSTDEVFGSLSPADPQFSETTPYA-PNSPYSATKAGSDHLVRAYHHTYG 177
L+ S+ V+G L+ P FS P S Y+ATK ++ + Y H YG
Sbjct: 118 IQ-----HLLYASSSSVYG-LNRKMP-FSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 178 LPVLTTNCSNNYGPYQFPEKLIPLMIANALGGKPLPVYGDGQNVRDWLYVGDHCSAIREV 237
LP YGP+ P+ + L GK + VY G+ RD+ Y+ D AI +
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRL 230

Query: 238 L------------------ARGVPGETYNVGGWNEKKNLDVVHTLCDLLD-EARPKAGGS 278
A P YN+G + + +D + L D L EA+
Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKN---- 286

Query: 279 YRDQITYVTDRPGHDRRYAIDARKLERELGWKPAETFETGLAKTVRWYLD 328
+ +PG + D + L +G+ P T + G+ V WY D
Sbjct: 287 ------MLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


76BPSL2734BPSL2740N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL2734-2122.002488putative membrane protein
BPSL2735-2112.376674putative membrane protein
BPSL2736-2122.223005putative AraC-family transcriptional regulatory
BPSL2737-2132.259531putative aminotransferase
BPSL2738-3141.729131putative LysR-family transcriptional regulator
BPSL2739-3131.356026probable D-lactate dehydrogenase
BPSL2740-3121.666025putative oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2734SECA340.001 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 33.7 bits (77), Expect = 0.001
Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 9/87 (10%)

Query: 116 LNRRLPRAVARTREGDFSLNGLLGFDLFGKTVGVIGTGLI--GSVFARIMTGFGMRVLAH 173
L +P A A RE + G+ FD V ++G G++ A + TG G + L
Sbjct: 60 LENLIPEAFAVVREASKRVFGMRHFD-----VQLLG-GMVLNERCIAEMRTGEG-KTLTA 112

Query: 174 SLPPHDDALIALGVRYVPLDALLAEAD 200
+LP + +AL GV V ++ LA+ D
Sbjct: 113 TLPAYLNALTGKGVHVVTVNDYLAQRD 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2736TCRTETA515e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 50.6 bits (121), Expect = 5e-09
Identities = 95/399 (23%), Positives = 149/399 (37%), Gaps = 37/399 (9%)

Query: 5 LFALAVAAFGIGTTEFVIMGLLPNVARDLGVSIPAA---GMLVSGYALGVTIGAPILAVV 61
L +A+ A GIG +IM +LP + RDL S G+L++ YAL AP+L +
Sbjct: 11 LSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 62 TAKMPRKAALLALIGVFIVGNLFCAIAPGYATLMVARVVTAFCHGAFFGIGSVVASNLVA 121
+ + R+ LL + V A AP L + R+V G+ +A ++
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITD 125

Query: 122 PNKRAQAIALMFTGLTLANVLGVPLGTALGQAFGWRATFWAVTGIGALAAAALAFCVPKR 181
++RA+ M V G LG +G F A F+A + L F +P+
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 182 LEMPAAGIAREFGVLRNPQVLMVLGISVLASASLFTVFTYIAPI-----------LEDVT 230
+ + RE NP + A+L VF + + ED
Sbjct: 185 HKGERRPLRRE---ALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 231 GFTPHDVTLVLLLFG-LGLTVGGTVGGKLADW---RRIPSLVATLASIGVVLAAFAGTMR 286
+ + + L FG L + G +A RR L G +L AFA
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 287 TPLPALVTIFVWGVLAFAIVPPLQILIVDRAS-HAPNLASTLNQGAFNLGNALGAWLGGT 345
P +V + G+ +P LQ ++ + +L + +G L
Sbjct: 302 MAFPIMVLLASGGI----GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 346 AIHAGVPLAK-LPW-AGAAL---AMAALALTLWSASLER 379
A + W AGAAL + AL LWS + +R
Sbjct: 358 IYAASITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQR 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2737TCRTETA477e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.1 bits (112), Expect = 7e-08
Identities = 95/395 (24%), Positives = 152/395 (38%), Gaps = 25/395 (6%)

Query: 21 LILSVAVVGLGTGATLPLTALALTEAGHGTRIV---GILTAAQAGGGLAVVPFVTAITKR 77
++ +VA+ +G G +P+ L + H + GIL A A A P + A++ R
Sbjct: 10 ILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDR 69

Query: 78 LGARQVIVASVVVLAAATALMQFTSNLVVWGVLRVVCGAALMLLFTIGEAWVNQLADDAT 137
G R V++ S+ A A+M L V + R+V G G A++ + D
Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDE 128

Query: 138 RGRVVAIYATNFTLFQMAGPVLVSQIAGMTH-----VRFALSGTLFLLAL----PSLASI 188
R R + F +AGPVL + G + AL+G FL S
Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGE 188

Query: 189 RKTPIADEPHHDAHDRWTRVIPKMPALVVGTAFFALFDTLALSLLPIFAMAR--GVASEA 246
R+ + + A RW R + + AL+ L + +L IF R A+
Sbjct: 189 RRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI 248

Query: 247 AVLFAAILLFGDTAMQFPIGWLADKLGRERVHLGAGCVVLALLPLLPAVVTTPWLCWPLL 306
+ AA + A G +A +LG ER L G + +L A T W+ +P++
Sbjct: 249 GISLAAFGILHSLAQAMITGPVAARLG-ERRALMLGMIADGTGYILLAFATRGWMAFPIM 307

Query: 307 FVLGAAAGSVYTL----SLVACGERFRGSALVTASSLVSASWSAASFGGPLVAGALMEQF 362
+L + + L S ER +G + ++L S S GPL+ A+
Sbjct: 308 VLLASGGIGMPALQAMLSRQVDEER-QGQLQGSLAALT----SLTSIVGPLLFTAIYAAS 362

Query: 363 GGDALIGVLIVSAIAFVGAALWERRALPMQAARRG 397
I A ++ RR L A +R
Sbjct: 363 ITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQRA 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL2740TCRTETA448e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.0 bits (104), Expect = 8e-07
Identities = 77/398 (19%), Positives = 124/398 (31%), Gaps = 59/398 (14%)

Query: 50 VAPSVIAEWGVKKQA---LGPVFSASLFGMLLGALGLSVLADRIGRRPVLIGATLFFALA 106
V P ++ + G + + A L L+DR GRRPVL+ + A+
Sbjct: 27 VLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVD 86

Query: 107 MLATPFATSIPILIALRFVTGLGLGCIMPNAMALVGECSPSAHRVKRM----MIVSCGFT 162
A + +L R V G+ G A A + + + R + G
Sbjct: 87 YAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMV 145

Query: 163 LGAALGGFVSAALIPAFGWRAVFFVGGAVPLALAAAMAASLPESPQLLVLRGRHDAARAW 222
G LGG + F A FF A+ LPES H R
Sbjct: 146 AGPVLGGLMGG-----FSPHAPFFAAAALNGLNFLTGCFLLPES---------HKGERRP 191

Query: 223 LAKFAPRLAVPPDTRLVVREAGPRGAPVAELFRSGRARVTLLLWAINF-MNLIDLYFLSN 281
L + A P+A + V L A+ F M L+ +
Sbjct: 192 LRREALN-------------------PLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232

Query: 282 WLPTVMRDAGYASGTAVIVGTVLQTGGVIGTLS----LGWFIERHGFARVLFACFACATI 337
W+ + A +G L G++ +L+ G R G R L
Sbjct: 233 WVIFGEDRFHW---DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289

Query: 338 AIGLIGSVAHAFVWLLAAVFVGGFCVVGGQPAVNALAGHYYPTSLRSMGIGWSLGVGRVG 397
L+ ++ V + + G PA+ A+ + G + +
Sbjct: 290 GYILLAFATRGWMAFPIMVLLASGGI--GMPALQAMLSRQVDEERQGQLQGSLAALTSLT 347

Query: 398 SVLGPLVGGQLIA--------LGWSNDALFHAAAVPVL 427
S++GPL+ + A W A + +P L
Sbjct: 348 SIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPAL 385


77BPSL3142BPSL3146N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL3142010-1.262489imidazole glycerol phosphate synthase subunit
BPSL3143010-1.122702phosphoribosylformimino-5-aminoimidazole
BPSL3144-111-0.812618imidazole glycerol phosphate synthase subunit
BPSL3145-212-0.526621putative membrane protein
BPSL3146-4110.731835imidazoleglycerol-phosphate dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3143ABC2TRNSPORT741e-17 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 73.8 bits (181), Expect = 1e-17
Identities = 60/243 (24%), Positives = 100/243 (41%), Gaps = 6/243 (2%)

Query: 7 LFYKEILRFWKVSFQTVLAPVVTALLYLTIFGHALTGRVNVYPGVEYVSFLVPGLVMMSV 66
++ + + + K + ++L + L+YL G L V GV Y +FL G+V S
Sbjct: 19 VWRRNYIAWKKAALASLLGHLAEPLIYLFGLGAGLGVMVGRVGGVSYTAFLAAGMVATSA 78

Query: 67 LQNA-FANSSSSLIQSKITGNLVFMLLPPLSSADIFGAYVLASVVRGLAVGAGVFVVTVW 125
+ A F ++ + + ML L DI + + + GAG+ VV
Sbjct: 79 MTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAA 138

Query: 126 FIPMSFAAPLYIVAFALFGSAILGTLGLIAGIWAEKFDQLAAFQNFLIMPLTFLSGVFYS 185
+ + LY + +LG++ A +D +Q +I P+ FLSG +
Sbjct: 139 LGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFP 198

Query: 186 THSLPPVWREVSRLNPFFYMIDGFRYGFFG--IADVNPLASLS---VVAGFFVLLALIAM 240
LP V++ +R P + ID R G + DV +V FF+ AL+
Sbjct: 199 VDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRR 258

Query: 241 RLL 243
RLL
Sbjct: 259 RLL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3144PF05272280.039 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.039
Identities = 11/19 (57%), Positives = 13/19 (68%)

Query: 39 LLGPNGAGKTTLISILAGL 57
L G G GK+TLI+ L GL
Sbjct: 601 LEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3146FLGMOTORFLIG280.024 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 28.2 bits (63), Expect = 0.024
Identities = 12/73 (16%), Positives = 22/73 (30%)

Query: 74 RTTQLAMGRNWRTATPAQQQQVIEQFKQLLIRTYSGALAQLKPDQQIQYPPFRADADATD 133
R + A ++ +Q +T + L+ L P + T+
Sbjct: 107 NLGSALQSRPFEFVRRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTN 166

Query: 134 VVVRTVAMNNGQP 146
V R M+ P
Sbjct: 167 VARRIALMDRTSP 179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3147VACJLIPOPROT2242e-74 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 224 bits (572), Expect = 2e-74
Identities = 85/220 (38%), Positives = 114/220 (51%), Gaps = 8/220 (3%)

Query: 32 AAAALSGCATVQTPTKG--DPFEGFNRTMYTFNDKV-DQYALKPVARGYQWAVPQPMRDS 88
L GCA+ T +G DP EGFNRTMY FN V D Y ++PVA ++ VPQP R+
Sbjct: 11 GTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWRDYVPQPARNG 70

Query: 89 VTNFFSNIGDVYIAANNLVQLKIADGVGDIMRVVINTVFGVGGLFDVATLAKLPKHAND- 147
++NF N+ + + N +Q G+ R +NT+ G+GG DVA +A +
Sbjct: 71 LSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGMANPKLQRTEP 130

Query: 148 --FGVTLGHYGVPSGPYLVLPLLGPSTVRDTAGLAVDYAGNPLTYVRPDGVSWGLFGLNL 205
FG TLGHYGV GPY+ LP G T+RD G D A P+ +S G + L
Sbjct: 131 HRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMAD-ALYPVLSWLTWPMSVGKWTLEG 189

Query: 206 VNTRANLLGAGDVLEAAAIDKYSFVRNAYLQRRQALIGGA 245
+ TRA LL + +L + D Y VR AY QR + G
Sbjct: 190 IETRAQLLDSDGLLR-QSSDPYIMVREAYFQRHDFIANGG 228


78BPSL3295BPSL3308N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BPSL3295-290.350122putative oxidoreductase
BPSL3296-1100.172904putative dienelactone hydrolase
BPSL3297-2100.339694putative amidase
BPSL3298-213-0.1376745,10-methylenetetrahydrofolate reductase
BPSL3299-1120.841561putative membrane protein
BPSL3300-1110.568435adenosylhomocysteinase
BPSL33010111.559381RNA polymerase sigma factor for flagellar
BPSL33021111.570768flagellar biosynthesis protein FlhG
BPSL33032111.316432flagellar biosynthesis protein FlhF
BPSL33042121.079696flagellar biosynthesis protein FlhA
BPSL3305211-0.473473flagellar biosynthetic protein FlhB
BPSL3306212-0.273905conserved hypothetical protein
BPSL3307012-2.337489Gly/Ala/Ser-rich lipoprotein
BPSL3308-113-2.431186putative exported protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3295TYPE3IMSPROT359e-125 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 359 bits (922), Expect = e-125
Identities = 108/344 (31%), Positives = 181/344 (52%), Gaps = 2/344 (0%)

Query: 12 DRTEAATPKRREKAREEGQVARSRELASFALLSAGFYGAWMLSGPIGEHLRTMLHTAFSF 71
++TE TPK+ AR++GQVA+S+E+ S AL+ A LS EH ++
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LIPA 61

Query: 72 DRAAAFDTNRMLSHAGTLSLEGLYALAPVLALTGVAALAAPMAMGGWLVSTKTFELKFER 131
+++ + + + LE Y P+L + + A+A+ + G+L+S + + ++
Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 132 LNPITGLGRIFSIQGPIQLGMSIAKTLVVGGIGGIAIWRSKDELLGLATQPLHAALADAL 191
+NPI G RIFSI+ ++ SI K +++ + I I + LL L T +
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 192 HLVAVCCGMTVAGMLVVAGLDVPYQLWQYNKKLRMTKEEVKREHRENEGDPHVKGRIRQQ 251
++ + G +V++ D ++ +QY K+L+M+K+E+KRE++E EG P +K + RQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 252 QRAMARRRMMANVPTADVVVTNPTHFAVALKYTDGEMRAPKVVAKGVNLVAARIRELAAE 311
+ + R M NV + VVV NPTH A+ + Y GE P V K + +R++A E
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301

Query: 312 HHVPLLEAPPLARALYHNVELEREIPGTLYSAVAEVLAWVYQLK 355
VP+L+ PLARALY + ++ IP A AEVL W+ +
Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3297cloacin320.004 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.0 bits (72), Expect = 0.004
Identities = 14/43 (32%), Positives = 20/43 (46%)

Query: 28 GGGGDGGSNASVNTGTGGGDTSAGGGSNGGTGGTGGSGSTPLA 70
GGG G + +G G G + G GTGG + + P+A
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVA 89



Score = 30.1 bits (67), Expect = 0.022
Identities = 17/56 (30%), Positives = 22/56 (39%)

Query: 17 AAATAALVAACGGGGDGGSNASVNTGTGGGDTSAGGGSNGGTGGTGGSGSTPLASN 72
A +T+ + G G AS +G + GGGS G GGSG N
Sbjct: 13 AHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGN 68



Score = 29.7 bits (66), Expect = 0.025
Identities = 21/61 (34%), Positives = 27/61 (44%), Gaps = 7/61 (11%)

Query: 28 GGGGDGGSNASVNTGTGGGDTS-------AGGGSNGGTGGTGGSGSTPLASNQAAITVST 80
GG DG +S N GGG S +G G+ GG G +GG T + A V+
Sbjct: 31 GGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAF 90

Query: 81 G 81
G
Sbjct: 91 G 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3300HTHFIS875e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 5e-23
Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 4/110 (3%)

Query: 12 MDKSMKILVVDDFPTMRRIVRNLLKELGYSNVDEAEDGLAGLARLRGGGYDFVISDWNMP 71
M + ILV DD +R ++ L GY V + + G D V++D MP
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 72 NLDGLAMLKEIRADASLTHLPVLMVTAESKKENIIAAAQAGASGYVVKPF 121
+ + +L I+ LPVL+++A++ I A++ GA Y+ KPF
Sbjct: 59 DENAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3301HTHFIS664e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.4 bits (162), Expect = 4e-14
Identities = 31/143 (21%), Positives = 61/143 (42%), Gaps = 13/143 (9%)

Query: 7 KIKVLCVDDSALIRSLMTEIINSQPDMEVCATAPDPLVARELIKQHNPDVLTLDVEMPRM 66
+L DD A IR+++ + ++ + I + D++ DV MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 67 DGLDFLEKLMRLRP-MPVVMVSSLTERGSEITLRALELGAVDFVTKPRVGIRDGMLDYSE 125
+ D L ++ + RP +PV+++S+ ++A E GA D++ KP D +E
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNT--FMTAIKASEKGAYDYLPKP--------FDLTE 110

Query: 126 KLADKVRAASRARVRQNPQPHAA 148
+ RA + + R + +
Sbjct: 111 LIGIIGRALAEPKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3306PF06580463e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 46.0 bits (109), Expect = 3e-07
Identities = 21/151 (13%), Positives = 50/151 (33%), Gaps = 52/151 (34%)

Query: 449 ELDKSLIERIIDPLT--HLVRNSLDHGIETVEARRAAGKDAVGQLVLSAAHHGGNIVIEV 506
+++ ++++ + P+ LV N + HGI G+++L G + +EV
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296

Query: 507 SDDGAGLNRERILAKAAKQGMQISENISDDEVWNLIFAPGFSTAEVVTDVSGRGVGMDVV 566
+ G+ + G G+ V
Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318

Query: 567 KRNIQSMGG---HVEISSQAGRGTTTRIVLP 594
+ +Q + G +++S + G +++P
Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3307HTHFIS718e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.4 bits (175), Expect = 8e-18
Identities = 38/114 (33%), Positives = 58/114 (50%), Gaps = 2/114 (1%)

Query: 4 TILAIDDSATMRTLLSATLGEAGYDVTVASDGEVGLDVALATRFDLVLTDHHMPRKNGLE 63
TIL DD A +RT+L+ L AGYDV + S+ A DLV+TD MP +N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 LIVALRRQLGYEATPILVLTTENGDAFKDAARAAGATGWIEKPIDPDALIELVA 117
L+ +++ P+LV++ +N A GA ++ KP D LI ++
Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BPSL3308OMPADOMAIN401e-05 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 39.5 bits (92), Expect = 1e-05
Identities = 25/117 (21%), Positives = 51/117 (43%), Gaps = 9/117 (7%)

Query: 182 FAMSSDAVEPYMRDILREIGKTLNDV---PNRIIVQGHTDAVPYAGGEKGYSNWELSADR 238
F + ++P + L ++ L+++ ++V G+TD + G Y N LS R
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI----GSDAY-NQGLSERR 277

Query: 239 ANASRRELIAGGMDEAKVLRV-LGLASTQNLNKADPLDPENRRISIIVLNRKSELAL 294
A + LI+ G+ K+ +G ++ N D + I + +R+ E+ +
Sbjct: 278 AQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.