PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2243.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_009052 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1Sbal_0060Sbal_0066Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0060024-3.5928983-deoxy-D-manno-octulosonic-acid transferase
Sbal_0061128-7.0702503-deoxy-D-manno-octulosonic-acid kinase
Sbal_0062132-8.420137glycosyl transferase family protein
Sbal_0063234-9.347985hypothetical protein
Sbal_0064231-8.197418hypothetical protein
Sbal_0065024-6.003360glycosyl transferase family protein
Sbal_0066020-5.095569group 1 glycosyl transferase
2Sbal_0144Sbal_0151Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_01443171.821571MATE efflux family protein
Sbal_01453172.097191polysaccharide deacetylase
Sbal_01463202.108573electron transport protein SCO1/SenC
Sbal_01473182.177735protoheme IX farnesyltransferase
Sbal_01482182.380395cytochrome oxidase assembly
Sbal_01493172.889373hypothetical protein
Sbal_01502152.933125hypothetical protein
Sbal_01512142.696125cytochrome c oxidase subunit III
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0145BCTERIALGSPC300.017 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 29.5 bits (66), Expect = 0.017
Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 13/105 (12%)

Query: 1 MVKRVLLALIGLMTFSAHAVVILQYH-HVSETTP-AATSVTPAQFREQMQFLAD-DGFKV 57
+++R+L L LM + ++ + + + P ++ +TPAQ R+Q L D F V
Sbjct: 13 VIRRILFYL--LMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFGV 70

Query: 58 IPLSQVVEAIKQKQ--DLPAKTVAITF------DDGYRSIATTAH 94
P A+ Q +LP T+ ++ DD RSIA +
Sbjct: 71 SPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISK 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0146PF06057270.039 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 27.5 bits (61), Expect = 0.039
Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 4/42 (9%)

Query: 70 IGFTFCPDVCPTTLNKLAAAYPDLNKIAPLQVVFLSVDPKRD 111
IG++F +V P LN++ A Y L V LS D
Sbjct: 122 IGYSFGAEVIPFVLNEMPARYRK----NVLGAVLLSPSQSSD 159


3Sbal_0164Sbal_0171Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0164420-1.473403hypothetical protein
Sbal_0165218-1.115031hypothetical protein
Sbal_01665161.021159hypothetical protein
Sbal_01673171.798442hypothetical protein
Sbal_01682151.772668NapC/NirT cytochrome c domain-containing
Sbal_01692151.709446hypothetical protein
Sbal_01703122.352938putative methyltransferase
Sbal_01712112.271008signal recognition particle-docking protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0171IGASERPTASE702e-14 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 69.7 bits (170), Expect = 2e-14
Identities = 45/315 (14%), Positives = 95/315 (30%), Gaps = 17/315 (5%)

Query: 4 KGFFSWFRKDKSQDEVVAETPVVTPTQDTEAAERLEQERAEAQRLAVEAEAQAAAAKLAA 63
G + + + + +T +T + +A E EA A +
Sbjct: 975 NGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPS 1034

Query: 64 EQLAAEQAQAERIVQEQAAIEAQRLAEQQAEAARLAAAQLEAEQLAKVQAERIAQEQAQI 123
E + AE QE +E EQ A ++ E + V+A E AQ
Sbjct: 1035 ET---TETVAENSKQESKTVEKN---EQDATETTAQNREVAKEAKSNVKANTQTNEVAQS 1088

Query: 124 EAQRLAEQQAETARLAAAQLEAEQLAVDKLAAEQAAAEQLAQAQAKAEAERIAQEQAQIE 183
++ ++ T A +E E+ A + Q + +Q K E Q QA+
Sbjct: 1089 GSE--TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 184 AQR----LAEQQAEAARLAAAQLESERLAANNLAQAQAKAEAERIDHEQAQIEAQRLAEQ 239
+ ++ A + + ++N+ Q ++ + +
Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 240 QAEAARLAAAQLEAERARVAAEQAAEALAAEQLAAEALAREQAEALAQQQAEATQTAGPV 299
++ R R + + + +A +T T +
Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRS-----VPHNVEPATTSSNDRSTVALCDLTSTNTNAVL 1261

Query: 300 TELQPEPQARPVKEG 314
++ + + Q + G
Sbjct: 1262 SDARAKAQFVALNVG 1276



Score = 40.8 bits (95), Expect = 2e-05
Identities = 32/202 (15%), Positives = 59/202 (29%), Gaps = 15/202 (7%)

Query: 11 RKDKSQDEVVAETPVVTPTQDTEAAERLEQERAEAQRLAVEAEAQAAAAKLAAEQLAAEQ 70
K K + E E P VT + +QE++E + E + E +
Sbjct: 1110 EKAKVETEKTQEVPKVT------SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163

Query: 71 AQAERIVQEQAAIEAQRLAEQQAEAARLAAAQLEAEQLAKVQAERIAQEQAQIEAQRLAE 130
A+ EQ A E EQ + + + Q E+ +
Sbjct: 1164 TTAD---TEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK 1220

Query: 131 QQAETARLAAAQLEAEQLAVDKLAAEQAAAEQLAQAQAKAEAERIAQEQAQIEAQRLAEQ 190
+ + + + V+ +A + A+ +AQ +A
Sbjct: 1221 NRHRRS------VRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALN 1274

Query: 191 QAEAARLAAAQLESERLAANNL 212
+A +QLE N+
Sbjct: 1275 VGKAVSQHISQLEMNNEGQYNV 1296


4Sbal_0221Sbal_0239Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_02212150.247763Sel1 domain-containing protein
Sbal_0222115-0.904649hypothetical protein
Sbal_0223114-1.334536hypothetical protein
Sbal_0224219-1.875291cytochrome c-type biogenesis protein CcmE
Sbal_0225218-2.338706Heme exporter protein CcmD
Sbal_0226119-1.209502heme exporter protein CcmC
Sbal_0227-117-1.493856heme exporter protein CcmB
Sbal_0228-119-1.687365cytochrome c biogenesis protein CcmA
Sbal_0229021-1.459319cytochrome c class I
Sbal_0230-120-1.752114hypothetical protein
Sbal_0231019-1.557528cytochrome c-type biogenesis protein CcmF
Sbal_0232117-3.405801periplasmic protein thiol--disulfide
Sbal_0233013-1.547460cytochrome C biogenesis protein
Sbal_0234117-0.036356alkyl hydroperoxide reductase
Sbal_02351210.689340*hypothetical protein
Sbal_02361251.941662hypothetical protein
Sbal_02370262.486704N-acetyltransferase GCN5
Sbal_0238-1283.298817integral membrane sensor signal transduction
Sbal_0239-2293.264916two component transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0228PF05272320.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.002
Identities = 9/20 (45%), Positives = 13/20 (65%)

Query: 41 IEGPNGAGKTSLLRILAGLS 60
+EG G GK++L+ L GL
Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0231PF06580300.036 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.036
Identities = 18/109 (16%), Positives = 38/109 (34%), Gaps = 15/109 (13%)

Query: 3 PELGHFALIIGVAFAFLLISVPLIGVARKDQYLVRYAWPLTYGMFFFITV---SVVVLAY 59
P+L I ++ L+++ ++ R W L M I + VV+
Sbjct: 37 PKLHSMIFNIAISLMGLVLTHAY------RSFIKRQGW-LKLNMGQIILRVLPACVVIGM 89

Query: 60 SFAVDDFSVAYVAHHSNSQLPIFFKISAVWGGHEGSLLFWVFALSTWAA 108
+ V + S+ + N++ P+ F + + V W+
Sbjct: 90 VWFVANTSIWRLLAFINTK-PVAFTLPLA----LSIIFNVVVVTFMWSL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0237SACTRNSFRASE280.009 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.4 bits (63), Expect = 0.009
Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 85 NVYVNAHYRNKGLGKLLVNAVVDYAKALGLQKIYLFTAD 123
++ V YR KG+G L++ +++AK + L T D
Sbjct: 94 DIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0239HTHFIS762e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.4 bits (188), Expect = 2e-18
Identities = 30/129 (23%), Positives = 61/129 (47%)

Query: 2 RLLLIEDDTDLVARLIPALNKAGYTVEHADNGIDGAFLGEEENFEAVILDLGLPGKPGLQ 61
+L+ +DD + L AL++AGY V N + + V+ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLGQWRQKGLAMPVLILTARDAWHERVDGLKAGADDYLGKPFHIEELLARLEVLIRRHFG 121
+L + ++ +PVL+++A++ + + + GA DYL KPF + EL+ + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 RADNVLQHA 130
R + +
Sbjct: 125 RPSKLEDDS 133


5Sbal_0322Sbal_0355Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_03220203.359374rhodanese domain-containing protein
Sbal_03230223.623990XRE family transcriptional regulator
Sbal_03242223.298044benzoate transporter
Sbal_03252223.223543N-acetyltransferase GCN5
Sbal_03261213.371925hypothetical protein
Sbal_03270204.3356893-oxoacyl-(acyl carrier protein) synthase II
Sbal_03281204.1148643-ketoacyl-ACP reductase
Sbal_03291203.790168thioester dehydrase family protein
Sbal_03302193.7429913-oxoacyl-ACP synthase
Sbal_03311193.799505hypothetical protein
Sbal_03321183.720340monooxygenase FAD-binding
Sbal_03331183.520401hypothetical protein
Sbal_03340203.160670hypothetical protein
Sbal_0335-1203.033101thioesterase superfamily protein
Sbal_0336-1202.986132histidine ammonia-lyase
Sbal_0337-1191.881879glycosyl transferase family protein
Sbal_03380191.893750thioester dehydrase family protein
Sbal_03390182.485326aconitate hydratase
Sbal_03400131.322263hypothetical protein
Sbal_0341-1153.156148acyl carrier protein
Sbal_0342-1173.735368acyl carrier protein
Sbal_0343-1183.376153phospholipid/glycerol acyltransferase
Sbal_03440193.259165hypothetical protein
Sbal_03450192.479188hypothetical protein
Sbal_0346-1193.176612ATP-dependent DNA helicase RecG
Sbal_03470192.047455two component LuxR family transcriptional
Sbal_0348-1182.290924integral membrane sensor signal transduction
Sbal_0349-1202.743477hypothetical protein
Sbal_0350-1202.594760CaCA family Na(+)/Ca(+) antiporter
Sbal_03510163.748574AMP-dependent synthetase and ligase
Sbal_0352-1133.590034putative endoribonuclease L-PSP
Sbal_0353-1133.320085bifunctional (p)ppGpp synthetase II/
Sbal_03540133.430785DNA-directed RNA polymerase subunit omega
Sbal_0355-1143.119169guanylate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0325SACTRNSFRASE332e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.4 bits (76), Expect = 2e-04
Identities = 21/78 (26%), Positives = 35/78 (44%), Gaps = 2/78 (2%)

Query: 67 NNLAGCGALKWLDAEHAEIKSMRTAAPYKQQGIASKILQHLINDAKTAGVKRLSLETGSM 126
NN G ++ +A I+ + A Y+++G+ + +L I AK L LET +
Sbjct: 74 NNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDI 133

Query: 127 DFFNPARLLYSKFGFEIC 144
+ A Y+K F I
Sbjct: 134 NI--SACHFYAKHHFIIG 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0328DHBDHDRGNASE1052e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 105 bits (262), Expect = 2e-29
Identities = 68/248 (27%), Positives = 113/248 (45%), Gaps = 15/248 (6%)

Query: 5 VLVTGSSRGIGKAIALKLAQAGFDIALHYHSNQTAADDTATQIRALGVNVSLLKFDVADR 64
+TG+++GIG+A+A LA G IA N + + ++A + DV D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 65 ATVKAAIEADIEANGAYYGVILNAGINRDTAFPAMTESEWDSVIHTNLDGFYNVIHPCVM 124
A + G ++ AG+ R ++++ EW++ N G +N V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS-VS 128

Query: 125 PMVQGRKGGRIITLASVSGIAGNRGQVNYSASKAGIIGATKALSLELAKRKITVNCIAPG 184
+ R+ G I+T+ S Y++SKA + TK L LELA+ I N ++PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 185 LIETDM----------VADIPKDMVEQL---VPMRRMGKPNEIAALAAFLMSDDAAYITR 231
ETDM + K +E +P++++ KP++IA FL+S A +IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 232 QVISVNGG 239
+ V+GG
Sbjct: 249 HNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0333ACRIFLAVINRP350.001 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 35.2 bits (81), Expect = 0.001
Identities = 27/151 (17%), Positives = 51/151 (33%), Gaps = 21/151 (13%)

Query: 697 LLALALGIALLLFSLNFGFKKAAVVVAVPALAALLTLATLGLTGSPLSLFHALALILVFG 756
A+ L ++ L +AVP + L T A L G ++ ++L G
Sbjct: 345 FEAIMLVFLVMYLFLQNMRATLIPTIAVP-VVLLGTFAILAAFGYSINTLTMFGMVLAIG 403

Query: 757 IGIDYSL----------------FFASAQNHGKAVMMAVFMSACSTLLAFGLLAFSQTQA 800
+ +D ++ + + + A+ A F +AF
Sbjct: 404 LLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGST 463

Query: 801 ---IHYFGLTLSLGIGFTFLLSPLILTTTLA 828
F +T+ + + L++ LILT L
Sbjct: 464 GAIYRQFSITIVSAMALSVLVA-LILTPALC 493



Score = 35.2 bits (81), Expect = 0.001
Identities = 27/155 (17%), Positives = 60/155 (38%), Gaps = 23/155 (14%)

Query: 691 RLLTLKLLALALGIALLLFSLNFGFKKAAVVVAVPALAALLTLATLGLTGSPLSLFHALA 750
+ L ++ + + L L +L + V+ V L + L L ++ +
Sbjct: 871 QAPALVAISFVV-VFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVG 929

Query: 751 LILVFGIG-------IDYSLFFASAQNHG--KAVMMA-------VFMSACSTLLAFGLLA 794
L+ G+ ++++ + G +A +MA + M++ + +L LA
Sbjct: 930 LLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLA 989

Query: 795 FSQTQAIHYFGLTLSLGIGFTFLLSPLILTTTLAL 829
S G ++GIG ++ ++ T LA+
Sbjct: 990 ISNGAGS---GAQNAVGIG---VMGGMVSATLLAI 1018



Score = 31.0 bits (70), Expect = 0.028
Identities = 40/243 (16%), Positives = 83/243 (34%), Gaps = 37/243 (15%)

Query: 276 LGLASLLGVIALVWLAFRSVMPLLLAIVTISSGLLLAVTFTLSVFGELHLLTLVFGTSLI 335
L A +L V +++L +++ L+ + + LL + ++ LT+ I
Sbjct: 344 LFEAIML-VFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAI 402

Query: 336 GIAIDYSFHFY--CERLSDSERSAKATVAYI------FPTVTLAFITSALAYVGIGLAPF 387
G+ +D + ER+ ++ V +A + SA V I +A F
Sbjct: 403 GLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSA---VFIPMAFF 459

Query: 388 PG-----MQQVAIFCAAGLLGAYLTLI-----LAYPLLASSRLPEGSRPLALAGTYLASL 437
G +Q +I + + + L + L LL G + +
Sbjct: 460 GGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTF 519

Query: 438 TQLSKRFTTPLG-------------MGMFALVILVWCLVGVTKLTVDDD--IRHLQQSPA 482
+T +G + A +++++ + + L +D + Q PA
Sbjct: 520 DHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPA 579

Query: 483 SVT 485
T
Sbjct: 580 GAT 582


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0346SECA412e-05 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 41.0 bits (96), Expect = 2e-05
Identities = 30/84 (35%), Positives = 39/84 (46%), Gaps = 8/84 (9%)

Query: 294 MRLVQGDV-----GSGKTLVAAMAA-LQAIENGYQVAMMAPTELLAEQHATNFAAWFEPL 347
M L + + G GKTL A + A L A+ G V ++ + LA++ A N FE L
Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFEFL 150

Query: 348 GLKVGW-LAGKLKGKVRAQSLADI 370
GL VG L G R ADI
Sbjct: 151 GLTVGINLPGMPAPAKREAYAADI 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0347HTHFIS761e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.4 bits (188), Expect = 1e-18
Identities = 29/117 (24%), Positives = 51/117 (43%), Gaps = 1/117 (0%)

Query: 2 KILLAEDQAMVRGALAALLTLAGGFNITQASDGDEALSLLKQQSFDLLLTDIEMPGRTGL 61
IL+A+D A +R L L+ AG +++ S+ + DL++TD+ MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 ELAAWLKDQHSQTKVVVITTFGRAGYIKRAIEAGVGGFLLKDAPSETLVNAIQQVMA 118
+L +K V+V++ +A E G +L K L+ I + +A
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0348PF06580378e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 8e-05
Identities = 65/358 (18%), Positives = 117/358 (32%), Gaps = 53/358 (14%)

Query: 1 MISTHLQLERKLAWVYLINLVFYL---IPLAINAYPAWKIALSFAVLIPFIASYF-WAYK 56
M STH Q + + I Y A ++ F + I + AY+
Sbjct: 1 MASTHRQANKYYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYR 60

Query: 57 CNQNSAYRPILMMVAIATAITPINPGSISLFTFAAFFIGF-FYPLRTCLLAIAALIGLLF 115
L M I + P A IG ++ T + + A I
Sbjct: 61 SFIKRQGWLKLNMGQIILRVLP-----------ACVVIGMVWFVANTSIWRLLAFINTKP 109

Query: 116 ALNEIYDFNSYYFPLYGSGLVLGVGMFG------VAERRRHQHKLKEQQSTQEISTLAAM 169
+ S F + + + FG + Q K+ ++ L A
Sbjct: 110 VAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQ 169

Query: 170 VERERIARDLHDIMGHSLSSIALKAELAEKLLAKQEYQLATIQLNELGQIARESLSQIRH 229
+ + L++I + I A ++L L ++ R SL
Sbjct: 170 INPHFMFNALNNIR----ALILEDPTKAREMLTS------------LSELMRYSLRYSNA 213

Query: 230 TVSDYKHKV-LADSVTQLCKLLREKGISVELTGNIPKLPARMESQLGLIVTELVNNILRH 288
++ + DS QL + E + E N + ++ ++V LV N ++H
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPP---MLVQTLVENGIKH 270

Query: 289 SGASQC------IIDFIQQADRLVVEVKDNGP----SKPIAEGNGLTGIRERLDSLGG 336
G +Q ++ + + +EV++ G + + G GL +RERL L G
Sbjct: 271 -GIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYG 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0353PF07328320.003 T-DNA border endonuclease VirD1
		>PF07328#T-DNA border endonuclease VirD1

Length = 144

Score = 31.9 bits (72), Expect = 0.003
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 506 PEQIEKVI----RDTKHTTLDSLLADIGLGNAMSIVIAQRLIGDNLENQESRDGHMMPIR 561
P +++KVI + + D+ +A++GL ++ IA R IG +EN + +
Sbjct: 16 PARVDKVISVKMTEAELAEFDAQIAELGLNRNRALRIAARRIGGFVENDAKTVELLRDMS 75

Query: 562 GAEGMLVT 569
A + T
Sbjct: 76 RAIAGVAT 83


6Sbal_0386Sbal_0419Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0386-1153.1937752-isopropylmalate synthase
Sbal_0387-1153.4185073-isopropylmalate dehydrogenase
Sbal_0388-1143.078182isopropylmalate isomerase large subunit
Sbal_0389-1132.177922isopropylmalate isomerase small subunit
Sbal_0390-2132.138860aromatic hydrocarbon degradation membrane
Sbal_0391-1132.755883glycerol kinase
Sbal_03920112.859370hypothetical protein
Sbal_03930103.039009cell division protein MraZ
Sbal_03940102.947909S-adenosyl-methyltransferase MraW
Sbal_03951113.123502cell division protein FtsL
Sbal_03961113.127791peptidoglycan glycosyltransferase
Sbal_03971133.208282UDP-N-acetylmuramoylalanyl-D-glutamate--2,
Sbal_03981143.124625UDP-N-acetylmuramoylalanyl-D-glutamyl-2,
Sbal_03991162.819340phospho-N-acetylmuramoyl-pentapeptide-
Sbal_04002153.230674UDP-N-acetylmuramoyl-L-alanyl-D-glutamate
Sbal_04011163.384686cell division protein FtsW
Sbal_04021143.034986undecaprenyldiphospho-muramoylpentapeptide
Sbal_04031142.529019UDP-N-acetylmuramate--L-alanine ligase
Sbal_04040151.593971cell division protein FtsQ
Sbal_04054191.306664cell division protein FtsA
Sbal_04063200.361669cell division protein FtsZ
Sbal_0407317-0.478301UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine
Sbal_0408217-0.283854hypothetical protein
Sbal_0409215-0.628083peptidase M23B
Sbal_0410114-0.714323preprotein translocase subunit SecA
Sbal_0411-112-0.683640***anti-RNA polymerase sigma 70 factor
Sbal_0412-214-0.537565hypothetical protein
Sbal_04130120.949927uroporphyrinogen decarboxylase
Sbal_04140111.483219putative PAS/PAC sensor protein
Sbal_04150132.444844diguanylate cyclase/phosphodiesterase
Sbal_04160133.019507short chain dehydrogenase
Sbal_04170133.227782hypothetical protein
Sbal_04180143.274893hypothetical protein
Sbal_04190163.633580phosphoribosylamine--glycine ligase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0405SHAPEPROTEIN688e-15 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 68.2 bits (167), Expect = 8e-15
Identities = 51/221 (23%), Positives = 91/221 (41%), Gaps = 20/221 (9%)

Query: 150 SGMRMEAKVHIVTC----ANDMAKNITK-SVERCGLKVDDLVFSGIASADAVLTFDEKDL 204
S M ++ C A + + + S + G + L+ +A+A +
Sbjct: 100 SNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEAT 159

Query: 205 GVCIVDIGGGTTDIAVYTNGALRHCAVVPVAGNQVTNDIAKIFR------TPSSHAEQIK 258
G +VDIGGGTT++AV + + + + V + G++ I R + AE+IK
Sbjct: 160 GSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIK 219

Query: 259 VQFACARSSMVSREDSIEVPS---VGGRPSR-SMSRHTLAEVVEPRYQELFELVLKELKD 314
+ A IEV G P +++ + + E ++ + V+ L+
Sbjct: 220 HEIGSAYPG--DEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQ 277

Query: 315 SGLE---DQIAAGIVLTGGTASIQGVVDIAEATFGMPVRVA 352
E D G+VLTGG A ++ + + G+PV VA
Sbjct: 278 CPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVA 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0410SECA13160.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1316 bits (3406), Expect = 0.0
Identities = 651/907 (71%), Positives = 762/907 (84%), Gaps = 7/907 (0%)

Query: 1 MFGKLLTKVFGSRNDRTLKGLQKIVISINALEADYEKLTDEALKAKTAEFRERLAAGASL 60
M KLLTKVFGSRNDRTL+ ++K+V INA+E + EKL+DE LK KTAEFR RL G L
Sbjct: 1 MLIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVL 60

Query: 61 DSIMAEAFATVREASKRVFDMRHFDVQLLGGMVLDSNRIAEMRTGEGKTLTATLPAYLNA 120
++++ EAFA VREASKRVF MRHFDVQLLGGMVL+ IAEMRTGEGKTLTATLPAYLNA
Sbjct: 61 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA 120

Query: 121 LTGKGVHVITVNDYLARRDAENNRPLFEFLGLTVGINVAGLGQHEKKAAYNADITYGTNN 180
LTGKGVHV+TVNDYLA+RDAENNRPLFEFLGLTVGIN+ G+ K+ AY ADITYGTNN
Sbjct: 121 LTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNN 180

Query: 181 EFGFDYLRDNMAFSPQERVQRPLHYALIDEVDSILIDEARTPLIISGAAEDSSELYIKIN 240
E+GFDYLRDNMAFSP+ERVQR LHYAL+DEVDSILIDEARTPLIISG AEDSSE+Y ++N
Sbjct: 181 EYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVN 240

Query: 241 TLIPNLIRQDKEDTEEYVGEGDYSIDEKAKQVHFTERGQEKVENLLIERGMLAEGDSLYS 300
+IP+LIRQ+KED+E + GEG +S+DEK++QV+ TERG +E LL++ G++ EG+SLYS
Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300

Query: 301 AANISLLHHVNAALRAHTLFERDVDYIVQDNEVIIVDEHTGRTMPGRRWSEGLHQAVEAK 360
ANI L+HHV AALRAH LF RDVDYIV+D EVIIVDEHTGRTM GRRWS+GLHQAVEAK
Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK 360

Query: 361 EGVHIQNENQTLASITFQNYFRQYEKLAGMTGTADTEAFEFQHIYGLDTVVVPTNRPMVR 420
EGV IQNENQTLASITFQNYFR YEKLAGMTGTADTEAFEF IY LDTVVVPTNRPM+R
Sbjct: 361 EGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIR 420

Query: 421 KDMADLVYLTADEKYQAIIKDIKDCRERGQPVLVGTVSIEQSELLARLMVQEKIPHEVLN 480
KD+ DLVY+T EK QAII+DIK+ +GQPVLVGT+SIE+SEL++ + + I H VLN
Sbjct: 421 KDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLN 480

Query: 481 AKFHEREAEIVAQAGRTGSVTIATNMAGRGTDIVLGGNWNMEIDELDNPTAEQKAKIKAD 540
AKFH EA IVAQAG +VTIATNMAGRGTDIVLGG+W E+ L+NPTAEQ KIKAD
Sbjct: 481 AKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKAD 540

Query: 541 WQIRHDEVVAAGGLHILGTERHESRRIDNQLRGRAGRQGDAGSSRFYLSMEDSLMRIFAS 600
WQ+RHD V+ AGGLHI+GTERHESRRIDNQLRGR+GRQGDAGSSRFYLSMED+LMRIFAS
Sbjct: 541 WQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFAS 600

Query: 601 DRVSGMMKKLGMEEGEAIEHPWVSRAIENAQRKVEARNFDIRKQLLEFDDVANDQRQVVY 660
DRVSGMM+KLGM+ GEAIEHPWV++AI NAQRKVE+RNFDIRKQLLE+DDVANDQR+ +Y
Sbjct: 601 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660

Query: 661 AQRNELMDAESIEDTIQNIQDDVIGAVIDQYIPPQSVEELWDIPGLEQRLHQEFMLKLPI 720
+QRNEL+D + +TI +I++DV A ID YIPPQS+EE+WDIPGL++RL +F L LPI
Sbjct: 661 SQRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPI 720

Query: 721 QEWLDKEDDLHEESLRERIITAWGDAYKAKEEMVGAQVLRQFEKAVMLQTLDGLWKEHLA 780
EWLDKE +LHEE+LRERI+ + Y+ KEE+VGA+++R FEK VMLQTLD LWKEHLA
Sbjct: 721 AEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLA 780

Query: 781 AMDHLRQGIHLRGYAQKNPKQEYKRESFELFQQLLNTLKHDVISVLSKVQVQAQSDVEEM 840
AMD+LRQGIHLRGYAQK+PKQEYKRESF +F +L +LK++VIS LSKVQV+ +VEE+
Sbjct: 781 AMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEEL 840

Query: 841 EARRREEDAKIQRDYQHAAAESLVGSSDEHEAVTAQAPMIRDGEKVGRNDPCPCGSGRKY 900
E +RR E ++ + Q + + D+ A A KVGRNDPCPCGSG+KY
Sbjct: 841 EQQRRMEAERLAQ-MQQLSHQ------DDDSAAAAALAAQTGERKVGRNDPCPCGSGKKY 893

Query: 901 KQCHGKL 907
KQCHG+L
Sbjct: 894 KQCHGRL 900


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0416DHBDHDRGNASE732e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 73.2 bits (179), Expect = 2e-17
Identities = 48/184 (26%), Positives = 80/184 (43%), Gaps = 2/184 (1%)

Query: 3 GLTGKVVIITGASEGIGRALAVAMARMGCQLVISARNETRLASLALEIANYGLPPFVFAA 62
G+ GK+ ITGA++GIG A+A +A G + N +L + + F A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 63 DVSRAEQCEALIEATVAHYGHLDILINNAGMTMWSRFDELTQLSVLEDIMRVNYLGPAYL 122
DV + + + G +DIL+N AG+ L+ E VN G
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEW-EATFSVNSTGVFNA 123

Query: 123 THAALPHLKASK-GQVVVVASVAGLTGVPTRSGYAASKHAVIGFFDSLRIELADDNVAVT 181
+ + ++ + G +V V S + + YA+SK A + F L +ELA+ N+
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 182 VICP 185
++ P
Sbjct: 184 IVSP 187


7Sbal_0452Sbal_0478Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0452021-3.132226flavoprotein oxygenase
Sbal_0453121-4.112273hypothetical protein
Sbal_0454023-2.807659transposase IS3/IS911 family protein
Sbal_0455-223-3.018776integrase catalytic subunit
Sbal_0456024-3.782857hypothetical protein
Sbal_0458-118-1.866003flavoprotein oxygenase
Sbal_0459-115-0.866142transposase IS3/IS911 family protein
Sbal_0460-1130.236431integrase catalytic subunit
Sbal_0461-1131.671807transposase, IS4 family protein
Sbal_04630153.039891hypothetical protein
Sbal_04641204.313955FMN reductase
Sbal_04650184.519701UbiD family decarboxylase
Sbal_0466-1184.730687hypothetical protein
Sbal_04671204.206824major facilitator superfamily transporter
Sbal_04682213.029940short-chain dehydrogenase/reductase SDR
Sbal_04692211.768437acetyl-CoA carboxylase, biotin carboxyl carrier
Sbal_04703222.6805173-dehydroquinate dehydratase
Sbal_04714213.045641peptidyl-tRNA hydrolase domain-containing
Sbal_04722204.486501hypothetical protein
Sbal_04731235.724685hypothetical protein
Sbal_04741235.657716hypothetical protein
Sbal_04750235.541582outer membrane efflux protein
Sbal_04760224.918238outer membrane efflux family protein
Sbal_0477-1204.566645RND family efflux transporter MFP subunit
Sbal_0478-1214.106839CzcA family heavy metal efflux protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0467TCRTETA543e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 54.4 bits (131), Expect = 3e-10
Identities = 62/338 (18%), Positives = 113/338 (33%), Gaps = 23/338 (6%)

Query: 40 MTLVPYIASDLGVD---VAHVSYAISAYALGVVVGSPIIMVLAVRVRRRTLLIALAALMA 96
M ++P + DL AH ++ YAL +P++ L+ R RR +L+ A A
Sbjct: 25 MPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAA 84

Query: 97 VANGLSALAPSLNWLIFFRFLSGLPHGAYFGVAMLLAASLVPPEMKARAVSRVIIGLTLA 156
V + A AP L L R ++G+ GA VA A + + +AR +
Sbjct: 85 VDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFG 143

Query: 157 TIIGVPFATWMGQTVGWRSGIGIVAILATITAVMVYFLAPDQAVAADASPRKELQ----- 211
+ G MG + A L + + FL P+ + + P +
Sbjct: 144 MVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPE-SHKGERRPLRREALNPLA 201

Query: 212 ------TLKNREVWLTLGIAAIGFGGIFCVYTYLAETLIQVTQVEPFKIPIMMAVFGI-G 264
+ + + G + + I I +A FGI
Sbjct: 202 SFRWARGMTVVAALMAVFFIMQLVGQVPA--ALWVIFGEDRFHWDATTIGISLAAFGILH 259

Query: 265 ATLGTLVCGWAADK-SALAAAFWSLVLSTVVLAIYPSLTGHYWALMPV-VFFVGCGLGLA 322
+ ++ G A + A ++ I + W P+ V G+G+
Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGY-ILLAFATRGWMAFPIMVLLASGGIGMP 318

Query: 323 TIVQARLMDVAPDGQAMTGALVQCAFNLANAIGPWVGS 360
+ V + Q + +L + +GP + +
Sbjct: 319 ALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0468DHBDHDRGNASE524e-10 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 51.6 bits (123), Expect = 4e-10
Identities = 41/183 (22%), Positives = 80/183 (43%), Gaps = 5/183 (2%)

Query: 2 ILITGASSGLGAAIASLYAKENEPLTLTGRNAERLQTVANALTPFSNKPIAAITADLASE 61
ITGA+ G+G A+A A + + N E+L+ V ++L + A AD+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVRDS 69

Query: 62 SSLEALFDGL---TQAPKTVIHCAGSGYFGAIETQGASDIHSLLNNNVTSTILLVRELVK 118
++++ + + +++ AG G I + + + + N T R + K
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 119 RYKDQ-TVTVVVVMSTAALAAKAGESTYCAAKWAVRGFIESVRLELKQSPMKLIAVYPGG 177
D+ + ++V V S A + + Y ++K A F + + LEL + ++ V PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 178 MDT 180
+T
Sbjct: 190 TET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0469RTXTOXIND280.015 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.9 bits (62), Expect = 0.015
Identities = 8/29 (27%), Positives = 13/29 (44%)

Query: 120 IQAERDGVVSAIWAKDGDEVAFDQPLFTL 148
I+ + +V I K+G+ V L L
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0477RTXTOXIND544e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.4 bits (131), Expect = 4e-10
Identities = 36/182 (19%), Positives = 64/182 (35%), Gaps = 22/182 (12%)

Query: 126 RATATLVVDRDRTATLAPQLDVRVLARHVVPGQEVKKGEPLLTLGGSAVAQAQADYINAA 185
R V++ R + L + +A+H V QE + +N
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQE----------------NKYVEAVNEL 268

Query: 186 AEWSRVKRMSEGAVSVSRRMQAQVDAELKRAILEAIKMTPAQIRTLE----STPEAIGSY 241
+ E + ++ V K IL+ ++ T I L E +
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 242 QLLAPIDGRVQQ-DIAMLGQVFSAGTPLMQLT-DESYLWVEAQLTPTQTAHITVGSAALV 299
+ AP+ +VQQ + G V + LM + ++ L V A + I VG A++
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAII 388

Query: 300 QV 301
+V
Sbjct: 389 KV 390



Score = 38.3 bits (89), Expect = 5e-05
Identities = 24/148 (16%), Positives = 53/148 (35%), Gaps = 5/148 (3%)

Query: 118 IANLNLDIRATATLVVDRDRTATLAPQLDVRVLARHVVPGQEVKKGEPLLTLGG----SA 173
+ + + A L R+ + P + V V G+ V+KG+ LL L +
Sbjct: 77 LGQVEIVATANGKLTHS-GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135

Query: 174 VAQAQADYINAAAEWSRVKRMSEGAVSVSRRMQAQVDAELKRAILEAIKMTPAQIRTLES 233
+ Q+ + A E +R + +S D + + E + + +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF 195

Query: 234 TPEAIGSYQLLAPIDGRVQQDIAMLGQV 261
+ YQ +D + + + +L ++
Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0478ACRIFLAVINRP6590.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 659 bits (1702), Expect = 0.0
Identities = 224/1075 (20%), Positives = 434/1075 (40%), Gaps = 73/1075 (6%)

Query: 9 AIKNRLLVVLALLAMIVASVVMLPKLNLDAFPDVTNVQVTINTAAEGLAAEEVEKLISYP 68
I+ + + + +++A + + +L + +P + V+++ G A+ V+ ++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 VESAMYALPAVTEVRSLS-RTGLSIVTVVFAEGTDIYFARQQVFEQLQAAREMIPSGVGV 127
+E M + + + S S G +T+ F GTD A+ QV +LQ A ++P V
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PEIGPNTSGLGQIYQYILRAEPNSGIDAAELRSLNDYLVKLIMMPVGGVTEVLSFGGDVR 187
I S + ++ N G ++ VK + + GV +V FG
Sbjct: 125 QGISVEKSSSSYLMVAGFVSD-NPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 188 QYQVQVDPNKLRAYGLSMAQVTEALESNNRNAGGWFMDQGQE------QLVVRGYGMLPA 241
++ +D + L Y L+ V L+ N + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 242 GEQGLAAIAQIPLTEDK-GTPVRVGDIAQVDFGSEIRVGAVTMTRRDEAGKVQNLGEVVA 300
E+ ++ L + G+ VR+ D+A+V+ G E + N
Sbjct: 243 PEE----FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARI----------NGKPAAG 288

Query: 301 GVVLKRMGANTKATIDDIGARVSLIEQALPDGVSFEVFYDQAELVDKAVTTVRDALLMAF 360
+ GAN T I A+++ ++ P G+ YD V ++ V L A
Sbjct: 289 LGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI 348

Query: 361 VFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAIGMLVDG 420
+ + +++ LFL N+RATL+ +++PV + +++ +G S N +++ G+ +AIG+LVD
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 421 SVVMVENIFKHLTQPDRRHLLEARTRADGEADPYHSDEDGGQQANMAVRIMLAAKEVCSP 480
++V+VEN+ + + ED + M ++
Sbjct: 409 AIVVVENVERVMM------------------------EDKLPPKEATEKSM---SQIQGA 441

Query: 481 IFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVYLFK--- 537
+ ++ VF P+ G G +++ +++I+ AM ++LVALI PAL L K
Sbjct: 442 LVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501

Query: 538 -------RGVVLKQSVVLGPLDAAYRKLLTATLARPKVVMLSALLMFALSLLLLPRLGTE 590
G + Y + L +L L+ A ++L RL +
Sbjct: 502 AEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSS 561

Query: 591 FVPELEEGTINLRVTLAPTASLGTSLAVAPKLEAILLEFPEVEYALSRIGAPELGGDPEP 650
F+PE ++G + L A+ + V ++ L+ + S +
Sbjct: 562 FLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVE-SVFTVNGFSFSGQA 620

Query: 651 VSNIEVYIGLKPISEWQSASSRLE--LQRLMEEKLSVFPGLLLTFSQPIATRVDELLSGV 708
+ ++ LKP E + E + R E + G ++ F+ P + EL +
Sbjct: 621 QNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGTAT 677

Query: 709 KAQLA-IKIFGPDLAVLSERGQALTDLVAKIPGAV-DVSLEQVSGEAQLVVRPKRELLAR 766
I G L++ L + A+ P ++ V + AQ + +E
Sbjct: 678 GFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQA 737

Query: 767 YGISVDQVMSLVSQGIGGGSAGQVIDGNARYDINVRLAAEFRTSPDAIKDLLLSGTNGAT 826
G+S+ + +S +GG ID + V+ A+FR P+ + L + NG
Sbjct: 738 LGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEM 797

Query: 827 VRLGEVASVEVEMAPPNIRRDDVQRRVVVQANVA-GRDMGSVVKDIYALVPQADLPAGYT 885
V + P + R + + +Q A G G + + L + LPAG
Sbjct: 798 VPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIG 855

Query: 886 VIIGGQYENQQRAQQKLMLVVPISIALIALLLYFSFGSFKQVLLIMANVPLALIGGIVAL 945
G ++ + + +V IS ++ L L + S+ + +M VPL ++G ++A
Sbjct: 856 YDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAA 915

Query: 946 YVSGTYLSVPSSIGFITLFGVAVLNGVVLVDSINQ-RRQSGEALYDCVYEGTVGRLRPVL 1004
+ V +G +T G++ N +++V+ + G+ + + RLRP+L
Sbjct: 916 TLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPIL 975

Query: 1005 MTALTSALGLIPILLSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYR 1059
MT+L LG++P+ +S+G GS Q + + ++GG+ S+T L + +P + + R
Sbjct: 976 MTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 113 bits (285), Expect = 1e-27
Identities = 81/544 (14%), Positives = 184/544 (33%), Gaps = 61/544 (11%)

Query: 10 IKNRLLVVLALLAMIVASVVMLPKLNLDAFPDVTNVQVTIN-TAAEGLAAEEVEKLI--- 65
+ + +L ++ VV+ +L P+ G E +K++
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 66 -SYPVESAMYALPAVTEVRSLSRTGLS----IVTVVFAEGTDIYFARQQVFEQLQAAREM 120
Y +++ + +V V S +G + + V + + A+
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 121 ---IPSGVGVPEIGPNTSGLGQIYQYILRAEPNSGIDAAELRSLNDYLVKLIMMPVGGVT 177
I G +P P LG + +G+ L + L+ + +
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 178 EV-LSFGGDVRQYQVQVDPNKLRAYGLSMAQVTEALES--NNRNAGGWFMDQGQEQLVVR 234
V + D Q++++VD K +A G+S++ + + + + + ++L V+
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 235 GYGMLPAGEQGLAAIAQIPLTEDKGTPVRVGDIAQVDFGSEIRVGAVTMTRRDEAGKVQN 294
+ ++ + G V + G+ + R + ++
Sbjct: 774 AD---AKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY----GSPRLERYNGLPSMEI 826

Query: 295 LGEVVAGVVLKRMGANTKATIDDIGARVSLIEQALPDGVSFEVFYDQAELVDKAVTTVRD 354
GE G D A + + LP G+ ++ + + +
Sbjct: 827 QGEAAPGTSS-----------GDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPA 874

Query: 355 ALLMAFVFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAI 414
+ ++FV + + LA + + V+L +P+ I L+ + + ++ + GL I
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934

Query: 415 GMLVDGSVVMVENIFKHLTQPDRRHLLEARTRADGEADPYHSDEDGGQQANMAVRIMLAA 474
G+ ++++VE + L+E + EA ++A
Sbjct: 935 GLSAKNAILIVEFA---------KDLMEKEGKGVVEA------------------TLMAV 967

Query: 475 KEVCSPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVY 534
+ PI + I+ PL G + + ++ M+SA L+A+ VP V
Sbjct: 968 RMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027

Query: 535 LFKR 538
+ +
Sbjct: 1028 IRRC 1031



Score = 102 bits (257), Expect = 3e-24
Identities = 88/516 (17%), Positives = 188/516 (36%), Gaps = 38/516 (7%)

Query: 565 RPKVVMLSALLMFALSLLLLPRLGTEFVPELEEGTINLRVTLAPTASLGT-SLAVAPKLE 623
RP + A+++ L + +L P + +++ P A T V +E
Sbjct: 8 RPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSAN-YPGADAQTVQDTVTQVIE 66

Query: 624 AILLEFPEVEYALSRIGAPELGGDPEPVSNIEVYIGLKPISEWQSASSRLELQRLMEEKL 683
+ + Y S + ++ + + + ++ A Q ++ KL
Sbjct: 67 QNMNGIDNLMYMSST---------SDSAGSVTITLTFQSGTDPDIA------QVQVQNKL 111

Query: 684 SVFPGLLLTFSQPIATRVDELLSGVKAQLAIKIFGPDLAVLSERGQALT---DLVAKIPG 740
+ LL Q V++ S P + D ++++ G
Sbjct: 112 QLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNG 171

Query: 741 AVDVSLEQVSGEAQLVVRPKRELLARYGISVDQVMSLVS---QGIGGGS--AGQVIDGNA 795
DV L + + + +LL +Y ++ V++ + I G + G
Sbjct: 172 VGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQ- 228

Query: 796 RYDINVRLAAEFRTSPDAIKDLLLSGTNGATVRLGEVASVEVEMAPPNIR-RDDVQRRVV 854
+ + ++ F+ + K L ++G+ VRL +VA VE+ N+ R + +
Sbjct: 229 QLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAG 288

Query: 855 VQANVA-GRDMGSVVKDIYALVP--QADLPAGYTVIIGGQYENQQRAQQKLMLVVP---I 908
+ +A G + K I A + Q P G V+ Y+ Q + VV
Sbjct: 289 LGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLSIHEVVKTLFE 346

Query: 909 SIALIALLLYFSFGSFKQVLLIMANVPLALIGGIVALYVSGTYLSVPSSIGFITLFGVAV 968
+I L+ L++Y + + L+ VP+ L+G L G ++ + G + G+ V
Sbjct: 347 AIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLV 406

Query: 969 LNGVVLVDSINQRRQS-GEALYDCVYEGTVGRLRPVLMTALTSALGLIPILLSSGVGSEI 1027
+ +V+V+++ + + + ++ A+ + IP+ G I
Sbjct: 407 DDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAI 466

Query: 1028 QKPLAVVIIGGLFSSTALTLLVLPTLYRWLYRGDKR 1063
+ ++ I+ + S + L++ P L L +
Sbjct: 467 YRQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502


8Sbal_0557Sbal_0562Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0557321-0.683811tRNA delta(2)-isopentenylpyrophosphate
Sbal_0558425-1.000392RNA chaperone Hfq
Sbal_0559525-1.291478HSR1-like GTP-binding protein
Sbal_0560530-1.539248HflK protein
Sbal_0561632-1.818993HflC protein
Sbal_0562429-0.816444ubiquinol-cytochrome c reductase, iron-sulfur
9Sbal_0580Sbal_0615Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0580-1163.0153932'-5' RNA ligase
Sbal_0581-1173.048021diguanylate cyclase/phosphodiesterase
Sbal_05820173.061417ATP-dependent helicase HrpB
Sbal_05831150.713986penicillin-binding protein 1B
Sbal_05840161.665278PpiC-type peptidyl-prolyl cis-trans isomerase
Sbal_05850172.309800hypothetical protein
Sbal_0586-1182.765244hypothetical protein
Sbal_0587-1163.406893hypothetical protein
Sbal_0588-1173.905919methyl-accepting chemotaxis sensory transducer
Sbal_05890204.508189molydopterin dinucleotide-binding region
Sbal_05901224.461088polysulfide reductase NrfD
Sbal_05912224.3093984Fe-4S ferredoxin
Sbal_05923213.747161integral membrane sensor signal transduction
Sbal_05934213.044366response regulator receiver protein
Sbal_05944182.260048hypothetical protein
Sbal_0595110-0.455106hypothetical protein
Sbal_0596011-0.814642hypothetical protein
Sbal_0597-111-0.620383hypothetical protein
Sbal_0598011-0.053605sodium:dicarboxylate symporter
Sbal_05990120.062912IstB ATP binding domain-containing protein
Sbal_06000110.282251integrase catalytic subunit
Sbal_06011142.152721extracellular solute-binding protein
Sbal_06022132.450139binding-protein-dependent transport system inner
Sbal_06030121.977803ABC transporter-like protein
Sbal_0604-1111.534685gamma-glutamyltransferase
Sbal_0605-1161.352271Dyp-type peroxidase family protein
Sbal_06060172.306684LysR family transcriptional regulator
Sbal_06070181.967920NAD-dependent epimerase/dehydratase
Sbal_0608-1181.370609arginine repressor
Sbal_06090201.584706malate dehydrogenase
Sbal_06102172.898853putative thiol-disulfide oxidoreductase DCC
Sbal_06113183.925805short chain dehydrogenase
Sbal_06122163.9954015-formyltetrahydrofolate cyclo-ligase
Sbal_06131183.938591hypothetical protein
Sbal_06141233.629036yecA family protein
Sbal_06150213.3215582-polyprenyl-6-methoxyphenol 4-hydroxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0593HTHFIS905e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 5e-23
Identities = 31/150 (20%), Positives = 58/150 (38%), Gaps = 4/150 (2%)

Query: 7 VYLIDDDDSVRRSLRFMLESYGLKITDFDSAEAFFTAVDLTLPGCALVDVRMPGLSGPQL 66
+ + DDD ++R L L G + +A + + + DV MP + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 HLELVAKNSPLAVIYLTGHGDVPMAVEALKLGAVDFFQKPADGAKLAEAVLKALEHT--- 123
+ L V+ ++ A++A + GA D+ KP D +L + +AL
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 124 -KAHHQDNQYLETYQALTPREREILNLIAQ 152
D+Q + +EI ++A+
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0594GPOSANCHOR521e-08 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 52.0 bits (124), Expect = 1e-08
Identities = 55/320 (17%), Positives = 102/320 (31%), Gaps = 18/320 (5%)

Query: 599 EYAASEQELRIRLSKAEEAHTSAQEMQAEAESQLVAINGELDNLSRELTFARTAYKNSRD 658
EL LS A+E + +E S++ + +L + L A
Sbjct: 82 ALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSA 141

Query: 659 DLRRLFDEKRSEQDKINKALSDRKAHAGQRLTQLDGELKQLKHQHELWLEEQKEQALEAR 718
++ L EK + KA G K + E E ++ LE
Sbjct: 142 KIKTLEAEK---AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 198

Query: 719 MEKQAYWQEVIGALDNQLGQIKATIEGRRESAKIEQKACETWYKNELKSRGVDEENILKL 778
+E + A L KA + R+ + + + + E L
Sbjct: 199 LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 258

Query: 779 KQQIRELETKISRAEQRRSDVLRFDDWYQHTWLMRKPKLQTQLADVKR----------AV 828
+ + ELE + A + T K L+ + AD++ ++
Sbjct: 259 EARQAELEKALEGAMNFSTADSA----KIKTLEAEKAALEAEKADLEHQSQVLNANRQSL 314

Query: 829 SEIDQQLKAKTLDVKTRRQQLETERKASDAAQVEASENLTKLRAVMRKLAELKLPSNNEE 888
+ ++ Q+LE + K S+A++ +L R ++L E + E+
Sbjct: 315 RRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQL-EAEHQKLEEQ 373

Query: 889 AQGSLGERLRQGEDLLLKRD 908
+ S R DL R+
Sbjct: 374 NKISEASRQSLRRDLDASRE 393



Score = 31.2 bits (70), Expect = 0.028
Identities = 49/347 (14%), Positives = 114/347 (32%), Gaps = 28/347 (8%)

Query: 360 WRTDVENLSERHKLQTEKHQDIEAAYNARRSKIGEQLNRELEGLHSDQDKQREARDKQRE 419
+ + + K+ D+ A + N EL S+ + + R +
Sbjct: 55 VQERADKFEIENNTLKLKNSDLSFNNKAL-----KDHNDELTEELSNAKE--KLRKNDKS 107

Query: 420 VARTDIDALELQWRNQMDAGKASFSEQEYQFKLTAAELKLRVDGVTYTEEEKLSLAIFDE 479
++ EL+ R D KA + +A L + + ++
Sbjct: 108 LSEKASKIQELEARKA-DLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL----EK 162

Query: 480 RIHRADEEQESCNAKVERLSSDERKLRAKRDQANEALRIATLRVNERQTALDELHHMLFP 539
+ A + +AK++ L +++ L A++ + +AL A + L
Sbjct: 163 ALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAA 222

Query: 540 QSHTL--LEFLRKEAQGWEQSLGKVIAPELLHRTDLHPSLVTESSEAFFGVHLDLKAIDV 597
+ LE + A + + I + L +
Sbjct: 223 LAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA-------------L 269

Query: 598 PEYAASEQELRIRLSKAEEAHTSAQEMQAEAESQLVAINGELDNLSRELTFARTAYKNSR 657
++ E + + +A+ E Q +N +L R+L +R A K
Sbjct: 270 EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLE 329

Query: 658 DDLRRLFDEKRSEQDKINKALSDRKAHAGQRLTQLDGELKQLKHQHE 704
+ ++L ++ + + ++L + + QL+ E ++L+ Q++
Sbjct: 330 AEHQKLEEQNKI-SEASRQSLRRDLDASREAKKQLEAEHQKLEEQNK 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0603PF05272320.005 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.005
Identities = 12/32 (37%), Positives = 16/32 (50%)

Query: 32 ILALLGPSGCGKTTLLRAVVGLQAISQGEIQI 63
+ L G G GK+TL+ +VGL S I
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0607NUCEPIMERASE396e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 39.4 bits (92), Expect = 6e-06
Identities = 30/123 (24%), Positives = 47/123 (38%), Gaps = 23/123 (18%)

Query: 1 MKIAILGATGWIGGAILKEALSRGHQVTAL-----VRDPS-------KLSATDVAVHAVD 48
MK + GA G+IG + K L GHQV + D S L+ H +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 49 LE-QPLVAQTFA--GVDVVI-----AAVGGRAQQNHDLVASTV---QHLLDVLPNAKVPR 97
L + + FA + V AV + H S + ++L+ + K+
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 98 LLW 100
LL+
Sbjct: 121 LLY 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0608ARGREPRESSOR1451e-47 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 145 bits (367), Expect = 1e-47
Identities = 43/150 (28%), Positives = 71/150 (47%), Gaps = 5/150 (3%)

Query: 6 NQDDLVRIFKSILKEERFGSQSEIVTALQAEGFGNINQSKVSRMLSKFGAVRTRNAKQEM 65
N+ + I+ +Q E+V L+ +G+ N+ Q+ VSR + + V+
Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSY 60

Query: 66 VYCLPAELGVPTAGSPLKNLV---LDVDHNQAMIVVRTSPGAAQLIARLLDSIGKPEGIL 122
Y LPA+ ++L+ + +D +IV++T PG AQ I L+D++ E I+
Sbjct: 61 KYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IM 119

Query: 123 GTIAGDDTIFICPSSIQDIADTLETIKSLF 152
GTI GDDTI I + D + I L
Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0611DHBDHDRGNASE488e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 48.1 bits (114), Expect = 8e-09
Identities = 37/192 (19%), Positives = 71/192 (36%), Gaps = 22/192 (11%)

Query: 5 IIITGVGKRIGYALAKHFLAQGQQVIG-----TYRSHYDSIDELNALGATLYPCDFYDDT 59
ITG + IG A+A+ +QG + S + A A +P D D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 60 QVQSLIDEL-TQLPQIRAIIHNASDWLPDPVLTKNEPLKSTTFAPSQVLQRMMQVHVSVP 118
+ + + ++ I +++ A P + + ++ TF+ V+ +
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFS----------VNSTGV 120

Query: 119 YQLNLALEAQLRAAAGDEIGGSDVIHITDYVAEKGSQKHIAYAASKAALHNLTLSFAAKF 178
+ + ++ + D GS V + A AYA+SKAA T +
Sbjct: 121 FNASRSVSKYMM----DRRSGSIVT-VGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175

Query: 179 APE-VKVNSIAP 189
A ++ N ++P
Sbjct: 176 AEYNIRCNIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0612OMS28PORIN290.018 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 28.6 bits (63), Expect = 0.018
Identities = 14/44 (31%), Positives = 26/44 (59%), Gaps = 1/44 (2%)

Query: 46 NRNQLRKSIRTARKSLSETEQIQASLSASQRMLDALLAQNAQHV 89
N++ K + ++ ++ EQ++ +L AS+R LD + Q AQ V
Sbjct: 166 NKSPNNKELELTKEEFAKVEQVKETLMASERALDETV-QEAQKV 208


10Sbal_0722Sbal_0728Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0722319-2.550570ABC transporter-like protein
Sbal_0723423-3.059874ABC transporter
Sbal_0724426-3.148971transposase, IS4 family protein
Sbal_0725431-4.169811integrase catalytic subunit
Sbal_0726330-3.890653transposase IS3/IS911 family protein
Sbal_0728328-3.265479protease domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0723ABC2TRNSPORT702e-16 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 70.0 bits (171), Expect = 2e-16
Identities = 48/215 (22%), Positives = 96/215 (44%), Gaps = 1/215 (0%)

Query: 37 LYFLIFGNLVGSRIGEMGGVSYMEFIAPGLIMMSVITNS-YSNVASSFYSAKFQRNLEEL 95
+Y G +G +G +GGVSY F+A G++ S +T + + + ++F + QR E +
Sbjct: 44 IYLFGLGAGLGVMVGRVGGVSYTAFLAAGMVATSAMTAATFETIYAAFGRMEGQRTWEAM 103

Query: 96 MVAPVPHYVLIAGYVGGGVARGLCVGLIVTLVAMFFVDISLHHAGLVVMTVFLTSVLFSL 155
+ + ++ G + + G + +VA + + LT + F+
Sbjct: 104 LYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWLSLLYALPVIALTGLAFAS 163

Query: 156 GGLINAVFAKSFDDISIIPTFVLTPLTYLGGVFYSLSLLPPFWQGVSALNPVVYMINVFR 215
G++ A S+D T V+TP+ +L G + + LP +Q + P+ + I++ R
Sbjct: 164 LGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIR 223

Query: 216 YGFLGFADISVPLSIAIMIGFCVALWTLAYYLVSR 250
LG + V + + + V + L+ L+ R
Sbjct: 224 PIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRR 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0728SUBTILISIN923e-22 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 92.2 bits (229), Expect = 3e-22
Identities = 44/159 (27%), Positives = 66/159 (41%), Gaps = 19/159 (11%)

Query: 8 GHGTHVAGTAVGNKVSTTFKEIPVELSGVAPAAYLMVYKALYSKADCTGGSGSNIMLMEA 67
GHGTHVAGT + GVAP A L++ K L GSG +++
Sbjct: 85 GHGTHVAGTIAATENENGV-------VGVAPEADLLIIKVLNK-----QGSGQYDWIIQG 132

Query: 68 LEHAVNDGADVINNSWGGGAGGDPASSPYKTMFEAAEAAGVVVVTAAGNDGNGPQTIGC- 126
+ +A+ D+I+ S GG + A A+ ++V+ AAGN+G+G
Sbjct: 133 IYYAIEQKVDIISMSLGGPEDVP----ELHEAVKKAVASQILVMCAAGNEGDGDDRTDEL 188

Query: 127 --PACIESGITVANTTTGRFFANSFNAGGDDLLAIPSSD 163
P C I+V R + N+ + L P D
Sbjct: 189 GYPGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGED 227



Score = 62.2 bits (151), Expect = 3e-12
Identities = 34/130 (26%), Positives = 51/130 (39%), Gaps = 25/130 (19%)

Query: 285 DNLNSTSSRGPDGNQNILKPDIAAPGTNILSAFSPDDGGEDFNMISGTSMASPHVAGAAA 344
+ + S+ + D+ APG +ILS G + SGTSMA+PHVAGA A
Sbjct: 207 RHASEFSNSNNE-------VDLVAPGEDILS----TVPGGKYATFSGTSMATPHVAGALA 255

Query: 345 LMSQL-----HPEWSANDIKTALTSTAKFEGILDDDAVTPATPFDMGAGRMDLDAAAKAV 399
L+ QL + + ++ L +P G G + L A +
Sbjct: 256 LIKQLANASFERDLTEPELYAQLIKRTI---------PLGNSPKMEGNGLLYLTAVEELS 306

Query: 400 LTFDKPSIAS 409
FD +A
Sbjct: 307 RIFDTQRVAG 316


11Sbal_0769Sbal_0778Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0769216-0.175947hypothetical protein
Sbal_0770-113-2.894262hypothetical protein
Sbal_0771-113-3.472358diguanylate cyclase
Sbal_0772012-4.321023acetyltransferase
Sbal_0773014-4.948177hypothetical protein
Sbal_0774014-5.186365N-acetyltransferase GCN5
Sbal_0775-114-5.125591HsdR family type I site-specific
Sbal_0776-122-5.666666anticodon nuclease
Sbal_0777-120-4.527001IstB ATP binding domain-containing protein
Sbal_0778015-3.361115integrase catalytic subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0774SACTRNSFRASE383e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.0 bits (88), Expect = 3e-06
Identities = 21/83 (25%), Positives = 32/83 (38%), Gaps = 4/83 (4%)

Query: 40 IARDENGKLIGGVGGRTIYKNFL-INVVWVDDQTRGTGLGHKLMALAEAEAKQRGCLVAQ 98
+ EN IG + R+ + + I + V R G+G L+ A AK+
Sbjct: 69 LYYLEN-NCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLM 127

Query: 99 VDTLSIQAPV--FYEKQGFEIIG 119
++T I FY K F I
Sbjct: 128 LETQDINISACHFYAKHHFIIGA 150


12Sbal_0810Sbal_0828Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0810021-3.617327glutathione S-transferase domain-containing
Sbal_0811122-4.547154hypothetical protein
Sbal_0812222-4.901924extracellular solute-binding protein
Sbal_0813225-4.872679IstB ATP binding domain-containing protein
Sbal_0814325-5.301311integrase catalytic subunit
Sbal_0815120-3.523042anticodon nuclease
Sbal_0816119-3.355148restriction modification system DNA specificity
Sbal_0817319-2.682747hypothetical protein
Sbal_0818317-2.280258integrase catalytic subunit
Sbal_0819216-2.643054type I restriction enzyme EcoprrI specificity
Sbal_0820217-2.178063hypothetical protein
Sbal_0821017-2.246441type I restriction-modification system, M
Sbal_0822022-0.217248hypothetical protein
Sbal_08231180.426427helix-turn-helix domain-containing protein
Sbal_08242191.430650phage integrase family protein
Sbal_08253231.927446hypothetical protein
Sbal_08263241.709713S-adenosylmethionine synthetase
Sbal_08273192.004810transketolase
Sbal_08282151.849266erythrose 4-phosphate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0824PF08280300.005 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 30.2 bits (68), Expect = 0.005
Identities = 12/45 (26%), Positives = 16/45 (35%)

Query: 101 QLAISDYLFPSPRKSGRPMSYSCYSTIIRRWASQLGYDSYLYGTH 145
LF SP G Y+ I+ W ++L YL H
Sbjct: 363 HFIPETNLFVSPYYKGNQKLYTSLKLIVEEWMAKLPGKRYLNHKH 407


13Sbal_0912Sbal_0926Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_09123151.325789integrase catalytic subunit
Sbal_09133151.539550cytochrome d ubiquinol oxidase subunit II
Sbal_09143141.501303cytochrome bd ubiquinol oxidase subunit I
Sbal_09153151.518896GntR family transcriptional regulator
Sbal_09163131.209538O-acetylhomoserine/O-acetylserine sulfhydrylase
Sbal_09173120.491669hypothetical protein
Sbal_0918-1110.826277rRNA (guanine-N(2)-)-methyltransferase
Sbal_0919-2110.2337932OG-Fe(II) oxygenase
Sbal_0920-1120.304162BolA family protein
Sbal_0921-113-0.102383TRAP dicarboxylate transporter- DctP subunit
Sbal_0922116-0.681239S-ribosylhomocysteinase
Sbal_0923218-0.608605TonB-dependent receptor
Sbal_0924334-1.554746Na(+)-translocating NADH-quinone reductase
Sbal_0925230-1.340513Na(+)-translocating NADH-quinone reductase
Sbal_0926228-2.141338Na(+)-translocating NADH-quinone reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0917FLGHOOKFLIK330.008 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 32.5 bits (73), Expect = 0.008
Identities = 34/166 (20%), Positives = 65/166 (39%), Gaps = 15/166 (9%)

Query: 5 DDVAQLKAELAQLQSLHLSQQSS---LSRQLAEFSTKLDTLSQQIATEDASDTSLSMAAD 61
D A L A A L + + + + E T L+ + T D + A
Sbjct: 126 DVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQ 185

Query: 62 SMTAGAASIAAVVPAADNAPTLTYAIHTPILESTPVEPVPVEPSPWQQNAVQGDPWQRNT 121
+T A + + P+ A +P++ +P+P +P + WQ+
Sbjct: 186 PLTPLVAEAQSKA-EVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQ-- 242

Query: 122 KNTSAEQVAKTEYQAQGQQLSDEVKLQ----ASVQVASQFDDLLSQ 163
+ ++ ++ + QGQQ S E++L VQ++ + DD +Q
Sbjct: 243 --SLSQHISL--FTRQGQQ-SAELRLHPQDLGEVQISLKVDDNQAQ 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0922LUXSPROTEIN2716e-97 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 271 bits (695), Expect = 6e-97
Identities = 131/168 (77%), Positives = 150/168 (89%)

Query: 2 PLLDSFTVDHTRMNAPAVRVAKHMSTPKGDAITVFDLRFCAPNKDILSERGIHTLEHLFA 61
PLLDSFTVDHTRMNAPAVRVAK M TPKGD ITVFDLRF APNKDILSE+GIHTLEHL+A
Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60

Query: 62 GFMRDHLNGSNVEIIDISPMGCRTGFYMSLIGEPTERQVADAWLAAMEDVLKVVEQSEIP 121
GFMR+HLNG +VEIIDISPMGCRTGFYMSLIG P+E+QVADAW+AAMEDVLKV Q++IP
Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120

Query: 122 ELNEYQCGTYEMHSLEQAQDIARNIIAAGVSVNRNDDLKLSDEILGNL 169
ELNEYQCGT MHSL++A+ IA+NI+ GV+VN+ND+L L + +L L
Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLREL 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0923ECOLIPORIN290.046 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 29.5 bits (66), Expect = 0.046
Identities = 14/39 (35%), Positives = 18/39 (46%)

Query: 418 EKVDVNRKVNLAYAGLEAADFSDSDWMPQLGVLYHAGDW 456
E N LA+AGL+ D+ D+ GVLY W
Sbjct: 87 EGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLYDVEGW 125


14Sbal_1055Sbal_1078Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1055127-4.145858lipoprotein signal peptidase
Sbal_1056434-5.534478peptidylprolyl isomerase, FKBP-type
Sbal_1057435-5.8465594-hydroxy-3-methylbut-2-enyl diphosphate
Sbal_1058538-6.856778type IV pilus modification protein PilV
Sbal_1059538-7.041902type IV pilus assembly protein PilW
Sbal_1060436-6.444617type IV pilus assembly protein PilX
Sbal_1061334-6.409343type IV pilin biogenesis protein
Sbal_1062131-6.261783type IV pilus biogenesis protein PilE
Sbal_1063129-6.127058hypothetical protein
Sbal_1064027-5.206201type IV pilus biogenesis protein
Sbal_1066023-5.018078integrase catalytic subunit
Sbal_1067-121-4.637741IstB ATP binding domain-containing protein
Sbal_1068-120-4.106896type IV pilus biogenesis protein
Sbal_1069-118-3.639385nitrogen regulatory protein P-II
Sbal_1070-122-4.220961FAD-dependent pyridine nucleotide-disulfide
Sbal_1071023-3.973694regulatory protein, LacI
Sbal_1072025-4.564032TonB-dependent receptor, plug
Sbal_1073124-5.573614integrase catalytic subunit
Sbal_1074126-6.695817transposase IS3/IS911 family protein
Sbal_1075025-6.263437TonB-dependent receptor
Sbal_1076024-5.537871integrase catalytic subunit
Sbal_1077-121-4.796660SapC family protein
Sbal_1078-219-3.290610tryptophan halogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1058BCTERIALGSPG327e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.8 bits (72), Expect = 7e-04
Identities = 10/24 (41%), Positives = 18/24 (75%), Gaps = 2/24 (8%)

Query: 21 QRGFSLIEVLVALVIL--VIGLIG 42
QRGF+L+E++V +VI+ + L+
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVV 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1062BCTERIALGSPG521e-11 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 52.2 bits (125), Expect = 1e-11
Identities = 20/61 (32%), Positives = 38/61 (62%)

Query: 6 KGFTLIEVMITVVIIGILAAIAYPSYTQYIALSARSEGLAALMRIANLQEQYYLDNRAYA 65
+GFTL+E+M+ +VIIG+LA++ P+ + + + ++ ++ + N + Y LDN Y
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHYP 67

Query: 66 T 66
T
Sbjct: 68 T 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1064BCTERIALGSPG353e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.2 bits (81), Expect = 3e-05
Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 6 KGFTLVELMVTIAVAAILLAIGVPSLTS 33
+GFTL+E+MV I + +L ++ VP+L
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMG 35


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1068BCTERIALGSPG300.002 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.9 bits (67), Expect = 0.002
Identities = 13/50 (26%), Positives = 30/50 (60%), Gaps = 3/50 (6%)

Query: 5 QKGFSLIELITTLSISTILFTVGTPSFT---DLSDQIRADSNIRTIQQTL 51
Q+GF+L+E++ + I +L ++ P+ + +D+ +A S+I ++ L
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1072ACRIFLAVINRP330.001 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 33.3 bits (76), Expect = 0.001
Identities = 21/89 (23%), Positives = 29/89 (32%), Gaps = 10/89 (11%)

Query: 80 STVDAITAEDIGKFPDKNVAESLQRIPGVTIQRQFGEGAGVSI-----RGAGQDLTLTT- 133
S T +DI + NV ++L R+ GV + FG + I LT
Sbjct: 144 SDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKYKLTPVDV 203

Query: 134 ---LNGQNV-ASTGWFVLEPAKRSFNYEL 158
L QN + G PA
Sbjct: 204 INQLKVQNDQIAAGQLGGTPALPGQQLNA 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1074HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHEALKQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


15Sbal_1133Sbal_1142Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1133-122-3.742936glycoside hydrolase family protein
Sbal_1134027-5.852294fructokinase
Sbal_1136229-6.898569phage integrase family protein
Sbal_1137125-5.058542hypothetical protein
Sbal_1138122-3.925854IstB ATP binding domain-containing protein
Sbal_1139318-2.796392integrase catalytic subunit
Sbal_1140214-1.929524hypothetical protein
Sbal_1141114-1.135232hypothetical protein
Sbal_1142216-0.981318transposase IS3/IS911 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1134ACETATEKNASE330.001 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 33.2 bits (76), Expect = 0.001
Identities = 16/69 (23%), Positives = 24/69 (34%), Gaps = 10/69 (14%)

Query: 184 FISGTGFVRDFRAAGGVADSGIEIAQMMQAGDPLATQAFDRFIDRLARSLAHVINMMDP- 242
+G DFR + GD A A + F R+ +++ M
Sbjct: 273 VYGISGISSDFRDL---------EDAAFKNGDKRAQLALNVFAYRVKKTIGSYAAAMGGV 323

Query: 243 DVIVLGGGV 251
DVIV G+
Sbjct: 324 DVIVFTAGI 332


16Sbal_1252Sbal_1308Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1252222-3.472261DEAD/DEAH box helicase
Sbal_1253326-4.637041hypothetical protein
Sbal_1254124-3.728391integrase catalytic subunit
Sbal_1255126-4.364415hypothetical protein
Sbal_1256124-3.839579integrase catalytic subunit
Sbal_1257425-6.243872IstB ATP binding domain-containing protein
Sbal_1258225-5.064028transposase IS3/IS911 family protein
Sbal_1259225-5.064028integrase catalytic subunit
Sbal_1260325-6.564406hypothetical protein
Sbal_1261323-4.165448helix-turn-helix domain-containing protein
Sbal_1262324-4.604243hypothetical protein
Sbal_1263123-1.475199integrase catalytic subunit
Sbal_1264225-2.624997transposase IS3/IS911 family protein
Sbal_1265227-2.672551hypothetical protein
Sbal_1266-123-0.807786integrase catalytic subunit
Sbal_1267018-1.834119hypothetical protein
Sbal_12681230.115469integrase catalytic subunit
Sbal_12694220.661294bacteriophage CI repressor
Sbal_12708224.115433putative regulator for prophage CP-933T
Sbal_127110223.529915phage regulatory CII family protein
Sbal_12724192.162292hypothetical protein
Sbal_12733181.921817hypothetical protein
Sbal_12742180.997496hypothetical protein
Sbal_1275018-0.049298hypothetical protein
Sbal_12761210.592172hypothetical protein
Sbal_12771231.485219bacteriophage replication gene A
Sbal_12781221.380898phage transcriptional activator, Ogr/delta
Sbal_12792201.025866prevent-host-death family protein
Sbal_12803202.028602hypothetical protein
Sbal_12813223.031282PBSX family phage portal protein
Sbal_12823222.986218hypothetical protein
Sbal_12833283.172541phage capsid scaffolding
Sbal_12845304.114145P2 family phage major capsid protein
Sbal_12855336.285654hypothetical protein
Sbal_12864325.630482hypothetical protein
Sbal_12875355.964998hypothetical protein
Sbal_12885335.495362hypothetical protein
Sbal_12897304.935330hypothetical protein
Sbal_12906253.144853hypothetical protein
Sbal_12917270.362443prophage PSPPH06 tail tube protein
Sbal_12927290.032402TraR/DksA family transcriptional regulator
Sbal_12933230.620478glycoside hydrolase
Sbal_12942260.984238hypothetical protein
Sbal_12952292.412166hypothetical protein
Sbal_12962303.349999hypothetical protein
Sbal_12971240.316078hypothetical protein
Sbal_12981240.326923TP901 family phage tail tape measure protein
Sbal_1299024-0.806149hypothetical protein
Sbal_1300022-1.031321putative bacteriophage protein
Sbal_1301022-2.405785hypothetical protein
Sbal_1302021-3.184794hypothetical protein
Sbal_1303016-1.130593hypothetical protein
Sbal_1304-115-1.308743hypothetical protein
Sbal_1305014-0.742462hypothetical protein
Sbal_1306-113-1.107233hypothetical protein
Sbal_1307-121-1.794846SsrA-binding protein
Sbal_1308-124-3.479057cyclase/dehydrase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1294cloacin290.010 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.3 bits (65), Expect = 0.010
Identities = 16/41 (39%), Positives = 24/41 (58%), Gaps = 3/41 (7%)

Query: 53 QTDLNNATVALKAAEIEKDRLRLDAALTAKTLTVREQERNK 93
QTD+NN A AA EK DAAL++ + R+++ +K
Sbjct: 401 QTDVNNKQAAFDAAAKEKS--DADAALSS-AMESRKKKEDK 438


17Sbal_1419Sbal_1424Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_14192142.665512hypothetical protein
Sbal_14203163.009373hypothetical protein
Sbal_14212153.131745hypothetical protein
Sbal_14222143.106927PfaD family protein
Sbal_14233142.979525Beta-hydroxyacyl-(acyl-carrier-protein)
Sbal_14242132.650995PfaB family protein
18Sbal_1434Sbal_1439Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1434215-1.795825hypothetical protein
Sbal_1435216-1.459493hypothetical protein
Sbal_1436314-0.781105hypothetical protein
Sbal_1437314-0.076701N-acetyltransferase GCN5
Sbal_1438315-0.180624hypothetical protein
Sbal_14392170.863048hypothetical protein
19Sbal_1473Sbal_1478Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_14732150.733547dTDP-4-dehydrorhamnose reductase
Sbal_14742140.2536273'(2'),5'-bisphosphate nucleotidase
Sbal_1475216-0.408013fructokinase
Sbal_1476318-1.240928hypothetical protein
Sbal_1477316-1.157443hypothetical protein
Sbal_1478219-0.910718decaheme cytochrome c
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1473NUCEPIMERASE371e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 36.7 bits (85), Expect = 1e-04
Identities = 35/181 (19%), Positives = 53/181 (29%), Gaps = 48/181 (26%)

Query: 73 QLDITDAANIAAVFDQYRPSWVINCAAYNAVDAAEHDAIEAHRVNALGPELLAQQCLQSG 132
++D+ D + +F V AV + + N G + + C +
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK 117

Query: 133 -ARLLHVSSDYVFGGHTVCGIAHAAERAVCDARESGVEQHQNPDLAPNSNPNHLPRPFVE 191
LL+ SS V+G + PF
Sbjct: 118 IQHLLYASSSSVYGLNR-------------------------------------KMPFST 140

Query: 192 LD-APEPLSTYGKSKLLGELKVVA---ILGERATIVRTSWLYGQNG------HNFVKTML 241
D P+S Y +K EL + G AT +R +YG G F K ML
Sbjct: 141 DDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAML 200

Query: 242 N 242

Sbjct: 201 E 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1475ACETATEKNASE290.022 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 29.0 bits (65), Expect = 0.022
Identities = 9/47 (19%), Positives = 17/47 (36%), Gaps = 1/47 (2%)

Query: 218 IDEGDAIAVAAFDRYMDRLARSLAHVINMLDP-DAIVLGGGMSNVAA 263
GD A A + + R+ +++ + D IV G+
Sbjct: 291 FKNGDKRAQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGP 337


20Sbal_1576Sbal_1589Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1576221-4.442413gluconate transporter
Sbal_1577327-5.988287catalase domain-containing protein
Sbal_1578229-6.173205IstB ATP binding domain-containing protein
Sbal_1579127-5.426402integrase catalytic subunit
Sbal_1580125-4.385460DNA methylase N-4/N-6 domain-containing protein
Sbal_1581121-1.089481mobile mystery protein B
Sbal_1582014-0.374759mobile mystery protein A
Sbal_15832120.129370phage integrase family protein
Sbal_15844160.717974phage integrase family protein
Sbal_15855220.182540phage SPO1 DNA polymerase domain-containing
Sbal_15865200.233513hypothetical protein
Sbal_15874190.268971outer membrane protein MtrB
Sbal_15884200.319615cytochrome C family protein
Sbal_15894180.278826decaheme cytochrome c
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1583TONBPROTEIN391e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 39.2 bits (91), Expect = 1e-05
Identities = 16/53 (30%), Positives = 24/53 (45%), Gaps = 1/53 (1%)

Query: 379 SEPLKPYIPQTNEAPTPAVKVKQKRLPERKPTAKPKPKAVKKVK-QVKPLQTP 430
+P E P A V +K P+ KP KP K ++ K VKP+++
Sbjct: 68 VVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120


21Sbal_1600Sbal_1606Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1600225-0.992714cyclophilin type peptidyl-prolyl cis-trans
Sbal_1601121-0.659194cysteinyl-tRNA synthetase
Sbal_1602324-0.782851bifunctional 5,10-methylene-tetrahydrofolate
Sbal_1603326-1.051803***trigger factor
Sbal_1604220-0.681134ATP-dependent Clp protease proteolytic subunit
Sbal_1605221-0.669258ATP-dependent protease ATP-binding subunit ClpX
Sbal_1606219-0.534644ATP-dependent protease La
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1605HTHFIS300.018 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.018
Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 13/70 (18%)

Query: 64 KLPTPHELRAHLDDYVIGQDRAKKVLSVAVYNHYKRLKNASPKDGIELGKSNILLIGPTG 123
+ P+ E + ++G+ S A+ Y+ L D +++ G +G
Sbjct: 124 RRPSKLEDDSQDGMPLVGR-------SAAMQEIYRVLARLMQTD------LTLMITGESG 170

Query: 124 SGKTLLAETL 133
+GK L+A L
Sbjct: 171 TGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1606HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 0.002
Identities = 45/211 (21%), Positives = 76/211 (36%), Gaps = 37/211 (17%)

Query: 261 NMPAEAKEKALAELNKLRMMSP---MSAEATV---VRSY----VDWMTSVPWSQRSKIKR 310
MP E L + K R P MSA+ T +++ D++ P+ I
Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK-PFDLTELIGI 114

Query: 311 D---------LAKAQEVLDTDHFGLEKVKERILEYLAVQSRVRQLKGPILCLVGPPGVGK 361
E D L + E V +R+ Q ++ + G G GK
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM-ITGESGTGK 173

Query: 362 TSLGQSIAKATGRK---YVRVALGGVRD---EAEIRGHRRTYIGSMPGKVIQKMAKVGVK 415
+ +++ R+ +V + + + E+E+ GH + G+ G + +
Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEK---GAFTGAQTRSTGRFEQA 230

Query: 416 N--PLFLLDEIDKMSSDMRGDPASALLEVLD 444
LFL DEI M D + + LL VL
Sbjct: 231 EGGTLFL-DEIGDMPMDAQ----TRLLRVLQ 256


22Sbal_1628Sbal_1655Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1628121-3.241484hypothetical protein
Sbal_1629121-3.437858ATP-dependent DNA helicase DinG
Sbal_1630226-3.974268DNA polymerase II
Sbal_1631432-5.057845porin
Sbal_1632333-5.195709TonB-dependent receptor, plug
Sbal_1633228-4.574180TonB-dependent receptor
Sbal_1634122-3.743234integrase catalytic subunit
Sbal_1636125-3.799023transposase, IS4 family protein
Sbal_1637129-4.535249IstB ATP binding domain-containing protein
Sbal_1638230-4.016920integrase catalytic subunit
Sbal_1640432-4.290740hypothetical protein
Sbal_1641330-4.408115MotA/TolQ/ExbB proton channel
Sbal_1642223-3.309146MotA/TolQ/ExbB proton channel
Sbal_1643323-3.296196biopolymer transport protein ExbD/TolR
Sbal_1644321-2.861094TonB family protein
Sbal_1645320-3.325761hypothetical protein
Sbal_1646215-2.215788diguanylate cyclase
Sbal_1647113-0.883527hypothetical protein
Sbal_1648316-0.272297PpiC-type peptidyl-prolyl cis-trans isomerase
Sbal_16491140.003194N-acetyltransferase GCN5
Sbal_16506141.220394RNA-binding S4 domain-containing protein
Sbal_16513130.253887hypothetical protein
Sbal_16524130.010375LysR family transcriptional regulator
Sbal_1653314-0.616829hypothetical protein
Sbal_1654414-1.071345nuclease SbcCD subunit D
Sbal_1655413-1.270186SMC domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1631ECOLIPORIN704e-15 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 69.6 bits (170), Expect = 4e-15
Identities = 100/417 (23%), Positives = 163/417 (39%), Gaps = 52/417 (12%)

Query: 1 MNKTLVATALAAIFLVPSVSAIEIYKDNKNAVEIGGFIDARVINTQGETEVVNG-ASRIN 59
M + ++A + A+ + A EIY + N +++ G +D + ++ +G + +
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSK--DGDQTYMR 58

Query: 60 FGFNRE--LTDGWKAFAKLEWGVNPVGSSDIVYNNRFESVQEEFFYNRLGYAGLSHDTYG 117
GF E + D + + E+ V N E + RL +AGL YG
Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQ---------ANTTEGEGANSW-TRLAFAGLKFGDYG 108

Query: 118 TLTIGKQWGAWYDVVYNTNYGFVWDGNTAGVYTYNKDDGAVNGVGRGDKTVQYRNA--FG 175
+ G+ +G YDV T+ + G++ Y N G NGV YRN FG
Sbjct: 109 SFDYGRNYGVLYDVEGWTDMLPEFGGDSYT-YADNYMTGRANGV------ATYRNTDFFG 161

Query: 176 DV---SFAVQAQLKNSSFYTCDTTDDITQAQCQADW--ESGDKAAQQVEYNYTYGGALTY 230
V +FA+Q Q KN S D D ++GD Y+ G +
Sbjct: 162 LVDGLNFALQYQGKNESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGA 221

Query: 231 KATDMLTLTAGINRGEFEVSFGNGEQRTAIDVIYGAGITWGNFDNDGLYAAA------NF 284
T +N G + G++ A + AG+ +D + +Y A N
Sbjct: 222 AYTTSDRTNEQVNAG---GTIAGGDKADA----WTAGL---KYDANNIYLATMYSETRNM 271

Query: 285 NRQENHDTDNIGRLIKDAYGIESLVSYKFDNGLRPFISYNVLDAGKDYVIQPNFNADPND 344
D G + E Y+FD GLRP +S+ ++ GKD + N N D D
Sbjct: 272 TPYGKTDKGYDGGVANKTQNFEVTAQYQFDFGLRPAVSF-LMSKGKD-LTYNNVNGDDKD 329

Query: 345 EFKRQFLVVGLHFVWDPNTVLYIEARKDYSDFTSADKDQEARMALSESDGVAIGIRY 401
K + VG + ++ N Y++ + + D D +S D VA+G+ Y
Sbjct: 330 LVK--YADVGATYYFNKNFSTYVDYKINLLD---DDDPFYKDAGISTDDIVALGMVY 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1644PF035441024e-29 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 102 bits (256), Expect = 4e-29
Identities = 35/169 (20%), Positives = 65/169 (38%), Gaps = 11/169 (6%)

Query: 39 TPVIEITMDRQDSKAQNKPRVVPKPPPPPEQPQKPDTTPPDSSSNID----TSMSFNMGG 94
P E + K PKP P P+ P S N
Sbjct: 75 EPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAP 134

Query: 95 VEAGGASTG-FKLGNMMTRDGDATPIVRIEPQYPIAAARDGKEGWVQLRFTINELGGIDD 153
++ + + + R +PQYP A EG V+++F + G +D+
Sbjct: 135 ARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDN 194

Query: 154 VEVIQAEPKRLFDKEAIRALKKWKYKPKIVDGKPLKQTGQTVQLDFTLD 202
V+++ A+P +F++E A+++W+Y+P +G V + F ++
Sbjct: 195 VQILSAKPANMFEREVKNAMRRWRYEP------GKPGSGIVVNILFKIN 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1645SYCDCHAPRONE300.011 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.9 bits (67), Expect = 0.011
Identities = 11/52 (21%), Positives = 21/52 (40%)

Query: 197 AYFNQKKYKKAVGVLEVMVPLFPEDGRLWVQLAQFYLMVEDYDKSLATYDLA 248
+ KY+ A V + + L D R ++ L + YD ++ +Y
Sbjct: 45 NQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1649SACTRNSFRASE260.046 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.4 bits (58), Expect = 0.046
Identities = 12/73 (16%), Positives = 26/73 (35%), Gaps = 1/73 (1%)

Query: 97 LAAEGQGKGYATESLMAVIDWACLSFNVHKFVGHCAKDNHASARVLEKCGFRLEGLLRQQ 156
+A + + KG T L I+WA + + N ++ K F + +
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWA-KENHFCGLMLETQDINISACHFYAKHHFIIGAVDTML 155

Query: 157 FKMGENWFDESVF 169
+ + ++F
Sbjct: 156 YSNFPTANEIAIF 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1655IGASERPTASE543e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 53.9 bits (129), Expect = 3e-09
Identities = 52/307 (16%), Positives = 106/307 (34%), Gaps = 16/307 (5%)

Query: 198 AADISALVKAQRSRRDGILQSAGLASDDELSNELAKLTPELALA--QSAKEQALQQQQLI 255
+I A V + S + I + ++ T +A Q +K +Q
Sbjct: 1000 PNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDAT 1059

Query: 256 IKASDAAQHLLAEFAQFDTLTQTAAALEAQQESIVAQT-HKLNLANQAQRLAPMVEVFLA 314
+ + + TQT ++ E+ QT A + VE
Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKT 1119

Query: 315 REQEAKAANLAFSHAQTALTQAKQAFDDAELKAQDLPVLEASLLEQEQAKQQLNALGPQL 374
+E + ++ Q+ Q + AE ++ P + ++ Q++ A Q
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQ-----AEPARENDPTVNI---KEPQSQTNTTADTEQP 1171

Query: 375 -RELDRLNKTLEQEQAQLVRAKAQLQNSKNELTAATQKRRELESALPPLQANSDTRLSLQ 433
+E + E + + ++N +N A TQ ES+ P + + S
Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRS-- 1229

Query: 434 QAHQQQQQLLSTYQQWQQVAARVSS--TKEKLAEAKAQGQQLNAEHQQAQVAHKALLLTW 491
H + S+ + ++S T L++A+A+ Q + +A H + L
Sbjct: 1230 VPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMN 1289

Query: 492 HQGQAAI 498
++GQ +
Sbjct: 1290 NEGQYNV 1296



Score = 30.4 bits (68), Expect = 0.046
Identities = 53/358 (14%), Positives = 109/358 (30%), Gaps = 43/358 (12%)

Query: 361 EQAKQQLNALGPQLRELDRLNKTL--------------------------EQEQA-QLVR 393
E +L + D LN +L E E+ Q V
Sbjct: 934 EPNHNELTLFDASKAQRDHLNVSLVGNTVDLGAWKYKLRNVNGRYDLYNPEVEKRNQTVD 993

Query: 394 AK--AQLQNSKNELTAATQKRREL----ESALPPLQANSDTRLSLQQAHQQQQQLLSTYQ 447
N + ++ + E+ E+ +PP + + + A +Q+ + +
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEK 1053

Query: 448 QWQQVAARVSSTKEKLAEAKAQGQQLNAEHQQAQVAHKALLLTWHQGQAAILARQLQQDE 507
Q + +E EAK + A Q +VA Q ++++E
Sbjct: 1054 NEQDATETTAQNREVAKEAK---SNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110

Query: 508 PCPVCGSQTHPQPAQSQEPLPSDEALQLAQ-DTETTAQEVLSKARAEYRGLDAQFKILQQ 566
V +T P + + P E + Q E + + E + +Q
Sbjct: 1111 KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170

Query: 567 QSQDLVAQLGTAVDISQDQHAHTLSQYA---FSLTQAEQAHQDLQQLQQEIVLLHAQETH 623
+++ + + V ++ +T + + T A + + H +
Sbjct: 1171 PAKETSSNVEQPV--TESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVR 1228

Query: 624 LQQKLEQGQTHDSALQSKVDLLQGQLAHIQQSVPPALATLEALTAAITQN-QQQISQI 680
+ T S +S V L + + A A + + + + Q ISQ+
Sbjct: 1229 SVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQL 1286


23Sbal_1807Sbal_1830Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_18072210.306222DoxX family protein
Sbal_1808123-0.908991nitroreductase
Sbal_1809324-1.850491N-acetyltransferase GCN5
Sbal_1810120-2.621782prolyl 4-hydroxylase subunit alpha
Sbal_1811219-1.954687N-acetyltransferase GCN5
Sbal_1812417-2.191878N-acetyltransferase GCN5
Sbal_1813314-1.660515hypothetical protein
Sbal_18140130.053976hypothetical protein
Sbal_1815-1130.060561hypothetical protein
Sbal_1816-114-1.296772hypothetical protein
Sbal_1817-314-2.909317hypothetical protein
Sbal_1818-216-3.837604hypothetical protein
Sbal_1819020-5.118719PKD domain-containing protein
Sbal_1820743-12.272748hypothetical protein
Sbal_1822540-11.054397hypothetical protein
Sbal_1823439-9.994101hypothetical protein
Sbal_1824231-8.055902hypothetical protein
Sbal_1825328-7.245466hypothetical protein
Sbal_1826224-6.455247hypothetical protein
Sbal_1827319-3.303953integrase catalytic subunit
Sbal_1828119-0.843623IstB ATP binding domain-containing protein
Sbal_1829220-0.594670hypothetical protein
Sbal_1830216-0.067320hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1812SACTRNSFRASE415e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.7 bits (95), Expect = 5e-07
Identities = 20/57 (35%), Positives = 30/57 (52%)

Query: 80 LILNDVYVTQHARCVGIGRALVQQAASYAKAHNMSYLMLETQQKNQRAQGLYEGLGF 136
++ D+ V + R G+G AL+ +A +AK ++ LMLETQ N A Y F
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1819MICOLLPTASE764e-16 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 76.3 bits (187), Expect = 4e-16
Identities = 34/128 (26%), Positives = 60/128 (46%), Gaps = 9/128 (7%)

Query: 784 APVASFTQVVNGAAVQLTST-STDSDGQIVSAEWSFGDNTVAVGEVVTHSYSQSGEYLVT 842
A + S + V+ + T S D DG+I + EW FGD + TH Y+++GEY V
Sbjct: 777 AVIKSDSSVIVEEEINFDGTESKDEDGEIKAYEWDFGDGEKSNEAKATHKYNKTGEYEVK 836

Query: 843 LTVTDNDGLTHSTSQTVTVVVGEVKQP------PVAQIQRINLLF-VDMFISTSYDTDGV 895
LTVTDN+G ++ S+ + VV + P ++ N + +M + + +
Sbjct: 837 LTVTDNNGGINTESKKI-KVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDY 895

Query: 896 IKQHKWTF 903
++ +
Sbjct: 896 SDKYYFDV 903



Score = 40.5 bits (94), Expect = 4e-05
Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 889 SYDTDGVIKQHKWTFDNGTRAN-GQVVLRLARRGQHTVELTVKDNDKLTDTTTLT 942
S D DG IK ++W F +G ++N + + + G++ V+LTV DN+ +T +
Sbjct: 798 SKDEDGEIKAYEWDFGDGEKSNEAKATHKYNKTGEYEVKLTVTDNNGGINTESKK 852


24Sbal_1923Sbal_1946Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1923017-4.392332hypothetical protein
Sbal_1924018-4.812187methyl-accepting chemotaxis sensory transducer
Sbal_1925124-6.785711*IstB ATP binding domain-containing protein
Sbal_1926123-6.407630integrase catalytic subunit
Sbal_1927021-5.356005phage integrase family protein
Sbal_1928120-3.204470integrase catalytic subunit
Sbal_1929121-3.974378transposase IS3/IS911 family protein
Sbal_1930121-4.075826phage integrase
Sbal_1931221-2.556726transposase IS3/IS911 family protein
Sbal_1932222-3.527013integrase catalytic subunit
Sbal_1934323-4.256945transposase IS3/IS911 family protein
Sbal_1935323-4.373828integrase catalytic subunit
Sbal_1936324-3.745818GntR family transcriptional regulator
Sbal_1937224-3.333064PhzF family phenazine biosynthesis protein
Sbal_1938018-3.517568hypothetical protein
Sbal_1940118-2.594438MerR family transcriptional regulator
Sbal_1941217-2.199817hypothetical protein
Sbal_1942216-2.005898integrase catalytic subunit
Sbal_1943316-2.600724two component LuxR family transcriptional
Sbal_1944316-2.821354integral membrane sensor hybrid histidine
Sbal_1945517-2.293661hypothetical protein
Sbal_1946516-2.089011YscC/HrcC family type III secretion outer
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1929HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHEALKQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1943HTHFIS621e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.2 bits (151), Expect = 1e-13
Identities = 31/159 (19%), Positives = 66/159 (41%), Gaps = 14/159 (8%)

Query: 11 ILVVDDHSLIFDGLLGCLAPYPELNL-IGSVEDGLAVYEKCLKLRPDLVFMDLKLPGMGG 69
ILV DD + I L L+ + DLV D+ +P
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA---GDGDLVVTDVVMPDENA 62

Query: 70 FDVIRQLRQRWPEMMIIMLTATVEEKSAREALDVGANGYVLKYSPKSTLLAAIKCVCKGK 129
FD++ ++++ P++ +++++A +A +A + GA Y+ K + L+ I
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL--- 119

Query: 130 NFIDPSLDERQIAALSGVVDGDMPLL--TPREQQVLKLI 166
+ +R+ + L MPL+ + Q++ +++
Sbjct: 120 -----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1944HTHFIS732e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.9 bits (179), Expect = 2e-15
Identities = 26/120 (21%), Positives = 45/120 (37%), Gaps = 8/120 (6%)

Query: 696 KILLVDDVETNRDIIGKMLLELGQQVIAVNSGEAALEKGTRHIFDLVLMDIRMPGLDGYQ 755
IL+ DD R ++ + L G V ++ DLV+ D+ MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 756 TTQQWRHSEDILDGDCPIFALTANANPKEHDTIEA--AGMNSYITKPVSLKQLNHALEAA 813
+ + D P+ ++A I+A G Y+ KP L +L + A
Sbjct: 65 LLPRIKK----ARPDLPVLVMSAQNTF--MTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1946TYPE3OMGPROT441e-152 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 441 bits (1135), Expect = e-152
Identities = 157/503 (31%), Positives = 263/503 (52%), Gaps = 24/503 (4%)

Query: 6 SLLLLCQMGLAQAAPLTNIKWQGEPFVMISRGTALTSVIQDFASNYGVPVIVSNKVNDNY 65
+LLLL AQ W P+V +++G +L ++ DF +NY V+VS+K+ND
Sbjct: 16 TLLLLSSYSWAQELD-----WLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKV 70

Query: 66 IGQIQQQDPQSVIQDLTRRFGLVWYYNNDVLYVYKASEINSEVLPLTSLSAAKVDHYLRS 125
GQ + +PQ +Q + + LVWYY+ +VLY++K SE+ S ++ L AA++ L+
Sbjct: 71 SGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQR 130

Query: 126 AGVLDKGVCSIKSMAGISGLQVTGVPECINSVTKLTAQLDANAKQTTE--NQETVKVYPL 183
+G+ + + A + V+G P + V + A L+ + +E ++++PL
Sbjct: 131 SGIWEPR-FGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPL 189

Query: 184 KYASATDSVYQYRSQPVSIPGLVTVLKEMDQGTQV-------ANAVAGSVSNISGPVFAA 236
KYASA+D YR V+ PG+ T+L+ + + + + A
Sbjct: 190 KYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQARVEA 249

Query: 237 DPRQNAIIVRGSARDMATYGSLIRQLDTKPTMIEVSVSIFDVDASDFKQLGIDWSASAKL 296
DP NAIIVR S M Y LI LD IEV++SI D++A +LG+DW +
Sbjct: 250 DPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVGIRT 309

Query: 297 GGGSVSFN---------SGDSSDNFSTVIGNTGNFMLRLNALEKNSKAKVLSRPSVVTLN 347
G + + + + R+N LE A+V+SRP+++T
Sbjct: 310 GNNHQVVIKTTGDQSNIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQE 369

Query: 348 NVQAVLDKNVTFYTKLEGDKVAKLESVTTGSLLRVTPRLIDEVGHQAVMLDLNIQDGQQS 407
N QAV+D + T+Y K+ G +VA+L+ +T G++LR+TPR++ + + L+L+I+DG Q
Sbjct: 370 NAQAVIDHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQK 429

Query: 408 QAVSRSEPLPQVQNSEISTQATLKSGESLLLGGFVQDRDETTQNKIPLLGDLPLLGGLFR 467
S E +P + + + T A + G+SL++GG +D +K+PLLGD+P +G LFR
Sbjct: 430 PNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFR 489

Query: 468 STDHHTQSVMRLFLIKAEPVNQG 490
T+ +RLF+I+ +++G
Sbjct: 490 RKSELTRRTVRLFIIEPRIIDEG 512


25Sbal_1956Sbal_1980Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1956319-1.944594type III secretion low calcium response
Sbal_1957220-2.237206hypothetical protein
Sbal_1958122-2.266388helix-turn-helix domain-containing protein
Sbal_1959222-0.788160type III secretion system needle protein
Sbal_1960118-0.158808type III secretion system protein SsaH family
Sbal_19611170.006541YscI/HrpB family type III secretion apparatus
Sbal_19621160.070757YscJ/HrcJ family type III secretion apparatus
Sbal_19632170.274394hypothetical protein
Sbal_19641160.301979type III secretion system apparatus protein
Sbal_19650160.690664TyeA family type III secretion effector delivery
Sbal_19660170.368035TIR chaperone family protein
Sbal_19670160.567840hypothetical protein
Sbal_19682161.230135hypothetical protein
Sbal_19693151.035068secretion system apparatus protein SsaV
Sbal_19704171.058013type III secretion system ATPase
Sbal_1971517-0.517400hypothetical protein
Sbal_1972418-0.358030hypothetical protein
Sbal_1973318-1.190450hypothetical protein
Sbal_1974315-2.046929type III secretion system protein
Sbal_1975316-1.993565HrpO family type III secretion protein
Sbal_1976216-1.446170type III secretion protein SpaR/YscT/HrcT
Sbal_1977317-1.347544secretion system apparatus protein SsaU
Sbal_1978417-1.317613cupin
Sbal_1979215-0.634859hypothetical protein
Sbal_1980214-0.750669hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1956SYCDCHAPRONE943e-27 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 94.2 bits (234), Expect = 3e-27
Identities = 44/148 (29%), Positives = 68/148 (45%)

Query: 9 DFEKLEAACQLALVNQQTLAEQVGLTSQDLEQTYQSGTSKYQMGLPAEAIVDFTYLVMHQ 68
D ++ + A + L T+A ++S LEQ Y ++YQ G +A F L +
Sbjct: 7 DTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLD 66

Query: 69 PWDRRFHLGLGSCLHWLGEYQHALTFYGYALVMDACSPDASFRIAQCFLSLNDDAAAIEA 128
+D RF LGLG+C +G+Y A+ Y Y +MD P F A+C L + A A
Sbjct: 67 HYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESG 126

Query: 129 LQMAISQSFSKPEHHFVGEQAQQLLSAL 156
L +A K E + + +L A+
Sbjct: 127 LFLAQELIADKTEFKELSTRVSSMLEAI 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1957RTXTOXIND362e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.0 bits (83), Expect = 2e-04
Identities = 23/188 (12%), Positives = 62/188 (32%), Gaps = 19/188 (10%)

Query: 185 LVVGTLWSAVVSPPSLPAHIAGTVNIVNMARRSAEPVVEGVIG--LV-DNTSTK---VLD 238
LV+ + S + + A G + + + +P+ ++ +V + S + VL
Sbjct: 68 LVIAFILSVL-GQVEIVATANGKL-THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLL 125

Query: 239 KLKGTSQNKSLEAEVSSVKAYQLRQLQASALTQQGNMKLSEAQLAVKDSQAKEKTAQFDA 298
KL SS+ +L Q + L++ + +L +
Sbjct: 126 KLTALGAEADTLKTQSSLLQARLEQTRYQILSRS----IELNKLPELKLPDEPYFQNVSE 181

Query: 299 EIRMKQSQQLRGTNQALQQQLADKDGLLLQSQNQFEQLQSRFDKSNVQLSGVMQQLHMLQ 358
E ++ + ++ Q Q Q + ++ ++ +++ + +
Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKY-------QKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 359 QQLAELQP 366
+L +
Sbjct: 235 SRLDDFSS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1962FLGMRINGFLIF794e-19 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 79.2 bits (195), Expect = 4e-19
Identities = 48/190 (25%), Positives = 81/190 (42%), Gaps = 10/190 (5%)

Query: 22 LYRDLPQDEANQMVALLMLNHIDASAEADQKSGNVSLKIEKDQFINAVELLRQNGFPKPH 81
L+ +L + +VA L +I +G+ ++++ D+ L Q G PK
Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPY----RFANGSGAIEVPADKVHELRLRLAQQGLPKGG 108

Query: 82 YANIEDLFPSGQLVTSPAQEEAKMGYLKEQQLERTLSSMDGVISARVSIAEPAPDTGRQL 141
E L + S E+ E +L RT+ ++ V SARV +A P P +
Sbjct: 109 AVGFE-LLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVRE 167

Query: 142 AQTKSASVYIKYSPQANLTNTE-NQIKSLVQNAVPGLSYDNISVFLQAASYRYQAITQPT 200
++ SASV + P L + + + LV +AV GL N+++ Q+ +TQ
Sbjct: 168 QKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHL----LTQSN 223

Query: 201 SSSSSQLLAQ 210
+S AQ
Sbjct: 224 TSGRDLNDAQ 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1965PF07201320.003 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 31.7 bits (72), Expect = 0.003
Identities = 37/190 (19%), Positives = 68/190 (35%), Gaps = 26/190 (13%)

Query: 79 LEQLLQQLGDTQATTVNKLVEQFASLG----EGNELLSQLKQLGLDSGNMMLLMMALVVS 134
+ +LL L ++ ++++L E ++L L+ +
Sbjct: 103 VSELLSLLSNSPNISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKGRPEL---------- 152

Query: 135 GKLGQSAHNKLRKLLTELLAQEGAEIALFAALEGVA-----LDHAGLQALQQLYQQAVRG 189
+ + + L + ++G I L A + A LQ L+ Y+ AV G
Sbjct: 153 ----AHLSHLVEQALVSMAEEQGETIVLGARITPEAYRESQSGVNPLQPLRDTYRDAVMG 208

Query: 190 DASLAKWFELLQHL---PDRRKRIRVLLRALSEPLSDQHSGRNMVKIAAAVDDLRRLLIF 246
+ + LQ D I L +ALS L Q SG K+ + DL++L F
Sbjct: 209 YQGIYAIWSDLQKRFPNGDIDSVILFLQKALSADLQSQQSGSGREKLGIVISDLQKLKEF 268

Query: 247 LTIEEHCHML 256
++ +
Sbjct: 269 GSVSDQVKGF 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1966PF05932752e-20 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 75.2 bits (185), Expect = 2e-20
Identities = 23/122 (18%), Positives = 47/122 (38%), Gaps = 11/122 (9%)

Query: 4 HDQLLAQFGQALGL-PLVFDANGQCLLMLDEKLMISI--RYGDVQWTFYCMLAQQPLAEK 60
+ LL F ++L + PLVFD +G C +++D +++ Y + +L +
Sbjct: 6 YKTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEPHKDIPQ 65

Query: 61 HYWQACLQLNLQLAEQGKGCICYEPQADALLYLTFIPMP---ASALQLREFLGDLADTYQ 117
Q L L + + ++ LY + +P S L+ + L + +
Sbjct: 66 ---QCLLAGALNPLLNAGPGLGLDEKSG--LYHAYQSIPREKLSVPTLKREMAGLLEWMR 120

Query: 118 AL 119

Sbjct: 121 GW 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1973FLGMOTORFLIM290.026 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 28.7 bits (64), Expect = 0.026
Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 224 SLLPKMDAIQSPLIAEIGRVSLSLAKLGAMMAGDKLTLAVTL 265
L K+ + ++AE+G + LS+ + + GD + L T
Sbjct: 249 VLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTH 290


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1974TYPE3IMPPROT2097e-71 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 209 bits (534), Expect = 7e-71
Identities = 80/215 (37%), Positives = 129/215 (60%), Gaps = 7/215 (3%)

Query: 8 VQLIIMLFCLSLLPLFAVMGTSFLKLAIVFSMLRNALGIQQIPPNMAIYGLALILTLFTM 67
+ LI +L +LLP GT F+K +IVF M+RNALG+QQIP NM + G+AL+L++F M
Sbjct: 5 ISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVM 64

Query: 68 APVGMAINDNLKATPIVFDAPNVFEQINTEAIAPYRAFLDKNTSNTQIEFFANIGHKVWP 127
P+ + + F+ + + E + YR +L K + ++FF N K
Sbjct: 65 WPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQY 124

Query: 128 EKYQQV-------LTKDSLLVMVPAFTMSQLIEAFKIGLLIYLPFVAIDLIVSNILLAMG 180
+ + + K S+ ++PA+ +S++ AFKIG +YLPFV +DL+VS++LLA+G
Sbjct: 125 GEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALG 184

Query: 181 MMMVSPMTIALPFKLLIFILMGGWEKLISQLMMSF 215
MMM+SP+TI+ P KL++F+ + GW L L++ +
Sbjct: 185 MMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1975TYPE3IMQPROT713e-20 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 70.9 bits (174), Expect = 3e-20
Identities = 33/83 (39%), Positives = 50/83 (60%)

Query: 6 IVHFTSELLWMVLLLSLPVVIVASVVGVLVSLIQALTQIQDQTLQFLIKLIAVCVTLVVC 65
+V ++ L++VL+LS IVA+++G+LV L Q +TQ+Q+QTL F IKL+ VC+ L +
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 66 YHWMGSSLLNYASMAFDQISQMG 88
W G LL+Y G
Sbjct: 64 SGWYGEVLLSYGRQVIFLALAKG 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1976TYPE3IMRPROT1264e-37 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 126 bits (317), Expect = 4e-37
Identities = 45/238 (18%), Positives = 105/238 (44%), Gaps = 5/238 (2%)

Query: 1 MTTLLPNLLTAQLPVLALCMMRPLGMMLLLPLFKGGAMGSALIRNSLILMFALPTVLAMD 60
M + + L + ++R L ++ P+ ++ ++ L +M +
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFA-IAPSL 58

Query: 61 EMQPILQQADTWMLISLFGKEMIVGVLLGFCAAIPFWAIDMAGFVIDTMRGASMSTVLNP 120
+ + + +++ +++++G+ LGF F A+ AG +I G S +T ++P
Sbjct: 59 PANDVPVFSFFALWLAV--QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDP 116

Query: 121 LMGLQSSIYGMLFTQVLTVLFLVSGGFNFLLTALYQSYQQLPPGFNLTLAQPLMVFIAHE 180
L + + + +LFL G +L++ L ++ LP G + +
Sbjct: 117 ASHLNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAG 176

Query: 181 WQLMCQLCLSFAMPAMVIMILVDVALGLVNRSAQQLNVFFLSMPIKSALVLLLLIYSL 238
+ L A+P + +++ +++ALGL+NR A QL++F + P+ + + L+ +
Sbjct: 177 SLIF-LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALM 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1977TYPE3IMSPROT356e-125 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 356 bits (916), Expect = e-125
Identities = 119/348 (34%), Positives = 190/348 (54%)

Query: 2 AEKTEKPTEKRLREARNRGQVIKSAEIVTGLQMAIILGYFLYEGPALVQAMMALIDLTIN 61
EKTE+PT K++R+AR +GQV KS E+V+ + + + + L+ +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 62 AINLPLETAAEQIVGTFAMLALRFLGGLTLVLVFTIVVGNSVQTGPVWATESIMPSMNKL 121
LP A +V + L V + + VQ G + + E+I P + K+
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122

Query: 122 NVMNNAKQLISLKSLFELAKNLVKVTVLSLVFYYLLHRYVNAFQYLPLCEEACGISVIST 181
N + AK++ S+KSL E K+++KV +LS++ + ++ + LP C C ++
Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182

Query: 182 MITWLWGSFLGCYLIFGIADYAFQRYSLMKELKMSKDDTKQEYKDSEGNPEMKQKRRETQ 241
++ L +++ IADYAF+ Y +KELKMSKD+ K+EYK+ EG+PE+K KRR+
Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242

Query: 242 REVASGSLANNVRKATVVVRNPTHIAVCLYYCEGETPLPKVLEKAEDHMALHIVALAEKA 301
+E+ S ++ NV++++VVV NPTHIA+ + Y GETPLP V K D + +AE+
Sbjct: 243 QEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEE 302

Query: 302 GVPIVENIPLARALFKHVETGDVIPESLFEPVAELLRLVMAISYDNMK 349
GVPI++ IPLARAL+ IP E AE+LR + + +
Sbjct: 303 GVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQH 350


26Sbal_2090Sbal_2108Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2090023-4.224718transposase IS3/IS911 family protein
Sbal_2091-123-4.709076integrase catalytic subunit
Sbal_2092020-4.682136phage integrase family protein
Sbal_2093019-4.877509integrase catalytic subunit
Sbal_2094-117-4.388695IstB ATP binding domain-containing protein
Sbal_2095117-3.184477hypothetical protein
Sbal_2096317-2.170683hypothetical protein
Sbal_2097214-2.373257ATP-dependent DNA helicase RecQ
Sbal_2098311-2.301582hypothetical protein
Sbal_2099111-0.149631ecotin
Sbal_2100212-0.394159cation diffusion facilitator family transporter
Sbal_2101011-0.920877MarR family transcriptional regulator
Sbal_210309-1.923042hypothetical protein
Sbal_2104012-1.428144diguanylate cyclase
Sbal_2105114-2.695589CoA-binding domain-containing protein
Sbal_2106124-5.942638methyl-accepting chemotaxis sensory transducer
Sbal_2107019-4.993173SpoIIAA family protein
Sbal_2108118-4.489671response regulator receiver protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2092PF05272310.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.011
Identities = 11/49 (22%), Positives = 19/49 (38%), Gaps = 5/49 (10%)

Query: 194 KAIIQLGLQGGFRRSELADIKVQYVSFL-RNKLKVRLPYSKSNQQGQRE 241
+L FRR++ +K +F K + R Y + Q R+
Sbjct: 642 IVAYELSEMTAFRRADAEAVK----AFFSSRKDRYRGAYGRYVQDHPRQ 686


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2108HTHFIS711e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.4 bits (175), Expect = 1e-16
Identities = 26/104 (25%), Positives = 49/104 (47%)

Query: 11 ILVIDDDVMMSQTISDFIHGKGYHVIVCNNLEEAFSELSQYKIDLILINFWQPDGTALIL 70
ILV DDD + ++ + GY V + +N + ++ DL++ + PD A L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 71 LEHLNEEKITTPVIVISNTKEHQSVLECFRMGVLDFVVKPINLE 114
L + + + PV+V+S + ++ G D++ KP +L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


27Sbal_2121Sbal_2150Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2121212-2.817917hypothetical protein
Sbal_2122113-2.553791ABC transporter-like protein
Sbal_2123015-3.654205hypothetical protein
Sbal_2125017-3.692766helix-turn-helix domain-containing protein
Sbal_2126118-3.228550XRE family transcriptional regulator
Sbal_2127218-2.233140response regulator receiver modulated
Sbal_2128117-1.869528hypothetical protein
Sbal_2129016-1.811399integral membrane sensor hybrid histidine
Sbal_2130014-1.699083PAS/PAC sensor-containing diguanylate cyclase
Sbal_2131114-2.337034hypothetical protein
Sbal_2132116-2.823629hypothetical protein
Sbal_2134017-3.885677PepSY-associated TM helix domain-containing
Sbal_2135019-4.167487hypothetical protein
Sbal_2136-118-4.317153TonB-dependent siderophore receptor
Sbal_2137024-5.374203hypothetical protein
Sbal_2138125-5.035956integrase catalytic subunit
Sbal_2139-221-4.095463CsbD family protein
Sbal_2140-119-3.362933hypothetical protein
Sbal_2141-218-3.394440IstB ATP binding domain-containing protein
Sbal_2142017-2.974544integrase catalytic subunit
Sbal_2143010-1.357928hypothetical protein
Sbal_2144-114-1.338706**MATE efflux family protein
Sbal_2145222-2.248803riboflavin synthase subunit alpha
Sbal_2146331-2.296890hypothetical protein
Sbal_2147233-1.740072hypothetical protein
Sbal_2148127-1.715915threonyl-tRNA synthetase
Sbal_2149129-1.918286translation initiation factor IF-3
Sbal_2150226-2.26378650S ribosomal protein L35
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2127HTHFIS492e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 48.7 bits (116), Expect = 2e-08
Identities = 23/163 (14%), Positives = 53/163 (32%), Gaps = 23/163 (14%)

Query: 3 ILIIDDQRFIRETMKADIHKFLVDDNVQIYEACNGNQGIELIIDIEVVLDLIIIDLKMDQ 62
IL+ DD IR + + L + N I + DL++ D+ M
Sbjct: 6 ILVADDDAAIRTVLN----QALSRAGYDVRITSNAATLWRWIAAGDG--DLVVTDVVMPD 59

Query: 63 GDGIAVINMLASEPRFAAIPIAVISS-SDKRTLELVDNIITVLRLNLLGVFAKPINVNDI 121
+ ++ + +P+ V+S+ + I KP ++ ++
Sbjct: 60 ENAFDLLPRIKK--ARPDLPVLVMSAQNT------FMTAIKASEKGAYDYLPKPFDLTEL 111

Query: 122 LTPLLEASTDR--------HHAHQHQSIVQNRAANINVRQLIK 156
+ + A + + +V AA + +++
Sbjct: 112 IGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2129HTHFIS542e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 53.7 bits (129), Expect = 2e-09
Identities = 16/118 (13%), Positives = 38/118 (32%), Gaps = 11/118 (9%)

Query: 526 DQQKVLIIDDNLFNLEICRAMLEHYHFQTFSTDNTEQALKMLVKHLPQIVIVDYRLQEMN 585
+L+ DD+ + L + T N + + +V+ D + + N
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 586 GLQLVRQMQQVLQNLVSDSPIEHHCRFFLLSA-NDCDDIPELASFPEVHFMQKPFSAE 642
L+ ++++ ++SA N + + ++ KPF
Sbjct: 62 AFDLLPRIKK----------ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


28Sbal_2169Sbal_2201Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2169-121-3.191350*hypothetical protein
Sbal_2170023-3.116807multi anti extrusion protein MatE
Sbal_2172128-4.569256permease
Sbal_2173130-6.485601resolvase domain-containing protein
Sbal_2174027-6.057611transposase Tn3 family protein
Sbal_2175336-10.053845ABC transporter-like protein
Sbal_2176230-7.209703endo-1,4-beta-glucanase
Sbal_2177128-8.024910hypothetical protein
Sbal_2178025-3.092861secretion protein HlyD family protein
Sbal_2179022-0.442282transposase, IS4 family protein
Sbal_2180125-1.585056resolvase domain-containing protein
Sbal_2181126-1.594978IstB ATP binding domain-containing protein
Sbal_2182225-1.557397hypothetical protein
Sbal_2183126-1.787503transposase Tn3 family protein
Sbal_2184237-5.225383hypothetical protein
Sbal_2185336-5.414041hypothetical protein
Sbal_2186333-4.944148hypothetical protein
Sbal_2187329-3.824555hypothetical protein
Sbal_2188331-5.038964hypothetical protein
Sbal_2189334-4.405797zonular occludens toxin
Sbal_2190537-5.465243hypothetical protein
Sbal_2191439-6.588083RstR-like phage repressor protein
Sbal_2192234-5.883621phage/plasmid replication protein
Sbal_2193238-6.931236hypothetical protein
Sbal_2194131-5.603042hypothetical protein
Sbal_2195030-5.746048triple helix repeat-containing collagen
Sbal_2196-120-4.312701zonular occludens toxin
Sbal_2197017-2.852625phage integrase family protein
Sbal_2198020-2.923990hypothetical protein
Sbal_2199122-2.563157response regulator receiver modulated metal
Sbal_2200327-2.274743hypothetical protein
Sbal_2201215-0.130167cbb3-type cytochrome c oxidase subunit I
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2176cloacin342e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.5 bits (76), Expect = 2e-04
Identities = 17/49 (34%), Positives = 24/49 (48%)

Query: 70 GVSAACNNRPGYYGSSGSSGGSSGSSAGYWPSVGSGSGSGGGGSGAGNG 118
GV ++ G+ + GG SGS + G G+G G G SG G+G
Sbjct: 28 GVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSG 76



Score = 27.4 bits (60), Expect = 0.021
Identities = 23/65 (35%), Positives = 26/65 (40%), Gaps = 5/65 (7%)

Query: 55 GMTPMGAAAGAFVTAGVSAACNNRPGYYGSSGSSGGSSGSSAGYWPSVGSGSGSGGGGSG 114
G T +G GA +G S+ N G GS GG SG G G SGGG
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG-----GGNGNSGGGSGT 77

Query: 115 AGNGK 119
GN
Sbjct: 78 GGNLS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2178RTXTOXIND1619e-47 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 161 bits (408), Expect = 9e-47
Identities = 84/418 (20%), Positives = 160/418 (38%), Gaps = 61/418 (14%)

Query: 39 IVIFTLFFLFYSDYSKKEKV---KGYLVLTNGLARVYSHTNGVVSDISVKEGDVVTKGDT 95
I+ F + S + E V G L + + N +V +I VKEG+ V KGD
Sbjct: 64 IMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDV 123

Query: 96 LLSV----SNDKYLRNTLSSDNEKIKEIDKQIVLV----------------------EGQ 129
LL + + L+ S ++++ QI+ E +
Sbjct: 124 LLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183

Query: 130 LLQYNSLFHERESR-------LNSIIRFLEQEQSELVLQGKLIRNRVDLAKERLADIKTL 182
+L+ SL E+ S + E+ ++ + N + K RL D +L
Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL 243

Query: 183 HANDYASQSEVKTQLDLVLDFQQRIQEY-------NTVVLKSETNLANALNDKAKLPFER 235
++ V Q + ++ ++ Y + +L ++ ++
Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303

Query: 236 ----QQQIDQLNIELSKLHNNKVAIQENSNLVLKAPISGVVSSIK-FNIGEFAPSGEYLV 290
I L +EL+K + V++AP+S V +K G + E L+
Sbjct: 304 LRQTTDNIGLLTLELAK------NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLM 357

Query: 291 TIIPTDSKLEAEVFIPTRAIAFVKKGDEVNLKLDAFPFQKFGSVHGRVSHVSMNIIFSAE 350
I+P D LE + + I F+ G +K++AFP+ ++G + G+V ++ + +
Sbjct: 358 VIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI------NLD 411

Query: 351 TASKLSFSEPVYRVKVDLTKQYIKAYGNETSLIPGMLLQADINTGSRTLVEWLLEPLF 408
V+ V + + + + L GM + A+I TG R+++ +LL PL
Sbjct: 412 AIED-QRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLE 468


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2195RTXTOXINA459e-07 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 45.0 bits (106), Expect = 9e-07
Identities = 31/100 (31%), Positives = 44/100 (44%)

Query: 379 KGKQGEQGVAGINGLDGKDGNDGKDGIDGINGVDGVDGVDGLAGINGIDGKDGIDGIDGI 438
G + G +G D +GNDG D + G G D + G +G + G DG D + G+ G
Sbjct: 732 FGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGN 791

Query: 439 NGIDGKDGKDGKDGAQGIQGIQGLQGIAGIAGLDGKDGEN 478
N ++G DG D L G G L G +G +
Sbjct: 792 NYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGAD 831



Score = 44.6 bits (105), Expect = 1e-06
Identities = 32/100 (32%), Positives = 48/100 (48%), Gaps = 3/100 (3%)

Query: 383 GEQGVAGINGLDGKD---GNDGKDGIDGINGVDGVDGVDGLAGINGIDGKDGIDGIDGIN 439
G G I G DG D G+ G D + G NG D + G DG + G+ G + ++G DG +
Sbjct: 742 GADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDD 801

Query: 440 GIDGKDGKDGKDGAQGIQGIQGLQGIAGIAGLDGKDGENV 479
+ K+ G +G L G G LDG +G+++
Sbjct: 802 EFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDL 841



Score = 43.8 bits (103), Expect = 2e-06
Identities = 30/103 (29%), Positives = 42/103 (40%)

Query: 377 NLKGKQGEQGVAGINGLDGKDGNDGKDGIDGINGVDGVDGVDGLAGINGIDGKDGIDGID 436
++G G + G G D G +G D + G +G D + GV G +NG DG D
Sbjct: 748 LIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQG 807

Query: 437 GINGIDGKDGKDGKDGAQGIQGIQGLQGIAGIAGLDGKDGENV 479
+ G G D G +G L G G L G G ++
Sbjct: 808 NSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDI 850



Score = 39.6 bits (92), Expect = 4e-05
Identities = 29/91 (31%), Positives = 37/91 (40%)

Query: 377 NLKGKQGEQGVAGINGLDGKDGNDGKDGIDGINGVDGVDGVDGLAGINGIDGKDGIDGID 436
NL V + G D G D +G DG D ++G G + + G G D +
Sbjct: 709 NLTETDNLYSVEELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLS 768

Query: 437 GINGIDGKDGKDGKDGAQGIQGIQGLQGIAG 467
G NG D G DG D G+ G L G G
Sbjct: 769 GGNGDDQLYGGDGNDKLIGVAGNNYLNGGDG 799



Score = 37.6 bits (87), Expect = 2e-04
Identities = 25/95 (26%), Positives = 38/95 (40%)

Query: 385 QGVAGINGLDGKDGNDGKDGIDGINGVDGVDGVDGLAGINGIDGKDGIDGIDGINGIDGK 444
+ + G D G+ D G +G D ++G DG + G G D + G +G + + G
Sbjct: 720 EELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGG 779

Query: 445 DGKDGKDGAQGIQGIQGLQGIAGIAGLDGKDGENV 479
DG D G G + G G +NV
Sbjct: 780 DGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNV 814



Score = 34.6 bits (79), Expect = 0.001
Identities = 27/105 (25%), Positives = 37/105 (35%)

Query: 163 GSGVENNFSFKGDKGNKGDKGDRGEKGEQGEQGEKGEKGEKGEIGETGENGVDGVNGLDG 222
G + G+ GDKG+ G G+ G G IG G N ++G +G D
Sbjct: 743 ADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDE 802

Query: 223 LNGKDGLNGSTGATGATGATGATGATGATGLQGERGLQGERGQIG 267
+ G G G+ GA L G G +G G
Sbjct: 803 FQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYG 847



Score = 30.3 bits (68), Expect = 0.027
Identities = 27/97 (27%), Positives = 38/97 (39%), Gaps = 6/97 (6%)

Query: 389 GINGLDGKDGNDGKDGIDGINGVDGVDGVDGLAGINGIDGKDGIDGIDGINGIDGKDGKD 448
L D + + G D G +G DG D I+G DG + + G G D
Sbjct: 706 NGKNLTETDNLYSVEELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGND 765

Query: 449 GKDGAQGIQGIQG------LQGIAGIAGLDGKDGENV 479
G G + G L G+AG L+G DG++
Sbjct: 766 TLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDE 802



Score = 30.3 bits (68), Expect = 0.029
Identities = 36/129 (27%), Positives = 44/129 (34%), Gaps = 3/129 (2%)

Query: 165 GVENNFSFKGDKGNKGDKGDRGEKGEQGEQGEKGEKGEKGEIGETGENGVDGVNGLDGLN 224
G F G K G G+ +G G G+KG +G NG D L G +
Sbjct: 724 GTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQ---LYGGD 780

Query: 225 GKDGLNGSTGATGATGATGATGATGATGLQGERGLQGERGQIGLTGERGLTGERGEIGLT 284
G D L G G G G + L G +G L G G G G
Sbjct: 781 GNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDD 840

Query: 285 GLQGLAGLD 293
L+G G D
Sbjct: 841 LLKGGYGND 849


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2199HTHFIS494e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 48.7 bits (116), Expect = 4e-08
Identities = 22/90 (24%), Positives = 36/90 (40%), Gaps = 9/90 (10%)

Query: 28 KVLVVDDEPDVHTVTKLALSRFKLDGRALTFINAYSAEQAKELLNQERDIAIAFIDVVME 87
+LV DD+ + TV ALSR D R +A + D + DVVM
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-----TSNAATLWRWI-AAGDGDLVVTDVVMP 58

Query: 88 SDHAGLELVKWIREELQNKTTRLILRTGQP 117
D +L+ I++ +++ + Q
Sbjct: 59 -DENAFDLLPRIKKA--RPDLPVLVMSAQN 85


29Sbal_2291Sbal_2298Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2291317-1.417846glyoxalase/bleomycin resistance
Sbal_2292417-1.038207hypothetical protein
Sbal_2294417-1.158991thiamine pyrophosphate binding domain-containing
Sbal_2295317-0.770079FAD dependent oxidoreductase
Sbal_2296116-1.121147hypothetical protein
Sbal_2297216-1.669879glycine betaine/L-proline ABC transporter
Sbal_2298217-1.837540binding-protein-dependent transport system inner
30Sbal_2310Sbal_2321Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2310225-3.356211hypothetical protein
Sbal_2311225-3.316202hypothetical protein
Sbal_2312225-3.084847phage integrase family protein
Sbal_2313227-3.349635metal dependent phosphohydrolase
Sbal_2314123-5.986901phage integrase family protein
Sbal_2315224-7.781823hypothetical protein
Sbal_2316223-7.606956Cro/CI family transcriptional regulator
Sbal_2317026-7.071628RNA-directed DNA polymerase
Sbal_2319025-6.915888hypothetical protein
Sbal_2320024-6.470051CDP-glycerol:poly(glycerophosphate)
Sbal_2321019-4.017263cytidyltransferase-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2315ACRIFLAVINRP240.047 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 24.4 bits (53), Expect = 0.047
Identities = 16/67 (23%), Positives = 28/67 (41%), Gaps = 6/67 (8%)

Query: 5 YLTTEELSARIRYDARTIRQCLKDAVLFEGVHYIRPFGGRKILYIWERVEESMLLGASAH 64
T +++S Y A ++ L GV ++ FG + + IW + +
Sbjct: 148 GTTQDDIS---DYVASNVKDTLSRL---NGVGDVQLFGAQYAMRIWLDADLLNKYKLTPV 201

Query: 65 DLINQLN 71
D+INQL
Sbjct: 202 DVINQLK 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2321LPSBIOSNTHSS414e-07 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 40.6 bits (95), Expect = 4e-07
Identities = 15/57 (26%), Positives = 29/57 (50%), Gaps = 5/57 (8%)

Query: 3 SIITYGTYDLFHYGHVRLFQRLKAMGDKLIVGVSTDEFNSLKGKVAFFNYQQRIEMI 59
+ I G++D +GH+ + +R + D++ V V + K F+ Q+R+E I
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRN-----PNKQPMFSVQERLEQI 53


31Sbal_2331Sbal_2342Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2331218-3.079756alkylhydroperoxidase
Sbal_2332017-2.825347hypothetical protein
Sbal_2333118-3.413472transposase IS3/IS911 family protein
Sbal_2334118-3.759402integrase catalytic subunit
Sbal_2335117-3.996436FHA domain-containing protein
Sbal_2336114-3.483618protein kinase
Sbal_2337213-0.886669hypothetical protein
Sbal_2338315-0.880063hypothetical protein
Sbal_23392160.013832hypothetical protein
Sbal_23402190.768838hypothetical protein
Sbal_23412221.230318hypothetical protein
Sbal_23422161.146142hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2336YERSSTKINASE421e-05 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 41.6 bits (97), Expect = 1e-05
Identities = 48/189 (25%), Positives = 85/189 (44%), Gaps = 32/189 (16%)

Query: 131 HPNIVSVFDVD----SDQDRYFIVMEYLDG----ESLDQIIRRYKPKGLSLTAAMKLLEQ 182
HPN+ +V + ++ ++M+ +DG ++L + +K ++ A ++
Sbjct: 190 HPNLANVHGMAVVPYGNRKEEALLMDEVDGWRCSDTLRTLADSWKQGKINSEAYWGTIKF 249

Query: 183 IA----DALNYAHKRGIVHADLKPANIMVDR-HGHIKILDFGVAHRMQLNYDAYAAAPLS 237
IA D N+ K G+VH D+KP N++ DR G ++D G+ R S
Sbjct: 250 IAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSR-------------S 296

Query: 238 QNSPINGFTPAYASTDL-LAEKTPAVSDDVFSFACVIYELLTSKHPFDRMPADKA-QAQH 295
P GFT ++ + +L + + DVF V+ LL F++ P K Q
Sbjct: 297 GEQP-KGFTESFKAPELGVGNLGASEKSDVF---LVVSTLLHCIEGFEKNPEIKPNQGLR 352

Query: 296 KIASKPSHL 304
I S+P+H+
Sbjct: 353 FITSEPAHV 361


32Sbal_2389Sbal_2394Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_23892181.570513nucleoside diphosphate kinase
Sbal_23902181.469958hypothetical protein
Sbal_23912171.428859alpha-L-glutamate ligase
Sbal_23923180.897853ferredoxin, 2Fe-2S type, ISC system
Sbal_23932180.597602chaperone protein HscA
Sbal_2394217-0.554140co-chaperone HscB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2393SHAPEPROTEIN1102e-28 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 110 bits (276), Expect = 2e-28
Identities = 72/368 (19%), Positives = 138/368 (37%), Gaps = 62/368 (16%)

Query: 22 VGIDLGTTNSLVAAVRSGETATLPDELGQHSLPSIVRYTQDSVEVGALAALSSAQDPQNT 81
+ IDLGT N+L+ G + PS+V A +
Sbjct: 13 LSIDLGTANTLIYVKGQGIVL---------NEPSVV------------AIRQDRAGSPKS 51

Query: 82 IVSV----KRFMGRSLADIKAGEQSFPYEFAESENGLPLFVTP--QGQVNPVQVSAEILR 135
+ +V K+ +GR+ +I A + P G + V+ ++L+
Sbjct: 52 VAAVGHDAKQMLGRTPGNIAA-------------------IRPMKDGVIADFFVTEKMLQ 92

Query: 136 PLIARA-EKTLGGELQGVVITVPAYFDDAQRQGTKDAAALLGVKVLRLLNEPTAAAIAYG 194
I + + V++ VP +R+ +++A G + + L+ EP AAAI G
Sbjct: 93 HFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAG 152

Query: 195 LDSKQEGVIAIYDLGGGTFDISILRLNRGVFEVLATGGDSALGGDDFDHLLQAHMQQVWQ 254
L + + D+GGGT +++++ LN V +GGD FD + ++++ +
Sbjct: 153 LPVSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYG 207

Query: 255 LSDIDSQLSRQLLIESRRVKEALTDAAETEAKVI---LADGTELTQIVSKAEFDAMIAAL 311
I + ++ E + A E +V LA+G ++ E +
Sbjct: 208 SL-IGEATAERIKHE---IGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEP 263

Query: 312 VKKTIASCRRTLRD-AGVTTDEVLE--TVMVGGSTRVPLVREQVEAFFGKPPLTSIDPDR 368
+ +++ L ++ E V+ GG + + + G P + + DP
Sbjct: 264 LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLT 323

Query: 369 VVAIGAAI 376
VA G
Sbjct: 324 CVARGGGK 331


33Sbal_2416Sbal_2433Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_24161173.316982beta-hexosaminidase
Sbal_24170172.750334L-serine dehydratase 1
Sbal_24180182.992712S-formylglutathione hydrolase
Sbal_24191182.736709alcohol dehydrogenase
Sbal_24201192.544653LysR family transcriptional regulator
Sbal_24212183.011460FAD dependent oxidoreductase
Sbal_24222193.002951hypothetical protein
Sbal_24232203.291225TonB-dependent receptor
Sbal_24243193.261895helicase c2
Sbal_24253193.553266ATP phosphoribosyltransferase
Sbal_24263193.875869histidinol dehydrogenase
Sbal_24272193.040945histidinol-phosphate aminotransferase
Sbal_24280182.927656imidazole glycerol-phosphate
Sbal_24293142.129169imidazole glycerol phosphate synthase subunit
Sbal_24302161.9335541-(5-phosphoribosyl)-5-[(5-
Sbal_24313171.783304imidazole glycerol phosphate synthase subunit
Sbal_24323160.789977bifunctional phosphoribosyl-AMP
Sbal_24333150.384516aromatic amino acid transporter
34Sbal_2509Sbal_2516Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2509229-0.370004ferric uptake regulator
Sbal_25103300.343888N-acetyltransferase GCN5
Sbal_25114330.049510GreA/GreB family elongation factor
Sbal_25124350.201702succinyl-CoA synthetase subunit alpha
Sbal_25134320.059272succinyl-CoA synthetase subunit beta
Sbal_25145300.1343682-oxoglutarate dehydrogenase, E2 subunit,
Sbal_2515531-0.2277772-oxoglutarate dehydrogenase E1 component
Sbal_2516327-0.758200succinate dehydrogenase iron-sulfur subunit
35Sbal_2641Sbal_2668Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2641020-3.217943NAD-dependent epimerase/dehydratase
Sbal_2642121-4.306436metal dependent phosphohydrolase
Sbal_2643123-5.472817hypothetical protein
Sbal_2644324-5.945429hypothetical protein
Sbal_2645121-4.842269integrase catalytic subunit
Sbal_2646225-6.395023integrase catalytic subunit
Sbal_2647528-8.016874IstB ATP binding domain-containing protein
Sbal_2649528-7.710485DEAD/DEAH box helicase
Sbal_2650323-5.507860transposase IS3/IS911 family protein
Sbal_2651317-2.436240integrase catalytic subunit
Sbal_2652314-2.233423DEAD/DEAH box helicase
Sbal_2653413-1.655975DNA ligase (NAD(+))
Sbal_26543140.677690DNA ligase (NAD(+))
Sbal_26553130.617269cell division protein ZipA
Sbal_26561110.503021chromosome segregation protein SMC
Sbal_2657111-0.129587putative sulfate transport protein CysZ
Sbal_2659113-0.213789transposase IS3/IS911 family protein
Sbal_2660115-0.631999integrase catalytic subunit
Sbal_2662-115-1.733924RDD domain-containing protein
Sbal_2663025-3.862349cysteine synthase A
Sbal_2665124-3.883209C4-dicarboxylate transporter/malic acid
Sbal_2666027-3.674490hypothetical protein
Sbal_2667124-3.624750IstB ATP binding domain-containing protein
Sbal_2668224-3.441490integrase catalytic subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2650HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHEALKQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2656GPOSANCHOR404e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 40.4 bits (94), Expect = 4e-05
Identities = 37/269 (13%), Positives = 89/269 (33%), Gaps = 2/269 (0%)

Query: 227 KTHAELLVMRYQELQDQMASLSEQIRAIEVQQAAAQSLAQTDELQTTELQVQLAQLAEQE 286
K L + L+D L+E++ + + + EL+ + A L +
Sbjct: 70 KLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKAL 129

Query: 287 QRAVEAYYLTGTEIAKLEQQLQNQKQRDAQLETQLAQVKEQILQNTDKLNGYKASLSALE 346
+ A+ +I LE + R A LE L ++ K+ +A +ALE
Sbjct: 130 EGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE 189

Query: 347 LELAKLGPQHDEQQEIMDELQSQWEMSIERSQQQAEVARQQAVDVSQHKLQLELRRSQLA 406
A+L + ++ + A + +++
Sbjct: 190 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIK 249

Query: 407 HQQQLLLHKQQQSSEQQAQLIALVDSDLASNITPLQQEIASLEDATRLQTEINHQQEQLV 466
+ + + + + + ++ ++ A + ++ Q Q++
Sbjct: 250 TLEA-EKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLE-HQSQVL 307

Query: 467 SASTQALDDARQSAEHIQQQLTATKARHE 495
+A+ Q+L ++ ++QL A + E
Sbjct: 308 NANRQSLRRDLDASREAKKQLEAEHQKLE 336



Score = 37.7 bits (87), Expect = 3e-04
Identities = 24/185 (12%), Positives = 47/185 (25%)

Query: 168 AGISRYKERRRETENRIRHTRENLERLGDIRSELGKQLDKLAQQAKAAKQYRELKQAERK 227
A + + E + LE + L+K + A K +
Sbjct: 123 ADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLE 182

Query: 228 THAELLVMRYQELQDQMASLSEQIRAIEVQQAAAQSLAQTDELQTTELQVQLAQLAEQEQ 287
L R EL+ + A + ++ + +L+ L
Sbjct: 183 AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST 242

Query: 288 RAVEAYYLTGTEIAKLEQQLQNQKQRDAQLETQLAQVKEQILQNTDKLNGYKASLSALEL 347
E A LE + ++ +I + +A + LE
Sbjct: 243 ADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEH 302

Query: 348 ELAKL 352
+ L
Sbjct: 303 QSQVL 307



Score = 34.3 bits (78), Expect = 0.003
Identities = 35/231 (15%), Positives = 83/231 (35%), Gaps = 2/231 (0%)

Query: 612 KTEGAQSLVQLSKEQAQLAQNILDLQQSLHVQQDKMTELAATLQQQRQQLTAGAQKLHQL 671
+ +SL + + + +L DL+++L + T +A ++ + A A + L
Sbjct: 101 LRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL 160

Query: 672 ELDKATKSMQLSGLTEQVKVRAEQQNKLEAALSASLIDLERLSEQCEVLAEQETELDDAL 731
E + + ++K ++ LEA + LE + + L+
Sbjct: 161 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 220

Query: 732 QASIDIQRQLTQDTQADSVRHQALKARITEVERQLSTTTSALQAVTMRMAVSTEQIELQQ 791
A + L + + A A+I +E + + + + + +
Sbjct: 221 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 280

Query: 792 VRVNELIHTRESVLAELAKVAQLSGVQNSVQLTEQLKQLLQQQHEQQQGLK 842
++ L + ++ AE A + S Q + L++ L E ++ L+
Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQS--QVLNANRQSLRRDLDASREAKKQLE 329



Score = 31.6 bits (71), Expect = 0.021
Identities = 35/206 (16%), Positives = 79/206 (38%), Gaps = 7/206 (3%)

Query: 175 ERRRETENRIRHTRENLERLGDIRSELGKQLDKLAQQAKAAKQYRELKQAERKTHAELLV 234
+ + + + LE+ + + +A K E ++A+ + +++L
Sbjct: 249 KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLN 308

Query: 235 MRYQELQDQMASLSEQIRAIEVQQAAAQSLAQTDELQTTELQVQLAQLAEQEQRAVEAYY 294
Q L+ + + E + +E + + + E L+ L + + ++ +EA
Sbjct: 309 ANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA-SREAKKQLEA-- 365

Query: 295 LTGTEIAKLEQQLQNQKQRDAQLETQLAQVKEQILQNTDKLNGYKASLSALELELAKLGP 354
E KLE+Q + + L L +E Q L + L+ALE +L
Sbjct: 366 ----EHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEE 421

Query: 355 QHDEQQEIMDELQSQWEMSIERSQQQ 380
++ ELQ++ E + +++
Sbjct: 422 SKKLTEKEKAELQAKLEAEAKALKEK 447


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2659HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHEALKQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


36Sbal_2713Sbal_2727Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2713-117-3.554775intracellular septation protein A
Sbal_2714119-3.732149YciI-like protein
Sbal_2715121-4.268216exonuclease III
Sbal_2716123-5.196564hypothetical protein
Sbal_2717121-4.451716transposase, IS4 family protein
Sbal_2718122-4.429171hypothetical protein
Sbal_2719120-3.765721integrase catalytic subunit
Sbal_2720018-3.436918IstB ATP binding domain-containing protein
Sbal_2721113-1.143420hypothetical protein
Sbal_2723-113-0.235122CRP/FNR family transcriptional regulator
Sbal_27240132.145463hypothetical protein
Sbal_27251142.320426hypothetical protein
Sbal_27261152.345889DSBA oxidoreductase
Sbal_27271173.766137methyl-accepting chemotaxis sensory transducer
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2714adhesinmafb250.042 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 25.4 bits (55), Expect = 0.042
Identities = 9/44 (20%), Positives = 14/44 (31%)

Query: 54 AGFSGSLVVADFESLVAAKHWADADPYIEAGVYKSVVVKPFKRV 97
G GS+ + + A W +P V V +V
Sbjct: 279 IGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


37Sbal_2752Sbal_2763Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2752-113-4.958603chorismate synthase
Sbal_2753016-6.008766N5-glutamine S-adenosyl-L-methionine-dependent
Sbal_2754118-6.452120hypothetical protein
Sbal_2755016-5.431163phosphohistidine phosphatase, SixA
Sbal_2756-114-4.657644peptidase M16 domain-containing protein
Sbal_2757018-4.470790PAS/PAC and GAF sensor-containing diguanylate
Sbal_27580142.008684hypothetical protein
Sbal_27590122.814495hypothetical protein
Sbal_27600122.889214multifunctional fatty acid oxidation complex
Sbal_27610113.7083393-ketoacyl-CoA thiolase
Sbal_27621123.672012ATPase
Sbal_27631133.478147hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2762HTHFIS330.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 0.001
Identities = 35/139 (25%), Positives = 53/139 (38%), Gaps = 19/139 (13%)

Query: 28 LIALIANG--HLLVEGPPGLAKT---RAVKALCDGVEGDFHRIQ---FTPDLLPADLTG- 78
++A + L++ G G K RA+ G F I DL+ ++L G
Sbjct: 152 VLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH 211

Query: 79 -----TDIYRSQTGTFEFEAGPIFHNLILADEINRAPAKVQSALLEAMAEGQVT-VGKHS 132
T TG FE G + DEI P Q+ LL + +G+ T VG +
Sbjct: 212 EKGAFTGAQTRSTGRFEQAEG----GTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRT 267

Query: 133 YKLPPLFLVMATQNPLENE 151
+ +V AT L+
Sbjct: 268 PIRSDVRIVAATNKDLKQS 286


38Sbal_2776Sbal_2784Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_27762121.629548hypothetical protein
Sbal_27771121.996170rhodanese domain-containing protein
Sbal_27782141.757206N-acetyltransferase GCN5
Sbal_27792152.210356peptidase S8/S53 subtilisin kexin sedolisin
Sbal_27800162.185104YaeQ family protein
Sbal_27811172.047147siroheme synthase
Sbal_27822210.928771rhodanese domain-containing protein
Sbal_27832191.013334hypothetical protein
Sbal_27842190.944673preprotein translocase subunit SecF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2779SUBTILISIN1702e-49 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 170 bits (433), Expect = 2e-49
Identities = 69/217 (31%), Positives = 110/217 (50%), Gaps = 10/217 (4%)

Query: 152 STPWGQTFVGATQLSDSQAG-NRTICIIDSGYDRSHSELGGNNVTGTN--NSGTGNWFEP 208
P G + A + + G + ++D+G D H +L + G N + G+
Sbjct: 21 EIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIF 80

Query: 209 GNNNAHGTHVAGTIAAIANNDGVIGVMPNQTANIHVVKVFNESGWGYSSSLVAAVDTCVA 268
+ N HGTHVAGTIAA N +GV+GV P A++ ++KV N+ G G ++ + +
Sbjct: 81 KDYNGHGTHVAGTIAATENENGVVGVAPE--ADLLIIKVLNKQGSGQYDWIIQGIYYAIE 138

Query: 269 NGANVVTMSLGGAGSSTTERNALAAHYNNGVLLIAAAGNAGDSTHS-----YPASYDGVM 323
++++MSLGG A+ + +L++ AAGN GD YP Y+ V+
Sbjct: 139 QKVDIISMSLGGPEDVPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVI 198

Query: 324 SVASVDNHKDHSAFSQYTNQVEISGPGEAILSTVTRG 360
SV +++ + S FS N+V++ PGE ILSTV G
Sbjct: 199 SVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGG 235



Score = 63.3 bits (154), Expect = 1e-12
Identities = 19/71 (26%), Positives = 29/71 (40%), Gaps = 7/71 (9%)

Query: 507 NKDYEYYNGTSMATPHVSGVATLVWS-----YHPECSAAQVRNALKMTAEDLGTAGRDNY 561
Y ++GTSMATPHV+G L+ + + + ++ L LG
Sbjct: 234 GGKYATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKM 291

Query: 562 YGYGLVNAVAA 572
G GL+ A
Sbjct: 292 EGNGLLYLTAV 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2784SECFTRNLCASE316e-110 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 316 bits (811), Expect = e-110
Identities = 113/309 (36%), Positives = 180/309 (58%), Gaps = 14/309 (4%)

Query: 2 LEILSLKHTVNFLRHALPISIMSAVLVLGSLVSLATNGINWGLDFTGGTVVEMEFTNPVD 61
L+++ K +F R + V+++ S++ G+N+G+DF GGT + E T +D
Sbjct: 5 LKLVPEKTNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAID 64

Query: 62 LNALRVQLTTPDSEGAIVQNFGSSR------DVLVRLQVKE--------GVKSDVQVKSV 107
+ R L + I+ ++R+Q++E G + V V
Sbjct: 65 VGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKV 124

Query: 108 MEAAQKVDPQVQQKRVEFVGPQVGKELAEQGALAVLVALICIMIYVSFRFEWRLAFGSVA 167
A VDP ++ E VGP+V EL ++L A + IM Y+ RFEW+ A G+V
Sbjct: 125 ETALTAVDPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVV 184

Query: 168 ALAHDVIVTLGVFSVFQLEFDLTVLAGLLTVVGYSLNDTIVVFDRIRENFLKMRKSDPEE 227
AL HDV++T+G+F+V QL+FDLT +A LLT+ GYS+NDT+VVFDR+REN +K + +
Sbjct: 185 ALVHDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRD 244

Query: 228 VVNTSITQTMSRTIITTGTTLVVVVALFLKGGTMIHGFATALLMGIFVGTYSSIYVASFL 287
V+N S+ +T+SRT++T TTL+ +V + + GG +I GF A++ G+F GTYSS+YVA +
Sbjct: 245 VMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNI 304

Query: 288 AIKLGINRE 296
+ +G++R
Sbjct: 305 VLFIGLDRN 313


39Sbal_2825Sbal_2835Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_28251193.332382small multidrug resistance protein
Sbal_28261203.511064AMP-dependent synthetase and ligase
Sbal_28272274.013415MerR family transcriptional regulator
Sbal_28283264.156583acyl-CoA dehydrogenase domain-containing
Sbal_28293253.836037propionyl-CoA carboxylase
Sbal_28302222.750516enoyl-CoA hydratase/isomerase
Sbal_28311212.717537carbamoyl-phosphate synthase L chain
Sbal_28321221.695510pyruvate carboxyltransferase
Sbal_28333181.2880653-oxoacid CoA-transferase subunit A
Sbal_28342180.7362693-oxoacid CoA-transferase subunit B
Sbal_2835216-1.017753hypothetical protein
40Sbal_2867Sbal_2899Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2867-115-3.596627GAF sensor signal transduction histidine kinase
Sbal_2868-123-4.288962response regulator receiver modulated metal
Sbal_2869030-5.256698lipoprotein
Sbal_2870-128-5.374795hypothetical protein
Sbal_2871028-5.629158IstB ATP binding domain-containing protein
Sbal_2872-128-5.644308integrase catalytic subunit
Sbal_2874-129-6.663562integrase catalytic subunit
Sbal_2875032-7.686594IstB ATP binding domain-containing protein
Sbal_2876038-9.498973polysaccharide biosynthesis protein CapD
Sbal_2877144-12.965655sugar transferase
Sbal_2878348-15.030430NAD-dependent epimerase/dehydratase
Sbal_2879452-17.509106glycosyl transferase family protein
Sbal_2880652-18.842045putative lipopolysaccharide biosynthesis
Sbal_2881853-19.197838glycosyl transferase family protein
Sbal_2882752-18.601306group 1 glycosyl transferase
Sbal_2883647-16.522092hypothetical protein
Sbal_2884342-13.842595glycosyl transferase family protein
Sbal_2885236-11.119065polysaccharide biosynthesis protein
Sbal_2886134-8.514096group 1 glycosyl transferase
Sbal_2887029-5.823683dTDP-4-dehydrorhamnose 3,5-epimerase
Sbal_2888029-5.501391dTDP-4-dehydrorhamnose reductase
Sbal_2889027-5.078200glucose-1-phosphate thymidylyltransferase
Sbal_2890026-4.846612dTDP-glucose-4,6-dehydratase
Sbal_2891026-4.989625lipopolysaccharide biosynthesis protein
Sbal_2893124-4.254964polysaccharide export protein
Sbal_2894121-3.604982transcriptional acivator RfaH
Sbal_2895221-3.220084amino acid/peptide transporter
Sbal_2896322-3.637213response regulator receiver protein
Sbal_2897122-3.502534VacJ family lipoprotein
Sbal_2898222-3.009018hypothetical protein
Sbal_2899224-2.667504FlhB domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2867PF06580393e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.1 bits (91), Expect = 3e-05
Identities = 28/161 (17%), Positives = 62/161 (38%), Gaps = 31/161 (19%)

Query: 266 NTMQDGLCLIERNLSRAAELV--------HNFKRTAADQSILERERFNLKNYIFQIFSSL 317
N + + LI + ++A E++ ++ + + A Q L E + +Y+ + S
Sbjct: 177 NALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQ-LAS-- 233

Query: 318 KPLMR-KKNITLKVELDDNIFIDSYPGAIAQIFTNLVANSFRHAFPDDFAGEKQIVIVVE 376
++ + + + +++ I P + Q LV N +H G +I++
Sbjct: 234 ---IQFEDRLQFENQINPAIMDVQVPPMLVQT---LVENGIKHGIAQLPQG-GKILLKGT 286

Query: 377 KEGSQIKMTYQDNGIGMSDEVKAKAFEPFFTTARQSGGTGL 417
K+ + + ++ G K +S GTGL
Sbjct: 287 KDNGTVTLEVENTGSLALKNTK------------ESTGTGL 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2868HTHFIS462e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.4 bits (110), Expect = 2e-07
Identities = 28/169 (16%), Positives = 57/169 (33%), Gaps = 25/169 (14%)

Query: 24 KVAIIDDEPGIHDVTRFALKNLTLDNRVLQFYSCYSAAEGLALLQTETDIALAFIDVVME 83
+ + DD+ I V AL D R +AA + L DVVM
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVR-----ITSNAATLWRWIAAGD-GDLVVTDVVMP 58

Query: 84 TDHAGLELVQKIRTELNNHSTRIILRTGQ--PGQAPE-------DQVIRDFDINDYKAKT 134
D +L+ +I+ +++ + Q A + D + + FD+ +
Sbjct: 59 -DENAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115

Query: 135 ELTAARLKSCVYTSLRSYRDIK-IIEQSQ------RGMEKVIAASTSVL 176
A K +D ++ +S R + +++ +++
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2876NUCEPIMERASE576e-11 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 57.1 bits (138), Expect = 6e-11
Identities = 53/298 (17%), Positives = 101/298 (33%), Gaps = 44/298 (14%)

Query: 283 VMVTGAGGSIGSELCRQILKQSPKKLVLFELSEFALYTIDRELSATAFELGLDVKILPIM 342
+VTGA G IG + +++L+ + + + L+++ Y D L EL
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDY--Y--DVSLKQARLELLAQPGFQFHK 58

Query: 343 GSVQRENRVQAVMQSFRVQTVYHAAAYKHVPLVEHNVVEGVRNNIFGTLYTARAAIAAKV 402
+ + + S + V+ + V N +N+ G L K+
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 403 ETFVLIST---------------DKAVRPTNIMGTTKRMAELALQALSKETHQTRFSMVR 447
+ + S+ D P ++ TK+ EL S + + +R
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYS-HLYGLPATGLR 177

Query: 448 FGNVLGSSGS---VVPLFRKQIANGGPVTV-THRDITRFFMTIPEASQLVIQA------- 496
F V G G + F K + G + V + + R F I + ++ +I+
Sbjct: 178 FFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHA 237

Query: 497 -----------GAMGKGGDVFVLDMGKAVKIVDLAAKMIRLSGFEVKDELNP--DGDI 541
A V+ + V+++D + G E K + P GD+
Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDV 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2878NUCEPIMERASE841e-20 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 84.1 bits (208), Expect = 1e-20
Identities = 65/345 (18%), Positives = 116/345 (33%), Gaps = 58/345 (16%)

Query: 1 MLTGATGFVGGAVLTQLMQQPD--LAIRTLGRRITVKL-----SVNTSANVTHFIGEIDS 53
++TGA GF+G V +L++ + I L V L + ++
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 54 ATDYSAALCD--IDVIIHCAARAHIMYDEVADPLAEYRRVNVEGTLNLARQGVAAGIKRF 111
+ + + R + Y + +P A Y N+ G LN+ I+
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAVRY-SLENPHA-YADSNLTGFLNILEGCRHNKIQHL 121

Query: 112 VYLSSIKVNGESTSVGSPFYESDSL--PAKDFYGQSKAEAERQLIELSKETGLEVVIIRP 169
+Y SS V G + + PF DS+ P Y +K E S GL +R
Sbjct: 122 LYASSSSVYGLNRKM--PFSTDDSVDHPVS-LYAATKKANELMAHTYSHLYGLPATGLRF 178

Query: 170 TLVYGPGVKANFA--ALMNLVSKGIPLP-FGCVTQNKRSLVSIANLVDLIITCIDHPKAA 226
VYGP + + A + +G + + KR I ++ + II D A
Sbjct: 179 FTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKM-KRDFTYIDDIAEAIIRLQDVIPHA 237

Query: 227 NQ-----------------VFLVSDDHDVSTSEMVREMAKALDKSTWQLPIPIWCYKLVG 269
+ V+ + + V + ++ + AL
Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEA-------------- 283

Query: 270 KLFNKSEVVDRLTGSLQV---DISHTKEILGWKPPQTLQEGFKQT 311
K ++ G + D E++G+ P T+++G K
Sbjct: 284 ----KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2888NUCEPIMERASE542e-10 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 53.6 bits (129), Expect = 2e-10
Identities = 34/164 (20%), Positives = 62/164 (37%), Gaps = 33/164 (20%)

Query: 1 MKILVTGSNGQVGSCLVKQLSQMPEIEFWAVD-------------RTQL----------- 36
MK LVTG+ G +G + K+L + + +D R +L
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGH-QVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 37 DITDAAAVAKLVNDFKPHAIINAAAHTAVDKA-EDEVALSYAINRDGPQWLAEAANSAGA 95
D+ D + L + + AV + E+ A + N G + E
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYA-DSNLTGFLNILEGCRHNK- 117

Query: 96 VMLHI---STDYVFAGDKQGEYRETDAID-PQGVYGKSKLAGEL 135
+ H+ S+ V+ +++ + D++D P +Y +K A EL
Sbjct: 118 -IQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2890NUCEPIMERASE1769e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 176 bits (449), Expect = 9e-55
Identities = 81/358 (22%), Positives = 146/358 (40%), Gaps = 48/358 (13%)

Query: 1 MKILVTGGAGFIGSAVVRFIINNTQDSVINVDKLT--YAGNL-ESLLSVEKNQRYAFEQV 57
MK LVTG AGFIG V + ++ V+ +D L Y +L ++ L + + F ++
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DICDRVELDRVFNKYQPDAVMHLAAESHVDRSITGPGDFIQTNIVGTYTLLEAARHYWMQ 117
D+ DR + +F + V V S+ P + +N+ G +LE RH +Q
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 118 LDAERKSAFRFHHISTDEVYGDLPHPDEVVPGSELALFTEITPYAPSSPYSASKASSDHL 177
+ S+ VYG + +P S + + P S Y+A+K +++ +
Sbjct: 120 ---------HLLYASSSSVYGL----NRKMPFST-----DDSVDHPVSLYAATKKANELM 161

Query: 178 VRAWLRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKALPIYGKGDQIRDWLYVE 237
+ YGLP YGP+ P+ + LEGK++ +Y G RD+ Y++
Sbjct: 162 AHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYID 221

Query: 238 DHARALYKVV------------------TEGKIGETYNIGGHNEKQNLEVVQTICSILDS 279
D A A+ ++ YNIG + + ++ +Q + L
Sbjct: 222 DIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG- 280

Query: 280 LVPKATPYAEQITYVTDRPGHDRRYAIDATKMSNELKWQPEETFETGLRKTIEWYLAN 337
+A + + +PG + D + + + PE T + G++ + WY
Sbjct: 281 --IEA-----KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2896HTHFIS907e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 7e-22
Identities = 27/100 (27%), Positives = 45/100 (45%)

Query: 8 VLLVEDDPVFRQIVASFLDTRGAQVTQACDGEEGLSLFKSQHFDVVLADLSMPKLGGLDM 67
+L+ +DD R ++ L G V + + D+V+ D+ MP D+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LKEMTRLAPLVPSVVISGNNVMADVVEALRIGASDYLVKP 107
L + + P +P +V+S N ++A GA DYL KP
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2897VACJLIPOPROT2291e-77 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 229 bits (585), Expect = 1e-77
Identities = 85/222 (38%), Positives = 128/222 (57%), Gaps = 4/222 (1%)

Query: 44 PRDPFEGFNRAMWDFNYLYLDRYLYRPVAHGYNDYIPMPAKTGINNFVQNLEEPSSLVNN 103
DP EGFNR M++FN+ LD Y+ RPVA + DY+P PA+ G++NF NLEEP+ +VN
Sbjct: 28 RSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWRDYVPQPARNGLSNFTGNLEEPAVMVNY 87

Query: 104 VLQGKWGWAANAGGRFTINSTVGLLGVIDVADMMGMSRKQDE---FNEVLGYYGVPNGPY 160
LQG RF +N+ +G+ G IDVA M ++ E F LG+YGV GPY
Sbjct: 88 FLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPY 147

Query: 161 FMAPFAGPYVVRELASDWVDGLYFPLSELTMWQTIVKWGLKNLHSRASAIDQERLVDNAL 220
PF G + +R+ D D LY LS LT ++ KW L+ + +RA +D + L+ +
Sbjct: 148 VQLPFYGSFTLRDDGGDMADALYPVLSWLTWPMSVGKWTLEGIETRAQLLDSDGLLRQSS 207

Query: 221 DPYAFVKDAYLQHMDYKVYDGNV-PQKQDDDELLDQYMQELE 261
DPY V++AY Q D+ G + PQ+ + + + +++++
Sbjct: 208 DPYIMVREAYFQRHDFIANGGELKPQENPNAQAIQDDLKDID 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2899TYPE3IMSPROT567e-13 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 56.3 bits (136), Expect = 7e-13
Identities = 16/93 (17%), Positives = 34/93 (36%), Gaps = 9/93 (9%)

Query: 10 AVALSYDGRN--APKIVATGEGLIAEEIIALAKANGVYIHQDPHLSHFL-QLLELGEEIP 66
A+ + Y P + + + +A+ GV I Q L+ L + IP
Sbjct: 268 AIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIP 327

Query: 67 KELYLLIAELIAFVYMLDGKFPEQWNNMHQKIV 99
E AE++ ++ + + H +++
Sbjct: 328 AEQIEATAEVLRWLERQNIE------KQHSEML 354


41Sbal_2928Sbal_2933Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2928023-3.754116two component, sigma54 specific, Fis family
Sbal_2929226-4.649697PAS/PAC sensor signal transduction histidine
Sbal_2930229-5.635236sigma-54 dependent trancsriptional regulator
Sbal_2931434-6.919328flagellar protein FliS
Sbal_2932328-5.904363hypothetical protein
Sbal_2933120-3.622108flagellar hook-associated 2 domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2928HTHFIS457e-161 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 457 bits (1178), Expect = e-161
Identities = 167/483 (34%), Positives = 249/483 (51%), Gaps = 42/483 (8%)

Query: 1 MSEAKLLLVEDDASLREALLDTLMLAQYECIDVASGEDAILALKQHQFDLVISDVQMQGI 60
M+ A +L+ +DDA++R L L A Y+ ++ + DLV++DV M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 GGLGLLNFLQQHHPKLPVLLMTAYATIGSAVDAIKLGAVDYLAKPFAPEVLLNQVSRYLP 120
LL +++ P LPVL+M+A T +A+ A + GA DYL KPF L+ + R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 LKQNVDQPVVAD-----------EKSLALLALAQRVAASDASVMILGPSGSGKEVLARYI 169
+ + D + + R+ +D ++MI G SG+GKE++AR +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 170 HQHSSRADQAFVAINCAAIPENMLEATLFGYEKGAFTGAYQACPGKFEQAQGGTLLLDEI 229
H + R + FVAIN AAIP +++E+ LFG+EKGAFTGA G+FEQA+GGTL LDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 230 SEMDLGLQAKLLRVLQEREVERLGGRKTIKLDVRVLATSNRDLKAVVAAGGFREDLYYRI 289
+M + Q +LLRVLQ+ E +GGR I+ DVR++A +N+DLK + G FREDLYYR+
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 290 NVFPLAWPALSQRPADILPLARHLLVKHAKALNVADVPELDENARRRLLSHRWPGNVREL 349
NV PL P L R DI L RH + + A+ + V D+ A + +H WPGNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLD-VKRFDQEALELMKAHPWPGNVREL 358

Query: 350 DNVIQRALILRAGQVITANDIIIDAQDVILG--------------------------GED 383
+N+++R L VIT I + + I
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 384 LDQFVAEPDGLGEELKAQEHVIILETLNQCQGSRKLVAEKLGISARTLRYKMARMRDMGI 443
+ L E+ +IL L +G++ A+ LG++ TLR K +R++G+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGV 475

Query: 444 QLP 446
+
Sbjct: 476 SVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2929PF06580348e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.1 bits (78), Expect = 8e-04
Identities = 32/185 (17%), Positives = 71/185 (38%), Gaps = 34/185 (18%)

Query: 167 LSDAAKAKFQQKLVDRLNELERQVNDMLLMAKGRQDELGDLITLAEVIDNVLANCEPIAA 226
L D KA ++++ L+EL R + ++LA+ + V + + +
Sbjct: 187 LEDPTKA---REMLTSLSELMRYS---------LRYSNARQVSLADELTVVDSYLQLASI 234

Query: 227 KQGCDLSFD-DVSSSSMLANSNALSSAINNLVMNSIEAGAT------EIRIQAAEEGDQL 279
+ L F+ ++ + + + + LV N I+ G +I ++ ++ +
Sbjct: 235 QFEDRLQFENQINPA--IMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTV 292

Query: 280 LLNVIDNGKGLDANMQQKVLEPFFTTKSQGTGLGLA-VVQSVVRNHGGQLQLSCLPNKGC 338
L V + G N + + TG GL V + + +G + Q+ +G
Sbjct: 293 TLEVENTGSLALKNTK------------ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGK 340

Query: 339 TVSLV 343
++V
Sbjct: 341 VNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2930HTHFIS432e-150 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 432 bits (1112), Expect = e-150
Identities = 171/481 (35%), Positives = 262/481 (54%), Gaps = 21/481 (4%)

Query: 7 RILLIGPPSERLNRLCCIFDFLGEQIAQI-DAEKLSASLQDTRFRALVILTDVMDADA-- 63
IL+ + L G + +A L + +++TDV+ D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD--LVVTDVVMPDENA 62

Query: 64 ---LKNIAGQHPWQPMLLL---GNVDDLQVSNILG---NIEEPLTYPQLTELLHFCQVFG 114
L I P P+L++ ++ G + +P +L ++
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 115 QVKRPQVPTSANQTKLFRSLVGRSDGIANVRHLINQVATSEATVLVLGQSGTGKEVVARN 174
+ + ++ + LVGRS + + ++ ++ ++ T+++ G+SGTGKE+VAR
Sbjct: 123 KRRPSKLEDDSQD---GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 175 IHYLSERRDGPFIPVNCGAIPPELLESELFGHEKGSFTGAICSRKGRFELAEGGTLFLDE 234
+H +RR+GPF+ +N AIP +L+ESELFGHEKG+FTGA GRFE AEGGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 235 IGDMPLQMQVKLLRVLQERVFERVGGTKTINADVRVVAATHRDLETMISVNEFREDLYYR 294
IGDMP+ Q +LLRVLQ+ + VGG I +DVR+VAAT++DL+ I+ FREDLYYR
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 295 LNVFPIEMPALCDRKDDVPLLLQELVSRVYNEGRGKVRFTQRAIESLKEHAWSGNVRELS 354
LNV P+ +P L DR +D+P L++ V + EG RF Q A+E +K H W GNVREL
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 355 NLVERLTILYPGGLVDVNDLPVKYRHIDVPEYCVEMSEEQQERDALASIFSDEEPVEIPE 414
NLV RLT LYP ++ + + R ++P+ +E + + +++ EE +
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRS-EIPDSPIEKAAARSGSLSISQAV--EENMRQYF 416

Query: 415 TRFPSELPPEGVNLKDLLAELEIDMIRQALELQDNVVARAAEMLGIRRTTLVEKMRKYGM 474
F LPP G+ +LAE+E +I AL +AA++LG+ R TL +K+R+ G+
Sbjct: 417 ASFGDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475

Query: 475 T 475
+
Sbjct: 476 S 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2933FLAGELLIN290.044 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 28.9 bits (64), Expect = 0.044
Identities = 26/228 (11%), Positives = 54/228 (23%), Gaps = 2/228 (0%)

Query: 4 TATGIGSGLKINEIVQVLVDAEKKPKEAMFNKKEDSIKAKVSAMGTLKSALSTFQDALKK 63
G+ + V + V T KS T +
Sbjct: 207 VDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIA 266

Query: 64 LQTGDALNQRKITVSNETYLTATADKTAQAGSYGIKVEQLAVNHKVAGINVADPTLPVGE 123
T+ T G + V VA I +
Sbjct: 267 GAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAAT 326

Query: 124 GSLDFGINGKNFSIDVSATDSIAAIAKKVNESSDNVGVTATVITSDAGSRLIFSSNKSGE 183
+ + + D + K+++ N V + G+ ++
Sbjct: 327 LQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKV 386

Query: 184 DNQISITANDTSGTGLNDMFGAGNITSLQDAKNAIVYIDN--QKVTSQ 229
D + +G++ + + + N + ID+ KV +
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAV 434


42Sbal_2945Sbal_2968Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2945-122-3.419454flagellar basal body rod modification protein
Sbal_2946022-3.853229flagellar basal body rod protein FlgC
Sbal_2947121-4.264755flagellar basal body rod protein FlgB
Sbal_2948120-4.111669protein-glutamate O-methyltransferase
Sbal_2949019-3.982265putative CheW protein
Sbal_2950119-3.352730flagellar basal body P-ring biosynthesis protein
Sbal_2951219-4.072363anti-sigma-28 factor FlgM
Sbal_2952120-4.276280FlgN family protein
Sbal_2953222-4.261783hypothetical protein
Sbal_2954223-5.213155hypothetical protein
Sbal_2955226-5.425225hypothetical protein
Sbal_2956331-6.141295*hypothetical protein
Sbal_2957435-6.344815hypothetical protein
Sbal_2958534-6.233044N-acetylneuraminic acid synthase
Sbal_2959532-5.688083phosphoribosylglycinamide synthetase
Sbal_2960529-5.345952short-chain dehydrogenase/reductase SDR
Sbal_2961529-5.403966transketolase, central region
Sbal_2962328-5.516839type 11 methyltransferase
Sbal_2963329-7.528745type 12 methyltransferase
Sbal_2964327-7.437285glycoside hydrolase
Sbal_2965226-7.892903acylneuraminate cytidylyltransferase
Sbal_2966024-7.429615DegT/DnrJ/EryC1/StrS aminotransferase
Sbal_2967022-7.155160polysaccharide biosynthesis protein CapD
Sbal_2968017-5.152966hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2946FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 3e-04
Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 99 NVNVMEEMADMISASRSYQMNVQVAEAAKSMLQQTLGM 136
VN+ EE ++ + Y N QV + A ++ + +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.5 bits (66), Expect = 0.003
Identities = 16/67 (23%), Positives = 29/67 (43%), Gaps = 6/67 (8%)

Query: 5 SIFDVAGSGMSAQSVRLNTTASNIANADSVSSSVDKTYRARHPIFEAEMAKAQSQQQASQ 64
S+ + A SG++A LNT ++NI++ + Y + I + +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYN------VAGYTRQTTIMAQANSTLGAGGWVGN 55

Query: 65 GVAVKGI 71
GV V G+
Sbjct: 56 GVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2949HTHFIS611e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 1e-12
Identities = 23/128 (17%), Positives = 52/128 (40%), Gaps = 12/128 (9%)

Query: 180 HIMVIDDSAVARKQIIRSLESLNLQIDTAKDGREALDKLKEIAKEMDNVADEIPLIISDI 239
I+V DD A R + ++L + + + A + L+++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDV 55

Query: 240 EMPEMDGYTLTAEIRDDPKLKHIKVVLHTSLSGVFNQAMVQKVGANDFIAK-FNPDELAA 298
MP+ + + L I+ + V++ ++ + + GA D++ K F+ EL
Sbjct: 56 VMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 299 AVNKHLSL 306
+ + L+
Sbjct: 114 IIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2960DHBDHDRGNASE1254e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 125 bits (315), Expect = 4e-37
Identities = 89/257 (34%), Positives = 121/257 (47%), Gaps = 14/257 (5%)

Query: 4 LTGKVALITGASRGIGAGIAEAFAIEGADLIINYRTNDDAAFRVVSKLKNLGRKVVAIRA 63
+ GK+A ITGA++GIG +A A +GA I N + +VVS LK R A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 64 DVSKRSEIQNLIHLAQIEFGRIDILVNNAGINQRGWFNEVTDEAWDMIMGTNLKGPFMCC 123
DV + I + + E G IDILVN AG+ + G + ++DE W+ N G F
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 124 QEVFPLMKANGGGRIINISSVAGQYHGPKT--VHYAVSKAGLNSLTKVLARYGAEYNVLV 181
+ V M G I+ + S P+T YA SKA TK L AEYN+
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 182 NAVAPGLVRTD-QTIDEIDSPAGARVLDMTL--------LKKAGRIEDISSACVFLASDE 232
N V+PG TD Q D +V+ +L LKK + DI+ A +FL S +
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 233 QQYMTGQILAVSGGAIL 249
++T L V GGA L
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2967NUCEPIMERASE812e-19 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 80.6 bits (199), Expect = 2e-19
Identities = 42/245 (17%), Positives = 85/245 (34%), Gaps = 54/245 (22%)

Query: 6 TILITGGTGSFGQKYTKTILAKY-----------------KPKRLIILSRDELKQYEMQQ 48
L+TG G G +K +L K RL +L++ + ++
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHK--- 58

Query: 49 IYNAPCMRYFIGDVRDGDRLEQAFKDVDF--VIHAAALKQVPAAEYNPMECIKTNIYGAE 106
D+ D + + F F V + V + NP +N+ G
Sbjct: 59 -----------IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFL 107

Query: 107 NVIRAAISNNVSKVIALST---------------DKAANPINLYGATKLASDKLFVAANN 151
N++ N + ++ S+ D +P++LY ATK A++ + ++
Sbjct: 108 NILEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH 167

Query: 152 IVGSGKTRFAAVRYGNVVGSRGS---VVPFFKQLIANGADALPITHQDMTRFWISLQDGV 208
+ G +R+ V G G + F + + G + M R + + D
Sbjct: 168 LYG---LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 209 DFVLK 213
+ +++
Sbjct: 225 EAIIR 229


43Sbal_2981Sbal_2986Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_29812161.842670putative transglycosylase
Sbal_29822181.823010zinc-binding CMP/dCMP deaminase
Sbal_29832181.461503GMP synthase
Sbal_29842171.270709inosine 5'-monophosphate dehydrogenase
Sbal_29852141.322263exodeoxyribonuclease VII large subunit
Sbal_29862171.134010GTP-binding protein EngA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2986TCRTETOQM330.003 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 32.9 bits (75), Expect = 0.003
Identities = 38/159 (23%), Positives = 67/159 (42%), Gaps = 35/159 (22%)

Query: 199 IKLAIIGKPNVGKSTLTNRIL----GEERVVVFDEPGTTRDSIYIPMER----------- 243
I + ++ + GK+TLT +L + D+ T D+ + +R
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSF 63

Query: 244 --EGREYVIIDTAGVRRRSKVHQVIEKFSVIKTLKAVEDANVVLLIIDAREGIAEQDLGL 301
E + IIDT G + + E V ++L ++ A +L+I A++G+ Q L
Sbjct: 64 QWENTKVNIIDTPG-----HMDFLAE---VYRSLSVLDGA---ILLISAKDGVQAQTRIL 112

Query: 302 LGFALNAGRALVIAVNKWD--GID-----QGIKDRVKSE 333
G + +NK D GID Q IK+++ +E
Sbjct: 113 FHALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAE 151


44Sbal_3131Sbal_3141Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3131017-3.343133amidase
Sbal_3132224-6.8825245-
Sbal_3133426-8.296225hypothetical protein
Sbal_3134528-7.141866hypothetical protein
Sbal_3135527-7.111175putative reverse transcriptase
Sbal_3136327-5.677712hypothetical protein
Sbal_3137326-4.746113hypothetical protein
Sbal_3138327-4.561351hypothetical protein
Sbal_3139327-3.771455transposase, IS4 family protein
Sbal_3140325-3.829202IstB ATP binding domain-containing protein
Sbal_3141216-1.180866hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3139FLGPRINGFLGI270.028 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 26.8 bits (59), Expect = 0.028
Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 87 KQLLGGTLSMRNYNAQVGETYAM 109
L GG L M + + G+ YA+
Sbjct: 122 TSLRGGNLIMTSLSGADGQIYAV 144


45Sbal_3234Sbal_3239Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3234429-0.801147polynucleotide phosphorylase/polyadenylase
Sbal_3235423-0.563289diguanylate cyclase/phosphodiesterase
Sbal_3236629-0.37405230S ribosomal protein S15
Sbal_3237527-0.374000tRNA pseudouridine synthase B
Sbal_3238630-0.665579ribosome-binding factor A
Sbal_3239426-0.300724translation initiation factor IF-2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3239TCRTETOQM725e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 72.2 bits (177), Expect = 5e-15
Identities = 51/202 (25%), Positives = 78/202 (38%), Gaps = 30/202 (14%)

Query: 387 IMGHVDHGKTSLLDYIRRAKVAAGEAG------------------GITQHIGAYHVETEN 428
++ HVD GKT+L + + A E G GIT G + EN
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 429 GMITFLDTPGHAAFTAMRARGAKATDIVVLVVAADDGVMPQTIEAIQHAKAGNVPLIVAV 488
+ +DTPGH F A R D +L+++A DGV QT + +P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 489 NKMDKPEADIDRV----KSELSQHGVMS-------EDWGGDNMFAFVSAKTGEGVDELLE 537
NK+D+ D+ V K +LS V+ + + EG D+LLE
Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187

Query: 538 GILLQAEVLELKAVRDGMAAGV 559
+ + LE + +
Sbjct: 188 K-YMSGKSLEALELEQEESIRF 208


46Sbal_3295Sbal_3316Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_32952162.935417putative lipoprotein
Sbal_32961152.219251hypothetical protein
Sbal_32971132.092922hypothetical protein
Sbal_32981142.203977endonuclease/exonuclease/phosphatase
Sbal_32990172.465513hypothetical protein
Sbal_33000182.278427magnesium transporter
Sbal_33011172.039260methyl-accepting chemotaxis sensory transducer
Sbal_33022202.210844extracellular solute-binding protein
Sbal_33032212.824293pseudouridine synthase
Sbal_33042212.619697carbamoyl phosphate synthase large subunit
Sbal_3305-1153.065505carbamoyl phosphate synthase small subunit
Sbal_33060173.303620dihydrodipicolinate reductase
Sbal_33071152.788327FKBP-type peptidylprolyl isomerase
Sbal_33080171.632487peptidase M48 Ste24p
Sbal_3309117-1.128437N-acetyltransferase GCN5
Sbal_3310-119-2.569667DEAD/DEAH box helicase
Sbal_3311117-3.393342hypothetical protein
Sbal_3313322-2.900380transposase IS3/IS911 family protein
Sbal_3314218-2.130063integrase catalytic subunit
Sbal_3316218-1.311117hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3301FbpA_PF05833290.023 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 29.5 bits (66), Expect = 0.023
Identities = 11/64 (17%), Positives = 23/64 (35%), Gaps = 8/64 (12%)

Query: 6 QLKAENKALKERLIQLEQQRQNEIDELRGMIRENETMQQQSRQNADHYTEVIACQNQGGE 65
K ++ LK + L++ N I+ + ++ + D + GE
Sbjct: 289 YAKDKSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCE-DKDIF-------KLYGE 340

Query: 66 MLNA 69
+L A
Sbjct: 341 LLTA 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3307INFPOTNTIATR1451e-45 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 145 bits (366), Expect = 1e-45
Identities = 75/203 (36%), Positives = 116/203 (57%), Gaps = 5/203 (2%)

Query: 6 STVEQQASYGVGRQMGEQLAANSFDGVDIPAVQAGLADAFAGLESAVS---MQDLQVAFT 62
+T + + SY +G +G+ D ++ + G+ D +G + ++ M+D+ F
Sbjct: 28 TTDKDKLSYSIGADLGKNFKNQGID-INPDVLAKGMQDGMSGAQLILTEEQMKDVLSKFQ 86

Query: 63 -EISGRIQAAQEQAAAAASSEGDAFLAENAKRDGVTVTDSGLQFEVLVQGDGATPTYEDT 121
++ + A + A ++GDAFL+ N + G+ V SGLQ++++ G GA P DT
Sbjct: 87 KDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGAKPGKSDT 146

Query: 122 VRTHYHGSFINGDVFDSSVVRGQPAEFPVSGVIAGWTEALQLMPVGTKLKLFVPHHLAYG 181
V Y G+ I+G VFDS+ G+PA F VS VI GWTEALQLMP G+ ++FVP LAYG
Sbjct: 147 VTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYG 206

Query: 182 ERGAGASIPPYSTLVFEVELLDI 204
R G I P TL+F++ L+ +
Sbjct: 207 PRSVGGPIGPNETLIFKIHLISV 229


47Sbal_3344Sbal_3365Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3344220-6.478491dihydroorotase
Sbal_3345326-8.207180hypothetical protein
Sbal_3346225-6.633359hypothetical protein
Sbal_3348225-6.246556NACHT family-like NTPase
Sbal_3349122-3.825732hypothetical protein
Sbal_3350019-0.821678hypothetical protein
Sbal_3351-1192.661294N-acetyltransferase GCN5
Sbal_33520193.335558GntR family transcriptional regulator
Sbal_33531183.017281hypothetical protein
Sbal_33540172.812637N-acetyltransferase GCN5
Sbal_33550173.205364cell division protein ZipA
Sbal_33560163.395047glycerol dehydrogenase
Sbal_3357-1163.623610aldehyde dehydrogenase
Sbal_33580183.449824periplasmic binding protein
Sbal_33590163.646727hypothetical protein
Sbal_33600204.247384cob(I)yrinic acid a,c-diamide
Sbal_33610204.323906cobyric acid synthase
Sbal_33621235.391013cobalbumin biosynthesis protein
Sbal_33631254.961185cobalamin synthase
Sbal_33640224.505316nicotinate-nucleotide--dimethylbenzimidazole
Sbal_33650234.409888transport system permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3351SACTRNSFRASE434e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 43.4 bits (102), Expect = 4e-08
Identities = 23/91 (25%), Positives = 34/91 (37%), Gaps = 5/91 (5%)

Query: 57 CAVAREGENLLGFVHFVFHRSTWAETEYCYLEDLFVAPAARGKLVGKQLIEFVQQAANER 116
+ N +G + RS W Y +ED+ VA R K VG L+ + A E
Sbjct: 67 AFLYYLENNCIGRIKI---RSNW--NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN 121

Query: 117 DCARLYWHTHETNHTAQKLYDWIGQKSGMIE 147
L T + N +A Y G ++
Sbjct: 122 HFCGLMLETQDINISACHFYAKHHFIIGAVD 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3354SACTRNSFRASE341e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.8 bits (77), Expect = 1e-04
Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 3/56 (5%)

Query: 66 IVDIAVDPEHQGKGLGRLIMQHIMAYLDREAFEGAYI-TLMADVP--ELYEKFGFK 118
I DIAV +++ KG+G ++ + + F G + T ++ Y K F
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3358FERRIBNDNGPP407e-06 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 39.5 bits (92), Expect = 7e-06
Identities = 44/180 (24%), Positives = 67/180 (37%), Gaps = 16/180 (8%)

Query: 18 PHCVLADPAKRIIALSPHAVEMLYAIGAGDAIVAATDYADY------PEAAKKIPRIGGY 71
H DP RI+AL VE+L A+G VA D +Y P + +G
Sbjct: 28 AHAAAIDP-NRIVALEWLPVELLLALGIVPYGVA--DTINYRLWVSEPPLPDSVIDVGLR 84

Query: 72 YGIQMERVMELNPDLIVVWDTGNKA--EDINQL-RTLGFNLYGSDPKTLEGVAKELEELG 128
+E + E+ P + VW G E + ++ GFN + + L K L E+
Sbjct: 85 TEPNLELLTEMKPSFM-VWSAGYGPSPEMLARIAPGRGFN-FSDGKQPLAMARKSLTEMA 142

Query: 129 KLTGHVEEASKAAAAYRAELIRLRTDNASKSE-PKVFYQLWSTPLMTV-SKNSWIQQIIS 186
L A A Y + ++ + P + L M V NS Q+I+
Sbjct: 143 DLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3362PF07328270.030 T-DNA border endonuclease VirD1
		>PF07328#T-DNA border endonuclease VirD1

Length = 144

Score = 27.3 bits (60), Expect = 0.030
Identities = 18/65 (27%), Positives = 29/65 (44%), Gaps = 11/65 (16%)

Query: 120 IPSASSTSNDEALASWQAEKTAFVDSLADLEGVVVLISNEVGSGIVPLGELSRQFVDEAG 179
I A++ ++D A S+ AE+ L+ L V + PL E+SR+ D
Sbjct: 88 IAKAANRTHDPAYHSFMAERKVLGLELSKLSAV-----------LAPLMEVSRRRSDGLE 136

Query: 180 WLNQA 184
L +A
Sbjct: 137 RLQKA 141


48Sbal_3512Sbal_3517Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3512429-1.205706nitrate reductase cytochrome c-type subunit
Sbal_3513329-2.417097hypothetical protein
Sbal_3514323-2.561711LysR family transcriptional regulator
Sbal_3515425-2.146949elongation factor G
Sbal_3516216-1.994620diguanylate cyclase
Sbal_3517214-1.753956IstB ATP binding domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3512PF06291260.030 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 26.2 bits (57), Expect = 0.030
Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 1 MRKILTLTALLVAITGCSGQQTDTNVAPV 29
M+K+L AL + ITGC+ Q P
Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPT 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3515TCRTETOQM5390.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 539 bits (1389), Expect = 0.0
Identities = 170/668 (25%), Positives = 292/668 (43%), Gaps = 60/668 (8%)

Query: 6 KYRNIGIFAHVDAGKTTTTERILKLTGKIHKIGEVHDGESTTDFMVQEAERGITIQSAAV 65
K NIG+ AHVDAGKTT TE +L +G I ++G V G + TD + E +RGITIQ+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 66 SCFWKDHRFNVIDTPGHVDFTVEVYRSLKVLDGGIGVFCGSGGVEPQSETNWRYANESEV 125
S W++ + N+IDTPGH+DF EVYRSL VLDG I + GV+ Q+ + + +
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 126 ARIIFVNKLDRMGADFLRVVKQTKDVLAATPLVMVLPIGVEDEFTGVVDLLTRKAYVWDD 185
I F+NK+D+ G D V + K+ L+A ++ +K ++ +
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIK------------------QKVELYPN 163

Query: 186 SGLPENFEVLDVPADMVDMVEEYREMLIETAVEQDDAVMEAYMEGEEPSMEDIKRCIRTG 245
+ + + +T +E +D ++E YM G+ ++++
Sbjct: 164 ----------MCVTNFTESEQ------WDTVIEGNDDLLEKYMSGKSLEALELEQEESIR 207

Query: 246 TRKLAFFPTYCGSAFKNKGMQLVLDAVVDYLPAPDEVDPQPLTDEEGNETGEFAIVSADE 305
+ FP Y GSA N G+ +++ + + +
Sbjct: 208 FHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH--------------------RGQS 247

Query: 306 PLKALAFKI-MDDRFGALTFVRIYSGRLKKGDTILNSATGKTERIGRMCEMYADDRIEIE 364
L FKI ++ L ++R+YSG L D++ S K +I M + +I+
Sbjct: 248 ELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKID 306

Query: 365 SAQAGDIIAIVGMKNVQTGHTLCDVKHPVTLEAMVFPEPVISIAVAPKDKGGSEKMGIAI 424
A +G+I+ + + ++ L D K E + P P++ V P E + A+
Sbjct: 307 KAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDAL 365

Query: 425 GKMIAEDPSFRVETDEDSGETILKGMGELHLDIKVDILKRTYGVELIVGEPQVAYRETIT 484
++ DP R D + E IL +G++ +++ +L+ Y VE+ + EP V Y E
Sbjct: 366 LEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPL 425

Query: 485 QMVEDQYTHKKQSGGSGQFGKIEYIIRPGEANTGFVFKSSVVGGSVPKEFWPAVEKGFAS 544
+ E YT + + + I + P +G ++SSV G + + F AV +G
Sbjct: 426 KKAE--YTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEGIRY 483

Query: 545 MMNTGTIAGFPVLDVEFELTDGAFHAVDSSAIAFEIAAKGAFRQSIAKAKPQLLEPIMKV 604
G + G+ V D + G +++ S+ F + A Q + KA +LLEP +
Sbjct: 484 GCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYLSF 542

Query: 605 DVFSPDDNVGDVIGDLNRRRGMIKDQVAGVTGVRVKADVPLSEMFGYIGTLRTMTSGRGQ 664
+++P + + D + I D V + ++P + Y L T+GR
Sbjct: 543 KIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGRSV 602

Query: 665 FSMEFSHY 672
E Y
Sbjct: 603 CLTELKGY 610


49Sbal_3532Sbal_3565Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3532-1193.235518hypothetical protein
Sbal_35330193.257250peptidase M50
Sbal_3534-1193.576577RND efflux system outer membrane lipoprotein
Sbal_3535-1183.252523ABC transporter-like protein
Sbal_3536-1141.701521RND family efflux transporter MFP subunit
Sbal_35370161.116970ModE family transcriptional regulator
Sbal_35381150.319522molybdenum ABC transporter periplasmic
Sbal_35392160.087682molybdate ABC transporter permease
Sbal_3540217-0.693407molybdate transporter ATP-binding protein
Sbal_3541320-1.373516phage integrase family protein
Sbal_3542119-2.750291XRE family transcriptional regulator
Sbal_3543121-3.232328hypothetical protein
Sbal_3544020-3.120586transposase IS3/IS911 family protein
Sbal_3545122-3.883104integrase catalytic subunit
Sbal_3546227-5.533707transposase IS3/IS911 family protein
Sbal_3547224-4.672039integrase catalytic subunit
Sbal_3548324-4.476074IstB ATP binding domain-containing protein
Sbal_3550225-3.875743integrase catalytic subunit
Sbal_3551225-6.056522hypothetical protein
Sbal_3552124-7.291313hypothetical protein
Sbal_3553120-5.606033DNA-directed DNA polymerase
Sbal_3554117-4.732300putative prophage repressor
Sbal_3555117-4.273539integrase catalytic subunit
Sbal_3556116-4.406118hypothetical protein
Sbal_3557218-2.244834hypothetical protein
Sbal_35583220.332468phage integrase family protein
Sbal_3559018-0.303133ATPase central domain-containing protein
Sbal_3560119-1.047296hypothetical protein
Sbal_3561016-1.604918DNA repair protein RadC
Sbal_3562017-2.376971hypothetical protein
Sbal_3563-116-2.261391hypothetical protein
Sbal_3564-113-2.055364*methyl-accepting chemotaxis sensory transducer
Sbal_3565016-3.226592hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3534RTXTOXIND340.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.6 bits (77), Expect = 0.002
Identities = 26/189 (13%), Positives = 55/189 (29%), Gaps = 11/189 (5%)

Query: 70 QPELDKLVSQVLSTNNDL-----TLATLTLRKARLQAGLTRDDLYPQLSANLGASRNQPL 124
+P + +V +++ + L LT A T+ L L A L +R Q L
Sbjct: 100 KPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSL---LQARLEQTRYQIL 156

Query: 125 DGGDATKSYQANLSVSYEVDLWGKVSANIDQAQWAALASA-EDRESTTQSLVATTASLYW 183
+ + + + + VS + + ++
Sbjct: 157 S--RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214

Query: 184 QIGYLNQRVDLSQKSIAYSKQTLDLTQRQYDSGAVTQVNVLEAQRSMAGQEAAHSQLIQQ 243
+ + R++ + K LD A+ + VLE + Q
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 244 LTVAKNALA 252
L ++ +
Sbjct: 275 LEQIESEIL 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3536RTXTOXIND643e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 64.1 bits (156), Expect = 3e-13
Identities = 34/181 (18%), Positives = 60/181 (33%), Gaps = 39/181 (21%)

Query: 106 NLKDALASLKSINAQFRAKQAQIRQAKLEFSRQQQMLADKASSRADY-----EVADAN-- 158
NL A ++ A+ + R K +L +A ++ + +A
Sbjct: 208 NLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNE 267

Query: 159 LTVYQAELEQLEAQKQQAEINV-----------------------------DSARIDLGY 189
L VY+++LEQ+E++ A+
Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQA 327

Query: 190 TKITAPMDGTVVYSAV-EVGQTVNANQTTPTIVEMAQLDTMTVKAQISEADVVNVHPGQA 248
+ I AP+ V V G V +T IV + DT+ V A + D+ ++ GQ
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV--PEDDTLEVTALVQNKDIGFINVGQN 385

Query: 249 V 249

Sbjct: 386 A 386



Score = 42.9 bits (101), Expect = 2e-06
Identities = 31/182 (17%), Positives = 69/182 (37%), Gaps = 16/182 (8%)

Query: 7 MKKSSKRKLLLALSGLILLGGGAYFMWHKPETAPTYVTEAVRRGDIENSVLANGMLQAS- 65
S+R L+A I+ F+ + + + +E ANG L S
Sbjct: 50 ETPVSRRPRLVAY--FIMGFLVIAFIL-------SVLGQ------VEIVATANGKLTHSG 94

Query: 66 KLVSVGAQVSGQILSLPLALGDEVKKGDLIAQIDSLAQQNNLKDALASLKSINAQFRAKQ 125
+ + + + + + G+ V+KGD++ ++ +L + + +SL + Q
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 126 AQIRQAKLEFSRQQQMLADKASSRADYEVADANLTVYQAELEQLEAQKQQAEINVDSARI 185
R +L + ++ + E ++ + + + QK Q E+N+D R
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214

Query: 186 DL 187
+
Sbjct: 215 ER 216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3546HTHFIS250.017 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.2 bits (55), Expect = 0.017
Identities = 7/45 (15%), Positives = 15/45 (33%), Gaps = 1/45 (2%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKH 50
+ + +L L + AA LG++ + L +
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3560TYPE4SSCAGX270.043 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 27.1 bits (59), Expect = 0.043
Identities = 21/62 (33%), Positives = 33/62 (53%), Gaps = 2/62 (3%)

Query: 104 AIDVGDKSKLPDTIKLIIDAVEKGELDEFILKAAKFPTKKAKASAEASKESKSEAKSDTS 163
A+ D + T KLI+DA + EL+E K A K+AK A+ +++ K E + +
Sbjct: 116 ALMTRDYQEFLKTKKLIVDAPDPKELEE--QKKALEKEKEAKEQAQKAQKDKREKRKEER 173

Query: 164 AK 165
AK
Sbjct: 174 AK 175


50Sbal_3578Sbal_3639Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3578220-2.699174outer membrane channel protein
Sbal_3579015-1.242480hypothetical protein
Sbal_3581018-0.929866hypothetical protein
Sbal_3582020-0.354849enoyl-CoA hydratase/isomerase
Sbal_35832190.663051phage shock protein C, PspC
Sbal_3584320-0.080368hypothetical protein
Sbal_3585418-0.059930tRNA-dihydrouridine synthase A
Sbal_35865200.061906phage integrase family protein
Sbal_3587016-0.711722hypothetical protein
Sbal_3588015-0.809944hypothetical protein
Sbal_3589015-1.327033hypothetical protein
Sbal_3590-114-0.861741hypothetical protein
Sbal_3591315-0.651150hypothetical protein
Sbal_3592214-0.735159TP901 family phage tail tape measure protein
Sbal_35931161.800028hypothetical protein
Sbal_35940163.389308hypothetical protein
Sbal_35951173.717420hypothetical protein
Sbal_35960173.589386hypothetical protein
Sbal_35970183.292257gifsy-2 prophage; putative RecA/RadA
Sbal_35980173.118751peptidase S14 ClpP
Sbal_35990213.371549lambda family phage portal protein
Sbal_36001222.466077hypothetical protein
Sbal_36011180.127961phage terminase GpA
Sbal_3602-121-1.589588phage DNA packaging Nu1
Sbal_3603-122-1.484560hypothetical protein
Sbal_3604025-1.496564hypothetical protein
Sbal_3605-237-0.248759glycoside hydrolase
Sbal_3606-1390.764926hypothetical protein
Sbal_36070393.623332oligoribonuclease
Sbal_36081424.211985hypothetical protein
Sbal_36091444.918113XRE family transcriptional regulator
Sbal_36102434.817302phage integrase family protein
Sbal_36110304.046684replication P family protein
Sbal_3612-1240.265212putative replication protein
Sbal_3613124-2.391149hypothetical protein
Sbal_3614225-2.411170hypothetical protein
Sbal_3615427-3.805008hypothetical protein
Sbal_3616225-3.068782putative phage repressor
Sbal_3617328-3.291087DNA methylase N-4/N-6 domain-containing protein
Sbal_3618124-1.028807hypothetical protein
Sbal_36194293.229973hypothetical protein
Sbal_36203312.910050hypothetical protein
Sbal_36212302.454967hypothetical protein
Sbal_3622-1230.805438hypothetical protein
Sbal_3623-118-0.097841hypothetical protein
Sbal_3624-121-0.695459hypothetical protein
Sbal_3625-122-4.605442hypothetical protein
Sbal_3626-125-6.190558hypothetical protein
Sbal_3627028-7.580436integrase catalytic subunit
Sbal_3628130-8.834242N-acetyltransferase GCN5
Sbal_3629128-7.331644hypothetical protein
Sbal_3630020-4.604742hypothetical protein
Sbal_3631017-3.024981hypothetical protein
Sbal_3632017-1.583904heme/copper-type cytochrome/quinol oxidase
Sbal_36331130.146618hypothetical protein
Sbal_36341100.128859putative hydroxylase
Sbal_36350120.005405TonB-dependent siderophore receptor
Sbal_36360150.182310putative chemotaxis protein CheX
Sbal_36371150.315615alanine racemase
Sbal_3638320-0.375387replicative DNA helicase
Sbal_3639325-1.707638hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3579OMPADOMAIN310.004 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 31.1 bits (70), Expect = 0.004
Identities = 21/73 (28%), Positives = 27/73 (36%), Gaps = 6/73 (8%)

Query: 1 MKKTLLASAVLGLCMATSAQAATVVG--FTIGGDYWRADTSGTFADKGQPQQTFDYSSSA 58
MKKT +A AV AT AQAA +T W F + P + A
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAGA 60

Query: 59 QGSY----WIAVE 67
G Y ++ E
Sbjct: 61 FGGYQVNPYVGFE 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3592GPOSANCHOR437e-06 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 43.1 bits (101), Expect = 7e-06
Identities = 37/277 (13%), Positives = 80/277 (28%), Gaps = 22/277 (7%)

Query: 21 EAKKSEQALQELGRESEKLNEQLDDLKRQ-QEAIKAIDSLTESINKGERAYVDNAQALDK 79
K + EL E E+L + E I L E+A
Sbjct: 79 NNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTA 138

Query: 80 LKQEQKQANTEAKNLEKSQQDAAASTAKLETEYSQTAAQLASYDSQLASARAEVERLTTT 139
+ K E L + D + + +A++ + +++ A+ A L
Sbjct: 139 DSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 198

Query: 140 QNKGAQASQAQAKALSAAKSDLQQLEAAQKNTATSATKLANELEQERSELTRLGTEVEKA 199
S A + + +++ L A + + + N + +++ L E
Sbjct: 199 LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 258

Query: 200 GRNKAEYALKVKSARTELNQLGSSLGRNKAELDKQQTVLNKAGID--------------- 244
+AE ++ A + + +AE +
Sbjct: 259 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL 318

Query: 245 ------MGKLADASQELKTKQAGAEAALKGVNDKLAQ 275
+L Q+L+ + +EA+ + + L
Sbjct: 319 DASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 355



Score = 36.6 bits (84), Expect = 7e-04
Identities = 37/216 (17%), Positives = 71/216 (32%), Gaps = 1/216 (0%)

Query: 19 SAEAKKSEQALQELGRESEKLNEQLDDLKRQQEAIKA-IDSLTESINKGERAYVDNAQAL 77
SA+ K E L +L + L+ A A I +L D +AL
Sbjct: 175 SAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKAL 234

Query: 78 DKLKQEQKQANTEAKNLEKSQQDAAASTAKLETEYSQTAAQLASYDSQLASARAEVERLT 137
+ + + K LE + A A+LE + +++ + AE L
Sbjct: 235 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALE 294

Query: 138 TTQNKGAQASQAQAKALSAAKSDLQQLEAAQKNTATSATKLANELEQERSELTRLGTEVE 197
+ SQ + + DL A+K KL + + + L +++
Sbjct: 295 AEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLD 354

Query: 198 KAGRNKAEYALKVKSARTELNQLGSSLGRNKAELDK 233
+ K + + + + +S + +LD
Sbjct: 355 ASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 390



Score = 30.8 bits (69), Expect = 0.041
Identities = 60/345 (17%), Positives = 115/345 (33%), Gaps = 4/345 (1%)

Query: 733 EYVTAKQVEDSKLRSKEIQESITRDLDDITNQTKRMKGESSIAYEGLIKTAVKYRGSIDQ 792
+ + K + E+ DL+ S + L
Sbjct: 100 KLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKAD 159

Query: 793 LSIAQRNQLNDILLSGKYNGDLEKTYRELTATLVRANRETEIEAEFKNKAADASKKKAEE 852
L A +N LE L A + E F + K E
Sbjct: 160 LEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE 219

Query: 853 DKKAAEAADALAASQGAINNSTKAYAQALKDIEAKQATLNSLYEQGKLSADDLVTASANL 912
A L + N + A + +K +EA++A L + + + + + + S
Sbjct: 220 KAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTAD 279

Query: 913 HQTIKAYNVEVDNSNVKAVTQTELTSAFISKRKELQTQYEKGLLTEKELNISLQELAASH 972
IK E + + + R+ L+ + +K+L Q+L +
Sbjct: 280 SAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQN 339

Query: 973 --TKAVEQSNKSIAATGLLSDAQLDLQEKILRTEKEVRDLEAA-LKDDSKAS-AELTIIK 1028
++A QS + + QL+ + + L + ++ + L+ D AS ++
Sbjct: 340 KISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVE 399

Query: 1029 AKLAKEEANLADLKRESVELSKIENATYVELLILQRDYEAQLEAL 1073
L + + LA L++ + EL + + T E LQ EA+ +AL
Sbjct: 400 KALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKAL 444


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3603SECYTRNLCASE290.005 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 29.0 bits (65), Expect = 0.005
Identities = 10/53 (18%), Positives = 19/53 (35%), Gaps = 5/53 (9%)

Query: 31 LTFSAYVSSLMSTIGGAFTLNE-----IAILLGIFFALVTLLANVFYQELRRR 78
L F + ++ S + I I L+ + VF ++ +RR
Sbjct: 193 LMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLIMVALVVFVEQAQRR 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3637ALARACEMASE439e-157 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 439 bits (1130), Expect = e-157
Identities = 160/350 (45%), Positives = 217/350 (62%), Gaps = 6/350 (1%)

Query: 6 RAEISSSALQTNLAALRQQAPASRVMAVVKANGYGHGLLNVANCLVSADGFGLARLDEAL 65
+A + AL+ NL+ +RQ A +RV +VVKAN YGHG+ + + + + DGF L L+EA+
Sbjct: 6 QASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEAI 65

Query: 66 ELRAGGVTARLLLLEGFFRATDLPLLVGHDIDTVVHHSSQLEMLEQTVLSKPVTVWLKVD 125
LR G +L+LEGFF A DL + H + T VH + QL+ L+ L P+ ++LKV+
Sbjct: 66 TLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKVN 125

Query: 126 SGMHRLGFTPEQFSTVYDRLMACPNVAKPIHLMTHFACADEPDNTYTSVQMAAFNSLTAG 185
SGM+RLGF P++ TV+ +L A NV + LM+HFA A+ PD MA G
Sbjct: 126 SGMNRLGFQPDRVLTVWQQLRAMANV-GEMTLMSHFAEAEHPDGISG--AMARIEQAAEG 182

Query: 186 LPGFRTLANSAGALYWPQSQGDWIRPGIALYGVSPVT--GDCGANHGLVPAMELVSQLIA 243
L R+L+NSA L+ P++ DW+RPGI LYG SP D AN GL P M L S++I
Sbjct: 183 LECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDI-ANTGLRPVMTLSSEIIG 241

Query: 244 VRDHKANQPVGYGCFWTAKQDTRLGVVAIGYGDGYPRNAPEGTPVWVNGRRVPIVGRVSM 303
V+ KA + VGYG +TA+ + R+G+VA GY DGYPR+AP GTPV V+G R VG VSM
Sbjct: 242 VQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVSM 301

Query: 304 DMLTVDLGQDAQDKVGDSALLWGKALPVEEVAEHIGTIAYELVTKLTPRV 353
DML VDL Q +G LWGK + +++VA GT+ YEL+ L RV
Sbjct: 302 DMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRV 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3639V8PROTEASE471e-07 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 47.3 bits (112), Expect = 1e-07
Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 650 SVPVNFLS-SVDTTGGNSGSPVFNGKGELVGLNFDSTYEAITKDWFFNPTI 699
+ + + TTGGNSGSPVFN K E++G+++ F N +
Sbjct: 220 YLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNEFNGAVFINENV 270


51Sbal_3703Sbal_3717Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_37032240.299381carboxymuconolactone decarboxylase
Sbal_3704225-0.005072diguanylate cyclase
Sbal_37051230.493272excalibur domain-containing protein
Sbal_37061200.230033hypothetical protein
Sbal_37071171.040408SMC domain-containing protein
Sbal_3708017-0.501986chaperonin GroEL
Sbal_3709-114-1.158150co-chaperonin GroES
Sbal_3710-112-2.817695MATE efflux family protein
Sbal_3711015-4.957162LysR family transcriptional regulator
Sbal_3712022-6.313065RND family efflux transporter MFP subunit
Sbal_3713024-7.255674integrase catalytic subunit
Sbal_3714121-7.730472IstB ATP binding domain-containing protein
Sbal_3715121-7.058768type I restriction-modification system, M
Sbal_3716121-6.022455anticodon nuclease
Sbal_3717013-3.093971restriction modification system DNA specificity
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3707RTXTOXIND330.005 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.3 bits (76), Expect = 0.005
Identities = 28/242 (11%), Positives = 73/242 (30%), Gaps = 41/242 (16%)

Query: 375 TTIVGELHVHANPFIKAEQGLVHL----------------RELTLLKQRYQAAIDQ---- 414
+IV E+ V ++ L+ L + L + RYQ
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 415 -------------RRVASDNLRRYFANFSQRVGATVNSEHELLRYLANPGFQPDTAWWKI 461
+ V+ + + R + ++ N +++ L + T
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTV---- 219

Query: 462 GYNPTAGGVSLSQQAVNYALQLEAADTAIQQNLKQRAQLITERDRLFEARIALAEHAERR 521
A + +L+ + + + + ++ + ++ EA L + +
Sbjct: 220 ----LARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQL 275

Query: 522 KQASVAIAKARQRVDVFEAQNTALITAVVEEETRIAAEVRIQVAYNKLLSLLRHYRSELP 581
+Q I A++ + I + + T + +++A N+ R+ +
Sbjct: 276 EQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVS 335

Query: 582 GT 583

Sbjct: 336 VK 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3710SECFTRNLCASE340.001 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 33.7 bits (77), Expect = 0.001
Identities = 25/114 (21%), Positives = 46/114 (40%), Gaps = 14/114 (12%)

Query: 169 MALAAIINLILDPLLIFGIGPFPRLEIQGAAIATLFAWLVALSLSGYLLIVRRKMLEWTV 228
AL A++ L+ D LL G+ +L+ +A L L+++GY + TV
Sbjct: 178 FALGAVVALVHDVLLTVGLFAVLQLKFDLTTVAAL------LTITGY-------SINDTV 224

Query: 229 FDIDRMRANWAKLAHIAQPAALMNLINP-LANAIIMAMLAHIDHSAVAAFGAGT 281
DR+R N K + + +N L+ ++ M + + +G
Sbjct: 225 VVFDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDV 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3712RTXTOXIND492e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.1 bits (117), Expect = 2e-08
Identities = 28/178 (15%), Positives = 63/178 (35%), Gaps = 28/178 (15%)

Query: 104 EAEHELLAADFKRKVELLNRKLISQSEFDSTQAQLKSAKAALAAARDQLSYTRLTAPFSG 163
+ E++L+ FK ++ + T + LA ++ + + AP S
Sbjct: 286 KEEYQLVTQLFKNEI---------LDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSV 336

Query: 164 TIAKRLVDNH-QIVQANQGVLTL-QNNNLLDVSIQVPESMAAGLKQYTDQAHFTAKVRFS 221
+ + V +V + ++ + ++ L+V+ V Q A ++
Sbjct: 337 KVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIG--FINVGQN---AIIKVE 391

Query: 222 AFPEQSF---DAKFKEYSTQVTPGTQ---AYEVVFSLPQP------QDIQLLPGMSAE 267
AFP + K K + + + V+ S+ + ++I L GM+
Sbjct: 392 AFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVT 449



Score = 31.7 bits (72), Expect = 0.005
Identities = 14/104 (13%), Positives = 37/104 (35%), Gaps = 2/104 (1%)

Query: 68 SGQLTELTLVEGQRVAQGSLLAQLDDRDAKNNLMTREAEHELLAADFKRK-VELLNRKLI 126
+ + E+ + EG+ V +G +L +L A+ + + ++ + R + + +L
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 127 SQSEFD-STQAQLKSAKAALAAARDQLSYTRLTAPFSGTIAKRL 169
E + ++ L + + + K L
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKEL 207


52Sbal_3726Sbal_3731Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3726-3173.013208thiol:disulfide interchange protein
Sbal_3727-2183.545404sodium/hydrogen exchanger
Sbal_3728-2153.763029galactokinase
Sbal_3729-1173.444933aldose 1-epimerase
Sbal_3730-1173.186524hypothetical protein
Sbal_3731-2173.320039alcohol dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3727TYPE3IMSPROT290.036 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.3 bits (66), Expect = 0.036
Identities = 27/159 (16%), Positives = 56/159 (35%), Gaps = 13/159 (8%)

Query: 146 VAIGILIMQDIFAVLFLTISKGDVPSVWAFALLLLPLAKPLIYKAFDRVGHGELLVLFGL 205
+ + + ++ + + +P A + ++ + Y F + + L +
Sbjct: 43 MGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLT---VAALMAI 99

Query: 206 VMALVVGAWLFESVGLKPDLGAL--IIGI-LLAGHKKSSELAKSLFYFKELFLVAFFLTI 262
+V +L +KPD+ + I G + K E KS+ L ++ + +
Sbjct: 100 ASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIK 159

Query: 263 G----LNGLPTVSDIVLAALLVLLVPLKILLFVYILTRF 297
G L LPT + + LL + L V F
Sbjct: 160 GNLVTLLQLPTCG---IECITPLLGQILRQLMVICTVGF 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3728RTXTOXINA290.033 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.033
Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 124 LAAGLSSSGALVVAFGTAISDTSQLHLSPMAVAQLAQRGEH 164
A GLS+S A +A++ L +SP++ +A + +
Sbjct: 296 AAQGLSTSAAAAGLIASAVT----LAISPLSFLSIADKFKR 332


53Sbal_3759Sbal_3775Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_37592120.641419hypothetical protein
Sbal_3760015-1.084804HAD family hydrolase
Sbal_3761015-1.398184hypothetical protein
Sbal_3762122-2.919230hydratase/decarboxylase family protein
Sbal_3763021-3.565706TonB family protein
Sbal_3764-117-2.486635IstB ATP binding domain-containing protein
Sbal_3765-213-0.738929integrase catalytic subunit
Sbal_3766-1151.225503hypothetical protein
Sbal_3767-1110.965091hypothetical protein
Sbal_37690132.586031cytochrome c family protein
Sbal_3770-1143.151870response regulator receiver protein
Sbal_37710153.137122secretion protein HlyD family protein
Sbal_3772-1142.794403MarR family transcriptional regulator
Sbal_3773-2142.803052methyl-accepting chemotaxis sensory transducer
Sbal_3774-1193.5058625,10-methylenetetrahydrofolate reductase
Sbal_3775-2183.541920bifunctional aspartate kinase II/homoserine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3763PF03544646e-14 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 64.2 bits (156), Expect = 6e-14
Identities = 22/79 (27%), Positives = 36/79 (45%)

Query: 285 QPIFRTLPDYPMSYAQRGKSGWVQLKFTIDEHGFVKNPEILASKGGALFEKESIETLDKW 344
+ + R P YP G V++KF + G V N +IL++K +FE+E + +W
Sbjct: 158 RALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRW 217

Query: 345 RYAPKFENGKAVYSASISL 363
RY P V + +
Sbjct: 218 RYEPGKPGSGIVVNILFKI 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3771RTXTOXIND551e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.2 bits (133), Expect = 1e-10
Identities = 37/197 (18%), Positives = 67/197 (34%), Gaps = 18/197 (9%)

Query: 1 MTPDQQFARLV-KFAMFGFVMVFGYFMLADTMMPLTPQAMATRVVT------KVTPQISG 53
TP + RLV F M V+ F +L + A A +T ++ P +
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLG----QVEIVATANGKLTHSGRSKEIKPIENS 105

Query: 54 KIQTIAVTNNQAVAKGDLLFQVDPAPYELAVSQAKLALEQARQDNAELDASLLAAKAD-- 111
++ I V ++V KGD+L ++ E + + +L QAR + + + +
Sbjct: 106 IVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKL 165

Query: 112 ----VNANKTTSQQKRSEAKRLDALYATH-GVSQQQRDQADSDAAAAEANLLAANARLEK 166
+ E RL +L Q Q+ Q + + A L AR+ +
Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINR 225

Query: 167 LKVSRGSYGEENLKVRQ 183
+
Sbjct: 226 YENLSRVEKSRLDDFSS 242


54Sbal_3811Sbal_3826Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3811-1143.232166ABC transporter
Sbal_3812-1122.277048ABC transporter
Sbal_3813-2122.077820secretion protein HlyD family protein
Sbal_3814-2121.530028outer membrane efflux protein
Sbal_3815-3130.917422peptidase U62 modulator of DNA gyrase
Sbal_3816-2141.069947nitrilase/cyanide hydratase and apolipoprotein
Sbal_3817-3140.970313hypothetical protein
Sbal_3818116-0.103931ribonuclease G
Sbal_3819218-0.054979maf protein
Sbal_3820218-0.305455rod shape-determining protein MreD
Sbal_3821219-0.302578rod shape-determining protein MreC
Sbal_3822220-1.058332rod shape-determining protein MreB
Sbal_3823323-1.742229hypothetical protein
Sbal_3824324-1.555573MSHA biogenesis protein MshP
Sbal_3825324-2.123053MSHA biogenesis protein MshO
Sbal_3826220-1.121303MSHA pilin protein MshD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3813RTXTOXIND498e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.4 bits (118), Expect = 8e-09
Identities = 28/186 (15%), Positives = 64/186 (34%), Gaps = 14/186 (7%)

Query: 7 LALVALVALGLILAYGLKLAYSPQPSLLQGQI--EAREYNVSSKVPGRVEQVLVRRGDSV 64
++ + + IL+ ++ + G++ R + V++++V+ G+SV
Sbjct: 62 YFIMGFLVIAFILSVLGQVE---IVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118

Query: 65 AEGDLLFAINSPELDAKLMQAEGGRDAAKAMQLEANNGARSQEVMAAKEQWLKAQAAATL 124
+GD+L + + +A ++ + A+ Q +RS E+ E L +
Sbjct: 119 RKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 125 AKTTNTRVENLFNEGVAARQKRDEAFTQWQAAKYTEQAALAMYQMAQEGARVETKAAAAG 184
E + E F+ WQ KY ++ L + +
Sbjct: 179 VSE---------EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229

Query: 185 NARMAE 190
+
Sbjct: 230 SRVEKS 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3817IGASERPTASE492e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 48.5 bits (115), Expect = 2e-07
Identities = 38/197 (19%), Positives = 57/197 (28%), Gaps = 11/197 (5%)

Query: 1263 AEPVVEELERKSKEIEIPEAILPVVGVERPAAPTEAASHNADITQPVEAKSTEAQSTEAK 1322
E V E +++SK +E E + EA S+ TQ E + +++ E +
Sbjct: 1037 TETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ 1096

Query: 1323 PIEIKSTEIKPTEVKPSNAKSEPTQAPLSQISEPQLQATQPEALEPQAVKPEEARTEESK 1382
E K T E K + + P Q Q E ++PQA E +
Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQE-QSETVQPQAEPARENDPTVNI 1155

Query: 1383 SEELKPATDQIKAPDLKLVQPQATPSLEVPNTPATPENQPAVEPIKQIQGDQNAHQPVAM 1442
E +T N P QP
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP---ENTTPATTQPTVN 1212

Query: 1443 SE-------QSRRQRQS 1452
SE + RR +S
Sbjct: 1213 SESSNKPKNRHRRSVRS 1229



Score = 38.5 bits (89), Expect = 3e-04
Identities = 33/195 (16%), Positives = 55/195 (28%), Gaps = 31/195 (15%)

Query: 1278 EIPEAILPVVGVERPAAPTEAASHNADITQPVEAKSTEAQSTEAKPIEIKSTEIKP--TE 1335
E E + P R PT T + +P + S+ ++ TE
Sbjct: 1134 EQSETVQPQAEPARENDPTVNIKEPQSQTN--------TTADTEQPAKETSSNVEQPVTE 1185

Query: 1336 VKPSNAKSEPTQAPLSQISEPQLQATQPEALEPQAVKPEEARTEESKSEELKPATDQIKA 1395
N + + P + TQP + KP+ +S P +
Sbjct: 1186 STTVNTGNSVVENPENTTPA----TTQPTVNSESSNKPKNRHRRSVRSV---PHNVEPAT 1238

Query: 1396 PDLKLVQPQATPSLEVPNTPATPENQPAV-------------EPIKQIQGDQNAHQPVAM 1442
A L NT A + A + I Q++ + V +
Sbjct: 1239 TSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYNVWV 1298

Query: 1443 SE-QSRRQRQSAVYR 1456
S + S+ YR
Sbjct: 1299 SNTSMNKNYSSSQYR 1313



Score = 32.3 bits (73), Expect = 0.017
Identities = 30/181 (16%), Positives = 50/181 (27%), Gaps = 2/181 (1%)

Query: 1249 EVISEIRFRVTGTMAEPVVEELERKSKEIEIPEAILPVVGVERPAAPTEAASHNADITQP 1308
EV E + V V + ++KE + E + A E
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT 1126

Query: 1309 VEAKSTEAQSTEAKPIEIKSTEIKPTEVKPSNAKSEPTQAPLSQISEPQLQATQPEALEP 1368
+ + QS +P + E PT T A Q ++ + E
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186

Query: 1369 QAVKPEEARTEESKSEELKPATDQIKAPDLKLVQPQATPSLEVPNTPATPENQPAVEPIK 1428
V + E ++ PAT Q +P+ V + P E +
Sbjct: 1187 TTVNTGNSVVENPENTT--PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244

Query: 1429 Q 1429

Sbjct: 1245 S 1245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3821IGASERPTASE300.022 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.022
Identities = 25/102 (24%), Positives = 41/102 (40%), Gaps = 9/102 (8%)

Query: 237 EVLTEDGQSYARVTAQPLAALDRIRYVLLIWPSPDSGVTLPNQPTVPAADHSLIENSSKI 296
+V TE Q +VT+Q ++ V P + N PTV + + ++
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQ-----PQAEPARENDPTVNIKEPQ-SQTNTTA 1166

Query: 297 GSASPAEGTSADATKPVTAPAATATVAKPA---TETTPPATE 335
+ PA+ TS++ +PVT T TTP T+
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQ 1208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3822SHAPEPROTEIN5580.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 558 bits (1440), Expect = 0.0
Identities = 317/348 (91%), Positives = 334/348 (95%), Gaps = 1/348 (0%)

Query: 1 MFKKLRGIFSNDLSIDLGTANTLIYVRGEGIVLNEPSVVAIRGERGGSGQKSVAAVGTEA 60
M KK RG+FSNDLSIDLGTANTLIYV+G+GIVLNEPSVVAIR +R GS KSVAAVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGS-PKSVAAVGHDA 59

Query: 61 KQMLGRTPGNIQAIRPMKDGVIADFYVTEKMLQHFIKQVHNNSFFRPSPRVLVCVPVGAT 120
KQMLGRTPGNI AIRPMKDGVIADF+VTEKMLQHFIKQVH+NSF RPSPRVLVCVPVGAT
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 QVERRAIRESAMGAGAREVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAIISLN 180
QVERRAIRESA GAGAREV+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVA+ISLN
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GVVYSSSVRIGGDKFDDAIINYVRRNYGSLIGEATAERIKHTIGTAYPGDEVLEIEVRGR 240
GVVYSSSVRIGGD+FD+AIINYVRRNYGSLIGEATAERIKH IG+AYPGDEV EIEVRGR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 NLAEGVPRSFTLNSNEILEALQEPLSGIVSAVMVALEQSPPELASDISERGMVLTGGGAL 300
NLAEGVPR FTLNSNEILEALQEPL+GIVSAVMVALEQ PPELASDISERGMVLTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 LRDLDRLLMQETGIPVMVADDPLTCVARGGGKALEMIDMHGGDLFSEE 348
LR+LDRLLM+ETGIPV+VA+DPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3825BCTERIALGSPG334e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 33.3 bits (76), Expect = 4e-04
Identities = 12/24 (50%), Positives = 19/24 (79%)

Query: 8 RMQTSKRGFTLVEMVTVILILGIL 31
R +RGFTL+E++ VI+I+G+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3826BCTERIALGSPH371e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 37.2 bits (86), Expect = 1e-05
Identities = 18/43 (41%), Positives = 30/43 (69%), Gaps = 3/43 (6%)

Query: 14 RQLGFTLIELVIGMLVIGIAIVMLTSMLFPQA--DRAASTLHR 54
RQ GFTL+E+++ +L++G++ M+ + FP + D AA TL R
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVL-LAFPASRDDSAAQTLAR 43


55Sbal_3932Sbal_3958Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3932-1143.010430alcohol dehydrogenase
Sbal_3933-2143.312115antibiotic biosynthesis monooxygenase
Sbal_3934-2143.471653fumarate reductase iron-sulfur subunit
Sbal_3935-2143.425820fumarate reductase flavoprotein subunit
Sbal_3936-2101.039549fumarate reductase respiratory complex
Sbal_3937-2110.113987fumarate reductase cytochrome b-556 subunit
Sbal_3938-211-0.60186450S ribosomal protein L11 methyltransferase
Sbal_3939-214-2.330005nifR3 family TIM-barrel protein
Sbal_3940-117-3.397076DNA-binding protein Fis
Sbal_3941-216-3.445349UvrD/REP helicase
Sbal_3942121-4.567751hypothetical protein
Sbal_3943122-3.551724type IV pilus assembly PilZ
Sbal_3944122-3.793972integrase catalytic subunit
Sbal_3945424-2.626685IstB ATP binding domain-containing protein
Sbal_3946427-2.149138OmpA/MotB domain-containing protein
Sbal_3948525-1.850265integrase catalytic subunit
Sbal_3950424-3.142681flagellar biosynthesis sigma factor
Sbal_3951224-3.630097flagellar basal body-associated protein FliL
Sbal_3952125-2.887411flagellar hook-length control protein
Sbal_3953226-3.392218hypothetical protein
Sbal_3954330-3.725751flagellar protein FliS
Sbal_3955229-4.048199flagellar hook-associated 2 domain-containing
Sbal_3956329-3.763827flagellin domain-containing protein
Sbal_3957225-3.220032flagellin domain-containing protein
Sbal_3958225-3.544180flagellin domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3940DNABINDNGFIS1181e-38 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 118 bits (297), Expect = 1e-38
Identities = 61/101 (60%), Positives = 83/101 (82%), Gaps = 3/101 (2%)

Query: 1 MFDQTTNTEVHQLTVGKIETANGTIKPQLLRDAVKRAVTNFFAQLDGQEAQEVYEMVLSE 60
MF+Q N++V LTV + + + + + LRD+VK+A+ N+FAQL+GQ+ ++YE+VL+E
Sbjct: 1 MFEQRVNSDV--LTVSTVNSQDQVTQ-KPLRDSVKQALKNYFAQLNGQDVNDLYELVLAE 57

Query: 61 VEAPLLDIIMQHTRGNQTRAANMLGINRGTLRKKLKKYGMN 101
VE PLLD++MQ+TRGNQTRAA M+GINRGTLRKKLKKYGMN
Sbjct: 58 VEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3941BONTOXILYSIN320.013 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 32.2 bits (73), Expect = 0.013
Identities = 14/63 (22%), Positives = 24/63 (38%), Gaps = 5/63 (7%)

Query: 338 EYFSRFYYVEKSPFEFEVEGEYFQYLNDNDIRTLKGERVKSFGELYIANWLFSNGIEYVY 397
E S + EV YF YL+++ IR ++ + N++F +Y
Sbjct: 1014 EELSVL---DNPITSEEVIRNYFSYLDNSYIRDSSKSLLEYNKNYQLYNYVFPE--TSLY 1068

Query: 398 EAK 400
E
Sbjct: 1069 EVN 1071


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3946OMPADOMAIN414e-06 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 40.7 bits (95), Expect = 4e-06
Identities = 30/105 (28%), Positives = 44/105 (41%), Gaps = 9/105 (8%)

Query: 169 FTRGSAQMQPYFEELLLSLGPILKNV---TNSMVISGHTDSTPYAGNLFTNWELSSERAL 225
F A ++P + L L L N+ S+V+ G+TD G+ N LS RA
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI---GSDAYNQGLSERRAQ 279

Query: 226 LARRVLERGGVKRDQVIQVTGMADQIPYIATDTAAAANRRIEALI 270
L G+ D+ I GM + P + +T +R ALI
Sbjct: 280 SVVDYLISKGIPADK-ISARGMGESNP-VTGNTCDNVKQR-AALI 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3952FLGHOOKFLIK415e-06 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 41.4 bits (96), Expect = 5e-06
Identities = 29/94 (30%), Positives = 49/94 (52%)

Query: 248 AASATQWGPVSLTPMASLAQQSQEILTPLREHLRFQVDQHIKKAELRLDPPELGKIELNI 307
TQ P P+ S S E L +H+ Q + AELRL P +LG++++++
Sbjct: 216 TPHQTQPLPTVAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISL 275

Query: 308 RLEGDRLQVQMHAVNPAIRDALLNGLDRLRVDLA 341
+++ ++ Q+QM + + +R AL L LR LA
Sbjct: 276 KVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLA 309


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3956FLAGELLIN1143e-31 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 114 bits (287), Expect = 3e-31
Identities = 79/274 (28%), Positives = 127/274 (46%), Gaps = 18/274 (6%)

Query: 4 VHTNYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETA 63
++TN S++ Q +NKS + L++A+ERLS+GLRINSA DDAAG IANR ++N+KG+ A
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 SRNISDATSMLQTADGALEELTTIANRQKELATQASNGVNSTADLKALDDEFKQLNAEIT 123
SRN +D S+ QT +GAL E+ R +EL+ QA+NG NS +DLK++ DE +Q EI
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 124 RIVENTTYAGNNLFKDATDGVLVKGVTFQIGSDAAEKMSVTLGAID------------KT 171
R+ T + G + + Q+G++ E +++ L ID
Sbjct: 124 RVSNQTQFNGVKVLSQDN------QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGP 177

Query: 172 VAGDLLTSAAANTAIGAVDTFLAKVGTERSTLGANINRLGHTAANLGSVTENTKAAAGRI 231
+ ++ + DT+ R + + TA + A
Sbjct: 178 KEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237

Query: 232 MDADFAVESANMTRNQLLVQAGTTVLSSANQNTG 265
D + ++ + + A G
Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKG 271



Score = 73.5 bits (180), Expect = 6e-17
Identities = 53/276 (19%), Positives = 101/276 (36%), Gaps = 10/276 (3%)

Query: 7 NYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETASRN 66
+ A N + L + + + + G + + ++
Sbjct: 232 ANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKT 291

Query: 67 ISDATSMLQTADGALEELTTIANRQKELATQASNGVNSTADLKALDDEFKQLNAEITRIV 126
+D + T + T+A+ A + + S+ ++ + + T+
Sbjct: 292 GNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351

Query: 127 ENTTYAGNNLFKDATDGVLVKGVTFQIGSDAAEKMSVTLGAIDKTVAG----------DL 176
+ + + A +K+++ +
Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA 411

Query: 177 LTSAAANTAIGAVDTFLAKVGTERSTLGANINRLGHTAANLGSVTENTKAAAGRIMDADF 236
+ + ++D+ L+KV RS+LGA NR NLG+ N +A RI DAD+
Sbjct: 412 AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADY 471

Query: 237 AVESANMTRNQLLVQAGTTVLSSANQNTGLVMGLLR 272
A E +NM++ Q+L QAGT+VL+ ANQ V+ LLR
Sbjct: 472 ATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3957FLAGELLIN1072e-28 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 107 bits (267), Expect = 2e-28
Identities = 80/269 (29%), Positives = 125/269 (46%), Gaps = 13/269 (4%)

Query: 4 VHTNYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETA 63
++TN S++ Q +NKS + L++A+ERLS+GLRINSA DDAAG IANR ++N+KG+ A
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 SRNISDATSMLQTADGALEELTTIANRQKELATQAANGVNSADDLTALNDEFTQLNTEIT 123
SRN +D S+ QT +GAL E+ R +EL+ QA NG NS DL ++ DE Q EI
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 124 RILDNTTYAGNNLFAKLEAGVTFQIGAGTGEKLVVTTTAIDDAALAAGDLTT-------- 175
R+ + T + G + ++ + + Q+GA GE + + ID +L
Sbjct: 124 RVSNQTQFNGVKVLSQ-DNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATV 182

Query: 176 ----GANAAIALVDTFIAAVGTERSTLGANINRLGHTAANLASVTENTKAAAGRIMDADF 231
+ + DT+ R + + TA + A D
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 232 AVESANMTRNQLLVQAGTTVLSSANQNTG 260
+ ++ + + A G
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKG 271



Score = 68.5 bits (167), Expect = 3e-15
Identities = 54/266 (20%), Positives = 88/266 (33%), Gaps = 4/266 (1%)

Query: 6 TNYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETASR 65
N ++ + + + D G+ G S
Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVST 301

Query: 66 NISDATSMLQTADGALEELTTIANRQKELATQAANGVNSADDLTALNDEFTQLNTEITR- 124
I+ L AD A + + VN + +++
Sbjct: 302 TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEAN 361

Query: 125 ---ILDNTTYAGNNLFAKLEAGVTFQIGAGTGEKLVVTTTAIDDAALAAGDLTTGANAAI 181
++ + AG + T + A +
Sbjct: 362 NAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPL 421

Query: 182 ALVDTFIAAVGTERSTLGANINRLGHTAANLASVTENTKAAAGRIMDADFAVESANMTRN 241
A +D+ ++ V RS+LGA NR NL + N +A RI DAD+A E +NM++
Sbjct: 422 ASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKA 481

Query: 242 QLLVQAGTTVLSSANQNTGLVMGLLR 267
Q+L QAGT+VL+ ANQ V+ LLR
Sbjct: 482 QILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3958FLAGELLIN1101e-29 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 110 bits (275), Expect = 1e-29
Identities = 79/269 (29%), Positives = 124/269 (46%), Gaps = 12/269 (4%)

Query: 4 VHTNYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETA 63
++TN S++ Q +NKS + L++A+ERLS+GLRINSA DDAAG IANR ++N+KG+ A
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 SRNISDATSMLQTADGALEELTTIANRQKELATQAANGVNSADDLTALDAEFQQLSLEVD 123
SRN +D S+ QT +GAL E+ R +EL+ QA NG NS DL ++ E QQ E+D
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 124 RIAKNTTYAGNNLFTAIDGGVTFQIGAGTSETMKVT-----------SAAPVALANTVKL 172
R++ T + G + + D + Q+GA ET+ + V +
Sbjct: 124 RVSNQTQFNGVKVLSQ-DNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATV 182

Query: 173 DTGDNARLAITAVDDFIKTVGTSRSTLGANINRLGHTAANLASVTENTKAAAGRIMDADF 232
++ +T D + R + + TA + A D
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 233 AVESANMTRNQLLVQAGTTVLSSANQNTG 261
+ ++ + + A G
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKG 271



Score = 73.2 bits (179), Expect = 6e-17
Identities = 53/210 (25%), Positives = 84/210 (40%)

Query: 59 GMETASRNISDATSMLQTADGALEELTTIANRQKELATQAANGVNSADDLTALDAEFQQL 118
+ T ++ GA K + T NG + DD T ++
Sbjct: 298 KVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSD 357

Query: 119 SLEVDRIAKNTTYAGNNLFTAIDGGVTFQIGAGTSETMKVTSAAPVALANTVKLDTGDNA 178
+ + + N + AG + + T++ L N +
Sbjct: 358 LEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKST 417

Query: 179 RLAITAVDDFIKTVGTSRSTLGANINRLGHTAANLASVTENTKAAAGRIMDADFAVESAN 238
+ ++D + V RS+LGA NR NL + N +A RI DAD+A E +N
Sbjct: 418 ANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSN 477

Query: 239 MTRNQLLVQAGTTVLSSANQNTGLVMGLLR 268
M++ Q+L QAGT+VL+ ANQ V+ LLR
Sbjct: 478 MSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


56Sbal_3967Sbal_3982Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3967218-2.324043flagellar basal-body rod protein FlgF
Sbal_3968220-3.205702flagellar hook protein FlgE
Sbal_3969419-3.685455flagellar hook capping protein
Sbal_3970418-4.219096flagellar basal-body rod protein FlgC
Sbal_3971319-1.783150flagellar basal body rod protein FlgB
Sbal_3972319-0.924565SAF domain-containing protein
Sbal_3973220-0.795584hypothetical protein
Sbal_3974121-0.639919hypothetical protein
Sbal_3975122-0.573625hypothetical protein
Sbal_3976123-0.637481FliI/YscN family ATPase
Sbal_3977023-2.476041flagellar assembly protein H
Sbal_3978023-2.984449flagellar motor switch protein G
Sbal_3979225-3.404466flagellar MS-ring protein
Sbal_3980324-4.455823flagellar hook-basal body complex subunit FliE
Sbal_3981223-3.764966sigma-54 dependent trancsriptional regulator
Sbal_3982221-3.993339hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3968FLGHOOKAP1357e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 34.5 bits (79), Expect = 7e-04
Identities = 20/57 (35%), Positives = 29/57 (50%), Gaps = 5/57 (8%)

Query: 2 SFNIALSGLQATTQDLNTISNNIANASTSGFRGGR----SEFASIYNGGQAG-GVGV 53
N A+SGL A LNT SNNI++ + +G+ +++ GG G GV V
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYV 59



Score = 33.0 bits (75), Expect = 0.002
Identities = 14/41 (34%), Positives = 21/41 (51%)

Query: 353 LEGSNVDTTAEMVNLMSAQRNYQSNAKVLDVNSTMQQALLN 393
S V+ E NL Q+ Y +NA+VL + + AL+N
Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3970FLGHOOKAP1300.002 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.002
Identities = 7/39 (17%), Positives = 19/39 (48%)

Query: 97 SNVNTIEEMADMMAASRSFETSVEIMNRARSMQQGLLQL 135
S VN EE ++ + + + +++ A ++ L+ +
Sbjct: 507 SGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3977FLGFLIH582e-12 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 57.9 bits (139), Expect = 2e-12
Identities = 38/179 (21%), Positives = 80/179 (44%), Gaps = 2/179 (1%)

Query: 41 QQAFDEGYDEGVIQGKSAGYEAGLEEGRIAGHAAGFHQGKLDGQSAGRSSIDEQLNSLLV 100
+ + ++ + +Q GY+AG+ EGR GH G+ +G G G + Q +
Sbjct: 37 EPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHA 96

Query: 101 PLGALRELLEDGHAKQVREQQNLILDLVRRVSQQVIRCELTLQPQQILKLVEETLSALPD 160
+ L + + ++ + ++QVI T+ ++K +++ L P
Sbjct: 97 RMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPL 156

Query: 161 DQVDVKIHLEPSAVVKLKEL--SEDKIRGWNLIADSSISAGSCRIVSDKSDADASVETR 217
++ + P + ++ ++ + + GW L D ++ G C++ +D+ D DASV TR
Sbjct: 157 FSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3978FLGMOTORFLIG1738e-54 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 173 bits (441), Expect = 8e-54
Identities = 81/324 (25%), Positives = 170/324 (52%), Gaps = 1/324 (0%)

Query: 6 QAAMLLLSMGEEGAAMVMAHLDRNDVQHLSHKMARLSSITQQEAEAVLSRFFQRYKEQSG 65
+AA+LL+S+G E ++ V +L + +++ L+ ++A+L +IT + + VL F + Q
Sbjct: 20 KAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMAQEF 79

Query: 66 IARASRSYLQKTLDIALGDRVSKSLIDSIYGDEIKVLVKRLEWVDPQLLAREITHEHCQL 125
I + Y ++ L+ +LG + + +I+++ + + DP + I EH Q
Sbjct: 80 IQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEHPQT 139

Query: 126 QAVLLGLLPPESAAKILKMLPSDSQDEVLVRIAQLGELDRNVVEELRELVERCMLMAMEK 185
A++L L P+ A+ IL LP++ Q V RIA + VV E+ ++E+ + +
Sbjct: 140 IALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASLSSE 199

Query: 186 SHTQVAGVKQVADILNRFE-GDREQLMEMIKLHDKQMAIDVTDNMFDFIILGRQKQETLQ 244
+T GV V +I+N + + ++E ++ D ++A ++ MF F + ++Q
Sbjct: 200 DYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDRSIQ 259

Query: 245 TLLGQVPSETLSLALKGIDFELKDSLLNALPKRMSSAIETQIEALGGVPVSRASGARKEI 304
+L ++ + L+ ALK +D +++ + + KR +S ++ +E LG ++++I
Sbjct: 260 RVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQQKI 319

Query: 305 MELAKQLMQEGEIELQLFEEQVVV 328
+ L ++L ++GEI + E+ V+
Sbjct: 320 VSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3979FLGMRINGFLIF315e-103 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 315 bits (808), Expect = e-103
Identities = 158/575 (27%), Positives = 279/575 (48%), Gaps = 60/575 (10%)

Query: 17 SSSGFIAGMTQKWHRFNR-GDRQVIALAL-LAVVVASVIVLMLWTATAGYRPLYGSQENV 74
S++ A + NR I L + + VA V+ ++LW T YR L+ + +
Sbjct: 1 SATASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ 60

Query: 75 DTSQVLNVLDAEGIDYRLDANSGAVLVAEEQVGNARMILAAKGVKAKVPSGMEALDNTAL 134
D ++ L I YR SGA+ V ++V R+ LA +G+ G E LD
Sbjct: 61 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF 120

Query: 135 GTSQFMEQAKYRNSLEGELARTIMSLKLVRAARVHLAIPKQTLFIRQEPELPTASVMLQL 194
G SQF EQ Y+ +LEGELARTI +L V++ARVHLA+PK +LF+R++ + P+ASV + L
Sbjct: 121 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQ-KSPSASVTVTL 179

Query: 195 DPNTRLSESQVEAIVNLVAGSVTGLTASNIKVVDQDGRYLSENISGNQDLSQSRNKQLQY 254
+P L E Q+ A+V+LV+ +V GL N+ +VDQ G L+++ + +DL+ + QL++
Sbjct: 180 EPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLN---DAQLKF 236

Query: 255 TRELENSLVANASSMLEPVLGQENFQVRVTAKVNFNQVEETKESLDPQ------NVVTQE 308
++E+ + ++L P++G N +VTA+++F E+T+E P + +++
Sbjct: 237 ANDVESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQ 296

Query: 309 RTSVDDSSNSIAAGIPGALSNKPPQAGKAAADDKT-----------------------RN 345
+ G+PGALSN+P +A R+
Sbjct: 297 LNISEQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRS 356

Query: 346 LKQEESRQYDVGRSVRHVRYQQMQLENLSVSVLINSATSQGA----FNDEAQLAKFGNMV 401
++ E+ Y+V R++RH + +E LSV+V++N T + Q+ + ++
Sbjct: 357 TQRNETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTAD-QMKQIEDLT 415

Query: 402 KDAIGFSAARGDSFTINAFEFTPTVTAEFTPSPWWQSENY----QAYLRYIIGGILGFGL 457
++A+GFS RGD+ + F V P+WQ +++ A R+++ ++ + L
Sbjct: 416 REAMGFSDKRGDTLNVVNSPF-SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWIL 474

Query: 458 ILFVLRPLVKHLTRTAQMTAPRIEPVALSAAPAGALDGPVADAASNQPHQLPSAEWLGSQ 517
+RP LTR + E + A++ ++ Q Q + + LG++
Sbjct: 475 WRKAVRPQ---LTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQ--QRRANQRLGAE 529

Query: 518 GLPEPGSPLTVKMEHLALLANKEPARVAEVIAHWI 552
V + + +++ +P VA VI W+
Sbjct: 530 ----------VMSQRIREMSDNDPRVVALVIRQWM 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3980FLGHOOKFLIE499e-11 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 48.5 bits (115), Expect = 9e-11
Identities = 20/72 (27%), Positives = 34/72 (47%), Gaps = 1/72 (1%)

Query: 42 SFTELIKSKVSAVNQDQNQSSMAMAAVDSGKSD-DLVGAMVASQKASLSFATMLQIRNRL 100
SF + + + ++ Q + G+ L M QKAS+S +Q+RN+L
Sbjct: 32 SFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKL 91

Query: 101 VQAFDDVMKMPI 112
V A+ +VM M +
Sbjct: 92 VAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3981HTHFIS379e-130 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 379 bits (976), Expect = e-130
Identities = 138/417 (33%), Positives = 209/417 (50%), Gaps = 45/417 (10%)

Query: 51 VKSYLARFPCRNIVALLAPEQGELAAAAMRAGVQDYLLIPVETEQLLASIQR----LRRL 106
+ P ++ + A A A G DYL P + +L+ I R +R
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 107 ELPDSS-------LVVSAAVSRQLLMLAHRAATTEASVLLLGESGTGKEPLARYIHRHSS 159
LV +A +++ + R T+ ++++ GESGTGKE +AR +H +
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185

Query: 160 RSHKPFVAINCAAIPESILESVLFGHVKGAFTGAICDKAGKFEQANGGTLLLDEIGEMPL 219
R + PFVAIN AAIP ++ES LFGH KGAFTGA G+FEQA GGTL LDEIG+MP+
Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245

Query: 220 PLQAKLLRVLQEREVERLGGQHAIPLDIRIIASTNRDLRQAVEFGHFREDLFYRLDVLPL 279
Q +LLRVLQ+ E +GG+ I D+RI+A+TN+DL+Q++ G FREDL+YRL+V+PL
Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305

Query: 280 KIAPLRERKADILPLAEHFLGLYGQSDNASRCYFSEHARQVLVTYDWPGNVRELENCIQR 339
++ PLR+R DI L HF+ + + F + A +++ + WPGNVRELEN ++R
Sbjct: 306 RLPPLRDRAEDIPDLVRHFVQQAEK-EGLDVKRFDQEALELMKAHPWPGNVRELENLVRR 364

Query: 340 ALVMRRGQAIQVAELGLNIQEETLEL------------------------------EPLG 369
+ I + ++ E + + L
Sbjct: 365 LTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALP 424

Query: 370 ATVGLKASKQQAEFQYIIDVLKRFNGQRTLSAQALGMTTRALRYRLVQMREAGIDIE 426
+ + E+ I+ L G + +A LG+ LR + +RE G+ +
Sbjct: 425 PSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGVSVY 478


57Sbal_4184Sbal_4189Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_41842151.330334phospholipid/glycerol acyltransferase
Sbal_41853161.498283TetR family transcriptional regulator
Sbal_41863161.191553N-acetyltransferase GCN5
Sbal_41872160.8503843'(2'),5'-bisphosphate nucleotidase
Sbal_41882170.630550ADP-ribose diphosphatase NudE
Sbal_41893160.622287Ig family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4185HTHTETR418e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 41.2 bits (96), Expect = 8e-07
Identities = 16/76 (21%), Positives = 29/76 (38%)

Query: 1 MARRKEHSHDEIRAMAIQAATELLTELGVVGLSLRKVASQIGYVPSTLINIFGSYNYLLL 60
MAR+ + E R + A L ++ GV SL ++A G + F + L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 AVSESTLRALHDRLAG 76
+ E + + +
Sbjct: 61 EIWELSESNIGELELE 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4186SACTRNSFRASE290.006 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.8 bits (64), Expect = 0.006
Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 50 VDNLLQGYLLSAQSSDSTCMWILSIAVSEDARGKGVGKRLM 90
++N G + +S+ + I IAV++D R KGVG L+
Sbjct: 72 LENNCIGRI-KIRSNWNGYALIEDIAVAKDYRKKGVGTALL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4189OMPADOMAIN468e-07 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 46.5 bits (110), Expect = 8e-07
Identities = 39/180 (21%), Positives = 56/180 (31%), Gaps = 26/180 (14%)

Query: 2309 AVVLAGTVSQANAA---DNWYVEGFVGQAQVDSSRRDLQPQAAAGVVTSVDDKDTAFGLS 2365
AV LAG + A AA + WY +G +Q + + G
Sbjct: 9 AVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDT-------GFINNNGPTHENQLGAGAF 61

Query: 2366 VGYQWTPMVAIEFGYADFGNGSARIEGASLTPAQYHEQVKAVTPVLADGVMLGLRFTLLQ 2425
GYQ P V E GY G R+ ++ A GV L +
Sbjct: 62 GGYQVNPYVGFEMGYDWLG----RMPYKGSVENGAYK---------AQGVQLTAKLGYPI 108

Query: 2426 HDAWRFEVPIGLFRWQADISSTMGNSRLTTELDGTDWYAGVRFSYQVSDAWSVGLGYQYV 2485
D +G W+AD S + T G Y ++ + L YQ+
Sbjct: 109 TDDLDIYTRLGGMVWRADTKSNVYGKNHDT---GVSPVFAGGVEYAITPEIATRLEYQWT 165


58Sbal_4242Sbal_4256Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_4242114-3.121554glyoxalase/bleomycin resistance
Sbal_4243115-3.698355hypothetical protein
Sbal_4244117-5.490648peptidase M48 Ste24p
Sbal_4245623-8.198435hypothetical protein
Sbal_4246114-4.093349AraC family transcriptional regulator
Sbal_4247012-0.618357branched-chain amino acid transport
Sbal_4248-1120.343413AzlC family protein
Sbal_4249-1111.486969class I and II aminotransferase
Sbal_4250-1153.236007hypothetical protein
Sbal_42510174.232745histidine ammonia-lyase
Sbal_4252-1122.858443urocanate hydratase
Sbal_42530120.909225histidine utilization repressor
Sbal_42541120.218997imidazolonepropionase
Sbal_4255213-0.280833siderophore-interacting protein
Sbal_4256214-0.490443MarR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4254UREASE432e-06 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 42.8 bits (101), Expect = 2e-06
Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 8/52 (15%)

Query: 352 TLNAAKALGIEDNVGSLVVGKQADFCLWDIVTPAQLAYSYGVNPCKDVVKNG 403
T+N A A G+ +GSL VGK+AD LW+ PA +GV P V+ G
Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVLWN---PAF----FGVKP-DMVLLGG 453



Score = 32.8 bits (75), Expect = 0.002
Identities = 19/61 (31%), Positives = 29/61 (47%), Gaps = 6/61 (9%)

Query: 23 YGAITNAAIAVKDGKIAWLGPRSE---LPAFDVL---SIPVYRGKGGWITPGLIDAHTHL 76
+ I A I +KDG+IA +G P ++ V G+G +T G +D+H H
Sbjct: 80 HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHF 139

Query: 77 I 77
I
Sbjct: 140 I 140


59Sbal_4298Sbal_4325Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_4298-2253.136579major facilitator superfamily transporter
Sbal_4299-1273.164242LytTR family two component transcriptional
Sbal_4300-2232.966029signal transduction histidine kinase LytS
Sbal_4301015-0.757120pirin domain-containing protein
Sbal_4302115-2.087403integral membrane sensor signal transduction
Sbal_4303115-3.194022two component transcriptional regulator
Sbal_4304216-3.766536TrkA domain-containing protein
Sbal_4305218-4.036344TrkH family potassium uptake protein
Sbal_4307425-5.074058putative DNA mismatch repair protein
Sbal_4308119-1.068079transposase IS3/IS911 family protein
Sbal_4309119-0.820788hypothetical protein
Sbal_4310120-2.109462transposase IS3/IS911 family protein
Sbal_4311124-1.932299integrase catalytic subunit
Sbal_4312120-2.109462integrase catalytic subunit
Sbal_4314124-3.252684hypothetical protein
Sbal_4315328-5.228193IS66 Orf2 family protein
Sbal_4317430-5.396596hypothetical protein
Sbal_4318427-4.912877integrase catalytic subunit
Sbal_4319428-5.789296transposase IS3/IS911 family protein
Sbal_4321532-6.628436hypothetical protein
Sbal_4322433-7.260943hypothetical protein
Sbal_4323226-4.883384hypothetical protein
Sbal_4324120-3.542450IstB ATP binding domain-containing protein
Sbal_4325121-3.175810integrase catalytic subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4298TCRTETA354e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.2 bits (81), Expect = 4e-04
Identities = 54/278 (19%), Positives = 101/278 (36%), Gaps = 39/278 (14%)

Query: 43 PVSQVAFVFGLL----SLSLAVASSMAGKLQERFGVRNVTLGAGLLLGLGFLLTAQASNL 98
+ V +G+L +L + + G L +RFG R V L + + + + A A L
Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96

Query: 99 MMLYLCAGILVGFADGTGY--------LMTLSNCVKWFPERKGLISALAIGAYGLGSLGF 150
+LY+ I+ G TG + + F G +SA +G G +
Sbjct: 97 WVLYI-GRIVAGITGATGAVAGAYIADITDGDERARHF----GFMSA----CFGFGMVAG 147

Query: 151 KYINVLLLENTGLETTFQLWGLIAMALVLCGGMLMKDA------PAQSAASQLAESRDFT 204
+ L+ F + L G L+ ++ P + A S F
Sbjct: 148 PVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLAS--FR 204

Query: 205 LAEAMRKPQYWMLALMFLSACMSG----LYVIGVAKDIGEKMVDLPVLVAANAVAVIAMA 260
A M ++A+ F+ + L+VI + + +AA + +
Sbjct: 205 WARGMT-VVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI----LH 259

Query: 261 NLSGRLVLGILSDKIPRIRVISLAQIITLVGMVLLLFI 298
+L+ ++ G ++ ++ R + L I G +LL F
Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4299HTHFIS684e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 4e-15
Identities = 24/132 (18%), Positives = 59/132 (44%), Gaps = 6/132 (4%)

Query: 3 KAIIVEDEYLAREELE-YLVKSHSEIDIVASFEDGLEAFKYLQDHEVDVVFLDIQIPSID 61
++ +D+ R L L ++ ++ I ++ + + D+V D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRW---IAAGDGDLVVTDVVMPDEN 61

Query: 62 GLLLAKNLHKSTHPPHVVFVTAYKEF--AVEAFELEAFDYILKPYNEPRIISLLQKIELA 119
L + K+ V+ ++A F A++A E A+DY+ KP++ +I ++ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 120 GRQAPKPQHEAA 131
++ P + +
Sbjct: 122 PKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4300PF065802032e-62 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 203 bits (517), Expect = 2e-62
Identities = 60/205 (29%), Positives = 110/205 (53%), Gaps = 13/205 (6%)

Query: 351 EQLQEMTRKAEFTALQSKINPHFLFNALNAISSLIRIRPQQARELIANLADYLRYNLAKG 410
++ M ++A+ AL+++INPHF+FNALN I +LI P +ARE++ +L++ +RY+L
Sbjct: 152 WKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYS 211

Query: 411 D-ELIDIQEEVKQVRDYVAIEQARFGDKLEVVFDVDD--VHFCVPCLLLQPLVENAILHG 467
+ + + +E+ V Y+ + +F D+L+ ++ + VP +L+Q LVEN I HG
Sbjct: 212 NARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHG 271

Query: 468 IQPRSAPGRVTIEVKKLDAGIRVAVRDTGYGISQEVIDGVAAGRIESSSIGLMNVHQRVK 527
I G++ ++ K + + + V +TG ES+ GL NV +R++
Sbjct: 272 IAQLPQGGKILLKGTKDNGTVTLEVENTG--------SLALKNTKESTGTGLQNVRERLQ 323

Query: 528 LLYGE--GLQLKRLEPGTEVSFYLP 550
+LYG ++L + +P
Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4303HTHFIS928e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 8e-24
Identities = 35/112 (31%), Positives = 60/112 (53%), Gaps = 1/112 (0%)

Query: 4 KVLVVDDEAQIHTFMRISLEAEGFEYHGAASIASALAQYQAQRPHVLVLDLGLPDGDGIS 63
+LV DD+A I T + +L G++ ++ A+ A ++V D+ +PD +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 LLQNLRQHDK-VPVLILTARDQEEEKIRLLEAGANDYLSKPFGIRELIARIK 114
LL +++ +PVL+++A++ I+ E GA DYL KPF + ELI I
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4308HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHEALKQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4322IGASERPTASE300.023 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.023
Identities = 20/99 (20%), Positives = 36/99 (36%), Gaps = 2/99 (2%)

Query: 371 SKSTTDELRLSLLKRKNESKKETPNNEASEEAKVITESKGKFVTIPGDLS-RPHHPEFAE 429
+ + + + KET E E+AKV TE + + +S + E +
Sbjct: 1081 QTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQ 1140

Query: 430 RITELQYEDDLHALIEDP-ALIQRASEEEHPLDMLKSAG 467
E E+D I++P + ++ E P S
Sbjct: 1141 PQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179


60Sbal_0247Sbal_0251N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0247-1112.195249short-chain dehydrogenase/reductase SDR
Sbal_02480111.876437aldehyde dehydrogenase
Sbal_02491180.761935Fis family GAF modulated sigma54 specific
Sbal_0250121-0.906908alcohol dehydrogenase
Sbal_0251020-2.453796flavocytochrome c
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0247DHBDHDRGNASE988e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 98.2 bits (244), Expect = 8e-27
Identities = 71/257 (27%), Positives = 113/257 (43%), Gaps = 16/257 (6%)

Query: 6 IALITGASRGLGKNTAQKLAAQGIDIILTYQTNAAAAAEVVAEIEWLGRKAVALPLDVSD 65
IA ITGA++G+G+ A+ LA+QG I N +VV+ ++ R A A P DV D
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 66 SGSFAEFAIQVSTVLAQTWQRESFNYLINNAGIGIHVPMAETSIEQFDTLMNIHVKGPFF 125
S + E + + + + L+N AG+ + S E+++ +++ G F
Sbjct: 69 SAAIDE---ITARIEREM---GPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 126 LTQTLLPQLMD--GGSIVNISTGLTRFAIPGFGAYATMKGAVETMTKYWAKELGSRGIRV 183
++++ +MD GSIV + + AYA+ K A TK EL IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 184 NVLAPGAIETDFGGGAVRDNEQMNQFLAQQTA-------LGRVGLPDDIGGAISALLSPA 236
N+++PG+ ETD D Q + L ++ P DI A+ L+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 237 AAWINAQRIEASGGMFL 253
A I + GG L
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0249HTHFIS331e-108 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 331 bits (849), Expect = e-108
Identities = 118/359 (32%), Positives = 189/359 (52%), Gaps = 29/359 (8%)

Query: 296 RDPQLERAWQHANKVITKQIPLLVLGETGVGKEQFVKKLHAQSARRTEHLVAVNCAALPA 355
R ++ ++ +++ + L++ GE+G GKE + LH RR VA+N AA+P
Sbjct: 142 RSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPR 201

Query: 356 ELVESELFGYQAGAFTGANRTGFIGKIRQAHGGFLFLDEIGEMPLAAQSRLLRVLQEREV 415
+L+ESELFG++ GAFTGA G+ QA GG LFLDEIG+MP+ AQ+RLLRVLQ+ E
Sbjct: 202 DLIESELFGHEKGAFTGAQTRS-TGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEY 260

Query: 416 VPVGSNQSFKVDIQIIAATHMDLEQQVAQGLFRQDLFYRLNGLQVRLPALRERQ-DIERI 474
VG + D++I+AAT+ DL+Q + QGLFR+DL+YRLN + +RLP LR+R DI +
Sbjct: 261 TTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDL 320

Query: 475 IH---KLHRKHRIAPQEICPELLGLLMQHDWPGNLRELDNLMQVACLMAEGDDTLTWQHL 531
+ + K + + E L L+ H WPGN+REL+NL++ + D +T + +
Sbjct: 321 VRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQ-DVITREII 379

Query: 532 PDYLAQKLACEPLKVDPLNAQLLNTQLLNEEQPLGEEVKTGQNSASHPLAGKVVSGKVTS 591
+ L ++ P++ + + + V+
Sbjct: 380 ENELRSEIPDSPIEKAAARS---------GSLSISQAVEENMRQYFASFGD--------- 421

Query: 592 GNIAVQPTANAVQSDSLHEAIYSNVLQAYQACGGNVSQCAKRLGISRNALYRRLKQMGL 650
+ + L E Y +L A A GN + A LG++RN L ++++++G+
Sbjct: 422 -----ALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0250NUCEPIMERASE300.014 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.014
Identities = 13/28 (46%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 151 ILVTGASGGVGS-VAVTLLANAGYRVVA 177
LVTGA+G +G V+ LL G++VV
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEA-GHQVVG 29


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0251HTHFIS290.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.008
Identities = 10/31 (32%), Positives = 17/31 (54%), Gaps = 1/31 (3%)

Query: 39 KWDKEIEILIVGSGFAGLAAAIEATRKGAKD 69
K ++ +L++ S AI+A+ KGA D
Sbjct: 71 KARPDLPVLVM-SAQNTFMTAIKASEKGAYD 100


61Sbal_0259Sbal_0267N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0259112-0.118301TetR family transcriptional regulator
Sbal_02600131.613782integral membrane sensor signal transduction
Sbal_02610121.883782two component transcriptional regulator
Sbal_0262-1131.358564hypothetical protein
Sbal_0263-1141.020954cation diffusion facilitator family transporter
Sbal_0264-1170.975664hypothetical protein
Sbal_0266-2171.331263nitrogen metabolism transcriptional regulator,
Sbal_0267-2180.255685signal transduction histidine kinase, nitrogen
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0259HTHTETR395e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 39.2 bits (91), Expect = 5e-06
Identities = 14/82 (17%), Positives = 31/82 (37%), Gaps = 3/82 (3%)

Query: 4 WEQRSHYLIEVAQRSLIGHKTFD-LCRSHLVAASQISKGTIYNHFTTEADLVVAVACAEY 62
++ ++++VA L + + A+ +++G IY HF ++DL +
Sbjct: 9 AQETRQHILDVA-LRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 63 EEWLAAAKRDM-QRYPDPLTRF 83
+ DPL+
Sbjct: 68 SNIGELELEYQAKFPGDPLSVL 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0260PF06580393e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.1 bits (91), Expect = 3e-05
Identities = 21/118 (17%), Positives = 46/118 (38%), Gaps = 12/118 (10%)

Query: 286 EAEQLEKLISELLELSRVKLSTNETKVHLGLAESLSQVLDDAEFEAEQQGK--TITIDID 343
+ + ++++ L EL R L + + LA+ L+ V + + Q I+
Sbjct: 189 DPTKAREMLTSLSELMRYSLRYSNARQVS-LADELTVVDSYLQLASIQFEDRLQFENQIN 247

Query: 344 EEIELAHFPKSLSRAIENLLRNAIRYAASD------IHLQASATADQVQITIKDDGPG 395
I P L ++ L+ N I++ + I L+ + V + +++ G
Sbjct: 248 PAIMDVQVPPML---VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0261HTHFIS972e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.2 bits (242), Expect = 2e-25
Identities = 44/163 (26%), Positives = 76/163 (46%), Gaps = 3/163 (1%)

Query: 2 SRILLIDDDLGLSELLGQLLELEGFQLTLAYDGKQGLDLALSADYDLILLDVMLPKLNGF 61
+ IL+ DDD + +L Q L G+ + + + + D DL++ DV++P N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVLRALRQH-KQTPVLMLTARGDEIDRVVGLEIGADDYLPKPFNDRELIARIRAIIRRSN 120
++L +++ PVL+++A+ + + E GA DYLPKPF+ ELI I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 LTTQEIHAAPAQEFGDLRLDPSRQEAYCNEQLIILTGTEFTLL 163
++ + + QE Y L L T+ TL+
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIY--RVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0266HTHFIS5600.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 560 bits (1445), Expect = 0.0
Identities = 197/473 (41%), Positives = 294/473 (62%), Gaps = 11/473 (2%)

Query: 7 VWILDDDSSIRWVLEKALQGAKLSTASFAAAESLWQALEISQPHVIVSDIRMPGTDGLSL 66
+ + DDD++IR VL +AL A + A +LW+ + ++V+D+ MP + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 LERLQVHYPHIPVIIMTAHSDLDSAVSAYQAGAFEYLPKPFDIDEAISLVERALTHATEQ 126
L R++ P +PV++M+A + +A+ A + GA++YLPKPFD+ E I ++ RAL +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 127 SPAPAQEAQVKTPEIIGEAPAMQEVFRAIGRLSRSSISVLINGQSGTGKELVAGALHKHS 186
P+ ++ ++G + AMQE++R + RL ++ ++++I G+SGTGKELVA ALH +
Sbjct: 126 -PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYG 184

Query: 187 PRKDKPFIALNMAAIPKDLIESELFGHEKGAFTGAANVRQGRFEQANGGTLFLDEIGDMP 246
R++ PF+A+NMAAIP+DLIESELFGHEKGAFTGA GRFEQA GGTLFLDEIGDMP
Sbjct: 185 KRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMP 244

Query: 247 LDVQTRLLRVLADGQFYRVGGHNAVQVDVRIIAATHQDLELLVQKGGFREDLFHRLNVIR 306
+D QTRLLRVL G++ VGG ++ DVRI+AAT++DL+ + +G FREDL++RLNV+
Sbjct: 245 MDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVP 304

Query: 307 VHLPPLSQRREDIPQLATHFLASAAKEIGVETKIMTKETAVKLSQLPWPGNVRQLENTCR 366
+ LPPL R EDIP L HF+ A KE G++ K +E + PWPGNVR+LEN R
Sbjct: 305 LRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELENLVR 363

Query: 367 WLTVMASGQEILPQDLPPELLKDPVSVTHTAKGSQDWQSALTEWIDQKLSE--------- 417
LT + I + + EL + ++ ++++ +++ + +
Sbjct: 364 RLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDAL 423

Query: 418 GNSDLLTEVQPAFERILLETALRHTQGHKQEAAKRLGWGRNTLTRKLKELSMD 470
S L V E L+ AL T+G++ +AA LG RNTL +K++EL +
Sbjct: 424 PPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0267PF06580393e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.7 bits (90), Expect = 3e-05
Identities = 36/188 (19%), Positives = 70/188 (37%), Gaps = 33/188 (17%)

Query: 166 TLIIEQADRLRNLVDRL-------LGPQRPTQHSLHNIHQVVQKVYKLVEMALPANIQLK 218
LI+E + R ++ L L Q SL + VV +L + +Q +
Sbjct: 184 ALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFE 243

Query: 219 RDYDPSIPDIEMDPDQMQQAVLNILQNAVQALEHTGGEILLRTRTQHQVTIGSQRHKLVL 278
+P+I D+++ P +Q V N +++ + L GG+ILL+ + +
Sbjct: 244 NQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQ-GGKILLKGTKDNGT----------V 292

Query: 279 TLSIIDNGPGIPPELMDTLFYPMVTGREQGSGLGLSIAHNIARLHSG---RIDCLSSAGH 335
TL + + G ++ +G GL ++ G +I G
Sbjct: 293 TLEVENTGSLALKN------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGK 340

Query: 336 TEFIISLP 343
++ +P
Sbjct: 341 VNAMVLIP 348


62Sbal_0346Sbal_0353N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0346-1193.176612ATP-dependent DNA helicase RecG
Sbal_03470192.047455two component LuxR family transcriptional
Sbal_0348-1182.290924integral membrane sensor signal transduction
Sbal_0349-1202.743477hypothetical protein
Sbal_0350-1202.594760CaCA family Na(+)/Ca(+) antiporter
Sbal_03510163.748574AMP-dependent synthetase and ligase
Sbal_0352-1133.590034putative endoribonuclease L-PSP
Sbal_0353-1133.320085bifunctional (p)ppGpp synthetase II/
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0346SECA412e-05 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 41.0 bits (96), Expect = 2e-05
Identities = 30/84 (35%), Positives = 39/84 (46%), Gaps = 8/84 (9%)

Query: 294 MRLVQGDV-----GSGKTLVAAMAA-LQAIENGYQVAMMAPTELLAEQHATNFAAWFEPL 347
M L + + G GKTL A + A L A+ G V ++ + LA++ A N FE L
Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFEFL 150

Query: 348 GLKVGW-LAGKLKGKVRAQSLADI 370
GL VG L G R ADI
Sbjct: 151 GLTVGINLPGMPAPAKREAYAADI 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0347HTHFIS761e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.4 bits (188), Expect = 1e-18
Identities = 29/117 (24%), Positives = 51/117 (43%), Gaps = 1/117 (0%)

Query: 2 KILLAEDQAMVRGALAALLTLAGGFNITQASDGDEALSLLKQQSFDLLLTDIEMPGRTGL 61
IL+A+D A +R L L+ AG +++ S+ + DL++TD+ MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 ELAAWLKDQHSQTKVVVITTFGRAGYIKRAIEAGVGGFLLKDAPSETLVNAIQQVMA 118
+L +K V+V++ +A E G +L K L+ I + +A
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0348PF06580378e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 8e-05
Identities = 65/358 (18%), Positives = 117/358 (32%), Gaps = 53/358 (14%)

Query: 1 MISTHLQLERKLAWVYLINLVFYL---IPLAINAYPAWKIALSFAVLIPFIASYF-WAYK 56
M STH Q + + I Y A ++ F + I + AY+
Sbjct: 1 MASTHRQANKYYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYR 60

Query: 57 CNQNSAYRPILMMVAIATAITPINPGSISLFTFAAFFIGF-FYPLRTCLLAIAALIGLLF 115
L M I + P A IG ++ T + + A I
Sbjct: 61 SFIKRQGWLKLNMGQIILRVLP-----------ACVVIGMVWFVANTSIWRLLAFINTKP 109

Query: 116 ALNEIYDFNSYYFPLYGSGLVLGVGMFG------VAERRRHQHKLKEQQSTQEISTLAAM 169
+ S F + + + FG + Q K+ ++ L A
Sbjct: 110 VAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQ 169

Query: 170 VERERIARDLHDIMGHSLSSIALKAELAEKLLAKQEYQLATIQLNELGQIARESLSQIRH 229
+ + L++I + I A ++L L ++ R SL
Sbjct: 170 INPHFMFNALNNIR----ALILEDPTKAREMLTS------------LSELMRYSLRYSNA 213

Query: 230 TVSDYKHKV-LADSVTQLCKLLREKGISVELTGNIPKLPARMESQLGLIVTELVNNILRH 288
++ + DS QL + E + E N + ++ ++V LV N ++H
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPP---MLVQTLVENGIKH 270

Query: 289 SGASQC------IIDFIQQADRLVVEVKDNGP----SKPIAEGNGLTGIRERLDSLGG 336
G +Q ++ + + +EV++ G + + G GL +RERL L G
Sbjct: 271 -GIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYG 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0353PF07328320.003 T-DNA border endonuclease VirD1
		>PF07328#T-DNA border endonuclease VirD1

Length = 144

Score = 31.9 bits (72), Expect = 0.003
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 506 PEQIEKVI----RDTKHTTLDSLLADIGLGNAMSIVIAQRLIGDNLENQESRDGHMMPIR 561
P +++KVI + + D+ +A++GL ++ IA R IG +EN + +
Sbjct: 16 PARVDKVISVKMTEAELAEFDAQIAELGLNRNRALRIAARRIGGFVENDAKTVELLRDMS 75

Query: 562 GAEGMLVT 569
A + T
Sbjct: 76 RAIAGVAT 83


63Sbal_0477Sbal_0487N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0477-1204.566645RND family efflux transporter MFP subunit
Sbal_0478-1214.106839CzcA family heavy metal efflux protein
Sbal_04791182.225971antibiotic biosynthesis monooxygenase
Sbal_04801172.278033large-conductance mechanosensitive channel
Sbal_04811182.585879LysR family transcriptional regulator
Sbal_0482-1172.869009secretion protein HlyD family protein
Sbal_04830182.701262EmrB/QacA family drug resistance transporter
Sbal_04840190.374637N-acetyltransferase GCN5
Sbal_0485-119-1.317787antibiotic biosynthesis monooxygenase
Sbal_0486021-1.007184radical SAM domain-containing protein
Sbal_0487-121-1.140651isochorismatase hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0477RTXTOXIND544e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.4 bits (131), Expect = 4e-10
Identities = 36/182 (19%), Positives = 64/182 (35%), Gaps = 22/182 (12%)

Query: 126 RATATLVVDRDRTATLAPQLDVRVLARHVVPGQEVKKGEPLLTLGGSAVAQAQADYINAA 185
R V++ R + L + +A+H V QE + +N
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQE----------------NKYVEAVNEL 268

Query: 186 AEWSRVKRMSEGAVSVSRRMQAQVDAELKRAILEAIKMTPAQIRTLE----STPEAIGSY 241
+ E + ++ V K IL+ ++ T I L E +
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 242 QLLAPIDGRVQQ-DIAMLGQVFSAGTPLMQLT-DESYLWVEAQLTPTQTAHITVGSAALV 299
+ AP+ +VQQ + G V + LM + ++ L V A + I VG A++
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAII 388

Query: 300 QV 301
+V
Sbjct: 389 KV 390



Score = 38.3 bits (89), Expect = 5e-05
Identities = 24/148 (16%), Positives = 53/148 (35%), Gaps = 5/148 (3%)

Query: 118 IANLNLDIRATATLVVDRDRTATLAPQLDVRVLARHVVPGQEVKKGEPLLTLGG----SA 173
+ + + A L R+ + P + V V G+ V+KG+ LL L +
Sbjct: 77 LGQVEIVATANGKLTHS-GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135

Query: 174 VAQAQADYINAAAEWSRVKRMSEGAVSVSRRMQAQVDAELKRAILEAIKMTPAQIRTLES 233
+ Q+ + A E +R + +S D + + E + + +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF 195

Query: 234 TPEAIGSYQLLAPIDGRVQQDIAMLGQV 261
+ YQ +D + + + +L ++
Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0478ACRIFLAVINRP6590.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 659 bits (1702), Expect = 0.0
Identities = 224/1075 (20%), Positives = 434/1075 (40%), Gaps = 73/1075 (6%)

Query: 9 AIKNRLLVVLALLAMIVASVVMLPKLNLDAFPDVTNVQVTINTAAEGLAAEEVEKLISYP 68
I+ + + + +++A + + +L + +P + V+++ G A+ V+ ++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 VESAMYALPAVTEVRSLS-RTGLSIVTVVFAEGTDIYFARQQVFEQLQAAREMIPSGVGV 127
+E M + + + S S G +T+ F GTD A+ QV +LQ A ++P V
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PEIGPNTSGLGQIYQYILRAEPNSGIDAAELRSLNDYLVKLIMMPVGGVTEVLSFGGDVR 187
I S + ++ N G ++ VK + + GV +V FG
Sbjct: 125 QGISVEKSSSSYLMVAGFVSD-NPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 188 QYQVQVDPNKLRAYGLSMAQVTEALESNNRNAGGWFMDQGQE------QLVVRGYGMLPA 241
++ +D + L Y L+ V L+ N + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 242 GEQGLAAIAQIPLTEDK-GTPVRVGDIAQVDFGSEIRVGAVTMTRRDEAGKVQNLGEVVA 300
E+ ++ L + G+ VR+ D+A+V+ G E + N
Sbjct: 243 PEE----FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARI----------NGKPAAG 288

Query: 301 GVVLKRMGANTKATIDDIGARVSLIEQALPDGVSFEVFYDQAELVDKAVTTVRDALLMAF 360
+ GAN T I A+++ ++ P G+ YD V ++ V L A
Sbjct: 289 LGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI 348

Query: 361 VFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAIGMLVDG 420
+ + +++ LFL N+RATL+ +++PV + +++ +G S N +++ G+ +AIG+LVD
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 421 SVVMVENIFKHLTQPDRRHLLEARTRADGEADPYHSDEDGGQQANMAVRIMLAAKEVCSP 480
++V+VEN+ + + ED + M ++
Sbjct: 409 AIVVVENVERVMM------------------------EDKLPPKEATEKSM---SQIQGA 441

Query: 481 IFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVYLFK--- 537
+ ++ VF P+ G G +++ +++I+ AM ++LVALI PAL L K
Sbjct: 442 LVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501

Query: 538 -------RGVVLKQSVVLGPLDAAYRKLLTATLARPKVVMLSALLMFALSLLLLPRLGTE 590
G + Y + L +L L+ A ++L RL +
Sbjct: 502 AEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSS 561

Query: 591 FVPELEEGTINLRVTLAPTASLGTSLAVAPKLEAILLEFPEVEYALSRIGAPELGGDPEP 650
F+PE ++G + L A+ + V ++ L+ + S +
Sbjct: 562 FLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVE-SVFTVNGFSFSGQA 620

Query: 651 VSNIEVYIGLKPISEWQSASSRLE--LQRLMEEKLSVFPGLLLTFSQPIATRVDELLSGV 708
+ ++ LKP E + E + R E + G ++ F+ P + EL +
Sbjct: 621 QNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGTAT 677

Query: 709 KAQLA-IKIFGPDLAVLSERGQALTDLVAKIPGAV-DVSLEQVSGEAQLVVRPKRELLAR 766
I G L++ L + A+ P ++ V + AQ + +E
Sbjct: 678 GFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQA 737

Query: 767 YGISVDQVMSLVSQGIGGGSAGQVIDGNARYDINVRLAAEFRTSPDAIKDLLLSGTNGAT 826
G+S+ + +S +GG ID + V+ A+FR P+ + L + NG
Sbjct: 738 LGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEM 797

Query: 827 VRLGEVASVEVEMAPPNIRRDDVQRRVVVQANVA-GRDMGSVVKDIYALVPQADLPAGYT 885
V + P + R + + +Q A G G + + L + LPAG
Sbjct: 798 VPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIG 855

Query: 886 VIIGGQYENQQRAQQKLMLVVPISIALIALLLYFSFGSFKQVLLIMANVPLALIGGIVAL 945
G ++ + + +V IS ++ L L + S+ + +M VPL ++G ++A
Sbjct: 856 YDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAA 915

Query: 946 YVSGTYLSVPSSIGFITLFGVAVLNGVVLVDSINQ-RRQSGEALYDCVYEGTVGRLRPVL 1004
+ V +G +T G++ N +++V+ + G+ + + RLRP+L
Sbjct: 916 TLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPIL 975

Query: 1005 MTALTSALGLIPILLSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYR 1059
MT+L LG++P+ +S+G GS Q + + ++GG+ S+T L + +P + + R
Sbjct: 976 MTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 113 bits (285), Expect = 1e-27
Identities = 81/544 (14%), Positives = 184/544 (33%), Gaps = 61/544 (11%)

Query: 10 IKNRLLVVLALLAMIVASVVMLPKLNLDAFPDVTNVQVTIN-TAAEGLAAEEVEKLI--- 65
+ + +L ++ VV+ +L P+ G E +K++
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 66 -SYPVESAMYALPAVTEVRSLSRTGLS----IVTVVFAEGTDIYFARQQVFEQLQAAREM 120
Y +++ + +V V S +G + + V + + A+
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 121 ---IPSGVGVPEIGPNTSGLGQIYQYILRAEPNSGIDAAELRSLNDYLVKLIMMPVGGVT 177
I G +P P LG + +G+ L + L+ + +
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 178 EV-LSFGGDVRQYQVQVDPNKLRAYGLSMAQVTEALES--NNRNAGGWFMDQGQEQLVVR 234
V + D Q++++VD K +A G+S++ + + + + + ++L V+
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 235 GYGMLPAGEQGLAAIAQIPLTEDKGTPVRVGDIAQVDFGSEIRVGAVTMTRRDEAGKVQN 294
+ ++ + G V + G+ + R + ++
Sbjct: 774 AD---AKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY----GSPRLERYNGLPSMEI 826

Query: 295 LGEVVAGVVLKRMGANTKATIDDIGARVSLIEQALPDGVSFEVFYDQAELVDKAVTTVRD 354
GE G D A + + LP G+ ++ + + +
Sbjct: 827 QGEAAPGTSS-----------GDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPA 874

Query: 355 ALLMAFVFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAI 414
+ ++FV + + LA + + V+L +P+ I L+ + + ++ + GL I
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934

Query: 415 GMLVDGSVVMVENIFKHLTQPDRRHLLEARTRADGEADPYHSDEDGGQQANMAVRIMLAA 474
G+ ++++VE + L+E + EA ++A
Sbjct: 935 GLSAKNAILIVEFA---------KDLMEKEGKGVVEA------------------TLMAV 967

Query: 475 KEVCSPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVY 534
+ PI + I+ PL G + + ++ M+SA L+A+ VP V
Sbjct: 968 RMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027

Query: 535 LFKR 538
+ +
Sbjct: 1028 IRRC 1031



Score = 102 bits (257), Expect = 3e-24
Identities = 88/516 (17%), Positives = 188/516 (36%), Gaps = 38/516 (7%)

Query: 565 RPKVVMLSALLMFALSLLLLPRLGTEFVPELEEGTINLRVTLAPTASLGT-SLAVAPKLE 623
RP + A+++ L + +L P + +++ P A T V +E
Sbjct: 8 RPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSAN-YPGADAQTVQDTVTQVIE 66

Query: 624 AILLEFPEVEYALSRIGAPELGGDPEPVSNIEVYIGLKPISEWQSASSRLELQRLMEEKL 683
+ + Y S + ++ + + + ++ A Q ++ KL
Sbjct: 67 QNMNGIDNLMYMSST---------SDSAGSVTITLTFQSGTDPDIA------QVQVQNKL 111

Query: 684 SVFPGLLLTFSQPIATRVDELLSGVKAQLAIKIFGPDLAVLSERGQALT---DLVAKIPG 740
+ LL Q V++ S P + D ++++ G
Sbjct: 112 QLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNG 171

Query: 741 AVDVSLEQVSGEAQLVVRPKRELLARYGISVDQVMSLVS---QGIGGGS--AGQVIDGNA 795
DV L + + + +LL +Y ++ V++ + I G + G
Sbjct: 172 VGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQ- 228

Query: 796 RYDINVRLAAEFRTSPDAIKDLLLSGTNGATVRLGEVASVEVEMAPPNIR-RDDVQRRVV 854
+ + ++ F+ + K L ++G+ VRL +VA VE+ N+ R + +
Sbjct: 229 QLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAG 288

Query: 855 VQANVA-GRDMGSVVKDIYALVP--QADLPAGYTVIIGGQYENQQRAQQKLMLVVP---I 908
+ +A G + K I A + Q P G V+ Y+ Q + VV
Sbjct: 289 LGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLSIHEVVKTLFE 346

Query: 909 SIALIALLLYFSFGSFKQVLLIMANVPLALIGGIVALYVSGTYLSVPSSIGFITLFGVAV 968
+I L+ L++Y + + L+ VP+ L+G L G ++ + G + G+ V
Sbjct: 347 AIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLV 406

Query: 969 LNGVVLVDSINQRRQS-GEALYDCVYEGTVGRLRPVLMTALTSALGLIPILLSSGVGSEI 1027
+ +V+V+++ + + + ++ A+ + IP+ G I
Sbjct: 407 DDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAI 466

Query: 1028 QKPLAVVIIGGLFSSTALTLLVLPTLYRWLYRGDKR 1063
+ ++ I+ + S + L++ P L L +
Sbjct: 467 YRQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0480MECHCHANNEL1708e-58 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 170 bits (431), Expect = 8e-58
Identities = 85/136 (62%), Positives = 110/136 (80%), Gaps = 1/136 (0%)

Query: 1 MSLIKEFKAFASRGNVIDMAVGIIIGAAFGKIVSSFVADIIMPPIGIILGGVNFSDLSIV 60
MS+IKEF+ FA RGNV+D+AVG+IIGAAFGKIVSS VADIIMPP+G+++GG++F ++
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LQAAQGDAPSVVIAYGKFIQTIIDFTIIAFAIFMGVKAINRLKRKEEVAPKAPAAPTKDQ 120
L+ AQGD P+VV+ YG FIQ + DF I+AFAIFM +K IN+L RK+E P A APTK++
Sbjct: 61 LRDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKE-EPAAAPAPTKEE 119

Query: 121 ELLSEIRDLLKAQQEK 136
LL+EIRDLLK Q +
Sbjct: 120 VLLTEIRDLLKEQNNR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0482RTXTOXIND972e-24 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 97.2 bits (242), Expect = 2e-24
Identities = 43/296 (14%), Positives = 98/296 (33%), Gaps = 32/296 (10%)

Query: 71 LAQLEDNQFSAKVSQAEASLASSKADLQTLAAKVELQRALITQASAGVVAAESDKIRAQQ 130
+ + + S + ++ + ++ +RA A + E+ +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 131 QLSRSKKLKVSNYSSQDDVDQLQAGFDSAAARLDEAKA--------VLVAKQRELAVFN- 181
+L L ++ V + + + A L K+ +L AK+ V
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 182 ------AQLDQAGSVVEQADATLELAKIQLNDTRVTAPFSGVIGKRGAM-VGQYVQPGQA 234
+L Q + L + + + + AP S + + G V +
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 235 LYSLVPDGAV-WITANFKETQIQHMQPGQSVLVSLDAFPDKTFIGVIDSLSPASGAKFSL 293
L +VP+ +TA + I + GQ+ ++ ++AFP + G + K
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY-GYLV-------GKVKN 407

Query: 294 LPAENATGNFTKIVQRIPVRIRLDLSEEEAH-----MLPGLSAVVKVDTASGTAIS 344
+ + +V V I ++ + + G++ ++ T + IS
Sbjct: 408 INLDAIEDQRLGLVFN--VIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVIS 461


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0483TCRTETB1298e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 129 bits (327), Expect = 8e-35
Identities = 90/421 (21%), Positives = 176/421 (41%), Gaps = 19/421 (4%)

Query: 25 TDYERGSRRSWIAVFGGLIGAFMAILDIQITNASMKEIQGSLGATLEEGSWISTAYLVAE 84
T Y + + R + I +F ++L+ + N S+ +I +W++TA+++
Sbjct: 3 TSYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTF 62

Query: 85 MIAIPLSGWLSTGLSVRRYLLWTTAAFIFASVLCSMAWN-LEAMIAFRALQGFFGGALIP 143
I + G LS L ++R LL+ F SV+ + + +I R +QG A
Sbjct: 63 SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122

Query: 144 LAFRLILEFLPDNKRAVGMALFGVTATFAPSIGPTLGGWLTEQFSWHYLFYINVPPGLLV 203
L ++ ++P R L G +GP +GG + W YL +P ++
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITII 180

Query: 204 MAMLAYGLEKQSVVWDKLKNVDLAGIVTMALGMGCLEVVLEEGNRKDWFGSELIRNLAII 263
L K+ V + D+ GI+ M++G+ + F + + I+
Sbjct: 181 TVPFLMKLLKKEVRIKG--HFDIKGIILMSVGIVFFML----------FTTSYSISFLIV 228

Query: 264 AVVNLVLFVWIQLRRKEPLVNLRLLGKRDFVLSTVAYFLLGMALFGAIYLIPLYLSQVHD 323
+V++ ++FV + +P V+ L F++ + ++ + G + ++P + VH
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 324 YTPLEIGGVIMWMGFPQLLVL-PLVPKLMERFDSRYLAAFGFLMFAISYYMNSQMTADYA 382
+ EIG VI++ G +++ + L++R Y+ G ++S+ S +
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LET 346

Query: 383 GPQMIASQVVRALG-QPFILVPIGMLATMHLKPHENASASTVLNVMRNLGGAFGIALVAT 441
+ +V LG F I + + LK E + ++LN L GIA+V
Sbjct: 347 TSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406

Query: 442 L 442
L
Sbjct: 407 L 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0484SACTRNSFRASE386e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 6e-06
Identities = 21/72 (29%), Positives = 31/72 (43%), Gaps = 5/72 (6%)

Query: 75 ASIGRVVVSPAGRGKGLAMPLMQHAIESALTTWPDAGIQIGAQDY-LKA--FYQKLGFVA 131
A I + V+ R KG+ L+ AIE A G+ + QD + A FY K F+
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFII 148

Query: 132 CS-EMYLEDGIP 142
+ + L P
Sbjct: 149 GAVDTMLYSNFP 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0487ISCHRISMTASE555e-11 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 54.6 bits (131), Expect = 5e-11
Identities = 48/209 (22%), Positives = 78/209 (37%), Gaps = 25/209 (11%)

Query: 30 PTIRTMTQAQAPTELNANTTAVLVIDFQNEYFTGSMP--IPNGKQALGKAKQVVKFAHQN 87
PT M Q + + N +L+ D Q YF + + +++ Q
Sbjct: 12 PTASDMPQNKVSWVPDPNRAVLLIHDMQ-NYFVDAFTAGASPVTELSANIRKLKNQCVQL 70

Query: 88 AMPVYFVRHLGPAA-----------GPLFAEGSVNAEFHQDLQPLDIDFVINKATPSSFV 136
+PV + G GP G + +L P D D V+ K S+F
Sbjct: 71 GIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFK 130

Query: 137 GTNLDQQLKEKGIKTLVITGLMTHMCVSSTARDAVPMGYDVIIAEDATATRDLATWDGSI 196
TNL + ++++G L+ITG+ H+ TA +A DA A D S+
Sbjct: 131 RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVA-------DFSL 183

Query: 197 VDHATLQRAAIAGVADVFAEIKTTQAVLN 225
H + A+ A A T ++L+
Sbjct: 184 EKH----QMALEYAAGRCAFTVMTDSLLD 208


64Sbal_0502Sbal_0508N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0502014-1.511426multi-sensor signal transduction histidine
Sbal_0503113-1.213474response regulator receiver protein
Sbal_0504112-1.131809response regulator receiver modulated
Sbal_0505016-0.778751alpha-L-glutamate ligase
Sbal_0506117-1.513790hypothetical protein
Sbal_0507-114-2.753453histone family protein DNA-binding protein
Sbal_0508015-3.186045response regulator receiver protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0502PF06580350.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 0.001
Identities = 39/252 (15%), Positives = 73/252 (28%), Gaps = 83/252 (32%)

Query: 463 FILATINNVSERKRIEVQRAEHMQELERINQELDRFAYIASHDLKSPLRGIEQLTSWLAE 522
F+ +NN+ + +A M L + + ++ LR LA+
Sbjct: 174 FMFNALNNIRALILEDPTKAREM---------LTSLSEL----MRYSLRYSNARQVSLAD 220

Query: 523 DLSDNTNENVQKYLGLIQSRIHRMVLLLDGLLMFSRIGRVDTETTEVNSRQLAEDMFALV 582
+L V YL L + F + + Q+ + +
Sbjct: 221 EL-----TVVDSYLQLASIQ-------------FEDRLQFEN--------QINPAIMDVQ 254

Query: 583 APPQGFELVLKGEFPNFHTVRALLELVIRNLISNAIKH---HDLGTGVITVLFEAADKHY 639
PP ++++ L+ N IKH G I + +
Sbjct: 255 VPP----------------------MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTV 292

Query: 640 LFSVIDDGPGISSAYQNKVFEMFQTLKPRDEVEGSGLGLSLVKKTVESLGGN---IQLKS 696
V + G L ++ E +G GL V++ ++ L G I+L
Sbjct: 293 TLEVENTGS----------------LALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSE 336

Query: 697 QGRGCCFYFTWP 708
+ P
Sbjct: 337 KQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0503HTHFIS482e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.9 bits (114), Expect = 2e-09
Identities = 24/112 (21%), Positives = 44/112 (39%), Gaps = 10/112 (8%)

Query: 8 QQVTILLVDDDDVDYMAVQRAMRQLRLLNPLVRARDGIEALSILTSLDTIKGPYLILLDL 67
TIL+ DDD + +A+ + + + + + L++ D+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAA----GDGDLVVTDV 55

Query: 68 NMPRMNGFEFLERIRS-DPSLSSSVVFMLTTSSTDEDRMKAYSHHVAGYMVK 118
MP N F+ L RI+ P L V +++ +T +KA Y+ K
Sbjct: 56 VMPDENAFDLLPRIKKARPDL---PVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0504HTHFIS617e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.6 bits (147), Expect = 7e-12
Identities = 29/102 (28%), Positives = 48/102 (47%), Gaps = 3/102 (2%)

Query: 3 LLLIDDDEVDRTAIIRALRQSKLTFNVIEANCAFDGLNLALERHFDGILLDYLLPDANGL 62
+L+ DDD RT + +AL S+ ++V + A D ++ D ++PD N
Sbjct: 6 ILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EVLIKLNAMTQDQTVVVMLSRYEDEKLAQRCIELGAQDFLLK 104
++L ++ D V+VM S A + E GA D+L K
Sbjct: 64 DLLPRIKKARPDLPVLVM-SAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0507DNABINDINGHU1092e-35 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 109 bits (275), Expect = 2e-35
Identities = 45/88 (51%), Positives = 66/88 (75%)

Query: 2 NKTELIAKIAENADLTKVEAARALKSFEAAITESMKNGDKISIVGFGSFETATRAARTGR 61
NK +LIAK+AE +LTK ++A A+ + +A++ + G+K+ ++GFG+FE RAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEIQIAEATVPKFKAGKTLRDSV 89
NPQTG+EI+I + VP FKAGK L+D+V
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0508HTHFIS618e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 8e-13
Identities = 25/107 (23%), Positives = 44/107 (41%), Gaps = 3/107 (2%)

Query: 146 RVLVVDDSRMARNVIKRTIGNLGMKLITEAEDGAQAIELMKNNMFDLVITDYNMPSVDGL 205
+LV DD R V+ + + G + + A + DLV+TD MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 206 ALTQYIRNESQQSHIPILMVSSEANDTHLSNVSQAGVNALCDKPFEP 252
L I+ + +P+L++S++ S+ G KPF+
Sbjct: 64 DLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108



Score = 45.6 bits (108), Expect = 1e-07
Identities = 34/155 (21%), Positives = 60/155 (38%), Gaps = 6/155 (3%)

Query: 10 SILLVEPSDIQRRIIIQRLQQEGILSIQTAENIEAAKDIIARHKPDLIASAMHFDDGTAI 69
+IL+ + R ++ Q L + G ++ N IA DL+ + + D A
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 70 DLLGYLRASADCKDIQFMLVSSECRREQLEIFRQSGVVAILPKPFSAEHLATALNATIDL 129
DLL ++ + D+ +++S++ + G LPKPF L + +
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 130 LSHDELDLSNFDVQDVRVLVVDDSRM--ARNVIKR 162
L + D QD LV + M V+ R
Sbjct: 122 PKRRPSKLED-DSQDGMPLVGRSAAMQEIYRVLAR 155


65Sbal_0607Sbal_0612N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_06070181.967920NAD-dependent epimerase/dehydratase
Sbal_0608-1181.370609arginine repressor
Sbal_06090201.584706malate dehydrogenase
Sbal_06102172.898853putative thiol-disulfide oxidoreductase DCC
Sbal_06113183.925805short chain dehydrogenase
Sbal_06122163.9954015-formyltetrahydrofolate cyclo-ligase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0607NUCEPIMERASE396e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 39.4 bits (92), Expect = 6e-06
Identities = 30/123 (24%), Positives = 47/123 (38%), Gaps = 23/123 (18%)

Query: 1 MKIAILGATGWIGGAILKEALSRGHQVTAL-----VRDPS-------KLSATDVAVHAVD 48
MK + GA G+IG + K L GHQV + D S L+ H +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 49 LE-QPLVAQTFA--GVDVVI-----AAVGGRAQQNHDLVASTV---QHLLDVLPNAKVPR 97
L + + FA + V AV + H S + ++L+ + K+
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 98 LLW 100
LL+
Sbjct: 121 LLY 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0608ARGREPRESSOR1451e-47 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 145 bits (367), Expect = 1e-47
Identities = 43/150 (28%), Positives = 71/150 (47%), Gaps = 5/150 (3%)

Query: 6 NQDDLVRIFKSILKEERFGSQSEIVTALQAEGFGNINQSKVSRMLSKFGAVRTRNAKQEM 65
N+ + I+ +Q E+V L+ +G+ N+ Q+ VSR + + V+
Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSY 60

Query: 66 VYCLPAELGVPTAGSPLKNLV---LDVDHNQAMIVVRTSPGAAQLIARLLDSIGKPEGIL 122
Y LPA+ ++L+ + +D +IV++T PG AQ I L+D++ E I+
Sbjct: 61 KYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IM 119

Query: 123 GTIAGDDTIFICPSSIQDIADTLETIKSLF 152
GTI GDDTI I + D + I L
Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0611DHBDHDRGNASE488e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 48.1 bits (114), Expect = 8e-09
Identities = 37/192 (19%), Positives = 71/192 (36%), Gaps = 22/192 (11%)

Query: 5 IIITGVGKRIGYALAKHFLAQGQQVIG-----TYRSHYDSIDELNALGATLYPCDFYDDT 59
ITG + IG A+A+ +QG + S + A A +P D D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 60 QVQSLIDEL-TQLPQIRAIIHNASDWLPDPVLTKNEPLKSTTFAPSQVLQRMMQVHVSVP 118
+ + + ++ I +++ A P + + ++ TF+ V+ +
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFS----------VNSTGV 120

Query: 119 YQLNLALEAQLRAAAGDEIGGSDVIHITDYVAEKGSQKHIAYAASKAALHNLTLSFAAKF 178
+ + ++ + D GS V + A AYA+SKAA T +
Sbjct: 121 FNASRSVSKYMM----DRRSGSIVT-VGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175

Query: 179 APE-VKVNSIAP 189
A ++ N ++P
Sbjct: 176 AEYNIRCNIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0612OMS28PORIN290.018 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 28.6 bits (63), Expect = 0.018
Identities = 14/44 (31%), Positives = 26/44 (59%), Gaps = 1/44 (2%)

Query: 46 NRNQLRKSIRTARKSLSETEQIQASLSASQRMLDALLAQNAQHV 89
N++ K + ++ ++ EQ++ +L AS+R LD + Q AQ V
Sbjct: 166 NKSPNNKELELTKEEFAKVEQVKETLMASERALDETV-QEAQKV 208


66Sbal_0749Sbal_0753N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0749-1131.315208two component transcriptional regulator
Sbal_07500151.660002integral membrane sensor signal transduction
Sbal_07510151.925227hypothetical protein
Sbal_07520172.235254MltA-interacting MipA family protein
Sbal_07530182.547319aldo/keto reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0749HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 37/126 (29%), Positives = 56/126 (44%), Gaps = 1/126 (0%)

Query: 6 HILVVEDDISLAEWISDYLLDHGYEVTVASQGDFALEMIAEEIPDLVLLDVMLPVKNGFD 65
ILV +DD ++ ++ L GY+V + S IA DLV+ DV++P +N FD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 VCKEARAFYAG-PILFMTACVEDGDEIRGLDVGADDYLTKPIRPQVLLARIKALLRRVGD 124
+ + P+L M+A I+ + GA DYL KP L+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 125 EEQKQQ 130
K +
Sbjct: 125 RPSKLE 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0750PF06580290.042 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.042
Identities = 23/156 (14%), Positives = 46/156 (29%), Gaps = 36/156 (23%)

Query: 258 RDLDTMEDLVMTLLSYARLDEANIQPDWQSIELNAWLLEKYQGQVYPDFSVELVSYPTAL 317
D +++ +L R S+ +++ Y + + + L
Sbjct: 188 EDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSY-------LQLASIQFEDRL 240

Query: 318 K--IKTDPKYLSMQVNNLL-----NNALRFG------KAKIRLTLAVEEGATWLHVDDDG 364
+ + +P + +QV +L N ++ G KI L + G L V++ G
Sbjct: 241 QFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 365 PGIDELESMQVIKPFVRGQHSRGNSGHGMGLAIVDR 400
+ G GL V
Sbjct: 301 SLALK----------------NTKESTGTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0752IGASERPTASE300.013 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.013
Identities = 28/115 (24%), Positives = 44/115 (38%), Gaps = 6/115 (5%)

Query: 127 IHLGTGTLSTKFQ--HDVTNVYDGFQADITYYHPINLGFGDLVPYAGVHYFSKDFANYYT 184
I LG G +K Q H+ Q +T NLG + P GV Y A++
Sbjct: 1377 IDLGYGKFQSKLQTNHNAKFARHTAQFGLTAGKAFNLGNFGITPIVGVRYSYLSNADFAL 1436

Query: 185 G---VTSSEATAQRPAYQADGTFAYKLGYALVIPI-TKHLDITQATGYSHIAANM 235
+ + + + Q D ++ Y LG V PI + D Q +G ++
Sbjct: 1437 DQARIKVNPISVKTAFAQVDLSYTYHLGEFSVTPILSARYDANQGSGKINVNGYD 1491


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0753HELNAPAPROT320.001 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 32.2 bits (73), Expect = 0.001
Identities = 19/88 (21%), Positives = 36/88 (40%), Gaps = 16/88 (18%)

Query: 109 IHQAVDASLARLQIDTIDLYQIHWPDRNTNFFG--ELFYDQQDQEHQTPILETLEALAEV 166
+ +++ L+ + L++ HW + +FF E F +E ET++ +AE
Sbjct: 13 VENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKF-----EELYDHAAETVDTIAER 67

Query: 167 IRQGKVRYIGVSNETPWGLMK-YLQLAE 193
+ IG P +K Y + A
Sbjct: 68 LLA-----IGGQ---PVATVKEYTEHAS 87


67Sbal_0786Sbal_0793N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0786-1111.275613two-component response regulator
Sbal_0787-1151.594549hypothetical protein
Sbal_07880141.033030aspartate kinase III
Sbal_07890120.511248succinylglutamate desuccinylase/aspartoacylase
Sbal_0790113-0.820690Mg2+ transporter
Sbal_0791013-1.596614hypothetical protein
Sbal_0792-112-1.543974two component LuxR family transcriptional
Sbal_0793-214-1.484968nitrate/nitrite sensor protein NarQ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0786HTHFIS854e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.9 bits (210), Expect = 4e-21
Identities = 31/131 (23%), Positives = 62/131 (47%), Gaps = 1/131 (0%)

Query: 1 MQNPHILIVEDEAVTRNTLRSIFEAEGYVVTEANDGAEMHKAMQENKINLVVMDINLPGK 60
M IL+ +D+A R L GY V ++ A + + + +LVV D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELREIN-NIGLIILTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLT 119
N L +++ ++ +++++ ++ + I E GA DY+ KPF+ EL L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RVNSAGAEVEE 130
+++E+
Sbjct: 121 EPKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0788DHBDHDRGNASE290.025 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 29.2 bits (65), Expect = 0.025
Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 5/86 (5%)

Query: 10 GTSVADYNAMNRCADIVLANPHCRLVVVSASSGVTNLLVELTQESINDDGRLQRLK-QIA 68
+S A +C + LA + R +VS S T++ L + ++G Q +K +
Sbjct: 158 ASSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWAD---ENGAEQVIKGSLE 214

Query: 69 QIQYAI-LDKLGRPNDVAAALDKLLS 93
+ I L KL +P+D+A A+ L+S
Sbjct: 215 TFKTGIPLKKLAKPSDIADAVLFLVS 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0792HTHFIS612e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 2e-13
Identities = 27/159 (16%), Positives = 61/159 (38%), Gaps = 9/159 (5%)

Query: 6 SVLVVDDHPLLRKGICQLIASDPDFSLFGEAGGGLDALTAVATDEPDIILLDLNMKGMTG 65
++LV DD +R + Q S + + +A + D+++ D+ M
Sbjct: 5 TILVADDDAAIRTVLNQ-ALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 66 LDTLNAMRQEGVTSRIVILTVSDAKQDVIRLLRAGADGYLLKDTEPDLLLEKLKNAMLGH 125
D L +++ +++++ + I+ GA YL K + L+ + A+
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL--- 119

Query: 126 RVISDEVEEYLYELKDATDEQEWISSLTPRELQILEQLA 164
E + +L+D + + + + +I LA
Sbjct: 120 ----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0793PF06580419e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.0 bits (96), Expect = 9e-06
Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 17/155 (10%)

Query: 407 TVEAQLTEINEGVSTAYVQLRELLSTFRLTIK-EPDLKSALETMLEQLRTKTSI------ 459
+ I E + A L L R +++ + +L L + + +
Sbjct: 178 ALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFE 237

Query: 460 -KITLDYKLAPQWLEAKQHIHILQITREATLNAIKHA-----EASLINIHCYKDDEGMVN 513
++ + ++ P ++ + ++Q E N IKH + I + KD+ G V
Sbjct: 238 DRLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDN-GTVT 293

Query: 514 INVCDNGIGIGHLKERDQHFGIGIMHERASKLSGK 548
+ V + G + G+ + ER L G
Sbjct: 294 LEVENTGSLALKNTKESTGTGLQNVRERLQMLYGT 328


68Sbal_0987Sbal_0994N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_0987-2121.803302hypothetical protein
Sbal_0988-2131.498884pseudouridine synthase
Sbal_0989-2171.282984phosphoribosylglycinamide formyltransferase 2
Sbal_0990-3170.692034hypothetical protein
Sbal_0991-2171.514910secretion protein HlyD family protein
Sbal_0992-2161.305546ATPase central domain-containing protein
Sbal_09930171.732533hypothetical protein
Sbal_09940171.750132sulfate ABC transporter ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0987SSBTLNINHBTR310.005 Streptomyces subtilisin inhibitor signature.
		>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature.

Length = 144

Score = 31.0 bits (69), Expect = 0.005
Identities = 18/68 (26%), Positives = 31/68 (45%), Gaps = 5/68 (7%)

Query: 27 MSACQPASEPSKLKTNPDASHT-AEVTSATAMPRAPLTQEVYIWQRQWRPASQTALVQSQ 85
+ A P+ T+P A+ AE+ +A P A ++ + R++ P T
Sbjct: 59 LRAVTLTCAPTASGTHPAAAAACAELRAAHGDPSALAAEDSVMCTREYAPVVVTV----D 114

Query: 86 GVFQGLRI 93
GV+QG R+
Sbjct: 115 GVWQGRRL 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0989PF06057310.005 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 31.3 bits (71), Expect = 0.005
Identities = 10/28 (35%), Positives = 13/28 (46%), Gaps = 2/28 (7%)

Query: 17 GCGELGKEVAIELQRLGVEVIGVD--RY 42
G L K V LQ+ G V+G +Y
Sbjct: 62 GWATLDKAVGGILQQQGWPVVGWSSLKY 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0991RTXTOXIND604e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 60.2 bits (146), Expect = 4e-12
Identities = 52/320 (16%), Positives = 98/320 (30%), Gaps = 80/320 (25%)

Query: 66 ITPAVKGLVSRVEVQPNTPVKQGDVLFRIDPIPFEAVVK--------------RKRAALV 111
I P +V + V+ V++GDVL ++ + EA R +
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158

Query: 112 AAEL--------------------EVPQLAAALESAKANVER----VNADKDRNKSVYER 147
+ EL EV +L + ++ + + + D+ ++
Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218

Query: 148 YESGHRKGGANSPFTALELDNKRQL----------YLASEAQLTAARSE----------- 186
+ + S LD+ L L E + A +E
Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQI 278

Query: 187 -----ELRMRLA-----YESNIDG----VNTKVAGLQGDLASALYDLKQTVVRAPADGIV 232
+ +++ I + L +LA + +V+RAP V
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338

Query: 233 TQMALR-PGAMAVPLPLRPVMSFIPDEQRYFAGAFWQNSLL-RLKEGDEAEIILDAAPGK 290
Q+ + G V +M +P++ A QN + + G A I ++A P
Sbjct: 339 QQLKVHTEG--GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYT 396

Query: 291 ---VFKGKVAKVLPAMAEGE 307
GKV + E +
Sbjct: 397 RYGYLVGKVKNINLDAIEDQ 416


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_0994PF05272290.044 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.044
Identities = 11/38 (28%), Positives = 17/38 (44%), Gaps = 7/38 (18%)

Query: 30 MIGLLGPSGSGKTTLLRIIAGLEGADSGNIYFGDRDVT 67
+ L G G GK+TL+ + GL+ +F D
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD-------FFSDTHFD 628


69Sbal_1241Sbal_1248N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1241-1101.182286two component LuxR family transcriptional
Sbal_1242-1101.023628integral membrane sensor signal transduction
Sbal_1243-2120.353602hypothetical protein
Sbal_1244-1120.640054carotenoid oxygenase
Sbal_1245-1130.573215hypothetical protein
Sbal_1246-19-0.004445methyl-accepting chemotaxis sensory transducer
Sbal_1247-190.147732hypothetical protein
Sbal_1248-19-0.109678TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1241HTHFIS711e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.0 bits (174), Expect = 1e-16
Identities = 21/112 (18%), Positives = 51/112 (45%), Gaps = 2/112 (1%)

Query: 12 LVEDQQLVRQGIASLLAISDNIRVVWQAEDGQDALSQLANNPVDVLLSDIRMPNLDGIAM 71
+ +D +R + L+ + V + +A D++++D+ MP+ + +
Sbjct: 8 VADDDAAIRTVLNQALSRAG-YDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 72 LKQIRQNANSLPVIMLTTFDDSELFLNSLQAGANGFLLKDVSLDKLLHAIET 123
L +I++ LPV++++ + + + + GA +L K L +L+ I
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1242PF06580310.006 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.4 bits (71), Expect = 0.006
Identities = 19/106 (17%), Positives = 44/106 (41%), Gaps = 20/106 (18%)

Query: 291 LVLQEGISNAVRHG-----KANQLQLSMEDSQSVLVLQLSDNGVGLTRVAARNASAKSGT 345
+++Q + N ++HG + ++ L + L++ + G + + K T
Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK------NTKEST 311

Query: 346 GLNGTGQFGTGLGGMQERLQP-FNGKVQLRANDSAPGCQLTLTLPA 390
GTGL ++ERLQ + + Q++ ++ + +P
Sbjct: 312 --------GTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1246FLAGELLIN300.026 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.0 bits (67), Expect = 0.026
Identities = 30/256 (11%), Positives = 72/256 (28%), Gaps = 14/256 (5%)

Query: 74 AHDISVQTSKIAIGSAEVSHFIDLLNKSIESNGEHASAIAVAAGQLSHTTAQLGDNAADI 133
K+ + +A D + + + + A G
Sbjct: 217 DTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAE---AKAIAGAIKGGK 273

Query: 134 LGQAQEAERVSVQGRSQAQKG-----VAAIRSLSTDIDTAAEQVQALKSRAEEIQKITEV 188
G + + V+ ++ I + A A A +Q V
Sbjct: 274 EGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNV 333

Query: 189 INSVAEQTNLLALNAAIEAARAGEQGRGFAVVADEVRSLAGKTAGATQDIGKMLLEIRSE 248
SV E+A+ + AV + ++ G A K+ L ++
Sbjct: 334 YTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTM 393

Query: 249 TDKTSGLMERVVTQTADVVA------AMGELDAHFTEISASVTQSAHALGDMEDSLKQYN 302
+ + A + +D+ +++ A + + ++
Sbjct: 394 FIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLG 453

Query: 303 NTTNDISRSVTQIRDS 318
NT +++ + ++I D+
Sbjct: 454 NTVTNLNSARSRIEDA 469


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1248HTHTETR572e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.3 bits (138), Expect = 2e-12
Identities = 28/167 (16%), Positives = 60/167 (35%), Gaps = 3/167 (1%)

Query: 2 RNAEFDREQVLRGAMAAFMHKGYTKTSMQDLTQATGLHPGSIYCAFTNKRGLLIAAIEQY 61
+ A+ R+ +L A+ F +G + TS+ ++ +A G+ G+IY F +K L E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 62 QLDRNAQFNSLFAN-SKNVLTNLKTYLDQIVAECLSCDSAQACLLTKALNEVAEQDVEIR 120
+ + A + L+ L+ L ++ ++ + + + ++ +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 121 D-IINQYLQSWQQALTQQFTSAAEQGLLEGHRSDEQRAQYFMMGIYG 166
+ Q E +L +RA M G
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADL-MTRRAAIIMRGYIS 172


70Sbal_1395Sbal_1406N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1395-1131.616468TetR family transcriptional regulator
Sbal_1396-1142.118585hypothetical protein
Sbal_1397-1172.029043TonB-dependent heme/hemoglobin receptor family
Sbal_13980161.822202PhnA protein
Sbal_13990181.570791major facilitator superfamily transporter
Sbal_14000171.341566MarR family transcriptional regulator
Sbal_14011191.245950succinylglutamate desuccinylase/aspartoacylase
Sbal_14021191.064336methyl-accepting chemotaxis sensory transducer
Sbal_14030160.547794fumarylacetoacetate (FAA) hydrolase
Sbal_14041170.937603hypothetical protein
Sbal_1405-1171.406814beta-N-acetylhexosaminidase
Sbal_1406-1171.268949ROK family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1395HTHTETR543e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.9 bits (129), Expect = 3e-11
Identities = 29/163 (17%), Positives = 63/163 (38%), Gaps = 5/163 (3%)

Query: 8 DRQEKLI-LAMELFWQKGFAETSISDLVGHLGINRFSLYNSFGDKQNLYRECLSFYLDNY 66
+ ++ ++ +A+ LF Q+G + TS+ ++ G+ R ++Y F DK +L+ E N
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 67 SFGASDTLLHEKAGLAE-IAAYLARFVALQREQKYGCFMQNAVLEKSL--DDESVLQECQ 123
+ + L + ++ + + K + +V+Q+ Q
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 124 RLFCRLQTS-FTQVLQDCQARGELLVNVQPHQVAAFLVLQLQG 165
R C Q L+ C L ++ + A + + G
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1396FERRIBNDNGPP280.046 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 27.6 bits (61), Expect = 0.046
Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 2/46 (4%)

Query: 50 PALQFIEQMQPSILALSPRLTAVPKKVGGSLMRPQRDSRFSKDKTP 95
P L+ + +M+PS + S P+ + + + P R FS K P
Sbjct: 87 PNLELLTEMKPSFMVWSAGYGPSPEML--ARIAPGRGFNFSDGKQP 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1399TCRTETB310.010 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 30.6 bits (69), Expect = 0.010
Identities = 35/192 (18%), Positives = 68/192 (35%), Gaps = 9/192 (4%)

Query: 36 LPSIQEDISLSFTLASMLTLLPVLAMGLGCFAGFSIAKRLGFNTVMTGSLLLLIVATAMR 95
LP I D + + + +L +G ++ +LG ++ +++ + +
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 96 FWAMD-ASWLICSALLAGVGIA-LIQTIMPAMIKLNFGERVPLMMGLYVTAIMGGAALAA 153
F S LI + + G G A +M + + E GL + + G +
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV-- 154

Query: 154 SSAPFIGMNLGWRAGLGHWTWLGVVALALWLMVKHNAALPNQTAEQTVQLSFWRFRRSWL 213
P IG G A HW++L ++ + + V L + +
Sbjct: 155 --GPAIG---GMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSV 209

Query: 214 LAIFFALGTSCY 225
+FF L T+ Y
Sbjct: 210 GIVFFMLFTTSY 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1402RTXTOXIND357e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.2 bits (81), Expect = 7e-04
Identities = 20/123 (16%), Positives = 43/123 (34%), Gaps = 8/123 (6%)

Query: 42 IETLAVPVQKQSNSLQLVLLKMSRLATLAHSQQDTAALTKSQQAFTALQKKYQSIENELT 101
T+ + + N ++ ++ ++L H Q ++ A + KY NEL
Sbjct: 216 RLTVLARINRYENLSRVEKSRLDDFSSLLHKQ------AIAKHAVLEQENKYVEAVNELR 269

Query: 102 ERVADQSKMQTSLHEAQARYQAYLQQSQAMFSAKLANEQAKQQYQQLFQRFNDAKTNASN 161
+ ++++ + A+ YQ Q + KL Q L +
Sbjct: 270 VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR--QTTDNIGLLTLELAKNEERQQA 327

Query: 162 AMI 164
++I
Sbjct: 328 SVI 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1403PF07824280.018 Type III secretion chaperone
		>PF07824#Type III secretion chaperone

Length = 120

Score = 27.6 bits (61), Expect = 0.018
Identities = 14/54 (25%), Positives = 23/54 (42%)

Query: 80 AIGLDLTKRDLQSKLKAKGLPWERAKAFDGAALFSPFVAIDDAEAPLHFTLSIN 133
A+G+ D Q+ + + K D L PF A+ + L + LS+N
Sbjct: 11 ALGIPSIDTDDQAIMLDDDVLIYIEKEGDSINLLCPFCALPENINDLIYALSLN 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1406PERTACTIN280.049 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 28.1 bits (62), Expect = 0.049
Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 9 GGTKLMLAQVEGKTLLDTWRYPVPADGN 36
LA +GK + T+RY + A+GN
Sbjct: 530 SAATFTLANKDGKVDIGTYRYRLAANGN 557


71Sbal_1521Sbal_1524N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_15211204.082223TetR family transcriptional regulator
Sbal_15220203.405853secretion protein HlyD family protein
Sbal_15231172.782125ABC transporter-like protein
Sbal_15241141.118741ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1521HTHTETR751e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 75.0 bits (184), Expect = 1e-18
Identities = 29/162 (17%), Positives = 59/162 (36%), Gaps = 6/162 (3%)

Query: 31 SDARQRLITAALSLFSHRSYPTVSTREIAREAEVDAALIRYYFGSKAGLFEQMVRETLEP 90
+ RQ ++ AL LFS + + S EIA+ A V I ++F K+ LF ++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 91 VLTRLREISAAEAPNN---VGEIMQTYYRVMAPNPGLPRLIIRVLQEGDGSEPYRIILSV 147
+ E A + + EI+ L+ + + + ++
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 148 FEQVLTLSRQWLESTL---VNSGLLKEGVDPDLARLSFVSLM 186
+ S +E TL + + +L + A + +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1522RTXTOXIND552e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.2 bits (133), Expect = 2e-10
Identities = 33/176 (18%), Positives = 62/176 (35%), Gaps = 17/176 (9%)

Query: 80 TVERDRLTLTAPVGELITQVNVVEGQQVKAGEVLLTLDSTSANARLALRQAELEQAKAKL 139
T + ++ ++ V EG+ V+ G+VLL L + A A Q+ L Q A+L
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ--ARL 148

Query: 140 SEAVTGARLEDIERAKAVLDGANASVKEAQRAFERTNRLYATKVLSQADLDTARAARDTS 199
+ IE + + F+ + +VL L + T
Sbjct: 149 EQTRYQILSRSIEL-----NKLPELKLPDEPYFQNVS---EEEVLRLTSL--IKEQFSTW 198

Query: 200 LAKQAEAEQSLRLLENGTRSEQLEQAKAAVAAASASVAIEKKALADLSLVAARDAV 255
++ + E +L + + A + +EK L D S + + A+
Sbjct: 199 QNQKYQKELNLD-----KKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAI 249



Score = 51.0 bits (122), Expect = 4e-09
Identities = 31/232 (13%), Positives = 81/232 (34%), Gaps = 15/232 (6%)

Query: 102 VEGQQVKAGEVLLTLDSTSANARLALRQAELEQAKAKLSEAVTGARLEDIERAKAVLDGA 161
V ++V L+ ++ + ++ L++ +A+ + AR+ E V
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVL--ARINRYENLSRVEKSR 236

Query: 162 NASVKE-AQRAFERTNRLYATK---VLSQADLDTARAARDTSLAKQAEAEQSLRLLENGT 217
+ + + + V + +L ++ + ++ A++ +L+
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 218 RSE---QLEQAKAAVAAASASVAIEKKALADLSLVAARDAVVDTLP-WRVGDRIAAGTQL 273
++E +L Q + + +A ++ + A V L G + L
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 274 IGLLASEDPY-VRVYLPATWLDRVKAGDSVNILVD---GREIP-ITGTVRNI 320
+ ++ +D V + + + G + I V+ + G V+NI
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1523adhesinb290.019 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 29.4 bits (66), Expect = 0.019
Identities = 15/87 (17%), Positives = 26/87 (29%), Gaps = 12/87 (13%)

Query: 220 SPQQLMAAMGARVVEISGDDL------------RNLKQSLISESAVLSAAQIGSRLRVLV 267
P+ + A ++ +G +L N K+ + +S L
Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132

Query: 268 RSDIEDPLAWLKPRVASRTMEEVRASL 294
EDP AWL + + L
Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRL 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1524ABC2TRNSPORT406e-06 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 40.3 bits (94), Expect = 6e-06
Identities = 47/200 (23%), Positives = 91/200 (45%), Gaps = 24/200 (12%)

Query: 186 GVILTMTMVMFT----SAAIVREREQGNMEFLITTPVRPLELMLGKI--------VPYVI 233
G++ T M T AA R Q E ++ T +R +++LG++ +
Sbjct: 72 GMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG 131

Query: 234 VGFVQVTIILSAG-HLLFDVPIRGGIDSIALAAMLFICASLTLGLVISTIAKTQLQSMQM 292
+G V + + LL+ +P+ IAL + F +LG+V++ +A + +
Sbjct: 132 IGVVAAALGYTQWLSLLYALPV------IALTGLAFA----SLGMVVTALAPSYDYFIFY 181

Query: 293 TVFILLPSILLSGFMFPYEAMPIAAQWIAEALPATHFMRMSRAIVLRDAQVMDLQFDALW 352
++ P + LSG +FP + +PI Q A LP +H + + R I+L V+D+
Sbjct: 182 QTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIML-GHPVVDVCQHVGA 240

Query: 353 MIGFTCVGLFIASMRFSKRL 372
+ + + F+++ +RL
Sbjct: 241 LCIYIVIPFFLSTALLRRRL 260


72Sbal_1605Sbal_1612N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1605221-0.669258ATP-dependent protease ATP-binding subunit ClpX
Sbal_1606219-0.534644ATP-dependent protease La
Sbal_1607117-0.096231histone family protein DNA-binding protein
Sbal_1608-1130.574818PpiC-type peptidyl-prolyl cis-trans isomerase
Sbal_1609-2140.742338TOBE domain-containing protein
Sbal_1610-2120.828113trans-2-enoyl-CoA reductase
Sbal_1611-1141.286003ABC transporter-like protein
Sbal_1612-1131.026997oligopeptide/dipeptide ABC transporter ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1605HTHFIS300.018 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.018
Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 13/70 (18%)

Query: 64 KLPTPHELRAHLDDYVIGQDRAKKVLSVAVYNHYKRLKNASPKDGIELGKSNILLIGPTG 123
+ P+ E + ++G+ S A+ Y+ L D +++ G +G
Sbjct: 124 RRPSKLEDDSQDGMPLVGR-------SAAMQEIYRVLARLMQTD------LTLMITGESG 170

Query: 124 SGKTLLAETL 133
+GK L+A L
Sbjct: 171 TGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1606HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 0.002
Identities = 45/211 (21%), Positives = 76/211 (36%), Gaps = 37/211 (17%)

Query: 261 NMPAEAKEKALAELNKLRMMSP---MSAEATV---VRSY----VDWMTSVPWSQRSKIKR 310
MP E L + K R P MSA+ T +++ D++ P+ I
Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK-PFDLTELIGI 114

Query: 311 D---------LAKAQEVLDTDHFGLEKVKERILEYLAVQSRVRQLKGPILCLVGPPGVGK 361
E D L + E V +R+ Q ++ + G G GK
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM-ITGESGTGK 173

Query: 362 TSLGQSIAKATGRK---YVRVALGGVRD---EAEIRGHRRTYIGSMPGKVIQKMAKVGVK 415
+ +++ R+ +V + + + E+E+ GH + G+ G + +
Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEK---GAFTGAQTRSTGRFEQA 230

Query: 416 N--PLFLLDEIDKMSSDMRGDPASALLEVLD 444
LFL DEI M D + + LL VL
Sbjct: 231 EGGTLFL-DEIGDMPMDAQ----TRLLRVLQ 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1607DNABINDINGHU1194e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (300), Expect = 4e-39
Identities = 53/88 (60%), Positives = 69/88 (78%)

Query: 2 NKSELIEKIASGADISKAAAGRALDSFIAAVTEGLKEGDKISLVGFGTFEVRERAERTGR 61
NK +LI K+A +++K + A+D+ +AV+ L +G+K+ L+GFG FEVRERA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGEEIKIAAAKIPAFKAGKALKDAV 89
NPQTGEEIKI A+K+PAFKAGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1611HTHFIS290.022 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.022
Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 42 TLAIVGEAGSGKSTLARILVGAEPRSGG 69
TL I GE+G+GK +AR L R G
Sbjct: 162 TLMITGESGTGKELVARALHDYGKRRNG 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1612HTHFIS300.011 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.011
Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 38 LVGESGSGRSLLARAI 53
+ GESG+G+ L+ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


73Sbal_1951Sbal_1962N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_19510180.260500type III secretion low calcium response
Sbal_1952017-0.035435secretion system effector
Sbal_1953017-1.268079putative pathogenicity island effector protein
Sbal_1954019-1.394557secretion system effector SseE
Sbal_1955119-1.806408hypothetical protein
Sbal_1956319-1.944594type III secretion low calcium response
Sbal_1957220-2.237206hypothetical protein
Sbal_1958122-2.266388helix-turn-helix domain-containing protein
Sbal_1959222-0.788160type III secretion system needle protein
Sbal_1960118-0.158808type III secretion system protein SsaH family
Sbal_19611170.006541YscI/HrpB family type III secretion apparatus
Sbal_19621160.070757YscJ/HrcJ family type III secretion apparatus
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1951SYCDCHAPRONE945e-27 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 93.8 bits (233), Expect = 5e-27
Identities = 34/148 (22%), Positives = 59/148 (39%), Gaps = 2/148 (1%)

Query: 16 LEHFLQRGGSLRMLADVEQSDLNVLYQYALQLMACRDQQGAKRIFYLLMRIDQWNYDYCF 75
+E FL+ GG++ ML ++ L LY A + A ++F L +D ++ +
Sbjct: 15 MESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFL 74

Query: 76 SLGICCQQLHEHEEAIFCLGRAGMIKVDNPLPAYHAGLSYLALGNHDYAKRSFNASLRWC 135
LG C Q + +++ AI ++ + P +HA L G A+ +
Sbjct: 75 GLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELI 134

Query: 136 EGHPESTGIAARAQRGLA--TLAKENSH 161
E ++ R L L KE H
Sbjct: 135 ADKTEFKELSTRVSSMLEAIKLKKEMEH 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1955CLENTEROTOXN290.035 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 29.2 bits (65), Expect = 0.035
Identities = 22/132 (16%), Positives = 40/132 (30%), Gaps = 18/132 (13%)

Query: 246 SSEDNSLRYAVTPSRYELLNCVAAHGMEDEGLTRVLYQAKVGNTNLGALYGLPAPKDAPQ 305
+E + T +Y+ + ++ + D+G L + T A
Sbjct: 124 PNEYVYYKVYATYRKYQAIR-ISHGNISDDGSIYKLTGIWLSKT------------SADS 170

Query: 306 LDNVDD--FILCDEDINLGVSQTDVYADEATFYQGIGQHQTTTTGDN--CYKLLQLNINE 361
L N+D I E L V TD+ + + T ++ L +
Sbjct: 171 LGNIDQGSLIETGERCVLTVPSTDIEKEILDLAAATERLNLTDALNSNPAGNLYDWR-SS 229

Query: 362 GLHYLATKANPH 373
+ K N H
Sbjct: 230 NSYPWTQKLNLH 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1956SYCDCHAPRONE943e-27 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 94.2 bits (234), Expect = 3e-27
Identities = 44/148 (29%), Positives = 68/148 (45%)

Query: 9 DFEKLEAACQLALVNQQTLAEQVGLTSQDLEQTYQSGTSKYQMGLPAEAIVDFTYLVMHQ 68
D ++ + A + L T+A ++S LEQ Y ++YQ G +A F L +
Sbjct: 7 DTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLD 66

Query: 69 PWDRRFHLGLGSCLHWLGEYQHALTFYGYALVMDACSPDASFRIAQCFLSLNDDAAAIEA 128
+D RF LGLG+C +G+Y A+ Y Y +MD P F A+C L + A A
Sbjct: 67 HYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESG 126

Query: 129 LQMAISQSFSKPEHHFVGEQAQQLLSAL 156
L +A K E + + +L A+
Sbjct: 127 LFLAQELIADKTEFKELSTRVSSMLEAI 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1957RTXTOXIND362e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.0 bits (83), Expect = 2e-04
Identities = 23/188 (12%), Positives = 62/188 (32%), Gaps = 19/188 (10%)

Query: 185 LVVGTLWSAVVSPPSLPAHIAGTVNIVNMARRSAEPVVEGVIG--LV-DNTSTK---VLD 238
LV+ + S + + A G + + + +P+ ++ +V + S + VL
Sbjct: 68 LVIAFILSVL-GQVEIVATANGKL-THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLL 125

Query: 239 KLKGTSQNKSLEAEVSSVKAYQLRQLQASALTQQGNMKLSEAQLAVKDSQAKEKTAQFDA 298
KL SS+ +L Q + L++ + +L +
Sbjct: 126 KLTALGAEADTLKTQSSLLQARLEQTRYQILSRS----IELNKLPELKLPDEPYFQNVSE 181

Query: 299 EIRMKQSQQLRGTNQALQQQLADKDGLLLQSQNQFEQLQSRFDKSNVQLSGVMQQLHMLQ 358
E ++ + ++ Q Q Q + ++ ++ +++ + +
Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKY-------QKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 359 QQLAELQP 366
+L +
Sbjct: 235 SRLDDFSS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1962FLGMRINGFLIF794e-19 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 79.2 bits (195), Expect = 4e-19
Identities = 48/190 (25%), Positives = 81/190 (42%), Gaps = 10/190 (5%)

Query: 22 LYRDLPQDEANQMVALLMLNHIDASAEADQKSGNVSLKIEKDQFINAVELLRQNGFPKPH 81
L+ +L + +VA L +I +G+ ++++ D+ L Q G PK
Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPY----RFANGSGAIEVPADKVHELRLRLAQQGLPKGG 108

Query: 82 YANIEDLFPSGQLVTSPAQEEAKMGYLKEQQLERTLSSMDGVISARVSIAEPAPDTGRQL 141
E L + S E+ E +L RT+ ++ V SARV +A P P +
Sbjct: 109 AVGFE-LLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVRE 167

Query: 142 AQTKSASVYIKYSPQANLTNTE-NQIKSLVQNAVPGLSYDNISVFLQAASYRYQAITQPT 200
++ SASV + P L + + + LV +AV GL N+++ Q+ +TQ
Sbjct: 168 QKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHL----LTQSN 223

Query: 201 SSSSSQLLAQ 210
+S AQ
Sbjct: 224 TSGRDLNDAQ 233


74Sbal_1973Sbal_1977N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_1973318-1.190450hypothetical protein
Sbal_1974315-2.046929type III secretion system protein
Sbal_1975316-1.993565HrpO family type III secretion protein
Sbal_1976216-1.446170type III secretion protein SpaR/YscT/HrcT
Sbal_1977317-1.347544secretion system apparatus protein SsaU
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1973FLGMOTORFLIM290.026 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 28.7 bits (64), Expect = 0.026
Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 224 SLLPKMDAIQSPLIAEIGRVSLSLAKLGAMMAGDKLTLAVTL 265
L K+ + ++AE+G + LS+ + + GD + L T
Sbjct: 249 VLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTH 290


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1974TYPE3IMPPROT2097e-71 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 209 bits (534), Expect = 7e-71
Identities = 80/215 (37%), Positives = 129/215 (60%), Gaps = 7/215 (3%)

Query: 8 VQLIIMLFCLSLLPLFAVMGTSFLKLAIVFSMLRNALGIQQIPPNMAIYGLALILTLFTM 67
+ LI +L +LLP GT F+K +IVF M+RNALG+QQIP NM + G+AL+L++F M
Sbjct: 5 ISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVM 64

Query: 68 APVGMAINDNLKATPIVFDAPNVFEQINTEAIAPYRAFLDKNTSNTQIEFFANIGHKVWP 127
P+ + + F+ + + E + YR +L K + ++FF N K
Sbjct: 65 WPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQY 124

Query: 128 EKYQQV-------LTKDSLLVMVPAFTMSQLIEAFKIGLLIYLPFVAIDLIVSNILLAMG 180
+ + + K S+ ++PA+ +S++ AFKIG +YLPFV +DL+VS++LLA+G
Sbjct: 125 GEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALG 184

Query: 181 MMMVSPMTIALPFKLLIFILMGGWEKLISQLMMSF 215
MMM+SP+TI+ P KL++F+ + GW L L++ +
Sbjct: 185 MMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1975TYPE3IMQPROT713e-20 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 70.9 bits (174), Expect = 3e-20
Identities = 33/83 (39%), Positives = 50/83 (60%)

Query: 6 IVHFTSELLWMVLLLSLPVVIVASVVGVLVSLIQALTQIQDQTLQFLIKLIAVCVTLVVC 65
+V ++ L++VL+LS IVA+++G+LV L Q +TQ+Q+QTL F IKL+ VC+ L +
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 66 YHWMGSSLLNYASMAFDQISQMG 88
W G LL+Y G
Sbjct: 64 SGWYGEVLLSYGRQVIFLALAKG 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1976TYPE3IMRPROT1264e-37 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 126 bits (317), Expect = 4e-37
Identities = 45/238 (18%), Positives = 105/238 (44%), Gaps = 5/238 (2%)

Query: 1 MTTLLPNLLTAQLPVLALCMMRPLGMMLLLPLFKGGAMGSALIRNSLILMFALPTVLAMD 60
M + + L + ++R L ++ P+ ++ ++ L +M +
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFA-IAPSL 58

Query: 61 EMQPILQQADTWMLISLFGKEMIVGVLLGFCAAIPFWAIDMAGFVIDTMRGASMSTVLNP 120
+ + + +++ +++++G+ LGF F A+ AG +I G S +T ++P
Sbjct: 59 PANDVPVFSFFALWLAV--QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDP 116

Query: 121 LMGLQSSIYGMLFTQVLTVLFLVSGGFNFLLTALYQSYQQLPPGFNLTLAQPLMVFIAHE 180
L + + + +LFL G +L++ L ++ LP G + +
Sbjct: 117 ASHLNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAG 176

Query: 181 WQLMCQLCLSFAMPAMVIMILVDVALGLVNRSAQQLNVFFLSMPIKSALVLLLLIYSL 238
+ L A+P + +++ +++ALGL+NR A QL++F + P+ + + L+ +
Sbjct: 177 SLIF-LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALM 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_1977TYPE3IMSPROT356e-125 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 356 bits (916), Expect = e-125
Identities = 119/348 (34%), Positives = 190/348 (54%)

Query: 2 AEKTEKPTEKRLREARNRGQVIKSAEIVTGLQMAIILGYFLYEGPALVQAMMALIDLTIN 61
EKTE+PT K++R+AR +GQV KS E+V+ + + + + L+ +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 62 AINLPLETAAEQIVGTFAMLALRFLGGLTLVLVFTIVVGNSVQTGPVWATESIMPSMNKL 121
LP A +V + L V + + VQ G + + E+I P + K+
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122

Query: 122 NVMNNAKQLISLKSLFELAKNLVKVTVLSLVFYYLLHRYVNAFQYLPLCEEACGISVIST 181
N + AK++ S+KSL E K+++KV +LS++ + ++ + LP C C ++
Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182

Query: 182 MITWLWGSFLGCYLIFGIADYAFQRYSLMKELKMSKDDTKQEYKDSEGNPEMKQKRRETQ 241
++ L +++ IADYAF+ Y +KELKMSKD+ K+EYK+ EG+PE+K KRR+
Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242

Query: 242 REVASGSLANNVRKATVVVRNPTHIAVCLYYCEGETPLPKVLEKAEDHMALHIVALAEKA 301
+E+ S ++ NV++++VVV NPTHIA+ + Y GETPLP V K D + +AE+
Sbjct: 243 QEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEE 302

Query: 302 GVPIVENIPLARALFKHVETGDVIPESLFEPVAELLRLVMAISYDNMK 349
GVPI++ IPLARAL+ IP E AE+LR + + +
Sbjct: 303 GVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQH 350


75Sbal_2005Sbal_2014N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2005020-2.557306transposase IS3/IS911 family protein
Sbal_2006018-2.678881integrase catalytic subunit
Sbal_2008-117-2.597606integrase catalytic subunit
Sbal_2009-117-2.436386transposase IS3/IS911 family protein
Sbal_2010-216-2.348213transposase, IS4 family protein
Sbal_2011-216-2.500043diguanylate cyclase
Sbal_2012-116-2.313466response regulator receiver modulated metal
Sbal_2013-115-2.458265integral membrane sensor signal transduction
Sbal_2014-216-2.642397phosphate-selective porin O and P
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2005HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHEALKQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2009HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHEALKQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2012HTHFIS515e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 51.4 bits (123), Expect = 5e-09
Identities = 20/89 (22%), Positives = 37/89 (41%), Gaps = 9/89 (10%)

Query: 27 VLVVDDEPDIVNVTRLTLNSFTYKNKTLNVQHAYSASEAIEIISHSTDLALILLDVVMET 86
+LV DD+ I V L+ Y V+ +A+ I+ L++ DVVM
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYD-----VRITSNAATLWRWIAAGD-GDLVVTDVVMP- 58

Query: 87 DDAGLKLVKWIRNELGNHMVRIVLRTGQP 115
D+ L+ I+ + +++ + Q
Sbjct: 59 DENAFDLLPRIKKA--RPDLPVLVMSAQN 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2013PF06580449e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.1 bits (104), Expect = 9e-07
Identities = 22/127 (17%), Positives = 38/127 (29%), Gaps = 24/127 (18%)

Query: 443 FENTISPEISCDGYPGALGQVISNLL-----HNAAVHAFEPT-DSGTITISAEIKDDYVT 496
FE+ + E + P + + +L N H G I + + VT
Sbjct: 236 FEDRLQFENQIN--PAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVT 293

Query: 497 IYVTDNGKGMSEVILARIWQPFFTTKLGSGGSGLGLSICRNIVVGILGG--SLIASSAEG 554
+ V + G K +G GL R + + G + S +G
Sbjct: 294 LEVENTGSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339

Query: 555 QGTCFTL 561
+ L
Sbjct: 340 KVNAMVL 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2014ENTEROVIROMP404e-06 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 40.3 bits (94), Expect = 4e-06
Identities = 22/101 (21%), Positives = 36/101 (35%), Gaps = 13/101 (12%)

Query: 306 KEAEALGQDKQKGYYIE----PSYRFNESFGVF----ARYNAWDNKAGNDVDTEITQTNI 357
K A D K Y P+YR N+ ++ Y + + +
Sbjct: 71 KSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTEYPTYKHDTSDYGF 130

Query: 358 ----GVNYWLHENVVFKADYEKAG-GAKDADGFNLGVGYQF 393
G+ + ENV YE++ + D + GVGY+F
Sbjct: 131 SYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF 171


76Sbal_2111Sbal_2118N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2111014-0.988706Fis family transcriptional regulator
Sbal_2112014-0.420693CheA signal transduction histidine kinase
Sbal_2113-115-0.364208putative CheW protein
Sbal_2114-115-0.076120methyl-accepting chemotaxis sensory transducer
Sbal_2115-118-1.380394protein-glutamate O-methyltransferase
Sbal_2116-114-1.656582chemoreceptor glutamine deamidase CheD
Sbal_2117010-1.271290response regulator receiver modulated CheB
Sbal_2118-110-1.638536response regulator receiver modulated
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2111HTHFIS872e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 2e-23
Identities = 30/122 (24%), Positives = 53/122 (43%), Gaps = 3/122 (2%)

Query: 1 MSK-KILVVDDSAAIRQMVEATLKSANYQVVLAKDGREALDLCNGQRFDFILTDQNMPRM 59
M+ ILV DD AAIR ++ L A Y V + + D ++TD MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGLTLIKSLRAMSAFMRTPIIMLTTEAGDDMKAQGKAAGATGWMVKPFDPQKLLAITAKV 119
+ L+ ++ P+++++ + + GA ++ KPFD +L+ I +
Sbjct: 61 NAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 LG 121
L
Sbjct: 119 LA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2112PF06580433e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.5 bits (100), Expect = 3e-06
Identities = 24/151 (15%), Positives = 51/151 (33%), Gaps = 52/151 (34%)

Query: 281 EIDKGMIEKLVDPLT--HLVRNSLDHGIEKPDKRLAAGKSEVGVLSLKASQRGGNIVIAV 338
+I+ +++ V P+ LV N + HGI + + G + LK ++ G + + V
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEV 296

Query: 339 HDNGAGLNRERIIQKARENGLQVADNISDKQVWQLIFAAGFSTAVEVTDVSGRGVGMDVV 398
+ G+ + G G+ V
Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318

Query: 399 RRNIEALGG---RIDIESTEGQGSTFEIQLP 426
R ++ L G +I + +G+ + + +P
Sbjct: 319 RERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2117HTHFIS697e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.7 bits (168), Expect = 7e-15
Identities = 29/118 (24%), Positives = 51/118 (43%), Gaps = 6/118 (5%)

Query: 3 IKVLVVDDSALMRSLLGKMIEADPELSLVGQAADAFEAKDLVNQFRPDVITLDIEMPKVD 62
+LV DD A +R++L + + + ++A + D++ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLTFLDRLMKARPTAVVMISSLTEQG-ADATFNALALGAVDFIPKPKLDSPQGIHDYQ 119
L R+ KARP V++ ++ Q A GA D++PKP D + I
Sbjct: 62 AFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2118HTHFIS731e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.6 bits (178), Expect = 1e-15
Identities = 28/164 (17%), Positives = 65/164 (39%), Gaps = 6/164 (3%)

Query: 255 KVLLVDDQQSMVDYFSSLLRSHGLIVKGLSSAEQVLPALEQFEPDLFIFDLYMPEVNGLE 314
+L+ DD ++ + L G V+ S+A + + + DL + D+ MP+ N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 315 LAKMIRQLDKYTSSPILVLSSDDTMQNKVSIIQAGSDDLISKQTAPSLFVA---QVISRA 371
L I++ P+LV+S+ +T + + G+ D + K + + + ++
Sbjct: 65 LLPRIKKARPDL--PVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 372 QRGHDIRSSASRDSLTGLLNHTQILVTARRCYNVARRINSQVCI 415
+R + L+ + + R + + + I
Sbjct: 123 KRRPS-KLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMI 165



Score = 55.2 bits (133), Expect = 3e-10
Identities = 30/135 (22%), Positives = 59/135 (43%), Gaps = 2/135 (1%)

Query: 131 HIAIIEDDGNVGAMITKQLREFGFSVQHFLNFTSFLVVQNETPFDLILLDLILPDWTEEA 190
I + +DD + ++ + L G+ V+ N + DL++ D+++PD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 191 LFEAATEFEKNNTRVFVLSSRGDFDMRLLAIRANVSEYFVKPAETTLLVRKIHQSLKMSE 250
L + + + V V+S++ F + A +Y KP + T L+ I ++L +
Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 251 KQPLKVLLVDDQQSM 265
++P L D Q M
Sbjct: 124 RRP-SKLEDDSQDGM 137


77Sbal_2361Sbal_2365N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2361-116-1.774132phosphoenolpyruvate-protein phosphotransferase
Sbal_2362016-1.702680PTS system glucose-specific transporter
Sbal_2363015-1.269099major facilitator superfamily transporter
Sbal_2364017-0.927693methyl-accepting chemotaxis sensory transducer
Sbal_2365-1160.868981porin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2361PHPHTRNFRASE5300.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 530 bits (1367), Expect = 0.0
Identities = 189/555 (34%), Positives = 308/555 (55%), Gaps = 5/555 (0%)

Query: 3 ITGIIVSSGIAFGQALHLIHTEHHLDYRPIPLSKIPQQQGKFAKALQELLAQLTH--SQA 60
ITGI SSG+A +A IH E ++D ++ + + K AL++ +L Q
Sbjct: 5 ITGIAASSGVAIAKAF--IHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQT 62

Query: 61 ALDSDSENYQLIEADLLLLEDDELIEQVNDAIRTLQLSASVAVERIFAHQANELQSLDDP 120
++ ++ A LL+L+D EL++ + I Q++A A++ + + +S+D+
Sbjct: 63 EASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNE 122

Query: 121 YLANRAQDVRCLGQRVVAAINGHLNQGLEKLDRPTILLAQDLTPAEFALLPRENLCGIVL 180
Y+ RA D+R + +RV+ + G L + T+++A+DLTP++ A L ++ + G
Sbjct: 123 YMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFAT 182

Query: 181 KTGGLTSHTAILARAAGIPAILSCQFDADSIPNGTPLVLDALNGELCVKPNPDQQARLTV 240
GG TSH+AI++R+ IPA++ + + I +G +++D + G + V P ++
Sbjct: 183 DIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEE 242

Query: 241 TFHHEQARRAALQTYKDGPAQTQDGHIVGLMANVGNLNDITHVSDVGADGIGLFRTEFML 300
+ ++ P+ T+DG V L AN+G D+ V G +GIGL+RTEF+
Sbjct: 243 KRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLY 302

Query: 301 MNVSTLPDEKAQYSLYCDALHALGGKTFTIRTLDIGADKELPCLCQEIEDNPALGLRGIR 360
M+ LP E+ Q+ Y + + + GK IRTLDIG DKEL L E NP LG R IR
Sbjct: 303 MDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAIR 362

Query: 361 YTLAHPDLFKTQLRAILRAANHGPIRLMFPMVNQVEELDEVFALIAQCQDALEEEEKGYG 420
L D+F+TQLRA+LRA+ +G +++MFPM+ +EEL + A++ + +D L E
Sbjct: 363 LCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVS 422

Query: 421 E-LSYGIVVETPAAVFNLNAMLPRLDFVSIGTNDLTQYAMAADRTNPQLTRDYPSLSPAI 479
+ + GI+VE P+ N +DF SIGTNDL QY MAADR N +++ Y PAI
Sbjct: 423 DSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPAI 482

Query: 480 LALINMTIVQAKAANVKVSLCGELASSPQIVPLLIGMGLDELSVNLSSLLEVKAAICQGN 539
L L++M I A + V +CGE+A +PLL+G+GLDE S++ +S+L ++ + + +
Sbjct: 483 LRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLS 542

Query: 540 IQQFSALAHTALQQD 554
++ A AL D
Sbjct: 543 KEELKPFAQKALMLD 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2363TCRTETA385e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.3 bits (89), Expect = 5e-05
Identities = 33/170 (19%), Positives = 66/170 (38%), Gaps = 11/170 (6%)

Query: 33 MVWPFLAVILYE--KFALSATEVGMVLSSAAIISVFTSFVGSSLSDRIGRHKLMYLTGIL 90
++ P L +L + G++L+ A++ + V +LSDR GR ++ ++
Sbjct: 23 LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAG 82

Query: 91 YIISFSLLAEANSVEGYVVVMTLCSMATSLWRPLTSAAIGDIIAD---PKTRELAMQSLY 147
+ ++++A A + + + + + T A G IAD R +
Sbjct: 83 AAVDYAIMATAPFLWVLYIGRIVAGITGA-----TGAVAGAYIADITDGDERARHFGFMS 137

Query: 148 FIVNVGCAVGPMLGVWLGLTGKQSSFYLTAVAFAVLLALLYWGFNHQTRQ 197
G GP+LG +G + F+ A A L L ++ +
Sbjct: 138 ACFGFGMVAGPVLGGLMGGFSPHAPFFA-AAALNGLNFLTGCFLLPESHK 186



Score = 32.5 bits (74), Expect = 0.004
Identities = 27/130 (20%), Positives = 52/130 (40%), Gaps = 9/130 (6%)

Query: 320 AMVIISTQFLLLKLMARFPLVKRIQIGLLLLICSQIWLAFNSLDLFWGW-IGAIVVMSVA 378
+ ++ + + AR + + +G++ I LAF + GW I+V+ +
Sbjct: 257 ILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATR----GWMAFPIMVLLAS 312

Query: 379 ETILFPTMNVHIDRLAPDHLRGAYFGA-ASFYDLGFALAPLCGGIILDHFGGQW---LFL 434
I P + + R + +G G+ A+ L + PL I W ++
Sbjct: 313 GGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWI 372

Query: 435 TGAALSVLVI 444
GAAL +L +
Sbjct: 373 AGAALYLLCL 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2364FLAGELLIN300.039 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.6 bits (66), Expect = 0.039
Identities = 20/179 (11%), Positives = 47/179 (26%), Gaps = 3/179 (1%)

Query: 383 ATAMHEMATTSSDVARNAQGASSAAKEADEATNVGSKVVSDTTNAINALSSKIDMAVAEV 442
E + + +G + K + + + + K+ + VA++
Sbjct: 256 TAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADI 315

Query: 443 NTLGSATDNIATILKVINDIADQTNLLALNAAI--EAARAGDSGRGFAVVADEVRTLAQR 500
+ D + + E+A+ D AV + T+
Sbjct: 316 TAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGA 375

Query: 501 TQQSTTQIRNMIEQLQTGARAVAEVMSQSKDNAKDAVTLAQGANTALDKIREAILQISD 559
+ + +T S +DA + L I A+ ++
Sbjct: 376 EYTANAAGDKVTLAGKTMFIDKTAS-GVSTLINEDAAAAKKSTANPLASIDSALSKVDA 433


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2365ECOLNEIPORIN483e-08 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 47.9 bits (114), Expect = 3e-08
Identities = 58/326 (17%), Positives = 109/326 (33%), Gaps = 49/326 (15%)

Query: 63 SLYGSLRPTLEYQDKADDVWD----------IGDALSRLGVKADTEFAPNWHAIAQGEWK 112
+LYG+++ +E I D S++G K + AI Q E K
Sbjct: 22 TLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKAIWQVEQK 81

Query: 113 IRLDDNGRFGEARLAFAGIGSPFGQITFGRQRPVQY--------TLVAEYIDIFNNA--- 161
+ R +F G+ FG++ GR V ++Y+ + A
Sbjct: 82 ASIAGTDSGWGNRQSFIGLKGGFGKLRVGRLNSVLKDTGDINPWDSKSDYLGVNKIAEPE 141

Query: 162 NSPFGYNQESPFF--VNNALIYELKLKPVTVMASAIFDGNSGGSGADTINVGAGFDKNGL 219
+SP F ++ ++ Y L N+G +++ + G + G
Sbjct: 142 ARLISVRYDSPEFAGLSGSVQYALN-------------DNAGRHNSESYHAGFNYKNGGF 188

Query: 220 HLGAAYLQQDVYANANRTG---KEQLTGAVISYEFSSGIYAAVGYQAKDYEFETAVNRTG 276
+ + + K Q+ V Y+ + +YA+V Q +D +
Sbjct: 189 FVQYGGAYKR-HHQVQENVNIEKYQIHRLVSGYD-NDALYASVAVQQQDAKLVEENYSHN 246

Query: 277 STFDSA--LAIPFANAYKLKLGYFWFKDG-IEDASSQD-YDGYNLTLEWQIAANVRTHLE 332
S + A LA F N ++ Y G + + + YD + E+ + +
Sbjct: 247 SQTEVAATLAYRFGNV-TPRVSYAHGFKGSFDATNYNNDYDQVVVGAEYDFSKRTSALVS 305

Query: 333 Y-LQQNNDQRED--DTIIALGIRYDF 355
Q T +G+R+ F
Sbjct: 306 AGWLQEGKGESKFVSTAGGVGLRHKF 331


78Sbal_2488Sbal_2498N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2488315-0.340989phosphoenolpyruvate synthase
Sbal_2489113-0.430651hypothetical protein
Sbal_2490113-0.546716phospho-2-dehydro-3-deoxyheptonate aldolase
Sbal_2491-112-0.438829glycoside hydrolase family protein
Sbal_2492-112-1.196057thioesterase superfamily protein
Sbal_2493-120-2.718488two component LuxR family transcriptional
Sbal_2494-120-2.817657transcriptional regulator CysB
Sbal_2495-121-3.032080hypothetical protein
Sbal_2496-121-2.867905DNA topoisomerase I
Sbal_2497024-3.674685succinylarginine dihydrolase
Sbal_2498025-3.992369Ig domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2488PHPHTRNFRASE2973e-93 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 297 bits (761), Expect = 3e-93
Identities = 111/418 (26%), Positives = 187/418 (44%), Gaps = 65/418 (15%)

Query: 384 QPGDVLVTDMTDPDWEPIMK-RASAIVTNRGGRTCHAAIIARELGVPAVVGCGDVTDRIK 442
+ ++ D+T D + K T+ GGRT H+AI++R L +PAVVG +VT++I+
Sbjct: 155 EETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQ 214

Query: 443 NGQIVTVSCAEG---------DTGFIYEGKQEFEVISNRVDSLPELP--------MKIMM 485
+G +V V EG + E + FE L P +++
Sbjct: 215 HGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAA 274

Query: 486 NVGNPDRAFDFARLPNEGVGLARLEFIINRMIGIHPKALLEFNQQDAALQTEINEMIAGY 545
N+G P EG+GL R EF+ M L TE E Y
Sbjct: 275 NIGTPKDVDGVLANGGEGIGLYRTEFLY--MD-------------RDQLPTE-EEQFEAY 318

Query: 546 ESPVEFYIARLVEGIATIGSAFYPKKVIVRMSDFKSNEYANLVGGDRYEPEEENPMLGFR 605
+ V+ K V++R D ++ + + P+E NP LGFR
Sbjct: 319 KEVVQ---------------RMDGKPVVIRTLDIGGDKELSYL----QLPKELNPFLGFR 359

Query: 606 GASRYISESFRDCFALECEAIKRVRNDMGLKNVEVMIPFVRTVKEAEQVIGLLKEQGLER 665
+ + +D F + A+ R N++VM P + T++E Q +++E+ +
Sbjct: 360 AIRLCLEK--QDIFRTQLRALLRAS---TYGNLKVMFPMIATLEELRQAKAIMQEEKDKL 414

Query: 666 GKDG------LRVIMMCEVPSNALLADQFLEHFDGFSIGSNDLTQLTLGLDRDSGIISHL 719
+G + V +M E+PS A+ A+ F + D FSIG+NDL Q T+ DR + +S+L
Sbjct: 415 LSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYL 474

Query: 720 FDERDEAVKMLLSLAIKAAKTKGAYIGICGQGPSDHADFAAWLVEQGIDTVSLNPDTV 777
+ A+ L+ + IKAA ++G ++G+CG+ D L+ G+D S++ ++
Sbjct: 475 YQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDE-VAIPLLLGLGLDEFSMSATSI 531


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2491MICOLLPTASE377e-04 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 36.6 bits (84), Expect = 7e-04
Identities = 38/177 (21%), Positives = 59/177 (33%), Gaps = 18/177 (10%)

Query: 81 WENKGVCDGAQNQAPTLVILQPQNNVSVNLGDVVVLQADAS-DVDGTVSSVNW-FANGQA 138
D N+ P VI +++ SV + + + S D DG + + W F +G+
Sbjct: 761 MNTDTNTDVHVNKEPKAVI---KSDSSVIVEEEINFDGTESKDEDGEIKAYEWDFGDGEK 817

Query: 139 VTSPWTT---NAIGSVQLKAVATDDKGATTEKSVVLTVINVTSENLPPSTEILSPLND-- 193
T N G ++K TD+ G +S I V + P ND
Sbjct: 818 SNEAKATHKYNKTGEYEVKLTVTDNNGGINTES---KKIKVVEDKPVEVINESEPNNDFE 874

Query: 194 SAVTIGDSVTISANASDPDTGDSITKVEFYLDSQ---LIATDNSAPYSATWQAAGVG 247
A I S + + D K F + + I +N TW G
Sbjct: 875 KANQIAKSNMLVKGTLSEE--DYSDKYYFDVAKKGNVKITLNNLNSVGITWTLYKEG 929


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2492TYPE3OMGPROT290.007 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 29.1 bits (65), Expect = 0.007
Identities = 13/44 (29%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 79 VTVSSDRIDFKKPIPAGTLAELIARVIHVGNTSLKVEVNIYVED 122
V V+ + K I GT+ + RV+ G+ S ++ +N+++ED
Sbjct: 383 VKVTGKEVAELKGITYGTMLRMTPRVLTQGDKS-EISLNLHIED 425


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2493HTHFIS792e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-19
Identities = 32/111 (28%), Positives = 53/111 (47%), Gaps = 6/111 (5%)

Query: 8 IIIADDHPLFRNALRQALTTAFEHAQWFEADSADALQAVL-DVRSVDYDLVLLDLQMPGS 66
I++ADD R L QAL+ A ++ + + + D DLV+ D+ MP
Sbjct: 6 ILVADDDAAIRTVLNQALSRAG-----YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 67 HGYSTLIHLRSHYPDLPVVVISAHEDINTISRAIHYGSSGFIPKSASMETL 117
+ + L ++ PDLPV+V+SA T +A G+ ++PK + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2498INTIMIN421e-05 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 41.6 bits (97), Expect = 1e-05
Identities = 58/331 (17%), Positives = 117/331 (35%), Gaps = 49/331 (14%)

Query: 25 SISNDGGTTPTPGVVTVTMGISNIDISAIAPAEVTAKVVDSKLGPLAGILVTFSLSDSNI 84
++ ++G GV T ++ TA V + + A + V+F++
Sbjct: 547 TVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGV-AQANVPVSFNI---VS 602

Query: 85 GSFTPSIGTALTDVNGVASITLVTATIAG------------AGTVSAKLDSGESGKIGFN 132
G+ S +A T+ +G A++TL + A +A + ++
Sbjct: 603 GTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITE 662

Query: 133 MKGDGGVAGGGAQVSLVLT---DANGAPI--------QSISTLSPGKLVATVTGIKKPTI 181
+K D A Q ++ T P+ ++ LS G K T+
Sbjct: 663 IKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTL 722

Query: 182 VTFSSPIGDLPIKTAVTNAQGKA-TVDIYAGSALGAG--EVMASLVTGEVGNTIVVVGAT 238
+ + + + + KA V+ + + G E++ + V G++ + G
Sbjct: 723 TSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQV 782

Query: 239 NVVMGSGN---PFVKGKAAVSSTTVSAG-------GTATISVLIQDDLGNAFTQPVDVNF 288
N+ GN + A++S S+G GT TISV+ D+ Q
Sbjct: 783 NLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDN------QTATYTI 836

Query: 289 SSTCATKIPAQAQISSPVSSSNGVATSTYLA 319
++ + + +S V+ ++ V T
Sbjct: 837 ATPNSLIV---PNMSKRVTYNDAVNTCKNFG 864


79Sbal_2710Sbal_2714N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2710-3171.086547IucA/IucC family protein
Sbal_2711-217-0.266022TonB-dependent siderophore receptor
Sbal_2712-217-1.692205ferric iron reductase
Sbal_2713-117-3.554775intracellular septation protein A
Sbal_2714119-3.732149YciI-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2710PF041836210.0 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 621 bits (1603), Expect = 0.0
Identities = 168/593 (28%), Positives = 291/593 (49%), Gaps = 22/593 (3%)

Query: 42 LTPAYWQAANRHLVKKILCEFTHEKIITPTLYGQKARLNHYELRLKDSTYYFSARHYQLD 101
+ W NR LV K+L E +E++ G + Y + L + + F A
Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQVFHAESQGD----DRYCINLPGAQWRFIAERGIWG 56

Query: 102 HLAIDADSIRVSVAGQEQALDAMSLIISLKSDLGISETLLPTYLEEITSTLYSKAYKL-A 160
L IDA ++R ++ + A +L++ LK L +S+ + +++++ +TL L A
Sbjct: 57 WLWIDAQTLRC----ADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKA 112

Query: 161 HQAIPAATLARADYQSIEAGMTEGHPVFIANNGRIGFDMQDYRQFAPESAMPMQLVWLGV 220
+ + A+ L + ++ + GHP F+ N GR G+ + ++APE A +L WL V
Sbjct: 113 RRGLSASDLINLNADRLQC-LLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAV 171

Query: 221 RKSKTTFAALENLSHDALLKEELG-QQFTDFQQRLKTQQHDPQDFYFMPVHPWQWREKIA 279
++ + + LL + Q+F F Q + D ++ +PVHPWQW++KIA
Sbjct: 172 KREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLD-HNWLPLPVHPWQWQQKIA 230

Query: 280 RVFAGDIARGDLVYLGEGSEQYQVQQSIRTFFNLSSPQKCYVKTALSILNMGFMRGLSPL 339
F D A G +V LGE +Q+ QQS+RT N S +K L+I N RG+
Sbjct: 231 TDFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGR 290

Query: 340 YMSCTPQINAWVANLVESDPYFTQQGFVILKEIAAIGYHHHYYEQALTQDSAYKKMLSAL 399
Y++ P + W+ + +D Q G VIL E AA H Y Y++ML +
Sbjct: 291 YIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVI 350

Query: 400 WRESPLPHIAPKQNLMTMAALLHTDHEDKALISALITASGLPAKDWLSRYLNLYLSPLLH 459
WRE+P + P ++ + MA L+ D ++ L A I SGL A+ WL++ + + PL H
Sbjct: 351 WRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYH 410

Query: 460 AFFAYDLVFMPHGENLILVLDEYVPVKILMKDIGEEVAVLNGTSP----LPDDVKRLAVS 515
Y + + HG+N+ L + E VP ++L+KD ++ ++ P LP +V+ +
Sbjct: 411 LLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSR 470

Query: 516 LEEEMKLNYILLDIFDCIFRYLAPLIDEQTSVSESQFWELVADNVRDYQAQHPHLAAKFA 575
L + ++ + F + R+++PL+ + V E +F++L+A + DY +HP ++ +FA
Sbjct: 471 LSADYLIHDLQTGHFVTVLRFISPLMV-RLGVPERRFYQLLAAVLSDYMKKHPQMSERFA 529

Query: 576 QYDLFKDSFVRTCLNRIQLNNNQQMIDLADREKNL-RFAGGIDNPLAAFRQSH 627
+ LF+ +R LN ++L DL + L + + NPL Q +
Sbjct: 530 LFSLFRPQIIRVVLNPVKL----TWPDLDGGSRMLPNYLEDLQNPLWLVTQEY 578


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2711PRTACTNFAMLY300.030 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.4 bits (68), Expect = 0.030
Identities = 31/130 (23%), Positives = 43/130 (33%), Gaps = 21/130 (16%)

Query: 231 DSGSVRGRVVAAYQDKDSFQDRYEQQRTTLYGIVETDIGDSTLFTLGVDYQDATPSGTMS 290
D+G GR A Q D+ R Q + G F LG D+ A G
Sbjct: 645 DAGGAWGRGFAQRQQLDNRAGRRFDQ--KVAG-----------FELGADHAVAVAGGRWH 691

Query: 291 GGLPLFYSDGSRTNYDRATSTAPDWGSAHTQGLNTFASLEHRFDNGWNLKGTYTYGDNSL 350
G Y+ G R G HT ++ + D+G+ L T
Sbjct: 692 LGGLAGYTRGDRG--------FTGDGGGHTDSVHVGGYATYIADSGFYLDATLRASRLEN 743

Query: 351 EFDVLWATGY 360
+F V + GY
Sbjct: 744 DFKVAGSDGY 753


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_27122FE2SRDCTASE1016e-27 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 101 bits (253), Expect = 6e-27
Identities = 69/249 (27%), Positives = 94/249 (37%), Gaps = 71/249 (28%)

Query: 118 PSRKDTRKALHSLWGQWYFGLLVPPMMEWIFNAPKTDVETLHWQPQSVFMQVHPSGRVAK 177
P K L SLW QWY GL+VPP+M + K L P+ + H +GRVA
Sbjct: 82 PMMIRENKPLISLWAQWYIGLMVPPLMLALLTQEK----ALDVSPEHFHAEFHETGRVAC 137

Query: 178 FEFKIAKYQPITALTFKKHHGIEPLSRTNTKPSIKIDTEVHSPLSPYKPSVDKELVLQGF 237
F + + + T HSP +
Sbjct: 138 FWVDVCEDKNATP---------------------------HSPQHRM----------ETL 160

Query: 238 ILNLLQSSVDRLLTLSPVPAKLYWSHLGYLIHWYLGELG--LSQQQNQQLKQALFRRTTF 295
I L V L + KL WS+ GYLI+WYL E+ L + + L+ ALF T
Sbjct: 161 ISQALVPVVQALEATGEINGKLIWSNTGYLINWYLTEMKQLLGEATVESLRHALFFEKTL 220

Query: 296 LDGSTNPLYNSINLLIEPEQDSATPYTVARIVTSTASRSKPSPKIHCIRRTCCLRYQLAN 355
+G NPL+ ++ L +D +RRTCC RY+L +
Sbjct: 221 TNGEDNPLWRTVVL-----RDGLL-----------------------VRRTCCQRYRLPD 252

Query: 356 TGQCHDCPL 364
QC DC L
Sbjct: 253 VQQCGDCTL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2714adhesinmafb250.042 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 25.4 bits (55), Expect = 0.042
Identities = 9/44 (20%), Positives = 14/44 (31%)

Query: 54 AGFSGSLVVADFESLVAAKHWADADPYIEAGVYKSVVVKPFKRV 97
G GS+ + + A W +P V V +V
Sbjct: 279 IGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


80Sbal_2905Sbal_2949N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2905021-2.051350chemotaxis-specific methylesterase
Sbal_2906020-1.980156CheA signal transduction histidine kinase
Sbal_2907-120-2.442721chemotaxis phosphatase, CheZ
Sbal_2908-118-1.964123response regulator receiver protein
Sbal_2909-118-1.733683flagellar biosynthesis sigma factor
Sbal_2910017-1.695805cobyrinic acid ac-diamide synthase
Sbal_2911-117-1.841739flagellar biosynthesis regulator FlhF
Sbal_2912-118-2.126718flagellar biosynthesis protein FlhA
Sbal_2913-220-2.467936flagellar biosynthesis protein FlhB
Sbal_2914019-2.939911flagellar biosynthesis protein FliR
Sbal_2915120-3.150596flagellar biosynthetic protein FliQ
Sbal_2916321-3.465690flagellar biosynthesis protein FliP
Sbal_2917322-3.564695flagellar biosynthesis protein FliO
Sbal_2918019-1.620138flagellar motor switch protein
Sbal_2919119-1.231222flagellar motor switch protein FliM
Sbal_2920116-0.715394flagellar basal body-associated protein FliL
Sbal_2921014-0.274948flagellar hook-length control protein
Sbal_2922-1130.924828flagellar export protein FliJ
Sbal_2923-2140.970259flagellum-specific ATP synthase
Sbal_2924013-0.208565flagellar assembly protein H
Sbal_2925-213-0.475876flagellar motor switch protein G
Sbal_2926-116-1.034462flagellar MS-ring protein
Sbal_2927019-2.131006flagellar hook-basal body complex subunit FliE
Sbal_2928023-3.754116two component, sigma54 specific, Fis family
Sbal_2929226-4.649697PAS/PAC sensor signal transduction histidine
Sbal_2930229-5.635236sigma-54 dependent trancsriptional regulator
Sbal_2931434-6.919328flagellar protein FliS
Sbal_2932328-5.904363hypothetical protein
Sbal_2933120-3.622108flagellar hook-associated 2 domain-containing
Sbal_2934015-1.807936flagellar protein FlaG protein
Sbal_2935-114-1.494698flagellin domain-containing protein
Sbal_2936-113-0.814564flagellin domain-containing protein
Sbal_2937-212-0.183521flagellar hook-associated protein FlgL
Sbal_2938-2110.690812flagellar hook-associated protein FlgK
Sbal_2939-116-0.136509flagellar rod assembly protein/muramidase FlgJ
Sbal_2940117-0.510690flagellar basal body P-ring protein
Sbal_2941017-0.697149flagellar basal body L-ring protein
Sbal_2942020-0.853898flagellar basal body rod protein FlgG
Sbal_2943-123-1.483443flagellar basal body rod protein FlgF
Sbal_2944-125-2.711305flagellar hook protein FlgE
Sbal_2945-122-3.419454flagellar basal body rod modification protein
Sbal_2946022-3.853229flagellar basal body rod protein FlgC
Sbal_2947121-4.264755flagellar basal body rod protein FlgB
Sbal_2948120-4.111669protein-glutamate O-methyltransferase
Sbal_2949019-3.982265putative CheW protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2905HTHFIS694e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.5 bits (170), Expect = 4e-15
Identities = 32/168 (19%), Positives = 65/168 (38%), Gaps = 9/168 (5%)

Query: 2 AIKVLVVDDSSFFRRRVSEIVNQDPELEVIATASNGAEAVKMAAELNPQVITMDIEMPVM 61
+LV DD + R +++ +++ + SN A + A + ++ D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVREIMAKCP-TPILMFSSLTHDGAKATLDALDAGALDFLPKRFEDIATNKDDAIL 120
+ + I P P+L+ S+ + + A + GA D+LPK F D+
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGR 117

Query: 121 LLQQRVKALGRRRMFRPIARPVVASTPSMRPTSSVLGTTSIASHTPAT 168
L + + + P+V + +M+ + + T T
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQ---EIYRVLARLMQTDLT 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2906PF06580456e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.2 bits (107), Expect = 6e-07
Identities = 18/105 (17%), Positives = 37/105 (35%), Gaps = 23/105 (21%)

Query: 439 TLNKEIDLIMV---------GEETDLDKNLVEALADPLVH------LVRNSVDHGIEMPN 483
+L E+ ++ + + + A+ D V LV N + HGI
Sbjct: 217 SLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA--- 273

Query: 484 EREANGKPRTGTITLSASQEGDHILLKIEDDGAGMDPEKLKKIAI 528
P+ G I L +++ + L++E+ G+ +
Sbjct: 274 -----QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2907SALSPVBPROT290.029 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 28.6 bits (63), Expect = 0.029
Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 154 IRLRELLNQILMAQDFQDLTGQMIRRVIDLVMEVESN 190
IRL L Q+LM F D G+ V L++E + N
Sbjct: 302 IRLHRLCRQVLMFHHFPDELGEADTLVSRLLLEYDEN 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2908HTHFIS903e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 3e-24
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 6 KILIVDDFSTMRRIIKNLLRDLGFNNTQEADDGSTALPMLQKGDFDFVVTDWNMPGMQGI 65
IL+ DD + +R ++ L G++ + +T + GD D VVTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 DLLKAIRADDSLKHLPVLMVTAEAKREQIIAAAQAGVNGYVVKPF 110
DLL I+ LPVL+++A+ I A++ G Y+ KPF
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2911PF05272310.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.013
Identities = 9/25 (36%), Positives = 12/25 (48%)

Query: 240 VKQGGVVALVGPTGVGKTTSLAKLA 264
K V L G G+GK+T + L
Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2913TYPE3IMSPROT331e-114 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 331 bits (850), Expect = e-114
Identities = 94/347 (27%), Positives = 178/347 (51%), Gaps = 2/347 (0%)

Query: 6 SGERSEEPTGRRIEQAREKGQIARSKELGTAAVLISAACGFYMLGPSLATSLTRVFETVF 65
SGE++E+PT ++I AR+KGQ+A+SKE+ + A++++ + L +++ +
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LI 59

Query: 66 TMDRAQIFDTEEMFNVWGVVASEIAWPMAKIMLLIVVVAFIGNVALGGMNFSTQAMMPKA 125
+++ + ++ + V V E + ++ + ++A +V G S +A+ P
Sbjct: 60 PAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 126 SKMSPAAGFKRMFGVQALVELTKGIAKFSVVAFSAYLLLSFYFNDIMLLSSDHLPGNVYH 185
K++P G KR+F +++LVE K I K +++ ++++ ++ L + +
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 186 ALDLLVWMFILLCSSILLIVVIDVPFQIWNHNKQLKMTKQEVKDEYKDTEGKPEVKGRVR 245
+L + ++ ++I + D F+ + + K+LKM+K E+K EYK+ EG PE+K + R
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 246 QMQHELAQRRMMAEVPNADVIVVNPEHFAVAIKYDVQRSAAPFVIAKGVDDVAFKIREIA 305
Q E+ R M V + V+V NP H A+ I Y + P V K D +R+IA
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 306 REHNIAIVSAPPLARAIYHTTKLDQQIPEGLFTAVAQILAYVFQLRQ 352
E + I+ PLARA+Y +D IP A A++L ++ +
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2914TYPE3IMRPROT1241e-36 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 124 bits (313), Expect = 1e-36
Identities = 93/243 (38%), Positives = 143/243 (58%), Gaps = 1/243 (0%)

Query: 15 YMWPLFRVASMLMVMVVFGAATTPSRVRLLLAMAITFAIAPVLPPVQNADLFSLSAVFIT 74
Y WPL RV +++ + + P RV+L LAM ITFAIAP LP + +FS A+++
Sbjct: 16 YFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLP-ANDVPVFSFFALWLA 74

Query: 75 AQQIIIGVAMGFVTQMVMQVFVLTGQIIGMQTSLGFASMVDPGSGQQTPVIGNFFLLLAT 134
QQI+IG+A+GF Q G+IIG+Q L FA+ VDP S PV+ +LA
Sbjct: 75 VQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLAL 134

Query: 135 LIFLAVDGHLLMIRMLVASFETLPISNQGLTLTSYRALADWGSYMFGAALTMSISAIIAL 194
L+FL +GHL +I +LV +F TLPI + L ++ AL GS +F L +++ I L
Sbjct: 135 LLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLL 194

Query: 195 LLVNLSFGVMTRAAPQLNIFSIGFPITMIGGLFILWLTLTPVMEHFDEVWAAAQVLLCDM 254
L +NL+ G++ R APQL+IF IGFP+T+ G+ ++ + + + +++ LL D+
Sbjct: 195 LTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADI 254

Query: 255 LAL 257
++
Sbjct: 255 ISE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2915TYPE3IMQPROT483e-11 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 48.2 bits (115), Expect = 3e-11
Identities = 21/78 (26%), Positives = 40/78 (51%)

Query: 4 EALIDIFREALAVIVMMVSAIVLPGLGIGLIVAVFQAATSINEQTLSFLPRLLVTLFGLM 63
+ L+ +AL +++++ + IGL+V +FQ T + EQTL F +LL L
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 FMGHWLVETLMDFFVEMV 81
+ W E L+ + +++
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2916FLGBIOSNFLIP2783e-97 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 278 bits (713), Expect = 3e-97
Identities = 126/244 (51%), Positives = 184/244 (75%), Gaps = 3/244 (1%)

Query: 4 RILALVGLVILLCMPSAWAADGVLPAVTVTTGPDGSTEYSVTMQILLLMTSLSFLPAMLI 63
R+L++ +++ L P A+A LP +T P G +S+ +Q L+ +TSL+F+PA+L+
Sbjct: 3 RLLSVAPVLLWLITPLAFAQ---LPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILL 59

Query: 64 MLTSFTRIIIVLSILRQAIGLQQTPSNQVLIGMSLFMTFFIMAPVFDRIYDEGVKPYIEE 123
M+TSFTRIIIV +LR A+G P NQVL+G++LF+TFFIM+PV D+IY + +P+ EE
Sbjct: 60 MMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEE 119

Query: 124 QLTLQQAFEKGKEPLKGFMLGQVRTTDLKTFIEISGYKNIKSPEEAPMSVLIPAFITSEL 183
++++Q+A EKG +PL+ FML Q R DL F ++ ++ PE PM +L+PA++TSEL
Sbjct: 120 KISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSEL 179

Query: 184 KTAFQIGFMLFVPFLVLDLVVASILMAMGMMMLSPMIVSLPFKIMLFVLVDGWSLVLGTL 243
KTAFQIGF +F+PFL++DLV+AS+LMA+GMMM+ P ++LPFK+MLFVLVDGW L++G+L
Sbjct: 180 KTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSL 239

Query: 244 ANSF 247
A SF
Sbjct: 240 AQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2918FLGMOTORFLIN1094e-34 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 109 bits (274), Expect = 4e-34
Identities = 53/119 (44%), Positives = 79/119 (66%)

Query: 7 DDWAAAMAEQALEEANAIELDELVDDSRPITKAEAAKLDTILDIPVTISMEVGRSYISIR 66
D WA A+ EQ + +D I+DIPV +++E+GR+ ++I+
Sbjct: 17 DLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIK 76

Query: 67 NLLQLNQGSVVELDRVAGEPLDVMVNGTLIAHGEVVVVNDKFGIRLTDVISQTERIKKL 125
LL+L QGSVV LD +AGEPLD+++NG LIA GEVVVV DK+G+R+TD+I+ +ER+++L
Sbjct: 77 ELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2919FLGMOTORFLIM2496e-83 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 249 bits (637), Expect = 6e-83
Identities = 87/326 (26%), Positives = 163/326 (50%), Gaps = 11/326 (3%)

Query: 1 MSDLLSQDEIDALLHGVDDVDDDEIDAVGE----DARSYDFSSQDRIVRGRMPTLEIVNE 56
M+++LSQDEID LL + D DA YDF D+ + +M TL +++E
Sbjct: 1 MTEVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 57 RFARHLRISMFNMMRRAAEVSINGVQMLKFGEYVHTLFVPTSLNMVRFSPLKGTALITME 116
FAR S+ +R V + V L + E++ ++ P++L ++ PLKG A++ ++
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVD 120

Query: 117 ARLVFILVDNFFGGDGRFHAKIEGREFTPTERRIVQLLLKIIFEDYKDAWAPVMDVEFDY 176
+ F ++D FGG G+ R+ T E +++ ++ I + +++W V+D+
Sbjct: 121 PSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRL 178

Query: 177 LDSEVNPAMANIVSPTEVVVINSFHIEVDGGGGDFHITMPYSMIEPIRELLDAG--VQSD 234
E NP A IV P+E+VV+ + +V G + +PY IEPI L + S
Sbjct: 179 GQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSV 238

Query: 235 KQDTDMRWSQALHDEIMDVKVGFDANIVEHELTLKDVMNFKAGDIIPIE---LPEYIMMK 291
++ + ++ L D++ V + A + L+++D++ + GDII + + + ++
Sbjct: 239 RRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLS 298

Query: 292 IEDLPTYRCKMGRSRDNLALKIYEKI 317
I + + C+ G +A +I E+I
Sbjct: 299 IGNRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2921FLGHOOKFLIK515e-09 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 51.4 bits (122), Expect = 5e-09
Identities = 36/132 (27%), Positives = 64/132 (48%), Gaps = 5/132 (3%)

Query: 411 MKQQLITMVSQGIQHAEIRLDPPELGHMLVKIQVHGDQTQVQFHVTQTQTRDLVEQAMPR 470
+ Q + QG Q AE+RL P +LG + + ++V +Q Q+Q R +E A+P
Sbjct: 244 LSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPV 303

Query: 471 LRELLQEQGMQLADSHVSQGGQGERREGGFGDGGGSNGADVDEISAEE-----LHLGLNQ 525
LR L E G+QL S++S +++ A+ + ++ E+ + + L
Sbjct: 304 LRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQG 363

Query: 526 ATSVNSGIDYYA 537
+ NSG+D +A
Sbjct: 364 RVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2922FLGFLIJ442e-08 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 44.0 bits (103), Expect = 2e-08
Identities = 39/145 (26%), Positives = 69/145 (47%)

Query: 1 MANADPLLLVLKLANDAEEQAALLLKSAQLECQKRLNQLSALNNYRLEYMKQMQSQQGQA 60
MA L + LA E AA LL + CQ+ QL L +Y+ EY + S
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 ISASHYHQFHRFIRQIDDAITQQNRVVADGETQKEYRQQHWLEKQKKRKAVELLLASKEK 120
I+++ + + +FI+ ++ AITQ + + + + W EK+++ +A + L +
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 KRQVVEQKREQKMTDEFASQQFYRR 145
+ E + +QK DEFA + R+
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2924FLGFLIH897e-23 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 89.1 bits (220), Expect = 7e-23
Identities = 57/201 (28%), Positives = 102/201 (50%), Gaps = 4/201 (1%)

Query: 50 AAKPTTVESVSPPTMAEIEDIRAQAEEEGFA---EGKQQGYEQGLEKGRLEGLEQGHTEG 106
A + P IE+ E++ + +QGY+ G+ +GR +G +QG+ EG
Sbjct: 16 APPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEG 75

Query: 107 LAQGHEQGLETGLAQAKVLLSRFEALLTQFEKPLQLLDGDIELSLLNLSMTLAKSVIGHE 166
LAQG EQGL +Q + +R + L+++F+ L LD I L+ +++ A+ VIG
Sbjct: 76 LAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQT 135

Query: 167 LKTHPEQVLSALRLGIESLPIKEQAVTIRLHPDDVILVEQLYSTAQLTRSKWELEVDPTL 226
++ ++ ++ P+ +R+HPDD+ V+ + A L+ W L DPTL
Sbjct: 136 PTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLG-ATLSLHGWRLRGDPTL 194

Query: 227 SAGDCILSSHRSLVDLTLSSR 247
G C +S+ +D ++++R
Sbjct: 195 HPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2925FLGMOTORFLIG2871e-97 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 287 bits (735), Expect = 1e-97
Identities = 109/350 (31%), Positives = 195/350 (55%), Gaps = 7/350 (2%)

Query: 1 MAENKTKEVAPAAPPAFNIKDISGVEKTAILLLSLSEADAASILKHLEPKQVQKVGMAMA 60
M E K KE+ ++ ++G +K AILL+S+ ++ + K+L ++++ + +A
Sbjct: 1 MEEKKEKEIL-------DVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIA 53

Query: 61 AMDEFGQEKVIGVHKLFLDDIQKYSSIGFNSEEFVRKALTAALGEDKAGNLIEQIIMGSG 120
++ E V F + + I ++ R+ L +LG KA ++I +
Sbjct: 54 KLETITSELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQ 113

Query: 121 AKGLDSLKWMDARQVATIIQNEHPQIQTIVLSYLEPDQAAEIFGQFPENTRLDLMMRIAN 180
++ + ++ D + IQ EHPQ ++LSYL+P +A+ I P + ++ RIA
Sbjct: 114 SRPFEFVRRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIAL 173

Query: 181 LEEVQPAALQELNDIMEKQFAGQGGAQAAKMGGLKAAANIMNYLDTGIESQLMETMRESD 240
++ P ++E+ ++EK+ A GG+ I+N D E ++E++ E D
Sbjct: 174 MDRTSPEVVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEED 233

Query: 241 EEMAQQIQDLMFVFENLIDVDDRGIQALLREVQQDVLMKALKGTDDQLKEKILGNMSKRA 300
E+A++I+ MFVFE+++ +DDR IQ +LRE+ L KALK D ++EKI NMSKRA
Sbjct: 234 PELAEEIKKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRA 293

Query: 301 AELLRDDLEAMGPIRISEVEVAQKEILSIARRLSDSGEIMLGGGGGDEFL 350
A +L++D+E +GP R +VE +Q++I+S+ R+L + GEI++ GG ++ L
Sbjct: 294 ASMLKEDMEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2926FLGMRINGFLIF3055e-99 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 305 bits (782), Expect = 5e-99
Identities = 161/560 (28%), Positives = 266/560 (47%), Gaps = 42/560 (7%)

Query: 25 NLGGVDMMRQVTMILALAICLALAVFVMLWAQEPEYRPL-GKMETQEMVQVLDVLDKNKV 83
L + ++ +I+A + +A+ V ++LWA+ P+YR L + Q+ ++ L + +
Sbjct: 15 WLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNI 74

Query: 84 KYQIDVD--VIKVPEDKYQEVKMMLSRAGVDSPAASQDFLNQDSGFGVSQRMEQARLKHS 141
Y+ I+VP DK E+++ L++ G+ A L FG+SQ EQ + +
Sbjct: 75 PYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRA 134

Query: 142 QEENLARAIEQLQSVSRAKVILALPKENVFARNASKPSATVVINTRRG-GLGQGEVDAIV 200
E LAR IE L V A+V LA+PK ++F R PSA+V + G L +G++ A+V
Sbjct: 135 LEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVV 194

Query: 201 DIVASAVQGLEPSRVTVTDSNGRLLNSGSQDGASATARRELELVQQKEAEYRTKIESILV 260
+V+SAV GL P VT+ D +G LL + G +L+ E+ + +IE+IL
Sbjct: 195 HLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDL-NDAQLKFANDVESRIQRRIEAILS 253

Query: 261 PILGPDNFTSQVDVSMDFTAVEQTSKRYNPDLPALRSEMTVENNTT-----GGSSGGIPG 315
PI+G N +QV +DF EQT + Y+P+ A ++ + G GG+PG
Sbjct: 254 PIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPG 313

Query: 316 ALSNQPP---------------MESNIPQDAT-KATESVTAGNSHREATRNFELDTTISH 359
ALSNQP N PQ +T + S ++ R T N+E+D TI H
Sbjct: 314 ALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRH 373

Query: 360 TRQQVGAVRRISVSVAVDFKPGAAGENGQVARVARTEQELTNIRRLLEGAVGFSSQRGDV 419
T+ VG + R+SV+V V++K A G+ + T ++ I L A+GFS +RGD
Sbjct: 374 TKMNVGDIERLSVAVVVNYKTLADGKP-----LPLTADQMKQIEDLTREAMGFSDKRGDT 428

Query: 420 LEVVTVPFMDQLVEDLPALELWEQPWFWRAIKLGIGALVILVLILAVVRPMLKRLIYPDS 479
L VV PF + L W+Q F + L L+L V + ++ + P
Sbjct: 429 LNVVNSPF-SAVDNTGGELPFWQQQSFIDQLLAAGRWL----LVLVVAWILWRKAVRPQL 483

Query: 480 VNMPEDGRLGNELAEIEDQYAADTLGMLNTQEAEYSYADDGSIHIPNLHKDDDMIKAIRA 539
E+ + E A++ + L+ + E + + + M + IR
Sbjct: 484 TRRVEEAKAAQEQAQVRQETEEAVEVRLS--KDEQLQQRRANQRLGA----EVMSQRIRE 537

Query: 540 LVANEPELSTQVVKNWLQDN 559
+ N+P + V++ W+ ++
Sbjct: 538 MSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2927FLGHOOKFLIE576e-14 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 56.6 bits (136), Expect = 6e-14
Identities = 29/86 (33%), Positives = 46/86 (53%)

Query: 26 QPNIMQQVNNTSGADFGQLLSQAVGNVSGLQSTSSNLATRLEMGDTTVSLSDTVIAREKA 85
Q+ F L A+ +S Q+ + A + +G+ V+L+D + +KA
Sbjct: 18 MSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKA 77

Query: 86 SVAFEATVQVRNKLVEAYKEIMSMPV 111
SV+ + +QVRNKLV AY+E+MSM V
Sbjct: 78 SVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2928HTHFIS457e-161 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 457 bits (1178), Expect = e-161
Identities = 167/483 (34%), Positives = 249/483 (51%), Gaps = 42/483 (8%)

Query: 1 MSEAKLLLVEDDASLREALLDTLMLAQYECIDVASGEDAILALKQHQFDLVISDVQMQGI 60
M+ A +L+ +DDA++R L L A Y+ ++ + DLV++DV M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 GGLGLLNFLQQHHPKLPVLLMTAYATIGSAVDAIKLGAVDYLAKPFAPEVLLNQVSRYLP 120
LL +++ P LPVL+M+A T +A+ A + GA DYL KPF L+ + R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 LKQNVDQPVVAD-----------EKSLALLALAQRVAASDASVMILGPSGSGKEVLARYI 169
+ + D + + R+ +D ++MI G SG+GKE++AR +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 170 HQHSSRADQAFVAINCAAIPENMLEATLFGYEKGAFTGAYQACPGKFEQAQGGTLLLDEI 229
H + R + FVAIN AAIP +++E+ LFG+EKGAFTGA G+FEQA+GGTL LDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 230 SEMDLGLQAKLLRVLQEREVERLGGRKTIKLDVRVLATSNRDLKAVVAAGGFREDLYYRI 289
+M + Q +LLRVLQ+ E +GGR I+ DVR++A +N+DLK + G FREDLYYR+
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 290 NVFPLAWPALSQRPADILPLARHLLVKHAKALNVADVPELDENARRRLLSHRWPGNVREL 349
NV PL P L R DI L RH + + A+ + V D+ A + +H WPGNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLD-VKRFDQEALELMKAHPWPGNVREL 358

Query: 350 DNVIQRALILRAGQVITANDIIIDAQDVILG--------------------------GED 383
+N+++R L VIT I + + I
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 384 LDQFVAEPDGLGEELKAQEHVIILETLNQCQGSRKLVAEKLGISARTLRYKMARMRDMGI 443
+ L E+ +IL L +G++ A+ LG++ TLR K +R++G+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGV 475

Query: 444 QLP 446
+
Sbjct: 476 SVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2929PF06580348e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.1 bits (78), Expect = 8e-04
Identities = 32/185 (17%), Positives = 71/185 (38%), Gaps = 34/185 (18%)

Query: 167 LSDAAKAKFQQKLVDRLNELERQVNDMLLMAKGRQDELGDLITLAEVIDNVLANCEPIAA 226
L D KA ++++ L+EL R + ++LA+ + V + + +
Sbjct: 187 LEDPTKA---REMLTSLSELMRYS---------LRYSNARQVSLADELTVVDSYLQLASI 234

Query: 227 KQGCDLSFD-DVSSSSMLANSNALSSAINNLVMNSIEAGAT------EIRIQAAEEGDQL 279
+ L F+ ++ + + + + LV N I+ G +I ++ ++ +
Sbjct: 235 QFEDRLQFENQINPA--IMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTV 292

Query: 280 LLNVIDNGKGLDANMQQKVLEPFFTTKSQGTGLGLA-VVQSVVRNHGGQLQLSCLPNKGC 338
L V + G N + + TG GL V + + +G + Q+ +G
Sbjct: 293 TLEVENTGSLALKNTK------------ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGK 340

Query: 339 TVSLV 343
++V
Sbjct: 341 VNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2930HTHFIS432e-150 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 432 bits (1112), Expect = e-150
Identities = 171/481 (35%), Positives = 262/481 (54%), Gaps = 21/481 (4%)

Query: 7 RILLIGPPSERLNRLCCIFDFLGEQIAQI-DAEKLSASLQDTRFRALVILTDVMDADA-- 63
IL+ + L G + +A L + +++TDV+ D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD--LVVTDVVMPDENA 62

Query: 64 ---LKNIAGQHPWQPMLLL---GNVDDLQVSNILG---NIEEPLTYPQLTELLHFCQVFG 114
L I P P+L++ ++ G + +P +L ++
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 115 QVKRPQVPTSANQTKLFRSLVGRSDGIANVRHLINQVATSEATVLVLGQSGTGKEVVARN 174
+ + ++ + LVGRS + + ++ ++ ++ T+++ G+SGTGKE+VAR
Sbjct: 123 KRRPSKLEDDSQD---GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 175 IHYLSERRDGPFIPVNCGAIPPELLESELFGHEKGSFTGAICSRKGRFELAEGGTLFLDE 234
+H +RR+GPF+ +N AIP +L+ESELFGHEKG+FTGA GRFE AEGGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 235 IGDMPLQMQVKLLRVLQERVFERVGGTKTINADVRVVAATHRDLETMISVNEFREDLYYR 294
IGDMP+ Q +LLRVLQ+ + VGG I +DVR+VAAT++DL+ I+ FREDLYYR
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 295 LNVFPIEMPALCDRKDDVPLLLQELVSRVYNEGRGKVRFTQRAIESLKEHAWSGNVRELS 354
LNV P+ +P L DR +D+P L++ V + EG RF Q A+E +K H W GNVREL
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 355 NLVERLTILYPGGLVDVNDLPVKYRHIDVPEYCVEMSEEQQERDALASIFSDEEPVEIPE 414
NLV RLT LYP ++ + + R ++P+ +E + + +++ EE +
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRS-EIPDSPIEKAAARSGSLSISQAV--EENMRQYF 416

Query: 415 TRFPSELPPEGVNLKDLLAELEIDMIRQALELQDNVVARAAEMLGIRRTTLVEKMRKYGM 474
F LPP G+ +LAE+E +I AL +AA++LG+ R TL +K+R+ G+
Sbjct: 417 ASFGDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475

Query: 475 T 475
+
Sbjct: 476 S 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2933FLAGELLIN290.044 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 28.9 bits (64), Expect = 0.044
Identities = 26/228 (11%), Positives = 54/228 (23%), Gaps = 2/228 (0%)

Query: 4 TATGIGSGLKINEIVQVLVDAEKKPKEAMFNKKEDSIKAKVSAMGTLKSALSTFQDALKK 63
G+ + V + V T KS T +
Sbjct: 207 VDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIA 266

Query: 64 LQTGDALNQRKITVSNETYLTATADKTAQAGSYGIKVEQLAVNHKVAGINVADPTLPVGE 123
T+ T G + V VA I +
Sbjct: 267 GAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAAT 326

Query: 124 GSLDFGINGKNFSIDVSATDSIAAIAKKVNESSDNVGVTATVITSDAGSRLIFSSNKSGE 183
+ + + D + K+++ N V + G+ ++
Sbjct: 327 LQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKV 386

Query: 184 DNQISITANDTSGTGLNDMFGAGNITSLQDAKNAIVYIDN--QKVTSQ 229
D + +G++ + + + N + ID+ KV +
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAV 434


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2935FLAGELLIN1313e-37 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 131 bits (330), Expect = 3e-37
Identities = 90/271 (33%), Positives = 130/271 (47%), Gaps = 8/271 (2%)

Query: 2 AITVNTNVTSLKAQKNLNSSATGLASSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGL 61
A +NTN SL Q NLN S + L+S++ERLSSGLRINSAKDDAAG AI+NR S ++GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVGMRNANDAISIAQISEGAMQEQTNMLQRMRDLTIQAENGSNSTSDIGAIKSELDALAT 121
RNAND ISIAQ +EGA+ E N LQR+R+L++QA NG+NS SD+ +I+ E+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EITAIANTTAFGNTKLLDGSFSAGKTFQVGHQNGEDIKVSVGKVTASAIKVN------AS 175
EI ++N T F K+L QVG +GE I + + K+ ++ ++
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ--MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPK 178

Query: 176 VILVSTAGQRSTSLSNIDAAIKTIDSQRAKLGAIQNRLAYNISNSANTQANVADAKSRIV 235
V +++ D + R + + + A
Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 236 DVDFAKETSQMTKNQVLQQTGSAMLAQANQL 266
D + K + A A +
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 86.6 bits (214), Expect = 2e-21
Identities = 63/215 (29%), Positives = 101/215 (46%), Gaps = 6/215 (2%)

Query: 60 GLDVGMRNANDAISIAQISEGAMQEQTNMLQRMRDLTIQAENGSNSTSDIGAIKSELDAL 119
+ + +++A I+ GA LQ +++ NG + D +S +
Sbjct: 298 KVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSD 357

Query: 120 ATEITAIANTTAFGNTKLLDGSFSAGKTFQVGHQNGEDIKVSVGKVTASAIKVNASVILV 179
A+ + + +AG + K TAS + +
Sbjct: 358 LEANNAVKGESKITVNGAEYTANAAGDKVTLA------GKTMFIDKTASGVSTLINEDAA 411

Query: 180 STAGQRSTSLSNIDAAIKTIDSQRAKLGAIQNRLAYNISNSANTQANVADAKSRIVDVDF 239
+ + L++ID+A+ +D+ R+ LGAIQNR I+N NT N+ A+SRI D D+
Sbjct: 412 AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADY 471

Query: 240 AKETSQMTKNQVLQQTGSAMLAQANQLPQVALSLL 274
A E S M+K Q+LQQ G+++LAQANQ+PQ LSLL
Sbjct: 472 ATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2936FLAGELLIN1278e-36 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 127 bits (320), Expect = 8e-36
Identities = 93/271 (34%), Positives = 129/271 (47%), Gaps = 8/271 (2%)

Query: 2 AITVNTNVTSMKAQKNLNTSSSGLASSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGL 61
A +NTN S+ Q NLN S S L+S++ERLSSGLRINSAKDDAAG AI+NR S ++GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVGMRNANDAISIAQISEGAMQEQTNMLQRMRDLTIQAENGANSTSDIDAIKSEIDALAS 121
RNAND ISIAQ +EGA+ E N LQR+R+L++QA NG NS SD+ +I+ EI
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EISAIANSTAFGNTKLLDGTFSAGKTFQVGHQNKEDITVSVSKVTASALKV------QAS 175
EI ++N T F K+L QVG + E IT+ + K+ +L +
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ--MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPK 178

Query: 176 VVKVSTAAQRSASLTNIDAAIKAIDTQRSNLGAIQNRLAYNISNSANTQANVADAKSRIV 235
V ++T D + R ++ + + A
Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 236 DVDFAKETATMTKNQVLQQTGSAMLAQANQL 266
D + K + A A +
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 80.5 bits (198), Expect = 3e-19
Identities = 53/181 (29%), Positives = 85/181 (46%)

Query: 94 DLTIQAENGANSTSDIDAIKSEIDALASEISAIANSTAFGNTKLLDGTFSAGKTFQVGHQ 153
L + + + ++++S + + A + + G +
Sbjct: 326 TLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDK 385

Query: 154 NKEDITVSVSKVTASALKVQASVVKVSTAAQRSASLTNIDAAIKAIDTQRSNLGAIQNRL 213
TAS + + + + L +ID+A+ +D RS+LGAIQNR
Sbjct: 386 VTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRF 445

Query: 214 AYNISNSANTQANVADAKSRIVDVDFAKETATMTKNQVLQQTGSAMLAQANQLPQVALSL 273
I+N NT N+ A+SRI D D+A E + M+K Q+LQQ G+++LAQANQ+PQ LSL
Sbjct: 446 DSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSL 505

Query: 274 L 274
L
Sbjct: 506 L 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2937FLAGELLIN575e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 57.4 bits (138), Expect = 5e-11
Identities = 58/360 (16%), Positives = 113/360 (31%), Gaps = 12/360 (3%)

Query: 20 QTATSKILDQLSSGKKVNTAGDDPVASQGIDNLNQKNALVDQFMKNIDYATNRLAVAESK 79
Q++ S +++LSSG ++N+A DD + + Q +N + + E
Sbjct: 21 QSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGA 80

Query: 80 LGSAEDLTGLMRDQVMRAINGTLSGTERQMIADEMKGSLEELLSIANSKDESGNYMFSGF 139
L + +R+ ++A NGT S ++ + I DE++ LEE+ ++N +G + S
Sbjct: 81 LNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLSQ- 139

Query: 140 STDKEPFAFDNSTPPKIVYSGDSGVRNSLVQTGVAMGTNI--PGDSAFMKAPNGLGDYSV 197
+ N + V++ + G GD D
Sbjct: 140 DNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYA 199

Query: 198 NYLASQQGEFSVKTAKIADAATYVADTYTFNFTDNGAGGTNLQVLDSANNPVANVANFDA 257
+ + + A V D N + V F
Sbjct: 200 VGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD-------DAENNTAVDLFKT 252

Query: 258 TNPVSFNGIDVKIDGKPSAGDSFTMEPQAEVSIFDTISSAIALIEDPNSANTSQGRAQLA 317
T + I G G V+ TI + + + T G
Sbjct: 253 TKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTF--TIDTKTGNDGNGKVSTTINGEKVTL 310

Query: 318 QILNNIDSGVNQISSARSVAGNNLKVVESYTETHTEEKVVNTSALSLLEDLDYASAITEF 377
+ + N ++ + N V + T ++ ++ LS LE + ++
Sbjct: 311 TVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKI 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2938FLGHOOKAP12125e-63 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 212 bits (540), Expect = 5e-63
Identities = 125/455 (27%), Positives = 193/455 (42%), Gaps = 19/455 (4%)

Query: 4 DLLNIARTGVLASQSQLGVTSNNIANANTAGYHRQVATQSTLESQRLGNSFYGTGTYVND 63
L+N A +G+ A+Q+ L SNNI++ N AGY RQ + S + G G YV+
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 64 VKRIYNDYAARELRIGQTTLSAAEASYGKLSELDQLFSQIGKVVPQSLNDLFAGMNSVAD 123
V+R Y+ + +LR QT S A Y ++S++D + S + + D F + ++
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 124 LPADLGIRSSTLTNAQQVASSLNQMQSYLNGQLDQTNDQITGMTKRINEIGTELAKLNLE 183
D R + + ++ + + YL Q Q N I +IN ++A LN +
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 184 LMKSPNQDA-----QLLDKQDALVQELSQYAQVNVIPQENGAKSIMLGGSVMLVSGEIAM 238
+ + A LLD++D LV EL+Q V V Q+ G +I + LV G A
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 239 SMGTQAGNPFPKELQLNSSIGSQSVTVDPSKL--GGQLGAMFDYRDQTLIPAGHELDQLA 296
+ + P + G+ P KL G LG + +R Q L + L QLA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 297 LGVADNFNKMQAQGIDLNGQVGANIFKDINDPMMSLGRVAGFSGNTGNATLGVTVDDTSL 356
L A+ FN G D NG G + F + V + N G+ +G TV D S
Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFA------IGKPAVLQNTKNKGDVAIGATVTDASA 355

Query: 357 LTGGAYELSF--TAPATYELRDTETGTITPLTLTGSTLSGGSGFSIDIKAGAMASGDRFA 414
+ Y++SF L T T+TP G G A D F
Sbjct: 356 VLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFT----GTPAVNDSFT 411

Query: 415 IRPTAGASNGIEVVMKDPKGIAAASPKITADAANS 449
++P + A ++V++ D IA AS + D+ N
Sbjct: 412 LKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNR 446



Score = 87.3 bits (216), Expect = 4e-20
Identities = 37/104 (35%), Positives = 56/104 (53%)

Query: 535 AEGDNSNAVAMAKLSESKVMNNGKSTLADVFENTKLEIGSKTKAAEVRTGSAEAVYQQAY 594
+ DN N A+ L + G + D + + +IG+KT + + + V Q
Sbjct: 441 GDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLS 500

Query: 595 ARVESESGVNLDEEAANLMRFQQAYQASARIMTTAQQIFDTLLS 638
+ +S SGVNLDEE NL RFQQ Y A+A+++ TA IFD L++
Sbjct: 501 NQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2939FLGFLGJ1537e-46 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 153 bits (388), Expect = 7e-46
Identities = 66/151 (43%), Positives = 94/151 (62%), Gaps = 1/151 (0%)

Query: 219 GSREEFLATLYPHAEKAAKALGTQPEVLLAQSALETGWGQKIVRGNNGAPSHNLFNIKAD 278
G + FLA L A+ A++ G ++LAQ+ALE+GWGQ+ +R NG PS+NLF +KA
Sbjct: 147 GDSKAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKAS 206

Query: 279 RRWQGDKANVSTLEFEHGVAVQQKADFRVYSDFEHSFNDFVSFIAEGDRYQDAKKVAASP 338
W+G ++T E+E+G A + KA FRVYS + + +D+V + RY A AAS
Sbjct: 207 GNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYA-AVTTAASA 265

Query: 339 TQFIRALQDAGYATDPRYAEKVIKVMQSISE 369
Q +ALQDAGYATDP YA K+ ++Q +
Sbjct: 266 EQGAQALQDAGYATDPHYARKLTNMIQQMKS 296



Score = 88.2 bits (218), Expect = 9e-22
Identities = 39/91 (42%), Positives = 61/91 (67%), Gaps = 3/91 (3%)

Query: 12 DLGGLDSLRAQAQKDEKGALKKVAQQFEGIFVQMLMKSMRDANAVFQSDSPLNSQYTKFY 71
D L+ L+A+A +D ++ VA+Q EG+FVQM++KSMRDA D +S++T+ Y
Sbjct: 14 DAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALP---KDGLFSSEHTRLY 70

Query: 72 EQMRDQQLSVDLSDKGVLGLADMMVQQLSPE 102
M DQQ++ ++ LGLA+MMV+Q++PE
Sbjct: 71 TSMYDQQIAQQMTAGKGLGLAEMMVKQMTPE 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2940FLGPRINGFLGI369e-129 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 369 bits (949), Expect = e-129
Identities = 158/367 (43%), Positives = 222/367 (60%), Gaps = 14/367 (3%)

Query: 5 LVLAVAVLVFSLPSQAE--RIKDIANVQGVRSNQLIGYGLVVGLPGTGEKTS---YTEQT 59
LV + + + P+QA+ RIKDIA++Q R NQLIGYGLVVGL GTG+ +TEQ+
Sbjct: 11 LVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQS 70

Query: 60 FMTMLKNFGINLPDNVKPKIKNVAVVAVHADMPAFIKPGQDLDVTVSSLGEAKSLRGGTL 119
ML+N GI + KN+A V V A++P F PG +DVTVSSLG+A SLRGG L
Sbjct: 71 MRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNL 129

Query: 120 LQTFLKGVDGNVYAIAQGSLVVSGFSADGLDGSKVIQNTPTVGRIPNGAIVERSVATPFS 179
+ T L G DG +YA+AQG+L+V+GFSA G D + + Q T R+PNGAI+ER + + F
Sbjct: 130 IMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 180 TGDYLTFNLRRSDFSTAQRMADAINEL----LGPDMARPLDATSVQVSAPRDVSQRVSFL 235
L LR DFSTA R+AD +N G +A P D+ + V PR V+ +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 236 ATLENLDVIPAEESAKVIVNSRTGTIVVGQNVRLLPAAITHGGMTVTIAEATQVSQPNAL 295
A +ENL + + AKV++N RTGTIV+G +VR+ A+++G +TV + E+ QV QP
Sbjct: 248 AEIENL-TVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPF 306

Query: 296 ANGQTTVTSNSTITATESDRRMFMFNPGTTLDELVRAVNLVGAAPSDVLAILEALKVAGA 355
+ GQT V + I A + ++ + G L LV +N +G ++AIL+ +K AGA
Sbjct: 307 SRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGA 365

Query: 356 LHGELII 362
L EL++
Sbjct: 366 LQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2941FLGLRINGFLGH1443e-45 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 144 bits (365), Expect = 3e-45
Identities = 74/227 (32%), Positives = 113/227 (49%), Gaps = 18/227 (7%)

Query: 4 YLVLAVALL-LAACSSTQKKPLADDPFYAPVYPEAPPTKIAATGSIYQDSQ-----ASSL 57
Y + ++ +L L C+ PL A P P A GSI+Q +Q L
Sbjct: 9 YAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTP---VANGSIFQSAQPINYGYQPL 65

Query: 58 YSDIRAHKVGDIITIVLKESTQAKKSAGNQIKKGSDMSLDPIFAGGSNISV-----GGVP 112
+ D R +GD +TIVL+E+ A KS+ + + F + G
Sbjct: 66 FEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTN----FGFDTVPRYLQGLFGNAR 121

Query: 113 LDLRYKDSMNTKRESDADQSNSLDGSISANVMQVLNNGSLVIRGEKWISINNGDEFIRVT 172
D+ + A+ SN+ G+++ V QVL NG+L + GEK I+IN G EFIR +
Sbjct: 122 ADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFS 181

Query: 173 GLVRSQDIKPDNTIDSTRMANARIQYSGTGTFADAQKVGWLSQFFMS 219
G+V + I NT+ ST++A+ARI+Y G G +AQ +GWL +FF++
Sbjct: 182 GVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLN 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2942FLGHOOKAP1421e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 42.3 bits (99), Expect = 1e-06
Identities = 18/119 (15%), Positives = 39/119 (32%), Gaps = 4/119 (3%)

Query: 145 DNATSITVSAEGEVSVKTPGTADNQVVGQLTMTDFINPSGLDPMGQNLYTETG---ASGT 201
+ I +++E + + Q + +L ++ G A+
Sbjct: 427 TDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLK 486

Query: 202 PIQGTASLDGMGAIRQGALETSNVNVTEELVNLIESQRIYEMNSKVISAVDQMLSYVTQ 260
T + + S VN+ EE NL Q+ Y N++V+ + + +
Sbjct: 487 TSSATQGNV-VTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 35.7 bits (82), Expect = 2e-04
Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 5 LWISKTGLDAQQTDIAVISNNVANASTVGYKKSRAV 40
+ + +GL+A Q + SNN+++ + GY + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2944FLGHOOKAP1402e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 39.9 bits (93), Expect = 2e-05
Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 2 SFNIALSGIAAAQKDLNTTANNIANANTIGFKESR 36
N A+SG+ AAQ LNT +NNI++ N G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 37.6 bits (87), Expect = 8e-05
Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 412 SISSSALEQSNIDLTTELVDLISAQRNFQANSRTLEVNNTLQQTVLQIR 460
+S+ S ++L E +L Q+ + AN++ L+ N + ++ IR
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2946FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 3e-04
Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 99 NVNVMEEMADMISASRSYQMNVQVAEAAKSMLQQTLGM 136
VN+ EE ++ + Y N QV + A ++ + +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.5 bits (66), Expect = 0.003
Identities = 16/67 (23%), Positives = 29/67 (43%), Gaps = 6/67 (8%)

Query: 5 SIFDVAGSGMSAQSVRLNTTASNIANADSVSSSVDKTYRARHPIFEAEMAKAQSQQQASQ 64
S+ + A SG++A LNT ++NI++ + Y + I + +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYN------VAGYTRQTTIMAQANSTLGAGGWVGN 55

Query: 65 GVAVKGI 71
GV V G+
Sbjct: 56 GVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2949HTHFIS611e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 1e-12
Identities = 23/128 (17%), Positives = 52/128 (40%), Gaps = 12/128 (9%)

Query: 180 HIMVIDDSAVARKQIIRSLESLNLQIDTAKDGREALDKLKEIAKEMDNVADEIPLIISDI 239
I+V DD A R + ++L + + + A + L+++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDV 55

Query: 240 EMPEMDGYTLTAEIRDDPKLKHIKVVLHTSLSGVFNQAMVQKVGANDFIAK-FNPDELAA 298
MP+ + + L I+ + V++ ++ + + GA D++ K F+ EL
Sbjct: 56 VMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 299 AVNKHLSL 306
+ + L+
Sbjct: 114 IIGRALAE 121


81Sbal_2967Sbal_2974N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_2967022-7.155160polysaccharide biosynthesis protein CapD
Sbal_2968017-5.152966hypothetical protein
Sbal_2969-1152.096198hypothetical protein
Sbal_2970-1172.091299hypothetical protein
Sbal_2971-2151.701434hypothetical protein
Sbal_2972-2161.771664TetR family transcriptional regulator
Sbal_2973-1131.795747RND family efflux transporter MFP subunit
Sbal_2974-190.958355acriflavin resistance protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2967NUCEPIMERASE812e-19 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 80.6 bits (199), Expect = 2e-19
Identities = 42/245 (17%), Positives = 85/245 (34%), Gaps = 54/245 (22%)

Query: 6 TILITGGTGSFGQKYTKTILAKY-----------------KPKRLIILSRDELKQYEMQQ 48
L+TG G G +K +L K RL +L++ + ++
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHK--- 58

Query: 49 IYNAPCMRYFIGDVRDGDRLEQAFKDVDF--VIHAAALKQVPAAEYNPMECIKTNIYGAE 106
D+ D + + F F V + V + NP +N+ G
Sbjct: 59 -----------IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFL 107

Query: 107 NVIRAAISNNVSKVIALST---------------DKAANPINLYGATKLASDKLFVAANN 151
N++ N + ++ S+ D +P++LY ATK A++ + ++
Sbjct: 108 NILEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH 167

Query: 152 IVGSGKTRFAAVRYGNVVGSRGS---VVPFFKQLIANGADALPITHQDMTRFWISLQDGV 208
+ G +R+ V G G + F + + G + M R + + D
Sbjct: 168 LYG---LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 209 DFVLK 213
+ +++
Sbjct: 225 EAIIR 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2972HTHTETR691e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.9 bits (168), Expect = 1e-16
Identities = 38/181 (20%), Positives = 66/181 (36%), Gaps = 8/181 (4%)

Query: 21 RSEQKRQQVLVAAIDLFCRQGFPHTSMDEVAKLAGVSKQTVYSHYGSKDELFVAAIE--S 78
+++ RQ +L A+ LF +QG TS+ E+AK AGV++ +Y H+ K +LF E
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 79 KCVGHDLNDDLLSDPTKPESALTQFALQFGEMIVSPEAITVFKACVAQSESHPEVSQLFF 138
+G + P P S L + + E V+ E + + +
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 139 DAGT---KHIVGLLADYLVTVEALGQYRFGNAHHSAVRLCLMLFGELKLKLELGLAADEL 195
A + L A R +++ G + +E L A +
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLP---ADLMTRRAAIIMRGYISGLMENWLFAPQS 184

Query: 196 V 196

Sbjct: 185 F 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2973RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 27/164 (16%), Positives = 58/164 (35%), Gaps = 18/164 (10%)

Query: 101 RLLEAERQ--EIQASLAQTQADVDLATSTL---KRNQELKKSGYVSEQLLDENRSQLNSL 155
+LE E + E L ++ ++ S + K +L + +E L ++ N +
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN-I 311

Query: 156 AAAKNRLLASQHANQLKLDKSILVAPFDGTISR-RLHNLGEVVAAGSPIFTLVGNINP-E 213
L N+ + S++ AP + + ++H G VV + +V + E
Sbjct: 312 GLLTLEL----AKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLE 367

Query: 214 AYIGVPVALAEQFHAEQQVQVSVQ--DQT----FTAEIAGISAE 251
V + Q + V+ T ++ I+ +
Sbjct: 368 VTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLD 411



Score = 35.2 bits (81), Expect = 4e-04
Identities = 26/109 (23%), Positives = 46/109 (42%), Gaps = 9/109 (8%)

Query: 75 SGKLNELQADSGIKVKQGQILAILDTRLLEAERQEIQASLAQTQADVDLATSTLKRNQEL 134
+ + E+ G V++G +L L EA+ + Q+SL Q + + L R+ EL
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ-TRYQILSRSIEL 162

Query: 135 KKSGYVS-------EQLLDENRSQLNSLAAAKNRLLASQHAN-QLKLDK 175
K + + + +E +L SL + +Q +L LDK
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_2974ACRIFLAVINRP378e-116 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 378 bits (971), Expect = e-116
Identities = 207/1047 (19%), Positives = 433/1047 (41%), Gaps = 54/1047 (5%)

Query: 1 MIKAFVENGRLVSLVIALLLVAGFGAISSLPRTEDPHITNRFASVITPYPGASAERVEAL 60
M F+ ++ +L++AG AI LP + P I SV YPGA A+ V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTEVLENQLRRLEEIKLIQSTS-RPGISVIQLELKDTVMETDPVWSR--ARDLLADAKIN 117
VT+V+E + ++ + + STS G I L + TDP ++ ++ L A
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQS---GTDPDIAQVQVQNKLQLATPL 117

Query: 118 LPEGIPSPTL-DDQIGYAYTAILSLVWNSNTPVRVDILNRYAKELQSRLRLLSGTDFVKL 176
LP+ + + ++ +Y + V ++ + DI + A ++ L L+G V+L
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 177 YGAPTEEMLVQLDGNKMSQLQLTPASIAHILSDADSKIAAGEINNS------EFRALVEV 230
+GA M + LD + +++ +LTP + + L + +IAAG++ + + A +
Sbjct: 178 FGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 231 SGELDSLTRIRQVPLKIDNQGQIIRLGDIATVTRQAKTPADSVALVDGEQGVFVAARMLN 290
+ +V L++++ G ++RL D+A V + + +A ++G+ + ++
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENY-NVIARINGKPAAGLGIKLAT 295

Query: 291 NSRVDLWQAQVNSVVDELNHEVPANIQIQWLFEQNSYTSDRLGGLVVNLLQGFVIILLVL 350
+ + + + EL P +++ + ++ + + +V L + +++ LV+
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 351 LLTLG-LRNAIIVAISLPLTALFTLACMKYIGLPIHQMSVTGLVVALGIMVDNAIVIVDA 409
L L +R +I I++P+ L T A + G I+ +++ G+V+A+G++VD+AIV+V+
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 410 ISQRRQ-QGMSRLAAVSETIHHLWLPLAGSTITTILAFAPIVLMPGAAGEFVGGIAMSVM 468
+ + + A +++ + L G + F P+ G+ G +++++
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 469 FALLGSYLISHTIIAGLAGRF-------SHEGKHD--AWYQHGINLPLVSQYFQASLRIA 519
A+ S L++ + L HE K W+ + + ++ S+
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSV--NHYTNSVGKI 533

Query: 520 LKRPILSALLIGITPVLGFYASGKMTEQFFPPSDRDMFQIEVYLAPHVSLENTLNQVQLM 579
L L+ + ++ F P D+ +F + L + E T + +
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 580 DKKL--HAVNGITQVDWVVGGNTPSFYYNLTQRQQGATNYAQAMVK-----VTDFERANE 632
+ + V V G + Q A +K D A
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSG--------QAQNAGMAFVSLKPWEERNGDENSAEA 645

Query: 633 LIPELQQQFDS---AFPEAQVLVRKLEQGPPFNAPVELM-IFGPNLDTLRSLGDEVRNIL 688
+I + + F + +E G EL+ G D L +++ +
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMA 705

Query: 689 AQTP-DVLHTRATLSAGAPKLWLQVNEDASLMSGLSLTDIAKQIQMSTTGVIGGSVLEQT 747
AQ P ++ R + L+V+++ + G+SL+DI + I + G +++
Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765

Query: 748 ESLSVRVRLGDGSREQVTRLSEIQLVTPSGESVALSALAYNEVQVSRSAIPRRNGQRVNT 807
+ V+ R + ++ + + +GE V SA + + R NG
Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSME 825

Query: 808 IEAYIVSGVLPAQVLNDIKDKVAQLALPSGYRIEIGGESAKRNEAVVNLLSNVMLVVTLL 867
I+ G + +++ ++ LP+G + G S + + + V + ++
Sbjct: 826 IQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVV 883

Query: 868 LATVVLSFNSFRLTAIILLSAIQSAGLGLLAVFVFGYPFGFPVIIGLLGLMGLAINAAIV 927
+ + S+ + ++L LLA +F ++GLL +GL+ AI+
Sbjct: 884 FLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAIL 943

Query: 928 ILAELEDMPSAR-LGDMETIVSTVSSCGRHISSTTITTVGGFIPLII---AGGGFWPPFA 983
I+ +D+ G +E + V R I T++ + G +PL I AG G
Sbjct: 944 IVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVG 1003

Query: 984 IAIAGGTLLTTLLSLVWVPTMYLLLMK 1010
I + GG + TLL++ +VP ++++ +
Sbjct: 1004 IGVMGGMVSATLLAIFFVPVFFVVIRR 1030


82Sbal_3082Sbal_3088N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3082-1111.930507FKBP-type peptidylprolyl isomerase
Sbal_3083-2112.215776glycoside hydrolase family protein
Sbal_3084-2112.402418ROK family protein
Sbal_3085-2123.384347peptidase M24
Sbal_3086-2143.119458metal dependent phosphohydrolase
Sbal_3087-1153.348494methyl-accepting chemotaxis sensory transducer
Sbal_30881132.876727DEAD/DEAH box helicase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3082INFPOTNTIATR1367e-43 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 136 bits (343), Expect = 7e-43
Identities = 64/132 (48%), Positives = 86/132 (65%), Gaps = 2/132 (1%)

Query: 25 KAAQENIRLGNEFLTQNKTKEGVITTASGLQYQVLTKGDGTVHPKASDTVTVHYHGTLID 84
K A+EN G+ FL+ NK+K G++ SGLQY+++ G G P SDTVTV Y GTLID
Sbjct: 99 KKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGA-KPGKSDTVTVEYTGTLID 157

Query: 85 GTVFDSSVDRGEPIAFPLNRVIKGWTEGVQLMVVGDKVRFFIPSELAYGNSST-GKIGGG 143
GTVFDS+ G+P F +++VI GWTE +QLM G F+P++LAYG S G IG
Sbjct: 158 GTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPRSVGGPIGPN 217

Query: 144 SVLIFDVELLNI 155
LIF + L+++
Sbjct: 218 ETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3083MICOLLPTASE481e-07 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 48.2 bits (114), Expect = 1e-07
Identities = 35/170 (20%), Positives = 64/170 (37%), Gaps = 15/170 (8%)

Query: 542 WEIDADNGDILNAMHEGLGHGEGTTPPVNKAPIANAGADVNVTGPADVVLNGSGSRDPEN 601
++D + + + + G+ T VNK P A +D +V ++ +G+ S+D +
Sbjct: 744 HKVDGNGNYVYDVVFHGMNTDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGTESKDEDG 803

Query: 602 EALTYLWTQVSGPTIAIANADMANAAIQLAATQTDVAYSFSLKVTDPEGLSATDSVTVTN 661
E Y W G ++ A A + T Y L VTD G T+S +
Sbjct: 804 EIKAYEWDFGDGEK-----SNEAKATHKYNKTGE---YEVKLTVTDNNGGINTESKKI-- 853

Query: 662 KVDTPNQAPVVSVAA-----TATVEAGKTVSIVASASDADGDALTYAWTV 706
KV V++ + + K+ +V + + Y + V
Sbjct: 854 KVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDV 903


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3087FLAGELLIN300.032 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.6 bits (66), Expect = 0.032
Identities = 14/87 (16%), Positives = 35/87 (40%), Gaps = 4/87 (4%)

Query: 282 QLSGAMEEMSSTITEVAQNTHLTSTSINTAYDLCLKSSANMKANTQKVEQLAKSVADAAN 341
++ +T+ ++N + + T + + N Q+V +L+ + N
Sbjct: 48 AIANRFTSNIKGLTQASRNANDGISIAQTTE----GALNEINNNLQRVRELSVQATNGTN 103

Query: 342 NAHQLNKEAEQVANAMGEIDSIAEQTN 368
+ L +++ + EID ++ QT
Sbjct: 104 SDSDLKSIQDEIQQRLEEIDRVSNQTQ 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3088SECA330.003 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 32.9 bits (75), Expect = 0.003
Identities = 37/175 (21%), Positives = 65/175 (37%), Gaps = 46/175 (26%)

Query: 220 SQVVYPVEQRRKRELLSELIGK-KNWQQVLVFTATRDAADTLVKELNLDGIPSEVVHGDK 278
+VY E + + ++ ++ + Q VLV T + + ++ + EL GI V++
Sbjct: 424 PDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNA-- 481

Query: 279 GQGSRRRALREFVAGDVR---VLVATEVAARGLDI---------------PSLEYVVNYD 320
+ A VA V +AT +A RG DI P+ E +
Sbjct: 482 -KFHANEA--AIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIK 538

Query: 321 LPFLAED---------YV-----H---RI-----GRTGRAGKTGVAISFVSREEE 353
+ ++ H RI GR+GR G G + ++S E+
Sbjct: 539 ADWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDA 593


83Sbal_3148Sbal_3155N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3148-2160.515941hybrid sensory histidine kinase BarA
Sbal_3149-115-0.046139hypothetical protein
Sbal_3150-1141.083554hypothetical protein
Sbal_3151-1141.216850LysR family transcriptional regulator
Sbal_3152-1131.074949auxin efflux carrier
Sbal_3153-1131.226027recombination and repair protein
Sbal_31540141.920823phosphatidylglycerophosphatase A
Sbal_31550122.587965thiamine-monophosphate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3148HTHFIS641e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.1 bits (156), Expect = 1e-12
Identities = 28/124 (22%), Positives = 48/124 (38%), Gaps = 2/124 (1%)

Query: 677 QSLTVLAVDDNFANLKLIDTLLSELVTTVVAVNSGDEAVKQAKTRTFDLIFMDIQMPGTD 736
T+L DD+ A +++ LS V ++ + DL+ D+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 737 GISATKQIRQGSMNRNTPIIAVTAHAIAEERELILGSGMDGYLPKPIDEAALKDVIHRWI 796
+I+ + P++ ++A G YLPKP D L +I R +
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 797 TRPK 800
PK
Sbjct: 120 AEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3149BACINVASINB280.023 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 28.2 bits (62), Expect = 0.023
Identities = 17/41 (41%), Positives = 25/41 (60%)

Query: 142 EALDDFVFAHEVMEEEKELQNSLLEIIEENPKITAELVKGL 182
EAL DF+ A M++ ++ +EI EN K+TAEL K +
Sbjct: 533 EALADFMLARFAMDQIQQWLKQSVEIFGENQKVTAELQKAM 573


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3153GPOSANCHOR372e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.0 bits (85), Expect = 2e-04
Identities = 55/328 (16%), Positives = 105/328 (32%), Gaps = 28/328 (8%)

Query: 59 ANKTEVSAR--FSLDDIPLAKRWLEDNDLELDDECILRRTIGSDGRSRAYINGNPVPLTQ 116
T + L K + E+++ + + ++A +
Sbjct: 34 VVNTNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKD-------H 86

Query: 117 LKLLGQLLIGIHGQHAHHAMLKSEHQLTLLDSYANHRLLIDTVAASFQRCKQIEADLKQL 176
L + L + + SE + + A L + + A +K L
Sbjct: 87 NDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTL 146

Query: 177 EASQHERIARKQLVQYQVEELDEFDLKVDEFDEIEQEHKRLANGTELIDTCQASLDILTE 236
EA + ARK ++ +E F + K L ++ QA L E
Sbjct: 147 EAEKAALAARKADLEKALEGAMNFST------ADSAKIKTLEAEKAALEARQAEL----E 196

Query: 237 GEENNIESLLNRVVSLAEDLQSYDPALSNINTMLNDALIQVQESAGELQHYLSKLELDPT 296
+ + + L++ AL+ L AL + + LE +
Sbjct: 197 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAE-- 254

Query: 297 HFAYLEERLSKAMQLARKHHVSPNKLAEHHLALKAELSTLDSDESKLEEIQLQVDASRAA 356
A LE R ++ + + L+AE + L+++++ LE ++A+R
Sbjct: 255 -KAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQ- 312

Query: 357 YLSNAQKLSQSRARYAK---ELDKLVTQ 381
S + L SR + E KL Q
Sbjct: 313 --SLRRDLDASREAKKQLEAEHQKLEEQ 338



Score = 36.2 bits (83), Expect = 4e-04
Identities = 39/217 (17%), Positives = 71/217 (32%), Gaps = 14/217 (6%)

Query: 167 KQIEADLKQLEASQHERIARKQLVQYQVEELD-EFDLKVDEFDEIEQEHKRLANGTELID 225
+ A A K ++ + EL+ + ++ + K L +
Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224

Query: 226 TCQASLDILTEGEENNIESLLNRVVSLAEDLQSYDPALSNINTMLNDALIQVQESAGELQ 285
+A L+ EG N + ++ +L + + L L AL +
Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAA----LEARQAELEKALEGAMNFSTADS 280

Query: 286 HYLSKLELDPTHFAYLEERLSKAMQLARKHHVSPNKLAEHHLALKAELSTLDSDESKLEE 345
+ LE A LE + ++ + + L A + L+++ KLEE
Sbjct: 281 AKIKTLE---AEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEE 337

Query: 346 IQLQVDASRAAYLSNAQKLSQSRARYAK---ELDKLV 379
Q S A+ S + L SR + E KL
Sbjct: 338 ---QNKISEASRQSLRRDLDASREAKKQLEAEHQKLE 371


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3155TYPE3IMQPROT270.028 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 27.0 bits (60), Expect = 0.028
Identities = 9/39 (23%), Positives = 16/39 (41%)

Query: 76 LSDLAAMGAEPAWMTLALTLPEVDETWLSGFSEGLFEAA 114
+ DL G + ++ L L+ + G GLF+
Sbjct: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTV 39


84Sbal_3239Sbal_3250N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3239426-0.300724translation initiation factor IF-2
Sbal_3240116-0.321363transcription elongation factor NusA
Sbal_3241017-0.021803hypothetical protein
Sbal_32421160.142210**preprotein translocase subunit SecG
Sbal_32430150.092596triosephosphate isomerase
Sbal_3244017-0.006376phosphoglucosamine mutase
Sbal_32451160.213958dihydropteroate synthase
Sbal_32461160.131675ATP-dependent metalloprotease FtsH
Sbal_3247015-0.21299523S rRNA methyltransferase J
Sbal_3248015-0.194113hypothetical protein
Sbal_32490150.399597preprotein translocase subunit SecF
Sbal_3250-1120.399172preprotein translocase subunit SecD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3239TCRTETOQM725e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 72.2 bits (177), Expect = 5e-15
Identities = 51/202 (25%), Positives = 78/202 (38%), Gaps = 30/202 (14%)

Query: 387 IMGHVDHGKTSLLDYIRRAKVAAGEAG------------------GITQHIGAYHVETEN 428
++ HVD GKT+L + + A E G GIT G + EN
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 429 GMITFLDTPGHAAFTAMRARGAKATDIVVLVVAADDGVMPQTIEAIQHAKAGNVPLIVAV 488
+ +DTPGH F A R D +L+++A DGV QT + +P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 489 NKMDKPEADIDRV----KSELSQHGVMS-------EDWGGDNMFAFVSAKTGEGVDELLE 537
NK+D+ D+ V K +LS V+ + + EG D+LLE
Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187

Query: 538 GILLQAEVLELKAVRDGMAAGV 559
+ + LE + +
Sbjct: 188 K-YMSGKSLEALELEQEESIRF 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3242SECGEXPORT1184e-38 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 118 bits (297), Expect = 4e-38
Identities = 61/111 (54%), Positives = 82/111 (73%), Gaps = 1/111 (0%)

Query: 1 MYEVLVVIYLLVALGLIGLVLIQQGKGADMGASFGAGASGTLFGSSGSGNFLTRTTAILA 60
MYE L+V++L+VA+GL+GL+++QQGKGADMGASFGAGAS TLFGSSGSGNF+TR TA+LA
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 61 IAFFTLSLLIGNLSANHAKNEDSWKNLGSDTEQVTQPVEQGTQKSETKIPD 111
FF +SL++GN+++N W+NL S + Q K + IP+
Sbjct: 61 TLFFIISLVLGNINSNKTNKGSEWENL-SAPAKTEQTQPAAPAKPTSDIPN 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3243adhesinb310.003 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 31.4 bits (71), Expect = 0.003
Identities = 18/95 (18%), Positives = 34/95 (35%), Gaps = 16/95 (16%)

Query: 142 REARRTFEVIAEELDIVIQKNGTMAFDNAIIAY----EPLWAVGTGKSATPEQAQEVHAF 197
+EA+ F I E +++ G F AY +W + T + TP+Q + +
Sbjct: 186 KEAKEKFNNIPGEKKMIVTSEG--CFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEK 243

Query: 198 IRKRLSEVSPFIGENIRILYGGSVTPSNAADLFAQ 232
+RK + L+ S ++
Sbjct: 244 LRKT----------KVPSLFVESSVDDRPMKTVSK 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3246HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 0.002
Identities = 23/82 (28%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 198 VLMVGPPGTGKTLLAKAIAGESK---VPFFT-----ISGSDFVEMFVGV------GASRV 243
+++ G GTGK L+A+A+ K PF I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 244 RD-MFEQAKKSAPCIIFIDEID 264
FEQA+ +F+DEI
Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3249SECFTRNLCASE2483e-83 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 248 bits (634), Expect = 3e-83
Identities = 91/306 (29%), Positives = 161/306 (52%), Gaps = 14/306 (4%)

Query: 2 KNINLTKWRYVSSAISILLMITSLAIIGVKGFNWGLDFTGGVVTEVQLDRKITSSELQPL 61
N + +W++ + +I++MI S+ + V G N+G+DF GG + I +
Sbjct: 12 TNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAA 71

Query: 62 LNAAYQQEVSVISASEPG----------RWVLRYGDIKSADTEQSNVDIQ----QALAPL 107
L +V + +P R ++ + ++ AL +
Sbjct: 72 LEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAV 131

Query: 108 NSEVQVLNSSVVGPQIGQELAEQGGLALLVAMLCILGYLSYRFEWRLASGALFALVHDVV 167
+ +++ + VGP++ EL +LL A + I+ Y+ RFEW+ A GA+ ALVHDV+
Sbjct: 132 DPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVL 191

Query: 168 FVLAFFALTQMEFNLTVLAAVLAILGYSLNDSIIIADRIRELLIAKPKLAIQEINNQAIV 227
+ FA+ Q++F+LT +AA+L I GYS+ND++++ DR+RE LI + ++++ N ++
Sbjct: 192 LTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVN 251

Query: 228 ATFSRTMVTSGTTLMTVGALWIMGGGPLEGFSIAMFIGILTGTFSSISVGTSLPELLGLT 287
T SRT++T TTL+ + + I GG + GF AM G+ TGT+SS+ V ++ +GL
Sbjct: 252 ETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLD 311

Query: 288 PEHYKE 293
K+
Sbjct: 312 RNKEKK 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3250SECFTRNLCASE781e-17 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 78.0 bits (192), Expect = 1e-17
Identities = 30/172 (17%), Positives = 82/172 (47%), Gaps = 4/172 (2%)

Query: 422 VTIVEERTIGPTLGAENIENGFAALGLGMGITLLFMALWYR-RLGWVANIALISNMVILF 480
+ I ++GP + E + +L + + ++ + + + A +AL+ ++++
Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194

Query: 481 GLLALIPGAVLTLPGIAGLVLTVGMAVDTNVLIFERIKDKLKEGRSFALA--IDTGFDSA 538
GL A++ L +A L+ G +++ V++F+R+++ L + ++ L ++ +
Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253

Query: 539 FSTIFDANFTTMITAVVLYSIGNGPIQGFALTLGLGLLTSMFTGIFASRALI 590
S TT++ V + G I+GF + G+ T ++ ++ ++ ++
Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV 305


85Sbal_3401Sbal_3410N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3401-2140.036439major facilitator superfamily transporter
Sbal_3402-2130.492344hypothetical protein
Sbal_3403-2120.424348response regulator receiver modulated
Sbal_3404-2121.335766integral membrane sensor signal transduction
Sbal_3405-2121.079558adenylylsulfate kinase
Sbal_3406-2100.404633TrkA domain-containing protein
Sbal_3407-211-0.758901sulfate adenylyltransferase subunit 1
Sbal_3408012-1.086285sulfate adenylyltransferase subunit 2
Sbal_3409-112-1.203550uroporphyrin-III C-methyltransferase
Sbal_3410-211-1.166264phospholipase A(1)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3401TCRTETA362e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.3 bits (84), Expect = 2e-04
Identities = 31/143 (21%), Positives = 48/143 (33%), Gaps = 12/143 (8%)

Query: 277 VVNLLFAPAIGRFIGRIGERNALTVEYVGLIIVFISYALVEQAHMAAALY---VIDHLLF 333
++ AP +G R G R L V L + YA++ A LY ++ +
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLV---SLAGAAVDYAIMATAPFLWVLYIGRIVAGITG 110

Query: 334 AMAIAMKTYFQKIADSKDIAAT---MSVSFTINHIAAVIIPVILGLLWLTDPALVFYIGA 390
A Y I D + A MS F +A PV+ GL+ P F+ A
Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG---PVLGGLMGGFSPHAPFFAAA 167

Query: 391 GFAVCSLILALNVPRHPEPGNET 413
+ + + G
Sbjct: 168 ALNGLNFLTGCFLLPESHKGERR 190



Score = 32.1 bits (73), Expect = 0.004
Identities = 23/151 (15%), Positives = 58/151 (38%), Gaps = 11/151 (7%)

Query: 232 YWLYYLLTFFSGARRQIFMVFAGFMMVEKFGYSVSEITALFLINYVVNLLF-APAIGRFI 290
+++++ ++++F ++F + + I +++ L A G
Sbjct: 216 MAVFFIMQLVGQVPAALWVIFG----EDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271

Query: 291 GRIGERNALTVEYVGLIIVFISYALVEQAHMAAALYVIDHLLFAMAIAM---KTYFQKIA 347
R+GER AL + + +I A + MA + V LL + I M + +
Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMV---LLASGGIGMPALQAMLSRQV 328

Query: 348 DSKDIAATMSVSFTINHIAAVIIPVILGLLW 378
D + + + +++ P++ ++
Sbjct: 329 DEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3403HTHFIS442e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.1 bits (104), Expect = 2e-06
Identities = 26/153 (16%), Positives = 54/153 (35%), Gaps = 15/153 (9%)

Query: 27 KILTVDDDSNFQRSTAFALSTLKVLDCKIELAQAFSYAEACQVLTKENDFAIALIDVVME 86
IL DDD+ + ALS ++ + A + + D + + DVVM
Sbjct: 5 TILVADDDAAIRTVLNQALS-----RAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMP 58

Query: 87 TEDAGLRLVRAIREVLGNEKIRIILLTGQPGMAPIFDVMRNYDINDYWTKS---ELSADR 143
E+ L+ I++ + +++++ Q DY K
Sbjct: 59 DEN-AFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEK-GAYDYLPKPFDLTELIGI 114

Query: 144 LQTILTTNLRSYQQISSIANAKRGLQLIAESSG 176
+ L R ++ +++ G+ L+ S+
Sbjct: 115 IGRALAEPKRRPSKLE--DDSQDGMPLVGRSAA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3404PF06580511e-08 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 50.6 bits (121), Expect = 1e-08
Identities = 27/127 (21%), Positives = 45/127 (35%), Gaps = 22/127 (17%)

Query: 497 ELEIDVDANIELNSYPGALGQSLENLVTNAITHAFEGRVN-GQIKISAKMIEEQIVEITV 555
+ E ++ I P L ++ LV N I H G+I + + V + V
Sbjct: 241 QFENQINPAIMDVQVPPML---VQTLVENGIKHGIAQLPQGGKILLKGTK-DNGTVTLEV 296

Query: 556 SDNGIGMSEETMKQIFDPFFTTRRGNGGTGLGLHLTYQLVSQLLGGK--ITVSSTLGKGS 613
+ G + T + TG GL + + L G + I +S GK +
Sbjct: 297 ENTGSLALKNTKE--------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN 342

Query: 614 VFSLTIP 620
+ IP
Sbjct: 343 AM-VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3407TCRTETOQM676e-14 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 67.2 bits (164), Expect = 6e-14
Identities = 46/155 (29%), Positives = 70/155 (45%), Gaps = 17/155 (10%)

Query: 41 VDDGKSTLIGRLLHDSAQIYEDQLASLKSDSAKMGTTGEAIDLALLVDGLQAEREQGITI 100
VD GK+TL LL++S I +L S+ + + D ER++GITI
Sbjct: 12 VDAGKTTLTESLLYNSGAI--TELGSVDKGTTRT-------------DNTLLERQRGITI 56

Query: 101 DVAYRYFSSDKRKFIIADTPGHEQYTRNMATGASTCDLAVILVDARYGVQTQTKRHAFIA 160
F + K I DTPGH + + S D A++L+ A+ GVQ QT+
Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 161 SLLGIRHFVVAVNKMDLLGFD-EQVFNRIRADFTD 194
+GI + +NK+D G D V+ I+ +
Sbjct: 117 RKMGIPT-IFFINKIDQNGIDLSTVYQDIKEKLSA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3410PHPHLIPASEA12171e-71 Bacterial phospholipase A1 protein signature.
		>PHPHLIPASEA1#Bacterial phospholipase A1 protein signature.

Length = 289

Score = 217 bits (553), Expect = 1e-71
Identities = 102/303 (33%), Positives = 156/303 (51%), Gaps = 26/303 (8%)

Query: 3 RLYSGIAMAGLLACTSINAEESLVEGRVKDE-----------LATAELPFVITPHKVNYI 51
R G + + ++ A+E+ V+ V D L + PF + P+ NY+
Sbjct: 2 RTLQGWLLPVFMLPMAVYAQEATVK-EVHDAPAVRGSIIANMLQEHDNPFTLYPYDTNYL 60

Query: 52 LPATYSPDPNMAPFAEDALINPYTLDEFEAKFQISFKFPIWYNVFGDNGHLFFAYTNQSY 111
+ S ++ A + + E KFQ+S FP+W + G N L +YT +S+
Sbjct: 61 IYTQTS---DLNKEAIASYDWAENARKDEVKFQLSLAFPLWRGILGPNSVLGASYTQKSW 117

Query: 112 WQVYNKDTSSPFRETNHEPEVFMLFNNDWKIGSVTNSFWGVGAVHQSNGKSGPLSRSWNR 171
WQ+ N + SSPFRETN+EP++F+ F D++ T +G H SNG+S P SRSWNR
Sbjct: 118 WQLSNSEESSPFRETNYEPQLFLGFATDYRFAGWTLRDVEMGYNHDSNGRSDPTSRSWNR 177

Query: 172 LYATMIFDAGPLAFSTKVWWRIPEDEKTDPHQARGDDNPNIDDYIGRAEFIGVYGIDEHR 231
LY ++ + G K W+ + DDNP+I Y+G + Y + +
Sbjct: 178 LYTRLMAENGNWLVEVKPWYVVGNT----------DDNPDITKYMGYYQLKIGYHLGDAV 227

Query: 232 FTLTLKTNLEDIDRGSAELTWSYPIVGNLRLYTQYFNGYGESLIDYNYHNQRIGIGISLN 291
+ + N G AEL SYPI ++RLYTQ ++GYGESLIDYN++ R+G+G+ LN
Sbjct: 228 LSAKGQYNWNT-GYGGAELGLSYPITKHVRLYTQVYSGYGESLIDYNFNQTRVGVGVMLN 286

Query: 292 DIL 294
D+
Sbjct: 287 DLF 289


86Sbal_3673Sbal_3679N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3673-2132.545231magnesium transporter
Sbal_3674-3161.286724molybdate ABC transporter substrate-binding
Sbal_3675-2140.568815OmpA/MotB domain-containing protein
Sbal_3676-115-0.813155transposase IS3/IS911 family protein
Sbal_3677014-0.756991hypothetical protein
Sbal_3678013-1.127913hypothetical protein
Sbal_3679-112-1.412008hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3673FLGMOTORFLIG310.014 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 30.5 bits (69), Expect = 0.014
Identities = 19/124 (15%), Positives = 43/124 (34%), Gaps = 13/124 (10%)

Query: 2 PVDNSENDHT---GHSLDQLNQALSSGMFVHVRNMLQK-MAASDIALILESSPPSARQVL 57
+ + D+ L + + G + R +L+K + I+ + +
Sbjct: 57 TITSELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSA----- 111

Query: 58 WQLIDQEQIGDILDELSEELKDPLIRSMSPERVAKATASMDTDDLAYILRSLPDAVYKQV 117
Q + + + I+ P+ +A + +D ++IL SLP V V
Sbjct: 112 ----LQSRPFEFVRRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNV 167

Query: 118 LQSM 121
+ +
Sbjct: 168 ARRI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3675OMPADOMAIN974e-26 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 97.3 bits (242), Expect = 4e-26
Identities = 38/123 (30%), Positives = 63/123 (51%), Gaps = 11/123 (8%)

Query: 117 LNMPNEVTFGVDQTELSDGAKRVLNSVAVVAKEYSKT--QLNVLGYTDSSGSDSYNLRLS 174
+ ++V F ++ L + L+ + + VLGYTD GSD+YN LS
Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274

Query: 175 QVRAGEVGNYLMSKGVASARVKSKGMGEASPIASNANANGR---------AQNRRVEIVL 225
+ RA V +YL+SKG+ + ++ ++GMGE++P+ N N + A +RRVEI +
Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334

Query: 226 TPT 228

Sbjct: 335 KGI 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3676PF04183240.049 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 24.5 bits (53), Expect = 0.049
Identities = 6/44 (13%), Positives = 15/44 (34%)

Query: 20 GRLLSDVARQYGLSAKAVYQWVRESDLQPQQRECALMSEIAQLQ 63
R +S + + G+ + YQ + ++ + A
Sbjct: 489 LRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSERFALFS 532


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3679FLGHOOKAP1290.014 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.8 bits (64), Expect = 0.014
Identities = 16/60 (26%), Positives = 34/60 (56%), Gaps = 1/60 (1%)

Query: 112 QQYLTNKRLSEIADRLNTIDREISSLDGKINNLTDKADLLKQKNSLLNEKNQLLDERSRL 171
Q N + D++N ++I+SL+ +I+ LT N+LL++++QL+ E +++
Sbjct: 153 QDKQVNIAIGASVDQINNYAKQIASLNDQISRLTGVGA-GASPNNLLDQRDQLVSELNQI 211


87Sbal_3727Sbal_3735N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3727-2183.545404sodium/hydrogen exchanger
Sbal_3728-2153.763029galactokinase
Sbal_3729-1173.444933aldose 1-epimerase
Sbal_3730-1173.186524hypothetical protein
Sbal_3731-2173.320039alcohol dehydrogenase
Sbal_3732-2172.831235collagenase
Sbal_3733-1171.873932Sel1 domain-containing protein
Sbal_3734-2171.272870acriflavin resistance protein
Sbal_3735-1140.480304RND family efflux transporter MFP subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3727TYPE3IMSPROT290.036 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.3 bits (66), Expect = 0.036
Identities = 27/159 (16%), Positives = 56/159 (35%), Gaps = 13/159 (8%)

Query: 146 VAIGILIMQDIFAVLFLTISKGDVPSVWAFALLLLPLAKPLIYKAFDRVGHGELLVLFGL 205
+ + + ++ + + +P A + ++ + Y F + + L +
Sbjct: 43 MGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLT---VAALMAI 99

Query: 206 VMALVVGAWLFESVGLKPDLGAL--IIGI-LLAGHKKSSELAKSLFYFKELFLVAFFLTI 262
+V +L +KPD+ + I G + K E KS+ L ++ + +
Sbjct: 100 ASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIK 159

Query: 263 G----LNGLPTVSDIVLAALLVLLVPLKILLFVYILTRF 297
G L LPT + + LL + L V F
Sbjct: 160 GNLVTLLQLPTCG---IECITPLLGQILRQLMVICTVGF 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3728RTXTOXINA290.033 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.033
Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 124 LAAGLSSSGALVVAFGTAISDTSQLHLSPMAVAQLAQRGEH 164
A GLS+S A +A++ L +SP++ +A + +
Sbjct: 296 AAQGLSTSAAAAGLIASAVT----LAISPLSFLSIADKFKR 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3732MICOLLPTASE3003e-88 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 300 bits (770), Expect = 3e-88
Identities = 109/557 (19%), Positives = 219/557 (39%), Gaps = 47/557 (8%)

Query: 114 SDFVGKSG-QDLVDQLSQSTPECVGKLYSLKGSAATALFSEANVISVANAIATKAKDYTG 172
D + + DLV+ + + E V L++ + T + V ++ + + YT
Sbjct: 95 FDELNRMNYSDLVELIKTISYENVPDLFNFNDGSYTFFSNRDRVQAIIYGLEDSGRTYTA 154

Query: 173 VDVQHLESHIYFVRAALYVQFYSPNDVPAYSSAAKASLKSALNALFANAAIWTVSDDNAG 232
D + + + + F+RA Y+ FY+ + K A+ A+ N+ + G
Sbjct: 155 DDDKGIPTLVEFLRAGYYLGFYNKQLSYLNTPQLKNECLPAMKAIQYNSNFRLGTKAQDG 214

Query: 233 VLKEALILIDSAELGADFNHVTIKVLTDYDANWQASFAMNAAANSVFTTLFRAQWNDDMQ 292
V++ LI +A + + I VL+D+ N + + N+VF + + +
Sbjct: 215 VVEALGRLIGNASADPEVINNCIYVLSDFKDNIDKYGSNYSKGNAVFNLMKGIDYYTNSV 274

Query: 293 -----ALFARDQGILDALNNFQLE------HRDLLGTNAEYLLVNSVKELSRLYYIDAMR 341
A++ + ++ + D L + +L+ N++ R+ R
Sbjct: 275 IYNTKGYDAKNTEFYNRIDPYMERLESLCTIGDKLNNDNAWLVNNALYYTGRM---GKFR 331

Query: 342 PRVTQLVKNILSSTSKTEP----SKVLWYAAAEMADYYDRSHCNDYNICGFKAQLEADAL 397
+ + L K P + ++ S ND + KA L
Sbjct: 332 ED-PSISQRALERAMKEYPYLSYQYIEAANDLDLNFGGKNSSGNDIDFNKIKADAREKYL 390

Query: 398 PFNWKCSDSLKI-RAQD-LYQDQAKWACDVLTSQESYFHSKLETGMQPVGQDNNDDLELV 455
P + D + +A D + +++ K ++ F ++ + +D L +V
Sbjct: 391 PKTYTFDDGKFVVKAGDKVTEEKIKRLYWASKEVKAQFMRVVQNDKALEEGNPDDILTVV 450

Query: 456 IFGSSSEYKSLANSIFGINTDNGGMYLEGSPAGLKNQARFIAYEAEWRTPDFHVWNL-QH 514
I+ S EYK L I G +TDNGG+Y+E N F YE + + L +H
Sbjct: 451 IYNSPEEYK-LNRIINGFSTDNGGIYIE-------NIGTFFTYERTPEESIYTLEELFRH 502

Query: 515 EYVHYLDGRYNLFGDFSRGTS---ANTIWWIEGLAEYIS---------YRDANTAAIAMG 562
E+ HYL GRY + G + +G W+ EG AE+ + R + T +A
Sbjct: 503 EFTHYLQGRYVVPGMWGQGEFYQEGVLTWYEEGTAEFFAGSTRTDGIKPRKSVTQGLAYD 562

Query: 563 ETGEFMLSTIFKNNYESGQDRIYRWGYLAVRFMFEHHRDDVRQILAYLRNDQYAEYQTFM 622
L + Y S Y +G+ +M+ ++ ++ Y++N+ + Y+ ++
Sbjct: 563 RNNRMSLYGVLHAKYGS--WDFYNYGFALSNYMYNNNMGMFNKMTNYIKNNDVSGYKDYI 620

Query: 623 DGIGTRY--DNEWQGWL 637
+ + Y ++++Q ++
Sbjct: 621 ASMSSDYGLNDKYQDYM 637



Score = 75.1 bits (184), Expect = 1e-15
Identities = 36/184 (19%), Positives = 63/184 (34%), Gaps = 26/184 (14%)

Query: 510 WNLQHEYVHYLDGRYNLFGDFSRGTSANTIWWIEGLAEYISYRDANTAAIA-MGETGEFM 568
+ L +Y Y+D N + ++ + A+ I+ + ++ + + +
Sbjct: 627 YGLNDKYQDYMDSLLNNIDNLDVPLVSD-EYVNGHEAKDINEITNDIKEVSNIKDLSSNV 685

Query: 569 LSTIFKNNYESGQDRIYRWGYLAVRFMFEHH-----RDDVRQILAYLRNDQYAEYQTF-- 621
+ F Y+ R Y+ R E + + IL L + Y+T
Sbjct: 686 EKSQFFTTYD------MRGTYVGGRSQGEENDWKDMNSKLNDILKELSKKSWNGYKTVTA 739

Query: 622 ------MDGIGTR-YDNEWQGWLASGLSTADDGIVDKGPSDV-DAEPSGREGNWTGPAGT 673
+DG G YD + G T D V+K P V ++ S GT
Sbjct: 740 YFVNHKVDGNGNYVYDVVFHGMNT---DTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGT 796

Query: 674 ISKD 677
SKD
Sbjct: 797 ESKD 800


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3734ACRIFLAVINRP6470.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 647 bits (1671), Expect = 0.0
Identities = 301/1032 (29%), Positives = 509/1032 (49%), Gaps = 44/1032 (4%)

Query: 8 IRHPIFASVLSIMAVLLGLIAFQKLDIQYFPEHTTHSASVNASIAGASADFMSSNVADKL 67
IR PIFA VL+I+ ++ G +A +L + +P + SV+A+ GA A + V +
Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVI 65

Query: 68 IAAASGIDKVDTM-STDCSEGRCSLTIKFNDDTS-DIEYTNLMNKLRSSVEGINDFPQSM 125
+GID + M ST S G ++T+ F T DI + NKL+ + PQ
Sbjct: 66 EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLAT---PLLPQE- 121

Query: 126 IDKPTVTDDTSATDSASNIITFVNAGGMEKQAMYDYISQQLVPQLKQVQGVGAVWGPYGG 185
+ + ++ + S++ + G + + DY++ + L ++ GVG V G
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV--QLFG 179

Query: 186 SQKAVRVWLNPEQMKALNIKAADVVGTLGSYNASFTSG------AIKGKSRDFSINPLNQ 239
+Q A+R+WL+ + + + DV+ L N +G A+ G+ + SI +
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 240 VETLEDVKDLVIKVS-EGKIIRVADVADVVMGEESLSPSILSIGGHSAMSLQILPLSNAN 298
+ E+ + ++V+ +G ++R+ DVA V +G E+ + I I G A L I + AN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYN-VIARINGKPAAGLGIKLATGAN 298

Query: 299 PVTVASNIKAEIARMQQHLPQGLEMTLAYNQADFIEASIDEGFSALIEAVILVSLIVVLF 358
+ A IKA++A +Q PQG+++ Y+ F++ SI E L EA++LV L++ LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 359 LGSLRAASIPIITIPVCVIGVFAVMSALGFSINVLTILAIILAIGLVVDDAIVVVENCYR 418
L ++RA IP I +PV ++G FA+++A G+SIN LT+ ++LAIGL+VDDAIVVVEN R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 419 HI-ENGETPFNAAIKGCQEIIFPIIAMTLTLAAVYLPIGLMSGLTADLFRQFSFTLAAAV 477
+ E+ P A K +I ++ + + L+AV++P+ G T ++RQFS T+ +A+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 478 MISGVVALTLSPMMSAYLINTTEQQPK-----WFSRVEHALQQLNDMYIKELDKRFTRKR 532
+S +VAL L+P + A L+ + +F + Y + K
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 533 LMLGAAVVLIGLAGIAYWQLPKILLPAEDSGFIDVASNGPTGVGRQYHLNHNAELNGVMD 592
L +++ + + +LP LP ED G P G ++ ++
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 593 EHPAVGANLSY------IEGEPVN----HVLLKPWGERS---EGIDEVISDLMTKSKESV 639
++ + G+ N V LKPW ER+ + VI + +
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 640 SAYNMSFSIRSANNLSIANNLRLELTTLDRNK---DELNDTAAKVQKLLEDYPG-LNNVG 695
+ + F++ + L A EL +D+ D L ++ + +P L +V
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFEL--IDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 696 NSVLRDQLRYDLSIDRNAIILSGVSYGDVTNALSTFLGSVKAADLHATDGFTYPIQVQVN 755
+ L D ++ L +D+ GVS D+ +ST LG D G + VQ +
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF-IDRGRVKKLYVQAD 775

Query: 756 LDKLSDFKVLNKLYVTSESGQALPLSQFVSIKQTTAESNIKTFMGLDSAELTADVMPGYS 815
+ ++KLYV S +G+ +P S F + ++ + GL S E+ + PG S
Sbjct: 776 AKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTS 835

Query: 816 TDEIKAYLDEQLPTLLNDAQGFKYNGVVKDLMDSQAGTQSLFLLALVFIYLILAAQFESF 875
+ + A + E L + L G+ + G+ S +L ++ V ++L LAA +ES+
Sbjct: 836 SGDAMALM-ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESW 894

Query: 876 VDPLIILLTVPLCIVGALLTLTLFGQSVNIYSQIGLLTLVGLVTKHGILLVEFANK-QQY 934
P+ ++L VPL IVG LL TLF Q ++Y +GLLT +GL K+ IL+VEFA +
Sbjct: 895 SIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEK 954

Query: 935 QGLSAIEAARSSAKSRLRPILMTSLTMILSAIPLALASGPGSLGLANIGLVLVGGLLAGT 994
+G +EA + + RLRPILMTSL IL +PLA+++G GS +G+ ++GG+++ T
Sbjct: 955 EGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSAT 1014

Query: 995 FFSLFVVPVAYV 1006
++F VPV +V
Sbjct: 1015 LLAIFFVPVFFV 1026



Score = 91.4 bits (227), Expect = 1e-20
Identities = 63/362 (17%), Positives = 122/362 (33%), Gaps = 22/362 (6%)

Query: 662 LELTTLDRNKDELNDTAAK-VQKLLEDYPGLNNVGNSVLRDQLRYDLSIDRNAIILSGVS 720
+D+++D A V+ L G+ +V + +R + +D + + ++
Sbjct: 142 FVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMR--IWLDADLLNKYKLT 199

Query: 721 YGDVTNALS-----TFLGSVKAADLHATDGFTYPIQVQVNLDKLSDFKVLNKLYVTSESG 775
DV N L G + I Q +F + G
Sbjct: 200 PVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFG--KVTLRVNSDG 257

Query: 776 QALPLSQFVSIKQTTAESNIK-TFMGLDSAELTADVMPGYST----DEIKAYLDEQLPTL 830
+ L ++ N+ G +A L + G + IKA L E P
Sbjct: 258 SVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFF 317

Query: 831 LNDAQGFKYNGVVKDLMDSQAGTQSLF---LLALVFIYLILAAQFESFVDPLIILLTVPL 887
QG K Q + A++ ++L++ ++ LI + VP+
Sbjct: 318 ---PQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV 374

Query: 888 CIVGALLTLTLFGQSVNIYSQIGLLTLVGLVTKHGILLVEFANK-QQYQGLSAIEAARSS 946
++G L FG S+N + G++ +GL+ I++VE + L EA S
Sbjct: 375 VLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKS 434

Query: 947 AKSRLRPILMTSLTMILSAIPLALASGPGSLGLANIGLVLVGGLLAGTFFSLFVVPVAYV 1006
++ ++ + IP+A G + +V + +L + P
Sbjct: 435 MSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCA 494

Query: 1007 AM 1008
+
Sbjct: 495 TL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3735RTXTOXIND516e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.6 bits (121), Expect = 6e-09
Identities = 38/192 (19%), Positives = 70/192 (36%), Gaps = 29/192 (15%)

Query: 123 AEQDNTKAKADLDKAKSTLALAKTKLERIEDLL---IKEPFALAKQDVDELRENVNLADA 179
A + K+ L++ +S + AK + + + L I + ++ L LA
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE--LAKN 321

Query: 180 DFRQKQATMNDYLIKAPFDG---QLTSFSQSIGSQIGAGTALVTLYSLN-PVEVRYAISQ 235
+ RQ+ + I+AP QL ++ G + L+ + + +EV +
Sbjct: 322 EERQQASV-----IRAPVSVKVQQLKVHTE--GGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 236 NDFGKAQKGQKVNVTVEAYGNKVFKGL---VNYVAP--AVDESSG-------RVEVHAAL 283
D G GQ + VEA+ + L V + D+ G +E +
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLS 434

Query: 284 -DNPEFKLAPGM 294
N L+ GM
Sbjct: 435 TGNKNIPLSSGM 446



Score = 46.7 bits (111), Expect = 1e-07
Identities = 23/108 (21%), Positives = 44/108 (40%), Gaps = 7/108 (6%)

Query: 105 ISAIHFSNGDKVTKGQVIAEQDNTKAKADLDKAKSTLALAKTKLERIEDLLIKEPFALAK 164
+ I G+ V KG V+ + A+AD K +S+L A+ + R + L ++
Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS----RSIEL 162

Query: 165 QDVDELRENVNLADADFRQKQATMNDYLIKAPFDGQLTSFSQSIGSQI 212
+ EL+ + +++ LIK F T +Q ++
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS---TWQNQKYQKEL 207


88Sbal_3752Sbal_3758N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_37520172.474645porin
Sbal_37530162.445299single-strand binding protein
Sbal_3754-1162.296597major facilitator superfamily transporter
Sbal_3755-1162.279921excinuclease ABC subunit A
Sbal_3756-1151.178814peptidase M28
Sbal_3757-1161.586025DEAD/DEAH box helicase
Sbal_37581130.811963hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3752ECOLNEIPORIN655e-14 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 65.2 bits (159), Expect = 5e-14
Identities = 59/296 (19%), Positives = 105/296 (35%), Gaps = 23/296 (7%)

Query: 74 GLTHTNSDDVWDVGDANSRIGFAAEHQLGNGLVGFAKGEFKVDIKDDGDFGDARKAYVGL 133
G + + + D S+IGF + LGNGL + E K I R++++GL
Sbjct: 41 GAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKAIWQVEQKASIAGTDSGWGNRQSFIGL 100

Query: 134 KGFFGSVAIGKQAVTQEIISDPVDIFNRSGTPLAYDS-ASPFRLNNLVTYRK-EFGDLLF 191
KG FG + +G+ + D ++ ++ L + A P V Y EF L
Sbjct: 101 KGGFGKLRVGRLNSVLKDTGD-INPWDSKSDYLGVNKIAEPEARLISVRYDSPEFAGLSG 159

Query: 192 SADAQFDGNKGSGGSDFVNAGIRYKTDLIYIAAAFYNKELEDNKDENT------FGVTLA 245
S + N G S+ +AG YK ++ K ++ +
Sbjct: 160 SVQYALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSG 219

Query: 246 KSFNDLYLAAAYQNIEKDDIDGTTLDGSTLDVVASYPITD-------SYKIKLGVSRYDD 298
+ LY + A Q + ++ S +V A+ SY S
Sbjct: 220 YDNDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGSFDAT 279

Query: 299 GGNDITSAKYNAYNTTLEWHKTPQFSTFIEY---QKTDFEYRETNDQFMLGMRYNF 351
N+ Y+ E+ + + S + Q+ E + + +G+R+ F
Sbjct: 280 NYNND----YDQVVVGAEYDFSKRTSALVSAGWLQEGKGESKFVSTAGGVGLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3753PF03544290.013 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.013
Identities = 18/98 (18%), Positives = 33/98 (33%), Gaps = 4/98 (4%)

Query: 125 PMGGGMPQNAGYQSAPQQAAPAQNQYAPAPQAAPAYQAPAQQQYAAPAPAQQQYGQQQAQ 184
P+ M A + P + P P+ P + P + P + + +
Sbjct: 49 PISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP----KPKPK 104

Query: 185 PQQGGYAPKPQAAPAPAYQAPAAPAQRPAPQPQQNFTP 222
P+ +P+ P PA+P + AP + T
Sbjct: 105 PKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTA 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3754TCRTETB863e-20 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 86.1 bits (213), Expect = 3e-20
Identities = 73/358 (20%), Positives = 139/358 (38%), Gaps = 48/358 (13%)

Query: 48 LWVGIAIGAYGLTQAVLQIPMGILSDKYGRKPIILIGLVLFAIGSLIAANADSIYGV-VF 106
WV A+ LT ++ G LSD+ G K ++L G+++ GS+I S + + +
Sbjct: 52 NWV---NTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIM 108

Query: 107 GRAVQGMGAIA--AAVLALAADLTRDEQRTKVMAIIGMCIGGSFALSLLVGPIVAQHVGL 164
R +QG GA A A V+ + A E R K +IG + + +G ++A ++
Sbjct: 109 ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW 168

Query: 165 SGLFFLTAILAVTGMLIVQFLVPNPISHAP---KGDTLATPARLKRML-------TDPQL 214
S L + I +T +++ L KG L + + ML + +
Sbjct: 169 SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIV 228

Query: 215 FRLDAGIFILHL-----------------VLTAVFVALPLDLVDAGLVKEKHWMLYF--- 254
L IF+ H+ + V + AG V +M+
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 255 --PAFVGAFFL---MVPLIIIG------VKRKNTKAMFQIALVIMIVALLAMALFSN-NL 302
A +G+ + + +II G V R+ + I + + V+ L +
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTS 348

Query: 303 WVLSFAVVLFFTGFNYLEASLPSLIAKFCPVGEKGSAMGVYSTSQFLGAFCGGMLGGG 360
W ++ +V G ++ + + ++++ E G+ M + + + FL G + GG
Sbjct: 349 WFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406



Score = 31.0 bits (70), Expect = 0.010
Identities = 18/111 (16%), Positives = 43/111 (38%)

Query: 274 RKNTKAMFQIALVIMIVALLAMALFSNNLWVLSFAVVLFFTGFNYLEASLPSLIAKFCPV 333
+ K + ++I + + + +L A + G A + ++A++ P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 334 GEKGSAMGVYSTSQFLGAFCGGMLGGGAFQLVGAVGVFIVAVILMSIWLFL 384
+G A G+ + +G G +GG + + ++ +I + FL
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFL 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3758BCTERIALGSPG495e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.7 bits (116), Expect = 5e-10
Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 3/61 (4%)

Query: 3 MSLKKRVQGFTLIELVVVIIILGILAVIAAPKFMNLQRDAKVNAVKGYLGQMQDMTKMLH 62
M + +GFTL+E++VVI+I+G+LA + P M + A + + L
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAV---SDIVALENALD 57

Query: 63 M 63
M
Sbjct: 58 M 58


89Sbal_3821Sbal_3838N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3821219-0.302578rod shape-determining protein MreC
Sbal_3822220-1.058332rod shape-determining protein MreB
Sbal_3823323-1.742229hypothetical protein
Sbal_3824324-1.555573MSHA biogenesis protein MshP
Sbal_3825324-2.123053MSHA biogenesis protein MshO
Sbal_3826220-1.121303MSHA pilin protein MshD
Sbal_3827018-0.033199methylation site containing protein
Sbal_3828-1171.410244methylation site containing protein
Sbal_3829-2171.991475MSHA pilin protein MshB
Sbal_3830-2161.844499hypothetical protein
Sbal_3831-1161.935454type II secretion system protein
Sbal_38320151.588204type II secretion system protein E
Sbal_38330151.453301hypothetical protein
Sbal_3834117-0.243723MSHA biogenesis protein MshM
Sbal_3835-117-1.418996pilus (MSHA type) biogenesis protein MshL
Sbal_3836-215-1.919603MSHA biogenesis protein MshK
Sbal_3837-214-2.153521MSHA biogenesis protein MshJ
Sbal_3838-214-1.645446hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3821IGASERPTASE300.022 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.022
Identities = 25/102 (24%), Positives = 41/102 (40%), Gaps = 9/102 (8%)

Query: 237 EVLTEDGQSYARVTAQPLAALDRIRYVLLIWPSPDSGVTLPNQPTVPAADHSLIENSSKI 296
+V TE Q +VT+Q ++ V P + N PTV + + ++
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQ-----PQAEPARENDPTVNIKEPQ-SQTNTTA 1166

Query: 297 GSASPAEGTSADATKPVTAPAATATVAKPA---TETTPPATE 335
+ PA+ TS++ +PVT T TTP T+
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQ 1208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3822SHAPEPROTEIN5580.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 558 bits (1440), Expect = 0.0
Identities = 317/348 (91%), Positives = 334/348 (95%), Gaps = 1/348 (0%)

Query: 1 MFKKLRGIFSNDLSIDLGTANTLIYVRGEGIVLNEPSVVAIRGERGGSGQKSVAAVGTEA 60
M KK RG+FSNDLSIDLGTANTLIYV+G+GIVLNEPSVVAIR +R GS KSVAAVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGS-PKSVAAVGHDA 59

Query: 61 KQMLGRTPGNIQAIRPMKDGVIADFYVTEKMLQHFIKQVHNNSFFRPSPRVLVCVPVGAT 120
KQMLGRTPGNI AIRPMKDGVIADF+VTEKMLQHFIKQVH+NSF RPSPRVLVCVPVGAT
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 QVERRAIRESAMGAGAREVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAIISLN 180
QVERRAIRESA GAGAREV+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVA+ISLN
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GVVYSSSVRIGGDKFDDAIINYVRRNYGSLIGEATAERIKHTIGTAYPGDEVLEIEVRGR 240
GVVYSSSVRIGGD+FD+AIINYVRRNYGSLIGEATAERIKH IG+AYPGDEV EIEVRGR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 NLAEGVPRSFTLNSNEILEALQEPLSGIVSAVMVALEQSPPELASDISERGMVLTGGGAL 300
NLAEGVPR FTLNSNEILEALQEPL+GIVSAVMVALEQ PPELASDISERGMVLTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 LRDLDRLLMQETGIPVMVADDPLTCVARGGGKALEMIDMHGGDLFSEE 348
LR+LDRLLM+ETGIPV+VA+DPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3825BCTERIALGSPG334e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 33.3 bits (76), Expect = 4e-04
Identities = 12/24 (50%), Positives = 19/24 (79%)

Query: 8 RMQTSKRGFTLVEMVTVILILGIL 31
R +RGFTL+E++ VI+I+G+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3826BCTERIALGSPH371e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 37.2 bits (86), Expect = 1e-05
Identities = 18/43 (41%), Positives = 30/43 (69%), Gaps = 3/43 (6%)

Query: 14 RQLGFTLIELVIGMLVIGIAIVMLTSMLFPQA--DRAASTLHR 54
RQ GFTL+E+++ +L++G++ M+ + FP + D AA TL R
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVL-LAFPASRDDSAAQTLAR 43


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3827BCTERIALGSPH435e-08 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 43.4 bits (102), Expect = 5e-08
Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 1/82 (1%)

Query: 8 KQAGFTLVELVTTIILISILAVVVLPRLFTQSSYSAYSLRNEFISELRQVQQKALNNTDR 67
+Q GFTL+E++ ++L+ + A +VL SA F ++LR VQQ+ L +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQT-GQ 60

Query: 68 CFRVTVSVTGYQVSQFSARNGA 89
F V+V +Q AR+GA
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGA 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3828BCTERIALGSPG488e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.3 bits (115), Expect = 8e-10
Identities = 18/53 (33%), Positives = 32/53 (60%), Gaps = 4/53 (7%)

Query: 1 MKRQQGFTLIELVVVIIILGILAVTAAPKFINLQGDAR----VSALNGLKASI 49
+Q+GFTL+E++VVI+I+G+LA P + + A VS + L+ ++
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3829BCTERIALGSPG445e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 43.7 bits (103), Expect = 5e-08
Identities = 15/36 (41%), Positives = 27/36 (75%)

Query: 4 KQTGFSLIELVIVIVILGLLAATAIPRFLNVTDDAE 39
KQ GF+L+E+++VIVI+G+LA+ +P + + A+
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3831BCTERIALGSPF302e-102 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 302 bits (776), Expect = e-102
Identities = 116/407 (28%), Positives = 207/407 (50%), Gaps = 6/407 (1%)

Query: 1 MPIYQYRGRSGQGQSVTGQLDAASESAAADMLLARGIIPLEVKVAKVVK----SFSLTKL 56
M Y Y+ QG+ G +A S A +L RG++PL V + + S L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 57 FGGKVALEELQIFTRQMYSLTRSGIPILRAIAGLSETAHSQRMKDALNDISEQLTAGRPL 116
+++ +L + TRQ+ +L + +P+ A+ +++ + + + + ++ G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 117 SSSMNQHPDVFDSLFVSMVHVGENTGKLEDAFIQLSGYIEREQETRRRIKSAMRYPMFVL 176
+ +M P F+ L+ +MV GE +G L+ +L+ Y E+ Q+ R RI+ AM YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPC-VL 179

Query: 177 IAIALAMV-ILNIMVIPKFAEMFSRFGADLPWATKVLIGTSNLFVNYWALMLVALIGTII 235
+A+A+V IL +V+PK E F LP +T+VL+G S+ + ML+AL+ +
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239

Query: 236 GIRYWHHTEKGEKQWDKWKLHIPAVGSIIERSTLARYCRSFSMMLSAGVPMTQALSLVAD 295
R EK + + LH+P +G I ARY R+ S++ ++ VP+ QA+ + D
Sbjct: 240 AFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 296 AVDNAYMHDKIVGMRRGIESGDSMLRVSNQSKLFTPLVLQMVAVGEETGQIDQLLNDAAD 355
+ N Y ++ + G S+ + Q+ LF P++ M+A GE +G++D +L AAD
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 356 FYEGEVDYDLKNLTAKLEPILIGFVAVIVLVLALGIYLPMWDMLNVV 402
+ E + EP+L+ +A +VL + L I P+ + ++
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3833IGASERPTASE340.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 0.002
Identities = 33/257 (12%), Positives = 72/257 (28%), Gaps = 16/257 (6%)

Query: 63 AEPAEATASAQAQEQHTLTAQNEPVRMESLVSKESSPSVDAAAEPLKLATAQKIAAAMTA 122
+ QA + E R++ +P+ + T + +A
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSE-------TTETVAENSKQ 1046

Query: 123 NSEEFEPSASDVASSQAPELGTAKEAEPERQAQPQQKPQADVSLALAADKVDTVSEPSHS 182
S+ E + D + A AKEA+ +A Q A + E +
Sbjct: 1047 ESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV 1106

Query: 183 QPNR-----ATSSVRSAEVTVAAPSALMMSERASVAADNEGNASNGLNVQEVKADAGTHQ 237
+ + +VT SE A+ +N++E ++ T
Sbjct: 1107 EKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT-- 1164

Query: 238 LAPKEQGQMAITEVKLTPKQLAKKRFTLASEAERDGKLKEAISYYEQTLALEPSMHEARK 297
+ Q A + + + + + + + T+ E S +
Sbjct: 1165 --TADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222

Query: 298 QLAALHYGQGQLAKASE 314
++ + A+
Sbjct: 1223 HRRSVRSVPHNVEPATT 1239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3835BCTERIALGSPD1773e-50 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 177 bits (450), Expect = 3e-50
Identities = 72/293 (24%), Positives = 127/293 (43%), Gaps = 26/293 (8%)

Query: 257 PQAGLVTIRAFPSELRQVRTFLNSAESHLQRQVILEAKIIEVTLSDGYQQGIQWDNVLGH 316
Q + + A P + + + + + QV++EA I EV +DG GIQW N
Sbjct: 316 GQTNALIVTAAPDVMNDLERVIAQLDIR-RPQVLVEAIIAEVQDADGLNLGIQWANKNAG 374

Query: 317 VGN-------TNVNFGTSKGPGLSDKITSAIGGVTS------LSIKGSDFTTMINLLDTQ 363
+ + + ++S++ S ++ ++ L +
Sbjct: 375 MTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSS 434

Query: 364 GDVDVLSSPRVTASNNQKAVIKVGTDEYFVTDVSSTTVGGTPPVTTPQVELTPFFSGIAL 423
D+L++P + +N +A VG + +T S T G T + + GI L
Sbjct: 435 TKNDILATPSIVTLDNMEATFNVGQEVPVLT--GSQTTSGDNIFNTVERKTV----GIKL 488

Query: 424 DVTPQIDSDGNVLLHVHPSVIDVKEQTKDIKVSDASLELPLAQSEIRESDTVIRAASGDV 483
V PQI+ +VLL + V V + S S +L + R + + SG+
Sbjct: 489 KVKPQINEGDSVLLEIEQEVSSVADAA-----SSTSSDLGATFN-TRTVNNAVLVGSGET 542

Query: 484 VIIGGLMKSESIEVVSQVPLLGDIPYLGELFKNRSKQKKKTELIILLKPTVVG 536
V++GGL+ + +VPLLGDIP +G LF++ SK+ K L++ ++PTV+
Sbjct: 543 VVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIR 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3838PF06580290.014 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.014
Identities = 11/53 (20%), Positives = 20/53 (37%), Gaps = 2/53 (3%)

Query: 25 LGAYAAGFLVLFAALGGYSYWQVSELQQAQQLAAQQ--KLQFDTQKQALEAQI 75
L +V F Y W + + ++ + + + Q AL+AQI
Sbjct: 118 LSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQI 170


90Sbal_3868Sbal_3878N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3868015-0.410316ATP-dependent protease ATP-binding subunit HslU
Sbal_3869115-0.472347hypothetical protein
Sbal_3870113-0.147695integrase catalytic subunit
Sbal_38711130.929324hypothetical protein
Sbal_38722142.370321deoxyribodipyrimidine photolyase-like protein
Sbal_38731133.026162C factor cell-cell signaling protein
Sbal_38740133.096313peptidase M4 thermolysin
Sbal_3876-2173.596110two component Fis family transcriptional
Sbal_3877-1183.441364integral membrane sensor signal transduction
Sbal_3878-2172.941701dTDP-4-dehydrorhamnose reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3868HTHFIS310.013 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.013
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLANAPFIKV 81
T +++ G +G GK +AR K N PF+ +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3873DHBDHDRGNASE591e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 59.3 bits (143), Expect = 1e-12
Identities = 47/241 (19%), Positives = 88/241 (36%), Gaps = 47/241 (19%)

Query: 3 VLIVGGSGGIGQAMVKQVQETYPDATVHATYRHHLPNDRQNNIQWHA----------LDV 52
I G + GIG+A V T H + P + + DV
Sbjct: 11 AFITGAAQGIGEA----VARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 53 TNEAEIKQLSEQLTE----LDWLINCVGILHTQDKGPEKSLQSLDIDFFQHNLTLNTLPS 108
+ A I +++ ++ +D L+N G+L G + SL + ++ ++N+
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRP---GL---IHSLSDEEWEATFSVNSTGV 120

Query: 109 VMLAKHFCHALKQSDSARFAVISAKVGSITDNRLGGWYSYRASKAALNMFLKTLAIEWQR 168
++ + S + + + + +Y +SKAA MF K L +E
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMA---AYASSKAAAVMFTKCLGLELAE 177

Query: 169 NMKHCVVLSLHPGTTDTPLSQP------------------FQQSVPKGKLFTPEYVANCL 210
C ++S PG+T+T + F+ +P KL P +A+ +
Sbjct: 178 YNIRCNIVS--PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAV 235

Query: 211 L 211
L
Sbjct: 236 L 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3874THERMOLYSIN360e-118 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 360 bits (926), Expect = e-118
Identities = 133/490 (27%), Positives = 191/490 (38%), Gaps = 51/490 (10%)

Query: 44 SQFNL--DAGSQLKVEKKLDLGQGKQKQRLQQYFHDVPVYGFSVATSQSSMGFYSDMSGR 101
+ F L A +L + G R +Q G + + S +SG
Sbjct: 64 NTFQLGGQARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGELSS-LSGT 122

Query: 102 VLKNIEKSADFVKPTLTANKALDIAIRGKSEK-AVAGLKAENKQAKLWLYLDDAAKTRLV 160
++ N++K + ++ +A IA + +++ AE + + D RL
Sbjct: 123 LIPNLDKRTLKTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLA 182

Query: 161 YVTSFVVYGDEPSRPFTMIDAHSGEVLKRWEGINHA-ASGTGPGGNIKTGQYEYGTDFSY 219
Y + P MIDA G+VL +W ++ A G P T G
Sbjct: 183 YEVNVRFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRG----- 237

Query: 220 LDVEVSGDT---CTMNSPNVKTVNLNGATSGATAFSYTCPRNTV-----------KEING 265
V GD T S L T G+ F+Y TV +
Sbjct: 238 ----VLGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFFAS 293

Query: 266 AYSPLNDAHYFGNVIYNMYSEWYN---TAPLTFQLTMRVHYSSNYENAFWDGSAMTFGDG 322
+ DAHY+ V+Y+ Y + + VHY Y NAFW+GS M +GDG
Sbjct: 294 YDAAAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDG 353

Query: 323 -ATTFYPLV-SLDVSAHEVSHGFTEQNSGLIYDAQSGGMNEAFSDMAGEAAEFYMHGTND 380
TF P +DV HE++H T+ +GL+Y +SG +NEA SD+ G EFY + D
Sbjct: 354 DGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNPD 413

Query: 381 WLVGADIFK---GNGALRYMADPTLDGISIGHIDDYYDGID---VHHSSGVFNKAFYTLA 434
W +G DI+ ALR M+DP G + Y D VH +SG+ NKA Y L+
Sbjct: 414 WEIGEDIYTPGVAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLS 473

Query: 435 N--------LPGWDTRTAFQTFVVANQLYWTADSLFWQGACGVKSAATDLG----LSADD 482
+ G + F A Y T S F Q AA DL +
Sbjct: 474 QGGVHYGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNS 533

Query: 483 VVTAFAAVGI 492
V AF AVG+
Sbjct: 534 VKQAFNAVGV 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3876HTHFIS923e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 3e-24
Identities = 26/129 (20%), Positives = 64/129 (49%)

Query: 3 RLLIVEDDLSLASILGRRLTRHGFECRLTHDASDALLVAREFRPSHILLDMKLAEANGLG 62
+L+ +DD ++ ++L + L+R G++ R+T +A+ ++ D+ + + N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LIVPLRNLLPKVTMVLLTGYASIATAVEAIRLGADNYLAKPVDTQTLLAALEMEGHSHTL 122
L+ ++ P + +++++ + TA++A GA +YL KP D L+ +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 123 QEDEVDDSP 131
+ +++D
Sbjct: 125 RPSKLEDDS 133



Score = 47.1 bits (112), Expect = 1e-08
Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 135 KRLEWEHIQQVLNANQGNVSATARQLGMHRRTLQRKLLK 173
+E+ I L A +GN A LG++R TL++K+ +
Sbjct: 434 AEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3878NUCEPIMERASE713e-16 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 71.4 bits (175), Expect = 3e-16
Identities = 44/182 (24%), Positives = 70/182 (38%), Gaps = 24/182 (13%)

Query: 3 KIMVTGATGLLGRAVVKQLELTGHEVV-----------------ATGFSRASERVHKLDL 45
K +VTGA G +G V K+L GH+VV ++ + HK+DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 46 TAPLAVEAFIAREQPQVIVHCAAERRPDVSEQNPQAALALNLTASQALAMAAKANN-AWL 104
+ A + + S +NP A NLT + + N L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 105 IYISTDYVFDGTQ--PKYAEDAATHPVNFYGESKLKGEEIVLNTSADFAV----LRLPIL 158
+Y S+ V+ + P +D+ HPV+ Y +K E + S + + LR +
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 159 YG 160
YG
Sbjct: 182 YG 183


91Sbal_3952Sbal_3970N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3952125-2.887411flagellar hook-length control protein
Sbal_3953226-3.392218hypothetical protein
Sbal_3954330-3.725751flagellar protein FliS
Sbal_3955229-4.048199flagellar hook-associated 2 domain-containing
Sbal_3956329-3.763827flagellin domain-containing protein
Sbal_3957225-3.220032flagellin domain-containing protein
Sbal_3958225-3.544180flagellin domain-containing protein
Sbal_3959023-2.959273flagellin domain-containing protein
Sbal_3960-121-2.566393hypothetical protein
Sbal_3961-119-1.804689flagellar hook-associated protein 3
Sbal_3962-119-0.912244flagellar hook-associated protein FlgK
Sbal_3963118-1.191148peptidoglycan hydrolase
Sbal_3964119-1.068310flagellar basal body P-ring protein
Sbal_3965019-1.381833flagellar basal body L-ring protein
Sbal_3966020-1.783150flagellar basal-body rod protein FlgG
Sbal_3967218-2.324043flagellar basal-body rod protein FlgF
Sbal_3968220-3.205702flagellar hook protein FlgE
Sbal_3969419-3.685455flagellar hook capping protein
Sbal_3970418-4.219096flagellar basal-body rod protein FlgC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3952FLGHOOKFLIK415e-06 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 41.4 bits (96), Expect = 5e-06
Identities = 29/94 (30%), Positives = 49/94 (52%)

Query: 248 AASATQWGPVSLTPMASLAQQSQEILTPLREHLRFQVDQHIKKAELRLDPPELGKIELNI 307
TQ P P+ S S E L +H+ Q + AELRL P +LG++++++
Sbjct: 216 TPHQTQPLPTVAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISL 275

Query: 308 RLEGDRLQVQMHAVNPAIRDALLNGLDRLRVDLA 341
+++ ++ Q+QM + + +R AL L LR LA
Sbjct: 276 KVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLA 309


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3956FLAGELLIN1143e-31 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 114 bits (287), Expect = 3e-31
Identities = 79/274 (28%), Positives = 127/274 (46%), Gaps = 18/274 (6%)

Query: 4 VHTNYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETA 63
++TN S++ Q +NKS + L++A+ERLS+GLRINSA DDAAG IANR ++N+KG+ A
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 SRNISDATSMLQTADGALEELTTIANRQKELATQASNGVNSTADLKALDDEFKQLNAEIT 123
SRN +D S+ QT +GAL E+ R +EL+ QA+NG NS +DLK++ DE +Q EI
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 124 RIVENTTYAGNNLFKDATDGVLVKGVTFQIGSDAAEKMSVTLGAID------------KT 171
R+ T + G + + Q+G++ E +++ L ID
Sbjct: 124 RVSNQTQFNGVKVLSQDN------QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGP 177

Query: 172 VAGDLLTSAAANTAIGAVDTFLAKVGTERSTLGANINRLGHTAANLGSVTENTKAAAGRI 231
+ ++ + DT+ R + + TA + A
Sbjct: 178 KEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237

Query: 232 MDADFAVESANMTRNQLLVQAGTTVLSSANQNTG 265
D + ++ + + A G
Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKG 271



Score = 73.5 bits (180), Expect = 6e-17
Identities = 53/276 (19%), Positives = 101/276 (36%), Gaps = 10/276 (3%)

Query: 7 NYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETASRN 66
+ A N + L + + + + G + + ++
Sbjct: 232 ANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKT 291

Query: 67 ISDATSMLQTADGALEELTTIANRQKELATQASNGVNSTADLKALDDEFKQLNAEITRIV 126
+D + T + T+A+ A + + S+ ++ + + T+
Sbjct: 292 GNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351

Query: 127 ENTTYAGNNLFKDATDGVLVKGVTFQIGSDAAEKMSVTLGAIDKTVAG----------DL 176
+ + + A +K+++ +
Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA 411

Query: 177 LTSAAANTAIGAVDTFLAKVGTERSTLGANINRLGHTAANLGSVTENTKAAAGRIMDADF 236
+ + ++D+ L+KV RS+LGA NR NLG+ N +A RI DAD+
Sbjct: 412 AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADY 471

Query: 237 AVESANMTRNQLLVQAGTTVLSSANQNTGLVMGLLR 272
A E +NM++ Q+L QAGT+VL+ ANQ V+ LLR
Sbjct: 472 ATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3957FLAGELLIN1072e-28 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 107 bits (267), Expect = 2e-28
Identities = 80/269 (29%), Positives = 125/269 (46%), Gaps = 13/269 (4%)

Query: 4 VHTNYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETA 63
++TN S++ Q +NKS + L++A+ERLS+GLRINSA DDAAG IANR ++N+KG+ A
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 SRNISDATSMLQTADGALEELTTIANRQKELATQAANGVNSADDLTALNDEFTQLNTEIT 123
SRN +D S+ QT +GAL E+ R +EL+ QA NG NS DL ++ DE Q EI
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 124 RILDNTTYAGNNLFAKLEAGVTFQIGAGTGEKLVVTTTAIDDAALAAGDLTT-------- 175
R+ + T + G + ++ + + Q+GA GE + + ID +L
Sbjct: 124 RVSNQTQFNGVKVLSQ-DNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATV 182

Query: 176 ----GANAAIALVDTFIAAVGTERSTLGANINRLGHTAANLASVTENTKAAAGRIMDADF 231
+ + DT+ R + + TA + A D
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 232 AVESANMTRNQLLVQAGTTVLSSANQNTG 260
+ ++ + + A G
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKG 271



Score = 68.5 bits (167), Expect = 3e-15
Identities = 54/266 (20%), Positives = 88/266 (33%), Gaps = 4/266 (1%)

Query: 6 TNYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETASR 65
N ++ + + + D G+ G S
Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVST 301

Query: 66 NISDATSMLQTADGALEELTTIANRQKELATQAANGVNSADDLTALNDEFTQLNTEITR- 124
I+ L AD A + + VN + +++
Sbjct: 302 TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEAN 361

Query: 125 ---ILDNTTYAGNNLFAKLEAGVTFQIGAGTGEKLVVTTTAIDDAALAAGDLTTGANAAI 181
++ + AG + T + A +
Sbjct: 362 NAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPL 421

Query: 182 ALVDTFIAAVGTERSTLGANINRLGHTAANLASVTENTKAAAGRIMDADFAVESANMTRN 241
A +D+ ++ V RS+LGA NR NL + N +A RI DAD+A E +NM++
Sbjct: 422 ASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKA 481

Query: 242 QLLVQAGTTVLSSANQNTGLVMGLLR 267
Q+L QAGT+VL+ ANQ V+ LLR
Sbjct: 482 QILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3958FLAGELLIN1101e-29 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 110 bits (275), Expect = 1e-29
Identities = 79/269 (29%), Positives = 124/269 (46%), Gaps = 12/269 (4%)

Query: 4 VHTNYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETA 63
++TN S++ Q +NKS + L++A+ERLS+GLRINSA DDAAG IANR ++N+KG+ A
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 SRNISDATSMLQTADGALEELTTIANRQKELATQAANGVNSADDLTALDAEFQQLSLEVD 123
SRN +D S+ QT +GAL E+ R +EL+ QA NG NS DL ++ E QQ E+D
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 124 RIAKNTTYAGNNLFTAIDGGVTFQIGAGTSETMKVT-----------SAAPVALANTVKL 172
R++ T + G + + D + Q+GA ET+ + V +
Sbjct: 124 RVSNQTQFNGVKVLSQ-DNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATV 182

Query: 173 DTGDNARLAITAVDDFIKTVGTSRSTLGANINRLGHTAANLASVTENTKAAAGRIMDADF 232
++ +T D + R + + TA + A D
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 233 AVESANMTRNQLLVQAGTTVLSSANQNTG 261
+ ++ + + A G
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKG 271



Score = 73.2 bits (179), Expect = 6e-17
Identities = 53/210 (25%), Positives = 84/210 (40%)

Query: 59 GMETASRNISDATSMLQTADGALEELTTIANRQKELATQAANGVNSADDLTALDAEFQQL 118
+ T ++ GA K + T NG + DD T ++
Sbjct: 298 KVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSD 357

Query: 119 SLEVDRIAKNTTYAGNNLFTAIDGGVTFQIGAGTSETMKVTSAAPVALANTVKLDTGDNA 178
+ + + N + AG + + T++ L N +
Sbjct: 358 LEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKST 417

Query: 179 RLAITAVDDFIKTVGTSRSTLGANINRLGHTAANLASVTENTKAAAGRIMDADFAVESAN 238
+ ++D + V RS+LGA NR NL + N +A RI DAD+A E +N
Sbjct: 418 ANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSN 477

Query: 239 MTRNQLLVQAGTTVLSSANQNTGLVMGLLR 268
M++ Q+L QAGT+VL+ ANQ V+ LLR
Sbjct: 478 MSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3959FLAGELLIN1109e-30 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 110 bits (276), Expect = 9e-30
Identities = 87/268 (32%), Positives = 132/268 (49%), Gaps = 9/268 (3%)

Query: 4 VHTNYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETA 63
++TN S++ Q +NKS + L++A+ERLS+GLRINSA DDAAG IANR ++N+KG+ A
Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63

Query: 64 SRNISDATSMLQTADGALEELTTIANRQKELATQASNGVNSAADLTALNDEFTQLNAEIT 123
SRN +D S+ QT +GAL E+ R +EL+ QA+NG NS +DL ++ DE Q EI
Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123

Query: 124 RIVENTTYAGNKLFDTGVLTSGTGVKFQIGAGTTETMDVKLGAI----PKTVTGTLTGGT 179
R+ T + G K+ +K Q+GA ET+ + L I + G
Sbjct: 124 RVSNQTQFNGVKVLSQ-----DNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPK 178

Query: 180 ANAAIALVDTFLAAVGTERSTLGANINRLGHTAANLASVTENTKAAAGRIMDADFAVESA 239
L +F G + +GAN R+ + + + T ++A +
Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 240 NMTRNQLLVQAGTTVLSSANQNTGLVMG 267
+ N V T S+A +
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIA 266



Score = 73.5 bits (180), Expect = 6e-17
Identities = 55/266 (20%), Positives = 89/266 (33%), Gaps = 1/266 (0%)

Query: 6 TNYASIVAQGAVNKSNNLLTNAMERLSTGLRINSASDDAAGLQIANRMSANVKGMETASR 65
N ++ + + + D G+ G S
Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVST 301

Query: 66 NISDATSMLQTADGALEELTTIANRQKELATQASNGVNSAADLTALNDEFTQLNAEITRI 125
I+ L AD A + ++ VN + +++
Sbjct: 302 TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEAN 361

Query: 126 VENTTYAGNKLFDTGVLTSGTGVKFQIGAGTTETMDVKLGAIPKTVTGTLTG-GTANAAI 184
+ + + G K + T G + +
Sbjct: 362 NAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPL 421

Query: 185 ALVDTFLAAVGTERSTLGANINRLGHTAANLASVTENTKAAAGRIMDADFAVESANMTRN 244
A +D+ L+ V RS+LGA NR NL + N +A RI DAD+A E +NM++
Sbjct: 422 ASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKA 481

Query: 245 QLLVQAGTTVLSSANQNTGLVMGLLR 270
Q+L QAGT+VL+ ANQ V+ LLR
Sbjct: 482 QILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3961FLAGELLIN371e-04 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 36.6 bits (84), Expect = 1e-04
Identities = 28/140 (20%), Positives = 57/140 (40%)

Query: 1 MRVTMQNLYTNNLNSLQNTTYDVARLNQMLSKGVSILSPSDDPIGVVRVMDNQRDLALVQ 60
+ +L N+L + ++ + LS G+ I S DD G ++ +
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 QYVKNIDSLSTSMSRAETYLSSMVETQQRMKEISIATNSSNLSKEDRASYASEMEELLKG 120
Q +N + + E L+ + QR++E+S+ + S D S E+++ L+
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 LVDNINATDESGNYLFSGNA 140
+ N T +G + S +
Sbjct: 122 IDRVSNQTQFNGVKVLSQDN 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3962FLGHOOKAP11452e-40 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 145 bits (367), Expect = 2e-40
Identities = 88/319 (27%), Positives = 152/319 (47%), Gaps = 8/319 (2%)

Query: 2 SMLNIGKSGLLASMAALNATSNNVANAMVAGYSRQQVMLSSVGGGAYGS---GAGVFVDG 58
S++N SGL A+ AALN SNN+++ VAGY+RQ +++ G GV+V G
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 59 VRRISDQYEVAQLWQTTSAVGFSKVQSSYLRQAEQVFGAEGNNVSKGLDQLFAALNSSME 118
V+R D + QL + + + + + + ++++ + F +L + +
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 119 QPNLIAYRQGVLNEAKAVAQRVNAINDNIDSQRNQINGQLGSSVKEINSQLNIIASFNRD 178
A RQ ++ +++ + + + + Q Q+N +G+SV +IN+ IAS N
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 179 IQAASVTGTIPPA--LQDSRDAAIDDLAAILDIRVVEDSQGMVNISLARGEPLLTGNTAA 236
I + G L D RD + +L I+ + V G NI++A G L+ G+TA
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 237 KLA---SAPDPANPKNDLVTIQFGSSQFNVDETAGGSLGALLTYRDVQLADSQEYIDELA 293
+LA S+ DP+ V G+ + GSLG +LT+R L ++ + +LA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 294 VMLATEFNSILASGTDLNG 312
+ A FN+ +G D NG
Sbjct: 302 LAFAEAFNTQHKAGFDANG 320



Score = 69.2 bits (169), Expect = 1e-14
Identities = 34/109 (31%), Positives = 60/109 (55%), Gaps = 3/109 (2%)

Query: 347 DGTQGDNTNLKALVQLANKELSFTSLGSNTSLAESFSSKVGQLGSASRQAISFAKTSVDL 406
D DN N +AL+ L + + +G S ++++S V +G+ + + + T ++
Sbjct: 439 DAGDSDNRNGQALLDLQSNSKT---VGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNV 495

Query: 407 QKDAQSQWASTSGVNPDEEGINLIIYQQSYMANAKVISTADQLFQTMLS 455
+Q S SGVN DEE NL +QQ Y+ANA+V+ TA+ +F +++
Sbjct: 496 VTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3963FLGFLGJ469e-09 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 45.9 bits (108), Expect = 9e-09
Identities = 30/91 (32%), Positives = 54/91 (59%), Gaps = 10/91 (10%)

Query: 10 LNKLNADDLIKANGEQGA-LKLVSQQFEAQFLQTVLKQMRSATDAMADEDNPLTTKSNNG 68
LN+L A KA + A ++ V++Q E F+Q +LK MR DA+ + L + +
Sbjct: 18 LNELKA----KAGEDPAANIRPVARQVEGMFVQMMLKSMR---DALPKDG--LFSSEHTR 68

Query: 69 IYQDLHDAELASRLSQVNGMGLAEVMTKQLS 99
+Y ++D ++A +++ G+GLAE+M KQ++
Sbjct: 69 LYTSMYDQQIAQQMTAGKGLGLAEMMVKQMT 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3964FLGPRINGFLGI331e-114 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 331 bits (849), Expect = e-114
Identities = 140/370 (37%), Positives = 211/370 (57%), Gaps = 14/370 (3%)

Query: 5 LLLLLLATGTLQAEEQSRYLMDVVDVQGLRDNQLVGYGLVVGLNGTGDR-SQVKFTSQSV 63
+ L T A+ + + D+ +Q RDNQL+GYGLVVGL GTGD FT QS+
Sbjct: 12 VFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSM 71

Query: 64 VNMLKQFGVQIDDKTDPKLKNVAAVAVHATITSLASPGQSLDVTVSSLGDAKSLQGGTLL 123
ML+ G+ KN+AAV V A + ASPG +DVTVSSLGDA SL+GG L+
Sbjct: 72 RAMLQNLGITTQG-GQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLI 130

Query: 124 MTPLRAVDGEVYAVAQGNLVVGGISAAGRNGSSVTVNVPTVGTIPNGALLEASIKSNFSD 183
MT L DG++YAVAQG L+V G SA G + +++T V T +PNGA++E + S F D
Sbjct: 131 MTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKD 189

Query: 184 NEDIILNLKDPNFKTARNIERAVNEL----FGPDIARAQDHAKVLVHAPKSNRERVTFMS 239
+ +++L L++P+F TA + VN +G IA +D ++ V P + M+
Sbjct: 190 SVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKP-RVADLTRLMA 248

Query: 240 MLEELKIDQGRRSPRIVFNSRTGTVVLGGDVVVRKAAVSHGNLTVTIVERENVSQPNGAY 299
+E L ++ ++V N RTGT+V+G DV + + AVS+G LTV + E V QP
Sbjct: 249 EIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPF- 306

Query: 300 LGNAAGETVVTNDSQVLVEQGNKRMFVWPEGTSIEEIVRAVNSLGATPMDLMAILEALSE 359
+ G+T V + ++ Q ++ + EG + +V +NS+G ++AIL+ +
Sbjct: 307 ---SRGQTAVQPQTDIMAMQEGSKVAI-VEGPDLRTLVAGLNSIGLKADGIIAILQGIKS 362

Query: 360 AGSLEADLVV 369
AG+L+A+LV+
Sbjct: 363 AGALQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3965FLGLRINGFLGH1494e-47 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 149 bits (377), Expect = 4e-47
Identities = 70/224 (31%), Positives = 109/224 (48%), Gaps = 15/224 (6%)

Query: 7 LCFALLSGCMSHIPDKETKPGTKEWAPPEIDYSLPDAKDGSLYRPGFMLT-----LFKDK 61
L L+GC + IP + P + + +GS+++ + LF+D+
Sbjct: 14 LLVLSLTGC-AWIP---STPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDR 69

Query: 62 RAFREGDILTVALDEKTYSSKKADTKTNKEQDMGMGLTGNIGSQS-----ANADGKTSFS 116
R GD LT+ L E +SK + +++ G A AD + S
Sbjct: 70 RPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGG 129

Query: 117 RGFNGAGSSTQQNQLSGSITVTVSKVLPNGTLLIRGEKWLRLNQGDEYLRLLGIIRTDDI 176
FNG G + N SG++TVTV +VL NG L + GEK + +NQG E++R G++ I
Sbjct: 130 NTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTI 189

Query: 177 GNNNTISSQRIADARIIYGGQGAIADSNAMGWASRYFNSPWFPL 220
+NT+ S ++ADARI Y G G I ++ MGW R+F + P+
Sbjct: 190 SGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLN-LSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3966FLGHOOKAP1422e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.9 bits (98), Expect = 2e-06
Identities = 11/47 (23%), Positives = 20/47 (42%)

Query: 213 ALRQGALEGANVNVVEEMVEMISTQRAYEMNAKVVSASDDMLKFLNQ 259
L + VN+ EE + Q+ Y NA+V+ ++ + L
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 37.6 bits (87), Expect = 4e-05
Identities = 9/37 (24%), Positives = 18/37 (48%)

Query: 3 SALWVSKTGLTAQDTKMTAIANNLANVNTTGFKRDRV 39
S + + +GL A + +NN+++ N G+ R
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3968FLGHOOKAP1357e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 34.5 bits (79), Expect = 7e-04
Identities = 20/57 (35%), Positives = 29/57 (50%), Gaps = 5/57 (8%)

Query: 2 SFNIALSGLQATTQDLNTISNNIANASTSGFRGGR----SEFASIYNGGQAG-GVGV 53
N A+SGL A LNT SNNI++ + +G+ +++ GG G GV V
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYV 59



Score = 33.0 bits (75), Expect = 0.002
Identities = 14/41 (34%), Positives = 21/41 (51%)

Query: 353 LEGSNVDTTAEMVNLMSAQRNYQSNAKVLDVNSTMQQALLN 393
S V+ E NL Q+ Y +NA+VL + + AL+N
Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3970FLGHOOKAP1300.002 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.002
Identities = 7/39 (17%), Positives = 19/39 (48%)

Query: 97 SNVNTIEEMADMMAASRSFETSVEIMNRARSMQQGLLQL 135
S VN EE ++ + + + +++ A ++ L+ +
Sbjct: 507 SGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


92Sbal_3977Sbal_3987N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_3977023-2.476041flagellar assembly protein H
Sbal_3978023-2.984449flagellar motor switch protein G
Sbal_3979225-3.404466flagellar MS-ring protein
Sbal_3980324-4.455823flagellar hook-basal body complex subunit FliE
Sbal_3981223-3.764966sigma-54 dependent trancsriptional regulator
Sbal_3982221-3.993339hypothetical protein
Sbal_3983118-2.155177surface presentation of antigens (SPOA) protein
Sbal_3984015-1.668603flagellar biosynthesis protein FliP
Sbal_3985015-1.015571export protein FliQ
Sbal_3986-114-0.373633flagellar biosynthetic protein FliR
Sbal_3987-213-0.022467flagellar biosynthetic protein FlhB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3977FLGFLIH582e-12 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 57.9 bits (139), Expect = 2e-12
Identities = 38/179 (21%), Positives = 80/179 (44%), Gaps = 2/179 (1%)

Query: 41 QQAFDEGYDEGVIQGKSAGYEAGLEEGRIAGHAAGFHQGKLDGQSAGRSSIDEQLNSLLV 100
+ + ++ + +Q GY+AG+ EGR GH G+ +G G G + Q +
Sbjct: 37 EPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHA 96

Query: 101 PLGALRELLEDGHAKQVREQQNLILDLVRRVSQQVIRCELTLQPQQILKLVEETLSALPD 160
+ L + + ++ + ++QVI T+ ++K +++ L P
Sbjct: 97 RMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPL 156

Query: 161 DQVDVKIHLEPSAVVKLKEL--SEDKIRGWNLIADSSISAGSCRIVSDKSDADASVETR 217
++ + P + ++ ++ + + GW L D ++ G C++ +D+ D DASV TR
Sbjct: 157 FSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3978FLGMOTORFLIG1738e-54 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 173 bits (441), Expect = 8e-54
Identities = 81/324 (25%), Positives = 170/324 (52%), Gaps = 1/324 (0%)

Query: 6 QAAMLLLSMGEEGAAMVMAHLDRNDVQHLSHKMARLSSITQQEAEAVLSRFFQRYKEQSG 65
+AA+LL+S+G E ++ V +L + +++ L+ ++A+L +IT + + VL F + Q
Sbjct: 20 KAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMAQEF 79

Query: 66 IARASRSYLQKTLDIALGDRVSKSLIDSIYGDEIKVLVKRLEWVDPQLLAREITHEHCQL 125
I + Y ++ L+ +LG + + +I+++ + + DP + I EH Q
Sbjct: 80 IQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEHPQT 139

Query: 126 QAVLLGLLPPESAAKILKMLPSDSQDEVLVRIAQLGELDRNVVEELRELVERCMLMAMEK 185
A++L L P+ A+ IL LP++ Q V RIA + VV E+ ++E+ + +
Sbjct: 140 IALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASLSSE 199

Query: 186 SHTQVAGVKQVADILNRFE-GDREQLMEMIKLHDKQMAIDVTDNMFDFIILGRQKQETLQ 244
+T GV V +I+N + + ++E ++ D ++A ++ MF F + ++Q
Sbjct: 200 DYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDRSIQ 259

Query: 245 TLLGQVPSETLSLALKGIDFELKDSLLNALPKRMSSAIETQIEALGGVPVSRASGARKEI 304
+L ++ + L+ ALK +D +++ + + KR +S ++ +E LG ++++I
Sbjct: 260 RVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQQKI 319

Query: 305 MELAKQLMQEGEIELQLFEEQVVV 328
+ L ++L ++GEI + E+ V+
Sbjct: 320 VSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3979FLGMRINGFLIF315e-103 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 315 bits (808), Expect = e-103
Identities = 158/575 (27%), Positives = 279/575 (48%), Gaps = 60/575 (10%)

Query: 17 SSSGFIAGMTQKWHRFNR-GDRQVIALAL-LAVVVASVIVLMLWTATAGYRPLYGSQENV 74
S++ A + NR I L + + VA V+ ++LW T YR L+ + +
Sbjct: 1 SATASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ 60

Query: 75 DTSQVLNVLDAEGIDYRLDANSGAVLVAEEQVGNARMILAAKGVKAKVPSGMEALDNTAL 134
D ++ L I YR SGA+ V ++V R+ LA +G+ G E LD
Sbjct: 61 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF 120

Query: 135 GTSQFMEQAKYRNSLEGELARTIMSLKLVRAARVHLAIPKQTLFIRQEPELPTASVMLQL 194
G SQF EQ Y+ +LEGELARTI +L V++ARVHLA+PK +LF+R++ + P+ASV + L
Sbjct: 121 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQ-KSPSASVTVTL 179

Query: 195 DPNTRLSESQVEAIVNLVAGSVTGLTASNIKVVDQDGRYLSENISGNQDLSQSRNKQLQY 254
+P L E Q+ A+V+LV+ +V GL N+ +VDQ G L+++ + +DL+ + QL++
Sbjct: 180 EPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLN---DAQLKF 236

Query: 255 TRELENSLVANASSMLEPVLGQENFQVRVTAKVNFNQVEETKESLDPQ------NVVTQE 308
++E+ + ++L P++G N +VTA+++F E+T+E P + +++
Sbjct: 237 ANDVESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQ 296

Query: 309 RTSVDDSSNSIAAGIPGALSNKPPQAGKAAADDKT-----------------------RN 345
+ G+PGALSN+P +A R+
Sbjct: 297 LNISEQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRS 356

Query: 346 LKQEESRQYDVGRSVRHVRYQQMQLENLSVSVLINSATSQGA----FNDEAQLAKFGNMV 401
++ E+ Y+V R++RH + +E LSV+V++N T + Q+ + ++
Sbjct: 357 TQRNETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTAD-QMKQIEDLT 415

Query: 402 KDAIGFSAARGDSFTINAFEFTPTVTAEFTPSPWWQSENY----QAYLRYIIGGILGFGL 457
++A+GFS RGD+ + F V P+WQ +++ A R+++ ++ + L
Sbjct: 416 REAMGFSDKRGDTLNVVNSPF-SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWIL 474

Query: 458 ILFVLRPLVKHLTRTAQMTAPRIEPVALSAAPAGALDGPVADAASNQPHQLPSAEWLGSQ 517
+RP LTR + E + A++ ++ Q Q + + LG++
Sbjct: 475 WRKAVRPQ---LTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQ--QRRANQRLGAE 529

Query: 518 GLPEPGSPLTVKMEHLALLANKEPARVAEVIAHWI 552
V + + +++ +P VA VI W+
Sbjct: 530 ----------VMSQRIREMSDNDPRVVALVIRQWM 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3980FLGHOOKFLIE499e-11 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 48.5 bits (115), Expect = 9e-11
Identities = 20/72 (27%), Positives = 34/72 (47%), Gaps = 1/72 (1%)

Query: 42 SFTELIKSKVSAVNQDQNQSSMAMAAVDSGKSD-DLVGAMVASQKASLSFATMLQIRNRL 100
SF + + + ++ Q + G+ L M QKAS+S +Q+RN+L
Sbjct: 32 SFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKL 91

Query: 101 VQAFDDVMKMPI 112
V A+ +VM M +
Sbjct: 92 VAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3981HTHFIS379e-130 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 379 bits (976), Expect = e-130
Identities = 138/417 (33%), Positives = 209/417 (50%), Gaps = 45/417 (10%)

Query: 51 VKSYLARFPCRNIVALLAPEQGELAAAAMRAGVQDYLLIPVETEQLLASIQR----LRRL 106
+ P ++ + A A A G DYL P + +L+ I R +R
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 107 ELPDSS-------LVVSAAVSRQLLMLAHRAATTEASVLLLGESGTGKEPLARYIHRHSS 159
LV +A +++ + R T+ ++++ GESGTGKE +AR +H +
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185

Query: 160 RSHKPFVAINCAAIPESILESVLFGHVKGAFTGAICDKAGKFEQANGGTLLLDEIGEMPL 219
R + PFVAIN AAIP ++ES LFGH KGAFTGA G+FEQA GGTL LDEIG+MP+
Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245

Query: 220 PLQAKLLRVLQEREVERLGGQHAIPLDIRIIASTNRDLRQAVEFGHFREDLFYRLDVLPL 279
Q +LLRVLQ+ E +GG+ I D+RI+A+TN+DL+Q++ G FREDL+YRL+V+PL
Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305

Query: 280 KIAPLRERKADILPLAEHFLGLYGQSDNASRCYFSEHARQVLVTYDWPGNVRELENCIQR 339
++ PLR+R DI L HF+ + + F + A +++ + WPGNVRELEN ++R
Sbjct: 306 RLPPLRDRAEDIPDLVRHFVQQAEK-EGLDVKRFDQEALELMKAHPWPGNVRELENLVRR 364

Query: 340 ALVMRRGQAIQVAELGLNIQEETLEL------------------------------EPLG 369
+ I + ++ E + + L
Sbjct: 365 LTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALP 424

Query: 370 ATVGLKASKQQAEFQYIIDVLKRFNGQRTLSAQALGMTTRALRYRLVQMREAGIDIE 426
+ + E+ I+ L G + +A LG+ LR + +RE G+ +
Sbjct: 425 PSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGVSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3983FLGMOTORFLIN771e-21 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 77.2 bits (190), Expect = 1e-21
Identities = 34/66 (51%), Positives = 50/66 (75%)

Query: 42 LPVQVTLELASAEMSLGELNRMGEGDVIALDRMVGEPLDIRVNGALLGRGEVVEVAGRYG 101
+PV++T+EL M++ EL R+ +G V+ALD + GEPLDI +NG L+ +GEVV VA +YG
Sbjct: 60 IPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYG 119

Query: 102 VRLLEI 107
VR+ +I
Sbjct: 120 VRITDI 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3984FLGBIOSNFLIP2261e-76 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 226 bits (577), Expect = 1e-76
Identities = 114/239 (47%), Positives = 161/239 (67%), Gaps = 1/239 (0%)

Query: 5 VLLLSILLFAPQALASEGLTLFTLDSAQDSQAVNIKLEILALMTAISFLPIMLMMLTSFT 64
+ +L ++ + + Q+ ++ ++ L +T+++F+P +L+M+TSFT
Sbjct: 6 SVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFT 65

Query: 65 RIIIVLAILRQALGLQQSPPNRVLVGIALILTIFIMRPVGDKIYKEAYLPYDQGKIELME 124
RIIIV +LR ALG +PPN+VL+G+AL LT FIM PV DKIY +AY P+ + KI + E
Sbjct: 66 RIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQE 125

Query: 125 AVSIAKVPLTRFMLAQTRATDLEQMLKIANEPTHMKTAEEVPFFVLMPAFVLSELKTAFQ 184
A+ PL FML QTR DL ++AN ++ E VP +L+PA+V SELKTAFQ
Sbjct: 126 ALEKGAQPLREFMLRQTREADLGLFARLANTGP-LQGPEAVPMRILLPAYVTSELKTAFQ 184

Query: 185 IGFLLFLPFLVIDLVVASVLMSMGMMMLSPLIISLPFKLMVFVLVDGWSMTVSTLTASF 243
IGF +F+PFL+IDLV+ASVLM++GMMM+ P I+LPFKLM+FVLVDGW + V +L SF
Sbjct: 185 IGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3985TYPE3IMQPROT471e-10 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 46.7 bits (111), Expect = 1e-10
Identities = 20/80 (25%), Positives = 39/80 (48%)

Query: 3 INELTSLFADSMFLVIIMVSVLVTPGLILGLIVAVFQAATQVNEQTLSFLPRLIITLLMV 62
+++L +++LV+I+ I+GL+V +FQ TQ+ EQTL F +L+ L +
Sbjct: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL 60

Query: 63 LFSGHWLIQQISDLFDRLFM 82
W + + ++
Sbjct: 61 FLLSGWYGEVLLSYGRQVIF 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3986TYPE3IMRPROT1248e-37 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 124 bits (314), Expect = 8e-37
Identities = 80/256 (31%), Positives = 130/256 (50%), Gaps = 2/256 (0%)

Query: 1 MLSLTSTELSMLIGSLWWPFCRIMGAFMIMPLLGNSYVPATVRIFLALSIAALIAPMLPP 60
ML +TS + + +WP R++ P+L VP V++ LA+ I IAP LP
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 VPPVDALSLGSLFLAVEQLLIGFMLALFLTILIHVMTMLGTIMSMQMGLAMAVMNDPANG 120
S +L+LAV+Q+LIG L + + G I+ +QMGL+ A DPA+
Sbjct: 61 NDVPV-FSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 DTNPILSEWLQIFGTLIFLALDGHLVGLNIIVDSFRLWPIG-NGIFDLPLMGLISRMGWL 179
P+L+ + + L+FL +GHL ++++VD+F PIG + + L +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 180 FAASLMLAIPAVLAMLMVNITFGVLSRAAPSLNIFSLGFPMTLLMGLICVFLSLSGIPTR 239
F LMLA+P + +L +N+ G+L+R AP L+IF +GFP+TL +G+ + + I
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPF 239

Query: 240 YSDLCLDALTAMYQFI 255
L + + I
Sbjct: 240 CEHLFSEIFNLLADII 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_3987TYPE3IMSPROT315e-108 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 315 bits (809), Expect = e-108
Identities = 113/350 (32%), Positives = 182/350 (52%), Gaps = 6/350 (1%)

Query: 8 DKTEEATPQKKRKAREEGQVPRSKDLASAALVVGCSAMLTTNADWFATRVSGLTKYNMLL 67
+KTE+ TP+K R AR++GQV +SK++ S AL+V SAML +D++ S L ML+
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKL----MLI 59

Query: 68 TRADLEQPD--IMMRHLGASLVEMLSILGPLFIMVALLAAVAGALPGGPIFNFGNANFKY 125
P + + L+E + PL + AL+A + + G + +
Sbjct: 60 PAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 126 SRIDPLAGVGRIFSAQSLVELLKSCLKIVLLIGIMLVFLNGHLQELLSYNQRPIDEAVRD 185
+I+P+ G RIFS +SLVE LKS LK+VLL ++ + + G+L LL I+
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 186 GINLLSQDMFYLGTGLLVIAFIDVPYQYWHNSKELRMSRQEVKDEHTQQEGKPEIKAKIR 245
+L Q M G +VI+ D ++Y+ KEL+MS+ E+K E+ + EG PEIK+K R
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 246 QIQQRMARSRADTTIPKADVLLVNPTHYAVALKYNPDLADAPYVITKGTEELALYMRELA 305
Q Q + + ++ V++ NPTH A+ + Y P V K T+ +R++A
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 306 KKHDIEIIDIPPLTRAIYHSTQVDQQIPSALFIAIAHVLSYVMQIKASRK 355
++ + I+ PL RA+Y VD IP+ A A VL ++ + ++
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQ 349


93Sbal_4002Sbal_4008N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_4002113-0.616519hypothetical protein
Sbal_4003214-0.893173HemY domain-containing protein
Sbal_4004214-1.229772outer membrane adhesin like protein
Sbal_4005-113-2.553505ABC transporter-like protein
Sbal_4006012-1.697871HlyD family type I secretion membrane fusion
Sbal_4007112-1.657019TolC family type I secretion outer membrane
Sbal_4008014-1.834432OmpA/MotB domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4002RTXTOXIND290.025 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.025
Identities = 14/77 (18%), Positives = 31/77 (40%), Gaps = 6/77 (7%)

Query: 81 FMLYQQMQQQLLAQDAKNIALQDQLQQALLQPNQRIGQLEQQQLNDAKT-----YQELTK 135
F +Q + Q K A + A + + + ++E+ +L+D +
Sbjct: 195 FSTWQNQKYQKELNLDKKRA-ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHA 253

Query: 136 LAEDQNQLQDRVNKLAQ 152
+ E +N+ + VN+L
Sbjct: 254 VLEQENKYVEAVNELRV 270



Score = 28.6 bits (64), Expect = 0.045
Identities = 9/72 (12%), Positives = 27/72 (37%), Gaps = 5/72 (6%)

Query: 81 FMLYQQMQQQLLAQDAKNIALQDQLQQALLQPNQRIGQLEQQQLNDAKTYQELTKLAEDQ 140
+ + + + + + +Q++ +L + + Q N+ L KL +
Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI-----LDKLRQTT 308

Query: 141 NQLQDRVNKLAQ 152
+ + +LA+
Sbjct: 309 DNIGLLTLELAK 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4004CABNDNGRPT889e-20 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 88.1 bits (218), Expect = 9e-20
Identities = 40/172 (23%), Positives = 64/172 (37%), Gaps = 6/172 (3%)

Query: 1987 GSDTINGGNGDDILFGDAIN--FNGISGQGYVAIKDYVADQLGIAAVTDAQVHRYITEHA 2044
+ T G+ + + + + A + ++ I +
Sbjct: 260 ANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINLNE 319

Query: 2045 SDFDQSGASDKADVLIGGQGNDILYGQGGNDQLYGGNGNDLIFGGAGNDTIIGGLGNDKL 2104
F G + G + G GND L G + ++++ GGAGND + GG G D L
Sbjct: 320 GSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTL 379

Query: 2105 TGGTGADTFVWQAG----ESGTDHITDFNIHEDKLDLRDLLQGENTNTLDSY 2152
GG G DTFV+ +G + D I DF DK+DL + +
Sbjct: 380 YGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDKIDLSAFRNEGQLSFVQDQ 431



Score = 53.4 bits (128), Expect = 7e-09
Identities = 34/118 (28%), Positives = 53/118 (44%), Gaps = 9/118 (7%)

Query: 1417 TGTRTIDNTSGQDQLLGTGGNDHLVSANGGGDLLYGMDGDDILVGSDAVQGDSLYGGTGN 1476
+ + I+ G +G + ++ + G G+DILVG+ A + L GG GN
Sbjct: 310 SNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSA--DNILQGGAGN 367

Query: 1477 DVLVAGLGNDGLYGGAGTDIAVLLGNRADYIIEKGTGYSSNDRWFNFFVTENGIGVTK 1534
DVL G G D LYGGAG D V Y + + ++ D +F + I ++
Sbjct: 368 DVLYGGAGADTLYGGAGRDTFV-------YGSGQDSTVAAYDWIADFQKGIDKIDLSA 418



Score = 49.6 bits (118), Expect = 1e-07
Identities = 31/120 (25%), Positives = 47/120 (39%), Gaps = 3/120 (2%)

Query: 1375 ADKPVVNVILTDNGIPLYSNFKTSGITTEQFRTGDFTNAPFNTGTRTIDNTSGQDQLLGT 1434
+ K ++ + G + S G F++ G +I + + +G
Sbjct: 287 SSKALIFSVWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGG 346

Query: 1435 GGNDHLVSANGGGDLLYGMDGDDILVGSDAVQGDSLYGGTGNDVLVAGLGNDGLYGGAGT 1494
GND LV N ++L G G+D+L G D+LYGG G D V G G D
Sbjct: 347 SGNDILV-GNSADNILQGGAGNDVLYGGAG--ADTLYGGAGRDTFVYGSGQDSTVAAYDW 403



Score = 42.3 bits (99), Expect = 2e-05
Identities = 31/135 (22%), Positives = 46/135 (34%), Gaps = 25/135 (18%)

Query: 1402 TEQFRTGDFTNAPFNTGTRTIDNTSGQDQLLGTGGNDHLVSANGGGDLLYGMDGDDILVG 1461
T G + AP I G + TG + + ++N D D L+
Sbjct: 234 TGADYNGHYGGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIF 293

Query: 1462 SDAVQG-----------------------DSLYGGTGNDVLVAGLGNDGLYGGAGTDIAV 1498
S G + G GN + G+ + GG+G DI
Sbjct: 294 SVWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDI-- 351

Query: 1499 LLGNRADYIIEKGTG 1513
L+GN AD I++ G G
Sbjct: 352 LVGNSADNILQGGAG 366



Score = 33.8 bits (77), Expect = 0.009
Identities = 16/72 (22%), Positives = 21/72 (29%), Gaps = 1/72 (1%)

Query: 1416 NTGTRTIDNTSGQDQLLGTGGNDHLVSANGGGDLLYGMDGDDILVGSDAVQGDSLYGGTG 1475
N+ + +G D L G G D L G D G D V + D G
Sbjct: 355 NSADNILQGGAGNDVLYGGAGADTL-YGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDK 413

Query: 1476 NDVLVAGLGNDG 1487
D+
Sbjct: 414 IDLSAFRNEGQL 425


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4006RTXTOXIND310e-103 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 310 bits (796), Expect = e-103
Identities = 86/431 (19%), Positives = 193/431 (44%), Gaps = 11/431 (2%)

Query: 29 RLIIWALAAMVVCFLLWAGFAKLDKVTTGTGKVIPSSQVQVIQSLDGGIMQELYVQEGEM 88
RL+ + + +V + + +++ V T GK+ S + + I+ ++ I++E+ V+EGE
Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGES 117

Query: 89 VTKGQPLVRIDDTRFRSDYAQQEQEVFGLKTNAIRMRAELDSILISDMTSDWREQVLITK 148
V KG L+++ +D + + + + R + SI E + +
Sbjct: 118 VRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSI----------ELNKLPE 167

Query: 149 KALVFPENIIAAEPALVKRQQEEYNGRLDNLSNQLEILVRQIQQRQQEIDDLASKTTTLT 208
L V R + NQ + +++ E + ++
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 209 TSMQLISRELELTRPLAKKGIVPEVELLKLERTVNDLQGELNSMRLLRPKVKAAMDEAIL 268
++ L+ L K + + +L+ E + EL + ++++ + A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287

Query: 269 KRREAVFVYAADLRAQLNETQTRLSRMNEAQVGAQDKVSKAIITSPVNGTIKTTHINTLG 328
+ + ++ ++ +L +T + + +++ ++I +PV+ ++ ++T G
Sbjct: 288 EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEG 347

Query: 329 GVVQPGVDIIEIVPSEDQLLIETKILPKDIAFLHPGLPAVVKITAYDFTRYGGLKGTVEH 388
GVV ++ IVP +D L + + KDI F++ G A++K+ A+ +TRYG L G V++
Sbjct: 348 GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKN 407

Query: 389 ISADTSQDEEGNSYYLIRVRTAESSLTKNDGTQMPIIPGMLTSVDVITGQRSILEYILNP 448
I+ D +D+ + + + E+ L+ +P+ GM + ++ TG RS++ Y+L+P
Sbjct: 408 INLDAIEDQRLGLVFNVIISIEENCLST-GNKNIPLSSGMAVTAEIKTGMRSVISYLLSP 466

Query: 449 ILRAKDTALRE 459
+ + +LRE
Sbjct: 467 LEESVTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4008OMPADOMAIN863e-22 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 86.1 bits (213), Expect = 3e-22
Identities = 33/118 (27%), Positives = 53/118 (44%), Gaps = 12/118 (10%)

Query: 77 NILFPNDSAYIAPEYYPQIEEVAIFLRQY--PTTKVTIEGHTSRTGTDERNAVLSQDRAN 134
++LF + A + PE ++++ L V + G+T R G+D N LS+ RA
Sbjct: 220 DVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279

Query: 135 AVTAVLAERFSIDRSRLTAIGYGSSRPVVLEQTPDAEIR---------NRRVVAEVTG 183
+V L + I +++A G G S PV + + R +RRV EV G
Sbjct: 280 SVVDYLISK-GIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336


94Sbal_4185Sbal_4192N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_41853161.498283TetR family transcriptional regulator
Sbal_41863161.191553N-acetyltransferase GCN5
Sbal_41872160.8503843'(2'),5'-bisphosphate nucleotidase
Sbal_41882170.630550ADP-ribose diphosphatase NudE
Sbal_41893160.622287Ig family protein
Sbal_4190119-1.125358phage tail collar domain-containing protein
Sbal_4191118-1.227886N-acetyltransferase GCN5
Sbal_4192117-0.763124hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4185HTHTETR418e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 41.2 bits (96), Expect = 8e-07
Identities = 16/76 (21%), Positives = 29/76 (38%)

Query: 1 MARRKEHSHDEIRAMAIQAATELLTELGVVGLSLRKVASQIGYVPSTLINIFGSYNYLLL 60
MAR+ + E R + A L ++ GV SL ++A G + F + L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 AVSESTLRALHDRLAG 76
+ E + + +
Sbjct: 61 EIWELSESNIGELELE 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4186SACTRNSFRASE290.006 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.8 bits (64), Expect = 0.006
Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 50 VDNLLQGYLLSAQSSDSTCMWILSIAVSEDARGKGVGKRLM 90
++N G + +S+ + I IAV++D R KGVG L+
Sbjct: 72 LENNCIGRI-KIRSNWNGYALIEDIAVAKDYRKKGVGTALL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4189OMPADOMAIN468e-07 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 46.5 bits (110), Expect = 8e-07
Identities = 39/180 (21%), Positives = 56/180 (31%), Gaps = 26/180 (14%)

Query: 2309 AVVLAGTVSQANAA---DNWYVEGFVGQAQVDSSRRDLQPQAAAGVVTSVDDKDTAFGLS 2365
AV LAG + A AA + WY +G +Q + + G
Sbjct: 9 AVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDT-------GFINNNGPTHENQLGAGAF 61

Query: 2366 VGYQWTPMVAIEFGYADFGNGSARIEGASLTPAQYHEQVKAVTPVLADGVMLGLRFTLLQ 2425
GYQ P V E GY G R+ ++ A GV L +
Sbjct: 62 GGYQVNPYVGFEMGYDWLG----RMPYKGSVENGAYK---------AQGVQLTAKLGYPI 108

Query: 2426 HDAWRFEVPIGLFRWQADISSTMGNSRLTTELDGTDWYAGVRFSYQVSDAWSVGLGYQYV 2485
D +G W+AD S + T G Y ++ + L YQ+
Sbjct: 109 TDDLDIYTRLGGMVWRADTKSNVYGKNHDT---GVSPVFAGGVEYAITPEIATRLEYQWT 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4191SACTRNSFRASE414e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.1 bits (96), Expect = 4e-07
Identities = 20/94 (21%), Positives = 42/94 (44%), Gaps = 6/94 (6%)

Query: 71 YILFYHQQAVGKVMLDISEYRIHLVDFIII-PSMRGRGFGSAILAAIKQEAMKRHLP-VG 128
++ + +G++ + + L++ I + R +G G+A+L + A + H +
Sbjct: 68 FLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLM 127

Query: 129 LSVESENTQAKKLYLQHGFKPESYSGAYESMLWR 162
L + N A Y +H F GA ++ML+
Sbjct: 128 LETQDINISACHFYAKHHFI----IGAVDTMLYS 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4192VACCYTOTOXIN320.007 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 31.5 bits (71), Expect = 0.007
Identities = 21/69 (30%), Positives = 29/69 (42%), Gaps = 2/69 (2%)

Query: 239 IIDSHQRVAGRHMAGGQSLLPTELQTQQMENTDTEPSLDEFRTTVLQYLAQLSLALHQSS 298
+IDSH R M S Q T + E +T+ LQ L LS A+ +S
Sbjct: 916 LIDSHDAGYARTMIDATSANEITKQLNTATTTLNNIASLEHKTSGLQTL-SLSNAMILNS 974

Query: 299 RVI-LLDYH 306
R++ L H
Sbjct: 975 RLVNLSRRH 983


95Sbal_4200Sbal_4207N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_42001202.379150general secretion pathway protein J
Sbal_42011171.484449general secretion pathway protein I
Sbal_42020151.281531general secretion pathway protein H
Sbal_42031141.057577general secretion pathway protein G
Sbal_42041131.463774general secretion pathway protein F
Sbal_42052151.596030general secretory pathway protein E
Sbal_42060171.161505general secretion pathway protein D
Sbal_4207-2181.775547general secretion pathway protein C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4200BCTERIALGSPG310.002 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.4 bits (71), Expect = 0.002
Identities = 16/41 (39%), Positives = 27/41 (65%), Gaps = 3/41 (7%)

Query: 3 LKLTSVQRGFTLLEMLIAIAIFAMIGLASNAVLSTVLTNDE 43
++ T QRGFTLLE+++ I I IG+ ++ V+ ++ N E
Sbjct: 1 MRATDKQRGFTLLEIMVVIVI---IGVLASLVVPNLMGNKE 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4201PilS_PF08805290.003 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 29.1 bits (65), Expect = 0.003
Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 5 KGMTLLEVIVALAVFSIAAVSITKSLGEQMAN 36
KG TL+EV++ + V + A S K +N
Sbjct: 26 KGATLMEVLLVVGVIVVLAASAYKLYSMVQSN 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4202BCTERIALGSPH845e-23 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 84.2 bits (208), Expect = 5e-23
Identities = 44/171 (25%), Positives = 70/171 (40%), Gaps = 39/171 (22%)

Query: 4 LRHAGFTLMEVMLVILLMGLTAAAVTMSIGNSGPQQALDRTARQFIAATEMVLDETVLSG 63
+R GFTL+E+ML++LLMG++A V ++ S A AR F A V + +G
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLAR-FEAQLRFVQQRGLQTG 59

Query: 64 QFIGIVIEKTSYQFVFYKDG---------------KWEPLDKDRLLSEKQMEPGVVMNLV 108
QF G+ + +QF+ + +W PL R+ +
Sbjct: 60 QFFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGS---------- 109

Query: 109 LDGLPLVQDDEEDDSWFEEPLIEPSADDKKKHPEPQVMLFPSGEMSAFELT 159
+ G L + ++W P V++FP GEM+ F LT
Sbjct: 110 IAGGKLNLAFAQGEAW-------------TPGDNPDVLIFPGGEMTPFRLT 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4203BCTERIALGSPG2272e-80 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 227 bits (581), Expect = 2e-80
Identities = 97/140 (69%), Positives = 118/140 (84%)

Query: 1 MQMNKKHQGFTLLEVMVVIVILGILASMVVPNLMGNKDKADQQKAVSDIVALENALDMYK 60
M+ K +GFTLLE+MVVIVI+G+LAS+VVPNLMGNK+KAD+QKAVSDIVALENALDMYK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 LDNGIYPTTEQGLEALVQKPTISPEPRNYREDGYVKRLPEDPWRNKYLLLSPGENGKLDI 120
LDN YPTT QGLE+LV+ PT+ P NY ++GY+KRLP DPW N Y+L++PGE+G D+
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 121 FTAGPDGQPGTEDDIGNWNL 140
+AGPDG+ GTEDDI NW L
Sbjct: 121 LSAGPDGEMGTEDDITNWGL 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4204BCTERIALGSPF5060.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 506 bits (1304), Expect = 0.0
Identities = 229/407 (56%), Positives = 304/407 (74%), Gaps = 1/407 (0%)

Query: 1 MPAFEYKALDAKGKQLKGVIEADTARHARSQLRDQRMMPLEILPVSEKEAKAKSSSFSF- 59
M + Y+ALDA+GK+ +G EAD+AR AR LR++ ++PL + + K+ S+ S
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 FKRGISVAELALITRQIATLVAAGLPIEESLKAVGQQCEKDRLASMIMAVRSRVVEGYSL 119
K +S ++LAL+TRQ+ATLVAA +P+EE+L AV +Q EK L+ ++ AVRS+V+EG+SL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 ADSLAEFPHIFDDLYRAMVASGEKSGHLEVVLNRLADYTERRQQLKSKLTQAMIYPAVLT 179
AD++ FP F+ LY AMVA+GE SGHL+ VLNRLADYTE+RQQ++S++ QAMIYP VLT
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 TVAIGVISILLAAVVPKVVGQFEHMGAELPASTRFLISASDFVQNYGVFVVIALVMLFAL 239
VAI V+SILL+ VVPKVV QF HM LP STR L+ SD V+ +G ++++AL+ F
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 FRRMLKSPAFRMKYDNFLLSMPVVGRVSKGLNTARFARTLSILSASSVPLLDGMRIASEV 299
FR ML+ R+ + LL +P++GR+++GLNTAR+ARTLSIL+AS+VPLL MRI+ +V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 LQNVRVRAAVDDATARVREGTSLGAALTNTKLFPAMMLYMIASGEKSGQLEQMLERAADN 359
+ N R + AT VREG SL AL T LFP MM +MIASGE+SG+L+ MLERAADN
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QDREFEGNVNIALGVFEPMLVVSMACVVLFIVMAILQPILALNNLIS 406
QDREF + +ALG+FEP+LVVSMA VVLFIV+AILQPIL LN L+S
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4206BCTERIALGSPD6050.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 605 bits (1562), Expect = 0.0
Identities = 331/683 (48%), Positives = 450/683 (65%), Gaps = 37/683 (5%)

Query: 6 IRRKLIAGIVAGAAMFSSQFAWSEQYAANFKGTDIQEFINIVGKNLNKTIIVDPTIRGKI 65
IR + ++ A +F A +E+++A+FKGTDIQEFIN V KNLNKT+I+DP++RG I
Sbjct: 7 IRSFSLTLLIFAALLFRP--AAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTI 64

Query: 66 NVRSYDLLNDEQYYQFFLNVLQVYGYAIVEMENNVIKVIKDKDAKTAAIRVANDAEPGIG 125
VRSYD+LN+EQYYQFFL+VL VYG+A++ M N V+KV++ KDAKTAA+ VA+DA PGIG
Sbjct: 65 TVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIG 124

Query: 126 DEMVTRIVALYNTEAKQLAPLLRQLNDNAGGGNVVNYDPSNVLMLSGRAAVVNKLVEIVR 185
DE+VTR+V L N A+ LAPLLRQLNDNAG G+VV+Y+PSNVL+++GRAAV+ +L+ IV
Sbjct: 125 DEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVE 184

Query: 186 RVDKQGDTSVQVVPLEFASAGEMVRIIDTLYRATANQSQMPGQAPKVVADERINAVVVSG 245
RVD GD SV VPL +ASA ++V+++ L + T+ + VVADER NAV+VSG
Sbjct: 185 RVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSG 244

Query: 246 DEKSRQRVVELIHRLDAEQASTGNTKVRYLRYAKAEDLVEVLTGFAQKLEGDKDPSAQAA 305
+ SRQR++ +I +LD +QA+ GNTKV YL+YAKA DLVEVLTG + ++ +K A
Sbjct: 245 EPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEK--QAAKP 302

Query: 306 GGKRRNEINIMAHAETNALVISAEPDQMRTIESVINQLDIRRAQVLVEAIIVEVAEGDNV 365
I I AH +TNAL+++A PD M +E VI QLDIRR QVLVEAII EV + D +
Sbjct: 303 VAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGL 362

Query: 366 GFGVQWASKAGGGTQFNNLGPTIGEVGAGVWAAQTEKASQTCTGSGDNKTCTDNPDKRGD 425
G+QWA+K G TQF N G I AG + +K G
Sbjct: 363 NLGIQWANKNAGMTQFTNSGLPISTAIAG----------------------ANQYNKDGT 400

Query: 426 VT-LLAQALGKVNGMAWGVAMGDFGALIQAVSSDTNSNVLATPSITTLDNQEASFIVGDE 484
V+ LA AL NG+A G G++ L+ A+SS T +++LATPSI TLDN EA+F VG E
Sbjct: 401 VSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQE 460

Query: 485 VPILTGSTASSSNSNPFQTVERKEVGVKLKVVPQINEGNAVKLTIEQEVSGVNG-----N 539
VP+LTGS ++S N F TVERK VG+KLKV PQINEG++V L IEQEVS V +
Sbjct: 461 VPVLTGSQ-TTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTS 519

Query: 540 TGVDISFATRRLTTTVMADSGQIVVLGGLINEEVQESIQKVPFLGDIPIIGHLFKSSSSK 599
+ + +F TR + V+ SG+ VV+GGL+++ V ++ KVP LGDIP+IG LF+S+S K
Sbjct: 520 SDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKK 579

Query: 600 KTKKNLMIFIKPTIIRDGITMEGIAGRKYNYFRALQLEQ--QERGVNLMPNTKVPVLEEW 657
+K+NLM+FI+PT+IRD + +Y F Q +Q +E ++ + +
Sbjct: 580 VSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYP-- 637

Query: 658 NQSEYLPPEVNAILERYKEGKGL 680
Q +V+A ++ + G L
Sbjct: 638 RQDTAAFRQVSAAIDAFNLGGNL 660


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4207BCTERIALGSPC1823e-58 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 182 bits (463), Expect = 3e-58
Identities = 70/288 (24%), Positives = 138/288 (47%), Gaps = 36/288 (12%)

Query: 17 KPLSRIVFWLGFIVIMLLAAQITWKL-VPTSSSASAWSPTPVSVNGKGAGQVELAGLQQL 75
+ RI+F+L ++ A I W++ +P ++ S+ TP + L
Sbjct: 12 SVIRRILFYLLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVT------LNDF 65

Query: 76 GLFGKADATSDKPKVEAVETVTDAPKTTLSIQLTGVVASTADQKGLAIIESNGSQDTYSL 135
LFG + + ++A +++ P +TL++ LTGV+A D + +AII + Q + +
Sbjct: 66 TLFGVSPEKNKAGALDA-SQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGV 124

Query: 136 GDKIKGTSASLKEVYADRIIITNAGRYETLMLDGLVYTSQSPANQQLQQAKSNKAGSAVS 195
+++ G +A + + DR+++ GRYE L L +
Sbjct: 125 NEEVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPG--------------- 169

Query: 196 RVDQRNNADISQELAESRTELLADPSKITDYIAISPVRQGDSVAGYRLNPGKDANLFKQA 255
A ++++L + + ++DY++ SP+ + + GYRLNPG ++ F +
Sbjct: 170 -------AQVNEQLQQR------ASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRV 216

Query: 256 GFKANDLAKSINGYDLTVMSQALEMMSQLSELTEVSIMVEREGQLVEI 303
G + ND+A ++NG DL QA + M +++++ ++ VER+GQ +I
Sbjct: 217 GLQDNDMAVALNGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDI 264


96Sbal_4295Sbal_4303N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal_42950242.724306major facilitator superfamily transporter
Sbal_4296-1232.695729NLP/P60 protein
Sbal_4297-1232.802803hypothetical protein
Sbal_4298-2253.136579major facilitator superfamily transporter
Sbal_4299-1273.164242LytTR family two component transcriptional
Sbal_4300-2232.966029signal transduction histidine kinase LytS
Sbal_4301015-0.757120pirin domain-containing protein
Sbal_4302115-2.087403integral membrane sensor signal transduction
Sbal_4303115-3.194022two component transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4295TCRTETB1118e-29 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 111 bits (280), Expect = 8e-29
Identities = 86/430 (20%), Positives = 170/430 (39%), Gaps = 29/430 (6%)

Query: 30 FLAAVDQTLLATATPAIVEDLGGLR-QASWITIGYMLAMAASVPIYGWLGDNFGRAKILM 88
F + +++ +L + P I D +W+ +ML + +YG L D G ++L+
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 89 IAIVIFALGSIVSA-SAGTMDHMIAGRILQGMGGGGLMSLSQSLIGELVPIRQRARFQGY 147
I+I GS++ +I R +QG G +L ++ +P R + G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 148 FAAMFTLASVGGPVIGGIVVHAYSWHWLFWANIPLA-MLAVWRLNGLHKRSVKPVRNGKF 206
++ + GP IGG++ H W +L IP+ ++ V L L K+ V+ G F
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVR--IKGHF 199

Query: 207 DLVGVVLFPTIITALLYWLSVAGQEFAWLSATSLGFAVFVVVGILGLLLWERRLASPFLP 266
D+ G++L I + + + F +S S F +FV R++ PF+
Sbjct: 200 DIKGIILMSVGIVFFMLFTTSYSISFLIVSVLS--FLIFVKH--------IRKVTDPFVD 249

Query: 267 LDLLAKKAVYMPLLTAALFAACLFAMIFFLPIYLQVGLHTNPAKTG-LLLLPMTFGIVTG 325
L + +L + + + +P ++ + A+ G +++ P T ++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 326 STIAGRLLSKEVAPKWLPTFGMGLAFIGLLLISFVPPNANVIGGLGV-LVGIGLGTVMPS 384
I G L+ + P ++ G+ + L SF+ + + + V GL
Sbjct: 310 GYIGGILVDR-RGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 385 VQLVVQSVSGKARLSQITAMVSLCRSMGAAIGTALFSVLLYSLLPLTGSELGIAAIKTLP 444
+ +V S + ++++ + G A+ LL + + + LP
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL---------SIPLLDQRLLP 419

Query: 445 TEVVHHAFQY 454
EV + Y
Sbjct: 420 MEVDQSTYLY 429


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4298TCRTETA354e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.2 bits (81), Expect = 4e-04
Identities = 54/278 (19%), Positives = 101/278 (36%), Gaps = 39/278 (14%)

Query: 43 PVSQVAFVFGLL----SLSLAVASSMAGKLQERFGVRNVTLGAGLLLGLGFLLTAQASNL 98
+ V +G+L +L + + G L +RFG R V L + + + + A A L
Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96

Query: 99 MMLYLCAGILVGFADGTGY--------LMTLSNCVKWFPERKGLISALAIGAYGLGSLGF 150
+LY+ I+ G TG + + F G +SA +G G +
Sbjct: 97 WVLYI-GRIVAGITGATGAVAGAYIADITDGDERARHF----GFMSA----CFGFGMVAG 147

Query: 151 KYINVLLLENTGLETTFQLWGLIAMALVLCGGMLMKDA------PAQSAASQLAESRDFT 204
+ L+ F + L G L+ ++ P + A S F
Sbjct: 148 PVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLAS--FR 204

Query: 205 LAEAMRKPQYWMLALMFLSACMSG----LYVIGVAKDIGEKMVDLPVLVAANAVAVIAMA 260
A M ++A+ F+ + L+VI + + +AA + +
Sbjct: 205 WARGMT-VVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI----LH 259

Query: 261 NLSGRLVLGILSDKIPRIRVISLAQIITLVGMVLLLFI 298
+L+ ++ G ++ ++ R + L I G +LL F
Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4299HTHFIS684e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 4e-15
Identities = 24/132 (18%), Positives = 59/132 (44%), Gaps = 6/132 (4%)

Query: 3 KAIIVEDEYLAREELE-YLVKSHSEIDIVASFEDGLEAFKYLQDHEVDVVFLDIQIPSID 61
++ +D+ R L L ++ ++ I ++ + + D+V D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRW---IAAGDGDLVVTDVVMPDEN 61

Query: 62 GLLLAKNLHKSTHPPHVVFVTAYKEF--AVEAFELEAFDYILKPYNEPRIISLLQKIELA 119
L + K+ V+ ++A F A++A E A+DY+ KP++ +I ++ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 120 GRQAPKPQHEAA 131
++ P + +
Sbjct: 122 PKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4300PF065802032e-62 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 203 bits (517), Expect = 2e-62
Identities = 60/205 (29%), Positives = 110/205 (53%), Gaps = 13/205 (6%)

Query: 351 EQLQEMTRKAEFTALQSKINPHFLFNALNAISSLIRIRPQQARELIANLADYLRYNLAKG 410
++ M ++A+ AL+++INPHF+FNALN I +LI P +ARE++ +L++ +RY+L
Sbjct: 152 WKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYS 211

Query: 411 D-ELIDIQEEVKQVRDYVAIEQARFGDKLEVVFDVDD--VHFCVPCLLLQPLVENAILHG 467
+ + + +E+ V Y+ + +F D+L+ ++ + VP +L+Q LVEN I HG
Sbjct: 212 NARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHG 271

Query: 468 IQPRSAPGRVTIEVKKLDAGIRVAVRDTGYGISQEVIDGVAAGRIESSSIGLMNVHQRVK 527
I G++ ++ K + + + V +TG ES+ GL NV +R++
Sbjct: 272 IAQLPQGGKILLKGTKDNGTVTLEVENTG--------SLALKNTKESTGTGLQNVRERLQ 323

Query: 528 LLYGE--GLQLKRLEPGTEVSFYLP 550
+LYG ++L + +P
Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal_4303HTHFIS928e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 8e-24
Identities = 35/112 (31%), Positives = 60/112 (53%), Gaps = 1/112 (0%)

Query: 4 KVLVVDDEAQIHTFMRISLEAEGFEYHGAASIASALAQYQAQRPHVLVLDLGLPDGDGIS 63
+LV DD+A I T + +L G++ ++ A+ A ++V D+ +PD +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 LLQNLRQHDK-VPVLILTARDQEEEKIRLLEAGANDYLSKPFGIRELIARIK 114
LL +++ +PVL+++A++ I+ E GA DYL KPF + ELI I
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.