PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2245.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_009997 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1Sbal195_0098Sbal195_0108Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0098-1153.893892MarR family transcriptional regulator
Sbal195_0099-1164.097951siderophore-interacting protein
Sbal195_0100-2153.656434imidazolonepropionase
Sbal195_0101-1173.922215histidine utilization repressor
Sbal195_0102-2193.635716urocanate hydratase
Sbal195_0103-1203.429642histidine ammonia-lyase
Sbal195_01041193.245681molybdopterin oxidoreductase Fe4S4 region
Sbal195_01051183.558516formate dehydrogenase subunit alpha
Sbal195_01060184.124253formate dehydrogenase subunit beta
Sbal195_0107-1173.587052formate dehydrogenase subunit gamma
Sbal195_0108-1183.570430formate dehydrogenase accessory protein FdhE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0100UREASE432e-06 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 42.8 bits (101), Expect = 2e-06
Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 8/52 (15%)

Query: 352 TLNAAKALGIEDNVGSLLVGKQADFCLWDIATPAQLAYSYGVNPCKDVVKNG 403
T+N A A G+ +GSL VGK+AD LW+ PA +GV P V+ G
Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVLWN---PAF----FGVKP-DMVLLGG 453



Score = 30.9 bits (70), Expect = 0.011
Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 6/61 (9%)

Query: 23 YGAITNAAMAVKDGKIAWLGPRSE---LPAFDVL---SIPVYRGKGGWITPGLIDAHTHL 76
+ I A + +KDG+IA +G P ++ V G+G +T G +D+H H
Sbjct: 80 HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHF 139

Query: 77 I 77
I
Sbjct: 140 I 140


2Sbal195_0165Sbal195_0170Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_01655152.364642aspartyl/asparaginyl beta-hydroxylase
Sbal195_01664152.357569phytanoyl-CoA dioxygenase
Sbal195_01675162.711410hypothetical protein
Sbal195_01685163.076178N-acetyltransferase GCN5
Sbal195_01695173.259984tail collar domain-containing protein
Sbal195_01704162.912769outer membrane adhesin-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0168SACTRNSFRASE415e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.7 bits (95), Expect = 5e-07
Identities = 19/94 (20%), Positives = 41/94 (43%), Gaps = 6/94 (6%)

Query: 71 YILFYHQQAVGKVMLDISEYRIHLVDFIII-PSMRGRGFGSAILAAIKQEAMKRQLP-VG 128
++ + +G++ + + L++ I + R +G G+A+L + A + +
Sbjct: 68 FLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLM 127

Query: 129 LSVESENTQAKKLYLQHGFKPESYSGAYESMLWR 162
L + N A Y +H F GA ++ML+
Sbjct: 128 LETQDINISACHFYAKHHFI----IGAVDTMLYS 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0170OMPADOMAIN484e-07 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 48.4 bits (115), Expect = 4e-07
Identities = 39/180 (21%), Positives = 57/180 (31%), Gaps = 26/180 (14%)

Query: 3505 AVLLAGTVSQANAA---DNWYVEGFVGQAQVDSSRRDLQPQTAAGVVTSVDDKDTAFGLS 3561
AV LAG + A AA + WY +G +Q + + G
Sbjct: 9 AVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTG-------FINNNGPTHENQLGAGAF 61

Query: 3562 VGYQWTPMVAIELGYADFGNGSARIEGASLTPAQYHEQVKAVTPVLADGVMLGLRFTLLQ 3621
GYQ P V E+GY G R+ ++ A GV L +
Sbjct: 62 GGYQVNPYVGFEMGYDWLG----RMPYKGSVENGAYK---------AQGVQLTAKLGYPI 108

Query: 3622 HDAWRFEVPIGLFRWQADISSTMGNSRLTTELDGTDWYAGVRFSYQVSDAWSVGLGYQYV 3681
D +G W+AD S + T G Y ++ + L YQ+
Sbjct: 109 TDDLDIYTRLGGMVWRADTKSNVYGKNHDT---GVSPVFAGGVEYAITPEIATRLEYQWT 165


3Sbal195_0325Sbal195_0371Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_03252183.211725hypothetical protein
Sbal195_03262172.576332rhodanese domain-containing protein
Sbal195_03270163.091351XRE family transcriptional regulator
Sbal195_03280163.228297benzoate transporter
Sbal195_03290172.687234N-acetyltransferase GCN5
Sbal195_03300163.237521RNA-binding S1 domain-containing protein
Sbal195_03310193.431434hypothetical protein
Sbal195_0332-1194.4927623-oxoacyl-(acyl carrier protein) synthase II
Sbal195_03331194.1869303-ketoacyl-ACP reductase
Sbal195_03342193.916382thioester dehydrase family protein
Sbal195_03352183.9348993-oxoacyl-ACP synthase
Sbal195_03361193.859343hypothetical protein
Sbal195_03371193.897220FAD-binding monooxygenase
Sbal195_03382193.735620hypothetical protein
Sbal195_03391213.338356hypothetical protein
Sbal195_03400203.134390thioesterase superfamily protein
Sbal195_03410203.141337histidine ammonia-lyase
Sbal195_03420202.088735glycosyl transferase family protein
Sbal195_03430181.781676thioester dehydrase family protein
Sbal195_03440192.471928aconitate hydratase
Sbal195_03450161.355527hypothetical protein
Sbal195_0346-1143.034982acyl carrier protein
Sbal195_0347-1163.688110acyl carrier protein
Sbal195_0348-1173.350275phospholipid/glycerol acyltransferase
Sbal195_03491183.247509hypothetical protein
Sbal195_03501192.530328hypothetical protein
Sbal195_03510193.219168ATP-dependent DNA helicase RecG
Sbal195_03520202.186829two component LuxR family transcriptional
Sbal195_0353-1192.320985integral membrane sensor signal transduction
Sbal195_0354-1202.781187hypothetical protein
Sbal195_0355-1202.633258CaCA family Na(+)/Ca(+) antiporter
Sbal195_03560163.745364AMP-dependent synthetase and ligase
Sbal195_0357-1143.742112putative endoribonuclease L-PSP
Sbal195_0358-1143.483288bifunctional (p)ppGpp synthetase II/
Sbal195_0359-1143.692401DNA-directed RNA polymerase subunit omega
Sbal195_0360-1153.448349guanylate kinase
Sbal195_0361-1153.022531hypothetical protein
Sbal195_0362-1141.011106hypothetical protein
Sbal195_03630150.588169aminoglycoside phosphotransferase
Sbal195_0364013-0.516959major facilitator superfamily transporter
Sbal195_0365018-5.080779hypothetical protein
Sbal195_0366225-7.682624filamentation induced by cAMP protein fic
Sbal195_0367127-8.243008hypothetical protein
Sbal195_0368230-8.908040filamentation induced by cAMP protein fic
Sbal195_0369228-8.159283hypothetical protein
Sbal195_0370128-8.022684hypothetical protein
Sbal195_0371022-6.113055histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0329SACTRNSFRASE353e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.3 bits (81), Expect = 3e-05
Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 2/79 (2%)

Query: 55 NNLAGCGALKWLDAEHAEIKSMRTAAPYKQQGIASKILQHLINDAKSAGVKRLSLETGSM 114
NN G ++ +A I+ + A Y+++G+ + +L I AK L LET +
Sbjct: 74 NNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDI 133

Query: 115 DFFNPARLLYCKFGFEICG 133
+ A Y K F I
Sbjct: 134 NI--SACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0333DHBDHDRGNASE1022e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 102 bits (256), Expect = 2e-28
Identities = 68/248 (27%), Positives = 114/248 (45%), Gaps = 15/248 (6%)

Query: 5 VLVTGSSRGIGKAIALKLAAAGFDIALHYHSNQTAADATAVEVRALGVNVSLLKFDVAER 64
+TG+++GIG+A+A LA+ G IA N + ++A + DV +
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 65 AAVKAAIEADIEANGAYYGVILNAGINRDTAFPAMTESEWDSVIHTNLDGFYNVIHPCVM 124
AA+ G ++ AG+ R ++++ EW++ N G +N V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS-VS 128

Query: 125 PMVQGRKGGRIITLASVSGIAGNRGQVNYSASKAGIIGATKALSLELAKRKITVNCIAPG 184
+ R+ G I+T+ S Y++SKA + TK L LELA+ I N ++PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 185 LIETDM----------VADIPKDMVEQL---VPMRRMGKPNEIAALAAFLMSDDAAYITR 231
ETDM + K +E +P++++ KP++IA FL+S A +IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 232 QVISVNGG 239
+ V+GG
Sbjct: 249 HNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0338ACRIFLAVINRP350.001 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 35.2 bits (81), Expect = 0.001
Identities = 27/151 (17%), Positives = 51/151 (33%), Gaps = 21/151 (13%)

Query: 700 LLALALGIALLLFSLNFGFKKAAVVVAVPALAALLTLATLGLTGSPLSLFHALALILIFG 759
A+ L ++ L +AVP + L T A L G ++ ++L G
Sbjct: 345 FEAIMLVFLVMYLFLQNMRATLIPTIAVP-VVLLGTFAILAAFGYSINTLTMFGMVLAIG 403

Query: 760 IGIDYSL----------------FFASAQNHGKTVMMAVFMSACSTLLAFGLLAFSQTQA 803
+ +D ++ + + + A+ A F +AF
Sbjct: 404 LLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGST 463

Query: 804 ---IHYFGLTLSLGIGFTFLLSPLILTTTLA 831
F +T+ + + L++ LILT L
Sbjct: 464 GAIYRQFSITIVSAMALSVLVA-LILTPALC 493



Score = 34.4 bits (79), Expect = 0.002
Identities = 27/155 (17%), Positives = 56/155 (36%), Gaps = 23/155 (14%)

Query: 694 RLLTLKLLALALGIALLLFSLNFGFKKAAVVVAVPALAALLTLATLGLTGSPLSLFHALA 753
+ L ++ + + L L +L + V+ V L + L L ++ +
Sbjct: 871 QAPALVAISFVV-VFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVG 929

Query: 754 LILIFGIGIDYSLFFAS-----AQNHGKTVMMAV-----------FMSACSTLLAFGLLA 797
L+ G+ ++ + GK V+ A M++ + +L LA
Sbjct: 930 LLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLA 989

Query: 798 FSQTQAIHYFGLTLSLGIGFTFLLSPLILTTTLAL 832
S G ++GIG ++ ++ T LA+
Sbjct: 990 ISNGAGS---GAQNAVGIG---VMGGMVSATLLAI 1018


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0351SECA411e-05 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 41.4 bits (97), Expect = 1e-05
Identities = 30/84 (35%), Positives = 39/84 (46%), Gaps = 8/84 (9%)

Query: 294 MRLVQGDV-----GSGKTLVAAMAA-LQAIENGYQVAMMAPTELLAEQHATNFAAWFEPL 347
M L + + G GKTL A + A L A+ G V ++ + LA++ A N FE L
Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFEFL 150

Query: 348 GLKVGW-LAGKLKGKARAQSLADI 370
GL VG L G R ADI
Sbjct: 151 GLTVGINLPGMPAPAKREAYAADI 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0352HTHFIS762e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 2e-18
Identities = 29/117 (24%), Positives = 51/117 (43%), Gaps = 1/117 (0%)

Query: 2 KILLAEDQAMVRGALAALLTLAGGFNITQASDGDEALGLLKQQSFDLLLTDIEMPGRTGL 61
IL+A+D A +R L L+ AG +++ S+ + DL++TD+ MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 ELAAWLKDQHSQTKVVVITTFGRAGYIKRAIEAGVGGFLLKDAPSETLVNAIQQVMA 118
+L +K V+V++ +A E G +L K L+ I + +A
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0353PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 70/379 (18%), Positives = 124/379 (32%), Gaps = 57/379 (15%)

Query: 1 MTSTHLQLERKLAWVYLINLVFYL---IPLTINAYPAWKIALSFAVLIPFIASYF-WAYK 56
M STH Q + + I Y ++ F + I + AY+
Sbjct: 1 MASTHRQANKYYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYR 60

Query: 57 CTQNSAYRPILMMVAIATAITPINPGSISLFTFAAFFIGF-FYPLRTCLLAIAALIGLLF 115
L M I + P A IG ++ T + + A I
Sbjct: 61 SFIKRQGWLKLNMGQIILRVLP-----------ACVVIGMVWFVANTSIWRLLAFINTKP 109

Query: 116 ALNEIYDFNSYYFPLYGSGLVLGVGMFG------VAERRRHQHKLKEQQSTQEISTLAAM 169
+ S F + + + FG + Q K+ ++ L A
Sbjct: 110 VAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQ 169

Query: 170 VERERIARDLHDIMGHSLSSIALKAELAEKLLAKQEYQLATIQLNELGQIARESLSQIR- 228
+ + L++I + I A ++L L ++ R SL
Sbjct: 170 INPHFMFNALNNIR----ALILEDPTKAREMLTS------------LSELMRYSLRYSNA 213

Query: 229 HTVSDYKHKGLADSVTQLCKLLREKGVSVELTGNIPKLPARMESQLGLIVTELVNNILRH 288
VS + DS QL + E + E N + ++ ++V LV N ++H
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPP---MLVQTLVENGIKH 270

Query: 289 SGASQC------IIDFIQQADRLVVEVKDNGP----SKPIAEGNGLTGIRERLDSLGG-- 336
G +Q ++ + + +EV++ G + + G GL +RERL L G
Sbjct: 271 -GIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTE 329

Query: 337 -SLSYNLEQG-YAFTVSLP 353
+ + +QG V +P
Sbjct: 330 AQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0358PF07328320.003 T-DNA border endonuclease VirD1
		>PF07328#T-DNA border endonuclease VirD1

Length = 144

Score = 31.9 bits (72), Expect = 0.003
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 506 PEQIEKVI----RDTKHTTLDSLLADIGLGNAMSIVIAQRLIGDNLENQESRDGHMMPIR 561
P +++KVI + + D+ +A++GL ++ IA R IG +EN + +
Sbjct: 16 PARVDKVISVKMTEAELAEFDAQIAELGLNRNRALRIAARRIGGFVENDAKTVELLRDMS 75

Query: 562 GAEGMLVT 569
A + T
Sbjct: 76 RAIAGVAT 83


4Sbal195_0398Sbal195_0416Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0398-1143.1513513-isopropylmalate dehydrogenase
Sbal195_0399-1142.784047isopropylmalate isomerase large subunit
Sbal195_0400-1131.917867isopropylmalate isomerase small subunit
Sbal195_0401-2131.864973aromatic hydrocarbon degradation membrane
Sbal195_0402-1132.519751glycerol kinase
Sbal195_04030102.925656hypothetical protein
Sbal195_04040103.026711cell division protein MraZ
Sbal195_04050103.063181S-adenosyl-methyltransferase MraW
Sbal195_04060113.245004cell division protein FtsL
Sbal195_04071113.220923peptidoglycan glycosyltransferase
Sbal195_04080123.312051UDP-N-acetylmuramoylalanyl-D-glutamate--2,
Sbal195_04090143.348111UDP-N-acetylmuramoylalanyl-D-glutamyl-2,
Sbal195_04101162.992453phospho-N-acetylmuramoyl-pentapeptide-
Sbal195_04111153.286991UDP-N-acetylmuramoyl-L-alanyl-D-glutamate
Sbal195_04121163.428273cell division protein FtsW
Sbal195_04131143.156427undecaprenyldiphospho-muramoylpentapeptide
Sbal195_04141152.666327UDP-N-acetylmuramate--L-alanine ligase
Sbal195_04150151.602424polypeptide-transport-associated
Sbal195_04163191.224712cell division protein FtsA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0416SHAPEPROTEIN688e-15 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 68.2 bits (167), Expect = 8e-15
Identities = 51/221 (23%), Positives = 91/221 (41%), Gaps = 20/221 (9%)

Query: 150 SGMRMEAKVHIVTC----ANDMAKNITK-SVERCGLKVDDLVFSGIASADAVLTFDEKDL 204
S M ++ C A + + + S + G + L+ +A+A +
Sbjct: 100 SNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEAT 159

Query: 205 GVCIVDIGGGTTDIAVYTNGALRHCAVVPVAGNQVTNDIAKIFR------TPSSHAEQIK 258
G +VDIGGGTT++AV + + + + V + G++ I R + AE+IK
Sbjct: 160 GSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIK 219

Query: 259 VQFACARSSMVSREDSIEVPS---VGGRPSR-SMSRHTLAEVVEPRYQELFELVLKELKD 314
+ A IEV G P +++ + + E ++ + V+ L+
Sbjct: 220 HEIGSAYPG--DEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQ 277

Query: 315 SGLE---DQIAAGIVLTGGTASIQGVVDIAEATFGMPVRVA 352
E D G+VLTGG A ++ + + G+PV VA
Sbjct: 278 CPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVA 318


5Sbal195_0510Sbal195_0524Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0510324-1.879176hypothetical protein
Sbal195_0511325-1.491321MSHA pilin protein MshB
Sbal195_0512225-1.747845methylation site containing protein
Sbal195_0513221-1.222417methylation site containing protein
Sbal195_0514219-0.372206MSHA pilin protein MshD
Sbal195_0515219-0.301716MSHA biogenesis protein MshO
Sbal195_0516219-0.107214MSHA biogenesis protein MshP
Sbal195_0517016-0.047244hypothetical protein
Sbal195_0518-3130.897961rod shape-determining protein MreB
Sbal195_0519-3141.008916rod shape-determining protein MreC
Sbal195_0520-3120.835872rod shape-determining protein MreD
Sbal195_0521-2101.427668maf protein
Sbal195_0522-2112.039885ribonuclease G
Sbal195_0523-1122.234580hypothetical protein
Sbal195_0524-1153.275097nitrilase/cyanide hydratase and apolipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0511BCTERIALGSPG445e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 43.7 bits (103), Expect = 5e-08
Identities = 15/36 (41%), Positives = 27/36 (75%)

Query: 4 KQTGFSLIELVIVIVILGLLAATAIPRFLNVTDDAE 39
KQ GF+L+E+++VIVI+G+LA+ +P + + A+
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0512BCTERIALGSPG502e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 50.3 bits (120), Expect = 2e-10
Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 4/53 (7%)

Query: 1 MKRQQGFTLIELVVVIIILGILAVTAAPKFINLQGDARA----STIQGMKGAI 49
+Q+GFTL+E++VVI+I+G+LA P + + A S I ++ A+
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0513BCTERIALGSPH422e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 41.9 bits (98), Expect = 2e-07
Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 1/82 (1%)

Query: 8 KQAGFTLVELVTTIILISILAVVVLPRLFTQSSYSAYSLRNEFISELRQVQQKALNNTDR 67
+Q GFTL+E++ ++L+ + A +VL SA F ++LR VQQ+ L +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQT-GQ 60

Query: 68 CFRVTVSGTGYQVSQFSARNGA 89
F V+V +Q AR+GA
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGA 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0514BCTERIALGSPH372e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 36.8 bits (85), Expect = 2e-05
Identities = 16/58 (27%), Positives = 31/58 (53%), Gaps = 6/58 (10%)

Query: 23 QQGFTLIELVIGMLVIAIAIVMLTSMLFPQA--DRAASTLHRVRSA-ELA--HSVMNE 75
Q+GFTL+E+++ +L++ ++ M+ + FP + D AA TL R + +
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL-LAFPASRDDSAAQTLARFEAQLRFVQQRGLQTG 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0515BCTERIALGSPG300.004 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.2 bits (68), Expect = 0.004
Identities = 12/27 (44%), Positives = 21/27 (77%), Gaps = 2/27 (7%)

Query: 5 LSAVNKKSTLGFTLVEMVTVILILGIL 31
+ A +K+ GFTL+E++ VI+I+G+L
Sbjct: 1 MRATDKQR--GFTLLEIMVVIVIIGVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0518SHAPEPROTEIN5580.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 558 bits (1440), Expect = 0.0
Identities = 317/348 (91%), Positives = 334/348 (95%), Gaps = 1/348 (0%)

Query: 1 MFKKLRGIFSNDLSIDLGTANTLIYVRGEGIVLNEPSVVAIRGERGGSGQKSVAAVGTEA 60
M KK RG+FSNDLSIDLGTANTLIYV+G+GIVLNEPSVVAIR +R GS KSVAAVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGS-PKSVAAVGHDA 59

Query: 61 KQMLGRTPGNIQAIRPMKDGVIADFYVTEKMLQHFIKQVHNNSFFRPSPRVLVCVPVGAT 120
KQMLGRTPGNI AIRPMKDGVIADF+VTEKMLQHFIKQVH+NSF RPSPRVLVCVPVGAT
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 QVERRAIRESAMGAGAREVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAIISLN 180
QVERRAIRESA GAGAREV+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVA+ISLN
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GVVYSSSVRIGGDKFDDAIINYVRRNYGSLIGEATAERIKHTIGTAYPGDEVLEIEVRGR 240
GVVYSSSVRIGGD+FD+AIINYVRRNYGSLIGEATAERIKH IG+AYPGDEV EIEVRGR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 NLAEGVPRSFTLNSNEILEALQEPLSGIVSAVMVALEQSPPELASDISERGMVLTGGGAL 300
NLAEGVPR FTLNSNEILEALQEPL+GIVSAVMVALEQ PPELASDISERGMVLTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 LRDLDRLLMQETGIPVMVADDPLTCVARGGGKALEMIDMHGGDLFSEE 348
LR+LDRLLM+ETGIPV+VA+DPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0519IGASERPTASE330.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.002
Identities = 22/97 (22%), Positives = 38/97 (39%), Gaps = 6/97 (6%)

Query: 237 EVLTEDGQSYARVTAQPLAALDRIRYVLLIWPSPDSGVTLPNQPTVPAADHSLIENSSKI 296
+V TE Q +VT+Q ++ V P + N PTV + + ++
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQ-----PQAEPARENDPTVNIKEPQ-SQTNTTA 1166

Query: 297 GSASPAEGTSADTTKPVTTPAATVAKPATETTPPATE 333
+ PA+ TS++ +PVT + P T
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0523IGASERPTASE422e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.0 bits (98), Expect = 2e-05
Identities = 35/177 (19%), Positives = 59/177 (33%), Gaps = 28/177 (15%)

Query: 1263 AEPVVEELERKSKEIEIPEAILPVVGVERPAAPTEAQSTEAKPTETKPTETKLTEAQSTE 1322
E V E +++SK +E E + A T AQ+ E + + +
Sbjct: 1037 TETVAENSKQESKTVEKNE---------QDATETTAQNREVAKEAKSNVKANTQTNEVAQ 1087

Query: 1323 VKPNDAKSEPTQAPLSPISEPQLQEAQSEALELQDIKPEEARSEESKSEEPKPSTDQIKA 1382
+++ T+ E E + +A + E + + + P +Q +
Sbjct: 1088 SGSETKETQTTET-----KETATVEKEEKAKVETEKTQEVPK----VTSQVSPKQEQSET 1138

Query: 1383 PDLKLVQPQATPSPE-VPVT----PATPENQPAVEPIKQIQGDQNAHQPVAMSEQSR 1434
VQPQA P+ E P P + N A + N QPV S
Sbjct: 1139 -----VQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN 1190



Score = 38.9 bits (90), Expect = 2e-04
Identities = 30/147 (20%), Positives = 51/147 (34%), Gaps = 2/147 (1%)

Query: 1293 AAPTEAQSTEAKPTETKPTETKLTEAQSTEVKPNDAKSEPTQAPLSPISEPQLQEAQSEA 1352
A + +++ E + TETK T T E ++ + + +SP + Q + Q +A
Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP-KQEQSETVQPQA 1143

Query: 1353 LELQDIKPEEARSEESKSEEPKPSTDQIKAPDLKLVQPQATPSPEVPVTPATPENQPAVE 1412
++ P E T+Q V+ T S V + EN P
Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVEN-PENT 1202

Query: 1413 PIKQIQGDQNAHQPVAMSEQSRRQRQS 1439
Q N+ + RR +S
Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRS 1229


6Sbal195_0543Sbal195_0557Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0543020-3.837878type 11 methyltransferase
Sbal195_0544028-7.054703nicotinamide mononucleotide transporter PnuC
Sbal195_0545027-6.184515purine phosphorylase family 1
Sbal195_0546020-5.033774cyclic nucleotide-binding protein
Sbal195_0547020-4.750708hypothetical protein
Sbal195_0548020-4.089431OmpA domain-containing protein
Sbal195_0549117-3.368316hypothetical protein
Sbal195_0550-1161.710992transposase IS4 family protein
Sbal195_0551-1172.687429phosphoribosylaminoimidazolesuccinocarboxamide
Sbal195_0552-1142.633258hypothetical protein
Sbal195_0553-1163.028203activator of Hsp90 ATPase 1 family protein
Sbal195_0554-1172.723641pentapeptide repeat-containing protein
Sbal195_05551183.503472molybdopterin oxidoreductase
Sbal195_0556-2183.3018824Fe-4S ferredoxin
Sbal195_0557-2193.307363polysulfide reductase NrfD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0548OMPADOMAIN693e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 69.2 bits (169), Expect = 3e-16
Identities = 51/206 (24%), Positives = 75/206 (36%), Gaps = 29/206 (14%)

Query: 1 MKNTILTLAAIISLASVSVYSHAENMKNNDVAENGIYVGANYGY------LKVDGKDDFD 54
MK T + +A + +V A +N Y GA G+ ++
Sbjct: 1 MKKTAIAIA-VALAGFATVAQAAPK-------DNTWYTGAKLGWSQYHDTGFINNNGPTH 52

Query: 55 DNSDVIQGLVGYRFNQYLAIEGGYVNFGDY---GNSLSNA-ETDGYTAALKVSYPIVDRV 110
+N GY+ N Y+ E GY G G+ + A + G K+ YPI D +
Sbjct: 53 ENQLGAGAFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDL 112

Query: 111 ELYAKGGQLWYSTDYDVLGFSGNKDDEGV--FAGAGVAFKVTDRFLINAEYTWYDAGITV 168
++Y + G + + D G D GV GV + +T EY W
Sbjct: 113 DIYTRLGGMVWRADTKSNV-YGKNHDTGVSPVFAGGVEYAITPEIATRLEYQW------T 165

Query: 169 ENVSSGA--DTDTDFKQASLGVEYRF 192
N+ T D SLGV YRF
Sbjct: 166 NNIGDAHTIGTRPDNGMLSLGVSYRF 191


7Sbal195_0580Sbal195_0629Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0580122-4.469917major facilitator superfamily transporter
Sbal195_0581230-5.979110ssDNA-binding protein
Sbal195_0582330-7.002836integrase family protein
Sbal195_0583330-6.954517integrase family protein
Sbal195_0584227-6.113518hypothetical protein
Sbal195_0585325-5.815670hypothetical protein
Sbal195_0586222-3.789708hypothetical protein
Sbal195_0587225-4.990257ATPase AAA
Sbal195_0588224-4.428636hypothetical protein
Sbal195_0589224-4.127888transposase IS4 family protein
Sbal195_0590325-5.944478ISSod3, transposase
Sbal195_0591328-7.300564hypothetical protein
Sbal195_0592334-9.717577nucleotide sugar dehydrogenase
Sbal195_0593129-9.091548transposase IS4 family protein
Sbal195_0594331-10.618108hypothetical protein
Sbal195_0595535-11.694541hypothetical protein
Sbal195_0596124-5.506883hypothetical protein
Sbal195_0597122-3.583206transposase IS4 family protein
Sbal195_0598121-2.879315hypothetical protein
Sbal195_0599121-1.375696hypothetical protein
Sbal195_0600121-0.789879hypothetical protein
Sbal195_06011210.810781hypothetical protein
Sbal195_06022222.350772virulence plasmid 65kDa B protein
Sbal195_06032294.925793YD repeat-containing protein
Sbal195_06042327.175300YD repeat-containing protein
Sbal195_06052307.188456resolvase domain-containing protein
Sbal195_06062285.747736hypothetical protein
Sbal195_06071281.107436putative integrase protein
Sbal195_0608022-2.805707putative integrase protein
Sbal195_0609322-3.083125type II secretion system protein
Sbal195_0610422-3.252286transposase, IS4 family protein
Sbal195_0611524-4.004492transposase, IS4 family protein
Sbal195_0612427-6.996479hypothetical protein
Sbal195_0613428-7.078769winged helix family two component response
Sbal195_0614530-7.339494Ig domain-containing protein
Sbal195_0615635-10.158249Ig domain-containing protein
Sbal195_0616338-12.139970hypothetical protein
Sbal195_0617328-8.426951P pilus assembly protein porin PapC-like
Sbal195_0618326-6.026446hypothetical protein
Sbal195_0619423-5.109279hypothetical protein
Sbal195_0620220-3.124633hypothetical protein
Sbal195_0621321-2.676317transposase IS4 family protein
Sbal195_0622523-2.866846Ig domain-containing protein
Sbal195_0623433-7.051160OmpA domain-containing protein
Sbal195_0624230-6.177217transposase, IS4 family protein
Sbal195_0625431-5.865638transposase, IS4 family protein
Sbal195_0626326-4.936113hypothetical protein
Sbal195_0627024-3.936200hypothetical protein
Sbal195_0628-122-4.108315hypothetical protein
Sbal195_0629-120-3.102966transposase IS3/IS911 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0580TCRTETB863e-20 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 86.1 bits (213), Expect = 3e-20
Identities = 73/358 (20%), Positives = 139/358 (38%), Gaps = 48/358 (13%)

Query: 48 LWVGIAIGAYGLTQAVLQIPMGILSDKYGRKPIILIGLVLFAIGSLIAANADSIYGV-VF 106
WV A+ LT ++ G LSD+ G K ++L G+++ GS+I S + + +
Sbjct: 52 NWV---NTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIM 108

Query: 107 GRAVQGMGAIA--AAVLALAADLTRDEQRTKVMAIIGMCIGGSFALSLLVGPIVAQHVGL 164
R +QG GA A A V+ + A E R K +IG + + +G ++A ++
Sbjct: 109 ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW 168

Query: 165 SGLFFLTAILAVTGMLIVQFLVPNPISHAP---KGDTLATPARLKRML-------TDPQL 214
S L + I +T +++ L KG L + + ML + +
Sbjct: 169 SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIV 228

Query: 215 FRLDAGIFILHL-----------------VLTAVFVALPLDLVDAGLVKEKHWMLYF--- 254
L IF+ H+ + V + AG V +M+
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 255 --PAFVGAFFL---MVPLIIIG------VKRKNTKAMFQIALVIMIVALLAMALFSN-NL 302
A +G+ + + +II G V R+ + I + + V+ L +
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTS 348

Query: 303 WVLSFAVVLFFTGFNYLEASLPSLIAKFCPVGEKGSAMGVYSTSQFLGAFCGGMLGGG 360
W ++ +V G ++ + + ++++ E G+ M + + + FL G + GG
Sbjct: 349 WFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406



Score = 31.0 bits (70), Expect = 0.010
Identities = 18/111 (16%), Positives = 43/111 (38%)

Query: 274 RKNTKAMFQIALVIMIVALLAMALFSNNLWVLSFAVVLFFTGFNYLEASLPSLIAKFCPV 333
+ K + ++I + + + +L A + G A + ++A++ P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 334 GEKGSAMGVYSTSQFLGAFCGGMLGGGAFQLVGAVGVFIVAVILMSIWLFL 384
+G A G+ + +G G +GG + + ++ +I + FL
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFL 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0581PF03544290.013 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.013
Identities = 18/98 (18%), Positives = 33/98 (33%), Gaps = 4/98 (4%)

Query: 125 PMGGGMPQNAGYQSAPQQAAPAQNQYAPAPQAAPAYQAPAQQQYAAPAPAQQQYGQQQAQ 184
P+ M A + P + P P+ P + P + P + + +
Sbjct: 49 PISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP----KPKPK 104

Query: 185 PQQGGYAPKPQAAPAPAYQAPAAPAQRPAPQPQQNFTP 222
P+ +P+ P PA+P + AP + T
Sbjct: 105 PKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTA 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0591INTIMIN405e-05 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 40.1 bits (93), Expect = 5e-05
Identities = 29/144 (20%), Positives = 54/144 (37%), Gaps = 9/144 (6%)

Query: 698 NRDQDNWQGLAAWLDIRSYKIANTILIGEALETIATSANHFVFCIAPLDVERVNLNVSVD 757
N N G A + ++S K ++ + E + + V I + + D
Sbjct: 610 NSANTNGSGKAT-VTLKSDKPGQVVVSAKTAEMTSALNANAV--IFVDQTKASITEIKAD 666

Query: 758 NVISNLNNKGKVEATLYHSNN-QPATGMDVSFKVNNGALFSNGADSIIVRTGSDGKASVE 816
+ N + + T+ +P + +V+F G L + +T ++G A V
Sbjct: 667 KTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKL-----SNSTEKTDTNGYAKVT 721

Query: 817 LTNATEGVVTVTANYKDASNVEPA 840
LT+ T G V+A D + A
Sbjct: 722 LTSTTPGKSLVSARVSDVAVDVKA 745



Score = 33.5 bits (76), Expect = 0.006
Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 3/53 (5%)

Query: 777 NNQPATGMDVSFKVNNGALFSNGADSIIVRTGSDGKASVELTNATEGVVTVTA 829
N + VSF + +G + + T GKA+V L + G V V+A
Sbjct: 587 NGVAQANVPVSFNIVSGTAVLSANSA---NTNGSGKATVTLKSDKPGQVVVSA 636


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0595TYPE3OMGPROT320.002 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 31.8 bits (72), Expect = 0.002
Identities = 22/76 (28%), Positives = 36/76 (47%), Gaps = 4/76 (5%)

Query: 37 QGEISHTKRKIKILSKINITLSLLKSNEQYG--IKRLFIKIKNTVIDEVKHHILQLNNLL 94
+ E+S K+ +L I +L + + RLFI I+ +IDE H L L N
Sbjct: 465 RDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFI-IEPRIIDEGIAHHLALGNGQ 523

Query: 95 K-KKGIQLIVEVEDQA 109
+ GI + E+ +Q+
Sbjct: 524 DLRTGILTVDEISNQS 539


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0598ANTHRAXTOXNA250.035 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 25.5 bits (55), Expect = 0.035
Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 5/67 (7%)

Query: 27 VGYFPKSTTGDLYLKKVA----ALYLNDYWDILNSLSDCDIKYALNKYKNFQRYNKLIES 82
VG + S D + KK + A YL+DY++ N + + K ++ ++ Q YN+ IE+
Sbjct: 676 VGVYKDSGDKDEFAKKESVKKIAGYLSDYYNSANHIFSQEKKRKISIFRGIQAYNE-IEN 734

Query: 83 TLASSLI 89
L S I
Sbjct: 735 VLKSKQI 741


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0602SALSPVBPROT327e-100 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 327 bits (839), Expect = e-100
Identities = 173/409 (42%), Positives = 235/409 (57%), Gaps = 46/409 (11%)

Query: 16 IVPLSLPKGGGAVTSMGMAPGNVGADGTASFSIPLPISAGRSGSTLTPPVAISYNSGAGN 75
I P LPKGG A++ G DG AS ++PLPISA R P +A+ Y+SG GN
Sbjct: 15 ITPPFLPKGGKALSQ-------SGPDGLASITLPLPISAERG---FAPALALHYSSGGGN 64

Query: 76 GIFGLGWQLPVMRISRRTRYGVPSFDESSDDQTDQYLGPDGEVLLPLLNEEGLVIIETRT 135
G FG+GW M I+R T +GVP +++S D++LGPDGEVL+ L+ T
Sbjct: 65 GPFGVGWSCATMSIARSTSHGVPQYNDS-----DEFLGPDGEVLVQTLSTGDAPNPVTCF 119

Query: 136 TFRELEFEQAYQVCRYQPRVEGSFSRIERWWRDGEPASTFWLIHDATGHLHCLGKTIAGR 195
+ ++ F Q+Y V RYQPR E SF R+E +W FWL+HD+ G LH LGKT A R
Sbjct: 120 AYGDVSFPQSYTVTRYQPRTESSFYRLE-YWVGNSNGDDFWLLHDSNGILHLLGKTAAAR 178

Query: 196 VASPQLLTDSYNPRIGEWLLEESVSPTGEHMIYCYQDENNVGVSPDKQYS---LGTLPHL 252
++ PQ S+ +WL+EESV+P GEH+ Y Y EN V + + + +L
Sbjct: 179 LSDPQ--AASH---TAQWLVEESVTPAGEHIYYSYLAENGDNVDLNGNEAGRDRSAMRYL 233

Query: 253 MEIRYGNLHPAANLYLWSSDNVNSATSGVEWLFKLIFDYGARGLDPHSQPQDKIEPDQHW 312
+++YGN PAA+LYLW+S AT V+WLF L+FDYG RG+DP P W
Sbjct: 234 SKVQYGNATPAADLYLWTS-----ATPAVQWLFTLVFDYGERGVDPQVPP--AFTAQNSW 286

Query: 313 TARHDPFSRFDYGYEVRCHRLCRQIIMFHQNFTELNQGCPTVVGRLILDYDENAVLSRLI 372
AR DPFS ++YG+E+R HRLCRQ++MFH EL + T+V RL+L+YDEN +L++L
Sbjct: 287 LARQDPFSLYNYGFEIRLHRLCRQVLMFHHFPDELGEA-DTLVSRLLLEYDENPILTQLC 345

Query: 373 GARLWAYDTDG---------KPQSQPPLFLNYTSFDTSSRPDA-WHVFQ 411
AR AY+ DG P PP + SSRP + W + +
Sbjct: 346 AARTLAYEGDGYRRAPVNNMMPPPPPPPMMG----GNSSRPKSKWAIVE 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0614INTIMIN436e-06 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 43.1 bits (101), Expect = 6e-06
Identities = 24/120 (20%), Positives = 42/120 (35%)

Query: 745 NALANNTETNQVSVTVRDARNALVSGEEVSFSASNGATVETPTVLTNSNGVAIASIKSTQ 804
+A A+ TE + TV+ A + S A + + TN +G A ++KS +
Sbjct: 569 SAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDK 628

Query: 805 SGISTITASFNGTTKTVEVTFILVNCQSLGQSLEGACIDIFDADNNGKLFTSSPSVAYLD 864
G ++A T + ++ Q+ E N T + V D
Sbjct: 629 PGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGD 688


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0615INTIMIN412e-05 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 41.2 bits (96), Expect = 2e-05
Identities = 35/164 (21%), Positives = 55/164 (33%), Gaps = 10/164 (6%)

Query: 738 VERVNLNVSVDNVTSNGAEKNTVEATLYHSNNQPATGVDVSFKVNNGALFSNGTDSIIVR 797
V + + ++G E T AT+ N V VSF + +G + +
Sbjct: 558 VGVTDFTADKTSAKADGTEAITYTATV-KKNGVAQANVPVSFNIVSGTAVLSANSA---N 613

Query: 798 TGRDGKASVEVASTTEGVVTVTAIYRDSTN-LNPDYTGITKVVNINFSTYLPDNIVYEG- 855
T GKA+V + S G V V+A + T+ LN + + + D
Sbjct: 614 TNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVAN 673

Query: 856 ----ISLTPLVSYNQAVAAGLEIDSTLELGYTIAVVNHIDATKY 895
I+ T V + E+ T LG D Y
Sbjct: 674 GQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGY 717



Score = 37.7 bits (87), Expect = 3e-04
Identities = 21/101 (20%), Positives = 39/101 (38%), Gaps = 9/101 (8%)

Query: 721 VSTTNTPLLVFCIAPLDVERVNLNVSVDNVTSNGAEKNTVEATLYHSNN-QPATGVDVSF 779
S N ++F + + D T+ ++ + T+ +P + +V+F
Sbjct: 642 TSALNANAVIF---VDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTF 698

Query: 780 KVNNGALFSNGTDSIIVRTGRDGKASVEVASTTEGVVTVTA 820
G L + +T +G A V + STT G V+A
Sbjct: 699 TTTLGKL-----SNSTEKTDTNGYAKVTLTSTTPGKSLVSA 734


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0617PF00577367e-04 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 36.0 bits (83), Expect = 7e-04
Identities = 31/224 (13%), Positives = 68/224 (30%), Gaps = 23/224 (10%)

Query: 411 QIGARYIYEDLFSADYFLG-YFSTGDIYQSANIKFGRLSLSAKAFDLDYNTRNFTLSNQL 469
+R ++ + D + D Y A K G+L L+ +T + S+Q
Sbjct: 492 TTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQT 551

Query: 470 YGTNPY--KNFSISYSKPFFGGNGYLNYNNYSSKNYYGINNDNSIVIEKPIDYIYNINAL 527
Y + F + F N L+Y+ +KN + D + + I
Sbjct: 552 YWGTSNVDEQFQAGLNTAFEDINWTLSYS--LTKNAWQKGRDQMLALNVNI--------- 600

Query: 528 NNVLNNALNNGSYSTISNENYNIGWSTNIYSGTLTLNSNYNSNSAYDEVKFGIYWSQRFG 587
++ S ++ + S ++ +N + +S + G
Sbjct: 601 ------PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTG 654

Query: 588 KSVAGGLSVMTNNKGSSQYNNS---LSLNASNDNWYANHTVMAS 628
+ G + + + Y ++ S+ + S
Sbjct: 655 YAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVS 698


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0622INTIMIN436e-06 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 43.1 bits (101), Expect = 6e-06
Identities = 17/81 (20%), Positives = 32/81 (39%)

Query: 743 NALANNTVTNQVSVTVRDANNAPVAGQEVIFNASNNATVVTQTVLTDGNGVAIASIRSPQ 802
+A A+ T + TV+ A S A + + T+G+G A +++S +
Sbjct: 569 SAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDK 628

Query: 803 SGISTVTASFNGTTKTVDVTF 823
G V+A T ++
Sbjct: 629 PGQVVVSAKTAEMTSALNANA 649



Score = 42.4 bits (99), Expect = 1e-05
Identities = 40/148 (27%), Positives = 59/148 (39%), Gaps = 10/148 (6%)

Query: 730 VDNDKSTIAVIFNNALANNTVTNQVSVTVRDANN-APVAGQEVIFNASNNATVVTQTVLT 788
+ I A+AN ++ TV+ PV+ QEV F + + T T
Sbjct: 656 TKASITEIKADKTTAVANGQDA--ITYTVKVMKGDKPVSNQEVTFTTTLGKLSNS-TEKT 712

Query: 789 DGNGVAIASIRSPQSGISTVTASFNGTT---KTVDVTFNCQ-SLAGACIDIFDTG-SGKL 843
D NG A ++ S G S V+A + K +V F ++ I+I TG GKL
Sbjct: 713 DTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKL 772

Query: 844 FTNSPSVAYLNSIGGSAADGTYTETGTN 871
T +N + S +G YT N
Sbjct: 773 PTVWLQYGQVN-LKASGGNGKYTWRSAN 799


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0623OMPADOMAIN605e-13 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 59.6 bits (144), Expect = 5e-13
Identities = 40/203 (19%), Positives = 67/203 (33%), Gaps = 25/203 (12%)

Query: 1 MRKFNLAVVIPLSIMSCSAVASYSDSSLELGVSAGQFNLKDS-----TGSYSGPSVGFNF 55
M+K +A+ + L+ + A A+ D++ G G D+ G +G
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAGA 60

Query: 56 I--RNFNDWFSFEGNYL------SSFNMDNANYDIQASTFSLAPVFTYHINDTFSIYGKG 107
N + FE Y +++N Y Q + Y I D IY +
Sbjct: 61 FGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKL--GYPITDDLDIYTRL 118

Query: 108 GASSMRITSSERNGLDFSYNTIGWFYGFGLNTSINNRINVRLGYETVTGDTGIEILGVTA 167
G R + + + G+ +I I RL Y+ +G
Sbjct: 119 GGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIGTRP 178

Query: 168 DGFSIQSSHTKISVISLGATYRF 190
D ++SLG +YRF
Sbjct: 179 D----------NGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0624FLGPRINGFLGI250.034 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 25.3 bits (55), Expect = 0.034
Identities = 9/42 (21%), Positives = 19/42 (45%)

Query: 23 TLIITPPDYTSISKLAKIINVQNNNSSRGPIMHVVDTTSLKV 64
L + PD+++ ++A ++N PI D+ + V
Sbjct: 194 VLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAV 235


8Sbal195_0653Sbal195_0658Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0653-2183.479412protein kinase
Sbal195_0654-2183.317043RND family efflux transporter MFP subunit
Sbal195_0655-2173.528908acriflavin resistance protein
Sbal195_0656-2153.857298Sel1 domain-containing protein
Sbal195_0657-3173.596957collagenase
Sbal195_0658-3173.140015PIG3 family NAD(P)H quinone oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0654RTXTOXIND501e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.8 bits (119), Expect = 1e-08
Identities = 38/192 (19%), Positives = 70/192 (36%), Gaps = 29/192 (15%)

Query: 123 AEQDNTKAKADLDKAKSTLALAKTKLERIEDLL---IKEPFALAKQDVDELRENVNLADA 179
A + K+ L++ +S + AK + + + L I + ++ L LA
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE--LAKN 321

Query: 180 DFRQKQATMNDYLIKAPFDG---QLTSFSQSIGSQIGAGTALVTLYSLN-PVEVRYAISQ 235
+ RQ+ + I+AP QL ++ G + L+ + + +EV +
Sbjct: 322 EERQQASV-----IRAPVSVKVQQLKVHTE--GGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 236 NDFGKGQKGQKVNVTVEAYGNKVFKGL---VNYVAP--AVDESSG-------RVEVHAAL 283
D G GQ + VEA+ + L V + D+ G +E +
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLS 434

Query: 284 -DNPEFKLAPGM 294
N L+ GM
Sbjct: 435 TGNKNIPLSSGM 446



Score = 47.1 bits (112), Expect = 8e-08
Identities = 23/108 (21%), Positives = 44/108 (40%), Gaps = 7/108 (6%)

Query: 105 ISAIHFSNGDKVTKGQVIAEQDNTKAKADLDKAKSTLALAKTKLERIEDLLIKEPFALAK 164
+ I G+ V KG V+ + A+AD K +S+L A+ + R + L ++
Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS----RSIEL 162

Query: 165 QDVDELRENVNLADADFRQKQATMNDYLIKAPFDGQLTSFSQSIGSQI 212
+ EL+ + +++ LIK F T +Q ++
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS---TWQNQKYQKEL 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0655ACRIFLAVINRP6510.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 651 bits (1680), Expect = 0.0
Identities = 301/1032 (29%), Positives = 509/1032 (49%), Gaps = 44/1032 (4%)

Query: 8 IRHPIFASVLSIMAVLLGLIAFQKLDIQYFPEHTTHSASVNASIAGASADFMSSNVADKL 67
IR PIFA VL+I+ ++ G +A +L + +P + SV+A+ GA A + V +
Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVI 65

Query: 68 IAAASGIDKVDTM-STDCSEGRCSLTIKFNDDTS-DIEYTNLMNKLRSSVEGINDFPQSM 125
+GID + M ST S G ++T+ F T DI + NKL+ + PQ
Sbjct: 66 EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLAT---PLLPQE- 121

Query: 126 IDKPTVTDDTSATDSASNIITFVNAGGMEKQAMYDYISQQLVPQLKQVQGVGAVWGPYGG 185
+ + ++ + S++ + G + + DY++ + L ++ GVG V G
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV--QLFG 179

Query: 186 SQKAVRVWLNPEQMKALNIKAADVVGTLGSYNASFTSG------AIKGKSRDFSINPLNQ 239
+Q A+R+WL+ + + + DV+ L N +G A+ G+ + SI +
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 240 VETLEDVKDLVIKVS-EGKIIRVADVADVVMGEESLSPSILSIGGHSAMSLQILPLSNAN 298
+ E+ + ++V+ +G ++R+ DVA V +G E+ + I I G A L I + AN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYN-VIARINGKPAAGLGIKLATGAN 298

Query: 299 PVTVASNIKAEIARMQQHLPQGLEMTLAYNQADFIEASIDEGFSALIEAVILVSLIVVLF 358
+ A IKA++A +Q PQG+++ Y+ F++ SI E L EA++LV L++ LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 359 LGSLRAASIPIITIPVCVIGVFAVMSALGFSINVLTILAIILAIGLVVDDAIVVVENCYR 418
L ++RA IP I +PV ++G FA+++A G+SIN LT+ ++LAIGL+VDDAIVVVEN R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 419 HI-ENGETPFNAAIKGCQEIIFPIIAMTLTLAAVYLPIGLMSGLTADLFRQFSFTLAAAV 477
+ E+ P A K +I ++ + + L+AV++P+ G T ++RQFS T+ +A+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 478 MISGVVALTLSPMMSAYLINTTEQQPK-----WFSRVEHALQQLNDLYIKELDKWFTRKR 532
+S +VAL L+P + A L+ + +F + Y + K
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 533 LMLGAAVVLIGLAGIAYWQLPKILLPAEDSGFIDVASNGPTGVGRQYHLNHNAELNGVMD 592
L +++ + + +LP LP ED G P G ++ ++
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 593 EHPAVGANLSY------IEGEPVN----HVLLKPWGERS---EGIDDVISDLMSKSKESV 639
++ + G+ N V LKPW ER+ + VI + +
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 640 SAYNMSFSIRSANNLSIANNLRLELTTLDRNK---DELNDTAAKVQKLLEDYPG-LNNVG 695
+ + F++ + L A EL +D+ D L ++ + +P L +V
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFEL--IDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 696 NSVLRDQLRYDLSIDRNAIILSGVSYGDVTNALSTFLGSVKAADLHATDGFTYPIQVQVN 755
+ L D ++ L +D+ GVS D+ +ST LG D G + VQ +
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF-IDRGRVKKLYVQAD 775

Query: 756 LDKLSDFKVLNKLYVTSESGQALPLSQFVSIKQTTAESNIKTFMGLDSAELTADVMPGYS 815
+ ++KLYV S +G+ +P S F + ++ + GL S E+ + PG S
Sbjct: 776 AKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTS 835

Query: 816 TDEIKAYLDEQLPTLLNDAQGFKYNGVVKDLMDSQAGTQSLFLLALVFIYLILAAQFESF 875
+ + A + E L + L G+ + G+ S +L ++ V ++L LAA +ES+
Sbjct: 836 SGDAMALM-ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESW 894

Query: 876 VDPLIILLTVPLCIVGALLTLTLFGQSVNIYSQIGLLTLVGLVTKHGILLVEFANK-QQD 934
P+ ++L VPL IVG LL TLF Q ++Y +GLLT +GL K+ IL+VEFA +
Sbjct: 895 SIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEK 954

Query: 935 QGLSAIEAARSSAKSRLRPILMTSLTMILSAIPLALASGPGSLGLANIGLVLVGGLLAGT 994
+G +EA + + RLRPILMTSL IL +PLA+++G GS +G+ ++GG+++ T
Sbjct: 955 EGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSAT 1014

Query: 995 FFSLFVVPVAYV 1006
++F VPV +V
Sbjct: 1015 LLAIFFVPVFFV 1026



Score = 93.4 bits (232), Expect = 2e-21
Identities = 63/362 (17%), Positives = 123/362 (33%), Gaps = 22/362 (6%)

Query: 662 LELTTLDRNKDELNDTAAK-VQKLLEDYPGLNNVGNSVLRDQLRYDLSIDRNAIILSGVS 720
+D+++D A V+ L G+ +V + +R + +D + + ++
Sbjct: 142 FVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMR--IWLDADLLNKYKLT 199

Query: 721 YGDVTNALS-----TFLGSVKAADLHATDGFTYPIQVQVNLDKLSDFKVLNKLYVTSESG 775
DV N L G + I Q +F + G
Sbjct: 200 PVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFG--KVTLRVNSDG 257

Query: 776 QALPLSQFVSIKQTTAESNIK-TFMGLDSAELTADVMPGYST----DEIKAYLDEQLPTL 830
+ L ++ N+ G +A L + G + IKA L E P
Sbjct: 258 SVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFF 317

Query: 831 LNDAQGFKYNGVVKDLMDSQAGTQSLF---LLALVFIYLILAAQFESFVDPLIILLTVPL 887
QG K Q + A++ ++L++ ++ LI + VP+
Sbjct: 318 ---PQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV 374

Query: 888 CIVGALLTLTLFGQSVNIYSQIGLLTLVGLVTKHGILLVEFANK-QQDQGLSAIEAARSS 946
++G L FG S+N + G++ +GL+ I++VE + + L EA S
Sbjct: 375 VLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKS 434

Query: 947 AKSRLRPILMTSLTMILSAIPLALASGPGSLGLANIGLVLVGGLLAGTFFSLFVVPVAYV 1006
++ ++ + IP+A G + +V + +L + P
Sbjct: 435 MSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCA 494

Query: 1007 AM 1008
+
Sbjct: 495 TL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0657MICOLLPTASE3004e-88 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 300 bits (770), Expect = 4e-88
Identities = 104/553 (18%), Positives = 216/553 (39%), Gaps = 39/553 (7%)

Query: 143 SDFVGKSGQA-LVDQLSQSTPECVGKLYSLKGSAATALFSEANVISVANAIATKAKDYTG 201
D + + + LV+ + + E V L++ + T + V ++ + + YT
Sbjct: 95 FDELNRMNYSDLVELIKTISYENVPDLFNFNDGSYTFFSNRDRVQAIIYGLEDSGRTYTA 154

Query: 202 VDVQHLESHIYFVRAALYVQFYSPNDVPAYSSAAKASLKSALNALFANAAIWTVSDDNAG 261
D + + + + F+RA Y+ FY+ + K A+ A+ N+ + G
Sbjct: 155 DDDKGIPTLVEFLRAGYYLGFYNKQLSYLNTPQLKNECLPAMKAIQYNSNFRLGTKAQDG 214

Query: 262 VLKEALILIDSAELGADFNHATIKVLMDYDANWQASFAMNAAANSVFTTLFRAQWNDDMQ 321
V++ LI +A + + I VL D+ N + + N+VF + + +
Sbjct: 215 VVEALGRLIGNASADPEVINNCIYVLSDFKDNIDKYGSNYSKGNAVFNLMKGIDYYTNSV 274

Query: 322 -----ALFARDQGILDALNNFQLE------HRDLLGTNAEYLLVNSVKELSRLYYIDSMR 370
A++ + ++ + D L + +L+ N++ R+
Sbjct: 275 IYNTKGYDAKNTEFYNRIDPYMERLESLCTIGDKLNNDNAWLVNNALYYTGRMGKFREDP 334

Query: 371 PRVTQLVKNILSSTSKTEPSKVLWYAAAEMADYYDRSHCNDYNICGFKAQLEADTLPFNW 430
+ ++ + + ++ S ND + KA LP +
Sbjct: 335 SISQRALERAMKEYPYLSYQYIEAANDLDLNFGGKNSSGNDIDFNKIKADAREKYLPKTY 394

Query: 431 KCSDSLKI-RAQD-LYQDQAKWACDVLTSQESYFHSKLETGMQPVGQDNNDDLELVIFGS 488
D + +A D + +++ K ++ F ++ + +D L +VI+ S
Sbjct: 395 TFDDGKFVVKAGDKVTEEKIKRLYWASKEVKAQFMRVVQNDKALEEGNPDDILTVVIYNS 454

Query: 489 SSEYKSLANSIFGINTDNGGMYLEGSPAGLKNQARFIAYEAEWRTPDFHVWNL-QHEYVH 547
EYK L I G +TDNGG+Y+E N F YE + + L +HE+ H
Sbjct: 455 PEEYK-LNRIINGFSTDNGGIYIE-------NIGTFFTYERTPEESIYTLEELFRHEFTH 506

Query: 548 YLDGRYNLFGDFSRGTS---ANTIWWIEGLAEYIS---------YRDANTAAIAMGETGE 595
YL GRY + G + +G W+ EG AE+ + R + T +A
Sbjct: 507 YLQGRYVVPGMWGQGEFYQEGVLTWYEEGTAEFFAGSTRTDGIKPRKSVTQGLAYDRNNR 566

Query: 596 FMLSTIFKNNYESGQDRIYRWGYLAVRFMFEHHRDDVRQILAYLRNDQYAEYQTFMDGIG 655
L + Y S Y +G+ +M+ ++ ++ Y++N+ + Y+ ++ +
Sbjct: 567 MSLYGVLHAKYGS--WDFYNYGFALSNYMYNNNMGMFNKMTNYIKNNDVSGYKDYIASMS 624

Query: 656 TRY--DNEWQGWL 666
+ Y ++++Q ++
Sbjct: 625 SDYGLNDKYQDYM 637



Score = 74.8 bits (183), Expect = 1e-15
Identities = 36/184 (19%), Positives = 63/184 (34%), Gaps = 26/184 (14%)

Query: 539 WNLQHEYVHYLDGRYNLFGDFSRGTSANTIWWIEGLAEYISYRDANTAAIA-MGETGEFM 597
+ L +Y Y+D N + ++ + A+ I+ + ++ + + +
Sbjct: 627 YGLNDKYQDYMDSLLNNIDNLDVPLVSD-EYVNGHEAKDINEITNDIKEVSNIKDLSSNV 685

Query: 598 LSTIFKNNYESGQDRIYRWGYLAVRFMFEHH-----RDDVRQILAYLRNDQYAEYQTF-- 650
+ F Y+ R Y+ R E + + IL L + Y+T
Sbjct: 686 EKSQFFTTYD------MRGTYVGGRSQGEENDWKDMNSKLNDILKELSKKSWNGYKTVTA 739

Query: 651 ------MDGIGTR-YDNEWQGWLASGLSTADDGIVDKGPSDV-DAEPSGREGNWTGPAGT 702
+DG G YD + G T D V+K P V ++ S GT
Sbjct: 740 YFVNHKVDGNGNYVYDVVFHGMNT---DTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGT 796

Query: 703 ISKD 706
SKD
Sbjct: 797 ESKD 800


9Sbal195_0739Sbal195_0804Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0739327-0.980187major facilitator superfamily transporter
Sbal195_0740325-2.082415hypothetical protein
Sbal195_0741421-0.68404030S ribosomal protein S6
Sbal195_0742214-0.082791primosomal replication protein N
Sbal195_0743114-0.20599430S ribosomal protein S18
Sbal195_0744012-0.38188250S ribosomal protein L9
Sbal195_074519-0.119828hypothetical protein
Sbal195_07460120.151655replicative DNA helicase
Sbal195_0747-115-0.566742alanine racemase
Sbal195_0748-216-1.937320CheC domain-containing protein
Sbal195_0749-118-3.489393TonB-dependent siderophore receptor
Sbal195_0750124-5.620710putative hydroxylase
Sbal195_0751329-7.405728hypothetical protein
Sbal195_0752332-8.723674hypothetical protein
Sbal195_0753227-7.185224N-acetyltransferase GCN5
Sbal195_0754324-6.500452hypothetical protein
Sbal195_0755-223-1.431014hypothetical protein
Sbal195_07560261.409153hypothetical protein
Sbal195_07570322.604961hypothetical protein
Sbal195_07583384.594042GP46
Sbal195_07594384.755499hypothetical protein
Sbal195_0760222-2.100701hypothetical protein
Sbal195_0761225-6.754094hypothetical protein
Sbal195_0762128-8.956346hypothetical protein
Sbal195_0763128-8.523094hypothetical protein
Sbal195_0764131-9.043059hypothetical protein
Sbal195_0765029-7.746521hypothetical protein
Sbal195_0766-126-4.185576hypothetical protein
Sbal195_0767-121-0.740155hypothetical protein
Sbal195_07680242.397186putative phage repressor
Sbal195_07692364.845929hypothetical protein
Sbal195_07702434.826868hypothetical protein
Sbal195_07711413.939671hypothetical protein
Sbal195_07721413.806569hypothetical protein
Sbal195_07730393.586375hypothetical protein
Sbal195_0774-1411.425909replication P family protein
Sbal195_0775-238-0.196931integrase family protein
Sbal195_0776128-1.464538XRE family transcriptional regulator
Sbal195_0777225-1.487894hypothetical protein
Sbal195_0778023-1.349872oligoribonuclease
Sbal195_07790180.604462hypothetical protein
Sbal195_07800222.402283glycoside hydrolase
Sbal195_07810213.467561hypothetical protein
Sbal195_07820183.241640hypothetical protein
Sbal195_07830183.513619phage DNA packaging Nu1
Sbal195_07840183.733010phage terminase GpA
Sbal195_07851183.809494hypothetical protein
Sbal195_07861163.330488lambda family phage portal protein
Sbal195_07871141.811340peptidase S14 ClpP
Sbal195_0788115-0.438352gifsy-2 prophage; putative RecA/RadA
Sbal195_0789016-1.300149hypothetical protein
Sbal195_0790216-2.058972hypothetical protein
Sbal195_0791219-3.975937hypothetical protein
Sbal195_0792415-2.736527hypothetical protein
Sbal195_0793514-2.604383hypothetical protein
Sbal195_0794111-1.389731DNA-binding protein
Sbal195_0795013-1.963867Rha family phage regulatory protein
Sbal195_0796015-2.207683QacE-like protein
Sbal195_0797014-1.349658TP901 family phage tail tape measure protein
Sbal195_0798717-0.783409hypothetical protein
Sbal195_0799518-3.684006hypothetical protein
Sbal195_0800425-4.715605hypothetical protein
Sbal195_0801120-3.237335hypothetical protein
Sbal195_0802018-2.988904hypothetical protein
Sbal195_0803-224-4.679796hypothetical protein
Sbal195_0804-222-4.771664hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0739TCRTETB453e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 45.3 bits (107), Expect = 3e-07
Identities = 30/162 (18%), Positives = 56/162 (34%), Gaps = 1/162 (0%)

Query: 221 APAYASNLGLPPEKVATYMTATILAGLLAQWPMGKLSDIMSRSRLIRINCILLGILALAI 280
P A++ PP TA +L + GKLSD + RL+ I+ ++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 281 ALTPYHPVISLVMTFLFGILGFTFYPLATALANSRVEQNERVGLSATILLTFGLGASIGP 340
+ + ++ F+ G F L + + + R I +G +GP
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 341 LIASTLMQWLGNSMLYGFMSACTLILFVRLRYVHSQQKVETN 382
I + ++ S L T+I L + ++
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMI-TIITVPFLMKLLKKEVRIKG 197



Score = 31.0 bits (70), Expect = 0.010
Identities = 26/137 (18%), Positives = 55/137 (40%), Gaps = 12/137 (8%)

Query: 213 IVGSFYGLAPAYASNLGLPPEKVATYMTATILAGLLAQWPMGKLSDIMSRSRLIRINCIL 272
++ + L+ A ++ + P + I+ G + G L D ++ I
Sbjct: 282 MMKDVHQLSTAEIGSVIIFPG-----TMSVIIFGYIG----GILVDRRGPLYVLNIGVTF 332

Query: 273 LGILALAIALTPYHP--VISLVMTFLFGILGFTFYPLATALANSRVEQNERVGLSATILL 330
L + L + +++++ F+ G L FT ++T +++S +Q G+S
Sbjct: 333 LSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFT 392

Query: 331 TFGLGASIGPLIASTLM 347
+F L G I L+
Sbjct: 393 SF-LSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0745V8PROTEASE463e-07 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 45.8 bits (108), Expect = 3e-07
Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 650 SVPVNFLS-SVDTTGGNSGSPVFNGKGELVGLNFDSTYEAITKDWFFSPTI 699
+ + + TTGGNSGSPVFN K E++G+++ F + +
Sbjct: 220 YLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNEFNGAVFINENV 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0747ALARACEMASE438e-157 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 438 bits (1129), Expect = e-157
Identities = 160/350 (45%), Positives = 218/350 (62%), Gaps = 6/350 (1%)

Query: 6 RAEISSSALQTNLAALRQQAPASRVMAVVKANGYGHGLLNVANCLVSADGFGLARLDEAL 65
+A + AL+ NL+ +RQ A +RV +VVKAN YGHG+ + + + + DGF L L+EA+
Sbjct: 6 QASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEAI 65

Query: 66 ELRAGGVTARLLLLEGFFRATDLPLLVGHDIDTVVHHSSQLEMLEQTVLSKPVTVWLKVD 125
LR G +L+LEGFF A DL + H + T VH + QL+ L+ L P+ ++LKV+
Sbjct: 66 TLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKVN 125

Query: 126 SGMHRLGFTPEQFSTVYDRLMACSNVAKPIHLMTHFACADEPDNTYTSVQMAAFNTLTAG 185
SGM+RLGF P++ TV+ +L A +NV + LM+HFA A+ PD MA G
Sbjct: 126 SGMNRLGFQPDRVLTVWQQLRAMANV-GEMTLMSHFAEAEHPDGISG--AMARIEQAAEG 182

Query: 186 LPGFRTLANSAGALYWPQSQGDWIRPGIALYGVSPVT--GDCGANHGLVPAMELVSQLIA 243
L R+L+NSA L+ P++ DW+RPGI LYG SP D AN GL P M L S++I
Sbjct: 183 LECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDI-ANTGLRPVMTLSSEIIG 241

Query: 244 VRDHKANQPVGYGCFWTAKQDTRLGVVAIGYGDGYPRNAPEGTPVWVNGRRVPIVGRVSM 303
V+ KA + VGYG +TA+ + R+G+VA GY DGYPR+AP GTPV V+G R VG VSM
Sbjct: 242 VQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVSM 301

Query: 304 DMLTVDLGQDAQDKVGDSALLWGKALPVEEVAEHIGTIAYELVTKLTPRV 353
DML VDL Q +G LWGK + +++VA GT+ YEL+ L RV
Sbjct: 302 DMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRV 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0761MYCMG045260.032 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 25.8 bits (56), Expect = 0.032
Identities = 22/96 (22%), Positives = 34/96 (35%), Gaps = 17/96 (17%)

Query: 14 IELLLSLTRISSPEVIAALTLHYTSALPAERAAARHGIELSNFMRGQKKLEQIA------ 67
+ L L+ S + A Y S L ER +H + + +K + A
Sbjct: 14 VSLSSILSSCGSTTFVLANFESYISPLLLERVQEKHPLTFLTYPSNEKLINGFANNTYSV 73

Query: 68 ------ATVEAIK-----AIDWAKLQLTHLQSSSLQ 92
A E I+ IDW++ L SSS +
Sbjct: 74 AVASTYAVSELIERDLLSPIDWSQFNLKKSSSSSDK 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0797GPOSANCHOR422e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 41.6 bits (97), Expect = 2e-05
Identities = 36/277 (12%), Positives = 79/277 (28%), Gaps = 22/277 (7%)

Query: 21 EAKKSEQALQELGRESEKLNEQLDDLKRQ-QEAIKAIDSLTESINKGERAYVDNAQALDK 79
K + EL E E+L + E I L E+A
Sbjct: 79 NNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTA 138

Query: 80 LKQEQKQANTEAKNLEKSQQDAAASTAKLETEYSQTAAQLASYDSQLASARAEVERLTTT 139
+ K E L + D + + +A++ + +++ A+ A L
Sbjct: 139 DSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 198

Query: 140 QNKGAQASQAQAKALSAAKTDLQQLESAQKNTATSATKLANELEQERSEFTRLGSEVEKA 199
S A + + + + L + + + + N + ++ L +E
Sbjct: 199 LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 258

Query: 200 GRNKAEYALKVKSARTELNQLGSSLGRNKAELDKQQTVLNKAGID--------------- 244
+AE ++ A + + +AE +
Sbjct: 259 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL 318

Query: 245 ------MGKLADASQELKTKQAGAEAALKGVNDKLAQ 275
+L Q+L+ + +EA+ + + L
Sbjct: 319 DASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 355



Score = 37.4 bits (86), Expect = 5e-04
Identities = 37/216 (17%), Positives = 71/216 (32%), Gaps = 1/216 (0%)

Query: 19 SAEAKKSEQALQELGRESEKLNEQLDDLKRQQEAIKA-IDSLTESINKGERAYVDNAQAL 77
SA+ K E L +L + L+ A A I +L D +AL
Sbjct: 175 SAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKAL 234

Query: 78 DKLKQEQKQANTEAKNLEKSQQDAAASTAKLETEYSQTAAQLASYDSQLASARAEVERLT 137
+ + + K LE + A A+LE + +++ + AE L
Sbjct: 235 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALE 294

Query: 138 TTQNKGAQASQAQAKALSAAKTDLQQLESAQKNTATSATKLANELEQERSEFTRLGSEVE 197
+ SQ + + DL A+K KL + + + L +++
Sbjct: 295 AEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLD 354

Query: 198 KAGRNKAEYALKVKSARTELNQLGSSLGRNKAELDK 233
+ K + + + + +S + +LD
Sbjct: 355 ASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 390



Score = 30.8 bits (69), Expect = 0.041
Identities = 58/347 (16%), Positives = 112/347 (32%), Gaps = 8/347 (2%)

Query: 733 EYVTAKQVEDSKLRSKEIQESITRDLDDITNQTKRMKGESSIAYEGLIKTAVKYSGSIDQ 792
+ + K + E+ DL+ S + L +
Sbjct: 100 KLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKAD 159

Query: 793 LSIAQRNQLNDILLSGKYNGDLEKTYRELTATLVRANRETEIEAEFKNKAADASKKKAEE 852
L A +N LE L A + E F + K E
Sbjct: 160 LEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE 219

Query: 853 DKKAAEAADALAASQGAINDSTKAYAQALKDIEAKQATLNSLYEQGKLSADDLVTASANL 912
A L + + + A + +K +EA++A L + + + + + + S
Sbjct: 220 KAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTAD 279

Query: 913 HQTIKAYNVEVDNSNVKAVTQTELTSAFISKRKELQTQYEKGLLTEKELNISLQELA--- 969
IK E + + + R+ L+ + +K+L Q+L
Sbjct: 280 SAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQN 339

Query: 970 -ASHTKSVEQSNKSIAATGLLSDAQLDLQEKILRTEKEVRDLEAA-LKDDSKAS-AELTI 1026
S A+ + QL+ + + L + ++ + L+ D AS
Sbjct: 340 KISEASRQSLRRDLDASRE--AKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ 397

Query: 1027 IKAKLAKEEANLADLKRESVELSKIENATYVELLILQRDYEAQLEAL 1073
++ L + + LA L++ + EL + + T E LQ EA+ +AL
Sbjct: 398 VEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKAL 444


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0800UREASE310.006 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 31.2 bits (71), Expect = 0.006
Identities = 12/62 (19%), Positives = 24/62 (38%)

Query: 147 GYDTQAYFIGDIESSVANDAGVFTIVSGESVSADATNSNGVTTNSGALGAQLYNTDGSNS 206
G D+ +FI + A +G+ ++ G + A T + T + + D
Sbjct: 132 GMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEAADAFPM 191

Query: 207 NI 208
N+
Sbjct: 192 NL 193


10Sbal195_0821Sbal195_0832Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0821016-3.358516porin
Sbal195_0822-113-2.100520porin
Sbal195_0823-116-1.375613UBA/THIF-type NAD/FAD-binding protein
Sbal195_0824-118-0.752023integral membrane sensor signal transduction
Sbal195_0825-1180.454940hypothetical protein
Sbal195_08260191.310507hypothetical protein
Sbal195_08270171.942162methyl-accepting chemotaxis sensory transducer
Sbal195_0828-1193.416863*molybdate transporter ATP-binding protein
Sbal195_0829-2193.790070molybdate ABC transporter permease
Sbal195_0830-1183.320404molybdenum ABC transporter periplasmic
Sbal195_0831-2183.318190ModE family transcriptional regulator
Sbal195_0832-2173.018505RND family efflux transporter MFP subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0821NEISSPPORIN543e-10 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 53.8 bits (129), Expect = 3e-10
Identities = 42/200 (21%), Positives = 84/200 (42%), Gaps = 22/200 (11%)

Query: 1 MKKTFISASVASVLALASFGALAEGPSFYGRLELALTHTDTAATLQGGTNGINTTNYADE 60
MKK+ I A LA A+A+ + YG ++ + + G + + T
Sbjct: 1 MKKSLI----ALTLAALPVAAMAD-VTLYGAIKAGVQTYRSVEHTDGKVSKVET------ 49

Query: 61 NNAGTYLENNFSLLGVKGSEKIADGFEAIYQMEFQVENTSTTGDVFKARNTYLGLKTNAG 120
G+ + + S +G KG E + +G +A++Q+E Q + + T + + +++GLK G
Sbjct: 50 ---GSEIADFGSKIGFKGQEDLGNGLKAVWQLE-QGASVAGTNTGWGNKQSFVGLKGGFG 105

Query: 121 TVLIGRNDTVFKQAEGGVDIFGNTNADIDRLISGQTRSADGMW----YYSPKIAGLVTLN 176
T+ G ++ K V+ + + + L + + Y SP+ AG +
Sbjct: 106 TIRAGSLNSPLKNTGANVNAWESGKFTGNVLEISGMAQREHRYLSVRYDSPEFAG---FS 162

Query: 177 ATYLFDDNDTTAKTSESLYA 196
+ + D + ES +
Sbjct: 163 GSVQYAPKDNSGSNGESYHV 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0822ECOLNEIPORIN474e-08 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 47.1 bits (112), Expect = 4e-08
Identities = 68/366 (18%), Positives = 121/366 (33%), Gaps = 47/366 (12%)

Query: 1 MKKTLLTTAIISSLTLYSFTALADGPEFYGHADLAITNSDT----GYATQNQKDGTVLEN 56
MKK+L+ A+ + + A YG + S + G + + GT + +
Sbjct: 1 MKKSLI--ALTLAALPVAAMADVT---LYGTIKAGVETSRSVAHNGAQAASVETGTGIVD 55

Query: 57 NFSWLGVKGSEAVSPYFDIIYQMEFGVENFDNSNKTFNARNTFLGIRSAAGTALVGRNDT 116
S +G KG E + I+Q+E + N R +F+G++ G VGR ++
Sbjct: 56 LGSKIGFKGQEDLGNGLKAIWQVEQKASIAGTDSGWGN-RQSFIGLKGGFGKLRVGRLNS 114

Query: 117 VFKSA------EGSFDLFGNTNADIDLLVAGQTRSADGISYYSPKIADLVTLNATYLMSD 170
V K + D G +A + Y SP+ A L + Y ++D
Sbjct: 115 VLKDTGDINPWDSKSDYLGVNK------IAEPEARLISVRYDSPEFAGLSG-SVQYALND 167

Query: 171 NYDQVDNNGDEVYSKDHMYALSATLGDKAFKAQNYYVAAAYSDGIDNVEAYRGVAQAKFG 230
N + E Y Y + ++ + + + +R V+
Sbjct: 168 N---AGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNI-EKYQIHRLVSGYDND 223

Query: 231 DVILG--ALYQHSEHVDDKFANLSGDTYFVNAAYLIGDLKLKMMYGKDDSGLGKYVSRYV 288
+ Q ++ V++ +++ S AY G++ ++ Y G
Sbjct: 224 ALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAH---GFKGSFD-AT 279

Query: 289 GDQNNSGLEQVRDVDLQQFSVGADYRLSTKTMVYGHYTRFDGDLMLADVKQDLNDNVFTV 348
N D Q VGA+Y S +T + V
Sbjct: 280 NYNN----------DYDQVVVGAEYDFSKRTSALVSAGWLQEG----KGESKFVSTAGGV 325

Query: 349 GLRVDF 354
GLR F
Sbjct: 326 GLRHKF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0824PF06580356e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 6e-04
Identities = 11/61 (18%), Positives = 22/61 (36%), Gaps = 4/61 (6%)

Query: 406 EISVEPTLNIQTNISLLNQIVSNILSNAFTHAFQGRED-NLIHISATVDEESILISIQNN 464
E + P + +L Q ++ N H I + T D ++ + ++N
Sbjct: 243 ENQINPAIMDVQVPPMLVQT---LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 465 G 465
G
Sbjct: 300 G 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0832RTXTOXIND643e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 64.1 bits (156), Expect = 3e-13
Identities = 34/181 (18%), Positives = 59/181 (32%), Gaps = 39/181 (21%)

Query: 106 NLKDALASLKSINAQFRAKQAQIRQAKLEFSRQQQMLADKASSRADY-----EVADAN-- 158
NL A ++ A+ + R K +L +A ++ + +A
Sbjct: 208 NLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNE 267

Query: 159 LTVYQADLEQLEAQKQQAEINV-----------------------------DSARIDLGY 189
L VY++ LEQ+E++ A+
Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQA 327

Query: 190 TKITAPMDGTVVYSAV-EVGQTVNANQTTPTIVEMAQLDTMTVKAQISEADVVNVHPGQA 248
+ I AP+ V V G V +T IV + DT+ V A + D+ ++ GQ
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV--PEDDTLEVTALVQNKDIGFINVGQN 385

Query: 249 V 249

Sbjct: 386 A 386



Score = 46.0 bits (109), Expect = 2e-07
Identities = 31/182 (17%), Positives = 68/182 (37%), Gaps = 16/182 (8%)

Query: 7 MKKSSKRKLLLALSGLILLGGGAYFMWHKPETAPAYVTEAVRRGDIENSVLANGMLQAS- 65
S+R L+A I+ F+ + + + +E ANG L S
Sbjct: 50 ETPVSRRPRLVAY--FIMGFLVIAFIL-------SVLGQ------VEIVATANGKLTHSG 94

Query: 66 KLVSVGAQVSGQILSLPLALGDEVKKGDLIAQIDSLAQQNNLKDALASLKSINAQFRAKQ 125
+ + + + + + G+ V+KGD++ ++ +L + + +SL + Q
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 126 AQIRQAKLEFSRQQQMLADKASSRADYEVADANLTVYQADLEQLEAQKQQAEINVDSARI 185
R +L + ++ + E ++ + + QK Q E+N+D R
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214

Query: 186 DL 187
+
Sbjct: 215 ER 216


11Sbal195_0845Sbal195_0853Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0845221-0.433749beta-lactamase
Sbal195_0846221-0.682623LysR family transcriptional regulator
Sbal195_0847323-1.085618carbamoyl-phosphate synthase L chain
Sbal195_0848-120-2.358917ISSod10, transposase OrfA
Sbal195_0849326-2.043001ISSod10, transposase OrfA
Sbal195_0851323-1.450163ISSod10, transposase OrfB
Sbal195_0852324-1.571186transposase IS116/IS110/IS902 family protein
Sbal195_0853324-1.290509transposase IS116/IS110/IS902 family protein
12Sbal195_0975Sbal195_0980Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0975120-6.055348DSBA oxidoreductase
Sbal195_0976220-5.361666heat shock protein DnaJ domain-containing
Sbal195_0977223-5.900396dihydropteridine reductase
Sbal195_0978123-6.018380N-acetyltransferase GCN5
Sbal195_0979118-4.810986hypothetical protein
Sbal195_0980015-4.375052hypothetical protein
13Sbal195_0998Sbal195_1018Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0998-1184.025856Na+ dependent nucleoside transporter
Sbal195_0999-1183.961970LysR family transcriptional regulator
Sbal195_1000-1163.761078dienelactone hydrolase
Sbal195_1001-112-1.207534gluconate 2-dehydrogenase
Sbal195_10021150.4335702Fe-2S iron-sulfur cluster-binding
Sbal195_10031151.028244aldehyde oxidase and xanthine dehydrogenase
Sbal195_1004114-0.241630hypothetical protein
Sbal195_10050150.409491hypothetical protein
Sbal195_10061140.898378hypothetical protein
Sbal195_10071234.662557B12-dependent methionine synthase
Sbal195_10081224.896396phosphoglycerate mutase
Sbal195_10090204.041025ABC transporter-like protein
Sbal195_10100204.173021transport system permease
Sbal195_10111173.846933nicotinate-nucleotide--dimethylbenzimidazole
Sbal195_10121193.589469cobalamin synthase
Sbal195_10130174.053071cobalbumin biosynthesis protein
Sbal195_10140163.731420cobyric acid synthase
Sbal195_10150163.370403cob(I)yrinic acid a,c-diamide
Sbal195_10160173.083202hypothetical protein
Sbal195_10170163.135073periplasmic-binding protein
Sbal195_10180183.602579aldehyde dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0998PREPILNPTASE300.015 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.2 bits (68), Expect = 0.015
Identities = 28/116 (24%), Positives = 51/116 (43%), Gaps = 10/116 (8%)

Query: 3 AILGMISILFFAWLLSVNRKNIPYRTVILALGLQIIFALLVLYVPAGKAVLQSVTAGVSS 62
A L + +L + +++ +P + + L ++F LL +V G AV+ ++ +
Sbjct: 136 AALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVL 195

Query: 63 VIAY-------GNEGIGFLFGDLA-TGKVGFVFAINVLGIIIFFSALISALYHIGL 110
Y G EG+G +GD +G L I++ S+L+ A IGL
Sbjct: 196 WSLYWAFKLLTGKEGMG--YGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGL 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1005CHANLCOLICIN300.002 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.002
Identities = 20/74 (27%), Positives = 31/74 (41%), Gaps = 5/74 (6%)

Query: 33 LTQGLNEFAMQQKQTELARQQATKDRQLAEYQIQQELQQNAAEKSRLAKQNEAARLRKAE 92
LTQ L + + + +R + + A Q E+ RLAK E AR ++AE
Sbjct: 90 LTQRLKDIVNEALRHNASRTPSATELAHANNAAMQA----EDERLRLAKAEEKAR-KEAE 144

Query: 93 AWRKYYLVPEDCKN 106
A K + E +
Sbjct: 145 AAEKAFQEAEQRRK 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1007BCTERIALGSPD320.018 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.8 bits (72), Expect = 0.018
Identities = 13/67 (19%), Positives = 29/67 (43%), Gaps = 5/67 (7%)

Query: 354 AGLEPLTIDAQSLFVNVGERTN---VTGSAKFLKLIKDGKFEQALDVAREQVESGAQIID 410
+P+ +++ + +TN VT + + ++ LD+ R QV A I +
Sbjct: 298 QAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLER--VIAQLDIRRPQVLVEAIIAE 355

Query: 411 INMDEGM 417
+ +G+
Sbjct: 356 VQDADGL 362


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1017FERRIBNDNGPP384e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 37.6 bits (87), Expect = 4e-05
Identities = 44/180 (24%), Positives = 68/180 (37%), Gaps = 16/180 (8%)

Query: 18 PHSVLADPAKRIIALSPHAVEMLYAIGAGDAIVAATDYADY------PEAAKKIPRIGGY 71
H+ DP RI+AL VE+L A+G VA D +Y P + +G
Sbjct: 28 AHAAAIDP-NRIVALEWLPVELLLALGIVPYGVA--DTINYRLWVSEPPLPDSVIDVGLR 84

Query: 72 YGIQMERVMELNPDLIVVWDTGNKA--EDINQL-KALGFNLYGSDPKTLEGVAKELEELG 128
+E + E+ P + VW G E + ++ GFN + + L K L E+
Sbjct: 85 TEPNLELLTEMKPSFM-VWSAGYGPSPEMLARIAPGRGFN-FSDGKQPLAMARKSLTEMA 142

Query: 129 KLTGHVEEASKAAAAYRAELIRLRTDNASKSE-PKVFYQLWSTPLMTV-SKNSWIQQIIS 186
L A A Y + ++ + P + L M V NS Q+I+
Sbjct: 143 DLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202


14Sbal195_1157Sbal195_1171Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_1157130-4.480407lipoprotein signal peptidase
Sbal195_1158434-5.663875FKBP-type peptidylprolyl isomerase
Sbal195_1159535-6.0054854-hydroxy-3-methylbut-2-enyl diphosphate
Sbal195_1160539-7.013328type IV pilus modification protein PilV
Sbal195_1161539-7.330865type IV pilus assembly protein PilW
Sbal195_1162539-7.162373type IV pilus assembly protein PilX
Sbal195_1163637-7.154244type IV pilin biogenesis protein
Sbal195_1164130-5.999836type IV pilus biogenesis protein PilE
Sbal195_1165026-5.927348hypothetical protein
Sbal195_1166232-6.376392type IV pilus biogenesis protein
Sbal195_1167132-6.958579type IV pilus biogenesis protein
Sbal195_1168233-7.520915nitrogen regulatory protein P-II
Sbal195_1169131-6.974295FAD-dependent pyridine nucleotide-disulfide
Sbal195_1170029-6.712003LacI family transcriptional regulator
Sbal195_1171026-5.448639TonB-dependent receptor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1160BCTERIALGSPG327e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.8 bits (72), Expect = 7e-04
Identities = 10/24 (41%), Positives = 18/24 (75%), Gaps = 2/24 (8%)

Query: 21 QRGFSLIEVLVALVIL--VIGLIG 42
QRGF+L+E++V +VI+ + L+
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVV 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1164BCTERIALGSPG521e-11 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 52.2 bits (125), Expect = 1e-11
Identities = 20/61 (32%), Positives = 38/61 (62%)

Query: 6 KGFTLIEVMITVVIIGILAAIAYPSYTQYIALSARSEGLAALMRIANLQEQYYLDNRVYA 65
+GFTL+E+M+ +VIIG+LA++ P+ + + + ++ ++ + N + Y LDN Y
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHYP 67

Query: 66 T 66
T
Sbjct: 68 T 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1166BCTERIALGSPG353e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.2 bits (81), Expect = 3e-05
Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 6 KGFTLVELMVTIAVAAILLTIGVPSLTS 33
+GFTL+E+MV I + +L ++ VP+L
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMG 35


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1167BCTERIALGSPG332e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.9 bits (75), Expect = 2e-04
Identities = 14/50 (28%), Positives = 30/50 (60%), Gaps = 3/50 (6%)

Query: 5 QKGFSLIELMTTLSISTILFTVGTPSFT---DLSDQIRADSNIRTIQQTL 51
Q+GF+L+E+M + I +L ++ P+ + +D+ +A S+I ++ L
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


15Sbal195_1274Sbal195_1285Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_1274214-0.686315sigma E regulatory protein MucB/RseB
Sbal195_1275315-0.567743sigma E positive regulator RseC/MucC
Sbal195_1276316-0.560048GTP-binding protein LepA
Sbal195_12772150.464231signal peptidase I
Sbal195_12782180.474363ribonuclease III
Sbal195_12790130.707284GTP-binding protein Era
Sbal195_1280-1120.449506DNA repair protein RecO
Sbal195_1281-2130.803270pyridoxine 5'-phosphate synthase
Sbal195_1282-2130.7838244'-phosphopantetheinyl transferase
Sbal195_1283-1130.198090hypothetical protein
Sbal195_12840130.600247hypothetical protein
Sbal195_12852170.693550cytochrome c assembly protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1276TCRTETOQM1531e-41 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 153 bits (387), Expect = 1e-41
Identities = 99/455 (21%), Positives = 177/455 (38%), Gaps = 93/455 (20%)

Query: 1 MKQIRNFSIIAHIDHGKSTLSDRLIQVCGGLTD-REMDA--QVLDSMDLERERGITIKAQ 57
MK I N ++AH+D GK+TL++ L+ G +T+ +D D+ LER+RGITI+
Sbjct: 1 MK-IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTG 59

Query: 58 SVTLDYKAKDGLVYQLNFIDTPGHVDFSYEVSRSLAACEGALLVVDAGQGVEAQTLANCY 117
+ + ++N IDTPGH+DF EV RSL+ +GA+L++ A GV+AQT +
Sbjct: 60 ITSFQW-----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114

Query: 118 TALDMNLDVVPILNKIDLPQADPERVAAEIEDIVGIDAI----------------DAVRC 161
M + + +NKID D V +I++ + + +
Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQ 174

Query: 162 SAKTGVGVDEVLEVIVAKIPPPEGDPNAPLQALIID------------------------ 197
G D++LE ++ + +
Sbjct: 175 WDTVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVI 234

Query: 198 -SWF--------------------DNYLGVVSLVRIKHGSLKKGDKFKVMSTGQNHTADR 236
+ F ++ +R+ G L D + ++
Sbjct: 235 TNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSV------RISEKEK 288

Query: 237 VGI---FTPKQTDKTELKTGEVGFVIAGLKEI--HGAPVGDTLTLAKNGA-EKPLPGFKK 290
+ I +T + ++ G ++ E + +GDT L + E PLP +
Sbjct: 289 IKITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENPLPLLQT 348

Query: 291 VKPQVYAGVFPISTDEYENFRDALNKLSLNDASLFFEPESSSALGFGFRIGYLGLLHMEI 350
V P + E DAL ++S +D L + +S++ + +LG + ME+
Sbjct: 349 T-------VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATH---EIILSFLGKVQMEV 398

Query: 351 VQERLEREYNLELITTAPTVVY-EVVMTSGETIYV 384
L+ +Y++E+ PTV+Y E + E
Sbjct: 399 TCALLQEKYHVEIEIKEPTVIYMERPLKKAEYTIH 433



Score = 34.4 bits (79), Expect = 0.001
Identities = 15/67 (22%), Positives = 28/67 (41%), Gaps = 1/67 (1%)

Query: 398 EMREPIVEANILVPKEYLGNVITLCIEKRGTQVNMVYHGNQVAVTYHLPMAEVVMDFFDR 457
E+ EP + I P+EYL T + V+ N+V ++ +P + ++
Sbjct: 534 ELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARC-IQEYRSD 592

Query: 458 LKSTSRG 464
L + G
Sbjct: 593 LTFFTNG 599


16Sbal195_1545Sbal195_1572Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_1545-1185.409467hypothetical protein
Sbal195_1546-1195.159697hypothetical protein
Sbal195_15471205.063249hypothetical protein
Sbal195_15481214.394641TetR family transcriptional regulator
Sbal195_15491204.106765secretion protein HlyD family protein
Sbal195_15502193.201341ABC transporter-like protein
Sbal195_15510161.538891ABC transporter
Sbal195_1552-216-2.074941acetyl-CoA hydrolase/transferase
Sbal195_1553119-4.422473hypothetical protein
Sbal195_1554019-3.883621beta-lactamase
Sbal195_1555221-5.008891transposase IS3/IS911 family protein
Sbal195_1556020-4.040446integrase catalytic subunit
Sbal195_1557018-3.670836hypothetical protein
Sbal195_1558-116-0.718697hypothetical protein
Sbal195_15590160.645102hypothetical protein
Sbal195_1560-2150.560608lysine exporter protein LysE/YggA
Sbal195_1561-2141.049334hypothetical protein
Sbal195_1562-1161.219200hypothetical protein
Sbal195_15630161.280422putative esterase
Sbal195_15640181.894630hypothetical protein
Sbal195_15652171.171021amino acid-binding ACT domain-containing
Sbal195_15661181.031835binding-protein-dependent transport system inner
Sbal195_1567214-2.142346phosphate ABC transporter permease
Sbal195_1568319-6.571360phosphate transporter ATP-binding protein
Sbal195_1569120-7.169281transcriptional regulator PhoU
Sbal195_1570021-7.657651hypothetical protein
Sbal195_1571117-5.405488hypothetical protein
Sbal195_1572117-5.172894hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1548HTHTETR751e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 75.0 bits (184), Expect = 1e-18
Identities = 29/162 (17%), Positives = 59/162 (36%), Gaps = 6/162 (3%)

Query: 31 SDARQRLITAALSLFSHRSYPTVSTREIAREAEVDAALIRYYFGSKAGLFEQMVRETLEP 90
+ RQ ++ AL LFS + + S EIA+ A V I ++F K+ LF ++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 91 VLTRLREISAAEAPNN---VGEIMQTYYRVMAPNPGLPRLIIRVLQEGDGSEPYRIILSV 147
+ E A + + EI+ L+ + + + ++
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 148 FEQVLTLSRQWLESTL---VNSGLLKEGVDPDLARLSFVSLM 186
+ S +E TL + + +L + A + +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1549RTXTOXIND553e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.8 bits (132), Expect = 3e-10
Identities = 32/176 (18%), Positives = 62/176 (35%), Gaps = 17/176 (9%)

Query: 86 TVERDRLTLTAPVGELITQVNVVEGQQVKAGEVLLTLDSTSANARLALRQAELEQAKAKL 145
T + ++ ++ V EG+ V+ G+VLL L + A A Q+ L Q A+L
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ--ARL 148

Query: 146 SEAVTGARLEDIERAKAVLDGAKASVKEAQRAFERTNRLYATKVLSQADLDTARAARDTS 205
+ IE + + F+ + +VL L + T
Sbjct: 149 EQTRYQILSRSIEL-----NKLPELKLPDEPYFQNVS---EEEVLRLTSL--IKEQFSTW 198

Query: 206 LAKQAEAEQSLRLLENGTRSEQLEQAKAAVAAASASVAIEQKALADLSLVAARDAV 261
++ + E +L + + A + +E+ L D S + + A+
Sbjct: 199 QNQKYQKELNLD-----KKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAI 249



Score = 49.1 bits (117), Expect = 2e-08
Identities = 31/232 (13%), Positives = 78/232 (33%), Gaps = 15/232 (6%)

Query: 108 VEGQQVKAGEVLLTLDSTSANARLALRQAELEQAKAKLSEAVTGARLEDIERAKAVLDGA 167
V ++V L+ ++ + ++ L++ +A+ + AR+ E V
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVL--ARINRYENLSRVEKSR 236

Query: 168 KASVKE-AQRAFERTNRLYATK---VLSQADLDTARAARDTSLAKQAEAEQSLRLLENGT 223
+ + + + V + +L ++ + ++ A++ +L+
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 224 RSEQLEQAKAAVAAASASVAIEQKALADLS---LVAARDAVVDTLP-WRVGDRIAAGTQL 279
++E L++ + K + A V L G + L
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 280 IGLLASEDPY-VRVYLPATWLDRVKAGDKVNIRVDG----REMPIAGTVRNI 326
+ ++ +D V + + + G I+V+ R + G V+NI
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1551ABC2TRNSPORT408e-06 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 39.9 bits (93), Expect = 8e-06
Identities = 48/200 (24%), Positives = 91/200 (45%), Gaps = 24/200 (12%)

Query: 186 GVILTMTMVMFT----SAAIVREREQGNMEFLITTPVRPLELMLGKI--------VPYVI 233
G++ T M T AA R Q E ++ T +R +++LG++ +
Sbjct: 72 GMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG 131

Query: 234 VGFVQVTIILSAG-HLLFDVPIRGGIDSIALAAMLFICASLTLGLVISTIAKTQLQSMQM 292
+G V + + LL+ +P+ IAL + F +LG+V++ +A + +
Sbjct: 132 IGVVAAALGYTQWLSLLYALPV------IALTGLAFA----SLGMVVTALAPSYDYFIFY 181

Query: 293 TVFILLPSILLSGFMFPYEAMPIAAQWIAEALPATHFMRMSRAIVLRDAQVMDLQFDALW 352
++ P + LSG +FP + +PI Q A LP +H + + R I+L V+D+
Sbjct: 182 QTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIML-GHPVVDVCQHVGA 240

Query: 353 MIGFTCIGLFIASMRFSKRL 372
+ + I F+++ +RL
Sbjct: 241 LCIYIVIPFFLSTALLRRRL 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1555HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHQALQQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


17Sbal195_1604Sbal195_1612Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_16042152.151468glycerate kinase
Sbal195_1605-2160.848054gluconate transporter
Sbal195_16060160.488788catalase domain-containing protein
Sbal195_16072190.564747transcriptional regulator CdaR
Sbal195_1608421-0.112322phage SPO1 DNA polymerase domain-containing
Sbal195_16094190.062064hypothetical protein
Sbal195_16104190.019833outer membrane protein MtrB
Sbal195_16114190.124667cytochrome C family protein
Sbal195_16123180.070380decaheme cytochrome c
18Sbal195_1653Sbal195_1677Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_1653-121-3.711980DNA polymerase II
Sbal195_1654233-5.512105porin
Sbal195_1655133-5.334524TonB-dependent receptor
Sbal195_1656127-4.396340transposase IS4 family protein
Sbal195_1657231-4.962860transposase, IS4 family protein
Sbal195_1658333-4.867126transposase, IS4 family protein
Sbal195_1659334-4.594713TonB-dependent receptor
Sbal195_1660329-3.391899transposase IS4 family protein
Sbal195_1661333-3.476224transposase, IS4 family protein
Sbal195_1662433-4.657143hypothetical protein
Sbal195_1663431-4.944448MotA/TolQ/ExbB proton channel
Sbal195_1664223-4.012004MotA/TolQ/ExbB proton channel
Sbal195_1665223-4.012959biopolymer transport protein ExbD/TolR
Sbal195_1666321-3.479844TonB family protein
Sbal195_1667319-3.972960hypothetical protein
Sbal195_1668217-2.757367diguanylate cyclase
Sbal195_1669114-1.160009hypothetical protein
Sbal195_1670217-0.168261PpiC-type peptidyl-prolyl cis-trans isomerase
Sbal195_16711160.074887N-acetyltransferase GCN5
Sbal195_16725151.166714RNA-binding S4 domain-containing protein
Sbal195_16733130.276687hypothetical protein
Sbal195_16744130.074203LysR family transcriptional regulator
Sbal195_1675314-0.570545hypothetical protein
Sbal195_1676313-0.962929nuclease SbcCD subunit D
Sbal195_1677312-1.193487SMC domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1654ECOLIPORIN703e-15 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 69.9 bits (171), Expect = 3e-15
Identities = 99/417 (23%), Positives = 165/417 (39%), Gaps = 52/417 (12%)

Query: 1 MNKTLVATALAAMFLVPSVSAIEIYKDNKNAVEIGGFIDARVINTQGETEVVNG-ASRIN 59
M + ++A + A+ + A EIY + N +++ G +D + ++ +G + +
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSK--DGDQTYMR 58

Query: 60 FGFNRE--LTDGWKAFAKLEWGVNPVGNSDIVYNNRFESVQEEFFYNRLGYAGLSHDTYG 117
GF E + D + + E+ V N E + RL +AGL YG
Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQ---------ANTTEGEGANSW-TRLAFAGLKFGDYG 108

Query: 118 TLTIGKQWGAWYDVVYNTNYGFVWDGNTAGVYTYNKDDGAVNGVGRGDKTVQYRNA--FG 175
+ G+ +G YDV T+ + G++ Y N G NGV YRN FG
Sbjct: 109 SFDYGRNYGVLYDVEGWTDMLPEFGGDSYT-YADNYMTGRANGV------ATYRNTDFFG 161

Query: 176 DV---SFAVQAQLKNS--SFYTCDTTDDITQAQCQANWESGDKAAQQVEYNYTYGGALTY 230
V +FA+Q Q KN S + + +++GD Y+ G +
Sbjct: 162 LVDGLNFALQYQGKNESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGA 221

Query: 231 KVTDMLTLTAGVNRGEFDVSFGNGEQTTAVDLIYGAGITWGNFDNDGLYAAA------NV 284
T VN G + G++ A + AG+ +D + +Y A N+
Sbjct: 222 AYTTSDRTNEQVNAG---GTIAGGDKADA----WTAGL---KYDANNIYLATMYSETRNM 271

Query: 285 NRQENHDTDNIGRLIKDAYGIESLVSYKFDNGLRPFISYNVLDAGKDYVIQPNFNADPND 344
D G + E Y+FD GLRP +S+ ++ GKD + N N D D
Sbjct: 272 TPYGKTDKGYDGGVANKTQNFEVTAQYQFDFGLRPAVSF-LMSKGKD-LTYNNVNGDDKD 329

Query: 345 EFKRQFLVVGLHFVWDPNTVLYIEARKDYSDFTSADKDQEARMALSESDGVAIGIRY 401
K + VG + ++ N Y++ + + D D +S D VA+G+ Y
Sbjct: 330 LVK--YADVGATYYFNKNFSTYVDYKINLLD---DDDPFYKDAGISTDDIVALGMVY 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1666PF035441011e-28 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 101 bits (253), Expect = 1e-28
Identities = 34/169 (20%), Positives = 64/169 (37%), Gaps = 11/169 (6%)

Query: 39 TPVIEITMDRQDSKAQNKPRVVPKPPPPPEQPQKPDTTPPDSSSNID----TAMSFNMGG 94
P E + K PKP P P+ P + N
Sbjct: 75 EPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAP 134

Query: 95 VEAGGASTG-FKLGNMMTRDGDATPIVRIEPQYPIAAARDGKEGWVQLRFTINELGGIDD 153
++ + + + R +PQYP A EG V+++F + G +D+
Sbjct: 135 ARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDN 194

Query: 154 VEVIQAEPKRLFDKEAIRALKKWKYKPKIVDGKPLKQPGQTVQLDFTLD 202
V+++ A+P +F++E A+++W+Y+P G V + F ++
Sbjct: 195 VQILSAKPANMFEREVKNAMRRWRYEP------GKPGSGIVVNILFKIN 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1667SYCDCHAPRONE290.021 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.1 bits (65), Expect = 0.021
Identities = 11/51 (21%), Positives = 21/51 (41%)

Query: 197 YFNQKKYKKAVGVLEVMVPLFPDDGRLWVQLAQFYLMVEDYDKSLATYDLA 247
+ KY+ A V + + L D R ++ L + YD ++ +Y
Sbjct: 46 QYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1677IGASERPTASE498e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 49.3 bits (117), Expect = 8e-08
Identities = 48/306 (15%), Positives = 104/306 (33%), Gaps = 14/306 (4%)

Query: 198 AADISALVKDQRSRRDGILQSAGLASDDELSNELAKLTPELALA--QSAKEQALQQQQLI 255
+I A V S + I + ++ T +A Q +K +Q
Sbjct: 1000 PNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDAT 1059

Query: 256 IKASDAAQHLLAEFAQFDTLTQTAAALEAQQESIVAQTHKLNLAEQAQRLAPMIEVFLAR 315
+ + + TQT ++ E+ QT + ++ + + +
Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE---TKETATVEKEEKAKVET 1116

Query: 316 EQEAKAANLAFSHAQTALTQAKQAFDDAELKAQDLPVLEASLLEQEQAKQQLNALGPQL- 374
E+ + + S Q++ AE ++ P + ++ Q++ A Q
Sbjct: 1117 EKTQEVPKVT-SQVSPKQEQSETVQPQAEPARENDPTVNI---KEPQSQTNTTADTEQPA 1172

Query: 375 RELDRLNKTLEQEQAQLVKAKTQLQISKNELTAASQKRRELESALPQLQANSDTRLTLQQ 434
+E + E + + ++ +N A +Q ES+ N R
Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS--NKPKNRHRRSVRSV 1230

Query: 435 AHQQQQQLLSTYQQWQQVAARVSS--TKAKLANAKAQGQQLNAEHQQAQVAHKALLITWH 492
H + S+ + ++S T A L++A+A+ Q + +A H + L +
Sbjct: 1231 PHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNN 1290

Query: 493 QGQAAI 498
+GQ +
Sbjct: 1291 EGQYNV 1296



Score = 43.9 bits (103), Expect = 4e-06
Identities = 49/325 (15%), Positives = 95/325 (29%), Gaps = 32/325 (9%)

Query: 277 QTAAALEAQQESIVAQTHKLNLAEQAQRLAPMIEVFLAREQEAKAANLAFSHAQTALTQA 336
+ E+ V +E + +A +QE+K A Q
Sbjct: 1012 SNNEEIARVDEAPVPPPAPATPSETTETVA------ENSKQESKTVEKNEQDATETTAQN 1065

Query: 337 KQAFDDAELKAQDLPVLEASLLEQEQAKQQLNALGPQLRELDRLNKTLEQEQAQLVKAKT 396
++ +A+ ++A+ E A+ Q E ++E+A++ KT
Sbjct: 1066 REVAKEAK------SNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKT 1119

Query: 397 QLQISKNELTAASQKRRELESALPQLQANSDTRLTLQQAHQQQQQLLSTYQQWQQVAARV 456
Q + Q++ E + +D + +++ Q T +Q A
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT----EQPAKET 1175

Query: 457 SSTKAKLANAKAQGQQLNAEHQQAQVAHKALLITWHQGQAAILARQLQQDEPCPVCGSQI 516
SS + N+ + + A ++P +
Sbjct: 1176 SSNVEQPVTESTTVNTGNSVVENPENTTPA--------TTQPTVNSESSNKPKNRHRRSV 1227

Query: 517 HPQPAQSQEPL---PSDEALQLAQDAETTAQEVLSKARA--EYRGLQTQLETLQQQAQ-- 569
P + + L T VLS ARA ++ L Q +Q
Sbjct: 1228 RSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLE 1287

Query: 570 -DLAAQLGTAVDISQDQHAHTLSQY 593
+ Q V + ++ SQY
Sbjct: 1288 MNNEGQYNVWVSNTSMNKNYSSSQY 1312


19Sbal195_1719Sbal195_1750Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_1719221-0.479582phage integrase family protein
Sbal195_1720320-0.497497integron integrase
Sbal195_17212210.127984integrase domain-containing protein
Sbal195_1722121-0.986594transposase IS4 family protein
Sbal195_1723122-2.369651integrase family protein
Sbal195_1724023-4.658409putative transposase
Sbal195_1725230-8.701655hypothetical protein
Sbal195_1726432-9.243460heat shock protein DnaJ-like protein
Sbal195_1727533-8.620303hypothetical protein
Sbal195_1728731-8.151056hypothetical protein
Sbal195_1729728-7.778460hypothetical protein
Sbal195_1730728-7.360200hypothetical protein
Sbal195_1731531-7.146616hypothetical protein
Sbal195_1732331-7.017702hypothetical protein
Sbal195_1733232-8.059144hypothetical protein
Sbal195_1734332-7.937804hypothetical protein
Sbal195_1735231-6.678304LysR family transcriptional regulator
Sbal195_1736227-5.771606short chain dehydrogenase
Sbal195_1737225-6.098900IS91 family transposase
Sbal195_1738023-3.467804hypothetical protein
Sbal195_1739222-4.148730hypothetical protein
Sbal195_1740120-3.563323integrase catalytic subunit
Sbal195_1741221-3.957543transposase IS3/IS911 family protein
Sbal195_1742121-4.050329hypothetical protein
Sbal195_1743119-2.680093peptidase M23B
Sbal195_1745118-1.377299hypothetical protein
Sbal195_1746117-0.291498hypothetical protein
Sbal195_17471170.00777523S rRNA methyltransferase A
Sbal195_17481180.544122cold-shock DNA-binding domain-containing
Sbal195_17491150.647009sulfate transporter
Sbal195_17502190.769376ribonuclease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1735PF05043280.035 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 28.4 bits (63), Expect = 0.035
Identities = 17/64 (26%), Positives = 30/64 (46%), Gaps = 3/64 (4%)

Query: 2 NKLDRLDIKQLRVFQALIREQSA---SKAASQLGLTQQAVSEQLKKLRDVFEDRLFLRKT 58
+ L + +QL + + L + S+ A L T++AV + L ++ F D +F T
Sbjct: 3 DLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHSST 62

Query: 59 NGFV 62
NG
Sbjct: 63 NGIR 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1736DHBDHDRGNASE411e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 41.2 bits (96), Expect = 1e-06
Identities = 29/124 (23%), Positives = 49/124 (39%), Gaps = 7/124 (5%)

Query: 39 VDITDENSIRA----LYEKVGHFDAVVNTVGFCEYATFADMTESQWMATVMSKMMGQISL 94
D+ D +I + ++G D +VN G +++ +W AT G +
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 95 VRIGQDYIAD--NGSFTLISGILNVKPIPYAIADATTSGAIDTFVKCVAHEM-PRGTRIN 151
R Y+ D +GS + P A A++ A F KC+ E+ R N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 152 VVNP 155
+V+P
Sbjct: 184 IVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1741HTHFIS260.027 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.9 bits (57), Expect = 0.027
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GHSVADAAKSLGVSTSLLYNWKEKHQALQQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1750IGASERPTASE536e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 53.1 bits (127), Expect = 6e-09
Identities = 38/293 (12%), Positives = 83/293 (28%), Gaps = 12/293 (4%)

Query: 540 PALKGFAAPQKVEQAPSPTVKVEAPQPGFFSKLVSAVSAMFAPSEKAEPVKVVETKTADT 599
P ++ +P ++A P S + A + P ++T +T
Sbjct: 983 PEVEKRNQTVDTTNITTPN-NIQADVPSVPSN--NEEIARVDEAPVPPPAPATPSETTET 1039

Query: 600 SAANANRRNRRNDTRRPRNAQDADKAKEGTREPRSRNPKKPADAAVSTSTQERPVREKEE 659
A N+ + ++ + + + +E +E +S V+ S E +E
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE-----TKE 1094

Query: 660 AVKRPAKAEPKPRVQAPKDVVADVEADAPKQEVARERRQRRNMRRKVRIDNGHNTPDNAI 719
K + V + + PK V + ++ V+ ++
Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPK--VTSQVSPKQEQSETVQPQAEPARENDPT 1152

Query: 720 LIAPEDAAEVLAEIAAINAAAASTISVDTKAEVAQAPAETKAPRTRRQPRKEAAPALEAA 779
+ E ++ A A S + + V ++ P +
Sbjct: 1153 VNIKEPQSQTNTT--ADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPT 1210

Query: 780 ENIAVEEKAMSSAAAETPAVDAVKTEEQAEVVTTEVAAPADAVSQDNDAVDAE 832
N K + +V A D S + +AV ++
Sbjct: 1211 VNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSD 1263



Score = 50.4 bits (120), Expect = 4e-08
Identities = 43/273 (15%), Positives = 89/273 (32%), Gaps = 33/273 (12%)

Query: 714 TPDNAILIAPEDAAEVLAEIAAINAAAASTISVDTKAEVAQAPAETKAPRTRRQPRKEAA 773
T N I EIA ++ A + T +E + AE ++ + E
Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQD 1057

Query: 774 PALEAAENIAVEEKAMSSAAAETPAVDAVKTEEQAEVVTTEVAAPADAVSQDNDAVDAES 833
A+N V ++A S+ A T + ++ + + T
Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET----------------K 1101

Query: 834 ETADDQAKREQRDGQRRSRRSPRHLRAAGQRRRRDEDDQGTSTPA----------QFVPN 883
ETA + + + + +++ P+ ++ + E Q + PA +
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161

Query: 884 DELGADQEYPSEVASVRVEAPVVATKTDAVTETQVTAKSVEVETAQASEAPAVEASAVVK 943
AD E P++ S VE PV + T + VE + + + + +
Sbjct: 1162 TNTTADTEQPAKETSSNVEQPVTESTTVNTGNS-------VVENPENTTPATTQPTVNSE 1214

Query: 944 AATKVETPANDVIAAETKPVEAKSVETKATEAE 976
++ K + + + VE + +
Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247


20Sbal195_1848Sbal195_1872Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_1848225-2.003839N-acetyltransferase GCN5
Sbal195_1849122-2.761558prolyl 4-hydroxylase subunit alpha
Sbal195_1850120-1.919211N-acetyltransferase GCN5
Sbal195_1851317-2.297576N-acetyltransferase GCN5
Sbal195_1852314-1.988405hypothetical protein
Sbal195_1853-1120.040985hypothetical protein
Sbal195_1854-1110.303349hypothetical protein
Sbal195_1855-113-0.489865hypothetical protein
Sbal195_1856-113-0.676416hypothetical protein
Sbal195_1857-117-2.543412hypothetical protein
Sbal195_1858-217-3.262895PKD domain-containing protein
Sbal195_1859130-7.652385hypothetical protein
Sbal195_1860024-6.505619peptidase S1 and S6 chymotrypsin/Hap
Sbal195_1861017-5.055782hypothetical protein
Sbal195_1862217-5.647072putative esterase
Sbal195_1863320-3.102478hypothetical protein
Sbal195_1864120-0.838964hypothetical protein
Sbal195_1865221-0.542113hypothetical protein
Sbal195_18662180.229128hypothetical protein
Sbal195_18671170.620969hypothetical protein
Sbal195_18682181.226964heat shock protein DnaJ domain-containing
Sbal195_18691181.794961Ppx/GppA phosphatase
Sbal195_18701181.949998polyphosphate kinase
Sbal195_18710173.267831putative chaperone
Sbal195_1872-1163.022970CreA family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1851SACTRNSFRASE415e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.7 bits (95), Expect = 5e-07
Identities = 20/57 (35%), Positives = 30/57 (52%)

Query: 80 LILNDVYVTQHARCVGIGRALVQQAASYAKAHNMSYLMLETQQKNQRAQGLYEGLGF 136
++ D+ V + R G+G AL+ +A +AK ++ LMLETQ N A Y F
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1858MICOLLPTASE764e-16 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 76.3 bits (187), Expect = 4e-16
Identities = 34/128 (26%), Positives = 60/128 (46%), Gaps = 9/128 (7%)

Query: 784 APVASFTQVVNGAAVQLTST-STDSDGQIVSAEWSFGDNTVAVGEVVTHSYSQSGEYLVT 842
A + S + V+ + T S D DG+I + EW FGD + TH Y+++GEY V
Sbjct: 777 AVIKSDSSVIVEEEINFDGTESKDEDGEIKAYEWDFGDGEKSNEAKATHKYNKTGEYEVK 836

Query: 843 LTVTDNDGLTHSTSQTVTVVVGEVKQP------PVAQIQRINLLF-VDMFISTSYDTDGV 895
LTVTDN+G ++ S+ + VV + P ++ N + +M + + +
Sbjct: 837 LTVTDNNGGINTESKKI-KVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDY 895

Query: 896 IKQHKWTF 903
++ +
Sbjct: 896 SDKYYFDV 903



Score = 40.5 bits (94), Expect = 4e-05
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 889 SYDTDGVIKQHKWTFDNGTRAN-GQVVLRLARRGQHTVELTVKDNDKLTGTTTLT 942
S D DG IK ++W F +G ++N + + + G++ V+LTV DN+ T +
Sbjct: 798 SKDEDGEIKAYEWDFGDGEKSNEAKATHKYNKTGEYEVKLTVTDNNGGINTESKK 852


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1860V8PROTEASE391e-05 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 38.8 bits (90), Expect = 1e-05
Identities = 31/202 (15%), Positives = 66/202 (32%), Gaps = 47/202 (23%)

Query: 48 GVLINSQWILTVAHTIFYDYVGKSLMVGSKNYEIESVHIHPDYSEPDKSLLKGDLAPLMR 107
GV++ +LT H + + ++ P D G A +
Sbjct: 106 GVVVGKDTLLTNKHVVDATHGDPH-----------ALKAFPSAINQDNYPNGGFTAEQIT 154

Query: 108 FFKSRSDIALIKLT------SPVSGIEPINI-YTGKSEEGKKITVYGKGATGNGVTGEYP 160
+ D+A++K + ++P + +++ + ITV G YP
Sbjct: 155 KYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTG-----------YP 203

Query: 161 DTKSLRVMSHFQNVIESAEGNWLAFKFDEPANALSLEGMHGSGDSGGASVIFEDSIPFLV 220
K + M + I +G + + G+SG S +F + ++
Sbjct: 204 GDKPVATMWESKGKITYLKGEAMQYDLSTT-----------GGNSG--SPVFNEKNE-VI 249

Query: 221 GLSSWQLGHGDISTFKGGLYGT 242
G+ + + F G ++
Sbjct: 250 GIHWGGVPN----EFNGAVFIN 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1869SHAPEPROTEIN310.010 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 31.3 bits (71), Expect = 0.010
Identities = 16/36 (44%), Positives = 23/36 (63%)

Query: 158 NLVIDIGGGSTEVVIGKKNTPTQLSSLRCGCVSFNE 193
++V+DIGGG+TEV + N SS+R G F+E
Sbjct: 161 SMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDE 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1871SHAPEPROTEIN416e-06 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 41.3 bits (97), Expect = 6e-06
Identities = 24/81 (29%), Positives = 42/81 (51%), Gaps = 11/81 (13%)

Query: 192 AAKRAGFVDVDFLFEPLAAGMDYEASLTDNKTVLVVDVGGGTTDCSVVKMGPAHQQKADR 251
+A+ AG +V + EP+AA + +++ +VVD+GGGTT+ +V+ +
Sbjct: 129 SAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN--------- 179

Query: 252 SEDFLGHSGQRIGGNDLDIAL 272
+ S RIGG+ D A+
Sbjct: 180 --GVVYSSSVRIGGDRFDEAI 198


21Sbal195_1959Sbal195_2043Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_19592295.111890hypothetical protein
Sbal195_19602265.116160GP46
Sbal195_19613244.586268C-5 cytosine-specific DNA methylase
Sbal195_19622243.196865hypothetical protein
Sbal195_19632253.520604hypothetical protein
Sbal195_19643254.090657hypothetical protein
Sbal195_19652201.168414phage-like protein
Sbal195_1966120-0.232751hypothetical protein
Sbal195_19671221.506450hypothetical protein
Sbal195_19682221.509662hypothetical protein
Sbal195_19693261.960023hypothetical protein
Sbal195_19701251.709700XRE family transcriptional regulator
Sbal195_19713344.993895XRE family transcriptional regulator
Sbal195_19723386.106696hypothetical protein
Sbal195_19734355.544091hypothetical protein
Sbal195_19743355.562551hypothetical protein
Sbal195_19753345.538457hypothetical protein
Sbal195_19761292.394561replication P family protein
Sbal195_1977123-5.403967hypothetical protein
Sbal195_1978327-8.128733integrase family protein
Sbal195_1979738-11.354516XRE family transcriptional regulator
Sbal195_1980835-11.775513hypothetical protein
Sbal195_1981937-11.888438hypothetical protein
Sbal195_1982837-11.263563metal dependent phosphohydrolase
Sbal195_1983937-10.638902hypothetical protein
Sbal195_1984530-8.323225hypothetical protein
Sbal195_1985529-8.217523hypothetical protein
Sbal195_1986527-6.747163hypothetical protein
Sbal195_1987526-6.819085DEAD/DEAH box helicase
Sbal195_1988423-4.799030hypothetical protein
Sbal195_1989322-4.669147amidohydrolase 3
Sbal195_1990533-4.996666hypothetical protein
Sbal195_1991531-4.920699hypothetical protein
Sbal195_1992327-4.632864hypothetical protein
Sbal195_1993127-4.784324putative transcriptional regulator
Sbal195_1995125-4.845575bifunctional antitoxin/transcriptional repressor
Sbal195_1996124-4.916600hypothetical protein
Sbal195_1997120-3.411824addiction module antitoxin
Sbal195_1998-117-0.218976chloramphenicol acetyltransferase
Sbal195_1999-1180.601969putative metal dependent phosphohydrolase
Sbal195_2000-1191.588406integrase family protein
Sbal195_20011274.810129hypothetical protein
Sbal195_20021274.501326GP46
Sbal195_20031253.164738C-5 cytosine-specific DNA methylase
Sbal195_20041271.516307hypothetical protein
Sbal195_20052281.901347hypothetical protein
Sbal195_20062291.853654hypothetical protein
Sbal195_20072282.095624hypothetical protein
Sbal195_20081231.994466hypothetical protein
Sbal195_20093233.581506hypothetical protein
Sbal195_20101170.553175hypothetical protein
Sbal195_2011120-0.981348hypothetical protein
Sbal195_2012122-2.058824hypothetical protein
Sbal195_2013126-3.965480integrase family protein
Sbal195_2014130-6.646161putative transposase
Sbal195_2015436-8.918616hypothetical protein
Sbal195_2016533-8.095339hypothetical protein
Sbal195_2017124-4.793892hypothetical protein
Sbal195_2018222-3.156957hypothetical protein
Sbal195_20192210.117535hypothetical protein
Sbal195_20202243.584385putative phage repressor
Sbal195_20213345.884214hypothetical protein
Sbal195_20224366.258989hypothetical protein
Sbal195_20234346.089593hypothetical protein
Sbal195_20243336.022231hypothetical protein
Sbal195_20254335.905327hypothetical protein
Sbal195_2026315-0.944981replication P family protein
Sbal195_2027317-2.265685hypothetical protein
Sbal195_2028316-2.322250integrase family protein
Sbal195_2029320-3.573839XRE family transcriptional regulator
Sbal195_2030319-3.544719hypothetical protein
Sbal195_2031320-3.782602YD repeat-containing protein
Sbal195_2032424-4.699055hypothetical protein
Sbal195_2033423-4.905860hypothetical protein
Sbal195_2034322-5.133843hypothetical protein
Sbal195_2035218-4.468348transcriptional regulator-like protein
Sbal195_2036120-5.609754hypothetical protein
Sbal195_2037019-5.051974Sel1 domain-containing protein
Sbal195_2038119-4.412242hypothetical protein
Sbal195_2039217-2.203375peptidase S24/S26 domain-containing protein
Sbal195_2040215-2.198829DNA-directed DNA polymerase
Sbal195_2041216-1.988719hypothetical protein
Sbal195_2042215-0.805895hypothetical protein
Sbal195_2043214-0.891033hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1989UREASE398e-05 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 38.6 bits (90), Expect = 8e-05
Identities = 13/27 (48%), Positives = 21/27 (77%)

Query: 655 TLHPAMQHNIGDKLGSLEKGKLADMVV 681
T++PA+ H + ++GSLE GK AD+V+
Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVL 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2015ACRIFLAVINRP290.025 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.025
Identities = 11/35 (31%), Positives = 16/35 (45%)

Query: 6 ESVFTTWSQGPSKTEQERAENAERQIRQAIHASEK 40
+ VF T Q P+ QER + Q+ +EK
Sbjct: 568 QGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEK 602


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2031SALSPVBPROT422e-05 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 42.4 bits (99), Expect = 2e-05
Identities = 44/178 (24%), Positives = 70/178 (39%), Gaps = 16/178 (8%)

Query: 296 GQASYQIPIDLPPGRNGVQPSVSLSYNSQGGNGILGVGWSLNAGSSISRCGATFAQ---- 351
G AS +P+ + R G P+++L Y+S GGNG GVGWS S Q
Sbjct: 34 GLASITLPLPISAER-GFAPALALHYSSGGGNGPFGVGWSCATMSIARSTSHGVPQYNDS 92

Query: 352 ------DGFTRAVTFSA--STDRLCLDGQRLIATTGSYGASNAEYRTEMDSFVKVVQQGN 403
DG T S + + + ++ SY + + RTE F ++
Sbjct: 93 DEFLGPDGEVLVQTLSTGDAPNPVTCFAYGDVSFPQSYTVTRYQPRTESS-FYRLEYWVG 151

Query: 404 INDSNSRFTVYKPDGNSATYGANANSRFV-PSGLSTALSWKVTQESYSDGANTIDYKY 460
++ + + ++ +G G A +R P S W V +ES + I Y Y
Sbjct: 152 NSNGDDFWLLHDSNGILHLLGKTAAARLSDPQAASHTAQWLV-EESVTPAGEHIYYSY 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2036HTHFIS310.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.3 bits (71), Expect = 0.002
Identities = 10/47 (21%), Positives = 21/47 (44%), Gaps = 1/47 (2%)

Query: 7 EKVVELEGIENAIVRSAAELC-YIQPQGPALVIIDIKLPNAAVLEWV 52
+ + G + I +AA L +I LV+ D+ +P+ + +
Sbjct: 20 NQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66


22Sbal195_2120Sbal195_2152Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2120220-1.456089hypothetical protein
Sbal195_2121323-3.262553hypothetical protein
Sbal195_2122425-5.566207hypothetical protein
Sbal195_2123423-4.774149hypothetical protein
Sbal195_2124524-3.498347Mor transcription activator domain-containing
Sbal195_2125320-3.018359hypothetical protein
Sbal195_2126222-0.305730hypothetical protein
Sbal195_21272191.154927hypothetical protein
Sbal195_21283235.057500hypothetical protein
Sbal195_21292235.359115glycoside hydrolase
Sbal195_2130-1204.829853hypothetical protein
Sbal195_21310204.989837zinc finger-like protein
Sbal195_21321195.416799hypothetical protein
Sbal195_21331195.702095hypothetical protein
Sbal195_21341163.667741hypothetical protein
Sbal195_21351142.382917prophage MuSo1, portal protein
Sbal195_21362150.078388hypothetical protein
Sbal195_2137321-2.485712SPP1 family phage head morphogenesis protein
Sbal195_2138225-3.291663phage virion morphogenesis protein
Sbal195_2139222-3.072263hypothetical protein
Sbal195_2140119-1.477555hypothetical protein
Sbal195_2141116-0.080462hypothetical protein
Sbal195_21423192.179302hypothetical protein
Sbal195_21432183.553938prophage MuSo1, protein Gp32
Sbal195_21443193.590127prophage MuSo1, major head subunit
Sbal195_21453193.804345Rho termination factor domain-containing
Sbal195_21460173.227647hypothetical protein
Sbal195_2147-1132.276275hypothetical protein
Sbal195_2148-1131.918972hypothetical protein
Sbal195_2149-1121.344812phage tape measure protein
Sbal195_2150617-1.447808hypothetical protein
Sbal195_2151518-1.458558hypothetical protein
Sbal195_2152525-3.020378hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2149GPOSANCHOR443e-06 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 44.3 bits (104), Expect = 3e-06
Identities = 47/370 (12%), Positives = 110/370 (29%), Gaps = 3/370 (0%)

Query: 830 TGAAVPETLKAQAATLGLTKELSDLTAKQYGYTDSVKELSPEQAKLSRAVAETEARLKQC 889
G V + AT T L + + + L + + LS + +
Sbjct: 31 AGLVVNTNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDEL 90

Query: 890 RDVMNSSTVSSKAKAKAQQDLISLQGKLSDQTKQLSEVQALEAANYEQIKSKYAAVSDEM 949
+ ++++ + K+ + S +L + L + +K + E
Sbjct: 91 TEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEK 150

Query: 950 LRLEQAYKDGGITAEEYLRQKERLVEVLRILQRLMGGLEEGEQETDEQVKKTTKTLIEQR 1009
L D E + ++ L+ LE + E ++ ++
Sbjct: 151 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 210

Query: 1010 EELEQLEETTGRATEYVNLFAGAYAHLNKQFNFNEDSTEKLNARVDQLTNSIMNNMRVNT 1069
+++ LE A + + L A L +
Sbjct: 211 AKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALE 270

Query: 1070 GFWGVLAQLSNQAFIREKQIINETLLTRKWTEELESSSISLDRVNQISREAKWNIRELGD 1129
G S + E + + + + + + + ++ ++L
Sbjct: 271 GAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330

Query: 1130 EDLKPLQAAIDATRDRILGLRDDINATLGSLKDEMDQLNNNQAAIEKRRYEQQQAELKAQ 1189
E K L+ + LR D++A+ + K + + + + E + L+
Sbjct: 331 EHQK-LEEQNKISEASRQSLRRDLDASREAKKQLEAEHQ--KLEEQNKISEASRQSLRRD 387

Query: 1190 LDAARTAQDK 1199
LDA+R A+ +
Sbjct: 388 LDASREAKKQ 397



Score = 37.0 bits (85), Expect = 6e-04
Identities = 41/276 (14%), Positives = 84/276 (30%)

Query: 749 LTILRAKFEEQQTYLDATAKGAEALEQAYKDLGLTSSHALEQVNTKAEAAFNLIKNNREP 808
I + + + L K + + + L + + + I+
Sbjct: 62 FEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEAR 121

Query: 809 IEQQKDAFLAWAKAALTAAEATGAAVPETLKAQAATLGLTKELSDLTAKQYGYTDSVKEL 868
+ A + + E A L K L + +K L
Sbjct: 122 KADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTL 181

Query: 869 SPEQAKLSRAVAETEARLKQCRDVMNSSTVSSKAKAKAQQDLISLQGKLSDQTKQLSEVQ 928
E+A L AE E L+ + + + K + L + + L +
Sbjct: 182 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFS 241

Query: 929 ALEAANYEQIKSKYAAVSDEMLRLEQAYKDGGITAEEYLRQKERLVEVLRILQRLMGGLE 988
++A + ++++ AA+ LE+A + + + + L L+ LE
Sbjct: 242 TADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLE 301

Query: 989 EGEQETDEQVKKTTKTLIEQREELEQLEETTGRATE 1024
Q + + + L RE +QLE + E
Sbjct: 302 HQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEE 337


23Sbal195_2177Sbal195_2182Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2177-124-4.272372phosphogluconate dehydratase
Sbal195_2178234-9.376229keto-hydroxyglutarate-aldolase/keto-deoxy-
Sbal195_2179125-7.579986resolvase domain-containing protein
Sbal195_2180-121-6.915838hypothetical protein
Sbal195_2181-115-4.843378hypothetical protein
Sbal195_2182-111-3.167847hypothetical protein
24Sbal195_2217Sbal195_2249Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2217022-3.137433cytochrome c oxidase, cbb3-type subunit III
Sbal195_2218021-3.652709cbb3-type cytochrome oxidase subunit
Sbal195_2219017-3.584588cbb3-type cytochrome c oxidase subunit II
Sbal195_2220018-4.824089cbb3-type cytochrome c oxidase subunit I
Sbal195_2221022-7.345374hypothetical protein
Sbal195_2222023-7.260010response regulator receiver modulated metal
Sbal195_2223228-6.367426hypothetical protein
Sbal195_2224120-4.202500phage integrase family protein
Sbal195_2225221-3.844781hypothetical protein
Sbal195_2226221-4.691749cytoplasmic chaperone TorD family protein
Sbal195_2227220-3.854345dimethylsulfoxide reductase subunit B
Sbal195_2228317-2.880337anaerobic dimethyl sulfoxide reductase subunit
Sbal195_2229215-2.616148outer membrane protein
Sbal195_2230317-4.587363cytochrome C family protein
Sbal195_2231319-5.796730CRP/FNR family transcriptional regulator
Sbal195_2232420-4.389514hypothetical protein
Sbal195_2233318-4.328010hypothetical protein
Sbal195_2234219-4.175352ATPase AAA
Sbal195_2235117-3.527694integrase catalytic subunit
Sbal195_2236020-5.616818TnsA endonuclease
Sbal195_2238020-5.819381filamentation induced by cAMP protein fic
Sbal195_2239020-5.582704hypothetical protein
Sbal195_2240019-5.296776transposase
Sbal195_2241223-6.724504integrase family protein
Sbal195_2242220-6.188581hypothetical protein
Sbal195_2243117-3.975937radical SAM domain-containing protein
Sbal195_2244215-2.563127hypothetical protein
Sbal195_2245316-2.287234transposase, IS4 family protein
Sbal195_2246317-2.888437two component LuxR family transcriptional
Sbal195_2247316-3.054224integral membrane sensor hybrid histidine
Sbal195_2248617-2.874168hypothetical protein
Sbal195_2249517-2.718424YscC/HrcC family type III secretion outer
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2222HTHFIS463e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 45.6 bits (108), Expect = 3e-07
Identities = 19/90 (21%), Positives = 33/90 (36%), Gaps = 9/90 (10%)

Query: 28 KVLVVDDEPDVHTVTKLALSRFKLDGRPLTFINAYSAEQAKELMNQEHDIAIAFIDVVME 87
+LV DD+ + TV ALSR +A + + DVVM
Sbjct: 5 TILVADDDAAIRTVLNQALSR-----AGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMP 58

Query: 88 SDHAGLELVKWIREELQNKTTRLILRTGQP 117
D +L+ I++ +++ + Q
Sbjct: 59 -DENAFDLLPRIKKA--RPDLPVLVMSAQN 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2233PREPILNPTASE300.016 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.2 bits (68), Expect = 0.016
Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 8/33 (24%)

Query: 53 QCPCCQHPLTWQQH--LFSH------CLHCKQP 77
CP C HP+T ++ L S C C+ P
Sbjct: 73 CCPHCNHPITALENIPLLSWLWLRGRCRGCQAP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2246HTHFIS636e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 6e-14
Identities = 21/102 (20%), Positives = 43/102 (42%), Gaps = 4/102 (3%)

Query: 11 ILVVDDHSLIFDGLRGCLAPYPELNL-IGSVEDGLAVYEKCLKLRPDLVFMDLKLPGMGG 69
ILV DD + I L L+ + DLV D+ +P
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA---GDGDLVVTDVVMPDENA 62

Query: 70 LDVIRQLRQRWPEMMIIMLTGTIEEKSAREALDVGANGYVLK 111
D++ ++++ P++ +++++ +A +A + GA Y+ K
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2247HTHFIS741e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 1e-15
Identities = 27/120 (22%), Positives = 46/120 (38%), Gaps = 8/120 (6%)

Query: 696 KILLVDDVETNRDIISKMLLELGQQVIAVSSGEAALEKGTRHIFDLVLMDIRMPGLDGYQ 755
IL+ DD R ++++ L G V S+ DLV+ D+ MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 756 TTQQWRHSENILDGDCPIFALTANANPKEHDTIEA--AGMNSYITKPVSLKQLNHALEAA 813
+ + D P+ ++A I+A G Y+ KP L +L + A
Sbjct: 65 LLPRIKK----ARPDLPVLVMSAQNTF--MTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2249TYPE3OMGPROT440e-152 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 440 bits (1133), Expect = e-152
Identities = 156/503 (31%), Positives = 262/503 (52%), Gaps = 24/503 (4%)

Query: 6 SLLLLCQMGLAQAAPLTNIKWQGEPFVMISRGTALTSVIQDFASNYGVPVIVSNKVNDNY 65
+LLLL AQ W P+V +++G +L ++ DF +NY V+VS+K+ND
Sbjct: 16 TLLLLSSYSWAQELD-----WLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKV 70

Query: 66 IGQIQEQDPKSVIQDLTRRYGLVWYYNNEVLYVYKASEINSEVLPLTSLSATKVDHYLRS 125
GQ + +P+ +Q + Y LVWYY+ VLY++K SE+ S ++ L A ++ L+
Sbjct: 71 SGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQR 130

Query: 126 AGVLDKGVCNIKSMAGISGLQVTGVPECINSVTKLTAQLDANAKQTTE--NQETVKVYPL 183
+G+ + + A + V+G P + V + A L+ + +E ++++PL
Sbjct: 131 SGIWEPR-FGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPL 189

Query: 184 KYASATDSIYEYRSQPVSIPGLVTVLKEMDQGTQV-------ANAVAGSVSNISGPVFAA 236
KYASA+D YR V+ PG+ T+L+ + + + + A
Sbjct: 190 KYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQARVEA 249

Query: 237 DPRQNAIIVRGSARDMATYGSLIRQLDTKPSMIEVSVSIFDVDASDFKQLGIDWSASAKL 296
DP NAIIVR S M Y LI LD + IEV++SI D++A +LG+DW +
Sbjct: 250 DPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVGIRT 309

Query: 297 GGGSVSFN---------SGDSSDNFSTVIGNTGNFMMRLNALEKNSKAKVLSRPSVVTLN 347
G + + + + R+N LE A+V+SRP+++T
Sbjct: 310 GNNHQVVIKTTGDQSNIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQE 369

Query: 348 NVQAVLDKNVTFYTKLEGDKVAKLESVTTGSLLRVTPRLIDEVGHQAVMLDLNIQDGQQS 407
N QAV+D + T+Y K+ G +VA+L+ +T G++LR+TPR++ + + L+L+I+DG Q
Sbjct: 370 NAQAVIDHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQK 429

Query: 408 QAVSRSEPLPQVQNSEISTQATLKSGESLLLGGFVQDRDETTQNKIPLLGDLPLLGGLFR 467
S E +P + + + T A + G+SL++GG +D +K+PLLGD+P +G LFR
Sbjct: 430 PNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFR 489

Query: 468 STDHHTQSVMRLFLIKAEPVNQG 490
T+ +RLF+I+ +++G
Sbjct: 490 RKSELTRRTVRLFIIEPRIIDEG 512


25Sbal195_2258Sbal195_2263Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2258222-2.386326hypothetical protein
Sbal195_2259323-2.931714type III secretion low calcium response
Sbal195_2260224-2.687142hypothetical protein
Sbal195_2261224-4.247170hypothetical protein
Sbal195_2262224-3.673503AraC family transcriptional regulator
Sbal195_2263223-1.977398type III secretion system needle protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2258CLENTEROTOXN290.036 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 29.2 bits (65), Expect = 0.036
Identities = 22/132 (16%), Positives = 40/132 (30%), Gaps = 18/132 (13%)

Query: 246 SSEDNSLRYAVTPSRYELLNCVAAHGMEDEGLARVLYQAKVGNTNLGALYGLPAPKDAPQ 305
+E + T +Y+ + ++ + D+G L + T A
Sbjct: 124 PNEYVYYKVYATYRKYQAIR-ISHGNISDDGSIYKLTGIWLSKT------------SADS 170

Query: 306 LDNVDD--FILCDEDINLGVSQTDVYADEETFYQGIGQHQTTTTGDN--CYKLLQLNIND 361
L N+D I E L V TD+ + + T ++ L +
Sbjct: 171 LGNIDQGSLIETGERCVLTVPSTDIEKEILDLAAATERLNLTDALNSNPAGNLYDWR-SS 229

Query: 362 GLHYLATKANPH 373
+ K N H
Sbjct: 230 NSYPWTQKLNLH 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2259SYCDCHAPRONE937e-27 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 93.1 bits (231), Expect = 7e-27
Identities = 43/148 (29%), Positives = 68/148 (45%)

Query: 9 DFEKLEAACQLALVNQQTLAEQVGLTSQDLELIYQSGTSKYQMGLPAEAIVDFTYLVMHQ 68
D ++ + A + L T+A ++S LE +Y ++YQ G +A F L +
Sbjct: 7 DTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLD 66

Query: 69 PWDRRFHLGLGSCLHWLGEYQHALTFYGYALLMDACSPEASFRIAQCFLSLNDDAAAIEA 128
+D RF LGLG+C +G+Y A+ Y Y +MD P F A+C L + A A
Sbjct: 67 HYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESG 126

Query: 129 LQMAISQSYSKPEHHFVGDQAQQLLSAL 156
L +A K E + + +L A+
Sbjct: 127 LFLAQELIADKTEFKELSTRVSSMLEAI 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2260RTXTOXIND386e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.5 bits (87), Expect = 6e-05
Identities = 8/114 (7%), Positives = 38/114 (33%), Gaps = 1/114 (0%)

Query: 259 YQLRQLQASALTQQGNMKLSEAQL-ALKESQANEKTAQFDAEIRMKQSERFRGTNQTLQQ 317
+ Q+S L + + +++ ++ E + + E +++
Sbjct: 134 ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE 193

Query: 318 QLAEKEGLLLQSQNQFEQLQSRFDKSNVQLSGVMQQLQMLQQQLAELQPARARN 371
Q + + Q + ++ ++ +++ ++ + +L + +
Sbjct: 194 QFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247



Score = 29.0 bits (65), Expect = 0.037
Identities = 18/129 (13%), Positives = 49/129 (37%), Gaps = 13/129 (10%)

Query: 244 SQNKSFEAEVSSVKAYQLRQLQASALTQQGNMKLSEAQLALKESQANEKTAQFDAEIRMK 303
Q +++ + + L + +A LT + E +++S+ ++ ++
Sbjct: 193 EQFSTWQNQKYQKEL-NLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH------ 245

Query: 304 QSERFRGTNQTL--QQQLAEKEGLLLQSQNQFEQLQSRFDKSNVQLSGVMQQLQMLQQQL 361
++ + L + + E L ++Q EQ++S + + V Q + + L
Sbjct: 246 --KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK--NEIL 301

Query: 362 AELQPARAR 370
+L+
Sbjct: 302 DKLRQTTDN 310


26Sbal195_2272Sbal195_2302Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_22722180.278186type III secretion apparatus
Sbal195_2273218-0.042701secretion system apparatus protein SsaV
Sbal195_22744180.284265type III secretion system ATPase
Sbal195_2275420-1.289995hypothetical protein
Sbal195_2276317-1.279785hypothetical protein
Sbal195_2277318-2.226928hypothetical protein
Sbal195_2278320-3.486834type III secretion system protein
Sbal195_2279217-3.104855HrpO family type III secretion protein
Sbal195_2280020-3.701305type III secretion protein SpaR/YscT/HrcT
Sbal195_2281122-5.063802secretion system apparatus protein SsaU
Sbal195_2282224-6.199676integrase catalytic subunit
Sbal195_2283424-6.323264transposase IS3/IS911 family protein
Sbal195_2284325-6.632687DNA-directed DNA polymerase
Sbal195_2285532-7.692452hypothetical protein
Sbal195_2286228-6.818871hypothetical protein
Sbal195_2287229-6.277934hypothetical protein
Sbal195_2288231-6.159972hypothetical protein
Sbal195_2289032-6.083783hypothetical protein
Sbal195_2290132-5.685683helix-destabilizing protein
Sbal195_2291131-6.061777replication protein gene II/X
Sbal195_2292231-6.587421XRE family transcriptional regulator
Sbal195_2293032-6.083783hypothetical protein
Sbal195_2294132-5.685683helix-destabilizing protein
Sbal195_2295131-6.061777replication protein gene II/X
Sbal195_2296231-6.587421XRE family transcriptional regulator
Sbal195_2297032-6.083783hypothetical protein
Sbal195_2298132-5.685683helix-destabilizing protein
Sbal195_2299131-6.061777replication protein gene II/X
Sbal195_2300127-5.264752XRE family transcriptional regulator
Sbal195_2301025-4.843527hypothetical protein
Sbal195_2302021-3.210898helix-destabilizing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2276PF06340290.007 Vibrio cholerae toxin co-regulated pilus biosynthesis pr...
		>PF06340#Vibrio cholerae toxin co-regulated pilus biosynthesis

protein F (TcpF)
Length = 338

Score = 28.8 bits (64), Expect = 0.007
Identities = 7/39 (17%), Positives = 19/39 (48%)

Query: 27 EMRRLFNRYFCGQQEDDNAISTKDRLTAKALLSRDGGVY 65
+++L+ ++ Q D I T+D++ + +G +
Sbjct: 110 SLQKLYIDFYLAQTTFDWEIPTRDQIETLVNYANEGKLS 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2277FLGMOTORFLIM280.047 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 27.9 bits (62), Expect = 0.047
Identities = 9/44 (20%), Positives = 19/44 (43%)

Query: 224 SLLPKMDAIQPPLTADIGRVSLPLAKLGAMMTGDKLTLEVTLNN 267
L K+ + + A++G + L + + + GD + L T
Sbjct: 249 VLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVG 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2278TYPE3IMPPROT2103e-71 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 210 bits (537), Expect = 3e-71
Identities = 81/215 (37%), Positives = 129/215 (60%), Gaps = 7/215 (3%)

Query: 8 IQLIIMLFCLSLLPLFAVMGTSFLKLAIVFSMLRNALGIQQIPPNMAIYGLALILTLFTM 67
I LI +L +LLP GT F+K +IVF M+RNALG+QQIP NM + G+AL+L++F M
Sbjct: 5 ISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVM 64

Query: 68 APVGMAINDNLKATPIVFDAPNVFEQINTEAIAPYRAFLEKNTSNTQIEFFANIGHKVWP 127
P+ + + F+ + + E + YR +L K + ++FF N K
Sbjct: 65 WPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQY 124

Query: 128 EKYQQV-------LTKDSLLVMVPAFTMSQLIEAFKIGLLIYLPFVAIDLIVSNILLAMG 180
+ + + K S+ ++PA+ +S++ AFKIG +YLPFV +DL+VS++LLA+G
Sbjct: 125 GEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALG 184

Query: 181 MMMVSPMTIALPFKLLIFILMGGWEKLISQLMMSF 215
MMM+SP+TI+ P KL++F+ + GW L L++ +
Sbjct: 185 MMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2279TYPE3IMQPROT707e-20 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 70.2 bits (172), Expect = 7e-20
Identities = 33/83 (39%), Positives = 50/83 (60%)

Query: 6 IVHFTSELLWMVLLLSLPVVIVASVVGVLVSLIQALTQIQDQTLQFLIKLIAVCVTLVVC 65
+V ++ L++VL+LS IVA+++G+LV L Q +TQ+Q+QTL F IKL+ VC+ L +
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 66 YHWMGSSLLNYASMAFDQISQMG 88
W G LL+Y G
Sbjct: 64 SGWYGEVLLSYGRQVIFLALAKG 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2280TYPE3IMRPROT1262e-37 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 126 bits (319), Expect = 2e-37
Identities = 47/238 (19%), Positives = 104/238 (43%), Gaps = 5/238 (2%)

Query: 1 MTTQLPNLLTAQLPVLALCMMRPLGMMLLLPLFKGGAMGSALIRNSLILMFALPTVLAMD 60
M + L + ++R L ++ P+ ++ ++ L +M +
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFA-IAPSL 58

Query: 61 EMQPILQQADTWMLISLFGKEIIVGMLLGFCAAIPFWAIDMAGFVIDTMRGASMSTVLNP 120
+ + + +++ ++I++G+ LGF F A+ AG +I G S +T ++P
Sbjct: 59 PANDVPVFSFFALWLAV--QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDP 116

Query: 121 LMGLQSSIYGMLFTQVLTVLFLVSGGFNFLLTALYQSYQQLPPGFNLTLSQPLMVFIAHE 180
L + + + +LFL G +L++ L ++ LP G S +
Sbjct: 117 ASHLNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAG 176

Query: 181 WQLMCQLCLSFAMPAMVIMILVDVALGLVNRSAQQLNVFFLSMPIKSALVLLLLIYSL 238
+ L A+P + +++ +++ALGL+NR A QL++F + P+ + + L+ +
Sbjct: 177 SLIF-LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALM 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2281TYPE3IMSPROT356e-124 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 356 bits (915), Expect = e-124
Identities = 120/346 (34%), Positives = 192/346 (55%)

Query: 2 AEKTEKPTEKRLREARNRGQVIKSAEIVTGLQMAIILGYFLYEGPALVQAIMALIDLTIH 61
EKTE+PT K++R+AR +GQV KS E+V+ + + + + L+ +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 62 AINLPLETAAEQIVGTFAVLALRFLGGLTLVLVFTIVVGNLVQTGPVWAAESIMPSMDKL 121
LP A +V + L V + ++VQ G + + E+I P + K+
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122

Query: 122 NVMNNAKQLISLKSLFELAKNLVKVTVLSLVFYYLLHRYVNAFQYLPLCGEACGISVIST 181
N + AK++ S+KSL E K+++KV +LS++ + ++ + LP CG C ++
Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182

Query: 182 MITWLWGSFLGCYLIFGIADYAFQRYSLMKELKMSKDDTKQEYKDSEGNPEMKQKRRETQ 241
++ L +++ IADYAF+ Y +KELKMSKD+ K+EYK+ EG+PE+K KRR+
Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242

Query: 242 REVASGSLASNVRKATVVVRNPTHIAVCLYYSEGETPLPKVLEKAEDHMALHIVALAEKA 301
+E+ S ++ NV++++VVV NPTHIA+ + Y GETPLP V K D + +AE+
Sbjct: 243 QEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEE 302

Query: 302 GVPIVENIPLARALFKHVEAGDVIPESLFEPVAELLRLVMTISYDN 347
GVPI++ IPLARAL+ IP E AE+LR + + +
Sbjct: 303 GVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEK 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2287adhesinb290.019 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 28.7 bits (64), Expect = 0.019
Identities = 27/120 (22%), Positives = 45/120 (37%), Gaps = 27/120 (22%)

Query: 140 AWTDVSASANKLRIAIETLLTTIDPE------------LAKIKVLHERIEAFSKSQPEVA 187
AW ++ + I L+ DP + K+ L + + + P
Sbjct: 141 AWLNLE-NGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKFNNIPGEK 199

Query: 188 KLLMAIKWLGNEASHEGALKEYDLAFAYEVMELSINRLFDDSE---DKIKQLVELVNKNK 244
K+++ + EG K Y + AY V I + + E D+IK LVE + K K
Sbjct: 200 KMIV---------TSEGCFK-Y-FSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTK 248


27Sbal195_2322Sbal195_2346Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2322219-1.111793outer membrane lipoprotein carrier protein LolA
Sbal195_2323221-1.029839cell division protein FtsK
Sbal195_2324234-2.575953leucine-responsive transcriptional regulator
Sbal195_2325133-1.912197alanine dehydrogenase
Sbal195_2326232-1.987566thioredoxin reductase
Sbal195_2327231-2.71265450S ribosomal protein L20
Sbal195_2328122-2.58027550S ribosomal protein L35
Sbal195_2329-213-1.572545translation initiation factor IF-3
Sbal195_2330-112-1.910439threonyl-tRNA synthetase
Sbal195_2331015-2.275321hypothetical protein
Sbal195_2332015-2.969458hypothetical protein
Sbal195_2333017-3.536171riboflavin synthase subunit alpha
Sbal195_2334014-3.277703MATE efflux family protein
Sbal195_2335019-4.432327**integrase catalytic subunit
Sbal195_2336116-3.916596transposase IS3/IS911 family protein
Sbal195_2337115-3.765279hypothetical protein
Sbal195_2338014-3.187065hypothetical protein
Sbal195_2339014-2.332675TonB-dependent siderophore receptor
Sbal195_2340-115-2.163790hypothetical protein
Sbal195_2341-213-2.528316PepSY-associated TM helix domain-containing
Sbal195_2342-313-2.715579TonB-dependent siderophore receptor
Sbal195_2343117-2.734791hypothetical protein
Sbal195_2344018-3.620741hypothetical protein
Sbal195_2345017-4.009502PAS/PAC sensor-containing diguanylate cyclase
Sbal195_2346016-3.801032integral membrane sensor hybrid histidine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2346HTHFIS532e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.9 bits (127), Expect = 2e-09
Identities = 16/118 (13%), Positives = 37/118 (31%), Gaps = 11/118 (9%)

Query: 513 DQQKVLIIDDNLFNLEICRAMLEHYHFQTFSTDNTEQALKMLVNHLPQIVIVDYRLQGMN 572
+L+ DD+ + L + T N + + +V+ D + N
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 573 GLQLVRQMQQVLQSLESNSPIEHQCRFFLLSA-NDCDDIPELASFPEVHFMQKPFSAE 629
L+ ++++ ++SA N + + ++ KPF
Sbjct: 62 AFDLLPRIKK----------ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


28Sbal195_2515Sbal195_2523Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2515121-5.180239hypothetical protein
Sbal195_2516630-9.096794hypothetical protein
Sbal195_2517528-8.874846RNA-directed DNA polymerase
Sbal195_2518227-8.274876ATP-dependent OLD family endonuclease
Sbal195_2519-118-5.110033hypothetical protein
Sbal195_2520-116-3.586135hypothetical protein
Sbal195_2521-113-0.337935hypothetical protein
Sbal195_25220182.529952hypothetical protein
Sbal195_25231183.212703beta-hexosaminidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2515BCTERIALGSPF310.028 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 30.6 bits (69), Expect = 0.028
Identities = 12/49 (24%), Positives = 23/49 (46%), Gaps = 4/49 (8%)

Query: 691 RYRFLNAQNLPLAPAMQGDSSDFYQRPLQQVLREQRWFVLAGRAAQIEK 739
Y+ L+AQ + DS+ R +Q+LRE+ L+ + ++
Sbjct: 5 HYQALDAQGKKCRGTQEADSA----RQARQLLRERGLVPLSVDENRGDQ 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2519PF04183300.009 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 30.2 bits (68), Expect = 0.009
Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 2/47 (4%)

Query: 251 RLPSSHFTFNQDIMVLGKFSIPALSERVTPTYLGNEIRTLITHAQQV 297
LP + + F + + G I A + R + +TL+ +QV
Sbjct: 40 NLPGAQWRFIAERGIWGWLWIDAQTLRCADEPV--LAQTLLMQLKQV 84


29Sbal195_2534Sbal195_2547Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_25342192.672706transposase, IS4 family protein
Sbal195_25352192.765463hypothetical protein
Sbal195_25363203.388946ATP phosphoribosyltransferase
Sbal195_25373203.642588histidinol dehydrogenase
Sbal195_25382202.757422histidinol-phosphate aminotransferase
Sbal195_25391202.880312imidazole glycerol-phosphate
Sbal195_25402161.790300imidazole glycerol phosphate synthase subunit
Sbal195_25412161.5789541-(5-phosphoribosyl)-5-[(5-
Sbal195_25422171.391308imidazole glycerol phosphate synthase subunit
Sbal195_25432150.666195bifunctional phosphoribosyl-AMP
Sbal195_25442150.466350aromatic amino acid transporter
Sbal195_2545214-1.080610hypothetical protein
Sbal195_2546-117-3.946188hypothetical protein
Sbal195_2547-216-3.998505hypothetical protein
30Sbal195_2601Sbal195_2611Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2601315-0.391856phosphoenolpyruvate synthase
Sbal195_2602113-0.499370hypothetical protein
Sbal195_2603114-0.601996phospho-2-dehydro-3-deoxyheptonate aldolase
Sbal195_2604-112-0.444441glycoside hydrolase
Sbal195_2605-111-1.063976thioesterase superfamily protein
Sbal195_2606-122-3.131565two component LuxR family transcriptional
Sbal195_2607-122-3.187219transcriptional regulator CysB
Sbal195_2608024-3.369125hypothetical protein
Sbal195_2609-123-3.286541DNA topoisomerase I
Sbal195_2610026-4.070267succinylarginine dihydrolase
Sbal195_2611128-4.397701Ig domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2601PHPHTRNFRASE2973e-93 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 297 bits (761), Expect = 3e-93
Identities = 111/418 (26%), Positives = 187/418 (44%), Gaps = 65/418 (15%)

Query: 384 QPGDVLVTDMTDPDWEPIMK-RASAIVTNRGGRTCHAAIIARELGVPAVVGCGDVTDRIK 442
+ ++ D+T D + K T+ GGRT H+AI++R L +PAVVG +VT++I+
Sbjct: 155 EETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQ 214

Query: 443 NGQIVTVSCAEG---------DTGFIYEGKQEFEVISNRVDSLPELP--------MKIMM 485
+G +V V EG + E + FE L P +++
Sbjct: 215 HGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAA 274

Query: 486 NVGNPDRAFDFARLPNEGVGLARLEFIINRMIGIHPKALLEFNQQDAALQTEINEMIAGY 545
N+G P EG+GL R EF+ M L TE E Y
Sbjct: 275 NIGTPKDVDGVLANGGEGIGLYRTEFLY--MD-------------RDQLPTE-EEQFEAY 318

Query: 546 ESPVEFYIARLVEGIATIGSAFYPKKVIVRMSDFKSNEYANLVGGDRYEPEEENPMLGFR 605
+ V+ K V++R D ++ + + P+E NP LGFR
Sbjct: 319 KEVVQ---------------RMDGKPVVIRTLDIGGDKELSYL----QLPKELNPFLGFR 359

Query: 606 GASRYISESFRDCFALECEAIKRVRNDMGLKNVEVMIPFVRTVKEAEQVIGLLKEQGLER 665
+ + +D F + A+ R N++VM P + T++E Q +++E+ +
Sbjct: 360 AIRLCLEK--QDIFRTQLRALLRAS---TYGNLKVMFPMIATLEELRQAKAIMQEEKDKL 414

Query: 666 GKDG------LRVIMMCEVPSNALLADQFLEHFDGFSIGSNDLTQLTLGLDRDSGIISHL 719
+G + V +M E+PS A+ A+ F + D FSIG+NDL Q T+ DR + +S+L
Sbjct: 415 LSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYL 474

Query: 720 FDERDEAVKMLLSLAIKAAKTKGAYIGICGQGPSDHADFAAWLVEQGIDTVSLNPDTV 777
+ A+ L+ + IKAA ++G ++G+CG+ D L+ G+D S++ ++
Sbjct: 475 YQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDE-VAIPLLLGLGLDEFSMSATSI 531


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2604MICOLLPTASE360.001 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 35.8 bits (82), Expect = 0.001
Identities = 34/162 (20%), Positives = 58/162 (35%), Gaps = 28/162 (17%)

Query: 81 WENKGVCDGAQNQAPTLVILQPQNNVSVNLGDVVLLQADAS-DVDGTVASVNW-FANGQA 138
D N+ P VI +++ SV + + + S D DG + + W F +G+
Sbjct: 761 MNTDTNTDVHVNKEPKAVI---KSDSSVIVEEEINFDGTESKDEDGEIKAYEWDFGDGEK 817

Query: 139 VTSPWTT---NAIGSVQLKAVATDDKGATTEKSVVLTVINPTSENLPPMIEILLLVNDSA 195
T N G ++K TD+ G +S + V+ + +I N+S
Sbjct: 818 SNEAKATHKYNKTGEYEVKLTVTDNNGGINTESKKIKVV---EDKPVEVI------NESE 868

Query: 196 VNVGDSVTITANASDPDTGDSITKVEFYLDSQLIATDNSAPY 237
N +D + + I K + L D S Y
Sbjct: 869 PN-----------NDFEKANQIAKSNMLVKGTLSEEDYSDKY 899


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2605TYPE3OMGPROT290.007 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 29.1 bits (65), Expect = 0.007
Identities = 13/44 (29%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 79 VTVSSDRIDFKKPIPAGTLAELIARVIHVGNTSLKVEVNIYVED 122
V V+ + K I GT+ + RV+ G+ S ++ +N+++ED
Sbjct: 383 VKVTGKEVAELKGITYGTMLRMTPRVLTQGDKS-EISLNLHIED 425


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2606HTHFIS792e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-19
Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 6/111 (5%)

Query: 8 IIIADDHPLFRNALRQALTTAFEHAQWFEADSAEALQSVL-DVRSIDYDLVLLDLQMPGS 66
I++ADD R L QAL+ A ++ ++ + + D DLV+ D+ MP
Sbjct: 6 ILVADDDAAIRTVLNQALSRAG-----YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 67 HGYSTLIHLRSHYPDLPVVVISAHEDINTISRAIHYGSSGFIPKSASMETL 117
+ + L ++ PDLPV+V+SA T +A G+ ++PK + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2611INTIMIN421e-05 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 42.4 bits (99), Expect = 1e-05
Identities = 48/204 (23%), Positives = 80/204 (39%), Gaps = 17/204 (8%)

Query: 30 GGTTPTPGVVTVTLSISNSDSVSVATPAEVKATVVDSKTGPLAGVVVSFKLDNDALGSFT 89
G GV T +++ + ATV + A V VSF + + G+
Sbjct: 552 GQVVDQVGVTDFTADKTSAKADG-TEAITYTATVKKNGV-AQANVPVSFNIVS---GTAV 606

Query: 90 PSTGTQLTDSSGVATVKLDTATLAGAGNVTASVASGASITKGFYSKGDGVVQPGTGNKLK 149
S + T+ SG ATV L + G V+A A S + V+
Sbjct: 607 LSANSANTNGSGKATVTLKSDKP-GQVVVSAKTAEMTSAL-----NANAVIFVDQTKASI 660

Query: 150 LSLQNVQGQTVTKISSAVPGTVSAIYTNGSDEPLVGKVITFTSNLGKFSPQSGTALTNAQ 209
++ + V A+ TV + D+P+ + +TFT+ LGK S + T+
Sbjct: 661 TEIKADKTTAVANGQDAITYTVKVMK---GDKPVSNQEVTFTTTLGKLSNSTEK--TDTN 715

Query: 210 GLAKIAITAGPVAGAGNIIAKVDE 233
G AK+ +T+ G + A+V +
Sbjct: 716 GYAKVTLTST-TPGKSLVSARVSD 738



Score = 40.1 bits (93), Expect = 6e-05
Identities = 64/299 (21%), Positives = 98/299 (32%), Gaps = 40/299 (13%)

Query: 378 TGLPTTNVSAAQPSKVTVTL---VDKDATPLVGKVVSFSSSLGNFLPTKGTALTDSIGRA 434
T SA +T V K+ VSF+ G + + +A T+ G+A
Sbjct: 561 TDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKA 620

Query: 435 SITLTAGSIEGAGEVTASY--GTAKAIVGFVTAGDDIDPIEASPEISFDIYDCNGVAAWD 492
++TL + G V+A T+ V D + NG A
Sbjct: 621 TVTLKSDKP-GQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAIT 679

Query: 493 KTLKNFEVCKITDNITNDKPGIIGAKVTRSGSTQALQQVLVTAATTLGAISPNSGTAITN 552
T+K + DKP + +VT + TTLG ++ T T+
Sbjct: 680 YTVKVMK---------GDKP-VSNQEVTFT--------------TTLGK--LSNSTEKTD 713

Query: 553 ADGKAILDLYANGNVGAGEVSLKVKD-ATSTKAFEI---GRVNISLDIKTSVGNNSLPAG 608
+G A + L + G VS +V D A KA E+ + I VG
Sbjct: 714 TNGYAKVTLTST-TPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKL 772

Query: 609 GSTIVEVTVFNPDGSLSTGQPFTLEFSSECVAAGKAVIDSPIVTNAGKGYSTYRSTGCS 667
+ ++ N S G+ + S A S VT KG +T
Sbjct: 773 PTVWLQYGQVNLKASGGNGK---YTWRSANPAIASVDASSGQVTLKEKGTTTISVISSD 828


31Sbal195_2622Sbal195_2629Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2622230-0.529360ferric uptake regulator
Sbal195_26233310.301503N-acetyltransferase GCN5
Sbal195_26245330.091121GreA/GreB family elongation factor
Sbal195_26255360.208803succinyl-CoA synthetase subunit alpha
Sbal195_26265330.068675succinyl-CoA synthetase subunit beta
Sbal195_26275310.1477572-oxoglutarate dehydrogenase, E2 subunit,
Sbal195_2628531-0.2013022-oxoglutarate dehydrogenase E1 component
Sbal195_2629328-0.911886succinate dehydrogenase iron-sulfur subunit
32Sbal195_2846Sbal195_2858Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2846-114-4.966821chorismate synthase
Sbal195_2847017-5.994380N5-glutamine S-adenosyl-L-methionine-dependent
Sbal195_2848119-6.557534hypothetical protein
Sbal195_2849220-6.906972phosphohistidine phosphatase SixA
Sbal195_2850017-5.763111peptidase M16 domain-containing protein
Sbal195_2851020-5.344457PAS/PAC and GAF sensor(s)-containing diguanylate
Sbal195_28522150.636375hypothetical protein
Sbal195_28532132.447582hypothetical protein
Sbal195_28541112.738795hypothetical protein
Sbal195_28551112.948264multifunctional fatty acid oxidation complex
Sbal195_28560113.7568543-ketoacyl-CoA thiolase
Sbal195_28571143.725259ATPase
Sbal195_28581143.664558hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2857HTHFIS330.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 0.001
Identities = 35/139 (25%), Positives = 53/139 (38%), Gaps = 19/139 (13%)

Query: 28 LIALIANG--HLLVEGPPGLAKT---RAVKALCDGVEGDFHRIQ---FTPDLLPADLTG- 78
++A + L++ G G K RA+ G F I DL+ ++L G
Sbjct: 152 VLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH 211

Query: 79 -----TDIYRSQTGTFEFEAGPIFHNLILADEINRAPAKVQSALLEAMAEGQVT-VGKHS 132
T TG FE G + DEI P Q+ LL + +G+ T VG +
Sbjct: 212 EKGAFTGAQTRSTGRFEQAEG----GTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRT 267

Query: 133 YKLPPLFLVMATQNPLENE 151
+ +V AT L+
Sbjct: 268 PIRSDVRIVAATNKDLKQS 286


33Sbal195_2871Sbal195_2930Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_28712121.555118hypothetical protein
Sbal195_28721121.960440rhodanese domain-containing protein
Sbal195_28731131.709516N-acetyltransferase GCN5
Sbal195_28741142.128207peptidase S8/S53 subtilisin kexin sedolisin
Sbal195_2875-1152.096277YaeQ family protein
Sbal195_28760171.934402siroheme synthase
Sbal195_28771220.981789rhodanese domain-containing protein
Sbal195_28782190.985298hypothetical protein
Sbal195_28791200.857896preprotein translocase subunit SecF
Sbal195_28800181.056941preprotein translocase subunit SecD
Sbal195_2881-1160.433407preprotein translocase subunit YajC
Sbal195_2882-115-0.041639queuine tRNA-ribosyltransferase
Sbal195_28830150.632036S-adenosylmethionine--tRNA
Sbal195_28840181.458711hypothetical protein
Sbal195_2885-1181.741396hypothetical protein
Sbal195_28860213.019095pseudouridine synthase
Sbal195_28870213.292671hypothetical protein
Sbal195_28881243.809828transcriptional activator Ogr/delta
Sbal195_28891233.705383late control D family protein
Sbal195_28901203.520641P2 GpU family protein
Sbal195_28911203.221688TP901 family phage tail tape measure protein
Sbal195_28922202.923714P2 GpE family protein
Sbal195_28932192.496340tail E family protein
Sbal195_28941192.965705phage major tail tube protein
Sbal195_28951172.653558tail sheath protein
Sbal195_28961172.377970hypothetical protein
Sbal195_28970173.343197hypothetical protein
Sbal195_2898019-0.805200hypothetical protein
Sbal195_2899218-2.812694phage tail protein I
Sbal195_2900221-3.581059baseplate J family protein
Sbal195_2901123-4.223745GPW/gp25 family protein
Sbal195_2902125-4.085640phage baseplate assembly protein V
Sbal195_2903225-3.773522ATP-binding protein involved in virulence-like
Sbal195_2904221-1.072086hypothetical protein
Sbal195_29055222.837757phage virion morphogenesis protein
Sbal195_29065243.398469P2 phage tail completion R family protein
Sbal195_29076283.739370hypothetical protein
Sbal195_29085254.477450peptidoglycan-binding domain-containing protein
Sbal195_29095223.476319hypothetical protein
Sbal195_29103243.389561putative phage-like transmembrane protein
Sbal195_29110243.116934tail X family protein
Sbal195_2912-1232.659125head completion protein
Sbal195_2913-1212.498487small terminase subunit
Sbal195_2914-1191.744760P2 family phage major capsid protein
Sbal195_29150201.523189capsid scaffolding
Sbal195_29160201.506330hypothetical protein
Sbal195_29172231.134745PBSX family phage portal protein
Sbal195_29184221.158578hypothetical protein
Sbal195_29195231.246680hypothetical protein
Sbal195_29204231.864027replication gene A
Sbal195_29212222.902558hypothetical protein
Sbal195_29222202.709362hypothetical protein
Sbal195_29230201.319502hypothetical protein
Sbal195_29241200.281837hypothetical protein
Sbal195_2925125-5.772962hypothetical protein
Sbal195_2926227-7.029775hypothetical protein
Sbal195_2927025-6.221745XRE family transcriptional regulator
Sbal195_2928-120-4.306560hypothetical protein
Sbal195_2929019-3.888481hypothetical protein
Sbal195_2930015-3.210001KAP P-loop domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2874SUBTILISIN1702e-49 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 170 bits (433), Expect = 2e-49
Identities = 69/217 (31%), Positives = 110/217 (50%), Gaps = 10/217 (4%)

Query: 152 STPWGQTFVGATQLSDSQAG-NRTICIIDSGYDRSHSELGGNNVTGTN--NSGTGNWFEP 208
P G + A + + G + ++D+G D H +L + G N + G+
Sbjct: 21 EIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIF 80

Query: 209 GNNNAHGTHVAGTIAAIANNDGVIGVMPNQTANIHVVKVFNESGWGYSSSLVAAVDTCVA 268
+ N HGTHVAGTIAA N +GV+GV P A++ ++KV N+ G G ++ + +
Sbjct: 81 KDYNGHGTHVAGTIAATENENGVVGVAPE--ADLLIIKVLNKQGSGQYDWIIQGIYYAIE 138

Query: 269 NGANVVTMSLGGAGSSTTERNALAAHYNNGVLLIAAAGNAGDSTHS-----YPASYDGVM 323
++++MSLGG A+ + +L++ AAGN GD YP Y+ V+
Sbjct: 139 QKVDIISMSLGGPEDVPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVI 198

Query: 324 SVASVDNHKDHSAFSQYTNQVEISGPGEAILSTVTRG 360
SV +++ + S FS N+V++ PGE ILSTV G
Sbjct: 199 SVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGG 235



Score = 63.3 bits (154), Expect = 1e-12
Identities = 19/71 (26%), Positives = 29/71 (40%), Gaps = 7/71 (9%)

Query: 507 NKDYEYYNGTSMATPHVSGVATLVWS-----YHPECSAAQVRNALKMTAEDLGTAGRDNY 561
Y ++GTSMATPHV+G L+ + + + ++ L LG
Sbjct: 234 GGKYATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKM 291

Query: 562 YGYGLVNAVAA 572
G GL+ A
Sbjct: 292 EGNGLLYLTAV 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2879SECFTRNLCASE316e-110 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 316 bits (811), Expect = e-110
Identities = 113/309 (36%), Positives = 180/309 (58%), Gaps = 14/309 (4%)

Query: 2 LEILSLKHTVNFLRHALPISIMSAVLVLGSLVSLATNGINWGLDFTGGTVVEMEFTNPVD 61
L+++ K +F R + V+++ S++ G+N+G+DF GGT + E T +D
Sbjct: 5 LKLVPEKTNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAID 64

Query: 62 LNALRVQLTTPDSEGAIVQNFGSSR------DVLVRLQVKE--------GVKSDVQVKSV 107
+ R L + I+ ++R+Q++E G + V V
Sbjct: 65 VGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKV 124

Query: 108 MEAAQKVDPQVQQKRVEFVGPQVGKELAEQGALAVLVALICIMIYVSFRFEWRLAFGSVA 167
A VDP ++ E VGP+V EL ++L A + IM Y+ RFEW+ A G+V
Sbjct: 125 ETALTAVDPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVV 184

Query: 168 ALAHDVIVTLGVFSVFQLEFDLTVLAGLLTVVGYSLNDTIVVFDRIRENFLKMRKSDPEE 227
AL HDV++T+G+F+V QL+FDLT +A LLT+ GYS+NDT+VVFDR+REN +K + +
Sbjct: 185 ALVHDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRD 244

Query: 228 VVNTSITQTMSRTIITTGTTLVVVVALFLKGGTMIHGFATALLMGIFVGTYSSIYVASFL 287
V+N S+ +T+SRT++T TTL+ +V + + GG +I GF A++ G+F GTYSS+YVA +
Sbjct: 245 VMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNI 304

Query: 288 AIKLGINRE 296
+ +G++R
Sbjct: 305 VLFIGLDRN 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2880SECFTRNLCASE796e-18 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 78.7 bits (194), Expect = 6e-18
Identities = 31/165 (18%), Positives = 80/165 (48%), Gaps = 4/165 (2%)

Query: 434 VSIVEERTIGPSLGAENIQNGVQAMVWGMAVVLLFMLVYYR-GFGLIANIALTANLVMVV 492
+ I ++GP + E + V +++ V++ ++ V + F L A +AL ++++ V
Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194

Query: 493 GVMSMIPGAVLTLPGIAGMVLTVGMAVDGNVLIYERIREELRA--GRSVQQAIHEGYGNA 550
G+ +++ L +A ++ G +++ V++++R+RE L ++ ++
Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253

Query: 551 FSTIADANITTFLTALILFAVGTGAVKGFAVTLMIGIATSMFTAI 595
S +TT L + + G ++GF ++ G+ T ++++
Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSV 298


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2914AEROLYSIN300.011 Aerolysin signature.
		>AEROLYSIN#Aerolysin signature.

Length = 493

Score = 30.4 bits (68), Expect = 0.011
Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 79 DTTQSDRQAIDPTDLDALGY-DCTQTNFDTALRYAKIDMWAK 119
D TQSDRQ + A+ D Q+ +D LRY W+K
Sbjct: 211 DVTQSDRQLVKTVVGWAVNDSDTPQSGYDVTLRYDTATNWSK 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2916PF03309310.015 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 30.5 bits (69), Expect = 0.015
Identities = 20/85 (23%), Positives = 28/85 (32%), Gaps = 4/85 (4%)

Query: 267 VASGMAIHAKWRQTY-ISTPSSITHDAYPFWTGTLFNRGRPKADRIEI-DVSHSALANGR 324
+ SG HAK Q + I T +T D L + S L R
Sbjct: 16 LISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGDDAERLTGASGLSTVPSVLHEVR 75

Query: 325 RCEDGQWRQV--VTVDDAIRKGCNL 347
+ W V V ++ +R G L
Sbjct: 76 VMLEQYWPNVPHVLIEPGVRTGIPL 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2919FIMREGULATRY405e-08 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 40.3 bits (94), Expect = 5e-08
Identities = 18/72 (25%), Positives = 33/72 (45%)

Query: 3 TLIQGCESTEQFEILLKLTGITSEDKKNALRAHLVEGLPAKRAYARFHVTQQHFSLALML 62
L+ G S F +L+ ++ I S+ A++ +LV G K ++ + +FS L
Sbjct: 22 VLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHSRKEVCEKYQMNNGYFSTTLGR 81

Query: 63 LNKKADLAMQYV 74
L + LA +
Sbjct: 82 LIRLNALAARLA 93


34Sbal195_2939Sbal195_2945Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2939-218-3.900791methylated-DNA--protein-cysteine
Sbal195_2940-220-5.322758AraC family transcriptional regulator
Sbal195_2941026-7.802719ABC transporter-like protein
Sbal195_2942231-8.890974integrase family protein
Sbal195_2943129-7.196426hypothetical protein
Sbal195_2944125-5.887161hypothetical protein
Sbal195_2945021-5.332193hypothetical protein
35Sbal195_2969Sbal195_2979Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_29691153.0411586-phosphogluconate dehydrogenase
Sbal195_29701183.391552small multidrug resistance protein
Sbal195_29711203.663290AMP-dependent synthetase and ligase
Sbal195_29723273.879164MerR family transcriptional regulator
Sbal195_29733274.026701acyl-CoA dehydrogenase domain-containing
Sbal195_29743263.815864propionyl-CoA carboxylase
Sbal195_29752242.976714enoyl-CoA hydratase/isomerase
Sbal195_29762232.927655carbamoyl-phosphate synthase L chain
Sbal195_29770221.757552pyruvate carboxyltransferase
Sbal195_29782171.5221473-oxoacid CoA-transferase subunit A
Sbal195_29792171.2260803-oxoacid CoA-transferase subunit B
36Sbal195_3014Sbal195_3041Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3014022-3.150681response regulator receiver modulated metal
Sbal195_3015026-3.849834lipoprotein
Sbal195_3016030-5.508400hypothetical protein
Sbal195_3017137-8.054335dTDP-glucose 4,6-dehydratase
Sbal195_3018137-9.916943phosphoglucosamine mutase
Sbal195_3019239-11.979145nucleotide sugar dehydrogenase
Sbal195_3020533-10.276460glycosyl transferase family protein
Sbal195_3021533-9.679667hypothetical protein
Sbal195_3022429-8.397338hypothetical protein
Sbal195_3023423-5.993946glycosyl transferase family protein
Sbal195_3024525-5.131650hexapaptide repeat-containing transferase
Sbal195_3025328-5.080479polysaccharide biosynthesis protein
Sbal195_3026333-7.507048DegT/DnrJ/EryC1/StrS aminotransferase
Sbal195_3027236-8.905204WxcM-like protein
Sbal195_3028236-9.113144WxcM domain-containing protein
Sbal195_3029337-9.319405glucose-1-phosphate thymidylyltransferase
Sbal195_3030132-7.238019dTDP-glucose-4,6-dehydratase
Sbal195_3031132-7.435504glycosyl transferase family protein
Sbal195_3032029-5.685507exopolysaccharide biosynthesis polyprenyl
Sbal195_3033-126-4.736455lipopolysaccharide biosynthesis protein
Sbal195_3034-125-4.181557S23 ribosomal protein
Sbal195_3035026-2.617578polysaccharide export protein
Sbal195_3036225-2.092890transcriptional acivator RfaH
Sbal195_3037224-1.720422amino acid/peptide transporter
Sbal195_3038227-1.941041response regulator receiver protein
Sbal195_3039328-1.757423VacJ family lipoprotein
Sbal195_3040428-1.394669hypothetical protein
Sbal195_3041224-2.610506FlhB domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3014HTHFIS472e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.7 bits (111), Expect = 2e-07
Identities = 27/169 (15%), Positives = 57/169 (33%), Gaps = 25/169 (14%)

Query: 24 KVAIIDDEPGIHEVTRFALKNLTLDNRVLQFYSCYSAAEGLALLQTETDIALAFIDVVME 83
+ + DD+ I V AL D R +AA + L DVVM
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVR-----ITSNAATLWRWIAAGD-GDLVVTDVVMP 58

Query: 84 TDHAGLELVQKIRTELNNHSTRIILRTGQ--PGQAPE-------DQVIRDFDINDYKAKT 134
D +L+ +I+ +++ + Q A + D + + FD+ +
Sbjct: 59 -DENAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115

Query: 135 ELTAARLKSCVYTSLRSYRDIK-IIEQSQ------KGMEKVIAASTSVL 176
A K +D ++ +S + + +++ +++
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3017NUCEPIMERASE1882e-59 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 188 bits (479), Expect = 2e-59
Identities = 83/361 (22%), Positives = 138/361 (38%), Gaps = 53/361 (14%)

Query: 1 MRVLVTGGAGFIGSALVRMLIEQTTCVVINFDKLTYASDL---ESLASIADSERYHFIQA 57
M+ LVTG AGFIG + + L+E V+ D L D+ ++ + + F +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGH-QVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DIGDRAKLDQVFQDYLPDVVMHLAAESHVDRSINGPAEFIQTNIVGTYTLLEACRCYFQS 117
D+ DR + +F + V V S+ P + +N+ G +LE CR
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCR----- 114

Query: 118 LNTDKQKVFRFHHISTDEVYGSLGDTGLFSETTAYD-PSSPYSASKASADHLVRAWHRTY 176
K+ + S+ VYG L FS + D P S Y+A+K + + + + Y
Sbjct: 115 ----HNKIQHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169

Query: 177 GLPIVITNCSNNYGPFQYPEKLIPLMVLNALAGKQLPVYGNGQQVRDWLYVDDHVRALFL 236
GLP YGP+ P+ + L GK + VY G+ RD+ Y+DD A+
Sbjct: 170 GLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIR 229

Query: 237 VV------------------TQGTVGETYNIGGTNERSNLEVVHQICDLLEELVPTHAQA 278
+ YNIG ++ ++ + LE+
Sbjct: 230 LQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYI----QALEDA------- 278

Query: 279 LAADGVGFRSLVEHVTDRAGHDVR--YAIDASKIQRELGWQPLESFDSGLRKTVEWIVAR 336
+G + + + G DV A D + +G+ P + G++ V W
Sbjct: 279 -----LGIEAKKNMLPLQPG-DVLETSA-DTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331

Query: 337 Y 337
Y
Sbjct: 332 Y 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3030NUCEPIMERASE1747e-54 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 174 bits (443), Expect = 7e-54
Identities = 81/361 (22%), Positives = 141/361 (39%), Gaps = 51/361 (14%)

Query: 1 MKVLVTGGAGFIGSAVVRHIICNTQDSVINVDKLT--YAGNLESLT-SVADNARYTFEKV 57
MK LVTG AGFIG V + ++ V+ +D L Y +L+ + + F K+
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DICDRGELDRIFLQYQPDAIMHLAAESHVDRSITGPSDFIQTNIIGTYTLLEAARHYWIQ 117
D+ DR + +F + + V S+ P + +N+ G +LE RH IQ
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 118 LDSERKAAFRFHHISTDEVYGDLPHPDEHEGQVVNQELPLFTETTPYAPSSPYSASKASS 177
+ S+ VYG N+++P T+ + P S Y+A+K ++
Sbjct: 120 ---------HLLYASSSSVYGL------------NRKMPFSTDDSVDHPVSLYAATKKAN 158

Query: 178 DHLVRAWLRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWL 237
+ + + YGLP YGP+ P+ + LEGK + +Y G RD+
Sbjct: 159 ELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFT 218

Query: 238 YVEDHARALYKVV------------------TEGKVGETYNIGGHNEKRNLEVVQTICSI 279
Y++D A A+ ++ YNIG + ++ +Q +
Sbjct: 219 YIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDA 278

Query: 280 LDVLMPKGTPYAEQITYVTDRLGHDRRYAIDASKMSAELNWQPQETFETGLLKTVEWYLA 339
L + K + + G + D + + + P+ T + G+ V WY
Sbjct: 279 LGIEAKK--------NMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330

Query: 340 N 340

Sbjct: 331 F 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3034ECOLIPORIN260.032 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 26.4 bits (58), Expect = 0.032
Identities = 23/98 (23%), Positives = 46/98 (46%), Gaps = 13/98 (13%)

Query: 4 QKLEVWQ--LSYELSSSIYIATKDLRDWGFRDQITRSGLSVPSN---IAEGMERYGAKEQ 58
K + W L Y+ +++IY+AT + +T G + +A + + Q
Sbjct: 243 DKADAWTAGLKYD-ANNIYLATM----YSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQ 297

Query: 59 IQFLYIAKASLAELITQAMIGKDIGYLEPNYVDELLIK 96
QF + + +++ L+++ GKD+ Y N D+ L+K
Sbjct: 298 YQFDFGLRPAVSFLMSK---GKDLTYNNVNGDDKDLVK 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3038HTHFIS908e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 8e-22
Identities = 29/114 (25%), Positives = 49/114 (42%), Gaps = 4/114 (3%)

Query: 8 VLLVEDDPVFRQIVASFLDTRGAQVTQACDGEEGLSLFKSQHFDIVLADLSMPKLGGLDM 67
+L+ +DD R ++ L G V + + D+V+ D+ MP D+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LKEMTRLAPLVPSVVISGNNVMADVVEALRIGASDYLVKPVSDLFIIEQAIKQS 121
L + + P +P +V+S N ++A GA DYL KP F + + I
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP----FDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3039VACJLIPOPROT2291e-77 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 229 bits (586), Expect = 1e-77
Identities = 85/222 (38%), Positives = 128/222 (57%), Gaps = 4/222 (1%)

Query: 44 PRDPFEGFNRAMWDFNYLFLDRYLYRPVAHGYNDYIPMPAKTGVNNFVQNLEEPSSLVNN 103
DP EGFNR M++FN+ LD Y+ RPVA + DY+P PA+ G++NF NLEEP+ +VN
Sbjct: 28 RSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWRDYVPQPARNGLSNFTGNLEEPAVMVNY 87

Query: 104 VLQGKWGWAANAGGRFTINSTVGLLGVIDVADMMGMSRKQDE---FNEVLGYYGVPNGPY 160
LQG RF +N+ +G+ G IDVA M ++ E F LG+YGV GPY
Sbjct: 88 FLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPY 147

Query: 161 FMAPFAGPYVVRELASDWVDGLYFPLSELTIWQTIVKWGLKNLHSRASAIDQERLVDNAL 220
PF G + +R+ D D LY LS LT ++ KW L+ + +RA +D + L+ +
Sbjct: 148 VQLPFYGSFTLRDDGGDMADALYPVLSWLTWPMSVGKWTLEGIETRAQLLDSDGLLRQSS 207

Query: 221 DPYAFVKDAYLQHMDYKVYDGNV-PQKQDDDELLDQYMQELE 261
DPY V++AY Q D+ G + PQ+ + + + +++++
Sbjct: 208 DPYIMVREAYFQRHDFIANGGELKPQENPNAQAIQDDLKDID 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3040CHANLCOLICIN330.005 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 33.1 bits (75), Expect = 0.005
Identities = 68/359 (18%), Positives = 129/359 (35%), Gaps = 26/359 (7%)

Query: 278 GIPLSNTNKGPVTNLNGSSGSSSSLNSQTQATQATQATQATQATQATQATQATQATQATQ 337
G+P + + +T LNG+ S S + ++++ A AT A +T + TQA Q
Sbjct: 11 GVPYDDKGQVIITLLNGTPDGSGSGGGGGKGGSKSESSAAIHAT-AKWSTAQLKKTQAEQ 69

Query: 338 ATQATQATQATQATQATQATQATQ-------------ATQATQATQATQATQATQATQAT 384
A +A A +A QA TQ A++ AT+ A A +
Sbjct: 70 AARAKAAAEA-QAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDE 128

Query: 385 QATQATQATQATQATQATQATQATQATQATQATQATQATQATQATQATQATQATQATKTN 444
+ A +A + +A A +A Q + + + + + + +A + A +
Sbjct: 129 RLRLAKAEEKARKEAEA--AEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSE 186

Query: 445 DAIPVKVTMPTMLSARGSNQSLATPSVLINSTQSQINQPSSATATIEQTTRNSSPLGFSL 504
+A V++ + +A+ + +NS S A RN L
Sbjct: 187 EAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRN------EL 240

Query: 505 ATASLNVPSQDPKVNNVLVMQN-PKSLAPTPPLTNVATNIGAQNEEAVEEI-AAVSPKNI 562
A AS D V + N P P T G EE +++ A+ + N
Sbjct: 241 AQASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINR 300

Query: 563 LGLNTQKNERHGNDTKTDSTMKVADVLQKAFN-KAGALPVELSRSNNSSNLASELLKHL 620
+ + + ++ + + +A V + N K + S+ ++ + + L
Sbjct: 301 INADITQIQKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKDAVDATVSFYQTL 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3041TYPE3IMSPROT567e-13 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 56.3 bits (136), Expect = 7e-13
Identities = 16/93 (17%), Positives = 34/93 (36%), Gaps = 9/93 (9%)

Query: 10 AVALSYDGRN--APKIVATGEGLIAEEIIALAKANGVYIHQDPHLSHFL-QLLELGEEIP 66
A+ + Y P + + + +A+ GV I Q L+ L + IP
Sbjct: 268 AIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIP 327

Query: 67 KELYLLIAELIAFVYMLDGKFPEQWNNMHQKIV 99
E AE++ ++ + + H +++
Sbjct: 328 AEQIEATAEVLRWLERQNIE------KQHSEML 354


37Sbal195_3070Sbal195_3084Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3070127-4.571909Fis family two component sigma54 specific
Sbal195_3071232-5.939883PAS/PAC sensor signal transduction histidine
Sbal195_3072335-6.623656sigma-54 dependent trancsriptional regulator
Sbal195_3073442-8.100477flagellar protein FliS
Sbal195_3074334-6.493380hypothetical protein
Sbal195_3075331-5.567332flagellar hook-associated 2 domain-containing
Sbal195_3076124-2.805707flagellar protein FlaG protein
Sbal195_3077023-1.985680flagellin domain-containing protein
Sbal195_3078121-2.062662transposase, IS4
Sbal195_3080122-2.324789transposase IS4 family protein
Sbal195_3081229-3.614713transposase, IS4
Sbal195_3082433-4.422804integrase catalytic subunit
Sbal195_3083330-4.180946transposase IS3/IS911 family protein
Sbal195_3084221-2.410715flagellin domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3070HTHFIS455e-160 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 455 bits (1171), Expect = e-160
Identities = 167/483 (34%), Positives = 250/483 (51%), Gaps = 42/483 (8%)

Query: 1 MSEAKLLLVEDDASLREALLDTLMLAQYECIDVASGEDAILALKQHQFDLVISDVQMQGI 60
M+ A +L+ +DDA++R L L A Y+ ++ + DLV++DV M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 GGLGLLNFLQQHHPKLPVLLMTAYATIGSAVDAIKLGAVDYLAKPFAPEVLLNQVSRYLP 120
LL +++ P LPVL+M+A T +A+ A + GA DYL KPF L+ + R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 LKQNVDQPVVAD-----------EKSLALLALAQRVAASDASVMILGPSGSGKEVLARYI 169
+ + D + + R+ +D ++MI G SG+GKE++AR +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 170 HQHSSRADQAFVAINCAAIPENMLEATLFGYEKGAFTGAYQACPGKFEQAQGGTLLLDEI 229
H + R + FVAIN AAIP +++E+ LFG+EKGAFTGA G+FEQA+GGTL LDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 230 SEMDLGLQAKLLRVLQEREVERLGGRKIIKLDVRVLATSNRDLKAVVAAGGFREDLYYRI 289
+M + Q +LLRVLQ+ E +GGR I+ DVR++A +N+DLK + G FREDLYYR+
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 290 NVFPLAWPALSQRPADILPLARHLLVKHAKALNVADVPELDENARRRLLSHRWPGNVREL 349
NV PL P L R DI L RH + + A+ + V D+ A + +H WPGNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLD-VKRFDQEALELMKAHPWPGNVREL 358

Query: 350 DNVIQRALILRAGQVITANDIIIDAQDVILGA--------------------------ED 383
+N+++R L VIT I + + I +
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 384 LDQFVAEPDGLGEELKAQEHVIILETLNQCQGSRKLVAEKLGISARTLRYKMARMRDMGI 443
+ L E+ +IL L +G++ A+ LG++ TLR K +R++G+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGV 475

Query: 444 QLP 446
+
Sbjct: 476 SVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3071PF06580347e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.1 bits (78), Expect = 7e-04
Identities = 19/95 (20%), Positives = 37/95 (38%), Gaps = 19/95 (20%)

Query: 256 LVMNSIEAGAT------EIRIQAKEEGDQLLLNVIDNGKGLDANMQQKVLEPFFTTKSQG 309
LV N I+ G +I ++ ++ + L V + G N + +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------------ES 310

Query: 310 TGLGLA-VVQSVVRNHGGQLQLSCLPNKGCTVSLV 343
TG GL V + + +G + Q+ +G ++V
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3072HTHFIS433e-151 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 433 bits (1114), Expect = e-151
Identities = 171/481 (35%), Positives = 262/481 (54%), Gaps = 21/481 (4%)

Query: 7 RILLIGPSSERLNRLCCIFDFLGEQIAQI-DAEKLSASLQDTRFRALVILTDVMDADA-- 63
IL+ + L G + +A L + +++TDV+ D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD--LVVTDVVMPDENA 62

Query: 64 ---LKNIAGQHPWQPMLLL---GNVDDLQVSNILG---NIEEPLTYPQLTELLHFCQVFG 114
L I P P+L++ ++ G + +P +L ++
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 115 QVKRPQVPTSANQTKLFRSLVGRSDGIANVRHLINQVATSEATVLVLGQSGTGKEVVARN 174
+ + ++ + LVGRS + + ++ ++ ++ T+++ G+SGTGKE+VAR
Sbjct: 123 KRRPSKLEDDSQD---GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 175 IHYLSERRDGPFIPVNCGAIPPELLESELFGHEKGSFTGAICSRKGRFELAEGGTLFLDE 234
+H +RR+GPF+ +N AIP +L+ESELFGHEKG+FTGA GRFE AEGGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 235 IGDMPLQMQVKLLRVLQERVFERVGGTKTINADVRVVAATHRDLETMISVNEFREDLYYR 294
IGDMP+ Q +LLRVLQ+ + VGG I +DVR+VAAT++DL+ I+ FREDLYYR
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 295 LNVFPIEMPALCDRKDDVPLLLQELVSRVYNEGRGKVRFTQRAIESLKEHAWSGNVRELS 354
LNV P+ +P L DR +D+P L++ V + EG RF Q A+E +K H W GNVREL
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 355 NLVERLTILYPGGLVDVNDLPVKYRHIDVPEYCVEMSEEQQERDALASIFSDEEPVEIPE 414
NLV RLT LYP ++ + + R ++P+ +E + + +++ EE +
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRS-EIPDSPIEKAAARSGSLSISQAV--EENMRQYF 416

Query: 415 TRFPSELPPEGVNLKDLLAELEIDMIRQALELQDNVVARAAEMLGIRRTTLVEKMRKYGM 474
F LPP G+ +LAE+E +I AL +AA++LG+ R TL +K+R+ G+
Sbjct: 417 ASFGDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475

Query: 475 T 475
+
Sbjct: 476 S 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3077FLAGELLIN1392e-40 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 139 bits (352), Expect = 2e-40
Identities = 94/270 (34%), Positives = 129/270 (47%), Gaps = 9/270 (3%)

Query: 2 AITVNTNVTSMKAQKNLNASGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGL 61
A +NTN S+ Q NLN S ++L++++ERLSSGLRINSAKDDAAG AI+NR S ++GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVGMRNANDGISVAQIAEGAMQEQTNMLQRMRDLSVQAVNGANSTSDRDALQAEIDQLAL 121
RNANDGIS+AQ EGA+ E N LQR+R+LSVQA NG NS SD ++Q EI Q
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EITAISSTTAFGDTKLLDSSFAGKSFQVGHQEGENISISISGTNATALGVNAL------- 174
EI +S+ T F K+L QVG +GE I+I + + +LG++
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDN-QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 175 -AVSTDILASTATGAIDDAIKAIDTQRAKLGATQNRLSHNISNSANTQANVADAKSRIVD 233
V + D + R + + + A D
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 234 VDFAKETSQMTKNQVLQQTGSAMLAQANQL 263
+ K + A A +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 83.2 bits (205), Expect = 3e-20
Identities = 57/265 (21%), Positives = 97/265 (36%)

Query: 7 TNVTSMKAQKNLNASGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGLDVGMR 66
N T++ K ++ + D G+ + + G
Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVST 301

Query: 67 NANDGISVAQIAEGAMQEQTNMLQRMRDLSVQAVNGANSTSDRDALQAEIDQLALEITAI 126
N +A+ ++ + N D ++ A
Sbjct: 302 TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEAN 361

Query: 127 SSTTAFGDTKLLDSSFAGKSFQVGHQEGENISISISGTNATALGVNALAVSTDILASTAT 186
++ + + + + + + +N A + +
Sbjct: 362 NAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPL 421

Query: 187 GAIDDAIKAIDTQRAKLGATQNRLSHNISNSANTQANVADAKSRIVDVDFAKETSQMTKN 246
+ID A+ +D R+ LGA QNR I+N NT N+ A+SRI D D+A E S M+K
Sbjct: 422 ASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKA 481

Query: 247 QVLQQTGSAMLAQANQLPQVALSLL 271
Q+LQQ G+++LAQANQ+PQ LSLL
Sbjct: 482 QILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3084FLAGELLIN1447e-42 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 144 bits (363), Expect = 7e-42
Identities = 101/270 (37%), Positives = 136/270 (50%), Gaps = 9/270 (3%)

Query: 2 AITVNTNVTSMKAQKNLNASGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGL 61
A +NTN S+ Q NLN S ++L++++ERLSSGLRINSAKDDAAG AI+NR S ++GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVGMRNANDGISVAQIAEGAMQEQTNMLQRMRDLSVQAVNGANSTSDKDAIQAEIDQLAL 121
RNANDGIS+AQ EGA+ E N LQR+R+LSVQA NG NS SD +IQ EI Q
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EITAISNTTAFGDTKLLSGGFTAKNFQVGHQEGENISISISGTDASTLGVEGLLVSSDGA 181
EI +SN T F K+LS QVG +GE I+I + D +LG++G V+
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 AS--------TSIGLIDTAIKTIDTQRAKLGATQNRLSHNISNSANTQSNVADAKSRIVD 233
A+ ++ DT + R + + + A D
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 234 VDFAKETSAMTKNQVLQQTGSAMLAQANQL 263
+ K + A A +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 87.4 bits (216), Expect = 1e-21
Identities = 57/265 (21%), Positives = 97/265 (36%)

Query: 7 TNVTSMKAQKNLNASGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGLDVGMR 66
N T++ K ++ + D G+ + + G
Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVST 301

Query: 67 NANDGISVAQIAEGAMQEQTNMLQRMRDLSVQAVNGANSTSDKDAIQAEIDQLALEITAI 126
N +A+ ++ + N D ++ A
Sbjct: 302 TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEAN 361

Query: 127 SNTTAFGDTKLLSGGFTAKNFQVGHQEGENISISISGTDASTLGVEGLLVSSDGAASTSI 186
+ + +TA + + ++ + + +
Sbjct: 362 NAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPL 421

Query: 187 GLIDTAIKTIDTQRAKLGATQNRLSHNISNSANTQSNVADAKSRIVDVDFAKETSAMTKN 246
ID+A+ +D R+ LGA QNR I+N NT +N+ A+SRI D D+A E S M+K
Sbjct: 422 ASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKA 481

Query: 247 QVLQQTGSAMLAQANQLPQVALSLL 271
Q+LQQ G+++LAQANQ+PQ LSLL
Sbjct: 482 QILQQAGTSVLAQANQVPQNVLSLL 506


38Sbal195_3097Sbal195_3125Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3097023-3.303977flagellar hook protein FlgE
Sbal195_3098021-3.718293flagellar basal body rod modification protein
Sbal195_3099022-4.074902flagellar basal body rod protein FlgC
Sbal195_3100121-4.430760flagellar basal body rod protein FlgB
Sbal195_3101120-4.221471protein-glutamate O-methyltransferase
Sbal195_3102120-4.241381response regulator receiver modulated CheW
Sbal195_3103021-3.557986flagellar basal body P-ring biosynthesis protein
Sbal195_3104223-4.318682anti-sigma-28 factor FlgM
Sbal195_3105225-5.633505FlgN family protein
Sbal195_3106125-5.122263hypothetical protein
Sbal195_3107225-5.626447hypothetical protein
Sbal195_3108324-5.265515hypothetical protein
Sbal195_3109321-5.079434*hypothetical protein
Sbal195_3110016-4.449142hypothetical protein
Sbal195_3111-116-3.156216hypothetical protein
Sbal195_3112016-3.737656transposase IS3/IS911 family protein
Sbal195_3113018-3.775423integrase catalytic subunit
Sbal195_3114021-4.223885phage integrase family protein
Sbal195_3115123-4.748358hypothetical protein
Sbal195_3116124-4.726049hypothetical protein
Sbal195_3117224-4.733409NAD-dependent epimerase/dehydratase
Sbal195_3118124-5.365088type 11 methyltransferase
Sbal195_3119123-6.426375DegT/DnrJ/EryC1/StrS aminotransferase
Sbal195_3120123-6.894429hypothetical protein
Sbal195_3121225-7.318296N-acylneuraminate-9-phosphate synthase
Sbal195_3122224-7.398612UDP-N-acetylglucosamine 2-epimerase
Sbal195_3123124-7.153841hypothetical protein
Sbal195_3124023-5.696052hypothetical protein
Sbal195_3125016-3.583945N-acylneuraminate cytidylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3097FLGHOOKAP1402e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 39.9 bits (93), Expect = 2e-05
Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 2 SFNIALSGISAAQKDLNTTANNIANANTIGFKESR 36
N A+SG++AAQ LNT +NNI++ N G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 37.6 bits (87), Expect = 1e-04
Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 405 SISSSALEQSNIDLTTELVDLISAQRNFQANSRTLEVNNTLQQTVLQIR 453
+S+ S ++L E +L Q+ + AN++ L+ N + ++ IR
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3099FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 3e-04
Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 99 NVNVMEEMADMISASRSYQMNVQVAEAAKSMLQQTLGM 136
VN+ EE ++ + Y N QV + A ++ + +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.9 bits (67), Expect = 0.003
Identities = 16/67 (23%), Positives = 29/67 (43%), Gaps = 6/67 (8%)

Query: 5 SIFDVAGSGMSAQSVRLNTTASNIANADSVSSSVDKTYRSRHPIFEAEMAKAQSQQQASQ 64
S+ + A SG++A LNT ++NI++ + Y + I + +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYN------VAGYTRQTTIMAQANSTLGAGGWVGN 55

Query: 65 GVAVKGI 71
GV V G+
Sbjct: 56 GVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3102HTHFIS611e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 1e-12
Identities = 23/128 (17%), Positives = 52/128 (40%), Gaps = 12/128 (9%)

Query: 180 HIMVIDDSAVARKQIIRSLESLNLQIDTAKDGREALDKLKEIAKEMDNVADEIPLIISDI 239
I+V DD A R + ++L + + + A + L+++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDV 55

Query: 240 EMPEMDGYTLTAEIRDDPKLKHIKVVLHTSLSGVFNQAMVQKVGANDFIAK-FNPDELAA 298
MP+ + + L I+ + V++ ++ + + GA D++ K F+ EL
Sbjct: 56 VMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 299 AVNKHLSL 306
+ + L+
Sbjct: 114 IIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3112HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHQALQQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3117NUCEPIMERASE1894e-60 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 189 bits (482), Expect = 4e-60
Identities = 75/332 (22%), Positives = 132/332 (39%), Gaps = 29/332 (8%)

Query: 3 KVLVTGADGFIGSHLVEMLVAQGYQVRALSQYNSFNYWGWLEN----IDCLDEVEVICGD 58
K LVTGA GFIG H+ + L+ G+QV + N + Y L+ + + D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDY-YDVSLKQARLELLAQPGFQFHKID 60

Query: 59 IRDPHFCKHLCKD--IDVIYHLAALIAIPYSYIAPDSYLDTNAKGTLNICQAALENNVSR 116
+ D L + ++ +A+ YS P +Y D+N G LNI + N +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 117 VIHTSTSEVYGTAKYVPIDEQHPL-QPQSPYSASKMAADAMAMSFHNSFELPLTIARPFN 175
+++ S+S VYG + +P + P S Y+A+K A + MA ++ + + LP T R F
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 176 TYGPRQSARAVIPTIISQIAAGATQIKLGDISPTRDFNYVLDTCRGFIALA---AHDNCI 232
YGP + + G + RDF Y+ D I L H +
Sbjct: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQ 240

Query: 233 --------------GETLNISSNYEISIEDTLNIIKQNMHSDVEFITDDARLRPQQSEVF 278
NI ++ + + D + ++ + + + Q +V
Sbjct: 241 WTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLP----LQPGDVL 296

Query: 279 RLWGDNSKIKTLTGYQPQFDIHIGLKETITWF 310
D + + G+ P+ + G+K + W+
Sbjct: 297 ETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


39Sbal195_3372Sbal195_3417Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3372019-3.232057CRISPR-associated Csy4 family protein
Sbal195_3373-122-4.157268CRISPR-associated Csy3 family protein
Sbal195_3374025-5.213398CRISPR-associated Csy2 family protein
Sbal195_3375-125-5.527500CRISPR-associated Csy1 family protein
Sbal195_3376-124-5.404147CRISPR-associated helicase Cas3 family protein
Sbal195_3377125-6.428380CRISPR-associated Cas1 family protein
Sbal195_3378328-7.278515hypothetical protein
Sbal195_3379329-6.935694hypothetical protein
Sbal195_3380428-5.789386XRE family transcriptional regulator
Sbal195_3381227-5.145328HipA domain-containing protein
Sbal195_3382428-6.308418hypothetical protein
Sbal195_3383327-6.531949hypothetical protein
Sbal195_3384232-5.763689hypothetical protein
Sbal195_3385130-4.897886type II secretion system protein E
Sbal195_3386318-1.502751hypothetical protein
Sbal195_3387220-0.735225hypothetical protein
Sbal195_33884303.673982hypothetical protein
Sbal195_33894345.537194hypothetical protein
Sbal195_33903355.712291XRE family transcriptional regulator
Sbal195_33913355.762807integrase family protein
Sbal195_33924356.300212hypothetical protein
Sbal195_33933315.131838replication P family protein
Sbal195_33942232.661797putative replication protein
Sbal195_33953242.915744hypothetical protein
Sbal195_33962232.346889hypothetical protein
Sbal195_33971212.119911hypothetical protein
Sbal195_33981201.191093XRE family transcriptional regulator
Sbal195_33992242.573840XRE family transcriptional regulator
Sbal195_34004334.362026hypothetical protein
Sbal195_34013323.669436hypothetical protein
Sbal195_34023293.437634hypothetical protein
Sbal195_34033293.830114hypothetical protein
Sbal195_34043324.138634hypothetical protein
Sbal195_3405123-0.613495hypothetical protein
Sbal195_3406-118-1.960382hypothetical protein
Sbal195_3407127-2.074411hypothetical protein
Sbal195_3408020-2.430787GP46
Sbal195_3409022-2.897284hypothetical protein
Sbal195_3410020-2.464703integrase family protein
Sbal195_3411123-1.804787lipoprotein NlpI
Sbal195_3412429-0.803471polynucleotide phosphorylase/polyadenylase
Sbal195_3413424-0.477516diguanylate cyclase/phosphodiesterase
Sbal195_3414629-0.25943530S ribosomal protein S15
Sbal195_3415528-0.245173tRNA pseudouridine synthase B
Sbal195_3416630-0.483667ribosome-binding factor A
Sbal195_3417426-0.130347translation initiation factor IF-2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3385SALSPVBPROT310.016 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 31.3 bits (70), Expect = 0.016
Identities = 24/89 (26%), Positives = 45/89 (50%), Gaps = 12/89 (13%)

Query: 604 SEILKKDTSVIAILIDSALMQSTPAEWLGKLREQLAWSGTPVLFLIPPNQNETKILLNKF 663
S++LK+ T++ I+ID A M ++P + AW +L + ++ +IL +
Sbjct: 481 SDVLKEYTTIGNIIIDKAFMSTSPDK---------AWINDTILNIYLEKGHKGRILGDV- 530

Query: 664 HAHAMEYDENMTPPTEIVNKLQSILTKGT 692
AH E + PP + K++SI+ G+
Sbjct: 531 -AHFKGEAEMLFPPNTKL-KIESIVNCGS 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3386BCTERIALGSPG464e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 45.6 bits (108), Expect = 4e-08
Identities = 28/98 (28%), Positives = 43/98 (43%), Gaps = 5/98 (5%)

Query: 7 RKNNNRGFNLLEIMVVVAIIGILAVVAVPLYKDYIIRAQVTEAFVFADAERIKVIEKRIE 66
+ RGF LLEIMVV+ IIG+LA + VP +A +A I +E ++
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKA-----VSDIVALENALD 57

Query: 67 STNVDIATFSEPKVHMTSLMWVPVINNQPVENSVIGYI 104
+D + + SL+ P + + GYI
Sbjct: 58 MYKLDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYI 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3387BCTERIALGSPG290.007 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.5 bits (66), Expect = 0.007
Identities = 19/78 (24%), Positives = 37/78 (47%), Gaps = 8/78 (10%)

Query: 142 IVIISVFIVIVTSFFYAKQDKVIAPESALAQQLMVVVNGIEKYRLENNKTP---EKLSDL 198
IVII V +V ++K + A++ ++ + N ++ Y+L+N+ P + L L
Sbjct: 19 IVIIGVLASLVVPNLMGNKEKADK-QKAVSD-IVALENALDMYKLDNHHYPTTNQGLESL 76

Query: 199 LEFPR---EAVEWRIDQY 213
+E P A + + Y
Sbjct: 77 VEAPTLPPLAANYNKEGY 94


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3411SYCDCHAPRONE300.010 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.5 bits (66), Expect = 0.010
Identities = 13/71 (18%), Positives = 21/71 (29%)

Query: 69 NEQRARFHYDRGVIYDSVGLRLLARIDFMQALKLQPDLADAYNFLGIYYTQEGEYASAYE 128
+ +RF G ++G LA + + Q+GE A A
Sbjct: 66 DHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAES 125

Query: 129 AFDGVLELAPN 139
EL +
Sbjct: 126 GLFLAQELIAD 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3417TCRTETOQM725e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 72.2 bits (177), Expect = 5e-15
Identities = 51/202 (25%), Positives = 78/202 (38%), Gaps = 30/202 (14%)

Query: 387 IMGHVDHGKTSLLDYIRRAKVAAGEAG------------------GITQHIGAYHVETEN 428
++ HVD GKT+L + + A E G GIT G + EN
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 429 GMITFLDTPGHAAFTAMRARGAKATDIVVLVVAADDGVMPQTIEAIQHAKAGNVPLIVAV 488
+ +DTPGH F A R D +L+++A DGV QT + +P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 489 NKMDKPEADIDRV----KSELSQHGVMS-------EDWGGDNMFAFVSAKTGEGVDELLE 537
NK+D+ D+ V K +LS V+ + + EG D+LLE
Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187

Query: 538 GILLQAEVLELKAVRDGMAAGV 559
+ + LE + +
Sbjct: 188 K-YMSGKSLEALELEQEESIRF 208


40Sbal195_3475Sbal195_3503Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3475026-4.144293transposase IS4 family protein
Sbal195_3476026-5.389556LysR family transcriptional regulator
Sbal195_3477-124-4.631140glyoxalase/bleomycin resistance
Sbal195_3478-124-4.493336XRE family transcriptional regulator
Sbal195_3479021-3.938358HipA domain-containing protein
Sbal195_3480-120-3.685265NUDIX hydrolase
Sbal195_3481018-3.406266N-acetyltransferase GCN5
Sbal195_3482216-1.336643PepSY-associated TM helix domain-containing
Sbal195_34831190.441754hypothetical protein
Sbal195_34841211.857342TonB-dependent receptor
Sbal195_34853315.235694hypothetical protein
Sbal195_34863315.344701XRE family transcriptional regulator
Sbal195_34873325.507654integrase family protein
Sbal195_34883346.224102hypothetical protein
Sbal195_34893336.001435replication P family protein
Sbal195_34901244.024562hypothetical protein
Sbal195_34912270.955406hypothetical protein
Sbal195_3492032-7.981267hypothetical protein
Sbal195_3493031-7.715084hypothetical protein
Sbal195_3494131-8.858638hypothetical protein
Sbal195_3495127-7.859903putative phage repressor
Sbal195_3496029-7.945129hypothetical protein
Sbal195_3497-124-6.192437hypothetical protein
Sbal195_34983273.810594hypothetical protein
Sbal195_34993283.392559hypothetical protein
Sbal195_35003282.870788hypothetical protein
Sbal195_35013264.098300hypothetical protein
Sbal195_35022264.764697hypothetical protein
Sbal195_35031295.063454hypothetical protein
41Sbal195_3516Sbal195_3533Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_35162162.789021putative lipoprotein
Sbal195_35171141.853675hypothetical protein
Sbal195_35182151.652866hypothetical protein
Sbal195_35191141.920427endonuclease/exonuclease/phosphatase
Sbal195_35200162.292349hypothetical protein
Sbal195_35210172.124946magnesium transporter
Sbal195_35222171.876848methyl-accepting chemotaxis sensory transducer
Sbal195_35232212.203459extracellular solute-binding protein, family 3
Sbal195_35242212.775824pseudouridine synthase
Sbal195_35252212.536198carbamoyl phosphate synthase large subunit
Sbal195_3526-1152.793062carbamoyl phosphate synthase small subunit
Sbal195_35271173.072744dihydrodipicolinate reductase
Sbal195_35281150.508123FKBP-type peptidylprolyl isomerase
Sbal195_3529217-2.315133peptidase M48 Ste24p
Sbal195_3530218-4.700075N-acetyltransferase GCN5
Sbal195_3531216-3.601552DEAD/DEAH box helicase
Sbal195_3532323-4.735784hypothetical protein
Sbal195_3533118-4.147304sugar-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3522FbpA_PF05833290.025 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 29.5 bits (66), Expect = 0.025
Identities = 10/64 (15%), Positives = 23/64 (35%), Gaps = 8/64 (12%)

Query: 6 QLKAENKALKERLIQLEQQRQNEIDELRSMIRENETMQQQSRQNADHYTEVIACQNQGGD 65
K ++ LK + L++ N I+ + ++ + D + G+
Sbjct: 289 YAKDKSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCE-DKDIF-------KLYGE 340

Query: 66 MLNA 69
+L A
Sbjct: 341 LLTA 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3528INFPOTNTIATR1451e-45 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 145 bits (366), Expect = 1e-45
Identities = 76/203 (37%), Positives = 116/203 (57%), Gaps = 5/203 (2%)

Query: 6 STVEQQASYGVGRQMGEQLAANSFDGVDIPAVQAGLADAFAGLESAVS---MQDLQVAFT 62
+T + + SY +G +G+ D ++ + G+ D +G + ++ M+D+ F
Sbjct: 28 TTDKDKLSYSIGADLGKNFKNQGID-INPDVLAKGMQDGMSGAQLILTEEQMKDVLSKFQ 86

Query: 63 -EISGRIQAAQEQAAAAASAEGDAFLAENAKRDGVTVTDSGLQFEVLVQGDGATPTYEDT 121
++ + A + A A+GDAFL+ N + G+ V SGLQ++++ G GA P DT
Sbjct: 87 KDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGAKPGKSDT 146

Query: 122 VRTHYHGSFINGDVFDSSVVRGQPAEFPVSGVIAGWTEALQLMPVGTKLKLFVPHHLAYG 181
V Y G+ I+G VFDS+ G+PA F VS VI GWTEALQLMP G+ ++FVP LAYG
Sbjct: 147 VTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYG 206

Query: 182 ERGAGASIPPYSTLVFEVELLDI 204
R G I P TL+F++ L+ +
Sbjct: 207 PRSVGGPIGPNETLIFKIHLISV 229


42Sbal195_3558Sbal195_3570Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3558333-1.445396Na(+)-translocating NADH-quinone reductase
Sbal195_3559118-0.621537Na(+)-translocating NADH-quinone reductase
Sbal195_3560016-0.663277Na(+)-translocating NADH-quinone reductase
Sbal195_3561-112-0.088118Na(+)-translocating NADH-quinone reductase
Sbal195_3562-1120.217799Na(+)-translocating NADH-quinone reductase
Sbal195_3563-2120.028326Na(+)-translocating NADH-quinone reductase
Sbal195_3564-2120.650136TonB-dependent receptor
Sbal195_35652120.423742S-ribosylhomocysteinase
Sbal195_35662131.058055TRAP dicarboxylate transporter subunit DctP
Sbal195_35672151.235089BolA family protein
Sbal195_35682151.3799602OG-Fe(II) oxygenase
Sbal195_35692151.575310rRNA (guanine-N(2)-)-methyltransferase
Sbal195_35703171.682914hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3565LUXSPROTEIN2716e-97 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 271 bits (695), Expect = 6e-97
Identities = 131/168 (77%), Positives = 150/168 (89%)

Query: 2 PLLDSFTVDHTRMNAPAVRVAKHMSTPKGDAITVFDLRFCAPNKDILSERGIHTLEHLFA 61
PLLDSFTVDHTRMNAPAVRVAK M TPKGD ITVFDLRF APNKDILSE+GIHTLEHL+A
Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60

Query: 62 GFMRDHLNGSNVEIIDISPMGCRTGFYMSLIGEPTERQVADAWLAAMEDVLKVVEQSEIP 121
GFMR+HLNG +VEIIDISPMGCRTGFYMSLIG P+E+QVADAW+AAMEDVLKV Q++IP
Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120

Query: 122 ELNEYQCGTYEMHSLEQAQDIARNIIAAGVSVNRNDDLKLSDEILGNL 169
ELNEYQCGT MHSL++A+ IA+NI+ GV+VN+ND+L L + +L L
Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLREL 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3570FLGHOOKFLIK330.009 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 32.5 bits (73), Expect = 0.009
Identities = 34/166 (20%), Positives = 62/166 (37%), Gaps = 15/166 (9%)

Query: 5 DDVAQLKAELAQLQSLHLSQQFS---LSRQLAEFSTKLDTLSQQIATEDASDTTVSMAAV 61
D A L A A L + + + + E T L+ + T D A
Sbjct: 126 DVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQ 185

Query: 62 SMTAGAASIAAVVPATDNAPTLTYAIHTPILESAPVAPVPVEPNPWQQNAVQGDPWQRNT 121
+T A + + P+ A +P++ P+P P + WQ+
Sbjct: 186 PLTPLVAEAQSKA-EVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQ-- 242

Query: 122 KNTSAEQVAKTEYQAQGQQLSDEVKLQ----ASVQVASQFDDLLSQ 163
+ ++ ++ + QGQQ S E++L VQ++ + DD +Q
Sbjct: 243 --SLSQHISL--FTRQGQQ-SAELRLHPQDLGEVQISLKVDDNQAQ 283


43Sbal195_3654Sbal195_3664Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_36542181.868732citrate transporter
Sbal195_36552241.557317hypothetical protein
Sbal195_36562242.056452fructose-1,6-bisphosphate aldolase
Sbal195_36570191.439207phosphoglycerate kinase
Sbal195_3658018-0.428271erythrose 4-phosphate dehydrogenase
Sbal195_3659123-4.186385transketolase
Sbal195_3660323-6.132294S-adenosylmethionine synthetase
Sbal195_3661427-7.351894hypothetical protein
Sbal195_3662324-6.819221RNA-directed DNA polymerase
Sbal195_3663222-6.709044hypothetical protein
Sbal195_3664221-6.040211hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3658BINARYTOXINB290.027 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 29.3 bits (65), Expect = 0.027
Identities = 18/67 (26%), Positives = 30/67 (44%), Gaps = 1/67 (1%)

Query: 248 FEAISVRVPTINVTAIDLSVTLEKTVDIATVNQVLESA-ANGRFNGILGYTDEPLVSCDF 306
F + + + A++ S LE T T+ + L+ A NG L Y + + DF
Sbjct: 522 FNGKDLNLVERRIAAVNPSDPLETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDF 581

Query: 307 NHDPRSS 313
N D ++S
Sbjct: 582 NFDQQTS 588


44Sbal195_3793Sbal195_3802Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3793-125-6.883536short chain dehydrogenase
Sbal195_3794-119-3.796867hypothetical protein
Sbal195_3795-120-3.486742tRNA synthetase class II
Sbal195_3796023-2.827677extracellular solute-binding protein
Sbal195_3797023-1.692983hypothetical protein
Sbal195_3798019-0.770128NACHT family-like NTPase
Sbal195_37991233.662232glycine dehydrogenase
Sbal195_38001183.945758glycine cleavage system protein H
Sbal195_38012163.903285glycine cleavage system aminomethyltransferase
Sbal195_38023173.807262UbiH/UbiF/VisC/COQ6 family ubiquinone
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3793NUCEPIMERASE280.024 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 28.2 bits (63), Expect = 0.024
Identities = 10/30 (33%), Positives = 17/30 (56%)

Query: 1 MKIVVVGASGTIGQAIVRLFHSTQHEVIQV 30
MK +V GA+G IG + + H+V+ +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGI 30


45Sbal195_3841Sbal195_3865Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3841-311-3.094928hypothetical protein
Sbal195_3842-310-3.239604two-component response regulator
Sbal195_3843-212-3.412652peptidase S9B dipeptidylpeptidase IV subunit
Sbal195_3844120-5.806761fructose-1,6-bisphosphatase
Sbal195_3845222-7.025535NERD domain-containing protein
Sbal195_3846121-6.582428hypothetical protein
Sbal195_3847016-3.191303transposase IS116/IS110/IS902 family protein
Sbal195_3848-115-1.200858transposase IS116/IS110/IS902 family protein
Sbal195_3849-216-0.690080transport-associated
Sbal195_3850-1180.694796hypothetical protein
Sbal195_38510181.393089hypothetical protein
Sbal195_38522202.737693hypothetical protein
Sbal195_38533213.049472sodium:dicarboxylate symporter
Sbal195_38543203.284472hypothetical protein
Sbal195_38552213.792678hypothetical protein
Sbal195_38561203.890051hypothetical protein
Sbal195_38570184.121828hypothetical protein
Sbal195_3858-1153.564169two component LuxR family transcriptional
Sbal195_3859-1153.094523histidine kinase
Sbal195_38600172.8551104Fe-4S ferredoxin
Sbal195_38610172.391500polysulfide reductase NrfD
Sbal195_38621161.694876molydopterin dinucleotide-binding region
Sbal195_38631140.647767methyl-accepting chemotaxis sensory transducer
Sbal195_38640162.980320hypothetical protein
Sbal195_3865-1173.101949hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3842HTHFIS831e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 1e-20
Identities = 31/131 (23%), Positives = 61/131 (46%), Gaps = 1/131 (0%)

Query: 1 MQNPHILIVEDEAVTRNTLRSIFEAEGYVVTEANDGAEMHKAMQENKINLVVMDINLPGK 60
M IL+ +D+A R L GY V ++ A + + + +LVV D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELREIN-NIGLIFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLT 119
N L +++ ++ ++ ++ ++ + I E GA DY+ KPF+ EL L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RVNSAGAEVEE 130
+++E+
Sbjct: 121 EPKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3849SECETRNLCASE250.047 Bacterial translocase SecE signature.
		>SECETRNLCASE#Bacterial translocase SecE signature.

Length = 127

Score = 25.2 bits (55), Expect = 0.047
Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 2 QTMKLTFVTAGVVLVGSLLLNGCDG 26
+T+ T + A V V SL+L G DG
Sbjct: 89 ETLHTTLIVAAVTAVMSLILWGLDG 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3857GPOSANCHOR543e-09 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 53.9 bits (129), Expect = 3e-09
Identities = 56/320 (17%), Positives = 103/320 (32%), Gaps = 18/320 (5%)

Query: 599 EYAASEQELRIRLSKAEEAHTSAQEMQAEAESQLVAINGELDNLSRELTFARTAYKNSRD 658
EL LS A+E + +E S++ + +L + L A
Sbjct: 82 ALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSA 141

Query: 659 DLRRLFDEKRSEQDKINKALSERKAQAGQRLTQLDGELKQLKHQHELWLEEQKEQALEAR 718
++ L EK + KA G K + E E ++ LE
Sbjct: 142 KIKTLEAEK---AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 198

Query: 719 MEKQAYWQEVIGALDNQLGQIKATIEGRRESAKIELKACETWYKNELKSRGVDEENILKL 778
+E + A L KA + R+ + L+ + + E L
Sbjct: 199 LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 258

Query: 779 KQQIRELETKISRAEQRRSDVLRFDDWYQHTWLMRKPKLQTQLADVKR----------AV 828
+ + ELE + A + T K L+ + AD++ ++
Sbjct: 259 EARQAELEKALEGAMNFSTADSA----KIKTLEAEKAALEAEKADLEHQSQVLNANRQSL 314

Query: 829 SEIDQQLKAKTLDVKTRRQQLETERKASDAAQVEASENLTKLRAVMRKLAELKLPTNNEE 888
+ ++ Q+LE + K S+A++ +L R ++L E + E+
Sbjct: 315 RRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQL-EAEHQKLEEQ 373

Query: 889 AQGSLGERLRQGEDLLLKRD 908
+ S R DL R+
Sbjct: 374 NKISEASRQSLRRDLDASRE 393



Score = 33.5 bits (76), Expect = 0.006
Identities = 48/347 (13%), Positives = 114/347 (32%), Gaps = 28/347 (8%)

Query: 360 WRTDVENLSERHKLQTEKHQDIEAAYNARRSKIGEQLNRELEGLHADQDKQREARDKQRE 419
+ + + K+ D+ A + ++L EL + + R +
Sbjct: 55 VQERADKFEIENNTLKLKNSDLSFNNKALKDHN-DELTEELSNA------KEKLRKNDKS 107

Query: 420 VARTDIDALELQWRNQMDAGKASFSEQEYQFKLTAAELKLRVDGVTYTEEEKLSLAIFDE 479
++ EL+ R D KA + +A L + + ++
Sbjct: 108 LSEKASKIQELEARKA-DLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL----EK 162

Query: 480 RIHRADEEQESCNAKVERLSSDERKLRAKRDQANEALRIATLRVNERQTALDELHHMLFP 539
+ A + +AK++ L +++ L A++ + +AL A + L
Sbjct: 163 ALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAA 222

Query: 540 QSHTL--LEFLRKEAQGWEQSLGKVIAPELLHRTDLHPSLVSESSEAFFGVHLDLKAIDV 597
+ LE + A + + I + L +
Sbjct: 223 LAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA-------------L 269

Query: 598 PEYAASEQELRIRLSKAEEAHTSAQEMQAEAESQLVAINGELDNLSRELTFARTAYKNSR 657
++ E + + +A+ E Q +N +L R+L +R A K
Sbjct: 270 EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLE 329

Query: 658 DDLRRLFDEKRSEQDKINKALSERKAQAGQRLTQLDGELKQLKHQHE 704
+ ++L ++ + + ++L + + QL+ E ++L+ Q++
Sbjct: 330 AEHQKLEEQNKI-SEASRQSLRRDLDASREAKKQLEAEHQKLEEQNK 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3858HTHFIS895e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.1 bits (221), Expect = 5e-23
Identities = 27/118 (22%), Positives = 48/118 (40%)

Query: 7 VYLIDDDDSVRRSLRFMLESYGLKITDFDSAEAFFTAVDLTLPGCALVDVRMPGLSGPQL 66
+ + DDD ++R L L G + +A + + + DV MP + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 HLELVSKNSPLAVIYLTGHGDVPMAVEALKLGAVDFFQKPADGAKLAEAVVKALEHAK 124
+ L V+ ++ A++A + GA D+ KP D +L + +AL K
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


46Sbal195_3884Sbal195_3889Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3884327-1.262846glutamine amidotransferase of anthranilate
Sbal195_3885431-2.258168ClpXP protease specificity-enhancing factor
Sbal195_3886429-1.942095stringent starvation protein A
Sbal195_3887425-1.633888cytochrome c1
Sbal195_3888426-1.342023cytochrome b/b6 domain-containing protein
Sbal195_3889321-0.594131ubiquinol-cytochrome c reductase, iron-sulfur
47Sbal195_3968Sbal195_3981Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3968-2204.078803EmrB/QacA family drug resistance transporter
Sbal195_3969-2204.566124secretion protein HlyD family protein
Sbal195_39700204.906576LysR family transcriptional regulator
Sbal195_39710215.085254large-conductance mechanosensitive channel
Sbal195_39721225.233429antibiotic biosynthesis monooxygenase
Sbal195_39730225.288625CzcA family heavy metal efflux protein
Sbal195_39743203.852770RND family efflux transporter MFP subunit
Sbal195_39753202.560223outer membrane efflux protein
Sbal195_39761190.851287hypothetical protein
Sbal195_39771192.458636hypothetical protein
Sbal195_39780203.677535hypothetical protein
Sbal195_3979-1164.256554peptidyl-tRNA hydrolase domain-containing
Sbal195_3980-1174.2360113-dehydroquinate dehydratase
Sbal195_39810184.080504acetyl-CoA carboxylase, biotin carboxyl carrier
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3968TCRTETB1307e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 130 bits (328), Expect = 7e-35
Identities = 90/421 (21%), Positives = 176/421 (41%), Gaps = 19/421 (4%)

Query: 25 TDYERGSRRSWIAVFGGLIGAFMAILDIQITNASMKEIQGSLGATLEEGSWISTAYLVAE 84
T Y + + R + I +F ++L+ + N S+ +I +W++TA+++
Sbjct: 3 TSYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTF 62

Query: 85 MIAIPLSGWLSTGLSVRRYLLWTTAAFIFASVLCSMAWN-LEAMIAFRALQGFFGGALIP 143
I + G LS L ++R LL+ F SV+ + + +I R +QG A
Sbjct: 63 SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122

Query: 144 LAFRLILEFLPDNKRAVGMALFGVTATFAPSIGPTLGGWLTEQFSWHYLFYINVPPGLLV 203
L ++ ++P R L G +GP +GG + W YL +P ++
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITII 180

Query: 204 MAMLAYGLEKQSVVWDKLKNVDLAGIVTMALGMGCLEVVLEEGNRKDWFGSELIRNLAII 263
L K+ V + D+ GI+ M++G+ + F + + I+
Sbjct: 181 TVPFLMKLLKKEVRIKG--HFDIKGIILMSVGIVFFML----------FTTSYSISFLIV 228

Query: 264 AVVNLVLFVWIQLRRKEPLVNLRLLGKRDFVLSTVAYFLLGMALFGAIYLIPLYLSQVHD 323
+V++ ++FV + +P V+ L F++ + ++ + G + ++P + VH
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 324 YTPLEIGGVIMWMGFPQLLVL-PLVPKLMERFDSRYLAAFGFLMFAISYYMNSQMTADYA 382
+ EIG VI++ G +++ + L++R Y+ G ++S+ S +
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LET 346

Query: 383 GPQMIASQVVRALG-QPFILVPIGMLATMHLKPHENASASTVLNVMRNLGGAFGIALVAT 441
+ +V LG F I + + LK E + ++LN L GIA+V
Sbjct: 347 TSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406

Query: 442 L 442
L
Sbjct: 407 L 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3969RTXTOXIND996e-25 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 98.7 bits (246), Expect = 6e-25
Identities = 42/294 (14%), Positives = 96/294 (32%), Gaps = 28/294 (9%)

Query: 71 LAQLEDNQFSAKVSQAEASLASSKADLQTLAAKVELQRALITQASAGVVAAESDKIRAQQ 130
+ + + S + ++ + ++ +RA A + E+ +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 131 QLSRSKKLKVSNYSSQDDVDQLQAGFDSAAARLDEAKA--------VLVAKQRELAVFN- 181
+L L ++ V + + + A L K+ +L AK+ V
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 182 ------AQLDQAGSVVEQADATLELAKIQLNDTRVTAPFSGVIGKRGAM-VGQYVQPGQA 234
+L Q + L + + + + AP S + + G V +
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 235 LYSLVPDGAV-WITANFKETQIQHMQPGQSVQVSLDAFPDKTFIGVIDSLSPASGAKFSL 293
L +VP+ +TA + I + GQ+ + ++AFP + G + K
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY-GYLV-------GKVKN 407

Query: 294 LPAENATGNFTKIVQRIPVRIRLDLSEAEAHML---PGLSAVVKVDTASGTAIS 344
+ + +V + + I + + G++ ++ T + IS
Sbjct: 408 INLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVIS 461


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3971MECHCHANNEL1708e-58 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 170 bits (431), Expect = 8e-58
Identities = 85/136 (62%), Positives = 110/136 (80%), Gaps = 1/136 (0%)

Query: 1 MSLIKEFKAFASRGNVIDMAVGIIIGAAFGKIVSSFVADIIMPPIGIILGGVNFSDLSIV 60
MS+IKEF+ FA RGNV+D+AVG+IIGAAFGKIVSS VADIIMPP+G+++GG++F ++
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LQAAQGDAPSVVIAYGKFIQTIIDFTIIAFAIFMGVKAINRLKRKEEVAPKAPAAPTKDQ 120
L+ AQGD P+VV+ YG FIQ + DF I+AFAIFM +K IN+L RK+E P A APTK++
Sbjct: 61 LRDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKE-EPAAAPAPTKEE 119

Query: 121 ELLSEIRDLLKAQQEK 136
LL+EIRDLLK Q +
Sbjct: 120 VLLTEIRDLLKEQNNR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3973ACRIFLAVINRP6620.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 662 bits (1709), Expect = 0.0
Identities = 225/1075 (20%), Positives = 434/1075 (40%), Gaps = 73/1075 (6%)

Query: 9 AIKNRLLVVLALLAMIVASVVMLPKLNLDAFPDVTNVQVTINTAAEGLAAEEVEKLISYP 68
I+ + + + +++A + + +L + +P + V+++ G A+ V+ ++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 VESAMYALPAVTEVRSLS-RTGLSIVTVVFAEGTDIYFARQQVFEQLQAAREMIPSGVGV 127
+E M + + + S S G +T+ F GTD A+ QV +LQ A ++P V
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PEIGPNTSGLGQIYQYILRAEPNSGIDAAELRSLNDYLVKLIMMPVGGVTEVLSFGGDVR 187
I S + ++ N G ++ VK + + GV +V FG
Sbjct: 125 QGISVEKSSSSYLMVAGFVSD-NPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 188 QYQVQVDPNKLRAYGLSMAQVTEALESNNRNAGGWFMDQGQE------QLVVRGYGMLPA 241
++ +D + L Y L+ V L+ N + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 242 GEEGLAAIAQIPLTEDK-GTPVRVGDIAQVDFGSEIRVGAVTMTRRDEAGNVQNLGEVVA 300
EE ++ L + G+ VR+ D+A+V+ G E + N
Sbjct: 243 PEE----FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARI----------NGKPAAG 288

Query: 301 GVVLKRMGANTKATIDDIGARVSLIEQALPDGVSFEVFYDQAELVDKAVTTVRDALLMAF 360
+ GAN T I A+++ ++ P G+ YD V ++ V L A
Sbjct: 289 LGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI 348

Query: 361 VFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAIGMLVDG 420
+ + +++ LFL N+RATL+ +++PV + +++ +G S N +++ G+ +AIG+LVD
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 421 SVVMVENIFKHLTQPDRRHLLEARTRADGEADPYHSDEDGGQQANMAVRIMLAAKEVCSP 480
++V+VEN+ + + ED + M ++
Sbjct: 409 AIVVVENVERVMM------------------------EDKLPPKEATEKSM---SQIQGA 441

Query: 481 IFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVYLFK--- 537
+ ++ VF P+ G G +++ +++I+ AM ++LVALI PAL L K
Sbjct: 442 LVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501

Query: 538 -------RGVVLKQSVVLAPLDAAYRKLLTATLARPKVVMLSALLMFALSLLLLPRLGTE 590
G + Y + L +L L+ A ++L RL +
Sbjct: 502 AEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSS 561

Query: 591 FVPELEEGTINLRVTLAPTASLGTSLAVAPKLEAILLEFPEVEYALSRIGAPELGGDPEP 650
F+PE ++G + L A+ + V ++ L+ + S +
Sbjct: 562 FLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVE-SVFTVNGFSFSGQA 620

Query: 651 VSNIEVYIGLKPISEWQSASSRLE--LQRLMEEKLSVFPGLLLTFSQPIATRVDELLSGV 708
+ ++ LKP E + E + R E + G ++ F+ P + EL +
Sbjct: 621 QNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGTAT 677

Query: 709 KAQLA-IKIFGPDLAVLSERGQALTDLVAKIPGAV-DVSLEQVSGEAQLVVRPKRELLAR 766
I G L++ L + A+ P ++ V + AQ + +E
Sbjct: 678 GFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQA 737

Query: 767 YGISVDQVMSLVSQGIGGASAGQVIDGNARYDINVRLAAEFRTSPDAIKDLLLSGTNGAT 826
G+S+ + +S +GG ID + V+ A+FR P+ + L + NG
Sbjct: 738 LGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEM 797

Query: 827 VRLGEVASVEVEMAPPNIRRDDVQRRVVVQANVA-GRDMGSVVKDIYALVPQADLPAGYT 885
V + P + R + + +Q A G G + + L + LPAG
Sbjct: 798 VPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIG 855

Query: 886 VIIGGQYENQQRAQQKLMLVVPISIALIALLLYFSFGSFKQVLLIMANVPLALIGGIVAL 945
G ++ + + +V IS ++ L L + S+ + +M VPL ++G ++A
Sbjct: 856 YDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAA 915

Query: 946 YVSGTYLSVPSSIGFITLFGVAVLNGVVLVDSINQ-RRQSGEALYDCVYEGTVGRLRPVL 1004
+ V +G +T G++ N +++V+ + G+ + + RLRP+L
Sbjct: 916 TLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPIL 975

Query: 1005 MTALTSALGLIPILLSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYR 1059
MT+L LG++P+ +S+G GS Q + + ++GG+ S+T L + +P + + R
Sbjct: 976 MTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 113 bits (284), Expect = 2e-27
Identities = 81/544 (14%), Positives = 185/544 (34%), Gaps = 61/544 (11%)

Query: 10 IKNRLLVVLALLAMIVASVVMLPKLNLDAFPDVTNVQVTIN-TAAEGLAAEEVEKLI--- 65
+ + +L ++ VV+ +L P+ G E +K++
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 66 -SYPVESAMYALPAVTEVRSLSRTGLS----IVTVVFAEGTDIYFARQQVFEQLQAAREM 120
Y +++ + +V V S +G + + V + + A+
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 121 ---IPSGVGVPEIGPNTSGLGQIYQYILRAEPNSGIDAAELRSLNDYLVKLIMMPVGGVT 177
I G +P P LG + +G+ L + L+ + +
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 178 EV-LSFGGDVRQYQVQVDPNKLRAYGLSMAQVTEALES--NNRNAGGWFMDQGQEQLVVR 234
V + D Q++++VD K +A G+S++ + + + + + ++L V+
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 235 GYGMLPAGEEGLAAIAQIPLTEDKGTPVRVGDIAQVDFGSEIRVGAVTMTRRDEAGNVQN 294
+ ++ + G V + G+ + R + +++
Sbjct: 774 AD---AKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY----GSPRLERYNGLPSMEI 826

Query: 295 LGEVVAGVVLKRMGANTKATIDDIGARVSLIEQALPDGVSFEVFYDQAELVDKAVTTVRD 354
GE G D A + + LP G+ ++ + + +
Sbjct: 827 QGEAAPGTSS-----------GDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPA 874

Query: 355 ALLMAFVFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAI 414
+ ++FV + + LA + + V+L +P+ I L+ + + ++ + GL I
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934

Query: 415 GMLVDGSVVMVENIFKHLTQPDRRHLLEARTRADGEADPYHSDEDGGQQANMAVRIMLAA 474
G+ ++++VE + L+E + EA ++A
Sbjct: 935 GLSAKNAILIVEFA---------KDLMEKEGKGVVEA------------------TLMAV 967

Query: 475 KEVCSPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVY 534
+ PI + I+ PL G + + ++ M+SA L+A+ VP V
Sbjct: 968 RMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027

Query: 535 LFKR 538
+ +
Sbjct: 1028 IRRC 1031



Score = 102 bits (257), Expect = 2e-24
Identities = 89/515 (17%), Positives = 190/515 (36%), Gaps = 36/515 (6%)

Query: 565 RPKVVMLSALLMFALSLLLLPRLGTEFVPELEEGTINLRVTLAPTASLGT-SLAVAPKLE 623
RP + A+++ L + +L P + +++ P A T V +E
Sbjct: 8 RPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSAN-YPGADAQTVQDTVTQVIE 66

Query: 624 AILLEFPEVEYALSRIGAPELGGDPEPVSNIEVYIGLKPISEWQSASSRLELQRLMEEKL 683
+ + Y S + ++ + + + ++ A Q ++ KL
Sbjct: 67 QNMNGIDNLMYMSST---------SDSAGSVTITLTFQSGTDPDIA------QVQVQNKL 111

Query: 684 SVFPGLLLTFSQPIATRVDELLSGVKAQLAIKIFGPDLAVLSERGQALT---DLVAKIPG 740
+ LL Q V++ S P + D ++++ G
Sbjct: 112 QLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNG 171

Query: 741 AVDVSLEQVSGEAQLVVRPKRELLARYGISVDQVMSLVSQGIGGASAGQVIDGNA----R 796
DV L + + + +LL +Y ++ V++ + +AGQ+ A +
Sbjct: 172 VGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQ 229

Query: 797 YDINVRLAAEFRTSPDAIKDLLLSGTNGATVRLGEVASVEVEMAPPNIR-RDDVQRRVVV 855
+ ++ F+ + K L ++G+ VRL +VA VE+ N+ R + + +
Sbjct: 230 LNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289

Query: 856 QANVA-GRDMGSVVKDIYALVP--QADLPAGYTVIIGGQYENQQRAQQKLMLVVP---IS 909
+A G + K I A + Q P G V+ Y+ Q + VV +
Sbjct: 290 GIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLSIHEVVKTLFEA 347

Query: 910 IALIALLLYFSFGSFKQVLLIMANVPLALIGGIVALYVSGTYLSVPSSIGFITLFGVAVL 969
I L+ L++Y + + L+ VP+ L+G L G ++ + G + G+ V
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVD 407

Query: 970 NGVVLVDSINQRRQS-GEALYDCVYEGTVGRLRPVLMTALTSALGLIPILLSSGVGSEIQ 1028
+ +V+V+++ + + + ++ A+ + IP+ G I
Sbjct: 408 DAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIY 467

Query: 1029 KPLAVVIIGGLFSSTALTLLVLPTLYRWLYRGDKR 1063
+ ++ I+ + S + L++ P L L +
Sbjct: 468 RQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3974RTXTOXIND531e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 52.9 bits (127), Expect = 1e-09
Identities = 36/182 (19%), Positives = 64/182 (35%), Gaps = 22/182 (12%)

Query: 126 RATATLVVDRDRTATLAPQLDARVLARHVVPGQEVKKGEPLLTLGGAAVAQAQADYINAA 185
R V++ R + L + +A+H V QE + +N
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQE----------------NKYVEAVNEL 268

Query: 186 AEWSRVKRMSEGAVSVSRRMQAQVDAELKRAILEAIKMTPTQIRALE----STPEAIGSY 241
+ E + ++ V K IL+ ++ T I L E +
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 242 QLLAPIDGRVQQ-DIAMLGQVFSAGTPLMQLT-DESYLWVEAQLTPTQTAHITVGSAALV 299
+ AP+ +VQQ + G V + LM + ++ L V A + I VG A++
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAII 388

Query: 300 QV 301
+V
Sbjct: 389 KV 390



Score = 41.4 bits (97), Expect = 5e-06
Identities = 26/148 (17%), Positives = 55/148 (37%), Gaps = 5/148 (3%)

Query: 118 IANLNLDIRATATLVVDRDRTATLAPQLDARVLARHVVPGQEVKKGEPLLTLGG----AA 173
+ + + A L R+ + P ++ V V G+ V+KG+ LL L A
Sbjct: 77 LGQVEIVATANGKLTHS-GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135

Query: 174 VAQAQADYINAAAEWSRVKRMSEGAVSVSRRMQAQVDAELKRAILEAIKMTPTQIRALES 233
+ Q+ + A E +R + +S D + + E + T + +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF 195

Query: 234 TPEAIGSYQLLAPIDGRVQQDIAMLGQV 261
+ YQ +D + + + +L ++
Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARI 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3981RTXTOXIND280.015 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.9 bits (62), Expect = 0.015
Identities = 8/29 (27%), Positives = 13/29 (44%)

Query: 120 IQAERDGVVSAIWAKDGDEVAFDQPLFTL 148
I+ + +V I K+G+ V L L
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKL 127


48Sbal195_3997Sbal195_4043Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3997214-1.904124polysulfide reductase NrfD
Sbal195_3998214-2.2417244Fe-4S ferredoxin
Sbal195_3999116-2.624220formate-dependent nitrite reductase NrfG
Sbal195_4000118-2.629900FKBP-type peptidylprolyl isomerase
Sbal195_4001118-2.690367rhodanese domain-containing protein
Sbal195_4002117-2.668266cytochrome c
Sbal195_4003217-1.827672cytochrome c nitrate reductase biogenesis
Sbal195_4004014-1.238034cytochrome C biogenesis protein
Sbal195_4005-1140.706413alkyl hydroperoxide reductase
Sbal195_40060161.293567hypothetical protein
Sbal195_40071182.019540putative mitomycin resistance protein
Sbal195_40081172.319216hypothetical protein
Sbal195_40091181.452899redoxin domain-containing protein
Sbal195_40102142.4852742-nitropropane dioxygenase
Sbal195_40111130.755173glyoxalase/bleomycin resistance
Sbal195_40120110.226156hypothetical protein
Sbal195_40130120.275273periplasmic-binding protein/LacI transcriptional
Sbal195_40140130.6339394-hydroxybenzoate octaprenyltransferase
Sbal195_4015-1150.905952DNA-dependent helicase II
Sbal195_4016118-0.312942hypothetical protein
Sbal195_40171181.183523hypothetical protein
Sbal195_40182171.673828NUDIX hydrolase
Sbal195_40192161.473838zinc metallopeptidase
Sbal195_40201172.019471hypothetical protein
Sbal195_40211162.548272hypothetical protein
Sbal195_40220152.376593hypothetical protein
Sbal195_4023-1162.262888hypothetical protein
Sbal195_40240142.214210TRAP transporter solute receptor TAXI family
Sbal195_40250131.913202TRAP transporter, 4TM/12TM fusion protein
Sbal195_4026210-0.049568hypothetical protein
Sbal195_4027110-0.022043PAS/PAC sensor-containing diguanylate cyclase
Sbal195_40280100.146739FKBP-type peptidylprolyl isomerase
Sbal195_40290101.207926thioredoxin
Sbal195_40300101.389718anion transporter
Sbal195_40310122.304620major facilitator superfamily transporter
Sbal195_40320163.445892pseudouridine synthase
Sbal195_4033-1133.163410hypothetical protein
Sbal195_40340123.109789permease
Sbal195_40350122.886669zinc-responsive transcriptional regulator
Sbal195_4036-1101.955918bifunctional
Sbal195_4037-191.373757phosphoribosylamine--glycine ligase
Sbal195_4038-1100.735296hypothetical protein
Sbal195_4039-112-0.824823hypothetical protein
Sbal195_4040-112-0.735822short chain dehydrogenase
Sbal195_4041017-0.582176PAS/PAC sensor-containing diguanylate
Sbal195_4042225-0.036295uroporphyrinogen decarboxylase
Sbal195_4043327-0.046237hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4000INFPOTNTIATR1554e-49 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 155 bits (394), Expect = 4e-49
Identities = 89/227 (39%), Positives = 128/227 (56%), Gaps = 9/227 (3%)

Query: 21 ALFVSVTSFAAPSLKSDADKTSYSIGASIGNYISGQVYNQVELGSEVNIDLVVQGFVDAL 80
A+ ++ + A SL +D DK SYSIGA +G Q G ++N D++ +G D +
Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLGKNFKNQ-------GIDINPDVLAKGMQDGM 66

Query: 81 KDKQ-QLTDEEVVTYLNQRAEELNAARKILAEKEMAETKKASADYLAQNAKQSNVKVTAS 139
Q LT+E++ L++ ++L A R K+ E K +L+ N + + V S
Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126

Query: 140 GLQYQVITQGSGQKPNPEDVVTVEYVGTLIDGTEFENTVGRKEHTRFALMTVIPGWEEGL 199
GLQY++I G+G KP D VTVEY GTLIDGT F++T + F + VIPGW E L
Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186

Query: 200 KLMPMGSKYRFVVPASLAYGTEAV-GIIPPESALIFEIELKNIEKPS 245
+LMP GS + VPA LAYG +V G I P LIF+I L +++K +
Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKAA 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4021VACJLIPOPROT290.002 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 28.7 bits (64), Expect = 0.002
Identities = 14/35 (40%), Positives = 16/35 (45%), Gaps = 1/35 (2%)

Query: 2 MKTPLIMLVLCSTLLTSGCAELACSARTDVDPYEP 36
MK L L L +TLL GCA + DP E
Sbjct: 1 MKLRLSALALGTTLLV-GCASSGTDQQGRSDPLEG 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4028INFPOTNTIATR691e-17 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 68.9 bits (168), Expect = 1e-17
Identities = 38/99 (38%), Positives = 51/99 (51%), Gaps = 2/99 (2%)

Query: 9 LQVGEGKEAVKGALITTQYRGFLQDGTQFDSSYDRGQAFQCVIGTGRVIKGWDQGIMGMK 68
+ G G + K +T +Y G L DGT FDS+ G+ +VI GW + + M
Sbjct: 133 IDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKP--ATFQVSQVIPGWTEALQLMP 190

Query: 69 VGGKRKLLVPAHLAYGERQVGAHIKPNSDLTFEIELLEV 107
G ++ VPA LAYG R VG I PN L F+I L+ V
Sbjct: 191 AGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4031TCRTETA320.005 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.7 bits (72), Expect = 0.005
Identities = 60/359 (16%), Positives = 124/359 (34%), Gaps = 31/359 (8%)

Query: 14 SLFVPVAGLSLFALASGYLMSLIPLSLTFFELSTSLAP---LLASIFYLGLLLGAPCIAP 70
L V ++ ++L A+ G +M ++P L S + +L +++ L AP +
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 71 IVTRIGHSKAFILFLNILLCSVVAMILIPKSGVWL--ASRLVAGFAVAGIFVVVESWLLM 128
+ R G + +L +++ +V I+ +W+ R+VAG A V +++
Sbjct: 66 LSDRFG--RRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIAD 122

Query: 129 ADTQKQRAKRLGLYMTALYG-GTAIGQLAIDYLGTKGNLPYLVVMGLLAAASLPALLVKR 187
+RA+ G +M+A +G G G + +G L +
Sbjct: 123 ITDGDERARHFG-FMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181

Query: 188 GQPQVSEQQSMSLSALKNLSQPAIMGCLVSGLLLGPIY-----------GLLPIYVALDM 236
+ E++ + AL L+ + L ++ L I+
Sbjct: 182 PESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 237 GLDQQTGQFMALIIVGGMLVQPLVSYLSPIFNK-----SGLIVSFSLLGIAALLLLSQHS 291
D T + G+L + ++ L++ G +LL
Sbjct: 242 HWDATTIGIS--LAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATR 299

Query: 292 SMTLIIGFLLLGASAFALYPIAISLACDNLPASQMVSVAQVMLLSY-SVGSVIGPLVAS 349
+LL + + A+ + Q L + S+ S++GPL+ +
Sbjct: 300 GWMAFPIMVLLASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4033CHANLCOLICIN290.027 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.9 bits (64), Expect = 0.027
Identities = 19/101 (18%), Positives = 38/101 (37%)

Query: 158 VTSAITNSIKGPVEINSVQIENIDFSNAYEKSVEDRMRAEVEVQTQLQNLEKERVSAQIA 217
VT++ T + +I +Q SN + AE ++ NL ++ +
Sbjct: 291 VTASETRINRINADITQIQKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKDAVD 350

Query: 218 VTQAQAQADSQLARAKAEAESIRIKGDAEASAIKSRAEALA 258
T + Q ++ K + + ++ I + EALA
Sbjct: 351 ATVSFYQTLTEKYGEKYSKMAQELADKSKGKKIGNVNEALA 391


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4040DHBDHDRGNASE732e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 73.2 bits (179), Expect = 2e-17
Identities = 48/184 (26%), Positives = 80/184 (43%), Gaps = 2/184 (1%)

Query: 3 GLTGKVVIITGASEGIGRALAVAMARMGCQLVISARNETRLASLALEIANYGLPPFVFAA 62
G+ GK+ ITGA++GIG A+A +A G + N +L + + F A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 63 DVSRAEQCEALIEATVAHYGHLDILINNAGMTMWSRFDELTQLSVLEDIMRVNYLGPAYL 122
DV + + + G +DIL+N AG+ L+ E VN G
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEW-EATFSVNSTGVFNA 123

Query: 123 THAALPHLKASK-GQVVVVASVAGLTGVPTRSGYAASKHAVIGFFDSLRIELADDNVAVT 181
+ + ++ + G +V V S + + YA+SK A + F L +ELA+ N+
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 182 VICP 185
++ P
Sbjct: 184 IVSP 187


49Sbal195_4093Sbal195_4100Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_40939334.521693adenylate cyclase
Sbal195_40949324.385835porphobilinogen deaminase
Sbal195_40959324.085961uroporphyrinogen III synthase HEM4
Sbal195_40969313.782683hypothetical protein
Sbal195_40979313.741876HemY domain-containing protein
Sbal195_40989323.629986outer membrane adhesin-like protein
Sbal195_4099-114-2.520691type I secretion system ATPase
Sbal195_4100013-3.087276HlyD family type I secretion membrane fusion
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4096RTXTOXIND290.036 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.036
Identities = 13/77 (16%), Positives = 32/77 (41%), Gaps = 6/77 (7%)

Query: 81 FMLYQQMQQQLLVQDAKNIALQDQLQQALLQPNQRIGQLEQQQLNDAKT-----YQELTK 135
F +Q + Q + K A + A + + + ++E+ +L+D +
Sbjct: 195 FSTWQNQKYQKELNLDKKRA-ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHA 253

Query: 136 LAEDQNQLQDRINKLAQ 152
+ E +N+ + +N+L
Sbjct: 254 VLEQENKYVEAVNELRV 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4098CABNDNGRPT792e-16 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 79.2 bits (195), Expect = 2e-16
Identities = 40/172 (23%), Positives = 64/172 (37%), Gaps = 6/172 (3%)

Query: 6410 GSDTINGGNGDDILFGDAIN--FNGISGQGYVAIKDYVADQLGIAAVTDAQVHRYITEHA 6467
+ T G+ + + + + A + ++ I +
Sbjct: 260 ANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINLNE 319

Query: 6468 SDFDQSGASDKADVLIGGQGNDILYGQGGNDQLYGGNGNDLIFGGAGNDTIIGGLGNDKL 6527
F G + G + G GND L G + ++++ GGAGND + GG G D L
Sbjct: 320 GSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTL 379

Query: 6528 TGGTGADTFVWQAG----ESGTDHITDFNIHEDKLDLRDLLQGENTNTLDSY 6575
GG G DTFV+ +G + D I DF DK+DL + +
Sbjct: 380 YGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDKIDLSAFRNEGQLSFVQDQ 431



Score = 48.4 bits (115), Expect = 8e-07
Identities = 32/89 (35%), Positives = 41/89 (46%), Gaps = 3/89 (3%)

Query: 5831 GDFTTAPFNTGTRTIDNTSGQDQLLGTGGNDHLVSANGGGDLLYGMDGDDILVGSDAVQG 5890
F + ++ R N + G GN + + G G+DILVG+ A
Sbjct: 302 DTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTI-ENAIGGSGNDILVGNSA--D 358

Query: 5891 DSLYGGTGNDVLVAGLGNDGLYGGAGTDI 5919
+ L GG GNDVL G G D LYGGAG D
Sbjct: 359 NILQGGAGNDVLYGGAGADTLYGGAGRDT 387



Score = 44.2 bits (104), Expect = 2e-05
Identities = 31/120 (25%), Positives = 46/120 (38%), Gaps = 3/120 (2%)

Query: 5798 ADKPVVNVILTDNGIPLYSNFKTSGITTEQFRTGDFTTAPFNTGTRTIDNTSGQDQLLGT 5857
+ K ++ + G + S G F+ G +I + + +G
Sbjct: 287 SSKALIFSVWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGG 346

Query: 5858 GGNDHLVSANGGGDLLYGMDGDDILVGSDAVQGDSLYGGTGNDVLVAGLGNDGLYGGAGT 5917
GND LV N ++L G G+D+L G D+LYGG G D V G G D
Sbjct: 347 SGNDILV-GNSADNILQGGAGNDVLYGGAG--ADTLYGGAGRDTFVYGSGQDSTVAAYDW 403



Score = 34.6 bits (79), Expect = 0.014
Identities = 30/135 (22%), Positives = 45/135 (33%), Gaps = 25/135 (18%)

Query: 5825 TEQFRTGDFTTAPFNTGTRTIDNTSGQDQLLGTGGNDHLVSANGGGDLLYGMDGDDILVG 5884
T G + AP I G + TG + + ++N D D L+
Sbjct: 234 TGADYNGHYGGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIF 293

Query: 5885 SDAVQG-----------------------DSLYGGTGNDVLVAGLGNDGLYGGAGTDIAV 5921
S G + G GN + G+ + GG+G DI
Sbjct: 294 SVWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDI-- 351

Query: 5922 LLGNRADYIIEKSTG 5936
L+GN AD I++ G
Sbjct: 352 LVGNSADNILQGGAG 366


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4100RTXTOXIND310e-103 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 310 bits (796), Expect = e-103
Identities = 86/431 (19%), Positives = 193/431 (44%), Gaps = 11/431 (2%)

Query: 29 RLIIWALAAMVVCFLLWAGFAKLDKVTTGTGKVIPSSQVQVIQSLDGGIMQELYVQEGEM 88
RL+ + + +V + + +++ V T GK+ S + + I+ ++ I++E+ V+EGE
Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGES 117

Query: 89 VTKGQPLVRIDDTRFRSDYAQQEQEVFGLKTNAIRMRAELDSILISDMTSDWREQVLITK 148
V KG L+++ +D + + + + R + SI E + +
Sbjct: 118 VRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSI----------ELNKLPE 167

Query: 149 KALVFPENIIAAEPALVKRQQEEYNGRLDNLSNQLEILVRQIQQRQQEIDDLASKTTTLT 208
L V R + NQ + +++ E + ++
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 209 TSMQLISRELELTRPLAKKGIVPEVELLKLERTVNDLQGELNSMRLLRPKVKAAMDEAIL 268
++ L+ L K + + +L+ E + EL + ++++ + A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287

Query: 269 KRREAVFVYAADLRAQLNETQTRLSRMNEAQVGAQDKVSKAIITSPVNGTIKTTHINTLG 328
+ + ++ ++ +L +T + + +++ ++I +PV+ ++ ++T G
Sbjct: 288 EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEG 347

Query: 329 GVVQPGVDIIEIVPSEDQLLIETKILPKDIAFLHPGLPAVVKITAYDFTRYGGLKGTVEH 388
GVV ++ IVP +D L + + KDI F++ G A++K+ A+ +TRYG L G V++
Sbjct: 348 GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKN 407

Query: 389 ISADTSQDEEGNSYYLIRVRTAESSLTKNDGTQMPIIPGMLTSVDVITGQRSILEYILNP 448
I+ D +D+ + + + E+ L+ +P+ GM + ++ TG RS++ Y+L+P
Sbjct: 408 INLDAIEDQRLGLVFNVIISIEENCLST-GNKNIPLSSGMAVTAEIKTGMRSVISYLLSP 466

Query: 449 ILRAKDTALRE 459
+ + +LRE
Sbjct: 467 LEESVTESLRE 477


50Sbal195_4116Sbal195_4126Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_4116-115-3.412386sodium:neurotransmitter symporter
Sbal195_4117025-5.930936import inner membrane translocase subunit Tim44
Sbal195_4118-227-6.952799hypothetical protein
Sbal195_4119-219-2.484113hypothetical protein
Sbal195_4120-1150.851571putative lipoprotein
Sbal195_41210153.008071curli production assembly/transport protein
Sbal195_41221184.817036hypothetical protein
Sbal195_41230195.339936hypothetical protein
Sbal195_41240184.392240serine--pyruvate transaminase
Sbal195_4125-1214.483677threonine dehydratase
Sbal195_41260213.609716dihydroxy-acid dehydratase
51Sbal195_4247Sbal195_4263Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_4247224-5.800824RNA methyltransferase
Sbal195_4248325-5.817314hypothetical protein
Sbal195_4249316-4.187821RNA-directed DNA polymerase
Sbal195_4250112-3.460160transposase IS3/IS911 family protein
Sbal195_425108-2.728523integrase catalytic subunit
Sbal195_425208-1.449787hypothetical protein
Sbal195_4253-1101.083503hypothetical protein
Sbal195_42540110.521171peptidase S9 prolyl oligopeptidase
Sbal195_4255-113-0.074964peptidase M16 domain-containing protein
Sbal195_4256013-0.198853peptidase M16 domain-containing protein
Sbal195_4257218-0.801085amidohydrolase
Sbal195_4258328-2.760933hypothetical protein
Sbal195_4259328-2.886799hypothetical protein
Sbal195_4260122-1.781156hypothetical protein
Sbal195_4261119-1.469537ArsR family transcriptional regulator
Sbal195_4262121-2.254192NIPSNAP family protein
Sbal195_4263224-1.744478antibiotic biosynthesis monooxygenase
52Sbal195_4310Sbal195_4326Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_43102172.916384hypothetical protein
Sbal195_43112162.543168cytochrome c oxidase subunit II
Sbal195_43122191.873179cytochrome c oxidase subunit I
Sbal195_43133211.639374cytochrome C oxidase assembly protein
Sbal195_43143201.927043cytochrome c oxidase subunit III
Sbal195_43152181.696928hypothetical protein
Sbal195_43162151.761463hypothetical protein
Sbal195_43172151.702261hypothetical protein
Sbal195_43181120.406184cytochrome oxidase assembly
Sbal195_4319-110-0.094296protoheme IX farnesyltransferase
Sbal195_4320012-1.591516electron transport protein SCO1/SenC
Sbal195_4321114-3.007354polysaccharide deacetylase
Sbal195_4322119-4.591798MATE efflux family protein
Sbal195_4323222-5.552742peptidase S9 prolyl oligopeptidase
Sbal195_4324230-6.836045putative DNA uptake protein
Sbal195_4325228-5.955135flavocytochrome c
Sbal195_4326124-4.839379hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4320PF06057270.039 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 27.5 bits (61), Expect = 0.039
Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 4/42 (9%)

Query: 70 IGFTFCPDVCPTTLNKLAAAYPDLNKIAPLQVVFLSVDPKRD 111
IG++F +V P LN++ A Y L V LS D
Sbjct: 122 IGYSFGAEVIPFVLNEMPARYRK----NVLGAVLLSPSQSSD 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4321BCTERIALGSPC300.017 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 29.5 bits (66), Expect = 0.017
Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 13/105 (12%)

Query: 1 MVKRVLLALIGLMTFSAHAVVILQYH-HVSETTP-AATSVTPAQFREQMQFLAD-DGFKV 57
+++R+L L LM + ++ + + + P ++ +TPAQ R+Q L D F V
Sbjct: 13 VIRRILFYL--LMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFGV 70

Query: 58 IPLSQVVEAIKQKQ--DLPAKTVAITF------DDGYRSIATTAH 94
P A+ Q +LP T+ ++ DD RSIA +
Sbjct: 71 SPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISK 115


53Sbal195_4394Sbal195_4428Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_4394017-4.241643hypothetical protein
Sbal195_4395122-6.641713glutathione reductase
Sbal195_4396328-8.800090integrase family protein
Sbal195_4397228-8.776794hypothetical protein
Sbal195_4398229-9.232536transposase mutator type
Sbal195_4399334-10.085787hypothetical protein
Sbal195_4400325-6.468232hypothetical protein
Sbal195_4401321-4.465626hypothetical protein
Sbal195_4402220-4.118614hypothetical protein
Sbal195_4404220-4.144990hypothetical protein
Sbal195_4405219-3.357252hypothetical protein
Sbal195_4406217-3.429233YD repeat-containing protein
Sbal195_4407018-4.386944DNA-directed DNA polymerase
Sbal195_4408128-6.156518peptidase S24/S26 domain-containing protein
Sbal195_4409229-6.201373hypothetical protein
Sbal195_4410128-6.163666XRE family transcriptional regulator
Sbal195_4411332-7.306394hypothetical protein
Sbal195_4412337-7.909952type III restriction protein res subunit
Sbal195_4413439-7.755859hypothetical protein
Sbal195_4414438-7.613968integrase catalytic subunit
Sbal195_4415539-7.870815transposase IS3/IS911 family protein
Sbal195_4416434-7.191080hypothetical protein
Sbal195_4417022-3.777996ATPase central domain-containing protein
Sbal195_4418-116-2.290156ATP-dependent OLD family endonuclease
Sbal195_4419-213-0.816182UvrD/REP helicase
Sbal195_4420-2151.187095hypothetical protein
Sbal195_4421-1151.395800secretion protein HlyD family protein
Sbal195_4422-1151.901662fusaric acid resistance protein region
Sbal195_4423-1182.367886sodium/proline symporter
Sbal195_4424-2183.366676gamma-glutamyl kinase
Sbal195_4425-2183.578394gamma-glutamyl phosphate reductase
Sbal195_4426-2173.502294TonB-dependent siderophore receptor
Sbal195_4427-2174.076169hypothetical protein
Sbal195_4428-2163.502695RND family efflux transporter MFP subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4394IGASERPTASE280.022 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.022
Identities = 25/156 (16%), Positives = 52/156 (33%), Gaps = 11/156 (7%)

Query: 27 ADARQLLELEPESEPSLESLSQSKQTAPLPLGTLLDSEGKPINLPDQEQSSFEYSAPTLT 86
++ + + + E ++ T + E K + + + S +
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG---S 1090

Query: 87 PTESSKTTKSTKATQSTKATKSKKLSRKQQLASREHVANDPNCRWLDKRMDQLEAQLGGK 146
T+ ++TT++ + K K+K + K Q + P + Q E
Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA---- 1146

Query: 147 QDNAATHQADELSARQKEWQCLKCDAEGPAQNDHSN 182
++N T E ++ D E PA+ SN
Sbjct: 1147 RENDPTVNIKEPQSQTNT----TADTEQPAKETSSN 1178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4416SUBTILISIN447e-07 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 44.1 bits (104), Expect = 7e-07
Identities = 18/51 (35%), Positives = 25/51 (49%), Gaps = 5/51 (9%)

Query: 284 LCILDTGVNICHPLLQ----PFINEVDQFSVNPDWSPSDDNGHGTGMAGLA 330
+ +LDTG + HP L+ N D +P+ D NGHGT +AG
Sbjct: 45 VAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPE-IFKDYNGHGTHVAGTI 94


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4421RTXTOXIND627e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 61.8 bits (150), Expect = 7e-13
Identities = 41/215 (19%), Positives = 87/215 (40%), Gaps = 31/215 (14%)

Query: 77 TRYKATIAELNAKAESQKLAWELAKHKYKRRIGLTNDNLVSKETFDEAFINTELARTSYE 136
YK+ + ++ ++ S K ++L +K I +L +T+
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEI------------------LDKLRQTTDN 310

Query: 137 LAQ--AQLNTAKIDLARTQIHAPENGTLINLSLR-NGNYVSKGNSVFSLV-KQDSLYITG 192
+ +L + + I AP + + L + G V+ ++ +V + D+L +T
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370

Query: 193 YFEETKIPLVHIGQNADVSLMSGGQVLHGKVTSIGKAIANTNVTTNGQLLPQIGQTFNWV 252
+ I +++GQNA + + + +G + GK N+ + ++G FN +
Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV--GKV---KNINLDAIEDQRLGLVFNVI 425

Query: 253 RLSQRIPVDIELDSIPKDIELSVGMTVSIQLQTDK 287
I + K+I LS GM V+ +++T
Sbjct: 426 I---SIEENCLSTGN-KNIPLSSGMAVTAEIKTGM 456



Score = 48.3 bits (115), Expect = 2e-08
Identities = 24/155 (15%), Positives = 56/155 (36%), Gaps = 9/155 (5%)

Query: 9 LTLIVVAVAGIAGHWIWSHYLYSPWTRDGRVRA--EIITIAPDVYGWVNQLNVKDNQIVN 66
+ ++ IA + T +G++ I P V ++ VK+ + V
Sbjct: 60 VAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVR 119

Query: 67 KGDVLFTVDDTRYKATIAELNAKAESQKLA---WELAKHKYKR----RIGLTNDNLVSKE 119
KGDVL + +A + + +L +++ + + L ++
Sbjct: 120 KGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNV 179

Query: 120 TFDEAFINTELARTSYELAQAQLNTAKIDLARTQI 154
+ +E T L + + Q Q +++L + +
Sbjct: 180 SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4424CARBMTKINASE467e-08 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 46.4 bits (110), Expect = 7e-08
Identities = 33/126 (26%), Positives = 51/126 (40%), Gaps = 16/126 (12%)

Query: 116 KDTIFSLLEHGLL---------PIINENDAVTADKLKVGDNDNLSAMVAAAADADTLIIC 166
+TI L+E G++ P+I E+ + + V D D +A +AD +I
Sbjct: 176 AETIKKLVERGVIVIASGGGGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMIL 234

Query: 167 SDVNGLYTQNPHENPDAQLIKQVTEINAEIYAMAGGASSAVGTGGMRTKIQAAKKAISHG 226
+DVNG + Q +++V Y G G M K+ AA + I G
Sbjct: 235 TDVNGAALY--YGTEKEQWLREVKVEELRKYYEEGH----FKAGSMGPKVLAAIRFIEWG 288

Query: 227 IETFII 232
E II
Sbjct: 289 GERAII 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4428RTXTOXIND506e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.2 bits (120), Expect = 6e-09
Identities = 27/145 (18%), Positives = 50/145 (34%), Gaps = 10/145 (6%)

Query: 3 YLMIGLYLLCFAVHASAVPVTVALPQTGSTNELLTLSGSIKSARVARLSARTDGLVAKVL 62
Y ++G ++ F + V + G LT SG K + + +V +++
Sbjct: 62 YFIMGFLVIAFILSVLG-QVEIVATANGK----LTHSGRSKE-----IKPIENSIVKEII 111

Query: 63 VDAGSQVRAGQPLLALDDTLAVHQLAQRQADVMAAKTMLAEKQRLLTEAQTLSAQQLFPE 122
V G VR G LL L A + Q+ ++ A+ Q L + +L
Sbjct: 112 VKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLP 171

Query: 123 TERAIRQAALTEAEANLQSLTAALA 147
E + + E + +
Sbjct: 172 DEPYFQNVSEEEVLRLTSLIKEQFS 196



Score = 42.5 bits (100), Expect = 1e-06
Identities = 32/194 (16%), Positives = 63/194 (32%), Gaps = 19/194 (9%)

Query: 86 QLAQRQADVMAAKTMLAEKQRLLTEAQTLSAQQLFPETERAIRQAALTEAEANLQSLTAA 145
+ + ++ K+ L + + + A+ Q + + I L + N+ LT
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAK-EEYQLVTQLFKNEILDK-LRQTTDNIGLLTLE 317

Query: 146 LAHQKEVVARHQLMAPFAGVISSKQTET-GEWVNVGTEIFTLV-SQEQLWLDIQVPQEMF 203
LA +E + AP + + + T G V + +V + L + V +
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDI 377

Query: 204 QSVATATHVDIAADMLPEQQFIGHVNALV-------PVSDHNARSFLVRLTIPEADGL-- 254
+ + I + P G++ V F V ++I E
Sbjct: 378 GFINVGQNAIIKVEAFP-YTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTG 436

Query: 255 -----LMPGTSATA 263
L G + TA
Sbjct: 437 NKNIPLSSGMAVTA 450


54Sbal195_0055Sbal195_0068N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0055-1242.529200TrkA domain-containing protein
Sbal195_0056-2232.744158two component transcriptional regulator
Sbal195_0057-2212.381686integral membrane sensor signal transduction
Sbal195_0058-2202.254470pirin domain-containing protein
Sbal195_0059-1212.333951signal transduction histidine kinase LytS
Sbal195_0060-1192.352851LytTR family two component transcriptional
Sbal195_0061-1162.501536major facilitator superfamily transporter
Sbal195_00620172.345499hypothetical protein
Sbal195_0063-1152.212882NLP/P60 protein
Sbal195_0064-1162.545569major facilitator superfamily transporter
Sbal195_0065-1172.141326LacI family transcriptional regulator
Sbal195_0066-1122.190998sucrose phosphorylase
Sbal195_00670142.295161major facilitator superfamily transporter
Sbal195_00680142.247206TonB-dependent receptor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0055NUCEPIMERASE270.042 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 27.4 bits (61), Expect = 0.042
Identities = 13/28 (46%), Positives = 16/28 (57%), Gaps = 1/28 (3%)

Query: 6 VIGLGRF-GVAVSRELIHLGHTVTGVDN 32
V G F G VS+ L+ GH V G+DN
Sbjct: 5 VTGAAGFIGFHVSKRLLEAGHQVVGIDN 32


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0056HTHFIS936e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.6 bits (230), Expect = 6e-24
Identities = 35/112 (31%), Positives = 60/112 (53%), Gaps = 1/112 (0%)

Query: 4 KVLVVDDEAQIHTFMRISLEAEGFEYHGAASIASALAQYQAQRPHVLVLDLGLPDGDGIS 63
+LV DD+A I T + +L G++ ++ A+ A ++V D+ +PD +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 LLQTLRQHDK-VPVLILTARDQEEEKIRLLEAGANDYLSKPFGIRELIARIK 114
LL +++ +PVL+++A++ I+ E GA DYL KPF + ELI I
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0059PF065802022e-62 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 202 bits (516), Expect = 2e-62
Identities = 60/205 (29%), Positives = 110/205 (53%), Gaps = 13/205 (6%)

Query: 351 EQLQEMTRKAEFTALQSKINPHFLFNALNAISSLIRIRPQQARELIANLADYLRYNLAKG 410
++ M ++A+ AL+++INPHF+FNALN I +LI P +ARE++ +L++ +RY+L
Sbjct: 152 WKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYS 211

Query: 411 D-ELIDIQEEVKQVRDYVAIEQARFGDKLEVVFDVDD--VHFCVPCLLLQPLVENAILHG 467
+ + + +E+ V Y+ + +F D+L+ ++ + VP +L+Q LVEN I HG
Sbjct: 212 NARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHG 271

Query: 468 IQPRSAPGRVTIEVKKLDSGIRVAVRDTGYGISQEVIDGVAAGRIESSSIGLMNVHQRVK 527
I G++ ++ K + + + V +TG ES+ GL NV +R++
Sbjct: 272 IAQLPQGGKILLKGTKDNGTVTLEVENTG--------SLALKNTKESTGTGLQNVRERLQ 323

Query: 528 LLYGE--GLQLKRLEPGTEVSFYLP 550
+LYG ++L + +P
Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0060HTHFIS684e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 4e-15
Identities = 24/132 (18%), Positives = 59/132 (44%), Gaps = 6/132 (4%)

Query: 3 KAIIVEDEYLAREELE-YLVKSHSEIDIVASFEDGLEAFKYLQDHEVDVVFLDIQIPSID 61
++ +D+ R L L ++ ++ I ++ + + D+V D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRW---IAAGDGDLVVTDVVMPDEN 61

Query: 62 GLLLAKNLHKSTHPPHVVFVTAYKEF--AVEAFELEAFDYILKPYNEPRIISLLQKIELA 119
L + K+ V+ ++A F A++A E A+DY+ KP++ +I ++ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 120 GRQAPKPQHEAA 131
++ P + +
Sbjct: 122 PKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0061TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 6e-04
Identities = 54/278 (19%), Positives = 101/278 (36%), Gaps = 39/278 (14%)

Query: 43 PVSQVAFVFGLL----SLSLAVASSMAGKLQERFGVRNVTLGAGLLLGLGFLLTAQASNL 98
+ V +G+L +L + + G L +RFG R V L + + + + A A L
Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96

Query: 99 MMLYLCAGILVGFADGTGY--------LMTLSNCVKWFPERKGLISALAIGAYGLGSLGF 150
+LY+ I+ G TG + + F G +SA +G G +
Sbjct: 97 WVLYI-GRIVAGITGATGAVAGAYIADITDGDERARHF----GFMSA----CFGFGMVAG 147

Query: 151 KYINVLLLENTGLETTFQLWGLIAMALVLCGGMLMKDA------PAQSAASQQAESRDFT 204
+ L+ F + L G L+ ++ P + A S F
Sbjct: 148 PVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLAS--FR 204

Query: 205 LAEAMRKPQYWMLALMFLSACMSG----LYVIGVAKDIGEKMVDLPVLVAANAVAVIAMA 260
A M ++A+ F+ + L+VI + + +AA + +
Sbjct: 205 WARGMT-VVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI----LH 259

Query: 261 NLSGRLVLGILSDKIPRIRVISLAQIITLVGMVLLLFV 298
+L+ ++ G ++ ++ R + L I G +LL F
Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0064TCRTETB1103e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 110 bits (276), Expect = 3e-28
Identities = 85/430 (19%), Positives = 170/430 (39%), Gaps = 29/430 (6%)

Query: 30 FLAAVDQTLLATATPAIVEDLGGLR-QASWITIGYMLAMAASVPIYGWLGDNFGRAKILM 88
F + +++ +L + P I D +W+ +ML + +YG L D G ++L+
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 89 IAIVIFALGSIVSA-SAGTMDHMIAGRILQGMGGGGLMSLSQSLIGELVPIRQRARFQGY 147
I+I GS++ +I R +QG G +L ++ +P R + G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 148 FAAMFTLASVGGPVIGGIVVHAYSWHWLFWANIPLA-MLAVWRLNGLHKHSVKPVRQGKF 206
++ + GP IGG++ H W +L IP+ ++ V L L K V+ +G F
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVR--IKGHF 199

Query: 207 DLVGVVLFPTIITALLYWLSVAGQEFAWLSATSLGFAVFVVFGILGLLLWERRLASPFLP 266
D+ G++L I + + + F +S S F +FV R++ PF+
Sbjct: 200 DIKGIILMSVGIVFFMLFTTSYSISFLIVSVLS--FLIFVKH--------IRKVTDPFVD 249

Query: 267 LDLLAKKAVYMPLLTAALFAACLFAMIFFLPIYLQVGLHTNPAKTG-LLLLPMTFGIVTG 325
L + +L + + + +P ++ + A+ G +++ P T ++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 326 STIAGRLLSKDVAPKWLPTFGMGLAFIGLILISFVPPNANVIGGLGV-LVGIGLGTVMPS 384
I G L+ + P ++ G+ + + SF+ + + + V GL
Sbjct: 310 GYIGGILVDR-RGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 385 VQLVVQSVSGKARLSQITAMVSLCRSMGAAIGTALFSVLLYSLLPLTGSELGIAAIKTLP 444
+ +V S + ++++ + G A+ LL + + + LP
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL---------SIPLLDQRLLP 419

Query: 445 TEVVHHAFQY 454
EV + Y
Sbjct: 420 MEVDQSTYLY 429


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0067TCRTETA613e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 60.6 bits (147), Expect = 3e-12
Identities = 68/370 (18%), Positives = 129/370 (34%), Gaps = 46/370 (12%)

Query: 22 LMFFMFAMTSDAVGV-----IIPELISQFGLSMSQASAFHYMPMIFIAMSGLF---LGFL 73
L+ + + DAVG+ ++P L+ S + + + ++ M LG L
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 74 ADKIGRKLTILLGLLLFALACFMFALGESFYYFLFLLAFVGTAIGVFKTGALGLIGDIST 133
+D+ GR+ +L+ L A+ + A F + L++ V G A I DI T
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATA-PFLWVLYIGRIVAGITGATGAVAGAYIADI-T 124

Query: 134 SSKQHSSTMNTVEGYFGVGAMIGPAIVSYLLISGVSWKYLYFGAGC-----FCLVLCWL- 187
+ + + FG G + GP + + G S +F A F L
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLM--GGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 188 ----AYRADYPQIKRSSTDAINLASTFKMMKNPYALGFSL-AIGLYVATEVAIYV----- 237
R + + + A ++ A+ F + +G A I+
Sbjct: 183 ESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242

Query: 238 WMPTLLQSYQGDYTTLAAYALT-IFFTLRAGGRFLGGWVLDRFPWQQVMFWFSFAISACY 296
W T + +LAA+ + G R +M +
Sbjct: 243 WDATTIG------ISLAAFGILHSLAQAMITGPVAARLGERRA----LMLGMIADGTGYI 292

Query: 297 LGSMI---YGIEAAVILLPLSGLFMSMMYPTLNSKGISCFPVDQHGSVAGVILFFTAVSA 353
L + + ++LL G+ M + S+ + ++ G + G + T++++
Sbjct: 293 LLAFATRGWMAFPIMVLLASGGIGMP-ALQAMLSRQVD---EERQGQLQGSLAALTSLTS 348

Query: 354 AVGPLLMGFV 363
VGPLL +
Sbjct: 349 IVGPLLFTAI 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0068ECOLIPORIN330.005 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 33.0 bits (75), Expect = 0.005
Identities = 60/276 (21%), Positives = 101/276 (36%), Gaps = 67/276 (24%)

Query: 411 DDTSVTLGYYNATQ-NIGMS----WMWNSYLMEVKGDNAALLDVVAADGTAYSDNGLYGY 465
D T + +G+ TQ N ++ W +N +G+ A +A G + D G + Y
Sbjct: 53 DQTYMRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDY 112

Query: 466 GVPYWGNCCQRNYDTDYTIKAPYLALASSFGDLSLDASVRYDSGDASG------------ 513
G RNY Y ++ + + FG S + Y +G A+G
Sbjct: 113 G---------RNYGVLYDVEG-WTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGL 162

Query: 514 ----NYAGSVQSQVDMNLDGVISIPEQSVSSIDNANPQPVNYDWSYTSYSLGANYQFASD 569
N+A Q + + ++I + ++ D+ N D + + Y
Sbjct: 163 VDGLNFALQYQGKNESQSADDVNIGTNNRNNGDDIRYD--NGD----GFGISTTYDIGMG 216

Query: 570 LAAFARLSHGGRANADRLLFGKVRADGSVAKEDAVDIVDQYELGVKYRYDDLSVFATAFY 629
+A A + R N +++ G A G A D + G+KY D +++ Y
Sbjct: 217 FSAGAAYTTSDRTN-EQVNAGGTIAGGDKA--------DAWTAGLKY--DANNIYLATMY 265

Query: 630 SET-------------------EEQNFEATSQRFFD 646
SET + QNFE T+Q FD
Sbjct: 266 SETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQFD 301


55Sbal195_0152Sbal195_0159N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_01520161.152422general secretion pathway protein C
Sbal195_01531171.345128general secretion pathway protein D
Sbal195_01541212.242398general secretory pathway protein E
Sbal195_0155-1201.344249general secretion pathway protein F
Sbal195_01560161.402068general secretion pathway protein G
Sbal195_0157-1171.729916general secretion pathway protein H
Sbal195_0158-1151.354569general secretion pathway protein I
Sbal195_0159-1131.637160general secretion pathway protein J
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0152BCTERIALGSPC1823e-58 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 182 bits (462), Expect = 3e-58
Identities = 70/288 (24%), Positives = 138/288 (47%), Gaps = 36/288 (12%)

Query: 17 KPLSRIVFWLGFIVIMLLAAQITWKL-VPTSSSASAWSPTPVSVNGKGAGQVDLAGLQQL 75
+ RI+F+L ++ A I W++ +P ++ S+ TP + L
Sbjct: 12 SVIRRILFYLLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVT------LNDF 65

Query: 76 GLFGKADATSDKPKVEAVETVTDAPKTTLSIQLTGVVASTADQKGLAIIESNGSQDTYSL 135
LFG + + ++A +++ P +TL++ LTGV+A D + +AII + Q + +
Sbjct: 66 TLFGVSPEKNKAGALDA-SQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGV 124

Query: 136 GDKIKGTSASLKEVYADRIIITNAGRYETLMLDGLVYTSQSPANQQLQQAKSNKAGSAVS 195
+++ G +A + + DR+++ GRYE L L +
Sbjct: 125 NEEVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPG--------------- 169

Query: 196 RVDQRNNADISQELAESRTELLADPSKITDYIAISPVRQGDSVAGYRLNPGKDANLFKQA 255
A ++++L + + ++DY++ SP+ + + GYRLNPG ++ F +
Sbjct: 170 -------AQVNEQLQQR------ASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRV 216

Query: 256 GFKANDLAKSINGYDLTVMSQALEMMSQLSELTEVSIMVEREGQLVEI 303
G + ND+A ++NG DL QA + M +++++ ++ VER+GQ +I
Sbjct: 217 GLQDNDMAVALNGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDI 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0153BCTERIALGSPD6000.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 600 bits (1549), Expect = 0.0
Identities = 330/683 (48%), Positives = 449/683 (65%), Gaps = 37/683 (5%)

Query: 6 IRRKLIAGIVAGAAMFSSQFAWSEQYAANFKGTDIQEFINIVGKNLNKTIIVDPTIRGKI 65
IR + ++ A +F A +E+++A+FKGTDIQEFIN V KNLNKT+I+DP++RG I
Sbjct: 7 IRSFSLTLLIFAALLFRP--AAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTI 64

Query: 66 NVRSYDLLNDEQYYQFFLNVLQVYGYAIVEMENNVIKVIKDKDAKTAAIRVANDAEPGIG 125
VRSYD+LN+EQYYQFFL+VL VYG+A++ M N V+KV++ KDAKTAA+ VA+DA PGIG
Sbjct: 65 TVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIG 124

Query: 126 DEMVTRIVALYNTEAKQLAPLLRQLNDNAGGGNVVNYDPSNVLMLSGRAAVVNKLVEIVR 185
DE+VTR+V L N A+ LAPLLRQLNDNAG G+VV+Y+PSNVL+++GRAAV+ +L+ IV
Sbjct: 125 DEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVE 184

Query: 186 RVDKQGDTSVQVVPLEFASAGEMVRIIDTLYRATANQSQMPGQAPKVVADERINAVVVSG 245
RVD GD SV VPL +ASA ++V+++ L + T+ + VVADER NAV+VSG
Sbjct: 185 RVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSG 244

Query: 246 DEKSRQRVVELIHRLDAEQASTGNTKVRYLRYAKAEDLVEVLTGFAQKLEGEKDPNAQAA 305
+ SRQR++ +I +LD +QA+ GNTKV YL+YAKA DLVEVLTG + ++ EK A
Sbjct: 245 EPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEK--QAAKP 302

Query: 306 GGKRRNEINIMAHAETNALVISAEPDQMRTIESVINQLDIRRAQVLVEAIIVEVAEGDNV 365
I I AH +TNAL+++A PD M +E VI QLDIRR QVLVEAII EV + D +
Sbjct: 303 VAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGL 362

Query: 366 GFGVQWASKSGLGTQFNNLGPTIGEIGAGIWQAQDVKASQTCTGSGDNQTCTDNPDTKGD 425
G+QWA+K+ TQF N G I AG + + G
Sbjct: 363 NLGIQWANKNAGMTQFTNSGLPISTAIAG----------------------ANQYNKDGT 400

Query: 426 VT-LLAQALGKVNGMAWGVAMGDFGALIQAVSSDTNSNVLATPSITTLDNQEASFIVGDE 484
V+ LA AL NG+A G G++ L+ A+SS T +++LATPSI TLDN EA+F VG E
Sbjct: 401 VSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQE 460

Query: 485 VPILTGSTASSSNSNPFQTVERKEVGVKLKVVPQINEGNAVKLTIEQEVSGVNG-----N 539
VP+LTGS ++S N F TVERK VG+KLKV PQINEG++V L IEQEVS V +
Sbjct: 461 VPVLTGSQ-TTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTS 519

Query: 540 TGVDISFATRRLTTTVMADSGQIVVLGGLINEEVQESIQKVPFLGDIPIIGHLFKSSSSK 599
+ + +F TR + V+ SG+ VV+GGL+++ V ++ KVP LGDIP+IG LF+S+S K
Sbjct: 520 SDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKK 579

Query: 600 KTKKNLMIFIKPTIIRDGITMEGIAGRKYNYFRALQLEQ--QERGVNLMPNTKVPVLEEW 657
+K+NLM+FI+PT+IRD + +Y F Q +Q +E ++ + +
Sbjct: 580 VSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYP-- 637

Query: 658 NQSEYLPPEVNAILERYKEGKGL 680
Q +V+A ++ + G L
Sbjct: 638 RQDTAAFRQVSAAIDAFNLGGNL 660


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0155BCTERIALGSPF5060.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 506 bits (1304), Expect = 0.0
Identities = 229/407 (56%), Positives = 304/407 (74%), Gaps = 1/407 (0%)

Query: 1 MPAFEYKALDAKGKQLKGVIEADTARHARSQLRDQRMMPLEILPVSEKEAKAKSSSFSF- 59
M + Y+ALDA+GK+ +G EAD+AR AR LR++ ++PL + + K+ S+ S
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 FKRGISVAELALITRQIATLVAAGLPIEESLKAVGQQCEKDRLASMIMAVRSRVVEGYSL 119
K +S ++LAL+TRQ+ATLVAA +P+EE+L AV +Q EK L+ ++ AVRS+V+EG+SL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 ADSLAEFPHIFDDLYRAMVASGEKSGHLEVVLNRLADYTERRQQLKSKLTQAMIYPAVLT 179
AD++ FP F+ LY AMVA+GE SGHL+ VLNRLADYTE+RQQ++S++ QAMIYP VLT
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 TVAIGVISILLAAVVPKVVGQFEHMGAELPASTRFLISASDFVQNYGVFVVIALVMLFAL 239
VAI V+SILL+ VVPKVV QF HM LP STR L+ SD V+ +G ++++AL+ F
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 FRRMLKSPAFRMKYDNFLLSMPVVGRVSKGLNTARFARTLSILSASSVPLLDGMRIASEV 299
FR ML+ R+ + LL +P++GR+++GLNTAR+ARTLSIL+AS+VPLL MRI+ +V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 LQNVRVRAAVDDATARVREGTSLGAALTNTKLFPAMMLYMIASGEKSGQLEQMLERAADN 359
+ N R + AT VREG SL AL T LFP MM +MIASGE+SG+L+ MLERAADN
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QDREFEGNVNIALGVFEPMLVVSMACVVLFIVMAILQPILALNNLIS 406
QDREF + +ALG+FEP+LVVSMA VVLFIV+AILQPIL LN L+S
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0156BCTERIALGSPG2296e-81 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 229 bits (584), Expect = 6e-81
Identities = 97/144 (67%), Positives = 119/144 (82%)

Query: 1 MQMNKKHQGFTLLEVMVVIVILGILASMVVPNLMGNKDKADQQKAVSDIVALENALDMYK 60
M+ K +GFTLLE+MVVIVI+G+LAS+VVPNLMGNK+KAD+QKAVSDIVALENALDMYK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 LDNGIYPTTEQGLEALVQKPTISPEPRNYREDGYVKRLPEDPWRNKYLLLSPGENGKLDI 120
LDN YPTT QGLE+LV+ PT+ P NY ++GY+KRLP DPW N Y+L++PGE+G D+
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 121 FTAGPDGQPGTEDDIGNWNLQNFQ 144
+AGPDG+ GTEDDI NW L +
Sbjct: 121 LSAGPDGEMGTEDDITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0157BCTERIALGSPH861e-23 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 86.2 bits (213), Expect = 1e-23
Identities = 44/171 (25%), Positives = 70/171 (40%), Gaps = 39/171 (22%)

Query: 17 LRHAGFTLMEVMLVILLMGLTAAAVTMSIGNSGPQQALDRTARQFIAATEMVLDETVLSG 76
+R GFTL+E+ML++LLMG++A V ++ S A AR F A V + +G
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLAR-FEAQLRFVQQRGLQTG 59

Query: 77 QFIGIVIEKTSYQFVFYKDG---------------KWEPLDKDRLLSEKQMEPGVVMNLV 121
QF G+ + +QF+ + +W PL R+ +
Sbjct: 60 QFFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGS---------- 109

Query: 122 LDGLPLVQDDEEDDSWFEEPLIEPSADDKKKHPEPQVMLFPSGEMSAFELT 172
+ G L + ++W P V++FP GEM+ F LT
Sbjct: 110 IAGGKLNLAFAQGEAW-------------TPGDNPDVLIFPGGEMTPFRLT 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0158PilS_PF08805290.003 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 29.1 bits (65), Expect = 0.003
Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 5 KGMTLLEVIVALAVFSIAAVSITKSLGEQMAN 36
KG TL+EV++ + V + A S K +N
Sbjct: 26 KGATLMEVLLVVGVIVVLAASAYKLYSMVQSN 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0159BCTERIALGSPG310.002 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.4 bits (71), Expect = 0.002
Identities = 16/41 (39%), Positives = 27/41 (65%), Gaps = 3/41 (7%)

Query: 3 LKLTSVQRGFTLLEMLIAIAIFAMIGLASNAVLSTVLTNDE 43
++ T QRGFTLLE+++ I I IG+ ++ V+ ++ N E
Sbjct: 1 MRATDKQRGFTLLEIMVVIVI---IGVLASLVVPNLMGNKE 38


56Sbal195_0250Sbal195_0272N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0250-1102.129879short-chain dehydrogenase/reductase SDR
Sbal195_0251-1101.297428aldehyde dehydrogenase
Sbal195_02520150.726200Fis family GAF modulated sigma54 specific
Sbal195_0253-114-0.269071catalase
Sbal195_0254-117-0.466803ankyrin
Sbal195_0255-213-0.212695quinone oxidoreductase
Sbal195_0256-212-0.843168flavocytochrome c
Sbal195_0257112-0.520502tetraheme cytochrome c
Sbal195_0258111-0.191261LysR family transcriptional regulator
Sbal195_0259113-0.219201TetR family transcriptional regulator
Sbal195_0260013-0.289544integral membrane sensor signal transduction
Sbal195_02610130.756878two component transcriptional regulator
Sbal195_0262-1141.099516hypothetical protein
Sbal195_0263-1150.683340cation diffusion facilitator family transporter
Sbal195_02640170.484820hypothetical protein
Sbal195_0265-2170.611806OmpA domain-containing protein
Sbal195_0266-2181.613974nitrogen metabolism transcriptional regulator
Sbal195_0267-2180.440560signal transduction histidine kinase, nitrogen
Sbal195_0268-121-0.448406hypothetical protein
Sbal195_0269-1100.827157glutathione S-transferase domain-containing
Sbal195_0270-2100.737489ThiJ/PfpI domain-containing protein
Sbal195_0271-29-0.301311iron-containing alcohol dehydrogenase
Sbal195_0272-29-0.841585TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0250DHBDHDRGNASE995e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 99.0 bits (246), Expect = 5e-27
Identities = 71/257 (27%), Positives = 112/257 (43%), Gaps = 16/257 (6%)

Query: 6 IALITGASRGLGKNTALKLAAQGIDIILTYQTNAAAAAEVVAEIEWLGRKAVALPLDVSD 65
IA ITGA++G+G+ A LA+QG I N +VV+ ++ R A A P DV D
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 66 SGSFAEFATQVSTVLAHTWQRESFNYLINNAGIGIHVPMAETSIEQFDTLMNIHVKGPFF 125
S + E ++ + + L+N AG+ + S E+++ +++ G F
Sbjct: 69 SAAIDEITARIER------EMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 126 LTQTLLPLLMD--GGSIVNISTGLTRFAIPGFGAYATMKGAVETMTKYWAKELGPRGIRV 183
++++ +MD GSIV + + AYA+ K A TK EL IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 184 NVLAPGAIETDFGGGAVRDNEQMNQFLAQQTA-------LGRVGLPDDIGGAISALLSPA 236
N+++PG+ ETD D Q + L ++ P DI A+ L+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 237 AAWINAQRIEASGGMFL 253
A I + GG L
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0252HTHFIS320e-104 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 320 bits (822), Expect = e-104
Identities = 117/359 (32%), Positives = 191/359 (53%), Gaps = 29/359 (8%)

Query: 296 RDPQLERAWQHANKVITKQIPLLVLGETGVGKEQFVKKLHAQSARRTEHLVAVNCAALPA 355
R ++ ++ +++ + L++ GE+G GKE + LH RR VA+N AA+P
Sbjct: 142 RSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPR 201

Query: 356 ELVESELFGYQAGAFTGANRTGFIGKIRQAHGGFLFLDEIGEMPLAAQSRLLRVLQEREV 415
+L+ESELFG++ GAFTGA G+ QA GG LFLDEIG+MP+ AQ+RLLRVLQ+ E
Sbjct: 202 DLIESELFGHEKGAFTGAQTRS-TGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEY 260

Query: 416 VPVGSNQSFKVDIQIIAATHMDLEQQVAQGLFRQDLFYRLNGLQVRLPALRERQ-DIERI 474
VG + D++I+AAT+ DL+Q + QGLFR+DL+YRLN + +RLP LR+R DI +
Sbjct: 261 TTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDL 320

Query: 475 IH---KLHRKHRIAPQAICPELLGQLMLHDWPGNLRELDNLMQVACLMAEGDDTLTWQHL 531
+ + K + + E L + H WPGN+REL+NL++ + D +T + +
Sbjct: 321 VRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQ-DVITREII 379

Query: 532 PDYLAQKLACDLLKVDPLNAQLLNTLQLNEEQPLGEEVKIGQNSASHPLAGKVVSGKVTS 591
+ L ++ ++ + + + V+ + A
Sbjct: 380 ENELRSEIPDSPIEKAAARS---------GSLSISQAVE---ENMRQYFA---------- 417

Query: 592 GNTATLPTATAVQSDSLHEAIYSNVLQAYKASDGNVSQCAKRLGISRNALYRRLKQMGL 650
+ + + L E Y +L A A+ GN + A LG++RN L ++++++G+
Sbjct: 418 -SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0255NUCEPIMERASE290.023 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.0 bits (65), Expect = 0.023
Identities = 13/28 (46%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 151 VLVTGASGGVGS-VAVTLLANAGYRVVA 177
LVTGA+G +G V+ LL G++VV
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEA-GHQVVG 29


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0256HTHFIS310.015 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.015
Identities = 10/31 (32%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 39 KWDKEIEILIVGSGFSGLAAAIEATRKGAKD 69
K ++ +L++ S + AI+A+ KGA D
Sbjct: 71 KARPDLPVLVM-SAQNTFMTAIKASEKGAYD 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0259HTHTETR395e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 39.2 bits (91), Expect = 5e-06
Identities = 24/135 (17%), Positives = 56/135 (41%), Gaps = 9/135 (6%)

Query: 4 WEQRTDYLVEVAQRCL--RGHQSFDLYRSHLVAASQISKGTIYNHFTTEADLVVAVACAQ 61
++ ++++VA R +G S L + A+ +++G IY HF ++DL +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSL--GEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 62 YQDWL-ISAKQDRQQYSDP---FECYLFHHCQRLHDVLAHKRFVIERVMPNQELLQQASE 117
+ + + + DP L H + +R ++E + E + + +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTV-TEERRRLLMEIIFHKCEFVGEMAV 125

Query: 118 VYRHRFSDLLDQYKK 132
V + + + L+ Y +
Sbjct: 126 VQQAQRNLCLESYDR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0260PF06580384e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 4e-05
Identities = 21/118 (17%), Positives = 47/118 (39%), Gaps = 12/118 (10%)

Query: 282 EAEQLEKLISELLELSRVKLSTNETKVHLGLAESLSQVLDDAEFEAEQQGKSIT--IDID 339
+ + ++++ L EL R L + + LA+ L+ V + + Q + I+
Sbjct: 189 DPTKAREMLTSLSELMRYSLRYSNARQVS-LADELTVVDSYLQLASIQFEDRLQFENQIN 247

Query: 340 EEIELAHFPKSLSRAIENLLRNAIRYAASD------IQLQASATADQVKITIKDDGPG 391
I P L ++ L+ N I++ + I L+ + V + +++ G
Sbjct: 248 PAIMDVQVPPML---VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0261HTHFIS962e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.4 bits (240), Expect = 2e-25
Identities = 44/163 (26%), Positives = 76/163 (46%), Gaps = 3/163 (1%)

Query: 2 SRILLIDDDLGLSELLGQLLELEGFQLTLAYDGKQGLDLALSSDYDLILLDVMLPKLNGF 61
+ IL+ DDD + +L Q L G+ + + + + D DL++ DV++P N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVLRALRQH-KQTPVLMLTARGDEIDRVVGLEIGADDYLPKPFNDRELIARIRAIIRRSN 120
++L +++ PVL+++A+ + + E GA DYLPKPF+ ELI I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 LTTQEIHAAPAQEFGDLRLDPSRQEAYCNEQLIILTGTEFTLL 163
++ + + QE Y L L T+ TL+
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIY--RVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0265OMPADOMAIN687e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 67.6 bits (165), Expect = 7e-16
Identities = 47/199 (23%), Positives = 70/199 (35%), Gaps = 22/199 (11%)

Query: 2 MNKLSIVAISILSAFAATQVSAATDTTGFYVGGAL-------NRVTADAFDGSETGTGVG 54
M K +I AI++ A AT AA +Y G L + E G G
Sbjct: 1 MKKTAI-AIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAG 59

Query: 55 VYGGYNFNEWFGLEANLFATGDL----GDKDVDISAGALSFTPKFTAQINDIFSAYAKVG 110
+GGY N + G E G + ++ A + T K I D Y ++G
Sbjct: 60 AFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLG 119

Query: 111 IASMAVNVDGYGYDEDF-TGFGWTYGVGVNAAVTEHLNIRVSYDVTT--GDLDADRSYLG 167
+ Y ++ TG + GV A+T + R+ Y T GD
Sbjct: 120 GMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTI----- 174

Query: 168 LKDIDTDIKQFAVGVHYQF 186
D ++GV Y+F
Sbjct: 175 --GTRPDNGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0266HTHFIS5600.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 560 bits (1445), Expect = 0.0
Identities = 197/473 (41%), Positives = 294/473 (62%), Gaps = 11/473 (2%)

Query: 7 VWILDDDSSIRWVLEKALQGAKLSTASFAAAESLWQALEISQPHVIVSDIRMPGTDGLSL 66
+ + DDD++IR VL +AL A + A +LW+ + ++V+D+ MP + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 LERLQVHYPHIPVIIMTAHSDLDSAVSAYQAGAFEYLPKPFDIDEAISLVERALTHATEQ 126
L R++ P +PV++M+A + +A+ A + GA++YLPKPFD+ E I ++ RAL +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 127 SPAPAQEAQVKTPEIIGEAPAMQEVFRAIGRLSRSSISVLINGQSGTGKELVAGALHKHS 186
P+ ++ ++G + AMQE++R + RL ++ ++++I G+SGTGKELVA ALH +
Sbjct: 126 -PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYG 184

Query: 187 PRKDKPFIALNMAAIPKDLIESELFGHEKGAFTGAANVRQGRFEQANGGTLFLDEIGDMP 246
R++ PF+A+NMAAIP+DLIESELFGHEKGAFTGA GRFEQA GGTLFLDEIGDMP
Sbjct: 185 KRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMP 244

Query: 247 LDVQTRLLRVLADGQFYRVGGHNAVQVDVRIIAATHQDLELLVQKGGFREDLFHRLNVIR 306
+D QTRLLRVL G++ VGG ++ DVRI+AAT++DL+ + +G FREDL++RLNV+
Sbjct: 245 MDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVP 304

Query: 307 VHLPPLSQRREDIPQLATHFLASAAKEIGVETKIMTKETAVKLSQLPWPGNVRQLENTCR 366
+ LPPL R EDIP L HF+ A KE G++ K +E + PWPGNVR+LEN R
Sbjct: 305 LRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELENLVR 363

Query: 367 WLTVMASGQEILPQDLPPELLKDPVSVTHTAKGSQDWQSALTEWIDQKLSE--------- 417
LT + I + + EL + ++ ++++ +++ + +
Sbjct: 364 RLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDAL 423

Query: 418 GNSDLLTEVQPAFERILLETALRHTQGHKQEAAKRLGWGRNTLTRKLKELSMD 470
S L V E L+ AL T+G++ +AA LG RNTL +K++EL +
Sbjct: 424 PPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0267PF06580393e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.7 bits (90), Expect = 3e-05
Identities = 36/188 (19%), Positives = 70/188 (37%), Gaps = 33/188 (17%)

Query: 166 TLIIEQADRLRNLVDRL-------LGPQRPTQHSLHNIHQVVQKVYKLVEMALPANIQLK 218
LI+E + R ++ L L Q SL + VV +L + +Q +
Sbjct: 184 ALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFE 243

Query: 219 RDYDPSIPDIEMDPDQMQQAVLNILQNAVQALEHTGGEILLRTRTQHQVTIGSQRHKLVL 278
+P+I D+++ P +Q V N +++ + L GG+ILL+ + +
Sbjct: 244 NQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQ-GGKILLKGTKDNGT----------V 292

Query: 279 TLSIIDNGPGIPPELMDTLFYPMVTGREQGSGLGLSIAHNIARLHSG---RIDCLSSAGH 335
TL + + G ++ +G GL ++ G +I G
Sbjct: 293 TLEVENTGSLALKN------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGK 340

Query: 336 TEFIISLP 343
++ +P
Sbjct: 341 VNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0272HTHTETR601e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.4 bits (146), Expect = 1e-13
Identities = 39/209 (18%), Positives = 64/209 (30%), Gaps = 13/209 (6%)

Query: 1 MKIETQSTRQHILDIGYKLIVRKGFSSVGLSLLLQAAEVPKGSFYHYFKSKEQFGEALIT 60
K E Q TRQHILD+ +L ++G SS L + +AA V +G+ Y +FK K +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 DYFEKYQLDLDALFNDSTLTGYQRLMQYWQQWLHVQADGCVDQKCLVVKLSAEVADLSEA 120
L + L + + + A
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 121 MRVALLKGSAG-IIDRLTTCVQVGINDSSIAEQ-DPQSTAEM-------LYHMWLGAS-- 169
+ + DR+ ++ I + + A + L WL A
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQS 184

Query: 170 --LMNKLGHSPAALERALVTTKAILTPKT 196
L + A L + + P T
Sbjct: 185 FDLKKEARDYVAILLEMYLLCPTLRNPAT 213


57Sbal195_0351Sbal195_0358N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_03510193.219168ATP-dependent DNA helicase RecG
Sbal195_03520202.186829two component LuxR family transcriptional
Sbal195_0353-1192.320985integral membrane sensor signal transduction
Sbal195_0354-1202.781187hypothetical protein
Sbal195_0355-1202.633258CaCA family Na(+)/Ca(+) antiporter
Sbal195_03560163.745364AMP-dependent synthetase and ligase
Sbal195_0357-1143.742112putative endoribonuclease L-PSP
Sbal195_0358-1143.483288bifunctional (p)ppGpp synthetase II/
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0351SECA411e-05 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 41.4 bits (97), Expect = 1e-05
Identities = 30/84 (35%), Positives = 39/84 (46%), Gaps = 8/84 (9%)

Query: 294 MRLVQGDV-----GSGKTLVAAMAA-LQAIENGYQVAMMAPTELLAEQHATNFAAWFEPL 347
M L + + G GKTL A + A L A+ G V ++ + LA++ A N FE L
Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFEFL 150

Query: 348 GLKVGW-LAGKLKGKARAQSLADI 370
GL VG L G R ADI
Sbjct: 151 GLTVGINLPGMPAPAKREAYAADI 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0352HTHFIS762e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 2e-18
Identities = 29/117 (24%), Positives = 51/117 (43%), Gaps = 1/117 (0%)

Query: 2 KILLAEDQAMVRGALAALLTLAGGFNITQASDGDEALGLLKQQSFDLLLTDIEMPGRTGL 61
IL+A+D A +R L L+ AG +++ S+ + DL++TD+ MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 ELAAWLKDQHSQTKVVVITTFGRAGYIKRAIEAGVGGFLLKDAPSETLVNAIQQVMA 118
+L +K V+V++ +A E G +L K L+ I + +A
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0353PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 70/379 (18%), Positives = 124/379 (32%), Gaps = 57/379 (15%)

Query: 1 MTSTHLQLERKLAWVYLINLVFYL---IPLTINAYPAWKIALSFAVLIPFIASYF-WAYK 56
M STH Q + + I Y ++ F + I + AY+
Sbjct: 1 MASTHRQANKYYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYR 60

Query: 57 CTQNSAYRPILMMVAIATAITPINPGSISLFTFAAFFIGF-FYPLRTCLLAIAALIGLLF 115
L M I + P A IG ++ T + + A I
Sbjct: 61 SFIKRQGWLKLNMGQIILRVLP-----------ACVVIGMVWFVANTSIWRLLAFINTKP 109

Query: 116 ALNEIYDFNSYYFPLYGSGLVLGVGMFG------VAERRRHQHKLKEQQSTQEISTLAAM 169
+ S F + + + FG + Q K+ ++ L A
Sbjct: 110 VAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQ 169

Query: 170 VERERIARDLHDIMGHSLSSIALKAELAEKLLAKQEYQLATIQLNELGQIARESLSQIR- 228
+ + L++I + I A ++L L ++ R SL
Sbjct: 170 INPHFMFNALNNIR----ALILEDPTKAREMLTS------------LSELMRYSLRYSNA 213

Query: 229 HTVSDYKHKGLADSVTQLCKLLREKGVSVELTGNIPKLPARMESQLGLIVTELVNNILRH 288
VS + DS QL + E + E N + ++ ++V LV N ++H
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPP---MLVQTLVENGIKH 270

Query: 289 SGASQC------IIDFIQQADRLVVEVKDNGP----SKPIAEGNGLTGIRERLDSLGG-- 336
G +Q ++ + + +EV++ G + + G GL +RERL L G
Sbjct: 271 -GIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTE 329

Query: 337 -SLSYNLEQG-YAFTVSLP 353
+ + +QG V +P
Sbjct: 330 AQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0358PF07328320.003 T-DNA border endonuclease VirD1
		>PF07328#T-DNA border endonuclease VirD1

Length = 144

Score = 31.9 bits (72), Expect = 0.003
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 506 PEQIEKVI----RDTKHTTLDSLLADIGLGNAMSIVIAQRLIGDNLENQESRDGHMMPIR 561
P +++KVI + + D+ +A++GL ++ IA R IG +EN + +
Sbjct: 16 PARVDKVISVKMTEAELAEFDAQIAELGLNRNRALRIAARRIGGFVENDAKTVELLRDMS 75

Query: 562 GAEGMLVT 569
A + T
Sbjct: 76 RAIAGVAT 83


58Sbal195_0416Sbal195_0423N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_04163191.224712cell division protein FtsA
Sbal195_04171200.679427cell division protein FtsZ
Sbal195_0418015-0.321374UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine
Sbal195_0419015-0.075720hypothetical protein
Sbal195_0420017-0.092390peptidase M23B
Sbal195_0421018-0.234475preprotein translocase subunit SecA
Sbal195_0422016-0.123222***delta-aminolevulinic acid dehydratase
Sbal195_0423114-0.615150diguanylate cyclase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0416SHAPEPROTEIN688e-15 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 68.2 bits (167), Expect = 8e-15
Identities = 51/221 (23%), Positives = 91/221 (41%), Gaps = 20/221 (9%)

Query: 150 SGMRMEAKVHIVTC----ANDMAKNITK-SVERCGLKVDDLVFSGIASADAVLTFDEKDL 204
S M ++ C A + + + S + G + L+ +A+A +
Sbjct: 100 SNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEAT 159

Query: 205 GVCIVDIGGGTTDIAVYTNGALRHCAVVPVAGNQVTNDIAKIFR------TPSSHAEQIK 258
G +VDIGGGTT++AV + + + + V + G++ I R + AE+IK
Sbjct: 160 GSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIK 219

Query: 259 VQFACARSSMVSREDSIEVPS---VGGRPSR-SMSRHTLAEVVEPRYQELFELVLKELKD 314
+ A IEV G P +++ + + E ++ + V+ L+
Sbjct: 220 HEIGSAYPG--DEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQ 277

Query: 315 SGLE---DQIAAGIVLTGGTASIQGVVDIAEATFGMPVRVA 352
E D G+VLTGG A ++ + + G+PV VA
Sbjct: 278 CPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVA 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0417TONBPROTEIN290.022 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 29.2 bits (65), Expect = 0.022
Identities = 21/96 (21%), Positives = 35/96 (36%), Gaps = 5/96 (5%)

Query: 292 TVVVGAVIDPEMSDELRVTVVATGIGAEKRPDIQLVSKPAPRPEPVVVEPKVEAYVEEAV 351
T V + P + + VT+V E +Q +P PEP EP+ +
Sbjct: 30 TSVHQVIELPAPAQPISVTMVTP-ADLEPPQAVQPPPEPVVEPEP---EPEPIPEPPKEA 85

Query: 352 HVNYAAPKGNVLPAAPQPAPQPAPSTKHELDYLDIP 387
V PK P P+P + K ++ ++
Sbjct: 86 PVVIEKPKPKPKP-KPKPVKKVQEQPKRDVKPVESR 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0421SECA13160.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1316 bits (3407), Expect = 0.0
Identities = 651/907 (71%), Positives = 758/907 (83%), Gaps = 7/907 (0%)

Query: 1 MFGKLLTKVFGSRNDRTLKGLQKIVISINALEADYEKLTDEALKAKTAEFRERLAAGASL 60
M KLLTKVFGSRNDRTL+ ++K+V INA+E + EKL+DE LK KTAEFR RL G L
Sbjct: 1 MLIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVL 60

Query: 61 DSIMAEAFATVREASKRVFDMRHFDVQLLGGMVLDSNRIAEMRTGEGKTLTATLPAYLNA 120
++++ EAFA VREASKRVF MRHFDVQLLGGMVL+ IAEMRTGEGKTLTATLPAYLNA
Sbjct: 61 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA 120

Query: 121 LTGKGVHVITVNDYLARRDAENNRPLFEFLGLTVGINVAGLGQHEKKAAYNADITYGTNN 180
LTGKGVHV+TVNDYLA+RDAENNRPLFEFLGLTVGIN+ G+ K+ AY ADITYGTNN
Sbjct: 121 LTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNN 180

Query: 181 EFGFDYLRDNMAFSPQERVQRPLHYALIDEVDSILIDEARTPLIISGAAEDSSELYIKIN 240
E+GFDYLRDNMAFSP+ERVQR LHYAL+DEVDSILIDEARTPLIISG AEDSSE+Y ++N
Sbjct: 181 EYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVN 240

Query: 241 TLIPNLIRQDKEDTEEYVGEGDYSIDEKAKQVHFTERGQEKVENLLIERGMLAEGDSLYS 300
+IP+LIRQ+KED+E + GEG +S+DEK++QV+ TERG +E LL++ G++ EG+SLYS
Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300

Query: 301 AANISLLHHVNAALRAHTLFERDVDYIVQDNEVIIVDEHTGRTMPGRRWSEGLHQAVEAK 360
ANI L+HHV AALRAH LF RDVDYIV+D EVIIVDEHTGRTM GRRWS+GLHQAVEAK
Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK 360

Query: 361 EGVHIQNENQTLASITFQNYFRQYEKLAGMTGTADTEAFEFQHIYGLDTVVVPTNRPMVR 420
EGV IQNENQTLASITFQNYFR YEKLAGMTGTADTEAFEF IY LDTVVVPTNRPM+R
Sbjct: 361 EGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIR 420

Query: 421 KDMADLVYLTADEKYQAIIKDIKDCRERGQPVLVGTVSIEQSELLARLMVQEKIPHEVLN 480
KD+ DLVY+T EK QAII+DIK+ +GQPVLVGT+SIE+SEL++ + + I H VLN
Sbjct: 421 KDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLN 480

Query: 481 AKFHEREAEIVAQAGRTGSVTIATNMAGRGTDIVLGGNWNMEIDELDNPTAEQKAKIKAD 540
AKFH EA IVAQAG +VTIATNMAGRGTDIVLGG+W E+ L+NPTAEQ KIKAD
Sbjct: 481 AKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKAD 540

Query: 541 WQIRHDEVVAAGGLHILGTERHESRRIDNQLRGRAGRQGDAGSSRFYLSMEDSLMRIFAS 600
WQ+RHD V+ AGGLHI+GTERHESRRIDNQLRGR+GRQGDAGSSRFYLSMED+LMRIFAS
Sbjct: 541 WQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFAS 600

Query: 601 DRVSGMMKKLGMEEGEAIEHPWVSRAIENAQRKVEARNFDIRKQLLEFDDVANDQRQVVY 660
DRVSGMM+KLGM+ GEAIEHPWV++AI NAQRKVE+RNFDIRKQLLE+DDVANDQR+ +Y
Sbjct: 601 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660

Query: 661 AQRNELMDAESIEDTIQNIQDDVIGAVIDQYIPPQSVEELWDIPGLEQRLHQEFMLKLPI 720
+QRNEL+D + +TI +I++DV A ID YIPPQS+EE+WDIPGL++RL +F L LPI
Sbjct: 661 SQRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPI 720

Query: 721 QEWLDKEDDLHEESLRERIITAWGDAYKAKEEMVGAQVLRQFEKAVMLQTLDGLWKEHLA 780
EWLDKE +LHEE+LRERI+ + Y+ KEE+VGA+++R FEK VMLQTLD LWKEHLA
Sbjct: 721 AEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLA 780

Query: 781 AMDHLRQGIHLRGYAQKNPKQEYKRESFELFQQLLNTLKHDVISVLSKVQVQAQSDVEEM 840
AMD+LRQGIHLRGYAQK+PKQEYKRESF +F +L +LK++VIS LSKVQV+ +VEE+
Sbjct: 781 AMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEEL 840

Query: 841 EARRREEDAKIQRDYQHAAAESLVGGGDEHEAVTAQAPMIRDGEKVGRNDPCPCGSGRKY 900
E +RR E A + D+ A A KVGRNDPCPCGSG+KY
Sbjct: 841 EQQRRMEAE-------RLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKY 893

Query: 901 KQCHGKL 907
KQCHG+L
Sbjct: 894 KQCHGRL 900


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0423PF02370310.009 M protein repeat
		>PF02370#M protein repeat

Length = 168

Score = 30.8 bits (69), Expect = 0.009
Identities = 22/83 (26%), Positives = 33/83 (39%), Gaps = 9/83 (10%)

Query: 380 SLVLAIRYN--DERKAKLRIQQEALKQAQKIRSAREE-----ALKAEAESNEKLEQMVQE 432
+ L YN E +KL+ Q E + S RE AL E + K E Q+
Sbjct: 12 NGKLITEYNKLVEENSKLQKQLE--EYLDSSDSKRENDPQYRALMGENQDLRKREGQYQD 69

Query: 433 RTLELEITLRELHEVNQKLTEQS 455
+ ELE +E E ++ +
Sbjct: 70 KIEELEKERKEKQERPERREKFE 92



Score = 30.8 bits (69), Expect = 0.010
Identities = 13/69 (18%), Positives = 32/69 (46%)

Query: 389 DERKAKLRIQQEALKQAQKIRSAREEALKAEAESNEKLEQMVQERTLELEITLRELHEVN 448
D RK + + Q + + ++ + +E + E + ++ QE+ + + ++L
Sbjct: 59 DLRKREGQYQDKIEELEKERKEKQERPERREKFERQHQDKHYQEQQKKHQQEQQQLEAEK 118

Query: 449 QKLTEQSTI 457
QKL ++ I
Sbjct: 119 QKLAKEKQI 127


59Sbal195_0449Sbal195_0462N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0449-1162.265069hypothetical protein
Sbal195_0450-2173.265677hypothetical protein
Sbal195_04511132.590921hypothetical protein
Sbal195_04523142.370857dTDP-4-dehydrorhamnose reductase
Sbal195_04532141.489617integral membrane sensor signal transduction
Sbal195_0454116-0.145954two component Fis family transcriptional
Sbal195_0455118-0.372771response regulator receiver protein
Sbal195_0456119-0.231837peptidase M4 thermolysin
Sbal195_0457-118-0.013801C factor cell-cell signaling protein
Sbal195_04581200.202914deoxyribodipyrimidine photolyase-like protein
Sbal195_04590230.175911hypothetical protein
Sbal195_04600241.624318hypothetical protein
Sbal195_04610251.334379hypothetical protein
Sbal195_0462-1231.226080ATP-dependent protease ATP-binding subunit HslU
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0449RTXTOXIND514e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.6 bits (121), Expect = 4e-09
Identities = 44/256 (17%), Positives = 89/256 (34%), Gaps = 43/256 (16%)

Query: 36 WYLLLLLVIAPVAIVGWILLRP-HLFVLASGIVTT--EPLEVRAPSTGDVSTIRVKPGDT 92
++++ LVIA +L + A+G +T E++ V I VK G++
Sbjct: 62 YFIMGFLVIA----FILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGES 117

Query: 93 LNVGTPILSISDPQLNAQITELERQLAQL------------NIEDLSLNSVILKQLQTRI 140
+ G +L ++ A + + L Q +IE L + L
Sbjct: 118 VRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQ 177

Query: 141 DVANEGVTRQDILLKS-YENFQR-------KGVVPTSDMATVLQAHTASKMALEQAKVDL 192
+V+ E V R L+K + +Q ++ TVL + K L
Sbjct: 178 NVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRL 237

Query: 193 ----------------MQAKQKQLVELSAGVVTQSRRSIELQLARLKAQQSQLQIKALTA 236
+ ++ + VE + + +++ L A++ + L
Sbjct: 238 DDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297

Query: 237 TRVVDILVQSGEHIVE 252
++D L Q+ ++I
Sbjct: 298 NEILDKLRQTTDNIGL 313



Score = 34.0 bits (78), Expect = 7e-04
Identities = 34/208 (16%), Positives = 70/208 (33%), Gaps = 37/208 (17%)

Query: 107 LNAQITELERQLAQLNIEDLSLNSVILKQLQTRIDVANEGVTRQDILLKSYENFQRKGVV 166
+ Q + + Q Q + + L L N + L + + K +
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS-RLDDFSSLLHKQAI 249

Query: 167 PTSDMATVLQAHTASKMALEQAKVDLMQAKQK-QLVELSAGVVTQSRRS----------- 214
+ + + L K L Q + + + +VTQ ++
Sbjct: 250 AKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTD 309

Query: 215 ----IELQLARLKAQQSQLQIKALTATRVVDI-------LVQSGEHIV----EDRPLALL 259
+ L+LA+ + +Q I+A + +V + +V + E ++ ED L
Sbjct: 310 NIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTL--- 366

Query: 260 AGRDNPVVLAFLEPKYLNYTTIGQQATI 287
V A ++ K + + +GQ A I
Sbjct: 367 ------EVTALVQNKDIGFINVGQNAII 388


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0452NUCEPIMERASE714e-16 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 71.0 bits (174), Expect = 4e-16
Identities = 44/182 (24%), Positives = 70/182 (38%), Gaps = 24/182 (13%)

Query: 3 KIMVTGATGLLGRAVVKQLELTGHEVV-----------------ATGFSRASERVHKLDL 45
K +VTGA G +G V K+L GH+VV ++ + HK+DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 46 TAPLAVEAFIAREQPQVIVHCAAERRPDVSEQNPQAALALNLTASQALAMAAKANN-AWL 104
+ A + + S +NP A NLT + + N L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 105 IYISTDYVFDGTQ--PKYAEDAATHPVNFYGESKLKGEEIVLNTSDDFAV----LRLPIL 158
+Y S+ V+ + P +D+ HPV+ Y +K E + S + + LR +
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 159 YG 160
YG
Sbjct: 182 YG 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0454HTHFIS923e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 3e-24
Identities = 26/129 (20%), Positives = 64/129 (49%)

Query: 3 RLLIVEDDLSLASILGRRLTRHGFECRLTHDASDALLVAREFRPSHILLDMKLAEANGLG 62
+L+ +DD ++ ++L + L+R G++ R+T +A+ ++ D+ + + N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LIVPLRNLLPKVTMVLLTGYASIATAVEAIRLGADNYLAKPVDTQTLLAALEMEGHSHTL 122
L+ ++ P + +++++ + TA++A GA +YL KP D L+ +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 123 QEDEVDDSP 131
+ +++D
Sbjct: 125 RPSKLEDDS 133



Score = 47.1 bits (112), Expect = 1e-08
Identities = 13/39 (33%), Positives = 22/39 (56%)

Query: 135 KRLEWEHIQQVLNANQGNVSATARQLGMHRRTLQRKLLK 173
+E+ I L A +GN A LG++R TL++K+ +
Sbjct: 434 AEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0455HTHFIS291e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 1e-04
Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 1 MSATARQLGMHRRTLQRKLLK 21
A LG++R TL++K+ +
Sbjct: 452 QIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0456THERMOLYSIN361e-118 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 361 bits (927), Expect = e-118
Identities = 133/490 (27%), Positives = 191/490 (38%), Gaps = 51/490 (10%)

Query: 44 SQFNL--DAGSQLKVEKKLDLGQGKQKQRLQQYFHDVPVYGFSVATSQSSMGFYSDMSGR 101
+ F L A +L + G R +Q G + + S +SG
Sbjct: 64 NTFQLGGQARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGELSS-LSGT 122

Query: 102 VLKNIEKSADFVKPTLTANKALDIAIRGKSEK-AVAGLKAENKQAKLWLYLDDAAKTRLV 160
++ N++K + ++ +A IA + +++ AE + + D RL
Sbjct: 123 LIPNLDKRTLKTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLA 182

Query: 161 YVTSFVVYGDEPSRPFTMIDAHSGEVLKRWEGINHA-ASGTGPGGNIKTGQYEYGTDFSY 219
Y + P MIDA G+VL +W ++ A G P T G
Sbjct: 183 YEVNVRFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRG----- 237

Query: 220 LDVEVSGDT---CTMNSPNVKTVNLNGATSGATAFSYTCPRNTV-----------KEING 265
V GD T S L T G+ F+Y TV +
Sbjct: 238 ----VLGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFFAS 293

Query: 266 AYSPLNDAHYFGNVIYNMYSEWYN---TAPLTFQLTMRVHYSSNYENAFWDGSAMTFGDG 322
+ DAHY+ V+Y+ Y + + VHY Y NAFW+GS M +GDG
Sbjct: 294 YDAAAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDG 353

Query: 323 -ATTFYPLV-SLDVSAHEVSHGFTEQNSGLIYDAQSGGMNEAFSDMAGEAAEFYMHGTND 380
TF P +DV HE++H T+ +GL+Y +SG +NEA SD+ G EFY + D
Sbjct: 354 DGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNPD 413

Query: 381 WLVGADIFK---GNGALRYMADPTLDGISIGHIDDYYDGID---VHHSSGVFNKAFYTLA 434
W +G DI+ ALR M+DP G + Y D VH +SG+ NKA Y L+
Sbjct: 414 WEIGEDIYTPGVAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLS 473

Query: 435 N--------LPGWDTRTAFQTFVVANQLYWTADSLFWQGACGVKSAATDLG----LSADD 482
+ G + F A Y T S F Q AA DL +
Sbjct: 474 QGGVHYGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNS 533

Query: 483 VVTAFAAVGI 492
V AF AVG+
Sbjct: 534 VKQAFNAVGV 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0457DHBDHDRGNASE591e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 59.3 bits (143), Expect = 1e-12
Identities = 46/241 (19%), Positives = 87/241 (36%), Gaps = 47/241 (19%)

Query: 3 VLIVGGSGGIGQAMVKQVQEAYPEATVHATYRHHLPQDRQNNIQWHA----------LDV 52
I G + GIG+A V H + P+ + + DV
Sbjct: 11 AFITGAAQGIGEA----VARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 53 TNEAEIKQLSEQLTE----LDWLINCVGILHTQDKGPEKSLQSLDLAFFQHNLTLNTLPS 108
+ A I +++ ++ +D L+N G+L G + SL ++ ++N+
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRP---GL---IHSLSDEEWEATFSVNSTGV 120

Query: 109 VMLAKHFCHALKQSDSARFAVISAKVGSITDNRLGGWYSYRASKAALNMFLKTLSIEWQR 168
++ + S + + + + +Y +SKAA MF K L +E
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMA---AYASSKAAAVMFTKCLGLELAE 177

Query: 169 NMKHCVVLSLHPGTTDTPLSQP------------------FQQSVPKGKLFTPEYVANCL 210
C ++S PG+T+T + F+ +P KL P +A+ +
Sbjct: 178 YNIRCNIVS--PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAV 235

Query: 211 L 211
L
Sbjct: 236 L 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0462HTHFIS310.013 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.013
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLANAPFIKV 81
T +++ G +G GK +AR K N PF+ +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


60Sbal195_0502Sbal195_0519N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_05020151.256446hypothetical protein
Sbal195_0503-1151.450269MSHA biogenesis protein MshJ
Sbal195_0504-2161.777850MSHA biogenesis protein MshK
Sbal195_0505-2151.770957pilus (MSHA type) biogenesis protein MshL
Sbal195_0506-3161.972881MSHA biogenesis protein MshM
Sbal195_0507-1171.566591hypothetical protein
Sbal195_0508-1170.246739type II secretion system protein E
Sbal195_0509118-0.944537type II secretion system protein
Sbal195_0510324-1.879176hypothetical protein
Sbal195_0511325-1.491321MSHA pilin protein MshB
Sbal195_0512225-1.747845methylation site containing protein
Sbal195_0513221-1.222417methylation site containing protein
Sbal195_0514219-0.372206MSHA pilin protein MshD
Sbal195_0515219-0.301716MSHA biogenesis protein MshO
Sbal195_0516219-0.107214MSHA biogenesis protein MshP
Sbal195_0517016-0.047244hypothetical protein
Sbal195_0518-3130.897961rod shape-determining protein MreB
Sbal195_0519-3141.008916rod shape-determining protein MreC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0502PF06580290.015 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.015
Identities = 11/53 (20%), Positives = 20/53 (37%), Gaps = 2/53 (3%)

Query: 25 LGAYAAGFLVLFAALGGYSYWQVSELQQAQQLAAQQ--KLQFDTQKQALEAQI 75
L +V F Y W + + ++ + + + Q AL+AQI
Sbjct: 118 LSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQI 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0505BCTERIALGSPD1795e-51 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 179 bits (455), Expect = 5e-51
Identities = 72/293 (24%), Positives = 129/293 (44%), Gaps = 26/293 (8%)

Query: 257 PQAGLVTIRAFPSELRQVRTFLNSAESHLQRQVILEAKIIEVTLSDGYQQGIQWENVLGH 316
Q + + A P + + + + + QV++EA I EV +DG GIQW N
Sbjct: 316 GQTNALIVTAAPDVMNDLERVIAQLDIR-RPQVLVEAIIAEVQDADGLNLGIQWANKNAG 374

Query: 317 VGN-------TNVNFGTSKGPGLSDKITSAIGGVTS------LSIKGSDFTTMINLLDTQ 363
+ + + ++S++ S ++ ++ L +
Sbjct: 375 MTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSS 434

Query: 364 GDVDVLSSPRVTASNNQKAVIKVGTDEYFVTDVSSTTVAGATPVTTPQVELTPFFSGIAL 423
D+L++P + +N +A VG + +T S T +G T + + GI L
Sbjct: 435 TKNDILATPSIVTLDNMEATFNVGQEVPVLT--GSQTTSGDNIFNTVERKTV----GIKL 488

Query: 424 DVTPQIDSDGNVLLHVHPSVIDVKEQTKDIKVSDASLELPLAQSEIRESDTVIRAASGDV 483
V PQI+ +VLL + V V + S S +L + R + + SG+
Sbjct: 489 KVKPQINEGDSVLLEIEQEVSSVADAA-----SSTSSDLGATFN-TRTVNNAVLVGSGET 542

Query: 484 VIIGGLMKSENTEVVSQVPLLGDIPFLGELFKNRSKQKKKTELIILLKPTVVG 536
V++GGL+ ++ +VPLLGDIP +G LF++ SK+ K L++ ++PTV+
Sbjct: 543 VVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIR 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0507IGASERPTASE411e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 1e-05
Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 23/164 (14%)

Query: 63 AEPAEATASSQAQEQNTLTAQ----TESVRIDSVASEEASPNVDAAAKPLKLATTQKIAA 118
+E E A + QE T+ TE+ + ++EA NV A + ++A +
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG---- 1089

Query: 119 AMTANSEEFEPSATEVASSQAPELGTAKEAEHEQQAQSQSQPQQKPQAD------VSLEL 172
+ ++E + + T+ ++ E AK + Q + Q P+ + E
Sbjct: 1090 ---SETKETQTTETKETATVEKE-EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP 1145

Query: 173 ANDQVDTVS-----EPSHSQPSHSQPSSATSSVRSAEVTVAAPS 211
A + TV+ +++ QP+ TSS VT +
Sbjct: 1146 ARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTV 1189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0509BCTERIALGSPF302e-102 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 302 bits (776), Expect = e-102
Identities = 116/407 (28%), Positives = 207/407 (50%), Gaps = 6/407 (1%)

Query: 1 MPIYQYRGRSGQGQSVTGQLDAASESAAADMLLARGIIPLEVKVAKVVK----SFSLAQL 56
M Y Y+ QG+ G +A S A +L RG++PL V + + S L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 57 FGGKVALEELQIFTRQMYSLTRSGIPILRAIAGLSETAHSQRMKDALNDISEQLTAGRPL 116
+++ +L + TRQ+ +L + +P+ A+ +++ + + + + ++ G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 117 SSSMNQHPDVFDSLFVSMVHVGENTGKLEDAFIQLSGYIEREQETRRRIKSAMRYPMFVL 176
+ +M P F+ L+ +MV GE +G L+ +L+ Y E+ Q+ R RI+ AM YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPC-VL 179

Query: 177 ISIALAMV-ILNIMVIPKFAEMFSRFGADLPWATKVLIGTSNLFVNYWALMLVALIGTII 235
+A+A+V IL +V+PK E F LP +T+VL+G S+ + ML+AL+ +
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239

Query: 236 GIRYWHHTEKGEKQWDKWKLHIPAVGSIIERSTLARYCRSFSMMLSAGVPMTQALSLVAD 295
R EK + + LH+P +G I ARY R+ S++ ++ VP+ QA+ + D
Sbjct: 240 AFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 296 AVDNAYMHDKIVGMRRGIESGDSMLRVSNQSKLFTPLVLQMVAVGEETGQIDQLLNDAAD 355
+ N Y ++ + G S+ + Q+ LF P++ M+A GE +G++D +L AAD
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 356 FYEGEVDYDLKNLTAKLEPILIGFVAVIVLVLALGIYLPMWDMLNVV 402
+ E + EP+L+ +A +VL + L I P+ + ++
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0511BCTERIALGSPG445e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 43.7 bits (103), Expect = 5e-08
Identities = 15/36 (41%), Positives = 27/36 (75%)

Query: 4 KQTGFSLIELVIVIVILGLLAATAIPRFLNVTDDAE 39
KQ GF+L+E+++VIVI+G+LA+ +P + + A+
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0512BCTERIALGSPG502e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 50.3 bits (120), Expect = 2e-10
Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 4/53 (7%)

Query: 1 MKRQQGFTLIELVVVIIILGILAVTAAPKFINLQGDARA----STIQGMKGAI 49
+Q+GFTL+E++VVI+I+G+LA P + + A S I ++ A+
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0513BCTERIALGSPH422e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 41.9 bits (98), Expect = 2e-07
Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 1/82 (1%)

Query: 8 KQAGFTLVELVTTIILISILAVVVLPRLFTQSSYSAYSLRNEFISELRQVQQKALNNTDR 67
+Q GFTL+E++ ++L+ + A +VL SA F ++LR VQQ+ L +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQT-GQ 60

Query: 68 CFRVTVSGTGYQVSQFSARNGA 89
F V+V +Q AR+GA
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGA 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0514BCTERIALGSPH372e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 36.8 bits (85), Expect = 2e-05
Identities = 16/58 (27%), Positives = 31/58 (53%), Gaps = 6/58 (10%)

Query: 23 QQGFTLIELVIGMLVIAIAIVMLTSMLFPQA--DRAASTLHRVRSA-ELA--HSVMNE 75
Q+GFTL+E+++ +L++ ++ M+ + FP + D AA TL R + +
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL-LAFPASRDDSAAQTLARFEAQLRFVQQRGLQTG 59


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0515BCTERIALGSPG300.004 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 30.2 bits (68), Expect = 0.004
Identities = 12/27 (44%), Positives = 21/27 (77%), Gaps = 2/27 (7%)

Query: 5 LSAVNKKSTLGFTLVEMVTVILILGIL 31
+ A +K+ GFTL+E++ VI+I+G+L
Sbjct: 1 MRATDKQR--GFTLLEIMVVIVIIGVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0518SHAPEPROTEIN5580.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 558 bits (1440), Expect = 0.0
Identities = 317/348 (91%), Positives = 334/348 (95%), Gaps = 1/348 (0%)

Query: 1 MFKKLRGIFSNDLSIDLGTANTLIYVRGEGIVLNEPSVVAIRGERGGSGQKSVAAVGTEA 60
M KK RG+FSNDLSIDLGTANTLIYV+G+GIVLNEPSVVAIR +R GS KSVAAVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGS-PKSVAAVGHDA 59

Query: 61 KQMLGRTPGNIQAIRPMKDGVIADFYVTEKMLQHFIKQVHNNSFFRPSPRVLVCVPVGAT 120
KQMLGRTPGNI AIRPMKDGVIADF+VTEKMLQHFIKQVH+NSF RPSPRVLVCVPVGAT
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 QVERRAIRESAMGAGAREVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAIISLN 180
QVERRAIRESA GAGAREV+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVA+ISLN
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GVVYSSSVRIGGDKFDDAIINYVRRNYGSLIGEATAERIKHTIGTAYPGDEVLEIEVRGR 240
GVVYSSSVRIGGD+FD+AIINYVRRNYGSLIGEATAERIKH IG+AYPGDEV EIEVRGR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 NLAEGVPRSFTLNSNEILEALQEPLSGIVSAVMVALEQSPPELASDISERGMVLTGGGAL 300
NLAEGVPR FTLNSNEILEALQEPL+GIVSAVMVALEQ PPELASDISERGMVLTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 LRDLDRLLMQETGIPVMVADDPLTCVARGGGKALEMIDMHGGDLFSEE 348
LR+LDRLLM+ETGIPV+VA+DPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0519IGASERPTASE330.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.002
Identities = 22/97 (22%), Positives = 38/97 (39%), Gaps = 6/97 (6%)

Query: 237 EVLTEDGQSYARVTAQPLAALDRIRYVLLIWPSPDSGVTLPNQPTVPAADHSLIENSSKI 296
+V TE Q +VT+Q ++ V P + N PTV + + ++
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQ-----PQAEPARENDPTVNIKEPQ-SQTNTTA 1166

Query: 297 GSASPAEGTSADTTKPVTTPAATVAKPATETTPPATE 333
+ PA+ TS++ +PVT + P T
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203


61Sbal195_0617Sbal195_0624N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0617328-8.426951P pilus assembly protein porin PapC-like
Sbal195_0618326-6.026446hypothetical protein
Sbal195_0619423-5.109279hypothetical protein
Sbal195_0620220-3.124633hypothetical protein
Sbal195_0621321-2.676317transposase IS4 family protein
Sbal195_0622523-2.866846Ig domain-containing protein
Sbal195_0623433-7.051160OmpA domain-containing protein
Sbal195_0624230-6.177217transposase, IS4 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0617PF00577367e-04 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 36.0 bits (83), Expect = 7e-04
Identities = 31/224 (13%), Positives = 68/224 (30%), Gaps = 23/224 (10%)

Query: 411 QIGARYIYEDLFSADYFLG-YFSTGDIYQSANIKFGRLSLSAKAFDLDYNTRNFTLSNQL 469
+R ++ + D + D Y A K G+L L+ +T + S+Q
Sbjct: 492 TTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQT 551

Query: 470 YGTNPY--KNFSISYSKPFFGGNGYLNYNNYSSKNYYGINNDNSIVIEKPIDYIYNINAL 527
Y + F + F N L+Y+ +KN + D + + I
Sbjct: 552 YWGTSNVDEQFQAGLNTAFEDINWTLSYS--LTKNAWQKGRDQMLALNVNI--------- 600

Query: 528 NNVLNNALNNGSYSTISNENYNIGWSTNIYSGTLTLNSNYNSNSAYDEVKFGIYWSQRFG 587
++ S ++ + S ++ +N + +S + G
Sbjct: 601 ------PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTG 654

Query: 588 KSVAGGLSVMTNNKGSSQYNNS---LSLNASNDNWYANHTVMAS 628
+ G + + + Y ++ S+ + S
Sbjct: 655 YAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVS 698


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0622INTIMIN436e-06 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 43.1 bits (101), Expect = 6e-06
Identities = 17/81 (20%), Positives = 32/81 (39%)

Query: 743 NALANNTVTNQVSVTVRDANNAPVAGQEVIFNASNNATVVTQTVLTDGNGVAIASIRSPQ 802
+A A+ T + TV+ A S A + + T+G+G A +++S +
Sbjct: 569 SAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDK 628

Query: 803 SGISTVTASFNGTTKTVDVTF 823
G V+A T ++
Sbjct: 629 PGQVVVSAKTAEMTSALNANA 649



Score = 42.4 bits (99), Expect = 1e-05
Identities = 40/148 (27%), Positives = 59/148 (39%), Gaps = 10/148 (6%)

Query: 730 VDNDKSTIAVIFNNALANNTVTNQVSVTVRDANN-APVAGQEVIFNASNNATVVTQTVLT 788
+ I A+AN ++ TV+ PV+ QEV F + + T T
Sbjct: 656 TKASITEIKADKTTAVANGQDA--ITYTVKVMKGDKPVSNQEVTFTTTLGKLSNS-TEKT 712

Query: 789 DGNGVAIASIRSPQSGISTVTASFNGTT---KTVDVTFNCQ-SLAGACIDIFDTG-SGKL 843
D NG A ++ S G S V+A + K +V F ++ I+I TG GKL
Sbjct: 713 DTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKL 772

Query: 844 FTNSPSVAYLNSIGGSAADGTYTETGTN 871
T +N + S +G YT N
Sbjct: 773 PTVWLQYGQVN-LKASGGNGKYTWRSAN 799


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0623OMPADOMAIN605e-13 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 59.6 bits (144), Expect = 5e-13
Identities = 40/203 (19%), Positives = 67/203 (33%), Gaps = 25/203 (12%)

Query: 1 MRKFNLAVVIPLSIMSCSAVASYSDSSLELGVSAGQFNLKDS-----TGSYSGPSVGFNF 55
M+K +A+ + L+ + A A+ D++ G G D+ G +G
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAGA 60

Query: 56 I--RNFNDWFSFEGNYL------SSFNMDNANYDIQASTFSLAPVFTYHINDTFSIYGKG 107
N + FE Y +++N Y Q + Y I D IY +
Sbjct: 61 FGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKL--GYPITDDLDIYTRL 118

Query: 108 GASSMRITSSERNGLDFSYNTIGWFYGFGLNTSINNRINVRLGYETVTGDTGIEILGVTA 167
G R + + + G+ +I I RL Y+ +G
Sbjct: 119 GGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIGTRP 178

Query: 168 DGFSIQSSHTKISVISLGATYRF 190
D ++SLG +YRF
Sbjct: 179 D----------NGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0624FLGPRINGFLGI250.034 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 25.3 bits (55), Expect = 0.034
Identities = 9/42 (21%), Positives = 19/42 (45%)

Query: 23 TLIITPPDYTSISKLAKIINVQNNNSSRGPIMHVVDTTSLKV 64
L + PD+++ ++A ++N PI D+ + V
Sbjct: 194 VLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAV 235


62Sbal195_0654Sbal195_0668N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0654-2183.317043RND family efflux transporter MFP subunit
Sbal195_0655-2173.528908acriflavin resistance protein
Sbal195_0656-2153.857298Sel1 domain-containing protein
Sbal195_0657-3173.596957collagenase
Sbal195_0658-3173.140015PIG3 family NAD(P)H quinone oxidoreductase
Sbal195_0659-2161.932978hypothetical protein
Sbal195_0660-3172.226018aldose 1-epimerase
Sbal195_0661-1171.720016galactokinase
Sbal195_0662-1181.976993sodium/hydrogen exchanger
Sbal195_0663-1182.200098thiol:disulfide interchange protein
Sbal195_0664-1151.645771CutA1 divalent ion tolerance protein
Sbal195_0665-2161.949397FxsA cytoplasmic membrane protein
Sbal195_0666-2161.743191hypothetical protein
Sbal195_0667-1161.687632acriflavin resistance protein
Sbal195_0668-1180.823222RND family efflux transporter MFP subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0654RTXTOXIND501e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.8 bits (119), Expect = 1e-08
Identities = 38/192 (19%), Positives = 70/192 (36%), Gaps = 29/192 (15%)

Query: 123 AEQDNTKAKADLDKAKSTLALAKTKLERIEDLL---IKEPFALAKQDVDELRENVNLADA 179
A + K+ L++ +S + AK + + + L I + ++ L LA
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE--LAKN 321

Query: 180 DFRQKQATMNDYLIKAPFDG---QLTSFSQSIGSQIGAGTALVTLYSLN-PVEVRYAISQ 235
+ RQ+ + I+AP QL ++ G + L+ + + +EV +
Sbjct: 322 EERQQASV-----IRAPVSVKVQQLKVHTE--GGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 236 NDFGKGQKGQKVNVTVEAYGNKVFKGL---VNYVAP--AVDESSG-------RVEVHAAL 283
D G GQ + VEA+ + L V + D+ G +E +
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLS 434

Query: 284 -DNPEFKLAPGM 294
N L+ GM
Sbjct: 435 TGNKNIPLSSGM 446



Score = 47.1 bits (112), Expect = 8e-08
Identities = 23/108 (21%), Positives = 44/108 (40%), Gaps = 7/108 (6%)

Query: 105 ISAIHFSNGDKVTKGQVIAEQDNTKAKADLDKAKSTLALAKTKLERIEDLLIKEPFALAK 164
+ I G+ V KG V+ + A+AD K +S+L A+ + R + L ++
Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS----RSIEL 162

Query: 165 QDVDELRENVNLADADFRQKQATMNDYLIKAPFDGQLTSFSQSIGSQI 212
+ EL+ + +++ LIK F T +Q ++
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS---TWQNQKYQKEL 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0655ACRIFLAVINRP6510.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 651 bits (1680), Expect = 0.0
Identities = 301/1032 (29%), Positives = 509/1032 (49%), Gaps = 44/1032 (4%)

Query: 8 IRHPIFASVLSIMAVLLGLIAFQKLDIQYFPEHTTHSASVNASIAGASADFMSSNVADKL 67
IR PIFA VL+I+ ++ G +A +L + +P + SV+A+ GA A + V +
Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVI 65

Query: 68 IAAASGIDKVDTM-STDCSEGRCSLTIKFNDDTS-DIEYTNLMNKLRSSVEGINDFPQSM 125
+GID + M ST S G ++T+ F T DI + NKL+ + PQ
Sbjct: 66 EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLAT---PLLPQE- 121

Query: 126 IDKPTVTDDTSATDSASNIITFVNAGGMEKQAMYDYISQQLVPQLKQVQGVGAVWGPYGG 185
+ + ++ + S++ + G + + DY++ + L ++ GVG V G
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV--QLFG 179

Query: 186 SQKAVRVWLNPEQMKALNIKAADVVGTLGSYNASFTSG------AIKGKSRDFSINPLNQ 239
+Q A+R+WL+ + + + DV+ L N +G A+ G+ + SI +
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 240 VETLEDVKDLVIKVS-EGKIIRVADVADVVMGEESLSPSILSIGGHSAMSLQILPLSNAN 298
+ E+ + ++V+ +G ++R+ DVA V +G E+ + I I G A L I + AN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYN-VIARINGKPAAGLGIKLATGAN 298

Query: 299 PVTVASNIKAEIARMQQHLPQGLEMTLAYNQADFIEASIDEGFSALIEAVILVSLIVVLF 358
+ A IKA++A +Q PQG+++ Y+ F++ SI E L EA++LV L++ LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 359 LGSLRAASIPIITIPVCVIGVFAVMSALGFSINVLTILAIILAIGLVVDDAIVVVENCYR 418
L ++RA IP I +PV ++G FA+++A G+SIN LT+ ++LAIGL+VDDAIVVVEN R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 419 HI-ENGETPFNAAIKGCQEIIFPIIAMTLTLAAVYLPIGLMSGLTADLFRQFSFTLAAAV 477
+ E+ P A K +I ++ + + L+AV++P+ G T ++RQFS T+ +A+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 478 MISGVVALTLSPMMSAYLINTTEQQPK-----WFSRVEHALQQLNDLYIKELDKWFTRKR 532
+S +VAL L+P + A L+ + +F + Y + K
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 533 LMLGAAVVLIGLAGIAYWQLPKILLPAEDSGFIDVASNGPTGVGRQYHLNHNAELNGVMD 592
L +++ + + +LP LP ED G P G ++ ++
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 593 EHPAVGANLSY------IEGEPVN----HVLLKPWGERS---EGIDDVISDLMSKSKESV 639
++ + G+ N V LKPW ER+ + VI + +
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 640 SAYNMSFSIRSANNLSIANNLRLELTTLDRNK---DELNDTAAKVQKLLEDYPG-LNNVG 695
+ + F++ + L A EL +D+ D L ++ + +P L +V
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFEL--IDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 696 NSVLRDQLRYDLSIDRNAIILSGVSYGDVTNALSTFLGSVKAADLHATDGFTYPIQVQVN 755
+ L D ++ L +D+ GVS D+ +ST LG D G + VQ +
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF-IDRGRVKKLYVQAD 775

Query: 756 LDKLSDFKVLNKLYVTSESGQALPLSQFVSIKQTTAESNIKTFMGLDSAELTADVMPGYS 815
+ ++KLYV S +G+ +P S F + ++ + GL S E+ + PG S
Sbjct: 776 AKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTS 835

Query: 816 TDEIKAYLDEQLPTLLNDAQGFKYNGVVKDLMDSQAGTQSLFLLALVFIYLILAAQFESF 875
+ + A + E L + L G+ + G+ S +L ++ V ++L LAA +ES+
Sbjct: 836 SGDAMALM-ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESW 894

Query: 876 VDPLIILLTVPLCIVGALLTLTLFGQSVNIYSQIGLLTLVGLVTKHGILLVEFANK-QQD 934
P+ ++L VPL IVG LL TLF Q ++Y +GLLT +GL K+ IL+VEFA +
Sbjct: 895 SIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEK 954

Query: 935 QGLSAIEAARSSAKSRLRPILMTSLTMILSAIPLALASGPGSLGLANIGLVLVGGLLAGT 994
+G +EA + + RLRPILMTSL IL +PLA+++G GS +G+ ++GG+++ T
Sbjct: 955 EGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSAT 1014

Query: 995 FFSLFVVPVAYV 1006
++F VPV +V
Sbjct: 1015 LLAIFFVPVFFV 1026



Score = 93.4 bits (232), Expect = 2e-21
Identities = 63/362 (17%), Positives = 123/362 (33%), Gaps = 22/362 (6%)

Query: 662 LELTTLDRNKDELNDTAAK-VQKLLEDYPGLNNVGNSVLRDQLRYDLSIDRNAIILSGVS 720
+D+++D A V+ L G+ +V + +R + +D + + ++
Sbjct: 142 FVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMR--IWLDADLLNKYKLT 199

Query: 721 YGDVTNALS-----TFLGSVKAADLHATDGFTYPIQVQVNLDKLSDFKVLNKLYVTSESG 775
DV N L G + I Q +F + G
Sbjct: 200 PVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFG--KVTLRVNSDG 257

Query: 776 QALPLSQFVSIKQTTAESNIK-TFMGLDSAELTADVMPGYST----DEIKAYLDEQLPTL 830
+ L ++ N+ G +A L + G + IKA L E P
Sbjct: 258 SVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFF 317

Query: 831 LNDAQGFKYNGVVKDLMDSQAGTQSLF---LLALVFIYLILAAQFESFVDPLIILLTVPL 887
QG K Q + A++ ++L++ ++ LI + VP+
Sbjct: 318 ---PQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV 374

Query: 888 CIVGALLTLTLFGQSVNIYSQIGLLTLVGLVTKHGILLVEFANK-QQDQGLSAIEAARSS 946
++G L FG S+N + G++ +GL+ I++VE + + L EA S
Sbjct: 375 VLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKS 434

Query: 947 AKSRLRPILMTSLTMILSAIPLALASGPGSLGLANIGLVLVGGLLAGTFFSLFVVPVAYV 1006
++ ++ + IP+A G + +V + +L + P
Sbjct: 435 MSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCA 494

Query: 1007 AM 1008
+
Sbjct: 495 TL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0657MICOLLPTASE3004e-88 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 300 bits (770), Expect = 4e-88
Identities = 104/553 (18%), Positives = 216/553 (39%), Gaps = 39/553 (7%)

Query: 143 SDFVGKSGQA-LVDQLSQSTPECVGKLYSLKGSAATALFSEANVISVANAIATKAKDYTG 201
D + + + LV+ + + E V L++ + T + V ++ + + YT
Sbjct: 95 FDELNRMNYSDLVELIKTISYENVPDLFNFNDGSYTFFSNRDRVQAIIYGLEDSGRTYTA 154

Query: 202 VDVQHLESHIYFVRAALYVQFYSPNDVPAYSSAAKASLKSALNALFANAAIWTVSDDNAG 261
D + + + + F+RA Y+ FY+ + K A+ A+ N+ + G
Sbjct: 155 DDDKGIPTLVEFLRAGYYLGFYNKQLSYLNTPQLKNECLPAMKAIQYNSNFRLGTKAQDG 214

Query: 262 VLKEALILIDSAELGADFNHATIKVLMDYDANWQASFAMNAAANSVFTTLFRAQWNDDMQ 321
V++ LI +A + + I VL D+ N + + N+VF + + +
Sbjct: 215 VVEALGRLIGNASADPEVINNCIYVLSDFKDNIDKYGSNYSKGNAVFNLMKGIDYYTNSV 274

Query: 322 -----ALFARDQGILDALNNFQLE------HRDLLGTNAEYLLVNSVKELSRLYYIDSMR 370
A++ + ++ + D L + +L+ N++ R+
Sbjct: 275 IYNTKGYDAKNTEFYNRIDPYMERLESLCTIGDKLNNDNAWLVNNALYYTGRMGKFREDP 334

Query: 371 PRVTQLVKNILSSTSKTEPSKVLWYAAAEMADYYDRSHCNDYNICGFKAQLEADTLPFNW 430
+ ++ + + ++ S ND + KA LP +
Sbjct: 335 SISQRALERAMKEYPYLSYQYIEAANDLDLNFGGKNSSGNDIDFNKIKADAREKYLPKTY 394

Query: 431 KCSDSLKI-RAQD-LYQDQAKWACDVLTSQESYFHSKLETGMQPVGQDNNDDLELVIFGS 488
D + +A D + +++ K ++ F ++ + +D L +VI+ S
Sbjct: 395 TFDDGKFVVKAGDKVTEEKIKRLYWASKEVKAQFMRVVQNDKALEEGNPDDILTVVIYNS 454

Query: 489 SSEYKSLANSIFGINTDNGGMYLEGSPAGLKNQARFIAYEAEWRTPDFHVWNL-QHEYVH 547
EYK L I G +TDNGG+Y+E N F YE + + L +HE+ H
Sbjct: 455 PEEYK-LNRIINGFSTDNGGIYIE-------NIGTFFTYERTPEESIYTLEELFRHEFTH 506

Query: 548 YLDGRYNLFGDFSRGTS---ANTIWWIEGLAEYIS---------YRDANTAAIAMGETGE 595
YL GRY + G + +G W+ EG AE+ + R + T +A
Sbjct: 507 YLQGRYVVPGMWGQGEFYQEGVLTWYEEGTAEFFAGSTRTDGIKPRKSVTQGLAYDRNNR 566

Query: 596 FMLSTIFKNNYESGQDRIYRWGYLAVRFMFEHHRDDVRQILAYLRNDQYAEYQTFMDGIG 655
L + Y S Y +G+ +M+ ++ ++ Y++N+ + Y+ ++ +
Sbjct: 567 MSLYGVLHAKYGS--WDFYNYGFALSNYMYNNNMGMFNKMTNYIKNNDVSGYKDYIASMS 624

Query: 656 TRY--DNEWQGWL 666
+ Y ++++Q ++
Sbjct: 625 SDYGLNDKYQDYM 637



Score = 74.8 bits (183), Expect = 1e-15
Identities = 36/184 (19%), Positives = 63/184 (34%), Gaps = 26/184 (14%)

Query: 539 WNLQHEYVHYLDGRYNLFGDFSRGTSANTIWWIEGLAEYISYRDANTAAIA-MGETGEFM 597
+ L +Y Y+D N + ++ + A+ I+ + ++ + + +
Sbjct: 627 YGLNDKYQDYMDSLLNNIDNLDVPLVSD-EYVNGHEAKDINEITNDIKEVSNIKDLSSNV 685

Query: 598 LSTIFKNNYESGQDRIYRWGYLAVRFMFEHH-----RDDVRQILAYLRNDQYAEYQTF-- 650
+ F Y+ R Y+ R E + + IL L + Y+T
Sbjct: 686 EKSQFFTTYD------MRGTYVGGRSQGEENDWKDMNSKLNDILKELSKKSWNGYKTVTA 739

Query: 651 ------MDGIGTR-YDNEWQGWLASGLSTADDGIVDKGPSDV-DAEPSGREGNWTGPAGT 702
+DG G YD + G T D V+K P V ++ S GT
Sbjct: 740 YFVNHKVDGNGNYVYDVVFHGMNT---DTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGT 796

Query: 703 ISKD 706
SKD
Sbjct: 797 ESKD 800


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0661RTXTOXINA290.033 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.033
Identities = 11/41 (26%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 124 LAAGLSSSGALVVAFGTAISDTSQLHLSPMAVAQLAQRGEH 164
A GLS+S A +A++ L +SP++ +A + +
Sbjct: 296 AAQGLSTSAAAAGLIASAVT----LAISPLSFLSIADKFKR 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0662TYPE3IMSPROT300.033 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.7 bits (67), Expect = 0.033
Identities = 27/159 (16%), Positives = 56/159 (35%), Gaps = 13/159 (8%)

Query: 146 VAIGILIMQDIFAVLFLTISKGDVPSVWAFALLLLPLAKPLIYKAFDRVGHGELLVLFGL 205
+ + + ++ + + +P A + ++ + Y F + + L +
Sbjct: 43 MGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLT---VAALMAI 99

Query: 206 VMALVVGAWLFESVGLKPDLGAL--IIGI-LLAGHKKSSELAKSLFYFKELFLVAFFLTI 262
+V +L +KPD+ + I G + K E KS+ L ++ + +
Sbjct: 100 ASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIK 159

Query: 263 G----LNGLPTVSDIVLAALLVLLVPLKILLFVYILTRF 297
G L LPT + + LL + L V F
Sbjct: 160 GNLVTLLQLPTCG---IECITPLLGQILRQLMVICTVGF 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0667ACRIFLAVINRP492e-159 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 492 bits (1269), Expect = e-159
Identities = 212/1049 (20%), Positives = 443/1049 (42%), Gaps = 58/1049 (5%)

Query: 3 IAEYSIRHKVISWMFVLLLLVGGGVSFTGLGQLEFPEFTIKEALVITAYPGASPEQVEEE 62
+A + IR + +W+ ++L++ G ++ L ++P V YPGA + V++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTLPLEDALQQLDAIKHVTSI-NSAGLSQIQIEIKESYDKTSLPQVWDEVRRKVNDTAGS 121
VT +E + +D + +++S +SAG I + + T +V+ K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQ---SGTDPDIAQVQVQNKLQLATPL 117

Query: 122 LPPGTTAPQVMDDFGD---VYGILFNLSGPDYSNRELSNYAD-YLRRELVLVPGVKKVSV 177
LP + + + F P + ++S+Y ++ L + GV V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 178 AGSVTEQVVIEISQQKLSALGLDQSYIYGLVNNQNVVSNAGSLVIGDN------RIRIHP 231
G+ + I + L+ L + + QN AG L I
Sbjct: 178 FGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236

Query: 232 TGEFSNVQDLARLIVSPPGSTELIYLGDIANIEKDYDETPNVLYHNKGEAALSLGISFSS 291
F N ++ ++ + ++ L D+A +E + + N G+ A LGI ++
Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARIN-GKPAAGLGIKLAT 295

Query: 292 GVNVVEVGQKVSNRLAELESQRPIGMNLATVYNQSQAVDETVNGFLINLLESIAIVIAVL 351
G N ++ + + +LAEL+ P GM + Y+ + V +++ + L E+I +V V+
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355

Query: 352 LLFMG-LRSGLLMGLILLLTILGTFIVMKVLGIELQLISLGALIIALGMLVDNAIVVTEG 410
LF+ +R+ L+ + + + +LGTF ++ G + +++ +++A+G+LVD+AIVV E
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 411 ILIGLRRGKTR-LEAAKQIVAQTQWPLLGATVIAIIAFAPIGLSQNAAGEFCRSLFQVLM 469
+ + K EA ++ ++Q Q L+G ++ F P+ + G R ++
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 470 ISLFISWITAITLTPFFCHLLFKDAPSDE-EAQDPYKGWF-------FSLYRASLTLALR 521
++ +S + A+ LTP C L K ++ E + + GWF + Y S+ L
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILG 535

Query: 522 FRLVSILLVVAMLFSAVVGFGHIKNVFFPASNTPIFFVDIWMPEGTDIKATERFTADIER 581
+L+ ++ VV F + + F P + +F I +P G + T++ +
Sbjct: 536 STGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTD 595

Query: 582 LMLSLNEQKDVGLKHLTTVIG-------QGSQRFVLPYQPEKGYPAFAQLIVEMQDLAAV 634
L NE+ +V + + TV G Q + + +P + + A
Sbjct: 596 YYLK-NEKANV--ESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIH-RAK 651

Query: 635 KAYMPELETLLNQRFPQAQYRLKNMENGPSPAAKIEARFYGDNPEVLRALGDQAEAIFHA 694
+ + P + + ++ + + + +A
Sbjct: 652 MELGKIRDGFV---IPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQH 708

Query: 695 EPSMDGIRHNWRNQVPLIRPQLENAQARETGISKQDLDNALLVNFSGKQIGLYRETSHLL 754
S+ +R N + +++ +A+ G+S D++ + G + + + +
Sbjct: 709 PASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVK 768

Query: 755 PIVARAPAEERLQADSLWKLQIWSSEHNTFVPATQVVSNFNTEWEN--PLVMRRDRMRML 812
+ +A A+ R+ + + KL + S + VP + + + W P + R + + +
Sbjct: 769 KLYVQADAKFRMLPEDVDKLYV-RSANGEMVPFSAFTT---SHWVYGSPRLERYNGLPSM 824

Query: 813 AVMADPKLGSD-ETADSVLRKVKDKVEAISLPAGYHLEWGGEFETAGEAQTAVFSSIPMG 871
+ + G+ A +++ + K LPAG +W G + + + +
Sbjct: 825 EIQGEAAPGTSSGDAMALMENLASK-----LPAGIGYDWTGMSYQERLSGNQAPALVAIS 879

Query: 872 YLAMFLITVFLFNSVRQPLVIWFTVPLALIGVSAGLLLFDAPFSFMALLGLLSLSGMVIK 931
++ +FL L+ S P+ + VPL ++GV LF+ ++GLL+ G+ K
Sbjct: 880 FVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAK 939

Query: 932 NGIVLVDQIN-LELSEGKPAYFALVDSCVSRVRPVMMAAITTMLGMIPLISDAFFGS--- 987
N I++V+ L EGK A + + R+RP++M ++ +LG++PL GS
Sbjct: 940 NAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQ 999

Query: 988 --MAITIIFGLGFASLLTLIVLPVMYSLV 1014
+ I ++ G+ A+LL + +PV + ++
Sbjct: 1000 NAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028



Score = 74.5 bits (183), Expect = 1e-15
Identities = 39/209 (18%), Positives = 95/209 (45%), Gaps = 13/209 (6%)

Query: 822 SDETADSVLRKVKDKVEAI--SLPAGYHLEWGGEFETAGEAQTAVFSSIPMGYLAMFL-- 877
+ A + +K K+ + P G ++ ++T Q ++ + + A+ L
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQG--MKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVF 352

Query: 878 ITVFLF-NSVRQPLVIWFTVPLALIGVSAGLLLFDAPFSFMALLGLLSLSGMVIKNGIVL 936
+ ++LF ++R L+ VP+ L+G A L F + + + G++ G+++ + IV+
Sbjct: 353 LVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVV 412

Query: 937 VDQINLELSEGKPAYFALVDSCVSRVR-PVMMAAITTMLGMIPL-----ISDAFFGSMAI 990
V+ + + E K + +S+++ ++ A+ IP+ + A + +I
Sbjct: 413 VENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSI 472

Query: 991 TIIFGLGFASLLTLIVLPVMYSLVFNIKA 1019
TI+ + + L+ LI+ P + + + +
Sbjct: 473 TIVSAMALSVLVALILTPALCATLLKPVS 501


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0668RTXTOXIND452e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 45.2 bits (107), Expect = 2e-07
Identities = 29/178 (16%), Positives = 62/178 (34%), Gaps = 28/178 (15%)

Query: 104 EAEHELLAADFKRKIELLNRKLISQSEFDSTQAQLKSAKAALAAARDQLSYTRLTAPFSG 163
+ E++L+ FK +I + T + LA ++ + + AP S
Sbjct: 286 KEEYQLVTQLFKNEI---------LDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSV 336

Query: 164 TIAKRLVDNH-QIVQANQGVLTL-QNNNLLDVSIQVPEAMAAGLKQYTDQAHFTAKVRFS 221
+ + V +V + ++ + ++ L+V+ V Q A ++
Sbjct: 337 KVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIG--FINVGQN---AIIKVE 391

Query: 222 AFPEQSF---DAKFKEYSTQVTPGTQ---AYEVVFSLPQPL------DIQLLPGMSAE 267
AFP + K K + + + V+ S+ + +I L GM+
Sbjct: 392 AFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVT 449



Score = 31.3 bits (71), Expect = 0.005
Identities = 15/104 (14%), Positives = 37/104 (35%), Gaps = 2/104 (1%)

Query: 68 SGQLTELTLVEGQRVAQGSLLAQLDDRDAKNNLMTREAEHELLAADFKRK-IELLNRKLI 126
+ + E+ + EG+ V +G +L +L A+ + + ++ + R I + +L
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 127 SQSEFD-STQAQLKSAKAALAAARDQLSYTRLTAPFSGTIAKRL 169
E + ++ L + + + K L
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKEL 207


63Sbal195_0701Sbal195_0711N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0701019-3.416125OmpA/MotB domain-containing protein
Sbal195_0702-116-2.066354Ig domain-containing protein
Sbal195_0703-116-0.833581GAF sensor-containing diguanylate
Sbal195_0704-3140.660349hypothetical protein
Sbal195_0705-2151.524068hypothetical protein
Sbal195_0706-2132.738227hypothetical protein
Sbal195_0707-2122.656699hypothetical protein
Sbal195_0708-2122.234197transposase IS3/IS911 family protein
Sbal195_0709-1122.275316OmpA/MotB domain-containing protein
Sbal195_07100131.538547molybdate ABC transporter substrate-binding
Sbal195_07112171.199224magnesium transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0701OMPADOMAIN1676e-51 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 167 bits (424), Expect = 6e-51
Identities = 85/328 (25%), Positives = 143/328 (43%), Gaps = 33/328 (10%)

Query: 60 YMGSEIGINHYQHGC-ESWSIDCDKNSTMASFFAGYQFNHHLAFEVAYIDLGKAEATYLE 118
Y G+++G + Y + + +N A F GYQ N ++ FE+ Y LG+ Y
Sbjct: 29 YTGAKLGWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGRM--PYKG 86

Query: 119 ASKQNIYQGSMRGLNLSSVASIDFGADLALFAKAGVFNWHGENKGPFSTIKADD-WSPSF 177
+ + Y +G+ L++ DL ++ + G W + K D SP F
Sbjct: 87 SVENGAY--KAQGVQLTAKLGYPITDDLDIYTRLGGMVWRADTKSNVYGKNHDTGVSPVF 144

Query: 178 GVGMTYQLSDSWQARLQYEYFHHLGNDDIGGTNA--HATSLGISYQFGRSRPEVITSTVI 235
G+ Y ++ RL+Y++ +++G+ GT SLG+SY+FG+ +
Sbjct: 145 AGGVEYAITPEIATRLEYQWTNNIGDAHTIGTRPDNGMLSLGVSYRFGQGEAAPV----- 199

Query: 236 NSVTNTVIKATPIELEEVTFP--VLFNFDSSELF--FVDSLQIIINRLIQF--PQATVIL 289
V A ++ + T VLFNF+ + L +L + ++L +V++
Sbjct: 200 --VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVV 257

Query: 290 RGYSDSQGSSEYNLALSKRRTDSITQYLTEHGVKPQQIIAEHYGEQYPVTDKISEQHKHL 349
GY+D GS YN LS+RR S+ YL G+ +I A GE PVT + K
Sbjct: 258 LGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQR 317

Query: 350 ---------NRRVQVLL---PQTIIQPQ 365
+RRV++ + + QPQ
Sbjct: 318 AALIDCLAPDRRVEIEVKGIKDVVTQPQ 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0702FLAGELLIN310.045 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.8 bits (69), Expect = 0.045
Identities = 35/307 (11%), Positives = 76/307 (24%), Gaps = 9/307 (2%)

Query: 425 DGSKKDITQLAAWSSSEPSVAAIQFSGALSGVANTLAIGNTDITVSFEGMSKTTTLTVNQ 484
+ Q+ A ++ + G+ G + TV S +
Sbjct: 138 SQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDT 197

Query: 485 AVVESVQITPQNPSVPVGVEGQFTAIAFYSDKTTVDVTHSANWQVDDYSIAAVIPNGDSA 544
V + + V G T DK V+ + D + AV +
Sbjct: 198 YAVGANKYRV---DVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTK 254

Query: 545 GYAKALGEGTNQLSVK------FSGQTASTSISVSAAKLDSISLTPSIAEAPAGTTLQYQ 598
A ++K T + D + T
Sbjct: 255 STAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVAD 314

Query: 599 VFGLFSDGTNHDLTTFAHYQTSDNALASIDSHGLATAHQYNAKPVTVTASYDGLQSTATL 658
+ ++ L + + TS + A + T
Sbjct: 315 ITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNG 374

Query: 659 KVTAGLLDHIEVTPAAQSIAVGHKGLLQARAFYSDKTSSDITSLATWSVNDGDIASVDNT 718
+VT A +++ + + D ++ ++ + D ++ VD
Sbjct: 375 AEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAV 434

Query: 719 EAESGSV 725
+ G++
Sbjct: 435 RSSLGAI 441


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0705FLGHOOKAP1280.017 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.017
Identities = 16/60 (26%), Positives = 34/60 (56%), Gaps = 1/60 (1%)

Query: 112 QQYLTNKRLSEIADRLNAIDREISSLDGKINNLTDKADLLKQKNSLLNEKNQLLDERSRL 171
Q N + D++N ++I+SL+ +I+ LT N+LL++++QL+ E +++
Sbjct: 153 QDKQVNIAIGASVDQINNYAKQIASLNDQISRLTGVGA-GASPNNLLDQRDQLVSELNQI 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0708PF04183240.049 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 24.5 bits (53), Expect = 0.049
Identities = 6/44 (13%), Positives = 15/44 (34%)

Query: 20 GRLLSDVARQYGLSAKAVYQWVRESDLQPQQRECALMSEIAQLQ 63
R +S + + G+ + YQ + ++ + A
Sbjct: 489 LRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSERFALFS 532


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0709OMPADOMAIN974e-26 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 97.3 bits (242), Expect = 4e-26
Identities = 38/123 (30%), Positives = 63/123 (51%), Gaps = 11/123 (8%)

Query: 117 LNMPNEVTFGVDQTELSDGAKRVLNSVAVVAKEYSKT--QLNVLGYTDSSGSDSYNLRLS 174
+ ++V F ++ L + L+ + + VLGYTD GSD+YN LS
Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274

Query: 175 QVRAGEVGNYLMSKGVASARVKSKGMGEASPIASNANANGR---------AQNRRVEIVL 225
+ RA V +YL+SKG+ + ++ ++GMGE++P+ N N + A +RRVEI +
Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334

Query: 226 TPT 228

Sbjct: 335 KGI 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0711FLGMOTORFLIG310.014 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 30.5 bits (69), Expect = 0.014
Identities = 19/124 (15%), Positives = 43/124 (34%), Gaps = 13/124 (10%)

Query: 2 PVDNSENDHT---GHSLDQLNQALSSGMFVHVRNMLQK-MAASDIALILESSPPSARQVL 57
+ + D+ L + + G + R +L+K + I+ + +
Sbjct: 57 TITSELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSA----- 111

Query: 58 WQLIDQEQIGDILDELSEELKDPLIRSMSPERVAKATASMDTDDLAYILRSLPDAVYKQV 117
Q + + + I+ P+ +A + +D ++IL SLP V V
Sbjct: 112 ----LQSRPFEFVRRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNV 167

Query: 118 LQSM 121
+ +
Sbjct: 168 ARRI 171


64Sbal195_0855Sbal195_0871N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_08550251.046332elongation factor G
Sbal195_08560181.842845LysR family transcriptional regulator
Sbal195_08570191.582045hypothetical protein
Sbal195_0858-1191.285156nitrate reductase cytochrome c-type subunit
Sbal195_08590181.200765nitrate reductase catalytic subunit
Sbal195_0860-2210.048444NapD family protein
Sbal195_0861-2210.413202hypothetical protein
Sbal195_08621151.454807prepilin-type cleavage/methylation-like protein
Sbal195_08631161.837961prepilin-type cleavage/methylation-like protein
Sbal195_08640152.677821methylation site containing protein
Sbal195_0865-2121.676587methylation site containing protein
Sbal195_0866-1121.661002hypothetical protein
Sbal195_0867-1121.553049hypothetical protein
Sbal195_0868-2101.425665ABC transporter-like protein
Sbal195_0869-2101.224206amino acid carrier protein
Sbal195_0870-2110.530881multi-sensor hybrid histidine kinase
Sbal195_08710190.039550response regulator receiver modulated metal
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0855TCRTETOQM5410.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 541 bits (1395), Expect = 0.0
Identities = 170/668 (25%), Positives = 292/668 (43%), Gaps = 60/668 (8%)

Query: 6 KYRNIGIFAHVDAGKTTTTERILKLTGKIHKIGEVHDGESTTDFMVQEAERGITIQSAAV 65
K NIG+ AHVDAGKTT TE +L +G I ++G V G + TD + E +RGITIQ+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 66 SCFWKDHRFNVIDTPGHVDFTVEVYRSLKVLDGGIGVFCGSGGVEPQSETNWRYANESEV 125
S W++ + N+IDTPGH+DF EVYRSL VLDG I + GV+ Q+ + + +
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 126 ARIIFVNKLDRMGADFLRVVKQTKDVLAATPLVMVLPIGVEDEFTGVVDLLTRKAYVWDD 185
I F+NK+D+ G D V + K+ L+A ++ +K ++ +
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIK------------------QKVELYPN 163

Query: 186 SGLPENFEVLDVPADMVDMVEEYREMLIETAVEQDDAVMEAYMEGEEPSMEDIKRCIRTG 245
+ + + +T +E +D ++E YM G+ ++++
Sbjct: 164 ----------MCVTNFTESEQ------WDTVIEGNDDLLEKYMSGKSLEALELEQEESIR 207

Query: 246 TRKLAFFPTYCGSAFKNKGMQLVLDAVVDYLPAPDEVDPQPLTDEEGNETGEFAIVSADE 305
+ FP Y GSA N G+ +++ + + +
Sbjct: 208 FHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH--------------------RGQS 247

Query: 306 PLKALAFKI-MDDRFGALTFVRIYSGRLKKGDTILNSATGKTERIGRMCEMYADDRIEIE 364
L FKI ++ L ++R+YSG L D++ S K +I M + +I+
Sbjct: 248 ELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKID 306

Query: 365 SAQAGDIIAIVGMKNVQTGHTLCDVKHPVTLEAMVFPEPVISIAVAPKDKGGSEKMGIAI 424
A +G+I+ + + ++ L D K E + P P++ V P E + A+
Sbjct: 307 KAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDAL 365

Query: 425 GKMIAEDPSFRVETDEDSGETILKGMGELHLDIKVDILKRTYGVELIVGEPQVAYRETIT 484
++ DP R D + E IL +G++ +++ +L+ Y VE+ + EP V Y E
Sbjct: 366 LEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPL 425

Query: 485 QMVEDQYTHKKQSGGSGQFGKIEYIIRPGEPNTGFVFKSSVVGGSVPKEFWPAVEKGFAS 544
+ E YT + + + I + P +G ++SSV G + + F AV +G
Sbjct: 426 KKAE--YTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEGIRY 483

Query: 545 MMNTGTIAGFPVLDVEFELTDGAFHAVDSSAIAFEIAAKGAFRQSIAKAKPQLLEPIMKV 604
G + G+ V D + G +++ S+ F + A Q + KA +LLEP +
Sbjct: 484 GCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYLSF 542

Query: 605 DVFSPDDNVGDVIGDLNRRRGMIKDQVAGVTGVRVKADVPLSEMFGYIGTLRTMTSGRGQ 664
+++P + + D + I D V + ++P + Y L T+GR
Sbjct: 543 KIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGRSV 602

Query: 665 FSMEFSHY 672
E Y
Sbjct: 603 CLTELKGY 610


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0858PF06291270.024 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 26.5 bits (58), Expect = 0.024
Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 1 MRKILTLTALLVAITGCSGQQTDTNAAPV 29
M+K+L AL + ITGC+ Q P
Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPT 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0861RTXTOXINA300.042 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.5 bits (66), Expect = 0.042
Identities = 21/75 (28%), Positives = 31/75 (41%), Gaps = 14/75 (18%)

Query: 143 KGAKGNNIFNDAIVSCESLNLTGSSTIDGYDSRKGAYGDSFNN----AQGNNQLNKHGKG 198
G+K +IF+ A G I+G D YGD N+ G++QL G G
Sbjct: 732 FGSKFTDIFHGA---------DGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYG-GDG 781

Query: 199 NVTTVEPNADVTLSG 213
N + + L+G
Sbjct: 782 NDKLIGVAGNNYLNG 796


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0862BCTERIALGSPG310.001 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.4 bits (71), Expect = 0.001
Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 6/55 (10%)

Query: 17 RQSGFSLSELMIAMV-LGLIIMIAVINFF-----APLKATVEESKRLENAADALR 65
+Q GF+L E+M+ +V +G++ + V N A + V + LENA D +
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0864BCTERIALGSPG362e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.6 bits (82), Expect = 2e-05
Identities = 17/48 (35%), Positives = 24/48 (50%), Gaps = 7/48 (14%)

Query: 8 GFTLVELMVTVAIISILGSLALPSY-------RDVMAREQLTAAANEL 48
GFTL+E+MV + II +L SL +P+ A + A N L
Sbjct: 9 GFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0865BCTERIALGSPG492e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.7 bits (116), Expect = 2e-10
Identities = 18/64 (28%), Positives = 35/64 (54%)

Query: 8 EKGFTLIELMIVVAIIGILAAIAIPSFSEYLKQGRRFDAQQYLMTSVQALERNYSRQGKY 67
++GFTL+E+M+V+ IIG+LA++ +P+ ++ + A ++ AL+ Y
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 68 PAAQ 71
P
Sbjct: 67 PTTN 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0870HTHFIS727e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.2 bits (177), Expect = 7e-15
Identities = 24/141 (17%), Positives = 52/141 (36%), Gaps = 13/141 (9%)

Query: 1286 SVLVVDDNATARDIMCTTLESMGFRVDTVRSGEEAISRCLLQAYEVALIDWKMPNMDGLE 1345
++LV DD+A R ++ L G+ V + ++ + D MP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1346 TARQIQLQAQSQSQSQSQPQSHPQPKILMVSAHADHEFLTQIEQLALAGYISKPISASRL 1405
+I+ ++ P +L++SA + + Y+ KP + L
Sbjct: 65 LLPRIK-------------KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111

Query: 1406 LDGIMNAIGREGILPVRRRTE 1426
+ I A+ P + +
Sbjct: 112 IGIIGRALAEPKRRPSKLEDD 132



Score = 71.4 bits (175), Expect = 1e-14
Identities = 29/134 (21%), Positives = 55/134 (41%), Gaps = 2/134 (1%)

Query: 1436 LQGKRILLVEDNEMNLEVASEFLEQVGIILSIATNGQIALDKLSQQHFDLVLMDCQMPVM 1495
+ G IL+ +D+ V ++ L + G + I +N ++ DLV+ D MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 1496 DGYQATQAIRKRPELANLPVVAMTANAMAGDRDMCIRAGMNDHIAKPIEVNVLYQTLLKY 1555
+ + I+K +LPV+ M+A G D++ KP ++ L + +
Sbjct: 61 NAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 1556 LAPASETVAVVTSA 1569
LA + +
Sbjct: 119 LAEPKRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0871HTHFIS837e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 7e-20
Identities = 38/160 (23%), Positives = 63/160 (39%), Gaps = 8/160 (5%)

Query: 1 MEKATILVVDDTPENIDILIGILGD-DYKVKVAIDGPRALALVAKSRPDLILLDVMMPGM 59
M ATILV DD +L L Y V++ + +A DL++ DV+MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 NGYEVCKLLKQ-DPLTSHIPIIFVTALSESSDEAQGFALGAVDYITKPVSAPVVKARVKT 118
N +++ +K+ P +P++ ++A + + GA DY+ KP + +
Sbjct: 61 NAFDLLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 119 HLALY--DQKRLLEQQVKIRTHELEETRF-EIIRRLGRAA 155
LA +L + EI R L R
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLM 157


65Sbal195_0960Sbal195_0969N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_0960-2121.092642phospholipase A(1)
Sbal195_0961-2131.302613uroporphyrin-III C-methyltransferase
Sbal195_0962-2120.373659sulfate adenylyltransferase subunit 2
Sbal195_0963-2140.350153sulfate adenylyltransferase subunit 1
Sbal195_0964-215-0.052644TrkA domain-containing protein
Sbal195_0965-114-0.753682adenylylsulfate kinase
Sbal195_0966-113-1.052035integral membrane sensor signal transduction
Sbal195_0967-211-1.200193response regulator receiver modulated
Sbal195_0968-212-1.261790hypothetical protein
Sbal195_0969-114-1.434617major facilitator superfamily transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0960PHPHLIPASEA12171e-71 Bacterial phospholipase A1 protein signature.
		>PHPHLIPASEA1#Bacterial phospholipase A1 protein signature.

Length = 289

Score = 217 bits (553), Expect = 1e-71
Identities = 102/303 (33%), Positives = 156/303 (51%), Gaps = 26/303 (8%)

Query: 3 RLYSGIAMAGLLACTSINAEESLVEGRVKDE-----------LATAELPFVITPHKVNYI 51
R G + + ++ A+E+ V+ V D L + PF + P+ NY+
Sbjct: 2 RTLQGWLLPVFMLPMAVYAQEATVK-EVHDAPAVRGSIIANMLQEHDNPFTLYPYDTNYL 60

Query: 52 LPATYSPDPNMAPFAEDALINPYTLDEFEAKFQISFKFPIWYNVFGDNGHLFFAYTNQSY 111
+ S ++ A + + E KFQ+S FP+W + G N L +YT +S+
Sbjct: 61 IYTQTS---DLNKEAIASYDWAENARKDEVKFQLSLAFPLWRGILGPNSVLGASYTQKSW 117

Query: 112 WQVYNKDTSSPFRETNHEPEVFMLFNNDWKIGSVTNSFWGIGAVHQSNGKSGPLSRSWNR 171
WQ+ N + SSPFRETN+EP++F+ F D++ T +G H SNG+S P SRSWNR
Sbjct: 118 WQLSNSEESSPFRETNYEPQLFLGFATDYRFAGWTLRDVEMGYNHDSNGRSDPTSRSWNR 177

Query: 172 LYATMIFDAGPLAFSTKVWWRIPEDEKTDPHQARGDDNPNIDDYIGRAEFIGVYGIDEHR 231
LY ++ + G K W+ + DDNP+I Y+G + Y + +
Sbjct: 178 LYTRLMAENGNWLVEVKPWYVVGNT----------DDNPDITKYMGYYQLKIGYHLGDAV 227

Query: 232 FTLTLKTNLEDIDRGSAELTWSYPIVGNLRLYTQYFNGYGESLIDYNYHNQRIGIGISLN 291
+ + N G AEL SYPI ++RLYTQ ++GYGESLIDYN++ R+G+G+ LN
Sbjct: 228 LSAKGQYNWNT-GYGGAELGLSYPITKHVRLYTQVYSGYGESLIDYNFNQTRVGVGVMLN 286

Query: 292 DIL 294
D+
Sbjct: 287 DLF 289


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0963TCRTETOQM676e-14 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 67.2 bits (164), Expect = 6e-14
Identities = 46/155 (29%), Positives = 70/155 (45%), Gaps = 17/155 (10%)

Query: 41 VDDGKSTLIGRLLHDSAQIYEDQLASLKSDSAKMGTTGEAIDLALLVDGLQAEREQGITI 100
VD GK+TL LL++S I +L S+ + + D ER++GITI
Sbjct: 12 VDAGKTTLTESLLYNSGAI--TELGSVDKGTTRT-------------DNTLLERQRGITI 56

Query: 101 DVAYRYFSSDKRKFIIADTPGHEQYTRNMATGASTCDLAVILVDARYGVQTQTKRHAFIA 160
F + K I DTPGH + + S D A++L+ A+ GVQ QT+
Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 161 SLLGIRHFVVAVNKMDLLGFD-EQVFNRIRADFTD 194
+GI + +NK+D G D V+ I+ +
Sbjct: 117 RKMGIPT-IFFINKIDQNGIDLSTVYQDIKEKLSA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0966PF06580532e-09 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 52.6 bits (126), Expect = 2e-09
Identities = 30/125 (24%), Positives = 45/125 (36%), Gaps = 21/125 (16%)

Query: 497 ELEIDIDANIEMNSYPGALGQSLENFVTNAITHAFEGR-GNGQIKISAKMIEDQIVEITV 555
+ E I+ I P L Q+L V N I H G+I + ++ V + V
Sbjct: 241 QFENQINPAIMDVQVPPMLVQTL---VENGIKHGIAQLPQGGKILLKGTK-DNGTVTLEV 296

Query: 556 SDNGIGMSAETMKQIFDPFFTTRRGNGGTGLGLHLTYQLVSQLLGGK--ITVSSKLGKGS 613
+ G T + TG GL + + L G + I +S K GK +
Sbjct: 297 ENTGSLALKNTKE--------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN 342

Query: 614 VFSLI 618
LI
Sbjct: 343 AMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0967HTHFIS451e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.8 bits (106), Expect = 1e-06
Identities = 26/153 (16%), Positives = 54/153 (35%), Gaps = 15/153 (9%)

Query: 27 KILTVDDDSNFQRSTAFALSTLKVLDCKIELSQAFSYAEACQVLTKENDFAIALVDVVME 86
IL DDD+ + ALS ++ + A + + D + + DVVM
Sbjct: 5 TILVADDDAAIRTVLNQALS-----RAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMP 58

Query: 87 TEDAGLRLVRAIREVLGNEKIRIILLTGQPGMAPIFDVMRDYDINDYWTKS---ELSADR 143
E+ L+ I++ + +++++ Q DY K
Sbjct: 59 DEN-AFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEK-GAYDYLPKPFDLTELIGI 114

Query: 144 LQTILTTNLRSYQQISSIANAKRGLQLIAESSG 176
+ L R ++ +++ G+ L+ S+
Sbjct: 115 IGRALAEPKRRPSKLE--DDSQDGMPLVGRSAA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_0969TCRTETA355e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 5e-04
Identities = 32/143 (22%), Positives = 48/143 (33%), Gaps = 12/143 (8%)

Query: 249 VVNLLFAPAIGRFIGRIGERNALTVEYVGLIIVFISYALVEQAHMAAALY---VIDHLLF 305
++ AP +G R G R L V L + YA++ A LY ++ +
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLV---SLAGAAVDYAIMATAPFLWVLYIGRIVAGITG 110

Query: 306 AMAIAMKTYFQKIADSKDIAAT---MSVSFTINHIAAVIIPVLLGLLWLTDPALVFYIGA 362
A Y I D + A MS F +A PVL GL+ P F+ A
Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG---PVLGGLMGGFSPHAPFFAAA 167

Query: 363 GFAVCSLILALNVPRHPEPGNET 385
+ + + G
Sbjct: 168 ALNGLNFLTGCFLLPESHKGERR 190



Score = 30.9 bits (70), Expect = 0.009
Identities = 24/151 (15%), Positives = 58/151 (38%), Gaps = 11/151 (7%)

Query: 204 YWLYYLLTFFSGARRQIFMVFAGFMMVEKFGYSVSEITALFLINYVVNLLF-APAIGRFI 262
+++++ ++++F ++F + + I +++ L A G
Sbjct: 216 MAVFFIMQLVGQVPAALWVIFG----EDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271

Query: 263 GRIGERNALTVEYVGLIIVFISYALVEQAHMAAALYVIDHLLFAMAIAM---KTYFQKIA 319
R+GER AL + + +I A + MA + V LL + I M + +
Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMV---LLASGGIGMPALQAMLSRQV 328

Query: 320 DSKDIAATMSVSFTINHIAAVIIPVLLGLLW 350
D + + + +++ P+L ++
Sbjct: 329 DEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359


66Sbal195_1091Sbal195_1096N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_1091-2141.395809pseudouridine synthase
Sbal195_1092-2161.353683phosphoribosylglycinamide formyltransferase 2
Sbal195_1093-3151.060931hypothetical protein
Sbal195_1094-3141.220048secretion protein HlyD family protein
Sbal195_1095-3141.792316ATPase central domain-containing protein
Sbal195_1096-1151.949887sulfate ABC transporter ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1091RTXTOXIND340.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.0 bits (78), Expect = 0.002
Identities = 22/144 (15%), Positives = 44/144 (30%), Gaps = 18/144 (12%)

Query: 122 QINQINEQLFAVENHPDIPRLSTELETEQAQAQAELDAHRQVMIDSRQSRKAQRNQLAA- 180
+ I EQ +N + E + +AE + + ++++L
Sbjct: 187 LTSLIKEQFSTWQNQ------KYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF 240

Query: 181 -QLAAN---PTDETL-RENAITEAKLS-QESISEKNQLRDIKRYWDERIHTISQ-----A 229
L L +EN EA + S+ Q+ E ++Q
Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300

Query: 230 LSQLTDERDALRQQRKRLSAALQQ 253
L +L D + L+ ++
Sbjct: 301 LDKLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1092PF06057310.005 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 31.3 bits (71), Expect = 0.005
Identities = 10/28 (35%), Positives = 13/28 (46%), Gaps = 2/28 (7%)

Query: 17 GCGELGKEVAIELQRLGVEVIGVD--RY 42
G L K V LQ+ G V+G +Y
Sbjct: 62 GWATLDKAVGGILQQQGWPVVGWSSLKY 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1094RTXTOXIND612e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 61.0 bits (148), Expect = 2e-12
Identities = 52/320 (16%), Positives = 98/320 (30%), Gaps = 80/320 (25%)

Query: 66 ITPAVKGLVSRVEVQPNTPVKQGDVLFRIDPIPFEAVVK--------------RKRAALV 111
I P +V + V+ V++GDVL ++ + EA R +
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158

Query: 112 AAEL--------------------EVPQLAAALESAKANVER----VNADKDRNKSAYER 147
+ EL EV +L + ++ + + + D+ ++
Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218

Query: 148 YESGHRKGGANSPFTALELDNKRQL----------YLASEAQLTAARSE----------- 186
+ + S LD+ L L E + A +E
Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQI 278

Query: 187 -----ELRMRLA-----YESNIDG----VNTKVAGLQGDLASALYDLEQTVVRAPADGIV 232
+ +++ I + L +LA + +V+RAP V
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338

Query: 233 TQMALR-PGAMAVPLPLRPVMSFIPDEQRYFAGAFWQNSLL-RLKEGDEAEIILDAAPGK 290
Q+ + G V +M +P++ A QN + + G A I ++A P
Sbjct: 339 QQLKVHTEG--GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYT 396

Query: 291 ---VFKGKVAKVLPAMAEGE 307
GKV + E +
Sbjct: 397 RYGYLVGKVKNINLDAIEDQ 416


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1095HTHFIS290.041 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.4 bits (66), Expect = 0.041
Identities = 12/49 (24%), Positives = 24/49 (48%), Gaps = 5/49 (10%)

Query: 266 VQQAKALDAPKGILLLGVQGSGKSLAAKAV---AGVWQRPLLRLDMAAL 311
+ + D +++ G G+GK L A+A+ P + ++MAA+
Sbjct: 153 LARLMQTDLT--LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAI 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1096PF05272290.044 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.044
Identities = 11/38 (28%), Positives = 17/38 (44%), Gaps = 7/38 (18%)

Query: 30 MIGLLGPSGSGKTTLLRIIAGLEGADSGNIYFGDRDVT 67
+ L G G GK+TL+ + GL+ +F D
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD-------FFSDTHFD 628


67Sbal195_1160Sbal195_1167N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_1160539-7.013328type IV pilus modification protein PilV
Sbal195_1161539-7.330865type IV pilus assembly protein PilW
Sbal195_1162539-7.162373type IV pilus assembly protein PilX
Sbal195_1163637-7.154244type IV pilin biogenesis protein
Sbal195_1164130-5.999836type IV pilus biogenesis protein PilE
Sbal195_1165026-5.927348hypothetical protein
Sbal195_1166232-6.376392type IV pilus biogenesis protein
Sbal195_1167132-6.958579type IV pilus biogenesis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1160BCTERIALGSPG327e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.8 bits (72), Expect = 7e-04
Identities = 10/24 (41%), Positives = 18/24 (75%), Gaps = 2/24 (8%)

Query: 21 QRGFSLIEVLVALVIL--VIGLIG 42
QRGF+L+E++V +VI+ + L+
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVV 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1164BCTERIALGSPG521e-11 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 52.2 bits (125), Expect = 1e-11
Identities = 20/61 (32%), Positives = 38/61 (62%)

Query: 6 KGFTLIEVMITVVIIGILAAIAYPSYTQYIALSARSEGLAALMRIANLQEQYYLDNRVYA 65
+GFTL+E+M+ +VIIG+LA++ P+ + + + ++ ++ + N + Y LDN Y
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHYP 67

Query: 66 T 66
T
Sbjct: 68 T 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1166BCTERIALGSPG353e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.2 bits (81), Expect = 3e-05
Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 6 KGFTLVELMVTIAVAAILLTIGVPSLTS 33
+GFTL+E+MV I + +L ++ VP+L
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMG 35


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1167BCTERIALGSPG332e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.9 bits (75), Expect = 2e-04
Identities = 14/50 (28%), Positives = 30/50 (60%), Gaps = 3/50 (6%)

Query: 5 QKGFSLIELMTTLSISTILFTVGTPSFT---DLSDQIRADSNIRTIQQTL 51
Q+GF+L+E+M + I +L ++ P+ + +D+ +A S+I ++ L
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


68Sbal195_1319Sbal195_1331N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_13190121.186057two component LuxR family transcriptional
Sbal195_13200121.012651integral membrane sensor signal transduction
Sbal195_1321-1140.455228hypothetical protein
Sbal195_13220130.646276carotenoid oxygenase
Sbal195_13230140.788155hypothetical protein
Sbal195_1324-1100.164551methyl-accepting chemotaxis sensory transducer
Sbal195_1325-1100.276417hypothetical protein
Sbal195_13260100.211299TetR family transcriptional regulator
Sbal195_1327-2100.124235NADH:flavin oxidoreductase
Sbal195_1328-19-0.762264peptidase S16 lon domain-containing protein
Sbal195_1329-19-1.668345PAS/PAC and GAF sensor(s)-containing diguanylate
Sbal195_1330013-2.147513DEAD/DEAH box helicase
Sbal195_1331117-3.801012transposase IS3/IS911 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1319HTHFIS703e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.9 bits (171), Expect = 3e-16
Identities = 20/112 (17%), Positives = 51/112 (45%), Gaps = 2/112 (1%)

Query: 12 LVEDQQLVRQGIASLLAISDNIRVLWQAEDGQDALSQLDNNPVDVLLSDIRMPNLDGIAM 71
+ +D +R + L+ + V + + D++++D+ MP+ + +
Sbjct: 8 VADDDAAIRTVLNQALSRAG-YDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 72 LKQIRQSANSLPVIMLTTFDDSELFLNSLQAGANGFLLKDVSLDKLLHAIET 123
L +I+++ LPV++++ + + + + GA +L K L +L+ I
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1320PF06580330.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.002
Identities = 19/101 (18%), Positives = 44/101 (43%), Gaps = 15/101 (14%)

Query: 291 LVLQEGISNAVRHG-----KANQLQLSMEDSQNALVLQLSDNGVGLTRVAARNASAKSGT 345
+++Q + N ++HG + ++ L + L++ + G + + K T
Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK------NTKEST 311

Query: 346 GLYGTGLSGMQERLQP-FNGKVQLRANDLAPGCQLTLTLPA 385
GTGL ++ERLQ + + Q++ ++ + +P
Sbjct: 312 ---GTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1324FLAGELLIN300.033 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.6 bits (66), Expect = 0.033
Identities = 30/256 (11%), Positives = 73/256 (28%), Gaps = 14/256 (5%)

Query: 74 AHDISVQTSKIAIGSAEVSHFIDLLNKSIESNGEHASAIAVAAGQLSHTTAQLGDNAADI 133
K+ + +A D + + + + A G
Sbjct: 217 DTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAE---AKAIAGAIKGGK 273

Query: 134 LGQAQEAERVSVQGRSQAQKG-----VSAIRSLSTDIDTAAEQVQALKSRAEEIQKITEV 188
G + + V+ ++ + I + A A A +Q V
Sbjct: 274 EGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNV 333

Query: 189 INSVAEQTNLLALNAAIEAARAGEQGRGFAVVADEVRSLAGKTAGATQDIGKMLLEIRSE 248
SV E+A+ + AV + ++ G A K+ L ++
Sbjct: 334 YTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTM 393

Query: 249 TDKTSGLMERVVTQTADVVA------AMGELDAHFTEISASVTQSAHALGDMEDSLKQYN 302
+ + A + +D+ +++ A + + ++
Sbjct: 394 FIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLG 453

Query: 303 NTTNDISRSVTQIRDS 318
NT +++ + ++I D+
Sbjct: 454 NTVTNLNSARSRIEDA 469


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1326HTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.6 bits (144), Expect = 2e-13
Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 3/167 (1%)

Query: 2 RNAEFDREQVLRGAMAAFMHKGYTKTSMQDLTQATGLHPGSIYCAFTNKRGLLIAAIEQY 61
+ A+ R+ +L A+ F +G + TS+ ++ +A G+ G+IY F +K L E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 62 QLDRNAQFNSLFAK-SENVLTNLKTYLDHIVAECLSCDSAQACLLTKALNEVAEQDVEIR 120
+ + AK + L+ L+ L H++ ++ + + + ++ +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 121 D-IINQYLQSWQQALTQQFTSAAEQGLLEGHRSDEQRAQYFMMGIYG 166
+ Q E +L +RA M G
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADL-MTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1328HTHFIS310.028 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.028
Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 6/39 (15%)

Query: 339 LFGYVENATFRGTVFTDFSLIRPGSLHKANGGVLLMDAV 377
LFG+ + A FT G +A GG L +D +
Sbjct: 208 LFGHEKGA------FTGAQTRSTGRFEQAEGGTLFLDEI 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1331HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHQALQQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


69Sbal195_1418Sbal195_1428N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_1418-1121.517605TetR family transcriptional regulator
Sbal195_1419-1132.093061hypothetical protein
Sbal195_1420-1161.949767TonB-dependent heme/hemoglobin receptor family
Sbal195_14210171.860191PhnA protein
Sbal195_14220191.755141major facilitator superfamily transporter
Sbal195_14230201.249610MarR family transcriptional regulator
Sbal195_1424-1210.428329succinylglutamate desuccinylase/aspartoacylase
Sbal195_1425019-0.711349methyl-accepting chemotaxis sensory transducer
Sbal195_1426019-2.134402fumarylacetoacetate (FAA) hydrolase
Sbal195_1427016-0.669275hypothetical protein
Sbal195_1428015-0.346258hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1418HTHTETR572e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 2e-12
Identities = 31/163 (19%), Positives = 63/163 (38%), Gaps = 5/163 (3%)

Query: 8 DRREKLI-LAMELFWQKGFAETSISDLVGHLGINRFSLYNSFGDKQKLYRECLSFYLDNY 66
+ R+ ++ +A+ LF Q+G + TS+ ++ G+ R ++Y F DK L+ E N
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 67 SFGASDTLLHEKAGLAE-IAAYLARFVALQREQKYGCFMQNAVLEKSL--DDESVLQECQ 123
+ + L + ++ + + K + +V+Q+ Q
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 124 RLFCRLQTS-FTQVLQDCQARGELLANVQPHQVAAFLVLQLQG 165
R C Q L+ C L A++ + A + + G
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1419FERRIBNDNGPP280.046 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 27.6 bits (61), Expect = 0.046
Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 2/46 (4%)

Query: 50 PALQFIEQMQPSILALSPRLTAVPKKVGGSLMRPQRDSRFSKDKTP 95
P L+ + +M+PS + S P+ + + + P R FS K P
Sbjct: 87 PNLELLTEMKPSFMVWSAGYGPSPEML--ARIAPGRGFNFSDGKQP 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1422TCRTETB310.008 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.0 bits (70), Expect = 0.008
Identities = 35/192 (18%), Positives = 68/192 (35%), Gaps = 9/192 (4%)

Query: 36 LPSIQEDISLSFTLASMLTLLPVLAMGLGCFAGFSIAKRLGFNTVMTGSLLLLIVATAMR 95
LP I D + + + +L +G ++ +LG ++ +++ + +
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 96 FWAMD-ASWLICSALLAGIGIA-LIQTIMPAMIKLNFGERVPLMMGLYVTAIMGGAALAA 153
F S LI + + G G A +M + + E GL + + G +
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV-- 154

Query: 154 SSAPFIGMNLGWRAGLGHWTWLGVVALALWMMVKHNAALPNQTAEQTVQLSFWRFRRSWL 213
P IG G A HW++L ++ + + V L + +
Sbjct: 155 --GPAIG---GMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSV 209

Query: 214 LAIFFALGTSCY 225
+FF L T+ Y
Sbjct: 210 GIVFFMLFTTSY 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1425RTXTOXIND358e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.2 bits (81), Expect = 8e-04
Identities = 20/123 (16%), Positives = 43/123 (34%), Gaps = 8/123 (6%)

Query: 42 IETLAVPVQKQSNSLQLVLLKMSRLATLAHSQQDTAALTKSQQAFTALQKKYQSIENELT 101
T+ + + N ++ ++ ++L H Q ++ A + KY NEL
Sbjct: 216 RLTVLARINRYENLSRVEKSRLDDFSSLLHKQ------AIAKHAVLEQENKYVEAVNELR 269

Query: 102 ERVADQSKMQTSLHEAQARYQAYLQQSQAMFSAKLANEQAKQQYQQLFQRFNDAKTNASN 161
+ ++++ + A+ YQ Q + KL Q L +
Sbjct: 270 VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR--QTTDNIGLLTLELAKNEERQQA 327

Query: 162 AMI 164
++I
Sbjct: 328 SVI 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1426PF07824280.018 Type III secretion chaperone
		>PF07824#Type III secretion chaperone

Length = 120

Score = 27.6 bits (61), Expect = 0.018
Identities = 14/54 (25%), Positives = 23/54 (42%)

Query: 80 AIGLDLTKRDLQSKLKAKGLPWERAKAFDGAALFSPFVAIDDAEAPLHFTLSIN 133
A+G+ D Q+ + + K D L PF A+ + L + LS+N
Sbjct: 11 ALGIPSIDTDDQAIMLDDDVLIYIEKEGDSINLLCPFCALPENINDLIYALSLN 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1428HTHFIS300.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.8 bits (67), Expect = 0.001
Identities = 12/56 (21%), Positives = 22/56 (39%), Gaps = 3/56 (5%)

Query: 19 FKPNGIPMTQLEQVPLAEDEIEAMRLVDLL---GMQQQEAAKSMGVSRQTLANLVK 71
F G + E+E ++ L Q +AA +G++R TL ++
Sbjct: 416 FASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIR 471


70Sbal195_1548Sbal195_1555N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_15481214.394641TetR family transcriptional regulator
Sbal195_15491204.106765secretion protein HlyD family protein
Sbal195_15502193.201341ABC transporter-like protein
Sbal195_15510161.538891ABC transporter
Sbal195_1552-216-2.074941acetyl-CoA hydrolase/transferase
Sbal195_1553119-4.422473hypothetical protein
Sbal195_1554019-3.883621beta-lactamase
Sbal195_1555221-5.008891transposase IS3/IS911 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1548HTHTETR751e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 75.0 bits (184), Expect = 1e-18
Identities = 29/162 (17%), Positives = 59/162 (36%), Gaps = 6/162 (3%)

Query: 31 SDARQRLITAALSLFSHRSYPTVSTREIAREAEVDAALIRYYFGSKAGLFEQMVRETLEP 90
+ RQ ++ AL LFS + + S EIA+ A V I ++F K+ LF ++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 91 VLTRLREISAAEAPNN---VGEIMQTYYRVMAPNPGLPRLIIRVLQEGDGSEPYRIILSV 147
+ E A + + EI+ L+ + + + ++
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 148 FEQVLTLSRQWLESTL---VNSGLLKEGVDPDLARLSFVSLM 186
+ S +E TL + + +L + A + +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1549RTXTOXIND553e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.8 bits (132), Expect = 3e-10
Identities = 32/176 (18%), Positives = 62/176 (35%), Gaps = 17/176 (9%)

Query: 86 TVERDRLTLTAPVGELITQVNVVEGQQVKAGEVLLTLDSTSANARLALRQAELEQAKAKL 145
T + ++ ++ V EG+ V+ G+VLL L + A A Q+ L Q A+L
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ--ARL 148

Query: 146 SEAVTGARLEDIERAKAVLDGAKASVKEAQRAFERTNRLYATKVLSQADLDTARAARDTS 205
+ IE + + F+ + +VL L + T
Sbjct: 149 EQTRYQILSRSIEL-----NKLPELKLPDEPYFQNVS---EEEVLRLTSL--IKEQFSTW 198

Query: 206 LAKQAEAEQSLRLLENGTRSEQLEQAKAAVAAASASVAIEQKALADLSLVAARDAV 261
++ + E +L + + A + +E+ L D S + + A+
Sbjct: 199 QNQKYQKELNLD-----KKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAI 249



Score = 49.1 bits (117), Expect = 2e-08
Identities = 31/232 (13%), Positives = 78/232 (33%), Gaps = 15/232 (6%)

Query: 108 VEGQQVKAGEVLLTLDSTSANARLALRQAELEQAKAKLSEAVTGARLEDIERAKAVLDGA 167
V ++V L+ ++ + ++ L++ +A+ + AR+ E V
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVL--ARINRYENLSRVEKSR 236

Query: 168 KASVKE-AQRAFERTNRLYATK---VLSQADLDTARAARDTSLAKQAEAEQSLRLLENGT 223
+ + + + V + +L ++ + ++ A++ +L+
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 224 RSEQLEQAKAAVAAASASVAIEQKALADLS---LVAARDAVVDTLP-WRVGDRIAAGTQL 279
++E L++ + K + A V L G + L
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 280 IGLLASEDPY-VRVYLPATWLDRVKAGDKVNIRVDG----REMPIAGTVRNI 326
+ ++ +D V + + + G I+V+ R + G V+NI
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1551ABC2TRNSPORT408e-06 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 39.9 bits (93), Expect = 8e-06
Identities = 48/200 (24%), Positives = 91/200 (45%), Gaps = 24/200 (12%)

Query: 186 GVILTMTMVMFT----SAAIVREREQGNMEFLITTPVRPLELMLGKI--------VPYVI 233
G++ T M T AA R Q E ++ T +R +++LG++ +
Sbjct: 72 GMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG 131

Query: 234 VGFVQVTIILSAG-HLLFDVPIRGGIDSIALAAMLFICASLTLGLVISTIAKTQLQSMQM 292
+G V + + LL+ +P+ IAL + F +LG+V++ +A + +
Sbjct: 132 IGVVAAALGYTQWLSLLYALPV------IALTGLAFA----SLGMVVTALAPSYDYFIFY 181

Query: 293 TVFILLPSILLSGFMFPYEAMPIAAQWIAEALPATHFMRMSRAIVLRDAQVMDLQFDALW 352
++ P + LSG +FP + +PI Q A LP +H + + R I+L V+D+
Sbjct: 182 QTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIML-GHPVVDVCQHVGA 240

Query: 353 MIGFTCIGLFIASMRFSKRL 372
+ + I F+++ +RL
Sbjct: 241 LCIYIVIPFFLSTALLRRRL 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1555HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHQALQQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


71Sbal195_1628Sbal195_1635N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_1628121-0.786258ATP-dependent protease ATP-binding subunit ClpX
Sbal195_1629119-0.717988ATP-dependent protease La
Sbal195_1630116-0.423388histone family protein DNA-binding protein
Sbal195_1631-1130.245123PpiC-type peptidyl-prolyl cis-trans isomerase
Sbal195_1632-1130.675339TOBE domain-containing protein
Sbal195_1633-1120.632642trans-2-enoyl-CoA reductase
Sbal195_16340151.122377ABC transporter-like protein
Sbal195_1635-1150.827371oligopeptide/dipeptide ABC transporter ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1628HTHFIS300.018 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.018
Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 13/70 (18%)

Query: 64 KLPTPHELRAHLDDYVIGQDRAKKVLSVAVYNHYKRLKNASPKDGIELGKSNILLIGPTG 123
+ P+ E + ++G+ S A+ Y+ L D +++ G +G
Sbjct: 124 RRPSKLEDDSQDGMPLVGR-------SAAMQEIYRVLARLMQTD------LTLMITGESG 170

Query: 124 SGKTLLAETL 133
+GK L+A L
Sbjct: 171 TGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1629HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 0.002
Identities = 45/211 (21%), Positives = 76/211 (36%), Gaps = 37/211 (17%)

Query: 262 NMPAEAKEKALAELNKLRMMSP---MSAEATV---VRSY----VDWMTSVPWSQRSKIKR 311
MP E L + K R P MSA+ T +++ D++ P+ I
Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK-PFDLTELIGI 114

Query: 312 D---------LAKAQEVLDTDHFGLEKVKERILEYLAVQSRVRQLKGPILCLVGPPGVGK 362
E D L + E V +R+ Q ++ + G G GK
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM-ITGESGTGK 173

Query: 363 TSLGQSIAKATGRK---YVRVALGGVRD---EAEIRGHRRTYIGSMPGKVIQKMAKVGVK 416
+ +++ R+ +V + + + E+E+ GH + G+ G + +
Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEK---GAFTGAQTRSTGRFEQA 230

Query: 417 N--PLFLLDEIDKMSSDMRGDPASALLEVLD 445
LFL DEI M D + + LL VL
Sbjct: 231 EGGTLFL-DEIGDMPMDAQ----TRLLRVLQ 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1630DNABINDINGHU1194e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (300), Expect = 4e-39
Identities = 53/88 (60%), Positives = 69/88 (78%)

Query: 2 NKSELIEKIASGADISKAAAGRALDSFIAAVTEGLKEGDKISLVGFGTFEVRERAERTGR 61
NK +LI K+A +++K + A+D+ +AV+ L +G+K+ L+GFG FEVRERA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGEEIKIAAAKIPAFKAGKALKDAV 89
NPQTGEEIKI A+K+PAFKAGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1634HTHFIS290.022 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.022
Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 42 TLAIVGEAGSGKSTLARILVGAEPRSGG 69
TL I GE+G+GK +AR L R G
Sbjct: 162 TLMITGESGTGKELVARALHDYGKRRNG 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1635HTHFIS300.011 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.011
Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 38 LVGESGSGRSLLARAI 53
+ GESG+G+ L+ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


72Sbal195_1869Sbal195_1876N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_18691181.794961Ppx/GppA phosphatase
Sbal195_18701181.949998polyphosphate kinase
Sbal195_18710173.267831putative chaperone
Sbal195_1872-1163.022970CreA family protein
Sbal195_1873-1162.961790cystathionine beta-lyase
Sbal195_1874-1152.659098integral membrane sensor signal transduction
Sbal195_1875-1162.160642two component transcriptional regulator
Sbal195_1876-2131.390895OmpA/MotB domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1869SHAPEPROTEIN310.010 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 31.3 bits (71), Expect = 0.010
Identities = 16/36 (44%), Positives = 23/36 (63%)

Query: 158 NLVIDIGGGSTEVVIGKKNTPTQLSSLRCGCVSFNE 193
++V+DIGGG+TEV + N SS+R G F+E
Sbjct: 161 SMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDE 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1871SHAPEPROTEIN416e-06 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 41.3 bits (97), Expect = 6e-06
Identities = 24/81 (29%), Positives = 42/81 (51%), Gaps = 11/81 (13%)

Query: 192 AAKRAGFVDVDFLFEPLAAGMDYEASLTDNKTVLVVDVGGGTTDCSVVKMGPAHQQKADR 251
+A+ AG +V + EP+AA + +++ +VVD+GGGTT+ +V+ +
Sbjct: 129 SAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN--------- 179

Query: 252 SEDFLGHSGQRIGGNDLDIAL 272
+ S RIGG+ D A+
Sbjct: 180 --GVVYSSSVRIGGDRFDEAI 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1875HTHFIS644e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.5 bits (157), Expect = 4e-14
Identities = 29/136 (21%), Positives = 53/136 (38%), Gaps = 7/136 (5%)

Query: 3 RIAIVEDEAAIRENYKDVLQQHGYSVQTYADRPSAMLAFNTRLPDLAIIDIGLGNEIDGG 62
I + +D+AAIR L + GY V+ ++ + DL + D+ + +E
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE--NA 62

Query: 63 FMLCQSLRAMSSTLPIIFLTARDSDFDTVCGLRLGADDYLSKEVSFPHLTARLAALFRRS 122
F L ++ LP++ ++A+++ + GA DYL K L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI-----IGR 117

Query: 123 ELAASQTAQENLLERG 138
LA + L +
Sbjct: 118 ALAEPKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_1876OMPADOMAIN653e-14 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 65.0 bits (158), Expect = 3e-14
Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 2/92 (2%)

Query: 154 ELALGLNVQFRTGSSEVESHFLPQLDDVAEVM-NLSP-ELNLELKGYADRRGDVSYNQAL 211
L +V F + ++ LD + + NL P + ++ + GY DR G +YNQ L
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGL 273

Query: 212 SEQRLLEVRGYLIKQGVAAERMTTQAFGALSP 243
SE+R V YLI +G+ A++++ + G +P
Sbjct: 274 SERRAQSVVDYLISKGIPADKISARGMGESNP 305


73Sbal195_2254Sbal195_2260N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2254018-0.285650type III secretion low calcium response
Sbal195_2255118-0.291337secretion system effector
Sbal195_2256121-1.114457putative pathogenicity island effector protein
Sbal195_2257122-1.999139secretion system effector SseE
Sbal195_2258222-2.386326hypothetical protein
Sbal195_2259323-2.931714type III secretion low calcium response
Sbal195_2260224-2.687142hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2254SYCDCHAPRONE901e-25 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 90.0 bits (223), Expect = 1e-25
Identities = 32/148 (21%), Positives = 58/148 (39%), Gaps = 2/148 (1%)

Query: 9 LEHFLQRGGSLRMLADVEQSDLNVLYQYALQLMACRDQQGAKRIFYLLMRIEQWNYDYCF 68
+E FL+ GG++ ML ++ L LY A + A ++F L ++ ++ +
Sbjct: 15 MESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFL 74

Query: 69 SLGICCQQLHEHEEAIFCLGRAGMIKVDNPLPAYHAGLSYLALGNHDYAKRSFNASLRWC 128
LG C Q + +++ AI ++ + P +HA L G A+ +
Sbjct: 75 GLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELI 134

Query: 129 EGHPESTGIAASAQRGLA--TLAKENSH 154
E ++ L L KE H
Sbjct: 135 ADKTEFKELSTRVSSMLEAIKLKKEMEH 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2258CLENTEROTOXN290.036 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 29.2 bits (65), Expect = 0.036
Identities = 22/132 (16%), Positives = 40/132 (30%), Gaps = 18/132 (13%)

Query: 246 SSEDNSLRYAVTPSRYELLNCVAAHGMEDEGLARVLYQAKVGNTNLGALYGLPAPKDAPQ 305
+E + T +Y+ + ++ + D+G L + T A
Sbjct: 124 PNEYVYYKVYATYRKYQAIR-ISHGNISDDGSIYKLTGIWLSKT------------SADS 170

Query: 306 LDNVDD--FILCDEDINLGVSQTDVYADEETFYQGIGQHQTTTTGDN--CYKLLQLNIND 361
L N+D I E L V TD+ + + T ++ L +
Sbjct: 171 LGNIDQGSLIETGERCVLTVPSTDIEKEILDLAAATERLNLTDALNSNPAGNLYDWR-SS 229

Query: 362 GLHYLATKANPH 373
+ K N H
Sbjct: 230 NSYPWTQKLNLH 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2259SYCDCHAPRONE937e-27 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 93.1 bits (231), Expect = 7e-27
Identities = 43/148 (29%), Positives = 68/148 (45%)

Query: 9 DFEKLEAACQLALVNQQTLAEQVGLTSQDLELIYQSGTSKYQMGLPAEAIVDFTYLVMHQ 68
D ++ + A + L T+A ++S LE +Y ++YQ G +A F L +
Sbjct: 7 DTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLD 66

Query: 69 PWDRRFHLGLGSCLHWLGEYQHALTFYGYALLMDACSPEASFRIAQCFLSLNDDAAAIEA 128
+D RF LGLG+C +G+Y A+ Y Y +MD P F A+C L + A A
Sbjct: 67 HYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESG 126

Query: 129 LQMAISQSYSKPEHHFVGDQAQQLLSAL 156
L +A K E + + +L A+
Sbjct: 127 LFLAQELIADKTEFKELSTRVSSMLEAI 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2260RTXTOXIND386e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.5 bits (87), Expect = 6e-05
Identities = 8/114 (7%), Positives = 38/114 (33%), Gaps = 1/114 (0%)

Query: 259 YQLRQLQASALTQQGNMKLSEAQL-ALKESQANEKTAQFDAEIRMKQSERFRGTNQTLQQ 317
+ Q+S L + + +++ ++ E + + E +++
Sbjct: 134 ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE 193

Query: 318 QLAEKEGLLLQSQNQFEQLQSRFDKSNVQLSGVMQQLQMLQQQLAELQPARARN 371
Q + + Q + ++ ++ +++ ++ + +L + +
Sbjct: 194 QFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247



Score = 29.0 bits (65), Expect = 0.037
Identities = 18/129 (13%), Positives = 49/129 (37%), Gaps = 13/129 (10%)

Query: 244 SQNKSFEAEVSSVKAYQLRQLQASALTQQGNMKLSEAQLALKESQANEKTAQFDAEIRMK 303
Q +++ + + L + +A LT + E +++S+ ++ ++
Sbjct: 193 EQFSTWQNQKYQKEL-NLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH------ 245

Query: 304 QSERFRGTNQTL--QQQLAEKEGLLLQSQNQFEQLQSRFDKSNVQLSGVMQQLQMLQQQL 361
++ + L + + E L ++Q EQ++S + + V Q + + L
Sbjct: 246 --KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK--NEIL 301

Query: 362 AELQPARAR 370
+L+
Sbjct: 302 DKLRQTTDN 310


74Sbal195_2276Sbal195_2281N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2276317-1.279785hypothetical protein
Sbal195_2277318-2.226928hypothetical protein
Sbal195_2278320-3.486834type III secretion system protein
Sbal195_2279217-3.104855HrpO family type III secretion protein
Sbal195_2280020-3.701305type III secretion protein SpaR/YscT/HrcT
Sbal195_2281122-5.063802secretion system apparatus protein SsaU
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2276PF06340290.007 Vibrio cholerae toxin co-regulated pilus biosynthesis pr...
		>PF06340#Vibrio cholerae toxin co-regulated pilus biosynthesis

protein F (TcpF)
Length = 338

Score = 28.8 bits (64), Expect = 0.007
Identities = 7/39 (17%), Positives = 19/39 (48%)

Query: 27 EMRRLFNRYFCGQQEDDNAISTKDRLTAKALLSRDGGVY 65
+++L+ ++ Q D I T+D++ + +G +
Sbjct: 110 SLQKLYIDFYLAQTTFDWEIPTRDQIETLVNYANEGKLS 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2277FLGMOTORFLIM280.047 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 27.9 bits (62), Expect = 0.047
Identities = 9/44 (20%), Positives = 19/44 (43%)

Query: 224 SLLPKMDAIQPPLTADIGRVSLPLAKLGAMMTGDKLTLEVTLNN 267
L K+ + + A++G + L + + + GD + L T
Sbjct: 249 VLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVG 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2278TYPE3IMPPROT2103e-71 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 210 bits (537), Expect = 3e-71
Identities = 81/215 (37%), Positives = 129/215 (60%), Gaps = 7/215 (3%)

Query: 8 IQLIIMLFCLSLLPLFAVMGTSFLKLAIVFSMLRNALGIQQIPPNMAIYGLALILTLFTM 67
I LI +L +LLP GT F+K +IVF M+RNALG+QQIP NM + G+AL+L++F M
Sbjct: 5 ISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVM 64

Query: 68 APVGMAINDNLKATPIVFDAPNVFEQINTEAIAPYRAFLEKNTSNTQIEFFANIGHKVWP 127
P+ + + F+ + + E + YR +L K + ++FF N K
Sbjct: 65 WPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQY 124

Query: 128 EKYQQV-------LTKDSLLVMVPAFTMSQLIEAFKIGLLIYLPFVAIDLIVSNILLAMG 180
+ + + K S+ ++PA+ +S++ AFKIG +YLPFV +DL+VS++LLA+G
Sbjct: 125 GEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALG 184

Query: 181 MMMVSPMTIALPFKLLIFILMGGWEKLISQLMMSF 215
MMM+SP+TI+ P KL++F+ + GW L L++ +
Sbjct: 185 MMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2279TYPE3IMQPROT707e-20 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 70.2 bits (172), Expect = 7e-20
Identities = 33/83 (39%), Positives = 50/83 (60%)

Query: 6 IVHFTSELLWMVLLLSLPVVIVASVVGVLVSLIQALTQIQDQTLQFLIKLIAVCVTLVVC 65
+V ++ L++VL+LS IVA+++G+LV L Q +TQ+Q+QTL F IKL+ VC+ L +
Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63

Query: 66 YHWMGSSLLNYASMAFDQISQMG 88
W G LL+Y G
Sbjct: 64 SGWYGEVLLSYGRQVIFLALAKG 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2280TYPE3IMRPROT1262e-37 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 126 bits (319), Expect = 2e-37
Identities = 47/238 (19%), Positives = 104/238 (43%), Gaps = 5/238 (2%)

Query: 1 MTTQLPNLLTAQLPVLALCMMRPLGMMLLLPLFKGGAMGSALIRNSLILMFALPTVLAMD 60
M + L + ++R L ++ P+ ++ ++ L +M +
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFA-IAPSL 58

Query: 61 EMQPILQQADTWMLISLFGKEIIVGMLLGFCAAIPFWAIDMAGFVIDTMRGASMSTVLNP 120
+ + + +++ ++I++G+ LGF F A+ AG +I G S +T ++P
Sbjct: 59 PANDVPVFSFFALWLAV--QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDP 116

Query: 121 LMGLQSSIYGMLFTQVLTVLFLVSGGFNFLLTALYQSYQQLPPGFNLTLSQPLMVFIAHE 180
L + + + +LFL G +L++ L ++ LP G S +
Sbjct: 117 ASHLNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAG 176

Query: 181 WQLMCQLCLSFAMPAMVIMILVDVALGLVNRSAQQLNVFFLSMPIKSALVLLLLIYSL 238
+ L A+P + +++ +++ALGL+NR A QL++F + P+ + + L+ +
Sbjct: 177 SLIF-LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALM 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2281TYPE3IMSPROT356e-124 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 356 bits (915), Expect = e-124
Identities = 120/346 (34%), Positives = 192/346 (55%)

Query: 2 AEKTEKPTEKRLREARNRGQVIKSAEIVTGLQMAIILGYFLYEGPALVQAIMALIDLTIH 61
EKTE+PT K++R+AR +GQV KS E+V+ + + + + L+ +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 62 AINLPLETAAEQIVGTFAVLALRFLGGLTLVLVFTIVVGNLVQTGPVWAAESIMPSMDKL 121
LP A +V + L V + ++VQ G + + E+I P + K+
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122

Query: 122 NVMNNAKQLISLKSLFELAKNLVKVTVLSLVFYYLLHRYVNAFQYLPLCGEACGISVIST 181
N + AK++ S+KSL E K+++KV +LS++ + ++ + LP CG C ++
Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182

Query: 182 MITWLWGSFLGCYLIFGIADYAFQRYSLMKELKMSKDDTKQEYKDSEGNPEMKQKRRETQ 241
++ L +++ IADYAF+ Y +KELKMSKD+ K+EYK+ EG+PE+K KRR+
Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242

Query: 242 REVASGSLASNVRKATVVVRNPTHIAVCLYYSEGETPLPKVLEKAEDHMALHIVALAEKA 301
+E+ S ++ NV++++VVV NPTHIA+ + Y GETPLP V K D + +AE+
Sbjct: 243 QEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEE 302

Query: 302 GVPIVENIPLARALFKHVEAGDVIPESLFEPVAELLRLVMTISYDN 347
GVPI++ IPLARAL+ IP E AE+LR + + +
Sbjct: 303 GVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEK 348


75Sbal195_2358Sbal195_2366N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2358015-0.170832response regulator receiver modulated
Sbal195_2359013-0.171196response regulator receiver modulated CheB
Sbal195_2360112-0.681617chemoreceptor glutamine deamidase CheD
Sbal195_2361011-2.436351protein-glutamate O-methyltransferase
Sbal195_2362-111-2.648183methyl-accepting chemotaxis sensory transducer
Sbal195_2363114-3.567355putative CheW protein
Sbal195_2364012-1.713275signal transduction histidine kinase CheA
Sbal195_2365113-2.109961response regulator receiver protein
Sbal195_2366-110-2.452979response regulator receiver protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2358HTHFIS731e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.6 bits (178), Expect = 1e-15
Identities = 28/164 (17%), Positives = 65/164 (39%), Gaps = 6/164 (3%)

Query: 255 KVLLVDDQQSMVDYFSSLLRSHGLMVKGLSSAEQVLPALEQFEPDLFIFDLYMPEVNGLE 314
+L+ DD ++ + L G V+ S+A + + + DL + D+ MP+ N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 315 LAKMIRQLDKYTSSPILVLSSDDTMQNKVSIIQAGSDDLISKQTAPSLFVA---QVISRA 371
L I++ P+LV+S+ +T + + G+ D + K + + + ++
Sbjct: 65 LLPRIKKARPDL--PVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 372 QRGHDIRSSASRDSLTGLLNHTQILVAARRCYNVARRINSQVCI 415
+R + L+ + + R + + + I
Sbjct: 123 KRRPS-KLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMI 165



Score = 55.2 bits (133), Expect = 4e-10
Identities = 30/135 (22%), Positives = 59/135 (43%), Gaps = 2/135 (1%)

Query: 131 HIAIIEDDGNVGAMITKQLREFGFSVQHFLNFTSFLVVQNETPFDLILLDLILPDWTEEA 190
I + +DD + ++ + L G+ V+ N + DL++ D+++PD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 191 LFEAATEFEKNNTRVFVLSSRGDFDMRLLAIRANVSEYFVKPAETTLLVRKIHQSLKMSE 250
L + + + V V+S++ F + A +Y KP + T L+ I ++L +
Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 251 KQPLKVLLVDDQQSM 265
++P L D Q M
Sbjct: 124 RRP-SKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2359HTHFIS697e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.7 bits (168), Expect = 7e-15
Identities = 29/118 (24%), Positives = 51/118 (43%), Gaps = 6/118 (5%)

Query: 3 IKVLVVDDSALMRSLLGKMIEADPELSLVGQAADAFEAKDLVNQFRPDVITLDIEMPKVD 62
+LV DD A +R++L + + + ++A + D++ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLTFLDRLMKARPTAVVMISSLTEQG-ADATFNALALGAVDFIPKPKLDSPQGIHDYQ 119
L R+ KARP V++ ++ Q A GA D++PKP D + I
Sbjct: 62 AFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2364PF06580442e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.7 bits (103), Expect = 2e-06
Identities = 24/151 (15%), Positives = 51/151 (33%), Gaps = 52/151 (34%)

Query: 440 EIDKGMIEKLVDPLT--HLVRNSLDHGIEKPEKRLAAGKSEVGVLSLKASQRGGNIVIAV 497
+I+ +++ V P+ LV N + HGI + + G + LK ++ G + + V
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEV 296

Query: 498 HDNGAGLNRERIIQKARESGLQVADNSSDKQIWQLIFAAGFSTAVEVTDVSGRGVGMDVV 557
+ G+ + G G+ V
Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318

Query: 558 RRNIEALGG---RIDIESTEGQGSTFEIQLP 585
R ++ L G +I + +G+ + + +P
Sbjct: 319 RERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2365HTHFIS881e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 87.6 bits (217), Expect = 1e-23
Identities = 29/122 (23%), Positives = 53/122 (43%), Gaps = 3/122 (2%)

Query: 1 MSK-KILIVDDSAAIRQMVEATLKSANYQVVLAKDGREALDICNGQKFDFILTDQNMPRM 59
M+ IL+ DD AAIR ++ L A Y V + + D ++TD MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGLTLIKSLRAMSAFMRTPIIMLTTEAGDDMKAQGKAAGATGWMVKPFDPQKLLAITAKV 119
+ L+ ++ P+++++ + + GA ++ KPFD +L+ I +
Sbjct: 61 NAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 LG 121
L
Sbjct: 119 LA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2366HTHFIS653e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.2 bits (159), Expect = 3e-13
Identities = 27/122 (22%), Positives = 55/122 (45%)

Query: 11 ILVVDDDAIASQRISDFIHSKGYNVIVCNDLEEVFFEITQNTVDLILINYWLKDGTALAL 70
ILV DDDA ++ + GY+V + ++ ++ I DL++ + + D A L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 71 LNKLNEEKQETPVIVMSETKESQNVLACFSMGVLDFVVKPINVEIFWYKVECLLSRVQLQ 130
L ++ + + + PV+VMS + G D++ KP ++ + L+ + +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 131 HK 132

Sbjct: 126 PS 127


76Sbal195_2601Sbal195_2611N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2601315-0.391856phosphoenolpyruvate synthase
Sbal195_2602113-0.499370hypothetical protein
Sbal195_2603114-0.601996phospho-2-dehydro-3-deoxyheptonate aldolase
Sbal195_2604-112-0.444441glycoside hydrolase
Sbal195_2605-111-1.063976thioesterase superfamily protein
Sbal195_2606-122-3.131565two component LuxR family transcriptional
Sbal195_2607-122-3.187219transcriptional regulator CysB
Sbal195_2608024-3.369125hypothetical protein
Sbal195_2609-123-3.286541DNA topoisomerase I
Sbal195_2610026-4.070267succinylarginine dihydrolase
Sbal195_2611128-4.397701Ig domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2601PHPHTRNFRASE2973e-93 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 297 bits (761), Expect = 3e-93
Identities = 111/418 (26%), Positives = 187/418 (44%), Gaps = 65/418 (15%)

Query: 384 QPGDVLVTDMTDPDWEPIMK-RASAIVTNRGGRTCHAAIIARELGVPAVVGCGDVTDRIK 442
+ ++ D+T D + K T+ GGRT H+AI++R L +PAVVG +VT++I+
Sbjct: 155 EETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQ 214

Query: 443 NGQIVTVSCAEG---------DTGFIYEGKQEFEVISNRVDSLPELP--------MKIMM 485
+G +V V EG + E + FE L P +++
Sbjct: 215 HGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAA 274

Query: 486 NVGNPDRAFDFARLPNEGVGLARLEFIINRMIGIHPKALLEFNQQDAALQTEINEMIAGY 545
N+G P EG+GL R EF+ M L TE E Y
Sbjct: 275 NIGTPKDVDGVLANGGEGIGLYRTEFLY--MD-------------RDQLPTE-EEQFEAY 318

Query: 546 ESPVEFYIARLVEGIATIGSAFYPKKVIVRMSDFKSNEYANLVGGDRYEPEEENPMLGFR 605
+ V+ K V++R D ++ + + P+E NP LGFR
Sbjct: 319 KEVVQ---------------RMDGKPVVIRTLDIGGDKELSYL----QLPKELNPFLGFR 359

Query: 606 GASRYISESFRDCFALECEAIKRVRNDMGLKNVEVMIPFVRTVKEAEQVIGLLKEQGLER 665
+ + +D F + A+ R N++VM P + T++E Q +++E+ +
Sbjct: 360 AIRLCLEK--QDIFRTQLRALLRAS---TYGNLKVMFPMIATLEELRQAKAIMQEEKDKL 414

Query: 666 GKDG------LRVIMMCEVPSNALLADQFLEHFDGFSIGSNDLTQLTLGLDRDSGIISHL 719
+G + V +M E+PS A+ A+ F + D FSIG+NDL Q T+ DR + +S+L
Sbjct: 415 LSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYL 474

Query: 720 FDERDEAVKMLLSLAIKAAKTKGAYIGICGQGPSDHADFAAWLVEQGIDTVSLNPDTV 777
+ A+ L+ + IKAA ++G ++G+CG+ D L+ G+D S++ ++
Sbjct: 475 YQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDE-VAIPLLLGLGLDEFSMSATSI 531


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2604MICOLLPTASE360.001 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 35.8 bits (82), Expect = 0.001
Identities = 34/162 (20%), Positives = 58/162 (35%), Gaps = 28/162 (17%)

Query: 81 WENKGVCDGAQNQAPTLVILQPQNNVSVNLGDVVLLQADAS-DVDGTVASVNW-FANGQA 138
D N+ P VI +++ SV + + + S D DG + + W F +G+
Sbjct: 761 MNTDTNTDVHVNKEPKAVI---KSDSSVIVEEEINFDGTESKDEDGEIKAYEWDFGDGEK 817

Query: 139 VTSPWTT---NAIGSVQLKAVATDDKGATTEKSVVLTVINPTSENLPPMIEILLLVNDSA 195
T N G ++K TD+ G +S + V+ + +I N+S
Sbjct: 818 SNEAKATHKYNKTGEYEVKLTVTDNNGGINTESKKIKVV---EDKPVEVI------NESE 868

Query: 196 VNVGDSVTITANASDPDTGDSITKVEFYLDSQLIATDNSAPY 237
N +D + + I K + L D S Y
Sbjct: 869 PN-----------NDFEKANQIAKSNMLVKGTLSEEDYSDKY 899


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2605TYPE3OMGPROT290.007 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 29.1 bits (65), Expect = 0.007
Identities = 13/44 (29%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 79 VTVSSDRIDFKKPIPAGTLAELIARVIHVGNTSLKVEVNIYVED 122
V V+ + K I GT+ + RV+ G+ S ++ +N+++ED
Sbjct: 383 VKVTGKEVAELKGITYGTMLRMTPRVLTQGDKS-EISLNLHIED 425


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2606HTHFIS792e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-19
Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 6/111 (5%)

Query: 8 IIIADDHPLFRNALRQALTTAFEHAQWFEADSAEALQSVL-DVRSIDYDLVLLDLQMPGS 66
I++ADD R L QAL+ A ++ ++ + + D DLV+ D+ MP
Sbjct: 6 ILVADDDAAIRTVLNQALSRAG-----YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 67 HGYSTLIHLRSHYPDLPVVVISAHEDINTISRAIHYGSSGFIPKSASMETL 117
+ + L ++ PDLPV+V+SA T +A G+ ++PK + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2611INTIMIN421e-05 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 42.4 bits (99), Expect = 1e-05
Identities = 48/204 (23%), Positives = 80/204 (39%), Gaps = 17/204 (8%)

Query: 30 GGTTPTPGVVTVTLSISNSDSVSVATPAEVKATVVDSKTGPLAGVVVSFKLDNDALGSFT 89
G GV T +++ + ATV + A V VSF + + G+
Sbjct: 552 GQVVDQVGVTDFTADKTSAKADG-TEAITYTATVKKNGV-AQANVPVSFNIVS---GTAV 606

Query: 90 PSTGTQLTDSSGVATVKLDTATLAGAGNVTASVASGASITKGFYSKGDGVVQPGTGNKLK 149
S + T+ SG ATV L + G V+A A S + V+
Sbjct: 607 LSANSANTNGSGKATVTLKSDKP-GQVVVSAKTAEMTSAL-----NANAVIFVDQTKASI 660

Query: 150 LSLQNVQGQTVTKISSAVPGTVSAIYTNGSDEPLVGKVITFTSNLGKFSPQSGTALTNAQ 209
++ + V A+ TV + D+P+ + +TFT+ LGK S + T+
Sbjct: 661 TEIKADKTTAVANGQDAITYTVKVMK---GDKPVSNQEVTFTTTLGKLSNSTEK--TDTN 715

Query: 210 GLAKIAITAGPVAGAGNIIAKVDE 233
G AK+ +T+ G + A+V +
Sbjct: 716 GYAKVTLTST-TPGKSLVSARVSD 738



Score = 40.1 bits (93), Expect = 6e-05
Identities = 64/299 (21%), Positives = 98/299 (32%), Gaps = 40/299 (13%)

Query: 378 TGLPTTNVSAAQPSKVTVTL---VDKDATPLVGKVVSFSSSLGNFLPTKGTALTDSIGRA 434
T SA +T V K+ VSF+ G + + +A T+ G+A
Sbjct: 561 TDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKA 620

Query: 435 SITLTAGSIEGAGEVTASY--GTAKAIVGFVTAGDDIDPIEASPEISFDIYDCNGVAAWD 492
++TL + G V+A T+ V D + NG A
Sbjct: 621 TVTLKSDKP-GQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAIT 679

Query: 493 KTLKNFEVCKITDNITNDKPGIIGAKVTRSGSTQALQQVLVTAATTLGAISPNSGTAITN 552
T+K + DKP + +VT + TTLG ++ T T+
Sbjct: 680 YTVKVMK---------GDKP-VSNQEVTFT--------------TTLGK--LSNSTEKTD 713

Query: 553 ADGKAILDLYANGNVGAGEVSLKVKD-ATSTKAFEI---GRVNISLDIKTSVGNNSLPAG 608
+G A + L + G VS +V D A KA E+ + I VG
Sbjct: 714 TNGYAKVTLTST-TPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKL 772

Query: 609 GSTIVEVTVFNPDGSLSTGQPFTLEFSSECVAAGKAVIDSPIVTNAGKGYSTYRSTGCS 667
+ ++ N S G+ + S A S VT KG +T
Sbjct: 773 PTVWLQYGQVNLKASGGNGK---YTWRSANPAIASVDASSGQVTLKEKGTTTISVISSD 828


77Sbal195_2811Sbal195_2819N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2811-3181.190472IucA/IucC family protein
Sbal195_2812-2160.156259TonB-dependent siderophore receptor
Sbal195_2813-116-1.231133ferric iron reductase
Sbal195_2814014-2.704820intracellular septation protein A
Sbal195_2815-113-0.106956YciI-like protein
Sbal195_2816-212-0.032955exonuclease III
Sbal195_2817-2131.717319hypothetical protein
Sbal195_2818-2142.684858integrase catalytic subunit
Sbal195_2819-1143.120893transposase IS3/IS911 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2811PF041836250.0 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 625 bits (1614), Expect = 0.0
Identities = 168/593 (28%), Positives = 291/593 (49%), Gaps = 22/593 (3%)

Query: 42 LTPAYWQAANRHLVKKILCEFTHEKIITPTLYGQKAGLNHYELRLKNSTYYFSARHYQLD 101
+ W NR LV K+L E +E++ + G + Y + L + + F A
Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQVFHA----ESQGDDRYCINLPGAQWRFIAERGIWG 56

Query: 102 HLAIDADSIRVSVAGQEQALDAMSLIISLKNDLGISETLLPTYLEEITSTLYSKAYKL-A 160
L IDA ++R ++ + A +L++ LK L +S+ + +++++ +TL L A
Sbjct: 57 WLWIDAQTLRC----ADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKA 112

Query: 161 HQAIPAATLARADYQSIEAGMTEGHPVFIANNGRIGFDMQDYRQFAPESAMPMQLVWLGV 220
+ + A+ L + ++ + GHP F+ N GR G+ + ++APE A +L WL V
Sbjct: 113 RRGLSASDLINLNADRLQC-LLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAV 171

Query: 221 RKSKTTFAALENLSHDALLKEELG-QQFTDFQQRLKAQQHDPQDFYFMPVHPWQWREKIA 279
++ + + LL + Q+F F Q + D ++ +PVHPWQW++KIA
Sbjct: 172 KREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLD-HNWLPLPVHPWQWQQKIA 230

Query: 280 RVFAGDIARGDLVYLGEGSQQYQVQQSIRTFFNLSSPQKCYVKTALSILNMGFMRGLSPL 339
F D A G +V LGE Q+ QQS+RT N S +K L+I N RG+
Sbjct: 231 TDFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGR 290

Query: 340 YMSCTPQINAWVANLVESDPYFTQQGFVILKEIAAIGYHHHYYEQALTQDSAYKKMLSAL 399
Y++ P + W+ + +D Q G VIL E AA H Y Y++ML +
Sbjct: 291 YIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVI 350

Query: 400 WRESPLPHIAPKQNLMTMAALLHTDHEDKALISALITASGLPAKDWLSRYLNLYLSPLLH 459
WRE+P + P ++ + MA L+ D ++ L A I SGL A+ WL++ + + PL H
Sbjct: 351 WRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYH 410

Query: 460 AFFAYDLVFMPHGENLILVLDEYVPVKILMKDIGEEVAVLNGTSP----LPDDVKRLAVS 515
Y + + HG+N+ L + E VP ++L+KD ++ ++ P LP +V+ +
Sbjct: 411 LLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSR 470

Query: 516 LEEEMKLNYILLDIFDCIFRYLAPLLDEQTSVSESQFWELVADNVRDYQAQHPHLAAKFA 575
L + ++ + F + R+++PL+ + V E +F++L+A + DY +HP ++ +FA
Sbjct: 471 LSADYLIHDLQTGHFVTVLRFISPLMV-RLGVPERRFYQLLAAVLSDYMKKHPQMSERFA 529

Query: 576 QYDLFKDSFVRTCLNRIQLNNNQQMIDLADREKNL-RFAGGIDNPLAAFRQSH 627
+ LF+ +R LN ++L DL + L + + NPL Q +
Sbjct: 530 LFSLFRPQIIRVVLNPVKL----TWPDLDGGSRMLPNYLEDLQNPLWLVTQEY 578


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2812PRTACTNFAMLY300.034 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.4 bits (68), Expect = 0.034
Identities = 31/130 (23%), Positives = 42/130 (32%), Gaps = 21/130 (16%)

Query: 231 DSGSVRGRVVAAYQDKDSFQDRYEQQRTTLYGIVETDIGDSTLFTLGVDYQDATPSGTMS 290
D+G GR A Q D+ R Q + G F LG D+ A G
Sbjct: 645 DAGGAWGRGFAQRQQLDNRAGRRFDQ--KVAG-----------FELGADHAVAVAGGRWH 691

Query: 291 GGLPLFYSDGSRTNYDRATSTAPDWGSAHTQGLNTFASLEHRFDNGWNLKGTYTYGDNSL 350
G Y+ G R G HT ++ + D+G+ L T
Sbjct: 692 LGGLAGYTRGDRG--------FTGDGGGHTDSVHVGGYATYIADSGFYLDATLRASRLEN 743

Query: 351 KFDVLWATGY 360
F V + GY
Sbjct: 744 DFKVAGSDGY 753


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_28132FE2SRDCTASE1111e-30 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 111 bits (279), Expect = 1e-30
Identities = 76/322 (23%), Positives = 113/322 (35%), Gaps = 108/322 (33%)

Query: 45 LSATSSPTGLSFDEWSCADTYQTLLARFAIAHPSAETLTETKTENETETDKTAATSQTPA 104
L + ++ +WS + +LLA +
Sbjct: 46 LDEPAPLNAMTLAQWSSPNVLSSLLAVY-------------------------------- 73

Query: 105 FASKSVLVSKNAGANRKDTRKALYSLWGQWYFGLLVPPMMEWIFNAPQTDLKSIHWQPQS 164
+ R++ K L SLW QWY GL+VPP+M + + + P+
Sbjct: 74 ---SDHIYRNQPMMIREN--KPLISLWAQWYIGLMVPPLMLALLTQEKA----LDVSPEH 124

Query: 165 IFMQLHPSGRVAKFEFNIAKHQPNTALTFKKHHGIEPLCQINTKPSIKIDTEEHSPLSPY 224
+ H +GRVA F ++ + + T HSP
Sbjct: 125 FHAEFHETGRVACFWVDVCEDKNAT---------------------------PHSPQHRM 157

Query: 225 KPPVDKELVLQGFILNLLQPSVERLLTLSPVPVKLYWSHLGYLIHWYLGELG--LTEQHS 282
+ I L P V+ L + KL WS+ GYLI+WYL E+ L E
Sbjct: 158 ----------ETLISQALVPVVQALEATGEINGKLIWSNTGYLINWYLTEMKQLLGEATV 207

Query: 283 QQLKQALFRRTTFLDGSTNPLYNSINLLIEPERDSATPNTVARIVTSTASRSKPSPKIHC 342
+ L+ ALF T +G NPL+ ++ L RD
Sbjct: 208 ESLRHALFFEKTLTNGEDNPLWRTVVL-----RDGLL----------------------- 239

Query: 343 IRRTCCLRYQLANTGQCHDCPL 364
+RRTCC RY+L + QC DC L
Sbjct: 240 VRRTCCQRYRLPDVQQCGDCTL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2815adhesinmafb250.042 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 25.4 bits (55), Expect = 0.042
Identities = 9/44 (20%), Positives = 14/44 (31%)

Query: 54 AGFSGSLVVADFESLVAAKHWADADPYIEAGVYKSVVVKPFKRV 97
G GS+ + + A W +P V V +V
Sbjct: 279 IGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2819HTHFIS260.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 25.6 bits (56), Expect = 0.043
Identities = 10/59 (16%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 7 HKSYPQAFKDEAVLMVLEQ-GYSVADAAKSLGVSTSLLYNWKEKHQALQQGITLEESER 64
+ + +L L + AA LG++ + L + G+++ S R
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-----GVSVYRSSR 482


78Sbal195_2992Sbal195_2998N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_2992-113-0.013138peptidyl-dipeptidase Dcp
Sbal195_2993-210-0.388040Na+/H+ antiporter NhaC
Sbal195_2994-2140.295860electron transfer flavoprotein subunit alpha
Sbal195_2995-2130.034692electron transfer flavoprotein subunit
Sbal195_2996-1140.592596histone family protein nucleoid-structuring
Sbal195_2997-2141.104322amidohydrolase
Sbal195_2998-1112.106002hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2992BACINVASINB300.038 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 30.1 bits (67), Expect = 0.038
Identities = 47/204 (23%), Positives = 74/204 (36%), Gaps = 29/204 (14%)

Query: 157 RTNLGLTPEAVRLVEVYHQRFIMAGAKLTDEQKVKIRALNEEQSTLTNE--FSQRLLRLT 214
+T LG EA L E ++ A + K +A N+ QS + ++Q +
Sbjct: 130 QTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPGYAQAEAAVE 189

Query: 215 KEIAVVVESKNELAGLTDSDITSAANDAKAAGHDGKYLINITNTTRQPVLASLENRELRQ 274
+ E+K L TD+ A DAKA ++ T
Sbjct: 190 QAGKEATEAKEALDKATDA-TVKAGTDAKAKAEKADNILTKFQGTAN------------- 235

Query: 275 RIWEASANRGLTGENETASLVSRLAQLRAERAALLGYENWASYRLAPQMAKTPEAVYSMF 334
AS N+ GE + S V+RL L A ++G S L +A +F
Sbjct: 236 ---AASQNQVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEES--LQNDLA--------LF 282

Query: 335 GSMVPAVVANTEKEAADIQAMIDK 358
++ A EK++A+ Q K
Sbjct: 283 NALQEGRQAEMEKKSAEFQEETRK 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2996PYOCINKILLER280.010 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 28.2 bits (62), Expect = 0.010
Identities = 17/77 (22%), Positives = 29/77 (37%), Gaps = 13/77 (16%)

Query: 20 ELSVEELRDLADKLDK-------ILVERESMAEEEEQAMAARNAKIEEIRQQMEAVG--- 69
+L E + L +++ I + A E+ A A R A EE +Q A+
Sbjct: 191 KLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKA--EEQARQQAAIRAAN 248

Query: 70 -LSIDDLGGVAVKATAK 85
++ G V A +
Sbjct: 249 TYAMPANGSVVATAAGR 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2997UREASE428e-06 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 42.4 bits (100), Expect = 8e-06
Identities = 16/29 (55%), Positives = 19/29 (65%)

Query: 919 TINPAKQLRVDEFVGSLTPGKMADIVLWN 947
TINPA + +GSL GK AD+VLWN
Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVLWN 438



Score = 34.3 bits (79), Expect = 0.003
Identities = 34/124 (27%), Positives = 48/124 (38%), Gaps = 39/124 (31%)

Query: 585 IKNATLWTSDKQGILEHADLLMANGRIEKIGQ----------QLSTPSGYQVLDATGKHL 634
I NA + D GI++ AD+ + +GRI IG+ + G +V+ GK +
Sbjct: 72 ITNALI--LDHWGIVK-ADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIV 128

Query: 635 TAGIVDEHSHIAINGGTNEGTDAVTSEVRIGDIINPDDISIYRALAGGVTSAQLLHGSAN 694
TAG +D H H I P I AL G+T +L G
Sbjct: 129 TAGGMDSHIH----------------------FICPQ--QIEEALMSGLTC--MLGGGTG 162

Query: 695 PIGG 698
P G
Sbjct: 163 PAHG 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_2998TYPE4SSCAGX310.012 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 30.9 bits (69), Expect = 0.012
Identities = 18/59 (30%), Positives = 33/59 (55%), Gaps = 5/59 (8%)

Query: 201 PEDEKEKTKAIEKNQQAIDQLNIAFEDGYRYFLSDKAKGKDN-----NKDEDPQSATNN 254
P++ +E+ KA+EK ++A +Q A +D ++AK + N N +PQ+ +NN
Sbjct: 138 PKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANLENLTNAMSNPQNLSNN 196


79Sbal195_3034Sbal195_3041N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3034-125-4.181557S23 ribosomal protein
Sbal195_3035026-2.617578polysaccharide export protein
Sbal195_3036225-2.092890transcriptional acivator RfaH
Sbal195_3037224-1.720422amino acid/peptide transporter
Sbal195_3038227-1.941041response regulator receiver protein
Sbal195_3039328-1.757423VacJ family lipoprotein
Sbal195_3040428-1.394669hypothetical protein
Sbal195_3041224-2.610506FlhB domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3034ECOLIPORIN260.032 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 26.4 bits (58), Expect = 0.032
Identities = 23/98 (23%), Positives = 46/98 (46%), Gaps = 13/98 (13%)

Query: 4 QKLEVWQ--LSYELSSSIYIATKDLRDWGFRDQITRSGLSVPSN---IAEGMERYGAKEQ 58
K + W L Y+ +++IY+AT + +T G + +A + + Q
Sbjct: 243 DKADAWTAGLKYD-ANNIYLATM----YSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQ 297

Query: 59 IQFLYIAKASLAELITQAMIGKDIGYLEPNYVDELLIK 96
QF + + +++ L+++ GKD+ Y N D+ L+K
Sbjct: 298 YQFDFGLRPAVSFLMSK---GKDLTYNNVNGDDKDLVK 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3038HTHFIS908e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 8e-22
Identities = 29/114 (25%), Positives = 49/114 (42%), Gaps = 4/114 (3%)

Query: 8 VLLVEDDPVFRQIVASFLDTRGAQVTQACDGEEGLSLFKSQHFDIVLADLSMPKLGGLDM 67
+L+ +DD R ++ L G V + + D+V+ D+ MP D+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LKEMTRLAPLVPSVVISGNNVMADVVEALRIGASDYLVKPVSDLFIIEQAIKQS 121
L + + P +P +V+S N ++A GA DYL KP F + + I
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP----FDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3039VACJLIPOPROT2291e-77 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 229 bits (586), Expect = 1e-77
Identities = 85/222 (38%), Positives = 128/222 (57%), Gaps = 4/222 (1%)

Query: 44 PRDPFEGFNRAMWDFNYLFLDRYLYRPVAHGYNDYIPMPAKTGVNNFVQNLEEPSSLVNN 103
DP EGFNR M++FN+ LD Y+ RPVA + DY+P PA+ G++NF NLEEP+ +VN
Sbjct: 28 RSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWRDYVPQPARNGLSNFTGNLEEPAVMVNY 87

Query: 104 VLQGKWGWAANAGGRFTINSTVGLLGVIDVADMMGMSRKQDE---FNEVLGYYGVPNGPY 160
LQG RF +N+ +G+ G IDVA M ++ E F LG+YGV GPY
Sbjct: 88 FLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPY 147

Query: 161 FMAPFAGPYVVRELASDWVDGLYFPLSELTIWQTIVKWGLKNLHSRASAIDQERLVDNAL 220
PF G + +R+ D D LY LS LT ++ KW L+ + +RA +D + L+ +
Sbjct: 148 VQLPFYGSFTLRDDGGDMADALYPVLSWLTWPMSVGKWTLEGIETRAQLLDSDGLLRQSS 207

Query: 221 DPYAFVKDAYLQHMDYKVYDGNV-PQKQDDDELLDQYMQELE 261
DPY V++AY Q D+ G + PQ+ + + + +++++
Sbjct: 208 DPYIMVREAYFQRHDFIANGGELKPQENPNAQAIQDDLKDID 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3040CHANLCOLICIN330.005 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 33.1 bits (75), Expect = 0.005
Identities = 68/359 (18%), Positives = 129/359 (35%), Gaps = 26/359 (7%)

Query: 278 GIPLSNTNKGPVTNLNGSSGSSSSLNSQTQATQATQATQATQATQATQATQATQATQATQ 337
G+P + + +T LNG+ S S + ++++ A AT A +T + TQA Q
Sbjct: 11 GVPYDDKGQVIITLLNGTPDGSGSGGGGGKGGSKSESSAAIHAT-AKWSTAQLKKTQAEQ 69

Query: 338 ATQATQATQATQATQATQATQATQ-------------ATQATQATQATQATQATQATQAT 384
A +A A +A QA TQ A++ AT+ A A +
Sbjct: 70 AARAKAAAEA-QAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDE 128

Query: 385 QATQATQATQATQATQATQATQATQATQATQATQATQATQATQATQATQATQATQATKTN 444
+ A +A + +A A +A Q + + + + + + +A + A +
Sbjct: 129 RLRLAKAEEKARKEAEA--AEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSE 186

Query: 445 DAIPVKVTMPTMLSARGSNQSLATPSVLINSTQSQINQPSSATATIEQTTRNSSPLGFSL 504
+A V++ + +A+ + +NS S A RN L
Sbjct: 187 EAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRN------EL 240

Query: 505 ATASLNVPSQDPKVNNVLVMQN-PKSLAPTPPLTNVATNIGAQNEEAVEEI-AAVSPKNI 562
A AS D V + N P P T G EE +++ A+ + N
Sbjct: 241 AQASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINR 300

Query: 563 LGLNTQKNERHGNDTKTDSTMKVADVLQKAFN-KAGALPVELSRSNNSSNLASELLKHL 620
+ + + ++ + + +A V + N K + S+ ++ + + L
Sbjct: 301 INADITQIQKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKDAVDATVSFYQTL 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3041TYPE3IMSPROT567e-13 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 56.3 bits (136), Expect = 7e-13
Identities = 16/93 (17%), Positives = 34/93 (36%), Gaps = 9/93 (9%)

Query: 10 AVALSYDGRN--APKIVATGEGLIAEEIIALAKANGVYIHQDPHLSHFL-QLLELGEEIP 66
A+ + Y P + + + +A+ GV I Q L+ L + IP
Sbjct: 268 AIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIP 327

Query: 67 KELYLLIAELIAFVYMLDGKFPEQWNNMHQKIV 99
E AE++ ++ + + H +++
Sbjct: 328 AEQIEATAEVLRWLERQNIE------KQHSEML 354


80Sbal195_3047Sbal195_3077N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3047021-2.009658chemotaxis-specific methylesterase
Sbal195_3048021-1.972630signal transduction histidine kinase CheA
Sbal195_3049020-2.406252chemotaxis phosphatase CheZ
Sbal195_3050-118-1.886002response regulator receiver protein
Sbal195_3051-218-1.647564flagellar biosynthesis sigma factor
Sbal195_3052-117-1.632434cobyrinic acid ac-diamide synthase
Sbal195_3053-216-1.854221flagellar biosynthesis regulator FlhF
Sbal195_3054-117-2.289116flagellar biosynthesis protein FlhA
Sbal195_3055021-2.744804flagellar biosynthesis protein FlhB
Sbal195_3056119-3.031767flagellar biosynthesis protein FliR
Sbal195_3057120-3.356094flagellar biosynthetic protein FliQ
Sbal195_3058522-3.858769flagellar biosynthesis protein FliP
Sbal195_3059521-3.828095flagellar biosynthesis protein FliO
Sbal195_3060118-1.867338flagellar motor switch protein
Sbal195_3061117-1.489266flagellar motor switch protein FliM
Sbal195_3062114-1.106990flagellar basal body-associated protein FliL
Sbal195_3063114-0.622103flagellar hook-length control protein
Sbal195_30640140.998937flagellar export protein FliJ
Sbal195_3065-1150.942223flagellum-specific ATP synthase
Sbal195_3066014-0.236601flagellar assembly protein H
Sbal195_3067-115-0.561607flagellar motor switch protein G
Sbal195_3068-118-1.253247flagellar MS-ring protein
Sbal195_3069020-2.739816flagellar hook-basal body complex subunit FliE
Sbal195_3070127-4.571909Fis family two component sigma54 specific
Sbal195_3071232-5.939883PAS/PAC sensor signal transduction histidine
Sbal195_3072335-6.623656sigma-54 dependent trancsriptional regulator
Sbal195_3073442-8.100477flagellar protein FliS
Sbal195_3074334-6.493380hypothetical protein
Sbal195_3075331-5.567332flagellar hook-associated 2 domain-containing
Sbal195_3076124-2.805707flagellar protein FlaG protein
Sbal195_3077023-1.985680flagellin domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3047HTHFIS665e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.4 bits (162), Expect = 5e-14
Identities = 31/168 (18%), Positives = 65/168 (38%), Gaps = 9/168 (5%)

Query: 2 AIKVLVVDDSSFFRRRVSEIVNQDPELEVIATASNGAEAVKMAAELNPQVITMDIEMPVM 61
+LV DD + R +++ +++ + SN A + A + ++ D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVREIMAKCP-TPILMFSSLTHDGAKATLDALDAGALDFLPKRFEDIATNKDDAIL 120
+ + I P P+L+ S+ + + A + GA D+LPK F D+
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGR 117

Query: 121 LLQQRVKALGRRRMFRPIARPVVASTPSVRPTSSVLGTTSIASHTPAT 168
L + + + P+V + +++ + + T T
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQ---EIYRVLARLMQTDLT 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3048PF06580456e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.2 bits (107), Expect = 6e-07
Identities = 18/105 (17%), Positives = 37/105 (35%), Gaps = 23/105 (21%)

Query: 430 TLNKEIDLIMV---------GEETDLDKNLVEALADPLVH------LVRNSVDHGIEMPN 474
+L E+ ++ + + + A+ D V LV N + HGI
Sbjct: 217 SLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA--- 273

Query: 475 EREANGKPRTGTITLSASQEGDHILLKIEDDGAGMDPEKLKKIAI 519
P+ G I L +++ + L++E+ G+ +
Sbjct: 274 -----QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3050HTHFIS903e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 3e-24
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 6 KILIVDDFSTMRRIIKNLLRDLGFNNTQEADDGSTALPMLQKGDFDFVVTDWNMPGMQGI 65
IL+ DD + +R ++ L G++ + +T + GD D VVTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 DLLKAIRADDSLKHLPVLMVTAEAKREQIIAAAQAGVNGYVVKPF 110
DLL I+ LPVL+++A+ I A++ G Y+ KPF
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3053PF05272310.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.013
Identities = 9/25 (36%), Positives = 12/25 (48%)

Query: 240 VKQGGVVALVGPTGVGKTTSLAKLA 264
K V L G G+GK+T + L
Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3055TYPE3IMSPROT331e-114 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 331 bits (851), Expect = e-114
Identities = 93/347 (26%), Positives = 179/347 (51%), Gaps = 2/347 (0%)

Query: 6 SGERSEEPTGRRLEQAREKGQIARSKELGTAAVLISAACGFYMLGPSLATSLTRVFETVF 65
SGE++E+PT +++ AR+KGQ+A+SKE+ + A++++ + L +++ +
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LI 59

Query: 66 TMDRAQIFDTEEMFNVWGVVASEIAWPMAKIMLLIVVVAFIGNVALGGMNFSTQAMMPKA 125
+++ + ++ + V V E + ++ + ++A +V G S +A+ P
Sbjct: 60 PAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 126 SKMSPAAGLKRMFGVQALVELTKGIAKFSVVAFSAYLLLSFYFNDIMLLSSDHLPGNVYH 185
K++P G KR+F +++LVE K I K +++ ++++ ++ L + +
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 186 ALDLLVWMFILLCSSILLIVVIDVPFQIWNHNKQLKMTKQEVKDEYKDTEGKPEVKGRVR 245
+L + ++ ++I + D F+ + + K+LKM+K E+K EYK+ EG PE+K + R
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 246 QMQRELAQRRMMTEVPNADVIVVNPEHFAVAIKYDVQRSAAPFVIAKGVDDVAFKIREIA 305
Q +E+ R M V + V+V NP H A+ I Y + P V K D +R+IA
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 306 REHNIAIVSAPPLARAIYHTTKLDQQIPEGLFTAVAQILAYVFQLRQ 352
E + I+ PLARA+Y +D IP A A++L ++ +
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3056TYPE3IMRPROT1241e-36 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 124 bits (313), Expect = 1e-36
Identities = 93/243 (38%), Positives = 143/243 (58%), Gaps = 1/243 (0%)

Query: 15 YMWPLFRVASMLMVMVVFGAATTPSRVRLLLAMAITFAIAPVLPPVQNADLFSLSAVFIT 74
Y WPL RV +++ + + P RV+L LAM ITFAIAP LP + +FS A+++
Sbjct: 16 YFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLP-ANDVPVFSFFALWLA 74

Query: 75 AQQIIIGVAMGFVTQMVMQVFVLTGQIIGMQTSLGFASMVDPGSGQQTPVIGNFFLLLAT 134
QQI+IG+A+GF Q G+IIG+Q L FA+ VDP S PV+ +LA
Sbjct: 75 VQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLAL 134

Query: 135 LIFLAVDGHLLMIRMLVASFETLPISNQGLTLTSYRALADWGSYMFGAALTMSISAIIAL 194
L+FL +GHL +I +LV +F TLPI + L ++ AL GS +F L +++ I L
Sbjct: 135 LLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLL 194

Query: 195 LLVNLSFGVMTRAAPQLNIFSIGFPITMIGGLFILWLTLTPVMEHFDEVWAAAQVLLCDM 254
L +NL+ G++ R APQL+IF IGFP+T+ G+ ++ + + + +++ LL D+
Sbjct: 195 LTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADI 254

Query: 255 LAL 257
++
Sbjct: 255 ISE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3057TYPE3IMQPROT483e-11 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 48.2 bits (115), Expect = 3e-11
Identities = 21/78 (26%), Positives = 40/78 (51%)

Query: 4 EALIDIFREALAVIVMMVSAIVLPGLGIGLIVAVFQAATSINEQTLSFLPRLLVTLFGLM 63
+ L+ +AL +++++ + IGL+V +FQ T + EQTL F +LL L
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 FMGHWLVETLMDFFVEMV 81
+ W E L+ + +++
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3058FLGBIOSNFLIP2784e-97 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 278 bits (712), Expect = 4e-97
Identities = 126/244 (51%), Positives = 184/244 (75%), Gaps = 3/244 (1%)

Query: 4 RMLALVGLVILLCMPSAWAADGVLPAVTVTTGPDGSTEYSVTMQILLLMTSLSFLPAMLI 63
R+L++ +++ L P A+A LP +T P G +S+ +Q L+ +TSL+F+PA+L+
Sbjct: 3 RLLSVAPVLLWLITPLAFAQ---LPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILL 59

Query: 64 MLTSFTRIIIVLSILRQAIGLQQTPSNQVLIGMSLFMTFFIMAPVFDRIYDEGVKPYIEE 123
M+TSFTRIIIV +LR A+G P NQVL+G++LF+TFFIM+PV D+IY + +P+ EE
Sbjct: 60 MMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEE 119

Query: 124 QLTLQQAFEKGKEPLKGFMLGQVRTTDLKTFIEISGYKNIKSPEEAPMSVLIPAFITSEL 183
++++Q+A EKG +PL+ FML Q R DL F ++ ++ PE PM +L+PA++TSEL
Sbjct: 120 KISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSEL 179

Query: 184 KTAFQIGFMLFVPFLVLDLVVASILMAMGMMMLSPMIVSLPFKIMLFVLVDGWSLVLGTL 243
KTAFQIGF +F+PFL++DLV+AS+LMA+GMMM+ P ++LPFK+MLFVLVDGW L++G+L
Sbjct: 180 KTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSL 239

Query: 244 ANSF 247
A SF
Sbjct: 240 AQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3060FLGMOTORFLIN1094e-34 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 109 bits (274), Expect = 4e-34
Identities = 53/119 (44%), Positives = 79/119 (66%)

Query: 7 DDWAAAMAEQALEEANAIELDELVDDSRPITKAEAAKLDTILDIPVTISMEVGRSYISIR 66
D WA A+ EQ + +D I+DIPV +++E+GR+ ++I+
Sbjct: 17 DLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIK 76

Query: 67 NLLQLNQGSVVELDRVAGEPLDVMVNGTLIAHGEVVVVNDKFGIRLTDVISQTERIKKL 125
LL+L QGSVV LD +AGEPLD+++NG LIA GEVVVV DK+G+R+TD+I+ +ER+++L
Sbjct: 77 ELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3061FLGMOTORFLIM2511e-83 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 251 bits (642), Expect = 1e-83
Identities = 87/326 (26%), Positives = 163/326 (50%), Gaps = 11/326 (3%)

Query: 1 MSDLLSQDEIDALLHGVDDVDDDDVDAVGE----DARSYDFSSQDRIVRGRMPTLEIVNE 56
M+++LSQDEID LL + D DA YDF D+ + +M TL +++E
Sbjct: 1 MTEVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 57 RFARHLRISMFNMMRRAAEVSINGVQMLKFGEYVHTLFVPTSLNMVRFSPLKGTALITME 116
FAR S+ +R V + V L + E++ ++ P++L ++ PLKG A++ ++
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVD 120

Query: 117 ARLVFILVDNFFGGDGRFHAKIEGREFTPTERRIVQLLLKIIFEDYKDAWAPVMDVEFDY 176
+ F ++D FGG G+ R+ T E +++ ++ I + +++W V+D+
Sbjct: 121 PSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRL 178

Query: 177 LDSEVNPAMANIVSPTEVVVINSFHIEVDGGGGDFHITMPYSMIEPIRELLDAG--VQSD 234
E NP A IV P+E+VV+ + +V G + +PY IEPI L + S
Sbjct: 179 GQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSV 238

Query: 235 KQDTDMRWSQALHDEIMDVKVGFDANIVEHELTLKDVMNFKAGDIIPIE---LPEYIMMK 291
++ + ++ L D++ V + A + L+++D++ + GDII + + + ++
Sbjct: 239 RRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLS 298

Query: 292 IEDLPTYRCKMGRSRDNLALKIHEKI 317
I + + C+ G +A +I E+I
Sbjct: 299 IGNRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3063FLGHOOKFLIK501e-08 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 50.2 bits (119), Expect = 1e-08
Identities = 35/132 (26%), Positives = 63/132 (47%), Gaps = 5/132 (3%)

Query: 592 MKQQLITMVSQGIQHAEIRLDPPELGHMLVKIQVHGDQTQVQFHVTQTQTRDLVEQAMPR 651
+ Q + QG Q AE+RL P +LG + + ++V +Q Q+Q R +E A+P
Sbjct: 244 LSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPV 303

Query: 652 LRELLQEQGMQLADSHVSQGGQGERREGGFGDGGGSNGTDVDEISAEE-----LHLGLNQ 706
LR L E G+QL S++S +++ + + ++ E+ + + L
Sbjct: 304 LRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQG 363

Query: 707 ATSVNSGIDYYA 718
+ NSG+D +A
Sbjct: 364 RVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3064FLGFLIJ442e-08 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 44.0 bits (103), Expect = 2e-08
Identities = 39/145 (26%), Positives = 70/145 (48%)

Query: 1 MANADPLLLVLKLANDAEEQAALLLKSAQLECQKRLNQLSALNNYRLEYMKQMQSQQGQA 60
MA L + LA E AA LL + CQ+ QL L +Y+ EY + S
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 ISASHYHQFHRFIRQIDDAITQQNRVVADGEKQKEYRQQHWLEKQKKRKAVELLLASKEK 120
I+++ + + +FI+ ++ AITQ + + ++ + W EK+++ +A + L +
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 KRQVVEQKREQKMTDEFASQQFYRR 145
+ E + +QK DEFA + R+
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3066FLGFLIH896e-23 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 89.1 bits (220), Expect = 6e-23
Identities = 57/201 (28%), Positives = 102/201 (50%), Gaps = 4/201 (1%)

Query: 50 AAKPTTVESVSPPTMAEIEDIRAQAEEEGFA---EGKQQGYEQGLEKGRLEGLEQGHTEG 106
A + P IE+ E++ + +QGY+ G+ +GR +G +QG+ EG
Sbjct: 16 APPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEG 75

Query: 107 LAQGHEQGLETGLAQAKVLLSRFEALLTQFEKPLQLLDGDIELSLLNLSMTLAKSVIGHE 166
LAQG EQGL +Q + +R + L+++F+ L LD I L+ +++ A+ VIG
Sbjct: 76 LAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQT 135

Query: 167 LKTHPEQVLSALRLGIESLPIKEQAVTIRLHPDDVILVEQLYSTAQLTRSKWELEVDPTL 226
++ ++ ++ P+ +R+HPDD+ V+ + A L+ W L DPTL
Sbjct: 136 PTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLG-ATLSLHGWRLRGDPTL 194

Query: 227 SAGDCILSSHRSLVDLTLSSR 247
G C +S+ +D ++++R
Sbjct: 195 HPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3067FLGMOTORFLIG2871e-97 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 287 bits (735), Expect = 1e-97
Identities = 109/350 (31%), Positives = 195/350 (55%), Gaps = 7/350 (2%)

Query: 1 MAENKTKEVAPAAPPAFNIKDISGVEKTAILLLSLSEADAASILKHLEPKQVQKVGMAMA 60
M E K KE+ ++ ++G +K AILL+S+ ++ + K+L ++++ + +A
Sbjct: 1 MEEKKEKEIL-------DVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIA 53

Query: 61 AMDEFGQEKVIGVHKLFLDDIQKYSSIGFNSEEFVRKALTAALGEDKAGNLIEQIIMGSG 120
++ E V F + + I ++ R+ L +LG KA ++I +
Sbjct: 54 KLETITSELKDNVLLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQ 113

Query: 121 AKGLDSLKWMDARQVATIIQNEHPQIQTIVLSYLEPDQAAEIFGQFPENTRLDLMMRIAN 180
++ + ++ D + IQ EHPQ ++LSYL+P +A+ I P + ++ RIA
Sbjct: 114 SRPFEFVRRADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIAL 173

Query: 181 LEEVQPAALQELNDIMEKQFAGQGGAQAAKMGGLKAAANIMNYLDTGIESQLMETMRESD 240
++ P ++E+ ++EK+ A GG+ I+N D E ++E++ E D
Sbjct: 174 MDRTSPEVVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEED 233

Query: 241 EEMAQQIQDLMFVFENLIDVDDRGIQALLREVQQDVLMKALKGTDDQLKEKILGNMSKRA 300
E+A++I+ MFVFE+++ +DDR IQ +LRE+ L KALK D ++EKI NMSKRA
Sbjct: 234 PELAEEIKKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRA 293

Query: 301 AELLRDDLEAMGPIRISEVEVAQKEILSIARRLSDSGEIMLGGGGGDEFL 350
A +L++D+E +GP R +VE +Q++I+S+ R+L + GEI++ GG ++ L
Sbjct: 294 ASMLKEDMEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3068FLGMRINGFLIF3032e-98 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 303 bits (778), Expect = 2e-98
Identities = 162/566 (28%), Positives = 261/566 (46%), Gaps = 54/566 (9%)

Query: 25 NLGGVDMMRQVTMILALAICLALAVFVMLWAQEPEYRPL-GKMETQEMVQVLDVLDKNKV 83
L + ++ +I+A + +A+ V ++LWA+ P+YR L + Q+ ++ L + +
Sbjct: 15 WLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNI 74

Query: 84 KYQIDVD--VIKVPEDKYQEVKMMLSRAGVDSPAASQDFLNQDSGFGVSQRMEQARLKHS 141
Y+ I+VP DK E+++ L++ G+ A L FG+SQ EQ + +
Sbjct: 75 PYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRA 134

Query: 142 QEENLARAIEQLQSVSRAKVILALPKENVFARNASKPSATVVINTRRG-GLGQGEVDAIV 200
E LAR IE L V A+V LA+PK ++F R PSA+V + G L +G++ A+V
Sbjct: 135 LEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVV 194

Query: 201 DIVASAVQGLEPSRVTVTDSNGRLLNSGSQDGASATARRELELVQQKEAEYRTKIESILV 260
+V+SAV GL P VT+ D +G LL + G +L+ E+ + +IE+IL
Sbjct: 195 HLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDL-NDAQLKFANDVESRIQRRIEAILS 253

Query: 261 PILGPDNFTSQVDVSMDFTAVEQTSKRYNPDLPSLRSEMTVENNTT-----GGSSGGIPG 315
PI+G N +QV +DF EQT + Y+P+ + ++ + G GG+PG
Sbjct: 254 PIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPG 313

Query: 316 ALSNQPP---------------MESNIPQDAT-KATESVTAGNSHREATRNFELDTTISH 359
ALSNQP N PQ +T + S ++ R T N+E+D TI H
Sbjct: 314 ALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRH 373

Query: 360 TRQQVGAVRRISVSVAVDFKPGAAGENGQVARVARTEQELTNIRRLLEGAVGFSSQRGDV 419
T+ VG + R+SV+V V++K A G+ + T ++ I L A+GFS +RGD
Sbjct: 374 TKMNVGDIERLSVAVVVNYKTLADGKP-----LPLTADQMKQIEDLTREAMGFSDKRGDT 428

Query: 420 LEVVTVPFMDQLVEDLPALELWEQPWFWRAIKLGIGALVILV----LILAVVRPMLKRLI 475
L VV PF + L W+Q F + L++LV L VRP L R +
Sbjct: 429 LNVVNSPF-SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRV 487

Query: 476 --YPDSVSMPEDGRLGNELAEIEDQYAADTLGMLNTQEAEYSYADDGSIHIPNLHKDDDM 533
+ + + E E+ Q + M
Sbjct: 488 EEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGA----------------EVM 531

Query: 534 IKAIRALVANEPELSTQVVKNWLQDN 559
+ IR + N+P + V++ W+ ++
Sbjct: 532 SQRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3069FLGHOOKFLIE576e-14 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 56.6 bits (136), Expect = 6e-14
Identities = 29/86 (33%), Positives = 45/86 (52%)

Query: 26 QPNIMQQVNNTSGADFGQLLSQAVGNVSGLQSTSSNLATRLEMGDTTVTLSDTVIAREKA 85
Q+ F L A+ +S Q+ + A + +G+ V L+D + +KA
Sbjct: 18 MSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKA 77

Query: 86 SVAFEATVQVRNKLVEAYKEIMSMPV 111
SV+ + +QVRNKLV AY+E+MSM V
Sbjct: 78 SVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3070HTHFIS455e-160 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 455 bits (1171), Expect = e-160
Identities = 167/483 (34%), Positives = 250/483 (51%), Gaps = 42/483 (8%)

Query: 1 MSEAKLLLVEDDASLREALLDTLMLAQYECIDVASGEDAILALKQHQFDLVISDVQMQGI 60
M+ A +L+ +DDA++R L L A Y+ ++ + DLV++DV M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 GGLGLLNFLQQHHPKLPVLLMTAYATIGSAVDAIKLGAVDYLAKPFAPEVLLNQVSRYLP 120
LL +++ P LPVL+M+A T +A+ A + GA DYL KPF L+ + R L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 LKQNVDQPVVAD-----------EKSLALLALAQRVAASDASVMILGPSGSGKEVLARYI 169
+ + D + + R+ +D ++MI G SG+GKE++AR +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 170 HQHSSRADQAFVAINCAAIPENMLEATLFGYEKGAFTGAYQACPGKFEQAQGGTLLLDEI 229
H + R + FVAIN AAIP +++E+ LFG+EKGAFTGA G+FEQA+GGTL LDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 230 SEMDLGLQAKLLRVLQEREVERLGGRKIIKLDVRVLATSNRDLKAVVAAGGFREDLYYRI 289
+M + Q +LLRVLQ+ E +GGR I+ DVR++A +N+DLK + G FREDLYYR+
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 290 NVFPLAWPALSQRPADILPLARHLLVKHAKALNVADVPELDENARRRLLSHRWPGNVREL 349
NV PL P L R DI L RH + + A+ + V D+ A + +H WPGNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLD-VKRFDQEALELMKAHPWPGNVREL 358

Query: 350 DNVIQRALILRAGQVITANDIIIDAQDVILGA--------------------------ED 383
+N+++R L VIT I + + I +
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 384 LDQFVAEPDGLGEELKAQEHVIILETLNQCQGSRKLVAEKLGISARTLRYKMARMRDMGI 443
+ L E+ +IL L +G++ A+ LG++ TLR K +R++G+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGV 475

Query: 444 QLP 446
+
Sbjct: 476 SVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3071PF06580347e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.1 bits (78), Expect = 7e-04
Identities = 19/95 (20%), Positives = 37/95 (38%), Gaps = 19/95 (20%)

Query: 256 LVMNSIEAGAT------EIRIQAKEEGDQLLLNVIDNGKGLDANMQQKVLEPFFTTKSQG 309
LV N I+ G +I ++ ++ + L V + G N + +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------------ES 310

Query: 310 TGLGLA-VVQSVVRNHGGQLQLSCLPNKGCTVSLV 343
TG GL V + + +G + Q+ +G ++V
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3072HTHFIS433e-151 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 433 bits (1114), Expect = e-151
Identities = 171/481 (35%), Positives = 262/481 (54%), Gaps = 21/481 (4%)

Query: 7 RILLIGPSSERLNRLCCIFDFLGEQIAQI-DAEKLSASLQDTRFRALVILTDVMDADA-- 63
IL+ + L G + +A L + +++TDV+ D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD--LVVTDVVMPDENA 62

Query: 64 ---LKNIAGQHPWQPMLLL---GNVDDLQVSNILG---NIEEPLTYPQLTELLHFCQVFG 114
L I P P+L++ ++ G + +P +L ++
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 115 QVKRPQVPTSANQTKLFRSLVGRSDGIANVRHLINQVATSEATVLVLGQSGTGKEVVARN 174
+ + ++ + LVGRS + + ++ ++ ++ T+++ G+SGTGKE+VAR
Sbjct: 123 KRRPSKLEDDSQD---GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 175 IHYLSERRDGPFIPVNCGAIPPELLESELFGHEKGSFTGAICSRKGRFELAEGGTLFLDE 234
+H +RR+GPF+ +N AIP +L+ESELFGHEKG+FTGA GRFE AEGGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 235 IGDMPLQMQVKLLRVLQERVFERVGGTKTINADVRVVAATHRDLETMISVNEFREDLYYR 294
IGDMP+ Q +LLRVLQ+ + VGG I +DVR+VAAT++DL+ I+ FREDLYYR
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 295 LNVFPIEMPALCDRKDDVPLLLQELVSRVYNEGRGKVRFTQRAIESLKEHAWSGNVRELS 354
LNV P+ +P L DR +D+P L++ V + EG RF Q A+E +K H W GNVREL
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 355 NLVERLTILYPGGLVDVNDLPVKYRHIDVPEYCVEMSEEQQERDALASIFSDEEPVEIPE 414
NLV RLT LYP ++ + + R ++P+ +E + + +++ EE +
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRS-EIPDSPIEKAAARSGSLSISQAV--EENMRQYF 416

Query: 415 TRFPSELPPEGVNLKDLLAELEIDMIRQALELQDNVVARAAEMLGIRRTTLVEKMRKYGM 474
F LPP G+ +LAE+E +I AL +AA++LG+ R TL +K+R+ G+
Sbjct: 417 ASFGDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475

Query: 475 T 475
+
Sbjct: 476 S 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3077FLAGELLIN1392e-40 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 139 bits (352), Expect = 2e-40
Identities = 94/270 (34%), Positives = 129/270 (47%), Gaps = 9/270 (3%)

Query: 2 AITVNTNVTSMKAQKNLNASGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGL 61
A +NTN S+ Q NLN S ++L++++ERLSSGLRINSAKDDAAG AI+NR S ++GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVGMRNANDGISVAQIAEGAMQEQTNMLQRMRDLSVQAVNGANSTSDRDALQAEIDQLAL 121
RNANDGIS+AQ EGA+ E N LQR+R+LSVQA NG NS SD ++Q EI Q
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EITAISSTTAFGDTKLLDSSFAGKSFQVGHQEGENISISISGTNATALGVNAL------- 174
EI +S+ T F K+L QVG +GE I+I + + +LG++
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDN-QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 175 -AVSTDILASTATGAIDDAIKAIDTQRAKLGATQNRLSHNISNSANTQANVADAKSRIVD 233
V + D + R + + + A D
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 234 VDFAKETSQMTKNQVLQQTGSAMLAQANQL 263
+ K + A A +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 83.2 bits (205), Expect = 3e-20
Identities = 57/265 (21%), Positives = 97/265 (36%)

Query: 7 TNVTSMKAQKNLNASGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGLDVGMR 66
N T++ K ++ + D G+ + + G
Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVST 301

Query: 67 NANDGISVAQIAEGAMQEQTNMLQRMRDLSVQAVNGANSTSDRDALQAEIDQLALEITAI 126
N +A+ ++ + N D ++ A
Sbjct: 302 TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEAN 361

Query: 127 SSTTAFGDTKLLDSSFAGKSFQVGHQEGENISISISGTNATALGVNALAVSTDILASTAT 186
++ + + + + + + +N A + +
Sbjct: 362 NAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPL 421

Query: 187 GAIDDAIKAIDTQRAKLGATQNRLSHNISNSANTQANVADAKSRIVDVDFAKETSQMTKN 246
+ID A+ +D R+ LGA QNR I+N NT N+ A+SRI D D+A E S M+K
Sbjct: 422 ASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKA 481

Query: 247 QVLQQTGSAMLAQANQLPQVALSLL 271
Q+LQQ G+++LAQANQ+PQ LSLL
Sbjct: 482 QILQQAGTSVLAQANQVPQNVLSLL 506


81Sbal195_3084Sbal195_3102N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3084221-2.410715flagellin domain-containing protein
Sbal195_3086117-1.610947transposase
Sbal195_3088016-1.404984flagellin domain-containing protein
Sbal195_3089-114-0.985498flagellin domain-containing protein
Sbal195_3090-211-0.167678flagellar hook-associated protein FlgL
Sbal195_3091-2100.584271flagellar hook-associated protein FlgK
Sbal195_3092-115-0.338318flagellar rod assembly protein/muramidase FlgJ
Sbal195_3093016-0.854686flagellar basal body P-ring protein
Sbal195_3094017-1.254930flagellar basal body L-ring protein
Sbal195_3095119-1.422000flagellar basal body rod protein FlgG
Sbal195_3096-122-2.048390flagellar basal body rod protein FlgF
Sbal195_3097023-3.303977flagellar hook protein FlgE
Sbal195_3098021-3.718293flagellar basal body rod modification protein
Sbal195_3099022-4.074902flagellar basal body rod protein FlgC
Sbal195_3100121-4.430760flagellar basal body rod protein FlgB
Sbal195_3101120-4.221471protein-glutamate O-methyltransferase
Sbal195_3102120-4.241381response regulator receiver modulated CheW
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3084FLAGELLIN1447e-42 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 144 bits (363), Expect = 7e-42
Identities = 101/270 (37%), Positives = 136/270 (50%), Gaps = 9/270 (3%)

Query: 2 AITVNTNVTSMKAQKNLNASGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGL 61
A +NTN S+ Q NLN S ++L++++ERLSSGLRINSAKDDAAG AI+NR S ++GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVGMRNANDGISVAQIAEGAMQEQTNMLQRMRDLSVQAVNGANSTSDKDAIQAEIDQLAL 121
RNANDGIS+AQ EGA+ E N LQR+R+LSVQA NG NS SD +IQ EI Q
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EITAISNTTAFGDTKLLSGGFTAKNFQVGHQEGENISISISGTDASTLGVEGLLVSSDGA 181
EI +SN T F K+LS QVG +GE I+I + D +LG++G V+
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 AS--------TSIGLIDTAIKTIDTQRAKLGATQNRLSHNISNSANTQSNVADAKSRIVD 233
A+ ++ DT + R + + + A D
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 234 VDFAKETSAMTKNQVLQQTGSAMLAQANQL 263
+ K + A A +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 87.4 bits (216), Expect = 1e-21
Identities = 57/265 (21%), Positives = 97/265 (36%)

Query: 7 TNVTSMKAQKNLNASGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGLDVGMR 66
N T++ K ++ + D G+ + + G
Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVST 301

Query: 67 NANDGISVAQIAEGAMQEQTNMLQRMRDLSVQAVNGANSTSDKDAIQAEIDQLALEITAI 126
N +A+ ++ + N D ++ A
Sbjct: 302 TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEAN 361

Query: 127 SNTTAFGDTKLLSGGFTAKNFQVGHQEGENISISISGTDASTLGVEGLLVSSDGAASTSI 186
+ + +TA + + ++ + + +
Sbjct: 362 NAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPL 421

Query: 187 GLIDTAIKTIDTQRAKLGATQNRLSHNISNSANTQSNVADAKSRIVDVDFAKETSAMTKN 246
ID+A+ +D R+ LGA QNR I+N NT +N+ A+SRI D D+A E S M+K
Sbjct: 422 ASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKA 481

Query: 247 QVLQQTGSAMLAQANQLPQVALSLL 271
Q+LQQ G+++LAQANQ+PQ LSLL
Sbjct: 482 QILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3088FLAGELLIN1418e-41 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 141 bits (356), Expect = 8e-41
Identities = 99/270 (36%), Positives = 133/270 (49%), Gaps = 9/270 (3%)

Query: 2 AITVNTNVTSMKAQKNLNASGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGL 61
A +NTN S+ Q NLN S ++L++++ERLSSGLRINSAKDDAAG AI+NR S ++GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVGMRNANDGISVAQIAEGAMQEQTNMLQRMRDLSVQAVNGANSTSDKDAIQAEIDQLAL 121
RNANDGIS+AQ EGA+ E N LQR+R+LSVQA NG NS SD +IQ EI Q
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EITAISNTTAFGDTKLLSGGFSAKSFQVGHQEGENISISISGTDAGTLSVDALLVSSDSA 181
EI +SN T F K+LS QVG +GE I+I + D +L +D V+
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 AS--------TSIGLIDAAIKTIDTQRAKLGATQNRLAHNISNSANTQANVADAKSRIVD 233
A+ ++ D + R + + + A D
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 234 VDFAKETSQMTKNQVLQQTGSAMLAQANQL 263
+ K + A A +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 85.1 bits (210), Expect = 7e-21
Identities = 56/265 (21%), Positives = 98/265 (36%)

Query: 7 TNVTSMKAQKNLNASGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGLDVGMR 66
N T++ K ++ + D G+ + + G
Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVST 301

Query: 67 NANDGISVAQIAEGAMQEQTNMLQRMRDLSVQAVNGANSTSDKDAIQAEIDQLALEITAI 126
N +A+ ++ + N D ++ A
Sbjct: 302 TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEAN 361

Query: 127 SNTTAFGDTKLLSGGFSAKSFQVGHQEGENISISISGTDAGTLSVDALLVSSDSAASTSI 186
+ + ++A + + ++ ++ + + +
Sbjct: 362 NAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPL 421

Query: 187 GLIDAAIKTIDTQRAKLGATQNRLAHNISNSANTQANVADAKSRIVDVDFAKETSQMTKN 246
ID+A+ +D R+ LGA QNR I+N NT N+ A+SRI D D+A E S M+K
Sbjct: 422 ASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKA 481

Query: 247 QVLQQTGSAMLAQANQLPQVALSLL 271
Q+LQQ G+++LAQANQ+PQ LSLL
Sbjct: 482 QILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3089FLAGELLIN1431e-41 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 143 bits (362), Expect = 1e-41
Identities = 96/270 (35%), Positives = 135/270 (50%), Gaps = 9/270 (3%)

Query: 2 AITVNTNVTSMKAQKNLNTSGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGL 61
A +NTN S+ Q NLN S ++L++++ERLSSGLRINSAKDDAAG AI+NR S ++GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 EVGMRNANDGISVAQVAEGAMQEQTNMLQRMRDLAVQSVNGANSTSDKEALQAEIDQLTS 121
RNANDGIS+AQ EGA+ E N LQR+R+L+VQ+ NG NS SD +++Q EI Q
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EITAISNSTAFGDTKLLSGGFTGKSFQVGHQEGENISISISGTDATTLGVNALVVSSDTA 181
EI +SN T F K+LS QVG +GE I+I + D +LG++ V+
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 AS--------TAIGAIDSALKLIDTQRATLGAVQNRLAHNISNSANTQSNVADAKSRIVD 233
A+ + D+ + R + + + A D
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 234 VDFAKETAQMTKNQVLQQTGSSMLAQANQL 263
+ K + A A +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 86.3 bits (213), Expect = 3e-21
Identities = 60/265 (22%), Positives = 102/265 (38%)

Query: 7 TNVTSMKAQKNLNTSGNALATSMERLSSGLRINSAKDDAAGLAISNRLNSQVRGLEVGMR 66
N T++ K ++ + D G+ + + G
Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVST 301

Query: 67 NANDGISVAQVAEGAMQEQTNMLQRMRDLAVQSVNGANSTSDKEALQAEIDQLTSEITAI 126
N VA+ ++ + N + S++ A
Sbjct: 302 TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEAN 361

Query: 127 SNSTAFGDTKLLSGGFTGKSFQVGHQEGENISISISGTDATTLGVNALVVSSDTAASTAI 186
+ + +T + + +N ++ + + +
Sbjct: 362 NAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPL 421

Query: 187 GAIDSALKLIDTQRATLGAVQNRLAHNISNSANTQSNVADAKSRIVDVDFAKETAQMTKN 246
+IDSAL +D R++LGA+QNR I+N NT +N+ A+SRI D D+A E + M+K
Sbjct: 422 ASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKA 481

Query: 247 QVLQQTGSSMLAQANQLPQVALSLL 271
Q+LQQ G+S+LAQANQ+PQ LSLL
Sbjct: 482 QILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3090FLAGELLIN575e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 57.4 bits (138), Expect = 5e-11
Identities = 41/242 (16%), Positives = 83/242 (34%), Gaps = 3/242 (1%)

Query: 20 QTATSKILDQLSSGKKVNTAGDDPVASQGIDNLNQKNALVDQFMKNIDYATNRLAVTESK 79
Q++ S +++LSSG ++N+A DD + + Q +N + + TE
Sbjct: 21 QSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGA 80

Query: 80 LGSAEDLTGSIREQVMRAINGTLSGTERQMIADEMKGSMEELLSIANSKDESGNYMFSGF 139
L + +RE ++A NGT S ++ + I DE++ +EE+ ++N +G + S
Sbjct: 81 LNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLSQ- 139

Query: 140 STDKEPFAFDNSTPPKIVYSGDSGVRNSLVQTGVAMGTNI--PGDSAFMKAPNGLGDYSV 197
+ N + V++ + G GD D
Sbjct: 140 DNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYA 199

Query: 198 NYLASQQGEFSVKTAKIADTATYVADTYTFNFTDNGAGGTNLQVLDSANNPVANVANFDA 257
+ + + TA V D N + + + + + +
Sbjct: 200 VGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGT 259

Query: 258 TN 259

Sbjct: 260 AE 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3091FLGHOOKAP12072e-61 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 207 bits (529), Expect = 2e-61
Identities = 122/455 (26%), Positives = 192/455 (42%), Gaps = 19/455 (4%)

Query: 4 DLLNIARTGVLASQSQLGVTSNNIANANTAGYHRQVATQSTLESQRLGNSFYGTGTYVND 63
L+N A +G+ A+Q+ L SNNI++ N AGY RQ + S + G G YV+
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 64 VKRIYNDYAARELRIGQTTLSAAEASYGKLSELDQLFSQIGKVVPQSLNNLFSGLNSIAD 123
V+R Y+ + +LR QT S A Y ++S++D + S + + + F+ L ++
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 124 LPADLGIRSSTLTNAQQVASSLNQMQSYLNGQLDQTNDQITGMTKRINEIGTELAKLNLE 183
D R + + ++ + + YL Q Q N I +IN ++A LN +
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 184 LMKSPNQDA-----QLLDKQDALVQELSQYAQVNVIPQENGAKSIMLGGSVMLVSGEIAM 238
+ + A LLD++D LV EL+Q V V Q+ G +I + LV G A
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 239 SMGTQAGNPFPKELQLNSSISSQSVTVDPSKL--GGQLGAMFDYRDQTLIPAGHELDQLA 296
+ + P + + P KL G LG + +R Q L + L QLA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 297 LGVADNFNKMQAQGIDLNGQVGANIFKDINDPMMSLGRAAGFSGNTGNATLGVTIDDTSL 356
L A+ FN G D NG G + F + + N G+ +G T+ D S
Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFA------IGKPAVLQNTKNKGDVAIGATVTDASA 355

Query: 357 LTGGAYELSF--TSPATYELRDTETGTITPLTLTGSILSGGSGFSIDIKAGAMASGDRFA 414
+ Y++SF L T T+TP G G A D F
Sbjct: 356 VLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFT----GTPAVNDSFT 411

Query: 415 IRPTAGASNGIEVVMKDPKGIAAASPKITADAANS 449
++P + A ++V++ D IA AS + D+ N
Sbjct: 412 LKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNR 446



Score = 87.7 bits (217), Expect = 3e-20
Identities = 38/104 (36%), Positives = 56/104 (53%)

Query: 535 AEGDNSNAVAMAKLSESKVMNNGKSTLADVFENTKLDIGSKTKAAEVRTGSAEAVYQQAY 594
+ DN N A+ L + G + D + + DIG+KT + + + V Q
Sbjct: 441 GDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLS 500

Query: 595 ARVESESGVNLDEEAANLMRFQQAYQASARIMTTAQQIFDTLLS 638
+ +S SGVNLDEE NL RFQQ Y A+A+++ TA IFD L++
Sbjct: 501 NQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3092FLGFLGJ1521e-45 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 152 bits (386), Expect = 1e-45
Identities = 66/151 (43%), Positives = 94/151 (62%), Gaps = 1/151 (0%)

Query: 219 GSREEFLATLYPHAEKAAKALGTQPEVLLAQSALETGWGQKIVRGNNGAPSHNLFNIKAD 278
G + FLA L A+ A++ G ++LAQ+ALE+GWGQ+ +R NG PS+NLF +KA
Sbjct: 147 GDSKAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKAS 206

Query: 279 RRWQGDKANVSTLEFEHGVAVQQKADFRVYSDFEHSFNDFVSFIAEGDRYQDAKKVAASP 338
W+G ++T E+E+G A + KA FRVYS + + +D+V + RY A AAS
Sbjct: 207 GNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYA-AVTTAASA 265

Query: 339 TQFIRALQDAGYATDPRYAEKVIKVMQSISE 369
Q +ALQDAGYATDP YA K+ ++Q +
Sbjct: 266 EQGAQALQDAGYATDPHYARKLTNMIQQMKS 296



Score = 87.8 bits (217), Expect = 1e-21
Identities = 39/91 (42%), Positives = 61/91 (67%), Gaps = 3/91 (3%)

Query: 12 DLGGLDSLRAQAQKDEKGALKKVAQQFEGIFVQMLMKSMRDANAVFQSDSPLNSQYTKFY 71
D L+ L+A+A +D ++ VA+Q EG+FVQM++KSMRDA D +S++T+ Y
Sbjct: 14 DAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALP---KDGLFSSEHTRLY 70

Query: 72 EQMRDQQLSVDLSDKGVLGLADMMVQQLSPE 102
M DQQ++ ++ LGLA+MMV+Q++PE
Sbjct: 71 TSMYDQQIAQQMTAGKGLGLAEMMVKQMTPE 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3093FLGPRINGFLGI369e-129 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 369 bits (949), Expect = e-129
Identities = 158/367 (43%), Positives = 222/367 (60%), Gaps = 14/367 (3%)

Query: 5 LVLAVAVLVFSLPSQAE--RIKDIANVQGVRSNQLIGYGLVVGLPGTGEKTS---YTEQT 59
LV + + + P+QA+ RIKDIA++Q R NQLIGYGLVVGL GTG+ +TEQ+
Sbjct: 11 LVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQS 70

Query: 60 FMTMLKNFGINLPDNVKPKIKNVAVVAVHADMPAFIKPGQDLDVTVSSLGEAKSLRGGTL 119
ML+N GI + KN+A V V A++P F PG +DVTVSSLG+A SLRGG L
Sbjct: 71 MRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNL 129

Query: 120 LQTFLKGVDGNVYAIAQGSLVVSGFSADGLDGSKVIQNTPTVGRIPNGAIVERSVATPFS 179
+ T L G DG +YA+AQG+L+V+GFSA G D + + Q T R+PNGAI+ER + + F
Sbjct: 130 IMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 180 TGDYLTFNLRRSDFSTAQRMADAINEL----LGPDMARPLDATSVQVSAPRDVSQRVSFL 235
L LR DFSTA R+AD +N G +A P D+ + V PR V+ +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 236 ATLENLDVIPAEESAKVIVNSRTGTIVVGQNVRLLPAAITHGGMTVTIAEATQVSQPNAL 295
A +ENL + + AKV++N RTGTIV+G +VR+ A+++G +TV + E+ QV QP
Sbjct: 248 AEIENL-TVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPF 306

Query: 296 ANGQTTVTSNSTITATESDRRMFMFNPGTTLDELVRAVNLVGAAPSDVLAILEALKVAGA 355
+ GQT V + I A + ++ + G L LV +N +G ++AIL+ +K AGA
Sbjct: 307 SRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGA 365

Query: 356 LHGELII 362
L EL++
Sbjct: 366 LQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3094FLGLRINGFLGH1431e-44 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 143 bits (362), Expect = 1e-44
Identities = 74/227 (32%), Positives = 113/227 (49%), Gaps = 18/227 (7%)

Query: 4 YLVLAVALL-LAACSSTQKKPLADDPFYAPVYPEAPPTKIAATGSIYQDSQ-----ASSL 57
Y + ++ +L L C+ PL A P P A GSI+Q +Q L
Sbjct: 9 YAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTP---VANGSIFQSAQPINYGYQPL 65

Query: 58 YSDIRAHKVGDIITIVLKESTQAKKSAGNQIKKGSDMSLDPIFAGGSNVSI-----GGVP 112
+ D R +GD +TIVL+E+ A KS+ + + F + G
Sbjct: 66 FEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTN----FGFDTVPRYLQGLFGNAR 121

Query: 113 IDLRYKDSMNTKRESDADQSNSLDGSISANVMQVLNNGSLVIRGEKWISINNGDEFIRVT 172
D+ + A+ SN+ G+++ V QVL NG+L + GEK I+IN G EFIR +
Sbjct: 122 ADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFS 181

Query: 173 GLVRSQDIKPDNTIDSTRMANARIQYSGTGTFADAQKVGWLSQFFMS 219
G+V + I NT+ ST++A+ARI+Y G G +AQ +GWL +FF++
Sbjct: 182 GVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLN 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3095FLGHOOKAP1437e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.0 bits (101), Expect = 7e-07
Identities = 18/119 (15%), Positives = 40/119 (33%), Gaps = 4/119 (3%)

Query: 145 DNATSITVSAEGEVSVKTPGTAENQVVGQLTMTDFINPSGLDPMGQNLYTETG---ASGT 201
+ I +++E + + + Q + +L ++ G A+
Sbjct: 427 TDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLK 486

Query: 202 PIQGTASLDGMGAIRQGALETSNVNVTEELVNLIESQRIYEMNSKVISAVDQMLSYVTQ 260
T + + S VN+ EE NL Q+ Y N++V+ + + +
Sbjct: 487 TSSATQGNV-VTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 35.7 bits (82), Expect = 2e-04
Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 5 LWISKTGLDAQQTDIAVISNNVANASTVGYKKSRAV 40
+ + +GL+A Q + SNN+++ + GY + +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3097FLGHOOKAP1402e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 39.9 bits (93), Expect = 2e-05
Identities = 15/35 (42%), Positives = 22/35 (62%)

Query: 2 SFNIALSGISAAQKDLNTTANNIANANTIGFKESR 36
N A+SG++AAQ LNT +NNI++ N G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 37.6 bits (87), Expect = 1e-04
Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 405 SISSSALEQSNIDLTTELVDLISAQRNFQANSRTLEVNNTLQQTVLQIR 453
+S+ S ++L E +L Q+ + AN++ L+ N + ++ IR
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3099FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 3e-04
Identities = 9/38 (23%), Positives = 18/38 (47%)

Query: 99 NVNVMEEMADMISASRSYQMNVQVAEAAKSMLQQTLGM 136
VN+ EE ++ + Y N QV + A ++ + +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.9 bits (67), Expect = 0.003
Identities = 16/67 (23%), Positives = 29/67 (43%), Gaps = 6/67 (8%)

Query: 5 SIFDVAGSGMSAQSVRLNTTASNIANADSVSSSVDKTYRSRHPIFEAEMAKAQSQQQASQ 64
S+ + A SG++A LNT ++NI++ + Y + I + +
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYN------VAGYTRQTTIMAQANSTLGAGGWVGN 55

Query: 65 GVAVKGI 71
GV V G+
Sbjct: 56 GVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3102HTHFIS611e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 1e-12
Identities = 23/128 (17%), Positives = 52/128 (40%), Gaps = 12/128 (9%)

Query: 180 HIMVIDDSAVARKQIIRSLESLNLQIDTAKDGREALDKLKEIAKEMDNVADEIPLIISDI 239
I+V DD A R + ++L + + + A + L+++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDV 55

Query: 240 EMPEMDGYTLTAEIRDDPKLKHIKVVLHTSLSGVFNQAMVQKVGANDFIAK-FNPDELAA 298
MP+ + + L I+ + V++ ++ + + GA D++ K F+ EL
Sbjct: 56 VMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 299 AVNKHLSL 306
+ + L+
Sbjct: 114 IIGRALAE 121


82Sbal195_3235Sbal195_3241N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3235-2111.905558FKBP-type peptidylprolyl isomerase
Sbal195_3236-1112.125291glycoside hydrolase
Sbal195_3237-2102.234240ROK family protein
Sbal195_3238-2113.093195peptidase M24
Sbal195_3239-2132.842526metal dependent phosphohydrolase
Sbal195_3240-1133.028515methyl-accepting chemotaxis sensory transducer
Sbal195_32411122.437410DEAD/DEAH box helicase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3235INFPOTNTIATR1344e-42 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 134 bits (338), Expect = 4e-42
Identities = 64/132 (48%), Positives = 84/132 (63%), Gaps = 2/132 (1%)

Query: 25 KAAQENIRLGNEFLTQNKTKEGVITTASGLQYQVLTKGDGTVHPKASDTVTVHYHGTLID 84
K A+EN G+ FL+ NK+K G++ SGLQY+++ G G P SDTVTV Y GTLID
Sbjct: 99 KKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGA-KPGKSDTVTVEYTGTLID 157

Query: 85 GTVFDSSVDRGEPIAFPLNRVIKGWTEGVQLMVVGDKVRFFIPSVLAYGNSST-GKIGGG 143
GTVFDS+ G+P F +++VI GWTE +QLM G F+P+ LAYG S G IG
Sbjct: 158 GTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPRSVGGPIGPN 217

Query: 144 SVLIFDVELLKI 155
LIF + L+ +
Sbjct: 218 ETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3236MICOLLPTASE482e-07 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 47.8 bits (113), Expect = 2e-07
Identities = 35/168 (20%), Positives = 62/168 (36%), Gaps = 11/168 (6%)

Query: 542 WEIDADNGDILNAMHEGLGHGEGTTPPVNKAPIANAGADVNVTGPTDVVLNGSGSRDPEN 601
++D + + + + G+ T VNK P A +D +V ++ +G+ S+D +
Sbjct: 744 HKVDGNGNYVYDVVFHGMNTDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGTESKDEDG 803

Query: 602 EALTYLWTQVSGPTIAITNADMANAAIQLAATQTDVAYSFSLKVTDPEGLSATDSVTVTN 661
E Y W G ++ A A + T Y L VTD G T+S +
Sbjct: 804 EIKAYEWDFGDGEK-----SNEAKATHKYNKTGE---YEVKLTVTDNNGGINTESKKIKV 855

Query: 662 KADTPNQAPVVSVAAT---ATVEAGKTVSIVASASDADGDALTYAWTV 706
D P + S + K+ +V + + Y + V
Sbjct: 856 VEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDV 903


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3240FLAGELLIN300.021 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.4 bits (68), Expect = 0.021
Identities = 14/87 (16%), Positives = 35/87 (40%), Gaps = 4/87 (4%)

Query: 293 QLSGAMEEMSSTITEVAQNTHLTSTSINTAYDLCLKSSANMKANTQKVEQLAKSVADAAN 352
++ +T+ ++N + + T + + N Q+V +L+ + N
Sbjct: 48 AIANRFTSNIKGLTQASRNANDGISIAQTTE----GALNEINNNLQRVRELSVQATNGTN 103

Query: 353 NAHQLNKEAEQVANAMGEIDSIAEQTN 379
+ L +++ + EID ++ QT
Sbjct: 104 SDSDLKSIQDEIQQRLEEIDRVSNQTQ 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3241SECA330.003 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 32.9 bits (75), Expect = 0.003
Identities = 37/175 (21%), Positives = 65/175 (37%), Gaps = 46/175 (26%)

Query: 220 SQVVYPVEQRRKRELLSELIGK-KNWQQVLVFTATRDAADTLVKELNLDGIPSEVVHGDK 278
+VY E + + ++ ++ + Q VLV T + + ++ + EL GI V++
Sbjct: 424 PDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNA-- 481

Query: 279 GQGSRRRALREFVAGDVR---VLVATEVAARGLDI---------------PSLEYVVNYD 320
+ A VA V +AT +A RG DI P+ E +
Sbjct: 482 -KFHANEA--AIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIK 538

Query: 321 LPFLAED---------YV-----H---RI-----GRTGRAGKTGVAISFVSREEE 353
+ ++ H RI GR+GR G G + ++S E+
Sbjct: 539 ADWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDA 593


83Sbal195_3291Sbal195_3298N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3291-2150.411967hybrid sensory histidine kinase BarA
Sbal195_3292-114-0.118752hypothetical protein
Sbal195_3293-1121.053706hypothetical protein
Sbal195_3294-1121.187207LysR family transcriptional regulator
Sbal195_3295-1111.085208auxin efflux carrier
Sbal195_3296-1121.235409recombination and repair protein
Sbal195_32970141.964646phosphatidylglycerophosphatase A
Sbal195_32980132.538978thiamine-monophosphate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3291HTHFIS649e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.5 bits (157), Expect = 9e-13
Identities = 28/124 (22%), Positives = 48/124 (38%), Gaps = 2/124 (1%)

Query: 678 QSLTVLAVDDNFANLKLIDTLLSELVTTVIAVNSGDEAVKQAKTRTFDLIFMDIQMPGTD 737
T+L DD+ A +++ LS V ++ + DL+ D+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 738 GISATKQIRQGSMNRNTPIIAVTAHAIAEERELILGSGMDGYLPKPIDEEALKDVIHRWI 797
+I+ + P++ ++A G YLPKP D L +I R +
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 798 TRPK 801
PK
Sbjct: 120 AEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3292BACINVASINB280.024 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 28.2 bits (62), Expect = 0.024
Identities = 17/41 (41%), Positives = 25/41 (60%)

Query: 142 EALDDFVFAHEVMEEEKELQNSLLEIIEENPKITAELVKGL 182
EAL DF+ A M++ ++ +EI EN K+TAEL K +
Sbjct: 533 EALADFMLARFAMDQIQQWLKQSVEIFGENQKVTAELQKAM 573


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3296GPOSANCHOR372e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.0 bits (85), Expect = 2e-04
Identities = 55/328 (16%), Positives = 105/328 (32%), Gaps = 28/328 (8%)

Query: 59 ANKTEVSAR--FSLDDIPLAKRWLEDNDLELDDECILRRTIGSDGRSRAYINGNPVPLTQ 116
T + L K + E+++ + + ++A +
Sbjct: 34 VVNTNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKD-------H 86

Query: 117 LKLLGQLLIGIHGQHAHHAMLKSEHQLTLLDSYANHRLLIDTVAASFQRCKQIEADLKQL 176
L + L + + SE + + A L + + A +K L
Sbjct: 87 NDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTL 146

Query: 177 EASQHERIARKQLVQYQVEELDEFDLKVDEFDEIEQEHKRLANGTELIDTCQASLDILTE 236
EA + ARK ++ +E F + K L ++ QA L E
Sbjct: 147 EAEKAALAARKADLEKALEGAMNFST------ADSAKIKTLEAEKAALEARQAEL----E 196

Query: 237 GEENNIESLLNRVVSLAEDLQSYDPALSNINTMLNDALIQVQESAGELQHYLSKLELDPT 296
+ + + L++ AL+ L AL + + LE +
Sbjct: 197 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAE-- 254

Query: 297 HFAYLEERLSKAMQLARKHHVSPNKLAEHHLALKAELSTLDSDESKLEEIQLQVDASRAA 356
A LE R ++ + + L+AE + L+++++ LE ++A+R
Sbjct: 255 -KAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQ- 312

Query: 357 YLSNAQKLSQSRARYAK---ELDKLVTQ 381
S + L SR + E KL Q
Sbjct: 313 --SLRRDLDASREAKKQLEAEHQKLEEQ 338



Score = 36.2 bits (83), Expect = 4e-04
Identities = 39/217 (17%), Positives = 71/217 (32%), Gaps = 14/217 (6%)

Query: 167 KQIEADLKQLEASQHERIARKQLVQYQVEELD-EFDLKVDEFDEIEQEHKRLANGTELID 225
+ A A K ++ + EL+ + ++ + K L +
Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224

Query: 226 TCQASLDILTEGEENNIESLLNRVVSLAEDLQSYDPALSNINTMLNDALIQVQESAGELQ 285
+A L+ EG N + ++ +L + + L L AL +
Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAA----LEARQAELEKALEGAMNFSTADS 280

Query: 286 HYLSKLELDPTHFAYLEERLSKAMQLARKHHVSPNKLAEHHLALKAELSTLDSDESKLEE 345
+ LE A LE + ++ + + L A + L+++ KLEE
Sbjct: 281 AKIKTLE---AEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEE 337

Query: 346 IQLQVDASRAAYLSNAQKLSQSRARYAK---ELDKLV 379
Q S A+ S + L SR + E KL
Sbjct: 338 ---QNKISEASRQSLRRDLDASREAKKQLEAEHQKLE 371


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3298TYPE3IMQPROT270.025 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 27.4 bits (61), Expect = 0.025
Identities = 9/39 (23%), Positives = 16/39 (41%)

Query: 76 LSDLAAMGAEPAWMTLALTLPEVDETWLSGFSEGLFEAA 114
+ DL G + ++ L L+ + G GLF+
Sbjct: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTV 39


84Sbal195_3417Sbal195_3428N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3417426-0.130347translation initiation factor IF-2
Sbal195_3418115-0.177872transcription elongation factor NusA
Sbal195_34190150.089996hypothetical protein
Sbal195_34201150.266846**preprotein translocase subunit SecG
Sbal195_34210140.201102triosephosphate isomerase
Sbal195_34220160.065939phosphoglucosamine mutase
Sbal195_34230160.340817dihydropteroate synthase
Sbal195_34241160.177618ATP-dependent metalloprotease FtsH
Sbal195_3425014-0.24317023S rRNA methyltransferase J
Sbal195_3426-114-0.404312hypothetical protein
Sbal195_3427-1130.189325preprotein translocase subunit SecF
Sbal195_3428-2110.362845preprotein translocase subunit SecD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3417TCRTETOQM725e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 72.2 bits (177), Expect = 5e-15
Identities = 51/202 (25%), Positives = 78/202 (38%), Gaps = 30/202 (14%)

Query: 387 IMGHVDHGKTSLLDYIRRAKVAAGEAG------------------GITQHIGAYHVETEN 428
++ HVD GKT+L + + A E G GIT G + EN
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 429 GMITFLDTPGHAAFTAMRARGAKATDIVVLVVAADDGVMPQTIEAIQHAKAGNVPLIVAV 488
+ +DTPGH F A R D +L+++A DGV QT + +P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 489 NKMDKPEADIDRV----KSELSQHGVMS-------EDWGGDNMFAFVSAKTGEGVDELLE 537
NK+D+ D+ V K +LS V+ + + EG D+LLE
Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187

Query: 538 GILLQAEVLELKAVRDGMAAGV 559
+ + LE + +
Sbjct: 188 K-YMSGKSLEALELEQEESIRF 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3420SECGEXPORT1184e-38 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 118 bits (297), Expect = 4e-38
Identities = 61/111 (54%), Positives = 81/111 (72%), Gaps = 1/111 (0%)

Query: 1 MYEVLVVIYLLVALGLIGLVLIQQGKGADMGASFGAGASGTLFGSSGSGNFLTRTTAILA 60
MYE L+V++L+VA+GL+GL+++QQGKGADMGASFGAGAS TLFGSSGSGNF+TR TA+LA
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 61 IAFFTLSLLIGNLSANHAKNEDSWNNLGSDTEQVTQPVEQGTQKSETKIPD 111
FF +SL++GN+++N W NL S + Q K + IP+
Sbjct: 61 TLFFIISLVLGNINSNKTNKGSEWENL-SAPAKTEQTQPAAPAKPTSDIPN 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3421adhesinb310.003 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 31.4 bits (71), Expect = 0.003
Identities = 18/95 (18%), Positives = 34/95 (35%), Gaps = 16/95 (16%)

Query: 142 REARRTFEVIAEELDIVIQKNGTMAFDNAIIAY----EPLWAVGTGKSATPEQAQEVHAF 197
+EA+ F I E +++ G F AY +W + T + TP+Q + +
Sbjct: 186 KEAKEKFNNIPGEKKMIVTSEG--CFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEK 243

Query: 198 IRKRLSEVSPFIGENIRILYGGSVTPSNAADLFAQ 232
+RK + L+ S ++
Sbjct: 244 LRKT----------KVPSLFVESSVDDRPMKTVSK 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3424HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 0.002
Identities = 23/82 (28%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 198 VLMVGPPGTGKTLLAKAIAGESK---VPFFT-----ISGSDFVEMFVGV------GASRV 243
+++ G GTGK L+A+A+ K PF I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 244 RD-MFEQAKKSAPCIIFIDEID 264
FEQA+ +F+DEI
Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3427SECFTRNLCASE2461e-82 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 246 bits (630), Expect = 1e-82
Identities = 91/306 (29%), Positives = 161/306 (52%), Gaps = 14/306 (4%)

Query: 2 KNINLTKWRYVSSAISILLMITSLAIIGVKGFNWGLDFTGGVVTEVQLDRKITSSELQPL 61
N + +W++ + +I++MI S+ + V G N+G+DF GG + I +
Sbjct: 12 TNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAA 71

Query: 62 LNAAYQQEVSVILASEPG----------RWVLRYGDIKSADTEQSNVDIQ----QALAPL 107
L +V + +P R ++ + ++ AL +
Sbjct: 72 LEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAV 131

Query: 108 NSEVQVLNSSVVGPQIGQELAEQGGLALLVAMLCILGYLSYRFEWRLASGALFALVHDVV 167
+ +++ + VGP++ EL +LL A + I+ Y+ RFEW+ A GA+ ALVHDV+
Sbjct: 132 DPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVL 191

Query: 168 FVLAFFALTQMEFNLTVLAAVLAILGYSLNDSIIIADRIRELLIAKPKLAIQEINNQAIV 227
+ FA+ Q++F+LT +AA+L I GYS+ND++++ DR+RE LI + ++++ N ++
Sbjct: 192 LTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVN 251

Query: 228 ATFSRTMVTSGTTLMTVGALWIMGGGPLEGFSIAMFIGILTGTFSSISVGTSLPELLGLT 287
T SRT++T TTL+ + + I GG + GF AM G+ TGT+SS+ V ++ +GL
Sbjct: 252 ETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLD 311

Query: 288 PEHYKE 293
K+
Sbjct: 312 RNKEKK 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3428SECFTRNLCASE781e-17 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 78.0 bits (192), Expect = 1e-17
Identities = 30/172 (17%), Positives = 82/172 (47%), Gaps = 4/172 (2%)

Query: 422 VTIVEERTIGPTLGAENIENGFAALGLGMGITLLFMALWYR-RLGWVANIALISNMVILF 480
+ I ++GP + E + +L + + ++ + + + A +AL+ ++++
Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194

Query: 481 GLLALIPGAVLTLPGIAGLVLTVGMAVDTNVLIFERIKDKLKEGRSFALA--IDTGFDSA 538
GL A++ L +A L+ G +++ V++F+R+++ L + ++ L ++ +
Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253

Query: 539 FSTIFDANFTTMITAVVLYSIGNGPIQGFALTLGLGLLTSMFTGIFASRALI 590
S TT++ V + G I+GF + G+ T ++ ++ ++ ++
Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV 305


85Sbal195_3686Sbal195_3690N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3686-2130.485492aldo/keto reductase
Sbal195_3687-113-0.314512MltA-interacting MipA family protein
Sbal195_36880160.047917hypothetical protein
Sbal195_3689-1132.081724integral membrane sensor signal transduction
Sbal195_3690-1120.785821two component transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3686HELNAPAPROT320.001 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 32.2 bits (73), Expect = 0.001
Identities = 19/88 (21%), Positives = 36/88 (40%), Gaps = 16/88 (18%)

Query: 109 IHQAVDASLARLQIDTIDLYQIHWPDRNTNFFG--ELFYDQQDQEHQTPILETLEALAEV 166
+ +++ L+ + L++ HW + +FF E F +E ET++ +AE
Sbjct: 13 VENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKF-----EELYDHAAETVDTIAER 67

Query: 167 IRQGKVRYIGVSNETPWGLMK-YLQLAE 193
+ IG P +K Y + A
Sbjct: 68 LLA-----IGGQ---PVATVKEYTEHAS 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3687IGASERPTASE300.015 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.015
Identities = 28/115 (24%), Positives = 43/115 (37%), Gaps = 6/115 (5%)

Query: 127 IHLGTGTLSTKFQ--HDVTNVYDGFQADITYYHPINLGFGDLVPYAGVHYFSKDFANYYT 184
I LG G +K Q H+ Q +T NLG + P GV Y A++
Sbjct: 1377 IDLGYGKFQSKLQTNHNAKFARHTAQFGLTAGKAFNLGNFGITPIVGVRYSYLSNADFAL 1436

Query: 185 G---VTSSEATAQRPAYQADGTFAYKLGYALVIP-LTKHLDITQATGYSRIAANM 235
+ + + + Q D ++ Y LG V P L+ D Q +G +
Sbjct: 1437 DQARIKVNPISVKTAFAQVDLSYTYHLGEFSVTPILSARYDANQGSGKINVNGYD 1491


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3689PF06580290.031 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.031
Identities = 23/156 (14%), Positives = 46/156 (29%), Gaps = 36/156 (23%)

Query: 258 RDLDTMEDLVMTLLSYARLDEANIQPDWQSIELNAWLLEKYQGQVYPDFSVELVSYPTAL 317
D +++ +L R S+ +++ Y + + + L
Sbjct: 188 EDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSY-------LQLASIQFEDRL 240

Query: 318 K--IKTDPKYLSMQVNNLL-----NNALRFG------KAKIRLTLAVEEGATWLHVDDDG 364
+ + +P + +QV +L N ++ G KI L + G L V++ G
Sbjct: 241 QFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 365 PGIDELESAQVIKPFVRGQHSRGNSGHGMGLAIVDR 400
+ G GL V
Sbjct: 301 SLALK----------------NTKESTGTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3690HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 37/126 (29%), Positives = 56/126 (44%), Gaps = 1/126 (0%)

Query: 6 HILVVEDDISLAEWISDYLLDHGYEVTVASQGDFALEMIAEEIPDLVLLDVMLPVKNGFD 65
ILV +DD ++ ++ L GY+V + S IA DLV+ DV++P +N FD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 VCKEARAFYAG-PILFMTACVEDGDEIRGLDVGADDYLTKPIRPQVLLARIKALLRRVGD 124
+ + P+L M+A I+ + GA DYL KP L+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 125 EEQKQQ 130
K +
Sbjct: 125 RPSKLE 130


86Sbal195_3806Sbal195_3811N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3806-1171.9912455-formyltetrahydrofolate cyclo-ligase
Sbal195_38070162.303104short chain dehydrogenase
Sbal195_3808-1151.464488putative thiol-disulfide oxidoreductase DCC
Sbal195_3809-1111.524531malate dehydrogenase
Sbal195_38100121.984818arginine repressor
Sbal195_38112142.392978NAD-dependent epimerase/dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3806OMS28PORIN290.019 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 28.6 bits (63), Expect = 0.019
Identities = 14/44 (31%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 46 NRNQLRKSIRTARKSLSETEQIQASLSASQRMLDALQAQNAQHV 89
N++ K + ++ ++ EQ++ +L AS+R LD Q AQ V
Sbjct: 166 NKSPNNKELELTKEEFAKVEQVKETLMASERALDE-TVQEAQKV 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3807DHBDHDRGNASE493e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 49.3 bits (117), Expect = 3e-09
Identities = 53/264 (20%), Positives = 97/264 (36%), Gaps = 36/264 (13%)

Query: 5 IIITGVGKRIGYALAKHFLAQGQQVIG-----TYRSHYDSIDELNALGATLYPCDFYDDA 59
ITG + IG A+A+ +QG + S + A A +P D D A
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 60 QVQSLIDEL-TQLPQIRAIIHNASDWLPDPVLIKNEPLKSTTFTPSQVLQRMMQVHVSVP 118
+ + + ++ I +++ A P + ++ TF+ V+ +
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFS----------VNSTGV 120

Query: 119 YQLNLALEAQLRAAAGDEIGGSDVIHITDYVAEKGSQKHIAYAASKAALHNMTLSFAAKF 178
+ + ++ + D GS V + A AYA+SKAA T +
Sbjct: 121 FNASRSVSKYMM----DRRSGSIVT-VGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175

Query: 179 APE-VKVNSIAPAMILFNM-----GDDAAYQQKTLAKAL-------LPKEAGNAEIIELV 225
A ++ N ++P +M D+ +Q L K A ++I + V
Sbjct: 176 AEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAV 235

Query: 226 DYLL--NSRYVTGRCHNVDGGRQL 247
+L+ + ++T VDGG L
Sbjct: 236 LFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3810ARGREPRESSOR1451e-47 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 145 bits (367), Expect = 1e-47
Identities = 43/150 (28%), Positives = 71/150 (47%), Gaps = 5/150 (3%)

Query: 6 NQDDLVRIFKSILKEERFGSQSEIVTALQAEGFGNINQSKVSRMLSKFGAVRTRNAKQEM 65
N+ + I+ +Q E+V L+ +G+ N+ Q+ VSR + + V+
Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSY 60

Query: 66 VYCLPAELGVPTAGSPLKNLV---LDVDHNQAMIVVRTSPGAAQLIARLLDSIGKPEGIL 122
Y LPA+ ++L+ + +D +IV++T PG AQ I L+D++ E I+
Sbjct: 61 KYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IM 119

Query: 123 GTIAGDDTIFICPSSIQDIADTLETIKSLF 152
GTI GDDTI I + D + I L
Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3811NUCEPIMERASE396e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 39.4 bits (92), Expect = 6e-06
Identities = 30/123 (24%), Positives = 47/123 (38%), Gaps = 23/123 (18%)

Query: 1 MKIAILGATGWIGGAILKEALSRGHQVTAL-----VRDPS-------KLSATDVAVHAVD 48
MK + GA G+IG + K L GHQV + D S L+ H +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 49 LE-QPLVAQTFA--GVDVVI-----AAVGGRAQQNHDLVASTV---QHLLDVLPNAKVPR 97
L + + FA + V AV + H S + ++L+ + K+
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 98 LLW 100
LL+
Sbjct: 121 LLY 123


87Sbal195_3829Sbal195_3842N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3829-214-0.601456hypothetical protein
Sbal195_3830-213-0.850636hypothetical protein
Sbal195_3831-213-0.740055ammonium transporter
Sbal195_3832114-0.853774nitrogen regulatory protein P-II
Sbal195_3833115-0.453162isochorismatase hydrolase
Sbal195_38341130.082954cytochrome c552
Sbal195_38350150.613770nitrate/nitrite sensor protein NarQ
Sbal195_3836-2171.464374two component LuxR family transcriptional
Sbal195_3837-2131.217136hypothetical protein
Sbal195_3838-2111.229902Mg2 transporter protein CorA family protein
Sbal195_3839-2121.160121succinylglutamate desuccinylase/aspartoacylase
Sbal195_3840-111-0.296660aspartate kinase III
Sbal195_3841-311-3.094928hypothetical protein
Sbal195_3842-310-3.239604two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3829SYCDCHAPRONE320.002 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 32.2 bits (73), Expect = 0.002
Identities = 22/118 (18%), Positives = 43/118 (36%), Gaps = 4/118 (3%)

Query: 280 LTTLYNLALILGDQGRLDEWAEINKVLELARIRNPYYYYDMAQQAFDEHQYDEALAWYQR 339
L LY+LA G+ ++ ++ + L + + ++ + QYD A+ Y
Sbjct: 36 LEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSY 95

Query: 340 A--LAKADYRHEFFFGLSKTYWALGDEKRAKLNMEKALALSRDDSERHRYQNKLQVML 395
+ + R F G+ A+ + A L D +E ++ ML
Sbjct: 96 GAIMDIKEPRFPFHAAEC--LLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSML 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3833ISCHRISMTASE432e-07 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 43.1 bits (101), Expect = 2e-07
Identities = 39/190 (20%), Positives = 68/190 (35%), Gaps = 31/190 (16%)

Query: 2 LKPEECVLVIVDVQGKLAQIMDNS----DKLHQQLQTLIQGAQLFEIPILWLEQLPDKLG 57
P VL+I D+Q +L ++ L IP+++ Q P G
Sbjct: 26 PDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQ-P---G 81

Query: 58 ATSPELQTLLEK------TGSP-----------------IAKQHFSGWHCEEFAQALTKT 94
+ +P+ + LL P + K +S + + + K
Sbjct: 82 SQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKE 141

Query: 95 DRKHVILAGIETHVCVYQTCCDLIEQQYSVHLVADGVSSRSADNKQLGIQMMTARGALLT 154
R +I+ GI H+ T C+ + V D V+ S + Q+ ++ R A
Sbjct: 142 GRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTV 201

Query: 155 NVESLLFELQ 164
+SLL +LQ
Sbjct: 202 MTDSLLDQLQ 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3835PF06580424e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.2 bits (99), Expect = 4e-06
Identities = 27/147 (18%), Positives = 55/147 (37%), Gaps = 17/147 (11%)

Query: 415 INEGVSTAYVQLRELLSTFRLTIK-EPDLKSALEAMLEQLRAKTNI-------KITLDYK 466
I E + A L L R +++ + +L L + + + ++ + +
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 467 LAPQWLEAKQHIHILQITREATLNAIKHA-----EASLINIHCYKDDKGMVNIDVCDNGI 521
+ P ++ + ++Q E N IKH + I + KD+ G V ++V + G
Sbjct: 246 INPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDN-GTVTLEVENTGS 301

Query: 522 GIGHLKERDQHFGIGIMHERASKLSGK 548
+ G+ + ER L G
Sbjct: 302 LALKNTKESTGTGLQNVRERLQMLYGT 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3836HTHFIS612e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 2e-13
Identities = 27/159 (16%), Positives = 61/159 (38%), Gaps = 9/159 (5%)

Query: 6 SVLVVDDHPLLRKGICQLIASDPDFSLFGEAGGGLDALTAVATDEPDIILLDLNMKGMTG 65
++LV DD +R + Q S + + +A + D+++ D+ M
Sbjct: 5 TILVADDDAAIRTVLNQ-ALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 66 LDTLNAMRQEGVTSRIVILTVSDAKQDVIRLLRAGADGYLLKDTEPDLLLEKLKNAMLGH 125
D L +++ +++++ + I+ GA YL K + L+ + A+
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL--- 119

Query: 126 RVISDEVEEYLYELKDATDEQEWISSLTPRELQILEQLA 164
E + +L+D + + + + +I LA
Sbjct: 120 ----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3840DHBDHDRGNASE290.026 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 29.2 bits (65), Expect = 0.026
Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 5/86 (5%)

Query: 10 GTSVADYNAMNRCADIVLANPHCRLVVVSASSGVTNLLVELTQESINDDGRLQRLK-QIA 68
+S A +C + LA + R +VS S T++ L + ++G Q +K +
Sbjct: 158 ASSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWAD---ENGAEQVIKGSLE 214

Query: 69 QIQYAI-LDKLGRPNDVAAALDKLLS 93
+ I L KL +P+D+A A+ L+S
Sbjct: 215 TFKTGIPLKKLAKPSDIADAVLFLVS 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3842HTHFIS831e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 1e-20
Identities = 31/131 (23%), Positives = 61/131 (46%), Gaps = 1/131 (0%)

Query: 1 MQNPHILIVEDEAVTRNTLRSIFEAEGYVVTEANDGAEMHKAMQENKINLVVMDINLPGK 60
M IL+ +D+A R L GY V ++ A + + + +LVV D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELREIN-NIGLIFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLT 119
N L +++ ++ ++ ++ ++ + I E GA DY+ KPF+ EL L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RVNSAGAEVEE 130
+++E+
Sbjct: 121 EPKRRPSKLED 131


88Sbal195_3942Sbal195_3948N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3942115-1.427498response regulator receiver protein
Sbal195_3943013-2.007581histone family protein DNA-binding protein
Sbal195_3944014-1.998649hypothetical protein
Sbal195_3945017-2.414630alpha-L-glutamate ligase
Sbal195_3946114-1.920196response regulator receiver modulated
Sbal195_3947012-0.802211response regulator receiver protein
Sbal195_3948011-0.017344multi-sensor signal transduction histidine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3942HTHFIS618e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 8e-13
Identities = 25/107 (23%), Positives = 44/107 (41%), Gaps = 3/107 (2%)

Query: 146 RVLVVDDSRMARNVIKRTIGNLGMKLITEAEDGAQAIELMKNNMFDLVITDYNMPSVDGL 205
+LV DD R V+ + + G + + A + DLV+TD MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 206 ALTQYIRNESQQSHIPILMVSSEANDTHLSNVSQAGVNALCDKPFEP 252
L I+ + +P+L++S++ S+ G KPF+
Sbjct: 64 DLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108



Score = 47.1 bits (112), Expect = 3e-08
Identities = 34/155 (21%), Positives = 60/155 (38%), Gaps = 6/155 (3%)

Query: 10 SILLVEPSDIQRRIIIQRLQQEGILSIQTAENIEAAKEIIARHKPDLIASAMHFDDGTAI 69
+IL+ + R ++ Q L + G ++ N IA DL+ + + D A
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 70 DLLGYLRTSADCKDIQFMLVSSECRREQLEIFRQSGVVAILPKPFSADHLATALNATIDL 129
DLL ++ + D+ +++S++ + G LPKPF L + +
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 130 LSHDELDLSNFDVQDVRVLVVDDSRM--ARNVIKR 162
L + D QD LV + M V+ R
Sbjct: 122 PKRRPSKLED-DSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3943DNABINDINGHU1092e-35 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 109 bits (275), Expect = 2e-35
Identities = 45/88 (51%), Positives = 66/88 (75%)

Query: 2 NKTELIAKIAENADLTKVEAARALKSFEAAITESMKNGDKISIVGFGSFETATRAARTGR 61
NK +LIAK+AE +LTK ++A A+ + +A++ + G+K+ ++GFG+FE RAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEIQIAEATVPKFKAGKTLRDSV 89
NPQTG+EI+I + VP FKAGK L+D+V
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3946HTHFIS617e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.6 bits (147), Expect = 7e-12
Identities = 29/102 (28%), Positives = 48/102 (47%), Gaps = 3/102 (2%)

Query: 3 LLLIDDDEVDRTAIIRALRQSKLTFNVIEANCAFDGLNLALERHFDGILLDYLLPDANGL 62
+L+ DDD RT + +AL S+ ++V + A D ++ D ++PD N
Sbjct: 6 ILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EVLIKLNAMTQDQTVVVMLSRYEDEKLAQRCIELGAQDFLLK 104
++L ++ D V+VM S A + E GA D+L K
Sbjct: 64 DLLPRIKKARPDLPVLVM-SAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3947HTHFIS481e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 48.3 bits (115), Expect = 1e-09
Identities = 24/112 (21%), Positives = 44/112 (39%), Gaps = 10/112 (8%)

Query: 8 QQVTILLVDDDDVDYMAVQRAMRQLRLLNPLVRARDGIEALAILTSLDTIKGPYLILLDL 67
TIL+ DDD + +A+ + + + + + L++ D+
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAA----GDGDLVVTDV 55

Query: 68 NMPRMNGFEFLERIRS-DPSLSSSVVFMLTTSSTDEDRMKAYSHHVAGYMVK 118
MP N F+ L RI+ P L V +++ +T +KA Y+ K
Sbjct: 56 VMPDENAFDLLPRIKKARPDL---PVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3948PF06580300.038 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.038
Identities = 20/107 (18%), Positives = 37/107 (34%), Gaps = 22/107 (20%)

Query: 608 LVIRNLISNAIKH---HDLGTGVITVLCESTSKHYLFSVLDDGPGISSAYQNKVFEMFQT 664
++++ L+ N IKH G I + + V + G
Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS---------------- 301

Query: 665 LKPRDEVEGSGLGLSLVKKTVESLGGN---IQLKSQGRGCCFYFTWP 708
L ++ E +G GL V++ ++ L G I+L + P
Sbjct: 302 LALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


89Sbal195_3964Sbal195_3974N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_3964-1162.920103isochorismatase hydrolase
Sbal195_39651172.595551radical SAM domain-containing protein
Sbal195_39661172.312148antibiotic biosynthesis monooxygenase
Sbal195_39671172.156475N-acetyltransferase GCN5
Sbal195_3968-2204.078803EmrB/QacA family drug resistance transporter
Sbal195_3969-2204.566124secretion protein HlyD family protein
Sbal195_39700204.906576LysR family transcriptional regulator
Sbal195_39710215.085254large-conductance mechanosensitive channel
Sbal195_39721225.233429antibiotic biosynthesis monooxygenase
Sbal195_39730225.288625CzcA family heavy metal efflux protein
Sbal195_39743203.852770RND family efflux transporter MFP subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3964ISCHRISMTASE546e-11 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 54.3 bits (130), Expect = 6e-11
Identities = 47/209 (22%), Positives = 78/209 (37%), Gaps = 25/209 (11%)

Query: 30 PTIRTMTQAQAPTELNANTTAVLVIDFQNEYFTGSMP--IPNGKQALGKAKQVVKFAHQN 87
PT M Q + + N +L+ D Q YF + + +++ Q
Sbjct: 12 PTASDMPQNKVSWVPDPNRAVLLIHDMQ-NYFVDAFTAGASPVTELSANIRKLKNQCVQL 70

Query: 88 AMPVYFVRHLGPAA-----------GPLFAEGSVNAEFHQDLQPLEIDFVINKATPSSFV 136
+PV + G GP G + +L P + D V+ K S+F
Sbjct: 71 GIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFK 130

Query: 137 GTNLDQQLKEKGIKTLVITGLMTHMCVSSTARDAVPMGYDVIIAEDATATRDLATWDGSI 196
TNL + ++++G L+ITG+ H+ TA +A DA A D S+
Sbjct: 131 RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVA-------DFSL 183

Query: 197 VDHATLQRAAIAGVADVFAEIKTTQAVLN 225
H + A+ A A T ++L+
Sbjct: 184 EKH----QMALEYAAGRCAFTVMTDSLLD 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3967SACTRNSFRASE386e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 6e-06
Identities = 21/72 (29%), Positives = 31/72 (43%), Gaps = 5/72 (6%)

Query: 75 ASIGRVVVSPAGRGKGLAMPLMQHAIESALTTWPDAGIQIGAQDY-LKA--FYQKLGFVA 131
A I + V+ R KG+ L+ AIE A G+ + QD + A FY K F+
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFII 148

Query: 132 CS-EMYLEDGIP 142
+ + L P
Sbjct: 149 GAVDTMLYSNFP 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3968TCRTETB1307e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 130 bits (328), Expect = 7e-35
Identities = 90/421 (21%), Positives = 176/421 (41%), Gaps = 19/421 (4%)

Query: 25 TDYERGSRRSWIAVFGGLIGAFMAILDIQITNASMKEIQGSLGATLEEGSWISTAYLVAE 84
T Y + + R + I +F ++L+ + N S+ +I +W++TA+++
Sbjct: 3 TSYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTF 62

Query: 85 MIAIPLSGWLSTGLSVRRYLLWTTAAFIFASVLCSMAWN-LEAMIAFRALQGFFGGALIP 143
I + G LS L ++R LL+ F SV+ + + +I R +QG A
Sbjct: 63 SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122

Query: 144 LAFRLILEFLPDNKRAVGMALFGVTATFAPSIGPTLGGWLTEQFSWHYLFYINVPPGLLV 203
L ++ ++P R L G +GP +GG + W YL +P ++
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITII 180

Query: 204 MAMLAYGLEKQSVVWDKLKNVDLAGIVTMALGMGCLEVVLEEGNRKDWFGSELIRNLAII 263
L K+ V + D+ GI+ M++G+ + F + + I+
Sbjct: 181 TVPFLMKLLKKEVRIKG--HFDIKGIILMSVGIVFFML----------FTTSYSISFLIV 228

Query: 264 AVVNLVLFVWIQLRRKEPLVNLRLLGKRDFVLSTVAYFLLGMALFGAIYLIPLYLSQVHD 323
+V++ ++FV + +P V+ L F++ + ++ + G + ++P + VH
Sbjct: 229 SVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ 288

Query: 324 YTPLEIGGVIMWMGFPQLLVL-PLVPKLMERFDSRYLAAFGFLMFAISYYMNSQMTADYA 382
+ EIG VI++ G +++ + L++R Y+ G ++S+ S +
Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LET 346

Query: 383 GPQMIASQVVRALG-QPFILVPIGMLATMHLKPHENASASTVLNVMRNLGGAFGIALVAT 441
+ +V LG F I + + LK E + ++LN L GIA+V
Sbjct: 347 TSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406

Query: 442 L 442
L
Sbjct: 407 L 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3969RTXTOXIND996e-25 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 98.7 bits (246), Expect = 6e-25
Identities = 42/294 (14%), Positives = 96/294 (32%), Gaps = 28/294 (9%)

Query: 71 LAQLEDNQFSAKVSQAEASLASSKADLQTLAAKVELQRALITQASAGVVAAESDKIRAQQ 130
+ + + S + ++ + ++ +RA A + E+ +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 131 QLSRSKKLKVSNYSSQDDVDQLQAGFDSAAARLDEAKA--------VLVAKQRELAVFN- 181
+L L ++ V + + + A L K+ +L AK+ V
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 182 ------AQLDQAGSVVEQADATLELAKIQLNDTRVTAPFSGVIGKRGAM-VGQYVQPGQA 234
+L Q + L + + + + AP S + + G V +
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 235 LYSLVPDGAV-WITANFKETQIQHMQPGQSVQVSLDAFPDKTFIGVIDSLSPASGAKFSL 293
L +VP+ +TA + I + GQ+ + ++AFP + G + K
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY-GYLV-------GKVKN 407

Query: 294 LPAENATGNFTKIVQRIPVRIRLDLSEAEAHML---PGLSAVVKVDTASGTAIS 344
+ + +V + + I + + G++ ++ T + IS
Sbjct: 408 INLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVIS 461


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3971MECHCHANNEL1708e-58 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 170 bits (431), Expect = 8e-58
Identities = 85/136 (62%), Positives = 110/136 (80%), Gaps = 1/136 (0%)

Query: 1 MSLIKEFKAFASRGNVIDMAVGIIIGAAFGKIVSSFVADIIMPPIGIILGGVNFSDLSIV 60
MS+IKEF+ FA RGNV+D+AVG+IIGAAFGKIVSS VADIIMPP+G+++GG++F ++
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LQAAQGDAPSVVIAYGKFIQTIIDFTIIAFAIFMGVKAINRLKRKEEVAPKAPAAPTKDQ 120
L+ AQGD P+VV+ YG FIQ + DF I+AFAIFM +K IN+L RK+E P A APTK++
Sbjct: 61 LRDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKE-EPAAAPAPTKEE 119

Query: 121 ELLSEIRDLLKAQQEK 136
LL+EIRDLLK Q +
Sbjct: 120 VLLTEIRDLLKEQNNR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3973ACRIFLAVINRP6620.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 662 bits (1709), Expect = 0.0
Identities = 225/1075 (20%), Positives = 434/1075 (40%), Gaps = 73/1075 (6%)

Query: 9 AIKNRLLVVLALLAMIVASVVMLPKLNLDAFPDVTNVQVTINTAAEGLAAEEVEKLISYP 68
I+ + + + +++A + + +L + +P + V+++ G A+ V+ ++
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 69 VESAMYALPAVTEVRSLS-RTGLSIVTVVFAEGTDIYFARQQVFEQLQAAREMIPSGVGV 127
+E M + + + S S G +T+ F GTD A+ QV +LQ A ++P V
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 128 PEIGPNTSGLGQIYQYILRAEPNSGIDAAELRSLNDYLVKLIMMPVGGVTEVLSFGGDVR 187
I S + ++ N G ++ VK + + GV +V FG
Sbjct: 125 QGISVEKSSSSYLMVAGFVSD-NPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ-Y 182

Query: 188 QYQVQVDPNKLRAYGLSMAQVTEALESNNRNAGGWFMDQGQE------QLVVRGYGMLPA 241
++ +D + L Y L+ V L+ N + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 242 GEEGLAAIAQIPLTEDK-GTPVRVGDIAQVDFGSEIRVGAVTMTRRDEAGNVQNLGEVVA 300
EE ++ L + G+ VR+ D+A+V+ G E + N
Sbjct: 243 PEE----FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARI----------NGKPAAG 288

Query: 301 GVVLKRMGANTKATIDDIGARVSLIEQALPDGVSFEVFYDQAELVDKAVTTVRDALLMAF 360
+ GAN T I A+++ ++ P G+ YD V ++ V L A
Sbjct: 289 LGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI 348

Query: 361 VFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAIGMLVDG 420
+ + +++ LFL N+RATL+ +++PV + +++ +G S N +++ G+ +AIG+LVD
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 421 SVVMVENIFKHLTQPDRRHLLEARTRADGEADPYHSDEDGGQQANMAVRIMLAAKEVCSP 480
++V+VEN+ + + ED + M ++
Sbjct: 409 AIVVVENVERVMM------------------------EDKLPPKEATEKSM---SQIQGA 441

Query: 481 IFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVYLFK--- 537
+ ++ VF P+ G G +++ +++I+ AM ++LVALI PAL L K
Sbjct: 442 LVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501

Query: 538 -------RGVVLKQSVVLAPLDAAYRKLLTATLARPKVVMLSALLMFALSLLLLPRLGTE 590
G + Y + L +L L+ A ++L RL +
Sbjct: 502 AEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSS 561

Query: 591 FVPELEEGTINLRVTLAPTASLGTSLAVAPKLEAILLEFPEVEYALSRIGAPELGGDPEP 650
F+PE ++G + L A+ + V ++ L+ + S +
Sbjct: 562 FLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVE-SVFTVNGFSFSGQA 620

Query: 651 VSNIEVYIGLKPISEWQSASSRLE--LQRLMEEKLSVFPGLLLTFSQPIATRVDELLSGV 708
+ ++ LKP E + E + R E + G ++ F+ P + EL +
Sbjct: 621 QNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGTAT 677

Query: 709 KAQLA-IKIFGPDLAVLSERGQALTDLVAKIPGAV-DVSLEQVSGEAQLVVRPKRELLAR 766
I G L++ L + A+ P ++ V + AQ + +E
Sbjct: 678 GFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQA 737

Query: 767 YGISVDQVMSLVSQGIGGASAGQVIDGNARYDINVRLAAEFRTSPDAIKDLLLSGTNGAT 826
G+S+ + +S +GG ID + V+ A+FR P+ + L + NG
Sbjct: 738 LGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEM 797

Query: 827 VRLGEVASVEVEMAPPNIRRDDVQRRVVVQANVA-GRDMGSVVKDIYALVPQADLPAGYT 885
V + P + R + + +Q A G G + + L + LPAG
Sbjct: 798 VPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIG 855

Query: 886 VIIGGQYENQQRAQQKLMLVVPISIALIALLLYFSFGSFKQVLLIMANVPLALIGGIVAL 945
G ++ + + +V IS ++ L L + S+ + +M VPL ++G ++A
Sbjct: 856 YDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAA 915

Query: 946 YVSGTYLSVPSSIGFITLFGVAVLNGVVLVDSINQ-RRQSGEALYDCVYEGTVGRLRPVL 1004
+ V +G +T G++ N +++V+ + G+ + + RLRP+L
Sbjct: 916 TLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPIL 975

Query: 1005 MTALTSALGLIPILLSSGVGSEIQKPLAVVIIGGLFSSTALTLLVLPTLYRWLYR 1059
MT+L LG++P+ +S+G GS Q + + ++GG+ S+T L + +P + + R
Sbjct: 976 MTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 113 bits (284), Expect = 2e-27
Identities = 81/544 (14%), Positives = 185/544 (34%), Gaps = 61/544 (11%)

Query: 10 IKNRLLVVLALLAMIVASVVMLPKLNLDAFPDVTNVQVTIN-TAAEGLAAEEVEKLI--- 65
+ + +L ++ VV+ +L P+ G E +K++
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 66 -SYPVESAMYALPAVTEVRSLSRTGLS----IVTVVFAEGTDIYFARQQVFEQLQAAREM 120
Y +++ + +V V S +G + + V + + A+
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653

Query: 121 ---IPSGVGVPEIGPNTSGLGQIYQYILRAEPNSGIDAAELRSLNDYLVKLIMMPVGGVT 177
I G +P P LG + +G+ L + L+ + +
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 178 EV-LSFGGDVRQYQVQVDPNKLRAYGLSMAQVTEALES--NNRNAGGWFMDQGQEQLVVR 234
V + D Q++++VD K +A G+S++ + + + + + ++L V+
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 235 GYGMLPAGEEGLAAIAQIPLTEDKGTPVRVGDIAQVDFGSEIRVGAVTMTRRDEAGNVQN 294
+ ++ + G V + G+ + R + +++
Sbjct: 774 AD---AKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY----GSPRLERYNGLPSMEI 826

Query: 295 LGEVVAGVVLKRMGANTKATIDDIGARVSLIEQALPDGVSFEVFYDQAELVDKAVTTVRD 354
GE G D A + + LP G+ ++ + + +
Sbjct: 827 QGEAAPGTSS-----------GDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPA 874

Query: 355 ALLMAFVFIVVILALFLVNIRATLLVLLSIPVSIGLALMVMSYYGLSANLMSLGGLAVAI 414
+ ++FV + + LA + + V+L +P+ I L+ + + ++ + GL I
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934

Query: 415 GMLVDGSVVMVENIFKHLTQPDRRHLLEARTRADGEADPYHSDEDGGQQANMAVRIMLAA 474
G+ ++++VE + L+E + EA ++A
Sbjct: 935 GLSAKNAILIVEFA---------KDLMEKEGKGVVEA------------------TLMAV 967

Query: 475 KEVCSPIFFATAIIIVVFAPLFALEGVEGKLFQPMAVSIILAMISALLVALIAVPALAVY 534
+ PI + I+ PL G + + ++ M+SA L+A+ VP V
Sbjct: 968 RMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027

Query: 535 LFKR 538
+ +
Sbjct: 1028 IRRC 1031



Score = 102 bits (257), Expect = 2e-24
Identities = 89/515 (17%), Positives = 190/515 (36%), Gaps = 36/515 (6%)

Query: 565 RPKVVMLSALLMFALSLLLLPRLGTEFVPELEEGTINLRVTLAPTASLGT-SLAVAPKLE 623
RP + A+++ L + +L P + +++ P A T V +E
Sbjct: 8 RPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSAN-YPGADAQTVQDTVTQVIE 66

Query: 624 AILLEFPEVEYALSRIGAPELGGDPEPVSNIEVYIGLKPISEWQSASSRLELQRLMEEKL 683
+ + Y S + ++ + + + ++ A Q ++ KL
Sbjct: 67 QNMNGIDNLMYMSST---------SDSAGSVTITLTFQSGTDPDIA------QVQVQNKL 111

Query: 684 SVFPGLLLTFSQPIATRVDELLSGVKAQLAIKIFGPDLAVLSERGQALT---DLVAKIPG 740
+ LL Q V++ S P + D ++++ G
Sbjct: 112 QLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNG 171

Query: 741 AVDVSLEQVSGEAQLVVRPKRELLARYGISVDQVMSLVSQGIGGASAGQVIDGNA----R 796
DV L + + + +LL +Y ++ V++ + +AGQ+ A +
Sbjct: 172 VGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQ 229

Query: 797 YDINVRLAAEFRTSPDAIKDLLLSGTNGATVRLGEVASVEVEMAPPNIR-RDDVQRRVVV 855
+ ++ F+ + K L ++G+ VRL +VA VE+ N+ R + + +
Sbjct: 230 LNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289

Query: 856 QANVA-GRDMGSVVKDIYALVP--QADLPAGYTVIIGGQYENQQRAQQKLMLVVP---IS 909
+A G + K I A + Q P G V+ Y+ Q + VV +
Sbjct: 290 GIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLSIHEVVKTLFEA 347

Query: 910 IALIALLLYFSFGSFKQVLLIMANVPLALIGGIVALYVSGTYLSVPSSIGFITLFGVAVL 969
I L+ L++Y + + L+ VP+ L+G L G ++ + G + G+ V
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVD 407

Query: 970 NGVVLVDSINQRRQS-GEALYDCVYEGTVGRLRPVLMTALTSALGLIPILLSSGVGSEIQ 1028
+ +V+V+++ + + + ++ A+ + IP+ G I
Sbjct: 408 DAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIY 467

Query: 1029 KPLAVVIIGGLFSSTALTLLVLPTLYRWLYRGDKR 1063
+ ++ I+ + S + L++ P L L +
Sbjct: 468 RQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3974RTXTOXIND531e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 52.9 bits (127), Expect = 1e-09
Identities = 36/182 (19%), Positives = 64/182 (35%), Gaps = 22/182 (12%)

Query: 126 RATATLVVDRDRTATLAPQLDARVLARHVVPGQEVKKGEPLLTLGGAAVAQAQADYINAA 185
R V++ R + L + +A+H V QE + +N
Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQE----------------NKYVEAVNEL 268

Query: 186 AEWSRVKRMSEGAVSVSRRMQAQVDAELKRAILEAIKMTPTQIRALE----STPEAIGSY 241
+ E + ++ V K IL+ ++ T I L E +
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 242 QLLAPIDGRVQQ-DIAMLGQVFSAGTPLMQLT-DESYLWVEAQLTPTQTAHITVGSAALV 299
+ AP+ +VQQ + G V + LM + ++ L V A + I VG A++
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAII 388

Query: 300 QV 301
+V
Sbjct: 389 KV 390



Score = 41.4 bits (97), Expect = 5e-06
Identities = 26/148 (17%), Positives = 55/148 (37%), Gaps = 5/148 (3%)

Query: 118 IANLNLDIRATATLVVDRDRTATLAPQLDARVLARHVVPGQEVKKGEPLLTLGG----AA 173
+ + + A L R+ + P ++ V V G+ V+KG+ LL L A
Sbjct: 77 LGQVEIVATANGKLTHS-GRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135

Query: 174 VAQAQADYINAAAEWSRVKRMSEGAVSVSRRMQAQVDAELKRAILEAIKMTPTQIRALES 233
+ Q+ + A E +R + +S D + + E + T + +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF 195

Query: 234 TPEAIGSYQLLAPIDGRVQQDIAMLGQV 261
+ YQ +D + + + +L ++
Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARI 223


90Sbal195_3981Sbal195_3987N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_39810184.080504acetyl-CoA carboxylase, biotin carboxyl carrier
Sbal195_39820152.292194short-chain dehydrogenase/reductase SDR
Sbal195_39830131.243756major facilitator superfamily transporter
Sbal195_39840140.530793hypothetical protein
Sbal195_39850130.252306UbiD family decarboxylase
Sbal195_3986-1110.900115FMN reductase
Sbal195_3987-1110.215786amidohydrolase 3
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3981RTXTOXIND280.015 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.9 bits (62), Expect = 0.015
Identities = 8/29 (27%), Positives = 13/29 (44%)

Query: 120 IQAERDGVVSAIWAKDGDEVAFDQPLFTL 148
I+ + +V I K+G+ V L L
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3982DHBDHDRGNASE516e-10 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 51.2 bits (122), Expect = 6e-10
Identities = 41/183 (22%), Positives = 80/183 (43%), Gaps = 5/183 (2%)

Query: 2 ILITGASSGLGAALASLYAKENEPLTLTGRNAERLQTVANALTPFSNKPIAAITADLASE 61
ITGA+ G+G A+A A + + N E+L+ V ++L + A AD+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVRDS 69

Query: 62 SSLEALFDGL---TQAPKTVIHCAGSGYFGAIETQGASDIHSLLNNNVTSTILLVRELVK 118
++++ + + +++ AG G I + + + + N T R + K
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 119 RYKDQ-AVTVVVVMSTAALAAKAGESTYCAAKWAVRGFIESVRLELKQSPMKLIAVYPGG 177
D+ + ++V V S A + + Y ++K A F + + LEL + ++ V PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 178 MDT 180
+T
Sbjct: 190 TET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3983TCRTETA543e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 54.4 bits (131), Expect = 3e-10
Identities = 62/338 (18%), Positives = 113/338 (33%), Gaps = 23/338 (6%)

Query: 40 MTLVPYIASDLGVD---VAHVSYAISAYALGVVVGSPIIMVLAVRVRRRTLLIALAALMA 96
M ++P + DL AH ++ YAL +P++ L+ R RR +L+ A A
Sbjct: 25 MPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAA 84

Query: 97 VANGLSALAPSLNWLIFFRFLSGLPHGAYFGVAMLLAASLVPPEMKARAVSRVIIGLTLA 156
V + A AP L L R ++G+ GA VA A + + +AR +
Sbjct: 85 VDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFG 143

Query: 157 TIIGVPFATWMGQTVGWRSGIGIVAILATITAVMVYFLAPDQAVAADASPRKELQ----- 211
+ G MG + A L + + FL P+ + + P +
Sbjct: 144 MVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPE-SHKGERRPLRREALNPLA 201

Query: 212 ------TLKNREVWLTLGIAAIGFGGIFCVYTYLAETLIQVTQVEPFKIPIMMAVFGI-G 264
+ + + G + + I I +A FGI
Sbjct: 202 SFRWARGMTVVAALMAVFFIMQLVGQVPA--ALWVIFGEDRFHWDATTIGISLAAFGILH 259

Query: 265 ATLGTLVCGWAADK-SALAAAFWSLVLSTVVLAIYPSLTGHYWALMPV-VFFVGCGLGLA 322
+ ++ G A + A ++ I + W P+ V G+G+
Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGY-ILLAFATRGWMAFPIMVLLASGGIGMP 318

Query: 323 TIVQARLMDVAPDGQAMTGALVQCAFNLANAIGPWVGS 360
+ V + Q + +L + +GP + +
Sbjct: 319 ALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_3987UREASE456e-07 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 45.1 bits (107), Expect = 6e-07
Identities = 18/33 (54%), Positives = 22/33 (66%)

Query: 499 IAAYTINPANALGISDITGSIVLGKSADFVVLE 531
IA YTINPA A G+S GS+ +GK AD V+
Sbjct: 406 IAKYTINPAIAHGLSHEIGSLEVGKRADLVLWN 438



Score = 38.6 bits (90), Expect = 6e-05
Identities = 21/75 (28%), Positives = 34/75 (45%), Gaps = 8/75 (10%)

Query: 27 NDELADTLLTNTHVYGHDQ--ATSLAIKDGKIVYIGNSIN--AMDHVS----NQTKVIDL 78
DT++TN + H + +KDG+I IG + N V+ T+VI
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAG 123

Query: 79 KGHYLLPGFIDNHNH 93
+G + G +D+H H
Sbjct: 124 EGKIVTAGGMDSHIH 138


91Sbal195_4096Sbal195_4102N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_40969313.782683hypothetical protein
Sbal195_40979313.741876HemY domain-containing protein
Sbal195_40989323.629986outer membrane adhesin-like protein
Sbal195_4099-114-2.520691type I secretion system ATPase
Sbal195_4100013-3.087276HlyD family type I secretion membrane fusion
Sbal195_4101011-2.391841TolC family type I secretion outer membrane
Sbal195_4102011-2.641776OmpA/MotB domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4096RTXTOXIND290.036 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.036
Identities = 13/77 (16%), Positives = 32/77 (41%), Gaps = 6/77 (7%)

Query: 81 FMLYQQMQQQLLVQDAKNIALQDQLQQALLQPNQRIGQLEQQQLNDAKT-----YQELTK 135
F +Q + Q + K A + A + + + ++E+ +L+D +
Sbjct: 195 FSTWQNQKYQKELNLDKKRA-ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHA 253

Query: 136 LAEDQNQLQDRINKLAQ 152
+ E +N+ + +N+L
Sbjct: 254 VLEQENKYVEAVNELRV 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4098CABNDNGRPT792e-16 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 79.2 bits (195), Expect = 2e-16
Identities = 40/172 (23%), Positives = 64/172 (37%), Gaps = 6/172 (3%)

Query: 6410 GSDTINGGNGDDILFGDAIN--FNGISGQGYVAIKDYVADQLGIAAVTDAQVHRYITEHA 6467
+ T G+ + + + + A + ++ I +
Sbjct: 260 ANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINLNE 319

Query: 6468 SDFDQSGASDKADVLIGGQGNDILYGQGGNDQLYGGNGNDLIFGGAGNDTIIGGLGNDKL 6527
F G + G + G GND L G + ++++ GGAGND + GG G D L
Sbjct: 320 GSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTL 379

Query: 6528 TGGTGADTFVWQAG----ESGTDHITDFNIHEDKLDLRDLLQGENTNTLDSY 6575
GG G DTFV+ +G + D I DF DK+DL + +
Sbjct: 380 YGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDKIDLSAFRNEGQLSFVQDQ 431



Score = 48.4 bits (115), Expect = 8e-07
Identities = 32/89 (35%), Positives = 41/89 (46%), Gaps = 3/89 (3%)

Query: 5831 GDFTTAPFNTGTRTIDNTSGQDQLLGTGGNDHLVSANGGGDLLYGMDGDDILVGSDAVQG 5890
F + ++ R N + G GN + + G G+DILVG+ A
Sbjct: 302 DTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTI-ENAIGGSGNDILVGNSA--D 358

Query: 5891 DSLYGGTGNDVLVAGLGNDGLYGGAGTDI 5919
+ L GG GNDVL G G D LYGGAG D
Sbjct: 359 NILQGGAGNDVLYGGAGADTLYGGAGRDT 387



Score = 44.2 bits (104), Expect = 2e-05
Identities = 31/120 (25%), Positives = 46/120 (38%), Gaps = 3/120 (2%)

Query: 5798 ADKPVVNVILTDNGIPLYSNFKTSGITTEQFRTGDFTTAPFNTGTRTIDNTSGQDQLLGT 5857
+ K ++ + G + S G F+ G +I + + +G
Sbjct: 287 SSKALIFSVWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGG 346

Query: 5858 GGNDHLVSANGGGDLLYGMDGDDILVGSDAVQGDSLYGGTGNDVLVAGLGNDGLYGGAGT 5917
GND LV N ++L G G+D+L G D+LYGG G D V G G D
Sbjct: 347 SGNDILV-GNSADNILQGGAGNDVLYGGAG--ADTLYGGAGRDTFVYGSGQDSTVAAYDW 403



Score = 34.6 bits (79), Expect = 0.014
Identities = 30/135 (22%), Positives = 45/135 (33%), Gaps = 25/135 (18%)

Query: 5825 TEQFRTGDFTTAPFNTGTRTIDNTSGQDQLLGTGGNDHLVSANGGGDLLYGMDGDDILVG 5884
T G + AP I G + TG + + ++N D D L+
Sbjct: 234 TGADYNGHYGGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIF 293

Query: 5885 SDAVQG-----------------------DSLYGGTGNDVLVAGLGNDGLYGGAGTDIAV 5921
S G + G GN + G+ + GG+G DI
Sbjct: 294 SVWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDI-- 351

Query: 5922 LLGNRADYIIEKSTG 5936
L+GN AD I++ G
Sbjct: 352 LVGNSADNILQGGAG 366


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4100RTXTOXIND310e-103 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 310 bits (796), Expect = e-103
Identities = 86/431 (19%), Positives = 193/431 (44%), Gaps = 11/431 (2%)

Query: 29 RLIIWALAAMVVCFLLWAGFAKLDKVTTGTGKVIPSSQVQVIQSLDGGIMQELYVQEGEM 88
RL+ + + +V + + +++ V T GK+ S + + I+ ++ I++E+ V+EGE
Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGES 117

Query: 89 VTKGQPLVRIDDTRFRSDYAQQEQEVFGLKTNAIRMRAELDSILISDMTSDWREQVLITK 148
V KG L+++ +D + + + + R + SI E + +
Sbjct: 118 VRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSI----------ELNKLPE 167

Query: 149 KALVFPENIIAAEPALVKRQQEEYNGRLDNLSNQLEILVRQIQQRQQEIDDLASKTTTLT 208
L V R + NQ + +++ E + ++
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 209 TSMQLISRELELTRPLAKKGIVPEVELLKLERTVNDLQGELNSMRLLRPKVKAAMDEAIL 268
++ L+ L K + + +L+ E + EL + ++++ + A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287

Query: 269 KRREAVFVYAADLRAQLNETQTRLSRMNEAQVGAQDKVSKAIITSPVNGTIKTTHINTLG 328
+ + ++ ++ +L +T + + +++ ++I +PV+ ++ ++T G
Sbjct: 288 EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEG 347

Query: 329 GVVQPGVDIIEIVPSEDQLLIETKILPKDIAFLHPGLPAVVKITAYDFTRYGGLKGTVEH 388
GVV ++ IVP +D L + + KDI F++ G A++K+ A+ +TRYG L G V++
Sbjct: 348 GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKN 407

Query: 389 ISADTSQDEEGNSYYLIRVRTAESSLTKNDGTQMPIIPGMLTSVDVITGQRSILEYILNP 448
I+ D +D+ + + + E+ L+ +P+ GM + ++ TG RS++ Y+L+P
Sbjct: 408 INLDAIEDQRLGLVFNVIISIEENCLST-GNKNIPLSSGMAVTAEIKTGMRSVISYLLSP 466

Query: 449 ILRAKDTALRE 459
+ + +LRE
Sbjct: 467 LEESVTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4102OMPADOMAIN863e-22 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 86.1 bits (213), Expect = 3e-22
Identities = 33/118 (27%), Positives = 53/118 (44%), Gaps = 12/118 (10%)

Query: 77 NILFPNDSAYIAPEYYPQIEEVAIFLRQY--PTTKVTIEGHTSRTGTDERNAVLSQDRAN 134
++LF + A + PE ++++ L V + G+T R G+D N LS+ RA
Sbjct: 220 DVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279

Query: 135 AVTAVLAERFSIDRSRLTAIGYGSSRPVVLEQTPDAEIR---------NRRVVAEVTG 183
+V L + I +++A G G S PV + + R +RRV EV G
Sbjct: 280 SVVDYLISK-GIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336


92Sbal195_4332Sbal195_4346N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_4332-214-0.086557hypothetical protein
Sbal195_4333-1140.340164sulfatase
Sbal195_4334-1110.910086RNA-binding S1 domain-containing protein
Sbal195_43350120.861033transcription elongation factor GreB
Sbal195_43360110.836444response regulator receiver modulated
Sbal195_43370101.402638multi-sensor hybrid histidine kinase
Sbal195_43381183.795642osmolarity response regulator
Sbal195_43391172.958333osmolarity sensor protein
Sbal195_43402172.529200methyl-accepting chemotaxis sensory transducer
Sbal195_43410172.595422redoxin domain-containing protein
Sbal195_43420192.907412hypothetical protein
Sbal195_4343-1131.843784peptidase
Sbal195_4344-1141.559600two component transcriptional regulator
Sbal195_4345-1121.629674integral membrane sensor signal transduction
Sbal195_43460131.536986hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4332IGASERPTASE280.025 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.025
Identities = 21/125 (16%), Positives = 46/125 (36%), Gaps = 8/125 (6%)

Query: 36 QAATKGHEERAFNPQNERTADQTQQQTKTLENNQQQVQEKQQQQQSSQQQSQQQQEKKAP 95
+ A + N Q A Q ++T E + +E ++ + + + ++ ++ P
Sbjct: 1067 EVAKEAKSNVKANTQTNEVA---QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVP 1123

Query: 96 LVVAERVLPKTLKIAARGQAALQRKD-----IRLKVSQGAANYASSNAKANTSSAARQSL 150
V ++ + + QA R++ I+ SQ + TSS Q +
Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPV 1183

Query: 151 QGEST 155
+T
Sbjct: 1184 TESTT 1188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4336HTHFIS806e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.3 bits (198), Expect = 6e-19
Identities = 26/123 (21%), Positives = 50/123 (40%), Gaps = 3/123 (2%)

Query: 14 KGKILIVDDQPLNIKILHQLFN-EEYELFMATNGEQAIAICQKVQPDLVLLDIEMPGMSG 72
IL+ DD +L+Q + Y++ + +N DLV+ D+ MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 73 FDVCQHIKADPETATIGVIFVTAHFDEVQEVKGFQLGAVDFIHKPINPIITTARVKNQFT 132
FD+ IK + V+ ++A + +K + GA D++ KP + +
Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 133 LKR 135
+
Sbjct: 121 EPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4337HTHFIS736e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.6 bits (178), Expect = 6e-15
Identities = 33/135 (24%), Positives = 61/135 (45%), Gaps = 4/135 (2%)

Query: 1287 TILVVEDNQLNRQVIDELLSYEGANVVLAEGGMEGVSLVLESGDLFDIVIMDMQMPDMDG 1346
TILV +D+ R V+++ LS G +V + + + D+V+ D+ MPD +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI--AAGDGDLVVTDVVMPDENA 62

Query: 1347 LEATRRIRADERFAALPILAMTANASQSDRQECLNAGMNDHVGKPIDMPLLLPSILRLVG 1406
+ RI+ + LP+L M+A + + G D++ KP D+ L+ I R +
Sbjct: 63 FDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 1407 REDMTSAEFEGHWPE 1421
++ E +
Sbjct: 121 EPKRRPSKLEDDSQD 135



Score = 61.0 bits (148), Expect = 2e-11
Identities = 19/93 (20%), Positives = 38/93 (40%), Gaps = 8/93 (8%)

Query: 1138 LSGYRVLVVDDNELTTEILEKILTGFGCEVETALGGYAALAKVKQSHEKSTPFDVVLMDW 1197
++G +LV DD+ +L + L+ G +V + D+V+ D
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-----GDLVVTDV 55

Query: 1198 RMPDLDGLQTAEMLRNSDANNQTPLVVMLTAYG 1230
MPD + ++ + + P++VM +A
Sbjct: 56 VMPDENAFDLLPRIKKARPD--LPVLVM-SAQN 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4338HTHFIS989e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 98.0 bits (244), Expect = 9e-26
Identities = 38/129 (29%), Positives = 67/129 (51%)

Query: 6 SKILVVDDDMRLRALLERYLMEQGYQVRSAANAEQMDRLLERENFHLLVLDLMLPGEDGL 65
+ ILV DDD +R +L + L GY VR +NA + R + + L+V D+++P E+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 SICRRLRQQGNPIPIVMLTAKGDEVDRIIGLELGADDYLPKPFNPRELLARIKAVMRRQT 125
+ R+++ +P+++++A+ + I E GA DYLPKPF+ EL+ I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 126 QDVPGAPAQ 134
+
Sbjct: 124 RRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4339PF06580491e-08 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 49.5 bits (118), Expect = 1e-08
Identities = 26/179 (14%), Positives = 56/179 (31%), Gaps = 28/179 (15%)

Query: 261 IVNDIEDMDAIISQFIAYIRQDQETSRE----LGQINKLIQDVAQAEANRAGEIEVVLTD 316
I+ D +++ +R S L ++ Q + + +
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 317 CPEAQFQAIAIKRVLSNLVENAFRYG------SGWIRISSQFDGKRIGFTVEDNGPGIDE 370
A ++ LVEN ++G G I + D + VE+ G +
Sbjct: 246 INPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK 305

Query: 371 SQITKLFQPFTQGDIARGSVGSGLGLA-IIKRIIDRHQGQVTLS-NRAEGGLRAQVWLP 427
+ +G GL + +R+ + + + + +G + A V +P
Sbjct: 306 NTKE----------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4344HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 2e-20
Identities = 30/125 (24%), Positives = 61/125 (48%)

Query: 2 RLLLVEDDLELQANLKQHLLDAHYSIDVASDGEEGLFQALEYNYDAAIIDVGLPKLDGIA 61
+L+ +DD ++ L Q L A Y + + S+ + D + DV +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LIRSVREQERDFPILILTARDSWQDKVEGLDAGADDYLTKPFHPQELVARLKALIRRSAG 121
L+ +++ D P+L+++A++++ ++ + GA DYL KPF EL+ + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 KASPL 126
+ S L
Sbjct: 125 RPSKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4345PF06580290.031 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.031
Identities = 15/80 (18%), Positives = 24/80 (30%), Gaps = 15/80 (18%)

Query: 365 KAAKSTVKLTVTGDAYQLLICIEDDGPGISEALQNQIFERGIRADSYHQGNGIGLAIVRD 424
+ L T D + + +E+ G + + G GL VR+
Sbjct: 275 LPQGGKILLKGTKDNGTVTLEVENTGSLA--------------LKNTKESTGTGLQNVRE 320

Query: 425 -LVDSYNGRISVSRSETLGG 443
L Y + SE G
Sbjct: 321 RLQMLYGTEAQIKLSEKQGK 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4346IGASERPTASE330.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.1 bits (75), Expect = 0.003
Identities = 23/153 (15%), Positives = 50/153 (32%), Gaps = 13/153 (8%)

Query: 352 ERDNRAPSYQKTQAELKERRSATMVQTPTHKDRNDAQSRPVQSKPMPSKEPQQRQYQTRE 411
E K +++ E+ +T +++ + E Q +T+E
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094

Query: 412 SQPRNVDQQRTPPQRQENPRTATPRVETPR----PEIRRAEPQRVEQPRQAAPRQREEVR 467
+Q + E A +VET + P++ + EQ P+
Sbjct: 1095 TQT----TETKETATVEKEEKA--KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA-- 1146

Query: 468 VRQSEPRQNAQTARSVEHNQGRSTQSQERRHRE 500
R+++P N + +S + + Q +
Sbjct: 1147 -RENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178



Score = 32.7 bits (74), Expect = 0.004
Identities = 26/145 (17%), Positives = 46/145 (31%), Gaps = 18/145 (12%)

Query: 361 QKTQAELKERRSATMVQTPTHKDRNDAQSRPVQSKPMP-------SKEPQQRQ-YQTRES 412
+K + E ++ + V + + +++ Q++P KEPQ +
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTE 1169

Query: 413 QPRNVDQQRTPPQRQENPRTATPRVETPRPEIRRAEPQRVEQPRQAAPRQREEVRVRQSE 472
QP E+ T PE P P E +
Sbjct: 1170 QPAKETSSNVEQPVTESTTVNTGNSVVENPE--------NTTPATTQPTVNSESSNKPKN 1221

Query: 473 PRQNAQTARSVEHNQGRSTQSQERR 497
++ ++ RSV HN +T S R
Sbjct: 1222 --RHRRSVRSVPHNVEPATTSSNDR 1244


93Sbal195_4384Sbal195_4388N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_4384-1180.440487hydrophobe/amphiphile efflux-1 (HAE1) family
Sbal195_4385-113-1.780867RND family efflux transporter MFP subunit
Sbal195_4386-212-2.570072hypothetical protein
Sbal195_4387-112-0.496202hypothetical protein
Sbal195_4388-3130.191995hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4384ACRIFLAVINRP12440.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1244 bits (3221), Expect = 0.0
Identities = 655/1032 (63%), Positives = 807/1032 (78%), Gaps = 4/1032 (0%)

Query: 1 MARFFIDRPIFAWVIALIIMLAGILSIRSLPVSQYPNIAPPTVVISANYPGASAKIVEDS 60
MA FFI RPIFAWV+A+I+M+AG L+I LPV+QYP IAPP V +SANYPGA A+ V+D+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQRMTGIDHLRYISSTSDSFGNASITLTFNAEADPDIAQVQVQNKLQGAMTLLPQ 120
VTQVIEQ M GID+L Y+SSTSDS G+ +ITLTF + DPDIAQVQVQNKLQ A LLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQSQGVNVNKSSSGFLMVLGFVSTDGSLDKGDIADYVGANIQDPMSRVPGVGEIQLFGA 180
EVQ QG++V KSSS +LMV GFVS + + DI+DYV +N++D +SR+ GVG++QLFGA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWLDPLKLTQYNLTSLDVIAAIRSQNAQVSAGQLGGAPSVAGQELNATVSAQSRL 240
QYAMRIWLD L +Y LT +DVI ++ QN Q++AGQLGG P++ GQ+LNA++ AQ+R
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 QTAEEFRKIILKSDISGANVFLGDVARVELGSESYAVVSLYNGQPATGLAIKLATGANAL 300
+ EEF K+ L+ + G+ V L DVARVELG E+Y V++ NG+PA GL IKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTASAVREKIAEMKPFFPQGLEVVYPYDTTPFVEQSIEGVVHTLLEAIVLVFVIMYLFLQ 360
DTA A++ K+AE++PFFPQG++V+YPYDTTPFV+ SI VV TL EAI+LVF++MYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAILSMAGFSINTLTMFAMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFAIL+ G+SINTLTMF MVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 SEEGLSPLEATRKSMDQITGALVGIGLTLSAVFVPMAFMSGSTGVIYRQFSITIVSAMAL 480
E+ L P EAT KSM QI GALVGI + LSAVF+PMAF GSTG IYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPVKKGHGHIETGFFGWFNRTFDKLTNRYESSVAGIIKRSFRV 540
SVLVALILTPALCAT+LKPV H + GFFGWFN TFD N Y +SV I+ + R
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 MTIYVVLVIAVGWIFIRMPTAFLPDEDQGILFTQAILPTNSTQESTLKVLEKVSDHFMA- 599
+ IY ++V + +F+R+P++FLP+EDQG+ T LP +TQE T KVL++V+D+++
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 600 -EEGVRSVFSVAGFSFAGQGQNMGIAFVGLKDWSEREAPGMDVKSIAGRAMGVFGQMKDA 658
+ V SVF+V GFSF+GQ QN G+AFV LK W ER +++ RA G+++D
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 659 LVFAFVPPAVIELGTANGFDMYLQDKNGQGHEKLVAARNQLLGMAAQNP-NLVGVRPNGQ 717
V F PA++ELGTA GFD L D+ G GH+ L ARNQLLGMAAQ+P +LV VRPNG
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 718 EDAPIYQLHVDHAKLSALGIEITNVNSVLATAWGGSYVNDFIDRGRVKKVYVQGDAQYRM 777
ED ++L VD K ALG+ ++++N ++TA GG+YVNDFIDRGRVKK+YVQ DA++RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 778 QPGDLDTWYVRNNKGDMVPFSAFATGTWEYGSPRLERFNGLPSMNIQGATAPGFSTGAAM 837
P D+D YVR+ G+MVPFSAF T W YGSPRLER+NGLPSM IQG APG S+G AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 838 EIMEDLAKQLPPGFGVEWNGLSYEERLSGNQAPALYALSILVVFLVLAALYESWSVPFAV 897
+ME+LA +LP G G +W G+SY+ERLSGNQAPAL A+S +VVFL LAALYESWS+P +V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 898 VLVVPLGIIGALIAMNGRGLPNDVFFQVGLLTTVGLATKNAILIVEFAKEFYEK-GSGLV 956
+LVVPLGI+G L+A NDV+F VGLLTT+GL+ KNAILIVEFAK+ EK G G+V
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 957 EATLHAVRVRLRPILMTSLAFGLGVVPLAISSGVGSGSQNAIGTAVLGGMMSSTFLGIFF 1016
EATL AVR+RLRPILMTSLAF LGV+PLAIS+G GSG+QNA+G V+GGM+S+T L IFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1017 VPLFFVIVERIF 1028
VP+FFV++ R F
Sbjct: 1021 VPVFFVVIRRCF 1032



Score = 78.3 bits (193), Expect = 1e-16
Identities = 75/523 (14%), Positives = 170/523 (32%), Gaps = 40/523 (7%)

Query: 534 IKRSFRVMTIYVVLVIAVGWIFIRMPTAFLPDEDQGILFTQAILPTNSTQESTLKVLEKV 593
I+R + ++L++A +++P A P + A P Q V + +
Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVI 65

Query: 594 SDHFMAEEGVRSVFSVAGFSFAGQGQNMGIAFVGLKDWSEREAPGMDVKSIAGRAMGVFG 653
+ + + + S S + + + F G D +
Sbjct: 66 EQNMNGIDNLMYMSST---SDSAGSVTITLTF----------QSGTDPDIAQVQVQNKLQ 112

Query: 654 QMKDALVFAFVPPAVIELGTANGFDMYL----QDKNGQGHEKLVAARNQLLGMAAQNPNL 709
L + +++ + M + + + + ++ +
Sbjct: 113 LATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGV 172

Query: 710 VGVRPNGQEDAPIYQLHVDHAKLSALGIEITNVNSVLATA----WGGSYVNDFIDRGRVK 765
V+ G + A ++ +D L+ + +V + L G G+
Sbjct: 173 GDVQLFGAQYA--MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQL 230

Query: 766 KVYVQGDAQYRMQPGDLDTWYVRNNK-GDMVPFSAFAT---GTWEYGSPRLERFNGLPSM 821
+ +++ P + +R N G +V A G Y + R NG P+
Sbjct: 231 NASIIAQTRFK-NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNV--IARINGKPAA 287

Query: 822 NIQGATAPGFST----GAAMEIMEDLAKQLPPGFGVEWNGL---SYEERLSGNQAPALYA 874
+ A G + A + +L P G ++ + +LS ++
Sbjct: 288 GLGIKLATGANALDTAKAIKAKLAELQPFFPQG--MKVLYPYDTTPFVQLSIHEVVKTLF 345

Query: 875 LSILVVFLVLAALYESWSVPFAVVLVVPLGIIGALIAMNGRGLPNDVFFQVGLLTTVGLA 934
+I++VFLV+ ++ + VP+ ++G + G + G++ +GL
Sbjct: 346 EAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLL 405

Query: 935 TKNAILIVE-FAKEFYEKGSGLVEATLHAVRVRLRPILMTSLAFGLGVVPLAISSGVGSG 993
+AI++VE + E EAT ++ ++ ++ +P+A G
Sbjct: 406 VDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGA 465

Query: 994 SQNAIGTAVLGGMMSSTFLGIFFVPLFFVIVERIFSKREKKAK 1036
++ M S + + P + + S + K
Sbjct: 466 IYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENK 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4385RTXTOXIND416e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 6e-06
Identities = 19/102 (18%), Positives = 39/102 (38%), Gaps = 10/102 (9%)

Query: 97 ATYKAALVSANADLARANAGLASAKAKAARYQQLVKTNAISKQEFDEAEAAYKEALANVT 156
L + L + + + SAK + QL K + K ++ N+
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK---------LRQTTDNIG 312

Query: 157 VAEAAINTAKINLQYTEVLAPISGRIGKSSV-TAGALVTANQ 197
+ + + Q + + AP+S ++ + V T G +VT +
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354



Score = 40.6 bits (95), Expect = 8e-06
Identities = 40/211 (18%), Positives = 77/211 (36%), Gaps = 52/211 (24%)

Query: 43 VVAQPQVIQVELPGRSKAFLEAEVRPQVSGIITKRSFV-EGGNVKQGESLYQIDSATYKA 101
A ++ GRSK E++P + I+ K V EG +V++G+ L ++ +
Sbjct: 84 ATANGKLT---HSGRSK-----EIKPIENSIV-KEIIVKEGESVRKGDVLLKLTA----- 129

Query: 102 ALVSANADLARANAGLASAKAKAARYQQLVKTNAISKQEFDEAEAAYKEALANVTVAEAA 161
+ A AD + + L A+ + RYQ L + +I + E + + NV+ E
Sbjct: 130 --LGAEADTLKTQSSLLQARLEQTRYQILSR--SIELNKLPELKLPDEPYFQNVSEEEVL 185

Query: 162 INTAKINLQYTEVLAPISGRIGKSSVTAGALVTANQSQTLATIQQLDPINVDIAQSSAQL 221
T+ I + Q Q +++ + A+
Sbjct: 186 RLTSLI-----------------------------KEQFSTWQNQKYQKELNLDKKRAER 216

Query: 222 LRLKAKL----KQGKLQASENADVQLLLEDG 248
L + A++ +++ S D LL
Sbjct: 217 LTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4387SECFTRNLCASE280.003 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 28.3 bits (63), Expect = 0.003
Identities = 8/34 (23%), Positives = 17/34 (50%)

Query: 1 MVSHRRSKPEIQFNTPLNETQTPTINTSLMTNIA 34
++ ++ N +NET + T+ T + T +A
Sbjct: 234 LIKYKTMPLRDVMNLSVNETLSRTVMTGMTTLLA 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4388ECOLNEIPORIN290.022 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 28.6 bits (64), Expect = 0.022
Identities = 21/95 (22%), Positives = 31/95 (32%), Gaps = 11/95 (11%)

Query: 97 FIGKAGEFG--DSGFTYDVMLFSYLYQGASYSNYTEL--WLKVGKQFGRANLQLEVTPTV 152
FIG G FG G V L + + +L V K + V
Sbjct: 97 FIGLKGGFGKLRVGRLNSV-----LKDTGDINPWDSKSDYLGVNKIAEPEARLISVRYDS 151

Query: 153 DDWFGVDGWHGVNYALHPSYNFDNGVKISGSVGYQ 187
++ G+ G V YAL+ + N Y+
Sbjct: 152 PEFAGLSG--SVQYALNDNAGRHNSESYHAGFNYK 184


94Sbal195_4449Sbal195_4456N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
Sbal195_44490161.348974ABC transporter-like protein
Sbal195_44501160.900133TonB-dependent receptor plug
Sbal195_4451-1110.375002isochorismate synthase
Sbal195_445209-0.172659hypothetical protein
Sbal195_44530110.799359hypothetical protein
Sbal195_44540120.966591N-acetyltransferase GCN5
Sbal195_4455-1131.456234integral membrane sensor signal transduction
Sbal195_44560172.550339Fis family two component sigma54 specific
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4449CARBMTKINASE290.025 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 29.4 bits (66), Expect = 0.025
Identities = 12/39 (30%), Positives = 17/39 (43%)

Query: 78 PQTQLLRQTVGAEVAFAIENLGIPAEQMLPKVQLALRRV 116
+ Q LR+ E+ E A M PKV A+R +
Sbjct: 247 EKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFI 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4450cloacin596e-11 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 58.6 bits (141), Expect = 6e-11
Identities = 36/100 (36%), Positives = 42/100 (42%), Gaps = 3/100 (3%)

Query: 383 GGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGT---GGGTGTGGGTGTGGGT 439
GG G G TG +G G TG G G G G+G + GGG+G+G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 440 GTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGG 479
G GGG G GG GG + G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 58.6 bits (141), Expect = 6e-11
Identities = 36/100 (36%), Positives = 42/100 (42%), Gaps = 3/100 (3%)

Query: 389 GGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGT---GGGTGTGGGTGTGGGT 445
GG G G TG +G G TG G G G G+G + GGG+G+G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 446 GTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGG 485
G GGG G GG GG + G G GG
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 57.4 bits (138), Expect = 1e-10
Identities = 33/83 (39%), Positives = 39/83 (46%), Gaps = 3/83 (3%)

Query: 407 GGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGT---GGGTGTGGGTGTGGGT 463
GG G G TG +G G TG G G G G+G + GGG+G+G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 464 GTGGGTGTGGGTGTGGGTGTGGA 486
G GGG G GG GG + A
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVA 85



Score = 55.1 bits (132), Expect = 6e-10
Identities = 35/99 (35%), Positives = 41/99 (41%), Gaps = 5/99 (5%)

Query: 380 GSGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGT-----GGGTGTGGGTGTGGGTG 434
G G G TG + +G G G G GGG G G + GGG+G+G G G G G
Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63

Query: 435 TGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGG 473
GGG G GG GG + G G GG
Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 53.2 bits (127), Expect = 2e-09
Identities = 34/96 (35%), Positives = 38/96 (39%), Gaps = 1/96 (1%)

Query: 372 GSNNGGTIGSGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGG 431
G N G SG G G G GGG G G + GGG+G+G G G G G GG
Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSEN-NPWGGGSGSGIHWGGGSGHGNGG 66

Query: 432 GTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGG 467
G G GG GG + G G GG
Sbjct: 67 GNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 52.0 bits (124), Expect = 7e-09
Identities = 33/93 (35%), Positives = 41/93 (44%), Gaps = 1/93 (1%)

Query: 370 NTGSNNGGTIGSGGGTGTGGGTGTGGGTGTGG-GTGTGGGTGTGGGTGTGGGTGTGGGTG 428
NTG+++ +GG TG G G G G+G GGG+G+G G G G G GGG G
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 429 TGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGG 461
GG GG + G G GG
Sbjct: 70 NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102



Score = 51.3 bits (122), Expect = 1e-08
Identities = 32/79 (40%), Positives = 39/79 (49%), Gaps = 6/79 (7%)

Query: 377 GTIGSGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGT-----GTGGGTGTGGGTGTGG 431
G G G TG +G G TG G G G G+G + G G G+G G G+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 432 GTGTGGGTGTGGGTGTGGG 450
G G GG +GGG+GTGG
Sbjct: 63 GNG-GGNGNSGGGSGTGGN 80



Score = 50.9 bits (121), Expect = 2e-08
Identities = 29/73 (39%), Positives = 35/73 (47%), Gaps = 3/73 (4%)

Query: 419 GGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGT---GGGTGTGGGTGTGGGT 475
GG G G TG +G G TG G G G G+G + GGG+G+G G G G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 476 GTGGGTGTGGAEN 488
G GGG G G +
Sbjct: 63 GNGGGNGNSGGGS 75



Score = 37.8 bits (87), Expect = 2e-04
Identities = 25/69 (36%), Positives = 32/69 (46%), Gaps = 7/69 (10%)

Query: 431 GGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGT---GGGTGTG----GGTGT 483
GG G G TG +G G TG G G G G+G + GGG+G+G GG+G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 484 GGAENLNDS 492
G +S
Sbjct: 63 GNGGGNGNS 71



Score = 36.6 bits (84), Expect = 3e-04
Identities = 21/63 (33%), Positives = 25/63 (39%)

Query: 369 ANTGSNNGGTIGSGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTGTGGGTG 428
++ + GG GSG G G G G GGG G GG GG + G G
Sbjct: 40 SSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPG 99

Query: 429 TGG 431
GG
Sbjct: 100 AGG 102



Score = 32.0 bits (72), Expect = 0.010
Identities = 18/43 (41%), Positives = 23/43 (53%), Gaps = 1/43 (2%)

Query: 360 WKSVVDTYTANTGSNNGGTIGSGGGTGTGGGTGTGGGTGTGGG 402
W S + + +GS GSG G G GG +GGG+GTGG
Sbjct: 39 WSSENNPWGGGSGSGIHWGGGSGHGNG-GGNGNSGGGSGTGGN 80



Score = 30.8 bits (69), Expect = 0.020
Identities = 17/41 (41%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 368 TANTGSNNGGTIGSGGGTGTGGGTGTGGGTGTGGGTGTGGG 408
+ N G G G G+G G G GG +GGG+GTGG
Sbjct: 41 SENNPWGGGSGSGIHWGGGSGHGNG-GGNGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4454SACTRNSFRASE300.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.3 bits (68), Expect = 0.002
Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 80 VASDFRRLGLAQSLLEYQETWARRQGYNHIQVKTMN 115
VA D+R+ G+ +LL WA+ + + ++T +
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
Sbal195_4456HTHFIS449e-157 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 449 bits (1157), Expect = e-157
Identities = 156/476 (32%), Positives = 239/476 (50%), Gaps = 43/476 (9%)

Query: 13 SAVSVLIVDDEPGMRSFLNKALSKKFALVETAGSIEDAEQLRSRCHFDLLIVDIRLPGRS 72
+ ++L+ DD+ +R+ LN+ALS+ V + + + DL++ D+ +P +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 73 GIEWHEALDEQGRRSDIIFMTGYADMEVAITALRAGASDFIMKPFHLEQMMTAVDRCIER 132
+ + + ++ M+ AI A GA D++ KPF L +++ + R +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 133 RLLKRENLMLRREVSIGYSSTIIGSSEAMKSVKHIIERVAPTNAVVLIQGESGTGKELVA 192
+ L + + ++G S AM+ + ++ R+ T+ ++I GESGTGKELVA
Sbjct: 122 PKRRPSKLEDDSQDGMP----LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVA 177

Query: 193 RQLHLLSGR-QGPFVPVNCGSIAPELLESELFGHTAGAFTGAKGNREGLFSFASGGTIFL 251
R LH R GPFV +N +I +L+ESELFGH GAFTGA+ G F A GGT+FL
Sbjct: 178 RALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFL 237

Query: 252 DEIGEMPLKMQTALLRVLEQKAIRPVGSEKEVNIDVRVIAATNRTLIDEVDAGNFRRDLY 311
DEIG+MP+ QT LLRVL+Q VG + DVR++AATN+ L ++ G FR DLY
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLY 297

Query: 312 YRLNVLDILIPPLRDRPEDVVELTHHFTRQLAAELGVREVVWSHEDMVKLQQHEWPGNIR 371
YRLNV+ + +PPLRDR ED+ +L HF +Q E G+ + E + ++ H WPGN+R
Sbjct: 298 YRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVR 356

Query: 372 ELRNMIERCILL------------------------------------GKPPAEYWKQQL 395
EL N++ R L + E +Q
Sbjct: 357 ELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYF 416

Query: 396 KS-ESLSTSGYPLDWPLKEVEKHHVTSVVDLHSGNKSAAARDLGVSRKTLDRKYKE 450
S D L E+E + + + GN+ AA LG++R TL +K +E
Sbjct: 417 ASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.