PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2261.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_007613 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1SBO_0045SBO_0051Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0045-3173.26087623S rRNA/tRNA pseudouridine synthase A
SBO_0046-3173.093730ATP-dependent helicase HepA
SBO_0047-1153.138188DNA polymerase II
SBO_0048-1153.380813L-ribulose-5-phosphate 4-epimerase
SBO_0049-1163.982982L-arabinose isomerase
SBO_00501163.747184ribulokinase
SBO_00510163.022916DNA-binding transcriptional regulator AraC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0050TCRTETOQM290.040 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 29.4 bits (66), Expect = 0.040
Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 18/103 (17%)

Query: 300 ILIADKQSVGERAVKGICGQVDGSVV------PGFIGLEAGQS-AFGDIYAWFGRVLGWP 352
+ I++K+ + + + ++G + G I + + + G P
Sbjct: 281 VRISEKEKIK---ITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV---LGDTKLLP 334

Query: 353 L-EQLAAQHPELKAQINASQKQ----LLPALTEAWAKNPSLDH 390
E++ P L+ + S+ Q LL AL E +P L +
Sbjct: 335 QRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRY 377


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0051PF05616290.022 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 28.9 bits (64), Expect = 0.022
Identities = 26/118 (22%), Positives = 47/118 (39%), Gaps = 21/118 (17%)

Query: 82 YGRHPEAREWYHQWVYFRPRAYWHEWLNWPSIFANTGFFRPDEAHQPHFSDLFGQ-IINA 140
Y R PE +E + R YW + N P ++ +F+ + +F G ++
Sbjct: 158 YSRFPEVKELMESQMERLARPYWEKLRNRPDMY----YFKNYNFKRCYFGLNGGDCLVAK 213

Query: 141 G-----------QGEGRYSELLAINLLEQLLLRRMEA-----INESLHPPMDNRVREA 182
G QG +Y E + LE++L +++A I + +P +V A
Sbjct: 214 GDDGRTFISFSLQGNSKYKEEMDAKKLEEILSLKVDANPDKYIKATGYPGYSEKVEVA 271


2SBO_0196SBO_0235Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0196-123-3.384862***2,5-diketo-D-gluconate reductase B
SBO_0197-125-3.050098LysR family transcriptional regulator
SBO_0198-124-2.979315hypothetical protein
SBO_0199-224-3.342386biotin synthesis protein
SBO_0200-126-3.625845membrane-bound lytic murein transglycosylase D
SBO_0201-127-4.315031hydroxyacylglutathione hydrolase
SBO_0202-221-1.072554hypothetical protein
SBO_0203-121-0.813216ribonuclease H
SBO_0204124-0.748379DNA polymerase III subunit epsilon
SBO_0206427-0.196280*IS1 encoded protein
SBO_02070200.025875IS1 ORF2
SBO_0208-1160.200907hypothetical protein
SBO_0209-212-1.068285C-lysozyme inhibitor
SBO_0210-215-0.622515IS1 encoded protein
SBO_0211-214-0.788401IS1 ORF2
SBO_0213013-0.374275phosphoheptose isomerase
SBO_02141160.121306amidotransferase
SBO_0215218-0.555542hypothetical protein
SBO_0216523-0.303934IS1 ORF2
SBO_0217422-3.531769IS1 encoded protein
SBO_0219419-2.648020IS629 ORF2
SBO_0220420-2.762721IS629 ORF1
SBO_0221422-1.969208hypothetical protein
SBO_0223524-1.53784750S ribosomal protein L31
SBO_0225423-0.161365regulator
SBO_0226524-4.571834hypothetical protein
SBO_0227525-4.687085hypothetical protein
SBO_0230525-5.091730IS911 ORF2
SBO_0231525-5.116126IS600 ORF2
SBO_0232524-4.597478hypothetical protein
SBO_0233424-5.167433serine protease
SBO_02343181.186314IS1 ORF2
SBO_02353192.439420IS629 ORF1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0221INTIMIN1679e-52 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 167 bits (424), Expect = 9e-52
Identities = 50/106 (47%), Positives = 73/106 (68%)

Query: 1 MYEQYYGDEVGLFGKDKRQKDPHAISAEVNYTPVPLLTLSAGHKQGKSGENDTRFGLEVN 60
MYEQYYGD V LF DK Q +P A + VNYTP+PL+T+ ++ G END + ++
Sbjct: 349 MYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFR 408

Query: 61 YRIGEPLEKQLDTDSIRERRMLAGSRYDLVERNNNIVLEYRKSEVI 106
Y+ +P +Q++ + E R L+GSRYDLV+RNNNI+LEY+K +++
Sbjct: 409 YQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDIL 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0233IGASERPTASE2851e-80 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 285 bits (729), Expect = 1e-80
Identities = 136/571 (23%), Positives = 223/571 (39%), Gaps = 122/571 (21%)

Query: 33 KKVILGIILSSIYGSYGETAFA-AMLDINNIWTRDYLDLAQNRGEFRPGATNVQLMMKDG 91
KK L I ++ +Y T + A L +++ + + D A+N+G+F GATNV + K+
Sbjct: 4 KKFKLNFIALTV--AYALTPYTEAALVRDDVDYQIFRDFAENKGKFSVGATNVLVKDKNN 61

Query: 92 KIFH--FPE-LPVPDFSAVS-NKGATTSIGGAYSVTATH--------------------N 127
K P +P+ DFS V +K T I Y V H N
Sbjct: 62 KDLGTALPNGIPMIDFSVVDVDKRIATLINPQYVVGVKHVSNGVSELHFGNLNGNMNNGN 121

Query: 128 GTQHHAITTQSWDQTAYKASNRVSS----------------GDFSVHRLNKFVVETTGVT 171
H ++++ + + + + D+ + RL+KFV T
Sbjct: 122 AKAHRDVSSEENRYFSVEKNEYPTKLNGKTVTTEDQTQKRREDYYMPRLDKFV------T 175

Query: 172 ESADFSLSPEDAMKRYGVNYNGKEQ-IIGFRAGAGTTSTILNGKQY-------------- 216
E A S + YN + + R G+G+ G Y
Sbjct: 176 EVAPIEASTASS---DAGTYNDQNKYPAFVRLGSGSQFIYKKGDNYSLILNNHEVGGNNL 232

Query: 217 -LFGQNYNPDLLSASLFNLDWKNKSYIYT--------------NRTPFKNSPIFGDSGSG 261
L G Y ++ + + ++ +N I ++ P N + GDSGS
Sbjct: 233 KLVGDAYTY-GIAGTPYKVNHENNGLIGFGNSKEEHSDPKGILSQDPLTNYAVLGDSGSP 291

Query: 262 SYLYDKEQQKWVFHGVTSTVGFISSTNIAWTNYSLFNNILVNNLKKNFTNTMQLDGKKQE 321
++YD+E+ KW+F G + +W ++++ + ++ + + K
Sbjct: 292 LFVYDREKGKWLFLGSYD--FWAGYNKKSWQEWNIYKSQFTKDVLNKDSAGSLIGSKTDY 349

Query: 322 LSSIIKD-------------------------KDLSVSGGGELTLKQDTDLGIGGLIFDK 356
S K ++ G G LTL + D G GGL F+
Sbjct: 350 SWSSNGKTSTITGGEKSLNVDLADGKDKPNHGKSVTFEGSGTLTLNNNIDQGAGGLFFEG 409

Query: 357 NQTYKVYGKDKSYKGAGIDIDNNTTVEWNVKGVAGDNLHKIGSGTLDVKIAQGN--NLKI 414
+ K + ++KGAG+ + TV W V D L KIG GTL V+ N +LK+
Sbjct: 410 DYEVKGTSDNTTWKGAGVSVAEGKTVTWKVHNPQYDRLAKIGKGTLIVEGTGDNKGSLKV 469

Query: 415 GNGTVIL------SAEKAFNKIYMAGGKGTVKINAKDALSESGNGEIYFTRNGGTLDLNG 468
G+GTVIL S + AF + + G+ T+ +N + + IYF GG LDLNG
Sbjct: 470 GDGTVILKQQTNGSGQHAFASVGIVSGRSTLVLNDDKQVDPNS---IYFGFRGGRLDLNG 526

Query: 469 YDQSFQKIAATDAGTTVTNSNVKQ-STLSLT 498
+F I D G + N N+ S +++T
Sbjct: 527 NSLTFDHIRNIDDGARLVNHNMTNASNITIT 557


3SBO_0251SBO_0266Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0251324-3.706383*phage integrase
SBO_0252323-3.079354IS629 ORF2
SBO_0253324-3.851523IS629 ORF1
SBO_0254018-2.462560hypothetical protein
SBO_0256014-0.935982hypothetical protein
SBO_02550161.502513phage repressor
SBO_0257-1203.104289IS1 encoded protein
SBO_0258-1183.169180IS1 ORF2
SBO_0259-1183.169180taurine transporter substrate binding subunit
SBO_0260-1192.578914taurine transporter ATP-binding subunit
SBO_0261-1182.069673taurine transporter subunit
SBO_02620181.391146taurine dioxygenase
SBO_02632241.454658delta-aminolevulinic acid dehydratase
SBO_02647290.028635IS1 ORF2
SBO_0265429-1.904325IS1 encoded protein
SBO_0266221-0.774333insertion sequence 2 OrfA protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0263BINARYTOXINB300.019 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 29.7 bits (66), Expect = 0.019
Identities = 19/69 (27%), Positives = 30/69 (43%)

Query: 265 DIVRELRERTELPIGAYQVSGEYAMIKFAALAGAIDEEKVVLESLGSIKRAGADLIFSYF 324
+ EL + +L + QV G A F +D E L I+ A +IF+
Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525

Query: 325 ALDLAEKKI 333
L+L E++I
Sbjct: 526 DLNLVERRI 534


4SBO_0355SBO_0371Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0355227-4.374241insertion sequence 2 OrfA protein
SBO_0356116-1.345223IS600 ORF1
SBO_0357216-0.620907IS600 ORF2
SBO_0359114-1.351792maltose O-acetyltransferase
SBO_0360014-0.864989hemolysin expression-modulating protein
SBO_0361014-0.636199hypothetical protein
SBO_03620140.269103acridine efflux pump
SBO_0363112-0.265257acridine efflux pump
SBO_0364114-0.436526DNA-binding transcriptional repressor AcrR
SBO_03652141.673363potassium efflux protein KefA
SBO_03664153.489041hypothetical protein
SBO_03673153.913189primosomal replication protein N''
SBO_03683222.527016hypothetical protein
SBO_03693262.365283adenine phosphoribosyltransferase
SBO_03702212.303898DNA polymerase III subunits gamma and tau
SBO_03712210.805542hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0362ACRIFLAVINRP13690.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1369 bits (3546), Expect = 0.0
Identities = 802/1033 (77%), Positives = 915/1033 (88%), Gaps = 1/1033 (0%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60
M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120
VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180
EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300
+ EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360
DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540
SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 600
YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT YYL
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720
V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780
LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840
MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS 900
M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960
VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020
+EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1021 FVPVFFVVVRRRF 1033
FVPVFFVV+RR F
Sbjct: 1020 FVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0363RTXTOXIND445e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.4 bits (105), Expect = 5e-07
Identities = 34/212 (16%), Positives = 71/212 (33%), Gaps = 23/212 (10%)

Query: 100 TYQATYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTA 159
+ Y A +L + + Q+ Q +++ ++ L +Q +
Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313

Query: 160 AKAAVETARINLAYTKVTAPISGRIGKSNV-TEGALVQNGQATALATVQQLDPIYVDVTQ 218
+ + + AP+S ++ + V TEG +V + T + V + D + V
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372

Query: 219 SSNDFLRLKQELA----------NGTLKQENGKAKVSLITSDGIKFPQDGTLEFSDVTVD 268
+ D + KV I D I+ + G + ++++
Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNINLDAIEDQRLGLVFNVIISIE 429

Query: 269 QTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 300
+ S + I L GM V A ++ G
Sbjct: 430 ENCLSTGNKNIP------LSSGMAVTAEIKTG 455



Score = 34.0 bits (78), Expect = 8e-04
Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 13/125 (10%)

Query: 49 PLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQATYDS 107
++I G+ T + R E++P + I+ + KEG + G L ++ +A
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA---- 134

Query: 108 AKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETA 167
D K Q++ A+L RYQ L E ++
Sbjct: 135 ---DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 168 RINLA 172
+L
Sbjct: 187 LTSLI 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0364HTHTETR2225e-76 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 222 bits (567), Expect = 5e-76
Identities = 215/215 (100%), Positives = 215/215 (100%)

Query: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60
MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120
EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180
GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0370IGASERPTASE372e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.0 bits (85), Expect = 2e-04
Identities = 38/245 (15%), Positives = 78/245 (31%), Gaps = 32/245 (13%)

Query: 386 PTQVPPQPQSAPQQAPTVPLPETTSQVLAARQQLQRVQGATKAKKSEPAAATRARPVPSA 445
T + P + P+VP + +++ RV A + + V
Sbjct: 994 TTNITT-PNNIQADVPSVP---------SNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 446 LEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPELAAKLA 494
++ E AT Q +E V A + + A E ++T K
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 495 A----------EAIERDPWAARVSQLSLPKLVEQVALNAWKEESDNAVCLHLRSSQRHLN 544
A E + SQ+S + + + +N ++++ Q N
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163

Query: 545 NRGAQQKLAEALST-LKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARESIIADNN 603
++ A+ S+ ++ E T V N V P A + + + + +
Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH 1223

Query: 604 IQTLR 608
+++R
Sbjct: 1224 RRSVR 1228


5SBO_0386SBO_0401Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0386-1163.250898copper exporting ATPase
SBO_0387-2150.449960glutaminase
SBO_0388019-0.744724amino acid/amine transport protein
SBO_0389017-0.788083DNA-binding transcriptional regulator CueR
SBO_0390-117-1.251560hypothetical protein
SBO_0391-117-1.053354protease
SBO_0392-119-0.351736ABC transporter ATP-binding protein
SBO_0393-1212.119912metal resistance protein
SBO_03952317.312519short chain dehydrogenase
SBO_03974337.538959multifunctional acyl-CoA thioesterase I/protease
SBO_03964356.212451ABC transporter ATP-binding protein
SBO_03995355.765444hypothetical protein
SBO_04003294.907090protein in rhs element
SBO_04012284.131533protein in rhs element
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0387BLACTAMASEA310.007 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 30.5 bits (69), Expect = 0.007
Identities = 11/43 (25%), Positives = 19/43 (44%)

Query: 38 GQLAVVAIVTCDGNVYSAGDSDYRFALESISKVCTLALALEDV 80
G++ ++ + G +A +D RF + S KV L V
Sbjct: 38 GRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0395DHBDHDRGNASE771e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 76.6 bits (188), Expect = 1e-18
Identities = 48/212 (22%), Positives = 81/212 (38%), Gaps = 7/212 (3%)

Query: 16 KSVLITGCSSGIGLESALELKRQGFHVLAGCRKPDDVERMNN----MGFT--GVLIDLDS 69
K ITG + GIG A L QG H+ A P+ +E++ + D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 70 PESVDRAADEVIALTDNCLYGIFNNAGFGMYGPLSTISRAQMEQQFSANFFGAHQLTMRL 129
++D + + + N AG G + ++S + E FS N G + +
Sbjct: 69 SAAIDEITARIEREMGP-IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 130 LPAMLPHGEGRIVMTSSVMGLISTPGRGAYAASKYALEAWSDALRMELRHSGIKVSLIEP 189
M+ G IV S + AYA+SK A ++ L +EL I+ +++ P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 190 GPIRTRFTDNVNQTQSDKPVENPGIAARFTLG 221
G T ++ ++ G F G
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTG 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0396PF05272290.014 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.014
Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 41 LVGESGSGKSTLLAILAGLD 60
L G G GKSTL+ L GLD
Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620


6SBO_0413SBO_0426Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_04133160.880617IS1 ORF2
SBO_04143181.036042bifunctional 5,10-methylene-tetrahydrofolate
SBO_04153201.606289hypothetical protein
SBO_04162182.182213hypothetical protein
SBO_04171172.642234cysteinyl-tRNA synthetase
SBO_04181153.354442peptidyl-prolyl cis-trans isomerase B
SBO_04190144.072309UDP-2,3-diacylglucosamine hydrolase
SBO_0420-1154.166824phosphoribosylaminoimidazole carboxylase
SBO_0421-1163.510255phosphoribosylaminoimidazole carboxylase ATPase
SBO_0423-1200.771905carboxylase
SBO_0424-119-0.505952hypothetical protein
SBO_0425124-2.197321insertion element IS2 transposase InsD
SBO_0426125-3.288362insertion sequence 2 OrfA protein
7SBO_0447SBO_0472Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_04472153.088823enterobactin synthase subunit F
SBO_04482172.641404ferric enterobactin transport protein FepE
SBO_04491174.753463iron-enterobactin transporter ATP-binding
SBO_04501164.997456iron-enterobactin transporter permease
SBO_04510164.566035iron-enterobactin transporter membrane protein
SBO_0452-1164.033187enterobactin exporter EntS
SBO_0453-2163.979230iron-enterobactin transporter periplasmic
SBO_0454-2184.280556isochorismate hydroxymutase
SBO_0455-2204.243546enterobactin synthase subunit E
SBO_0456-1184.0660772,3-dihydro-2,3-dihydroxybenzoate synthetase
SBO_0457-1152.1339772,3-dihydroxybenzoate-2,3-dehydrogenase
SBO_04580151.591609hypothetical protein
SBO_0459-1140.622436carbon starvation protein
SBO_0460-218-1.441398hypothetical protein
SBO_0461-118-0.678934hypothetical protein
SBO_0463-221-1.548446hypothetical protein
SBO_0464322-0.312300IS1 ORF2
SBO_0465118-0.174782IS1 encoded protein
SBO_04662190.323357IS1 ORF2
SBO_0467-116-0.384817insertion element IS2 transposase InsD
SBO_0468-117-1.056941insertion sequence 2 OrfA protein
SBO_0470-119-0.888464alkyl hydroperoxide reductase subunit C
SBO_0471-114-0.747705alkyl hydroperoxide reductase F52a subunit
SBO_0472216-1.472783hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0452TCRTETA363e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.0 bits (83), Expect = 3e-04
Identities = 82/394 (20%), Positives = 146/394 (37%), Gaps = 38/394 (9%)

Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWLV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83
+ V +GL+ +P ++ + HS + G+ + L F V G L+DR+ R+
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141
V+L + G ++ + P L +Y+ + G + G A A +
Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201
+ + G V P++GGL+ GG + + AA L L LP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253
+ PL+ + LA FR+ +V + + ++ + A+ V++ D +
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241

Query: 254 QMSAAEIGFLYAAIP-LGAAIGALTSGKLAHSARPGLLMLLSTLGS---FLAIGLFGLMP 309
A IG AA L + A+ +G +A ++L + ++ +
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369
M +V LA G ML Q E G++ G A +G L
Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELRR 403
+ A + + +G+ + L LL L LRR
Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0453FERRIBNDNGPP632e-13 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 62.7 bits (152), Expect = 2e-13
Identities = 62/285 (21%), Positives = 103/285 (36%), Gaps = 35/285 (12%)

Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99
H P RIV+ LLA+ VAD + R W E L
Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75

Query: 100 RLYIG-----EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151
I EP+ E + P ++ SA G S + L+ IAP N+ D
Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131

Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209
+ LT++ ++ + A +AQ++ + + K + + ++
Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191

Query: 210 TPESAQGQMLEQLGFTLAKLPTGLNASQSQGKRHDIIQLSGENLAAGLNGESLFLFAGDQ 269
P S ++L++ G NA Q + +S + LAA + + L +
Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243

Query: 270 KDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQVLERL 314
KD DA+ A PL +P V+ + + F SAM + L
Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVL 288


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0456ISCHRISMTASE440e-159 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 440 bits (1132), Expect = e-159
Identities = 146/299 (48%), Positives = 194/299 (64%), Gaps = 18/299 (6%)

Query: 1 MAIPKLQAYALPESHDIPQNKVDWAFELQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60
MAIP +Q Y +P + D+PQNKV W + RA LLIHDMQ+YFV + + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120
L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180
L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPVPASKA-----------ALREVIL 223
FS ++H M+L+Y AGR VMT+ LL PA V + A +R+ I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLS 281
LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0457DHBDHDRGNASE363e-130 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 363 bits (932), Expect = e-130
Identities = 108/258 (41%), Positives = 149/258 (57%), Gaps = 20/258 (7%)

Query: 5 GKNVWVTGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 49
GK ++TGA +GIG A A GA + D +A E +P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 50 VMDVADAGQVAQVCQRLLAETERLDVLINAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109
DV D+ + ++ R+ E +D+L+N AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 169
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229
N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 ASHITLQDIVVDGGSTLG 247
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


8SBO_0555SBO_0561Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0555-2163.626958hypothetical protein
SBO_0556-1163.891701DNA-binding transcriptional activator KdpE
SBO_0557-1185.147261sensor protein KdpD
SBO_05590163.416994potassium-transporting ATPase subunit B
SBO_05601193.088725potassium-transporting ATPase subunit A
SBO_05612222.875786potassium-transporting ATPase subunit F
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0556HTHFIS921e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.8 bits (228), Expect = 1e-23
Identities = 35/125 (28%), Positives = 58/125 (46%), Gaps = 1/125 (0%)

Query: 2 TNVLIVEDEQAIRRFLRTALEGDGMRVYEAETLQRGLLEAATRKPDLIILDLGLPDGDGI 61
+L+ +D+ AIR L AL G V A DL++ D+ +PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EFIRDLRQWSA-VPVIVLSARSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHS 120
+ + +++ +PV+V+SA++ I A + GA DYL KPF + EL + AL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 ATAAP 125
+
Sbjct: 124 RRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0557PF06580320.012 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.012
Identities = 10/48 (20%), Positives = 21/48 (43%), Gaps = 4/48 (8%)

Query: 785 LLENAVKYAGAQAE----IGIDAHVEGENLQLDVWDNGPGLPPGQEQT 828
L+EN +K+ AQ I + + + L+V + G +++
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKES 310


9SBO_0593SBO_0598Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0593318-0.501680cytochrome d terminal oxidase polypeptide
SBO_0594218-0.623838hypothetical protein
SBO_0595322-0.556770acyl-CoA thioester hydrolase YbgC
SBO_0596321-0.746325colicin uptake protein TolQ
SBO_0597320-0.823617colicin uptake protein TolR
SBO_0598318-1.267746cell envelope integrity inner membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0598IGASERPTASE647e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 63.5 bits (154), Expect = 7e-13
Identities = 39/210 (18%), Positives = 70/210 (33%), Gaps = 9/210 (4%)

Query: 79 EQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAELKQKQAEVAA 138
E+R A+ + +E E A +E + AE +
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045

Query: 139 AKAAADAKAAEEAAKKAAADAKKKAEAEAAKAAAEAQKKAEVAAAALKKKAEAAEAAAAE 198
++ K E+ A + A ++ A+ + A Q EVA + +E E E
Sbjct: 1046 QESKTVEKN-EQDATETTAQNREVAKEAKSNVKANTQT-NEVA----QSGSETKETQTTE 1099

Query: 199 ARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAAEKAAADKKAAEKAAADKAAADKKAAA 258
++ A E EKAK E EK K + E++ + AE A +
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV---NIK 1156

Query: 259 EKAAADKKAAAAKAAAEKAAAAKAAAEADD 288
E + A + A++ ++ +
Sbjct: 1157 EPQSQTNTTADTEQPAKETSSNVEQPVTES 1186



Score = 55.1 bits (132), Expect = 3e-10
Identities = 30/244 (12%), Positives = 81/244 (33%), Gaps = 20/244 (8%)

Query: 51 DAVMVDSGAVVEQYKRMQSQESSAKRSDEQRKMKEQQAAE-ELREKQAAEQER------L 103
D V A + ++ ++K+ + + EQ A E + ++ A++ +
Sbjct: 1021 DEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080

Query: 104 KQLEKERLAAQEQKKQAEEAAKQAELKQKQAEVAAAKAAADAKAAEEAAKKAAADAKKKA 163
+ E + ++ ++ Q K+ +K+ KA + + +E K + + K+
Sbjct: 1081 QTNEVAQSGSETKETQ-TTETKETATVEKE-----EKAKVETEKTQEVPKVTSQVSPKQE 1134

Query: 164 EAEAAKAAAEAQKKAEVAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEK 223
++E + AE ++ + E + + A++ + E+
Sbjct: 1135 QSETVQPQAEPARENDPTVN-------IKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187

Query: 224 AAADKKAAAEKAAADKKAAEKAAADKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAA 283
+ E A + + +++K + + + A +
Sbjct: 1188 TVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247

Query: 284 AEAD 287
A D
Sbjct: 1248 ALCD 1251



Score = 54.7 bits (131), Expect = 5e-10
Identities = 33/221 (14%), Positives = 76/221 (34%), Gaps = 8/221 (3%)

Query: 66 RMQSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125
R ++E+ + + + Q+ E +E Q E + +EKE A E +K E
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 126 QAELKQKQAEVAAAKAAADAKAAEEAAKKAAADAKKKAEAEAAKAAAEAQKKAEVAAAAL 185
+++ KQ ++ AE A + K+ +++ A Q E ++
Sbjct: 1126 TSQVSPKQ-----EQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 186 KKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAAEKAAADKKAAEKA 245
+ E+ + + ++ K + + + + A +
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTS 1240

Query: 246 AADKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAAAEA 286
+ D++ A + + + A + A A+ A +A
Sbjct: 1241 SNDRSTV---ALCDLTSTNTNAVLSDARAKAQFVALNVGKA 1278



Score = 52.4 bits (125), Expect = 3e-09
Identities = 32/184 (17%), Positives = 67/184 (36%), Gaps = 12/184 (6%)

Query: 99 EQERLKQLEKERLAAQEQKKQAEEAAKQAELKQK----QAEVAAAKAAADAKAAEEAAK- 153
E E+ Q QA+ + + ++ +A V A ++ E A+
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 154 -KAAADAKKKAEAEAAKAAAEAQKKAEVAAAALKKKAEAAEAAAAEARKKAATEAAEKAK 212
K + +K E +A + A+ ++ A+ A + +K + E A ++ +E E
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVA------QSGSETKETQT 1097

Query: 213 AEAEKKAAAEKAAADKKAAAEKAAADKKAAEKAAADKAAADKKAAAEKAAADKKAAAAKA 272
E ++ A EK K + K ++ + + + + AE A + K
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 273 AAEK 276
+
Sbjct: 1158 PQSQ 1161



Score = 47.4 bits (112), Expect = 9e-08
Identities = 29/213 (13%), Positives = 68/213 (31%), Gaps = 6/213 (2%)

Query: 71 ESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAELK 130
+S ++ + Q ++ A E EK E E+ +++ K +++Q+E QAE
Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 131 QKQAEVAAAKAAADAKAAEEAAKKAAADAKKKAEAEAAKAAAEAQKKAEVAAAALKKKAE 190
++ K ++ A + E ++ + V A
Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 191 AAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAAEKAAADKKAAEKAAADKA 250
+E+ K ++ A ++ D+ A +
Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTN------TNAV 1260

Query: 251 AADKKAAAEKAAADKKAAAAKAAAEKAAAAKAA 283
+D +A A+ A + A ++ ++ +
Sbjct: 1261 LSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293



Score = 40.4 bits (94), Expect = 1e-05
Identities = 22/156 (14%), Positives = 44/156 (28%), Gaps = 11/156 (7%)

Query: 126 QAELKQKQAEVAAAKAAADAKAAEEAAK-KAAADAKKKAEAEAAKAAAEAQKKAEVAAAA 184
+ E + + + + +A + A+ A A + E A
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 185 LKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAAEKAAADKKAAEK 244
K++++ E K +A E E A+ E A + + E
Sbjct: 1044 SKQESKTVE--------KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET 1095

Query: 245 AAADKAAADKKAAAEKAAA--DKKAAAAKAAAEKAA 278
+ EKA +K K ++ +
Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP 1131


10SBO_0621SBO_0661Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_06213181.044230phosphotransferase
SBO_06223230.861445IS911 ORF2
SBO_06234230.695049IS629 ORF1
SBO_06244230.727570IS629 ORF2
SBO_06254230.261825IS1 ORF2
SBO_06264230.261825IS1 encoded protein
SBO_0631224-0.534449****IS600 ORF1
SBO_06324270.065082IS629 ORF2
SBO_0633329-0.077878IS629 ORF1
SBO_06344270.065082IS911 ORF2
SBO_06352250.813440IS1 ORF2
SBO_06363260.451512IS1 encoded protein
SBO_06374251.195907IS600 ORF2
SBO_06384241.582570IS629 ORF1
SBO_06394211.342541IS629 ORF2
SBO_06403221.526698transglycosylase
SBO_06414211.068054hypothetical protein
SBO_06422200.382109protein encoded within IS
SBO_0643118-0.256528protein encoded within IS
SBO_0644221-0.432686protein encoded within IS
SBO_0645222-1.231188IS2 ORF2
SBO_0646225-0.836829insertion sequence 2 OrfA protein
SBO_0647123-0.654442tail fiber protein
SBO_0648527-2.324450tail fiber assembly protein
SBO_0649423-1.244790IS600 ORF2
SBO_0650425-1.661840IS600 ORF1
SBO_0651424-1.126589phage tail fiber protein
SBO_0652015-1.704263prophage tail protein
SBO_0653014-1.475113invasion plasmid antigen
SBO_06540120.1478376-phosphogluconolactonase
SBO_06560140.558376IS1 encoded protein
SBO_0657-1151.122353IS1 ORF2
SBO_06580151.909576hypothetical protein
SBO_06591133.193296pectinesterase
SBO_06602143.500239kinase inhibitor protein
SBO_06610133.013085adenosylmethionine-8-amino-7-oxononanoate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0651FLAGELLIN280.045 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 28.5 bits (63), Expect = 0.045
Identities = 13/89 (14%), Positives = 29/89 (32%)

Query: 4 QQSAGAAAGNAQQTAQDVAAAATARDDAQRFAENARQDATVTAEDRKATAEDVTSTGANA 63
Q + N D+ A + +++ A A + + + +
Sbjct: 341 QFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTAS 400

Query: 64 AAAGQSAQDAAGYARAAEQAKNDIDAALT 92
+ +DAA ++ ID+AL+
Sbjct: 401 GVSTLINEDAAAAKKSTANPLASIDSALS 429


11SBO_0718SBO_0775Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0718217-0.291787L-asparaginase
SBO_0719017-0.927976glutathione transporter ATP-binding protein
SBO_0720118-2.541219transporter
SBO_0721016-5.160778transport system permease
SBO_0722-112-4.861147transport system permease
SBO_0723012-5.850518hypothetical protein
SBO_0724013-2.868678IS1 ORF2
SBO_0725014-2.947248hypothetical protein
SBO_0726115-1.540592hypothetical protein
SBO_07273191.04672930S ribosomal protein S12 methylthiotransferase
SBO_07280240.764819biofilm formation regulatory protein BssR
SBO_07290221.244332insertion element IS2 transposase InsD
SBO_07300180.240164insertion sequence 2 OrfA protein
SBO_0731-1150.132040IS2 ORF2
SBO_0732115-0.378459insertion sequence 2 OrfA protein
SBO_0733115-0.398406dehydrogenase
SBO_0734116-1.397284transferase
SBO_0736215-0.757317DNA-binding transcriptional repressor DeoR
SBO_0737214-0.573011undecaprenyl pyrophosphate phosphatase
SBO_0738013-0.117217proton motive force efflux pump
SBO_07390140.226958hypothetical protein
SBO_07400140.397294hypothetical protein
SBO_07411171.650638DEOR-type transcriptional regulator
SBO_07423231.000124DEOR-type transcriptional regulator
SBO_0743423-0.980989IS911 ORF2
SBO_0744223-2.667205IS629 ORF2
SBO_0745223-2.596083IS629 ORF1
SBO_0746123-3.531096IS1 ORF2
SBO_0747018-1.937566IS1 encoded protein
SBO_0748018-2.162002repressor protein
SBO_0749124-6.207186hypothetical protein
SBO_0750230-7.946446IS911 ORF2
SBO_0751233-9.336954DNA adenine methylase
SBO_0752332-8.716007endonuclease
SBO_0753329-8.702989damage-inducible protein
SBO_0754227-6.976606hypothetical protein
SBO_0755022-0.991777hypothetical protein
SBO_07562303.336866capsid portal protein
SBO_07573314.308038capsid portal protein
SBO_07583314.802993terminase, ATPase subunit
SBO_07594315.047225capsid scaffolding protein
SBO_07603275.312057major capsid protein
SBO_07613265.050361terminase, endonuclease subunit
SBO_07623264.389093capsid completion protein
SBO_07632233.317382phage tail protein
SBO_07643211.758877secretory protein
SBO_07653221.173152lysozyme protein R of prophage CP-933K
SBO_07664220.192470hypothetical protein
SBO_0767422-1.190485regulatory protein
SBO_0768323-2.657857phage tail protein
SBO_0769220-0.816688phage tail protein
SBO_0770319-0.167823IS1 encoded protein
SBO_07713180.153685IS1 ORF2
SBO_07723162.386909hypothetical protein
SBO_07733153.280726inversion of adjacent DNA; at locus of e14
SBO_07742173.550448major tail sheath protein
SBO_07752173.006561major tail tube protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0738TCRTETA401e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.8 bits (93), Expect = 1e-05
Identities = 57/269 (21%), Positives = 106/269 (39%), Gaps = 23/269 (8%)

Query: 63 LLGPLSDRIGRRPVMLAGVVWFIITCLAILLAQNIEQFTLLRFLQGISLCFIGAVGYAAI 122
+LG LSDR GRRPV+L + + + A + + R + GI+ GAV A I
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYI 120

Query: 123 QESFEEAVCIKITALMANVALIAPLLGPLVG---AAWIHVLPWEGMFVLFAALAAISFFG 179
+ + + M+ + GP++G + P F AAL ++F
Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP----FFAAAALNGLNFLT 176

Query: 180 LQRAMPETATRIGEKLSLKELGRDYKLVLKNG-RFVAGALALGFVSLPLLAWIAQSP--I 236
+PE+ L + L G VA +A+ F ++ + Q P +
Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF----IMQLVGQVPAAL 232

Query: 237 IIITGEQLSSYEYGLLQVPIFGALIAGNL----LLARLTSRRTVRSLIIMGGWPIMIGLL 292
+I GE ++ + + + I +L + + +R R +++G G +
Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292

Query: 293 VAAAATVISSHAYLWMTAGLSIYAFGIGL 321
+ A AT ++ + + + GIG+
Sbjct: 293 LLAFAT----RGWMAFPIMVLLASGGIGM 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0741TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.7 bits (77), Expect = 0.001
Identities = 34/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%)

Query: 218 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTVGRFTGGWFI 275
+IGV+ + F + + P +M D H S GS+I T+ + + + GG +
Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317

Query: 276 DRYSRVAVVR-ASALM--GALGIGLIIFVDSAWVA-GVSVVLWGLGASLGFPLTISAASD 331
DR + V+ + L ++ S ++ + VL GL + TI ++S
Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377

Query: 332 TGPDAPTRVSVVATTGYLAFLVGPPLLGYL 361
+A +S++ T +L+ G ++G L
Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0742HTHTETR506e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 49.6 bits (118), Expect = 6e-10
Identities = 14/83 (16%), Positives = 30/83 (36%), Gaps = 4/83 (4%)

Query: 2 RRANDPQRREKIIQATLEAVKLYGIHAVTHRKIATLAGVPLGSMTYYFSGIDELLLEAFS 61
+ + R+ I+ L G+ + + +IA AGV G++ ++F +L E +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW- 63

Query: 62 SFTEIMSRQYQAFFSDVSDAQGA 84
E+ +
Sbjct: 64 ---ELSESNIGELELEYQAKFPG 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0752DNABINDNGFIS320.002 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 31.9 bits (72), Expect = 0.002
Identities = 22/76 (28%), Positives = 39/76 (51%), Gaps = 9/76 (11%)

Query: 100 QQRFNPDMVILADVNAQPSHISKPLMQRIE-----YFSSL-GRP--KAYSRYLRETIKPC 151
+QR N D++ ++ VN+Q KPL ++ YF+ L G+ Y L E +P
Sbjct: 3 EQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQPL 62

Query: 152 LER-LEHVRDSQLSAS 166
L+ +++ R +Q A+
Sbjct: 63 LDMVMQYTRGNQTRAA 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_076760KDINNERMP280.017 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 27.6 bits (61), Expect = 0.017
Identities = 22/80 (27%), Positives = 32/80 (40%), Gaps = 14/80 (17%)

Query: 3 RLLLVVLALLLAALGWQTWR-------LADASQTISTQADELQSKSQALAKSNSQLIS-- 53
R LLV+ L ++ + WQ W A + +T A + A +LIS
Sbjct: 5 RNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVK 64

Query: 54 -----LSILTETNNREQARL 68
L+I T + EQA L
Sbjct: 65 TDVLDLTINTRGGDVEQALL 84


12SBO_0845SBO_0866Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0845-1223.110066ATP phosphoribosyltransferase
SBO_08460222.897642histidinol dehydrogenase
SBO_08470251.988350histidinol-phosphate aminotransferase
SBO_0848-2170.174623imidazole glycerol-phosphate
SBO_0849-1140.272369imidazole glycerol phosphate synthase subunit
SBO_0850-114-0.0339971-(5-phosphoribosyl)-5-[(5-
SBO_0851115-2.146893imidazole glycerol phosphate synthase subunit
SBO_0852018-4.564804bifunctional phosphoribosyl-AMP
SBO_0853021-4.585275regulator of length of O-antigen component of
SBO_0854025-7.663931IS629 ORF2
SBO_0855137-12.945173IS629 ORF1
SBO_0856241-15.178855nucleotide sugar epimerase
SBO_0857346-16.824636UDP-glucose 6-dehydrogenase
SBO_0858551-19.3231346-phosphogluconate dehydrogenase
SBO_0859765-23.626247glycosyl transferase family protein
SBO_0860764-23.203687O-antigen polymerase
SBO_0861656-20.730298glycosyl transferase family protein
SBO_0862552-17.874056O-antigen flippase
SBO_0863346-14.801546glycosyl transferase family protein
SBO_0864135-10.098888glycosyl transferase family protein
SBO_0865-222-6.590884dTDP-4-dehydrorhamnose 3,5-epimerase
SBO_0866-217-4.361287glucose-1-phosphate thymidylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0856NUCEPIMERASE2384e-82 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 238 bits (610), Expect = 4e-82
Identities = 126/145 (86%), Positives = 135/145 (93%)

Query: 1 MALFKFTKAMLEGKSIDVYNFGKMKRDFTYIDDIAEAIIRLQDVIPEKNPQWAVETGSPA 60
MALFKFTKAMLEGKSIDVYN+GKMKRDFTYIDDIAEAIIRLQDVIP + QW VETG+PA
Sbjct: 190 MALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPA 249

Query: 61 TSSAPYRVYNIGNSSPVELMDYINALEEALGIEANKNMMPLQPGDVLETSADTKALYDVI 120
S APYRVYNIGNSSPVELMDYI ALE+ALGIEA KNM+PLQPGDVLETSADTKALY+VI
Sbjct: 250 ASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVI 309

Query: 121 GFKPETSVKEGVKNFVEWYRNFYKV 145
GF PET+VK+GVKNFV WYR+FYKV
Sbjct: 310 GFTPETTVKDGVKNFVNWYRDFYKV 334


13SBO_0899SBO_0919Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0899-2153.139805hypothetical protein
SBO_0900-2153.185694multidrug efflux system subunit MdtA
SBO_0901-1162.962821multidrug efflux system subunit MdtB
SBO_0902-1120.903799multidrug efflux system subunit MdtC
SBO_0903-210-0.292461multidrug efflux system protein MdtE
SBO_0904-211-2.101963signal transduction histidine-protein kinase
SBO_0905-114-3.759064DNA-binding transcriptional regulator BaeR
SBO_0906017-4.513700hypothetical protein
SBO_0907216-3.816899hypothetical protein
SBO_0908215-3.227897hypothetical protein
SBO_0909324-5.500027hypothetical protein
SBO_0911320-3.708284galactitol utilization operon repressor
SBO_0912219-2.811593galactitol-1-phosphate dehydrogenase
SBO_0913214-2.516056PTS system galactitol-specific transporter
SBO_0914114-1.544221PTS system galactitol-specific transporter
SBO_0915113-0.063961PTS system galactitol-specific transporter
SBO_09172130.567678tagatose-bisphosphate aldolase
SBO_0918312-0.193142fructose-bisphosphate aldolase
SBO_09192130.401066nucleoside permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0900RTXTOXIND493e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.7 bits (116), Expect = 3e-08
Identities = 46/369 (12%), Positives = 105/369 (28%), Gaps = 87/369 (23%)

Query: 4 SYKSRWVIVIVVVIAAIAAFWFWQGRNDSRSAAPG-----ATKQAQQSPAGGRRG---MR 55
S + R V ++ IA G+ + + A G + + ++
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 56 SG-------PLA---PVQAATAVEQAVPRYLTGLGTIIAANTVTVRSRVDG--QLMALHF 103
G L + A + L ++ ++ +L
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 104 QEGQQVKAGDLLAEI------------DPSQFKVALAQTQGQLA-------KDKATLANA 144
Q V ++L Q ++ L + + + + +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 145 RRDLARYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVA----------------- 187
+ L + L +++ + Q+ E ++ ++ +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 188 --------------------------SAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSG 220
+ + S I APV +V LK G +++
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 221 DTTGIVVITQTHPIDLVFTLPESDIATVVQAQKAGKPLVVEAWDRTNSKKL-SEGTLLSL 279
+T +V++ + +++ + DI + Q A + VEA+ T L + ++L
Sbjct: 354 ETL-MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINL 410

Query: 280 DNQIDATTG 288
D D G
Sbjct: 411 DAIEDQRLG 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0901ACRIFLAVINRP9140.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 914 bits (2364), Expect = 0.0
Identities = 299/1036 (28%), Positives = 514/1036 (49%), Gaps = 29/1036 (2%)

Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAV 72
+ FI RP+ +L + +++AG + LPV+ P + P + V YPGA + V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L MSS S S G+ ITL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPNPPVYSKVNPADPPIMTLAVTSTAMPMTQVE--DMVETRVAQKISQISGVGLVTLSGG 189
+ + S + +M S TQ + D V + V +S+++GVG V L G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGP------SRAVTLSANDQ 243
Q A+R+ L+A + LT V + N A G L G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MQSAEEYRQLII-AYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANI 302
++ EE+ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 ISTADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFL 362
+ TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLIFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481
+ E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWL 538
+S +V+L LTP +CA +L S E + F FD + Y + K+L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLSVALSTLLLSVLLWVFIPKGFFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQ 598
L + + V+L++ +P F P +D G+ +Q P ++ + QV D L+
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDR---VQKVIARLQTAVDKVPG 653
+ V+S+ + G + + N+ ++LKP +ER+ + VI R + + K+
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658

Query: 654 VDLFLQPTQDLTIDTQVSRTQYQFTLQ---ATSLDALSTWVPQLMEKLQQLP-QLSDVSS 709
D F+ P I + T + F L DAL+ QL+ Q P L V
Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDKGLVAYVNVDRDSASRLGISMADVDNALYSAFGQRLISTIYTQANQYRVVLEHNTE 769
+ + + VD++ A LG+S++D++ + +A G ++ + ++ ++ + +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 ITPGLAALDTIRLTSSDGGVVPLSSIAKVEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829
+D + + S++G +VP S+ + + + P I S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 DAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIVLGILYESFI 889
DA A+M+ + LP I + G + + + L+ + V +++ L LYES+
Sbjct: 838 DA-MALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949
P++++ +P VG LLA + + DV ++G++ IG+ KNAI++++FA ++
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMSPRDAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQV 1009
G +A A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDRL 1025
L +F PV +++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0902ACRIFLAVINRP9210.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 921 bits (2382), Expect = 0.0
Identities = 289/1035 (27%), Positives = 506/1035 (48%), Gaps = 36/1035 (3%)

Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65
FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124
+E+++ I + M+S+S S GS I L F D + A VQ + A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGKLYDFASTQLAPTISQIDGVGDVDVGGSSL 182
+ S + +M+ SD +Q + D+ ++ + T+S+++GVGDV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQG------ALEDGTHRWQIQTNDELK 236
A+R+ L+ L ++ DV + N + G AL I K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295
E+ + + N +G VRL DVA V ++ N KPA L I+ AN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDSIRAKLPELQETIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355
T +I+AKL ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414
RAT+IP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474
E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530
+LV+L LTP +C +LK + GF Y S+ +L T
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 VVLLGTIALNIWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586
++ +A + L++ +P +F PE+D GV + IQ + + + ++K
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 587 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRDERS---ETAQQIIDRLRVKLAKEPGAN 641
+ +V V GF+ G N+GM F++LKP +ER+ +A+ +I R +++L K
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 696
+ + I G + ++ L D + + R +L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QQDNGAEMNLVYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756
++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 816
++K++V + G+ +P S F + + I G S D
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 876
A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGN 936
P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA +
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996
EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVVYLFFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031



Score = 79.9 bits (197), Expect = 3e-17
Identities = 77/446 (17%), Positives = 162/446 (36%), Gaps = 26/446 (5%)

Query: 592 VDNVTGFTGGS-RVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGANLFLMAVQDI 650
+DN+ + S S + +T + + Q+ ++L++ P + Q I
Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127

Query: 651 RVGGRQSNASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQQDNGAE-- 703
V S+ +SD+ ++ ++ L+ L + DV GA+
Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183

Query: 704 MNLVYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 759
M + D D + + + + + + T P Q + R+
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 760 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 817
+ +N++G + L A+ N + G AA +L D
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302

Query: 818 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 874
+ AI + +L P ++ + T Q +++ V + AI V++V+ + ++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 875 VHPLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRH 934
L +P +G L F + + + G++L IG++ +AI++V+
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 935 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQ 994
L P+EA ++ ++ + +P+ GG + + ITIV + +S
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 995 LLTLYTTPVVYLFFDRLRLRFSRKPK 1020
L+ L TP + + + K
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENK 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0903TCRTETB1221e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 122 bits (308), Expect = 1e-32
Identities = 97/435 (22%), Positives = 190/435 (43%), Gaps = 25/435 (5%)

Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79
F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138
I++ GS+ + + LL +AR +QG G A + + V + +P+E A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLL-LMPNYTMQTWRFDL 197
+ +G +GPA+GG++ Y HW +L+ IP+ I + L+ L+ FD+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFLLLAVGMAVLTLALDGSKGTGLSPLAIAGLVAVGVVALVLYLLHARNNNRALFSLKL 257
G +L++VG+ L + + V V++ ++++ H R L
Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FRTRTFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316
+ F +G+ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 VVQVVNRFGYRRVLVATTLGLSLVTLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372
+V+R G VL +G++ +++ F+T + L W+ + V L G+ S +
Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367

Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVF 432
++T+ L A +G SLL+ LS G+ I G LL + + Q+ +
Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427

Query: 433 MYT--WLSMALIIAL 445
+Y+ L + II +
Sbjct: 428 LYSNLLLLFSGIIVI 442


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0904BCTERIALGSPF310.009 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 31.0 bits (70), Expect = 0.009
Identities = 27/95 (28%), Positives = 35/95 (36%), Gaps = 20/95 (21%)

Query: 164 RQTSWLIVALATLLAALATFLLA------RGLLAPVKRLVDGTHKLAAGDFTTRVTPTSE 217
RQ + L+ A L AL L+A V+ V H LA + P S
Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSF 131

Query: 218 DEL-----------GKLAQDFNQLASTLEKNQQMR 241
+ L G L N+LA E+ QQMR
Sbjct: 132 ERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0905HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%)

Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGL 70
IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129
L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 130 PQRELQQQDAESPLII 145
+ + D++ + +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0909LIPOLPP20260.024 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 26.3 bits (57), Expect = 0.024
Identities = 13/38 (34%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 3 KGEMKKIAAISLISIFIMSGCAVHNDETSIGKFGLAYK 40
K ++KKI +S+++ ++ GC+ H ++ I K AYK
Sbjct: 2 KNQVKKILGMSVVAAMVIVGCS-HAPKSGISKSNKAYK 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0912DHBDHDRGNASE353e-04 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 35.0 bits (80), Expect = 3e-04
Identities = 22/92 (23%), Positives = 36/92 (39%), Gaps = 2/92 (2%)

Query: 156 AQGCENKNVIIIGAGT-IGLLAIQCAVALGAKSVTAIDISSEKLALAKSFGSMQTFNSSE 214
A+G E K I GA IG + + GA + A+D + EKL S + ++
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 215 MSAPQMQSVLRELRFNQLILETAGVPQTVELA 246
A S + ++ E + V +A
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVA 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0919TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 6e-04
Identities = 52/268 (19%), Positives = 88/268 (32%), Gaps = 17/268 (6%)

Query: 29 LSKSGFSAGEIGWSYACTAIAAILSPILVGSITDRFFSAQKVLAVLMFAGALLMYFAAQQ 88
L S G A A+ ++G+++DRF ++ + ++ AGA + Y
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89

Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIG-WIASGLACGF 147
A F +L + T A T ++A A + D+ R R G + G+ G
Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147

Query: 148 LPQILGY-ADISPTNIPLLITAGSSALLGVFAFFLPDTPPKSTGKMDIKVMLGLDALILL 206
P + G SP + P A + L + FL K + + L A
Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205

Query: 207 RDKN------FLVFFFCSFLFAMPLVFYYIFANGYLTEVGMKNATGWMTLGQFSEIFFML 260
VFF + +P + IF G + +
Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265

Query: 261 ALPFFTKRFGIKKVLLLGLVIAAIRYGF 288
R G ++ L+LG++ Y
Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYIL 293



Score = 33.3 bits (76), Expect = 0.002
Identities = 27/110 (24%), Positives = 43/110 (39%), Gaps = 7/110 (6%)

Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLVIAAIRYGFFIYGSADEYFTYALLFLGILLHGV 312
+ L + RFG + VLL+ L AA+ Y +L++G ++ G+
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108

Query: 313 SYDFYYVTAYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMME 362
+ V D R G ++ C GFG + G LGG+M
Sbjct: 109 TGATGAVAGAYIAD-ITDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGG 156


14SBO_0929SBO_0987Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_09292210.626150methionyl-tRNA synthetase
SBO_09311221.182214IS600 ORF2
SBO_09322170.067800IS600 ORF1
SBO_0934117-0.388957IS1 ORF2
SBO_0935319-0.686276IS1 encoded protein
SBO_0936217-0.632986IS911 ORF2
SBO_0937217-0.879460hypothetical protein
SBO_0938118-0.461211regulator
SBO_09390180.661318hypothetical protein
SBO_09401191.922440IS1 encoded protein
SBO_09411181.401580IS1 ORF2
SBO_09422191.144940hypothetical protein
SBO_09432180.838394hypothetical protein
SBO_0944016-0.592833hypothetical protein
SBO_0945016-1.473094hypothetical protein
SBO_0946-215-3.722692hypothetical protein
SBO_0947-217-4.263005hypothetical protein
SBO_0948121-4.358576two-component response-regulatory protein YehT
SBO_0949221-4.0024062-component sensor protein
SBO_0950628-3.283939DNA damage-inducible protein
SBO_0951528-2.984612hypothetical protein
SBO_0952327-3.239714hypothetical protein
SBO_0953324-2.145039invasion plasmid antigen
SBO_0954221-0.858187prophage tail protein
SBO_0955122-1.583074tail fiber protein
SBO_0956222-2.618825tail fiber assembly protein
SBO_0957124-1.247797hypothetical protein
SBO_0958225-0.511170tail fiber protein
SBO_0959127-0.959751tail protein
SBO_0960028-2.306660IS600 ORF1
SBO_0961030-1.731350IS600 ORF2
SBO_0962125-1.673558hypothetical protein
SBO_0963030-5.326420RelF
SBO_0964028-4.810218hypothetical protein
SBO_0965-127-3.477425hypothetical protein
SBO_0966026-0.918012IS1 encoded protein
SBO_0967-122-0.856374IS1 ORF2
SBO_0968225-2.606018hypothetical protein
SBO_0969124-1.159704hypothetical protein
SBO_0970125-1.513084bacteriophage protein
SBO_0971120-1.980790hypothetical protein
SBO_0972224-3.889870hypothetical protein
SBO_0973225-4.666806transcriptional regulator
SBO_0974123-3.574579IS1 ORF2
SBO_0975122-3.122184IS1 encoded protein
SBO_0976018-2.283840exodeoxyribonuclease VIII
SBO_0977-119-2.467791integrase for prophage
SBO_0979117-0.654442*hypothetical protein
SBO_09823170.441109*IS629 ORF1
SBO_09833210.657609IS629 ORF2
SBO_09852180.260932anaerobic dimethyl sulfoxide reductase subunit
SBO_09861220.387518hypothetical protein
SBO_09872241.216997IS911 ORF2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0946BINARYTOXINA270.030 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 27.3 bits (60), Expect = 0.030
Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 3/65 (4%)

Query: 29 DKEESKKFSANLNGTEIAITYVYKGDKV-LKQSSETKIQFAS--ISATTKEDAAKTLEPL 85
+K+E+++ NL+ E +YK D + S+T+ F I + +E K L
Sbjct: 61 EKKEAERVEKNLDTLEKEALELYKKDSEQISNYSQTRQYFYDYQIESNPREKEYKNLRNA 120

Query: 86 SAKYK 90
+K K
Sbjct: 121 ISKNK 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0948HTHFIS712e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.4 bits (175), Expect = 2e-16
Identities = 41/177 (23%), Positives = 77/177 (43%), Gaps = 12/177 (6%)

Query: 7 IKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRIS 66
+L+ DD+ R L L ++ ++ SNA + D++ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 67 GLEMVGMLDPEHRPYI--VFLTAFD--EYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQ 122
+++ + + RP + + ++A + AIKA E+ A+DYL KP D L + R
Sbjct: 62 AFDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 123 ERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVT--SHEGKE 177
E ++ L ++Q + + G S + +A + + +T S GKE
Sbjct: 121 EPKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMITGESGTGKE 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0949PF065802207e-69 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 220 bits (561), Expect = 7e-69
Identities = 63/216 (29%), Positives = 115/216 (53%), Gaps = 3/216 (1%)

Query: 343 LGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQA 402
L G + + + +M ++++ L AQ+NPHF+FNALN I+A+I D +A
Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193

Query: 403 SQLVQYLSTFFRKNLKR-PSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQ 461
+++ LS R +L+ + V+LADE+ V++YLQ+ +F+ RLQ I +
Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253

Query: 462 QLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGL-YQPVTNASGL 520
Q+P +Q +VEN IKHG +QL G++ + ++ + LE+E+ L + ++G
Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313

Query: 521 GMNLVDKRLRERFGDDYGISVACEPDSYTRITLQLP 556
G+ V +RL+ +G + I ++ + + +P
Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0952LUXSPROTEIN310.001 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 31.4 bits (71), Expect = 0.001
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 7/66 (10%)

Query: 37 TKEHLLPHFL-EHLGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 89
T EHL F+ HL + ++I G TGFY++ + S + D A++
Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113

Query: 90 AGESKI 95
++KI
Sbjct: 114 ENQNKI 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0963HOKGEFTOXIC615e-17 Hok/Gef cell toxic protein family signature.
		>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature.

Length = 52

Score = 60.6 bits (147), Expect = 5e-17
Identities = 19/46 (41%), Positives = 32/46 (69%)

Query: 4 QKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEP 49
+ +++ ++++CLT+++ +TRK LCE+R R G EVA F AYE
Sbjct: 5 RSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYES 50


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0986ISCHRISMTASE365e-05 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 36.1 bits (83), Expect = 5e-05
Identities = 15/57 (26%), Positives = 26/57 (45%)

Query: 74 NAWDNEDFVKAVKATGKKQLIIAGVVTEVCVAFPALSAIEEGFDVFVVTDASGTFNE 130
+A+ + ++ ++ G+ QLII G+ + A A E F V DA F+
Sbjct: 127 SAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSL 183


15SBO_1013SBO_1055Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_10133182.776038transport system permease
SBO_10141172.518299transport system permease
SBO_10150171.594308ABC transporter ATP-binding protein
SBO_10162190.858908transport system permease
SBO_1017323-0.554967transcriptional regulator
SBO_10182210.081194IS600 ORF1
SBO_10194251.509086IS600 ORF2
SBO_10204241.749962hypothetical protein
SBO_1021424-0.226957IS1 ORF2
SBO_1022425-1.362800IS1 encoded protein
SBO_1023425-1.384456membrane protein
SBO_1024426-1.803256tail fiber protein
SBO_1025426-3.724047prophage tail protein
SBO_1026427-3.146757invasion plasmid antigen
SBO_1027227-4.034294hypothetical protein
SBO_1028124-5.114884IS1 encoded protein
SBO_1029121-4.827246IS1 ORF2
SBO_1030226-6.277871hypothetical protein
SBO_1031232-8.247275IS911 ORF2
SBO_1032136-9.247267cytochrome
SBO_1033238-8.719255hypothetical protein
SBO_1034135-8.126851sulfite oxidase subunit YedZ
SBO_1037134-7.873722transcriptional regulatory protein YedW
SBO_1038228-5.6789342-component sensor protein
SBO_1039525-1.333140IS1 ORF2
SBO_1040426-1.937104IS1 encoded protein
SBO_1041020-0.983341outer membrane porin protein C precurser
SBO_1042-2160.479258IS2 ORF2
SBO_1043-1160.839924IS911 ORF2
SBO_10442170.519729IS2 ORF2
SBO_10452180.721565hypothetical protein
SBO_10463180.974711hypothetical protein
SBO_10472170.657783DNA cytosine methylase
SBO_10480180.999167DNA mismatch endonuclease
SBO_1049119-1.948450hypothetical protein
SBO_1050020-3.511699hypothetical protein
SBO_1051022-4.611989hypothetical protein
SBO_1053020-3.968020mannosyl-3-phosphoglycerate phosphatase
SBO_1055119-4.214231hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1023ENTEROVIROMP1467e-47 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 146 bits (369), Expect = 7e-47
Identities = 66/201 (32%), Positives = 102/201 (50%), Gaps = 32/201 (15%)

Query: 1 MRK-VCAAILSAAICLAVSGAPAWASEHQSTLSAGYLQPHTDMPGSDDLKGINVKYRYEF 59
M+K C + L+A LA + + A+ ST++ GY Q + + G N+KYRYE
Sbjct: 1 MKKIACLSALAA--VLAFTAGTSVAA--TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEE 55

Query: 60 TDT-LGLVTSFSYANAEDEQKTHYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAG 118
++ LG++ SF+Y T S T D +N+++ + AGP+ R+N+W S Y + G
Sbjct: 56 DNSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVG 107

Query: 119 VAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDNRHSNTSLAWGAGVQFNPTESVAIDLAY 178
V Y + T T+ HD S+ ++GAG+QFNP E+VA+D +Y
Sbjct: 108 VGYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSY 150

Query: 179 EGSGSGDWRTDGFIVGVGYKF 199
E S +I GVGY+F
Sbjct: 151 EQSRIRSVDVGTWIAGVGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1024FLAGELLIN320.007 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 31.6 bits (71), Expect = 0.007
Identities = 18/107 (16%), Positives = 36/107 (33%), Gaps = 3/107 (2%)

Query: 110 VVQAQQSAGAAAGNAQQTAQDVAAAATARDDAQRFAENARQDATVTAEDRKATAEDVTST 169
VV Q + N D+ A + +++ A A + + +
Sbjct: 337 VVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFID 396

Query: 170 GANAAAAGQSAQDAAGYARAAEQAKNDIDAALTGTLKMANHLSEIAA 216
+ + +DAA ++ ID+AL+ K+ S + A
Sbjct: 397 KTASGVSTLINEDAAAAKKSTANPLASIDSALS---KVDAVRSSLGA 440


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1037HTHFIS822e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.2 bits (203), Expect = 2e-20
Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%)

Query: 2 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 61
IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 117
+L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1038PF06580330.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.002
Identities = 36/181 (19%), Positives = 62/181 (34%), Gaps = 37/181 (20%)

Query: 290 ENILFLARADKNNVLVKLDSLS----------------LNKEVENLLDYL--EYLSDEKE 331
NI L D L SLS L E+ + YL + E
Sbjct: 180 NNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDR 239

Query: 332 ICFKVECNQQIFADKI---LLQRMLSNLIVNAIRYSPEKSRIHITSFLDTNGYLNIDIAS 388
+ F+ + N I ++ L+Q ++ N I + I P+ +I + D NG + +++ +
Sbjct: 240 LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKD-NGTVTLEVEN 298

Query: 389 LGTKIHEPEKLFRRFWRGDNSRHSVGQGLGLSLVKA-IAELHGGSASYHYLNKHNVFRIT 447
G+ + K G GL V+ + L+G A K
Sbjct: 299 TGSLALKNTKE--------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344

Query: 448 L 448
+
Sbjct: 345 V 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1041ECOLIPORIN1911e-62 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 191 bits (486), Expect = 1e-62
Identities = 90/178 (50%), Positives = 108/178 (60%), Gaps = 14/178 (7%)

Query: 1 MSVDYNI-EGFGFVGAYSKSDRTNEQAG----DGYGDNAEVWSLAAKYDANNIYAAMMYG 55
+S Y+I GF AY+ SDRTNEQ GD A+ W+ KYDANNIY A MY
Sbjct: 207 ISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTIAGGDKADAWTAGLKYDANNIYLATMYS 266

Query: 56 ETRNMTVLA------NDHFANKTQNFEAVVQYQFDFGLRPSLGYVYSKGKDLYARDGHKG 109
ETRNMT + ANKTQNFE QYQFDFGLRP++ ++ SKGKDL + +
Sbjct: 267 ETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQFDFGLRPAVSFLMSKGKDLTYNNVNGD 326

Query: 110 VDADRVNYIEVGTWYYFNKNMNVYTAYKFNLLDKDDAAITDA--ATDDQFAVGIIYQF 165
D D V Y +VG YYFNKN + Y YK NLLD DD DA +TDD A+G++YQF
Sbjct: 327 -DKDLVKYADVGATYYFNKNFSTYVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1046CARBMTKINASE343e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 34.4 bits (79), Expect = 3e-04
Identities = 22/92 (23%), Positives = 36/92 (39%), Gaps = 9/92 (9%)

Query: 37 AQKLAADDDVDMLVILTACYFHDIVSLAKNHPQRQRSSILAAEETRRLLREEFEQFPA-- 94
+KLA + + D+ +ILT + +L + Q + EE R+ E F A
Sbjct: 219 GEKLAEEVNADIFMILTDV---NGAALYYGTEKEQWLREVKVEELRKYYEE--GHFKAGS 273

Query: 95 --EKIEAVCHVIAAHSFSAQIAPLTTEAKIVQ 124
K+ A I A IA L + ++
Sbjct: 274 MGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1047PF05272290.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.045
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 15/62 (24%)

Query: 320 AKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRG 379
A+Y + PVLW Y+ R+ K + G+ VY +R +DG+E RG
Sbjct: 166 ARYQVGPVLWGYVVRFIK---SDGDKLTLPYVY------------SRSQRDGSEAWKWRG 210

Query: 380 WD 381
WD
Sbjct: 211 WD 212


16SBO_1132SBO_1150Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1132222-7.965436hypothetical protein
SBO_1133328-7.662208hypothetical protein
SBO_1134429-7.385799IS911 ORF2
SBO_1135428-6.675592IS911 ORF2
SBO_1136430-6.543596hypothetical protein
SBO_1137225-4.257746hypothetical protein
SBO_11380212.353287hypothetical protein
SBO_1139-1221.384558transcriptional regulator
SBO_1140-1200.445280integrase
SBO_1141020-0.738523IS629 ORF1
SBO_1142118-1.014803IS629 ORF2
SBO_1143118-1.014803hypothetical protein
SBO_1145217-0.015504hypothetical protein
SBO_1146118-0.324395DNA polymerase III subunit theta
SBO_1147216-0.490255hypothetical protein
SBO_1148012-0.444878exodeoxyribonuclease X
SBO_1149-115-0.839890IS629 ORF2
SBO_1150217-2.080349IS629 ORF1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1143ARGDEIMINASE260.036 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 25.9 bits (57), Expect = 0.036
Identities = 10/37 (27%), Positives = 18/37 (48%), Gaps = 5/37 (13%)

Query: 43 EVMLTCRPGNALYVINPSTLVQYPLNDI-----AQKE 74
+ +L RPG L + P + + +DI A++E
Sbjct: 18 KKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQE 54


17SBO_1163SBO_1203Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1163018-3.825352IS1 ORF2
SBO_1164018-3.903233hypothetical protein
SBO_1165121-2.548179hypothetical protein
SBO_1166-125-7.324598hypothetical protein
SBO_1167-124-7.014103hypothetical protein
SBO_1168-220-5.220482IS1 encoded protein
SBO_1169-221-4.915576IS1 ORF2
SBO_1170-221-5.216920IS1 ORF2
SBO_1171-223-5.168746alpha-mannosidase
SBO_1172-216-1.229877lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA
SBO_1173016-0.989763hypothetical protein
SBO_1174-118-1.561975high-affinity zinc transporter periplasmic
SBO_1175-119-0.862322high-affinity zinc transporter ATPase
SBO_1176-1160.219656high-affinity zinc transporter membrane protein
SBO_1177-2150.710230Holliday junction DNA helicase RuvB
SBO_11782190.554726Holliday junction DNA helicase RuvA
SBO_11794280.835061IS1 encoded protein
SBO_11805281.485363insertion sequence 2 OrfA protein
SBO_11814241.895470IS2 ORF2
SBO_11824241.296131IS2 ORF2
SBO_11833221.047315IS1 ORF2
SBO_11841251.482918IS911 ORF2
SBO_11850231.215446IS911 ORF2
SBO_11861192.017085Holliday junction resolvase
SBO_11872201.768852hypothetical protein
SBO_11882201.471904dATP pyrophosphohydrolase
SBO_11892201.549863aspartyl-tRNA synthetase
SBO_11901280.945935hypothetical protein
SBO_11913262.075574ISSfl2 ORF
SBO_11921240.945494hypothetical protein
SBO_11935253.182079hypothetical protein
SBO_11945274.076779hypothetical protein
SBO_11954284.293731tail fiber protein
SBO_11965294.890310tail component encoded by cryptic prophage
SBO_11975284.778959membrane protein
SBO_11985284.610366host specificity protein
SBO_11994264.082022tail component
SBO_12004273.854673tail assembly protein
SBO_12014273.660798minor tail protein
SBO_12024273.244879tail length tape measure protein
SBO_12032180.128533hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1174ADHESNFAMILY2733e-93 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 273 bits (699), Expect = 3e-93
Identities = 63/304 (20%), Positives = 116/304 (38%), Gaps = 25/304 (8%)

Query: 4 KKTLLFAALSAALWGGATQA---------ADAAVVASLKPVGFIASAIADGVTETEVLLP 54
KK L + A VVA+ + I IA + ++P
Sbjct: 2 KKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIVP 61

Query: 55 DGASEHDYSLRPSDVKRLQNADLVVWVGPEMEAFMQKPVSKLLGAKQVTIAQLEDVKLLL 114
G H+Y P DVK+ ADL+ + G +E +KL+ + T E+
Sbjct: 62 IGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKT----ENKDYFA 117

Query: 115 MKSIHGDDDDHDHAEKSEEDHHHGDFNMHLWLSPEIARATAVAIHGKLVELMPQSRAKLD 174
+ EK +ED H WL+ E A I +L P ++ +
Sbjct: 118 VSDGVDVIYLEGQNEKGKEDPH-------AWLNLENGIIFAKNIAKQLSAKDPNNKEFYE 170

Query: 175 ANLKDFEAQLASTETQVGNELA--PLKGKGYFVFHDAYGYFEKQFGLTPLGHFTVNPEIQ 232
NLK++ +L + + ++ P + K A+ YF K +G+ + +N E +
Sbjct: 171 KNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEE 230

Query: 233 PGAQRLHEIRTQLVEQKATCVFAEPQFRPAVVESVARGTSVRMGT---LDPLGTNIKLGK 289
+++ + +L + K +F E +++V++ T++ + D + K G
Sbjct: 231 GTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGD 290

Query: 290 TSYS 293
+ YS
Sbjct: 291 SYYS 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1175PF05272300.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.007
Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 4/31 (12%)

Query: 27 LKPG----KILTLLGPNGAGKSTLVRVVLGL 53
++PG + L G G GKSTL+ ++GL
Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1190SECFTRNLCASE260.031 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 25.6 bits (56), Expect = 0.031
Identities = 11/37 (29%), Positives = 17/37 (45%), Gaps = 1/37 (2%)

Query: 1 MRHDFVFSGVSMLELNAKTTALVVIDLQEGILPFAGG 37
R + G +++ + A +VI L GI F GG
Sbjct: 17 FRWQWATFGAAIVMMIASVILPLVIGLNFGI-DFKGG 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1192ISCHRISMTASE583e-13 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 57.7 bits (139), Expect = 3e-13
Identities = 27/101 (26%), Positives = 46/101 (45%)

Query: 11 AALGATDSDIEIIKRQWGAFYGTDLELQLRRRGIDTIVLCGISTNIGVESTARNAWELGF 70
L D D+ + K ++ AF T+L +R+ G D +++ GI +IG TA A+
Sbjct: 110 TELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDI 169

Query: 71 NLVIAEDACSAASAEQHNNSINHIYPRIARVRSVEEILHAL 111
DA + S E+H ++ + R A + +L L
Sbjct: 170 KAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDSLLDQL 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1197ENTEROVIROMP1439e-46 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 143 bits (361), Expect = 9e-46
Identities = 66/201 (32%), Positives = 102/201 (50%), Gaps = 32/201 (15%)

Query: 1 MRK-VCAAILSAAICLAVSGAPAWASEHQSTLSAGYLHARTNAPGSDNLNGINVKYRYEF 59
M+K C + L+A LA + + A+ ST++ GY + + + G N+KYRYE
Sbjct: 1 MKKIACLSALAA--VLAFTAGTSVAA--TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEE 55

Query: 60 TDA-LGLITSFSYANAEDEKKTHYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAG 118
++ LG+I SF+Y T S T D +N+++ + AGP+ R+N+W S Y + G
Sbjct: 56 DNSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVG 107

Query: 119 VAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDIAY 178
V Y + T T+ HD S+ ++GAG+QFNP E+VA+D +Y
Sbjct: 108 VGYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSY 150

Query: 179 EGSGSGDWRTDGFIVGVGYKF 199
E S +I GVGY+F
Sbjct: 151 EQSRIRSVDVGTWIAGVGYRF 171


18SBO_1251SBO_1264Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1251526-3.488174hypothetical protein
SBO_1252528-3.650377hypothetical protein
SBO_1253527-3.051356serine/threonine protein phosphatase 1
SBO_1254732-2.905825IS600 ORF2
SBO_1255529-2.276730IS600 ORF1
SBO_1256628-1.683501invasion plasmid antigen
SBO_1257524-0.471637IS600 ORF1
SBO_12585230.361509IS600 ORF2
SBO_12595200.882627ISSfl4 ORF1
SBO_12604201.354261IS629 ORF2
SBO_12614211.044230IS629 ORF1
SBO_12625250.348002protein encoded within IS
SBO_1263422-0.298817protein encoded within IS
SBO_1264321-1.160482protein encoded within IS
19SBO_1290SBO_1314Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1290225-0.964815IS4 orf
SBO_1291119-1.423615IS4 orf
SBO_1292123-4.235054hypothetical protein
SBO_1293428-2.135296IS1 ORF2
SBO_1294324-3.844072hypothetical protein
SBO_1295225-2.859818hypothetical protein
SBO_1296022-2.206480hypothetical protein
SBO_1297021-0.726960hypothetical protein
SBO_1298024-0.690755IS1 ORF2
SBO_1299123-0.376860IS1 encoded protein
SBO_1300024-0.110017hypothetical protein
SBO_1301023-0.455567hypothetical protein
SBO_1302126-4.878760amino acid/amine transport protein
SBO_1303-121-5.866728ARAC-type regulatory protein
SBO_1304016-5.078831hypothetical protein
SBO_1305014-5.108349hypothetical protein
SBO_1306-113-4.705548hypothetical protein
SBO_1308-111-4.601605hypothetical protein
SBO_1309-111-1.506728hypothetical protein
SBO_1310113-1.004809hypothetical protein
SBO_1311120-0.853700hypothetical protein
SBO_1312019-1.278843aldehyde reductase
SBO_1313121-1.665329hypothetical protein
SBO_1314221-1.108951IS4 orf
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1297HTHTETR306e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 30.0 bits (67), Expect = 6e-04
Identities = 9/37 (24%), Positives = 17/37 (45%), Gaps = 5/37 (13%)

Query: 4 LSWIIFGLIAGILAKWIMPG-----KDGGGFFMTILL 35
+ I+ G I+G++ W+ K ++ ILL
Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1304PRTACTNFAMLY280.022 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 27.7 bits (61), Expect = 0.022
Identities = 18/61 (29%), Positives = 26/61 (42%)

Query: 49 QGLSIGIIILTIGVMAPIASGTLPPSTLIHSFLNWKSLVAIAVGVIVSWLGGRGVTLMGS 108
Q +I L IG + + LPPS ++ N ++ A VS LG +TL G
Sbjct: 174 QRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGG 233

Query: 109 Q 109

Sbjct: 234 H 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1313INVEPROTEIN290.022 Salmonella/Shigella invasion protein E (InvE) signat...
		>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE)

signature.
Length = 372

Score = 28.9 bits (64), Expect = 0.022
Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 13/81 (16%)

Query: 165 ETTSALHTYFNVGDIAKVSVSGLGDRFIDKVNDAKED-----------VLTDGIQTFPDR 213
E ++AL + N D K S S L + F ++V + + V ++ F +
Sbjct: 57 EMSAALAQFRNRRDYEKKS-SNLSNSF-ERVLEDEALPKAKQILKLISVHGGALEDFLRQ 114

Query: 214 TDRVYLNPQDCSVINDEALNR 234
++ +P D ++ E L R
Sbjct: 115 ARSLFPDPSDLVLVLRELLRR 135


20SBO_1343SBO_1357Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_13432122.597973arginine succinyltransferase
SBO_13442141.834019succinylglutamic semialdehyde dehydrogenase
SBO_13451130.154885succinylarginine dihydrolase
SBO_1346014-0.966791succinylglutamate desuccinylase
SBO_1347115-2.202170hypothetical protein
SBO_1348-115-1.819883hypothetical protein
SBO_1349-217-2.083649nucleotide excision repair endonuclease
SBO_1350-217-1.936101NAD synthetase
SBO_1351219-0.587008DNA-binding transcriptional activator OsmE
SBO_1352224-0.548823PTS system N,N'-diacetylchitobiose-specific
SBO_13535290.649791IS1 ORF2
SBO_13543170.460814IS1 encoded protein
SBO_13562170.327717IS629 ORF1
SBO_1357217-0.044346IS629 ORF2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1344DNABINDINGHU300.005 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 29.7 bits (67), Expect = 0.005
Identities = 14/61 (22%), Positives = 28/61 (45%), Gaps = 5/61 (8%)

Query: 74 SNKAELTAIIARETGKLRWEAATEVTAMINKIAISIKAYHVRTGEQRSEMPDGAASLRHR 133
+NK +L A +A T + ++A V A+ + ++ + GE+ + G +R R
Sbjct: 2 ANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAK-----GEKVQLIGFGNFEVRER 56

Query: 134 P 134

Sbjct: 57 A 57


21SBO_1368SBO_1412Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1368425-1.770888hypothetical protein
SBO_1369221-1.905135hypothetical protein
SBO_1370221-2.064477IS1 ORF2
SBO_1371121-2.505998IS1 encoded protein
SBO_1372224-2.541023IS1 encoded protein
SBO_1373324-1.512838IS1 ORF2
SBO_1375326-0.550484threonyl-tRNA synthetase
SBO_13764270.177038translation initiation factor IF-3
SBO_13773250.66381050S ribosomal protein L35
SBO_13782191.18860450S ribosomal protein L20
SBO_1379022-6.623875phenylalanyl-tRNA synthetase subunit alpha
SBO_1380-123-6.801294phenylalanyl-tRNA synthetase subunit beta
SBO_1381028-8.303236integration host factor subunit alpha
SBO_1382-127-8.101577insertion sequence 2 OrfA protein
SBO_1383-126-8.274514insertion element IS2 transposase InsD
SBO_1384-122-5.369560hypothetical protein
SBO_13850192.254701bacteriophage late gene regulator
SBO_1386-1191.852061tail sheath protein
SBO_13871200.667412tail tube protein
SBO_1388122-0.436475tail protein
SBO_1389020-1.016186tail protein
SBO_1390-123-3.629232tail protein
SBO_1391024-2.013973DNA-invertase
SBO_1392022-1.376957tail fiber assembly protein
SBO_13931200.551224tail fiber assembly protein
SBO_13941202.251426bacteriophage tail protein
SBO_13954265.076388phage tail protein
SBO_13962275.700376phage baseplate assembly protein
SBO_13972284.970602phage baseplate assembly protein
SBO_13981295.119292phage baseplate assembly protein
SBO_13991295.133518phage tail completion protein
SBO_14001265.432177phage tail protein
SBO_14011285.178209hypothetical protein
SBO_14020284.681502endolysin
SBO_14030243.932244phage tail protein
SBO_1404-1223.401226capsid completion protein
SBO_14050201.972853phage terminase
SBO_14060191.438702major capsid protein
SBO_1407-1181.328801capsid scaffolding protein
SBO_1408-1200.681174terminase subunit
SBO_1409022-0.463216capsid portal protein
SBO_1410225-0.449049hypothetical protein
SBO_14113241.145710stability/partitioning protein
SBO_14123241.214683IS911 ORF2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1369cloacin270.008 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 26.6 bits (58), Expect = 0.008
Identities = 10/28 (35%), Positives = 11/28 (39%)

Query: 5 TTPPNGCNNKKSLDARPFCYTPLSRTRD 32
T G N D RP +T TRD
Sbjct: 242 QTLSPGVTNNTDKDVRPAGFTQGGNTRD 269


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1381DNABINDINGHU1193e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (301), Expect = 3e-39
Identities = 34/89 (38%), Positives = 55/89 (61%)

Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGR 63
K ++ + + L+K+D+ V+ F + L GE+V+L GFGNF++R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92
NP+TGE+I I A +V F+ G+ LK V+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1389PHPHTRNFRASE300.047 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 30.1 bits (68), Expect = 0.047
Identities = 24/106 (22%), Positives = 45/106 (42%), Gaps = 13/106 (12%)

Query: 286 IGSNILTQFGLSADQMD-RVG---DTLTAAFTRTNTDLRALGETMKYAGPVAGKLGISLE 341
IG+N L Q+ ++AD+M+ RV A LR + +K A +G+ E
Sbjct: 452 IGTNDLIQYTMAADRMNERVSYLYQPYHPAI------LRLVDMVIKAAHSEGKWVGMCGE 505

Query: 342 QAA--AMAGVLANMGIRG-SDAGTAMRASLARLASPPKAAAEALKE 384
A +L +G+ S + T++ + ++L K + +
Sbjct: 506 MAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLSKEELKPFAQ 551


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1394FLAGELLIN320.012 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 31.6 bits (71), Expect = 0.012
Identities = 33/240 (13%), Positives = 67/240 (27%), Gaps = 6/240 (2%)

Query: 121 GSGRAQTFRTILTVSSTATVALTVDNTMVMATVDYVDDKLKEHEQSRRHPDASLTAKGFV 180
SG T T TV V + L + +S + G +
Sbjct: 210 NSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269

Query: 181 QLSSATNSVSETQAATPKAVKAAYDLANGKYTAQDASTTRKGLVQLSSATNSTSETQAAT 240
+ ++ K D T + + +++ + +
Sbjct: 270 KGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQS 329

Query: 241 PKAVKAAYDLANAKYTAQDATTAQKGIVQLSSATNSTSETLAATSKAVKAVMDETNKKAP 300
K V + N ++T D T + + A N+ T + + K
Sbjct: 330 SKNVYTSVV--NGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVT 387

Query: 301 LNSPALTGTPTTPAARQGTNNTQIASTAFVMAAIAALVDSSPDALNTLNELAAALGNAPN 360
L + T N A+ +A++ AL+ ++ + ++LG N
Sbjct: 388 LAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASI----DSALSKVDAVRSSLGAIQN 443


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1400ANTHRAXTOXNA280.017 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 28.2 bits (62), Expect = 0.017
Identities = 12/44 (27%), Positives = 27/44 (61%), Gaps = 4/44 (9%)

Query: 79 LLNPERNQDIKFSAVI----NDDDSADLLFTLPLRERVRITRSS 118
+++ +++ D +F +I +D DS+DLLF+ +E++ + S
Sbjct: 181 IISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKEKLELNNKS 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1405PF03944300.007 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 30.4 bits (68), Expect = 0.007
Identities = 26/101 (25%), Positives = 37/101 (36%), Gaps = 18/101 (17%)

Query: 77 DGGQQDEVIATLMVWAIDCGDLPLALRIGAYVVRHNL-IMPDNFGRTAATVLTEEICNPV 135
D G E +AT+ W + + L LR GA+ R PD F R + V
Sbjct: 379 DSGSDREGVATVTNWQTESFETTLGLRSGAFTARGISNYFPDYFIRNISGV--------- 429

Query: 136 LTQAGTDADADLSAFIEPL--DTLREIVTDQDMPDEVRAKL 174
+ DL PL + +R I + P RA +
Sbjct: 430 ---PLVVRNEDLR---RPLHYNEIRNIASPSGTPGGARAYM 464


22SBO_1547SBO_1564Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1547220-0.791010insertion element IS2 transposase InsD
SBO_1548117-1.715440oxidoreductase, major subunit
SBO_1549120-2.452576IS629 ORF1
SBO_1550022-4.264206IS629 ORF2
SBO_1552025-6.384224hypothetical protein
SBO_1553-127-7.964817hypothetical protein
SBO_1555-124-7.617770oxidoreductase
SBO_1556-224-8.416831hypothetical protein
SBO_1557-219-6.958872transcriptional regulator YdeO
SBO_1558-117-6.140529sulfatase
SBO_1559-115-5.058121hypothetical protein
SBO_1561-113-4.301459hypothetical protein
SBO_1562-216-3.803414peptidase
SBO_1563012-2.588410glutamate decarboxylase
SBO_1564013-3.066622acid sensitivity protein
23SBO_1603SBO_1633Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1603226-0.807955hypothetical protein
SBO_1604428-0.728161IS1 encoded protein
SBO_1605529-0.088156insertion element IS2 transposase InsD
SBO_1606426-1.201994IS1 ORF2
SBO_1607-121-1.805695IS600 ORF1
SBO_1608-119-1.594278IS600 ORF2
SBO_1609018-2.222600hypothetical protein
SBO_1610-117-1.909735IS1 ORF2
SBO_1611018-2.371575competence damage-inducible protein A
SBO_1612017-2.199778dipeptidyl carboxypeptidase II
SBO_1613-118-2.5325873-hydroxy acid dehydrogenase
SBO_1614121-3.681327hypothetical protein
SBO_1615321-2.844857hypothetical protein
SBO_1616322-1.915434oxidoreductase
SBO_1617529-1.440309IS1 ORF2
SBO_1618527-0.960497IS1 encoded protein
SBO_16193201.429216invasion plasmid antigen
SBO_16203223.518597prophage tail protein
SBO_16214233.822378phage tail fiber protein
SBO_16224243.991503tail component encoded by cryptic prophage
SBO_16234244.031992membrane protein
SBO_16241253.789834host specificity protein
SBO_16252273.655906prophage tail protein
SBO_16263283.301136tail assembly protein
SBO_16272262.914580minor tail protein
SBO_16283262.670012minor tail protein
SBO_16293272.894087tail length tape measure protein
SBO_16305261.531211prophage tail protein
SBO_16315241.568293prophage tail protein
SBO_16324181.640794prophage tail protein
SBO_16332200.981365prophage tail protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1613DHBDHDRGNASE1002e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (249), Expect = 2e-27
Identities = 70/244 (28%), Positives = 114/244 (46%), Gaps = 16/244 (6%)

Query: 2 IVLVTGATAGFGECITRRFIQQGHKVIATGRRQERLQELKDELGDNLYIAQ---LDVRNR 58
I +TGA G GE + R QG + A E+L+++ L A+ DVR+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 AAIEEMLASLPAEWCNIDILVNNAGLALGMEPAHKASVEDWETMIDTNNKGLVYMTRAVL 118
AAI+E+ A + E IDILVN AG+ L H S E+WE N+ G+ +R+V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PGMVERNHGHIINIGSTAGSWPYAGGNVYGATKAFVRQFSLNLRTDLHGTAVRVTDIEPG 178
M++R G I+ +GS P Y ++KA F+ L +L +R + PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 LVGGTEFSNVRFKGDDGKAE------KTYQNTVALT----PEDVSEAV-WWVSTLPAHVN 227
T+ + ++G + +T++ + L P D+++AV + VS H+
Sbjct: 189 ST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 228 INTL 231
++ L
Sbjct: 248 MHNL 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1621FLAGELLIN280.046 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 28.5 bits (63), Expect = 0.046
Identities = 13/89 (14%), Positives = 29/89 (32%)

Query: 4 QQSAGAAAGNAQQTAQDVAAAATARDDAQRFAENARQDATVTAEDRKATAEDVTSTGANA 63
Q + N D+ A + +++ A A + + + +
Sbjct: 341 QFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTAS 400

Query: 64 AAAGQSAQDAAGYARAAKQAKNDIDAALT 92
+ +DAA ++ ID+AL+
Sbjct: 401 GVSTLINEDAAAAKKSTANPLASIDSALS 429


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1623ENTEROVIROMP1467e-47 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 146 bits (369), Expect = 7e-47
Identities = 66/201 (32%), Positives = 102/201 (50%), Gaps = 32/201 (15%)

Query: 1 MRK-VCAAILSAAICLAVSGAPAWASEHQSTLSAGYLQPHTDMPGSDDLKGINVKYRYEF 59
M+K C + L+A LA + + A+ ST++ GY Q + + G N+KYRYE
Sbjct: 1 MKKIACLSALAA--VLAFTAGTSVAA--TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEE 55

Query: 60 TDT-LGLVTSFSYANAEDEQKTHYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAG 118
++ LG++ SF+Y T S T D +N+++ + AGP+ R+N+W S Y + G
Sbjct: 56 DNSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVG 107

Query: 119 VAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDNRHSNTSLAWGAGVQFNPTESVAIDLAY 178
V Y + T T+ HD S+ ++GAG+QFNP E+VA+D +Y
Sbjct: 108 VGYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSY 150

Query: 179 EGSGSGDWRTDGFIVGVGYKF 199
E S +I GVGY+F
Sbjct: 151 EQSRIRSVDVGTWIAGVGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1629RTXTOXIND391e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.7 bits (90), Expect = 1e-04
Identities = 25/224 (11%), Positives = 63/224 (28%), Gaps = 30/224 (13%)

Query: 545 NYQEQQKRRNAENAALNRMNETEAARHQREIARINAMQYADQAVRDAAIQRENERYEKAI 604
+ + + A L + + E+ ++ ++ D+ + E R I
Sbjct: 133 EADTLKTQSSLLQARLEQ-TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLI 191

Query: 605 KKNTRATRNDEATRLLLQYSQQQAQVEGQIAAARQSAGIATERMTEAHKQLLALQQRISD 664
K+ + Q+ Q E + R R+ + R+ D
Sbjct: 192 KEQ------------FSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239

Query: 665 LDGKKLTADEKSVLARKNELIQALTLLDVKQQELQKQTALNDLRKKTVQLTSQLADKERA 724
L + I +L+ + + ++ L + + Q+ S++ +
Sbjct: 240 F--SSL---------LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE 288

Query: 725 LREQHNLDIATAGMGDKQRQRYQAQLRIRQEYRQQLQQLENDSR 768
+ + + Q I +L + E +
Sbjct: 289 YQ-LVTQLFKN----EILDKLRQTTDNIGL-LTLELAKNEERQQ 326



Score = 37.1 bits (86), Expect = 4e-04
Identities = 14/188 (7%), Positives = 40/188 (21%), Gaps = 12/188 (6%)

Query: 35 DAERSSARMQRFMERQTQAARQTMHAASSAATAASAHAQTVEKNARAHERMAREVEQTRL 94
+A+ + R Q Q S + + EV +
Sbjct: 133 EADTLKTQSSLLQARLEQTRYQ---ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTS 189

Query: 95 RVDALNQKMREEQAQARALAEAQDKAAAAFYRQIDSVKQAGAGLQELQRIQQQIRQARNS 154
+ + ++ Q + + +I+ + + +
Sbjct: 190 LIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD---FSSLLHK 246

Query: 155 GGVGQQDYLALISEITAKTRALTQAE------EQATRQKAAFIRQLKEQATRQNLSSSEL 208
+ + L ++ L + E + + + + L
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQ 306

Query: 209 LRARAAQL 216
L
Sbjct: 307 TTDNIGLL 314



Score = 32.9 bits (75), Expect = 0.007
Identities = 31/238 (13%), Positives = 69/238 (28%), Gaps = 42/238 (17%)

Query: 402 DPVNAAKALDNALHFLTATQLEQIRVLGEQGRSSDAARIAMSALAEETGKRTSDIDNNLN 461
+ A L +LEQ R RS + ++ L +E + + L
Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILS-RSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 462 ALGSTLQTLSDWWKQFWDAAMNIGREDSLDAQIDALQEKIQRAKKYPWTNASTQVEYDQQ 521
+ S W Q + +N+ D A+ + +I R + ++
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNL---DKKRAERLTVLARINRYEN--------LSRVEKS 235

Query: 522 RLNDLQEKKRRKDLQDAKAQAERNYQEQQKRRNAENAALNRMNETEAARHQREIARINAM 581
RL+D L +A A+ EQ+ + L +++ +
Sbjct: 236 RLDDFSS------LLHKQAIAKHAVLEQENKYVEAVNELRVY-----------KSQLEQI 278

Query: 582 QYADQAVRDAAIQRENERYEKAIKKNTRATRNDEATRLLLQYSQQQAQVEGQIAAARQ 639
+ + ++ + + K + T + ++A +
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTT-------------DNIGLLTLELAKNEE 323



Score = 30.6 bits (69), Expect = 0.036
Identities = 26/185 (14%), Positives = 57/185 (30%), Gaps = 13/185 (7%)

Query: 13 IDAAEFKNEIPRIKNLLNGAAGDAERSSARMQRFMERQTQAARQTMHAASSAATAASAHA 72
+ E +E R+ ++ Q + + A
Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216

Query: 73 QTVEKNARAHERMAREVEQTRLRVDALNQKMREEQAQARALAEAQDKAAAAFYRQIDSVK 132
TV +E ++R VE++RL + +QA A+ Q+ ++
Sbjct: 217 LTVLARINRYENLSR-VEKSRL---DDFSSLLHKQAIAKHAVLEQENKYVEAVNEL---- 268

Query: 133 QAGAGLQELQRIQQQIRQARNSGGVGQQDYLALISEITAKTRA-LTQAEEQATRQKAAFI 191
+L++I+ +I A+ + Q + I + +T + + K
Sbjct: 269 --RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE--LAKNEER 324

Query: 192 RQLKE 196
+Q
Sbjct: 325 QQASV 329


24SBO_1645SBO_1657Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1645020-3.858229hypothetical protein
SBO_1646017-2.128345sugar efflux transporter
SBO_1647017-1.472014multiple drug resistance protein MarC
SBO_1648215-1.745524DNA-binding transcriptional repressor MarR
SBO_1649218-0.912720DNA-binding transcriptional activator MarA
SBO_1650322-0.305197hypothetical protein
SBO_1652319-1.538583IS629 ORF1
SBO_1653316-1.350152IS629 ORF2
SBO_1654119-2.743479hypothetical protein
SBO_1655017-3.456712IS1 ORF2
SBO_1656-119-4.398297IS1 encoded protein
SBO_1657-118-4.354187tellurite resistance protein TehB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1646TCRTETB537e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 53.3 bits (128), Expect = 7e-10
Identities = 41/192 (21%), Positives = 83/192 (43%), Gaps = 8/192 (4%)

Query: 36 LSDIAQSFHMQTAQVGIMLTIYAWVVALMSLPFMLMTSQVERRKLLICLFVVFIASHVLS 95
L DIA F+ A + T + ++ + + ++ Q+ ++LL+ ++ V+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 96 FLSWS-FTVLVISRIGVAFAHAIFWSITASLAIRMAPAGKRAQALSLIATGTALAMVLGL 154
F+ S F++L+++R A F ++ + R P R +A LI + A+ +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 155 PLGRIVGQYFGWRMTFFAIGIGALITLLCLIKLLPLLPSEHSGSLKSLPLLFRRPALMSI 214
+G ++ Y W I + +IT+ L+KLL + LMS+
Sbjct: 157 AIGGMIAHYIHWSY-LLLIPMITIITVPFLMKLLK------KEVRIKGHFDIKGIILMSV 209

Query: 215 YLLTVVVVTAHY 226
++ ++ T Y
Sbjct: 210 GIVFFMLFTTSY 221


25SBO_1689SBO_1716Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1689018-3.325993outer membrane protein
SBO_1690219-2.235226filament protein
SBO_1691218-1.256210Iron transport protein, periplasmic-binding
SBO_1692218-0.262268Iron transport protein, ATP-binding component
SBO_1693321-0.310244iron transport protein, inner membrane
SBO_1694425-0.267170Iron transport protein, inner membrane
SBO_16955270.898423ISSfl4 ORF3
SBO_16964240.050397IS629 ORF2
SBO_1697325-0.358846IS629 ORF1
SBO_1698325-1.126757ISSfl4 ORF3
SBO_1699225-0.717997transposase
SBO_1700324-0.136429IS911 ORF2
SBO_1704123-1.483784***hypothetical protein
SBO_17053230.113415IS600 ORF2
SBO_17064220.571408IS600 ORF1
SBO_17075221.402670IS629 ORF1
SBO_17084230.460521IS629 ORF2
SBO_1709424-0.091746hypothetical protein
SBO_17104240.294830IS911 ORF2
SBO_1711224-3.567823IS629 ORF2
SBO_1712220-4.867531IS629 ORF1
SBO_1713-121-4.164739hypothetical protein
SBO_1714-220-3.210466IS600 ORF2
SBO_1715-121-3.908357IS600 ORF1
SBO_1716019-3.308494transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1689ECOLIPORIN5810.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 581 bits (1499), Expect = 0.0
Identities = 312/388 (80%), Positives = 336/388 (86%), Gaps = 16/388 (4%)

Query: 1 MKSKVLALLIPALLAAGAAHAAEVYNKDGNKLDLYGKVDGLHYFSDNSAKDGDQSYARLG 60
MK KVLAL+IPALLAAGAAHAAE+YNKDGNKLDLYGKVDGLHYFSD+S+KDGDQ+Y R+G
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 61 FKGETQINDQLTGYGQWEYNIQANNTESSKNQSWTRLAFAGLKFADYGSFDYGRNYGVMY 120
FKGETQINDQLTGYGQWEYN+QAN TE SWTRLAFAGLKF DYGSFDYGRNYGV+Y
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 121 DIEGWTDMLPEFGGDSYTNADNFMTGRANGVATYRNTDFFGLVNGLNFAVQYQGNNEGAS 180
D+EGWTDMLPEFGGDSYT ADN+MTGRANGVATYRNTDFFGLV+GLNFA+QYQG NE S
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 181 N-----GQEGTNNGRDVRHENGDGWGFSTTYDLGMGFSAGAAYTSSDRTNDQVNH--TAA 233
G NNG D+R++NGDG+G STTYD+GMGFSAGAAYT+SDRTN+QVN T A
Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTIA 240

Query: 234 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPFGDS----DYAVANKTQNFEVTAQYQF 289
GGDKADAWTAGLKYDANNIYLATMYSETRNMTP+G + D VANKTQNFEVTAQYQF
Sbjct: 241 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQF 300

Query: 290 DFGLRPAVSFLMSKGRDLHAAGGADNPAGVDDKDLVKYADVGATYYFNKNMSTYVDYKIN 349
DFGLRPAVSFLMSKG+DL N DDKDLVKYADVGATYYFNKN STYVDYKIN
Sbjct: 301 DFGLRPAVSFLMSKGKDL-----TYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 350 LLDEDDSFYAANGISTDDIVALGLVYQF 377
LLD+DD FY GISTDDIVALG+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1691adhesinb331e-116 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 331 bits (849), Expect = e-116
Identities = 90/296 (30%), Positives = 163/296 (55%), Gaps = 7/296 (2%)

Query: 9 MLLGCLALTCSIAFQASATEKFKVITTFTIIADMAKNVAGDAAEVSSITKPGAEIHEYQP 68
+G A + + + + K V+ T +IIAD+ KN+AGD + SI G + HEY+P
Sbjct: 13 AFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEP 72

Query: 69 TPGDIKRAQGAQLILANGMNLEL----WFQRFYQHLNGVPE---VIVSSGVTPVGITEGP 121
P D+K+ A LI NG+NLE WF + ++ VS GV + +
Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132

Query: 122 YEGKPNPHAWMSPDNALIYVDNIRDALIKYDPANAQTYQRNADTYKAKITQTLAPLRKQI 181
+GK +PHAW++ +N +IY NI L + DPAN +TY++N Y K++ +++
Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKF 192

Query: 182 TELPENQRWMVTSEGAFSYLARDLGLKELYLWPINADQQGTPQQVRKVVDIVKKNHIPAV 241
+P ++ +VTSEG F Y ++ + Y+W IN +++GTP Q++ +V+ ++K +P++
Sbjct: 193 NNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSL 252

Query: 242 FSESTISDKPARQVARETGAHYGGVLYVDSLSTENGPVPTYIDLLKVTTSTLVQGI 297
F ES++ D+P + V+++T ++ DS++ + +Y ++K + +G+
Sbjct: 253 FVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1698RTXTOXIND414e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 4e-06
Identities = 9/74 (12%), Positives = 33/74 (44%), Gaps = 1/74 (1%)

Query: 16 LRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRMLFGQSSEKKRHKLENQIRQAEKRLS 75
+ +Q+++ + ++ Y+ ++E++++++ + + K L+ ++RQ +
Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD-KLRQTTDNIG 312

Query: 76 ELENRLNTARNLLE 89
L L +
Sbjct: 313 LLTLELAKNEERQQ 326


26SBO_1727SBO_1738Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1727-114-3.791318fumarate/nitrate reduction transcriptional
SBO_1728-116-4.302505universal stress protein UspE
SBO_1729-122-5.047402hypothetical protein
SBO_1730-221-3.617277IS1 ORF2
SBO_1731-122-3.731191IS1 encoded protein
SBO_1732-121-3.578478hypothetical protein
SBO_1733-124-2.688708transport periplasmic protein
SBO_1734225-1.611506LysR family transcriptional regulator
SBO_1735122-0.951769insertion element IS2 transposase InsD
SBO_1736122-1.905349insertion sequence 2 OrfA protein
SBO_1737121-2.474550hypothetical protein
SBO_1738224-3.899412dienelactone hydrolase and related enzyme
27SBO_1750SBO_1757Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1750431-0.437970IS1 encoded protein
SBO_1751326-0.483272IS1 ORF2
SBO_1752120-1.368402IS1 ORF2
SBO_1753316-2.470165IS1 encoded protein
SBO_1754213-1.533033thiosulfate:cyanide sulfurtransferase
SBO_17553140.718915peripheral inner membrane phage-shock protein
SBO_17562172.124913DNA-binding transcriptional activator PspC
SBO_17572193.073412phage shock protein B
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1757MPTASEINHBTR250.030 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 24.6 bits (53), Expect = 0.030
Identities = 7/43 (16%), Positives = 17/43 (39%)

Query: 30 SGRSELSQSEQQRLAQLADEAKRMRERIQALESILDAEHPNWR 72
+G+ + + A A++A + + E L + +W
Sbjct: 37 AGQLGIEATGSGVCAGPAEQANALAGDVACAEQWLGDKPVSWS 79


28SBO_1807SBO_1833Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1807217-2.999196IS1 ORF2
SBO_1808218-2.421705IS1 encoded protein
SBO_1809118-2.345601outer membrane protein W
SBO_1810016-1.863568hypothetical protein
SBO_1811120-2.043790intracellular septation protein A
SBO_1812-124-4.212484acyl-CoA thioester hydrolase
SBO_1813-222-3.228325transporter
SBO_1814-120-4.215296YciI-like protein
SBO_1815018-3.460659IS600 ORF2
SBO_1816017-3.435595IS600 ORF1
SBO_1817015-2.772522voltage-gated potassium channel
SBO_1818014-1.742764cardiolipin synthetase
SBO_1819-113-2.818841dsDNA-mimic protein
SBO_1820011-2.454206hypothetical protein
SBO_1821012-2.977785oligopeptide ABC transporter ATP-binding
SBO_1822113-3.581695hypothetical protein
SBO_1823024-3.304671oligopeptide transporter permease
SBO_1824026-2.202286oligopeptide transport periplasmic binding
SBO_1825128-0.888688IS1 ORF2
SBO_1826027-2.194178IS1 encoded protein
SBO_1827-129-2.355353hypothetical protein
SBO_1828-130-2.586878bifunctional acetaldehyde-CoA/alcohol
SBO_1829-121-3.445977hypothetical protein
SBO_1830023-3.989745hypothetical protein
SBO_1831028-4.926468thymidine kinase
SBO_1832-123-4.243507global DNA-binding transcriptional dual
SBO_1833017-3.005667UTP--glucose-1-phosphate uridylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1813TONBPROTEIN2562e-88 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 256 bits (654), Expect = 2e-88
Identities = 236/239 (98%), Positives = 237/239 (99%)

Query: 4 ITLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQA 63
+TLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMV PADLEPPQA
Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60

Query: 64 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 123
VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR
Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120

Query: 124 PASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 183
PASPFENTAPAR TSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF
Sbjct: 121 PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180

Query: 184 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 242
DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ
Sbjct: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1814adhesinmafb314e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 31.2 bits (70), Expect = 4e-04
Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 2/57 (3%)

Query: 41 GPMPAVDSNDPGAAGFTGSTVIAEFESLEAAQAWADADPYVAAGVYEHVSVKPFKKV 97
P+PA G GS E + EA W +P A V +V KV
Sbjct: 268 APLPA--EGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1820HTHFIS310.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.008
Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 55 VVGESGCGKSTFARAI 70
+ GESG GK ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1827ACRIFLAVINRP280.034 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.9 bits (62), Expect = 0.034
Identities = 14/44 (31%), Positives = 22/44 (50%)

Query: 47 NLTANLSVAIILWISLFLGDTILQLFGISINSFRIAGGILVVTI 90
N+ A L I + + L IL FG SIN+ + G +L + +
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGL 404


29SBO_1889SBO_1898Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1889-219-3.055875DNA polymerase V subunit UmuD
SBO_1891-320-3.459032IS1 ORF2
SBO_1892-122-3.833360IS1 encoded protein
SBO_1893022-3.843515hypothetical protein
SBO_1894120-3.065298septum formation inhibitor
SBO_1895-123-4.341706cell division inhibitor MinD
SBO_1896-124-3.968978cell division topological specificity factor
SBO_1897026-3.481541hypothetical protein
SBO_1898020-3.372717hypothetical protein
30SBO_1913SBO_1935Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1913217-0.801007IS1 encoded protein
SBO_1914215-0.007293IS1 ORF2
SBO_19151130.420001putrescine/spermidine ABC transporter ATPase
SBO_19162160.124928spermidine/putrescine ABC transporter membrane
SBO_19172180.947982protein encoded within IS
SBO_19182180.651151IS629 ORF2
SBO_19190200.836067IS629 ORF1
SBO_1920019-0.088992protein encoded within IS
SBO_1921126-1.067260membrane protein, energy transducer
SBO_1922325-0.878934endopeptidase
SBO_1923325-1.052700hypothetical protein
SBO_1924322-0.076934lysozyme
SBO_19253220.270244hypothetical protein
SBO_19263240.959838hypothetical protein
SBO_19283230.728565hypothetical protein
SBO_19273261.596898IS629 ORF1
SBO_19293260.839462IS629 ORF2
SBO_1930224-1.450393hypothetical protein
SBO_1934124-2.986079***DNA adenine methyltransferase encoded by
SBO_1935021-3.112897DNA adenine methyltransferase encoded by
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1915PF05272300.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.017
Identities = 10/36 (27%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 46 LTLLGPSGCGKTTVLRLIAGLE-TVDSGRIMLDNED 80
+ L G G GK+T++ + GL+ D+ + +D
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1921TONBPROTEIN729e-19 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 71.9 bits (176), Expect = 9e-19
Identities = 31/78 (39%), Positives = 49/78 (62%)

Query: 28 PRQLVKALPQYPAYAAANYIKGRVDVKFDIGADGTVTRIEFIRSEPHHLFDEQVVKAMAK 87
PR L + PQYPA A A I+G+V VKFD+ DG V ++ + ++P ++F+ +V AM +
Sbjct: 153 PRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRR 212

Query: 88 WRFEKDRPCKGVKKTFIF 105
WR+E +P G+ +F
Sbjct: 213 WRYEPGKPGSGIVVNILF 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1934FbpA_PF05833290.010 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 29.5 bits (66), Expect = 0.010
Identities = 13/56 (23%), Positives = 24/56 (42%), Gaps = 4/56 (7%)

Query: 46 ESDYLKLQA--LFARVAEEKHR-RGELEKLHHQLVDTYTSLN-RQYAELLSEYKHL 97
+SD LK ++ L V +R + + L++ L + Y ELL+ +
Sbjct: 293 KSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYA 348


31SBO_1974SBO_1992Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1974220-1.80887850S ribosomal protein L32
SBO_19752170.645852hypothetical protein
SBO_19761150.632635Maf-like protein
SBO_19770130.64457723S rRNA pseudouridylate synthase C
SBO_1978-1110.973064transposase
SBO_19790121.088855hypothetical protein
SBO_19800111.516188ribonuclease E
SBO_1981-1101.142771flagellar hook-associated protein FlgL
SBO_19820101.120225flagellar hook-associated protein FlgK
SBO_19842111.689888flagellar basal body P-ring protein
SBO_19853131.546220flagellar basal body L-ring protein
SBO_19862141.566818flagellar basal body rod protein FlgG
SBO_19871151.401934flagellar basal body rod protein FlgF
SBO_19882170.271479flagellar hook protein FlgE
SBO_19891220.050453flagellar basal body rod modification protein
SBO_19901150.155758flagellar basal body rod protein FlgC
SBO_19912170.528795cell-proximal portion of basal-body rod
SBO_19922180.413484flagellar basal body P-ring biosynthesis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1980IGASERPTASE675e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 66.6 bits (162), Expect = 5e-13
Identities = 48/288 (16%), Positives = 85/288 (29%), Gaps = 36/288 (12%)

Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAAPATPAAPAQPGLL 571
P E+ + DVP P+ E A AP P APATP+
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETT----- 1037

Query: 572 SRFFGALKALFSGSEETKPTEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRSER-- 629
ET + Q QN + + + ++
Sbjct: 1038 ---------------ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT 1082

Query: 630 TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKRQ 689
E + + E + + ++TA + + TEK + + + + + + Q
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142

Query: 690 AQ---QEAKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAPV 744
A+ + +N++E Q + +P + + Q V +V V P
Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENPE 1200

Query: 745 VEETAAAEPIVQEAPA------PRTELVKVPLPVVAQTAPEQQEENNA 786
A +P V + R + VP V T A
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248



Score = 66.2 bits (161), Expect = 7e-13
Identities = 47/261 (18%), Positives = 84/261 (32%), Gaps = 26/261 (9%)

Query: 551 VAPAPKAAPATPAAPAQPGLLSRFFGALKALFSGSEETKPTEQP-APKAEAKPERQQDRR 609
P + S E + E P P A A P
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSN----------NEEIARVDEAPVPPPAPATPSETT--- 1037

Query: 610 KPRQNNRRDRNERRDTRSERTEGSDNREENRRNRRQAQQQTAETRESRQQAEV------T 663
N ++++++ D E +NR A++ + + + Q EV T
Sbjct: 1038 -----ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 664 EKARTTDEQQAPRRERSRRRNDDKRQAQQEAKALNVEEQSVQETEQEERVRPVQPRRKQR 723
++ +TT+ ++ E+ + + + Q+ K + + QE + + + R
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPK-VTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 724 QLNQKVRYEQSVAEEAVVAPVVEETAAAEPIVQEAPAPRTELVKVPLPVVAQTAPEQQEE 783
+N K Q+ P E ++ E V E+ T V P A Q
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211

Query: 784 NNADNRDNGGMPRRSRRSPRH 804
N+ + RRS RS H
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPH 1232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1981FLAGELLIN474e-08 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 47.3 bits (112), Expect = 4e-08
Identities = 31/132 (23%), Positives = 61/132 (46%), Gaps = 3/132 (2%)

Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66
++ Q N+ +S + + E++S+G R+ + DD + A + +Q +
Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 67 TFAIQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLAMDIQGLRDQLLNLAN 126
I E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 127 TTDGNGRYIFAG 138
T NG + +
Sbjct: 128 QTQFNGVKVLSQ 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1982FLGHOOKAP16820.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 682 bits (1762), Expect = 0.0
Identities = 545/546 (99%), Positives = 546/546 (100%)

Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 61
SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121
GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181
SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 241
QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 242 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 301
RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 302 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 361
ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360

Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421
YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV
Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420

Query: 422 NMDVLITDEAKIAMASEKDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 481
NMDVLITDEAKIAMASE+DAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN
Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480

Query: 482 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541
KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 542 ALINIR 547
ALINIR
Sbjct: 541 ALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1984FLGPRINGFLGI427e-152 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 427 bits (1099), Expect = e-152
Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%)

Query: 4 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63
F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123
ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M
Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131

Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183
T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F
Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191

Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 239
L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I
Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250

Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299
+N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF
Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308

Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSARLNNVVRALNALGATPMDLMSILQSMQSAGCLR 359
GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+
Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367

Query: 360 AKL 362
A+L
Sbjct: 368 AEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1985FLGLRINGFLGH346e-124 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 346 bits (890), Expect = e-124
Identities = 230/232 (99%), Positives = 230/232 (99%)

Query: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60
MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY
Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60

Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120
GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120

Query: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180
RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 181 SGVVNPRTISGSNTVPSTQVVDARIEYVGNGYINEAQNMGWLQHFFLNLSPM 232
SGVVNPRTISGSNTVPSTQV DARIEYVGNGYINEAQNMGWLQ FFLNLSPM
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1986FLGHOOKAP1434e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.4 bits (102), Expect = 4e-07
Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%)

Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62
S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQIGTGVRPVATERLHSQ 83
+G +G GV +R +
Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68



Score = 38.8 bits (90), Expect = 1e-05
Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMQQKLTQL 260
S VN+ EE N+ + Q+ Y N++ + T + + L +
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1988FLGHOOKAP1415e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.5 bits (97), Expect = 5e-06
Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 354 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 402
L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 37.2 bits (86), Expect = 1e-04
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57
A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62


32SBO_2018SBO_2026Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_2018016-3.073181glucan biosynthesis protein G
SBO_2019120-2.904741glucans biosynthesis protein
SBO_2020019-1.255258synthase
SBO_2021318-0.965559hypothetical protein
SBO_2022119-3.178085hypothetical protein
SBO_2023223-2.589674IS629 ORF2
SBO_2024124-3.352042IS629 ORF1
SBO_2025121-3.000623IS600 ORF2
SBO_2026219-2.056780cryptic curlin major subunit
33SBO_2056SBO_2082Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_2056221-0.140203hypothetical protein
SBO_20580200.5422372Fe-2S ferredoxin
SBO_20593230.797026ribonucleotide-diphosphate reductase subunit
SBO_20603210.946306ribonucleotide-diphosphate reductase subunit
SBO_20613160.292590IS1 encoded protein
SBO_20622160.741499IS1 ORF2
SBO_20642150.8298203-demethylubiquinone-9 3-methyltransferase
SBO_20653170.861315DNA gyrase subunit A
SBO_2066321-0.149992hypothetical protein
SBO_20673210.524588IS600 ORF1
SBO_20684240.826604IS911 ORF2
SBO_20694240.385323IS629 ORF2
SBO_20705240.359190IS629 ORF1
SBO_20714210.303779IS911 ORF1
SBO_20723190.606163IS1 ORF2
SBO_2073323-0.106318IS1 encoded protein
SBO_20743220.877669protein encoded within IS
SBO_20753220.799792protein encoded within IS
SBO_20762220.039116protein encoded within IS
SBO_20771250.423934tail fiber assembly protein
SBO_20782230.587360IS600 ORF2
SBO_20794250.035351insertion element IS2 transposase InsD
SBO_20803230.208162insertion sequence 2 OrfA protein
SBO_20812220.150833IS600 ORF1
SBO_20823200.707277phage tail fiber protein
34SBO_2098SBO_2122Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_20980183.272907ecotin
SBO_2099-1193.837808ferredoxin-type protein
SBO_21000203.452311assembly protein for periplasmic nitrate
SBO_2101-1183.530579nitrate reductase catalytic subunit
SBO_2102-1173.432519quinol dehydrogenase periplasmic component
SBO_21030142.786513quinol dehydrogenase membrane component
SBO_21041162.559944citrate reductase cytochrome c-type subunit
SBO_21050162.364544cytochrome c-type protein NapC
SBO_21060183.790225cytochrome c biogenesis protein CcmA
SBO_21071203.630885heme exporter protein B
SBO_21080213.263470heme exporter protein C
SBO_21090192.794395heme exporter protein C
SBO_21100192.580175cytochrome c-type biogenesis protein CcmE
SBO_21110182.314554cytochrome c-type biogenesis protein
SBO_2112-115-1.101257disulfide oxidoreductase
SBO_2113016-1.516761subunit of heme lyase
SBO_2114120-2.231758transcriptional regulator NarP
SBO_2115221-2.246247IS1 ORF2
SBO_2116323-2.785560IS1 encoded protein
SBO_2117322-2.772326ABC transporter ATP-binding protein
SBO_2118322-1.864170hypothetical protein
SBO_2119124-1.769360protein encoded by prophage
SBO_2120119-2.345601hypothetical protein
SBO_2121118-2.910573hypothetical protein
SBO_2122121-3.358492hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2114HTHFIS613e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 3e-13
Identities = 21/113 (18%), Positives = 46/113 (40%), Gaps = 2/113 (1%)

Query: 9 VMIVDDHPLMRRGVRQLLELDPSFEVVAEAGDGASAIDLANRLDIDVILLDLNMKGMSGL 68
+++ DD +R + Q L ++V + A+ D D+++ D+ M +
Sbjct: 6 ILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 69 DTLNALRRDGVTAQIIILTVSDASSDVFALIDAGADGYLLKDSDPEVLLEAIR 121
D L +++ +++++ + + GA YL K D L+ I
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2117PRTACTNFAMLY2021e-56 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 202 bits (515), Expect = 1e-56
Identities = 139/526 (26%), Positives = 220/526 (41%), Gaps = 59/526 (11%)

Query: 291 APAMLVGKVVVSEGASFRTHGAVDTSKADVSLENSVWAIIADITTTNQNTLLNLANLAMS 350
P +G + V+ + R GA + +S++N+ W + + N+ L ++
Sbjct: 405 IPGTSIGPLDVALASQARWTGATRAVDS-LSIDNATWVMTDNS---------NVGALRLA 454

Query: 351 DANVIMMDEPVTRSSVTASAENFITLTTNTLSGNGNFYMRTDMANHQSDQLNVTGQATGD 410
+ +P A A F LT NTL+G+G F M SD+L V A+G
Sbjct: 455 SDGSVDFQQP-------AEAGRFKVLTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQ 507

Query: 411 FKIFVTDTGASPAAGDSLTLVTT-GGGDAAFTLGNAGGVVDIGTYEYTLLDNGNHSWSLA 469
+++V ++G+ PA+ ++L LV T G A FTL N G VDIGTY Y L NGN WSL
Sbjct: 508 HRLWVRNSGSEPASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLV 567

Query: 470 ENRAQITPSTTDVLNMAAAQPL-----------------------------------VFD 494
+A P QP ++
Sbjct: 568 GAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWY 627

Query: 495 AELDTVRERLGSVKSVSYDTAMWSSAINTRNNVTTDAGAGFEQTLTGLTLGIDSRFSREE 554
AE + + +RLG ++ W R + AG F+Q + G LG D +
Sbjct: 628 AESNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAG 687

Query: 555 SSTIRGLFFGYSHSDIGFDRGGKGNIDSYTLGAYAGWEHQNGAYVDGVVKVDRFANTIHG 614
G GY+ D GF G G+ DS +G YA + +G Y+D ++ R N
Sbjct: 688 GRWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGGYATYIADSGFYLDATLRASRLENDFKV 747

Query: 615 KMSNGATAFGDYNSNGAGAHVESGFRW-VDGLWSVRPYLAFTGFTTDGQDYTLSNGMR-- 671
S+G G Y ++G GA +E+G R+ W + P F G Y +NG+R
Sbjct: 748 AGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVR 807

Query: 672 ADVGNTRILRAEAGTAVSYHMDLQNGTTLEPWLKAAVRQEYADSNHVKVNDDGKFNNDVA 731
+ G++ + R G V ++L G ++P++KA+V QE+ + V N ++
Sbjct: 808 DEGGSSVLGR--LGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAH-RTELR 864

Query: 732 GTPGVYQAGIRSSFTPTLSGHLSVSYGNGAGVESPWNTQAGVVWTF 777
GT G+ ++ S + S Y G + PW AG +++
Sbjct: 865 GTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910


35SBO_2178SBO_2187Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_2178220-3.872750galactose/methyl galaxtoside transporter
SBO_2179123-2.746604beta-methylgalactoside transporter inner
SBO_2181227-2.417378insertion element IS2 transposase InsD
SBO_2182121-3.731978insertion sequence 2 OrfA protein
SBO_2184427-2.909116LysR family transcriptional regulator
SBO_2185431-2.538502hypothetical protein
SBO_2186330-1.551950IS1 encoded protein
SBO_2187226-1.798076IS1 ORF2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2178PF05272320.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.007
Identities = 21/74 (28%), Positives = 28/74 (37%), Gaps = 17/74 (22%)

Query: 24 PGVKALDNVNLKVRPHSIHALMGENGAGKSTLLKCLFGIYQKDSGTILFQGKEIDFHSAK 83
PG K D + L G G GKSTL+ L G+ F D + K
Sbjct: 591 PGCKF-DYSVV---------LEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGK 633

Query: 84 EALENGISMVHQEL 97
++ E +V EL
Sbjct: 634 DSYEQIAGIVAYEL 647


36SBO_2309SBO_2316Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_23090273.407890NADH dehydrogenase subunit N
SBO_23100282.792636NADH dehydrogenase subunit M
SBO_2311-1293.377724NADH dehydrogenase subunit L
SBO_2312-1283.096259NADH dehydrogenase subunit K
SBO_23130283.159814NADH dehydrogenase subunit J
SBO_23141263.366152NADH dehydrogenase subunit I
SBO_23150253.155896NADH dehydrogenase subunit H
SBO_23160243.201044NADH dehydrogenase subunit G
37SBO_2381SBO_2399Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_23812191.024924hypothetical protein
SBO_23823201.354284long-chain fatty acid outer membrane
SBO_23833242.423874ISSfl2 ORF
SBO_2384-118-0.379789IS1 ORF2
SBO_2385-1190.235463insertion element IS2 transposase InsD
SBO_2386-117-0.717249insertion sequence 2 OrfA protein
SBO_2387-120-3.682299IS1 ORF2
SBO_2388024-5.398850lipoprotein
SBO_2389027-6.745601transport
SBO_2391032-8.874870*IS911 ORF2
SBO_2392035-9.241783D-serine dehydratase
SBO_2393037-10.131489multidrug resistance protein Y
SBO_2394036-9.260148multidrug resistance protein K
SBO_2395034-8.554540DNA-binding transcriptional activator EvgA
SBO_2396034-8.166304hybrid sensory histidine kinase in two-component
SBO_2397233-6.614607hypothetical protein
SBO_2398233-6.440386transporter YfdV
SBO_2399128-5.284433oxalyl-CoA decarboxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2388VACJLIPOPROT407e-148 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 407 bits (1048), Expect = e-148
Identities = 250/251 (99%), Positives = 250/251 (99%)

Query: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60
MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR
Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60

Query: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120
DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM
Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120

Query: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADGLYPVLSWLTWPM 180
ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMAD LYPVLSWLTWPM
Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180

Query: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240
SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA
Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240

Query: 241 IQDDLKDIDSE 251
IQDDLKDIDSE
Sbjct: 241 IQDDLKDIDSE 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2393TCRTETB1214e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (306), Expect = 4e-32
Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%)

Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78
+ I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137
++G RL L + S++ + + +LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197
E R A L V + GP +GG I W +L+ +PM I+ L L +E
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192

Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257
++ G+ L+ +G+ + ML F +S I +VSV+S + V
Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317
+P +D L K+ F IG++ + +G + ++P ++++ + G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376
G M ++I + G ++ ++ +V + S T F II+ G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420
+ ++TI S L + S+ NF LS G ++
Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2394RTXTOXIND765e-17 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 75.6 bits (186), Expect = 5e-17
Identities = 62/412 (15%), Positives = 121/412 (29%), Gaps = 96/412 (23%)

Query: 13 RRKYFSLLAVVLFIAFSGAYVYWSMELEDMISTDDAYVT-GNADPISAQVSGSVTVVNHK 71
RR ++ F+ + ++E + + + G + I + V + K
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLG-QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 72 DTNYVRQGDILVSLDKTDATIALNKA---------------------------------- 97
+ VR+GD+L+ L A K
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 98 ------------------KNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDY 136
K + Q + L + AE + + Y+
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 137 NRRV----PLAKQGVISKE----------TLEHTKDTLISSKAALNAAIQAYKANKALVM 182
R+ L + I+K + S + + I + K LV
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 183 N-------TPLNR-QPQVVEAADATKEAWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPG 233
L + + + + + I++PV+ + Q V G V+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 234 QSLMAVVPARQ-MWVNANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNA 292
++LM +VP + V A + + + +GQ+ I + F G +G
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK--- 404

Query: 293 FSLLPAQNATGNWIKIVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDT 340
+ + +V V +S++ L PL G+++TA I T
Sbjct: 405 VKNINLDAIEDQRLGLVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2395HTHFIS493e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 3e-09
Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%)

Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63
++ DD + L + ++ + + + + D+V+ DV +P N +
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123
L ++K + ++++SA+N + AI+A++ G +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101

Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148
PF L + + L ++
Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2396HTHFIS792e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-17
Identities = 30/105 (28%), Positives = 51/105 (48%)

Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019
+IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1020 LTRKLREQNSSLPIWGLTANAQANEREKGLSCGMNLCLFKPLTLD 1064
L ++++ LP+ ++A K G L KP L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


38SBO_2437SBO_2446Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_2437219-2.520175sulfate transport protein CysZ
SBO_2438320-1.558032cysteine synthase A
SBO_2439322-1.698470PTS system phosphohistidinoprotein-hexose
SBO_2440117-0.713678phosphoenolpyruvate-protein phosphotransferase
SBO_24410181.987975PTS system glucose-specific transporter
SBO_24420223.128201IS4 orf
SBO_2443-1223.658673IS4 orf
SBO_2445-1243.040220hypothetical protein
SBO_2446-1233.541540cysteine synthase B
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2440PHPHTRNFRASE7480.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 748 bits (1933), Expect = 0.0
Identities = 276/571 (48%), Positives = 386/571 (67%), Gaps = 2/571 (0%)

Query: 1 MISGILASPGIAFGKALLLKEDEIVIDRKKISADQVDQEVERFLSGRAKASAQLETIKTK 60
I+GI AS G+A KA + E + I++ I V E+E+ + K+ +L IK +
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSI--TDVSTEIEKLTAALEKSKEELRAIKDQ 61

Query: 61 AGETFGEEKEAIFEGHIMLLEDEELEQEIIALIKDKHMTADAAAHEVIEGQASALEELDD 120
+ G +K IF H+++L+D EL I I+++ M A+ A EV + S E +D+
Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121

Query: 121 EYLKERAADVRDIGKRLLRNILGLKIIDLSAIQDEVILVAADLTPSETAQLNLKKVLGFI 180
EY+KERAAD+RD+ KR+L +++G++ L+ I +E +++A DLTPS+TAQLN + V GF
Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 181 TDAGGRTSHTSIMARSLELPAIVGTGSVTSQVKNDDYLILDAVNNQVYVNPTNEVIDKMR 240
TD GGRTSH++IM+RSLE+PA+VGT VT ++++ D +I+D + V VNPT E +
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 241 AVQEQVASEKAELAKLKDLPAITLDGHQVEVCANIGTVRDVEGAERNGAEGVGLYRTEFL 300
+ +K E AKL P+ T DG VE+ ANIGT +DV+G NG EG+GLYRTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 301 FMDRDALPTEEEQFAAYKAVAEACGSQAVIVRTMDIGGDKELPYMNFPKEENPFLGWRAI 360
+MDRD LPTEEEQF AYK V + + V++RT+DIGGDKEL Y+ PKE NPFLG+RAI
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 361 RIAMDRKEILRDQLRAILRASAFGKLRIMFPMIISVEEVRALRKEIEIYKQELRDEGKAF 420
R+ +++++I R QLRA+LRAS +G L++MFPMI ++EE+R + ++ K +L EG
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421

Query: 421 DESIEIGVMVETPAAATIARHLAKEVDFFSIGTNDLTQYTLAVDRGNDMISHLYQPMSPS 480
+SIE+G+MVE P+ A A AKEVDFFSIGTNDL QYT+A DR N+ +S+LYQP P+
Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481

Query: 481 VLNLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNT 540
+L L+ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMSA SI + +
Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541

Query: 541 NFEDAKVLAEQALAQPTTDELMTLVNKFIEE 571
+ E+ K A++AL T +E+ LV K +
Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572


39SBO_2463SBO_2473Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_24630183.666470hypothetical protein
SBO_24640173.928655hypothetical protein
SBO_24650184.361943ethanolamine ammonia-lyase small subunit
SBO_24661184.081949IS1 ORF2
SBO_24671194.541010IS1 encoded protein
SBO_24682205.131975ethanolamine ammonia-lyase heavy subunit
SBO_24692185.439345ethanolamine utilization
SBO_24703204.952295ethanolamine utilization
SBO_24712183.766902detox protein
SBO_24722193.191990detox protein
SBO_24733162.630702phosphotransacetylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2469SHAPEPROTEIN496e-09 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 49.4 bits (118), Expect = 6e-09
Identities = 31/116 (26%), Positives = 50/116 (43%), Gaps = 9/116 (7%)

Query: 63 VRDGIVWDFFGAVTIVRRHLD-TLEQQFGRRFSHAATSFPPGTDP---RISINVLESAGL 118
++DG++ DFF +++ + F R P G R + AG
Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135

Query: 119 EVSHVLDEPTAVA-----ELLQLDNAGVVDIGGGTTGIAIVKKGKVTYSADEATGG 169
+++EP A A + + + VVDIGGGTT +A++ V YS+ GG
Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191


40SBO_2523SBO_2530Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_25232170.377863exopolyphosphatase
SBO_25242200.325923IS911 ORF2
SBO_25252170.580986IS629 ORF2
SBO_25262260.853271IS629 ORF1
SBO_25282200.791448IS1 ORF2
SBO_25292210.496350IS1 encoded protein
SBO_25302210.625511hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2530cloacin300.007 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.7 bits (66), Expect = 0.007
Identities = 40/157 (25%), Positives = 66/157 (42%), Gaps = 18/157 (11%)

Query: 34 QQGKNEEQRQHDEW-------VAERNREIQQEKQRRANAQVAANK-RAATAAANKKARQD 85
+Q ++EE R+ EW AERN E + + +AN VA N+ R A A +R+
Sbjct: 297 KQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDVARNQERQAKAVQVYNSRKS 356

Query: 86 KLD------AEASADKKRDQSYEDELRS---LEIQKQKLALAKEEARVKRENEFIDQELK 136
+LD A+A A+ K+ + + + Q L + + V + D K
Sbjct: 357 ELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAFDAAAK 416

Query: 137 HKAAQTDVVQSEADANRNMTEGGRDLMKSVGKAEENK 173
K + D S A +R E + ++ E+NK
Sbjct: 417 EK-SDADAALSSAMESRKKKEDKKRSAENNLNDEKNK 452


41SBO_2652SBO_2658Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_2652218-1.389416anti-terminator regulatory protein
SBO_2653221-0.761136flavoprotein
SBO_2654120-0.676366transporter
SBO_2655323-1.722442IS1 ORF2
SBO_2656222-1.597449hypothetical protein
SBO_2657224-1.683875hypothetical protein
SBO_2658226-0.760097IS4 orf
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2657cloacin346e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.5 bits (76), Expect = 6e-04
Identities = 12/34 (35%), Positives = 15/34 (44%)

Query: 198 SGKSYHSDNSGSGGGSSGGGFSGGGGSSGGGGAS 231
SG H G G G SGGG +GG ++
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 6e-04
Identities = 15/36 (41%), Positives = 20/36 (55%)

Query: 197 ASGKSYHSDNSGSGGGSSGGGFSGGGGSSGGGGASG 232
+ G + S+N+ GGGS G GGG G GG +G
Sbjct: 34 SDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69



Score = 32.0 bits (72), Expect = 0.002
Identities = 12/30 (40%), Positives = 12/30 (40%)

Query: 203 HSDNSGSGGGSSGGGFSGGGGSSGGGGASG 232
GGGS G G G S GG G G
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79



Score = 31.2 bits (70), Expect = 0.003
Identities = 12/31 (38%), Positives = 13/31 (41%)

Query: 199 GKSYHSDNSGSGGGSSGGGFSGGGGSSGGGG 229
G G G +GGG GG SG GG
Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79



Score = 30.5 bits (68), Expect = 0.007
Identities = 11/27 (40%), Positives = 12/27 (44%)

Query: 205 DNSGSGGGSSGGGFSGGGGSSGGGGAS 231
SGSG GG G GG +G G
Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGG 74



Score = 29.3 bits (65), Expect = 0.016
Identities = 11/34 (32%), Positives = 14/34 (41%)

Query: 199 GKSYHSDNSGSGGGSSGGGFSGGGGSSGGGGASG 232
G S + G G G GG +G G G G +
Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81


42SBO_2790SBO_2796Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_27901183.375792hypothetical protein
SBO_27911213.778897hypothetical protein
SBO_2792-1254.245635hydrogenase assembly chaperone
SBO_2793-1254.180607hydrogenase nickel incorporation protein HypB
SBO_2794-2254.396070hydrogenase nickel incorporation protein
SBO_2795-2254.412555membrane-spanning protein of hydrogenase 3
SBO_2796-1264.024815membrane-spanning protein of hydrogenase 3
43SBO_2846SBO_2868Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_2846-122-3.565113IS1 encoded protein
SBO_2847-120-3.156198IS1 ORF2
SBO_2848125-3.948165hypothetical protein
SBO_2849-120-3.721748hypothetical protein
SBO_2850019-2.585409DNA binding protein
SBO_2851318-1.637304hypothetical protein
SBO_28523190.898719hypothetical protein
SBO_28530150.919004hypothetical protein
SBO_28541140.114387LysM domain/BON superfamily protein
SBO_28551150.324403DNA-binding transcriptional regulator CsiR
SBO_28560140.297333gamma-aminobutyrate transporter
SBO_2859-1150.167738hydroxyglutarate oxidase
SBO_2860-313-1.104943hypothetical protein
SBO_2861015-0.646138hypothetical protein
SBO_28622190.823413IS1 encoded protein
SBO_28631220.949677IS1 ORF2
SBO_28650221.067427hypothetical protein
SBO_28661231.530173hydrogenase 2 small subunit
SBO_28672231.599742hydrogenase 2 protein HybA
SBO_28682211.355610hydrogenase 2 b cytochrome subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2861IGASERPTASE270.015 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 26.6 bits (58), Expect = 0.015
Identities = 23/88 (26%), Positives = 35/88 (39%), Gaps = 20/88 (22%)

Query: 2 NINHSPHDGLVIINKGNEEVEGTWPNK-------------LQPGIYKNMGSNSVNI---- 44
+++ +D L I KG VEGT NK Q SV I
Sbjct: 438 KVHNPQYDRLAKIGKGTLIVEGTGDNKGSLKVGDGTVILKQQTNGSGQHAFASVGIVSGR 497

Query: 45 ---IINNTRKIIPPGKVFTLRGGTLNIN 69
++N+ +++ P F RGG L++N
Sbjct: 498 STLVLNDDKQVDPNSIYFGFRGGRLDLN 525


44SBO_2978SBO_2987Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_2978012-3.244633hypothetical protein
SBO_2979014-4.283259formate acetyltransferase 3
SBO_2980020-6.800466propionate/acetate kinase
SBO_2981017-6.560552threonine/serine transporter TdcC
SBO_2982130-11.112760threonine dehydratase
SBO_2983-135-12.259962DNA-binding transcriptional activator TdcA
SBO_2984127-8.065613DNA-binding transcriptional activator TdcR
SBO_2985020-4.892447IS1 encoded protein
SBO_2986018-3.762795IS1 ORF2
SBO_2987018-3.739370hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2980ACETATEKNASE5360.0 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 536 bits (1383), Expect = 0.0
Identities = 173/397 (43%), Positives = 254/397 (63%), Gaps = 11/397 (2%)

Query: 11 VLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVN-GGEPAP--LAHHSYEGA 67
+LVINCGSSS+K+ ++++ D VL G+A+ I ++ L+ N GE ++ A
Sbjct: 3 ILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKDA 62

Query: 68 LKAIAFELEKRNLN-----DSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLH 122
+K + L + + +GHR+ HGG FT S +ITD+V+ I LAPLH
Sbjct: 63 IKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPLH 122

Query: 123 NYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTS 182
N AN+ GI++ Q+ P V VAVFDT+FHQTM AYLY +P++YY + +R+YGFHGTS
Sbjct: 123 NPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGTS 182

Query: 183 HRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSG 242
H+YVSQRA +LN + ++ HLGNG+SI AV+NG+S+DTSMG TPLEGL MGTRSG
Sbjct: 183 HKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRSG 242

Query: 243 DVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLR-VLEKAWHEGHERAQLAI 301
+D +S++ + N S ++ ++NK+SG+ GISG+SSD R + + A+ G +RAQLA+
Sbjct: 243 SIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLAL 302

Query: 302 KTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRS 361
F +R+ + I +AA++ +D I+FT GIGEN IR +++ L LG ++D E N
Sbjct: 303 NVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKVR 362

Query: 362 NSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGK 398
E I+S+ +++V V+PTNEE MIA D + +
Sbjct: 363 GE--EAIISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397


45SBO_3008SBO_3027Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_30081193.406611outer membrane lipoprotein
SBO_30090203.755381IS1 encoded protein
SBO_30100214.451292IS1 ORF2
SBO_30130205.058050type II secretion protein
SBO_30141184.135880type II secretion protein
SBO_30150153.861390type II secretion protein
SBO_30161194.062584type II secretion protein
SBO_3017-1173.294450type II secretion protein
SBO_3018-2171.642912type II secretion protein
SBO_3019-2140.116729type II secretion protein
SBO_3020-113-0.751088GspL-like protein
SBO_3021014-1.987606general secretion pathway protein YghD
SBO_3023-213-1.353538*transporter
SBO_3025-213-1.186571transposase
SBO_3026-114-0.939220protein NupG
SBO_3027217-0.132153murein transglycosylase C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3014BCTERIALGSPF451e-160 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 451 bits (1162), Expect = e-160
Identities = 224/406 (55%), Positives = 301/406 (74%), Gaps = 1/406 (0%)

Query: 1 MALFYYQALERNGRKTKGMIEADSARHARQLLRGKELIPVHI-EARMNASAGGMLQRRRH 59
MA ++YQAL+ G+K +G EADSAR ARQLLR + L+P+ + E R + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 AHRRVAAADLALFTRQLATLVQAAMPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119
R++ +DLAL TRQLATLV A+MPLE L AV++QSEK H+ L A+RS++ EG++L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVML 179
+D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP V+
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 VVATGVVTILLTAVVPKIIEQFDHLGHALPSSTRALIAMSDALQASGVYWLAGLLALLVL 239
VVA VV+ILL+ VVPK++EQF H+ ALP STR L+ MSDA++ G + L LLA +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 GQWLLKNPTMRLRWDKTLLRLPVTGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299
+ +L+ R+ + + LL LP+ GR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 SANRYVEQQLLLAADRVREGSSLRAALAELRLFPPMMLYIIASGEQSGELETMLEQAAVN 359
+N Y +L LA D VREG SL AL + LFPPMM ++IASGE+SGEL++MLE+AA N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPMLQLNNMV 405
Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+LQLN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3015BCTERIALGSPG2176e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 217 bits (553), Expect = 6e-76
Identities = 89/146 (60%), Positives = 109/146 (74%), Gaps = 3/146 (2%)

Query: 6 RTHKPRAGFTLLEVMVVIVILGVLASLVVPNLLGNKEKANRQKAISDIVALENALDMYRL 65
R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKA++QKA+SDIVALENALDMY+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 66 DNGRYPTTEQGLEALIQQPANMADSRNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125
DN YPTT QGLE+L++ P + NY GYIKRLP DPWGNDY ++PGE G +D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151
+ G DG+ E DI NW L + +
Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3016BCTERIALGSPH552e-12 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 55.0 bits (132), Expect = 2e-12
Identities = 31/159 (19%), Positives = 54/159 (33%), Gaps = 28/159 (17%)

Query: 1 MLVIFLIGLASAGVVQTFATDSEPPAKKAAQDFLTRFAQFKDRAVIEGQTLGVLIDPPGY 60
ML++ L+G+++ V+ F + A + F + + R + GQ GV + P +
Sbjct: 12 MLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFFGVSVHPDRW 71

Query: 61 QFMQRRQGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQRRRL----TLHD 116
QF+ + P D W L L+ R+ ++
Sbjct: 72 QFLVLEARDGADPA-------------------PADDGWSGYRWLPLRAGRVATSGSIAG 112

Query: 117 IELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSA 150
+L L + P + P TPF L A
Sbjct: 113 GKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLTLGEA 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3017BCTERIALGSPH368e-06 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 36.5 bits (84), Expect = 8e-06
Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 2 KRGFTLLEVMLVLAIFALAATAVL 25
+RGFTLLE+ML+L + ++A VL
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3026TCRTETA300.016 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.016
Identities = 36/239 (15%), Positives = 76/239 (31%), Gaps = 18/239 (7%)

Query: 174 SHMQLYIGAALSAILVLFTLTLPHIPVAKQQANQSWTTLLGLDAFALFKNKRMAIFFIFS 233
H + AAL+ + L L + + L+ A F+ R
Sbjct: 159 PHAPFFAAAALNGLNFLTGCFLLPES---HKGERRPLRREALNPLASFRWARGMTVVAAL 215

Query: 234 MLLGAELQITNMFGNTFLHSFDKDPMFASSFIVQHASIIMSISQISETLF-ILTIPFFLS 292
M + +Q+ F +D + + I ++ I +L + +
Sbjct: 216 MAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI---GISLAAFGILHSLAQAMITGPVAA 272

Query: 293 RYGIKNVMMISIVAWILRFALFAYGDPTPFGTVLLVLSMIVYGCAFDFFNISGSVFVEKE 352
R G + +M+ ++A + L A+ + ++V + + + ++
Sbjct: 273 RLGERRALMLGMIADGTGYILLAF-----ATRGWMAFPIMVLLASGGIGMPALQAMLSRQ 327

Query: 353 VSPAIRASAQGMFLMMTNGFGCILGGIVSGKVVEMYTQNGITDWQ-TVWLIFAGYSVVL 410
V + QG +T+ L IV + IT W W+ A ++
Sbjct: 328 VDEERQGQLQGSLAALTS-----LTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLC 381


46SBO_3121SBO_3127Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3121225-5.646949lipoprotein
SBO_3123226-6.489392*IS1 encoded protein
SBO_3124224-5.960320IS1 ORF2
SBO_3125325-6.062173ribose ABC transporter
SBO_3126225-5.974544ribose ABC transporter ATP-binding protein
SBO_3127115-3.367884ribose ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3121RTXTOXIND353e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.8 bits (80), Expect = 3e-04
Identities = 18/82 (21%), Positives = 31/82 (37%), Gaps = 12/82 (14%)

Query: 168 AAGAGKVVYVGNQLRGYGNLIMIKHSEDYITAYAHNDTMLVNNGQSVKTGQKIATMGSTD 227
A GK+ + G IK E+ I ++V G+SV+ G + + +
Sbjct: 84 ATANGKLTHSGRSK-------EIKPIENSIV-----KEIIVKEGESVRKGDVLLKLTALG 131

Query: 228 AASVRLHFQIRYRATAIDPLRY 249
A + L Q ++ RY
Sbjct: 132 AEADTLKTQSSLLQARLEQTRY 153


47SBO_3158SBO_3177Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_31582240.24699250S ribosomal protein L13
SBO_3159-1201.21292930S ribosomal protein S9
SBO_3160-1161.891010stringent starvation protein A
SBO_31610122.701146ClpXP protease specificity-enhancing factor
SBO_3163-1122.013078transcriptional regulator NanR
SBO_3164-2122.581563N-acetylneuraminate lyase
SBO_3165-1183.989312sialic acid transporter
SBO_3166-1184.146452N-acetylmannosamine-6-phosphate 2-epimerase
SBO_3167-2162.772063N-acetylmannosamine kinase
SBO_3168-2162.303319hypothetical protein
SBO_3169-1162.432428glutamate synthase subunit beta
SBO_3170-2142.072437glutamate synthase subunit alpha
SBO_3171-313-0.449824hypothetical protein
SBO_3172-113-1.052341aerobic respiration control sensor protein ArcB
SBO_3173-114-0.548619isoprenoid biosynthesis protein
SBO_3174214-0.307558monofunctional biosynthetic peptidoglycan
SBO_3175315-0.022918hypothetical protein
SBO_3176216-0.178704PTS system phosphohistidinoprotein-hexose
SBO_3177215-0.740344hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3165TCRTETB613e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 61.4 bits (149), Expect = 3e-12
Identities = 63/408 (15%), Positives = 138/408 (33%), Gaps = 36/408 (8%)

Query: 30 LLDGFDFVLIALVLTEVQGEFGLTTVQAASLISAAFISRWFGGLMLGAMGDRYGRRLAMV 89
+ +++ + L ++ +F + +A ++ G + G + D+ G + ++
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 90 TSIVLFSAGTLACGFAPGYITMFI-ARLVIGMGMAGEYGSSATYVIESWPKHLRNKASGF 148
I++ G++ + ++ I AR + G G A V PK R KA G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 149 LISGFSVGAVVAAQVYSLVVPVWGWRALFFIGILPIIFALWLRKNIPEAEDWKEKHAGKA 208
+ S ++G V + ++ W L I ++ II +L K + +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR--------- 194

Query: 209 PVRTMVDILYRGEHRIANIVMTLAAATALWFCFAGNLQNAAIVAVLGLLCAAIFISFMVQ 268
+G I I++ + IV+VL L IF+ + +
Sbjct: 195 ---------IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL---IFVKHIRK 242

Query: 269 STGK----RWPTGVMLMVVVLFAFLYSWPIQA---LLPTYLKTDLAYNPHTVANVLFFSG 321
T + M+ VL + + ++P +K + + +V+ F G
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 322 FGAAVGCCVSGFLGDWLGTRKAYVCSLLASQLLIIPVFAIGG----ANVWVLGLLLFFQQ 377
+ + + G++G L R+ + L + F W + +++ F
Sbjct: 303 TMSVI---IFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVL 359

Query: 378 MLGQGIAGILPKLIGGYFDTDQRAAGLGFTYNVGALGGALAPIIGALI 425
++ ++ + AG+ L I +
Sbjct: 360 GGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3172HTHFIS656e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.9 bits (158), Expect = 6e-13
Identities = 26/115 (22%), Positives = 45/115 (39%), Gaps = 4/115 (3%)

Query: 528 VLLVEDIELNVIVARSVLEKLGNSVDVAMTGKAALEMFKPGEYDLVLLDIQLPDMTGLDI 587
+L+ +D V L + G V + G+ DLV+ D+ +PD D+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 588 SRELTKRYPREDLPPLVALTA-NVLKDKQEYLNAGMDDVLSKPLSVPALTAMIKK 641
+ K P P++ ++A N + G D L KP + L +I +
Sbjct: 66 LPRIKKARPD---LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3175PF05043280.025 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 28.4 bits (63), Expect = 0.025
Identities = 11/61 (18%), Positives = 19/61 (31%), Gaps = 2/61 (3%)

Query: 86 FDGKPSITLTEFAEQCRYEEDIAQLRQLLKQLKRYLQDNRIVTMSLKPQNILCHHISESE 145
F+ K +E AE E ++ L +K D + + + I
Sbjct: 20 FEHKRWFHRSELAELLNCTE--RAVKDDLSHVKSAFPDLIFHSSTNGIRIINTDDSDIEM 77

Query: 146 V 146
V
Sbjct: 78 V 78


48SBO_3434SBO_3446Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3434115-4.377135gluconate kinase
SBO_3435115-4.027896regulator of gluconate operon
SBO_3436017-4.882492hypothetical protein
SBO_3437023-5.811385dehydrogenase
SBO_3438020-4.163435acetyltransferase YhhY
SBO_3439019-4.238224hypothetical protein
SBO_3440-2130.534535transposase
SBO_3441-2162.001770IS1 ORF2
SBO_34420172.601218IS1 encoded protein
SBO_34430183.082031gamma-glutamyltranspeptidase
SBO_34440203.774814hypothetical protein
SBO_3445-1214.033897glycerophosphodiester phosphodiesterase
SBO_3446-1213.584869glycerol-3-phosphate transporter ATP-binding
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3438SACTRNSFRASE384e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.4 bits (89), Expect = 4e-06
Identities = 21/92 (22%), Positives = 34/92 (36%), Gaps = 16/92 (17%)

Query: 55 VACIDGDVVGHLTIDVQQRPRRSHVADFGICVDSRWKNRGVASALMREMIE------MCD 108
+ ++ + +G + I + + D + D R K GV +AL+ + IE C
Sbjct: 69 LYYLENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125

Query: 109 NWLRVDRIELTVFVDNAPAIKVYKKYGFEIEG 140
L I N A Y K+ F I
Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3443NAFLGMOTY320.007 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 31.6 bits (71), Expect = 0.007
Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%)

Query: 272 RTPISGDYRGYQVYSMPPPSSGGIHIVQILNI--LENFDMKKYGF-GSADAMQIMAEAEK 328
R P+ G+ R + SMPPP G H +I N+ + FD G+ G A I++E EK
Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNLKFFKQFD----GYVGGQTAWGILSELEK 131

Query: 329 YAYADRSEYLGDPDFVKVPWQA 350
Y P F WQ+
Sbjct: 132 GRY---------PTFSYQDWQS 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3446PF05272320.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.4 bits (73), Expect = 0.003
Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 7/43 (16%)

Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTEGDIWINDQRVTEMEPKD 75
+V+ G G GKSTL+ + GL+ + +D KD
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGKD 634


49SBO_3458SBO_3486Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_34582151.092910RNA polymerase factor sigma-32
SBO_34593130.812412cell division protein FtsX
SBO_34603111.176928cell division protein FtsE
SBO_34612122.765270cell division protein FtsY
SBO_34620163.24872316S rRNA m(2)G966-methyltransferase
SBO_34630142.874179hypothetical protein
SBO_3464-1143.066770receptor
SBO_3465-1143.566622hypothetical protein
SBO_34660142.624253zinc/cadmium/mercury/lead-transporting ATPase
SBO_34671171.207093sulfur transfer protein SirA
SBO_34680151.139824hypothetical protein
SBO_34691162.152848hypothetical protein
SBO_34701173.231892major facilitator superfamily transporter
SBO_34711203.565510hypothetical protein
SBO_34720234.564083holo-(acyl carrier protein) synthase 2
SBO_34730234.523768periplasmic binding protein for nickel
SBO_34742244.741683nickel transporter permease NikB
SBO_34752223.799650nickel transporter permease NikC
SBO_34760203.372636nickel transporter ATP-binding protein NikD
SBO_34770193.514864nickel transporter ATP-binding protein NikE
SBO_34780183.275027nickel responsive regulator
SBO_34790172.891056hypothetical protein
SBO_3480-1150.588458transporter
SBO_3481-117-2.607220ABC transporter ATP-binding protein
SBO_3482032-7.215079hypothetical protein
SBO_3483137-10.321687IS1 ORF2
SBO_3484238-10.616089IS1 encoded protein
SBO_3485027-7.639482hypothetical protein
SBO_3486015-3.629063hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3461IGASERPTASE502e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 50.1 bits (119), Expect = 2e-08
Identities = 37/181 (20%), Positives = 62/181 (34%), Gaps = 14/181 (7%)

Query: 19 EQTPEKETEVQNEQPVVEEI---VQAQEPAKASEQAVEEQPQAHTEAEAETFAADVVEVT 75
TP + TE E E Q+ + + Q E +A + +A T EV
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN---EVA 1086

Query: 76 EQVAESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEEVSPEEWQAEAETV 135
+ +E+++ Q + V +E V E+ + +VSP++ Q+E
Sbjct: 1087 QSGSETKETQTTE--TKETATVEKEEKAKVETEKTQ---EVPKVTSQVSPKQEQSETVQP 1141

Query: 136 EIVEAAEEEAAK--EEITDEELEAQALAAEAAEEAVMVVPPVEE-QPVEEIAQEQEKPTK 192
+ A E + +E + A E + V PV E V E P
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPEN 1201

Query: 193 E 193

Sbjct: 1202 T 1202



Score = 48.5 bits (115), Expect = 5e-08
Identities = 43/210 (20%), Positives = 70/210 (33%), Gaps = 26/210 (12%)

Query: 20 QTPEK-ETEVQNEQPVVEEIVQAQE----------PAKASEQAVEEQPQAHTEAE----- 63
TP + +V + EEI + E P++ +E E Q E
Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQD 1057

Query: 64 AETFAADVVEVTEQVAESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEEV 123
A A EV ++ + KA + VAQ +ET + E V EE
Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET------QTTETKETATVEKEEK 1111

Query: 124 SPEEW--QAEAETVEIVEAAEEEAAKEEITDEELEAQALAAEAAEEAVMV--VPPVEEQP 179
+ E E V + ++E ++ E + +E EQP
Sbjct: 1112 AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171

Query: 180 VEEIAQEQEKPTKEGFFARLKRSLLKTKEN 209
+E + E+P E S+++ EN
Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVVENPEN 1201



Score = 45.4 bits (107), Expect = 5e-07
Identities = 26/159 (16%), Positives = 48/159 (30%), Gaps = 7/159 (4%)

Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPAKASE------QAVEEQPQAHTEAEAETFAAD 70
Q +T E T + E+ VE + P S+ Q+ QPQA E +
Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 71 VVEVTEQVAESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEEVSPEEWQA 130
++ ++ QP E + E V E+ V + PE+ P
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNVEQPVTES-TTVNTGNSVVENPENTTPATTQPTVNSE 1214

Query: 131 EAETVEIVEAAEEEAAKEEITDEELEAQALAAEAAEEAV 169
+ + + + + + A +
Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLT 1253



Score = 40.0 bits (93), Expect = 2e-05
Identities = 29/182 (15%), Positives = 52/182 (28%), Gaps = 12/182 (6%)

Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPAKASEQAVEEQPQAHTEAE-AETFAADVVEVT 75
+E E ++ V+ E E + +E + E A+ EV
Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE-TATVEKEEKAKVETEKTQEVP 1123

Query: 76 EQVAE-------SEKAQPEAEVVAQPEPVV--EETPEPVAIEREELPLPEDVNAEEVSPE 126
+ ++ SE QP+AE + +P V +E + ++ ++ P
Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPV 1183

Query: 127 EWQAEAETV-EIVEAAEEEAAKEEITDEELEAQALAAEAAEEAVMVVPPVEEQPVEEIAQ 185
T +VE E E+ +V VP E
Sbjct: 1184 TESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSND 1243

Query: 186 EQ 187

Sbjct: 1244 RS 1245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3464SHIGARICIN260.039 Ribosome inactivating protein family signature.
		>SHIGARICIN#Ribosome inactivating protein family signature.

Length = 289

Score = 25.9 bits (57), Expect = 0.039
Identities = 6/21 (28%), Positives = 13/21 (61%)

Query: 7 FFIVIIGLIVVAASFRFMQQR 27
+V+I AA ++F++Q+
Sbjct: 173 ALMVLIQSTSEAARYKFIEQQ 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3467PF012061053e-34 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 105 bits (265), Expect = 3e-34
Identities = 24/72 (33%), Positives = 41/72 (56%)

Query: 9 DHTLDALGLRCPEPVMMVRKTVRNMQPGETLLIIADDPATTRDIPGFCTFMEHELVAKET 68
D +LDA GL CP P++ +KT+ M GE L ++A DP + +D F HEL+ ++
Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64

Query: 69 DGLPYRYLIRKG 80
+ Y + +++
Sbjct: 65 EDGTYHFRLKRA 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3470TCRTETA531e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 52.9 bits (127), Expect = 1e-09
Identities = 80/398 (20%), Positives = 147/398 (36%), Gaps = 32/398 (8%)

Query: 13 LRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 70
++ N ++ I+ + IGL + VLPG + D++ G++++L
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 71 PHAGRYADLLGPKKIVVFGLCGCFLSGLGYLTAGLTASLPVISLLLLCLGRVILGI-GQS 129
P G +D G + +++ L G + + Y L V L +GR++ GI G +
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPFLWV-----LYIGRIVAGITGAT 112

Query: 130 FAGTGSTLWGVGVVGSL--HIGRVISWNGIVTYGAMAMGAPLGVVFYHWGGLQALALIIM 187
A G+ + + H G + + G +G +G H A AL +
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGL 172

Query: 188 GVALVAILLAIPRPTVK--ASKGKPLPFRAVLGRVWLYGMALALA-----SAGFGVIATF 240
LL + + P + + +A +A V A
Sbjct: 173 NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232

Query: 241 ITLFYDAK-SWDGAAFALTLFSCAFVGT---RLLFPNGINRIGGLNVAMICFSVEIIGLL 296
+F + + WD ++L + + + ++ R+G M+ + G +
Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292

Query: 297 LVGVATMPWMAKIG-VLLAGAGFSLVFPALGVVAVKAVPQQNQGAALATYTVFMDLSLGV 355
L+ AT WMA VLLA G + PAL + + V ++ QG + L+ +
Sbjct: 293 LLAFATRGWMAFPIMVLLASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLT-SI 349

Query: 356 TGPLAGLVMSWAGVPV----IYLAAAGLVAIALLLTWR 389
GPL + A + ++A A L + L R
Sbjct: 350 VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3477HTHFIS290.020 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.020
Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 25 QAVLNNVSLTLKSGETVALLGRSGCGKSTLARLL 58
Q + ++ +++ T+ + G SG GK +AR L
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3480ABC2TRNSPORT505e-09 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 49.5 bits (118), Expect = 5e-09
Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 7/171 (4%)

Query: 201 REREHGTVEHLLVMPITPFEIMMAKI-WSMGLVVLVVSGLSLVLMVKGVLGVPIEGSIPL 259
R T E +L + +I++ ++ W+ L +G+ +V G + L
Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148

Query: 260 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQD 318
+ L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P + Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 319 IMLTMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFF-TIALLRFR 368
+P +H + L + I+ ++ + I FF + ALLR R
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3481PF05272300.044 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.044
Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 20 ARCMVGLIGPDGVGKSSLLSLISGAR 45
V L G G+GKS+L++ + G
Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3482RTXTOXIND848e-20 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 83.7 bits (207), Expect = 8e-20
Identities = 71/408 (17%), Positives = 139/408 (34%), Gaps = 81/408 (19%)

Query: 6 RHLAWWVVGALAVAAVVAWWLLRPAGVP-EGFAVSNGRIEATEVDIASKIAGRIDTILVK 64
R +A++++G L +A +++ G +GR + I + I+VK
Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKE----IKPIENSIVKEIIVK 113

Query: 65 EGQFVREGEVLAKMDTRV----------------LQEQRLEAI----------------- 91
EG+ VR+G+VL K+ L++ R + +
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 92 -------------------AQIKEAQSAVAAAQALLEQRQSETRAAQSLVNQRQAELDSV 132
Q Q+ + L+++++E + +N+ +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 133 AKRHMRSRSLAQRGAISAQQLDDDRAAAESARAALESAKAQVSASKAAIEAARTNIIQ-- 190
R SL + AI+ + + A L K+Q+ ++ I +A+
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 191 -----------AQTRVEAAQATERRIAADID--DSELKAPRDGRV-QYRVAEPGEVLAAG 236
QT T + S ++AP +V Q +V G V+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 237 GRVLNMVDLSDVY-MTFFLPTEQAGTLKLGGEARLILDAAPDLRIPATISFVASVAQFTP 295
++ +V D +T + + G + +G A + ++A P R V V
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNINL 410

Query: 296 KTVETSDERLKLMFRVKARIPPELLQQHLEYV--KTGLPGVAWVRVNE 341
+E D+RL L+F V I L + + +G+ A ++
Sbjct: 411 DAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456


50SBO_3499SBO_3514Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3499018-3.128599DNA-binding transcriptional repressor ArsR
SBO_3500118-3.833069arsenical pump membrane protein
SBO_3501222-7.031757arsenate reductase
SBO_3502326-8.400863hypothetical protein
SBO_3503327-8.919190IS911 ORF2
SBO_3504228-11.254463hypothetical protein
SBO_3505224-11.092157hypothetical protein
SBO_3506-121-7.169402hypothetical protein
SBO_3508-114-2.701267acid-resistance protein
SBO_3509-214-3.099292acid-resistance protein
SBO_3510-216-3.782009acid-resistance membrane protein
SBO_3511-213-2.687316hypothetical protein
SBO_3512-111-1.806471multidrug efflux system protein MdtE
SBO_3513010-1.747817transport system permease
SBO_3514112-3.072213ARAC-type regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3512RTXTOXIND514e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.0 bits (122), Expect = 4e-09
Identities = 41/218 (18%), Positives = 70/218 (32%), Gaps = 33/218 (15%)

Query: 97 LQAELNSAKGSLAKALSTASNARITFNRQASLLKTNYVSR-QDYDT-ARTQLNEAEANVT 154
+ + A L S + K Y Q + +L + N+
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIE----SEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 155 VAKAAVEQATINLQYANVTSPITGVSGKSSV-TVGALVTANQADSLVTVQRLDPIYVDLT 213
+ + + Q + + +P++ + V T G +VT + +V V D + V
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTAL 371

Query: 214 QSVQDFLRMKEEVASGQIKQVQGSTPVQLNLE--NGKRY-SQTGTLK--FSDPTVDETTG 268
+D I + + +E RY G +K D D+ G
Sbjct: 372 VQNKD------------IGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419

Query: 269 SVT--LRAI------FPNPNGDLLPGMYVTALVDEGSR 298
V + +I N N L GM VTA + G R
Sbjct: 420 LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457



Score = 32.1 bits (73), Expect = 0.004
Identities = 25/118 (21%), Positives = 47/118 (39%), Gaps = 7/118 (5%)

Query: 53 PGRTVPY-EVAEIRPQVGGIIIKRNFI-EGDKVNQGDSLYQIDPAPLQAELNSAKGSLAK 110
G+ EI+P I+ K + EG+ V +GD L ++ +A+ + SL +
Sbjct: 87 NGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 111 ALSTASNARITFNRQASLLKTNYVSRQDYDTARTQLNEAEANVTVAKAAVEQATINLQ 168
A + +I +R L K + D + N +E V + +++ Q
Sbjct: 146 ARLEQTRYQIL-SRSIELNKLPELKLPDEPYFQ---NVSEEEVLRLTSLIKEQFSTWQ 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3513ACRIFLAVINRP12890.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1289 bits (3337), Expect = 0.0
Identities = 723/1032 (70%), Positives = 845/1032 (81%), Gaps = 1/1032 (0%)

Query: 1 MANYFIDRPVFAWVLAIIMMLAGGLAIMNLPVAQYPQIAPPTITVSATYPGADAQTVEDS 60
MAN+FI RP+FAWVLAII+M+AG LAI+ LPVAQYP IAPP ++VSA YPGADAQTV+D+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGLDGLMYMSSTSDAAGNASITLTFETGTSPDIAQVQVQNKLQLAMPSLPE 120
VTQVIEQNMNG+D LMYMSSTSD+AG+ +ITLTF++GT PDIAQVQVQNKLQLA P LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 AVQQQGISVDKSSSNILMVAAFISDNGSLNQYDIADYVASNIKDPLSRTAGVGSVQLFGS 180
VQQQGISV+KSSS+ LMVA F+SDN Q DI+DYVASN+KD LSR GVG VQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 EYAMRIWLDPQKLNKYNLVPSDVISQIKVQNNQISGGQLGGMPQAADQQLNASIIVQTRL 240
+YAMRIWLD LNKY L P DVI+Q+KVQN+QI+ GQLGG P QQLNASII QTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 QTPEEFGKILLKVQQDGSQVLLRDVARVELGAEDYSTVARYNGKPAAGIAIKLATGANAL 300
+ PEEFGK+ L+V DGS V L+DVARVELG E+Y+ +AR NGKPAAG+ IKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTSRAVKEELNRLSAYFPASLKTVYPYDTTPFIEISIQEVFKTLVEAIILVFLVMYLFLQ 360
DT++A+K +L L +FP +K +YPYDTTPF+++SI EV KTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATIIPTIAVPVVILGTFAILSAVGFTINTLTMFGMVLAIGLLVDDAIVVVENVERVI 420
N RAT+IPTIAVPVV+LGTFAIL+A G++INTLTMFGMVLAIGLLVDDAIVVVENVERV+
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEDKLPPKEATHKSMGQIQRALVGIAVVLSAVFMPMAFMSGATGEIYRQFSITLISSMLL 480
EDKLPPKEAT KSM QIQ ALVGIA+VLSAVF+PMAF G+TG IYRQFSIT++S+M L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVFVAMSLTPALCATILKAAPEGGHK-PNALFARFNTLFEKSTQHYTDSTRSLLRCTGRY 539
SV VA+ LTPALCAT+LK H+ F FNT F+ S HYT+S +L TGRY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 540 MVVYLLICAGMAVLFLRTPTSFLPEEDQGVFMTTAQLPSGATMVNTTKVLQQVTDYYLTK 599
+++Y LI AGM VLFLR P+SFLPEEDQGVF+T QLP+GAT T KVL QVTDYYL
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 600 EKDNVQSVFTVGGFGFSGQGQNNGLAFISLKPWSERVGEENSVTAIIQRAMIALSSINKA 659
EK NV+SVFTV GF FSGQ QN G+AF+SLKPW ER G+ENS A+I RA + L I
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 660 VVFPFNLPAVAELGTASGFDMELLDNGNLGHEKLTQARNELLSLAAQSPDQVTGVRPNGL 719
V PFN+PA+ ELGTA+GFD EL+D LGH+ LTQARN+LL +AAQ P + VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 720 EDTPMFKVNVNAAKAEAMGVALSDINQTISTAFGSSYVNDFLNQGRVKKVYVQAGTPFRM 779
EDT FK+ V+ KA+A+GV+LSDINQTISTA G +YVNDF+++GRVKK+YVQA FRM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 780 LPDNINQWYVRNASGTMAPLSAYSSTEWTYGSPRLERYNGIPSMEILGEAAAGKSTGDAM 839
LP+++++ YVR+A+G M P SA++++ W YGSPRLERYNG+PSMEI GEAA G S+GDAM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 840 KFMADLVAKLPAGVGYSWTGLSYQEALSSNQAPALYAISLVVVFLALAALYESWSIPFSV 899
M +L +KLPAG+GY WTG+SYQE LS NQAPAL AIS VVVFL LAALYESWSIP SV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 900 MLVVPLGVVGALLATDLRGLSNDVYFQVGLLTTIGLSAKNAILIVEFAVEMMQKEGKTPI 959
MLVVPLG+VG LLA L NDVYF VGLLTTIGLSAKNAILIVEFA ++M+KEGK +
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 960 GAIIEAARMRLRPILMTSLAFILGVLPLVISHGAGSGAQNAVGTGVMGGMFAATVLAIYF 1019
A + A RMRLRPILMTSLAFILGVLPL IS+GAGSGAQNAVG GVMGGM +AT+LAI+F
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1020 VPVFFVVVEHLF 1031
VPVFFVV+ F
Sbjct: 1021 VPVFFVVIRRCF 1032


51SBO_3555SBO_3573Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3555022-3.403802dehydrogenase
SBO_3556120-3.931808hypothetical protein
SBO_3557129-5.212646transcriptional regulator
SBO_3558430-2.956712major cold shock protein
SBO_35591210.457243small toxic polypeptide
SBO_35602241.662551IS1 encoded protein
SBO_35611231.503947IS1 ORF2
SBO_35621201.818533IS1 encoded protein
SBO_35632212.215600IS1 ORF2
SBO_3564-1180.959603glycyl-tRNA synthetase subunit beta
SBO_3565014-1.323182glycyl-tRNA synthetase subunit alpha
SBO_3566-120-3.616666lipoprotein
SBO_3567020-1.958518insertion element IS2 transposase InsD
SBO_3568016-2.682817IS1 ORF2
SBO_3569116-4.033719hypothetical protein
SBO_3570-118-4.063901hypothetical protein
SBO_3571-117-2.676528hypothetical protein
SBO_3572-117-2.392436xylulokinase
SBO_3573-117-3.477731xylose isomerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3559HOKGEFTOXIC671e-19 Hok/Gef cell toxic protein family signature.
		>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature.

Length = 52

Score = 67.2 bits (164), Expect = 1e-19
Identities = 18/46 (39%), Positives = 31/46 (67%)

Query: 1 MPQKYGLLSLIVICFTLLFFTWMVRDSLCELHIKQGRYELAAFLAC 46
+P+ + ++++C TLL FT++ R SLCE+ + G E+AAF+A
Sbjct: 3 LPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAY 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3570FLGBIOSNFLIP270.021 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 27.5 bits (61), Expect = 0.021
Identities = 19/66 (28%), Positives = 26/66 (39%), Gaps = 1/66 (1%)

Query: 78 MTCLTVFIISVALLMVGLWNATLLLSEKGFYGLAFFLSLFGAVAVQKNIRDAGINPPKET 137
MT T II LL L + + GLA FL+ F V I P E
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAP-PNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEE 119

Query: 138 QVTQEE 143
+++ +E
Sbjct: 120 KISMQE 125


52SBO_3599SBO_3610Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3599124-5.393441mannitol repressor protein
SBO_3600220-3.599726hypothetical protein
SBO_3601220-2.745246hypothetical protein
SBO_3602321-2.670501IS1 encoded protein
SBO_3603218-1.443450IS1 ORF2
SBO_3604117-1.297277lipoprotein
SBO_36052140.253593adhesin
SBO_36060152.671911insertion element IS2 transposase InsD
SBO_3607-1152.650324adhesin
SBO_3608-2132.975635L-lactate permease
SBO_3609-1113.246444DNA-binding transcriptional repressor LldR
SBO_3610-2113.087500L-lactate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3604TYPE4SSCAGA280.030 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 28.1 bits (62), Expect = 0.030
Identities = 24/106 (22%), Positives = 43/106 (40%), Gaps = 14/106 (13%)

Query: 38 QEEHNNKIDLLE--------------KQQAQLKSQLETIQKQQTGIISSTKTLTHVIKSV 83
QEE NKID +E K++ + +++++ QK + + S
Sbjct: 390 QEEIQNKIDFMEFLAQNNAKLDNLSEKEKEKFRTEIKDFQKDSKAYLDALGNDRIAFVSK 449

Query: 84 KDQQNTFIFTEFNPAKTKYFILNNGSVALAGRVLSIDATENGSVIH 129
KD +++ + TEF Y + + G A + T GS+ H
Sbjct: 450 KDTKHSALITEFGNGDLSYTLKDYGKKADKALDREKNVTLQGSLKH 495


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3605OMADHESIN466e-07 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 46.4 bits (109), Expect = 6e-07
Identities = 51/156 (32%), Positives = 76/156 (48%), Gaps = 28/156 (17%)

Query: 1196 DAVTVRQLQNAIGAVATTPTKYFHANSTEEDSLAVGTDSLAMGAKTIVNGDKGIGIGYGA 1255
++V + L A+G A T Y A++ ++D +A+G + D G+ +G+ +
Sbjct: 99 NSVAIGPLSKALGDSAVT---YGAASTAQKDGVAIGARAST--------SDTGVAVGFNS 147

Query: 1256 YVDANALNGIAIGSNAQVIHVNSIAIGNGSTTTRGAQTNYTAYNMDAPQNSVGEFSVGSA 1315
DA I S+ H SIAIG+ S T R +NSV S+G
Sbjct: 148 KADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR--------------ENSV---SIGHE 190

Query: 1316 DGQRQITNVAAGSADTDAVNVGQLKVTDERVAQNTQ 1351
RQ+T++AAG+ DTDAVNV QLK E+ +NT
Sbjct: 191 SLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTN 226



Score = 45.3 bits (106), Expect = 1e-06
Identities = 83/329 (25%), Positives = 139/329 (42%), Gaps = 21/329 (6%)

Query: 111 GSKTHAIGGASMAFGVSAISEGDRSIALGASSYSLGQYSMALGRYSKALGKLSIAMGDSS 170
G A G S+A G +A + ++A+GA S + G S+A+G SKALG ++ G +S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 171 KAE------GANA------IALGNATKATEIMSIALGDTAN--ASKAYSMALGASSVASE 216
A+ GA A +A+G +KA S+A+G +++ A+ YS+A+G S
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 217 ENAIALGRSSVASGTDSLAFGRQSLASAANAIAIGAETEAAENATAIGNNAKAKGTNSMA 276
EN++++G S+ LA G + A N + E E + T + N+ A
Sbjct: 182 ENSVSIGHESLNRQLTHLAAGTKD-TDAVNVAQLKKEIEKTQENTNKRSAELLANANAYA 240

Query: 277 MGFGSLADKVNTIALGNGSQALADNAIAIGQGNKADGVDAIALGNGSQSRGLNTIALGTA 336
S + + S +NA D ++ + S +R A A
Sbjct: 241 DNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEHA 300

Query: 337 SNA---TGDKSLALGSNSSANGINSVALGADSIADLDNTVSVGNSSLKRKIVNVKNGAIK 393
++ T + + + SA + S + ADS + +T+ NS + N AI+
Sbjct: 301 NSVARTTLETAEEHANKKSAEALASANVYADSKS--SHTLKTANSYTDVTVSNSTKKAIR 358

Query: 394 -SDSYDAINGSQLYAISDSVAKRLGGGAA 421
S+ Y QL D + R+ G A
Sbjct: 359 ESNQYTDHKFRQLDNRLDKLDTRVDKGLA 387



Score = 37.2 bits (85), Expect = 4e-04
Identities = 45/156 (28%), Positives = 70/156 (44%), Gaps = 17/156 (10%)

Query: 1099 AQGVGATAIGYNSVAKGDSSVAIGQGSYSDVDTGIALGS-SSVSSRVIAKGSRDTSITEN 1157
A GV + AIG S A GDS+V G S + D G+A+G+ +S S +A G + +N
Sbjct: 95 ATGVNSVAIGPLSKALGDSAVTYGAASTAQKD-GVAIGARASTSDTGVAVGFNSKADAKN 153

Query: 1158 GVVIGY---------------DTTDGELLGALSIGDDGKYRQIINVADGSEAHDAVTVRQ 1202
V IG+ D + + ++SIG + RQ+ ++A G++ DAV V Q
Sbjct: 154 SVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQ 213

Query: 1203 LQNAIGAVATTPTKYFHANSTEEDSLAVGTDSLAMG 1238
L+ I K ++ A S +G
Sbjct: 214 LKKEIEKTQENTNKRSAELLANANAYADNKSSSVLG 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3607PF03895662e-16 Serum resistance protein DsrA.
		>PF03895#Serum resistance protein DsrA.

Length = 79

Score = 66.0 bits (161), Expect = 2e-16
Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 2/79 (2%)

Query: 202 ESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGV-SMVSANGRWVYKLQ 260
+L G+A+ A++ L Q G + S G Y ++A+A+GV S ++ +
Sbjct: 2 SKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGVA 61

Query: 261 GSTNSQGEYSAALGAGIQW 279
+T + G S G ++
Sbjct: 62 FNTYN-GGMSYGASVGYEF 79


53SBO_3622SBO_3636Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3622-115-3.432895L-threonine 3-dehydrogenase
SBO_3623119-5.9802702-amino-3-ketobutyrate coenzyme A ligase
SBO_3624224-8.867340ADP-L-glycero-D-manno-heptose-6-epimerase
SBO_3625334-11.441816ADP-heptose--LPS heptosyltransferase
SBO_3626341-14.185722ADP-heptose--LPS heptosyltransferase
SBO_3627340-15.053070lipid A-core surface polymer ligase
SBO_3628331-11.991932UDP-galactose:(galactosyl) LPS
SBO_3629329-10.133927lipopolysaccharide core biosynthesis protein
SBO_3630324-6.976559lipopolysaccharide 1,2-glucosyltransferase
SBO_3631321-4.629777UDP-D-galactose:(glucosyl)lipopolysaccharide-
SBO_3632216-1.985241RfaP protein
SBO_3633115-1.857796glucosyltransferase I
SBO_3634013-1.055918lipopolysaccharide core biosynthesis protein
SBO_3635112-0.3993293-deoxy-D-manno-octulosonic-acid transferase
SBO_3636216-0.549444phosphopantetheine adenylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3624NUCEPIMERASE1047e-28 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 104 bits (260), Expect = 7e-28
Identities = 77/348 (22%), Positives = 127/348 (36%), Gaps = 67/348 (19%)

Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLDI 47
+VTG AGFIG ++ K L + G ++ +DNL D +D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 48 ADYMDKEDFLIQIMAGEEFGDVEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100
AD + + + A F E +F + +Y ++N + Y+ +
Sbjct: 62 ADR----EGMTDLFASGHF---ERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109

Query: 101 LHYCLEREIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158
L C +I LYASS++ YG F + P+++Y +K +
Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169

Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218
G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A
Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224

Query: 219 DVNL------------WFLENGVSG-------IFNLGTGRAESFQAVADATLAY-HKKGQ 258
+ + W +E G ++N+G A + +
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRAA-GYDKPFKTVAEGVTEYMAW 305
+P G T AD L G+ P TV +GV ++ W
Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3636LPSBIOSNTHSS2472e-87 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 247 bits (631), Expect = 2e-87
Identities = 77/154 (50%), Positives = 111/154 (72%)

Query: 5 AIYPGTFDPITNGHIDIVTRATQMFDHVILAIAASPSKKPMFTLEERVALAQQATAHLGN 64
AIYPG+FDPIT GH+DI+ R ++FD V +A+ +P+K+PMF+++ER+ +A AHL N
Sbjct: 3 AIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPN 62

Query: 65 VEVVGFSDLMANFARNQHATVLIRGLRAVADFEYEMQLAHMNRHLMPELESVFLMPSKEW 124
+V F L N+AR + A ++RGLR ++DFE E+Q+A+ N+ L +LE+VFL S E+
Sbjct: 63 AQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTEY 122

Query: 125 SFISSSLVKEVARHQGDVTHFLPENVHQALMAKL 158
SF+SSSLVKEVAR G+V HF+P +V AL +
Sbjct: 123 SFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQF 156


54SBO_3693SBO_3711Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_36932162.907503hypothetical protein
SBO_36942172.653189hypothetical protein
SBO_36952193.561717transcriptional regulator
SBO_36962194.129152multidrug resistance protein D
SBO_36971183.737323ilvB operon leader peptide
SBO_36980173.057815acetolactate synthase catalytic subunit
SBO_3699-1142.661677acetolactate synthase 1 regulatory subunit
SBO_3700-1132.828760DNA-binding transcriptional activator UhpA
SBO_3701-1122.093409sensory histidine kinase UhpB
SBO_3702-1121.124072regulatory protein UhpC
SBO_3703-1120.669250sugar phosphate antiporter
SBO_3704-1160.070342cryptic adenine deaminase
SBO_3706-120-2.331489IS1 ORF2
SBO_3707121-3.094522IS1 encoded protein
SBO_3708121-3.010261hypothetical protein
SBO_3709020-3.203635ribonucleoside transporter
SBO_3710021-4.556822transporter
SBO_3711022-3.945188hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3696TCRTETB612e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 61.0 bits (148), Expect = 2e-12
Identities = 41/184 (22%), Positives = 82/184 (44%), Gaps = 1/184 (0%)

Query: 7 RNVNLLLMLVLLVAVGQMAQTIYIPAIADMARDLNVREGAVQSVMGAYLLTYGVSQLFYG 66
R+ +L+ L +L + + + ++ D+A D N + V A++LT+ + YG
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 67 PISDRVGRRPVILVGMSIFMLATLVA-VTTSSLTVLIAASAMQGMGTGVGGVMARTLPRD 125
+SD++G + ++L G+ I +++ V S ++LI A +QG G + +
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130

Query: 126 LYERTQLRHANSLLNMGILVSPLLAPLIGGLLDTMWNWRACYLFLLVLCVGVTFSMARWM 185
+ A L+ + + + P IGG++ +W L ++ + V F M
Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190

Query: 186 PETR 189
E R
Sbjct: 191 KEVR 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3700HTHFIS612e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 2e-13
Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%)

Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61
T+ + DD +R+ Q L V + + + + D+ MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118
+LL ++ K + +++S ++ +A GA +L K ELI +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117

Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169
A+ R L + + + + + A L + T+
Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3701PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 2e-05
Identities = 28/142 (19%), Positives = 56/142 (39%), Gaps = 11/142 (7%)

Query: 366 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 425
LR ++L + + ++L L++ + + +V + Q + N
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266

Query: 426 IVKHA-----DASAVTLQGWQQDERLMLVIEDDGSGLPPGSGQ-QGFGLTGMRERVTALG 479
+KH + L+G + + + L +E+ GS + + G GL +RER+ L
Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326

Query: 480 G---TLHISCLHG-TRVSVSLP 497
G + +S G V +P
Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3702TCRTETB411e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.6 bits (95), Expect = 1e-05
Identities = 65/408 (15%), Positives = 137/408 (33%), Gaps = 60/408 (14%)

Query: 30 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 87
RH + IWL F+ N ++P+I + + + T F +T+ + V G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 88 IVSDRSNARYFMGIGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 144
+SD+ + + G+I +++ S F L ++ F QG G+ P ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 145 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAAAALHYGWRAGMMIAGCMAIVVGIFLC 203
A Y + RG + L + +G + P + A + W ++ M ++ +
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184

Query: 204 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 263
+ L +I G L I+ + Y VL
Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 264 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFI-----------GALVA 307
+++ R + + + + + + + + + GF+ A
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 308 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 366
GS +F G + + GIL+ G L+++ + + F T F + +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351

Query: 367 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 396
G++ + ++ AGA + ++L
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3703TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.1 bits (78), Expect = 0.001
Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%)

Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108
N++ D+ + + + F +T+ +G + +D K+ L F +I++ C
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90

Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164
+G SL +M + F Q G + + + ++ P+ RG G
Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212
+G + A+Y+ + + + P + I+ L
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3704UREASE381e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 37.8 bits (88), Expect = 1e-04
Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 17/105 (16%)

Query: 22 AVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG-AEYAD---------APA 71
V+R D +I N ILD + G + I +K IA +G A D P
Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117

Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116
+ I G G +D+H+H + P E A L GLT ++
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3709TCRTETB362e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.0 bits (83), Expect = 2e-04
Identities = 29/163 (17%), Positives = 61/163 (37%), Gaps = 18/163 (11%)

Query: 43 LTPMAQDLGISEG-----VAGQSVTVTAFVAMFASLFITQTIQATDRCYVVILFAVLLTL 97
L +A D +T + A++ L +D+ + L + +
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKL--------SDQLGIKRLLLFGIII 88

Query: 98 SCL--LVSFAN--SFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKALSVIFGAV 153
+C ++ F FSLL++ R G F A+ + R +P KA +I V
Sbjct: 89 NCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIV 148

Query: 154 SIALVIAAPLGSFLGELIGWRNVFNAAAVMGVLCIFWIIKSLP 196
++ + +G + I W + + ++ + +++K L
Sbjct: 149 AMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLK 190


55SBO_3747SBO_3755Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3747231-0.18315416S rRNA methyltransferase GidB
SBO_37481320.580986F0F1 ATP synthase subunit I
SBO_37504401.190355F0F1 ATP synthase subunit C
SBO_37514391.298434F0F1 ATP synthase subunit B
SBO_37523351.187274F0F1 ATP synthase subunit delta
SBO_37533361.365600F0F1 ATP synthase subunit alpha
SBO_37542261.242842F0F1 ATP synthase subunit gamma
SBO_37552241.648624F0F1 ATP synthase subunit beta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3751IGASERPTASE270.028 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.3 bits (60), Expect = 0.028
Identities = 20/101 (19%), Positives = 37/101 (36%), Gaps = 18/101 (17%)

Query: 31 AAIEKRQKEIADGLASAERAHKDLDLAKASATDQLKKAKAEAQVIIEQ--ANKRRSQILD 88
+EK +++ + A K+ + T + A++ ++ Q K + +
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 89 EAKAEAEQERTKIVA----------------QAQAEIEAER 113
E KA+ E E+T+ V Q QAE E
Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149


56SBO_3778SBO_3783Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3778-1203.494045ilvG operon leader peptide
SBO_3779-1193.785151acetolactate synthase 2 catalytic subunit
SBO_37800253.928655acetolactate synthase 2 regulatory subunit
SBO_37810263.998485branched-chain amino acid aminotransferase
SBO_3782-1224.063549dihydroxy-acid dehydratase
SBO_37831203.214635threonine dehydratase
57SBO_3949SBO_3960Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3949-1185.336235ATP-dependent protease peptidase subunit
SBO_3950-1185.408758cell division protein FtsN
SBO_3951-1194.734359DNA-binding transcriptional regulator CytR
SBO_39520225.142322primosome assembly protein PriA
SBO_39531254.60147950S ribosomal protein L31
SBO_39540214.480967protein in rhs element
SBO_39550152.860392IS1 ORF2
SBO_3956-1152.517329iso-IS1 ORF2
SBO_3957-1173.245075peptidoglycan peptidase
SBO_3958-2173.264648transcriptional repressor protein MetJ
SBO_3959-2163.193740cystathionine gamma-synthase
SBO_3960-2173.103823bifunctional aspartate kinase II/homoserine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3950IGASERPTASE422e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.0 bits (98), Expect = 2e-06
Identities = 32/155 (20%), Positives = 64/155 (41%), Gaps = 5/155 (3%)

Query: 114 LTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSR 173
+ +QAD+ P+ E+ ++ P + +AE + Q+S+
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK--QESK 1049

Query: 174 TTEQSWQQQT-RTSQAAPVQAQPRQSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVARA 232
T E++ Q T T+Q V + + + A+TQ + T ++ ++ A V +
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 233 ADAPKPTAEKKDERRWMVQCGSFRGAEQAETVRAQ 267
A T + ++ + Q + EQ+ETV+ Q
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQ--EQSETVQPQ 1142


58SBO_4065SBO_4089Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_4065114-3.142328replicative DNA helicase
SBO_4066121-4.375516quinone oxidoreductase
SBO_4067125-6.284995phage shock protein G
SBO_4068121-3.394207tRNA-dihydrouridine synthase A
SBO_4069023-4.194387hypothetical protein
SBO_4070220-1.295373hypothetical protein
SBO_40710141.504784zinc uptake transcriptional repressor
SBO_40720131.225213stress-response protein
SBO_40730131.515688DNA-damage-inducible SOS response protein
SBO_4074-2100.306492LexA repressor
SBO_4075-114-3.402211diacylglycerol kinase
SBO_4076-115-3.634999glycerol-3-phosphate acyltransferase
SBO_4077-128-6.6097004-hydroxybenzoate octaprenyltransferase
SBO_4078035-9.871271chorismate pyruvate lyase
SBO_4079034-9.912719transposase
SBO_4080032-11.093457hypothetical protein
SBO_4081024-5.799054IS1 encoded protein
SBO_4082-113-1.076565IS1 ORF2
SBO_4083-113-0.589248hypothetical protein
SBO_4084-1131.717659acid phosphatase/phosphotransferase
SBO_40851142.428664hypothetical protein
SBO_40860142.538636hypothetical protein
SBO_40870152.733764excinuclease ABC subunit A
SBO_40882200.285978single-stranded DNA-binding protein
SBO_4089325-0.684126IS4 orf
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4088PERTACTIN270.048 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 27.0 bits (59), Expect = 0.048
Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 4/50 (8%)

Query: 119 GGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG----GAQSRPQQSAPAAP 164
G APAGG + GG GG + + G + QS AP
Sbjct: 261 GDAPAGGAVPGGAVPGGAVPGGFGPLLDGWYGVDVSDSTVDLAQSIVEAP 310


59SBO_4116SBO_4129Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_41160296.578949ribose 5-phosphate isomerase B
SBO_41170337.424514hypothetical protein
SBO_4118-1347.990495carbon-phosphorus lyase complex accessory
SBO_4119-1377.663426aminoalkylphosphonic acid N-acetyltransferase
SBO_41200367.912336ribose 1,5-bisphosphokinase
SBO_41210378.143840protein PhnM
SBO_41220358.183717phosphonate transport ATP-binding protein
SBO_41230368.517091phosphonate C-P lyase system protein PhnK
SBO_41241378.364671protein PhnJ
SBO_41252367.437949protein PhnI
SBO_41262376.918767carbon-phosphorus lyase complex subunit
SBO_41272355.791724protein PhnG
SBO_41282344.350391phosphonate metabolism transcriptional regulator
SBO_41292322.311057Pn transporter membrane channel protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4119SACTRNSFRASE280.007 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.4 bits (63), Expect = 0.007
Identities = 19/84 (22%), Positives = 31/84 (36%), Gaps = 5/84 (5%)

Query: 55 HLALLDGEVVGMIGLHLQFHLHHVNWIGEIQELVVMPQACGLNVGSKLLAWAEEEARQAG 114
L L+ +G I + + N I+++ V VG+ LL A E A++
Sbjct: 68 FLYYLENNCIGRIKIRSNW-----NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122

Query: 115 AEMTELSTNVKRHDAHRFYLREGY 138
L T A FY + +
Sbjct: 123 FCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4122PF05272290.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.013
Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 8/70 (11%)

Query: 36 CVVLHGHSGSGKSTLLRSLYANYLPDEGQIQIKHGDEWVDLVTAPARKVVEI------RK 89
VVL G G GKSTL+ +L + I G + + + E+ R+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIV--AYELSEMTAFRR 655

Query: 90 TTVGWVSQFL 99
V F
Sbjct: 656 ADAEAVKAFF 665


60SBO_4145SBO_4172Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_4145218-4.508180DNA-binding transcriptional regulator MelR
SBO_4146321-4.399015alpha-galactosidase
SBO_4147525-5.061076IS911 ORF2
SBO_4148525-5.648283IS600 ORF2
SBO_4149524-5.308730hypothetical protein
SBO_4150425-5.231762serine protease
SBO_41515251.425689protein encoded within IS
SBO_41524240.769794hypothetical protein
SBO_4153-114-0.110017IS629 ORF1
SBO_4154-113-0.516410IS629 ORF2
SBO_4155-111-2.931052protein in IS element
SBO_4156013-5.021123IS600 ORF2
SBO_4157-113-5.094139IS600 ORF1
SBO_4158-114-5.694511type I restriction enzyme R protein
SBO_4159225-7.469598type I restriction enzyme M protein
SBO_4160121-3.032970type I restriction-modification system
SBO_41610210.314071hypothetical protein
SBO_41620243.236068IS1 ORF2
SBO_41630243.709762hypothetical protein
SBO_41640275.253007prophage P4 integrase
SBO_41652337.000887DNA primase
SBO_41662304.531093protein encoded in prophage
SBO_4167126-2.035711hypothetical protein
SBO_4168332-6.372686bacteriophage protein
SBO_4169232-7.443037hypothetical protein
SBO_4170128-6.278582hypothetical protein
SBO_4171124-5.964737phage polarity suppression protein
SBO_4172020-5.258851hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4150IGASERPTASE2851e-80 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 285 bits (729), Expect = 1e-80
Identities = 136/571 (23%), Positives = 223/571 (39%), Gaps = 122/571 (21%)

Query: 33 KKVILGIILSSIYGSYGETAFA-AMLDINNIWTRDYLDLAQNRGEFRPGATNVQLMMKDG 91
KK L I ++ +Y T + A L +++ + + D A+N+G+F GATNV + K+
Sbjct: 4 KKFKLNFIALTV--AYALTPYTEAALVRDDVDYQIFRDFAENKGKFSVGATNVLVKDKNN 61

Query: 92 KIFH--FPE-LPVPDFSAVS-NKGATTSIGGAYSVTATH--------------------N 127
K P +P+ DFS V +K T I Y V H N
Sbjct: 62 KDLGTALPNGIPMIDFSVVDVDKRIATLINPQYVVGVKHVSNGVSELHFGNLNGNMNNGN 121

Query: 128 GTQHHAITTQSWDQTAYKASNRVSS----------------GDFSVHRLNKFVVETTGVT 171
H ++++ + + + + D+ + RL+KFV T
Sbjct: 122 AKAHRDVSSEENRYFSVEKNEYPTKLNGKTVTTEDQTQKRREDYYMPRLDKFV------T 175

Query: 172 ESADFSLSPEDAMKRYGVNYNGKEQ-IIGFRAGAGTTSTILNGKQY-------------- 216
E A S + YN + + R G+G+ G Y
Sbjct: 176 EVAPIEASTASS---DAGTYNDQNKYPAFVRLGSGSQFIYKKGDNYSLILNNHEVGGNNL 232

Query: 217 -LFGQNYNPDLLSASLFNLDWKNKSYIYT--------------NRTPFKNSPIFGDSGSG 261
L G Y ++ + + ++ +N I ++ P N + GDSGS
Sbjct: 233 KLVGDAYTY-GIAGTPYKVNHENNGLIGFGNSKEEHSDPKGILSQDPLTNYAVLGDSGSP 291

Query: 262 SYLYDKEQQKWVFHGVTSTVGFISSTNIAWTNYSLFNNILVNNLKKNFTNTMQLDGKKQE 321
++YD+E+ KW+F G + +W ++++ + ++ + + K
Sbjct: 292 LFVYDREKGKWLFLGSYD--FWAGYNKKSWQEWNIYKSQFTKDVLNKDSAGSLIGSKTDY 349

Query: 322 LSSIIKD-------------------------KDLSVSGGGELTLKQDTDLGIGGLIFDK 356
S K ++ G G LTL + D G GGL F+
Sbjct: 350 SWSSNGKTSTITGGEKSLNVDLADGKDKPNHGKSVTFEGSGTLTLNNNIDQGAGGLFFEG 409

Query: 357 NQTYKVYGKDKSYKGAGIDIDNNTTVEWNVKGVAGDNLHKIGSGTLDVKIAQGN--NLKI 414
+ K + ++KGAG+ + TV W V D L KIG GTL V+ N +LK+
Sbjct: 410 DYEVKGTSDNTTWKGAGVSVAEGKTVTWKVHNPQYDRLAKIGKGTLIVEGTGDNKGSLKV 469

Query: 415 GNGTVIL------SAEKAFNKIYMAGGKGTVKINAKDALSESGNGEIYFTRNGGTLDLNG 468
G+GTVIL S + AF + + G+ T+ +N + + IYF GG LDLNG
Sbjct: 470 GDGTVILKQQTNGSGQHAFASVGIVSGRSTLVLNDDKQVDPNS---IYFGFRGGRLDLNG 526

Query: 469 YDQSFQKIAATDAGTTVTNSNVKQ-STLSLT 498
+F I D G + N N+ S +++T
Sbjct: 527 NSLTFDHIRNIDDGARLVNHNMTNASNITIT 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_41662FE2SRDCTASE260.036 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 25.8 bits (56), Expect = 0.036
Identities = 12/31 (38%), Positives = 17/31 (54%)

Query: 18 VACAWLTVCERQHRYPHLTLESLEAAIAAEL 48
VAC W+ VCE ++ PH +E I+ L
Sbjct: 135 VACFWVDVCEDKNATPHSPQHRMETLISQAL 165


61SBO_4182SBO_4194Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_4182-219-3.143427valyl-tRNA synthetase
SBO_4183025-6.810814IS1 encoded protein
SBO_4184-223-6.548925IS1 ORF2
SBO_4185-122-6.938974hypothetical protein
SBO_4186-120-5.523171IS1 ORF2
SBO_4187022-6.801383hypothetical protein
SBO_4188-216-2.773867ornithine carbamoyltransferase subunit I
SBO_4189129-9.015943hypothetical protein
SBO_4190131-8.600120acetyltransferase
SBO_4191135-9.221791hypothetical protein
SBO_4192134-8.123903IS1 encoded protein
SBO_4193230-5.784754IS1 ORF2
SBO_4194334-6.838056hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4190SACTRNSFRASE270.021 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.8 bits (59), Expect = 0.021
Identities = 12/33 (36%), Positives = 15/33 (45%)

Query: 97 PAIRGKGLAKKLALMAMEQAREMGFKRCYLETT 129
R KG+ L A+E A+E F LET
Sbjct: 99 KDYRKKGVGTALLHKAIEWAKENHFCGLMLETQ 131


62SBO_4270SBO_4280Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_4270119-3.393220hypothetical protein
SBO_4271015-1.929320hypothetical protein
SBO_42723130.492083hypothetical protein
SBO_42733130.920432alpha helical protein
SBO_42742181.334512IS1 encoded protein
SBO_42752181.430510IS1 ORF2
SBO_42763211.34171523S rRNA (guanosine-2'-O-)-methyltransferase
SBO_42773221.606134exoribonuclease R
SBO_42784221.338802transcriptional repressor NsrR
SBO_42794241.428689adenylosuccinate synthetase
SBO_42802181.331457hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4273PHPHTRNFRASE320.001 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 32.4 bits (74), Expect = 0.001
Identities = 22/122 (18%), Positives = 47/122 (38%), Gaps = 12/122 (9%)

Query: 85 VNPSLINEVAEEIARLENLITAEEQVLSNLEVSRDGVEKAVTATAQRIAQFEQQMEVVKA 144
+ + I +V+ EI +L A E+ L +D E ++ A I F + V+
Sbjct: 29 IEKTSITDVSTEIEKLT---AALEKSKEELRAIKDQTEASMGADKAEI--FAAHLLVLDD 83

Query: 145 TEAMQRAQQAVTTSTVGASSSVSTAAESLKRLQTRQAERQARLDAAAQLEKVADGRDLDE 204
E + + + + A ++ ++ +D E+ AD RD+ +
Sbjct: 84 PELVDGIKGKIENEQMNAEYALKEVSD-------MFVSMFESMDNEYMKERAADIRDVSK 136

Query: 205 KL 206
++
Sbjct: 137 RV 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4277RTXTOXIND310.026 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.026
Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 1/55 (1%)

Query: 165 VVPDDSRLSFDILIPPDQIMGARMGFVVVVELTQRPTRRTKAV-GKIVEVLGDNM 218
+VP+D L L+ I +G ++++ P R + GK+ + D +
Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413


63SBO_4330SBO_4365Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_4330225-2.826015hypothetical protein
SBO_4331428-0.396841hypothetical protein
SBO_43324240.686458IS1 encoded protein
SBO_43333231.449946IS1 ORF2
SBO_43344201.679793insertion element IS2 transposase InsD
SBO_43354190.227106hypothetical protein
SBO_43364180.892122membrane transport protein
SBO_43373150.452534siderophore biosynthesis protein
SBO_43383150.345130siderophore biosynthesis protein
SBO_4339415-0.118768siderophore biosynthesis protein
SBO_4340417-0.688478lysine:N6-hydroxylase
SBO_43414180.938796ferric siderophore receptor
SBO_43423211.114086protein encoded within IS
SBO_43444221.079540protein encoded within IS
SBO_43433260.683092hypothetical protein
SBO_43455261.427293IS629 ORF1
SBO_43465281.200498IS629 ORF2
SBO_43476290.276820IS1 encoded protein
SBO_43487280.551873IS1 ORF2
SBO_43497270.855131IS2 ORF2
SBO_43506240.095309IS2 ORF2
SBO_4351624-0.576113insertion sequence 2 OrfA protein
SBO_4352624-0.563194bifunctional enterobactin receptor/adhesin
SBO_4353020-1.539669hypothetical protein
SBO_4354-121-2.345601IS911 ORF2
SBO_4355-125-4.956043hypothetical protein
SBO_4356027-5.518076IS1-encoded protein
SBO_4357127-5.976772IS1 ORF2
SBO_4358230-6.559815hypothetical protein
SBO_4359131-6.423957hypothetical protein
SBO_4360231-6.468955N-acetylneuraminic acid mutarotase
SBO_4361232-6.225809hypothetical protein
SBO_4362232-5.020498tyrosine recombinase
SBO_4363129-3.699239tyrosine recombinase
SBO_4364225-3.120174major type 1 subunit fimbrin
SBO_4365324-2.406830fimbrial protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4331SACTRNSFRASE260.012 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.4 bits (58), Expect = 0.012
Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 32 LAIIEHTDVDESLKGQGIGKQLVAKVVE 59
A+IE V + + +G+G L+ K +E
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4336TCRTETA461e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 46.4 bits (110), Expect = 1e-07
Identities = 79/375 (21%), Positives = 131/375 (34%), Gaps = 41/375 (10%)

Query: 20 FSAGLLGIGQNGLLVVLPVLVIQTNLSLSV---WAALLMLGSMLFLPSSPWWGKQISRTG 76
+ L +G ++ VLP L+ S V + LL L +++ +P G R G
Sbjct: 12 STVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFG 71

Query: 77 SKPVVLWALGGYGISFTLLGLGSVLMATSAITTAVGLGILIIARIAYGLTVSAMVPACQV 136
+PV+L +L G + + ++ L +L I RI G+T + A
Sbjct: 72 RRPVLLVSLAGAAVDYAIMATAPFLW------------VLYIGRIVAGITGATGAVAGAY 119

Query: 137 WALQRAGEGNRMAALATISSGLSCGRLFGPLCAAAMLAIHPLAPLGLLMAAPVLALLMLL 196
A R +S+ G + GP+ M P AP A L L
Sbjct: 120 IA-DITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178

Query: 197 RL------PGTPPQPTPECKSVSLKRDCLPYLLCAILLAAAVSMMQLGLSPA-LTRQFVT 249
L P ++ R + A L+A M +G PA L F
Sbjct: 179 FLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238

Query: 250 DTTAISQQVAWLLGLSAVAALIAQFGVVRPQRLTP-----VALLLSAGVLMSGGLAIMLS 304
D + L+A L + + + AL+L +G + + +
Sbjct: 239 DRFHWDATTIG-ISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297

Query: 305 EQLWLFYPGCAVLSFGAALATPAYQLLLNDKLADGAGAGWLATSHTLGYGLCALLVPLVS 364
+ W+ +P +L+ G + PA Q +L+ + D G L L L S
Sbjct: 298 TRGWMAFPIMVLLASG-GIGMPALQAMLS-RQVDEERQGQLQ----------GSLAALTS 345

Query: 365 KTGVAIALIMAALFA 379
T + L+ A++A
Sbjct: 346 LTSIVGPLLFTAIYA 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4337PF04183339e-111 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 339 bits (872), Expect = e-111
Identities = 104/480 (21%), Positives = 178/480 (37%), Gaps = 46/480 (9%)

Query: 56 ELLIPLDEQKSLHFRVAYFSPTQHHRF-----AFPARLVTASGSYPVDFTTLSRLIIDKL 110
E + + Q + + P RF + + A D L++ ++ +L
Sbjct: 24 EQVFHAESQGDDRYCIN--LPGAQWRFIAERGIWGWLWIDAQTLRCADEPVLAQTLLMQL 81

Query: 111 RHQLFLPVPLCETFHQRVLESHVHTQQAIDARHDWAALREKALNFGEAEQALLTGHAFHP 170
+ L + Q + + + Q + AR +A LN + Q LL+GH
Sbjct: 82 KQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINLNA-DRLQCLLSGHPKFV 140

Query: 171 APKSHEPFNRREAERYLPDMAPHFPLRWFSVDKTQIAGES-LHLNLQQRLTRFAAENAPQ 229
K + + ERY P+ A F L W +V + + +++ Q LT A PQ
Sbjct: 141 FNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQLLT---AAMDPQ 197

Query: 230 LLNELS--------DNQWLF-PLHPWQGEYLLQQGWCQALVAKGLIKDLGEAGTSWLPTT 280
S D+ WL P+HPWQ + + + A+G + LGE G WL
Sbjct: 198 EFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADF-AEGRMVSLGEFGDQWLAQQ 256

Query: 281 SSRSLYCATSRD--MIKFSLSVRLTNSIRTLSVKEVKRGMRLARLAQ----TDGWQMLQ- 333
S R+L A+ R IK L++ T+ R + + + G +R Q TD +
Sbjct: 257 SLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDATLVQSG 316

Query: 334 ---VRFPTFRVMQEDGWAGLLDLNGNIMQESLFALRENLLVDQPKSQTNVLVSLTQAAPD 390
+ P + +G+A L + REN ++ VL++ +
Sbjct: 317 AVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDESPVLMATLMECDE 376

Query: 391 GGDSLLVSAVKRLSDRLGITVQQAAHAWVDAYCQQVLKPLFTAEADYGLVLLAHQQNILV 450
L + + DR G+ A W+ + V+ PL+ YG+ L+AH QNI +
Sbjct: 377 NNQPLAGAYI----DRSGLD----AETWLTQLFRVVVVPLYHLLCRYGVALIAHGQNITL 428

Query: 451 QMLGDLPVGFIYRDCQGSAFMPHATDWLDSIGEAQAENIFTHEQLLRYFPYYLLVNSTFA 510
M +P + +D QG M + + E + L++
Sbjct: 429 AMKEGVPQRVLLKDFQGD--MRLVKEEFPEMDSLPQE----VRDVTSRLSADYLIHDLQT 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4339PF041838130.0 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 813 bits (2101), Expect = 0.0
Identities = 563/580 (97%), Positives = 569/580 (98%)

Query: 1 MNHKDWDFVNRRLVAKMLSEMEYEQVFHAESQGDDHYCINLPGAQWRFIAERGIWGWLWI 60
MNHKDWD VNRRLVAKMLSE+EYEQVFHAESQGDD YCINLPGAQWRFIAERGIWGWLWI
Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQVFHAESQGDDRYCINLPGAQWRFIAERGIWGWLWI 60

Query: 61 DAQTLRCTDEPVLAQTLLMQLKPVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASD 120
DAQTLRC DEPVLAQTLLMQLK VLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASD
Sbjct: 61 DAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASD 120

Query: 121 LINLDADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYTNTFRLHWLAVKREHMIWRC 180
LINL+ADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEY NTFRLHWLAVKREHMIWRC
Sbjct: 121 LINLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRC 180

Query: 181 DNDLDIQQLLTAAMDPQEFTRFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFVEG 240
DN++DI QLLTAAMDPQEF RFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADF EG
Sbjct: 181 DNEMDIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEG 240

Query: 241 RMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASR 300
RMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASR
Sbjct: 241 RMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASR 300

Query: 301 WLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLK 360
WLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLK
Sbjct: 301 WLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLK 360

Query: 361 PDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI 420
PDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI
Sbjct: 361 PDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI 420

Query: 421 AHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEAFPEMDSLPQEVRDATSRLSADYLIHDL 480
AHGQNITLAMKEGVPQRVLLKDFQGDMRLVKE FPEMDSLPQEVRD TSRLSADYLIHDL
Sbjct: 421 AHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSRLSADYLIHDL 480

Query: 481 QTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMNKHPQMAERFALFSLFRPQIIR 540
QTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYM KHPQM+ERFALFSLFRPQIIR
Sbjct: 481 QTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSERFALFSLFRPQIIR 540

Query: 541 VVLNPVKLTWPDLDGGSRMLPNYLENLQNPLWLVTQEYES 580
VVLNPVKLTWPDLDGGSRMLPNYLE+LQNPLWLVTQEYES
Sbjct: 541 VVLNPVKLTWPDLDGGSRMLPNYLEDLQNPLWLVTQEYES 580


64SBO_4379SBO_4410Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_4379219-0.987085isoaspartyl dipeptidase
SBO_4380323-2.484028hypothetical protein
SBO_4381222-1.988370hypothetical protein
SBO_4382319-1.704575hypothetical protein
SBO_4383219-0.852295transporter
SBO_4384218-1.603289hypothetical protein
SBO_43852180.314617hypothetical protein
SBO_4387-1160.605043hypothetical protein
SBO_4388-117-0.753561hypothetical protein
SBO_43890190.378920transporter
SBO_43902220.216441hypothetical protein
SBO_43912221.041099IS629 ORF1
SBO_43922211.028259IS629 ORF2
SBO_43931190.896517penicillin G acylase
SBO_43943212.241766ISSfl2 ORF
SBO_43951151.458203IS629 ORF2
SBO_43960141.156814IS629 ORF1
SBO_43971212.037329protein encoded within IS
SBO_43980211.969875protein encoded within IS
SBO_43990242.071360hypothetical protein
SBO_44000242.416304GTP-binding protein YjiA
SBO_44010243.008301hypothetical protein
SBO_44020253.413005carbon starvation protein
SBO_44030233.2857984-hydroxyphenylacetate 3-monooxygenase coupling
SBO_4404-1253.3287264-hydroxyphenylacetate 3-monooxygenase
SBO_4406-2274.4425084-hydroxyphenylacetate permease
SBO_4407-1284.8367192,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase
SBO_44080263.7275122-oxo-hepta-3-ene-1,7-dioic acid hydratase
SBO_4409-1203.1443055-carboxymethyl-2-hydroxymuconate
SBO_4410-2193.011542homoprotocatechuate dyoxygenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4379UREASE354e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 35.1 bits (81), Expect = 4e-04
Identities = 21/85 (24%), Positives = 37/85 (43%), Gaps = 20/85 (23%)

Query: 26 CDVLVANGKIIAVASNIPSDIVPNCT--------VVDLSGQILCPGFIDQHVHLIGGGGE 77
D+ + +G+I A+ D+ P T V+ G+I+ G +D H+H I
Sbjct: 86 ADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFI----- 140

Query: 78 AGPTTRTPEVALSRLTEAGVTSVVG 102
+ E AL +G+T ++G
Sbjct: 141 ---CPQQIEEALM----SGLTCMLG 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4383TCRTETA290.033 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.0 bits (65), Expect = 0.033
Identities = 62/290 (21%), Positives = 106/290 (36%), Gaps = 22/290 (7%)

Query: 82 RPFLLVSALARGLLILAMAWLPPFILVLLIRFLAGV-----ASAGMLIFGSTLIMQHTRH 136
RP LLVS + MA P ++ + R +AG+ A AG I T + RH
Sbjct: 73 RPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARH 132

Query: 137 PFVLAALFSGVGIALGNEYVLAGLHFAFSSQTLWQGAGALSGMMLIALTLLMP-SKKHAI 195
++A F G G+ G VL GL FS + A AL+G+ + L+P S K
Sbjct: 133 FGFMSACF-GFGMVAGP--VLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGER 189

Query: 196 APMPLAKTEQQIMSWW---------LLAILYGLAGFGYIIVATYLPLMAKDAGSPLLTAH 246
P+ W L+A+ + + G + A ++ T
Sbjct: 190 RPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIG 249

Query: 247 LWTLVGLSIVPGCFGWLWA---AKRWGALPCLTANLLVQAISVLLTLASDSPLLLIISSI 303
+ +L I+ + A R G L ++ +L + + +
Sbjct: 250 I-SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMV 308

Query: 304 GFGGTFMGTTSLVMTIARQLSVPGNLNLLGFVTLIYGIGQILGPALTSML 353
+G +L ++RQ+ L G + + + I+GP L + +
Sbjct: 309 LLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4384ADHESNFAMILY290.022 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 29.1 bits (65), Expect = 0.022
Identities = 10/45 (22%), Positives = 16/45 (35%)

Query: 54 LFAIVAVCTFFVQSCARKSNHAASFQNYHATIDGKEIAGITNNIS 98
++ + + +CA S Q IA IT NI+
Sbjct: 6 TLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIA 50


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4389TCRTETB507e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 50.3 bits (120), Expect = 7e-09
Identities = 46/189 (24%), Positives = 76/189 (40%), Gaps = 5/189 (2%)

Query: 7 RHAATLFFPMALILYDFAAYLSTDLIQPGIINVVRDFNADVSLAPAAVSLYLAGGMALQW 66
RH L + L + + ++ P I N A + A L + G A+
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAV-- 68

Query: 67 LLGPLSDRIGRRPVLITGALIFTLACAATMFTTSMTQFLI-ARAIQGTSICFIATVGYVT 125
G LSD++G + +L+ G +I S LI AR IQG + V
Sbjct: 69 -YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 126 VQEAFGQTKGIKLMAIITSIVLIAPIIGPLSGAALMHFVHWKVLFAIIAVMGFISFVGLL 185
V + K +I SIV + +GP G + H++HW L +I ++ I+ L+
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL-LIPMITIITVPFLM 186

Query: 186 LAMPETVKR 194
+ + V+
Sbjct: 187 KLLKKEVRI 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4406TCRTETA290.027 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.4 bits (66), Expect = 0.027
Identities = 69/407 (16%), Positives = 120/407 (29%), Gaps = 45/407 (11%)

Query: 40 LFVLFIFSFLDRINIGFAGL---TMGRDLGLS---ATMFGLATTLFYAAYVIFGIPSNIM 93
L V+ LD + IG + RDL S +G+ L+ +
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 94 LSIVGARRWIATIMVLWGIASTATMFATGPT--SLYVLRILVGITEAGFLPGILLYLTFW 151
G R ++ L G A + AT P LY+ RI+ GIT A Y+
Sbjct: 67 SDRFGRR--PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYIADI 123

Query: 152 FPAYFRARANALFMVAMPVTTALGSIVSGYILSLDGVMALKGWQWLFLLEGFPSVLLGVM 211
RAR G ++ G M F + L +
Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGL-------MGGFSPHAPFFAAAALNGLNFLT 176

Query: 212 VWFWLDDSPDKAKWLTKEDKKCLQEMMDNDRLTLVQPEGAISHHAMQQRSMWREIFTPVV 271
F L +S + R + P + W T V
Sbjct: 177 GCFLLPESHKGER--------------RPLRREALNPLASFR---------WARGMTVVA 213

Query: 272 MMYTLAYFCLTNTLSAISIWTPQILQSFNQGSSNITIGLLAAVPQICTILGMVYWSRHSD 331
+ + + ++W F+ TIG+ A I L +
Sbjct: 214 ALMAVFFIMQLVGQVPAALWVIFGEDRFHW--DATTIGISLAAFGILHSLAQAMITGPVA 271

Query: 332 RRQERRHHTALPYLFAAAGWLLASATDHNMIQMLGIIMASTGSFSAMAIFWTTPDQSISL 391
R R L + G++L + + +++ ++G A+ Q
Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEE 331

Query: 392 RARAIGIAVINATGNIGSALSPFMIGWLKDLT-GSFNSGLWFVAALL 437
R + ++ T ++ S + P + + + ++N W A L
Sbjct: 332 RQGQLQGSLAALT-SLTSIVGPLLFTAIYAASITTWNGWAWIAGAAL 377


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4407PHPHTRNFRASE300.010 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 30.1 bits (68), Expect = 0.010
Identities = 32/182 (17%), Positives = 59/182 (32%), Gaps = 34/182 (18%)

Query: 78 QIKQLLDVGTQT---LLVPMVQNADEAREAVRATRYPPAGIRGVGSALARASRWNRIPDY 134
Q++ LL T ++ PM+ +E R+A + + G ++ +
Sbjct: 374 QLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVG----- 428

Query: 135 LQKANDQMCVLVQIETREAMKNLPQILDVEGVDGVFIGPADL----------SADMGYAG 184
+ +E + VD IG DL + + Y
Sbjct: 429 -----------IMVEIPSTAVAANLFA--KEVDFFSIGTNDLIQYTMAADRMNERVSYLY 475

Query: 185 NPQHPEVQAAIEQAIVQIRESGKAPGI---LIANEQLAKRYLELGALFVAVGVDTTLLAR 241
P HP + ++ I GK G+ + +E L LG ++ + L AR
Sbjct: 476 QPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPAR 535

Query: 242 AA 243
+
Sbjct: 536 SQ 537


65SBO_0290SBO_0294N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_02901121.286442MFS transport protein AraJ
SBO_02910121.570429exonuclease subunit SbcC
SBO_0292-1111.328026exonuclease subunit SbcD
SBO_02931131.467758transcriptional regulator PhoB
SBO_02941130.984779phosphate regulon sensor protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0290TCRTETA491e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 49.4 bits (118), Expect = 1e-08
Identities = 73/356 (20%), Positives = 125/356 (35%), Gaps = 35/356 (9%)

Query: 5 ILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGH---MISYYALGVVVGAPIIALF 61
+ ++AL G+G+ IM VL L ++ S H +++ YAL AP++
Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 62 SSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIK 121
S R+ + +LL +A + A+ + +L IGR+V+G GA + I
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124

Query: 122 PGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDI 181
G A G +S ++ P+ L FS F A N + F +P+
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 182 RDEAKGKLREQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYVKPYMMFI 229
+ LR + + A + F + G W +
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238

Query: 230 SGFSETAMTFIMMLVGLGM---VLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMFFFCG 286
F A T + L G+ + M++G ++ R R + ++ F
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 287 GMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAVG 340
I + G+ LQ +L + E G G +A +L S VG
Sbjct: 299 RGWMAFPIMVLLASGGIG--MPALQAMLSRQV-DEERQGQLQGSLAALTSLTSIVG 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0291IGASERPTASE405e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.0 bits (93), Expect = 5e-05
Identities = 40/264 (15%), Positives = 81/264 (30%), Gaps = 11/264 (4%)

Query: 162 LNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTELEKLQAQASGVALLTPEQVQSL 221
A P E E + E + Q S V + + A + + A Q+
Sbjct: 1029 APATPSETTETVAENSK-----QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN 1083

Query: 222 TASLQVLTDEEKQLLTAQQQEQQSLNWLTRLD-ELQQEASRRQQALQQALAEEEKAQPQL 280
+ +E Q ++ +++ E QE + + + E QPQ
Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143

Query: 281 AALSLAQPARNLRPHWE---RIAEHSAALAHTRQQIEEVNTRLQSTMALRASIRHHAAKQ 337
P N++ A+ T +E+ T + + + +
Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203

Query: 338 SAELQQQQQSLNTWLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLN 397
A Q S ++ ++ R + + ++DR + T+ L+
Sbjct: 1204 PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPA-TTSSNDRSTVALCDLTSTNTNAVLS 1262

Query: 398 ALAAITLTLTADEVATALAQHAEQ 421
A + + V A++QH Q
Sbjct: 1263 DARAKAQFVALN-VGKAVSQHISQ 1285



Score = 35.4 bits (81), Expect = 0.001
Identities = 29/139 (20%), Positives = 51/139 (36%), Gaps = 13/139 (9%)

Query: 738 QQDVLAAQSLQKAQAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQ 797
Q DV + S + A+ D A A E T T E KQ + + Q
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPP-------APATPSETTETVAENSKQESKTVEKNEQ 1056

Query: 798 TLVTQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQQLKQDAD 857
TA+ ++ + + A T T E Q ++T + T + ++ K +
Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQSG-SETKETQTTETKETATVEKEEKAKVE 1115

Query: 858 NRQQQQTLMQQIAQMTQQV 876
+ Q++ ++T QV
Sbjct: 1116 TEKT-----QEVPKVTSQV 1129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0292FRAGILYSIN300.022 Fragilysin metallopeptidase (M10C) enterotoxin signat...
		>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin

signature.
Length = 405

Score = 29.7 bits (66), Expect = 0.022
Identities = 13/70 (18%), Positives = 23/70 (32%), Gaps = 4/70 (5%)

Query: 149 KQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFP 208
K+ ++ I ++Y + + + I T D + + I A
Sbjct: 135 KEAQMMNEIAEFYAAPFKKTRAINEKEAFECI-YDSRTRSA--GKD-IVSVKINIDKAKK 190

Query: 209 AQNFPPADYI 218
N P DYI
Sbjct: 191 ILNLPECDYI 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0293HTHFIS951e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.9 bits (236), Expect = 1e-24
Identities = 33/149 (22%), Positives = 62/149 (41%), Gaps = 9/149 (6%)

Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQ 63
ILV +D+A IR ++ L + G+ + + + DL++ D ++P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 FIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123
+ +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I +
Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120

Query: 124 SPMAVEEVIEMQGLSLDPTSHRVMAGEEP 152
E L D + G
Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0294PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.1 bits (78), Expect = 0.001
Identities = 19/105 (18%), Positives = 33/105 (31%), Gaps = 26/105 (24%)

Query: 325 LVYNAVNH----TPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKAR 380
LV N + H P+G I ++ + VE+ G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 381 SRQTGGSGLGLAIVKHAVNH---HESRLNIESTVGKGTRFSFVIP 422
+G GL V+ + E+++ + GK +IP
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


66SBO_0327SBO_0334N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0327020-0.505716muropeptide transporter
SBO_0328226-0.857229hypothetical protein
SBO_0329326-0.913309transcriptional regulator BolA
SBO_0330226-0.630870trigger factor
SBO_0331019-0.329651ATP-dependent Clp protease proteolytic subunit
SBO_0332019-0.604079ATP-dependent protease ATP-binding subunit ClpX
SBO_0333017-0.627820DNA-binding ATP-dependent protease La
SBO_0334012-0.472836transcriptional regulator HU subunit beta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0327TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 3e-05
Identities = 71/347 (20%), Positives = 135/347 (38%), Gaps = 20/347 (5%)

Query: 62 KFLWSPLMDRYTPPFFGRRRGWLLATQILLLVAIAAMGFLEPGTQLRWMAALAVVIAFCS 121
+F +P++ + F RR LL + V A M W+ + ++A +
Sbjct: 56 QFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAIMAT----APFLWVLYIGRIVAGIT 109

Query: 122 ASQDIVFDAWKTDVLPAEERGAGAAISVLGYRLGMLVSGGLALWLADKWLGWQGMYWLMA 181
+ V A+ D+ +ER + GM+ L + ++ A
Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSPHAPFFAAA 167

Query: 182 AL-LIPCIIATLLAPEP--TDTIPVPKTLEQAVVAPLRDFFGRNNAWLILLLIVLYKLGD 238
AL + + L PE + P+ + + + A L+ + ++ +G
Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227

Query: 239 AFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYGGILMQRLSLFRALLIFGIL 298
A +L F +DA +G+ G+L ++ A+ G + RL RAL+ G++
Sbjct: 228 VPA-ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMI 285

Query: 299 QGASNAGYWLLSITDKHLYSMGAAVFFENLCGGMGTSAFVALLMTLCNKSFSATQFALLS 358
A GY LL+ + + V GG+G A A+L ++ L+
Sbjct: 286 --ADGTGYILLAFATRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLA 341

Query: 359 ALSAVGRVYVGPVAGWFVEAHGWSTF--YLFSVAAAVPGLILLLVCR 403
AL+++ + VGP+ + A +T+ + + AA+ L L + R
Sbjct: 342 ALTSLTSI-VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0328PF06291270.026 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 26.9 bits (59), Expect = 0.026
Identities = 11/34 (32%), Positives = 18/34 (52%)

Query: 3 KKILFPLVALFMLAGCAKPPTTIEVSPTITLPQQ 36
KK+LF ++ GCA+ T+ PT P++
Sbjct: 7 KKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0332HTHFIS290.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.043
Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 13/73 (17%)

Query: 60 ERSALPTPHEIRNHLDDYVIGQEQAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIG 119
E P+ E + ++G+ A + +Y RL D +++ G
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITG 167

Query: 120 PTGSGKTLLAETL 132
+G+GK L+A L
Sbjct: 168 ESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0333GPOSANCHOR350.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.7 bits (79), Expect = 0.002
Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%)

Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249
LE A +E + +L R +++ ++ S+ +Q++A ++L E + +
Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344

Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308
++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ +
Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397

Query: 309 VKKDLRQAQEILD 321
V+K L +A L
Sbjct: 398 VEKALEEANSKLA 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0334DNABINDINGHU1173e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 117 bits (294), Expect = 3e-38
Identities = 49/88 (55%), Positives = 67/88 (76%)

Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61
NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89
NPQTG+EI I A+KVP+F+AGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


67SBO_0452SBO_0457N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0452-1164.033187enterobactin exporter EntS
SBO_0453-2163.979230iron-enterobactin transporter periplasmic
SBO_0454-2184.280556isochorismate hydroxymutase
SBO_0455-2204.243546enterobactin synthase subunit E
SBO_0456-1184.0660772,3-dihydro-2,3-dihydroxybenzoate synthetase
SBO_0457-1152.1339772,3-dihydroxybenzoate-2,3-dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0452TCRTETA363e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.0 bits (83), Expect = 3e-04
Identities = 82/394 (20%), Positives = 146/394 (37%), Gaps = 38/394 (9%)

Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWLV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83
+ V +GL+ +P ++ + HS + G+ + L F V G L+DR+ R+
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141
V+L + G ++ + P L +Y+ + G + G A A +
Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201
+ + G V P++GGL+ GG + + AA L L LP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253
+ PL+ + LA FR+ +V + + ++ + A+ V++ D +
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241

Query: 254 QMSAAEIGFLYAAIP-LGAAIGALTSGKLAHSARPGLLMLLSTLGS---FLAIGLFGLMP 309
A IG AA L + A+ +G +A ++L + ++ +
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369
M +V LA G ML Q E G++ G A +G L
Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELRR 403
+ A + + +G+ + L LL L LRR
Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0453FERRIBNDNGPP632e-13 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 62.7 bits (152), Expect = 2e-13
Identities = 62/285 (21%), Positives = 103/285 (36%), Gaps = 35/285 (12%)

Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99
H P RIV+ LLA+ VAD + R W E L
Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75

Query: 100 RLYIG-----EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151
I EP+ E + P ++ SA G S + L+ IAP N+ D
Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131

Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209
+ LT++ ++ + A +AQ++ + + K + + ++
Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191

Query: 210 TPESAQGQMLEQLGFTLAKLPTGLNASQSQGKRHDIIQLSGENLAAGLNGESLFLFAGDQ 269
P S ++L++ G NA Q + +S + LAA + + L +
Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243

Query: 270 KDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQVLERL 314
KD DA+ A PL +P V+ + + F SAM + L
Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVL 288


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0456ISCHRISMTASE440e-159 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 440 bits (1132), Expect = e-159
Identities = 146/299 (48%), Positives = 194/299 (64%), Gaps = 18/299 (6%)

Query: 1 MAIPKLQAYALPESHDIPQNKVDWAFELQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60
MAIP +Q Y +P + D+PQNKV W + RA LLIHDMQ+YFV + + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120
L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180
L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPVPASKA-----------ALREVIL 223
FS ++H M+L+Y AGR VMT+ LL PA V + A +R+ I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLS 281
LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0457DHBDHDRGNASE363e-130 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 363 bits (932), Expect = e-130
Identities = 108/258 (41%), Positives = 149/258 (57%), Gaps = 20/258 (7%)

Query: 5 GKNVWVTGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 49
GK ++TGA +GIG A A GA + D +A E +P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 50 VMDVADAGQVAQVCQRLLAETERLDVLINAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109
DV D+ + ++ R+ E +D+L+N AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 169
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229
N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 ASHITLQDIVVDGGSTLG 247
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


68SBO_0680SBO_0685N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0680-1182.749759hypothetical protein
SBO_0681-1162.809038hypothetical protein
SBO_0682-1152.594418ABC transporter ATP-binding protein
SBO_0683-1132.748455hypothetical protein
SBO_0684-1132.482088DNA-binding transcriptional regulator
SBO_06850132.322334ATP-dependent RNA helicase RhlE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0680ABC2TRNSPORT473e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 47.2 bits (112), Expect = 3e-08
Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%)

Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256
R Q T + +L + L I +G+ A A IG+ A + + L+L
Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148

Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314
Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340
P+ H D+ + I L +D+
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0682PF05272320.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.011
Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 21/90 (23%)

Query: 293 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 352
PR E + +LG P + + + K HV +
Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVM 589

Query: 353 KRGEIFG----LLGPNGAGKSTTFKMMCGL 378
+ G F L G G GKST + GL
Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619



Score = 29.7 bits (66), Expect = 0.045
Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56
Y L G G GK+TL+ L GL
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0683RTXTOXIND626e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 62.2 bits (151), Expect = 6e-13
Identities = 41/248 (16%), Positives = 88/248 (35%), Gaps = 25/248 (10%)

Query: 94 AQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRTISA--NDLENA 151
+A+ +LA E + + + + L + N+L
Sbjct: 212 KRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVY 271

Query: 152 RSSRNQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQAELNLQDSTLI 208
+S Q ++ + SA+++ + + + + Q ++ +LA+ E Q S +
Sbjct: 272 KSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIR 331

Query: 209 APSDGTLLTRAV-EPGTVLNEGGTVFTVSLT-RPVWVRAYVDERNLDQAQPGRKVLLYTD 266
AP + V G V+ T+ + + V A V +++ G+ ++ +
Sbjct: 332 APVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE 391

Query: 267 GRPDKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT-------DADDALR 316
P Y G++ ++ A D R LV+ + I + + + L
Sbjct: 392 AFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENCLSTGNKNIPLS 443

Query: 317 QGMPVTVQ 324
GM VT +
Sbjct: 444 SGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0684HTHTETR736e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.1 bits (179), Expect = 6e-18
Identities = 33/214 (15%), Positives = 77/214 (35%), Gaps = 17/214 (7%)

Query: 13 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 71
+ ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 72 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSKFISREQL 131
IGE E + P + +RE+++ + + + + + F E +
Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEII-FHKCEFV 120

Query: 132 SPTAAYHLVHEQVISPLHSHLTRLIAAWTGCDANDTRMILHTHALIGEILAFRLGKETIL 191
A + + + + + +A L T + + G
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKH--CIEAKMLPADLMTRRAAIIMRGYISG----- 173

Query: 192 LRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 225
L W + + + ++ ++L+
Sbjct: 174 LMENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0685SECA310.014 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 30.6 bits (69), Expect = 0.014
Identities = 20/67 (29%), Positives = 35/67 (52%), Gaps = 4/67 (5%)

Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSSDIRVLVATDI 304
Q VLV T + + ++ +L K GI+ ++ + A A A + ++ V +AT++
Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506

Query: 305 AARGLDI 311
A RG DI
Sbjct: 507 AGRGTDI 513


69SBO_0794SBO_0802N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0794-216-1.128588arginine 3rd transport system periplasmic
SBO_0795-2180.902775arginine transporter permease subunit ArtM
SBO_0796-2151.179382arginine transporter permease subunit ArtQ
SBO_0797-1131.431130arginine 3rd transport system periplasmic
SBO_07980132.396340arginine transporter ATP-binding subunit
SBO_07990152.428664hypothetical protein
SBO_0800-1142.495873regulator
SBO_0801-3142.268407nucleotide di-P-sugar epimerase or dehydratase
SBO_0802-3121.604153dTDP-glucose enzyme
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0794ECOLNEIPORIN280.038 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 27.8 bits (62), Expect = 0.038
Identities = 17/74 (22%), Positives = 28/74 (37%), Gaps = 17/74 (22%)

Query: 159 AFIDLKN-------GRIDGVFGDTAVVNEWLKTNPQLGVATEKVTD----------PQYF 201
+FI LK GR++ V DT +N W + LGV + P++
Sbjct: 96 SFIGLKGGFGKLRVGRLNSVLKDTGDINPWDSKSDYLGVNKIAEPEARLISVRYDSPEFA 155

Query: 202 GTGLGIAVRPDNKA 215
G + ++ A
Sbjct: 156 GLSGSVQYALNDNA 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0798PF05272300.010 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.010
Identities = 9/18 (50%), Positives = 12/18 (66%)

Query: 31 LVLLGPSGAGKSSLLRVL 48
+VL G G GKS+L+ L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0800ECOLIPORIN300.010 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 29.9 bits (67), Expect = 0.010
Identities = 21/54 (38%), Positives = 27/54 (50%), Gaps = 9/54 (16%)

Query: 2 RRVFWLVAAALLLAGCAGEKGIVEKEGYQLDTRRQAQAAYPRIKVLVIHYTADD 55
R+V LV ALL AG A I K+G +LD Y ++ L HY +DD
Sbjct: 3 RKVLALVIPALLAAGAAHAAEIYNKDGNKLDL-------YGKVDGL--HYFSDD 47


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0801NUCEPIMERASE784e-18 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 77.5 bits (191), Expect = 4e-18
Identities = 70/363 (19%), Positives = 124/363 (34%), Gaps = 65/363 (17%)

Query: 13 MKVLVTGATSGLGRNAVEFLCQKGISVRA---------TGRNEAMGKLLEKMGAEFVPAD 63
MK LVTGA +G + + L + G V +A +LL + G +F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 64 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 116
L + + ++ S +P A+ +N+ + E
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116

Query: 117 GVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAASEEVINMFSQANPQTRFT 176
+++ ++ SS S+Y + D + +A +K A+E + + +S T
Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH-LYGLPAT 174

Query: 177 ILRPQSLFGPHDK--VFIPRLAHMMHHYGSILLPHGGSALVDMTYYENAVHAMWLASQEA 234
LR +++GP + + + + M SI + + G D TY ++ A+
Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234

Query: 235 CDKLPS--------------GRVYNITNGEHRTLCSIVQKLIDELNIDCRIRSVPYPMLD 280
RVYNI N L +Q L D L I+ + +P D
Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGD 294

Query: 281 MIARSMERLGRKSAKEPPLTHYGASKLNFDFTLDITRAQEELGYQPVITLDEGIEKTAAW 340
+ T D E +G+ P T+ +G++ W
Sbjct: 295 V----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNFVNW 327

Query: 341 LRD 343
RD
Sbjct: 328 YRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0802NUCEPIMERASE561e-10 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 55.6 bits (134), Expect = 1e-10
Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 17/125 (13%)

Query: 4 RILVLGASGYIGQHLVRTLSQQGHQILA---------AARHVDRLAKLQLANVSCHKVDL 54
+ LV GA+G+IG H+ + L + GHQ++ + RL L HK+DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 55 SWPDNLPALLQD--IDTVYFLVH------SMGEGGDFIAQERQVALNVRDALREVPVKQL 106
+ + + L + V+ H S+ + LN+ + R ++ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 107 IFLSS 111
++ SS
Sbjct: 122 LYASS 126


70SBO_0896SBO_0912N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0896-3112.037462chaperone
SBO_0897-2122.314395chaperonin
SBO_0898-2142.966929hypothetical protein
SBO_0899-2153.139805hypothetical protein
SBO_0900-2153.185694multidrug efflux system subunit MdtA
SBO_0901-1162.962821multidrug efflux system subunit MdtB
SBO_0902-1120.903799multidrug efflux system subunit MdtC
SBO_0903-210-0.292461multidrug efflux system protein MdtE
SBO_0904-211-2.101963signal transduction histidine-protein kinase
SBO_0905-114-3.759064DNA-binding transcriptional regulator BaeR
SBO_0906017-4.513700hypothetical protein
SBO_0907216-3.816899hypothetical protein
SBO_0908215-3.227897hypothetical protein
SBO_0909324-5.500027hypothetical protein
SBO_0911320-3.708284galactitol utilization operon repressor
SBO_0912219-2.811593galactitol-1-phosphate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0896SHAPEPROTEIN514e-09 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 50.9 bits (122), Expect = 4e-09
Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 20/129 (15%)

Query: 132 AMMLH-IRQQAQAQLPEAITQAVIGRPINFQGLGGDEANAQAQGILERAAKRAGFRDVVF 190
M+ H I+Q + ++ P+ + E A + +A+ AG R+V
Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV---ERRA-----IRESAQGAGAREVFL 140

Query: 191 QYEPVAAGLDYEATLQEEKRVLVVDIGGGTTDCSLLLMGPQWRSRLDREASLLGHSGCRI 250
EP+AA + + E +VVDIGGGTT+ +++ + ++ S RI
Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189

Query: 251 GGNDLDIAL 259
GG+ D A+
Sbjct: 190 GGDRFDEAI 198



Score = 35.5 bits (82), Expect = 3e-04
Identities = 32/137 (23%), Positives = 57/137 (41%), Gaps = 23/137 (16%)

Query: 332 RLSYRLV---RSAEESKIALSNA--AETRASLPFISDELAT------LISQQGLESALSQ 380
R +Y + +AE K + +A + + LA ++ + AL +
Sbjct: 203 RRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQE 262

Query: 381 PLARILEQVQLALDNAQEKPDV--------IYLTGGSARSPLIKKALAEQLPGIPIAGGD 432
PL I+ V +AL+ Q P++ + LTGG A + + L E+ GIP+ +
Sbjct: 263 PLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPVVVAE 319

Query: 433 D-FGSVTAGLARWAEVV 448
D V G + E++
Sbjct: 320 DPLTCVARGGGKALEMI 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0900RTXTOXIND493e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.7 bits (116), Expect = 3e-08
Identities = 46/369 (12%), Positives = 105/369 (28%), Gaps = 87/369 (23%)

Query: 4 SYKSRWVIVIVVVIAAIAAFWFWQGRNDSRSAAPG-----ATKQAQQSPAGGRRG---MR 55
S + R V ++ IA G+ + + A G + + ++
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 56 SG-------PLA---PVQAATAVEQAVPRYLTGLGTIIAANTVTVRSRVDG--QLMALHF 103
G L + A + L ++ ++ +L
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 104 QEGQQVKAGDLLAEI------------DPSQFKVALAQTQGQLA-------KDKATLANA 144
Q V ++L Q ++ L + + + + +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 145 RRDLARYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVA----------------- 187
+ L + L +++ + Q+ E ++ ++ +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 188 --------------------------SAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSG 220
+ + S I APV +V LK G +++
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 221 DTTGIVVITQTHPIDLVFTLPESDIATVVQAQKAGKPLVVEAWDRTNSKKL-SEGTLLSL 279
+T +V++ + +++ + DI + Q A + VEA+ T L + ++L
Sbjct: 354 ETL-MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINL 410

Query: 280 DNQIDATTG 288
D D G
Sbjct: 411 DAIEDQRLG 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0901ACRIFLAVINRP9140.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 914 bits (2364), Expect = 0.0
Identities = 299/1036 (28%), Positives = 514/1036 (49%), Gaps = 29/1036 (2%)

Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAV 72
+ FI RP+ +L + +++AG + LPV+ P + P + V YPGA + V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L MSS S S G+ ITL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPNPPVYSKVNPADPPIMTLAVTSTAMPMTQVE--DMVETRVAQKISQISGVGLVTLSGG 189
+ + S + +M S TQ + D V + V +S+++GVG V L G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGP------SRAVTLSANDQ 243
Q A+R+ L+A + LT V + N A G L G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MQSAEEYRQLII-AYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANI 302
++ EE+ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 ISTADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFL 362
+ TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLIFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481
+ E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWL 538
+S +V+L LTP +CA +L S E + F FD + Y + K+L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLSVALSTLLLSVLLWVFIPKGFFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQ 598
L + + V+L++ +P F P +D G+ +Q P ++ + QV D L+
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDR---VQKVIARLQTAVDKVPG 653
+ V+S+ + G + + N+ ++LKP +ER+ + VI R + + K+
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658

Query: 654 VDLFLQPTQDLTIDTQVSRTQYQFTLQ---ATSLDALSTWVPQLMEKLQQLP-QLSDVSS 709
D F+ P I + T + F L DAL+ QL+ Q P L V
Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDKGLVAYVNVDRDSASRLGISMADVDNALYSAFGQRLISTIYTQANQYRVVLEHNTE 769
+ + + VD++ A LG+S++D++ + +A G ++ + ++ ++ + +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 ITPGLAALDTIRLTSSDGGVVPLSSIAKVEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829
+D + + S++G +VP S+ + + + P I S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 DAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIVLGILYESFI 889
DA A+M+ + LP I + G + + + L+ + V +++ L LYES+
Sbjct: 838 DA-MALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949
P++++ +P VG LLA + + DV ++G++ IG+ KNAI++++FA ++
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMSPRDAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQV 1009
G +A A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDRL 1025
L +F PV +++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0902ACRIFLAVINRP9210.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 921 bits (2382), Expect = 0.0
Identities = 289/1035 (27%), Positives = 506/1035 (48%), Gaps = 36/1035 (3%)

Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65
FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124
+E+++ I + M+S+S S GS I L F D + A VQ + A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGKLYDFASTQLAPTISQIDGVGDVDVGGSSL 182
+ S + +M+ SD +Q + D+ ++ + T+S+++GVGDV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQG------ALEDGTHRWQIQTNDELK 236
A+R+ L+ L ++ DV + N + G AL I K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295
E+ + + N +G VRL DVA V ++ N KPA L I+ AN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDSIRAKLPELQETIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355
T +I+AKL ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414
RAT+IP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474
E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530
+LV+L LTP +C +LK + GF Y S+ +L T
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 VVLLGTIALNIWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586
++ +A + L++ +P +F PE+D GV + IQ + + + ++K
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 587 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRDERS---ETAQQIIDRLRVKLAKEPGAN 641
+ +V V GF+ G N+GM F++LKP +ER+ +A+ +I R +++L K
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 696
+ + I G + ++ L D + + R +L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QQDNGAEMNLVYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756
++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 816
++K++V + G+ +P S F + + I G S D
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 876
A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGN 936
P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA +
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996
EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVVYLFFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031



Score = 79.9 bits (197), Expect = 3e-17
Identities = 77/446 (17%), Positives = 162/446 (36%), Gaps = 26/446 (5%)

Query: 592 VDNVTGFTGGS-RVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGANLFLMAVQDI 650
+DN+ + S S + +T + + Q+ ++L++ P + Q I
Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127

Query: 651 RVGGRQSNASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQQDNGAE-- 703
V S+ +SD+ ++ ++ L+ L + DV GA+
Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183

Query: 704 MNLVYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 759
M + D D + + + + + + T P Q + R+
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 760 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 817
+ +N++G + L A+ N + G AA +L D
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302

Query: 818 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 874
+ AI + +L P ++ + T Q +++ V + AI V++V+ + ++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 875 VHPLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRH 934
L +P +G L F + + + G++L IG++ +AI++V+
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 935 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQ 994
L P+EA ++ ++ + +P+ GG + + ITIV + +S
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 995 LLTLYTTPVVYLFFDRLRLRFSRKPK 1020
L+ L TP + + + K
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENK 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0903TCRTETB1221e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 122 bits (308), Expect = 1e-32
Identities = 97/435 (22%), Positives = 190/435 (43%), Gaps = 25/435 (5%)

Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79
F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138
I++ GS+ + + LL +AR +QG G A + + V + +P+E A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLL-LMPNYTMQTWRFDL 197
+ +G +GPA+GG++ Y HW +L+ IP+ I + L+ L+ FD+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFLLLAVGMAVLTLALDGSKGTGLSPLAIAGLVAVGVVALVLYLLHARNNNRALFSLKL 257
G +L++VG+ L + + V V++ ++++ H R L
Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FRTRTFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316
+ F +G+ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 VVQVVNRFGYRRVLVATTLGLSLVTLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372
+V+R G VL +G++ +++ F+T + L W+ + V L G+ S +
Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367

Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVF 432
++T+ L A +G SLL+ LS G+ I G LL + + Q+ +
Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427

Query: 433 MYT--WLSMALIIAL 445
+Y+ L + II +
Sbjct: 428 LYSNLLLLFSGIIVI 442


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0904BCTERIALGSPF310.009 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 31.0 bits (70), Expect = 0.009
Identities = 27/95 (28%), Positives = 35/95 (36%), Gaps = 20/95 (21%)

Query: 164 RQTSWLIVALATLLAALATFLLA------RGLLAPVKRLVDGTHKLAAGDFTTRVTPTSE 217
RQ + L+ A L AL L+A V+ V H LA + P S
Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSF 131

Query: 218 DEL-----------GKLAQDFNQLASTLEKNQQMR 241
+ L G L N+LA E+ QQMR
Sbjct: 132 ERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0905HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%)

Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGL 70
IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129
L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 130 PQRELQQQDAESPLII 145
+ + D++ + +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0909LIPOLPP20260.024 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 26.3 bits (57), Expect = 0.024
Identities = 13/38 (34%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 3 KGEMKKIAAISLISIFIMSGCAVHNDETSIGKFGLAYK 40
K ++KKI +S+++ ++ GC+ H ++ I K AYK
Sbjct: 2 KNQVKKILGMSVVAAMVIVGCS-HAPKSGISKSNKAYK 38


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0912DHBDHDRGNASE353e-04 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 35.0 bits (80), Expect = 3e-04
Identities = 22/92 (23%), Positives = 36/92 (39%), Gaps = 2/92 (2%)

Query: 156 AQGCENKNVIIIGAGT-IGLLAIQCAVALGAKSVTAIDISSEKLALAKSFGSMQTFNSSE 214
A+G E K I GA IG + + GA + A+D + EKL S + ++
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 215 MSAPQMQSVLRELRFNQLILETAGVPQTVELA 246
A S + ++ E + V +A
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVA 93


71SBO_0946SBO_0952N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_0946-215-3.722692hypothetical protein
SBO_0947-217-4.263005hypothetical protein
SBO_0948121-4.358576two-component response-regulatory protein YehT
SBO_0949221-4.0024062-component sensor protein
SBO_0950628-3.283939DNA damage-inducible protein
SBO_0951528-2.984612hypothetical protein
SBO_0952327-3.239714hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0946BINARYTOXINA270.030 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 27.3 bits (60), Expect = 0.030
Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 3/65 (4%)

Query: 29 DKEESKKFSANLNGTEIAITYVYKGDKV-LKQSSETKIQFAS--ISATTKEDAAKTLEPL 85
+K+E+++ NL+ E +YK D + S+T+ F I + +E K L
Sbjct: 61 EKKEAERVEKNLDTLEKEALELYKKDSEQISNYSQTRQYFYDYQIESNPREKEYKNLRNA 120

Query: 86 SAKYK 90
+K K
Sbjct: 121 ISKNK 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0948HTHFIS712e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.4 bits (175), Expect = 2e-16
Identities = 41/177 (23%), Positives = 77/177 (43%), Gaps = 12/177 (6%)

Query: 7 IKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRIS 66
+L+ DD+ R L L ++ ++ SNA + D++ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 67 GLEMVGMLDPEHRPYI--VFLTAFD--EYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQ 122
+++ + + RP + + ++A + AIKA E+ A+DYL KP D L + R
Sbjct: 62 AFDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 123 ERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVT--SHEGKE 177
E ++ L ++Q + + G S + +A + + +T S GKE
Sbjct: 121 EPKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMITGESGTGKE 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0949PF065802207e-69 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 220 bits (561), Expect = 7e-69
Identities = 63/216 (29%), Positives = 115/216 (53%), Gaps = 3/216 (1%)

Query: 343 LGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQA 402
L G + + + +M ++++ L AQ+NPHF+FNALN I+A+I D +A
Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193

Query: 403 SQLVQYLSTFFRKNLKR-PSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQ 461
+++ LS R +L+ + V+LADE+ V++YLQ+ +F+ RLQ I +
Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253

Query: 462 QLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGL-YQPVTNASGL 520
Q+P +Q +VEN IKHG +QL G++ + ++ + LE+E+ L + ++G
Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313

Query: 521 GMNLVDKRLRERFGDDYGISVACEPDSYTRITLQLP 556
G+ V +RL+ +G + I ++ + + +P
Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_0952LUXSPROTEIN310.001 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 31.4 bits (71), Expect = 0.001
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 7/66 (10%)

Query: 37 TKEHLLPHFL-EHLGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 89
T EHL F+ HL + ++I G TGFY++ + S + D A++
Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113

Query: 90 AGESKI 95
++KI
Sbjct: 114 ENQNKI 119


72SBO_1057SBO_1083N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1057018-0.167743flagellar biosynthesis protein FliR
SBO_1058-1221.163171flagellar biosynthesis protein FliQ
SBO_1060-3201.559323flagellar biosynthesis protein FliO
SBO_1061-3151.704534flagellar motor switch protein FliN
SBO_1062-3151.694162flagellar motor switch protein FliM
SBO_1063-2161.763284flagellar basal body-associated protein FliL
SBO_1065-1151.684402flagellar biosynthesis chaperone
SBO_1067-1161.233528IS911 ORF2
SBO_1070-1150.719276flagellar MS-ring protein
SBO_1071019-0.555782flagellar hook-basal body protein FliE
SBO_10721150.068017IS1 encoded protein
SBO_10730130.609651IS1 ORF2
SBO_1074011-0.282847virulence protein
SBO_1075114-0.390799hypothetical protein
SBO_1076-116-0.271958hypothetical protein
SBO_1077-112-0.770145inner membrane protein
SBO_1078111-1.084823hypothetical protein
SBO_1079011-1.863082alpha-amylase
SBO_1081313-1.795370flagellar protein FliS
SBO_1082112-1.170078flagellar capping protein
SBO_1083015-1.139461flagellin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1057TYPE3IMRPROT2003e-66 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 200 bits (511), Expect = 3e-66
Identities = 255/261 (97%), Positives = 259/261 (99%)

Query: 1 MMQVTSDQWLSWLSLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFVIAPSLPA 60
M+QVTS+QWLSWL+LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITF IAPSLPA
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120
NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL
Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120

Query: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180
NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF
Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180

Query: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240
LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC
Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240

Query: 241 EHLFSEMFNLLTDIISELPLI 261
EHLFSE+FNLL DIISELPLI
Sbjct: 241 EHLFSEIFNLLADIISELPLI 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1058TYPE3IMQPROT671e-18 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 67.1 bits (164), Expect = 1e-18
Identities = 22/78 (28%), Positives = 42/78 (53%)

Query: 4 ESVMMMGTEAMKVALALAAPLLLVALVTGLIISILQAATQINEMTLSFIPKIIAVFIAII 63
+ ++ G +A+ + L L+ +VA + GL++ + Q TQ+ E TL F K++ V + +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 IAGPWMLNLLLDYVRTLF 81
+ W +LL Y R +
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1061FLGMOTORFLIN2121e-74 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 212 bits (542), Expect = 1e-74
Identities = 124/137 (90%), Positives = 133/137 (97%)

Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSSKSAADAVFQQFGGGDVSGTLQDIDLIMDI 60
MSDMNNP+D+N GA+DDLWA+AL+EQK+T++KSAADAVFQQ GGGDVSG +QDIDLIMDI
Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60

Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120
PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV
Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120

Query: 121 RITDIITPSERMHRLSR 137
RITDIITPSERM RLSR
Sbjct: 121 RITDIITPSERMRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1062FLGMOTORFLIM381e-135 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 381 bits (979), Expect = e-135
Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%)

Query: 5 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 62
+LSQ EID LL S + E +S I YD + +E+++ L +++E FA
Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63

Query: 63 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 122
R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+
Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123

Query: 123 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 182
F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L +
Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181

Query: 183 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 240
E +F I P+++VV ++G G N C+P+ IEP+ L + +S R
Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 241 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DRIIAHVD 297
+ + L ++ +++VA + L + IL L+ GD++ + D + +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 298 GVPVLTSQYGTLNGQYALRIEHLI 321
Q G + + A +I I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1065FLGFLIJ2031e-70 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 203 bits (517), Expect = 1e-70
Identities = 145/147 (98%), Positives = 146/147 (99%)

Query: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60
MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MTSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERLST 120
+TSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQER ST
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
AALLAENRLDQKKMDEFAQRAAMRKPE
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1070FLGMRINGFLIF7520.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 752 bits (1944), Expect = 0.0
Identities = 477/555 (85%), Positives = 513/555 (92%), Gaps = 5/555 (0%)

Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62
+TA Q K LEWLNRLRANP+IPLIVAGSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA
Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64

Query: 63 IVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 122
IV+QLTQMNIPYRF+ SGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ
Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124

Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182
FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA
Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184

Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTLSNTSGRDLNDAQLKYASDVEGRI 242
LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLT SNTSGRDLNDAQLK+A+DVE RI
Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244

Query: 243 QRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSG 302
QRRIEAILSPIVGNGN+HAQVTAQLDFA+KEQTEE Y PNGD S A LRSRQLN SEQ G
Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304

Query: 303 SGYPGGVPGALSNQPAPANNAPISTPPANQNNRQQ--QASTTSNS---GPRSTQRNETSN 357
+GYPGGVPGALSNQPAP N API+TPP NQ N Q Q ST++NS GPRSTQRNETSN
Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSN 364

Query: 358 YEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEDLTREAMGFSEK 417
YEVDRTIRHTKMNVGD++RLSVAVVVNYKTL DGKPLPL+ +QMKQIEDLTREAMGFS+K
Sbjct: 365 YEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK 424

Query: 418 RGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLT 477
RGD+LNVVNSPF++ D +GGELPFWQQQ+FIDQLLAAGRWLLVL+VAW+LWRKAVRPQLT
Sbjct: 425 RGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLT 484

Query: 478 SRAEAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 537
R E KA Q+QAQ R+E E+AVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR
Sbjct: 485 RRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 544

Query: 538 VVALVIRQWINNDHE 552
VVALVIRQW++NDHE
Sbjct: 545 VVALVIRQWMSNDHE 559


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1071FLGHOOKFLIE1183e-38 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 118 bits (296), Expect = 3e-38
Identities = 102/103 (99%), Positives = 103/103 (100%)

Query: 2 SAIQGIEGVISQLQATAMSARAQDSLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 61
SAIQGIEGVISQLQATAMSARAQ+SLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60

Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104
GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV
Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1076PF01206912e-28 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 91.0 bits (226), Expect = 2e-28
Identities = 15/71 (21%), Positives = 36/71 (50%)

Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVRDIQQ 66
D LD G CP P + + + + GE+L V++ P S+ + ++ G+ + + ++
Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64

Query: 67 DGPTIRYLIQK 77
+ T + +++
Sbjct: 65 EDGTYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1077RTXTOXIND300.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.017
Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 2/57 (3%)

Query: 164 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFFGMLGWALLTAMNQ 218
R L R + + + A L + P R R M ++L +
Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEI 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1082TYPE3OMBPROT330.003 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 32.7 bits (74), Expect = 0.003
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 2/72 (2%)

Query: 214 NGMEVSVAAKNAQLTVNNVAIENSSNTISDALENITLNLNDVTTGNQTLTITQDTSKAQT 273
N E +VAA+N + + A+ + +S AL T++L V+T LT T T ++
Sbjct: 236 NSSERAVAARNKAEELVSAALYSRPELLSQALSGKTVDLKIVSTS--LLTPTSLTGGEES 293

Query: 274 AIKDWVNAYNSL 285
+KD VNA L
Sbjct: 294 MLKDQVNALKGL 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1083FLAGELLIN2039e-61 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 203 bits (517), Expect = 9e-61
Identities = 252/567 (44%), Positives = 315/567 (55%), Gaps = 61/567 (10%)

Query: 2 AQVINTNSLSLITQNNINKNQSALSSSIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 61
AQVINTNSLSL+TQNN+NK+QS+LSS+IERLSSGLRINSAKDDAAGQAIANRFTSNIKGL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 TQAARNANDGISVAQTTEGALSEINNNLQRILELTVQASTGTNSDSDLDSIQDEIKSRLD 121
TQA+RNANDGIS+AQTTEGAL+EINNNLQR+ EL+VQA+ GTNSDSDL SIQDEI+ RL+
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EIDRVSGQTQFNGVNVLAKDGSMKIQVGANDGQTITIDLKKIDSDTLGLNGFNVNGKGTI 181
EIDRVS QTQFNGV VL++D MKIQVGANDG+TITIDL+KID +LGL+GFNVNG
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNG---- 176

Query: 182 ANKAATVSDLTAAGATGTGPYAVTTNNTALSASDALSRLKTGDTVTTTGSSAAIYTYDAA 241
+ + ++ T + T TT + +AA
Sbjct: 177 ----PKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAA 232

Query: 242 KGNFTTQATVADGDVVNFANTLKPAAGTTASGV-YTRSTGDVKFDVDANGDVTIGGKAAY 300
G TT + V F T A A + G D G
Sbjct: 233 NGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTG 292

Query: 301 LDATGNLSTNNAGIASSAKLSDLFASGSTLATTGSIQLSGTTYNFGAAATSGVTYTKTVS 360
D G +ST G + ++D+ +
Sbjct: 293 NDGNGKVSTTINGEKVTLTVADI---------------------------------TAGA 319

Query: 361 ADTVLSTVQSAATANTAVTGATIKYNTGIQSATASFGGANTNGAGNSNDTYTDADKELTT 420
A+ +T+QS+ T+V ++ ++ +A N A T
Sbjct: 320 ANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVN------ 373

Query: 421 TASYTINYNVDKDTGTVTVASNGAGATGKFAATVGAQAYVNSTGKLTTETTSAGTATKDP 480
T + G T + + + + +A +T +P
Sbjct: 374 -------------GAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANP 420

Query: 481 LAALDEAISSIDKFRSSLGAIQNRLDSAVTNLNNTTTNLSEAQSRIQDADYATEVSNMSK 540
LA++D A+S +D RSSLGAIQNR DSA+TNL NT TNL+ A+SRI+DADYATEVSNMSK
Sbjct: 421 LASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSK 480

Query: 541 AQIIQQAGNSVLAKANQVPQQVLSLLQ 567
AQI+QQAG SVLA+ANQVPQ VLSLL+
Sbjct: 481 AQILQQAGTSVLAQANQVPQNVLSLLR 507


73SBO_1843SBO_1846N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_1843-2140.436232nitrite extrusion protein
SBO_1844-117-0.087408nitrate/nitrite sensor protein NarX
SBO_1845-217-1.393883transcriptional regulator NarL
SBO_1846-119-1.493834hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1843ACRIFLAVINRP310.011 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.0 bits (70), Expect = 0.011
Identities = 35/166 (21%), Positives = 60/166 (36%), Gaps = 22/166 (13%)

Query: 258 IMSLLYLATFGSFIGFSAGFAMLSKTQFPDVQILQYAFFGPFIGALARSA---GGALSDR 314
I+S + L+ + I A A L K + + FFG F S ++
Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533

Query: 315 LGGTRVTLVNFILMAIFSGLLFLTLPTD----GQGGSFMAFFAVFLALFLTAGLGSGSTF 370
LG T L+ + L+ +LFL LP+ G F+ L +G+T
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTM----------IQLPAGATQ 583

Query: 371 QMISVIFRKLTMDRVKAEGGSDER-----AMREAATDTAAALGFIS 411
+ + ++T +K E + E + A + F+S
Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVS 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1844PF06580532e-09 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 52.6 bits (126), Expect = 2e-09
Identities = 29/123 (23%), Positives = 55/123 (44%), Gaps = 17/123 (13%)

Query: 473 SAKFGFPVKLDYQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SQASEVVVTV 523
S +F ++ + Q+ P + VP L+Q E N +KH Q ++++
Sbjct: 233 SIQFEDRLQFENQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKG 285

Query: 524 AQNDNQVKLTVQDNGCGVPENAIRSNHYGMIIMRDRAQSLRG-DCRVRRRESGGTEVVVT 582
+++ V L V++ G +N S G+ +R+R Q L G + +++ E G +
Sbjct: 286 TKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345

Query: 583 FIP 585
IP
Sbjct: 346 LIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1845HTHFIS742e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 2e-17
Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 2/117 (1%)

Query: 7 ATILLIDDHPMLRTGVKQLISMAPDITVVGEASNGEQGIELAESLDPDLILLDLNMPGMN 66
ATIL+ DD +RT + Q +S A + SN + D DL++ D+ MP N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 67 GLETLDKLREKSLSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALHQA 123
+ L ++++ ++V S N + A ++GA YL K + +L+ + +A
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1846INTIMIN2534e-78 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 253 bits (647), Expect = 4e-78
Identities = 118/378 (31%), Positives = 195/378 (51%), Gaps = 21/378 (5%)

Query: 32 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 91
G+ AK ALG + Q + +++WL +G A V+++ N F GS + +P D+
Sbjct: 184 GDYAKDTALGIAGN----QASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDFLLPFYDS 237

Query: 92 DRYFTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWG 151
++ + Q+G D+ +N+G GQR+ ++GYN F D + R G G E W
Sbjct: 238 EKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWR 297

Query: 152 EYLRLLANFYQPFAAWHE--QTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDR 209
+Y + N Y + WHE ++R A G+D+ +P Y L + EQY+GD
Sbjct: 298 DYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDN 357

Query: 210 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 269
V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+F P +
Sbjct: 358 VALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQ 417

Query: 270 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 329
Q+ V E ++L GSRYD QRNN LEY+++ L++ + + T ++L +
Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERSTQKIQLIV 476

Query: 330 RSRYGIRQLIWQGDTQILS-----LTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVV 384
+S+YG+ +++W D+ + S G+Q SA+ + I+P + +G SN ++++
Sbjct: 477 KSKYGLDRIVWD-DSALRSQGGQIQHSGSQ--SAQDYQAILPAYV--QGGSNVYKVTARA 531

Query: 385 EDNQGQRVSSNEITLTLV 402
D G SSN + LT+
Sbjct: 532 YDRNGN--SSNNVLLTIT 547


74SBO_1980SBO_1988N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_19800111.516188ribonuclease E
SBO_1981-1101.142771flagellar hook-associated protein FlgL
SBO_19820101.120225flagellar hook-associated protein FlgK
SBO_19842111.689888flagellar basal body P-ring protein
SBO_19853131.546220flagellar basal body L-ring protein
SBO_19862141.566818flagellar basal body rod protein FlgG
SBO_19871151.401934flagellar basal body rod protein FlgF
SBO_19882170.271479flagellar hook protein FlgE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1980IGASERPTASE675e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 66.6 bits (162), Expect = 5e-13
Identities = 48/288 (16%), Positives = 85/288 (29%), Gaps = 36/288 (12%)

Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAAPATPAAPAQPGLL 571
P E+ + DVP P+ E A AP P APATP+
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETT----- 1037

Query: 572 SRFFGALKALFSGSEETKPTEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRSER-- 629
ET + Q QN + + + ++
Sbjct: 1038 ---------------ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT 1082

Query: 630 TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKRQ 689
E + + E + + ++TA + + TEK + + + + + + Q
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142

Query: 690 AQ---QEAKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAPV 744
A+ + +N++E Q + +P + + Q V +V V P
Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENPE 1200

Query: 745 VEETAAAEPIVQEAPA------PRTELVKVPLPVVAQTAPEQQEENNA 786
A +P V + R + VP V T A
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248



Score = 66.2 bits (161), Expect = 7e-13
Identities = 47/261 (18%), Positives = 84/261 (32%), Gaps = 26/261 (9%)

Query: 551 VAPAPKAAPATPAAPAQPGLLSRFFGALKALFSGSEETKPTEQP-APKAEAKPERQQDRR 609
P + S E + E P P A A P
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSN----------NEEIARVDEAPVPPPAPATPSETT--- 1037

Query: 610 KPRQNNRRDRNERRDTRSERTEGSDNREENRRNRRQAQQQTAETRESRQQAEV------T 663
N ++++++ D E +NR A++ + + + Q EV T
Sbjct: 1038 -----ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 664 EKARTTDEQQAPRRERSRRRNDDKRQAQQEAKALNVEEQSVQETEQEERVRPVQPRRKQR 723
++ +TT+ ++ E+ + + + Q+ K + + QE + + + R
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPK-VTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 724 QLNQKVRYEQSVAEEAVVAPVVEETAAAEPIVQEAPAPRTELVKVPLPVVAQTAPEQQEE 783
+N K Q+ P E ++ E V E+ T V P A Q
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211

Query: 784 NNADNRDNGGMPRRSRRSPRH 804
N+ + RRS RS H
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPH 1232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1981FLAGELLIN474e-08 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 47.3 bits (112), Expect = 4e-08
Identities = 31/132 (23%), Positives = 61/132 (46%), Gaps = 3/132 (2%)

Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66
++ Q N+ +S + + E++S+G R+ + DD + A + +Q +
Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 67 TFAIQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLAMDIQGLRDQLLNLAN 126
I E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 127 TTDGNGRYIFAG 138
T NG + +
Sbjct: 128 QTQFNGVKVLSQ 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1982FLGHOOKAP16820.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 682 bits (1762), Expect = 0.0
Identities = 545/546 (99%), Positives = 546/546 (100%)

Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 61
SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121
GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181
SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 241
QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 242 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 301
RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 302 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 361
ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360

Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421
YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV
Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420

Query: 422 NMDVLITDEAKIAMASEKDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 481
NMDVLITDEAKIAMASE+DAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN
Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480

Query: 482 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541
KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 542 ALINIR 547
ALINIR
Sbjct: 541 ALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1984FLGPRINGFLGI427e-152 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 427 bits (1099), Expect = e-152
Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%)

Query: 4 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63
F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123
ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M
Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131

Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183
T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F
Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191

Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 239
L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I
Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250

Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299
+N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF
Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308

Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSARLNNVVRALNALGATPMDLMSILQSMQSAGCLR 359
GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+
Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367

Query: 360 AKL 362
A+L
Sbjct: 368 AEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1985FLGLRINGFLGH346e-124 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 346 bits (890), Expect = e-124
Identities = 230/232 (99%), Positives = 230/232 (99%)

Query: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60
MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY
Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60

Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120
GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120

Query: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180
RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 181 SGVVNPRTISGSNTVPSTQVVDARIEYVGNGYINEAQNMGWLQHFFLNLSPM 232
SGVVNPRTISGSNTVPSTQV DARIEYVGNGYINEAQNMGWLQ FFLNLSPM
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1986FLGHOOKAP1434e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.4 bits (102), Expect = 4e-07
Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%)

Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62
S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQIGTGVRPVATERLHSQ 83
+G +G GV +R +
Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68



Score = 38.8 bits (90), Expect = 1e-05
Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMQQKLTQL 260
S VN+ EE N+ + Q+ Y N++ + T + + L +
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_1988FLGHOOKAP1415e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.5 bits (97), Expect = 5e-06
Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 354 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 402
L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 37.2 bits (86), Expect = 1e-04
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57
A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62


75SBO_2388SBO_2396N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_2388024-5.398850lipoprotein
SBO_2389027-6.745601transport
SBO_2391032-8.874870*IS911 ORF2
SBO_2392035-9.241783D-serine dehydratase
SBO_2393037-10.131489multidrug resistance protein Y
SBO_2394036-9.260148multidrug resistance protein K
SBO_2395034-8.554540DNA-binding transcriptional activator EvgA
SBO_2396034-8.166304hybrid sensory histidine kinase in two-component
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2388VACJLIPOPROT407e-148 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 407 bits (1048), Expect = e-148
Identities = 250/251 (99%), Positives = 250/251 (99%)

Query: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60
MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR
Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60

Query: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120
DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM
Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120

Query: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADGLYPVLSWLTWPM 180
ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMAD LYPVLSWLTWPM
Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180

Query: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240
SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA
Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240

Query: 241 IQDDLKDIDSE 251
IQDDLKDIDSE
Sbjct: 241 IQDDLKDIDSE 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2393TCRTETB1214e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (306), Expect = 4e-32
Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%)

Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78
+ I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137
++G RL L + S++ + + +LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197
E R A L V + GP +GG I W +L+ +PM I+ L L +E
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192

Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257
++ G+ L+ +G+ + ML F +S I +VSV+S + V
Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317
+P +D L K+ F IG++ + +G + ++P ++++ + G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376
G M ++I + G ++ ++ +V + S T F II+ G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420
+ ++TI S L + S+ NF LS G ++
Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2394RTXTOXIND765e-17 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 75.6 bits (186), Expect = 5e-17
Identities = 62/412 (15%), Positives = 121/412 (29%), Gaps = 96/412 (23%)

Query: 13 RRKYFSLLAVVLFIAFSGAYVYWSMELEDMISTDDAYVT-GNADPISAQVSGSVTVVNHK 71
RR ++ F+ + ++E + + + G + I + V + K
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLG-QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 72 DTNYVRQGDILVSLDKTDATIALNKA---------------------------------- 97
+ VR+GD+L+ L A K
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 98 ------------------KNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDY 136
K + Q + L + AE + + Y+
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 137 NRRV----PLAKQGVISKE----------TLEHTKDTLISSKAALNAAIQAYKANKALVM 182
R+ L + I+K + S + + I + K LV
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 183 N-------TPLNR-QPQVVEAADATKEAWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPG 233
L + + + + + I++PV+ + Q V G V+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 234 QSLMAVVPARQ-MWVNANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNA 292
++LM +VP + V A + + + +GQ+ I + F G +G
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK--- 404

Query: 293 FSLLPAQNATGNWIKIVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDT 340
+ + +V V +S++ L PL G+++TA I T
Sbjct: 405 VKNINLDAIEDQRLGLVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2395HTHFIS493e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 3e-09
Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%)

Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63
++ DD + L + ++ + + + + D+V+ DV +P N +
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123
L ++K + ++++SA+N + AI+A++ G +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101

Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148
PF L + + L ++
Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2396HTHFIS792e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-17
Identities = 30/105 (28%), Positives = 51/105 (48%)

Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019
+IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1020 LTRKLREQNSSLPIWGLTANAQANEREKGLSCGMNLCLFKPLTLD 1064
L ++++ LP+ ++A K G L KP L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


76SBO_2830SBO_2836N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_2830-1131.120155S-ribosylhomocysteinase
SBO_2831-1121.580244multidrug resistance
SBO_28320150.660856multidrug resistance secretion protein
SBO_2833-1131.862320transcriptional repressor MprA
SBO_28341151.187008hypothetical protein
SBO_2835-1111.821066hypothetical protein
SBO_2836-2111.121858permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2830LUXSPROTEIN290e-104 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 290 bits (744), Expect = e-104
Identities = 130/170 (76%), Positives = 147/170 (86%)

Query: 2 PLLDSFTVDHTRMEAPAVRVAKTMNTPHGDAITVFDLRFCVPNKEVMPERGSHTLEHLFA 61
PLLDSFTVDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ E+G HTLEHL+A
Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60

Query: 62 GFMRNHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMEDVLKVQDQNQIP 121
GFMRNHLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAMEDVLKV++QN+IP
Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120

Query: 122 ELNVYQCGTYQMHSLQEAQDIARSILERDVRINSNEELALPKEKLQELHI 171
ELN YQCGT MHSL EA+ IA++ILE V +N N+ELALP+ L+EL I
Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2831TCRTETB1313e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 131 bits (330), Expect = 3e-35
Identities = 97/405 (23%), Positives = 169/405 (41%), Gaps = 23/405 (5%)

Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRV 76
I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ ++
Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 77 GEVKLFLWSTIAFAIASWECGVS-SSLNMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAK 135
G +L L+ I S V S ++LI R IQG A L ++ P
Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136

Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGVAVVLMTLQTLRGRETR 195
R A L V + GP +GG I+ HW + + +P+ + + L L +E R
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVR 194

Query: 196 TERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVVAICFLIVWELTD 255
+ D G+ L+ +GI + ML F++ I +V+V++ +
Sbjct: 195 I-KGHFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243

Query: 256 DNPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315
+P VD L K+ F IG LC + + G + ++P ++++V+ + G G
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFIQGF- 373
+ VI+ I G + ++ +V F ++ S + I F
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358

Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416
++ ++TI S L + A SL NFT L+ G +I
Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2832RTXTOXIND795e-18 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 78.7 bits (194), Expect = 5e-18
Identities = 64/412 (15%), Positives = 120/412 (29%), Gaps = 97/412 (23%)

Query: 25 LLLTLLFIIIAVAIGIYWFLVLRHFEETDDA----YVAGNQIQIMSQVSGSVTKVWADNT 80
L FI+ + I VL E A +G +I + V ++
Sbjct: 57 PRLVAYFIMGFLVIAFILS-VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115

Query: 81 DFVKEGDVLVTLDPTDARQAFEKA------------------------------------ 104
+ V++GDVL+ L A K
Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175

Query: 105 ----------------KTALASSVRQTHQLMINSKQLQANIEVQKIALAKA-------QS 141
K ++ Q +Q +N + +A + + +S
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 142 DYNRRVPLGNANLIGREELQHARDAVTSAQAQLDVAIQQYNANQAMILGTKLEDQPAVQQ 201
+ L + I + + + A +L V Q ++ IL K E Q Q
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 202 AATEVRN------------------AWLALERTRIVSPMTGYVSRRAVQ-PGAQISPTTP 242
E+ + + + I +P++ V + V G ++
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 243 LMAVVPA-TNMWVDANFKETQIANMRIGQPVTITTDIYGDDVKY---TGKVVGLDMGTGS 298
LM +VP + V A + I + +GQ I + + +Y GKV + +
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI-----N 409

Query: 299 AFSLLPAQNATGNWIKVVQRLPVRIELDQKQLEQYPLRIGLSTLVSVNTTNR 350
++ G V+ + + PL G++ + T R
Sbjct: 410 LDAIE--DQRLGLVFNVIISIEENCLST--GNKNIPLSSGMAVTAEIKTGMR 457


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2833PF05272280.018 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.018
Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 12/94 (12%)

Query: 23 PYQEILLTRLCMHMQSKLLENRNKMLKAQGINETLFMALITLESQENHSIQPSELSCALG 82
P QE+ L + + L R A+G + + T + ++L ALG
Sbjct: 756 PEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTI-------ADLVQALG 808

Query: 83 -----SSRTNATRIADELEKRGWIERRESDNDRR 111
SS ++ D L + GW RE+ RR
Sbjct: 809 ADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_2836TCRTETB453e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.9 bits (106), Expect = 3e-07
Identities = 32/165 (19%), Positives = 70/165 (42%), Gaps = 2/165 (1%)

Query: 34 LDTIARNFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFERRRLIVSMTLLAAGGMLIT 93
L IA +F+ +S ++ TA L ++ G L D +RL++ ++ G +I
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 94 ASSQSLA-MMILGTALTGLFSVVAQILVPLA-ATLASPDKRGKVVGTIMSGLLLGILLAR 151
S ++I+ + G + LV + A + RGK G I S + +G +
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 152 TVAGLLANLGGWRTVFWVASVLMALMALALWRGLPQMKSETHLNY 196
+ G++A+ W + + + + + + +++ + H +
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201


77SBO_3014SBO_3017N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_30141184.135880type II secretion protein
SBO_30150153.861390type II secretion protein
SBO_30161194.062584type II secretion protein
SBO_3017-1173.294450type II secretion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3014BCTERIALGSPF451e-160 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 451 bits (1162), Expect = e-160
Identities = 224/406 (55%), Positives = 301/406 (74%), Gaps = 1/406 (0%)

Query: 1 MALFYYQALERNGRKTKGMIEADSARHARQLLRGKELIPVHI-EARMNASAGGMLQRRRH 59
MA ++YQAL+ G+K +G EADSAR ARQLLR + L+P+ + E R + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 AHRRVAAADLALFTRQLATLVQAAMPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119
R++ +DLAL TRQLATLV A+MPLE L AV++QSEK H+ L A+RS++ EG++L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVML 179
+D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP V+
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 VVATGVVTILLTAVVPKIIEQFDHLGHALPSSTRALIAMSDALQASGVYWLAGLLALLVL 239
VVA VV+ILL+ VVPK++EQF H+ ALP STR L+ MSDA++ G + L LLA +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 GQWLLKNPTMRLRWDKTLLRLPVTGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299
+ +L+ R+ + + LL LP+ GR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 SANRYVEQQLLLAADRVREGSSLRAALAELRLFPPMMLYIIASGEQSGELETMLEQAAVN 359
+N Y +L LA D VREG SL AL + LFPPMM ++IASGE+SGEL++MLE+AA N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPMLQLNNMV 405
Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+LQLN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3015BCTERIALGSPG2176e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 217 bits (553), Expect = 6e-76
Identities = 89/146 (60%), Positives = 109/146 (74%), Gaps = 3/146 (2%)

Query: 6 RTHKPRAGFTLLEVMVVIVILGVLASLVVPNLLGNKEKANRQKAISDIVALENALDMYRL 65
R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKA++QKA+SDIVALENALDMY+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 66 DNGRYPTTEQGLEALIQQPANMADSRNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125
DN YPTT QGLE+L++ P + NY GYIKRLP DPWGNDY ++PGE G +D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151
+ G DG+ E DI NW L + +
Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3016BCTERIALGSPH552e-12 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 55.0 bits (132), Expect = 2e-12
Identities = 31/159 (19%), Positives = 54/159 (33%), Gaps = 28/159 (17%)

Query: 1 MLVIFLIGLASAGVVQTFATDSEPPAKKAAQDFLTRFAQFKDRAVIEGQTLGVLIDPPGY 60
ML++ L+G+++ V+ F + A + F + + R + GQ GV + P +
Sbjct: 12 MLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFFGVSVHPDRW 71

Query: 61 QFMQRRQGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQRRRL----TLHD 116
QF+ + P D W L L+ R+ ++
Sbjct: 72 QFLVLEARDGADPA-------------------PADDGWSGYRWLPLRAGRVATSGSIAG 112

Query: 117 IELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSA 150
+L L + P + P TPF L A
Sbjct: 113 GKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLTLGEA 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3017BCTERIALGSPH368e-06 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 36.5 bits (84), Expect = 8e-06
Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 2 KRGFTLLEVMLVLAIFALAATAVL 25
+RGFTLLE+ML+L + ++A VL
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26


78SBO_3152SBO_3155N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3152-113-0.440042arginine repressor
SBO_3153-114-0.293887malate dehydrogenase
SBO_3154-213-0.268790serine endoprotease
SBO_3155-115-0.144776serine endoprotease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3152ARGREPRESSOR1694e-57 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 169 bits (430), Expect = 4e-57
Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 5/141 (3%)

Query: 15 KALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELG 74
+ ++ + +Q E+V L++ G+ N+ Q+ VSR + + V+ Y LPA+
Sbjct: 11 REIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSYKYSLPADQR 69

Query: 75 VPTTSSPLKNLV---LDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTI 131
S ++L+ + ID ++V+ T PG AQ I L+D+L E I+GTI GDDTI
Sbjct: 70 FNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IMGTICGDDTI 128

Query: 132 FTTPANGFTVKDLYEAILELF 152
K + + ILEL
Sbjct: 129 LIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3153DHBDHDRGNASE280.045 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 28.1 bits (62), Expect = 0.045
Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 27/167 (16%)

Query: 3 VAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGED 62
+ GAA GIG+A+A L G+ ++ D P V S A + F +
Sbjct: 11 AFITGAAQGIGEAVARTL---ASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPADV 66

Query: 63 ATPA------------LEGADVVLISAGVARK------PGMDRSDLFNVNAGIVKNLVQQ 104
A + D+++ AGV R + F+VN+ V N +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 105 VAKTCPK----ACIGIITNPVNTT-VAIAAEVLKKAGVYDKNKLFGV 146
V+K + + + +NP ++AA KA K G+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3154V8PROTEASE529e-10 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 52.3 bits (125), Expect = 9e-10
Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%)

Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124
+ SGV++ + ++TNKHV++ AL+ +G +
Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 125 TDLAVLKI-------NATGGLPTIPINARRVPHIGDVVLAIGNPYNLGQTITQGIISATG 177
DLA++K + + ++ + + G P + T + G
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216

Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217
+I + +Q D S GNSG + N E++GI+
Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3155V8PROTEASE724e-16 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 72.3 bits (177), Expect = 4e-16
Identities = 32/184 (17%), Positives = 63/184 (34%), Gaps = 38/184 (20%)

Query: 90 GLGSGVIINASKGYVLTNNHVINQAQKISIQL------------NDGREFDAKLIGSDDQ 137
+ SGV++ K +LTN HV++ L +G ++ +
Sbjct: 102 FIASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIVSALG 190
D+A+++ + ++++ + +V G P V+ +
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212

Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSVGIGFAIPSN 249
S + L+ +Q D S GNSG + N E+IGI+ G+
Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGA 263

Query: 250 MART 253
+
Sbjct: 264 VFIN 267


79SBO_3255SBO_3261N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3255-111-2.625646DNA-binding protein Fis
SBO_3256-112-2.329404methyltransferase
SBO_3257-214-2.273659hypothetical protein
SBO_3258-212-1.719035DNA-binding transcriptional regulator EnvR
SBO_3259-212-1.534939AcrE protein
SBO_3260014-2.228949integral transmembrane protein
SBO_3261117-3.230667hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3255DNABINDNGFIS1573e-54 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 157 bits (399), Expect = 3e-54
Identities = 98/98 (100%), Positives = 98/98 (100%)

Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ
Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60

Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3258HTHTETR1291e-39 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 129 bits (325), Expect = 1e-39
Identities = 78/209 (37%), Positives = 122/209 (58%), Gaps = 3/209 (1%)

Query: 1 MAKRTKAEALKTRQELIETAIAQFAQHGVSKTTLNDIADAANVTRGAIYWHFENKTQLFN 60
MA++TK EA +TRQ +++ A+ F+Q GVS T+L +IA AA VTRGAIYWHF++K+ LF+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EMW-LQQPSLRELIQEHLTAGLEHDPFQQLREKLIVGLQYIAKIPRQQALLKILYHKCEF 119
E+W L + ++ EL E A DP LRE LI L+ R++ L++I++HKCEF
Sbjct: 61 EIWELSESNIGELELE-YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 120 NDEM-LAEGVIREKMGFNPQTLREVLQACQQQGCVANNLDLDVVMIIINGAFSGIVQNWL 178
EM + + R + + + L+ C + + +L II+ G SG+++NWL
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179

Query: 179 MNMAGYDLYKQAPALVDNVLRMFMPDENI 207
+DL K+A V +L M++ +
Sbjct: 180 FAPQSFDLKKEARDYVAILLEMYLLCPTL 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3259RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 38/217 (17%), Positives = 70/217 (32%), Gaps = 38/217 (17%)

Query: 98 ATYQANYNSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADA-RQADAAV 156
K +L + E+ A + Q + I D RQ +
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311

Query: 157 IAAKGTVESARINLAYTKVTAPISGRIGK-STVTEGALVTNGQTTELATVQQLDPIYVDV 215
+ + + AP+S ++ + TEG +VT +T + V + D + V
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTA 370

Query: 216 TQSSND--FMRLKQSVEQGNLHKENATSNVELVMENGQTYP-LKGTLQ--FSDVTVDEST 270
+ D F+ + Q+ +++ Y L G ++ D D+
Sbjct: 371 LVQNKDIGFINVGQNAI------------IKVEAFPYTRYGYLVGKVKNINLDAIEDQRL 418

Query: 271 GSIT--LRAV------FPNPQHTLLPGMYVRARIDEG 299
G + + ++ N L GM V A I G
Sbjct: 419 GLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455



Score = 34.0 bits (78), Expect = 8e-04
Identities = 22/127 (17%), Positives = 43/127 (33%), Gaps = 13/127 (10%)

Query: 46 TAPLEVKTELPGR-TNAYRIAEVRPQVSGIVLNRNFTEGSDVQAGQSLYQIDPATYQANY 104
+E+ G+ T++ R E++P + IV EG V+ G L ++ +A
Sbjct: 77 LGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA-- 134

Query: 105 NSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADARQADAAVIAAKGTVE 164
+ K++++ A L RY L E ++ +
Sbjct: 135 -----DTLKTQSSLLQARLEQTRYQIL-----SRSIELNKLPELKLPDEPYFQNVSEEEV 184

Query: 165 SARINLA 171
+L
Sbjct: 185 LRLTSLI 191



Score = 29.4 bits (66), Expect = 0.026
Identities = 11/34 (32%), Positives = 15/34 (44%), Gaps = 1/34 (2%)

Query: 65 AEVRPQVSGIVLNRN-FTEGSDVQAGQSLYQIDP 97
+ +R VS V TEG V ++L I P
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3260ACRIFLAVINRP14030.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1403 bits (3633), Expect = 0.0
Identities = 1030/1034 (99%), Positives = 1032/1034 (99%)

Query: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60
MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120
VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180
EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRL 240
QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300
KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360
DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480
MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540
SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600
LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660
EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720
FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 721 EDTAQFKLEVDQEKAQALSVSLSDINQTISTALGGTYVNDFIDRGRVKKVYVQADAKFRM 780
EDTAQFKLEVDQEKAQAL VSLSDINQTISTALGGTYVNDFIDRGRVKK+YVQADAKFRM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840
LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900
ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960
MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020
EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1021 LPVFFVVIRRCFKG 1034
+PVFFVVIRRCFKG
Sbjct: 1021 VPVFFVVIRRCFKG 1034


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3261adhesinb270.008 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 26.7 bits (59), Expect = 0.008
Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 10/68 (14%)

Query: 1 MKR---LIPVALLTALLAGCAHDSPCVPVYDDQGCLVHTNTCMKGTTQDNWETAGAIAGG 57
MK+ L+ + L LA C+ + +V TN+ + T++ IAG
Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKN-------IAGD 53

Query: 58 AAAVAGLT 65
+ +
Sbjct: 54 KINLHSIV 61


80SBO_3319SBO_3335N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_3319551-1.894066bacterioferritin
SBO_3320652-0.659519elongation factor Tu
SBO_3321440-1.861694elongation factor G
SBO_3322428-2.61621530S ribosomal protein S7
SBO_3323422-2.32829430S ribosomal protein S12
SBO_3324422-1.749788sulfur relay protein TusC
SBO_3325013-0.008537sulfur transfer complex subunit TusD
SBO_33260130.162336hypothetical protein
SBO_33271121.821066FKBP-type peptidyl-prolyl cis-trans isomerase
SBO_3328-1142.577526hypothetical protein
SBO_3329-2142.446679FKBP-type peptidyl-prolyl cis-trans isomerase
SBO_3330-1142.151505glutathione-regulated potassium-efflux system
SBO_3331-1181.844343glutathione-regulated potassium-efflux system
SBO_33320152.430655ABC transporter ATP-binding protein
SBO_33330152.254802hydrolase
SBO_33341161.689229hypothetical protein
SBO_3335-1131.714724phosphoribulokinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3319HELNAPAPROT415e-07 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 40.6 bits (95), Expect = 5e-07
Identities = 19/103 (18%), Positives = 44/103 (42%), Gaps = 10/103 (9%)

Query: 44 EYHESIDEMKHADKYIERILFLEGTPN--LQDLGKL------GIGEDVEEMLQSDLRLEQ 95
E ++ E D ER+L + G P +++ + G EM+Q+ + +
Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109

Query: 96 EGAKDLREAIAYADSVHDYVSRDMMIEILADEEGHIDWLETEL 138
+ + + + I A+ D + D+ + ++ + E + L + L
Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3320TCRTETOQM803e-18 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 79.5 bits (196), Expect = 3e-18
Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 13/198 (6%)

Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGAARAFDQIDNAPEEKARGITINTS 66
+N+G + HVD GKTTLT ++ T L G R DN E+ RGITI T
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59

Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126
+ +D PGH D++ + + +DGAIL+++A DG QTR R++
Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119

Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWE 186
G+P I F+NK D + L V +++E LS + + +W+
Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176

Query: 187 AKILELAGFLDSYIPEPE 204
I L+ Y+
Sbjct: 177 TVIEGNDDLLEKYMSGKS 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3321TCRTETOQM6130.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 613 bits (1583), Expect = 0.0
Identities = 178/698 (25%), Positives = 304/698 (43%), Gaps = 81/698 (11%)

Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68
+ NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128
+ W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++
Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114

Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVNQIKTRLGANPVPLQLAIGAEEHFTGVVDLVKM 188
K +P I F+NK+D+ G + V IK +L A V Q V M
Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164

Query: 189 KAINWNDADQGVTFEYEDIPADMVELANEWHQNLIESAAEASEELMEKYLGGEELTEAEI 248
N+ +++Q ++ E +++L+EKY+ G+ L E+
Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200

Query: 249 KGALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308
+ R N + V GSA N G+ +++ + + S
Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243

Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHA 368
FKI L + R+YSGV++ D+V S K + + +
Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299

Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPDAPIILERMEFPEPVISIAVEPKT 424
+ +I + +G+I L V GDT P ER+E P P++ VEP
Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354

Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484
+E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE +
Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414

Query: 485 KPQVAYRETIRQKVTDVEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544
+P V Y E +K E + + + + + PL GS G ++ + + G
Sbjct: 415 EPTVIYMERPLKK---AEYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468

Query: 545 IPGEYIPAVDKGIQEQLKAGPLAGYPVVDMGIRLHFGSYHDVDSSELAFKLAASIAFKEG 604
+ + AV +GI+ + G L G+ V D I +G Y+ S+ F++ A I ++
Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527

Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664
KKA LLEP + ++ P+E D + + + + V + E+P +
Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587

Query: 665 GYATQLRSLTKGRASYTMEFLKYDEAPSNVAQAVIEAR 702
Y + L T GR+ E Y + V + R
Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3326ACRIFLAVINRP290.023 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.7 bits (64), Expect = 0.023
Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 164 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 222
A +V+D VTQ +E + ++ + S + + + L + D A QV ++L +
Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113

Query: 223 SK 224
+
Sbjct: 114 AT 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3327INFPOTNTIATR1325e-40 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 132 bits (334), Expect = 5e-40
Identities = 79/226 (34%), Positives = 124/226 (54%), Gaps = 9/226 (3%)

Query: 28 AAKPATTADSKAAFKNDDQKSAYALGASLGRYMENSLKEQEKLGIKLDKDQLIAGVQDAF 87
A A A + D K +Y++GA LG K + GI ++ D L G+QD
Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLG-------KNFKNQGIDINPDVLAKGMQDGM 66

Query: 88 A-DKSKLSDQEIEQTLQAFEARVKSSAQAKMEKDAADNEAKGKEYREKFAKEKGVKTSST 146
+ + L++++++ L F+ + + A+ K A +N+AKG + + G+ +
Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126

Query: 147 GLVYQVVEAGKGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGL 206
GL Y++++AG G P SDTV V Y GTLIDG FD++ G+P +F++ VIPGWTE L
Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186

Query: 207 KNIKKGGKIKLVIPPELAYGKAGVPG-IPPNSTLVFDVELLDVKPA 251
+ + G ++ +P +LAYG V G I PN TL+F + L+ VK A
Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_333060KDINNERMP310.021 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 30.7 bits (69), Expect = 0.021
Identities = 13/69 (18%), Positives = 29/69 (42%), Gaps = 6/69 (8%)

Query: 261 TAIDPFKGLLLG---LFFISVGMSLNLGVLYTHL-LWVVISVVVLVAVKILVLYLLARLY 316
A+ P L + L+FIS + L +++ + W +++ V+ ++ L
Sbjct: 318 AAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKA-- 375

Query: 317 GVRSSERMQ 325
S +M+
Sbjct: 376 QYTSMAKMR 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3331ISCHRISMTASE320.001 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 31.9 bits (72), Expect = 0.001
Identities = 32/135 (23%), Positives = 51/135 (37%), Gaps = 16/135 (11%)

Query: 12 YAHPESQDSVANRVLLKPATQLSNVTVHDLYAHYPDFFIDIPREQALLREHEVIVFQH-- 69
Y P + D N+V P + + +HD+ ++ D F L + +
Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68

Query: 70 ----PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLAGKYWRNVITTGEPESA------Y 119
P+ + P DR L F GPG N +G Y +IT PE +
Sbjct: 69 QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLN--SGPYEEKIITELAPEDDDLVLTKW 124

Query: 120 RYDALNRYPMSDVLR 134
RY A R + +++R
Sbjct: 125 RYSAFKRTNLLEMMR 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3332GPOSANCHOR330.005 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 0.005
Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 22/152 (14%)

Query: 504 KVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKE 563
+ D + ++ E + + ++ R+ +R R + L E
Sbjct: 272 AMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331

Query: 564 IARLEKEME---------------------KLNAQLAQAEEKLGDSELYDQSRKAELTAC 602
+LE++ + +L A+ + EE+ SE QS + +L A
Sbjct: 332 HQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 391

Query: 603 LQQQASAKSGLEECEMAWLEAQEQLEQMLLEG 634
+ + + LEE L A E+L + L E
Sbjct: 392 REAKKQVEKALEEANSK-LAALEKLNKELEES 422



Score = 32.0 bits (72), Expect = 0.009
Identities = 13/125 (10%), Positives = 39/125 (31%), Gaps = 7/125 (5%)

Query: 513 EDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKEIARLEKEME 572
+ + ++ + E A A + D ++ + +++
Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST-------ADSAKIK 179

Query: 573 KLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLL 632
L A+ A E + + E + TA + + ++ + ++ LE +
Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239

Query: 633 EGQSN 637
++
Sbjct: 240 FSTAD 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3335PF07299320.002 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 31.8 bits (72), Expect = 0.002
Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 71 PEANDFGLLEQTFIEYGQSGKGKSRKYLHTYDEAVPWNQVPGTFTP 116
P+ + + E ++ KG SRK++ ++ + + GTF
Sbjct: 112 PDMEELDMKELSY--LSWIDKGSSRKFIIAKNDKNKFVGLQGTFQS 155


81SBO_3477SBO_3482N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_34770193.514864nickel transporter ATP-binding protein NikE
SBO_34780183.275027nickel responsive regulator
SBO_34790172.891056hypothetical protein
SBO_3480-1150.588458transporter
SBO_3481-117-2.607220ABC transporter ATP-binding protein
SBO_3482032-7.215079hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3477HTHFIS290.020 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.020
Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 25 QAVLNNVSLTLKSGETVALLGRSGCGKSTLARLL 58
Q + ++ +++ T+ + G SG GK +AR L
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3480ABC2TRNSPORT505e-09 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 49.5 bits (118), Expect = 5e-09
Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 7/171 (4%)

Query: 201 REREHGTVEHLLVMPITPFEIMMAKI-WSMGLVVLVVSGLSLVLMVKGVLGVPIEGSIPL 259
R T E +L + +I++ ++ W+ L +G+ +V G + L
Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148

Query: 260 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQD 318
+ L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P + Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 319 IMLTMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFF-TIALLRFR 368
+P +H + L + I+ ++ + I FF + ALLR R
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3481PF05272300.044 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.044
Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 20 ARCMVGLIGPDGVGKSSLLSLISGAR 45
V L G G+GKS+L++ + G
Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3482RTXTOXIND848e-20 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 83.7 bits (207), Expect = 8e-20
Identities = 71/408 (17%), Positives = 139/408 (34%), Gaps = 81/408 (19%)

Query: 6 RHLAWWVVGALAVAAVVAWWLLRPAGVP-EGFAVSNGRIEATEVDIASKIAGRIDTILVK 64
R +A++++G L +A +++ G +GR + I + I+VK
Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKE----IKPIENSIVKEIIVK 113

Query: 65 EGQFVREGEVLAKMDTRV----------------LQEQRLEAI----------------- 91
EG+ VR+G+VL K+ L++ R + +
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 92 -------------------AQIKEAQSAVAAAQALLEQRQSETRAAQSLVNQRQAELDSV 132
Q Q+ + L+++++E + +N+ +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 133 AKRHMRSRSLAQRGAISAQQLDDDRAAAESARAALESAKAQVSASKAAIEAARTNIIQ-- 190
R SL + AI+ + + A L K+Q+ ++ I +A+
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 191 -----------AQTRVEAAQATERRIAADID--DSELKAPRDGRV-QYRVAEPGEVLAAG 236
QT T + S ++AP +V Q +V G V+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 237 GRVLNMVDLSDVY-MTFFLPTEQAGTLKLGGEARLILDAAPDLRIPATISFVASVAQFTP 295
++ +V D +T + + G + +G A + ++A P R V V
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNINL 410

Query: 296 KTVETSDERLKLMFRVKARIPPELLQQHLEYV--KTGLPGVAWVRVNE 341
+E D+RL L+F V I L + + +G+ A ++
Sbjct: 411 DAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456


82SBO_3549SBO_3554N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_35491120.338673resistance protein
SBO_35501130.718191lipase
SBO_35511100.5625003-methyladenine DNA glycosylase
SBO_35521120.349120hypothetical protein
SBO_35530110.274823biotin sulfoxide reductase
SBO_3554-117-1.368976outer membrane lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3549TCRTETA424e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.7 bits (98), Expect = 4e-06
Identities = 47/275 (17%), Positives = 94/275 (34%), Gaps = 32/275 (11%)

Query: 44 PVSQVAFSFGLLSLGLAIS----SSVAGKLQERFGVKRVTMASGILLGLGFFLTAHSDNL 99
+ V +G+L A+ + V G L +RFG + V + S + + + A + L
Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96

Query: 100 MMLWLS---AGVLVGLADGAGYLL----TLSNCVKWFPERKGLISAFAIGSYGLGSLGFK 152
+L++ AG+ AG + + F + LG
Sbjct: 97 WVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG---- 152

Query: 153 FIDTQLLETVGLEKTFVIWGAIALVMIVFGATLMKDAPKQEVKTSNGVVEKDYTLAESMR 212
L+ F A+ + + G L+ ++ K E + R
Sbjct: 153 -----LMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWAR 207

Query: 213 --KPQYWMLAVMFLTACMSG----LYVIGVAKDIAQSLAHLDVVSAANAVTVISIAN-LS 265
++AV F+ + L+VI + H D + ++ I + L+
Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVI-----FGEDRFHWDATTIGISLAAFGILHSLA 262

Query: 266 GRLVLGILSDKIARIRVITIGQVISLVGMAALLFA 300
++ G ++ ++ R + +G + G L FA
Sbjct: 263 QAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297



Score = 36.0 bits (83), Expect = 2e-04
Identities = 37/155 (23%), Positives = 64/155 (41%), Gaps = 9/155 (5%)

Query: 241 AQSLAHLDVVSAANAVTVISIANLSGRLVLGILSDKIARIRVITIGQVISLVGMAALLFA 300
AH ++ A A+ + A + G L SD+ R V+ + + V A + A
Sbjct: 39 NDVTAHYGILLALYALMQFACAPVLGAL-----SDRFGRRPVLLVSLAGAAVDYAIMATA 93

Query: 301 PLNAVTFFAAIACVAFNFGGTITVFPSLVSEFFGLNNLAKNYGVIYLGFGIGSIFGSIIA 360
P V + I VA G T V + +++ + A+++G + FG G + G ++
Sbjct: 94 PFLWVLYIGRI--VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG 151

Query: 361 SLFGGF--YVTFYVIFALLILSLALSTTIRQPEQK 393
L GGF + F+ AL L+ + K
Sbjct: 152 GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3550ECOLNEIPORIN280.035 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 27.8 bits (62), Expect = 0.035
Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 13/90 (14%)

Query: 121 SMYNEFGDSTTTLTDPLWHASVSSLGWRVDSRLGDLRPWAQISYNQQFGENIWKAQSGLS 180
S+ + D+ + H S + + + R G++ P ++SY F +
Sbjct: 228 SVAVQQQDAKLV-EENYSHNSQTEVAATLAYRFGNVTP--RVSYAHGFKGSF-------- 276

Query: 181 RMTATNQNGNWLDVTVGADMLLNQNIAAYA 210
ATN N ++ V VGA+ ++ +A
Sbjct: 277 --DATNYNNDYDQVVVGAEYDFSKRTSALV 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3552SACTRNSFRASE362e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.1 bits (83), Expect = 2e-05
Identities = 17/52 (32%), Positives = 26/52 (50%), Gaps = 5/52 (9%)

Query: 76 VAPKAVRRGIGKALM----QYVQQRHP-HLMLEVYQKNQPAINFYQAQGFHI 122
VA ++G+G AL+ ++ ++ H LMLE N A +FY F I
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3554OMPADOMAIN1132e-32 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 113 bits (285), Expect = 2e-32
Identities = 41/122 (33%), Positives = 62/122 (50%), Gaps = 11/122 (9%)

Query: 108 LNMPNNVTFDSSSATLKPAGANTLTGVAMVLKEY--PKTAVNVIGYTDSTGGHDLNMRLS 165
+ ++V F+ + ATLKP G L + L +V V+GYTD G N LS
Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274

Query: 166 QQRADSVASALITQGVDASRIRTQGLGPANPIASNSTAEGK---------AQNRRVEITL 216
++RA SV LI++G+ A +I +G+G +NP+ N+ K A +RRVEI +
Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334

Query: 217 SP 218

Sbjct: 335 KG 336


83SBO_3696SBO_3709N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_36962194.129152multidrug resistance protein D
SBO_36971183.737323ilvB operon leader peptide
SBO_36980173.057815acetolactate synthase catalytic subunit
SBO_3699-1142.661677acetolactate synthase 1 regulatory subunit
SBO_3700-1132.828760DNA-binding transcriptional activator UhpA
SBO_3701-1122.093409sensory histidine kinase UhpB
SBO_3702-1121.124072regulatory protein UhpC
SBO_3703-1120.669250sugar phosphate antiporter
SBO_3704-1160.070342cryptic adenine deaminase
SBO_3706-120-2.331489IS1 ORF2
SBO_3707121-3.094522IS1 encoded protein
SBO_3708121-3.010261hypothetical protein
SBO_3709020-3.203635ribonucleoside transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3696TCRTETB612e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 61.0 bits (148), Expect = 2e-12
Identities = 41/184 (22%), Positives = 82/184 (44%), Gaps = 1/184 (0%)

Query: 7 RNVNLLLMLVLLVAVGQMAQTIYIPAIADMARDLNVREGAVQSVMGAYLLTYGVSQLFYG 66
R+ +L+ L +L + + + ++ D+A D N + V A++LT+ + YG
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 67 PISDRVGRRPVILVGMSIFMLATLVA-VTTSSLTVLIAASAMQGMGTGVGGVMARTLPRD 125
+SD++G + ++L G+ I +++ V S ++LI A +QG G + +
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130

Query: 126 LYERTQLRHANSLLNMGILVSPLLAPLIGGLLDTMWNWRACYLFLLVLCVGVTFSMARWM 185
+ A L+ + + + P IGG++ +W L ++ + V F M
Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190

Query: 186 PETR 189
E R
Sbjct: 191 KEVR 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3700HTHFIS612e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 2e-13
Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%)

Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61
T+ + DD +R+ Q L V + + + + D+ MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118
+LL ++ K + +++S ++ +A GA +L K ELI +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117

Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169
A+ R L + + + + + A L + T+
Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3701PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 2e-05
Identities = 28/142 (19%), Positives = 56/142 (39%), Gaps = 11/142 (7%)

Query: 366 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 425
LR ++L + + ++L L++ + + +V + Q + N
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266

Query: 426 IVKHA-----DASAVTLQGWQQDERLMLVIEDDGSGLPPGSGQ-QGFGLTGMRERVTALG 479
+KH + L+G + + + L +E+ GS + + G GL +RER+ L
Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326

Query: 480 G---TLHISCLHG-TRVSVSLP 497
G + +S G V +P
Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3702TCRTETB411e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.6 bits (95), Expect = 1e-05
Identities = 65/408 (15%), Positives = 137/408 (33%), Gaps = 60/408 (14%)

Query: 30 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 87
RH + IWL F+ N ++P+I + + + T F +T+ + V G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 88 IVSDRSNARYFMGIGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 144
+SD+ + + G+I +++ S F L ++ F QG G+ P ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 145 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAAAALHYGWRAGMMIAGCMAIVVGIFLC 203
A Y + RG + L + +G + P + A + W ++ M ++ +
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184

Query: 204 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 263
+ L +I G L I+ + Y VL
Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 264 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFI-----------GALVA 307
+++ R + + + + + + + + + GF+ A
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 308 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 366
GS +F G + + GIL+ G L+++ + + F T F + +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351

Query: 367 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 396
G++ + ++ AGA + ++L
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3703TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.1 bits (78), Expect = 0.001
Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%)

Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108
N++ D+ + + + F +T+ +G + +D K+ L F +I++ C
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90

Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164
+G SL +M + F Q G + + + ++ P+ RG G
Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212
+G + A+Y+ + + + P + I+ L
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3704UREASE381e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 37.8 bits (88), Expect = 1e-04
Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 17/105 (16%)

Query: 22 AVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG-AEYAD---------APA 71
V+R D +I N ILD + G + I +K IA +G A D P
Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117

Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116
+ I G G +D+H+H + P E A L GLT ++
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3709TCRTETB362e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.0 bits (83), Expect = 2e-04
Identities = 29/163 (17%), Positives = 61/163 (37%), Gaps = 18/163 (11%)

Query: 43 LTPMAQDLGISEG-----VAGQSVTVTAFVAMFASLFITQTIQATDRCYVVILFAVLLTL 97
L +A D +T + A++ L +D+ + L + +
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKL--------SDQLGIKRLLLFGIII 88

Query: 98 SCL--LVSFAN--SFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKALSVIFGAV 153
+C ++ F FSLL++ R G F A+ + R +P KA +I V
Sbjct: 89 NCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIV 148

Query: 154 SIALVIAAPLGSFLGELIGWRNVFNAAAVMGVLCIFWIIKSLP 196
++ + +G + I W + + ++ + +++K L
Sbjct: 149 AMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLK 190


84SBO_3878SBO_3883N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_38782221.666012hypothetical protein
SBO_38791181.712628coproporphyrinogen III oxidase
SBO_38801151.447502nitrogen regulation protein NR(I)
SBO_38811150.531261nitrogen regulation protein NR(II)
SBO_38821150.019358glutamine synthetase
SBO_3883014-0.223565GTP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3878SECA300.004 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 30.2 bits (68), Expect = 0.004
Identities = 11/71 (15%), Positives = 30/71 (42%)

Query: 14 AKARRKTREELDQEARDRKRQKKRRGHAPGSRAAGGNTTSGSKGQNAPKDPRIGSKTPIP 73
+K + + EE+++ + R+ + +R ++ + + + ++G P P
Sbjct: 827 SKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCP 886

Query: 74 LGVTEKVTKQH 84
G +K + H
Sbjct: 887 CGSGKKYKQCH 897


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3880HTHFIS6010.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 601 bits (1550), Expect = 0.0
Identities = 206/478 (43%), Positives = 300/478 (62%), Gaps = 11/478 (2%)

Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGM 60
M + V DDD++IR VL +AL+ AG N A + +A+ D++++D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120
+ LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 HYQEQQQPRNIQLNGPTTDIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180
+ + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A
Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240
LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300
IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETEAALTRLAWPGNVRQL 360
LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFESTVAESTSQMQPDSWATLLAQWADRALRS---- 416
EN R LT + + + + EL + S + ++Q + +R
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469
L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3881PF06580280.042 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.3 bits (63), Expect = 0.042
Identities = 34/190 (17%), Positives = 72/190 (37%), Gaps = 41/190 (21%)

Query: 171 IIEQADRLRNLVDRL---LGPQLPGTRVTE-SIHKVAERV---VTLVSMELPDNVRLIRD 223
I+E + R ++ L + L + + S+ V + L S++ D ++
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 224 YDPSLPELAHDPDQIEQVLLN-IVRNALQ---ALGPEGGEIILRTRTAFQLTLHGERYRL 279
+P++ ++ Q+ +L+ +V N ++ A P+GG+I+L+
Sbjct: 246 INPAIMDV-----QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGT------KDNGTVT- 293

Query: 280 AARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIARNLIDQHSGK---IEFTSWP 336
++VE+ G + ++ TG GL R + G I+ +
Sbjct: 294 ---LEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEKQ 338

Query: 337 GHTEFSVYLP 346
G V +P
Sbjct: 339 GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_3883TCRTETOQM1804e-51 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 180 bits (458), Expect = 4e-51
Identities = 97/445 (21%), Positives = 170/445 (38%), Gaps = 81/445 (18%)

Query: 4 KLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQE--RVMDSNDLEKERGITILAKNT 61
K+ NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGL 121
+ +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159
I INK+D+ G V + + L+ N+ T+
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 160 --------------------------------FPIVYASALNGIAGLDHEDMAEDMTPLY 187
FP+ + SA N I G+D+ L
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231

Query: 188 QAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247
+ I + + ++ +++Y+ + R+ G + V I + E
Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289

Query: 248 NAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTV 307
K+ ++ + E + D A +G+IV + L ++ + DT+ + + P +
Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEF-LKLNSVLGDTKLLPQRERIENPLPLL 346

Query: 308 SMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLS 367
+ + D L LR +S G++ +
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKVQME 397

Query: 368 VLIENMRRE-GFELAVSRPKVIFRE 391
V ++ + E+ + P VI+ E
Sbjct: 398 VTCALLQEKYHVEIEIKEPTVIYME 422



Score = 32.5 bits (74), Expect = 0.005
Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%)

Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457
EPY + + +++ + ++ + V L IP+R + +RS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595

Query: 458 MTSGTGLLYSTFSHY 472
T+G + + Y
Sbjct: 596 FTNGRSVCLTELKGY 610


85SBO_4021SBO_4032N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_40210132.901138transcriptional regulator HU subunit alpha
SBO_40220143.391608hypothetical protein
SBO_40230163.500230zinc resistance protein
SBO_40240152.881841sensor protein ZraS
SBO_40250152.185424response regulator of hydrogenase 3 activity
SBO_40260131.477770phosphoribosylamine--glycine ligase
SBO_4027-1110.575206bifunctional
SBO_4032-210-0.193142*hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4021DNABINDINGHU1202e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 120 bits (302), Expect = 2e-39
Identities = 50/89 (56%), Positives = 66/89 (74%)

Query: 2 NKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAERTGR 61
NK LI +AE EL+K + AA+++ +A++ L +G+ VQL+GFG F+V RA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEIKIAAANVPAFVSGKALKDAVK 90
NPQTG+EIKI A+ VPAF +GKALKDAVK
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4024PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.4 bits (84), Expect = 2e-04
Identities = 49/262 (18%), Positives = 104/262 (39%), Gaps = 43/262 (16%)

Query: 197 ILFALATVLLA-SVLSFFW-YRRYLRSRQLLQDEMKRKEKLVALGHLAAGV-AHEIRNPL 253
I+F + V S+L F W + + + ++ Q +M + L L A + H + N L
Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNAL 179

Query: 254 SSIKGLAKYFAERAPAGGEAHQLAQVM---AKEADRLNRVVSELLELVKPTHLALQAVDL 310
++I+ L +A L+++M + ++ +++ L +V ++L L ++
Sbjct: 180 NNIRALILEDPTKAREM--LTSLSELMRYSLRYSNARQVSLADELTVVD-SYLQLASIQF 236

Query: 311 NTLINHSLQLVSQDANSREIQLRFTANDTLPEIQADPDRLTQVLL-NLYLNAIQAIVQHG 369
+ Q+ + ++Q+ P L Q L+ N + I + Q G
Sbjct: 237 EDRLQFENQI---NPAIMDVQV--------------PPMLVQTLVENGIKHGIAQLPQGG 279

Query: 370 VISVTVSESGAGVKISVTDSGKGIAADQLEAIFTPYFTTKAEGTGLGLAVVHNIVEQHGG 429
I + ++ V + V ++G + E TG GL V ++ G
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNT------------KESTGTGLQNVRERLQMLYG 327

Query: 430 ---TIQVASQEGKGSTFTLWLP 448
I+++ ++GK + +P
Sbjct: 328 TEAQIKLSEKQGKV-NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4025HTHFIS498e-178 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 498 bits (1284), Expect = e-178
Identities = 163/369 (44%), Positives = 224/369 (60%), Gaps = 1/369 (0%)

Query: 8 ILVVDDDISHCTILQALLRGWGYNVALANSGRQALEQVREQVFDLVLCDVRMAEMDGIAT 67
ILV DDD + T+L L GY+V + ++ + DLV+ DV M + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LKEIKALNPAIPVLIMTAYSSVETAVEALKTGALDYLIKPLDFDNLQATLEKALAHTHSI 127
L IK P +PVL+M+A ++ TA++A + GA DYL KP D L + +ALA
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 128 DAETPAVTASQFGMVGKSPAMQHLLSEIALVAPSEATVLIHGDSGTGKELVARAIHASSA 187
++ + +VG+S AMQ + +A + ++ T++I G+SGTGKELVARA+H
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185

Query: 188 RSEKPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLDEIGDISP 247
R P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLDEIGD+
Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245

Query: 248 MMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAAEVNAGRFRQDLYYRLNVVAI 307
Q RLLR +Q+ E VG I DVR++AAT++DL +N G FR+DLYYRLNVV +
Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305

Query: 308 EVPSLRQRREDIPLLAGHFLQRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRELENAVER 367
+P LR R EDIP L HF+Q+ + VK F +A++L+ + WPGN+RELEN V R
Sbjct: 306 RLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLVRR 364

Query: 368 AVVLLPGNI 376
L P ++
Sbjct: 365 LTALYPQDV 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4032SACTRNSFRASE314e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.7 bits (69), Expect = 4e-04
Identities = 15/54 (27%), Positives = 20/54 (37%), Gaps = 5/54 (9%)

Query: 27 IDPDVCGCGVGRMLVEHALSMAPE-----LTTNVNEQNEQAVGFYKKVGFKVTG 75
+ D GVG L+ A+ A E L + N A FY K F +
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


86SBO_4131SBO_4140N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SBO_4131-214-0.914028phosphonate/organophosphate ester transporter
SBO_4132-315-1.483741hypothetical protein
SBO_4133-214-1.217608hypothetical protein
SBO_4134-213-1.377632IS1 encoded protein
SBO_4135013-0.555941IS1 ORF2
SBO_4138013-1.756698proline/glycine betaine transporter
SBO_4139-116-1.034021sensor protein BasS/PmrB
SBO_4140015-1.591305DNA-binding transcriptional regulator BasR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4131PF05272290.020 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.020
Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 32 MVALLGPSGSGKSTLLRHLSGL 53
V L G G GKSTL+ L GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4138TCRTETA432e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.9 bits (101), Expect = 2e-06
Identities = 57/290 (19%), Positives = 105/290 (36%), Gaps = 55/290 (18%)

Query: 85 FFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGE 144
G L D++GR+ +L +++ ++ + P +W +L I ++ G + G
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGIT-GAT 112

Query: 145 YTGASIFVAEYSPDRKR----GFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGW 200
A ++A+ + +R GFM + FG +AG VLG G++ S
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP------------ 159

Query: 201 RIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDREGLQDGPKVSFKEIATKYWRS 260
PFF A L + L K E+ P SF+ W
Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESH------KGERRPLRREALNPLASFR------WAR 207

Query: 261 LLTCIGLVIATNVTYYML----LTYMPSYLSHNLHYS-EDHGVLIIIAIMIGMLFVQPVM 315
+T + ++A ++ + H+ G+ + ++ L +
Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMIT 267

Query: 316 GLLSDRFGRRPFVLLG----SVALFVLA--------IPAFILINSNVIGL 353
G ++ R G R ++LG +LA P +L+ S IG+
Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM 317



Score = 39.4 bits (92), Expect = 3e-05
Identities = 39/164 (23%), Positives = 73/164 (44%), Gaps = 16/164 (9%)

Query: 286 LSHNLHYSEDHGVLI-IIAIMIGMLFVQPVMGLLSDRFGRRPFVLLGSVALFVLAIPAFI 344
L H+ + +G+L+ + A+M PV+G LSDRFGRRP +L+ L A+ I
Sbjct: 35 LVHSNDVTAHYGILLALYALM--QFACAPVLGALSDRFGRRPVLLVS---LAGAAVDYAI 89

Query: 345 LINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIR---YSALAAAFNISVLVAG 401
+ + + +++ G ++A I V + + + R + ++A F +VAG
Sbjct: 90 MATAPFLWVLYIG-RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAG 147

Query: 402 LTPTLAAWLVESSQNLMMPAYYLMVVAVIGLITG-VTMKETANR 444
P L + S + P + + + +TG + E+
Sbjct: 148 --PVLGGLMGGFSPH--APFFAAAALNGLNFLTGCFLLPESHKG 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4139PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 40/182 (21%), Positives = 80/182 (43%), Gaps = 34/182 (18%)

Query: 184 ARLDQMMESVSQLLQLARAGQSFSSGNYQHVKLLEDV-ILPSYDELSTML--DQRQQTLL 240
+ +M+ S+S+L++ S N + V L +++ ++ SY +L+++ D+ Q
Sbjct: 191 TKAREMLTSLSELMR-----YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 241 LPESAADITVQGDATLLRMLLRNLVENAHRY----SPQGSNIMIKLQEDGGAV-MAVEDE 295
+ + D+ V ML++ LVEN ++ PQG I++K +D G V + VE+
Sbjct: 246 INPAIMDVQV------PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 296 GPGIDESKCGELSKAFVRMDSRYGGIGLGLSIV-SRITQLHHGQFFLQNRQETSGTRAWV 354
G + + G GL V R+ L+ + ++ ++ A V
Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345

Query: 355 RL 356
+
Sbjct: 346 LI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SBO_4140HTHFIS912e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.0 bits (226), Expect = 2e-23
Identities = 41/121 (33%), Positives = 60/121 (49%)

Query: 2 KILIVEDDTLLLQGLILAAQTEGYACDGVTTARMAEQSLEAGHYSLVVLDLGLPDEDGLH 61
IL+ +DD + L A GY + A + + AG LVV D+ +PDE+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 FLARIRQKKYTLPVLILTARDTLTDKIAGLDVGADDYLVKPFALEELHARIRALLRRHNN 121
L RI++ + LPVL+++A++T I + GA DYL KPF L EL I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 Q 122
+
Sbjct: 125 R 125



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.