PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2473.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_003485 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1spyM18_0123spyM18_0129Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0123116-3.581511heat shock protein 33
spyM18_0125319-4.551479hypothetical protein
spyM18_0126117-4.287048collagen binding protein
spyM18_0127116-4.204914hypothetical protein
spyM18_0128115-4.352361hypothetical protein
spyM18_0129012-3.075573hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0125PF082802842e-96 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 284 bits (729), Expect = 2e-96
Identities = 122/213 (57%), Positives = 163/213 (76%)

Query: 5 KKKKDSFLVETYLEQSIRDKSELVLLLFKSPTIIFSHVAKQTGLTAVQLKYYCKELDDFF 64
+K+ L+E YLE SI K +LV+L FK+ ++ + VA++TGLT +QL +YC+EL+ FF
Sbjct: 27 RKRGPLSLIEKYLESSIESKCQLVVLFFKTSSLPITEVAEKTGLTFLQLNHYCEELNAFF 86

Query: 65 GNNLDITIKKGKIICCFVKPVKEFYLHQLYDTSTILKLLVFFIKNGTTSQPLIKFSKKYF 124
++L +TI+K I C F P KE YL+QLY +S +L+LL F IKNG+ S+PL F++ +F
Sbjct: 87 PDSLSMTIQKRMISCQFTHPSKETYLYQLYASSNVLQLLAFLIKNGSHSRPLTDFARSHF 146

Query: 125 LSSSSAYRLRESLIKLLREFGLRVSKNTIVGEEYRIRYLIAMLYSKFGIVIYPLDHLDNQ 184
LS+SSAYR+RE+LI LLR F L++SKN IVGEEYRIRYLIA+LYSKFGI +Y L D
Sbjct: 147 LSNSSAYRMREALIPLLRNFELKLSKNKIVGEEYRIRYLIALLYSKFGIKVYDLTQQDKN 206

Query: 185 IIYRFLSQSATNLRTSPWLEEPFSFYNMLLALS 217
II+ FLS S+T+L+TSPWL E FSFY++LLALS
Sbjct: 207 IIHSFLSHSSTHLKTSPWLSESFSFYDILLALS 239


2spyM18_0140spyM18_0150Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0140214-1.139565hypothetical protein
spyM18_0141314-0.814919hypothetical protein
spyM18_0142213-0.846299regulatory protein
spyM18_0143315-1.127117hypothetical protein
spyM18_01440111.220916V-type ATP synthase subunit I
spyM18_0145-3123.017626V-type ATP synthase subunit K
spyM18_0146-3112.720725V-type Na+ -ATPase subunit E
spyM18_0147-3122.893775V-type Na+ -ATPase subunit C
spyM18_0148-3112.869248V-type ATP synthase subunit F
spyM18_0150-2103.193464V-type ATP synthase subunit A
3spyM18_0165spyM18_0173Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_01650173.127293streptolysin O
spyM18_01660254.535760hypothetical protein
spyM18_01670254.557714hypothetical protein
spyM18_01680244.557578hypothetical protein
spyM18_01700245.094107cystathionine beta-lyase
spyM18_01711275.257022leucyl-tRNA synthetase
spyM18_01731224.555937PTS system ascorbate-specific transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0165TACYTOLYSIN8880.0 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 888 bits (2297), Expect = 0.0
Identities = 565/574 (98%), Positives = 570/574 (99%)

Query: 1 MKDMSNKKTFKKYSRVAGLLTAALIIGNLVTANAESNKQNTASTETTTTNEQPKPESSEL 60
MKDMSNKK FKKYSRVAGLLTAALI+GNLVTANA+SNKQNTA+TETTTTNEQPKPESSEL
Sbjct: 1 MKDMSNKKIFKKYSRVAGLLTAALIVGNLVTANADSNKQNTANTETTTTNEQPKPESSEL 60

Query: 61 TTEKAGQKTDDMLNSNDMIKLAPKEMPLESAEKEEKKSEDKKKSEEDHTEEINDKIYSLN 120
TTEKAGQK DDMLNSNDMIKLAPKEMPLESAEKEEKKSED KKSEEDHTEEINDKIYSLN
Sbjct: 61 TTEKAGQKMDDMLNSNDMIKLAPKEMPLESAEKEEKKSEDNKKSEEDHTEEINDKIYSLN 120

Query: 121 YNELEVLAKNGETIENFVPKEGVKKADKFIVIERKKKNINTTPVDISIIDSVTDRTYPAA 180
YNELEVLAKNGETIENFVPKEGVKKADKFIVIERKKKNINTTPVDISIIDSVTDRTYPAA
Sbjct: 121 YNELEVLAKNGETIENFVPKEGVKKADKFIVIERKKKNINTTPVDISIIDSVTDRTYPAA 180

Query: 181 LQLANKGFTENKPDAVVTKRNPQKIHIDLPGMGDKATVEVNDPTYANVSTAIDNLVNQWH 240
LQLANKGFTENKPDAVVTKRNPQKIHIDLPGMGDKATVEVNDPTYANVSTAIDNLVNQWH
Sbjct: 181 LQLANKGFTENKPDAVVTKRNPQKIHIDLPGMGDKATVEVNDPTYANVSTAIDNLVNQWH 240

Query: 241 DNYSGGNTLPARTQYTESMVYSKSQIEAALNVNSKILDGTLGIDFKSISKGEKKVMIAAY 300
DNYSGGNTLPARTQYTESMVYSKSQIEAALNVNSKILDGTLGIDFKSISKGEKKVMIAAY
Sbjct: 241 DNYSGGNTLPARTQYTESMVYSKSQIEAALNVNSKILDGTLGIDFKSISKGEKKVMIAAY 300

Query: 301 KQIFYTVSANLPNNPADVFDKSVTFKELQRKGVSNEAPPLFVSNVAYGRTVFVKLETSSK 360
KQIFYTVSANLPNNPADVFDKSVT KELQRKGVSNEAPPLFVSNVAYGRTVFVKLETSSK
Sbjct: 301 KQIFYTVSANLPNNPADVFDKSVTLKELQRKGVSNEAPPLFVSNVAYGRTVFVKLETSSK 360

Query: 361 SNDVEAAFSAALKGTDVKTNGKYSDILENSSFTAVVLGGDAAEHNKVVTKDFDVIRNVIK 420
SNDVEAAFSAALKGTDVKTNGKYSDILENSSFTAVVLGGDAAEHNKVVTKDFDVIRNVIK
Sbjct: 361 SNDVEAAFSAALKGTDVKTNGKYSDILENSSFTAVVLGGDAAEHNKVVTKDFDVIRNVIK 420

Query: 421 DNATFSRKNPAYPISYTSVFLKNNKIAGVNNRTEYVETTSTEYTSGKINLSHRGAYVAQY 480
DNATFSRKNPAYPISYTSVFLKNNKIAGVNNR+EYVETTSTEYTSGKINLSH+GAYVAQY
Sbjct: 421 DNATFSRKNPAYPISYTSVFLKNNKIAGVNNRSEYVETTSTEYTSGKINLSHQGAYVAQY 480

Query: 481 EILWDEINYDDKGKEVITKRRWDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGLAWEW 540
EILWDEINYDDKGKEVITKRRWDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGLAWEW
Sbjct: 481 EILWDEINYDDKGKEVITKRRWDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGLAWEW 540

Query: 541 WRKVIDERDVKLSKEINVNISGSTLSPYGSITYK 574
WRKVIDERDVKLSKEINVNISGSTLSPYGSITYK
Sbjct: 541 WRKVIDERDVKLSKEINVNISGSTLSPYGSITYK 574


4spyM18_0324spyM18_0386Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0324315-1.521867cytoplasmic membrane protein
spyM18_0325215-1.457172heat shock protein HtpX
spyM18_0326115-1.293163hypothetical protein
spyM18_0328215-0.123540CovR
spyM18_0329013-1.060347CovS
spyM18_0330-110-1.004304NrdR family transcriptional regulator
spyM18_0331-19-1.181052hypothetical protein
spyM18_0332-18-1.294693primosomal protein DnaI
spyM18_0334-110-2.098172GTP-binding protein EngA
spyM18_0335-112-3.427141SNF helicase
spyM18_0336119-4.056247integrase
spyM18_0338124-4.399777hypothetical protein
spyM18_0339019-3.954436phage protein
spyM18_0340021-4.083553repressor protein
spyM18_0341120-3.610817hypothetical protein
spyM18_0342316-1.574222hypothetical protein
spyM18_0343220-1.987188hypothetical protein
spyM18_0344223-0.395334antirepressor
spyM18_0345327-0.407978hypothetical protein
spyM18_0346426-0.089950hypothetical protein
spyM18_03484270.138721DNA polymerase III delta prime subunit
spyM18_03493310.020445hypothetical protein
spyM18_03503330.679903phage DNA replication protein
spyM18_0351328-2.363132hypothetical protein
spyM18_0352227-2.980574hypothetical protein
spyM18_0353127-2.163715hypothetical protein
spyM18_0355025-2.510150hypothetical protein
spyM18_0356019-2.732653hypothetical protein
spyM18_0357116-3.500154hypothetical protein
spyM18_0358220-3.104860hypothetical protein
spyM18_0359121-3.068014single strand binding protein
spyM18_0360121-3.741283hypothetical protein
spyM18_0361324-3.629758hypothetical protein
spyM18_0362432-3.424834hypothetical protein
spyM18_0363331-3.184362hypothetical protein
spyM18_0364021-2.519783hypothetical protein
spyM18_0365118-2.411511hypothetical protein
spyM18_0366017-1.853495hypothetical protein
spyM18_0367016-1.719407hypothetical protein
spyM18_0368015-1.935854hypothetical protein
spyM18_0369016-1.742147hypothetical protein
spyM18_0370116-1.746780hypothetical protein
spyM18_0371219-0.611957hypothetical protein
spyM18_0372520-1.401962hypothetical protein
spyM18_0373421-1.073269hypothetical protein
spyM18_0374223-1.335072hypothetical protein
spyM18_03751160.064898hypothetical protein
spyM18_03761150.003436hypothetical protein
spyM18_03780160.561246hypothetical protein
spyM18_03790160.617658major tail protein
spyM18_03802181.327650hypothetical protein
spyM18_03822181.356786hypothetical protein
spyM18_03833251.900608hypothetical protein
spyM18_03843262.257113hypothetical protein
spyM18_03854261.907865phage hyaluronidase
spyM18_03865283.546080hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0328HTHFIS905e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 5e-23
Identities = 29/116 (25%), Positives = 62/116 (53%), Gaps = 2/116 (1%)

Query: 1 MTK-KILIIEDEKNLARFVSLELQHEGYEVIVEVNGREGLETALEKEFDLILLDLMLPEM 59
MT IL+ +D+ + ++ L GY+V + N + DL++ D+++P+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGFEVTRRLQTE-KTTYIMMMTARDSIMDVVAGLDRGADDYIVKPFAIEELLARIR 114
+ F++ R++ +++M+A+++ M + ++GA DY+ KPF + EL+ I
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0329PF06580310.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.4 bits (71), Expect = 0.007
Identities = 18/110 (16%), Positives = 41/110 (37%), Gaps = 27/110 (24%)

Query: 390 LMILIDNAVKYSRKEKKIAINLSVTGKQE---AIVRVQDKGEGISKEDIEHIFERFYRTD 446
+ L++N +K+ + + + G ++ + V++ G K E
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------- 309

Query: 447 KSRNRTSTQAGLGIGLSILK---QIVDGYHLQMKVESELNEGSVFILHIP 493
G GL ++ Q++ G Q+K+ + + + +L IP
Sbjct: 310 ----------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVL-IP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0334TCRTETOQM371e-04 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 37.1 bits (86), Expect = 1e-04
Identities = 21/87 (24%), Positives = 40/87 (45%), Gaps = 8/87 (9%)

Query: 36 GVTRDRIYATGEWLNRQFSLIDTGGIDDVDAPFMEQIKHQAQIAMEEADVIVFVVSGKEG 95
G+T + +W N + ++IDT G D F+ ++ ++ D + ++S K+G
Sbjct: 53 GITIQTGITSFQWENTKVNIIDTPGHMD----FLAEVYR----SLSVLDGAILLISAKDG 104

Query: 96 VTDADEYVSKILYRTNTPVILAVNKVD 122
V + L + P I +NK+D
Sbjct: 105 VQAQTRILFHALRKMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0361IGASERPTASE330.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.002
Identities = 30/150 (20%), Positives = 51/150 (34%), Gaps = 21/150 (14%)

Query: 122 KAAVQRAVEQVTVNYDIYEALGSKRNELYAEIEKSLSERLAKESIELVSVTLTDQDAGDE 181
A V A A S+ E AE K S+ + K + T +++ E
Sbjct: 1017 IARVDEAPVP-----PPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKE 1071

Query: 182 -----------IEKAIKDESVKQKQVDSAKQ-----DKEKAKIEAETKQIQAQAEADAQV 225
E A K+ Q K+ +EKAK+E E Q + +
Sbjct: 1072 AKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP 1131

Query: 226 IKAKGEAESNNTKAASITDNLIKMKEAEAR 255
+ + E + A D + +KE +++
Sbjct: 1132 KQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0379PF06872310.002 EspG protein
		>PF06872#EspG protein

Length = 398

Score = 31.2 bits (70), Expect = 0.002
Identities = 15/35 (42%), Positives = 22/35 (62%), Gaps = 3/35 (8%)

Query: 59 RGVGDVKMETEAIDIPFD---VLKKILGYKDGSSS 90
RG+G+ K+ +DIP D +L+ LG KD +SS
Sbjct: 208 RGLGNSKLSLNGVDIPADAQKLLRNTLGLKDTNSS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0385PF072125600.0 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 560 bits (1444), Expect = 0.0
Identities = 254/334 (76%), Positives = 285/334 (85%), Gaps = 2/334 (0%)

Query: 1 MTETIPLRVQFKRMTAEEWARSTVILLEGEIGLETDTGYAKFGDGKNRFSKLKYLNKPDL 60
MTETIPLRVQFKRMTAEEW RS VILLE EIG ETDTGYAKFGDGKN+FSKLKYLNKPDL
Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKPDL 60

Query: 61 DAFAQKKETDNKIAKLESIKADKDTVYLKAESKKELDKKMNLTGGTMTGQLQFKPN-SHI 119
AFAQK+ET++KI KLES KADK+ VYLKAESK ELDKK+NL GG MTGQLQFKPN S I
Sbjct: 61 GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGI 120

Query: 120 KHSSSTGGAINIDMSKSAGAAMVMYTNKDTTDGPLMILRSDKDTFDQSAQFVDYSGKTNA 179
K SSS GGAINIDMSKS GA +V+Y+N DT+DGPLM LR+ K+TF+QSA FVDYSGKTNA
Sbjct: 121 KPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNA 180

Query: 180 VNIVMRQPSTPNFSSALNITSANEGGSAMQIRGIERALGTLKITHENPNVDAKYDENAAA 239
VNI MRQP+TPNFSSALNITS NE GSAMQIRG+E+ALGTLKITHENPNV+A YDENAAA
Sbjct: 181 VNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAA 240

Query: 240 LSIDIVGKRGASGNGTAAQGIFINSSAGTTGKMLRIRNKNKDKFYVNPDGGFHSYADSIV 299
LSIDIV K+ G GTAAQGI+INS++GTTGK+LRIRN DKFYV DGGF++ S +
Sbjct: 241 LSIDIV-KKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQI 299

Query: 300 DGNLTVKDPTSGKHAATKDYVDKKFDELKKLIQK 333
DGNL +K+PT+ HAATK YVD + +LK L+
Sbjct: 300 DGNLKLKNPTADDHAATKAYVDSEVKKLKALLMD 333


5spyM18_0450spyM18_0482Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0450217-0.448174DNA polymerase III subunit delta'
spyM18_04511171.098164hypothetical protein
spyM18_04530191.241766hypothetical protein
spyM18_0455-1182.187171DNA replication intiation control protein YabA
spyM18_0456-1163.181039hypothetical protein
spyM18_0457-1162.625073hypothetical protein
spyM18_0458-1162.745213hypothetical protein
spyM18_0459-1173.022712arsenate reductase
spyM18_0461-1173.1809343'-exo-deoxyribonuclease
spyM18_0462-1183.098890L-lactate oxidase
spyM18_0464-1213.706600cell envelope proteinase
spyM18_0467-2275.490283hypothetical protein
spyM18_0468-2265.747507methionyl-tRNA synthetase
spyM18_0470-1306.066837ribonucleotide-diphosphate reductase subunit
spyM18_04710295.711082ribonucleotide reductase stimulatory protein
spyM18_0472-1285.770937ribonucleotide-diphosphate reductase subunit
spyM18_0476-1264.694682hypothetical protein
spyM18_0477-1182.601572hypothetical protein
spyM18_0479-1192.995433hypothetical protein
spyM18_0480-1162.903752hypothetical protein
spyM18_0482-1163.139560hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0461BINARYTOXINB300.012 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 30.0 bits (67), Expect = 0.012
Identities = 26/127 (20%), Positives = 40/127 (31%), Gaps = 28/127 (22%)

Query: 91 NTLNPVSTFPEIGAP----TTMDA-EGRIITLEFEDFF-----------VTQVY----TP 130
L P + +P D IT+ + F QVY T
Sbjct: 432 QILAPNNYYPSKNLAPIALNAQDDFSSTPITMNYNQFLELEKTKQLRLDTDQVYGNIATY 491

Query: 131 NAGDGLRRLDDRQIWDHKYADYLTELDAQKP--VLAAGDYNVAHKEIDLANPSS--NRRS 186
N +G R+D W ++ L ++ + D N+ + I NPS
Sbjct: 492 NFENGRVRVDTGSNW----SEVLPQIQETTARIIFNGKDLNLVERRIAAVNPSDPLETTK 547

Query: 187 PGFTDEE 193
P T +E
Sbjct: 548 PDMTLKE 554


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0464SUBTILISIN934e-22 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 92.6 bits (230), Expect = 4e-22
Identities = 42/160 (26%), Positives = 64/160 (40%), Gaps = 24/160 (15%)

Query: 264 DIDWTQTDDDTKYESHGMHVTGIVAGNSKEAAATGERFLGIAPEAQVMFMRVFANDVMGS 323
+ D D HG HV G +A +G+APEA ++ ++V G
Sbjct: 74 EGDPEIFKDY---NGHGTHVAGTIAAT-----ENENGVVGVAPEADLLIIKVLNKQGSGQ 125

Query: 324 AESLFIKAIEDAVALGADVINLSLGTANGAQLSGSKPLMEAIEKAKKAGVSVVVAAGNER 383
+ + I+ I A+ D+I++SLG L EA++KA + + V+ AAGNE
Sbjct: 126 YDWI-IQGIYYAIEQKVDIISMSLGGP-----EDVPELHEAVKKAVASQILVMCAAGNEG 179

Query: 384 VYGSDHDDPLAINPDYGLVGSPSTGRTPTSVAAINSKWVI 423
D+ +G P SV AIN
Sbjct: 180 DGDDRTDE----------LGYPGCYNEVISVGAINFDRHA 209



Score = 79.1 bits (195), Expect = 1e-17
Identities = 36/147 (24%), Positives = 58/147 (39%), Gaps = 18/147 (12%)

Query: 561 FDSVVSKAPSQKGNEMNHFSNWGLTSDGYLKPDITAPGGDIYSTYNDNHYGSQTGTSMAS 620
++ V+S + FSN + D+ APG DI ST Y + +GTSMA+
Sbjct: 194 YNEVISVGAINFDRHASEFSNSNN------EVDLVAPGEDILSTVPGGKYATFSGTSMAT 247

Query: 621 PQIAGASLLVKQ-YLEKTQPNLPKEKIADIVKNLLMSNAQIHVNPETKTTTSPRQQGAGL 679
P +AGA L+KQ + +L + L+ SP+ +G GL
Sbjct: 248 PHVAGALALIKQLANASFERDL----TEPELYAQLIKRT-------IPLGNSPKMEGNGL 296

Query: 680 LNIDGAVTSGLYVTGKDNYGSISLGNI 706
L + + G +S ++
Sbjct: 297 LYLTAVEELSRIFDTQRVAGILSTASL 323



Score = 40.6 bits (95), Expect = 4e-05
Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 1/34 (2%)

Query: 127 HDWVKTKGAWDKGYKGQGKVVAVIDTGIDPAHQS 160
+ ++ W++ G+G VAV+DTG D H
Sbjct: 26 VEMIQAPAVWNQTR-GRGVKVAVLDTGCDADHPD 58


6spyM18_0518spyM18_0544Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0518220-3.749463hypothetical protein
spyM18_0520117-4.999497hypothetical protein
spyM18_0536116-4.099515IS1562 transposase
spyM18_0538117-5.745974response regulator
spyM18_0539017-5.417566histidine kinase
spyM18_0540017-3.185393hypothetical protein
spyM18_0542015-1.260373hypothetical protein
spyM18_05432200.094357transport ATP-binding protein
spyM18_05442232.770525BlpM-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0538HTHFIS384e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.5 bits (87), Expect = 4e-05
Identities = 23/137 (16%), Positives = 49/137 (35%), Gaps = 10/137 (7%)

Query: 3 IFVLEDDFLHQTRIEKIIYKILTDNKLEVNHLEVYGKPNQLLEDISERGRHQLFFLDIDI 62
I V +DD I ++ + L+ +V + L I+ G L D+ +
Sbjct: 6 ILVADDD----AAIRTVLNQALSRAGYDV---RITSNAATLWRWIA-AGDGDLVVTDVVM 57

Query: 63 KGEDKKGMEIAVEIRNRDPHAVIVFVTTHSEFMPVSFQYQVSALDFIDKELPEELFSHRI 122
E+ ++ I+ P ++ ++ + FM + A D++ K I
Sbjct: 58 PDEN--AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115

Query: 123 EKAITYVQDNQGKTLAE 139
+A+ + K +
Sbjct: 116 GRALAEPKRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0542RTXTOXIND425e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.1 bits (99), Expect = 5e-07
Identities = 24/103 (23%), Positives = 50/103 (48%), Gaps = 9/103 (8%)

Query: 28 VLLVSFLVLF---SLFAKKEITITSQGEMTP---TKVIASVQSTSDHTIVVNNLKNNKFI 81
++ FLV+ S+ + EI T+ G++T +K I ++++ I+V K + +
Sbjct: 62 YFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIV---KEGESV 118

Query: 82 KKGDVIIQYSKTMENSQKKALEKQLATLNKQKNGLQILKTSLE 124
+KGDV+++ + + + L ++ QIL S+E
Sbjct: 119 RKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161


7spyM18_0583spyM18_0632Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_05832132.576887DhaKLM operon coactivator DhaQ
spyM18_05841142.637445hypothetical protein
spyM18_05852143.923802dihydroxyacetone kinase subunit DhaK
spyM18_05862162.775718dihydroxyacetone kinase
spyM18_05872141.763409phosphotransferase mannnose-specific family
spyM18_05880131.420088glycerol uptake facilitator
spyM18_0589-1120.366565hypothetical protein
spyM18_0590-112-0.109790acetyl-CoA c-acetyltransferase
spyM18_0592-111-1.365187hypothetical protein
spyM18_0593113-1.786709hypothetical protein
spyM18_0594012-2.417301two-component response regulator
spyM18_0595014-3.681579two-component sensor histidine kinase
spyM18_0596116-5.091086hypothetical protein
spyM18_0597118-5.492610ribonuclease III
spyM18_0598119-6.215753chromosome segregation SMC
spyM18_0599425-9.253093positive regulator
spyM18_0600429-9.701104shikimate 5-dehydrogenase
spyM18_0601328-9.535687hypothetical protein
spyM18_0602426-8.906829hypothetical protein
spyM18_0603426-9.383191hypothetical protein
spyM18_0604327-9.402499S-adenosylmethionine synthetase
spyM18_0605227-9.390628hypothetical protein
spyM18_0606322-7.968550hypothetical protein
spyM18_0607420-6.588802hypothetical protein
spyM18_0608319-6.439113UDP-glucose 6-dehydrogenase
spyM18_0609217-4.199628efflux protein
spyM18_0610321-2.271782hypothetical protein
spyM18_0611321-1.712663hypothetical protein
spyM18_0612222-1.526902hypothetical protein
spyM18_06133250.120291hypothetical protein
spyM18_06153271.199266hypothetical protein
spyM18_06164260.094283hypothetical protein
spyM18_0617530-1.721723hypothetical protein
spyM18_0618630-3.685500hypothetical protein
spyM18_0620829-4.628656phage portal protein
spyM18_0621930-6.419173hypothetical protein
spyM18_0622733-8.490117hypothetical protein
spyM18_0624734-8.868532asparagine synthetase A
spyM18_0626627-8.703608hypothetical protein
spyM18_0630426-8.007021immunity protein
spyM18_0631021-6.191132hypothetical protein
spyM18_0632018-3.631459hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0584HTHTETR402e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 40.0 bits (93), Expect = 2e-06
Identities = 19/87 (21%), Positives = 32/87 (36%), Gaps = 6/87 (6%)

Query: 7 TKKKIAKAFKKQLAVKSFEKISVVDIMDQAQIRRQTFYNHFLDKYELLDWIFETE---LQ 63
T++ I + + + S+ +I A + R Y HF DK +L I+E +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 64 EQVTHNLNYISGS---QLLDELLFYFE 87
E G L + L+ E
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLE 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0594HTHFIS921e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.8 bits (228), Expect = 1e-23
Identities = 29/133 (21%), Positives = 65/133 (48%), Gaps = 1/133 (0%)

Query: 3 KILIVDDEKPISDIIKFNLTKEGYDIVTAFDGREAVTIFEEEKPDLIILDLMLPELDGLE 62
IL+ DD+ I ++ L++ GYD+ + DL++ D+++P+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VAKEIRKT-SHVPIIMLSAKDSEFDKVIGLEIGADDYVTKPFSNRELLARVKAHLRRTET 121
+ I+K +P++++SA+++ + E GA DY+ KPF EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 IETAVAEENASSG 134
+ + +++
Sbjct: 125 RPSKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0595PF06580445e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.5 bits (105), Expect = 5e-07
Identities = 30/187 (16%), Positives = 72/187 (38%), Gaps = 34/187 (18%)

Query: 253 DETNRMMRMISDLL--NLSRIDNQVTQLAVEMTNFTAFITSILNRFDLVKNQHTGTGKVY 310
+ M+ +S+L+ +L + + LA E+T +++ +F +++ ++
Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQF---EDRLQFENQIN 247

Query: 311 EIVRDYPITSVWLEIDNDKMTQVIENILNNAIKYSPDGGKITVRMKTTDTQLIISISDQG 370
+ D + + ++ ++EN + + I P GGKI ++ + + + + + G
Sbjct: 248 PAIMDVQVPPMLVQT-------LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 371 LGIPKTDLPLIFDRFYRVDKARSRAQGGTGLGLAIAKEIIKQHHGF---IWAKSDYGKGS 427
K + TG GL +E ++ +G I GK
Sbjct: 301 SLALKNT------------------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV- 341

Query: 428 TFTIVLP 434
+++P
Sbjct: 342 NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0598GPOSANCHOR529e-09 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 52.4 bits (125), Expect = 9e-09
Identities = 52/351 (14%), Positives = 117/351 (33%), Gaps = 13/351 (3%)

Query: 151 NSKPEERRAIFEEAAGVLKYKTRKKETQI-----KLNQTQDNLDRLEDIIYELDTQLAPL 205
N+ + + + LK + ++ KL + +L I EL+ + A L
Sbjct: 66 NNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADL 125

Query: 206 EKQAKVAKQFLELDANRKQLQLDILVKDIDIAQERQTKDTEALAALQQDLASYYAKRQSM 265
EK + + ++ L + R+ +AL + AK +++
Sbjct: 126 EKA----LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTL 181

Query: 266 EEDYQKFKQKKQVLSQESDQTQTTLLELTKLIADLEKQIELVKLESGQEAEKKAEAKKHL 325
E + + ++ L + + + I LE + + + A
Sbjct: 182 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFS 241

Query: 326 EQLQEQLDGFQAEEKQRTEQLLHIDQQLCDVKQQLNELSNALERFSSDPDQLMETLREEF 385
++ +AE+ + +++ L S ++ ++ L +
Sbjct: 242 TADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKAD-- 299

Query: 386 VLLMQKEAALSNQLTALKAHLDKEKQARQHKAQEYQLLVTKLDQLNDESQKAQAHYKAQK 445
L + L+ +L+ LD ++A++ E+Q L + Q + A +
Sbjct: 300 --LEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASR 357

Query: 446 EQVEMLLQNYQKGDKRVQELERDYQLNQERLFDLLDQKKGKEARKASLESI 496
E + L +QK +++ + E Q + L + KK E S
Sbjct: 358 EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSK 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0609TCRTETA394e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.7 bits (90), Expect = 4e-05
Identities = 28/141 (19%), Positives = 59/141 (41%), Gaps = 13/141 (9%)

Query: 52 SVIGVLFNLFGGVIADSFKR----KKIIITTNILCGTACLVLSFLTKEQWLVYAIVLTNV 107
+ G+L +L +I ++ ++ I GT ++L+F T W+ + I+ V
Sbjct: 253 AAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT-RGWMAFPIM---V 308

Query: 108 ILAFMSAFSSPSYKAFTKEIVKKDSISQLNSLLETTSTVIKVTVPMVAIFLYKLLGIHGV 167
+LA P+ +A V ++ QL L +++ + P++ +Y +
Sbjct: 309 LLAS-GGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA----ASI 363

Query: 168 LLLDGLSFLIAALLISFILPV 188
+G +++ A L LP
Sbjct: 364 TTWNGWAWIAGAALYLLCLPA 384


8spyM18_0715spyM18_0771Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0715-118-4.798442hypothetical protein
spyM18_0716120-6.253658bacteriophage 370.1 integrase
spyM18_0717018-3.165504hypothetical protein
spyM18_0718018-2.696075repressor protein
spyM18_0719121-1.438822hypothetical protein
spyM18_0721022-1.872599hypothetical protein
spyM18_0722126-0.495247Cro protein
spyM18_07230280.454805antirepressor
spyM18_0724235-1.104541hypothetical protein
spyM18_0725335-2.327628hypothetical protein
spyM18_0726332-1.502880hypothetical protein
spyM18_0727329-1.738681hypothetical protein
spyM18_0728327-0.578806hypothetical protein
spyM18_0729327-1.050395hypothetical protein
spyM18_0730326-1.132293hypothetical protein
spyM18_0731426-0.365315recombinase
spyM18_0732323-0.516304hypothetical protein
spyM18_0733222-1.291911hypothetical protein
spyM18_0734324-2.360333hypothetical protein
spyM18_0735526-1.801895hypothetical protein
spyM18_0736226-2.514758hypothetical protein
spyM18_0737224-2.655189hypothetical protein
spyM18_0738221-2.586660hypothetical protein
spyM18_0739120-1.484134hypothetical protein
spyM18_0740320-2.069855hypothetical protein
spyM18_0741120-2.243192methyltransferase
spyM18_0742220-2.050856hypothetical protein
spyM18_0743423-1.784881hypothetical protein
spyM18_0744525-2.134774hypothetical protein
spyM18_0745423-2.702188hypothetical protein
spyM18_0746118-1.525027hypothetical protein
spyM18_0747218-1.878231hypothetical protein
spyM18_0748219-1.863722hypothetical protein
spyM18_0749018-1.517209hypothetical protein
spyM18_0750016-1.248008hypothetical protein
spyM18_0751014-0.937194hypothetical protein
spyM18_0752317-0.766696hypothetical protein
spyM18_0753217-0.329218hypothetical protein
spyM18_0754117-0.634184OrfH-like protein
spyM18_0755217-0.642969ClpP protease
spyM18_0756318-1.637547hypothetical protein
spyM18_0757123-1.444687hypothetical protein
spyM18_0758220-1.765372hypothetical protein
spyM18_0759219-2.098847hypothetical protein
spyM18_07603151.276514hypothetical protein
spyM18_07614141.438155hypothetical protein
spyM18_07623172.127161hypothetical protein
spyM18_07633162.090888hypothetical protein
spyM18_07643192.858626hypothetical protein
spyM18_07653192.650136hypothetical protein
spyM18_07663242.621996hypothetical protein
spyM18_07693242.360144hypothetical protein
spyM18_07705232.142545hyaluronidase
spyM18_07714243.207284hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0740PF06580260.021 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 26.0 bits (57), Expect = 0.021
Identities = 7/45 (15%), Positives = 19/45 (42%)

Query: 29 LFLAIAIFGMMVTVSYFSYRDARQYYESQITGLRTQLSRTQKQLK 73
+ + + M ++ YF + + Y +++I + + QL
Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0745PRPHPHLPASEC310.006 Prokaryotic zinc-dependent phospholipase C signature.
		>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature.

Length = 398

Score = 31.1 bits (70), Expect = 0.006
Identities = 32/173 (18%), Positives = 63/173 (36%), Gaps = 12/173 (6%)

Query: 100 NKIRQKEPKSVDDNLKLLVNSFGNELSSYLYGADWSDTKYDLAVEIINKHDVFSGYKQTE 159
N + + EP+SV NL++L +EL D+ YDL + D + + +
Sbjct: 52 NDLSKNEPESVRKNLEIL-KENMHELQLGSTYPDYDKNAYDLYQDHFWDPDTDNNFSKDN 110

Query: 160 TYKKADKPYDEGELEKNKKLTQLEQLQQLGRLQQLEQLQQLGRLQQLGRLEQ-LEPTNYS 218
++ A D GE + K Q G +Q + G ++ P N +
Sbjct: 111 SWYLAYSIPDTGESQIRKFSALARYEWQRGNYKQATFYLGEA-MHYFGDIDTPYHPANVT 169

Query: 219 ---------YEAFSDIEDAIFYLDPPYENTTQKSYKGDFNSQAFYDWAFGMSK 262
+E F++ + ++ T + Y ++ F W+ ++
Sbjct: 170 AVDSAGHVKFETFAEERKEQYKINTAGCKTNEDFYADILKNKDFNAWSKEYAR 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0769SSPAMPROTEIN290.036 Salmonella surface presentation of antigen gene typ...
		>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M

signature.
Length = 147

Score = 28.9 bits (64), Expect = 0.036
Identities = 23/65 (35%), Positives = 29/65 (44%), Gaps = 6/65 (9%)

Query: 331 ERINALENNQKVITNNQKQFELNLPKYLNDINGKRVWYEKPDDNIEHKIGDYWFEKNGKY 390
E I AL Q ++ K EL + + I KR EK + + K YW K G Y
Sbjct: 66 EEIYALLRKQSIVRRQIKDLELQIIQ----IQEKRSELEKKREEFQEK-SKYWLRKEGNY 120

Query: 391 QRTWI 395
QR WI
Sbjct: 121 QR-WI 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0770PF072125630.0 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 563 bits (1452), Expect = 0.0
Identities = 336/336 (100%), Positives = 336/336 (100%)

Query: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKPDL 60
MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKPDL
Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKPDL 60

Query: 61 GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGI 120
GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGI
Sbjct: 61 GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGI 120

Query: 121 KPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNA 180
KPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNA
Sbjct: 121 KPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNA 180

Query: 181 VNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAA 240
VNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAA
Sbjct: 181 VNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAA 240

Query: 241 LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQID 300
LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQID
Sbjct: 241 LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQID 300

Query: 301 GNLKLKNPTADDHAATKAYVDSEVKKLKALLMDKQV 336
GNLKLKNPTADDHAATKAYVDSEVKKLKALLMDKQV
Sbjct: 301 GNLKLKNPTADDHAATKAYVDSEVKKLKALLMDKQV 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0771RTXTOXIND340.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.6 bits (77), Expect = 0.002
Identities = 27/145 (18%), Positives = 51/145 (35%), Gaps = 21/145 (14%)

Query: 158 RLSSSYQSGINGLKAQLANDKI---GLQAEIQATAQGLSQKYDNELRQLSAKITTTSSGT 214
RL+S + + + Q ++ +AE T +Y+N R +++ SS
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERL-TVLARINRYENLSRVEKSRLDDFSSLL 244

Query: 215 TEAYESKLAGLRAEFTRSNQGMRIELESQISGLRAVQQSTTSQISQEIRDRTGAVSRVQQ 274
+ +K A L E E++ + SQ+ Q + A Q
Sbjct: 245 HKQAIAKHAVL-------------EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL 291

Query: 275 DLESYQR----RLQDAEDNYSSLTH 295
+ ++ +L+ DN LT
Sbjct: 292 VTQLFKNEILDKLRQTTDNIGLLTL 316


9spyM18_0796spyM18_0804Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0796215-2.061658septation ring formation regulator EzrA
spyM18_0797118-2.273743hypothetical protein
spyM18_0798017-2.519766phosphopyruvate hydratase
spyM18_0799120-4.040925streptolysin S associated protein
spyM18_0800019-4.068798hypothetical protein
spyM18_0801-121-4.379696SagC
spyM18_0802020-4.853358hypothetical protein
spyM18_0803-115-3.769062hypothetical protein
spyM18_0804-114-3.775963hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0804TYPE3IMSPROT310.003 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 31.3 bits (71), Expect = 0.003
Identities = 17/76 (22%), Positives = 32/76 (42%), Gaps = 1/76 (1%)

Query: 36 SYQDFLDVLLSLFQFVVIIFVLFFYSATINLGEVLTFLTQTSWHWQILCYLVLYLMAIIE 95
S + ++ L S+ + VV++ +L + NL +L T L +L + +I
Sbjct: 133 SIKSLVEFLKSILK-VVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILRQLMVIC 191

Query: 96 MTLLVLILIFDVLLQK 111
V+I I D +
Sbjct: 192 TVGFVVISIADYAFEY 207


10spyM18_0845spyM18_0863Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0845-315-4.353823hypothetical protein
spyM18_0846-318-5.859152alpha-L-Rha alpha-1,3-L-rhamnosyltransferase
spyM18_0848-220-6.164327ABC transporter permease
spyM18_0849-120-6.531472ABC transporter ATP-binding protein
spyM18_0852-122-7.357862glycosyltransferase
spyM18_0853021-7.749393hypothetical protein
spyM18_0855120-6.566834hypothetical protein
spyM18_0856118-6.382324glycosyl transferase
spyM18_0858218-6.616375hypothetical protein
spyM18_0859218-5.706242hypothetical protein
spyM18_0860216-3.719755hypothetical protein
spyM18_0861-115-1.183363peptidase T
spyM18_0862-120-1.354497pore-forming peptide
spyM18_08632230.353617ferredoxin
11spyM18_0895spyM18_0903Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0895-218-3.383227carbamoyl phosphate synthase large subunit
spyM18_0896-119-4.698242hypothetical protein
spyM18_0897017-4.429882ABC transporter ATP-binding protein
spyM18_0898-117-4.430342ABC transporter permease
spyM18_0899017-4.708088glycerophosphodiester phosphodiesterase
spyM18_0900016-4.17431230S ribosomal protein S16
spyM18_0901014-3.942552RNA binding protein
spyM18_0903-113-3.316482hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0896RTXTOXIND446e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.4 bits (105), Expect = 6e-07
Identities = 21/112 (18%), Positives = 45/112 (40%), Gaps = 13/112 (11%)

Query: 170 QQLQDLNDAYADAQAEVNKAQIALNDTVVISSVSGTVVE-----VNNDIDPSSKNSQTLV 224
+L+ D E+ K + +V+ + VS V + + ++TL+
Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT----AETLM 357

Query: 225 HVATEGQ-LQVKGTLTEYDLANVKVGQSVKIKSKVYSNQEW---TGKISYVS 272
+ E L+V + D+ + VGQ+ IK + + + GK+ ++
Sbjct: 358 VIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409



Score = 37.1 bits (86), Expect = 1e-04
Identities = 24/185 (12%), Positives = 53/185 (28%), Gaps = 29/185 (15%)

Query: 21 ITLVLIITGVVLWKQQQNTLTADIAKEPYSTVSVTEGSIASSTLLSGTVKALSEEYIYFD 80
++ + + + + V+ G + S S +K + +
Sbjct: 62 YFIMGFLVIAFIL--------SVLG--QVEIVATANGKLTHSGR-SKEIKPIENSIV--- 107

Query: 81 ANKGNDATVTVKIGDQVTQGQQLVQYNTTTA-------QSAYDTAVRSLNKIGRQINHLK 133
+ VK G+ V +G L++ A QS+ A + ++
Sbjct: 108 ------KEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161

Query: 134 TYGVPAV--STETNKDEATGEETTTTVQPSAQQNANYKQQLQDLNDAYADAQAEVNKAQI 191
+P + E + EE +Q + ++ Q +AE
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLA 221

Query: 192 ALNDT 196
+N
Sbjct: 222 RINRY 226


12spyM18_1061spyM18_1078Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_10612150.321627dihydroneopterin aldolase
spyM18_1062114-0.2923872-amino-4-hydroxy-6-
spyM18_1063215-0.537471UDP-N-acetylenolpyruvoylglucosamine reductase
spyM18_1064117-0.829740spermidine/putrescine ABC transporter
spyM18_1065117-0.180480spermidine/putrescine ABC transporter permease
spyM18_10661160.261920spermidine/putrescine ABC transporter permease
spyM18_10671150.384746spermidine/putrescine ABC transporter
spyM18_10681160.419080two-component response regulator
spyM18_1069115-0.031061two-component sensor histidine kinase
spyM18_1071217-0.767033L-malate permease
spyM18_1072219-1.945836NAD-dependent malic enzyme
spyM18_1073120-4.012123zinc-containing alcohol dehydrogenase
spyM18_1074223-5.138172acid phosphatase/phosphotransferase
spyM18_1075021-4.757089hypothetical protein
spyM18_1076020-4.822252hypothetical protein
spyM18_1077016-4.831335hypothetical protein
spyM18_1078-116-3.261460hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1067MYCMG045353e-04 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 35.5 bits (81), Expect = 3e-04
Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 4/82 (4%)

Query: 31 SGSQTDKLVIYNWGDYIDPALLKKFTKETGVEVQYETFDSNEAMYTKIKQGGTTYDIAVP 90
S + V+ N+ YI P LL++ + + + T+ SNE + TY +AV
Sbjct: 21 SSCGSTTFVLANFESYISPLLLER--VQEKHPLTFLTYPSNEKLINGF--ANNTYSVAVA 76

Query: 91 SDYTIDKMIKENLLNKLDKSKL 112
S Y + ++I+ +LL+ +D S+
Sbjct: 77 STYAVSELIERDLLSPIDWSQF 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1068HTHFIS695e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.5 bits (170), Expect = 5e-16
Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 2/131 (1%)

Query: 3 VLIIEDDPMVDFIHRNYLEKLNLFDRIISSDSMKAVQSILTDYAIDLILLDIHITDGNGI 62
+L+ +DD + + L + + + + + + DL++ D+ + D N
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 QFLEKLRAQHIPCEVIIISAANDGNIIRDGFHLGIIDYLIKPFTFERFQESIQQFVTHRE 122
L +++ V+++SA N G DYL KPF I + + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 123 HLANQQLEQAQ 133
++ + +Q
Sbjct: 124 RRPSKLEDDSQ 134


13spyM18_1180spyM18_1186Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_1180123-4.689673lipoprotein
spyM18_1181122-6.210130cytidine deaminase
spyM18_1182018-4.91359616S rRNA m(2)G 1207 methyltransferase
spyM18_1183118-4.764301pantothenate kinase
spyM18_1184016-3.81285330S ribosomal protein S20
spyM18_1186-114-3.522890histidine kinase protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1180LIPPROTEIN48665e-14 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 65.8 bits (160), Expect = 5e-14
Identities = 76/299 (25%), Positives = 120/299 (40%), Gaps = 45/299 (15%)

Query: 36 DLKVAMVTDTGGVDDKSFNQSAWEGLQSWGKEMGLQKGTGFDYFQSTSESEYATNLDTAV 95
LK ++TD G +DDKSFNQSA+E L++ + K TG + S + + ++A+
Sbjct: 61 KLKPVLITDEGKIDDKSFNQSAFEALKA------INKQTGIEINNVEPSSNFESAYNSAL 114

Query: 96 SGGYQLIYGIGFALKDAIAKAAGD------NEGVKFVIIDDIIEGKDNV-ASVTFADHEA 148
S G+++ GF + +I + +K + ID IE + S+ F E+
Sbjct: 115 SAGHKIWVLNGFKHQQSIKQYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIKES 174

Query: 149 AYLAGIAAAKTTKTK-----TVGFVGGMEGTVITRFEKGFEAGVKS---------VDDTI 194
A+ G A A + V GG +T F +GF G+ + T
Sbjct: 175 AFTTGYAIASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAKGILYYNQKHKSSKIYHTS 234

Query: 195 QVKVDYAGSFGDAAKGKTIAAAQYAAGADVIYQAAGG---TGAGVFNEAKAINEKRSEAD 251
VK+D +G I + ADV Y G F + N+ +
Sbjct: 235 PVKLD-SGFTAGEKMNTVINNVLSSTPADVKYNPHVILSVAGPATFETVRLANKGQ---- 289

Query: 252 KVWVIGVDRDQKDEGKYTSKDGKEANFVLASSIKEVGKAVQLINKQVADKKFPGGKTTV 310
+VIGVD DQ +D +L S +K + +AV + +K G K V
Sbjct: 290 --YVIGVDSDQG-----MIQDKDR---ILTSVLKHIKQAVYETLLDLILEKEEGYKPYV 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1186PF06580392e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.1 bits (91), Expect = 2e-05
Identities = 15/75 (20%), Positives = 31/75 (41%), Gaps = 5/75 (6%)

Query: 312 YGKIFYFQNQVNRSLRMDKALLKQLITILFDNAIKY----TDKNGIIEIIVKTTDKNLLI 367
+ F+NQ+N ++ D + L+ L +N IK+ + G I + + + +
Sbjct: 236 FEDRLQFENQINPAIM-DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTL 294

Query: 368 SVIDNGPGITDEEKK 382
V + G K+
Sbjct: 295 EVENTGSLALKNTKE 309


14spyM18_1202spyM18_1220Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_1202316-0.589277hypothetical protein
spyM18_1203316-0.751027hypothetical protein
spyM18_1205116-1.304121hypothetical protein
spyM18_1207020-2.955659ABC transporter ATP-binding protein
spyM18_1208221-4.454530transcriptional regulator
spyM18_1209123-5.232683hypothetical protein
spyM18_1210023-5.335907transcriptional regulator
spyM18_1211125-4.594181hypothetical protein
spyM18_1212-214-0.907410hypothetical protein
spyM18_1213-1140.561540hypothetical protein
spyM18_1214-1140.318099hypothetical protein
spyM18_1215-2120.119711hypothetical protein
spyM18_1216-212-0.182596hypothetical protein
spyM18_1217-213-0.259673DNA helicase II
spyM18_1219-115-1.503986Na(+)-linked D-alanine glycine permease
spyM18_1220-415-3.128649cation efflux system protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1205GPOSANCHOR300.042 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.0 bits (67), Expect = 0.042
Identities = 23/127 (18%), Positives = 40/127 (31%), Gaps = 27/127 (21%)

Query: 216 AFSKDYQKRVTQNQAHLDNLLKDNGQ-----KRYDDLQNQYDLALKNGRAALAKETVKLA 270
FS ++ +A L + + + +K A A + A
Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKA 298

Query: 271 ASEENLTFLEVS---------ALQEAKHQIEQGKQALAKEEKQ------------LEQVQ 309
E L + A +EAK Q+E Q L +E+ + L+ +
Sbjct: 299 DLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKL-EEQNKISEASRQSLRRDLDASR 357

Query: 310 ATKDKLE 316
K +LE
Sbjct: 358 EAKKQLE 364


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1207PF05272346e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.5 bits (76), Expect = 6e-04
Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 2/41 (4%)

Query: 32 KGELVVIL-GASGAGKSTVLNILGGMD-TVDAGQVIIDGKD 70
K + V+L G G GKST++N L G+D D I GKD
Sbjct: 594 KFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1208HTHTETR418e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 40.8 bits (95), Expect = 8e-07
Identities = 13/48 (27%), Positives = 26/48 (54%)

Query: 4 RHTETKAYVKTALITLLTEQSFETLTVSDLTKKAGINRGTFYLHYTDK 51
ET+ ++ + L ++Q + ++ ++ K AG+ RG Y H+ DK
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDK 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1215PF06580290.015 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.015
Identities = 19/109 (17%), Positives = 32/109 (29%), Gaps = 15/109 (13%)

Query: 19 LVGLVLLSVFGWVVGITGGYIYLPYSYRWLSWGMDSFPNLLDSALSYYYFWTALVLFVIT 78
++ + +S+ G V +T Y WL M A V+
Sbjct: 42 MIFNIAISLMGLV--LTHAYRSFIKRQGWLKLNMGQI---------ILRVLPACVVIG-- 88

Query: 79 FLALLVIILYPRIYTEVQLRHKNKKGTLLLKKSAIESYVATAIQTAGLM 127
+ + R+ + K TL L S I + V + L
Sbjct: 89 MVWFVANTSIWRLLAFIN--TKPVAFTLPLALSIIFNVVVVTFMWSLLY 135


15spyM18_1232spyM18_1306Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_1232-211-3.066007DNA polymerase III DnaE
spyM18_1233218-5.678697transcriptional regulator
spyM18_1234219-6.815716ABC transporter ATP-binding protein
spyM18_1235120-7.374969ABC transporter permease
spyM18_1236221-7.054835hypothetical protein
spyM18_1237321-5.378177hypothetical protein
spyM18_1238221-5.134966exotoxin SpeL
spyM18_1239018-3.220118exotoxin SpeM
spyM18_1240018-1.592284hypothetical protein
spyM18_1241019-0.110032hypothetical protein
spyM18_12423231.293066hypothetical protein
spyM18_1243120-0.246925hypothetical protein
spyM18_12444272.356308hypothetical protein
spyM18_12475251.243275holin
spyM18_12483241.733054hypothetical protein
spyM18_12493241.200701hypothetical protein
spyM18_12513212.447824hypothetical protein
spyM18_12532192.745943hypothetical protein
spyM18_12541172.069831hyaluronidase
spyM18_12551172.308833hypothetical protein
spyM18_12561181.720078hypothetical protein
spyM18_12571182.314087minor tail protein
spyM18_1258317-0.144286minor tail protein
spyM18_1259016-2.179727hypothetical protein
spyM18_1260120-1.755166hypothetical protein
spyM18_1261224-1.765814structural phage protein
spyM18_1262025-0.132974hypothetical protein
spyM18_12632250.141890hypothetical protein
spyM18_12644260.830627hypothetical protein
spyM18_12655240.652568hypothetical protein
spyM18_12664230.615384hypothetical protein
spyM18_12674220.585577hypothetical protein
spyM18_1268523-0.139470structural phage protein
spyM18_1269424-0.537133hypothetical protein
spyM18_1270122-1.297783antirepressor
spyM18_1271221-0.033829hypothetical protein
spyM18_1272119-0.065575hypothetical protein
spyM18_12731180.000817hypothetical protein
spyM18_12741180.037759hypothetical protein
spyM18_12751190.079444hypothetical protein
spyM18_1276421-0.033328phage protein
spyM18_1277223-0.947639terminase, large subunit
spyM18_1278328-1.325552terminase, small subunit
spyM18_1279531-1.745819hypothetical protein
spyM18_1280529-2.143778hypothetical protein
spyM18_1281530-2.207228hypothetical protein
spyM18_1282431-1.613497hypothetical protein
spyM18_1283528-3.444279hypothetical protein
spyM18_1284423-3.864580hypothetical protein
spyM18_1285424-3.691828hypothetical protein
spyM18_1286422-3.489121hypothetical protein
spyM18_1287321-3.272784hypothetical protein
spyM18_1288421-3.767457hypothetical protein
spyM18_1289320-3.655476bacteriophage resistance protein
spyM18_1290420-3.490413hypothetical protein
spyM18_1291418-3.974133hypothetical protein
spyM18_1292522-4.734050hypothetical protein
spyM18_1293530-6.043002hypothetical protein
spyM18_1294429-5.502507hypothetical protein
spyM18_1295528-5.348318hypothetical protein
spyM18_1296323-5.822252hypothetical protein
spyM18_1297422-5.044042hypothetical protein
spyM18_1298520-5.374395hypothetical protein
spyM18_1299619-3.822252hypothetical protein
spyM18_1300418-3.182500hypothetical protein
spyM18_1301318-2.995397hypothetical protein
spyM18_1302219-2.555655hypothetical protein
spyM18_1303122-2.524761hypothetical protein
spyM18_1304-118-3.295370hypothetical protein
spyM18_1305019-4.017494hypothetical protein
spyM18_1306-318-3.167384repressor protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1238BACTRLTOXIN518e-10 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 51.1 bits (122), Expect = 8e-10
Identities = 38/185 (20%), Positives = 76/185 (41%), Gaps = 35/185 (18%)

Query: 100 YNFIISSNLHPSVEGKFNVGDNVDVFGL--ALSAEVFSKDQIHSINGGLV---------- 147
Y+ + + L+ + K+ + VDV+G ++ SKD + + GG
Sbjct: 88 YDKVKTELLNEDLAKKYK-DEVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHE 146

Query: 148 --KVNERKGAGKTIYMNVFIDGHKKDDTSKYKITFEKSPVTFQEVDVRLRKSFMLNDEIK 205
+ + + + V+ + K +T +++ +K VT QE+D++ R L ++
Sbjct: 147 GNHFDNGNL--QNVLVRVYEN---KRNTISFEVQTDKKSVTAQELDIKARN--FLINKKN 199

Query: 206 LYQYD-SKVLSGNWEFHGSGEKEEGADLFKYPD-----------YRYNNLIDIDKKSHID 253
LY+++ S +G +F + D+ P Y N +D K I+
Sbjct: 200 LYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKTVD-SKSVKIE 258

Query: 254 VYLFT 258
V+L T
Sbjct: 259 VHLTT 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1239BACTRLTOXIN553e-11 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 54.9 bits (132), Expect = 3e-11
Identities = 52/218 (23%), Positives = 92/218 (42%), Gaps = 29/218 (13%)

Query: 35 LKNIYTKDVINRTNMKITKK-IGTQLIFNTNEKTRVWDDDNYNKVISSNVSPAQERRFKE 93
+K +Y ++ T +K K + LI+N ++K NY+KV + ++ +++K+
Sbjct: 51 MKYLYDDHYVSATKVKSVDKFLAHDLIYNISDK----KLKNYDKVKTELLNEDLAKKYKD 106

Query: 94 EEVDIYALIKSYSVICKEQYNYVDG--------GLIRTSDREKLDSTIYMNIFGEQIPLK 145
E VD+Y + + N G I + D+ N+ K
Sbjct: 107 EVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENK 166

Query: 146 EQS-KYKITFQNKFVTFQEIDVRLRKSLMSDNRIKLYEHN-SICKKGYWGIHYKDNTTKF 203
+ +++ K VT QE+D++ R L+ N+ LYE N S + GY + T +
Sbjct: 167 RNTISFEVQTDKKSVTAQELDIKARNFLI--NKKNLYEFNSSPYETGYIKFIENNGNTFW 224

Query: 204 TDLFTHPN-----------YTDNETIDMSKVSHFDVYL 230
D+ P Y DN+T+D SK +V+L
Sbjct: 225 YDMMPAPGDKFDQSKYLMMYNDNKTVD-SKSVKIEVHL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1244FLGFLGJ963e-26 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 95.9 bits (238), Expect = 3e-26
Identities = 46/123 (37%), Positives = 64/123 (52%), Gaps = 8/123 (6%)

Query: 23 SLTAAQAILESGWGKHA-------PHNALFGIKADSSWTGKSFDTKTQEEYQPGVVTDIV 75
L AQA LESGWG+ P LFG+KA +W G + T E Y+ G +
Sbjct: 172 HLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTE-YENGEAKKVK 230

Query: 76 DRFRAYGSWDESILDHGKFLNDNPRYKAVVGETDYKKACHAIKDAGYATASGYAELLIQI 135
+FR Y S+ E++ D+ L NPRY AV ++ A++DAGYAT YA L +
Sbjct: 231 AKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNM 290

Query: 136 IKE 138
I++
Sbjct: 291 IQQ 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1254PF072125530.0 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 553 bits (1427), Expect = 0.0
Identities = 260/334 (77%), Positives = 294/334 (88%), Gaps = 2/334 (0%)

Query: 1 MTETIPLRVQFKRMTAKEWASSAVILLEGEIGFETDTGYAKFGDGKSRFSELKYLNKPDL 60
MTETIPLRVQFKRMTA+EW S VILLE EIGFETDTGYAKFGDGK++FS+LKYLNKPDL
Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKPDL 60

Query: 61 GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLSLTGGIVTGQLRLKPN-SGI 119
GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKL+L GG++TGQL+ KPN SGI
Sbjct: 61 GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGI 120

Query: 120 EKSSSTGGAINIDMSKSKGAAMVMYTNKDTTDGPLMILRSNKDTFDQSVQFVDYRGKTNA 179
+ SSS GGAINIDMSKS+GA +V+Y+N DT+DGPLM LR+ K+TF+QS FVDY GKTNA
Sbjct: 121 KPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNA 180

Query: 180 VNIVMRQPSTPNFSSALNITSANEGGSAMQLRGSEEALGTLKITHENPSLEANYDKNAAA 239
VNI MRQP+TPNFSSALNITS NE GSAMQ+RG E+ALGTLKITHENP++EANYD+NAAA
Sbjct: 181 VNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAA 240

Query: 240 LSIDIVKKTNGA-GTAAQGIYINSTSGTTGKLLRIRNKNKDKFYVNPDGGFHSYADSIVD 298
LSIDIVKK G GTAAQGIYINSTSGTTGKLLRIRN DKFYV DGGF++ S +D
Sbjct: 241 LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQID 300

Query: 299 GNLTVKNPTSNEHAATKKYVDEKIAELKKLIPKK 332
GNL +KNPT+++HAATK YVD ++ +LK L+ K
Sbjct: 301 GNLKLKNPTADDHAATKAYVDSEVKKLKALLMDK 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1255FLGFLIJ300.019 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 29.8 bits (66), Expect = 0.019
Identities = 25/100 (25%), Positives = 55/100 (55%), Gaps = 14/100 (14%)

Query: 389 QNMIDESLETITGLGMT------FQEFLQDIEKRIETGKKEMEDNWRKVNLEFDNFKKKV 442
QN +L + G+T +Q+F+Q +EK I ++++ +KV++ +++++K
Sbjct: 46 QNEYRNNLNSDMSAGITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREK- 104

Query: 443 EQEGLQFNTLKEQIKE----VDERID-KELEEF--RATLK 475
+Q + TL+E+ + R+D K+++EF RA ++
Sbjct: 105 KQRLQAWQTLQERQSTAALLAENRLDQKKMDEFAQRAAMR 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1257RTXTOXINA320.009 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 32.2 bits (73), Expect = 0.009
Identities = 63/330 (19%), Positives = 115/330 (34%), Gaps = 44/330 (13%)

Query: 192 ISAVIQSLTGVITAVFNGIATVISSVGSAIKDVLTGLGI--AFEGFGNGVK-SALEGVGA 248
++ I ++S+ + + L+ + I + +G S+ E A
Sbjct: 124 AGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSELAKA 183

Query: 249 VIESFGSAVRNVLDGVANILDSMGTAALNAGRGVKEMAKGIKMLVDLSLGDLVATLAAVA 308
IE V V N+ +S G + K + +G+ + L
Sbjct: 184 SIELINQLVDTVASLNNNV-NSFSQQLNTLGSVL-SNTKHLN-----GVGNKLQNL---- 232

Query: 309 SGLGKMASSAGEMTTLGSAMSKVANGMTRLATSATIAITGLTVFATTMATIKTAVATLPP 368
L + + ++ + SA+S A + T A G+ + + + ++
Sbjct: 233 PNLDNIGAGLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYII 292

Query: 369 VLTMAASGFTTFTTQAVAAVTGLAAINAPITMFKAQLMAITPALAQAGAGFAAFVAQSST 428
A T+ AA GL A A +AI+P L+ +A
Sbjct: 293 AQRAAQGLSTS------AAAAGLIA--------SAVTLAISP-LSFLS------IADKFK 331

Query: 429 FSTGLASAGPTIAAFNANLMSLSAT----TGALVASIVGLSAVLSVVLADFSQIGASATA 484
+ + + SL A TGA+ AS+ +S VL+ V + S A+ T+
Sbjct: 332 RANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGIS--AAATTS 389

Query: 485 TVGQ-IQAFASSTTVVSSAF--ASMQSMIQ 511
VG + A + T + S AS Q+M +
Sbjct: 390 LVGAPVSALVGAVTGIISGILEASKQAMFE 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1275TYPE4SSCAGX330.003 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 33.2 bits (75), Expect = 0.003
Identities = 48/199 (24%), Positives = 80/199 (40%), Gaps = 10/199 (5%)

Query: 63 EARKRASELDISAYQKKAKELVAKAEKLRKEGRTVTRDDFTHQENADMSIYNLAMKTNAL 122
E +K+A E + A ++ K K EK RKE R R + + NA + NL+ N
Sbjct: 142 EEQKKALEKEKEAKEQAQKAQKDKREK-RKEERAKNRANLENLTNAMSNPQNLSNNKNLS 200

Query: 123 ELLRLNIDLE---------MQELANGEHKLTKKFLDEGYRKETEFQAGLLGLSVASQASV 173
EL++ + E MQE A + L++ +E Q +S+ + S
Sbjct: 201 ELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQAEEAVRQRAKDKISIKTDKSQ 260

Query: 174 KSLADAVINANFKGAKWSDNIWDRQDKLRSIISQSVQSAILRGKNGLTIARDIRREFDVS 233
KS D I + + W N+ R +K + LT+ + + +VS
Sbjct: 261 KSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKDNFASAYLTVKLEYPQRHEVS 320

Query: 234 ASYAKRLAITEHARVQMEV 252
+ + L E A+ Q E+
Sbjct: 321 SVIEEELKKREEAKRQREL 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1284OMPTIN280.013 Omptin serine protease signature.
		>OMPTIN#Omptin serine protease signature.

Length = 317

Score = 27.6 bits (61), Expect = 0.013
Identities = 13/75 (17%), Positives = 30/75 (40%), Gaps = 6/75 (8%)

Query: 51 DNLDDYVLMQSTGLKDKNGVKIFEGDVVKLQYTITSDLEFFKVNQFRGGSW-RIDNRRRG 109
DN + Y + + K + + V Y +T + + + G+W R+ N++
Sbjct: 228 DNDEHYDPGKRITYRSKVKDQNYYSVAVNAGYYVTPNAKVYV-----EGAWNRVTNKKGN 282

Query: 110 SELWLRNDDCEVVGN 124
+ L+ N++
Sbjct: 283 TSLYDHNNNTSDYSK 297


16spyM18_1448spyM18_1461Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_14483231.790510hypothetical protein
spyM18_14504220.420550holin
spyM18_14513200.518328hypothetical protein
spyM18_14522200.539424hypothetical protein
spyM18_1453119-0.445406hypothetical protein
spyM18_1454118-0.441860hypothetical protein
spyM18_1455-218-1.495923hyaluronidase
spyM18_1456-318-1.683700hypothetical protein
spyM18_1457022-3.484313hypothetical protein
spyM18_1461-221-3.749494endonuclease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1448FLGFLGJ925e-23 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 92.1 bits (228), Expect = 5e-23
Identities = 44/125 (35%), Positives = 63/125 (50%), Gaps = 8/125 (6%)

Query: 23 SLTAAQTILESGWGKHA-------PHNALFGIKADSSWTGKSFDTKTQEEYQAGVVTDIV 75
L AQ LESGWG+ P LFG+KA +W G + T E Y+ G +
Sbjct: 172 HLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTE-YENGEAKKVK 230

Query: 76 DRFRAYDSWTDSIIDHGKFLNDNPRYQAVIGETDYKKACHAIKDAGYATASGYAELLIQL 135
+FR Y S+ +++ D+ L NPRY AV ++ A++DAGYAT YA L +
Sbjct: 231 AKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNM 290

Query: 136 IEEND 140
I++
Sbjct: 291 IQQMK 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1455PF072125580.0 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 558 bits (1438), Expect = 0.0
Identities = 282/334 (84%), Positives = 307/334 (91%), Gaps = 1/334 (0%)

Query: 1 MTETIPLRVQFKRMTAEEWARSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKLDL 60
MTETIPLRVQFKRMTAEEW RSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNK DL
Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKPDL 60

Query: 61 DAFAQKKETNSKITKLESNKADKNAVYLKAESNAKLDEKLSLTGGIVTGQLQFKPN-SGI 119
AFAQK+ETNSKITKLES+KADKNAVYLKAES +LD+KL+L GG++TGQLQFKPN SGI
Sbjct: 61 GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGI 120

Query: 120 KPSSSVGGAINIDMSKSKGAGIVVYSNNDTSDGPLMSLRTGKETFNKSALFVDYSGKTNA 179
KPSSSVGGAINIDMSKS+GAG+VVYSNNDTSDGPLMSLRTGKETFN+SALFVDYSGKTNA
Sbjct: 121 KPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNA 180

Query: 180 VNIAMRQPSTPNFSSALNITSGNENGSAMQLRGSEKALGTLKITHENPNVEAKYDENAAA 239
VNIAMRQP+TPNFSSALNITSGNENGSAMQ+RG EKALGTLKITHENPNVEA YDENAAA
Sbjct: 181 VNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAA 240

Query: 240 LSIDIVKKQKGGKGTAAQGIYINSTSGTAGKMLRIRNKNKDKFYVGPDGDFWSCASSIVD 299
LSIDIVKKQKGGKGTAAQGIYINSTSGT GK+LRIRN DKFYV DG F++ +S +D
Sbjct: 241 LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQID 300

Query: 300 GNLTVKDPTSGKHAATKDYVDEKIAELKKLILKK 333
GNL +K+PT+ HAATK YVD ++ +LK L++ K
Sbjct: 301 GNLKLKNPTADDHAATKAYVDSEVKKLKALLMDK 334


17spyM18_1482spyM18_1506Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_1482224-2.656516hypothetical protein
spyM18_1484226-1.621552hypothetical protein
spyM18_1485425-1.966530hypothetical protein
spyM18_1486324-2.189517hypothetical protein
spyM18_1487225-1.435300hypothetical protein
spyM18_1488227-1.787164hypothetical protein
spyM18_1489528-0.783492hypothetical protein
spyM18_1490322-1.003375hypothetical protein
spyM18_1491220-1.794342hypothetical protein
spyM18_1492221-1.326291hypothetical protein
spyM18_1493221-0.965997hypothetical protein
spyM18_1494321-1.011646hypothetical protein
spyM18_1495223-1.784567recombinase
spyM18_1496331-1.983868hypothetical protein
spyM18_1497231-1.391440hypothetical protein
spyM18_1498226-2.767262hypothetical protein
spyM18_1499121-2.900796hypothetical protein
spyM18_1500122-3.630933hypothetical protein
spyM18_1501220-3.776870hypothetical protein
spyM18_1502220-3.564491excisionase
spyM18_1503118-4.119510Cro-like repressor
spyM18_1504115-4.460715repressor
spyM18_1505-117-4.130567hypothetical protein
spyM18_1506018-3.595706integrase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1504SACTRNSFRASE280.026 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.0 bits (62), Expect = 0.026
Identities = 13/37 (35%), Positives = 18/37 (48%)

Query: 119 KSEETEDYITDYVEGLVAAGLGAYQEDNLHMKVKLRS 155
K E +D YVE A Y E+N ++K+RS
Sbjct: 48 KQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRS 84


18spyM18_1547spyM18_1562Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_1547216-2.738551hypothetical protein
spyM18_1548217-2.285666peroxide resistance protein
spyM18_1549220-2.853374hypothetical protein
spyM18_1550118-2.197697ribosomal RNA large subunit methyltransferase N
spyM18_1551-115-1.554219hypothetical protein
spyM18_1552-214-0.053021ribose transport operon repressor
spyM18_1553-1131.080105hypothetical protein
spyM18_15542151.451102phosphopantetheine adenylyltransferase
spyM18_15553182.115762hypothetical protein
spyM18_15564192.209295asparagine synthetase AsnA
spyM18_15584252.077377carbamate kinase
spyM18_15602211.258253hypothetical protein
spyM18_15613230.765586hypothetical protein
spyM18_15623230.945492ornithine carbamoyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1548HELNAPAPROT1511e-49 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 151 bits (383), Expect = 1e-49
Identities = 49/154 (31%), Positives = 85/154 (55%), Gaps = 4/154 (2%)

Query: 19 KKEASKNEKT--KAVLNQAVADLSVAASIVHQVHWYMRGPGFLYLHPKMDELLDSLNANL 76
K E +K +T + LN +++ + S +H+ HWY++GP F LH K +EL D +
Sbjct: 2 KTENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETV 61

Query: 77 DEMSERLITIGGAPYSTLAEFSKHSKLDEAKGTYDKTVAQHLARLVEVYLYLSSLYQVGL 136
D ++ERL+ IGG P +T+ E+++H+ + + + + ++ + LV Y +SS + +
Sbjct: 62 DTIAERLLAIGGQPVATVKEYTEHASITDGGN--ETSASEMVQALVNDYKQISSESKFVI 119

Query: 137 DITDEEGDAGTNDLFTAAKTEAEKTIWMLQAERG 170
+ +E D T DLF E EK +WML + G
Sbjct: 120 GLAEENQDNATADLFVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1549PREPILNPTASE310.003 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.9 bits (70), Expect = 0.003
Identities = 42/160 (26%), Positives = 59/160 (36%), Gaps = 25/160 (15%)

Query: 78 SLIIILWASMVHWVSASYCYLLLFSLLFSLF--DWRSQ------EYPFILWLFSFVSLLL 129
+L+ + A + + LLL +L +L D P + F L
Sbjct: 118 ALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGG 177

Query: 130 FYSIN---------YLSLILLLLGLLAHLRPFSIGAGDFFYLASLALVLDLTSLIWLIQL 180
F S+ YL L L +G GDF LA+L L +L ++ L
Sbjct: 178 FVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLL 237

Query: 181 ASLAGITACLLL-------GIKRIPFIPYLSFGLFWIVLL 213
+SL G + L K IPF PYL+ WI LL
Sbjct: 238 SSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIA-GWIALL 276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1552NUCEPIMERASE320.004 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 31.7 bits (72), Expect = 0.004
Identities = 13/76 (17%), Positives = 34/76 (44%), Gaps = 9/76 (11%)

Query: 50 LAQSLKTKKNQLVGLLLPDISNPFF-PRLARGAEEYLKEKGYRVMLGNISDSEALEE--- 105
+++ L +Q+VG+ D N ++ L + E L + G++ +++D E + +
Sbjct: 16 VSKRLLEAGHQVVGI---DNLNDYYDVSLKQARLELLAQPGFQFHKIDLADREGMTDLFA 72

Query: 106 --EYVHVLLQSNAAGI 119
+ V + + +
Sbjct: 73 SGHFERVFISPHRLAV 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1554LPSBIOSNTHSS1532e-50 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 153 bits (388), Expect = 2e-50
Identities = 58/157 (36%), Positives = 94/157 (59%), Gaps = 2/157 (1%)

Query: 5 IGLYTGSFDPVTNGHLDIVKRASGLFDQIYVGIFDNPTKKSYFKLEVRKAMLTQALADFT 64
+Y GSFDP+T GHLDI++R LFDQ+YV + NP K+ F ++ R + +A+A
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 65 NVIVVTSHERLAIDVAKELRVTHLIRGLRNATDFEYEENLEYFNHLLAPNIETVYLISRN 124
N V + E L ++ A++ + ++RGLR +DFE E + N LA ++ETV+L +
Sbjct: 62 NAQVDSF-EGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120

Query: 125 KWQALSSSRVRELIHFQSSLEGLVPQSVIAQV-EKMN 160
++ LSSS V+E+ F ++E VP V A + ++ +
Sbjct: 121 EYSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQFH 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1558CARBMTKINASE404e-144 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 404 bits (1039), Expect = e-144
Identities = 140/315 (44%), Positives = 203/315 (64%), Gaps = 6/315 (1%)

Query: 3 KQKIVVALGGNAIL--STDASAKAQQEALMSTSKSLVKLIKEGHEVIVTHGNGPQVGNLL 60
+++V+ALGGNA+ S + + + T++ + ++I G+EV++THGNGPQVG+LL
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61

Query: 61 LQQAAADSEKN-PAMPLDTCVAMTEGSIGFWLVNALDNELQAQGIQKEVAAVVTQVIVDA 119
L A + PA P+D AM++G IG+ + AL NEL+ +G++K+V ++TQ IVD
Sbjct: 62 LHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDK 121

Query: 120 KDPAFENPTKPIGPFLTEEDAKKQMAESGASFKEDAGRGWRKVVPSPKPVGIKEANVIRS 179
DPAF+NPTKP+GPF EE AK+ E G KED+GRGWR+VVPSP P G EA I+
Sbjct: 122 NDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKK 181

Query: 180 LVDSGVVVVSAGGGGVPVVEDATSKSLTGVEAVIDKDFASQTLSGLVDADLFIVLTGVDN 239
LV+ GV+V+++GGGGVPV+ + + GVEAVIDKD A + L+ V+AD+F++LT V+
Sbjct: 182 LVERGVIVIASGGGGVPVILED--GEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNG 239

Query: 240 VYINFNKPDQAKLEEVTVSQMKEYITQDQFAPGSMLPKVEAAIAFVENKPNAKAIITSLE 299
+ + + L EV V ++++Y + F GSM PKV AAI F+E +AII LE
Sbjct: 240 AALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEW-GGERAIIAHLE 298

Query: 300 NIDNVLSANAGTQII 314
L GTQ++
Sbjct: 299 KAVEALEGKTGTQVL 313


19spyM18_1575spyM18_1623Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_1575-1193.393047hypothetical protein
spyM18_1577-1173.401839oxidoreductase
spyM18_1578-1172.096488hypothetical protein
spyM18_15791193.808768valyl-tRNA synthetase
spyM18_1581-1201.688611hypothetical protein
spyM18_1582-1191.802580hypothetical protein
spyM18_1583-1171.350339hypothetical protein
spyM18_1584-2171.585156hypothetical protein
spyM18_1585-2182.001489*3-deoxy-7-phosphoheptulonate synthase
spyM18_1586-2183.1898803-dehydroquinate synthase
spyM18_15881194.152303acetate kinase
spyM18_15892183.338054hypothetical protein
spyM18_15900182.960254SAM-dependent methyltransferase
spyM18_15911192.625402hypothetical protein
spyM18_15921162.334879shikimate 5-dehydrogenase
spyM18_15941161.768507beta-galactosidase
spyM18_15951170.032090two-component sensor response regulator
spyM18_15960180.941694two-component sensor histidine kinase
spyM18_15972191.927470hypothetical protein
spyM18_15980172.531390hypothetical protein
spyM18_15990193.030348sugar ABC transporter substrate-binding protein
spyM18_16010183.664719sugar ABC transporter substrate-binding protein
spyM18_1602-1184.426570transcriptional regulator
spyM18_1605-2163.860119beta-glucosidase
spyM18_1606-2173.364100hyaluronidase
spyM18_1609-2162.658456transcription regulator
spyM18_1610-2172.198447hypothetical protein
spyM18_1611-2161.596712hypothetical protein
spyM18_1614-117-1.474426hypothetical protein
spyM18_1615013-1.209755RNA methyltransferase
spyM18_1616217-4.111162recombination regulator RecX
spyM18_1617018-3.787692hypothetical protein
spyM18_1618019-3.896567hypothetical protein
spyM18_1619019-3.201776hypothetical protein
spyM18_1623-119-3.040765********hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1579RTXTOXIND350.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.8 bits (80), Expect = 0.002
Identities = 11/73 (15%), Positives = 27/73 (36%), Gaps = 6/73 (8%)

Query: 805 YLPLADLLNVEEELARLDKELAKWQKELDMVGKKLGNERFVANAKPEVVQKEKDKQADYQ 864
+ +L E + EL ++ +L+ + ++ +AK E + + +
Sbjct: 248 AIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI------LSAKEEYQLVTQLFKNEIL 301

Query: 865 AKYDATQERIAEM 877
K T + I +
Sbjct: 302 DKLRQTTDNIGLL 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1595HTHFIS842e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 2e-19
Identities = 31/133 (23%), Positives = 50/133 (37%), Gaps = 6/133 (4%)

Query: 3 KVLLVDDEYMILQGLTMIIDWQALGFEVVQTARSGKEALTYLTQYPVDVMISDVTMPGMT 62
+L+ DD+ I L + G++V + ++ D++++DV MP
Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLDLIEAAKTYHPQLQTLILSGYQEFSYVQKAMELETKGYLLKPVDKAELQAKMKQFKDC 122
DL+ K P L L++S F KA E YL KP D L +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD---LTELIGIIGRA 118

Query: 123 LDAQQAESIRQEA 135
L + + E
Sbjct: 119 LAEPKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1596PF065801812e-54 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 181 bits (462), Expect = 2e-54
Identities = 71/324 (21%), Positives = 133/324 (41%), Gaps = 34/324 (10%)

Query: 250 LSKAYRMQYNRSGDLLAYVAVRKSYLLAEAVRTVFVYGLVSLLLAWLLLQLL-FRVFRNY 308
L+ AYR R G L + + A + + V+ W LL + +
Sbjct: 55 LTHAYRSFIKRQG-WLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFT 113

Query: 309 IQQVSEITDTVEMVAAGDLSLTIDNSHMELELYHISEAINQMLASIKAYIDEVYVLEVEQ 368
+ I V +V + M LY + +A ID+ +
Sbjct: 114 LPLALSIIFNVVVV-----------TFMWSLLYF---GWHFFKNYKQAEIDQWK-MASMA 158

Query: 369 RDAQMRALQSQINPHFLYNTLEYIRMYALSCQQEELADVIYAFASLLRNNI--SQDKMTT 426
++AQ+ AL++QINPHF++N L IR L + +++ + + L+R ++ S + +
Sbjct: 159 QEAQLMALKAQINPHFMFNALNNIRALILE-DPTKAREMLTSLSELMRYSLRYSNARQVS 217

Query: 427 LKEELAFCEKYIYLYQMRYPDSFAYHVKIDESIADLAIPKFVIQPLVENYFVHGIDYSRH 486
L +EL + Y+ L +++ D + +I+ +I D+ +P ++Q LVEN HGI
Sbjct: 218 LADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQ 277

Query: 487 DNALSIKALDETDHLLIQVLDNGRGISQERLADMEKRLQEHQTTGNSSIGLQNVYLRLFH 546
+ +K + + ++V + G L T ++ GLQNV RL
Sbjct: 278 GGKILLKGTKDNGTVTLEVENTG-------------SLALKNTKESTGTGLQNVRERLQM 324

Query: 547 HFRDRVSWSMAKEPNGGFIIQIRI 570
+ ++++ G + I
Sbjct: 325 LYGTEAQIKLSEKQ-GKVNAMVLI 347


20spyM18_1686spyM18_1707Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_16860143.344803hypothetical protein
spyM18_1687-1143.543006transketolase
spyM18_16890132.788859translaldolase
spyM18_16920132.524654trans-acting positive regulator
spyM18_16930122.911016NADH peroxidase
spyM18_1694-1143.445969glycerol uptake facilitator
spyM18_1695-2143.189831alpha-glycerophosphate oxidase
spyM18_16960152.467452glycerol kinase
spyM18_1698-1142.367318hypothetical protein
spyM18_1699-1133.255985hypothetical protein
spyM18_1700-1112.444303glycyl-tRNA synthetase subunit beta
spyM18_1701-191.544898glycyl-tRNA synthetase subunit alpha
spyM18_1703-1100.750026hypothetical protein
spyM18_1704-1100.809602reductase/dehydrogenase
spyM18_1705-290.931675N-acetylglucosamine-6-phosphate deacetylase
spyM18_17060120.501249hypothetical protein
spyM18_17072100.950305hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1692PF05043554e-10 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 54.9 bits (132), Expect = 4e-10
Identities = 30/162 (18%), Positives = 71/162 (43%), Gaps = 7/162 (4%)

Query: 3 IEDLMDKERRAQYRLLVTLYHAKETLRLKDLMRLSNLSKVTLLKYIDNLNHLCREQGLAC 62
+ DL+ K+ Q LL L+ K +L L N ++ + + ++ +
Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIF-- 58

Query: 63 QLLLEKDSLSLKENGQFHWEDLVALLLKESVAYQILTYMYCHEHFNITNLSVELMVSEAT 122
+ + E + K S + IL +++ +E ++ E +S ++
Sbjct: 59 -HSSTNGIRIINTDDS-DIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSS 116

Query: 123 LNRQLAHLNQLLS---EFDLALSQGRQLGSELQWRYFYFELF 161
L R ++ +N+++ +F+++L+ + +G+E RYF+ + F
Sbjct: 117 LYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYF 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1698THERMOLYSIN401e-06 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 40.0 bits (93), Expect = 1e-06
Identities = 15/78 (19%), Positives = 29/78 (37%), Gaps = 3/78 (3%)

Query: 69 NQPKTSQTSKKVKLSEDKAKSIALKDASVTEADAQMLSVTQDNEDGKAVYEIEFQNKDQE 128
+ S ++ +D A + + + E L + D E + YE+ +
Sbjct: 134 TEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEVNVRFLTPV 193

Query: 129 ---YSYTIDANSGDIVEK 143
+ Y IDA G ++ K
Sbjct: 194 PGNWIYMIDAADGKVLNK 211


21spyM18_1750spyM18_1756Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_17504263.353778hypothetical protein
spyM18_17523251.948449hypothetical protein
spyM18_17534252.252014hypothetical protein
spyM18_17543242.159527hypothetical protein
spyM18_17552172.667088hypothetical protein
spyM18_17562172.527015hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1750FLGFLGJ941e-23 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 93.6 bits (232), Expect = 1e-23
Identities = 46/123 (37%), Positives = 65/123 (52%), Gaps = 8/123 (6%)

Query: 23 SLTAAQAILESGWGKYA-------PHNALFGIKADSSWTGKSFNTKTQEEYQPGIVTDIV 75
L AQA LESGWG+ P LFG+KA +W G T E Y+ G +
Sbjct: 172 HLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTE-YENGEAKKVK 230

Query: 76 DRFRAYDSWEDSIADHGQFLVDNPRYKAVIGEADYKKACHAIKDAGYATASGYAELLIQL 135
+FR Y S+ ++++D+ L NPRY AV A ++ A++DAGYAT YA L +
Sbjct: 231 AKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNM 290

Query: 136 IEE 138
I++
Sbjct: 291 IQQ 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1756RTXTOXIND412e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.6 bits (95), Expect = 2e-05
Identities = 28/186 (15%), Positives = 61/186 (32%), Gaps = 12/186 (6%)

Query: 154 KSLLEQKTAQLGLTVDGLKLDLNKANKQTASLQASIDGLRQDYRDADRQLSANYQAGLNG 213
SLL+ + Q + ++LNK + + + ++ L +
Sbjct: 141 SSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN 200

Query: 214 LKAQLTNDKIGLQAEIQATAQGLSQKYDNELRQLSAKITTTSSGTTEAY--ENKLEGLRA 271
K Q + +AE T +Y+N R +++ SS + ++ +
Sbjct: 201 QKYQKELNLDKKRAERL-TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259

Query: 272 EFTRSNQGMRT------ELESQISGLRAVQQSTASQISQEIRNREGAVSRVQQNLASYQR 325
++ + +R ++ES+I + Q EI + + N+
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL---DKLRQTTDNIGLLTL 316

Query: 326 RLQSAE 331
L E
Sbjct: 317 ELAKNE 322


22spyM18_1771spyM18_1811Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_17712190.583998hypothetical protein
spyM18_17723200.904116major head protein
spyM18_17743210.637697hypothetical protein
spyM18_17754210.620803hypothetical protein
spyM18_17764220.024074hypothetical protein
spyM18_1777421-0.113228minor capsid protein
spyM18_1778222-0.859096minor capsid protein
spyM18_1779222-2.508743hypothetical protein
spyM18_1780424-4.046889hypothetical protein
spyM18_1781226-5.743912hypothetical protein
spyM18_1782123-4.251777hypothetical protein
spyM18_1783225-3.451118hypothetical protein
spyM18_1784426-2.763054hypothetical protein
spyM18_1785529-2.226124hypothetical protein
spyM18_1786728-2.728018hypothetical protein
spyM18_1787528-1.126761hypothetical protein
spyM18_1788731-1.374686hypothetical protein
spyM18_1789528-0.523898hypothetical protein
spyM18_1790524-0.146446hypothetical protein
spyM18_1791425-0.479627hypothetical protein
spyM18_1792423-1.124770hypothetical protein
spyM18_1793525-1.109215hypothetical protein
spyM18_1794422-0.767140hypothetical protein
spyM18_1795523-0.925344single strand binding protein
spyM18_1796421-2.332612hypothetical protein
spyM18_1797520-3.699532hypothetical protein
spyM18_1798523-3.554433hypothetical protein
spyM18_1799321-2.892865hypothetical protein
spyM18_1800318-3.475739hypothetical protein
spyM18_1801417-4.548076phage DNA replication protein
spyM18_1803318-4.571987hypothetical protein
spyM18_1804018-3.296669hypothetical protein
spyM18_1802-216-2.226746hypothetical protein
spyM18_1805-223-1.365530phage repressor protein
spyM18_1806-225-0.411102hypothetical protein
spyM18_1808-1301.062021phage integrase
spyM18_18090382.714334hypothetical protein
spyM18_1810-1373.096562mannose-specific phosphotransferase system
spyM18_1811-1283.181350PTS system mannose-specific transporter subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1789PF06580260.021 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 26.0 bits (57), Expect = 0.021
Identities = 7/45 (15%), Positives = 19/45 (42%)

Query: 29 LFLAIAIFGMMVTVSYFSYRDARQYYESQITGLRTQLSRTQKQLK 73
+ + + M ++ YF + + Y +++I + + QL
Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1797ANTHRAXTOXNA280.026 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 27.8 bits (61), Expect = 0.026
Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 1/89 (1%)

Query: 71 QAEAKVEKYKETIRRAMELSQKKKVDAGMFKVSLRKSKKVEILDETKIPLDYMQEKIEYK 130
+ A E Y E+ + ++K K + FK S+ K E +ET + Q+ ++
Sbjct: 30 EVNAMNEHYTESDIKRNHKTEKNKTEKEKFKDSINNLVKTEFTNETLDKIQQTQDLLKKI 89

Query: 131 PMKS-EISKALKSGIDISGVELIETESLQ 158
P EI L I + ++L+E + LQ
Sbjct: 90 PKDVLEIYSELGGEIYFTDIDLVEHKELQ 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1806FbpA_PF05833363e-04 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 36.4 bits (84), Expect = 3e-04
Identities = 33/140 (23%), Positives = 53/140 (37%), Gaps = 11/140 (7%)

Query: 207 LSNITIKNVDTYRNKLAKTFETLNKLFEIDGVKISKELLTSKLKQL-------DILYKYQ 259
L I + N++ K TL K + D K+ ELLT+ + L ++ Y
Sbjct: 304 LQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYALKKGLSHIELANYYS 363

Query: 260 KQIEIEKELLKAQKEEIREQQKAEKEIQQAKAKLKKEERQFNNEMSKLLKYLNGAQNEIE 319
+ + K L K + Q K+ + K + Q + + L YL I
Sbjct: 364 ENYDTVKITLDENKTPSQNVQSYYKKYNKLKKSEEAANEQL-LQNEEELNYLYSVLTNIN 422

Query: 320 QQIYADKIKELEDKIKELEK 339
AD E+E+ KEL +
Sbjct: 423 N---ADNYDEIEEIKKELIE 439


23spyM18_1974spyM18_1993Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_1974116-3.765095type I site-specific deoxyribonuclease
spyM18_1975420-6.574345specificity determinant HsdS
spyM18_1976319-7.156191type I site-specific deoxyribonuclease
spyM18_1977523-8.199301hypothetical protein
spyM18_1978524-8.200259response regulator of salavaricin regulon
spyM18_1979419-6.986194SalK-like protein
spyM18_1980116-5.127045ABC transporter permease
spyM18_1981115-3.946022ABC transporter ATP-binding protein
spyM18_1982116-3.196431salivaricin A modification enzyme
spyM18_1983124-1.534713lantibiotic
spyM18_1984125-1.4736816-phospho-beta-galactosidase
spyM18_1985127-1.483366PTS system lactose-specific transporter subunit
spyM18_1986222-2.506635PTS system lactose-specific transporter subunit
spyM18_1987122-2.787312tagatose 1,6-diphosphate aldolase
spyM18_1989120-3.591882tagatose-6-phosphate kinase
spyM18_1990327-2.423129galactose-6-phosphate isomerase subunit LacB
spyM18_1991428-1.697767galactose-6-phosphate isomerase subunit LacA
spyM18_1993427-1.445974lactose phosphotransferase system repressor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1978HTHFIS463e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.0 bits (109), Expect = 3e-08
Identities = 21/118 (17%), Positives = 51/118 (43%), Gaps = 6/118 (5%)

Query: 2 KILLIDDHRLFAKSIQLLFQQYD-EVDVIDTITSHFNDVTIDLSKYDIILLDINLTNISK 60
IL+ DD + + +V + + + + D+++ D+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI--AAGDGDLVVTDVVM---PD 59

Query: 61 ENGLEIAKELIQSTPHLKVVMLTGYVKSIYRERAKKVGAYGFVDKNIDPKQLISILKK 118
EN ++ + ++ P L V++++ + +A + GAY ++ K D +LI I+ +
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1993ARGREPRESSOR300.006 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 29.8 bits (67), Expect = 0.006
Identities = 21/85 (24%), Positives = 38/85 (44%), Gaps = 11/85 (12%)

Query: 1 MKKKERHEKILDILKVDGFIKVKDIIDEM-----NISDMTARRDLDTLADKGLL-IRTHG 54
M K +RH KI +I+ + +++D + N++ T RD+ L L+ + T+
Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL---HLVKVPTNN 57

Query: 55 GAQYLDYSSAKDEGHEKTHTEKKVL 79
G+ YS D+ K+ L
Sbjct: 58 GSYK--YSLPADQRFNPLSKLKRSL 80


24spyM18_2008spyM18_2060Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_20080153.032508cysteinyl-tRNA synthetase
spyM18_20090163.186757hypothetical protein
spyM18_20100194.089984hypothetical protein
spyM18_20120193.930519serine acetyltransferase
spyM18_2013-1173.273609hypothetical protein
spyM18_2014-1183.190369polynucleotide phosphorylase
spyM18_2015-1172.367404translaldolase
spyM18_2016-1192.378624PTS system ascorbate-specific transporter
spyM18_2017-2221.172487hypothetical protein
spyM18_2018-2191.422904hypothetical protein
spyM18_2020-1211.470945hypothetical protein
spyM18_2021-1181.65197530S ribosomal protein S15
spyM18_2023-2183.610834hypothetical protein
spyM18_2024-2173.842443hypothetical protein
spyM18_2025-2163.604248peptide deformylase
spyM18_2026-1153.442993hypothetical protein
spyM18_20270163.314644transcriptional regulator
spyM18_20280163.333276DNA polymerase III PolC
spyM18_2030-2142.302864prolyl-tRNA synthetase
spyM18_2031-2132.525278hypothetical protein
spyM18_2032-1143.031493phosphatidate cytidylyltransferase
spyM18_2033-2143.457831undecaprenyl pyrophosphate synthase
spyM18_2035-1153.914859preprotein translocase subunit YajC
spyM18_2037-2143.469230hypothetical protein
spyM18_2038-2153.464591pullulanase
spyM18_2039-1193.658234dextran glucosidase
spyM18_20400122.143387sugar ABC transporter ATP-binding protein
spyM18_20410141.927587leucine-rich protein
spyM18_20420151.754123streptokinase
spyM18_20442192.613063D-tyrosyl-tRNA(Tyr) deacylase
spyM18_20450173.228521(p)ppGpp synthetase
spyM18_20462182.878883protective antigen
spyM18_20471244.855077transcriptional regulator
spyM18_20481205.040764flavoprotein NrdI
spyM18_20491195.759196hypothetical protein
spyM18_20500195.680096PTS system glucose-specific transporter subunit
spyM18_20511245.93245916S ribosomal RNA methyltransferase RsmE
spyM18_2052-1235.660660ribosomal protein L11 methyltransferase
spyM18_2053-1235.026514hypothetical protein
spyM18_2055-1244.828454amidase
spyM18_2056-2232.718220para-aminobenzoate synthetase
spyM18_2057119-1.083824anthranilate synthase component II
spyM18_2058118-3.026352recombination factor protein RarA
spyM18_2060-115-3.586990*pai1 protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2031PF04605300.008 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 29.8 bits (67), Expect = 0.008
Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 2/44 (4%)

Query: 227 INGYKVTSWNDLTEAV-DLATRD-LGPSQTIKVTYKSHQRLKTV 268
+ ++ L E + DL +D + +Q+LK +
Sbjct: 80 FDITEIGEQYSLKETIQDLCAKDFHQKLKEFTEKTPKNQKLKDL 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2040PF05272356e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.0 bits (80), Expect = 6e-04
Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 9/56 (16%)

Query: 34 IVFVGPSGCGKSTTLRMIAGLEDISEGELKIGGEVVNDKSPKDRDIAMVFQNYALY 89
+V G G GKST + + GL+ S+ IG +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG---------TGKDSYEQIAGIVAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2041HTHFIS347e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 7e-04
Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 229 ALWSEHGNLVQTAQRLYIHRNSLQYKLDKF 258
AL + GN ++ A L ++RN+L+ K+ +
Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2042STREPKINASE7930.0 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 793 bits (2049), Expect = 0.0
Identities = 388/440 (88%), Positives = 413/440 (93%)

Query: 1 MKNYLSFGMFALLFALTFGTVKPVQAIAGYEWLLDRPSVNNSQLVVSMAGIVEGTDKKVF 60
MKNYLSFGMFALLFALTFGTV VQAIAG EWLLDRPSVNNSQLVVS+AG VEGT++ +
Sbjct: 1 MKNYLSFGMFALLFALTFGTVNSVQAIAGPEWLLDRPSVNNSQLVVSVAGTVEGTNQDIS 60

Query: 61 INFFEIDLTSQPAHGGKTEQGLSPKSKPFATDNGAMPHKLEKADLLKAIQERLIANVHSN 120
+ FFEIDLTS+PAHGGKTEQGLSPKSKPFATD+GAM HKLEKADLLKAIQE+LIANVHSN
Sbjct: 61 LKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKLEKADLLKAIQEQLIANVHSN 120

Query: 121 DGYFEVIDFASDATITDRDDNIYFANQDGSVTLPTQPIQQFLLRGHVRVRPYKEKPIQTP 180
D YFEVIDFASDATITDR+ +YFA++DGSVTLPTQP+Q+FLL GHVRVRPYKEKPIQ
Sbjct: 121 DDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVRPYKEKPIQNQ 180

Query: 181 AKSVDIRYTVQFTPLNPDDDFKPVLKDTKLLKTLAIGDTITSQELLAQAQSILNESHSDY 240
AKSVD+ YTVQFTPLNPDDDF+P LKDTKLLKTLAIGDTITSQELLAQAQSILN++H Y
Sbjct: 181 AKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKNHPGY 240

Query: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYHIKDREQAYGINKKSGQEEKTNNTDLISEKY 300
TIYERDSSIVTHDNDIFRTILPMDQEFTY +K+REQAY INKKSG E+ NNTDLISEKY
Sbjct: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQAYRINKKSGLNEEINNTDLISEKY 300

Query: 301 YVLKKGEKPYDPFDRSHLKLFTINYVDVNTNKLLKSEQLLTASERNLDFRDLYDPRDKAK 360
YVLKKGEKPYDPFDRSHLKLFTI YVDV+TN+LLKSEQLLTASERNLDFRDLYDPRDKAK
Sbjct: 301 YVLKKGEKPYDPFDRSHLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAK 360

Query: 361 LLYNNLDAFGIMDYTLTGKVEDNHDKNNRVVTVYMGKRPEGENASYHLAYDKDRYTEEER 420
LLYNNLDAFGIMDYTLTGKVEDNHD NR++TVYMGKRPEGENASYHLAYDKDRYTEEER
Sbjct: 361 LLYNNLDAFGIMDYTLTGKVEDNHDDTNRIITVYMGKRPEGENASYHLAYDKDRYTEEER 420

Query: 421 EVYSYLRYTGTPIPDNPKDK 440
EVYSYLRYTGTPIPDNP DK
Sbjct: 421 EVYSYLRYTGTPIPDNPNDK 440


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2046GPOSANCHOR1265e-33 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 126 bits (317), Expect = 5e-33
Identities = 107/457 (23%), Positives = 184/457 (40%), Gaps = 40/457 (8%)

Query: 143 IEAIKYRLDSESHLKEELLKQTAELEQRKNAEVDLKSEKKRLEAQIEKVGYDIANKQQEL 202
+ K +L E + ELE RK + ++ L
Sbjct: 94 LSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 153

Query: 203 EKARSDQKELSESIQKLTSRFKKESDAKQKELDEAKAANKSLSESATKTLARSSKITNEL 262
++D ++ E F AK K L+ KAA ++ K L + +
Sbjct: 154 AARKADLEKALEGA----MNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTAD 209

Query: 263 KDKLAASEKDKNRAFQVSSELANKLHETETSRDKALAESKELADKLAVKTAEAEKLMENV 322
K+ E +K LA + + E + + A+ S + K+ AE L
Sbjct: 210 SAKIKTLEAEKA-------ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQ 262

Query: 323 GSLDRLVESAKREMAQKLAEIDQLTADKAKADAELAAANDTIASLQTELEKVKTELAVSE 382
L++ +E A A+I L A+KA +AE A L + ++ +L S
Sbjct: 263 AELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASR 322

Query: 383 RLIESGKREIAELEKQKDASDKALAESQANVAELEKQKAASDAKVAELEKEVEAAKAEVA 442
+ + E +LE+Q S+ + + ++ + K +A+ +LE++ + ++A
Sbjct: 323 EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQ 382

Query: 443 DLKAQLAKKEEELEAVKKEKEALEAKIEELKKAHAEELSKLKEMLEKKDHANADLQAEIN 502
L+ L E + V+K E +K+ L+K + E K ++K A L+AE
Sbjct: 383 SLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAK 442

Query: 503 RLKQELADRIKSLSQGGRASQTNPGSTTAKAG---------------------------- 534
LK++LA + + L++ ++ + AK G
Sbjct: 443 ALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETK 502

Query: 535 -QLPSTGESANPFFTIAALTVIAGAGMAVVSPKRKEN 570
QLPSTGE+ANPFFT AALTV+A AG+A V +++EN
Sbjct: 503 RQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN 539


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2058HTHFIS340.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 0.001
Identities = 28/155 (18%), Positives = 46/155 (29%), Gaps = 38/155 (24%)

Query: 8 RMRPKTISEVIGQKHLVGEGKIIRRMVE-----ANRLSSMILYGPPGIGKTSIASAIAGT 62
R K + LVG ++ + ++++ G G GK +A A+
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 63 TRYAFRTF--------------------------NATIDSKKRLQEIAEEAKFSGGLVLL 96
+ F A S R ++ + G L
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQ-------AEGGTLF 236

Query: 97 LDEIHRLDKTKQDFLLPLLENGTIIMIGATTENPF 131
LDEI + Q LL +L+ G +G T
Sbjct: 237 LDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRS 271


25spyM18_2104spyM18_2111Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_21041214.270300mitogenic factor
spyM18_21063234.495209low temperature requirement C protein
spyM18_21082234.307023glycerol dehydrogenase
spyM18_21091203.822926fructose-6-phosphate aldolase
spyM18_21100223.666319pyruvate formate-lyase 2
spyM18_21112192.561792PTS system cellobiose-specific transporter
26spyM18_2135spyM18_2157Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_2135-1213.407758transcriptional regulator
spyM18_21360243.876950cold shock protein
spyM18_2137-1254.092722*alkyl hydroperoxide reductase
spyM18_2138-1255.168806NADH oxidase/alkyl hydroperoxidase reductase
spyM18_2139-1225.377861imidazolonepropionase
spyM18_21410255.748918urocanate hydratase
spyM18_2142-1275.997018glutamate formiminotransferase
spyM18_21430286.110993formiminotetrahydrofolate cyclodeaminase
spyM18_21440234.873052formate--tetrahydrofolate ligase
spyM18_2145-2224.162110hypothetical protein
spyM18_2146-1233.918527cationic amino acid transporter
spyM18_2147-1193.577254histidine ammonia-lyase
spyM18_2148-2162.990009formimidoylglutamase
spyM18_2149-2152.573378regulatory protein
spyM18_2151-2162.53895230S ribosomal protein S2
spyM18_2152-2142.498000elongation factor Ts
spyM18_2153-1152.300754endopeptidase O
spyM18_21540132.397644dextran glucosidase
spyM18_21551132.023380PTS system enzyme II
spyM18_21572181.800366transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2138PF07212300.021 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 30.0 bits (67), Expect = 0.021
Identities = 34/145 (23%), Positives = 58/145 (40%), Gaps = 22/145 (15%)

Query: 242 GGQVMETVGIENMIGTLYT--EGPKLMAEVEAHTKSYDVDIIKAQLATSIEKKENIEVTL 299
G M+ G+E +GTL E P + A + + + +DI+K K++ + T
Sbjct: 205 NGSAMQIRGVEKALGTLKITHENPNVEANYDENAAALSIDIVK--------KQKGGKGTA 256

Query: 300 ANGAVLQAKTAILALGAKWRNINVPGEDEFRNKGVTYCPHCDGPLFEGKDVAVIGGGNSG 359
A G + + + + RN+ +D+F K DG + K + GN
Sbjct: 257 AQGIYINSTSGTTGKLLRIRNLG---DDKFYVKH-------DGGFYAKKTSQI--DGNLK 304

Query: 360 LEAALDLAGLAKHVYVLEFLPELKA 384
L+ A YV + +LKA
Sbjct: 305 LKNPTADDHAATKAYVDSEVKKLKA 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2139UREASE477e-08 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 47.4 bits (113), Expect = 7e-08
Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 6/53 (11%)

Query: 46 IAIKDGLIVALG-SGEPDAE-----LVGPQTIMRSYKGKIATPGIIDCHTHLV 92
I +KDG I A+G +G PD + +VGP T + + +GKI T G +D H H +
Sbjct: 88 IGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFI 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2153IGASERPTASE310.019 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.019
Identities = 20/118 (16%), Positives = 35/118 (29%), Gaps = 17/118 (14%)

Query: 438 NAYYDPQQNQIVFPAAILQEPFYSLDQSSSANYGGIGAVIAHEISHAFDT---------N 488
+ + A + S+ N+ +G + + N
Sbjct: 594 YLNLENYTYYALRKGASTRSELPKNSGESNENWLYMGKTSDEAKRNVMNHINNERMNGFN 653

Query: 489 GASFDEHGSLNDWWTQEDYAAFKERTDKIVAQFDGLESHGAKVNGKLTVSENVADLGG 546
G +E G N FK ++++ G G +NG LTV + L G
Sbjct: 654 GYFGEEEGKNNGNLN----VTFKGKSEQNRFLLTG----GTNLNGDLTVEKGTLFLSG 703


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2155RTXTOXIND320.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.1 bits (73), Expect = 0.007
Identities = 6/18 (33%), Positives = 12/18 (66%)

Query: 610 LVKQGDQVKAGQTLIQFD 627
+VK+G+ V+ G L++
Sbjct: 111 IVKEGESVRKGDVLLKLT 128


27spyM18_2196spyM18_2209Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_2196222-2.91298150S ribosomal protein L32
spyM18_2197320-2.84022650S ribosomal protein L33
spyM18_2198519-2.352019cadmium resistance protein
spyM18_2199722-1.814761cadmium efflux system accessory
spyM18_2201623-1.232334hypothetical protein
spyM18_2202520-0.315644hypothetical protein
spyM18_2203418-0.294474hypothetical protein
spyM18_22073160.115900hypothetical protein
spyM18_2209215-0.219672hypothetical protein
28spyM18_0103spyM18_0111N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0103220-3.925518competence protein, ABC transporter subunit
spyM18_0105121-2.758390competence protein
spyM18_0106-214-1.742900competence protein
spyM18_0107-215-1.485738hypothetical protein
spyM18_0108-215-0.362697competence protein
spyM18_0109-2151.354575hypothetical protein
spyM18_0110-3152.063053hypothetical protein
spyM18_0111-2162.381548acetate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0103BCTERIALGSPF903e-22 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 90.3 bits (224), Expect = 3e-22
Identities = 65/341 (19%), Positives = 135/341 (39%), Gaps = 22/341 (6%)

Query: 37 KKLSSKHQHKFIQLLANLLSTGFSFAEVIAFLKRS--QLLQIDYVLKMEESLLKGQGLAD 94
+LS+ + LA L++ E + + + + + + +++G LAD
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 95 MLSGLG--FSDAILTQISLADRHGNIETTLVAIQHYLNQMARIRRKTVEVITYPLILLLF 152
+ F ++ + G+++ L + Y Q ++R + + + YP +L +
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 153 LFVMMLGLRRYLVPQLETQNQ---------------ITYFLNHFPAFFIGFCSGLILLFG 197
++ L +VP++ Q ++ + F + + + F
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 198 MVWLRWRSQSRLKLYSRLSRYPFLGKLLKQYLTSYYAREWGTLIGQGLDLMTILDIMAIE 257
+ LR + + R+ + RL P +G++ + T+ YAR L + L+ + I
Sbjct: 243 V-MLR-QEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 258 KSSL-MKELAEDIRMSLLEGQAFHIKVATYPFFKKELSLMIEYGEIKSKLGAELEIYAQE 316
S+ + ++ EG + H + F + MI GE +L + LE A
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 317 SWEQFFSQLYQVTQLIQPAIFLVVAVTIVMIYAAILLPIYQ 357
+F SQ+ L +P + + +A ++ I AIL PI Q
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQ 401



Score = 34.8 bits (80), Expect = 4e-04
Identities = 32/129 (24%), Positives = 60/129 (46%), Gaps = 6/129 (4%)

Query: 235 REWGTLIGQGLDLMTILDIMAIE-KSSLMKELAEDIRMSLLEGQAFHIKVATYP-FFKKE 292
R+ TL+ + L LD +A + + + +L +R ++EG + + +P F++
Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERL 134

Query: 293 LSLMIEYGEIKSKLGAELEIYA--QESWEQFFSQLYQVTQLIQPAIFLVVAVTIVMIYAA 350
M+ GE L A L A E +Q S++ Q +I P + VVA+ +V I +
Sbjct: 135 YCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQA--MIYPCVLTVVAIAVVSILLS 192

Query: 351 ILLPIYQNM 359
+++P
Sbjct: 193 VVVPKVVEQ 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0105BCTERIALGSPG534e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 52.6 bits (126), Expect = 4e-12
Identities = 28/94 (29%), Positives = 50/94 (53%), Gaps = 4/94 (4%)

Query: 9 RHKKLKGFTLLEMLLVILVISVLMLLFVPNLSKQKDRVTETGNAAVVKLVENQAELYELS 68
K +GFTLLE+++VI++I VL L VPNL K++ + + + +EN ++Y+L
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 69 QGSKPSLSQ-LKA--DGSITEKQEKAY-QDYYDK 98
P+ +Q L++ + Y ++ Y K
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIK 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0108OMPTIN260.037 Omptin serine protease signature.
		>OMPTIN#Omptin serine protease signature.

Length = 317

Score = 26.5 bits (58), Expect = 0.037
Identities = 17/71 (23%), Positives = 25/71 (35%), Gaps = 9/71 (12%)

Query: 37 LLKRSHYLARHDQDNWLLFSHQL--REELSGARFYKVADNK-LYVEKGKKVLAFGQFKSH 93
K S ++ D D ++ R ++ +Y VA N YV KV G +
Sbjct: 217 TFKYSGWVESSDNDEHYDPGKRITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRV 276

Query: 94 DFRKSASNGKG 104
N KG
Sbjct: 277 T------NKKG 281


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0111ACETATEKNASE500e-180 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 500 bits (1290), Expect = e-180
Identities = 209/401 (52%), Positives = 281/401 (70%), Gaps = 7/401 (1%)

Query: 3 KTIAINAGSSSLKWQLYQMPEEEVLAQGIIERIGLKDSISTVKYDGKKEEQILDIHDHTE 62
K + IN GSSSLK+QL + + VLA+G+ ERIG+ DS+ T +G+K + D+ DH +
Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61

Query: 63 AVKILLNDLI--HFGIIAAYDEITGVGHRVVAGGELFKESVVVNDKVLEHIEELSVLAPL 120
A+K++L+ L+ +G+I EI VGHRVV GGE F SV++ D VL+ I + LAPL
Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121

Query: 121 HNPGAAAGIRAFRDILPDITSVCVFDTSFHTSMAKHTYLYPIPQKYYTDYKVRKYGAHGT 180
HNP GI+A I+PD+ V VFDT+FH +M + YLYPIP +YYT YK+RKYG HGT
Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181

Query: 181 SHKYVAQEAAKMLGRPLEELKLITAHIGNGVSITANYHGKSVDTSMGFTPLAGPMMGTRS 240
SHKYV+Q AA++L +P+E LK+IT H+GNG SI A +GKS+DTSMGFTPL G MGTRS
Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241

Query: 241 GDIDPAIIPYLIEQDPELKDAADVVNMLNKKSGLSGVSGISSDMRDI-EAGLQEDNPDAV 299
G IDP+II YL+E+ E A +VVN+LNKKSG+ G+SGISSD RD+ +A + + A
Sbjct: 242 GSIDPSIISYLMEK--ENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQ 299

Query: 300 LAYNIFIDRIKKCIGQYFAVLNGADALVFTAGMGENAPLMRQDVIGGLTWFGMDIDPEKN 359
LA N+F R+KK IG Y A + G D +VFTAG+GEN P +R+ ++ GL + G +D EKN
Sbjct: 300 LALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKN 359

Query: 360 -VFGYRGDISTPESKVKVLVISTDEELCIARDVERL-KNTK 398
V G IST +SKV V+V+ T+EE IA+D E++ ++ K
Sbjct: 360 KVRGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIVESLK 400


29spyM18_0227spyM18_0234N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0227-2141.072428response regulator
spyM18_02280161.848403ribonuclease P
spyM18_02290161.566602hypothetical protein
spyM18_02302171.833151hypothetical protein
spyM18_02323171.46725250S ribosomal protein L34
spyM18_02333171.025373N-acetylmannosamine-6-phosphate 2-epimerase
spyM18_02342181.057307N-acetylneuraminate-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0227HTHFIS300.007 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.007
Identities = 11/63 (17%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 50 ERGDHQLYFLDIEIGEYTRCGLELAAAIRQKDPNAVIVFVTTHSEFVPISFKYKVSALDF 109
GD L D+ + + +L I++ P+ ++ ++ + F+ + A D+
Sbjct: 44 AAGDGDLVVTDVVMPD--ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDY 101

Query: 110 IDK 112
+ K
Sbjct: 102 LPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_022960KDINNERMP1621e-48 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 162 bits (412), Expect = 1e-48
Identities = 66/237 (27%), Positives = 116/237 (48%), Gaps = 20/237 (8%)

Query: 31 VTAQSSSGWDQLVYLFARAIQWL-----SFDGSIGVGIILFTLTIRLMLMPLFNMQIKSS 85
+ GW + ++ + L SF G+ G II+ T +R ++ PL Q S
Sbjct: 324 LDLTVDYGWLWFI---SQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSM 380

Query: 86 QKMQDIQPELRELQKKYAGKDTQTRMKLAEESQALYKKYGVNPYASLLPLLIQMPVMIAL 145
KM+ +QP+++ ++++ + ++++E ALYK VNP PLLIQMP+ +AL
Sbjct: 381 AKMRMLQPKIQAMRERLGDD----KQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLAL 436

Query: 146 FQALTRVSFLKTGTF-LWV-ELAQHDHLYLLPVLAAVFTFLSTWLTNLAAKEKNVMMTVM 203
+ L L+ F LW+ +L+ D Y+LP+L V F ++ + M +
Sbjct: 437 YYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMS--PTTVTDPMQQKI 494

Query: 204 IYVMPLMIFFMGFNLASGVVLYWTVSNAFQVVQLLLLNNPFKIIAERQRLANEEKER 260
+ MP++ SG+VLY+ VSN ++Q L+ E++ L + EK++
Sbjct: 495 MTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYR----GLEKRGLHSREKKK 547


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0230IGASERPTASE280.047 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.5 bits (63), Expect = 0.047
Identities = 29/156 (18%), Positives = 49/156 (31%), Gaps = 20/156 (12%)

Query: 57 KTVYKADKKATRGVPEN----------------INQKHAPAVNSADVEPEEIKATQKLEA 100
KTV K ++ AT +N N+ + + + E K T +E
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 101 EDTKVVPLMPEDSPAQTPSNLAETVTETKAQQPSIPVEESEVPQDAGNDGFSKDIEKAAQ 160
E+ V + S ++ +++ QP P + S+ A
Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168

Query: 161 EVSDYVTKIIYEMDIEATVETSNNRRQINLQIETPE 196
E T ++E V S N +E PE
Sbjct: 1169 EQPAKETS----SNVEQPVTESTTVNTGNSVVENPE 1200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0234adhesinb310.011 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 30.6 bits (69), Expect = 0.011
Identities = 14/34 (41%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 3 MKKLASLAMLGASVLGLAACGGKSQKEAGASKSD 36
MKK L +L + +GLAAC SQK + + S
Sbjct: 1 MKKCRFLVLLLLAFVGLAACS--SQKSSTETGSS 32


30spyM18_0393spyM18_0404N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_0393-114-2.958144exotoxin type A
spyM18_0394-114-1.926977hypothetical protein
spyM18_0396-111-1.501604hypothetical protein
spyM18_0398-311-1.306123UDP-N-acetylmuramate--L-alanine ligase
spyM18_0399-112-0.583525arylalkylamine n-acetyltransferase
spyM18_0401-111-0.560253aminodeoxychorismate lyase
spyM18_0402-214-0.225746transcription elongation factor GreA
spyM18_0404013-0.044474OxaA-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0393BACTRLTOXIN2794e-97 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 279 bits (714), Expect = 4e-97
Identities = 115/257 (44%), Positives = 161/257 (62%), Gaps = 19/257 (7%)

Query: 11 MVFFVLVTFLGLTISQEVFA--QQDPDPSQLHRSS-LVKNLQNIYFLYEGDPVTHENVKS 67
++ F L+ + + V A Q DP P LH+SS + N+ +LY+ V+ VKS
Sbjct: 11 ILIFALIL---VISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYVSATKVKS 67

Query: 68 VDQLLSHDLIYNVSGP---NYDKLKTELKNQEMATLFKDKNVDIYGVEYYHLCYLCE--- 121
VD+ L+HDLIYN+S NYDK+KTEL N+++A +KD+ VD+YG YY CY
Sbjct: 68 VDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFSSKDN 127

Query: 122 ---NAERSACIYGGVTNHEGNHLEIP--KKIVVKVSIDGIQSLSFDIETNKKMVTAQELD 176
C+YGG+T HEGNH + + ++V+V + ++SF+++T+KK VTAQELD
Sbjct: 128 VGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKSVTAQELD 187

Query: 177 YKVRKYLTDNKQLYTNGPSKYETGYIKFIPKNKESFWFDFFPEP--EFTQSKYLMIYKDN 234
K R +L + K LY S YETGYIKFI N +FW+D P P +F QSKYLM+Y DN
Sbjct: 188 IKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDN 247

Query: 235 ETLDSNTSQIEVYLTTK 251
+T+DS + +IEV+LTTK
Sbjct: 248 KTVDSKSVKIEVHLTTK 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0398ACETATEKNASE310.009 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 30.9 bits (70), Expect = 0.009
Identities = 15/55 (27%), Positives = 24/55 (43%), Gaps = 9/55 (16%)

Query: 304 IVNDTII--IDDFA-----HHPTEIVATIDAARQKYPSKEIVAIFQPHTFTRTIA 351
++ D ++ I D H+P I I A Q P +VA+F F +T+
Sbjct: 103 LITDDVLKAITDCIELAPLHNPANIEG-IKACTQIMPDVPMVAVFDT-AFHQTMP 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0399SACTRNSFRASE327e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.8 bits (72), Expect = 7e-04
Identities = 26/120 (21%), Positives = 45/120 (37%), Gaps = 29/120 (24%)

Query: 46 VALIDQEIVGYIEGPVVTTPILEDSLFHGVTKNPKTGGYIAITSLSIAKHFQQQGVGTAL 105
+ ++ +G I+ + N GY I +++AK ++++GVGTAL
Sbjct: 69 LYYLENNCIGRIK----------------IRSN--WNGYALIEDIAVAKDYRKKGVGTAL 110

Query: 106 LAALKDLVVAQQRTGLILTCHDYLIS---YYEMNGFINQGISESQHGGT--------LWY 154
L + GL+L D IS +Y + FI + + WY
Sbjct: 111 LHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSNFPTANEIAIFWY 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_040460KDINNERMP1361e-38 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 136 bits (344), Expect = 1e-38
Identities = 70/231 (30%), Positives = 116/231 (50%), Gaps = 22/231 (9%)

Query: 38 WEFLGKPMSYFIDYFANNAGLGYGLAIIIVTIIVRTLILPLGLYQSWKASYQS-EKMTFL 96
F+ +P+ + + + G +G +III+T IVR ++ PL KA Y S KM L
Sbjct: 333 LWFISQPLFKLLKWIHSFVG-NWGFSIIIITFIVRGIMYPLT-----KAQYTSMAKMRML 386

Query: 97 KPVFEPINKRIKQASSQEEKMAAQTELMAAQRAHGINPLGGIGCLPLLIQMPFFSAMYFA 156
+P + + +R+ ++K E+MA +A +NPLGG C PLLIQMP F A+Y+
Sbjct: 387 QPKIQAMRERLG-----DDKQRISQEMMALYKAEKVNPLGG--CFPLLIQMPIFLALYYM 439

Query: 157 AQYTKGVSTSTFMG--IDLGSR--SLVLTAIIAALYFFQSWLSMMAVSEEQREQMKTMMY 212
+ + + F DL ++ +L ++ FF +S V++ + +M
Sbjct: 440 LMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPM---QQKIMT 496

Query: 213 TMPIMMIFMSFSLPAGVGLYWLVGGFFSIIQQ-LITTYLLKPRLHKQIKEE 262
MP++ P+G+ LY++V +IIQQ LI L K LH + K++
Sbjct: 497 FMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKK 547


31spyM18_0769spyM18_0778N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_07693242.360144hypothetical protein
spyM18_07705232.142545hyaluronidase
spyM18_07714243.207284hypothetical protein
spyM18_0773021-0.514754hypothetical protein
spyM18_0774020-0.552159hypothetical protein
spyM18_07750140.095938hypothetical protein
spyM18_0776-1140.176385holin
spyM18_0777-2120.149774hypothetical protein
spyM18_0778-49-0.316510exotoxin C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0769SSPAMPROTEIN290.036 Salmonella surface presentation of antigen gene typ...
		>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M

signature.
Length = 147

Score = 28.9 bits (64), Expect = 0.036
Identities = 23/65 (35%), Positives = 29/65 (44%), Gaps = 6/65 (9%)

Query: 331 ERINALENNQKVITNNQKQFELNLPKYLNDINGKRVWYEKPDDNIEHKIGDYWFEKNGKY 390
E I AL Q ++ K EL + + I KR EK + + K YW K G Y
Sbjct: 66 EEIYALLRKQSIVRRQIKDLELQIIQ----IQEKRSELEKKREEFQEK-SKYWLRKEGNY 120

Query: 391 QRTWI 395
QR WI
Sbjct: 121 QR-WI 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0770PF072125630.0 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 563 bits (1452), Expect = 0.0
Identities = 336/336 (100%), Positives = 336/336 (100%)

Query: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKPDL 60
MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKPDL
Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKPDL 60

Query: 61 GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGI 120
GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGI
Sbjct: 61 GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGI 120

Query: 121 KPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNA 180
KPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNA
Sbjct: 121 KPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNA 180

Query: 181 VNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAA 240
VNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAA
Sbjct: 181 VNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAA 240

Query: 241 LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQID 300
LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQID
Sbjct: 241 LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQID 300

Query: 301 GNLKLKNPTADDHAATKAYVDSEVKKLKALLMDKQV 336
GNLKLKNPTADDHAATKAYVDSEVKKLKALLMDKQV
Sbjct: 301 GNLKLKNPTADDHAATKAYVDSEVKKLKALLMDKQV 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0771RTXTOXIND340.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.6 bits (77), Expect = 0.002
Identities = 27/145 (18%), Positives = 51/145 (35%), Gaps = 21/145 (14%)

Query: 158 RLSSSYQSGINGLKAQLANDKI---GLQAEIQATAQGLSQKYDNELRQLSAKITTTSSGT 214
RL+S + + + Q ++ +AE T +Y+N R +++ SS
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERL-TVLARINRYENLSRVEKSRLDDFSSLL 244

Query: 215 TEAYESKLAGLRAEFTRSNQGMRIELESQISGLRAVQQSTTSQISQEIRDRTGAVSRVQQ 274
+ +K A L E E++ + SQ+ Q + A Q
Sbjct: 245 HKQAIAKHAVL-------------EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL 291

Query: 275 DLESYQR----RLQDAEDNYSSLTH 295
+ ++ +L+ DN LT
Sbjct: 292 VTQLFKNEILDKLRQTTDNIGLLTL 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0777FLGFLGJ977e-25 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 97.1 bits (241), Expect = 7e-25
Identities = 46/123 (37%), Positives = 65/123 (52%), Gaps = 8/123 (6%)

Query: 23 SLTAAQAILESGWGKHA-------PHNALFGIKADASWTGKSFNTKTQEEYQAGVITDIV 75
L AQA LESGWG+ P LFG+KA +W G T E Y+ G +
Sbjct: 172 HLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTE-YENGEAKKVK 230

Query: 76 DRFRAYDSWDESIADHGQFLVDNPRYQSVIGEADYKKACHAIKDAGYATASGYAELLIQI 135
+FR Y S+ E+++D+ L NPRY +V A ++ A++DAGYAT YA L +
Sbjct: 231 AKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNM 290

Query: 136 IEE 138
I++
Sbjct: 291 IQQ 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_0778BACTRLTOXIN1714e-55 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 171 bits (435), Expect = 4e-55
Identities = 61/254 (24%), Positives = 117/254 (46%), Gaps = 28/254 (11%)

Query: 6 IIKIVFIITVILISTISPIIKSDSKKDISNVKSDLLYAYTITPYDYKDCRVNFSTT---- 61
+I I +I VI + + D D + S+ Y Y D V+ +
Sbjct: 10 VILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYVSATKVKSVD 69

Query: 62 ----HTL--NIDTQKYRGKDYYISSEMSYEASQKFKRDDHVDVFGLFYILNSHTGEY--- 112
H L NI +K + D + ++ + ++K+K D+ VDV+G Y +N +
Sbjct: 70 KFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYK-DEVVDVYGSNYYVNCYFSSKDNV 128

Query: 113 ---------IYGGITPAQNNKVNHKLLGNLFIS-GESQQN-LNNKIILEKDIVTFQEIDF 161
+YGGIT + N ++ L N+ + E+++N ++ ++ +K VT QE+D
Sbjct: 129 GKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKSVTAQELDI 188

Query: 162 KIRKYLMDNYKIYD-ATSPYVSGRIEIGTKDGKHEQIDLFDSPNEG-TRSDIFAKYKDNR 219
K R +L++ +Y+ +SPY +G I+ +G D+ +P + +S Y DN+
Sbjct: 189 KARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNK 248

Query: 220 IINMKNFSHFDIYL 233
++ K+ +++L
Sbjct: 249 TVDSKS-VKIEVHL 261


32spyM18_1538spyM18_1554N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_1538-2140.895327cell division protein
spyM18_1540-2151.001895cell division protein
spyM18_1541-1151.966226undecaprenyldiphospho-muramoylpentapeptide
spyM18_1542-1171.689205UDP-N-acetylmuramoyl-L-alanyl-D-glutamate
spyM18_15431231.770872hypothetical protein
spyM18_1544-1180.826974GTP-binding protein
spyM18_1545012-0.619637hypothetical protein
spyM18_1546-113-1.410487glucose kinase
spyM18_1547216-2.738551hypothetical protein
spyM18_1548217-2.285666peroxide resistance protein
spyM18_1549220-2.853374hypothetical protein
spyM18_1550118-2.197697ribosomal RNA large subunit methyltransferase N
spyM18_1551-115-1.554219hypothetical protein
spyM18_1552-214-0.053021ribose transport operon repressor
spyM18_1553-1131.080105hypothetical protein
spyM18_15542151.451102phosphopantetheine adenylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1538SHAPEPROTEIN475e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 47.4 bits (113), Expect = 5e-08
Identities = 42/191 (21%), Positives = 79/191 (41%), Gaps = 16/191 (8%)

Query: 170 RKTVERAGIKVENIIISPLAMAKTILNEGEREFGATVIDMGGGQTTVASMRAQELQYTNI 229
R++ + AG + +I P+A A G+ V+D+GGG T VA + + Y++
Sbjct: 127 RESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSS 186

Query: 230 YAEGGEYITKDISKVLKTSLAI------AEALKFNFGQAEISEASITETVK-VDVV-GSE 281
GG+ + I ++ + AE +K G A + V+ ++ G
Sbjct: 187 VRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVP 246

Query: 282 EPVEVTERYLSEIISARIRHILDRVKQDLER------GRLLDLPGGIVLIGGGAIMPGVV 335
+ + E + + I+ V LE+ + + G+VL GGGA++ +
Sbjct: 247 RGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISE--RGMVLTGGGALLRNLD 304

Query: 336 EIAQEIFGVTV 346
+ E G+ V
Sbjct: 305 RLLMEETGIPV 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1541LIPPROTEIN48310.010 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 30.7 bits (69), Expect = 0.010
Identities = 19/99 (19%), Positives = 31/99 (31%), Gaps = 10/99 (10%)

Query: 154 FEQEDQLSKVKHLGAVTKVFKDANQMPESTQLE-AVKEYFSRDLKTLLFIGGSAGAHVFN 212
FE ++K + + N + S+ E A S K + G
Sbjct: 83 FEALKAINKQTGI--------EINNVEPSSNFESAYNSALSAGHKIWVLNGFKHQQS-IK 133

Query: 213 QFISDHPELKQRYNIINITGDPHLNELSSHLYRVDYVTD 251
Q+I H E +R I I D + Y + +
Sbjct: 134 QYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIK 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1544TCRTETOQM1863e-53 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 186 bits (474), Expect = 3e-53
Identities = 102/477 (21%), Positives = 187/477 (39%), Gaps = 97/477 (20%)

Query: 8 IRNVAIIAHVDHGKTTLVDELLKQSHTLDERKELQE--RAMDSNDLEKERGITILAKNTA 65
I N+ ++AHVD GKTTL + LL S + E + + D+ LE++RGITI T+
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 66 VAYNDVRINIMDTPGHADFGGEVERIMKMVDGVVLVVDAYEGTMPQTRFVLKKALEQNLI 125
+ + ++NI+DTPGH DF EV R + ++DG +L++ A +G QTR + + +
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 126 PIVVVNKIDKPSARP-------------------------------------AEVVDEVL 148
I +NKID+ + V E
Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182

Query: 149 ELFIELGADDEQLE-----------------FPVVYASAINGTSSLSDDPADQEHTMAPI 191
+ +E + LE FPV + SA N + +
Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIG------------IDNL 230

Query: 192 FDTIIDHIPAPVDNSDEPLQFQVSLLDYNDFVGRIGIGRVFRGTVKVGDQVTLSKLDGTT 251
+ I + + L +V ++Y++ R+ R++ G + + D V +S
Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRIS----EK 286

Query: 252 KNFRVTKLFGFFGLERREIQEAKAGDLIAVSGMEDIFVGETITPTDCVEALPILRIDEPT 311
+ ++T+++ E +I +A +G+++ + E + + + T + + P
Sbjct: 287 EKIKITEMYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPL 345

Query: 312 LQMTFLVNNSPFAGREGKWITSRKVEER--LLAELQT----DVSLRVDPTDSPDKWTISG 365
LQ T + K ++R LL L D LR + + +S
Sbjct: 346 LQTT---------------VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSF 390

Query: 366 RGELHLSILIETMRRE-GYELQVSRPEVIIKEIDGVKCEPFERVQIDTPEEYQGAII 421
G++ + + ++ + E+++ P VI E K E + I+ P A I
Sbjct: 391 LGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKAE--YTIHIEVPPNPFWASI 445



Score = 42.5 bits (100), Expect = 4e-06
Identities = 18/79 (22%), Positives = 31/79 (39%), Gaps = 1/79 (1%)

Query: 403 EPFERVQIDTPEEYQGAIIQSLSERKGDMLDMQMVGNGQTRLIFLIPARGLIGYSTEFLS 462
EP+ +I P+EY + +++D Q + N + L IPAR + Y ++
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTF 595

Query: 463 MTRGYGIMNHTFDQYLPVV 481
T G + Y
Sbjct: 596 FTNGRSVCLTELKGYHVTT 614


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1546PF03309310.004 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 31.3 bits (71), Expect = 0.004
Identities = 29/126 (23%), Positives = 43/126 (34%), Gaps = 14/126 (11%)

Query: 5 LLGIDLGGTTIKFGILTAAGEVQE---KWAIETNILEGGKHIVPDIVASIKHRLDLYGLS 61
LL ID+ T G+++ +G+ + +W I T + D +A L G
Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTE-----PEVTADELALTIDG--LIGDD 54

Query: 62 SADFVGIGMGSPGAVDRDTNTVTGAFNLNWKETQEVGSVVEKELGIPFAIDNDANVAALG 121
+ G S V + V W V GIP +DN V A
Sbjct: 55 AERLTGASGLS--TVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGA-- 110

Query: 122 ERWVGA 127
+R V
Sbjct: 111 DRIVNC 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1548HELNAPAPROT1511e-49 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 151 bits (383), Expect = 1e-49
Identities = 49/154 (31%), Positives = 85/154 (55%), Gaps = 4/154 (2%)

Query: 19 KKEASKNEKT--KAVLNQAVADLSVAASIVHQVHWYMRGPGFLYLHPKMDELLDSLNANL 76
K E +K +T + LN +++ + S +H+ HWY++GP F LH K +EL D +
Sbjct: 2 KTENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETV 61

Query: 77 DEMSERLITIGGAPYSTLAEFSKHSKLDEAKGTYDKTVAQHLARLVEVYLYLSSLYQVGL 136
D ++ERL+ IGG P +T+ E+++H+ + + + + ++ + LV Y +SS + +
Sbjct: 62 DTIAERLLAIGGQPVATVKEYTEHASITDGGN--ETSASEMVQALVNDYKQISSESKFVI 119

Query: 137 DITDEEGDAGTNDLFTAAKTEAEKTIWMLQAERG 170
+ +E D T DLF E EK +WML + G
Sbjct: 120 GLAEENQDNATADLFVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1549PREPILNPTASE310.003 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.9 bits (70), Expect = 0.003
Identities = 42/160 (26%), Positives = 59/160 (36%), Gaps = 25/160 (15%)

Query: 78 SLIIILWASMVHWVSASYCYLLLFSLLFSLF--DWRSQ------EYPFILWLFSFVSLLL 129
+L+ + A + + LLL +L +L D P + F L
Sbjct: 118 ALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGG 177

Query: 130 FYSIN---------YLSLILLLLGLLAHLRPFSIGAGDFFYLASLALVLDLTSLIWLIQL 180
F S+ YL L L +G GDF LA+L L +L ++ L
Sbjct: 178 FVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLL 237

Query: 181 ASLAGITACLLL-------GIKRIPFIPYLSFGLFWIVLL 213
+SL G + L K IPF PYL+ WI LL
Sbjct: 238 SSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIA-GWIALL 276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1552NUCEPIMERASE320.004 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 31.7 bits (72), Expect = 0.004
Identities = 13/76 (17%), Positives = 34/76 (44%), Gaps = 9/76 (11%)

Query: 50 LAQSLKTKKNQLVGLLLPDISNPFF-PRLARGAEEYLKEKGYRVMLGNISDSEALEE--- 105
+++ L +Q+VG+ D N ++ L + E L + G++ +++D E + +
Sbjct: 16 VSKRLLEAGHQVVGI---DNLNDYYDVSLKQARLELLAQPGFQFHKIDLADREGMTDLFA 72

Query: 106 --EYVHVLLQSNAAGI 119
+ V + + +
Sbjct: 73 SGHFERVFISPHRLAV 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1554LPSBIOSNTHSS1532e-50 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 153 bits (388), Expect = 2e-50
Identities = 58/157 (36%), Positives = 94/157 (59%), Gaps = 2/157 (1%)

Query: 5 IGLYTGSFDPVTNGHLDIVKRASGLFDQIYVGIFDNPTKKSYFKLEVRKAMLTQALADFT 64
+Y GSFDP+T GHLDI++R LFDQ+YV + NP K+ F ++ R + +A+A
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 65 NVIVVTSHERLAIDVAKELRVTHLIRGLRNATDFEYEENLEYFNHLLAPNIETVYLISRN 124
N V + E L ++ A++ + ++RGLR +DFE E + N LA ++ETV+L +
Sbjct: 62 NAQVDSF-EGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120

Query: 125 KWQALSSSRVRELIHFQSSLEGLVPQSVIAQV-EKMN 160
++ LSSS V+E+ F ++E VP V A + ++ +
Sbjct: 121 EYSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQFH 157


33spyM18_1564spyM18_1570N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_1564-2171.607671arginine deiminase
spyM18_1565-2181.658861hypothetical protein
spyM18_1566-2192.293245arginine repressor ArgR
spyM18_1567-1182.157092hypothetical protein
spyM18_15681181.881600hypothetical protein
spyM18_1569-1211.972620histidine kinase
spyM18_15700212.609584two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1564ARGDEIMINASE5780.0 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 578 bits (1492), Expect = 0.0
Identities = 191/410 (46%), Positives = 276/410 (67%), Gaps = 9/410 (2%)

Query: 5 TPIHVYSEIGKLKKVLLHRPGKEIENLMPDYLERLLFDDIPFLEDAQKEHDAFAQALRDE 64
PI+++SEIG+LKKVLLHRPG+E+ENL P ++ LFDDIP+LE A++EH+ FA L++
Sbjct: 6 NPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNN 65

Query: 65 GIEVLYLETLAAESLVTP-EIREAFIDEYLSEANIRGRATKKAIRELLMAIEDNQELIEK 123
+E+ Y+E L +E LV+ + FI +++ EA I+ T +++ ++ +I K
Sbjct: 66 LVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSL-TIDNMISK 124

Query: 124 TMAGVQKSELPEIPASEKGLTDLVESNYPFAIDPMPNLYFTRDPFATIGTGVSLNHMFSE 183
++GV EL +S L DLV F IDPMPN+ FTRDPFA+IG GV++N MF++
Sbjct: 125 MISGVVTEELKNYTSS---LDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTK 181

Query: 184 TRNRETLYGKYIFTHHPIYGGGKVPMVYDRNETTRIEGGDELVLSKDVLAVGISQRTDAA 243
R RET++ +YIF +HP+Y VP+ +R E +EGGDELVL+K +L +GIS+RT+A
Sbjct: 182 VRQRETIFAEYIFKYHPVYKE-NVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAK 240

Query: 244 SIEKLLVNIFKQNLGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIEGDLRVYSV 303
S+EKL +++FK F +LAF+ NR +MHLDTVFT +DY FT + +Y +
Sbjct: 241 SVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDMYFSIYVL 300

Query: 304 TYDNE--ELHIVEEKGDLAELLAANLGVEKVDLIRCGGDNLVAAGREQWNDGSNTLTIAP 361
TY+ ++HI +EK + ++L+ LG K+D+I+C G +L+ REQWNDG+N L IAP
Sbjct: 301 TYNPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAP 359

Query: 362 GVVVVYNRNTITNAILESKGLKLIKIHGSELVRGRGGPRCMSMPFEREDI 411
G ++ Y+RN +TN + E G+K+ +I SEL RGRGGPRCMSMP REDI
Sbjct: 360 GEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1566ARGREPRESSOR1234e-39 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 123 bits (311), Expect = 4e-39
Identities = 60/146 (41%), Positives = 92/146 (63%), Gaps = 2/146 (1%)

Query: 1 MNKKETRHQLIRSLISETTIHTQQELQERLQKNGITITQATLSRDMKELNLVKVTSGNDT 60
MNK + RH IR +I+ I TQ EL + L+K+G +TQAT+SRD+KEL+LVKV + N +
Sbjct: 1 MNKGQ-RHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNGS 59

Query: 61 HYEALAISQTRWEH-RLRFYMEDALVMLKIVQHQIILKTLPGLAQSFGSILDAMQIPEIV 119
+ +L Q +L+ + DA V + H I+LKT+PG AQ+ G+++D + EI+
Sbjct: 60 YKYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIM 119

Query: 120 ATVCGDDTCLIVCEDNEQAKACYETL 145
T+CGDDT LI+C ++ K + +
Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKI 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1569PF065801805e-54 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 180 bits (459), Expect = 5e-54
Identities = 58/203 (28%), Positives = 102/203 (50%), Gaps = 10/203 (4%)

Query: 362 EKEIGQYRLQALASQINPHFLYNTLDTIIWMAEFNDSKRVVEVTKSLAKYFRLALNQGN- 420
+ +L AL +QINPHF++N L+ I + D + E+ SL++ R +L N
Sbjct: 155 ASMAQEAQLMALKAQINPHFMFNALNNIRALIL-EDPTKAREMLTSLSELMRYSLRYSNA 213

Query: 421 DYIRLADELDHVSQYLFIQKQRYGDKLTYDVQGLDAYADFIIPKLILQPLVENAIYHGIK 480
+ LADEL V YL + ++ D+L ++ Q A D +P +++Q LVEN I HGI
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273

Query: 481 EVDRKGMIKVTVSETAQHLILTVWDNGKGIEASALTNSQSLLTRGGVGLKNVDQRLKLQY 540
++ + G I + ++ + L V + G + ++ G GL+NV +RL++ Y
Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKEST-------GTGLQNVRERLQMLY 326

Query: 541 GEAYQMTIHSQSDHFTEIQLSLP 563
G Q+ + + + + +P
Sbjct: 327 GTEAQIKLSEKQGKVNAM-VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1570HTHFIS969e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 95.7 bits (238), Expect = 9e-25
Identities = 43/165 (26%), Positives = 75/165 (45%), Gaps = 12/165 (7%)

Query: 3 SLLIVEDEYLIRQGVRSLVDFSQFKIDRVNEAENGQLAWDLFQKEPYDIVLTDINMPKLN 62
++L+ +D+ IR + + + + V N W D+V+TD+ MP N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY---DVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GIQLAELIKQESPQTHLVFLTGYDDFNYALSALKLGADDYLLKPFSKADVEDMLGKLRKK 122
L IK+ P ++ ++ + F A+ A + GA DYL KPF D+ +++G + +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIGIIGRA 118

Query: 123 LELSKKTETIQELVEQPQKEVSAIAMAIHE------RLADSDLTL 161
L K+ + E Q + + A+ E RL +DLTL
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


34spyM18_1866spyM18_1873N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_1866-111-1.041002ABC transporter permease
spyM18_1867-29-0.514511ABC transporter
spyM18_1868-29-0.036088hypothetical protein
spyM18_1870-2111.978311hypothetical protein
spyM18_1871-2112.665539alanine racemase
spyM18_1872-491.3197774'-phosphopantetheinyl transferase
spyM18_1873-3102.008718preprotein translocase subunit SecA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1866TYPE3IMSPROT280.046 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 28.2 bits (63), Expect = 0.046
Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 5/76 (6%)

Query: 255 LASVATSIVGVVSFLGL---IVPHMSRLLVGSKHQILIPFSALLGAFVFLLADTLGRSLA 311
+ S A + +GL H S+L++ Q +PFS L V + L
Sbjct: 29 VVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFF-YLC 87

Query: 312 YPLEISPAIIMSIVGG 327
+PL ++ A +M+I
Sbjct: 88 FPL-LTVAALMAIASH 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1867FERRIBNDNGPP711e-15 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 71.1 bits (174), Expect = 1e-15
Identities = 55/265 (20%), Positives = 104/265 (39%), Gaps = 24/265 (9%)

Query: 304 VACVNQHPKTAKETEQQRIVATSVAVVDICDRLNLDLVGVCDSKLYTL----PKRYDAVK 359
+ A + RIVA V++ L + GV D+ Y L P D+V
Sbjct: 20 PLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVI 79

Query: 360 RVGLPMNPDIELIASLKPTWILSPNSLQEDLEPKYQKLDTEYGFLNLRSVEG------MY 413
VGL P++EL+ +KP++++ P + L +G
Sbjct: 80 DVGLRTEPNLELLTEMKPSFMV----WSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMAR 135

Query: 414 QSIDDLGNLFQRQQEAKELRQQYQDYYRAFQAKRKGK-KKPKVLILMGLPGSYLVATNQS 472
+S+ ++ +L Q A+ QY+D+ R+ + + + +P +L + P LV S
Sbjct: 136 KSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNS 195

Query: 473 YVGNLLDLAGGENVYQ--SDEKEFLSVNPEDMLA-KEPDLILRTAHAIPDKVKVMFDKEF 529
+LD G N +Q ++ +V+ + + A K+ D++ D +M
Sbjct: 196 LFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALM----- 250

Query: 530 AENDIWKHFTAVKEGKVYDLDNTLF 554
+W+ V+ G+ + F
Sbjct: 251 -ATPLWQAMPFVRAGRFQRVPAVWF 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1870TONBPROTEIN340.001 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 33.8 bits (77), Expect = 0.001
Identities = 18/89 (20%), Positives = 29/89 (32%)

Query: 76 SNSLVNADDKKRSDSSQSVVGSSDNKAEAENQVDDKSTDHSKPTDHSKPTDHSKPTDQPK 135
S ++V D + + Q + + + + KP KP K
Sbjct: 46 SVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 105

Query: 136 PTDQPKPSPSKVDTAPASSLSRQLPEVRT 164
+QPK V++ PAS P T
Sbjct: 106 VQEQPKRDVKPVESRPASPFENTAPARLT 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1871ALARACEMASE344e-119 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 344 bits (883), Expect = e-119
Identities = 122/367 (33%), Positives = 195/367 (53%), Gaps = 21/367 (5%)

Query: 7 RPTVARVNLQAIKENVASVQKHIPLGVKTYAVVKADAYGHGAVQVSKALLPQVDGYCVSN 66
RP A ++LQA+K+N++ V++ + ++VVKA+AYGHG ++ A+ DG+ + N
Sbjct: 3 RPIQASLDLQALKQNLSIVRQAAT-HARVWSVVKANAYGHGIERIWSAI-GATDGFALLN 60

Query: 67 LDEALQLRQAGIDKEILIL-GVLLPNELKLAITRQVTVTVASLEWLAMAKQEWPDLKG-L 124
L+EA+ LR+ G IL+L G +L++ ++T V S L + LK L
Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNA--RLKAPL 118

Query: 125 KVHIKIDSGMGRIGLRSVTEVDNLIAGLKSMGAD-VEGIFTHFATADEADDTKFNQQLQF 183
+++K++SGM R+G + V + L++M + +HFA A+ D +
Sbjct: 119 DIYLKVNSGMNRLGFQP-DRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGIS--GAMAR 175

Query: 184 FKKLIAGLEDKPRLVHASNSATSIWHSDTIFNAVRLGIVSYGLNPSGS-DLSLPFPLQEA 242
++ GL SNSA ++WH + F+ VR GI+ YG +PSG L+
Sbjct: 176 IEQAAEGL---ECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPV 232

Query: 243 LSLESSLVHVKMISAGDTVGYGATYTAKKSEYVGTVPIGYADGWTRNM-QGFSVLVDGQF 301
++L S ++ V+ + AG+ VGYG YTA+ + +G V GYADG+ R+ G VLVDG
Sbjct: 233 MTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVR 292

Query: 302 CEIIGRVSMDQLTIRLSKA--YPLGTKVTLIGSNQQKNISTTDIANYRNTINYEVLCLLS 359
+G VSMD L + L+ +GT V L G K I D+A T+ YE++C L+
Sbjct: 293 TMTVGTVSMDMLAVDLTPCPQAGIGTPVELWG----KEIKIDDVAAAAGTVGYELMCALA 348

Query: 360 DRIPRIY 366
R+P +
Sbjct: 349 LRVPVVT 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_1873SECA10520.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1052 bits (2723), Expect = 0.0
Identities = 394/903 (43%), Positives = 560/903 (62%), Gaps = 73/903 (8%)

Query: 1 MANILRKVIENDKG-ELRKLEKIAKKVESYADQMASLSDRDLQGKTLEFKERYQKGETLE 59
+ +L KV + LR++ K+ + + +M LSD +L+GKT EF+ R +KGE LE
Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61

Query: 60 QLLPEAFAVVREAAKRVLGLFPYRVQIMGGIVLHNGDVPEMRTGEGKTLTATMPVYLNAI 119
L+PEAFAVVREA+KRV G+ + VQ++GG+VL+ + EMRTGEGKTLTAT+P YLNA+
Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121

Query: 120 AGEGVHVITVNEYLSTRDATEMGEVYSWLGLSVGINLAAKSPAEKREAYNCDITYSTNSE 179
G+GVHV+TVN+YL+ RDA ++ +LGL+VGINL KREAY DITY TN+E
Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181

Query: 180 VGFDYLRDNMVVRQEDMVQRPLNFALVDEVDSVLIDEARTPLIVSGAVSSETNQLYIRAD 239
GFDYLRDNM E+ VQR L++ALVDEVDS+LIDEARTPLI+SG + ++Y R +
Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSS-EMYKRVN 240

Query: 240 MFVKTLT------------SVDYVIDVPTKTIGLSDSGIDKAESYFNLS-------NLYD 280
+ L + +D ++ + L++ G+ E +LY
Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300

Query: 281 IENVALTHFIDNALRANYIMLLDIDYVVSEDGEILIVDQFTGRTMEGRRFSDGLHQAIEA 340
N+ L H + ALRA+ + D+DY+V +DGE++IVD+ TGRTM+GRR+SDGLHQA+EA
Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIV-KDGEVIIVDEHTGRTMQGRRWSDGLHQAVEA 359

Query: 341 KEGVRIQEESKTSASITYQNMFRMYKKLAGMTGTAKTEEEEFREVYNMRIIPIPTNRPIA 400
KEGV+IQ E++T ASIT+QN FR+Y+KLAGMTGTA TE EF +Y + + +PTNRP+
Sbjct: 360 KEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMI 419

Query: 401 RIDHTDLLYPTLESKFRAVVEDVKTRHAKGQPILVGTVAVETSDLISRKLVEAGIPHEVL 460
R D DL+Y T K +A++ED+K R AKGQP+LVGT+++E S+L+S +L +AGI H VL
Sbjct: 420 RKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVL 479

Query: 461 NAKNHFKEAQIIMNAGQRGAVTIATNMAGRGTDIKLG----------------------- 497
NAK H EA I+ AG AVTIATNMAGRGTDI LG
Sbjct: 480 NAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKA 539

Query: 498 ------EGVRELGGLCVIGTERHESRRIDNQLRGRSGRQGDPGESQFYLSLEDDLMRRFG 551
+ V E GGL +IGTERHESRRIDNQLRGRSGRQGD G S+FYLS+ED LMR F
Sbjct: 540 DWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFA 599

Query: 552 SDRIKAFLDRMKLDEEDTVIKSGMLGRQVESAQKRVEGNNYDTRKQVLQYDDVMREQREI 611
SDR+ + ++ + + I+ + + + +AQ++VE N+D RKQ+L+YDDV +QR
Sbjct: 600 SDRVSGMMRKLGM-KPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRA 658

Query: 612 IYANRRDVITANRDLGPEIKAMIKRTIDRAVDAHARSNR---KDAIDAIVTFARTSLVPE 668
IY+ R +++ + D+ I ++ + +DA+ I + + +
Sbjct: 659 IYSQRNELLDVS-DVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLD 717

Query: 669 ESIS--AKELRGLKDDQIKEKLYQRALAIYDQQLSKLRDQEAIIEFQKVLILMIVDNKWT 726
I+ + L ++ ++E++ +++ +Y ++ + E + F+K ++L +D+ W
Sbjct: 718 LPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVG-AEMMRHFEKGVMLQTLDSLWK 776

Query: 727 EHIDALDQLRNAVGLRGYAQNNPVVEYQAEGFKMFQDMIGAIEFDVTRTMMKAQIH-EQE 785
EH+ A+D LR + LRGYAQ +P EY+ E F MF M+ +++++V T+ K Q+ +E
Sbjct: 777 EHLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEE 836

Query: 786 RERASQRATTAAPQNIQSQQSANTDD-------------LPKVERNEACPCGSGKKFKNC 832
E Q+ A + Q QQ ++ DD KV RN+ CPCGSGKK+K C
Sbjct: 837 VEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQC 896

Query: 833 HGR 835
HGR
Sbjct: 897 HGR 899


35spyM18_2040spyM18_2046N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_20400122.143387sugar ABC transporter ATP-binding protein
spyM18_20410141.927587leucine-rich protein
spyM18_20420151.754123streptokinase
spyM18_20442192.613063D-tyrosyl-tRNA(Tyr) deacylase
spyM18_20450173.228521(p)ppGpp synthetase
spyM18_20462182.878883protective antigen
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2040PF05272356e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.0 bits (80), Expect = 6e-04
Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 9/56 (16%)

Query: 34 IVFVGPSGCGKSTTLRMIAGLEDISEGELKIGGEVVNDKSPKDRDIAMVFQNYALY 89
+V G G GKST + + GL+ S+ IG +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG---------TGKDSYEQIAGIVAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2041HTHFIS347e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 7e-04
Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 229 ALWSEHGNLVQTAQRLYIHRNSLQYKLDKF 258
AL + GN ++ A L ++RN+L+ K+ +
Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2042STREPKINASE7930.0 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 793 bits (2049), Expect = 0.0
Identities = 388/440 (88%), Positives = 413/440 (93%)

Query: 1 MKNYLSFGMFALLFALTFGTVKPVQAIAGYEWLLDRPSVNNSQLVVSMAGIVEGTDKKVF 60
MKNYLSFGMFALLFALTFGTV VQAIAG EWLLDRPSVNNSQLVVS+AG VEGT++ +
Sbjct: 1 MKNYLSFGMFALLFALTFGTVNSVQAIAGPEWLLDRPSVNNSQLVVSVAGTVEGTNQDIS 60

Query: 61 INFFEIDLTSQPAHGGKTEQGLSPKSKPFATDNGAMPHKLEKADLLKAIQERLIANVHSN 120
+ FFEIDLTS+PAHGGKTEQGLSPKSKPFATD+GAM HKLEKADLLKAIQE+LIANVHSN
Sbjct: 61 LKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKLEKADLLKAIQEQLIANVHSN 120

Query: 121 DGYFEVIDFASDATITDRDDNIYFANQDGSVTLPTQPIQQFLLRGHVRVRPYKEKPIQTP 180
D YFEVIDFASDATITDR+ +YFA++DGSVTLPTQP+Q+FLL GHVRVRPYKEKPIQ
Sbjct: 121 DDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVRPYKEKPIQNQ 180

Query: 181 AKSVDIRYTVQFTPLNPDDDFKPVLKDTKLLKTLAIGDTITSQELLAQAQSILNESHSDY 240
AKSVD+ YTVQFTPLNPDDDF+P LKDTKLLKTLAIGDTITSQELLAQAQSILN++H Y
Sbjct: 181 AKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKNHPGY 240

Query: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYHIKDREQAYGINKKSGQEEKTNNTDLISEKY 300
TIYERDSSIVTHDNDIFRTILPMDQEFTY +K+REQAY INKKSG E+ NNTDLISEKY
Sbjct: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQAYRINKKSGLNEEINNTDLISEKY 300

Query: 301 YVLKKGEKPYDPFDRSHLKLFTINYVDVNTNKLLKSEQLLTASERNLDFRDLYDPRDKAK 360
YVLKKGEKPYDPFDRSHLKLFTI YVDV+TN+LLKSEQLLTASERNLDFRDLYDPRDKAK
Sbjct: 301 YVLKKGEKPYDPFDRSHLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAK 360

Query: 361 LLYNNLDAFGIMDYTLTGKVEDNHDKNNRVVTVYMGKRPEGENASYHLAYDKDRYTEEER 420
LLYNNLDAFGIMDYTLTGKVEDNHD NR++TVYMGKRPEGENASYHLAYDKDRYTEEER
Sbjct: 361 LLYNNLDAFGIMDYTLTGKVEDNHDDTNRIITVYMGKRPEGENASYHLAYDKDRYTEEER 420

Query: 421 EVYSYLRYTGTPIPDNPKDK 440
EVYSYLRYTGTPIPDNP DK
Sbjct: 421 EVYSYLRYTGTPIPDNPNDK 440


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2046GPOSANCHOR1265e-33 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 126 bits (317), Expect = 5e-33
Identities = 107/457 (23%), Positives = 184/457 (40%), Gaps = 40/457 (8%)

Query: 143 IEAIKYRLDSESHLKEELLKQTAELEQRKNAEVDLKSEKKRLEAQIEKVGYDIANKQQEL 202
+ K +L E + ELE RK + ++ L
Sbjct: 94 LSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 153

Query: 203 EKARSDQKELSESIQKLTSRFKKESDAKQKELDEAKAANKSLSESATKTLARSSKITNEL 262
++D ++ E F AK K L+ KAA ++ K L + +
Sbjct: 154 AARKADLEKALEGA----MNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTAD 209

Query: 263 KDKLAASEKDKNRAFQVSSELANKLHETETSRDKALAESKELADKLAVKTAEAEKLMENV 322
K+ E +K LA + + E + + A+ S + K+ AE L
Sbjct: 210 SAKIKTLEAEKA-------ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQ 262

Query: 323 GSLDRLVESAKREMAQKLAEIDQLTADKAKADAELAAANDTIASLQTELEKVKTELAVSE 382
L++ +E A A+I L A+KA +AE A L + ++ +L S
Sbjct: 263 AELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASR 322

Query: 383 RLIESGKREIAELEKQKDASDKALAESQANVAELEKQKAASDAKVAELEKEVEAAKAEVA 442
+ + E +LE+Q S+ + + ++ + K +A+ +LE++ + ++A
Sbjct: 323 EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQ 382

Query: 443 DLKAQLAKKEEELEAVKKEKEALEAKIEELKKAHAEELSKLKEMLEKKDHANADLQAEIN 502
L+ L E + V+K E +K+ L+K + E K ++K A L+AE
Sbjct: 383 SLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAK 442

Query: 503 RLKQELADRIKSLSQGGRASQTNPGSTTAKAG---------------------------- 534
LK++LA + + L++ ++ + AK G
Sbjct: 443 ALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETK 502

Query: 535 -QLPSTGESANPFFTIAALTVIAGAGMAVVSPKRKEN 570
QLPSTGE+ANPFFT AALTV+A AG+A V +++EN
Sbjct: 503 RQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN 539


36spyM18_2069spyM18_2095N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
spyM18_2069-1101.431818ATPase
spyM18_20701121.094499ATPase
spyM18_20711120.582539hypothetical protein
spyM18_20721120.630338hypothetical protein
spyM18_20732161.350604laminin adhesion
spyM18_20741171.039875C5A peptidase
spyM18_2076219-0.045863M18 protein
spyM18_20770220.119519positive regulatory protein Mga
spyM18_2080-1220.930548hypothetical protein
spyM18_2082-1220.905981hypothetical protein
spyM18_2083-122-0.594565histidine kinase
spyM18_2084-222-0.274275two-component response regulator
spyM18_2087-2230.507816ABC transporter permease
spyM18_20891291.624269ABC transporter ATP-binding protein
spyM18_20902301.484984ABC transporter
spyM18_20911321.813295hypothetical protein
spyM18_20921321.498936hypothetical protein
spyM18_2095024-0.580086hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2069HTHFIS290.020 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.020
Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 45 IIGASGSGKSLLAHAI 60
I G SG+GK L+A A+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2072PF05616340.002 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 34.0 bits (77), Expect = 0.002
Identities = 24/87 (27%), Positives = 35/87 (40%), Gaps = 2/87 (2%)

Query: 226 IPKKDLSPSELAAAQAYWSQKQGRGARPSDY-RPTPAPGRRKAPIPDVTPNPRQGHQPD- 283
IP+ DL+P A A + P++ P PG R P PD NP D
Sbjct: 310 IPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDG 369

Query: 284 NGGYHPAPPRPNDASQNKHQRDEFKGK 310
G P P D +H+++ +G+
Sbjct: 370 QPGTRPDSPAVPDRPNGRHRKERKEGE 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2073ADHESNFAMILY2501e-84 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 250 bits (641), Expect = 1e-84
Identities = 84/323 (26%), Positives = 145/323 (44%), Gaps = 34/323 (10%)

Query: 1 MKKGFFLMVMVVSLVMIAGCDKSANPKQPTQGMSVVTSFYPMYAMTKEVSGDLNDVR-MI 59
MKK L+V+ +S +++ C Q + VV + + +TK ++GD D+ ++
Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60

Query: 60 QSGAGIHSFEPSVNDVAAIYDADLFVYHSHTLE----AWARDLDPNLKKSKVDVFEASKP 115
G H +EP DV +ADL Y+ LE AW L N KK++ + A
Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFA--- 117

Query: 116 LTLDRVKGLEDMEVTQGIDPATLY--------DPHTWTDPVLAGEEAVNIAKELGRLDPK 167
V+ G+D L DPH W + A NIAK+L DP
Sbjct: 118 -------------VSDGVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPN 164

Query: 168 HKDSYTKNAKAFKKEAEQLTEEYTQKFKKVR--SKTFVTQHTAFSYLAKRFGLKQLGISG 225
+K+ Y KN K + + ++L +E KF K+ K VT AF Y +K +G+ I
Sbjct: 165 NKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWE 224

Query: 226 ISPEQEPSPRQLKEIQDFVKEYNVKTIFAEDNVNPKIAHAIAKSTGAKVKT---LSPLEA 282
I+ E+E +P Q+K + + +++ V ++F E +V+ + +++ T + +
Sbjct: 225 INTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAE 284

Query: 283 APSGNKTYLENLRANLEVLYQQL 305
+Y ++ NL+ + + L
Sbjct: 285 QGKEGDSYYSMMKYNLDKIAEGL 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2074SUBTILISIN1073e-27 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 107 bits (268), Expect = 3e-27
Identities = 50/226 (22%), Positives = 85/226 (37%), Gaps = 47/226 (20%)

Query: 117 KAGKGAGTVVAVIDAGFDKNHEAWRLTDKTKARYQSKEDLEKAKKEHGITYGEWVNDKVA 176
+G G VAV+D G D +H DL KA+ G + +
Sbjct: 36 NQTRGRGVKVAVLDTGCDADHP----------------DL-KARIIGGRNFTDDDEGDPE 78

Query: 177 YYHDYSKDGKTAVDQEHGTHVSGILSGNAPSETKEPYRLEGAMPEAQLLLMRVEIVNGLA 236
+ DY+ HGTHV+G ++ + G PEA LL+++V G
Sbjct: 79 IFKDYNG---------HGTHVAGTIAATENE-----NGVVGVAPEADLLIIKVLNKQGSG 124

Query: 237 DYARNYAQAIRDAVNLGAKVINMSFGNAALAYANLPDETKKAFDYAKSKGVSIVTSAGND 296
Y Q I A+ +I+MS G E +A A + + ++ +AGN+
Sbjct: 125 QYD-WIIQGIYYAIEQKVDIISMSLGGPED-----VPELHEAVKKAVASQILVMCAAGNE 178

Query: 297 SSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASYSPDKQLTETAT 342
+T +G P + ++V + + D+ +E +
Sbjct: 179 GDGDDRT----------DELGYPGCYNEVISVGAINFDRHASEFSN 214



Score = 80.3 bits (198), Expect = 4e-18
Identities = 37/139 (26%), Positives = 57/139 (41%), Gaps = 22/139 (15%)

Query: 457 NATPKVLPTASGTK---LSRFSSWGLTADGNIKPDIAAPGQDILSSVANNKYAKLSGTSM 513
+V+ + S FS+ + D+ APG+DILS+V KYA SGTSM
Sbjct: 192 GCYNEVISVGAINFDRHASEFSNSNN------EVDLVAPGEDILSTVPGGKYATFSGTSM 245

Query: 514 SAPLVAGIMGL-LQKQYEIQYPDMTPSERLDLAKKVLMSSATALYDEDEKAYFSPRQQGA 572
+ P VAG + L Q D+T E L+ L + SP+ +G
Sbjct: 246 ATPHVAGALALIKQLANASFERDLTEPE----LYAQLIKRTIPLGN-------SPKMEGN 294

Query: 573 GAVDAKKASA-ATMYVTDK 590
G + + ++ T +
Sbjct: 295 GLLYLTAVEELSRIFDTQR 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2076GPOSANCHOR1271e-34 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 127 bits (321), Expect = 1e-34
Identities = 200/332 (60%), Positives = 232/332 (69%), Gaps = 8/332 (2%)

Query: 68 HQLTVENKKLKIDKEQLTKENDDLKTEKDQLEQRSEKLATQKENLEKEVAEAKHKNETLN 127
L E L K L K + + + L +K LE AE + E
Sbjct: 214 KTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAM 273

Query: 128 INNDDLTKKLNETRQELANKQQESKENEKTLNELLEKTVKDKIAREQKSKQDFGALKQEL 187
+ + K+ E A + E K + + +++L + S++ L+ E
Sbjct: 274 NFSTADSAKIKTLEAEKAALEAE-KADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEH 332

Query: 188 AKKEEQNKISEASRKGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQISDASRQGLR 247
K EEQNKISEASR+ LRRDLDASREAKKQ+E AE K++E+ +IS+ASRQ LR
Sbjct: 333 QKLEEQNKISEASRQSLRRDLDASREAKKQLE-------AEHQKLEEQNKISEASRQSLR 385

Query: 248 RDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALK 307
RDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALK
Sbjct: 386 RDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALK 445

Query: 308 EQLAKQAEELAKLRAEKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQL 367
E+LAKQAEELAKLRA KASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQL
Sbjct: 446 EKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQL 505

Query: 368 PSTGEAANPFFTAAAATVMVSAGMLALKRKEE 399
PSTGE ANPFFTAAA TVM +AG+ A+ +++E
Sbjct: 506 PSTGETANPFFTAAALTVMATAGVAAVVKRKE 537



Score = 48.9 bits (116), Expect = 2e-08
Identities = 71/301 (23%), Positives = 119/301 (39%), Gaps = 2/301 (0%)

Query: 1 MVRKDANRQYSLRKLKKSTASVAVALSALGVGLAVNQTEVSAAPLTRATADNKDELIKRA 60
M + + NR YSLRKLK TASVAVAL+ LG GL T +A TR+ D +++ +RA
Sbjct: 1 MTKNNTNRHYSLRKLKTGTASVAVALTVLGAGLV-VNTNEVSAVATRSQTDTLEKVQERA 59

Query: 61 NGYEIQNHQLTVENKKLKIDKEQLTKENDDLKTEKDQLEQRSEKLATQKENLEKEVAEAK 120
+ +EI+N+ L ++N L + + L ND+L E +++ K ++ E +
Sbjct: 60 DKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELE 119

Query: 121 HKNETLNINNDDLTKKLNETRQELANKQQESKENEKTLNELLEKTVKDKIAREQKSKQDF 180
+ L + ++ + E K LEK ++ +
Sbjct: 120 ARKADLEKALEGAMNFSTADSAKIKTLEAE-KAALAARKADLEKALEGAMNFSTADSAKI 178

Query: 181 GALKQELAKKEEQNKISEASRKGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQISD 240
L+ E A E + E + +G A K +E + A L A +++ + +
Sbjct: 179 KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAM 238

Query: 241 ASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLE 300
+ K +E E L + K E EKA L+A+
Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKA 298

Query: 301 A 301

Sbjct: 299 D 299



Score = 34.7 bits (79), Expect = 6e-04
Identities = 56/272 (20%), Positives = 99/272 (36%), Gaps = 27/272 (9%)

Query: 55 ELIKRANGYEIQNHQLTVENKKLKIDKEQLTKENDDLKTEKDQLEQRSEKLATQKENLEK 114
++ + + + ++L+ K L K + + + L +K L
Sbjct: 96 NAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAA 155

Query: 115 EVAEAKHKNETLNINNDDLTKKLNETRQELANKQQESKENEKTLNEL------------- 161
A+ + E + + K+ E A + E EK L
Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 215

Query: 162 --------------LEKTVKDKIAREQKSKQDFGALKQELAKKEEQNKISEASRKGLRRD 207
LEK ++ + L+ E A E + E + +G
Sbjct: 216 LEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNF 275

Query: 208 LDASREAKKQVEKDLANLTAELDKVKEEKQISDASRQGLRRDLDASREAKKQVEKALEEA 267
A K +E + A L AE ++ + Q+ +A+RQ LRRDLDASREAKKQ+E ++
Sbjct: 276 STADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKL 335

Query: 268 NSKLAALEKLNKELEESKKLTEKEKAELQAKL 299
+ E + L + + K +L+A+
Sbjct: 336 EEQNKISEASRQSLRRDLDASREAKKQLEAEH 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2077PF050435370.0 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 537 bits (1385), Expect = 0.0
Identities = 108/491 (21%), Positives = 217/491 (44%), Gaps = 16/491 (3%)

Query: 13 RELKLISYLTENSNAIGVKDKELSKALNISMLTLQSCLTNMQFMKEVGGITYKDGYINIW 72
R+L+L+ L E+ EL++ LN + ++ L++++ I I
Sbjct: 11 RQLELLELLFEHKRWFHRS--ELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIRII 68

Query: 73 YHQCCGLQEVYQKALRESPSLKLLELLFFRDFSSLEELAEELFVSLSTLKRLIKKTNTYL 132
++ VY + S +LE +FF + E + +E ++S S+L R+I + N +
Sbjct: 69 NTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVI 128

Query: 133 SHTFAISIVTSPVQVSGDERQIRLFYLKYFSEAYKISEWPFGDILNLKNCERLLSLLIKE 192
F + +PVQ+ G+ER IR F+ +YFSE Y EWPF + + + +LL L+ KE
Sbjct: 129 KRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFEN-FSSEPLSQLLELVYKE 187

Query: 193 VDVKVHFTLFQHLKILSGVNLIRYYKGYSCSYNNKKTSHRFSQLIQHYSEIQDLSRLFYL 252
++ + + LK+L NL R G+ + + + + I+ +++ F
Sbjct: 188 TSFPMNLSTHRMLKLLLVTNLYRIKFGHFMEVDKDSFNDQSLDFLMQAEGIEGVAQSFES 247

Query: 253 KFGLHLDEYTIAEMFSNHLNDKLEIGCAFEIINQDPTSGGRQVTNWIHLL----DEMEIK 308
++ + LDE + ++F ++ I E + V HLL D++ +K
Sbjct: 248 EYNISLDEEVVCQLFVSYFQKMFFID---ESLFMKCVKKDSYVEKSYHLLSDFIDQISVK 304

Query: 309 LNLSITNKYEVAVTLHNASVLNEEDITANYLLFDYKKSYLNFYQKEHPRIYEAFVTSVEK 368
+ I NK + LHN + L +++ ++LFD K + + +Q P+ +
Sbjct: 305 YQIEIENKDNLIWHLHNTAHLYRQELFTEFILFDQKGNTIRNFQNIFPKFVSDVKKELSH 364

Query: 369 LMQADNAQVSKELINQLTYCFFITWENSFLKVNQKDEKVRLLVI----ERSYNSVGNFLK 424
++ S ++N L+Y F ++ + + Q K+++LV+ + V L
Sbjct: 365 YLETLEVCSSSMMVNHLSYTFITHTKHLVINLLQNQPKLKVLVMSNFDQYHAKFVAETLS 424

Query: 425 KYIGEFFSITNFDELDCLTIDLVEIEKQYDVIVTDVMVGKSEELEIFFFYKMIPEAIIDR 484
Y F + + EL+ L + YD+I+++ ++ E + + + ++I
Sbjct: 425 YYCSNNFELEVWTELELSKESLE--DSPYDIIISNFIIPPIENKRLIYSNNINTVSLIYL 482

Query: 485 LNEFLNVSFTD 495
LN + + +
Sbjct: 483 LNAMMFIRLDE 493


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2082IGASERPTASE427e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.0 bits (98), Expect = 7e-06
Identities = 29/151 (19%), Positives = 53/151 (35%), Gaps = 10/151 (6%)

Query: 42 TADTATDAESETAKKDKKSKETASQHDTQKDHKPSHTHPTPPSNDTKQTDQASSEATDKP 101
T +T T ETA +K+ K TQ+ P T P + +T Q +E +
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEV--PKVTSQVSPKQEQSETVQPQAEP-ARE 1148

Query: 102 NKDKNDTKQPDSSDQSTSSPKDQSSQKESQNKDGRPTPSPDQQKDPTPD--KTPEK--SA 157
N + K+P S +T+ D + + + + + + PE A
Sbjct: 1149 NDPTVNIKEPQSQTNTTA---DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205

Query: 158 DKIPEKATEKTPEPNRDAPKPIQPPLAAAAP 188
P +E + +P + ++ P
Sbjct: 1206 TTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2083MECHCHANNEL320.002 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 32.1 bits (73), Expect = 0.002
Identities = 14/62 (22%), Positives = 28/62 (45%), Gaps = 8/62 (12%)

Query: 10 VINGLIIVVVTSILLVLYFAMPIYYTKVKDKEVKREFDQTSKQIKGKTVTEIRDILTKKI 69
V + LI+ ++ A+ + + KE +K+ +TEIRD+L ++
Sbjct: 82 VFDFLIVA------FAIFMAIKLINKLNRKKEEPAAAPAPTKEEV--LLTEIRDLLKEQN 133

Query: 70 NK 71
N+
Sbjct: 134 NR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2084HTHFIS831e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 1e-20
Identities = 31/128 (24%), Positives = 55/128 (42%), Gaps = 1/128 (0%)

Query: 3 KILVVEDDDTISQVICEFLKANNYDPDCVFDGQAALDKWQTTSYDLIILDIMLPSLSGLE 62
ILV +DD I V+ + L YD + DL++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VLKTIRKT-SDVPIIMLTALDDEYTQLVSFNHLISDYVTKPFSPLILIKRIENVLRVSTP 121
+L I+K D+P+++++A + T + + DY+ KPF LI I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 DEKRQIGD 129
+ D
Sbjct: 125 RPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_2090RTXTOXIND544e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.4 bits (131), Expect = 4e-10
Identities = 34/144 (23%), Positives = 55/144 (38%), Gaps = 10/144 (6%)

Query: 60 DISLTLAGEVTANNSSKVKIDSSKGEVKDVFVKKGDVVKVGQPLFSYETSQRLTAQSSEF 119
+I T G++T + SK VK++ VK+G+ V+ G L +LTA +E
Sbjct: 81 EIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLL------KLTALGAEA 134

Query: 120 DVQTKANQLQVAKTNAALKWETYNRKVNEINTLKSRYNTAPDESLLEQIRSAEDSVSQAL 179
D + Q + A L+ Y I K PDE + + E +L
Sbjct: 135 DTL----KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190

Query: 180 SDAKTADSDVKTAQIELDKANATA 203
+ + + Q EL+ A
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRA 214



Score = 37.1 bits (86), Expect = 1e-04
Identities = 27/180 (15%), Positives = 60/180 (33%), Gaps = 16/180 (8%)

Query: 120 DVQTKANQLQVAKTNAALKWETYNRKVNEINTLKSRYNTAPDESL---LEQIRSAEDSVS 176
D + ++ +AK + Y VNE+ KS+ E L E + +
Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298

Query: 177 QALSDAKTADSDVKTAQIELDKANATAATEKGKLEYDTVKSDTAGTIVSLNTDLPNQSKS 236
+ L + ++ +EL K + + +++ + + L
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEE-------RQQASVIRAPVSVKVQQLKVHTEGGVV- 350

Query: 237 KKENETFMEII-DKSKMLVKGNISEFDRDKLKIDQKVEV-IDRKDNSK--KWTGKVTQVG 292
ET M I+ + + V + D + + Q + ++ ++ GKV +
Sbjct: 351 -TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
spyM18_209560KDINNERMP270.006 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 27.2 bits (60), Expect = 0.006
Identities = 6/24 (25%), Positives = 9/24 (37%)

Query: 22 YSKKVLADEPTSYQPPAAHSPCDD 45
+ + A + T AA S D
Sbjct: 27 KNPQPQAQQTTQTTTTAAGSAADQ 50



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.