PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeNC_002737.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_002737 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1SPy_2217SPy_2197Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_2217019-5.015523putative chromosome segregation protein
SPy_2216018-4.909560putative serine protease
SPy_2215-216-3.53587123S rRNA
SPy_2211-216-3.275274***ABC transporter permease
SPy_2210017-1.352664ABC-F family ATPase
SPy_2209-115-1.354791hypothetical protein
SPy_2207-212-0.209777putative tryptophanyl-tRNA synthetase
SPy_2206-312-1.357828inosine monophosphate dehydrogenase
SPy_2205-313-3.537539hypothetical protein
SPy_2204-114-4.649435RecF protein
SPy_2203017-5.646473hypothetical protein
SPy_2202017-5.704976UDP-glucose pyrophosphorylase
SPy_2201120-6.702564UDP-glucose 6-dehydrogenase
SPy_2200020-6.028000hyaluronate synthase
SPy_2199020-5.409067hypothetical protein
SPy_2198018-4.596585hypothetical protein
SPy_2197-118-3.150109hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2216V8PROTEASE598e-12 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 59.2 bits (143), Expect = 8e-12
Identities = 32/169 (18%), Positives = 59/169 (34%), Gaps = 39/169 (23%)

Query: 123 VVTNNHVIDGAKRIEILMA------------DGSKVVGELVGADTYSDLAVVKISSDKIK 170
++TN HV+D + +G ++ DLA+VK S ++
Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173

Query: 171 -------TVAEFADSTKLNVGEVAIAIGSPLG-TQYANSVTQGIVSSLSRTVTLKNENGE 222
A +++ + V + G P ++G ++ L
Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLK----------- 222

Query: 223 TVSTNAIQTDAAINPGNSGGPLINIEGQVIGINSSKISSTPTGSNGNSG 271
A+Q D + GNSG P+ N + +VIGI+ + + N
Sbjct: 223 ---GEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGV-----PNEFNGA 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2210PF05272320.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.009
Identities = 24/87 (27%), Positives = 31/87 (35%), Gaps = 13/87 (14%)

Query: 32 LIGANGAGKSTFLKILAGDIEPSTGHISLGPDERLSVLRQNHFDYEEERAIDVVIMGNEQ 91
L G G GKST + L G S H +G YE+ I +
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIG---------TGKDSYEQIAGIVAYELSE-- 649

Query: 92 LYNIMKEKDAIYMKADFS-EEDGVRAA 117
+ DA +KA FS +D R A
Sbjct: 650 -MTAFRRADAEAVKAFFSSRKDRYRGA 675


2SPy_2177SPy_2165Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_21773150.130247putative transcriptional regulator (TetR/AcrR
SPy_21764180.488488hypothetical protein
SPy_2174519-0.309513hypothetical protein
SPy_2173521-0.308985hypothetical protein
SPy_2172620-2.805336hypothetical protein
SPy_2170620-3.056530hypothetical protein
SPy_2169517-3.745984hypothetical protein
SPy_2166318-4.351738hypothetical protein
SPy_2165318-4.580636hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2177HTHTETR474e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 47.3 bits (112), Expect = 4e-09
Identities = 20/134 (14%), Positives = 46/134 (34%), Gaps = 11/134 (8%)

Query: 4 RKENTKQAILKAMVMLLKTESFDDITTVKLSKRAGISRSSFYTHYKDKYEMID------- 56
+ T+Q IL + L + + +++K AG++R + Y H+KDK ++
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 57 -YYQQTFFHKLEYIFEKKYQNKEQAFLEVFEFLQREQLLSSLLSANGTKEIQA---FIIN 112
+ + + V E E+ L+ K ++
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 113 KVRLLITTDLQDKF 126
+ + + + D+
Sbjct: 128 QAQRNLCLESYDRI 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2176RTXTOXIND350.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.2 bits (81), Expect = 0.001
Identities = 24/161 (14%), Positives = 57/161 (35%), Gaps = 16/161 (9%)

Query: 266 GLSQLTQATTLSDEKAKGIQSLIVGLPVLNQGIQQLNTELSTLQPPNLNADELGNSLGAI 325
L +S+E+ + SLI +Q +T + LN D+ +
Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIK---------EQFSTWQNQKYQKELNLDKKR-AERLT 218

Query: 326 AQAAKQVIAEETAAQNEELSALQA----TSVYQSLTAEQQGELAAALSQSDKSQTVSAAQ 381
A + + L + ++ + EQ+ + A ++ S +
Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA--VNELRVYKSQLE 276

Query: 382 TILSSVQTLSTSLQSLSQEDQSKQLEQLKEAVAQIANQSNQ 422
I S + + Q ++Q +++ L++L++ I + +
Sbjct: 277 QIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE 317


3SPy_2147SPy_2129Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_2147421-1.193377hypothetical protein
SPy_21453200.045356hypothetical protein
SPy_21443190.131340hypothetical protein
SPy_21423200.500821hypothetical protein
SPy_21403190.414845hypothetical protein
SPy_21363180.128206putative DNA primase - phage associated
SPy_21356240.494673putative replication protein
SPy_2134223-1.031953hypothetical protein
SPy_2133121-1.375722hypothetical protein
SPy_2132218-1.601804hypothetical protein
SPy_2131218-1.874421hypothetical protein
SPy_2130119-2.026947hypothetical protein
SPy_2129319-2.304438hypothetical protein
4SPy_2095SPy_2082Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_2095-1193.600824putative endopeptidase O
SPy_2093-1244.057645putative elongation factor TS
SPy_2092-2234.33260630S ribosomal protein S2
SPy_20910245.016150putative regulatory protein
SPy_20900296.305073putative formiminoglutamate hydrolase
SPy_2089-1286.133753putative histidine ammonia-lyase
SPy_20880265.820709putative cationic amino acid transporter
SPy_20870235.365893hypothetical protein
SPy_20850255.158472putative formate-tetrahydrofolate ligase
SPy_2084-1244.065984putative serine cycle enzyme
SPy_20830233.848342putative formiminotransferase cyclodeaminase
SPy_20820203.393390putative urocanate hydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2082TCRTETA290.047 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.4 bits (66), Expect = 0.047
Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 6/45 (13%)

Query: 251 LFISSGLGGMSGAQGKAAEIAKAVAIIAEVDQSRIKTRHSQGWIS 295
L+I + G++GA G A A A IA++ + RH G++S
Sbjct: 99 LYIGRIVAGITGATG-----AVAGAYIADITDGDERARHF-GFMS 137


5SPy_2055SPy_2018Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_20552182.383575putative pyruvate formate-lyase activating
SPy_20541223.438179putative transcriptional regulator
SPy_20531213.525161putative transcriptional regulator
SPy_20523244.129105putative PTS system, enzyme III
SPy_20513244.417929putative PTS system, enzyme IIB
SPy_20501214.272385putative PTS system, enzyme IIC component
SPy_2049-1192.942639putative pyruvate formate-lyase 2
SPy_20481141.004174putative transaldolase-like protein
SPy_20470160.666775putative glycerol dehydrogenase
SPy_2045-118-0.995964protein low temperature requirement C
SPy_2043024-0.733348mitogenic factor
SPy_2042-126-1.058262putative transcription regulator
SPy_20412340.246245hypothetical protein
SPy_20403320.809020hypothetical protein
SPy_20392280.737554pyrogenic exotoxin B
SPy_20370280.291640hypothetical protein
SPy_2034-222-0.419547conversed hypothetical protein
SPy_2033-122-0.668652hypothetical protein
SPy_2032-1220.957758putative ATP-binding cassette transporter-like
SPy_2031-1230.990979ABC transporter ATP-binding protein
SPy_20291240.614668putative ABC transporter (ATP-binding protein)
SPy_20273220.840758putative two-component response regulator
SPy_20263201.556543putative histidine kinase
SPy_20252182.612373immunogenic secreted protein precursor
SPy_20231142.078675hypothetical protein (mga-associated)
SPy_20192132.503029M protein trans-acting positive regulator
SPy_20182132.978093M protein type 1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2039STREPTOPAIN7090.0 Streptopain (C10) cysteine protease family signature.
		>STREPTOPAIN#Streptopain (C10) cysteine protease family signature.

Length = 398

Score = 709 bits (1831), Expect = 0.0
Identities = 398/398 (100%), Positives = 398/398 (100%)

Query: 1 MNKKKLGVRLLSLLALGGFVLANPVFADQNFARNEKEAKDSAITFIQKSAAIKAGARSAE 60
MNKKKLGVRLLSLLALGGFVLANPVFADQNFARNEKEAKDSAITFIQKSAAIKAGARSAE
Sbjct: 1 MNKKKLGVRLLSLLALGGFVLANPVFADQNFARNEKEAKDSAITFIQKSAAIKAGARSAE 60

Query: 61 DIKLDKVNLGGELSGSNMYVYNISTGGFVIVSGDKRSPEILGYSTSGSFDANGKENIASF 120
DIKLDKVNLGGELSGSNMYVYNISTGGFVIVSGDKRSPEILGYSTSGSFDANGKENIASF
Sbjct: 61 DIKLDKVNLGGELSGSNMYVYNISTGGFVIVSGDKRSPEILGYSTSGSFDANGKENIASF 120

Query: 121 MESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHYNQGNPYNLLTPVIEKVKPGE 180
MESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHYNQGNPYNLLTPVIEKVKPGE
Sbjct: 121 MESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHYNQGNPYNLLTPVIEKVKPGE 180

Query: 181 QSFVGQHAATGCVATATAQIMKYHNYPNKGLKDYTYTLSSNNPYFNHPKNLFAAISTRQY 240
QSFVGQHAATGCVATATAQIMKYHNYPNKGLKDYTYTLSSNNPYFNHPKNLFAAISTRQY
Sbjct: 181 QSFVGQHAATGCVATATAQIMKYHNYPNKGLKDYTYTLSSNNPYFNHPKNLFAAISTRQY 240

Query: 241 NWNNILPTYSGRESNVQKMAISELMADVGISVDMDYGPSSGSAGSSRVQRALKENFGYNQ 300
NWNNILPTYSGRESNVQKMAISELMADVGISVDMDYGPSSGSAGSSRVQRALKENFGYNQ
Sbjct: 241 NWNNILPTYSGRESNVQKMAISELMADVGISVDMDYGPSSGSAGSSRVQRALKENFGYNQ 300

Query: 301 SVHQINRGDFSKQDWEAQIDKELSQNQPVYYQGVGKVGGHAFVIDGADGRNFYHVNWGWG 360
SVHQINRGDFSKQDWEAQIDKELSQNQPVYYQGVGKVGGHAFVIDGADGRNFYHVNWGWG
Sbjct: 301 SVHQINRGDFSKQDWEAQIDKELSQNQPVYYQGVGKVGGHAFVIDGADGRNFYHVNWGWG 360

Query: 361 GVSDGFFRLDALNPSALGTGGGAGGFNGYQSAVVGIKP 398
GVSDGFFRLDALNPSALGTGGGAGGFNGYQSAVVGIKP
Sbjct: 361 GVSDGFFRLDALNPSALGTGGGAGGFNGYQSAVVGIKP 398


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2032RTXTOXIND553e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.8 bits (132), Expect = 3e-10
Identities = 35/144 (24%), Positives = 55/144 (38%), Gaps = 10/144 (6%)

Query: 60 DISLTLAGEVTANNSSKVKIDSSKGEVKEVFVKKGDVVKVGQPLFSYETSQRLTAQSSEF 119
+I T G++T + SK VKE+ VK+G+ V+ G L +LTA +E
Sbjct: 81 EIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLL------KLTALGAEA 134

Query: 120 DVQTKANQLQVAKTNAALKWETYNRKVNEINTLKSRYNTAPDESLLEQIRSAEDSVSQAL 179
D + Q + A L+ Y I K PDE + + E +L
Sbjct: 135 DTL----KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190

Query: 180 SDAKTADSDVKTAQIELDKANATA 203
+ + + Q EL+ A
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRA 214



Score = 39.4 bits (92), Expect = 2e-05
Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 16/180 (8%)

Query: 120 DVQTKANQLQVAKTNAALKWETYNRKVNEINTLKSRYNTAPDESL---LEQIRSAEDSVS 176
D + ++ +AK + Y VNE+ KS+ E L E + +
Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298

Query: 177 QALSDAKTADSDVKTAQIELDKANATATTEKGKLEYDTVKSDTAGTIVSLNTDLPNQSKS 236
+ L + ++ +EL K + + +++ + + L
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEE-------RQQASVIRAPVSVKVQQLKVHTEGGVV- 350

Query: 237 KKENETFMEII-DKSKMLVKGNISEFDRDKLKIGQKVEV-IDRKDNSK--KWTGKVTQVG 292
ET M I+ + + V + D + +GQ + ++ ++ GKV +
Sbjct: 351 -TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2027HTHFIS831e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 1e-20
Identities = 31/128 (24%), Positives = 55/128 (42%), Gaps = 1/128 (0%)

Query: 3 KILVVEDDDTISQVICEFLKANNYDPDCVFDGQAALDKWQTTSYDLIILDIMLPSLSGLE 62
ILV +DD I V+ + L YD + DL++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VLKTIRKT-SDVPIIMLTALDDEYTQLVSFNHLISDYVTKPFSPLILIKRIENVLRVSTP 121
+L I+K D+P+++++A + T + + DY+ KPF LI I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 DEKRQIGD 129
+ D
Sbjct: 125 RPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2026MECHCHANNEL320.002 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 31.7 bits (72), Expect = 0.002
Identities = 14/62 (22%), Positives = 28/62 (45%), Gaps = 8/62 (12%)

Query: 10 VINGLIIVVVTSILLVLYFAMPIYYTKVKDKEVKCEFDQTSKQIKGKTVTEIRDILTKKI 69
V + LI+ ++ A+ + + KE +K+ +TEIRD+L ++
Sbjct: 82 VFDFLIVA------FAIFMAIKLINKLNRKKEEPAAAPAPTKEEV--LLTEIRDLLKEQN 133

Query: 70 NK 71
N+
Sbjct: 134 NR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2025IGASERPTASE429e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.6 bits (97), Expect = 9e-06
Identities = 28/157 (17%), Positives = 54/157 (34%), Gaps = 13/157 (8%)

Query: 42 TADTDTDDESETPKKDKKSKETASQHDTQKDHKPSHTHPTPPSNDTKQTDQASSEATDKP 101
T +T T + ET +K+ K TQ+ P T P + +T Q +E +
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEV--PKVTSQVSPKQEQSETVQPQAEP-ARE 1148

Query: 102 NKDKNDTKQPDSSDQSTPSPKDQSSQKESQNKDGRPTPSPDQQKDQTPD--KTPEKSADK 159
N + K+P S +T D + + + + + + PE +
Sbjct: 1149 NDPTVNIKEPQSQTNTTA---DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205

Query: 160 TPEKGPEKATDKTPEPN-----RDAPKPIQPPLAAAP 191
T + + P+ R P ++P ++
Sbjct: 1206 TTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSN 1242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2019PF050435190.0 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 519 bits (1339), Expect = 0.0
Identities = 109/473 (23%), Positives = 217/473 (45%), Gaps = 20/473 (4%)

Query: 34 ELSKALNISMLTLQTCLTNMQ-FMKEVGGITYKNGYITIWYHQHCGLQEVYQKALRHSQS 92
EL++ LN + ++ L++++ ++ + NG I ++ VY +HS
Sbjct: 30 ELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIRIINT-DDSDIEMVYHHFFKHSTH 88

Query: 93 FKLLETLFFRDFNSLEELAEELFVSLSTLKRLIKKTNAYLMHTFGITILTSPVQVSGDEH 152
F +LE +FF + E + +E ++S S+L R+I + N + F + +PVQ+ G+E
Sbjct: 89 FSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFEVSLTPVQIIGNER 148

Query: 153 QIRLFYLKYFSEAYKISEWPFGEILNLKNCERLLSLMIKEVDVRVNFTLFQHLKILSSVN 212
IR F+ +YFSE Y EWPF + + +LL L+ KE +N + + LK+L N
Sbjct: 149 DIRYFFAQYFSEKYYFLEWPFEN-FSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTN 207

Query: 213 LIRYYKGHSAVYDNKKTSQRFSQLIQSSLEFQDLSRLFHLKFGLYLDETTIAEMFSNHVN 272
L R GH D + + + + + +++ F ++ + LDE + ++F ++
Sbjct: 208 LYRIKFGHFMEVDKDSFNDQSLDFLMQAEGIEGVAQSFESEYNISLDEEVVCQLFVSYFQ 267

Query: 273 DQLEIGYAF--DSIKQDSPTGCRKVTNWVHLL----DELEIRLNLSVTNKYEVAVILHNT 326
I + +K+DS V HLL D++ ++ + + NK + LHNT
Sbjct: 268 KMFFIDESLFMKCVKKDS-----YVEKSYHLLSDFIDQISVKYQIEIENKDNLIWHLHNT 322

Query: 327 TVLKEEDITANYLFFDYKKSYLNFYKQEHPHLYKAFVAGVEKLMRSEKEPISTELTNQLI 386
L +++ ++ FD K + + ++ P + + + + S+ + N L
Sbjct: 323 AHLYRQELFTEFILFDQKGNTIRNFQNIFPKFVSDVKKELSHYLETLEVCSSSMMVNHLS 382

Query: 387 YAFFITWENSFLKVNQKDEKIRLLVI----ERSFNSVGNFLKKYVGEFFSITNFNELDAL 442
Y F ++ + + Q K+++LV+ + V L Y F + + EL+
Sbjct: 383 YTFITHTKHLVINLLQNQPKLKVLVMSNFDQYHAKFVAETLSYYCSNNFELEVWTELELS 442

Query: 443 TIDLEEIEKQYDVIVTDVMVGKSEELEIFFFHKMIPEAIIDKLNAFLNISFAD 495
LE + YD+I+++ ++ E + + + + ++I LNA + I +
Sbjct: 443 KESLE--DSPYDIIISNFIIPPIENKRLIYSNNINTVSLIYLLNAMMFIRLDE 493


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2018GPOSANCHOR1821e-53 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 182 bits (463), Expect = 1e-53
Identities = 246/450 (54%), Positives = 281/450 (62%), Gaps = 32/450 (7%)

Query: 35 NQTEVKANGDGNPREVIEDLAANNPAIQNIRLRYENKDLKARLENAMEVAGRDFKRAEEL 94
KA + + L DL+ LE AM + D + + L
Sbjct: 122 KADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTL 181

Query: 95 EKAKQALEDQRKDLETKLKELQQDYDLAKESTSWDRQRLEKELEEKKEALELAIDQASRD 154
E K ALE ++ +LE L+ K + LE +
Sbjct: 182 EAEKAALEARQAELEKALEGAMNF---------------STADSAKIKTLEAEKAALAAR 226

Query: 155 YHRATALEKELEEKKKALELAIDQASQDYNRANVLEKELETITREQEINRNLLGNAKLEL 214
+ A I + LE + + E N ++
Sbjct: 227 KADLEKALEGAMNFSTADSAKIKTLEAEKAA---LEARQAELEKALEGAMNFSTADSAKI 283

Query: 215 DQLSSEKEQLTIEKAKLEEEKQISDASRQSLRRDLDASREAKKQVEKDLANLTAELDKVK 274
L +EK L EKA LE + Q+ +A+RQSLRRDLDASREAKKQ+E AE K++
Sbjct: 284 KTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLE-------AEHQKLE 336

Query: 275 EDKQISDASRQGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQISDASRQGLRRDLD 334
E +IS+ASRQ LRRDLDASREAKKQ+E AE K++E+ +IS+ASRQ LRRDLD
Sbjct: 337 EQNKISEASRQSLRRDLDASREAKKQLE-------AEHQKLEEQNKISEASRQSLRRDLD 389

Query: 335 ASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEQLA 394
ASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKE+LA
Sbjct: 390 ASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLA 449

Query: 395 KQAEELAKLRAGKASDSQTPDTKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTG 454
KQAEELAKLRAGKASDSQTPD KPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTG
Sbjct: 450 KQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTG 509

Query: 455 ETANPFFTAAALTVMATAGVAAVVKRKEEN 484
ETANPFFTAAALTVMATAGVAAVVKRKEEN
Sbjct: 510 ETANPFFTAAALTVMATAGVAAVVKRKEEN 539



Score = 51.2 bits (122), Expect = 6e-09
Identities = 86/413 (20%), Positives = 143/413 (34%), Gaps = 50/413 (12%)

Query: 1 MAKNNTNRHYSLRKLKTGTASVAVALTVLGAGFANQTEVKANGDGNPREVIEDLAANNPA 60
M KNNTNRHYSLRKLKTGTASVAVALTVLGAG T + + +
Sbjct: 1 MTKNNTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQER-- 58

Query: 61 IQNIRLRYENKDLKARLENAMEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYD 120
+ EN LK + + +EL + +++ + + L E
Sbjct: 59 --ADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKAS--- 113

Query: 121 LAKESTSWDRQRLEKELEEKKEALELAIDQASRDYHRATALEKELEEKKKALELAIDQAS 180
+ELE +K LE A++ A +A K LE +K AL
Sbjct: 114 ------------KIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLE 161

Query: 181 QDYNRANVLEKELETITREQEINRNLLGNAKLELDQLSSEKEQLTIEKAKLEEEKQISDA 240
+ L+ + + + LE EK +A
Sbjct: 162 KA-------------------------------LEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 241 SRQSLRRDLDASREAKKQVEKDLANLTAELDKVKEDKQISDASRQGLRRDLDASREAKKQ 300
+ L + L+ + + L AE + K + + +G A K
Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250

Query: 301 VEKDLANLTAELDKVKEEKQISDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKL 360
+E + A L A ++++ + + + K +E + + L
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310

Query: 361 NKELEESKKLTEKEKAELQAKLEAEAKALKEQLAKQAEELAKLRAGKASDSQT 413
+ L + + K +L+A+ + + K A + L A + + Q
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQL 363


6SPy_1995SPy_1949Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1995-2233.025833topology modulator
SPy_1994-1223.938181pai1 protein (theoretical repressor)
SPy_19920214.476517*hypothetical protein
SPy_19910184.859737anthranilate synthase component II
SPy_19900184.629109putative para-aminobenzoate synthetase
SPy_19890204.899833hypothetical protein
SPy_19880206.199064putative methyltransferase
SPy_1987-1215.96488316S rRNA (uracil(1498)-N(3))-methyltransferase
SPy_1986-1216.007343putative PTS system, enzyme II, A component
SPy_1985-3225.065152hypothetical protein
SPy_1984-2225.233455putative ribonucleotide reductase (NrdI
SPy_1983-2194.750327collagen-like surface protei
SPy_1981-1213.974462(p)ppGpp synthetase
SPy_1980-2163.647222hypothetical protein
SPy_1979-2163.653278streptokinase A precursor
SPy_1978-1164.019037leucine-rich protein
SPy_1976-2163.653790multiple sugar-binding ABC transport system
SPy_1973-2153.290404dextran glucosidase
SPy_1972-2132.555756putative pullulanase
SPy_1971-2142.286441hypothetical protein
SPy_19680153.155008hypothetical protein
SPy_19650153.169189putative undecaprenyl pyrophosphate synthetase
SPy_1964-1143.305022putative phosphatidate cytidylyltransferase
SPy_1963-2143.452521may be involved in production of a peptide sex
SPy_1962-2163.721253putative prolyl-tRNA synthetase
SPy_1961-2173.554346DNA polymerase III (alpha subunit)
SPy_19600171.845186putative transcriptional regulator (MarR
SPy_1959-1191.893754hypothetical protein
SPy_1958-2181.426210putative polypeptide deformylase
SPy_1957-1202.641731hypothetical protein
SPy_1956-1172.534264hypothetical protein
SPy_1955-1173.32081930S ribosomal protein S15
SPy_1952-1173.138167hypothetical protein
SPy_1950-1173.793537hypothetical protein
SPy_1949-1173.549870hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1992HTHFIS340.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 0.001
Identities = 28/155 (18%), Positives = 46/155 (29%), Gaps = 38/155 (24%)

Query: 8 RMRPKTISEVIGQKHLVGEGKIIRRMVE-----ANRLSSMILYGPPGIGKTSIASAIAGT 62
R K + LVG ++ + ++++ G G GK +A A+
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 63 TRYAFRTF--------------------------NATIDSKKRLQEIAEEAKFSGGLVLL 96
+ F A S R ++ + G L
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQ-------AEGGTLF 236

Query: 97 LDEIHRLDKTKQDFLLPLLENGTIIMIGATTENPF 131
LDEI + Q LL +L+ G +G T
Sbjct: 237 LDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRS 271


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1983GPOSANCHOR606e-12 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 59.7 bits (144), Expect = 6e-12
Identities = 37/90 (41%), Positives = 43/90 (47%)

Query: 259 KSPEGEAGQPGEKAPEKSKEVTPAAEKPADKEANQTPERRNGNMAKTPVANNHRRLPATG 318
K E A KA + K + N K P+ R+LP+TG
Sbjct: 450 KQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTG 509

Query: 319 EQANPFFTAAAVAVMTTAGVLAVTKRKENN 348
E ANPFFTAAA+ VM TAGV AV KRKE N
Sbjct: 510 ETANPFFTAAALTVMATAGVAAVVKRKEEN 539


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1979STREPKINASE8150.0 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 815 bits (2106), Expect = 0.0
Identities = 389/440 (88%), Positives = 410/440 (93%)

Query: 1 MKNYLSIGVIALLFALTFGTVKSVQAIAGYGWLPDRPPINNSQLVVSMAGIVEGTDKKVF 60
MKNYLS G+ ALLFALTFGTV SVQAIAG WL DRP +NNSQLVVS+AG VEGT++ +
Sbjct: 1 MKNYLSFGMFALLFALTFGTVNSVQAIAGPEWLLDRPSVNNSQLVVSVAGTVEGTNQDIS 60

Query: 61 INFFEIDLTSQPAHGGKTEQGLSPKSKPFATDNGAMPHKLEKADLLKAIQKQLIANVHSN 120
+ FFEIDLTS+PAHGGKTEQGLSPKSKPFATD+GAM HKLEKADLLKAIQ+QLIANVHSN
Sbjct: 61 LKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKLEKADLLKAIQEQLIANVHSN 120

Query: 121 DGYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLKGHVRVRPYKEKPVQNQ 180
D YFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLL GHVRVRPYKEKP+QNQ
Sbjct: 121 DDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVRPYKEKPIQNQ 180

Query: 181 AKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKTHPGY 240
AKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNK HPGY
Sbjct: 181 AKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKNHPGY 240

Query: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYHVKNREQAYEINPKTGIKEKTNNTDLVSEKY 300
TIYERDSSIVTHDNDIFRTILPMDQEFTY VKNREQAY IN K+G+ E+ NNTDL+SEKY
Sbjct: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQAYRINKKSGLNEEINNTDLISEKY 300

Query: 301 YVLKQGEKPYDPFDRSHLKLFTIKYVDVNTNELLKSEQLLTASERNLDFRDLYDPRDKAK 360
YVLK+GEKPYDPFDRSHLKLFTIKYVDV+TNELLKSEQLLTASERNLDFRDLYDPRDKAK
Sbjct: 301 YVLKKGEKPYDPFDRSHLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAK 360

Query: 361 LLYNNLDAFDIMDYTLTGKVEDNHDKNNRVVTVYMGKRPKGAKGSYHLAYDKDLYTEEER 420
LLYNNLDAF IMDYTLTGKVEDNHD NR++TVYMGKRP+G SYHLAYDKD YTEEER
Sbjct: 361 LLYNNLDAFGIMDYTLTGKVEDNHDDTNRIITVYMGKRPEGENASYHLAYDKDRYTEEER 420

Query: 421 KAYSYLRDTGTPIPDNPKDK 440
+ YSYLR TGTPIPDNP DK
Sbjct: 421 EVYSYLRYTGTPIPDNPNDK 440


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1978HTHFIS347e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 7e-04
Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 229 ALWSEHGNLVQTAQRLYIHRNSLQYKLDKF 258
AL + GN ++ A L ++RN+L+ K+ +
Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1976PF05272356e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.7 bits (79), Expect = 6e-04
Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 9/56 (16%)

Query: 34 IVFVGPSGCGKSTTLRMIAGLEDISEGELKIGGEVVNDKSPKDRDIAMVFQNYALY 89
+V G G GKST + + GL+ S+ IG +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG---------TGKDSYEQIAGIVAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1963PF04605300.009 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 29.8 bits (67), Expect = 0.009
Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 2/44 (4%)

Query: 227 INGYKVTSWNDLTEAV-DLATRD-LGPSQTIKVTYKSHQRLKTV 268
+ ++ L E + DL +D + +Q+LK +
Sbjct: 80 FDITEIGEQYSLKETIQDLCAKDFHQKLKEFTEKTPKNQKLKDL 123


7SPy_1935SPy_1911Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_19352280.240439hypothetical protein
SPy_1934427-1.566588putative transcription regulator
SPy_1932429-1.80816850S ribosomal protein L13
SPy_1931327-2.530693ribosomal protein S9
SPy_1927120-3.620350hypothetical protein
SPy_1926221-2.785227hypothetical protein
SPy_1924221-2.580481putative lactose phosphotransferase system
SPy_1923126-1.532987galactosidase acetyltransferase
SPy_1922124-1.395086putative galactose-6-phosphate isomerase (B
SPy_1921123-1.432765putative tagatose 6-phosphate kinase
SPy_1919118-2.976412putative tagatose 1,6-diphosphate aldolase
SPy_1918014-3.750505putative PTS system, lactose-specific component
SPy_1917015-4.864372putative PTS system, lactose-specific component
SPy_1916315-6.285933putative phospho-beta-D-galactosidase
SPy_1915315-6.470204lantibiotic precursor
SPy_1914316-6.355485putative salivaricin A modification enzyme;
SPy_1912115-4.000371ABC transporter ATP-binding protein
SPy_1911016-3.199285putative ABC transporter (permease) associated
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1924ARGREPRESSOR300.006 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 29.8 bits (67), Expect = 0.006
Identities = 21/85 (24%), Positives = 38/85 (44%), Gaps = 11/85 (12%)

Query: 1 MKKKERHEKILDILKVDGFIKVKDIIDEM-----NISDMTARRDLDTLADKGLL-IRTHG 54
M K +RH KI +I+ + +++D + N++ T RD+ L L+ + T+
Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL---HLVKVPTNN 57

Query: 55 GAQYLDYSSAKDEGHEKTHTEKKVL 79
G+ YS D+ K+ L
Sbjct: 58 GSYK--YSLPADQRFNPLSKLKRSL 80


8SPy_1842SPy_1832Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1842-114-3.318242putative signal peptidase I
SPy_1841-215-3.481342putative ribonuclease HIII
SPy_1840-215-3.951414hypothetical protei
SPy_1839-215-3.125445hypothetical protein
SPy_1837-216-3.129497putative DNA mismatch repair protein
SPy_1836017-3.349843hypothetical protein
SPy_1835019-1.697783putative thioredoxin
SPy_1834122-1.166701hypothetical protein
SPy_18331220.278575A/G-specific adenine glycosylase
SPy_18323260.551879hypothetical protein
9SPy_1748SPy_1725Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_17481143.022184putative beta-ketoacyl-ACP synthase II
SPy_17470163.118703putative biotoin carboxyl carrier protein
SPy_17460152.695937putative beta-hydroxyacyl-ACP dehydratase
SPy_17450182.823810acetyl-CoA carboxylase biotin carboxylase
SPy_17440273.283475acetyl-CoA carboxylase carboxyl transferase
SPy_17430363.118163putative acetyl-CoA carboxylase alpha subunit
SPy_17420372.697061putative seryl-tRNA synthetase
SPy_17410271.601781hypothetical protein
SPy_1740-1271.701389putative mannose-specific phosphotransferase
SPy_1739-2210.834652putative mannose-specific phosphotransferase
SPy_1738-316-0.339971mannose-specific phosphotransferase system
SPy_1737-112-0.915200hypothetical protein
SPy_1736-114-0.864611hypothetical protein
SPy_1735217-1.198653hypothetical protein
SPy_1734217-2.477612hypothetical protein
SPy_1733013-2.345682putative transcription regulator
SPy_1731-213-2.315376hypothetical protein
SPy_1730-113-2.238899putative cell-cycle regulation histidine triad
SPy_1729-111-1.888651ABC transporter ATP-binding protein
SPy_1728-110-2.031231ABC transporter permease
SPy_1727012-0.750139hypothetical protein
SPy_17261140.580543hypothetical protein
SPy_17252150.484627*hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1736TYPE3IMSPROT320.005 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 32.0 bits (73), Expect = 0.005
Identities = 19/123 (15%), Positives = 43/123 (34%), Gaps = 8/123 (6%)

Query: 372 LTAVSTAVCFLLSILLLPLVGIVPAAATAPALIIVGVMMVSSFLDVNWSKF--ADALPAF 429
L+ V V L PL+ + A A ++ G ++ + + K +
Sbjct: 72 LSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRI 131

Query: 430 FAA-FFMALCYSISYGIAAAFIFYCLVK-----VVEGKTKDIHPIIWGATFLFIVNFIIL 483
F+ + SI + + + + ++K +++ T I I + +I
Sbjct: 132 FSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILRQLMVIC 191

Query: 484 TIL 486
T+
Sbjct: 192 TVG 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1734SACTRNSFRASE621e-14 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 61.9 bits (150), Expect = 1e-14
Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 4/86 (4%)

Query: 62 CLLARLDEKVVGLLNLSGEVLSQGQAEADVFMLVAKTYRGYGIGQLLLEIALDWAEENPY 121
L L+ +G + + E + VAK YR G+G LL A++WA+EN +
Sbjct: 67 AFLYYLENNCIGRIKIRSNWNGYALIED---IAVAKDYRKKGVGTALLHKAIEWAKENHF 123

Query: 122 IESLKLDVQVRNTKAIYLYKKYGFRI 147
L L+ Q N A + Y K+ F I
Sbjct: 124 C-GLMLETQDINISACHFYAKHHFII 148


10SPy_1710SPy_1682Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_17102132.262057putative PTS system, enzyme IIB component
SPy_17092122.325667putative PTS system, enzyme IIC component
SPy_17080111.317209putative galactose-6-phosphate isomerase
SPy_17070110.698557putative galactose-6-phosphate isomerase
SPy_17042100.886209putative tagatose 1,6-diphosphate aldolase
SPy_17010100.387661hypothetical protein
SPy_1700-190.829048hypothetical protein
SPy_1699-1100.724655putative transcription regulator
SPy_1698-1100.658412hypothetical protein
SPy_1697-191.382859hypothetical protein
SPy_1695-1112.323932hypothetical protein
SPy_1694-1133.120806putative N-acetylglucosamine-6-phosphate
SPy_1693-1152.236705putative reductase / dehydrogenase
SPy_16910152.404490hypothetical protein
SPy_1689-1143.248774putative glycyl-tRNA synthetase (alpha subunit)
SPy_1688-1143.594704putative glycyl-tRNA synthetase (beta subunit)
SPy_16870123.060016hypothetical protein
SPy_16860132.581028hypothetical protein
SPy_16840132.857082putative glycerol kinase
SPy_1683-1133.336401putative alpha-glycerophosphate oxidase
SPy_16820143.017184putative glycerol uptake facilitator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1699HTHTETR423e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 41.9 bits (98), Expect = 3e-07
Identities = 13/65 (20%), Positives = 28/65 (43%)

Query: 19 KETRRIARESMEIALLNLLETKPLGDITISELVTKAGVSRNAFYRNYTSKEAIIEQLLVG 78
K+ + R+ + L L + + ++ E+ AGV+R A Y ++ K + ++
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 79 VIRRI 83
I
Sbjct: 66 SESNI 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1686THERMOLYSIN401e-06 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 40.0 bits (93), Expect = 1e-06
Identities = 15/78 (19%), Positives = 29/78 (37%), Gaps = 3/78 (3%)

Query: 69 NQPKTSQTSKKVKLSEDKAKSIALKDASVTEADAQMLSVTQDNEDGKAVYEIEFQNKDQE 128
+ S ++ +D A + + + E L + D E + YE+ +
Sbjct: 134 TEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEVNVRFLTPV 193

Query: 129 ---YSYTIDANSGDIVEK 143
+ Y IDA G ++ K
Sbjct: 194 PGNWIYMIDAADGKVLNK 211


11SPy_1617SPy_1584Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1617118-3.688238hypothetical protein
SPy_1616017-3.665725putative late competence protein required for
SPy_1615-114-1.782676putative late competence protein
SPy_1613-213-2.542492conserved protein - function unknown
SPy_1610-3140.387025********hypothetical protein
SPy_1608-3151.179709hypothetical protein
SPy_1607-3141.559950hypothetical protein
SPy_1606-3152.227996putative RNA methyltransferase
SPy_1605-2142.583810putative two-component responsible histidine
SPy_1604-1184.264235hypothetical protein
SPy_16030173.395360hypothetical protein
SPy_16020192.774591putative transcription regulator
SPy_16001162.371006putative hyaluronidase
SPy_15993181.770750putative beta-glucosidase
SPy_15960170.928579putative transcriptional regulator
SPy_1595017-0.049859putative sugar-binding transport protein
SPy_15931171.904668putative sugar-binding transport protein
SPy_15921172.461342putative ABC transporter substrate binding
SPy_15891192.820155hypothetical protein
SPy_15880193.143129putative two-component sensor histidine kinase
SPy_15871183.352001putative two-component sensor response
SPy_15861194.265827putative beta-galactosidase
SPy_1584-3173.091448putative shikimate 5-dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1596PF03309300.012 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 29.7 bits (67), Expect = 0.012
Identities = 13/65 (20%), Positives = 23/65 (35%), Gaps = 7/65 (10%)

Query: 18 LLCIDIGGTSLKFALCHN----GQLSQQSSFPT--PSSLEKFYQLLDQEVARYSAYHFSG 71
LL ID+ T L ++ QQ T + ++ +D + A +G
Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDG-LIGDDAERLTG 60

Query: 72 IAISS 76
+ S
Sbjct: 61 ASGLS 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1588PF065801822e-54 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 182 bits (463), Expect = 2e-54
Identities = 70/324 (21%), Positives = 133/324 (41%), Gaps = 34/324 (10%)

Query: 250 LSKAYRMQYNRSGDLLAYVAVRKSYLLAEAVRTVFVYGLVSLLLAWLLLQLL-FRVFRNY 308
L+ AYR R G L + + A + + V+ W LL + +
Sbjct: 55 LTHAYRSFIKRQG-WLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFT 113

Query: 309 IQQVSEITDTVEMVAAGDLSLTIDNSHMELELYHISEAINQMLASIKAYIDEVYVLEVEQ 368
+ I V +V + M LY + +A ID+ +
Sbjct: 114 LPLALSIIFNVVVV-----------TFMWSLLYF---GWHFFKNYKQAEIDQWK-MASMA 158

Query: 369 RDAQMRALQSQINPHFLYNTLEYIRMYALSCQQEELADVIYAFASLLRNNI--SQDKMTT 426
++AQ+ AL++QINPHF++N L IR L + +++ + + L+R ++ S + +
Sbjct: 159 QEAQLMALKAQINPHFMFNALNNIRALILE-DPTKAREMLTSLSELMRYSLRYSNARQVS 217

Query: 427 LKEELAFCEKYIYLYQMRYPDSFAYHVKIDESVADLAIPKFVIQPLVENYFVHGIDYSRH 486
L +EL + Y+ L +++ D + +I+ ++ D+ +P ++Q LVEN HGI
Sbjct: 218 LADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQ 277

Query: 487 DNALSIKALDETDHLLIQVLDNGRGISQERLADMEKRLQEHQTTGNSSIGLQNVYLRLFH 546
+ +K + + ++V + G L T ++ GLQNV RL
Sbjct: 278 GGKILLKGTKDNGTVTLEVENTG-------------SLALKNTKESTGTGLQNVRERLQM 324

Query: 547 HFRDRVSWSMAKEPNGGFIIQIRI 570
+ ++++ G + I
Sbjct: 325 LYGTEAQIKLSEKQ-GKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1587HTHFIS842e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 2e-19
Identities = 35/170 (20%), Positives = 62/170 (36%), Gaps = 10/170 (5%)

Query: 3 KVLLVDDEYMILQGLTMIIDWQALGFEVVQTARSGKEALAYLTQYPVDVMISDVTMPGMT 62
+L+ DD+ I L + G++V + ++ D++++DV MP
Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLDLIEAAKTYHPQLQTLILSGYQEFSYVQKAMELETKGYLLKPVDKAELQAKMKQFKDW 122
DL+ K P L L++S F KA E YL KP D EL + +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 123 LDAQQAESIRQEAYHDSLLTLWLTDELSEKEFQQLSQGLPAAALTGFTVL 172
+ ++ L+ Q++ + L T T++
Sbjct: 122 PKRRPSKLEDDSQDGMPLVG-------RSAAMQEIYRVLARLMQTDLTLM 164


12SPy_1551SPy_1531Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_15513231.145335conserved protein - function unknown
SPy_15492220.917821putative arginine repressor
SPy_15481191.244172hypothetical protein
SPy_15473232.063972streptococcal antitumor protein
SPy_15463182.211380hypothetical protein
SPy_15442172.038783putative ornithine transcarbamylase
SPy_15431141.384246hypothetical protein
SPy_1542-1131.007368putative Xaa-His dipeptidase
SPy_1541-2140.049224putative carbamate kinase
SPy_1539-116-1.381413putative asparagine synthetase A
SPy_1538117-2.132758hypothetical protein
SPy_1537319-2.692595putative 3-deoxy-D-manno-octulosonic-acid
SPy_1536316-2.102846hypothetical protein
SPy_1535215-2.545918putative ribose transport operon repressor
SPy_1534012-1.061784hypothetical protein
SPy_1533012-0.32161423S rRNA (adenine(2503)-C(2))-methyltransferase
SPy_15320181.105102hypothetical protein
SPy_15312231.940101putative peroxide resistance protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1549ARGREPRESSOR1234e-39 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 123 bits (311), Expect = 4e-39
Identities = 60/146 (41%), Positives = 92/146 (63%), Gaps = 2/146 (1%)

Query: 1 MNKKETRHQLIRSLISETTIHTQQELQERLQKNGITITQATLSRDMKELNLVKVTSGNDT 60
MNK + RH IR +I+ I TQ EL + L+K+G +TQAT+SRD+KEL+LVKV + N +
Sbjct: 1 MNKGQ-RHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNGS 59

Query: 61 HYEALAISQTRWEH-RLRFYMEDALVMLKIVQHQIILKTLPGLAQSFGSILDAMQIPEIV 119
+ +L Q +L+ + DA V + H I+LKT+PG AQ+ G+++D + EI+
Sbjct: 60 YKYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIM 119

Query: 120 ATVCGDDTCLIVCEDNEQAKACYETL 145
T+CGDDT LI+C ++ K + +
Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKI 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1547ARGDEIMINASE5780.0 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 578 bits (1492), Expect = 0.0
Identities = 191/410 (46%), Positives = 276/410 (67%), Gaps = 9/410 (2%)

Query: 5 TPIHVYSEIGKLKKVLLHRPGKEIENLMPDYLERLLFDDIPFLEDAQKEHDAFAQALRDE 64
PI+++SEIG+LKKVLLHRPG+E+ENL P ++ LFDDIP+LE A++EH+ FA L++
Sbjct: 6 NPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNN 65

Query: 65 GIEVLYLETLAAESLVTP-EIREAFIDEYLSEANIRGRATKKAIRELLMAIEDNQELIEK 123
+E+ Y+E L +E LV+ + FI +++ EA I+ T +++ ++ +I K
Sbjct: 66 LVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSL-TIDNMISK 124

Query: 124 TMAGVQKSELPEIPASEKGLTDLVESNYPFAIDPMPNLYFTRDPFATIGTGVSLNHMFSE 183
++GV EL +S L DLV F IDPMPN+ FTRDPFA+IG GV++N MF++
Sbjct: 125 MISGVVTEELKNYTSS---LDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTK 181

Query: 184 TRNRETLYGKYIFTHHPIYGGGKVPMVYDRNETTRIEGGDELVLSKDVLAVGISQRTDAA 243
R RET++ +YIF +HP+Y VP+ +R E +EGGDELVL+K +L +GIS+RT+A
Sbjct: 182 VRQRETIFAEYIFKYHPVYKE-NVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAK 240

Query: 244 SIEKLLVNIFKQNLGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIEGDLRVYSV 303
S+EKL +++FK F +LAF+ NR +MHLDTVFT +DY FT + +Y +
Sbjct: 241 SVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDMYFSIYVL 300

Query: 304 TYDNE--ELHIVEEKGDLAELLAANLGVEKVDLIRCGGDNLVAAGREQWNDGSNTLTIAP 361
TY+ ++HI +EK + ++L+ LG K+D+I+C G +L+ REQWNDG+N L IAP
Sbjct: 301 TYNPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAP 359

Query: 362 GVVVVYNRNTITNAILESKGLKLIKIHGSELVRGRGGPRCMSMPFEREDI 411
G ++ Y+RN +TN + E G+K+ +I SEL RGRGGPRCMSMP REDI
Sbjct: 360 GEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1541CARBMTKINASE405e-145 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 405 bits (1043), Expect = e-145
Identities = 141/315 (44%), Positives = 204/315 (64%), Gaps = 6/315 (1%)

Query: 3 KQKIVVALGGNAIL--STDASAKAQQEALISTSKSLVKLIKEGHEVIVTHGNGPQVGNLL 60
+++V+ALGGNA+ S + + + T++ + ++I G+EV++THGNGPQVG+LL
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61

Query: 61 LQQAAADSEKN-PAMPLDTCVAMTEGSIGFWLVNALDNELQAQGIQKEVAAVVTQVIVDA 119
L A + PA P+D AM++G IG+ + AL NEL+ +G++K+V ++TQ IVD
Sbjct: 62 LHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDK 121

Query: 120 KDPAFENPTKPIGPFLTEEDAKKQMAESGASFKEDAGRGWRKVVPSPKPVGIKEANVIRS 179
DPAF+NPTKP+GPF EE AK+ E G KED+GRGWR+VVPSP P G EA I+
Sbjct: 122 NDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKK 181

Query: 180 LVDSGVVVVSAGGGGVPVVEDATSKTLTGVEAVIDKDFASQTLSELVDADLFIVLTGVDN 239
LV+ GV+V+++GGGGVPV+ + + GVEAVIDKD A + L+E V+AD+F++LT V+
Sbjct: 182 LVERGVIVIASGGGGVPVILED--GEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNG 239

Query: 240 VYVNFNKPDQAKLEEVTVSQMKEYITQDQFAPGSMLPKVEAAIAFVENKPNAKAIITSLE 299
+ + + L EV V ++++Y + F GSM PKV AAI F+E +AII LE
Sbjct: 240 AALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEW-GGERAIIAHLE 298

Query: 300 NIDNVLSANAGTQII 314
L GTQ++
Sbjct: 299 KAVEALEGKTGTQVL 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1537LPSBIOSNTHSS1532e-50 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 153 bits (388), Expect = 2e-50
Identities = 58/157 (36%), Positives = 94/157 (59%), Gaps = 2/157 (1%)

Query: 5 IGLYTGSFDPVTNGHLDIVKRASGLFDQIYVGIFDNPTKKSYFKLEVRKAMLTQALADFT 64
+Y GSFDP+T GHLDI++R LFDQ+YV + NP K+ F ++ R + +A+A
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 65 NVIVVTSHERLAIDVAKELRVTHLIRGLRNATDFEYEENLEYFNHLLAPNIETVYLISRN 124
N V + E L ++ A++ + ++RGLR +DFE E + N LA ++ETV+L +
Sbjct: 62 NAQVDSF-EGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120

Query: 125 KWQALSSSRVRELIHFQSSLEGLVPQSVIAQV-EKMN 160
++ LSSS V+E+ F ++E VP V A + ++ +
Sbjct: 121 EYSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQFH 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1535NUCEPIMERASE320.004 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 31.7 bits (72), Expect = 0.004
Identities = 13/76 (17%), Positives = 34/76 (44%), Gaps = 9/76 (11%)

Query: 50 LAQSLKTKKNQLVGLLLPDISNPFF-PRLARGAEEYLKEKGYRVMLGNISDSEALEE--- 105
+++ L +Q+VG+ D N ++ L + E L + G++ +++D E + +
Sbjct: 16 VSKRLLEAGHQVVGI---DNLNDYYDVSLKQARLELLAQPGFQFHKIDLADREGMTDLFA 72

Query: 106 --EYVHVLLQSNAAGI 119
+ V + + +
Sbjct: 73 SGHFERVFISPHRLAV 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1532PREPILNPTASE300.005 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.2 bits (68), Expect = 0.005
Identities = 42/160 (26%), Positives = 58/160 (36%), Gaps = 25/160 (15%)

Query: 70 GLIIILWASMVHWVSASYCYLLLFSLLFSLF--DWRSQ------EYPFILWLFSFVSLLL 121
L+ + A + + LLL +L +L D P + F L
Sbjct: 118 ALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGG 177

Query: 122 FYSIN---------YLSLILLLLGLLAHLRPFSIGAGDFFYLASLALVLDLTSLIWLIQL 172
F S+ YL L L +G GDF LA+L L +L ++ L
Sbjct: 178 FVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLL 237

Query: 173 ASLAGITACLLL-------GIKRIPFIPYLSFGLFWIVLL 205
+SL G + L K IPF PYL+ WI LL
Sbjct: 238 SSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIA-GWIALL 276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1531HELNAPAPROT1511e-49 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 151 bits (383), Expect = 1e-49
Identities = 49/154 (31%), Positives = 85/154 (55%), Gaps = 4/154 (2%)

Query: 19 KKEASKNEKT--KAVLNQAVADLSVAASIVHQVHWYMRGPGFLYLHPKMDELLDSLNANL 76
K E +K +T + LN +++ + S +H+ HWY++GP F LH K +EL D +
Sbjct: 2 KTENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETV 61

Query: 77 DEMSERLITIGGAPYSTLAEFSKHSKLDEAKGTYDKTVAQHLARLVEVYLYLSSLYQVGL 136
D ++ERL+ IGG P +T+ E+++H+ + + + + ++ + LV Y +SS + +
Sbjct: 62 DTIAERLLAIGGQPVATVKEYTEHASITDGGN--ETSASEMVQALVNDYKQISSESKFVI 119

Query: 137 DITDEEGDAGTNDLFTAAKTEAEKTIWMLQAERG 170
+ +E D T DLF E EK +WML + G
Sbjct: 120 GLAEENQDNATADLFVGLIEEVEKQVWMLSSYLG 153


13SPy_1494SPy_1473Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1494018-3.626618hypothetical protein
SPy_1493-117-4.258776hypothetical protein
SPy_1492117-4.540900hypothetical protein
SPy_1491220-4.103428hypothetical protein
SPy_1489223-3.521316histone-like DNA-binding protein
SPy_1488221-3.774785putative integrase - phage associated
SPy_1487123-3.586707hypothetical protein
SPy_1486122-2.850818putative repressor - phage associated
SPy_1485129-2.506327putative Cro-like repressor protein - phage
SPy_1484231-3.167967putative excisionase
SPy_1483333-1.412807hypothetical protein
SPy_1482331-1.951119hypothetical protein
SPy_1481327-1.187521hypothetical protein
SPy_1479327-1.165053hypothetical protein
SPy_1478227-1.357255hypothetical protein
SPy_1477429-0.609872putative recombinase - phage associated
SPy_1476427-1.688811hypothetical protein
SPy_1475527-1.526492hypothetical protein phage associated
SPy_1474430-1.432176hypothetical protein
SPy_1473428-2.199545conserved hypotehetical protein - phage
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1489DNABINDINGHU1245e-41 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 124 bits (312), Expect = 5e-41
Identities = 82/91 (90%), Positives = 87/91 (95%)

Query: 1 MANKQDLIAKVAEATELTKKDSAAAVDAVFSTIEAFLAEGEKVQLIGFGNFEVRERAARK 60
MANKQDLIAKVAEATELTKKDSAAAVDAVFS + ++LA+GEKVQLIGFGNFEVRERAARK
Sbjct: 1 MANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARK 60

Query: 61 GRNPQTGAEIEIAASKVPAFKAGKALKDAVK 91
GRNPQTG EI+I ASKVPAFKAGKALKDAVK
Sbjct: 61 GRNPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1486SACTRNSFRASE280.026 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.0 bits (62), Expect = 0.026
Identities = 13/37 (35%), Positives = 18/37 (48%)

Query: 119 KSEETEDYITDYVEGLVAAGLGAYQEDNLHMKVKLRS 155
K E +D YVE A Y E+N ++K+RS
Sbjct: 48 KQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRS 84


14SPy_1265SPy_1259Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1265125-4.506106hypothetical protein
SPy_1264022-5.296998hypothetical protein
SPy_1263021-4.631060hypothetical protein
SPy_1262019-3.191962hypothetical protein
SPy_1261216-1.392335hypothetical protein
SPy_1260215-1.009905hypothetical protein
SPy_1259216-0.933691putative transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1264PF06580290.015 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.015
Identities = 19/109 (17%), Positives = 32/109 (29%), Gaps = 15/109 (13%)

Query: 19 LVGLVLLSVFGWVVGITGGYIYLPYSYRWLSWGMDSFPNLLDSALSYYYFWTALVLFVIT 78
++ + +S+ G V +T Y WL M A V+
Sbjct: 42 MIFNIAISLMGLV--LTHAYRSFIKRQGWLKLNMGQI---------ILRVLPACVVIG-- 88

Query: 79 FLALLVIILYPRIYTEVQLRHKNKKGTLLLKKSAIESYVATAIQTAGLM 127
+ + R+ + K TL L S I + V + L
Sbjct: 89 MVWFVANTSIWRLLAFIN--TKPVAFTLPLALSIIFNVVVVTFMWSLLY 135


15SPy_1242SPy_1236Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1242-114-3.477343putative phosphate ABC transporter (ATP-binding
SPy_1241016-3.811744phosphate ABC transporter (ATP-binding protein)
SPy_1240117-4.762216putative phosphate uptake regulatory protein
SPy_1239018-4.895569putative lysyl-aminopeptidase; aminopeptidase N
SPy_1237122-6.329258putative response regulator
SPy_1236122-4.709796putative histidine kinase protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1237HTHFIS852e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.3 bits (211), Expect = 2e-21
Identities = 34/119 (28%), Positives = 56/119 (47%), Gaps = 1/119 (0%)

Query: 2 IKILLVEDDLSLSNSIFDFLDD-FADVMQVFDGDEGLYEAESGIYDLILLDLMLPEKNGF 60
IL+ +DD ++ + L DV + +G DL++ D+++P++N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 61 QVLKELREKDIKIPVLIMTAKEGLDDKGHGFELGADDYLTKPFYLEELKMRIQALLKRT 119
+L +++ +PVL+M+A+ E GA DYL KPF L EL I L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1236PF06580392e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.1 bits (91), Expect = 2e-05
Identities = 15/75 (20%), Positives = 31/75 (41%), Gaps = 5/75 (6%)

Query: 312 YGKIFYFQNQVNRSLRMDKALLKQLITILFDNAIKY----TDKNGIIEIIVKTTDKNLLI 367
+ F+NQ+N ++ D + L+ L +N IK+ + G I + + + +
Sbjct: 236 FEDRLQFENQINPAIM-DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTL 294

Query: 368 SVIDNGPGITDEEKK 382
V + G K+
Sbjct: 295 EVENTGSLALKNTKE 309


16SPy_1122SPy_1080Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1122-115-3.280894putative iron-sulfur cofactor synthesis protein
SPy_1121-119-4.997719hypothetical protein
SPy_1120019-3.996516hypothetical protein
SPy_1119121-4.284836hypothetical protein
SPy_1118121-4.084056putative DNA repair protein
SPy_1117220-2.442005hypothetical protein
SPy_1115316-0.656066hypothetical protein
SPy_11142140.093373hypothetical protein
SPy_11131150.498218acid phosphatase/phosphotransferase
SPy_11111150.489305putative zinc-containing alcohol dehydrogenase
SPy_11102150.355497putative malic enzyme ((S)-malate:NAD+
SPy_1109216-0.049071putative L-malate permease
SPy_1107117-0.677880putative two-component sensor histidine kinase
SPy_1106215-0.442015putative two-component response regulator
SPy_1105215-0.116891putative spermidine/putrescine ABC transporter
SPy_11043160.457760putative spermidine/putrescine ABC transporter
SPy_11032151.779833putative spermidine / putrescine ABC transporter
SPy_11020141.816788putative spermidine / putrescine ABC transporter
SPy_11010122.123864putative UDP-N-acetylenolpyruvoylglucosamine
SPy_1100-2111.081742hydroxymethylpterin pyrophosphokinase
SPy_1099-1111.125073dihydroneopterin aldolase
SPy_1098-2110.748461dihydropteroate synthase
SPy_1097013-1.658478GTP cyclohydrolase
SPy_1096014-3.305173putative folyl-polyglutamate synthetase
SPy_1094218-5.165288hypothetical protein
SPy_1093224-6.079846putative D,D-carboxypeptidase,
SPy_1088427-8.940192putative repressor protein - phage associated
SPy_1087327-9.427569conserved hypothetical protein - lantibiotics
SPy_1086528-8.938522conserved hypothetical protein - lantibiotic
SPy_1085626-7.877780ABC transporter (ATP-binding) - lantibiotic
SPy_1084526-7.095067ABC transporter (ATP binding) - lantibiotic
SPy_1083524-6.047698lantibiotic precursor
SPy_1082621-5.232089putative histidine kinase - lantibiotic
SPy_1081721-1.778716putative DNA binding regulatory protein -
SPy_10803171.348592protein involved in lantibiotic (srt)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1119SECA330.001 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 32.5 bits (74), Expect = 0.001
Identities = 25/131 (19%), Positives = 43/131 (32%), Gaps = 16/131 (12%)

Query: 59 VDKIILIGGQNVDPKYYQEEKAAFDDDFSPERDTFE--LAIIKEAITLKKPILGICRGTQ 116
+ + + D E + + E E LA E K+ ++G +
Sbjct: 703 IPGLQERLKNDFDLDLPIAEWLDKEPELHEE-TLRERILAQSIEVYQRKEEVVG----AE 757

Query: 117 LMNVALGGNLNQHIDSHWQEAPSDFLSH--EMIIEPDSILYPIYGHKTLINSFHRQSLKT 174
+M G + Q +DS W+E H M I Y K + R+S
Sbjct: 758 MMRHFEKGVMLQTLDSLWKE-------HLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSM 810

Query: 175 VAKDLKVIARD 185
A L+ + +
Sbjct: 811 FAAMLESLKYE 821


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1106HTHFIS675e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.8 bits (163), Expect = 5e-15
Identities = 23/131 (17%), Positives = 50/131 (38%), Gaps = 2/131 (1%)

Query: 3 VLIIEDDPMVDFIHRNYLEKLNLFDRIISSDSMKAVQSILTDYAIDLILLDIHITDGNGI 62
+L+ +DD + + L + + + + + + DL++ D+ + D N
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 QFLEKWRTQHIPCEVIIISAANDGNIIRDGFHLGIIDYLIKPFTFERFQESIQQFVTHRE 122
L + + V+++SA N G DYL KPF I + + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 123 HLANQQLEQAQ 133
++ + +Q
Sbjct: 124 RRPSKLEDDSQ 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1105MYCMG045371e-04 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 36.6 bits (84), Expect = 1e-04
Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 4/82 (4%)

Query: 31 SGSQSDKLVIYNWGDYIDPALLKKFTKETGIEVQYETFDSNEAMYTKIKQGGTTYDIAVP 90
S S V+ N+ YI P LL++ + + + T+ SNE + TY +AV
Sbjct: 21 SSCGSTTFVLANFESYISPLLLER--VQEKHPLTFLTYPSNEKLINGF--ANNTYSVAVA 76

Query: 91 SDYTIDKMIKENLLNKLDKSKL 112
S Y + ++I+ +LL+ +D S+
Sbjct: 77 STYAVSELIERDLLSPIDWSQF 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1097LPSBIOSNTHSS328e-04 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 32.1 bits (73), Expect = 8e-04
Identities = 30/141 (21%), Positives = 55/141 (39%), Gaps = 13/141 (9%)

Query: 18 EKAEAAIYQFLEAIGENPNREGLLDTPKRVAKMYAEMFLGLGK---DPKEEFTAVFKEQH 74
E+ Q A+ NPN++ + +R+ + A+ L D E T + Q
Sbjct: 21 ERGCRLFDQVYVAVLRNPNKQPMFSVQERL-EQIAKAIAHLPNAQVDSFEGLTVNYARQR 79

Query: 75 EDVVIVKDISFYSICEHHLVPFYGKAHIA------YLPSDGRVTGL-SKLARAVEVASKR 127
+ I++ + S E L +A +L + + L S L + EVA
Sbjct: 80 QAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTEYSFLSSSLVK--EVARFG 137

Query: 128 PQLQERLTSQIADALVEALNP 148
++ + S +A AL + +P
Sbjct: 138 GNVEHFVPSHVAAALYDQFHP 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1093BLACTAMASEA330.001 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 33.2 bits (76), Expect = 0.001
Identities = 22/85 (25%), Positives = 34/85 (40%), Gaps = 11/85 (12%)

Query: 55 AIASLTKLVTAYLVLDKVKSGQLQLSDQVNLSDYAFELTKDRSLSNVPFDKK----TYSV 110
+ S K+V VL +V +G QL ++ + + P +K +V
Sbjct: 63 PMMSTFKVVLCGAVLARVDAGDEQLERKI-------HYRQQDLVDYSPVSEKHLADGMTV 115

Query: 111 QDLLTATLVASSNSAAIALAEKVAG 135
+L A + S NSAA L V G
Sbjct: 116 GELCAAAITMSDNSAANLLLATVGG 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1086ANTHRAXTOXNA310.003 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 31.3 bits (70), Expect = 0.003
Identities = 17/47 (36%), Positives = 29/47 (61%), Gaps = 7/47 (14%)

Query: 184 NKWYLFPYDWSLKLLEPMTRMRINSIPFGAEFVPDYSQIFISLFLGI 230
NK Y+ +W+ +P+T+ +IN+IP AEF+ + S I S +G+
Sbjct: 639 NKAYI---EWT----DPITKAKINTIPTSAEFIKNLSSIRRSSNVGV 678


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1083NISIN270.001 Nisin signature.
		>NISIN#Nisin signature.

Length = 57

Score = 26.7 bits (58), Expect = 0.001
Identities = 17/32 (53%), Positives = 23/32 (71%), Gaps = 2/32 (6%)

Query: 4 TIKDFDLDL-KTNKKDT-ATPYVGSRYLCTPG 33
+ KDF+LDL +KKD+ A+P + S LCTPG
Sbjct: 2 STKDFNLDLVSVSKKDSGASPRITSISLCTPG 33


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1081HTHFIS742e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 2e-17
Identities = 30/130 (23%), Positives = 57/130 (43%), Gaps = 1/130 (0%)

Query: 3 KILAIDDDKEILKLMKTALEIENYHVITCQEIELPIVFDDFKGYDLILLDIMMPNISGTE 62
IL DDD I ++ AL Y V + DL++ D++MP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 FCYKIREE-VHSPIIFVSALDGDNEIVQALNIGGDDFIVKPFSLKQFVAKVNSHLKREER 121
+I++ P++ +SA + ++A G D++ KPF L + + + L +R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 AKIKNEAEER 131
K E + +
Sbjct: 125 RPSKLEDDSQ 134


17SPy_1011SPy_0942Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1011214-3.277432hypothetical protein
SPy_1010115-3.095106putative
SPy_1008317-3.659429streptococcal exotoxin H precursor
SPy_1007518-0.114461streptococcal exotoxin I
SPy_10064181.611769putative lysin - phage associated
SPy_10022232.586337hypothetical protein
SPy_10012222.019423hypothetical protein
SPy_09991192.888542hypothetical protein
SPy_09981192.800698hypothetical protein
SPy_09970172.452693hyaluronidase - phage associated
SPy_09961182.189165hypothetical protein
SPy_09950191.658783hypothetical protein
SPy_09942202.260641putative minor tail protein - phage associated
SPy_0993019-1.502377hypothetical protein
SPy_0992122-1.033842hypothetical protein
SPy_0991-123-1.070943putative structural protein - phage associated
SPy_09890230.247085hypothetical protein
SPy_09880230.193768hypothetical protein
SPy_09873250.777415conserved hypothetical protein - phage 370.2
SPy_09864250.540469hypothetical protein
SPy_09853250.375620hypothetical protein
SPy_09844250.310102hypothetical protein
SPy_0982524-1.015745putative structural protein - phage associated
SPy_0981626-1.031696hypothetical protein
SPy_0980221-1.657354putative antirepressor - phage associated
SPy_0979020-1.541244hypothetical protein
SPy_0978019-1.168425hypothetical protein
SPy_0977019-2.294303hypothetical protein
SPy_0976019-3.782233hypothetical protein
SPy_0975-118-3.366649hypothetical protein
SPy_0972118-4.019032putative terminase, large subunit - phage
SPy_0971-123-5.255195putative terminase, small subunit - phage
SPy_0970125-5.392394hypothetical protein
SPy_0968023-4.071195hypothetical protein
SPy_0967225-0.695220hypothetical protein
SPy_0965223-0.561079hypothetical protein
SPy_0963121-0.196024hypothetical protein
SPy_0962119-0.686833hypothetical protein
SPy_0961220-0.252238hypothetical protein
SPy_0960220-0.103674hypothetical protein
SPy_0959221-0.697843hypothetical protein
SPy_0958225-1.278968hypothetical protein
SPy_0957328-2.720862hypothetical protein
SPy_0956423-2.030079hypothetical protein
SPy_0954420-2.845662hypothetical protein
SPy_0953320-1.725257hypothetical protein
SPy_0952118-1.545609hypothetical protein
SPy_0949117-1.779763hypothetical protein
SPy_0948117-1.364513hypothetical protein
SPy_0947118-3.995954hypothetical protein
SPy_0946118-3.997250putative P1-type antirepressor - phage
SPy_0945119-4.097514hypothetical protein
SPy_0944119-4.486833hypothetical protein
SPy_0943121-3.240026hypothetical protein
SPy_0942220-2.358060hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1008BACTRLTOXIN937e-25 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 93.1 bits (231), Expect = 7e-25
Identities = 56/218 (25%), Positives = 96/218 (44%), Gaps = 24/218 (11%)

Query: 37 TTNRHNLESLYKHDSNLIEADSIKNSPDIVTSHMLKYSVKDKNLSVF------FEKDWIS 90
T N++ LY D + + A +K S D +H L Y++ DK L + + ++
Sbjct: 45 TGTMGNMKYLY--DDHYVSATKVK-SVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLA 101

Query: 91 QEFKDKEVDIYAL---------SAQEVCECPGKRYEAFGGITLTN----SEKKEIKVPVN 137
+++KD+ VD+Y S V + G + +GGIT V V
Sbjct: 102 KKYKDEVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVR 161

Query: 138 VWDKSKQQPPMFITVNKPKVTAQEVDIKVRKLLIKKYDIYNNREQKYSKGTVTLDLNSGK 197
V++ + + +K VTAQE+DIK R LI K ++Y Y G + N+G
Sbjct: 162 VYENKRNTISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGN 221

Query: 198 DIVFDLYYFGNGDF--NSMLKIYSNNERIDSTQFHVDV 233
+D+ F + L +Y++N+ +DS ++V
Sbjct: 222 TFWYDMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEV 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1007BACTRLTOXIN1113e-32 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 111 bits (280), Expect = 3e-32
Identities = 65/234 (27%), Positives = 101/234 (43%), Gaps = 35/234 (14%)

Query: 8 NLRNLYSTYDPTEVKGKINEGPPFSGSLFYK--NIPYGNSSIELKVELNSVEKANFFSGK 65
N++ LY + + K K + + L Y + N ++K EL + + A + +
Sbjct: 50 NMKYLYDDHYVSATKVK-SVDKFLAHDLIYNISDKKLKNYD-KVKTELLNEDLAKKYKDE 107

Query: 66 RVDIFTLEYSPPCNSNIKKNS----------YGGITLSDGNRID---KKNIPVNIFIDGV 112
VD++ Y C + K N YGGIT +GN D +N+ V ++ +
Sbjct: 108 VVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKR 167

Query: 113 QQKYSYTDISTVSTDKKEVTIQELDVKSRYYLQKHFNIYGFGDVKDFGRSSRFQSGFEEG 172
T V TDKK VT QELD+K+R +L N+Y F S +E G
Sbjct: 168 N-----TISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNS-----------SPYETG 211

Query: 173 NIIFHLNSGERISYNLFDT--GHGDRESMLKKYSDNKTAYSDQLHIDIYLVKFN 224
I F N+G Y++ D+ L Y+DNKT S + I+++L N
Sbjct: 212 YIKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEVHLTTKN 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0997PF072125490.0 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 549 bits (1416), Expect = 0.0
Identities = 286/373 (76%), Positives = 306/373 (82%), Gaps = 38/373 (10%)

Query: 1 MAENIPLRVQFKRMKAAEWASSDVVLLEGEIGFETDTGFAKFGDGQNTFSKLKYLTGPKG 60
M E IPLRVQFKRM A EW SDV+LLE EIGFETDTG+AKFGDG+N FSKLKYL
Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYL----- 55

Query: 61 PKGDTGLQGKTGGTGSRGPAGKPGTTDYDQLQNKPDLGAFAQKEETNSKITKLESSKADK 120
NKPDLGAFAQKEETNSKITKLESSKADK
Sbjct: 56 --------------------------------NKPDLGAFAQKEETNSKITKLESSKADK 83

Query: 121 NAVYLKAESNAKLDEKLNLKGGVMTGQLQFKPN-SGIKPSSSVGGAINIDMSKSEGAAMV 179
NAVYLKAES +LD+KLNLKGGVMTGQLQFKPN SGIKPSSSVGGAINIDMSKSEGA +V
Sbjct: 84 NAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGIKPSSSVGGAINIDMSKSEGAGVV 143

Query: 180 MYTNKDTTDGPLMILRSNKDTFDQSVQFVDYKGTTNAVNIVMRQPTTPNFSSALNITSAN 239
+Y+N DT+DGPLM LR+ K+TF+QS FVDY G TNAVNI MRQPTTPNFSSALNITS N
Sbjct: 144 VYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNAVNIAMRQPTTPNFSSALNITSGN 203

Query: 240 EGGSAMQIRGVEKALGTLKITHENPSVDKEYDKNAAALSIDIVKKQKGGKGTAAQGIYIN 299
E GSAMQIRGVEKALGTLKITHENP+V+ YD+NAAALSIDIVKKQKGGKGTAAQGIYIN
Sbjct: 204 ENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAALSIDIVKKQKGGKGTAAQGIYIN 263

Query: 300 STSGTTGKLLRIRNLNDDKFYVKPDGGFYAKETSQIDGNLKLKDPIANDHAATKAYVDGE 359
STSGTTGKLLRIRNL DDKFYVK DGGFYAK+TSQIDGNLKLK+P A+DHAATKAYVD E
Sbjct: 264 STSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQIDGNLKLKNPTADDHAATKAYVDSE 323

Query: 360 VEKLKALLAAKQM 372
V+KLKALL KQ+
Sbjct: 324 VKKLKALLMDKQV 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0994RTXTOXINA330.006 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 33.4 bits (76), Expect = 0.006
Identities = 58/305 (19%), Positives = 108/305 (35%), Gaps = 30/305 (9%)

Query: 679 LGTAFEGFGNGVKSALEGVGAVIESFGSAVRNVLDGVANILDSMGTAALNAGRGVK-EMA 737
G G + L G ++ +F + + L + +D + + G E+A
Sbjct: 124 AGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMK--IDELIKKQKSGGNVSSSELA 181

Query: 738 K-GIKMLVDL--SLGDLVATLAAVASGLGKMASSAGEMTTLGSAMSKVAN--GMTRLATS 792
K I+++ L ++ L + + + L + S L +K+ N + +
Sbjct: 182 KASIELINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLNGVGNKLQNLPNLDNIGAG 241

Query: 793 ATIAITGLTVFATTMATIKTAVATLPPVLTMAASGFTTFTTQAVAAVTGLAAINAPITMF 852
++G+ + + A A T AA+G TT+ + V +
Sbjct: 242 LDT-VSGILSAISASFILSNADA---DTRTKAAAGVE-LTTKVLGNVGKGISQYIIAQRA 296

Query: 853 KAQLMTITPALAQAGAGFAAF--------VAQSSTFSTGLASAGPTIAAFNANLMSLSAT 904
L T A + +A + + + SL A
Sbjct: 297 AQGLSTSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAA 356

Query: 905 ----TGVLVASIAGLSAVLSVVSAGFSQIGASATATVGQ-IQAFASSTTVVSSAF--ASM 957
TG + AS+ +S VL+ VS+G S A+ T+ VG + A + T + S AS
Sbjct: 357 FHKETGAIDASLTTISTVLASVSSGIS--AAATTSLVGAPVSALVGAVTGIISGILEASK 414

Query: 958 QSMIQ 962
Q+M +
Sbjct: 415 QAMFE 419



Score = 30.7 bits (69), Expect = 0.040
Identities = 39/241 (16%), Positives = 90/241 (37%), Gaps = 29/241 (12%)

Query: 275 IEAIGKQLDKVD-FSKFASNLGKFLEGINIDKIVSNISSAISSVTSKVKEFWGGFKQTGA 333
++ + + V+ FS+ + LG L ++ V +K++
Sbjct: 192 VDTVASLNNNVNSFSQQLNTLGSVLSNT----------KHLNGVGNKLQNLPNLDNIGAG 241

Query: 334 ISAFSGALKSVWGAL----KNVASAMSGGSWKNFGS-IVGGIVKHVSNFAKAIADVVGKM 388
+ SG L ++ + + + + + ++G + K +S + A G
Sbjct: 242 LDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLS 301

Query: 389 EPGRLQSWIATFAAVGGGLKLFEKLTGQSVVGSFLDKISTKFGLFGKKAKEGTDQAANGS 448
IA+ + F + + + +++ S +F G +G A
Sbjct: 302 TSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGY---DGDSLLAAFH 358

Query: 449 RKSGGIISQIFNGLGNIVKSAGTAISTAAKGIGTGIKTALSGAPPIISSLGTAISTVAQG 508
+++G I + + + T +++ + GI T+L GAP +S+L A++ + G
Sbjct: 359 KETGAIDAS--------LTTISTVLASVSSGISAAATTSLVGAP--VSALVGAVTGIISG 408

Query: 509 I 509
I
Sbjct: 409 I 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0981IGASERPTASE270.038 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.3 bits (60), Expect = 0.038
Identities = 33/162 (20%), Positives = 62/162 (38%), Gaps = 10/162 (6%)

Query: 2 AEETQTVETVEEQVVPEAKQPQ-DEKKYTDA-------DVDAIIDKKFAKWKSEQEAEKS 53
A ++T ETV E E+K + +E+ T+ +A + K +E S
Sbjct: 1031 ATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGS 1090

Query: 54 EAKKMAKMNEKEKADYEKQKLLDELQELKNDKTRNELTAVARQMFAESEINVNDDVLGLV 113
E K+ KE A EK++ E + + +Q +E+ +
Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND 1150

Query: 114 VTLDAE--QTKANVTTLANAFAKVIADDRKALVRQTTPSTGG 153
T++ + Q++ N T AK + + + V ++T G
Sbjct: 1151 PTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0975TYPE4SSCAGX310.014 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 30.9 bits (69), Expect = 0.014
Identities = 51/219 (23%), Positives = 90/219 (41%), Gaps = 17/219 (7%)

Query: 50 YQRYADKEK--IDLSEARKRASELDISAYQKKAKELVAKAEK----LRREGKIVTRDDFT 103
YQ + +K +D + ++ + +K+AKE KA+K R+E + R +
Sbjct: 122 YQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANLE 181

Query: 104 HQENADMSIYNLAMKTNALELLRLNIDLE---------MQELANGEHKLTKKFLDEGYRK 154
+ NA + NL+ N EL++ + E MQE A + L++ +
Sbjct: 182 NLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQAE 241

Query: 155 ETEFQAGLLGLSVASQASVKSLADAVINANFKGAKWSDNIWDRQDK-LRSIISQSVQSAI 213
E Q +S+ + S KS D I + + W N+ R +K L I + Q
Sbjct: 242 EAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKDN 301

Query: 214 LKGKNGLTIARDIRREFDVSASYAKRLAITEHARVQMEV 252
LT+ + + +VS+ + L E A+ Q E+
Sbjct: 302 FASAY-LTVKLEYPQRHEVSSVIEEELKKREEAKRQREL 339


18SPy_0918SPy_0904Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_09185190.328701putative exfoliative toxin
SPy_0917825-0.279586hypothetical protein
SPy_0916622-0.010933hypothetical protein
SPy_09154170.363408hypothetical protein
SPy_09142141.175374hypothetical protein
SPy_09131120.846500*30S ribosomal protein S1
SPy_0912-390.817469**hypothetical protein
SPy_0911-391.253949putative branched-chain-amino-acid
SPy_0910-2110.982822putative DNA topoisomerase IV (subunit C)
SPy_0909-2120.523547putative DNA topoisomerase IV (subunit B)
SPy_09081130.480318hypothetical protein
SPy_09071111.195484putative dihydroorotase
SPy_09053131.203715putative uracil DNA glycosylase
SPy_09042120.530018ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0907UREASE371e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 37.4 bits (87), Expect = 1e-04
Identities = 21/81 (25%), Positives = 30/81 (37%), Gaps = 20/81 (24%)

Query: 20 ADVLIDGKQIVKIASA-----------IECQEAQVIDASGLIVAPGLVDIHVHFREPGQT 68
AD+ + +I I A I +VI G IV G +D H+HF P Q
Sbjct: 86 ADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFICPQQ- 144

Query: 69 HKEDIHTGALAAAAGGVTTVV 89
A G+T ++
Sbjct: 145 --------IEEALMSGLTCML 157


19SPy_0850SPy_0840Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_0850-215-3.223727putative thioredoxin reductase
SPy_0849-114-3.516777putative tRNA (guanine-N1)-methyltransferase
SPy_0847015-4.173192putative 16S rRNA processing protein
SPy_0846016-4.153500hypothetical protein
SPy_0845017-4.660216putative cation-efflux system membrane protein
SPy_0844017-4.370261hypothetical protein
SPy_0843017-4.441721hypothetical protein
SPy_0841019-4.659304hypothetical protein
SPy_0840-118-3.34527430S ribosomal protein S16
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0846HTHTETR512e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.2 bits (122), Expect = 2e-10
Identities = 13/64 (20%), Positives = 30/64 (46%)

Query: 5 RQIKKTKTAIYSAFIALLQKKEYSKITVRDMITLANVGRSTFYAHYESKEMLLKELCEEL 64
++ ++T+ I + L ++ S ++ ++ A V R Y H++ K L E+ E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 65 FHHL 68
++
Sbjct: 67 ESNI 70


20SPy_0806SPy_0792Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_08063240.27390350S ribosomal protein L20
SPy_0805020-1.39126250S ribosomal protein L35
SPy_0804-116-1.264611putative translation initiation factor 3 (IF3)
SPy_0803216-3.827175putative cytidylate kinase
SPy_0802218-5.800832hypothetical protein
SPy_0801217-6.713556ferredoxin
SPy_0800118-6.452506putative pore-forming peptide
SPy_0799120-6.604637putative tripeptidase
SPy_0798021-7.772175hypothetical protein
SPy_0797-122-7.382253hypothetical protein
SPy_0796-120-6.529387hypothetical protein
SPy_0794-119-6.098895putative glycosyl transferase
SPy_0793-217-5.820167hypothetical protein
SPy_0792-315-4.395791conserved hypothetical protein - possibly
21SPy_0749SPy_0731Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_0749-115-3.831170hypothetical protein
SPy_0747-116-3.795373hypothetical protein
SPy_0746020-4.800799putative ABC transporter (ATP-binding protein)
SPy_0745020-4.292532ABC transporter (ATP-binding protein) -
SPy_0744019-3.926026putative ABC transporter (ATP-binding protein)
SPy_0743120-3.876234hypothetical protein
SPy_0742516-2.077782hypothetical protein
SPy_0741516-1.798395hypothetical protein
SPy_0740618-1.471866streptolysin S associated ORF
SPy_0739625-0.640418streptolysin S associated ORF
SPy_0738827-0.399291streptolysin S associated protein
SPy_0737721-0.711793putative extracellular matrix binding protein
SPy_0733315-0.110879hypothetical protein
SPy_0732213-0.166081hypothetical protein
SPy_07313160.018675putative enolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0746LIPPROTEIN48300.013 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 30.4 bits (68), Expect = 0.013
Identities = 28/122 (22%), Positives = 50/122 (40%), Gaps = 11/122 (9%)

Query: 15 KKTSYVTFFLMPILTTLLALSLSFSNNNQAKIGILDKDNSQISKQFIAQLKQNKKYDIFT 74
KK+ + L PI L A+++S NN+++ I +KD S+ + + K ++
Sbjct: 2 KKSKKILLGLSPIAAILPAVAVSCGNNDESNISFKEKDISKYTTTNANGKQVVKNAELLK 61

Query: 75 KIKKEHI--DHYLQDKSL-----EAVLTIDKGFS-DKVLQGKSQKL--NIRSIANSEITE 124
+K I + + DKS EA+ I+K + S S ++
Sbjct: 62 -LKPVLITDEGKIDDKSFNQSAFEALKAINKQTGIEINNVEPSSNFESAYNSALSAGHKI 120

Query: 125 WV 126
WV
Sbjct: 121 WV 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0743TYPE3IMSPROT310.003 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 30.9 bits (70), Expect = 0.003
Identities = 15/76 (19%), Positives = 32/76 (42%), Gaps = 1/76 (1%)

Query: 37 SYQDFLDVLLSLFQFVVIILVLFFYSATINLGEVLTFLTQTSWHWQILCYLVLYLMAIIE 96
S + ++ L S+ + V++ ++++ NL +L T L +L + +I
Sbjct: 133 SIKSLVEFLKSILKVVLLSILIWII-IKGNLVTLLQLPTCGIECITPLLGQILRQLMVIC 191

Query: 97 MTLLVLILIFDVLLQK 112
V+I I D +
Sbjct: 192 TVGFVVISIADYAFEY 207


22SPy_0710SPy_0682Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_07104263.675957conserved hypothetical protein, phage
SPy_07074283.405423putative holin, phage associated
SPy_07063293.423399hypothetical protein
SPy_07053283.353472hypothetical protein
SPy_07032233.528395hypothetical protein
SPy_07022233.548978hypothetical protein
SPy_07010212.850424hyaluronidase, phage associated
SPy_07000202.641271hypothetical protein
SPy_06981182.108222hypothetical protein
SPy_06971192.590369hypothetical protein
SPy_06962260.543822conserved hypothetical protein, phage
SPy_06951261.020629hypothetical protein
SPy_06941222.119747putative major tail shaft protein, phage
SPy_06930251.662827putative minor capsid protein, phage associated
SPy_06910231.872141hypothetical protein
SPy_06900220.562812hypothetical protein
SPy_06892200.534000hypothetical protein
SPy_06884220.919582putative major head protein, phage associated
SPy_06864210.647214hypothetical protein
SPy_06855211.089696hypothetical protein
SPy_06843190.691920hypothetical protein
SPy_06833201.177289putative minor capsid protein, phage associated
SPy_06823210.864916putative minor capsid protein, phage associated
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0710FLGFLGJ919e-23 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 91.3 bits (226), Expect = 9e-23
Identities = 44/125 (35%), Positives = 63/125 (50%), Gaps = 8/125 (6%)

Query: 23 SLTAAQAILESGWGKHA-------PHNALFGIKADSSWTGKSFDTKTQEEYQAGVVTDIV 75
L AQA LESGWG+ P LFG+KA +W G + T E Y+ G +
Sbjct: 172 HLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTE-YENGEAKKVK 230

Query: 76 DRFRAYDSWDESIADHGQFLVDNPRYEAVIGETDYKKACYAIKAAGYATASSYVELLIQL 135
+FR Y S+ E+++D+ L NPRY AV ++ A++ AGYAT Y L +
Sbjct: 231 AKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNM 290

Query: 136 IEEND 140
I++
Sbjct: 291 IQQMK 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0702CHANLCOLICIN290.048 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 29.3 bits (65), Expect = 0.048
Identities = 30/157 (19%), Positives = 64/157 (40%), Gaps = 17/157 (10%)

Query: 182 QAEIKASAQGLSQKYDDELRKLSAKITTTSSGTTEAYESKLAGLRAEFTR-----SNQGT 236
QA+ KA+ L+Q+ D + + + + TE + A ++AE R + +
Sbjct: 80 QAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKA 139

Query: 237 RTELESQISGLR----------AVQQSTASQI--SQEIRDREGAVSRVQQSLESYQRRMQ 284
R E E+ + + T Q+ ++ R A+S +++E Q+++
Sbjct: 140 RKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLS 199

Query: 285 DAEENYSSLTHTVRGLQSDVGSPTGKIQSRLTQLAGQ 321
A+ + ++ L S + S + + LAG+
Sbjct: 200 AAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGK 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0701PF072125060.0 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 506 bits (1304), Expect = 0.0
Identities = 259/343 (75%), Positives = 287/343 (83%), Gaps = 15/343 (4%)

Query: 1 MSENIPLRVQFKRMKAAEWARSDVILLESEIGFETDTGFARAGDGHNRFSDLGYISPLDY 60
M+E IPLRVQFKRM A EW RSDVILLESEIGFETDTG+A+ GDG N+FS L Y+
Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYL----- 55

Query: 61 NLLTNKPNIDGLATKVETAQKLQQ----KADKETVYTKAESKQELDKKLNLKGGVMTGQL 116
NKP++ A K ET K+ + KADK VY KAESK ELDKKLNLKGGVMTGQL
Sbjct: 56 ----NKPDLGAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQL 111

Query: 117 KFKPAAT-VAYSSSTGGAVNIDLSSTRGAGVVVYSDNDTSDGPLMSLRTGKETFNQSALF 175
+FKP + + SSS GGA+NID+S + GAGVVVYS+NDTSDGPLMSLRTGKETFNQSALF
Sbjct: 112 QFKPNKSGIKPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALF 171

Query: 176 VDYKGTTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQLRGSEKALGTLKITHENPSIG 235
VDY G TNAVNIAMRQPTTPNFSSALNITSGNENGSAMQ+RG EKALGTLKITHENP++
Sbjct: 172 VDYSGKTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVE 231

Query: 236 ADYDKNAAALSIDIVKKTNGA-GTAAQGIYINSTSGTTGKLLRIRNLSDDKFYVKSDGGF 294
A+YD+NAAALSIDIVKK G GTAAQGIYINSTSGTTGKLLRIRNL DDKFYVK DGGF
Sbjct: 232 ANYDENAAALSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGF 291

Query: 295 YAKETSQIDGNLKLKDPTANDHAATKAYVDKAISELKKLILKK 337
YAK+TSQIDGNLKLK+PTA+DHAATKAYVD + +LK L++ K
Sbjct: 292 YAKKTSQIDGNLKLKNPTADDHAATKAYVDSEVKKLKALLMDK 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0697GPOSANCHOR497e-08 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 49.3 bits (117), Expect = 7e-08
Identities = 49/287 (17%), Positives = 98/287 (34%), Gaps = 29/287 (10%)

Query: 454 LTKESDETKKLKKEQEGLVESNKQLRDSVREGVQERKKGLESVKESTAAHQKLADEIIKL 513
T +S + K L+ E+ L L + + + +A + L E L
Sbjct: 136 STADSAKIKTLEAEKAALAARKADLE-------KALEGAMNFSTADSAKIKTLEAEKAAL 188

Query: 514 AAKENKTAGEKQNLKNKIDQLNGSIDGLNLAYDKNSNSLSHNADQIKSRISAMEAESTWQ 573
A++ + + N + I L + + ++ + S
Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKA----DLEKALEGAMNFS--- 241

Query: 574 TAQQNLLNIEQKRSEVSKKLAENAELRKKWNEEANVSDSVRKEKIAELTEEEGKLKNMQT 633
+ I+ +E + A AEL K E A + KI L E+ L+ +
Sbjct: 242 --TADSAKIKTLEAEKAALEARQAELEKA-LEGAMNFSTADSAKIKTLEAEKAALEAEKA 298

Query: 634 QLQEEYNKTSATQQAAADAMAAAEESGSARQVIAYENMSEAQRTAIDNMRTKYSELLETT 693
L+ + +A +Q+ + A+ E + +Q+ A E Q + R L+ +
Sbjct: 299 DLEHQSQVLNANRQSLRRDLDASRE--AKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 356

Query: 694 TSIFDAIE----------QKTALSVEQMNANLEKNRAATEQWATNLE 730
+E + + S + + +L+ +R A +Q LE
Sbjct: 357 REAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALE 403


23SPy_0667SPy_0658Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_06674160.246590hypothetical protein
SPy_0666316-0.315580hypothetical protein
SPy_06653190.251369hypothetical protein
SPy_06643210.240439hypothetical protein
SPy_0663424-3.502876hypothetical protein
SPy_0661332-5.982241hypothetical protein
SPy_0660227-6.379561hypothetical protein
SPy_0659022-4.669840hypothetical protein
SPy_0658-215-3.033457putative Cro-like protein, phage associated
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0664PF03544330.001 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 33.0 bits (75), Expect = 0.001
Identities = 22/107 (20%), Positives = 27/107 (25%), Gaps = 8/107 (7%)

Query: 233 VIAHLFAQVPTQPVP-----QTPPVQETPASQTAHESVHEQAEKAPEQPPMQPTSAPVAY 287
H ++P P P E P + + E PE P P APV
Sbjct: 35 TSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVI 94

Query: 288 PPSMPKALTDLMSA---EQVTPDELVAVANIRGHFPPMTPIENFPSD 331
PK EQ D + F P S
Sbjct: 95 EKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSST 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0658TYPE3OMGPROT250.030 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 25.2 bits (55), Expect = 0.030
Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 6 KRLKAERIASGMTQCEVAQSMGWK 29
K L +S +TQC++ +S+GW+
Sbjct: 562 KWLSQNNKSSYLTQCKMDKSLGWR 585


24SPy_0568SPy_0527Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_0568524-3.372852hypothetical protein
SPy_0567626-3.840459hypothetical protein
SPy_0560730-4.568745hypothetical protein
SPy_0559529-1.643650hypothetical protein
SPy_05584270.347234hypothetical protein
SPy_05563261.368675hypothetical protein
SPy_05553240.220102putative portal protein - phage associated
SPy_0553222-1.357692hypothetical protein
SPy_0552421-1.770322hypothetical protein
SPy_0550321-2.355829hypothetical protein
SPy_0549217-4.243903hypothetical protein
SPy_0547319-6.482721hypothetical protein
SPy_0546320-7.356534hypothetical protein
SPy_0545222-8.384269hypothetical protein
SPy_0544327-9.446387hypothetical protein
SPy_0543326-9.351565putative efflux protein
SPy_0542426-8.966383putative nucleotide sugar dehydrogenase
SPy_0540328-9.580223hypothetical protein
SPy_0539429-9.657479hypothetical protein
SPy_0538425-9.196837putative S-adenosylmethionine synthetase
SPy_0537219-6.188280hypothetical protein
SPy_0536218-5.490525hypothetical protein
SPy_0535116-5.075944hypothetical protein
SPy_0534014-3.667261putative shikimate 5-dehydrogenase
SPy_0533012-2.364959putative positive regulator
SPy_0532113-1.735260putative chromosome segregation SMC protein
SPy_0531-211-1.225433putative ribonuclease III
SPy_0530-1120.003992hypothetical protein
SPy_05290120.266651two-component sensor histidine kinase
SPy_05281140.520699two-component response regulator
SPy_05272150.606139hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0543TCRTETA384e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.3 bits (89), Expect = 4e-05
Identities = 28/141 (19%), Positives = 59/141 (41%), Gaps = 13/141 (9%)

Query: 52 SVIGVLFNLFGGVIADSFKR----KKIIITTNILCGTACLVLSFLTKEQWLVYAIVLTNV 107
+ G+L +L +I ++ ++ I GT ++L+F T W+ + I+ V
Sbjct: 253 AAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT-RGWMAFPIM---V 308

Query: 108 ILAFMSAFSSPSYKAFTKEIVKKDSISQLNSLLETTSTVIKVTVPMVAIFLYKLLGIHGV 167
+LA P+ +A V ++ QL L +++ + P++ +Y +
Sbjct: 309 LLAS-GGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA----ASI 363

Query: 168 LLLDGLSFLIAALLISFILPV 188
+G +++ A L LP
Sbjct: 364 TTWNGWAWIAGAALYLLCLPA 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0532GPOSANCHOR473e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 47.4 bits (112), Expect = 3e-07
Identities = 48/313 (15%), Positives = 94/313 (30%), Gaps = 10/313 (3%)

Query: 209 AKVAKQFLELDANRKQLQLDILVKDIDIAQERQTKDTEALAALQQDLASYYAKRQSMEED 268
+ VA + + Q + D + + + + + + AL+ + + +E
Sbjct: 41 SAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEK 100

Query: 269 YQKFKQKKQVLSQESDQTQTTLLELTKLIADLEKQIELVKLESGQ---EAEKKAEAKKHL 325
+K + + + + + +L K + + E A K L
Sbjct: 101 LRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL 160

Query: 326 EQLQEQLDGFQAEEKQCTEQLLHIDQQLCDVKQQLNELSNALERFSSDPDQLMETLREEF 385
E+ E F + + L L + +L + FS+ ++TL E
Sbjct: 161 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 220

Query: 386 VLLMQKEAALSNQLTALKAHLDKEKQARQHKAQEYQLLVTKLDQLNDESQKAQAHYKAQK 445
L ++A L L + + E L + +L + A A
Sbjct: 221 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 280

Query: 446 EQVEMLLQNYQEGDKRVQELERDYQLNQERLFDLLDQ-------KKGKEARKASLESIQK 498
+++ L + +LE Q+ L KK EA LE K
Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340

Query: 499 SHSQFYAGVRAVL 511
+R L
Sbjct: 341 ISEASRQSLRRDL 353



Score = 30.8 bits (69), Expect = 0.034
Identities = 38/243 (15%), Positives = 88/243 (36%), Gaps = 18/243 (7%)

Query: 169 KYKTRKKETQIKLNQTQDNLDRLEDIIYELDTQLAPLEKQAKVAKQFLELDANRKQLQLD 228
+ + + LE L+ + A LEK + A F ++
Sbjct: 229 DLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTA----DSAKIK 284

Query: 229 ILVKDIDIAQERQTKDTEALAALQ-------QDLASYYAKRQSMEEDYQKFKQKKQVLSQ 281
L + + + L +DL + ++ +E ++QK +++ ++
Sbjct: 285 TLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 344

Query: 282 ESDQTQTTLLELTKLIADLEKQIELVKLESGQEAEKKAEAKKHLEQLQEQLDGFQAEEKQ 341
+ L + LE + + ++ E+ ++ + L+ LD + +KQ
Sbjct: 345 SRQSLRRDLDASREAKKQLEAEHQKLE-------EQNKISEASRQSLRRDLDASREAKKQ 397

Query: 342 CTEQLLHIDQQLCDVKQQLNELSNALERFSSDPDQLMETLREEFVLLMQKEAALSNQLTA 401
+ L + +L +++ EL + + + +L L E L +K A + +L
Sbjct: 398 VEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAK 457

Query: 402 LKA 404
L+A
Sbjct: 458 LRA 460



Score = 30.4 bits (68), Expect = 0.044
Identities = 30/163 (18%), Positives = 54/163 (33%), Gaps = 8/163 (4%)

Query: 676 ELEQISEELTRLVEQLKITEKEVAALQSDLIAKKEELTQLKLAGDQARLAEQRAQMAYQQ 735
LE L L+ + + AK + L K A E R +
Sbjct: 145 TLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAA------LEARQAELEKA 198

Query: 736 LQEKQEDSKALLAALDQSQTTHSDESLLAEQARIEEALTAIAKKKNALTCDIDDIKENKD 795
L+ S A A + + +L A +A +E+AL A + I ++ K
Sbjct: 199 LEGAMNFSTADSAKIKTLE--AEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKA 256

Query: 796 LIRQKTQNIHQALSQARLQERDLLNEKKFEQANQSRLRTQLKQ 838
+ + + +AL A + K +A ++ L +
Sbjct: 257 ALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKAD 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0529PF06580447e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.1 bits (104), Expect = 7e-07
Identities = 30/187 (16%), Positives = 72/187 (38%), Gaps = 34/187 (18%)

Query: 253 DETNRMMRMISDLL--NLSRIDNQVTQLAVEMTNFTAFITSILNRFDLVKNQHTGTGKVY 310
+ M+ +S+L+ +L + + LA E+T +++ +F +++ ++
Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQF---EDRLQFENQIN 247

Query: 311 EIVRDYPITSVWIEIDNDKMTQVIENILNNAIKYSPDGGKITVRMKTTDTQLIISISDQG 370
+ D + + ++ ++EN + + I P GGKI ++ + + + + + G
Sbjct: 248 PAIMDVQVPPMLVQT-------LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 371 LGIPKTDLPLIFDRFYRVDKARSRAQGGTGLGLAIAKEIIKQHHGF---IWAKSDYGKGS 427
K + TG GL +E ++ +G I GK
Sbjct: 301 SLALKNT------------------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV- 341

Query: 428 TFTIVLP 434
+++P
Sbjct: 342 NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0528HTHFIS921e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.8 bits (228), Expect = 1e-23
Identities = 29/133 (21%), Positives = 65/133 (48%), Gaps = 1/133 (0%)

Query: 3 KILIVDDEKPISDIIKFNLTKEGYDIVTAFDGREAVTIFEEEKPDLIILDLMLPELDGLE 62
IL+ DD+ I ++ L++ GYD+ + DL++ D+++P+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VAKEIRKT-SHVPIIMLSAKDSEFDKVIGLEIGADDYVTKPFSNRELLARVKAHLRRTET 121
+ I+K +P++++SA+++ + E GA DY+ KPF EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 IETAVAEENASSG 134
+ + +++
Sbjct: 125 RPSKLEDDSQDGM 137


25SPy_0447SPy_0425Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_04471163.0532905'-methylthioadenosine/S-adenosylhomocysteine
SPy_04460173.221669hypothetical protein
SPy_04440173.518580hypothetical protein
SPy_0443-2162.971882putative UDP-N-acetylglucosamine
SPy_0442-1150.248435putative glycerol-3-phosphate transporter
SPy_0441-118-1.742002hypothetical protein
SPy_0440122-3.697820putative dehydrogenease / oxidoreductase
SPy_0439425-4.768299hypothetical protein
SPy_0437526-5.089973hypothetical protein
SPy_0436427-4.401727putative exotoxin (superantigen)
SPy_04353282.083236hypothetical protein
SPy_04330274.826470hypothetical protein
SPy_04320295.822899hypothetical protein
SPy_0431-2265.386895hypothetical protein
SPy_0430-2245.433945hypothetical protein
SPy_0428-2275.374908hypothetical protein
SPy_0427-1223.600315ribonucleoside-diphosphate reductase, large
SPy_04260182.948201putative ribonucleotide reductase (NrdI
SPy_0425-1183.016302putative ribonucleotide reductase 2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0442TCRTETB415e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 41.4 bits (97), Expect = 5e-06
Identities = 32/153 (20%), Positives = 56/153 (36%), Gaps = 9/153 (5%)

Query: 17 LRRQKVVF---FVAFFGYVCAYLVRNNFKLMSNTIMVQNGWDKAQIAILLSCLTVSYGLA 73
LR +++ ++FF + ++ + ++N LT S G A
Sbjct: 10 LRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPA--STNWVNTAFMLTFSIGTA 67

Query: 74 KFYMGALGDRVSLRKLFSISLGASALICILIGFFNSSMVVLGILLVLCGVVQGALAPA-S 132
G L D++ +++L + + + IGF S L I+ A PA
Sbjct: 68 --VYGKLSDQLGIKRLLLFGIIINCFGSV-IGFVGHSFFSLLIMARFIQGAGAAAFPALV 124

Query: 133 QAMIANYFPNKTRGGAIAGWNISQNMGSALLPL 165
++A Y P + RG A MG + P
Sbjct: 125 MVVVARYIPKENRGKAFGLIGSIVAMGEGVGPA 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0440DHBDHDRGNASE1001e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 99.7 bits (248), Expect = 1e-27
Identities = 68/252 (26%), Positives = 109/252 (43%), Gaps = 24/252 (9%)

Query: 3 KVVLVTGCASGIGYAQARYFLKQGHHVYGVDKSDKPDLSGNFHFIKLDLSSELAPL---- 58
K+ +TG A GIG A AR QG H+ VD + + +E P
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 59 -----------FKVVPSVDILCNTAGILDAYKPLLDVSDEEVEHLFDINFFATVKLTRHY 107
+ + +DIL N AG+L + +SDEE E F +N +R
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRP-GLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 108 LRRMVEKQSGVIINMCSIASFIAGGGGVAYTSSKHALAGFTRQLALDYAKDQIHIFGIAP 167
+ M++++SG I+ + S + + AY SSK A FT+ L L+ A+ I ++P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 168 GAVKTAM-----TANDFEP---GGLADWVARETPIGRWTKPDEVAELTGFLASGKARSMQ 219
G+ +T M + G + P+ + KP ++A+ FL SG+A +
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 220 GEIVKIDGGWTL 231
+ +DGG TL
Sbjct: 248 MHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0436BACTRLTOXIN985e-27 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 98.5 bits (245), Expect = 5e-27
Identities = 55/216 (25%), Positives = 96/216 (44%), Gaps = 20/216 (9%)

Query: 35 LNYAYEIIPVDYTNC-NIDYLTTHDFYIDISSYKKKNF-SVDSEVESYITTKFTKNQKVN 92
+ Y Y+ V T ++D HD +IS K KN+ V +E+ + K K++ V+
Sbjct: 51 MKYLYDDHYVSATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVD 110

Query: 93 IFGLPYIFTRYDVYY------------IYGGVTPSVNSNSENSKIVGNLLID--GVQQKT 138
++G Y Y +YGG+T N ++ + N+L+ ++ T
Sbjct: 111 VYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEG-NHFDNGNLQNVLVRVYENKRNT 169

Query: 139 LINPIKIDKPIFTIQEFDFKIRQYLMQTYKIYDPN-SPYIKGQLEIAINGNKHESFNLYD 197
+ ++ DK T QE D K R +L+ +Y+ N SPY G ++ N +++
Sbjct: 170 ISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMP 229

Query: 198 ATSS-STRSDIFKKYKDNKTINMKDFSHFDIYLWTK 232
A +S Y DNKT++ K +++L TK
Sbjct: 230 APGDKFDQSKYLMMYNDNKTVDSKS-VKIEVHLTTK 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0428BINARYTOXINA382e-05 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 38.5 bits (89), Expect = 2e-05
Identities = 42/170 (24%), Positives = 70/170 (41%), Gaps = 27/170 (15%)

Query: 88 INTSLDKAKGELSQLTPELRDQVAQLDAATHRLVIPWNIVVYRYVYETFLRDIGVSHADL 147
IN L + G L+ PEL +V ++ A IP N++VYR G L
Sbjct: 295 INNYL-ISNGPLNNPNPELDSKVNNIENALKLTPIPSNLIVYRRS--------GPQEFGL 345

Query: 148 TSYYRNHQFDPHILCKIK---------LGTRYTKHSFMSTT--ALKNGAMTHRPVEVRIC 196
T + F+ KI+ G T +F+ST+ ++ A R + +RI
Sbjct: 346 TLTSPEYDFN-----KIENIDAFKEKWEGKVITYPNFISTSIGSVNMSAFAKRKIILRIN 400

Query: 197 VKKGAKAAFVEPYSAVPSEVELLFPRGCQLEV--VGAYVSQDQKKLHIEA 244
+ K + A++ E E+L G + ++ V +Y KL ++A
Sbjct: 401 IPKDSPGAYLSAIPGYAGEYEVLLNHGSKFKINKVDSYKDGTVTKLILDA 450


26SPy_0180SPy_0173Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_01801214.739630hypothetical protein
SPy_01791234.417121putative L-ribulose 5-phosphate 4-epimerase
SPy_01781275.076651putative hexulose-6-phosphate isomerase
SPy_01771245.029325putative hexulose-6-phosphate synthase
SPy_01761234.452472hypothetical protein
SPy_01751244.463885hypothetical protein
SPy_01741244.533631hypothetical protein
SPy_01732214.794476putative leucyl-tRNA synthetase
27SPy_2082SPy_2073N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_20820203.393390putative urocanate hydratase
SPy_2081-2101.804602putative imidazolonepropionase
SPy_2080-2111.730779putative NADH oxidase/alkyl hydroperoxidase
SPy_20790131.543551putative alkyl hydroperoxidase
SPy_2077-1121.376281*putative cold shock protein
SPy_2074-1101.084950putative transcriptional regulator
SPy_2073-2101.089097putative endopeptidase Clp ATP-binding chain C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2082TCRTETA290.047 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.4 bits (66), Expect = 0.047
Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 6/45 (13%)

Query: 251 LFISSGLGGMSGAQGKAAEIAKAVAIIAEVDQSRIKTRHSQGWIS 295
L+I + G++GA G A A A IA++ + RH G++S
Sbjct: 99 LYIGRIVAGITGATG-----AVAGAYIADITDGDERARHF-GFMS 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2081UREASE462e-07 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 46.3 bits (110), Expect = 2e-07
Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 6/53 (11%)

Query: 46 IAIKDGLIVALG-SGEPDAE-----LVGTQTIMRSYKGKIATPGIIDCHTHLV 92
I +KDG I A+G +G PD + +VG T + + +GKI T G +D H H +
Sbjct: 88 IGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFI 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2080PF07212300.021 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 30.0 bits (67), Expect = 0.021
Identities = 34/145 (23%), Positives = 58/145 (40%), Gaps = 22/145 (15%)

Query: 242 GGQVMETVGIENMIGTLYT--EGPKLMAEVEAHTKSYDVDIIKAQLATSIEKKENIEVTL 299
G M+ G+E +GTL E P + A + + + +DI+K K++ + T
Sbjct: 205 NGSAMQIRGVEKALGTLKITHENPNVEANYDENAAALSIDIVK--------KQKGGKGTA 256

Query: 300 ANGAVLQAKTAILALGAKWRNINVPGEDEFRNKGVTYCPHCDGPLFEGKDVAVIGGGNSG 359
A G + + + + RN+ +D+F K DG + K + GN
Sbjct: 257 AQGIYINSTSGTTGKLLRIRNLG---DDKFYVKH-------DGGFYAKKTSQI--DGNLK 304

Query: 360 LEAALDLAGLAKHVYVLEFLPELKA 384
L+ A YV + +LKA
Sbjct: 305 LKNPTADDHAATKAYVDSEVKKLKA 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2073HTHFIS366e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.0 bits (83), Expect = 6e-04
Identities = 37/202 (18%), Positives = 66/202 (32%), Gaps = 34/202 (16%)

Query: 476 PTPVTEDDILATLSKLSGIPLEKLTQADSKKYLNLEKELHKRVIGQDAAVTAISRAIRRN 535
P P +++ + + P + ++ + + ++G+ AA+ I R + R
Sbjct: 103 PKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMP------LVGRSAAMQEIYRVLAR- 155

Query: 536 QSGIRTGKRPIGSFMFLGPTGVGKTELAKALAEVLFDDEAALIRFDMSEYMEKFAASRLN 595
+ + M G +G GK +A+AL + + +M+ S L
Sbjct: 156 ---LMQTDLTL---MITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELF 209

Query: 596 GAPPGYVGYDEGGELTQKVRNKPYSV-------LLFDEVEKAHPDIFNVLLQVLDDGILT 648
G E G T L DE+ D LL+VL G T
Sbjct: 210 GH--------EKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYT 261

Query: 649 ---DSRGRKVDFSNTIIIMTSN 667
+ D I+ +N
Sbjct: 262 TVGGRTPIRSDVR---IVAATN 280


28SPy_2032SPy_2003N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_2032-1220.957758putative ATP-binding cassette transporter-like
SPy_2031-1230.990979ABC transporter ATP-binding protein
SPy_20291240.614668putative ABC transporter (ATP-binding protein)
SPy_20273220.840758putative two-component response regulator
SPy_20263201.556543putative histidine kinase
SPy_20252182.612373immunogenic secreted protein precursor
SPy_20231142.078675hypothetical protein (mga-associated)
SPy_20192132.503029M protein trans-acting positive regulator
SPy_20182132.978093M protein type 1
SPy_20161102.987819inhibitor of complement-mediated lysis
SPy_20130102.650124transposase - IS1562
SPy_20101112.100848C5A peptidase precursor
SPy_20090121.585440hypothetical protein
SPy_2007-2110.757657putative laminin adhesion
SPy_2006-1120.711136hypothetical protein
SPy_2005-1130.403900hypothetical protein
SPy_2004-1140.245759ATPase protein
SPy_2003116-1.091216ATPase protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2032RTXTOXIND553e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.8 bits (132), Expect = 3e-10
Identities = 35/144 (24%), Positives = 55/144 (38%), Gaps = 10/144 (6%)

Query: 60 DISLTLAGEVTANNSSKVKIDSSKGEVKEVFVKKGDVVKVGQPLFSYETSQRLTAQSSEF 119
+I T G++T + SK VKE+ VK+G+ V+ G L +LTA +E
Sbjct: 81 EIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLL------KLTALGAEA 134

Query: 120 DVQTKANQLQVAKTNAALKWETYNRKVNEINTLKSRYNTAPDESLLEQIRSAEDSVSQAL 179
D + Q + A L+ Y I K PDE + + E +L
Sbjct: 135 DTL----KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190

Query: 180 SDAKTADSDVKTAQIELDKANATA 203
+ + + Q EL+ A
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRA 214



Score = 39.4 bits (92), Expect = 2e-05
Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 16/180 (8%)

Query: 120 DVQTKANQLQVAKTNAALKWETYNRKVNEINTLKSRYNTAPDESL---LEQIRSAEDSVS 176
D + ++ +AK + Y VNE+ KS+ E L E + +
Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298

Query: 177 QALSDAKTADSDVKTAQIELDKANATATTEKGKLEYDTVKSDTAGTIVSLNTDLPNQSKS 236
+ L + ++ +EL K + + +++ + + L
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEE-------RQQASVIRAPVSVKVQQLKVHTEGGVV- 350

Query: 237 KKENETFMEII-DKSKMLVKGNISEFDRDKLKIGQKVEV-IDRKDNSK--KWTGKVTQVG 292
ET M I+ + + V + D + +GQ + ++ ++ GKV +
Sbjct: 351 -TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2027HTHFIS831e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 1e-20
Identities = 31/128 (24%), Positives = 55/128 (42%), Gaps = 1/128 (0%)

Query: 3 KILVVEDDDTISQVICEFLKANNYDPDCVFDGQAALDKWQTTSYDLIILDIMLPSLSGLE 62
ILV +DD I V+ + L YD + DL++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VLKTIRKT-SDVPIIMLTALDDEYTQLVSFNHLISDYVTKPFSPLILIKRIENVLRVSTP 121
+L I+K D+P+++++A + T + + DY+ KPF LI I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 DEKRQIGD 129
+ D
Sbjct: 125 RPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2026MECHCHANNEL320.002 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 31.7 bits (72), Expect = 0.002
Identities = 14/62 (22%), Positives = 28/62 (45%), Gaps = 8/62 (12%)

Query: 10 VINGLIIVVVTSILLVLYFAMPIYYTKVKDKEVKCEFDQTSKQIKGKTVTEIRDILTKKI 69
V + LI+ ++ A+ + + KE +K+ +TEIRD+L ++
Sbjct: 82 VFDFLIVA------FAIFMAIKLINKLNRKKEEPAAAPAPTKEEV--LLTEIRDLLKEQN 133

Query: 70 NK 71
N+
Sbjct: 134 NR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2025IGASERPTASE429e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.6 bits (97), Expect = 9e-06
Identities = 28/157 (17%), Positives = 54/157 (34%), Gaps = 13/157 (8%)

Query: 42 TADTDTDDESETPKKDKKSKETASQHDTQKDHKPSHTHPTPPSNDTKQTDQASSEATDKP 101
T +T T + ET +K+ K TQ+ P T P + +T Q +E +
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEV--PKVTSQVSPKQEQSETVQPQAEP-ARE 1148

Query: 102 NKDKNDTKQPDSSDQSTPSPKDQSSQKESQNKDGRPTPSPDQQKDQTPD--KTPEKSADK 159
N + K+P S +T D + + + + + + PE +
Sbjct: 1149 NDPTVNIKEPQSQTNTTA---DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205

Query: 160 TPEKGPEKATDKTPEPN-----RDAPKPIQPPLAAAP 191
T + + P+ R P ++P ++
Sbjct: 1206 TTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSN 1242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2019PF050435190.0 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 519 bits (1339), Expect = 0.0
Identities = 109/473 (23%), Positives = 217/473 (45%), Gaps = 20/473 (4%)

Query: 34 ELSKALNISMLTLQTCLTNMQ-FMKEVGGITYKNGYITIWYHQHCGLQEVYQKALRHSQS 92
EL++ LN + ++ L++++ ++ + NG I ++ VY +HS
Sbjct: 30 ELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIRIINT-DDSDIEMVYHHFFKHSTH 88

Query: 93 FKLLETLFFRDFNSLEELAEELFVSLSTLKRLIKKTNAYLMHTFGITILTSPVQVSGDEH 152
F +LE +FF + E + +E ++S S+L R+I + N + F + +PVQ+ G+E
Sbjct: 89 FSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFEVSLTPVQIIGNER 148

Query: 153 QIRLFYLKYFSEAYKISEWPFGEILNLKNCERLLSLMIKEVDVRVNFTLFQHLKILSSVN 212
IR F+ +YFSE Y EWPF + + +LL L+ KE +N + + LK+L N
Sbjct: 149 DIRYFFAQYFSEKYYFLEWPFEN-FSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTN 207

Query: 213 LIRYYKGHSAVYDNKKTSQRFSQLIQSSLEFQDLSRLFHLKFGLYLDETTIAEMFSNHVN 272
L R GH D + + + + + +++ F ++ + LDE + ++F ++
Sbjct: 208 LYRIKFGHFMEVDKDSFNDQSLDFLMQAEGIEGVAQSFESEYNISLDEEVVCQLFVSYFQ 267

Query: 273 DQLEIGYAF--DSIKQDSPTGCRKVTNWVHLL----DELEIRLNLSVTNKYEVAVILHNT 326
I + +K+DS V HLL D++ ++ + + NK + LHNT
Sbjct: 268 KMFFIDESLFMKCVKKDS-----YVEKSYHLLSDFIDQISVKYQIEIENKDNLIWHLHNT 322

Query: 327 TVLKEEDITANYLFFDYKKSYLNFYKQEHPHLYKAFVAGVEKLMRSEKEPISTELTNQLI 386
L +++ ++ FD K + + ++ P + + + + S+ + N L
Sbjct: 323 AHLYRQELFTEFILFDQKGNTIRNFQNIFPKFVSDVKKELSHYLETLEVCSSSMMVNHLS 382

Query: 387 YAFFITWENSFLKVNQKDEKIRLLVI----ERSFNSVGNFLKKYVGEFFSITNFNELDAL 442
Y F ++ + + Q K+++LV+ + V L Y F + + EL+
Sbjct: 383 YTFITHTKHLVINLLQNQPKLKVLVMSNFDQYHAKFVAETLSYYCSNNFELEVWTELELS 442

Query: 443 TIDLEEIEKQYDVIVTDVMVGKSEELEIFFFHKMIPEAIIDKLNAFLNISFAD 495
LE + YD+I+++ ++ E + + + + ++I LNA + I +
Sbjct: 443 KESLE--DSPYDIIISNFIIPPIENKRLIYSNNINTVSLIYLLNAMMFIRLDE 493


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2018GPOSANCHOR1821e-53 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 182 bits (463), Expect = 1e-53
Identities = 246/450 (54%), Positives = 281/450 (62%), Gaps = 32/450 (7%)

Query: 35 NQTEVKANGDGNPREVIEDLAANNPAIQNIRLRYENKDLKARLENAMEVAGRDFKRAEEL 94
KA + + L DL+ LE AM + D + + L
Sbjct: 122 KADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTL 181

Query: 95 EKAKQALEDQRKDLETKLKELQQDYDLAKESTSWDRQRLEKELEEKKEALELAIDQASRD 154
E K ALE ++ +LE L+ K + LE +
Sbjct: 182 EAEKAALEARQAELEKALEGAMNF---------------STADSAKIKTLEAEKAALAAR 226

Query: 155 YHRATALEKELEEKKKALELAIDQASQDYNRANVLEKELETITREQEINRNLLGNAKLEL 214
+ A I + LE + + E N ++
Sbjct: 227 KADLEKALEGAMNFSTADSAKIKTLEAEKAA---LEARQAELEKALEGAMNFSTADSAKI 283

Query: 215 DQLSSEKEQLTIEKAKLEEEKQISDASRQSLRRDLDASREAKKQVEKDLANLTAELDKVK 274
L +EK L EKA LE + Q+ +A+RQSLRRDLDASREAKKQ+E AE K++
Sbjct: 284 KTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLE-------AEHQKLE 336

Query: 275 EDKQISDASRQGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQISDASRQGLRRDLD 334
E +IS+ASRQ LRRDLDASREAKKQ+E AE K++E+ +IS+ASRQ LRRDLD
Sbjct: 337 EQNKISEASRQSLRRDLDASREAKKQLE-------AEHQKLEEQNKISEASRQSLRRDLD 389

Query: 335 ASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEQLA 394
ASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKE+LA
Sbjct: 390 ASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLA 449

Query: 395 KQAEELAKLRAGKASDSQTPDTKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTG 454
KQAEELAKLRAGKASDSQTPD KPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTG
Sbjct: 450 KQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTG 509

Query: 455 ETANPFFTAAALTVMATAGVAAVVKRKEEN 484
ETANPFFTAAALTVMATAGVAAVVKRKEEN
Sbjct: 510 ETANPFFTAAALTVMATAGVAAVVKRKEEN 539



Score = 51.2 bits (122), Expect = 6e-09
Identities = 86/413 (20%), Positives = 143/413 (34%), Gaps = 50/413 (12%)

Query: 1 MAKNNTNRHYSLRKLKTGTASVAVALTVLGAGFANQTEVKANGDGNPREVIEDLAANNPA 60
M KNNTNRHYSLRKLKTGTASVAVALTVLGAG T + + +
Sbjct: 1 MTKNNTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQER-- 58

Query: 61 IQNIRLRYENKDLKARLENAMEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYD 120
+ EN LK + + +EL + +++ + + L E
Sbjct: 59 --ADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKAS--- 113

Query: 121 LAKESTSWDRQRLEKELEEKKEALELAIDQASRDYHRATALEKELEEKKKALELAIDQAS 180
+ELE +K LE A++ A +A K LE +K AL
Sbjct: 114 ------------KIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLE 161

Query: 181 QDYNRANVLEKELETITREQEINRNLLGNAKLELDQLSSEKEQLTIEKAKLEEEKQISDA 240
+ L+ + + + LE EK +A
Sbjct: 162 KA-------------------------------LEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 241 SRQSLRRDLDASREAKKQVEKDLANLTAELDKVKEDKQISDASRQGLRRDLDASREAKKQ 300
+ L + L+ + + L AE + K + + +G A K
Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250

Query: 301 VEKDLANLTAELDKVKEEKQISDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKL 360
+E + A L A ++++ + + + K +E + + L
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310

Query: 361 NKELEESKKLTEKEKAELQAKLEAEAKALKEQLAKQAEELAKLRAGKASDSQT 413
+ L + + K +L+A+ + + K A + L A + + Q
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQL 363


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2010SUBTILISIN1065e-27 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 106 bits (266), Expect = 5e-27
Identities = 50/226 (22%), Positives = 85/226 (37%), Gaps = 47/226 (20%)

Query: 117 KAGKGAGTVVAVIDAGFDKNHEAWRLTDKTKARYQSKEDLEKAKKEHGITYGEWVNDKVA 176
+G G VAV+D G D +H DL KA+ G + +
Sbjct: 36 NQTRGRGVKVAVLDTGCDADHP----------------DL-KARIIGGRNFTDDDEGDPE 78

Query: 177 YYHDYSKDGKTAVDQEHGTHVSGILSGNAPSETKEPYRLEGAMPEAQLLLMRVEIVNGLA 236
+ DY+ HGTHV+G ++ + G PEA LL+++V G
Sbjct: 79 IFKDYNG---------HGTHVAGTIAATENE-----NGVVGVAPEADLLIIKVLNKQGSG 124

Query: 237 DYARNYAQAIIDAVNLGAKVINMSFGNAALAYANLPDETKKAFDYAKSKGVSIVTSAGND 296
Y Q I A+ +I+MS G E +A A + + ++ +AGN+
Sbjct: 125 QYD-WIIQGIYYAIEQKVDIISMSLGGPED-----VPELHEAVKKAVASQILVMCAAGNE 178

Query: 297 SSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASYSPDKQLTETAT 342
+T +G P + ++V + + D+ +E +
Sbjct: 179 GDGDDRT----------DELGYPGCYNEVISVGAINFDRHASEFSN 214



Score = 80.3 bits (198), Expect = 4e-18
Identities = 37/139 (26%), Positives = 58/139 (41%), Gaps = 22/139 (15%)

Query: 457 NATPKVLPTASGTK---LSRFSSWGLTADGNIKPDIAAPGQDILSSVANNKYAKLSGTSM 513
+V+ + S FS+ + D+ APG+DILS+V KYA SGTSM
Sbjct: 192 GCYNEVISVGAINFDRHASEFSNSNN------EVDLVAPGEDILSTVPGGKYATFSGTSM 245

Query: 514 SAPLVAGIMGL-LQKQYETQYPDMTPSERLDLAKKVLMSSATALYDEDEKAYFSPRQQGA 572
+ P VAG + L Q + D+T E L+ L + SP+ +G
Sbjct: 246 ATPHVAGALALIKQLANASFERDLTEPE----LYAQLIKRTIPLGN-------SPKMEGN 294

Query: 573 GAVDAKKASA-ATMYVTDK 590
G + + ++ T +
Sbjct: 295 GLLYLTAVEELSRIFDTQR 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2009IGASERPTASE552e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 55.5 bits (133), Expect = 2e-10
Identities = 52/304 (17%), Positives = 103/304 (33%), Gaps = 23/304 (7%)

Query: 44 ISLTQKTTATTSENWHHIDKDGLIPLGISLEAAKEEFKKEVEESRLSEAQKETYKQKIKT 103
I + + +E +D+ + P + + E E + +K T
Sbjct: 1003 IQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETT 1062

Query: 104 APDKDKLLFTYHSEYMTAVKDLPASTESTTQPVEA-PVQETQASASDSMVTGDSTSVTTD 162
A +++ E + VK + E E Q T+ + ++ + V T+
Sbjct: 1063 AQNREVA-----KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETE 1117

Query: 163 SPEETPSSESPVAPALSEA-----PAQPAESEEPSVAASSEETPSPSTPAAPETPEEPAA 217
+E P S V+P ++ A+PA +P+V ++ + +T + +E ++
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSS 1177

Query: 218 PSPSPESEEPSVAAPSEETPS-PETPE-EPAAPSQPAESEESSVAATTSPSPSTPAESET 275
P +E S E PE A +QP + ESS S +
Sbjct: 1178 NVEQPVTES----TTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHN 1233

Query: 276 QTPPAVTKDSDKPSSAAEKPAASSLVSEQTVQQPTSKRSSDKKEEQEQSYSPNRSLSRQV 335
P + S+ A L S T + R+ + + ++ +S+
Sbjct: 1234 VEPATTS------SNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLE 1287

Query: 336 RAHE 339
+E
Sbjct: 1288 MNNE 1291



Score = 41.6 bits (97), Expect = 5e-06
Identities = 20/119 (16%), Positives = 37/119 (31%)

Query: 206 PAAPETPEEPAAPSPSPESEEPSVAAPSEETPSPETPEEPAAPSQPAESEESSVAATTSP 265
TP A PS S +A E P P P+ ++ + T
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEK 1053

Query: 266 SPSTPAESETQTPPAVTKDSDKPSSAAEKPAASSLVSEQTVQQPTSKRSSDKKEEQEQS 324
+ E+ Q + + + + SE Q T + + E++E++
Sbjct: 1054 NEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112



Score = 39.3 bits (91), Expect = 3e-05
Identities = 21/109 (19%), Positives = 43/109 (39%), Gaps = 4/109 (3%)

Query: 219 SPSPESEEPSVAAPSEETPSPETPEEPAAPSQPAESEESSVAATTSPSPSTPAESE---T 275
+P E +V + TP+ + P+ PS E A P+P+TP+E+
Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA 1041

Query: 276 QTPPAVTKDSDKPSS-AAEKPAASSLVSEQTVQQPTSKRSSDKKEEQEQ 323
+ +K +K A E A + V+++ + +++ +
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGS 1090



Score = 38.9 bits (90), Expect = 4e-05
Identities = 26/179 (14%), Positives = 58/179 (32%), Gaps = 2/179 (1%)

Query: 163 SPEETPSSESPVAPALSEAPAQPAESEEPSVAASSEETPSPSTPAAPETPEEPAAPSPSP 222
+ TP++ P++ + A +E V + TPS +T E ++ +
Sbjct: 995 TNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN 1054

Query: 223 ESEEPSVAAPSEETPSPETPEEPAAPSQP--AESEESSVAATTSPSPSTPAESETQTPPA 280
E + A + E A A+S + T+ + T + +
Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV 1114

Query: 281 VTKDSDKPSSAAEKPAASSLVSEQTVQQPTSKRSSDKKEEQEQSYSPNRSLSRQVRAHE 339
T+ + + + + SE Q R +D ++ S + + + +
Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2007ADHESNFAMILY2502e-84 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 250 bits (640), Expect = 2e-84
Identities = 83/323 (25%), Positives = 144/323 (44%), Gaps = 34/323 (10%)

Query: 1 MKKGFFLMAMVVSLVMIAGCDKSANPKQPTQGMSVVTSFYPMYAMTKEVSGDLNDVR-MI 59
MKK L+ + +S +++ C Q + VV + + +TK ++GD D+ ++
Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60

Query: 60 QSGAGIHSFEPSVNDVAAIYDADLFVYHSHTLE----AWARDLDPNLKKSKVDVFEASKP 115
G H +EP DV +ADL Y+ LE AW L N KK++ + A
Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFA--- 117

Query: 116 LTLDRVKGLEDMEVTQGIDPATLY--------DPHTWTDPVLAGEEAVNIAKELGRLDPK 167
V+ G+D L DPH W + A NIAK+L DP
Sbjct: 118 -------------VSDGVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPN 164

Query: 168 HKDSYTKNAKAFKKEAEQLTEEYTQKFKKVR--SKTFVTQHTAFSYLAKRFGLKQLGISG 225
+K+ Y KN K + + ++L +E KF K+ K VT AF Y +K +G+ I
Sbjct: 165 NKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWE 224

Query: 226 ISPEQEPSPRQLKEIQDFVKEYNVKTIFAEDNVNPKIAHAIAKSTGAKVKT---LSPLEA 282
I+ E+E +P Q+K + + +++ V ++F E +V+ + +++ T + +
Sbjct: 225 INTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAE 284

Query: 283 APSGNKTYLENLRANLEVLYQQL 305
+Y ++ NL+ + + L
Sbjct: 285 QGKEGDSYYSMMKYNLDKIAEGL 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2006PF05616372e-04 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 37.4 bits (86), Expect = 2e-04
Identities = 25/88 (28%), Positives = 36/88 (40%), Gaps = 2/88 (2%)

Query: 226 IPKKDLSPSELAAAQAYWSQKQGRGARPSDYRPTPAPAPGRRKAPIPDVTPNPGQGHQPD 285
IP+ DL+P A A + P++ P P PG R P PD NP D
Sbjct: 310 IPRPDLTPGSAEAPNAQPLPEVSPAENPAN-NPAPNENPGTRPNPEPDPDLNPDANPDTD 368

Query: 286 -NGGYHPAPPRPNDASQNKHQRDEFKGK 312
G P P D +H+++ +G+
Sbjct: 369 GQPGTRPDSPAVPDRPNGRHRKERKEGE 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_2003HTHFIS290.022 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.022
Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 45 IIGASGSGKSLLAHAI 60
I G SG+GK L+A A+
Sbjct: 165 ITGESGTGKELVARAL 180


29SPy_1983SPy_1976N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1983-2194.750327collagen-like surface protei
SPy_1981-1213.974462(p)ppGpp synthetase
SPy_1980-2163.647222hypothetical protein
SPy_1979-2163.653278streptokinase A precursor
SPy_1978-1164.019037leucine-rich protein
SPy_1976-2163.653790multiple sugar-binding ABC transport system
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1983GPOSANCHOR606e-12 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 59.7 bits (144), Expect = 6e-12
Identities = 37/90 (41%), Positives = 43/90 (47%)

Query: 259 KSPEGEAGQPGEKAPEKSKEVTPAAEKPADKEANQTPERRNGNMAKTPVANNHRRLPATG 318
K E A KA + K + N K P+ R+LP+TG
Sbjct: 450 KQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTG 509

Query: 319 EQANPFFTAAAVAVMTTAGVLAVTKRKENN 348
E ANPFFTAAA+ VM TAGV AV KRKE N
Sbjct: 510 ETANPFFTAAALTVMATAGVAAVVKRKEEN 539


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1979STREPKINASE8150.0 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 815 bits (2106), Expect = 0.0
Identities = 389/440 (88%), Positives = 410/440 (93%)

Query: 1 MKNYLSIGVIALLFALTFGTVKSVQAIAGYGWLPDRPPINNSQLVVSMAGIVEGTDKKVF 60
MKNYLS G+ ALLFALTFGTV SVQAIAG WL DRP +NNSQLVVS+AG VEGT++ +
Sbjct: 1 MKNYLSFGMFALLFALTFGTVNSVQAIAGPEWLLDRPSVNNSQLVVSVAGTVEGTNQDIS 60

Query: 61 INFFEIDLTSQPAHGGKTEQGLSPKSKPFATDNGAMPHKLEKADLLKAIQKQLIANVHSN 120
+ FFEIDLTS+PAHGGKTEQGLSPKSKPFATD+GAM HKLEKADLLKAIQ+QLIANVHSN
Sbjct: 61 LKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKLEKADLLKAIQEQLIANVHSN 120

Query: 121 DGYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLKGHVRVRPYKEKPVQNQ 180
D YFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLL GHVRVRPYKEKP+QNQ
Sbjct: 121 DDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVRPYKEKPIQNQ 180

Query: 181 AKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKTHPGY 240
AKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNK HPGY
Sbjct: 181 AKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKNHPGY 240

Query: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYHVKNREQAYEINPKTGIKEKTNNTDLVSEKY 300
TIYERDSSIVTHDNDIFRTILPMDQEFTY VKNREQAY IN K+G+ E+ NNTDL+SEKY
Sbjct: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQAYRINKKSGLNEEINNTDLISEKY 300

Query: 301 YVLKQGEKPYDPFDRSHLKLFTIKYVDVNTNELLKSEQLLTASERNLDFRDLYDPRDKAK 360
YVLK+GEKPYDPFDRSHLKLFTIKYVDV+TNELLKSEQLLTASERNLDFRDLYDPRDKAK
Sbjct: 301 YVLKKGEKPYDPFDRSHLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAK 360

Query: 361 LLYNNLDAFDIMDYTLTGKVEDNHDKNNRVVTVYMGKRPKGAKGSYHLAYDKDLYTEEER 420
LLYNNLDAF IMDYTLTGKVEDNHD NR++TVYMGKRP+G SYHLAYDKD YTEEER
Sbjct: 361 LLYNNLDAFGIMDYTLTGKVEDNHDDTNRIITVYMGKRPEGENASYHLAYDKDRYTEEER 420

Query: 421 KAYSYLRDTGTPIPDNPKDK 440
+ YSYLR TGTPIPDNP DK
Sbjct: 421 EVYSYLRYTGTPIPDNPNDK 440


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1978HTHFIS347e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 7e-04
Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 229 ALWSEHGNLVQTAQRLYIHRNSLQYKLDKF 258
AL + GN ++ A L ++RN+L+ K+ +
Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1976PF05272356e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.7 bits (79), Expect = 6e-04
Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 9/56 (16%)

Query: 34 IVFVGPSGCGKSTTLRMIAGLEDISEGELKIGGEVVNDKSPKDRDIAMVFQNYALY 89
+V G G GKST + + GL+ S+ IG +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG---------TGKDSYEQIAGIVAY 645


30SPy_1866SPy_1856N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_18661130.878146hypothetical protein
SPy_18650130.092168hypothetical protein
SPy_1864-213-0.902459putative DNA polymerase III epsilon subunit
SPy_1863-211-0.988234putative transcriptional activator regulator
SPy_1862-213-1.193351hypothetical protein
SPy_1861-111-1.181150putative repressor - phage associated
SPy_1858-210-1.305068putative X-Pro dipeptidyl-peptidase IV
SPy_1857-111-2.184458putative transcriptional regulatory protein
SPy_1856021-0.097107putative antibiotic resistance protein NorA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1866DHBDHDRGNASE290.035 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 28.9 bits (64), Expect = 0.035
Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 1/56 (1%)

Query: 7 IIIGGGPAGMMAAISSSYYGYKTLLIEKNRRLGKKLAGTGGGRCNVTNSGNLDVLM 62
+ +G PAG+ ++Y K + + LG +LA RCN+ + G+ + M
Sbjct: 140 VTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY-NIRCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1865IGASERPTASE310.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.003
Identities = 24/102 (23%), Positives = 45/102 (44%), Gaps = 10/102 (9%)

Query: 84 EETKQRELLEILVDEKNTEITRLYEQLKAKDAQLASKDEQMRVKDVQIAEKDKQLDQQQQ 143
E ++E + +E++ T + AK+A+ K + Q E + +
Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKA------NTQTNEVAQS--GSET 1092

Query: 144 LTAKAMADKETLKLELEE-AKAEANQARLQVEEVQAEVGPKK 184
+ KET +E EE AK E + + +V +V ++V PK+
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQ-EVPKVTSQVSPKQ 1133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1863BCTERIALGSPF280.032 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 28.3 bits (63), Expect = 0.032
Identities = 14/57 (24%), Positives = 27/57 (47%), Gaps = 1/57 (1%)

Query: 131 KNQKAWKKLQWKMGISIFLAIVSY-VGLILLSSYLQKFWLVYVAMGLFLPGFSWLVI 186
+ Q+ ++Q M L +V+ V ILLS + K ++ M LP + +++
Sbjct: 161 QRQQMRSRIQQAMIYPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLM 217


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1862TYPE4SSCAGX270.014 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 27.1 bits (59), Expect = 0.014
Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 7/85 (8%)

Query: 9 KQAQKLQKQMEQKQADLAAMQFTGKSAQDLVTA-----TFTGDKKLVGIDFKEAVVDPED 63
+QAQK QK +K+ + A + ++L A + +K L + ++ + +
Sbjct: 156 EQAQKAQKDKREKRKEERAKNRA--NLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQ 213

Query: 64 VETLQDMTTQAINDALTQIDETTKK 88
+E L+DM QA +AL QI+E KK
Sbjct: 214 MERLEDMQEQAQANALKQIEELNKK 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1856TCRTETB523e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 51.8 bits (124), Expect = 3e-09
Identities = 67/411 (16%), Positives = 148/411 (36%), Gaps = 46/411 (11%)

Query: 30 SSFSMEEKLFNKHFVAITVINFIVYMVYYLFTVIIAFVATRELGAQTSQAGLATGIYILG 89
+S+S N+ + + +++F + + V + +A + + ++L
Sbjct: 3 TSYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIAN-DFNKPPASTNWVNTAFMLT 61

Query: 90 TLLARLIFGKQLEVFG-RRLVLRGGAIFYLLTTLAYFYMPTISMMYLVRFLNGFGYGVVS 148
+ ++GK + G +RL+L G I + + + S++ + RF+ G G
Sbjct: 62 FSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP 121

Query: 149 TATNTIVTAYIPARKRGEGINFYGLSTSLAAAIGPFVGTFMLDNLHIDFRMI-------- 200
+V YIP RG+ G ++ +GP +G + +H + ++
Sbjct: 122 ALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181

Query: 201 ----------------------IVLCSVLIGCVVVGAFAFPVKNMSLNAEQL---AKTKS 235
I+L SV I ++ ++ + + ++ K
Sbjct: 182 VPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 236 WTVDSFIEK---KALFITAIAFLMGIAYASVLGFQKLYTSEI----HLTT--VGAYFFVV 286
D F++ K + GI + +V GF + + L+T +G+
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 287 YALIITITRPAMGRLMDAKGDKWVLYPSYLFLAMGLFLLGSVSSGGSYLLSGALIG-FGY 345
+ + I G L+D +G +VL FL++ + S+ ++ ++ G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 346 GTFMSCGQAASI-QGVDEHRFNTAMSTYMIGLDLGLGAGPYLLGLIKDLAL 395
+F + + + + MS L G G ++G + + L
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412



Score = 34.1 bits (78), Expect = 0.001
Identities = 35/196 (17%), Positives = 76/196 (38%), Gaps = 12/196 (6%)

Query: 12 LKYIIFCFFCKMFMKIERSSFSMEEKLF-NKHFVAITVINFIVYMVYYLFTVIIAFVATR 70
+ + F F K K+ L N F+ + I++ F ++ ++
Sbjct: 228 VSVLSFLIFVKHIRKVTDPFVDPG--LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKD 285

Query: 71 ELGAQTSQAGLATGIYILGTLLARLIF----GKQLEVFGRRLVLRGGAIFYLLT--TLAY 124
T++ G + I ++ +IF G ++ G VL G F ++ T ++
Sbjct: 286 VHQLSTAEIG---SVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASF 342

Query: 125 FYMPTISMMYLVRFLNGFGYGVVSTATNTIVTAYIPARKRGEGINFYGLSTSLAAAIGPF 184
T M ++ G T +TIV++ + ++ G G++ ++ L+ G
Sbjct: 343 LLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIA 402

Query: 185 VGTFMLDNLHIDFRMI 200
+ +L +D R++
Sbjct: 403 IVGGLLSIPLLDQRLL 418


31SPy_1805SPy_1794N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1805-110-0.974179putative preprotein translocase binding subunit
SPy_1804-111-1.567333putative holo-(acyl carrier protein) synthase
SPy_1802-111-1.158920putative alanine racemase
SPy_1801-110-1.043914immunogenic secreted precursor-like protein
SPy_179809-1.435059hypothetical protein sharing similarity with
SPy_1796214-0.817332hypothetical protein
SPy_1795314-0.800976putative ABC transporter (periplasmic binding
SPy_1794313-1.238736putative ABC transporter (permease)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1805SECA10530.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1053 bits (2725), Expect = 0.0
Identities = 394/903 (43%), Positives = 560/903 (62%), Gaps = 73/903 (8%)

Query: 1 MANILRKVIENDKG-ELRKLEKIAKKVESYADQMASLSDRDLQGKTLEFKERYQKGETLE 59
+ +L KV + LR++ K+ + + +M LSD +L+GKT EF+ R +KGE LE
Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61

Query: 60 QLLPEAFAVVREAAKRVLGLFPYRVQIMGGIVLHNGDVPEMRTGEGKTLTATMPVYLNAI 119
L+PEAFAVVREA+KRV G+ + VQ++GG+VL+ + EMRTGEGKTLTAT+P YLNA+
Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121

Query: 120 AGEGVHVITVNEYLSTRDATEMGEVYSWLGLSVGINLAAKSPAEKREAYNCDITYSTNSE 179
G+GVHV+TVN+YL+ RDA ++ +LGL+VGINL KREAY DITY TN+E
Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181

Query: 180 VGFDYLRDNMVVRQEDMVQRPLNFALVDEVDSVLIDEARTPLIVSGAVSSETNQLYIRAD 239
GFDYLRDNM E+ VQR L++ALVDEVDS+LIDEARTPLI+SG + ++Y R +
Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSS-EMYKRVN 240

Query: 240 MFVKTLT------------SVDYVIDVPTKTIGLSDSGIDKAESYFNLS-------NLYD 280
+ L + +D ++ + L++ G+ E +LY
Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300

Query: 281 IENVALTHFIDNALRANYIMLLDIDYVVSEDGEILIVDQFTGRTMEGRRFSDGLHQAIEA 340
N+ L H + ALRA+ + D+DY+V +DGE++IVD+ TGRTM+GRR+SDGLHQA+EA
Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIV-KDGEVIIVDEHTGRTMQGRRWSDGLHQAVEA 359

Query: 341 KEGVRIQEESKTSASITYQNMFRMYKKLAGMTGTAKTEEEEFREVYNMRIIPIPTNRPIA 400
KEGV+IQ E++T ASIT+QN FR+Y+KLAGMTGTA TE EF +Y + + +PTNRP+
Sbjct: 360 KEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMI 419

Query: 401 RIDHTDLLYPTLESKFRAVVEDVKTRHAKGQPILVGTVAVETSDLISRKLVEAGIPHEVL 460
R D DL+Y T K +A++ED+K R AKGQP+LVGT+++E S+L+S +L +AGI H VL
Sbjct: 420 RKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVL 479

Query: 461 NAKNHFKEAQIIMNAGQRGAVTIATNMAGRGTDIKLG----------------------- 497
NAK H EA I+ AG AVTIATNMAGRGTDI LG
Sbjct: 480 NAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKA 539

Query: 498 ------EGVRELGGLCVIGTERHESRRIDNQLRGRSGRQGDPGESQFYLSLEDDLMRRFG 551
+ V E GGL +IGTERHESRRIDNQLRGRSGRQGD G S+FYLS+ED LMR F
Sbjct: 540 DWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFA 599

Query: 552 SDRIKAFLDRMKLDEEDTVIKSGMLGRQVESAQKRVEGNNYDTRKQVLQYDDVMREQREI 611
SDR+ + ++ + + I+ + + + +AQ++VE N+D RKQ+L+YDDV +QR
Sbjct: 600 SDRVSGMMRKLGM-KPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRA 658

Query: 612 IYANRRDVITANRDLGPEIKAMIKRTIDRAVDAHARSNR---KDAIDAIVTFARTSLVPE 668
IY+ R +++ + D+ I ++ + +DA+ I + + +
Sbjct: 659 IYSQRNELLDVS-DVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLD 717

Query: 669 EFIS--AKELRGLKDDQIKEKLYQRALAIYDQQLSKLRDQEAIIEFQKVLILMIVDNKWT 726
I+ + L ++ ++E++ +++ +Y ++ + E + F+K ++L +D+ W
Sbjct: 718 LPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVG-AEMMRHFEKGVMLQTLDSLWK 776

Query: 727 EHIDALDQLRNAVGLRGYAQNNPVVEYQAEGFKMFQDMIGAIEFDVTRTMMKAQIH-EQE 785
EH+ A+D LR + LRGYAQ +P EY+ E F MF M+ +++++V T+ K Q+ +E
Sbjct: 777 EHLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEE 836

Query: 786 RERASQRATTAAPQNIQSQQSANTDD-------------LPKVERNEACPCGSGKKFKNC 832
E Q+ A + Q QQ ++ DD KV RN+ CPCGSGKK+K C
Sbjct: 837 VEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQC 896

Query: 833 HGR 835
HGR
Sbjct: 897 HGR 899


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1802ALARACEMASE347e-120 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 347 bits (891), Expect = e-120
Identities = 121/368 (32%), Positives = 193/368 (52%), Gaps = 23/368 (6%)

Query: 7 RPTVARVNLQAIKENVASVQKHIPLGVKTYAVVKADAYGHGAVQVSKALLPQVDGYCVSN 66
RP A ++LQA+K+N++ V++ + ++VVKA+AYGHG ++ A+ DG+ + N
Sbjct: 3 RPIQASLDLQALKQNLSIVRQAAT-HARVWSVVKANAYGHGIERIWSAI-GATDGFALLN 60

Query: 67 LDEALQLRQAGIDKEILIL-GVLLPNELELAVANAITVTIAS---LDWIALARLEKKECQ 122
L+EA+ LR+ G IL+L G +LE+ + +T + S L + ARL+
Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAP--- 117

Query: 123 GLKVHVKVDSGMGRIGLRSSKEVNLLIDSLKELGADVEGIFTHFATADEADDTKFNQQLQ 182
L +++KV+SGM R+G + + + + + +HFA A+ D +
Sbjct: 118 -LDIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGIS--GAMA 174

Query: 183 FFKKLIAGLEDKPRLVHASNSATSIWHSDTIFNAVRLGIVSYGLNPSGS-DLSLPFPLQE 241
++ GL SNSA ++WH + F+ VR GI+ YG +PSG L+
Sbjct: 175 RIEQAAEGL---ECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRP 231

Query: 242 ALSLESSLVHVKMISAGDTVGYGATYTAKKSEYVGTVPIGYADGWTRNM-QGFSVLVDGQ 300
++L S ++ V+ + AG+ VGYG YTA+ + +G V GYADG+ R+ G VLVDG
Sbjct: 232 VMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGV 291

Query: 301 FCEIIGRVSMDQLTIRLPKA--YPLGTKVTLIGSNQQKNISTTDIANYRNTINYEVLCLL 358
+G VSMD L + L +GT V L G K I D+A T+ YE++C L
Sbjct: 292 RTMTVGTVSMDMLAVDLTPCPQAGIGTPVELWG----KEIKIDDVAAAAGTVGYELMCAL 347

Query: 359 SDRIPRIY 366
+ R+P +
Sbjct: 348 ALRVPVVT 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1795FERRIBNDNGPP683e-15 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 67.7 bits (165), Expect = 3e-15
Identities = 55/265 (20%), Positives = 103/265 (38%), Gaps = 24/265 (9%)

Query: 18 VACVNQHPKTAKETEQQRIVATSVAVVDICDRLNLDLVGVCDSKLYTL----PKRYDAVK 73
+ A + RIVA V++ L + GV D+ Y L P D+V
Sbjct: 20 PLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVI 79

Query: 74 RVGLPMNPDIELIASLKPTWILSPNSLQEDLEPKYQKLDTEYGFLNLRSVEG------MY 127
VGL P++EL+ +KP++++ P + L +G
Sbjct: 80 DVGLRTEPNLELLTEMKPSFMV----WSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMAR 135

Query: 128 QSIDDLGNLFQRQQEAKELRQQYQDYYRAFQAKRKGK-KKPKVLILMGLPGSYLVATNQS 186
+S+ ++ +L Q A+ QY+D+ R+ + + + +P +L + P LV S
Sbjct: 136 KSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNS 195

Query: 187 YVGNLLDLAGGENVYQSDEKEFLSA--NPEDMLA-KEPDLILRTAHAIPDKVKVMFDKEF 243
+LD G N +Q + + S + + + A K+ D++ D +M
Sbjct: 196 LFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALM----- 250

Query: 244 AENDIWKHFTAVKEGKVYDLDNTLF 268
+W+ V+ G+ + F
Sbjct: 251 -ATPLWQAMPFVRAGRFQRVPAVWF 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1794TYPE3IMSPROT280.043 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 28.2 bits (63), Expect = 0.043
Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 5/76 (6%)

Query: 255 LASVATSIVGVVSFLGL---IVPHMSRLLVGSKHQILIPFSALLGAFVFLLADTLGRSLA 311
+ S A + +GL H S+L++ Q +PFS L V + L
Sbjct: 29 VVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFF-YLC 87

Query: 312 YPLEISPAIIMSIVGG 327
+PL ++ A +M+I
Sbjct: 88 FPL-LTVAALMAIASH 102


32SPy_1556SPy_1547N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1556-2201.974195putative two-component response regulator
SPy_1553-2191.913707putative two-component sensor kinase
SPy_15520171.045764hypothetical protein
SPy_15513231.145335conserved protein - function unknown
SPy_15492220.917821putative arginine repressor
SPy_15481191.244172hypothetical protein
SPy_15473232.063972streptococcal antitumor protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1556HTHFIS943e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.1 bits (234), Expect = 3e-24
Identities = 42/165 (25%), Positives = 75/165 (45%), Gaps = 12/165 (7%)

Query: 3 SLLIVEDEYLVRQGIRSLVDFSQFKIDRVNEAENGQLAWDLFQKEPYDIVLTDINMPKLN 62
++L+ +D+ +R + + + + V N W D+V+TD+ MP N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY---DVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GIQLAELIKQESPQTHLVFLTGYDDFNYALSALKLGADDYLLKPFSKADVEDMLGKLRKK 122
L IK+ P ++ ++ + F A+ A + GA DYL KPF D+ +++G + +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIGIIGRA 118

Query: 123 LELSKKTETIQELVEQPQKEVSAIAMAIHE------RLADSDLTL 161
L K+ + E Q + + A+ E RL +DLTL
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1553PF065801821e-54 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 182 bits (464), Expect = 1e-54
Identities = 57/203 (28%), Positives = 101/203 (49%), Gaps = 10/203 (4%)

Query: 362 EKAIGQYRLQALASQINPHFLYNTLDTIIWMAEFNDSKRVVEVTKSLAKYFRLALNQGN- 420
+ +L AL +QINPHF++N L+ I + D + E+ SL++ R +L N
Sbjct: 155 ASMAQEAQLMALKAQINPHFMFNALNNIRALIL-EDPTKAREMLTSLSELMRYSLRYSNA 213

Query: 421 EYIRLADELDHVSQYLFIQKQRYGDKLSYEVQGLDVYADFVIPKLILQPLVENAIYHGIK 480
+ LADEL V YL + ++ D+L +E Q D +P +++Q LVEN I HGI
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273

Query: 481 EVDRKGMIKVTVSDTAQHLMLTVWDNGKGIEDSSLTNSQSLLARGGVGLKNVDQRLKLHY 540
++ + G I + + + L V + G ++ ++ G GL+NV +RL++ Y
Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKEST-------GTGLQNVRERLQMLY 326

Query: 541 GEGYHMTIHSQSDQFTEIQLSLP 563
G + + + + + + +P
Sbjct: 327 GTEAQIKLSEKQGKVNAM-VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1549ARGREPRESSOR1234e-39 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 123 bits (311), Expect = 4e-39
Identities = 60/146 (41%), Positives = 92/146 (63%), Gaps = 2/146 (1%)

Query: 1 MNKKETRHQLIRSLISETTIHTQQELQERLQKNGITITQATLSRDMKELNLVKVTSGNDT 60
MNK + RH IR +I+ I TQ EL + L+K+G +TQAT+SRD+KEL+LVKV + N +
Sbjct: 1 MNKGQ-RHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNGS 59

Query: 61 HYEALAISQTRWEH-RLRFYMEDALVMLKIVQHQIILKTLPGLAQSFGSILDAMQIPEIV 119
+ +L Q +L+ + DA V + H I+LKT+PG AQ+ G+++D + EI+
Sbjct: 60 YKYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIM 119

Query: 120 ATVCGDDTCLIVCEDNEQAKACYETL 145
T+CGDDT LI+C ++ K + +
Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKI 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1547ARGDEIMINASE5780.0 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 578 bits (1492), Expect = 0.0
Identities = 191/410 (46%), Positives = 276/410 (67%), Gaps = 9/410 (2%)

Query: 5 TPIHVYSEIGKLKKVLLHRPGKEIENLMPDYLERLLFDDIPFLEDAQKEHDAFAQALRDE 64
PI+++SEIG+LKKVLLHRPG+E+ENL P ++ LFDDIP+LE A++EH+ FA L++
Sbjct: 6 NPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNN 65

Query: 65 GIEVLYLETLAAESLVTP-EIREAFIDEYLSEANIRGRATKKAIRELLMAIEDNQELIEK 123
+E+ Y+E L +E LV+ + FI +++ EA I+ T +++ ++ +I K
Sbjct: 66 LVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSL-TIDNMISK 124

Query: 124 TMAGVQKSELPEIPASEKGLTDLVESNYPFAIDPMPNLYFTRDPFATIGTGVSLNHMFSE 183
++GV EL +S L DLV F IDPMPN+ FTRDPFA+IG GV++N MF++
Sbjct: 125 MISGVVTEELKNYTSS---LDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTK 181

Query: 184 TRNRETLYGKYIFTHHPIYGGGKVPMVYDRNETTRIEGGDELVLSKDVLAVGISQRTDAA 243
R RET++ +YIF +HP+Y VP+ +R E +EGGDELVL+K +L +GIS+RT+A
Sbjct: 182 VRQRETIFAEYIFKYHPVYKE-NVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAK 240

Query: 244 SIEKLLVNIFKQNLGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIEGDLRVYSV 303
S+EKL +++FK F +LAF+ NR +MHLDTVFT +DY FT + +Y +
Sbjct: 241 SVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDMYFSIYVL 300

Query: 304 TYDNE--ELHIVEEKGDLAELLAANLGVEKVDLIRCGGDNLVAAGREQWNDGSNTLTIAP 361
TY+ ++HI +EK + ++L+ LG K+D+I+C G +L+ REQWNDG+N L IAP
Sbjct: 301 TYNPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAP 359

Query: 362 GVVVVYNRNTITNAILESKGLKLIKIHGSELVRGRGGPRCMSMPFEREDI 411
G ++ Y+RN +TN + E G+K+ +I SEL RGRGGPRCMSMP REDI
Sbjct: 360 GEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


33SPy_1537SPy_1521N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1537319-2.692595putative 3-deoxy-D-manno-octulosonic-acid
SPy_1536316-2.102846hypothetical protein
SPy_1535215-2.545918putative ribose transport operon repressor
SPy_1534012-1.061784hypothetical protein
SPy_1533012-0.32161423S rRNA (adenine(2503)-C(2))-methyltransferase
SPy_15320181.105102hypothetical protein
SPy_15312231.940101putative peroxide resistance protein
SPy_15300171.830760hypothetical protein
SPy_1529-1152.114727glucose kinase
SPy_1528-1151.126335hypothetical protein
SPy_1527-1131.045082putative GTP-binding protein TypA/BipA (tyrosine
SPy_1526-3131.103853hypothetical protein
SPy_1525-3130.598947putative UDP-N-acetylmuramoylalanine-D-glutamate
SPy_1524-2150.231707putative
SPy_1523-115-0.541466cell division protein
SPy_1521-119-1.611512cell division protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1537LPSBIOSNTHSS1532e-50 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 153 bits (388), Expect = 2e-50
Identities = 58/157 (36%), Positives = 94/157 (59%), Gaps = 2/157 (1%)

Query: 5 IGLYTGSFDPVTNGHLDIVKRASGLFDQIYVGIFDNPTKKSYFKLEVRKAMLTQALADFT 64
+Y GSFDP+T GHLDI++R LFDQ+YV + NP K+ F ++ R + +A+A
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 65 NVIVVTSHERLAIDVAKELRVTHLIRGLRNATDFEYEENLEYFNHLLAPNIETVYLISRN 124
N V + E L ++ A++ + ++RGLR +DFE E + N LA ++ETV+L +
Sbjct: 62 NAQVDSF-EGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120

Query: 125 KWQALSSSRVRELIHFQSSLEGLVPQSVIAQV-EKMN 160
++ LSSS V+E+ F ++E VP V A + ++ +
Sbjct: 121 EYSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQFH 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1535NUCEPIMERASE320.004 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 31.7 bits (72), Expect = 0.004
Identities = 13/76 (17%), Positives = 34/76 (44%), Gaps = 9/76 (11%)

Query: 50 LAQSLKTKKNQLVGLLLPDISNPFF-PRLARGAEEYLKEKGYRVMLGNISDSEALEE--- 105
+++ L +Q+VG+ D N ++ L + E L + G++ +++D E + +
Sbjct: 16 VSKRLLEAGHQVVGI---DNLNDYYDVSLKQARLELLAQPGFQFHKIDLADREGMTDLFA 72

Query: 106 --EYVHVLLQSNAAGI 119
+ V + + +
Sbjct: 73 SGHFERVFISPHRLAV 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1532PREPILNPTASE300.005 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.2 bits (68), Expect = 0.005
Identities = 42/160 (26%), Positives = 58/160 (36%), Gaps = 25/160 (15%)

Query: 70 GLIIILWASMVHWVSASYCYLLLFSLLFSLF--DWRSQ------EYPFILWLFSFVSLLL 121
L+ + A + + LLL +L +L D P + F L
Sbjct: 118 ALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGG 177

Query: 122 FYSIN---------YLSLILLLLGLLAHLRPFSIGAGDFFYLASLALVLDLTSLIWLIQL 172
F S+ YL L L +G GDF LA+L L +L ++ L
Sbjct: 178 FVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLL 237

Query: 173 ASLAGITACLLL-------GIKRIPFIPYLSFGLFWIVLL 205
+SL G + L K IPF PYL+ WI LL
Sbjct: 238 SSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIA-GWIALL 276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1531HELNAPAPROT1511e-49 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 151 bits (383), Expect = 1e-49
Identities = 49/154 (31%), Positives = 85/154 (55%), Gaps = 4/154 (2%)

Query: 19 KKEASKNEKT--KAVLNQAVADLSVAASIVHQVHWYMRGPGFLYLHPKMDELLDSLNANL 76
K E +K +T + LN +++ + S +H+ HWY++GP F LH K +EL D +
Sbjct: 2 KTENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETV 61

Query: 77 DEMSERLITIGGAPYSTLAEFSKHSKLDEAKGTYDKTVAQHLARLVEVYLYLSSLYQVGL 136
D ++ERL+ IGG P +T+ E+++H+ + + + + ++ + LV Y +SS + +
Sbjct: 62 DTIAERLLAIGGQPVATVKEYTEHASITDGGN--ETSASEMVQALVNDYKQISSESKFVI 119

Query: 137 DITDEEGDAGTNDLFTAAKTEAEKTIWMLQAERG 170
+ +E D T DLF E EK +WML + G
Sbjct: 120 GLAEENQDNATADLFVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1529PF03309320.003 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 31.7 bits (72), Expect = 0.003
Identities = 29/126 (23%), Positives = 43/126 (34%), Gaps = 14/126 (11%)

Query: 5 LLGIDLGGTTIKFGILTAAGEVQE---KWAIETNILEGGKHIVPDIIASIKHRLDLYGLS 61
LL ID+ T G+++ +G+ + +W I T + D +A L G
Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTE-----PEVTADELALTIDG--LIGDD 54

Query: 62 SADFVGIGMGSPGAVDRDTNTVTGAFNLNWKETQEVGSVVEKELGIPFAIDNDANVAALG 121
+ G S V + V W V GIP +DN V A
Sbjct: 55 AERLTGASGLS--TVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGA-- 110

Query: 122 ERWVGA 127
+R V
Sbjct: 111 DRIVNC 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1527TCRTETOQM1864e-53 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 186 bits (473), Expect = 4e-53
Identities = 102/477 (21%), Positives = 187/477 (39%), Gaps = 97/477 (20%)

Query: 8 IRNVAIIAHVDHGKTTLVDELLKQSHTLDERKELQE--RAMDSNDLEKERGITILAKNTA 65
I N+ ++AHVD GKTTL + LL S + E + + D+ LE++RGITI T+
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 66 VAYNDVRINIMDTPGHADFGGEVERIMKMVDGVVLVVDAYEGTMPQTRFVLKKALEQNLI 125
+ + ++NI+DTPGH DF EV R + ++DG +L++ A +G QTR + + +
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 126 PIVVVNKIDKPSARP-------------------------------------AEVVDEVL 148
I +NKID+ + V E
Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182

Query: 149 ELFIELGADDEQLE-----------------FPVVYASAINGTSSLSDDPADQEHTMAPI 191
+ +E + LE FPV + SA N + +
Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIG------------IDNL 230

Query: 192 FDTIIDHIPAPVDNSDEPLQFQVSLLDYNDFVGRIGIGRVFRGTVKVGDQVTLSKLDGTT 251
+ I + + L +V ++Y++ R+ R++ G + + D V +S
Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRIS----EK 286

Query: 252 KNFRVTKLFGFFGLERREIQEAKAGDLIAVSGMEDIFVGETITPTDCVEALPILRIDEPT 311
+ ++T+++ E +I +A +G+++ + E + + + T + + P
Sbjct: 287 EKIKITEMYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPL 345

Query: 312 LQMTFLVNNSPFAGREGKWITSRKVEER--LLAELQT----DVSLRVDPTDSPDKWTVSG 365
LQ T + K ++R LL L D LR + + +S
Sbjct: 346 LQTT---------------VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSF 390

Query: 366 RGELHLSILIETMRRE-GYELQVSRPEVIIKEIDGVKCEPFERVQIDTPEEYQGAII 421
G++ + + ++ + E+++ P VI E K E + I+ P A I
Sbjct: 391 LGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKAE--YTIHIEVPPNPFWASI 445



Score = 42.5 bits (100), Expect = 4e-06
Identities = 18/79 (22%), Positives = 31/79 (39%), Gaps = 1/79 (1%)

Query: 403 EPFERVQIDTPEEYQGAIIQSLSERKGDMLDMQMVGNGQTRLIFLIPARGLIGYSTEFLS 462
EP+ +I P+EY + +++D Q + N + L IPAR + Y ++
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTF 595

Query: 463 MTRGYGIMNHTFDQYLPVV 481
T G + Y
Sbjct: 596 FTNGRSVCLTELKGYHVTT 614


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1524LIPPROTEIN48300.012 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 30.4 bits (68), Expect = 0.012
Identities = 19/99 (19%), Positives = 31/99 (31%), Gaps = 10/99 (10%)

Query: 147 FEQEDQLSKVKHLGAVTKVFKDANQMPESTQLE-AVKEYFSRDLKTLLFIGGSAGAHVFN 205
FE ++K + + N + S+ E A S K + G
Sbjct: 83 FEALKAINKQTGI--------EINNVEPSSNFESAYNSALSAGHKIWVLNGFKHQQS-IK 133

Query: 206 QFISDHPELKQRYNIINITGDPHLNELSSHLYRVDYVTD 244
Q+I H E +R I I D + Y + +
Sbjct: 134 QYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIK 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1521SHAPEPROTEIN475e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 47.4 bits (113), Expect = 5e-08
Identities = 42/191 (21%), Positives = 79/191 (41%), Gaps = 16/191 (8%)

Query: 170 RKTVERAGIKVENIIISPLAMAKTILNEGEREFGATVIDMGGGQTTVASMRAQELQYTNI 229
R++ + AG + +I P+A A G+ V+D+GGG T VA + + Y++
Sbjct: 127 RESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSS 186

Query: 230 YAEGGEYITKDISKVLKTSLAI------AEALKFNFGQAEISEASITETVK-VDVV-GSE 281
GG+ + I ++ + AE +K G A + V+ ++ G
Sbjct: 187 VRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVP 246

Query: 282 EPVEVTERYLSEIISARIRHILDRVKQDLER------GRLLDLPGGIVLIGGGAIMPGVV 335
+ + E + + I+ V LE+ + + G+VL GGGA++ +
Sbjct: 247 RGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISE--RGMVLTGGGALLRNLD 304

Query: 336 EIAQEIFGVTV 346
+ E G+ V
Sbjct: 305 RLLMEETGIPV 315


34SPy_1013SPy_1007N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_1013-115-1.886732putative fibronectin-binding protein-like
SPy_1012114-2.534782hypothetical protein
SPy_1011214-3.277432hypothetical protein
SPy_1010115-3.095106putative
SPy_1008317-3.659429streptococcal exotoxin H precursor
SPy_1007518-0.114461streptococcal exotoxin I
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1013FbpA_PF058337070.0 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 707 bits (1827), Expect = 0.0
Identities = 196/577 (33%), Positives = 325/577 (56%), Gaps = 32/577 (5%)

Query: 1 MSFDGFFLHHLTNELKENLLYGRIQKVNQPFERELVLTIRNHRKNYKLLLSAHPVFGRVQ 60
M+ DG FL+ + +ELK ++ G+I KVNQP + E++L IR R ++KLL+S+ + R+
Sbjct: 1 MALDGIFLYSIIDELKNTIINGKIDKVNQPEKDEIILNIRKGRLSFKLLISSSSNYPRIH 60

Query: 61 ITQADFQNPQVPNTFTMIMRKYLQGAVIEQLEQIDNDRIIEIKVSNKNEIGDAIQATLII 120
+T NP F M++RKY+ A I + QI+ DRI+ I + +E+G +LII
Sbjct: 61 LTDLTKPNPIKAPMFCMVLRKYISNAKIVDIHQINQDRIVVIDFESTDELGFNSIYSLII 120

Query: 121 EIMGKHSNIILVDRAENKIIESIKHVGFSQNSYRTILPGSTYIEPPKTAAVNPFTITD-- 178
EIMG+HSN+ L+ + +N I++SIKH+ N+YR+I PG Y+ PPK+ +NPF +
Sbjct: 121 EIMGRHSNMTLIRKRDNIIMDSIKHITPDINTYRSIYPGIEYVYPPKSPKLNPFDFSYDM 180

Query: 179 VPLFEILQTQELTVKSLQQHFQGLGRDTAKELAELLTTDKLKR---------------FR 223
+ F + +L + F G+ + + E+ L + + F+
Sbjct: 181 IENFTKENSLQLNDNIFSKIFTGVSKTLSSEICFRLKNNSIDLSLSNLKEIVEVCKDLFK 240

Query: 224 EFFARPTQANLTTASFAPVLF---------SDSHATFETLSDMLDHFYQDKAERDRINQQ 274
E + + N T + + V F +++ S +L++FY K + DR+ +
Sbjct: 241 EIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKDKSDRLKSK 300

Query: 275 ASDLIHRVQTELDKNRNKLSKQEAELLATENAELFRQKGELLTTYLSLVPNNQDSVILDN 334
+SDL V +++ K L E+ ++F+ GELLT + + + L N
Sbjct: 301 SSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYALKKGLSHIELAN 360

Query: 335 YYT--GEKIEIALDKALTPNQNAQRYFKKYQKLKEAVKHLSGLIADTKQSITYFESVDYN 392
YY+ + ++I LD+ TP+QN Q Y+KKY KLK++ + + + ++ + Y SV N
Sbjct: 361 YYSENYDTVKITLDENKTPSQNVQSYYKKYNKLKKSEEAANEQLLQNEEELNYLYSVLTN 420

Query: 393 LSQA-SIDDIEDIREELYQAGFLKSRQ--RDKRHKRKKPEQYLASDGTTILMVGRNNLQN 449
++ A + D+IE+I++EL + G++K ++ + K+ K KP +++ DG I VG+NN+QN
Sbjct: 421 INNADNYDEIEEIKKELIETGYIKFKKIYKSKKSKTSKPMHFISKDGIDIY-VGKNNIQN 479

Query: 450 EELTFKMAKKGELWFHAKDIPGSHVIIKDNLDPSDEVKTDAAELAAYYSKARLSNLVQVD 509
+ LT K A K ++WFH K+IPGSHVI+K+ +D + +AA LAAYYSK++ S+ V VD
Sbjct: 480 DYLTLKFANKHDIWFHTKNIPGSHVIVKNIMDIPESTLLEAANLAAYYSKSQNSSNVPVD 539

Query: 510 MIEAKKLHKPSGAKPGFVTYTGQKTLRVTPDQAKILS 546
E K + KP+GAKPG V Y+ +T+ VTP + +
Sbjct: 540 YTEVKNVKKPNGAKPGMVIYSTNQTIYVTPTNPNLKN 576


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1012ANTHRAXTOXNA310.007 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 31.3 bits (70), Expect = 0.007
Identities = 43/178 (24%), Positives = 81/178 (45%), Gaps = 20/178 (11%)

Query: 25 KALKEDDADSLIALGEYLESIGFLPHAKRIYLQLADDYPELNINLAQIAAEDDAIEEAF- 83
+ L E++ +S+ + GE + P A R + + P+L IN+ A + +E +
Sbjct: 118 QDLSEEEKNSMNSRGEKV------PFASRFVFEKKRETPKLIINIKDYAINSEQSKEVYY 171

Query: 84 -----LYLDKVSKDS---PNYLSALLVMADLYDMEGLTEVAREKLLQAVGISPEPLVIFG 135
+ LD +SKD P +L+ + ++D D + + +K + + ++ + + I
Sbjct: 172 EIGKGISLDIISKDKSLDPEFLNLIKSLSD--DSDSSDLLFSQKFKEKLELNNKSIDINF 229

Query: 136 LAEIDMSLQH-FKEAIDYYAQLDNRQILELTGISTYQRIGRAYASLGKFEAAIEFLEK 192
+ E QH F A YY D+R +LEL ++ + + G FE E L+K
Sbjct: 230 IKENLTEFQHAFSLAFSYYFAPDHRTVLELYAPDMFEYMNK--LEKGGFEKISESLKK 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1008BACTRLTOXIN937e-25 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 93.1 bits (231), Expect = 7e-25
Identities = 56/218 (25%), Positives = 96/218 (44%), Gaps = 24/218 (11%)

Query: 37 TTNRHNLESLYKHDSNLIEADSIKNSPDIVTSHMLKYSVKDKNLSVF------FEKDWIS 90
T N++ LY D + + A +K S D +H L Y++ DK L + + ++
Sbjct: 45 TGTMGNMKYLY--DDHYVSATKVK-SVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLA 101

Query: 91 QEFKDKEVDIYAL---------SAQEVCECPGKRYEAFGGITLTN----SEKKEIKVPVN 137
+++KD+ VD+Y S V + G + +GGIT V V
Sbjct: 102 KKYKDEVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVR 161

Query: 138 VWDKSKQQPPMFITVNKPKVTAQEVDIKVRKLLIKKYDIYNNREQKYSKGTVTLDLNSGK 197
V++ + + +K VTAQE+DIK R LI K ++Y Y G + N+G
Sbjct: 162 VYENKRNTISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGN 221

Query: 198 DIVFDLYYFGNGDF--NSMLKIYSNNERIDSTQFHVDV 233
+D+ F + L +Y++N+ +DS ++V
Sbjct: 222 TFWYDMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEV 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_1007BACTRLTOXIN1113e-32 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 111 bits (280), Expect = 3e-32
Identities = 65/234 (27%), Positives = 101/234 (43%), Gaps = 35/234 (14%)

Query: 8 NLRNLYSTYDPTEVKGKINEGPPFSGSLFYK--NIPYGNSSIELKVELNSVEKANFFSGK 65
N++ LY + + K K + + L Y + N ++K EL + + A + +
Sbjct: 50 NMKYLYDDHYVSATKVK-SVDKFLAHDLIYNISDKKLKNYD-KVKTELLNEDLAKKYKDE 107

Query: 66 RVDIFTLEYSPPCNSNIKKNS----------YGGITLSDGNRID---KKNIPVNIFIDGV 112
VD++ Y C + K N YGGIT +GN D +N+ V ++ +
Sbjct: 108 VVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKR 167

Query: 113 QQKYSYTDISTVSTDKKEVTIQELDVKSRYYLQKHFNIYGFGDVKDFGRSSRFQSGFEEG 172
T V TDKK VT QELD+K+R +L N+Y F S +E G
Sbjct: 168 N-----TISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNS-----------SPYETG 211

Query: 173 NIIFHLNSGERISYNLFDT--GHGDRESMLKKYSDNKTAYSDQLHIDIYLVKFN 224
I F N+G Y++ D+ L Y+DNKT S + I+++L N
Sbjct: 212 YIKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEVHLTTKN 265


35SPy_0711SPy_0701N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_0711123-0.423341pyrogenic exotoxin C precursor, phage
SPy_07104263.675957conserved hypothetical protein, phage
SPy_07074283.405423putative holin, phage associated
SPy_07063293.423399hypothetical protein
SPy_07053283.353472hypothetical protein
SPy_07032233.528395hypothetical protein
SPy_07022233.548978hypothetical protein
SPy_07010212.850424hyaluronidase, phage associated
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0711BACTRLTOXIN1714e-55 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 171 bits (435), Expect = 4e-55
Identities = 60/263 (22%), Positives = 122/263 (46%), Gaps = 32/263 (12%)

Query: 1 MKKINIIKIVFIITVILISTISPIIKSDSKKD-----------ISNVKSDLLYAYTITPY 49
M K I V +I +++ +P + ++S+ D + ++ Y Y
Sbjct: 1 MYKRLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYV 60

Query: 50 DYKNCR-VNFSTTHTL--NIDTQKYRGKDYYISSEMSYEASQKFKRDDHVDVFGLFYILN 106
+ V+ H L NI +K + D + ++ + ++K+K D+ VDV+G Y +N
Sbjct: 61 SATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYK-DEVVDVYGSNYYVN 119

Query: 107 SHTGEY------------IYGGITPAQNNKVNHKLLGNLFIS-GESQQN-LNNKIILEKD 152
+ +YGGIT + N ++ L N+ + E+++N ++ ++ +K
Sbjct: 120 CYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKK 179

Query: 153 IVTFQEIDFKIRKYLMDNYKIYD-ATSPYVSGRIEIGTKDGKHEQIDLFDSPNEG-TRSD 210
VT QE+D K R +L++ +Y+ +SPY +G I+ +G D+ +P + +S
Sbjct: 180 SVTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSK 239

Query: 211 IFAKYKDNRIINMKNFSHFDIYL 233
Y DN+ ++ K+ +++L
Sbjct: 240 YLMMYNDNKTVDSKS-VKIEVHL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0710FLGFLGJ919e-23 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 91.3 bits (226), Expect = 9e-23
Identities = 44/125 (35%), Positives = 63/125 (50%), Gaps = 8/125 (6%)

Query: 23 SLTAAQAILESGWGKHA-------PHNALFGIKADSSWTGKSFDTKTQEEYQAGVVTDIV 75
L AQA LESGWG+ P LFG+KA +W G + T E Y+ G +
Sbjct: 172 HLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTE-YENGEAKKVK 230

Query: 76 DRFRAYDSWDESIADHGQFLVDNPRYEAVIGETDYKKACYAIKAAGYATASSYVELLIQL 135
+FR Y S+ E+++D+ L NPRY AV ++ A++ AGYAT Y L +
Sbjct: 231 AKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNM 290

Query: 136 IEEND 140
I++
Sbjct: 291 IQQMK 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0702CHANLCOLICIN290.048 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 29.3 bits (65), Expect = 0.048
Identities = 30/157 (19%), Positives = 64/157 (40%), Gaps = 17/157 (10%)

Query: 182 QAEIKASAQGLSQKYDDELRKLSAKITTTSSGTTEAYESKLAGLRAEFTR-----SNQGT 236
QA+ KA+ L+Q+ D + + + + TE + A ++AE R + +
Sbjct: 80 QAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKA 139

Query: 237 RTELESQISGLR----------AVQQSTASQI--SQEIRDREGAVSRVQQSLESYQRRMQ 284
R E E+ + + T Q+ ++ R A+S +++E Q+++
Sbjct: 140 RKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLS 199

Query: 285 DAEENYSSLTHTVRGLQSDVGSPTGKIQSRLTQLAGQ 321
A+ + ++ L S + S + + LAG+
Sbjct: 200 AAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGK 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0701PF072125060.0 Hyaluronoglucosaminidase
		>PF07212#Hyaluronoglucosaminidase

Length = 336

Score = 506 bits (1304), Expect = 0.0
Identities = 259/343 (75%), Positives = 287/343 (83%), Gaps = 15/343 (4%)

Query: 1 MSENIPLRVQFKRMKAAEWARSDVILLESEIGFETDTGFARAGDGHNRFSDLGYISPLDY 60
M+E IPLRVQFKRM A EW RSDVILLESEIGFETDTG+A+ GDG N+FS L Y+
Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYL----- 55

Query: 61 NLLTNKPNIDGLATKVETAQKLQQ----KADKETVYTKAESKQELDKKLNLKGGVMTGQL 116
NKP++ A K ET K+ + KADK VY KAESK ELDKKLNLKGGVMTGQL
Sbjct: 56 ----NKPDLGAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQL 111

Query: 117 KFKPAAT-VAYSSSTGGAVNIDLSSTRGAGVVVYSDNDTSDGPLMSLRTGKETFNQSALF 175
+FKP + + SSS GGA+NID+S + GAGVVVYS+NDTSDGPLMSLRTGKETFNQSALF
Sbjct: 112 QFKPNKSGIKPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALF 171

Query: 176 VDYKGTTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQLRGSEKALGTLKITHENPSIG 235
VDY G TNAVNIAMRQPTTPNFSSALNITSGNENGSAMQ+RG EKALGTLKITHENP++
Sbjct: 172 VDYSGKTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVE 231

Query: 236 ADYDKNAAALSIDIVKKTNGA-GTAAQGIYINSTSGTTGKLLRIRNLSDDKFYVKSDGGF 294
A+YD+NAAALSIDIVKK G GTAAQGIYINSTSGTTGKLLRIRNL DDKFYVK DGGF
Sbjct: 232 ANYDENAAALSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGF 291

Query: 295 YAKETSQIDGNLKLKDPTANDHAATKAYVDKAISELKKLILKK 337
YAK+TSQIDGNLKLK+PTA+DHAATKAYVD + +LK L++ K
Sbjct: 292 YAKKTSQIDGNLKLKNPTADDHAATKAYVDSEVKKLKALLMDK 334


36SPy_0351SPy_0341N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_0351-111-1.577742hypothetical protein
SPy_0349112-2.316230putative transcription elongation factor
SPy_0348-110-1.379690putative aminodeoxychorismate lyase
SPy_0346-110-0.747703putative arylalkylamine n-acetyltransferase
SPy_0345011-0.963261putative UDP-N-acetyl muramate-alanine ligase
SPy_0343112-0.798419hypothetical protein
SPy_0342113-1.130642putative SNF helicase
SPy_0341115-0.385568putative phosphoglycerate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_035160KDINNERMP1361e-38 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 136 bits (344), Expect = 1e-38
Identities = 70/231 (30%), Positives = 116/231 (50%), Gaps = 22/231 (9%)

Query: 38 WEFLGKPMSYFIDYFANNAGLGYGLAIIIVTIIVRTLILPLGLYQSWKASYQS-EKMAFL 96
F+ +P+ + + + G +G +III+T IVR ++ PL KA Y S KM L
Sbjct: 333 LWFISQPLFKLLKWIHSFVG-NWGFSIIIITFIVRGIMYPLT-----KAQYTSMAKMRML 386

Query: 97 KPVFEPINKRIKQANSQEEKMAAQTELMAAQRAHGINPLGGIGCLPLLIQMPFFSAMYFA 156
+P + + +R+ ++K E+MA +A +NPLGG C PLLIQMP F A+Y+
Sbjct: 387 QPKIQAMRERLG-----DDKQRISQEMMALYKAEKVNPLGG--CFPLLIQMPIFLALYYM 439

Query: 157 AQYTKGVSTSTFMG--IDLGSR--SLVLTAIIAALYFFQSWLSMMAVSEEQREQMKTMMY 212
+ + + F DL ++ +L ++ FF +S V++ + +M
Sbjct: 440 LMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPM---QQKIMT 496

Query: 213 TMPIMMIFMSFSLPAGVGLYWLVGGFFSIIQQ-LITTYLLKPRLHKQIKEE 262
MP++ P+G+ LY++V +IIQQ LI L K LH + K++
Sbjct: 497 FMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKK 547


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0346SACTRNSFRASE325e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.2 bits (73), Expect = 5e-04
Identities = 26/131 (19%), Positives = 47/131 (35%), Gaps = 29/131 (22%)

Query: 35 EHIRLIPDTFLVALIDQEIVGYIEGPVVTTPILEDSLFHGVTKNPKTGGYIAITSLSIAK 94
++ + ++ +G I+ + N GY I +++AK
Sbjct: 58 SYVEEEGKAAFLYYLENNCIGRIK----------------IRSN--WNGYALIEDIAVAK 99

Query: 95 HFQQQGVGTALLAALKDLVVAQQRTGLILTCHDYLIS---YYEMNGFINQGISESQHGGT 151
++++GVGTALL + GL+L D IS +Y + FI + +
Sbjct: 100 DYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSNF 159

Query: 152 --------LWY 154
WY
Sbjct: 160 PTANEIAIFWY 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0345ACETATEKNASE310.008 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 31.3 bits (71), Expect = 0.008
Identities = 16/55 (29%), Positives = 24/55 (43%), Gaps = 9/55 (16%)

Query: 304 IINDTII--IDDFA-----HHPTEIVATIDAARQKYPSKEIVAIFQPHTFTRTIA 351
+I D ++ I D H+P I I A Q P +VA+F F +T+
Sbjct: 103 LITDDVLKAITDCIELAPLHNPANIEG-IKACTQIMPDVPMVAVFDT-AFHQTMP 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0341TCRTETOQM371e-04 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 37.1 bits (86), Expect = 1e-04
Identities = 21/87 (24%), Positives = 40/87 (45%), Gaps = 8/87 (9%)

Query: 36 GVTRDRIYATGEWLNRQFSLIDTGGIDDVDAPFMEQIKHQAQIAMEEADVIVFVVSGKEG 95
G+T + +W N + ++IDT G D F+ ++ ++ D + ++S K+G
Sbjct: 53 GITIQTGITSFQWENTKVNIIDTPGHMD----FLAEVYR----SLSVLDGAILLISAKDG 104

Query: 96 VTDADEYVSKILYRTNTPVILAVNKVD 122
V + L + P I +NK+D
Sbjct: 105 VQAQTRILFHALRKMGIPTIFFINKID 131


37SPy_0109SPy_0102N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPy_0109-114-1.740815acetate kinase
SPy_0108121-2.830133hypothetical protein
SPy_0107121-3.784836hypothetical protein
SPy_0106119-3.208226putative competence protein
SPy_0105119-3.246287hypothetical protein
SPy_01041181.017440putative competence protein
SPy_01035272.363528putative competence protein
SPy_01023222.324746putative competence protein, ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0109ACETATEKNASE502e-180 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 502 bits (1293), Expect = e-180
Identities = 209/401 (52%), Positives = 281/401 (70%), Gaps = 7/401 (1%)

Query: 3 KTIAINAGSSSLKWQLYQMPEEAVLAQGIIERIGLKDSISTVKYDGKKEEQILDIHDHTE 62
K + IN GSSSLK+QL + + VLA+G+ ERIG+ DS+ T +G+K + D+ DH +
Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61

Query: 63 AVKILLNDLI--HFGIIAAYDEITGVGHRVVAGGELFKESVVVNDKVLEQIEELSVLAPL 120
A+K++L+ L+ +G+I EI VGHRVV GGE F SV++ D VL+ I + LAPL
Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121

Query: 121 HNPGAAAGIRAFRDILPDITSVCVFDTSFHTSMAKHTYLYPIPQKYYTDYKVRKYGAHGT 180
HNP GI+A I+PD+ V VFDT+FH +M + YLYPIP +YYT YK+RKYG HGT
Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181

Query: 181 SHKYVAQEAAKMLGRPLEELKLITAHIGNGVSITANYHGKSVDTSMGFTPLAGPMMGTRS 240
SHKYV+Q AA++L +P+E LK+IT H+GNG SI A +GKS+DTSMGFTPL G MGTRS
Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241

Query: 241 GDIDPAIIPYLIEQDPELKDAADVVNMLNKKSGLSGVSGISSDMRDI-EAGLQEDNPDAV 299
G IDP+II YL+E+ E A +VVN+LNKKSG+ G+SGISSD RD+ +A + + A
Sbjct: 242 GSIDPSIISYLMEK--ENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQ 299

Query: 300 LAYNIFIDRIKKCIGQYFAVLNGADALVFTAGMGENAPLMRQDVIGGLTWFGMDIDPEKN 359
LA N+F R+KK IG Y A + G D +VFTAG+GEN P +R+ ++ GL + G +D EKN
Sbjct: 300 LALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKN 359

Query: 360 -VFGYRGDISTPESKVKVLVISTDEELCIARDVERL-KNTK 398
V G IST +SKV V+V+ T+EE IA+D E++ ++ K
Sbjct: 360 KVRGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIVESLK 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0106OMPTIN280.012 Omptin serine protease signature.
		>OMPTIN#Omptin serine protease signature.

Length = 317

Score = 28.0 bits (62), Expect = 0.012
Identities = 17/71 (23%), Positives = 26/71 (36%), Gaps = 9/71 (12%)

Query: 37 LLKHSHYLARHDQDNWLLFSHQL--REELSGARFYKVADNK-LYVEKGKKVLAFGQFKSH 93
K+S ++ D D ++ R ++ +Y VA N YV KV G +
Sbjct: 217 TFKYSGWVESSDNDEHYDPGKRITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRV 276

Query: 94 DFRKSASNGKG 104
N KG
Sbjct: 277 T------NKKG 281


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0103BCTERIALGSPG534e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 52.6 bits (126), Expect = 4e-12
Identities = 28/94 (29%), Positives = 50/94 (53%), Gaps = 4/94 (4%)

Query: 9 RHKKLKGFTLLEMLLVILVISVLMLLFVPNLSKQKDRVTETGNAAVVKLVENQAELYELS 68
K +GFTLLE+++VI++I VL L VPNL K++ + + + +EN ++Y+L
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 69 QGSKPSLSQ-LKA--DGSITEKQEKAY-QDYYDK 98
P+ +Q L++ + Y ++ Y K
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIK 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPy_0102BCTERIALGSPF885e-22 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 88.0 bits (218), Expect = 5e-22
Identities = 59/291 (20%), Positives = 118/291 (40%), Gaps = 20/291 (6%)

Query: 4 SLLKGQGLADMLSGLG--FSDAILTQISLADRHGNIETTLVAIQHYLNQMARIRRKTVEV 61
+++G LAD + F ++ + G+++ L + Y Q ++R + +
Sbjct: 113 KVMEGHSLADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQA 172

Query: 62 ITYPLILLLFLFVMMLGLRRYLVPQLETQNQ---------------ITYFLNHFPAFFIG 106
+ YP +L + ++ L +VP++ Q ++ + F + +
Sbjct: 173 MIYPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLL 232

Query: 107 FCSGLILLFGMVWLRWRSQSRLKLYSRLSRYPFLGKLLKQYLTSYYAREWGTLIGQGLDL 166
+ F + LR + + R+ + RL P +G++ + T+ YAR L + L
Sbjct: 233 ALLAGFMAFRV-MLR-QEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPL 290

Query: 167 MTILDIMAIEKSSL-MKELAEDIRMSLLEGQAFHIKVATYPFFKKELSLMIEYGEIKSKL 225
+ + I S+ + ++ EG + H + F + MI GE +L
Sbjct: 291 LQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGEL 350

Query: 226 GAELEIYAQESWEQFFSQLYQVTQLIQPAIFLVVAVTIVMIYAAILLPIYQ 276
+ LE A +F SQ+ L +P + + +A ++ I AIL PI Q
Sbjct: 351 DSMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQ 401



Score = 34.8 bits (80), Expect = 3e-04
Identities = 32/129 (24%), Positives = 60/129 (46%), Gaps = 6/129 (4%)

Query: 154 REWGTLIGQGLDLMTILDIMAIE-KSSLMKELAEDIRMSLLEGQAFHIKVATYP-FFKKE 211
R+ TL+ + L LD +A + + + +L +R ++EG + + +P F++
Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERL 134

Query: 212 LSLMIEYGEIKSKLGAELEIYA--QESWEQFFSQLYQVTQLIQPAIFLVVAVTIVMIYAA 269
M+ GE L A L A E +Q S++ Q +I P + VVA+ +V I +
Sbjct: 135 YCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQA--MIYPCVLTVVAIAVVSILLS 192

Query: 270 ILLPIYQNM 278
+++P
Sbjct: 193 VVVPKVVEQ 201



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.