PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2441.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_010582 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1SPCG_0001SPCG_0020Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0001-115-4.451660chromosomal replication initiation protein
SPCG_0002017-4.606237DNA polymerase III subunit beta
SPCG_0003018-4.408705hypothetical protein
SPCG_0004019-5.003728GTP-dependent nucleic acid-binding protein EngD
SPCG_0005122-5.944586peptidyl-tRNA hydrolase
SPCG_0006122-5.954545transcription-repair coupling factor
SPCG_0007022-4.850976S4 domain-containing protein
SPCG_0008-121-4.691417hypothetical protein
SPCG_0009-122-4.616155hypothetical protein
SPCG_0010-120-3.772234mesJ/ycf62 family protein
SPCG_0011022-2.923634hypoxanthine-guanine phosphoribosyltransferase
SPCG_0012019-2.319940cell division protein FtsH
SPCG_0013421-5.112967transcriptional regulator ComX1
SPCG_0014225-2.481287***IS630-Spn1, transposase Orf1
SPCG_0015227-1.090407hypothetical protein
SPCG_0016228-0.086908hypothetical protein
SPCG_00172280.402249hypothetical protein
SPCG_00180262.639635hypothetical protein
SPCG_00190243.177917adenylosuccinate synthetase
SPCG_0020-3273.471379cytidine/deoxycytidylate deaminase family
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0012HTHFIS366e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.6 bits (82), Expect = 6e-04
Identities = 21/88 (23%), Positives = 33/88 (37%), Gaps = 18/88 (20%)

Query: 217 ARIPAGVLLEGPPGTGKTLLAKAV---AGEAGVPFFS-----ISGSDFVEMFVGV----- 263
+ +++ G GTGK L+A+A+ PF + I G
Sbjct: 157 MQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAF 216

Query: 264 -GASRVRS-LFEDAKKAAPAIIFIDEID 289
GA + FE A+ +F+DEI
Sbjct: 217 TGAQTRSTGRFEQAEGGT---LFLDEIG 241


2SPCG_0109SPCG_0151Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0109220-4.666578argininosuccinate synthase, truncation
SPCG_0110225-7.059513hypothetical protein
SPCG_0111631-7.628500transporter truncation
SPCG_0112530-8.370068hypothetical protein
SPCG_0113429-8.792579hypothetical protein
SPCG_0114230-9.769176hypothetical protein
SPCG_0115424-5.514598hypothetical protein
SPCG_0116118-1.610914hypothetical protein
SPCG_0117118-1.226110hypothetical protein
SPCG_0118-2152.436458hypothetical protein
SPCG_0119-3173.477152hypothetical protein
SPCG_0120-2215.195927surface protein A
SPCG_01211276.039884tRNA-specific 2-thiouridylase MnmA
SPCG_01221255.488837mutT/nudix family protein
SPCG_01231265.325603tRNA uridine 5-carboxymethylaminomethyl
SPCG_01241263.589144metallo-beta-lactamase superfamily protein
SPCG_01252221.821516hypothetical protein
SPCG_01260242.520432hypothetical protein
SPCG_01270245.308858competence-induced protein Ccs1, truncated
SPCG_01280244.658591hypothetical protein
SPCG_0129-1254.719838hypothetical protein
SPCG_0130-2222.555420hypothetical protein
SPCG_0131-1200.752957ribosomal-protein-alanine acetyltransferase
SPCG_0132122-2.766494DNA-binding/iron metalloprotein/AP endonuclease
SPCG_0133226-7.657357degenerate transposase
SPCG_0134228-9.051114degenerate transposase
SPCG_0135234-12.004864hypothetical protein
SPCG_0136133-11.610958glycosyl transferase family protein
SPCG_0137133-11.795574glycosyl transferase family protein
SPCG_0138129-10.193195ABC transporter ATP-binding protein
SPCG_0139128-10.161825hypothetical protein
SPCG_0140129-10.630530hypothetical protein
SPCG_0141129-9.686101hypothetical protein
SPCG_0142230-9.978213UDP-glucose dehydrogenase
SPCG_0143127-9.263814transcriptional regulator
SPCG_0144229-9.699129bacteriocin
SPCG_0145226-8.611182lantibiotic biosynthesis protein
SPCG_0146421-5.171890lantibiotic biosynthesis protein
SPCG_0147113-3.277529lantibiotic efflux protein
SPCG_0148015-0.574373hypothetical protein
SPCG_0149-2131.554486hypothetical protein
SPCG_0150-1142.213225hypothetical protein
SPCG_0151-2193.653874hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0112PRTACTNFAMLY270.023 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 26.9 bits (59), Expect = 0.023
Identities = 18/65 (27%), Positives = 23/65 (35%)

Query: 34 GAITGAAYAALAAAGGGGLQLVLASYGLRSALVAGIVKGLGVLGIHIGNAFANTVIRSIA 93
G ITG A +AA G + L A+ A G V G V G + F +
Sbjct: 233 GHITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVL 292

Query: 94 SAGIG 98
G
Sbjct: 293 DGWYG 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0120GPOSANCHOR677e-14 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 67.4 bits (164), Expect = 7e-14
Identities = 53/298 (17%), Positives = 98/298 (32%), Gaps = 17/298 (5%)

Query: 11 LASVAILGAGFVASQPTVVRAEEAPVASQSKAEKDYDAAMEKYKAAEEDLKKAEAAQRKY 70
++ +LGAG V + T + A + EK E+ E + +
Sbjct: 23 AVALTVLGAGLVVN--TNEVSAVATRSQTDTLEKVQ----ERADKFEIENNTLKLKNSDL 76

Query: 71 DEDQKKTEEKAKETEEASKRQQAANLKYQLKLREYLKYIQEKNKEK--IAKAEKEMNEAK 128
+ K ++ E E + K L E IQE K + KA +
Sbjct: 77 SFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFS 136

Query: 129 QEEDKEKANLNKVLAKVIPSDRELEKTRQEAEKAKKNIPELKKKVEEAKQKVDAAKQKVD 188
+ + L A + +LEK + A K +E K ++A + +++
Sbjct: 137 TADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELE 196

Query: 189 AEHAKEVAPQAKIAELENQVHRLEQDLKDINESDSEDYVKEGLRAPLQSELDTKKAKLLK 248
+ + + + L L L+ ++ A K
Sbjct: 197 KALEGAMNFSTADSAKIKTLEAEKAALAARKAD---------LEKALEGAMNFSTADSAK 247

Query: 249 LEELSGKIEELDAEIAELEVQLKDAEGNNNVEAYFKEGLEKTTAEKKAELEKAEADLK 306
++ L + L+A AELE L+ A + ++ + LE A +AE E +
Sbjct: 248 IKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQ 305



Score = 67.0 bits (163), Expect = 9e-14
Identities = 65/360 (18%), Positives = 119/360 (33%), Gaps = 49/360 (13%)

Query: 37 ASQSKAEKDYDAAMEKYKAAEEDLKKAEAAQRKYDEDQKKTEEKAKETEEASKRQQAANL 96
A ++ E + + A A + + ++ + + E+A + +
Sbjct: 183 AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST 242

Query: 97 KYQLKLREYLKYIQEKNKEKIAKAEKEMNEAKQEEDKEKANLNKVLAKVIPSDRELEKTR 156
K++ + + A+ EK + A + A + + A+ + E
Sbjct: 243 ADSAKIKTLEAEKAAL-EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLE 301

Query: 157 QEAEKAKKNIPEL----------KKKVEEAKQKVDAAKQKVDAE----HAKEVAPQAKIA 202
+++ N L KK++E QK++ + +A A +
Sbjct: 302 HQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK 361

Query: 203 ELENQVHRLEQDLKDINESDSEDYVKEGLRAPLQSELDTKKAKLLKLEELSGKIEELDAE 262
+LE + +LE+ K + ++ LR L + + KK LEE + K+ L+
Sbjct: 362 QLEAEHQKLEEQNK------ISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKL 415

Query: 263 IAELEVQLKDAEGNNNVEAYFKEGLEKTTAEKKAELEKAEADLKKAVDEPETPAPAPAPA 322
ELE K EK AE +A+LE LK+ + +
Sbjct: 416 NKELEESKKLT--------------EKEKAELQAKLEAEAKALKEKLAKQA--------- 452

Query: 323 PAPTPEAPAPAPAPAPAPKPAPAPKPAPAPKPAPAPKPAPAPKPAPAPAPKPEKPAEKPA 382
E A A + P KP P P KP AP E + P+
Sbjct: 453 -----EELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPS 507



Score = 67.0 bits (163), Expect = 1e-13
Identities = 68/383 (17%), Positives = 119/383 (31%), Gaps = 23/383 (6%)

Query: 37 ASQSKAEKDYDAAMEKYKAAEEDLKKAEAAQRKYDEDQKKTEEKAKETEEASKRQQAANL 96
A ++ EK + AM A +K EA + + E+ + S A
Sbjct: 120 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIK 179

Query: 97 KYQLKLR-------------EYLKYIQEKNKEKIAKAEKEMNEAKQEEDKEKANLNKVLA 143
+ + E + KI E E + + L +
Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239

Query: 144 KVIPSDRELEKTRQEAEKAKKNIPELKKKVEEAKQKVDAAKQKVDAEHAKEVAPQAKIAE 203
+++ E + EL+K +E A A K+ A++ A +A+ A+
Sbjct: 240 FSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKAD 299

Query: 204 LENQVHRLEQDLKDINES-DSEDYVKEGLRAPLQSELDTKKAKLLKLEELSGKIEELDAE 262
LE+Q L + + + D+ K+ L A Q + K + L ++
Sbjct: 300 LEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREA 359

Query: 263 IAELEVQLKDAEGNNNVEAYFKEG----LEKTTAEKK---AELEKAEADLKKAVDEPETP 315
+LE + + E N + ++ L+ + KK LE+A + L +
Sbjct: 360 KKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKEL 419

Query: 316 APAPAPAPAPTPEAPAPAPAPAPAPK-PAPAPKPAPAPKPAPAPKPAPAPKPAPAPAPKP 374
+ E A A A A K A A + P P P
Sbjct: 420 EESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVP 479

Query: 375 EKPAEKPAPAPKPETPKTGWKQE 397
+ P KP K K+
Sbjct: 480 -GKGQAPQAGTKPNQNKAPMKET 501


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0125BACINVASINB250.047 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 24.7 bits (53), Expect = 0.047
Identities = 11/43 (25%), Positives = 22/43 (51%)

Query: 31 SELEGRITARQLVEENRPEYNIEYIELLSDKLLDYEKETGAFE 73
S+LE R+ Q + E++ E I+ + L + ++ T +E
Sbjct: 102 SQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQEATDLYE 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0131SACTRNSFRASE300.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.3 bits (68), Expect = 0.002
Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 7/75 (9%)

Query: 48 LAYDGAEVIGFLAVQENLFE-AEVLQIAVKGAYQGQGIASAL------FAQLPTDKEIFL 100
L Y IG + ++ N A + IAV Y+ +G+ +AL +A+ + L
Sbjct: 69 LYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLML 128

Query: 101 EVRQSNQRAQAFYKK 115
E + N A FY K
Sbjct: 129 ETQDINISACHFYAK 143


3SPCG_0167SPCG_0176Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0167221-1.383762hypothetical protein
SPCG_0168323-3.032292hypothetical protein
SPCG_0169024-4.750234hypothetical protein
SPCG_0170025-4.914936tetracycline resistance determinant leader
SPCG_0171-124-4.739460tetracycline resistance protein TetM
SPCG_0172024-5.213880rRNA methylase
SPCG_0173125-4.858248resolvase
SPCG_0174121-4.341351transposase
SPCG_0175018-4.098287hypothetical protein
SPCG_0176017-3.986584hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0167IGASERPTASE397e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.3 bits (91), Expect = 7e-05
Identities = 34/197 (17%), Positives = 72/197 (36%), Gaps = 18/197 (9%)

Query: 526 DTKDRMVDTASGLKEQVKDLPTNARYA-VYQGKSKVKENVRDLTSSISQTKADRASG--R 582
D + KE ++ N + V Q S+ KE T + + + +
Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET 1116

Query: 583 KEQQEQRRKT--IAKRRSEMEQVKQKKQPASSVHERPTTRQEQYHDEQTSKQSNIQTSYK 640
++ QE + T ++ ++ + E V+ + +PA PT ++ QT+ ++
Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARE--NDPTVNIKEP-QSQTNTTAD------ 1167

Query: 641 ESQQAKQERPAVKSDFSSPKVERQGNTVQEKTVQKPATSTTTADRTSQRPITKERPSTVQ 700
Q AK+ V+ + GN+V E P +T + + + +P
Sbjct: 1168 TEQPAKETSSNVEQPVTESTTVNTGNSVVE----NPENTTPATTQPTVNSESSNKPKNRH 1223

Query: 701 RVPLQNTRSRPPIKTAT 717
R +++ T +
Sbjct: 1224 RRSVRSVPHNVEPATTS 1240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0171TCRTETOQM11110.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 1111 bits (2876), Expect = 0.0
Identities = 633/639 (99%), Positives = 636/639 (99%)

Query: 6 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 65
MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 66 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 125
TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 120

Query: 126 IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 185
IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE
Sbjct: 121 IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 180

Query: 186 GNDDLLEKYMSGKSLEALELEQEESIRFQNCSLFPLYHGSAKSNIGIDNLIEVITNKFYS 245
GNDDLLEKYMSGKSLEALELEQEESIRF NCSLFP+YHGSAK+NIGIDNLIEVITNKFYS
Sbjct: 181 GNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYS 240

Query: 246 STHRGPSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSING 305
STHRG SELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSING
Sbjct: 241 STHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSING 300

Query: 306 ELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREM 365
ELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREM
Sbjct: 301 ELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREM 360

Query: 366 LLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 425
LLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY
Sbjct: 361 LLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420

Query: 426 MERPLKKAEYTIHIEVPPNPFWASIGLSVAQLPLGSGMQYESSVSLGYLNQSFQNAVMEG 485
MERPLKKAEYTIHIEVPPNPFWASIGLSV+ LPLGSGMQYESSVSLGYLNQSFQNAVMEG
Sbjct: 421 MERPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEG 480

Query: 486 IRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYL 545
IRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYL
Sbjct: 481 IRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYL 540

Query: 546 SFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGR 605
SFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGR
Sbjct: 541 SFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGR 600

Query: 606 SVCLTELKGYHVTTGEPVCQPRRPNSRIDKVRYMFNKIT 644
SVCLTELKGYHVTTGEPVCQPRRPNSRIDKVRYMFNKIT
Sbjct: 601 SVCLTELKGYHVTTGEPVCQPRRPNSRIDKVRYMFNKIT 639


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0173HTHTETR270.032 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 27.3 bits (60), Expect = 0.032
Identities = 8/55 (14%), Positives = 29/55 (52%), Gaps = 4/55 (7%)

Query: 133 RGKKGGRPSKGKLSIDLALKMYDSKEY---SIRQILDASKLSKTTFYRYLNKRNA 184
+ K+ + ++ + +D+AL+++ + S+ +I A+ +++ Y + ++
Sbjct: 4 KTKQEAQETRQHI-LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSD 57


4SPCG_0196SPCG_0201Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0196-1223.598267hypothetical protein
SPCG_0197-1244.276564mccC family protein
SPCG_0198-1244.360369hypothetical protein
SPCG_0199-1223.877181hypothetical protein
SPCG_02000223.753408magnesium transporter CorA family protein
SPCG_0201-2203.936342excinuclease ABC subunit A
5SPCG_0251SPCG_0290Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0251-124-4.208349hypothetical protein
SPCG_0252-123-5.200963hypothetical protein
SPCG_0253-225-6.807982hypothetical protein
SPCG_0254-131-8.204838hypothetical protein
SPCG_0255-332-7.959950hypothetical protein
SPCG_0256-131-7.397713pyruvate formate-lyase-activating enzyme
SPCG_0257030-5.755868DeoR family transcriptional regulator
SPCG_0258129-5.135979transcriptional regulator
SPCG_0259329-4.096254PTS system transporter subunit IIA
SPCG_0260118-1.378321PTS system transporter subunit IIB
SPCG_0261016-0.834919PTS system transporter subunit IIC
SPCG_0262-114-0.266736formate acetyltransferase
SPCG_0263-1111.352030fructose-6-phosphate aldolase
SPCG_0264-1122.690722glycerol dehydrogenase
SPCG_0265-2143.452183leucyl-tRNA synthetase
SPCG_02661222.710005acetyltransferase
SPCG_02672212.045596acetyltransferase
SPCG_02682192.358238hypothetical protein
SPCG_02690142.549482Holliday junction DNA helicase RuvB
SPCG_02700142.171158hypothetical protein
SPCG_02710163.066434undecaprenyl pyrophosphate synthase
SPCG_02720153.393081phosphatidate cytidylyltransferase
SPCG_0273-2163.350148eep protein
SPCG_0274-2163.021681prolyl-tRNA synthetase
SPCG_0275-2172.901660glycosyl hydrolase family protein
SPCG_0276-2192.780758glucosamine--fructose-6-phosphate
SPCG_0277-2202.292715oxidoreductase
SPCG_02780271.865551alkaline amylopullulanase
SPCG_02797391.596956IS1380-Spn1 transposase
SPCG_02802243.485991hypothetical protein
SPCG_02812243.556767hypothetical protein
SPCG_02822233.39794930S ribosomal protein S12
SPCG_02831223.28040230S ribosomal protein S7
SPCG_02840213.602487elongation factor G
SPCG_0285-2183.776211DNA polymerase III PolC
SPCG_0286-1183.142958hypothetical protein
SPCG_02870203.366604hypothetical protein
SPCG_0288-2233.337697hypothetical protein
SPCG_0289-2254.176309aminopeptidase PepS
SPCG_0290-1243.337906hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0253PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.011
Identities = 11/34 (32%), Positives = 15/34 (44%)

Query: 12 TLLGPSGCGKTTLLRMIAGFNSIKDGEFYFDDTK 45
L G G GK+TL+ + G + D F K
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0267SACTRNSFRASE428e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 42.2 bits (99), Expect = 8e-08
Identities = 21/108 (19%), Positives = 46/108 (42%), Gaps = 3/108 (2%)

Query: 18 LYQAVGWTNYTHQPEMLEQALSHSLVIYLALDGDAVVGLIRLVGDGFSSVLVQDLIVLPI 77
+ + Y + +L + +G I++ + L++D+ V
Sbjct: 41 RFSKPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKD 100

Query: 78 YQRQGIGSALMKEALEDYKDAYQVQLVTEETERTLG---FYRSMGFEI 122
Y+++G+G+AL+ +A+E K+ + L+ E + + FY F I
Sbjct: 101 YRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0284TCRTETOQM6190.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 619 bits (1597), Expect = 0.0
Identities = 181/667 (27%), Positives = 296/667 (44%), Gaps = 57/667 (8%)

Query: 9 KTRNIGIMAHVDAGKTTTTERILYYTGKIHKIGETHEGASQMDWMEQEQERGITITSAAT 68
K NIG++AHVDAGKTT TE +LY +G I ++G +G ++ D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TAQWNNHRVNIIDTPGHVDFTIEVQRSLRVLDGAVTVLDSQSGVEPQTETVWRQATEYGV 128
+ QW N +VNIIDTPGH+DF EV RSL VLDGA+ ++ ++ GV+ QT ++ + G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 129 PRIVFANKMDKIGADFLYSVSTLHDRLQANAHPIQLPIGSEDDFRGIIDLIKMKAEIYTN 188
P I F NK+D+ G D + ++L A +IK K E+Y N
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEI------------------VIKQKVELYPN 163

Query: 189 DLGTDILEEDIPAEYLDQAQEYREKLIEAVAETDEELMMKYLEGEEITNEELKAGIRKAT 248
T+ E + + V E +++L+ KY+ G+ + EL+
Sbjct: 164 MCVTNFTESE---------------QWDTVIEGNDDLLEKYMSGKSLEALELEQEESIRF 208

Query: 249 INVEFFPVLCGSAFKNKGVQLMLDAVIDYLPSPLDIPAIKGINPDTDAEETRPASDEEPF 308
N FPV GSA N G+ +++ + + S +
Sbjct: 209 HNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH-------------------RGQSEL 249

Query: 309 AALAFKIMTDPFVGRLTFFRVYSGVLQSGSYVLNTSKGKRERIGRILQMHANSRQEIDTV 368
FKI RL + R+YSGVL V + K K +I + +ID
Sbjct: 250 CGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDKA 308

Query: 369 YSGDIAAAVGLKDTTTGDSLTDEKSKIILESINVPEPVIQLMVEPKSKADQDKMGIALQK 428
YSG+I + L D K E I P P++Q VEP ++ + AL +
Sbjct: 309 YSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLE 367

Query: 429 LAEEDPTFRVETNVETGETVISGMGELHLDVLVDRMRREFKVEANVGAPQVSYRETFRAS 488
+++ DP R + T E ++S +G++ ++V ++ ++ VE + P V Y E R
Sbjct: 368 ISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYME--RPL 425

Query: 489 TQARGFFKRQSGGKGQFGDVWIEFTPNEEGKGFEFENAIVGGVVPREFIPAVEKGLVESM 548
+A + + + + +P G G ++E+++ G + + F AV +G+
Sbjct: 426 KKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEGIRYGC 485

Query: 549 ANGVLAGYPMVDVKAKLYDGSYHDVDSSETAFKIAASLSLKEAAKSAQPAILEPMMLVTI 608
G L G+ + D K G Y+ S+ F++ A + L++ K A +LEP + I
Sbjct: 486 EQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYLSFKI 544

Query: 609 TVPEENLGDVMGHVTARRGRVDGMEAHGNSQIVRAYVPLAEMFGYATVLRSASQGRGTFM 668
P+E L + + N I+ +P + Y + L + GR +
Sbjct: 545 YAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGRSVCL 604

Query: 669 MVFDHYE 675
Y
Sbjct: 605 TELKGYH 611


6SPCG_0301SPCG_0316Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0301-2183.119746hypothetical protein
SPCG_0302-1203.747513dihydropteroate synthase
SPCG_0303-1172.921658dihydrofolate synthetase
SPCG_03042211.835711GTP cyclohydrolase I
SPCG_03052220.781906bifunctional folate synthesis protein
SPCG_0306330-2.520363hypothetical protein
SPCG_0307128-3.579628hypothetical protein
SPCG_0308329-3.69547050S ribosomal protein L13
SPCG_0309028-4.35602030S ribosomal protein S9
SPCG_0310-226-7.041753hypothetical protein
SPCG_0311027-6.953861hypothetical protein
SPCG_0312-227-7.3240036-phospho-beta-glucosidase
SPCG_0313029-6.784763hypothetical protein
SPCG_0314-129-6.589679PTS system cellobiose transporter subunit IIB
SPCG_0315-126-5.944686PTS system transporter subunit IIB
SPCG_0316124-3.995751cellobiose phosphotransferase system IIA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0311TYPE4SSCAGA290.005 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 28.9 bits (64), Expect = 0.005
Identities = 28/87 (32%), Positives = 43/87 (49%), Gaps = 6/87 (6%)

Query: 19 FAREMLESGLVAE-IRCQKGNLKYEYFLPIEKE--GTILLIDQWINQ-KALDEHHQSKTM 74
F R LE L + + Q+ N + FL KE G L ++ + K + + K
Sbjct: 551 FVRRNLEDKLTTKGLSPQEANKLIKDFLSSNKELVGKTLNFNKAVADAKNTGNYDEVKKA 610

Query: 75 QKILD--LRKKYHLQMQVERYIEDDSG 99
QK L+ LRK+ HL+ +VE+ +E SG
Sbjct: 611 QKDLEKSLRKREHLEKEVEKKLESKSG 637


7SPCG_0346SPCG_0357Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0346121-4.575563capsular polysaccharide biosynthesis protein
SPCG_0347226-6.231676capsular polysaccharide biosynthesis protein
SPCG_0348330-8.349591capsular polysaccharide biosynthesis protein
SPCG_0349231-9.377404capsular polysaccharide biosynthesis protein
SPCG_0350131-9.926491glucosyl-1-phosphate transferase
SPCG_0351234-10.440611glycosyltransferase activity enhancer
SPCG_0352234-10.545387ss-1,4-galactosyltransferase
SPCG_0353234-10.257004polysaccharide polymerase
SPCG_0354234-9.466423ss-1,3-N-acetylglucosaminyltransferase
SPCG_0355028-6.430985ss-1,4-galactosyltransferase
SPCG_0356-124-5.554640capsular polysaccharide synthesis protein
SPCG_0357-120-4.479829transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0356PF057042521e-85 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 252 bits (646), Expect = 1e-85
Identities = 68/238 (28%), Positives = 126/238 (52%), Gaps = 14/238 (5%)

Query: 52 ISNKVWICWFQGEERPPELIRTCIQSMRTHFLGREIIVLTEENISDYIDIPDYITDKYKK 111
++ICW QG E+ P +++ C+ S++ + ++I++ N +++DIPD++ ++++
Sbjct: 67 RQKYIFICWLQGIEKAPYIVQQCVASVKKNSGDFKVIIIDGNNYKEWVDIPDFLIKRWQE 126

Query: 112 GSISRAHYSDILRVELLCRYGGLWVDVTVLNTGGDFSNLELPLFVYKS----LDLSRKDS 167
G + A +SDILR+ LLC+YGGLW+D TV + ++P ++ +S S +S
Sbjct: 127 GKMLDAWFSDILRLFLLCKYGGLWIDATV------YMFDKVPNYIVESNRFMFQSSFLES 180

Query: 168 QAIVASSWLISSYS-NHPILLYARKLLWEYWRRKNSLCNYFLFHIFFTIATEL--YPIEW 224
+ S+WLI S N P L+ + + Y ++K +Y++FH F ++ Y W
Sbjct: 181 ETTHISNWLIFVKSKNDPFLVGLKNSMVTYLKKKEKPADYYIFHDFVSVMAVSKEYSKYW 240

Query: 225 SAVLTFNNHSPHMFNFELNNQFSEKRWEQLKQISVFHKLNHHIDY-SIGVNNFYKFIV 281
+ NN +PHM + N + + +K S KL + +DY ++ N +Y I
Sbjct: 241 KEIPYVNNVNPHMLQYLGNLPYDNSMFNYIKSTSPVQKLTYKLDYNNLKRNTYYDHIF 298


8SPCG_0376SPCG_0431Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_03760233.195770mevalonate kinase
SPCG_03771233.100846diphosphomevalonate decarboxylase
SPCG_03781232.806488phosphomevalonate kinase
SPCG_03791242.610903isopentenyl pyrophosphate isomerase
SPCG_03803241.694090hypothetical protein
SPCG_03812201.682503sensor histidine kinase
SPCG_0382114-1.502880DNA-binding response regulator
SPCG_0383320-3.368713hypothetical protein
SPCG_0384619-2.263938DNA alkylation repair enzyme, truncation
SPCG_0385721-3.170507DNA alkylation repair enzyme, truncation
SPCG_0386621-3.106149hypothetical protein
SPCG_0387620-3.036994choline binding protein G
SPCG_0388622-3.054288hypothetical protein
SPCG_0389424-1.356846hypothetical protein
SPCG_0390018-3.245421degenerate transposase
SPCG_0391019-3.131045degenerate transposase
SPCG_0392-118-3.227365degenerate transposase
SPCG_0393-118-3.240472degenerate transposase
SPCG_0394-216-2.134762PTS system mannitol-specific transporter subunit
SPCG_0395-211-0.810283transcriptional regulator
SPCG_0396-3180.946139PTS system mannitol-specific transporter subunit
SPCG_0397-2201.795168mannitol-1-phosphate 5-dehydrogenase
SPCG_03980232.671808hypothetical protein
SPCG_0399-1232.896139trigger factor
SPCG_0400-1243.991836helicase
SPCG_04010254.031937signal peptidase I
SPCG_04020214.308876ribonuclease HIII
SPCG_0403-1204.125026hypothetical protein
SPCG_04040204.082175hypothetical protein
SPCG_0405-2174.243344mutS2 family protein
SPCG_04060193.659091hypothetical protein
SPCG_0407-2193.250221sodium:alanine symporter family protein
SPCG_0408-2182.918512hypothetical protein
SPCG_0409-3162.597348exfoliative toxin
SPCG_0410-2162.748622seryl-tRNA synthetase
SPCG_0411-1131.880317hypothetical protein
SPCG_0412-1112.317104aspartate kinase
SPCG_04131123.041490enoyl-CoA hydratase
SPCG_04141142.663779MarR family transcriptional regulator
SPCG_04151142.8233913-oxoacyl-ACP synthase
SPCG_04160152.841078acyl carrier protein
SPCG_0417-1133.251075enoyl-(acyl-carrier-protein) reductase
SPCG_04180132.997830acyl-carrier-protein S-malonyltransferase
SPCG_0419-1132.6731653-ketoacyl-ACP reductase
SPCG_0420-2133.1295213-oxoacyl-(acyl carrier protein) synthase II
SPCG_0421-1153.060549acetyl-CoA carboxylase biotin carboxyl carrier
SPCG_0422-1153.048567(3R)-hydroxymyristoyl-ACP dehydratase
SPCG_0423-1142.909086acetyl-CoA carboxylase biotin carboxylase
SPCG_0424-1182.713721acetyl-CoA carboxylase subunit beta
SPCG_04250263.603857acetyl-CoA carboxylase subunit alpha
SPCG_04261323.849828hypothetical protein
SPCG_04271324.128669transcription antitermination protein NusB
SPCG_04280294.590601hypothetical protein
SPCG_04290284.288743elongation factor P
SPCG_0430-1264.349817aspartyl/glutamyl-tRNA amidotransferase subunit
SPCG_04310243.581808aspartyl/glutamyl-tRNA amidotransferase subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0381PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 2e-04
Identities = 18/91 (19%), Positives = 43/91 (47%), Gaps = 9/91 (9%)

Query: 244 ILQELISNTLRHA-----QASCLDVYLYQTDVELQLKVVDNGIGFQLGSLDDLSYGLRNI 298
++Q L+ N ++H Q + + + + + L+V + G + + GL+N+
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNV 318

Query: 299 KERVEDMAG---TVQLLTAPKQGLAVDIRIP 326
+ER++ + G ++L + A+ + IP
Sbjct: 319 RERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0382HTHFIS651e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.2 bits (159), Expect = 1e-14
Identities = 27/116 (23%), Positives = 47/116 (40%), Gaps = 4/116 (3%)

Query: 2 KILLVDDHEMVRLGLKSYFDLQD-DVEVVGEASNGSQGIDLALELRPDVIVMDIVMPEMN 60
IL+ DD +R L DV + N + D++V D+VMP+ N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITS---NAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 61 GIDATLAILKEWPEAKILIVTSYLDNEKIMPVLDAGAKGYMLKTSSADELLHAVSK 116
D I K P+ +L++++ + + GA Y+ K EL+ + +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0387V8PROTEASE743e-17 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 74.3 bits (182), Expect = 3e-17
Identities = 38/212 (17%), Positives = 78/212 (36%), Gaps = 16/212 (7%)

Query: 3 KIDNTLQYPYSTSAMVLSKYYGVADGMNVEGRGSANF-IKDNVLITAAHNYYRHDYGKEA 61
+I +T Y+ + + ++ + + L+T H +G
Sbjct: 78 QITDTTNGHYAPVTYIQVEAPT-------GTFIASGVVVGKDTLLTNKH-VVDATHGDPH 129

Query: 62 DDIYVLPAVSPSQELFGKIKVKEVRYLKEFRNLNSKDAREYDLALLILEEPIGAKLGTLG 121
A++ G +++ A + + IG +
Sbjct: 130 ALKAFPSAINQDNYPNGGFTAEQITKYSG----EGDLAI-VKFSPNEQNKHIGEVVKPAT 184

Query: 122 LPTSQKNLTGITVTITGYPSYNFKIHQMYTDKKQVLSDDGMFLDYQVDTLEGSSGSTVYD 181
+ + + +T+TGYP + + M+ K ++ G + Y + T G+SGS V++
Sbjct: 185 MSNNAETQVNQNITVTGYP-GDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFN 243

Query: 182 ASHRVVGVHTLGDGANQINSAVKLNERNLPFI 213
+ V+G+H G N+ N AV +NE F+
Sbjct: 244 EKNEVIGIHW-GGVPNEFNGAVFINENVRNFL 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0419DHBDHDRGNASE1278e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 127 bits (319), Expect = 8e-38
Identities = 78/254 (30%), Positives = 134/254 (52%), Gaps = 13/254 (5%)

Query: 3 LEHKNIFITGSSRGIGLAIAHKFAQAGANIV-LNSRGAISEELLAEFSNYGIKVVPISGD 61
+E K FITG+++GIG A+A A GA+I ++ E++++ D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 62 VSDFADAKRMIDQAIAELGSVDVLVNNAGITQDTLMLKMTEADFEKVLKVNLTGAFNMTQ 121
V D A + + E+G +D+LVN AG+ + L+ +++ ++E VN TG FN ++
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 122 SVLKPMMKAREGAIINMSSVVGLMGNIGQANYAASKAGLIGFTKSVAREVASRNIRVNVI 181
SV K MM R G+I+ + S + A YA+SKA + FTK + E+A NIR N++
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 182 APGMIESDMTAIL------SDKIKEATLAQ----IPMKEFGQAEQVADLTVFLAGQD--Y 229
+PG E+DM L ++++ + +L IP+K+ + +AD +FL +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 230 LTGQVIAIDGGLSM 243
+T + +DGG ++
Sbjct: 246 ITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0421RTXTOXIND270.023 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.5 bits (61), Expect = 0.023
Identities = 14/56 (25%), Positives = 23/56 (41%), Gaps = 9/56 (16%)

Query: 72 EVPAPAEASVATEGN--LVESPLVGVVYLAAGPDKPAFVTVGDSVKKGQTLVIIEA 125
E+ A A + G ++ +V K V G+SV+KG L+ + A
Sbjct: 81 EIVATANGKLTHSGRSKEIKPIENSIV-------KEIIVKEGESVRKGDVLLKLTA 129


9SPCG_0624SPCG_0635Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0624-3173.866263glucokinase
SPCG_0625-1213.498752thymidylate synthase
SPCG_06260203.928996hypothetical protein
SPCG_0627-1213.572455hypothetical protein
SPCG_0628-2233.549422tRNA delta(2)-isopentenylpyrophosphate
SPCG_0629-2232.946513GTP-binding protein HflX
SPCG_0630-1212.263311hypothetical protein
SPCG_0631-3212.044442ribonuclease Z
SPCG_06320231.424518short chain dehydrogenase/reductase family
SPCG_06332252.083777transcriptional regulator
SPCG_06342261.169616hypothetical protein
SPCG_06352271.690207hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0624PF03309352e-04 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 35.1 bits (81), Expect = 2e-04
Identities = 25/126 (19%), Positives = 45/126 (35%), Gaps = 14/126 (11%)

Query: 11 IIGIDLGGTSIKFAILTTAGEIQ---GKWSIKTNILDEGSHIVDDMIESIQHRLDLLGLA 67
++ ID+ T +++ +G+ +W I+T D++ +I L+G
Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTA----DELALTI---DGLIGDD 54

Query: 68 AADFQGIGMGSPGVVDRDKGTVIGAYNLNWKTLQPIKQKIEKALGIPFFIDNDANVAALG 127
A G S V V W + + + GIP +DN V A
Sbjct: 55 AERLTGASGLS--TVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGA-- 110

Query: 128 ERWMGA 133
+R +
Sbjct: 111 DRIVNC 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0632DHBDHDRGNASE805e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 80.1 bits (197), Expect = 5e-20
Identities = 48/182 (26%), Positives = 87/182 (47%), Gaps = 6/182 (3%)

Query: 4 ILITGASGGLAQEMVKLLPND--QLILLGRNKEKLAQLYGNYS----HAELIEIDITDDS 57
ITGA+ G+ + + + L + + + N EKL ++ + HAE D+ D +
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 58 ALEALVTDLYLRYGKIDVLINNAGYGIFEGFDQIADKDIHQMFEVNTFALMNLSRHLAAR 117
A++ + + G ID+L+N AG ++D++ F VN+ + N SR ++
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 118 MKESSKGHIINIVSMAGLIATGKSSLYSATKFAAIGFSNALRLELMPYGVYVTTVNPGPI 177
M + G I+ + S + + Y+++K AA+ F+ L LEL Y + V+PG
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGST 190

Query: 178 RT 179
T
Sbjct: 191 ET 192


10SPCG_0644SPCG_0652Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0644016-4.579830undecaprenyldiphospho-muramoylpentapeptide
SPCG_0645325-7.055255cell division protein DivIB
SPCG_0646432-9.642499hypothetical protein
SPCG_0647432-9.586963hypothetical protein
SPCG_0648328-7.127731hypothetical protein
SPCG_0649121-3.541787HesA/MoeB/ThiF family protein
SPCG_0650223-2.832838ABC transporter ATP-binding protein
SPCG_0651122-2.858857hypothetical protein
SPCG_0652221-2.867122hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0645IGASERPTASE417e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.2 bits (96), Expect = 7e-06
Identities = 21/122 (17%), Positives = 47/122 (38%), Gaps = 2/122 (1%)

Query: 2 SKDKKNEDKETLEELKELSEWQKRNQEYLKKKAE-EEAALAEEKEKERQARMGEESEKSE 60
+ + +++E +E K + + E + +E +E E KE + + ++E
Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETE 1117

Query: 61 DKQDQESETDQEDSESAKEESEEKVASSEADKEKEEK-EEPESKEKEEQDKKLAKKATKE 119
Q+ T Q + + E+ + A + + +EP+S+ D + K T
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSS 1177

Query: 120 KP 121

Sbjct: 1178 NV 1179



Score = 37.4 bits (86), Expect = 1e-04
Identities = 30/124 (24%), Positives = 48/124 (38%), Gaps = 21/124 (16%)

Query: 12 TLEELKELSEWQKRNQEYLKKKAEEEAALAEEKEKERQARMGEESEKSEDKQD-QESETD 70
T E E + + +K E++A E Q R + KS K + Q +E
Sbjct: 1032 TPSETTETVAENSKQESKTVEKNEQDA-----TETTAQNREVAKEAKSNVKANTQTNEVA 1086

Query: 71 QEDSESAKEESEEKVASSEADKEKEEK---------EEP----ESKEKEEQDKKLAKKAT 117
Q SE+ +E++ A EKEEK E P + K+EQ + + +A
Sbjct: 1087 QSGSET--KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144

Query: 118 KEKP 121
+
Sbjct: 1145 PARE 1148



Score = 36.6 bits (84), Expect = 2e-04
Identities = 22/80 (27%), Positives = 41/80 (51%), Gaps = 7/80 (8%)

Query: 54 EESEKSEDKQDQESETDQEDSESAKEE-------SEEKVASSEADKEKEEKEEPESKEKE 106
E +E + QES+T +++ + A E ++E ++ +A+ + E + S+ KE
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094

Query: 107 EQDKKLAKKATKEKPAKAKI 126
Q + + AT EK KAK+
Sbjct: 1095 TQTTETKETATVEKEEKAKV 1114



Score = 29.3 bits (65), Expect = 0.040
Identities = 10/33 (30%), Positives = 15/33 (45%)

Query: 354 ADKLIMEAEEKAKQEAKEAEKKQEEEQKKQEEE 386
+ E +E E KE ++EE+ K E E
Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETE 1117


11SPCG_0833SPCG_0879Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0833-114-3.230028hypothetical protein
SPCG_0834-217-4.300061hypothetical protein
SPCG_0835-221-5.717741type I restriction-modification system, M
SPCG_0836-225-6.750763type I restriction-modification system subunit
SPCG_0837-221-4.656543type I restriction-modification system subunit
SPCG_0838-215-3.208542hypothetical protein
SPCG_0839-215-2.952745hypothetical protein
SPCG_0840-215-2.547322phage integrase family integrase/recombinase
SPCG_0841-216-2.254038type I restriction-modification system subunit
SPCG_0842-215-1.987243type I restriction-modification system, R
SPCG_0843-215-2.370527IS1380-Spn1 transposase
SPCG_0844-123-4.785540hypothetical protein
SPCG_0845226-5.179573degenerate transposase
SPCG_0846326-5.040595hypothetical protein
SPCG_0847327-5.499388hypothetical protein
SPCG_0848329-5.270663phosphosugar-binding transcriptional regulator
SPCG_0849429-4.705865N-acetylmannosamine-6-phosphate 2-epimerase
SPCG_0850328-4.862685N-acetylneuraminate lyase
SPCG_0851226-5.221075sodium:solute symporter family protein
SPCG_0852128-6.484377hypothetical protein
SPCG_0853128-5.889578neuraminidase
SPCG_0854-130-6.805877Gfo/Idh/MocA family oxidoreductase
SPCG_0855134-8.508658ROK family protein
SPCG_0856235-8.824609hypothetical protein
SPCG_0857132-6.355155V-type ATP synthase subunit I
SPCG_0858131-4.944532V-type ATP synthase subunit K
SPCG_0859232-5.867856v-type sodium ATP synthase subunit E
SPCG_0860228-5.097097v-type sodium ATP synthase subunit C
SPCG_0861124-3.721185V-type ATP synthase subunit F
SPCG_0862124-3.822003V-type ATP synthase subunit A
SPCG_0863322-5.744880V-type ATP synthase subunit B
SPCG_0864316-5.312595V-type ATP synthase subunit D
SPCG_0865-120-1.315386degenerate transposase
SPCG_0866-214-1.002527degenerate transposase
SPCG_0867-313-0.210597hypothetical protein
SPCG_0868-2160.295532hypothetical protein
SPCG_0869-1150.528421transcriptional repressor
SPCG_0870-2140.338847x-prolyl-dipeptidyl aminopeptidase
SPCG_0871-2150.017154DNA polymerase III DnaE
SPCG_08723320.7842356-phosphofructokinase
SPCG_0873327-0.441515pyruvate kinase
SPCG_0874-219-2.212903hypothetical protein
SPCG_0875220-3.376817hypothetical protein
SPCG_0876119-3.013370hypothetical protein
SPCG_0877119-1.563526hypothetical protein
SPCG_0878019-1.719701IS1381 transposase protein A
SPCG_0879321-4.930807hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0857PF06580300.035 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.035
Identities = 19/145 (13%), Positives = 46/145 (31%), Gaps = 7/145 (4%)

Query: 415 SETKRFLKFFNILGVAVAIWGGIYGSFFGYELP-FHLISTTSDVMIILVVSVVFGFITVF 473
+ ++ + +G V G + +I + ++ LV++ +
Sbjct: 6 RQANKYYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKR 65

Query: 474 AGLLASGLQKVRMKKYAEAYNSGFVWCVILLGLLFIAVGMLMPDMRLLFVLGKWVSIFNA 533
G L + ++ ++ G VW V + + + + + + +
Sbjct: 66 QGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAF------TLPLALS 119

Query: 534 VGILVVSIIQAKSLSGIGAGLFNLY 558
+ VV + SL G F Y
Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNY 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0869ARGREPRESSOR1159e-36 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 115 bits (289), Expect = 9e-36
Identities = 56/146 (38%), Positives = 85/146 (58%), Gaps = 2/146 (1%)

Query: 1 MNKSEHRHQLIRALITKNKIHTQAELQTLLAENDIQVTQATLSRDIKNMNLSKVR-EKDS 59
MNK + RH IR +IT N+I TQ EL +L ++ VTQAT+SRDIK ++L KV S
Sbjct: 1 MNKGQ-RHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNGS 59

Query: 60 AYYVLNNGSISKWEKRLELYMEDALVWMRPVQHQVLLKTLPGLAQSFGSIIDTLSFPDAI 119
Y L +L+ + DA V + H ++LKT+PG AQ+ G+++D L + + +
Sbjct: 60 YKYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIM 119

Query: 120 ATLCGNDVCLIICEDADTAQKCFEEL 145
T+CG+D LIIC D + +++
Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKI 145


12SPCG_0906SPCG_0919Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_09061163.940159gamma-glutamyl kinase
SPCG_09072173.782719gamma-glutamyl phosphate reductase
SPCG_09083171.850248pyrroline-5-carboxylate reductase
SPCG_0909319-0.083222thymidylate kinase
SPCG_0910217-1.345578DNA polymerase III subunit delta'
SPCG_0911017-2.101116DNA replication intiation control protein YabA
SPCG_0912-212-0.040280tetrapyrrole methylase family protein
SPCG_0913-215-0.073127hypothetical protein
SPCG_09140181.277847hypothetical protein
SPCG_09151213.080269IS1381 transposase protein B
SPCG_09162213.375445IS1381 transposase protein A
SPCG_09171214.115479tRNA (uracil-5-)-methyltransferase Gid
SPCG_09182213.698257uridylate kinase
SPCG_09192223.123659ribosome recycling factor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0906CARBMTKINASE491e-08 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 48.7 bits (116), Expect = 1e-08
Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 7/101 (6%)

Query: 134 GAIPIINENDSVVIDELKVGDNDTLSAQVAAMVQADLLVFLTDVDGLYTGNPNSDPRAKR 193
G +P+I E+ + E V D D ++A V AD+ + LTDV+G + R
Sbjct: 195 GGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLR 253

Query: 194 LERIETINREIIDMAGGAGSSNGTGGMLTKIKAATIATESG 234
++E + + + AGS M K+ AA E G
Sbjct: 254 EVKVEELRKYYEEGHFKAGS------MGPKVLAAIRFIEWG 288


13SPCG_0963SPCG_0993Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0963217-1.770661mutT/nudix family protein
SPCG_0964115-2.325530hypothetical protein
SPCG_0965114-2.0901915'-methylthioadenosine/S-adenosylhomocysteine
SPCG_0966215-2.539220hypothetical protein
SPCG_0967318-1.335581DNA polymerase III subunit epsilon
SPCG_0968317-1.981485hypothetical protein
SPCG_0969419-0.432955IS630-Spn1 transposase
SPCG_09703160.389847hypothetical protein
SPCG_09713140.022362hypothetical protein
SPCG_09720110.336882hypothetical protein
SPCG_0973-114-1.385691cytochrome c-type biogenesis protein CcdA
SPCG_0974-114-1.217855thioredoxin
SPCG_0975014-1.074247amino acid permease family protein
SPCG_0976015-1.445558adhesion lipoprotein
SPCG_0977015-1.408243hypothetical protein
SPCG_0978-123-3.206065hypothetical protein
SPCG_09791190.015836hypothetical protein
SPCG_0980013-0.526482histidine triad protein E , truncation
SPCG_0981-114-0.805296hypothetical protein
SPCG_0982-114-1.384918hypothetical protein
SPCG_0983-214-1.439518hypothetical protein
SPCG_0984-314-1.377925peptidase T
SPCG_0985-118-3.738817ferrochelatase
SPCG_0986019-4.152137hypothetical protein
SPCG_0987117-3.729149degenerate transposase
SPCG_0988-114-1.785665degenerate transposase
SPCG_0989-114-0.533096large conductance mechanosensitive channel
SPCG_0990-1151.008867gtrA family protein
SPCG_0991-1172.126981hypothetical protein
SPCG_09920163.015193*hypothetical protein
SPCG_09931183.552776aspartate-semialdehyde dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0974ADHESNFAMILY270.046 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 27.1 bits (60), Expect = 0.046
Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 6/66 (9%)

Query: 7 MKKVMFAGLSLLSLVVLMACGEEETKKTQAAQQPKQQTTVQQIS-----VGKDVPDFTLQ 61
MKK+ + LS ++L+AC K T + Q+ K T I+ + D D
Sbjct: 1 MKKLGTLLVLFLSAIILVACA-SGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSI 59

Query: 62 SMDGKE 67
G++
Sbjct: 60 VPIGQD 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0976ADHESNFAMILY2281e-75 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 228 bits (582), Expect = 1e-75
Identities = 86/315 (27%), Positives = 152/315 (48%), Gaps = 19/315 (6%)

Query: 7 MKKQNLFLVLLSVFLLCLGAC-GQKESQTGKGMKIVTSFYPIYAMVKEVSGDLNDVR-MI 64
MKK LVL ++ + G+K++ +G+ +K+V + I + K ++GD D+ ++
Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60

Query: 65 QSSSGIHSFEPSANDIAAIYDADVFVYHSHTLES----WAGSLDPNLKKSKVKVLEASEG 120
H +EP D+ +AD+ Y+ LE+ W L N KK++ K A
Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVS- 119

Query: 121 MTLERVPGLEDVEAGDGVDEKTLYDPHTWLDPEKAGEEAQIIADKLSEVDSEHKETYQKN 180
G++ + +EK DPH WL+ E A+ IA +LS D +KE Y+KN
Sbjct: 120 ------DGVDVIYLEGQ-NEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKN 172

Query: 181 AQAFIKKAQELTKKFQPKFEK--ATQKTFVTQHTAFSYLAKRFGLNQLGIAGISPEQEPS 238
+ + K +L K+ + KF K A +K VT AF Y +K +G+ I I+ E+E +
Sbjct: 173 LKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEGT 232

Query: 239 PRQLTEIQEFVKTYKVKTIFTESNASSKVAETLVKSTGV---GLKTLNPLESDPQNDKTY 295
P Q+ + E ++ KV ++F ES+ + +T+ + T + + + + +Y
Sbjct: 233 PEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSY 292

Query: 296 LENLEENMSILAEEL 310
++ N+ +AE L
Sbjct: 293 YSMMKYNLDKIAEGL 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0977PRTACTNFAMLY320.013 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 31.9 bits (72), Expect = 0.013
Identities = 18/44 (40%), Positives = 19/44 (43%), Gaps = 7/44 (15%)

Query: 340 RYR-----SNHW--VPDSRPEEPSPQPSPSPQPAPNPQPAPSNP 376
RYR + W V P P P P P PQP PQP P P
Sbjct: 553 RYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAP 596


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0989MECHCHANNEL931e-27 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 93.0 bits (231), Expect = 1e-27
Identities = 48/133 (36%), Positives = 72/133 (54%), Gaps = 11/133 (8%)

Query: 1 MLKNLKSFLLRGNVIDLAVGVVIASAFGAIVTSLVNDIITPLILN-------PALKAAKV 53
++K + F +RGNV+DLAVGV+I +AFG IV+SLV DII P +
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTLR 62

Query: 54 ERIAQLSWHGVGYGNFLSAIINFIFVGTALFFIIKGIEKAQKLTGIKKEKTAEKKPTELE 113
+ + + YG F+ + +F+ V A+F IK I K + K+E A PT+ E
Sbjct: 63 DAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRK---KEEPAAAPAPTKEE 119

Query: 114 V-LQEIKALLEKK 125
V L EI+ LL+++
Sbjct: 120 VLLTEIRDLLKEQ 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0991ACRIFLAVINRP300.008 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.8 bits (67), Expect = 0.008
Identities = 16/85 (18%), Positives = 33/85 (38%), Gaps = 6/85 (7%)

Query: 36 IAIVAAIYVVLTVTPPLNAISYGAYQFRISEMMN-FMAFYNPKY-----IIGVTIGCMIA 89
A+ ++ V L +TP L A E F ++N + ++G ++
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILG 535

Query: 90 NFFSFGLLDVFVGGGSTLVFLSLGV 114
+ + L+ + G ++FL L
Sbjct: 536 STGRYLLIYALIVAGMVVLFLRLPS 560


14SPCG_1005SPCG_1034Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1005016-3.549703hypothetical protein
SPCG_1006225-5.099496hypothetical protein
SPCG_1007329-6.035742RNA methyltransferase
SPCG_10081044-8.507345hypothetical protein
SPCG_1009945-8.859833iron-compound ABC transporter iron
SPCG_1010845-8.972001hypothetical protein
SPCG_1011845-10.327001iron-compound ABC transporter permease
SPCG_1012745-11.138565iron-compound ABC transporter permease
SPCG_1013846-11.104415iron-compound ABC transporter ATP-binding
SPCG_1014850-11.883004hypothetical protein
SPCG_1015539-9.731638hypothetical protein
SPCG_1016534-9.173124hypothetical protein
SPCG_1017431-7.587775hypothetical protein
SPCG_1018432-7.074634phage-associated protein
SPCG_1019125-5.336420hypothetical protein
SPCG_1020122-5.108623resolvase family site-specific recombinase
SPCG_1021216-3.109208hypothetical protein
SPCG_1022118-2.958001hypothetical protein
SPCG_1023018-3.209457hypothetical protein
SPCG_1024119-3.285701hydrolase
SPCG_1025020-4.103556hypothetical protein
SPCG_1026025-5.872049neopullulanase
SPCG_1027132-8.953205hypothetical protein
SPCG_1028128-8.699738hypothetical protein
SPCG_1029226-8.055529transcriptional regulator
SPCG_1030121-7.643669hypothetical protein
SPCG_1031217-7.293765phosphoesterase
SPCG_1032215-5.366559hypothetical protein
SPCG_1033217-4.281009Tn5252, ORF 10 protein
SPCG_1034016-4.478328Tn5252, ORF 9 protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1005THERMOLYSIN290.033 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 28.8 bits (64), Expect = 0.033
Identities = 23/180 (12%), Positives = 51/180 (28%), Gaps = 14/180 (7%)

Query: 1 MRKKLFLTSAAVLWAVTAMNSVHAATDVQKVIDETYVQPEYVLGSSLSEDQ--------- 51
M K+ L + + + + A +A V +E + P +V GS L
Sbjct: 1 MNKRAMLGAIGLAFGLMAWPFGASAKGKSMVWNEQWKTPSFVSGSLLGRCSQELVYRYLD 60

Query: 52 KNQTLKKLGYNASTDTKELKTMTPDVYSKIMNVANDSS-LQLYSSAKIQKLGDKSPLEVK 110
+ + +LG A + ++ +M + + + + D +
Sbjct: 61 QEKNTFQLGGQARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGELSSLS 120

Query: 111 IETPENIT----KVTQDMYRNAAVTLGMEHAKITVAAPIPVTGESALAGIYYSLEANGAK 166
N+ K + A + + V P E + + +
Sbjct: 121 GTLIPNLDKRTLKTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPR 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1009FERRIBNDNGPP602e-12 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 60.3 bits (146), Expect = 2e-12
Identities = 50/263 (19%), Positives = 102/263 (38%), Gaps = 36/263 (13%)

Query: 55 PERVATIAWGNHDVALALGIVPVGFSK-ANYGVSADKGVLPWTEEKIKELNGKANLFDDL 113
P R+ + W ++ LALGIVP G + NY + + LP + + ++ +
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLP---DSVIDVGLRTE----- 86

Query: 114 DGLNFEAISNSKPDVIL--AGYSGITKEDYDTLSKIAPVAAYK----SKPWQTLWRDMIK 167
N E ++ KP ++ AGY + L++IAP + +P + + +
Sbjct: 87 --PNLELLTEMKPSFMVWSAGY----GPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTE 140

Query: 168 IDSKALGMEKEGDELIKNTEARISKELEKHPEIKGKIKGKKVLFTMINAADTSKFWIYTS 227
+ + L ++ + + E I P + +L T+I D ++
Sbjct: 141 M-ADLLNLQSAAETHLAQYEDFI---RSMKPRFVKRGARPLLLTTLI---DPRHMLVFGP 193

Query: 228 KDPRANYLTDLGLVFPESLKEFESEDSF--AKEISAEEANKINDADVI-ITYGDDKTLEA 284
L + G+ ++ E +F + +S + D DV+ + + K ++A
Sbjct: 194 NSLFQEILDEYGIP-----NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDA 248

Query: 285 LQKDPLLGKINAIKNGAVAVIPD 307
L PL + ++ G +P
Sbjct: 249 LMATPLWQAMPFVRAGRFQRVPA 271


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1020FLGHOOKAP1359e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 34.9 bits (80), Expect = 9e-04
Identities = 19/127 (14%), Positives = 44/127 (34%), Gaps = 6/127 (4%)

Query: 390 QEKINMKVDTSEIEKEIDNY-QKELRKSHSTKFKLIEEIDNLDVEDKHYKRRKQDLDDRL 448
+ V S +++E D + +LR + + L + + D L ++
Sbjct: 50 GGWVGNGVYVSGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQM 109

Query: 449 YRMYDKIDELESSLIDAKAKKQTIEAEKLTGDNIYKVLIYFDKLYKVMNDVERRQLISAL 508
+ + L S+ D A++ I + + D+ + + + I A
Sbjct: 110 QDFFTSLQTLVSNAEDPAARQALIGKSEGLVNQFKT----TDQYLRDQDK-QVNIAIGAS 164

Query: 509 ISEIQVY 515
+ +I Y
Sbjct: 165 VDQINNY 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1034PF01540280.013 Adhesin lipoprotein
		>PF01540#Adhesin lipoprotein

Length = 475

Score = 27.8 bits (61), Expect = 0.013
Identities = 16/61 (26%), Positives = 33/61 (54%), Gaps = 8/61 (13%)

Query: 52 INTDTYDQLVFELRRIGNNINQIARAINQSHLISQDQLQELSKGVGELIKEVDKEFQVEV 111
I + +L E ++I N + ++ + N++ ELSK V + I E++K+F+++V
Sbjct: 351 IKAEDDKKLAEENQKIKNGVEELKKINNEA--------FELSKTVNKTIAELEKKFKIDV 402

Query: 112 K 112

Sbjct: 403 S 403


15SPCG_1081SPCG_1091Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_10812190.515216L-lactate dehydrogenase
SPCG_1082-1130.429913DNA gyrase subunit A
SPCG_1083-315-6.553419hypothetical protein
SPCG_1084-214-5.096737FNT family protein
SPCG_1085-216-5.710415O-acetylhomoserine sulfhydrylase, truncation
SPCG_1086-215-5.480484O-acetylhomoserine sulfhydrylase, truncation
SPCG_1087-117-5.840659hypothetical protein
SPCG_1088019-5.310450hypothetical protein
SPCG_1089-1243.026007tRNA pseudouridine synthase B
SPCG_1090-2222.861914hypothetical protein
SPCG_1091-2243.394256hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1087RTXTOXIND347e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.4 bits (79), Expect = 7e-04
Identities = 24/167 (14%), Positives = 61/167 (36%), Gaps = 26/167 (15%)

Query: 38 DRMRQELALAEQKAMNEQQTKLAQKDQEIAQLQSQIQNFDTEKELAKKEVEQTSHQALLA 97
+ +R + EQ + + Q QK+ + + +++ T +
Sbjct: 183 EVLRLTSLIKEQFSTWQNQ--KYQKELNLDKKRAER---------------LTVLARINR 225

Query: 98 KDKEVQLLENQLATLR-LEHENQLQKT-LSDLEKERDQVKNQLLLQEKENELSLASVKQN 155
+ ++ +++L L H+ + K + + E + + N+L + + L ++
Sbjct: 226 YENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNEL----RVYKSQLEQIESE 281

Query: 156 Y-EAQLKAASEQVEFYKNFKAQ--QSTKAIGESLEQYAESEFNKVRS 199
A+ + F + Q+T IG + A++E + S
Sbjct: 282 ILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328


16SPCG_1102SPCG_1127Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1102225-0.527744hypothetical protein
SPCG_1103229-0.209156IS1381 transposase protein B
SPCG_1104125-0.956339galactose mutarotase
SPCG_1105125-0.541685galactose-6-phosphate isomerase subunit LacA
SPCG_1106-120-2.130043galactose-6-phosphate isomerase subunit LacB
SPCG_1107-118-1.862834tagatose-6-phosphate kinase
SPCG_1108-119-2.653951tagatose 1,6-diphosphate aldolase
SPCG_1109-218-3.680723transcription antiterminator LacT
SPCG_1110-218-3.352420PTS system lactose-specific transporter subunit
SPCG_1111-215-3.574444PTS system lactose-specific transporter subunit
SPCG_1112018-2.0505776-phospho-beta-galactosidase
SPCG_1113125-1.154246hypothetical protein
SPCG_1114126-0.724416lactose phosphotransferase system repressor
SPCG_11151310.175234degenerate transposase
SPCG_11162341.252052degenerate transposase
SPCG_11170251.357368ribonucleotide-diphosphate reductase subunit
SPCG_11180230.850751ribonucleotide-diphosphate reductase subunit
SPCG_11190180.366838nrdH-redoxin
SPCG_1120-217-0.790153phosphocarrier protein HPr
SPCG_1121-213-0.640153phosphoenolpyruvate-protein phosphotransferase
SPCG_1122016-3.388995hypothetical protein
SPCG_1123222-5.039446hypothetical protein
SPCG_1124220-3.329478degenerate transposase
SPCG_1125014-1.334448hypothetical protein
SPCG_11261130.700894HAD superfamily hydrolase
SPCG_11272140.257570hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1102BCTERIALGSPF260.026 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 25.9 bits (57), Expect = 0.026
Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 64 FEPGIPVIEAGPILFCIPAMSVPVFD 89
FEP + V A +LF + A+ P+
Sbjct: 376 FEPLLVVSMAAVVLFIVLAILQPILQ 401


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1107BINARYTOXINA280.037 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 28.5 bits (63), Expect = 0.037
Identities = 36/141 (25%), Positives = 56/141 (39%), Gaps = 24/141 (17%)

Query: 95 GDNQTEVLEKGPEVLEQEGQDFLEHFKKLLESVEVVAISGSLPAGLPV------DYYASL 148
+ E +EK + LE+E LE +KK E + + + + Y +L
Sbjct: 61 EKKEAERVEKNLDTLEKEA---LELYKKDSEQISNYSQTRQYFYDYQIESNPREKEYKNL 117

Query: 149 VE--LANQAGKPVVLDCSGAALQAVLESPHKPTVIKPNNEELSQLLGREVS-EDLDELKE 205
N+ KP+ + ESP K N+E+ E+S E +ELKE
Sbjct: 118 RNAISKNKIDKPINV--------YYFESPEKFAF----NKEIRTENQNEISLEKFNELKE 165

Query: 206 VLQEPLFAGIEWIIVSLGANG 226
+Q+ LF + VSL G
Sbjct: 166 TIQDKLFKQDGFKDVSLYEPG 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1115PREPILNPTASE300.004 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.2 bits (68), Expect = 0.004
Identities = 18/80 (22%), Positives = 26/80 (32%), Gaps = 16/80 (20%)

Query: 19 QILDIINKDTHKEIIAKLDYDAP--SCPECGNQLKKYDFQKPSKIPYLETTGMPTRILLR 76
+ N D + P CP C + + + IP L + + LR
Sbjct: 48 EYRSYFNPDDEGVDEPPYNLMVPRSCCPHCNHPITALE-----NIPLL------SWLWLR 96

Query: 77 KRRFKCYHCSKMMVAETPLV 96
R C C + A PLV
Sbjct: 97 GR---CRGCQAPISARYPLV 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1121PHPHTRNFRASE8130.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 813 bits (2102), Expect = 0.0
Identities = 336/573 (58%), Positives = 446/573 (77%), Gaps = 4/573 (0%)

Query: 1 MTEMLKGIAASDGVAVAKAYLLVQPDLSFETITVEDTNAEEARLDAALQASQDELSVIRE 60
M + GIAAS GVA+AKA++ ++P++ E ++ D + E +L AAL+ S++EL I++
Sbjct: 1 MHHKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKD 60

Query: 61 KAVGTLGEEAAQVFDAHLMVLADPEMISQIKETIRAKKVNAEAGLKEVTDMFITIFEGME 120
+ ++G + A++F AHL+VL DPE++ IK I +++NAE LKEV+DMF+++FE M
Sbjct: 61 QTEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESM- 119

Query: 121 DNPYMQERAADIRDVTKRVLANLLGKKLPNPASINEEVIVIAHDLTPSDTAQLDKNFVKA 180
DN YM+ERAADIRDV+KRVL +L+G + + A+I EE ++IA DLTPSDTAQL+K FVK
Sbjct: 120 DNEYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKG 179

Query: 181 FVTNIGGRTSHSAIMARTLEIAAVLGTNNITEIVKDGDILAVNGITGEVIINPTDEQAAE 240
F T+IGGRTSHSAIM+R+LEI AV+GT +TE ++ GD++ V+GI G VI+NPT+E+
Sbjct: 180 FATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKA 239

Query: 241 FKAAGEAYAKQKAEWALLKDAQTVTADGKHFELAANIGTPKDVEGVNNNGAEAVGLYRTE 300
++ A+ KQK EWA L + T DG H ELAANIGTPKDV+GV NG E +GLYRTE
Sbjct: 240 YEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTE 299

Query: 301 FLYMDSQDFPTEDEQYEAYKAVLEGMNGKPVVVRTMDIGGDKELPYFDMPHEMNPFLGFR 360
FLYMD PTE+EQ+EAYK V++ M+GKPVV+RT+DIGGDKEL Y +P E+NPFLGFR
Sbjct: 300 FLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFR 359

Query: 361 ALRISISETGDAMFRTQIRALLRASVHGQLRIMFPMVALLKEFRAAKAVFDEEKANLLAE 420
A+R+ + + +FRTQ+RALLRAS +G L++MFPM+A L+E R AKA+ EEK LL+E
Sbjct: 360 AIRLCLEKQD--IFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSE 417

Query: 421 GVAVADNIQVGIMIEIPAAAMLADQFAKEVDFFSIGTNDLIQYTMAADRMNEQVSYLYQP 480
GV V+D+I+VGIM+EIP+ A+ A+ FAKEVDFFSIGTNDLIQYTMAADRMNE+VSYLYQP
Sbjct: 418 GVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQP 477

Query: 481 YNPSILRLINNVIKAAHAEGKWAGMCGEMAGDQQAVPLLVGMGLDEFSMSATSVLRTRSL 540
Y+P+ILRL++ VIKAAH+EGKW GMCGEMAGD+ A+PLL+G+GLDEFSMSATS+L RS
Sbjct: 478 YHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQ 537

Query: 541 MKKLDTAKMEEYANRALTECSTMEEVLELQKEY 573
+ KL +++ +A +AL T EEV +L K+
Sbjct: 538 LLKLSKEELKPFAQKALM-LDTAEEVEQLVKKT 569


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1122TONBPROTEIN396e-05 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 38.8 bits (90), Expect = 6e-05
Identities = 16/88 (18%), Positives = 35/88 (39%), Gaps = 3/88 (3%)

Query: 742 PQTEKPEEETPREEKPQSEKPESPKPTEEPEEESPEESPEESEEPQVETEKVKEKLREAE 801
P +P + +P E P+P EP +E+P + +P+ + + VK+ + +
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111

Query: 802 DLLGKIQDPI---IKSNAKETLTGLKNN 826
+ ++ ++ A LT
Sbjct: 112 RDVKPVESRPASPFENTAPARLTSSTAT 139



Score = 35.0 bits (80), Expect = 0.001
Identities = 16/74 (21%), Positives = 26/74 (35%), Gaps = 3/74 (4%)

Query: 740 EKPQTEKPEEET---PREEKPQSEKPESPKPTEEPEEESPEESPEESEEPQVETEKVKEK 796
E P +P T P + +P P+P EPE E E P V + +
Sbjct: 37 ELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKP 96

Query: 797 LREAEDLLGKIQDP 810
+ + + + P
Sbjct: 97 KPKPKPVKKVQEQP 110



Score = 33.8 bits (77), Expect = 0.002
Identities = 13/56 (23%), Positives = 22/56 (39%), Gaps = 3/56 (5%)

Query: 733 QTEKPNEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEEESPEESPEESEEPQV 788
Q +P+ E P +E P + PKP +P+ P + +E + V
Sbjct: 62 QPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPK---PVKKVQEQPKRDV 114



Score = 31.1 bits (70), Expect = 0.016
Identities = 19/69 (27%), Positives = 28/69 (40%), Gaps = 10/69 (14%)

Query: 735 EKPNEEKPQTEKPEEETPR-EEKPQSEKPESPKPTEEPEE---------ESPEESPEESE 784
+P E +P +E P EKP+ + PKP ++ +E ES SP E+
Sbjct: 69 VEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENT 128

Query: 785 EPQVETEKV 793
P T
Sbjct: 129 APARLTSST 137


17SPCG_1212SPCG_1239Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1212-112-3.687602phosphoenolpyruvate carboxylase
SPCG_1213-221-5.075281cell division protein FtsW
SPCG_1214-125-6.237657IS1381 transposase protein A, truncation
SPCG_1215022-5.821037hypothetical protein
SPCG_1216-120-4.850991hypothetical protein
SPCG_1217-220-4.762302resolvase family site-specific recombinase
SPCG_1218-219-4.089899hypothetical protein
SPCG_1219-118-3.292375degenerate transposase
SPCG_1220-119-3.215543degenerate transposase
SPCG_1221120-5.925428degenerate transposase
SPCG_1222225-7.451373hypothetical protein
SPCG_1223428-7.751733daunorubicin resistance transmembrane protein
SPCG_1224425-6.668088hypothetical protein
SPCG_1225123-5.891059hypothetical protein
SPCG_1226120-5.998954hypothetical protein
SPCG_1227016-4.155807methionyl-tRNA synthetase
SPCG_1228-212-2.942629chloramphenicol acetyltransferase
SPCG_1229-312-1.591379IS1239 transposase
SPCG_1230-113-1.599188DNA processing protein DprA
SPCG_1231-112-2.505900licC protein
SPCG_1232012-2.817356licB protein
SPCG_1233013-3.304226choline kinase
SPCG_1234-214-0.192913zinc-containing alcohol dehydrogenase
SPCG_1235-1160.3300902-C-methyl-D-erythritol 4-phosphate
SPCG_1236-2150.742147polysaccharide biosynthesis protein
SPCG_12370161.658384licD1 protein
SPCG_12381172.786942licD2 protein
SPCG_12391183.124673carbamoyl phosphate synthase large subunit
18SPCG_1253SPCG_1259Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1253-117-4.545541hypothetical protein
SPCG_1254022-6.725438Cof family protein
SPCG_1255026-7.818058hypothetical protein
SPCG_1256125-6.196285choline binding protein PcpA
SPCG_1257123-6.631941surface protein A
SPCG_1258322-7.156865rgg protein
SPCG_1259217-1.012270hypothetical protein
19SPCG_1268SPCG_1390Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1268-217-6.558166hypothetical protein
SPCG_1269-223-8.507468hypothetical protein
SPCG_1270-224-8.778296hypothetical protein
SPCG_1271-126-9.662045hypothetical protein
SPCG_1272-126-9.785576glutamate dehydrogenase
SPCG_1273132-12.004761glutamate dehydrogenase
SPCG_1274233-11.204732site-specific recombinase
SPCG_1275329-10.014799hypothetical protein
SPCG_1276329-9.926083Tn5252, relaxase
SPCG_1277328-8.994677Tn5252, ORF 9 protein
SPCG_1278229-9.427709Tn5252, ORF 10 protein
SPCG_1279130-9.982038hypothetical protein
SPCG_1280131-9.950246DNA-dependent ATPase I and helicase II
SPCG_1281033-10.819694hypothetical protein
SPCG_1282033-11.058680hypothetical protein
SPCG_1283134-11.815412hypothetical protein
SPCG_1284035-11.540149hypothetical protein
SPCG_1285133-10.675388hypothetical protein
SPCG_1286035-11.458365phosphoserine phosphatase
SPCG_1287235-11.658162hypothetical protein
SPCG_1288335-11.598188hypothetical protein
SPCG_1289132-11.215081hypothetical protein
SPCG_1290029-9.705261hypothetical protein
SPCG_1291030-9.517963replication initiation protein Rep(RC)
SPCG_1292127-8.079564chloramphenicol acetyltransferase
SPCG_1293128-7.567986hypothetical protein
SPCG_1294227-6.679512hypothetical protein
SPCG_1295429-5.584424hypothetical protein
SPCG_1296428-5.984868transcriptional regulator
SPCG_1297327-6.208387hypothetical protein
SPCG_1298227-6.045405hypothetical protein
SPCG_1299325-5.469966hypothetical protein
SPCG_1300321-4.215073hypothetical protein
SPCG_1301320-3.875226parvulin-like peptidyl-prolyl isomerase
SPCG_1302321-3.883362hypothetical protein
SPCG_1303320-3.021903hypothetical protein
SPCG_1304319-2.947903hypothetical protein
SPCG_1305218-3.057959SNF2 family protein
SPCG_1306118-2.433405hypothetical protein
SPCG_1307118-2.342910hypothetical protein
SPCG_1308218-2.397676hypothetical protein
SPCG_1309119-3.030385hypothetical protein
SPCG_1310020-2.931703hypothetical protein
SPCG_1311120-3.754552hypothetical protein
SPCG_1312223-5.119622hypothetical protein
SPCG_1313322-5.532943hypothetical protein
SPCG_1314126-6.257124hypothetical protein
SPCG_1315-123-5.648177hypothetical protein
SPCG_1316-217-3.864430hypothetical protein
SPCG_1317020-1.593374hypothetical protein
SPCG_1318122-1.379422hypothetical protein
SPCG_1319123-1.370734hypothetical protein
SPCG_1320025-0.558074hypothetical protein
SPCG_1321025-0.956569hypothetical protein
SPCG_1322326-2.198276hypothetical protein
SPCG_1323429-3.617793erythromycin ribosome methylase
SPCG_1324223-4.052306streptothricin acetyltransferase
SPCG_1325124-4.110777aminoglycoside phosphotransferase type III
SPCG_1326024-5.334032omega2 protein
SPCG_1327024-5.194330rRNA methylase
SPCG_1328125-4.858248resolvase
SPCG_1329122-4.488437transposase
SPCG_1330221-3.044926hypothetical protein
SPCG_1331122-3.777478hypothetical protein
SPCG_1332021-3.846516hypothetical protein
SPCG_1333-120-4.395462hypothetical protein
SPCG_1334-122-3.345346hypothetical protein
SPCG_1335126-3.764981hypothetical protein
SPCG_1336226-3.969202hypothetical protein
SPCG_1337327-4.027652hypothetical protein
SPCG_1338127-3.753378hypothetical protein
SPCG_1339226-2.371257type II DNA modification methyltransferase
SPCG_1340320-0.558841replication initiator A, N-terminal
SPCG_13411141.544598hypothetical protein
SPCG_13421152.040067hypothetical protein
SPCG_13431142.58862350S ribosomal protein L7/L12
SPCG_13440153.11005950S ribosomal protein L10
SPCG_13450143.022709chlorohydrolase
SPCG_13460152.728357ABC transporter ATP-binding protein/permease
SPCG_13470131.191276ABC transporter ATP-binding protein/permease
SPCG_1348-113-0.482439bifunctional methionine sulfoxide reductase A/B
SPCG_1349-111-1.562393homoserine kinase
SPCG_1350-112-2.446096homoserine dehydrogenase
SPCG_1351016-2.959706adaptor protein
SPCG_1352-117-3.346828hypothetical protein
SPCG_1353017-1.655962hypothetical protein
SPCG_1354018-0.283327glycosyl transferase family protein
SPCG_13550160.596547group 1 glycosyl transferase
SPCG_1356-2152.110828licD3 protein
SPCG_1357-2152.063853hypothetical protein
SPCG_1358-2142.577975psr protein
SPCG_1359-1153.845069prephenate dehydratase
SPCG_1360-1154.255092shikimate kinase
SPCG_1361-1174.3345253-phosphoshikimate 1-carboxyvinyltransferase
SPCG_1362-1184.195631hypothetical protein
SPCG_1363-1183.864998prephenate dehydrogenase
SPCG_13640192.799282chorismate synthase
SPCG_13650212.0032733-dehydroquinate synthase
SPCG_1366-1192.354770shikimate 5-dehydrogenase
SPCG_1367-1162.7807583-dehydroquinate dehydratase
SPCG_1368-1162.568212hypothetical protein
SPCG_1369-1162.650868hypothetical protein
SPCG_1370-2193.349170ABC transporter ATP-binding protein
SPCG_1371-1243.497184alpha-amylase
SPCG_1372-1233.254025alanyl-tRNA synthetase
SPCG_1373-2253.118355hypothetical protein
SPCG_1374-1263.163351spermidine/putrescine ABC transporter
SPCG_1375-1263.475182spermidine/putrescine ABC transporter permease
SPCG_13760253.859365spermidine/putrescine ABC transporter permease
SPCG_1377-2243.818497spermidine/putrescine ABC transporter
SPCG_1378-1182.810228UDP-N-acetylenolpyruvoylglucosamine reductase
SPCG_1379-1172.246324hypothetical protein
SPCG_1380-1172.532702hypothetical protein
SPCG_1381-1172.377090alpha-acetolactate decarboxylase
SPCG_1382-1162.352498hypothetical protein
SPCG_1383-2152.268637amino acid ABC transporter amino acid-binding
SPCG_1384-2183.068593phosphate transport system regulatory protein
SPCG_1385-1193.748842phosphate transporter ATP-binding protein
SPCG_13861203.899688phosphate transporter ATP-binding protein
SPCG_13873194.117568phosphate ABC transporter permease
SPCG_13882223.337998phosphate ABC transporter permease
SPCG_13893222.766419phosphate ABC transporter substrate-binding
SPCG_13902233.129771NOL1/NOP2/sun family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1290BCTERIALGSPC290.016 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 28.8 bits (64), Expect = 0.016
Identities = 13/52 (25%), Positives = 25/52 (48%)

Query: 2 AESALINLINFSKENEELTNLVSGHASKREKATISKDGLIQARSIENFIDNY 53
+++ ++ + S N LT +++G R A ISKD +R + + Y
Sbjct: 80 LDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGY 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1299PYOCINKILLER260.017 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 25.9 bits (56), Expect = 0.017
Identities = 6/47 (12%), Positives = 17/47 (36%)

Query: 1 MTIIERLEEKVTRQESKVARETEKLAAYKEQLETAMFATFKRRQSIS 47
++ ++ +T ++ + A + E A + RQ +
Sbjct: 197 ISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAA 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1308GPOSANCHOR300.035 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.035
Identities = 14/131 (10%), Positives = 37/131 (28%), Gaps = 14/131 (10%)

Query: 12 KAFRRSLKDEKKFLKKGKKEVKKQKKDSAVLDEKAWK-----KEIKKKLEEMREASKARV 66
+A + +L+ + L+K + + + K LE+ E +
Sbjct: 182 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFS 241

Query: 67 KQANEDYNHI------LQNSPPSLLNRKELRDRRLPHARKRLKIAKKQYREAKVE---AK 117
+ + L+ L E ++K + + + E +
Sbjct: 242 TADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLE 301

Query: 118 EERKESRKERK 128
+ + R+
Sbjct: 302 HQSQVLNANRQ 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1324SACTRNSFRASE289e-103 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 289 bits (740), Expect = e-103
Identities = 87/180 (48%), Positives = 124/180 (68%), Gaps = 7/180 (3%)

Query: 1 MITEMKAGHLKDIDKPSEPFEVIGKIIPRYENENWTFTELLYEAPYLKSYQDEEDEEDEE 60
MI +M ++KD +KP+EPF V G++IP +EN WT+TE + PY K Y+D++ +
Sbjct: 1 MIMKMTHLNMKDFNKPNEPFVVFGRMIPAFENGVWTYTEERFSKPYFKQYEDDDMD---- 56

Query: 61 ADCLEYIDNTDKIIYLYYQDDKCVGKVKLRKNWNRYAYIEDIAVCKDFRGQGIGSALINI 120
+ Y++ K +LYY ++ C+G++K+R NWN YA IEDIAV KD+R +G+G+AL++
Sbjct: 57 ---VSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHK 113

Query: 121 SIEWAKHKNLHGLMLETQDNNLIACKFYHNCGFKIGSVDTMLYANFENNFEKAVFWYLRF 180
+IEWAK + GLMLETQD N+ AC FY F IG+VDTMLY+NF E A+FWY +F
Sbjct: 114 AIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSNFPTANEIAIFWYYKF 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1328HTHTETR270.032 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 27.3 bits (60), Expect = 0.032
Identities = 8/55 (14%), Positives = 29/55 (52%), Gaps = 4/55 (7%)

Query: 133 RGKKGGRPSKGKLSIDLALKMYDSKEY---SIRQILDASKLSKTTFYRYLNKRNA 184
+ K+ + ++ + +D+AL+++ + S+ +I A+ +++ Y + ++
Sbjct: 4 KTKQEAQETRQHI-LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1345UREASE371e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 37.4 bits (87), Expect = 1e-04
Identities = 24/65 (36%), Positives = 34/65 (52%), Gaps = 9/65 (13%)

Query: 387 RTAALLQKMK---------SGDASQFPIETALKVLTIEGAKALGMENQIGSLEVGKQADF 437
RT KMK +GD F ++ + TI A A G+ ++IGSLEVGK+AD
Sbjct: 375 RTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEIGSLEVGKRADL 434

Query: 438 LVIQP 442
++ P
Sbjct: 435 VLWNP 439


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1351BACINVASINB280.038 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 28.2 bits (62), Expect = 0.038
Identities = 15/63 (23%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 92 EDLSDLPDMEELAQMSPDEFIKTLEKSIADKTKDDIEAIQSLEQVEAKEEEQEQAEQEAE 151
++LS++ + L M FI+ + K+ + ++D+ +L++ E E++ AE + E
Sbjct: 248 DNLSNVARLTMLMAM----FIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEFQEE 303

Query: 152 SKK 154
++K
Sbjct: 304 TRK 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1356INTIMIN300.012 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 30.0 bits (67), Expect = 0.012
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 5/48 (10%)

Query: 69 ELWPRYADERYFLSKSHKDFVDRNLFITIRDKKTTCIKPYQQDLDLPH 116
++ P+Y +E LS S D V RN I + KK + L++PH
Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILS-----LNIPH 460


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1374MYCMG045453e-07 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 44.7 bits (105), Expect = 3e-07
Identities = 77/331 (23%), Positives = 137/331 (41%), Gaps = 65/331 (19%)

Query: 25 LDSKINSRDSQKLVIYNWGDYIDPELLTQFTEETGIQVQYETFDSNEAMYTKIKQGGTTY 84
L S ++S S V+ N+ YI P LL + E+ + + T+ SNE + TY
Sbjct: 16 LSSILSSCGSTTFVLANFESYISPLLLERVQEKH--PLTFLTYPSNEKLINGF--ANNTY 71

Query: 85 DIAIPSEYMINKMKDEDLLVPLDYSK-----------------------LEGLENIGPEF 121
+A+ S Y ++++ + DLL P+D+S+ ++ ++ I +
Sbjct: 72 SVAVASTYAVSELIERDLLSPIDWSQFNLKKSSSSSDKVNNASDAKDLFIDSIKEISQQT 131

Query: 122 LNQSFDPGNKFSIPYFWGTLGIVY-NETMVEAAPEH--WDDLWKPEYK-------NSIML 171
+ + +++PYF L VY E + E E+ W D+ K K N ++
Sbjct: 132 KDSKNNELLHWAVPYFLQNLVFVYRGEKISELEQENVSWTDVIKAIVKHKDRFNDNRLVF 191

Query: 172 FDGAREVLGLG---------------LNSLGYSLNSKDT-QQLEETVDKLYKLTPNIKA- 214
D AR + L + +GY N ++ Q+L T L + N +
Sbjct: 192 IDDARTIFSLANIVNTNNNSADVNPKEDGIGYFTNVYESFQRLGLTKSNLDSIFVNSDSN 251

Query: 215 IVADEM-----KGYMIQNNAAIGVTFSGEASQMLEKNE----NLRYVVPTEASNLWFDNM 265
IV +E+ +G ++ N A+ G+ L + + N ++V + S + D +
Sbjct: 252 IVINELASGRRQGGIVYNGDAVYAALGGDLRDELSEEQIPDGNNFHIVQPKISPVALDLL 311

Query: 266 VIPKTVKN-QDAAYAFINFMLKPENALKNAE 295
VI K N Q A+ I F L + A + E
Sbjct: 312 VINKQQSNFQKEAHEII-FDLALDGADQTKE 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1383ADHESNFAMILY300.008 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 30.2 bits (68), Expect = 0.008
Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 5/45 (11%)

Query: 4 KKWIFVLCNFLASFFLVACQSGSNGSQSAVDAIKQKGKLVVATSP 48
KK +L FL++ LVAC SG + S QK K+V S
Sbjct: 2 KKLGTLLVLFLSAIILVACASGKKDTTS-----GQKLKVVATNSI 41


20SPCG_1401SPCG_1439Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1401-2163.025403HPr kinase/phosphorylase
SPCG_1402-2183.87143430S ribosomal protein S21
SPCG_1403-2174.564569glucosamine-6-phosphate isomerase
SPCG_1404-2183.954084S-adenosylmethionine--tRNA
SPCG_14050151.470514choline binding protein
SPCG_14060171.068914acetyltransferase
SPCG_14070170.065132NAD synthetase
SPCG_1408019-3.028444nicotinate phosphoribosyltransferase
SPCG_1409019-6.437065hypothetical protein
SPCG_1410123-6.753961hypothetical protein
SPCG_1411019-1.203172hypothetical protein
SPCG_1412019-1.228928hypothetical protein
SPCG_1413119-0.748187hypothetical protein
SPCG_1414117-3.604578U32 family peptidase
SPCG_1415-119-5.225931hypothetical protein
SPCG_1416-122-6.579609U32 family peptidase
SPCG_1417334-10.202069hypothetical protein
SPCG_1418335-9.658357hypothetical protein
SPCG_1419435-9.535803type II DNA modification methyltransferase
SPCG_1420534-8.681872hypothetical protein
SPCG_1421535-9.038086AraC family transcriptional regulator
SPCG_1422634-7.968285ABC transporter ATP-binding protein/permease
SPCG_1423631-7.409064ABC transporter ATP-binding protein
SPCG_1424629-6.377054hypothetical protein
SPCG_1425428-5.589747hypothetical protein
SPCG_1426425-4.633675ABC transporter ATP-binding protein
SPCG_14272260.506783hypothetical protein
SPCG_14282210.636898hypothetical protein
SPCG_1429-1210.823917hypothetical protein
SPCG_1430-2141.251268hypothetical protein
SPCG_1431-291.169389hypothetical protein
SPCG_1432-290.315333IS66 family Orf1
SPCG_1433-290.427478GMP synthase
SPCG_14341170.536776GntR family transcriptional regulator
SPCG_14352190.353311membrane protein
SPCG_1436320-0.212006hypothetical protein
SPCG_1437422-0.280899cppA protein
SPCG_14384211.346664platelet activating factor
SPCG_14393192.119863cof family protein
21SPCG_1448SPCG_1460Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_14482171.096338amino acid ABC transporter permease
SPCG_1449420-3.993247hypothetical protein
SPCG_1450421-4.611774methylated-DNA--protein-cysteine
SPCG_1451319-4.488171acetyltransferase
SPCG_1452115-3.505585hypothetical protein
SPCG_1453-116-2.758305hemolysin
SPCG_1454-316-2.124171hypothetical protein
SPCG_1455-2180.387180ABC transporter ATP-binding protein
SPCG_14560271.814332glutamine amidotransferase subunit PdxT
SPCG_14571311.695094pyridoxal biosynthesis lyase PdxS
SPCG_14583301.278697NADH oxidase
SPCG_14591261.863775thiamine biosynthesis protein ApbE
SPCG_14602250.558117oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1448HTHFIS290.027 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.6 bits (64), Expect = 0.027
Identities = 9/32 (28%), Positives = 18/32 (56%)

Query: 140 ETIRAAILSVNPGEIEAARSLGMTRAQVYRRV 171
I AA+ + +I+AA LG+ R + +++
Sbjct: 439 PLILAALTATRGNQIKAADLLGLNRNTLRKKI 470


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1451SACTRNSFRASE300.003 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.9 bits (67), Expect = 0.003
Identities = 24/124 (19%), Positives = 35/124 (28%), Gaps = 26/124 (20%)

Query: 50 GQAYVALEEGELLAYAAVTKSPEEAYEAIYEGNWQAGESEYLVFHRIAVAADVQGKGVAQ 109
A++ E + + NW + Y + IAVA D + KGV
Sbjct: 65 KAAFLYYLENNCIGRIKIRS------------NW----NGYALIEDIAVAKDYRKKGVGT 108

Query: 110 TFLEGLIE---GFDYLDFRSDTHAENKVMQHIFEKLGFKQVG-------KVPVDGERLAY 159
L IE + +T N H + K F P E +
Sbjct: 109 ALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSNFPTANEIAIF 168

Query: 160 QKLK 163
K
Sbjct: 169 WYYK 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1458NUCEPIMERASE340.001 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 34.0 bits (78), Expect = 0.001
Identities = 19/94 (20%), Positives = 33/94 (35%), Gaps = 20/94 (21%)

Query: 164 RIAVVGG-GYIGVELAEAFERLGKEVVLVDIVDTVLNGYYDKDFTQMMAKNLEDHNIRLA 222
+ V G G+IG +++ G +VV +D LN YYD +L+ + L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGID----NLNDYYD--------VSLKQARLELL 49

Query: 223 LGQTVKAIEGD----GKVERLITDKESFDVDMVI 252
+ + D + L + V
Sbjct: 50 AQPGFQFHKIDLADREGMTDLFASGH---FERVF 80


22SPCG_1469SPCG_1480Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_14692310.967950Gfo/Idh/MocA family oxidoreductase
SPCG_14703351.671990DEAD/DEAH box helicase
SPCG_14714432.051223hypothetical protein
SPCG_14723351.073917hypothetical protein
SPCG_14733350.933542hypothetical protein
SPCG_14743320.953488elongation factor Tu
SPCG_1475319-0.425025glycerol uptake facilitator protein
SPCG_14762250.256345cell wall surface anchor family protein
SPCG_1477222-0.181454hypothetical protein
SPCG_14782250.298952hypothetical protein
SPCG_14793270.484156transposase, IS630-Spn1 related, Orf2
SPCG_14802230.029581hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1474TCRTETOQM812e-18 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 80.7 bits (199), Expect = 2e-18
Identities = 53/153 (34%), Positives = 81/153 (52%), Gaps = 10/153 (6%)

Query: 19 VNIGTIGHVDHGKTTLTAAI---TTVLARRLPSSVNQPKDYASIDAAPEERERGITINTA 75
+NIG + HVD GKTTLT ++ + + SV+ K D ER+RGITI T
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITE--LGSVD--KGTTRTDNTLLERQRGITIQTG 59

Query: 76 HVEYETEKRHYAHIDAPGHADYVKNMITGAAQMDGAILVVASTDGPMPQTREHILLSRQV 135
++ E ID PGH D++ + + +DGAIL++++ DG QTR R++
Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119

Query: 136 GVKHLIVFMNKVDLVDDEELLELVEMEIRDLLS 168
G+ I F+NK+D + L V +I++ LS
Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLS 149


23SPCG_1490SPCG_1504Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_14903232.503019hypothetical protein
SPCG_14912251.970624F0F1 ATP synthase subunit epsilon
SPCG_14921231.095228F0F1 ATP synthase subunit beta
SPCG_1493118-0.333330F0F1 ATP synthase subunit gamma
SPCG_1494-115-0.699702F0F1 ATP synthase subunit alpha
SPCG_1495015-3.425461F0F1 ATP synthase subunit delta
SPCG_1496015-2.772876F0F1 ATP synthase subunit B
SPCG_1497-116-2.081404F0F1 ATP synthase subunit A
SPCG_1498114-0.630464F0F1 ATP synthase subunit C
SPCG_1499114-0.240604IS1239 transposase
SPCG_1500217-0.059811acetyltransferase
SPCG_15012160.827928hypothetical protein
SPCG_15020140.929957transcription elongation factor GreA
SPCG_15031152.057616hypothetical protein
SPCG_15042182.253550acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1500SACTRNSFRASE371e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.8 bits (85), Expect = 1e-05
Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 3/76 (3%)

Query: 76 IAETFGNWLEIEYLFVKEELRGQGIGSKLLQQAESEAKNRNCCFAFVNTYQFQAP--DFY 133
I + + IE + V ++ R +G+G+ LL +A AK + C + T FY
Sbjct: 82 IRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141

Query: 134 QKHGYKEVFSLQDYLY 149
KH + + ++ LY
Sbjct: 142 AKHHFI-IGAVDTMLY 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1503PF03544300.029 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.6 bits (66), Expect = 0.029
Identities = 29/160 (18%), Positives = 52/160 (32%), Gaps = 15/160 (9%)

Query: 50 LMADSLSTVEEIMRKAPTVPTHPSQGVPASPADEIQRETPGVPSHP-------SQDVPSS 102
++A L T + + P P P +PAD + P P + +P
Sbjct: 28 VVAGLLYTSVHQVIELPA-PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEP 86

Query: 103 PAEE------SGSRPGPGPVRPKKLEREYNETPTRVAVSYTTGEKKAEQAGPETPTPATE 156
P E +P P P KK+E+ + + + E A + A
Sbjct: 87 PKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT 146

Query: 157 TVDIIRDTSRRSRREGAKPVKPKKEKKSHVKAFV-ISFLV 195
+ + S +P P + + ++ V + F V
Sbjct: 147 SKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDV 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1504SACTRNSFRASE327e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.8 bits (72), Expect = 7e-04
Identities = 19/71 (26%), Positives = 34/71 (47%), Gaps = 3/71 (4%)

Query: 85 GYISVTCLSIAKEAQGLGLGQKLLTALKEFALEDERDGINLTCHDYLIA---YYEKHGFV 141
GY + +++AK+ + G+G LL E+A E+ G+ L D I+ +Y KH F+
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 142 NEGQSQSTFAG 152
++
Sbjct: 148 IGAVDTMLYSN 158


24SPCG_1523SPCG_1551Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_15233212.165636hypothetical protein
SPCG_15242211.907513cof family protein/peptidyl-prolyl cis-trans
SPCG_15251223.51720930S ribosomal protein S18
SPCG_1526-1173.106308single-stranded DNA-binding protein
SPCG_15270173.01368730S ribosomal protein S6
SPCG_1528-1153.168193asparaginyl-tRNA synthetase
SPCG_15291112.757866hypothetical protein
SPCG_1530-2152.823083aspartate aminotransferase
SPCG_1531-1171.976488hypothetical protein
SPCG_1532-1161.905728hypothetical protein
SPCG_1533-2172.738590hypothetical protein
SPCG_1534-3182.692208hypothetical protein
SPCG_1535-3182.928711hypothetical protein
SPCG_1536-1173.174110peptide deformylase
SPCG_1537-1173.265714glutathione S-transferase
SPCG_1538-1173.383766cation transporter E1-E2 family ATPase
SPCG_15391192.693161cation efflux family protein
SPCG_15400163.063249ABC transporter ATP-binding protein
SPCG_1541-2152.339750tRNA CCA-pyrophosphorylase
SPCG_1542-1151.479154dihydrodipicolinate reductase
SPCG_15431151.465779degV family protein
SPCG_1544-1142.161167hypothetical protein
SPCG_15450142.998423phosphoglucosamine mutase
SPCG_15460172.887057hypothetical protein
SPCG_15470183.531460hypothetical protein
SPCG_15480214.524327pyridine nucleotide-disulfide oxidoreductase
SPCG_15490214.130525hypothetical protein
SPCG_15501193.686441hypothetical protein
SPCG_15510193.080558hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1530FLGPRINGFLGI290.028 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 29.1 bits (65), Expect = 0.028
Identities = 8/21 (38%), Positives = 10/21 (47%)

Query: 31 DILSLTLGEPDFTTPKNIQDA 51
L L L PDF+T + D
Sbjct: 191 VNLVLQLRNPDFSTAVRVADV 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1550FERRIBNDNGPP290.027 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 28.8 bits (64), Expect = 0.027
Identities = 17/65 (26%), Positives = 25/65 (38%), Gaps = 3/65 (4%)

Query: 141 GTEVAGESHIVDHRGIIDNVYVTNALNDDTPLASRRVVQTILESDMIVLGPGSLFTSILP 200
+ A E+H+ + I ++ PL + I M+V GP SLF IL
Sbjct: 146 NLQSAAETHLAQYEDFIRSMKPRFVKRGARPLL---LTTLIDPRHMLVFGPNSLFQEILD 202

Query: 201 NIVIK 205
I
Sbjct: 203 EYGIP 207


25SPCG_1587SPCG_1618Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_15870163.448329glycosyl transferase family protein
SPCG_15880163.605146UDP-glucose 4-epimerase
SPCG_15892160.134413hypothetical protein
SPCG_1590218-2.495012oxidoreductase, DadA family protein
SPCG_1591220-4.500343hypothetical protein
SPCG_1592319-3.900728bcl-2 family protein
SPCG_1593317-1.902370hypothetical protein
SPCG_1594217-1.557996hypothetical protein
SPCG_1595116-0.742356hypothetical protein
SPCG_15960170.971880hypothetical protein
SPCG_15970191.671810snf2 family protein
SPCG_1598-2172.578968cation transporter E1-E2 family ATPase
SPCG_1599123-1.318940acyltransferase family protein
SPCG_1600123-2.273010cadmium resistance transporter
SPCG_16014331.80295830S ribosomal protein S15
SPCG_16021241.570495hypothetical protein
SPCG_16030211.704148hypothetical protein
SPCG_1604-2242.470124hypothetical protein
SPCG_1605-2232.694327hypothetical protein
SPCG_1606-2223.346780threonyl-tRNA synthetase
SPCG_16071242.825319sensor histidine kinase
SPCG_16080231.622673DNA-binding response regulator
SPCG_16090221.587933hypothetical protein
SPCG_16100210.821568rrf2 family protein
SPCG_1611-1220.337026hypothetical protein
SPCG_1612-123-0.926083iron-dependent transcriptional regulator
SPCG_1613-1211.166789IS1167, transposase
SPCG_1614-1233.294181hypothetical protein
SPCG_1615-3223.739242hypothetical protein
SPCG_1616-3224.114536hypothetical protein
SPCG_1617-3214.022102D-tyrosyl-tRNA(Tyr) deacylase
SPCG_1618-4203.270342GTP pyrophosphokinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1588NUCEPIMERASE1815e-57 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 181 bits (462), Expect = 5e-57
Identities = 88/350 (25%), Positives = 149/350 (42%), Gaps = 48/350 (13%)

Query: 4 KILVTGGAGFIGTHTVIELIQAGHQVVVVDNLVNSNRKSLEV--VERITGVEIPFYEADI 61
K LVTG AGFIG H L++AGHQVV +DNL + SL+ +E + F++ D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 62 RDTATLRDIFKQEEPTGVIHFAGLKAVGESTRIPLAYYDNNIAGTVSLLKAMEENNCKNI 121
D + D+F V AV S P AY D+N+ G +++L+ N +++
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 122 IFSSSATVYGDPHTVPILE----DFPLSVTNPYGRTKLMLEEI---LTDIYKADSEWNVV 174
+++SS++VYG +P D P+S Y TK E + + +Y
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVS---LYAATKKANELMAHTYSHLYGLP----AT 174

Query: 175 LLRYFNPIGAHESGDLGENPNGIPNNLLPYVTQVAVGKLEQVQVFGDDYDTEDGTGVRDY 234
LR+F G P G P+ L T+ A+ + + + V+ G RD+
Sbjct: 175 GLRFFTVYG----------PWGRPDMALFKFTK-AMLEGKSIDVYN------YGKMKRDF 217

Query: 235 IHVVDLAKGHVAALKKIQKGSG---------------LNVYNLGTGKGYSVLEIIQNMEK 279
++ D+A+ + I VYN+G +++ IQ +E
Sbjct: 218 TYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALED 277

Query: 280 AVGRPIPYRIVERRPGDIAACYSDPAKAKAELGWEAELDITQMCEDAWRW 329
A+G ++ +PGD+ +D +G+ E + ++ W
Sbjct: 278 ALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1596TCRTETA300.014 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.014
Identities = 48/277 (17%), Positives = 92/277 (33%), Gaps = 24/277 (8%)

Query: 90 VTLLLTSTDFSLFSVFFICSMNLISDTIGFLAGYMLTPIYIRLIND-----DMTEAMGFR 144
V+L + D+++ + + I + + G + I D + GF
Sbjct: 78 VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITG-ATGAVAGAYIADITDGDERARHFGFM 136

Query: 145 QSTSSIVRLIGNLSGGVFLGLFSISTLAFVNVLTFLFAFLGSLLIRNRLKKEEEKIEVPP 204
+ + G + GG +G FS F FL + K E +
Sbjct: 137 SACFGFGMVAGPVLGG-LMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERR----- 190

Query: 205 YVGMSSFFQHLKESMKLLMTMEDVMVLLWILSISQAVLMMVEPVSAILLIHHPFMGLSTG 264
+ + S + M V L+ + I Q V + + I
Sbjct: 191 --PLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR--FHWDAT 246

Query: 265 QSLAILIMISLLHVILGGLLSGFLSKKISIRLNIYWSLL--MESLIVIDFLRGS--FLLI 320
L +LH + +++G ++ ++ R + ++ I++ F I
Sbjct: 247 TIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPI 306

Query: 321 LLGSAGDAFSAGVLSPRLQAMIFGIIPEELMGSVQSS 357
++ A G+ P LQAM+ + EE G +Q S
Sbjct: 307 MVLLAS----GGIGMPALQAMLSRQVDEERQGQLQGS 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1608HTHFIS796e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 6e-19
Identities = 31/142 (21%), Positives = 66/142 (46%), Gaps = 3/142 (2%)

Query: 33 KILLIEDDQVIRQQIGKMLSEWGFEVVLVEDFMEVLSLFVQSEPHLVLMDIGLPLFNGYH 92
IL+ +DD IR + + LS G++V + + + + LV+ D+ +P N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 93 WCQEIRKI-SKVPIMFLSSRDQAMDIVMAINMGADDFVTKPFDQQVLLAKVQGLL--RRS 149
I+K +P++ +S+++ M + A GA D++ KPFD L+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 150 YEFGRDESLLEYAGVILNTKSM 171
++ + ++ + +M
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAM 146


26SPCG_1650SPCG_1676Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1650319-1.177040hypothetical protein
SPCG_1651226-0.580946hypothetical protein
SPCG_1652223-1.459144hypothetical protein
SPCG_1653122-0.734818hypothetical protein
SPCG_1654022-0.487132sugar ABC transporter permease
SPCG_1655-120-2.539023sugar ABC transporter permease
SPCG_1656-123-2.976513sugar ABC transporter sugar-binding protein
SPCG_1657024-3.610736PTS system transporter subunit IIBC
SPCG_1658-123-3.761453N-acetylmannosamine-6-phosphate 2-epimerase
SPCG_1659025-4.553741Gfo/Idh/MocA family oxidoreductase
SPCG_1660-121-3.163057neuraminidase B
SPCG_1661-217-2.164159ABC transporter permease
SPCG_1662-116-1.316380ABC transporter permease
SPCG_1663-216-0.157009ABC transporter substrate-binding protein
SPCG_1664-1172.450313hypothetical protein
SPCG_1665-1183.340257sialidase A (neuraminidase A)
SPCG_16660254.912435hypothetical protein
SPCG_1667-1235.358306hypothetical protein
SPCG_16680204.848518acetyl xylan esterase
SPCG_1669-2164.268485ATP-dependent DNA helicase RecG
SPCG_1670-1173.277162alanine racemase
SPCG_16710191.6723924'-phosphopantetheinyl transferase
SPCG_1672-1170.338814phospho-2-dehydro-3-deoxyheptonate aldolase
SPCG_1673-114-1.434434phospho-2-dehydro-3-deoxyheptonate aldolase
SPCG_1674-113-2.221583preprotein translocase subunit SecA
SPCG_1675220-6.730789hypothetical protein
SPCG_1676-113-3.973808ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1656MALTOSEBP300.026 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 29.7 bits (66), Expect = 0.026
Identities = 31/104 (29%), Positives = 45/104 (43%), Gaps = 6/104 (5%)

Query: 71 FEKANPDIKVKLETIDFKSGPEKITTAIEAGTAPDVLFDAPGRIIQYGKNGKLAELNDLF 130
FEK + IKV +E D EK G PD++F A R Y ++G LAE+
Sbjct: 53 FEK-DTGIKVTVEHPD--KLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEIT--- 106

Query: 131 TDEFVKDVNNENIVQASKAGDKAYMYPISSAPFYMAMNKKMLED 174
D+ +D A + K YPI+ + NK +L +
Sbjct: 107 PDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPN 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1657PREPILNPTASE320.003 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 32.5 bits (74), Expect = 0.003
Identities = 38/146 (26%), Positives = 63/146 (43%), Gaps = 14/146 (9%)

Query: 72 LSLLLCVGLCIGLAKRDKGTAAL-AGVTGYLVMTATIKALVKLFMAEGSAIDTGVIGALV 130
L+ LL + + L D L +T L+ + L+ F++ G A+ + G LV
Sbjct: 135 LAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLV 194

Query: 131 VGIV--AVYLHNR-----YNNIQLPSALGFFGGSRFVPIVTSFSSILIGFVFFVIWPPFQ 183
+ + A L Y + +L +ALG + G + +PIV SS L+G + +
Sbjct: 195 LWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSS-LVGAFMGIGLILLR 253

Query: 184 QLLVST----GGYISQAGPIGTFLYG 205
S G Y++ AG I L+G
Sbjct: 254 NHHQSKPIPFGPYLAIAGWIA-LLWG 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1663MALTOSEBP330.003 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 32.8 bits (74), Expect = 0.003
Identities = 60/268 (22%), Positives = 105/268 (39%), Gaps = 21/268 (7%)

Query: 72 TKIKIETFSWNDFYTKWTTGLANGNVPDISTALPNQVMEMVNSDALVPLNDSIKRIGQDK 131
T IK+ + K+ A G+ PDI ++ S L + + QDK
Sbjct: 57 TGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPD--KAFQDK 114

Query: 132 FNETALNEAKIGDDYYSVPLYSHAQVMWVRTDLLKEHNIEVPKTWDQLYEASKKLKEAG- 190
+ + + P+ A + DLL PKTW+++ K+LK G
Sbjct: 115 LYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNP----PKTWEEIPALDKELKAKGK 170

Query: 191 ---IYGLSVPFGTNDLMATRFLNFYVRSGGGSLLTKDLKADLTSQLAQDGIKYWVKLYKE 247
++ L P+ T L+A + + G KD+ D + A+ G+ + V L K
Sbjct: 171 SALMFNLQEPYFTWPLIAADG-GYAFKYENGKYDIKDVGVD--NAGAKAGLTFLVDLIKN 227

Query: 248 ISPQDSLNFNVLQQATLFYQGKTAFDFNSGFHIGGINANSPQLIDSIDAYPIPKIKESDK 307
++++ + A F +G+TA N + I+ + + P K + S
Sbjct: 228 KHMNADTDYSIAEAA--FNKGETAMTINGPWAWSNIDTSKVNY--GVTVLPTFKGQPSKP 283

Query: 308 DQGIETSNIPMVVWKNSKHPEVAKAFLE 335
G+ ++ I S + E+AK FLE
Sbjct: 284 FVGVLSAGINAA----SPNKELAKEFLE 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1665GPOSANCHOR382e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 38.1 bits (88), Expect = 2e-04
Identities = 19/133 (14%), Positives = 42/133 (31%), Gaps = 15/133 (11%)

Query: 21 QERKCRYSIRKLSVGAVSMIVGAVVFGTSPVLAQEGASEQPLANETQLSGESSTLTDTEK 80
YS+RKL G S+ V V G L T +T + T+
Sbjct: 4 NNTNRHYSLRKLKTGTASVAVALTVLGAG------------LVVNTNEVSAVATRSQTDT 51

Query: 81 SQPSSETELSGNKQEQERKDKQEEKIPRDYYARD--LENVETVIEKEDVETNASNGQRVD 138
+ E + E + + + A + + + + ++ +
Sbjct: 52 LEKVQE-RADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSE 110

Query: 139 LSSELDKLKKLEN 151
+S++ +L+ +
Sbjct: 111 KASKIQELEARKA 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1670ALARACEMASE351e-122 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 351 bits (902), Expect = e-122
Identities = 129/365 (35%), Positives = 186/365 (50%), Gaps = 17/365 (4%)

Query: 14 RPTKALIHLGAIRQNIQQMGAHIPQGTLKWAVVKANAYGHGAVAVAKAIQDDVDGFCVSN 73
RP +A + L A++QN+ + + W+VVKANAYGHG + AI DGF + N
Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATHARV-WSVVKANAYGHGIERIWSAI-GATDGFALLN 60

Query: 74 IDEAIELRQAGLSKPILIL-GVSEIEAVALAKEYDFTLTVAGLEWIQALLDKEVDLTGLT 132
++EAI LR+ G PIL+L G + + + ++ T V ++AL + + L
Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAP-LD 119

Query: 133 VHLKIDSGMGRIGFREVSEVEQAQDLLQKHGVCVEGIFTHFATADEESDDYFNAQLERFK 192
++LK++SGM R+GF+ + Q L V + +HFA A+ D + + R +
Sbjct: 120 IYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHP--DGISGAMARIE 177

Query: 193 TILASMKEVPELVHASNSATTLWHVETIFNAVRMGDAMYGLNPSGAVLDL-PYDLIPALT 251
+ + SNSA TLWH E F+ VR G +YG +PSG D+ L P +T
Sbjct: 178 QAA---EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMT 234

Query: 252 LESALVHVKTVPAGACMGYGATYQADSEQVIATVPIGYADGWTRDMQN-FSVLVDGQACP 310
L S ++ V+T+ AG +GYG Y A EQ I V GYADG+ R VLVDG
Sbjct: 235 LSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTM 294

Query: 311 IVGRVSMDQITIRLPKL--YPLGTKVTLIGSNGDKEITATQVATYRVTINYEVVCLLSDR 368
VG VSMD + + L +GT V L G KEI VA T+ YE++C L+ R
Sbjct: 295 TVGTVSMDMLAVDLTPCPQAGIGTPVELWG----KEIKIDDVAAAAGTVGYELMCALALR 350

Query: 369 IPREY 373
+P
Sbjct: 351 VPVVT 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1671ENTSNTHTASED270.017 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 26.9 bits (59), Expect = 0.017
Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 5/73 (6%)

Query: 9 GIDIEELASIESAVTRHEGFAKRVLTAQEMERFTSLKGRRQIEYLAGRWSAKEAFSKAMG 68
GIDIE++ S + A ++ + E + + L +SAKE+ KA
Sbjct: 105 GIDIEKIMSQHT----ATELAPSIIDSDERQILQA-SLLPFPLALTLAFSAKESVYKAFS 159

Query: 69 TGISKLGFQDLEV 81
++ GF +V
Sbjct: 160 DRVTLPGFNSAKV 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1674SECA10550.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1055 bits (2729), Expect = 0.0
Identities = 391/904 (43%), Positives = 561/904 (62%), Gaps = 71/904 (7%)

Query: 1 MANILKTIIENDKG-EIRRLEKMADKVFKYEDQMAALTDDQLKAKTVEFKERYQNGESLD 59
+ +L + + +RR+ K+ + + E +M L+D++LK KT EF+ R + GE L+
Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61

Query: 60 SLLYEAFAVVREGAKRVLGLFPYKVQVMGGIVLHHGDVPEMRTGEGKTLTATMPVYLNAL 119
+L+ EAFAVVRE +KRV G+ + VQ++GG+VL+ + EMRTGEGKTLTAT+P YLNAL
Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121

Query: 120 SGKGVHVVTVNEYLSERDATEMGELYSWLGLSVGINLATKSPMEKKEAYECDITYSTNSE 179
+GKGVHVVTVN+YL++RDA L+ +LGL+VGINL K+EAY DITY TN+E
Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181

Query: 180 IGFDYLRDNMVVRAENMVQRPLNYALVDEVDSILIDEARTPLIVSGANAVETSQLYHMAD 239
GFDYLRDNM E VQR L+YALVDEVDSILIDEARTPLI+SG + ++Y +
Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSS-EMYKRVN 240

Query: 240 HYVKSLNKD------------DYIIDVQSKTIGLSDSGIDRAESYF-------KLENLYD 280
+ L + + +D +S+ + L++ G+ E + E+LY
Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300

Query: 281 IENVALTHFIDNALRANYIMLLDIDYVVSEEQEILIVDQFTGRTMEGRRYSDGLHQAIEA 340
N+ L H + ALRA+ + D+DY+V ++ E++IVD+ TGRTM+GRR+SDGLHQA+EA
Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIV-KDGEVIIVDEHTGRTMQGRRWSDGLHQAVEA 359

Query: 341 KEGVPIQDETKTSASITYQNLFRMYKKLSGMTGTGKTEEEEFREIYNIRVIPIPTNRPVQ 400
KEGV IQ+E +T ASIT+QN FR+Y+KL+GMTGT TE EF IY + + +PTNRP+
Sbjct: 360 KEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMI 419

Query: 401 RIDHSDLLYASIESKFKAVVEDVKARYQKGQPVLVGTVAVETSDYISKKLVAAGVPHEVL 460
R D DL+Y + K +A++ED+K R KGQPVLVGT+++E S+ +S +L AG+ H VL
Sbjct: 420 RKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVL 479

Query: 461 NAKNHYREAQIIMNAGQRGAVTIATNMAGRGTDIKLG----------------------- 497
NAK H EA I+ AG AVTIATNMAGRGTDI LG
Sbjct: 480 NAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKA 539

Query: 498 ------EGVRELGGLCVIGTERHESRRIDNQLRGRSGRQGDPGESQFYLSLEDDLMKRFG 551
+ V E GGL +IGTERHESRRIDNQLRGRSGRQGD G S+FYLS+ED LM+ F
Sbjct: 540 DWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFA 599

Query: 552 SERLKGIFERLNMSE-EAIESRMLTRQVEAAQKRVEGNNYDTRKQVLQYDDVMREQREII 610
S+R+ G+ +L M EAIE +T+ + AQ++VE N+D RKQ+L+YDDV +QR I
Sbjct: 600 SDRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAI 659

Query: 611 YAQRYDVITADRDLAPEIQSMIKRTIERVVDGHARAKQDEK---LEAILNFAKYNLLPED 667
Y+QR +++ D++ I S+ + + +D + + E+ + + K + +
Sbjct: 660 YSQRNELLDVS-DVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDL 718

Query: 668 SIT--MEDLSGLSDKAIKEELFQRALKVYDSQVSKLRDEEAVKEFQKVLILRVVDNKWTD 725
I ++ L ++ ++E + ++++VY + + E ++ F+K ++L+ +D+ W +
Sbjct: 719 PIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVG-AEMMRHFEKGVMLQTLDSLWKE 777

Query: 726 HIDALDQLRNAVGLRGYAQNNPVVEYQAEGFRMFNDMIGSIEFDVTRLMMKAQIH----- 780
H+ A+D LR + LRGYAQ +P EY+ E F MF M+ S++++V + K Q+
Sbjct: 778 HLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEV 837

Query: 781 ----EQERPQAERHISTTATRNIAAHQASMP---EDLDLSQIGRNELCPCGSGKKFKNCH 833
+Q R +AER + A+ ++GRN+ CPCGSGKK+K CH
Sbjct: 838 EELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQCH 897

Query: 834 GKRQ 837
G+ Q
Sbjct: 898 GRLQ 901


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1676PF05272310.004 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.004
Identities = 22/81 (27%), Positives = 35/81 (43%), Gaps = 6/81 (7%)

Query: 33 LKGDNGSGKTVLLKVLAG-YIKLDKGKVLQDGKVYGIKNHYIQDAGILIEKVEFLSHLSL 91
L+G G GK+ L+ L G D G K+ Y Q AGI+ ++ ++
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSD--THFDIGTG---KDSYEQIAGIVAYELSEMTAFRR 655

Query: 92 RENLELLRYFSSKVTEKRIAY 112
+ + +FSS+ R AY
Sbjct: 656 ADAEAVKAFFSSRKDRYRGAY 676


27SPCG_1696SPCG_1720Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1696-3193.259746sucrose-6-phosphate hydrolase
SPCG_1697-2203.446881sucrose operon repressor
SPCG_16980233.4381413-hydroxy-3-methylglutaryl-CoA reductase
SPCG_1699-1201.539290hydroxymethylglutaryl-CoA synthase
SPCG_1700-3182.559414hypothetical protein
SPCG_1701-3162.645785IS1381 transposase protein B
SPCG_1702-3173.233364IS1381 transposase protein A, truncation
SPCG_1703-2173.752916hypothetical protein
SPCG_1704-2184.271542hypothetical protein
SPCG_1705-1184.622661serine/threonine protein kinase
SPCG_1706-1164.618268phosphatase
SPCG_17070174.943245rRNA methyltransferase RsmB
SPCG_17080154.988463methionyl-tRNA formyltransferase
SPCG_17090154.515261primosome assembly protein PriA
SPCG_17101133.767516primosome assembly protein PriA
SPCG_1711-1143.229342DNA-directed RNA polymerase subunit omega
SPCG_17120183.705054guanylate kinase
SPCG_17130203.753071hypothetical protein
SPCG_17140222.162161hypothetical protein
SPCG_17150242.971077hypothetical protein
SPCG_1716-1253.902992hypothetical protein
SPCG_1717-2242.960343hypothetical protein
SPCG_1718-2213.051857iojap-like protein
SPCG_1719-2212.872110isochorismatase family protein
SPCG_1720-1243.545777hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1713RTXTOXIND425e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 5e-06
Identities = 21/192 (10%), Positives = 56/192 (29%), Gaps = 18/192 (9%)

Query: 41 NAEQEATNLRGQAEREADLLVNEAKRESKSLKKEALLEAKEEARKYREEVDAEFKSERQE 100
+ R Q + L + + + +E R + +F + + +
Sbjct: 143 LLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR-LTSLIKEQFSTWQNQ 201

Query: 101 LKQIESRLTERATSLDRKDDNLTSKEQTLEQKEQSISDRAK----------NLDAREEQL 150
Q E L ++ + E ++ + D + + +E +
Sbjct: 202 KYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKY 261

Query: 151 EEVERQKEAELERIG----ALSQAEARDIILAQTEENLTREIASRIREAEQEVKERSDKM 206
E + ++ + A+ + EI ++R+ + + ++
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEE---YQLVTQLFKNEILDKLRQTTDNIGLLTLEL 318

Query: 207 AKDILVQAMQRI 218
AK+ Q I
Sbjct: 319 AKNEERQQASVI 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1720ENTSNTHTASED280.019 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 28.1 bits (62), Expect = 0.019
Identities = 11/42 (26%), Positives = 19/42 (45%), Gaps = 1/42 (2%)

Query: 27 LTHCLGVERAAMELAQRFGVDVEKASLAGLLHDYAKKLSDQE 68
++HC A+ QR G+D+EK + A + D +
Sbjct: 88 ISHCATT-ALAVISRQRIGIDIEKIMSQHTATELAPSIIDSD 128


28SPCG_1872SPCG_1883Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1872122-3.331610alpha-galactosidase
SPCG_1873019-3.387795msm operon regulatory protein
SPCG_1874118-4.001357biotin--protein ligase
SPCG_1875116-3.773506*******************RNA methyltransferase
SPCG_1876013-1.681654aminoglycoside phosphotransferase
SPCG_1877015-0.918637recombination regulator RecX
SPCG_18782170.016900hypothetical protein
SPCG_18791141.151847hypothetical protein
SPCG_18802142.369723IS1167, transposase
SPCG_18813173.966109chaperonin GroEL
SPCG_18821173.946437co-chaperonin GroES
SPCG_18831183.279053single-stranded DNA-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1876CARBMTKINASE310.006 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 30.6 bits (69), Expect = 0.006
Identities = 15/47 (31%), Positives = 24/47 (51%), Gaps = 13/47 (27%)

Query: 162 KEKALKNYQK--GTFYNKTLLPQFHIHSREEAYQLIQEKGYILKADA 206
+ A +N K G FY++ E A +L +EKG+I+K D+
Sbjct: 122 NDPAFQNPTKPVGPFYDE-----------ETAKRLAREKGWIVKEDS 157


29SPCG_1899SPCG_1906Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1899319-1.104278hypothetical protein
SPCG_1900219-2.080057hypothetical protein
SPCG_1901022-2.815158hypothetical protein
SPCG_1902124-4.258825degenerate transposase
SPCG_1903-122-4.484725degenerate transposase
SPCG_1904126-5.225466hypothetical protein
SPCG_1905329-6.059322hypothetical protein
SPCG_1906433-6.780550hypothetical protein
30SPCG_1915SPCG_1958Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1915117-3.262310transcriptional regulator
SPCG_1916-114-2.314685acetyltransferase
SPCG_1917-114-2.341701hypothetical protein
SPCG_1918-213-2.164831hypothetical protein
SPCG_19190161.829576transcriptional regulator PlcR
SPCG_19203243.613946hypothetical protein
SPCG_19214253.577540hypothetical protein
SPCG_19226334.429698ABC transporter ATP-binding protein
SPCG_19235304.062440nucleoside diphosphate kinase
SPCG_19244294.059540DNA-directed RNA polymerase subunit beta'
SPCG_19252213.607749hypothetical protein
SPCG_1926-1162.432610hypothetical protein
SPCG_1927-3172.403901*hypothetical protein
SPCG_1928-2162.632196hypothetical protein
SPCG_1929-3163.454342DNA-entry nuclease
SPCG_1930-2143.026179UDP-N-acetylglucosamine
SPCG_1931-2141.716366hypothetical protein
SPCG_1932-2131.529993phosphopantetheine adenylyltransferase
SPCG_1933-1131.785977type II DNA modification methyltransferase
SPCG_1934-2151.973312asparagine synthetase AsnA
SPCG_1935-1151.936863hypothetical protein
SPCG_1936-2143.217786membrane protein
SPCG_1937-2163.089537spoU rRNA methylase family protein
SPCG_1938-2193.586946acylphosphatase
SPCG_1939-1194.004370OxaA-like protein precursor
SPCG_1940-1204.083328pyruvate formate-lyase-activating enzyme
SPCG_1941-2204.053570diaminopimelate decarboxylase
SPCG_1942-2193.849463pur operon repressor
SPCG_1943-3224.621231cmp-binding-factor 1
SPCG_1944-3234.471716competence-induced protein Ccs50
SPCG_1945-1223.306290hypothetical protein
SPCG_1946-2193.079034ribulose-phosphate 3-epimerase
SPCG_1947-2183.067926ribosome-associated GTPase
SPCG_1948-1171.054029dimethyladenosine transferase
SPCG_19491170.654378hypothetical protein
SPCG_19500181.403433ABC transporter ATP-binding protein
SPCG_19511171.679387immunity protein
SPCG_19521200.932664hypothetical protein
SPCG_19530200.985479transcriptional regulator PlcR
SPCG_19540174.166313primase-like protein
SPCG_1955-1162.941061hydrolase
SPCG_1956-2162.547402hypothetical protein
SPCG_1957-3122.836972cell wall surface anchor family protein
SPCG_1958-1173.57938550S ribosomal protein L34
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1916SACTRNSFRASE504e-10 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 49.6 bits (118), Expect = 4e-10
Identities = 29/90 (32%), Positives = 42/90 (46%), Gaps = 4/90 (4%)

Query: 59 QITLLAFLNGKIAGIVNITADQRKRVRHIGDLFIVIGKRYWNNGLGSLLLEEAIEWAQAS 118
+ L +L G + I ++ I D I + K Y G+G+ LL +AIEWA+ +
Sbjct: 65 KAAFLYYLENNCIGRIKIRSNWNGYA-LIED--IAVAKDYRKKGVGTALLHKAIEWAKEN 121

Query: 119 GILRRLQLTVQTRNQAAVHLYQKHGFVIEG 148
L L Q N +A H Y KH F+I
Sbjct: 122 HFCG-LMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1921TYPE3IMSPROT310.014 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 31.3 bits (71), Expect = 0.014
Identities = 30/156 (19%), Positives = 49/156 (31%), Gaps = 22/156 (14%)

Query: 122 LLREELSQLGLTNMHLTIPSKLSTLMAIFSNGFQLISLLIFILTFVAL--TLISQISQ-- 177
E S+L L + L + N L F L VA + S + Q
Sbjct: 48 YYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYG 107

Query: 178 ---------LRSSGIRLISGEKR------WSIFLRPVGEDLKAIAVGFSLAGVLAILMQK 222
I I G KR FL+ + LK + + + ++ +
Sbjct: 108 FLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSI---LKVVLLSILIWIIIKGNLVT 164

Query: 223 ILSLPTQSLMTIGAGLLSYNLILLSISLFFAQLFAV 258
+L LPT + I L L+ I + ++
Sbjct: 165 LLQLPTCGIECITPLLGQILRQLMVICTVGFVVISI 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1932LPSBIOSNTHSS1562e-51 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 156 bits (395), Expect = 2e-51
Identities = 59/155 (38%), Positives = 95/155 (61%), Gaps = 1/155 (0%)

Query: 5 IGLFTGSFDPMTNGHLDIIERASRLFDKLYVGIFFNPHKQGFLPIENRKRGLEKAVKHLG 64
++ GSFDP+T GHLDIIER RLFD++YV + NP+KQ ++ R + KA+ HL
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 65 NVKVVSSHDELVVDVAKRLGATCLVRGLRNASDLQYEASFDYYNHQLSPDIETIYLHSRP 124
N +V S L V+ A++ A ++RGLR SD + E N L+ D+ET++L +
Sbjct: 62 NAQVDSFEG-LTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120

Query: 125 EHLYISSSGVRELLKFGQDIACYVPESILEEIRNE 159
E+ ++SSS V+E+ +FG ++ +VP + + ++
Sbjct: 121 EYSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQ 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_193960KDINNERMP1171e-31 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 117 bits (294), Expect = 1e-31
Identities = 61/225 (27%), Positives = 109/225 (48%), Gaps = 21/225 (9%)

Query: 35 GFIWNTIGAPMAEAIKYFATDKGLGFGVAIIIVTIIVRLIILPLGIYQSWKATLHSEKMN 94
G++W I P+ + +K+ + G +G +III+T IVR I+ PL + + KM
Sbjct: 331 GWLW-FISQPLFKLLKWIHSFVG-NWGFSIIIITFIVRGIMYPLT-KAQYTSMA---KMR 384

Query: 95 ALKHVLEPHQTRLKEATTQEEKLEAQQALFAAQKEHGISMFGGVGCFPILLQMPFFSAIY 154
L +P ++E +++ Q + A K ++ GG CFP+L+QMP F A+Y
Sbjct: 385 ML----QPKIQAMRERLGDDKQ-RISQEMMALYKAEKVNPLGG--CFPLLIQMPIFLALY 437

Query: 155 FAAQHTEGVAQASYLG----IPLGSPSMILVACAGVLYYLQSLLSLHGVEDEMQREQIKK 210
+ + + QA + + P IL GV + +S V D MQ +K
Sbjct: 438 YMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQ----QK 493

Query: 211 MIYMSPLMIVVFSLFSPASVTLYWVVGGFMMILQQFIVNYIVRPK 255
++ P++ VF L+ P+ + LY++V + I+QQ ++ + +
Sbjct: 494 IMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKR 538


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1946FLGHOOKAP1280.036 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.0 bits (62), Expect = 0.036
Identities = 8/31 (25%), Positives = 14/31 (45%)

Query: 32 LAADYANFEREIKRLEATGAEYAHIDIMDSH 62
A A+ +I RL GA + +++D
Sbjct: 171 YAKQIASLNDQISRLTGVGAGASPNNLLDQR 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1957GPOSANCHOR354e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.7 bits (79), Expect = 4e-04
Identities = 36/153 (23%), Positives = 65/153 (42%), Gaps = 7/153 (4%)

Query: 145 AQSQASKQLATEKESAKNAIEKAAKDKQDEIKGAPLSDKEKAELLARVEAEKQAALKEI- 203
A +A KQ+ E A + + K ++ + L++KEKAEL A++EAE +A +++
Sbjct: 390 ASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLA 449

Query: 204 ENAKTMEDVKEAETIGVQAIVMVTVPKRPVTPNAAPKTTSTPQATAGTMQDVTYQSPAGK 263
+ A+ + ++ + Q P A P PQA Q+ +
Sbjct: 450 KQAEELAKLRAGKASDSQT------PDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKR 503

Query: 264 QLPNTGSASSAALASLGLVVATSGFALLGRKTR 296
QLP+TG ++ + L V + K +
Sbjct: 504 QLPSTGETANPFFTAAALTVMATAGVAAVVKRK 536


31SPCG_2038SPCG_2069Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_2038-1193.314119hypothetical protein
SPCG_20390203.282256glutamine amidotransferase, class-I
SPCG_2040-1163.169162ABC transporter ATP-binding protein/permease
SPCG_2041-2173.323097degenerate transposase
SPCG_2042-3153.199396ABC transporter ATP-binding protein/permease
SPCG_2043-3142.707746DNA mismatch repair protein MutS
SPCG_2044-2131.074475transcriptional repressor
SPCG_2045-1161.029634arginyl-tRNA synthetase
SPCG_20460180.547193IS1381 transposase protein B
SPCG_20472190.432064hypothetical protein
SPCG_20482200.266823response regulator
SPCG_20492210.067615sensor histidine kinase PnpS
SPCG_2050221-0.592881phosphate ABC transporter substrate-binding
SPCG_2051320-1.083364phosphate ABC transporter permease
SPCG_2052118-0.524943phosphate ABC transporter permease
SPCG_2053-2160.440338phosphate transporter ATP-binding protein
SPCG_2054-2151.028723phosphate transport system regulatory protein
SPCG_2055-1131.311218degenerate transposase
SPCG_2056-3132.826452transcriptional regulator
SPCG_2057-3144.143847NAD(P)H-dependent glycerol-3-phosphate
SPCG_2058-3133.674733UTP-glucose-1-phosphate uridylyltransferase
SPCG_2059-1163.271181hypothetical protein
SPCG_2060-3123.3282015-formyltetrahydrofolate cyclo-ligase
SPCG_2061-3112.021380M20/M25/M40 family peptidase
SPCG_2062-2101.0537292,3,4,5-tetrahydropyridine-2,6-carboxylate
SPCG_2063-2100.631490hypothetical protein
SPCG_2064-2100.219303penicillin-binding protein 1B
SPCG_2065125-0.385514tyrosyl-tRNA synthetase
SPCG_2066126-0.295105cation transporter E1-E2 family ATPase
SPCG_20673390.519916hypothetical protein
SPCG_20682400.832399rRNA (guanine-N1-)-methyltransferase
SPCG_20692420.982647hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2044ARGREPRESSOR1772e-60 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 177 bits (451), Expect = 2e-60
Identities = 46/149 (30%), Positives = 81/149 (54%), Gaps = 4/149 (2%)

Query: 1 MRKRDRHQLIKKMITEEKLSTQKEIQDRLEAHNVCVTQTTLSRDLREIGLTKVKKNDMVY 60
M K RH I+++IT ++ TQ E+ D L+ VTQ T+SRD++E+ L KV N+ Y
Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNGSY 60

Query: 61 YVLVNETEKIDLVEFLSHHLEG----VARAEFTLVLHTKLGEASVLANIVDVNKDKWILG 116
+ ++ + + L L + A +VL T G A + ++D + I+G
Sbjct: 61 KYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIMG 120

Query: 117 TVAGANTLLVICRDQHVAKLMEDRLLDLM 145
T+ G +T+L+ICR K+++ ++L+L+
Sbjct: 121 TICGDDTILIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2048HTHFIS964e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.4 bits (240), Expect = 4e-25
Identities = 32/127 (25%), Positives = 61/127 (48%), Gaps = 1/127 (0%)

Query: 1 MTKQ-VLLVDDEEHILKLLDYHLSKEGFSTQLVTNGRKALALAETEPFDFILLDIMLPQL 59
MT +L+ DD+ I +L+ LS+ G+ ++ +N D ++ D+++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGMEVCKRLRAKGVKTPIMMVSAKSDEFDKVLALELGADDYLTKPFSPRELLARVKAVLR 119
+ ++ R++ P++++SA++ + A E GA DYL KPF EL+ + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RTKGEQE 126
K
Sbjct: 121 EPKRRPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2049PF06580431e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.3 bits (102), Expect = 1e-06
Identities = 38/186 (20%), Positives = 76/186 (40%), Gaps = 33/186 (17%)

Query: 261 IYKESLRLEHIVEHLLTLSKA--QQMPIQWTTLSL-AEFVQDLTQSLQPQLKKKDLQLKV 317
I ++ + ++ L L + + + +L+ V Q Q + + LQ +
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDR-LQFEN 244

Query: 318 QVPDDVTLVSDSQLLSQILLNLLSNAIRY----TEQGGKIEVKTQKVNEGIKISVSDTGI 373
Q+ + D Q+ ++ L+ N I++ QGGKI +K K N + + V +TG
Sbjct: 245 QINPAI---MDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS 301

Query: 374 GISQLEQDRIFERFYRVNKGRSRQTGGTGLGLAIVKELSQLLGG---QVTVTSQLGRGSC 430
+ ++ TG GL V+E Q+L G Q+ ++ + G+ +
Sbjct: 302 LALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA 343

Query: 431 FTIFLP 436
+ +P
Sbjct: 344 -MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2058PF04605290.009 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 29.1 bits (65), Expect = 0.009
Identities = 13/70 (18%), Positives = 24/70 (34%), Gaps = 11/70 (15%)

Query: 231 NEIQLTDAIDTLNKTQRVFAREFKGAR-YDVGDKFGFMKTSIDYALKHPQVKDDLKNYLI 289
NE ++ ++ L K K ++G+++ ++D
Sbjct: 55 NERRVIRIVNKLTKKFTWLGECVKEFDITEIGEQYSLK----------ETIQDLCAKDFH 104

Query: 290 QLGKELTEKE 299
Q KE TEK
Sbjct: 105 QKLKEFTEKT 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2063TCRTETA310.005 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.3 bits (71), Expect = 0.005
Identities = 31/159 (19%), Positives = 51/159 (32%), Gaps = 9/159 (5%)

Query: 136 LPFLAYAILGIFSVQYFFYLCVEYSNATTATILQFISPVFILFYNRLVYQKRASKSAVFY 195
PF A A L + +L E + + F A+ AVF+
Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF 220

Query: 196 V--LVAMLGVCLMATKG-DLSQLSMTPLALITGLLSAMGVMFNVILPQPFAKRYGFVPTV 252
+ LV + L G D T + + + + ++ P A R G +
Sbjct: 221 IMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRAL 280

Query: 253 GWGMILAGLFSNVLSPVYQLSFTLDIWSILICLIIAFFG 291
GMI G +L L+F W +++ G
Sbjct: 281 MLGMIADGT-GYIL-----LAFATRGWMAFPIMVLLASG 313


32SPCG_2096SPCG_2132Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_2096026-4.617123transketolase, C-terminal subunit
SPCG_2097025-4.405868transketolase
SPCG_2098023-4.006284PTS system ascorbate-specific transporter
SPCG_2099-121-3.973042PTS system transporter subunit IIB
SPCG_2100125-5.208943BglG family transcriptional regulator
SPCG_2101329-4.043594hypothetical protein
SPCG_2102534-5.497389hypothetical protein
SPCG_2103532-5.65684450S ribosomal protein L32
SPCG_2104429-5.71470350S ribosomal protein L33
SPCG_2105329-5.725127choline binding protein PcpA
SPCG_2106-2172.658391degenerate transposase
SPCG_21070184.006861degenerate transposase
SPCG_21081204.435437degenerate transposase
SPCG_21090214.847034hypothetical protein
SPCG_21100224.980831hypothetical protein
SPCG_21110235.403042glycosyl hydrolase-like protein
SPCG_2112-1215.390975ROK family protein
SPCG_2113-2194.777983hypothetical protein
SPCG_2114-2194.509065hypothetical protein
SPCG_2115-2183.918754hypothetical protein
SPCG_2116-3193.790521cell wall antigen
SPCG_2117-3193.147538hypothetical protein
SPCG_2118-1262.227871arginine deiminase
SPCG_2119-1202.299821ornithine carbamoyltransferase
SPCG_2120-1152.144487carbamate kinase
SPCG_21210132.079328hypothetical protein
SPCG_21220121.830912hypothetical protein
SPCG_2123119-2.322122hypothetical protein
SPCG_2124121-3.462184hypothetical protein
SPCG_2125224-4.468643iron-containing alcohol dehydrogenase
SPCG_2126326-5.179843L-fucose isomerase
SPCG_2127329-6.646046L-fuculose phosphate aldolase
SPCG_2128430-7.341120hypothetical protein
SPCG_2129230-6.824980alpha-galactosidase
SPCG_2130128-6.310111hypothetical protein
SPCG_2131028-6.174134glycosyl hydrolase
SPCG_2132019-4.444743sugar ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2100LPSBIOSNTHSS290.045 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 28.6 bits (64), Expect = 0.045
Identities = 17/94 (18%), Positives = 36/94 (38%), Gaps = 16/94 (17%)

Query: 433 DLQVKASSDYDMVFSTIKVETEKPNYLVSVMMTEEQAIQLVELVLKDFPNLEYGDFEIEQ 492
D+ + +D V+ + K M + ++ ++ + + PN + FE
Sbjct: 18 DIIERGCRLFDQVYVAVLRNPNK-----QPMFSVQERLEQIAKAIAHLPNAQVDSFE-GL 71

Query: 493 ILNIVKRYGI--ITQ--------ELELRLALKNY 516
+N ++ I + ELEL++A N
Sbjct: 72 TVNYARQRQAGAILRGLRVLSDFELELQMANTNK 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2118ARGDEIMINASE5520.0 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 552 bits (1425), Expect = 0.0
Identities = 190/408 (46%), Positives = 270/408 (66%), Gaps = 8/408 (1%)

Query: 5 PIQVFSEIGKLKKVMLHRPGKELENLLPDYLERLLFDDIPFLEDAQKEHDAFAQALRDEG 64
PI +FSEIG+LKKV+LHRPG+ELENL P ++ LFDDIP+LE A++EH+ FA L++
Sbjct: 7 PINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNNL 66

Query: 65 IEVLYLEQLAAESLTSP-EIRDQFIEEYLDEANIRDRQTKVAIRELLHGIKDNQELVEKT 123
+E+ Y+E L +E L S + ++FI +++ EA I+ T +++ ++ K
Sbjct: 67 VEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSS-LTIDNMISKM 125

Query: 124 MAGIQKVELPEIPDEAKDLTDLVESDYPFAIDPMPNLYFTRDPFATIGNAVSLNHMFADT 183
++G+ EL L DLV F IDPMPN+ FTRDPFA+IGN V++N MF
Sbjct: 126 ISGVVTEELKNYT---SSLDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTKV 182

Query: 184 RNRETLYGKYIFKYHPIYGGKVDLVYNREEDTRIEGGDELVLSKDVLAVGISQRTDAASI 243
R RET++ +YIFKYHP+Y V + NR E+ +EGGDELVL+K +L +GIS+RT+A S+
Sbjct: 183 RQRETIFAEYIFKYHPVYKENVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAKSV 242

Query: 244 EKLLVNIFKKNVGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIEGDLHVYSVTY 303
EKL +++FK F +LAF+ NR +MHLDTVFT +DY FT + +Y +TY
Sbjct: 243 EKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDMYFSIYVLTY 302

Query: 304 ENEK--LKIVEEKGDLAELLAQNLGVEKVHLIRCGGGNIVAAAREQWNDGSNTLTIAPGV 361
+ I +EK + ++L+ LG K+ +I+C GG+++ AREQWNDG+N L IAPG
Sbjct: 303 NPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAPGE 361

Query: 362 VVVYDRNTVTNKILEEYGLRLIKIRGSELVRGRGGPRCMSMPFEREEV 409
++ Y RN VTNK+ EE G+++ +I SEL RGRGGPRCMSMP RE++
Sbjct: 362 IIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2120CARBMTKINASE406e-146 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 406 bits (1046), Expect = e-146
Identities = 139/312 (44%), Positives = 204/312 (65%), Gaps = 5/312 (1%)

Query: 4 RKIVVALGGNAIL--SSDPSAKAQQEALVETAKHLVKLIKNGDDLIITHGNGPQVGNLLL 61
+++V+ALGGNA+ S + + + +TA+ + ++I G +++ITHGNGPQVG+LLL
Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62

Query: 62 QHLASDSEKN-PAFPLDSLVAMTEGSIGFWLKNALQNALLDEGIEKNVASVVTQVVVDKN 120
A + PA P+D AM++G IG+ ++ AL+N L G+EK V +++TQ +VDKN
Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122

Query: 121 DPAFVNLSKPIGPFYSEEEAKAEAEKSGATFKEDAGRGWRKVVASPKPVDIKEIETIRTL 180
DPAF N +KP+GPFY EE AK A + G KED+GRGWR+VV SP P E ETI+ L
Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182

Query: 181 LNNGQVVVAAGGGGIPVVKENNGHLTGVEAVIDKDFASQRLAELVDADLFIVLTGVDYVF 240
+ G +V+A+GGGG+PV+ E +G + GVEAVIDKD A ++LAE V+AD+F++LT V+
Sbjct: 183 VERGVIVIASGGGGVPVILE-DGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAA 241

Query: 241 VNYNKPNQEKLEHVNVAQLEEYIKQDQFAPGSMLPKVEAAIAFVNGRPEGKAVITSLENL 300
+ Y ++ L V V +L +Y ++ F GSM PKV AAI F+ +A+I LE
Sbjct: 242 LYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIE-WGGERAIIAHLEKA 300

Query: 301 GALIESESGTII 312
+E ++GT +
Sbjct: 301 VEALEGKTGTQV 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2121RTXTOXINA372e-04 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 37.3 bits (86), Expect = 2e-04
Identities = 28/134 (20%), Positives = 50/134 (37%), Gaps = 19/134 (14%)

Query: 279 FIPWTDLGVTIF-DDFNAWLTGLPVIGNIVGSSTSALGTWYFPEGAMLFAFMGILIGVIY 337
I T+ GVTIF + L GNI+G +G G +L F L +
Sbjct: 99 LIGLTERGVTIFAPQLDKLLQKYQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALS 158

Query: 338 GLKEDKIISSFMNG----------AADLLSVALIVAIARGIQVIMNDGMITDTILNWGK- 386
+K D++I +G A+ L L+ +A + + + G
Sbjct: 159 SMKIDELIKKQKSGGNVSSSELAKASIELINQLVDTVASLNNNV---NSFSQQLNTLGSV 215

Query: 387 ----EGLSGLSSQV 396
+ L+G+ +++
Sbjct: 216 LSNTKHLNGVGNKL 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2123GPOSANCHOR331e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 1e-04
Identities = 15/72 (20%), Positives = 32/72 (44%)

Query: 12 ERKQRFSLRKYAIGACSVLLGTSLFFAGMGAQPVQDTETSSALISSHYLDEQDLSEKLKS 71
+ +SLRK G SV + ++ AG+ + + ++ + Q+ ++K +
Sbjct: 5 NTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERADKFEI 64

Query: 72 ELQWFELENKLL 83
E +L+N L
Sbjct: 65 ENNTLKLKNSDL 76


33SPCG_2145SPCG_2153Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_2145018-3.333820hypothetical protein
SPCG_2146019-3.386609hypothetical protein
SPCG_2147-115-0.120527hypothetical protein
SPCG_2148-1202.253234hypothetical protein
SPCG_21490223.063922hypothetical protein
SPCG_2150-1183.219411hypothetical protein
SPCG_2151-1193.510898hypothetical protein
SPCG_2152-2204.278757glycerol uptake facilitator protein
SPCG_2153-1183.408187hypothetical protein
34SPCG_0040SPCG_0053N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0040-1283.758717acyl carrier protein
SPCG_0041-2273.327110degenerate transposase
SPCG_0042-2214.241244degenerate transposase
SPCG_0043-3214.704545bacteriocin BlpU
SPCG_0044-2224.873173competence factor transporting ATP-binding
SPCG_00450215.137532competence factor transport protein ComB
SPCG_00461204.269736phosphoribosylaminoimidazole-succinocarboxamide
SPCG_00471215.030266phosphoribosylformylglycinamidine synthase
SPCG_00480205.384824amidophosphoribosyltransferase
SPCG_00490215.038979phosphoribosylaminoimidazole synthetase
SPCG_0050-1205.040838phosphoribosylglycinamide formyltransferase
SPCG_00510215.330959vanZ protein
SPCG_00521246.607216bifunctional
SPCG_00531256.576841phosphoribosylamine--glycine ligase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0040NUCEPIMERASE260.016 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 25.9 bits (57), Expect = 0.016
Identities = 14/32 (43%), Positives = 19/32 (59%), Gaps = 2/32 (6%)

Query: 39 VDLMEFILTLEDEFSIEISDEEIDQLQSVGDV 70
V+LM++I LED IE + + LQ GDV
Sbjct: 266 VELMDYIQALEDALGIEA-KKNMLPLQP-GDV 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0044ANTHRAXTOXNA300.027 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.5 bits (68), Expect = 0.027
Identities = 34/209 (16%), Positives = 76/209 (36%), Gaps = 26/209 (12%)

Query: 315 NLFFMTLLALPIYTVIIFAFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESQ 374
N F ++ ++V++FA + +E NA+ DI + +E +
Sbjct: 4 NKFIPNKFSIISFSVLLFAIS------SSQAIEVNAMNEHYTESDIKRNHKTEKNKTEKE 57

Query: 375 RYQKIDKEFVDYLKKSFTYSRAESQQKALKKVAHLLLNVGILWMGAVLVMDGKMSLGQLI 434
+++ V + T + + Q LKK+ +L + G + D +
Sbjct: 58 KFKDSINNLVKTEFTNETLDKIQQTQDLLKKIPKDVLEIYSELGGEIYFTDID------L 111

Query: 435 TYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTV---EDLSLMKG 491
+ L + +N +N + + ++ + E K + +D ++
Sbjct: 112 VEHKELQDLSEEEKNSMNSRGE-------KVPFASRFVFEKKRETPKLIINIKDYAI--N 162

Query: 492 DMTFKQVHYKYGYG--RDVLSDINLTVPQ 518
K+V+Y+ G G D++S P+
Sbjct: 163 SEQSKEVYYEIGKGISLDIISKDKSLDPE 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0045RTXTOXIND607e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 59.8 bits (145), Expect = 7e-12
Identities = 66/444 (14%), Positives = 145/444 (32%), Gaps = 60/444 (13%)

Query: 27 MALLLVFLLGFATVAEKEMSLSTRATVEPSRILANIQSTSN---NRILVNHLEENKLVKK 83
M L++ + + + + E+ + + S I+ N I+V +E + V+K
Sbjct: 65 MGFLVIAFI-LSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIV---KEGESVRK 120

Query: 84 GDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFR 143
GD+L++ A G +A++ +Q +L+ + +Q Y S N PE
Sbjct: 121 GDVLLKLT--ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 144 DYISQAGSLRASTSQQNETIASQNAAASQT----QAEIGNLISQTEAKIRDYQTAKSAIE 199
+ L + +Q T +Q +AE ++++ + KS ++
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 200 TGASLAGQNLAYSLYQSYKSQGEENPQTKVQAVAQVEAQISQLESSLATYRVQYAGSGTQ 259
+SL + + + + ++ +S L + +
Sbjct: 239 DFSSLLHKQAI----------AKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA--- 285

Query: 260 QAYASGLSSQLESLKSQHLAKVGQELTLLAQKILEAESGKKVQGNLLDKGKVTASEDGVL 319
+ ++ ++ L + + LL ++ + E + A +
Sbjct: 286 KEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNE-------ERQQASVIRAPVSVKV 338

Query: 320 HLNPETSDSSMVAEGALLAQLYPS---LEREGKAKLTAYLSSKYVARIKVGDSVR----- 371
++ +V L + P LE + +K + I VG +
Sbjct: 339 QQLKVHTEGGVVTTAETLMVIVPEDDTLEVTAL------VQNKDIGFINVGQNAIIKVEA 392

Query: 372 --YTTTHDAGNQLFLDSTITSIDATATKTEKGNFF-----KIEAETNLTSEQAEKLRYGV 424
YT L + +I+ A + ++ IE T + L G+
Sbjct: 393 FPYTRYGY------LVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGM 446

Query: 425 EGRLQMITGKKSYLRYYLDQFLNK 448
++ TG +S + Y L
Sbjct: 447 AVTAEIKTGMRSVISYLLSPLEES 470


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0047FLGMRINGFLIF310.036 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 31.1 bits (70), Expect = 0.036
Identities = 29/127 (22%), Positives = 41/127 (32%), Gaps = 35/127 (27%)

Query: 1101 ANSTSPTLFYNDANQHVAKMVETRIANTNSPWLAGVQVGDIHAIPVSHGEGKFV--VTAE 1158
S + NDA A VE+RI L+ + G G VTA+
Sbjct: 220 TQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPIV-----------GNGNVHAQVTAQ 268

Query: 1159 EFAELRDNGQIFSQYVDFNGKPSMDSKYNPNGSVHAIEGITSKNGQIIGKMGHSERYEDG 1218
+DF K + Y+PNG SK ++ SE+ G
Sbjct: 269 ---------------LDFANKEQTEEHYSPNGDA-------SKATLRSRQLNISEQVGAG 306

Query: 1219 LFQNIPG 1225
+PG
Sbjct: 307 YPGGVPG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0049BINARYTOXINA300.014 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 30.0 bits (67), Expect = 0.014
Identities = 11/33 (33%), Positives = 15/33 (45%)

Query: 193 YSLVRRVFADYTGEEVLPELEGKKLKEVLLEPT 225
YS R+ F DY E E E K L+ + +
Sbjct: 93 YSQTRQYFYDYQIESNPREKEYKNLRNAISKNK 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0050SUBTILISIN337e-04 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 32.5 bits (74), Expect = 7e-04
Identities = 10/37 (27%), Positives = 15/37 (40%), Gaps = 1/37 (2%)

Query: 105 YLPEFPGAHGIEDAWNAGVGQSGVTIHWVDSGVDTGH 141
+P WN G+ GV + +D+G D H
Sbjct: 21 EIPRGVEMIQAPAVWNQTRGR-GVKVAVLDTGCDADH 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0053ARGDEIMINASE310.008 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 31.3 bits (71), Expect = 0.008
Identities = 14/90 (15%), Positives = 35/90 (38%), Gaps = 6/90 (6%)

Query: 146 DGLALGKGVVVADTVEQAVEAAHEMLLDNKFGDSGA--RVVIEEFLEGEEF----SLFAF 199
D L L KG++V E+ + E L + F + + ++ + + + ++F
Sbjct: 220 DELVLNKGLLVIGISERTEAKSVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQ 279

Query: 200 VNGDKFYIMPTAQDHKRAYDGDKGPNTGGM 229
++ F + + Y P++ +
Sbjct: 280 IDYSVFTSFTSDDMYFSIYVLTYNPSSSKI 309


35SPCG_0558SPCG_0571N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_05582150.768147leucine-rich protein
SPCG_05591150.804371hypothetical protein
SPCG_05603202.052639transmembrane protein Vexp1
SPCG_05610212.907835ABC transporter ATP-binding protein
SPCG_05620222.461724transmembrane protein Vexp3
SPCG_05631222.042823pep27 protein
SPCG_05641232.326968DNA-binding response regulator VncR
SPCG_0565-1232.240318sensor histidine kinase VncS
SPCG_0566-2282.187195fructose-bisphosphate aldolase
SPCG_0567-2192.347741oxidoreductase
SPCG_0568-2173.063561amino acid ABC transporter permease
SPCG_0569-2173.077532amino acid ABC transporter permease
SPCG_0570-2162.025083amino acid ABC transporter amino acid-binding
SPCG_0571-2150.816324amino acid ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0558HTHFIS344e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 4e-04
Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 4/56 (7%)

Query: 218 LHQMILDQDQIQEIILSLWENSAVLTKTAQQLYLHRNSLQYKIDKWEELTGLQLKE 273
L+ +L + + I+ +L K A L L+RN+L+ KI + G+ +
Sbjct: 428 LYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL----GVSVYR 479


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0561PF05272320.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.003
Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 4/41 (9%)

Query: 28 FEPG-KF-YSII--GESGAGKSTLLSLLAGLDSPVEGSILF 64
EPG KF YS++ G G GKSTL++ L GLD +
Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0564HTHFIS852e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.3 bits (211), Expect = 2e-21
Identities = 29/104 (27%), Positives = 51/104 (49%), Gaps = 1/104 (0%)

Query: 2 KILIVEDEEMIREGVSDYLTDCGYETIEAADGQEALEQFSSYEVALVLLDIQMPKLNGLE 61
IL+ +D+ IR ++ L+ GY+ ++ ++ + LV+ D+ MP N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLAEIRKT-SQVPVLMLTAFQDEEYKMSAFASLADGYLEKPFSL 104
+L I+K +PVL+++A + A A YL KPF L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0565PF06580310.012 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.012
Identities = 29/166 (17%), Positives = 61/166 (36%), Gaps = 30/166 (18%)

Query: 288 ILSLSSV--QELRDDRETIDLLQMTQNLVKDYALLAKER-------ELQIDNSLTHQQAY 338
+ SLS + LR L +V Y LA + E QI+ ++ Q
Sbjct: 197 LTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQ-- 254

Query: 339 LNPSVMKLILSNLISNAIKHSVPGGLVRIGEREGELFIENSCSSEEQEKLAQSFSDNASR 398
V +++ L+ N IKH + + G++ + +++ + + S
Sbjct: 255 ----VPPMLVQTLVENGIKHGIAQ-----LPQGGKILL---KGTKDNGTVTLEVENTGSL 302

Query: 399 KVK----GSGMGLFVVKSLLEH---EKLAYRFEMEENRLTFFIDFP 437
+K +G GL V+ L+ + + ++ ++ + P
Sbjct: 303 ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0567ACRIFLAVINRP310.008 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.008
Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 2/72 (2%)

Query: 100 GNLAIYIFASIILVAYLGKYIQYEAWRWIHRLVYLAYILGLFHIYMIMGNRLLTFNLLSF 159
GN A + A +V +L YE+W I V L LG+ + + + + F
Sbjct: 869 GNQAPALVAISFVVVFLCLAALYESWS-IPVSVMLVVPLGIVGVLLAATLFNQKND-VYF 926

Query: 160 LVGSYALLGLLA 171
+VG +GL A
Sbjct: 927 MVGLLTTIGLSA 938


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0571PF05272290.018 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.018
Identities = 14/23 (60%), Positives = 16/23 (69%)

Query: 35 VVVLLGPSGSGKSTLIRTINGLE 57
VVL G G GKSTLI T+ GL+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


36SPCG_0615SPCG_0624N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0615111-0.916231thioredoxin
SPCG_0616111-0.222847bifunctional methionine sulfoxide reductase A/B
SPCG_0617114-0.453463DNA-binding response regulator
SPCG_0618014-0.685462sensor histidine kinase
SPCG_0619-114-0.946109hypothetical protein
SPCG_0620-212-0.985057zinc metalloprotease ZmpB
SPCG_0621-2132.258397chorismate binding protein
SPCG_0622-1131.181374hypothetical protein
SPCG_0623-1132.114946surface protein
SPCG_0624-3173.866263glucokinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0615adhesinb270.049 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 27.1 bits (60), Expect = 0.049
Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 1/33 (3%)

Query: 10 MKKWQTCVLGAGSLLCLTACS-GKSVTSEHQTK 41
MKK + VL + + L ACS KS T +K
Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSK 33


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0617HTHFIS951e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 95.3 bits (237), Expect = 1e-24
Identities = 35/129 (27%), Positives = 65/129 (50%), Gaps = 6/129 (4%)

Query: 10 TILIVEDEYLVRQGLTKLVNVAAYDMEIIGQAENGRQAWELIQKQVPDIILTDINMPHLN 69
TIL+ +D+ +R L + ++ A YD+ N W I D+++TD+ MP N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVR---ITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 70 GIQLASLVRETYPQVHLVFLTGYDDFDYALSAVKLGVDDYLLKPFSRQDIEEMLGKIKQK 129
L +++ P + ++ ++ + F A+ A + G DYL KPF D+ E++G I +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIGIIGRA 118

Query: 130 LDKEEKEEQ 138
L + ++
Sbjct: 119 LAEPKRRPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0618PF065801993e-61 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 199 bits (508), Expect = 3e-61
Identities = 58/202 (28%), Positives = 100/202 (49%), Gaps = 9/202 (4%)

Query: 357 QEETTRQYQLQALSSQINPHFLYNTLDTIIWMAEFHDSQRVVQVTKSLATYFRLAL-NQG 415
++ QL AL +QINPHF++N L+ I + D + ++ SL+ R +L
Sbjct: 154 MASMAQEAQLMALKAQINPHFMFNALNNIRALIL-EDPTKAREMLTSLSELMRYSLRYSN 212

Query: 416 KDLICLSDEINHVRQYLFIQKQRYGDKLEYEINENVAFDNLVLPKLVLQPLVENALYHGI 475
+ L+DE+ V YL + ++ D+L++E N A ++ +P +++Q LVEN + HGI
Sbjct: 213 ARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGI 272

Query: 476 KEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHF 535
+ G I L K + + + +E+ G S G GLQNV +RL++ +
Sbjct: 273 AQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKES------TGTGLQNVRERLQMLY 326

Query: 536 GANYQMKIDSRPQKGTKVEIYI 557
G Q+K+ + K + I
Sbjct: 327 GTEAQIKLSEKQGKVN-AMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0620IGASERPTASE722e-14 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 72.4 bits (177), Expect = 2e-14
Identities = 70/389 (17%), Positives = 133/389 (34%), Gaps = 43/389 (11%)

Query: 158 GLDTVLEETSAKPGEVTVVEVETPQSTTNQEQARTENQVVETEEAPKEEAPKTEESPKEE 217
+ ++ T+ +V + S N+E AR + V AP + +T E+ E
Sbjct: 987 KRNQTVDTTNITTPNNIQADVPSVPSN-NEEIARVDEAPV-PPPAPATPS-ETTETVAEN 1043

Query: 218 PKSEVKPTDDTLPKVEEGKEDSAEPAPVEEVGGEVESKSEEKVAVKPESQPSDKPAEESK 277
K E K + E + E A + + +++ E E +E++
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE-------TKETQ 1096

Query: 278 VEQAGEPVAPREDEKAPVEPEKQPEAPEEEKAVEETPKQEDTQPEVVETKDEAANQPVEE 337
+ E ++EKA VE EK E P+ + +PKQE Q E V+ + E A +
Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPK--VTSQVSPKQE--QSETVQPQAEPARENDPT 1152

Query: 338 PKVETPAVEKQTEPTEEPKVEQVGEPVEPREDEKAPVSPEKQPEAPEEEKTAEETPKQED 397
++ P + T E ++ VE E V+ E P+
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVV---------ENPENTT 1203

Query: 398 KIKGIGTKEPVDKSELNNQIDKASSVS----PTDYSTASYNALGPVLETAKGVYASEPVK 453
T +P SE +N+ S P + A+ ++ V
Sbjct: 1204 P----ATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR-----------STVA 1248

Query: 454 QPEVNSETKAEKVAANTDAKQSEVNSETASLKTAISGLNTDKVELENQLKIAQGKTETDF 513
++ S ++ Q + ++ IS L + E + + ++ ++
Sbjct: 1249 LCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNN-EGQYNVWVSNTSMNKNY 1307

Query: 514 SMESWTVLSTAKNKAQEVKDNGTATQEQI 542
S + S+ + Q D + Q+
Sbjct: 1308 SSSQYRRFSSKSTQTQLGWDQTISNNVQL 1336



Score = 63.2 bits (153), Expect = 1e-11
Identities = 74/309 (23%), Positives = 119/309 (38%), Gaps = 25/309 (8%)

Query: 255 KSEEKVAVKPESQPSDKPAEESKVEQAGEPVAPREDEKAPVEPEKQPEAPEEEKAVEETP 314
K + V + P++ A+ V E +A R DE APV P E + V E
Sbjct: 987 KRNQTVDTTNITTPNNIQADVPSVPSNNEEIA-RVDE-APVPPPAPATPSETTETVAENS 1044

Query: 315 KQEDTQPEVVETKDEAA----NQPVEEPKVETPA------VEKQTEPTEEPKVEQVGEPV 364
KQE E E + +E K A V + T+E + + E
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104

Query: 365 EPREDEKAPVSPEKQPEAPEEEKTAEETPKQEDKIKGIGTKEPVDKSELNNQIDKASSVS 424
++EKA V EK E P + T++ +PKQE EP +++ I + S +
Sbjct: 1105 TVEKEEKAKVETEKTQEVP--KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 425 PTDYSTA------SYNALGPVLETAKGVYASEPVKQPEVNSETKAEKVAANTDAKQSEVN 478
T T S N PV E+ + V+ PE N+ + N+++ N
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPE-NTTPATTQPTVNSESSNKPKN 1221

Query: 479 SETASLKTAISGLNTDKVELENQLKIAQGKTETDFSMESWTVLSTAKNKAQEVK-DNGTA 537
S+++ + ++ +A S + VLS A+ KAQ V + G A
Sbjct: 1222 RHRRSVRSVPHNVEPATTSSNDRSTVALCDL---TSTNTNAVLSDARAKAQFVALNVGKA 1278

Query: 538 TQEQINEAE 546
+ I++ E
Sbjct: 1279 VSQHISQLE 1287


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0624PF03309352e-04 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 35.1 bits (81), Expect = 2e-04
Identities = 25/126 (19%), Positives = 45/126 (35%), Gaps = 14/126 (11%)

Query: 11 IIGIDLGGTSIKFAILTTAGEIQ---GKWSIKTNILDEGSHIVDDMIESIQHRLDLLGLA 67
++ ID+ T +++ +G+ +W I+T D++ +I L+G
Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTA----DELALTI---DGLIGDD 54

Query: 68 AADFQGIGMGSPGVVDRDKGTVIGAYNLNWKTLQPIKQKIEKALGIPFFIDNDANVAALG 127
A G S V V W + + + GIP +DN V A
Sbjct: 55 AERLTGASGLS--TVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGA-- 110

Query: 128 ERWMGA 133
+R +
Sbjct: 111 DRIVNC 116


37SPCG_0742SPCG_0748N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0742-1101.4915033-ketoacyl-ACP reductase
SPCG_0743-1110.455273mutT/nudix family protein
SPCG_0744-2100.226862PEP-utilizing enzyme family protein
SPCG_0745-2120.768388hypothetical protein
SPCG_0746-1100.398209aminopeptidase N
SPCG_0747013-1.055024DNA-binding response regulator CiaR
SPCG_0748-112-0.479972sensor histidine kinase CiaH
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0742DHBDHDRGNASE972e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 97.0 bits (241), Expect = 2e-26
Identities = 67/252 (26%), Positives = 107/252 (42%), Gaps = 24/252 (9%)

Query: 3 KRVLITGVSSGIGLAQARLFLEKGYQVYGVDQGEKSLL-----EGDFHFLQRDLTLDL-- 55
K ITG + GIG A AR +G + VD + L D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 56 -----EPIFDWCPQV---DVLCNTAGVLDDYKPLLEQTAQDIQAIFEINYIIPVELTRYY 107
E ++ D+L N AGVL + + ++ +A F +N +R
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLR-PGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 108 LTQMLENKKGIIINMCSIASSLAGGGGHAYTSSKHALAGFTKQLALDYAEAGIQVFGIAP 167
M++ + G I+ + S + + AY SSK A FTK L L+ AE I+ ++P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 168 GAVKTAMT--------AADFEPGGLADWVASETPIKRWIEPEEIAELSLFLASGKASAMQ 219
G+ +T M A+ G + + P+K+ +P +IA+ LFL SG+A +
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 220 GQILTIDGGWSL 231
L +DGG +L
Sbjct: 248 MHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0744PHPHTRNFRASE692e-15 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 69.4 bits (170), Expect = 2e-15
Identities = 49/224 (21%), Positives = 90/224 (40%), Gaps = 30/224 (13%)

Query: 25 VGMIRGEYLLRELNQNILLQSCQEFVKDYLETICSLYSDEEVWYRFTEL-TNTEANCLVG 83
+G+ R E+L + +Q L + +E + Y E + + V R ++ + E + L
Sbjct: 293 IGLYRTEFLYMDRDQ---LPTEEEQFEAYKEVVQRMDGKP-VVIRTLDIGGDKELSYL-- 346

Query: 84 TKEFFDEGHPLFGYRGTRRLLACLDEF--QAEAHVVTEVYQTNPNLSVIFPFVNDADQLK 141
+ E +P G+R R L D F Q A + Y NL V+FP + ++L+
Sbjct: 347 --QLPKELNPFLGFRAIRLCLEKQDIFRTQLRALLRASTY---GNLKVMFPMIATLEELR 401

Query: 142 QAITVLRQYGFTG-----------KVGTMIELPSAYFDLSSILETGISKIVVGMNDLTSF 190
QA ++++ +VG M+E+PS + + + +G NDL +
Sbjct: 402 QAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKE-VDFFSIGTNDLIQY 460

Query: 191 VFATMRN----SQWHDLESPIMLDMLRDMQDKARKNKINFAVAG 230
A R S + P +L ++ + A + G
Sbjct: 461 TMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCG 504


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0747HTHFIS862e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 2e-21
Identities = 34/118 (28%), Positives = 55/118 (46%), Gaps = 1/118 (0%)

Query: 24 IKILLVEDDLGLSNSVFDFLDD-FADVMQVFDGEEGLYEAESGVYDLILLDLMLPEKNGF 82
IL+ +DD + + L DV + +G DL++ D+++P++N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 83 QVLKELREKGITTPVLIMTAKESLDDKGHGFELGADDYLTKPFYLEELKMRIQALLKR 140
+L +++ PVL+M+A+ + E GA DYL KPF L EL I L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0748PF06580356e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 6e-04
Identities = 13/76 (17%), Positives = 30/76 (39%), Gaps = 9/76 (11%)

Query: 314 FRFENRIHRTIVTDQLLLKQL---MTI--LFDNAVKY----TEEDGEIDFLISATDRNLY 364
+FE+R+ + ++ M + L +N +K+ + G+I + + +
Sbjct: 234 IQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVT 293

Query: 365 LLVSDNGIGISTEDKK 380
L V + G K+
Sbjct: 294 LEVENTGSLALKNTKE 309


38SPCG_0941SPCG_0949N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_0941-1131.111165endo-beta-N-acetylglucosaminidase
SPCG_09420202.894164adherence and virulence protein A
SPCG_0943-1193.131494metalloprotease
SPCG_0944-1193.027037diacylglycerol kinase
SPCG_0945-2182.839534GTP-binding protein Era
SPCG_0946-2182.774923formamidopyrimidine-DNA glycosylase
SPCG_0947-2182.495505dephospho-CoA kinase
SPCG_0948-2161.618038multi-drug resistance efflux pump
SPCG_0949-3141.383215preprotein translocase subunit SecG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0941FLGFLGJ300.024 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 30.5 bits (68), Expect = 0.024
Identities = 24/123 (19%), Positives = 48/123 (39%), Gaps = 27/123 (21%)

Query: 576 LLAHSALESNWGRSKIAKDK----NNFFGI----------TAYDTTPYLSA--------- 612
+LA +ALES WG+ +I ++ N FG+ T TT Y +
Sbjct: 174 ILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKF 233

Query: 613 KTFDDVDKGILGATKWIKENYIDRGRTFLGNKASGM----NVEYASDPYWGEKIASVMMK 668
+ + + + + N T + G + YA+DP++ K+ +++ +
Sbjct: 234 RVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQ 293

Query: 669 INE 671
+
Sbjct: 294 MKS 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0942FbpA_PF058336840.0 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 684 bits (1766), Expect = 0.0
Identities = 197/577 (34%), Positives = 323/577 (55%), Gaps = 31/577 (5%)

Query: 10 MSFDGFFLHHIVEELRSELVNGRIQKINQPFEQELVLQIRSNRQSHRLLLSAHPVFGRIQ 69
M+ DG FL+ I++EL++ ++NG+I K+NQP + E++L IR R S +LL+S+ + RI
Sbjct: 1 MALDGIFLYSIIDELKNTIINGKIDKVNQPEKDEIILNIRKGRLSFKLLISSSSNYPRIH 60

Query: 70 LTQTTFENPAQPSTFIMVLRKYLQGALIESIEQVENDRIVEITVSNKNEIGDHIQATLII 129
LT T NP + F MVLRKY+ A I I Q+ DRIV I + +E+G + +LII
Sbjct: 61 LTDLTKPNPIKAPMFCMVLRKYISNAKIVDIHQINQDRIVVIDFESTDELGFNSIYSLII 120

Query: 130 EIMGKHSNILLVDKSSHKILEVIKHVGFSQNSYRTLLPGSTYIAPPSTKSLNPFTIKDEK 189
EIMG+HSN+ L+ K + I++ IKH+ N+YR++ PG Y+ PP + LNPF +
Sbjct: 121 EIMGRHSNMTLIRKRDNIIMDSIKHITPDINTYRSIYPGIEYVYPPKSPKLNPFDFSYDM 180

Query: 190 LFEILQ--TQELTAKNLQSLFQGLGRDTANELERILVSEKL---------------SAFR 232
+ + + +L +F G+ + ++E+ L + + F+
Sbjct: 181 IENFTKENSLQLNDNIFSKIFTGVSKTLSSEICFRLKNNSIDLSLSNLKEIVEVCKDLFK 240

Query: 233 NFFNQETKPCLTETSFSPVPFA--------NQVGEPFANLSDLLDTYYKDKAERDRVKQQ 284
+ + + + S V F + + + S LL+ +Y K + DR+K +
Sbjct: 241 EIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKDKSDRLKSK 300

Query: 285 ASELIRRVENELQKNRHKLKKQEKELLATDNAEEFRQKGELLTTFLHQVPNDQDQVILDN 344
+S+L + V N + + K K L ++ + F+ GELLT ++ + + L N
Sbjct: 301 SSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYALKKGLSHIELAN 360

Query: 345 YYTNQ--PIMIALDKALTPNQNAQRYFKRYQKLKEAVKYLTDLIEETKATILYLESVETV 402
YY+ + I LD+ TP+QN Q Y+K+Y KLK++ + + + + + + YL SV T
Sbjct: 361 YYSENYDTVKITLDENKTPSQNVQSYYKKYNKLKKSEEAANEQLLQNEEELNYLYSVLTN 420

Query: 403 LNQA-GLEEIAEIREELIQTGFIRRRQ--REKIQKRKKLEQYLASDGKTIIYVGRNNLQN 459
+N A +EI EI++ELI+TG+I+ ++ + K K K +++ DG IYVG+NN+QN
Sbjct: 421 INNADNYDEIEEIKKELIETGYIKFKKIYKSKKSKTSKPMHFISKDGID-IYVGKNNIQN 479

Query: 460 EELTFKMARKEELWFHAKDIPGSHVVISGNLDPSDAVKTDAAELAAYFSQGRLSNLVQVD 519
+ LT K A K ++WFH K+IPGSHV++ +D ++ +AA LAAY+S+ + S+ V VD
Sbjct: 480 DYLTLKFANKHDIWFHTKNIPGSHVIVKNIMDIPESTLLEAANLAAYYSKSQNSSNVPVD 539

Query: 520 MIEVKKLNKPTGGKPGFVTYTGQKTLRVTPDSKKIAS 556
EVK + KP G KPG V Y+ +T+ VTP + + +
Sbjct: 540 YTEVKNVKKPNGAKPGMVIYSTNQTIYVTPTNPNLKN 576


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0945TCRTETOQM361e-04 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 36.4 bits (84), Expect = 1e-04
Identities = 43/207 (20%), Positives = 80/207 (38%), Gaps = 34/207 (16%)

Query: 3 FKSGFVAILGRPNVGKSTFLNHVMGQKIAIMSDKAQTTRNKIMGIYTTDKEQIVFIDTPG 62
+ SG + LG + G + N ++ ++ I T+ + + ++ IDTPG
Sbjct: 25 YNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS-------FQWENTKVNIIDTPG 77

Query: 63 IHKPKTALGDFMVESAYSTLREVDTVLFMVPADEARGKGDDMIIERLKAAKVPVILVVNK 122
H DF+ E Y +L +D + ++ A + ++ L+ +P I +NK
Sbjct: 78 -HM------DFLAE-VYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFINK 129

Query: 123 IDKVHPDQLLSQIDDFRNQMDFKEIVPISALQGNNVSRLVDILSENLDEGFQYFPSDQIT 182
ID+ + ID D KE + + V ++ N E Q+ D +
Sbjct: 130 IDQ-------NGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQW---DTVI 179

Query: 183 DHPERFLVSEMVREKVL---HLTREEI 206
+ + L EK + L E+
Sbjct: 180 EGNDDLL------EKYMSGKSLEALEL 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0948TCRTETA1062e-27 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 106 bits (265), Expect = 2e-27
Identities = 69/357 (19%), Positives = 142/357 (39%), Gaps = 9/357 (2%)

Query: 10 LRIAWFGNFLTGASISLVVPFMPIFVENLGVGSQQVAFYAGLAISVSAISAALFSPIWGI 69
L + L I L++P +P + +L V S V + G+ +++ A+ +P+ G
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDL-VHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 70 LADKYGRKPMMIRAGLAMTITMGGLAFVPNIYWLIFLRLLNGVFAGFVPNATALIASQVP 129
L+D++GR+P+++ + + +A P ++ L R++ G+ A A IA
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITD 125

Query: 130 KEKSGSALGTLSTGVVAGTLTGPFIGGFIAELFGIRTVFLLVGSFLFLAAILTICFIKED 189
++ G +S G + GP +GG + F F + L + + E
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 190 FQPVAKEKAIPTKELFTSVKYPYL---LLNLFLTSFVIQFSAQSIGPILALYVRDLGQTE 246
+ + S ++ + L F++Q Q + ++ D +
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWD 244

Query: 247 NLLFVSGLIVSSMG-FSSMMSAGVMGKLGDKVGNHRLLVVAQFYSVIIYLLCANASSPLQ 305
G+ +++ G S+ A + G + ++G R L++ Y+L A A+
Sbjct: 245 ATTI--GISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM 302

Query: 306 LGLYRFLFGLGTGALIPGVNALLSKMTPKAGISRVFAFNQVFFYLGGVVGPMAGSAV 362
L G G +P + A+LS+ + ++ L +VGP+ +A+
Sbjct: 303 AFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI 358



Score = 57.5 bits (139), Expect = 3e-11
Identities = 44/178 (24%), Positives = 76/178 (42%), Gaps = 2/178 (1%)

Query: 214 LLNLFLTSFVIQFSAQSIGPILALYVRDLGQTENLLFVSGLIVSSMGFSSMMSAGVMGKL 273
L+ + T + I P+L +RDL + ++ G++++ A V+G L
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 274 GDKVGNHRLLVVAQFYSVIIYLLCANASSPLQLGLYRFLFGLGTGALIPGVNALLSKMTP 333
D+ G +L+V+ + + Y + A A L + R + G+ TGA A ++ +T
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITD 125

Query: 334 KAGISRVFAFNQVFFYLGGVVGPMAGSAVAGQFGYHAVFYATSLCVAFSCLFNLIQFR 391
+R F F F G V GP+ G + G F HA F+A + + L
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_0949SECGEXPORT303e-04 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 29.9 bits (67), Expect = 3e-04
Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 5/78 (6%)

Query: 1 MYNLLLTILLVLSVVIVIAIFMQPTK--NQSSNVFDASSGDLFERSKARGFEAVMQRLTG 58
MY LL + L++++ +V I +Q K + ++ +S LF S + F M R+T
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNF---MTRMTA 57

Query: 59 ILVFFWLAIALALTVLSS 76
+L + I+L L ++S
Sbjct: 58 LLATLFFIISLVLGNINS 75


39SPCG_1136SPCG_1148N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1136-3190.758332acetoin dehydrogenase complex, E3 component,
SPCG_1137-113-2.113500lipoate-protein ligase
SPCG_1138-120-4.124403site-specific tyrosine recombinase XerS
SPCG_1139-218-3.842493voltage-gated chloride channel family protein
SPCG_1140-213-2.874191ribonuclease HII
SPCG_1141-311-2.086560ribosomal biogenesis GTPase
SPCG_1142-211-2.266678zinc metalloprotease ZmpD
SPCG_1143-113-1.419808immunoglobulin A1 protease
SPCG_11443202.019691hypothetical protein
SPCG_11451202.192607exonuclease RexA
SPCG_11460181.185674exonuclease RexB
SPCG_1147131-0.597807IS630-Spn1, transposase Orf1
SPCG_1148-127-0.034669degenerate transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1136MICOLLPTASE340.002 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 33.9 bits (77), Expect = 0.002
Identities = 24/88 (27%), Positives = 35/88 (39%), Gaps = 2/88 (2%)

Query: 29 FVKEGEILLEIMTDKVSMELEAEEDGYLIAILKGDGETVPVTEVIGYLGEERENIPTAGA 88
V +GE LE +S+ + G +KG+ + V E +E EN
Sbjct: 944 TVLKGEKTLEPGRYYLSVYTYDNQSGTYTVNVKGNLKN-EVKETAKDAIKEVENNNDFDK 1002

Query: 89 ASPEASSVPVAST-SNDDDKSDDAFDIV 115
A S+ + T SNDD K + DI
Sbjct: 1003 AMKVDSNSKIVGTLSNDDLKDIYSIDIQ 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1141PF05272290.020 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.020
Identities = 16/87 (18%), Positives = 34/87 (39%), Gaps = 14/87 (16%)

Query: 73 RQYFESQ------EIQTLAI-------NSKEQVTVKVVTDAAKKLMADKIARQKERGIQI 119
R + ++Q ++ + + + ++ + K ++ +AR E G +
Sbjct: 536 RDWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKF 595

Query: 120 ETLRTMIIGIPNAGKSTLMNRLAGKKI 146
+ + G GKSTL+N L G
Sbjct: 596 DYSVV-LEGTGGIGKSTLINTLVGLDF 621


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1142TONBPROTEIN330.008 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 33.0 bits (75), Expect = 0.008
Identities = 15/69 (21%), Positives = 22/69 (31%), Gaps = 10/69 (14%)

Query: 168 ASTVSPVEQPK--------VVTEKGEPEVQPALPEAVVTDKGEPEVQPT--LPEAVVTDK 217
S +E P +VT Q P + EPE +P P+
Sbjct: 30 TSVHQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVI 89

Query: 218 GEPEVHEKP 226
+P+ KP
Sbjct: 90 EKPKPKPKP 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1143PF03544350.001 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 35.3 bits (81), Expect = 0.001
Identities = 19/112 (16%), Positives = 37/112 (33%), Gaps = 13/112 (11%)

Query: 473 PELSEAVVTDKGEPAVQPELPEAVVSDKGEPAVQPELPEAVVTD---KGETEVQPESPDT 529
P ++ + PA P+AV EP V+PE + + + ++ P
Sbjct: 44 PAPAQPISVTMVAPADLEP-PQAVQPPP-EPVVEPEPEPEPIPEPPKEAPVVIEKPKPKP 101

Query: 530 VVSDKGEPKQVAPLP----EYTGPQASAIVEPEQVAPLPEYTGVQAGSIVEP 577
+PK V + + ++ E AP + + +P
Sbjct: 102 KP----KPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKP 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1148PREPILNPTASE300.004 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.2 bits (68), Expect = 0.004
Identities = 19/80 (23%), Positives = 25/80 (31%), Gaps = 16/80 (20%)

Query: 19 QILDIINKDTHKEIIAKLDYDAP--SCPECGSQMKKYDFQKPSKIPYLETTGMPSRILLR 76
+ N D + P CP C + + IP L S + LR
Sbjct: 48 EYRSYFNPDDEGVDEPPYNLMVPRSCCPHCNHPITALE-----NIPLL------SWLWLR 96

Query: 77 KRRFKCYHCSKMMVAETPLV 96
R C C + A PLV
Sbjct: 97 GR---CRGCQAPISARYPLV 113


40SPCG_1500SPCG_1507N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1500217-0.059811acetyltransferase
SPCG_15012160.827928hypothetical protein
SPCG_15020140.929957transcription elongation factor GreA
SPCG_15031152.057616hypothetical protein
SPCG_15042182.253550acetyltransferase
SPCG_15050182.623206acetyltransferase
SPCG_1506-1162.429043UDP-N-acetylmuramate--L-alanine ligase
SPCG_15071182.128102hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1500SACTRNSFRASE371e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.8 bits (85), Expect = 1e-05
Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 3/76 (3%)

Query: 76 IAETFGNWLEIEYLFVKEELRGQGIGSKLLQQAESEAKNRNCCFAFVNTYQFQAP--DFY 133
I + + IE + V ++ R +G+G+ LL +A AK + C + T FY
Sbjct: 82 IRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141

Query: 134 QKHGYKEVFSLQDYLY 149
KH + + ++ LY
Sbjct: 142 AKHHFI-IGAVDTMLY 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1503PF03544300.029 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.6 bits (66), Expect = 0.029
Identities = 29/160 (18%), Positives = 52/160 (32%), Gaps = 15/160 (9%)

Query: 50 LMADSLSTVEEIMRKAPTVPTHPSQGVPASPADEIQRETPGVPSHP-------SQDVPSS 102
++A L T + + P P P +PAD + P P + +P
Sbjct: 28 VVAGLLYTSVHQVIELPA-PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEP 86

Query: 103 PAEE------SGSRPGPGPVRPKKLEREYNETPTRVAVSYTTGEKKAEQAGPETPTPATE 156
P E +P P P KK+E+ + + + E A + A
Sbjct: 87 PKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT 146

Query: 157 TVDIIRDTSRRSRREGAKPVKPKKEKKSHVKAFV-ISFLV 195
+ + S +P P + + ++ V + F V
Sbjct: 147 SKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDV 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1504SACTRNSFRASE327e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.8 bits (72), Expect = 7e-04
Identities = 19/71 (26%), Positives = 34/71 (47%), Gaps = 3/71 (4%)

Query: 85 GYISVTCLSIAKEAQGLGLGQKLLTALKEFALEDERDGINLTCHDYLIA---YYEKHGFV 141
GY + +++AK+ + G+G LL E+A E+ G+ L D I+ +Y KH F+
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 142 NEGQSQSTFAG 152
++
Sbjct: 148 IGAVDTMLYSN 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1505SACTRNSFRASE362e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 2e-05
Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 9/92 (9%)

Query: 25 SFPAEKQQLSHILEESIRKCADTFLLARDENQLLGYI-LSSPQSDNPQCLKVHSLVIESD 83
+ + +S++ EE L EN +G I + S + + + + D
Sbjct: 49 QYEDDDMDVSYVEEE-----GKAAFLYYLENNCIGRIKIRSNWNGY---ALIEDIAVAKD 100

Query: 84 HQRQGLGTLLLAALKEVAVELDYKGIRLESPD 115
++++G+GT LL E A E + G+ LE+ D
Sbjct: 101 YRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1506ACETATEKNASE320.006 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 31.7 bits (72), Expect = 0.006
Identities = 16/55 (29%), Positives = 24/55 (43%), Gaps = 9/55 (16%)

Query: 306 IVNDTVI--IDDFA-----HHPTEIIATLDAARQKYPSKEIVAVFQPHTFTRTIA 353
++ D V+ I D H+P I + A Q P +VAVF F +T+
Sbjct: 103 LITDDVLKAITDCIELAPLHNPANIEG-IKACTQIMPDVPMVAVFDT-AFHQTMP 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1507BLACTAMASEA300.005 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 30.1 bits (68), Expect = 0.005
Identities = 15/49 (30%), Positives = 29/49 (59%), Gaps = 1/49 (2%)

Query: 4 ERFPLVSDDEVMLTEMPVMNLYDESDLISNIKGEYRDKNYLEWAPITEE 52
ERFP++S +V+L V+ D D K YR ++ ++++P++E+
Sbjct: 60 ERFPMMSTFKVVLC-GAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEK 107


41SPCG_1670SPCG_1676N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1670-1173.277162alanine racemase
SPCG_16710191.6723924'-phosphopantetheinyl transferase
SPCG_1672-1170.338814phospho-2-dehydro-3-deoxyheptonate aldolase
SPCG_1673-114-1.434434phospho-2-dehydro-3-deoxyheptonate aldolase
SPCG_1674-113-2.221583preprotein translocase subunit SecA
SPCG_1675220-6.730789hypothetical protein
SPCG_1676-113-3.973808ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1670ALARACEMASE351e-122 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 351 bits (902), Expect = e-122
Identities = 129/365 (35%), Positives = 186/365 (50%), Gaps = 17/365 (4%)

Query: 14 RPTKALIHLGAIRQNIQQMGAHIPQGTLKWAVVKANAYGHGAVAVAKAIQDDVDGFCVSN 73
RP +A + L A++QN+ + + W+VVKANAYGHG + AI DGF + N
Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATHARV-WSVVKANAYGHGIERIWSAI-GATDGFALLN 60

Query: 74 IDEAIELRQAGLSKPILIL-GVSEIEAVALAKEYDFTLTVAGLEWIQALLDKEVDLTGLT 132
++EAI LR+ G PIL+L G + + + ++ T V ++AL + + L
Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAP-LD 119

Query: 133 VHLKIDSGMGRIGFREVSEVEQAQDLLQKHGVCVEGIFTHFATADEESDDYFNAQLERFK 192
++LK++SGM R+GF+ + Q L V + +HFA A+ D + + R +
Sbjct: 120 IYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHP--DGISGAMARIE 177

Query: 193 TILASMKEVPELVHASNSATTLWHVETIFNAVRMGDAMYGLNPSGAVLDL-PYDLIPALT 251
+ + SNSA TLWH E F+ VR G +YG +PSG D+ L P +T
Sbjct: 178 QAA---EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMT 234

Query: 252 LESALVHVKTVPAGACMGYGATYQADSEQVIATVPIGYADGWTRDMQN-FSVLVDGQACP 310
L S ++ V+T+ AG +GYG Y A EQ I V GYADG+ R VLVDG
Sbjct: 235 LSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTM 294

Query: 311 IVGRVSMDQITIRLPKL--YPLGTKVTLIGSNGDKEITATQVATYRVTINYEVVCLLSDR 368
VG VSMD + + L +GT V L G KEI VA T+ YE++C L+ R
Sbjct: 295 TVGTVSMDMLAVDLTPCPQAGIGTPVELWG----KEIKIDDVAAAAGTVGYELMCALALR 350

Query: 369 IPREY 373
+P
Sbjct: 351 VPVVT 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1671ENTSNTHTASED270.017 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 26.9 bits (59), Expect = 0.017
Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 5/73 (6%)

Query: 9 GIDIEELASIESAVTRHEGFAKRVLTAQEMERFTSLKGRRQIEYLAGRWSAKEAFSKAMG 68
GIDIE++ S + A ++ + E + + L +SAKE+ KA
Sbjct: 105 GIDIEKIMSQHT----ATELAPSIIDSDERQILQA-SLLPFPLALTLAFSAKESVYKAFS 159

Query: 69 TGISKLGFQDLEV 81
++ GF +V
Sbjct: 160 DRVTLPGFNSAKV 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1674SECA10550.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1055 bits (2729), Expect = 0.0
Identities = 391/904 (43%), Positives = 561/904 (62%), Gaps = 71/904 (7%)

Query: 1 MANILKTIIENDKG-EIRRLEKMADKVFKYEDQMAALTDDQLKAKTVEFKERYQNGESLD 59
+ +L + + +RR+ K+ + + E +M L+D++LK KT EF+ R + GE L+
Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61

Query: 60 SLLYEAFAVVREGAKRVLGLFPYKVQVMGGIVLHHGDVPEMRTGEGKTLTATMPVYLNAL 119
+L+ EAFAVVRE +KRV G+ + VQ++GG+VL+ + EMRTGEGKTLTAT+P YLNAL
Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121

Query: 120 SGKGVHVVTVNEYLSERDATEMGELYSWLGLSVGINLATKSPMEKKEAYECDITYSTNSE 179
+GKGVHVVTVN+YL++RDA L+ +LGL+VGINL K+EAY DITY TN+E
Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181

Query: 180 IGFDYLRDNMVVRAENMVQRPLNYALVDEVDSILIDEARTPLIVSGANAVETSQLYHMAD 239
GFDYLRDNM E VQR L+YALVDEVDSILIDEARTPLI+SG + ++Y +
Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSS-EMYKRVN 240

Query: 240 HYVKSLNKD------------DYIIDVQSKTIGLSDSGIDRAESYF-------KLENLYD 280
+ L + + +D +S+ + L++ G+ E + E+LY
Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300

Query: 281 IENVALTHFIDNALRANYIMLLDIDYVVSEEQEILIVDQFTGRTMEGRRYSDGLHQAIEA 340
N+ L H + ALRA+ + D+DY+V ++ E++IVD+ TGRTM+GRR+SDGLHQA+EA
Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIV-KDGEVIIVDEHTGRTMQGRRWSDGLHQAVEA 359

Query: 341 KEGVPIQDETKTSASITYQNLFRMYKKLSGMTGTGKTEEEEFREIYNIRVIPIPTNRPVQ 400
KEGV IQ+E +T ASIT+QN FR+Y+KL+GMTGT TE EF IY + + +PTNRP+
Sbjct: 360 KEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMI 419

Query: 401 RIDHSDLLYASIESKFKAVVEDVKARYQKGQPVLVGTVAVETSDYISKKLVAAGVPHEVL 460
R D DL+Y + K +A++ED+K R KGQPVLVGT+++E S+ +S +L AG+ H VL
Sbjct: 420 RKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVL 479

Query: 461 NAKNHYREAQIIMNAGQRGAVTIATNMAGRGTDIKLG----------------------- 497
NAK H EA I+ AG AVTIATNMAGRGTDI LG
Sbjct: 480 NAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKA 539

Query: 498 ------EGVRELGGLCVIGTERHESRRIDNQLRGRSGRQGDPGESQFYLSLEDDLMKRFG 551
+ V E GGL +IGTERHESRRIDNQLRGRSGRQGD G S+FYLS+ED LM+ F
Sbjct: 540 DWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFA 599

Query: 552 SERLKGIFERLNMSE-EAIESRMLTRQVEAAQKRVEGNNYDTRKQVLQYDDVMREQREII 610
S+R+ G+ +L M EAIE +T+ + AQ++VE N+D RKQ+L+YDDV +QR I
Sbjct: 600 SDRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAI 659

Query: 611 YAQRYDVITADRDLAPEIQSMIKRTIERVVDGHARAKQDEK---LEAILNFAKYNLLPED 667
Y+QR +++ D++ I S+ + + +D + + E+ + + K + +
Sbjct: 660 YSQRNELLDVS-DVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDL 718

Query: 668 SIT--MEDLSGLSDKAIKEELFQRALKVYDSQVSKLRDEEAVKEFQKVLILRVVDNKWTD 725
I ++ L ++ ++E + ++++VY + + E ++ F+K ++L+ +D+ W +
Sbjct: 719 PIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVG-AEMMRHFEKGVMLQTLDSLWKE 777

Query: 726 HIDALDQLRNAVGLRGYAQNNPVVEYQAEGFRMFNDMIGSIEFDVTRLMMKAQIH----- 780
H+ A+D LR + LRGYAQ +P EY+ E F MF M+ S++++V + K Q+
Sbjct: 778 HLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEV 837

Query: 781 ----EQERPQAERHISTTATRNIAAHQASMP---EDLDLSQIGRNELCPCGSGKKFKNCH 833
+Q R +AER + A+ ++GRN+ CPCGSGKK+K CH
Sbjct: 838 EELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQCH 897

Query: 834 GKRQ 837
G+ Q
Sbjct: 898 GRLQ 901


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1676PF05272310.004 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.004
Identities = 22/81 (27%), Positives = 35/81 (43%), Gaps = 6/81 (7%)

Query: 33 LKGDNGSGKTVLLKVLAG-YIKLDKGKVLQDGKVYGIKNHYIQDAGILIEKVEFLSHLSL 91
L+G G GK+ L+ L G D G K+ Y Q AGI+ ++ ++
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSD--THFDIGTG---KDSYEQIAGIVAYELSEMTAFRR 655

Query: 92 RENLELLRYFSSKVTEKRIAY 112
+ + +FSS+ R AY
Sbjct: 656 ADAEAVKAFFSSRKDRYRGAY 676


42SPCG_1964SPCG_1969N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_1964-3130.487195catabolite control protein A
SPCG_1966-3160.399478DNA-binding response regulator
SPCG_1967-117-0.778951sensor histidine kinase
SPCG_1968-114-1.876380hypothetical protein
SPCG_1969116-2.880710ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1964MALTOSEBP290.025 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 29.3 bits (65), Expect = 0.025
Identities = 21/79 (26%), Positives = 36/79 (45%), Gaps = 2/79 (2%)

Query: 205 NGK--VRLVGYKETLKKAGITYSEGLVFESKYSYDDGYALAERLISSNATAAVVTGDELA 262
NGK ++ VG KAG+T+ L+ + D Y++AE + TA + G
Sbjct: 199 NGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAW 258

Query: 263 AGVLNGLADKGVSVPEDFE 281
+ + + GV+V F+
Sbjct: 259 SNIDTSKVNYGVTVLPTFK 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1966HTHFIS749e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 9e-18
Identities = 25/122 (20%), Positives = 51/122 (41%), Gaps = 2/122 (1%)

Query: 2 KVLVAEDQSMLRDAMCQLLAFQADVESVLQAKNGQEAIQLLEKESVDIAILDVEMPVKTG 61
+LVA+D + +R + Q L+ V N + + D+ + DV MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 LEVLEWIRAEKLETKVVVVTTFKRPGYFERAVKAGVDAYVLKERSIADLMQTLHTVLEGR 121
++L I+ + + V+V++ +A + G Y+ K + +L+ + L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 122 KE 123
K
Sbjct: 123 KR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1967PF06580391e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.5 bits (92), Expect = 1e-05
Identities = 67/376 (17%), Positives = 127/376 (33%), Gaps = 67/376 (17%)

Query: 1 MLERLKSIHYMFWASLIFMLFPILPVVTGWLSAWHLLIDILFVVAYLGVLTTKSQRLSWL 60
L L M+F I + G + AY + +R WL
Sbjct: 24 TLTGFGFASLYGSPKLHSMIFNIAISLMGLV----------LTHAYRSFI----KRQGWL 69

Query: 61 YWGLMLTYVVGNTAFVAVNYIWFFFFLSNLLSYHFSVGGLKSLHVWTFLLAQVLVVGQLL 120
+ + A V + +WF S F + T +A L + +
Sbjct: 70 KLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAF---------INTKPVAFTLPLALSI 120

Query: 121 IFQRIEVEFLFYLLVILAFVDLMTFGLVRIRIVEDLKEAQAKQNAQINLLLAENERSRIG 180
IF + V F++ LL + F + ++ K A Q AQ+ L +++I
Sbjct: 121 IFNVVVVTFMWSLL----YFGWHFFKNYKQAEIDQWKMASMAQEAQLMAL-----KAQIN 171

Query: 181 QDLHDSLGHTFAMLSVKTDLALQLFQMEAYPQVEKELKEIHQISKDSMNEVRTIVENLKS 240
+ + + +E + + L + ++ + S+ +
Sbjct: 172 PHF---MFNALNNIRALI--------LEDPTKAREMLTSLSELMRYSLRYSNA-----RQ 215

Query: 241 RTLTSELETVKKMLEIAGI----EVETDNQLDTASLTQELESTASMILLELVTNIIKHAK 296
+L EL V L++A I ++ +NQ++ A + ++ M++ LV N IKH
Sbjct: 216 VSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQV---PPMLVQTLVENGIKHGI 272

Query: 297 ASKA-----YLKLERAEKELILTVSDDGCGFAFLKGDE----LHTVRDRVSPFSGE---V 344
A LK + + L V + G + L VR+R+ G +
Sbjct: 273 AQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQI 332

Query: 345 SVISQKHPTEVQVRLP 360
+ ++ V +P
Sbjct: 333 KLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_1969PF05272300.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.017
Identities = 11/32 (34%), Positives = 16/32 (50%)

Query: 31 CVALIGPNGAGKTTLLDCLLGDKLVTSGQVSI 62
V L G G GK+TL++ L+G + I
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629


43SPCG_2118SPCG_2123N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_2118-1262.227871arginine deiminase
SPCG_2119-1202.299821ornithine carbamoyltransferase
SPCG_2120-1152.144487carbamate kinase
SPCG_21210132.079328hypothetical protein
SPCG_21220121.830912hypothetical protein
SPCG_2123119-2.322122hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2118ARGDEIMINASE5520.0 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 552 bits (1425), Expect = 0.0
Identities = 190/408 (46%), Positives = 270/408 (66%), Gaps = 8/408 (1%)

Query: 5 PIQVFSEIGKLKKVMLHRPGKELENLLPDYLERLLFDDIPFLEDAQKEHDAFAQALRDEG 64
PI +FSEIG+LKKV+LHRPG+ELENL P ++ LFDDIP+LE A++EH+ FA L++
Sbjct: 7 PINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNNL 66

Query: 65 IEVLYLEQLAAESLTSP-EIRDQFIEEYLDEANIRDRQTKVAIRELLHGIKDNQELVEKT 123
+E+ Y+E L +E L S + ++FI +++ EA I+ T +++ ++ K
Sbjct: 67 VEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSS-LTIDNMISKM 125

Query: 124 MAGIQKVELPEIPDEAKDLTDLVESDYPFAIDPMPNLYFTRDPFATIGNAVSLNHMFADT 183
++G+ EL L DLV F IDPMPN+ FTRDPFA+IGN V++N MF
Sbjct: 126 ISGVVTEELKNYT---SSLDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTKV 182

Query: 184 RNRETLYGKYIFKYHPIYGGKVDLVYNREEDTRIEGGDELVLSKDVLAVGISQRTDAASI 243
R RET++ +YIFKYHP+Y V + NR E+ +EGGDELVL+K +L +GIS+RT+A S+
Sbjct: 183 RQRETIFAEYIFKYHPVYKENVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAKSV 242

Query: 244 EKLLVNIFKKNVGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIEGDLHVYSVTY 303
EKL +++FK F +LAF+ NR +MHLDTVFT +DY FT + +Y +TY
Sbjct: 243 EKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDMYFSIYVLTY 302

Query: 304 ENEK--LKIVEEKGDLAELLAQNLGVEKVHLIRCGGGNIVAAAREQWNDGSNTLTIAPGV 361
+ I +EK + ++L+ LG K+ +I+C GG+++ AREQWNDG+N L IAPG
Sbjct: 303 NPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAPGE 361

Query: 362 VVVYDRNTVTNKILEEYGLRLIKIRGSELVRGRGGPRCMSMPFEREEV 409
++ Y RN VTNK+ EE G+++ +I SEL RGRGGPRCMSMP RE++
Sbjct: 362 IIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2120CARBMTKINASE406e-146 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 406 bits (1046), Expect = e-146
Identities = 139/312 (44%), Positives = 204/312 (65%), Gaps = 5/312 (1%)

Query: 4 RKIVVALGGNAIL--SSDPSAKAQQEALVETAKHLVKLIKNGDDLIITHGNGPQVGNLLL 61
+++V+ALGGNA+ S + + + +TA+ + ++I G +++ITHGNGPQVG+LLL
Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62

Query: 62 QHLASDSEKN-PAFPLDSLVAMTEGSIGFWLKNALQNALLDEGIEKNVASVVTQVVVDKN 120
A + PA P+D AM++G IG+ ++ AL+N L G+EK V +++TQ +VDKN
Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122

Query: 121 DPAFVNLSKPIGPFYSEEEAKAEAEKSGATFKEDAGRGWRKVVASPKPVDIKEIETIRTL 180
DPAF N +KP+GPFY EE AK A + G KED+GRGWR+VV SP P E ETI+ L
Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182

Query: 181 LNNGQVVVAAGGGGIPVVKENNGHLTGVEAVIDKDFASQRLAELVDADLFIVLTGVDYVF 240
+ G +V+A+GGGG+PV+ E +G + GVEAVIDKD A ++LAE V+AD+F++LT V+
Sbjct: 183 VERGVIVIASGGGGVPVILE-DGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAA 241

Query: 241 VNYNKPNQEKLEHVNVAQLEEYIKQDQFAPGSMLPKVEAAIAFVNGRPEGKAVITSLENL 300
+ Y ++ L V V +L +Y ++ F GSM PKV AAI F+ +A+I LE
Sbjct: 242 LYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIE-WGGERAIIAHLEKA 300

Query: 301 GALIESESGTII 312
+E ++GT +
Sbjct: 301 VEALEGKTGTQV 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2121RTXTOXINA372e-04 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 37.3 bits (86), Expect = 2e-04
Identities = 28/134 (20%), Positives = 50/134 (37%), Gaps = 19/134 (14%)

Query: 279 FIPWTDLGVTIF-DDFNAWLTGLPVIGNIVGSSTSALGTWYFPEGAMLFAFMGILIGVIY 337
I T+ GVTIF + L GNI+G +G G +L F L +
Sbjct: 99 LIGLTERGVTIFAPQLDKLLQKYQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALS 158

Query: 338 GLKEDKIISSFMNG----------AADLLSVALIVAIARGIQVIMNDGMITDTILNWGK- 386
+K D++I +G A+ L L+ +A + + + G
Sbjct: 159 SMKIDELIKKQKSGGNVSSSELAKASIELINQLVDTVASLNNNV---NSFSQQLNTLGSV 215

Query: 387 ----EGLSGLSSQV 396
+ L+G+ +++
Sbjct: 216 LSNTKHLNGVGNKL 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2123GPOSANCHOR331e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 1e-04
Identities = 15/72 (20%), Positives = 32/72 (44%)

Query: 12 ERKQRFSLRKYAIGACSVLLGTSLFFAGMGAQPVQDTETSSALISSHYLDEQDLSEKLKS 71
+ +SLRK G SV + ++ AG+ + + ++ + Q+ ++K +
Sbjct: 5 NTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERADKFEI 64

Query: 72 ELQWFELENKLL 83
E +L+N L
Sbjct: 65 ENNTLKLKNSDL 76


44SPCG_2155SPCG_2162N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SPCG_2155-2120.639868hypothetical protein
SPCG_2156-2140.087749Hsp33-like chaperonin
SPCG_2157-2111.050080NifR3 family TIM-barrel protein
SPCG_2158-2130.160919choline binding protein A
SPCG_2159-1190.532425hypothetical protein
SPCG_2160-2140.952698sensor histidine kinase
SPCG_2161-1162.675404DNA-binding response regulator
SPCG_2162-2183.389764ATP-dependent Clp protease, ATP-binding subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2155PF05043431e-06 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 43.4 bits (102), Expect = 1e-06
Identities = 31/169 (18%), Positives = 70/169 (41%), Gaps = 8/169 (4%)

Query: 5 DLMEKAECGQFSILSFLLQE-SQTTVKAVMEETGFSKATLTKYVTLLNDKALDSGLELAI 63
DL+ K Q +L L + + E ++ + ++ + D +
Sbjct: 3 DLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDL---IFH 59

Query: 64 HSEDENLRLSIGAATKGRDIRSLFLESAVKYQILVYLLYHQQFLAHQLAQELVISEATLG 123
S + ++ + F S + IL ++ +++ A + +E IS ++L
Sbjct: 60 SSTNGIRIINTDDSDIEMVYHHFFKHSTH-FSILEFIFFNEGCQAESICKEFYISSSSLY 118

Query: 124 RHLAGLNQILS---EFDLSIQNGRWRGPEHQIRYFYFCLFRKVWSSQEW 169
R ++ +N+++ +F++S+ + G E IRYF+ F + + EW
Sbjct: 119 RIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEW 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2158IGASERPTASE633e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 62.8 bits (152), Expect = 3e-12
Identities = 48/314 (15%), Positives = 95/314 (30%), Gaps = 52/314 (16%)

Query: 178 TLELEIAEFDVKVKEAELELVKKEADESRNEGTINQAKAKVESEKAEATRLKKIKTDR-- 235
L +D+ E E + I V S E R+ +
Sbjct: 970 KLRNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPA 1029

Query: 236 --------EKAEEEAKRRADAKEQDESKRRKSRVKRGDLGEQATPDKKENDAKSSDSSVG 287
E E +K+ + E++E ++ + ++ ++A + K N + + G
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 288 EETLPSPSLKPGKKVAEAQKKVEEAKKKAKDQKEEDRRNYPTNTYKTLELEIAESDVKVK 347
ET K+ + K +K + K E +
Sbjct: 1090 SET---------KETQTTETKETATVEKEEKAKVETEK---------------------- 1118

Query: 348 EAELELVKEEAKESQNEEKIKQAKAKVESKKAEATRLENIKTDRKKAEEEAKRKAAEEDK 407
+E + ++ KQ +++ +AE R + + K+ + + A E
Sbjct: 1119 -------TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171

Query: 408 VKEKPAEQPQPAPAPQPEKPAPKPENPAEQPKAEKPADQQAEEDYA-RRSEEEYNRLTQQ 466
KE + QP E P+ PA Q + + +R + +
Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVV---ENPENTTPATTQPTVNSESSNKPKNRHRRSVR 1228

Query: 467 QPPKTEKPAQPSTP 480
P +PA S+
Sbjct: 1229 SVPHNVEPATTSSN 1242



Score = 58.5 bits (141), Expect = 7e-11
Identities = 43/223 (19%), Positives = 83/223 (37%), Gaps = 12/223 (5%)

Query: 270 ATPDKKENDAKSSDSSVGE----ETLPSPSLKPGKKVAEAQKKVEEAKKKAKDQKEEDRR 325
TP+ + D S S+ E + P P P + E +K+++K ++ ++
Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQD 1057

Query: 326 NYPTNTYKTLELEIAESDVKVKEAELELVKEEAKESQNEEKIKQAKAKVESKKAEATRLE 385
T + A+S+VK E+ + ++ + + + A VE + E
Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE-------E 1110

Query: 386 NIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPAPQPEKPAPKPENPAEQPKAEKPAD 445
K + +K +E K + K ++ QPQ PA + + P + P Q +
Sbjct: 1111 KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND-PTVNIKEPQSQTNTTADTE 1169

Query: 446 QQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTGWKQEN 488
Q A+E + + T + + +TP T N
Sbjct: 1170 QPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212



Score = 56.6 bits (136), Expect = 3e-10
Identities = 39/207 (18%), Positives = 73/207 (35%), Gaps = 11/207 (5%)

Query: 288 EETLPSPSLKPGKKVAEAQKKVE-EAKKKAKDQKEEDRRNYPTNTYKTLELEIAESDVKV 346
+T+ + ++ + V ++ A+ + P +T E S +
Sbjct: 989 NQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQES 1048

Query: 347 KEAELELVKEEAKESQNEEKIKQAKAKVES--KKAEATRL--ENIKTDRKKAEEEAKRKA 402
K E +QN E K+AK+ V++ + E + E +T + +E A +
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 403 AEEDKV-KEKPAEQPQPAPAPQPEKPAPKPENPAEQPKAEKPADQQAEEDYARRSEEEYN 461
E+ KV EK E P+ P++ + P +P E +E + + N
Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE-----PQSQTN 1163

Query: 462 RLTQQQPPKTEKPAQPSTPKTGWKQEN 488
+ P E + P T N
Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVN 1190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2161HTHFIS817e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 7e-20
Identities = 36/119 (30%), Positives = 62/119 (52%), Gaps = 4/119 (3%)

Query: 3 ILVADDEEMIREGIAAFLTEEGYHVIMAKDGQEVLEKFQDLPIHLMVLDLMMPRKSGFEV 62
ILVADD+ IR + L+ GY V + + + L+V D++MP ++ F++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 63 LKEINQ-KHDIPVIVLSALGDETTQSQVFDLYADDHVTKPFSL---VLLVKRIKALIRR 117
L I + + D+PV+V+SA T + + A D++ KPF L + ++ R A +R
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SPCG_2162HTHFIS340.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 0.003
Identities = 34/172 (19%), Positives = 56/172 (32%), Gaps = 28/172 (16%)

Query: 502 KYLNLEAELHKRVIGQDQAVSSISRAIRRNQSGIRSHKRPIGSFMFLGPTGVGKTELAKA 561
L +++ ++G+ A+ I R + R + + M G +G GK +A+A
Sbjct: 127 SKLEDDSQDGMPLVGRSAAMQEIYRVLAR----LMQTDLTL---MITGESGTGKELVARA 179

Query: 562 LAEVLFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSV---- 617
L + + +M+ S L G E G T
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTGRFEQAE 231

Query: 618 ---LLFDEVEKAHPDIFNVLLQVLDDGVLT---DSKGRKVDFSNTIIIMTSN 663
L DE+ D LL+VL G T + D I+ +N
Sbjct: 232 GGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN 280



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.