PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeSJM180.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in CP002073 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1HPSJM_00305HPSJM_00500Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_003052160.124795Proline/pyrroline-5-carboxylate dehydrogenase
HPSJM_00310624-1.002913hypothetical protein
HPSJM_00315624-0.952937hypothetical protein
HPSJM_00320522-1.326233hypothetical protein
HPSJM_00325519-0.630342hypothetical protein
HPSJM_003303180.861499hypothetical protein
HPSJM_003503181.637999hypothetical protein
HPSJM_003552161.907013hypothetical protein
HPSJM_003702151.831994hypothetical protein
HPSJM_003751132.700655hypothetical protein
HPSJM_003904213.729958urease accessory protein UreH
HPSJM_003954233.527495Urease accessory protein UreG
HPSJM_004003202.548572urease accessory protein (ureF)
HPSJM_004052182.651857urease accessory protein UreE
HPSJM_004102202.645200urease accessory protein / pH-dependent
HPSJM_004151172.699925urease subunit beta
HPSJM_00420-3111.828378urease subunit alpha
HPSJM_00430-1132.567611*lipoprotein signal peptidase
HPSJM_004351153.122164phosphoglucosamine mutase
HPSJM_004402163.29369030S ribosomal protein S20
HPSJM_004451132.194817peptide chain release factor 1
HPSJM_004503141.938265hypothetical protein
HPSJM_004552131.844795outer membrane protein HorA
HPSJM_004602130.875063hypothetical protein
HPSJM_00465-1130.103047hypothetical protein
HPSJM_00470-2120.150929methyl-accepting chemotaxis protein (MCP)
HPSJM_004751140.39080630S ribosomal protein S9
HPSJM_004801120.54945050S ribosomal protein L13
HPSJM_004851120.817914hypothetical protein
HPSJM_004901100.288513Malate:quinone oxidoreductase (Malate
HPSJM_004951120.227160hypothetical protein
HPSJM_005002140.056407RNA polymerase sigma factor RpoD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00305ANTHRAXTOXNA310.034 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.9 bits (69), Expect = 0.034
Identities = 36/173 (20%), Positives = 71/173 (41%), Gaps = 19/173 (10%)

Query: 121 QEESQLKERILKRKNEKIILNVNFIGEEVLGEEEANARFEKY---SQALKSNYIQYISIK 177
Q+ S+ ++ + + EK+ F+ E+ + + Y S+ K Y +
Sbjct: 118 QDLSEEEKNSMNSRGEKVPFASRFVFEKKRETPKLIINIKDYAINSEQSKEVYYEIGKGI 177

Query: 178 ITTIFSQINILDFEY-----SKKEIVKRLDALYALALEEEKKQGMPKFINLDMEEFRDLE 232
I S+ LD E+ S + D L++ +E K + K I+++ ++
Sbjct: 178 SLDIISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKE-KLELNNKSIDINF-----IK 231

Query: 233 LTVESFMESIAK-----FDLNAGIVLQAYIPDSYEYLKKLHAFSKERVLKGLK 280
+ F + + F + VL+ Y PD +EY+ KL E++ + LK
Sbjct: 232 ENLTEFQHAFSLAFSYYFAPDHRTVLELYAPDMFEYMNKLEKGGFEKISESLK 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00325GPOSANCHOR392e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 38.9 bits (90), Expect = 2e-05
Identities = 43/234 (18%), Positives = 77/234 (32%), Gaps = 14/234 (5%)

Query: 16 EELEARIGELEDENAELLREKNDLFVETS--------------GLKDANNQLRQKNDKLF 61
+ L+ EL +E + + S L+ A +
Sbjct: 81 KALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADS 140

Query: 62 ITKDKLTKANTELYRERNDLAREKENLNNQLNASQKQVKELEQSQQVLKNEKAELSKEKE 121
L L + DL + E N A ++K LE + L+ +AEL K E
Sbjct: 141 AKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALE 200

Query: 122 NLTKANTDLKTENDKLNHQVIALNKEQGSLKQERVRLQDEHGFLEKRCTNLEKENQRLTE 181
+T + L + AL + L++ + + LE E L
Sbjct: 201 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 260

Query: 182 KLKQLESAQKSLENTNNQLRQALENSNAQLAQAEEKIATEKTELEREIARLKSL 235
+ +LE A + N + ++ A+ A E + A + + + A +SL
Sbjct: 261 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSL 314



Score = 35.4 bits (81), Expect = 2e-04
Identities = 49/220 (22%), Positives = 75/220 (34%), Gaps = 7/220 (3%)

Query: 17 ELEARIGELEDENAELLREKNDLFVETSGLKDANNQLRQKNDKLFITKDKLTKANTELYR 76
A+I LE E A L K DL G + + K L L
Sbjct: 138 ADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAK-------IKTLEAEKAALEA 190

Query: 77 ERNDLAREKENLNNQLNASQKQVKELEQSQQVLKNEKAELSKEKENLTKANTDLKTENDK 136
+ +L + E N A ++K LE + L KA+L K E +T +
Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250

Query: 137 LNHQVIALNKEQGSLKQERVRLQDEHGFLEKRCTNLEKENQRLTEKLKQLESAQKSLENT 196
L + AL Q L++ + + LE E L + LE + L
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310

Query: 197 NNQLRQALENSNAQLAQAEEKIATEKTELEREIARLKSLE 236
LR+ L+ S Q E + + + + A +SL
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLR 350



Score = 33.5 bits (76), Expect = 8e-04
Identities = 43/226 (19%), Positives = 82/226 (36%)

Query: 15 REELEARIGELEDENAELLREKNDLFVETSGLKDANNQLRQKNDKLFITKDKLTKANTEL 74
+ ELE + + + + L E + L L + + + L
Sbjct: 192 QAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTL 251

Query: 75 YRERNDLAREKENLNNQLNASQKQVKELEQSQQVLKNEKAELSKEKENLTKANTDLKTEN 134
E+ L + L L + + L+ EKA L EK +L + L
Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANR 311

Query: 135 DKLNHQVIALNKEQGSLKQERVRLQDEHGFLEKRCTNLEKENQRLTEKLKQLESAQKSLE 194
L + A + + L+ E +L++++ E +L ++ E KQLE+ + LE
Sbjct: 312 QSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLE 371

Query: 195 NTNNQLRQALENSNAQLAQAEEKIATEKTELEREIARLKSLEGMEA 240
N + ++ L + E + LE ++L +LE +
Sbjct: 372 EQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNK 417



Score = 33.1 bits (75), Expect = 0.001
Identities = 38/203 (18%), Positives = 75/203 (36%)

Query: 15 REELEARIGELEDENAELLREKNDLFVETSGLKDANNQLRQKNDKLFITKDKLTKANTEL 74
+ +LE + + + + L E + L+ +L + + + L
Sbjct: 157 KADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTL 216

Query: 75 YRERNDLAREKENLNNQLNASQKQVKELEQSQQVLKNEKAELSKEKENLTKANTDLKTEN 134
E+ LA K +L L + + L+ EKA L + L KA +
Sbjct: 217 EAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFS 276

Query: 135 DKLNHQVIALNKEQGSLKQERVRLQDEHGFLEKRCTNLEKENQRLTEKLKQLESAQKSLE 194
+ ++ L E+ +L+ E+ L+ + L +L ++ E KQLE+ + LE
Sbjct: 277 TADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLE 336

Query: 195 NTNNQLRQALENSNAQLAQAEEK 217
N + ++ L + E
Sbjct: 337 EQNKISEASRQSLRRDLDASREA 359



Score = 28.1 bits (62), Expect = 0.048
Identities = 24/163 (14%), Positives = 55/163 (33%)

Query: 84 EKENLNNQLNASQKQVKELEQSQQVLKNEKAELSKEKENLTKANTDLKTENDKLNHQVIA 143
+ + N + + +L + + LK+ EL++E N + + ++
Sbjct: 58 RADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQE 117

Query: 144 LNKEQGSLKQERVRLQDEHGFLEKRCTNLEKENQRLTEKLKQLESAQKSLENTNNQLRQA 203
L + L++ + + LE E L + LE A + N +
Sbjct: 118 LEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAK 177

Query: 204 LENSNAQLAQAEEKIATEKTELEREIARLKSLEGMEAKSDLDL 246
++ A+ A E + A + LE + + + +
Sbjct: 178 IKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00415UREASE10450.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1045 bits (2703), Expect = 0.0
Identities = 354/569 (62%), Positives = 443/569 (77%), Gaps = 4/569 (0%)

Query: 3 KISRKEYVSMYGPTTGDKVRLGDTDLIAEVEHDYTIYGEELKFGGGKTLREGMSQSN-NP 61
++SR Y +M+GPT GDKVRL DT+L EVE D+T +GEE+KFGGGK +R+GM QS
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 62 SKEELDLIITNALIVDYTGIYKADIGIKDGKIAGIGKGGNKDMQDGVKNNLSVGPATEAL 121
+D +ITNALI+D+ GI KADIG+KDG+IA IGK GN DMQ GV + VGP TE +
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGV--TIIVGPGTEVI 121

Query: 122 AGEGLIVTAGGIDTHIHFISPQQIPTAFASGVTTMIGGGTGPADGTNATTITPGRRNLKW 181
AGEG IVTAGG+D+HIHFI PQQI A SG+T M+GGGTGPA GT ATT TPG ++
Sbjct: 122 AGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIAR 181

Query: 182 MLRAAEEYSMNLGFLAKGNASNDASLADQIEAGAIGFKIHEDWGTTPSAINHALDVADKY 241
M+ AA+ + MNL F KGNAS +L + + GA K+HEDWGTTP+AI+ L VAD+Y
Sbjct: 182 MIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEY 241

Query: 242 DVQVAIHTDTLNEAGCVEDTMAAIAGRTMHTFHTEGAGGGHAPDIIKVAGEHNILPASTN 301
DVQV IHTDTLNE+G VEDT+AAI GRT+H +HTEGAGGGHAPDII++ G+ N++P+STN
Sbjct: 242 DVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTN 301

Query: 302 PTIPFTVNTEAEHMDMLMVCHHLDKSIKEDVQFADSRIRPQTIAAEDTLHDMGIFSITSS 361
PT P+TVNT AEH+DMLMVCHHL +I ED+ FA+SRIR +TIAAED LHD+G FSI SS
Sbjct: 302 PTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISS 361

Query: 362 DSQAMGRVGEVITRTWQTADKNKKEFGRLKEEKGDNDNFRIKRYLSKYTINPAIAHGISE 421
DSQAMGRVGEV RTWQTADK K++ GRLKEE GDNDNFR+KRY++KYTINPAIAHG+S
Sbjct: 362 DSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSH 421

Query: 422 YVGSVEVGKVADLVLWSPAFFGVKPNMIIKGGFIALSQMGDANASIPTPQPVYYREMFAH 481
+GS+EVGK ADLVLW+PAFFGVKP+M++ GG IA + MGD NASIPTPQPV+YR MF
Sbjct: 422 EIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGA 481

Query: 482 HGKAKYDANITFVSQAAYDKGIKEELGLERQVLPVKNCR-NITKKDMQFNDTTAHIEVNP 540
+G+++ ++++TFVSQA+ D G+ LG+ ++++ V+N R I K M N T HIEV+P
Sbjct: 482 YGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDP 541

Query: 541 ETYHVFVDGKEVTSKPANKVSLAQLFSIF 569
ETY V DG+ +T +PA + +AQ + +F
Sbjct: 542 ETYEVRADGELLTCEPATVLPMAQRYFLF 570


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00445TYPE4SSCAGX310.008 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 30.9 bits (69), Expect = 0.008
Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 6/92 (6%)

Query: 16 DELTALLSNAEVISDIKKLTELSKEQSSIEEISIASKEYLSVLEDIKENKELLEDKELSE 75
+ LT +SN + +S+ K L+EL K+Q E + + LED++E + K++ E
Sbjct: 181 ENLTNAMSNPQNLSNNKNLSELIKQQRENE------LDQMERLEDMQEQAQANALKQIEE 234

Query: 76 LAKEELKILEIQKSDLETAIKQLLIPKDPNDD 107
L K++ + Q++ + +IK K P D+
Sbjct: 235 LNKKQAEEAVRQRAKDKISIKTDKSQKSPEDN 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00455FLAGELLIN330.004 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 32.7 bits (74), Expect = 0.004
Identities = 25/205 (12%), Positives = 57/205 (27%), Gaps = 6/205 (2%)

Query: 87 TASSGTPAPSTPPAKKDETSGTPSASGSSVASQLTKDTTMVNNLKSVSVSAMNTTLSGVT 146
A +G D T + + K +T +N K A T +
Sbjct: 263 KAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANV 322

Query: 147 QLSQQTAAISNLLS---GNPNLGSVISNAQGLSSAFSALESAQNTLKGYLDSSSATIGQL 203
+ ++ + S G N S A + + K ++ + T
Sbjct: 323 DAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAA 382

Query: 204 TNGSNAVVGALNKAINQVDMALADLATADTQKT---QAVTLATASATTTTDAINFLNALK 260
+ + ++ A K + ++ + + L A++
Sbjct: 383 GDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQ 442

Query: 261 NNLTAQKDAFMNVHKNIQTAVAQAQ 285
N + N N+ +A ++ +
Sbjct: 443 NRFDSAITNLGNTVTNLNSARSRIE 467


2HPSJM_00570HPSJM_00595Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_005702121.767750hypothetical protein
HPSJM_005751133.242744hypothetical protein
HPSJM_005801133.611146Methyl-accepting chemotaxis protein tlpB;
HPSJM_005850133.8155192', 3'-cyclic-nucleotide 2'-phosphodiesterase
HPSJM_00590-2114.812977S-ribosylhomocysteinase
HPSJM_00595-2124.030872cystathionine gamma-synthase/cystathionine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00590LUXSPROTEIN2242e-78 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 224 bits (571), Expect = 2e-78
Identities = 56/145 (38%), Positives = 91/145 (62%), Gaps = 7/145 (4%)

Query: 5 VESFNLDHTKVKAPYVRIADRKKGANGDLIVKYDVRFKQPNQDHMDMPSLHSLEHLVAEI 64
++SF +DHT++ AP VR+A + GD I +D+RF PN+D + +H+LEHL A
Sbjct: 3 LDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYAGF 62

Query: 65 IRNHA----SYVVDWSPMGCQTGFYLTVLNHDNYTEILEVLEKTMQDVLKAK---EVPAS 117
+RNH ++D SPMGC+TGFY++++ + ++ + M+DVLK + ++P
Sbjct: 63 MRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIPEL 122

Query: 118 NEKQCGWAANHTLEGAQNLARAFLD 142
NE QCG AA H+L+ A+ +A+ L+
Sbjct: 123 NEYQCGTAAMHSLDEAKQIAKNILE 147


3HPSJM_00950HPSJM_01075Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_00950223-4.481585fructose-bisphosphate aldolase
HPSJM_00955123-4.803566elongation factor P
HPSJM_00960124-5.077811DNA-cytosine methyltransferase
HPSJM_00965-120-4.430499hypothetical protein
HPSJM_00970-117-4.233015putative restriction enzyme
HPSJM_00975-212-2.446645hypothetical protein
HPSJM_00980-1130.029959sialic acid synthase
HPSJM_00985-1110.005950ABC transporter, ATP-binding protein
HPSJM_00990-110-0.249186apolipoprotein N-acyltransferase
HPSJM_009951110.381587hypothetical protein
HPSJM_010002140.566345lysyl-tRNA synthetase
HPSJM_010052160.842502serine hydroxymethyltransferase
HPSJM_01010116-0.085530hypothetical protein
HPSJM_010152150.517980hypothetical protein
HPSJM_010201132.621861hypothetical protein
HPSJM_01025-2112.820031hypothetical protein
HPSJM_01030-2101.926779hypothetical protein
HPSJM_01035-292.127311phospholipase D-family protein
HPSJM_01040-1123.171962fumarate reductase iron-sulfur subunit
HPSJM_01045-1123.238075fumarate reductase flavoprotein subunit
HPSJM_01050-1151.877059fumarate reductase cytochrome b-556 subunit
HPSJM_01055-1151.931793triosephosphate isomerase
HPSJM_01060-2163.352893enoyl-(acyl carrier protein) reductase
HPSJM_01065-2163.477469UDP-3-O-[3-hydroxymyristoyl] glucosamine
HPSJM_01070-1173.691533S-adenosylmethionine synthetase
HPSJM_01075-1183.014437nucleoside diphosphate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00985PF05272300.006 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.006
Identities = 12/53 (22%), Positives = 22/53 (41%), Gaps = 1/53 (1%)

Query: 29 LAILGVSGSGKSTLLSHLATMLKPNSGTISLLEHQDIY-ALNSKKLLELRRLK 80
+ + G G GKSTL++ L + + + +D Y + EL +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMT 651


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01015IGASERPTASE330.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.5 bits (76), Expect = 0.001
Identities = 34/149 (22%), Positives = 58/149 (38%), Gaps = 6/149 (4%)

Query: 50 PKETFLQTDSGMQKIGNTKDEKKDDEFESLNMDSPKQEDKLDKVADNVKKQENDAFNMPT 109
P ET ++ T ++ + D E+ + ++ V N Q N+ +
Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKAN--TQTNEVAQSGS 1090

Query: 110 QTDQTQTEMKTAEETQEAQKELKV-VEHTPIIAQKESQAVAKKEISHK-KPKATPKDKEA 167
+T +TQT T E +++ KV E T + + SQ K+E S +P+A P +
Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND 1150

Query: 168 HKDKDKHAVKELKVKKEAHKEVPKKANSK 196
K + + A E P K S
Sbjct: 1151 PTVNIKEP--QSQTNTTADTEQPAKETSS 1177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01060DHBDHDRGNASE607e-13 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 60.4 bits (146), Expect = 7e-13
Identities = 60/263 (22%), Positives = 109/263 (41%), Gaps = 29/263 (11%)

Query: 4 LKGKKGLIVGVANNKSIAYGIAQSCFNQGATL-AFTYLNESLEKRVRPIAQELNSPYVYE 62
++GK I G A + I +A++ +QGA + A Y E LEK V + E +
Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 63 LDVSKEEHFKSLYNSVKKDLGSLDFIVHSVAF--------APKEALEGSLLETSKSAFNT 114
DV + +++++G +D +V+ E E + S FN
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 115 AMEISVYSLIELTNTLKPLLNNGASVLTLSYLGSTKYMAHYNVMGLAKAALESAVRYLAV 174
+ +S Y + + ++ + +N A V S MA Y +KAA + L +
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTS-------MAAY---ASSKAAAVMFTKCLGL 173

Query: 175 DLGKHHIRVNALSAGPIRT-----LASSGIADFRMILKWNE---INAPLRKNVSLEEVGN 226
+L +++IR N +S G T L + ++I E PL+K ++ +
Sbjct: 174 ELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIAD 233

Query: 227 AGMYLLSSLSSGVSGEVHFVDAG 249
A ++L+S + ++ VD G
Sbjct: 234 AVLFLVSGQAGHITMHNLCVDGG 256


4HPSJM_01595HPSJM_01790Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_015952163.578361hypothetical protein
HPSJM_016000153.80051250S ribosomal protein L21
HPSJM_016050153.80954650S ribosomal protein L27
HPSJM_016100153.774700periplasmic dipeptide-binding protein
HPSJM_016150154.362666dipeptide transport system permease protein
HPSJM_01620-1153.507781dipeptide permease protein
HPSJM_01625-3153.124152ABC-type transport system, ATP-binding protein;
HPSJM_01630-2162.734608ABC-type transport system, ATP binding protein;
HPSJM_01635-2152.158138GTPase ObgE
HPSJM_01640-2141.662613hypothetical protein
HPSJM_016450182.188046hypothetical protein
HPSJM_016501202.897377glutamate-1-semialdehyde aminotransferase
HPSJM_016554192.552990hypothetical protein
HPSJM_016604172.360908hypothetical protein
HPSJM_016653172.130818N-carbamoyl-D-amino acid amidohydrolase
HPSJM_016704161.772579hypothetical protein
HPSJM_016751153.237527hypothetical protein
HPSJM_016802162.865824ATP-binding protein
HPSJM_01685-1161.818328nitrite extrusion protein
HPSJM_016900161.969819hypothetical protein
HPSJM_01695-1152.018479hypothetical protein
HPSJM_017000161.358709outer membrane protein BabA
HPSJM_01705214-1.334464putative heme iron utilization protein
HPSJM_01710113-1.370276arginyl-tRNA synthetase
HPSJM_01715112-1.050733Sec-independent protein translocase protein
HPSJM_01720011-1.244994guanylate kinase
HPSJM_01725011-1.540451poly E-rich protein
HPSJM_01730-111-2.052755nuclease NucT
HPSJM_01735010-2.067593outer membrane protein HorC
HPSJM_01740113-2.359884flagellar basal body L-ring protein
HPSJM_01745213-1.879862CMP-N-acetylneuraminic acid synthetase
HPSJM_01750211-1.138469CMP-N-acetylneuraminic acid synthetase (neuA)
HPSJM_01755211-0.772352flagellar biosynthesis protein G
HPSJM_017601130.518776tetraacyldisaccharide 4'-kinase
HPSJM_017651151.244835NAD synthetase
HPSJM_017701171.686940*ketol-acid reductoisomerase
HPSJM_017751180.940316MinD cell division inhibitor protein
HPSJM_017803180.041724cell division topological specificity factor
HPSJM_01785216-0.577599hypothetical protein
HPSJM_01790217-2.331424Holliday junction resolvase-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01685TCRTETA453e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 45.2 bits (107), Expect = 3e-07
Identities = 53/271 (19%), Positives = 101/271 (37%), Gaps = 16/271 (5%)

Query: 28 LILSGSLTPHQSFQLGIAVLMGYVFGSFLIQFLSPLMSLESIAKISFGLIALSFLVCYFD 87
L+ S +T H L + LM + L LS + +S A+ + +
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGA-LSDRFGRRPVLLVSLAGAAVDYAI--MA 91

Query: 88 SIPFFW-LWIWRFIAGVASSALMILVAPLSLPYVKEHKKALVGGLIFSAVGIGSVFSGFV 146
+ PF W L+I R +AG+ + A + ++A G + + G G V +
Sbjct: 92 TAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVL 150

Query: 147 LPWISSYNIKWAWIFLGGSCLIAFILSLVGLK-----TRSLRKKSVKKEESAFKIPFHL- 200
+ ++ + + F+ L R ++ ++F+ +
Sbjct: 151 GGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMT 210

Query: 201 ---WLLLVSCALNAIGFLPHTLFWVDYLIRHLNISPTIAGTSWAFFG-FGATLGSLISGP 256
L+ V + +G +P L WV + + T G S A FG + ++I+GP
Sbjct: 211 VVAALMAVFFIMQLVGQVPAAL-WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGP 269

Query: 257 MAQKLGAKNANIFILILKSIACFLPIFFHQI 287
+A +LG + A + +I L F +
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRG 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01690SECA260.040 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 26.4 bits (58), Expect = 0.040
Identities = 12/69 (17%), Positives = 34/69 (49%), Gaps = 12/69 (17%)

Query: 4 TTAKKDYTKYSEKQLVNLIHQLERKIKKMQNDRVSFKEKMAKELEKRDQNFKDKIDALNE 63
+ + + +++VN+I+ +E +++K+ ++ EL+ + F+ +++
Sbjct: 12 SRNDRTLRRM--RKVVNIINAMEPEMEKLSDE----------ELKGKTAEFRARLEKGEV 59

Query: 64 LLQKISQAF 72
L I +AF
Sbjct: 60 LENLIPEAF 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01720PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.011
Identities = 9/18 (50%), Positives = 11/18 (61%)

Query: 8 LILSGPSGAGKSTLTKYL 25
++L G G GKSTL L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01725IGASERPTASE714e-15 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 70.9 bits (173), Expect = 4e-15
Identities = 61/282 (21%), Positives = 108/282 (38%), Gaps = 19/282 (6%)

Query: 168 QEEKEEVKETPQEEKPKDDETQESETPKDEEV---SKELETQEKLEIPKEETQEEVKEEI 224
E++ + +T P + + P + E E ET E V E
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044

Query: 225 KEEAQEEVKEETQENKEEKQEKTQDSPSAQELEAMQELVKEIQENSNDQENKEKTQESTE 284
K+E++ K E + Q + + ++A + + Q S +E + + T
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104

Query: 285 IPQDKEIQEVVTEKTQAQELEIPKEKTQESAEALQ-ETQAHELEKQEIAETPQDVEIPQS 343
+ +E +V TEKTQ E+PK +Q S + Q ET + E + +++ PQS
Sbjct: 1105 TVEKEEKAKVETEKTQ----EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160

Query: 344 QEKETQETQEDHYESIEDIPEPVMAKAMGEELPFLNEAVAKTPNNENDTETPKESVIKTP 403
Q T +T++ E+ ++ +PV +V + P N T P
Sbjct: 1161 QTNTTADTEQPAKETSSNVEQPVTESTTVNTGN----SVVENPENTTPATT-------QP 1209

Query: 404 QEKEESAKTPKSDKTSSPLELHLNLQDLLKSLNQESLKSLLE 445
ES+ PK+ S + N++ S N S +L +
Sbjct: 1210 TVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCD 1251



Score = 62.4 bits (151), Expect = 2e-12
Identities = 42/212 (19%), Positives = 85/212 (40%), Gaps = 20/212 (9%)

Query: 159 EQLLPTLDVQEEKEEVKETPQEEKPKDDETQESETPKDEEVSKELETQE----KLEIPKE 214
E + +++ + E E+ + Q E K+ + + + TQ + +
Sbjct: 1035 ETTETVAENSKQESKTVEK-NEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 215 ETQE-EVKEEIKEEAQEEVKEETQENKEEKQEKTQDSPSAQELEAMQ------------E 261
ETQ E KE E +E+ K ET++ +E + +Q SP ++ E +Q
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 262 LVKEIQENSNDQENKEKTQESTEIPQDKEIQEVVTEKTQAQELEIPKEKTQESAEALQET 321
+KE Q +N + E+ + T ++ + E T T +E P+ T + + +
Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS 1213

Query: 322 QAHELEKQEIAETPQDVEIPQSQEKETQETQE 353
++ K + + V P + E T + +
Sbjct: 1214 ESSNKPKNRHRRSVRSV--PHNVEPATTSSND 1243



Score = 56.6 bits (136), Expect = 1e-10
Identities = 44/239 (18%), Positives = 77/239 (32%), Gaps = 21/239 (8%)

Query: 152 KEEPNNEEQLLPTLDVQEEKEEVKETPQEEKPKDDETQESETPKDEE--------VSKEL 203
NNEE + E ++ QES+T + E ++E+
Sbjct: 1009 SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREV 1068

Query: 204 ETQEKLEIPKEETQEEVKEEIKEEAQEEVKEETQENKEEKQEKTQDSPSAQELEAMQELV 263
+ K + EV + E + + E + EK+EK + E E QE+
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAK-----VETEKTQEVP 1123

Query: 264 KEIQENSNDQENKEKTQESTEIPQDKEIQEVVTEKTQAQ----ELEIPKEKTQESAEALQ 319
K + S QE E Q E ++ + + E + E P ++T + E
Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPV 1183

Query: 320 ETQAHELEKQEIAETPQDVEIPQSQEKETQETQE----DHYESIEDIPEPVMAKAMGEE 374
+ E P++ +Q E+ H S+ +P V
Sbjct: 1184 TESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSN 1242



Score = 42.0 bits (98), Expect = 5e-06
Identities = 36/229 (15%), Positives = 64/229 (27%), Gaps = 11/229 (4%)

Query: 142 ENLGDLEALAKEEPNNEEQLLPTLDVQEEKEEVKETPQEEKPKDDETQESETPKDEEVSK 201
E +AKE +N + T +V + E KET Q + K+ T E E E K
Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET-QTTETKETATVEKEEKAKVETEK 1118

Query: 202 ELET----------QEKLEIPKEETQEEVKEEIKEEAQEEVKEETQENKEEKQEKTQDSP 251
E QE+ E + + + + + +E + E+ K S
Sbjct: 1119 TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178

Query: 252 SAQELEAMQELVKEIQENSNDQENKEKTQESTEIPQDKEIQEVVTEKTQAQELEIPKEKT 311
Q + + N + T + T + + ++ + T
Sbjct: 1179 VEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238

Query: 312 QESAEALQETQAHELEKQEIAETPQDVEIPQSQEKETQETQEDHYESIE 360
S + A Q + H +E
Sbjct: 1239 TSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLE 1287



Score = 42.0 bits (98), Expect = 5e-06
Identities = 39/236 (16%), Positives = 78/236 (33%), Gaps = 12/236 (5%)

Query: 111 QKKLGSNASELEPRQNLDPTQEILETNWDELENLGDLEALAKEEPNNEEQLLPTLDVQEE 170
+ + E + N+ + E E KE E++ ++ ++
Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKT 1119

Query: 171 KEEVKET----PQEEKPKDDETQ-ESETPKD-----EEVSKELETQEKLEIPKEETQEEV 220
+E K T P++E+ + + Q E D +E + T E P +ET V
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 221 KEEIKEEAQEEVKEETQENKEEKQEKTQDSPSAQELEAMQELVKEIQENSNDQENKEKTQ 280
++ + E EN E T E + + + + N E
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK-NRHRRSVRSVPHNVEPAT 1238

Query: 281 ESTEIPQDKEIQEVVTEKTQAQELEIPKEKTQESAEALQETQAHELEKQEIAETPQ 336
S+ + ++ + T A L + K Q A + + + + + E+ Q
Sbjct: 1239 TSSNDRSTVALCDLTSTNTNA-VLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01740FLGLRINGFLGH1912e-63 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 191 bits (487), Expect = 2e-63
Identities = 52/172 (30%), Positives = 85/172 (49%), Gaps = 18/172 (10%)

Query: 56 GERPLFADRRAMKPNDLITIIVSEKASANYSSS----KDYKSASGGNSTPPRLTYNGLDE 111
G +PLF DRR D +TI++ E SA+ SSS +D K+ G ++ P L GL
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYL--QGLFG 118

Query: 112 RKKQEVQYLDDKNNYNFTKSSNNTNFKGGGSQKKSEDLEIVLSARIIKVLENGNYFIYGN 171
+ +V+ S F G G S L+ + +VL NGN + G
Sbjct: 119 NARADVEA------------SGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGE 166

Query: 172 KEVLVDGEKQILKVSGVIRPYDIERNNTIQSKFLADAKIEYTNLGHLSDSNK 223
K++ ++ + ++ SGV+ P I +NT+ S +ADA+IEY G+++++
Sbjct: 167 KQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQN 218


5HPSJM_02375HPSJM_02440Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_02375011-3.182459molybdenum ABC transporter ModB
HPSJM_0238009-1.688388molybdenum ABC transporter ModD
HPSJM_02385-19-2.077521glutamyl-tRNA synthetase
HPSJM_02390-212-2.703948outer membrane protein HopK
HPSJM_02395-212-2.792709type II adenine specific methyltransferase
HPSJM_02400-211-1.407462putative non-functional type II restriction
HPSJM_024053130.783313hypothetical protein
HPSJM_02410717-0.270607type II adenine specific DNA methyltransferase
HPSJM_024152160.168393type II restriction endonuclease
HPSJM_024202170.155868type II DNA modification enzyme
HPSJM_024252170.069104catalase-like protein
HPSJM_024303170.220546outer membrane protein HofC
HPSJM_02435316-0.750557outer membrane protein HofD
HPSJM_02440419-1.497665hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02380PF05272300.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.009
Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 30 VVALLGESGAGKSTILRILAGLE 52
V L G G GKST++ L GL+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02405TCRTETOQM1995e-58 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 199 bits (507), Expect = 5e-58
Identities = 116/461 (25%), Positives = 190/461 (41%), Gaps = 67/461 (14%)

Query: 3 NIRNIAVIAHVDHGKTTLVDGLLSQSGTFSEREKVDE--RVMDSNDLERERGITILSKNT 60
I NI V+AHVD GKTTL + LL SG +E VD+ D+ LER+RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 61 AIYYKDTKINIIDTPGHADFGGEVERVLKMVDGVLLLVDAQEGVMPQTKFVVKKALSFGI 120
+ +++TK+NIIDTPGH DF EV R L ++DG +LL+ A++GV QT+ + GI
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 121 CPIVVVNKIDKPAAEPDRVVDEVFDLF---------VAMGASDKQLDFPV-----VYAAA 166
I +NKID+ + V ++ + V + + +F
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 167 RDGYAMKSLDDE----------------------------KKNL--EPLFETILEHVPSP 196
D K + + K N+ + L E I S
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSS 241

Query: 197 SGSVDEPLQMQIFTLDYDNYVGKIGIARVFNGSVKKNESVLLMKSDGSKENGRITKLIGF 256
+ L ++F ++Y ++ R+++G + +SV + KE +IT++
Sbjct: 242 THRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRI----SEKEKIKITEMYTS 297

Query: 257 LGLARTEIENAYAGDIVALAG--FNAMDV-GDSVVDPTNPMPLDPMHLEEPTMSVYFAVN 313
+ +I+ AY+G+IV L V GD+ + P +P P + +
Sbjct: 298 INGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENP----LPLLQTTVEPS 353

Query: 314 DSPLAGLEGKHVTANKLKDRLLKEMQTNIAMKCEEMGEGKFKVSGRGELQITILAENLRR 373
+ + D LL+ + + +S G++Q+ + L+
Sbjct: 354 KPQQREMLLDALLEISDSDPLLRYYVDSAT--------HEIILSFLGKVQMEVTCALLQE 405

Query: 374 E-GFEFSISRPEVIIKEENGVKCEPFEHLVIDTPQDFSGAI 413
+ E I P VI E K E H+ + P F +I
Sbjct: 406 KYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP-PNPFWASI 445



Score = 41.8 bits (98), Expect = 8e-06
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 1/80 (1%)

Query: 396 EPFEHLVIDTPQDFSGAIIERLGKRKAEMKAMNPMSDGYTRLEFEIPARGLIGYRSEFLT 455
EP+ I PQ++ K A + + + L EIPAR + YRS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTF 595

Query: 456 DTKGEGVMNHSFLEFRPFSG 475
T G V + +G
Sbjct: 596 FTNGRSVCLTELKGYHVTTG 615


6HPSJM_02530HPSJM_02675Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_02530213-0.344045UDP-sugar diphosphatase
HPSJM_025352140.563282hypothetical protein
HPSJM_02540-1150.381756glycolate oxidase, subunit GlcD
HPSJM_02545318-2.510026dihydrodipicolinate reductase
HPSJM_02550720-4.127257hypothetical protein
HPSJM_02555720-4.175205cag pathogenicity island protein B
HPSJM_02560621-3.773607cag pathogenicity island protein C
HPSJM_02565619-3.565183cag pathogenicity island protein D
HPSJM_02570520-2.688915cag pathogenicity island protein E
HPSJM_02575621-2.547783cag island protein
HPSJM_02580619-2.092454cag pathogenicity island protein (cag21)
HPSJM_02585618-2.672867hypothetical protein
HPSJM_02590718-3.104710cag pathogenicity island protein H
HPSJM_02595720-4.238702cag pathogenicity island protein I
HPSJM_026001222-4.715160cag pathogenicity island protein L
HPSJM_026051124-4.739830cag island protein
HPSJM_026101024-4.820683cag island protein
HPSJM_02615929-4.178667cag pathogenicity island protein Q
HPSJM_026201026-4.272940CAG pathogenicity island protein 13
HPSJM_026251020-2.926835CAG pathogenicity island protein 12
HPSJM_02630920-3.169507cag island protein
HPSJM_02635919-3.000440cag pathogenicity island protein V
HPSJM_02640918-2.524506cag pathogenicity island protein W
HPSJM_026451016-2.506626cag pathogenicity island protein X
HPSJM_026501016-2.085358cag island protein
HPSJM_02655818-2.520338cag pathogenicity island protein (cag6)
HPSJM_02660818-2.241644cag island protein, DNA transfer protein
HPSJM_02665616-2.249494cag island protein, DNA transfer protein
HPSJM_02670112-1.217610cag pathogenicity island protein Gamma
HPSJM_02675212-1.039849cag pathogenicity island protein 3
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02535IGASERPTASE471e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 47.4 bits (112), Expect = 1e-07
Identities = 30/190 (15%), Positives = 63/190 (33%), Gaps = 17/190 (8%)

Query: 260 EPKKANQGAENAPTLE---EKNYQKAE-----RKLDSKEERRYLRDERKKAKATKKAMEL 311
+ AEN+ EKN Q A + +KE + ++ + + + E
Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 312 EE------REKEHDERDERETEGRRKALEMDKGNEKVNTKENEQEIKQ---EAIKEPDNG 362
+E +E E++E+ K E+ K +V+ K+ + E Q E +E D
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152

Query: 363 NNATQQGEKQNAPKENKASKEENKPNSKEEKRRLKEEKKKAKAEQRAREFEQRAREHQER 422
N + + N + + +E N ++ + +
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212

Query: 423 DEKELEERRK 432
E + + +
Sbjct: 1213 SESSNKPKNR 1222



Score = 46.2 bits (109), Expect = 2e-07
Identities = 32/213 (15%), Positives = 69/213 (32%), Gaps = 12/213 (5%)

Query: 236 RIEKKEERIDTRE----NKREIKQEAIKEPKKANQGAENAPTLEEKNYQKAERKLDSKEE 291
+EK+ + +DT N + ++ + + AP +E E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 292 RRYLRDERKKAKATKKAMELEEREKEHDERDERETEGRRKALEMDKGNEKV------NTK 345
+ ++ + K + A E + +E + + + + E+ + + TK
Sbjct: 1044 SK--QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101

Query: 346 ENEQEIKQEAIKEPDNGNNATQQGEKQNAPKENKASKEENKPNSKEEKRRLKEEKKKAKA 405
E K+E K + Q +PK+ ++ + + E K+
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161

Query: 406 EQRAREFEQRAREHQERDEKELEERRKALEMNK 438
+ EQ A+E E+ + E N
Sbjct: 1162 TNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02570SECETRNLCASE330.003 Bacterial translocase SecE signature.
		>SECETRNLCASE#Bacterial translocase SecE signature.

Length = 127

Score = 32.6 bits (74), Expect = 0.003
Identities = 13/72 (18%), Positives = 24/72 (33%), Gaps = 12/72 (16%)

Query: 48 DGGNRLFGFPETFIYSSIFILFVTIVLSVILFQAYELVLIVAIVIVLVALGF-------- 99
G L E + + L + ++ L++ L L V++L+A
Sbjct: 9 GSGRGL----EAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVALLTTK 64

Query: 100 KKDYRLYQRMER 111
K + R R
Sbjct: 65 GKATVAFAREAR 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02590TCRTETA290.035 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 28.6 bits (64), Expect = 0.035
Identities = 20/92 (21%), Positives = 36/92 (39%), Gaps = 16/92 (17%)

Query: 35 MAVGNN--ILNISKLTGEFNAQGNTQGAQIGAVNSQIASILASNTTPKNPSAIEA-YATN 91
MA +L I ++ G T GA + IA I + ++ + A +
Sbjct: 90 MATAPFLWVLYIGRIVA-----GIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFG 143

Query: 92 QIAVPSVPTTVEMISGILGNITSAAPKYALAL 123
+A P ++ G++G + AP +A A
Sbjct: 144 MVAGP-------VLGGLMGGFSPHAPFFAAAA 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02605TYPE4SSCAGX320.003 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 31.7 bits (71), Expect = 0.003
Identities = 30/119 (25%), Positives = 56/119 (47%), Gaps = 16/119 (13%)

Query: 24 AINTALLPSEYKELVALGFKKIKTLYQRHDDKEITKEEKEFATNALREKLRNDRARAEQI 83
A+N AL+ +Y+E + K K + D KE+ +++K EK + + +A++
Sbjct: 112 AVNFALMTRDYQEFL----KTKKLIVDAPDPKELEEQKKAL------EKEKEAKEQAQKA 161

Query: 84 QKNIEAFEKKNNSSVQKKATKHRGLQELNEINANPLNDNPNGNSSTETKSNKDDNFDEM 142
QK+ K +++A L+ L +NP N + N N S K +++ D+M
Sbjct: 162 QKD------KREKRKEERAKNRANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQM 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02635PF043351186e-35 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 118 bits (298), Expect = 6e-35
Identities = 44/205 (21%), Positives = 74/205 (36%), Gaps = 10/205 (4%)

Query: 27 KLNKANRTFKRAFYL---SMALNVAAVTSIVMMMPLKKTDIFVYGIDRYTGEFKIVKRSD 83
KL A R+ K A+ + + AL A V ++ + PLK + +V +DR TGE I +
Sbjct: 24 KLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAKLH 83

Query: 84 A-RQIVNSEAVVDSATSKFVSLLFGYSKNSLRDRKDQLMQYCDVSFQTQAMRMFNENIRQ 142
I EAV + +V G+ + + D +M Q + R + + Q
Sbjct: 84 GDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDNPQ 143

Query: 143 FVDKVRA-EAIISSNIQREKVKNSPLTRLTFFITIKITPDTMENYEYITKKQVTIYYDFA 201
+ A + I + +F +T T TI Y
Sbjct: 144 SPQNILANRTDVFVEI-KRVSFLGGNVAQVYFTKESVTGSNS----TKTDAVATIKYKVD 198

Query: 202 RGNSSQENLIINPFGFKVFDIQITD 226
S + + NP G++V +
Sbjct: 199 GTPSKEVDRFKNPLGYQVESYRADV 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02645TYPE4SSCAGX8620.0 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 862 bits (2227), Expect = 0.0
Identities = 512/522 (98%), Positives = 516/522 (98%), Gaps = 1/522 (0%)

Query: 1 MGQAFFKKIVNCFCLGYLFLSSVIEAAP-DIKNFNRGRVKVVNKKIAYLGDEKPITIWTS 59
MGQAFFKKIV CFCLGYLFLSS IEA DIKNFNRGRVKVVNKKIAYLGDEKPITIWTS
Sbjct: 1 MGQAFFKKIVGCFCLGYLFLSSAIEAVALDIKNFNRGRVKVVNKKIAYLGDEKPITIWTS 60

Query: 60 LDNVTVIQLEKDETISYITTGFNKGWNIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR 119
LDNVTVIQLEKDETISYITTGFNKGW+IVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR
Sbjct: 61 LDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR 120

Query: 120 DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL 179
DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL
Sbjct: 121 DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL 180

Query: 180 ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA 239
ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA
Sbjct: 181 ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA 240

Query: 240 EETIKQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD 299
EE ++QRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD
Sbjct: 241 EEAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD 300

Query: 300 NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQKELIKQENLNTTAYINRVMMASNE 359
NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQ+ELIKQENLNTTAYINRVMMASNE
Sbjct: 301 NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQRELIKQENLNTTAYINRVMMASNE 360

Query: 360 QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF 419
QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF
Sbjct: 361 QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF 420

Query: 420 DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK 479
DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK
Sbjct: 421 DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK 480

Query: 480 DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK 521
DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK
Sbjct: 481 DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK 522


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02650IGASERPTASE370.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.6 bits (84), Expect = 0.001
Identities = 41/221 (18%), Positives = 86/221 (38%), Gaps = 8/221 (3%)

Query: 813 QAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCVSRARNEKEKKECEKLLTPEARKL 872
+ NE + E + P A E E+V S+ + E+ E T + R++
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET--TAQNREV 1068

Query: 873 LEESKKSVKAYLDC--VSQAKNEAERKECEKLLTPEARKLLEEAKKSVKAYLDCVSRARN 930
+E+K +VKA V+Q+ +E + + + + E+AK + +
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 931 EKEKKECEKLLTPEARKLLENQALDCLKNAK----TEAEKKRCVKDLPKDLQKKVLAKES 986
K+E + + P+A EN +K + T A+ ++ K+ ++++ V +
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTT 1188

Query: 987 VRVYLDCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKA 1027
V V +N + + + K + SV++
Sbjct: 1189 VNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRS 1229



Score = 35.8 bits (82), Expect = 0.002
Identities = 38/229 (16%), Positives = 83/229 (36%), Gaps = 10/229 (4%)

Query: 606 TPEAKKLLEEEAKESVKAYLDCVSQAKTEDEKKECEKLLTPEAKKKLEEAKKSVKAYLDC 665
P ++ + + ++KT ++ ++ T + ++ +EAK +VKA
Sbjct: 1023 APVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT 1082

Query: 666 VSQAKTEDEKKECEKLLTPEAKKLLEQQALDCLKNAKTDEERKECLKDLPKDLQKKVL-- 723
A++ E KE + T E + +++ KT E K + PK Q + +
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVET-EKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 724 ----AKESVRVYL--DCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKAYLDCVSRAR 777
A+E+ + S+ A+ ++ K + + + E +V V
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE-STTVNTGNSVVENPE 1200

Query: 778 NEKEKKECEKLLTPEARKLLEESKKSVKAYLDCVSQAKNEAERKECEKL 826
N + + + K ++SV++ V A + + L
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVAL 1249



Score = 35.4 bits (81), Expect = 0.002
Identities = 34/183 (18%), Positives = 70/183 (38%), Gaps = 6/183 (3%)

Query: 737 KAKNEAERKECEKLLTPEARKLLEEAKESVKAYLDCVSRARNEKEKKECEKLLTPEARKL 796
+ NE + E + P A E E+V S+ + E+ E T + R++
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET--TAQNREV 1068

Query: 797 LEESKKSVKAYLDC--VSQAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCVSRARN 854
+E+K +VKA V+Q+ +E + + + + E+AK + ++
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 855 EKEKKECEKLLTPEARKLLEESKKSVKAYLDCVSQAKNEAERKECEKLLTPEARKLLEEA 914
K+E + + P+A E + SQ A+ ++ K + + + E+
Sbjct: 1129 VSPKQEQSETVQPQAEPAREND--PTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186

Query: 915 KKS 917

Sbjct: 1187 TTV 1189



Score = 34.7 bits (79), Expect = 0.005
Identities = 38/222 (17%), Positives = 86/222 (38%), Gaps = 6/222 (2%)

Query: 958 KNAKTEAEKKRCVKDLP-KDLQKKVLAKESVRVYLDCVSKAKNEAERKECEKLLTPEARK 1016
+ + E+ V + P ++ + V + ++K + ++ T + R+
Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067

Query: 1017 LLEEAKESVKAYKDCVSRARNEKEKQECEKLLTPEARKLLEQEVKKSVKAYLDCVSR-AR 1075
+ +EAK +VKA A++ E +E + T E + ++E K V +
Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS 1127

Query: 1076 NEKEKKECEKLLTPKARKLLENQALDCLKNAK----TEAEKKRCVKDLPKDLQKKVLAKK 1131
K+E + + P+A EN +K + T A+ ++ K+ ++++ V
Sbjct: 1128 QVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187

Query: 1132 SVKAYLDCVSRARNEKEKQECEKLLTPEARKLLEEAKKSVKA 1173
+V V N + + + K ++SV++
Sbjct: 1188 TVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRS 1229



Score = 33.1 bits (75), Expect = 0.013
Identities = 33/194 (17%), Positives = 69/194 (35%), Gaps = 20/194 (10%)

Query: 1 MNEENDKLETSKKAQQHSPQDLSNEEATEANHFEDLLKEESSDNHLDNSTETQTHFDEDK 60
EN K E+ + +ATE + +E+ N N+ + +
Sbjct: 1039 TVAENSKQESKTVEKNEQ-------DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 61 LEETQTQMDSEGNETSESSNGSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEDDQ 120
+ETQT E + +KA+ + + + + QE +E
Sbjct: 1092 TKETQTTETKETATVEKE----------EKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 121 ENNEYQEETQTGLIDDETSKKAQQHSPQDLSNEEATEVNHFEDLLKEESSDNHLDNPTES 180
+ +E T I + S+ + + E ++ V E + E ++ N ++ E+
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV---EQPVTESTTVNTGNSVVEN 1198

Query: 181 SDNHLDNSTESSDN 194
+N +T+ + N
Sbjct: 1199 PENTTPATTQPTVN 1212



Score = 32.7 bits (74), Expect = 0.016
Identities = 43/260 (16%), Positives = 88/260 (33%), Gaps = 25/260 (9%)

Query: 1142 RARNEKEKQECEKLLTPEARKLLEEAKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKL 1201
+ NE+ + E + P A E ++V S+ + E+ E T + R++
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET--TAQNREV 1068

Query: 1202 LEEAKESLKAYKDCLSQARNETERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNE 1261
+EAK ++KA A++ +E + + T E + ++E +A+ E
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE-------------KAKVE 1115

Query: 1262 KEKQECEKLLTPEARKFLEKQRQQKDKAIKDCLKNADPNDRAAIMKCLDGL-----SDEE 1316
EK + +T + +Q++ + ++ + A ND +K E+
Sbjct: 1116 TEKTQEVPKVTSQV-----SPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170

Query: 1317 KLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRAQNKQNQLSKTERLHQ 1376
K E+ V + + +N Q N + NK +
Sbjct: 1171 PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV 1230

Query: 1377 ASECLDNLDDPTDQEAIEQC 1396
D+ + C
Sbjct: 1231 PHNVEPATTSSNDRSTVALC 1250



Score = 32.3 bits (73), Expect = 0.023
Identities = 41/246 (16%), Positives = 73/246 (29%), Gaps = 16/246 (6%)

Query: 1073 RARNEKEKKECEKLLTPKARKLLENQALDCLKNAKTEA---EKKRCVKDLPKDLQKKVL- 1128
+ NE+ + E + P A +N+K E+ EK ++V
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAK 1070

Query: 1129 -AKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKLLEEAKKSVKA--YLDCVSRARNEK 1185
AK +VKA A++ E +E + T E + +E K V+ +
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS 1130

Query: 1186 EKQECEKLLTPEARKLLEEAKESLKAYKDCLSQARNETE---RRACEKLLTPEARKLLEQ 1242
KQE + + P+A E + +TE + + P
Sbjct: 1131 PKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN 1190

Query: 1243 EVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLEKQRQQKDKAIKDCLKNADPNDR 1302
V+ + E R+ + + A NDR
Sbjct: 1191 TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPA------TTSSNDR 1244

Query: 1303 AAIMKC 1308
+ + C
Sbjct: 1245 STVALC 1250



Score = 32.3 bits (73), Expect = 0.023
Identities = 42/259 (16%), Positives = 85/259 (32%), Gaps = 13/259 (5%)

Query: 522 RARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKD--LQSDILAK 579
+ NE+ + E + P A + +N+K + + + + + Q+ +AK
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAK 1070

Query: 580 ESLKAYKDCTSQAKTEDEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEDEKKE 639
E+ K T + E ++ T E K+ E +E K E EK +
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV----------ETEKTQ 1120

Query: 640 CEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEDEKKECEKLLTPEAKKLLEQQALDCLK 699
+T + K E+++ T + K+ + T + ++ ++
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 700 NAKTDEERKECLKDLPKDLQKKVLAKESVRVYLDCVSKAKNEAERKECEKLLTPEARKLL 759
T+ + ++ + A V + +K KN R E
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTS 1240

Query: 760 EEAKESVKAYLDCVSRARN 778
+ +V A D S N
Sbjct: 1241 SNDRSTV-ALCDLTSTNTN 1258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02670TACYTOLYSIN270.032 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 27.3 bits (60), Expect = 0.032
Identities = 12/43 (27%), Positives = 22/43 (51%), Gaps = 3/43 (6%)

Query: 128 NKSVYQLVEMAIGAYNGG-MKHDPNGAYVKKFRCIYSQVRYNE 169
N+S Y VE Y G + GAYV ++ ++ ++ Y++
Sbjct: 451 NRSEY--VETTSTEYTSGKINLSHQGAYVAQYEILWDEINYDD 491


7HPSJM_02720HPSJM_02760Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_02720318-1.968667GTP-binding protein Era
HPSJM_02725321-1.637916conserved hypothetical secreted protein
HPSJM_02730623-1.584900hypothetical protein
HPSJM_02735624-1.444596hypothetical protein
HPSJM_02740420-1.208856hypothetical protein
HPSJM_02755216-0.236176hypothetical protein
HPSJM_02760215-0.128795urease-enhancing factor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02720PF03944330.002 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 32.7 bits (74), Expect = 0.002
Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 68 LHHQEKLLNQCMLSQALKAMSDAELCVFLASVHDDLKGYEEFLSLCQKPHILALSKIDTA 127
L E+ LNQ + + + A +AEL A+V + + + FL+ + L+++
Sbjct: 94 LRETERFLNQRLNTDTV-ARVNAELTGLQANVEEFNRQVDNFLNPNRNAVPLSITSSVNT 152

Query: 128 THKQVLQKLQEYQKYASQFLDLVPLSAKKSQNLN 161
+ L +L ++Q Q L L+PL A+ + NL+
Sbjct: 153 MQQLFLNRLPQFQMQGYQLL-LLPLFAQAA-NLH 184


8HPSJM_03400HPSJM_03530Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_03400212-1.843217hypothetical protein
HPSJM_03405318-3.490821hypothetical protein
HPSJM_03410222-4.252256outer membrane protein (omp14)
HPSJM_03415017-4.622340aspartate aminotransferase
HPSJM_03420221-6.394803hypothetical protein
HPSJM_03425017-4.820040hypothetical protein
HPSJM_03430-113-2.962508hypothetical protein
HPSJM_03435-110-1.374837hypothetical protein
HPSJM_03440-110-0.149809integrase-recombinase protein
HPSJM_034450110.400975methylated-DNA--protein-cysteine
HPSJM_034500101.061354hypothetical protein
HPSJM_03455-2101.327355putative lipopolysaccharide biosynthesis
HPSJM_034601101.237675ribonucleotide-diphosphate reductase subunit
HPSJM_034652100.919180hypothetical protein
HPSJM_034701100.668692hypothetical protein
HPSJM_03475091.341795bifunctional N-acetylglucosamine-1-phosphate
HPSJM_034800101.442101flagellar biosynthesis protein FliP
HPSJM_034851112.101154Iron(III) dicitrate transport protein FecA;
HPSJM_03490-1122.714687iron(II) transport protein (feoB)
HPSJM_034950123.214089hypothetical protein
HPSJM_035001124.483332acetyl-CoA acetyltransferase
HPSJM_035052134.021818succinyl-CoA-transferase subunit A
HPSJM_035103133.900447succinyl-CoA-transferase subunit B
HPSJM_035151132.820194short-chain fatty acids transporter
HPSJM_035202132.790659putative Outer membrane protein
HPSJM_035253122.757076hydantoin utilization protein A
HPSJM_035303111.833237N-methyl hydantoinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_03480FLGBIOSNFLIP2803e-97 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 280 bits (719), Expect = 3e-97
Identities = 115/246 (46%), Positives = 164/246 (66%), Gaps = 3/246 (1%)

Query: 53 ILRFFIFLILICPLICPLMSADSALPSVNLSLNAPSDPKQLVTTLNVIALLTLLVLAPSL 112
+ R ++ LI PL A + LP + S P + + + +T L P++
Sbjct: 1 MRRLLSVAPVLLWLITPL--AFAQLPGIT-SQPLPGGGQSWSLPVQTLVFITSLTFIPAI 57

Query: 113 ILVMTSFTRLIVVFSFLRTALGTQQTPPTQILVSLSLILTFFIMEPSLKKAYDTGIKPYM 172
+L+MTSFTR+I+VF LR ALGT PP Q+L+ L+L LTFFIM P + K Y +P+
Sbjct: 58 LLMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFS 117

Query: 173 DKKISYTEAFEKSALPFKEFMLKNTREKDLALFFRIRNLPNPKTPDEVSLSVLIPAFMIS 232
++KIS EA EK A P +EFML+ TRE DL LF R+ N + P+ V + +L+PA++ S
Sbjct: 118 EEKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTS 177

Query: 233 ELKTAFQIGFLLYLPFLVIDMVISSILMAMGMMMLPPVMISLPFKILVFILVDGFNLLTE 292
ELKTAFQIGF +++PFL+ID+VI+S+LMA+GMMM+PP I+LPFK+++F+LVDG+ LL
Sbjct: 178 ELKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVG 237

Query: 293 NLVASF 298
+L SF
Sbjct: 238 SLAQSF 243


9HPSJM_03610HPSJM_03685Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_03610413-0.834749RNA polymerase factor sigma-54
HPSJM_036151120.173531ABC-type transport system, ATP binding protein
HPSJM_036200110.079463hypothetical protein
HPSJM_036250101.125475DNA polymerase III subunits gamma and tau
HPSJM_03630192.519172hypothetical protein
HPSJM_036352142.661996hypothetical protein
HPSJM_036402132.699574hypothetical protein
HPSJM_036451122.549831outer membrane protein SabB/HopO
HPSJM_036500112.216802L-asparaginase II
HPSJM_03655-1110.544641anaerobic C4-dicarboxylate transporter
HPSJM_03660-213-0.655867outer membrane protein SabA
HPSJM_03665012-1.891530outer membrane protein
HPSJM_03670114-2.884530putative transcriptional regulator
HPSJM_03675115-3.685836tRNA(Ile)-lysidine synthase
HPSJM_03680218-4.088088hypothetical protein
HPSJM_03685211-1.808876hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_03635SECA280.014 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 27.9 bits (62), Expect = 0.014
Identities = 12/43 (27%), Positives = 23/43 (53%), Gaps = 2/43 (4%)

Query: 71 RIARKNLSKMSEEDFKKMREEVRK--ELEEKTKGLSDEEIKAK 111
++ K ++ ++MR+ V +E + + LSDEE+K K
Sbjct: 4 KLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGK 46


10HPSJM_04990HPSJM_05045Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_049902160.558310putative pedidyl-prolyl cis-trans ismerase
HPSJM_049954191.528262cell division protein FtsA
HPSJM_050004200.727247cell division protein FtsZ
HPSJM_05005523-1.512322hypothetical protein
HPSJM_05010321-2.337510hypothetical protein
HPSJM_05015221-3.762001hypothetical protein
HPSJM_05020114-4.416982hypothetical protein
HPSJM_05025014-4.547323mechanosensitive channel MscS
HPSJM_05030117-5.658434hypothetical protein
HPSJM_05035117-4.815673hypothetical protein
HPSJM_05040117-4.400299hypothetical protein
HPSJM_05045115-3.866239hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_04995SHAPEPROTEIN411e-05 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 40.5 bits (95), Expect = 1e-05
Identities = 38/176 (21%), Positives = 66/176 (37%), Gaps = 12/176 (6%)

Query: 211 AASIATLSNDERELGVACVDMGGETCNLTIYSGNSIRYNKYLPVGSHHLTTDL------S 264
AA+I G VD+GG T + + S N + Y+ + +G + +
Sbjct: 146 AAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRN 205

Query: 265 HMLNTPFPYAEEVKIKYGDLSFESGAETPSQSVQIPTTGSDGHESHIVPLSEIQTIMRER 324
+ AE +K + G S G E V+ + +EI ++E
Sbjct: 206 YGSLIGEATAERIKHEIG--SAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEP 263

Query: 325 ALETFKIIHRSIQDSGLE---EHLGGGVVLTGGMALMKGIKELARTHFTNYPVRLA 377
+ +++ E + G+VLTGG AL++ + L T PV +A
Sbjct: 264 LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLM-EETGIPVVVA 318


11HPSJM_05450HPSJM_05590Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_05450310-1.647451NADP-dependent alcohol dehydrogenase
HPSJM_05455211-2.075493putative lipopolysaccharide biosynthesis
HPSJM_05460212-1.081504hypothetical protein
HPSJM_054650110.864823putative lipopolysaccharide biosynthesis
HPSJM_054701112.473767hypothetical protein
HPSJM_054750133.074464outer membrane protein (omp23)
HPSJM_054800112.696333pyruvate flavodoxin oxidoreductase subunit
HPSJM_05485-1112.250544pyruvate flavodoxin oxidoreductase subunit
HPSJM_05490-191.752184pyruvate flavodoxin oxidoreductase subunit
HPSJM_05495-2110.332131pyruvate ferredoxin oxidoreductase, beta
HPSJM_05500115-0.574543adenylosuccinate lyase
HPSJM_05505217-1.206228putative outer membrane protein; putative signal
HPSJM_05510318-1.368011excinuclease ABC subunit B
HPSJM_05515219-0.578667hypothetical protein
HPSJM_05520114-0.381311hypothetical protein
HPSJM_05525113-0.167627hypothetical protein
HPSJM_05530-113-0.423576cysteine-rich protein X
HPSJM_05535013-0.102376hypothetical protein
HPSJM_05540013-0.199688gamma-glutamyltranspeptidase (ggt)
HPSJM_05545012-1.288304flagellar hook-associated protein FlgK
HPSJM_05550215-1.614994hypothetical protein
HPSJM_05555318-1.207119type II DNA modification enzyme
HPSJM_05560414-0.755217hypothetical protein
HPSJM_05565412-1.618323hypothetical protein
HPSJM_05570512-1.399514FKBP-type peptidyl-prolyl cis-trans isomerase
HPSJM_05575413-2.070647hypothetical protein
HPSJM_05580414-1.761619peptidoglycan-associated lipoprotein precursor
HPSJM_055851140.379682translocation protein TolB
HPSJM_055902180.472006hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05480YERSSTKINASE290.016 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 28.6 bits (63), Expect = 0.016
Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 9/63 (14%)

Query: 50 YNRVDDEPILNHERFMQPDYVLVIDPGLVFIENIFANEKEDTTYIITSYLSKEELFEKKP 109
++R ++P E F P+ + + N+ A+EK D ++++ L E FEK P
Sbjct: 293 HSRSGEQPKGFTESFKAPE---------LGVGNLGASEKSDVFLVVSTLLHCIEGFEKNP 343

Query: 110 ELK 112
E+K
Sbjct: 344 EIK 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05515IGASERPTASE280.032 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.5 bits (63), Expect = 0.032
Identities = 25/117 (21%), Positives = 40/117 (34%), Gaps = 5/117 (4%)

Query: 114 TATLNANTENIKSEVKKLENQIIETTTKLLTSYQIFLNQARDNANNQITENKTQSLEAIK 173
+ T EN K E K +E + T + ++ + N T QS K
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 174 EAKTNANNEINEKQTQAITNINEAKTTANHEINTSKTQSLEALKQEKNQATSEINEA 230
E +T K+T + +AK K S + KQE+++ E
Sbjct: 1094 ETQTTE-----TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP 1145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05545FLGHOOKAP15610.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 561 bits (1447), Expect = 0.0
Identities = 129/610 (21%), Positives = 229/610 (37%), Gaps = 75/610 (12%)

Query: 6 SSLNTSYTGLQAHQSMVDVTGNNISNASDEFYSRQRVIAKPQAAYMYGTKNVNMGVDVEA 65
S +N + +GL A Q+ ++ NNIS+ + Y+RQ I + + V GV V
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 66 IERVHDEFVFARYTKANYENTYYDTEFSHLKEASAYFPDIDEASLFTDLQDYFNSWKELS 125
++R +D F+ + A +++ + + + +SL T +QD+F S + L
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNML-STSTSSLATQMQDFFTSLQTLV 120

Query: 126 KNAKDSAQKQALAQKTEALTHNIKDTRERLTTLQHKASEELKSVIKEVNSLGSQIAQINK 185
NA+D A +QAL K+E L + K T + L + + + + + ++N+ QIA +N
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 186 RIKEVENNKSLKHANELRDKRDELEFHLRELLGGNVFKSSIKTHSLTDKDSADFDESYNL 245
+I + + N L D+RD+L L +++G V S +YN+
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEV--------------SVQDGGTYNI 226

Query: 246 NIGHGFNIIDGSIFHPLVIKESENKGGLNQIYFQSDDFKLTNITDK-LNQGKVGALLNVY 304
+ +G++++ GS L S + + I +K LN G +G +L
Sbjct: 227 TMANGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFR 286

Query: 305 NDGSNGTLKGKLQDYIDLLDSFARGLIESTNAIYAQSANHHIEGEPVEFNSDEAFKDTNY 364
+ L + L A E+ N + +A D N
Sbjct: 287 SQ--------DLDQTRNTLGQLALAFAEAFNTQH------------------KAGFDANG 320

Query: 365 NIKNGSFDL----IAYNTDGKEIARKTIAITPITTMNDIIQAINANTDDNQ-----DNNT 415
+ F + + NT K +T + + I+ + + Q N T
Sbjct: 321 DAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATDYKISFDNNQWQVTRLASNTT 380

Query: 416 ENDFDDYFTASFNNETKKFVIQPKNASQGLFVSMKDDGTNFMGALKLNPFFQGDDASNIS 475
D + + + + D M L D + I+
Sbjct: 381 FTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIVNMDVLI-------TDEAKIA 433

Query: 476 LNKEYKKEPTTIRPWLAPINGNFDVANMMQQLQYDSVDFYNDKFDIKPMKISEFYQFLTG 535
+ E E G+ D N L S N K ++ Y L
Sbjct: 434 MASE---EDA----------GDSDNRNGQALLDLQS----NSKTVGGAKSFNDAYASLVS 476

Query: 536 KINTDAEKSGRILDTKKSMLETIKKEQLSISQVSVDEEMLNLIKFQSGYAANAKVISAID 595
I T+ +++ + +Q SIS V++DEE NL +FQ Y ANA+V+ +
Sbjct: 477 DIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTAN 536

Query: 596 RMIDTLLGIK 605
+ D L+ I+
Sbjct: 537 AIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05580OMPADOMAIN1477e-46 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 147 bits (373), Expect = 7e-46
Identities = 48/169 (28%), Positives = 75/169 (44%), Gaps = 24/169 (14%)

Query: 22 KMDNKTVAGDVSAKTVQTAPV-TTEPAPEKEEPKQEPAPVVEEKPAVESGTIIASIYFDF 80
+ DN ++ VS + Q PAP PAP V+ K T+ + + F+F
Sbjct: 177 RPDNGMLSLGVSYRFGQGEAAPVVAPAPA-------PAPEVQTK----HFTLKSDVLFNF 225

Query: 81 DKYEIKESDQETLDEIVQKAKE---NHMQVLLEGNTDEFGSSEYNQALGVKRTLSVKNAL 137
+K +K Q LD++ + V++ G TD GS YNQ L +R SV + L
Sbjct: 226 NKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYL 285

Query: 138 VIKGVEKDMIKTISFGETKPKCTQ-----KTR----ECYKENRRVDVKL 177
+ KG+ D I GE+ P K R +C +RRV++++
Sbjct: 286 ISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05590TYPE4SSCAGA320.003 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 31.6 bits (71), Expect = 0.003
Identities = 34/139 (24%), Positives = 65/139 (46%), Gaps = 11/139 (7%)

Query: 32 KEAEKILLDLGKKNEQVID--LNLEDLPSEKKNE-KIEKVTEKQGDF---LEPKEEPKEE 85
+EA K++ D N++++ LN ++ KN ++V + Q D L +E ++E
Sbjct: 568 QEANKLIKDFLSSNKELVGKTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLRKREHLEKE 627

Query: 86 PEESLEDIFSSLNDFQEKTDTNAQKDE----QKNEQEEEQRRLKEQQRLRKNQKN-QEML 140
E+ LE + N + K N+QKDE E + R + Q L+ ++ + L
Sbjct: 628 VEKKLESKSGNKNKMEAKAQANSQKDEIFALINKEANRDARAIAYAQNLKGIKRELSDKL 687

Query: 141 KGLQQNLDQFAQKLESVKS 159
+ + +NL F + + K+
Sbjct: 688 ENVNKNLKDFDKSFDEFKN 706


12HPSJM_05635HPSJM_05695Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_05635-118-3.489565F0F1 ATP synthase subunit B'
HPSJM_05640017-3.053896plasmid replication-partition related protein
HPSJM_05645118-3.285810chromosome partitioning protein
HPSJM_05650019-3.897195biotin--protein ligase
HPSJM_05655121-4.213370methionyl-tRNA formyltransferase
HPSJM_05660122-5.162113ATPase
HPSJM_056653220.341773hypothetical protein
HPSJM_05670320-0.128172hypothetical protein
HPSJM_05675216-0.714700hypothetical protein
HPSJM_056803170.177909hypothetical protein
HPSJM_056952160.447677hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05645PF07675310.004 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 31.2 bits (70), Expect = 0.004
Identities = 30/105 (28%), Positives = 40/105 (38%), Gaps = 7/105 (6%)

Query: 70 QISQVILKTQMPFLDLVPSNLGLAGFEKTFYDSQDENKRGELMLKNALESVV---GLYDY 126
VI T F SNL A FE + D + ++ VV G+YDY
Sbjct: 414 TFGSVIPATGPLFTGTASSNLYSANFEYLTPANADPVVTTQNIIVTGQGEVVIPGGVYDY 473

Query: 127 IIIDSPPALGPLTINSLSAAHSVIIPIQCEFFALEGTKLLLNTIR 171
I + PA G + I A P + + FA E K T+R
Sbjct: 474 CITNPEPASGKMWI----AGDGGNQPARYDDFAFEAGKKYTFTMR 514


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05655FERRIBNDNGPP320.002 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 32.2 bits (73), Expect = 0.002
Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 70 EPEVQILKDLKPDFIVVVAYGKILPKEVLTIAP 102
EP +++L ++KP F+V A P+ + IAP
Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPSPEMLARIAP 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05665RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.9 bits (101), Expect = 2e-06
Identities = 24/170 (14%), Positives = 60/170 (35%), Gaps = 18/170 (10%)

Query: 51 RAQYHTHLKMLEQKEEALKERAKEQQAQFDDAVKQASVLALQDERAKIIEEARKNAFLEQ 110
+ Q+ T QKE L ++ E+ + ++ ++ R + +
Sbjct: 192 KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAK 251

Query: 111 QKGLELLQKELDEKSKQVQELHQKEAEIERLKRENNEAESRLKAENEKKLNEKLDLEREK 170
LE + + E EL ++++E+++ E A+ + + + E
Sbjct: 252 HAVLEQ-ENKYVEAV---NELRVYKSQLEQIESEILSAKEEYQLVTQ-------LFKNEI 300

Query: 171 IEKALHEKNELKFKQQEEQLEMLRNELKNAQRKAELSSQQFQGEVQELAI 220
++K + +L + + +A +S +VQ+L +
Sbjct: 301 LDK--LRQTTDNIGLLTLELAKNEERQQASVIRAPVS-----VKVQQLKV 343


13HPSJM_06140HPSJM_06275Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_06140211-0.140789hypothetical protein
HPSJM_06145111-0.615701DNA polymerase III subunit delta'
HPSJM_061501120.570412dihydropteroate synthase
HPSJM_061550121.674507hypothetical protein
HPSJM_061600101.391061hypothetical protein
HPSJM_06165-2101.588241hypothetical protein
HPSJM_06170-2102.789723hypothetical protein
HPSJM_06175-293.127197carbamoyl phosphate synthase small subunit
HPSJM_06180-2102.589337formamidase
HPSJM_061850123.172889hypothetical protein
HPSJM_061900123.397379Maf-like protein
HPSJM_061950133.441779alanyl-tRNA synthetase
HPSJM_062003183.315014hypothetical protein
HPSJM_062052183.242467hypothetical protein
HPSJM_062100152.224841outer membrane protein HopU
HPSJM_06215114-1.23445230S ribosomal protein S18
HPSJM_06220213-1.088177single-stranded DNA-binding protein
HPSJM_06225212-1.38395230S ribosomal protein S6
HPSJM_06230313-1.043510hypothetical protein
HPSJM_06235211-0.517766DNA polymerase III subunit delta
HPSJM_06240290.106620ribonuclease R
HPSJM_06245-1110.211968shikimate 5-dehydrogenase
HPSJM_062500100.516476hypothetical protein
HPSJM_06255-1100.485283oligopeptide ABC transporter, permease protein
HPSJM_06260-1110.727553hypothetical protein
HPSJM_062651120.602234tryptophanyl-tRNA synthetase
HPSJM_062701130.929687Biotin biosynthesis protein BioC
HPSJM_062752151.657301preprotein translocase subunit SecG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_06200PF05844250.035 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 25.0 bits (54), Expect = 0.035
Identities = 13/65 (20%), Positives = 28/65 (43%), Gaps = 1/65 (1%)

Query: 10 SVLKANNPHFDKIFEKHNQLDDDIKTAEQQNASDAEVSHMKKQKLKLKDEIHSMIIEYRE 69
L+A F+ + I++ Q + +V + Q ++E+++ I + +
Sbjct: 197 VALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQASAREEEVNATIGQ-SQ 255

Query: 70 KQKSE 74
KQK E
Sbjct: 256 KQKVE 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_06275SECGEXPORT494e-10 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 48.8 bits (116), Expect = 4e-10
Identities = 25/84 (29%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 1 MTSALLGLQIVLAVLIVVVVLLQ--KSSSIGLGAYSGSNDSLFGAKGPASFMAKLTMFLG 58
M ALL + +++A+ +V +++LQ K + +G +G++ +LFG+ G +FM ++T L
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 59 LLFVINTIALGYFYNKEYGKSILD 82
LF I ++ LG N +
Sbjct: 61 TLFFIISLVLGNI-NSNKTNKGSE 83


14HPSJM_06840HPSJM_07050Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_06840013-3.597296hypothetical protein
HPSJM_06845014-5.393514putative histidine kinase sensor protein
HPSJM_06850-114-6.353200response regulator
HPSJM_06855220-6.301969type IIS restriction enzyme R protein (MBOIIR)
HPSJM_06860424-6.546172hypothetical protein
HPSJM_06865420-4.860727integrase/recombinase XercD family protein
HPSJM_06870318-2.563900integrase/recombinase XercD family protein
HPSJM_06875321-4.428807hypothetical protein
HPSJM_06880322-4.702001hypothetical protein
HPSJM_06885317-3.678074hypothetical protein
HPSJM_06890317-3.715184hypothetical protein
HPSJM_06895316-4.163597periplasmic competence protein-like protein
HPSJM_06900417-4.784603hypothetical protein
HPSJM_06905517-3.994479hypothetical protein
HPSJM_06910618-3.951770adenine specific DNA methyltransferase
HPSJM_069151129-6.587098hypothetical protein
HPSJM_069201129-6.064528topoisomerase I
HPSJM_069251129-6.262001hypothetical protein
HPSJM_069301029-6.697020hypothetical protein
HPSJM_069351226-5.952786hypothetical protein
HPSJM_069401026-5.888269hypothetical protein
HPSJM_06945827-5.138059hypothetical protein
HPSJM_06950827-5.276640hypothetical protein
HPSJM_06955726-5.567556hypothetical protein
HPSJM_06960726-5.630688hypothetical protein
HPSJM_06965628-6.518527hypothetical protein
HPSJM_06970629-6.353828hypothetical protein
HPSJM_06975627-7.351973hypothetical protein
HPSJM_06980828-8.102365hypothetical protein
HPSJM_069851124-7.484619hypothetical protein
HPSJM_06990922-7.088199hypothetical protein
HPSJM_06995923-6.913725hypothetical protein
HPSJM_07000921-7.073821possible cell division protein
HPSJM_07005719-6.440775integrase/recombinase (xerD)
HPSJM_07010316-5.976286relaxase
HPSJM_07015-113-5.460251VirD4 coupling protein
HPSJM_07020-111-5.141669hypothetical protein
HPSJM_07025-211-3.245409hypothetical protein
HPSJM_07030-210-2.319934type IIS restriction enzyme M2 protein (mod)
HPSJM_07035-210-1.837997adenine-specific DNA methylase
HPSJM_07040-213-0.341217type III restriction enzyme R protein
HPSJM_07045-1132.346172rod shape-determining protein MreC
HPSJM_070500153.152575rod shape-determining protein MreB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_06850HTHFIS903e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.3 bits (224), Expect = 3e-23
Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 2/118 (1%)

Query: 1 MQK-KIFLLEDDYLLSESIKEFLEHLGYEVFCAFNGKEAYERLSVERFNLLLLDVQVPEM 59
M I + +DD + + + L GY+V N + ++ +L++ DV +P+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 NSLELFKRIKNDFLISTPVIFITALQDNATLKNAFNLGASDYLKKPFDLDELEARIKR 117
N+ +L RIK PV+ ++A T A GA DYL KPFDL EL I R
Sbjct: 61 NAFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_06895VACCYTOTOXIN373e-04 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 37.3 bits (86), Expect = 3e-04
Identities = 48/193 (24%), Positives = 69/193 (35%), Gaps = 30/193 (15%)

Query: 139 NTAQTKAANDPMYASTPFSNGSDSSAYDNNPNSPSNNAINGKDGANGGNGYGINGNDEIN 198
N+AQ PF+ G ++ N N+ ++ I G ++
Sbjct: 368 NSAQKTEIQPTQVIDGPFAGGKNTVVNINRINTNADGTIR----VGGFKASLTTNAAHLH 423

Query: 199 GSSGSNGNNSNNNAIGSGIDTDGVLG---VDGVNGSNSSSGGS-VGGYENNFT-NHGSTN 253
G G N +N A G + + + G VDG N+ GG + G NF G+
Sbjct: 424 --IGKGGINLSNQASGRSLLVENLTGNITVDGPLRVNNQVGGYALAGSSANFEFKAGTDT 481

Query: 254 NNTGGYDNFNNGSSSGGSL----------------GNGGLFPIPFGNGDTNNSNNPANTT 297
N G FNN S G + GNGG + F +G TN N T
Sbjct: 482 KN--GTATFNNDISLGRFVNLKVDAHTANFKGIDTGNGGFNTLDF-SGVTNKVNINKLIT 538

Query: 298 SPTNGSSSNNATN 310
+ TN + N N
Sbjct: 539 ASTNVAVKNFNIN 551


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_06960IGASERPTASE350.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.4 bits (81), Expect = 0.001
Identities = 33/193 (17%), Positives = 72/193 (37%), Gaps = 17/193 (8%)

Query: 173 SAKLEQQELERKEAELESKALQDYEENQIQRAEEISLTEHENELAKQELE-----KYAIE 227
++K E + +E+ E + Q+ E + ++ + T+ NE+A+ E +
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ-TNEVAQSGSETKETQTTETK 1101

Query: 228 SENKALEQNMMKAKTPDTLYEQQVAQEQNKEEERSDKQDLELEALRDNNTKQGLDDENLE 287
++ K +T T +V + + ++E+S+ + E R+N+ + +
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP--- 1158

Query: 288 LLESLQVIRDETSKVTNFLRANETIELLRPTGENTTSKIASLALEENNKLNEKETNTQEQ 347
+S +T + +N + T NT + + N N TQ
Sbjct: 1159 --QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSV------VENPENTTPATTQPT 1210

Query: 348 NTEKEQEKPKIGN 360
+ KPK +
Sbjct: 1211 VNSESSNKPKNRH 1223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_07050SHAPEPROTEIN474e-171 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 474 bits (1221), Expect = e-171
Identities = 179/347 (51%), Positives = 248/347 (71%), Gaps = 2/347 (0%)

Query: 2 IFSKLIGLFSHDIAIDLGTANTIVLVKGQGIIINEPSIVAVRMGLFDSKAYDILAVGSEA 61
+ K G+FS+D++IDLGTANT++ VKGQGI++NEPS+VA+R S + AVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKS-VAAVGHDA 59

Query: 62 KEMLGKTPNSIRAIRPMKDGVIADYDITAKMIRYFIEKAHKRKTW-IRPRIMVCVPYGLT 120
K+MLG+TP +I AIRPMKDGVIAD+ +T KM+++FI++ H PR++VCVP G T
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 SVERNAVKESALSAGAREVFLIEEPMAAAIGAGLPVKEPQGSLIVDIGGGTTEIGVISLG 180
VER A++ESA AGAREVFLIEEPMAAAIGAGLPV E GS++VDIGGGTTE+ VISL
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GLVISKSIRVAGDKLDQSIVEYIRKKFNLLIGERTGEEIKIEIGCAIKLDPPLTMEVSGR 240
G+V S S+R+ GD+ D++I+ Y+R+ + LIGE T E IK EIG A D +EV GR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 DQVSGLLHTIELSSDDVFEAIKDQVREISSALRSVLEEVKPDLAKDIVQNGVVLTGGGAL 300
+ G+ L+S+++ EA+++ + I SA+ LE+ P+LA DI + G+VLTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 IKGLDKYLSDMVKLPVYVGDEPLLAVAKGTGEAIQDLDLLSRVGFSE 347
++ LD+ L + +PV V ++PL VA+G G+A++ +D+ FSE
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSE 346


15HPSJM_07100HPSJM_07130Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_07100213-3.501216putative type III restriction enzyme M protein
HPSJM_07105012-4.873817putative type III restriction enzyme R protein
HPSJM_07110017-4.562356biotin synthase
HPSJM_07115221-6.404939putative ribonuclease N
HPSJM_07120427-7.805211hypothetical protein
HPSJM_07125323-6.515969hypothetical protein
HPSJM_07130221-5.455799hypothetical protein
16HPSJM_07425HPSJM_07560Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_07425220-5.466094cytochrome c551 peroxidase
HPSJM_07430321-6.364000hypothetical protein
HPSJM_07435019-4.286692DNA (cytosine-5-)-methyltransferase
HPSJM_07440016-2.086587DNA-cytosine methyltransferase
HPSJM_07445-213-0.096029addiction module antidote protein
HPSJM_07450-1130.225533putative secreted motility protein
HPSJM_074550120.744369hypothetical protein
HPSJM_074601121.874998ABC transport system substrate binding protein
HPSJM_074652132.486615ABC transporter ATP-binding protein
HPSJM_074703140.899762putative ABC transport system permease protein
HPSJM_074753120.103313hypothetical protein
HPSJM_074802100.119461putative outer membrane protein; putative signal
HPSJM_074852100.139446branched-chain amino acid aminotransferase
HPSJM_07490210-0.617250outer membrane protein
HPSJM_07495211-0.849459DNA polymerase I
HPSJM_07500116-0.091202type II restriction enzyme
HPSJM_075053190.520006restriction enzyme BcgI alpha chain-like
HPSJM_075104221.359951hypothetical protein
HPSJM_075153140.590102thymidylate kinase
HPSJM_075203130.304368phosphopantetheine adenylyltransferase
HPSJM_075253130.5025003-octaprenyl-4-hydroxybenzoate carboxy-lyase
HPSJM_075303130.207075hypothetical protein
HPSJM_075353130.232366flagellar basal body P-ring biosynthesis protein
HPSJM_075402120.278830putative ATP-dependent DNA helicase
HPSJM_075451140.437833hypothetical protein
HPSJM_075500140.842227seryl-tRNA synthetase
HPSJM_07555-114-0.035729hypothetical protein
HPSJM_07560211-0.145793exodeoxyribonuclease VII small subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_07460FLAGELLIN300.007 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.4 bits (68), Expect = 0.007
Identities = 13/34 (38%), Positives = 23/34 (67%)

Query: 177 LASVDDLIANLDSRKTQFDSLINNANNLVSNVNN 210
L+ VD + ++L + + +FDS I N N V+N+N+
Sbjct: 428 LSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNS 461


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_07520LPSBIOSNTHSS2235e-78 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 223 bits (569), Expect = 5e-78
Identities = 62/147 (42%), Positives = 94/147 (63%)

Query: 4 IGIYPGTFDPVTNGHIDIIHRSSELFEKLIVAVAHSSAKNPMFSLKERLKMMQLATKSFT 63
IYPG+FDP+T GH+DII R LF+++ VAV + K PMFS++ERL+ + A
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 64 NVECVAFEGLLADLAKEYHCKVLVRGLRVVSDFEYELQMGYANKSLNHELETLYFMPTLQ 123
N + +FEGL + A++ ++RGLRV+SDFE ELQM NK+L +LET++ + +
Sbjct: 62 NAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTE 121

Query: 124 NAFISSSIVRSIIAHKGDASHLVPKEI 150
+F+SSS+V+ + G+ H VP +
Sbjct: 122 YSFLSSSLVKEVARFGGNVEHFVPSHV 148


17HPSJM_07690HPSJM_07745Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_076903142.400131saccharopine dehydrogenase
HPSJM_076951121.493474ferrodoxin-like protein
HPSJM_07700-1111.237608putative glycerol-3-phosphate acyltransferase
HPSJM_07705-2100.080393dihydroneopterin aldolase
HPSJM_07710-310-0.207099hypothetical protein
HPSJM_07715-110-2.054755iron-regulated outer membrane protein
HPSJM_07720117-5.896757selenocysteine synthase
HPSJM_07725013-4.646616transcription elongation factor NusA
HPSJM_07730015-5.013041hypothetical protein
HPSJM_07735114-4.728997phage-associated protein
HPSJM_07740-111-4.042638hypothetical protein
HPSJM_07745010-3.005724hypothetical protein
18HPSJM_00215HPSJM_00240N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_00215-2130.332271comB8 competence protein
HPSJM_00220-2140.142886ComB9 competence protein
HPSJM_00225-2150.911778ComB10 competence protein
HPSJM_002300150.952088mannose-6-phosphate isomerase
HPSJM_00235-1132.039818GDP-D-mannose dehydratase
HPSJM_00240-2132.219481nodulation protein (nolK)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00215PF043351315e-40 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 131 bits (332), Expect = 5e-40
Identities = 37/202 (18%), Positives = 73/202 (36%), Gaps = 4/202 (1%)

Query: 40 QSVFRLERNRLKIAYKLLGLMSFIALVLAIVLISVLPLQKTEHHF--VDFLNQDKHYAII 97
+ K+A+ + G+ +A + + ++ PL+ E + VD + A
Sbjct: 22 RDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAK 81

Query: 98 QRADKSISSNEALARSLIGAYVLNRESINRIDDKSRYELVRLQSSSKVWQRFEDLIKTQN 157
D +I+ +EA+ + + YV RE + ++ V + S+ R+ KT N
Sbjct: 82 LHGDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDN 141

Query: 158 SIYAQSHLEREVHI-VNIAIYQQDNNPIASVSIAAKLMNENKLVYEKRYKIVLSYLFDTP 216
Q+ L + V I +A V + + + + + Y D
Sbjct: 142 PQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFTKESVTGSNST-KTDAVATIKYKVDGT 200

Query: 217 DFDYASMPKNPTGFKITRYSIT 238
KNP G+++ Y
Sbjct: 201 PSKEVDRFKNPLGYQVESYRAD 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00220TYPE4SSCAGX290.026 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 29.0 bits (64), Expect = 0.026
Identities = 22/80 (27%), Positives = 40/80 (50%), Gaps = 13/80 (16%)

Query: 186 NNKPLKEEKEETKEKEEETITIGDNTNAMKIIKKDIQKGYKALKSSQRKWYCLGICSKKS 245
N + ++EEK++ + + + NA+K + + + Y ++ + K+S
Sbjct: 364 NKEKIREEKQKIILDQAKALETQYVHNALK--RNPVPRNYNYYQAPE----------KRS 411

Query: 246 KLSLMPKEIFNDKQFTYFKF 265
K +MP EIF+D FTYF F
Sbjct: 412 K-HIMPSEIFDDGTFTYFGF 430


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00230FLGMRINGFLIF300.018 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 30.3 bits (68), Expect = 0.018
Identities = 18/80 (22%), Positives = 29/80 (36%), Gaps = 7/80 (8%)

Query: 272 ALFEEAANEPKENVSLNQTPVFAKESANNLVFSHKVSAL---LGVENLAIIDTKDALLIA 328
+LF P +V++ P A + H VS+ L N+ ++D LL
Sbjct: 162 SLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQ 221

Query: 329 HKDKANDLKA----LVSEVE 344
DL ++VE
Sbjct: 222 SNTSGRDLNDAQLKFANDVE 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00235NUCEPIMERASE865e-21 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 86.4 bits (214), Expect = 5e-21
Identities = 46/180 (25%), Positives = 72/180 (40%), Gaps = 19/180 (10%)

Query: 7 LITGVTGQDGSYLAEYLLNLGYEVHGLKRRSSSINTSRIDHLYEDLHSDHKRRFFLHYGD 66
L+TG G G ++++ LL G++V G+ + + S E L F H D
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP---GFQFHKID 60

Query: 67 MTDSSNLIHLIATTKPTEIYNLAAQSHVKVSFETPEYTANADGIGTLRILEAMRILGLEK 126
+ D + L A+ ++ + V+ S E P A+++ G L ILE R ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ- 119

Query: 127 KTRFYQASTSELYGEVLETPQNENTPF-------NPRSPYAVAKMYAFYITKNYRETYSL 179
AS+S +YG N PF +P S YA K + Y Y L
Sbjct: 120 --HLLYASSSSVYGL------NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_00240NUCEPIMERASE512e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 50.9 bits (122), Expect = 2e-09
Identities = 51/346 (14%), Positives = 107/346 (30%), Gaps = 54/346 (15%)

Query: 5 ILITGAYGMVGQNTALYFKKNKPDV-----------TLLTPKKSELY-----------LL 42
L+TGA G +G + + + V L + EL L
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 43 DKDNVQAYLKEYKPTGIIHCAGRVGGIVANMNDLSTYMVENLLMGLYLFSSALDSGVKKA 102
D++ + + R + ++ + Y NL L + + ++
Sbjct: 63 DREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 103 INLASSCAYPKYAPNPLKESDLLNGSLEPTNEGYALAKLSVMKYCEYVSAEKGVFYKTLV 162
+ +SS Y P D ++ + YA K + S G+ L
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSL----YAATKKANELMAHTYSHLYGLPATGLR 177

Query: 163 PCNLYGEFDKFEEKIAHMIPGLIARMHTAKLKNEKEFAMWGDGTARREYLNAKDLARFIS 222
+YG + + P + T + K ++ G +R++ D+A I
Sbjct: 178 FFTVYGPWGR---------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228

Query: 223 LAYENIASIPS-----------------VMNVGSGVDYSIEEYYEKVAQVLDYKGVFVKD 265
+ I + V N+G+ + +Y + + L +
Sbjct: 229 RLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNML 288

Query: 266 LSKPVGMQQKLMDISK-QKALKWELEIPLEQGIKEAYEYYLKLLEV 310
+P + + D + + + E ++ G+K +Y +V
Sbjct: 289 PLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYKV 334


19HPSJM_01340HPSJM_01375N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_01340-2131.034215Neutrophil activating protein NapA
HPSJM_01345-2120.950997histidine kinase sensor protein
HPSJM_01350-2121.763307hypothetical protein
HPSJM_01355-2112.261319flagellar basal body P-ring protein
HPSJM_01360-1132.084320ATP-dependent RNA helicase
HPSJM_01365-1121.836713hypothetical protein
HPSJM_01370-291.291366hypothetical protein
HPSJM_01375-392.211327oligopeptide permease ATPase protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01340HELNAPAPROT1493e-49 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 149 bits (377), Expect = 3e-49
Identities = 39/140 (27%), Positives = 74/140 (52%), Gaps = 1/140 (0%)

Query: 5 EILKHLQADAIVLFMKVHNFHWNVKGTDFFNVHKATEEIYEEFADMFDDLAERIAQLGHH 64
L ++ +L+ K+H FHW VKG FF +H+ EE+Y+ A+ D +AER+ +G
Sbjct: 15 NSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERLLAIGGQ 74

Query: 65 PLVTLSEALKLTRVKEETKTSFHSKDIFKEILGDYKHLEKEFKELSNTAEKEGDKVTVTY 124
P+ T+ E + + + + + ++ + ++ DYK + E K + AE+ D T
Sbjct: 75 PVATVKEYTEHASITDGGNET-SASEMVQALVNDYKQISSESKFVIGLAEENQDNATADL 133

Query: 125 ADDQLAKLQKSIWMLEAHLA 144
+ +++K +WML ++L
Sbjct: 134 FVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01345PF06580300.015 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.015
Identities = 10/71 (14%), Positives = 25/71 (35%), Gaps = 13/71 (18%)

Query: 281 IVLQNFLYNAIDAIEALEESEQ-GQVKIEAFIQNEFIVFTIIDNGKEVENKSALFEPFET 339
+++Q + N I + + Q G++ ++ N + + + G +
Sbjct: 258 MLVQTLVENGI--KHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------- 308

Query: 340 TKLKGNGLGLA 350
+ G GL
Sbjct: 309 ---ESTGTGLQ 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01355FLGPRINGFLGI359e-126 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 359 bits (923), Expect = e-126
Identities = 118/345 (34%), Positives = 191/345 (55%), Gaps = 26/345 (7%)

Query: 19 AEKIGDIASVVGVRDNQLIGYGLVIGLNGTGDK-SGSKFTMQSISNMLESVNVKISADDI 77
+I DIAS+ RDNQLIGYGLV+GL GTGD S FT QS+ ML+++ +
Sbjct: 28 TSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNLGITTQGGQS 87

Query: 78 KSKNVAAVMITASLPPFARQGDKIDIQISSIGDAKSIQGGTLVMTPLNAVDGNIYALAQG 137
+KN+AAVM+TA+LPPFA G ++D+ +SS+GDA S++GG L+MT L+ DG IYA+AQG
Sbjct: 88 NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQIYAVAQG 147

Query: 138 AITSGN-----------SNNLLSANIINGATIEREVSYDLFHKNAMVLSLKNPNFKNAIQ 186
A+ SA + NGA IERE+ +VL L+NP+F A++
Sbjct: 148 ALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQLRNPDFSTAVR 207

Query: 187 VQNTLNKV----FGNKVATALDPKTIQITRPERFSMVEFLALVQEIPINYSAKNKIIVDE 242
V + +N +G+ +A D + I + +P + +A ++ + + K++++E
Sbjct: 208 VADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPAKVVINE 267

Query: 243 KSGTIVSGVDIMVHPIVVTSQDITLKITKEPLDN--------SKNAQDLDNNMSLDTAHN 294
++GTIV G D+ + + V+ +T+++T+ P Q + M++
Sbjct: 268 RTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSK 327

Query: 295 TLSSNGKNITIAGVVKALQKIGVSAKGMVSILQALKKSGAISAEM 339
G ++ +V L IG+ A G+++ILQ +K +GA+ AE+
Sbjct: 328 VAIVEGPDLR--TLVAGLNSIGLKADGIIAILQGIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01360SECA300.027 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.027
Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 2/63 (3%)

Query: 261 IVFTRTKKEADELHQFLASKNYKSTALHGDMDQRDRRASIMAFKKNDADVLVATDVASRG 320
+V T + ++++ + L K L+ + A+I+A A V +AT++A RG
Sbjct: 453 LVGTISIEKSELVSNELTKAGIKHNVLNAKFHANE--AAIVAQAGYPAAVTIATNMAGRG 510

Query: 321 LDI 323
DI
Sbjct: 511 TDI 513


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01375HTHFIS310.009 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.3 bits (71), Expect = 0.009
Identities = 16/50 (32%), Positives = 21/50 (42%), Gaps = 7/50 (14%)

Query: 30 VAIVGESGSGKSSIANIIMRLNPR----FKPHNGEVLFETTNLLKESEEF 75
+ I GESG+GK +A + R F N + L ESE F
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRD---LIESELF 209


20HPSJM_01860HPSJM_01895N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_018600110.217312GTP-binding protein LepA
HPSJM_01865113-1.380429hypothetical protein
HPSJM_018702150.549263hypothetical protein
HPSJM_01875014-1.150527hypothetical protein
HPSJM_018800120.005240flagellar basal-body rod protein
HPSJM_01885112-0.479239General substrate transporter, MFS superfamily;
HPSJM_01890113-1.060639hypothetical protein
HPSJM_01895013-0.856595cell division protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01860TCRTETOQM1515e-41 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 151 bits (383), Expect = 5e-41
Identities = 103/437 (23%), Positives = 177/437 (40%), Gaps = 85/437 (19%)

Query: 11 NIRNFSIIAHIDHGKSTLADCLISECNAIS---NREMTSQVMDTMDIEKERGITIKAQSV 67
I N ++AH+D GK+TL + L+ AI+ + + + D +E++RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 68 HLNYTFKGEDYVLNLIDTPGHVDFSYEVSRSLCSCEGALLVVDATQGVEAQTIANTYIAL 127
+F+ E+ +N+IDTPGH+DF EV RSL +GA+L++ A GV+AQT +
Sbjct: 62 ----SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117

Query: 128 DNNLEILPVINKIDLPNANVLEVKQDIEDTIGIDCSNANEVS----------------AK 171
+ + INKID ++ V QDI++ + + +V
Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDT 177

Query: 172 VKLGIKDLLEKIITTIP------------------------------------------- 188
V G DLLEK ++
Sbjct: 178 VIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNK 237

Query: 189 --APSGDPNAPLKALIYDSWFDNYLGALALVRIMDGSINTEQEILVMGTGKKHGVLGLYY 246
+ + + L ++ + LA +R+ G ++ + + K + +Y
Sbjct: 238 FYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMYT 296

Query: 247 PNPLKKIPTKSLECGEIGIV---SLGLKSVTDIAVGDTLTDAKNPTPKPIEGFMPAKPFV 303
+ GEI I+ L L SV +GDT P + IE P +
Sbjct: 297 SINGELCKIDKAYSGEIVILQNEFLKLNSV----LGDTKLL---PQRERIEN---PLPLL 346

Query: 304 FAGLYPIETDRFEDLREALLKLQLNDCALNFEPESSVALGFGFRVGFLGLLHMEVIKERL 363
+ P + + E L +ALL++ +D L + +S+ + FLG + MEV L
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATH---EIILSFLGKVQMEVTCALL 403

Query: 364 EREFGLNLIATAPTVVY 380
+ ++ + + PTV+Y
Sbjct: 404 QEKYHVEIEIKEPTVIY 420



Score = 31.0 bits (70), Expect = 0.015
Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 2/75 (2%)

Query: 407 IKEPFVRATIITPSEFLGNLMQLLNNKRGIQEKMEYLNQSRVMLTYSLPSNEIVMDFYDK 466
+ EP++ I P E+L + L + V+L+ +P+ I ++
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCI-QEYRSD 592

Query: 467 LKSCTKGYASFDYEP 481
L T G + E
Sbjct: 593 LTFFTNGRSVCLTEL 607


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01880FLGHOOKAP1300.010 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.010
Identities = 9/40 (22%), Positives = 16/40 (40%)

Query: 3 NGYYAATGAMATQFNRLDLTSNNLANLNTNGFKRDDAITG 42
+ A + L+ SNN+++ N G+ R I
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01885TCRTETB393e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.1 bits (91), Expect = 3e-05
Identities = 42/182 (23%), Positives = 71/182 (39%), Gaps = 33/182 (18%)

Query: 37 APYFAKEFTHTNDPTLALISAFLVFMLGFFMRPLGSLFFGKLGDKKGRKTSMVYSIILMA 96
P A +F T + +AF++ G+ +GKL D+ G K +++ II+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSI------GTAVYGKLSDQLGIKRLLLFGIIINC 90

Query: 97 LGSFMLALLPTKEIVGEWAFLFLLLARLLQGFSVGGE------YGVVATYLSELGKNGKK 150
GS + VG F L++AR +QG G VVA Y+ + +
Sbjct: 91 FGSVIGF-------VGHSFFSLLIMARFIQG--AGAAAFPALVMVVVARYIPKENRGKAF 141

Query: 151 GFYGSFQYVTLVGGQLLAIFSLFIVENIYTHEQISAFAWRYLFALGGILALLSLFLRNIM 210
G GS + +G + I I+ W YL + I + FL ++
Sbjct: 142 GLIGS---IVAMGEGVGPAIGGMIAHYIH---------WSYLLLIPMITIITVPFLMKLL 189

Query: 211 EE 212
++
Sbjct: 190 KK 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_01895IGASERPTASE330.007 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.007
Identities = 43/197 (21%), Positives = 71/197 (36%), Gaps = 26/197 (13%)

Query: 169 FSPKKEGFENTPSDAQKKETNNDKEKENLKENPIDENHKPPNE--ESFLAIPTPYNTTLN 226
++P+ E N D T N+ + + +E +E A TP TT
Sbjct: 981 YNPEVEK-RNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 227 DSE---PQEGLVQISSHPPTHYTI----YPKKNRFDDLTNPTNPPLKEPKQETKEREPTL 279
+E + V+ + T T K+ + + N + + ETKE + T
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 280 TKETPTTPKSIMPAPAPNTENDNKTQNHKTPNHPKKEESPQENAQEEMIEETIKENLKEE 339
TKET T K E K + KT +E P+ +Q +E + +
Sbjct: 1100 TKETATVEK----------EEKAKVETEKT------QEVPKVTSQVSPKQEQSETVQPQA 1143

Query: 340 EKEIQNAPSFSPLTPTS 356
E +N P+ + P S
Sbjct: 1144 EPARENDPTVNIKEPQS 1160


21HPSJM_02635HPSJM_02670N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_02635919-3.000440cag pathogenicity island protein V
HPSJM_02640918-2.524506cag pathogenicity island protein W
HPSJM_026451016-2.506626cag pathogenicity island protein X
HPSJM_026501016-2.085358cag island protein
HPSJM_02655818-2.520338cag pathogenicity island protein (cag6)
HPSJM_02660818-2.241644cag island protein, DNA transfer protein
HPSJM_02665616-2.249494cag island protein, DNA transfer protein
HPSJM_02670112-1.217610cag pathogenicity island protein Gamma
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02635PF043351186e-35 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 118 bits (298), Expect = 6e-35
Identities = 44/205 (21%), Positives = 74/205 (36%), Gaps = 10/205 (4%)

Query: 27 KLNKANRTFKRAFYL---SMALNVAAVTSIVMMMPLKKTDIFVYGIDRYTGEFKIVKRSD 83
KL A R+ K A+ + + AL A V ++ + PLK + +V +DR TGE I +
Sbjct: 24 KLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAKLH 83

Query: 84 A-RQIVNSEAVVDSATSKFVSLLFGYSKNSLRDRKDQLMQYCDVSFQTQAMRMFNENIRQ 142
I EAV + +V G+ + + D +M Q + R + + Q
Sbjct: 84 GDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDNPQ 143

Query: 143 FVDKVRA-EAIISSNIQREKVKNSPLTRLTFFITIKITPDTMENYEYITKKQVTIYYDFA 201
+ A + I + +F +T T TI Y
Sbjct: 144 SPQNILANRTDVFVEI-KRVSFLGGNVAQVYFTKESVTGSNS----TKTDAVATIKYKVD 198

Query: 202 RGNSSQENLIINPFGFKVFDIQITD 226
S + + NP G++V +
Sbjct: 199 GTPSKEVDRFKNPLGYQVESYRADV 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02645TYPE4SSCAGX8620.0 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 862 bits (2227), Expect = 0.0
Identities = 512/522 (98%), Positives = 516/522 (98%), Gaps = 1/522 (0%)

Query: 1 MGQAFFKKIVNCFCLGYLFLSSVIEAAP-DIKNFNRGRVKVVNKKIAYLGDEKPITIWTS 59
MGQAFFKKIV CFCLGYLFLSS IEA DIKNFNRGRVKVVNKKIAYLGDEKPITIWTS
Sbjct: 1 MGQAFFKKIVGCFCLGYLFLSSAIEAVALDIKNFNRGRVKVVNKKIAYLGDEKPITIWTS 60

Query: 60 LDNVTVIQLEKDETISYITTGFNKGWNIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR 119
LDNVTVIQLEKDETISYITTGFNKGW+IVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR
Sbjct: 61 LDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR 120

Query: 120 DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL 179
DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL
Sbjct: 121 DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL 180

Query: 180 ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA 239
ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA
Sbjct: 181 ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA 240

Query: 240 EETIKQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD 299
EE ++QRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD
Sbjct: 241 EEAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD 300

Query: 300 NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQKELIKQENLNTTAYINRVMMASNE 359
NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQ+ELIKQENLNTTAYINRVMMASNE
Sbjct: 301 NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQRELIKQENLNTTAYINRVMMASNE 360

Query: 360 QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF 419
QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF
Sbjct: 361 QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF 420

Query: 420 DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK 479
DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK
Sbjct: 421 DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK 480

Query: 480 DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK 521
DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK
Sbjct: 481 DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK 522


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02650IGASERPTASE370.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.6 bits (84), Expect = 0.001
Identities = 41/221 (18%), Positives = 86/221 (38%), Gaps = 8/221 (3%)

Query: 813 QAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCVSRARNEKEKKECEKLLTPEARKL 872
+ NE + E + P A E E+V S+ + E+ E T + R++
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET--TAQNREV 1068

Query: 873 LEESKKSVKAYLDC--VSQAKNEAERKECEKLLTPEARKLLEEAKKSVKAYLDCVSRARN 930
+E+K +VKA V+Q+ +E + + + + E+AK + +
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 931 EKEKKECEKLLTPEARKLLENQALDCLKNAK----TEAEKKRCVKDLPKDLQKKVLAKES 986
K+E + + P+A EN +K + T A+ ++ K+ ++++ V +
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTT 1188

Query: 987 VRVYLDCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKA 1027
V V +N + + + K + SV++
Sbjct: 1189 VNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRS 1229



Score = 35.8 bits (82), Expect = 0.002
Identities = 38/229 (16%), Positives = 83/229 (36%), Gaps = 10/229 (4%)

Query: 606 TPEAKKLLEEEAKESVKAYLDCVSQAKTEDEKKECEKLLTPEAKKKLEEAKKSVKAYLDC 665
P ++ + + ++KT ++ ++ T + ++ +EAK +VKA
Sbjct: 1023 APVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT 1082

Query: 666 VSQAKTEDEKKECEKLLTPEAKKLLEQQALDCLKNAKTDEERKECLKDLPKDLQKKVL-- 723
A++ E KE + T E + +++ KT E K + PK Q + +
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVET-EKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 724 ----AKESVRVYL--DCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKAYLDCVSRAR 777
A+E+ + S+ A+ ++ K + + + E +V V
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE-STTVNTGNSVVENPE 1200

Query: 778 NEKEKKECEKLLTPEARKLLEESKKSVKAYLDCVSQAKNEAERKECEKL 826
N + + + K ++SV++ V A + + L
Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVAL 1249



Score = 35.4 bits (81), Expect = 0.002
Identities = 34/183 (18%), Positives = 70/183 (38%), Gaps = 6/183 (3%)

Query: 737 KAKNEAERKECEKLLTPEARKLLEEAKESVKAYLDCVSRARNEKEKKECEKLLTPEARKL 796
+ NE + E + P A E E+V S+ + E+ E T + R++
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET--TAQNREV 1068

Query: 797 LEESKKSVKAYLDC--VSQAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCVSRARN 854
+E+K +VKA V+Q+ +E + + + + E+AK + ++
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 855 EKEKKECEKLLTPEARKLLEESKKSVKAYLDCVSQAKNEAERKECEKLLTPEARKLLEEA 914
K+E + + P+A E + SQ A+ ++ K + + + E+
Sbjct: 1129 VSPKQEQSETVQPQAEPAREND--PTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186

Query: 915 KKS 917

Sbjct: 1187 TTV 1189



Score = 34.7 bits (79), Expect = 0.005
Identities = 38/222 (17%), Positives = 86/222 (38%), Gaps = 6/222 (2%)

Query: 958 KNAKTEAEKKRCVKDLP-KDLQKKVLAKESVRVYLDCVSKAKNEAERKECEKLLTPEARK 1016
+ + E+ V + P ++ + V + ++K + ++ T + R+
Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067

Query: 1017 LLEEAKESVKAYKDCVSRARNEKEKQECEKLLTPEARKLLEQEVKKSVKAYLDCVSR-AR 1075
+ +EAK +VKA A++ E +E + T E + ++E K V +
Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS 1127

Query: 1076 NEKEKKECEKLLTPKARKLLENQALDCLKNAK----TEAEKKRCVKDLPKDLQKKVLAKK 1131
K+E + + P+A EN +K + T A+ ++ K+ ++++ V
Sbjct: 1128 QVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187

Query: 1132 SVKAYLDCVSRARNEKEKQECEKLLTPEARKLLEEAKKSVKA 1173
+V V N + + + K ++SV++
Sbjct: 1188 TVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRS 1229



Score = 33.1 bits (75), Expect = 0.013
Identities = 33/194 (17%), Positives = 69/194 (35%), Gaps = 20/194 (10%)

Query: 1 MNEENDKLETSKKAQQHSPQDLSNEEATEANHFEDLLKEESSDNHLDNSTETQTHFDEDK 60
EN K E+ + +ATE + +E+ N N+ + +
Sbjct: 1039 TVAENSKQESKTVEKNEQ-------DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 61 LEETQTQMDSEGNETSESSNGSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEDDQ 120
+ETQT E + +KA+ + + + + QE +E
Sbjct: 1092 TKETQTTETKETATVEKE----------EKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 121 ENNEYQEETQTGLIDDETSKKAQQHSPQDLSNEEATEVNHFEDLLKEESSDNHLDNPTES 180
+ +E T I + S+ + + E ++ V E + E ++ N ++ E+
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV---EQPVTESTTVNTGNSVVEN 1198

Query: 181 SDNHLDNSTESSDN 194
+N +T+ + N
Sbjct: 1199 PENTTPATTQPTVN 1212



Score = 32.7 bits (74), Expect = 0.016
Identities = 43/260 (16%), Positives = 88/260 (33%), Gaps = 25/260 (9%)

Query: 1142 RARNEKEKQECEKLLTPEARKLLEEAKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKL 1201
+ NE+ + E + P A E ++V S+ + E+ E T + R++
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET--TAQNREV 1068

Query: 1202 LEEAKESLKAYKDCLSQARNETERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNE 1261
+EAK ++KA A++ +E + + T E + ++E +A+ E
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE-------------KAKVE 1115

Query: 1262 KEKQECEKLLTPEARKFLEKQRQQKDKAIKDCLKNADPNDRAAIMKCLDGL-----SDEE 1316
EK + +T + +Q++ + ++ + A ND +K E+
Sbjct: 1116 TEKTQEVPKVTSQV-----SPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170

Query: 1317 KLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRAQNKQNQLSKTERLHQ 1376
K E+ V + + +N Q N + NK +
Sbjct: 1171 PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV 1230

Query: 1377 ASECLDNLDDPTDQEAIEQC 1396
D+ + C
Sbjct: 1231 PHNVEPATTSSNDRSTVALC 1250



Score = 32.3 bits (73), Expect = 0.023
Identities = 41/246 (16%), Positives = 73/246 (29%), Gaps = 16/246 (6%)

Query: 1073 RARNEKEKKECEKLLTPKARKLLENQALDCLKNAKTEA---EKKRCVKDLPKDLQKKVL- 1128
+ NE+ + E + P A +N+K E+ EK ++V
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAK 1070

Query: 1129 -AKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKLLEEAKKSVKA--YLDCVSRARNEK 1185
AK +VKA A++ E +E + T E + +E K V+ +
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS 1130

Query: 1186 EKQECEKLLTPEARKLLEEAKESLKAYKDCLSQARNETE---RRACEKLLTPEARKLLEQ 1242
KQE + + P+A E + +TE + + P
Sbjct: 1131 PKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN 1190

Query: 1243 EVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLEKQRQQKDKAIKDCLKNADPNDR 1302
V+ + E R+ + + A NDR
Sbjct: 1191 TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPA------TTSSNDR 1244

Query: 1303 AAIMKC 1308
+ + C
Sbjct: 1245 STVALC 1250



Score = 32.3 bits (73), Expect = 0.023
Identities = 42/259 (16%), Positives = 85/259 (32%), Gaps = 13/259 (5%)

Query: 522 RARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKD--LQSDILAK 579
+ NE+ + E + P A + +N+K + + + + + Q+ +AK
Sbjct: 1011 PSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAK 1070

Query: 580 ESLKAYKDCTSQAKTEDEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEDEKKE 639
E+ K T + E ++ T E K+ E +E K E EK +
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV----------ETEKTQ 1120

Query: 640 CEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEDEKKECEKLLTPEAKKLLEQQALDCLK 699
+T + K E+++ T + K+ + T + ++ ++
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 700 NAKTDEERKECLKDLPKDLQKKVLAKESVRVYLDCVSKAKNEAERKECEKLLTPEARKLL 759
T+ + ++ + A V + +K KN R E
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTS 1240

Query: 760 EEAKESVKAYLDCVSRARN 778
+ +V A D S N
Sbjct: 1241 SNDRSTV-ALCDLTSTNTN 1258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02670TACYTOLYSIN270.032 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 27.3 bits (60), Expect = 0.032
Identities = 12/43 (27%), Positives = 22/43 (51%), Gaps = 3/43 (6%)

Query: 128 NKSVYQLVEMAIGAYNGG-MKHDPNGAYVKKFRCIYSQVRYNE 169
N+S Y VE Y G + GAYV ++ ++ ++ Y++
Sbjct: 451 NRSEY--VETTSTEYTSGKINLSHQGAYVAQYEILWDEINYDD 491


22HPSJM_02930HPSJM_02960N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_02930214-0.779135hypothetical protein
HPSJM_02935115-0.223649hypothetical protein
HPSJM_02940116-0.309620dihydroorotase
HPSJM_02945016-2.613576putative siderophore-mediated iron transport
HPSJM_02950-215-2.893997hypothetical protein
HPSJM_02955-215-2.582465flagellar motor switch protein
HPSJM_02960-114-1.336128endonuclease III
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02930TYPE3IMSPROT290.008 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.3 bits (66), Expect = 0.008
Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 4/64 (6%)

Query: 88 LQSYSVMLFFNLLLLTDILGFLPFSIYHHFMASLIFSALFCGSLFLSSPLLGMIALVALS 147
L Y F L+L+ +LPFS S + + +L PLL + AL+A++
Sbjct: 45 LSDYYFEHFSKLMLIPAEQSYLPFSQ----ALSYVVDNVLLEFFYLCFPLLTVAALMAIA 100

Query: 148 SSLL 151
S ++
Sbjct: 101 SHVV 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02945TONBPROTEIN524e-10 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 52.3 bits (125), Expect = 4e-10
Identities = 24/52 (46%), Positives = 27/52 (51%)

Query: 91 PQKPPTPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVE 142
P P P P P P P IEKPKP+PKPKPKP K + +K VE
Sbjct: 67 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVE 118



Score = 46.1 bits (109), Expect = 5e-08
Identities = 27/74 (36%), Positives = 32/74 (43%), Gaps = 8/74 (10%)

Query: 83 APKPTLAGPQKPPTPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVE 142
A Q PP P P P P P P E P KPKPKP+PK K V+
Sbjct: 53 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP--------KPVK 104

Query: 143 KVEEKKVVEEKKEE 156
KV+E+ + K E
Sbjct: 105 KVQEQPKRDVKPVE 118



Score = 45.4 bits (107), Expect = 1e-07
Identities = 21/65 (32%), Positives = 27/65 (41%), Gaps = 1/65 (1%)

Query: 83 APKPTLAGPQKPPTPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPK-KPNHKHKALKKV 141
P P P P P P PPK +PKPKPKP+PK + + + V
Sbjct: 55 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDV 114

Query: 142 EKVEE 146
+ VE
Sbjct: 115 KPVES 119



Score = 39.6 bits (92), Expect = 8e-06
Identities = 17/50 (34%), Positives = 22/50 (44%)

Query: 81 PGAPKPTLAGPQKPPTPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPKK 130
P P+ P P P P PKP KPKP+P K + +PK+
Sbjct: 63 PPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKR 112



Score = 38.4 bits (89), Expect = 2e-05
Identities = 43/214 (20%), Positives = 79/214 (36%), Gaps = 34/214 (15%)

Query: 101 PTPPTPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVEKVEEKKVVEEKKEEKKVV 160
P PP +P +P EP+P+P+P P+ P K+ V EK + K + K V
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPP-------KEAPVVIEKPKPKPKPKPKPVK 104

Query: 161 EQKVEQKKIEEKKPVKKEFDPNQLSFLPKEVAPPRKENNKGLDNQTRRDIDELYGEEFGD 220
KV+++ + KPV E P N T +
Sbjct: 105 --KVQEQPKRDVKPV--------------ESRPASPFENTAPARLTSSTATAATSKPVTS 148

Query: 221 LGTAEKDFIRNNLRDIGRITQKYLEYPQVAAYLGQDGTNAVEFYLHPNGDITDLKIIIGS 280
+ + + RN + YP A L +G V+F + P+G + +++I+
Sbjct: 149 VASGPRALSRNQPQ-----------YPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAK 197

Query: 281 EYKMLDDNTLKTIQIAYKDYPRPKTKTLIRIRVR 314
M + ++ + +P + ++ I +
Sbjct: 198 PANMFEREVKNAMRRWRYEPGKPGSGIVVNILFK 231



Score = 35.0 bits (80), Expect = 3e-04
Identities = 12/42 (28%), Positives = 17/42 (40%)

Query: 91 PQKPPTPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPKKPN 132
P+ P P P P PKP K + +PK +P +
Sbjct: 79 PEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02955FLGMOTORFLIN1001e-30 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 100 bits (250), Expect = 1e-30
Identities = 25/77 (32%), Positives = 47/77 (61%)

Query: 34 LICDYKNLLDMEIVFSAELGSTQIPLLQILRFEKGSVIDLQKPAGESVDTFVNGRVIGKG 93
+ D ++D+ + + ELG T++ + ++LR +GSV+ L AGE +D +NG +I +G
Sbjct: 50 AMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQG 109

Query: 94 EVMVFERNLAIRLNEIL 110
EV+V +R+ +I+
Sbjct: 110 EVVVVADKYGVRITDII 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_02960OMS28PORIN270.043 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 27.4 bits (60), Expect = 0.043
Identities = 28/112 (25%), Positives = 53/112 (47%), Gaps = 11/112 (9%)

Query: 22 NQTTELRHKNPYELLVATILSAQCTDARVNQITPKLFEKYPSVNDLAL-----ASLEEVK 76
N+ E+ K E A ++ + T QI + K P+ +L L A +E+VK
Sbjct: 132 NKVVEMSKKAVQETQKAVSVAGEATFLIEKQI---MLNKSPNNKELELTKEEFAKVEQVK 188

Query: 77 EIIKSVSYFNNKSKHLISMAQKVVRDFKGVIPSTQKELMSLDGVGQKTANVV 128
E + + +++ + AQKV+ G+ PS + ++++ V + +NVV
Sbjct: 189 ETLMASERALDET---VQEAQKVLNMVNGLNPSNKDQVLAKKDVAKAISNVV 237


23HPSJM_03045HPSJM_03085N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_03045-3110.609124flagellin A
HPSJM_03050-3100.6708003-methyladenine DNA glycosylase
HPSJM_03055-1111.088565hypothetical protein
HPSJM_030601100.564646uroporphyrinogen decarboxylase
HPSJM_030651110.370295outer-membrane protein of the hefABC efflux
HPSJM_030702110.282051membrane fusion protein of the hefABC efflux
HPSJM_030752100.095606cytoplasmic pump protein of the hefABC efflux
HPSJM_03080310-0.776880hypothetical protein
HPSJM_03085310-0.830834putative vacuolating cytotoxin (VacA)-like
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_03045FLAGELLIN2446e-77 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 244 bits (624), Expect = 6e-77
Identities = 126/518 (24%), Positives = 209/518 (40%), Gaps = 22/518 (4%)

Query: 2 AFQVNTNINAMNAHVQSALTQNALKTSLERLSSGLRINKAADDASGMTVADSLRSQASSL 61
A +NTN ++ +Q++L +++ERLSSGLRIN A DDA+G +A+ S L
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GQAIANTNDGMGIIQVADKAMDEQLKILDTVKVKATQAAQDGQTTESRKAIQSDIVRLIQ 121
QA N NDG+ I Q + A++E L V+ + QA + K+IQ +I + ++
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 GLDNIGNTTTYNGQALLSGQFTNKEFQVGAYSNQSIKASIGSTTSDKIGQVRI-ATGALI 180
+D + N T +NG +LS + QVGA ++I + +G G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDN-QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 181 TASGDISLTFKQVDGVNDVTLESVKVSSSAGTGIGVLAEVINKNSNRTGVKAYASVITTS 240
GD+ +FK V G + + + K +G V ++ V A +TT
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 241 DVAVQSGSLSNLTLNGIHLGNIADIKKNDSDGRLVAAINAVTSETGVEAYTDQKGRLNLR 300
D N + K A A+ + + + +
Sbjct: 240 DAE-----------NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTID 288

Query: 301 SIDGRGIEIKTDSVSNGPSALTMVNGGQDLTKGSTNYGRLSLTRLDAKSINV------VS 354
+ G K + NG V S + +N +
Sbjct: 289 TKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKT 348

Query: 355 ASDSQHLGFTAIGFGESQVAETTVNLRDVTGNFNANVKSASGANYNAVIASGNQSL---G 411
++S L ++ TVN + T N + + +G + S
Sbjct: 349 KNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINE 408

Query: 412 SGVTTLRGAMVVIDIAESAMKMLDKVRSDLGSVQNQMISTVNNISITQVNVKAAESQIRD 471
+ + +SA+ +D VRS LG++QN+ S + N+ T N+ +A S+I D
Sbjct: 409 DAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIED 468

Query: 472 VDFAEESANFNKNNILAQSGSYAMSQANTVQQNILRLL 509
D+A E +N +K IL Q+G+ ++QAN V QN+L LL
Sbjct: 469 ADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_03065RTXTOXIND300.019 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.019
Identities = 16/113 (14%), Positives = 41/113 (36%), Gaps = 16/113 (14%)

Query: 203 LARMIALQKKLEQIKTDIKRVTKLYDEGLTTIDDL-----QSLKAQGNLSEY--DILDIQ 255
LAR+ + K+ + + L + + + ++A L Y + I+
Sbjct: 220 LARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIE 279

Query: 256 FALEQNRLTLEYLTNLSVKNLKKTTIDAPNLQLRERQD-LVSLREQISALKYQ 307
+ + + +T K +D +LR+ D + L +++ + +
Sbjct: 280 SEILSAKEEYQLVTQ----LFKNEILD----KLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_03070RTXTOXIND527e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.8 bits (124), Expect = 7e-10
Identities = 23/82 (28%), Positives = 36/82 (43%), Gaps = 5/82 (6%)

Query: 27 NVKAVQDSKLTLDSTGIVDSIKVTEGSVVKKGDVLLLLYNQEKQAQSDSTEQQLIFAKKQ 86
K ++ IV I V EG V+KGDVLL L +A + T+ L+ A+ +
Sbjct: 95 RSKEIKPI-----ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE 149

Query: 87 YQRYSKTGGAVDKNTLESYEFN 108
RY +++ N L +
Sbjct: 150 QTRYQILSRSIELNKLPELKLP 171



Score = 29.8 bits (67), Expect = 0.010
Identities = 21/152 (13%), Positives = 47/152 (30%), Gaps = 25/152 (16%)

Query: 70 QAQSDSTEQQLIFAKKQYQR--YSKTGGAVDKNTLESYEFNYRRLESDYAYSIAVLNKTI 127
+++ S +++ + ++ K D + + E ++
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDN--IGLLTLELAKNEER-------QQASV 329

Query: 128 LRAPFDGVIASKNIQVGEGVSANNTVLLRLVSHARKLVIE--FDSKYINAVKVG------ 179
+RAP + + GV L+ +V L + +K I + VG
Sbjct: 330 IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIK 389

Query: 180 -DTYTYSIDGDSNQHEAKITKIYP--TVDENT 208
+ + Y+ G K+ I D+
Sbjct: 390 VEAFPYTRYGYL---VGKVKNINLDAIEDQRL 418


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_03075ACRIFLAVINRP9010.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 901 bits (2329), Expect = 0.0
Identities = 285/1038 (27%), Positives = 518/1038 (49%), Gaps = 40/1038 (3%)

Query: 1 MYKTAINRPITTLMFALAIVFFGTMGFKKLSVALFPKIDLPTVVVTTTYPGASAEIIESK 60
M I RPI + A+ ++ G + +L VA +P I P V V+ YPGA A+ ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTDKIEEAVMGIDGIKKVTSTSSKNVSIVV-IEFELEKPNEEALNDVVNKISSVR-FDDS 118
VT IE+ + GID + ++STS S+ + + F+ + A V NK+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 119 NIKKPSINKFDTDSQAIISLFVSSSSVPVT--TLNDYAKNTIKPMLQKINGVGGVQLNGF 176
+++ I+ + S ++ S + T ++DY + +K L ++NGVG VQL G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 177 RERQIRIYADPTLMNKYNLTYADLFSTLKAENVEIDGGRIVNS------QRELSILINAN 230
+ +RI+ D L+NKY LT D+ + LK +N +I G++ + Q SI+
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 231 SYSVADVEKIQV-----GNHVRLGDIAKIEIGLEEDNTFASFKDKPGVILEIQKIAGANE 285
+ + K+ + G+ VRL D+A++E+G E N A KP L I+ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 286 IEIVDRVYEALKHIQAISP-NYEIRPFLDTTGYIRTSIEDVKFDLVLGAILAVLVVFAFL 344
++ + L +Q P ++ DTT +++ SI +V L +L LV++ FL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 345 RNGTITLVSAISIPISIMGTFALIQWMGFSLNMLTMVALTLAIGIIIDDAIVVIENIHK- 403
+N TL+ I++P+ ++GTFA++ G+S+N LTM + LAIG+++DDAIVV+EN+ +
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 404 KLEMGMDKRKASYEGVREIGFALVAISAMLLSVFVPIGNMKGIIGRFFQSFGITVALAIA 463
+E + ++A+ + + +I ALV I+ +L +VF+P+ G G ++ F IT+ A+A
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 464 LSYVVVVTIIPMVSSVVVNPRHS-------RFYVWSEPFFKALESHYTRLLQWVLNHKLI 516
LS +V + + P + + ++ P + F+ W F +HYT + +L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 517 IFIAVVLVFVGSLFVASKIGMEFMLKEDRGRFLVWLKAKPGVSIDY----MTQKSKIFQK 572
+ L+ G + + ++ F+ +ED+G FL ++ G + + + Q + + K
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 573 AIEKHAEVEFTTLQVGY-GTTQNPFKAKIFVQLKPLKERKKEHQLGQFELMSALRKELKS 631
+ + E FT + G QN FV LKP +ER + ++ + EL
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNA--GMAFVSLKPWEERNGDENS-AEAVIHRAKMELGK 656

Query: 632 MPEAKGLDTINLSEVSLLGGGGDSSPFQTFVFSHSQEAVDKSVENLKKFLLENPELKGKI 691
+ + + N+ + L G ++ F + + D + + L + +
Sbjct: 657 IRDGFVI-PFNMPAIVEL---GTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712

Query: 692 EGYHTSTSESQPQLQLKILRQNANKYGVSAQTIGSVVSSAFSGTSQASVFKEDGKEYDMI 751
+ E Q +L++ ++ A GVS I +S+A G + + F + G+ +
Sbjct: 713 VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGG-TYVNDFIDRGRVKKLY 771

Query: 752 IRVPDNKRVSVEDIKRLQVRNKYDKLMFLDALVEITETKSPSSISRYNRQRSVTVLAQPK 811
++ R+ ED+ +L VR+ +++ A + RYN S+ + +
Sbjct: 772 VQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAA 831

Query: 812 AGISLGEILTQVSKNTKEWLVEGANYRFTGEADNAKETNGEFLVALATAFVLIYMILAAL 871
G S G+ + + +N L G Y +TG + + + + +A +FV++++ LAAL
Sbjct: 832 PGTSSGDAMALM-ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAAL 890

Query: 872 YESILEPFIIMVTMPLSFSGAFFALGLVHQPLSMFSMIGLILLIGMVGKNATLLIDVANE 931
YES P +M+ +PL G A L +Q ++ M+GL+ IG+ KNA L+++ A +
Sbjct: 891 YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD 950

Query: 932 -ERKKGLNIQEAILFAGKTRLRPILMTTIAMVCGMLPLALASGDGAAMKSPIGIAMSGGL 990
K+G + EA L A + RLRPILMT++A + G+LPLA+++G G+ ++ +GI + GG+
Sbjct: 951 LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010

Query: 991 MISMVLSLLIVPVFYRLL 1008
+ + +L++ VPVF+ ++
Sbjct: 1011 VSATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_03085VACCYTOTOXIN2833e-79 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 283 bits (725), Expect = 3e-79
Identities = 109/397 (27%), Positives = 182/397 (45%), Gaps = 14/397 (3%)

Query: 2797 AGNNSILWLNELFVAKGGNPLFAPYYLQDTPTEHIVTLMKDITSALGMLSKPNLKNNSTD 2856
+G L L + + +A + T I + T+ L ++ K +
Sbjct: 904 SGAQGRDLLQTLLI-DSHDAGYARTMIDATSANEITKQLNTATTTLNNIASLEHKTSGLQ 962

Query: 2857 ALQLSTYTQQMSRLAKLSNFASFDSTDFSERLSSLKNQRFADAIPNAMDVILKYSQRDKL 2916
L LS SRL LS + F++RL +LK+QRFA + +A +V+ +++ + +
Sbjct: 963 TLSLSNAMILNSRLVNLSRRHTNHIDSFAKRLQALKDQRFAS-LESAAEVLYQFAPKYEK 1021

Query: 2917 KNNLWATGVGGVSFVENGTGTLYGVNVGYDRFIKG---VIVGGYAAYGYSGFYER--ITN 2971
N+WA +GG S G +LYG + G D ++ G IVGG+ +YGYS F + N
Sbjct: 1022 PTNVWANAIGGTSLNSGGNASLYGTSAGVDAYLNGEVEAIVGGFGSYGYSSFSNQANSLN 1081

Query: 2972 SRSNNVDMGLYARAFIKKSELTFSVNETWGANKTQISSNDTLLSMINQSYKYSTWTTNAR 3031
S +NN + G+Y+R F + E F G++++ ++ LL +NQSY Y ++ R
Sbjct: 1082 SGANNTNFGVYSRIFANQHEFDFEAQGALGSDQSSLNFKSALLRDLNQSYNYLAYSAATR 1141

Query: 3032 VNYGYDFMFKNKSIILKPQIGLRYYYIGMTGLDGVMNNALYNQFKANADPSKKSVLTIDL 3091
+YGYDF F +++LKP +G+ Y ++G T + S + +
Sbjct: 1142 ASYGYDFAFFRNALVLKPSVGVSYNHLGSTNFKS----NSNQKVALKNGASSQHLFNASA 1197

Query: 3092 ALENRHYFNTNSYFYAIGGFGRDLLIRSMGDKLVRFIGNNTLSYRKGELYNTFASITTGG 3151
+E R+Y+ SYFY G ++ + V + R NT A + GG
Sbjct: 1198 NVEARYYYGDTSYFYMNAGVLQEFANFGSSNA-VSLNTFKVNATRNP--LNTHARVMMGG 1254

Query: 3152 EVRLFKSFYANAGVGARFGLDYKMINITGNIGMRLAF 3188
E++L K + N G L + + N+GMR +F
Sbjct: 1255 ELKLAKEVFLNLGFVYLHNLISNIGHFASNLGMRYSF 1291



Score = 40.0 bits (93), Expect = 2e-04
Identities = 85/475 (17%), Positives = 156/475 (32%), Gaps = 101/475 (21%)

Query: 82 SVNENNNNKSYYISPLRTWAGGNRSFTQNYNNSQLYIGTKNASSTPNHSSVWFGEKGYIG 141
V+ N +Y +S L + GG+ N + L +G N +S ++ K
Sbjct: 133 EVDMQNAVGTYNLSGLINFTGGD--LDVNMQKATLRLGQFNGNSFTSY-------KDSAD 183

Query: 142 FITGV-FKAKDIFITGAVGSGNEWKTGGG-----AILVFESSNGLSANGAYFQNNRAGTQ 195
T V F AK+I I + N +G G +L ++S G+++ +
Sbjct: 184 RTTRVDFNAKNILIDNFLEINNRVGSGAGRKASSTVLTLQASEGITSRE---NAEISLYD 240

Query: 196 NSWINLISNHSVNLTNTDFGNQTPNGGF-----------NVMGRKITYNGGTINGGNFGF 244
+ +NL SN + N G G + V G ++ +N T+ N
Sbjct: 241 GATLNLASNSVKLMGNVWMGRLQYVGAYLAPSYSTINTSKVTG-EVNFNHLTVGDHNAAQ 299

Query: 245 DNVDSNGATTISGVTFNNNGALTY----KGGNGIGGSITFTNSNINHYKLNLNANSVTFN 300
+ ++ T I + + L +GG + +N+ N+ K + +S +
Sbjct: 300 AGIIASNKTHIGTLDLWQSAGLNIIAPPEGGYKDKPNDKPSNTTQNNAKNDKQESSQNNS 359

Query: 301 NSALGSMPN------------------GNANTIGNAYILNAN------NITFNNLTFNGG 336
N+ + + PN G NT+ N +N N F
Sbjct: 360 NTQVINPPNSAQKTEIQPTQVIDGPFAGGKNTVVNINRINTNADGTIRVGGFKASLTTNA 419

Query: 337 WFVFNRSDAHVNFQGTTTINNPTSPFVNMTGKVTINPNAIFNIQ--NYTPSIGSAYTLFS 394
+ +N + + N+TG +T++ N Q Y + SA F
Sbjct: 420 AHLHIGKGG-INLSNQAS--GRSLLVENLTGNITVDGPLRVNNQVGGYALAGSSANFEFK 476

Query: 395 M----KNGSITYND------------------------VDNLWNIIRL----------KN 416
KNG+ T+N+ + +N + K
Sbjct: 477 AGTDTKNGTATFNNDISLGRFVNLKVDAHTANFKGIDTGNGGFNTLDFSGVTNKVNINKL 536

Query: 417 TQATKDNSKNATSNNNTHTYYVTYNLGGTLYNFRQIFSPNSIVLQSVYYGTNNIY 471
A+ + + + N ++G + I S + I + GT +IY
Sbjct: 537 ITASTNVAVKNFNINELVVKTNGVSVGEYTHFSEDIGSQSRINTVRLETGTRSIY 591



Score = 33.5 bits (76), Expect = 0.019
Identities = 40/266 (15%), Positives = 85/266 (31%), Gaps = 22/266 (8%)

Query: 546 GGYEGVNWGKTGYITGTFTADRVYITGNMMSGNGAQTGGGATLNFVGAT-EINIAGADFK 604
GG++ N + ++ + +M + G G NF G ++N+ A
Sbjct: 111 GGWDWGNAARHYWVKDGQWNK---LEVDMQNAVGTYNLSGLI-NFTGGDLDVNMQKATL- 165

Query: 605 NLKTTSQNSYMTFMALGD-----SFGSGKINVSQS-DFYDWTGGGYDFTGNGVFDSVNFN 658
L + NS+ ++ D F + I + + + G G + ++ +
Sbjct: 166 RLGQFNGNSFTSYKDSADRTTRVDFNAKNILIDNFLEINNRVGSGAGRKASSTVLTLQAS 225

Query: 659 KAYYKFQGAE-NSYHFKNTNFLAGNFKFQGKTTIER----SVLSDASYTFDGANNTFNED 713
+ + AE + Y N + + K G + R SY+ + E
Sbjct: 226 EGITSRENAEISLYDGATLNLASNSVKLMGNVWMGRLQYVGAYLAPSYSTINTSKVTGEV 285

Query: 714 KFNGGSFNFNHAEQTDAFNNNSFNGGSFSFNAKQVDFNYNSFNGGVFNF---NNTPKVSF 770
FN + ++A Q +N + G+ + N + G + + +
Sbjct: 286 NFNHLTVGDHNAAQAGIIASNKTHIGTLDLW-QSAGLNIIAPPEGGYKDKPNDKPSNTTQ 344

Query: 771 TDDTFNVNNQFKING-AQTTFTFNKG 795
+ + + N Q N
Sbjct: 345 NNAKNDKQESSQNNSNTQVINPPNSA 370


24HPSJM_04585HPSJM_04630N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_045851161.766561acetate kinase
HPSJM_045902150.896279acetate kinase
HPSJM_046153160.168259hypothetical protein
HPSJM_046201160.438717hypothetical protein
HPSJM_04625014-0.251072flagellar basal body rod modification protein
HPSJM_046300131.075180flagellar hook protein FlgE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_04585ACETATEKNASE1226e-37 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 122 bits (309), Expect = 6e-37
Identities = 48/117 (41%), Positives = 72/117 (61%), Gaps = 2/117 (1%)

Query: 1 MRNIEARK-EKGDKEAKLAFEMCAYRIKKYIGAYMVVLKKVDAIIFTGGLGENYSALRES 59
R++E + GDK A+LA + AYR+KK IG+Y + VD I+FT G+GEN +RE
Sbjct: 283 FRDLEDAAFKNGDKRAQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREF 342

Query: 60 VCEGLENLGIALHKPTNDNPGNALVNLSQPNTKIQVLRIPTDEELEIALQTKKVLEK 116
+ +GLE LG L K N G + +S ++K+ V+ +PT+EE IA T+K++E
Sbjct: 343 ILDGLEFLGFKLDKEKNKVRGEEAI-ISTADSKVNVMVVPTNEEYMIAKDTEKIVES 398


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_04590ACETATEKNASE353e-124 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 353 bits (908), Expect = e-124
Identities = 139/274 (50%), Positives = 190/274 (69%), Gaps = 6/274 (2%)

Query: 1 MEILVLNLGSSSIKFKLFDMQENKPLASGLAERIGEEIGQLKIKSHLHHNEQELKEKLVI 60
M+ILV+N GSSS+K++L + ++ LA GLAERIG L N +++K K +
Sbjct: 1 MKILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHN----ANGEKIKIKKDM 56

Query: 61 KDHASGLLMIRENLT--KMGIIKDFNQIDAIGHRVVQGGDKFHAPVLVDEKVMREIGNLS 118
KDH + ++ + L G+IKD ++IDA+GHRVV GG+ F + VL+ + V++ I +
Sbjct: 57 KDHKDAIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCI 116

Query: 119 ILAPLHNPANLAGIEFVQKAHPHIPQIAVFDTAFHATMPSYAYMYALPYELYEKYQIRRY 178
LAPLHNPAN+ GI+ + P +P +AVFDTAFH TMP YAY+Y +PYE Y KY+IR+Y
Sbjct: 117 ELAPLHNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKY 176

Query: 179 GFHGTSHHYVAKEAAKFLNIAYEEFNAISLHLGNGSSVAAIQNGKSVDTSMGLTPLEGLI 238
GFHGTSH YV++ AA+ LN E I+ HLGNGSS+AA++NGKS+DTSMG TPLEGL
Sbjct: 177 GFHGTSHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLA 236

Query: 239 MGTRCGDIDPTVVEYTAQCANKSLEEVMKMLNHE 272
MGTR G IDP+++ Y + N S EEV+ +LN +
Sbjct: 237 MGTRSGSIDPSIISYLMEKENISAEEVVNILNKK 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_04620IGASERPTASE459e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 44.7 bits (105), Expect = 9e-07
Identities = 38/224 (16%), Positives = 69/224 (30%), Gaps = 8/224 (3%)

Query: 287 KKPEKTPIHAKTQTTAPSATPENAPKLALKTPPLMPLIGANPPNDNIPTPLEKEEKTKEV 346
Q PS N + P+ P A TP E E E
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPA--------TPSETTETVAEN 1043

Query: 347 SDNKEKTKESSNSAQSAQNTQASDKTSDNKSIAPKETIKHFTQQLKQEIQEYKPPMSKIS 406
S + KT E + + Q + + KS T + Q E +E + +K +
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 407 MDLFPKELGKVEVTIQKVGKNLKVSVISHNNSLQTFLDNQQDLKNSLNALGFEGVDLSFS 466
+ +E KVE + + V +T + + + + + +
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163

Query: 467 QDSSKEQPKEQLRELFKEQESSPLKENALKSYQENTNHENQETS 510
+ EQP ++ ++ + N S EN + T+
Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207



Score = 36.2 bits (83), Expect = 4e-04
Identities = 38/218 (17%), Positives = 71/218 (32%), Gaps = 7/218 (3%)

Query: 42 KISKDKTAPKESLNHNALKATPKDAKEDAKALEKTPTPNHQHAQNLAKNQQAPTLKDWLN 101
++++ + KE+ + + +E AK + + ++ Q+
Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143

Query: 102 HPKTHPTAKHEAQHETHEANETNPKT-PNETLSKNEKKPNEVTSNAHQINLPNKNPITPN 160
P + + N T P + S N ++P ++ + N +NP
Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203

Query: 161 HANKTPTTPTHSAKEPKTLKDIQTLSQKHDLNASNIQATAPLEKKETPLSASDQLALKTT 220
A PT + S+ +PK S H++ AT + T A L T
Sbjct: 1204 PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP----ATTSSNDRST--VALCDLTSTNT 1257

Query: 221 QTPTSHTLAKNDAKNTANLSSVLQSLEKKESQNKEHAN 258
S AK +V Q + + E N+ N
Sbjct: 1258 NAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYN 1295



Score = 33.1 bits (75), Expect = 0.003
Identities = 42/237 (17%), Positives = 84/237 (35%), Gaps = 12/237 (5%)

Query: 176 PKTLKDIQTLSQKHDLNASNIQATAPLEKKET-----PLSASDQLALKTTQTPTSHTLAK 230
P+ K QT+ + +NIQA P A T + T+ T+A+
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 231 NDAKNTANLSSVLQSLEKKESQNKEHANPPNNEKKTPPLKEALQMNAIKRDKTLSKKKPE 290
N + + + Q + +QN+E A + K + + + +T + + E
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 291 KTPIHAKTQTTAPSATPENAPKLALKTPP-------LMPLIGANPPNDNIPTPLEKEEKT 343
+ + + + + PK+ + P + P ND E + +T
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 344 KEVSDNKEKTKESSNSAQSAQNTQASDKTSDNKSIAPKETIKHFTQQLKQEIQEYKP 400
+D ++ KE+S++ + + T ++ P+ T TQ KP
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKP 1219



Score = 30.4 bits (68), Expect = 0.022
Identities = 41/248 (16%), Positives = 84/248 (33%), Gaps = 12/248 (4%)

Query: 40 NQKISKDKTAPKESLNHNALKATPKDAKEDAKALEKTPTPNHQHA------QNLAKNQQA 93
N++I++ AP T + E++K KT N Q A +
Sbjct: 1014 NEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAK 1073

Query: 94 PTLKDWLNHPKTHPTAKHEAQHETHEANETNPKTPNETLSKNEKKPNEVTSNAHQINLPN 153
+K + + + +T E ET E +K EV Q++
Sbjct: 1074 SNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQ 1133

Query: 154 KNP--ITPNHANKTPTTPTHSAKEPKTLKDIQ-TLSQKHDLNASNIQATAPLEKKETPLS 210
+ + P PT + KEP++ + Q +SN++ + T ++
Sbjct: 1134 EQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE---QPVTESTTVN 1190

Query: 211 ASDQLALKTTQTPTSHTLAKNDAKNTANLSSVLQSLEKKESQNKEHANPPNNEKKTPPLK 270
+ + T + T +++++ + + + N E A +N++ T L
Sbjct: 1191 TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250

Query: 271 EALQMNAI 278
+ N
Sbjct: 1251 DLTSTNTN 1258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_04630FLGHOOKAP1357e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 35.3 bits (81), Expect = 7e-04
Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 2 NDTLLNAYSGIKTHQFGIDSLSNNIANVNTLGY 34
+ + NA SG+ Q +++ SNNI++ N GY
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGY 33



Score = 33.0 bits (75), Expect = 0.004
Identities = 10/48 (20%), Positives = 20/48 (41%)

Query: 557 IRHKYLETSNVNAGNALTNLILMQRGYSMNARAFGAGDDMIKEAISLK 604
+ ++ S VN NL Q+ Y NA+ + + I+++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


25HPSJM_05210HPSJM_05245N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_05210-1140.930254hypothetical protein
HPSJM_05215-1141.085576UDP-2, 3-diacylglucosamine hydrolase
HPSJM_05220-1141.205645CheA-MCP interaction modulator
HPSJM_05225-2130.934627auto phosphorylating histidine kinase
HPSJM_05230012-1.057873purine-binding chemotaxis protein (cheW)
HPSJM_05235014-1.271552thiol peroxidase
HPSJM_05240012-1.796507superoxide dismutase
HPSJM_05245113-2.141630hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05210ALARACEMASE337e-04 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 32.8 bits (75), Expect = 7e-04
Identities = 9/44 (20%), Positives = 16/44 (36%), Gaps = 1/44 (2%)

Query: 136 GVMPEETLEIYSQISETCKRLKLKGLMCIGAHADDEKKIEKSFT 179
G P+ L ++ Q+ + LM A A+ I +
Sbjct: 132 GFQPDRVLTVWQQL-RAMANVGEMTLMSHFAEAEHPDGISGAMA 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05220HTHFIS611e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 1e-12
Identities = 29/129 (22%), Positives = 50/129 (38%), Gaps = 13/129 (10%)

Query: 181 GEVLFLDDSRTARKTLKNHLSKLGFSITEAVDGEDGLNKLEMLFKKYGDNLRKHLKFIIS 240
+L DD R L LS+ G+ + + + +++
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA----------AGDGDLVVT 53

Query: 241 DVEMPKMDGYHFLFKLQKDPRFAYIPVIFNSSICDNYSAERAKEMGAVAYLVK-FDAEKF 299
DV MP + + L +++K +PV+ S+ +A +A E GA YL K FD +
Sbjct: 54 DVVMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111

Query: 300 TEEISKILD 308
I + L
Sbjct: 112 IGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05225HTHFIS555e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.2 bits (133), Expect = 5e-10
Identities = 24/121 (19%), Positives = 55/121 (45%), Gaps = 4/121 (3%)

Query: 682 VLAIDDSSTDRAIIRKCLKPLGITLLEATNGLEGLEMLKNGDKIPDAILVDIEMPKMDGY 741
+L DD + R ++ + L G + +N + GD D ++ D+ MP + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD--GDLVVTDVVMPDENAF 63

Query: 742 TFASEVRKYNKFKNLPLIAVTSRVTKTDRMRGVESGMTEYITKPYSGEYLTTVVKRSIKL 801
++K +LP++ ++++ T ++ E G +Y+ KP+ L ++ R++
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 802 E 802

Sbjct: 122 P 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_05245LUXSPROTEIN290.013 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 28.7 bits (64), Expect = 0.013
Identities = 28/119 (23%), Positives = 47/119 (39%), Gaps = 22/119 (18%)

Query: 13 RFCFDEKVAHVFDDMLERSIPYYHEMLDLGAYFIAQNLKENTNAKPLIYDLGCSTGNFFI 72
RF K D + E+ I H + L A F+ +L ++ I +GC TG +
Sbjct: 38 RFTAPNK-----DILSEKGI---HTLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMS 89

Query: 73 VLN----QQI-------QQDIELVGIDNSMPMLKKAQ---EKLKDFKNARFECMDFLEV 117
++ QQ+ +D+ V N +P L + Q + A+ + LEV
Sbjct: 90 LIGTPSEQQVADAWIAAMEDVLKVENQNKIPELNEYQCGTAAMHSLDEAKQIAKNILEV 148


26HPSJM_07365HPSJM_07400N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPSJM_07365-1150.556181putative inner membrane protein translocase
HPSJM_07370-1140.607311hypothetical protein
HPSJM_073750121.139404tRNA modification GTPase TrmE
HPSJM_073801111.683375outer membrane protein HomD
HPSJM_073851150.626196hypothetical protein
HPSJM_07390-2141.043935hypothetical protein
HPSJM_07395-1132.071333hypothetical protein
HPSJM_07400-2122.312722membrane-associated lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_0736560KDINNERMP427e-147 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 427 bits (1100), Expect = e-147
Identities = 165/581 (28%), Positives = 278/581 (47%), Gaps = 81/581 (13%)

Query: 10 RLILAIALSFLFITLYSYFFQKPNKTTTETTKQETTNNHTTTSPNAPNAQHFSVTQTIPQ 69
R +L IAL F+ ++ Q + + + T TTT+ + Q Q
Sbjct: 5 RNLLVIALLFVSFMIW----QAWEQDKNPQPQAQQTTQTTTTAAGSAADQG---VPASGQ 57

Query: 70 ENLLSTISFEHARIEIDFLG-RIKQVYLKDKKYLTPKQKGFLEHVG--HLFSSKEN---- 122
L+ ++ + + I+ G ++Q L P L L +
Sbjct: 58 GKLI-SVKTDVLDLTINTRGGDVEQALL-------PAYPKELNSTQPFQLLETSPQFIYQ 109

Query: 123 AQPPL--KELPLLAADKLKPLEVRFLDPTLNNKAFNTPYSASKTTLGPNEQLV--LTQDL 178
AQ L ++ P A+ +PL +N A G NE V D
Sbjct: 110 AQSGLTGRDGPDNPANGPRPL-------------YNVEKDAYVLAEGQNELQVPMTYTDA 156

Query: 179 GALSIIKTLTFYDDLHYDLKIAFKSPNN------------------LIPSYVITNGYRPV 220
+ KT Y + + + N L P + +
Sbjct: 157 AGNTFTKTFVLKRG-DYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFAL 215

Query: 221 ADLDSYTFSGVLLENNDKKIEKIE---DKDAKEIKRFSNTLFLSSVDRYFTTLLFTKDSQ 277
+TF G D+K EK + D + + S +++ + +YF T +
Sbjct: 216 -----HTFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAMLQQYFATAWIPHN-D 269

Query: 278 GFEALIDSEIGTKNPLGFISLKNEA-----------NLHGYIGPKDYRSLKAISPMLTDV 326
G + +G N + I K++ N ++GP+ + A++P L
Sbjct: 270 GTNNFYTANLG--NGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAVAPHLDLT 327

Query: 327 IEYGLITFFAKGVFVLLDYLYQFVGNWGWAIIFLTIIVRIILYPLSYKGMVSMQKLKELA 386
++YG + F ++ +F LL +++ FVGNWG++II +T IVR I+YPL+ SM K++ L
Sbjct: 328 VDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQ 387

Query: 387 PKMKELQEKYKGEPQKLQAHMMQLYKKHGANPLGGCLPLILQIPVFFAIYRVLYNAVELK 446
PK++ ++E+ + Q++ MM LYK NPLGGC PL++Q+P+F A+Y +L +VEL+
Sbjct: 388 PKIQAMRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELR 447

Query: 447 SSEWILWIHDLSIMDPYFILPLLMGASMYWHQSVTPNTMTDPMQAKIFKFLPLLFTIFLI 506
+ + LWIHDLS DPY+ILP+LMG +M++ Q ++P T+TDPMQ KI F+P++FT+F +
Sbjct: 448 QAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFFL 507

Query: 507 TFPAGLVLYWTTNNILSVLQQLIINKVLENKKRMHAQNKKE 547
FP+GLVLY+ +N+++++QQ +I + LE K+ +H++ KK+
Sbjct: 508 WFPSGLVLYYIVSNLVTIIQQQLIYRGLE-KRGLHSREKKK 547


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_07370IGASERPTASE300.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.009
Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 7/57 (12%)

Query: 56 VKESVKEVKEESVKETNTKEIHQNNIEEKKQKLETETPQEE-KITPKPSKKNPKEES 111
KE + KET T E +E+K K+ETE QE K+T + S K + E+
Sbjct: 1088 SGSETKETQTTETKETATVE------KEEKAKVETEKTQEVPKVTSQVSPKQEQSET 1138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_07375TCRTETOQM340.001 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 34.1 bits (78), Expect = 0.001
Identities = 34/134 (25%), Positives = 54/134 (40%), Gaps = 25/134 (18%)

Query: 216 LSIVGKPNAGKSSLLNAMLLEERA---LVSDIKGTTR-DTIEE-------------VIEL 258
+ ++ +AGK++L ++L A L S KGTTR D +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 259 QGHKVRLIDTAGIRESADEIERLGIEKSLKSLENCDIILGVFDLSKPLEKEDFNLIDALN 318
+ KV +IDT G + E+ R SL L D + + ++ + L AL
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYR-----SLSVL---DGAILLISAKDGVQAQTRILFHALR 117

Query: 319 RAKKPCIVVLNKND 332
+ P I +NK D
Sbjct: 118 KMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_07380TONBPROTEIN372e-04 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 36.5 bits (84), Expect = 2e-04
Identities = 14/25 (56%), Positives = 15/25 (60%)

Query: 49 EQTIATTQEKPKPKPKPKPKPKPKP 73
+ EKPKPKPKPKPKP K
Sbjct: 82 PKEAPVVIEKPKPKPKPKPKPVKKV 106



Score = 36.1 bits (83), Expect = 3e-04
Identities = 12/21 (57%), Positives = 13/21 (61%)

Query: 57 EKPKPKPKPKPKPKPKPITPQ 77
KPKPKPKPKPKP +
Sbjct: 88 VIEKPKPKPKPKPKPVKKVQE 108



Score = 35.3 bits (81), Expect = 6e-04
Identities = 12/28 (42%), Positives = 12/28 (42%)

Query: 46 QTQEQTIATTQEKPKPKPKPKPKPKPKP 73
KPKPKPKPKPKP
Sbjct: 75 PEPIPEPPKEAPVVIEKPKPKPKPKPKP 102



Score = 32.7 bits (74), Expect = 0.004
Identities = 11/17 (64%), Positives = 13/17 (76%)

Query: 56 QEKPKPKPKPKPKPKPK 72
+ KPKPKPKPKP K +
Sbjct: 91 KPKPKPKPKPKPVKKVQ 107



Score = 31.1 bits (70), Expect = 0.013
Identities = 10/16 (62%), Positives = 12/16 (75%)

Query: 58 KPKPKPKPKPKPKPKP 73
KPKPKPKP K + +P
Sbjct: 95 KPKPKPKPVKKVQEQP 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_07390BINARYTOXINB340.001 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 33.5 bits (76), Expect = 0.001
Identities = 24/97 (24%), Positives = 37/97 (38%), Gaps = 6/97 (6%)

Query: 155 SKNMGDLLAKAMPIERILKAYSVPVGSLENYEKIYYQNAFKPKVQITFDNNSDAEIKNAL 214
+ N D L P + +A + G E + YQ + FD + IKN L
Sbjct: 536 AVNPSDPLETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQL 595

Query: 215 ISAYAR-VLTPSDEEKLYQ-----IKNEVFTENANGI 245
A + T D+ KL I+++ F + N I
Sbjct: 596 AELNATNIYTVLDKIKLNAKMNILIRDKRFHYDRNNI 632


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPSJM_07400LIPOLPP20293e-105 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 293 bits (752), Expect = e-105
Identities = 174/175 (99%), Positives = 175/175 (100%)

Query: 1 MKNQVKKILGMSVIAAMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK 60
MKNQVKKILGMSV+AAMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK
Sbjct: 1 MKNQVKKILGMSVVAAMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK 60

Query: 61 YSGVFLGRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRS 120
YSGVFLGRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRS
Sbjct: 61 YSGVFLGRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRS 120

Query: 121 ISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDKQIVDKVREELGMVKK 175
ISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDKQIVDKVREELGMVKK
Sbjct: 121 ISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDKQIVDKVREELGMVKK 175



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.