PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeHUP_B14.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in CP003486 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1HPB14_00100HPB14_00145Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_001000153.005042lipid A 1-phosphatase
HPB14_001050152.827354lipid A phosphoethanolamine transferase
HPB14_001102153.295563hypothetical protein
HPB14_001152163.298559outer membrane protein HopD
HPB14_001202140.698673hypothetical protein
HPB14_001252140.652450type II citrate synthase
HPB14_00130316-0.192873isocitrate dehydrogenase
HPB14_00135012-0.761147hypothetical protein
HPB14_00140113-0.622337dethiobiotin synthetase
HPB14_00145214-0.554159hypothetical protein
2HPB14_00330HPB14_00430Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_003304212.949879urease accessory protein UreG
HPB14_003354232.538042urease accessory protein
HPB14_003405252.652655urease accessory protein UreE
HPB14_003455262.284297urease accessory protein / pH-dependent
HPB14_003502181.840534hypothetical protein
HPB14_003551181.972830urease subunit beta
HPB14_00360-290.979740urease subunit alpha
HPB14_00365-2121.235419hypothetical protein
HPB14_003701121.517979*lipoprotein signal peptidase
HPB14_003751121.248879phosphoglucosamine mutase
HPB14_003802141.24910630S ribosomal protein S20
HPB14_003852131.389184peptide chain release factor 1
HPB14_003903151.050572hypothetical protein
HPB14_003952140.677002hypothetical protein
HPB14_00400-2130.098928methyl-accepting chemotaxis protein
HPB14_00405113-0.11537330S ribosomal protein S9
HPB14_00410111-0.03961750S ribosomal protein L13
HPB14_004151100.304747hypothetical protein
HPB14_00420110-0.779866malate:quinone oxidoreductase
HPB14_00425110-0.981956hypothetical protein
HPB14_00430311-1.337946RNA polymerase sigma factor RpoD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_00355UREASE10430.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1043 bits (2698), Expect = 0.0
Identities = 353/569 (62%), Positives = 442/569 (77%), Gaps = 4/569 (0%)

Query: 3 KISRKEYVSMYGPTTGDKVRLGDTDLIAEVEHDYTIYGEELKFGGGKTLREGMSQSN-NP 61
++SR Y +M+GPT GDKVRL DT+L EVE D+T +GEE+KFGGGK +R+GM QS
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 62 SKEELDLIITNALIVDYTGIYKADIGIKDGKIAGIGKGGNKDMQDGVKNNLSVGPATEAL 121
+D +ITNALI+D+ GI KADIG+KDG+IA IGK GN DMQ GV + VGP TE +
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGV--TIIVGPGTEVI 121

Query: 122 AGEGLIVTAGGIDTHIHFISPQQIPTAFASGVTTMIGGGTGPADGTNATTITPGRRNLKW 181
AGEG IVTAGG+D+HIHFI PQQI A SG+T M+GGGTGPA GT ATT TPG ++
Sbjct: 122 AGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIAR 181

Query: 182 MLRAAEEYSMNLGFLAKGNASNDASLADQIEAGAIGFKIHEDWGTTPSAINHALDVADKY 241
M+ AA+ + MNL F KGNAS +L + + GA K+HEDWGTTP+AI+ L VAD+Y
Sbjct: 182 MIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEY 241

Query: 242 DVQVAIHTDTLNEAGCVEDTMAAIAGRTMHTFHTEGAGGGHAPDIIKVAGEHNILPASTN 301
DVQV IHTDTLNE+G VEDT+AAI GRT+H +HTEGAGGGHAPDII++ G+ N++P+STN
Sbjct: 242 DVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTN 301

Query: 302 PTIPFTVNTEAEHMDMLMVCHHLDKSIKEDVQFADSRIRPQTIAAEDTLHDMGIFSITSS 361
PT P+TVNT AEH+DMLMVCHHL +I ED+ FA+SRIR +TIAAED LHD+G FSI SS
Sbjct: 302 PTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISS 361

Query: 362 DSQAMGRVGEVITRTWQTADKNKKEFGRLKEEKGDNDNFRIKRYLSKYTINPAIAHGISE 421
DSQAMGRVGEV RTWQTADK K++ GRLKEE GDNDNFR+KRY++KYTINPAIAHG+S
Sbjct: 362 DSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSH 421

Query: 422 YVGSVEVGKVADLVLWSPAFFGVKPNMIIKGGFIALSQMGDANASIPTPQPVYYREMFAH 481
+GS+EVGK ADLVLW+PAFFGVKP+M++ GG IA + MGD NASIPTPQPV+YR MF
Sbjct: 422 EIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGA 481

Query: 482 HGKAKYDANITFVSQVAYDKGIKEELGLERQVLPVKNCR-NITKKDMQFNDTTAHIEVNP 540
+G+++ ++++TFVSQ + D G+ LG+ ++++ V+N R I K M N T HIEV+P
Sbjct: 482 YGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDP 541

Query: 541 ETYHVFVDGKEVTSKPANKVSLAQLFSIF 569
ETY V DG+ +T +PA + +AQ + +F
Sbjct: 542 ETYEVRADGELLTCEPATVLPMAQRYFLF 570


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_00400BACINVASINB300.036 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 30.1 bits (67), Expect = 0.036
Identities = 41/181 (22%), Positives = 67/181 (37%), Gaps = 40/181 (22%)

Query: 436 ANIEALNNALEHYKDLDFTHHIQNPKANMEKALNTLGQEISSMLKASLGFANA------L 489
A+I+ + A Y K L ++ S+ A G+A A
Sbjct: 145 ASIKKTDTAKSVYDAA-------------TKKLTQAQNKLQSLDPADPGYAQAEAAVEQA 191

Query: 490 NHESKDLKTCVDNLTKTAHKQERSLKNTTQSLEEITNIIT----TIDSKSQEMISQGED- 544
E+ + K +D T K + + E+ NI+T T ++ SQ +SQGE
Sbjct: 192 GKEATEAKEALDKATDATVK---AGTDAKAKAEKADNILTKFQGTANAASQNQVSQGEQD 248

Query: 545 -------IKSVVDMIRDIADQT------NLLALNAAIEAARAGEHGRGFAVVADEVRKLA 591
+ ++ M +I + N LAL A++ R E + A +E RK
Sbjct: 249 NLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEFQEETRKAE 308

Query: 592 E 592
E
Sbjct: 309 E 309


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_00430cloacin300.048 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.7 bits (66), Expect = 0.048
Identities = 12/44 (27%), Positives = 23/44 (52%)

Query: 14 AKQEAKAEATQEVAQENKTKENNKAKESKIKENKTKESKIKEAK 57
AK+++ A+A A E++ K+ +K + ++ N K K K
Sbjct: 415 AKEKSDADAALSSAMESRKKKEDKKRSAENNLNDEKNKPRKGFK 458


3HPB14_01465HPB14_01610Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_014651163.71030450S ribosomal protein L21
HPB14_014701163.60016650S ribosomal protein L27
HPB14_014750153.599621peptide ABC transporter substrate-binding
HPB14_014801143.951383dipeptide permease
HPB14_014850123.018202dipeptide transport system permease protein
HPB14_01490-2132.746679peptide ABC transporter ATP-binding protein
HPB14_01495-2142.470534dipeptide ABC transporter ATP-binding protein
HPB14_01500-2132.119636GTPase CgtA
HPB14_01505-1141.743686hypothetical protein
HPB14_015100182.265992hypothetical protein
HPB14_015151182.963023glutamate-1-semialdehyde aminotransferase
HPB14_015203182.296634hypothetical protein
HPB14_015253161.949965hypothetical protein
HPB14_015303142.059724hypothetical protein
HPB14_015350120.886891polysaccharide deacetylase
HPB14_015400130.166765hypothetical protein
HPB14_01545-115-0.421290ATP-binding protein
HPB14_01550116-1.239847nitrite extrusion protein
HPB14_01555216-1.948531putative heme iron utilization protein
HPB14_01560315-2.330229arginyl-tRNA synthetase
HPB14_01565215-1.877872Sec-independent protein translocase protein
HPB14_01570214-1.412032guanylate kinase
HPB14_01575114-1.392451polyE-rich protein
HPB14_01580-113-1.640638hypothetical protein
HPB14_01585-113-1.910911nuclease NucT
HPB14_01590112-2.066485hypothetical protein
HPB14_01595313-2.352507flagellar basal body L-ring protein
HPB14_01600311-2.050128putative acylneuraminate cytidylyltransferase
HPB14_01605211-1.329504CMP-N-acetyl neuraminic acid synthetase
HPB14_01610212-0.940126flagellar biosynthesis protein G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01550TCRTETB320.003 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 32.2 bits (73), Expect = 0.003
Identities = 37/207 (17%), Positives = 84/207 (40%), Gaps = 1/207 (0%)

Query: 23 VLIPLLILSGSLTPHQSFQLGIAVLMGYVFGSFLIQFLSPLMSLESIAKISFGLIALSFL 82
V +P + + P + + A ++ + G+ + LS + ++ + + +
Sbjct: 35 VSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSV 94

Query: 83 VCYFDSIPFFWLWIWRFIAGVASSALMILVAPLSLPYVKEHKKALVGGLIFSAVGIGSVF 142
+ + F L + RFI G ++A LV + Y+ + + GLI S V +G
Sbjct: 95 IGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV 154

Query: 143 SGFVLPWISSYNIKWAWIFLGGSCLIAFILSLVGLKTRSLRKKSVKKEESAFKITFHLWL 202
+ I+ Y I W+++ L I + L+ L + +R K + ++ +
Sbjct: 155 GPAIGGMIAHY-IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVF 213

Query: 203 LLISCALNAIGFLPHTLFWVDYLIRHL 229
++ +I FL ++ ++H+
Sbjct: 214 FMLFTTSYSISFLIVSVLSFLIFVKHI 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01570PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.011
Identities = 9/18 (50%), Positives = 11/18 (61%)

Query: 8 LILSGPSGAGKSTLTKYL 25
++L G G GKSTL L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01575IGASERPTASE653e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 65.5 bits (159), Expect = 3e-13
Identities = 63/292 (21%), Positives = 99/292 (33%), Gaps = 29/292 (9%)

Query: 176 ETPQKEKQEVKETPQEEKE-----EVKETPQEEEKPKDDETQESETPKDEEVSKELEMQE 230
TP + +V P +E E P P + +E K E + E Q+
Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQD 1057

Query: 231 KLEIPKEETQEEVKEEMKEETQDSPSAQELEAMQELVKEIQENSNDQEDKKETQESAEAL 290
E EV +E K + + E+ KE Q + E +E A+
Sbjct: 1058 ATE--TTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVE 1115

Query: 291 QETQAHELEKQEIAE-TPQEKEKQEDTETPQDVEIPQSQEKETQETQEVVTEKTQVQEKE 349
E + QE+ + T Q KQE +ET Q P + T V ++ Q Q
Sbjct: 1116 TE------KTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT-----VNIKEPQSQTNT 1164

Query: 350 TPKTQEDHYENIEDIPEPVMAKAMGEELPFLNEAVAKTPNNENDTETPKESVTEIS---- 405
T T++ E ++ +PV +V + P N T +E S
Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTVNTGN----SVVENPENTTPATTQPTVNSESSNKPK 1220

Query: 406 -KNENATEFPQEKEESDKTSSPLELRLNLQDLLKSLNQESLKSLLENKTLSI 456
++ + E TSS + L DL S N ++ S K +
Sbjct: 1221 NRHRRSVRSVPHNVEPATTSSNDRSTVALCDLT-STNTNAVLSDARAKAQFV 1271



Score = 65.5 bits (159), Expect = 3e-13
Identities = 44/221 (19%), Positives = 78/221 (35%), Gaps = 17/221 (7%)

Query: 155 PNNEEQLLPTLDVQEEKEEIKETPQKEKQEVKETPQEEKEEVKETPQEEEKPKDDETQES 214
P T V E ++ +T +K +Q+ ET + +E KE + TQ +
Sbjct: 1028 PAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVK----ANTQTN 1083

Query: 215 ETPKDEEVSKELEMQEKLEIPKEETQEEVKEEMKEETQDSPS-AQELEAMQELVKEIQEN 273
E + +KE + E E E +E+ K E E+TQ+ P ++ QE + +Q
Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVE-TEKTQEVPKVTSQVSPKQEQSETVQPQ 1142

Query: 274 SNDQEDKKETQESAEALQETQAHELEKQEIAETPQEKEKQEDT-----------ETPQDV 322
+ + T E +T +Q ET E+ E P++
Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202

Query: 323 EIPQSQEKETQETQEVVTEKTQVQEKETPKTQEDHYENIED 363
+Q E+ + + + P E + D
Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSND 1243



Score = 44.7 bits (105), Expect = 8e-07
Identities = 34/207 (16%), Positives = 70/207 (33%), Gaps = 15/207 (7%)

Query: 111 QKKLGSNMSELEPSQNLDPTQEVLETNWDELENLGDLEALAKEEPNNEEQLLPTLDVQEE 170
+ + E N+ + E E KE E++ ++
Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE-------EKA 1112

Query: 171 KEEIKETPQKEKQEVKETPQEEKEEVKETPQEEEKPKDDETQESETPKDEEVSKELEMQE 230
K E ++T + K + +P++E+ E PQ E ++D T + P+ + + Q
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSE-TVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ- 1170

Query: 231 KLEIPKEETQEEVKEEMKEETQDSPSAQELE-AMQELVKEIQENSNDQEDKKETQESAEA 289
P +ET V++ + E T + +E Q N + K +
Sbjct: 1171 ----PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRS 1226

Query: 290 LQETQAHELEKQEIAETPQEKEKQEDT 316
++ + H +E + + D
Sbjct: 1227 VR-SVPHNVEPATTSSNDRSTVALCDL 1252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01595FLGLRINGFLGH1907e-63 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 190 bits (483), Expect = 7e-63
Identities = 51/172 (29%), Positives = 84/172 (48%), Gaps = 18/172 (10%)

Query: 56 GERPLFADRRAMKPNDLITIIVSEKASANYSSS----KDYKSTSGGNSTPPRLTYNGLDE 111
G +PLF DRR D +TI++ E SA+ SSS +D K+ G ++ P L GL
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYL--QGLFG 118

Query: 112 RKKQEAQYLDDKNNYNFTKSSNNTNFKGGGSQKKSEDLEIVLSARIIKVLENGNYFIYGN 171
+ + + S F G G S L+ + +VL NGN + G
Sbjct: 119 NARADVEA------------SGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGE 166

Query: 172 KEVLVDGEKQILKVSGVIRPYDIERNNTIQSKFLADAKIEYTNLGHLSDSNK 223
K++ ++ + ++ SGV+ P I +NT+ S +ADA+IEY G+++++
Sbjct: 167 KQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQN 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01610SACTRNSFRASE280.016 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.0 bits (62), Expect = 0.016
Identities = 14/49 (28%), Positives = 21/49 (42%), Gaps = 3/49 (6%)

Query: 102 KGRIILKALERIAFE---EFQLHSLHLEVMENNFKAIAFYEKNHYELEG 147
+ + + AL A E E L LE + N A FY K+H+ +
Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


4HPB14_02125HPB14_02150Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_02125-213-3.352260dihydroorotate dehydrogenase 2
HPB14_02130-213-3.550414polyphosphate kinase
HPB14_02135-216-4.154185*type I restriction enzyme S protein
HPB14_02140-117-3.251977type I restriction enzyme M protein
HPB14_02145016-3.378602type I restriction enzyme R protein
HPB14_02150216-2.676440type I restriction enzyme R protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_02145FLGHOOKAP1320.011 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 31.9 bits (72), Expect = 0.011
Identities = 19/145 (13%), Positives = 52/145 (35%), Gaps = 24/145 (16%)

Query: 417 YDKTTDDYLKEL-NQFNQSDSNIKDNLKDMFADRKVLEKDIKNAYDDLFNYPIDDVEAMT 475
+ + ++ N + S S++ ++D F + L + D A
Sbjct: 83 GLTARYEQMSKIDNMLSTSTSSLATQMQDFF-----------TSLQTLVSNAED--PAAR 129

Query: 476 SAIVSISEMNELLKVSHAINTLKERYNLIRTSNDEKILSLKEKMDIEKISKISSMLHKKA 535
A++ SE + + T + R + + +++ +++I+ + +
Sbjct: 130 QALIGKSEG-----LVNQFKTTDQYL---RDQDKQ--VNIAIGASVDQINNYAKQIASLN 179

Query: 536 KHLHALKNINEPKNPNDLMILEDLI 560
+ L + +PN+L+ D +
Sbjct: 180 DQISRLTGVGAGASPNNLLDQRDQL 204


5HPB14_02195HPB14_02265Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_02195-114-3.366907ABC transporter substrate-binding protein
HPB14_02200-112-3.511404molybdenum ABC transporter ModB
HPB14_02205-19-1.864428molybdenum ABC transporter ModD
HPB14_02210-111-2.215089glutamyl-tRNA synthetase
HPB14_02215-112-2.749332hypothetical protein
HPB14_02220-212-2.402513type II DNA modification (methyltransferase)
HPB14_02225-112-1.013165type II adenine specific methyltransferase
HPB14_022303140.758850GTP-binding protein TypA
HPB14_02235617-0.359022type II DNA modification enzyme
HPB14_022405180.539850type II restriction endonuclease
HPB14_022454170.280465type II DNA modification (methyltransferase)
HPB14_022605200.451587catalase-like protein
HPB14_022655220.262471hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_02205PF05272300.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.009
Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 30 VVALLGESGAGKSTILRILAGLE 52
V L G G GKST++ L GL+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_02230TCRTETOQM1963e-57 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 196 bits (501), Expect = 3e-57
Identities = 115/461 (24%), Positives = 190/461 (41%), Gaps = 67/461 (14%)

Query: 3 NIRNIAVIAHVDHGKTTLVDGLLSQSGTFSEREKVDE--RVMDSNDLERERGITILSKNT 60
I NI V+AHVD GKTTL + LL SG +E VD+ D+ LER+RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 61 AIYYKDTKINIIDTPGHADFGGEVERVLKMVDGVLLLVDAQEGVMPQTKFVVKKALSFGI 120
+ +++TK+NIIDTPGH DF EV R L ++DG +LL+ A++GV QT+ + GI
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 121 CPIVVVNKIDKPAAEPDRVVDEVFDLF---------VAMGASDKQLDFPV-----VYAAA 166
I +NKID+ + V ++ + V + + +F
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 167 RDGYAMKSLDDE----------------------------KKNL--EPLFETILEHVPSP 196
D K + + K N+ + L E I S
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSS 241

Query: 197 SGSVDEPLQMQIFTLDYDNYVGKIGIARVFNGSVKKNESVLLMKSDGSKENGRITKLIGF 256
+ L ++F ++Y ++ R+++G + +SV + KE +IT++
Sbjct: 242 THRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRI----SEKEKIKITEMYTS 297

Query: 257 LGLARTEIENAYAGDIVAIAG--FNAMDV-GDSVVDPANPMPLDPMHLEEPTMSVYFAVN 313
+ +I+ AY+G+IV + V GD+ + P +P P + +
Sbjct: 298 INGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENP----LPLLQTTVEPS 353

Query: 314 DSPLAGLEGKHVTANKLKDRLLKEMQTNIAMKCEEMGEGKFKVSGRGELQITILAENLRR 373
+ + D LL+ + + +S G++Q+ + L+
Sbjct: 354 KPQQREMLLDALLEISDSDPLLRYYVDSAT--------HEIILSFLGKVQMEVTCALLQE 405

Query: 374 E-GFEFSISRPEVIIKEENGVKCEPFEHLVIDTPQDFSGAI 413
+ E I P VI E K E H+ + P F +I
Sbjct: 406 KYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP-PNPFWASI 445



Score = 41.8 bits (98), Expect = 8e-06
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 1/80 (1%)

Query: 396 EPFEHLVIDTPQDFSGAIIERLGKRKAEMKAMNPMSDGYTRLEFEIPARGLIGYRSEFLT 455
EP+ I PQ++ K A + + + L EIPAR + YRS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTF 595

Query: 456 DTKGEGVMNHSFLEFRPFSG 475
T G V + +G
Sbjct: 596 FTNGRSVCLTELKGYHVTTG 615


6HPB14_03145HPB14_03285Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_031452170.059883outer membrane protein
HPB14_03150-212-0.101927excinuclease ABC subunit A
HPB14_03155018-1.330625hypothetical protein
HPB14_031600130.181071hypothetical protein
HPB14_031650120.297263hypothetical protein
HPB14_03170010-0.266423hypothetical protein
HPB14_031750110.033355transcriptional activator of flagellar proteins
HPB14_031800100.149797hypothetical protein
HPB14_031852111.489044DNA gyrase subunit A
HPB14_031903142.811353diacylglycerol kinase
HPB14_031953132.448958hypothetical protein
HPB14_032004153.066943hypothetical protein
HPB14_032054153.306440hypothetical protein
HPB14_032103143.413041N-methylhydantoinase
HPB14_032151143.722253hydantoin utilization protein A
HPB14_032201132.056106putative outer membrane protein
HPB14_03225-1111.887867short-chain fatty acids transporter
HPB14_03230-2121.596397succinyl-CoA-transferase subunit B
HPB14_03235-2111.497759succinyl-CoA-transferase subunit A
HPB14_03240-2121.622484acetyl-CoA acetyltransferase
HPB14_03245-1120.632740hypothetical protein
HPB14_032501131.443744ferrous iron transport protein B
HPB14_032654211.866703flagellar biosynthesis protein FliP
HPB14_032703151.908610bifunctional N-acetylglucosamine-1-phosphate
HPB14_032753151.516220hypothetical protein
HPB14_032802120.689821hypothetical protein
HPB14_032852100.369794hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03175HTHFIS396e-138 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 396 bits (1018), Expect = e-138
Identities = 128/384 (33%), Positives = 197/384 (51%), Gaps = 9/384 (2%)

Query: 2 KIAIVEDDINMRKSLELFFELQDDLEIVSFKNPKDALAKL-DESFDLVITDINMPHMDGL 60
I + +DD +R L + ++ N + DLV+TD+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALS-RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 61 EFLRLLEGKYES---IVITGNATLNKAIDSIRLGVKDFFQKPFKPELLLESIYRTKKVLE 117
+ L ++ +V++ T AI + G D+ KPF E I + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT---ELIGIIGRALA 120

Query: 118 FQKKHPLEKPLKKPHKHSFLAASKALEESKRQALKVASTDANVMLLGESGVGKEVFAHFI 177
K+ P + + S A++E R ++ TD +M+ GESG GKE+ A +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 178 HQHSQRSKHPFIAINMSAIPEHLLESELFGYQKGAFTDATAPKMGLFESANKGTIFLDEI 237
H + +R PF+AINM+AIP L+ESELFG++KGAFT A G FE A GT+FLDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 238 AEMPFQLQSKLLRVVQEKEITRLGDNKSVKIDVRFISATNANMKEKIASKEFREDLFFRL 297
+MP Q++LLRV+Q+ E T +G ++ DVR ++ATN ++K+ I FREDL++RL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 298 QIVPITIAPLRERVEEILPIAEIKLKEVCDAYHLGPKSFSKNAAKRLLEYSWHGNVRELL 357
+VP+ + PLR+R E+I + +++ L K F + A + + + W GNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 358 GVVERAAILSEGAEIQEKDLFLER 381
+V R L I + + E
Sbjct: 360 NLVRRLTALYPQDVITREIIENEL 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03265FLGBIOSNFLIP2762e-96 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 276 bits (708), Expect = 2e-96
Identities = 113/245 (46%), Positives = 161/245 (65%), Gaps = 2/245 (0%)

Query: 1 MRFFIFFILICPLICPLMSADSALPSVNLSLNAPNDPKQLVTTLNVIALLTLLVLAPSLI 60
MR + + L A + LP + S P + + + +T L P+++
Sbjct: 1 MRRLLSVAPVL-LWLITPLAFAQLPGIT-SQPLPGGGQSWSLPVQTLVFITSLTFIPAIL 58

Query: 61 LVMTSFTRLIVVFSFLRTALGTQQTPPTQILVSLSLILTFFIMEPSLKKAYDTGIKPYMD 120
L+MTSFTR+I+VF LR ALGT PP Q+L+ L+L LTFFIM P + K Y +P+ +
Sbjct: 59 LMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSE 118

Query: 121 KKISYTEAFEKSALPFKEFMLKNTREKDLALFFRIRNLPNPKTPDDVSLSVLIPAFMISE 180
+KIS EA EK A P +EFML+ TRE DL LF R+ N + P+ V + +L+PA++ SE
Sbjct: 119 EKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSE 178

Query: 181 LKTAFQIGFLLYLPFLVIDMVISSILMAMGMMMLPPVMISLPFKILVFILVDGFNLLTEN 240
LKTAFQIGF +++PFL+ID+VI+S+LMA+GMMM+PP I+LPFK+++F+LVDG+ LL +
Sbjct: 179 LKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGS 238

Query: 241 LVASF 245
L SF
Sbjct: 239 LAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03275PF07132331e-04 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 33.1 bits (75), Expect = 1e-04
Identities = 19/45 (42%), Positives = 31/45 (68%)

Query: 37 IGEGVGAGMGGAMGGMIGALGGPWGTVFGAGIGGGIGAYSGAEIG 81
+G +G G+GG +GG+ +LGG G + G G+GGG+G+ G+ +G
Sbjct: 61 MGSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLG 105



Score = 30.0 bits (67), Expect = 0.001
Identities = 17/50 (34%), Positives = 27/50 (54%)

Query: 33 LGRDIGEGVGAGMGGAMGGMIGALGGPWGTVFGAGIGGGIGAYSGAEIGD 82
+G +G G+G G+GG + G GG G G G+G +G+ G+ +G
Sbjct: 61 MGSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03285PHAGEIV260.025 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 26.4 bits (58), Expect = 0.025
Identities = 12/36 (33%), Positives = 15/36 (41%)

Query: 34 GKLIGGGVGGFVGDKIGGAIGVPGGPVGIGLGRFLG 69
G G GG D++ + GG GI G LG
Sbjct: 220 GSQRGTVAGGVNTDRLTSVLSSAGGSFGIFNGDVLG 255


7HPB14_03935HPB14_04155Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_03935215-1.38372150S ribosomal protein L31
HPB14_03940214-1.685479transcription termination factor Rho
HPB14_03945417-3.261321glutamate racemase
HPB14_03950517-3.844697regulator of nonsense transcripts 1
HPB14_03965313-3.365991regulator of nonsense transcripts 1
HPB14_03970010-1.994300hypothetical protein
HPB14_03975-29-1.518754hypothetical protein
HPB14_03990010-2.772151conserved hypothetical secreted protein
HPB14_0399509-1.720616GTPase Era
HPB14_0400019-1.788746ATP-dependent protease ATP-binding subunit HslU
HPB14_04005010-2.062878ATP-dependent protease subunit HslV
HPB14_04010112-2.62098650S ribosomal protein L9
HPB14_04015214-3.323145hypothetical protein
HPB14_04020416-1.573067glutamine synthetase
HPB14_04025919-3.124526hypothetical protein
HPB14_04030918-2.570768IS606 transposase
HPB14_04035918-2.336043cag pathogenicity island protein 1
HPB14_04040918-2.586308HP0521B-like protein
HPB14_04045917-2.204174cag pathogenicity island protein 3
HPB14_04050819-3.331133cag pathogenicity island protein Gamma
HPB14_04055823-3.223170CAG pathogenicity island protein 5
HPB14_04060925-4.034250cag island protein, DNA transfer protein
HPB14_04065827-4.576159cag pathogenicity island protein (cag6)
HPB14_040701027-4.173373hypothetical protein
HPB14_040851127-4.398695cag island protein
HPB14_040901026-4.363004cag pathogenicity island protein W
HPB14_040951224-5.007302cag pathogenicity island protein V
HPB14_041001225-4.870627cag pathogenicity island protein (cag11)
HPB14_041051121-4.006724CAG pathogenicity island protein T
HPB14_04110820-3.146369CAG pathogenicity island protein S
HPB14_04115618-2.712662cag island protein
HPB14_04120720-2.983250cag pathogenicity island protein CagN
HPB14_04125520-2.878116cag pathogenicity island protein L
HPB14_04130520-3.045379cag pathogenicity island protein I
HPB14_04135620-3.026540cag pathogenicity island protein H
HPB14_04140621-4.011773cag pathogenicity island protein (cag21)
HPB14_04145420-3.219979cag island protein
HPB14_04150016-1.198985cag pathogenicity island protein E VirB4-like
HPB14_04155216-0.108815cag pathogenicity island protein D
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03935PF01206270.004 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 26.6 bits (59), Expect = 0.004
Identities = 7/22 (31%), Positives = 12/22 (54%)

Query: 19 SGKEIEVLSTKPEMRIDISSFC 40
+G+ + V++T P D SF
Sbjct: 31 AGEVLYVMATDPGSVKDFESFS 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03965GPOSANCHOR350.001 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.7 bits (79), Expect = 0.001
Identities = 19/162 (11%), Positives = 52/162 (32%), Gaps = 7/162 (4%)

Query: 25 ETEKEKERQNTLKKDIKDYTYKVQQAKKRHKHQQTLKIADHCLNIIQRYLQEIERLQQEN 84
++ E + L+ + T + K + +++ L+ +
Sbjct: 188 LEARQAELEKALEGAMNFST---ADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTAD 244

Query: 85 KETPTQLEAVLEQIQQEMKSLPIQQKHALQIYYNRFEDVQRRTLAYLDYLSRLKTQLQEK 144
LEA ++ L + AL+ N + + L+ + +
Sbjct: 245 SAKIKTLEAEKAALEARQAEL----EKALEGAMNFSTADSAKIKTLEAEKAALEAEKADL 300

Query: 145 EKRLKSDLGEIKKLKQKVKENEKDIKKHQEALEQFEEWRDES 186
E + + + L++ + + + K+ + ++ EE S
Sbjct: 301 EHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKIS 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03975ACRIFLAVINRP327e-04 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.7 bits (72), Expect = 7e-04
Identities = 23/101 (22%), Positives = 37/101 (36%), Gaps = 10/101 (9%)

Query: 20 SYGQYRAAKEQTKQLEIICNTL---RKQYENLLIELRLDKQKLRL------QLAQDLEKI 70
G A Q II T +++ + + + D +RL +L + +
Sbjct: 218 QLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNV 277

Query: 71 DARIR-KNADKMHLLKLAYENSFMVLKCIKEHLDEYEKKFP 110
ARI K A + + N+ K IK L E + FP
Sbjct: 278 IARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFP 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03995PF03944320.003 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 32.0 bits (72), Expect = 0.003
Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 68 LHHQEKLLNQCMLSQALKAMGDAELCVFLASVHDDLKGYEEFLNLCQKPHILALSKIDMA 127
L E+ LNQ + + + A +AEL A+V + + + FLN + L+++
Sbjct: 94 LRETERFLNQRLNTDTV-ARVNAELTGLQANVEEFNRQVDNFLNPNRNAVPLSITSSVNT 152

Query: 128 THKQVLQKLQEYQKYASQFVDLVPLSAKKSQNLN 161
+ L +L ++Q Q + L+PL A+ + NL+
Sbjct: 153 MQQLFLNRLPQFQMQGYQLL-LLPLFAQAA-NLH 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04000HTHFIS290.047 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.6 bits (64), Expect = 0.047
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 48 TPKNILMIGSTGVGKTEIARRI---AKIMELPFVKV 80
T +++ G +G GK +AR + K PFV +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04045IGASERPTASE310.016 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.016
Identities = 37/183 (20%), Positives = 64/183 (34%), Gaps = 9/183 (4%)

Query: 224 DEVCSPLRDEMVAMPTNDSVTQKPNIIAPYSLYRLKETNNANEAQPSPYATQTAPENSKE 283
D P +E +A + P P N+ E++ Q A E + +
Sbjct: 1006 DVPSVPSNNEEIARVDE-APVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQ 1064

Query: 284 KLIEELIANSQLIANEEEREKKLLAEKEKQEAELAKYKLKDLENQKKLKALEAELKKKNA 343
A S + AN + E + K+ + +E ++K K +K
Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAK----VETEKTQ 1120

Query: 344 KKPRVVEVPVSPQTSNSDETMRVVKEKENYNGLLVDKETTIKRSYEGTLISENSYSKKTP 403
+ P+V VSP+ S ET++ E N V+ + +S T +K+T
Sbjct: 1121 EVPKVTS-QVSPKQEQS-ETVQPQAEPARENDPTVNIKE--PQSQTNTTADTEQPAKETS 1176

Query: 404 LNP 406
N
Sbjct: 1177 SNV 1179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04085TYPE4SSCAGX8730.0 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 873 bits (2256), Expect = 0.0
Identities = 516/522 (98%), Positives = 518/522 (99%)

Query: 1 MEQAFFKKIVGCFCLGYLFLSSVIEAAALDIKNFNRGRVKVVNKKIAYLGDEKPITIWTS 60
M QAFFKKIVGCFCLGYLFLSS IEA ALDIKNFNRGRVKVVNKKIAYLGDEKPITIWTS
Sbjct: 1 MGQAFFKKIVGCFCLGYLFLSSAIEAVALDIKNFNRGRVKVVNKKIAYLGDEKPITIWTS 60

Query: 61 LDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR 120
LDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR
Sbjct: 61 LDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR 120

Query: 121 DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL 180
DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL
Sbjct: 121 DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL 180

Query: 181 ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA 240
ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA
Sbjct: 181 ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA 240

Query: 241 EETIKQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD 300
EE ++QRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD
Sbjct: 241 EEAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD 300

Query: 301 NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQRELIKQENLNTTAYINRVMMASNE 360
NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQRELIKQENLNTTAYINRVMMASNE
Sbjct: 301 NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQRELIKQENLNTTAYINRVMMASNE 360

Query: 361 QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF 420
QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF
Sbjct: 361 QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF 420

Query: 421 DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK 480
DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK
Sbjct: 421 DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK 480

Query: 481 DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK 522
DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK
Sbjct: 481 DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK 522


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04095PF043351186e-35 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 118 bits (298), Expect = 6e-35
Identities = 44/205 (21%), Positives = 74/205 (36%), Gaps = 10/205 (4%)

Query: 27 KLNKANRTFKRAFYL---SMALNVAAVTSIVMMMPLKKTDIFVYGIDRYTGEFKIVKRSD 83
KL A R+ K A+ + + AL A V ++ + PLK + +V +DR TGE I +
Sbjct: 24 KLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAKLH 83

Query: 84 A-RQIVNSEAVVDSATSKFVSLLFGYSKNSLRDRKDQLMQYCDVSFQTQAMRMFNENIRQ 142
I EAV + +V G+ + + D +M Q + R + + Q
Sbjct: 84 GDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDNPQ 143

Query: 143 FVDKVRA-EAIISSNIQREKVKNSPLTRLTFFITIKITPDTMENYEYITKKQVTIYYDFA 201
+ A + I + +F +T T TI Y
Sbjct: 144 SPQNILANRTDVFVEI-KRVSFLGGNVAQVYFTKESVTGSNS----TKTDAVATIKYKVD 198

Query: 202 RGNSSQENLIINPFGFKVFDIQITD 226
S + + NP G++V +
Sbjct: 199 GTPSKEVDRFKNPLGYQVESYRADV 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04120TYPE4SSCAGX310.005 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 31.3 bits (70), Expect = 0.005
Identities = 33/119 (27%), Positives = 52/119 (43%), Gaps = 16/119 (13%)

Query: 24 AINTALLPSEYKELVALGFKKIKTFHQRHDDEEVTEEEKKFATNALREKLRNDRARAEQI 83
A+N AL+ +Y+E F K K D + EE+KK L ++ EQ
Sbjct: 112 AVNFALMTRDYQE-----FLKTKKLIVDAPDPKELEEQKK--------ALEKEKEAKEQA 158

Query: 84 QKNIEAFEKKNNSSVQKKAAKHRGLQELNETNANPLNDNPNGNSSTETKSNKDDNFDEM 142
QK A + K +++A L+ L +NP N + N N S K +++ D+M
Sbjct: 159 QK---AQKDKREKRKEERAKNRANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQM 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04150ACRIFLAVINRP330.008 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 32.9 bits (75), Expect = 0.008
Identities = 20/88 (22%), Positives = 32/88 (36%), Gaps = 18/88 (20%)

Query: 19 EVQKRQFQKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSI-------FILFVT 71
+ K K+ EL+ +G+ +D F+ SI F +
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMK--VLYPYD--------TTPFVQLSIHEVVKTLFEAIML 350

Query: 72 IVLSVILF-QAYEPVLIVAIVIVLVALG 98
+ L + LF Q LI I + +V LG
Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLG 378


8HPB14_04245HPB14_04280Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_04245315-1.324831phospho-N-acetylmuramoyl-pentapeptide-
HPB14_04250516-1.838260neuraminyllactose-binding hemagglutinin
HPB14_04255717-1.92312150S ribosomal protein L28
HPB14_04260716-2.395607putative conserved potassium channel protein
HPB14_04265917-2.544009hypothetical protein
HPB14_04270511-2.621842hypothetical protein
HPB14_04275412-2.258094hypothetical protein
HPB14_04280310-1.998656hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04250PF05211291e-102 Neuraminyllactose-binding hemagglutinin
		>PF05211#Neuraminyllactose-binding hemagglutinin

Length = 260

Score = 291 bits (747), Expect = e-102
Identities = 65/283 (22%), Positives = 124/283 (43%), Gaps = 41/283 (14%)

Query: 1 MERSLIFKKVRVYSKMLVALGLSSVLIGCAMNPSAETKTPNDAKNQVQTHERIQTSSEYV 60
M+ + FK + K L+ + ++L+GC S N+ ++ H +SE V
Sbjct: 1 MKANNHFKDF-AWKKCLLGASVVALLVGC----SPHIIETNEVALKLNYH----PASEKV 51

Query: 61 TPLDFNYPIHIAQAPQNHHVVGILMPRIQVSDNL-KPYIDKFQDALANQIQTIFEKRGYQ 119
LD + +L P Q SDN+ K Y +KF++ +++ I + +GY+
Sbjct: 52 QALD--------------EKILLLRPAFQYSDNIAKEYENKFKNQTTLKVEQILQNQGYK 97

Query: 120 VLRF--QDEKALNAQDKRKIFCVLDLKGWVGILEDLKMNLKDPNNPNL--DTLVDQ---- 171
V+ D+ + K++ + + + G + + D K ++ + P L T +D+
Sbjct: 98 VINVDSSDKDDFSFAQKKEGYLAVAMNGEIVLRPDPKRTIQKKSEPGLLFSTGLDKMEGV 157

Query: 172 --SSGSVWFNFYEPESNRVVHDFAVEVGTF---QAMTYTYKHSNSGGFDSSDSIIHEDLE 226
+G V EP S + F +++ + T S+SGG S+
Sbjct: 158 LIPAGFVKVTILEPMSGESLDSFTMDLSELDIQEKFLKTTHSSHSGGLVSTMV----KGT 213

Query: 227 KNKEDAIHKILNRMYAVVMKKAVMELTEENIAKYRDAIDRMKG 269
N DAI LN+++A +M++ +LT++N+ Y+ +KG
Sbjct: 214 DNSNDAIKSALNKIFANIMQEIDKKLTQKNLESYQKDAKELKG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04265FLAGELLIN280.038 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 28.5 bits (63), Expect = 0.038
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 13/98 (13%)

Query: 172 INQAKE-SANNEISTNKTQAIANINEAKNNANNEIS---------NNQTQAITNINEAKE 221
IN AK+ +A I+ T I + +A NAN+ IS N + + E
Sbjct: 37 INSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGALNEINNNLQRVRELSV 96

Query: 222 SATNQINTNKQEVLNNIKKEKTQATSEITE-AKKTAFN 258
ATN N++ L +I+ E Q EI + +T FN
Sbjct: 97 QATNGTNSDSD--LKSIQDEIQQRLEEIDRVSNQTQFN 132


9HPB14_04325HPB14_04350Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_043251173.170966iron(III) dicitrate ABC transporter permease
HPB14_043302172.627580short-chain oxidoreductase
HPB14_043350193.245102hypothetical protein
HPB14_043400213.409978hypothetical protein
HPB14_043451213.863420hypothetical protein
HPB14_043502214.200769outer membrane protein BabA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04330DHBDHDRGNASE932e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 92.8 bits (230), Expect = 2e-24
Identities = 58/233 (24%), Positives = 102/233 (43%), Gaps = 10/233 (4%)

Query: 14 KVAVITGASSGIGLECALMLLDQGYKVYALSRHATLCVALNHALC------ECVDIDVSD 67
K+A ITGA+ GIG A L QG + A+ + + +L E DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 68 SNALKEVFLNISAKEDHCDVLINSAGYGVFGSVEDTPIEEVKKQFGVNFFALCEVVQFCL 127
S A+ E+ I + D+L+N AG G + EE + F VN + +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 128 PLLKNKPHSKIFNLSSIAGRVSMLFLGHYSASKHALEAYSDALRLEFKPFNIQVCLIEPG 187
+ ++ I + S V + Y++SK A ++ L LE +NI+ ++ PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 188 PVKSNWEKTAFENDERKDSVYALDLEKTKNFYSGVYQNALS-PKAVAQKIVFL 239
+++ + + + ++ + V LE F +G+ L+ P +A ++FL
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLET---FKTGIPLKKLAKPSDIADAVLFL 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04340PF01206260.017 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 25.9 bits (57), Expect = 0.017
Identities = 9/43 (20%), Positives = 21/43 (48%), Gaps = 7/43 (16%)

Query: 83 IPNLETQQAMREALNGENLEVI-------EDFSAWANEIKKEV 118
+P L+ ++ + GE L V+ +DF +++ + E+
Sbjct: 17 LPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHEL 59


10HPB14_05155HPB14_05225Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_0515529-0.415182glucokinase
HPB14_05160311-1.539692NADP-dependent alcohol dehydrogenase
HPB14_05165112-1.858594lipopolysaccharide biosynthesis protein
HPB14_051701120.430616lipopolysaccharide biosynthesis protein
HPB14_051752132.223839hypothetical protein
HPB14_051800152.883429hypothetical protein
HPB14_051850122.391661pyruvate flavodoxin oxidoreductase subunit
HPB14_05190-1112.029641pyruvate flavodoxin oxidoreductase subunit
HPB14_05195-1101.198160pyruvate flavodoxin oxidoreductase subunit
HPB14_05200-210-0.369018pyruvate ferredoxin oxidoreductase, beta
HPB14_05205011-0.988555adenylosuccinate lyase
HPB14_05210014-1.717531outer membrane protein Horl
HPB14_05215213-1.883143excinuclease ABC subunit B
HPB14_05220623-3.827737hypothetical protein
HPB14_05225332-3.726269hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_05185YERSSTKINASE290.011 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 29.3 bits (65), Expect = 0.011
Identities = 18/63 (28%), Positives = 33/63 (52%), Gaps = 9/63 (14%)

Query: 50 YNRVDDEPILNHERFMQPDYVLVIDPGLVFIENIFANEKEDTTYIITSYLNKEELFEKKP 109
++R ++P E F P+ + + N+ A+EK D ++++ L+ E FEK P
Sbjct: 293 HSRSGEQPKGFTESFKAPE---------LGVGNLGASEKSDVFLVVSTLLHCIEGFEKNP 343

Query: 110 ELK 112
E+K
Sbjct: 344 EIK 346


11HPB14_05280HPB14_05315Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_05280221-1.487879type II DNA modification enzyme
HPB14_05285217-1.105869Anti-sigma 28 factor
HPB14_05290314-2.092305hypothetical protein
HPB14_05295415-1.928446FKBP-type peptidyl-prolyl cis-trans isomerase
HPB14_05300316-2.594304hypothetical protein
HPB14_05305316-2.163356peptidoglycan-associated lipoprotein precursor
HPB14_05310113-0.234130translocation protein TolB
HPB14_05315217-0.264474hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_05305OMPADOMAIN1471e-45 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 147 bits (372), Expect = 1e-45
Identities = 48/169 (28%), Positives = 75/169 (44%), Gaps = 24/169 (14%)

Query: 22 KMDNKTVAGDVSAKTVQTAPV-TTEPAPEKEEPKQEPAPVVEEKPAVESGTIIASIYFDF 80
+ DN ++ VS + Q PAP PAP V+ K T+ + + F+F
Sbjct: 177 RPDNGMLSLGVSYRFGQGEAAPVVAPAPA-------PAPEVQTK----HFTLKSDVLFNF 225

Query: 81 DKYEIKESDQETLDEIVQKAKE---NHMQVLLEGNTDEFGSSEYNQALGVKRTLSVKNAL 137
+K +K Q LD++ + V++ G TD GS YNQ L +R SV + L
Sbjct: 226 NKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYL 285

Query: 138 VIKGVEKDMIKTISFGETKPKC-----AQKTR----ECYKENRRVDVKL 177
+ KG+ D I GE+ P K R +C +RRV++++
Sbjct: 286 ISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_05315IGASERPTASE300.013 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.013
Identities = 34/160 (21%), Positives = 55/160 (34%), Gaps = 25/160 (15%)

Query: 30 HNKEAEKILLDLGKKNEQVIDLNLEDLPSDEKKDEKIAEKAEEKKDEKVVEKNATDKEGD 89
+N E EK + N + D+PS +E+IA + DE V
Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIA-----RVDEAPVP--------- 1026

Query: 90 FIDPKEQEESLEDIFSSLNDFQEKTDTNAQKDEQKNEQEEEQRRLKEQQRLRKNQKNQEM 149
P S + N QE K +KNEQ+ + Q R + +
Sbjct: 1027 --PPAPATPSETTETVAENSKQE------SKTVEKNEQDATET--TAQNREVAKEAKSNV 1076

Query: 150 LKDLQQNLDQFAQKLESVKNKTLDLQIPKQDGVDEKAYQE 189
+ Q N + E+ + +T + + +EKA E
Sbjct: 1077 KANTQTN-EVAQSGSETKETQTTETKETATVEKEEKAKVE 1115


12HPB14_06525HPB14_06630Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_06525-210-3.719242hypothetical protein
HPB14_06530013-4.960299putative histidine kinase sensor protein
HPB14_06535-114-5.293615transcriptional regulator
HPB14_06540016-5.935875adenine-specific DNA methylase
HPB14_06545318-7.092013type II restriction endonuclease
HPB14_06550418-6.599234DNA adenine methylase
HPB14_06555618-6.958489type III restriction enzyme M protein
HPB14_065601030-8.332735type III R-M system methyltransferase
HPB14_06565826-5.902790hypothetical protein
HPB14_06570728-5.623848hypothetical protein
HPB14_06575627-4.482081hypothetical protein
HPB14_06580627-4.109559hypothetical protein
HPB14_06585527-3.640794hypothetical protein
HPB14_06590424-4.629418Transposase, OrfB, ISHp609
HPB14_06595725-6.493489Serine recombinase-transposase, OrfA, ISHp609
HPB14_06600625-6.453036hypothetical protein
HPB14_06605728-5.564529hypothetical protein
HPB14_06610626-6.112900hypothetical protein
HPB14_06615725-5.968723putative transposase
HPB14_06620729-5.591642ISHa1152 transposase A
HPB14_06625525-4.913116hypothetical protein
HPB14_06630218-2.104305hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_06535HTHFIS897e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.1 bits (221), Expect = 7e-23
Identities = 38/118 (32%), Positives = 57/118 (48%), Gaps = 2/118 (1%)

Query: 1 MQK-KIFLLEDDYLLSESIKEFLEHLGYEVFCTFNGKEAYERLSVERFNLLLLDVQVPEM 59
M I + +DD + + + L GY+V T N + ++ +L++ DV +P+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 NSLELFRRIKNDFLISTPVIFITALQDNATLKNAFNLGASDYLKKPFDLDELEARIKR 117
N+ +L RIK PV+ ++A T A GA DYL KPFDL EL I R
Sbjct: 61 NAFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


13HPB14_06695HPB14_06795Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_06695-111-3.436976prephenate dehydrogenase
HPB14_06700-313-4.441893endonuclease
HPB14_06705-215-4.408399adenine-specific DNA-methyltransferase
HPB14_06710-113-3.178182putative type III restriction enzyme R protein
HPB14_06715014-2.153361biotin synthase
HPB14_06720217-2.905013putative ribonuclease N
HPB14_06725419-2.959941hypothetical protein
HPB14_06730417-1.663445hypothetical protein
HPB14_06735417-1.943903hypothetical protein
HPB14_06740620-1.133887hypothetical protein
HPB14_06745218-1.198798hypothetical protein
HPB14_06750014-0.975084hypothetical protein
HPB14_06755-115-0.8577037-cyano-7-deazaguanine reductase
HPB14_067600121.043184iojap-related protein
HPB14_06765-2111.601559tRNA delta(2)-isopentenylpyrophosphate
HPB14_06770-2111.817219lipopolysaccharide biosynthesis protein, LPS
HPB14_06785-2132.377513UDP-N-acetylenolpyruvoylglucosamine reductase
HPB14_06790-2132.557738flagellar biosynthesis protein FliQ
HPB14_06795-2123.014028flagellum-specific ATP synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_06790TYPE3IMQPROT672e-18 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 66.7 bits (163), Expect = 2e-18
Identities = 26/81 (32%), Positives = 40/81 (49%)

Query: 3 SQLMKLAIETYKITLMISLPVLLAGLVVGLLVSIFQATTQINEMTLSFVPKILAVIGVLI 62
L+ + + L++S + ++GLLV +FQ TQ+ E TL F K+L V L
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 63 LTMPWMTNMLLDYTKTLIKLI 83
L W +LL Y + +I L
Sbjct: 62 LLSGWYGEVLLSYGRQVIFLA 82


14HPB14_07040HPB14_07105Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_070402110.439783putative ABC transport system permease protein
HPB14_070453130.523109outer membrane protein
HPB14_070503150.475583branched-chain amino acid aminotransferase
HPB14_07055214-0.288555outer membrane protein HorJ
HPB14_07060216-0.284487DNA polymerase I
HPB14_070753180.259767restriction enzyme BcgI alpha chain-like
HPB14_070804221.067290competence protein ComFC
HPB14_070854130.240768thymidylate kinase
HPB14_07090312-0.012039phosphopantetheine adenylyltransferase
HPB14_070952110.1856793-octaprenyl-4-hydroxybenzoate carboxy-lyase
HPB14_07100211-0.078695hypothetical protein
HPB14_07105212-0.038191flagellar basal body P-ring biosynthesis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_07090LPSBIOSNTHSS2235e-78 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 223 bits (569), Expect = 5e-78
Identities = 63/147 (42%), Positives = 94/147 (63%)

Query: 4 IGIYPGTFDPVTNGHIDIIHRSSELFEKLIVAVAHSSAKNPMFSLKERLKMMQLATKSFK 63
IYPG+FDP+T GH+DII R LF+++ VAV + K PMFS++ERL+ + A
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 64 NVECVAFEGLLANLAKEYHCKVLVRGLRVVSDFEYELQMGYANKSLNHELETLYFMPTLQ 123
N + +FEGL N A++ ++RGLRV+SDFE ELQM NK+L +LET++ + +
Sbjct: 62 NAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTE 121

Query: 124 NAFISSSIVRSIIAHKGDASHLVPKEI 150
+F+SSS+V+ + G+ H VP +
Sbjct: 122 YSFLSSSLVKEVARFGGNVEHFVPSHV 148


15HPB14_07190HPB14_07355Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_07190014-4.520889UDP-N-acetylmuramoylalanyl-D-glutamate--2,
HPB14_07195018-5.529763transaldolase
HPB14_07200217-4.48319950S ribosomal protein L25/general stress protein
HPB14_07205218-4.608998peptidyl-tRNA hydrolase
HPB14_07210114-3.430488hypothetical protein
HPB14_07215114-3.371803putative site-specific DNA-methyltransferase
HPB14_0722009-0.845835putative endonuclease
HPB14_072251110.912811outer membrane protein (omp32)
HPB14_072300110.763459hypothetical protein
HPB14_072350111.018882putative cation-transporting ATPase CopA
HPB14_072401101.313486hypothetical protein
HPB14_072452111.094779riboflavin biosynthesis protein
HPB14_072501121.530647sodium/glutamate symport carrier
HPB14_072552122.661429saccharopine dehydrogenase
HPB14_072601132.636208ferrodoxin-like protein
HPB14_072650111.961204putative glycerol-3-phosphate acyltransferase
HPB14_07270-2101.750495dihydroneopterin aldolase
HPB14_07275-191.349106hypothetical protein
HPB14_07280-211-0.225220iron-regulated outer membrane protein
HPB14_07285-113-4.926577hypothetical protein
HPB14_07290-110-4.557342selenocysteine synthase
HPB14_07295-110-4.565562transcription elongation factor NusA
HPB14_07300-110-4.658360hypothetical protein
HPB14_07305010-4.538215putative type IIS restriction-modification
HPB14_07310112-5.083173hypothetical protein
HPB14_07315111-2.471046type III restriction enzyme
HPB14_07320111-1.539074type III R-M system modification enzyme
HPB14_07325113-1.341613ATP-dependent DNA helicase RecG
HPB14_07330113-1.166086hypothetical protein
HPB14_07335013-1.151831hypothetical protein
HPB14_07340012-0.425014exodeoxyribonuclease III
HPB14_07345114-0.099077*hypothetical protein
HPB14_073503170.063577chromosomal replication initiation protein
HPB14_07355219-1.939637hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_07350HTHFIS355e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.2 bits (81), Expect = 5e-04
Identities = 9/51 (17%), Positives = 24/51 (47%), Gaps = 4/51 (7%)

Query: 125 TVYEIAKKVAQSDTPPYNPVLFYGGTGLGKTHILNAIGNHALEKHKKVVLV 175
+Y + ++ Q+D ++ G +G GK + A+ ++ ++ V +
Sbjct: 148 EIYRVLARLMQTDLT----LMITGESGTGKELVARALHDYGKRRNGPFVAI 194


16HPB14_00190HPB14_00215N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_00190-211-0.133624type IV secretion system protein VirB8
HPB14_00195-212-0.521118ComB9 competence protein
HPB14_00200-2130.633046ComB10 competence protein
HPB14_00205-1120.745712mannose-1-phosphate guanyltransferase
HPB14_00210-2111.021620GDP-D-mannose dehydratase
HPB14_00215-1120.833560sugar nucleotide biosynthesis
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_00190PF043351331e-40 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 133 bits (337), Expect = 1e-40
Identities = 38/202 (18%), Positives = 72/202 (35%), Gaps = 4/202 (1%)

Query: 40 QSVFRLERNRLKIAYKLLGLMSFIALILAIVLISLLPLQKTEHHF--VDFLNQDKHYAII 97
+ K+A+ + G+ +A + + +L PL+ E + VD + A
Sbjct: 22 RDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAK 81

Query: 98 QRADKSISSNEALARSLIGAYVLNRESINRIDDKSRYELVRLQSSSKVWQRFEDLIKTQN 157
D +I+ +EA+ + + YV RE + ++ V + S+ R+ KT N
Sbjct: 82 LHGDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDN 141

Query: 158 SIYAQSHLEREVHI-VNIAIYQQDNNPIASVSIAAKLTNENKLVYEKRYKIVLSYLFDTP 216
Q+ L + V I +A V + + + + Y D
Sbjct: 142 PQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFTKESVTGSNST-KTDAVATIKYKVDGT 200

Query: 217 DFDYASMPKNPTGFKITRYSIT 238
KNP G+++ Y
Sbjct: 201 PSKEVDRFKNPLGYQVESYRAD 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_00195TYPE4SSCAGX290.024 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 29.4 bits (65), Expect = 0.024
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 13/112 (11%)

Query: 174 KENKENKENKENKE-NKENTLENAPTNNKPLKEKKEET----KEKEEETITIGDNTNAMK 228
+E K+ +E K +E K+ L N+ + E+ K +EE+ I D A+
Sbjct: 325 EELKKREEAKRQRELIKQENLNTTAYINRVMMASNEQIINKEKIREEKQKIILDQAKAL- 383

Query: 229 IVKKDIQKGYKALKSSQ--RKWYCLGICSKKSKLSLMPKEIFNDKQFTYFKF 278
+ Q + ALK + R + K+SK +MP EIF+D FTYF F
Sbjct: 384 ----ETQYVHNALKRNPVPRNYNYYQAPEKRSK-HIMPSEIFDDGTFTYFGF 430


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_00205FLGMRINGFLIF300.024 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 29.9 bits (67), Expect = 0.024
Identities = 15/70 (21%), Positives = 26/70 (37%), Gaps = 3/70 (4%)

Query: 272 ALFEEAANEPKENVSLNQTPVFAKESANNLVFSHKVSAL---LGVEDLAVIDTKDALLVA 328
+LF P +V++ P A + H VS+ L ++ ++D LL
Sbjct: 162 SLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQ 221

Query: 329 HKDKAKDLKA 338
+DL
Sbjct: 222 SNTSGRDLND 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_00210NUCEPIMERASE896e-22 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 89.1 bits (221), Expect = 6e-22
Identities = 46/180 (25%), Positives = 72/180 (40%), Gaps = 19/180 (10%)

Query: 7 LITGVTGQDGSYLAEYLLNLGYEVHGLKRRSSSINTSRIDHLYEDLHSDHKRRFFLHYGD 66
L+TG G G ++++ LL G++V G+ + + S E L F H D
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP---GFQFHKID 60

Query: 67 MTDSSNLIHLIATNKPTEIYNLAAQSHVKVSFETPEYTANADGIGTLRILEAMRILGLEN 126
+ D + L A+ ++ + V+ S E P A+++ G L ILE R ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ- 119

Query: 127 KTRFYQASTSELYGEVLETPQNENTPF-------NPRSPYAVAKMYAFYITKNYREAYNL 179
AS+S +YG N PF +P S YA K + Y Y L
Sbjct: 120 --HLLYASSSSVYGL------NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_00215NUCEPIMERASE513e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 50.6 bits (121), Expect = 3e-09
Identities = 52/346 (15%), Positives = 107/346 (30%), Gaps = 54/346 (15%)

Query: 5 ILITGAYGMVGQNTALYFKKNKPDV-----------TLLTPKKSELY-----------LL 42
L+TGA G +G + + + V L + EL L
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 43 DKDSVQAYLKEYKPTGIIHCAGRVGGIVANMNDLSTYMVENLLMGLYLFSSALDLGVKKA 102
D++ + + R + ++ + Y NL L + ++
Sbjct: 63 DREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 103 INLASSCAYPKYAPNPLKESDLLNGSLEPTNEGYALAKLSVMKYCEYVSAEKGVFYKTLV 162
+ +SS Y P D ++ + YA K + S G+ L
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSL----YAATKKANELMAHTYSHLYGLPATGLR 177

Query: 163 PCNLYGEFDKFEEKIAHMIPGLIARMHTAKLKNEKEFAMWGDGTARREYLNAKDLARFIA 222
+YG + + P + T + K ++ G +R++ D+A I
Sbjct: 178 FFTVYGPWGR---------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228

Query: 223 LAYENIAQ----------MPS-------VMNVGSGVDYSIEEYYEMVAQVLDYKGVFVKD 265
+ I P+ V N+G+ + +Y + + L +
Sbjct: 229 RLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNML 288

Query: 266 LSKPVGMQQKLMDISK-QKALKWELEIPLEQGIKEAYEYYLKLLEV 310
+P + + D + + + E ++ G+K +Y +V
Sbjct: 289 PLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYKV 334


17HPB14_01200HPB14_01235N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_01200-2121.061520Neutrophil activating protein NapA
HPB14_01205-3120.849232histidine kinase sensor protein
HPB14_01210-2111.638872hypothetical protein
HPB14_01215-2112.234194flagellar basal body P-ring protein
HPB14_01220-1111.945233ATP-dependent RNA helicase
HPB14_01225-2101.631997hypothetical protein
HPB14_01230-291.199941hypothetical protein
HPB14_01235-392.290861oligopeptide permease ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01200HELNAPAPROT1493e-49 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 149 bits (377), Expect = 3e-49
Identities = 39/140 (27%), Positives = 74/140 (52%), Gaps = 1/140 (0%)

Query: 5 EILKHLQADAIVLFMKVHNFHWNVKGTDFFNVHKATEEIYEEFADMFDDLAERIAQLGHH 64
L ++ +L+ K+H FHW VKG FF +H+ EE+Y+ A+ D +AER+ +G
Sbjct: 15 NSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERLLAIGGQ 74

Query: 65 PLVTLSEALKLTRVKEETKTSFHSKDIFKEILGDYKHLEKEFKELSNTAEKEGDKVTVTY 124
P+ T+ E + + + + + ++ + ++ DYK + E K + AE+ D T
Sbjct: 75 PVATVKEYTEHASITDGGNET-SASEMVQALVNDYKQISSESKFVIGLAEENQDNATADL 133

Query: 125 ADDQLAKLQKSIWMLEAHLA 144
+ +++K +WML ++L
Sbjct: 134 FVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01205PF06580300.015 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.2 bits (68), Expect = 0.015
Identities = 10/71 (14%), Positives = 25/71 (35%), Gaps = 13/71 (18%)

Query: 281 IVLQNFLYNAIDAIEALEESEQ-GQVKIEAFIQNEFIVFTIIDNGKEVENKSALFEPFET 339
+++Q + N I + + Q G++ ++ N + + + G +
Sbjct: 258 MLVQTLVENGI--KHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------- 308

Query: 340 TKLKGNGLGLA 350
+ G GL
Sbjct: 309 ---ESTGTGLQ 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01215FLGPRINGFLGI362e-127 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 362 bits (930), Expect = e-127
Identities = 117/345 (33%), Positives = 191/345 (55%), Gaps = 26/345 (7%)

Query: 19 AEKIGDIASVVGVRDNQLIGYGLVIGLNGTGDK-SGSKFTMQSISNMLESVNVKISADDI 77
+I DIAS+ RDNQLIGYGLV+GL GTGD S FT QS+ ML+++ +
Sbjct: 28 TSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNLGITTQGGQS 87

Query: 78 KSKNVAAVMITASLPPFARQGDKIDIHISSIGDAKSIQGGTLVMTPLNAVDGNIYALAQG 137
+KN+AAVM+TA+LPPFA G ++D+ +SS+GDA S++GG L+MT L+ DG IYA+AQG
Sbjct: 88 NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQIYAVAQG 147

Query: 138 AIVSGN-----------SNNLLSANIINGATIEREVSYDLFHKNAMVLSLKSPNFKNAIQ 186
A++ SA + NGA IERE+ +VL L++P+F A++
Sbjct: 148 ALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQLRNPDFSTAVR 207

Query: 187 VQNTLNKV----FGNKVAIALDPKTIQITRPERLSMVEFLALVQEIPINYSAKNKIIVDE 242
V + +N +G+ +A D + I + +P + +A ++ + + K++++E
Sbjct: 208 VADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPAKVVINE 267

Query: 243 KSGTIVSGVDIIVHPIVVTSQDITLKITKEP--------LNDSKNTQDLDNNMSLDTAHN 294
++GTIV G D+ + + V+ +T+++T+ P Q + M++
Sbjct: 268 RTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSK 327

Query: 295 TLSSNGKSITIAGVVKALQKIGVSAKGMVSILQALKKSGAISAEM 339
G + +V L IG+ A G+++ILQ +K +GA+ AE+
Sbjct: 328 VAIVEGPDLR--TLVAGLNSIGLKADGIIAILQGIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01220SECA300.026 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.026
Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 2/63 (3%)

Query: 261 IVFTRTKKEADELHQFLASKNYKSTALHGDMDQRDRRASIMAFKKNDADVLVATDVASRG 320
+V T + ++++ + L K L+ + A+I+A A V +AT++A RG
Sbjct: 453 LVGTISIEKSELVSNELTKAGIKHNVLNAKFHANE--AAIVAQAGYPAAVTIATNMAGRG 510

Query: 321 LDI 323
DI
Sbjct: 511 TDI 513


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01235HTHFIS320.005 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.5 bits (74), Expect = 0.005
Identities = 16/50 (32%), Positives = 21/50 (42%), Gaps = 7/50 (14%)

Query: 30 VAIVGESGSGKSSIANLIMRLNPR----FKPHNGEVLFETTNLLKESEAF 75
+ I GESG+GK +A + R F N + L ESE F
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRD---LIESELF 209


18HPB14_01685HPB14_01730N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_01685-2100.623032flagellar MS-ring protein
HPB14_01690-2120.876235flagellar motor switch protein G
HPB14_01695-1110.929189flagellar assembly protein H
HPB14_01700-171.4287381-deoxy-D-xylulose-5-phosphate synthase
HPB14_017050100.770532GTP-binding protein LepA
HPB14_01710-114-1.390379hypothetical protein
HPB14_01715-1130.019032hypothetical protein
HPB14_01720-113-0.229181flagellar basal-body rod protein
HPB14_01725012-0.607072alpha-ketoglutarate permease
HPB14_01730013-0.901219cell division protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01685FLGMRINGFLIF5520.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 552 bits (1423), Expect = 0.0
Identities = 179/582 (30%), Positives = 293/582 (50%), Gaps = 66/582 (11%)

Query: 11 VDFFIKLNKKQKIALIAAGVLITALLVFLLLYPFKEKDYAQGGYGVLFERLDSSDNALIL 70
+++ +L +I LI AG A++V ++L+ K DY LF L D I+
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWA-KTPDYR-----TLFSNLSDQDGGAIV 66

Query: 71 QHLQQNQIPYKVLKDD-TILVPKDKVYEERITLASQGIPKTSKVGFEIFDTKDFGATDFD 129
L Q IPY+ I VP DKV+E R+ LA QG+PK VGFE+ D + FG + F
Sbjct: 67 AQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFS 126

Query: 130 QNIKLIRAIEGELSRTIESLNPILKANVHIAIPKDSVFVAKEVPPSASVMLKLKPDMKLS 189
+ + RA+EGEL+RTIE+L P+ A VH+A+PK S+FV ++ PSASV + L+P L
Sbjct: 127 EQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALD 186

Query: 190 PTQILGIKNLIAAAVPKLTIENVKIVNENGESIGEGDILENSKELALEQLHYKQNFENIL 249
QI + +L+++AV L NV +V+++G + + + + ++L QL + + E+ +
Sbjct: 187 EGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNT--SGRDLNDAQLKFANDVESRI 244

Query: 250 ENKIVNILAPIVGGKNKVVARVNAEFDFSQKKSTKETFDPNN-----VVRSEQNLEEKKE 304
+ +I IL+PIVG N V A+V A+ DF+ K+ T+E + PN +RS Q ++
Sbjct: 245 QRRIEAILSPIVGNGN-VHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQV 303

Query: 305 GAPKKQVGGVPGVVSN-IGPVQGLKDNKEPEKYEKSQN---------------------- 341
GA GGVPG +SN P P + +QN
Sbjct: 304 GAGYP--GGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNE 361

Query: 342 TTNYEVGKTISEIKGEFGTLVRLNAAVVVDGRYKIALKDGANALEYEPLSDESLKKINAL 401
T+NYEV +TI K G + RL+ AVVV+ + L DG + PL+ + +K+I L
Sbjct: 362 TSNYEVDRTIRHTKMNVGDIERLSVAVVVNYK---TLADG----KPLPLTADQMKQIEDL 414

Query: 402 VKQAIGYNQNRGDDVAVSNFEFNPMAPVIDNATLSEKIMHKTQKILGSFTPLIKYILVFI 461
++A+G++ RGD + V N F+ + T E + Q + +++LV +
Sbjct: 415 TREAMGFSDKRGDTLNVVNSPFSAVDN-----TGGELPFWQQQSFIDQLLAAGRWLLVLV 469

Query: 462 VLFIFYKKVIVPFSERMLEVVPDEDKEVKSMFEEMDEEEDELNKLGDLRKKVEDQLGLNA 521
V +I ++K + P R +E ++ + E + E L+K L+++ +Q
Sbjct: 470 VAWILWRKAVRPQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQ----- 524

Query: 522 TFSEEEVRYEIILEKIRGTLKERPDEIATLFKLLIKDEISSD 563
+ E++ ++IR E D + L+I+ +S+D
Sbjct: 525 -----RLGAEVMSQRIR----EMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01690FLGMOTORFLIG351e-123 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 351 bits (902), Expect = e-123
Identities = 122/338 (36%), Positives = 209/338 (61%), Gaps = 4/338 (1%)

Query: 8 KQKAQLDELSMSEKIAILLIQVGEDTTGEILRHLDIDSITEISKQIVQLNGTDKQIGAAV 67
K+ + L+ +K AILL+ +G + + ++ ++L + I ++ +I +L ++ V
Sbjct: 7 KEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNV 66

Query: 68 LEEFFAIFQSNQYINTGGLEYARELLTRTLGSEEAKKVMDKLTKSLQTQKNFAYLGKIKP 127
L EF + + ++I GG++YARELL ++LG+++A +++ L +LQ+ + F ++ + P
Sbjct: 67 LLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQS-RPFEFVRRADP 125

Query: 128 QQLADFIINEHPQTIALILAHMEAPNAAETLSYFPDEMKAEISIRMANLGEISPQVVKRV 187
+ +FI EHPQTIALIL++++ A+ LS P E++ ++ R+A + SP+VV+ V
Sbjct: 126 ANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREV 185

Query: 188 STVLENKLESLTSYK-IEVGGLRAVAEIFNRLGQKSAKTTLARIESVDNKLAGAIKEMMF 246
VLE KL SL+S GG+ V EI N +K+ K + +E D +LA IK+ MF
Sbjct: 186 ERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMF 245

Query: 247 TFEDIVKLDNFAIREILKVADKKDLSLALKTSTKDLTDKFLNNMSSRAAEQFVEEMQYLG 306
FEDIV LD+ +I+ +L+ D ++L+ ALK+ + +K NMS RAA E+M++LG
Sbjct: 246 VFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLG 305

Query: 307 AVKIKDVDVAQRKIIEIVQSLQEKG--VIQTGEEEDVI 342
+ KDV+ +Q+KI+ +++ L+E+G VI G EEDV+
Sbjct: 306 PTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343



Score = 30.2 bits (68), Expect = 0.010
Identities = 20/102 (19%), Positives = 41/102 (40%), Gaps = 3/102 (2%)

Query: 4 KLTPKQKAQLDELSMSEKIAILLIQVGEDTTGEILRHLDIDSITEISKQIVQLNGTDKQI 63
+ P + + IA++L + IL L + T ++++I ++ T ++
Sbjct: 122 RADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEV 181

Query: 64 GAA---VLEEFFAIFQSNQYINTGGLEYARELLTRTLGSEEA 102
VLE+ A S Y + GG++ E++ E
Sbjct: 182 VREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEK 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01695FLGFLIH368e-05 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 35.9 bits (82), Expect = 8e-05
Identities = 44/207 (21%), Positives = 90/207 (43%), Gaps = 14/207 (6%)

Query: 50 PLEKKAIENDLIDCLLKKTDELSSHLVKLQMQFEKAQEES-KALIENAKNDGYKIGFKEG 108
E I + + L L +LQMQ A E+ +A I + G+K G++EG
Sbjct: 19 QAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQ---AHEQGYQAGIAEGRQQGHKQGYQEG 75

Query: 109 EEKMRNELTHSVNEEKNQLLHAITALDEKMKKSEDHLMALE----KELSAIAIDIAKEVI 164
+ L + E K+Q + + + + + L AL+ L +A++ A++VI
Sbjct: 76 ---LAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVI 132

Query: 165 LKEVEDSSQKVALALAEELLKNVLDATDIHLKVNPLDYPYLNERLQNASKI---KLESNE 221
+ + + + + L + L + L+V+P D +++ L + +L +
Sbjct: 133 GQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDP 192

Query: 222 AISKGGVMITSSNGSLDGNLMERFKTL 248
+ GG +++ G LD ++ R++ L
Sbjct: 193 TLHPGGCKVSADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01705TCRTETOQM1146e-29 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 114 bits (288), Expect = 6e-29
Identities = 54/162 (33%), Positives = 89/162 (54%), Gaps = 7/162 (4%)

Query: 3 NIRNFSIIAHIDHGKSTLADCLISECNAIS---NREMTSQVMDTMDIEKERGITIKAQSV 59
I N ++AH+D GK+TL + L+ AI+ + + + D +E++RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 60 RLNYTFKGEDYVLNLIDTPGHVDFSYEVSRSLCSCEGALLVVDATQGVEAQTIANTYIAL 119
+F+ E+ +N+IDTPGH+DF EV RSL +GA+L++ A GV+AQT +
Sbjct: 62 ----SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117

Query: 120 DNNLEILPVINKIDLPNANVLEVKQDIEDTIGIDCSSANEVS 161
+ + INKID ++ V QDI++ + + +V
Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVE 159



Score = 83.4 bits (206), Expect = 7e-19
Identities = 50/215 (23%), Positives = 90/215 (41%), Gaps = 17/215 (7%)

Query: 161 SAKAKLGIKDLLEKIITTIPAPSGDPNAPLKALIYDSWFDNYLGALALVRIMDGSINTEQ 220
SAK +GI +L+E I + + + L ++ + LA +R+ G ++
Sbjct: 220 SAKNNIGIDNLIEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRD 279

Query: 221 EILVMGTGKKHGVLGLYYPNPLKKIPTKSLECGEIGIV---SLGLKSVTDIAVGDTLTDA 277
+ + K + +Y + GEI I+ L L SV +GDT
Sbjct: 280 SVRISEKEKI-KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV----LGDTKLL- 333

Query: 278 KNPTPKPIEGFIPAKPFVFAGLYPIETDRFEDLREALLKLQLNDCALNFEPESSVALGFG 337
P + IE P + + P + + E L +ALL++ +D L + +S+
Sbjct: 334 --PQRERIEN---PLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATH---E 385

Query: 338 FRVGFLGLLHMEVIKERLEREFGLNLIATAPTVVY 372
+ FLG + MEV L+ ++ + + PTV+Y
Sbjct: 386 IILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420



Score = 32.5 bits (74), Expect = 0.005
Identities = 16/75 (21%), Positives = 28/75 (37%), Gaps = 2/75 (2%)

Query: 399 IKEPFVRATIITPSEFLGNLMQLLNNKRGIQEKMEYLNQSRVILTYSLPSNEIVMDFYDK 458
+ EP++ I P E+L + L + VIL+ +P+ I ++
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCI-QEYRSD 592

Query: 459 LKSCTKGYASFDYEP 473
L T G + E
Sbjct: 593 LTFFTNGRSVCLTEL 607


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01720FLGHOOKAP1300.008 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.008
Identities = 9/40 (22%), Positives = 16/40 (40%)

Query: 3 NGYYAATGAMATQFNRLDLTSNNLANLNTNGFKRDDAITG 42
+ A + L+ SNN+++ N G+ R I
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01725TCRTETB416e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 41.0 bits (96), Expect = 6e-06
Identities = 58/312 (18%), Positives = 106/312 (33%), Gaps = 57/312 (18%)

Query: 37 APYFAKEFTHTNDPTLALISAFLVFMLGFFMRPLGSLFFGKLGDKKGRKTSMVYSIILMA 96
P A +F T + +AF++ G+ +GKL D+ G K +++ II+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSI------GTAVYGKLSDQLGIKRLLLFGIIINC 90

Query: 97 LGSFLLALLPTKEIVGEWAFLFLLLARLLQGFSVGGE------YGVVATYLSELGKNGKK 150
GS + VG F L++AR +QG G VVA Y+ + +
Sbjct: 91 FGSVIGF-------VGHSFFSLLIMARFIQG--AGAAAFPALVMVVVARYIPKENRGKAF 141

Query: 151 GFYGSFQYVTLVGGQLLAIFSLFIVENIYTHEQISAFAWRYLFALGGILALLSLFLRNIM 210
G GS + +G + I I+ W YL + I + FL ++
Sbjct: 142 GLIGS---IVAMGEGVGPAIGGMIAHYIH---------WSYLLLIPMITIITVPFLMKLL 189

Query: 211 EETMDNEATPQKETNVNHVKETQRGSLKELLHHKKALMIVFGLTMGGSLCFYTFTVYLKI 270
++ E + ++ + +L + + T + I
Sbjct: 190 KK----EVRIKGHFDIKGI----------ILMSVGIVFFMLFTTSYSISFLIVSVLSFLI 235

Query: 271 FLTNSSSFSPK-------ESSFIMLLALSYFIFLQPLCG---MLADKIKRTQMLMVFAIT 320
F+ + + ++ M+ L I + G M+ +K L I
Sbjct: 236 FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG 295

Query: 321 GLIVTPVVFYGI 332
+I+ P I
Sbjct: 296 SVIIFPGTMSVI 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_01730IGASERPTASE330.008 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.008
Identities = 30/182 (16%), Positives = 61/182 (33%), Gaps = 13/182 (7%)

Query: 202 KENLIDENHNTPNEESFLAIPTPYNTTLNDLEPQEGLVQISPHPPTHYTIYPKRNRFDDL 261
N + +N E+ + T TT N+++ P+ + + R D+
Sbjct: 973 NVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADV---------PSVPSNNEEIARVDEA 1023

Query: 262 TNPTNPPLKEPKQETKEREPMPTKETLTPATPKPATLKPIISAPVMPASAPIIENDNKTQ 321
P P + E + + AT + V + ++ N
Sbjct: 1024 PVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVK-ANTQT 1082

Query: 322 NHKAPNHPKKEESPQENTQEEMIKE---NLKEEEKETQDAPNFSPVTPTSTKKPVMVKEL 378
N A + + +E+ T+E E K E ++TQ+ P + ++ V+
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142

Query: 379 SE 380
+E
Sbjct: 1143 AE 1144


19HPB14_03765HPB14_03795N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_03765213-0.139145endonuclease III
HPB14_03770213-0.897480flagellar motor switch protein
HPB14_03775213-1.567191hypothetical protein
HPB14_03780112-0.902549putative siderophore-mediated iron transport
HPB14_03785-112-0.807501dihydroorotase
HPB14_03790012-0.821747hypothetical protein
HPB14_03795-111-0.852017hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03765OMS28PORIN280.032 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 27.8 bits (61), Expect = 0.032
Identities = 28/112 (25%), Positives = 53/112 (47%), Gaps = 11/112 (9%)

Query: 23 NQTTELHHKNPYELLVATILSAQCTDARVNQITPKLFEKYPSVNDLAL-----ASLEEVK 77
N+ E+ K E A ++ + T QI + K P+ +L L A +E+VK
Sbjct: 132 NKVVEMSKKAVQETQKAVSVAGEATFLIEKQI---MLNKSPNNKELELTKEEFAKVEQVK 188

Query: 78 EIIKSVSYFNNKSKHLISMAQKVVRDFKGVIPSTQKELMSLDGVGQKTANVV 129
E + + +++ + AQKV+ G+ PS + ++++ V + +NVV
Sbjct: 189 ETLMASERALDET---VQEAQKVLNMVNGLNPSNKDQVLAKKDVAKAISNVV 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03770FLGMOTORFLIN1001e-30 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 100 bits (250), Expect = 1e-30
Identities = 25/77 (32%), Positives = 47/77 (61%)

Query: 34 LICDYKNLLDMEIVFSAELGSTQIPLLQILRFEKGSVIDLQKPAGESVDTFVNGRVIGKG 93
+ D ++D+ + + ELG T++ + ++LR +GSV+ L AGE +D +NG +I +G
Sbjct: 50 AMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQG 109

Query: 94 EVMVFERNLAIRLNEIL 110
EV+V +R+ +I+
Sbjct: 110 EVVVVADKYGVRITDII 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03780TONBPROTEIN496e-09 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 49.2 bits (117), Expect = 6e-09
Identities = 24/57 (42%), Positives = 28/57 (49%)

Query: 83 APKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVE 139
P P +P P P P P IEKPKP+PKPKPKP K + +K VE
Sbjct: 62 QPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVE 118



Score = 45.0 bits (106), Expect = 1e-07
Identities = 25/70 (35%), Positives = 32/70 (45%), Gaps = 8/70 (11%)

Query: 84 PKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVEKVEE 143
P + P +P P P P P P E P KPKPKP+PK K V+KV+E
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP--------KPVKKVQE 108

Query: 144 KKVVEEKKEE 153
+ + K E
Sbjct: 109 QPKRDVKPVE 118



Score = 39.2 bits (91), Expect = 1e-05
Identities = 25/72 (34%), Positives = 31/72 (43%), Gaps = 1/72 (1%)

Query: 87 TLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVEKVEEKKV 146
T+ P P PP P E P+PEP P+P E K K K + KKV
Sbjct: 48 TMVTPADLEPPQAVQPPPEPVVEPE-PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKV 106

Query: 147 VEEKKEEKKIVE 158
E+ K + K VE
Sbjct: 107 QEQPKRDVKPVE 118



Score = 38.0 bits (88), Expect = 2e-05
Identities = 16/54 (29%), Positives = 21/54 (38%)

Query: 74 QDPNKNTPGAPKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPKK 127
Q +P P P P PKP KPKP+P K + +PK+
Sbjct: 59 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKR 112



Score = 37.3 bits (86), Expect = 4e-05
Identities = 42/218 (19%), Positives = 75/218 (34%), Gaps = 40/218 (18%)

Query: 98 PTPPTPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVEKVEEKKVVEEKKEEKKIV 157
P P P +PEP+P+P PEP K VV EK + K
Sbjct: 54 DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKE---------------APVVIEKPKPKPKP 98

Query: 158 EQKVEQKVEQKKIEEKKPVKKEFDPNQLSFLPKEVAPPRQENNKGLDNQTRRDIDELYGE 217
+ K +KV+++ + KPV E P N T +
Sbjct: 99 KPKPVKKVQEQPKRDVKPV--------------ESRPASPFENTAPARLTSSTATAATSK 144

Query: 218 EFGDLGTAEKDFIRNNLRDIGRITQKYLEYPQVAAYLGQDGTNAVEFYLHPNGDITDLKI 277
+ + + RN + YP A L +G V+F + P+G + +++I
Sbjct: 145 PVTSVASGPRALSRNQPQ-----------YPARAQALRIEGQVKVKFDVTPDGRVDNVQI 193

Query: 278 IIGSEYKMLDDNTLKTIQIAYKDYPRPKTKTLIRIRVR 315
+ M + ++ + +P + ++ I +
Sbjct: 194 LSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFK 231



Score = 33.0 bits (75), Expect = 0.001
Identities = 14/56 (25%), Positives = 22/56 (39%)

Query: 74 QDPNKNTPGAPKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPKKPN 129
+P P+P P++ P P P PKP K + +PK +P +
Sbjct: 65 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120



Score = 31.1 bits (70), Expect = 0.005
Identities = 12/52 (23%), Positives = 16/52 (30%)

Query: 75 DPNKNTPGAPKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPEPKPKPKPEPK 126
+P P + P P P P K E+PK + KP
Sbjct: 72 EPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPAS 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03795TYPE3IMSPROT310.002 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 31.3 bits (71), Expect = 0.002
Identities = 19/64 (29%), Positives = 29/64 (45%), Gaps = 4/64 (6%)

Query: 88 LQSYSVMLFFNLLLLTDILGFLPFSIYHHFMASLIFSALFCSSLFLSSPLLGVIALVALS 147
L Y F L+L+ +LPFS S + + +L PLL V AL+A++
Sbjct: 45 LSDYYFEHFSKLMLIPAEQSYLPFSQ----ALSYVVDNVLLEFFYLCFPLLTVAALMAIA 100

Query: 148 SSFL 151
S +
Sbjct: 101 SHVV 104


20HPB14_03965HPB14_04000N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_03965313-3.365991regulator of nonsense transcripts 1
HPB14_03970010-1.994300hypothetical protein
HPB14_03975-29-1.518754hypothetical protein
HPB14_03990010-2.772151conserved hypothetical secreted protein
HPB14_0399509-1.720616GTPase Era
HPB14_0400019-1.788746ATP-dependent protease ATP-binding subunit HslU
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03965GPOSANCHOR350.001 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.7 bits (79), Expect = 0.001
Identities = 19/162 (11%), Positives = 52/162 (32%), Gaps = 7/162 (4%)

Query: 25 ETEKEKERQNTLKKDIKDYTYKVQQAKKRHKHQQTLKIADHCLNIIQRYLQEIERLQQEN 84
++ E + L+ + T + K + +++ L+ +
Sbjct: 188 LEARQAELEKALEGAMNFST---ADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTAD 244

Query: 85 KETPTQLEAVLEQIQQEMKSLPIQQKHALQIYYNRFEDVQRRTLAYLDYLSRLKTQLQEK 144
LEA ++ L + AL+ N + + L+ + +
Sbjct: 245 SAKIKTLEAEKAALEARQAEL----EKALEGAMNFSTADSAKIKTLEAEKAALEAEKADL 300

Query: 145 EKRLKSDLGEIKKLKQKVKENEKDIKKHQEALEQFEEWRDES 186
E + + + L++ + + + K+ + ++ EE S
Sbjct: 301 EHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKIS 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03975ACRIFLAVINRP327e-04 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.7 bits (72), Expect = 7e-04
Identities = 23/101 (22%), Positives = 37/101 (36%), Gaps = 10/101 (9%)

Query: 20 SYGQYRAAKEQTKQLEIICNTL---RKQYENLLIELRLDKQKLRL------QLAQDLEKI 70
G A Q II T +++ + + + D +RL +L + +
Sbjct: 218 QLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNV 277

Query: 71 DARIR-KNADKMHLLKLAYENSFMVLKCIKEHLDEYEKKFP 110
ARI K A + + N+ K IK L E + FP
Sbjct: 278 IARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFP 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_03995PF03944320.003 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 32.0 bits (72), Expect = 0.003
Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 68 LHHQEKLLNQCMLSQALKAMGDAELCVFLASVHDDLKGYEEFLNLCQKPHILALSKIDMA 127
L E+ LNQ + + + A +AEL A+V + + + FLN + L+++
Sbjct: 94 LRETERFLNQRLNTDTV-ARVNAELTGLQANVEEFNRQVDNFLNPNRNAVPLSITSSVNT 152

Query: 128 THKQVLQKLQEYQKYASQFVDLVPLSAKKSQNLN 161
+ L +L ++Q Q + L+PL A+ + NL+
Sbjct: 153 MQQLFLNRLPQFQMQGYQLL-LLPLFAQAA-NLH 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04000HTHFIS290.047 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.6 bits (64), Expect = 0.047
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 48 TPKNILMIGSTGVGKTEIARRI---AKIMELPFVKV 80
T +++ G +G GK +AR + K PFV +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


21HPB14_04305HPB14_04340N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_043050111.157886cysteinyl-tRNA synthetase
HPB14_043101111.605572vacuolating cytotoxin
HPB14_043150150.461858lipopolysaccharide 1,2-glycosyltransferase
HPB14_043200161.131310IRON(III) dicitrate transport system ATP-binding
HPB14_043251173.170966iron(III) dicitrate ABC transporter permease
HPB14_043302172.627580short-chain oxidoreductase
HPB14_043350193.245102hypothetical protein
HPB14_043400213.409978hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04305OMS28PORIN300.019 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 29.8 bits (66), Expect = 0.019
Identities = 13/37 (35%), Positives = 25/37 (67%)

Query: 309 EEDLLVSKKRLDKIYRLKQRVLGTLGGINPNFKKEIL 345
+E L+ S++ LD+ + Q+VL + G+NP+ K ++L
Sbjct: 188 KETLMASERALDETVQEAQKVLNMVNGLNPSNKDQVL 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04310VACCYTOTOXIN19540.0 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 1954 bits (5063), Expect = 0.0
Identities = 1155/1258 (91%), Positives = 1199/1258 (95%), Gaps = 6/1258 (0%)

Query: 1 MIIPAIVGGIATGTAVGTVSGLLGWGLKQAEEANKTPDKPDKVWRIQAGRGFNEFPNKEY 60
+IIPAIVGGIATG AVGTVSGLLGWGLKQAEEANKTPDKPDKVWRIQAG+GFNEFPNKEY
Sbjct: 39 VIIPAIVGGIATGAAVGTVSGLLGWGLKQAEEANKTPDKPDKVWRIQAGKGFNEFPNKEY 98

Query: 61 DLYQSLLSSKIDGGWDWGNAARHYWVKGGQQNKLEVDMKDAVGTYKLSGLRNFTGGDLDV 120
DLY+SLLSSKIDGGWDWGNAARHYWVK GQ NKLEVDM++AVGTY LSGL NFTGGDLDV
Sbjct: 99 DLYKSLLSSKIDGGWDWGNAARHYWVKDGQWNKLEVDMQNAVGTYNLSGLINFTGGDLDV 158

Query: 121 NMQKATLRLGQFNGNSFTSYKDSADRTTRVDFNAKNISIDNFVEINNRVGSGAGRKASST 180
NMQKATLRLGQFNGNSFTSYKDSADRTTRVDFNAKNI IDNF+EINNRVGSGAGRKASST
Sbjct: 159 NMQKATLRLGQFNGNSFTSYKDSADRTTRVDFNAKNILIDNFLEINNRVGSGAGRKASST 218

Query: 181 VLTLQASEGITSSKNAEISLYDGATLNLASNSVKLMGNVWMGRLQYVGAYLAPSYSTINT 240
VLTLQASEGITS +NAEISLYDGATLNLASNSVKLMGNVWMGRLQYVGAYLAPSYSTINT
Sbjct: 219 VLTLQASEGITSRENAEISLYDGATLNLASNSVKLMGNVWMGRLQYVGAYLAPSYSTINT 278

Query: 241 SKVTGEVNFNHLTVGDKNAAQAGIIASNKTHIGTLDLWQSAGLNIIAPPEGGYKNQTNNT 300
SKVTGEVNFNHLTVGD NAAQAGIIASNKTHIGTLDLWQSAGLNIIAPPEGGYK++ N+
Sbjct: 279 SKVTGEVNFNHLTVGDHNAAQAGIIASNKTHIGTLDLWQSAGLNIIAPPEGGYKDKPNDK 338

Query: 301 PSQSGAKNDKNESAKNDKQESSQNNSNTQVINPPNSTQKTEIQPTQVIDGPFAGGKNTVV 360
PS + N AKNDKQESSQNNSNTQVINPPNS QKTEIQPTQVIDGPFAGGKNTVV
Sbjct: 339 PSNTTQNN-----AKNDKQESSQNNSNTQVINPPNSAQKTEIQPTQVIDGPFAGGKNTVV 393

Query: 361 NIDRINTNADGTIKVGGYKASLTTNAAHLHIGKGGVNLSNQASGRSLLVENLTGNITVDG 420
NI+RINTNADGTI+VGG+KASLTTNAAHLHIGKGG+NLSNQASGRSLLVENLTGNITVDG
Sbjct: 394 NINRINTNADGTIRVGGFKASLTTNAAHLHIGKGGINLSNQASGRSLLVENLTGNITVDG 453

Query: 421 PLRVNNQVGGYALAGSSANFEFKAGTDTKNGTATFNNDISLGRFVNLKVDAHTANFKGID 480
PLRVNNQVGGYALAGSSANFEFKAGTDTKNGTATFNNDISLGRFVNLKVDAHTANFKGID
Sbjct: 454 PLRVNNQVGGYALAGSSANFEFKAGTDTKNGTATFNNDISLGRFVNLKVDAHTANFKGID 513

Query: 481 TGNGGFNTLDFSGVTGKVNINKLITASTNVAVKNFNINELIVKTNGVSVGEYTHFSEDIG 540
TGNGGFNTLDFSGVT KVNINKLITASTNVAVKNFNINEL+VKTNGVSVGEYTHFSEDIG
Sbjct: 514 TGNGGFNTLDFSGVTNKVNINKLITASTNVAVKNFNINELVVKTNGVSVGEYTHFSEDIG 573

Query: 541 SQSRINTVRLETGTRSIFSGGVKFKSGEKLVIDEFYYSPWNYFDARNVKNVEITKKFASS 600
SQSRINTVRLETGTRSI+SGGVKFK GEKLVI++FYY+PWNYFDARN+KNVEIT K A
Sbjct: 574 SQSRINTVRLETGTRSIYSGGVKFKGGEKLVINDFYYAPWNYFDARNIKNVEITNKLAFG 633

Query: 601 TPENPWGTSKLMFNNLTLGQNAVMDYSQFSNLTIQGDFINNQGTINYLVRGGKVATLNVG 660
+PWGT+KLMFNNLTLGQNAVMDYSQFSNLTIQGDF+NNQGTINYLVRGG+VATLNVG
Sbjct: 634 PQGSPWGTAKLMFNNLTLGQNAVMDYSQFSNLTIQGDFVNNQGTINYLVRGGQVATLNVG 693

Query: 661 NAAAMMFNNDIDSTTGFYKPLIKINSAQDLIKNTEHVLLKAKIIGYGNVSTGTNGISNVN 720
NAAAM F+N++DS TGFY+PL+KINSAQDLIKN EHVLLKAKIIGYGNVS GT+ I+NVN
Sbjct: 694 NAAAMFFSNNVDSATGFYQPLMKINSAQDLIKNKEHVLLKAKIIGYGNVSAGTDSIANVN 753

Query: 721 LEEQFKERLALYNNNNRMDTCVVRNTDDIKACGMAIGNQSMVNNPDNYKYLIGKAWKNIG 780
L EQFKERLALYNNNNRMD CVVRNTDDIKACG AIGNQSMVNNP+NYKYL GKAWKNIG
Sbjct: 754 LIEQFKERLALYNNNNRMDICVVRNTDDIKACGTAIGNQSMVNNPENYKYLEGKAWKNIG 813

Query: 781 ISKTANGSKISVYYLGNSTPSENGGNTTNLPTNTTNNARSANYALVKNAPFA-HSATPNL 839
ISKTANGSKISV+YLGNSTP+ENGGNTTNLPTNTTN R A+YAL+KNAPFA +SATPNL
Sbjct: 814 ISKTANGSKISVHYLGNSTPTENGGNTTNLPTNTTNKVRFASYALIKNAPFARYSATPNL 873

Query: 840 VAINQHDFGTIESVFELANRSKDIDTLYTHSGAQGRDLLQTLLIDSHDAGYARQMIDNTS 899
VAINQHDFGTIESVFELANRS DIDTLY +SGAQGRDLLQTLLIDSHDAGYAR MID TS
Sbjct: 874 VAINQHDFGTIESVFELANRSNDIDTLYANSGAQGRDLLQTLLIDSHDAGYARTMIDATS 933

Query: 900 TGEITKQLNAATTTLNNIASLEHKTSGLQTLSLSNAMILNSRLVNLSRRHTNNINSFAQR 959
EITKQLN ATTTLNNIASLEHKTSGLQTLSLSNAMILNSRLVNLSRRHTN+I+SFA+R
Sbjct: 934 ANEITKQLNTATTTLNNIASLEHKTSGLQTLSLSNAMILNSRLVNLSRRHTNHIDSFAKR 993

Query: 960 LQALKGQRFASLESAAEVLYQFAPKYEKPTNVWANAIGGASLNNGSNASLYGTSAGVDAY 1019
LQALK QRFASLESAAEVLYQFAPKYEKPTNVWANAIGG SLN+G NASLYGTSAGVDAY
Sbjct: 994 LQALKDQRFASLESAAEVLYQFAPKYEKPTNVWANAIGGTSLNSGGNASLYGTSAGVDAY 1053

Query: 1020 LNGNVEAIVGGFGSYGYSSFSNQANSLNSGANNTNFGVYSRIFANQHEFDFEAQGALGSD 1079
LNG VEAIVGGFGSYGYSSFSNQANSLNSGANNTNFGVYSRIFANQHEFDFEAQGALGSD
Sbjct: 1054 LNGEVEAIVGGFGSYGYSSFSNQANSLNSGANNTNFGVYSRIFANQHEFDFEAQGALGSD 1113

Query: 1080 QSSLNFKSALLRDLNQSYNYLAYSAATRASYGYDFAFFRNALVLKPSVGVSYNHLGSTNF 1139
QSSLNFKSALLRDLNQSYNYLAYSAATRASYGYDFAFFRNALVLKPSVGVSYNHLGSTNF
Sbjct: 1114 QSSLNFKSALLRDLNQSYNYLAYSAATRASYGYDFAFFRNALVLKPSVGVSYNHLGSTNF 1173

Query: 1140 KSNSTNKVALSNGSSSQHLFNASANVEARYYYGDTSYFYMNAGVLQEFANFGSSNAVSLN 1199
KSNS KVAL NG+SSQHLFNASANVEARYYYGDTSYFYMNAGVLQEFANFGSSNAVSLN
Sbjct: 1174 KSNSNQKVALKNGASSQHLFNASANVEARYYYGDTSYFYMNAGVLQEFANFGSSNAVSLN 1233

Query: 1200 TFKVNATHNPLNTHARVMMGGELKLAKEVFLNLGFVYLHNLISNASHFASNLGMRYSF 1257
TFKVNAT NPLNTHARVMMGGELKLAKEVFLNLGFVYLHNLISN HFASNLGMRYSF
Sbjct: 1234 TFKVNATRNPLNTHARVMMGGELKLAKEVFLNLGFVYLHNLISNIGHFASNLGMRYSF 1291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04330DHBDHDRGNASE932e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 92.8 bits (230), Expect = 2e-24
Identities = 58/233 (24%), Positives = 102/233 (43%), Gaps = 10/233 (4%)

Query: 14 KVAVITGASSGIGLECALMLLDQGYKVYALSRHATLCVALNHALC------ECVDIDVSD 67
K+A ITGA+ GIG A L QG + A+ + + +L E DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 68 SNALKEVFLNISAKEDHCDVLINSAGYGVFGSVEDTPIEEVKKQFGVNFFALCEVVQFCL 127
S A+ E+ I + D+L+N AG G + EE + F VN + +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 128 PLLKNKPHSKIFNLSSIAGRVSMLFLGHYSASKHALEAYSDALRLEFKPFNIQVCLIEPG 187
+ ++ I + S V + Y++SK A ++ L LE +NI+ ++ PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 188 PVKSNWEKTAFENDERKDSVYALDLEKTKNFYSGVYQNALS-PKAVAQKIVFL 239
+++ + + + ++ + V LE F +G+ L+ P +A ++FL
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLET---FKTGIPLKKLAKPSDIADAVLFL 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04340PF01206260.017 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 25.9 bits (57), Expect = 0.017
Identities = 9/43 (20%), Positives = 21/43 (48%), Gaps = 7/43 (16%)

Query: 83 IPNLETQQAMREALNGENLEVI-------EDFSAWANEIKKEV 118
+P L+ ++ + GE L V+ +DF +++ + E+
Sbjct: 17 LPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHEL 59


22HPB14_04385HPB14_04420N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_043850121.641321acetate kinase
HPB14_043900121.071923acetate kinase A/propionate kinase 2
HPB14_044051140.382371phosphotransacetylase
HPB14_044100150.322839hypothetical protein
HPB14_04415-1151.297460flagellar basal body rod modification protein
HPB14_04420-1141.794269flagellar hook protein FlgE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04385ACETATEKNASE1213e-36 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 121 bits (304), Expect = 3e-36
Identities = 46/117 (39%), Positives = 70/117 (59%), Gaps = 2/117 (1%)

Query: 1 MRNIEARK-EKGDKEAKLAFEMCTYRIKKYIGAYMVVLKKVDAIIFTGGLGENYSALRES 59
R++E + GDK A+LA + YR+KK IG+Y + VD I+FT G+GEN +RE
Sbjct: 283 FRDLEDAAFKNGDKRAQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREF 342

Query: 60 VCEGLENLGIALHKPTNDNPGNGLVDLSQPNTKVQILRIPTDKELEIALQTKKVLEK 116
+ +GLE LG L K N G +S ++KV ++ +PT++E IA T+K++E
Sbjct: 343 ILDGLEFLGFKLDKEKNKVRGEE-AIISTADSKVNVMVVPTNEEYMIAKDTEKIVES 398


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04390ACETATEKNASE361e-127 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 361 bits (929), Expect = e-127
Identities = 145/282 (51%), Positives = 196/282 (69%), Gaps = 6/282 (2%)

Query: 1 MEILVLNLGSSSIKFKLFDMKENKPLASGLAEKIGEEIGQLKIKSHLHHNDQELKEKLVI 60
M+ILV+N GSSS+K++L + K+ LA GLAE+IG L N +++K K +
Sbjct: 1 MKILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHN----ANGEKIKIKKDM 56

Query: 61 KDHASGLLMIRENLT--KMGIIKDFNQIDAIGHRVVQGGDKFHAPVLVDEKVMQEIGNLS 118
KDH + ++ + L G+IKD ++IDA+GHRVV GG+ F + VL+ + V++ I +
Sbjct: 57 KDHKDAIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCI 116

Query: 119 ILAPLHNPANLAGIEFVKKAHPHIPQIAVFDTAFHATMPSYAYMYALPYELYEKYQIRRY 178
LAPLHNPAN+ GI+ + P +P +AVFDTAFH TMP YAY+Y +PYE Y KY+IR+Y
Sbjct: 117 ELAPLHNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKY 176

Query: 179 GFHGTSHHYVAKEAAKFLNIPYEEFNAISLHLGNGSSAAAIQKGKSVDTSMGLTPLEGLI 238
GFHGTSH YV++ AA+ LN P E I+ HLGNGSS AA++ GKS+DTSMG TPLEGL
Sbjct: 177 GFHGTSHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLA 236

Query: 239 MGTRCGDIDPTVVEYIAQCANKSLEEVIKILNHESGLKGICG 280
MGTR G IDP+++ Y+ + N S EEV+ ILN +SG+ GI G
Sbjct: 237 MGTRSGSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISG 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04410IGASERPTASE372e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.6 bits (84), Expect = 2e-04
Identities = 36/225 (16%), Positives = 67/225 (29%), Gaps = 9/225 (4%)

Query: 284 KKSEKTPIHAKTQTTAQATTPENAPKIPLKTPPLMPLIGANPPPNDNIPTPLEKEEKTQE 343
+ Q + N + P+ P A TP E E E
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPA---------TPSETTETVAE 1042

Query: 344 ISENKEKTKETSNSAQSAQNTQASDKTSENKSITPKETIKHFTQQLKQEIQEYKPPMSKI 403
S+ + KT E + + Q + E KS T + Q E +E + +K
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 404 SMDLFPKELGKVEVIIQKVGKNLKVSVISHNNSLQTFLDNQQDLKNSLNALGFEGVDLSF 463
+ + +E KVE + + V +T + + + + +
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 464 SQDSSKEQEKEPFKEPFKDQELTPLKENALKSYQENTDHENQETS 508
+ + EQ + + N S EN ++ T+
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207



Score = 32.0 bits (72), Expect = 0.008
Identities = 44/273 (16%), Positives = 85/273 (31%), Gaps = 30/273 (10%)

Query: 2 PSPINPI-QTNANALNGGAKNEDTKNAPKSASKDFSKILNQKISKDKTASKESPNPNALK 60
P+P P T A N +++ + + A++ + + S N +
Sbjct: 1028 PAPATPSETTETVAENSKQESKTVEKNEQDATE---TTAQNREVAKEAKSNVKANTQTNE 1084

Query: 61 ATPKDAKALEKTPTPHHQHAKDLVKDQQAPTLKDWLNHPK-THPTAPHETQHETHEHETN 119
++ E T + A +++ + PK T +P + Q ET + +
Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144

Query: 120 PKTPNETLNKNEKKPNGVTSNAHQTNLASKNP--------------------ITPNHANN 159
P N+ ++ + + A A + P +
Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTP 1204

Query: 160 IIKNPTAPTDTKKEPKTLKDIQTLSQKHDLNASNIQAATTPENKNPLNASDHLALKTTQT 219
PT +++ +PK S H N++ ATT N A L T
Sbjct: 1205 ATTQPTVNSESSNKPKNRHRRSVRSVPH-----NVEPATTSSNDRSTVALCDLTSTNTNA 1259

Query: 220 PTNHTLAKNDAKNTANLSSVLQSLEKKESHNKE 252
+ AK +V Q + + E +N+
Sbjct: 1260 VLSDARAKAQFVALNVGKAVSQHISQLEMNNEG 1292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04420FLGHOOKAP1357e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 35.3 bits (81), Expect = 7e-04
Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 2 NDTLLNAYSGIKTHQFGIDSLSNNIANVNTLGY 34
+ + NA SG+ Q +++ SNNI++ N GY
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGY 33



Score = 33.0 bits (75), Expect = 0.004
Identities = 10/48 (20%), Positives = 20/48 (41%)

Query: 557 IRHKYLETSNVNAGNALTNLILMQRGYSMNARAFGAGDDMIKEAISLK 604
+ ++ S VN NL Q+ Y NA+ + + I+++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


23HPB14_04915HPB14_04945N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_04915-113-0.477003hypothetical protein
HPB14_04920-1120.725514D-3-phosphoglycerate dehydrogenase
HPB14_04925-1120.4072343-octaprenyl-4-hydroxybenzoate carboxy-lyase
HPB14_04930-1130.469775hypothetical protein
HPB14_04935-1130.554441UDP-2,3-diacylglucosamine hydrolase
HPB14_049400140.762198CheA-MCP interaction modulator
HPB14_04945-2120.601718autophosphorylating histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04915V8PROTEASE270.033 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 27.3 bits (60), Expect = 0.033
Identities = 13/52 (25%), Positives = 24/52 (46%)

Query: 46 TNEGLSQTDAKSHEINLEESPNNPNTPNDEKASHNEEDRNNALSQNLDAQDS 97
N+ N ++PNNP+ PN+ +N ++ +N + N D D+
Sbjct: 284 ANDDQPNNPDNPDNPNNPDNPNNPDEPNNPDNPNNPDNPDNGDNNNSDNPDA 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04930ALARACEMASE320.001 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 32.1 bits (73), Expect = 0.001
Identities = 9/43 (20%), Positives = 16/43 (37%), Gaps = 1/43 (2%)

Query: 136 GVVPEEALEIYSQISETCKRLKLKGLMCIGAHADDEKKIEKSF 178
G P+ L ++ Q+ + LM A A+ I +
Sbjct: 132 GFQPDRVLTVWQQL-RAMANVGEMTLMSHFAEAEHPDGISGAM 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04940HTHFIS604e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.8 bits (145), Expect = 4e-12
Identities = 29/129 (22%), Positives = 50/129 (38%), Gaps = 13/129 (10%)

Query: 181 GEVLFLDDSRTARKTLKNHLSKLGFSITEAVDGEDGLDKLEMLFKKYGDDLRKHLKFIIS 240
+L DD R L LS+ G+ + + + +++
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA----------AGDGDLVVT 53

Query: 241 DVEMPKMDGYHFLFKLQKDPRFAYIPVIFNSSICDNYSAERAKEMGAVAYLVK-FDAEKF 299
DV MP + + L +++K +PV+ S+ +A +A E GA YL K FD +
Sbjct: 54 DVVMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111

Query: 300 TEEISKILD 308
I + L
Sbjct: 112 IGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_04945HTHFIS564e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.6 bits (134), Expect = 4e-10
Identities = 24/121 (19%), Positives = 55/121 (45%), Gaps = 4/121 (3%)

Query: 680 VLAIDDSSTDRAIIRKCLKPLGITLLEATNGLEGLEMLKNGDKIPDAILVDIEMPKMDGY 739
+L DD + R ++ + L G + +N + GD D ++ D+ MP + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD--GDLVVTDVVMPDENAF 63

Query: 740 TFASEVRKYNKFKNLPLIAVTSRVTKTDRMRGVESGMTEYITKPYSGEYLTTVVKRSIKL 799
++K +LP++ ++++ T ++ E G +Y+ KP+ L ++ R++
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 800 E 800

Sbjct: 122 P 122


24HPB14_06955HPB14_06990N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_069550160.387447membrane protein insertase
HPB14_069600140.357738spoIIIJ-associated protein
HPB14_069650130.814458tRNA modification GTPase TrmE
HPB14_069702111.441157hypothetical protein
HPB14_069751150.687369hypothetical protein
HPB14_06980-2140.881439hypothetical protein
HPB14_06985-1131.832128hypothetical protein
HPB14_06990-2112.006397membrane-associated lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_0695560KDINNERMP430e-148 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 430 bits (1106), Expect = e-148
Identities = 166/575 (28%), Positives = 287/575 (49%), Gaps = 69/575 (12%)

Query: 10 RLILAIALSFLFIALYSYFFQKPNKTTTQTTKQETTNNHTTTSPNAPNAQHFGTTQTTPQ 69
R +L IAL F+ ++ Q + + + T TTT+ + Q + Q
Sbjct: 5 RNLLVIALLFVSFMIW----QAWEQDKNPQPQAQQTTQTTTTAAGSAADQG---VPASGQ 57

Query: 70 ENLLSTISFEHARIEIDSLG-RIKQVYLKDKKYLTPKQKGFLEHVG--HLFSSKEN---- 122
L+ ++ + + I++ G ++Q L P L L +
Sbjct: 58 GKLI-SVKTDVLDLTINTRGGDVEQALL-------PAYPKELNSTQPFQLLETSPQFIYQ 109

Query: 123 AQPPL--KEFPLLAADKLKPLEVRFLDPTLNNKAFNTPYSASKTTLGPNEQLV--LTQDL 178
AQ L ++ P A+ +PL +N A G NE V D
Sbjct: 110 AQSGLTGRDGPDNPANGPRPL-------------YNVEKDAYVLAEGQNELQVPMTYTDA 156

Query: 179 GALSIIKTLTF----YD-DLHYDLKIAFKSPDNLIPSYVITNGYRPVADLDS-------Y 226
+ KT Y +++Y+++ A + P + + LD+ +
Sbjct: 157 AGNTFTKTFVLKRGDYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFALH 216

Query: 227 TFSGVLLENNDKKIEKIE---DKDAKEIKRFSNTLFLSSVDRYFTTLLFTKDPQGFEALI 283
TF G D+K EK + D + + S +++ + +YF T + G
Sbjct: 217 TFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAMLQQYFATAWIPHN-DGTNNFY 275

Query: 284 DSEIGTKNPLGFISLKNEA-----------NLHGYIGPKDYRSLKAISPMLTDVIEYGLI 332
+ +G N + I K++ N ++GP+ + A++P L ++YG +
Sbjct: 276 TANLG--NGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWL 333

Query: 333 TFFAKGVFVLLDYLYQFVGNWGWAIILLTIIVRIILYPLSYKGMVSMQKLKELAPKMKEL 392
F ++ +F LL +++ FVGNWG++II++T IVR I+YPL+ SM K++ L PK++ +
Sbjct: 334 WFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAM 393

Query: 393 QEKYKGEPQKLQAHMMQLYKKHGANPLGGCLPLILQIPVFFAIYRVLYNAVELKNSEWIL 452
+E+ + Q++ MM LYK NPLGGC PL++Q+P+F A+Y +L +VEL+ + + L
Sbjct: 394 RERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFAL 453

Query: 453 WIHDLSIMDPYFILPLLMGASMYWHQSVTPNTMTDPMQAKIFKLLPLLFTIFLITFPAGL 512
WIHDLS DPY+ILP+LMG +M++ Q ++P T+TDPMQ KI +P++FT+F + FP+GL
Sbjct: 454 WIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGL 513

Query: 513 VLYWTTNNILSVLQQLIINKVLENKKRMHAQNKKE 547
VLY+ +N+++++QQ +I + LE K+ +H++ KK+
Sbjct: 514 VLYYIVSNLVTIIQQQLIYRGLE-KRGLHSREKKK 547


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_06960IGASERPTASE300.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.009
Identities = 20/59 (33%), Positives = 28/59 (47%), Gaps = 7/59 (11%)

Query: 54 AGVKESVKEVKEESVKETNTKENHQNNIEEKKQKLETETPQEE-KITPKPPKKNLKEES 111
A KE + KET T E +E+K K+ETE QE K+T + K + E+
Sbjct: 1086 AQSGSETKETQTTETKETATVE------KEEKAKVETEKTQEVPKVTSQVSPKQEQSET 1138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_06965TCRTETOQM330.004 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 32.5 bits (74), Expect = 0.004
Identities = 34/134 (25%), Positives = 55/134 (41%), Gaps = 25/134 (18%)

Query: 216 LSIVGKPNAGKSSLLNAMLLEERA---LVSDIKGTTR-DTIEE-------------VIEL 258
+ ++ +AGK++L ++L A L S KGTTR D +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 259 QGHKVRLIDTAGIRESADKIERLGIEKSLKSLENCDLILGVFDLSKPLEKEDFNLIDALN 318
+ KV +IDT G + ++ R SL L+ L++ D + + L AL
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYR-----SLSVLDGAILLISAKDGVQAQTR---ILFHALR 117

Query: 319 RAKKPCIVVLNKND 332
+ P I +NK D
Sbjct: 118 KMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_06980BINARYTOXINB300.009 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 30.4 bits (68), Expect = 0.009
Identities = 14/60 (23%), Positives = 22/60 (36%)

Query: 155 SKSMGDLLAKAMPIERILKAYSVPVGSLENYEKIYYQNAFKPKVQITFDNNSDTEIKNAL 214
+ + D L P + +A + G E + YQ + FD + IKN L
Sbjct: 536 AVNPSDPLETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQL 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_06990LIPOLPP20293e-105 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 293 bits (752), Expect = e-105
Identities = 174/175 (99%), Positives = 175/175 (100%)

Query: 1 MKNQVKKILGMSVIAAMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK 60
MKNQVKKILGMSV+AAMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK
Sbjct: 1 MKNQVKKILGMSVVAAMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK 60

Query: 61 YSGVFLGRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRS 120
YSGVFLGRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRS
Sbjct: 61 YSGVFLGRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRS 120

Query: 121 ISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDKQIVDKVREELGMVKK 175
ISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDKQIVDKVREELGMVKK
Sbjct: 121 ISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDKQIVDKVREELGMVKK 175


25HPB14_07590HPB14_07625N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPB14_07590-1141.986415flagellar hook-basal body protein FliE
HPB14_07595-1131.941531flagellar basal body rod protein FlgC
HPB14_076001141.360547flagellar basal body rod protein FlgB
HPB14_076051140.848880cell division protein FtsW
HPB14_07610116-0.208985iron(III) ABC transporter periplasmic
HPB14_076152160.076388alkyl hydroperoxide reductase
HPB14_07620113-0.460526outer membrane protein
HPB14_07625013-0.633638penicillin-binding protein 2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_07590FLGHOOKFLIE776e-22 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 77.0 bits (189), Expect = 6e-22
Identities = 19/77 (24%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 34 EQKGGEFSKLLKQSINELNNTQEQSDKALADMATGQIK-DLHQAAIAIGKAETSMKLMLE 92
Q F+ L +++ +++TQ + G+ L+ + KA SM++ ++
Sbjct: 27 PQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQ 86

Query: 93 VRNKAISAYKELLRTQI 109
VRNK ++AY+E++ Q+
Sbjct: 87 VRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_07595FLGHOOKAP1280.012 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.012
Identities = 10/38 (26%), Positives = 15/38 (39%)

Query: 121 NVNAVVEMADLVEATRAYQANVAAFQSAKNMAQNAIGM 158
VN E +L + Y AN Q+A + I +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_07610FERRIBNDNGPP330.001 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 33.4 bits (76), Expect = 0.001
Identities = 29/184 (15%), Positives = 75/184 (40%), Gaps = 12/184 (6%)

Query: 106 NVELLKKLSPDLVVTFVGNPKAVEHAKKF--GILFLSFQEKTIAEVMEDID---AQAKAL 160
N+ELL ++ P +V G + E + G F K + A L
Sbjct: 88 NLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNL 147

Query: 161 EIDASKKLAKMQETLDFIAERLKGVKKKKGVELFHKAN----KISGHQALDSDILEKGGI 216
+ A LA+ ++ + + R + + + L + + G +L +IL++ GI
Sbjct: 148 QSAAETHLAQYEDFIRSMKPRFVK-RGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGI 206

Query: 217 DN-FGLKYVKFGRADISVEKIVK-ENPEIIFIWWISPLSPEDVLNNPKFATIKAIKNKQV 274
N + + +G +S++++ ++ +++ + + ++ P + + ++ +
Sbjct: 207 PNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRF 266

Query: 275 YKLP 278
++P
Sbjct: 267 QRVP 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPB14_07625TYPE3IMPPROT290.029 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 29.4 bits (66), Expect = 0.029
Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 4 LRYKLLLFVFIGFWGLLVLNLFI 26
KL+LFV + W LL L +
Sbjct: 195 TPIKLVLFVALDGWTLLSKGLIL 217



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.