PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeAklavik86.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in CP003476 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1HPAKL86_00190HPAKL86_00285Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_00190213-0.928938iojap-related protein
HPAKL86_00195314-0.968245tRNA delta(2)-isopentenylpyrophosphate
HPAKL86_00200211-0.617799lipopolysaccharide 1,2-glucosyltransferase
HPAKL86_00215110-1.012299FAD-dependent thymidylate synthase
HPAKL86_00220010-1.258108glucosamine--fructose-6-phosphate
HPAKL86_00225014-2.085981hypothetical protein
HPAKL86_00230115-2.127960purine nucleoside phosphorylase
HPAKL86_00235113-2.412332chromosomal replication initiation protein
HPAKL86_00240212-2.897140hypothetical protein
HPAKL86_00245011-3.421846*exodeoxyribonuclease III
HPAKL86_00250012-4.232402hypothetical protein
HPAKL86_00255013-3.887841hypothetical protein
HPAKL86_00260013-2.808778ATP-dependent DNA helicase RecG
HPAKL86_00265012-2.779979type III R-M system methyltransferase
HPAKL86_00270011-2.359264type III restriction enzyme
HPAKL86_00285212-1.323578hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00235HTHFIS354e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.2 bits (81), Expect = 4e-04
Identities = 9/51 (17%), Positives = 25/51 (49%), Gaps = 4/51 (7%)

Query: 125 TVYEIAKKVAQSDTPPYNPVLFYGGTGLGKTHILNAIGNHAIQKHKKVVLV 175
+Y + ++ Q+D ++ G +G GK + A+ ++ +++ V +
Sbjct: 148 EIYRVLARLMQTDLT----LMITGESGTGKELVARALHDYGKRRNGPFVAI 194


2HPAKL86_00340HPAKL86_00380Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_00340120-3.541878ribulose-phosphate 3-epimerase
HPAKL86_00345126-5.334554DNA polymerase III subunit epsilon
HPAKL86_00350630-8.068757hypothetical protein
HPAKL86_00355429-6.182847hypothetical protein
HPAKL86_00360216-6.266824hypothetical protein
HPAKL86_00365511-3.418728hypothetical protein
HPAKL86_00370411-2.265012hypothetical protein
HPAKL86_00375110-1.375649hypothetical protein
HPAKL86_00380210-1.324285hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00355FbpA_PF05833240.044 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 24.1 bits (52), Expect = 0.044
Identities = 5/42 (11%), Positives = 17/42 (40%), Gaps = 1/42 (2%)

Query: 17 KINNENIQQSNADIY-TITAKVVLLENRLKKLQKEVAKLKQA 57
K ++ ++ ++D+ + + + K L + K +
Sbjct: 291 KDKSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCEDK 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00370TCRTETB250.032 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 25.2 bits (55), Expect = 0.032
Identities = 9/42 (21%), Positives = 18/42 (42%)

Query: 25 ILGVMEELCDTLNDSLNFKKVVCMGIKVSIAFKFLIFCSQSF 66
+ + L+D L K+++ GI ++ + F SF
Sbjct: 61 TFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSF 102


3HPAKL86_00520HPAKL86_00595Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_00520211-0.190438ubiquinone/menaquinone biosynthesis
HPAKL86_00525313-0.238635exodeoxyribonuclease VII small subunit
HPAKL86_00530413-0.294291hypothetical protein
HPAKL86_00535412-0.088580seryl-tRNA synthetase
HPAKL86_00540411-0.257430hypothetical protein
HPAKL86_005453130.101255DNA helicase II
HPAKL86_005505190.331570flagellar basal body P-ring biosynthesis protein
HPAKL86_00555318-0.858122hypothetical protein
HPAKL86_00560216-0.0422793-octaprenyl-4-hydroxybenzoate carboxy-lyase
HPAKL86_00565315-1.091420phosphopantetheine adenylyltransferase
HPAKL86_00570214-0.747465thymidylate kinase
HPAKL86_00575111-0.200196hypothetical protein
HPAKL86_005800100.014929type II restriction modification enzyme
HPAKL86_005952131.048452biotin synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00565LPSBIOSNTHSS2235e-78 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 223 bits (569), Expect = 5e-78
Identities = 65/148 (43%), Positives = 95/148 (64%)

Query: 4 IGIYPGTFDPVTNGHIDIIHRSSELFEKLIVAVAHSSAKNPMFSLKERLEMIQLATKNFK 63
IYPG+FDP+T GH+DII R LF+++ VAV + K PMFS++ERLE I A +
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 64 NVECVAFEGLLANLAKEYHCKVLVRGLRVVSDFEYELQMGYANKSLNHELETLYFMPTLQ 123
N + +FEGL N A++ ++RGLRV+SDFE ELQM NK+L +LET++ + +
Sbjct: 62 NAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTE 121

Query: 124 NAFISSSIVRSIIAHKGDASHLVPKEIH 151
+F+SSS+V+ + G+ H VP +
Sbjct: 122 YSFLSSSLVKEVARFGGNVEHFVPSHVA 149


4HPAKL86_01435HPAKL86_01530Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_01435314-1.023443Proline/pyrroline-5-carboxylate dehydrogenase
HPAKL86_01440823-3.074326hypothetical protein
HPAKL86_01445922-3.100405hypothetical protein
HPAKL86_01450918-2.139252hypothetical protein
HPAKL86_01455817-2.173001hypothetical protein
HPAKL86_01460617-1.593819hypothetical protein
HPAKL86_01480314-0.742688hypothetical protein
HPAKL86_01485311-0.299281hypothetical protein
HPAKL86_014902110.177758hypothetical protein
HPAKL86_014951110.677868hypothetical protein
HPAKL86_015000100.891321hypothetical protein
HPAKL86_01505-1121.649293ATP-binding protein
HPAKL86_015100162.226033urease accessory protein UreH
HPAKL86_015153242.850980urease accessory protein
HPAKL86_015204252.462097urease accessory protein UreF
HPAKL86_015254242.064780urease accessory protein UreE
HPAKL86_015303211.896265urease accessory protein UreI
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01435ANTHRAXTOXNA310.035 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.9 bits (69), Expect = 0.035
Identities = 36/173 (20%), Positives = 71/173 (41%), Gaps = 19/173 (10%)

Query: 121 QEESQLKERILKRKNEKIILNVNFIGEEVLGEEEANARFEKY---SQALKSNYIQYISIK 177
Q+ S+ ++ + + EK+ F+ E+ + + Y S+ K Y +
Sbjct: 118 QDLSEEEKNSMNSRGEKVPFASRFVFEKKRETPKLIINIKDYAINSEQSKEVYYEIGKGI 177

Query: 178 ITTIFSQINILDFEY-----SKKEIVKRLDALYALALEEEKKQGMPKFINLDMEEFRDLE 232
I S+ LD E+ S + D L++ +E K + K I+++ ++
Sbjct: 178 SLDIISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKE-KLELNNKSIDINF-----IK 231

Query: 233 LTVESFMESIAK-----FDLNAGIVLQAYIPDSYEYLKKLHAFSKERVLKGLK 280
+ F + + F + VL+ Y PD +EY+ KL E++ + LK
Sbjct: 232 ENLTEFQHAFSLAFSYYFAPDHRTVLELYAPDMFEYMNKLEKGGFEKISESLK 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01455GPOSANCHOR477e-08 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 46.6 bits (110), Expect = 7e-08
Identities = 46/266 (17%), Positives = 86/266 (32%)

Query: 13 QVREELEARISELEDENENLTDENTELKTEKTELLREKNNLTNENTRLLASKERLTTEKT 72
+L L+D N+ LT+E + K + + + + ++ L A K L
Sbjct: 71 LKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALE 130

Query: 73 ELTKEKTELTKEKTELTEKNQNLAKANTELTKEKIDLTEKNQNLIKENTELKIEKENLND 132
T + + L + LA +L K + + L+ EK L
Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 133 QLNASQKQIKSLEQSQQILENEKADLTKEKAELTDKNKTLTTEKDNLTKANADLKKENDK 192
+ +K ++ + L EKA L + L + + +
Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250

Query: 193 LNHQVIALTKERDSLEYERVQLQDEHGFLEELCANLEKDNQHLNDKLKKLESAQKNLENS 252
L + AL + LE + LE + L + LE + L +
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310

Query: 253 NNQLLQAKEKIAEEKIEMEREVARLK 278
L + + E K ++E E +L+
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLE 336



Score = 44.7 bits (105), Expect = 3e-07
Identities = 49/258 (18%), Positives = 88/258 (34%)

Query: 21 RISELEDENENLTDENTELKTEKTELLREKNNLTNENTRLLASKERLTTEKTELTKEKTE 80
R + E EN L +N++L L + LT E + + +E + E
Sbjct: 58 RADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQE 117

Query: 81 LTKEKTELTEKNQNLAKANTELTKEKIDLTEKNQNLIKENTELKIEKENLNDQLNASQKQ 140
L K +L + + +T + + L + L +L+ E + A +
Sbjct: 118 LEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAK 177

Query: 141 IKSLEQSQQILENEKADLTKEKAELTDKNKTLTTEKDNLTKANADLKKENDKLNHQVIAL 200
IK+LE + LE +A+L K + + + + L A L L +
Sbjct: 178 IKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGA 237

Query: 201 TKERDSLEYERVQLQDEHGFLEELCANLEKDNQHLNDKLKKLESAQKNLENSNNQLLQAK 260
+ + L+ E LE A LEK + + + K LE L K
Sbjct: 238 MNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEK 297

Query: 261 EKIAEEKIEMEREVARLK 278
+ + + L+
Sbjct: 298 ADLEHQSQVLNANRQSLR 315



Score = 35.4 bits (81), Expect = 3e-04
Identities = 61/304 (20%), Positives = 114/304 (37%)

Query: 15 REELEARISELEDENENLTDENTELKTEKTELLREKNNLTNENTRLLASKERLTTEKTEL 74
+ +LE + + + + + L+ EK L + L + + + L
Sbjct: 157 KADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTL 216

Query: 75 TKEKTELTKEKTELTEKNQNLAKANTELTKEKIDLTEKNQNLIKENTELKIEKENLNDQL 134
EK L K +L + + +T + + L + L EL+ E +
Sbjct: 217 EAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFS 276

Query: 135 NASQKQIKSLEQSQQILENEKADLTKEKAELTDKNKTLTTEKDNLTKANADLKKENDKLN 194
A +IK+LE + LE EKADL + L ++L + D +A L+ E+ KL
Sbjct: 277 TADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLE 336

Query: 195 HQVIALTKERDSLEYERVQLQDEHGFLEELCANLEKDNQHLNDKLKKLESAQKNLENSNN 254
Q R SL + ++ LE LE+ N+ + L +
Sbjct: 337 EQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK 396

Query: 255 QLLQAKEKIAEEKIEMEREVARLKSLESTDKSELDLQNRRFKSAIEDLKRQNRKLEEENM 314
Q+ +A E+ + +E+ L+ + + E + ++ + LK + K EE
Sbjct: 397 QVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELA 456

Query: 315 ALKE 318
L+
Sbjct: 457 KLRA 460


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01460PHPHTRNFRASE260.024 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 26.3 bits (58), Expect = 0.024
Identities = 7/44 (15%), Positives = 20/44 (45%)

Query: 38 KALSTERDELIASHTDKSELDLQNRRFKSAIEDLKCQNRKLEEE 81
KA + T +++ + + +A+E K + R ++++
Sbjct: 18 KAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQ 61


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01490BACINVASINB320.011 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 31.6 bits (71), Expect = 0.011
Identities = 29/106 (27%), Positives = 52/106 (49%), Gaps = 2/106 (1%)

Query: 50 KIERRLQEAKRELRKAKQNKDNLGWVTSGLQFVTGAVSFVYPPARAAGALAVAAIGLASK 109
++E++ E + E RKA++ +G + L + VS V +LA+AA+GLA
Sbjct: 292 EMEKKSAEFQEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAV- 350

Query: 110 FIEEDTKKYEKNVKLLEQALE-IYSTQAKASKELVEEALERVKKAL 154
+ ++ K V ++QAL I K EL+ +A+ + + L
Sbjct: 351 MVADEIVKAATGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGL 396


5HPAKL86_03255HPAKL86_03285Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_03255-314-3.537592dihydroorotate dehydrogenase 2
HPAKL86_03260-215-3.761737polyphosphate kinase
HPAKL86_03265-216-4.651162*hypothetical protein
HPAKL86_03270-216-4.563160type I restriction enzyme S protein (hsdS)
HPAKL86_03275-116-3.773653type I restriction enzyme M protein
HPAKL86_03280015-3.638219type I restriction enzyme R protein
HPAKL86_03285114-3.100120type I restriction enzyme R protein HsdR
6HPAKL86_03340HPAKL86_03525Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_03340115-3.112763outer membrane protein (omp22)
HPAKL86_03345112-2.327379type II adenine specific methyltransferase
HPAKL86_03350213-1.263774GTP-binding protein TypA
HPAKL86_03355414-3.083064type II adenine specific DNA methyltransferase
HPAKL86_03360214-2.572902type II restriction endonuclease
HPAKL86_03365115-1.647905hypothetical protein
HPAKL86_03370212-0.462080hypothetical protein
HPAKL86_03375-111-0.940524hypothetical protein
HPAKL86_03380-110-1.342907hypothetical protein
HPAKL86_03385112-1.2142406-carboxy-5,6,7,8-tetrahydropterin synthase
HPAKL86_03390-111-1.797086hypothetical protein
HPAKL86_03395-110-2.2903815'(3')-nucleotidase/polyphosphatase
HPAKL86_03400-112-2.947261geranyltranstransferase
HPAKL86_03405-111-3.052718GTP cyclohydrolase I
HPAKL86_03410-111-2.593742heat shock protein HtpX
HPAKL86_03415112-1.980509tRNA pseudouridine synthase D
HPAKL86_03420115-0.676459recombination protein RecR
HPAKL86_03425211-0.0443454-oxalocrotonate tautomerase
HPAKL86_03430011-0.526014outer membrane protein (omp22)
HPAKL86_034351110.119004hypothetical protein
HPAKL86_03440013-0.394395ATP-binding protein
HPAKL86_03445216-0.930463nitrite extrusion protein NarK
HPAKL86_03450316-1.318917putative heme iron utilization protein
HPAKL86_03455215-1.610298arginyl-tRNA synthetase
HPAKL86_03460313-1.150427Sec-independent protein translocase protein
HPAKL86_03465113-1.288676guanylate kinase
HPAKL86_03470211-1.417840poly E-rich protein
HPAKL86_03475-113-2.357427nuclease NucT
HPAKL86_03480112-2.484651outer membrane protein 13
HPAKL86_03485214-2.682037flagellar basal body L-ring protein
HPAKL86_03490312-2.077858CMP-N-acetylneuraminic acid synthetase
HPAKL86_03495311-1.414965CMP-N-acetylneuraminic acid synthetase
HPAKL86_03500213-1.415007flagellar biosynthesis protein G
HPAKL86_035052140.201778tetraacyldisaccharide 4'-kinase
HPAKL86_035101141.252476NAD synthetase
HPAKL86_035150151.684307*ketol-acid reductoisomerase
HPAKL86_035202181.239931cell division inhibitor
HPAKL86_035253201.509790cell division topological specificity factor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03350TCRTETOQM1964e-57 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 196 bits (501), Expect = 4e-57
Identities = 115/461 (24%), Positives = 190/461 (41%), Gaps = 67/461 (14%)

Query: 3 NIRNIAVIAHVDHGKTTLVDGLLSQSGTFSEREKVDE--RVMDSNDLERERGITILSKNT 60
I NI V+AHVD GKTTL + LL SG +E VD+ D+ LER+RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 61 AIYYKDTKINIIDTPGHADFGGEVERVLKMVDGVLLLVDAQEGVMPQTKFVVKKALSFGI 120
+ +++TK+NIIDTPGH DF EV R L ++DG +LL+ A++GV QT+ + GI
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 121 CPIVVVNKIDKPAAEPDRVVDEVFDLF---------VAMGASDKQLDFPV-----VYAAA 166
I +NKID+ + V ++ + V + + +F
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 167 RDGYAMKSLDDE----------------------------KKNL--EPLFETILEHVPSP 196
D K + + K N+ + L E I S
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSS 241

Query: 197 SGSVDEPLQMQIFTLDYDNYVGKIGIARVFNGSVKKNESVLLMKSDGSKENGRITKLIGF 256
+ L ++F ++Y ++ R+++G + +SV + KE +IT++
Sbjct: 242 THRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRI----SEKEKIKITEMYTS 297

Query: 257 LGLARTEIENAYAGDIVAIAG--FNAMDV-GDSVVDPANPMPLDPMHLEEPTMSVYFAVN 313
+ +I+ AY+G+IV + V GD+ + P +P P + +
Sbjct: 298 INGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENP----LPLLQTTVEPS 353

Query: 314 DSPLAGLEGKHVTANKLKDRLLKEMQTNIAMKCEEMGEGKFKVSGRGELQITILAENLRR 373
+ + D LL+ + + +S G++Q+ + L+
Sbjct: 354 KPQQREMLLDALLEISDSDPLLRYYVDSAT--------HEIILSFLGKVQMEVTCALLQE 405

Query: 374 E-GFEFSISRPEVIIKEENGVKCEPFEHLVIDTPQDFSGAI 413
+ E I P VI E K E H+ + P F +I
Sbjct: 406 KYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP-PNPFWASI 445



Score = 41.8 bits (98), Expect = 8e-06
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 1/80 (1%)

Query: 396 EPFEHLVIDTPQDFSGAIIERLGKRKAEMKAMNPMSDGYTRLEFEIPARGLIGYRSEFLT 455
EP+ I PQ++ K A + + + L EIPAR + YRS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTF 595

Query: 456 DTKGEGVMNHSFLEFRPFSG 475
T G V + +G
Sbjct: 596 FTNGRSVCLTELKGYHVTTG 615


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03375SACTRNSFRASE310.001 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.5 bits (71), Expect = 0.001
Identities = 17/90 (18%), Positives = 30/90 (33%)

Query: 43 VQKLRERGGEFWGMRDNEKLIGICGLNLINKTEAELCKFHINSAYQSQGLGQKLYESVER 102
V + E G + IG + A + + Y+ +G+G L
Sbjct: 57 VSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIE 116

Query: 103 YSFIKGYTKISLHVSKSQIKACNLYQKLGF 132
++ + + L I AC+ Y K F
Sbjct: 117 WAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03415BONTOXILYSIN290.034 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 29.1 bits (65), Expect = 0.034
Identities = 30/194 (15%), Positives = 69/194 (35%), Gaps = 15/194 (7%)

Query: 133 RFFMRFKKMT-PLNAQKTEQVLEQIAQFGMPNYFGPQRFGKFNDNHQEGLKI----LQNQ 187
++F + AQ E +++QI Q + +E + L N+
Sbjct: 679 QYFELICMAKQSILAQ--ESLVKQIVQNKFTDLSKASIPPDTLKLIRETTEKTFIDLSNE 736

Query: 188 TKFAHQKLNAFLISSYQSYLFNALLSKRL-EISKIISAFSVKENLEFFKQKNLNINPNTL 246
++ + +++ FL + + K + + K I+ ++ F Q+ NIN N
Sbjct: 737 SQISMNRVDNFLNKASICVFVEDIYPKFISYMEKYIN--NINIKTREFIQRCTNINDNEK 794

Query: 247 KALKNQAHPFKILEGDVMCHYPYGKFFDALELEKESERFLKKEVVPTGLLDGKKA----L 302
L N + FK ++ + FF++ + E +++ +
Sbjct: 795 SILINS-YTFKTIDFKFLDIQSIKNFFNSQVEQVMKEILSPYQLLLFASKGPNSNIIEDI 853

Query: 303 YAKNLSLEIEKEFQ 316
KN ++ + +
Sbjct: 854 SGKNTLIQYTESIE 867


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03445TCRTETA462e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 45.6 bits (108), Expect = 2e-07
Identities = 54/271 (19%), Positives = 102/271 (37%), Gaps = 16/271 (5%)

Query: 28 LILSGSLTPHQSFQLGIAVLMGYIFGSFLIQFLNPLMSLESIAKISFGLIALSFLICYFD 87
L+ S +T H L + LM + L L+ + +S A+ + I
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGA-LSDRFGRRPVLLVSLAGAAVDYAI--MA 91

Query: 88 SIPFFW-LWIWRFIAGVASSALMILVAPLSLPYVKENKRALVGGFIFSAVGVGSVFSGFV 146
+ PF W L+I R +AG+ + A + ++RA GF+ + G G V +
Sbjct: 92 TAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVL 150

Query: 147 LPWISSYNIKWAWIFLGGSCLIAFILSLIGLKN-HSLRKKSVKKEESAFKIPFHL----- 200
+ ++ + + F+ L H ++ +++E F
Sbjct: 151 GGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMT 210

Query: 201 ---WLLLISCALNAIGFLPHTLFWVDYLVRHLNISPAIAGTSWAVFG-FGATLGSLISGP 256
L+ + + +G +P L WV + + G S A FG + ++I+GP
Sbjct: 211 VVAALMAVFFIMQLVGQVPAAL-WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGP 269

Query: 257 MAQKLGAKNANIFILILKSIACFLPIFFHQI 287
+A +LG + A + +I L F +
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRG 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03465PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.011
Identities = 9/18 (50%), Positives = 11/18 (61%)

Query: 8 LILSGPSGAGKSTLTKYL 25
++L G G GKSTL L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03470IGASERPTASE684e-14 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 68.2 bits (166), Expect = 4e-14
Identities = 50/246 (20%), Positives = 91/246 (36%), Gaps = 23/246 (9%)

Query: 117 QPSESDPNPTDPLEPAQETLETNWDELENLGDLETLAQEEPNNEEQLLPTLNDQEEKEEA 176
+ + D P P PA + T + + +T+ + E + E T ++E +EA
Sbjct: 1016 EIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATE---TTAQNREVAKEA 1072

Query: 177 KEEIKETPQEEEKPKEEMQEQAKEQEPIKEETQEEIKEETQEEIKEETQEEIKEETQEEL 236
K +K Q E + ET+E ET+E E +E+ K ET++
Sbjct: 1073 KSNVKANTQTNEVAQ------------SGSETKETQTTETKETATVEKEEKAKVETEKTQ 1120

Query: 237 EIPKEETQEQAKEQELEAMQELVKEIQENSNNQADKEKTQENAKAFQETQAQELEKQELE 296
E+PK +Q K+++ E +Q + +EN KE + Q + +E
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 297 IPQESTETPQEKEKQALEIPQEEKQENAETPQESTEIPQEKTQKLETQEDHYESIEDIPE 356
+ T +E P+ + P ++E + + H S+ +P
Sbjct: 1181 -QPVTESTTVNTGNSVVENPENTTPATTQ-PTVNSESSNKPKNR------HRRSVRSVPH 1232

Query: 357 PVMAQA 362
V
Sbjct: 1233 NVEPAT 1238



Score = 58.5 bits (141), Expect = 3e-11
Identities = 54/277 (19%), Positives = 99/277 (35%), Gaps = 21/277 (7%)

Query: 142 ELENLGDLETLAQEEPNNEEQLLPTLNDQEEKEEAK-EEIKETPQEEEKPKEEMQ---EQ 197
E+E N Q +E A+ +E P P E + E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 198 AKEQEPIKEETQEEIKEET--QEEIKEETQEEIKEETQEELEIPKEETQEQAKEQELEAM 255
+K++ E+ +++ E T E+ +E + +K TQ E+ + E + Q E
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT-NEVAQSG-SETKETQTTETK 1101

Query: 256 QELVKEIQENSNNQADKEKTQENAKAFQETQAQELEKQELEIPQESTETPQEKEKQALEI 315
+ E +E + + + EKTQE K + ++ E+ E PQ + E
Sbjct: 1102 ETATVEKEEKA--KVETEKTQEVPKVTSQVSPKQ-EQSETVQPQAEPARENDPTVNIKE- 1157

Query: 316 PQEEKQENAETPQESTEIPQEKT----QKLETQEDHYESIEDIPEPVMAQAMGEALPFLN 371
PQ + A+T Q P ++T ++ T+ + + E P +N
Sbjct: 1158 PQSQTNTTADTEQ-----PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212

Query: 372 ESVAKTPNNENDTEIPKESVIKTPQEKEGNDKTSSPL 408
+ P N + + P ND+++ L
Sbjct: 1213 SESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVAL 1249



Score = 51.6 bits (123), Expect = 5e-09
Identities = 34/191 (17%), Positives = 61/191 (31%), Gaps = 10/191 (5%)

Query: 149 LETLAQEEPNNEEQLLPTLNDQEEKEEAKEEIKETPQ--EEEKPKEEMQEQAKE-----Q 201
+ N + + E KE E KET +EEK K E ++ + Q
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 202 EPIKEETQEEIKEETQEEIKEETQEEIKEETQEELEIPKEETQEQAKEQELEAMQELVKE 261
K+E E ++ + + + + IKE + +T++ AKE Q + +
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTA--DTEQPAKETSSNVEQPVTES 1186

Query: 262 IQENSNNQADKEKTQENAKAFQETQAQELEKQELEIPQESTET-PQEKEKQALEIPQEEK 320
N+ N + Q T E + + S + P E
Sbjct: 1187 TTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRST 1246

Query: 321 QENAETPQEST 331
+ +T
Sbjct: 1247 VALCDLTSTNT 1257



Score = 34.7 bits (79), Expect = 9e-04
Identities = 41/271 (15%), Positives = 83/271 (30%), Gaps = 25/271 (9%)

Query: 67 KEIVSQNKNSVCMYKKGNE-AQPFLEGFEMKIKKPFLPTEMLKVLQKKLGFQPSESDPNP 125
+E+ + K++V + NE AQ E E + + + K + K+ + ++ P
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 126 TDPLEPAQETLETNWDELENLGDLETLAQEEPNNEEQLLPTLN-----DQEEKEEAKEEI 180
T + P QE ET Q EP E PT+N Q E+
Sbjct: 1126 TSQVSPKQEQSET------------VQPQAEPAREND--PTVNIKEPQSQTNTTADTEQP 1171

Query: 181 KETPQEEEKPKEEMQEQAKEQEPIKEETQEEIKEETQEEIKEETQEEIKEETQEELEIPK 240
+ + + E + TQ + E+ + K + +
Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231

Query: 241 EETQEQAKEQELEAMQELVKEIQENSNNQAD--KEKTQENAKAFQETQAQELEKQELEIP 298
+ + L N+N + K Q A + +Q + + E+
Sbjct: 1232 HNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNE 1291

Query: 299 QES---TETPQEKEKQALEIPQEEKQENAET 326
+ + + + ++ +T
Sbjct: 1292 GQYNVWVSNTSMNKNYSSSQYRRFSSKSTQT 1322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03485FLGLRINGFLGH1933e-64 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 193 bits (492), Expect = 3e-64
Identities = 52/172 (30%), Positives = 84/172 (48%), Gaps = 18/172 (10%)

Query: 56 GERPLFADRRAMKPNDLITIIVSEKASANYSSS----KDYKSASGGNSTPPRLTYNGLDE 111
G +PLF DRR D +TI++ E SA+ SSS +D K+ G ++ P L GL
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYL--QGLFG 118

Query: 112 RKKQEAEYLDDKNNYNFTKSSNNTNFKGGGSQKKSEDLEIVLSARIIKVLENGNYFIYGN 171
+ + E S F G G S L+ + +VL NGN + G
Sbjct: 119 NARADVEA------------SGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGE 166

Query: 172 KEVLVDGEKQILKVSGVIRPYDIERNNTIQSKFLADAKIEYTNLGHLSDSNK 223
K++ ++ + ++ SGV+ P I +NT+ S +ADA+IEY G+++++
Sbjct: 167 KQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQN 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03500SACTRNSFRASE280.022 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 27.6 bits (61), Expect = 0.022
Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 102 RGETILKALEYIAFE---EFQLNSLHLEVMENNFKAIAFYEKNHYELEG 147
R + + AL + A E E L LE + N A FY K+H+ +
Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


7HPAKL86_04515HPAKL86_04590Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_04515212-1.2703024-hydroxy-3-methylbut-2-en-1-yl diphosphate
HPAKL86_04520210-1.539139hypothetical protein
HPAKL86_04525111-3.171337UDP-N-acetylmuramate--L-alanine ligase
HPAKL86_0453009-3.873938hypothetical protein
HPAKL86_0453509-3.680867recombination and DNA strand exchange inhibitor
HPAKL86_0454008-3.579336osmoprotection protein (proV)
HPAKL86_04545-17-3.158208osmoprotection protein ProWX
HPAKL86_0455008-3.499765adenine-specific DNA methylase
HPAKL86_04555010-2.115124type III restriction enzyme R protein
HPAKL86_045600120.349460hypothetical protein
HPAKL86_045652130.172995flagellar motor protein MotB
HPAKL86_045705171.295397flagellar motor protein MotA
HPAKL86_045755160.817961hypothetical protein
HPAKL86_045805160.740774hypothetical protein
HPAKL86_045856161.708038outer membrane protein HopH
HPAKL86_045905151.866578hof family outer membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_04565OMPADOMAIN563e-11 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 55.7 bits (134), Expect = 3e-11
Identities = 31/121 (25%), Positives = 56/121 (46%), Gaps = 15/121 (12%)

Query: 120 LPSNLLFENATSDTINQDMMLYLERIA-KIIQKLPKRVHINVRGFTDDTPLIKTRFKSHY 178
L S++LF T+ + L+++ ++ PK + V G+TD I + +
Sbjct: 217 LKSDVLFNFN-KATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR---IGSD-AYNQ 271

Query: 179 ELAANRAYRVMKVLIQYGVNPNQLSFSSYGSTNPIAPN--DSLENRM-------KNNRVE 229
L+ RA V+ LI G+ +++S G +NP+ N D+++ R + RVE
Sbjct: 272 GLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVE 331

Query: 230 I 230
I
Sbjct: 332 I 332


8HPAKL86_04830HPAKL86_04860Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_04830211-1.824783hypothetical protein
HPAKL86_04835111-2.296034UTP--glucose-1-phosphate uridylyltransferase
HPAKL86_04840211-3.308624soluble lytic murein transglycosylase
HPAKL86_04845212-3.708435hypothetical protein
HPAKL86_04850213-3.849296glutamylglutaminyl-tRNA synthetase
HPAKL86_04855211-3.230188tRNA(Ile)-lysidine synthase
HPAKL86_04860211-3.098471hypothetical protein
9HPAKL86_05250HPAKL86_05295Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_05250216-1.594014hypothetical protein
HPAKL86_05255218-1.163418type II DNA modification methyltransferase
HPAKL86_05260317-0.474965hypothetical protein
HPAKL86_05265314-1.119031hypothetical protein
HPAKL86_05270414-1.023443FKBP-type peptidyl-prolyl cis-trans isomerase
HPAKL86_05275415-1.705195hypothetical protein
HPAKL86_05280414-1.443047peptidoglycan-associated lipoprotein precursor
HPAKL86_052853130.316121translocation protein TolB
HPAKL86_052903180.047372periplasmic protein TonB
HPAKL86_052952161.966526hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05275GPOSANCHOR320.003 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.3 bits (73), Expect = 0.003
Identities = 23/138 (16%), Positives = 44/138 (31%)

Query: 27 GATKKELKQLQVNSKNFSNVLTKIHSQVEANTQAQEGLRSVYEGQANKIKDLNNAILSQV 86
K + ++ +A EG + + KIK L +
Sbjct: 200 EGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE 259

Query: 87 ESLRALKASQEVQANTLKQQSQTLDDLRNEIRANQQAIQQLDQQNKQMSELLTKLSQDLV 146
L+ + E N S + L E A + L+ Q++ ++ L +DL
Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLD 319

Query: 147 SQIALIQKALKEQQEKTE 164
+ ++ E Q+ E
Sbjct: 320 ASREAKKQLEAEHQKLEE 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05280OMPADOMAIN1382e-42 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 138 bits (350), Expect = 2e-42
Identities = 45/172 (26%), Positives = 69/172 (40%), Gaps = 28/172 (16%)

Query: 24 DNNTVAGDVGTGAKAVQNAPVTTEPVLEKQEPKEEPKQEPAPAVEEKPAIESGTIIASIY 83
DN ++ V + APV P PAP V+ K T+ + +
Sbjct: 179 DNGMLSLGVSYRFGQGEAAPVV------------APAPAPAPEVQTK----HFTLKSDVL 222

Query: 84 FDFDKYEIKESDQETLDAIVQKAKE---HHMQVLLEGNTDEFGSSEYNQALGVKRTLSVK 140
F+F+K +K Q LD + + V++ G TD GS YNQ L +R SV
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVV 282

Query: 141 NALVIKGVEKDMIKTISFGEIKPKCTQKT---------KECYKENRRVDVKL 183
+ L+ KG+ D I GE P +C +RRV++++
Sbjct: 283 DYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05290TYPE4SSCAGA354e-04 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 34.7 bits (79), Expect = 4e-04
Identities = 38/139 (27%), Positives = 66/139 (47%), Gaps = 11/139 (7%)

Query: 32 EEAEKILLDLSKKDEQVID--LNLEDAPSENKKE-KIEKVTEKQGDFLKP--KEEPKEEE 86
+EA K++ D +++++ LN A ++ K ++V + Q D K K E E+E
Sbjct: 568 QEANKLIKDFLSSNKELVGKTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLRKREHLEKE 627

Query: 87 PEESLEDIFSSLNDFKEKTDKNAQKDE-----QKKEQEEQRRLKEQQRLKQ-NQENQEML 140
E+ LE + N + K N+QKDE K+ + R + Q LK +E + L
Sbjct: 628 VEKKLESKSGNKNKMEAKAQANSQKDEIFALINKEANRDARAIAYAQNLKGIKRELSDKL 687

Query: 141 KGLQQNLDQFAQKLESVKN 159
+ + +NL F + + KN
Sbjct: 688 ENVNKNLKDFDKSFDEFKN 706


10HPAKL86_05890HPAKL86_05995Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_05890-2103.281419glucose/galactose transporter
HPAKL86_05895-2102.869025hypothetical protein
HPAKL86_05900-292.551377outer membrane protein HopZ
HPAKL86_059050111.548813purine-nucleoside phosphorylase
HPAKL86_059101111.084159phosphopentomutase
HPAKL86_059151121.722460nucleoside transporter
HPAKL86_059201131.028701*multidrug-efflux transporter
HPAKL86_05925013-0.125449hypothetical protein
HPAKL86_05930417-1.161684Na+/H+ antiporter
HPAKL86_05935418-0.957912hypothetical protein
HPAKL86_05940619-0.409016putative arabinose transporter
HPAKL86_05945619-0.648139Alpha-carbonic anhydrase
HPAKL86_05950822-0.903749hypothetical protein
HPAKL86_059554180.156977hypothetical protein
HPAKL86_05960-1182.376225hypothetical protein
HPAKL86_05965-2152.9763933-oxoacyl-(acyl carrier protein) synthase III
HPAKL86_059703190.241745putative phosphate acyltransferase
HPAKL86_05975623-0.65630550S ribosomal protein L32
HPAKL86_05980622-0.906989hypothetical protein
HPAKL86_05985419-0.401462mulitfunctional nucleoside diphosphate
HPAKL86_05990317-0.192076S-adenosylmethionine synthetase
HPAKL86_05995417-1.744115hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05900PF03944300.025 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 30.4 bits (68), Expect = 0.025
Identities = 26/113 (23%), Positives = 51/113 (45%), Gaps = 1/113 (0%)

Query: 129 YTGSSPTPHQSETFDNQPGKSSTTKECGNGVGSLRAEKNNSLSIEQFKQINRAYQILQKV 188
YTG + +P + +NQ ++ +++ GN SLR E+NN+ + + +Y + +V
Sbjct: 491 YTGFTISPIHATQVNNQT-RTFISEKFGNQGDSLRFEQNNTTARYTLRGNGNSYNLYLRV 549

Query: 189 LNEAGGVPALNENGTDVTVSVKSTTKSTSGSTTSGGQSGSSGSGNSETISSKN 241
+ + NG T + +TT + G +G + GN S+ +
Sbjct: 550 SSIGNSTIRVTINGRVYTATNVNTTTNNDGVNDNGARFSDINIGNVVASSNSD 602


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05920TCRTETA973e-24 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 97.2 bits (242), Expect = 3e-24
Identities = 76/386 (19%), Positives = 145/386 (37%), Gaps = 30/386 (7%)

Query: 3 KKIFPLALVSSLRFLGLFIVLPIISLYADSFHSSSPLL--VGLAVGGAYLTQIIFQTPMG 60
+ + + +L +G+ +++P++ S+ + G+ + L Q +G
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 61 ILSDKIGRKVVVMVCLLLFLIGSLVCFIANDIVWLVIGRFIQGM-GALGGVVSAMVADEV 119
LSD+ GR+ V++V L + + A + L IGR + G+ GA G V A +AD
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124

Query: 120 KEEERTKAMAIMGAFIFISFTISMAIGPGVVAFFGG--AKWLFLLTAILTLLSLLM-LLK 176
+ER + M A F M GP + GG F A L L+ L
Sbjct: 125 DGDERARHFGFMSA----CFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180

Query: 177 VKDAPKISYQIKNIKAYQPNSKALYLLYISSFFEKAFMTLIFVL-----IPLAL-----V 226
+ ++ K + +A P + + ++ A M + F++ +P AL
Sbjct: 181 LPESHKGERRPLRREALNPLASFRWARGMTVV--AALMAVFFIMQLVGQVPAALWVIFGE 238

Query: 227 NEFHKDESFLILVYVPGALLGVLSMGIASVMAEKYNKPKGVMLSGVLLFIVSYLCLFLAD 286
+ FH D + + + +L L+ + + + ++ G++ Y+ L A
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 287 SSFLGKYLWLFIVGVAFFFIGFATLEPIMQSLASKFARANEKGKVLGQFTTFGYLGSFVG 346
W+ + P +Q++ S+ +G++ G L S VG
Sbjct: 299 RG------WMAFPIM-VLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG 351

Query: 347 GVSGGLSY-HHLGVSNTSLIVVGLGL 371
+ Y + N + G L
Sbjct: 352 PLLFTAIYAASITTWNGWAWIAGAAL 377


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05940TCRTETA485e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.5 bits (113), Expect = 5e-08
Identities = 54/292 (18%), Positives = 102/292 (34%), Gaps = 22/292 (7%)

Query: 35 ALLSDIAKSFEMESASVGLMITLYAWLVSLGSLPLMLLSAKIERKRLLLFLFALFILSHI 94
LL D+ S +A G+++ LYA + + L LS + R+ +LL A + +
Sbjct: 30 GLLRDLVHS-NDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYA 88

Query: 95 LSALAWNFWVLLISR--AGIALAHSIFWSITASLVIRVAPIGRKQQALGLLALGSSLAMI 152
+ A A WVL I R AGI A ++ + + + + + G ++ M+
Sbjct: 89 IMATAPFLWVLYIGRIVAGITGATG---AVAGAYIADITDGDERARHFGFMSACFGFGMV 145

Query: 153 LGLPLGRIIGQMLDWRSTFGVIGGVATLIALLMYKLLP----------HLPSKNAGTLSS 202
G LG ++G + F + L L LLP + N
Sbjct: 146 AGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFR 204

Query: 203 LPVLVKRPLLMGIYLLVIMVISGHFTTYSYIEPFIIQISQFSPEVATLMLFMFGLAG-VA 261
+ + ++ ++ I F + + L FG+ +A
Sbjct: 205 WARGMTVVAALMAVFFIMQLVGQVPAALWVI--FGEDRFHWDATTIGISLAAFGILHSLA 262

Query: 262 GSFLFGRFYEK-NPKKFITCAIILIICPQLLLFVFKNSEWVVFLQIFLWGIG 312
+ + G + ++ + +I +L F W+ F + L G
Sbjct: 263 QAMITGPVAARLGERRALMLGMIADGT-GYILLAFATRGWMAFPIMVLLASG 313


11HPAKL86_06615HPAKL86_06670Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_06615211-1.83718630S ribosomal protein S18
HPAKL86_06620212-1.879112single-stranded DNA-binding protein
HPAKL86_06625211-1.97475730S ribosomal protein S6
HPAKL86_06630310-1.354076DNA polymerase III subunit delta
HPAKL86_0663528-0.704912ribonuclease R
HPAKL86_06640011-0.530680shikimate 5-dehydrogenase
HPAKL86_06645010-0.180812hypothetical protein
HPAKL86_06650-19-0.104808oligopeptide ABC transporter, permease protein
HPAKL86_066550100.227979hypothetical protein
HPAKL86_06660111-0.200867tryptophanyl-tRNA synthetase
HPAKL86_066652120.360892biotin biosynthesis protein BioC
HPAKL86_066702141.122566preprotein translocase subunit SecG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_06645IGASERPTASE290.014 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.9 bits (64), Expect = 0.014
Identities = 14/85 (16%), Positives = 26/85 (30%), Gaps = 1/85 (1%)

Query: 47 EKTEIERQNSALSPKQANMATTATEENPTKDPPLPLETDTQKQEDKQENKQEQEKENKPK 106
+ E+ + S +SPKQ T + P + P + Q ++ +
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQPQAEPAR-ENDPTVNIKEPQSQTNTTADTEQPAKETS 1176

Query: 107 QNSASPTQNHQKPLTTPIIGKKPLE 131
N P T + + P
Sbjct: 1177 SNVEQPVTESTTVNTGNSVVENPEN 1201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_06670SECGEXPORT481e-09 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 47.6 bits (113), Expect = 1e-09
Identities = 24/84 (28%), Positives = 46/84 (54%), Gaps = 3/84 (3%)

Query: 1 MTSALLGLQIVLAVLIVVVVLLQ--KSSSIGLGAYSGSNDSLFGAKGPASFMAKLTMFLG 58
M ALL + +++A+ +V +++LQ K + +G +G++ +LFG+ G +FM ++T L
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 59 LLFVTNTIALGYFYNKEYGKSILD 82
LF ++ LG N +
Sbjct: 61 TLFFIISLVLGNI-NSNKTNKGSE 83


12HPAKL86_00035HPAKL86_00060N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_00035-111-0.205754*****nodulation protein (nolK)
HPAKL86_00040-110-0.378709GDP-D-mannose dehydratase
HPAKL86_00045-110-0.680432mannose-1-phosphate guanyltransferase
HPAKL86_00050-111-0.569048comB10 competence protein
HPAKL86_00055012-1.007502type IV secretion system protein VirB9
HPAKL86_000600140.205414comB8 competence protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00035NUCEPIMERASE491e-08 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 48.6 bits (116), Expect = 1e-08
Identities = 51/346 (14%), Positives = 106/346 (30%), Gaps = 54/346 (15%)

Query: 5 ILITGAYGMVGQNTALYFKKNKPDV-----------TLLTPKKSELY-----------LL 42
L+TGA G +G + + + V L + EL L
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 43 DKDNVQAYLKEYKPTGIIHCAGRVGGIVANMNDLSTYMVENLLMGLYLFSSALDLGVKKA 102
D++ + + R + ++ + Y NL L + ++
Sbjct: 63 DREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 103 INLASSCAYPKFAPNPLKESDLLNGSLEPTNEGYALAKLSVMKYCEYVSAEKGVFYKTLV 162
+ +SS Y P D ++ + YA K + S G+ L
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSL----YAATKKANELMAHTYSHLYGLPATGLR 177

Query: 163 PCNLYGEFDKFEEKIAHMIPGLIARMHTAKLKNEKEFVMWGDGTARREYLNAKDLARFIS 222
+YG + + P + T + K ++ G +R++ D+A I
Sbjct: 178 FFTVYGPWGR---------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228

Query: 223 LAYENVASIPS-----------------VMNVGSGVDYSIEEYYKMVAQVLDYKGAFVKD 265
+ + + V N+G+ + +Y + + L +
Sbjct: 229 RLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNML 288

Query: 266 LSKPVGMQQKLMDISK-QEALKWELEIPLEQGIKEAYEYYLKLLEV 310
+P + + D E + + E ++ G+K +Y +V
Sbjct: 289 PLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYKV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00040NUCEPIMERASE882e-21 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 87.9 bits (218), Expect = 2e-21
Identities = 46/180 (25%), Positives = 72/180 (40%), Gaps = 19/180 (10%)

Query: 7 LITGVTGQDGSYLAEYLLNLGYEVHGLKRRSSSINTSRIDHLYEDLHSEHKRRFFLHYGD 66
L+TG G G ++++ LL G++V G+ + + S E L F H D
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP---GFQFHKID 60

Query: 67 MTDSSNLIHLIATTKPTEIYNLAAQSHVKVSFETPEYTANADGIGTLRILEAMRILGLEK 126
+ D + L A+ ++ + V+ S E P A+++ G L ILE R ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ- 119

Query: 127 KTRFYQASTSELYGEVLETPQNENTPF-------NPRSPYAVAKMYAFYITKNYREAYNL 179
AS+S +YG N PF +P S YA K + Y Y L
Sbjct: 120 --HLLYASSSSVYGL------NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00055TYPE4SSCAGX300.012 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 30.1 bits (67), Expect = 0.012
Identities = 23/84 (27%), Positives = 41/84 (48%), Gaps = 13/84 (15%)

Query: 177 ENTTNNKPLKEEKEETKEKEEETITIGDNTNAMKIVKKDIQKGYRALKSSQRKWYCLWIC 236
E N + ++EEK++ + + + NA+K + + + Y ++ +
Sbjct: 360 EQIINKEKIREEKQKIILDQAKALETQYVHNALK--RNPVPRNYNYYQAPE--------- 408

Query: 237 SKKSKLSLMPEEIFNDKQFTYFKF 260
K+SK +MP EIF+D FTYF F
Sbjct: 409 -KRSK-HIMPSEIFDDGTFTYFGF 430


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00060PF043351332e-40 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 133 bits (335), Expect = 2e-40
Identities = 37/202 (18%), Positives = 73/202 (36%), Gaps = 4/202 (1%)

Query: 40 QSVFRLERNRLKIAYKLLGLMSFIALVLAIVLISVLPLQKTEHHF--VDFLNQDKHYAII 97
+ K+A+ + G+ +A + + ++ PL+ E + VD + A
Sbjct: 22 RDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAK 81

Query: 98 QRADKSISSNEALARSLIGAYVLNRESINRIDDKSRYELVRLQSSSKVWQRFEDLIKTQN 157
D +I+ +EA+ + + YV RE + ++ V + S+ R+ KT N
Sbjct: 82 LHGDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDN 141

Query: 158 SIYAQSHLEREVHI-VNIAIYQQDNNPIASVSIVAKLMNENKLVYEKRYKIVMSYLFDTP 216
Q+ L + V I +A V + + + + + Y D
Sbjct: 142 PQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFTKESVTGSNST-KTDAVATIKYKVDGT 200

Query: 217 DFDYASMPKNPTGFKITRYSIT 238
KNP G+++ Y
Sbjct: 201 PSKEVDRFKNPLGYQVESYRAD 222


13HPAKL86_00670HPAKL86_00705N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_00670-2111.097192ATP-dependent protease ATP-binding subunit ClpX
HPAKL86_00675-2131.146019rod shape-determining protein MreB
HPAKL86_00680-1140.223722rod shape-determining protein MreC
HPAKL86_006850150.642726hypothetical protein
HPAKL86_006900121.970342hypothetical protein
HPAKL86_00695-1121.287668cell division protein
HPAKL86_00700-1121.660702flagellar hook-basal body protein FliE
HPAKL86_00705-1131.659701flagellar basal body rod protein FlgC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00670HTHFIS418e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 41.0 bits (96), Expect = 8e-06
Identities = 24/116 (20%), Positives = 45/116 (38%), Gaps = 20/116 (17%)

Query: 134 LEHLEEVELSKSNILLIGPTGSGKTLMAQTLAKHLD------IPI---AISDATSLTEAG 184
L + + +++ G +G+GK L+A+ L + + I AI L E+
Sbjct: 150 YRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPR--DLIESE 207

Query: 185 YVGEDVENILTRLLQASDWNVQKAQKGIVFIDEIDKIS--------RLSENRSITR 232
G + T S ++A+ G +F+DEI + R+ + T
Sbjct: 208 LFGH-EKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTT 262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00675SHAPEPROTEIN474e-171 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 474 bits (1221), Expect = e-171
Identities = 179/347 (51%), Positives = 248/347 (71%), Gaps = 2/347 (0%)

Query: 2 IFSKLIGLFSHDIAIDLGTANTIVLVKGQGIIINEPSIVAVRMGLFDSKAYDILAVGSEA 61
+ K G+FS+D++IDLGTANT++ VKGQGI++NEPS+VA+R S + AVG +A
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKS-VAAVGHDA 59

Query: 62 KEMLGKTPNSIRAIRPMKDGVIADYDITAKMIRYFIEKAHKRKTW-IRPRIMVCVPYGLT 120
K+MLG+TP +I AIRPMKDGVIAD+ +T KM+++FI++ H PR++VCVP G T
Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119

Query: 121 SVERNAVKESALSAGAREVFLIEEPMAAAIGAGLPVKEPQGSLIVDIGGGTTEIGVISLG 180
VER A++ESA AGAREVFLIEEPMAAAIGAGLPV E GS++VDIGGGTTE+ VISL
Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179

Query: 181 GLVISKSIRVAGDKLDQSIVEYIRKKFNLLIGERTGEEIKIEIGCAIKLDPPLTMEVSGR 240
G+V S S+R+ GD+ D++I+ Y+R+ + LIGE T E IK EIG A D +EV GR
Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239

Query: 241 DQVSGLLHTIELSSDDVFEAIKDQVREISSALRSVLEEVKPDLAKDIVQNGVVLTGGGAL 300
+ G+ L+S+++ EA+++ + I SA+ LE+ P+LA DI + G+VLTGGGAL
Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299

Query: 301 IKGLDKYLSDMVKLPVYVGDEPLLAVAKGTGEAIQDLDLLSRVGFSE 347
++ LD+ L + +PV V ++PL VA+G G+A++ +D+ FSE
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSE 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00700FLGHOOKFLIE761e-21 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 76.2 bits (187), Expect = 1e-21
Identities = 19/77 (24%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 34 EQKGGEFSKLLKQSINELNNTQEQSDKALADMATGQIK-DLHQAAIAIGKAETSMKLMLE 92
Q F+ L +++ +++TQ + G+ L+ + KA SM++ ++
Sbjct: 27 PQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQ 86

Query: 93 VRNKAISAYKELLRTQI 109
VRNK ++AY+E++ Q+
Sbjct: 87 VRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_00705FLGHOOKAP1280.016 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.0 bits (62), Expect = 0.016
Identities = 10/38 (26%), Positives = 15/38 (39%)

Query: 121 NVNAVIEMADLVEATRAYQANVAAFQSAKNMAQNAIGM 158
VN E +L + Y AN Q+A + I +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


14HPAKL86_01020HPAKL86_01070N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_010205182.886303DNA-directed RNA polymerase subunit alpha
HPAKL86_010255172.46929050S ribosomal protein L17
HPAKL86_010304152.011239hypothetical protein
HPAKL86_010351120.663113collagen-binding surface adhesin SpaP
HPAKL86_01040012-0.027385membrane-associated lipoprotein
HPAKL86_010450130.217908hypothetical protein
HPAKL86_010500140.182906hypothetical protein
HPAKL86_010551160.133764putative Outer membrane protein
HPAKL86_01060218-0.449696tRNA modification GTPase TrmE
HPAKL86_01065218-0.274822hypothetical protein
HPAKL86_010701170.369035membrane protein insertase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01020BACINVASINC320.004 Salmonella/Shigella invasin protein C signature.
		>BACINVASINC#Salmonella/Shigella invasin protein C signature.

Length = 409

Score = 31.8 bits (71), Expect = 0.004
Identities = 33/133 (24%), Positives = 50/133 (37%), Gaps = 13/133 (9%)

Query: 219 AFLSAVKVMSKQLGVFGERPIANTEYSGDYAQRDDAKDLSAKIESMNL-SARCFNCLDKI 277
A ++ + QLG+ G A EY G +R K +AKI+ + S N L+
Sbjct: 173 ALSGSISQSALQLGITGVG--AKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQ 230

Query: 278 GIKYVGELVLMSEEELKGVK---------NMGKKSYDEIAEKLNDLGY-PVGTELSPEQR 327
+G + S + L K N + LG ++SPE +
Sbjct: 231 NSVKLGAEGVDSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKNSNKQISPEHQ 290

Query: 328 ESLKKRLEKLEDK 340
L KRLE +E
Sbjct: 291 AILSKRLESVESD 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01040LIPOLPP202654e-94 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 265 bits (677), Expect = 4e-94
Identities = 154/175 (88%), Positives = 167/175 (95%)

Query: 1 MKNQVKKILGMSVIAAMVIVGCSHAPKSGVSKSNTEYKEATKGAPEWVMGDLEKVAKYEK 60
MKNQVKKILGMSV+AAMVIVGCSHAPKSG+SKSN YKEATKGAP+WV+GDLEKVAKYEK
Sbjct: 1 MKNQVKKILGMSVVAAMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK 60

Query: 61 YSGVFLGRAEDLITSGDVDYSTSQATTKARANLAANLKSTLQKELQNEKTRTTDSSGKSS 120
YSGVFLGRAEDLIT+ DVDYST+QAT KARANLAANLKSTLQK+L+NEKTRT D+SGK S
Sbjct: 61 YSGVFLGRAEDLITNNDVDYSTNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRS 120

Query: 121 ISGSDSEKISQLVDNELVASKMLARYVGKDRVFVLVGLDKEIVDKVRSELGMVKK 175
ISG+D+EKISQLVD EL+ASKMLARYVGKDRVFVLVGLDK+IVDKVR ELGMVKK
Sbjct: 121 ISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDKQIVDKVREELGMVKK 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01045LIPOLPP20290.004 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 29.0 bits (64), Expect = 0.004
Identities = 12/34 (35%), Positives = 25/34 (73%)

Query: 91 EITASQLKATFIGADKVYVLIEVDKKNIALINKE 124
E+ AS++ A ++G D+V+VL+ +DK+ + + +E
Sbjct: 136 ELIASKMLARYVGKDRVFVLVGLDKQIVDKVREE 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01055CABNDNGRPT310.021 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 30.7 bits (69), Expect = 0.021
Identities = 13/65 (20%), Positives = 21/65 (32%), Gaps = 7/65 (10%)

Query: 435 IIHEYGHTLGYTH-------NGNMTYQRVRVCEESGKYELCKGGHVVEKDGKEEQVFSNG 487
HE GH LG H G+ +Y E+S ++ + E +
Sbjct: 186 FTHEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGENETGADYNGHYGGA 245

Query: 488 KQVLD 492
+ D
Sbjct: 246 PMIDD 250


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01060TCRTETOQM340.001 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 34.4 bits (79), Expect = 0.001
Identities = 32/134 (23%), Positives = 53/134 (39%), Gaps = 25/134 (18%)

Query: 216 LSIVGKPNAGKSSLLNAMLLEERA---LVSDIKGTTR-DTIEE-------------VIEL 258
+ ++ +AGK++L ++L A L S KGTTR D +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 259 QGHKVRLIDTAGIRESTDKIERLGIEKSLKSLENCDIVLGVFDLSKPLEKEDFNLIDTLN 318
+ KV +IDT G + ++ R SL L D + + ++ + L L
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYR-----SLSVL---DGAILLISAKDGVQAQTRILFHALR 117

Query: 319 RAKKPCIVVLNKND 332
+ P I +NK D
Sbjct: 118 KMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_0107060KDINNERMP430e-148 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 430 bits (1107), Expect = e-148
Identities = 163/575 (28%), Positives = 277/575 (48%), Gaps = 68/575 (11%)

Query: 10 RLILAIALSFLFIALYSYFFQKPNKTTTPT-TKQETTNNPTATSPNNTPNAFSATQIIPQ 68
R +L IAL F+ ++ + Q N T Q TT + + P + I +
Sbjct: 5 RNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVK 64

Query: 69 ENLLSTISFEHARIEIDSLGR--IKQVYLKDKKYLTPKQKGFLEHVGHLFSSKENSQTPL 126
++L + I++ G + + K L Q L F + S
Sbjct: 65 TDVL--------DLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTG 116

Query: 127 KELPLLATDKLKPLEVRFLDPTLNNKAFNTPYSASKTTLGPNEQLV--LTQDLGALTIIK 184
++ P + +PL +N A G NE V D T K
Sbjct: 117 RDGPDNPANGPRPL-------------YNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTK 163

Query: 185 TLTFYDDLHYDLQIAFKSPNN------------------LIPSYVITNGYRPVADLDSYT 226
T Y + + + N L P + + +T
Sbjct: 164 TFVLKRG-DYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFAL-----HT 217

Query: 227 FSGVLLENTDKKIEKIE---DKDAKEIKRFSNTLFLSSVDRYFTTLLFTKDPQGFEALID 283
F G D+K EK + D + + S +++ + +YF T + G
Sbjct: 218 FRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAMLQQYFATAWIPHN-DGTNNFYT 276

Query: 284 SEIGTKNPLGFISLKNEA-----------DLHGYIGPKDYRSLKAISPMLTDVIEYGLIT 332
+ +G N + I K++ + ++GP+ + A++P L ++YG +
Sbjct: 277 ANLG--NGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLW 334

Query: 333 FFAKGVFVLLDYLYQFVGNWGWAIILLTIIVRLILYPLSYKGMVSMQKLKEIAPKMKELQ 392
F ++ +F LL +++ FVGNWG++II++T IVR I+YPL+ SM K++ + PK++ ++
Sbjct: 335 FISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMR 394

Query: 393 EKYKGEPQKLQAHMMQLYKKHGANPLGGCLPLILQIPVFFAIYRVLYNAVELKSSEWILW 452
E+ + Q++ MM LYK NPLGGC PL++Q+P+F A+Y +L +VEL+ + + LW
Sbjct: 395 ERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALW 454

Query: 453 IHDLSIMDPYFILPLLMGASMYWHQSVTPSSVTDPMQAKIFKFLPLLFTIFLITFPAGLV 512
IHDLS DPY+ILP+LMG +M++ Q ++P++VTDPMQ KI F+P++FT+F + FP+GLV
Sbjct: 455 IHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLV 514

Query: 513 LYWTTNNILSVLQQLIINKVLENKKRAHAQSKKES 547
LY+ +N+++++QQ +I + LE K+ H++ KK+S
Sbjct: 515 LYYIVSNLVTIIQQQLIYRGLE-KRGLHSREKKKS 548


15HPAKL86_01940HPAKL86_01985N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_01940013-1.350847signal peptidase I (lepB)
HPAKL86_01945113-1.253646bifunctional 5,10-methylene-tetrahydrofolate
HPAKL86_01950114-2.272485hypothetical protein
HPAKL86_01955214-1.711469hypothetical protein
HPAKL86_01960216-1.043572hypothetical protein
HPAKL86_01965116-0.794685dihydroorotase
HPAKL86_01970115-2.243498hypothetical protein
HPAKL86_01975013-2.670043hypothetical protein
HPAKL86_01980013-1.798988flagellar motor switch protein
HPAKL86_01985113-0.670414endonuclease III
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01940PREPILNPTASE300.010 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 29.8 bits (67), Expect = 0.010
Identities = 11/59 (18%), Positives = 24/59 (40%), Gaps = 10/59 (16%)

Query: 8 YAFFSSWVGTIVVVLLVIFFVAQAFIIPSRSMVGTLYEGDMLFVKKFSYGIPIPKIPWI 66
A +W+G L ++ ++ S+VG ++ ++ PIP P++
Sbjct: 219 LAALGAWLG--WQALPIVLLLS--------SLVGAFMGIGLILLRNHHQSKPIPFGPYL 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01955TYPE3IMSPROT300.006 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.7 bits (67), Expect = 0.006
Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 4/64 (6%)

Query: 87 LQSYSVMLFFNLLLLIDILGFLPFSIYHHFMASLIFSALFCGSLFLSSPLLGMIALVALS 146
L Y F L+L+ +LPFS S + + +L PLL + AL+A++
Sbjct: 45 LSDYYFEHFSKLMLIPAEQSYLPFSQ----ALSYVVDNVLLEFFYLCFPLLTVAALMAIA 100

Query: 147 SSLL 150
S ++
Sbjct: 101 SHVV 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01965PF05043280.046 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 28.4 bits (63), Expect = 0.046
Identities = 9/39 (23%), Positives = 17/39 (43%)

Query: 46 TNTPTTLEYEKEILNHSSNFKPLMSLYFNDGLTLEELQR 84
T+ HS++F L ++FN+G E + +
Sbjct: 70 TDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICK 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01970PF03544502e-09 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 50.0 bits (119), Expect = 2e-09
Identities = 45/283 (15%), Positives = 91/283 (32%), Gaps = 63/283 (22%)

Query: 23 RNFFFSLILSILLHLLIY----FLYEYRESLFPSKPKLVKVDPKNLLILKRGHSQDPSKN 78
R F + +LS+ +H + + ++ P+ + + V L+ + P
Sbjct: 12 RRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPP-- 69

Query: 79 TQGAPKPTLAGPQKPPTPPTPPTPPTPPKPIEKPKPKPKPKPKPKKSDHKHKALKKVEKV 138
P +P P P P P IEKPKPKPKPKPKP K
Sbjct: 70 ------PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKV------------ 111

Query: 139 EEKKIVEKKVEEKKVVEKKPAQKEFDPNQLSFLPKEVVPPTPPRTDNNKGLDNQTRRDID 198
+ +++ P + T +
Sbjct: 112 ------------------EQPKRDVKPVESRPASPFENTAPARPTSST------------ 141

Query: 199 ELYGEEFGDLGTAEKDFIRNNLRDIGRITQKYLEYPQVAAYLGQDGTNAVEFYLHPNGDI 258
+ + + R +++ +YP A L +G V+F + P+G +
Sbjct: 142 ------ATAATSKPVTSVASGPR---ALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRV 192

Query: 259 SDLKIIIGSEYKILDDNTLKTIQIAYKDYPRPKTKTLIRIRVR 301
+++I+ + + ++ + +P + ++ I +
Sbjct: 193 DNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFK 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01980FLGMOTORFLIN992e-30 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 99 bits (249), Expect = 2e-30
Identities = 25/77 (32%), Positives = 47/77 (61%)

Query: 34 LICDYKNLLDMEIVFSAELGSTQIPLLQILRFEKGSVIDLQKPAGESVDTFVNGRVIGKG 93
+ D ++D+ + + ELG T++ + ++LR +GSV+ L AGE +D +NG +I +G
Sbjct: 50 AMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQG 109

Query: 94 EVMVFERNLAIRLNEIL 110
EV+V +R+ +I+
Sbjct: 110 EVVVVADKYGVRITDII 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_01985OMS28PORIN300.006 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 30.1 bits (67), Expect = 0.006
Identities = 22/77 (28%), Positives = 42/77 (54%), Gaps = 8/77 (10%)

Query: 60 LFEKYPSVKDLAL-----ASLEEVKETIKSVSYFNNKSKHLINMAQKVVRDFKGVIPSTQ 114
+ K P+ K+L L A +E+VKET+ + +++ + AQKV+ G+ PS +
Sbjct: 164 MLNKSPNNKELELTKEEFAKVEQVKETLMASERALDET---VQEAQKVLNMVNGLNPSNK 220

Query: 115 KELMSLDGVGQKTANVV 131
++++ V + +NVV
Sbjct: 221 DQVLAKKDVAKAISNVV 237


16HPAKL86_02080HPAKL86_02120N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_02080-2110.681229flagellin A
HPAKL86_02085-1120.7692443-methyladenine DNA glycosylase
HPAKL86_020900131.171372hypothetical protein
HPAKL86_02095190.719915uroporphyrinogen decarboxylase
HPAKL86_02100290.219165outer-membrane protein of the hefABC efflux
HPAKL86_02105290.171628efflux transporter
HPAKL86_02110180.403997cytoplasmic pump protein of the hefABC efflux
HPAKL86_0211519-0.021035hypothetical protein
HPAKL86_02120110-0.000815vacuolating cytotoxin VacA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_02080FLAGELLIN2445e-77 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 244 bits (625), Expect = 5e-77
Identities = 127/518 (24%), Positives = 210/518 (40%), Gaps = 22/518 (4%)

Query: 2 AFQVNTNINAMNAHVQSALTQNALKTSLERLSSGLRINKAADDASGMTVADSLRSQASSL 61
A +NTN ++ +Q++L +++ERLSSGLRIN A DDA+G +A+ S L
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GQAIANTNDGMGIIQVADKAMDEQLKILDTVKVKATQAAQDGQTTESRKAIQSDIVRLIQ 121
QA N NDG+ I Q + A++E L V+ + QA + K+IQ +I + ++
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 GLDNIGNTTTYNGQALLSGQFTNKEFQVGAYSNQSIKASIGSTTSDKIGQVRI-ATGALI 180
+D + N T +NG +LS + QVGA ++I + +G G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDN-QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 181 TASGDISLTFKQVDGVNDVTLESVKISSSAGTGIGVLAEVINKNSNRTGVKAYASVITTS 240
GD+ +FK V G + + + K +G V ++ V A +TT
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 241 DMAVQSGSVSNLTLNGIHLGNIVDIKKNDSDGRLVAAINAVTSETGVEAYTDQNGRLNLR 300
D N + K A A+ + + + +
Sbjct: 240 DAE-----------NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTID 288

Query: 301 SLDGRGIEIKTDSTSNGPSALTMVNGGQDLTQGSTNYGRLSLTRLDAKSINV------VS 354
+ G K +T NG V S + +N +
Sbjct: 289 TKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKT 348

Query: 355 ASDSQHLGFSAIGFGESQVAETTVNLRDVTGNFNANVKSASGANYNAVIASGNQSL---G 411
++S L ++ TVN + T N + + +G + S
Sbjct: 349 KNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINE 408

Query: 412 SGVTTLRGAMVVIDIAESAMKMLDKVRSDLGSVQNQMISTVNNISITQVNVKAAESQIRD 471
+ + +SA+ +D VRS LG++QN+ S + N+ T N+ +A S+I D
Sbjct: 409 DAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIED 468

Query: 472 VDFAEESANFNKNNILAQSGSYAMSQANTVQQNILRLL 509
D+A E +N +K IL Q+G+ ++QAN V QN+L LL
Sbjct: 469 ADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_02085PF05272300.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.007
Identities = 13/95 (13%), Positives = 26/95 (27%), Gaps = 20/95 (21%)

Query: 60 ILENDDEINLKKIAYIEFSKLAECVRPSGFYNQKAKRLIDLSENILKDFQSFENFKQEVT 119
L + + +A+ E + VR + +KA E+
Sbjct: 458 ALRSAPALA-GCVAFDELREQPVAVRAFPW--RKAPGP-------------LEDADVLRL 501

Query: 120 REWLLDQKGIGKESADAILCYVCAKEVMVVDKYSY 154
+++ G G+ SA + D
Sbjct: 502 ADYVETTYGTGEASAQTTEQAINV----AADMNRV 532


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_02105RTXTOXIND532e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 52.9 bits (127), Expect = 2e-10
Identities = 24/82 (29%), Positives = 36/82 (43%), Gaps = 5/82 (6%)

Query: 27 NVKAIQDSKLTLDSTGIVDSIKVTEGSVVKKGDVLLLLYNQDKQAQSDSTEQQLIFAKKQ 86
K I+ IV I V EG V+KGDVLL L +A + T+ L+ A+ +
Sbjct: 95 RSKEIKPI-----ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE 149

Query: 87 YQRYSKTGGAVDKNTLESYEFN 108
RY +++ N L +
Sbjct: 150 QTRYQILSRSIELNKLPELKLP 171



Score = 33.3 bits (76), Expect = 7e-04
Identities = 17/115 (14%), Positives = 37/115 (32%), Gaps = 13/115 (11%)

Query: 70 QAQSDSTEQQLIFAKKQYQR--YSKTGGAVDKNTLESYEFNYRRLESDYAYSIAVLNKTI 127
+++ S +++ + ++ K D + + E ++
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDN--IGLLTLELAKNEER-------QQASV 329

Query: 128 LRAPFDGVVANKNIQVGEGVSANNTVLLRLVSHARKLVIE--FDSKYINAVKVGD 180
+RAP V + GV L+ +V L + +K I + VG
Sbjct: 330 IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQ 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_02110ACRIFLAVINRP8930.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 893 bits (2308), Expect = 0.0
Identities = 283/1037 (27%), Positives = 517/1037 (49%), Gaps = 38/1037 (3%)

Query: 1 MYKTAINRPITTLMFAFAIIFFGVMGFKKLSVALFPNIDIPTVVVTTTYPGASAEIIESK 60
M I RPI + A ++ G + +L VA +P I P V V+ YPGA A+ ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTDKIEEAVMGIDGIKKVTSTSSKNVSIVV-IEFELEKPNEEALNDVVNKISSVR-FDDS 118
VT IE+ + GID + ++STS S+ + + F+ + A V NK+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 119 NIQKPSINKFDT-NSQAIISLFVSSS-SVPATTLNDYAKNTIKPMLQKISGVGGVQLNGF 176
+Q+ I+ + +S +++ FVS + ++DY + +K L +++GVG VQL G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 177 RERQIRIYADPTLMNKYNLTYADLFSTLKAENVEIDGGRIVNS------QRELSILINAN 230
+ +RI+ D L+NKY LT D+ + LK +N +I G++ + Q SI+
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 231 SYSVADVEKIQV-----GNHVRLGDIAKIEIGLEEDNTFASFKDKPGVILEIQKVAGSNE 285
+ + K+ + G+ VRL D+A++E+G E N A KP L I+ G+N
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 286 IEIVDRVYEALKHIQAISP-SYEIRPFLDTTSYIRTSIEDVKFDLVLGAILAVLVVFAFL 344
++ + L +Q P ++ DTT +++ SI +V L +L LV++ FL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 345 RNGTITLVSAISIPISIMGTFALIQWMGFSLNMLTMVALTLAIGIIIDDAIVVIENIHK- 403
+N TL+ I++P+ ++GTFA++ G+S+N LTM + LAIG+++DDAIVV+EN+ +
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 404 KLEMGMDKRKASYEGVKEIGFALVAISAMLLSVFVPIGNMKGIIGRFFQSFGITVALAIA 463
+E + ++A+ + + +I ALV I+ +L +VF+P+ G G ++ F IT+ A+A
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 464 LSYVVVVTIIPMVSSVVVNPRHS-------RFYVWSEPFFKALESYYTRLLQWVLNHKLI 516
LS +V + + P + + ++ P + F+ W F ++YT + +L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 517 IFIAVVLVFVGSLFVASKIGMEFMLKEDRGRFLVWLRAKPGVSIDYMTKKAKGFQEAI-- 574
+ L+ G + + ++ F+ +ED+G FL ++ G + + K +
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 575 --EKHAEVEFTTLQVGYGTTQSPFKAKIFVQLKPLKERKKERKLGQFELMRALRSELKSM 632
+ + E FT + Q+ FV LKP +ER + + + RA + EL +
Sbjct: 600 NEKANVESVFTVNGFSFS-GQAQNAGMAFVSLKPWEERNGDENSAEAVIHRA-KMELGKI 657

Query: 633 PEAKDLESINLSEVPLLGGGGDSSPFQTFVFAHSQEAVDKSVANLKKFLLESPELKGKIE 692
+ + N+ + L G ++ F + + D + L + + +
Sbjct: 658 RDGF-VIPFNMPAIVEL---GTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 693 GFHTSTSESQPQLQLKILRQNANKYGVSAQTIGSVVSSAFSGTSQASVFKQDGKEYDMII 752
+ E Q +L++ ++ A GVS I +S+A G + + F G+ + +
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGG-TYVNDFIDRGRVKKLYV 772

Query: 753 RVPDNKRVSVEDIKRLQVRNKYNKLMFLDALVEITETKSPSSISRYNRQRSVTVLAQPKA 812
+ R+ ED+ +L VR+ +++ A + RYN S+ + +
Sbjct: 773 QADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP 832

Query: 813 GISLGEILTQVSKNTKEWLVEGANYRFTGEADNAKETNGEFLIAIATAFVLIYMILAALY 872
G S G+ + + +N L G Y +TG + + + + +A +FV++++ LAALY
Sbjct: 833 GTSSGDAMALM-ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALY 891

Query: 873 ESILEPFIIMVTMPLSFSGAFFALGLVHQPLSMFSMIGLILLIGMVGKNATLLIDVANE- 931
ES P +M+ +PL G A L +Q ++ M+GL+ IG+ KNA L+++ A +
Sbjct: 892 ESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDL 951

Query: 932 ERKKGLNIQEAILFAGKTRLRPILMTTIAMVCGMLPLALASGDGAAMKSPIGIAMSGGLM 991
K+G + EA L A + RLRPILMT++A + G+LPLA+++G G+ ++ +GI + GG++
Sbjct: 952 MEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMV 1011

Query: 992 ISMVLSLLIVPVFYRLL 1008
+ +L++ VPVF+ ++
Sbjct: 1012 SATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_02120VACCYTOTOXIN2781e-77 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 278 bits (712), Expect = 1e-77
Identities = 103/394 (26%), Positives = 186/394 (47%), Gaps = 14/394 (3%)

Query: 2806 NAVNWLNALFVAKGGNPLFAPYYLQDTPTKHIVTLMEDVSSALGMLTKPSLKNNSTDVLQ 2865
+ L L + + +A + T I + ++ L + K + L
Sbjct: 907 QGRDLLQTLLI-DSHDAGYARTMIDATSANEITKQLNTATTTLNNIASLEHKTSGLQTLS 965

Query: 2866 LNTYTQQMGRLAKLSSFASFDSTNFSERLSSLKNQRFADAIPNAMDVILKYSQRDKLKNN 2925
L+ RL LS + +F++RL +LK+QRFA + +A +V+ +++ + + N
Sbjct: 966 LSNAMILNSRLVNLSRRHTNHIDSFAKRLQALKDQRFAS-LESAAEVLYQFAPKYEKPTN 1024

Query: 2926 LWATGVGGISFVENGSGTLYGINLGYDRFVRG---VIVGGYAAYGYSGFNSR--ITGSRS 2980
+WA +GG S G+ +LYG + G D ++ G IVGG+ +YGYS F+++ S +
Sbjct: 1025 VWANAIGGTSLNSGGNASLYGTSAGVDAYLNGEVEAIVGGFGSYGYSSFSNQANSLNSGA 1084

Query: 2981 DNVNVGLYARAFIKKSELTFSVNETWGANKTQISSNDALLSMINQSYSYNTWTTNARVNY 3040
+N N G+Y+R F + E F G++++ ++ ALL +NQSY+Y ++ R +Y
Sbjct: 1085 NNTNFGVYSRIFANQHEFDFEAQGALGSDQSSLNFKSALLRDLNQSYNYLAYSAATRASY 1144

Query: 3041 GYDFMFKNKSVIIKPQISLGYYYIGMTGLDGVMNNALYNQFKANADPSKKSVLTINFAIE 3100
GYDF F ++++KP + + Y ++G T + S + + + +E
Sbjct: 1145 GYDFAFFRNALVLKPSVGVSYNHLGSTNFKS----NSNQKVALKNGASSQHLFNASANVE 1200

Query: 3101 NRHYFNKNSYFYAISGISRDLLVRSMGDKLVRFIGDNTLSYRKGELYNTFASITTGGEVR 3160
R+Y+ SYFY +G+ ++ + V + R NT A + GGE++
Sbjct: 1201 ARYYYGDTSYFYMNAGVLQEFANFGSSNA-VSLNTFKVNATRNP--LNTHARVMMGGELK 1257

Query: 3161 LFKSFYANAGVGARFGLDYKMINITGNIGMRLAF 3194
L K + N G L + + N+GMR +F
Sbjct: 1258 LAKEVFLNLGFVYLHNLISNIGHFASNLGMRYSF 1291



Score = 35.4 bits (81), Expect = 0.005
Identities = 15/100 (15%), Positives = 30/100 (30%), Gaps = 5/100 (5%)

Query: 708 SYTFDGVNNAFNEDKFNGGSFNFNHAEQTDAFNNNSFNGGSFNFNAKQVDFNHNLFNGGV 767
SY+ + E FN + ++A Q +N + G+ + + N G
Sbjct: 272 SYSTINTSKVTGEVNFNHLTVGDHNAAQAGIIASNKTHIGTLDLW-QSAGLNIIAPPEGG 330

Query: 768 FNF---NNTPKVSFTNDTFNVNNQFKING-AQTTFTFNKG 803
+ + + N + + N Q N
Sbjct: 331 YKDKPNDKPSNTTQNNAKNDKQESSQNNSNTQVINPPNSA 370



Score = 33.5 bits (76), Expect = 0.018
Identities = 53/283 (18%), Positives = 84/283 (29%), Gaps = 49/283 (17%)

Query: 670 YKFQGPKNAYTFKNTNFLA-GNFKFQGKTTIEKSVLNDASYTF---DGVNN-AFNE---- 720
Y N N L G+ I + + + G+N A E
Sbjct: 273 YSTINTSKVTGEVNFNHLTVGDHNAAQAGIIASNKTHIGTLDLWQSAGLNIIAPPEGGYK 332

Query: 721 ----DKFNGGSFNFNHAEQTDAFNNNSFNGGSFNFNAKQ-VDFNHNLFNGGVFNFNNTPK 775
DK + + N ++ ++ NNS N+ Q + G F
Sbjct: 333 DKPNDKPSNTTQNNAKNDKQESSQNNSNTQVINPPNSAQKTEIQPTQVIDGPFAGGKNTV 392

Query: 776 VSFTNDTFNVNNQFKI-NGAQTTFTFNKGVVFNMQGLLNSLSVGTTYQLLNAKSVDYKDN 834
V+ + N N I G ++ +LS + + L +++
Sbjct: 393 VNI--NRINTNADGTIRVGGFKASLTTNAAHLHIGKGGINLSNQASGRSLLVENL----- 445

Query: 835 HNTLYQMLRWTSGENPSGKLVDENQS-----APSSAKIYNVQFID--NGLTYYIKESFNN 887
T G L NQ A SSA D NG +FNN
Sbjct: 446 ----------TGNITVDGPLRVNNQVGGYALAGSSANFEFKAGTDTKNGTA-----TFNN 490

Query: 888 GITLTRLCTLGYTHCVSVHDNAFNLKNVNNSASDTVFYLNGMT 930
I+L R V H F + N +T+ + +G+T
Sbjct: 491 DISLGRFV----NLKVDAHTANFKGIDTGNGGFNTLDF-SGVT 528


17HPAKL86_02190HPAKL86_02225N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_02190-2130.498876neutrophil activating protein NapA
HPAKL86_02195-2120.366190histidine kinase sensor protein
HPAKL86_02200-2111.246509hypothetical protein
HPAKL86_02205-2121.676105flagellar basal body P-ring protein
HPAKL86_02210-2101.965127ATP-dependent RNA helicase
HPAKL86_02215-2102.257881hypothetical protein
HPAKL86_02220-2112.244122hypothetical protein
HPAKL86_02225-2122.212986oligopeptide permease ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_02190HELNAPAPROT1484e-49 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 148 bits (376), Expect = 4e-49
Identities = 40/140 (28%), Positives = 75/140 (53%), Gaps = 1/140 (0%)

Query: 5 EILKHLQADAIVLFMKVHNFHWNVKGTDFFNVHKATEEIYEEFADMFDDLAERIVQLGHH 64
L ++ +L+ K+H FHW VKG FF +H+ EE+Y+ A+ D +AER++ +G
Sbjct: 15 NSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERLLAIGGQ 74

Query: 65 PLVTLSEAIKLTRIKEETKTSFHSKDIFKEILEDYKHLEKEFKTLSNTAEKEGDKVTVTY 124
P+ T+ E + I + + + ++ + ++ DYK + E K + AE+ D T
Sbjct: 75 PVATVKEYTEHASITDGGNET-SASEMVQALVNDYKQISSESKFVIGLAEENQDNATADL 133

Query: 125 ADDQLAKLQKSIWMLQAHLA 144
+ +++K +WML ++L
Sbjct: 134 FVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_02195PF06580300.015 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.015
Identities = 10/71 (14%), Positives = 25/71 (35%), Gaps = 13/71 (18%)

Query: 281 IVLQNFLYNAIDAIEALEESEQ-GQVKIEAFIQNEFIVFTIIDNGKEVENKSALFEPFET 339
+++Q + N I + + Q G++ ++ N + + + G +
Sbjct: 258 MLVQTLVENGI--KHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------- 308

Query: 340 TKLKGNGLGLA 350
+ G GL
Sbjct: 309 ---ESTGTGLQ 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_02205FLGPRINGFLGI356e-124 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 356 bits (915), Expect = e-124
Identities = 118/345 (34%), Positives = 193/345 (55%), Gaps = 26/345 (7%)

Query: 19 AEKIGDIASVVGVRDNQLIGYGLVIGLNGTGDK-SGSKFTMQSISNMLESVNVKISADDI 77
+I DIAS+ RDNQLIGYGLV+GL GTGD S FT QS+ ML+++ +
Sbjct: 28 TSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNLGITTQGGQS 87

Query: 78 KSKNVAAVMITASLPPFARQGDKIDIQISSIGDAKSIQGGTLVMTPLNAVDGNIYALAQG 137
+KN+AAVM+TA+LPPFA G ++D+ +SS+GDA S++GG L+MT L+ DG IYA+AQG
Sbjct: 88 NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQIYAVAQG 147

Query: 138 AITSGNSS-----------NLLSANIINGATIEREVSYDLFHKNAMVLSLKSPNFKNAIQ 186
A+ S SA + NGA IERE+ +VL L++P+F A++
Sbjct: 148 ALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQLRNPDFSTAVR 207

Query: 187 VQNTLNKV----FGNKVAIALDPKTIQITRPERFSMVEFLALVQEIPINYSAKNKIIVDE 242
V + +N +G+ +A D + I + +P + +A ++ + + K++++E
Sbjct: 208 VADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPAKVVINE 267

Query: 243 KSGTIVSGVDIMVHPIVVTSQDITLKITKEPLDN--------SKSAQDLDNNMSLDTAHN 294
++GTIV G D+ + + V+ +T+++T+ P + Q + M++
Sbjct: 268 RTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSK 327

Query: 295 TLSSNGKNITIAGVVKALQKIGVSAKGMVSILQALKKSGAISAEM 339
G ++ +V L IG+ A G+++ILQ +K +GA+ AE+
Sbjct: 328 VAIVEGPDLR--TLVAGLNSIGLKADGIIAILQGIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_02225HTHFIS310.009 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.3 bits (71), Expect = 0.009
Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 30 VAIVGESGSGKSSIANIIMRLNPR 53
+ I GESG+GK +A + R
Sbjct: 163 LMITGESGTGKELVARALHDYGKR 186


18HPAKL86_03465HPAKL86_03500N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_03465113-1.288676guanylate kinase
HPAKL86_03470211-1.417840poly E-rich protein
HPAKL86_03475-113-2.357427nuclease NucT
HPAKL86_03480112-2.484651outer membrane protein 13
HPAKL86_03485214-2.682037flagellar basal body L-ring protein
HPAKL86_03490312-2.077858CMP-N-acetylneuraminic acid synthetase
HPAKL86_03495311-1.414965CMP-N-acetylneuraminic acid synthetase
HPAKL86_03500213-1.415007flagellar biosynthesis protein G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03465PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.011
Identities = 9/18 (50%), Positives = 11/18 (61%)

Query: 8 LILSGPSGAGKSTLTKYL 25
++L G G GKSTL L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03470IGASERPTASE684e-14 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 68.2 bits (166), Expect = 4e-14
Identities = 50/246 (20%), Positives = 91/246 (36%), Gaps = 23/246 (9%)

Query: 117 QPSESDPNPTDPLEPAQETLETNWDELENLGDLETLAQEEPNNEEQLLPTLNDQEEKEEA 176
+ + D P P PA + T + + +T+ + E + E T ++E +EA
Sbjct: 1016 EIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATE---TTAQNREVAKEA 1072

Query: 177 KEEIKETPQEEEKPKEEMQEQAKEQEPIKEETQEEIKEETQEEIKEETQEEIKEETQEEL 236
K +K Q E + ET+E ET+E E +E+ K ET++
Sbjct: 1073 KSNVKANTQTNEVAQ------------SGSETKETQTTETKETATVEKEEKAKVETEKTQ 1120

Query: 237 EIPKEETQEQAKEQELEAMQELVKEIQENSNNQADKEKTQENAKAFQETQAQELEKQELE 296
E+PK +Q K+++ E +Q + +EN KE + Q + +E
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 297 IPQESTETPQEKEKQALEIPQEEKQENAETPQESTEIPQEKTQKLETQEDHYESIEDIPE 356
+ T +E P+ + P ++E + + H S+ +P
Sbjct: 1181 -QPVTESTTVNTGNSVVENPENTTPATTQ-PTVNSESSNKPKNR------HRRSVRSVPH 1232

Query: 357 PVMAQA 362
V
Sbjct: 1233 NVEPAT 1238



Score = 58.5 bits (141), Expect = 3e-11
Identities = 54/277 (19%), Positives = 99/277 (35%), Gaps = 21/277 (7%)

Query: 142 ELENLGDLETLAQEEPNNEEQLLPTLNDQEEKEEAK-EEIKETPQEEEKPKEEMQ---EQ 197
E+E N Q +E A+ +E P P E + E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 198 AKEQEPIKEETQEEIKEET--QEEIKEETQEEIKEETQEELEIPKEETQEQAKEQELEAM 255
+K++ E+ +++ E T E+ +E + +K TQ E+ + E + Q E
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQT-NEVAQSG-SETKETQTTETK 1101

Query: 256 QELVKEIQENSNNQADKEKTQENAKAFQETQAQELEKQELEIPQESTETPQEKEKQALEI 315
+ E +E + + + EKTQE K + ++ E+ E PQ + E
Sbjct: 1102 ETATVEKEEKA--KVETEKTQEVPKVTSQVSPKQ-EQSETVQPQAEPARENDPTVNIKE- 1157

Query: 316 PQEEKQENAETPQESTEIPQEKT----QKLETQEDHYESIEDIPEPVMAQAMGEALPFLN 371
PQ + A+T Q P ++T ++ T+ + + E P +N
Sbjct: 1158 PQSQTNTTADTEQ-----PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212

Query: 372 ESVAKTPNNENDTEIPKESVIKTPQEKEGNDKTSSPL 408
+ P N + + P ND+++ L
Sbjct: 1213 SESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVAL 1249



Score = 51.6 bits (123), Expect = 5e-09
Identities = 34/191 (17%), Positives = 61/191 (31%), Gaps = 10/191 (5%)

Query: 149 LETLAQEEPNNEEQLLPTLNDQEEKEEAKEEIKETPQ--EEEKPKEEMQEQAKE-----Q 201
+ N + + E KE E KET +EEK K E ++ + Q
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 202 EPIKEETQEEIKEETQEEIKEETQEEIKEETQEELEIPKEETQEQAKEQELEAMQELVKE 261
K+E E ++ + + + + IKE + +T++ AKE Q + +
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTA--DTEQPAKETSSNVEQPVTES 1186

Query: 262 IQENSNNQADKEKTQENAKAFQETQAQELEKQELEIPQESTET-PQEKEKQALEIPQEEK 320
N+ N + Q T E + + S + P E
Sbjct: 1187 TTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRST 1246

Query: 321 QENAETPQEST 331
+ +T
Sbjct: 1247 VALCDLTSTNT 1257



Score = 34.7 bits (79), Expect = 9e-04
Identities = 41/271 (15%), Positives = 83/271 (30%), Gaps = 25/271 (9%)

Query: 67 KEIVSQNKNSVCMYKKGNE-AQPFLEGFEMKIKKPFLPTEMLKVLQKKLGFQPSESDPNP 125
+E+ + K++V + NE AQ E E + + + K + K+ + ++ P
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 126 TDPLEPAQETLETNWDELENLGDLETLAQEEPNNEEQLLPTLN-----DQEEKEEAKEEI 180
T + P QE ET Q EP E PT+N Q E+
Sbjct: 1126 TSQVSPKQEQSET------------VQPQAEPAREND--PTVNIKEPQSQTNTTADTEQP 1171

Query: 181 KETPQEEEKPKEEMQEQAKEQEPIKEETQEEIKEETQEEIKEETQEEIKEETQEELEIPK 240
+ + + E + TQ + E+ + K + +
Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231

Query: 241 EETQEQAKEQELEAMQELVKEIQENSNNQAD--KEKTQENAKAFQETQAQELEKQELEIP 298
+ + L N+N + K Q A + +Q + + E+
Sbjct: 1232 HNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNE 1291

Query: 299 QES---TETPQEKEKQALEIPQEEKQENAET 326
+ + + + ++ +T
Sbjct: 1292 GQYNVWVSNTSMNKNYSSSQYRRFSSKSTQT 1322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03485FLGLRINGFLGH1933e-64 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 193 bits (492), Expect = 3e-64
Identities = 52/172 (30%), Positives = 84/172 (48%), Gaps = 18/172 (10%)

Query: 56 GERPLFADRRAMKPNDLITIIVSEKASANYSSS----KDYKSASGGNSTPPRLTYNGLDE 111
G +PLF DRR D +TI++ E SA+ SSS +D K+ G ++ P L GL
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYL--QGLFG 118

Query: 112 RKKQEAEYLDDKNNYNFTKSSNNTNFKGGGSQKKSEDLEIVLSARIIKVLENGNYFIYGN 171
+ + E S F G G S L+ + +VL NGN + G
Sbjct: 119 NARADVEA------------SGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGE 166

Query: 172 KEVLVDGEKQILKVSGVIRPYDIERNNTIQSKFLADAKIEYTNLGHLSDSNK 223
K++ ++ + ++ SGV+ P I +NT+ S +ADA+IEY G+++++
Sbjct: 167 KQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQN 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03500SACTRNSFRASE280.022 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 27.6 bits (61), Expect = 0.022
Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 102 RGETILKALEYIAFE---EFQLNSLHLEVMENNFKAIAFYEKNHYELEG 147
R + + AL + A E E L LE + N A FY K+H+ +
Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


19HPAKL86_03550HPAKL86_03610N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_03550-2110.906634hypothetical protein
HPAKL86_03555-290.850801ribosomal large subunit pseudouridine synthase
HPAKL86_03560-290.289358single-stranded-DNA-specific exonuclease
HPAKL86_03565-291.027705CTP synthetase
HPAKL86_03570-291.127095hypothetical protein
HPAKL86_03575-290.594780flagellar MS-ring protein
HPAKL86_03580-390.718141flagellar motor switch protein G
HPAKL86_03585-3100.354679flagellar assembly protein H
HPAKL86_03590-290.5649301-deoxy-D-xylulose-5-phosphate synthase
HPAKL86_03595080.134250GTP-binding protein LepA
HPAKL86_03600-19-0.573910DNA-cytosine methyltransferase
HPAKL86_0360508-0.356109hypothetical protein
HPAKL86_03610090.660888transcriptional activator of flagella proteins
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03550PREPILNPTASE270.029 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 27.5 bits (61), Expect = 0.029
Identities = 17/59 (28%), Positives = 25/59 (42%), Gaps = 8/59 (13%)

Query: 32 FVIVAWLFRF--KSIAFSILITLLVILVDIWVYSDVHQFLL-DTASSPILLLVALLIKW 87
V VA ++A +L +LV L I D+ + LL D + P+L LL
Sbjct: 121 SVAVAMTLAPGWGTLAALLLTWVLVALTFI----DLDKMLLPDQLTLPLLWG-GLLFNL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03575FLGMRINGFLIF5510.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 551 bits (1422), Expect = 0.0
Identities = 178/582 (30%), Positives = 291/582 (50%), Gaps = 66/582 (11%)

Query: 11 VDFFIKLNKKQKIALIAAGVLITALLVFLLLYPFKEKDYAQGGYGVLFEGLDSSDNALIL 70
+++ +L +I LI AG A++V ++L+ K DY LF L D I+
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWA-KTPDYR-----TLFSNLSDQDGGAIV 66

Query: 71 QHLQQNQIPYKVSRDD-TILIPKDKVYEERITLASQGIPKTSKVGFEIFDTKDFGATDFD 129
L Q IPY+ + I +P DKV+E R+ LA QG+PK VGFE+ D + FG + F
Sbjct: 67 AQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFS 126

Query: 130 QNIKLIRAIEGELSRTIESLNPILKANVHIAIPKDSVFVAKEVPPSASVMLKLKPNMKLL 189
+ + RA+EGEL+RTIE+L P+ A VH+A+PK S+FV ++ PSASV + L+P L
Sbjct: 127 EQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALD 186

Query: 190 PAQILGIKNLIAAAVPKLTTENVKIVNENGEPLGEGDMLDNAKELALEQLHYKQNFENIL 249
QI + +L+++AV L NV +V+++G L + + + ++L QL + + E+ +
Sbjct: 187 EGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSN--TSGRDLNDAQLKFANDVESRI 244

Query: 250 ENKIVNILAPIVGGKNKVVARVNAEFDFSQKKSTKETFDPNN-----VVRSEQNLEEKKE 304
+ +I IL+PIVG N V A+V A+ DF+ K+ T+E + PN +RS Q ++
Sbjct: 245 QRRIEAILSPIVGNGN-VHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQV 303

Query: 305 GAPKKQVGGVPGVVSN-IGPVQGLKDNKEQEKYEKSQN---------------------- 341
GA GGVPG +SN P + +QN
Sbjct: 304 GAGYP--GGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNE 361

Query: 342 TTNYEVGKTISEIKGEFGTLVRLNAAVVVDGKYKIALKDGTNALEYEPLSDESLKKINAL 401
T+NYEV +TI K G + RL+ AVVV+ K L DG PL+ + +K+I L
Sbjct: 362 TSNYEVDRTIRHTKMNVGDIERLSVAVVVNYK---TLADGK----PLPLTADQMKQIEDL 414

Query: 402 VKQAIGYNQNRGDDVAVSNFEFNPITPMLDNATWSEKIMHKTQKILGSFTPLIKYILVFV 461
++A+G++ RGD + V N F+ + E + Q + +++LV V
Sbjct: 415 TREAMGFSDKRGDTLNVVNSPFSAVDNTGG-----ELPFWQQQSFIDQLLAAGRWLLVLV 469

Query: 462 VLFIFYKKVIVPFSERMLEVVPDEDKEVKSMFEEMDEEEDEMNKLGDLKKKVEDQLGLNA 521
V +I ++K + P R +E ++ + E + E ++K L+++ +Q
Sbjct: 470 VAWILWRKAVRPQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQ----- 524

Query: 522 TFSEEEVRYEIVLEKIRGTLKERPDEIAMLFKLLIKDEISSD 563
+ E++ ++IR E D + L+I+ +S+D
Sbjct: 525 -----RLGAEVMSQRIR----EMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03580FLGMOTORFLIG350e-122 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 350 bits (900), Expect = e-122
Identities = 122/338 (36%), Positives = 209/338 (61%), Gaps = 4/338 (1%)

Query: 8 KQKAQLDELSMSEKIAILLIQVGEDTTGEILRHLDIDSITEISKQIVQLNGTDKQIGAAV 67
K+ + L+ +K AILL+ +G + + ++ ++L + I ++ +I +L ++ V
Sbjct: 7 KEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNV 66

Query: 68 LEEFFAIFQSNQYINTGGLEYARELLTRTLGSEEARKVMDKLTKSLQTQKNFAYLGKIKP 127
L EF + + ++I GG++YARELL ++LG+++A +++ L +LQ+ + F ++ + P
Sbjct: 67 LLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQS-RPFEFVRRADP 125

Query: 128 QQLADFIINEHPQTIALILAHMEAPNAAETLSYFPDEMKAEISIRMANLGDISPQVVKRV 187
+ +FI EHPQTIALIL++++ A+ LS P E++ ++ R+A + SP+VV+ V
Sbjct: 126 ANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREV 185

Query: 188 STVLENKLESLTSYK-IEVGGLRAVAEIFNRLGQKSAKTTLARIESVDNKLAGAIKEMMF 246
VLE KL SL+S GG+ V EI N +K+ K + +E D +LA IK+ MF
Sbjct: 186 ERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMF 245

Query: 247 TFEDIVKLDNFAIREILKVADKKDLSLALKTSTQDLTDKFLNNMSSRAAEQFVEEMQYLG 306
FEDIV LD+ +I+ +L+ D ++L+ ALK+ + +K NMS RAA E+M++LG
Sbjct: 246 VFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLG 305

Query: 307 AVKIKDVDVAQRKIIEIVQSLQEKG--VIQTGEEEDVI 342
+ KDV+ +Q+KI+ +++ L+E+G VI G EEDV+
Sbjct: 306 PTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343



Score = 31.3 bits (71), Expect = 0.005
Identities = 20/103 (19%), Positives = 41/103 (39%), Gaps = 3/103 (2%)

Query: 4 KLTPKQKAQLDELSMSEKIAILLIQVGEDTTGEILRHLDIDSITEISKQIVQLNGTDKQI 63
+ P + + IA++L + IL L + T ++++I ++ T ++
Sbjct: 122 RADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEV 181

Query: 64 GAA---VLEEFFAIFQSNQYINTGGLEYARELLTRTLGSEEAR 103
VLE+ A S Y + GG++ E++ E
Sbjct: 182 VREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKF 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03585FLGFLIH375e-05 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 36.7 bits (84), Expect = 5e-05
Identities = 46/212 (21%), Positives = 94/212 (44%), Gaps = 17/212 (8%)

Query: 45 PNPEEPLEKKAIENDLIDCLLKKTDELSSHLVKLQMQFEKAQEES-KALIENAKNDGYKI 103
P E + E +I+ + L L +LQMQ A E+ +A I + G+K
Sbjct: 17 PPQAEFVPIVEPEETIIE---EAEPSLEQQLAQLQMQ---AHEQGYQAGIAEGRQQGHKQ 70

Query: 104 GFKEGEEKMRNELTHSVNEEKNQLLHAITALDEKMKKSQDHLMTLE----KELSAIAIDI 159
G++EG + L + E K+Q + + + + Q L L+ L +A++
Sbjct: 71 GYQEG---LAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEA 127

Query: 160 AKEVILKEVEDNSQKVALALAEELLKNVLDATDIHLKVNPLDYPYLNERLQNASKI---K 216
A++VI + ++ + + + L + L + L+V+P D +++ L + +
Sbjct: 128 ARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWR 187

Query: 217 LESNEAISKGGVMITSSNGSLDGNLMERFKTL 248
L + + GG +++ G LD ++ R++ L
Sbjct: 188 LRGDPTLHPGGCKVSADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03595TCRTETOQM1411e-37 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 141 bits (356), Expect = 1e-37
Identities = 99/437 (22%), Positives = 174/437 (39%), Gaps = 85/437 (19%)

Query: 3 NIRNFSIIAHIDHGKSTLADCLIAECNAIS---NREMTSQVMDTMDIEKERGITIKAQSV 59
I N ++AH+D GK+TL + L+ AI+ + + + D +E++RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 60 RLNYTLKGEDYVLNLIDTPGHVDFSYEVSRSLCSCEGALLVVDATQGVEAQTIANVYIAL 119
+ E+ +N+IDTPGH+DF EV RSL +GA+L++ A GV+AQT +
Sbjct: 62 SFQW----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117

Query: 120 DNNLEILPVINKIDLPNANVVEVKQDIEDTIGIDCSGANEVSAKARLGIKD--------- 170
+ + INKID ++ V QDI++ + + +V + + +
Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDT 177

Query: 171 -------LLEKIITTIPAPSGDFNAPLKALIYD-------------------------SW 198
LLEK ++ + + ++ +
Sbjct: 178 VIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNK 237

Query: 199 F--------------------DNYLGALALVRIMDGSINTEQEILVMGTGKKHGVLGLYY 238
F LA +R+ G ++ + + K + +Y
Sbjct: 238 FYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMYT 296

Query: 239 PNPLKKIPTKSLECGEIGIV---SLGLKSVTDIAVGDTLTDAKNPTPKPIEGFMPAKPFV 295
+ GEI I+ L L SV +GDT P + IE P +
Sbjct: 297 SINGELCKIDKAYSGEIVILQNEFLKLNSV----LGDTKLL---PQRERIEN---PLPLL 346

Query: 296 FAGLYPIETDRFEDLREALLKLQLNDCALNFEPESSVALGFGFRVGFLGLLHMEVIKERL 355
+ P + + E L +ALL++ +D L + +S+ + FLG + MEV L
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATH---EIILSFLGKVQMEVTCALL 403

Query: 356 EREFSLNLIATAPTVVY 372
+ ++ + + PTV+Y
Sbjct: 404 QEKYHVEIEIKEPTVIY 420



Score = 31.0 bits (70), Expect = 0.015
Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 2/75 (2%)

Query: 399 IKEPFVRATIITPSEFLGNLMQLLNNKRGIQEKMEYLNQSRVMLTYSLPSNEIVMDFYDK 458
+ EP++ I P E+L + L + V+L+ +P+ I ++
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCI-QEYRSD 592

Query: 459 LKSCTKGYASFDYEP 473
L T G + E
Sbjct: 593 LTFFTNGRSVCLTEL 607


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03610HTHFIS394e-137 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 394 bits (1013), Expect = e-137
Identities = 128/384 (33%), Positives = 198/384 (51%), Gaps = 9/384 (2%)

Query: 2 KIAIVEDDINMRKSLELFFELQDDLEIVSFKNPKDALDKL-DESFDLVITDINMPHMDGL 60
I + +DD +R L + ++ N + DLV+TD+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALS-RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 61 EFLRLLEGKYES---IVITGNATLNKAIDSIRLGVKDFFQKPFKPELLLESIYRTKKVLE 117
+ L ++ +V++ T AI + G D+ KPF E I + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT---ELIGIIGRALA 120

Query: 118 FQKKHPLEKPLKKPHKHSFLAASKALEESKRQALKVASTDANVMLLGESGVGKEVFAHFI 177
K+ P + + S A++E R ++ TD +M+ GESG GKE+ A +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 178 HQHSQRSKHPFIAINMSAIPEHLLESELFGYQKGAFTDATAPKMGLFESAHKGTIFLDEI 237
H + +R PF+AINM+AIP L+ESELFG++KGAFT A G FE A GT+FLDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 238 AEMPIQLQSKLLRVVQEKEITRLGDNKSVKIDVRFISATNANMKEKIAAKEFREDLFFRL 297
+MP+ Q++LLRV+Q+ E T +G ++ DVR ++ATN ++K+ I FREDL++RL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 298 QIVPIVIAPLRERVEEILPIAEIKLKEVCDAYHLGPKSFSKNATKRLLEYSWHGNVRELL 357
+VP+ + PLR+R E+I + +++ L K F + A + + + W GNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 358 GVVERAAILSEGSEIQEKDLFLER 381
+V R L I + + E
Sbjct: 360 NLVRRLTALYPQDVITREIIENEL 383


20HPAKL86_03900HPAKL86_03935N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_03900-110-0.725158ATP-dependent protease subunit HslV
HPAKL86_03905-29-0.225137ATP-dependent protease ATP-binding subunit HslU
HPAKL86_03910-112-0.063757GTPase Era
HPAKL86_03915-2110.413165hypothetical protein
HPAKL86_03920-2110.747479hypothetical protein
HPAKL86_03925-2110.445108glutamate racemase
HPAKL86_03930-212-0.413771transcription termination factor Rho
HPAKL86_03935116-1.90738650S ribosomal protein L31
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03900PF07520290.010 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 29.2 bits (65), Expect = 0.010
Identities = 14/49 (28%), Positives = 23/49 (46%), Gaps = 4/49 (8%)

Query: 121 LEAEDNKIAAIGSGG---NFALSAARALDNFAHLEPRKLVEESLKIAGD 166
E+ ++A I GG + ++ R DN L P + E ++AGD
Sbjct: 590 GESPSLRLACIDVGGGTTDLMVTTYRGEDNRV-LHPEQTFREGFRVAGD 637


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03905HTHFIS290.045 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.045
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 48 TPKNILMIGSTGVGKTEIARRI---AKIMKLPFVKV 80
T +++ G +G GK +AR + K PFV +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03910PF03944320.002 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 32.3 bits (73), Expect = 0.002
Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 68 LHHQEKLLNQCMLSQALKAMGDAELCVFLASVHDDLKGYEEFLSLCQKPHILAVSKIDTA 127
L E+ LNQ + + + A +AEL A+V + + + FL+ + L+++
Sbjct: 94 LRETERFLNQRLNTDTV-ARVNAELTGLQANVEEFNRQVDNFLNPNRNAVPLSITSSVNT 152

Query: 128 THKQVLQKLQEYQQYASQFLALVPLSAKKSQNLN 161
+ L +L ++Q Q L L+PL A+ + NL+
Sbjct: 153 MQQLFLNRLPQFQMQGYQLL-LLPLFAQAA-NLH 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_03935PF01206270.004 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 26.6 bits (59), Expect = 0.004
Identities = 7/22 (31%), Positives = 12/22 (54%)

Query: 19 SGKEIEVLSTKPEMRIDISSFC 40
+G+ + V++T P D SF
Sbjct: 31 AGEVLYVMATDPGSVKDFESFS 52


21HPAKL86_05645HPAKL86_05685N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_05645-1120.084198hypothetical protein
HPAKL86_056501130.805900UDP-2,3-diacylglucosamine hydrolase
HPAKL86_056551141.674970CheA-MCP interaction modulator
HPAKL86_056604143.161579autophosphorylating histidine kinase
HPAKL86_056653141.674535CheA-MCP coupling protein
HPAKL86_056702170.331799hypothetical protein
HPAKL86_05675015-0.451432hypothetical protein
HPAKL86_05680014-0.518393urease subunit alpha
HPAKL86_05685-213-0.310214urease subunit beta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05645ALARACEMASE320.001 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 32.1 bits (73), Expect = 0.001
Identities = 9/43 (20%), Positives = 16/43 (37%), Gaps = 1/43 (2%)

Query: 136 GVMPEETLEIYSQISETCKRLKLKGLMCIGAHADDEKEIEKSF 178
G P+ L ++ Q+ + LM A A+ I +
Sbjct: 132 GFQPDRVLTVWQQL-RAMANVGEMTLMSHFAEAEHPDGISGAM 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05655HTHFIS612e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.6 bits (147), Expect = 2e-12
Identities = 30/129 (23%), Positives = 50/129 (38%), Gaps = 13/129 (10%)

Query: 181 GEVLFLDDSKTARKTLKNHLSKLGFSITEAVDGEDGLNKLEMLFKKYGDNLRKHLKFIIS 240
+L DD R L LS+ G+ + + + +++
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA----------AGDGDLVVT 53

Query: 241 DVEMPKMDGYHFLFKLQKDPRFAYIPVIFNSSICDNYSAEKAKEMGAVAYLVK-FDAEKF 299
DV MP + + L +++K +PV+ S+ +A KA E GA YL K FD +
Sbjct: 54 DVVMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111

Query: 300 TEEISKILD 308
I + L
Sbjct: 112 IGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05660HTHFIS581e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.5 bits (139), Expect = 1e-10
Identities = 22/121 (18%), Positives = 55/121 (45%), Gaps = 4/121 (3%)

Query: 683 VLAIDDSSTDRAIIRKCLKPLGITLLEATNGLEGLEMLKNGDKTPDAILVDIEMPKMDGY 742
+L DD + R ++ + L G + +N + G D ++ D+ MP + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG--DGDLVVTDVVMPDENAF 63

Query: 743 TFASEVRKYNKFKNMPLIAVTSRVTKTDRMRGVESGMTEYITKPYSSEYLMSVVKRSIKL 802
++K ++P++ ++++ T ++ E G +Y+ KP+ L+ ++ R++
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 803 E 803

Sbjct: 122 P 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05675PF07132346e-05 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 33.9 bits (77), Expect = 6e-05
Identities = 19/46 (41%), Positives = 32/46 (69%)

Query: 36 FLGGAVGAGMGGAMGGMIGALGGPWNAVVGAGIGGGIGAYSGADIG 81
F+G +G G+GG +GG+ +LGG ++G G+GGG+G+ G+ +G
Sbjct: 60 FMGSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLG 105



Score = 27.0 bits (59), Expect = 0.015
Identities = 16/49 (32%), Positives = 26/49 (53%)

Query: 34 GRFLGGAVGAGMGGAMGGMIGALGGPWNAVVGAGIGGGIGAYSGADIGD 82
G +GG +G G+GG + G GG +G G+G +G+ G+ +G
Sbjct: 62 GSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_05685UREASE9880.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 988 bits (2556), Expect = 0.0
Identities = 340/570 (59%), Positives = 435/570 (76%), Gaps = 5/570 (0%)

Query: 2 KMKKLDYVNTYGPTKGDKVRLGDTEIWAEVEHDYTIYGEELKFGAGKTIREGMGQSN-SH 60
+M + Y N +GPT GDKVRL DTE++ EVE D+T +GEE+KFG GK IR+GMGQS +
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 61 DENTLDLVITNALIIDYTGIYKADIGIKNGKIAGIGKAGNKDMQDGVSPNLVVGVGTEAL 120
+ +D VITNALI+D+ GI KADIG+K+G+IA IGKAGN DMQ GV ++VG GTE +
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGV--TIIVGPGTEVI 121

Query: 121 AGEGMIVTAGGIDSHTHFLSPQQFPTALANGVTTMFGGGTGPVDGTNATTITPGEWNIHR 180
AGEG IVTAGG+DSH HF+ PQQ AL +G+T M GGGTGP GT ATT TPG W+I R
Sbjct: 122 AGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIAR 181

Query: 181 MLRAAEEYAMNVGFLGKGNSSSKTQLVEQIEAGVVGFKLHEDWGTTPSAIDTCLSVADEY 240
M+ AA+ + MN+ F GKGN+S LVE + G KLHEDWGTTP+AID CLSVADEY
Sbjct: 182 MIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEY 241

Query: 241 DVQVCIHTDTVNEAGYVEDTLNAMNGRAIHAYHIEGAGGGHSPDVITMAGEENILPSSTT 300
DVQV IHTDT+NE+G+VEDT+ A+ GR IHAYH EGAGGGH+PD+I + G+ N++PSST
Sbjct: 242 DVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTN 301

Query: 301 PTIPYTINTVAEHLDMLMTCHHLDKKIREDLQFSQSRIRPGSIAAEDVLHDNGMIAMTSS 360
PT PYT+NT+AEHLDMLM CHHL I ED+ F++SRIR +IAAED+LHD G ++ SS
Sbjct: 302 PTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISS 361

Query: 361 DSQAMGRAGEVVPRTWQTADKNKKEFGPLKEDAQNGNDNFRIKRYISKYTINPAITHGVS 420
DSQAMGR GEV RTWQTADK K++ G LKE+ NDNFR+KRYI+KYTINPAI HG+S
Sbjct: 362 DSQAMGRVGEVAIRTWQTADKMKRQRGRLKEET-GDNDNFRVKRYIAKYTINPAIAHGLS 420

Query: 421 EYIGSVEAGKIADLVVWNPAFFGVKPKIIIKGGLVVFSEMGDSNASVPTPQPVYYREMFG 480
IGS+E GK ADLV+WNPAFFGVKP +++ GG + + MGD NAS+PTPQPV+YR MFG
Sbjct: 421 HEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFG 480

Query: 481 HHGKAKFDTSITFVNKLAYEKGIKEKLGLERQVLPIKNVR-NITKKDFKFNNTTGKLTVD 539
+G+++ ++S+TFV++ + + G+ +LG+ ++++ ++N R I K N+ T + VD
Sbjct: 481 AYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVD 540

Query: 540 PKTFEVFLDGKLCTSKPASELPLAQRYTFF 569
P+T+EV DG+L T +PA+ LP+AQRY F
Sbjct: 541 PETYEVRADGELLTCEPATVLPMAQRYFLF 570


22HPAKL86_07090HPAKL86_07125N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPAKL86_070901171.942305putative nicotinate-nucleotide
HPAKL86_070952143.083123nickel responsive regulator
HPAKL86_07100biopolymer transport protein
HPAKL86_07105biopolymer transport protein ExbD
HPAKL86_07110siderophore-mediated iron transport protein
HPAKL86_07115hypothetical protein
HPAKL86_07120flagellar basal body rod protein FlgG
HPAKL86_07125hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_07090LPSBIOSNTHSS467e-09 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 46.0 bits (109), Expect = 7e-09
Identities = 23/71 (32%), Positives = 39/71 (54%), Gaps = 4/71 (5%)

Query: 11 ALYGGSFDPLHKAHLAIIDQTLELLPFAKLIVLPAYQNPFKKPCFLDAKTRFKELERALK 70
A+Y GSFDP+ HL II++ L F ++ V +NP K+P F + R +++ +A+
Sbjct: 3 AIYPGSFDPITFGHLDIIERGCRL--FDQVYVAVL-RNPNKQPMF-SVQERLEQIAKAIA 58

Query: 71 GMDRVLLSDFE 81
+ + FE
Sbjct: 59 HLPNAQVDSFE 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_07110TONBPROTEIN822e-20 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 81.6 bits (201), Expect = 2e-20
Identities = 47/189 (24%), Positives = 82/189 (43%), Gaps = 29/189 (15%)

Query: 74 DEPKKEAPKKEKKEITKPKPKPKPKPKPKPKPKKEVTKPKPDPKITPKPEPKPEPPKEEP 133
D +A + + + +P+P+P+P P+P + + KPKP PK PKP K +
Sbjct: 54 DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQ------ 107

Query: 134 KEEPKEEVEKEEPKKEEKAKEESTPKEVTTKDAVKDPNKQEESNKTSEGATSEAQAYNPG 193
E+PK +V+ E + E + P +T+ A + TS+ TS A
Sbjct: 108 -EQPKRDVKPVESR-PASPFENTAPARLTSSTA---------TAATSKPVTSVASGPRAL 156

Query: 194 VSNEFLMQIQTAISAKNRYPKMAQVRGIEGEVLVSFIINTDGSVTDIKVVKSNTTDILNH 253
N+ +YP AQ IEG+V V F + DG V +++++ + ++
Sbjct: 157 SRNQ------------PQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFER 204

Query: 254 AALEAIKNA 262
A++
Sbjct: 205 EVKNAMRRW 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_07120FLGHOOKAP1587e-12 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 58.0 bits (140), Expect = 7e-12
Identities = 14/43 (32%), Positives = 26/43 (60%)

Query: 219 LELSNVRLVEEMTDLITAQRAYEANSKSIQTADAMLQTVNSLK 261
+S V L EE +L Q+ Y AN++ +QTA+A+ + +++
Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 50.3 bits (120), Expect = 3e-09
Identities = 12/35 (34%), Positives = 21/35 (60%)

Query: 4 SLYSATSGMLAQQTHIDTTSNNIANVNTTGFKKSR 38
+ +A SG+ A Q ++T SNNI++ N G+ +
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPAKL86_07125PREPILNPTASE290.015 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 28.6 bits (64), Expect = 0.015
Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 2/40 (5%)

Query: 1 MIYMLAVFFPWLAFLLRGRIFSAIFSFILWVILC-LPIIL 39
++ LA PWL F L +FS + L V++ LPI+L
Sbjct: 3 LLLELAHGLPWLYFSLVF-LFSLMIGSFLNVVIHRLPIML 41


Database: VIFASCDB
Posted date: Jun 1, 2014 9:04 PM
Number of letters in database: 79,683
Number of sequences in database: 213

Lambda K H
0.321 0.139 0.412

Gapped
Lambda K H
0.267 0.0533 0.140


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 213
Number of Hits to DB: 62,522,964
Number of extensions: 2847047
Number of successful extensions: 11565
Number of sequences better than 5.0e-02: 355
Number of HSP's gapped: 10807
Number of HSP's successfully gapped: 810
Length of database: 79,683
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.9 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)

 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.