PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeCDN129.gbThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NZ_CP056774 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1HWH78_RS00180HWH78_RS00225Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS001802111.637901NAD(P)/FAD-dependent oxidoreductase
HWH78_RS001852101.816007TIGR01777 family oxidoreductase
HWH78_RS001903121.729039ferrochelatase
HWH78_RS001954141.752381MFS transporter
HWH78_RS002007161.501094spore coat U domain-containing protein
HWH78_RS002054141.268943fimbrial biogenesis outer membrane usher
HWH78_RS002104120.468452molecular chaperone
HWH78_RS00215413-0.843497spore coat U domain-containing protein
HWH78_RS00220411-0.585231spore coat U domain-containing protein
HWH78_RS00225210-0.792585spore coat U domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00185NUCEPIMERASE392e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 39.0 bits (91), Expect = 2e-05
Identities = 53/303 (17%), Positives = 97/303 (32%), Gaps = 73/303 (24%)

Query: 1 MHILLTGGTGLMGRRLCARWRQAGHRLTVF---------SRRPQQVESLCGVGVRGI--- 48
M L+TG G +G + R +AGH++ S + ++E L G +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 49 ----DRFDE-YGDQPLDAVVNLAGEPIADKPWSHKRKALLWESRVRLTERLVEWLDAREQ 103
+ + + + V +S + +S + ++E R
Sbjct: 61 LADREGMTDLFASGHFERVFISPHR--LAVRYSLENPHAYADSNLTGFLNILE--GCRHN 116

Query: 104 RPDLLLSGSAVGWYGDSGERPVTEEGA--------AAGEDFASELCLAWEQIAREAEKLG 155
+ LL S+ YG + + P + + + AA + + + + G
Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHL------YG 170

Query: 156 TRVVLLRTGLVLAPEG-------GFL-----GRLLPLYRLGLGGPLGDGRQWMPWIHIED 203
LR V P G F G+ + +Y G+ + +I+D
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVY--------NYGKMKRDFTYIDD 222

Query: 204 QIELIDFLLRRP---------DASGPYNACAP---------NPVRNRDFAKALGRALHKP 245
E I L + P + AP +PV D+ +AL AL
Sbjct: 223 IAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE 282

Query: 246 ARI 248
A+
Sbjct: 283 AKK 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00195TCRTETA441e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.7 bits (103), Expect = 1e-06
Identities = 60/303 (19%), Positives = 92/303 (30%), Gaps = 33/303 (10%)

Query: 61 FALFYTLCGIPLGRMADNRSRRGLILFGVLVWSAMTAACGLARSYWQFLTFRVGVGVGEA 120
+AL C LG ++D RR ++L + + A A W R+ G+
Sbjct: 52 YALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TG 110

Query: 121 ALSPAAYSLIADSFPRERRATAISVYSMGIYLGSGLAFLLGGLVIKFASAQGDVHLPLFG 180
A A + IAD + RA S G +LGGL+ H P
Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-----GGFSPHAP--- 162

Query: 181 EVRPWQLIFLILGA-AGVLFCLLLLAIREP--ARRGVGAGVAVPLGEVGAYLRANRKTVL 237
F A G+ F + E R A+ + R
Sbjct: 163 --------FFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAA 214

Query: 238 CHNFGFACLSFAGYGSGAWVPTFFVRTHGWDAGHVGVVYGSIVAVFGCLGIVFGGRLADY 297
F + WV F WDA +G+ +A FG L + +
Sbjct: 215 LMAVFFIMQLVGQVPAALWV-IFGEDRFHWDATTIGIS----LAAFGILHSLAQAMITGP 269

Query: 298 WAKRGRSDANMRVGLLAAWAVIPFTLVYPLLDNANWAAALMAPTVFFLSMPFGVAPAAIQ 357
A R + +G++A W MA + L G+ A+Q
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFA----TRGW----MAFPIMVLLASGGIGMPALQ 321

Query: 358 EIM 360
++
Sbjct: 322 AML 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00205PF00577402e-130 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 402 bits (1035), Expect = e-130
Identities = 129/800 (16%), Positives = 248/800 (31%), Gaps = 85/800 (10%)

Query: 49 YYLELVING--RDSGQVVPVNAADGHYL---LDAAALREAGVRLPGNPAGQVAVD----- 98
Y +++ +N + V + L A L G+ + D
Sbjct: 78 YRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP 137

Query: 99 ---ALPEVRADYDSASQQLHLQVPPDWLPEQRFDDPGLVARA-PARSSLGALFNYDLYYS 154
+ + A D Q+L+L +P ++ + G + L NY+ +
Sbjct: 138 LTSMIHDATAQLDVGQQRLNLTIPQAFMSNR---ARGYIPPELWDPGINAGLLNYNFSGN 194

Query: 155 DPAD-GATPWLSALLEQRLFDGFGV--ISNTGVYTRYFGDADNLDSRYLRYDTYWLYNDE 211
+ A L + G + + ++ D+ + ++ WL D
Sbjct: 195 SVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDI 254

Query: 212 RNMHS-YQLGDYVNGALNWTTPVRMGGFRFARNFGVRPDLVTYPLLRFDGQAAVPSTVDL 270
+ S LGD + + G + A + + PD G A + V +
Sbjct: 255 IPLRSRLTLGDGYTQGDIFDG-INFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTI 313

Query: 271 FINGYKASSADLQPGPFAISNVPYINGAGEATVVTTDAQGRQVVTSLPFYVSNTLLARGL 330
NGY ++ + PGPF I+++ +G+ V +A G + ++P+ L G
Sbjct: 314 KQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGH 373

Query: 331 SDFDLSVGRLRDDYGLRNFSYADNAASGIYRYGVSDRLTLSTHAEAASDLRLLGIGGDIA 390
+ + ++ G R + +G+ T+ + A R G
Sbjct: 374 TRYSITAGEYRSGNAQQ---EKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKN 430

Query: 391 VATFGTLSLAASGSYGQGDSGQQY---------------------LLGYSYYSRR-LGLS 428
+ G LS+ + + Q+ L+GY Y + +
Sbjct: 431 MGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFA 490

Query: 429 LQHIERSAGYGDLGTLDGEYQLSRRTD------------QATASLTFDEQGTIGTGYFDI 476
R GY + TD Q T + T+
Sbjct: 491 DTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQ 550

Query: 477 RARDGS-RTRLANLSYSRPIGSRS-SFYLALNKDLDGDGYSALMQLVIPFDI-------- 526
S + + + +L K+ G ++ L +
Sbjct: 551 TYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDS 610

Query: 527 -----NGLLNIGVTRDSDRRYSERVIWSRSTPSQGGLGWNL------GYGGGASRYQQAD 575
+ + ++ D + R + + L +++ G G + A
Sbjct: 611 KSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYAT 670

Query: 576 LTWRMQNVQLQGGLYGETGNYTRWADLSGSLVWMDNAVFASNRINDAFVLVSTKGYPQVP 635
L +R G + +SG ++ N V +ND VLV G
Sbjct: 671 LNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAK 730

Query: 636 IRYENQLMGSTDDNGHLLVPWVAAYYPAKFQIEPLDLPANVSAPEVEQRVAVRQGSGLLL 695
+ ENQ TD G+ ++P+ Y + ++ L NV V +G+ +
Sbjct: 731 V--ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRA 788

Query: 696 DFPIRAVVAASINLVDERGEPLPLGSQAEETGSGQRASVGWDGQVYFEGLQSDNQLRVV- 754
+F R + + L +PLP G+ S V +GQVY G+ +++V
Sbjct: 789 EFKARVGIKLLMTLTHN-NKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKW 847

Query: 755 -RPDGRACQARFRLDTRKPT 773
+ C A ++L
Sbjct: 848 GEEENAHCVANYQLPPESQQ 867


2HWH78_RS00400HWH78_RS00450Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS00400110-3.147345DctP family TRAP transporter solute-binding
HWH78_RS00405212-3.461278ferredoxin--NADP reductase
HWH78_RS00410213-2.812063large-conductance mechanosensitive channel
HWH78_RS00415211-1.098755hypothetical protein
HWH78_RS004200100.205401catalase KatB
HWH78_RS004250110.901526ankyrin repeat domain-containing protein
HWH78_RS004303110.164214YdcH family protein
HWH78_RS004354130.468912CopD family protein
HWH78_RS00440314-0.033748DNA repair protein RadA
HWH78_RS00445414-0.345893cyclic di-GMP-binding protein MapZ
HWH78_RS00450214-0.513981DUF3015 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00410MECHCHANNEL1813e-62 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 181 bits (461), Expect = 3e-62
Identities = 88/137 (64%), Positives = 111/137 (81%), Gaps = 1/137 (0%)

Query: 1 MGLLSEFKAFAVKGNVVDMAVGIIIGAAFGKIVSSFVGDVIMPPIGLLIGGVDFSDLAIT 60
M ++ EF+ FA++GNVVD+AVG+IIGAAFGKIVSS V D+IMPP+GLLIGG+DF A+T
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LKAAEGDVPAVVLAYGKFIQTVLDFVIVAFAIFMGVKAINRLKREEAVAPSEPPVPSAEE 120
L+ A+GD+PAVV+ YG FIQ V DF+IVAFAIFM +K IN+L R++ P+ P P+ EE
Sbjct: 61 LRDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKE-EPAAAPAPTKEE 119

Query: 121 TLLTEIRDLLKAQQNKS 137
LLTEIRDLLK Q N+S
Sbjct: 120 VLLTEIRDLLKEQNNRS 136


3HWH78_RS00700HWH78_RS00725Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS00700211-3.310027signal peptidase II
HWH78_RS00705211-3.172694FKBP-type peptidyl-prolyl cis-trans isomerase
HWH78_RS00710211-3.3067054-hydroxy-3-methylbut-2-enyl diphosphate
HWH78_RS00715212-3.462812type 4a pilus minor pilin PilE
HWH78_RS00720113-3.513585type 4a fimbrial biogenesis protein PilY2
HWH78_RS00725112-3.379840type 4a pilus biogenesis protein PilY1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00705INFPOTNTIATR332e-04 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 33.4 bits (76), Expect = 2e-04
Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 5/54 (9%)

Query: 8 GEESRVTLHFALKLEDGNVVDSTFDK--QPASFKVGDGNLLPGFEQALFGLKAG 59
G+ VT+ + L DG V DST +K +PA+F+V ++PG+ +AL + AG
Sbjct: 142 GKSDTVTVEYTGTLIDGTVFDST-EKAGKPATFQV--SQVIPGWTEALQLMPAG 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00710PF06704280.032 DspF/AvrF protein
		>PF06704#DspF/AvrF protein

Length = 129

Score = 27.5 bits (61), Expect = 0.032
Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 194 KNDIC--YATQNRQDAVKELADQCDMVL 219
+N +C Y +Q+ + AV E+ D +MV+
Sbjct: 26 QNGVCALYDSQDNEAAVIEMPDHSEMVI 53


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00715BCTERIALGSPG421e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.8 bits (98), Expect = 1e-07
Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 4 RQKGFTLLEMVVVVAVIGILLGIAIPSYQNYVIRSNRTEGQALLSDAAA 52
+Q+GFTLLE++VV+ +IG+L + +P N + + + Q +SD A
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVP---NLMGNKEKADKQKAVSDIVA 51


4HWH78_RS00850HWH78_RS00985Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS00850215-0.996997hypothetical protein
HWH78_RS00855011-2.434149energy-coupling factor ABC transporter permease
HWH78_RS00860010-3.567044hypothetical protein
HWH78_RS00865011-4.293190DNA gyrase inhibitor YacG
HWH78_RS00870016-4.955690dephospho-CoA kinase
HWH78_RS00875115-4.713732type IV prepilin peptidase/methyltransferase
HWH78_RS00880015-3.973024type II secretion system F family protein
HWH78_RS00885018-3.720882type IV-A pilus assembly ATPase PilB
HWH78_RS00890-120-2.574914pilin
HWH78_RS00895-214-1.504977O-antigen ligase family protein
HWH78_RS00905-29-1.196847*carboxylating nicotinate-nucleotide
HWH78_RS00910-110-1.471486DUF1631 domain-containing protein
HWH78_RS00915-29-2.0393991,6-anhydro-N-acetylmuramyl-L-alanine amidase
HWH78_RS00920-28-1.345580regulatory signaling modulator protein AmpE
HWH78_RS00925-27-1.795156methyl-accepting chemotaxis protein
HWH78_RS00930014-4.479556type III PLP-dependent enzyme
HWH78_RS00935-112-2.334990VOC family protein
HWH78_RS00940-112-2.611180phosphoethanolamine transferase CptA
HWH78_RS00945-112-1.926470sel1 repeat family protein
HWH78_RS00950-212-1.883732Fe2+-dependent dioxygenase
HWH78_RS00955-113-1.235155TonB-dependent siderophore receptor PiuD
HWH78_RS00960091.860053sulfite reductase flavoprotein subunit alpha
HWH78_RS00965-2102.066261lipid A hydroxylase LpxO
HWH78_RS00970-293.337948LamB/YcsF family protein
HWH78_RS00975093.2018615-oxoprolinase subunit PxpB
HWH78_RS009801112.884684biotin-dependent carboxyltransferase family
HWH78_RS009852151.712306Lrp/AsnC family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00870DHBDHDRGNASE300.005 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 30.0 bits (67), Expect = 0.005
Identities = 23/88 (26%), Positives = 32/88 (36%), Gaps = 11/88 (12%)

Query: 5 WILGLTGGIGSGKSAAAEHFISLGVHLVDADHAARW--VVEPGRPALAKIVERFGDGILL 62
+I G GIG A A S G H+ D+ V A A+ E F
Sbjct: 12 FITGAAQGIGE---AVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF------ 62

Query: 63 PDGQLDRAALRERIFQAPEERRWLEQLL 90
P D AA+ E + E ++ L+
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00875PREPILNPTASE352e-125 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 352 bits (906), Expect = e-125
Identities = 164/283 (57%), Positives = 194/283 (68%), Gaps = 1/283 (0%)

Query: 3 LLDYLASHPLAFVLCAILLGLLVGSFLNVVVHRLPKMMERNWKAEAREALGLEPE-PKQA 61
LL+ P + L L++GSFLNVV+HRLP M+ER W+AE R + E +
Sbjct: 4 LLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEP 63

Query: 62 TYNLVLPNSACPRCGHEIRPWENIPLVSYLALGGKCSSCKAAIGKRYPLVELATALLSGY 121
YNL++P S CP C H I ENIPL+S+L L G+C C+A I RYPLVEL TALLS
Sbjct: 64 PYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVA 123

Query: 122 VAWHFGFTWQAGAMLLLTWGLLAMSLIDADHQLLPDVLVLPLLWLGLIANHFGLFASLDD 181
VA W A LLLTW L+A++ ID D LLPD L LPLLW GL+ N G F SL D
Sbjct: 124 VAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGD 183

Query: 182 ALFGTVFGYLSLWSVFWLFKLVTGKEGMGYGDFKLLAMLGAWGGWQILPLTILLSSLVGA 241
A+ G + GYL LWS++W FKL+TGKEGMGYGDFKLLA LGAW GWQ LP+ +LLSSLVGA
Sbjct: 184 AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGA 243

Query: 242 ILGVIMLRLRNAESGTPIPFGPYLAIAGWIALLWGDQITRTYL 284
+G+ ++ LRN PIPFGPYLAIAGWIALLWGD ITR YL
Sbjct: 244 FMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00880BCTERIALGSPF456e-162 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 456 bits (1174), Expect = e-162
Identities = 127/406 (31%), Positives = 226/406 (55%), Gaps = 14/406 (3%)

Query: 11 FVWEGTDKKGTKVKGELSSQNPTLVKAQLRKQGITPVKVR-------KKGISLLGA--GK 61
+ ++ D +G K +G + + + LR++G+ P+ V K G + L
Sbjct: 4 YHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRKI 63

Query: 62 KIKPMDIALFTRQMSTMMAAGVPLLQSFDIISEGFDNPNMRKLVEEIKQEVAGGNSLANS 121
++ D+AL TRQ++T++AA +PL ++ D +++ + P++ +L+ ++ +V G+SLA++
Sbjct: 64 RLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADA 123

Query: 122 LRKKPQYFDSLYCNLVDAGEQSGALETLLDRVATYKEKTEALKAKIKKAMTYPIAVIVVA 181
++ P F+ LYC +V AGE SG L+ +L+R+A Y E+ + ++++I++AM YP + VVA
Sbjct: 124 MKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVA 183

Query: 182 IIVSAILLIKVVPQFQSVFEGFGAELPAFTQMVINISNVLQEW--WLLVLLMMGGAGFLL 239
I V +ILL VVP+ F LP T++++ +S+ ++ + W+L+ L+ G F +
Sbjct: 184 IAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFRV 243

Query: 240 NHAYKRSEKFRDATDRTVLKLPIVGAILYKSAVARYARTLSTTFAAGVPLVEALDSVSGA 299
R EK R + R +L LP++G I ARYARTLS A+ VPL++A+
Sbjct: 244 ---MLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 TGNVVFRDAVGKIKQDVSTGMQLNFSMRTTNIFPSMAIQMTAIGEESGALDDMLAKVAGF 359
N R + V G+ L+ ++ T +FP M M A GE SG LD ML + A
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 YEQEVDNAVDNLTALMEPMIMAVLGVLVGGLIIAMYLPIFQLGNVV 405
++E + + L EP+++ + +V +++A+ PI QL ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00890BCTERIALGSPG553e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 54.5 bits (131), Expect = 3e-12
Identities = 21/54 (38%), Positives = 35/54 (64%)

Query: 1 MKAQKGFTLIELMIVVAIIGILAAIAIPQYQDYTARTQVTRAVSEISALKTAAE 54
Q+GFTL+E+M+V+ IIG+LA++ +P + +AVS+I AL+ A +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00905RTXTOXIND300.013 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.013
Identities = 23/141 (16%), Positives = 45/141 (31%), Gaps = 4/141 (2%)

Query: 75 QVEDGQRVEPNQMLFQLKGP-ARALLTGERSALNFLQLLSGTATRSQHYADLVAGTAVKL 133
V++G+ V +L +L A A +S+L +L TR Q + + +
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL---EQTRYQILSRSIELNKLPE 167

Query: 134 LDTRKTLPGLRLAQKYAVTCGGCHNHRIGLYDAFLIKENHIAACGGIDRAIAQARRIAPG 193
L ++++ + + + ++ +R AR
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 194 KPVEVEVENLDELRQALEAGA 214
VE LD+ L A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQA 248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00940SECYTRNLCASE320.006 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 32.0 bits (73), Expect = 0.006
Identities = 15/63 (23%), Positives = 25/63 (39%)

Query: 114 SEYFSQYFNPWMTLGLVLYSLVAILLWRRLRPVYLPRFSALPVAVLLIVATIGYPFYKQL 173
+EY S N G + L+A++ L + +LI+ +G KQ+
Sbjct: 364 AEYLSYVLNRITWPGSLYLGLIALVPTMALVGFGASQNFPFGGTSILIIVGVGLETVKQI 423

Query: 174 VSQ 176
SQ
Sbjct: 424 ESQ 426


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00970PHPHTRNFRASE300.012 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.8 bits (67), Expect = 0.012
Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 11/64 (17%)

Query: 40 CGFHAGDPLTMRRAVELAVR----HGVSIG------AHPAYPDLSGFGRRSLAC-SAEEV 88
CG AGD + + + L + SI + +L F +++L +AEEV
Sbjct: 503 CGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLSKEELKPFAQKALMLDTAEEV 562

Query: 89 HAMV 92
+V
Sbjct: 563 EQLV 566


5HWH78_RS01170HWH78_RS01285Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS01170212-0.659047hypothetical protein
HWH78_RS01175112-1.080889class II fumarate hydratase FumC
HWH78_RS01180-112-1.267729hypothetical protein
HWH78_RS01185-213-2.158706superoxide dismutase
HWH78_RS01190015-3.106761ZIP family metal transporter
HWH78_RS01195018-4.473007HPr family phosphocarrier protein
HWH78_RS01200018-4.510281RNase adapter RapZ
HWH78_RS01205019-5.168267PTS IIA-like nitrogen regulatory protein PtsN
HWH78_RS01210120-4.763844ribosome-associated translation inhibitor RaiA
HWH78_RS01215118-3.650504RNA polymerase factor sigma-54
HWH78_RS01220415-2.877595LPS export ABC transporter ATP-binding protein
HWH78_RS01225213-2.781600lipopolysaccharide transport periplasmic protein
HWH78_RS01230212-2.769474LPS export ABC transporter periplasmic protein
HWH78_RS01235113-2.485649HAD family hydrolase
HWH78_RS01240012-2.552956arabinose-5-phosphate isomerase KdsD
HWH78_RS01245012-3.217838ABC transporter ATP-binding protein
HWH78_RS01250-113-2.880435lipid asymmetry maintenance ABC transporter
HWH78_RS01255-114-2.534534outer membrane lipid asymmetry maintenance
HWH78_RS01260214-1.259423phospholipid-binding protein MlaC
HWH78_RS01265413-0.885208STAS domain-containing protein
HWH78_RS01270413-0.577934BolA family transcriptional regulator
HWH78_RS01275414-0.337704UDP-N-acetylglucosamine
HWH78_RS01280311-0.379399ATP phosphoribosyltransferase
HWH78_RS01285210-1.105690histidinol dehydrogenase
6HWH78_RS01350HWH78_RS01405Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS01350113-3.964105acyl-CoA dehydrogenase family protein
HWH78_RS01355116-4.421127NADP(H)-dependent aldo-keto reductase
HWH78_RS01360220-5.60015550S ribosomal protein L13
HWH78_RS01365221-5.25516330S ribosomal protein S9
HWH78_RS01370216-4.808203ubiquinol-cytochrome c reductase iron-sulfur
HWH78_RS01375115-4.951265cytochrome bc complex cytochrome b subunit
HWH78_RS01380017-4.093023cytochrome c1
HWH78_RS01385114-1.677755glutathione S-transferase N-terminal
HWH78_RS01390010-0.880881ClpXP protease specificity-enhancing factor
HWH78_RS01395111-1.086077BON domain-containing protein
HWH78_RS01400212-1.048618phosphoheptose isomerase
HWH78_RS01405214-1.050708YraN family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS01355HELNAPAPROT290.016 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 29.1 bits (65), Expect = 0.016
Identities = 25/106 (23%), Positives = 45/106 (42%), Gaps = 17/106 (16%)

Query: 102 NLKFNRQHIVAALDASLERLQTDWLDLYQLHWPERRTNFFGQLGYQHQ--EESFTPLEET 159
N K N+ + +L+ L + L++ HW + +FF L H+ EE + ET
Sbjct: 5 NAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFF-TL---HEKFEELYDHAAET 60

Query: 160 LEVLDEQVRAGKIRHIGLSNETPWGTMT-FLRLA--EERGWPRAVS 202
++ + E++ A IG P T+ + A + G + S
Sbjct: 61 VDTIAERLLA-----IGGQ---PVATVKEYTEHASITDGGNETSAS 98


7HWH78_RS02090HWH78_RS02240Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS02090292.991325hypothetical protein
HWH78_RS02095293.333750methyl-accepting chemotaxis protein
HWH78_RS02100083.222664chromate efflux transporter
HWH78_RS02105093.213445helix-turn-helix transcriptional regulator
HWH78_RS02110382.622592DMT family transporter
HWH78_RS02115272.449288lipoate--protein ligase family protein
HWH78_RS02120171.992182exodeoxyribonuclease V subunit gamma
HWH78_RS02125191.875471exodeoxyribonuclease V subunit beta
HWH78_RS021302121.657998exodeoxyribonuclease V subunit alpha
HWH78_RS021353140.869480AAA family ATPase
HWH78_RS02140-1141.093187exonuclease SbcCD subunit D C-terminal
HWH78_RS02170-1131.633356**error-prone DNA polymerase
HWH78_RS021752132.200294DNA polymerase Y family protein
HWH78_RS021800131.780449translesion DNA synthesis-associated protein
HWH78_RS021850131.966911biliverdin-producing heme oxygenase PigA
HWH78_RS02190-1162.679627hypothetical protein
HWH78_RS021950163.133544TonB-dependent outer membrane receptor
HWH78_RS022000142.979952RNA polymerase sigma factor VreI
HWH78_RS022051153.399287anti-sigma factor VreR
HWH78_RS022101163.362940type II secretion system protein J
HWH78_RS022151153.819622type II secretion system minor pseudopilin GspH
HWH78_RS022202183.583266hypothetical protein
HWH78_RS02225-1113.379265type II secretion system minor pseudopilin GspI
HWH78_RS022301113.561870type II secretion system major pseudopilin GspG
HWH78_RS022351124.133416type II secretion system minor pseudopilin GspK
HWH78_RS022400103.380584type II secretion system protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02095PREPILNPTASE310.010 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.9 bits (70), Expect = 0.010
Identities = 14/53 (26%), Positives = 18/53 (33%)

Query: 9 LFVCLLQSLALVLLLVGARHGWLPLYLALPLGVALLWLPWLLPGNRTAVATVG 61
L LL + + L + LP L LPL L L A +G
Sbjct: 135 LAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIG 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02135RTXTOXIND451e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 45.2 bits (107), Expect = 1e-06
Identities = 41/234 (17%), Positives = 79/234 (33%), Gaps = 22/234 (9%)

Query: 628 DQEQVRAEQSLERLRQTLVGLREGYSSQRERLSQSRQEQQELTGQLAALDR-QLDQWTLP 686
+ E VR L +L G + L Q+R EQ +++ +L + LP
Sbjct: 114 EGESVRKGDVLLKLTAL--GAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLP 171

Query: 687 EELRLLQPSAQLEWLAQRLDDLAGQRQQCQRDFDRLIARQRQTQQLQQELRAAETILQQR 746
+E S + L ++ Q Q Q + L +
Sbjct: 172 DEPYFQNVSEEEVLRLTSLIK------------EQFSTWQNQKYQKELNLDK----KRAE 215

Query: 747 QQALTEQRQRYEHLQQQVEEDSQQLRPLLSDEHWQRWQADPLRTFQALGESIEQRRQQQA 806
+ + + RYE+L + + LL + + L E++ + R ++
Sbjct: 216 RLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAV--LEQENKYVEAVNELRVYKS 273

Query: 807 RLQQIEQRLQELKQRCDESSWQLKQS-DEQRNEARQAEERVQAELAELNGRLGA 859
+L+QIE + K+ + K ++ + + ELA+ R A
Sbjct: 274 QLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQA 327



Score = 37.1 bits (86), Expect = 4e-04
Identities = 24/178 (13%), Positives = 59/178 (33%), Gaps = 13/178 (7%)

Query: 878 AQAAQSAVETLQAPLDSLREEQLRLAEALEHLQQQRQRQQDEFQRLQADWQAWRERQDNL 937
Q ++E + P L +E + E + + +++F Q ++ Q L
Sbjct: 153 YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN-----QKYQKEL 207

Query: 938 DDSRLDALLGLSEEQATQWREQLQRLQEEITRQQTLEAER---QAQLLQHRRQRPETDRE 994
+ + A + ++ + + + +L ++ + +L+ + E E
Sbjct: 208 NLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNE 267

Query: 995 -----ALEDNLRQQRERLAASEQAYLDTYSQLQADNQRREQSQALLAELERARAEFRR 1047
+ + + + Q + D R+ L LE A+ E R+
Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325



Score = 36.3 bits (84), Expect = 7e-04
Identities = 24/164 (14%), Positives = 63/164 (38%), Gaps = 11/164 (6%)

Query: 881 AQSAVETLQAPLDSLREEQLRLAEALEHLQQQRQRQQDEFQRLQADWQAWRERQDNLDDS 940
A++ Q+ L R EQ R R + ++ L+ + + + +
Sbjct: 132 AEADTLKTQSSLLQARLEQTRYQILS------RSIELNKLPELKLPDEPYFQNVSEEEVL 185

Query: 941 RLDALLGLSEEQATQWREQLQRLQEEITRQQTLEAERQAQLLQHRRQRPETDREALEDNL 1000
RL +L+ +EQ + W+ Q + + + +++ A++ ++ ++ L+D
Sbjct: 186 RLTSLI---KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS-RVEKSRLDD-F 240

Query: 1001 RQQRERLAASEQAYLDTYSQLQADNQRREQSQALLAELERARAE 1044
+ A ++ A L+ ++ ++ L ++E
Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS 284



Score = 33.3 bits (76), Expect = 0.007
Identities = 28/214 (13%), Positives = 58/214 (27%), Gaps = 10/214 (4%)

Query: 253 QALQRLEGQQQWFTEEQRLLQSCEHAQGQLAEARQAWDALATERETLQWLERLAPVRGLI 312
L +L E L Q +L + R + + E L L+
Sbjct: 122 DVLLKLTALG---AEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 313 ERLKQLEQELRHSEQQQRQRTEQQAAGAERLQGLQARLQEARERQAQADNHLRQAQAPLR 372
+++ + ++Q Q+ L +A R N +
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARI----NRYENLSRVEK 234

Query: 373 EAFQLESEARRLERTLAERQELHRQSNQRHAQQSDAARQL-DMEQQRHVAEQAQLQAALR 431
+L+ + L + + + Q N+ ++ +EQ A+ + L
Sbjct: 235 S--RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292

Query: 432 DSQALAALGDAWVTHQGQLATLVQRRQRALESQA 465
+ D + L + E Q
Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQ 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02170DHBDHDRGNASE310.022 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 30.8 bits (69), Expect = 0.022
Identities = 30/122 (24%), Positives = 48/122 (39%), Gaps = 14/122 (11%)

Query: 527 VLALGMLSALRRSFDLIHALRGGKRLSIASIPSEDPATYEMISRADTIGVFQIESRAQMA 586
V + G+ +A R + R G +++ S P+ P T ++ + S+A
Sbjct: 115 VNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRT--------SMAAYA-SSKAAAV 165

Query: 587 MLPRLRPQKFYDLVIQVAIVRPGPIQGDMVHPYLRRRNGEEPVAYPSAELEKVFERTLGV 646
M + + + I+ IV PG + DM NG E V S E K G+
Sbjct: 166 MFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFK-----TGI 220

Query: 647 PL 648
PL
Sbjct: 221 PL 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02210BCTERIALGSPG326e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 6e-04
Identities = 18/60 (30%), Positives = 32/60 (53%), Gaps = 3/60 (5%)

Query: 12 RRQAGFTLIEVMVAIMLMAIV-SLMAWRGLDSIARASAHLEDSTEQGAALLRALNQLERD 70
+Q GFTL+E+MV I+++ ++ SL+ + + +A S AL AL+ + D
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVS--DIVALENALDMYKLD 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02215BCTERIALGSPH493e-10 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 49.2 bits (117), Expect = 3e-10
Identities = 30/129 (23%), Positives = 46/129 (35%), Gaps = 7/129 (5%)

Query: 5 RQGGFTLIELMVVLVIVGIATAAISLSARPDPTGLLRQDAARLARLLEIAQGEARVRGTP 64
RQ GFTL+E+M++L+++G++ + L+ Q AR L Q G
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 65 ILWQPSAKGYRFSPQAYRGKTDALAADTELRARDWQAAPLRVSVRPPRPVLLDAEWIGAP 124
++F R D AD W PLR V G
Sbjct: 62 FGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWL--PLR-----AGRVATSGSIAGGK 114

Query: 125 LRITLSDGQ 133
L + + G+
Sbjct: 115 LNLAFAQGE 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02225BCTERIALGSPG316e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.0 bits (70), Expect = 6e-04
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 8 RGFTLIEVLVALAIVAIALAAAIRAVGLMTDGNGLLRDKSLA-LLAAESRLAELRLGVGA 66
RGFTL+E++V IV I + A++ LM + + K+++ ++A E+ L +L
Sbjct: 8 RGFTLLEIMV--VIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHH 65

Query: 67 AP 68
P
Sbjct: 66 YP 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02230BCTERIALGSPG1671e-56 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 167 bits (425), Expect = 1e-56
Identities = 63/142 (44%), Positives = 87/142 (61%), Gaps = 6/142 (4%)

Query: 11 KGHRGQRGFTLIEIMVVVVILGILAAMVVPKVLDRPDQARATAARQDISGLMQALKLYRL 70
+ QRGFTL+EIMVV+VI+G+LA++VVP ++ ++A A DI L AL +Y+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 71 DQGRYPSQAQGLKVLAERP-ADASASNWRS--YLERLPNDPWGKPYQYLNPGVNGEIDVF 127
D YP+ QGL+ L E P A+N+ Y++RLP DPWG Y +NPG +G D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 128 SLGADGQPGGEGINADIGSWQL 149
S G DG+ G E DI +W L
Sbjct: 122 SAGPDGEMGTED---DITNWGL 140


8HWH78_RS02375HWH78_RS02490Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS02375217-2.483117antibiotic biosynthesis monooxygenase
HWH78_RS02380022-3.364925lactoylglutathione lyase
HWH78_RS02385026-4.161072N-acetyltransferase
HWH78_RS02390127-4.935428YjhX family toxin
HWH78_RS02395228-5.485540DUF2845 domain-containing protein
HWH78_RS02400-118-2.065861hypothetical protein
HWH78_RS02405-218-1.772633integrase family protein
HWH78_RS02410-119-0.412105AlpA family phage regulatory protein
HWH78_RS02415-1170.136704ASCH domain-containing protein
HWH78_RS02420119-1.141397hypothetical protein
HWH78_RS02425119-1.132816DNA cytosine methyltransferase
HWH78_RS02430432-5.528321hypothetical protein
HWH78_RS02435116-3.909280hypothetical protein
HWH78_RS02440117-3.989545TraR/DksA family transcriptional regulator
HWH78_RS02445116-4.264008hypothetical protein
HWH78_RS02450015-3.476939hypothetical protein
HWH78_RS02455-115-3.002869hypothetical protein
HWH78_RS02460-116-2.069797toprim domain-containing protein
HWH78_RS02465024-3.585196hypothetical protein
HWH78_RS02470126-4.466681hypothetical protein
HWH78_RS02475031-4.491682ogr/Delta-like zinc finger family protein
HWH78_RS02480-233-6.604790hypothetical protein
HWH78_RS02485222-4.199743DNA-binding protein
HWH78_RS02490321-3.471715helix-turn-helix domain-containing protein
9HWH78_RS02545HWH78_RS02710Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS02545220-1.975262tail assembly chaperone
HWH78_RS02550220-1.857375phage tail protein
HWH78_RS02555320-0.740911phage tail protein I
HWH78_RS02560320-0.435539baseplate assembly protein
HWH78_RS02565221-0.317150GPW/gp25 family protein
HWH78_RS02570121-0.118629phage baseplate assembly protein V
HWH78_RS02575124-0.089881phage virion morphogenesis protein
HWH78_RS025802210.188857phage tail protein
HWH78_RS31400-122-0.110609Rz1-like lysis system protein LysC
HWH78_RS02590-1200.071163N-acetylmuramidase family protein
HWH78_RS02595021-0.106593phage holin family protein
HWH78_RS02600-118-1.204905hypothetical protein
HWH78_RS02605016-1.716023tail protein X
HWH78_RS02610012-2.674253hypothetical protein
HWH78_RS02615-110-2.437075head completion/stabilization protein
HWH78_RS02620011-3.041005terminase endonuclease subunit
HWH78_RS02625114-4.759510phage major capsid protein, P2 family
HWH78_RS02630012-4.822272GPO family capsid scaffolding protein
HWH78_RS02635-114-4.467569terminase ATPase subunit family protein
HWH78_RS02640-115-3.949927phage portal protein
HWH78_RS02645015-3.628122hypothetical protein
HWH78_RS02650-114-2.620166zeta toxin family protein
HWH78_RS02660-114-0.127574*(R)-3-hydroxydecanoyl-ACP:CoA transacylase
HWH78_RS026650130.523035murein L,D-transpeptidase catalytic domain
HWH78_RS02670-1130.327243L,D-transpeptidase
HWH78_RS026752130.69966516S rRNA pseudouridine(516) synthase
HWH78_RS026803130.673334cysteine-rich CWC family protein
HWH78_RS026853120.355478DUF4824 family protein
HWH78_RS026904111.388210DUF2157 domain-containing protein
HWH78_RS026953101.649763hypothetical protein
HWH78_RS027002112.244245hypothetical protein
HWH78_RS027053122.043309DUF1145 domain-containing protein
HWH78_RS027102132.396513hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02560FLGMOTORFLIG280.045 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 28.2 bits (63), Expect = 0.045
Identities = 12/47 (25%), Positives = 20/47 (42%), Gaps = 14/47 (29%)

Query: 6 VAIDLSQLPPPHAVEQLDYEQILAERKAYAISLWPEDQQAEIAARLA 52
+A+ LS L P A ++ +S P + Q +A R+A
Sbjct: 140 IALILSYLDPQKA--------------SFILSSLPTEVQTNVARRIA 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02630PF07201330.001 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 32.5 bits (74), Expect = 0.001
Identities = 31/133 (23%), Positives = 48/133 (36%), Gaps = 21/133 (15%)

Query: 127 DSPASLGTEALSFSAKNGTLASRKTNPDTLFSAAEEGTLEFEEYEEKPSVGAALFTKVKE 186
S + F ++ + S + AEE T F E K
Sbjct: 22 ASSQIVNQTLGQFRGESVQIVSGTLQS--IADMAEEVTFVFSE------------RKELS 67

Query: 187 LLKGKEARTQAEFGQVGEAVEAIAEHSRDLGEQLGEQKKQTQQLASQL-DKVTKELADLK 245
L K K + +QA V E V +L EQK+ +L S L + L+ LK
Sbjct: 68 LDKRKLSDSQARVSDVEEQVNQYLSKVPEL-----EQKQNVSELLSLLSNSPNISLSQLK 122

Query: 246 STLDS-TRDHSQQ 257
+ L+ + + S+Q
Sbjct: 123 AYLEGKSEEPSEQ 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02640ACRIFLAVINRP290.026 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.4 bits (66), Expect = 0.026
Identities = 11/55 (20%), Positives = 23/55 (41%), Gaps = 7/55 (12%)

Query: 239 AKGPGNFRNLFVYAPNGKKEGIQLIPVSEVA-AKDEFGSIKNISRDDQLAGLRVY 292
P + L+V + NG +++P S + +GS + R + L + +
Sbjct: 779 RMLPEDVDKLYVRSANG-----EMVPFSAFTTSHWVYGS-PRLERYNGLPSMEIQ 827


10HWH78_RS03015HWH78_RS03140Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS030152122.998156GntR family transcriptional regulator
HWH78_RS030202123.122513methyltransferase domain-containing protein
HWH78_RS030252132.986025DEAD/DEAH box helicase
HWH78_RS030303111.773513DUF3649 domain-containing protein
HWH78_RS030352111.672461PepSY domain-containing protein
HWH78_RS03040-1110.420746DUF3325 domain-containing protein
HWH78_RS03045-213-0.207397VOC family protein
HWH78_RS03050-2111.132922aldo/keto reductase
HWH78_RS030550101.451539hypothetical protein
HWH78_RS030601122.838588hypothetical protein
HWH78_RS030651112.361619N-acetylmuramoyl-L-alanine amidase AmpDh3
HWH78_RS030702103.309736SRPBCC family protein
HWH78_RS030753134.066971Nramp family divalent metal transporter
HWH78_RS030802133.825935haloacid dehalogenase type II
HWH78_RS030851144.089204MFS transporter
HWH78_RS030901153.7410323-keto-5-aminohexanoate cleavage protein
HWH78_RS030951154.390896aminotransferase class V-fold PLP-dependent
HWH78_RS031000163.373661RidA family protein
HWH78_RS03105115-0.655598LysR family transcriptional regulator
HWH78_RS03110126-3.984535LysR family transcriptional regulator
HWH78_RS03115336-7.122241VOC family protein
HWH78_RS03120140-6.830701hypothetical protein
HWH78_RS03125241-7.498084hypothetical protein
HWH78_RS03130241-8.824818HNH endonuclease
HWH78_RS03135229-6.547823DUF3396 domain-containing protein
HWH78_RS03140120-4.298696PAAR domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS03085TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.9 bits (75), Expect = 0.002
Identities = 30/155 (19%), Positives = 58/155 (37%), Gaps = 7/155 (4%)

Query: 5 LAANPTQRYRWVILLIATFAQACACFFVQGIGAI-----AVFIQNDLQLSSLQIGLLVSA 59
A NP +RW + A F +Q +G + +F ++ + IG+ ++A
Sbjct: 195 EALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAA 254

Query: 60 AQLVPIVG-LLVAGELLDRYSERLVVGLGTLIVALALCASLWATDYLTILLFLVVVGAGY 118
++ + ++ G + R ER + LG + +AT +V++ +G
Sbjct: 255 FGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG- 313

Query: 119 STAQPGGSKSVSRWFAKTQLGFAMGIRQAGLPLGG 153
P +SR + + G G A L
Sbjct: 314 GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTS 348


11HWH78_RS03730HWH78_RS03765Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS03730114-3.604120GTP diphosphokinase
HWH78_RS03735418-4.392747nucleoside triphosphate pyrophosphohydrolase
HWH78_RS03740319-4.957016lipid A hydroxylase LpxO2
HWH78_RS03745224-5.507474DUF2058 family protein
HWH78_RS03750128-4.812054LPS O-antigen chain length determinant protein
HWH78_RS03755-123-5.352475hypothetical protein
HWH78_RS03760-123-4.322555hypothetical protein
HWH78_RS03765-216-3.360411DUF2024 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS03745IGASERPTASE397e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.9 bits (90), Expect = 7e-06
Identities = 15/79 (18%), Positives = 33/79 (41%), Gaps = 2/79 (2%)

Query: 19 QAKQATKQKQKQQRLEHKNQVDKDDSQRQAAEQAKAEKLARDQ--ELNRQQQEKAEKKAK 76
+ +KQ+ K ++ + R+ A++AK+ A Q E+ + E E +
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 77 AAQIKQLIEGTRLPKLESD 95
+ +E K+E++
Sbjct: 1099 ETKETATVEKEEKAKVETE 1117


12HWH78_RS03855HWH78_RS04440Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS03855-113-3.137932PaaI family thioesterase
HWH78_RS03860-111-2.688110OprD family porin
HWH78_RS03865010-2.793420HIT domain-containing protein
HWH78_RS03870111-2.709012protein SlyX
HWH78_RS03875012-2.905938cold-shock protein
HWH78_RS03880113-3.179223DNA starvation/stationary phase protection
HWH78_RS03885216-3.329596aspartate--tRNA ligase
HWH78_RS03890120-3.937211YebC/PmpR family DNA-binding transcriptional
HWH78_RS03895123-4.255176crossover junction endodeoxyribonuclease RuvC
HWH78_RS03900122-4.907326Holliday junction branch migration protein RuvA
HWH78_RS03905118-4.568390Holliday junction branch migration DNA helicase
HWH78_RS03910020-4.774325tol-pal system-associated acyl-CoA thioesterase
HWH78_RS03915020-4.317388protein TolQ
HWH78_RS03920118-4.528657protein TolR
HWH78_RS03925018-4.077267cell envelope integrity protein TolA
HWH78_RS03930-120-4.313637Tol-Pal system beta propeller repeat protein
HWH78_RS03935029-5.565506peptidoglycan-associated lipoprotein Pal
HWH78_RS03940-130-5.734941tol-pal system protein YbgF
HWH78_RS03945-131-6.2704997-carboxy-7-deazaguanine synthase QueE
HWH78_RS03950039-7.7714187-cyano-7-deazaguanine synthase QueC
HWH78_RS03960139-8.671838*site-specific integrase
HWH78_RS03965140-8.958732helicase/relaxase domain-containing protein
HWH78_RS03970030-7.198741type II toxin-antitoxin system RelE/ParE family
HWH78_RS03975028-6.141880type II toxin-antitoxin system ParD family
HWH78_RS03980030-6.067170hypothetical protein
HWH78_RS03985127-4.874191conjugal transfer protein TraG N-terminal
HWH78_RS03990225-3.230915hypothetical protein
HWH78_RS03995226-3.563371integrating conjugative element protein
HWH78_RS04000020-3.520571TIGR03756 family integrating conjugative element
HWH78_RS04005020-3.828606TIGR03757 family integrating conjugative element
HWH78_RS04010017-3.565254hypothetical protein
HWH78_RS04015-116-2.901259DsbA family protein
HWH78_RS04020016-3.020253hypothetical protein
HWH78_RS04025-116-2.667657conjugative transfer ATPase
HWH78_RS04030023-3.176765TIGR03751 family conjugal transfer lipoprotein
HWH78_RS04035025-3.336584TIGR03752 family integrating conjugative element
HWH78_RS04040030-2.874778TIGR03749 family integrating conjugative element
HWH78_RS04045130-4.908609TIGR03746 family integrating conjugative element
HWH78_RS04050123-5.383691TIGR03750 family conjugal transfer protein
HWH78_RS04055220-5.777671TIGR03745 family integrating conjugative element
HWH78_RS04060116-3.663229TIGR03758 family integrating conjugative element
HWH78_RS04065116-3.735880RAQPRD family integrative conjugative element
HWH78_RS04070117-3.608997hypothetical protein
HWH78_RS04075217-3.163217UvrD-helicase domain-containing protein
HWH78_RS04080219-1.863127TIGR03747 family integrating conjugative element
HWH78_RS04085121-1.584971type IV conjugative transfer system coupling
HWH78_RS04090228-2.617268hypothetical protein
HWH78_RS04095132-4.549726integrating conjugative element protein
HWH78_RS04100-135-5.662273hypothetical protein
HWH78_RS04105041-6.779367TIGR03759 family integrating conjugative element
HWH78_RS04110-150-8.589031methyl-accepting chemotaxis protein
HWH78_RS04115055-10.346554hypothetical protein
HWH78_RS04120-236-6.884329hypothetical protein
HWH78_RS04125030-4.881292VWA domain-containing protein
HWH78_RS04130128-4.492713hypothetical protein
HWH78_RS04135225-4.141341hypothetical protein
HWH78_RS04140223-4.034966hypothetical protein
HWH78_RS04145322-3.749284DEAD/DEAH box helicase
HWH78_RS04150525-4.039113class I SAM-dependent methyltransferase
HWH78_RS04155529-4.725787hypothetical protein
HWH78_RS04160334-7.710925hypothetical protein
HWH78_RS04165239-8.866217hypothetical protein
HWH78_RS04170140-9.381696hypothetical protein
HWH78_RS04175241-10.067189DUF3275 family protein
HWH78_RS04180345-11.187804hypothetical protein
HWH78_RS04185251-12.503205hypothetical protein
HWH78_RS04190449-11.126753hypothetical protein
HWH78_RS04195437-6.986937hypothetical protein
HWH78_RS04200122-4.525861hypothetical protein
HWH78_RS04205-218-3.219177DUF3577 domain-containing protein
HWH78_RS04210-117-3.127845hypothetical protein
HWH78_RS04215015-2.665664hypothetical protein
HWH78_RS04220113-2.625369type IV pilus biogenesis protein PilM
HWH78_RS04225112-2.316793shufflon system plasmid conjugative transfer
HWH78_RS04230216-2.317945Flp pilus assembly complex ATPase component
HWH78_RS04235216-2.766308pilus assembly protein PilX
HWH78_RS04240315-2.218825type II secretion system F family protein
HWH78_RS04245215-2.101249Flp pilus assembly complex ATPase component
HWH78_RS04250220-2.777815type IV pilus biogenesis protein PilP
HWH78_RS04255220-3.479892type 4b pilus protein PilO2
HWH78_RS04260121-3.756795PilN family type IVB pilus formation outer
HWH78_RS04265125-3.942806TcpQ domain-containing protein
HWH78_RS04270025-4.416475DEAD/DEAH box helicase
HWH78_RS04275231-5.482914hypothetical protein
HWH78_RS04280225-5.067612type I DNA topoisomerase
HWH78_RS04285129-5.936967hypothetical protein
HWH78_RS04290-135-6.113203CrpP family protein
HWH78_RS04295035-6.979707single-stranded DNA-binding protein
HWH78_RS04300240-7.818673DUF3158 family protein
HWH78_RS04305241-7.943570TIGR03761 family integrating conjugative element
HWH78_RS04310233-7.341026hypothetical protein
HWH78_RS04315133-6.934236hypothetical protein
HWH78_RS04320330-6.677853DUF2857 domain-containing protein
HWH78_RS04325227-5.909798ParB family protein
HWH78_RS04330218-3.971054Arc family DNA-binding protein
HWH78_RS04335218-4.217756nucleoid-associated protein YejK
HWH78_RS04340223-5.176304hypothetical protein
HWH78_RS04345124-6.139803hypothetical protein
HWH78_RS04350123-5.513924hypothetical protein
HWH78_RS04355124-5.471921hypothetical protein
HWH78_RS04360024-5.315130replicative DNA helicase
HWH78_RS04365127-5.232556hypothetical protein
HWH78_RS04370227-5.302833HNH endonuclease
HWH78_RS04375032-5.133112DUF2786 domain-containing protein
HWH78_RS04380034-5.737781Lar family restriction alleviation protein
HWH78_RS04385-131-5.772517hypothetical protein
HWH78_RS04390-133-6.029666hypothetical protein
HWH78_RS04395137-6.955442hypothetical protein
HWH78_RS04400137-7.179132ParA family protein
HWH78_RS04410342-8.688958*acyl-CoA thioesterase
HWH78_RS04415336-8.201483hypothetical protein
HWH78_RS04420440-8.297908NUDIX hydrolase
HWH78_RS04425335-6.682309Hpt domain-containing protein
HWH78_RS04430231-5.566277type 1 fimbrial protein
HWH78_RS04435127-4.842942molecular chaperone
HWH78_RS04440021-3.265076fimbrial biogenesis outer membrane usher
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS03880HELNAPAPROT1573e-52 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 157 bits (398), Expect = 3e-52
Identities = 50/145 (34%), Positives = 72/145 (49%)

Query: 11 DRAAIAEGLSRLLADTYTLYLKTHNFHWNVTGPMFNTLHLMFEGQYTELAVAVDDIAERI 70
++ + L+ L++ + LY K H FHW V GP F TLH FE Y A VD IAER+
Sbjct: 9 NQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERL 68

Query: 71 RALGFPAPGTYAAYARLSSIKEEEGVPEAEEMIRQLVQGQEAVVRTARSIFPLLDKVSDE 130
A+G T Y +SI + A EM++ LV + + ++ + L ++ D
Sbjct: 69 LAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLAEENQDN 128

Query: 131 PTADLLTQRMQVHEKTAWMLRSLLA 155
TADL ++ EK WML S L
Sbjct: 129 ATADLFVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS03885ANTHRAXTOXNA320.009 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 31.6 bits (71), Expect = 0.009
Identities = 31/117 (26%), Positives = 51/117 (43%), Gaps = 23/117 (19%)

Query: 212 YYQIAKCFRDEDLRADRQPEFTQIDIETSFLDESDIIGITEKMVRQLFKEVL-------D 264
YY+I K + + D+ + +++ S D+SD ++ + Q FKE L D
Sbjct: 170 YYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSD---SSDLLFSQKFKEKLELNNKSID 226

Query: 265 VEF-----DEFPHMPFEEAMRRYGSDKPDLRIPLEL-----VDVADQLKEVEFKVFS 311
+ F EF H F A Y + PD R LEL + ++L++ F+ S
Sbjct: 227 INFIKENLTEFQHA-FSLAFSYYFA--PDHRTVLELYAPDMFEYMNKLEKGGFEKIS 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS0391560KDINNERMP290.017 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 29.1 bits (65), Expect = 0.017
Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 13/72 (18%)

Query: 12 WSLISNASIVVQLVMLTLVAASVTSWIMIFQRGNAMRAAKKALDAFEERFWS-----GID 66
+S+I + +V+ +M L A TS MR + + A ER +
Sbjct: 356 FSIII-ITFIVRGIMYPLTKAQYTSM-------AKMRMLQPKIQAMRERLGDDKQRISQE 407

Query: 67 LSKLYRQAGSNP 78
+ LY+ NP
Sbjct: 408 MMALYKAEKVNP 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS03925IGASERPTASE491e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 49.3 bits (117), Expect = 1e-08
Identities = 36/204 (17%), Positives = 71/204 (34%), Gaps = 21/204 (10%)

Query: 54 QLKSKSQATTQTNQKIAGEAKKTASKQYE-----VEQLEQKKLEQQKLEQQKLEQQQVAA 108
Q + TT N + + + +++ + E +Q +
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 109 AKAAEQKKADEARKAEAQKAAEAKKADEAKKAAEAKAAEQKKQADIAKKRAEDEAKKKAA 168
++ A E + A EAK +A + ++A ++ E K+
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKA----------NTQTNEVA--QSGSETKETQT 1097

Query: 169 EDAKKKAAEDAKKKAAEEAKKKAAAEAAKKKAAVEAAKKKAAAAAAAARKAAEDKKAQAL 228
+ K+ A + ++KA E +K E K + V ++++ A A E+ +
Sbjct: 1098 TETKETATVEKEEKAKVETEKT--QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 229 AELLS--DTTERQQALADEVGSEV 250
E S +TT + A E S V
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNV 1179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS03935OMPADOMAIN1166e-34 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 116 bits (291), Expect = 6e-34
Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 12/112 (10%)

Query: 68 YFEYDSSDLKPEAMRALDVHA---KDLKGSGQRVVLEGHTDERGTREYNMALGERRAKAV 124
F ++ + LKPE ALD +L VV+ G+TD G+ YN L ERRA++V
Sbjct: 222 LFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSV 281

Query: 125 QRYLVLQGVSPAQLELVSYGKERPVATGHDEQS---------WAQNRRVELK 167
YL+ +G+ ++ G+ PV + A +RRVE++
Sbjct: 282 VDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS03940RTXTOXIND320.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.1 bits (73), Expect = 0.002
Identities = 10/53 (18%), Positives = 19/53 (35%)

Query: 69 QLQQMQDELARLRGTLEEQQNQIQQLKQESLERYQDLDRRISGGGAPAAQNSA 121
+ + +EL + LE+ +++I K+E Q I N
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04160FbpA_PF05833260.018 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 26.0 bits (57), Expect = 0.018
Identities = 10/32 (31%), Positives = 17/32 (53%)

Query: 8 LTQETLAYLEDQLSNNDVAGDDELIDLFIEEL 39
+E L YL L+N + A + + I+ +EL
Sbjct: 406 QNEEELNYLYSVLTNINNADNYDEIEEIKKEL 437


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04175TONBPROTEIN280.015 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 28.4 bits (63), Expect = 0.015
Identities = 12/68 (17%), Positives = 20/68 (29%), Gaps = 13/68 (19%)

Query: 104 EVPAIQQPTVAPAAPPKSPQKPKP-------------LSPAATGDDAPFGMDPPAPAEQA 150
+P + PK KPKP + P + +PF PA +
Sbjct: 77 PIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSS 136

Query: 151 ASLDTDAD 158
+ +
Sbjct: 137 TATAATSK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04225BCTERIALGSPG373e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 37.2 bits (86), Expect = 3e-05
Identities = 15/62 (24%), Positives = 29/62 (46%)

Query: 2 RSKRSSGFISIELMIALVVIAIATAGGISVLMSYLDGLDEQHAAQQQQQVAKAAEKYLKD 61
+ + GF +E+M+ +V+I + + + LM + D+Q A + A + Y D
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 62 NF 63
N
Sbjct: 63 NH 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04235PilS_PF088051177e-36 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 117 bits (295), Expect = 7e-36
Identities = 46/179 (25%), Positives = 91/179 (50%), Gaps = 12/179 (6%)

Query: 2 STTQRTSRPTQGGFVSIEMIIVLIIIAIGVGLGLAAAAGMFSSSNANEEQRNISVIAANA 61
S + R + G +E+++V+ +I + + + S+ ++ EQ N+ + AN
Sbjct: 15 SLSARRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANM 74

Query: 62 RALKTSSGYGSSGTNLIPSLIAINGVPKNM--SVSSGVVYNVYGGSVTV--SSTGMGFSI 117
++LK Y + +N I +L A +P +M + N +GGSVT+ SS F++
Sbjct: 75 KSLKFQGRY--TDSNYIKTLYAQGLLPSDMIADTTGASAKNPWGGSVTITTSSDKYSFNV 132

Query: 118 TTSKLPQDACITLATKIAKNTFEQTKINSGSAITGEVTTAAATQACSSDSNSITWTYSS 176
+ +PQ C+ + + +++ +KIN+ S +T +A C+SDSN++T++ S
Sbjct: 133 VEANVPQKNCMAMVNAL-RSSSAISKINNTS-----TSTVSAATVCASDSNTLTFSTDS 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04240BCTERIALGSPF725e-16 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 71.8 bits (176), Expect = 5e-16
Identities = 74/346 (21%), Positives = 141/346 (40%), Gaps = 20/346 (5%)

Query: 14 SKQFGRKERLQFYESMSTLLENGVPLKDAVAEVHKIFAHEGQHPFHPVAIASREALMGLS 73
+ + ++TL+ +PL++A+ V K E H + A R +M
Sbjct: 62 KIRLSTSDLALLTRQLATLVAASMPLEEALDAVAK--QSEKPH-LSQLMAAVRSKVME-- 116

Query: 74 NGKRLATAMALYLPVQE---RALIEAGEMSGNLVQAMGDAVSLVEAQARIRATIWQALLY 130
G LA AM + E A++ AGE SG+L + E + ++R+ I QA++Y
Sbjct: 117 -GHSLADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIY 175

Query: 131 PSALSAMMVFLLCIVAYRMVPSLARLSDPVTWTGPLAT--LNAIASFVTGPGIYVLVAVI 188
P L+ + + ++ I+ +VP + + PL+T L ++ V G ++L+A++
Sbjct: 176 PCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALL 235

Query: 189 TLTVVVIVTLPTYRWKGRVWLDRTLPPW----SIYRMLQGTTFLLNMAVMLNAGIRPYDS 244
+ V L + RV R L I R L + ++++ + + +
Sbjct: 236 AGFMAFRVMLRQEKR--RVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQA 293

Query: 245 LASMIK-ISPPWLKQRLEAARYGVGLGQNLGVALRSAGHDFPDRQAIQYLCILANRGGFS 303
+ +S + + RL A V G +L AL FP + G
Sbjct: 294 MRISGDVMSNDYARHRLSLATDAVREGVSLHKALE-QTALFP-PMMRHMIASGERSGELD 351

Query: 304 EALVKFSRRWQETSLKQIELAAGLVKNFALIFIGALMILVLLGAYQ 349
L + + Q+ LA GL + ++ + A+++ ++L Q
Sbjct: 352 SMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQ 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04260BCTERIALGSPD874e-20 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 87.3 bits (216), Expect = 4e-20
Identities = 70/318 (22%), Positives = 132/318 (41%), Gaps = 26/318 (8%)

Query: 269 SELKTSILSDIENSINSMLTPSMGRMSLSRATGTLTVTDRPEVLNRVQQLVNRENESITK 328
+ + +++ S+ + + + T L VT P+V+N +++++ + +
Sbjct: 287 TGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIA-QLDIRRP 345

Query: 329 QVLLNVNVLSVALTDKDQLGIDW---NLVYKSLNNKWGIGLKNTMPGIDQSAISGSV--- 382
QVL+ + V D LGI W N N G+ + + G +Q G+V
Sbjct: 346 QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNS-GLPISTAIAGANQYNKDGTVSSS 404

Query: 383 --SILDTANSAWAGS-----KAMVQALAQQGRVSTVRSPSVTTLNLQSAPIQIGRYDSYL 435
S L + N AG ++ AL+ + + +PS+ TL+ A +G+ L
Sbjct: 405 LASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVL 464

Query: 436 ASSQISNVAQVGSTTSLIPGAVTSGYNMSLLPFVMESGEMLLKININMTSRPTFEMQTSG 495
SQ ++ + +T T G + + P + E +LL+I ++S TS
Sbjct: 465 TGSQTTSGDNIFNTVERK----TVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSS 520

Query: 496 DSKAQFPSYDIQLFDQKVRLRSGETLVLSGF--DQTTEDTNKV-GTGDAGFFG-LGGGLT 551
D A F + + V + SGET+V+ G ++ +KV GD G L +
Sbjct: 521 DLGATFNTRTVN---NAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTS 577

Query: 552 RNTKREVIVVLITPVVLG 569
+ + +++ I P V+
Sbjct: 578 KKVSKRNLMLFIRPTVIR 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04265PF03544310.007 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 30.7 bits (69), Expect = 0.007
Identities = 25/128 (19%), Positives = 39/128 (30%), Gaps = 4/128 (3%)

Query: 166 QLPPAPRP-KPVQQLYAKPAAPTPAAVAQPSSTEKVSTLESPVVVASVPTPTPITTSPAP 224
Q+ P P +P+ PA P QP V P + P P+
Sbjct: 39 QVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK 98

Query: 225 TKKPESTTVFPPAAPAKDGHPSSPPAASAPIKPLASAVKSMPPTPAVASAPPVKVLTPAE 284
K P + P S P P + + P + +A V + A
Sbjct: 99 PKPKPKP---KPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155

Query: 285 PSRQLAQS 292
R L+++
Sbjct: 156 GPRALSRN 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04315IGASERPTASE310.013 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.013
Identities = 35/223 (15%), Positives = 72/223 (32%), Gaps = 11/223 (4%)

Query: 221 DPAAEFGIRTLPDLPHSTPSSDAELSEISGKQCALPLSSD--AEPRQNPPSTPLVRMPNS 278
+P E +T+ +TP++ I ++P +++ A + P P P+
Sbjct: 982 NPEVEKRNQTVDTTNITTPNN------IQADVPSVPSNNEEIARVDEAPVPPPAPATPSE 1035

Query: 279 YSTYTYKQDSVCKKPVQPQTREEAHPNWQGLLHTLEAEQRIQAVSALRRVSEDLRLPIIE 338
+ + K V+ ++ Q EA+ ++A + V++
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET 1095

Query: 339 QWQHRCAGGTVGNPFGYLMTLIQRAVQGKFNASWAPEEPAERTIPAAERPSRAPAPSSPI 398
Q TV + + K + +P++ T+ P+R P+ I
Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 399 APTQPQVQPRGDTRTGSEVLSRLKDLIRPRHGSSVPSERGDEP 441
Q Q DT ++ S + S G+
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSS---NVEQPVTESTTVNTGNSV 1195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04330FbpA_PF05833260.028 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 25.6 bits (56), Expect = 0.028
Identities = 12/62 (19%), Positives = 22/62 (35%), Gaps = 6/62 (9%)

Query: 18 RDQLKQKAAHNHRSANSEIVYRLERSNELEEELARANRMVDELFAKNQRLQAELAAANTP 77
D+LK K++ + + I ++ L L + +L EL AN
Sbjct: 294 SDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCED------KDIFKLYGELLTANIY 347

Query: 78 QV 79
+
Sbjct: 348 AL 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04440PF005777910.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 791 bits (2045), Expect = 0.0
Identities = 282/863 (32%), Positives = 443/863 (51%), Gaps = 47/863 (5%)

Query: 12 LSVYSRSSCLMALGLALPAVTFAVEFNAEFLNNEGGAPVELKYFENGNSVSPGTYSVDIH 71
+ R A P + + FN FL ++ A +L FENG + PGTY VDI+
Sbjct: 24 AGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIY 83

Query: 72 LNQIMIRREDVVFSADPDTGSVRPVVRVGLLKEIGVDIARLTRDKLIPDNLENNTPLNVA 131
LN + DV F+ + P + L +G++ A ++ L+ D+ + +
Sbjct: 84 LNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADD----ACVPLT 139

Query: 132 ELIPGASIEFDVNSLSLLVSIPQLYVQRHSRGYVDPSLWDDGVTALFSNYQANFTRNTN- 190
+I A+ + DV L ++IPQ ++ +RGY+ P LWD G+ A NY + N
Sbjct: 140 SMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNR 199

Query: 191 FGQNSDYRYLGLRNGFNLFGWRLRNDSSLS-----GGTGMRNKFSSNRTYVERDIRALKG 245
G NS Y YL L++G N+ WRLR++++ S +G +NK+ T++ERDI L+
Sbjct: 200 IGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRS 259

Query: 246 TLSLGELYTSAQGDAFESVRMRGVQLQSDIGMLPDNEISYTPVVRGIAETNATVEVSQNG 305
L+LG+ YT GD F+ + RG QL SD MLPD++ + PV+ GIA A V + QNG
Sbjct: 260 RLTLGDGYTQ--GDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNG 317

Query: 306 FVIYSTNVPPGAFEITDIYPSGSNGDLEVKIIEADGRQRSFKQSYSYLPVMTRKGNLRYG 365
+ IY++ VPPG F I DIY +G++GDL+V I EADG + F YS +P++ R+G+ RY
Sbjct: 318 YDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYS 377

Query: 366 LAAGEYHNDG--QPSVNLLQGSAVYGLSDRVTGFGGLLAAEKYNATNLGLGFNT-PLGGF 422
+ AGEY + Q Q + ++GL T +GG A++Y A N G+G N LG
Sbjct: 378 ITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGAL 437

Query: 423 SADVTHSQSRTRRGGRNQGQSLRLLYSKTINATETSFTVVGYRYSTEGYRTLSQH----- 477
S D+T + S ++ GQS+R LY+K++N + T+ +VGYRYST GY +
Sbjct: 438 SVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRM 497

Query: 478 ----------IDDMSEESYLYGSSSSRQKSRIDLTVNQTLFRRSSLYLTAGETTYWNRPG 527
+ + + Y + + ++ ++ LTV Q L R S+LYL+ TYW
Sbjct: 498 NGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSN 557

Query: 528 SSRRVQFGFSSGIKRASYSLAVSRTQETGSFGRSDTQFTASVSIPLGG--------SARS 579
+ Q G ++ + +++L+ S T+ GR D +V+IP R
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGR-DQMLALNVNIPFSHWLRSDSKSQWRH 616

Query: 580 SQVYANAVSSQHGDSSLNTGISGYLDEANAFNYSAQANYSKDG----GNSGSVGLGWDTS 635
+ + +G + G+ G L E N +YS Q Y+ G G++G L +
Sbjct: 617 ASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGG 676

Query: 636 KAKLSANYSQGRDNKQINLGASGSVVVHSGGVTFGQPVGETFGLVEVPEVGGVGLDGYSS 695
+ YS D KQ+ G SG V+ H+ GVT GQP+ +T LV+ P ++ +
Sbjct: 677 YGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTG 736

Query: 696 VRTDGRGYAVLPYMQPYRYNWVNLDTNTLGSDTEISDSTQMAVPTRGAVIAKRFSAESGR 755
VRTD RGYAVLPY YR N V LDTNTL + ++ ++ VPTRGA++ F A G
Sbjct: 737 VRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGI 796

Query: 756 RVQFDLSMDSGGKIPFGAQAYDKEERVVGMVDNLSRLLVFGIEDQGRLSIRWSDG---SC 812
++ L+ + +PFGA + + G+V + ++ + G+ G++ ++W + C
Sbjct: 797 KLLMTLTHN-NKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHC 855

Query: 813 SVDYQLPPRNKDLTYERVALSCR 835
+YQLPP ++ +++ CR
Sbjct: 856 VANYQLPPESQQQLLTQLSAECR 878


13HWH78_RS04735HWH78_RS04775Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS047352122.758785helix-turn-helix domain-containing protein
HWH78_RS047402111.850239GntP family permease
HWH78_RS047501121.955104glycerate kinase
HWH78_RS047552120.631276hypothetical protein
HWH78_RS047602120.696101glycine zipper 2TM domain-containing protein
HWH78_RS047653140.416584monovalent cation/H+ antiporter subunit A
HWH78_RS047703110.653082Na+/H+ antiporter subunit C
HWH78_RS047752110.672974monovalent cation/H+ antiporter subunit D
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04740HTHFIS300.016 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.016
Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 315 AHDGHAQACAQALGIHRNSLRYRLERI 341
A G+ A LG++RN+LR ++ +
Sbjct: 447 ATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04745BACINVASINB300.026 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 29.7 bits (66), Expect = 0.026
Identities = 23/123 (18%), Positives = 53/123 (43%), Gaps = 10/123 (8%)

Query: 23 LHPFLALLAAAL---IAGFAYQVPTAEIVKTVTGGFGSILGYIGIVIVLGTIIGVILERS 79
L P + L+ A+ + G TAE+ ++ G + + + +++V+ +
Sbjct: 378 LKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMVAVIVVVAVV------GK 431

Query: 80 GAAITMAESVIRLLGERFPTLTMSIIGYLVS-IPVFCDSGFVILNSLKNALAARMKISTI 138
GAA + ++ +++GE L +++ L G + S + ++M + T
Sbjct: 432 GAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLGNVGSKMGLQTN 491

Query: 139 AMS 141
A+S
Sbjct: 492 ALS 494


14HWH78_RS05240HWH78_RS05340Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS05240-1103.022644LysR family transcriptional regulator
HWH78_RS05245-1113.327242UTRA domain-containing protein
HWH78_RS05250-1113.249445HAD-IB family hydrolase
HWH78_RS05255-382.471527MFS transporter
HWH78_RS05260-382.230702LysR family transcriptional regulator
HWH78_RS05265016-1.228006iron-containing alcohol dehydrogenase
HWH78_RS05270122-3.320175APC family permease
HWH78_RS05275234-5.396618exotoxin A
HWH78_RS05280671-16.492473ATP-grasp domain-containing protein
HWH78_RS05285661-15.780177DUF2195 family protein
HWH78_RS05290440-12.184901S-type pyocin domain-containing protein
HWH78_RS05295331-10.801297hypothetical protein
HWH78_RS05300124-8.291323IS3 family transposase
HWH78_RS05305123-8.679036SIR2 family protein
HWH78_RS05310018-0.596214ribonucleotide-diphosphate reductase subunit
HWH78_RS05315-1170.770094ribonucleoside-diphosphate reductase subunit
HWH78_RS05320-1142.608384winged helix-turn-helix domain-containing
HWH78_RS05325-1132.433819HAMP domain-containing protein
HWH78_RS05330-1152.478917cold-shock protein
HWH78_RS05335-1163.249915hypothetical protein
HWH78_RS05340-2173.101655methyltransferase domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05255TCRTETA432e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.5 bits (100), Expect = 2e-06
Identities = 76/334 (22%), Positives = 120/334 (35%), Gaps = 35/334 (10%)

Query: 68 LAAYAVGQF----MWGVFADRFGTRVVVLGGLLTSALAALVMGTLATLPIFAACMVVQGL 123
LA YA+ QF + G +DRFG R V+L L +A+ +M T L + +V G+
Sbjct: 49 LALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI 108

Query: 124 AQSTGWSGLCKNIGAFFA----TYERGRVLGLWSTCYAFGGLVATPFAGWCAYRLTHDWR 179
+TG GA+ A ER R G S C+ F G+VA P G +
Sbjct: 109 TGATG-----AVAGAYIADITDGDERARHFGFMSACFGF-GMVAGPVLGGLMGGFSP--H 160

Query: 180 MAFLSSAGVVLAVAVLFFFLQRNRPQDVGLPPVEAEAARPANAPRPSH-SGALLKALRNP 238
F ++A + + FL LP RP + +
Sbjct: 161 APFFAAAALNGLNFLTGCFL---------LPESHKGERRPLRREALNPLASFRWARGMTV 211

Query: 239 HVLVLGLAYFLLKPARYAILLWGPVIIYERMPEIGKVASAIVPSAFEVAGLAGPILIGLA 298
++ + + + + LW VI E I +AF + +I
Sbjct: 212 VAALMAVFFIMQLVGQVPAALW--VIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGP 269

Query: 299 SDKLFGARRMPACVLSLAALTVSLALFVPAMQSGSLYLVVGLLFMMGVTLYGPDSMISGA 358
G RR A +L + A L A + + ++ LL G+ + +M+S
Sbjct: 270 VAARLGERR--ALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQ 327

Query: 359 AAIDFGTSEAAGTATGFVNGCGSVGAILGGLLPG 392
E G G + S+ +I+G LL
Sbjct: 328 VD-----EERQGQLQGSLAALTSLTSIVGPLLFT 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05275DPTHRIATOXIN320.009 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 31.6 bits (71), Expect = 0.009
Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 5/72 (6%)

Query: 458 FVGYHGTFLEAAQSIVFGGVRARSQ---DLDAIWRGFYVAGDPALAYGYAQDQEPDARGR 514
F YHGT SI G + +S + D W+GFY + A GY+ D E G+
Sbjct: 49 FSSYHGTKPGYVDSIQKGIQKPKSGTQGNYDDDWKGFYSTDNKYDAAGYSVDNENPLSGK 108

Query: 515 IRNGALLRVYVP 526
G +++V P
Sbjct: 109 A--GGVVKVTYP 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05290PYOCINKILLER7190.0 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 719 bits (1856), Expect = 0.0
Identities = 459/488 (94%), Positives = 468/488 (95%)

Query: 2 ARPIADLIHFNSTTVTASGDVYYGPGGGTGIGPIARPIEHGLDSSTENGWQEFESYADLG 61
ARPIADLIHFNSTTVTASGDVYYGPGGGTGIGPIARPIEHGLDSSTENGWQEFESYAD+G
Sbjct: 1 ARPIADLIHFNSTTVTASGDVYYGPGGGTGIGPIARPIEHGLDSSTENGWQEFESYADVG 60

Query: 62 VDPRRYVPLQVKEKRREIELQFRDAEKKLEASVQAELDKADAALGPAKNLAPLDVINRSL 121
VDPRRYVPLQVKEKRREIELQFRDAEKKLEASVQAELDKADAALGPAKNLAPLDVINRSL
Sbjct: 61 VDPRRYVPLQVKEKRREIELQFRDAEKKLEASVQAELDKADAALGPAKNLAPLDVINRSL 120

Query: 122 TIVGNALQQKNQKLLLNQKKITSLGAKNFLTRTAEEIGEQAVREGNINGPEAYMRFLDRE 181
TIVGNALQQKNQKLLLNQKKITSLGAKNFLTRTAEEIGEQAVREGNINGPEAYMRFLDRE
Sbjct: 121 TIVGNALQQKNQKLLLNQKKITSLGAKNFLTRTAEEIGEQAVREGNINGPEAYMRFLDRE 180

Query: 182 MEGLTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQ 241
MEGLTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQ
Sbjct: 181 MEGLTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQ 240

Query: 242 QAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLVSAPSVMA 301
QAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVL SAPSVMA
Sbjct: 241 QAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASAPSVMA 300

Query: 302 VGFASLTYSSRTAEQWQDQTPDSVRYALGMDANKLGLPSSVNLNAVAKAGGTVDLPMRLT 361
VGFASLTYSSRTAEQWQDQTPDSVRYALGMDA KLGLP SVNLNAVAKA GTVDLPMRLT
Sbjct: 301 VGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMRLT 360

Query: 362 NEARGSTTTLSVVSTDGVSVPKAVPVRMAAYNTATGLYEVTVPSTVAEAPPLILTWTPAS 421
NEARG+TTTLSVVSTDGVSVPKAVPVRMAAYN TGLYEVTVPST AEAPPLILTWTPAS
Sbjct: 361 NEARGNTTTLSVVSTDGVSVPKAVPVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTPAS 420

Query: 422 PPGNQNPSSTTPVVSKPVPVYEGAALTPLKTDPESYPGMLLDLNDLIVIFPADSGVKPVY 481
PPGNQNPSSTTPVV KPVPVYEGA LTP+K PE+YPG++ DLI+ FPADSG+KP+Y
Sbjct: 421 PPGNQNPSSTTPVVPKPVPVYEGATLTPVKATPETYPGVITLPEDLIIGFPADSGIKPIY 480

Query: 482 VMLSSPLD 489
VM P D
Sbjct: 481 VMFRDPRD 488


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05320HTHFIS793e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 3e-19
Identities = 37/148 (25%), Positives = 66/148 (44%), Gaps = 3/148 (2%)

Query: 7 RILIVEDDRRLAELTREYLEGNGLKVDIEANGALAAARILAERPDLVVLDLMLPGEDGLS 66
IL+ +DD + + + L G V I +N A I A DLVV D+++P E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 ICRQVR-PQFDGPILMLTARTDDMDEVLGLEMGADDYVCKPVRPRVLLARIRALLRRSEA 125
+ +++ + D P+L+++A+ M + E GA DY+ KP L+ I L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 126 PEAGAPAADSKRLAFGRLVIDNAMREAW 153
+ + + AM+E +
Sbjct: 125 RPSKLEDD--SQDGMPLVGRSAAMQEIY 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05325PF06580418e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.0 bits (96), Expect = 8e-06
Identities = 21/107 (19%), Positives = 35/107 (32%), Gaps = 25/107 (23%)

Query: 431 LQNLLTNALRHA------DRRVRISYRVSLERCRVDVEDDGPGVPEAQWERLFTPFLRLD 484
+Q L+ N ++H ++ + ++VE+ G + E
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------- 309

Query: 485 DSRTRASGGHGLGLSIVR-RIVYWHGGRASIGRSETLGGACFTLAWP 530
G GL VR R+ +G A I SE G + P
Sbjct: 310 --------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


15HWH78_RS05645HWH78_RS05725Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS056451113.258712murein transglycosylase A
HWH78_RS056502133.415015LysR family transcriptional regulator
HWH78_RS056551103.340062NAD(P)H-dependent oxidoreductase
HWH78_RS056600113.558900NAD(P)H-dependent oxidoreductase
HWH78_RS056650123.556746TetR/AcrR family transcriptional regulator
HWH78_RS056701133.670815aminoglycoside phosphotransferase family
HWH78_RS056750123.249753hypothetical protein
HWH78_RS056800132.910566helix-turn-helix transcriptional regulator
HWH78_RS056850162.968160DUF1656 domain-containing protein
HWH78_RS056900142.848161HlyD family secretion protein
HWH78_RS056950123.568931FUSC family protein
HWH78_RS057001114.508925DUF2790 domain-containing protein
HWH78_RS057050104.624811thioredoxin family protein
HWH78_RS05710-174.611979helix-turn-helix transcriptional regulator
HWH78_RS05715-294.026154multidrug efflux MFS transporter
HWH78_RS05720-174.244209HlyD family secretion protein
HWH78_RS05725-183.403161TolC family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05665HTHTETR535e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.5 bits (128), Expect = 5e-11
Identities = 24/144 (16%), Positives = 58/144 (40%), Gaps = 8/144 (5%)

Query: 12 RRRLSRDERQRQLLEVAWRLVREEGTEALTLGRLAEQAGVTKPVVYDHFGTRAGLLAALY 71
+ + E ++ +L+VA RL ++G + +LG +A+ AGVT+ +Y HF ++ L + ++
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 72 QDYDLRQTALMEAALEASEATLEGRADVIARAYVDCVMQQGREIPGVVAALASSPE---- 127
+ + L I ++ ++ + E
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLES-TVTEERRRLLMEIIFHKCEFVGE 122

Query: 128 ---LERIKRDYEVLFMDKCRAVLE 148
+++ +R+ + D+ L+
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLK 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05690RTXTOXIND664e-14 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 65.6 bits (160), Expect = 4e-14
Identities = 43/214 (20%), Positives = 75/214 (35%), Gaps = 39/214 (18%)

Query: 79 RSYRLAVRQREAELEQARETLRQRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAA 138
R Y+ + Q E+E+ A+E + + ++ + LR
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL--------------DKLRQTTDNIGLL 314

Query: 139 GAALDQARLDLRRSELRSPVDGYVTQLRVQ-PGDYAAAGRTNIFIV-DRRSFWVTGYFEE 196
L + + S +R+PV V QL+V G T + IV + + VT +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 197 TKLRNVQVGAPATIKLMGFD----PLLDGHVASIGRGVADLNESRADSGLPQVSPNFSWI 252
+ + VG A IK+ F L G V +I D+ Q
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN----------LDAIEDQRLGLV--- 421

Query: 253 RLAQRVPVRIELDRVPA---GVVLAAGMTGSVEV 283
V + IE + + + L++GM + E+
Sbjct: 422 ---FNVIISIEENCLSTGNKNIPLSSGMAVTAEI 452



Score = 47.5 bits (113), Expect = 3e-08
Identities = 18/114 (15%), Positives = 41/114 (35%), Gaps = 3/114 (2%)

Query: 41 VSAQVIRIAPEVSGSVEAVFVADNQRVARGDPLYRIDPRSYRLAVRQREAELEQARETLR 100
S + I P + V+ + V + + V +GD L ++ + ++ L QAR +
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE-Q 150

Query: 101 QRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAAGAALDQARLDLRRSEL 154
R + R ++L E + +L + + +++
Sbjct: 151 TRYQILSRSIELNKL--PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05710HTHFIS345e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 5e-04
Identities = 14/103 (13%), Positives = 31/103 (30%), Gaps = 6/103 (5%)

Query: 87 RHDLPQDCRVVDVPPLLRQLIVAAMRIAPDYPPGGRDERVMELILDELRVLPILALHVPQ 146
R + + R + + + ++ + D L + + +
Sbjct: 376 REIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAE 435

Query: 147 PVDPRLAALCRSLRAEPAADWSLGDAARRLGVSPRTLTRAFQR 189
P + L A A + AA LG++ TL + +
Sbjct: 436 MEYPLI------LAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05715TCRTETB1097e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (273), Expect = 7e-28
Identities = 79/402 (19%), Positives = 168/402 (41%), Gaps = 17/402 (4%)

Query: 23 FMAGMNVHVTSAALPEIEGALGATFEEGSWISTAYLVAEISMIPLTAWLVEVFSLRRVML 82
F + +N V + +LP+I +W++TA+++ + L + ++R++L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 83 LGSLVFLLSSLSCALAPN-LSTLILIRVIQGASGAVLIPLSMQLILTELPSSRIPLGMAL 141
G ++ S+ + + S LI+ R IQGA A L M ++ +P L
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 142 FSLSNSVAQAAGPSIGGWLADAYSWRWIFLLQLLPGIALLAAVAWSIRPRDGDRERLRQA 201
++ + GP+IGG +A W ++ L+ ++ I + + + R +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKL---LKKEVRIK-GHF 199

Query: 202 DWLGIGAMVAGLGALQIVLEEGGRRDWFESGFIRTFAVLAVLALLLFVQRQLWGARPFIN 261
D GI M G+ + F + + +F +++VL+ L+FV+ PF++
Sbjct: 200 DIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 262 LRLLGSYNFGVSSLAMAVFGAATFGLVFLVPNYLSQLQGFNARQIGDSLILYGLVQLLL- 320
L + F + L + G V +VP + + + +IG +I G + +++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 321 APLLPRLMRWLNPKLLVAGGFAIMALGCWMGAHLNADAGRNVIIPSIVVRGIGQPLIMVA 380
+ L+ P ++ G +++ ++ A + + IV G
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 381 LSVLAVKGLDKAQAGSASALISMLRNLGGAIGTALLTQLVSL 422
+S + L + +AG+ +L++ L G A++ L+S+
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05720RTXTOXIND1211e-32 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 121 bits (304), Expect = 1e-32
Identities = 61/368 (16%), Positives = 110/368 (29%), Gaps = 68/368 (18%)

Query: 66 AVSAQVSGYVAEVLVADDADVQAGDLLLRLDPRDFR-------QRLRAAEAREAAAQAAL 118
+ + V E++V + V+ GD+LL+L L A + Q
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 119 EAQ-------------------------------RAKLETLDRQLLEQAQTISRARADGE 147
+ + + T Q ++ + + RA+
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 148 AARAEWRRAETDWR-------RYRQLADEHATSRQRLENADAVHQRARAAARRASAEEGR 200
A R E R + L + A ++ + + + A R ++ +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277

Query: 201 QRAARDVLKSR--------RREAEAALAQRQAELQEAAAARELARHALDDTEIRAPFAGR 252
+ K + E L Q + + IRAP + +
Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVK 337

Query: 253 VGQRKVRLR-QYVTPGLPLLAVVPLEQAYVV-ANYKETQLERIRPGQPVELEVDTFGRRW 310
V Q KV VT L+ +VP + V A + + I GQ ++V+ F
Sbjct: 338 VQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTR 397

Query: 311 RGRVDSVAPASGAVFALLPPDNATGNFTKIVQRFPVRIRLDADAAERG----RLLPGMSV 366
G + + D +V F V I ++ + G L GM+V
Sbjct: 398 YGYLV-------GKVKNINLDAIEDQRLGLV--FNVIISIEENCLSTGNKNIPLSSGMAV 448

Query: 367 IATVDTRE 374
A + T
Sbjct: 449 TAEIKTGM 456


16HWH78_RS05775HWH78_RS06005Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS05775193.322263alkaline proteinase inhibitor AprI
HWH78_RS057803113.062174methyl-accepting chemotaxis protein
HWH78_RS057851112.794583bifunctional
HWH78_RS057900152.516685aldehyde dehydrogenase (NADP(+))
HWH78_RS057952172.846487dihydrodipicolinate synthase family protein
HWH78_RS058001172.842008proline racemase family protein
HWH78_RS058050152.948968amino acid ABC transporter ATP-binding protein
HWH78_RS058100153.760719amino acid ABC transporter permease
HWH78_RS05815-1143.724423amino acid ABC transporter permease
HWH78_RS058200134.005307aconitase family protein
HWH78_RS058250113.732127transporter substrate-binding domain-containing
HWH78_RS058304104.843842AraC family transcriptional regulator
HWH78_RS058353105.701204MFS transporter
HWH78_RS058403106.042885helix-turn-helix domain-containing protein
HWH78_RS058455157.135073LysR family transcriptional regulator
HWH78_RS058503126.861472EamA family transporter
HWH78_RS058552135.917223NAD(P)/FAD-dependent oxidoreductase
HWH78_RS058601105.814097(2Fe-2S)-binding protein
HWH78_RS05865-193.892203D-hydroxyproline dehydrogenase subunit beta
HWH78_RS05875-293.3244254-hydroxyproline epimerase
HWH78_RS05880-193.207000GntR family transcriptional regulator
HWH78_RS05885-1104.061582FUSC family protein
HWH78_RS05890-2103.819027TonB-dependent vitamin B12 receptor
HWH78_RS058951104.996889cob(I)yrinic acid a,c-diamide
HWH78_RS059001105.531431cobyrinate a,c-diamide synthase
HWH78_RS059052126.0449645,6-dimethylbenzimidazole synthase
HWH78_RS059102126.033541adenosylcobinamide-phosphate synthase CbiB
HWH78_RS059153135.850279threonine-phosphate decarboxylase CobD
HWH78_RS059202125.375514cobyric acid synthase
HWH78_RS059251134.976306bifunctional adenosylcobinamide
HWH78_RS05930-1103.494809nicotinate-nucleotide--dimethylbenzimidazole
HWH78_RS05935-292.528078alpha-ribazole phosphatase family protein
HWH78_RS05940-1102.693089adenosylcobinamide-GDP ribazoletransferase
HWH78_RS05945-2101.921006MFS transporter
HWH78_RS059500130.120830TetR family transcriptional regulator
HWH78_RS059551150.503299acyl-CoA dehydrogenase C-terminal
HWH78_RS059602130.445491MarR family winged helix-turn-helix
HWH78_RS059652100.388888MFS transporter
HWH78_RS05970111-0.146105glutathione peroxidase
HWH78_RS059750120.023527outer membrane protein transport protein
HWH78_RS05980-1111.195011hypothetical protein
HWH78_RS05985-1120.529431TetR family transcriptional regulator
HWH78_RS05990-2110.963023alpha/beta fold hydrolase
HWH78_RS05995091.128522sulfurtransferase
HWH78_RS060001100.820371SMP-30/gluconolactonase/LRE family protein
HWH78_RS060052110.798569ribonuclease D
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05775MPTASEINHBTR1295e-42 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 129 bits (325), Expect = 5e-42
Identities = 40/118 (33%), Positives = 58/118 (49%), Gaps = 9/118 (7%)

Query: 12 CLLCGFFSTGI-SMASSLILLSASDLAGQWTLQQDEAPAICHLELRDSEVAEASGYDLGG 70
F S G +MASS ++ S + +AGQ ++ +C +E A A L G
Sbjct: 11 VWQVLFVSAGAQAMASSFVVPSTAQMAGQLGIEATG-SGVC---AGPAEQANA----LAG 62

Query: 71 DTACLTRWLPSEPRAWRPTPAGIALLERGGLTLMLLGRQGEGDYRVQKGDGGQLVLRR 128
D AC +WL +P +W PTP GI L+ G + L RQ EG+Y + G + L+R
Sbjct: 63 DVACAEQWLGDKPVSWSPTPDGIWLMNAEGTGITHLNRQKEGEYTGRTPSGADVTLQR 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05835TCRTETB1096e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (273), Expect = 6e-28
Identities = 80/418 (19%), Positives = 161/418 (38%), Gaps = 30/418 (7%)

Query: 15 LCILLAGQLLPMIDFSIVNVALDALAHSLGASETELELIVAVYGVAFAVCLAMGGRLGDN 74
LCIL +++ ++NV+L +A+ + + + F++ A+ G+L D
Sbjct: 19 LCIL---SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 75 YGRRRLFDLGVALFAVASLLCGLAGS-VWLLLVARALQGVGAALVVPQIFATLHVSLSGH 133
G +RL G+ + S++ + S LL++AR +QG GAA + + +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 134 AHSRALAAYGAIGGLAFVVGQVLGGFLVSADIGGLGWRSVFLINLPICLGILLCSRRWVP 193
+A G+I + VG +GG + + W +L+ +P+ I + +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHY----IHWS--YLLLIPMITIITVPFLMKLL 189

Query: 194 ETRAEHAARVDAPGTLLLAALILCLLLPLALGPSLHWS-WPCALLLAAAVPLLAWLWRTE 252
+ D G +L++ I+ +L + L+
Sbjct: 190 KKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF------------- 236

Query: 253 LRQERRQAWPLLPPSLLRLPSIRFGLLLAILFFACWSGFMFALALALQAGAGLSPVQAGN 312
++ R+ P + P L + G+L + F +GF+ + ++ LS + G+
Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGS 296

Query: 313 AFIALGA-SYFVSALLTARVAARIGPVRLLLLGCVIQMCGLLGLMLTLQRVWPQPGILNL 371
I G S + + + R GP+ +L +G L + +
Sbjct: 297 VIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFM 351

Query: 372 APATLVIGFGQAFIVSSFFRIGLSEVPAAQAGAGSAMLATVQQASLGLGSALLGAVFA 429
+ + G +F + I S + +AGAG ++L S G G A++G + +
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05870NUCEPIMERASE290.021 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.4 bits (66), Expect = 0.021
Identities = 9/30 (30%), Positives = 15/30 (50%), Gaps = 1/30 (3%)

Query: 5 VIVVG-AGIVGSACAHELARRGLDVLVLDS 33
+V G AG +G + L G V+ +D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDN 32


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05945TCRTETB1971e-59 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 197 bits (502), Expect = 1e-59
Identities = 91/398 (22%), Positives = 176/398 (44%), Gaps = 14/398 (3%)

Query: 18 FLIIIDMTVLYTALPRLTHDLGATAAEKLWIVNAYPLVVAGLLPGAGLLSDRLGHKRLFL 77
F +++ VL +LP + +D A W+ A+ L + G LSD+LG KRL L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 78 AGLPLFGLASLCAAFAPSAAA-LIAARAGLAVGAALMMPATLSIVRHVFQDERERALAIG 136
G+ + S+ S + LI AR GAA PA + +V + + R A G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAA-AFPALVMVVVARYIPKENRGKAFG 142

Query: 137 IWASVASAGAALGPVVGGVLLEFFWWGSVFLINVPVVVVALLLALPAIPACGGQSRRPWD 196
+ S+ + G +GP +GG++ + W +L+ +P++ + + L + + + +D
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200

Query: 197 ALGSLQMMFGLVGVVYAIKELSTRAPDFGLAVLAALGGMLCLYLFVRRQRRAREPMIDFA 256
G + M G+V + L T + +++ +L +FV+ R+ +P +D
Sbjct: 201 IKGIILMSVGIVFFM-----LFTTSYSISFLIVS----VLSFLIFVKHIRKVTDPFVDPG 251

Query: 257 LFRNRRFARGVAVALVATMALVGMELVFSQHLQLVQGLTPLKAG-LFVLPIPLASLVVGP 315
L +N F GV + + G + ++ V L+ + G + + P ++ ++ G
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 316 LAGWLVPRWGENRVMCASLLLGSAGLLGLALSYQSATGAQLASLVLLGVGFGGAMTAAST 375
+ G LV R G V+ + S L + ++ + +V + G T ST
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371

Query: 376 AVMLNVDEQSSGMAAAIEDVSYELGGVIGVTLLGSLMS 413
V ++ +Q +G ++ + + L G+ ++G L+S
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05950HTHTETR565e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.8 bits (134), Expect = 5e-12
Identities = 24/150 (16%), Positives = 49/150 (32%), Gaps = 7/150 (4%)

Query: 1 MGRRRTIDRDQLLDAAEAVIAREGAAGLTIDAVAKEMGITKGGVQYCFGTKDALIDAIFE 60
+ R +LD A + +++G + ++ +AK G+T+G + + F K L I+E
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 RWGKAYDSLFEAVAGKQP-TPLTRVRAHAEATQRSDELSSSKAAALMAALIQAPEHLEGS 119
L K P PL+ +R S + + + E
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE-- 122

Query: 120 NQWYRSRLEGLDLSAPEGRRARLAFLAVEG 149
+ ++ + R+
Sbjct: 123 ----MAVVQQAQRNLCLESYDRIEQTLKHC 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05965TCRTETB454e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.9 bits (106), Expect = 4e-07
Identities = 33/133 (24%), Positives = 62/133 (46%), Gaps = 3/133 (2%)

Query: 53 LVWGLAQPFTGALADRYGAARAVLVGGLLYALGLVLMGLSQSATGLSLSAGLLIGLGLSG 112
L + + G L+D+ G R +L G ++ G V+ + S L + A + G G +
Sbjct: 60 LTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AA 118

Query: 113 TSFSVILGAVGRAVPAEQRSMAMGISSAAGSFGQFAMLPGTLGLIG-WLGWSSALLALGL 171
++++ V R +P E R A G+ + + G+ + P G+I ++ WS LL +
Sbjct: 119 AFPALVMVVVARYIPKENRGKAFGLIGSIVAMGE-GVGPAIGGMIAHYIHWSYLLLIPMI 177

Query: 172 LVALIVPLAGLMK 184
+ + L L+K
Sbjct: 178 TIITVPFLMKLLK 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05985HTHTETR755e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 75.0 bits (184), Expect = 5e-19
Identities = 37/202 (18%), Positives = 72/202 (35%), Gaps = 10/202 (4%)

Query: 5 SRQQENAEATREALLESALSAFIEHGYGGVSIDAIAREARVTKGAFYHHFGSKQELLAEC 64
+ ++ A+ TR+ +L+ AL F + G S+ IA+ A VT+GA Y HF K +L +E
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 65 YERQVRTIAEDLDRVPAHVDKWAEAAALA--EAFIDSVMARGKRQL----SLQEVITVVG 118
+E I E A + ++S + +R+L + V
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 119 WE---RWKRIDSRHTLRYVGRLVDELAASGELK-DYRRETLVGQLYGFLTQAAMSLRDAR 174
+ +R + + + + + L D + G+++ + A
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 175 SKRQAANEVKAIIRDFLYSLRR 196
E + + L
Sbjct: 183 QSFDLKKEARDYVAILLEMYLL 204


17HWH78_RS06275HWH78_RS06315Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS06275-1113.491820hypothetical protein
HWH78_RS06280-1112.662712YciI family protein
HWH78_RS06285-1113.014661thioredoxin family protein
HWH78_RS062901124.091134RNA polymerase sigma factor
HWH78_RS062950133.697229hypothetical protein
HWH78_RS063000152.473850VOC family protein
HWH78_RS063050132.777150YciI family protein
HWH78_RS06310-1123.617849MoaD/ThiS family protein
HWH78_RS063150103.349936exo-alpha-sialidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06275PF05616300.002 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 29.7 bits (66), Expect = 0.002
Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 3/53 (5%)

Query: 22 AAPLPDELPAALPAAPPVAPLPMGSAPQPQPEAAEPDLPGSDAASADGEATTK 74
A PLP+ PA PA P G+ P P+P +PDL DG+ T+
Sbjct: 325 AQPLPEVSPAENPANNPAPNENPGTRPNPEP---DPDLNPDANPDTDGQPGTR 374


18HWH78_RS06360HWH78_RS06580Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS06360-117-3.712575TonB-dependent receptor
HWH78_RS06365037-9.155018Fic/DOC family protein
HWH78_RS06370-138-9.780185hypothetical protein
HWH78_RS06375028-6.157964PhzF family phenazine biosynthesis protein
HWH78_RS06380028-6.522628His-Xaa-Ser repeat protein HxsA2
HWH78_RS06385021-3.795358His-Xaa-Ser system protein HxsD
HWH78_RS06390-123-4.226015His-Xaa-Ser system radical SAM maturase HxsB
HWH78_RS06395125-3.084376His-Xaa-Ser system radical SAM maturase HxsC
HWH78_RS06400222-1.525832beta-ketoacyl-ACP synthase II
HWH78_RS06405351-9.575202helix-turn-helix transcriptional regulator
HWH78_RS06410364-13.4393424-phosphoerythronate dehydrogenase PdxB
HWH78_RS06415157-13.649566integrase arm-type DNA-binding domain-containing
HWH78_RS06420254-13.478047integrase domain-containing protein
HWH78_RS06425252-13.729455hypothetical protein
HWH78_RS06430244-11.550611ATP-binding protein
HWH78_RS06435023-5.597038Fic family protein
HWH78_RS06440-1150.688583bifunctional isocitrate dehydrogenase
HWH78_RS064451120.993559GNAT family N-acetyltransferase
HWH78_RS064500141.442411hypothetical protein
HWH78_RS064551131.866760oxidoreductase
HWH78_RS06460013-0.802912AraC family transcriptional regulator
HWH78_RS06465014-1.245781hypothetical protein
HWH78_RS06470020-3.098584hypothetical protein
HWH78_RS06475123-3.829390two-component system sensor histidine kinase
HWH78_RS06480433-5.617124response regulator transcription factor
HWH78_RS06485535-6.016395type VI secretion system tip protein VgrG
HWH78_RS06490432-4.757482DUF4123 domain-containing protein
HWH78_RS06495221-1.608253hypothetical protein
HWH78_RS06500016-0.257672hypothetical protein
HWH78_RS06505-2130.984951hypothetical protein
HWH78_RS065100113.356042LysR family transcriptional regulator
HWH78_RS06515-1103.337222carbamoyl phosphate synthase large subunit
HWH78_RS06520-1113.006658ester cyclase
HWH78_RS065252101.943770O-methyltransferase
HWH78_RS065303102.266042TetR family transcriptional regulator
HWH78_RS065400132.345886hypothetical protein
HWH78_RS065450121.958628UvrD-helicase domain-containing protein
HWH78_RS06550-1111.769574nucleotidyltransferase domain-containing
HWH78_RS065551112.364371class I SAM-dependent methyltransferase
HWH78_RS065601122.492319DUF3772 domain-containing protein
HWH78_RS06565-1111.901838acetylpolyamine amidohydrolase AphA
HWH78_RS06570-192.458885extracellular solute-binding protein
HWH78_RS065750103.465607DMT family transporter
HWH78_RS065801133.330238MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06445SACTRNSFRASE389e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 9e-06
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 1/51 (1%)

Query: 93 VAVAWQGKGVGSRLLGELLDIADNWMNLRRVELTVYTDNAPALALYRKFGF 143
VA ++ KGVG+ LL + ++ A + + L N A Y K F
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKE-NHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06455DHBDHDRGNASE1103e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (277), Expect = 3e-31
Identities = 61/185 (32%), Positives = 86/185 (46%), Gaps = 3/185 (1%)

Query: 5 KTLLITGASSGFGQALAREALDAGHRVVGTVRSEEARSALEAVAPGQAFGR---LLDVTD 61
K ITGA+ G G+A+AR G + + E + + +A DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 62 LAAIEPTVAAIERDIGPLDVLVNSAGYGHEGILEESPLAEMRRQFEVNLFGAVAMIQAVL 121
AAI+ A IER++GP+D+LVN AG G++ E F VN G ++V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 122 PYMRRRRRGHILNITSMGGYITMPGIAYYCGSKFALEGVSEALGKEVAGLGIAVTAVAPG 181
YM RR G I+ + S + +A Y SK A ++ LG E+A I V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 182 SFRTD 186
S TD
Sbjct: 189 STETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06475HTHFIS502e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.8 bits (119), Expect = 2e-08
Identities = 31/124 (25%), Positives = 51/124 (41%), Gaps = 9/124 (7%)

Query: 416 LTGLRVCLVEDDRNVLRATSALLERWGCTVQ-AETEADGWRTDC----DILVVDYDLGPH 470
+TG + + +DD + + L R G V+ A WR D++V D + P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVM-PD 59

Query: 471 ASGVECIERVRRQRGEAIPALVISGH-DIERIQASVEDTDIALLSKPVRPTELRATL-RA 528
+ + + R+++ +P LV+S + E L KP TEL + RA
Sbjct: 60 ENAFDLLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 529 LRER 532
L E
Sbjct: 119 LAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06480HTHFIS562e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 56.0 bits (135), Expect = 2e-11
Identities = 34/157 (21%), Positives = 58/157 (36%), Gaps = 5/157 (3%)

Query: 3 GRIIVADDHPLFREGMLSILQRLLPEARIEEAGDLAGVLRLAGEGEQPDSLILDLRFPGL 62
I+VADD R + L R + + A + R G D ++ D+ P
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDE 60

Query: 63 TRIEMLADLRRQFPRTTLIVVSMVDDPQLIGEVMNAGADGFLGKSIAPEELGQAILAIRA 122
++L +++ P ++V+S + + GA +L K EL I RA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG--RA 118

Query: 123 GEVLVRYEPSGLLPLQPSPRLEGLTERQLDVLRLLAQ 159
R Q L G + ++ R+LA+
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06485ICENUCLEATIN300.040 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 30.1 bits (67), Expect = 0.040
Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 2/92 (2%)

Query: 523 RISRDSRSLVENDRFEQVNMNSSSLIKGDELHTTQGERHTRIGGNELLSISGAGSIAVDG 582
+I+ SL+ Q+ N S LI G T G R T I G + + ++G + G
Sbjct: 1080 QIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAG 1139

Query: 583 TWVVQ-AGSQARVTA-TNVLVDAGVNLTLKAG 612
Q AG ++++ A N + AG L AG
Sbjct: 1140 ADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAG 1171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06520RTXTOXIND330.008 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.9 bits (75), Expect = 0.008
Identities = 35/161 (21%), Positives = 62/161 (38%), Gaps = 24/161 (14%)

Query: 432 RNLLLHPAVQANRVDTRFVESHLETLLAPIPASHPRLRAECPLAEDAAPARVEAPLGSLP 491
R L P + + F+ +HLE + P+ PRL A + A + + LG +
Sbjct: 26 RKQLDTPVREK--DENEFLPAHLELIETPVSR-RPRLVAYFIMGF-LVIAFILSVLGQVE 81

Query: 492 LPAPSSGVLVALEVSVGERVRAGQRVAILEAMKMEFEVKAPGGGIVRRLAASLGEPLEEG 551
+ A ++G L +G+ E+K IV+ + GE + +G
Sbjct: 82 IVATANGKLTH----------SGRSK----------EIKPIENSIVKEIIVKEGESVRKG 121

Query: 552 ATLLFLEPTEDDDEQAPTEQALDLAHIRADLAEVLERQAAL 592
LL L + + T+ +L A + ++L R L
Sbjct: 122 DVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06535HTHTETR589e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.5 bits (141), Expect = 9e-13
Identities = 24/149 (16%), Positives = 55/149 (36%)

Query: 14 QPQQARSSELVASILEAAVQVLASEGAQRFTTARVAERAGVSIGSLYQYFPNKAAILFRL 73
+ + + E IL+ A+++ + +G + +A+ AGV+ G++Y +F +K+ + +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 74 QSDEWRRTTRLLGEILEDTTRPPLARLRRLVLAFVRSECEEAAIRVALSDAAPLYRDADE 133
L E PL+ LR +++ + S E R+ + E
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 134 AREVKAEGARVFQAFLREALPEVAEAERS 162
V+ + + +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEA 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06560RTXTOXIND396e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.0 bits (91), Expect = 6e-05
Identities = 25/201 (12%), Positives = 63/201 (31%), Gaps = 13/201 (6%)

Query: 3 RASLHMLLCQFAMALGLLLSLGSEAWAARPAPQAAVDLEAPAALAEDASLDQLNAQLDLI 62
A A + + + L ++ S +++ LI
Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF-QNVSEEEVLRLTSLI 191

Query: 63 RQRVTADASDDLLAELRQSALQVQRQ-ADALLALRVADIERLDDQLKVIGPPQPDEAESL 121
+++ + + EL + +R A + +L + SL
Sbjct: 192 KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD--------DFSSL 243

Query: 122 AAQRQALTRQKNALLDDERQATQLGQSSRDLAAQIVNLRRSLFNSQISSRAATPFSPSFW 181
++ K+A+L+ E + + R +Q+ + + +++ + T +
Sbjct: 244 LHKQAI---AKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300

Query: 182 STLIRPTDDDLRRLDKLKAEA 202
+R T D++ L A+
Sbjct: 301 LDKLRQTTDNIGLLTLELAKN 321


19HWH78_RS06720HWH78_RS06750Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS06720317-1.518967flagellar basal body-associated protein FliL
HWH78_RS06725519-0.958023flagellar motor switch protein FliM
HWH78_RS06730620-0.221881flagellar motor switch protein FliN
HWH78_RS067354190.520263flagellar biosynthetic protein FliO
HWH78_RS067403200.650304flagellar type III secretion system pore protein
HWH78_RS067453170.591997flagellar biosynthesis protein FliQ
HWH78_RS067502161.055966flagellar type III secretion system protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06725FLGMOTORFLIM2592e-87 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 259 bits (664), Expect = 2e-87
Identities = 98/326 (30%), Positives = 167/326 (51%), Gaps = 13/326 (3%)

Query: 5 DLLSQDEIDALLHGVDDGLVETEVEATPG-----SVKSYDLTSQDRIVRGRMPTLEMINE 59
++LSQDEID LL + G + +E + YD D+ + +M TL +++E
Sbjct: 3 EVLSQDEIDQLLTAISSG--DASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 60 RFARYTRISMFNLLRRSADVAVGGVQVMKFGEYVHSLYVPTSLNLVKMKPLRGTALFILD 119
FAR T S+ LR V V V + + E++ S+ P++L ++ M PL+G A+ +D
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVD 120

Query: 120 AKLVFKLVDNFFGGDGRHAKIEGREFTPTELRVVRMVLEQAFVDLKEAWQAVLEMNFEYV 179
+ F ++D FGG G+ AK++ R+ T E V+ V+ + +++E+W V+++
Sbjct: 121 PSITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLG 179

Query: 180 NSEVNPAMANIVSPSEVVVVSTFHIELDGGGGDLHITMPYSMIEPIREMLDAGF--QSDH 237
E NP A IV PSE+VV+ T ++ G ++ +PY IEPI L + F S
Sbjct: 180 QIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVR 239

Query: 238 DDQDERWIKALREDVLDVQVPLGATVVRRQLKLRDILHMQPGDVIPVE---MPEHMVMRA 294
+++ LR+ + V + + A V +L +RDIL ++ GD+I + + + V+
Sbjct: 240 RSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSI 299

Query: 295 NGVPAFKVKLGAHKGNLALQILEAVE 320
F + G +A QILE +E
Sbjct: 300 GNRKKFLCQPGVVGKKIAAQILERIE 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06730FLGMOTORFLIN1208e-38 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 120 bits (301), Expect = 8e-38
Identities = 62/145 (42%), Positives = 90/145 (62%), Gaps = 24/145 (16%)

Query: 13 ALADEWAAALAE-AGDASQDDIDALMAQGGATPVAEPSTPRAPMEEFGASPKAPTISGLE 71
AL D WA AL E ++ DA+ Q G V+
Sbjct: 14 ALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQ--------------------- 52

Query: 72 GPNLDVILDIPVTISMEVGHTDISIRNLLQLNQGSVIELDRLAGEPLDVLVNGTLIAHGE 131
++D+I+DIPV +++E+G T ++I+ LL+L QGSV+ LD LAGEPLD+L+NG LIA GE
Sbjct: 53 --DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGE 110

Query: 132 VVVVNEKFGIRLTDVISPSERIKKL 156
VVVV +K+G+R+TD+I+PSER+++L
Sbjct: 111 VVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06740FLGBIOSNFLIP2642e-91 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 264 bits (676), Expect = 2e-91
Identities = 140/242 (57%), Positives = 176/242 (72%), Gaps = 3/242 (1%)

Query: 11 LAALCLLLLAPWPALAADPTSISAITVTTNGQGQQEYSVSLQILLIMTALSFIPAFVMLM 70
L+ +LL P A + IT G Q +S+ +Q L+ +T+L+FIPA +++M
Sbjct: 5 LSVAPVLLWLITPLAFA---QLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMM 61

Query: 71 TSFTRIIIVFSILRQALGLQSTPSNQVLVGLALFLTMFVMAPVFDKINSQALQPYLNEQI 130
TSFTRIIIVF +LR ALG S P NQVL+GLALFLT F+M+PV DKI A QP+ E+I
Sbjct: 62 TSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKI 121

Query: 131 PAQEALQKAEVPLKAFMLAQTRTSDLELFVRLSKRTDIGSPEATPLTILVPAFVTSELKT 190
QEAL+K PL+ FML QTR +DL LF RL+ + PEA P+ IL+PA+VTSELKT
Sbjct: 122 SMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKT 181

Query: 191 AFQIGFMIFIPFLIIDLVVSSVLMAMGMMMLSPLIISLPFKIMLFVLVDGWALIIGTLAG 250
AFQIGF IFIPFLIIDLV++SVLMA+GMMM+ P I+LPFK+MLFVLVDGW L++G+LA
Sbjct: 182 AFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQ 241

Query: 251 SF 252
SF
Sbjct: 242 SF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06745TYPE3IMQPROT559e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 54.8 bits (132), Expect = 9e-14
Identities = 24/75 (32%), Positives = 43/75 (57%)

Query: 7 LDLFREALWLTAMIVGVLVVPSLLVGLVVAMFQAATQINEQTLSFLPRLMVILLTLIVLG 66
+ +AL+L ++ G + + ++GL+V +FQ TQ+ EQTL F +L+ + L L +L
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 PWLLRQLMEYTQTLI 81
W L+ Y + +I
Sbjct: 65 GWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06750TYPE3IMRPROT1357e-41 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 135 bits (341), Expect = 7e-41
Identities = 96/232 (41%), Positives = 143/232 (61%), Gaps = 2/232 (0%)

Query: 1 MLELTNAQIGGWIASFVLPLFRVAALLMTMPVIGTQLVPVRVRLYLALGVCVVLVPNLPP 60
ML++T+ Q W+ + PL RV AL+ T P++ + VP RV+L LA+ + + P+LP
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 MPQVDALSMKAMLLIGEQILVGALLGFSLQLLFHAFVIAGQIISMQMGLGFASMVDPANG 120
S A+ L +QIL+G LGF++Q F A AG+II +QMGL FA+ VDPA+
Sbjct: 61 NDVPV-FSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 VSVPVLGQFFTMLVTLLFLAMNGHLVVFEVIAESFVTLPVGEGLSGNHFWI-IAGKLGWV 179
+++PVL + ML LLFL NGHL + ++ ++F TLP+G ++ ++ + +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 180 MGAALLLALPAITALLVVNLAFGAMTRAAPQLNIFSIGFPLTLVLGLVILWI 231
L+LALP IT LL +NLA G + R APQL+IF IGFPLTL +G+ ++
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAA 231


20HWH78_RS06835HWH78_RS06885Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS06835-1143.127577DUF2802 domain-containing protein
HWH78_RS06840-2142.605792glutathione S-transferase
HWH78_RS06845-2132.325406YafY family transcriptional regulator
HWH78_RS068500141.713464hypothetical protein
HWH78_RS068552102.039316laccase domain-containing protein
HWH78_RS068604112.241802SDR family oxidoreductase
HWH78_RS068658111.594899hypothetical protein
HWH78_RS06870691.005899GNAT family N-acetyltransferase
HWH78_RS068756101.196479EscU/YscU/HrcU family type III secretion system
HWH78_RS068806100.849248flagellar hook-length control protein FliK
HWH78_RS068852110.927161cytochrome c biogenesis heme-transporting ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06860DHBDHDRGNASE834e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 82.8 bits (204), Expect = 4e-21
Identities = 70/247 (28%), Positives = 111/247 (44%), Gaps = 12/247 (4%)

Query: 8 AIVTGASRGIGRAIARRLAADGFAVAVNYAGNQTMADEVVAEIVAAGGTAIAVQGDVASP 67
A +TGA++GIG A+AR LA+ G +A N ++VV+ + A A A DV
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAA-VDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 68 EDMDKLFEATRGAFGRIDVVVNSAGTMPYLKIADGDLEGFDRVIRTNLRGAFIVLGLAAR 127
+D++ G ID++VN AG + I E ++ N G F ++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 128 HV--ERGGRIIALSTSVIARALPSYGPYIASKSGVEGLVHVLANELRGQDIRVNAVAPGP 185
++ R G I+ + ++ S Y +SK+ L EL +IR N V+PG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 186 VATE----LFFNGKSAEQI-----DQIARLAPLERLGEPDEIAAAVSFLAGPDGAWVNSQ 236
T+ L+ + AEQ+ + PL++L +P +IA AV FL +
Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMH 249

Query: 237 VLRVNGG 243
L V+GG
Sbjct: 250 NLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06865cloacin318e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.8 bits (69), Expect = 8e-04
Identities = 18/46 (39%), Positives = 24/46 (52%)

Query: 30 GSAYAKGGNGGGNGGGHGGGHGGGKGGSHGGNLGGHSSKGHGSATS 75
S G G G+G GGG G G GG +G + GG + G+ SA +
Sbjct: 40 SSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVA 85



Score = 29.3 bits (65), Expect = 0.003
Identities = 13/43 (30%), Positives = 16/43 (37%)

Query: 25 ELSPVGSAYAKGGNGGGNGGGHGGGHGGGKGGSHGGNLGGHSS 67
E +P G G + GG G GG G GG G +
Sbjct: 42 ENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06875TYPE3IMSPROT612e-14 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 61.3 bits (149), Expect = 2e-14
Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 3/73 (4%)

Query: 12 AIALSYDGQ--AAPTLSAKGDAELAEAILAIARDYEVPIYENAELVR-LLARLELGDAIP 68
AI + Y P ++ K + + IA + VPI + L R L + IP
Sbjct: 268 AIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIP 327

Query: 69 EALYRTIAEIIAF 81
AE++ +
Sbjct: 328 AEQIEATAEVLRW 340


21HWH78_RS07005HWH78_RS07065Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS070052161.139022pyruvate kinase
HWH78_RS070100170.553421glycerate kinase
HWH78_RS070150160.1290492-hydroxy-3-oxopropionate reductase
HWH78_RS07020-1130.225904hydroxypyruvate isomerase
HWH78_RS070252130.524606glyoxylate carboligase
HWH78_RS07030390.355829heme-binding protein
HWH78_RS07035014-3.319514TetR/AcrR family transcriptional regulator
HWH78_RS07040227-5.714678GTP 3',8-cyclase MoaA
HWH78_RS07045328-5.719462low molecular weight protein tyrosine
HWH78_RS07050025-5.311728purine permease
HWH78_RS07055130-6.307143PAAR domain-containing protein
HWH78_RS07060023-5.426806hypothetical protein
HWH78_RS07065115-3.950069hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS07035HTHTETR712e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 71.2 bits (174), Expect = 2e-17
Identities = 24/166 (14%), Positives = 53/166 (31%), Gaps = 16/166 (9%)

Query: 5 RERNKRLILRAASEEFADKGFAATKTSDIAARAGLPKPNVYYYFQSKENLYRCVLESIVE 64
+ ++ IL A F+ +G ++T +IA AG+ + +Y++F+ K +L+ + E
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 65 PLLQASA--PFRVEDDPLLALPAYIRSKIRISRELPH----ASKVFASEIMHGAPHLPKE 118
+ + + DPL L + + + +F G
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE----MA 124

Query: 119 YLDELNAQAQRNVTCLQTW-----IDRGQL-APVDPHHLLFAIWAA 158
+ + I+ L A + +
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS07055OMADHESIN250.048 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 24.9 bits (53), Expect = 0.048
Identities = 20/72 (27%), Positives = 26/72 (36%), Gaps = 9/72 (12%)

Query: 22 QTDLNGKPMAGVGHQVVCP---------LCKGTFPITEGSALLDVNGVPVALHGMKTACG 72
Q N P G+ + V P KG I G+ G VA+ A G
Sbjct: 38 QISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATG 97

Query: 73 ASLIASGPLGAA 84
+ +A GPL A
Sbjct: 98 VNSVAIGPLSKA 109


22HWH78_RS07280HWH78_RS07305Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS07280217-3.498810FixH family protein
HWH78_RS07285218-3.490127cytochrome c oxidase accessory protein CcoG
HWH78_RS07290319-3.809870cytochrome-c oxidase, cbb3-type subunit III
HWH78_RS07295318-4.321620CcoQ/FixQ family Cbb3-type cytochrome c oxidase
HWH78_RS07300519-3.618725cytochrome-c oxidase, cbb3-type subunit II
HWH78_RS07305416-2.356844cytochrome-c oxidase, cbb3-type subunit I
23HWH78_RS07435HWH78_RS07465Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS07435117-3.819061YkgJ family cysteine cluster protein
HWH78_RS07440117-4.262671START domain-containing protein
HWH78_RS07445219-3.917219citrate (Si)-synthase
HWH78_RS07450222-3.111957succinate dehydrogenase, cytochrome b556
HWH78_RS07455224-2.686621succinate dehydrogenase, hydrophobic membrane
HWH78_RS07460225-2.650386succinate dehydrogenase flavoprotein subunit
HWH78_RS07465225-2.823679succinate dehydrogenase iron-sulfur subunit
24HWH78_RS08005HWH78_RS08100Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS08005212-0.706131SctU family type III secretion system export
HWH78_RS080103180.838524SctT family type III secretion system export
HWH78_RS080153181.900610SctS family type III secretion system export
HWH78_RS080200152.378813SctR family type III secretion system export
HWH78_RS080250152.689077SctQ family type III secretion system
HWH78_RS080301153.113118type III secretion system needle length
HWH78_RS080402112.671654type III secretion system central stalk protein
HWH78_RS080453112.726795SctN family type III secretion system ATPase
HWH78_RS080505151.114692SctW family type III secretion system gatekeeper
HWH78_RS080555150.983589Acr1 family type III secretion system gatekeeper
HWH78_RS080605161.489076type III secretion chaperone SycN
HWH78_RS080654150.457971type III secretion system protein PscX
HWH78_RS08070314-0.146790type III secretion system chaperone PscY
HWH78_RS08075313-0.624723SctV family type III secretion system export
HWH78_RS08080214-0.206801LcrR family type III secretion system chaperone
HWH78_RS08085216-1.114019LcrG family type III secretion system chaperone
HWH78_RS08090218-1.103387type III secretion protein PcrV
HWH78_RS08095121-0.615751SycD/LcrH family type III secretion system
HWH78_RS08100223-2.292920type III secretion system translocon subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08010TYPE3IMSPROT422e-150 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 422 bits (1087), Expect = e-150
Identities = 232/349 (66%), Positives = 294/349 (84%)

Query: 1 MSAEKTEQPTAKKLRDARRQGQVVKSKEIVSSALILSLVALLMGFSDYYLEHLGKLLLLP 60
MS EKTEQPT KK+RDAR++GQV KSKE+VS+ALI++L A+LMG SDYY EH KL+L+P
Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60

Query: 61 AEYIDLPFRQALETILENLLQELLYLLAPVLLVAALVVVLSHVGQYGFLLSLDSVKPDLK 120
AE LPF QAL +++N+L E YL P+L VAAL+ + SHV QYGFL+S +++KPD+K
Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 121 KINPVEGAKKIFSIRSLVEFLKSTLKVALLSLLVWLTLQGNLASLLRIPACGLDCVAPVS 180
KINP+EGAK+IFSI+SLVEFLKS LKV LLS+L+W+ ++GNL +LL++P CG++C+ P+
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 181 GLMLRQLMLVCAVGFLAIAVADYAFERHQHYKQLRMSKDEVKREYKEMEGSPEIKSKRRQ 240
G +LRQLM++C VGF+ I++ADYAFE +Q+ K+L+MSKDE+KREYKEMEGSPEIKSKRRQ
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 241 FHQELQSSNLRADVRRSSVIVANPTHVAIGIRYRRGETPLPLVTLKHTDALALRVRRIAE 300
FHQE+QS N+R +V+RSSV+VANPTH+AIGI Y+RGETPLPLVT K+TDA VR+IAE
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 301 EEGIPVLQRIPLARALLRDGNVDQYIPADLIQATAEVLRWLESQQTDTP 349
EEG+P+LQRIPLARAL D VD YIPA+ I+ATAEVLRWLE Q +
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQ 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08015TYPE3IMRPROT1421e-43 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 142 bits (360), Expect = 1e-43
Identities = 47/245 (19%), Positives = 100/245 (40%), Gaps = 4/245 (1%)

Query: 9 LLLTYSLLLPRIISCFVVLPVLAKQTLGGGLVRNGVACSLALFAYPIVAGSLPPALDALD 68
L Y L R+++ P+L+++++ V+ G+A + P + + P
Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFAIAPSLPANDVPVFSFFA 70

Query: 69 IALLIGKEVLLGLLIGFVATIPFWAMEATGFIIDNQRGAALASTFNPSLGSQTSPTGLLL 128
+ L + +++L+G+ +GF F A+ G II Q G + A+ +P+ ++
Sbjct: 71 LWLAV-QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIM 129

Query: 129 TQTLITLFFSGGAFLALVGSLFRSYASWPVSSFFPQLGSQWVAFFYAQFSQMLMLCALFA 188
+ LF + L L+ L ++ + P+ S S + + + A
Sbjct: 130 DMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPL--NSNAFLALTKAGSLIFLNGLMLA 187

Query: 189 APLLIAMFLAEFGLALVSRFAPSLNVFILAMPIKSLVASLLLVLYLGILMEHAYDALLLA 248
PL+ + L L++R AP L++F++ P+ V L+ + ++
Sbjct: 188 LPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEI 247

Query: 249 VDPLR 253
+ L
Sbjct: 248 FNLLA 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08020TYPE3IMQPROT684e-19 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 68.3 bits (167), Expect = 4e-19
Identities = 35/78 (44%), Positives = 48/78 (61%)

Query: 5 DILHFTNQTLWLVLVLSLPPVLVAALIGTLVSLVQALTQIQEQTLGFVAKLVAVVVVLFA 64
D++ N+ L+LVL+LS P +VA +IG LV L Q +TQ+QEQTL F KL+ V + LF
Sbjct: 3 DLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFL 62

Query: 65 TSGWLGGELYRFAEMTLL 82
SGW G L + +
Sbjct: 63 LSGWYGEVLLSYGRQVIF 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08025TYPE3IMPPROT2463e-85 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 246 bits (629), Expect = 3e-85
Identities = 92/217 (42%), Positives = 142/217 (65%), Gaps = 7/217 (3%)

Query: 6 DELGLILGLALLALVPFIAVMATSFIKMTVVFSLLRNALGVQQIPPNMAMYGLAIILSLY 65
+++ LI LA L+PFI T F+K ++VF ++RNALG+QQIP NM + G+A++LS++
Sbjct: 3 NDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMF 62

Query: 66 VMAPVGFATRDYLRNHDVSLSDSASVERFLDEGMAPYRNFLKRQIQEREHTFFMESTRQV 125
VM P+ Y + DV+ +D +S+ + +DEG+ YR++L + FF + +
Sbjct: 63 VMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKR 122

Query: 126 WPSEYAERLDPD-------SLLILLPAFTVSELTRAFEIGFLIYLPFIAIDLIISNILLA 178
E E + D S+ LLPA+ +SE+ AF+IGF +YLPF+ +DL++S++LLA
Sbjct: 123 QYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLA 182

Query: 179 MGMMMVSPMTISLPFKLLLFVLLDGWARLTHGLVISY 215
+GMMM+SP+TIS P KL+LFV LDGW L+ GL++ Y
Sbjct: 183 LGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08030TYPE3OMOPROT841e-20 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 83.5 bits (206), Expect = 1e-20
Identities = 46/177 (25%), Positives = 73/177 (41%), Gaps = 14/177 (7%)

Query: 130 RLALWLDGDPATLLARLPPRPSAQRLAIPLRLSLQWPGLPLDASELRTLEPGDLLLLPAG 189
R LW + P L A RP R + + L L + GD+LL+
Sbjct: 126 RGGLWFEHLPE-LPAVGGGRPKMLRWPLRFVIGSSDTQRSL----LGRIGIGDVLLIRTS 180

Query: 190 HRPDAALLGVLEGRPWARCQLHSTQL-ELLDMH----DTPSLADGEDLHELDQLPIPVSF 244
A + + ++ + E LD+ + + E L L+QLP+ + F
Sbjct: 181 R----AEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEF 236

Query: 245 EVGRRTLDLHTLSTLQPGSLLDLDSALDGEVRILANQRCLGIGELVRLQDRLGVRVT 301
+ R+ + L L + LL L + + V I+AN LG GELV++ D LGV +
Sbjct: 237 VLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIH 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08035IGASERPTASE393e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.9 bits (90), Expect = 3e-05
Identities = 25/133 (18%), Positives = 39/133 (29%), Gaps = 18/133 (13%)

Query: 19 APLPPLRAQQIAFEQALPAHRPPAPRPPFDKGDETTEAAATADAPTSTPLADQPAAPAAD 78
+ + P + Q + R P T + T+T + A
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDP----------TVNIKEPQSQTNTTADTEQPAKE-- 1174

Query: 79 RPPTTRQAPMPVAADATPTPTPTPTPTPTPTPTPTPTPTV-SPSGSVARQAPAVTARVAA 137
T+ PV T + P T T PTV S S + + + R
Sbjct: 1175 ---TSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231

Query: 138 STQGREPASVSAP 150
EPA+ S+
Sbjct: 1232 HNV--EPATTSSN 1242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08050PF072012836e-98 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 283 bits (726), Expect = 6e-98
Identities = 134/294 (45%), Positives = 181/294 (61%), Gaps = 7/294 (2%)

Query: 1 MDILQSSSAAPLA-----PREAANAPAQQAGGSFQGERVHYVSVS-QSLADAAEELTFAF 54
M L + S P A++ Q G F+GE V VS + QS+AD AEE+TF F
Sbjct: 1 MTTLHNLSYGNTPLHNERPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFVF 60

Query: 55 SERAEKSLAKRRLSDAHARLSEVQTMLQEYWKRIPDLESQQKLEALIAHLGSGQLSSLAQ 114
SER E SL KR+LSD+ AR+S+V+ + +Y ++P+LE +Q + L++ L + SL+Q
Sbjct: 61 SERKELSLDKRKLSDSQARVSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQ 120

Query: 115 LSAYLEGFSSEISQRFLALSRARDVLAGRPEARAMLALVDQALLRMADEQGLEIELGLRI 174
L AYLEG S E S++F L RD L GRPE + LV+QAL+ MA+EQG I LG RI
Sbjct: 121 LKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLGARI 180

Query: 175 EPLAAEASAAGVGDIQALRDTYRDAVLDYRGLSAAWQDIQARFAATPLERVVAFLQKALS 234
P A S +GV +Q LRDTYRDAV+ Y+G+ A W D+Q RF ++ V+ FLQKALS
Sbjct: 181 TPEAYRESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQKALS 240

Query: 235 ADLDSQSSRLDPVKLERVMSDMHKLRVLGGLAEQVGALWQVLVTGERGHGIRAF 288
ADL SQ S KL V+SD+ KL+ G +++QV WQ G + +G+R F
Sbjct: 241 ADLQSQQSGSGREKLGIVISDLQKLKEFGSVSDQVKGFWQFFSEG-KTNGVRPF 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08090LCRVANTIGEN344e-121 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 344 bits (884), Expect = e-121
Identities = 115/296 (38%), Positives = 171/296 (57%), Gaps = 32/296 (10%)

Query: 25 ASAEQEELLALLRSERIVLAHAGQPLSEAQVL-------------KALAWLLAANPSAPP 71
S+ EEL+ L++ + I ++ P +++V K LA+ L +
Sbjct: 28 GSSVLEELVQLVKDKNIDISIKYDPRKDSEVFANRVITDDIELLKKILAYFLPEDAILKG 87

Query: 72 GQ-------GLEVLREVLQARRQPGAQWDLREFLVSAYFSLHG-RLDEDVIGVYKDVLQT 123
G G++ ++E L++ P QW+LR F+ +FSL R+D+D++ V D +
Sbjct: 88 GHYDNQLQNGIKRVKEFLES--SPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNH 145

Query: 124 QDGKRKALLDELKALTAELKVYSVIQSQINAALSAKQGIRIDAGGIDLVDPTLYGYAVGD 183
R L +EL LTAELK+YSVIQ++IN LS+ I I I+L+D LYGY +
Sbjct: 146 HGDARSKLREELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYT-DE 204

Query: 184 PRWKDSPEYALLSNLDTFSGKL--------SIKDFLSGSPKQSGELKGLSDEYPFEKDNN 235
+K S EY +L + + ++ SIKDFL K++G L L + Y + KDNN
Sbjct: 205 EIFKASAEYKILEKMPQTTIQVDGSEKKIVSIKDFLGSENKRTGALGNLKNSYSYNKDNN 264

Query: 236 PVGNFATTVSDRSRPLNDKVNEKTTLLNDTSSRYNSAVEALNRFIQKYDSVLRDIL 291
+ +FATT SD+SRPLND V++KTT L+D +SR+NSA+EALNRFIQKYDSV++ +L
Sbjct: 265 ELSHFATTCSDKSRPLNDLVSQKTTQLSDITSRFNSAIEALNRFIQKYDSVMQRLL 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08095SYCDCHAPRONE2022e-69 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 202 bits (514), Expect = 2e-69
Identities = 95/166 (57%), Positives = 126/166 (75%)

Query: 3 QQATPSDTDQQQALEAFLRDGGTLAMLRGLSEDTLEQLYALGFNQYQAGKWDDAQKIFQA 62
QQ T + Q A+E+FL+ GGT+AML +S DTLEQLY+L FNQYQ+GK++DA K+FQA
Sbjct: 2 QQETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQA 61

Query: 63 LCMLDHYDARYFLGLGACRQSLGLYEQALQSYSYGALMDINEPRFPFHAAECHLQLGDLD 122
LC+LDHYD+R+FLGLGACRQ++G Y+ A+ SYSYGA+MDI EPRFPFHAAEC LQ G+L
Sbjct: 62 LCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELA 121

Query: 123 GAESGFYSARALAAAQPAHEALAARAGAMLEAVTARKDRTYESDNA 168
AESG + A+ L A + + L+ R +MLEA+ +K+ +E +
Sbjct: 122 EAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167


25HWH78_RS08235HWH78_RS08305Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS08235290.658042AEC family transporter
HWH78_RS08240370.619855acetyl-CoA C-acetyltransferase
HWH78_RS08245290.359807enoyl-CoA hydratase/isomerase family protein
HWH78_RS082502120.932000LysR family transcriptional regulator
HWH78_RS082550120.948897aldo/keto reductase
HWH78_RS08260-1110.871932hypothetical protein
HWH78_RS08265-213-0.525446glutathione S-transferase N-terminal
HWH78_RS08270-113-0.522613amidotransferase
HWH78_RS08275015-0.701161hypothetical protein
HWH78_RS08280014-1.024470hypothetical protein
HWH78_RS08285112-2.147246hypothetical protein
HWH78_RS08290312-2.318905macro domain-containing protein
HWH78_RS08295211-2.006111hypothetical protein
HWH78_RS08300212-1.950627crotonase/enoyl-CoA hydratase family protein
HWH78_RS08305212-2.154551N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08270PF05704290.012 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 29.5 bits (66), Expect = 0.012
Identities = 14/95 (14%), Positives = 34/95 (35%), Gaps = 20/95 (21%)

Query: 10 TDILRPELIERYEGY---------GRMFQQLFAKQPIAAEFVIYNVVEGRYPADDERFDA 60
+DILR L+ +Y G ++ + F+ + ++
Sbjct: 135 SDILRLFLLCKYGGLWIDATVYMFDKVPNYIVESN----RFMFQSS---FLESETTHISN 187

Query: 61 YLVTGSKADSFGPDPWIQTLKTFLLDRYERGDKLL 95
+L+ + DP++ LK ++ ++ +K
Sbjct: 188 WLIFVKSKN----DPFLVGLKNSMVTYLKKKEKPA 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08290CHLAMIDIAOM6290.015 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 28.5 bits (63), Expect = 0.015
Identities = 16/40 (40%), Positives = 21/40 (52%), Gaps = 9/40 (22%)

Query: 122 PCVATGVGGLDWSEV-KP----LVVRHLGDLEIPVILYEV 156
PCV + G DWS V KP + V + GDL +L +V
Sbjct: 315 PCVQVSIAGADWSYVCKPVEYVISVSNPGDL----VLRDV 350


26HWH78_RS08370HWH78_RS08425Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS08370010-3.148581C39 family peptidase
HWH78_RS08375010-3.162495transporter
HWH78_RS08380-111-3.650913outer membrane protein transport protein
HWH78_RS08385113-3.392021DUF4124 domain-containing protein
HWH78_RS08390111-3.295234alpha-L-glutamate ligase-like protein
HWH78_RS08395113-3.283906inactive transglutaminase family protein
HWH78_RS08400212-2.694500ATP-dependent zinc protease
HWH78_RS08405212-2.469836kinase/pyrophosphorylase
HWH78_RS08410012-1.745281phosphoenolpyruvate synthase
HWH78_RS08415-121-2.509647alpha/beta fold hydrolase
HWH78_RS08420219-4.550305ribonuclease E activity regulator RraA
HWH78_RS08425121-3.759198zinc transporter ZntB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08380PF03895280.022 Serum resistance protein DsrA.
		>PF03895#Serum resistance protein DsrA.

Length = 79

Score = 27.9 bits (62), Expect = 0.022
Identities = 16/58 (27%), Positives = 25/58 (43%), Gaps = 13/58 (22%)

Query: 413 GYRDTWSWAMGVQYDVNDRLQLRAGYEYRPSAIPKGQADILVPIGDANLYGLGLGYQW 470
GYRD + A+GV + DR +AG + G + YG +GY++
Sbjct: 35 GYRDKTALAIGVGSRITDRFTAKAGVAFNTYN------------GGMS-YGASVGYEF 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08410PHPHTRNFRASE317e-101 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 317 bits (815), Expect = e-101
Identities = 113/446 (25%), Positives = 191/446 (42%), Gaps = 68/446 (15%)

Query: 360 RAIGQRI-GAGPVKVINDVSEMDKVQPGDVLVSDMTDPDWEPVMK-RASAIVTNRGGRTC 417
R + +R+ G ++ + + ++ D+T D + K T+ GGRT
Sbjct: 132 RDVSKRVLGHLIGVETGSLATIA--EETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTS 189

Query: 418 HAAIIARELGIPAVVGCGNATQILQDGQGVTVSCAEG---------DTGFIFEGELGFDV 468
H+AI++R L IPAVVG T+ +Q G V V EG + E F+
Sbjct: 190 HSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEK 249

Query: 469 RKNSVDAMPDLP--------FKIMMNVGNPDRAFDFAQLPNEGVGLARLEFIINRMIGVH 520
+K + P ++ N+G P EG+GL R EF+
Sbjct: 250 QKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLY------- 302

Query: 521 PKALLNFAGLPADIKESVEKRIAGYPDPVGFYVEKLVEGISTLAAAFWPKKVIVRLSDFK 580
++ LP + E++ Y + + K V++R D
Sbjct: 303 ----MDRDQLP-----TEEEQFEAYKE---------------VVQRMDGKPVVIRTLDIG 338

Query: 581 SNEYANLIGGKLYEPEEENPMLGFRGASRYISESFRDCFELECRALKKVRNEMGLTNVEI 640
++ + L P+E NP LGFR + + +D F + RAL + N+++
Sbjct: 339 GDKELSY----LQLPKELNPFLGFRAIRLCLEK--QDIFRTQLRALLRAS---TYGNLKV 389

Query: 641 MVPFVRTLGEASQVVELLAGNGLKRGENG------LKVIMMCELPSNALLADEFLEFFDG 694
M P + TL E Q ++ K G ++V +M E+PS A+ A+ F + D
Sbjct: 390 MFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDF 449

Query: 695 FSIGSNDLTQLTLGLDRDSGIVAHLFDERNPAVKKLLANAIAACNKAGKYIGICGQGPSD 754
FSIG+NDL Q T+ DR + V++L+ +PA+ +L+ I A + GK++G+CG+ D
Sbjct: 450 FSIGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGD 509

Query: 755 HPDLARWLMEQGIESVSLNPDSVLDT 780
L+ G++ S++ S+L
Sbjct: 510 -EVAIPLLLGLGLDEFSMSATSILPA 534


27HWH78_RS08520HWH78_RS08595Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS08520211-1.568205UDP-2,3-diacylglucosamine diphosphatase
HWH78_RS0852509-1.208331peptidyl-prolyl cis-trans isomerase
HWH78_RS08530-19-0.788589glutamine--tRNA ligase/YqeY domain fusion
HWH78_RS08535-29-0.509116cysteine--tRNA ligase
HWH78_RS08540016-1.754773bifunctional methylenetetrahydrofolate
HWH78_RS08565016-2.633618****hypothetical protein
HWH78_RS08570016-3.442434serine hydrolase
HWH78_RS08575113-4.854352sensor histidine kinase ParS
HWH78_RS08580116-5.747645response regulator transcription factor ParR
HWH78_RS08585115-5.511138trigger factor
HWH78_RS08590112-4.673214ATP-dependent Clp endopeptidase proteolytic
HWH78_RS08595110-3.796338ATP-dependent Clp protease ATP-binding subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08565FLGFLGJ250.030 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 25.1 bits (54), Expect = 0.030
Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 1/46 (2%)

Query: 9 RKALKQMANNHESAAVEVMRAVEWANDPKLAQRLLEVIEQMHQDAD 54
R A A + E A + A +A DP A++L +I+QM +D
Sbjct: 255 RYAAVTTAASAEQGAQALQDA-GYATDPHYARKLTNMIQQMKSISD 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08575PF06580290.039 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.039
Identities = 20/123 (16%), Positives = 38/123 (30%), Gaps = 31/123 (25%)

Query: 315 QIRIEPRFMARAVINLL-----RNAIRHAHS------RVEIALLDQGDSCQIRVNDDGPG 363
+ +I P M V +L N I+H + ++ + + + V + G
Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL 302

Query: 364 IPADARQKIFEPFSRLDDSRDRSTGGFGLGLAIVR-RVAQWHGG-YAEALETPQGGASFR 421
+ ++ G GL VR R+ +G L QG +
Sbjct: 303 ALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344

Query: 422 LTW 424
+
Sbjct: 345 VLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08580HTHFIS772e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.2 bits (190), Expect = 2e-18
Identities = 31/132 (23%), Positives = 63/132 (47%), Gaps = 5/132 (3%)

Query: 7 SKVLLVEDDQKLARLIASFLSQHGFEVRQVHRGDAAFAAFLDFKPQVVVLDLMLPGQNGL 66
+ +L+ +DD + ++ LS+ G++VR + +VV D+++P +N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 67 QVCREIRRV-ANLPILILTAQEDDLDHILGLESGADDYVIKPIEPPVLLARLRALM---- 121
+ I++ +LP+L+++AQ + I E GA DY+ KP + L+ + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 RRHAPLPASPES 133
RR + L +
Sbjct: 124 RRPSKLEDDSQD 135


28HWH78_RS08835HWH78_RS08970Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS08835292.293342MFS transporter
HWH78_RS08840281.941428DUF1272 domain-containing protein
HWH78_RS08845182.034664GlxA family transcriptional regulator
HWH78_RS088504100.409972GGDEF domain-containing protein
HWH78_RS088553110.393607hypothetical protein
HWH78_RS088602100.848487LysR family transcriptional regulator
HWH78_RS088653110.753166HPP family protein
HWH78_RS088703110.607775hypothetical protein
HWH78_RS08875290.634683cytochrome-c oxidase, cbb3-type subunit I
HWH78_RS08880091.510784DUF808 domain-containing protein
HWH78_RS08885-1101.364212aminoglycoside phosphotransferase family
HWH78_RS08890-1111.516220LysR family transcriptional regulator
HWH78_RS08895-191.425989class I SAM-dependent methyltransferase
HWH78_RS08900181.749805molybdenum ABC transporter ATP-binding protein
HWH78_RS08905281.725294molybdate ABC transporter permease subunit
HWH78_RS08910481.983702molybdate ABC transporter substrate-binding
HWH78_RS08915082.628620TetR/AcrR family transcriptional regulator
HWH78_RS08920-192.310259nuclease Fan1
HWH78_RS08925-2112.695602ATP-dependent DNA helicase
HWH78_RS089300132.087723hypothetical protein
HWH78_RS089350111.832223type VI pilus biosynthesis protein
HWH78_RS08940-191.170715type II secretion system secretin GspD
HWH78_RS089455151.953557acyl carrier protein
HWH78_RS089504131.973368hypothetical protein
HWH78_RS089554122.097946protease LasA
HWH78_RS089604122.101086EcsC family protein
HWH78_RS089654122.132429transporter
HWH78_RS089705132.391486BapA prefix-like domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08915HTHTETR671e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.6 bits (162), Expect = 1e-15
Identities = 25/152 (16%), Positives = 55/152 (36%), Gaps = 8/152 (5%)

Query: 5 RQRNLQLILDAACEVFADCGFSAARLSDVAERAGVAKANVLYYYRSKVQLYEAVLDSIVE 64
Q Q ILD A +F+ G S+ L ++A+ AGV + + ++++ K L+ + +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 65 PLLEASRPFAGDQP--PAEALRAYVDNKMRIGAERPHAARVFSCEIMRGAPRMPAPLLER 122
+ E + P P LR + + + + + ++++
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 123 LDAQAERN-----AERIRQWIDEG-LLAPLDP 148
+ ++ I+ L A L
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMT 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08940BCTERIALGSPD5920.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 592 bits (1528), Expect = 0.0
Identities = 197/591 (33%), Positives = 324/591 (54%), Gaps = 26/591 (4%)

Query: 44 EQWTINMKDAEIGDFIEQVSSISGQTFVVDPRVKGRVTVVSQARLSLAEVYQLFLSVLAT 103
E+++ + K +I +FI VS +T ++DP V+G +TV S L+ + YQ FLSVL
Sbjct: 28 EEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVLDV 87

Query: 104 HGYAVLPQGDQA-RIVPNMEARQDAAQKTVRDGPG---SLETRVVQAQQTSVAELIPMIR 159
+G+AV+ + ++V + +A+ A PG + TRVV + +L P++R
Sbjct: 88 YGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLR 147

Query: 160 PLVPAHGHLAAV--PSANALIVSDRRANIERIEAIVRSLDRAGEHDYSIYDMRHAWVAEI 217
L G + V +N L+++ R A I+R+ IV +D AG+ + A A++
Sbjct: 148 QLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASAADV 207

Query: 218 AEV---LDRSVTPAAGKSAATVQVLADSRSNRLVLLGPPQARARLLRLAQSLDVPSSRSA 274
++ L++ + +A + V+AD R+N +++ G P +R R++ + + LD +
Sbjct: 208 VKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQLDRQQATQG 267

Query: 275 NSRVIRLRHGDAKTLAATLGEIGESLHGER-GQDGRGSGKRGLLVRADESLNALVILADP 333
N++VI L++ A L L I ++ E+ + + ++++A NAL++ A P
Sbjct: 268 NTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAP 327

Query: 334 EDVGLLEDIVRQLDVPRAQLLVEAAIVELSGEIGDALGVQWALRSGHVAGGAGFADSGLS 393
+ + LE ++ QLD+ R Q+LVEA I E+ G LG+QWA AG F +SGL
Sbjct: 328 DVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA---NKNAGMTQFTNSGLP 384

Query: 394 IGTLLGAL----QAGKPPAELP------DGAIVGLGSRDFGALVTALSRNSRSNLLSTPS 443
I T + + G + L +G G ++ L+TALS ++++++L+TPS
Sbjct: 385 ISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPS 444

Query: 444 LLTLDNQKAEILVGQNVPFQTGSYTTSASGSSNPFTTVERKDIGVTLKVTPHIGEDRMLR 503
++TLDN +A VGQ VP TGS TTS N F TVERK +G+ LKV P I E +
Sbjct: 445 IVTLDNMEATFNVGQEVPVLTGSQTTS---GDNIFNTVERKTVGIKLKVKPQINEGDSVL 501

Query: 504 LEIEQEISSIAPTATLAAKAVDLVTNKRSIKSTVLADDGQVIVLGGLIQDDLLRSDSRVP 563
LEIEQE+SS+A A+ + + N R++ + VL G+ +V+GGL+ + + +VP
Sbjct: 502 LEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVP 561

Query: 564 LLGDIPGVGRLFRSSRETRVKRNLMVFLRPSIVRDAAGLERISHGRYRSIQ 614
LLGDIP +G LFRS+ + KRNLM+F+RP+++RD + S G+Y +
Sbjct: 562 LLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFN 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08970CABNDNGRPT483e-07 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 48.4 bits (115), Expect = 3e-07
Identities = 35/163 (21%), Positives = 65/163 (39%), Gaps = 8/163 (4%)

Query: 2519 TDSNGNDSAAYGITLTPNGLSLNIGQI-DVNGTSGDDVLSGANGSSEHINGGDGSDLIFN 2577
D+ G D+ + ++LN G DV G G+ ++ E+ GG G+D++
Sbjct: 296 WDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTI-ENAIGGSGNDILVG 354

Query: 2578 VGTGDHVVAGNGNDTIQITATDFVSIDGGAGFDTLVLANGIDLDYNAVGVGT--LSNLER 2635
+ + G GND + A ++ GGAG DT V +G D A +++
Sbjct: 355 NSADNILQGGAGNDVLYGGAGA-DTLYGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDK 413

Query: 2636 IDLGKGDSGSVLTLTAAEVDAITDANNTLQITGENNDTLNVVG 2678
IDL + + D T + + + +++ +
Sbjct: 414 IDL---SAFRNEGQLSFVQDQFTGKGQEVMLQWDAANSITNLW 453


29HWH78_RS09070HWH78_RS09140Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS09070013-3.7932902OG-Fe(II) oxygenase
HWH78_RS09075-113-4.358150fatty acid desaturase
HWH78_RS09080011-2.408382hypothetical protein
HWH78_RS09085-111-1.968223sterol desaturase family protein
HWH78_RS090900110.279670quorum-sensing transcriptional repressor QscR
HWH78_RS09095-1142.312457phenazine biosynthesis protein PhzA
HWH78_RS09100-1173.354553phenazine biosynthesis protein phzB 2
HWH78_RS091050174.109684phenazine biosynthesis protein PhzC
HWH78_RS09110-2143.599792phenazine biosynthesis protein PhzD
HWH78_RS09115-2124.202830phenazine biosynthesis protein PhzE
HWH78_RS09120-2114.449878phenazine biosynthesis protein PhzF
HWH78_RS09125-2113.499983phenazine biosynthesis FMN-dependent oxidase
HWH78_RS09130-1103.230929HIT family protein
HWH78_RS091350114.018749dienelactone hydrolase
HWH78_RS091401133.843679MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS09110ISCHRISMTASE351e-125 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 351 bits (901), Expect = e-125
Identities = 102/207 (49%), Positives = 136/207 (65%), Gaps = 2/207 (0%)

Query: 3 GIPEITAYPLPTAQQLPANLARWSLEPRRAVLLVHDMQRYFLRPLPESLRA--GLVANAA 60
IP I Y +PTA +P N W +P RAVLL+HDMQ YF+ L AN
Sbjct: 2 AIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIR 61

Query: 61 RLRRWCVEQGVQIAYTAQPGSMTEEQRGLLKDFWGPGMRASPADREVVEELAPGPDDWLL 120
+L+ CV+ G+ + YTAQPGS + R LL DFWGPG+ + P + +++ ELAP DD +L
Sbjct: 62 KLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVL 121

Query: 121 TKWRYSAFFHSDLLQRMRAAGRDQLVLCGVYAHVGVLISTVDAYSNDIQPFLVADAIADF 180
TKWRYSAF ++LL+ MR GRDQL++ G+YAH+G L++ +A+ DI+ F V DA+ADF
Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADF 181

Query: 181 SEAHHRMALEYAASRCAMVVTTDEVLE 207
S H+MALEYAA RCA V TD +L+
Sbjct: 182 SLEKHQMALEYAAGRCAFTVMTDSLLD 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS09140TCRTETA995e-25 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 99.1 bits (247), Expect = 5e-25
Identities = 86/335 (25%), Positives = 125/335 (37%), Gaps = 37/335 (11%)

Query: 49 GAAVTVGGIAWMLAARPWGIASDRHGRRRILLGGLAGFALSYGSLCLFIVLALHWTLPTL 108
G + + + A G SDR GRR +LL LAG A+ Y +
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY----------------AI 89

Query: 109 FAFAG---IVLLRGLAGGFYAAVPACTAALVADHVEAQRRAAALAGLGAASAIGMVIGPG 165
A A ++ + + G A A A +AD + RA + A GMV GP
Sbjct: 90 MATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPV 149

Query: 166 LAGLLATHGLVLPLLVTGALPLVALLALWRWLP----------REERRQPNRGAALAIGD 215
L GL+ P AL + L LP R E P A G
Sbjct: 150 LGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGM 209

Query: 216 RRLRRPLAVGFVAMFSVTVAQITVGFFALDRLRLDSADAARVAGIALTAVGVALILAQLL 275
+ +AV F+ V F DR D+ GI+L A G+ LAQ +
Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATT----IGISLAAFGILHSLAQAM 265

Query: 276 VRRL---DWPPPRLIRVGGLVAAIGFAAVCFADSPPLLWLAFFVAAAGMGWVFPAVSALN 332
+ R + +G + G+ + FA + + + A+G G PA+ A+
Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAML 324

Query: 333 ANAVRAEEQGAAAGTLVAVHGFGLISGPLLGTLLH 367
+ V E QG G+L A+ I GPLL T ++
Sbjct: 325 SRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359



Score = 37.5 bits (87), Expect = 8e-05
Identities = 36/140 (25%), Positives = 56/140 (40%), Gaps = 7/140 (5%)

Query: 251 SADAARVAGIALTAVGVA-LILAQLLVRRLDWPPPRLIRVGGLV-AAIGFAAVCFADSPP 308
S D GI L + A +L D R + + L AA+ +A + A P
Sbjct: 38 SNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA---P 94

Query: 309 LLWLAFF--VAAAGMGWVFPAVSALNANAVRAEEQGAAAGTLVAVHGFGLISGPLLGTLL 366
LW+ + + A G A A+ +E+ G + A GFG+++GP+LG L+
Sbjct: 95 FLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM 154

Query: 367 HQLDSRAPYALVGLLLALAA 386
AP+ L L
Sbjct: 155 GGFSPHAPFFAAAALNGLNF 174


30HWH78_RS09265HWH78_RS09295Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS09265327-5.016213(2Fe-2S)-binding protein
HWH78_RS09270530-6.340275xanthine dehydrogenase family protein subunit M
HWH78_RS09275541-10.405415xanthine dehydrogenase family protein
HWH78_RS09280549-14.649599DUF4354 family protein
HWH78_RS09285637-10.530454DUF2326 domain-containing protein
HWH78_RS09290631-8.323107hypothetical protein
HWH78_RS09295217-3.578444HNH endonuclease
31HWH78_RS09750HWH78_RS09905Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS097503120.343375Lrp/AsnC family transcriptional regulator
HWH78_RS097551132.182160winged helix-turn-helix domain-containing
HWH78_RS097601102.667237hypothetical protein
HWH78_RS097651110.884266DUF1127 domain-containing protein
HWH78_RS09770426-3.639158PLP-dependent aminotransferase family protein
HWH78_RS09775532-4.960598siderophore-interacting protein
HWH78_RS09780433-5.357879class I SAM-dependent methyltransferase
HWH78_RS09785328-5.956886thiamine pyrophosphate-binding protein
HWH78_RS09790329-8.198244methyltransferase domain-containing protein
HWH78_RS09795220-6.049728hypothetical protein
HWH78_RS09800112-1.392636AzlD domain-containing protein
HWH78_RS09805-111-0.760819AzlC family ABC transporter permease
HWH78_RS09810-112-0.827955glutamine synthetase family protein
HWH78_RS098150120.756558APC family permease
HWH78_RS098201121.980728serine/threonine transporter SstT
HWH78_RS098252112.661387RluA family pseudouridine synthase
HWH78_RS098300112.440039transglutaminase family protein
HWH78_RS098351112.886062membrane protein insertion efficiency factor
HWH78_RS098401103.206526hypothetical protein
HWH78_RS098450102.425672AraC family transcriptional regulator CmrA
HWH78_RS09850-1112.785478antibiotic biosynthesis monooxygenase
HWH78_RS098550102.978151alkaline phosphatase family protein
HWH78_RS098601114.071439sigma-70 family RNA polymerase sigma factor
HWH78_RS098650124.200882FecR family protein
HWH78_RS098702113.024666cyanase
HWH78_RS098751102.842496carbonic anhydrase CynT
HWH78_RS098801122.690057transcriptional regulator CynR
HWH78_RS098851112.531641MFS transporter
HWH78_RS098901112.289095LysR family transcriptional regulator
HWH78_RS098951112.481508TonB-dependent receptor
HWH78_RS09900-2112.914761extracellular solute-binding protein
HWH78_RS09905-2123.486467microcin C ABC transporter permease YejB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS09855SALVRPPROT300.018 Salmonella virulence-associated 28kDa protein signature.
		>SALVRPPROT#Salmonella virulence-associated 28kDa protein signature.

Length = 241

Score = 30.5 bits (68), Expect = 0.018
Identities = 14/44 (31%), Positives = 25/44 (56%), Gaps = 4/44 (9%)

Query: 299 RFAGERRHIEAFRDGLPRVARALAHISSLMIFDDHDITDDWNLS 342
+FAG++ HI RD +P+ +AL S ++F + D W ++
Sbjct: 99 KFAGDKFHISVLRDMVPQAFQAL----SGLLFSEDSPVDKWKVT 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS09885TCRTETB1264e-34 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 126 bits (319), Expect = 4e-34
Identities = 82/393 (20%), Positives = 159/393 (40%), Gaps = 21/393 (5%)

Query: 14 LDATIVFVALPEISRALDFSAQRLQWVVSAYTVAFGGFLLLGGRATDLLGRRRMYVLGQS 73
L+ ++ V+LP+I+ + WV +A+ + F + G+ +D LG +R+ + G
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 74 LYALASLAGGLAQSELP-LILARAVQGLGGALLFPATLALIGNHFAEGPARNRALAIWSI 132
+ S+ G + S LI+AR +QG G A FPA + ++ + R +A +
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAA-FPALVMVVVARYIPKENRGKAFGLIGS 146

Query: 133 ASAFGLALGSALGGALTELFGWASIFLVNVPLAGAAALLALRLIPADARRQRGRRFDLAG 192
A G +G A+GG + W+ +L+ +P+ + L + R +G FD+ G
Sbjct: 147 IVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKG-HFDIKG 203

Query: 193 ALTVTAGATLLVFTLVQGPESGWDAPSVRFGLYLSVPLLLAFLAIEHYSR--DPLMPLRL 250
+ ++ G + S +L V +L + ++H + DP + L
Sbjct: 204 IILMSVGIVFFMLF----------TTSYSIS-FLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 251 LGNRNLQVAMLLTAIFMSSYGVQYYFLAIYFQSVYGYSVLQTGLAFL-PATLLCTLGIRV 309
N + +L I + + + V+ S + G + P T+ + +
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 310 AERLLARHGARATLVAGLLLGALGLGLLAACLPLGRGFLALLPAIVILSVGQGMTWTAMW 369
L+ R G L G+ ++ L A+ L + + IV + G T T +
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSF-LTASFLLETTSWFMTI-IIVFVLGGLSFTKTVIS 370

Query: 370 VSAASGVDPAEQGVASGMASMTQQIGGALGLAL 402
+S + E G + + T + G+A+
Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


32HWH78_RS10025HWH78_RS10305Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS10025-1103.120246asparagine synthase (glutamine-hydrolyzing)
HWH78_RS10030193.374417aromatic-ring-hydroxylating dioxygenase subunit
HWH78_RS10035193.262060alpha/beta hydrolase
HWH78_RS100450103.984358hypothetical protein
HWH78_RS100500103.600087class I SAM-dependent methyltransferase
HWH78_RS100551103.939601TonB-dependent receptor
HWH78_RS100601134.712249LLM class flavin-dependent oxidoreductase
HWH78_RS100653124.508779MFS transporter
HWH78_RS10070-193.523740MFS transporter
HWH78_RS10075-193.859922sigma-70 family RNA polymerase sigma factor
HWH78_RS10080-194.284358FecR family protein
HWH78_RS10085-1111.574600hypothetical protein
HWH78_RS10090-115-0.138304AraC family transcriptional regulator
HWH78_RS10095019-1.596537NAD(P)/FAD-dependent oxidoreductase
HWH78_RS10100029-3.802363alpha/beta hydrolase
HWH78_RS10105-136-6.804820SDR family oxidoreductase
HWH78_RS10110042-8.387655transcriptional regulator MdrR2
HWH78_RS10115143-8.690906EamA family transporter
HWH78_RS10120143-7.300554Mov34/MPN/PAD-1 family protein
HWH78_RS10125129-4.291737molybdopterin-synthase adenylyltransferase MoeB
HWH78_RS10130224-2.536651PLP-dependent cysteine synthase family protein
HWH78_RS101352150.741028serine O-acetyltransferase
HWH78_RS10140192.742294alanyl-tRNA editing protein
HWH78_RS10145294.500103four-helix bundle copper-binding protein
HWH78_RS10150-1113.980956thiamine pyrophosphate-requiring protein
HWH78_RS10155-1163.758293DUF4142 domain-containing protein
HWH78_RS10160-2153.433811biotin-dependent carboxyltransferase family
HWH78_RS10165-1142.680436allophanate hydrolase subunit 1
HWH78_RS101700152.158634LamB/YcsF family protein
HWH78_RS101751132.249884OprD family porin
HWH78_RS101800142.088830MFS transporter
HWH78_RS101851121.615109LysR family transcriptional regulator
HWH78_RS101903101.648863putative hydro-lyase
HWH78_RS101951101.632013hypothetical protein
HWH78_RS102001101.886198bifunctional DNA-binding transcriptional
HWH78_RS10205-1101.311791DUF2790 domain-containing protein
HWH78_RS102100102.393212NAD(P)-dependent alcohol dehydrogenase
HWH78_RS10215-1103.450162nuclear transport factor 2 family protein
HWH78_RS10220-193.372923LysR family transcriptional regulator
HWH78_RS10225-193.409667SRPBCC family protein
HWH78_RS10230-193.064408LysR family transcriptional regulator
HWH78_RS102351112.955203GMC family oxidoreductase N-terminal
HWH78_RS102401152.046951aldehyde dehydrogenase family protein
HWH78_RS102451101.599909ParB/RepB/Spo0J family partition protein
HWH78_RS102502101.888713N-acetyltransferase
HWH78_RS102552102.098854phosphoadenosine phosphosulfate reductase
HWH78_RS102602122.592249molecular chaperone
HWH78_RS10265092.314028fimbrial biogenesis outer membrane usher
HWH78_RS10270182.752059fimbrial protein
HWH78_RS10275092.619928molecular chaperone
HWH78_RS102800112.531123EAL domain-containing protein
HWH78_RS102852121.744921DUF4142 domain-containing protein
HWH78_RS102901112.026552sodium:proton antiporter
HWH78_RS102953102.441623DUF421 domain-containing protein
HWH78_RS103002121.854765hypothetical protein
HWH78_RS103052142.572077response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10045PHPHLIPASEA1320.001 Bacterial phospholipase A1 protein signature.
		>PHPHLIPASEA1#Bacterial phospholipase A1 protein signature.

Length = 289

Score = 32.2 bits (73), Expect = 0.001
Identities = 17/59 (28%), Positives = 22/59 (37%), Gaps = 2/59 (3%)

Query: 69 FRAQALIEQFIVARPADYRLDDWARATARIYRALEPAGRGDPASAA-DRL-ARQAALYG 125
FR Q + DYR W + + GR DP S + +RL R A G
Sbjct: 129 FRETNYEPQLFLGFATDYRFAGWTLRDVEMGYNHDSNGRSDPTSRSWNRLYTRLMAENG 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10070TCRTETA552e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 54.8 bits (132), Expect = 2e-10
Identities = 84/365 (23%), Positives = 139/365 (38%), Gaps = 20/365 (5%)

Query: 23 VGTVELVVAGVLDELAASFAVSQGRAGLLMSLYALVYALLGPLLVYLSAGIERRRLLAGA 82
+G + V+ G+L +L S V G+L++LYAL+ P+L LS RR +L +
Sbjct: 21 IGLIMPVLPGLLRDLVHSNDV-TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVS 79

Query: 83 LLAFIGANLASAAAPSFALLLASRLLVAASASVIVVVAITLAVAIVAPERRGRAIGLVFA 142
L A AP +L R++ + + V +A I + R R G + A
Sbjct: 80 LAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITDGDERARHFGFMSA 138

Query: 143 GIVASLVLGVPLGTLIGEFWGWRSLFLLLAGVALLGLPLLLRLL---------PAIPGAP 193
+V G LG L+G F + F A + L LL P A
Sbjct: 139 CFGFGMVAGPVLGGLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREAL 197

Query: 194 GIAPAEQLRALARGRVPFAHLASLLQMTGQFTVYTYIVPFLVGSMALDKPTISLVLLVYG 253
+ + + ++Q+ GQ +++ F D TI + L +G
Sbjct: 198 NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI-FGEDRFHWDATTIGISLAAFG 256

Query: 254 GGGILG-ALLGGRAADRWPGPATFVAFLLLHALALVLLPFATGGLPLLLGAVVFWCVFNM 312
L A++ G A R + ++ +LL FAT G V+
Sbjct: 257 ILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLL--ASGG 314

Query: 313 APGPAIQKYLVELSPDTAAIQISLNTSAIQLGVALGAFIGAILVDQVAVRALPWW-GAAL 371
PA+Q L + Q+ + +A+ +L + +G +L + ++ W G A
Sbjct: 315 IGMPALQAMLSRQVDEERQGQLQGSLAALT---SLTSIVGPLLFTAIYAASITTWNGWAW 371

Query: 372 ILGAA 376
I GAA
Sbjct: 372 IAGAA 376


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10105DHBDHDRGNASE872e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 86.6 bits (214), Expect = 2e-22
Identities = 56/188 (29%), Positives = 84/188 (44%), Gaps = 9/188 (4%)

Query: 2 QNILITGAASGIGAASARLFHRRGWRVGLLDIDAEALRGLAVQLPGAWHRA----VDVSE 57
+ ITGAA GIG A AR +G + +D + E L + L A DV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 58 PDAVGEALAQFCAD-GRLRLLFNCAGVLRFGRFEEVALEDHARLLAINLHGVLNCCHAAF 116
A+ E A+ + G + +L N AGVLR G ++ E+ ++N GV N +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 117 PFLRATPQAQVLNMGSASGLYGVPE--MAVYSASKFAVRGLTEALELEWRRHGIRVADLM 174
++ ++ +GS GVP MA Y++SK A T+ L LE + IR +
Sbjct: 129 KYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 175 PPFVRTPM 182
P T M
Sbjct: 187 PGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10180TCRTETB418e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.6 bits (95), Expect = 8e-06
Identities = 37/182 (20%), Positives = 82/182 (45%), Gaps = 6/182 (3%)

Query: 29 FWSCKIGYGLDGMDTQMLSFVIPTLIALWGISTGEAGFIHTMTLLASAAGGWIAGILSDR 88
W C + + ++ +L+ +P + + +++T +L + G + G LSD+
Sbjct: 17 IWLCILSF-FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 89 IGRVLTLQLTVLWFAFFTFLCGLAQNYEQLLV-ARTLMGFGFGGEWTAGAVLIGEVIKAR 147
+G L ++ F + + + ++ LL+ AR + G G V++ I
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 148 DRGKAVGLVQSGWAIGWGLTAILYSLMFSLLPPEQAWRALFMLGLLPALFVLVVRRLVKE 207
+RGKA GL+ S A+G G+ + ++ + W L ++ ++ + V + +L+K+
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI----HWSYLLLIPMITIITVPFLMKLLKK 191

Query: 208 PA 209

Sbjct: 192 EV 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10270PF005777720.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 772 bits (1995), Expect = 0.0
Identities = 271/881 (30%), Positives = 411/881 (46%), Gaps = 50/881 (5%)

Query: 7 RRCRTGTALMAGGMALAASAFGHAQPGYEFDDRLLLGSSLGGGDLSRFNQDGRIDPGRYH 66
R + A AA A + F+ R L DLSRF + PG Y
Sbjct: 21 HRLAGFFVRLFVACAFAAQA-PLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79

Query: 67 VDVYLNERFASRSEVSFRANPASGAVEPCLDEDFLRQRLGAKPGDDPRKSGDGRHCAFLG 126
VD+YLN + + +V+F + + PCL L C L
Sbjct: 80 VDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLT 139

Query: 127 ARLPGSRFSLDIARLRLDLSVPQALLDLKPRGYVSPEEWDAGDSMGFVNYDTNLYRSDYR 186
+ + + LD+ + RL+L++PQA + + RGY+ PE WD G + G +NY+ + R
Sbjct: 140 SMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNR 199

Query: 187 GGESGRSDYAYVGLNSGINLGLWRLRHQSNYTYSRYNGQA--RRKWNSIRTYAQRALPAW 244
G G S YAY+ L SG+N+G WRLR + ++Y+ + + + KW I T+ +R +
Sbjct: 200 IG--GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPL 257

Query: 245 RSELTAGESYTAGNLLGSIGYRGLSLATDDRMLPESLRRYAPQVRGTAATAARVVISQNG 304
RS LT G+ YT G++ I +RG LA+DD MLP+S R +AP + G A A+V I QNG
Sbjct: 258 RSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNG 317

Query: 305 RKIREVNVAPGPFVIDDLYDSAYAGDLDVQVFEADGSVSSFSVPFASVPESMRPGLSRYS 364
I V PGPF I+D+Y + +GDL V + EADGS F+VP++SVP R G +RYS
Sbjct: 318 YDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYS 377

Query: 365 FTLGQARQYGDGDD--LFADFTYQRGMSNALTANLGLRVADDYLA-MLGGGVLATRFGAF 421
T G+ R + F T G+ T G ++AD Y A G G GA
Sbjct: 378 ITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGAL 437

Query: 422 GLNSTYSSARVEDGARKQGWRIGLDYSRTFQPTGTTLTLAGYRYSTEGYRELGDVLGSRD 481
++ T +++ + D ++ G + Y+++ +GT + L GYRYST GY D SR
Sbjct: 438 SVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRM 497

Query: 482 ALRHGDTWD-------------SGSYKQRNQFNLLVSQALGGYGNLYLSGSSSDFYDGKS 528
+ +T D + +Y +R + L V+Q LG LYLSGS ++ +
Sbjct: 498 NGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSN 557

Query: 529 RDTQLQFGYSNTWGQLSYNLAWSRQTTTYYQEQGDQDPGVELLRRDRRSGQRNDTLTLSV 588
D Q Q G + + +++ L++S ++ R+ L L+V
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLT-------------------KNAWQKGRDQMLALNV 598

Query: 589 SMPLGSSSRAPTLSA-----MATRRSGDSRGG-SLQTGLNGTLGDERTWSYALSA---NR 639
++P R+ + S + S D G + G+ GTL ++ SY++
Sbjct: 599 NIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGG 658

Query: 640 DSEVADTTWNGTLQKQAALATVNAGYAQGDRYRQYSGGIRGALVAHRDGLTLGPSVGDTF 699
+ +T TL + N GY+ D +Q G+ G ++AH +G+TLG + DT
Sbjct: 659 GDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTV 718

Query: 700 ALVEAKGASGAAIRGGQGARIDGNGYALAPSLSPYRYNPISLDPVGIDPDAELLETERKV 759
LV+A GA A + G R D GYA+ P + YR N ++LD + + +L V
Sbjct: 719 VLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANV 778

Query: 760 APYAGASVRVTFRTLTGHPLLIQARREDGSVLPLGAVVVDDGGAAIGMVGQGGQVYARAE 819
P GA VR F+ G LL+ + LP GA+V + + G+V GQVY
Sbjct: 779 VPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVYLSGM 837

Query: 820 NQRGRLLVQWGTARKERCELPYDLAGVSRDQALIRLRGTCR 860
G++ V+WG C Y L S+ Q L +L CR
Sbjct: 838 PLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10310HTHFIS342e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 2e-04
Identities = 15/80 (18%), Positives = 32/80 (40%), Gaps = 2/80 (2%)

Query: 58 VLVLEEHAEQLWRIEEFLLDRGYAVLSAASRDEALDHLASDAVIDLFLLSEQLEGPLSGS 117
+LV ++ A + + L GY V ++ +A+ DL + + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPD-ENAF 63

Query: 118 MLIETSLPVRPRMRVILLSD 137
L+ RP + V+++S
Sbjct: 64 DLLPRIKKARPDLPVLVMSA 83


33HWH78_RS10420HWH78_RS10445Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS10420-3113.880317glutathione-dependent formaldehyde
HWH78_RS10425-2113.863030CBS domain-containing protein
HWH78_RS10430-2103.460319glycogen debranching protein GlgX
HWH78_RS10435-193.733506DUF2934 domain-containing protein
HWH78_RS10440-193.653317malto-oligosyltrehalose synthase
HWH78_RS10445-2103.2781654-alpha-glucanotransferase
34HWH78_RS10705HWH78_RS11180Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS107052130.122028hypothetical protein
HWH78_RS107103120.976164sigma-54-dependent Fis family transcriptional
HWH78_RS107155121.654778phosphotransferase family protein
HWH78_RS107206140.759944DUF6285 domain-containing protein
HWH78_RS107255120.538630acyl-CoA/acyl-ACP dehydrogenase
HWH78_RS107303110.090805SDR family oxidoreductase
HWH78_RS10735212-0.554838DUF485 domain-containing protein
HWH78_RS10740113-1.375006cation acetate symporter
HWH78_RS10745112-1.641659NAD(P)-binding domain-containing protein
HWH78_RS10750013-1.752651aspartate aminotransferase family protein
HWH78_RS10755013-2.828638gamma-aminobutyraldehyde dehydrogenase
HWH78_RS10760-117-3.551874APC family permease
HWH78_RS10765-123-4.698968iron-containing alcohol dehydrogenase
HWH78_RS10770029-5.328973alpha/beta hydrolase
HWH78_RS10775036-6.638159NAD(P)/FAD-dependent oxidoreductase
HWH78_RS10780034-6.803465TetR/AcrR family transcriptional regulator
HWH78_RS10785028-6.094436phytanoyl-CoA dioxygenase family protein
HWH78_RS10790023-5.304740FAD-binding oxidoreductase
HWH78_RS10795222-4.514036hypothetical protein
HWH78_RS10800120-4.546141DUF1329 domain-containing protein
HWH78_RS10805118-3.482967DUF1302 domain-containing protein
HWH78_RS10815024-3.937211MMPL family transporter
HWH78_RS10820034-4.525446hypothetical protein
HWH78_RS10825034-5.578705phytanoyl-CoA dioxygenase family protein
HWH78_RS10830138-5.267743helix-turn-helix domain-containing protein
HWH78_RS10835042-6.302058hypothetical protein
HWH78_RS10840341-6.048362hypothetical protein
HWH78_RS10845340-6.830888zonular occludens toxin
HWH78_RS10850741-6.962097DUF2523 domain-containing protein
HWH78_RS10855537-6.296310hypothetical protein
HWH78_RS10860334-7.147242hypothetical protein
HWH78_RS10865334-7.101687hypothetical protein
HWH78_RS10870331-6.162733hypothetical protein
HWH78_RS10875033-5.275931hypothetical protein
HWH78_RS10880-133-5.548815phage/plasmid replication protein, II/X family
HWH78_RS10885136-5.401238hypothetical protein
HWH78_RS10890137-5.083316hypothetical protein
HWH78_RS10895341-6.048362hypothetical protein
HWH78_RS10900340-6.830888zonular occludens toxin
HWH78_RS10905741-6.962097DUF2523 domain-containing protein
HWH78_RS10910537-6.296310hypothetical protein
HWH78_RS10915334-7.147242hypothetical protein
HWH78_RS10920334-7.101687hypothetical protein
HWH78_RS10925133-6.045400hypothetical protein
HWH78_RS10930031-5.334231hypothetical protein
HWH78_RS10935134-5.031704phage/plasmid replication protein, II/X family
HWH78_RS10940035-4.878151hypothetical protein
HWH78_RS10945339-5.604465hypothetical protein
HWH78_RS10950441-6.550912zonular occludens toxin
HWH78_RS10955741-6.275674DUF2523 domain-containing protein
HWH78_RS10960637-5.678725hypothetical protein
HWH78_RS10965535-7.098976hypothetical protein
HWH78_RS10970435-7.128613hypothetical protein
HWH78_RS10975133-6.092429hypothetical protein
HWH78_RS10980133-5.425899phage/plasmid replication protein, II/X family
HWH78_RS10985136-5.149410helix-turn-helix domain-containing protein
HWH78_RS10990136-5.163402hypothetical protein
HWH78_RS10995339-5.604465zonular occludens toxin
HWH78_RS11000441-6.550912DUF2523 domain-containing protein
HWH78_RS11005741-6.275674hypothetical protein
HWH78_RS11010637-5.678725hypothetical protein
HWH78_RS11015535-7.098976hypothetical protein
HWH78_RS11020435-7.128613hypothetical protein
HWH78_RS11025345-9.071068phage/plasmid replication protein, II/X family
HWH78_RS11030258-12.349571helix-turn-helix domain-containing protein
HWH78_RS11035471-15.861012hypothetical protein
HWH78_RS11045484-18.800536tyrosine-type recombinase/integrase
HWH78_RS11050490-20.313320site-specific integrase
HWH78_RS11055589-20.831969hypothetical protein
HWH78_RS11060775-17.116903hypothetical protein
HWH78_RS11065453-10.869735hypothetical protein
HWH78_RS11070440-8.410704hypothetical protein
HWH78_RS11075336-8.987605hypothetical protein
HWH78_RS11080344-10.621703DNA-binding protein
HWH78_RS11085248-12.385622site-specific integrase
HWH78_RS11090256-14.723915phage Gp37/Gp68 family protein
HWH78_RS11100157-15.611154three-Cys-motif partner protein TcmP
HWH78_RS11105263-15.659582saccharopine dehydrogenase NADP-binding
HWH78_RS11110057-13.886166hypothetical protein
HWH78_RS11115056-12.405023EamA family transporter
HWH78_RS11120053-11.247386GNAT family N-acetyltransferase
HWH78_RS11125054-10.830553Tn3 family transposase
HWH78_RS11130157-10.743475hypothetical protein
HWH78_RS11135055-10.647196TatD family hydrolase
HWH78_RS11140055-10.8944657-cyano-7-deazaguanine synthase
HWH78_RS11145053-10.953593hypothetical protein
HWH78_RS11150054-11.217045KAP family P-loop domain protein
HWH78_RS11155-248-10.250323hypothetical protein
HWH78_RS11160-148-9.256922hypothetical protein
HWH78_RS11165037-6.019326helix-turn-helix transcriptional regulator
HWH78_RS11170033-4.258720hypothetical protein
HWH78_RS11175228-2.096943hypothetical protein
HWH78_RS11180223-0.487471hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10710HTHFIS333e-110 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 333 bits (856), Expect = e-110
Identities = 136/364 (37%), Positives = 191/364 (52%), Gaps = 24/364 (6%)

Query: 304 ITAVQRADQRIRSTRRPGAFTARYRLEQLNGASKANREMLQLAKRFAASHSTILITGESG 363
I + RA + L G S A +E+ ++ R + T++ITGESG
Sbjct: 112 IGIIGRALAEPKRRPSKLE-DDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESG 170

Query: 364 TGKELLAQGIHNESPRRHGPFVAINCAAFPESLLESELFGYEEGAFSGSRKGGKPGLFEA 423
TGKEL+A+ +H+ RR+GPFVAIN AA P L+ESELFG+E+GAF+G+ + G FE
Sbjct: 171 TGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGA-QTRSTGRFEQ 229

Query: 424 AHRGTLFLDEIGDMPVSLQTRLLRVLQEREVLRLGGTEPISIDVRIVAATHKDLGAAMKD 483
A GTLFLDEIGDMP+ QTRLLRVLQ+ E +GG PI DVRIVAAT+KDL ++
Sbjct: 230 AEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQ 289

Query: 484 REFRTDLYYRLNILRLQTTPLRERPEDIPLICQGISQRLLVQGQPPGAAGITTLLLPYLV 543
FR DLYYRLN++ L+ PLR+R EDIP + + Q+ +G L +
Sbjct: 290 GLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDV--KRFDQEALELMK 347

Query: 544 RHSWPGNVRELENVIERAM-LSASEIFHEHGVDEHYLARVLPELFEGQPQPRSRKEPSRT 602
H WPGNVRELEN++ R L ++ ++ + + E S+
Sbjct: 348 AHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQA 407

Query: 603 KDLHAIGKTAQL-------------------RYIQETLDSCQGSLDEAARRLGISRTTLW 643
+ + A I L + +G+ +AA LG++R TL
Sbjct: 408 VEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 644 RRLR 647
+++R
Sbjct: 468 KKIR 471


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10730DHBDHDRGNASE1123e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 112 bits (282), Expect = 3e-32
Identities = 71/253 (28%), Positives = 123/253 (48%), Gaps = 14/253 (5%)

Query: 10 ALDGRRALVTGASSGLGRHFAMTLAAAGAEVVVTARRQAPLQALVEAIEVAGGRAQAFAL 69
++G+ A +TGA+ G+G A TLA+ GA + L+ +V +++ A+AF
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 70 DV----TCREDICRVLDAAGPLDVLVNNAGVSDSQPLLACDDQSWDRVLDTNLKGAWAVA 125
DV E R+ GP+D+LVN AGV + + D+ W+ N G + +
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 126 QESARRMVAAGKGGSLINVTSILASRVAGAVGPYLAAKAGLAHLTRAMALELARHGIRVN 185
+ ++ M+ + GS++ V S A ++ Y ++KA T+ + LELA + IR N
Sbjct: 125 RSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 186 ALAPGYVMTDLNEAFLASEAGDKLRSR---------IPSRRFSVPADLDGALLLLASDAG 236
++PG TD+ + A E G + + IP ++ + P+D+ A+L L S
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 237 RAMSGAEIVVDGG 249
++ + VDGG
Sbjct: 244 GHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10740RTXTOXINA260.026 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 26.5 bits (58), Expect = 0.026
Identities = 14/46 (30%), Positives = 22/46 (47%), Gaps = 1/46 (2%)

Query: 50 LIARRLAQGSNMTFGVAAGVFLFVFFCALSALYVYRANGEFDRLTQ 95
+IA+R AQG + + AAG+ A+S L +F R +
Sbjct: 291 IIAQRAAQGLSTS-AAAAGLIASAVTLAISPLSFLSIADKFKRANK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10785HTHTETR505e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.4 bits (120), Expect = 5e-10
Identities = 30/190 (15%), Positives = 68/190 (35%), Gaps = 11/190 (5%)

Query: 3 RVGAEVRRQDFIEAAVKVIAEYGVANATTRRIAAAANSPLASLHYVFHTKDELFDAVYES 62
+ A+ RQ ++ A+++ ++ GV++ + IA AA ++++ F K +LF ++E
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 63 LIDKPQQSLLHVTA--GATAADSVAEILRQLVGWFTTHPE-----LATTQFELFFWNLRN 115
+ L A + EIL ++ T F +
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAV 125

Query: 116 NPAMASKIYTDSVEATKQAIEQV--AGSVLDQEALATVSRLLINLFDGLLLAWSAHGDQE 173
+ +S + +Q ++ A + + ++ GL+ W +
Sbjct: 126 VQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF--APQ 183

Query: 174 RLNAETEAAC 183
+ + EA
Sbjct: 184 SFDLKKEARD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10815ACRIFLAVINRP732e-15 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 73.3 bits (180), Expect = 2e-15
Identities = 33/209 (15%), Positives = 80/209 (38%), Gaps = 9/209 (4%)

Query: 586 TTINRVVDAAKAFRSEYPMSGISIRLASGNAGVLAAINEEVEKSETPMLLYVYAAIALLV 645
T + + +P G+ + + EV K+ L + L++
Sbjct: 301 DTAKAIKAKLAELQPFFP-QGMKVLYPYDTTPFVQLSIHEVVKT----LFEAIMLVFLVM 355

Query: 646 FVVYRDLRAVLVCCLPLTIGTFIGYWFMKELQIGLTIATLPVMVLAVGIGVDYAFYIYNR 705
++ +++RA L+ + + + + + + T+ MVLA+G+ VD A +
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 706 LQLHLAHGQTITK-AVEYALLEVGVATIFTAITLAVGVATWAF---SELKFQADMGKLLA 761
++ + + K A E ++ ++ A + A+ L+ AF S +
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 762 FMFVVNMVMAMTVLPAFAVWLERAFPRKR 790
+++++A+ + PA L + +
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEH 504



Score = 41.0 bits (96), Expect = 2e-05
Identities = 37/216 (17%), Positives = 75/216 (34%), Gaps = 11/216 (5%)

Query: 233 IADGASAVLEFCLLALLLTAGAVYWYCHSLRFTLLALVCSLASLVWQFGSLRLLGYGLDP 292
+ V++ A++L +Y + ++R TL+ + L+ F L GY ++
Sbjct: 333 VQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT 392

Query: 293 LAVLVPFLVFAIGVSHGVQQINFIVREIAIGKS----AEEAARSSFTGLLVPGTLALVTA 348
L + L + V + + + R + K A E + S G LV + L
Sbjct: 393 LTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAV 452

Query: 349 LVSFVTLLLIPIPMVRELAITASLGVAYKIVTNLVMLPLMASLLRVDDKYAAAQEVSRQR 408
+ + R+ +IT +A ++ L++ P + + L K +A+ +
Sbjct: 453 FIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLL---KPVSAEHHENKG 509

Query: 409 R-SRWL-RGLARLAE--PRKAQWVLGAALAVFLAAI 440
W +LG+ L
Sbjct: 510 GFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYA 545



Score = 35.6 bits (82), Expect = 0.001
Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 18/175 (10%)

Query: 625 EVEKSETPMLLYVYAAIALLVFVV----YRDLRAVLVCCL--PLT-IGTFIGYWFMKELQ 677
E+ + A ++VF+ Y + L PL +G +
Sbjct: 863 YQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLF---N 919

Query: 678 IGLTIATLPVMVLAVGIGVDYAFYIYNRL-QLHLAHGQTITKAVEYALLEVGVATIFTAI 736
+ + ++ +G+ A I L G+ + +A A+ + T++
Sbjct: 920 QKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSL 979

Query: 737 TLAVGV-----ATWAFSELKFQADMGKLLAFMFVVNMVMAMTVLPAFAVWLERAF 786
+GV + A S Q +G + V ++A+ +P F V + R F
Sbjct: 980 AFILGVLPLAISNGAGSGA--QNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10830HTHFIS320.005 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.005
Identities = 21/113 (18%), Positives = 36/113 (31%), Gaps = 9/113 (7%)

Query: 276 RAAIGSPGTGVNGFRRSHLEALTTQRLMGRLAGAPAVATIDQVRMVSLMTQDDRAARQFV 335
R P + R +E + + A A + + + ++ R
Sbjct: 364 RLTALYPQDVI---TREIIE-NELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 336 LSTLGRLATEPSVL-----QRSLHAFLANGCNVTQTAEALGTHRNTLLRRLER 383
L VL L A A N + A+ LG +RNTL +++
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10855cloacin472e-08 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 47.0 bits (111), Expect = 2e-08
Identities = 37/115 (32%), Positives = 46/115 (40%), Gaps = 15/115 (13%)

Query: 17 GGDGGGTGGG--------DGGGTGGGTGGGGDGGTGGGD------GGTGGGDGNGGTGGG 62
GGDG G G +GG TG G GGG G+G GG+G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 63 DGDGGGTGGGGDGGGDGQCDPAKDPNKCGSGSSISGDGDCKVAIQCNGDAIQCAI 117
GG GG G G P G ++S G +A+ + A+ AI
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF-PALSTPGAGGLAVSISAGALSAAI 116



Score = 40.1 bits (93), Expect = 5e-06
Identities = 24/58 (41%), Positives = 25/58 (43%), Gaps = 6/58 (10%)

Query: 16 PGGDGGGTGGGDGGGTG------GGTGGGGDGGTGGGDGGTGGGDGNGGTGGGDGDGG 67
P G G G G DG G GG G G GG G GGG+GN G G G G
Sbjct: 24 PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 38.2 bits (88), Expect = 2e-05
Identities = 23/58 (39%), Positives = 27/58 (46%), Gaps = 10/58 (17%)

Query: 13 PKDPGGDGGGTGG----------GDGGGTGGGTGGGGDGGTGGGDGGTGGGDGNGGTG 60
P G GG + G G G G+G GGG G GGG+G +GGG G GG
Sbjct: 24 PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.002
Identities = 17/39 (43%), Positives = 22/39 (56%), Gaps = 3/39 (7%)

Query: 15 DPGGDGGGTGGGDGGGTGGGTGGGGD---GGTGGGDGGT 50
+P G G G+G GGG+G G GGG GG+G G +
Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82



Score = 27.8 bits (61), Expect = 0.048
Identities = 14/31 (45%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 11 TDPKDPGGDGGGTGGGDGGGTGGGTGGGGDG 41
+ GG G G GGG G +GGG+G GG+
Sbjct: 52 SGIHWGGGSGHGNGGG-NGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10890IGASERPTASE270.044 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.0 bits (59), Expect = 0.044
Identities = 18/129 (13%), Positives = 38/129 (29%), Gaps = 23/129 (17%)

Query: 14 LEQPKRGRGRPATGKALTDAERARRYRANKKKRDDQPSRKDAPSIPADGVKEILDGWQRT 73
Q D + D+ P AP+ P++ + + + ++
Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQE 1047

Query: 74 QDELDQALQRIAELE-----------------------AELASRVTKKEEAEAKTWAIQE 110
+++ Q E A+ S + + E K A E
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 111 RKGKARWQT 119
++ KA+ +T
Sbjct: 1108 KEEKAKVET 1116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10910cloacin472e-08 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 47.0 bits (111), Expect = 2e-08
Identities = 37/115 (32%), Positives = 46/115 (40%), Gaps = 15/115 (13%)

Query: 17 GGDGGGTGGG--------DGGGTGGGTGGGGDGGTGGGD------GGTGGGDGNGGTGGG 62
GGDG G G +GG TG G GGG G+G GG+G G GG G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 63 DGDGGGTGGGGDGGGDGQCDPAKDPNKCGSGSSISGDGDCKVAIQCNGDAIQCAI 117
GG GG G G P G ++S G +A+ + A+ AI
Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFGF-PALSTPGAGGLAVSISAGALSAAI 116



Score = 40.1 bits (93), Expect = 5e-06
Identities = 24/58 (41%), Positives = 25/58 (43%), Gaps = 6/58 (10%)

Query: 16 PGGDGGGTGGGDGGGTG------GGTGGGGDGGTGGGDGGTGGGDGNGGTGGGDGDGG 67
P G G G G DG G GG G G GG G GGG+GN G G G G
Sbjct: 24 PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 38.2 bits (88), Expect = 2e-05
Identities = 23/58 (39%), Positives = 27/58 (46%), Gaps = 10/58 (17%)

Query: 13 PKDPGGDGGGTGG----------GDGGGTGGGTGGGGDGGTGGGDGGTGGGDGNGGTG 60
P G GG + G G G G+G GGG G GGG+G +GGG G GG
Sbjct: 24 PTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81



Score = 31.6 bits (71), Expect = 0.002
Identities = 17/39 (43%), Positives = 22/39 (56%), Gaps = 3/39 (7%)

Query: 15 DPGGDGGGTGGGDGGGTGGGTGGGGD---GGTGGGDGGT 50
+P G G G+G GGG+G G GGG GG+G G +
Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82



Score = 27.8 bits (61), Expect = 0.048
Identities = 14/31 (45%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 11 TDPKDPGGDGGGTGGGDGGGTGGGTGGGGDG 41
+ GG G G GGG G +GGG+G GG+
Sbjct: 52 SGIHWGGGSGHGNGGG-NGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10945IGASERPTASE270.044 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.0 bits (59), Expect = 0.044
Identities = 18/129 (13%), Positives = 38/129 (29%), Gaps = 23/129 (17%)

Query: 14 LEQPKRGRGRPATGKALTDAERARRYRANKKKRDDQPSRKDAPSIPADGVKEILDGWQRT 73
Q D + D+ P AP+ P++ + + + ++
Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQE 1047

Query: 74 QDELDQALQRIAELE-----------------------AELASRVTKKEEAEAKTWAIQE 110
+++ Q E A+ S + + E K A E
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 111 RKGKARWQT 119
++ KA+ +T
Sbjct: 1108 KEEKAKVET 1116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS10995IGASERPTASE270.044 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.0 bits (59), Expect = 0.044
Identities = 18/129 (13%), Positives = 38/129 (29%), Gaps = 23/129 (17%)

Query: 14 LEQPKRGRGRPATGKALTDAERARRYRANKKKRDDQPSRKDAPSIPADGVKEILDGWQRT 73
Q D + D+ P AP+ P++ + + + ++
Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQE 1047

Query: 74 QDELDQALQRIAELE-----------------------AELASRVTKKEEAEAKTWAIQE 110
+++ Q E A+ S + + E K A E
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 111 RKGKARWQT 119
++ KA+ +T
Sbjct: 1108 KEEKAKVET 1116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS11045IGASERPTASE270.044 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.0 bits (59), Expect = 0.044
Identities = 18/129 (13%), Positives = 38/129 (29%), Gaps = 23/129 (17%)

Query: 14 LEQPKRGRGRPATGKALTDAERARRYRANKKKRDDQPSRKDAPSIPADGVKEILDGWQRT 73
Q D + D+ P AP+ P++ + + + ++
Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQE 1047

Query: 74 QDELDQALQRIAELE-----------------------AELASRVTKKEEAEAKTWAIQE 110
+++ Q E A+ S + + E K A E
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 111 RKGKARWQT 119
++ KA+ +T
Sbjct: 1108 KEEKAKVET 1116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS11085RTXTOXIND392e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.4 bits (92), Expect = 2e-05
Identities = 15/150 (10%), Positives = 51/150 (34%), Gaps = 5/150 (3%)

Query: 83 AQADVAQEREQLARERLDYQNQIRQAESRIQQLEGQCAGLTEQFQAAQQALLQEQQ---- 138
++ +V + + + +QNQ Q E + + + + + + E+
Sbjct: 180 SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239

Query: 139 LRQQAEVENARLQQANHDQEARLQDRDGQIRSLEDKHQHARDALEHYRQASKEQREQEQR 198
+ A + A +QE + + ++R + + + + ++ + + +
Sbjct: 240 FSSLLH-KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298

Query: 199 RHESQVQQLQLELRQLQQTLIVKQDELTHL 228
+++Q + L L ++
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQAS 328



Score = 36.0 bits (83), Expect = 2e-04
Identities = 28/198 (14%), Positives = 65/198 (32%), Gaps = 7/198 (3%)

Query: 125 QFQAAQQALLQEQQLRQQAEVENARLQQANHDQEARLQDRDGQIRSLEDKHQHARDALEH 184
Q +LLQ + + + ++ + ++ + + Q S E+ + E
Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194

Query: 185 Y--RQASKEQREQEQRRHESQVQQLQLELRQLQQTLIVKQDELTHLNRDNARLLAEARQQ 242
+ Q K Q+E + ++ + + + + V++ L D + LL +
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD----DFSSLLHKQAIA 250

Query: 243 QKDQHAQQKLLTQKAQALEVAQNTLTSIERTNEALEQRCHALQDEVTRLGEASSIQAQ-Q 301
+ Q+ + L V ++ L IE + ++ + Q
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 302 TQSLQERLAKATAQLKLL 319
L LAK + +
Sbjct: 311 IGLLTLELAKNEERQQAS 328



Score = 34.4 bits (79), Expect = 6e-04
Identities = 28/199 (14%), Positives = 65/199 (32%), Gaps = 4/199 (2%)

Query: 125 QFQAAQQALLQEQQLRQQAEVENARLQQANHDQEARLQDRDGQIRSLEDKHQHARDALEH 184
+ LL+ L +A+ + E + L + +
Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175

Query: 185 YRQASKEQREQEQRRHESQVQQLQLELRQLQQTLIVKQDELTHLNRDNARLLAEARQQQK 244
++ S+E+ + + Q Q + Q + L K+ E + R +R ++
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 245 DQHAQQKLLTQKA---QALEVAQNTLTSIERTNEALEQRCHALQDEVTRLGEASSIQAQQ 301
LL ++A A+ +N + + ++ E+ E + Q
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 302 TQSLQ-ERLAKATAQLKLL 319
++ ++L + T + LL
Sbjct: 296 FKNEILDKLRQTTDNIGLL 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS11120SACTRNSFRASE431e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 42.6 bits (100), Expect = 1e-07
Identities = 15/55 (27%), Positives = 25/55 (45%)

Query: 86 IDEFYIDEQCRGQGFGSEILEKVKSFLKSQGAAIVHLEVDENNPKAVSFYKKSGF 140
I++ + + R +G G+ +L K + K + LE + N A FY K F
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS11175OUTRSURFACE250.016 Outer surface protein signature.
		>OUTRSURFACE#Outer surface protein signature.

Length = 273

Score = 25.3 bits (55), Expect = 0.016
Identities = 15/44 (34%), Positives = 26/44 (59%), Gaps = 2/44 (4%)

Query: 16 DGSGDAIIELPSALLDGLGWVVGDELILEHIEGVLSLKRKLSPS 59
DG+G A L + L+G V D++ LE EG ++L ++++ S
Sbjct: 153 DGTGKAKEVLKNFTLEGK--VANDKVTLEVKEGTVTLSKEIAKS 194


35HWH78_RS11240HWH78_RS11340Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS11240293.266890glycosyltransferase family 4 protein
HWH78_RS112452102.366255cellulase family glycosylhydrolase
HWH78_RS112501103.488502glycosyltransferase family 4 protein
HWH78_RS112551103.465982glycosyltransferase family 4 protein
HWH78_RS112602113.382486membrane protein
HWH78_RS112652103.468445murein biosynthesis integral membrane protein
HWH78_RS11270-2122.228445acyltransferase family protein
HWH78_RS112751122.788096FAD-dependent oxidoreductase
HWH78_RS112801111.752604DNA topoisomerase IB
HWH78_RS112852111.823559inner centromere protein
HWH78_RS112901101.299725Lrp/AsnC family transcriptional regulator
HWH78_RS112950121.3005123-methyl-2-oxobutanoate dehydrogenase
HWH78_RS11300-1132.164507alpha-ketoacid dehydrogenase subunit beta
HWH78_RS11305-2122.0984612-oxo acid dehydrogenase subunit E2
HWH78_RS11310-2142.088812dihydrolipoyl dehydrogenase
HWH78_RS11315-1151.948697transcriptional regulator
HWH78_RS113200172.900443asparaginase
HWH78_RS113251143.233516paerucumarin biosynthesis protein PvcA
HWH78_RS11330-1132.645000paerucumarin biosynthesis oxygenase PvcB
HWH78_RS11335-1123.137511paerucumarin biosynthesis protein PvcC
HWH78_RS11340-2133.387812paerucumarin biosynthesis heme-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS11240TCRTETOQM290.035 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 29.1 bits (65), Expect = 0.035
Identities = 8/28 (28%), Positives = 15/28 (53%)

Query: 193 LYFGFIYRGKGIEDLLEALADLFASAPE 220
+Y G GI++L+E + + F S+
Sbjct: 216 VYHGSAKNNIGIDNLIEVITNKFYSSTH 243


36HWH78_RS11475HWH78_RS11635Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS114750123.056725hypothetical protein
HWH78_RS114801123.595152M48 family metallopeptidase
HWH78_RS114850102.887311hypothetical protein
HWH78_RS114901102.645806hypothetical protein
HWH78_RS114952112.316615M48 family metallopeptidase
HWH78_RS115002141.886302hypothetical protein
HWH78_RS115052142.528784hypothetical protein
HWH78_RS115101132.610999TonB-dependent receptor
HWH78_RS115150143.101864glucose/quinate/shikimate family membrane-bound
HWH78_RS11520-2132.719491carbohydrate porin OprB
HWH78_RS115250113.585587gamma-butyrobetaine hydroxylase-like
HWH78_RS115300102.518958HEAT repeat domain-containing protein
HWH78_RS115350111.368201ABC transporter ATP-binding protein
HWH78_RS115400110.878055ABC transporter permease
HWH78_RS11545-2120.866278ABC transporter substrate-binding protein
HWH78_RS11550-2113.535827ferredoxin family protein
HWH78_RS11555-1113.556990fumarate reductase/succinate dehydrogenase
HWH78_RS11560-1114.053090GntR family transcriptional regulator
HWH78_RS11565-1104.785600chitinase
HWH78_RS11570-1114.875947YbaK/EbsC family protein
HWH78_RS11575-1114.686621amino acid adenylation domain-containing
HWH78_RS11580-1113.824830TauD/TfdA family dioxygenase
HWH78_RS11585-2123.688061TauD/TfdA family dioxygenase
HWH78_RS11590-2123.466568non-ribosomal peptide synthetase
HWH78_RS11595-3151.187809LysE family translocator
HWH78_RS11600-2131.792966ABC transporter permease
HWH78_RS116054110.508702ABC transporter ATP-binding protein
HWH78_RS116105121.821092ABC transporter substrate-binding protein
HWH78_RS116156112.548460TauD/TfdA family dioxygenase
HWH78_RS116205123.026599hypothetical protein
HWH78_RS116253102.051270XRE family transcriptional regulator
HWH78_RS11630291.932291hypothetical protein
HWH78_RS11635-293.156588MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS11635TCRTETB462e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 45.6 bits (108), Expect = 2e-07
Identities = 35/170 (20%), Positives = 73/170 (42%), Gaps = 7/170 (4%)

Query: 35 FVAILSETLPAGLLPQIGAGLAVSEALAGQLVSVYALGSLLAALPAASLTQGWRRRRVLL 94
F ++L+E + LP I A + + + L + L+ +R+LL
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 95 LALLIFFVCNSLTAIS-SDYRLTLLARFGSGVAAGLAWGLLAGYARRLVPPEQQGRALAV 153
++I + + + S + L ++ARF G A L+ R +P E +G+A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAF-- 141

Query: 154 AMLGAPLALSLGVPLGTWLGGLLG--WRWAFGLLSLTALLLVGWVLRSVP 201
++G+ +++G +G +GG++ W++ LL ++ L +
Sbjct: 142 GLIGS--IVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLL 189


37HWH78_RS11685HWH78_RS11710Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS116850133.055318acyl-CoA dehydrogenase family protein
HWH78_RS11690-2123.452743SfnB family sulfur acquisition oxidoreductase
HWH78_RS11695-2113.288586LLM class flavin-dependent oxidoreductase
HWH78_RS117000103.975364ABC transporter permease
HWH78_RS11705-1103.797575ABC transporter substrate-binding protein
HWH78_RS11710-2113.882002ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS11710PF05272290.025 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.025
Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 37 VVSILGPSGVGKSSLLRVLAGLQ 59
V + G G+GKS+L+ L GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


38HWH78_RS11760HWH78_RS11790Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS11760-1133.308865sugar ABC transporter permease
HWH78_RS11765-2133.772557carbohydrate ABC transporter permease
HWH78_RS11770-2134.395926sn-glycerol-3-phosphate ABC transporter
HWH78_RS117750124.847659mannitol dehydrogenase family protein
HWH78_RS117800134.704475xylulokinase
HWH78_RS117851133.568903carbohydrate kinase
HWH78_RS117900133.392147NAD(P)/FAD-dependent oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS11770PF05272310.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.009
Identities = 27/129 (20%), Positives = 42/129 (32%), Gaps = 26/129 (20%)

Query: 32 VVFVGPSGCGKSTLLRLIAGLEEASGGSIALDGTDITDTPPAKRDLAMVFQTYALYPHMT 91
VV G G GKSTL+ + GL+ S + +D Y
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVAY---- 645

Query: 92 VRRNLSFALDLAGVDKREVAA-KVDAAARILELQALLERKPRQLSGGQRQRVAIGRAIVR 150
LS ++ + + A K ++R + R + RQ V
Sbjct: 646 ---ELS---EMTAFRRADAEAVKAFFSSRKDRYRGAYGRYVQDHP---RQVVIWCTT--- 693

Query: 151 NPKIFLFDE 159
N + +LFD
Sbjct: 694 NKRQYLFDI 702


39HWH78_RS11845HWH78_RS11870Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS118452122.403814FMNH2-dependent alkanesulfonate monooxygenase
HWH78_RS118503132.448126FMN reductase
HWH78_RS118554122.854828hypothetical protein
HWH78_RS118604122.632823sigma-54-dependent transcriptional regulator
HWH78_RS118654112.474708type VI secretion system protein TssA
HWH78_RS118702101.659937hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS11860HTHFIS362e-125 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 362 bits (932), Expect = e-125
Identities = 132/350 (37%), Positives = 184/350 (52%), Gaps = 13/350 (3%)

Query: 8 HARELTKSVRATVLVFNDPRSRELLERIERLAPSEANALVIGETGTGKELVARHIHALSG 67
++ S LV +E+ + RL ++ ++ GE+GTGKELVAR +H
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185

Query: 68 RNGGPFVAVNCGAFAESLVESELFGHEKGAFTGALQSKAGWFEAANGGTLFLDEIGDLPP 127
R GPFVA+N A L+ESELFGHEKGAFTGA G FE A GGTLFLDEIGD+P
Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245

Query: 128 SIQVKLLRVLQEREVVRLGSRRPIPIDVRLVAATNVDLADAVVAGHFREDLFYRLHVATI 187
Q +LLRVLQ+ E +G R PI DVR+VAATN DL ++ G FREDL+YRL+V +
Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305

Query: 188 QLPPLRERRGDILPLAEYFIVEHCRRLGYTSASLSPEAERKLLGHSWAGNIRELENAIHH 247
+LPPLR+R DI L +F+ + + G EA + H W GN+RELEN +
Sbjct: 306 RLPPLRDRAEDIPDLVRHFV-QQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRR 364

Query: 248 ALLVCRNRLIQPADLH-----------LIDMRARQEPSGLRRAPESAAGSALEAALQALF 296
+ +I + + AR + +A E + AL
Sbjct: 365 LTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALP 424

Query: 297 EEN-REDLYEHIEETVFRAAYRFCHGNQLQTGRLLGISRNIVRARLEKIG 345
+ + +E + AA GNQ++ LLG++RN +R ++ ++G
Sbjct: 425 PSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474


40HWH78_RS12260HWH78_RS12520Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS122604143.462474DUF1656 domain-containing protein
HWH78_RS122653143.272776FUSC family protein
HWH78_RS122703123.984358LysR family transcriptional regulator
HWH78_RS122753134.718879hypothetical protein
HWH78_RS122803125.583091DUF4946 domain-containing protein
HWH78_RS122852115.403539cadmium-translocating P-type ATPase
HWH78_RS12290194.993809hypothetical protein
HWH78_RS12295185.215346protease modulator HflK
HWH78_RS12300084.434229protease modulator HflC
HWH78_RS12305-173.710003protease modulator HflK
HWH78_RS12310-2112.367838polysaccharide deacetylase family protein
HWH78_RS12315-2132.162676hypothetical protein
HWH78_RS12320-2132.225042glycine cleavage system aminomethyltransferase
HWH78_RS12325-2132.131132L-serine ammonia-lyase
HWH78_RS12330-2142.263854serine hydroxymethyltransferase
HWH78_RS12335-2142.781977aminomethyl-transferring glycine dehydrogenase
HWH78_RS123400113.406411glycine cleavage system protein GcvH
HWH78_RS123450103.313368LysR family transcriptional regulator
HWH78_RS12350-1113.188086amidohydrolase
HWH78_RS123550122.778345sigma-54-dependent transcriptional regulator
HWH78_RS123602131.841007hypothetical protein
HWH78_RS12365316-1.148728esterase family protein
HWH78_RS12370421-2.297388YgdI/YgdR family lipoprotein
HWH78_RS12375425-3.170318class I SAM-dependent methyltransferase
HWH78_RS12380530-5.922595N-acetyltransferase
HWH78_RS12385535-7.888426DUF6531 domain-containing protein
HWH78_RS12390442-9.271985RHS repeat protein
HWH78_RS12395538-8.010142ankyrin repeat domain-containing protein
HWH78_RS12400745-10.867013hypothetical protein
HWH78_RS12405417-5.700537hypothetical protein
HWH78_RS12410111-2.275805hypothetical protein
HWH78_RS1241509-0.146425hypothetical protein
HWH78_RS124201122.027664PepSY domain-containing protein
HWH78_RS124251122.253561ferrioxamine receptor FoxA
HWH78_RS124301142.482477anti-sigma factor FoxR
HWH78_RS124351144.361279sigma-70 family RNA polymerase sigma factor
HWH78_RS124400143.897988LysR family transcriptional regulator
HWH78_RS12445-1143.604429gentisate 1,2-dioxygenase
HWH78_RS12450-1123.417691fumarylacetoacetate hydrolase family protein
HWH78_RS124550113.084726aromatic acid/H+ symport family MFS transporter
HWH78_RS12460083.045757maleylacetoacetate isomerase
HWH78_RS12465193.049492transporter
HWH78_RS12470093.174834cytochrome P450
HWH78_RS124752103.858594thiol:disulfide interchange protein DsbG
HWH78_RS12485383.798539TlpA family protein disulfide reductase
HWH78_RS124901103.476421protein-disulfide reductase DsbD
HWH78_RS12495082.954655response regulator
HWH78_RS12500192.599577two-component sensor histidine kinase
HWH78_RS125050111.671771c-type cytochrome
HWH78_RS125101112.609928cytochrome c4
HWH78_RS125151113.121292LLM class flavin-dependent oxidoreductase
HWH78_RS125200133.242008TetR/AcrR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12350UREASE372e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 37.0 bits (86), Expect = 2e-04
Identities = 18/36 (50%), Positives = 22/36 (61%)

Query: 522 YTRNAARTIGLEHRIGSLEPGKQADFIVLDRDVFEV 557
YT N A GL H IGSLE GK+AD ++ + F V
Sbjct: 409 YTINPAIAHGLSHEIGSLEVGKRADLVLWNPAFFGV 444



Score = 30.9 bits (70), Expect = 0.018
Identities = 23/79 (29%), Positives = 37/79 (46%), Gaps = 10/79 (12%)

Query: 19 HALGAADLLVVNARIFTANPQQPFAEALAVEDGRILAVGDEAGLRALADGDSQVVDLG-- 76
GA D ++ NA I + + ++DGRI A+G +AG + G + +V G
Sbjct: 63 REGGAVDTVITNALIL--DHWGIVKADIGLKDGRIAAIG-KAGNPDMQPGVTIIVGPGTE 119

Query: 77 -----GKRLMPGLIDTHSH 90
GK + G +D+H H
Sbjct: 120 VIAGEGKIVTAGGMDSHIH 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12355HTHFIS331e-111 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 331 bits (851), Expect = e-111
Identities = 120/355 (33%), Positives = 179/355 (50%), Gaps = 35/355 (9%)

Query: 186 ERLAALHHDHAEGFEMLLGDSQPIRTLKTRAQRVAALDAPLLIHGETGTGKELVARGCHA 245
+R + D ++ L+G S ++ + R+ D L+I GE+GTGKELVAR H
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD 182

Query: 246 LSARHNSPFLALNCAALPENLAESELFGYAPGAFTGAQRGGKPGLLELAHQGTVFLDEIG 305
R N PF+A+N AA+P +L ESELFG+ GAFTGAQ G E A GT+FLDEIG
Sbjct: 183 YGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRST-GRFEQAEGGTLFLDEIG 241

Query: 306 EMSPYLQAKLLRFLSDGSFRRVGGDREVRVDVRILSATHRNLEKMVAEGSFREDLFYRLN 365
+M Q +LLR L G + VGG +R DVRI++AT+++L++ + +G FREDL+YRLN
Sbjct: 242 DMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLN 301

Query: 366 VLSLEVPPLRERGHDILLLARHFMQQACAQIQRPVCRLAPGTYPALLNNRWPGNVRQLQN 425
V+ L +PPLR+R DI L RHF+QQA + V R + + WPGNVR+L+N
Sbjct: 302 VVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELEN 360

Query: 426 VIFRAAAICESSLVDIGDLEIAGTAVARQND----------------------------- 456
++ R A+ ++ +E + +
Sbjct: 361 LVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFG 420

Query: 457 ---GEVGSLEEAVEGFEKALLEKLYVSYPSTRQLAAR-LQTSHTAIAHRLRKYGI 507
G + + E L+ + + AA L + + ++R+ G+
Sbjct: 421 DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12380SACTRNSFRASE334e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 4e-04
Identities = 18/68 (26%), Positives = 27/68 (39%), Gaps = 4/68 (5%)

Query: 103 LSHAYRRRGLGAHLFHLAATQARHLGASGLYVSATPSQN--TVDFYTRLGCRLCMEPDEE 160
++ YR++G+G L H A A+ GL + T N FY + + D
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLE-TQDINISACHFYAKHHFIIG-AVDTM 154

Query: 161 LYRLEPED 168
LY P
Sbjct: 155 LYSNFPTA 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12415PF01540270.030 Adhesin lipoprotein
		>PF01540#Adhesin lipoprotein

Length = 475

Score = 27.4 bits (60), Expect = 0.030
Identities = 11/30 (36%), Positives = 20/30 (66%)

Query: 60 EYPAIISRLRVAIESYQGRVKWVLDEHKRV 89
+YPAIIS+L A+E+ + + V +K++
Sbjct: 87 DYPAIISKLSAAVENAKSEQQKVDQANKKI 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12430TYPE3OMGPROT340.002 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 34.1 bits (78), Expect = 0.002
Identities = 19/81 (23%), Positives = 26/81 (32%), Gaps = 10/81 (12%)

Query: 15 LDFPRASRLSRSVRAALLSLAMAAGAAPLCASAAEAAAEQARPYAIPAGQ--LGDVLNRF 72
+ FP S R + LL L+ + A L PY A L D+L F
Sbjct: 1 MAFPLHSFFKRVLTGTLLLLSSYSWAQEL--------DWLPIPYVYVAKGESLRDLLTDF 52

Query: 73 AREAGITLSATPAQTGGYSSQ 93
T+ + S Q
Sbjct: 53 GANYDATVVVSDKINDKVSGQ 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12445MPTASEINHBTR280.030 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 27.7 bits (61), Expect = 0.030
Identities = 13/43 (30%), Positives = 20/43 (46%)

Query: 59 RGLRPTPYGMTLFNHAQRVLTEMERARQNLEAMRSGSGSRVLL 101
PTP G+ L N +T + R ++ R+ SG+ V L
Sbjct: 76 VSWSPTPDGIWLMNAEGTGITHLNRQKEGEYTGRTPSGADVTL 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12460TCRTETB493e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 48.7 bits (116), Expect = 3e-08
Identities = 39/178 (21%), Positives = 72/178 (40%), Gaps = 3/178 (1%)

Query: 31 LCFLIVAMDGFDTAAIGFIAPALAHDWQLSPAQLSPILGAALAGLALGAFAAGPLADRFG 90
LC L + + P +A+D+ PA + + A + ++G G L+D+ G
Sbjct: 19 LCIL-SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77

Query: 91 RKSVLLLSVLFFGGWSLASAYAGS-VETLALLRFFTGLGLGGAMPNAITLTSEYCPRRHR 149
K +LL ++ S+ S L + RF G G + + + Y P+ +R
Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137

Query: 150 ALMVTAMFCGFTLGSALGGLLAARMVPALGWESVLLLGGGLPLASLPLLWACLPESVR 207
+ +G +G + + + W S LLL + + ++P L L + VR
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW-SYLLLIPMITIITVPFLMKLLKKEVR 194



Score = 30.6 bits (69), Expect = 0.011
Identities = 36/196 (18%), Positives = 71/196 (36%), Gaps = 13/196 (6%)

Query: 256 AELRGGTLLLWATF--FMGLLIIYLLTNWLPTLIGGTGFSLGEAATISAMFQLGGTLGAL 313
+ LR +L+W F +L +L LP + ++ F L ++G
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTA 67

Query: 314 LLGSAMDRFDAHRVLSLAYVGGALFILG--IASLYHSFA---LLALCVAGVGFCISGSQV 368
+ G D+ R+L G + G I + HSF ++A + G G + V
Sbjct: 68 VYGKLSDQLGIKRLLLF---GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALV 124

Query: 369 GANALAADFYPTRSRATGVSWALGLGRIGSIVGSLSGGALLG-LGLGFSGILALLVIPAL 427
+ A + P +R + +G VG GG + + + ++ ++ I +
Sbjct: 125 M--VVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITV 182

Query: 428 LAAVAVHRLGRRRARP 443
+ + + R
Sbjct: 183 PFLMKLLKKEVRIKGH 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12495HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 2e-20
Identities = 38/122 (31%), Positives = 62/122 (50%), Gaps = 1/122 (0%)

Query: 2 HVLLTEDDDLIASGIVAGLNAQGLTVDRVASAADTQALLQVARFDVLVLDLGLPDEDGLR 61
+L+ +DD I + + L+ G V ++AA + D++V D+ +PDE+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLQRLRQQGVDLPVLVLTARDAVTDRVAGLQAGADDYLLKPFDLRELGARLHT-LQRRSA 120
LL R+++ DLPVLV++A++ + + GA DYL KPFDL EL + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GR 122

Sbjct: 125 RP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12520HTHTETR514e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.8 bits (121), Expect = 4e-10
Identities = 37/201 (18%), Positives = 66/201 (32%), Gaps = 11/201 (5%)

Query: 1 MAKRGRPCGFD-REQALRRALDVFWEAGYEGATMAALKEAMGGICAPSMYAAYGSKEALF 59
MA++ + + R+ L AL +F + G ++ + +A G+ ++Y + K LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAA-GVTRGAIYWHFKDKSDLF 59

Query: 60 RSAVELYLSQECQLSKGAFA------LPTARESIAALLESAAVSYTTEGKPRGCLVDLST 113
EL S +L A L RE + +LES T E + +
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLEST---VTEERRRLLMEIIFHK 116

Query: 114 TNFSPANKGVEDYLRDHRRRAARLLRERFARGVADGDVPAGADLDALTSFYSSVLQGLSI 173
F V+ R+ + + + + +PA + GL
Sbjct: 117 CEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176

Query: 174 QARDGASRQQLLAIGRCAMAA 194
L R +A
Sbjct: 177 NWLFAPQSFDLKKEARDYVAI 197


41HWH78_RS12935HWH78_RS13275Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS12935-220-3.227933hypothetical protein
HWH78_RS12940-220-3.416090sensor domain-containing phosphodiesterase
HWH78_RS12945223-4.934220hypothetical protein
HWH78_RS12950125-4.723245hypothetical protein
HWH78_RS12955033-6.431715PA-I galactophilic lectin
HWH78_RS12960032-6.826085site-specific integrase
HWH78_RS12965140-8.133193type II toxin-antitoxin system RelE/ParE family
HWH78_RS12970142-9.042077DNA-binding transcriptional regulator
HWH78_RS12975144-9.235155TIGR02391 family protein
HWH78_RS12980146-10.348336hypothetical protein
HWH78_RS12985243-10.479570DUF4917 family protein
HWH78_RS12990340-9.274902hypothetical protein
HWH78_RS12995434-8.013825DUF2326 domain-containing protein
HWH78_RS13000126-5.038473hypothetical protein
HWH78_RS13005421-2.260502hypothetical protein
HWH78_RS13010418-0.522169type II toxin-antitoxin system RelB/DinJ family
HWH78_RS13015417-0.636760DNA repair protein RadC
HWH78_RS13020417-0.751670hypothetical protein
HWH78_RS13025518-0.389125DUF945 domain-containing protein
HWH78_RS13030419-0.101076ParB/RepB/Spo0J family partition protein
HWH78_RS130351200.036686hypothetical protein
HWH78_RS130401200.554483DNA helicase
HWH78_RS130453193.239842DUF736 domain-containing protein
HWH78_RS130505213.980552helix-turn-helix domain-containing protein
HWH78_RS130556223.872184DUF2958 domain-containing protein
HWH78_RS130606223.854745DUF2285 domain-containing protein
HWH78_RS130658203.006076helix-turn-helix domain-containing protein
HWH78_RS130705201.983266replication initiator protein A
HWH78_RS130755230.380738AAA family ATPase
HWH78_RS13080424-1.052922hypothetical protein
HWH78_RS13085327-2.840054DUF2840 domain-containing protein
HWH78_RS13090430-3.352839S26 family signal peptidase
HWH78_RS13095225-3.574273hypothetical protein
HWH78_RS13100228-4.278944hypothetical protein
HWH78_RS13105229-4.090703TonB-dependent siderophore receptor
HWH78_RS13110231-4.413938efflux RND transporter periplasmic adaptor
HWH78_RS13115132-4.355336efflux RND transporter permease subunit
HWH78_RS13120133-4.659648efflux transporter outer membrane subunit
HWH78_RS13125238-4.308196transcriptional repressor
HWH78_RS13130136-4.382114helix-turn-helix transcriptional regulator
HWH78_RS13135329-2.522857LysR family transcriptional regulator
HWH78_RS13140322-1.076963EexN family lipoprotein
HWH78_RS131454190.324262conjugal transfer protein TraG
HWH78_RS131503141.166176ribbon-helix-helix protein, CopG family
HWH78_RS131552131.748622P-type conjugative transfer ATPase TrbB
HWH78_RS131603131.962252TrbC/VirB2 family protein
HWH78_RS131652142.052948VirB3 family type IV secretion system protein
HWH78_RS131703142.059118conjugal transfer protein TrbE
HWH78_RS131753141.983747P-type conjugative transfer protein TrbJ
HWH78_RS131804142.263402hypothetical protein
HWH78_RS131853152.378464P-type conjugative transfer protein TrbL
HWH78_RS131902140.803874conjugal transfer protein TrbF
HWH78_RS13195318-1.087967P-type conjugative transfer protein TrbG
HWH78_RS13205-120-2.439885TrbI/VirB10 family protein
HWH78_RS13210019-3.159641DUF2274 domain-containing protein
HWH78_RS13215-118-3.275699hypothetical protein
HWH78_RS13220-115-2.863744hypothetical protein
HWH78_RS13225-112-1.982295two-component sensor histidine kinase
HWH78_RS13230-19-1.935536two-component system response regulator
HWH78_RS13235-111-1.517829methyl-accepting chemotaxis protein
HWH78_RS13240-114-2.074075alkane 1-monooxygenase AlkB1
HWH78_RS13245-214-1.783319nitroreductase family protein
HWH78_RS13250-215-2.539517DMT family transporter
HWH78_RS13255-118-3.066238Lrp/AsnC family transcriptional regulator
HWH78_RS13260024-3.364408GNAT family N-acetyltransferase
HWH78_RS13265023-3.734778tryptophan 2,3-dioxygenase
HWH78_RS13270-122-3.241449NAD(P)H-dependent oxidoreductase
HWH78_RS13275-120-3.218204hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12985ANTHRAXTOXNA347e-04 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 34.3 bits (78), Expect = 7e-04
Identities = 14/64 (21%), Positives = 28/64 (43%), Gaps = 1/64 (1%)

Query: 237 QKVASIQNSYYLSTVYREVLKSQRDTLVVYGWGFAEQDIHL-LQRMKDTGINRVAVSVFR 295
Q S+ SYY + +R VL+ + Y + + +K G+ + + V +
Sbjct: 238 QHAFSLAFSYYFAPDHRTVLELYAPDMFEYMNKLEKGGFEKISESLKKEGVEKDRIDVLK 297

Query: 296 GDQA 299
G++A
Sbjct: 298 GEKA 301


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13040GPOSANCHOR391e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 38.9 bits (90), Expect = 1e-04
Identities = 39/251 (15%), Positives = 77/251 (30%), Gaps = 24/251 (9%)

Query: 451 NGEASEEAETDDENVAEALFASPATAANTDGEQALSQQQKKAPEGLMAWLGRHNQLNKER 510
A E+ +D++++E + + +A ++ + + E
Sbjct: 94 LSNAKEKLRKNDKSLSEKA----SKIQELEARKADLEKALEGAMNFSTADSAKIK-TLEA 148

Query: 511 TPDQRQALWQQAVAEYQAAKAQVRTVCADAHRIRALIQTLLTARKKIAE----------- 559
A + A A + A L + ++ +
Sbjct: 149 EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTA 208

Query: 560 ATATVQALEQNLAHTAEQQARLDTEEGRPANAAFKHALETLTQHQVRRPGFWANLRSLWG 619
+A ++ LE A A ++A L+ A + + + A L
Sbjct: 209 DSAKIKTLEAEKAALAARKADLEKALEG-AMNFSTADSAKIKTLEAEKAALEARQAELEK 267

Query: 620 ASRAWNTALKLLQGQHDLAKAEFDRV----ARLAKQ---FEASSQQLERQITGARQALQQ 672
A + +AE + A L Q A+ Q L R + +R+A +Q
Sbjct: 268 ALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQ 327

Query: 673 WQAEDQSLTQQ 683
+AE Q L +Q
Sbjct: 328 LEAEHQKLEEQ 338



Score = 33.9 bits (77), Expect = 0.004
Identities = 44/260 (16%), Positives = 77/260 (29%), Gaps = 21/260 (8%)

Query: 449 DSNGEASEEAETDDENVAEALFASPATAANTDGEQALSQQQKKAPEGLMAWLGRHNQLNK 508
+ + E A A +KA EG M + + K
Sbjct: 120 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIK 179

Query: 509 ERTPDQRQ--ALWQQAVAEYQAAKAQVRTVCADAHRIRALIQTLLTARKKIAEATATVQA 566
++ A + + A A + A L + + +A
Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239

Query: 567 LEQNLAHTA-----------EQQARLDTEEGRPAN--AAFKHALETLTQHQVRRPGFWAN 613
+ +QA L+ N A ++TL + A+
Sbjct: 240 FSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKAD 299

Query: 614 LRSLWGASRAWNTALKL-LQGQHDLAKAEFDRVARLAKQF---EASSQQLERQITGARQA 669
L A +L+ L + K +L +Q EAS Q L R + +R+A
Sbjct: 300 LEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREA 359

Query: 670 LQQWQAEDQSLTQ--QASDL 687
+Q +AE Q L + + S+
Sbjct: 360 KKQLEAEHQKLEEQNKISEA 379


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13100cloacin280.011 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 28.1 bits (62), Expect = 0.011
Identities = 15/52 (28%), Positives = 22/52 (42%)

Query: 40 FVLLSDTAPPAEEHEHSPEEKKRLALWMKKHKVELRKLVLAMIQIEPNQAKR 91
+V +SD P + + EE +R W H VE + + E NQA
Sbjct: 284 YVSVSDVLSPDQVKQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANE 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13115RTXTOXIND478e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.7 bits (111), Expect = 8e-08
Identities = 21/167 (12%), Positives = 61/167 (36%), Gaps = 15/167 (8%)

Query: 114 ASAKADFAQLQLKRSQELAPTGAEPRETLEQRKAEAAQAVAAVRQLDARIQQKAIRAPFT 173
+++ + + E + L Q + + + R Q IRAP +
Sbjct: 276 EQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVS 335

Query: 174 GQLGIRRIN-PGQYLNAGDAIATLT-QLDPLYINFTLPQQDLSKLTPGAPVQVTLDAAPG 231
++ +++ G + + + + + D L + + +D+ + G + ++A P
Sbjct: 336 VKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPY 395

Query: 232 QVFDAKVSAIEPRIDGETRNVAVQALLPNAGRLLKSGMYATVRLTLP 278
+ + G+ +N+ + A+ + G+ V +++
Sbjct: 396 TRY--------GYLVGKVKNINLDAIEDQ-----RLGLVFNVIISIE 429



Score = 36.0 bits (83), Expect = 2e-04
Identities = 19/100 (19%), Positives = 39/100 (39%), Gaps = 2/100 (2%)

Query: 74 LAPDTSGRVTAIYFDAGQTVKEGTVLVQLYDAPEQSDRAAASAKADFAQLQLKRSQELAP 133
+ P + V I G++V++G VL++L ++D + A+L+ R Q
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ--IL 156

Query: 134 TGAEPRETLEQRKAEAAQAVAAVRQLDARIQQKAIRAPFT 173
+ + L + K V + + I+ F+
Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13120ACRIFLAVINRP7790.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 779 bits (2013), Expect = 0.0
Identities = 310/1029 (30%), Positives = 518/1029 (50%), Gaps = 28/1029 (2%)

Query: 5 DIFIRRPILALVVSLLILLMGATAVFLLPVRQYPYLENATITISTSLPGATQDVMQGFVT 64
+ FIRRPI A V+++++++ GA A+ LPV QYP + +++S + PGA +Q VT
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 65 TPIAQSIATASGIEYLSSTT-KQGKSEIKARLVLNANADRTMTEILAKVQQVKYQLPAGV 123
I Q++ + Y+SST+ G I + D ++ K+Q LP V
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 124 TDPVISKSTEGGTAVQYIAFYSKTLSIPQ--VTDFASRVAQPLFTSIPGVASADVFGGQS 181
IS + + F S Q ++D+ + + + + GV +FG Q
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ- 181

Query: 182 LAMRIWIDPVRLAAHGLSAGEISAALRANNVQAAPGQLKSSLTVT------NISAATDLR 235
AMRIW+D L + L+ ++ L+ N Q A GQL + + +I A T +
Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 236 GVEDFRQMVVKSSPGGGVVRLSDVATVEVGGQNYNNLSFATGVPAIFVALQPTPDGNPLE 295
E+F ++ ++ + G VVRL DVA VE+GG+NYN ++ G PA + ++ N L+
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 IVKHANELLPKIRAMAPPGLTVAPNYDVARFVNASIEEVKHTLIEAIVIVIVVIFLFLGT 355
K L +++ P G+ V YD FV SI EV TL EAI++V +V++LFL
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 FRAVIIPVVTIPLSLVGTAALMLACGFSINLLTLLAMVLAIGLVVDDAIVVVENIHRHI- 414
RA +IP + +P+ L+GT A++ A G+SIN LT+ MVLAIGL+VDDAIVVVEN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EEGLTPVRAALLGAREIVGPVIAMTITLAAVYAPIGMMGGLTGALFKEFAFTLAGSVIVS 474
E+ L P A +I G ++ + + L+AV+ P+ GG TGA++++F+ T+ ++ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 GIVALTLSPVMSSMLL-----SSKQGEGRLAKRVEHAMEGLTALYGRLLAHTLAARGAVL 529
+VAL L+P + + LL + +G + Y + L + G L
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 530 LVGAIVLVAIVGLFAGTRHELAPTEDQGAIIVVTKAPQYAGVGYTSRYAQQIEK-LFESI 588
L+ A+++ +V LF P EDQG + + + P A T + Q+ ++
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 589 PEFDSSFMNIGDYA---GGQNMMIGGAILKDWSERKR---SATDIQGQIQTAGGAIDGQT 642
S + ++ QN + LK W ER SA + + + G I
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 643 LTAVQLPP--LPGSSGGLPVQMVLRSPDDFKSLYETGEKI-KMAAYASGLFLYVQNDLSF 699
+ +P G++ G +++ ++ +L + ++ MAA + V+ +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 700 DSPQAHITIDNAKASEMGVTMQSIADTLAVLVGENYVNRFNFHDRSYDVIPQVRGGERMT 759
D+ Q + +D KA +GV++ I T++ +G YVN F R + Q RM
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 760 PDDLGRFYVKATSGALVPLSTVARVEMRPQANQLTQFGQMNSATLEMLPAPGVSMGEAVA 819
P+D+ + YV++ +G +VP S + +L ++ + S ++ APG S G+A+A
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 820 FLQSQP--LPPGTSVDWLSDSRQFVQEGNRLLVSFGFALVVIFLVLAAQFESLRDPLVIL 877
+++ LP G DW S Q GN+ + VV+FL LAA +ES P+ ++
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 878 VTVPLAVCGALVPLWLGYATLNIYTQIGLVTLIGLISKHGILMVTFANHIQHHENMSRIE 937
+ VPL + G L+ L ++Y +GL+T IGL +K+ IL+V FA + E +E
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 938 AIEKAAAVRMRPVLMTTAAMVAGLVPLLFADGAGSASRFSIGIVVVMGMMVGTFFTLFVL 997
A A +R+RP+LMT+ A + G++PL ++GAGS ++ ++GI V+ GM+ T +F +
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 998 PTIYSFIAK 1006
P + I +
Sbjct: 1022 PVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13190PRTACTNFAMLY290.042 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 29.3 bits (65), Expect = 0.042
Identities = 35/134 (26%), Positives = 43/134 (32%), Gaps = 7/134 (5%)

Query: 280 ALGAAGTAVAVGAAATGVGGAVAAGARMAPVAAKMAASGARTAASTAGSARSAFQAGSAA 339
A GA +GA+ + G G R A VAA GA A R AG A
Sbjct: 213 ASGAPAAVSVLGASELTLDGGHITGGRAAGVAAM---QGAVVHLQRATIRRGDAPAGGAV 269

Query: 340 AGGGAKGAMAGLGNVAKSGAQAVGQKAA--AGARSLKERAAAAFRSEGAGSASS-GSGGA 396
GG G A G G V + S E A + + G+A G G
Sbjct: 270 PGGAVPGG-AVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGAR 328

Query: 397 AAGAATPGSEATAN 410
+ S N
Sbjct: 329 VTVSGGSLSAPHGN 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13195PF04335568e-12 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 56.4 bits (136), Expect = 8e-12
Identities = 42/207 (20%), Positives = 69/207 (33%), Gaps = 12/207 (5%)

Query: 20 YQAAAQVWD-ERIGSARVQAKNWRLMAFGCLVLALLMAGGLVWRSAQSIVTPYVVEVDKS 78
Y A W+ +++ +A K ++A LA + + V PYV+ VD++
Sbjct: 13 YFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRN 72

Query: 79 GQVRAVGE---AATPYRPADAQIAYHLAHFVTLVRSLSIDPIVVRQNWLDAYDYATDRGA 135
++ +A Y LA +V + + +
Sbjct: 73 TGEASIAAKLHGDATITYDEAVRKYFLATYVRYRE--GWIAAAREEYFDAVMVMSARPEQ 130

Query: 136 A-VLNDYASKN--DPFARIGKE-SVTVQITSVVRASESSFNVRWTEQRFVNGAPAGTERW 191
Y + N P + V V+I V + V +T + V G+ +
Sbjct: 131 DRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFT-KESVTGSNSTKTDA 189

Query: 192 NAVISI-VLQTPRTEQRLRKNPLGIYV 217
A I V TP E KNPLG V
Sbjct: 190 VATIKYKVDGTPSKEVDRFKNPLGYQV 216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13200PF03544310.006 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 30.7 bits (69), Expect = 0.006
Identities = 19/81 (23%), Positives = 31/81 (38%), Gaps = 3/81 (3%)

Query: 23 QGKPPPRISLDEPVQAQPLPEPPKPVEVV---AVPKVLPMPAQMKPLPEADDAKPTPEPA 79
+PPP ++ + +P+PEPPK VV PK P P +K + + E
Sbjct: 65 AVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESR 124

Query: 80 DETVRVSRANAEARIAPTREG 100
+ + A A +
Sbjct: 125 PASPFENTAPARPTSSTATAA 145



Score = 29.6 bits (66), Expect = 0.014
Identities = 15/73 (20%), Positives = 22/73 (30%)

Query: 26 PPPRISLDEPVQAQPLPEPPKPVEVVAVPKVLPMPAQMKPLPEADDAKPTPEPADETVRV 85
PP + +P PEP E V+ + KP P+ K +P + V
Sbjct: 62 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPV 121

Query: 86 SRANAEARIAPTR 98
A
Sbjct: 122 ESRPASPFENTAP 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13225PF06580355e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 5e-04
Identities = 29/188 (15%), Positives = 63/188 (33%), Gaps = 44/188 (23%)

Query: 288 REGIGRVRKIVQDLKNFSR-VDAEDDWQWTDLHQGIESTLNIVASE-------LKYRADV 339
E + R+++ L R + + L + + + L++ +
Sbjct: 187 LEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQI 246

Query: 340 VREYGDLPEVKCLPSQINQVVMNLVMNAAQ-AMGPER--GRIVIRTGHTVEHAWIEVEDS 396
D+ +P + Q LV N + + G+I+++ +EVE++
Sbjct: 247 NPAIMDVQ----VPPMLVQT---LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 397 GQGISPEILPRIFDPFFTTKPVGKGTGLGLS-------LSYGIVQKHGGTIEVRSQPGVG 449
G K + TG GL + YG + I++ + G
Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYG--TEAQ--IKLSEKQGKV 341

Query: 450 SAFRIVLP 457
+A +++P
Sbjct: 342 NA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13230HTHFIS1066e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 106 bits (265), Expect = 6e-27
Identities = 32/154 (20%), Positives = 69/154 (44%), Gaps = 5/154 (3%)

Query: 14 RFSVLLVDDEPLILSSLRRLLRNQPYDLLLAESGEQALQLLESRPVDLVVSDARMPNMDG 73
++L+ DD+ I + L + L YD+ + + + + + DLVV+D MP+ +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 74 AALLAEIHRRSPETIRILLTGHADLPTIAKAINEGRIHHYLSKPWNDDELLLTLRQSLEY 133
LL I + P+ ++++ T KA +G + YL KP++ EL+ + ++L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKG-AYDYLPKPFDLTELIGIIGRALAE 121

Query: 134 LHSERERRRLERLTQE----QNDRLQQLNATLEK 163
+ + ++ +Q++ L +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13260SACTRNSFRASE362e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 2e-05
Identities = 20/82 (24%), Positives = 35/82 (42%), Gaps = 4/82 (4%)

Query: 74 DDQVIGHCQLLFDRRNGVVRLARIALAPSARGQGLGLPMLEALLAEAFA-DADIERVELN 132
++ IG ++ NG + IA+A R +G+G +L A +A + + L
Sbjct: 73 ENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKKGVGTALLH--KAIEWAKENHFCGLMLE 129

Query: 133 VYDWNAAARHLYRRAGFREEGL 154
D N +A H Y + F +
Sbjct: 130 TQDINISACHFYAKHHFIIGAV 151


42HWH78_RS14090HWH78_RS14340Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS140902141.997166DUF6348 family protein
HWH78_RS140952152.327417VOC family protein
HWH78_RS141001152.824417GFA family protein
HWH78_RS141051153.583111hypothetical protein
HWH78_RS141102183.784323hypothetical protein
HWH78_RS141151142.733862Dyp-type peroxidase
HWH78_RS141202130.767962AAA family ATPase
HWH78_RS14125117-1.1674454Fe-4S cluster-binding domain-containing
HWH78_RS14130020-2.177239DUF4011 domain-containing protein
HWH78_RS14135-146-7.787471hypothetical protein
HWH78_RS14140056-10.197652IS3 family transposase
HWH78_RS14145059-11.199673FRG domain-containing protein
HWH78_RS14150159-11.269787DEAD/DEAH box helicase
HWH78_RS14155365-13.438681ATP-binding protein
HWH78_RS14160468-14.251753TerB N-terminal domain-containing protein
HWH78_RS14165468-15.017170hypothetical protein
HWH78_RS14170460-14.414287site-specific integrase
HWH78_RS14175453-13.176774hypothetical protein
HWH78_RS14180347-11.232978inovirus-type Gp2 protein
HWH78_RS14185334-7.325810AlpA family transcriptional regulator
HWH78_RS14190432-6.815233DUF932 domain-containing protein
HWH78_RS14195237-7.143711YqaJ viral recombinase family protein
HWH78_RS14200143-8.657618hypothetical protein
HWH78_RS14205154-9.306548DNA repair protein RadC
HWH78_RS14210360-10.140032type II toxin-antitoxin system RelE/ParE family
HWH78_RS14215361-10.082941type II toxin-antitoxin system Phd/YefM family
HWH78_RS14220259-10.304317hypothetical protein
HWH78_RS14225260-10.497348DEAD/DEAH box helicase
HWH78_RS14230258-10.263365phospholipase D family protein
HWH78_RS14235256-10.249715DUF6361 family protein
HWH78_RS14240154-9.968957RecQ family ATP-dependent DNA helicase
HWH78_RS14245043-9.562182AlpA family transcriptional regulator
HWH78_RS14250043-11.229111Hsp70 family protein
HWH78_RS14255142-10.989040hypothetical protein
HWH78_RS14260135-10.945324hypothetical protein
HWH78_RS14265033-11.066419DEAD/DEAH box helicase family protein
HWH78_RS14270030-11.154256site-specific DNA-methyltransferase
HWH78_RS14275026-9.001536DUF4391 domain-containing protein
HWH78_RS14280-113-5.463162helicase
HWH78_RS14285-212-5.298612*MerR family transcriptional regulator
HWH78_RS14295114-2.185781integration host factor subunit alpha
HWH78_RS14300215-3.159511phenylalanine--tRNA ligase subunit beta
HWH78_RS14305011-3.036038phenylalanine--tRNA ligase subunit alpha
HWH78_RS14310-113-3.62199950S ribosomal protein L20
HWH78_RS14315-113-3.35845950S ribosomal protein L35
HWH78_RS14320-212-2.764528translation initiation factor IF-3
HWH78_RS14325-211-2.203826threonine--tRNA ligase
HWH78_RS14330-210-0.595352alpha/beta hydrolase
HWH78_RS143351111.716938hypothetical protein
HWH78_RS143403100.481129hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14120HTHFIS310.014 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.014
Identities = 42/257 (16%), Positives = 78/257 (30%), Gaps = 39/257 (15%)

Query: 225 DQAARRALAPALLRGLGGAGVAEEALQQAAATFVENTEGLLLLDL-----NAIVQLARVE 279
D AA R + L G L++ D+ NA L R++
Sbjct: 11 DDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIK 70

Query: 280 GLAMER----------IADAVRRYKVGVTE---DPWLKID-RQRIRQADEIVRRRVKGQQ 325
+ A++ + G + P+ + I +A +RR +
Sbjct: 71 KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLE 130

Query: 326 HAVTHMLDIVKR--AMTGV--GASRKGNRPRGVAFLAGPTGVGKTELAKTVTSLLFGDES 381
+ +V R AM + +R + + G +G GK +A+ +
Sbjct: 131 DDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL-MITGESGTGKELVARALHDYGKRRNG 189

Query: 382 AYIRFDMSEFSAEHADQRLIGAPPGYVGYDVGGELTNAIREKP--FS-----VVLFDEIE 434
++ +M+ + + L G G T A F + DEI
Sbjct: 190 PFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTGRFEQAEGGTLFLDEIG 241

Query: 435 KAHPRILDKFLQILDDG 451
+ L++L G
Sbjct: 242 DMPMDAQTRLLRVLQQG 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14135RTXTOXIND310.012 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.012
Identities = 27/215 (12%), Positives = 64/215 (29%), Gaps = 19/215 (8%)

Query: 98 ETRQQHRRLQENASALLQALDARPDAASAALRQTLQALADGALRDDAEALLAQGFAALAS 157
+ L+ +S L L+ + + + + +++ +
Sbjct: 129 ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT 188

Query: 158 APAEERLSEAQRELAQRLKTDETPITLEQWRARQQQDAPREQRLARIDRHIAELQLLQGE 217
+ +E+ S Q + Q E ++ A R LARI+R+ ++ +
Sbjct: 189 SLIKEQFSTWQNQKYQ----------KEL--NLDKKRAERLTVLARINRYENLSRVEKSR 236

Query: 218 AS------AQAFLERLARAEAEQRPERRNLLLDSLVLDLAQAAREHQQQRQRLE-HLQDL 270
+ + + A E E + L L Q E ++ + Q
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 271 ASEVAALGAAEHAELLQRAAACQADSDPQQLAELT 305
+E+ + + + QQ + +
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIR 331



Score = 29.4 bits (66), Expect = 0.032
Identities = 20/122 (16%), Positives = 43/122 (35%), Gaps = 4/122 (3%)

Query: 12 REEAIATCERDLQRLDKALARWENQASRLAQLSDAERAAAHARRASLHALLEQERWLDVQ 71
+E + + + + R+EN + D + H + + HA+LEQE V+
Sbjct: 205 KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKY-VE 263

Query: 72 LQVKIESEFLKRDLAEREERAIRQAAETRQQHRR---LQENASALLQALDARPDAASAAL 128
++ + + E E + ++ + Q + L + + A
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323

Query: 129 RQ 130
RQ
Sbjct: 324 RQ 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14170CHANLCOLICIN300.018 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.018
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 6/74 (8%)

Query: 55 VTLPFGTYSRDGSEGTMTLAEAGQRAKELAGIH---KSGTKDIKEHLEAEEASRIAARDA 111
+TL GT GS G + G +++ A IH K T +K+ +AE+A+R A
Sbjct: 22 ITLLNGTPDGSGSGGGG--GKGGSKSESSAAIHATAKWSTAQLKKT-QAEQAARAKAAAE 78

Query: 112 ELARAAAEKAAIAE 125
A+A A + A+ +
Sbjct: 79 AQAKAKANRDALTQ 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14255SHAPEPROTEIN575e-11 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 57.5 bits (139), Expect = 5e-11
Identities = 58/232 (25%), Positives = 90/232 (38%), Gaps = 49/232 (21%)

Query: 5 GFDFGTTNSLISV-----VEGDRCIAYVDEDNGGSPHPSVVCYDGDEVVVGHEAKGRLSS 59
D GT N+LI V V + + + +D GSP VGH+AK L
Sbjct: 14 SIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPK--------SVAAVGHDAKQML-- 63

Query: 60 SGLGVIGNVVRSPKTLLG----KHSVHVDGVSRSPRQMVADIVRYVTGHAR-DERPGDYS 114
R+P + K V D +M+ ++ V ++ P
Sbjct: 64 ---------GRTPGNIAAIRPMKDGVIAD--FFVTEKMLQHFIKQVHSNSFMRPSP---- 108

Query: 115 RAVVTIPVDMNGERRRDLREAFRLAGVTIDQFVHEPLAALYGHLRDLPDFASEIRRLDRQ 174
R +V +PV RR +RE+ + AG + EP+AA G LP +
Sbjct: 109 RVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGA--GLPVSEATGS----- 161

Query: 175 YMLVFDWGGGTLDLTLCQLVDGYLVQITNHGCSHLGGDVIDESIMHEVVRRH 226
+V D GGGT ++ + L + +GGD DE+I++ V R +
Sbjct: 162 --MVVDIGGGTTEVAVISLNG-----VVYSSSVRIGGDRFDEAIINYVRRNY 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14300DNABINDINGHU1131e-36 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 113 bits (284), Expect = 1e-36
Identities = 34/89 (38%), Positives = 54/89 (60%)

Query: 5 TKAEIAERLYEELGLNKREAKELVELFFEEIRQALEHNEQVKLSGFGNFDLRDKRQRPGR 64
K ++ ++ E L K+++ V+ F + L E+V+L GFGNF++R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 65 NPKTGEEIPITARRVVTFRPGQKLKARVE 93
NP+TGEEI I A +V F+ G+ LK V+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


43HWH78_RS14630HWH78_RS14675Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS14630228-4.220267hypothetical protein
HWH78_RS14635129-4.686410hypothetical protein
HWH78_RS14640023-3.449035hypothetical protein
HWH78_RS14645020-3.131424hypothetical protein
HWH78_RS14650124-3.956529GntR family transcriptional regulator
HWH78_RS14655-118-3.737865MFS transporter
HWH78_RS14660-215-3.092565cysteine hydrolase
HWH78_RS14665-110-2.648366tRNA dihydrouridine(20/20a) synthase DusA
HWH78_RS14670014-3.488906transaldolase
HWH78_RS14675-114-3.254189STAS domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14655TCRTETA1031e-26 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 103 bits (259), Expect = 1e-26
Identities = 85/362 (23%), Positives = 148/362 (40%), Gaps = 13/362 (3%)

Query: 15 RNLAVCLFGAFTTVFAMTLILPFLPVYIGQLGVSGHAAIVQWSGIAYAATFVTAGLVAPL 74
R L V L + LI+P LP + L S GI A + AP+
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVT--AHYGILLALYALMQFACAPV 62

Query: 75 WGMLGDRYGRKPMLVRASLGMAITMLLMGLASDIWQFIGLRLLAGIAGGYSSGATILVAV 134
G L DR+GR+P+L+ + G A+ +M A +W R++AGI G + A +A
Sbjct: 63 LGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD 122

Query: 135 QAPKSRSAWALGLLSSGVMAGNLLGPLVGGFLPPLIGIRATFWGASGLIFIAFVFTTFML 194
A G +S+ G + GP++GG + A F+ A+ L + F+ F+L
Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLL 181

Query: 195 RETA---RPSVPAVEEPPKVGWSNMPNKAVILAMLATGLLLMIANMSVEPIITVYIETLL 251
E+ R + P + V+ A++A ++ + + ++ E
Sbjct: 182 PESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 252 EDSRRVTSVAGLAMSA-AALGSIISATYLGKVADRIGYGVIMIAALSVAAVLLIPQAFVY 310
+ G++++A L S+ A G VA R+G ++ + I AF
Sbjct: 242 HWD---ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 311 AGWQLIALRFLMGMALGGL-LPCMAAVIRHSVPERFVGSAMGWSLSAQFAGQVIGPVIGG 369
GW + L +A GG+ +P + A++ V E G G + ++GP++
Sbjct: 299 RGWMAFPIMVL--LASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356

Query: 370 FV 371
+
Sbjct: 357 AI 358



Score = 51.4 bits (123), Expect = 3e-09
Identities = 43/177 (24%), Positives = 73/177 (41%), Gaps = 4/177 (2%)

Query: 217 PNKAVILAMLATGLLLMIANMSVEPIITVYIETLLEDSRRVTSVAGLAMSAAALGSIISA 276
PN+ +I+ + L + + + P++ + L+ S VT+ G+ ++ AL A
Sbjct: 3 PNRPLIVILSTVALDAVGIGL-IMPVLPGLLRDLVH-SNDVTAHYGILLALYALMQFACA 60

Query: 277 TYLGKVADRIGYGVIMIAALSVAAVLLIPQAFVYAGWQLIALRFLMGMALGGLLPCMAAV 336
LG ++DR G +++ +L+ AAV A W L R + G+ G A
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAY 119

Query: 337 IRHSVPERFVGSAMGWSLSAQFAGQVIGPVIGGFVGGQFGMRSVFLVTSLLMLAGAL 393
I G+ + G V GPV+GG + G F + F + L L
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFFAAAALNGLNFL 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14660ISCHRISMTASE523e-10 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 51.6 bits (123), Expect = 3e-10
Identities = 47/189 (24%), Positives = 72/189 (38%), Gaps = 6/189 (3%)

Query: 9 PFNSALLLMDFQEFVLNNFVP-APRAAEVVARTSRFLSRVRQTDMLVVHVTVGCPPDGPP 67
P + LL+ D Q + ++ F A E+ A + ++ Q + VV+ P P
Sbjct: 28 PNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQ--PGSQNP 85

Query: 68 MDRRNRLFRELRESGLIEPGSPGLAITAALQPIESEPVITKSRVGAFTGTELDELLWAHD 127
DR L + GL G I L P + + V+TK R AF T L E++
Sbjct: 86 DDRA--LLTDFWGPGLNS-GPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEG 142

Query: 128 IETLLIAGATTSGVVLTTVRQAFDLDYQLVVLRDGCVDGDAELHDYLMARVISDHATITE 187
+ L+I G L T +AF D + + D D E H + A
Sbjct: 143 RDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVM 202

Query: 188 IDEVAKTLR 196
D + L+
Sbjct: 203 TDSLLDQLQ 211


44HWH78_RS15090HWH78_RS15155Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS150902112.533083DUF962 domain-containing protein
HWH78_RS150952113.106989hypothetical protein
HWH78_RS151002123.357543GGDEF domain-containing protein
HWH78_RS151051143.644680acyl-CoA thioesterase II
HWH78_RS151102143.588394CHAD domain-containing protein
HWH78_RS151151113.042912protein-glutamine gamma-glutamyltransferase
HWH78_RS151200103.325694DUF58 domain-containing protein
HWH78_RS15125-192.398533AAA family ATPase
HWH78_RS151300112.468279orotidine-5'-phosphate decarboxylase
HWH78_RS151350112.273090LysR family transcriptional regulator
HWH78_RS151401122.013541DUF924 family protein
HWH78_RS151452131.891654LysR family transcriptional regulator
HWH78_RS151504131.068752multidrug/biocide efflux PACE transporter
HWH78_RS151552121.832223response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15125HTHFIS320.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.003
Identities = 12/43 (27%), Positives = 21/43 (48%)

Query: 103 DEINRATPKSQSALLEAMEEGQVTIEGATRPLPEPFFVIATQN 145
DEI +Q+ LL +++G+ T G P+ ++A N
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15155HTHFIS985e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.6 bits (243), Expect = 5e-25
Identities = 29/136 (21%), Positives = 57/136 (41%), Gaps = 2/136 (1%)

Query: 7 RILFVDDEERILRSLAMQF-RRHYEVLTESDPRRALERLKTERIQVLVSDQRMPQMSGAE 65
IL DD+ I L R Y+V S+ + ++V+D MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 LLAQARERYPETLRILLTGYSDLDAAVDALNDGGIFRYLTKPWNPQEMAFTLRQAAEIAS 125
LL + ++ P+ ++++ + A+ A + G + YL KP++ E+ + +A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 126 RQGLPAPPAATLAAPL 141
R+ + PL
Sbjct: 124 RRPSKLEDDSQDGMPL 139



Score = 54.8 bits (132), Expect = 1e-10
Identities = 27/139 (19%), Positives = 55/139 (39%), Gaps = 5/139 (3%)

Query: 142 SVLLLDDDPETLDCVGAFCHAGGHRLLRARNLAEALVWLNTEPVEVLVSDLKLAGEHTAP 201
++L+ DDD + G+ + N A W+ +++V+D+ + E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 202 LLKSLAQAHPRLLSLVVTPFRDTQALLELINQAQIFRYLPKPIRRGLFEKGLKAAAEQAL 261
LL + +A P L LV++ ++ + + YLPKP L +AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKP----FDLTELIGIIGRAL 119

Query: 262 LWRGRSLPEVDRLAEVPRD 280
R +++ ++
Sbjct: 120 AEPKRRPSKLEDDSQDGMP 138


45HWH78_RS15200HWH78_RS15510Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS152001123.307119isohexenylglutaconyl-CoA hydratase
HWH78_RS152050112.792032geranyl-CoA carboxylase subunit apha
HWH78_RS15210-1111.570035NAD(P)-dependent oxidoreductase
HWH78_RS152150101.555813long-chain-acyl-CoA synthetase
HWH78_RS152200132.116554hypothetical protein
HWH78_RS152250122.024288anti-sigma factor SbrR
HWH78_RS152301130.888474ECF family RNA polymerase sigma factor SbrI
HWH78_RS152350120.790268PLP-dependent aminotransferase family protein
HWH78_RS152400113.298571hypothetical protein
HWH78_RS152450112.930734response regulator transcription factor
HWH78_RS15250-2103.528362OmpA family protein
HWH78_RS15255-2114.591688DUF4398 domain-containing protein
HWH78_RS15260-2105.424068transporter substrate-binding domain-containing
HWH78_RS15265195.952334precorrin-3B C(17)-methyltransferase
HWH78_RS15270095.931308precorrin-2 C(20)-methyltransferase
HWH78_RS15275195.999002precorrin-8X methylmutase
HWH78_RS15280-2114.663686precorrin-3B synthase
HWH78_RS15285-1144.331523bifunctional cobalt-precorrin-7
HWH78_RS15290-2153.585817cobalt-precorrin-5B (C(1))-methyltransferase
HWH78_RS152950143.721926cobalt-precorrin-6A reductase
HWH78_RS153000152.966153manganese efflux pump MntP
HWH78_RS153051153.330199TonB-dependent receptor
HWH78_RS153101134.160875ABC transporter ATP-binding protein
HWH78_RS153151133.294050ABC transporter substrate-binding protein
HWH78_RS153201133.216820iron ABC transporter permease
HWH78_RS15325-1112.701749MBL fold metallo-hydrolase
HWH78_RS15330092.714666LysE family translocator
HWH78_RS15335-1112.790075AraC family transcriptional regulator
HWH78_RS15340-1102.744329SDR family oxidoreductase
HWH78_RS15345-193.003157hypothetical protein
HWH78_RS15350-2102.472668methyl-accepting chemotaxis protein
HWH78_RS15355-3121.762396LysR family transcriptional regulator
HWH78_RS15360-3151.696132amidohydrolase
HWH78_RS15365-1171.411336ABC transporter substrate-binding protein
HWH78_RS153700102.110865ABC transporter permease
HWH78_RS15375082.040612ABC transporter permease
HWH78_RS15380183.021248ABC transporter ATP-binding protein
HWH78_RS153851103.725227hypothetical protein
HWH78_RS15390083.910999hypothetical protein
HWH78_RS15395-194.531154LysE family transporter
HWH78_RS15400-173.347420LysR family transcriptional regulator
HWH78_RS15405-293.461643TetR/AcrR family transcriptional regulator
HWH78_RS15410-193.070386alkene reductase
HWH78_RS154150102.047850MFS transporter
HWH78_RS15420-1140.697717CFTR inhibitory factor Cif
HWH78_RS15425-1120.033305cytochrome b
HWH78_RS154300110.827151hypothetical protein
HWH78_RS154350101.440591purine permease
HWH78_RS15440-1101.860690aminopeptidase PaaP
HWH78_RS15445-382.978305thiolase family protein
HWH78_RS15450-263.518329VWA domain-containing protein
HWH78_RS15455-263.618395AAA family ATPase
HWH78_RS15460-283.8080373-deoxy-7-phosphoheptulonate synthase
HWH78_RS15465-283.691112cobaltochelatase subunit CobN
HWH78_RS15470-283.499195cobalamin biosynthesis protein CobW
HWH78_RS15475-2122.639439CbtB-domain containing protein
HWH78_RS154801141.646276CbtA family protein
HWH78_RS154851161.315271cobalamin biosynthesis protein
HWH78_RS15490216-0.135475precorrin-4 C(11)-methyltransferase
HWH78_RS15495217-1.202681alpha/beta fold hydrolase
HWH78_RS15500317-1.569105enoyl-ACP reductase FabV
HWH78_RS15505317-1.759018FAD-binding protein
HWH78_RS15510215-1.634480electron transfer flavoprotein subunit beta/FixA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15205RTXTOXIND382e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.5 bits (87), Expect = 2e-04
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 579 AGASAQVGASSGTLK-APMDGAIV-EVLVGEGERVGKGQLLLVLEAMKMEHPLKAGVDGV 636
A A+ ++ S + + P++ +IV E++V EGE V KG +LL L A+ E A
Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE----ADTLKT 139

Query: 637 VRRVQVGRGEQVRNRQVLVEVEADA 661
+ R EQ R + + +E +
Sbjct: 140 QSSLLQARLEQTRYQILSRSIELNK 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15210DHBDHDRGNASE813e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.3 bits (200), Expect = 3e-20
Identities = 54/191 (28%), Positives = 85/191 (44%), Gaps = 9/191 (4%)

Query: 3 LHGKTLFITGASRGIGREIALRAARDGANLVIAAKSAEPHPKLEGTIFSVAAEVEAAGGQ 62
+ GK FITGA++GIG +A A GA++ + E K+ ++ + A EA
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---- 61

Query: 63 ALPLQLDVRDEQAVAAAMARAAERFGGIDALVNNAGAIRLVGVEKLEPKRFDLMYQINTR 122
DVRD A+ AR G ID LVN AG +R + L + ++ + +N+
Sbjct: 62 ---FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118

Query: 123 AVLVCSQAALPYLRRSANGHILSLSPPINLAGRWFAQHGPYTVTKYGMSMLTLGMHEEFG 182
V S++ Y+ +G I+++ N AG Y +K M T + E
Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGS--NPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 183 KYAISVNALWP 193
+Y I N + P
Sbjct: 177 EYNIRCNIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15220IGASERPTASE333e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.1 bits (75), Expect = 3e-04
Identities = 19/100 (19%), Positives = 29/100 (29%), Gaps = 9/100 (9%)

Query: 4 APKTASKKVAPAAEQVAEPKPPAKPKPAAAPPKPASRPVAKDKPAPAKRASTARLDPEVR 63
PK S+ V+P EQ +P A+P P P ++ V
Sbjct: 1122 VPKVTSQ-VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 64 KPLPSAKLDLRLPK-------ELVQKMAPPGTEETH-KPK 95
+P+ + P E+ KPK
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK 1220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15225IGASERPTASE361e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 1e-04
Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 8/120 (6%)

Query: 112 AAAAKRAMRAPAAPAPLSSEMSEP--PALLASYASSGEAPQLMAEAAPAAPAALADRPPA 169
+ + P + + P P+ A EAP + APA P+ +
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAP--VPPPAPATPSETTETVAE 1042

Query: 170 QAAQQAK---VQAALAGDFVAQARGKAVAVKPEVLDEALGAVLALREQGKTEQAATQLAE 226
+ Q++K A + AQ R A K V +A + +T++ T +
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVA-QSGSETKETQTTETK 1101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15245HTHFIS501e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 50.2 bits (120), Expect = 1e-09
Identities = 35/157 (22%), Positives = 61/157 (38%), Gaps = 11/157 (7%)

Query: 10 LVIADSFPVMQWALQRYLSEECGRQVLAVVGDSDSLVERLADLPPESILITELGLPGQRS 69
+++AD ++ L + LS G V + +A +++T++ +P
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVRITSNAATLW-RWIAAGDG-DLVVTDVVMPD--- 59

Query: 70 RDGIHLVEWLTRHCPQMKVMVYSVFSAPLLAKAVLRSGASAYISKRSPLETLKAALECMA 129
+ L+ + + P + V+V S + + A GA Y+ K L L + A
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG-RA 118

Query: 130 LGQTFLDPG-LHPQRHTGKPL---SPTEVDILRRLAR 162
L + P L G PL S +I R LAR
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15250OMPADOMAIN1022e-27 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 102 bits (255), Expect = 2e-27
Identities = 44/126 (34%), Positives = 63/126 (50%), Gaps = 11/126 (8%)

Query: 155 DVLFDFNRAELKPAANRTALKLVQFL-QLNPRRV-IRIEGYTDSVGDRQANLDLSRERAQ 212
DVLF+FN+A LKP +L L L+P+ + + GYTD +G N LS RAQ
Sbjct: 220 DVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279

Query: 213 AVADVLADLGVDPARMQVVGYGEAFPVTDNASNRGR---------AQNRRVEIVFSNDKG 263
+V D L G+ ++ G GE+ PVT N + + A +RRVEI K
Sbjct: 280 SVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGIKD 339

Query: 264 QLSAPR 269
++ P+
Sbjct: 340 VVTQPQ 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15315FERRIBNDNGPP382e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 38.4 bits (89), Expect = 2e-05
Identities = 49/266 (18%), Positives = 96/266 (36%), Gaps = 43/266 (16%)

Query: 43 PSRAVSHDINLTEMMVALGLQTRMVGYTGISGW--WKNADPGLIAALKPLPELV-----A 95
P+R V+ + E+++ALG+ G + W + +P PLP+ V
Sbjct: 35 PNRIVALEWLPVELLLALGIVP--YGVADTINYRLWVS-EP-------PLPDSVIDVGLR 84

Query: 96 RYPTAETLLDVDADFFFAGWGYGMRVGGDLTPASLEPLG-VKVYELSESCAQIGEPRRAS 154
P E L ++ F GYG +P L + + + S+ + R++
Sbjct: 85 TEPNLELLTEMKPSFMVWSAGYGP------SPEMLARIAPGRGFNFSDGKQPLAMARKS- 137

Query: 155 LDELYRDLRNLGRIFDVEPRAERLVASLQARIERARAGIPANAEAPRVF--LYDSGEDRP 212
L + + +++ AE +A + I + P + L D
Sbjct: 138 -------LTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLV 190

Query: 213 FTSGRLGMPQALIEAAGGRSVTDDVAASW--TQVNWESVVA-RDPQVIVIVDYGETSAAQ 269
F + Q +++ G + W T V+ + + A +D V+ D+ +
Sbjct: 191 FGPN--SLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCF-DHDNSKDMD 247

Query: 270 KQRFLEENPALRSLTAIRERRFIVLP 295
L P +++ +R RF +P
Sbjct: 248 A---LMATPLWQAMPFVRAGRFQRVP 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15325PF05932270.047 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 27.1 bits (60), Expect = 0.047
Identities = 9/53 (16%), Positives = 18/53 (33%), Gaps = 8/53 (15%)

Query: 74 ADHLSAAIFLQRELGGCLAIGARITQVQAKFSGLFNLGEAFPVDGRQFEHLFE 126
L+ A+ G L + + SGL++ ++ P + L
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEK--------SGLYHAYQSIPREKLSVPTLKR 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15335PF05272280.034 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.034
Identities = 6/16 (37%), Positives = 8/16 (50%)

Query: 258 FRRAYGMTPAAYRRQC 273
+R AYG + RQ
Sbjct: 672 YRGAYGRYVQDHPRQV 687


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15340DHBDHDRGNASE711e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 70.9 bits (173), Expect = 1e-16
Identities = 71/260 (27%), Positives = 118/260 (45%), Gaps = 17/260 (6%)

Query: 6 IKGKTVLVTGGAKNLGGLIARDLAAHGAKAIAIHYNSAASKADADATVAALQAAGAKAVA 65
I+GK +TG A+ +G +AR LA+ GA A+ YN + V++L+A A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL----EKVVSSLKAEARHAEA 61

Query: 66 LQGDLTSAAAMEKLFADAIAAVGKPDIAINTVGKVLKKPFTEISEAEYDEMSAVNAKSAF 125
D+ +AA++++ A +G DI +N G + +S+ E++ +VN+ F
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 126 FFLREAGKHVND--NGKICTLVTSLLGAYTPYYAAYAGTKAPVEHFTRAASKEFGARGIS 183
R K++ D +G I T+ ++ G AAYA +KA FT+ E I
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 184 VTAVGPGPMDTPFFYPAEGADAVAYHKTAAALSPFSRTGL--------TDIEDVVPFIRH 235
V PG +T + + A +L F +TG+ +DI D V F+
Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETF-KTGIPLKKLAKPSDIADAVLFL-- 238

Query: 236 LVSEGWWITGQTILINGGYT 255
+ + IT + ++GG T
Sbjct: 239 VSGQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15410HTHTETR573e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.6 bits (136), Expect = 3e-12
Identities = 35/174 (20%), Positives = 60/174 (34%), Gaps = 3/174 (1%)

Query: 1 MATRGRPRAFD-RDTALQRAMDVFWVRGYEGASLAALTEAMEIRPPSLYAAFGSKEGLFR 59
MA + + A + R L A+ +F +G SL + +A + ++Y F K LF
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 60 EALAHYLGQHGRYRRDVLDGAPSA-REGVAELLRETVARFCSDEFPRGCLVVL-AALTGT 117
E G + P + E+L + ++E R + ++
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 118 PESEAVRDALSAERGESIRLFRERMRRGIADGDLAADTDVEELATFYATVLFGL 171
E V+ A ES + ++ I L AD A + GL
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15420TCRTETA423e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.7 bits (98), Expect = 3e-06
Identities = 54/261 (20%), Positives = 88/261 (33%), Gaps = 19/261 (7%)

Query: 1 MLLPILLLAAAGFTILTTEFVIVGLLPALAADLQVSVAQA---GLLVSLFAFSVAAFGPF 57
++L + L A G + I+ +LP L DL S G+L++L+A A P
Sbjct: 9 VILSTVALDAVGIGL------IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPV 62

Query: 58 LTAALAGVERKRLFVACLLLFAAANALAAVAGDIWTMAVARFVPALALPVFWAMASETAA 117
L A R+ + + L A A+ A A +W + + R V + A+A A
Sbjct: 63 LGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYIA 121

Query: 118 HLAGPSREGRAVALVFFGIVAATVLGIPIGTLIADAWGWRLAFAALAALALAKALLLAAW 177
+ R + V G +G L+ + F A AAL L
Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFL 180

Query: 178 LPRIPGRPGVSLRSQASVLRQPLVLGHLLLSLLVFTGMF--------TPYTYLADILQRL 229
LP LR +A + + +F P +
Sbjct: 181 LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR 240

Query: 230 AGFSGSLVGWTLMGFGAVGLL 250
+ + +G +L FG + L
Sbjct: 241 FHWDATTIGISLAAFGILHSL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15460HTHFIS514e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 50.6 bits (121), Expect = 4e-09
Identities = 60/285 (21%), Positives = 98/285 (34%), Gaps = 24/285 (8%)

Query: 34 VLIEGPRGMAKSTLARGVAELLP--AGEFVTLPLGASEERIVGSLDLDAALGE--GRARF 89
++I G G K +AR + + G FV + + A ++ S G G
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 90 SPGVLAKADGGVLYVDEVNLLPDHLVDLLLDVAASGVNLVERDGISHRHPARFVLIGTMN 149
S G +A+GG L++DE+ +P LL V G G + ++ N
Sbjct: 223 STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGE--YTTVGGRTPIRSDVRIVAATN 280

Query: 150 P------EEGELRPQLLDRFGLNVRLDTQPPPAERAEIIRRRLAFDADPQAFVERWEGQQ 203
+G R L R LNV PP +RAE I L + FV++ E +
Sbjct: 281 KDLKQSINQGLFREDLYYR--LNVVPLRLPPLRDRAEDI-PDLV-----RHFVQQAEKEG 332

Query: 204 DTLRRRCAEARRRLARI--PLDDAALDSIARRCFEAAVDGLRADLVWLRAARAHAAWRGG 261
++R EA + P + L+++ RR + + R+
Sbjct: 333 LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPI 392

Query: 262 ERIEAEDIDAVEHFALLHRRRQSNPPSSAANPPPPPAAAPVALPE 306
E+ A + +S + PP L E
Sbjct: 393 EK--AAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAE 435


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15515ALARACEMASE280.045 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 27.8 bits (62), Expect = 0.045
Identities = 22/85 (25%), Positives = 39/85 (45%), Gaps = 7/85 (8%)

Query: 19 VKADNSGVDLANVKM---SMNPFCEIAVEEAVRLKEKGVATEIVAVSVGPTAAQEQLRTA 75
VKA+ G + + + + F + +EEA+ L+E+G I+ + G AQ+
Sbjct: 34 VKANAYGHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPIL-MLEGFFHAQD---LE 89

Query: 76 LALGADRAILVESNDELNSLAVAKL 100
+ V SN +L +L A+L
Sbjct: 90 IYDQHRLTTCVHSNWQLKALQNARL 114


46HWH78_RS15610HWH78_RS15890Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS15610315-0.593593DUF177 domain-containing protein
HWH78_RS15615314-0.137198Maf-like protein
HWH78_RS15620315-0.276273signal peptide peptidase SppA
HWH78_RS156254160.366037HAD-IA family hydrolase
HWH78_RS156303170.36504923S rRNA pseudouridine(955/2504/2580) synthase
HWH78_RS156353160.786294ribonuclease E
HWH78_RS156400160.268155UDP-N-acetylmuramate dehydrogenase
HWH78_RS156452120.012995low molecular weight phosphotyrosine protein
HWH78_RS156502162.2116563-deoxy-manno-octulosonate cytidylyltransferase
HWH78_RS156553171.816297Trm112 family protein
HWH78_RS156602190.752014tetraacyldisaccharide 4'-kinase
HWH78_RS15665215-0.097502biopolymer transporter ExbD
HWH78_RS156703150.141813MotA/TolQ/ExbB proton channel family protein
HWH78_RS15675115-0.173552DNA internalization-related competence protein
HWH78_RS15680111-2.431373DUF2062 domain-containing protein
HWH78_RS15685011-2.185730IS110 family transposase
HWH78_RS15690111-1.013149lipoprotein-releasing ABC transporter permease
HWH78_RS15695010-1.426364lipoprotein-releasing ABC transporter
HWH78_RS15700110-2.079557lipoprotein-releasing ABC transporter permease
HWH78_RS15705210-1.229461PilZ domain-containing protein
HWH78_RS15710211-1.543760hypothetical protein
HWH78_RS15715112-2.031771glycerophosphodiester phosphodiesterase
HWH78_RS15720114-2.748200Si-specific NAD(P)(+) transhydrogenase
HWH78_RS15725015-1.993230(Na+)-NQR maturation NqrM
HWH78_RS15730014-1.786193FAD:protein FMN transferase
HWH78_RS15735012-2.034807NADH:ubiquinone reductase (Na(+)-transporting)
HWH78_RS15740211-2.320783NADH:ubiquinone reductase (Na(+)-transporting)
HWH78_RS15745211-2.469185NADH:ubiquinone reductase (Na(+)-transporting)
HWH78_RS15750-19-1.392454Na(+)-translocating NADH-quinone reductase
HWH78_RS15755-110-1.330878NADH:ubiquinone reductase (Na(+)-transporting)
HWH78_RS15760-18-1.089839Na(+)-translocating NADH-quinone reductase
HWH78_RS15765-18-0.786555amino acid permease
HWH78_RS15770-19-0.353861glyceraldehyde-3-phosphate dehydrogenase
HWH78_RS15775090.236777transcription-repair coupling factor
HWH78_RS157802130.296108hypothetical protein
HWH78_RS15785315-0.227619S-methyl-5'-thioinosine phosphorylase
HWH78_RS15790415-0.915783beta-N-acetylhexosaminidase
HWH78_RS1579529-2.339872transcriptional regulator PsrA
HWH78_RS15800210-2.449505transcriptional repressor LexA
HWH78_RS15805010-2.550640cell division inhibitor SulA
HWH78_RS15810212-3.066880hypothetical protein
HWH78_RS15815110-2.909576hypothetical protein
HWH78_RS15820010-2.512622type I DNA topoisomerase
HWH78_RS15825011-2.212612DUF1653 domain-containing protein
HWH78_RS15830011-2.277610acetyl-CoA C-acyltransferase FadA
HWH78_RS1583509-1.659053fatty acid oxidation complex subunit alpha FadB
HWH78_RS1584019-0.416862hypothetical protein
HWH78_RS15845210-0.084620hypothetical protein
HWH78_RS15850190.016767universal stress protein
HWH78_RS15855290.511307hypothetical protein
HWH78_RS15860081.062202ATP-binding cassette domain-containing protein
HWH78_RS15865082.424062lytic transglycosylase Slt
HWH78_RS15870-1122.996054lysozyme inhibitor LprI family protein
HWH78_RS15875-2112.883489MOSC domain-containing protein
HWH78_RS15880-1123.061865lipid kinase YegS
HWH78_RS15885-1102.721172FGGY-family carbohydrate kinase
HWH78_RS15890093.092219glycerol-3-phosphate dehydrogenase/oxidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15635IGASERPTASE605e-11 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 59.7 bits (144), Expect = 5e-11
Identities = 56/316 (17%), Positives = 96/316 (30%), Gaps = 41/316 (12%)

Query: 760 RRSRGQRRRSNRRERQREVSGELEGSEATDNA-----AAPLNTVAAAAAAGIAVA--SEA 812
R G+ N +R + + +N + P N A V + A
Sbjct: 972 RNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPA 1031

Query: 813 VEANVEQAPATTSEAASETTASDETDASTSEAVETQDA-----DSEANT---------GE 858
+ + A S+ S+T +E DA+ + A + A + +ANT E
Sbjct: 1032 TPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 859 TADIEAPVTVSVVRDEADQSTLLVAQATEEAPFASESVESREDAESAVQPATEAAEEVAA 918
T + + T E ++ + + T+E P + V +++ VQP E A E
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE--- 1148

Query: 919 PVPVEAAAPSEPATTEEPTPAIAAVPANATGRALNDPREKRRLQREAERLAR--EAAAAA 976
P + ++ T A PA T + P + + E A
Sbjct: 1149 NDPTVNI---KEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205

Query: 977 EAAAQAAPAVEEIPAVASEEA----SAQEEPAA---PQAEEIAQADVPSQT-----DEAQ 1024
P + EPA +A D+ S +A+
Sbjct: 1206 TTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDAR 1265

Query: 1025 EAVQAEPEASGKDAAD 1040
Q GK +
Sbjct: 1266 AKAQFVALNVGKAVSQ 1281



Score = 57.4 bits (138), Expect = 3e-10
Identities = 51/349 (14%), Positives = 104/349 (29%), Gaps = 36/349 (10%)

Query: 508 EAQPVSSTRTLVRQEAAVKTVAPQQPAPQHTEAPVEPAKPMPEPSLFQGLVKSLVGLFAG 567
+ +++ + +V + + EAPV P P PS V A
Sbjct: 993 DTTNITTPNNIQADVPSVPSNNEEIARVD--EAPVPPPAP-ATPSETTETV-------AE 1042

Query: 568 KDQPAAKPAETSKPAAERQTRQDERRNGRQQNRRRDGRDGNRRDEERKPREERAERQPRE 627
+ +K E ++ A T Q+ ++ + N + +E + +E
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 628 ERAERPNREERSERRREERAERPAREERQPREGREERAERTPREERQPREGREGREERSE 687
+ + E + + + + P++ + E + R+ +E +S+
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQV-SPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161

Query: 688 RRREERAERPAREERQPREGREERAERPAREERQPREDRQARDAAALEAEALPNDESLEQ 747
E+PA+E E + +E
Sbjct: 1162 TNTTADTEQPAKETS--------SNVEQPVTESTTVN---------------TGNSVVEN 1198

Query: 748 DEQDDTDGERPRRRSRGQRRRSNRRERQ-REVSGELEGSEATDNAAAPLNTVAAAAAAGI 806
E +P S + NR R R V +E + + N + + +
Sbjct: 1199 PENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTN 1258

Query: 807 AVASEAVEANVEQAPATTSEAASETTASDETDASTSEAVETQDADSEAN 855
AV S+A A + +A S+ + E + V + N
Sbjct: 1259 AVLSDAR-AKAQFVALNVGKAVSQHISQLEMNNEGQYNVWVSNTSMNKN 1306



Score = 53.5 bits (128), Expect = 4e-09
Identities = 45/253 (17%), Positives = 77/253 (30%), Gaps = 35/253 (13%)

Query: 838 DASTSEAVETQDADSEANTGETADIEAP------VTVSVVRDEADQSTLLVAQATEEAPF 891
A+ + ++ D E N E +A + VS+V + D +
Sbjct: 919 SATGNFTLQVADKTGEPNHNELTLFDASKAQRDHLNVSLVGNTVDLGAWKYKLRNVNGRY 978

Query: 892 A--SESVESREDAESAVQPATEAAEEVAAP-VPVEAAAPSEPATTEEPTPAIAAVPANAT 948
+ VE R T + P VP + P PA A
Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038

Query: 949 GRALNDPREKRRLQR------EAERLAREAAAAAEAAAQAAPAVEEIPAVASEEASAQ-- 1000
A N +E + +++ E RE A A++ +A E+ SE Q
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 1001 --EEPAAPQAEEIAQADVPSQTDEAQEA--------------VQAEPEASGKDA--ADTE 1042
+E A + EE A+ + + + QAEP
Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158

Query: 1043 HAKKAEESETSRP 1055
++ ++T +P
Sbjct: 1159 QSQTNTTADTEQP 1171



Score = 50.1 bits (119), Expect = 5e-08
Identities = 53/338 (15%), Positives = 95/338 (28%), Gaps = 62/338 (18%)

Query: 616 PREERAERQPREERAERPNREERSERRREERAERPAREERQPREGREERAERTPREERQP 675
P E+ + PN + E AR + P A TP E +
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVP---PPAPATPSETTET 1039

Query: 676 REGREGREERSERRREERAERPAREERQPREGREERAERPAREERQPREDRQARDAAALE 735
+E ++ + E+ A + R+ + E ++ A + Q + A
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAK--EAKSNVKA--------NTQTNEVAQSG 1089

Query: 736 AEALPNDESLEQDEQDDTDGERPRRRSRGQRRRSNRRERQREVSGELEGSEATDNAAAPL 795
+E E+ + ++ E+ + + + +VS + E SE A P
Sbjct: 1090 SET---KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 796 NTVAAAAAAGIAVASEAVEANVEQAPATTSEAASETTASDETDASTSEAVETQDADSEAN 855
+ A+ EQ TS + T + + VE + + A
Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 856 TGETADIEAPVTVSVVRDEADQSTLLVAQATEEAPFASESVESREDAESAVQPATEAAEE 915
T T + E+ S ++R P
Sbjct: 1207 TQPTVNSES----------------------------SNKPKNRHRRSVRSVPHNV---- 1234

Query: 916 VAAPVPVEAAAPSEPATTEEPTPAIAAVPANATGRALN 953
EPATT + A+ + T N
Sbjct: 1235 -------------EPATTSSNDRSTVAL-CDLTSTNTN 1258



Score = 44.3 bits (104), Expect = 3e-06
Identities = 47/237 (19%), Positives = 74/237 (31%), Gaps = 25/237 (10%)

Query: 427 EALKDRTAEVRARVPFQVAAFLLNEKRNAITKIELRTRARIFILPDDHLETPHFEVQRLR 486
A T E A Q + + +++A + T EV +
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 487 DDSPELVAGQTSYEMATVEHEEAQPVSSTRTLVRQEAAVKT--VAPQQPAPQHTEAPVEP 544
++ E +T E ATVE EE V + +T QE T V+P+Q + + EP
Sbjct: 1090 SETKETQTTETK-ETATVEKEEKAKVETEKT---QEVPKVTSQVSPKQEQSETVQPQAEP 1145

Query: 545 AKPMPEPSLFQGLVKSLVGLFAGKDQPAAKPAETSKPAAERQTRQDERRNGRQQNRRRDG 604
A +P+ K+ ++T+ A Q ++ N Q
Sbjct: 1146 A-RENDPT------------VNIKE----PQSQTNTTADTEQPAKETSSNVEQPVTESTT 1188

Query: 605 RDGNRRDEERKPREERAERQP--REERAERPNREERSERRREERAERPAREERQPRE 659
+ E A QP E + +P R R PA R
Sbjct: 1189 VNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15660ENTSNTHTASED290.022 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 28.8 bits (64), Expect = 0.022
Identities = 29/128 (22%), Positives = 45/128 (35%), Gaps = 22/128 (17%)

Query: 15 HPALALLRPLEALYRRVANGRRADFLSGRKPAYRAPLPVLVVGNITVGGTGKTPM----I 70
L P R R+A+ L+GR A A L + V + G + P+ +
Sbjct: 26 REHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA-LREVGVRTVPGMGDKRQPLWPDGL 84

Query: 71 LWMIEHCRARGLRVGVISRGYGARPPTTPWRVRAEQDAAEAGDEPLMIVRRSGVPLMIDP 130
I HC L V + + G + E+ ++ L P +ID
Sbjct: 85 FGSISHCATTALAV-ISRQRIG---------IDIEKIMSQHTATEL-------APSIIDS 127

Query: 131 DRPRALQA 138
D + LQA
Sbjct: 128 DERQILQA 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15795HTHTETR692e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 68.9 bits (168), Expect = 2e-16
Identities = 25/93 (26%), Positives = 40/93 (43%), Gaps = 1/93 (1%)

Query: 4 SETVERILDAAEQLFAEKGFAETSLRLITSKAGVNLAAVNYHFGSKKALIQAVFSRFLGP 63
ET + ILD A +LF+++G + TSL I AGV A+ +HF K L ++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 64 FCASLEKELDRRQAKPEAQ-HATLEDLLHLLVS 95
+ + P + L +L V+
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVT 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15850SHAPEPROTEIN270.019 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 27.4 bits (61), Expect = 0.019
Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 2/39 (5%)

Query: 17 DPVMKRAAALATSNQARLSVVHVV-EPMAMAFGGDVPMD 54
V +RA + A V ++ EPMA A G +P+
Sbjct: 119 TQVERRAIRESAQG-AGAREVFLIEEPMAAAIGAGLPVS 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15860RTXTOXIND330.003 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.3 bits (76), Expect = 0.003
Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 2/62 (3%)

Query: 575 KLQRELEALPGQIDAVEAELAGVQETIAQ--QDFYLRPQDEQRETLARLDALQQELDALL 632
+ EL Q++ +E+E+ +E Q F D+ R+T + L EL
Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNE 322

Query: 633 ER 634
ER
Sbjct: 323 ER 324


47HWH78_RS16050HWH78_RS16275Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS16050224-1.498800helix-turn-helix transcriptional regulator
HWH78_RS16055223-1.268466hypothetical protein
HWH78_RS16060123-0.381496DUF2274 domain-containing protein
HWH78_RS16065221-0.513981TrbI/VirB10 family protein
HWH78_RS16070220-0.820135P-type conjugative transfer protein TrbG
HWH78_RS16075220-0.782906conjugal transfer protein TrbF
HWH78_RS16080221-0.826892P-type conjugative transfer protein TrbL
HWH78_RS16085321-0.396595hypothetical protein
HWH78_RS16090320-0.811939P-type conjugative transfer protein TrbJ
HWH78_RS16095221-1.278041conjugal transfer protein TrbE
HWH78_RS16100127-2.540598VirB3 family type IV secretion system protein
HWH78_RS16105027-2.864804TrbC/VirB2 family protein
HWH78_RS16110026-3.017773P-type conjugative transfer ATPase TrbB
HWH78_RS16115033-5.026980ribbon-helix-helix protein, CopG family
HWH78_RS16120136-5.924625conjugal transfer protein TraG
HWH78_RS16125048-8.324821LysR family transcriptional regulator
HWH78_RS16130149-5.814975hypothetical protein
HWH78_RS16135152-5.878159multidrug efflux SMR transporter
HWH78_RS16140053-5.467935EexN family lipoprotein
HWH78_RS16145043-5.258793LysR family transcriptional regulator
HWH78_RS16150043-4.864599TolC family protein
HWH78_RS16155142-4.714081TetR family transcriptional regulator
HWH78_RS16160139-5.517965efflux RND transporter periplasmic adaptor
HWH78_RS16165039-5.555041efflux RND transporter permease subunit
HWH78_RS16170138-6.287120hypothetical protein
HWH78_RS16175046-6.441872DUF3313 family protein
HWH78_RS16180035-3.979693ABC transporter permease
HWH78_RS16185032-3.000476ATP-binding cassette domain-containing protein
HWH78_RS16190329-1.551905MlaD family protein
HWH78_RS16195327-0.749769ABC-type transport auxiliary lipoprotein family
HWH78_RS162003201.437295relaxase/mobilization nuclease and DUF3363
HWH78_RS162053162.188889S26 family signal peptidase
HWH78_RS162105192.594421DUF2840 domain-containing protein
HWH78_RS162156231.917156hypothetical protein
HWH78_RS162204240.967548AAA family ATPase
HWH78_RS162254250.141442replication initiator protein A
HWH78_RS16230227-0.820119helix-turn-helix domain-containing protein
HWH78_RS16235331-0.780375DUF2285 domain-containing protein
HWH78_RS16240325-0.240193DUF2958 domain-containing protein
HWH78_RS16245523-0.901251helix-turn-helix domain-containing protein
HWH78_RS16250433-7.638961DUF736 domain-containing protein
HWH78_RS16255432-7.455116hypothetical protein
HWH78_RS16265429-6.647208ParB N-terminal domain-containing protein
HWH78_RS16270126-8.300886DUF945 domain-containing protein
HWH78_RS16275023-7.138793AAA family ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16070PF03544290.022 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.022
Identities = 18/81 (22%), Positives = 29/81 (35%), Gaps = 3/81 (3%)

Query: 23 QGKPPPSISLDETVLAQPLPEPPKPVEVV---AVPEPLALPAQLKPLPELDEAPVAPEPA 79
+PPP ++ +P+PEPPK VV P+P P +K + + E
Sbjct: 65 AVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESR 124

Query: 80 DEKVRVSRANAEARIAPTREG 100
+ A A +
Sbjct: 125 PASPFENTAPARPTSSTATAA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16080adhesinmafb320.005 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 32.0 bits (72), Expect = 0.005
Identities = 29/118 (24%), Positives = 39/118 (33%), Gaps = 11/118 (9%)

Query: 272 GAGAMAGAAVGAVGTGVAIGAAVTGVGGAVMAGARMAPAAAKLAGAG-----ARAATSAA 326
GA +A A+G G + + A M PA K A G A +
Sbjct: 234 GALNPFISAGEALGIGDILYGTRYAIDKAAMRNIAPLPAEGKFAVIGGLGSVAGFEKNTR 293

Query: 327 GSARSAFQAG-SAAAGGGAKGAAAGLGNVAKTGAQAAGRSVTSGASAVGQKVADSFRA 383
+ Q +AA A A VAK A G +AV ADS++
Sbjct: 294 EAVDRWIQENPNAAETVEAVFNVAAAAKVAKLAKAA-----KPGKAAVSGDFADSYKK 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16125PF05043352e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 35.3 bits (81), Expect = 2e-04
Identities = 34/213 (15%), Positives = 70/213 (32%), Gaps = 32/213 (15%)

Query: 1 MKNIKSMD-LNLLKALDALLDER---NVTRAAARLGLTQPALSGMLTRLRESFGDPLFAR 56
M+++ S L+ L+ L + + + + A L T+ A+ L+ ++ +F D +F
Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHS 60

Query: 57 SQRGIVPTQ-RALDLGMPVKQVLAEIDALLQPPSFNPATAQLTFSIAATDYALRAVAVP- 114
S GI D+ M + L F ++
Sbjct: 61 STNGIRIINTDDSDIEMVYHHFFKHSTHF----------SILEFIFFNEGCQAESICKEF 110

Query: 115 FLSALKRHAPRVRVSLVPVESGQLQNQLERGQIDLALLTPEITPPNLHAR----ELFKEH 170
++S R+ + V ++ Q + +++L +I R + F E
Sbjct: 111 YIS--SSSLYRIISQINKV----IKRQFQ---FEVSLTPVQIIGNERDIRYFFAQYFSEK 161

Query: 171 YVCVLREDHPAAMGRKLTVKQFCALDHALVSYD 203
Y + P + Q L + S+
Sbjct: 162 YYFLE---WPFENFSSEPLSQLLELVYKETSFP 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16160HTHTETR445e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 43.8 bits (103), Expect = 5e-08
Identities = 28/156 (17%), Positives = 58/156 (37%), Gaps = 8/156 (5%)

Query: 2 TLDAVVVRAGLSKGAFLYHFKTKRDLFVTLIDEMIRAFDAVQANHERRFAGDPDPWLSSQ 61
+L + AG+++GA +HFK K DLF + + ++ ++ +F GDP L
Sbjct: 33 SLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLREI 92

Query: 62 VEAMPD----DEMQKMGAALLAAAAEDPTLLDPLREWYRVQYERVRRSPRGTETAAL--- 114
+ + + +E +++ ++ E + +++ R T +
Sbjct: 93 LIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAK 152

Query: 115 -IMLALDGALFADLLGLPILAPAERRHFFRALQDLA 149
+ L A ++ I E F DL
Sbjct: 153 MLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLK 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16165RTXTOXIND419e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.6 bits (95), Expect = 9e-06
Identities = 18/141 (12%), Positives = 43/141 (30%), Gaps = 10/141 (7%)

Query: 103 LDAQPDRLRVTQAQASLAAAEAGLMDRRVQTDQQRRLLESEVISPAAFESAKAQLAVAEG 162
L+ + + + + + ++ +L+ + + + +
Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 163 QARTAKAALGLAERAQRGTMIVAPFDGVVAEKLALAFTD---IAAGAPVFQVDGVRSGTE 219
AK E Q+ ++I AP V + T+ + + + E
Sbjct: 315 TLELAKN-----EERQQASVIRAPVSVKVQQ--LKVHTEGGVVTTAETLMVIVPEDDTLE 367

Query: 220 IIANASTTQAPHIDVGQRAEL 240
+ A I+VGQ A +
Sbjct: 368 VTALVQNKDIGFINVGQNAII 388



Score = 40.2 bits (94), Expect = 9e-06
Identities = 16/103 (15%), Positives = 37/103 (35%), Gaps = 2/103 (1%)

Query: 79 TGGRIAKLNVDVGERFSRGQVLAELDAQPDRLRVTQAQASLAAAEAGLMDRRVQTDQQRR 138
+ ++ V GE +G VL +L A + Q+SL A ++ +
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 139 --LLESEVISPAAFESAKAQLAVAEGQARTAKAALGLAERAQR 179
L E ++ F++ + + + + ++ Q+
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16170ACRIFLAVINRP470e-151 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 470 bits (1210), Expect = e-151
Identities = 230/1055 (21%), Positives = 437/1055 (41%), Gaps = 71/1055 (6%)

Query: 3 ITEMALRASRLTYFVALIIFVAGIATFLNFPSQEEPTVTVRDAMVTALNPGLPAERVEQL 62
+ +R + +A+I+ +AG L P + PT+ V+A PG A+ V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 IARPIEERLRELAEVKRVTST-VRAGSAMIQVTIWDRYTDLAPIWQRVRAKVADSKDALP 121
+ + IE+ + + + ++ST AGS I +T + TD +V+ K+ + LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLT-FQSGTDPDIAQVQVQNKLQLATPLLP 119

Query: 122 ---QSTMGPFVDEDFGRVAVASIAVTAPGYSMSEMRV-ALKQMRDRLYTVPGIERITFYG 177
Q + VA PG + ++ ++D L + G+ + +G
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 178 LQEE-RVYLEFDRPRLARLELTPQGVIDQLVKQNVVASGGQIVVG------GINATLAVS 230
Q R++L+ D L + +LTP VI+QL QN + GQ+ +NA++
Sbjct: 180 AQYAMRIWLDADL--LNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQ 237

Query: 231 GEVRDAPSLRAMPIALPRPQSSTAPVPTIALGELAQVSVRPADPPESAAIYKGQPAVVMA 290
++ + + R S + V L ++A+V + A G+PA +
Sbjct: 238 TRFKNPEEFGKVTL---RVNSDGSVVR---LKDVARV-ELGGENYNVIARINGKPAAGLG 290

Query: 291 VSMASGQNVEQFGKALKARVADQEKLLPAGFDLSYVTFQADVVKHEMGKMNHVMMETIIV 350
+ +A+G N KA+KA++A+ + P G + Y V+ + ++ + E I++
Sbjct: 291 IKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIML 350

Query: 351 VLGVVVLFLG-WRTGIIVGMIVPLTILSALIVMRAMNIELQNVSMGAIIIALGLLVDNGI 409
V V+ LFL R +I + VP+ +L ++ A + ++M +++A+GLLVD+ I
Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAI 410

Query: 410 VIAEDIERRLA-GGEDRKHACLEAGRTLAIPLLTSSLVIVIVFSPFFFGQNATSEYLHNL 468
V+ E++ER + K A ++ + L+ ++V+ VF P F +T
Sbjct: 411 VVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQF 470

Query: 469 VVVLALTLFASWLLCLTVTPLLCYHFAKP----HHKQEQG--DAYDTRFYR---GYRRVL 519
+ + + S L+ L +TP LC KP HH+ + G ++T F Y +
Sbjct: 471 SITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSV 530

Query: 520 EWVLHHKAVYVASMIAALAIALYGFTTLPYDFMPKSDRLQFQIPVQLAPGTDSRETLARV 579
+L Y+ +A + F LP F+P+ D+ F +QL G T +
Sbjct: 531 GKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVL 590

Query: 580 KQISGWLGDTNI-NPEVSDHIGYVADGGPRFILSLNPPLPASNIAYFVVTLKPKSD---- 634
Q++ + N E + + G A N V+LKP +
Sbjct: 591 DQVTDYYLKNEKANVESVFTVNGFSFSGQ-----------AQNAGMAFVSLKPWEERNGD 639

Query: 635 ---IDAVLARTRSYFAQAHGDVRA--EPKRFSLGATESGTAIYRV--SGPDEEVLLGAAS 687
+AV+ R + + T +G + +G + L A +
Sbjct: 640 ENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARN 699

Query: 688 KIEAALRKLPGT-INVKNDWDTRVGRIDVRVDQDRARRAGVTTEDIAGGLDVRYSGRSIS 746
++ + P + ++V+ + + + VDQ++A+ GV+ DI + G ++
Sbjct: 700 QLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVN 759

Query: 747 VIRDGDTSVPIVLRSIVSERRSTADVGATLIYPTNGGPAVTLAQVADVSLASEPSVIQRR 806
D + +++ R DV + + G V + ++R
Sbjct: 760 DFIDRGRVKKLYVQADAKFRMLPEDVDKLYV-RSANGEMVPFSAFTTSHWVYGSPRLERY 818

Query: 807 NLIRTITVQGQNT----SYTAQEIINRLAPSVAALDLPAGYSVELGGEIEEAAESNAALS 862
N + ++ +QG+ S A ++ LA LPAG + G + S
Sbjct: 819 NGLPSMEIQGEAAPGTSSGDAMALMENLAS-----KLPAGIGYDWTGMSYQERLSGNQAP 873

Query: 863 TYMPLAFLAMLMLFVWQFNSFRKLGVILATIPFTLIGVVLALKLTGTPFSFMATFGVLAL 922
+ ++F+ + + + S+ ++ +P ++GV+LA L G+L
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTT 933

Query: 923 FGIIVNNAVLLLERIDQ-GLAEGLPRHEALVGAAIQRLRPIVMTKVTCISGLVPLMLFSG 981
G+ NA+L++E EG EA + A RLRPI+MT + I G++PL + +G
Sbjct: 934 IGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG 993

Query: 982 P---LWKGMAIAMIGGLALGTLVTLGLIPLLYEVL 1013
+ I ++GG+ TL+ + +P+ + V+
Sbjct: 994 AGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028



Score = 79.5 bits (196), Expect = 4e-17
Identities = 99/526 (18%), Positives = 193/526 (36%), Gaps = 61/526 (11%)

Query: 516 RRVLEWVLHHKAVYVASMIAALAIALYGFTTLPYDFMPKSDRLQFQIPVQLAPGTDSRET 575
R + WVL I + LP P + PG D++
Sbjct: 8 RPIFAWVL---------AIILMMAGALAILQLPVAQYPTIAPPAVSVSAN-YPGADAQTV 57

Query: 576 LARVKQISGWLGDTNINPEVS--DHIGYVADGGPRFILSLNPPLPASNIAYFVVTLKPKS 633
V Q+ I ++ D++ Y++ +T + +
Sbjct: 58 QDTVTQV--------IEQNMNGIDNLMYMSSTSDSAGSV-----------TITLTFQSGT 98

Query: 634 DIDAVLARTRSYFAQAHG----DVRAE--PKRFSLGATESGTAIYRVSGPDEEVLLG--A 685
D D + ++ A +V+ + S + + + +
Sbjct: 99 DPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYV 158

Query: 686 ASKIEAALRKLPGTINVKNDWDTRVGRIDVRVDQDRARRAGVTTEDIAGGLDVRYSGRSI 745
AS ++ L +L G +V+ RI + D D + +T D+ L V+ +
Sbjct: 159 ASNVKDTLSRLNGVGDVQLFGAQYAMRIWL--DADLLNKYKLTPVDVINQLKVQNDQIAA 216

Query: 746 SVIRDGDTSVPI--VLRSIVSERR--STADVGATLIYPTNGGPAVTLAQVADVSLASEP- 800
+ G ++P + SI+++ R + + G + + G V L VA V L E
Sbjct: 217 GQL-GGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENY 275

Query: 801 SVIQRRNLIRTITVQ-----GQNTSYTAQEIINRLAPSVAALDLPAGYSVELGGEIEEAA 855
+VI R N + G N TA+ I +LA + P G V +
Sbjct: 276 NVIARINGKPAAGLGIKLATGANALDTAKAIKAKLA-ELQP-FFPQGMKVLYPYDTTPFV 333

Query: 856 ES--NAALSTYMPLAFLAMLMLFVWQFNSFRKLGVILATIPFTLIGVVLALKLTGTPFSF 913
+ + + T L L+++++ + R + +P L+G L G +
Sbjct: 334 QLSIHEVVKTLFEAIMLVFLVMYLF-LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT 392

Query: 914 MATFGVLALFGIIVNNAVLLLERIDQGLAE-GLPRHEALVGAAIQRLRPIVMTKVTCISG 972
+ FG++ G++V++A++++E +++ + E LP EA + Q +V + +
Sbjct: 393 LTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAV 452

Query: 973 LVPLMLFSG---PLWKGMAIAMIGGLALGTLVTLGLIPLLYEVLFG 1015
+P+ F G +++ +I ++ +AL LV L L P L L
Sbjct: 453 FIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK 498


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16265FbpA_PF05833300.029 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 30.2 bits (68), Expect = 0.029
Identities = 15/71 (21%), Positives = 24/71 (33%), Gaps = 11/71 (15%)

Query: 310 FQRAPRERRSPNKRDAQR-----IEKLQTKLHELAEAVDAALDDEDEEKADALQEEGERL 364
+ + +R D Q+ I + K L + E D + GE L
Sbjct: 289 YAKDKSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKC------EDKDIFKLYGELL 342

Query: 365 GEQLQALEDGL 375
+ AL+ GL
Sbjct: 343 TANIYALKKGL 353


48HWH78_RS16710HWH78_RS16830Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS16710015-3.296970multidrug efflux MFS transporter
HWH78_RS16715022-3.928628excinuclease ABC subunit B
HWH78_RS16720134-5.558680aspartate/tyrosine/aromatic aminotransferase
HWH78_RS16730249-7.834759*ComEA family DNA-binding protein
HWH78_RS16735254-9.469475polysaccharide biosynthesis protein
HWH78_RS16740368-11.556259glycosyltransferase family 4 protein
HWH78_RS16745473-13.447288SDR family oxidoreductase
HWH78_RS16750477-15.094066glycosyltransferase family 4 protein
HWH78_RS16755478-16.287487glycosyltransferase
HWH78_RS16760479-16.590662asparagine synthase (glutamine-hydrolyzing)
HWH78_RS16765580-17.206523glycosyltransferase family 4 protein
HWH78_RS16770578-17.170847oligosaccharide flippase family protein
HWH78_RS16775455-13.714764Vi polysaccharide biosynthesis
HWH78_RS16780444-10.675444Vi polysaccharide biosynthesis
HWH78_RS16785326-5.227998LPS O-antigen chain length determinant protein
HWH78_RS16790318-2.427398lipopolysaccharide assembly protein LapA
HWH78_RS16795215-2.113052integration host factor subunit beta
HWH78_RS16800113-2.07762830S ribosomal protein S1
HWH78_RS16805112-0.804794(d)CMP kinase
HWH78_RS16810112-0.718967bifunctional prephenate
HWH78_RS16815111-0.546428histidinol-phosphate transaminase
HWH78_RS16820211-0.660244prephenate dehydratase
HWH78_RS16825212-0.0940743-phosphoserine/phosphohydroxythreonine
HWH78_RS168302130.299211DNA gyrase subunit A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16710TCRTETB1043e-26 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 104 bits (261), Expect = 3e-26
Identities = 85/402 (21%), Positives = 165/402 (41%), Gaps = 19/402 (4%)

Query: 8 AFMAVLDIQITNSSLKDIQGALAATLEEGSWISTSYLVAEIIMIPMTAWLVQLLSARRLA 67
+F +VL+ + N SL DI +W++T++++ I + L L +RL
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 68 VMISVGFLVSSLLCSFAWNLESMIVF-RAMQGFTGGALIPLAFTLALVKLPEHHRPKGMA 126
+ + S++ + S+++ R +QG A L + +P+ +R K
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 127 LFAITATFAPSIGPTLGGWLTENFGWEYIFYINVPPGLLMIAGLLYGLEKKAPHWELLKS 186
L +GP +GG + W Y+ I + ++ + L+ L+K+
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLKKEVRIKGHF-- 199

Query: 187 TDYAGIVTLGIGLGCLQVFLEEGHRKDWLESQLIVSLGSVALFSLVLFVILQLSRPNPLI 246
D GI+ + +G+ +F + S LIVS + S ++FV +P +
Sbjct: 200 -DIKGIILMSVGIVFFMLFT-----TSYSISFLIVS-----VLSFLIFVKHIRKVTDPFV 248

Query: 247 DLGILRNRNFGLASISSIGLGMGLYGSIYVLPLYLAQIQGYNAMQIGEVIMWMG-IPQLF 305
D G+ +N F + + + + G + ++P + + + +IG VI++ G + +
Sbjct: 249 DPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308

Query: 306 LIPLVPKLMRLVSPRLLCAAGFGLFGLASFFSGVLNPDFAGPQFNQIQLLRALG-QPMIM 364
+ L+ P + G ++ + L F I ++ LG
Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LETTSWFMTIIIVFVLGGLSFTK 366

Query: 365 VTISLIATAYLQPQDAGSASSLFNILRNLGGAIGIALLATLL 406
IS I ++ L+ Q+AG+ SL N L GIA++ LL
Sbjct: 367 TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16735NUCEPIMERASE578e-11 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 57.1 bits (138), Expect = 8e-11
Identities = 46/292 (15%), Positives = 103/292 (35%), Gaps = 56/292 (19%)

Query: 301 VMVTGAGGSIGSELCRQIMSCSPSVLILFEHSEYNLYSIHQELERRIKRESLSVNLLPIL 360
+VTGA G IG + ++++ V+ + ++Y S+ Q + +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP----GFQFHK 58

Query: 361 GSVRNPERLVDVMRTWKVNTVYHAAAYKHVPIVEHNIAEGVLNNVIGTLHAVQAAVQVGV 420
+ + E + D+ + V+ + V N +N+ G L+ ++ +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 421 QNFVLIST---------------DKAVRPTNVMGSTKRLAEMVLQALSNESAPVLFGDRK 465
Q+ + S+ D P ++ +TK+ E++ S
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYS------------ 166

Query: 466 DVHHVNKTRFTMVRFGNVLGSSGS---VIPLFREQIKRGGPVTV-THPSITRYFMTIPEA 521
H+ T +RF V G G + F + + G + V + + R F I +
Sbjct: 167 ---HLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDI 223

Query: 522 AQLVIQA----------GSMGQGGD--------VFVLDMGPPVKILELAEKM 555
A+ +I+ ++ G V+ + PV++++ + +
Sbjct: 224 AEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16745NUCEPIMERASE663e-14 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 66.0 bits (161), Expect = 3e-14
Identities = 67/356 (18%), Positives = 120/356 (33%), Gaps = 69/356 (19%)

Query: 5 NVLVTGATGFIGAALVNSLCSSGQ-----------YKVWAGCRRRGGAWPRGVTP----L 49
LVTGA GFIG + L +G Y V R G L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 50 LLGELGSSVVWDAESAIDTVVHCAARVHV-MSETASDPLVEFRKANVQGT---LDLAREA 105
E + + + V R+ V S + +N+ G L+ R
Sbjct: 62 ADREGMTDLFASGH--FERVFISPHRLAVRYSLENPHAYAD---SNLTGFLNILEGCRHN 116

Query: 106 VSRGVRRFIFISSIKVNGEGTEPGRPY-TADSPPNPVDPYGVSKREAEQALLDLAEETGL 164
++ ++ SS V G + P+ T DS +PV Y +K+ E + GL
Sbjct: 117 ---KIQHLLYASSSSVYGLNRKM--PFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171

Query: 165 EVVIIRPVLVYGPGVKAN--VQTMMRWLKRGVPLPL-GAIHNRRSLVSLDNLVDLIITCI 221
+R VYGP + + + + + G + + +R +D++ + II
Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231

Query: 222 EHPA-----------------AVGQVFLVSDGEDLSTTELLRRMGRALGAPAR--LLPVP 262
+ A +V+ + + + + ++ + ALG A+ +LP+
Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291

Query: 263 ASWIGAAAKVLNRQAFARRLCGSLQVDIMKTRQVLGWTPPVGVDQALEKTARSFLD 318
V + A D +V+G+TP V ++ + D
Sbjct: 292 ------PGDV--LETSA---------DTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16765PF07520290.034 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 28.8 bits (64), Expect = 0.034
Identities = 24/119 (20%), Positives = 46/119 (38%), Gaps = 10/119 (8%)

Query: 15 MHMSEALVSAYPPARRLDRPLWLLRELLHRLPQVIGSYGSDVVILQRELLSTIPTLEFLT 74
+ ++EA++SA A DR + ++L +P +G G + T L++L
Sbjct: 696 VPLAEAILSACEDAEEADRIDIPVADVLGLVPTPVGEEGDEEGHEDASPQVTDEILDYLE 755

Query: 75 K--------APRILDVDDAIWLHRRGIAANSIARRVDHIVCG--NQYLADYFGQFGRPT 123
K R+ D+ + A + ++V +C + D GRP+
Sbjct: 756 KPATQLGAEGWRLADMVLSASREDLDAIAREVFQKVLGNMCEVIDHLGCDVVLLTGRPS 814


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16775NUCEPIMERASE2572e-86 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 257 bits (659), Expect = 2e-86
Identities = 101/341 (29%), Positives = 159/341 (46%), Gaps = 30/341 (8%)

Query: 19 LITGVAGFIGSNLLETLLKLDQKVVGLDNFATGHQRNLDEVRSLVSEKQWSNFKFIQGDI 78
L+TG AGFIG ++ + LL+ +VVG+DN + +L + R + + F+F + D+
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQ--PGFQFHKIDL 61

Query: 79 RNLDDCNNACA--GVDYVLHQAALGSVPRSINDPITSNATNIDGFLNMLIAARDAKVQSF 136
+ + + A + V +V S+ +P +N+ GFLN+L R K+Q
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 137 TYAASSSTYGDHPGLP-KVEDTIGKPLSPYAVTKYVNELYADVFSRCYGFSTIGLRYFNV 195
YA+SSS YG + +P +D++ P+S YA TK NEL A +S YG GLR+F V
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 196 FGRRQDPNGAYAAVIPKWTSSMIQGDDVYINGDGETSRDFCYIENTVQANLLAATAGLDA 255
+G P+ A K+T +M++G + + G+ RDF YI++ +A + A
Sbjct: 182 YGPWGRPDMAL----FKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHA 237

Query: 256 RNQ----------------VYNIAVGGRTSLNQLFFALRDGLAENGVSYHREPVYRDFRE 299
Q VYNI L AL D L G+ + +
Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDAL---GIEAKKN--MLPLQP 292

Query: 300 GDVRHSLADISKAAKLLGYAPKYDVSAGVALAMPWYIMFLK 340
GDV + AD +++G+ P+ V GV + WY F K
Sbjct: 293 GDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16795DNABINDINGHU1181e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 118 bits (297), Expect = 1e-38
Identities = 35/89 (39%), Positives = 54/89 (60%), Gaps = 1/89 (1%)

Query: 2 TKSELIERIVTHQGQLSAKDVELAIKTMLEQMSQALATGDRIEIRGFGSFSLHYRAPRVG 61
K +LI + V +L+ KD A+ + +S LA G+++++ GFG+F + RA R G
Sbjct: 3 NKQDLIAK-VAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGESVRLDGKFVPHFKPGKELRDRV 90
RNP+TGE +++ VP FK GK L+D V
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90


49HWH78_RS16970HWH78_RS17030Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS169702130.155048DUF1289 domain-containing protein
HWH78_RS169753140.636968hypothetical protein
HWH78_RS16980312-0.129563SMC-Scp complex subunit ScpB
HWH78_RS169853120.417258segregation/condensation protein A
HWH78_RS16990114-0.242786threonylcarbamoyl-AMP synthase
HWH78_RS16995-1140.622594PHP domain-containing protein
HWH78_RS170000130.659619septation protein A
HWH78_RS170051161.090075YciI family protein
HWH78_RS170102152.928802hypothetical protein
HWH78_RS170153143.454961response regulator transcription factor
HWH78_RS170202143.472730Spy/CpxP family protein refolding chaperone
HWH78_RS170252131.835014HAMP domain-containing histidine kinase
HWH78_RS170302111.871503hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16980OUTRMMBRANEA280.028 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 27.6 bits (61), Expect = 0.028
Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 86 SFGKDVAFNIKPKQLLKTEPLVALNRIQVQLSSMYPEKGGYVLVDFSE 133
+ DV FN K LK E AL+++ QLS++ P+ G V++ +++
Sbjct: 216 TLKSDVLFNFN-KATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTD 262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17010adhesinmafb309e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 30.4 bits (68), Expect = 9e-04
Identities = 13/45 (28%), Positives = 18/45 (40%)

Query: 53 AAGFTGSLIVAEFDSLAAAQSWAEADPYRAAGVYAEVVVKPFKKV 97
G GS+ E ++ A W + +P A V A V KV
Sbjct: 278 VIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17015IGASERPTASE280.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.010
Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 8/76 (10%)

Query: 22 EEPAPAPIPAAQPSITQATAELERRLVETERQRDELVSRMRQENRQLREQ--------LQ 73
E P P P PA T+ AE ++ +T + ++ + +NR++ ++ Q
Sbjct: 1022 EAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 74 AAQAQRQPPLLTEEQT 89
+ + E QT
Sbjct: 1082 TNEVAQSGSETKETQT 1097


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17020HTHFIS1039e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 103 bits (258), Expect = 9e-28
Identities = 42/117 (35%), Positives = 63/117 (53%)

Query: 4 LLLIDDDRELCELLGTWLVQEGFSVRASHDGAQARRALAEQTPDAVVLDVMLPDGSGLEL 63
+L+ DDD + +L L + G+ VR + + A R +A D VV DV++PD + +L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LKQLRGDHPDLPVLMLSARGEPLDRILGLELGADDYLAKPCDPRELTARLRAVLRRT 120
L +++ PDLPVL++SA+ + I E GA DYL KP D EL + L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17030PF06580290.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.026
Identities = 16/100 (16%), Positives = 35/100 (35%), Gaps = 17/100 (17%)

Query: 341 VDNLLRNAVRFNPVGQPLEVRASSAGDYLRLSVRDHGPGIAAELQEQLGEPFFRAPNQSS 400
V+N +++ + P G + ++ + + L V + G +E
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------------- 309

Query: 401 PGHGLGLA-IARRAIERHGGHLRLG-NHPDGGFIATLSLP 438
G GL + R +G ++ + G A + +P
Sbjct: 310 -STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


50HWH78_RS17380HWH78_RS17485Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS173800123.021563DEAD/DEAH box helicase
HWH78_RS17385115-0.409653hypothetical protein
HWH78_RS17390-111-1.626226hypothetical protein
HWH78_RS17395213-2.146135YnfA family protein
HWH78_RS17400212-2.218286hypothetical protein
HWH78_RS17405113-1.964845SDR family oxidoreductase
HWH78_RS17410213-1.773112hypothetical protein
HWH78_RS17415213-1.845860outer membrane porin OprP
HWH78_RS17420013-1.422008porin
HWH78_RS174251150.103832DoxX family protein
HWH78_RS17430-113-0.273441DUF2063 domain-containing protein
HWH78_RS17435-212-0.124932DUF692 domain-containing protein
HWH78_RS17440-1120.286564hypothetical protein
HWH78_RS17445-2110.659240zf-HC2 domain-containing protein
HWH78_RS17450217-2.611482RNA polymerase sigma factor
HWH78_RS17455324-4.685781beta-ketoacyl-ACP synthase III
HWH78_RS17460637-6.232082ankyrin repeat domain-containing protein
HWH78_RS17465645-8.021751cysteine hydrolase family protein
HWH78_RS17470851-9.728811hypothetical protein
HWH78_RS17475848-8.648736DUF2235 domain-containing protein
HWH78_RS17480329-5.035995DUF3304 domain-containing protein
HWH78_RS17485221-2.965700DUF3304 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17405DHBDHDRGNASE792e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 78.9 bits (194), Expect = 2e-19
Identities = 54/186 (29%), Positives = 88/186 (47%), Gaps = 7/186 (3%)

Query: 6 MITGAGSGLGREIALRWARDGWRLALADVNEAGLAESLKLVREAGGDGFTQ---RCDVRD 62
ITGA G+G +A A G +A D N L K+V + DVRD
Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL---EKVVSSLKAEARHAEAFPADVRD 68

Query: 63 YSQLTALAQSCEEKFGGIDVIVNNAGVASGGFFGELSLEDWDWQIAINLMGVVKGCKAFL 122
+ + + E + G ID++VN AGV G LS E+W+ ++N GV ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 P-LLERSKGKIVNIASMAALMQGPAMSNYNVAKAGVVALSESLLVELALVEVGVHVVCPS 181
+++R G IV + S A + +M+ Y +KA V ++ L +ELA + ++V P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 182 FFQTNL 187
+T++
Sbjct: 189 STETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17465ISCHRISMTASE280.013 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 28.4 bits (63), Expect = 0.013
Identities = 18/93 (19%), Positives = 35/93 (37%), Gaps = 3/93 (3%)

Query: 75 AADRVFVKHGY--LPTAELVDHLRALRAERVLVCGIQADTCVLAAGFALFDAGLQPTLIG 132
D V K Y L++ +R +++++ GI A L F ++ +G
Sbjct: 116 DDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVG 175

Query: 133 DLVLGSSLDRSGELGVRLWKHHFGQVVSLAEVL 165
D V SL++ ++ + V +L
Sbjct: 176 DAVADFSLEKH-QMALEYAAGRCAFTVMTDSLL 207


51HWH78_RS17665HWH78_RS17715Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS176652113.149309SDR family oxidoreductase
HWH78_RS176702103.361155metal-dependent hydrolase
HWH78_RS176753103.487155ATP-dependent Clp protease proteolytic subunit
HWH78_RS176802103.728289non-ribosomal peptide synthetase
HWH78_RS176852113.845044FAD-dependent monooxygenase
HWH78_RS176901123.162652hypothetical protein
HWH78_RS176951102.514899SDR family NAD(P)-dependent oxidoreductase
HWH78_RS177000102.321000cytochrome P450
HWH78_RS17705-1112.103187nuclear transport factor 2 family protein
HWH78_RS17710-1102.212582ketoacyl-ACP synthase III
HWH78_RS177152131.048754acyl carrier protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17665DHBDHDRGNASE1156e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 115 bits (288), Expect = 6e-31
Identities = 65/209 (31%), Positives = 103/209 (49%), Gaps = 4/209 (1%)

Query: 319 DARSMSGKLVVVTGAGGGIGRSTLLSFAERGASLLAADLDLEAAERSAELARALGATAHV 378
+A+ + GK+ +TGA GIG + + A +GA + A D + E E+ +A A
Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 379 YQVDVGDTQAM-ETFAEWVRDTLGVPDVVVSNAGIGMAGPMLDTSPAEWERLLRVNLWSV 437
+ DV D+ A+ E A R+ + D++V+ AG+ G + S EWE VN V
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPI-DILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 438 IDGCRLFGRQMIAANKPGHLVNVASGVAFAPSRNYPAYATSKAAVLMLSECLRAELAGRS 497
+ R + M+ + G +V V S A P + AYA+SKAA +M ++CL ELA +
Sbjct: 121 FNASRSVSKYMM-DRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 498 IGVTAVCPGFVDTGIVQATRFVGMDAERQ 526
I V PG +T + Q + + + Q
Sbjct: 180 IRCNIVSPGSTETDM-QWSLWADENGAEQ 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17695DHBDHDRGNASE772e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 77.0 bits (189), Expect = 2e-18
Identities = 49/188 (26%), Positives = 86/188 (45%), Gaps = 4/188 (2%)

Query: 11 QGRHVLITGASSGLGRETALHLAEQGFQVIAGVRRQEDGERLANACPS-GRISTLL-IDV 68
+G+ ITGA+ G+G A LA QG + A E E++ ++ + R + DV
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 69 TDEESIGRAAAQVAEKVGDTGLWGLVNNAGICISAPLECVSSDLLRRQLEVNLIGQLAVT 128
D +I A++ ++G + LVN AG+ + +S + VN G +
Sbjct: 67 RDSAAIDEITARIEREMG--PIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 129 RAILPLLRRGGAARLVNVTSGLGSVAIPYLGAYSAAQFAKEGVSDALRRELAPMGIQVSV 188
R++ + + +V V S V + AY++++ A + L ELA I+ ++
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 189 VSPGAIWT 196
VSPG+ T
Sbjct: 185 VSPGSTET 192


52HWH78_RS17835HWH78_RS17910Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS178353100.801240DMT family transporter
HWH78_RS17840110-0.079663DUF2955 domain-containing protein
HWH78_RS17845-190.902746HlyD family secretion protein
HWH78_RS17850-111-0.581324fucose-binding lectin LecB
HWH78_RS17855-119-3.073196AmiS/UreI family transporter
HWH78_RS17860430-6.211357transcriptional antitermination factor AmiR
HWH78_RS17865531-5.837560aliphatic amidase expression-regulating protein
HWH78_RS17870536-7.372279AAA family ATPase
HWH78_RS17875745-10.168224aliphatic amidase
HWH78_RS178801064-12.589085hypothetical protein
HWH78_RS17885737-5.297974hypothetical protein
HWH78_RS178950172.285854*GNAT family N-acetyltransferase
HWH78_RS179002132.786290hypothetical protein
HWH78_RS179051113.089071hypothetical protein
HWH78_RS179101103.379419hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17845RTXTOXIND689e-15 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 68.3 bits (167), Expect = 9e-15
Identities = 27/208 (12%), Positives = 73/208 (35%), Gaps = 24/208 (11%)

Query: 81 RLAVRQAELALEEAERTNRELDAAIASAKADLLAARSSAGELDSEARRTAQLVQRHHVS- 139
+ + Q + +E + A + A + + + S + L+ + ++
Sbjct: 194 QFSTWQNQKYQKELNL--DKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAK 251

Query: 140 ------QQMHEQVSAQAQAARARVAAAQARIGELTARRGTAGED------------NLRL 181
+ + + + + ++++ ++ I + +
Sbjct: 252 HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNI 311

Query: 182 RQARNALAQARLQLQYSSVRADRAGTLSNLQL-TPGTYVPAGTPVAALV--DDRIDIVAD 238
LA+ + Q S +RA + + L++ T G V + +V DD +++ A
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTAL 371

Query: 239 FREKSLRYVRPGDRAAVVFDARPGEVFG 266
+ K + ++ G A + +A P +G
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYG 399



Score = 67.9 bits (166), Expect = 1e-14
Identities = 32/226 (14%), Positives = 70/226 (30%), Gaps = 15/226 (6%)

Query: 1 MTPDQRFARWVQVAI-AVFVLLFVYFLVADLWMPLTPQAQLT--RPVVRVAPRVSGQVAE 57
TP R R V I V+ F+ ++ + + T +LT + P + V E
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109

Query: 58 VLVSNNGHVQPGEVLFRLDPEPFRLAVRQAELALEEAERTNRELDAAIASAKADLLAARS 117
++V V+ G+VL +L + + +L +A S + + L
Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169

Query: 118 SAGELDSEARRTAQLVQRHHVSQQMHEQVSAQAQAARARVAAAQARIGELTARRGTAGED 177
E + ++++ + ++ Q + +A + AR
Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229

Query: 178 NLRLRQARNALAQAR------------LQLQYSSVRADRAGTLSNL 211
+ + + + + +Y + S L
Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17850PF074721491e-48 Fucose-binding lectin II
		>PF07472#Fucose-binding lectin II

Length = 245

Score = 149 bits (377), Expect = 1e-48
Identities = 61/113 (53%), Positives = 77/113 (68%), Gaps = 3/113 (2%)

Query: 5 GVFTLPANTRFGVTAFANSSGTQTVNVLV--NNETAATFSGQSTNNAVIGTQVLNSGSSG 62
G+F LP N FGVTA NSS QT+ V V N + AATF G T +A + TQ++NSG G
Sbjct: 134 GIFNLPPNIAFGVTALVNSSAQQTIEVYVDDNPKPAATFQGAGTQDANLNTQIVNSGK-G 192

Query: 63 KVQVQVSVNGRPSDLVSAQVILTNELNFALVGSEDGTDNDYNDAVVVINWPLG 115
KV+V V+ NG+PS + S QV + + F LVGSEDGTD DYND + ++NWPLG
Sbjct: 193 KVRVVVTANGKPSKIGSRQVDIFKKTYFGLVGSEDGTDGDYNDGIAILNWPLG 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17870HTHFIS393e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.7 bits (90), Expect = 3e-05
Identities = 36/177 (20%), Positives = 61/177 (34%), Gaps = 21/177 (11%)

Query: 31 LLQAHLSHRSALHSRFRFDPAAVMDCLRAEVLGQEPALQAVEDMLKVVRADIADPRRPLF 90
++ L+ S+ D M ++G+ A+Q + +L + L
Sbjct: 114 IIGRALAEPKRRPSKLEDDSQDGMP-----LVGRSAAMQEIYRVLARL----MQTDLTL- 163

Query: 91 SALFLGPTGVGKTEIVRALARALHGDAEGFCRVDMNTLSQEHYAAALTGAPPG-YVGA-K 148
+ G +G GK + RAL F ++M + ++ + L G G + GA
Sbjct: 164 --MITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQT 221

Query: 149 EGTTLLEQDKLDGSPGRPGIVLFDELEKASPEVVHALLNVLDNGLLRVASGERTYHF 205
T EQ +G G + DE+ + LL VL G G
Sbjct: 222 RSTGRFEQ--AEG-----GTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRS 271


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17895SACTRNSFRASE564e-12 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 56.1 bits (135), Expect = 4e-12
Identities = 21/71 (29%), Positives = 33/71 (46%)

Query: 151 RSAILEDMVVDRHARGQGVGRELIGRAVERARSWGCYKLALSSHQDRETAQRFYAALGFT 210
A++ED+ V + R +GVG L+ +A+E A+ L L + +A FYA F
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 211 SHGVSLALHLG 221
V L+
Sbjct: 148 IGAVDTMLYSN 158


53HWH78_RS18020HWH78_RS18145Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS180201183.228524nitrous oxide reductase family maturation
HWH78_RS180252163.081907ABC transporter ATP-binding protein
HWH78_RS180302181.392498ABC transporter permease subunit
HWH78_RS180352180.638754nitrous oxide reductase accessory protein NosL
HWH78_RS180401170.491400ferredoxin-NADP reductase
HWH78_RS180451160.798819LysR family transcriptional regulator
HWH78_RS180501150.211889YceK/YidQ family lipoprotein
HWH78_RS180552151.223034ABC transporter permease
HWH78_RS180601142.534741ABC transporter permease
HWH78_RS180651133.791587efflux RND transporter periplasmic adaptor
HWH78_RS180702143.816754phosphate-starvation-inducible protein PsiE
HWH78_RS180752123.333818DUF3509 domain-containing protein
HWH78_RS180802133.991207TolC family outer membrane protein
HWH78_RS180850123.633348HlyD family type I secretion periplasmic adaptor
HWH78_RS180900123.421995type I secretion system permease/ATPase
HWH78_RS180951122.443684heme acquisition protein HasA
HWH78_RS18100-1122.112017heme uptake receptor HasR
HWH78_RS18105-1123.746664FecR family protein
HWH78_RS181102142.377975RNA polymerase sigma factor
HWH78_RS18115-1133.945101hypothetical protein
HWH78_RS18120-2113.944754DUF2790 domain-containing protein
HWH78_RS18125-2124.285433YebG family protein
HWH78_RS18130-1134.779097hypothetical protein
HWH78_RS18135-1114.505065DUF2809 domain-containing protein
HWH78_RS18140-1114.7405742-oxo acid dehydrogenase subunit E2
HWH78_RS18145-2123.787404alpha-ketoacid dehydrogenase subunit beta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18055ABC2TRNSPORT280.039 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 28.4 bits (63), Expect = 0.039
Identities = 27/122 (22%), Positives = 50/122 (40%), Gaps = 1/122 (0%)

Query: 246 LGYRQSASFFMLLGIVLPFLIAVIALSEFIAELLPTEESVYLTMTFITLPLFYMAGYSWP 305
LGY Q S L ++ +A +L + L P+ + T + P+ +++G +P
Sbjct: 139 LGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFP 198

Query: 306 EQAMPDWVRWLADAIPSTWAIRAIAEMNQMDLPLREVSDHALVLLGMAATYALLGTLLYQ 365
+P + A +P + +I I + + P+ +V H L L T L +
Sbjct: 199 VDQLPIVFQTAARFLPLSHSIDLIRPI-MLGHPVVDVCQHVGALCIYIVIPFFLSTALLR 257

Query: 366 YR 367
R
Sbjct: 258 RR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18065RTXTOXIND561e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.6 bits (134), Expect = 1e-10
Identities = 26/161 (16%), Positives = 62/161 (38%), Gaps = 17/161 (10%)

Query: 41 IVSSKAKGRVQVLHVRRGDEVKQGDLLISLDSPELEAQLDALHAARNQVQAQLDESLHGT 100
+ V+ + V+ G+ V++GD+L+ L + EA ++ +QA+L+++ +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSS--LLQARLEQTRYQI 155

Query: 101 REESIRALKASLAQAEAELRNAESDFQRNQQMVERGFLSRTQFDLSRRERDVARDRVAEA 160
SI K + E ++++ L + QF + ++ + +
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNV---SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKK 212

Query: 161 RANLDEGLKGDREERRQALQAAVRRADAQIAELQAQIDDLQ 201
RA + A + R + ++++DD
Sbjct: 213 RAER------------LTVLARINRYENLSRVEKSRLDDFS 241



Score = 51.8 bits (124), Expect = 1e-09
Identities = 29/205 (14%), Positives = 77/205 (37%), Gaps = 24/205 (11%)

Query: 75 LEAQLDALHAARNQVQAQLDESLHGTREESIRALKASLAQAEAELRNAESDFQRNQQMVE 134
++ Q + Q + LD+ + + A + + E R +S ++
Sbjct: 191 IKEQFSTWQNQKYQKELNLDK-----KRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 135 RGFLSRTQFDLSRRERDVARDRVAEARANLDE------GLKGDREERRQALQAAV----R 184
+ +++ + A + + ++ L++ K + + Q + + R
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 185 RADAQIAELQAQI----DDLQ---VRAPVNGEVGPIPA-EQGELINAYSPLLTLVRLDDS 236
+ I L ++ + Q +RAPV+ +V + +G ++ L+ +V DD+
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 237 YFV-FNLREDILAKVRKGDRIVMQV 260
V ++ + + G +++V
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKV 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18080RTXTOXIND320.006 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.006
Identities = 23/170 (13%), Positives = 47/170 (27%), Gaps = 9/170 (5%)

Query: 60 LPSLRYDYNKARNDSTVSQGDARVERDYRSYASTLSLEQPLFDYEAYARYRQ-GEAQAL- 117
L +L + + + S++ Q R S + P ++ E + L
Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 118 ---FADEQFRGRSQELA---VRLFAAYSETLFAREQVLLAEAQRRALETQLAFNQRAFEE 171
EQF + + L +E L ++ E R +++L +
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 172 GEGTRTDLLET-RARLSLTRAEEIAAGDRAAAARRTLEAMLGQALEDREL 220
+ +LE + + L A L +
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18085RTXTOXIND416e-145 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 416 bits (1072), Expect = e-145
Identities = 96/435 (22%), Positives = 170/435 (39%), Gaps = 8/435 (1%)

Query: 15 AALELDEK---RFSRLGWGLVLLGFVGFLLWAGLAPLDKGVGVSGTVMVAGSRKAVQHPT 71
A LEL E R RL ++ V + + L ++ +G + +G K ++
Sbjct: 44 AHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIE 103

Query: 72 GGLVRHIRVHEGERVEAGQVLLEMDATQARAQADGLFAQYLAALASLARLSAERDEKARI 131
+V+ I V EGE V G VLL++ A A A + L A R
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 132 EFPAELLALDDPRLPTLLEQQ----RQLHDSRRRALRLELDGLAETVAGSQAQLDGLQAA 187
+ P EL D+P + E++ L + + + + +A+ + A
Sbjct: 164 KLP-ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLAR 222

Query: 188 LRSKEQQRAALEEQLRGLRQLASEGYVPRNRLLDSERLLAQVNGEIAGDLGSLGSTRRQI 247
+ E + +L L + + ++ +L+ E + E+ L +I
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282

Query: 248 LELRLRMAQRREKFQEEVRGSLADAQVRAEELRNRLASARFDLANSEVRAPVAGLVVGQE 307
L + + F+ E+ L L LA S +RAPV+ V +
Sbjct: 283 LSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLK 342

Query: 308 VFTEGGVIAPGQQLMEILPERQPLLVDARLPVEMVDKVRVGLPVELMFSAFSQSTTPRVE 367
V TEGGV+ + LM I+PE L V A + + + + VG + AF + +
Sbjct: 343 VHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV 402

Query: 368 GEVTLVSADRLLDERSEAPYYRVRIRVGEEGVRRLAGLEIRPGMPVEAFVRSGERSLLNY 427
G+V ++ D + D+R + + + + GM V A +++G RS+++Y
Sbjct: 403 GKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISY 462

Query: 428 LFKPLADRTHLALGE 442
L PL + +L E
Sbjct: 463 LLSPLEESVTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18095PF064382761e-97 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 276 bits (706), Expect = 1e-97
Identities = 204/205 (99%), Positives = 205/205 (100%)

Query: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60
MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS
Sbjct: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60

Query: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120
TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG
Sbjct: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120

Query: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180
LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA
Sbjct: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180

Query: 181 TPAAAAAEIGVVGVQELPHDLALAA 205
TPAAAAAE+GVVGVQELPHDLALAA
Sbjct: 181 TPAAAAAEVGVVGVQELPHDLALAA 205


54HWH78_RS18785HWH78_RS18885Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS187852190.288953mannuronan 5-epimerase AlgG
HWH78_RS187902190.283068alginate biosynthesis protein AlgX
HWH78_RS187950190.760208alginate biosynthesis protein AlgL
HWH78_RS18800-1191.055021MBOAT family protein
HWH78_RS18805-1171.854900alginate O-acetylase
HWH78_RS18810-1182.420309alginate O-acetyltransferase AlgF
HWH78_RS188150172.637268mannose-1-phosphate
HWH78_RS188200163.377316UDP-4-amino-4-deoxy-L-arabinose
HWH78_RS188252153.598243undecaprenyl-phosphate
HWH78_RS188301143.006500bifunctional UDP-4-amino-4-deoxy-L-arabinose
HWH78_RS188352133.9205114-deoxy-4-formamido-L-arabinose-
HWH78_RS188402123.686355lipid IV(A)
HWH78_RS188450114.615065EamA family transporter
HWH78_RS188500114.4272104-amino-4-deoxy-L-arabinose-phosphoundecaprenol
HWH78_RS18855-1114.515769UDP-glucose 6-dehydrogenase
HWH78_RS188600125.254398PTS fructose-like transporter subunit IIB
HWH78_RS188651135.1679111-phosphofructokinase
HWH78_RS188700134.968933phosphoenolpyruvate--protein phosphotransferase
HWH78_RS18875-1133.587333catabolite repressor/activator
HWH78_RS18880-1133.542734TatD family hydrolase
HWH78_RS18885-2113.238684barstar family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18830NUCEPIMERASE1092e-28 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 109 bits (275), Expect = 2e-28
Identities = 79/362 (21%), Positives = 138/362 (38%), Gaps = 61/362 (16%)

Query: 319 RVLILGVNGFIGNHLSERLLRDGRYEVHGMDIGSDAIE-RLK-------ADPHFHFVEGD 370
+ L+ G GFIG H+S+RLL G ++V G+D +D + LK A P F F + D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 371 IGIHSEWLE--YHVKKCDVVLPLVAIATPIEYT-RNPLRVFELDFEENLRIVRYCVKYG- 426
+ E + + + V + Y+ NP + + L I+ C
Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 427 KRVVFPSTSEVYGMCQDPDFDEDRSNLVVGPINKQRWIYSVSKQLLDRVIWAYGQ-QGLR 485
+ +++ S+S VYG+ + F D S V P++ +Y+ +K+ + + Y GL
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDS--VDHPVS----LYAATKKANELMAHTYSHLYGLP 172

Query: 486 FTLFRPFNWMGPRLDRLDSARIGSSRAITQLILHLVEGTPIRLVDGGAQKRCFTDVDDGI 545
T R F GP R D A ++A+ +EG I + + G KR FT +DD
Sbjct: 173 ATGLRFFTVYGPW-GRPDMALFKFTKAM-------LEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 546 EALARIIDN---------------RDGRCDGQIVNIGNPDNEASIRQLGEELLRQFEAHP 590
EA+ R+ D ++ NIGN + + L
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVE-LMDYIQALEDALGIEA 283

Query: 591 LRAQFPPFAGFREVESRSFYGDGYQDVAHRKPSIDNARRLLDWQPTIELRETIGKTLDFF 650
+ P G DV ++ + P +++ + ++++
Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328

Query: 651 LH 652

Sbjct: 329 RD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18870PHPHTRNFRASE6090.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 609 bits (1573), Expect = 0.0
Identities = 219/565 (38%), Positives = 340/565 (60%), Gaps = 13/565 (2%)

Query: 401 ERLQAIAASPGIASGPAHVQVAQRFEFQPR-GESPAHERERLLRAKRAVDEEIVGLVERS 459
++ IAAS G+A A + + + + + E E+L A EE+ + +++
Sbjct: 3 HKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQT 62

Query: 460 TVKA---IREIFVTHREMLDDPELAEQVQLRL-NRGESAEAAWSRVVEDSAAQQEALHDA 515
EIF H +LDDPEL + ++ ++ N +AE A V + + E++ +
Sbjct: 63 EASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNE 122

Query: 516 LLAERAADLRDLGRRVLARLCGVEAPREPE--QPYILVMDEVGPSDVARLDAQRVAGILT 573
+ ERAAD+RD+ +RVL L GVE + +++ +++ PSD A+L+ Q V G T
Sbjct: 123 YMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFAT 182

Query: 574 ARGGATSHSAIIARALGIPALVGAGAAVLGLEPGTALLLDGEHGWLQVAPSTEQLQQAAA 633
GG TSHSAI++R+L IPA+VG ++ G +++DG G + V P+ E+++
Sbjct: 183 DIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEE 242

Query: 634 ERDARQQRQARADAQRLEPARTRDGHAVEVCANLGDTAGAARAVELGAEGVGLLRTEFVF 693
+R A ++++ EP+ T+DG VE+ AN+G + G EG+GL RTEF++
Sbjct: 243 KRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLY 302

Query: 694 MNNARAPDLATQEAEYRRVLDALDGRPLVARTLDVGGDKPLPYWPIPHEENPYLGLRGIR 753
M+ + P Q Y+ V+ +DG+P+V RTLD+GGDK L Y +P E NP+LG R IR
Sbjct: 303 MDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAIR 362

Query: 754 LTLQRPQILETQLRALFRAAGERPLRVMFPMVGSLDEWRQARDLALRLREEI------PL 807
L L++ I TQLRAL RA+ L+VMFPM+ +L+E RQA+ + ++++
Sbjct: 363 LCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVS 422

Query: 808 ADLQLGIMVEVPSAALLAPVLAREVDFFSVGTNDLTQYTLAIDRGHPSLSAQADGLHPAV 867
+++GIMVE+PS A+ A + A+EVDFFS+GTNDL QYT+A DR + +S HPA+
Sbjct: 423 DSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPAI 482

Query: 868 LQLIDMTVRAAHAEGKWVGVCGELAADPLALPLLVGLGVDELSVSARSIALVKAGVRELQ 927
L+L+DM ++AAH+EGKWVG+CGE+A D +A+PLL+GLG+DE S+SA SI ++ + +L
Sbjct: 483 LRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLS 542

Query: 928 LVAARGLARKALGLASAAEVRALVE 952
+ A+KAL L +A EV LV+
Sbjct: 543 KEELKPFAQKALMLDTAEEVEQLVK 567


55HWH78_RS19265HWH78_RS19395Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS19265210-1.573497acetyl-CoA carboxylase carboxyl transferase
HWH78_RS19270210-1.788019DNA polymerase III subunit alpha
HWH78_RS19275311-1.523272alanine:cation symporter family protein
HWH78_RS19280312-2.046547ribonuclease HII
HWH78_RS19285011-2.959612lipid-A-disaccharide synthase
HWH78_RS19290110-3.367341acyl-ACP--UDP-N-acetylglucosamine
HWH78_RS1929508-2.6063163-hydroxyacyl-ACP dehydratase FabZ
HWH78_RS19300-18-1.749904UDP-3-O-(3-hydroxymyristoyl)glucosamine
HWH78_RS19305-18-1.640268OmpH family outer membrane protein
HWH78_RS19310-29-1.402173outer membrane protein assembly factor BamA
HWH78_RS19315011-0.496056sigma E protease regulator RseP
HWH78_RS19320110-1.0300471-deoxy-D-xylulose-5-phosphate reductoisomerase
HWH78_RS19325014-2.516200phosphatidate cytidylyltransferase
HWH78_RS19330216-3.701864di-trans,poly-cis-decaprenylcistransferase
HWH78_RS19335014-3.439657ribosome recycling factor
HWH78_RS19340014-2.413826UMP kinase
HWH78_RS19345-112-1.753693translation elongation factor Ts
HWH78_RS19350011-1.31451630S ribosomal protein S2
HWH78_RS19355011-0.889128type I methionyl aminopeptidase
HWH78_RS19360010-0.330674[protein-PII] uridylyltransferase
HWH78_RS19365-290.633859succinyldiaminopimelate transaminase
HWH78_RS193702100.628480Na+/H+ antiporter
HWH78_RS193751130.581373hypothetical protein
HWH78_RS193801131.745559hypothetical protein
HWH78_RS193853152.164809YkgJ family cysteine cluster protein
HWH78_RS193903122.366857ArsC family reductase
HWH78_RS19395271.978799LysE family translocator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19340CARBMTKINASE373e-05 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 37.5 bits (87), Expect = 3e-05
Identities = 17/79 (21%), Positives = 28/79 (35%), Gaps = 15/79 (18%)

Query: 132 GEVVIFSAGTGNPFFTT-------------DSAACLRAIEIDADVVLKATKVDGVYTADP 178
G +VI S G G P D A A E++AD+ + T V+G
Sbjct: 186 GVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALY-- 243

Query: 179 FKDPNAEKFERLTYDEVLD 197
+ + + +E+
Sbjct: 244 YGTEKEQWLREVKVEELRK 262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19360YERSSTKINASE320.014 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 32.0 bits (72), Expect = 0.014
Identities = 22/89 (24%), Positives = 43/89 (48%), Gaps = 1/89 (1%)

Query: 63 ILQQAWQRFDWGDDADIALVAVGGYGRGELHPYSDVDLLILLDSEDQESFREPIEGFLTL 122
I++ + QR D + +G R H + +++L+ L + Q E GFL
Sbjct: 538 IVEPSLQRIQKHLDQTHSFSDIGSLVRAHKHLETLLEVLVTLSQQGQPVSSETY-GFLNR 596

Query: 123 LWDIGLEVGQSVRSVQQCAEEARADLTVI 151
L + + + Q + ++QQ E A+A L+++
Sbjct: 597 LTEAKITLSQQLNTLQQQQESAKAQLSIL 625


56HWH78_RS19925HWH78_RS20065Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS19925328-4.706047multicopper oxidase family protein
HWH78_RS19930324-4.807396NADP-dependent oxidoreductase
HWH78_RS19935318-4.314610AraC family transcriptional regulator
HWH78_RS19940216-3.968468MFS transporter
HWH78_RS19945112-3.311462hypothetical protein
HWH78_RS19950012-2.522497glutamine-hydrolyzing GMP synthase
HWH78_RS19955-1151.354790IMP dehydrogenase
HWH78_RS19960-2121.928672LuxR C-terminal-related transcriptional
HWH78_RS199650112.659886transporter
HWH78_RS199701113.413385MFS transporter
HWH78_RS199750103.835998class II histone deacetylase
HWH78_RS199800113.246632sulfite exporter TauE/SafE family protein
HWH78_RS199851113.158961LysR family transcriptional regulator
HWH78_RS199900122.516632exodeoxyribonuclease VII large subunit
HWH78_RS199950122.528887LysR family transcriptional regulator
HWH78_RS200002122.361392TRAP transporter substrate-binding protein DctP
HWH78_RS200050111.998690TRAP transporter small permease
HWH78_RS200102102.007787TRAP transporter large permease
HWH78_RS200151111.865906M15 family metallopeptidase
HWH78_RS200202112.942625helix-turn-helix domain-containing protein
HWH78_RS200252112.101579cysteine hydrolase
HWH78_RS200302141.703815putative natural product biosynthesis protein
HWH78_RS20035392.203329copper chaperone PCu(A)C
HWH78_RS20040-191.598393DUF2946 family protein
HWH78_RS20045-391.531108M23 family metallopeptidase
HWH78_RS20050-2100.325509DUF4345 domain-containing protein
HWH78_RS20055-190.183775PepSY domain-containing protein
HWH78_RS20060-19-0.228441TonB-dependent copper receptor
HWH78_RS20065210-1.127844DUF2946 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19940TCRTETB362e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.4 bits (84), Expect = 2e-04
Identities = 32/149 (21%), Positives = 61/149 (40%), Gaps = 11/149 (7%)

Query: 12 ISGMFFMLGLCFGSLSSRMATIKDGLQLSDGVFGSA-LFAMSAGVVLSLPVSGWMIAKLG 70
+ G + G +S +KD QLS GS +F + V++ + G ++ + G
Sbjct: 263 LCGGIIFGTVA-GFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRG 321

Query: 71 SRNV---GVTAILANAVLLSLVPLASSVYQLAALLFVSGF-SYSAVNVSNNTQASLSEAL 126
V GVT + + + S + +S + ++FV G S++ +S +SL +
Sbjct: 322 PLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQE 381

Query: 127 IGKTELPFFHGIWGLAGFVGAGFGALMIG 155
G + F+ G G ++G
Sbjct: 382 AGAGM-----SLLNFTSFLSEGTGIAIVG 405



Score = 29.8 bits (67), Expect = 0.019
Identities = 22/127 (17%), Positives = 48/127 (37%), Gaps = 2/127 (1%)

Query: 245 FMGAMTVGRLLLNRVADRFGTRSTLQWSGGLALIG-MVTTIAYPSLLASIIGFCLVGLGI 303
FM ++G + +++D+ G + L + + G ++ + + I+ + G G
Sbjct: 58 FMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGA 117

Query: 304 CTVIPLVAGAAARSSSMAPSS-AIAAVLTIGFLGTLIGPPLIGFLSEAFGLRYAFGACVV 362
LV AR A + +I +G +GP + G ++ Y ++
Sbjct: 118 AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI 177

Query: 363 LAIGIIL 369
I +
Sbjct: 178 TIITVPF 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20015RTXTOXINA330.004 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 32.6 bits (74), Expect = 0.004
Identities = 25/94 (26%), Positives = 38/94 (40%), Gaps = 12/94 (12%)

Query: 161 SSLTNTSVGELFLAGVIPGLL--LAAAFMLLNAVYAYRNGLQARHAAPAWGEILAALSGA 218
+L N G + G+L ++A+F+L N + AA A E+ + G
Sbjct: 233 PNLDNIGAG----LDTVSGILSAISASFILSN-----ADADTRTKAA-AGVELTTKVLGN 282

Query: 219 LTALIAPVIIVAGIVLGLVTPTESGALIALYVAL 252
+ I+ II GL T + LIA V L
Sbjct: 283 VGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTL 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20030ISCHRISMTASE300.004 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 30.4 bits (68), Expect = 0.004
Identities = 25/124 (20%), Positives = 40/124 (32%), Gaps = 11/124 (8%)

Query: 5 QPKRALLVIDVQNEYVSGNLRIEFPAIQSSLERIGAAMDAAHAAGIPIVVVQHLA---PA 61
P RA+L+I Y + I + GIP+V P
Sbjct: 27 DPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPD 86

Query: 62 D--------SPLFARGSRQAELHEVVASRPYQHKVEKQLASSFVGTGLADWLRERDIDTL 113
D P G + ++ +A + K S+F T L + +R+ D L
Sbjct: 87 DRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQL 146

Query: 114 AVVG 117
+ G
Sbjct: 147 IITG 150


57HWH78_RS20130HWH78_RS20255Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS20130216-2.506064flavodoxin-dependent
HWH78_RS20135117-2.923064helix-turn-helix domain-containing protein
HWH78_RS20140116-2.530967type 4a pilus biogenesis protein PilF
HWH78_RS20145217-2.17407623S rRNA (adenine(2503)-C(2))-methyltransferase
HWH78_RS20150216-1.592586nucleoside-diphosphate kinase
HWH78_RS20155113-1.502017Fe-S cluster assembly protein IscX
HWH78_RS20160113-1.565029ISC system 2Fe-2S type ferredoxin
HWH78_RS20165114-1.473976Fe-S protein assembly chaperone HscA
HWH78_RS20170118-2.463853co-chaperone HscB
HWH78_RS20175116-2.585559iron-sulfur cluster assembly protein IscA
HWH78_RS20180115-2.055544Fe-S cluster assembly scaffold IscU
HWH78_RS20185114-1.551442IscS subfamily cysteine desulfurase
HWH78_RS20190213-1.655744Fe-S cluster assembly transcriptional regulator
HWH78_RS20195212-1.612134serine O-acetyltransferase
HWH78_RS20200013-2.049090tRNA
HWH78_RS20205114-2.084485inositol-phosphate phosphatase
HWH78_RS20210111-1.660887glycine zipper 2TM domain-containing protein
HWH78_RS2021509-3.157719protein translocase subunit SecF
HWH78_RS20220120-5.941568protein translocase subunit SecD
HWH78_RS20225340-9.203350preprotein translocase subunit YajC
HWH78_RS20230341-9.293050tRNA guanosine(34) transglycosylase Tgt
HWH78_RS20235142-8.486766tRNA preQ1(34) S-adenosylmethionine
HWH78_RS20245145-9.966486*hypothetical protein
HWH78_RS20250-134-7.726544lectin MOA-related protein
HWH78_RS20255-322-4.728490cold shock domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20135IGASERPTASE320.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.3 bits (73), Expect = 0.003
Identities = 29/144 (20%), Positives = 44/144 (30%), Gaps = 9/144 (6%)

Query: 141 RTPETAGLSLEHVEVESADGTTEIHTLDEPEDQAVIEAQKEGEQAPAEVSP------EVA 194
+T E A E E ++ + E + + E +E + ++VSP V
Sbjct: 1081 QTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQ 1140

Query: 195 AQAEGAPPVASEGAQVLAAPAQGTPAAPQQPVTAAPAGTAPAPAPGTAPAVPATASPAAP 254
QAE A T A +QP + + V S
Sbjct: 1141 PQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSS--NVEQPVTESTTVNTGNSVVEN 1198

Query: 255 PAPSEPAAA-PVVAGEGQGVVKVQ 277
P + PA P V E K +
Sbjct: 1199 PENTTPATTQPTVNSESSNKPKNR 1222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20165SHAPEPROTEIN1072e-27 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 107 bits (269), Expect = 2e-27
Identities = 79/363 (21%), Positives = 139/363 (38%), Gaps = 56/363 (15%)

Query: 22 VGIDLGTTNSLVAAVRSGVAEPLPDAQGRLILPSAVRYHAERAEVGESARAAAAEDPFNT 81
+ IDLGT N+L+ G+ L PS V +RA +S A +
Sbjct: 13 LSIDLGTANTLIYVKGQGIV---------LNEPSVVAIRQDRAGSPKSVAAVGHD----- 58

Query: 82 VISVKRLMGRGLEDVKQLGEQLPYRFRQGESHMPFIETVQGLKSPV----EVSADILRE- 136
K+++GR + I ++ +K V V+ +L+
Sbjct: 59 ---AKQMLGRTPGN---------------------IAAIRPMKDGVIADFFVTEKMLQHF 94

Query: 137 LRQRAETTLGGELVGAVITVPAYFDDAQRQATKDAARLAGLNVLRLLNEPTAAAVAYGLD 196
++Q + ++ VP +R+A +++A+ AG + L+ EP AAA+ GL
Sbjct: 95 IKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLP 154

Query: 197 KGAEGLVAIYDLGGGTFDISILRLTRGVFEVLATGGDTALGGDDFDHAIAGWVIEQAGLS 256
+ D+GGGT +++++ L V +GGD FD AI +V G
Sbjct: 155 VSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSL 209

Query: 257 ADLDPGSQRQLLQIACAAKERLTDEASVR---VAYG-DWSGELSRATLDELIEPFVARSL 312
+ ++R +I A E VR +A G L+ + E ++ + +
Sbjct: 210 IG-EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIV 268

Query: 313 KSCRRAVRDSGVDLEEI---RSVVMVGGSTRVPRVRTAVGELFGCEPLTDIDPDQVVAIG 369
+ A+ +L R +V+ GG + + + E G + DP VA G
Sbjct: 269 SAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARG 328

Query: 370 AAI 372

Sbjct: 329 GGK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20170CHANLCOLICIN270.045 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 27.0 bits (59), Expect = 0.045
Identities = 20/76 (26%), Positives = 30/76 (39%), Gaps = 5/76 (6%)

Query: 103 ELEELQDSADLAGVATFKRRLKAAQAELEREFAACWDDA-----QRREEAERLVRRMQFL 157
EL + + A A +R A E R+ A + A QRR+E ER +
Sbjct: 111 SATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQ 170

Query: 158 DKLAQEVRQLEERLDD 173
KLA+ + L +
Sbjct: 171 LKLAEAEEKRLAALSE 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20200PHAGEIV290.021 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 28.7 bits (64), Expect = 0.021
Identities = 12/57 (21%), Positives = 30/57 (52%), Gaps = 10/57 (17%)

Query: 45 DAVARASGATDILDAARVVDTLEEALSGCSVVLG------TSARDRRIPW----PLL 91
D+++ ++ A+D++ R + T G +++LG +++D +P+ PL+
Sbjct: 342 DSLSSSTQASDVITNQRSIATTVNLRDGQTLLLGGLTDYKNTSQDSGVPFLSKIPLI 398


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20215SECFTRNLCASE303e-105 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 303 bits (778), Expect = e-105
Identities = 101/299 (33%), Positives = 163/299 (54%), Gaps = 20/299 (6%)

Query: 8 INFMGIRNVAFAVTLILTVIALGSWFTKGINFGLDFTGGTLIELTYEQPADLGKVRGQLV 67
+F + F +++ + ++ G+NFG+DF GGT I D+G R L
Sbjct: 14 FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALE 73

Query: 68 GAGYEDAVVQSFGDAR------DVLVRMPSED------------PELGKKVATALQQADA 109
D ++ D ++R+ ++ EL KV TAL D
Sbjct: 74 PLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDP 133

Query: 110 GNPANLKRVEYVGPQVGEELRDQGGLGMLLALGGILLYVGFRFQWKFALGAILSLVHDAI 169
+ E VGP+V EL +L A I+ Y+ RF+W+FALGA+++LVHD +
Sbjct: 134 A--LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVL 191

Query: 170 IVMGVLSFFQVTFDLTVLAAVLAVVGYSLNDTIVIFDRVRENFRVLRKADLVENLNISTS 229
+ +G+ + Q+ FDLT +AA+L + GYS+NDT+V+FDR+REN + L + +N+S +
Sbjct: 192 LTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVN 251

Query: 230 QTLLRTIATSVSTLLAIAALLFFGGDNLFGFSIALFVGVMAGTYSSIYIANVVLIWLNL 288
+TL RT+ T ++TLLA+ +L +GGD + GF A+ GV GTYSS+Y+A +++++ L
Sbjct: 252 ETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGL 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20220SECFTRNLCASE811e-18 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 81.0 bits (200), Expect = 1e-18
Identities = 41/180 (22%), Positives = 85/180 (47%), Gaps = 15/180 (8%)

Query: 446 TIGPSLGADNIAKGIDASLWGMLFVSLFIIVIY---RF---FGVIATVALAFNMVMLVAL 499
++GP + + + + + L + +I+ Y RF F + A VAL ++++ V L
Sbjct: 142 SVGPKVSGELVWTAVWS-----LLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGL 196

Query: 500 MSILGATLTLPGIAGIVLTMGMAVDANVLIFSRIREEL--ANGMSVQRAIHEGFNRAFTA 557
++L L +A ++ G +++ V++F R+RE L M ++ ++ N +
Sbjct: 197 FAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSR 256

Query: 558 ILDANLTSLLVGGILYAMGTGPVKGFAVTMSLGIITSMFTAIMVTRAMVNLIFGGRDFKK 617
+ +T+LL + G ++GF M G+ T ++++ V A ++F G D K
Sbjct: 257 TVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYV--AKNIVLFIGLDRNK 314


58HWH78_RS20300HWH78_RS20360Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS20300216-4.655915hypothetical protein
HWH78_RS20305116-5.082993valine--tRNA ligase
HWH78_RS20310015-4.801870hypothetical protein
HWH78_RS20315122-6.604090hypothetical protein
HWH78_RS20320017-4.275493hypothetical protein
HWH78_RS2032508-1.110743hypothetical protein
HWH78_RS20330090.254811ABC transporter substrate-binding protein
HWH78_RS2033507-0.146647ABC transporter permease
HWH78_RS2034018-0.207858ABC transporter ATP-binding protein
HWH78_RS2034519-0.142042SLC13 family permease
HWH78_RS20350012-0.48249623S rRNA (adenine(1618)-N(6))-methyltransferase
HWH78_RS20355113-0.038272T3SS effector bifunctional cytotoxin exoenzyme
HWH78_RS203602130.237458CesT family type III secretion system chaperone
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20345BINARYTOXINB300.009 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 30.4 bits (68), Expect = 0.009
Identities = 23/107 (21%), Positives = 41/107 (38%), Gaps = 7/107 (6%)

Query: 3 SAKNLKITFNPGTPIETRALRGLSLDIPAGQFVTVIGSNGAGKSTFLNAVSGDLP-IDS- 60
S+ + + +N +E L D G T NG + + S LP I
Sbjct: 457 SSTPITMNYNQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQET 516

Query: 61 -GQILIDDEDVTRKPVWARANRVARVFQDPMAGTCEDLTIEENMALA 106
+I+ + +D+ A DP+ T D+T++E + +A
Sbjct: 517 TARIIFNGKDLN----LVERRIAAVNPSDPLETTKPDMTLKEALKIA 559


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20360YERSINIAYOPE2171e-70 Yersinia virulence determinant YopE protein signature.
		>YERSINIAYOPE#Yersinia virulence determinant YopE protein signature.

Length = 219

Score = 217 bits (553), Expect = 1e-70
Identities = 55/220 (25%), Positives = 99/220 (45%), Gaps = 23/220 (10%)

Query: 9 SPSFAVELHQAASGRLGQIEARQVATPSE---AQQLAQRQDAPKGEGLLARLGAALVRPF 65
S S + + S +G++ R V+ + A LA R ++P+G L +R+ L
Sbjct: 8 STSLPLPTSVSGSSSVGEMSGRSVSQQTSDQYANNLAGRTESPQGSSLASRIIERLSSVA 67

Query: 66 VAIMDWLGKLL--GSHA---RTGPQPSQDAQPAVMSSAVVFKQMVLQQALPMTLKGLDKA 120
+++ ++ ++ GSH P P+Q P S ++ + + + LP ++
Sbjct: 68 HSVIGFIQRMFSEGSHKPVVTPAPTPAQMPSPTSFSDSI---KQLAAETLPKYMQ----- 119

Query: 121 SELATLTPEGLAREHSRLASGDGALRSLSTALAGIRAGSQVEESRIQAGRLLERSIGGIA 180
+L +L E L + H + A+G G LR T G+ E + +A +L + GI
Sbjct: 120 -QLNSLDAEMLQKNHDQFATGSGPLRGSITQCQGLMQFCG-GELQAEASAILNTPVCGIP 177

Query: 181 LQQWGTTGGAASQLV-----LDASPELRREITDQLHQVMS 215
QWGT GGAAS V L + + + Q+ +++S
Sbjct: 178 FSQWGTIGGAASAYVASGVDLTQAANEIKGLAQQMQKLLS 217


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20365SYCECHAPRONE1702e-58 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 170 bits (432), Expect = 2e-58
Identities = 51/115 (44%), Positives = 65/115 (56%), Gaps = 3/115 (2%)

Query: 5 YRAAIHQLFLALDLPTPNDEESVLSLQVGPHLCHLAEHPTDHLLMFT--RLEGQGDA-TA 61
+ AI QLF L L P+ E V+ ++VG CH+ EHP +LMFT L+ + T
Sbjct: 4 FEQAITQLFQQLSLSIPDTIEPVIGVKVGEFACHITEHPVGQILMFTLPSLDNNDEKETL 63

Query: 62 SEQNLFSQDPCKPILGRDPESGERLLWNRQPLQLLDRAQIHHQLEQLVAAAEELR 116
N+FSQD KPIL D G +LWNRQPL LD ++ QLE LV AE L+
Sbjct: 64 LSHNIFSQDILKPILSWDEVGGHPVLWNRQPLNSLDNNSLYTQLEMLVQGAERLQ 118


59HWH78_RS20530HWH78_RS20575Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS205300153.366416two-component system response regulator NarL
HWH78_RS205351143.885677anaerobic/virulence modulator AnvM
HWH78_RS205400153.994383hypothetical protein
HWH78_RS205451143.882528class I SAM-dependent methyltransferase
HWH78_RS205501102.987151SDR family oxidoreductase
HWH78_RS205552112.204988hypothetical protein
HWH78_RS205600111.715411tyrosine phosphatase TpbA
HWH78_RS205651121.354798TIGR01459 family HAD-type hydrolase
HWH78_RS205702140.463687hypothetical protein
HWH78_RS205752140.390001sodium:proton antiporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20530HTHFIS801e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 1e-19
Identities = 44/197 (22%), Positives = 76/197 (38%), Gaps = 18/197 (9%)

Query: 13 RLLLVDDHPMMRKGVAQLLELEDDLSVVGEAGSGEEALRLAAELDPDMILLDLNMKGMNG 72
+L+ DD +R + Q L V + R A D D+++ D+ M N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 73 LDTLRALREAGVDARIVVFTVSDDKGDVVNVLRAGADGYLLKDMEPERLLEHIRQAATGQ 132
D L +++A D ++V + + + GA YL K + L+ I +A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120

Query: 133 MTLSPQLTQILAQALRGDD---RSKSLDELTERERQILRQIAHGYSNKMIARKLDITE-G 188
+ +++ + G RS ++ E+ ++L ++ MI E G
Sbjct: 121 -EPKRRPSKLEDDSQDGMPLVGRSAAMQEI----YRVLARLMQTDLTLMI-----TGESG 170

Query: 189 TVKVHVKRVLHKLGMRS 205
T K V R LH G R
Sbjct: 171 TGKELVARALHDYGKRR 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20550DHBDHDRGNASE851e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.1 bits (210), Expect = 1e-21
Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 7/180 (3%)

Query: 5 VAFVTGCSSGIGRALADAFQRAGYRVWA----SARKEDDVRALAEAGFQAVQ--LDVNDA 58
+AF+TG + GIG A+A G + A + E V +L A DV D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 AALARLAEELGVEAAGLDVLVNNAGYGAMGPLLDGGVDAMRRQFETNVFAVVGVTRALFP 118
AA+ + + E +D+LVN AG G + + F N V +R++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 119 -LLRRKSGLVVNVGSVSGVLVTPFAGAYCASKAAVHALSDALRLELAPFGVEVLEVQPGA 177
++ R+SG +V VGS + AY +SKAA + L LELA + + V PG+
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189


60HWH78_RS20690HWH78_RS20725Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS20690-1113.090690SpnA family nuclease
HWH78_RS20695-1103.307253alkaline phosphatase
HWH78_RS207000123.757536phosphatidic acid-binding protein
HWH78_RS207050123.253676U32 family peptidase
HWH78_RS20710-2113.564791U32 family peptidase
HWH78_RS207150112.047631molybdopterin molybdotransferase MoeA
HWH78_RS207200132.503843molybdenum cofactor biosynthesis protein B
HWH78_RS207251123.018212molybdopterin synthase catalytic subunit MoaE
61HWH78_RS20930HWH78_RS21020Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS20930225-7.614141SDR family oxidoreductase
HWH78_RS20935440-11.534936endonuclease/exonuclease/phosphatase family
HWH78_RS20940448-13.898775hypothetical protein
HWH78_RS20945354-14.393278hypothetical protein
HWH78_RS20950355-12.961189SAVED domain-containing protein
HWH78_RS20955252-11.943349DEAD/DEAH box helicase family protein
HWH78_RS20960150-9.371020SEC-C domain-containing protein
HWH78_RS20965037-5.510847hypothetical protein
HWH78_RS20970131-2.512928IS3 family transposase
HWH78_RS20975223-2.257304ATP-dependent helicase HrpB
HWH78_RS20980-1160.488453ATP-dependent helicase HrpB
HWH78_RS209851160.305798hypothetical protein
HWH78_RS209901130.934113CDF family cation-efflux transporter FieF
HWH78_RS209950121.142771DUF6515 family protein
HWH78_RS21000-1120.673806Lrp/AsnC family transcriptional regulator
HWH78_RS210050122.018569DUF2788 domain-containing protein
HWH78_RS210101132.929384globin
HWH78_RS210151143.168744pseudouridine synthase
HWH78_RS210200133.413991MBL fold metallo-hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20930DHBDHDRGNASE1107e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (275), Expect = 7e-31
Identities = 61/229 (26%), Positives = 102/229 (44%), Gaps = 10/229 (4%)

Query: 14 KVVLVSGGCSGIGRALALRFARAGARLAILDLDQAALDSLVQHLRDHLGGEALGLRCDVA 73
K+ ++G GIG A+A A GA +A +D + L+ +V L+ A DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLK-AEARHAEAFPADVR 67

Query: 74 DADAVERAVALAVERFGGIDVLVNNAGITHRGTFAETGLGVFRKVMAVNFFGAVHCTRAA 133
D+ A++ A G ID+LVN AG+ G + +VN G + +R+
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 134 LPSLLERR-GQIVVLGSLTGFAPLLYRSAYNASKHALHGLFDTLRMELEGTGVSVTLACP 192
+++RR G IV +GS P +AY +SK A L +EL + + P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 193 GFTATDLRKNALVGD--------GSVTRQPVQVLGSQVASPVEVAEAIF 233
G T TD++ + + GS+ + ++A P ++A+A+
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20960SECA472e-07 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 46.8 bits (111), Expect = 2e-07
Identities = 16/35 (45%), Positives = 20/35 (57%)

Query: 448 DNISQRATPGNGATKNVSRNAPCPCGSNKKYKKCC 482
D+ + A + V RN PCPCGS KKYK+C
Sbjct: 863 DSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQCH 897


62HWH78_RS21900HWH78_RS22030Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS219000103.039269peptidase domain-containing ABC transporter
HWH78_RS21905-1104.382610TolC family protein
HWH78_RS21910193.759769LysR family transcriptional regulator
HWH78_RS21915083.666988LysE family translocator
HWH78_RS21920193.572524sigma-54-dependent Fis family transcriptional
HWH78_RS219251122.764894SDR family oxidoreductase
HWH78_RS21930-1132.488588ATP-NAD kinase family protein
HWH78_RS21935-2121.894670thiamine pyrophosphate-dependent dehydrogenase
HWH78_RS21940-1121.329218alpha-ketoacid dehydrogenase subunit beta
HWH78_RS21945-2131.628349acetoin dehydrogenase
HWH78_RS21950-2121.8535172,3-butanediol dehydrogenase
HWH78_RS21955-1132.409987SH3 domain-containing protein
HWH78_RS21960-2133.327823NtaA/DmoA family FMN-dependent monooxygenase
HWH78_RS21965-2143.919229TonB-dependent receptor
HWH78_RS219702115.511423IclR family transcriptional regulator
HWH78_RS219751104.775331ABC transporter ATP-binding protein
HWH78_RS21980094.438009Fe2+-enterobactin ABC transporter
HWH78_RS219851104.157491iron ABC transporter permease
HWH78_RS219900112.894719iron chelate uptake ABC transporter family
HWH78_RS21995-1112.098274SDR family oxidoreductase
HWH78_RS22000-1121.640799LysR family transcriptional regulator
HWH78_RS220051102.823864MFS transporter
HWH78_RS220102102.852188LysR family transcriptional regulator
HWH78_RS220151113.731343SgcJ/EcaC family oxidoreductase
HWH78_RS220201123.566229zinc-binding alcohol dehydrogenase family
HWH78_RS220252133.227193SgcJ/EcaC family oxidoreductase
HWH78_RS220302112.229680amidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21920HTHFIS338e-112 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 338 bits (869), Expect = e-112
Identities = 135/390 (34%), Positives = 192/390 (49%), Gaps = 59/390 (15%)

Query: 273 FDLDALHAAADQAPCLLRGQAGELHVRLSAPRAKARRLEREVPDDAAL---DPRIAESLR 329
FDL L +A L+ P+ + +LE + D L + E R
Sbjct: 106 FDLTELIGIIGRA--------------LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151

Query: 330 LAVRVKDRNLPVLIQGETGAGKEVFARQLHQASARRDKPFVALNCAAIPESLIESELFGY 389
+ R+ +L ++I GE+G GKE+ AR LH RR+ PFVA+N AAIP LIESELFG+
Sbjct: 152 VLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH 211

Query: 390 VGGAFTGAAAKGMRGLLQQADGGTLFLDEIGDMPLGLQTRLLRVLAEGEVAPLGAARRQA 449
GAFTGA + G +QA+GGTLFLDEIGDMP+ QTRLLRVL +GE +G
Sbjct: 212 EKGAFTGAQTRST-GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270

Query: 450 VDIQVVCATHRDLAALVAAGGFREDLYFRLGGARFELPPLRERSDRLALIRRILDEETAH 509
D+++V AT++DL + G FREDLY+RL LPPLR+R++ + + R ++
Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEK 330

Query: 510 CGVRI-ELGEAALECLLGYRWPGNVRQLRHVLRYACALCGGATLQLADLPAELRGEGRTP 568
G+ + + ALE + + WPGNVR+L +++R AL + + ELR E P
Sbjct: 331 EGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSE--IP 388

Query: 569 ASACESGGGP--------------------------------------ERDALLDALVRH 590
S E E +L AL
Sbjct: 389 DSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTAT 448

Query: 591 RWKPMAAARELGISRATLYRRVRRHGIRMP 620
R + AA LG++R TL +++R G+ +
Sbjct: 449 RGNQIKAADLLGLNRNTLRKKIRELGVSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21925DHBDHDRGNASE1272e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 127 bits (319), Expect = 2e-37
Identities = 75/262 (28%), Positives = 126/262 (48%), Gaps = 14/262 (5%)

Query: 11 LSSRVALVTGAGRGIGRGIALALARAGADVAVADLDPQVAEETAAAIRSLGRRSLALGVD 70
+ ++A +TGA +GIG +A LA GA +A D +P+ E+ +++++ R + A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 71 VSDGDSVRAMVERVATEFGRLDVAVNNAGVISIRKVAELSLADWDRVMNVNARGVFLCCQ 130
V D ++ + R+ E G +D+ VN AGV+ + LS +W+ +VN+ GVF +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 131 AELPLMQAQRWGRIVNLSSIAGKVGLPDLAHYCASKFAVIGFSNALAKEVARDGVTVNAL 190
+ M +R G IV + S V +A Y +SK A + F+ L E+A + N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 191 CPGIVGTGM----WRGEDGLSGRWRQAGESEAQSWERHQASLLPQGEAQTVEDMGQLVVY 246
PG T M W E+G + + E+ +P + D+ V++
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTG--------IPLKKLAKPSDIADAVLF 237

Query: 247 LAC--APHVTGQAIAVDGGFSL 266
L A H+T + VDGG +L
Sbjct: 238 LVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21955DPTHRIATOXIN310.004 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 30.9 bits (69), Expect = 0.004
Identities = 24/73 (32%), Positives = 33/73 (45%), Gaps = 11/73 (15%)

Query: 17 RVIGACLLGGLLAAGAPAQAEEATGNARWVSDSLTTFVRS------GPTDGYRIVGTLTS 70
++ + L+G LL GAP A + V DS +FV G GY V ++
Sbjct: 11 KLFASILIGALLGIGAPPSAHAGADD---VVDSSKSFVMENFSSYHGTKPGY--VDSIQK 65

Query: 71 GQKVELLGTQGNY 83
G + GTQGNY
Sbjct: 66 GIQKPKSGTQGNY 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21980FERRIBNDNGPP384e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 37.6 bits (87), Expect = 4e-05
Identities = 53/289 (18%), Positives = 97/289 (33%), Gaps = 28/289 (9%)

Query: 2 PTRRRSALPLLALALSLFA-TLAAAGEPKPARIVSTTPSVTGILLAMDAPLVASAATTPS 60
RR L +AL+ L+ A A P RIV+ +LLA+ A T
Sbjct: 6 LISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINY 65

Query: 61 RLTDAKGFFSQWAKVADQRGVEVLYRNLRFD--IEAVIAQDPDLLVASA---TGADSAAP 115
RL + S+ V+ LR + +E + P +V SA + A
Sbjct: 66 RL-----WVSEPPLPDS-----VIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLAR 115

Query: 116 Y-RAELEAQGVPTLVVDYSKHSWQELATELGRHTGLERQAQAAIQRFDAYTAEVAA-AIA 173
+ ++ S E+A L + A+ + +++ + + +
Sbjct: 116 IAPGRGFNFSDGKQPLAMARKSLTEMADLLNL----QSAAETHLAQYEDFIRSMKPRFVK 171

Query: 174 PPQGPVSVVGYNIAGNYSIGRQASPQARLLEALGFQVAELPEALAGKVTRASDFQFISRE 233
P+ + + + S +L+ G +P A G+ +S +
Sbjct: 172 RGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYG-----IPNAWQGETNFWG-STAVSID 225

Query: 234 NLPAAIAGDSVFLLGASDDDVQAFLADPVLANLSAVREKRVYALGPSSF 282
L A D + + D+ A +A P+ + VR R + F
Sbjct: 226 RLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWF 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21985PF04335300.011 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 29.8 bits (67), Expect = 0.011
Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 3/43 (6%)

Query: 7 RRRRLRAWGLLAGALLLALA---ALASLALGSRPVPLAVTLDA 46
R + AW + A LA A A+A+L P +T+D
Sbjct: 29 ERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDR 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21995DHBDHDRGNASE1196e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 119 bits (299), Expect = 6e-35
Identities = 75/258 (29%), Positives = 117/258 (45%), Gaps = 32/258 (12%)

Query: 5 RTALVTGATRGIGLALARRLAASGWSVVGI-----------------ARHASDDFPGRLL 47
+ A +TGA +GIG A+AR LA+ G + + ARHA + FP
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFP---- 63

Query: 48 CCDLADPAQTAETLRGLLSESA-VDALVNNAGIALPQSLENLDLAALQQVFDLNVRVAVQ 106
D+ D A E + E +D LVN AG+ P + +L + F +N
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 107 LAQACLPGLKRSPAGRIVNLCSRAIHGAR-ERTAYAAAKSALVGVTRTWALELAPLGITV 165
+++ + +G IV + S R AYA++K+A V T+ LELA I
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 166 NAVAPGPIETELFRQTRPVGGEEERRILST-------IPMQRLGRPDEVAALIEFLLSEG 218
N V+PG ET++ E+ I + IP+++L +P ++A + FL+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 219 ASFVTGQVIGVDGGGSLG 236
A +T + VDGG +LG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22005TCRTETA506e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 50.2 bits (120), Expect = 6e-09
Identities = 90/396 (22%), Positives = 138/396 (34%), Gaps = 30/396 (7%)

Query: 14 RSTLSVAILAFSAFLIVTTEFLIVGLLPSLARDLQISISAA---GRLVTLFAFTVMLFGP 70
+ + ++ + L LI+ +LP L RDL S G L+ L+A P
Sbjct: 2 KPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAP 61

Query: 71 PLTAVLAHLPRKPLFVAILLAFALSNGLAALSTDLRLLAVARFVPALMLPVFWGTASETA 130
L A+ R+P+ + L A+ + A + L +L + R V + + A
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121

Query: 131 AQLAGTERAGQAIARVYLGISCALLLGIPLGTLAANGIGWRGAFWILAGLSLLMAVALVL 190
G ERA + + ++ G LG L G F+ A L+ L +
Sbjct: 122 DITDGDERA-RHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCF 179

Query: 191 FMPAVDRGERLDLRRQARIFGEPLFLANVALSVLVFSAMFVSYTYLADILERIAGI---- 246
+P +GER LRR+A A V A+F + + + I
Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGED 239

Query: 247 ----TPARIGWWLMGFGAV---------GLFGNWLGGRLVDRSPLRATALFLVLLALGMA 293
IG L FG + G LG R + A +LLA A
Sbjct: 240 RFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF--A 297

Query: 294 ASVPLAGQPLLFCLALALWGVANTALYPVCQVRVMRSVQHAQALAATSNVAAANAGIGLG 353
+A P++ LA G+ AL + +V + Q S A + +G
Sbjct: 298 TRGWMA-FPIMVLLASG--GIGMPALQAMLSRQVD---EERQGQLQGSLAALTSLTSIVG 351

Query: 354 ALLGGETIATLGLERIGFVAAALAVLGLSLLPVVAR 389
LL A G+ A A L L LP + R
Sbjct: 352 PLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


63HWH78_RS22195HWH78_RS22220Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS22195-2123.214473two-component system response regulator BfiR
HWH78_RS22200-2113.708623two-component system sensor histidine kinase
HWH78_RS22205-2133.098976acyl-CoA synthetase
HWH78_RS22210-2133.530457acyl-CoA dehydrogenase C-terminal
HWH78_RS22215-2113.683971MBL fold metallo-hydrolase
HWH78_RS22220-2103.629366D-alanine--D-alanine ligase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22195HTHFIS904e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 4e-23
Identities = 32/149 (21%), Positives = 57/149 (38%)

Query: 5 DEALVYVVDDDQGMLESTVWLLESVGLKARPFTQGRDFLDACEGGRHACVLLDVRMPGMG 64
A + V DDD + L G R + G V+ DV MP
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 65 GLNVQDELLARGIDLPVIFVSGHADVPIVVRAFKAGAVDFIEKPYNEQLLLDSVQQALDR 124
++ + DLPV+ +S ++A + GA D++ KP++ L+ + +AL
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 125 HARRRRHRDLDAGLRERLDSLTPRERDVL 153
RR + D+ L + +++
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150


64HWH78_RS22725HWH78_RS22925Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS2272529-3.080465histidine triad nucleotide-binding protein
HWH78_RS2273027-2.7915242-polyprenyl-3-methyl-6-methoxy-1,4-benzoquinone
HWH78_RS2273538-3.208842adenosylmethionine decarboxylase
HWH78_RS2274029-2.971567OsmC family protein
HWH78_RS22745212-3.695119cAMP-activated global transcriptional regulator
HWH78_RS22750223-4.724987indole-3-glycerol phosphate synthase TrpC
HWH78_RS22755119-3.192318anthranilate phosphoribosyltransferase
HWH78_RS22760221-3.395487aminodeoxychorismate/anthranilate synthase
HWH78_RS22765221-2.782082hypothetical protein
HWH78_RS22770219-1.978919hypothetical protein
HWH78_RS22775118-1.659132pyocin knob domain-containing protein
HWH78_RS22780-215-0.635503phage tail protein
HWH78_RS22785-115-0.348532tail assembly protein
HWH78_RS22790-215-0.180309C40 family peptidase
HWH78_RS22795-115-1.122561phage minor tail protein L
HWH78_RS22800018-1.400997phage tail protein
HWH78_RS22805019-0.662073phage tail tape measure protein
HWH78_RS228102210.708070hypothetical protein
HWH78_RS228151200.276186phage tail assembly chaperone family protein,
HWH78_RS228200200.392105hypothetical protein
HWH78_RS228252220.947923hypothetical protein
HWH78_RS228301240.342781hypothetical protein
HWH78_RS22835024-0.034571glycoside hydrolase family 19 protein
HWH78_RS22840023-0.752505phage late control D family protein
HWH78_RS22845-121-1.066693tail protein X
HWH78_RS22850-123-1.471953phage tail protein
HWH78_RS22855-130-3.282751phage tail length tape measure protein
HWH78_RS22860028-4.415564phage tail assembly protein
HWH78_RS22865028-4.239994phage major tail tube protein
HWH78_RS22870029-3.992186phage tail sheath family protein
HWH78_RS22875033-3.847534hypothetical protein
HWH78_RS22880132-3.140008phage tail protein
HWH78_RS22885-2230.045464phage tail protein I
HWH78_RS22890-123-0.080179bacteriophage protein
HWH78_RS22895-1201.103761GPW/gp25 family protein
HWH78_RS229000150.719652phage baseplate assembly protein V
HWH78_RS22905-1150.252249hypothetical protein
HWH78_RS22910-111-0.826604hypothetical protein
HWH78_RS2291509-1.257721hypothetical protein
HWH78_RS2292029-1.353671repressor PtrB
HWH78_RS22925211-1.651634transcriptional regulator PrtR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22830PYOCINKILLER325e-04 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 31.7 bits (71), Expect = 5e-04
Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 2/49 (4%)

Query: 51 LEALLDEQQRALAAVRASAERRAKDVEQALGEARAQAAEQYAAAVRLLQ 99
L+ ++ A A++ A+A +A+ EQA EA+ +A EQ +
Sbjct: 200 LQIRMNTLTAAKASIEAAAANKAR--EQAAAEAKRKAEEQARQQAAIRA 246


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22855PF07132320.010 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 31.6 bits (71), Expect = 0.010
Identities = 23/57 (40%), Positives = 30/57 (52%)

Query: 621 GSLAGAALGASIGSVVPVVGTLIGGLVGGAIGAWGGSELGGRLGRSLAGDPPAASDN 677
GS+ G LG +G + +G L GGL+GG +G GS LG LG +L G A
Sbjct: 62 GSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGGLGGALGA 118


65HWH78_RS23025HWH78_RS23160Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS23025015-3.499411Co2+/Mg2+ efflux protein ApaG
HWH78_RS23030015-3.322582symmetrical bis(5'-nucleosyl)-tetraphosphatase
HWH78_RS23035113-2.918017thiosulfate sulfurtransferase GlpE
HWH78_RS23040011-3.001990PrkA family serine protein kinase
HWH78_RS2304509-1.589869YeaH/YhbH family protein
HWH78_RS2305009-0.603754SpoVR family protein
HWH78_RS230550111.543536DUF4136 domain-containing protein
HWH78_RS230601111.049807multifunctional CCA addition/repair protein
HWH78_RS230650120.6491252-amino-4-hydroxy-6-
HWH78_RS230701100.092794dihydroneopterin aldolase
HWH78_RS23075211-1.095662glycerol-3-phosphate 1-O-acyltransferase PlsY
HWH78_RS23080013-0.974797tRNA
HWH78_RS23085014-1.40999330S ribosomal protein S21
HWH78_RS23090013-1.747384GatB/YqeY domain-containing protein
HWH78_RS23095-112-2.019345DNA primase
HWH78_RS23100-114-2.593384RNA polymerase sigma factor RpoD
HWH78_RS23105-112-1.252006bifunctional diguanylate
HWH78_RS23115213-1.595514*hypothetical protein
HWH78_RS23120213-1.345978Fic family protein
HWH78_RS23125111-0.079663NERD domain-containing protein
HWH78_RS231300100.317067hypothetical protein
HWH78_RS23135090.875734ImpA family metalloprotease
HWH78_RS23140-1131.775635DMP19 family protein
HWH78_RS231450140.856958SMI1/KNR4 family protein
HWH78_RS231500131.700044hypothetical protein
HWH78_RS231550140.568512hypothetical protein
HWH78_RS231602161.698067YqaE/Pmp3 family membrane protein
66HWH78_RS23365HWH78_RS23425Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS233653142.066350hypothetical protein
HWH78_RS233702152.404788nitric oxide reductase activation protein NorD
HWH78_RS23375115-0.445750nitric-oxide reductase subunit NorB
HWH78_RS233801140.083564nitric oxide reductase subunit NorC
HWH78_RS233851130.895519cytochrome C oxidase subunit IV family protein
HWH78_RS233900131.051681cytochrome C oxidase subunit
HWH78_RS233950141.214293transcriptional regulator NirQ
HWH78_RS234000171.085922nitrite reductase
HWH78_RS234050183.097911cytochrome C-551
HWH78_RS234100183.205525cytochrome c55X
HWH78_RS234151172.904236heme d1 biosynthesis protein NirF
HWH78_RS234200173.561222AsnC family transcriptional regulator
HWH78_RS234250153.363959Lrp/AsnC family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23395HTHFIS320.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.1 bits (73), Expect = 0.002
Identities = 23/94 (24%), Positives = 37/94 (39%), Gaps = 7/94 (7%)

Query: 15 IEVFERAWRHGLPVLLKGPTGCGKT---RFVQYMARRLELPLYSVACH---DDLGAADLL 68
V R + L +++ G +G GK R + +R P ++ DL ++L
Sbjct: 150 YRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELF 209

Query: 69 GRHLIGADGTWWQDGPLTRAVREGGICYLDEVVE 102
G H GA EGG +LDE+ +
Sbjct: 210 G-HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGD 242


67HWH78_RS24485HWH78_RS24575Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS24485122-3.049924AraC family transcriptional regulator
HWH78_RS24490127-4.188519Tn3 family transposase
HWH78_RS24495352-9.060444recombinase family protein
HWH78_RS24500356-9.262597class 1 integron integrase IntI1
HWH78_RS24505254-8.611030aminoglycoside N-acetyltransferase AAC(6')-Il
HWH78_RS24510244-7.094873subclass B1 metallo-beta-lactamase VIM-2
HWH78_RS24515135-4.835856trimethoprim-resistant dihydrofolate reductase
HWH78_RS24520235-4.654007aminoglycoside N-acetyltransferase AAC(3)-Id
HWH78_RS24525029-3.345255recombinase family protein
HWH78_RS24530-117-1.789812TniQ family protein
HWH78_RS24535-212-1.395937TniB family NTP-binding protein
HWH78_RS24540-210-1.927688DDE-type integrase/transposase/recombinase
HWH78_RS2454509-1.806635DUF3330 domain-containing protein
HWH78_RS24550010-2.069923IS6-like element IS6100 family transposase
HWH78_RS24555113-2.313017ABC transporter permease subunit
HWH78_RS24560115-3.549593ABC transporter permease subunit
HWH78_RS24565014-3.717752polyamine ABC transporter ATP-binding protein
HWH78_RS24570013-3.422075spermidine-binding protein SpuE
HWH78_RS24575113-3.612357putrescine-binding protein SpuD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS24505SACTRNSFRASE408e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.9 bits (93), Expect = 8e-07
Identities = 18/77 (23%), Positives = 27/77 (35%), Gaps = 9/77 (11%)

Query: 73 YAEECYSGNVAFLEGW---------YVVPSARRQGVGVALVKAAEHWARGRGCTEFASDT 123
Y E G + W V R++GVG AL+ A WA+ +T
Sbjct: 71 YLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLET 130

Query: 124 QLTNSASTSAHLAAGFT 140
Q N ++ + F
Sbjct: 131 QDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS24520SACTRNSFRASE300.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.3 bits (68), Expect = 0.002
Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 2/57 (3%)

Query: 83 YIYDLAVAATRRREGIATALIKKLKAIGAARGAYVIYVQADKGVEDQPAIELYKKLG 139
I D+AVA R++G+ TAL+ K + ++ + A Y K
Sbjct: 91 LIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD--INISACHFYAKHH 145


68HWH78_RS24685HWH78_RS24885Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS246853113.759746sulfite exporter TauE/SafE family protein
HWH78_RS246904113.940963M48 family metallopeptidase
HWH78_RS246953113.221399DUF962 domain-containing protein
HWH78_RS247005123.274578Crp/Fnr family transcriptional regulator
HWH78_RS247055113.124623hypothetical protein
HWH78_RS247104122.972012CynX/NimT family MFS transporter
HWH78_RS24720182.587857LysR family transcriptional regulator
HWH78_RS24725-192.288688antibiotic biosynthesis monooxygenase
HWH78_RS24730-1151.822771cupin domain-containing protein
HWH78_RS24735-1151.382844carboxymuconolactone decarboxylase family
HWH78_RS24740010-0.530273PLP-dependent aminotransferase family protein
HWH78_RS24745-114-2.242615response regulator
HWH78_RS24750116-4.2741814-aminobutyrate--2-oxoglutarate transaminase
HWH78_RS24755-235-8.290428NADP-dependent succinate-semialdehyde
HWH78_RS24765-152-10.896907*hypothetical protein
HWH78_RS24770055-10.963595site-specific integrase
HWH78_RS24775258-11.463272hypothetical protein
HWH78_RS24780260-11.256346helix-turn-helix transcriptional regulator
HWH78_RS24785260-11.361377hypothetical protein
HWH78_RS24790461-11.957390hypothetical protein
HWH78_RS24795560-11.911397hypothetical protein
HWH78_RS24800457-12.133195hypothetical protein
HWH78_RS24805239-11.097590DUF6088 family protein
HWH78_RS24810645-13.929113hypothetical protein
HWH78_RS24815752-15.190662hypothetical protein
HWH78_RS24820647-14.198950hypothetical protein
HWH78_RS24825543-13.412231restriction endonuclease subunit S
HWH78_RS24830544-13.191879type I restriction-modification system subunit
HWH78_RS24835547-11.680004restriction endonuclease subunit S
HWH78_RS24840434-8.486677hypothetical protein
HWH78_RS24845332-8.082165virulence RhuM family protein
HWH78_RS24850333-8.081605type I restriction endonuclease subunit R
HWH78_RS24855432-6.940895M48 family metallopeptidase
HWH78_RS24860333-6.871292STY4851/ECs_5259 family protein
HWH78_RS24865231-6.797363DEAD/DEAH box helicase
HWH78_RS24870045-8.159433hypothetical protein
HWH78_RS24875042-7.271689helix-turn-helix domain-containing protein
HWH78_RS24880-140-6.831809hypothetical protein
HWH78_RS24885-236-5.863123hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS24715TCRTETB290.038 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.1 bits (65), Expect = 0.038
Identities = 28/131 (21%), Positives = 47/131 (35%), Gaps = 6/131 (4%)

Query: 228 LAPYYL--EQGWSAQESGLLLGFLTAMEV-LSGLLAPALASRSRDRRPVLVGLTALMLAG 284
+ PY + S E G ++ F M V + G + L R R VL +
Sbjct: 278 MVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDR-RGPLYVLNIGVTFLSVS 336

Query: 285 FLGLAWAPASLPLLWALCLGLGIGGLFPMGLIVC--LDHFDAPQRAGQLAALVQGAGYLI 342
FL ++ + + + +GGL ++ + Q AG +L+ +L
Sbjct: 337 FLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLS 396

Query: 343 AGVSPWIAGLL 353
G I G L
Sbjct: 397 EGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS24745HTHFIS544e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.1 bits (130), Expect = 4e-10
Identities = 26/124 (20%), Positives = 49/124 (39%), Gaps = 8/124 (6%)

Query: 1 MSAAKKAPVILIADPDPWSRDLLGQLVLGVRCDARLVLCGDGGEALAHCRRRRFALILAE 60
M+ A IL+AD D R +L Q + R + + + L++ +
Sbjct: 1 MTGAT----ILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWIAAGDGDLVVTD 54

Query: 61 LNLPQVDGFELLREARLRRSVAEQPFILISDRADQASVRAAVSLAPTAYLVKPFQAENLM 120
+ +P + F+LL + R + P +++S + + A YL KPF L+
Sbjct: 55 VVMPDENAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112

Query: 121 QRLR 124
+
Sbjct: 113 GIIG 116


69HWH78_RS25105HWH78_RS25195Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS251050123.911117LysR family transcriptional regulator
HWH78_RS251100124.954136malonate transporter subunit MadM
HWH78_RS251151125.731484malonate transporter subunit MadL
HWH78_RS251202116.548323malonate decarboxylase subunit epsilon
HWH78_RS251253104.992993malonate decarboxylase holo-ACP synthase
HWH78_RS251302104.622199biotin-independent malonate decarboxylase
HWH78_RS251351103.552039biotin-independent malonate decarboxylase
HWH78_RS251401103.239769malonate decarboxylase subunit delta
HWH78_RS25145192.721032triphosphoribosyl-dephospho-CoA synthase
HWH78_RS25150-1101.459971malonate decarboxylase subunit alpha
HWH78_RS25155-3101.557139LysR family transcriptional regulator
HWH78_RS25160-3101.420411ABC transporter ATP-binding protein
HWH78_RS25165-2121.319285ABC transporter permease
HWH78_RS25170-3120.939913ABC transporter permease
HWH78_RS25175-2131.526308extracellular solute-binding protein
HWH78_RS25180-1113.088465amidase
HWH78_RS251851141.928202alpha/beta hydrolase
HWH78_RS251902141.114301DUF3079 domain-containing protein
HWH78_RS251952161.880191biopolymer transporter ExbD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25160PF05272300.014 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.014
Identities = 9/30 (30%), Positives = 16/30 (53%)

Query: 45 TLLGPSGCGKTTLLRMIAGFEFPTEGEILL 74
L G G GK+TL+ + G +F ++ +
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25175MYCMG045378e-05 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 37.4 bits (86), Expect = 8e-05
Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 4/87 (4%)

Query: 10 LGLTLALGGSAQAAGQLNVVSWSGYFSPQLLEKFEKESGIRVTVDSYDSNETLLAKLKQG 69
L ++L+ S+ + + ++ Y SP LLE+ +++ + T +Y SNE L+
Sbjct: 12 LFVSLSSILSSCGSTTFVLANFESYISPLLLERVQEKHPL--TFLTYPSNEKLINGFANN 69

Query: 70 GAGYDVAIPSQQFVPILVKEALLERFD 96
Y VA+ S V L++ LL D
Sbjct: 70 --TYSVAVASTYAVSELIERDLLSPID 94


70HWH78_RS25605HWH78_RS25660Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS256052121.502695SDR family oxidoreductase
HWH78_RS256105131.412583helix-turn-helix transcriptional regulator
HWH78_RS256156122.253001GNAT family N-acetyltransferase
HWH78_RS256206122.193909cytochrome c oxidase copper chaperone SenC
HWH78_RS256254131.777678heme o synthase
HWH78_RS256303141.740027COX15/CtaA family protein
HWH78_RS25635217-0.513478hypothetical protein
HWH78_RS25640317-1.196440hypothetical protein
HWH78_RS25645418-1.951770twin transmembrane helix small protein
HWH78_RS25650314-0.549227cytochrome c oxidase subunit 3
HWH78_RS25655412-0.337548cytochrome c oxidase assembly protein
HWH78_RS256603110.564825cytochrome c oxidase subunit I
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25605DHBDHDRGNASE754e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 74.7 bits (183), Expect = 4e-18
Identities = 47/192 (24%), Positives = 86/192 (44%), Gaps = 5/192 (2%)

Query: 5 KAVLVMGAGDATGGAIARRFAREGYVACVARRNAEKLEPLVQAIRDQGGEALACGCDARQ 64
K + GA G A+AR A +G N EKLE +V +++ + A A D R
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 EQQVIDLFARIEGEVGALEAVIFNVGANVWFPITETTERVYRKVWEMAAFGGFLTGREAA 124
+ ++ ARIE E+G ++ ++ G I ++ + + + + G F R +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 RVMLPRQRGTIIFTGATASLRGRAHFAAFSGAKFALRALAQSMARELGPKGI--HVAHPI 182
+ M+ R+ G+I+ G+ + R AA++ +K A + + EL I ++ P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP- 187

Query: 183 IDGAIDTDFIRE 194
G+ +TD
Sbjct: 188 --GSTETDMQWS 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25615SACTRNSFRASE290.006 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.8 bits (64), Expect = 0.006
Identities = 13/48 (27%), Positives = 23/48 (47%), Gaps = 8/48 (16%)

Query: 88 RGQGLGHQLMERALQ-AAER----LWLDTPVYLSAQAHLQAYYGRYGF 130
R +G+G L+ +A++ A E L L+T + H Y ++ F
Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHF---YAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25620PF06057280.026 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 27.9 bits (62), Expect = 0.026
Identities = 18/81 (22%), Positives = 33/81 (40%), Gaps = 10/81 (12%)

Query: 72 GRWHLLFFGYTFCPDVCPTTLAQLRELQGKLPQEVRDDL-QVVFVSVDPNRDTPQQIKQY 130
G ++ GY+F +V P L + +P R ++ V +S + D + +
Sbjct: 115 GTQKVILIGYSFGAEVIPFVLNE-------MPARYRKNVLGAVLLSPSQSSDFEIHVSEM 167

Query: 131 LGYFNAGFQGLTGTPENIQKL 151
+ N + LT PE + K
Sbjct: 168 VTSDNQSARYLTL-PE-VNKQ 186


71HWH78_RS25775HWH78_RS25895Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS25775-1133.859196type VI secretion system contractile sheath
HWH78_RS25780-1163.817528type VI secretion system protein TssA
HWH78_RS25785-1173.925914type VI secretion system-associated FHA domain
HWH78_RS257900173.764936type VI secretion system lipoprotein TssJ
HWH78_RS25795-1173.738783type VI secretion system baseplate subunit TssK
HWH78_RS258000174.065789DotU family type VI secretion system protein
HWH78_RS258050164.197418type VI secretion system membrane subunit TssM
HWH78_RS25810-1123.911702type VI secretion system-associated protein
HWH78_RS258150133.503804serine/threonine phosphatase PppA
HWH78_RS25820-1113.404927serine/threonine protein kinase PpkA
HWH78_RS258251103.010025type IV secretion associated ABC transporter
HWH78_RS258300112.009604type IV secretion associated ABC transporter
HWH78_RS25835-1110.945896type VI secretion system-associated regulator
HWH78_RS25840-1141.183268type VI secretion system-associated lipoprotein
HWH78_RS25845-1141.491354PA0069 family radical SAM protein
HWH78_RS258500121.667859YheV family putative metal-binding protein
HWH78_RS258551121.782421oligopeptidase A
HWH78_RS258602122.283208gamma carbonic anhydrase family protein
HWH78_RS258652122.130724HAD family hydrolase
HWH78_RS258701112.035219hypothetical protein
HWH78_RS258751112.500639aminopeptidase
HWH78_RS258800151.896404hypothetical protein
HWH78_RS25885-1153.032134hypothetical protein
HWH78_RS25890-1142.409580DUF1161 domain-containing protein
HWH78_RS25895-1133.064338OsmC family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25785PF05616386e-05 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 38.2 bits (88), Expect = 6e-05
Identities = 30/96 (31%), Positives = 35/96 (36%), Gaps = 25/96 (26%)

Query: 180 PRPDHVPAEQHDFRPPEPVIPPPPATTPAPPPAGGAPLIPADWDPFAELLGNTPAPSATP 239
PRPD P P P P +PA PA N PAP+ P
Sbjct: 311 PRPDLTPGSAE-----APNAQPLPEVSPAENPA------------------NNPAPNENP 347

Query: 240 VAQPLPTAEPTPLAMPFADPGTTQQPQPQPQPQPQP 275
+P P EP P P A+P T QP +P P
Sbjct: 348 GTRPNP--EPDPDLNPDANPDTDGQPGTRPDSPAVP 381



Score = 35.5 bits (81), Expect = 4e-04
Identities = 21/67 (31%), Positives = 28/67 (41%), Gaps = 12/67 (17%)

Query: 230 GNTPAPSATPVAQPLPTAEPTPLAMPFADPGTTQQPQPQP------------QPQPQPAS 277
G+ AP+A P+ + P P P +PGT P+P P QP +P S
Sbjct: 318 GSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDS 377

Query: 278 VAAPTPP 284
A P P
Sbjct: 378 PAVPDRP 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25800OMPADOMAIN741e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 74.2 bits (182), Expect = 1e-16
Identities = 40/138 (28%), Positives = 60/138 (43%), Gaps = 16/138 (11%)

Query: 318 AQRVAVEDAVDRSVVTIRGDELFASASASVRDEFQPLLLRIADALRKVK---GQVLVTGH 374
A A V T++ D LF A+++ E Q L ++ L + G V+V G+
Sbjct: 201 APAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGY 260

Query: 375 SDNRPIATLRYPSNWKLSQARAQEVADLLGATTGDAGRFTAEGRSDTEPVATNASAEGRA 434
+D + Y N LS+ RAQ V D L + A + +A G ++ PV N +
Sbjct: 261 TDRI--GSDAY--NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQ 316

Query: 435 R---------NRRVEITV 443
R +RRVEI V
Sbjct: 317 RAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25820PF03544394e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 39.2 bits (91), Expect = 4e-05
Identities = 23/111 (20%), Positives = 36/111 (32%), Gaps = 4/111 (3%)

Query: 260 DRLAPSALEATQIRPLATPQGSPRASNPPPAEPAPMPPADLGGLQPVSIQLPPVTPSAGG 319
+AP+ LE Q P+ P P EP P PP + + P P
Sbjct: 53 TMVAPADLEPPQ-AVQPPPE--PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109

Query: 320 ATPPPPPPSQAA-KPPSPPPPPLPPAKPRAGGSRTPLIAAAAAAAAVLLAI 369
P + P+ P PA+P + + + A+ A+
Sbjct: 110 KVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRAL 160



Score = 30.3 bits (68), Expect = 0.029
Identities = 22/112 (19%), Positives = 30/112 (26%), Gaps = 9/112 (8%)

Query: 261 RLAPSALEATQIRPLATPQGSP---------RASNPPPAEPAPMPPADLGGLQPVSIQLP 311
A + P P+ P P +P P P + + +
Sbjct: 63 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVE 122

Query: 312 PVTPSAGGATPPPPPPSQAAKPPSPPPPPLPPAKPRAGGSRTPLIAAAAAAA 363
S T P P S A + P + PRA P A A A
Sbjct: 123 SRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQAL 174


72HWH78_RS26145HWH78_RS26295Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS26145-212-3.751731tetratricopeptide repeat protein
HWH78_RS26150-120-4.724314hypothetical protein
HWH78_RS26155012-4.3149167-cyano-7-deazaguanine/7-aminomethyl-7-
HWH78_RS2616507-1.698672PilZ domain-containing protein
HWH78_RS2617008-1.600808lysophospholipid acyltransferase
HWH78_RS26175-18-0.639406DNA-3-methyladenine glycosylase I
HWH78_RS26180-110-1.602944glycine--tRNA ligase subunit alpha
HWH78_RS26185-28-2.107693glycine--tRNA ligase subunit beta
HWH78_RS26190-110-3.112467hypothetical protein
HWH78_RS26195013-3.993749D-glycero-beta-D-manno-heptose 1,7-bisphosphate
HWH78_RS26200014-4.0641681-acyl-sn-glycerol-3-phosphate acyltransferase
HWH78_RS26205114-4.448187DNA topoisomerase (ATP-hydrolyzing) subunit B
HWH78_RS26210114-4.436904DNA replication/repair protein RecF
HWH78_RS26215013-3.652295DNA polymerase III subunit beta
HWH78_RS26220013-3.448643chromosomal replication initiator protein DnaA
HWH78_RS26225014-3.13739150S ribosomal protein L34
HWH78_RS26230-112-2.947446ribonuclease P protein component
HWH78_RS26235-211-2.764628membrane protein insertase YidC
HWH78_RS26240-213-1.914486tRNA uridine-5-carboxymethylaminomethyl(34)
HWH78_RS26245-214-2.512521hypothetical protein
HWH78_RS26250-214-3.134336tRNA uridine-5-carboxymethylaminomethyl(34)
HWH78_RS26255-116-3.63199516S rRNA (guanine(527)-N(7))-methyltransferase
HWH78_RS26260-120-4.381375chromosome partitioning protein Soj
HWH78_RS26265127-5.236881ParB/RepB/Spo0J family partition protein
HWH78_RS26270026-5.997917F0F1 ATP synthase subunit I
HWH78_RS26275122-6.048227F0F1 ATP synthase subunit A
HWH78_RS26280225-5.600907F0F1 ATP synthase subunit C
HWH78_RS26285223-4.961013F0F1 ATP synthase subunit B
HWH78_RS26290119-3.730255F0F1 ATP synthase subunit delta
HWH78_RS26295216-3.187897F0F1 ATP synthase subunit alpha
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS26220PF03544392e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 39.2 bits (91), Expect = 2e-05
Identities = 21/119 (17%), Positives = 36/119 (30%), Gaps = 3/119 (2%)

Query: 85 TPRAAIVPSQTHVAPPPPVAPPPAPVQPVSAAPVVVPREELPPVTTAPSVSSDPYEPEEP 144
P+A P + V P P P P P + APVV+ + + P V +
Sbjct: 62 PPQAVQPPPEPVVEPEPEPEPIPEPPKE---APVVIEKPKPKPKPKPKPVKKVEQPKRDV 118

Query: 145 SIDPLAAAMPAGAAPAVRTERNVQVEGALKHTSYLNRTFTFENFVEGKSNQLARAAAWQ 203
A P R + K + + + + + A+A +
Sbjct: 119 KPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIE 177



Score = 34.6 bits (79), Expect = 6e-04
Identities = 17/91 (18%), Positives = 22/91 (24%), Gaps = 5/91 (5%)

Query: 82 RSRTPRAAIVPSQTHVAPPPPVAPPPAPVQPVSAAPVVVPREELPPVTTAPSVSSDPYEP 141
P A P + P + PP A P P V E P P
Sbjct: 40 VIELPAPA-QPISVTMVAPADLEPPQAVQPP----PEPVVEPEPEPEPIPEPPKEAPVVI 94

Query: 142 EEPSIDPLAAAMPAGAAPAVRTERNVQVEGA 172
E+P P P + +
Sbjct: 95 EKPKPKPKPKPKPVKKVEQPKRDVKPVESRP 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS2623560KDINNERMP6770.0 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 677 bits (1748), Expect = 0.0
Identities = 245/581 (42%), Positives = 344/581 (59%), Gaps = 54/581 (9%)

Query: 1 MDIQRSILIVALAVVSYLLVLQWNKDYGQPELPAASASMNTTQGLPDTPSASGTSSDVPT 60
MD QR++L++AL VS+++ W +D T
Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQA-----------------------QQTT 37

Query: 61 AQSSAAGSEAADK--PVAVSDKLIQVKTDVLDLAIDPRGGDIVQLGLLQYPRRLDRPDVP 118
++ A AAD+ P + KLI VKTDVLDL I+ RGGD+ Q L YP+ L+ P
Sbjct: 38 QTTTTAAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQ-P 96

Query: 119 FPLFDNGRERTYLAQSGLTGADGPDASSAG-RPLFHSAQSSYQLADGQNELVVDLSFS-H 176
F L + + Y AQSGLTG DGPD + G RPL++ + +Y LA+GQNEL V ++++
Sbjct: 97 FQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDA 156

Query: 177 DGVNYIKRFTFHRGLKADCSDKEKAQKKIECINENAYQVGVSYLIDNQSGKTWSGNLFAQ 236
G + K F RG Y V V+Y + N K + F Q
Sbjct: 157 AGNTFTKTFVLKRG---------------------DYAVNVNYNVQNAGEKPLEISSFGQ 195

Query: 237 LKRDGSADPSSTTATG---VSTYLGAAVWTPDSPYKKISTKDM-DKEQFKESVQGGWVAW 292
LK+ + P T + + T+ GAA TPD Y+K + D E S +GGWVA
Sbjct: 196 LKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAM 255

Query: 293 LQHYFVTAWVPTKGEQHQVMTRKDGQGNYIVGFTGPTLSVPAGSKVETDLTLYAGPKLQK 352
LQ YF TAW+P + T G G +G+ + V G + TL+ GP++Q
Sbjct: 256 LQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQD 315

Query: 353 HLKELSPGLELTVDYGFLWFIAQPIFWLLQHIHSLIGNWGWSIIALTVLIKLAFFPLSAA 412
+ ++P L+LTVDYG+LWFI+QP+F LL+ IHS +GNWG+SII +T +++ +PL+ A
Sbjct: 316 KMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKA 375

Query: 413 SYRSMARMRAVSPKMQAIKEQHGDDRQKMSQAMMELYKKEKINPLGGCLPILVQMPVFLS 472
Y SMA+MR + PK+QA++E+ GDD+Q++SQ MM LYK EK+NPLGGC P+L+QMP+FL+
Sbjct: 376 QYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLA 435

Query: 473 LYWVLLESVEMRQAPWLGWITDLSVKDPFFILPIVMGGTMLIQQMLNPTP-PDPMQAKVM 531
LY++L+ SVE+RQAP+ WI DLS +DP++ILPI+MG TM Q ++PT DPMQ K+M
Sbjct: 436 LYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIM 495

Query: 532 KLMPIIFTFFFLWFPAGLVLYWVVNNCLSIAQQWYITRKIE 572
MP+IFT FFLWFP+GLVLY++V+N ++I QQ I R +E
Sbjct: 496 TFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLE 536


73HWH78_RS26710HWH78_RS26745Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS267103100.800044Na/Pi cotransporter family protein
HWH78_RS267152111.026651ABC transporter substrate-binding protein
HWH78_RS267203111.503937RNA ligase RtcB family protein
HWH78_RS267253111.525260peptide chain release factor H
HWH78_RS267302121.721373TerC family protein
HWH78_RS267351122.461835CitMHS family transporter
HWH78_RS267402132.926215DUF2388 domain-containing protein
HWH78_RS267452132.606081AEC family transporter
74HWH78_RS27045HWH78_RS27170Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS270453190.134283HdeD family acid-resistance protein
HWH78_RS27050525-0.987106hypothetical protein
HWH78_RS27055524-2.564662cell division protein ZapA
HWH78_RS27060522-1.006526hypothetical protein
HWH78_RS270650181.707992hypothetical protein
HWH78_RS27070-1152.178106hypothetical protein
HWH78_RS27075-2141.629723helix-turn-helix transcriptional regulator
HWH78_RS27080-2141.611058hypothetical protein
HWH78_RS270850161.403644electron transfer flavoprotein subunit beta/FixA
HWH78_RS270901170.993202electron transfer flavoprotein subunit
HWH78_RS270950150.345513dimethylglycine demethylation protein DgcB
HWH78_RS271002140.115419dimethylglycine demethylation protein DgcA
HWH78_RS271052130.175599DUF5943 domain-containing protein
HWH78_RS271102121.045218dipeptidase
HWH78_RS271152122.578952DUF1348 family protein
HWH78_RS271200122.693717cardiolipin synthase
HWH78_RS27125-1102.630146NAD(P)/FAD-dependent oxidoreductase
HWH78_RS27130-1112.519735RidA family protein
HWH78_RS27135-1113.251983DUF1028 domain-containing protein
HWH78_RS27140-1122.725344acetylornithine deacetylase
HWH78_RS271451123.035334HTH-type transcriptional regulator CdhR
HWH78_RS271502113.152712choline ABC transporter substrate-binding
HWH78_RS271552113.6901623-keto-5-aminohexanoate cleavage protein
HWH78_RS271603124.024762L-carnitine dehydrogenase
HWH78_RS271653133.177845thioesterase family protein
HWH78_RS271702122.990687acylcarnitine hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27095TCRTETA330.004 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.9 bits (75), Expect = 0.004
Identities = 35/155 (22%), Positives = 52/155 (33%), Gaps = 16/155 (10%)

Query: 5 LLPVLLFAALALAVLGAAKRFLMWRRGRPAKVDWIGGL----MQMPRRYLVDLHHVVERD 60
LL L AA+ A++ A + GR + G+ + Y+ D+ ER
Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGR-----IVAGITGATGAVAGAYIADITDGDERA 130

Query: 61 RYMSRTHVATAGGFVLAVLLAILVHGFGLHGRILGFALLAATALMFVGALF--VARRRLD 118
R+ G V +L L+ GF H A L + L +
Sbjct: 131 RHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERR 190

Query: 119 PPSRLSKGP-----WMRLPKSLLAFAASFFLATLP 148
P R + P W R + A A FF+ L
Sbjct: 191 PLRREALNPLASFRWARGMTVVAALMAVFFIMQLV 225


75HWH78_RS27645HWH78_RS27820Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS27645122-5.439668NorM family multidrug efflux MATE transporter
HWH78_RS27650344-11.075659LysR family transcriptional regulator
HWH78_RS27655353-13.368795phosphorylcholine phosphatase
HWH78_RS27660466-15.606251choline BCCT transporter BetT
HWH78_RS27665689-19.599904hypothetical protein
HWH78_RS27670691-20.097494integrase
HWH78_RS27675688-18.634089hypothetical protein
HWH78_RS27680363-14.514895site-specific integrase
HWH78_RS27685148-12.533283hypothetical protein
HWH78_RS27690136-10.926160UvrD-helicase domain-containing protein
HWH78_RS27695136-10.560111hypothetical protein
HWH78_RS27700132-10.056118dynamin family protein
HWH78_RS27705133-10.207853dynamin family protein
HWH78_RS27710032-9.696232WYL domain-containing protein
HWH78_RS27715236-9.374702hypothetical protein
HWH78_RS27720-127-8.324765hypothetical protein
HWH78_RS27725-126-7.995202DEAD/DEAH box helicase family protein
HWH78_RS27730021-7.562812HNH endonuclease
HWH78_RS27735129-8.304812type I restriction-modification system subunit
HWH78_RS27740125-7.133954restriction endonuclease subunit S
HWH78_RS27745226-7.203759endonuclease NucS
HWH78_RS27750332-7.317823anti-phage defense ZorAB system protein ZorA
HWH78_RS27755336-7.920216OmpA family protein
HWH78_RS27760344-8.920779hypothetical protein
HWH78_RS27765238-8.092903DEAD/DEAH box helicase
HWH78_RS27770244-9.004153IS256-like element ISPa1328 family transposase
HWH78_RS27775046-9.189666IS3 family transposase
HWH78_RS27780045-8.638805IS3 family transposase
HWH78_RS27785-142-8.263718IS3 family transposase
HWH78_RS27790-127-3.977815hypothetical protein
HWH78_RS27795032-3.104779accessory factor UbiK family protein
HWH78_RS27800018-1.327978P-II family nitrogen regulator
HWH78_RS27805-210-0.578954ammonium transporter
HWH78_RS27810-1110.258932secondary thiamine-phosphate synthase enzyme
HWH78_RS278150110.875768transcriptional regulator SutA
HWH78_RS27820210-1.743624type 1 fimbrial protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27670FLGLRINGFLGH310.012 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 31.1 bits (70), Expect = 0.012
Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 230 NLLKSKQAEGDALIKNTGCGYLTDMKENPWLMRYFD 265
N + S Q DA I+ G GY+ + + WL R+F
Sbjct: 193 NTVPSTQV-ADARIEYVGNGYINEAQNMGWLQRFFL 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27705GPOSANCHOR503e-08 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 49.7 bits (118), Expect = 3e-08
Identities = 31/210 (14%), Positives = 69/210 (32%), Gaps = 10/210 (4%)

Query: 25 KRKKLAADLLGYLNDAQQ---QMDELYAANQQVSAQRA-------LQQDEIAQLQSRSNS 74
++ L L G +N + ++ L A + A++A + ++ +
Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 215

Query: 75 LAEQLQAESVRLASTAKALDLEKSKLHEALQALAQANTQFHDARLRESQALTRIAELEQQ 134
L + A + R A KAL+ + + + R+++ +
Sbjct: 216 LEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNF 275

Query: 135 LTDQQTQLSEKAQFIDELQQTLTEVRQQLDALRLYKRQLTEQCTDQATQIQTLEVRSQEL 194
T ++ L+ ++ Q L ++ L + LE Q+L
Sbjct: 276 STADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKL 335

Query: 195 ASQLQVEQANTAILEERKSQALAQITELEA 224
Q ++ +A+ L + +LEA
Sbjct: 336 EEQNKISEASRQSLRRDLDASREAKKQLEA 365



Score = 49.3 bits (117), Expect = 4e-08
Identities = 36/239 (15%), Positives = 80/239 (33%), Gaps = 11/239 (4%)

Query: 25 KRKKLAADLLGYLNDAQQQMDELYAANQQVSAQRALQQDEIAQLQSRSNSLAEQLQAESV 84
K A + L + +++ S + + + ++ + L+
Sbjct: 106 KSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALE 165

Query: 85 RLASTAKALDLEKSKLHEALQAL----AQANTQFHDARLRESQALTRIAELEQQLTDQQT 140
+ + A + L AL A+ A + +I LE +
Sbjct: 166 GAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAA 225

Query: 141 QLSEKAQFIDELQQTLTEVRQQLDALRLYKRQLTEQCTDQATQIQTLEVRSQELASQLQV 200
+ ++ + ++ T ++ L K L + + ++ S +++++
Sbjct: 226 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 285

Query: 201 EQANTAILEERKSQALAQITELEAFLEDNRNQLLSKEQSVCLLESMTEQLKASHQHLEN 259
+A A LE K+ Q L A + R L + ++ +QL+A HQ LE
Sbjct: 286 LEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAK-------KQLEAEHQKLEE 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27720ARGREPRESSOR280.036 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 27.9 bits (62), Expect = 0.036
Identities = 11/24 (45%), Positives = 18/24 (75%)

Query: 30 LLEKLKEDGFSVSLRTVQRDLDRL 53
L++ LK+DG++V+ TV RD+ L
Sbjct: 25 LVDILKKDGYNVTQATVSRDIKEL 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27770GPOSANCHOR411e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 41.2 bits (96), Expect = 1e-05
Identities = 44/332 (13%), Positives = 106/332 (31%), Gaps = 12/332 (3%)

Query: 333 GMSERLNQLFSSLNEQQGRQMEVAQQQSAAFETQLQRISGSAEERQAQMEQRFAELMSGL 392
+ N + ++ + + +++Q + + + +E +
Sbjct: 81 KALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTA-D 139

Query: 393 TNQLQTQLGTAQQRDEERQVLFERLLGQASSSQTAMLEQFSSSTREQMQAMAEAGNERHS 452
+ +++T + L + L G + S + + + +A E+
Sbjct: 140 SAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKAL 199

Query: 453 NLEKVFSRLMMNLNTQLDSQMGAAEQREQARQQRFQEQLDQVSTHQQELLSGLASAVQAT 512
FS L+++ A R+ ++ + ++ + ++ + A
Sbjct: 200 EGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE 259

Query: 513 QQQSRLMADQH--QQLLGQLKQVSDATAQSSKHMDSSANQLGLLSANLRQAADSLGQRLE 570
+Q+ L +++ L S L SL + L+
Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLD 319

Query: 571 AVSHSIERAGQQNAELATQLQSQAATLAQLQATLLEGAQRFEQAAGEARNGFGEMKSAQQ 630
A + ++ ++ +L Q + A+ L+ L A R + EA E + ++
Sbjct: 320 ASREAKKQLEAEHQKLEEQNKISEASRQSLRRDL--DASREAKKQLEA-----EHQKLEE 372

Query: 631 EFLSS--VRHEFTALGEKLREQVEAVEKQAEE 660
+ S R + RE + VEK EE
Sbjct: 373 QNKISEASRQSLRRDLDASREAKKQVEKALEE 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27775OMPADOMAIN382e-05 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 38.4 bits (89), Expect = 2e-05
Identities = 29/143 (20%), Positives = 57/143 (39%), Gaps = 33/143 (23%)

Query: 98 LSIPNDLLGFDTGAYDIHAAYQARALEIGKVINEVISREDRVEFLD-TIFIEGHTDNRPL 156
++ +D+L F+ + QA ++++ S+ ++ D ++ + G+TD
Sbjct: 215 FTLKSDVL-FNFNKATLKPEGQA-------ALDQLYSQLSNLDPKDGSVVVLGYTDRIGS 266

Query: 157 QGFMGKGNWGLSTFRAISLWQFWGSALSPDEQLARLKNKDGKPLFSVSGYGETRPVLVD- 215
+ N GLS RA S+ + S P +++ S G GE+ PV +
Sbjct: 267 DAY----NQGLSERRAQSVVDYLISKGIPADKI------------SARGMGESNPVTGNT 310

Query: 216 -------QQTEDDFKRNRRIDIR 231
D +RR++I
Sbjct: 311 CDNVKQRAALIDCLAPDRRVEIE 333


76HWH78_RS27915HWH78_RS28015Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS27915011-3.096057hypothetical protein
HWH78_RS27925-19-1.344071magnesium transporter CorA family protein
HWH78_RS27930-113-2.336939hypothetical protein
HWH78_RS27935020-3.063608Hcp family type VI secretion system effector
HWH78_RS27940-114-2.134735type VI secretion system tip protein VgrG
HWH78_RS27945-115-2.196515DUF4123 domain-containing protein
HWH78_RS27950020-3.027137hypothetical protein
HWH78_RS27955-118-2.664044hypothetical protein
HWH78_RS27960-212-1.267520argininosuccinate lyase
HWH78_RS27965090.761474alginate biosynthesis two-component system
HWH78_RS279703140.697189alginate biosynthesis regulator AlgR
HWH78_RS279752131.114940hydroxymethylbilane synthase
HWH78_RS279803130.856854uroporphyrinogen-III synthase
HWH78_RS279854160.280278heme biosynthesis operon protein HemX
HWH78_RS279901021-0.035507heme biosynthesis protein HemY
HWH78_RS279951024-0.449135disulfide bond formation protein B
HWH78_RS28000823-0.786241sigma D regulator
HWH78_RS28005823-1.025743FKBP-type peptidyl-prolyl cis-trans isomerase
HWH78_RS28015619-0.771542alginate regulator AlgP
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27970PF065801821e-56 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 182 bits (463), Expect = 1e-56
Identities = 76/308 (24%), Positives = 137/308 (44%), Gaps = 23/308 (7%)

Query: 64 LFVQWIVLLSAALFCRLRPLLARLPVALAGSACCLLVVALT------LGCTAVAEHYQLG 117
+F I L+ L R + R +L V + A ++L
Sbjct: 43 IFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLL 102

Query: 118 GELTRAGE-------VNLYLRHALIALIMSALVLRYFYLQS-------QWRRQQQAELQA 163
+ +++ ++ + S L + + ++ QW+ A+ +A
Sbjct: 103 AFINTKPVAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQ-EA 161

Query: 164 RLESLQARIRPHFLFNSLNSIASLIELDPLKAEHAVLDLSDLFRASLAK-PGTLVSWEEE 222
+L +L+A+I PHF+FN+LN+I +LI DP KA + LS+L R SL VS +E
Sbjct: 162 QLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADE 221

Query: 223 LALARRYLSIEQYRLGDRLQLDWQVHGVPANLPIPQLTLQPLLENALIYGIQPRVEGGLV 282
L + YL + + DRLQ + Q++ ++ +P + +Q L+EN + +GI +GG +
Sbjct: 222 LTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKI 281

Query: 283 QVEAVYREGVFQLCVSNPYDEALESPPSKGTRQALHNIDARLGALFGPKASLSVERRDGR 342
++ G L V N AL++ + T L N+ RL L+G +A + + + G+
Sbjct: 282 LLKGTKDNGTVTLEVENTGSLALKNTK-ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGK 340

Query: 343 HYTCLRYP 350
+ P
Sbjct: 341 VNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27975HTHFIS794e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 4e-19
Identities = 31/136 (22%), Positives = 57/136 (41%), Gaps = 5/136 (3%)

Query: 3 VLIVDDEPLARERLARLVGQLDGYRVLEPSASNGEEALTLIDSLKPDIVLLDIRMPGLDG 62
+L+ DD+ R L + + + GY V SN I + D+V+ D+ MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR--ITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 63 LQVAARLCEREAPPAVIFCTAHDEF--ALEAFQVSAVGYLVKPVRSEDLAEALKKASRPN 120
+ R+ + V+ +A + F A++A + A YL KP +L + +A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 121 RVQLAALTKPPASGGS 136
+ + + L G
Sbjct: 123 KRRPSKLEDDSQDGMP 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS28010INFPOTNTIATR991e-27 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 98.5 bits (245), Expect = 1e-27
Identities = 62/202 (30%), Positives = 98/202 (48%), Gaps = 16/202 (7%)

Query: 22 KDELAYAVGARLGMRLQQEMPGLELSELLLGLRQAYRGEALEIPPERIEQLLLQHE---- 77
KD+L+Y++GA LG + + + L G++ G L + E+++ +L + +
Sbjct: 31 KDKLSYSIGADLGKNFKNQGIDINPDVLAKGMQDGMSGAQLILTEEQMKDVLSKFQKDLM 90

Query: 78 -------NATTETPRTTPAEARFLANEKARFGVRELTGGVLVSELRRGQGNGIGAATQVH 130
N E + +A FL+ K++ G+ L G+ + G G G + V
Sbjct: 91 AKRSAEFNKKAEENKAK-GDA-FLSANKSKPGIVVLPSGLQYKIIDAGTGAKPGKSDTVT 148

Query: 131 VRYRGLLADGQVFDQSESA---EWFALDSVIEGWRTALRAMPVGACWRVVIPSAQAYGHE 187
V Y G L DG VFD +E A F + VI GW AL+ MP G+ W V +P+ AYG
Sbjct: 149 VEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPR 208

Query: 188 GAGDLIPPDAPLVFEIDLLGVR 209
G I P+ L+F+I L+ V+
Sbjct: 209 SVGGPIGPNETLIFKIHLISVK 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS28015IGASERPTASE621e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 62.4 bits (151), Expect = 1e-12
Identities = 44/220 (20%), Positives = 67/220 (30%), Gaps = 7/220 (3%)

Query: 134 KAKPATKPAAKAAAKPAVKTVAAKPAAKPAAKPAAKPA-AKPAAKTAAAKPAAKPTAKPA 192
T P A P+V + + A+ P PA A P+ T +K +K
Sbjct: 993 DTTNITTPNNIQADVPSVPSNNEE-IARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051

Query: 193 AKPAAKPAAKTAAAKPAAKPAAKPVAKPAAKPAAKTAAAKPAAKPAAKPVAKPTAKPAAK 252
K TA + AK A V A + A + K K TA +
Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNV--KANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 253 TAAAKPAAKPAAKPAAKPAAKPVAKSAAAKPAAKPAAKPAAKPAAKPAAKPVAAKPAATK 312
A K P P +P A+PA + K ++ T
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSP---KQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTA 1166

Query: 313 PATAPAAKPAATPSAPAAASSAASATPAAGSNGAAPTSAS 352
PA + ++ P S+ + + N T A+
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206



Score = 45.1 bits (106), Expect = 3e-07
Identities = 40/246 (16%), Positives = 80/246 (32%), Gaps = 19/246 (7%)

Query: 45 EKQRGKAQEKLHKARTKLQDAAKAGKTKAQAK--ARETISDLEEALDTLKARQADTRTYI 102
E A+ +++T ++ A +T AQ + A+E S+++ T + Q
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQ------- 1087

Query: 103 VGLKRDVQESLKLAQGVGKVKEAAGKA-LESRKAKPATKPAAKAAAKPAVKTVAAKPAAK 161
+ +E+ E KA +E+ K + K ++ + K ++ +P A+
Sbjct: 1088 --SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQE-QSETVQPQAE 1144

Query: 162 PAAKPA----AKPAAKPAAKTAAAKPAAKPTAKPAAKPAAKPAAKTAAAKPAAKPAAKPV 217
PA + K TA + AK T+ +P + P +
Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP--ENT 1202

Query: 218 AKPAAKPAAKTAAAKPAAKPAAKPVAKPTAKPAAKTAAAKPAAKPAAKPAAKPAAKPVAK 277
+P + ++ + V T ++ + A V
Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLS 1262

Query: 278 SAAAKP 283
A AK
Sbjct: 1263 DARAKA 1268



Score = 33.9 bits (77), Expect = 0.001
Identities = 17/119 (14%), Positives = 29/119 (24%), Gaps = 1/119 (0%)

Query: 233 PAAKPAAKPVAKPTAKPAAKTAAAKPAAKPAAKPAAKPAAKPVAKSAAAKPAAKPAAKPA 292
P + + V A P+ + A+ PV A A P+ A
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSET-TETVA 1041

Query: 293 AKPAAKPAAKPVAAKPAATKPATAPAAKPAATPSAPAAASSAASATPAAGSNGAAPTSA 351
+ + A A A + A + A + + T
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100


77HWH78_RS28505HWH78_RS28700Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS28505217-1.544932integrase
HWH78_RS28510218-2.216900hypothetical protein
HWH78_RS28515014-2.585475hypothetical protein
HWH78_RS28520014-2.387330DNA cytosine methyltransferase
HWH78_RS28525116-1.804018hypothetical protein
HWH78_RS28530114-1.935879hypothetical protein
HWH78_RS28535015-1.782427hypothetical protein
HWH78_RS28540015-1.979144toprim domain-containing protein
HWH78_RS28545030-3.407115hypothetical protein
HWH78_RS28550341-6.943747hypothetical protein
HWH78_RS28555657-11.838092ogr/Delta-like zinc finger family protein
HWH78_RS28560567-14.136680hypothetical protein
HWH78_RS28565781-17.492605BcepMu gp16 family phage-associated protein
HWH78_RS28570557-12.994570helix-turn-helix domain-containing protein
HWH78_RS28575350-11.534721hypothetical protein
HWH78_RS28580227-6.167288hypothetical protein
HWH78_RS28585023-4.594397hypothetical protein
HWH78_RS28590016-2.006541hypothetical protein
HWH78_RS28595-115-0.129732phage late control D family protein
HWH78_RS28600-2160.158372phage tail protein
HWH78_RS28605-214-0.091143phage tail tape measure protein
HWH78_RS28610117-1.669422GpE family phage tail protein
HWH78_RS28615217-1.704635phage tail assembly protein
HWH78_RS28620117-1.567246phage major tail tube protein
HWH78_RS28625118-1.343235phage tail sheath protein
HWH78_RS28630222-1.847365tail fiber assembly protein
HWH78_RS28635119-1.562672phage tail protein
HWH78_RS28640019-0.803044phage tail protein I
HWH78_RS28645222-0.697904baseplate assembly protein
HWH78_RS28650324-0.416283GPW/gp25 family protein
HWH78_RS28655123-0.015642phage baseplate assembly protein V
HWH78_RS286602210.423918phage virion morphogenesis protein
HWH78_RS286652230.546607phage tail protein
HWH78_RS314052221.055639Rz1-like lysis system protein LysC
HWH78_RS314102190.788835hypothetical protein
HWH78_RS28675-217-0.608360N-acetylmuramidase family protein
HWH78_RS28680-117-1.774508phage holin, lambda family
HWH78_RS28685-114-2.330457tail protein X
HWH78_RS28690-111-2.696055head completion/stabilization protein
HWH78_RS28695-113-3.582742hypothetical protein
HWH78_RS28700-115-3.772765phage major capsid protein, P2 family
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS28605GPOSANCHOR330.006 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 0.006
Identities = 18/118 (15%), Positives = 42/118 (35%), Gaps = 1/118 (0%)

Query: 12 QAIDRATAPIRAVTRSSTGMGRALKESRDQLKALQAQQKDIS-SLRTQREAVRQTSEKLA 70
+A I+ + + + Q + L A ++ + L REA +Q +
Sbjct: 274 NFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ 333

Query: 71 GAQQRLRQYREQLQGMDAPSARFQKSFAAAAAQVDKLKAKHGEQRAELQRLVGQLGKA 128
+++ + Q + +++ A+ KL+ ++ A Q L L +
Sbjct: 334 KLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 391


78HWH78_RS28765HWH78_RS28830Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS28765-116-3.703323ABC transporter permease
HWH78_RS28770-124-5.737656ABC transporter substrate-binding protein
HWH78_RS28775035-7.247767ABC transporter ATP-binding protein
HWH78_RS28780144-8.493351DUF502 domain-containing protein
HWH78_RS28785044-9.314423SDR family oxidoreductase
HWH78_RS28795-154-11.406008*XRE family transcriptional regulator
HWH78_RS28800-152-11.486176hypothetical protein
HWH78_RS28805050-11.085525hypothetical protein
HWH78_RS28810151-10.433411hypothetical protein
HWH78_RS28815147-9.772111DotA/TraY family protein
HWH78_RS28820344-7.680824hypothetical protein
HWH78_RS28825235-5.838153hypothetical protein
HWH78_RS28830128-3.930568hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS28785DHBDHDRGNASE854e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.5 bits (211), Expect = 4e-22
Identities = 57/259 (22%), Positives = 104/259 (40%), Gaps = 28/259 (10%)

Query: 3 ERKTLLLTGASRGIGHATVKYFNAAGWRVFTASRQSWASECPWADGEENHIH-----LDL 57
E K +TGA++GIG A + + G + E + + H D+
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 58 EDIPGVEASLELIREKLGDGRLHALVDNAGISPKGEGGERLGVL-ETDYATWLRVFNVNL 116
D ++ I ++G + LV+ AG+ R G++ W F+VN
Sbjct: 67 RDSAAIDEITARIEREMG--PIDILVNVAGVL-------RPGLIHSLSDEEWEATFSVNS 117

Query: 117 FSTALLARGLFAELKAAQGTVINVTSIAGSKVHPFAGVAYATSKAALSALTREMAHDFGP 176
+R + + + I + V + AYA+SKAA T+ + +
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 177 HGVRVNAIAPGEIDTSI-------------LSPGTEEIVERSIPLHRLGRPEEIASLIYF 223
+ +R N ++PG +T + + G+ E + IPL +L +P +IA + F
Sbjct: 178 YNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLF 237

Query: 224 LCTSGASYVNGAEIHVNGG 242
L + A ++ + V+GG
Sbjct: 238 LVSGQAGHITMHNLCVDGG 256


79HWH78_RS29115HWH78_RS29355Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS29115234-3.044235sulfonamide-resistant dihydropteroate synthase
HWH78_RS29120236-3.690762phage integrase N-terminal SAM-like
HWH78_RS29130334-3.393590IS91 family transposase
HWH78_RS29135342-4.649342LysR family transcriptional regulator
HWH78_RS29140249-6.323052tetracycline efflux MFS transporter Tet(G)
HWH78_RS29145352-7.355710tetracycline resistance transcriptional
HWH78_RS29150253-8.057061chloramphenicol/florfenicol efflux MFS
HWH78_RS29155356-9.263091dihydropteroate synthase
HWH78_RS29160565-12.935539quaternary ammonium compound efflux SMR
HWH78_RS29165564-12.647423chloramphenicol efflux MFS transporter CmlA6
HWH78_RS29170764-13.021243ANT(3'')-Ia family aminoglycoside
HWH78_RS29175759-12.179457OXA-1 family oxacillin-hydrolyzing class D
HWH78_RS29180754-11.843599class 1 integron integrase IntI1
HWH78_RS29185748-10.897066mercury resistance transcriptional regulator
HWH78_RS29190424-3.150728mercuric transport protein MerT
HWH78_RS29195421-2.183664mercury resistance system periplasmic binding
HWH78_RS29205424-2.999826mercury(II) reductase
HWH78_RS29210525-2.959831mercury resistance co-regulator MerD
HWH78_RS29215326-2.632643broad-spectrum mercury transporter MerE
HWH78_RS29220323-2.128798DUF3330 domain-containing protein
HWH78_RS29225329-2.793420DDE-type integrase/transposase/recombinase
HWH78_RS29230230-3.020238TniB family NTP-binding protein
HWH78_RS29235233-4.491736TniQ family protein
HWH78_RS29240135-4.835856recombinase family protein
HWH78_RS29245340-6.208303aminoglycoside N-acetyltransferase AAC(3)-Id
HWH78_RS29250246-7.188638trimethoprim-resistant dihydrofolate reductase
HWH78_RS29255229-4.917603aminoglycoside N-acetyltransferase AAC(6')-Il
HWH78_RS29260118-3.525993class 1 integron integrase IntI1
HWH78_RS29265-210-1.614175urocanate hydratase
HWH78_RS29270-29-0.598275cytosine permease
HWH78_RS29275-19-0.137223histidine ammonia-lyase
HWH78_RS29280-2100.533976amino acid permease
HWH78_RS292852131.088699ABC transporter substrate-binding protein
HWH78_RS292903120.653016proline/glycine betaine ABC transporter
HWH78_RS292953121.403096glycine betaine/L-proline ABC transporter
HWH78_RS293002121.514219histidine ammonia-lyase
HWH78_RS29305-1111.872978imidazolonepropionase
HWH78_RS29310-271.695109N-formylglutamate deformylase
HWH78_RS29320-110-0.292589type VI secretion system tip protein VgrG
HWH78_RS29325117-2.492795type VI secrection system-dependent
HWH78_RS29330427-4.791874sel1 repeat family protein
HWH78_RS29335428-4.491236sel1 repeat family protein
HWH78_RS29340631-4.622688sel1 repeat family protein
HWH78_RS29345939-5.538671LysR family transcriptional regulator
HWH78_RS29350624-3.354800D-amino acid dehydrogenase
HWH78_RS29355415-0.972779RidA family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29145TCRTETA483e-173 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 483 bits (1245), Expect = e-173
Identities = 236/383 (61%), Positives = 288/383 (75%)

Query: 3 SSAIIALLIVGLDAMGLGLIMPVLPTLLRELVPAEQVAGHYGALLSLYALMQVVFAPMLG 62
I+ L V LDA+G+GLIMPVLP LLR+LV + V HYG LL+LYALMQ AP+LG
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 63 QLSDSYGRRPVLLASLAGAAVDYTIMASAPVLWVLYIGRLVSGVTGATGAVAASTIADST 122
LSD +GRRPVLL SLAGAAVDY IMA+AP LWVLYIGR+V+G+TGATGAVA + IAD T
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124

Query: 123 GEGSRARWFGYMGACYGAGMIAGPALGGMLGGISAHAPFIAAALLNGFAFLLACIFLKET 182
RAR FG+M AC+G GM+AGP LGG++GG S HAPF AAA LNG FL C L E+
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 183 HHSHGGTGKPVRIKPFVLLRLDDALRGLGALFAVFFIIQLIGQVPAALWVIYGEDRFQWN 242
H + + P R + + AL AVFFI+QL+GQVPAALWVI+GEDRF W+
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWD 244

Query: 243 TATVGLSLAAFGATHAIFQAFVTGPLSSRLGERRTLLFGMAADATGFVLLAFATQGWMVF 302
T+G+SLAAFG H++ QA +TGP+++RLGERR L+ GM AD TG++LLAFAT+GWM F
Sbjct: 245 ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAF 304

Query: 303 PILLLLAAGGVGMPALQAMLSNNVSSNKQGALQGTLTSLTNLSSIAGPLGFTALYSATAG 362
PI++LLA+GG+GMPALQAMLS V +QG LQG+L +LT+L+SI GPL FTA+Y+A+
Sbjct: 305 PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASIT 364

Query: 363 AWNGWVWIVGAILYLICLPILRR 385
WNGW WI GA LYL+CLP LRR
Sbjct: 365 TWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29150TETREPRESSOR312e-111 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 312 bits (800), Expect = e-111
Identities = 102/205 (49%), Positives = 138/205 (67%), Gaps = 2/205 (0%)

Query: 1 MTKLDKGTVIAAALELLNEVGMDSLTTRKLAERLKVQQPALYWHFQNKRALLDALAEAML 60
M +L++ +VI AALELLNE G+D LTTRKLA++L ++QP LYWH +NKRALLDALA +L
Sbjct: 1 MARLNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEIL 60

Query: 61 AERHTRSLPEENEDWRVFLKENALSFRTALLSYRDGARIHAGTRPTEPNFGTAETQIRFL 120
A H SLP E W+ FL+ NA+SFR ALL YRDGA++H GTRP E + T ETQ+RF+
Sbjct: 61 ARHHDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLGTRPDEKQYDTVETQLRFM 120

Query: 121 CAEGFCPKRAVWALRAVSHYVVGSVLEQQASDADERVPDRPDVSEQAPSSFLHDLFHELE 180
GF + ++A+ AVSH+ +G+VLEQQ A DRP ++ L + ++
Sbjct: 121 TENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAAL--TDRPAAPDENLPPLLREALQIMD 178

Query: 181 TDGMDAAFNFGLDSLIAGFERLRSS 205
+D + AF GL+SLI GFE ++
Sbjct: 179 SDDGEQAFLHGLESLIRGFEVQLTA 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29160TCRTETB635e-13 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 63.4 bits (154), Expect = 5e-13
Identities = 31/138 (22%), Positives = 58/138 (42%), Gaps = 2/138 (1%)

Query: 37 VPAMPGVLNTTPSIIQLTLSLYMVMLGVGQVIFGPLSDRVGRRPILLVGATAFVAASLGA 96
+P + N P+ + +M+ +G ++G LSD++G + +LL G S+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 97 ACSSTALAFVAF-RLVQAVGASAMLVATFATVRDVYANRPEGAVIYGLFSSMLAFVPALG 155
+ + + R +Q GA A A V Y + +GL S++A +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGA-AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155

Query: 156 PIAGALIGEFWGWQAIFI 173
P G +I + W + +
Sbjct: 156 PAIGGMIAHYIHWSYLLL 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29175TCRTETB582e-11 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 58.4 bits (141), Expect = 2e-11
Identities = 34/146 (23%), Positives = 69/146 (47%), Gaps = 2/146 (1%)

Query: 36 AVPFMPNALGTTASTIQLTLTTYLVMIGAGQLLFGPLSDRLGRRPVLLGGGLAYVVASM- 94
++P + N ++ T +++ G ++G LSD+LG + +LL G + S+
Sbjct: 36 SLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI 95

Query: 95 GLALTSSAEVFLGLRILQACGASACLVSTFATVRDIYAGREESNVIYGILGSMLAIVPAV 154
G S + + R +Q GA+A + V Y +E +G++GS++A+ V
Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAA-FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV 154

Query: 155 GPLLGALVDMWLGWRAIFAFLGLGMI 180
GP +G ++ ++ W + + +I
Sbjct: 155 GPAIGGMIAHYIHWSYLLLIPMITII 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29225PYOCINKILLER270.017 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.5 bits (60), Expect = 0.017
Identities = 15/89 (16%), Positives = 32/89 (35%), Gaps = 15/89 (16%)

Query: 38 GYGLFDDAALQRLCFVRAAFEAGIGLDALARLCRALDAADCDETAAQLAVLSQFVER--- 94
Y F D ++ L AA+ + +A++ L ++ + + + A ++ E+
Sbjct: 172 AYMRFLDREMEGL---TAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAA 228

Query: 95 ---------RREALADLEVQLAAMPTAPA 114
R+ A AMP +
Sbjct: 229 EAKRKAEEQARQQAAIRAANTYAMPANGS 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29260SACTRNSFRASE300.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.3 bits (68), Expect = 0.002
Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 2/57 (3%)

Query: 83 YIYDLAVAATRRREGIATALIKKLKAIGAARGAYVIYVQADKGVEDQPAIELYKKLG 139
I D+AVA R++G+ TAL+ K + ++ + A Y K
Sbjct: 91 LIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD--INISACHFYAKHH 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29270SACTRNSFRASE408e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.9 bits (93), Expect = 8e-07
Identities = 18/77 (23%), Positives = 27/77 (35%), Gaps = 9/77 (11%)

Query: 73 YAEECYSGNVAFLEGW---------YVVPSARRQGVGVALVKAAEHWARGRGCTEFASDT 123
Y E G + W V R++GVG AL+ A WA+ +T
Sbjct: 71 YLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLET 130

Query: 124 QLTNSASTSAHLAAGFT 140
Q N ++ + F
Sbjct: 131 QDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29325UREASE362e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 36.3 bits (84), Expect = 2e-04
Identities = 17/33 (51%), Positives = 21/33 (63%)

Query: 341 LAGVTLHAARALGLEASHGSLEVGKLADFVAWD 373
+A T++ A A GL GSLEVGK AD V W+
Sbjct: 406 IAKYTINPAIAHGLSHEIGSLEVGKRADLVLWN 438


80HWH78_RS29460HWH78_RS29485Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS29460615-0.430127SCP2 domain-containing protein
HWH78_RS29465616-0.904531bifunctional demethylmenaquinone
HWH78_RS29470314-1.368735polyhydroxyalkanoic acid system family protein
HWH78_RS29475313-0.901158phasin family protein
HWH78_RS29480311-1.284554phasin family protein
HWH78_RS29485212-0.654777TetR/AcrR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29485IGASERPTASE453e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 44.7 bits (105), Expect = 3e-07
Identities = 35/188 (18%), Positives = 61/188 (32%), Gaps = 8/188 (4%)

Query: 110 VPSRNEVKELHSK--VDTLTKQIEKLTGVSVKPAAKAAAKPAAKPAAKPAAKTAAAKPAA 167
VPS NE + V T +V +K +K K TA + A
Sbjct: 1010 VPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVA 1069

Query: 168 KPA-----AKAAAKPAAKPAAKKTAAKTAAAKPAAKPTAKAAAKPATKPAAKAAAKPAAK 222
K A A A+ ++ +T K A + AK T+ + ++
Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV-TSQ 1128

Query: 223 PAAKPAAATAAKPAAKPAAKPAAKKPAAKKPAAKPAAAKPAAPAASSSAPAAPAATPAAS 282
+ K + +P A+PA + + + A PA +S+ T + +
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTT 1188

Query: 283 APAANAPA 290
N+
Sbjct: 1189 VNTGNSVV 1196



Score = 42.7 bits (100), Expect = 1e-06
Identities = 38/197 (19%), Positives = 55/197 (27%), Gaps = 12/197 (6%)

Query: 100 RLNSAISRLGVPSRNEVKELHSKVDTLTKQIEKLTGVSV-KPAAKAAAKPAAKPAAKPAA 158
R + L P EV++ + VDT I + P+ + + A+ P
Sbjct: 972 RNVNGRYDLYNP---EVEKRNQTVDT--TNITTPNNIQADVPSVPSNNEEIARVDEAPVP 1026

Query: 159 KTAAAKPAAKPAAKAAAKPAAKPAAKKTAAKTAAAKPAAKPTAKAAAKPATKPAAKAAAK 218
A A P+ A +K + AK A K K +
Sbjct: 1027 PPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEA-KSNVKANTQTNEV 1085

Query: 219 PAAKPAAKPAAATAAKPAAKPAAKPAAKKPAAKKPAAKPAAAKPAAPAASSSAPAAPAAT 278
+ K T K A + AK K P +P S P A
Sbjct: 1086 AQSGSETKETQTTETKETATVEKEEKAKVETEKTQEV-PKVTSQVSPKQEQSETVQPQAE 1144

Query: 279 PAASAPAANAPATPSSQ 295
PA N P +
Sbjct: 1145 PA----RENDPTVNIKE 1157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29490HTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.0 bits (145), Expect = 2e-13
Identities = 30/148 (20%), Positives = 57/148 (38%), Gaps = 8/148 (5%)

Query: 1 MKTRDRILECSLLLFNEQGEPNVSTLEIANELGISPGNLYYHFHGKEPLVMALFERFQAE 60
+TR IL+ +L LF++QG + S EIA G++ G +Y+HF K L ++E ++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 61 LAPLL-----DPPEEVRLGAEDYWLFLHLIVERLAHYRFLFQDL---SNLTGRLPRLARG 112
+ L P + + + + R L + + G + + +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 113 IRTWLGALKRTLATLLARLKADRQLRSD 140
R + L + L +D
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPAD 157


81HWH78_RS29775HWH78_RS29895Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS29775-113-3.032821PIG-L family deacetylase
HWH78_RS29780-210-2.785006glycosyltransferase
HWH78_RS29785-217-4.458404alpha-1,3-rhamnosyltransferase
HWH78_RS29790-213-2.981541O-antigen ligase WaaL
HWH78_RS29795-212-2.620372hypothetical protein
HWH78_RS29800-211-1.945650lipid A export permease/ATP-binding protein
HWH78_RS29805-211-1.252159bifunctional
HWH78_RS29810012-1.089596asparagine synthase (glutamine-hydrolyzing)
HWH78_RS29815-2112.091637acyl-CoA dehydrogenase
HWH78_RS29820-192.270286acyl-CoA dehydrogenase family protein
HWH78_RS298250103.739445hypothetical protein
HWH78_RS29830-1113.327524aldo/keto reductase
HWH78_RS298350113.779180FAD-dependent oxidoreductase
HWH78_RS298400113.733339multidrug efflux SMR transporter
HWH78_RS298450113.844823LysR family transcriptional regulator
HWH78_RS298500113.571659lipid IV(A) 3-deoxy-D-manno-octulosonic acid
HWH78_RS29855-1113.191807cupin domain-containing protein
HWH78_RS29860-2123.661223FAD-dependent oxidoreductase
HWH78_RS29865-2132.803091ABC transporter substrate-binding protein
HWH78_RS29870-1122.870426TetR family transcriptional regulator C-terminal
HWH78_RS29875-2102.424408two-component system response regulator AruR
HWH78_RS29880-192.788647transporter substrate-binding domain-containing
HWH78_RS29885-183.179246response regulator
HWH78_RS29890-173.109733amino acid permease
HWH78_RS29895-173.400466enoyl-CoA hydratase/isomerase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29775FLGMRINGFLIF290.039 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 29.2 bits (65), Expect = 0.039
Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 9 LKRHRRNKRIGLLVALLALLAVGLLVSPWL 38
L R R N RI L+VA A +A+ + + W
Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWA 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29800ACRIFLAVINRP310.012 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.012
Identities = 12/50 (24%), Positives = 23/50 (46%)

Query: 144 ITFNVTMVTGAATDAIKVVIREGLTVVFLFLYLLWMNWKLTLVMLAILPV 193
++ T + + + E + +VFL +YL N + TL+ +PV
Sbjct: 325 YPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV 374


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29805LPSBIOSNTHSS280.045 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 28.2 bits (63), Expect = 0.045
Identities = 14/57 (24%), Positives = 26/57 (45%), Gaps = 7/57 (12%)

Query: 346 GCFDILHAGHVTYLEQARAQGDRLIVGVNDDASVTRLKGVGRPINSVDRRMAVLAGL 402
G FD + GH+ +E+ D++ V V + + +P+ SV R+ +A
Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPN-------KQPMFSVQERLEQIAKA 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29870HTHTETR751e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 74.7 bits (183), Expect = 1e-18
Identities = 34/184 (18%), Positives = 65/184 (35%), Gaps = 4/184 (2%)

Query: 6 RFSRLEPEQRKALLIEATLACLKRHGFQGASVRKICAEAGVSVGLINHHYDGKDALVAEA 65
R ++ E ++ + +++ L + G S+ +I AGV+ G I H+ K L +E
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 66 YLAVTGRVMRLLRGAIDTAPGGARPRLSAFFEASFSAELLDPQ---LLDAWLAFWGAVGS 122
+ + L PG L + + + + L++ VG
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 123 IEAIGRVHDHSYGEYRALLVGVLRQLAEEGGW-ADFDAELAAISLSALLDGLWLESGLNP 181
+ + + + E + L+ E AD AAI + + GL P
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182

Query: 182 ATFT 185
+F
Sbjct: 183 QSFD 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29875HTHFIS921e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 1e-23
Identities = 34/136 (25%), Positives = 60/136 (44%), Gaps = 6/136 (4%)

Query: 5 PRVLVVDDDPVIRELLQAYLGEEGYDVLCAGNAEQAEACLAECAHLGQPVELVLLDIRLP 64
+LV DDD IR +L L GYDV NA +A +LV+ D+ +P
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA-----GDGDLVVTDVVMP 58

Query: 65 GKDGLTLTRELR-VRSEVGIILITGRNDEIDRIVGLECGADDYVIKPLNPRELVSRAKNL 123
++ L ++ R ++ +++++ +N + I E GA DY+ KP + EL+
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 124 IRRVRHAQASAGPARQ 139
+ + + Q
Sbjct: 119 LAEPKRRPSKLEDDSQ 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29885HTHFIS541e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.1 bits (130), Expect = 1e-09
Identities = 29/157 (18%), Positives = 50/157 (31%), Gaps = 16/157 (10%)

Query: 500 SALEVLLVEDVALNREVAQGLLERDGHRVMLAEDAGPALALCRQRRFDLILLDMHLPGMA 559
+ +L+ +D A R V L R G+ V + +A DL++ D+ +P
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 560 GLELCAGIRRQLDGLNRATPIFAFTASIQPDMVRRYFAAGMQGVLGKPLRMDELRRALGE 619
+L I++ L P+ +A + G L KP + EL
Sbjct: 62 AFDLLPRIKKARPDL----PVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG---- 113

Query: 620 VGTSVPALAVEAALDRQMLETHRRLLGRHKLAGLLGN 656
+ AL + L+G
Sbjct: 114 --------IIGRALAEPKRRPSKLEDDSQDGMPLVGR 142


82HWH78_RS30245HWH78_RS30365Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS302452160.258180branched-chain amino acid ABC transporter
HWH78_RS302501131.216118high-affinity branched-chain amino acid ABC
HWH78_RS302550111.902387ATP-binding cassette domain-containing protein
HWH78_RS302600112.573922ABC transporter ATP-binding protein
HWH78_RS302650103.668751ornithine cyclodeaminase family protein
HWH78_RS30270-1103.114823SDR family oxidoreductase
HWH78_RS30275-193.606549GntR family transcriptional regulator
HWH78_RS30280-1103.819783PDR/VanB family oxidoreductase
HWH78_RS30285-1113.666377aromatic ring-hydroxylating dioxygenase subunit
HWH78_RS30290-1113.180705MFS transporter
HWH78_RS30295-1101.959355LysR family transcriptional regulator
HWH78_RS30300-181.884128benzoylformate decarboxylase
HWH78_RS30305-192.183311aromatic acid/H+ symport family MFS transporter
HWH78_RS30310-1102.284089aldehyde dehydrogenase family protein
HWH78_RS303150101.906331OprD family porin
HWH78_RS303200102.707362TonB-dependent receptor
HWH78_RS303252134.851840sigma-70 family RNA polymerase sigma factor
HWH78_RS303301135.237034FecR family protein
HWH78_RS303352145.161403HupE/UreJ family protein
HWH78_RS303401113.738403urease accessory protein UreG
HWH78_RS303452123.394513urease accessory protein UreF
HWH78_RS303502123.484440urease accessory protein UreE
HWH78_RS303552112.819533TetR family transcriptional regulator DesT
HWH78_RS303603112.838294ferredoxin reductase
HWH78_RS303652111.844904fatty acid desaturase DesB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30270DHBDHDRGNASE812e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 80.9 bits (199), Expect = 2e-20
Identities = 61/244 (25%), Positives = 100/244 (40%), Gaps = 14/244 (5%)

Query: 6 FITGATSGFGEACARRFAEAGWSLVLTGRREERLQALAGELSAKTRVL-PLTLDVRDRAA 64
FITGA G GEA AR A G + E+L+ + L A+ R DVRD AA
Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAA 71

Query: 65 MSAVVDNLPEEFATLRGLINNAGLALGTDPAQSCDLDDWDTMVDTNIKGLLYSTRLLLPR 124
+ + + E + L+N AG+ L S ++W+ N G+ ++R +
Sbjct: 72 IDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 125 LIAHGAGASIVNLGSVAGKWPYPGSHVYGGTKAFVEQFSLNLRCDLQGTGVRVTNLEPGL 184
++ +G SIV +GS P Y +KA F+ L +L +R + PG
Sbjct: 131 MMDRRSG-SIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 185 CESEFSLV----------RFGGDQARYDKTYAGAHPIQPEDIAETI-FWIMNQPAHLNIN 233
E++ G + +P DIA+ + F + Q H+ ++
Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMH 249

Query: 234 SLEI 237
+L +
Sbjct: 250 NLCV 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30290TCRTETA509e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 50.2 bits (120), Expect = 9e-09
Identities = 40/147 (27%), Positives = 57/147 (38%), Gaps = 8/147 (5%)

Query: 55 AEIGLLLSAGLFGMAAGSLFIAPWADRWGRRPLILACLALSGLGMLASALSQAAWQLALL 114
A G+LL+ A + + +DR+GRRP++L LA + + A + W L +
Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102

Query: 115 R---GLTGLGIGGILASSNVIASEYASRRWRGLAVSLQSTGYALGATLGGLLAVWLIGAW 171
R G+TG A I R G + G G LGGL+ G +
Sbjct: 103 RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-----GGF 157

Query: 172 GWRSVFVFGAGLTLAVIPLVCLCLPES 198
+ F A L C LPES
Sbjct: 158 SPHAPFFAAAALNGLNFLTGCFLLPES 184



Score = 36.7 bits (85), Expect = 1e-04
Identities = 34/146 (23%), Positives = 59/146 (40%), Gaps = 7/146 (4%)

Query: 51 NLGGAEIGLLLSA-GLFGMAAGSLFIAPWADRWGRRPLILACLALSGLGMLASALSQAAW 109
+ IG+ L+A G+ A ++ P A R G R ++ + G G + A + W
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 110 QLALLRGLTGLGIGGILASSNVIASEYASRRW---RGLAVSLQSTGYALGATLGGLLAVW 166
+ L G G+ A +++ + R +G +L S +G L +
Sbjct: 302 MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361

Query: 167 LIGAW-GWRSVFVFGAGLTLAVIPLV 191
I W GW ++ GA L L +P +
Sbjct: 362 SITTWNGW--AWIAGAALYLLCLPAL 385



Score = 34.4 bits (79), Expect = 8e-04
Identities = 41/189 (21%), Positives = 66/189 (34%), Gaps = 5/189 (2%)

Query: 253 RTTLLLWALFFLVMFGFYFIMSWTPKLLVAAGLSTAQGITGGTLLSIGGI---FGAALLG 309
R +++ + L G IM P LL S G LL++ + A +LG
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 310 GLAARFRLERVLALFMLLTAALLALFSLSAGLPGAALPLGLLIGLCANACVAGLYALAPS 369
L+ RF VL + + A A+ + + L L +G ++ A A A
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL--WVLYIGRIVAGITGATGAVAGAYIAD 122

Query: 370 LYDASVRATGVGWGIGVGRGGAILSPLVAGLLLDDGWQPLSLYGAFAAVFVAAAAVLPLL 429
+ D RA G+ G + P++ GL+ A L
Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 430 GARRRERSP 438
+ + ER P
Sbjct: 183 ESHKGERRP 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30305TCRTETB461e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.4 bits (110), Expect = 1e-07
Identities = 33/187 (17%), Positives = 75/187 (40%), Gaps = 5/187 (2%)

Query: 16 NRTHWLILGWGCFIMLFDGYDMVIYGSVVPRLMQEWQLSPVQAGTLGSCALFGMLFGGTL 75
N H IL W C + F + ++ +P + ++ P + + + G +
Sbjct: 9 NLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAV 68

Query: 76 LAPLADRFGRRRLV---IATTLLASLAAFLTGHARDPLELGAGRFFTGLALGALVPSAIN 132
L+D+ G +RL+ I S+ F+ GH+ L L RF G A +
Sbjct: 69 YGKLSDQLGIKRLLLFGIIINCFGSVIGFV-GHSFFSL-LIMARFIQGAGAAAFPALVMV 126

Query: 133 LISEFAPAGRRSTLVTVMSAFYSVGAVLSALLAIAMIPAWGWQSVFYVAVLPVLAVPLML 192
+++ + P R ++ + ++G + + + W + + ++ ++ VP ++
Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186

Query: 193 RWLPESA 199
+ L +
Sbjct: 187 KLLKKEV 193



Score = 32.2 bits (73), Expect = 0.005
Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 13/152 (8%)

Query: 258 VAFAMCMLMSYG------LNTWLPKLMAGGGYALGSSLAFLVTLNVGATLGALFGGWLAD 311
+ +C+L + LN LP + S+ + ++G G L+D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 312 RLGAGRTLVLFFAL--AAASLAALGLGPGPWLLNGLLVVA--GATTIGTLAVIHAYAAQF 367
+LG R L+ + + + +G L+ + A + V+ A++
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVV---VARY 131

Query: 368 YPAWVRSTGVGWAAGVGRLGAIAGPMLGGSLL 399
P R G + +G GP +GG +
Sbjct: 132 IPKENRGKAFGLIGSIVAMGEGVGPAIGGMIA 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30355HTHTETR646e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.3 bits (156), Expect = 6e-15
Identities = 32/179 (17%), Positives = 57/179 (31%), Gaps = 10/179 (5%)

Query: 1 MSSPRAEQKQQTRHALMSAARHLMESGRGFGSLSLREVTRAAGIVPAGFYRHFSDMDQLG 60
M+ ++ Q+TR ++ A L +G S SL E+ +AAG+ Y HF D L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQ-QGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 61 LALVAEVDETFRATLR--AVRRNEFELGGLIDASVRIF-LDAVGANRSQF---LFLAREQ 114
+ + + L L + + + R +F E
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 115 YGGSLPIRQAIASLRQRITDDLAADLALLNKMPHLDGAALDVFADLVVKTVFATLPELI 173
G ++QA +L D + L D+ + + L+
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIE---QTLKHCIEAKMLPADLMTRRAAIIMRGYISGLM 175


83HWH78_RS30445HWH78_RS30485Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS304452111.234114oxaloacetate decarboxylase
HWH78_RS304501131.533771hypothetical protein
HWH78_RS304553140.723605DksA/TraR family C4-type zinc finger protein
HWH78_RS304602171.889120hypothetical protein
HWH78_RS304653213.038903urease subunit alpha
HWH78_RS304700183.570653urease subunit beta
HWH78_RS304750173.031977L-methionine sulfoximine N-acetyltransferase
HWH78_RS30480-1173.029683urease subunit gamma
HWH78_RS30485-1163.189138urease accessory protein UreD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30465UREASE10960.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1096 bits (2837), Expect = 0.0
Identities = 422/567 (74%), Positives = 480/567 (84%), Gaps = 2/567 (0%)

Query: 2 KISRQAYADMFGPTVGDRVRLADTDLWLEVERDFTVYGEEVKFGGGKVIRDGMGQSQL-G 60
++SR AYA+MFGPTVGD+VRLADT+L++EVE+DFT +GEEVKFGGGKVIRDGMGQSQ+
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 61 AAQVVDTVITNALILDHWGVVKADVGLKDGRIQAIGKAGNPDIQPGVNIAIGAGTEVIAG 120
VDTVITNALILDHWG+VKAD+GLKDGRI AIGKAGNPD+QPGV I +G GTEVIAG
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAG 123

Query: 121 EGMILTAGGIDTHIHFICPQQIEEALMSGVTTMIGGGTGPAAGTNATTCTSGPWHMARML 180
EG I+TAGG+D+HIHFICPQQIEEALMSG+T M+GGGTGPA GT ATTCT GPWH+ARM+
Sbjct: 124 EGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMI 183

Query: 181 QAADAFPMNIGFTGKGNASLPLPLEEQVLAGAIGLKLHEDWGSTPAAIDNCLEVAERHDI 240
+AADAFPMN+ F GKGNASLP L E VL GA LKLHEDWG+TPAAID CL VA+ +D+
Sbjct: 184 EAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDV 243

Query: 241 QVAIHTDTLNESGFVETTLGAFKGRTIHTYHTEGAGGGHAPDIIKACGFANVLPSSTNPT 300
QV IHTDTLNESGFVE T+ A KGRTIH YHTEGAGGGHAPDII+ CG NV+PSSTNPT
Sbjct: 244 QVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPT 303

Query: 301 RPFTRNTIDEHLDMLMVCHHLDPAIAEDVAFAESRIRRETIAAEDILHDLGAFSMISSDS 360
RP+T NT+ EHLDMLMVCHHL P I ED+AFAESRIR+ETIAAEDILHD+GAFS+ISSDS
Sbjct: 304 RPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDS 363

Query: 361 QAMGRVGEVITRTWQTADKMKRQRGRLDGDGARNDNFRARRYIAKYTINPAITHGISHEV 420
QAMGRVGEV RTWQTADKMKRQRGRL + NDNFR +RYIAKYTINPAI HG+SHE+
Sbjct: 364 QAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEI 423

Query: 421 GSVEAGKWADLVLWRPAFFGVKPSLILKGGAIAASLMGDINGSIPTPQPVHYRPMFASYA 480
GS+E GK ADLVLW PAFFGVKP ++L GG IAA+ MGD N SIPTPQPVHYRPMF +Y
Sbjct: 424 GSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYG 483

Query: 481 GSRHATSLTFVSQAAFAAGVPQQLGLRKAIGVVSGCR-GVQKSDLIHNGYLPTIEVDAQN 539
SR +S+TFVSQA+ AG+ +LG+ K + V R G+ K+ +IHN P IEVD +
Sbjct: 484 RSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPET 543

Query: 540 YQVRADGQLLWCEPADVLPMAQRYFLF 566
Y+VRADG+LL CEPA VLPMAQRYFLF
Sbjct: 544 YEVRADGELLTCEPATVLPMAQRYFLF 570


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30475SACTRNSFRASE423e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.9 bits (98), Expect = 3e-07
Identities = 15/63 (23%), Positives = 26/63 (41%), Gaps = 1/63 (1%)

Query: 81 RGTVEHSVYVRDDQRGKGLGVQLLQALIERARAQGLHVMVAAIESGNAASIGLHRRLGFE 140
+E + V D R KG+G LL IE A+ ++ + N ++ + + F
Sbjct: 89 YALIED-IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 141 ISG 143
I
Sbjct: 148 IGA 150


84HWH78_RS30680HWH78_RS30770Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS30680292.182160magnesium-translocating P-type ATPase
HWH78_RS30685593.985200hypothetical protein
HWH78_RS30690493.238855hypothetical protein
HWH78_RS30695483.823036Na/Pi cotransporter family protein
HWH78_RS307003103.783598DUF2493 domain-containing protein
HWH78_RS30705294.312919MATE family efflux transporter
HWH78_RS30710093.858450GtrA family protein
HWH78_RS30715094.156673glycosyltransferase family 2 protein
HWH78_RS30720-184.003163phospholipid carrier-dependent
HWH78_RS30725-2101.817327YkgJ family cysteine cluster protein
HWH78_RS30730-1111.598563DUF1835 domain-containing protein
HWH78_RS307350120.517592DMT family transporter
HWH78_RS307400120.873871NADPH-dependent 2,4-dienoyl-CoA reductase
HWH78_RS307450120.806622lipase LipC
HWH78_RS30750-1131.860551formate dehydrogenase-N subunit alpha
HWH78_RS307550133.305391formate dehydrogenase subunit beta
HWH78_RS30760-2133.279015formate dehydrogenase subunit gamma
HWH78_RS30765-2123.402830formate dehydrogenase accessory protein FdhE
HWH78_RS30770-1123.250487L-seryl-tRNA(Sec) selenium transferase
85HWH78_RS30830HWH78_RS30870Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS308300123.085807glycine cleavage system protein H
HWH78_RS308350153.199792DsrE family protein
HWH78_RS308400152.743921GNAT family N-acetyltransferase
HWH78_RS308451153.429215DUF4136 domain-containing protein
HWH78_RS308500123.561375hypothetical protein
HWH78_RS308551143.908331DUF4136 domain-containing protein
HWH78_RS308600133.137511methyltransferase domain-containing protein
HWH78_RS30865-1132.907668nucleotide pyrophosphohydrolase
HWH78_RS30870-1123.264727MaoC family dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30835SYCECHAPRONE260.030 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 26.2 bits (57), Expect = 0.030
Identities = 11/44 (25%), Positives = 21/44 (47%)

Query: 68 EPLRHFIRQAREAGARLLMCRQPGVNIDESELIEELDEISSGGE 111
+ L+ + G +L RQP ++D + L +L+ + G E
Sbjct: 72 DILKPILSWDEVGGHPVLWNRQPLNSLDNNSLYTQLEMLVQGAE 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30840SACTRNSFRASE290.005 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.1 bits (65), Expect = 0.005
Identities = 12/51 (23%), Positives = 20/51 (39%), Gaps = 1/51 (1%)

Query: 83 VAPAARGLGVARYLIGVMENLAREQYKARLMKISCFNANAAGLLLYTQLGY 133
VA R GV L+ A+E + LM + + N + Y + +
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLM-LETQDINISACHFYAKHHF 146


86HWH78_RS00485HWH78_RS00525N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS00485-2120.655307efflux pump transcriptional repressor NfxB
HWH78_RS00490-2110.613521multidrug efflux RND transporter periplasmic
HWH78_RS00495-190.233407multidrug efflux RND transporter permease
HWH78_RS00500-1111.618517multidrug efflux transporter outer membrane
HWH78_RS00505090.856501TetR/AcrR family transcriptional regulator EscR
HWH78_RS00510-191.278258hypothetical protein
HWH78_RS00515-190.867360energy-dependent translational throttle protein
HWH78_RS00520-1101.011650ABC transporter ATP-binding protein
HWH78_RS00525-1100.653534ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00485HTHTETR388e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 38.1 bits (88), Expect = 8e-06
Identities = 19/141 (13%), Positives = 52/141 (36%), Gaps = 7/141 (4%)

Query: 24 ATLKELAEAAGVSKATLHRFCGTRDNLVQMLEDHGETVLNQIIQACDLEHAEPLEALQRL 83
+L E+A+AAGV++ ++ + +L + + E+ + ++ + ++ R
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 84 IKEHL-------THRELLVFLVFQYRPDFLDPHGEGARWQSYLEALDAFFLRGQQKGVFR 136
I H+ R LL+ ++F + ++ + + +
Sbjct: 92 ILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEA 151

Query: 137 IDITAAVFTELFITLVYGMVD 157
+ A + T ++ G +
Sbjct: 152 KMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00490RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.9 bits (101), Expect = 1e-06
Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 4/104 (3%)

Query: 99 LKAAVSRAEGELARNRAVLFEAQARVRRYEPLVKIQAVSQQDFDTATADLRSAEAATRSA 158
+ A EL ++ L + ++ + + + Q V+Q + LR
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAK--EEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 159 QADLETARLNLGYASVTAPISGRIGRALV-TEGALVGQGEATLM 201
+L + + AP+S ++ + V TEG +V E TLM
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLM 357



Score = 37.1 bits (86), Expect = 1e-04
Identities = 22/199 (11%), Positives = 67/199 (33%), Gaps = 28/199 (14%)

Query: 55 PGRIEPV-RVAEVRARVAGIVVRKRFEEGADVKAGDLLFQIDP-------APLKAAVSRA 106
G++ R E++ IV +EG V+ GD+L ++ ++++ +A
Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA 146

Query: 107 EGELARNRAVLFEAQARVRRY--------------EPLVKIQAVSQQDFDTATADLRSAE 152
E R + + + E ++++ ++ ++ F T E
Sbjct: 147 RLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206

Query: 153 AATRSAQADLETARLNLGYASVTAPISGR---IGRALVTEGALVGQGEATLMARIQQLDP 209
+A+ T + + + +L+ + A+ + ++ + +
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAI---AKHAVLEQENKYVE 263

Query: 210 IYADFTQTAAEALRLRDAL 228
+ ++ ++ +
Sbjct: 264 AVNELRVYKSQLEQIESEI 282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00495ACRIFLAVINRP11670.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1167 bits (3020), Expect = 0.0
Identities = 516/1028 (50%), Positives = 714/1028 (69%), Gaps = 8/1028 (0%)

Query: 1 MSEFFIKRPNFAWVVALFISLAGLLVISKLPVAQYPNVAPPQITITATYPGASAKVLVDS 60
M+ FFI+RP FAWV+A+ + +AG L I +LPVAQYP +APP ++++A YPGA A+ + D+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTSVLEESLNGAKGLLYFESTNNSNGTAEIVVTFEPGTDPDLAQVDVQNRLKKAEARMPQ 120
VT V+E+++NG L+Y ST++S G+ I +TF+ GTDPD+AQV VQN+L+ A +PQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 AVLTQGLQVEQTSAGFLLIYALSYKEGAQRSDTTALGDYAARNINNELRRLPGVGKLQFF 180
V QG+ VE++S+ +L++ D + DY A N+ + L RL GVG +Q F
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDD--ISDYVASNVKDTLSRLNGVGDVQLF 178

Query: 181 SSEAAMRVWIDPQKLVGFGLSIDDVSNAIRGQNVQVPAGAFGSAPGSSAQELTATLAVKG 240
++ AMR+W+D L + L+ DV N ++ QN Q+ AG G P Q+L A++ +
Sbjct: 179 GAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 241 TLDDPQEFGQVVLRANEDGSLVRLADVARLELGKESYNISSRLNGTPTVGGAIQLSPGAN 300
+P+EFG+V LR N DGS+VRL DVAR+ELG E+YN+ +R+NG P G I+L+ GAN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 301 AIQAATLVKQRLAELSAFFPEDMQYSVPYDTSRFVDVAIEKVIHTLIEAMVLVFLVMFLF 360
A+ A +K +LAEL FFP+ M+ PYDT+ FV ++I +V+ TL EA++LVFLVM+LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 361 LQNVRYTLIPSIVVPVCLLGTLMVMYLLGFSVNMMTMFGMVLAIGILVDDAIVVVENVER 420
LQN+R TLIP+I VPV LLGT ++ G+S+N +TMFGMVLAIG+LVDDAIVVVENVER
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 421 IMAEEGISPAEATVKAMKQVSGAIVGITLVLSAVFLPLAFMAGSVGVIYQQFSVSLAVSI 480
+M E+ + P EAT K+M Q+ GA+VGI +VLSAVF+P+AF GS G IY+QFS+++ ++
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 481 LFSGFLALTFTPALCATLLKPIPEGHHE-KRGFFGAFNRGFARVTERYSLLNSKLVARAG 539
S +AL TPALCATLLKP+ HHE K GFFG FN F Y+ K++ G
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 540 RFMLVYAGLVAMLGYFYLRLPEAFVPAEDLGYMVVDVQLPPGASRVRTDATGEE-LERFL 598
R++L+YA +VA + +LRLP +F+P ED G + +QLP GA++ RT ++ + +L
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 599 KSREA-VASVFLISGFSFSGQGDNAALAFPTFKDWSER-GAEQSAAAEIAALNEHFALPD 656
K+ +A V SVF ++GFSFSGQ NA +AF + K W ER G E SA A I
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 657 DGTVMAVSPPPINGLGNSGGFALRLMDRSGVGREALLQARDTLLGEIQTNPKFLYAMM-E 715
DG V+ + P I LG + GF L+D++G+G +AL QAR+ LLG +P L ++
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 716 GLAEAPQLRLLIDREKARALGVSFETISGTLSAAFGSEVINDFTNAGRQQRVVIQAEQGN 775
GL + Q +L +D+EKA+ALGVS I+ T+S A G +NDF + GR +++ +QA+
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 776 RMTPESVLELYVPNAAGNLVPLSAFVSVKWEEGPVQLVRYNGYPSIRIVGDAAPGFSTGE 835
RM PE V +LYV +A G +VP SAF + W G +L RYNG PS+ I G+AAPG S+G+
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 836 AMAEMERLAAQLPAGIGYEWTGLSYQEKVSAGQATSLFALAILVVFLLLVALYESWSIPL 895
AMA ME LA++LPAGIGY+WTG+SYQE++S QA +L A++ +VVFL L ALYESWSIP+
Sbjct: 839 AMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPV 898

Query: 896 SVMLIVPIGAIGAVLAVMVSGMSNDVYFKVGLITIIGLSAKNAILIVEFAKELWE-QGHS 954
SVML+VP+G +G +LA + NDVYF VGL+T IGLSAKNAILIVEFAK+L E +G
Sbjct: 899 SVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 958

Query: 955 LRDAAIEAARLRFRPIIMTSMAFILGVIPLALASGAGAASQRAIGTGVIGGMLSATFLGV 1014
+ +A + A R+R RPI+MTS+AFILGV+PLA+++GAG+ +Q A+G GV+GGM+SAT L +
Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018

Query: 1015 LFVPICFV 1022
FVP+ FV
Sbjct: 1019 FFVPVFFV 1026



Score = 96.1 bits (239), Expect = 4e-22
Identities = 92/506 (18%), Positives = 179/506 (35%), Gaps = 40/506 (7%)

Query: 541 FMLVYAGLVAMLGYF-YLRLPEAFVPAEDLGYMVVDVQLP-PGAS-RVRTDATGEELERF 597
F V A ++ M G L+LP A P + V V PGA + D + +E+
Sbjct: 11 FAWVLAIILMMAGALAILQLPVAQYP--TIAPPAVSVSANYPGADAQTVQDTVTQVIEQN 68

Query: 598 LKSREAVASVFLISGFSFSGQGDNAALAFPTFKDWSERGAEQSAAAEIAALNEHFALPDD 657
+ + ++ +S S S L F + D A+ ++ LP +
Sbjct: 69 MNG---IDNLMYMSSTSDSAGSVTITLTFQSGTD--PDIAQVQVQNKLQLATP--LLPQE 121

Query: 658 GTVMAVSPPPINGLGNSGGFALRLMDRSGVGREALLQARDTLLGEIQTNPKFLYAMMEGL 717
V I+ +S + + S D + +N K + + G+
Sbjct: 122 -----VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDY----VASNVKDTLSRLNGV 172

Query: 718 AEAP------QLRLLIDREKARALGVSFETISGTLSAA---FGSEVINDFTNAGRQQRVV 768
+ +R+ +D + ++ + L + + QQ
Sbjct: 173 GDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNA 232

Query: 769 IQAEQGNRMTPESVLELYVP-NAAGNLVPLSAFVSVKW-EEGPVQLVRYNGYPSIRIVGD 826
Q PE ++ + N+ G++V L V+ E + R NG P+ +
Sbjct: 233 SIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIK 292

Query: 827 AAPGFSTGEA----MAEMERLAAQLPAGIGYEWTGLSYQEKVSAGQATSLFAL--AILVV 880
A G + + A++ L P G+ + V + L AI++V
Sbjct: 293 LATGANALDTAKAIKAKLAELQPFFPQGMKVLYP-YDTTPFVQLSIHEVVKTLFEAIMLV 351

Query: 881 FLLLVALYESWSIPLSVMLIVPIGAIGAVLAVMVSGMSNDVYFKVGLITIIGLSAKNAIL 940
FL++ ++ L + VP+ +G + G S + G++ IGL +AI+
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411

Query: 941 IVE-FAKELWEQGHSLRDAAIEAARLRFRPIIMTSMAFILGVIPLALASGAGAASQRAIG 999
+VE + + E ++A ++ ++ +M IP+A G+ A R
Sbjct: 412 VVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFS 471

Query: 1000 TGVIGGMLSATFLGVLFVPICFVWLL 1025
++ M + + ++ P LL
Sbjct: 472 ITIVSAMALSVLVALILTPALCATLL 497


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00505HTHTETR358e-05 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 34.6 bits (79), Expect = 8e-05
Identities = 21/141 (14%), Positives = 50/141 (35%), Gaps = 8/141 (5%)

Query: 7 ATMGELAELAGVSRATLNRHCGTREGL-KRRLESHARSTLERLTHSAALQRLEPREALRE 65
++GE+A+ AGV+R + H + L E + E A +P LRE
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 66 LIREHL-------AQRDLLALLMFEQNPGRQAGHGDASWQSYVEALDAFFLRGQQKRVFR 118
++ L +R L+ ++ + + + ++ + + +
Sbjct: 92 ILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEA 151

Query: 119 IDISAATFSELFIVLIYGMVD 139
+ A + +++ G +
Sbjct: 152 KMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00525PF05844290.038 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 28.8 bits (64), Expect = 0.038
Identities = 34/123 (27%), Positives = 51/123 (41%), Gaps = 21/123 (17%)

Query: 226 VSPGADVYSVGAALGAALTARLPGHEAQV---QVSQQVLDGLKRQTRTFTYLLAGLGIIS 282
++PGA SVG AA ++P A +QVLD R + L + + ++
Sbjct: 20 IAPGAAGRSVGTPQAAAELPQVPAARADRVELNAPRQVLDP-VRMEAAGSELDSSVELLL 78

Query: 283 LLGGGVGVMNVMLMSVAERRREIGVRMALGARQRDIRNLFLIEAVTLTAAGALSGAVLGV 342
+L +A++ RE+GV QRD N +I A SGA L +
Sbjct: 79 IL-----------FRIAQKARELGVL------QRDNENQAIIHAQKAQVDEMRSGATLMI 121

Query: 343 AAA 345
A A
Sbjct: 122 AMA 124


87HWH78_RS00705HWH78_RS00905N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS00705211-3.172694FKBP-type peptidyl-prolyl cis-trans isomerase
HWH78_RS00710211-3.3067054-hydroxy-3-methylbut-2-enyl diphosphate
HWH78_RS00715212-3.462812type 4a pilus minor pilin PilE
HWH78_RS00720113-3.513585type 4a fimbrial biogenesis protein PilY2
HWH78_RS00725112-3.379840type 4a pilus biogenesis protein PilY1
HWH78_RS00730-115-1.999478type 4a pilus minor pilin PilX
HWH78_RS00735-111-1.257107type 4a pilus minor pilin PilW
HWH78_RS00740011-0.872632type 4a pilus minor pilin PilV
HWH78_RS00745-112-0.526136type 4 fimbrial biogenesis protein FimU
HWH78_RS00750-111-0.342777Tfp pilus assembly protein FimT/FimU
HWH78_RS00755-110-0.175361glycine oxidase ThiO
HWH78_RS00760-2130.137029two-component system response regulator PilR
HWH78_RS0076509-0.507077two-component system sensor histidine kinase
HWH78_RS00770-111-0.506289hypothetical protein
HWH78_RS00775-211-0.597298outer membrane protein assembly factor BamD
HWH78_RS00780-212-0.66505123S rRNA pseudouridine(1911/1915/1917) synthase
HWH78_RS00785-311-0.576664peptidoglycan editing factor PgeF
HWH78_RS00790-312-0.922425ATP-dependent chaperone ClpB
HWH78_RS00810016-0.632719***two-partner secretion system exoprotein TpsA4
HWH78_RS00815-113-0.412693ShlB/FhaC/HecB family hemolysin
HWH78_RS00820-214-0.408088cell division protein ZapE
HWH78_RS00825-3120.157969NAD(P)/FAD-dependent oxidoreductase
HWH78_RS00830015-0.191823DUF3094 family protein
HWH78_RS00835017-0.432743MOSC domain-containing protein
HWH78_RS00840118-0.890833DUF1780 domain-containing protein
HWH78_RS00845017-0.743612GNAT family N-acetyltransferase
HWH78_RS00850215-0.996997hypothetical protein
HWH78_RS00855011-2.434149energy-coupling factor ABC transporter permease
HWH78_RS00860010-3.567044hypothetical protein
HWH78_RS00865011-4.293190DNA gyrase inhibitor YacG
HWH78_RS00870016-4.955690dephospho-CoA kinase
HWH78_RS00875115-4.713732type IV prepilin peptidase/methyltransferase
HWH78_RS00880015-3.973024type II secretion system F family protein
HWH78_RS00885018-3.720882type IV-A pilus assembly ATPase PilB
HWH78_RS00890-120-2.574914pilin
HWH78_RS00895-214-1.504977O-antigen ligase family protein
HWH78_RS00905-29-1.196847*carboxylating nicotinate-nucleotide
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00705INFPOTNTIATR332e-04 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 33.4 bits (76), Expect = 2e-04
Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 5/54 (9%)

Query: 8 GEESRVTLHFALKLEDGNVVDSTFDK--QPASFKVGDGNLLPGFEQALFGLKAG 59
G+ VT+ + L DG V DST +K +PA+F+V ++PG+ +AL + AG
Sbjct: 142 GKSDTVTVEYTGTLIDGTVFDST-EKAGKPATFQV--SQVIPGWTEALQLMPAG 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00710PF06704280.032 DspF/AvrF protein
		>PF06704#DspF/AvrF protein

Length = 129

Score = 27.5 bits (61), Expect = 0.032
Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 194 KNDIC--YATQNRQDAVKELADQCDMVL 219
+N +C Y +Q+ + AV E+ D +MV+
Sbjct: 26 QNGVCALYDSQDNEAAVIEMPDHSEMVI 53


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00715BCTERIALGSPG421e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.8 bits (98), Expect = 1e-07
Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 4 RQKGFTLLEMVVVVAVIGILLGIAIPSYQNYVIRSNRTEGQALLSDAAA 52
+Q+GFTLLE++VV+ +IG+L + +P N + + + Q +SD A
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVP---NLMGNKEKADKQKAVSDIVA 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00740PilS_PF08805300.003 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 30.3 bits (68), Expect = 0.003
Identities = 11/58 (18%), Positives = 24/58 (41%)

Query: 3 LKSRHRSLHQSGFSMIEVLVALLLISIGVLGMIAMQGKTIQYTADSVERNKAAMLGSN 60
L +R + G +++EVL+ + +I + + S E+N + +N
Sbjct: 16 LSARRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIAN 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00745BCTERIALGSPG415e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 40.6 bits (95), Expect = 5e-07
Identities = 14/45 (31%), Positives = 30/45 (66%)

Query: 8 TGFTLIELLIIVVLLAIMASFAIPNFKQLTERNELQSAAEELNAM 52
GFTL+E+++++V++ ++AS +PN E+ + Q A ++ A+
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVAL 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00750BCTERIALGSPG332e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.9 bits (75), Expect = 2e-04
Identities = 12/47 (25%), Positives = 26/47 (55%)

Query: 4 RSQRALTLTELLFALVLLGILGSLALPGMAAWLDGNRQRSVLHELSA 50
QR TL E++ +V++G+L SL +P + + ++ + ++ A
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVA 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00760HTHFIS5250.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 525 bits (1355), Expect = 0.0
Identities = 176/477 (36%), Positives = 261/477 (54%), Gaps = 33/477 (6%)

Query: 1 MSRQKALIVDDEPDIRELLEITLGRMKLDTRSARNVKEARELLAREPFDLCLTDMRLPDG 60
M+ L+ DD+ IR +L L R D R N +A DL +TD+ +PD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGLDLVQYIQQRHPQTPVAMITAYGSLDTAIQALKAGAFDFLTKPVDLGRLRELVATALR 120
+ DL+ I++ P PV +++A + TAI+A + GA+D+L KP DL L ++ AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 LRNPEAEEAPVDNR----LLGESPPMRALRNQIGKLARSQAPVYISGESGSGKELVARLI 176
+ D++ L+G S M+ + + +L ++ + I+GESG+GKELVAR +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 177 HEQGPRIERPFVPVNCGAIPSELMESEFFGHKKGSFTGAIEDKQGLFQAASGGTLFLDEV 236
H+ G R PFV +N AIP +L+ESE FGH+KG+FTGA G F+ A GGTLFLDE+
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 237 ADLPMAMQVKLLRAIQEKAVRAVGGQQEVAVDVRILCATHKDLAAEVGAGRFRQDLYYRL 296
D+PM Q +LLR +Q+ VGG+ + DVRI+ AT+KDL + G FR+DLYYRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 297 NVIELRVPPLRERREDIPLLAERILKRLAGDTGLPAARLTGDAQEKLKNYRFPGNVRELE 356
NV+ LR+PPLR+R EDIP L +++ + GL R +A E +K + +PGNVRELE
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 357 NMLERAYTLCEDDQIQPHDLRL---------ADAPGASQEGAASLSEI------------ 395
N++ R L D I + A++ G+ S+S+
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 396 -------DNLEDYLEDIERKLIMQALEETRWNRTAAAQRLGLTFRSMRYRLKKLGID 445
+ L ++E LI+ AL TR N+ AA LGL ++R ++++LG+
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS0077060KDINNERMP250.027 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 25.3 bits (55), Expect = 0.027
Identities = 14/43 (32%), Positives = 23/43 (53%), Gaps = 3/43 (6%)

Query: 1 MGLFRLLFWIALIAIAFWLWRRFTR---PTPRQQQRPQDEPSA 40
M R L IAL+ ++F +W+ + + P P+ QQ Q +A
Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTA 43


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00790HTHFIS434e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 43.3 bits (102), Expect = 4e-06
Identities = 50/266 (18%), Positives = 94/266 (35%), Gaps = 45/266 (16%)

Query: 551 MLEGEREKLLRMEQELHRRVIGQDEAVVAVSNAVRRSRAGLADPNRPSGSFLFLGPTGVG 610
+ KL Q+ ++G+ A+ + + R L + + G +G G
Sbjct: 121 EPKRRPSKLEDDSQDGMP-LVGRSAAMQEIYRVLAR----LMQTDLT---LMITGESGTG 172

Query: 611 KTELCKALAEFLFDTEEALVRIDMSEFMEKHSVARLIGAPPGYVGFEEGGYLTEAIRRKP 670
K + +AL ++ V I+M+ + L G E G T A R
Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRST 224

Query: 671 YSV-------VLLDEVEKAHPDVFNILLQVLEDG---RLTDSHGRTVDFRNTVVVMTSNL 720
+ LDE+ D LL+VL+ G + D R +V +N
Sbjct: 225 GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN- 280

Query: 721 GSAQIQELAGDREAQRAAVMDAVNAHFRPEFINRIDEVVVFEPLAREQIAGIAEIQLGRL 780
++L + ++ FR + R++ V + P R++ I ++ +
Sbjct: 281 -----KDL-------KQSINQ---GLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFV 325

Query: 781 RKRLAERELSLELSQEALDKLIAVGF 806
++ E QEAL+ + A +
Sbjct: 326 QQAEKEGLDVKRFDQEALELMKAHPW 351



Score = 34.4 bits (79), Expect = 0.002
Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 32/177 (18%)

Query: 49 LLMQVGFDIAALRSGLNKELDALPKIQSPTGDVNLSQDLARLLNQADRLAQQKGDQFISS 108
L + G+D+ + I + GD+ ++ D+ + + +
Sbjct: 22 ALSRAGYDVRITSNAA----TLWRWIAAGDGDLVVT-DV--------VMPDENAFDLLPR 68

Query: 109 ELVLLAAMDENTRLGKLLLGQGVSRKALENAVANLRGGEA-------VNDPNVEESRQAL 161
+ + + ++ Q A+ G + +AL
Sbjct: 69 ----IKKARPDLPV-LVMSAQN----TFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 162 DKYTVDMTKRAEEG-KLDPVIGRDDEIRRTIQVLQRRTKNN-PVLI-GEPGVGKTAI 215
+ +K ++ P++GR ++ +VL R + + ++I GE G GK +
Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELV 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00810PF05860651e-14 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 65.2 bits (159), Expect = 1e-14
Identities = 31/93 (33%), Positives = 52/93 (55%), Gaps = 5/93 (5%)

Query: 71 TDGRHMVID---QQSHKLITNWNEFSVRADERVSFHQPGQDAVALNRVIGRNGSDIQGRI 127
T+G +I+ Q L ++ EFSV F+ P ++RV G + S+I G I
Sbjct: 17 TEGNTRIIERGTQAGSNLFHSFQEFSVPTSGTAFFNNPTNIQNIISRVTGGSVSNIDGLI 76

Query: 128 DANGK--VFLVNPNGVVFGKSAQVNVGGLVAST 158
AN +FL+NPNG++FG++A++++GG +
Sbjct: 77 RANATANLFLINPNGIIFGQNARLDIGGSFVGS 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00815PF00577372e-04 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 37.1 bits (86), Expect = 2e-04
Identities = 32/226 (14%), Positives = 69/226 (30%), Gaps = 19/226 (8%)

Query: 256 QRYYRAAYQLPLGSRGTRIGLAHAETTYRLVRDFSRLDAHGRAITDSLFVSQPLLRSRSL 315
Y+ A G I + A+ + L V+Q L R+ +L
Sbjct: 484 SGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTL 543

Query: 316 SLS-TQLQYENKRLRDDQERTG-RHSRKEIRLWTASISGNAQDRLFGGGQS-----GFSL 368
LS + Y D+Q + G + ++I ++S + + G+ ++
Sbjct: 544 YLSGSHQTYWGTSNVDEQFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALNVNI 600

Query: 369 AYAHGQLAIDSGEERLLDRYTIGTAGSFDKIMLNAVRLQHLGDRLQLFAQLNAQWSGGNL 428
++H + + R + ++ A L + L + ++GG
Sbjct: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660

Query: 429 DSAEQFDMG-----GPYGVRAFPLGSYKGYGDEGWQASAELRYSLA 469
++ G YG + D+ Q + +
Sbjct: 661 GNSGSTGYATLNYRGGYGN----ANIGYSHSDDIKQLYYGVSGGVL 702


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS008352FE2SRDCTASE270.041 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 26.5 bits (58), Expect = 0.041
Identities = 10/24 (41%), Positives = 12/24 (50%)

Query: 36 DHPHPPRQVTLVQWEHIEALGTLL 59
D P P +TL QW L +LL
Sbjct: 47 DEPAPLNAMTLAQWSSPNVLSSLL 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00845SACTRNSFRASE280.013 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 27.6 bits (61), Expect = 0.013
Identities = 5/25 (20%), Positives = 10/25 (40%)

Query: 66 RRGYLQHLVVDPGYRGLGLARRMLD 90
++ + V YR G+ +L
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLH 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00870DHBDHDRGNASE300.005 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 30.0 bits (67), Expect = 0.005
Identities = 23/88 (26%), Positives = 32/88 (36%), Gaps = 11/88 (12%)

Query: 5 WILGLTGGIGSGKSAAAEHFISLGVHLVDADHAARW--VVEPGRPALAKIVERFGDGILL 62
+I G GIG A A S G H+ D+ V A A+ E F
Sbjct: 12 FITGAAQGIGE---AVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF------ 62

Query: 63 PDGQLDRAALRERIFQAPEERRWLEQLL 90
P D AA+ E + E ++ L+
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00875PREPILNPTASE352e-125 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 352 bits (906), Expect = e-125
Identities = 164/283 (57%), Positives = 194/283 (68%), Gaps = 1/283 (0%)

Query: 3 LLDYLASHPLAFVLCAILLGLLVGSFLNVVVHRLPKMMERNWKAEAREALGLEPE-PKQA 61
LL+ P + L L++GSFLNVV+HRLP M+ER W+AE R + E +
Sbjct: 4 LLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEP 63

Query: 62 TYNLVLPNSACPRCGHEIRPWENIPLVSYLALGGKCSSCKAAIGKRYPLVELATALLSGY 121
YNL++P S CP C H I ENIPL+S+L L G+C C+A I RYPLVEL TALLS
Sbjct: 64 PYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVA 123

Query: 122 VAWHFGFTWQAGAMLLLTWGLLAMSLIDADHQLLPDVLVLPLLWLGLIANHFGLFASLDD 181
VA W A LLLTW L+A++ ID D LLPD L LPLLW GL+ N G F SL D
Sbjct: 124 VAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGD 183

Query: 182 ALFGTVFGYLSLWSVFWLFKLVTGKEGMGYGDFKLLAMLGAWGGWQILPLTILLSSLVGA 241
A+ G + GYL LWS++W FKL+TGKEGMGYGDFKLLA LGAW GWQ LP+ +LLSSLVGA
Sbjct: 184 AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGA 243

Query: 242 ILGVIMLRLRNAESGTPIPFGPYLAIAGWIALLWGDQITRTYL 284
+G+ ++ LRN PIPFGPYLAIAGWIALLWGD ITR YL
Sbjct: 244 FMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00880BCTERIALGSPF456e-162 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 456 bits (1174), Expect = e-162
Identities = 127/406 (31%), Positives = 226/406 (55%), Gaps = 14/406 (3%)

Query: 11 FVWEGTDKKGTKVKGELSSQNPTLVKAQLRKQGITPVKVR-------KKGISLLGA--GK 61
+ ++ D +G K +G + + + LR++G+ P+ V K G + L
Sbjct: 4 YHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRKI 63

Query: 62 KIKPMDIALFTRQMSTMMAAGVPLLQSFDIISEGFDNPNMRKLVEEIKQEVAGGNSLANS 121
++ D+AL TRQ++T++AA +PL ++ D +++ + P++ +L+ ++ +V G+SLA++
Sbjct: 64 RLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADA 123

Query: 122 LRKKPQYFDSLYCNLVDAGEQSGALETLLDRVATYKEKTEALKAKIKKAMTYPIAVIVVA 181
++ P F+ LYC +V AGE SG L+ +L+R+A Y E+ + ++++I++AM YP + VVA
Sbjct: 124 MKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVA 183

Query: 182 IIVSAILLIKVVPQFQSVFEGFGAELPAFTQMVINISNVLQEW--WLLVLLMMGGAGFLL 239
I V +ILL VVP+ F LP T++++ +S+ ++ + W+L+ L+ G F +
Sbjct: 184 IAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFRV 243

Query: 240 NHAYKRSEKFRDATDRTVLKLPIVGAILYKSAVARYARTLSTTFAAGVPLVEALDSVSGA 299
R EK R + R +L LP++G I ARYARTLS A+ VPL++A+
Sbjct: 244 ---MLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 TGNVVFRDAVGKIKQDVSTGMQLNFSMRTTNIFPSMAIQMTAIGEESGALDDMLAKVAGF 359
N R + V G+ L+ ++ T +FP M M A GE SG LD ML + A
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 YEQEVDNAVDNLTALMEPMIMAVLGVLVGGLIIAMYLPIFQLGNVV 405
++E + + L EP+++ + +V +++A+ PI QL ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00890BCTERIALGSPG553e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 54.5 bits (131), Expect = 3e-12
Identities = 21/54 (38%), Positives = 35/54 (64%)

Query: 1 MKAQKGFTLIELMIVVAIIGILAAIAIPQYQDYTARTQVTRAVSEISALKTAAE 54
Q+GFTL+E+M+V+ IIG+LA++ +P + +AVS+I AL+ A +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS00905RTXTOXIND300.013 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.013
Identities = 23/141 (16%), Positives = 45/141 (31%), Gaps = 4/141 (2%)

Query: 75 QVEDGQRVEPNQMLFQLKGP-ARALLTGERSALNFLQLLSGTATRSQHYADLVAGTAVKL 133
V++G+ V +L +L A A +S+L +L TR Q + + +
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL---EQTRYQILSRSIELNKLPE 167

Query: 134 LDTRKTLPGLRLAQKYAVTCGGCHNHRIGLYDAFLIKENHIAACGGIDRAIAQARRIAPG 193
L ++++ + + + ++ +R AR
Sbjct: 168 LKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227

Query: 194 KPVEVEVENLDELRQALEAGA 214
VE LD+ L A
Sbjct: 228 NLSRVEKSRLDDFSSLLHKQA 248


88HWH78_RS01295HWH78_RS01325N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS01295010-1.507706protease AlgW
HWH78_RS01300010-1.769115Nif3-like dinuclear metal center hexameric
HWH78_RS01305-210-1.307388lytic murein transglycosylase B
HWH78_RS01310-111-1.561628sulfate adenylyltransferase subunit CysD
HWH78_RS01315-212-1.002484sulfate adenylyltransferase subunit CysN
HWH78_RS01320011-0.777441YhcB family protein
HWH78_RS01325-18-1.429883alpha/beta hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS01295V8PROTEASE612e-12 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 60.8 bits (147), Expect = 2e-12
Identities = 33/163 (20%), Positives = 52/163 (31%), Gaps = 35/163 (21%)

Query: 118 LLTNNHVTAGADQIIVALR------------DGRETIAQLVGSDPETDLAVLKIDL---- 161
LLTN HV AL+ +G T Q+ E DLA++K
Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173

Query: 162 ----KNLPAMTLGRSDGIRTGDVCLAIGNPFGVGQTVTMGIISATGRNQLGLNTYEDFIQ 217
+ + T+ + + G P TM + G+ L +Q
Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKGK-ITYLKGE--AMQ 227

Query: 218 TDAAINPGNSGGALVDAAGNLIGINTAIFSKSGGSQGIGFAIP 260
D + GNSG + + +IGI+ G+
Sbjct: 228 YDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFN 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS01310TCRTETOQM280.046 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 28.3 bits (63), Expect = 0.046
Identities = 17/90 (18%), Positives = 33/90 (36%), Gaps = 14/90 (15%)

Query: 94 GVAQG-INPFTHGSAKHTDVMKTEGLKQALDKYGFDAAFGGARRDEEKSRAKERVYSFRD 152
+ P HGSAK G+ ++ F + + +++ F
Sbjct: 207 RFHNCSLFPVYHGSAK-----NNIGIDNLIE--VITNKFYSS---THRGQSELCGKVF-- 254

Query: 153 SKHRWDPKNQRPELWNIYNGKVKKGESIRV 182
K + K QR +Y+G + +S+R+
Sbjct: 255 -KIEYSEKRQRLAYIRLYSGVLHLRDSVRI 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS01315TCRTETOQM715e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 71.4 bits (175), Expect = 5e-15
Identities = 54/150 (36%), Positives = 68/150 (45%), Gaps = 17/150 (11%)

Query: 33 VDDGKSTLIGRLLHDSKMIYEDHLEAITRDSKKVGTTGDDVDLALLVDGLQAEREQGITI 92
VD GK+TL LL++S I E K T D+ L ER++GITI
Sbjct: 12 VDAGKTTLTESLLYNSGAITE------LGSVDKGTTRTDNTLL---------ERQRGITI 56

Query: 93 DVAYRYFSTAKRKFIIADTPGHEQYTRNMATGASTCDLAIILIDARYGVQTQTRRHSFIA 152
F K I DTPGH + + S D AI+LI A+ GVQ QTR
Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 153 SLLGIRHIVVAINKMDLKGFD-QGVFEQIK 181
+GI I INK+D G D V++ IK
Sbjct: 117 RKMGIPTIFF-INKIDQNGIDLSTVYQDIK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS01325FLAGELLIN310.004 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.8 bits (69), Expect = 0.004
Identities = 13/42 (30%), Positives = 20/42 (47%), Gaps = 5/42 (11%)

Query: 55 SARD--AGLA---TLRFNFRGVGQSAGSYGEGIGEIDDAEAA 91
SA+D AG A N +G+ Q++ + +GI E A
Sbjct: 39 SAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEGA 80


89HWH78_RS01730HWH78_RS01765N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS01730-1110.745205NAD(P)-dependent oxidoreductase
HWH78_RS017350100.877577hypothetical protein
HWH78_RS01740-1111.408273ATPase
HWH78_RS01745-1101.589646ferrous iron transporter A
HWH78_RS01750-2111.651024Fe(2+) transporter permease subunit FeoB
HWH78_RS01755-2132.603405hypothetical protein
HWH78_RS01760-2143.131802alkene reductase
HWH78_RS01765-2153.342483MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS01730NUCEPIMERASE1105e-30 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 110 bits (276), Expect = 5e-30
Identities = 80/362 (22%), Positives = 129/362 (35%), Gaps = 68/362 (18%)

Query: 1 MRILVTGATGFIGGRFARFALEQGLSVRV---------SGRRADAVEHLVARGAEFVPGD 51
M+ LVTGA GFIG ++ LE G V + +E L G +F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 52 LADPALVVRLCED--VEAVVHCAGAVGV---WGPRERFLAANVGLAESVVEACMRQKVRR 106
LAD + L E V + V + +N+ +++E C K++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 107 LVHLSSPSIYFDGRDHLDLNEEYVPRRFSDHYGATKYQAEQLVLSARDL-GLEVLALRPR 165
L++ SS S+Y R + + + Y ATK E + + L GL LR
Sbjct: 121 LLYASSSSVYGLNR-KMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR-F 178

Query: 166 FVV----GAGDTSIFPRMIQAHRKGR-LRILGNGLNRVDFTSVHNLNDALFSCL------ 214
F V G D ++F + +A +G+ + + G + DFT + ++ +A+
Sbjct: 179 FTVYGPWGRPDMALF-KFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHA 237

Query: 215 ------LAGEPALG----KVYNISNGQPVPFWDAVNYVMRQLDLPPVGGHLPYAVGYGLA 264
G PA +VYNI N PV D + + L + LP
Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLP-------- 289

Query: 265 ALNEGVCRILPGRPEPALFRLGMAVMAKNFTLDINRAREYLDYDPRVSLWTALDEFCAWW 324
+P A D E + + P ++ + F W+
Sbjct: 290 -------------LQPGDVLETSA--------DTKALYEVIGFTPETTVKDGVKNFVNWY 328

Query: 325 RA 326
R
Sbjct: 329 RD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS01740GPOSANCHOR300.009 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.009
Identities = 22/111 (19%), Positives = 40/111 (36%)

Query: 95 EEAAGRLDDIRGKVVASESSVTSEREALRLQVKQLQEKLGSQERQQADVSNQFGGQGKRL 154
E+A + A ++ +E+ AL + +L++ L S +
Sbjct: 231 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 290

Query: 155 DQLASDLKAQQESAAQLVAQLDGKLQTLAAEQEKLKALQVELGKTNEQLKA 205
L ++ + + L A + L A +E K L+ E K EQ K
Sbjct: 291 AALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKI 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS01750TCRTETOQM350.001 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 35.2 bits (81), Expect = 0.001
Identities = 40/179 (22%), Positives = 69/179 (38%), Gaps = 55/179 (30%)

Query: 1 MTALTLGLIGNPNSGKTTLFNQL---TGSRQRVGNW-AGVTV------ERKEG------- 43
M + +G++ + ++GKTTL L +G+ +G+ G T ER+ G
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 44 -AFHTVRHAVRLVDLPGTYSLTSVSAQASLDEQIACRYIASGEVDVLVNVVDAANL---- 98
+F V ++D PG + EV ++V+D A L
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLA-------------------EVYRSLSVLDGAILLISA 101

Query: 99 -----ERNLYLTVQLREMGIPCIVALNMLDIARSQRIRIDIDGLAR----RLGCPVVPL 148
+ L LR+MGIP I +N +D + ID+ + + +L +V
Sbjct: 102 KDGVQAQTRILFHALRKMGIPTIFFINKID-----QNGIDLSTVYQDIKEKLSAEIVIK 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS01765TCRTETB508e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 49.9 bits (119), Expect = 8e-09
Identities = 32/155 (20%), Positives = 66/155 (42%), Gaps = 2/155 (1%)

Query: 26 LPQVAGDLRVSIPSAGWLISGYAFAVAFGAPLMAMATARLERKKALLALMGIFIVGNLLC 85
LP +A D S W+ + + + G + + +L K+ LL + I G+++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 86 AVAANY-GLLMLARIVTALCHGAFFGIGSVVAASLVAPNRRASAVALMFTGLTLANVLGV 144
V ++ LL++AR + AF + VV A + R A L+ + + + +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 145 PLGTALGQEAGWRATFWVVTLIGVVAFVGLARVLP 179
+G + W ++ +I ++ L ++L
Sbjct: 157 AIGGMIAHYIHWSYLL-LIPMITIITVPFLMKLLK 190


90HWH78_RS01995HWH78_RS02030N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS01995-110-0.377306methyl-accepting chemotaxis protein PctB
HWH78_RS02000-18-0.028704methyl-accepting chemotaxis protein PctA
HWH78_RS02005-270.732451DUF853 domain-containing protein
HWH78_RS02010080.564197methyl-accepting chemotaxis protein PctC
HWH78_RS020150131.507467type 4b pilus Flp major pilin
HWH78_RS020201141.700159type 4b pilus Flp biogenesis protein RcpC
HWH78_RS020251151.828193type 4b pilus Flp secretin RcpA
HWH78_RS020302132.147135type 4b pilus Flp biogenesis protein TadZ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS01995GPOSANCHOR300.028 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.0 bits (67), Expect = 0.028
Identities = 25/209 (11%), Positives = 63/209 (30%), Gaps = 11/209 (5%)

Query: 342 RFVERIHESIREVAGTARQLHDVAQLVVNASNSSMANSDEQSNRTNSVAAAINELGAAAQ 401
+ +E + + L + + N + + +A I L A
Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKA 186

Query: 402 EIARNAADASHHASDANHQ-AEDGKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIG 460
+A + + A + I+ + ++A A++E +N
Sbjct: 187 A-----LEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFS 241

Query: 461 QILEVIKGISEQTN--LLALNAAIEAARAGEAGRGFAVVADEVRNLAHRAQESAQQIQKM 518
E L A A +E A A + +++ L + +
Sbjct: 242 TADSAKIKTLEAEKAALEARQAELEKALE-GAMNFSTADSAKIKTLEAEKAALEAEKADL 300

Query: 519 IEELQI--GAQEAVSTMTESQRYSLESVE 545
+ Q+ ++++ ++ R + + +E
Sbjct: 301 EHQSQVLNANRQSLRRDLDASREAKKQLE 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02000RTXTOXINA310.018 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.1 bits (70), Expect = 0.018
Identities = 24/167 (14%), Positives = 58/167 (34%), Gaps = 8/167 (4%)

Query: 308 GRAMQDIAQGEGDLTKRLAVTSRDEFGVLGDAFN---QFVERIHRSIREVAGTAHKLHDV 364
G ++ D+ + +L + ++ + F + + R + A KL
Sbjct: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120

Query: 365 SQLVVNASNSSMANSDEQSNRTNSVAAAI-NELGAAAQEIARNAADASHHASDANHQAED 423
Q N N + + + + N LG A + + + +E
Sbjct: 121 YQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180

Query: 424 GKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIGQILEVIKGIS 470
K +E N+L + +++ N+ + + + +G +L K ++
Sbjct: 181 AKASIELI----NQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLN 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02010RTXTOXINA310.019 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.1 bits (70), Expect = 0.019
Identities = 29/179 (16%), Positives = 65/179 (36%), Gaps = 8/179 (4%)

Query: 299 LIRVLMQPLTDMGRAMQDIAQGEGDLTKRLKVTSNDEFGTLANAFNRFVERIHESIREVA 358
LI ++ + G ++ D+ + +L ++ + F + I + R V
Sbjct: 49 LILLIPKDYKGQGSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVT 108

Query: 359 GTARQLHDVAQLVVNASN---SSMANSDEQSNRTNSVAAAI-NELGAAAQEIARNAADAS 414
A QL + Q A N N + + + + N LG A + +
Sbjct: 109 IFAPQLDKLLQKYQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKK 168

Query: 415 HHASDANHQAEDGKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIGQILEVIKGIS 473
+ +E K +E N+L + +++ N+ + + + +G +L K ++
Sbjct: 169 QKSGGNVSSSELAKASIELI----NQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLN 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02025BCTERIALGSPD1462e-40 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 146 bits (369), Expect = 2e-40
Identities = 67/253 (26%), Positives = 109/253 (43%), Gaps = 15/253 (5%)

Query: 131 PNQVQTDIRFVEVSRSKLKQASTSFVRRGGNLWVLG------APGSLGDIKVNADGSGLG 184
QV + EV + + + + + G + N DG+ +
Sbjct: 344 RPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGT-VS 402

Query: 185 GTFGTGSSGFNLIFGG---GKWLSFMNALEGSGFAYTLARPSLVAMSGQSASFLAGGEFP 241
+ + S FN I G G W + AL S LA PS+V + A+F G E P
Sbjct: 403 SSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVP 462

Query: 242 IPVP--NGTNDNV--TIEYKEFGIRLTLTPTVMNNRRIALKVAPEVSELDYSAGIQSGGV 297
+ + DN+ T+E K GI+L + P + + L++ EVS + +A S +
Sbjct: 463 VLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDL 522

Query: 298 AVPALRVRRTDTSVMLADGESFVISGLTSSNSVSNVDKFPWLGDIPILGAFFRSTKLDKD 357
R + +V++ GE+ V+ GL + DK P LGDIP++GA FRST
Sbjct: 523 GA-TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVS 581

Query: 358 DRELLMIVTPHLV 370
R L++ + P ++
Sbjct: 582 KRNLMLFIRPTVI 594


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02030HTHFIS330.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.002
Identities = 22/105 (20%), Positives = 40/105 (38%), Gaps = 10/105 (9%)

Query: 18 LQNSLASAG-QVVPAGSASLEELLALLDVTAAGVLFISL---GKSNLVSQGALVEGLVSA 73
L +L+ AG V +A L + ++ + ++ L+ + A
Sbjct: 19 LNQALSRAGYDVRITSNA--ATLWRWIAAGDGDLVVTDVVMPDENAF----DLLPRIKKA 72

Query: 74 RPMLSVVAIGDGLDNQLVLAAMRAGARDFITYGARASELTGLIRR 118
RP L V+ + + A GA D++ +EL G+I R
Sbjct: 73 RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


91HWH78_RS02210HWH78_RS02260N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS022101163.362940type II secretion system protein J
HWH78_RS022151153.819622type II secretion system minor pseudopilin GspH
HWH78_RS022202183.583266hypothetical protein
HWH78_RS02225-1113.379265type II secretion system minor pseudopilin GspI
HWH78_RS022301113.561870type II secretion system major pseudopilin GspG
HWH78_RS022351124.133416type II secretion system minor pseudopilin GspK
HWH78_RS022400103.380584type II secretion system protein
HWH78_RS02245092.025434type II secretion system protein M
HWH78_RS022501121.293232type II secretion system secretin GspD
HWH78_RS022551140.607307type II secretion system ATPase GspE
HWH78_RS022600140.324942type II secretion system inner membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02210BCTERIALGSPG326e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 6e-04
Identities = 18/60 (30%), Positives = 32/60 (53%), Gaps = 3/60 (5%)

Query: 12 RRQAGFTLIEVMVAIMLMAIV-SLMAWRGLDSIARASAHLEDSTEQGAALLRALNQLERD 70
+Q GFTL+E+MV I+++ ++ SL+ + + +A S AL AL+ + D
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVS--DIVALENALDMYKLD 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02215BCTERIALGSPH493e-10 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 49.2 bits (117), Expect = 3e-10
Identities = 30/129 (23%), Positives = 46/129 (35%), Gaps = 7/129 (5%)

Query: 5 RQGGFTLIELMVVLVIVGIATAAISLSARPDPTGLLRQDAARLARLLEIAQGEARVRGTP 64
RQ GFTL+E+M++L+++G++ + L+ Q AR L Q G
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 65 ILWQPSAKGYRFSPQAYRGKTDALAADTELRARDWQAAPLRVSVRPPRPVLLDAEWIGAP 124
++F R D AD W PLR V G
Sbjct: 62 FGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWL--PLR-----AGRVATSGSIAGGK 114

Query: 125 LRITLSDGQ 133
L + + G+
Sbjct: 115 LNLAFAQGE 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02225BCTERIALGSPG316e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.0 bits (70), Expect = 6e-04
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 8 RGFTLIEVLVALAIVAIALAAAIRAVGLMTDGNGLLRDKSLA-LLAAESRLAELRLGVGA 66
RGFTL+E++V IV I + A++ LM + + K+++ ++A E+ L +L
Sbjct: 8 RGFTLLEIMV--VIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHH 65

Query: 67 AP 68
P
Sbjct: 66 YP 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02230BCTERIALGSPG1671e-56 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 167 bits (425), Expect = 1e-56
Identities = 63/142 (44%), Positives = 87/142 (61%), Gaps = 6/142 (4%)

Query: 11 KGHRGQRGFTLIEIMVVVVILGILAAMVVPKVLDRPDQARATAARQDISGLMQALKLYRL 70
+ QRGFTL+EIMVV+VI+G+LA++VVP ++ ++A A DI L AL +Y+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 71 DQGRYPSQAQGLKVLAERP-ADASASNWRS--YLERLPNDPWGKPYQYLNPGVNGEIDVF 127
D YP+ QGL+ L E P A+N+ Y++RLP DPWG Y +NPG +G D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 128 SLGADGQPGGEGINADIGSWQL 149
S G DG+ G E DI +W L
Sbjct: 122 SAGPDGEMGTED---DITNWGL 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02250BCTERIALGSPD2558e-77 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 255 bits (653), Expect = 8e-77
Identities = 151/571 (26%), Positives = 257/571 (45%), Gaps = 50/571 (8%)

Query: 230 PGNNTVVVTDYAENLDRVAGIIASIDIPSASD---TDVVPIQNGIAVDIASTVSELLDSQ 286
NN V+ +++ A +AS P D T VVP+ N A D+A + +L D+
Sbjct: 94 NMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQLNDNA 153

Query: 287 GSGGAEQGQKTVVLADPRSNSIVIRSPSPERTQLARDLIGKLDSVQSNPGNLHVVYLRNA 346
G G +VV +P SN +++ + +L ++ ++D+ ++ V L A
Sbjct: 154 GVG-------SVVHYEP-SNVLLMTGRAAVIKRL-LTIVERVDNAGDR--SVVTVPLSWA 202

Query: 347 QATRLAQALRGLITGDSGGEGNE--------GDQQRARLSGGG---------MLGGGNSG 389
A + + + L S ++ A L G M+ +
Sbjct: 203 SAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQLDRQ 262

Query: 390 TGSQGLGSSGNTTGSGSSGLGGSNRSGGAYGAMGSGQGGAGPGAMGEENSAFSAGGVTVQ 449
+QG + +S L Q A+ + + ++
Sbjct: 263 QATQGNTKVIYLKYAKASDLVEVLTGIS-STMQSEKQAAKPVAALDK--------NIIIK 313

Query: 450 ADATTNTLLISAPEPLYRNLREVIDLLDQRRAQVVIESLIVEVSEDDSSEFGIQWQAGNL 509
A TN L+++A + +L VI LD RR QV++E++I EV + D GIQW N
Sbjct: 314 AHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNA 373

Query: 510 GGNGVFG-GVNFGQSALNTAGKNTIDVLPKGLNIGLVDGTVDIPGIGKILDLKVLARALK 568
G G+ + N + L L G + + +L AL
Sbjct: 374 GMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQG-NWAMLLTALS 432

Query: 569 SRGGTNVLSTPNLLTLDNESASIMVGQTIPFVSGQYVTDGGGTSNNPFQTIQREDVGLKL 628
S ++L+TP+++TLDN A+ VGQ +P ++G T G N F T++R+ VG+KL
Sbjct: 433 SSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGD----NIFNTVERKTVGIKL 488

Query: 629 NIRPQISEGGTVKLDVYQEVSSVDERASTAA---GVVTNKRAIDTSILLDDGQIMVLGGL 685
++PQI+EG +V L++ QEVSSV + AS+ + G N R ++ ++L+ G+ +V+GGL
Sbjct: 489 KVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGL 548

Query: 686 LQDNVQDNTDGVPGLSSLPGVGSLFRYQKRSRTKTNLMVFLRPYIVRDAAAGRSITLNRY 745
L +V D D VP L +P +G+LFR + +K NLM+F+RP ++RD R + +Y
Sbjct: 549 LDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQY 608

Query: 746 DFIRRAQ-QRVQPRHDWSVGDMQAPVLPPAQ 775
AQ ++ ++ ++ + + P Q
Sbjct: 609 TAFNDAQSKQRGKENNDAMLNQDLLEIYPRQ 639



Score = 160 bits (407), Expect = 2e-43
Identities = 75/278 (26%), Positives = 130/278 (46%), Gaps = 10/278 (3%)

Query: 85 GAVAPVSAAAAELGEQPVSLNFVDTEVEAVVRALSRATGRQFLVDPRVKGKLTLVSEGQV 144
A+ AAA E S +F T+++ + +S+ + ++DP V+G +T+ S +
Sbjct: 18 AALLFRPAAAEEF-----SASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72

Query: 145 AARTAYRMLTSALRMQGFSVVDVD-GVSQVVPEADAKLLGGPVYGADRPA-ANGMVTRTF 202
Y+ S L + GF+V++++ GV +VV DAK PV P + +VTR
Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVV 132

Query: 203 RLRYENAVNLIPVLRPIVAQNNPINA--YPGNNTVVVTDYAENLDRVAGIIASIDIPSAS 260
L A +L P+LR + + Y +N +++T A + R+ I+ +D
Sbjct: 133 PLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 192

Query: 261 DTDVVPIQNGIAVDIASTVSELLDSQGSGGAEQGQKTVVLADPRSNSIVIRSPSPERTQL 320
VP+ A D+ V+EL V+AD R+N++++ P Q
Sbjct: 193 SVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGE-PNSRQR 251

Query: 321 ARDLIGKLDSVQSNPGNLHVVYLRNAQATRLAQALRGL 358
+I +LD Q+ GN V+YL+ A+A+ L + L G+
Sbjct: 252 IIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGI 289



Score = 50.7 bits (121), Expect = 2e-08
Identities = 44/299 (14%), Positives = 103/299 (34%), Gaps = 56/299 (18%)

Query: 194 ANGMVTRTFRLRYENAVNLIPVLRPI----------VAQNNPINAYPGNNTVVVTDYAEN 243
A T L + +A +++ ++ + + + A N V+V+ +
Sbjct: 189 AGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNS 248

Query: 244 LDRVAGIIASIDIPSAS--DTDVVPIQNGIAVDIASTVSELL-----DSQGSGGAEQGQK 296
R+ +I +D A+ +T V+ ++ A D+ ++ + + Q + K
Sbjct: 249 RQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDK 308

Query: 297 TV-VLADPRSNSIVIRSPSPERTQLARDLIGKLDSVQSNPGNLHVVYLRNAQATRLAQAL 355
+ + A ++N++++ + P+ +I +LD +R Q +
Sbjct: 309 NIIIKAHGQTNALIVTAA-PDVMNDLERVIAQLD-------------IRRPQV-----LV 349

Query: 356 RGLITGDSGGEGNEGDQQRARLSGGGMLG--GGNSGTGSQGLGSSGNTTGSGSSGLGGSN 413
+I +G LG N G +SG + +G N
Sbjct: 350 EAIIAEVQDADGLN-------------LGIQWANKNAGMTQFTNSGLPISTAIAGANQYN 396

Query: 414 RSGGAYGAMGSGQGGAGPGAMGEENSAFSAGGVTVQA-DATTNTLLISAPEPLYRNLRE 471
+ G ++ S A G + + + A ++T +++ P + + E
Sbjct: 397 KDGTVSSSLASALSSFNGIAAGFYQGNW---AMLLTALSSSTKNDILATPSIVTLDNME 452



Score = 44.1 bits (104), Expect = 2e-06
Identities = 20/84 (23%), Positives = 35/84 (41%), Gaps = 16/84 (19%)

Query: 190 DRPAANGMVTRTFRLRYENAVNLIPVLR----------------PIVAQNNPINAYPGNN 233
DR A T+ L+Y A +L+ VL + +N I A+ N
Sbjct: 260 DRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTN 319

Query: 234 TVVVTDYAENLDRVAGIIASIDIP 257
++VT + ++ + +IA +DI
Sbjct: 320 ALIVTAAPDVMNDLERVIAQLDIR 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02260BCTERIALGSPF380e-132 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 380 bits (977), Expect = e-132
Identities = 188/407 (46%), Positives = 253/407 (62%), Gaps = 5/407 (1%)

Query: 1 MQTFRYEAADAQGRIETGTLEADSQRGALGQLRARGLTPLEVREQAGGGAGQGAGALFAP 60
M + Y+A DAQG+ GT EADS R A LR RGL PL V E G G+ L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 R---LSDGDLAWATRQLASLLAASLPLEAALSATLDQAERKHIAQTLSAVRSDVRGGMRL 117
R LS DLA TRQLA+L+AAS+PLE AL A Q+E+ H++Q ++AVRS V G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 118 ADALAARPRDFPEIYRALVAAGEESGDLAQVMERLADYIEERNALRGKILTAFIYPAVVG 177
ADA+ P F +Y A+VAAGE SG L V+ RLADY E+R +R +I A IYP V+
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 178 VVSIGIVIFLLGYVVPQVVSAFSQARQDLPALTRAMLQASDFVRAWG-WLCAGAIGGAYW 236
VV+I +V LL VVP+VV F +Q LP TR ++ SD VR +G W+ + G
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM- 239

Query: 237 GWRLYLRDPQARLGWHRRVLRLPLLGRFVLGVNTARFASTLAILGSAGVPLLRALDAARQ 296
+R+ LR + R+ +HRR+L LPL+GR G+NTAR+A TL+IL ++ VPLL+A+ +
Sbjct: 240 AFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 297 TLANDCLAQAVEEATAQVREGVSLASALRTRQVFPPILTHLIASGEKTGALPPMLDRAAQ 356
++ND + AT VREGVSL AL +FPP++ H+IASGE++G L ML+RAA
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 357 TLSRDIERRAMGMTALLEPLMIVVMGGVVLTIVMAVLMPIIEMNQLV 403
R+ + L EPL++V M VVL IV+A+L PI+++N L+
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


92HWH78_RS02765HWH78_RS02800N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS02765-1100.545038VanW family protein
HWH78_RS02770-2120.151275uracil-DNA glycosylase
HWH78_RS02775-2130.855817AbrB family transcriptional regulator
HWH78_RS02780-3111.109199tripartite tricarboxylate transporter permease
HWH78_RS02785-1131.085405tripartite tricarboxylate transporter TctB
HWH78_RS02790-1121.049059tripartite tricarboxylate transporter substrate
HWH78_RS027950100.914417OprD family porin
HWH78_RS028003112.118571response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02765PF05043300.010 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 29.9 bits (67), Expect = 0.010
Identities = 7/27 (25%), Positives = 16/27 (59%)

Query: 47 YRYIKHTSKLIRRLGDSDLALQRNKVV 73
YR I +K+I+R +++L +++
Sbjct: 118 YRIISQINKVIKRQFQFEVSLTPVQII 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02785ACRIFLAVINRP270.044 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.7 bits (59), Expect = 0.044
Identities = 15/58 (25%), Positives = 25/58 (43%), Gaps = 5/58 (8%)

Query: 99 LGFILSAALVGSCMAILYGARPIPAVVTASLL-----GIGLYWLFDRALDVPLPLGVL 151
+S +V C+A LY + IP V + + LF++ DV +G+L
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLL 931


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02790AEROLYSIN290.028 Aerolysin signature.
		>AEROLYSIN#Aerolysin signature.

Length = 493

Score = 28.8 bits (64), Expect = 0.028
Identities = 16/40 (40%), Positives = 23/40 (57%)

Query: 2 MMKLSFRPLALVAAGLLLAGAAVAEPKRPECIAPASPGGG 41
M K+ L+L+ +GLL+A A AEP P+ + S G G
Sbjct: 1 MQKIKLTGLSLIISGLLMAQAQAAEPVYPDQLRLFSLGQG 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS02800HTHFIS838e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 8e-21
Identities = 35/127 (27%), Positives = 62/127 (48%)

Query: 2 RILLVEDHPQLAESVVQALKGAGWTVDLLQDGVAADLALASEEYALAILDVGLPRMDGFE 61
IL+ +D + + QAL AG+ V + + +A+ + L + DV +P + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLARLRGRGKTLPVLMLTARGEVKDRVHGLNLGADDYLAKPFELSELEARVKALLRRSVL 121
+L R++ LPVL+++A+ + GA DYL KPF+L+EL + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 GGEQLQR 128
+L+
Sbjct: 125 RPSKLED 131


93HWH78_RS03915HWH78_RS03940N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS03915020-4.317388protein TolQ
HWH78_RS03920118-4.528657protein TolR
HWH78_RS03925018-4.077267cell envelope integrity protein TolA
HWH78_RS03930-120-4.313637Tol-Pal system beta propeller repeat protein
HWH78_RS03935029-5.565506peptidoglycan-associated lipoprotein Pal
HWH78_RS03940-130-5.734941tol-pal system protein YbgF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS0391560KDINNERMP290.017 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 29.1 bits (65), Expect = 0.017
Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 13/72 (18%)

Query: 12 WSLISNASIVVQLVMLTLVAASVTSWIMIFQRGNAMRAAKKALDAFEERFWS-----GID 66
+S+I + +V+ +M L A TS MR + + A ER +
Sbjct: 356 FSIII-ITFIVRGIMYPLTKAQYTSM-------AKMRMLQPKIQAMRERLGDDKQRISQE 407

Query: 67 LSKLYRQAGSNP 78
+ LY+ NP
Sbjct: 408 MMALYKAEKVNP 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS03925IGASERPTASE491e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 49.3 bits (117), Expect = 1e-08
Identities = 36/204 (17%), Positives = 71/204 (34%), Gaps = 21/204 (10%)

Query: 54 QLKSKSQATTQTNQKIAGEAKKTASKQYE-----VEQLEQKKLEQQKLEQQKLEQQQVAA 108
Q + TT N + + + +++ + E +Q +
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 109 AKAAEQKKADEARKAEAQKAAEAKKADEAKKAAEAKAAEQKKQADIAKKRAEDEAKKKAA 168
++ A E + A EAK +A + ++A ++ E K+
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKA----------NTQTNEVA--QSGSETKETQT 1097

Query: 169 EDAKKKAAEDAKKKAAEEAKKKAAAEAAKKKAAVEAAKKKAAAAAAAARKAAEDKKAQAL 228
+ K+ A + ++KA E +K E K + V ++++ A A E+ +
Sbjct: 1098 TETKETATVEKEEKAKVETEKT--QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 229 AELLS--DTTERQQALADEVGSEV 250
E S +TT + A E S V
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNV 1179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS03935OMPADOMAIN1166e-34 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 116 bits (291), Expect = 6e-34
Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 12/112 (10%)

Query: 68 YFEYDSSDLKPEAMRALDVHA---KDLKGSGQRVVLEGHTDERGTREYNMALGERRAKAV 124
F ++ + LKPE ALD +L VV+ G+TD G+ YN L ERRA++V
Sbjct: 222 LFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSV 281

Query: 125 QRYLVLQGVSPAQLELVSYGKERPVATGHDEQS---------WAQNRRVELK 167
YL+ +G+ ++ G+ PV + A +RRVE++
Sbjct: 282 VDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS03940RTXTOXIND320.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.1 bits (73), Expect = 0.002
Identities = 10/53 (18%), Positives = 19/53 (35%)

Query: 69 QLQQMQDELARLRGTLEEQQNQIQQLKQESLERYQDLDRRISGGGAPAAQNSA 121
+ + +EL + LE+ +++I K+E Q I N
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312


94HWH78_RS04225HWH78_RS04265N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS04225112-2.316793shufflon system plasmid conjugative transfer
HWH78_RS04230216-2.317945Flp pilus assembly complex ATPase component
HWH78_RS04235216-2.766308pilus assembly protein PilX
HWH78_RS04240315-2.218825type II secretion system F family protein
HWH78_RS04245215-2.101249Flp pilus assembly complex ATPase component
HWH78_RS04250220-2.777815type IV pilus biogenesis protein PilP
HWH78_RS04255220-3.479892type 4b pilus protein PilO2
HWH78_RS04260121-3.756795PilN family type IVB pilus formation outer
HWH78_RS04265125-3.942806TcpQ domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04225BCTERIALGSPG373e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 37.2 bits (86), Expect = 3e-05
Identities = 15/62 (24%), Positives = 29/62 (46%)

Query: 2 RSKRSSGFISIELMIALVVIAIATAGGISVLMSYLDGLDEQHAAQQQQQVAKAAEKYLKD 61
+ + GF +E+M+ +V+I + + + LM + D+Q A + A + Y D
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 62 NF 63
N
Sbjct: 63 NH 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04235PilS_PF088051177e-36 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 117 bits (295), Expect = 7e-36
Identities = 46/179 (25%), Positives = 91/179 (50%), Gaps = 12/179 (6%)

Query: 2 STTQRTSRPTQGGFVSIEMIIVLIIIAIGVGLGLAAAAGMFSSSNANEEQRNISVIAANA 61
S + R + G +E+++V+ +I + + + S+ ++ EQ N+ + AN
Sbjct: 15 SLSARRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANM 74

Query: 62 RALKTSSGYGSSGTNLIPSLIAINGVPKNM--SVSSGVVYNVYGGSVTV--SSTGMGFSI 117
++LK Y + +N I +L A +P +M + N +GGSVT+ SS F++
Sbjct: 75 KSLKFQGRY--TDSNYIKTLYAQGLLPSDMIADTTGASAKNPWGGSVTITTSSDKYSFNV 132

Query: 118 TTSKLPQDACITLATKIAKNTFEQTKINSGSAITGEVTTAAATQACSSDSNSITWTYSS 176
+ +PQ C+ + + +++ +KIN+ S +T +A C+SDSN++T++ S
Sbjct: 133 VEANVPQKNCMAMVNAL-RSSSAISKINNTS-----TSTVSAATVCASDSNTLTFSTDS 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04240BCTERIALGSPF725e-16 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 71.8 bits (176), Expect = 5e-16
Identities = 74/346 (21%), Positives = 141/346 (40%), Gaps = 20/346 (5%)

Query: 14 SKQFGRKERLQFYESMSTLLENGVPLKDAVAEVHKIFAHEGQHPFHPVAIASREALMGLS 73
+ + ++TL+ +PL++A+ V K E H + A R +M
Sbjct: 62 KIRLSTSDLALLTRQLATLVAASMPLEEALDAVAK--QSEKPH-LSQLMAAVRSKVME-- 116

Query: 74 NGKRLATAMALYLPVQE---RALIEAGEMSGNLVQAMGDAVSLVEAQARIRATIWQALLY 130
G LA AM + E A++ AGE SG+L + E + ++R+ I QA++Y
Sbjct: 117 -GHSLADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIY 175

Query: 131 PSALSAMMVFLLCIVAYRMVPSLARLSDPVTWTGPLAT--LNAIASFVTGPGIYVLVAVI 188
P L+ + + ++ I+ +VP + + PL+T L ++ V G ++L+A++
Sbjct: 176 PCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALL 235

Query: 189 TLTVVVIVTLPTYRWKGRVWLDRTLPPW----SIYRMLQGTTFLLNMAVMLNAGIRPYDS 244
+ V L + RV R L I R L + ++++ + + +
Sbjct: 236 AGFMAFRVMLRQEKR--RVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQA 293

Query: 245 LASMIK-ISPPWLKQRLEAARYGVGLGQNLGVALRSAGHDFPDRQAIQYLCILANRGGFS 303
+ +S + + RL A V G +L AL FP + G
Sbjct: 294 MRISGDVMSNDYARHRLSLATDAVREGVSLHKALE-QTALFP-PMMRHMIASGERSGELD 351

Query: 304 EALVKFSRRWQETSLKQIELAAGLVKNFALIFIGALMILVLLGAYQ 349
L + + Q+ LA GL + ++ + A+++ ++L Q
Sbjct: 352 SMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQ 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04260BCTERIALGSPD874e-20 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 87.3 bits (216), Expect = 4e-20
Identities = 70/318 (22%), Positives = 132/318 (41%), Gaps = 26/318 (8%)

Query: 269 SELKTSILSDIENSINSMLTPSMGRMSLSRATGTLTVTDRPEVLNRVQQLVNRENESITK 328
+ + +++ S+ + + + T L VT P+V+N +++++ + +
Sbjct: 287 TGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIA-QLDIRRP 345

Query: 329 QVLLNVNVLSVALTDKDQLGIDW---NLVYKSLNNKWGIGLKNTMPGIDQSAISGSV--- 382
QVL+ + V D LGI W N N G+ + + G +Q G+V
Sbjct: 346 QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNS-GLPISTAIAGANQYNKDGTVSSS 404

Query: 383 --SILDTANSAWAGS-----KAMVQALAQQGRVSTVRSPSVTTLNLQSAPIQIGRYDSYL 435
S L + N AG ++ AL+ + + +PS+ TL+ A +G+ L
Sbjct: 405 LASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVL 464

Query: 436 ASSQISNVAQVGSTTSLIPGAVTSGYNMSLLPFVMESGEMLLKININMTSRPTFEMQTSG 495
SQ ++ + +T T G + + P + E +LL+I ++S TS
Sbjct: 465 TGSQTTSGDNIFNTVERK----TVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSS 520

Query: 496 DSKAQFPSYDIQLFDQKVRLRSGETLVLSGF--DQTTEDTNKV-GTGDAGFFG-LGGGLT 551
D A F + + V + SGET+V+ G ++ +KV GD G L +
Sbjct: 521 DLGATFNTRTVN---NAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTS 577

Query: 552 RNTKREVIVVLITPVVLG 569
+ + +++ I P V+
Sbjct: 578 KKVSKRNLMLFIRPTVIR 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04265PF03544310.007 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 30.7 bits (69), Expect = 0.007
Identities = 25/128 (19%), Positives = 39/128 (30%), Gaps = 4/128 (3%)

Query: 166 QLPPAPRP-KPVQQLYAKPAAPTPAAVAQPSSTEKVSTLESPVVVASVPTPTPITTSPAP 224
Q+ P P +P+ PA P QP V P + P P+
Sbjct: 39 QVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK 98

Query: 225 TKKPESTTVFPPAAPAKDGHPSSPPAASAPIKPLASAVKSMPPTPAVASAPPVKVLTPAE 284
K P + P S P P + + P + +A V + A
Sbjct: 99 PKPKPKP---KPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155

Query: 285 PSRQLAQS 292
R L+++
Sbjct: 156 GPRALSRN 163


95HWH78_RS04885HWH78_RS04950N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS04885215-2.088507flagellar basal body rod protein FlgC
HWH78_RS04890215-1.264494flagellar hook assembly protein FlgD
HWH78_RS04895115-0.538056flagellar hook protein FlgE
HWH78_RS04900013-0.607089flagellar basal-body rod protein FlgF
HWH78_RS04905012-1.023199flagellar basal-body rod protein FlgG
HWH78_RS04910-211-1.109413flagellar basal body L-ring protein FlgH
HWH78_RS04915-110-1.467393flagellar basal body P-ring protein FlgI
HWH78_RS04920-28-1.798020flagellar assembly peptidoglycan hydrolase FlgJ
HWH78_RS0492509-2.499543flagellar hook-associated protein FlgK
HWH78_RS04930110-2.542082flagellar hook-associated protein FlgL
HWH78_RS04935011-2.345936DegT/DnrJ/EryC1/StrS family aminotransferase
HWH78_RS04940012-2.145713acyl carrier protein
HWH78_RS04945116-2.420531ketoacyl-ACP synthase III
HWH78_RS04950119-0.957412SDR family oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04885FLGHOOKAP1363e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 36.1 bits (83), Expect = 3e-05
Identities = 8/38 (21%), Positives = 20/38 (52%)

Query: 107 NVNVVEEMADMISASRAFQTNAEMMNTAKQMMQKVLTL 144
VN+ EE ++ + + NA+++ TA + ++ +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 29.5 bits (66), Expect = 0.004
Identities = 15/54 (27%), Positives = 25/54 (46%), Gaps = 2/54 (3%)

Query: 4 ASVFNIAGSGMSAQSTRLNTVASNIANAETVSSSVDKTYRARHPVFSTMFQQAQ 57
+S+ N A SG++A LNT ++NI++ + T A ST+
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA--QANSTLGAGGW 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04895FLGHOOKAP1455e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.9 bits (106), Expect = 5e-07
Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 414 ALQSGALEASNVDISNELVNLIVHQRNYQANAKTIQTEDAVTQTIINLR 462
L + S V++ E NL Q+ Y ANA+ +QT +A+ +IN+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 41.1 bits (96), Expect = 8e-06
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 3/69 (4%)

Query: 2 SFNIGLSGIQAASSGLNVTGNNIANAGTVGFKQSRAEFADVYAASVLGSGSNPQGSGVLL 61
N +SG+ AA + LN NNI++ G+ + A A S LG+G G+GV +
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQ--ANSTLGAGGW-VGNGVYV 59

Query: 62 SDVSQMFKQ 70
S V + +
Sbjct: 60 SGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04905FLGHOOKAP1452e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 44.6 bits (105), Expect = 2e-07
Identities = 13/51 (25%), Positives = 25/51 (49%)

Query: 209 NGLGTVAQNTLENSNVNVVEELVNMITTQRAYEMNSKVISTADQMLSFVTQ 259
N + ++ S VN+ EE N+ Q+ Y N++V+ TA+ + +
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 39.6 bits (92), Expect = 9e-06
Identities = 20/79 (25%), Positives = 35/79 (44%), Gaps = 14/79 (17%)

Query: 3 SALWVSKTGLSAQDMNLTTISNNLANVSTTGFKRDRAEFQDLLYQIRRQPGGQSTQDSEL 62
S + + +GL+A L T SNN+++ + G+ R + +S L
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQLGTGVRVVGTQKIF 81
+G +G GV V G Q+ +
Sbjct: 48 GAGGWVGNGVYVSGVQREY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04910FLGLRINGFLGH1803e-59 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 180 bits (459), Expect = 3e-59
Identities = 81/224 (36%), Positives = 112/224 (50%), Gaps = 13/224 (5%)

Query: 12 IATALGGCVNPPPKPNDPYYAPVLPRTPLPAAQNNGAIYQAGF-----EQNLYDDRKAFR 66
+ +L GC P P P P P NG+I+Q+ Q L++DR+
Sbjct: 15 LVLSLTGCAWIPSTPLVQGATSAQP-VPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRN 73

Query: 67 VGDIITITLNEKTQASKKANSDIQKDSKTKMGLTSLFGSGMTTNNPIGGGDLSLSAEYGG 126
+GD +TI L E ASK ++++ +D KT G + G + E G
Sbjct: 74 IGDTLTIVLQENVSASKSSSANASRDGKTNFGFDT---VPRYLQGLFGNARADV--EASG 128

Query: 127 SRDAKGDSQAGQSNSLTGSITVTVAEVLPNGILSVRGEKWMTLNTGNELVRIAGLVRADD 186
G A SN+ +G++TVTV +VL NG L V GEK + +N G E +R +G+V
Sbjct: 129 GNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRT 188

Query: 187 IATDNTVSSTRVADARITYSGTGAFADASQPGWLDRFF--LSPL 228
I+ NTV ST+VADARI Y G G +A GWL RFF LSP+
Sbjct: 189 ISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04915FLGPRINGFLGI436e-155 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 436 bits (1122), Expect = e-155
Identities = 168/366 (45%), Positives = 224/366 (61%), Gaps = 10/366 (2%)

Query: 7 LLALAALLLAAGAAQAERLKDIASIQGVRTNQLIGYGLVVGLSGSGDQTTQTPFTLQTFN 66
AL L A R+KDIAS+Q R NQLIGYGLVVGL G+GD +PFT Q+
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 67 NMLAQFGIKVPANVGNVQLKNVAAVSVHADLPPFAKPGQPIDVTVSSIGNAKSLRGGSLL 126
ML GI G KN+AAV V A+LPPFA PG +DVTVSS+G+A SLRGG+L+
Sbjct: 73 AMLQNLGITTQG--GQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLI 130

Query: 127 MTPLKGIDGQVYAVAQGNLVVGGFDAEGRDGSKITVNVPSAGRIPAGATVERAVPSGFDQ 186
MT L G DGQ+YAVAQG L+V GF A+G D + +T V ++ R+P GA +ER +PS F
Sbjct: 131 MTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKD 189

Query: 187 GNSLTLNLNRPDFTTAKRIVDRINEL----LGPGVAHAVDGGSVRVSAPLDPNQRVDYLS 242
+L L L PDF+TA R+ D +N G +A D + V P ++
Sbjct: 190 SVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKP-RVADLTRLMA 248

Query: 243 ILENLDVQPGEAVAKVIINSRTGTIVIGQNVKVSPAAVTHGSLTVSITEDPIVSQPGAFS 302
+ENL V+ + AKV+IN RTGTIVIG +V++S AV++G+LTV +TE P V QP FS
Sbjct: 249 EIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFS 307

Query: 303 NGQTAVVPRSRVNAEEETKPMFKFGPGTTLDDIVRAVNQVGAAPSDLMAILEALKQAGAL 362
GQTAV P++ + A +E + G L +V +N +G ++AIL+ +K AGAL
Sbjct: 308 RGQTAVQPQTDIMAMQEGSKVA-IVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGAL 366

Query: 363 QADLIV 368
QA+L++
Sbjct: 367 QAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04920FLGFLGJ1481e-43 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 148 bits (374), Expect = 1e-43
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)

Query: 198 LPAQSYPAASRRGFSTDGVDSQGSRRIAQP-----PLARGKSMFASADEFIATMLPMAQK 252
LP +S PAA F + V ++ ++Q P S+ + F+A + AQ
Sbjct: 104 LPEESTPAA-PMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQL 162

Query: 253 AAERIGVDARYLVAQAALETGWGKSIIRQQDGGSSHNLFGIKTGSRWDGASARALTTEYE 312
A+++ GV ++AQAALE+GWG+ IR+++G S+NLFG+K W G TTEYE
Sbjct: 163 ASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYE 222

Query: 313 GGKAVKEVAAFRSYSSFEQSFHDYVSFLQGNDRYQNALDSAANPERFMQELQRAGYATDP 372
G+A K A FR YSS+ ++ DYV L N RY A+ +AA+ E+ Q LQ AGYATDP
Sbjct: 223 NGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYA-AVTTAASAEQGAQALQDAGYATDP 281

Query: 373 QYARKVAQIARQMQT 387
YARK+ + +QM++
Sbjct: 282 HYARKLTNMIQQMKS 296



Score = 68.2 bits (166), Expect = 7e-15
Identities = 46/160 (28%), Positives = 78/160 (48%), Gaps = 10/160 (6%)

Query: 20 DLNRLNQLKVGKDRDGEANIRKVAQEFESLFLNEMLKSMRSANEALGDGNFMNSQTTKQY 79
D LN+LK D ANIR VA++ E +F+ MLKSMR +AL +S+ T+ Y
Sbjct: 14 DAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMR---DALPKDGLFSSEHTRLY 70

Query: 80 QDMYDQQLSVSLSKNAGGIGLADVLVRQLSKMKQGSRGNGENPFARVAENGAGRWPSNPS 139
MYDQQ++ ++ G+GLA+++V+Q++ + + + R+ +
Sbjct: 71 TSMYDQQIAQQMT-AGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQAL 129

Query: 140 AQAGKALPMPEAGRDDSKLLNQR----RLALPGKLAERML 175
+Q DDS + + +L+LP +LA +
Sbjct: 130 SQL--VQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQS 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04925FLGHOOKAP12448e-75 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 244 bits (625), Expect = 8e-75
Identities = 142/469 (30%), Positives = 236/469 (50%), Gaps = 23/469 (4%)

Query: 2 SDLLSIGLSGLGTSQTWLTITGHNITNVKTPGYSRQDAIQQTQVPQFSGAGYMGSGSQIV 61
S L++ +SGL +Q L +NI++ GY+RQ I G++G+G +
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 DVRRLASDFLTGQLRNATSQNSELSAFRSQIEQLDGLLSNTTTGVSPAMQRFFAALQAAA 121
V+R F+T QLR A +Q+S L+A Q+ ++D +LS +T+ ++ MQ FF +LQ
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 NNPSSTEAREAVLAQAEGLGKTFNTLYDQLDKQNSLINQQLGALASQVNHLSQSVASYND 181
+N AR+A++ ++EGL F T L Q+ +N +GA Q+N+ ++ +AS ND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 AIAK--AKSAGAVPNDLMDARDEAVRKLSEMIGVTAVTQDDNSVSLFIGSGQPLVVGNTV 239
I++ AGA PN+L+D RD+ V +L++++GV QD + ++ + +G LV G+T
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 240 STLSVVPGLDDPTRYQVQLSNG--NSIQNVTGLVSGGEMGGLLAYRNSALDSSYNKLGQL 297
L+ VP DP+R V +G +I+ L++ G +GG+L +R+ LD + N LGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 298 AITLADTINKQLGQGLDLAGKAGANLFGDINDPDITALRVLAKNGNTGNVHANLNITDTS 357
A+ A+ N Q G D G AG + F I VL N G+V +TD S
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFF------AIGKPAVLQNTKNKGDVAIGATVTDAS 354

Query: 358 KLNSSDFRLDFDGTSFTARRLGDDASMQVTVSGTGPYTLSFKDANGVDQGFNLTLDQLPA 417
+ ++D+++ FD + RL + + VT + G LT PA
Sbjct: 355 AVLATDYKISFDNNQWQVTRLASNTTFTVT---------PDANGKVAFDGLELTFTGTPA 405

Query: 418 AGDRFTLQPTRRGAADIEATLKNASQLAFAGTARTESTTENRGTGKIGA 466
D FTL+P +++ + + +++A +E + A
Sbjct: 406 VNDSFTLKPVSDAIVNMDVLITDEAKIA----MASEEDAGDSDNRNGQA 450



Score = 83.9 bits (207), Expect = 6e-19
Identities = 49/111 (44%), Positives = 65/111 (58%), Gaps = 3/111 (2%)

Query: 569 FNDKGISDNRNALNLLALQTKPTVGGTDNTGSTYNEAYGGLVERVGTLTAQVRASSEASA 628
D G SDNRN LL LQ+ G ++N+AY LV +G TA ++ SS
Sbjct: 437 EEDAGDSDNRNGQALLDLQSNSKTVGGA---KSFNDAYASLVSDIGNKTATLKTSSATQG 493

Query: 629 TVLKQAQDSRDSLSGVSLDEEAANLIQFQQYYGASAQVIQVARTLFDTLIG 679
V+ Q + + S+SGV+LDEE NL +FQQYY A+AQV+Q A +FD LI
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04930FLAGELLIN553e-10 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 55.1 bits (132), Expect = 3e-10
Identities = 62/369 (16%), Positives = 122/369 (33%), Gaps = 14/369 (3%)

Query: 1 MRISTIQAFNNSVNGISRNYADLNRTFEQISTGKRILTPADDPVGSVRLLRLD-QEQGLN 59
I+T + N ++++ + L+ E++S+G RI + DD G R +GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 60 EQYKTGMTEAKNSLSQEETILRSVGNVLQRIREIAGQAGDGALDSNDKKSLASELRQRED 119
Q + + E L + N LQR+RE++ QA +G +D KS+ E++QR +
Sbjct: 62 -QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 120 ELLNLLNSRDASGKYLFSGSQGSVQPFVRNEDGTYSYMGDESQREVQIASSTRIPVSDSG 179
E+ + N +G + S N+ T + + V+ V+
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKID--VKSLGLDGFNVNGPK 178

Query: 180 KVLFEDIVNAARLDTKAAAGNTGDGRISVGLVEDELAFDSQFPASNPPAATDGFNIHFVS 239
+ D+ ++ + T G + V + + D+ P + N +
Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 240 DKEYVVYDPKSLPPGYDWTTYDPNSPPAWQLSKGAIDDDPKTIDKVLYAGVSVTIDGTPK 299
D T + A + K D Y GV+ TID
Sbjct: 239 DDAENNTAVDLF------KTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTG 292

Query: 300 AGDEFNVNYKPGSEKRSLLNVVSDLRKALESSTDNQAGNDAIRDATAVALTNLSAVAAAV 359
V+ + ++ ++ + A + ++ +
Sbjct: 293 NDGNGKVS----TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKT 348

Query: 360 DGGQGKIGA 368
K+
Sbjct: 349 KNESAKLSD 357



Score = 37.3 bits (86), Expect = 1e-04
Identities = 24/94 (25%), Positives = 40/94 (42%)

Query: 326 KALESSTDNQAGNDAIRDATAVALTNLSAVAAAVDGGQGKIGARLNTVESTETFIDDVKL 385
A ST A + +TA L ++ + + VD + +GA N +S T + +
Sbjct: 398 TASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVT 457

Query: 386 VNASVMSQIQDLDYAEALSRLSLQSTIMDAAQQS 419
S S+I+D DYA +S +S + A
Sbjct: 458 NLNSARSRIEDADYATEVSNMSKAQILQQAGTSV 491


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS04950DHBDHDRGNASE1082e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 108 bits (270), Expect = 2e-30
Identities = 68/260 (26%), Positives = 133/260 (51%), Gaps = 13/260 (5%)

Query: 7 FNPFSLSGRRILVTGASSGLGLAIAQSCARMGAELIVTGRDQTRLDGCLTTLQSISELPH 66
N + G+ +TGA+ G+G A+A++ A GA + + +L+ +++L++ +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 67 RAVQADLRVAEERAAVVAAVDSEIHG---LVHSAGISRLCPVRMMTEAHLHEVQSINVDS 123
A AD+R + + A ++ E+ LV+ AG+ R + +++ S+N
Sbjct: 61 -AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTG 119

Query: 124 PMLLTQALLKRNLVAPGGSILFIASIAAHIGVAGVGAYSGTKAALIAMSRCLAMEVVKRN 183
++++ K + GSI+ + S A + + AY+ +KAA + ++CL +E+ + N
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 184 IRVNCLSPSLVETPLL-------DAATQTV-GSMDGQRSNHPMG-FGKPEDIANAAIFML 234
IR N +SP ET + + A Q + GS++ ++ P+ KP DIA+A +F++
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 235 SDASRWVTGTTLVMDGGLTI 254
S + +T L +DGG T+
Sbjct: 240 SGQAGHITMHNLCVDGGATL 259


96HWH78_RS05010HWH78_RS05065N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS05010-1100.412062transcriptional regulator FleQ
HWH78_RS05015-1100.744005sensor histidine kinase FleS
HWH78_RS05020090.801715sigma-54-dependent response regulator
HWH78_RS050250100.314735flagellar hook-basal body complex protein FliE
HWH78_RS050301110.522255flagellar M-ring protein FliF
HWH78_RS050350110.291494flagellar motor switch protein FliG
HWH78_RS05040-181.816634flagellar assembly protein FliH
HWH78_RS05045092.356459flagellar protein export ATPase FliI
HWH78_RS050502102.222567flagella biosynthesis chaperone FliJ
HWH78_RS05055192.183432TorF family putative porin
HWH78_RS05060092.369124diguanylate cyclase RoeA
HWH78_RS050650122.740602multidrug effflux MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05010HTHFIS5100.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 510 bits (1314), Expect = 0.0
Identities = 181/489 (37%), Positives = 256/489 (52%), Gaps = 14/489 (2%)

Query: 5 TKLLLIDDNLDRSRDLAVILNFLGEDQLTCNS--EDWREVAAGLSNSREALCVLLGSVES 62
+L+ DD+ L L+ G D ++ WR +AAG + V+ V
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD-----LVVTDVVMP 58

Query: 63 KGGAVELLKQLASWDEYLPILLI-GEPAPADWPEELRRRVLASLEMPPSYNKLLDSLHRA 121
A +LL ++ LP+L++ + + + L P +L+ + RA
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 122 QVYREMYDQARERGRSREPNLFRSLVGTSRAIQQVRQMMQQVADTDASVLILGESGTGKE 181
+ R + LVG S A+Q++ +++ ++ TD +++I GESGTGKE
Sbjct: 119 LAEP----KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKE 174

Query: 182 VVARNLHYHSKRREGPFVPVNCGAIPAELLESELFGHEKGAFTGAITSRAGRFELANGGT 241
+VAR LH + KRR GPFV +N AIP +L+ESELFGHEKGAFTGA T GRFE A GGT
Sbjct: 175 LVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT 234

Query: 242 LFLDEIGDMPLPMQVKLLRVLQERTFERVGSNKTQNVDVRIIAATHKNLEKMIEDGTFRE 301
LFLDEIGDMP+ Q +LLRVLQ+ + VG DVRI+AAT+K+L++ I G FRE
Sbjct: 235 LFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFRE 294

Query: 302 DLYYRLNVFPIEMAPLRERVEDIALLLNELISRMEHEKRGSIRFNSAAIMSLCRHDWPGN 361
DLYYRLNV P+ + PLR+R EDI L+ + + E E RF+ A+ + H WPGN
Sbjct: 295 DLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGN 354

Query: 362 VRELANLVERLAIMHPYGVIGVGELPKKFR-HVDDEDEQLASSLREELEERAAINAGLPG 420
VREL NLV RL ++P VI + + R + D + A++ L A+ +
Sbjct: 355 VRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQ 414

Query: 421 MDAPAM-LPAEGLDLKDYLANLEQGLIQQALDDAGGVVARAAERLRIRRTTLVEKMRKYG 479
A LA +E LI AL G +AA+ L + R TL +K+R+ G
Sbjct: 415 YFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474

Query: 480 MSRRDDDLS 488
+S S
Sbjct: 475 VSVYRSSRS 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05015PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 20/97 (20%), Positives = 32/97 (32%), Gaps = 19/97 (19%)

Query: 299 LVENA----IQACGPELRLKVHLYARADSLRLSVSDNGPGMDPATLARLGEPFFTTKTTG 354
LVEN I ++ + ++ L V + G T
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT------------KES 310

Query: 355 TGLGLAVVKAVARAHQG---QLQLRSRPGRGTCATLI 388
TG GL V+ + G Q++L + G+ LI
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05020HTHFIS505e-179 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 505 bits (1302), Expect = e-179
Identities = 173/482 (35%), Positives = 255/482 (52%), Gaps = 18/482 (3%)

Query: 2 AAKVLLVEDDRALREALSDTLLLGGHEFVAVDSAEAALPVLAREAFSLVISDVNMPGMDG 61
A +L+ +DD A+R L+ L G++ +A +A LV++DV MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 HQLLGLIRTRYPHLPVLLMTAYGAVDRAVEAMRQGAADYLVKPF--------EARALLDL 113
LL I+ P LPVL+M+A A++A +GA DYL KPF RAL +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 114 VARHALGQLPGSEEDGPVALEPASRQLLELAARVARSDSTVLISGESGTGKEVLANYIHQ 173
R + + + V A +++ + AR+ ++D T++I+GESGTGKE++A +H
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD 182

Query: 174 QSPRAGKPFIAINCAAIPDNMLEATLFGHEKGSFTGAIAAQPGKFELADGGTILLDEISE 233
R PF+AIN AAIP +++E+ LFGHEKG+FTGA G+FE A+GGT+ LDEI +
Sbjct: 183 YGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGD 242

Query: 234 MPLGLQAKLLRVLQEREVERVGARKPINLDIRVLATTNRDLAAEVAAGRFREDLYYRLSV 293
MP+ Q +LLRVLQ+ E VG R PI D+R++A TN+DL + G FREDLYYRL+V
Sbjct: 243 MPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNV 302

Query: 294 FPLAWRPLRERPADILPLAERLLRKHSRKMNLGAVALGPEAAQCLVRHAWPGNVRELDNA 353
PL PLR+R DI L +++ + K L EA + + H WPGNVREL+N
Sbjct: 303 VPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVRELENL 361

Query: 354 IQRALILQQGGLIQPADLCLTAPIGMPLAAPAPVPMPAIPPATPPSVE------IPSPAA 407
++R L +I + +P + + + +VE S
Sbjct: 362 VRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGD 421

Query: 408 GQDASGALGDDLRRREFQVIIDTLRTERGRRKEAAERLGISPRTLRYKLAQMRDAGMDVE 467
SG L E+ +I+ L RG + +AA+ LG++ TLR K +R+ G+ V
Sbjct: 422 ALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGVSVY 478

Query: 468 AY 469

Sbjct: 479 RS 480


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05025FLGHOOKFLIE929e-28 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 91.6 bits (227), Expect = 9e-28
Identities = 41/92 (44%), Positives = 55/92 (59%)

Query: 18 QMEAMAKAKPVQAPAEAGAPSFSEMLSQAVDKVNETQQASTAMANAFEVGQSGVDLTDVM 77
Q++A A + Q SF+ L A+D++++TQ A+ A F +G+ GV L DVM
Sbjct: 12 QLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVM 71

Query: 78 IASQKASVSFQAMTQVRNKLVQAYQDIMQMPV 109
QKASVS Q QVRNKLV AYQ++M M V
Sbjct: 72 TDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05030FLGMRINGFLIF6080.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 608 bits (1569), Expect = 0.0
Identities = 206/576 (35%), Positives = 311/576 (53%), Gaps = 39/576 (6%)

Query: 30 LDNLSEMTMLRQIGLLVGLAASVAIGFAVVLWSQQPDYKPLYGSLNGVDANRVVEALTAA 89
L+ L+ + +I L+V +A+VAI A+VLW++ PDY+ L+ +L+ D +V LT
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQM 72

Query: 90 DIPYKVEPNSGALLVKADDLGRARMKVASAGVAPTDNNVGFEILDKEQALGTSQFMEATN 149
+IPY+ SGA+ V AD + R+++A G+ P VGFE+LD+E G SQF E N
Sbjct: 73 NIPYRFANGSGAIEVPADKVHELRLRLAQQGL-PKGGAVGFELLDQE-KFGISQFSEQVN 130

Query: 150 YRRGLEGELARTVSSLNNVKAARVHLAIPKSSVFVRDDRKPSASVLVELYPGRSLEPSQV 209
Y+R LEGELART+ +L VK+ARVHLA+PK S+FVR+ + PSASV V L PGR+L+ Q+
Sbjct: 131 YQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQI 190

Query: 210 MAIVNLVATSVPELDKSQVTVVDQKGNLLSDQQELSELTMAGKQFDFTRRMEGLLTQRVH 269
A+V+LV+++V L VT+VDQ G+LL+ Q S + Q F +E + +R+
Sbjct: 191 SAVVHLVSSAVAGLPPGNVTLVDQSGHLLT-QSNTSGRDLNDAQLKFANDVESRIQRRIE 249

Query: 270 NILQPVLGNGRYKAEVSADVDFSAVESTSEMYNPDQPA----LRSEQRNNEERQNSSGPQ 325
IL P++GNG A+V+A +DF+ E T E Y+P+ A LRS Q N E+ + P
Sbjct: 250 AILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPG 309

Query: 326 GVPGALSNQPPGPASAPQQATASAPADYVAPGQPLKDANGQTIIDPKTGKPELAPYPTDK 385
GVPGALSNQP P AP + P P N Q T + P
Sbjct: 310 GVPGALSNQPAPPNEAP----IATP--------PTNQQNAQNTPQTSTSTNSNSAGPRST 357

Query: 386 RDQTTRNYELDRSISYTKQQQGRLRRLSVAVVLDDQMKVDAKTGEVSHQPWSADELARFT 445
+ T NYE+DR+I +TK G + RLSVAVV++ + D K P +AD++ +
Sbjct: 358 QRNETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKP-----LPLTADQMKQIE 412

Query: 446 RLVQDSVGYDASRGDSVSVINAPFAPAQAEEIDSIPFYSQPWFWDIVKQVLGVLFILVLV 505
L ++++G+ RGD+++V+N+PF A +PF+ Q F D + L +LV+
Sbjct: 413 DLTREAMGFSDKRGDTLNVVNSPF-SAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVA 471

Query: 506 F----GVLRPVLSNITGGGKGKSLAGGGGRDGDLALGESGLEGSLADDRVSIGGPSSILL 561
+ +RP L+ K ++ E +E L+ D ++ L
Sbjct: 472 WILWRKAVRPQLTRRVEEAKAAQEQAQVRQE-----TEEAVEVRLSKDEQLQQRRANQRL 526

Query: 562 PSPTEGYDAQLNAIKNLVAQDPGRVAQVVKEWINAD 597
G + I+ + DP VA V+++W++ D
Sbjct: 527 -----GAEVMSQRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05035FLGMOTORFLIG305e-105 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 305 bits (784), Expect = e-105
Identities = 109/330 (33%), Positives = 204/330 (61%)

Query: 9 KLTKVDKAAILLLSLGETDAAQVLRHMGPKEVQRVGVAMASMRNVHREQVEQVMGEFVEV 68
LT KAAILL+S+G +++V +++ +E++ + +A + + E + V+ EF E+
Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73

Query: 69 VGDQTSLGVGADGYIRKMLTQALGEDKANNLIDRILLGGSTSGLDSLKWMEPRAVADVIR 128
+ Q + G Y R++L ++LG KA ++I+ + + + ++ +P + + I+
Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQ 133

Query: 129 YEHPQIQAIVVAYLDPDQAAEVLSHFDHKVRLDIVLRVSSLNTVQPSALKELNLILEKQF 188
EHPQ A++++YLDP +A+ +LS +V+ ++ R++ ++ P ++E+ +LEK+
Sbjct: 134 QEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKL 193

Query: 189 AGNSNATRTTMGGVKRAADIMNYLDSSIEGQLMDSIREVDEDLSGQIEDLMFVFDNLADV 248
A S+ T+ GGV +I+N D E +++S+ E D +L+ +I+ MFVF+++ +
Sbjct: 194 ASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLL 253

Query: 249 DDRGIQALLREVSSDVLVLALKGSDEAIREKVFKNMSKRAAELLRDDLEAKGPVRVSEVE 308
DDR IQ +LRE+ L ALK D ++EK+FKNMSKRAA +L++D+E GP R +VE
Sbjct: 254 DDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVE 313

Query: 309 GAQKEILTIARRMAESGDIVLGGKGGEEMI 338
+Q++I+++ R++ E G+IV+ G E+++
Sbjct: 314 ESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05040FLGFLIH561e-11 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 56.3 bits (135), Expect = 1e-11
Identities = 47/202 (23%), Positives = 93/202 (46%), Gaps = 11/202 (5%)

Query: 40 VAAPQVPAVAEPAPAPPAVEEVELETVKPPTLEEIEAIRQDAYNEGFATGERDGFHAGQL 99
+A PQ V P +EE E + +++A Q Y G A G + G G
Sbjct: 15 LAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQ-GYQAGIAEGRQQGHKQGYQ 73

Query: 100 KARQEAEEALKERLQS--------LERLMTQLLEPIAEQDALIEQGMVNLVNHVARQVIQ 151
+ + E +S +++L+++ + D++I ++ + ARQVI
Sbjct: 74 EGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIG 133

Query: 152 RELHMDSSHVRQVLREALKLLPMGAANIRIHVNPQDFERVKAL--RERHEESWRILEDDS 209
+ +D+S + + +++ L+ P+ + ++ V+P D +RV + WR+ D +
Sbjct: 134 QTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPT 193

Query: 210 LLPGGCRIETEHSRIDATIETR 231
L PGGC++ + +DA++ TR
Sbjct: 194 LHPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05050FLGFLIJ542e-12 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 54.4 bits (130), Expect = 2e-12
Identities = 46/134 (34%), Positives = 74/134 (55%)

Query: 8 LAPVVDMASKAERDAATQLGRCQQQLLAAQQKLAELERYRNDYQQQWISQGQKGVSGQWL 67
LA + D+A K DAA LG ++ A+++L L Y+N+Y+ S G++
Sbjct: 7 LATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAGITSNRW 66

Query: 68 MNYQRFLSQLETAVAQQANSVTWHREAVDKARLNWQERYARLEGLRKLVERYLEEARQAE 127
+NYQ+F+ LE A+ Q + + VD A +W+E+ RL+ + L ER A AE
Sbjct: 67 INYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQSTAALLAE 126

Query: 128 DKREQKQLDELAQR 141
++ +QK++DE AQR
Sbjct: 127 NRLDQKKMDEFAQR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05065TCRTETA574e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 57.1 bits (138), Expect = 4e-11
Identities = 82/337 (24%), Positives = 125/337 (37%), Gaps = 34/337 (10%)

Query: 4 RPRPPLLLVLALLALPQVAETILSPALPALASHWRLDDATSQWT------MALFFVGFAP 57
+P PL+++L+ +AL V ++ P LP L + + AL AP
Sbjct: 2 KPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAP 61

Query: 58 GIWLWGWLADRLGRRPALLGGLGLAALATFGAWASTDYSYLLACRLVQGLGLATCSVTVQ 117
+ G L+DR GRRP LL L AA+ + L R+V G+ AT +V
Sbjct: 62 ---VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAV-AG 117

Query: 118 ASLRDVLQGPALMSYFVTLGAVLAWSPAVGPLGGQWLADLGGH-PAVFATLAVLLASLAA 176
A + D+ G +F + A + GP+ G + H P A L L
Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTG 177

Query: 177 LVV---PAWPETRPLLAGTPEPATLAIFRRVLADRPLQTRALLVAVLNVLVFSFYAAGPF 233
+ E RPL P FR + + V + LV AA
Sbjct: 178 CFLLPESHKGERRPLRREALNPLAS--FRWARGMTVVAA-LMAVFFIMQLVGQVPAALWV 234

Query: 234 MVGDLPGLGFGW----IGLAIAIAGSLGAL----LNRRLPRTWNSARRVRLGLALAAAGA 285
+ G+ F W IG+++A G L +L + + R + LG+ A
Sbjct: 235 IFGEDR---FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGM---IADG 288

Query: 286 TAQTLLAAVGYAEGLYWALPALPIFIGFGVAIPNLLG 322
T LLA + A P + + G+ +P L
Sbjct: 289 TGYILLAFATRG---WMAFPIMVLLASGGIGMPALQA 322


97HWH78_RS05320HWH78_RS05360N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS05320-1142.608384winged helix-turn-helix domain-containing
HWH78_RS05325-1132.433819HAMP domain-containing protein
HWH78_RS05330-1152.478917cold-shock protein
HWH78_RS05335-1163.249915hypothetical protein
HWH78_RS05340-2173.101655methyltransferase domain-containing protein
HWH78_RS05345-2152.467728succinyl-diaminopimelate desuccinylase
HWH78_RS05350-2131.838082glycosyltransferase NdvB
HWH78_RS05355-1110.894800tRNA cyclic N6-threonylcarbamoyladenosine(37)
HWH78_RS05360-3100.5688224'-phosphopantetheinyl transferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05320HTHFIS793e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 3e-19
Identities = 37/148 (25%), Positives = 66/148 (44%), Gaps = 3/148 (2%)

Query: 7 RILIVEDDRRLAELTREYLEGNGLKVDIEANGALAAARILAERPDLVVLDLMLPGEDGLS 66
IL+ +DD + + + L G V I +N A I A DLVV D+++P E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 ICRQVR-PQFDGPILMLTARTDDMDEVLGLEMGADDYVCKPVRPRVLLARIRALLRRSEA 125
+ +++ + D P+L+++A+ M + E GA DY+ KP L+ I L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 126 PEAGAPAADSKRLAFGRLVIDNAMREAW 153
+ + + AM+E +
Sbjct: 125 RPSKLEDD--SQDGMPLVGRSAAMQEIY 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05325PF06580418e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.0 bits (96), Expect = 8e-06
Identities = 21/107 (19%), Positives = 35/107 (32%), Gaps = 25/107 (23%)

Query: 431 LQNLLTNALRHA------DRRVRISYRVSLERCRVDVEDDGPGVPEAQWERLFTPFLRLD 484
+Q L+ N ++H ++ + ++VE+ G + E
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------- 309

Query: 485 DSRTRASGGHGLGLSIVR-RIVYWHGGRASIGRSETLGGACFTLAWP 530
G GL VR R+ +G A I SE G + P
Sbjct: 310 --------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05350PF05704310.024 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 30.6 bits (69), Expect = 0.024
Identities = 7/32 (21%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 430 YNEPPELLKQTLDALARLDYPDYEVLVIDNNT 461
+ P +++Q + ++ + + D++V++ID N
Sbjct: 79 IEKAPYIVQQCVASV-KKNSGDFKVIIIDGNN 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05355ISCHRISMTASE300.009 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 30.0 bits (67), Expect = 0.009
Identities = 14/51 (27%), Positives = 18/51 (35%), Gaps = 3/51 (5%)

Query: 109 MAEYIVDF--DYLIDCIDSVAAKAALIAWCKRRKIPVITTGGAGGQVDPTQ 157
M Y VD + A L C + IPV+ T G Q +P
Sbjct: 38 MQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQ-NPDD 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05360ENTSNTHTASED892e-23 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 88.5 bits (219), Expect = 2e-23
Identities = 62/200 (31%), Positives = 93/200 (46%), Gaps = 12/200 (6%)

Query: 15 LDDRWPLPVALPGVQLRSTRFDPALLQPGDFALAGIQPPANILRAVAKRQAEFLAGRLCA 74
L +PLP A G +L FD + + D L + + A KR+AE LAGR+ A
Sbjct: 2 LTSHFPLPFA--GHRLHIVDFDASSFREHD--LLWLPHHDRLRSAGRKRKAEHLAGRIAA 57

Query: 75 RAALFALDGRAQTPAVGEDRAPVWPAAISGSITHGDRWAAALVAARGDWRGLGLDVETLL 134
AL + G P +G+ R P+WP + GSI+H A A+++ + +G+D+E ++
Sbjct: 58 VHALREV-GVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISR----QRIGIDIEKIM 112

Query: 135 EAERARYLHGEILTEGERLRFADDLERRTGLLVTLAFSLKESLFKALYPLVGKRFYFEHA 194
A L I+ ER L L +TLAFS KES++KA + F A
Sbjct: 113 SQHTATELAPSIIDSDERQILQASL-LPFPLALTLAFSAKESVYKA-FSDRVTLPGFNSA 170

Query: 195 ELLEWRADGQARLRLLTDLS 214
++ A L LL +
Sbjct: 171 KVTSLTA-THISLHLLPAFA 189


98HWH78_RS05690HWH78_RS05775N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS056900142.848161HlyD family secretion protein
HWH78_RS056950123.568931FUSC family protein
HWH78_RS057001114.508925DUF2790 domain-containing protein
HWH78_RS057050104.624811thioredoxin family protein
HWH78_RS05710-174.611979helix-turn-helix transcriptional regulator
HWH78_RS05715-294.026154multidrug efflux MFS transporter
HWH78_RS05720-174.244209HlyD family secretion protein
HWH78_RS05725-183.403161TolC family protein
HWH78_RS05730-182.793630hypothetical protein
HWH78_RS05735-1102.664816S8/S53 family peptidase
HWH78_RS05740-2102.304302PAS domain-containing protein
HWH78_RS05745-2101.233195hypothetical protein
HWH78_RS05750-1101.390772hypothetical protein
HWH78_RS05755-2101.724250alkaline protease secretion ATP-binding protein
HWH78_RS05760-291.301607alkaline protease secretion protein AprE
HWH78_RS05765-272.052679alkaline protease secretion protein AprF
HWH78_RS05770081.979764serralysin family metalloprotease AprA
HWH78_RS05775193.322263alkaline proteinase inhibitor AprI
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05690RTXTOXIND664e-14 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 65.6 bits (160), Expect = 4e-14
Identities = 43/214 (20%), Positives = 75/214 (35%), Gaps = 39/214 (18%)

Query: 79 RSYRLAVRQREAELEQARETLRQRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAA 138
R Y+ + Q E+E+ A+E + + ++ + LR
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL--------------DKLRQTTDNIGLL 314

Query: 139 GAALDQARLDLRRSELRSPVDGYVTQLRVQ-PGDYAAAGRTNIFIV-DRRSFWVTGYFEE 196
L + + S +R+PV V QL+V G T + IV + + VT +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 197 TKLRNVQVGAPATIKLMGFD----PLLDGHVASIGRGVADLNESRADSGLPQVSPNFSWI 252
+ + VG A IK+ F L G V +I D+ Q
Sbjct: 375 KDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN----------LDAIEDQRLGLV--- 421

Query: 253 RLAQRVPVRIELDRVPA---GVVLAAGMTGSVEV 283
V + IE + + + L++GM + E+
Sbjct: 422 ---FNVIISIEENCLSTGNKNIPLSSGMAVTAEI 452



Score = 47.5 bits (113), Expect = 3e-08
Identities = 18/114 (15%), Positives = 41/114 (35%), Gaps = 3/114 (2%)

Query: 41 VSAQVIRIAPEVSGSVEAVFVADNQRVARGDPLYRIDPRSYRLAVRQREAELEQARETLR 100
S + I P + V+ + V + + V +GD L ++ + ++ L QAR +
Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE-Q 150

Query: 101 QRDEQWRRRMQLAGAVSREEVANAGRALRIARARAEAAGAALDQARLDLRRSEL 154
R + R ++L E + +L + + +++
Sbjct: 151 TRYQILSRSIELNKL--PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05710HTHFIS345e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 5e-04
Identities = 14/103 (13%), Positives = 31/103 (30%), Gaps = 6/103 (5%)

Query: 87 RHDLPQDCRVVDVPPLLRQLIVAAMRIAPDYPPGGRDERVMELILDELRVLPILALHVPQ 146
R + + R + + + ++ + D L + + +
Sbjct: 376 REIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAE 435

Query: 147 PVDPRLAALCRSLRAEPAADWSLGDAARRLGVSPRTLTRAFQR 189
P + L A A + AA LG++ TL + +
Sbjct: 436 MEYPLI------LAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05715TCRTETB1097e-28 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 109 bits (273), Expect = 7e-28
Identities = 79/402 (19%), Positives = 168/402 (41%), Gaps = 17/402 (4%)

Query: 23 FMAGMNVHVTSAALPEIEGALGATFEEGSWISTAYLVAEISMIPLTAWLVEVFSLRRVML 82
F + +N V + +LP+I +W++TA+++ + L + ++R++L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 83 LGSLVFLLSSLSCALAPN-LSTLILIRVIQGASGAVLIPLSMQLILTELPSSRIPLGMAL 141
G ++ S+ + + S LI+ R IQGA A L M ++ +P L
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 142 FSLSNSVAQAAGPSIGGWLADAYSWRWIFLLQLLPGIALLAAVAWSIRPRDGDRERLRQA 201
++ + GP+IGG +A W ++ L+ ++ I + + + R +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKL---LKKEVRIK-GHF 199

Query: 202 DWLGIGAMVAGLGALQIVLEEGGRRDWFESGFIRTFAVLAVLALLLFVQRQLWGARPFIN 261
D GI M G+ + F + + +F +++VL+ L+FV+ PF++
Sbjct: 200 DIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 262 LRLLGSYNFGVSSLAMAVFGAATFGLVFLVPNYLSQLQGFNARQIGDSLILYGLVQLLL- 320
L + F + L + G V +VP + + + +IG +I G + +++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 321 APLLPRLMRWLNPKLLVAGGFAIMALGCWMGAHLNADAGRNVIIPSIVVRGIGQPLIMVA 380
+ L+ P ++ G +++ ++ A + + IV G
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 381 LSVLAVKGLDKAQAGSASALISMLRNLGGAIGTALLTQLVSL 422
+S + L + +AG+ +L++ L G A++ L+S+
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05720RTXTOXIND1211e-32 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 121 bits (304), Expect = 1e-32
Identities = 61/368 (16%), Positives = 110/368 (29%), Gaps = 68/368 (18%)

Query: 66 AVSAQVSGYVAEVLVADDADVQAGDLLLRLDPRDFR-------QRLRAAEAREAAAQAAL 118
+ + V E++V + V+ GD+LL+L L A + Q
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 119 EAQ-------------------------------RAKLETLDRQLLEQAQTISRARADGE 147
+ + + T Q ++ + + RA+
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217

Query: 148 AARAEWRRAETDWR-------RYRQLADEHATSRQRLENADAVHQRARAAARRASAEEGR 200
A R E R + L + A ++ + + + A R ++ +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277

Query: 201 QRAARDVLKSR--------RREAEAALAQRQAELQEAAAARELARHALDDTEIRAPFAGR 252
+ K + E L Q + + IRAP + +
Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVK 337

Query: 253 VGQRKVRLR-QYVTPGLPLLAVVPLEQAYVV-ANYKETQLERIRPGQPVELEVDTFGRRW 310
V Q KV VT L+ +VP + V A + + I GQ ++V+ F
Sbjct: 338 VQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTR 397

Query: 311 RGRVDSVAPASGAVFALLPPDNATGNFTKIVQRFPVRIRLDADAAERG----RLLPGMSV 366
G + + D +V F V I ++ + G L GM+V
Sbjct: 398 YGYLV-------GKVKNINLDAIEDQRLGLV--FNVIISIEENCLSTGNKNIPLSSGMAV 448

Query: 367 IATVDTRE 374
A + T
Sbjct: 449 TAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05735SUBTILISIN883e-21 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 88.4 bits (219), Expect = 3e-21
Identities = 60/293 (20%), Positives = 104/293 (35%), Gaps = 51/293 (17%)

Query: 256 VRIGVIERDVDFDAPDFADYLGPCKAPAPRTCLYARDAERPDNHGSTVAGILAARWDQGG 315
V++ V++ D D PD + + + + HG+ VAG +AA
Sbjct: 43 VKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAA----TE 98

Query: 316 NSGFLRGLDRASQGFEVIVERNSDAGITANVAASVN-LVEDGVRVLNWSWGIHRVGARDV 374
N + G+ + + V +G + + +E V +++ S G
Sbjct: 99 NENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLG------GPE 152

Query: 375 DGDEVDSLVRSGIAMSGYEELLEEFFLWLRKEHPDVLVVNSAGN-GSSYSGTDEYRLPSS 433
D E+ V+ +A +LV+ +AGN G TDE P
Sbjct: 153 DVPELHEAVKKAVA-------------------SQILVMCAAGNEGDGDDRTDELGYPGC 193

Query: 434 FVTEQLLVVGGHQRSERQGLAVDDPAYAVKRSTSNVDMRVDVTAAACTHASTLERDARGE 493
+ +++ VG A++ +A SN + VD+ A ST+ +
Sbjct: 194 Y--NEVISVG----------AINFDRHAS--EFSNSNNEVDLVAPGEDILSTV-PGGKYA 238

Query: 494 VHCGTSYATPMVAGTVAAMLSLNPRLR-----PEEIRMLLRRSAMTIGGDYDF 541
GTS ATP VAG +A + L E+ L + + +G
Sbjct: 239 TFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKM 291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05740HTHFIS808e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.3 bits (198), Expect = 8e-18
Identities = 32/119 (26%), Positives = 52/119 (43%), Gaps = 5/119 (4%)

Query: 742 THVLLVDDNRMVRYTTALLLGDLGYQVSEAASAEEALGEVERGLAPDLLVTDHLMADKTG 801
+L+ DD+ +R L GY V ++A + G DL+VTD +M D+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDENA 62

Query: 802 VQLAEELRQRFPQLPVLVITGYANL----RPEQLNGFEVLTKPFRHNELAERLARLLEA 856
L +++ P LPVLV++ + + ++ L KPF EL + R L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05760RTXTOXIND436e-153 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 436 bits (1122), Expect = e-153
Identities = 99/423 (23%), Positives = 182/423 (43%), Gaps = 2/423 (0%)

Query: 11 AYARLGWLLVLFGFGGALLWAAFAPLDQGVAVPATVIISGQRKSVQHPLGGVVKHILVRD 70
RL ++ A + + ++ + SG+ K ++ +VK I+V++
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 71 GQHVEAGEPLIRMEPTQARANVDSLLNRYANARLNQARLQAEYDGRRTLEMPA-GLAEQA 129
G+ V G+ L+++ A A+ + ARL Q R Q ++P L ++
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 130 PLPTLGERLEL-QRQLLRSRQTALANELSALRANIEGLRAQLEGLRQTEGNQRLQQRLLN 188
+ E L L++ + + N+ N++ RA+ + R+
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 189 SQLSGARDLAEEGYMPRNQLLEQERQLAEVNARLSESSGRFGQIRQSIAEAQMRIAQREE 248
S+L L + + ++ +LEQE + E L + QI I A+ +
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294

Query: 249 EYRKEVNGQLAETQVNARTLWEELSSARYELRHAEIRAPVSGYVAGLKVFTDGGVIGPGE 308
++ E+ +L +T N L EL+ + + IRAPVS V LKV T+GGV+ E
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354

Query: 309 LLMYIVPNSDSLEVEGQLAVNLVDRIHSGLPVEMLFTAFNQSKTPRVTGEVTMVSADRLL 368
LM IVP D+LEV + + I+ G + AF ++ + G+V ++ D +
Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414

Query: 369 DEQNKQPYYALRAQVDAATMGKLKGLQIRPGMAVQVFVRTGERSLLNYLFKPLFDRAHVA 428
D++ + + + + K + + GMAV ++TG RS+++YL PL + +
Sbjct: 415 DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTES 474

Query: 429 LAE 431
L E
Sbjct: 475 LRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05770CABNDNGRPT418e-145 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 418 bits (1075), Expect = e-145
Identities = 254/480 (52%), Positives = 319/480 (66%), Gaps = 29/480 (6%)

Query: 10 GRSDAYTQVDNFLHAYARGGDELVNGHPSYTVDQAAEQILREQASWQKAPGDSVLTLSYS 69
S AY V +FL + RG VNG SY++DQAA QI RE SW G +V S +
Sbjct: 19 NTSSAYNSVYDFLRYHDRGDGLTVNGKTSYSIDQAAAQITRENVSWN---GTNVFGKSAN 75

Query: 70 FLTKPNDFFSTPWKYVSDIYSLGK----FSAFSAQQQAQAKLSLQSWSDVTNIHFVDAGQ 125
+K++ + S+ F F+A+Q QAKLSLQSWSDV N+ F +
Sbjct: 76 ----------LTFKFLQSVSSIPSGDTGFVKFNAEQIEQAKLSLQSWSDVANLTFTEVTG 125

Query: 126 GDQGDLTFGNFSSSVGG------AAFAFLPDVPDALKGQSWYLINSSYSANVNPANGNYG 179
++TFGN++ G A+A+ P G SWY N S NP + YG
Sbjct: 126 NKSANITFGNYTRDASGNLDYGTQAYAYYPGNYQG-AGSSWYNYNQSN--IRNPGSEEYG 182

Query: 180 RQTLTHEIGHTLGLSHPGDYNAGEGDPTYADATYAEDTRAYSVMSYWEEQNTGQDFKGAY 239
RQT THEIGH LGL+HPG+YNAGEGDP+Y DA YAED+ +S+MSYW E TG D+ G Y
Sbjct: 183 RQTFTHEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGENETGADYNGHY 242

Query: 240 SSAPLLDDIAAIQKLYGANLTTRTGDTVYGFNSNTERDFYSATSSSSKLVFSVWDAGGND 299
AP++DDIAAIQ+LYGAN+TTRTGD+VYGFNSNT+RDFY+AT SS L+FSVWDAGG D
Sbjct: 243 GGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTD 302

Query: 300 TLDFSGFSQNQKINLNEKALSDVGGLKGNVSIAAGVTVENAIGGSGSDLLIGNDVANVLK 359
T DFSG+S NQ+INLNE + SDVGGLKGNVSIA GVT+ENAIGGSG+D+L+GN N+L+
Sbjct: 303 TFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQ 362

Query: 360 GGAGNDILYGGLGADQLWGGAGADTFVYGDIAESSAAAPDTLRDFVSGQDKIDLSGLDAF 419
GGAGND+LYGG GAD L+GGAG DTFVYG +S+ AA D + DF G DKIDLS AF
Sbjct: 363 GGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDKIDLS---AF 419

Query: 420 VNGGLVLQYVDAFAGKAGQAILSYDAASKAGSLAIDFSGDAHADFAINLIGQATQADIVV 479
N G + D F GK + +L +DAA+ +L + +G + DF + ++GQA Q+DI+V
Sbjct: 420 RNEGQLSFVQDQFTGKGQEVMLQWDAANSITNLWLHEAGHSSVDFLVRIVGQAAQSDIIV 479


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS05775MPTASEINHBTR1295e-42 Metalloprotease inhibitor signature.
		>MPTASEINHBTR#Metalloprotease inhibitor signature.

Length = 122

Score = 129 bits (325), Expect = 5e-42
Identities = 40/118 (33%), Positives = 58/118 (49%), Gaps = 9/118 (7%)

Query: 12 CLLCGFFSTGI-SMASSLILLSASDLAGQWTLQQDEAPAICHLELRDSEVAEASGYDLGG 70
F S G +MASS ++ S + +AGQ ++ +C +E A A L G
Sbjct: 11 VWQVLFVSAGAQAMASSFVVPSTAQMAGQLGIEATG-SGVC---AGPAEQANA----LAG 62

Query: 71 DTACLTRWLPSEPRAWRPTPAGIALLERGGLTLMLLGRQGEGDYRVQKGDGGQLVLRR 128
D AC +WL +P +W PTP GI L+ G + L RQ EG+Y + G + L+R
Sbjct: 63 DVACAEQWLGDKPVSWSPTPDGIWLMNAEGTGITHLNRQKEGEYTGRTPSGADVTLQR 120


99HWH78_RS06445HWH78_RS06485N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS064451120.993559GNAT family N-acetyltransferase
HWH78_RS064500141.442411hypothetical protein
HWH78_RS064551131.866760oxidoreductase
HWH78_RS06460013-0.802912AraC family transcriptional regulator
HWH78_RS06465014-1.245781hypothetical protein
HWH78_RS06470020-3.098584hypothetical protein
HWH78_RS06475123-3.829390two-component system sensor histidine kinase
HWH78_RS06480433-5.617124response regulator transcription factor
HWH78_RS06485535-6.016395type VI secretion system tip protein VgrG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06445SACTRNSFRASE389e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 9e-06
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 1/51 (1%)

Query: 93 VAVAWQGKGVGSRLLGELLDIADNWMNLRRVELTVYTDNAPALALYRKFGF 143
VA ++ KGVG+ LL + ++ A + + L N A Y K F
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKE-NHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06455DHBDHDRGNASE1103e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (277), Expect = 3e-31
Identities = 61/185 (32%), Positives = 86/185 (46%), Gaps = 3/185 (1%)

Query: 5 KTLLITGASSGFGQALAREALDAGHRVVGTVRSEEARSALEAVAPGQAFGR---LLDVTD 61
K ITGA+ G G+A+AR G + + E + + +A DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 62 LAAIEPTVAAIERDIGPLDVLVNSAGYGHEGILEESPLAEMRRQFEVNLFGAVAMIQAVL 121
AAI+ A IER++GP+D+LVN AG G++ E F VN G ++V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 122 PYMRRRRRGHILNITSMGGYITMPGIAYYCGSKFALEGVSEALGKEVAGLGIAVTAVAPG 181
YM RR G I+ + S + +A Y SK A ++ LG E+A I V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 182 SFRTD 186
S TD
Sbjct: 189 STETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06475HTHFIS502e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.8 bits (119), Expect = 2e-08
Identities = 31/124 (25%), Positives = 51/124 (41%), Gaps = 9/124 (7%)

Query: 416 LTGLRVCLVEDDRNVLRATSALLERWGCTVQ-AETEADGWRTDC----DILVVDYDLGPH 470
+TG + + +DD + + L R G V+ A WR D++V D + P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVM-PD 59

Query: 471 ASGVECIERVRRQRGEAIPALVISGH-DIERIQASVEDTDIALLSKPVRPTELRATL-RA 528
+ + + R+++ +P LV+S + E L KP TEL + RA
Sbjct: 60 ENAFDLLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 529 LRER 532
L E
Sbjct: 119 LAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06480HTHFIS562e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 56.0 bits (135), Expect = 2e-11
Identities = 34/157 (21%), Positives = 58/157 (36%), Gaps = 5/157 (3%)

Query: 3 GRIIVADDHPLFREGMLSILQRLLPEARIEEAGDLAGVLRLAGEGEQPDSLILDLRFPGL 62
I+VADD R + L R + + A + R G D ++ D+ P
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDE 60

Query: 63 TRIEMLADLRRQFPRTTLIVVSMVDDPQLIGEVMNAGADGFLGKSIAPEELGQAILAIRA 122
++L +++ P ++V+S + + GA +L K EL I RA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG--RA 118

Query: 123 GEVLVRYEPSGLLPLQPSPRLEGLTERQLDVLRLLAQ 159
R Q L G + ++ R+LA+
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06485ICENUCLEATIN300.040 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 30.1 bits (67), Expect = 0.040
Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 2/92 (2%)

Query: 523 RISRDSRSLVENDRFEQVNMNSSSLIKGDELHTTQGERHTRIGGNELLSISGAGSIAVDG 582
+I+ SL+ Q+ N S LI G T G R T I G + + ++G + G
Sbjct: 1080 QIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAG 1139

Query: 583 TWVVQ-AGSQARVTA-TNVLVDAGVNLTLKAG 612
Q AG ++++ A N + AG L AG
Sbjct: 1140 ADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAG 1171


100HWH78_RS06655HWH78_RS06815N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS06655-280.236671acetyltransferase
HWH78_RS06660-290.722271cation-transporting P-type ATPase
HWH78_RS06665-290.214326transcriptional regulator LasR
HWH78_RS06670-1100.834698acyl-homoserine-lactone synthase LasI
HWH78_RS066750121.968485EAL domain-containing protein
HWH78_RS066801141.831043transglutaminase-like cysteine peptidase
HWH78_RS066851141.565225multidrug efflux RND transporter periplasmic
HWH78_RS066900121.485620multidrug efflux RND transporter permease
HWH78_RS066950101.118420heavy metal response regulator transcription
HWH78_RS067000110.693170sensor histidine kinase
HWH78_RS0670508-0.462912YbaN family protein
HWH78_RS06710090.124120YecA family protein
HWH78_RS06715012-0.194493flagellar hook-length control protein FliK
HWH78_RS06720317-1.518967flagellar basal body-associated protein FliL
HWH78_RS06725519-0.958023flagellar motor switch protein FliM
HWH78_RS06730620-0.221881flagellar motor switch protein FliN
HWH78_RS067354190.520263flagellar biosynthetic protein FliO
HWH78_RS067403200.650304flagellar type III secretion system pore protein
HWH78_RS067453170.591997flagellar biosynthesis protein FliQ
HWH78_RS067502161.055966flagellar type III secretion system protein
HWH78_RS067551150.859576flagellar type III secretion system protein
HWH78_RS067600150.615521YgcG family protein
HWH78_RS06765012-0.060285YgcG family protein
HWH78_RS0677008-1.183526flagellar biosynthesis protein FlhA
HWH78_RS06775-110-0.840646flagellar biosynthesis protein FlhF
HWH78_RS06780111-1.143093flagellar synthesis regulator FleN
HWH78_RS06785112-0.896334RNA polymerase sigma factor FliA
HWH78_RS06790113-0.778011chemotaxis response regulator CheY
HWH78_RS06795012-0.395639protein phosphatase CheZ
HWH78_RS068000120.553135chemotaxis protein CheA
HWH78_RS068051140.249718chemotaxis response regulator protein-glutamate
HWH78_RS068100150.484358flagellar motor protein
HWH78_RS06815-1150.780306flagellar motor protein MotD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06655SACTRNSFRASE405e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.3 bits (94), Expect = 5e-07
Identities = 20/84 (23%), Positives = 31/84 (36%), Gaps = 12/84 (14%)

Query: 63 GFIGLNQ-----AHVEMLFVEPGLRGRGIGRRLLDHARATWPR------LSVDVNEQNPQ 111
G I + A +E + V R +G+G LL A W + L ++ + N
Sbjct: 78 GRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKA-IEWAKENHFCGLMLETQDINIS 136

Query: 112 ACGFYRHYGFRQTGRSATDSAGRP 135
AC FY + F + P
Sbjct: 137 ACHFYAKHHFIIGAVDTMLYSNFP 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06670AUTOINDCRSYN1535e-49 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 153 bits (387), Expect = 5e-49
Identities = 41/177 (23%), Positives = 74/177 (41%), Gaps = 6/177 (3%)

Query: 14 KLLGEMHKLRAQVFKERKGWDVSVIDEMEIDGYDALSPYYMLIQEDTPEAQVFGCWRILD 73
GE+ LR + FK+R W V D ME D YD + Y+ +D V R ++
Sbjct: 15 TKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKDN---TVICSLRFIE 71

Query: 74 TTGPYMLKNTFPELLHGKEAPCSPHIWELSRFAINSGQKGSLGFSDCTLEAMRALARYSL 133
T P M+ TF P + E SRF ++ + + ++ + +M L+ +
Sbjct: 72 TKYPNMITGTFFPYFKEINIPEGNY-LESSRFFVDKSRAKDILGNEYPISSMLFLSMINY 130

Query: 134 QND--IQTLVTVTTVGVEKMMIRAGLDVSRFGPHLKIGIERAVALRIELNAKTQIAL 188
D + T+ + + ++ R+G + L ER + + ++ + Q AL
Sbjct: 131 SKDKGYDGIYTIVSHPMLTILKRSGWGIRVVEQGLSEKEERVYLVFLPVDDENQEAL 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06685RTXTOXIND509e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.8 bits (119), Expect = 9e-09
Identities = 28/159 (17%), Positives = 56/159 (35%), Gaps = 36/159 (22%)

Query: 56 VEQQVSGIGTVTSLHNV-VIRTQIDGQLTRLLVSEGQMVEAGELLATIDD-------RAV 107
VE + G +T I+ + + ++V EG+ V G++L +
Sbjct: 80 VEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKT 139

Query: 108 VAALEQAQASRASNQAQLKS--------------------AEQDLQRYRSLYAER----- 142
++L QA+ + Q +S +E+++ R SL E+
Sbjct: 140 QSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ 199

Query: 143 ---AVSRQLLDQQQATVDQLRATLKANDATINAERVRLS 178
LD+++A + A + + E+ RL
Sbjct: 200 NQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238



Score = 40.2 bits (94), Expect = 1e-05
Identities = 42/207 (20%), Positives = 78/207 (37%), Gaps = 16/207 (7%)

Query: 110 ALEQAQASRASNQAQLKSAEQDLQRYRSLYAERAVSRQLLDQ--QQATVDQLR-ATLKAN 166
A+ + + +L+ + L++ S QL+ Q + +D+LR T
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 167 DAT--INAERVRLSYTRITSPVSGKVGIRNV-DVGNLVRVGDSLGLFSVTQIAPISVVFS 223
T + R + I +PVS KV V G +V ++L + V + + V
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTAL 371

Query: 224 LQQEQLSQLQALLGGEAAVRAY-SRDGGSALGEGRLLTIDNQIDSSTGTI-RVRASFD-- 279
+Q + + + V A+ G +G+ + + +D D G + V S +
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 280 -----NRQARLWPGQFVAVSLHTGVRR 301
N+ L G V + TG+R
Sbjct: 432 CLSTGNKNIPLSSGMAVTAEIKTGMRS 458


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06690ACRIFLAVINRP7620.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 762 bits (1969), Expect = 0.0
Identities = 290/1040 (27%), Positives = 490/1040 (47%), Gaps = 36/1040 (3%)

Query: 7 ISGWCVRHPIATALLTLASLLLGLLAFLRLGVAPLPEADFPTIQINALLPGGSPETMASS 66
++ + +R PI +L + ++ G LA L+L VA P P + ++A PG +T+ +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 67 VATPLEVQFSAIPGITEMTSSSA-LGTTTLTLQFSLDKSIDVAAQEVQAAINAAAGRLPV 125
V +E + I + M+S+S G+ T+TL F D+A +VQ + A LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 126 DMPNLPTWRKVNPADSPIMILRVNSE--MMPLIELSDYAETILARQLSQVNGVGQIFVVG 183
++ + S +M+ S+ ++SDY + + LS++NGVG + + G
Sbjct: 121 EVQQ-QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 184 QQRPAIRIQAQPEKLAAYQLTLADLRQSLQSASVNLAKGALYGEGRVS------TLAAND 237
Q A+RI + L Y+LT D+ L+ + +A G L G + ++ A
Sbjct: 180 AQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 238 QLFNASDYDDLVV-AYRQGAPVFLKDVARIVSAPEDDYVQAWPNGVPGVALVILRQPGAN 296
+ N ++ + + G+ V LKDVAR+ E+ V A NG P L I GAN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 297 IVDTADAIQAALPRLREMLPATIEVDVLNDRTRTIRSSLHEVELTLLLTIGLVVLVMGLF 356
+DTA AI+A L L+ P ++V D T ++ S+HEV TL I LV LVM LF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 357 LRQLSATLIVATVLAVSLSASFAAMYVLGFTLNNLTLVALIIAVGFIVDDAIVVVENIHR 416
L+ + ATLI + V L +FA + G+++N LT+ +++A+G +VDDAIVVVEN+ R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 417 HL-EAGASKVEAALKGAAEIGFTVISISFSLIAAFIPLLFMGGIVGRLFREFAVSVTVAI 475
+ E EA K ++I ++ I+ L A FIP+ F GG G ++R+F++++ A+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 476 LISVLASLTLAPMLASRFM-PALRHADAPRKGFAEW-------LTGGYERGLRWALGHQR 527
+SVL +L L P L + + P + GF W Y + LG
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 528 LMLVGFAFTVLVAVAGYVGIPKGFFPLQDTAFVFGTSQAAEDISYDDMVAKHRQLAEIIA 587
L+ +A V V ++ +P F P +D Q + + Q+ +
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 588 SDPA--VQSYNHAVGVTGGSQSLANGRFWIVLKDRGERDV---SVGEFIDRLRPQLAKVP 642
+ V+S G + Q+ G ++ LK ER+ S I R + +L K+
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 643 GIMLYLRAAQDINLSSGPSRTQYQYAL---RSSDSTQLALWAQRLTERLKQVPG-LMDVS 698
+ I + T + + L L +L Q P L+ V
Sbjct: 659 DGFVIPFNMPAI--VELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 699 NDLQVGASVTALDIDRVAAARFGLSAEDVSQTLYDAFGQRQVGEYQTEVNQYKVVLELDA 758
+ + L++D+ A G+S D++QT+ A G V ++ K+ ++ DA
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 759 RQRGRAESLDWFYLRSPLSGEMVPLSAIAKVAAPRSGPLQINHNGMFPAVNLSFNLAAGV 818
+ R E +D Y+RS +GEMVP SA G ++ P++ + A G
Sbjct: 777 KFRMLPEDVDKLYVRSA-NGEMVPFSAFTTSH-WVYGSPRLERYNGLPSMEIQGEAAPGT 834

Query: 819 SLGEAVQAVQRAQEEIGMPSTIIGVFQGAAQAFQSSLASQPLLILAALIAVYIILGVLYE 878
S G+A+ ++ + +P+ I + G + + S P L+ + + V++ L LYE
Sbjct: 835 SSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYE 892

Query: 879 SFVHPLTILSTLPSAGIGAVFLLWAWGQDFSIMALIGIVLLIGIVKKNGILMVDFAIVAQ 938
S+ P++++ +P +G + + Q + ++G++ IG+ KN IL+V+FA
Sbjct: 893 SWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLM 952

Query: 939 REQGMSAEQAIYQACLTRFRPIMMTTLAALLGAIPLMIGFGTGSELRQPLGIAVVGGLLV 998
++G +A A R RPI+MT+LA +LG +PL I G GS + +GI V+GG++
Sbjct: 953 EKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVS 1012

Query: 999 SQVLTLFSTPVVYLALERLF 1018
+ +L +F PV ++ + R F
Sbjct: 1013 ATLLAIFFVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06695HTHFIS861e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.0 bits (213), Expect = 1e-21
Identities = 33/129 (25%), Positives = 62/129 (48%), Gaps = 2/129 (1%)

Query: 2 RVLIVEDEAKTADYLNRGLSEQGFTVDLADNGIDGRHLALHGEYDVIVLDVMLPGVDGYG 61
+L+ +D+A LN+ LS G+ V + N G+ D++V DV++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRALRERR-QTPVIMLTARERVEDRVRGLREGADDYLIKPFSFLELVARL-QALTRRGG 119
+L +++ R PV++++A+ ++ +GA DYL KPF EL+ + +AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 120 NHESHSQMR 128

Sbjct: 125 RPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06700PF06580290.035 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.035
Identities = 17/93 (18%), Positives = 30/93 (32%), Gaps = 5/93 (5%)

Query: 316 EETSLAGEIATTVDFLEVI----FDEAGVGIEVRGEAR-ALVERALFQRAVTNLLYNAAQ 370
+ SLA E+ +L++ D ++ V L Q V N + +
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273

Query: 371 HTAAGGTLRVGVERRGDEVRVAVSNPGVPIADE 403
GG + + + V + V N G
Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKN 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06715FLGHOOKFLIK522e-09 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 52.1 bits (124), Expect = 2e-09
Identities = 73/300 (24%), Positives = 114/300 (38%), Gaps = 14/300 (4%)

Query: 128 DENTQATLLPPAVPTASSAPASLTEASSDPTLVKLNGVPAVNMALEQGAQDAAQTAKGGP 187
DE + +T L A A +A A + V A AL T K
Sbjct: 90 DEQSTSTPLTTAQTMALAAVADKNTTKDEKADDLNEDVTASLSALFAMLPGFDNTPKVTD 149

Query: 188 AKSADPRQANLGDALAGLTSDSLTKAVDGKALEAQLQQTAEPAVASAASESLLESKAEPR 247
A S LTS+ LT A A Q P VA A S++ + S P
Sbjct: 150 APST-VLPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLT-PLVAEAQSKAEVISTPSPV 207

Query: 248 GEPFAAKLNGLTQAMAQQALTNRPVNGTVPGQPVAMQQNGWSEAVVDRVMWMSSQNLKSA 307
AA +T Q T P + + W +++ + + Q +SA
Sbjct: 208 T---AAASPLITPHQTQPLPT-----VAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSA 259

Query: 308 EIQLDPAELGRLDVRIHMTADQTQVTFASPNAGVRDALESQMHRLRDMFSQQGMNQLDVN 367
E++L P +LG + + + + +Q Q+ SP+ VR ALE+ + LR ++ G+ N
Sbjct: 260 ELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSN 319

Query: 368 VSDQSLARGWQGQQQGEGGSARGRGLAGEASGDEETLAGVSEIRSRPGASAARGLVDYYA 427
+S +S + G Q + S R A D++TL ++ G VD +A
Sbjct: 320 ISGESFS-GQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQ---GRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06725FLGMOTORFLIM2592e-87 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 259 bits (664), Expect = 2e-87
Identities = 98/326 (30%), Positives = 167/326 (51%), Gaps = 13/326 (3%)

Query: 5 DLLSQDEIDALLHGVDDGLVETEVEATPG-----SVKSYDLTSQDRIVRGRMPTLEMINE 59
++LSQDEID LL + G + +E + YD D+ + +M TL +++E
Sbjct: 3 EVLSQDEIDQLLTAISSG--DASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 60 RFARYTRISMFNLLRRSADVAVGGVQVMKFGEYVHSLYVPTSLNLVKMKPLRGTALFILD 119
FAR T S+ LR V V V + + E++ S+ P++L ++ M PL+G A+ +D
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVD 120

Query: 120 AKLVFKLVDNFFGGDGRHAKIEGREFTPTELRVVRMVLEQAFVDLKEAWQAVLEMNFEYV 179
+ F ++D FGG G+ AK++ R+ T E V+ V+ + +++E+W V+++
Sbjct: 121 PSITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLG 179

Query: 180 NSEVNPAMANIVSPSEVVVVSTFHIELDGGGGDLHITMPYSMIEPIREMLDAGF--QSDH 237
E NP A IV PSE+VV+ T ++ G ++ +PY IEPI L + F S
Sbjct: 180 QIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVR 239

Query: 238 DDQDERWIKALREDVLDVQVPLGATVVRRQLKLRDILHMQPGDVIPVE---MPEHMVMRA 294
+++ LR+ + V + + A V +L +RDIL ++ GD+I + + + V+
Sbjct: 240 RSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSI 299

Query: 295 NGVPAFKVKLGAHKGNLALQILEAVE 320
F + G +A QILE +E
Sbjct: 300 GNRKKFLCQPGVVGKKIAAQILERIE 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06730FLGMOTORFLIN1208e-38 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 120 bits (301), Expect = 8e-38
Identities = 62/145 (42%), Positives = 90/145 (62%), Gaps = 24/145 (16%)

Query: 13 ALADEWAAALAE-AGDASQDDIDALMAQGGATPVAEPSTPRAPMEEFGASPKAPTISGLE 71
AL D WA AL E ++ DA+ Q G V+
Sbjct: 14 ALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQ--------------------- 52

Query: 72 GPNLDVILDIPVTISMEVGHTDISIRNLLQLNQGSVIELDRLAGEPLDVLVNGTLIAHGE 131
++D+I+DIPV +++E+G T ++I+ LL+L QGSV+ LD LAGEPLD+L+NG LIA GE
Sbjct: 53 --DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGE 110

Query: 132 VVVVNEKFGIRLTDVISPSERIKKL 156
VVVV +K+G+R+TD+I+PSER+++L
Sbjct: 111 VVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06740FLGBIOSNFLIP2642e-91 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 264 bits (676), Expect = 2e-91
Identities = 140/242 (57%), Positives = 176/242 (72%), Gaps = 3/242 (1%)

Query: 11 LAALCLLLLAPWPALAADPTSISAITVTTNGQGQQEYSVSLQILLIMTALSFIPAFVMLM 70
L+ +LL P A + IT G Q +S+ +Q L+ +T+L+FIPA +++M
Sbjct: 5 LSVAPVLLWLITPLAFA---QLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMM 61

Query: 71 TSFTRIIIVFSILRQALGLQSTPSNQVLVGLALFLTMFVMAPVFDKINSQALQPYLNEQI 130
TSFTRIIIVF +LR ALG S P NQVL+GLALFLT F+M+PV DKI A QP+ E+I
Sbjct: 62 TSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKI 121

Query: 131 PAQEALQKAEVPLKAFMLAQTRTSDLELFVRLSKRTDIGSPEATPLTILVPAFVTSELKT 190
QEAL+K PL+ FML QTR +DL LF RL+ + PEA P+ IL+PA+VTSELKT
Sbjct: 122 SMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKT 181

Query: 191 AFQIGFMIFIPFLIIDLVVSSVLMAMGMMMLSPLIISLPFKIMLFVLVDGWALIIGTLAG 250
AFQIGF IFIPFLIIDLV++SVLMA+GMMM+ P I+LPFK+MLFVLVDGW L++G+LA
Sbjct: 182 AFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQ 241

Query: 251 SF 252
SF
Sbjct: 242 SF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06745TYPE3IMQPROT559e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 54.8 bits (132), Expect = 9e-14
Identities = 24/75 (32%), Positives = 43/75 (57%)

Query: 7 LDLFREALWLTAMIVGVLVVPSLLVGLVVAMFQAATQINEQTLSFLPRLMVILLTLIVLG 66
+ +AL+L ++ G + + ++GL+V +FQ TQ+ EQTL F +L+ + L L +L
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 PWLLRQLMEYTQTLI 81
W L+ Y + +I
Sbjct: 65 GWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06750TYPE3IMRPROT1357e-41 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 135 bits (341), Expect = 7e-41
Identities = 96/232 (41%), Positives = 143/232 (61%), Gaps = 2/232 (0%)

Query: 1 MLELTNAQIGGWIASFVLPLFRVAALLMTMPVIGTQLVPVRVRLYLALGVCVVLVPNLPP 60
ML++T+ Q W+ + PL RV AL+ T P++ + VP RV+L LA+ + + P+LP
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 MPQVDALSMKAMLLIGEQILVGALLGFSLQLLFHAFVIAGQIISMQMGLGFASMVDPANG 120
S A+ L +QIL+G LGF++Q F A AG+II +QMGL FA+ VDPA+
Sbjct: 61 NDVPV-FSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 VSVPVLGQFFTMLVTLLFLAMNGHLVVFEVIAESFVTLPVGEGLSGNHFWI-IAGKLGWV 179
+++PVL + ML LLFL NGHL + ++ ++F TLP+G ++ ++ + +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 180 MGAALLLALPAITALLVVNLAFGAMTRAAPQLNIFSIGFPLTLVLGLVILWI 231
L+LALP IT LL +NLA G + R APQL+IF IGFPLTL +G+ ++
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAA 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06755TYPE3IMSPROT336e-116 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 336 bits (864), Expect = e-116
Identities = 98/345 (28%), Positives = 183/345 (53%), Gaps = 2/345 (0%)

Query: 9 DKTEEPTEKRRREAREKGQLPRSRELNTLAILMAGAGGLLIYGADLAGALLRLMRSNFEL 68
+KTE+PT K+ R+AR+KGQ+ +S+E+ + A+++A + L+ +LM E
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQ 63

Query: 69 SRETAMNTESMLQLLGASAYLAAQGLWPILLMLLVAAIVGPIALGGWLFSMDALQPKFSR 128
S ++++ ++ +P+L + + AI + G+L S +A++P +
Sbjct: 64 SYLPF--SQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 129 LNPLSGLKRMFSAKSLLELSKALIKFLVVLAVALLVLSADRDALLALAHQPLEQAILHSV 188
+NP+ G KR+FS KSL+E K+++K +++ + +++ + LL L +E
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 189 RVVGWSAFWMACSLLLIAAVDVPYQIWDNRQKLLMTKQEVRDEYKDSEGKPEVKSKIRQM 248
+++ ++I+ D ++ + ++L M+K E++ EYK+ EG PE+KSK RQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 249 QREMAQRRMMAAVPEADVVITNPTHFAVALKYDPAGGGAPLLLAKGNDFLALKIREVAQE 308
+E+ R M V + VV+ NPTH A+ + Y PL+ K D +R++A+E
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301

Query: 309 HKVMVMESPALARAVYYSTELDQEIPAGLYLAVAQVLAYVYQLKQ 353
V +++ LARA+Y+ +D IPA A A+VL ++ +
Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06760cloacin362e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.2 bits (83), Expect = 2e-04
Identities = 22/55 (40%), Positives = 24/55 (43%), Gaps = 9/55 (16%)

Query: 373 GQVRLSGGGGGSSGSS--------GGGSSSSSSSSSGGFSGGGGSSG-GGGASGS 418
G L GGG S GS GGGS S G G GG +G GG SG+
Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77



Score = 35.8 bits (82), Expect = 3e-04
Identities = 15/37 (40%), Positives = 19/37 (51%)

Query: 380 GGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGAS 416
GGG SG GG S + G SGGG +GG ++
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 33.5 bits (76), Expect = 0.001
Identities = 15/40 (37%), Positives = 19/40 (47%)

Query: 379 GGGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASGS 418
GGG GS GGGS + +G GG G+ G A +
Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAA 86



Score = 32.4 bits (73), Expect = 0.004
Identities = 16/40 (40%), Positives = 16/40 (40%)

Query: 380 GGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASGSW 419
GG G GG S S SS GGG SG GS
Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61



Score = 30.1 bits (67), Expect = 0.017
Identities = 12/36 (33%), Positives = 17/36 (47%)

Query: 382 GGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASG 417
GG G+ S+S + +GG +G G G SG
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG 38



Score = 30.1 bits (67), Expect = 0.021
Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 378 SGGGGGSSGSSGGGSSSSSSSSSGGFSGGGGSSGGGGASG 417
GG G S G SG G + S+ ++ F S+ G G
Sbjct: 65 GGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLA 104



Score = 29.3 bits (65), Expect = 0.029
Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 3/38 (7%)

Query: 385 SGSSGGGSSSSSSSSSG---GFSGGGGSSGGGGASGSW 419
SG G G ++ + S+SG G G G GG W
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGW 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06765cloacin300.022 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.1 bits (67), Expect = 0.022
Identities = 15/48 (31%), Positives = 20/48 (41%)

Query: 398 SAGGSGGGRRRGGDYASSSGSSSSSSSSSSSDSFSGGGGSSGGGGASG 445
+ G GGG G ++S + S S G G+ GG G SG
Sbjct: 25 TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSG 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06790HTHFIS902e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.9 bits (223), Expect = 2e-24
Identities = 32/120 (26%), Positives = 55/120 (45%), Gaps = 3/120 (2%)

Query: 2 KILIVDDFSTMRRIIKNLLRDLGFTNTAEADDGTTALPMLHSGNFDFLVTDWNMPGMTGI 61
IL+ DD + +R ++ L G+ + T + +G+ D +VTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 DLLRAVRADERLKHLPVLMVTAEAKRDQIIEAAQAGVNGYVVKPFTAQVLKEKIEKIFER 121
DLL ++ + LPVL+++A+ I+A++ G Y+ KPF L I +
Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06800PF06580427e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.8 bits (98), Expect = 7e-06
Identities = 13/69 (18%), Positives = 30/69 (43%), Gaps = 10/69 (14%)

Query: 462 ETDLDKNLVEALADPLV--HLVRNAVDHGIESPEEREAAGKPRVGQVVLSAEQEGDHILL 519
E ++ +++ P++ LV N + HGI P+ G+++L ++ + L
Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTL 294

Query: 520 MITDDGKGM 528
+ + G
Sbjct: 295 EVENTGSLA 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06805HTHFIS598e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.5 bits (144), Expect = 8e-12
Identities = 35/142 (24%), Positives = 55/142 (38%), Gaps = 6/142 (4%)

Query: 2 AVKVLVVDDSGFFRRRVSEILSADGQIQVVGTATNGREAIEQALALRPDVITMDYEMPLM 61
+LV DD R +++ LS G +N A D++ D MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVR--ITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVRNIMQRCP-TPVLMFSSLTHEGARVTLDALDAGAVDYLPKNFEDISRNPDKVRQ 120
+ + I + P PVL+ S+ + A + GA DYLPK F D++ + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAI--KASEKGAYDYLPKPF-DLTELIGIIGR 117

Query: 121 LLCEKVLTIARSNRRSISLPPL 142
L E ++ S PL
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS06815OMPADOMAIN691e-15 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 69.2 bits (169), Expect = 1e-15
Identities = 34/125 (27%), Positives = 52/125 (41%), Gaps = 16/125 (12%)

Query: 128 EITLNSSLLFPSGDALPNDAAFDIVEKVAKILAPYKNP---IHVEGFTDDVPIHSPRYPT 184
TL S +LF A ++++ L+ + V G+TD I S Y
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY-- 269

Query: 185 NWELSAARAASIVRLLGNDGVEPSRMAAVGYGEFQPVADNASAEGR---------AKNRR 235
N LS RA S+V L + G+ +++A G GE PV N + A +RR
Sbjct: 270 NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRR 329

Query: 236 VVLVI 240
V + +
Sbjct: 330 VEIEV 334


101HWH78_RS08005HWH78_RS08050N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS08005212-0.706131SctU family type III secretion system export
HWH78_RS080103180.838524SctT family type III secretion system export
HWH78_RS080153181.900610SctS family type III secretion system export
HWH78_RS080200152.378813SctR family type III secretion system export
HWH78_RS080250152.689077SctQ family type III secretion system
HWH78_RS080301153.113118type III secretion system needle length
HWH78_RS080402112.671654type III secretion system central stalk protein
HWH78_RS080453112.726795SctN family type III secretion system ATPase
HWH78_RS080505151.114692SctW family type III secretion system gatekeeper
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08010TYPE3IMSPROT422e-150 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 422 bits (1087), Expect = e-150
Identities = 232/349 (66%), Positives = 294/349 (84%)

Query: 1 MSAEKTEQPTAKKLRDARRQGQVVKSKEIVSSALILSLVALLMGFSDYYLEHLGKLLLLP 60
MS EKTEQPT KK+RDAR++GQV KSKE+VS+ALI++L A+LMG SDYY EH KL+L+P
Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60

Query: 61 AEYIDLPFRQALETILENLLQELLYLLAPVLLVAALVVVLSHVGQYGFLLSLDSVKPDLK 120
AE LPF QAL +++N+L E YL P+L VAAL+ + SHV QYGFL+S +++KPD+K
Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 121 KINPVEGAKKIFSIRSLVEFLKSTLKVALLSLLVWLTLQGNLASLLRIPACGLDCVAPVS 180
KINP+EGAK+IFSI+SLVEFLKS LKV LLS+L+W+ ++GNL +LL++P CG++C+ P+
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 181 GLMLRQLMLVCAVGFLAIAVADYAFERHQHYKQLRMSKDEVKREYKEMEGSPEIKSKRRQ 240
G +LRQLM++C VGF+ I++ADYAFE +Q+ K+L+MSKDE+KREYKEMEGSPEIKSKRRQ
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 241 FHQELQSSNLRADVRRSSVIVANPTHVAIGIRYRRGETPLPLVTLKHTDALALRVRRIAE 300
FHQE+QS N+R +V+RSSV+VANPTH+AIGI Y+RGETPLPLVT K+TDA VR+IAE
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 301 EEGIPVLQRIPLARALLRDGNVDQYIPADLIQATAEVLRWLESQQTDTP 349
EEG+P+LQRIPLARAL D VD YIPA+ I+ATAEVLRWLE Q +
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQ 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08015TYPE3IMRPROT1421e-43 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 142 bits (360), Expect = 1e-43
Identities = 47/245 (19%), Positives = 100/245 (40%), Gaps = 4/245 (1%)

Query: 9 LLLTYSLLLPRIISCFVVLPVLAKQTLGGGLVRNGVACSLALFAYPIVAGSLPPALDALD 68
L Y L R+++ P+L+++++ V+ G+A + P + + P
Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFAIAPSLPANDVPVFSFFA 70

Query: 69 IALLIGKEVLLGLLIGFVATIPFWAMEATGFIIDNQRGAALASTFNPSLGSQTSPTGLLL 128
+ L + +++L+G+ +GF F A+ G II Q G + A+ +P+ ++
Sbjct: 71 LWLAV-QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIM 129

Query: 129 TQTLITLFFSGGAFLALVGSLFRSYASWPVSSFFPQLGSQWVAFFYAQFSQMLMLCALFA 188
+ LF + L L+ L ++ + P+ S S + + + A
Sbjct: 130 DMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPL--NSNAFLALTKAGSLIFLNGLMLA 187

Query: 189 APLLIAMFLAEFGLALVSRFAPSLNVFILAMPIKSLVASLLLVLYLGILMEHAYDALLLA 248
PL+ + L L++R AP L++F++ P+ V L+ + ++
Sbjct: 188 LPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEI 247

Query: 249 VDPLR 253
+ L
Sbjct: 248 FNLLA 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08020TYPE3IMQPROT684e-19 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 68.3 bits (167), Expect = 4e-19
Identities = 35/78 (44%), Positives = 48/78 (61%)

Query: 5 DILHFTNQTLWLVLVLSLPPVLVAALIGTLVSLVQALTQIQEQTLGFVAKLVAVVVVLFA 64
D++ N+ L+LVL+LS P +VA +IG LV L Q +TQ+QEQTL F KL+ V + LF
Sbjct: 3 DLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFL 62

Query: 65 TSGWLGGELYRFAEMTLL 82
SGW G L + +
Sbjct: 63 LSGWYGEVLLSYGRQVIF 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08025TYPE3IMPPROT2463e-85 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 246 bits (629), Expect = 3e-85
Identities = 92/217 (42%), Positives = 142/217 (65%), Gaps = 7/217 (3%)

Query: 6 DELGLILGLALLALVPFIAVMATSFIKMTVVFSLLRNALGVQQIPPNMAMYGLAIILSLY 65
+++ LI LA L+PFI T F+K ++VF ++RNALG+QQIP NM + G+A++LS++
Sbjct: 3 NDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMF 62

Query: 66 VMAPVGFATRDYLRNHDVSLSDSASVERFLDEGMAPYRNFLKRQIQEREHTFFMESTRQV 125
VM P+ Y + DV+ +D +S+ + +DEG+ YR++L + FF + +
Sbjct: 63 VMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKR 122

Query: 126 WPSEYAERLDPD-------SLLILLPAFTVSELTRAFEIGFLIYLPFIAIDLIISNILLA 178
E E + D S+ LLPA+ +SE+ AF+IGF +YLPF+ +DL++S++LLA
Sbjct: 123 QYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLA 182

Query: 179 MGMMMVSPMTISLPFKLLLFVLLDGWARLTHGLVISY 215
+GMMM+SP+TIS P KL+LFV LDGW L+ GL++ Y
Sbjct: 183 LGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08030TYPE3OMOPROT841e-20 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 83.5 bits (206), Expect = 1e-20
Identities = 46/177 (25%), Positives = 73/177 (41%), Gaps = 14/177 (7%)

Query: 130 RLALWLDGDPATLLARLPPRPSAQRLAIPLRLSLQWPGLPLDASELRTLEPGDLLLLPAG 189
R LW + P L A RP R + + L L + GD+LL+
Sbjct: 126 RGGLWFEHLPE-LPAVGGGRPKMLRWPLRFVIGSSDTQRSL----LGRIGIGDVLLIRTS 180

Query: 190 HRPDAALLGVLEGRPWARCQLHSTQL-ELLDMH----DTPSLADGEDLHELDQLPIPVSF 244
A + + ++ + E LD+ + + E L L+QLP+ + F
Sbjct: 181 R----AEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEF 236

Query: 245 EVGRRTLDLHTLSTLQPGSLLDLDSALDGEVRILANQRCLGIGELVRLQDRLGVRVT 301
+ R+ + L L + LL L + + V I+AN LG GELV++ D LGV +
Sbjct: 237 VLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIH 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08035IGASERPTASE393e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.9 bits (90), Expect = 3e-05
Identities = 25/133 (18%), Positives = 39/133 (29%), Gaps = 18/133 (13%)

Query: 19 APLPPLRAQQIAFEQALPAHRPPAPRPPFDKGDETTEAAATADAPTSTPLADQPAAPAAD 78
+ + P + Q + R P T + T+T + A
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDP----------TVNIKEPQSQTNTTADTEQPAKE-- 1174

Query: 79 RPPTTRQAPMPVAADATPTPTPTPTPTPTPTPTPTPTPTV-SPSGSVARQAPAVTARVAA 137
T+ PV T + P T T PTV S S + + + R
Sbjct: 1175 ---TSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231

Query: 138 STQGREPASVSAP 150
EPA+ S+
Sbjct: 1232 HNV--EPATTSSN 1242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08050PF072012836e-98 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 283 bits (726), Expect = 6e-98
Identities = 134/294 (45%), Positives = 181/294 (61%), Gaps = 7/294 (2%)

Query: 1 MDILQSSSAAPLA-----PREAANAPAQQAGGSFQGERVHYVSVS-QSLADAAEELTFAF 54
M L + S P A++ Q G F+GE V VS + QS+AD AEE+TF F
Sbjct: 1 MTTLHNLSYGNTPLHNERPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFVF 60

Query: 55 SERAEKSLAKRRLSDAHARLSEVQTMLQEYWKRIPDLESQQKLEALIAHLGSGQLSSLAQ 114
SER E SL KR+LSD+ AR+S+V+ + +Y ++P+LE +Q + L++ L + SL+Q
Sbjct: 61 SERKELSLDKRKLSDSQARVSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQ 120

Query: 115 LSAYLEGFSSEISQRFLALSRARDVLAGRPEARAMLALVDQALLRMADEQGLEIELGLRI 174
L AYLEG S E S++F L RD L GRPE + LV+QAL+ MA+EQG I LG RI
Sbjct: 121 LKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLGARI 180

Query: 175 EPLAAEASAAGVGDIQALRDTYRDAVLDYRGLSAAWQDIQARFAATPLERVVAFLQKALS 234
P A S +GV +Q LRDTYRDAV+ Y+G+ A W D+Q RF ++ V+ FLQKALS
Sbjct: 181 TPEAYRESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQKALS 240

Query: 235 ADLDSQSSRLDPVKLERVMSDMHKLRVLGGLAEQVGALWQVLVTGERGHGIRAF 288
ADL SQ S KL V+SD+ KL+ G +++QV WQ G + +G+R F
Sbjct: 241 ADLQSQQSGSGREKLGIVISDLQKLKEFGSVSDQVKGFWQFFSEG-KTNGVRPF 293


102HWH78_RS08090HWH78_RS08140N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS08090218-1.103387type III secretion protein PcrV
HWH78_RS08095121-0.615751SycD/LcrH family type III secretion system
HWH78_RS08100223-2.292920type III secretion system translocon subunit
HWH78_RS08105123-1.739780type III secretion system translocon subunit
HWH78_RS08110026-1.614890type III secretion system regulatory chaperone
HWH78_RS08115-317-0.498734T3SS regulon translocated regulator ExsE
HWH78_RS08120-115-0.069147YscW family type III secretion system pilotin
HWH78_RS081250140.002886T3SS regulon transcriptional activator ExsA
HWH78_RS081300141.188177T3SS regulon anti-activator ExsD
HWH78_RS081351151.230032YscB family type III secretion system chaperone
HWH78_RS081401140.905168SctC family type III secretion system outer
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08090LCRVANTIGEN344e-121 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 344 bits (884), Expect = e-121
Identities = 115/296 (38%), Positives = 171/296 (57%), Gaps = 32/296 (10%)

Query: 25 ASAEQEELLALLRSERIVLAHAGQPLSEAQVL-------------KALAWLLAANPSAPP 71
S+ EEL+ L++ + I ++ P +++V K LA+ L +
Sbjct: 28 GSSVLEELVQLVKDKNIDISIKYDPRKDSEVFANRVITDDIELLKKILAYFLPEDAILKG 87

Query: 72 GQ-------GLEVLREVLQARRQPGAQWDLREFLVSAYFSLHG-RLDEDVIGVYKDVLQT 123
G G++ ++E L++ P QW+LR F+ +FSL R+D+D++ V D +
Sbjct: 88 GHYDNQLQNGIKRVKEFLES--SPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNH 145

Query: 124 QDGKRKALLDELKALTAELKVYSVIQSQINAALSAKQGIRIDAGGIDLVDPTLYGYAVGD 183
R L +EL LTAELK+YSVIQ++IN LS+ I I I+L+D LYGY +
Sbjct: 146 HGDARSKLREELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYT-DE 204

Query: 184 PRWKDSPEYALLSNLDTFSGKL--------SIKDFLSGSPKQSGELKGLSDEYPFEKDNN 235
+K S EY +L + + ++ SIKDFL K++G L L + Y + KDNN
Sbjct: 205 EIFKASAEYKILEKMPQTTIQVDGSEKKIVSIKDFLGSENKRTGALGNLKNSYSYNKDNN 264

Query: 236 PVGNFATTVSDRSRPLNDKVNEKTTLLNDTSSRYNSAVEALNRFIQKYDSVLRDIL 291
+ +FATT SD+SRPLND V++KTT L+D +SR+NSA+EALNRFIQKYDSV++ +L
Sbjct: 265 ELSHFATTCSDKSRPLNDLVSQKTTQLSDITSRFNSAIEALNRFIQKYDSVMQRLL 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08095SYCDCHAPRONE2022e-69 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 202 bits (514), Expect = 2e-69
Identities = 95/166 (57%), Positives = 126/166 (75%)

Query: 3 QQATPSDTDQQQALEAFLRDGGTLAMLRGLSEDTLEQLYALGFNQYQAGKWDDAQKIFQA 62
QQ T + Q A+E+FL+ GGT+AML +S DTLEQLY+L FNQYQ+GK++DA K+FQA
Sbjct: 2 QQETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQA 61

Query: 63 LCMLDHYDARYFLGLGACRQSLGLYEQALQSYSYGALMDINEPRFPFHAAECHLQLGDLD 122
LC+LDHYD+R+FLGLGACRQ++G Y+ A+ SYSYGA+MDI EPRFPFHAAEC LQ G+L
Sbjct: 62 LCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELA 121

Query: 123 GAESGFYSARALAAAQPAHEALAARAGAMLEAVTARKDRTYESDNA 168
AESG + A+ L A + + L+ R +MLEA+ +K+ +E +
Sbjct: 122 EAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEHECVDN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08105PF05844385e-137 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 385 bits (989), Expect = e-137
Identities = 291/295 (98%), Positives = 293/295 (99%)

Query: 1 MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAADLPQVPAARADRVELNAPRQVLDP 60
MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAA+LPQVPAARADRVELNAPRQVLDP
Sbjct: 1 MIDTQYSLAATQAAIPSEPIAPGAAGRSVGTPQAAAELPQVPAARADRVELNAPRQVLDP 60

Query: 61 VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQSIIHAQKAQVDEMRSGATLM 120
VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQ+IIHAQKAQVDEMRSGATLM
Sbjct: 61 VRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQKAQVDEMRSGATLM 120

Query: 121 IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED 180
IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED
Sbjct: 121 IAMAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGKTSDED 180

Query: 181 RKIVGKVWAADQVQDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA 240
RKIVGKVWAADQ QDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA
Sbjct: 181 RKIVGKVWAADQAQDSVALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQA 240

Query: 241 SAREGEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295
SARE EVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV
Sbjct: 241 SAREEEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08110PF05932463e-09 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 45.9 bits (109), Expect = 3e-09
Identities = 27/118 (22%), Positives = 49/118 (41%), Gaps = 4/118 (3%)

Query: 10 LLAEFAGRIGLPSLSLDEEGMASLLFDEQVGVTLLLLAERERLLLEVDVAGIDVLGEGIF 69
LL +F+ + + L D+ G +++ D +TL RERLLL + +
Sbjct: 9 LLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEP---HKDIPQ 65

Query: 70 RQLASFNRHWHCFDLH-FGFDELTGKVQLYAQILAAQLTLECFEATLANLLDHAEFWQ 126
+ L + + G DE +G Y I +L++ + +A LL+ W+
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGWR 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08135PF05932932e-27 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 92.9 bits (231), Expect = 2e-27
Identities = 25/120 (20%), Positives = 41/120 (34%), Gaps = 5/120 (4%)

Query: 2 DHLLSGLATRLGQGPFVADRTGSYHLRIDGQSVLLLRQGDDLLLESPLEHAPLDPQRDQQ 61
LL + L P V D G+ ++ ID L L D E L L+P +
Sbjct: 7 KTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLS--CDYARERLLLIGLLEP--HKD 62

Query: 62 GLLRALLSRVASWSRRYPQAIVLDADGRLLLQA-RLGLDGLDPERLERALAAQVGLLEAL 120
+ LL+ + + LD L + + L L+R +A + +
Sbjct: 63 IPQQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGW 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08140TYPE3OMGPROT8170.0 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 817 bits (2113), Expect = 0.0
Identities = 376/600 (62%), Positives = 473/600 (78%), Gaps = 7/600 (1%)

Query: 1 MRRLLIGGLLALLPGAVLRAQPLDWPSLPYDYVAQGESLRDVLANFGANYDASVIVSDKV 60
+R+L G LL L + AQ LDW +PY YVA+GESLRD+L +FGANYDA+V+VSDK+
Sbjct: 9 FKRVLTGTLLLLSSYSW--AQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKI 66

Query: 61 NDQVSGRFDLESPQAFLQLMASLYNLGWYYDGTVLYVFKTTEMQSRLVRLEQVGEAELKR 120
ND+VSG+F+ ++PQ FLQ +ASLYNL WYYDG VLY+FK +E+ SRL+RL++ AELK+
Sbjct: 67 NDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQ 126

Query: 121 ALTAAGIWEPRFGWRADPSGRLVHVSGPGRYLELVEQTAQVLEQQYTLRSEKTGDLSVEI 180
AL +GIWEPRFGWR D S RLV+VSGP RYLELVEQTA LEQQ +RSEKTG L++EI
Sbjct: 127 ALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEI 186

Query: 181 FPLRYAVAEDRKIEYRDDEIEAPGIASILSRVLSDANVVAVGDEPGKLRPGP--QSSHAV 238
FPL+YA A DR I YRDDE+ APG+A+IL RVLSDA + V + ++ S+ A
Sbjct: 187 FPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQAR 246

Query: 239 VQAEPSLNAVVVRDHKDRLPMYRRLIEALDRPSARIEVGLSIIDINAENLAQLGVDWSAG 298
V+A+PSLNA++VRD +R+PMY+RLI ALD+PSARIEV LSI+DINA+ L +LGVDW G
Sbjct: 247 VEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVG 306

Query: 299 IRLGNNKSIQIRTTGQDSEEGGGAGNGAVGSLVDSRGLDFLLAKVTLLQSQGQAQIGSRP 358
IR GNN + I+TTG S A NGA+GSLVD+RGLD+LLA+V LL+++G AQ+ SRP
Sbjct: 307 IRTGNNHQVVIKTTGDQS---NIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRP 363

Query: 359 TLLTQENTQAVLDQSETYYVRVTGERVAELKAITYGTMLKMTPRVVTLGDTPEISLSLHI 418
TLLTQEN QAV+D SETYYV+VTG+ VAELK ITYGTML+MTPRV+T GD EISL+LHI
Sbjct: 364 TLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHI 423

Query: 419 EDGSQKPNSAGLDKIPTINRTVIDTIARVGHGQSLLIGGIYRDELSQSQRKVPWLGDIPY 478
EDG+QKPNS+G++ IPTI+RTV+DT+ARVGHGQSL+IGGIYRDELS + KVP LGDIPY
Sbjct: 424 EDGNQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPY 483

Query: 479 LGALFRTTADTVRRSVRLFLIEPRLIDDGVGHYLALNNRRDLRGGLLEIDELSNQSLSLR 538
+GALFR ++ RR+VRLF+IEPR+ID+G+ H+LAL N +DLR G+L +DE+SNQS +L
Sbjct: 484 IGALFRRKSELTRRTVRLFIIEPRIIDEGIAHHLALGNGQDLRTGILTVDEISNQSTTLN 543

Query: 539 KLLGSARCQALAPARAEQEHLRQAGQGSFLTPCRMGAQEGWRVTDGACPKDGAWCVGAER 598
KLLG ++CQ L A+ Q+ L Q + S+LT C+M GWRV +GAC +WCV A +
Sbjct: 544 KLLGGSQCQPLNKAQEVQKWLSQNNKSSYLTQCKMDKSLGWRVVEGACTPAQSWCVSAPK 603


103HWH78_RS08165HWH78_RS08200N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS08165-190.496269YopR family T3SS polymerization control protein
HWH78_RS08170-180.446699SctI family type III secretion system inner rod
HWH78_RS08175-180.416677SctJ family type III secretion inner membrane
HWH78_RS08180090.765138SctK family type III secretion system sorting
HWH78_RS08185-180.480869SctL family type III secretion system stator
HWH78_RS08190-1100.473985beta-glucosidase BglX
HWH78_RS08195190.526882bifunctional diguanylate
HWH78_RS082000100.530806ribonuclease E inhibitor RraB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08165PF090252052e-71 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 205 bits (522), Expect = 2e-71
Identities = 143/143 (100%), Positives = 143/143 (100%)

Query: 1 MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL 60
MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL
Sbjct: 1 MSRIDTPPGFAVYPSASPKAANLPAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLL 60

Query: 61 QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ 120
QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ
Sbjct: 61 QRFAQPLQGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQ 120

Query: 121 VLIPLNGMLDNLVRNSHKLDLES 143
VLIPLNGMLDNLVRNSHKLDLES
Sbjct: 121 VLIPLNGMLDNLVRNSHKLDLES 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08175FLGMRINGFLIF751e-17 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 75.0 bits (184), Expect = 1e-17
Identities = 33/165 (20%), Positives = 69/165 (41%), Gaps = 6/165 (3%)

Query: 27 LYTGISQKEGNEMLALLRSEGVSADKQADKDGTVRLLVEESDIAEAVEVLKRKGYPRENF 86
L++ +S ++G ++A L + + V + E L ++G P+
Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPYR--FANGSGA-IEVPADKVHELRLRLAQQGLPKGG- 108

Query: 87 STLKDVFPKDGLISSPIEERARLNYAKAQEISHTLSEIDGVLVARVHVVLPEERDGLGRK 146
+ ++ ++ S E+ A E++ T+ + V ARVH+ +P + R+
Sbjct: 109 AVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMP-KPSLFVRE 167

Query: 147 SSPASASVFIKHAADVQLD-AYVPQIKQLVNNGIEGLSYDRISVV 190
SASV + LD + + LV++ + GL +++V
Sbjct: 168 QKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLV 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08185TYPE4SSCAGX300.008 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 29.8 bits (66), Expect = 0.008
Identities = 27/102 (26%), Positives = 45/102 (44%), Gaps = 8/102 (7%)

Query: 21 LRARDYQDYLSANRLVEAA--------RERAAEIEREAHEVYQEQKRLGWEAGLEEARLR 72
L RDYQ++L +L+ A +++A E E+EA E Q+ ++ E EE
Sbjct: 117 LMTRDYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKN 176

Query: 73 QAGLIQETLLRCNRYYRQVDRQLGEVVLQAVRKVLRHYDAVE 114
+A L T N ++ L E++ Q L + +E
Sbjct: 177 RANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLE 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS08200SECBCHAPRONE260.025 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 26.4 bits (58), Expect = 0.025
Identities = 8/30 (26%), Positives = 15/30 (50%)

Query: 19 EGGFDFARIHPIDFFAIFPSEREARQAAGQ 48
G F + P++F A+F + ++ A Q
Sbjct: 131 RGTFPALNLSPVNFDALFMDYLQRQEQAEQ 160


104HWH78_RS09695HWH78_RS09715N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS096950112.259771hypothetical protein
HWH78_RS097000121.351254multidrug efflux RND transporter permease
HWH78_RS09705-181.182475multidrug efflux RND transporter periplasmic
HWH78_RS09710-18-0.284293TetR family transcriptional regulator AmrR
HWH78_RS09715-29-0.226856DUF3203 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS09695VACJLIPOPROT260.009 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 26.4 bits (58), Expect = 0.009
Identities = 14/35 (40%), Positives = 18/35 (51%), Gaps = 2/35 (5%)

Query: 5 KLSLPTLALCVGLLGACS--PTPRQPRAAPIVPAN 37
KL L LAL LL C+ T +Q R+ P+ N
Sbjct: 2 KLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFN 36


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS09700ACRIFLAVINRP10920.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1092 bits (2826), Expect = 0.0
Identities = 508/1033 (49%), Positives = 710/1033 (68%), Gaps = 6/1033 (0%)

Query: 1 MARFFIDRPVFAWVISLLIVLAGVLAIRFLPVAQYPDIAPPVVNVSASYPGASAKVVEEA 60
MA FFI RP+FAWV+++++++AG LAI LPVAQYP IAPP V+VSA+YPGA A+ V++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTAIIEREMNGAPGLLYTKATS-STGQASLTLTFRQGVNADLAAVEVQNRLKIVESRLPE 119
VT +IE+ MNG L+Y +TS S G ++TLTF+ G + D+A V+VQN+L++ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 SVRRDGIYVEKAADSIQLIVTLTSSSGRYDAMELGEIASSNVLQALRRVEGVGKVETWGA 179
V++ GI VEK++ S ++ S + ++ + +SNV L R+ GVG V+ +GA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 EYAMRIWPDPAKLTSMNLSASDLVNAVRRHNARLTVGDIGNLGVPDSAPISATVKVDDTL 239
+YAMRIW D L L+ D++N ++ N ++ G +G ++A++
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 240 VTPEQFGEIPLRIRADGGAIRLRDVARVEFGQSEYGFVSRVNQMTATGLAVKMAPGSNAV 299
PE+FG++ LR+ +DG +RL+DVARVE G Y ++R+N A GL +K+A G+NA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 300 ATAKRIRATLDELSRYFPEGVSYNIPYDTSAFVEISIRKVVSTLLEAMLLVFAVMYLFMQ 359
TAK I+A L EL +FP+G+ PYDT+ FV++SI +VV TL EA++LVF VMYLF+Q
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 360 NFRATLIPTLVVPVALLGTFTVMLGLGFSINVLTMFGMVLAIGILVDDAIIVVENVERLM 419
N RATLIPT+ VPV LLGTF ++ G+SIN LTMFGMVLAIG+LVDDAI+VVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 420 AEEGLSPHDATVKAMRQISGAIVGITVVLVSVFVPMAFFSGAVGNIYRQFAVTLAVSIGF 479
E+ L P +AT K+M QI GA+VGI +VL +VF+PMAFF G+ G IYRQF++T+ ++
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 480 SAFLALSLTPALCATLLRPIDADHHE-KRGFFGWFNRAFLRLTGRYRNAVAGILARPIRW 538
S +AL LTPALCATLL+P+ A+HHE K GFFGWFN F Y N+V IL R+
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 539 MLVYALVIGVVALLFVRLPQAFLPEEDQGDFMIMVMQPEGTPMAETMANVGDVERYLAEH 598
+L+YAL++ + +LF+RLP +FLPEEDQG F+ M+ P G T + V Y ++
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 599 EP--VAYAYAVGGFSLYGDGTSSAMIFATLKDWSERREASQHVGAIVERINQRFAGLPNR 656
E V + V GFS G ++ M F +LK W ER A++ R + +
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 657 TVYAMNSPPLPDLGSTSGFDFRLQDRGGVGYEALVKARDQLLARAAEDP-RLANVMFAGQ 715
V N P + +LG+ +GFDF L D+ G+G++AL +AR+QLL AA+ P L +V G
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 716 GEAPQIRLDIDRRKAETLGVSMDEINTTLAVMFGSDYIGDFMHGSQVRKVVVQADGAKRL 775
+ Q +L++D+ KA+ LGVS+ +IN T++ G Y+ DF+ +V+K+ VQAD R+
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 776 GIDDIGRLHVRNEQGEMVPLATFAKAAWTLGPPQLTRYNGYPSFNLEGQAAPGYSSGEAM 835
+D+ +L+VR+ GEMVP + F + W G P+L RYNG PS ++G+AAPG SSG+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 836 QAMEQLMQGLPEGIAHEWSGQSFEERLSGAQAPALFALSVLIVFLALAALYESWSIPLAV 895
ME L LP GI ++W+G S++ERLSG QAPAL A+S ++VFL LAALYESWSIP++V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 896 ILVVPLGVLGALLGVSLRGLPNDIYFKVGLITIIGLSAKNAILIIEVAKD-HYQEGMSLL 954
+LVVPLG++G LL +L ND+YF VGL+T IGLSAKNAILI+E AKD +EG ++
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 955 QATLEAARLRLRPIVMTSLAFGFGVVPLALSSGAGSGAQVAIGTGVLGGIVTATVLAVFL 1014
+ATL A R+RLRPI+MTSLAF GV+PLA+S+GAGSGAQ A+G GV+GG+V+AT+LA+F
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1015 VPLFFLVVGRLFR 1027
VP+FF+V+ R F+
Sbjct: 1021 VPVFFVVIRRCFK 1033


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS09705RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 3e-06
Identities = 14/83 (16%), Positives = 30/83 (36%), Gaps = 3/83 (3%)

Query: 117 ASHAAAADKLKRYADLIKDRAISERE--YTEAQTDARQALAQIASAKAELEQARLRLGYA 174
+ + ++++ K+ + E RQ I EL + R +
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 175 TVTAPIDGR-ARRALVTEGALVG 196
+ AP+ + + + TEG +V
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVT 351



Score = 41.4 bits (97), Expect = 4e-06
Identities = 24/137 (17%), Positives = 47/137 (34%), Gaps = 7/137 (5%)

Query: 67 EVRARVAGIVTRRLYEEGQDVRAGTVLFQIDPAPLKAALDISRGALARAEASHAAAADKL 126
E++ IV + +EG+ VR G VL ++ +A ++ +L +A L
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE-QTRYQIL 156

Query: 127 KRYADLIKDRAISEREYT------EAQTDARQALAQIASAKAELEQARLRLGYATVTAPI 180
R +L K + + E + +L + + + ++ + L A
Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216

Query: 181 DGRARRALVTEGALVGE 197
R E E
Sbjct: 217 LTVLARINRYENLSRVE 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS09710HTHTETR973e-27 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 97.0 bits (241), Expect = 3e-27
Identities = 57/209 (27%), Positives = 100/209 (47%), Gaps = 5/209 (2%)

Query: 1 MARKTKEESQKTRDGILDAAERVFLEKGVGTTAMADLADAAGVSRGAVYGHYKNKIEVCL 60
MARKTK+E+Q+TR ILD A R+F ++GV +T++ ++A AAGV+RGA+Y H+K+K ++
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 AMCDRAFGQI-EVPDENA--RVPALDILLRAGM-GFLRQCCEPGSVQRVLEILYLKCERS 116
+ + + I E+ E +LR + L + ++EI++ KCE
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 117 DENEPLLRRRELLEKQGQRFGLRQIRRAVERGELPARLDVELASIYLQSLWDGICGTLAW 176
E + + + L + + ++ +E LPA L A+I ++ G+ +
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 177 TERLRDDPWNRAERMFRAGLDSLRSSPYL 205
+ D A L+ P L
Sbjct: 181 APQSFDLK-KEARDYVAILLEMYLLCPTL 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS09715PF05272280.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.002
Identities = 18/68 (26%), Positives = 26/68 (38%), Gaps = 11/68 (16%)

Query: 14 VEIEGSRHRAPVDSLRIGTDAEARLSVLYIDGKRLHISEED---------AQRLVVAGAE 64
V + G + + R AEA LY+ G+R S ED RLV G +
Sbjct: 711 VLVPGRANLVWLQKFRGQLFAEAL--HLYLAGERYFPSPEDEEIYFRPEQELRLVETGVQ 768

Query: 65 DQRRHLMA 72
+ L+
Sbjct: 769 GRLWALLT 776


105HWH78_RS12695HWH78_RS12740N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS126950131.922639efflux RND transporter permease subunit
HWH78_RS127001132.863219efflux RND transporter periplasmic adaptor
HWH78_RS127051142.173340TolC family protein
HWH78_RS127102171.553686heavy metal response regulator transcription
HWH78_RS127152171.726910sensor histidine kinase
HWH78_RS127201161.556067hypothetical protein
HWH78_RS127251151.715959multidrug efflux RND transporter outer membrane
HWH78_RS127300141.711065multidrug efflux RND transporter permease
HWH78_RS12735-1131.882948multidrug efflux RND transporter permease
HWH78_RS12740-3111.809279multidrug efflux RND transporter periplasmic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12695ACRIFLAVINRP8110.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 811 bits (2096), Expect = 0.0
Identities = 237/1055 (22%), Positives = 435/1055 (41%), Gaps = 56/1055 (5%)

Query: 5 IIQFAIEQRWLVLLAVLGMAGVGIGSYQKLSIDAVPDITNVQVQINTAAPGYSPLEVEQR 64
+ F I + + + + G + +L + P I V ++ PG V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 ITYPVETVMAGLPGLQETRSLS-RPGISQVTVIFEEGTDIYFARQQVNERLSTAREQLPE 123
+T +E M G+ L S S G +T+ F+ GTD A+ QV +L A LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 DISPTLGPISTGLGEIYLWTVEAEEGATKEDGSAYTPTDLRTIQDWIIRPQLRNVKGVAE 183
++ + YL D T D+ ++ L + GV +
Sbjct: 121 EVQQQGISVEKSSS-SYLMVA-----GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174

Query: 184 INTIGGYAKQFLIAPDPKKLAAYKLTLGDLQNAVLRNNENVGAGYI------ERRGEQLL 237
+ G I D L YKLT D+ N + N+ + AG + +
Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 238 IRAPGQVKDMDDIRGIIV-SNVDGVPIRIRDVAEVGLGKELRTGAATENGREVVLGTVFM 296
I A + K+ ++ + + N DG +R++DVA V LG E A NG+ + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 297 LIGENSREVAQAVGQRLEEINRTLPKGVKAITVYDRTTLVDKAVATVKKNLVEGAALVIA 356
G N+ + A+A+ +L E+ P+G+K + YD T V ++ V K L E LV
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 357 VLFLFLGNIRAALITATIIPLSMLFTFTGMVGNRVSANLMSLG--ALDFGIIVDGAVVIV 414
V++LFL N+RA LI +P+ +L TF + S N +++ L G++VD A+V+V
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 415 ENAIRRLAHAQAHHGRQLTRAERFHEVFAASREARRALVFGQIIIMVVYLPIFALTGVEG 474
EN + + + + ALV +++ V++P+ G G
Sbjct: 414 EN---------VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 475 KMFHPMAFTVVTALLGAMILSVTFVPAAIALFITGKVKEEE----------NFVMRRARL 524
++ + T+V+A+ ++++++ PA A + E N +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524

Query: 525 AYEPALRWVLGHRALVVGGALGAILLTGLVASRMGSEFIPSLSEGDFAMQGLRVPGTSL- 583
Y ++ +LG + + ++ R+ S F+P +G F G +
Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584

Query: 584 -TQSVEMQQTLEKKLMGKFPEIERVFARTGTAEIASDLMPPNASDSYVMLKPQSQWPDPK 642
TQ V + Q + L + +E VF G + NA ++V LKP + +
Sbjct: 585 RTQKV-LDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDE 640

Query: 643 KSREALLEELQAAALEVP-GSVYEFSQPIQLRFNELISGVRSDVA-VKVFGDDMQVLNDT 700
S EA++ + ++ G V F+ P EL + D + G L
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMP---AIVELGTATGFDFELIDQAGLGHDALTQA 697

Query: 701 AEKI-SKVLQSIDGASEVKVEQTTGLPVLTVDIDRDKAARFGLNVGDIQDTVATALGGRN 759
++ Q V+ +++D++KA G+++ DI T++TALGG
Sbjct: 698 RNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTY 757

Query: 760 AGTLFEGDRRFDIVIRLPETLRADLPALSNLLIPLPPNNLARIDFIPLSDVARLDLSPGP 819
+ R + ++ R + L + + +P S G
Sbjct: 758 VNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANG-----EMVPFSAFTTSHWVYGS 812

Query: 820 NQISRENGKRRIVVSANVRGRDIGSFVLEAQQKLQDGVKIPAGYWTTWGGQFEQLQSAAK 879
++ R NG + + + + L K+PAG W G Q + +
Sbjct: 813 PRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGN 870

Query: 880 RLQVVVPVALLLVFTLLFAMFNNVKDGLLVFTGIPFALTGGVLALWLRGIPLSISAAVGF 939
+ +V ++ ++VF L A++ + + V +P + G +LA L + VG
Sbjct: 871 QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGL 930

Query: 940 IALSGVAVLNGLVMISFIRNLL-QEGRSLDQAVWEGAITRLRPVLMTALVASLGFVPMAL 998
+ G++ N ++++ F ++L+ +EG+ + +A RLRP+LMT+L LG +P+A+
Sbjct: 931 LTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAI 990

Query: 999 ATGTGAEVQRPLATVVIGGILSSTMLTLLVLPVLY 1033
+ G G+ Q + V+GG++S+T+L + +PV +
Sbjct: 991 SNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFF 1025



Score = 70.6 bits (173), Expect = 2e-14
Identities = 70/527 (13%), Positives = 160/527 (30%), Gaps = 46/527 (8%)

Query: 2 FERIIQFAIEQRWLVLLAVLGMAGVGIGSYQKLSIDAVPDITNVQVQINTAAPGYSPLEV 61
+ + + LL + + + +L +P+ P + E
Sbjct: 526 YTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQER 585

Query: 62 EQRI---------TYPVETVMA--GLPGLQETRSLSRPGISQVTV-IFEEGTDIYFARQQ 109
Q++ V + + G + G++ V++ +EE + +
Sbjct: 586 TQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEA 645

Query: 110 VNERLSTAREQLPED-ISPTLGPISTGLGEIYLWTVEAEEGATKEDGSAYTPTDLRTIQD 168
V R ++ + + P P LG D + L ++
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELG------TATGFDFELIDQAGLGHDALTQARN 699

Query: 169 WIIRPQLRNVKGVAEINTIGGY-AKQFLIAPDPKKLAAYKLTLGDLQNAVLRNNENVGAG 227
++ ++ + + G QF + D +K A ++L D+ +
Sbjct: 700 QLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVN 759

Query: 228 YIERRGEQ--LLIRAPGQ-VKDMDDIRGIIVSNVDGVPIRIRDVAEVGLGKELRTGAAT- 283
RG L ++A + +D+ + V + +G + G+
Sbjct: 760 DFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTS----HWVYGSPRL 815

Query: 284 --ENGREVVLGTVFMLIGENSREVAQAVGQRLEEINRTLPKGVKAITVYDRTTLVDKAVA 341
NG + G +S + + E + LP G+ + +
Sbjct: 816 ERYNGLPSMEIQGEAAPGTSSGDAMALM----ENLASKLPAGI-GYDWTGMSYQERLSGN 870

Query: 342 TVKKNLVEGAALVIAVLFLFLGNIRAALITATIIPLSMLFTFTGMVGNRVSANLMSLGAL 401
+ +V L + + ++PL ++ ++ + L
Sbjct: 871 QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGL 930

Query: 402 --DFGIIVDGAVVIVENAIRRLAHAQAHHGRQLTRAERFHEVFAASREARRALVFGQIII 459
G+ A++IVE A G+ + A A R R ++ +
Sbjct: 931 LTTIGLSAKNAILIVEFAK----DLMEKEGKGVVEA-----TLMAVRMRLRPILMTSLAF 981

Query: 460 MVVYLPIFALTGVEGKMFHPMAFTVVTALLGAMILSVTFVPAAIALF 506
++ LP+ G + + V+ ++ A +L++ FVP +
Sbjct: 982 ILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12700RTXTOXIND453e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.8 bits (106), Expect = 3e-07
Identities = 40/212 (18%), Positives = 82/212 (38%), Gaps = 22/212 (10%)

Query: 88 ISSPQLSDQRSEFAAAQRRLSLAQSTYKREQQLWKEGISAEQEFLLARQGLQ-EAEIALN 146
I+ + +Q +++ A L + +S + +Q+ E +SA++E+ L Q + E L
Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKS---QLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 147 NARAKIAALGG--NPSLQGGNRYELRAPFAGVLVE-KHLTQGEPVDGTANVFTLS-DLSS 202
I L + + +RAP + + + K T+G V + + + +
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 203 VWATFNVPAQLLGQVRVGSKVKVLAQALDS----EVEGTVSYIG-DLLGEQTRAATARVT 257
+ T V + +G + VG + +A + G V I D + +Q V
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVI 425

Query: 258 LSNPEST---------WRPGLFVSVQVAEATR 280
+S E+ G+ V+ ++ R
Sbjct: 426 ISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457



Score = 30.6 bits (69), Expect = 0.010
Identities = 19/119 (15%), Positives = 42/119 (35%), Gaps = 13/119 (10%)

Query: 40 LAQVVSLPGEIRFNEDRTAHIVPRLPGIVDSVPANLGQAVKQGELLAVISSPQLSDQRSE 99
+ V + G++ + I P IV + G++V++G++L +++ ++
Sbjct: 80 VEIVATANGKLTHSGRSKE-IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGA---EAD 135

Query: 100 FAAAQRRLSLAQSTYKREQQLWKE---------GISAEQEFLLARQGLQEAEIALNNAR 149
Q L A+ R Q L + + E F + +L +
Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12710HTHFIS817e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 7e-20
Identities = 34/145 (23%), Positives = 63/145 (43%), Gaps = 8/145 (5%)

Query: 2 RILIIEDEVKTADYLHQGLTESGYIVDRANDGIDGLHMALQHPYELVILDVNLPGIDGWD 61
IL+ +D+ L+Q L+ +GY V ++ +LV+ DV +P + +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLRRLRER-SSARVMMLTGHGRLTDKVRGLDLGADDFMVKPFQFPELLARVRSLLRRHDQ 120
LL R+++ V++++ ++ + GA D++ KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE--- 121

Query: 121 APMQDVLRVADLELDASRHRAFRGR 145
R + LE D+ GR
Sbjct: 122 ----PKRRPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12715PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 25/105 (23%), Positives = 40/105 (38%), Gaps = 24/105 (22%)

Query: 370 LVSNAVRH----TPQGGRIDVRIGERAGHTEVRVSNDGPGIPPEYLPHLFERFYRRAGRQ 425
LV N ++H PQGG+I ++ + G + V N G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 426 TGAQAGTGLGLAIV-QSIMAYHGGRAEAE-SVPQQKTHLRLLFPS 468
+ TG GL V + + +G A+ + S Q K + +L P
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12725RTXTOXIND386e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 6e-05
Identities = 25/216 (11%), Positives = 62/216 (28%), Gaps = 30/216 (13%)

Query: 229 RADVAQARTQLKSTQAQAIDLKYQ--RAQLEHAIAVLVGLPPAQFNLPPVASVPKLPDLP 286
+ A TQ+ + + + R Q+ L LP + P ++
Sbjct: 126 KLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVL 185

Query: 287 AVVP----------SQLLERRPDIASAERKVISANAQIGVAKAAY------FPDLTLSAA 330
+ +Q ++ ++ + ++ A+I + D +
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 331 GGYRSGSLSNWISTPNRFWSIGPQFAMTLFDGGLIGSQVDQAEATYDQTVATYRQTVLDG 390
+ + N++ + + I S++ A+ Y ++ +LD
Sbjct: 246 KQA--IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303

Query: 391 FREVEDYLVQLSVLDEESGVQREALESAREALRLAE 426
R+ D + L L E + +
Sbjct: 304 LRQTTDNIGLL----------TLELAKNEERQQASV 329



Score = 31.3 bits (71), Expect = 0.009
Identities = 18/150 (12%), Positives = 43/150 (28%), Gaps = 18/150 (12%)

Query: 171 ASAADLAAVRLSQQSQLAQNYLQLRVMDEQIRLLNDTVTAYERSLKVAENK-------YR 223
+ + + Q+Q Q L L + + + YE +V +++
Sbjct: 186 RLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 224 AGIVTRADVAQARTQLKSTQAQAIDLKYQRAQLEHAIAVLVGLPPAQFNLPPVASVPKLP 283
+ + V + + + K Q Q+E I A+ V
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS------AKEEYQLVTQ----- 294

Query: 284 DLPAVVPSQLLERRPDIASAERKVISANAQ 313
+ +L + +I ++ +
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12730ACRIFLAVINRP8160.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 816 bits (2110), Expect = 0.0
Identities = 290/1034 (28%), Positives = 512/1034 (49%), Gaps = 31/1034 (2%)

Query: 7 FIRRPVATTLLTLALLLAGTLSFGLLPVAPLPNVDFPAIVVSASLPGASPETMASSVATP 66
FIRRP+ +L + L++AG L+ LPVA P + PA+ VSA+ PGA +T+ +V
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 67 LERSLGRIAGISEMTSSS-SLGSTTVVLVFDLEKDIDGAAREVQAAINGAMSLLPSGMPN 125
+E+++ I + M+S+S S GS T+ L F D D A +VQ + A LLP +
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 126 NPSYRKANPSDMPIMVLTLTSET--QSRGEMYDLASTVLAPKLSQVQGVGQVSIGGSSLP 183
S +MV S+ ++ ++ D ++ + LS++ GVG V + G+
Sbjct: 125 -QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY- 182

Query: 184 AVRVDLNPDAMSQYGLSLDSVRTAIAAANSNGPKG------AVEKDDKHWQVDANDQLRK 237
A+R+ L+ D +++Y L+ V + N G A+ + + A + +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 238 AREYEPLVIHYNADNGAAVRLGDVAKVSDSVEDVRNAGFSDDLPAVLLIVTRQPGANIIE 297
E+ + + N+D G+ VRL DVA+V E+ + PA L + GAN ++
Sbjct: 243 PEEFGKVTLRVNSD-GSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 298 ATDAIHAQLPVLQELLGPQVKLNVMDDRSPSIRASLEEAELTLLISVALVILVVFLFLRN 357
AI A+L LQ +K+ D +P ++ S+ E TL ++ LV LV++LFL+N
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 358 GRATLIPSLAVPVSLIGTFAVMYLCDFSLNNLSLMALIIATGFVVDDAIVVVENIARRI- 416
RATLIP++AVPV L+GTFA++ +S+N L++ +++A G +VDDAIVVVEN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 417 EEGDPPIQAAITGARQVGFTVLSMTLSLVAVFIPLLLMGGLTGRLFREFAVTLSAAILVS 476
E+ PP +A Q+ ++ + + L AVFIP+ GG TG ++R+F++T+ +A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 477 LVVSLTLTPMLCARLLRPLKRPEG---ASLARRSDRFFAAFMLRYRASLGWALEHSRLMV 533
++V+L LTP LCA LL+P+ + F + Y S+G L + +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 534 VIMLACIAMNLWLFVVVPKGFLPQQDSGRLRGYAVADQSISFQSLSAKMGEYRKILSSDP 593
+I +A + LF+ +P FLP++D G + + + + +
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 594 AVE-----NVVGFIGGGRWQSSNTGSFFVTLKPIGERDP----VEKVLTRLRERIAKVPG 644
V GF G Q+ N G FV+LKP ER+ E V+ R + + K+
Sbjct: 602 KANVESVFTVNGFSFSG--QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 645 AALYLNAGQDVRLGGRDSNAQYEFTLRS-DDLTLLREWAPKVEAAMRKLP-QLVDVNSDS 702
+ + G + +E ++ L + ++ + P LV V +
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 703 QDKGVQTRLVIDRDRAATLGINVEMVDAVLNDSFGQRQVSTIFNPLNQYRVVMEVDQQYQ 762
+ Q +L +D+++A LG+++ ++ ++ + G V+ + ++ ++ D +++
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 763 QSPEILRQVQVIGNDGQRVPLSAFSRYEPSRAPLEVNHQGQFAATTLSFNLAPGAQIGPT 822
PE + ++ V +G+ VP SAF+ + + + APG G
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 823 REAIMQALEPLHIPVDVQTSFEGNAGAVQDTQNQMPWLILLALLAVYIVLGILYESYVHP 882
+ L P + + G + + + NQ P L+ ++ + V++ L LYES+ P
Sbjct: 840 MALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897

Query: 883 LTILSTLPSAGVGALLALILCRSELSLIALIGIILLIGIVKKNAIMMIDFALEAERNHGL 942
++++ +P VG LLA L + + ++G++ IG+ KNAI++++FA + G
Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957

Query: 943 SPREAILEACMMRFRPIMMTTLAALLGALPLIFGIGGDAALRRPLGITIVGGLIGSQLLT 1002
EA L A MR RPI+MT+LA +LG LPL G + + +GI ++GG++ + LL
Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017

Query: 1003 LYTTPVVYLYLDRL 1016
++ PV ++ + R
Sbjct: 1018 IFFVPVFFVVIRRC 1031



Score = 83.0 bits (205), Expect = 4e-18
Identities = 73/366 (19%), Positives = 136/366 (37%), Gaps = 15/366 (4%)

Query: 665 QYEFTLRSDDLTL--LREWAPK-VEAAMRKLPQLVDVNSDSQDKGVQTRLVIDRDRAATL 721
F + T + ++ V+ + +L + DV R+ +D D
Sbjct: 139 VAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY--AMRIWLDADLLNKY 196

Query: 722 GINVEMVDAVL---NDSFGQRQVSTIFNPLNQYRVVMEVDQQYQQSPEILRQVQVIGN-D 777
+ V L ND Q+ Q + Q ++PE +V + N D
Sbjct: 197 KLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSD 256

Query: 778 GQRVPLSAFSRYEPSRAPLE--VNHQGQFAATTLSFNLAPGAQIGPTREAIMQALEPLH- 834
G V L +R E G+ A L LA GA T +AI L L
Sbjct: 257 GSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGANALDTAKAIKAKLAELQP 315

Query: 835 -IPVDVQ-TSFEGNAGAVQDTQNQMPWLILLALLAVYIVLGILYESYVHPLTILSTLPSA 892
P ++ VQ + +++ + A++ V++V+ + ++ L +P
Sbjct: 316 FFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVV 375

Query: 893 GVGALLALILCRSELSLIALIGIILLIGIVKKNAIMMIDFALEAERNHGLSPREAILEAC 952
+G L ++ + + G++L IG++ +AI++++ L P+EA ++
Sbjct: 376 LLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSM 435

Query: 953 MMRFRPIMMTTLAALLGALPLIFGIGGDAALRRPLGITIVGGLIGSQLLTLYTTPVVYLY 1012
++ + +P+ F G A+ R ITIV + S L+ L TP +
Sbjct: 436 SQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCAT 495

Query: 1013 LDRLRH 1018
L +
Sbjct: 496 LLKPVS 501


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12735ACRIFLAVINRP8400.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 840 bits (2171), Expect = 0.0
Identities = 301/1036 (29%), Positives = 514/1036 (49%), Gaps = 29/1036 (2%)

Query: 4 SRPFILRPVATTLLMVAILLSGLIAYRFLPISALPEVDYPTIQVVTLYPGASPEIMTSSI 63
+ FI RP+ +L + ++++G +A LP++ P + P + V YPGA + + ++
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 TAPLENQLGQIPGLNEMSSSS-SGGASVITLQFSLQSNLDVAEQEVQAAINAAQSLLPND 122
T +E + I L MSS+S S G+ ITL F ++ D+A+ +VQ + A LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 LPNQPVFSKVNPADAPILTLAVMSDG--MPLPQIQDLVDTRLAQKISQISGVGLVSISGG 180
+ Q + S + + ++ +SD I D V + + +S+++GVG V + G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QRPAVRVRANPTALAAAGLSLEDLRSTVTSNNLNGPKGSFDGPTRAS------TLDANDQ 234
Q A+R+ + L L+ D+ + + N G G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 235 LRSADAYRDLII-AYKNGSPLRIRDVASVEDDAENVRLAAWANNLPAVVLNIQRQPGANV 293
++ + + + + +GS +R++DVA VE EN + A N PA L I+ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 IEVVDRIKALLPQLQSTLPGNLDVQVLTDRTTTIRASVKDVQFELALAVALVVMVTFLFL 353
++ IKA L +LQ P + V D T ++ S+ +V L A+ LV +V +LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 RNVYATLIPSFAVPLSLIGTFGVMYLSGFSINNLTLMALTIATGFVVDDAIVMVENIARY 413
+N+ ATLIP+ AVP+ L+GTF ++ G+SIN LT+ + +A G +VDDAIV+VEN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 L-EQGDSPLEAALKGSKQIGFTIISLTFSLIAVLIPLLFMGDVAGRLFREFAITLAVAIL 472
+ E P EA K QI ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 ISGFVSLTLTPMLSAKLLRHIDEDQQ---GRFARAAGRVIDGLIAQYAKALRVVLRHQPL 529
+S V+L LTP L A LL+ + + G F D + Y ++ +L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 530 TLLVAIATLALTALLYLAMPKGFFPVQDTGVIQGVAEAPQSISFQAMSERQRALAEVVLK 589
LL+ +A +L+L +P F P +D GV + + P + + + + + LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 590 DPA--VASLSSYIGVDGSNPTLNTGRLLINLKPHSERDV---TASEVIQRLQPELDHLPG 644
+ V S+ + G S N G ++LKP ER+ +A VI R + EL +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 645 IKLYMQPVQDLTIEDRVARTQYQFTLQD---ADPDVLAEWVPKLVARLQELP-QLADVAS 700
+ P I + T + F L D D L + +L+ + P L V
Sbjct: 660 GFVI--PFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 701 DWQDKGLQAYLNIDRDTASRLGVKLSDIDSVLYNAFGQRLISTIFTQATQYRVVLEVAPQ 760
+ + Q L +D++ A LGV LSDI+ + A G ++ + ++ ++ +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 761 FQLGPQALEQLYVPSSDGTQVRLSSLAKVEERHTLLAINHIAQFPSATLSFNLAKGYSLG 820
F++ P+ +++LYV S++G V S+ + + PS + A G S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 821 EAVEAIRGVEASLELPLSMQGSFRGAALAFEASLSNTLLLILASVVTMYIVLGILYESFI 880
+A+ + + + +LP + + G + S + L+ S V +++ L LYES+
Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 881 HPVTILSTLPSAGVGALLALMLAGQEIGIVAIIGIILLIGIVKKNAIMMIDFALDAERNE 940
PV+++ +P VG LLA L Q+ + ++G++ IG+ KNAI++++FA D E
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 941 GKPPHEAIYQACLLRFRPILMTTMAALLGALPLMLAGGAGAELRQPLGITMVGGLLLSQV 1000
GK EA A +R RPILMT++A +LG LPL ++ GAG+ + +GI ++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1001 LTLFTTPVIYLYFDRL 1016
L +F PV ++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS12740RTXTOXIND445e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.4 bits (105), Expect = 5e-07
Identities = 31/172 (18%), Positives = 65/172 (37%), Gaps = 16/172 (9%)

Query: 124 TYKAALAQAEGTLMQNQAQLKNAEIDLQRYKGLYAEDSIAKQTLDTQEAQVRQLQGTIRT 183
L + L Q ++++ +A+ + Q L+ + + ++RQ I
Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD---------KLRQTTDNIGL 313

Query: 184 NQGQVDDARLNLTFTEVRAPISGR-LGLRQVDIGNLVTSGDTTPLVVITQVKPISVVFSL 242
++ + +RAP+S + L+ G +VT+ +T +V++ + + V +
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTALV 372

Query: 243 PQQQIGTVVEQMNGPGQLAVTALDRNQDKVLAEGTLT--TLDNQIDSTTGTV 292
+ IG + + V A + L G + LD D G V
Sbjct: 373 QNKDIGFINVGQ--NAIIKVEAFPYTRYGYL-VGKVKNINLDAIEDQRLGLV 421



Score = 41.4 bits (97), Expect = 5e-06
Identities = 26/125 (20%), Positives = 49/125 (39%), Gaps = 8/125 (6%)

Query: 80 ALGTVTAF-NTVNVKPRVNGELVKVLFQEGQEVKAGDLLAVVDPRTYKAALAQAEGTLMQ 138
A G +T + +KP N + +++ +EG+ V+ GD+L + +A + + +L+Q
Sbjct: 86 ANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 139 NQAQL--KNAEIDLQRYKGLYAEDSIAKQTLDT-QEAQVRQLQGTIR----TNQGQVDDA 191
+ + L + E +V +L I+ T Q Q
Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 192 RLNLT 196
LNL
Sbjct: 206 ELNLD 210


106HWH78_RS13185HWH78_RS13230N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS131853152.378464P-type conjugative transfer protein TrbL
HWH78_RS131902140.803874conjugal transfer protein TrbF
HWH78_RS13195318-1.087967P-type conjugative transfer protein TrbG
HWH78_RS13205-120-2.439885TrbI/VirB10 family protein
HWH78_RS13210019-3.159641DUF2274 domain-containing protein
HWH78_RS13215-118-3.275699hypothetical protein
HWH78_RS13220-115-2.863744hypothetical protein
HWH78_RS13225-112-1.982295two-component sensor histidine kinase
HWH78_RS13230-19-1.935536two-component system response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13190PRTACTNFAMLY290.042 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 29.3 bits (65), Expect = 0.042
Identities = 35/134 (26%), Positives = 43/134 (32%), Gaps = 7/134 (5%)

Query: 280 ALGAAGTAVAVGAAATGVGGAVAAGARMAPVAAKMAASGARTAASTAGSARSAFQAGSAA 339
A GA +GA+ + G G R A VAA GA A R AG A
Sbjct: 213 ASGAPAAVSVLGASELTLDGGHITGGRAAGVAAM---QGAVVHLQRATIRRGDAPAGGAV 269

Query: 340 AGGGAKGAMAGLGNVAKSGAQAVGQKAA--AGARSLKERAAAAFRSEGAGSASS-GSGGA 396
GG G A G G V + S E A + + G+A G G
Sbjct: 270 PGGAVPGG-AVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGAR 328

Query: 397 AAGAATPGSEATAN 410
+ S N
Sbjct: 329 VTVSGGSLSAPHGN 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13195PF04335568e-12 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 56.4 bits (136), Expect = 8e-12
Identities = 42/207 (20%), Positives = 69/207 (33%), Gaps = 12/207 (5%)

Query: 20 YQAAAQVWD-ERIGSARVQAKNWRLMAFGCLVLALLMAGGLVWRSAQSIVTPYVVEVDKS 78
Y A W+ +++ +A K ++A LA + + V PYV+ VD++
Sbjct: 13 YFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRN 72

Query: 79 GQVRAVGE---AATPYRPADAQIAYHLAHFVTLVRSLSIDPIVVRQNWLDAYDYATDRGA 135
++ +A Y LA +V + + +
Sbjct: 73 TGEASIAAKLHGDATITYDEAVRKYFLATYVRYRE--GWIAAAREEYFDAVMVMSARPEQ 130

Query: 136 A-VLNDYASKN--DPFARIGKE-SVTVQITSVVRASESSFNVRWTEQRFVNGAPAGTERW 191
Y + N P + V V+I V + V +T + V G+ +
Sbjct: 131 DRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFT-KESVTGSNSTKTDA 189

Query: 192 NAVISI-VLQTPRTEQRLRKNPLGIYV 217
A I V TP E KNPLG V
Sbjct: 190 VATIKYKVDGTPSKEVDRFKNPLGYQV 216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13200PF03544310.006 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 30.7 bits (69), Expect = 0.006
Identities = 19/81 (23%), Positives = 31/81 (38%), Gaps = 3/81 (3%)

Query: 23 QGKPPPRISLDEPVQAQPLPEPPKPVEVV---AVPKVLPMPAQMKPLPEADDAKPTPEPA 79
+PPP ++ + +P+PEPPK VV PK P P +K + + E
Sbjct: 65 AVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESR 124

Query: 80 DETVRVSRANAEARIAPTREG 100
+ + A A +
Sbjct: 125 PASPFENTAPARPTSSTATAA 145



Score = 29.6 bits (66), Expect = 0.014
Identities = 15/73 (20%), Positives = 22/73 (30%)

Query: 26 PPPRISLDEPVQAQPLPEPPKPVEVVAVPKVLPMPAQMKPLPEADDAKPTPEPADETVRV 85
PP + +P PEP E V+ + KP P+ K +P + V
Sbjct: 62 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPV 121

Query: 86 SRANAEARIAPTR 98
A
Sbjct: 122 ESRPASPFENTAP 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13225PF06580355e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 5e-04
Identities = 29/188 (15%), Positives = 63/188 (33%), Gaps = 44/188 (23%)

Query: 288 REGIGRVRKIVQDLKNFSR-VDAEDDWQWTDLHQGIESTLNIVASE-------LKYRADV 339
E + R+++ L R + + L + + + L++ +
Sbjct: 187 LEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQI 246

Query: 340 VREYGDLPEVKCLPSQINQVVMNLVMNAAQ-AMGPER--GRIVIRTGHTVEHAWIEVEDS 396
D+ +P + Q LV N + + G+I+++ +EVE++
Sbjct: 247 NPAIMDVQ----VPPMLVQT---LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 397 GQGISPEILPRIFDPFFTTKPVGKGTGLGLS-------LSYGIVQKHGGTIEVRSQPGVG 449
G K + TG GL + YG + I++ + G
Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYG--TEAQ--IKLSEKQGKV 341

Query: 450 SAFRIVLP 457
+A +++P
Sbjct: 342 NA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13230HTHFIS1066e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 106 bits (265), Expect = 6e-27
Identities = 32/154 (20%), Positives = 69/154 (44%), Gaps = 5/154 (3%)

Query: 14 RFSVLLVDDEPLILSSLRRLLRNQPYDLLLAESGEQALQLLESRPVDLVVSDARMPNMDG 73
++L+ DD+ I + L + L YD+ + + + + + DLVV+D MP+ +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 74 AALLAEIHRRSPETIRILLTGHADLPTIAKAINEGRIHHYLSKPWNDDELLLTLRQSLEY 133
LL I + P+ ++++ T KA +G + YL KP++ EL+ + ++L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKG-AYDYLPKPFDLTELIGIIGRALAE 121

Query: 134 LHSERERRRLERLTQE----QNDRLQQLNATLEK 163
+ + ++ +Q++ L +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


107HWH78_RS13340HWH78_RS13380N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS13340013-2.506448polyamine ABC transporter substrate-binding
HWH78_RS13345114-1.489770quorum threshold expression protein QteE
HWH78_RS13350116-1.144629transporter
HWH78_RS13355013-0.899109efflux transporter outer membrane subunit
HWH78_RS13360017-1.219346ABC transporter permease
HWH78_RS13365019-1.182674ABC transporter permease
HWH78_RS13370-119-0.963855ATP-binding cassette domain-containing protein
HWH78_RS13375123-2.008800HlyD family efflux transporter periplasmic
HWH78_RS13380-122-1.223107CerR family C-terminal domain-containing
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13340MALTOSEBP330.002 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 33.2 bits (75), Expect = 0.002
Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 9/103 (8%)

Query: 7 LALSILTTIGATAADSAWAAQTSVHLYNW------YDFIAPETPKAFQKETGTRVVLDTF 60
LALS LTT+ +A SA A L W Y+ +A E K F+K+TG +V ++
Sbjct: 10 LALSALTTMMFSA--SALAKIEEGKLVIWINGDKGYNGLA-EVGKKFEKDTGIKVTVEHP 66

Query: 61 DSAETAQGKLMVGRSGYDVVVITSNILPGLIKAGVLQELDRDR 103
D E ++ G D++ + G ++G+L E+ D+
Sbjct: 67 DKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDK 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13360ABC2TRNSPORT602e-12 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 59.9 bits (145), Expect = 2e-12
Identities = 43/192 (22%), Positives = 81/192 (42%), Gaps = 3/192 (1%)

Query: 182 GLIAALSMI-QTLMLAALSVAREREQGTFDQLLVTPLTPLEILIGKAVPSVLIGLLQSTL 240
G++A +M T + R Q T++ +L T L +I++G+ + L
Sbjct: 72 GMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG 131

Query: 241 ILAIGLFWFRIPMAGSLLDLYLGLLVFTTACVGIGLSVSALSANMQQAMVYTFVIMMPLI 300
I + SLL + + A +G+ V+AL+ + + Y +++ P++
Sbjct: 132 IGVVAAALGYTQWL-SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPIL 190

Query: 301 LLSGLITPIHSMPETLQILTYADPLRFAIDLVRRVYLEGASLADISGNFIPMLSVAAVTL 360
LSG + P+ +P Q PL +IDL+R + L D+ + + +
Sbjct: 191 FLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPV-VDVCQHVGALCIYIVIPF 249

Query: 361 PLAAWLFRNRLV 372
L+ L R RL+
Sbjct: 250 FLSTALLRRRLL 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13365ABC2TRNSPORT409e-06 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 39.9 bits (93), Expect = 9e-06
Identities = 39/175 (22%), Positives = 77/175 (44%), Gaps = 13/175 (7%)

Query: 205 AREWERGTLESLFVTPVRSAEILLAKIIPYFLVGLL-GLSMCLVSARLLFRVPIQGSLVL 263
R + T E++ T +R +I+L ++ L G + +V+A L + Q +L
Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY---TQWLSLL 148

Query: 264 LLFSSMLY---LFVTLGIGLLISARTRNQFLASQVAILSSFLPALMLSGFLFDLRNVPTF 320
+ F +LG+ + A + + F+ Q +++ P L LSG +F + +P
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVIT---PILFLSGAVFPVDQLPIV 205

Query: 321 IRLVGSILPATYFMELVKTLFLAGDNWHLALKNLLILAGYAV--FLLNAARLCTR 373
+ LP ++ ++L++ + L + ++ L Y V F L+ A L R
Sbjct: 206 FQTAARFLPLSHSIDLIRPIMLGHPVVDVCQ-HVGALCIYIVIPFFLSTALLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13375RTXTOXIND633e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 63.3 bits (154), Expect = 3e-13
Identities = 47/296 (15%), Positives = 100/296 (33%), Gaps = 38/296 (12%)

Query: 45 SLAFENSERIVALHAEEGDKVRAGQVLAELDTRTLRLRIETARARIGVQEQVLLRLR--- 101
+++ E R+ +L E+ + + EL+ R T ARI E + +
Sbjct: 178 NVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRL 237

Query: 102 NGTRPQEVDQSRARLQAAQAEAELARLELVRLQRIAGGTDGKGVSRQSLDAAAARLKVAR 161
+ Q+ A+ + E + + + +A ++
Sbjct: 238 DDFSSLLHKQAIAKHAVLEQENKYVEAV----NELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 162 AEVENQRKAWQLADIGPRDEDIAQAQAELEVARADLRLLEHYLSQAQLKAPRDATVRT-R 220
+N+ L + ++I EL + ++AP V+ +
Sbjct: 294 QLFKNE----ILDKLRQTTDNIGLLTLELAKNEERQQASV-------IRAPVSVKVQQLK 342

Query: 221 LLEPGDMASPSRPVFALALTD-PKWVRAYVNERQLGRVHPGQKARVVTDSVPER---PVD 276
+ G + + + + + D V A V + +G ++ GQ A + ++ P +
Sbjct: 343 VHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV 402

Query: 277 GRVGYISSVAEFTPKTVETEDLRTSLVYEIRVLLDDPE-------DRLRLGMPATV 325
G+V I+ A ED R LV+ + + +++ L GM T
Sbjct: 403 GKVKNINLDA--------IEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTA 450


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13380HTHTETR648e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.3 bits (156), Expect = 8e-15
Identities = 33/140 (23%), Positives = 55/140 (39%), Gaps = 9/140 (6%)

Query: 6 RTPRSDGESTRARILEVAGRLFAQHGYANTASKAICEEAGADLAAINYHFGSRDALYKAV 65
R + + + TR IL+VA RLF+Q G ++T+ I + AG AI +HF + L+ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 66 LVEGHKQFVSLHDLRELADSALPPETKLERFINAIVSRLLDDRSWQSKVCAREILAPTAH 125
L EL A P L ++ L + + + EI+
Sbjct: 63 WELSESNIGEL----ELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEII----- 113

Query: 126 FASLIREEVMPKFEALERII 145
F M + +R +
Sbjct: 114 FHKCEFVGEMAVVQQAQRNL 133


108HWH78_RS13750HWH78_RS13775N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS137500111.497722HAMP domain-containing protein
HWH78_RS137551132.815952hypothetical protein
HWH78_RS137600132.278644two-component system sensor histidine kinase
HWH78_RS137650162.305387two-component system response regulator CarR
HWH78_RS137701112.400170PepSY domain-containing protein
HWH78_RS137751102.458993PepSY domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13750FLAGELLIN300.041 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 30.0 bits (67), Expect = 0.041
Identities = 17/98 (17%), Positives = 39/98 (39%), Gaps = 3/98 (3%)

Query: 438 DVKVSVRDARSTADQSAAISSQTSAGMQQQFREIDQVATASHEMTATAQDVARSAAQAAD 497
K+S +A + + I+ + + +A + + TA V+ + A
Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA 411

Query: 498 AARGADQATRDGLALIDRTTQSIDSLAANLTSAMGQVE 535
AA+ + LA ID +D++ ++L + + +
Sbjct: 412 AAKKSTANP---LASIDSALSKVDAVRSSLGAIQNRFD 446


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13760PF06580385e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 5e-05
Identities = 44/263 (16%), Positives = 87/263 (33%), Gaps = 71/263 (26%)

Query: 187 RLQIAQLQQGQRSQLDNQAPEELEPLVEQIN-HLLAHTEETLKRSRNALGNLGHALKTPL 245
+ A++ Q + + + +A +L L QIN H + + + L + A +
Sbjct: 143 NYKQAEIDQWKMASMAQEA--QLMALKAQINPHFMFNALNNI--RALILEDPTKARE--- 195

Query: 246 AVLVSLAE--REEMARQPELQQVLREQLEQIQQRLGRELGKARLVGEALPGAHFDCAEEL 303
+L SL+E R + Q L ++L + L +L +
Sbjct: 196 -MLTSLSELMRYSLRYSNARQVSLADELTVVDSYL--QLASIQ----------------- 235

Query: 304 PSLCDTLRLIHGPHLQVSWSAPPGL---RLPWDREDLLEMLGNLLDNACKWA------DS 354
LQ P + ++P +L + L++N K
Sbjct: 236 ----------FEDRLQFENQINPAIMDVQVP----PML--VQTLVENGIKHGIAQLPQGG 279

Query: 355 EVRLTVAQGEGMVRLKVDDDGPGILPDQRQAVLERGTRLDEQVSGHGLGLGIARD-IAEA 413
++ L + G V L+V++ G L + ++ G GL R+ +
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNTKE--------------STGTGLQNVRERLQML 325

Query: 414 CGGRLSLE-DSPLGGLRVSVELP 435
G ++ G + V +P
Sbjct: 326 YGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13765HTHFIS789e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.6 bits (191), Expect = 9e-19
Identities = 31/117 (26%), Positives = 54/117 (46%)

Query: 2 RLLLVEDHVPLADELMASLTRQGYAVDWLADGRDAAVQGASEPYDLIILDLGLPGRPGLE 61
+L+ +D + L +L+R GY V ++ A+ DL++ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILQEWRGLGLATPVLILTARGSWAERIDGLKAGADDYLTKPFHPEELALRIQALLRR 118
+L + PVL+++A+ ++ I + GA DYL KPF EL I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13775THERMOLYSIN270.010 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 27.3 bits (60), Expect = 0.010
Identities = 21/83 (25%), Positives = 32/83 (38%), Gaps = 6/83 (7%)

Query: 21 QARDLGPDEALKLRDAGTIKSFEELNKNAIAKHPGSSVHDTELE----EEYGRYIYQVEL 76
R L + A+ ++ A I + ++ + T L EE R Y+V +
Sbjct: 128 DKRTLKTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEVNV 187

Query: 77 R--DPQGVKWDLELDAATGAVLK 97
R P W +DAA G VL
Sbjct: 188 RFLTPVPGNWIYMIDAADGKVLN 210


109HWH78_RS13815HWH78_RS13870N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS13815130-4.791566DNA binding protein
HWH78_RS13820132-3.722722hypothetical protein
HWH78_RS13825137-2.453941hypothetical protein
HWH78_RS13830240-1.798789hypothetical protein
HWH78_RS13835138-1.278041general secretion pathway protein GspK
HWH78_RS13840135-1.250210type II secretion system GspH family protein
HWH78_RS13845130-1.773332prepilin-type N-terminal cleavage/methylation
HWH78_RS13850-129-2.860194prepilin-type N-terminal cleavage/methylation
HWH78_RS13855-125-2.309307type II secretion system major pseudopilin GspG
HWH78_RS13860-120-1.052528type II secretion system F family protein
HWH78_RS13865-117-0.031321GspE/PulE family protein
HWH78_RS13870-1120.512965ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13820PYOCINKILLER290.005 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 28.6 bits (63), Expect = 0.005
Identities = 24/84 (28%), Positives = 33/84 (39%), Gaps = 10/84 (11%)

Query: 21 LEKLKSDSSLKQELEFKDKLQALMDKYGMTLHNIIAILDPKAPVTVSAAPQRRA------ 74
+E L + ++K E LQ M+ +I A KA +A +R+A
Sbjct: 181 MEGLTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQ 240

Query: 75 ----RALKVYKNPNNGEVVETKGG 94
RA Y P NG VV T G
Sbjct: 241 QAAIRAANTYAMPANGSVVATAAG 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13845BCTERIALGSPH300.004 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 29.9 bits (67), Expect = 0.004
Identities = 14/48 (29%), Positives = 30/48 (62%), Gaps = 2/48 (4%)

Query: 7 KQGAFTLLEMIVVLLVVSFIGTLLMQGLSYASKANQSLHQSLGRGQVR 54
+Q FTLLEM+++LL++ +++ L++ + + S Q+L R + +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVL--LAFPASRDDSAAQTLARFEAQ 47


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13850PilS_PF08805300.003 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 29.5 bits (66), Expect = 0.003
Identities = 5/31 (16%), Positives = 16/31 (51%)

Query: 7 GFTLLEAVVALTLLAVVGGALFAWLNSAFRS 37
G TL+E ++ + ++ V+ + + + +
Sbjct: 27 GATLMEVLLVVGVIVVLAASAYKLYSMVQSN 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13855BCTERIALGSPG391e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 38.7 bits (90), Expect = 1e-06
Identities = 17/43 (39%), Positives = 31/43 (72%)

Query: 12 QAAFTLLELLVVLVIVGAIAAVALPGLVRMQETWARRTALDDL 54
Q FTLLE++VV+VI+G +A++ +P L+ +E ++ A+ D+
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDI 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13860BCTERIALGSPG1183e-37 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 118 bits (297), Expect = 3e-37
Identities = 47/128 (36%), Positives = 67/128 (52%), Gaps = 1/128 (0%)

Query: 6 QQGFTLLEMIVVLVIIGMLMGLVGPRLFNQADKAKAQTADTQVKMLKGALLTMRLDIGRL 65
Q+GFTLLE++VV+VIIG+L LV P L +KA Q A + + L+ AL +LD
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 66 PTEEEGLALLNTPPSDERLGAFWHGPYLEGGVPLDPWNRPYLYSDRPSAEQPFTLYSQGA 125
PT +GL L P+ L A ++ +P DPW Y+ + P + L S G
Sbjct: 67 PTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVN-PGEHGAYDLLSAGP 125

Query: 126 DGQPGGKG 133
DG+ G +
Sbjct: 126 DGEMGTED 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13865BCTERIALGSPF1859e-57 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 185 bits (471), Expect = 9e-57
Identities = 104/404 (25%), Positives = 187/404 (46%), Gaps = 11/404 (2%)

Query: 2 NFIYQAVDRKGRRVRGELCLPTRQDALRQLQRQGLTPLSLEVKR----------RNLGSR 51
+ YQA+D +G++ RG + + A + L+ +GL PLS++ R +L +
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 52 RRLKAEELNMAIHELATMLAAGVSMADAVEAQERGARHPKLITALQAMANGLRQGQSFPV 111
RL +L + +LAT++AA + + +A++A + + P L + A+ + + +G S
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 112 VLESAGLDLPRYVYQLVAAGEMTGNLAGALRDCATQMEYERRTRAELQGALIYPAILVLS 171
++ R +VAAGE +G+L L A E ++ R+ +Q A+IYP +L +
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 172 GVLAVATLFVFVVPKFANLLNET-AQLPWLAWAVLSIGVWSNESSGLLAFAVLLLAGGIA 230
+ V+ L VVPK LP ++ + + A+L
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 231 VALRNPALRAHALDQLVRLPVVGEWLMQAEIAQWSKVLGTLLGNRVPLVEALLLSAAGVR 290
V LR R +L+ LP++G A++++ L L + VPL++A+ +S +
Sbjct: 243 VMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMS 302

Query: 291 IARQRRTLERVTQDVRAGIALSAALEERQAVTSIGSSLVRVGEASGQLAEMLQSLATLYG 350
R L T VR G++L ALE+ + ++ GE SG+L ML+ A
Sbjct: 303 NDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQD 362

Query: 351 EAGQARMKKALVLIEPLAILLIGSVFGLIITGVVLAITSANDMV 394
++M AL L EPL ++ + +V I+ ++ I N ++
Sbjct: 363 REFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS13875ABC2TRNSPORT330.001 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 32.6 bits (74), Expect = 0.001
Identities = 22/121 (18%), Positives = 42/121 (34%)

Query: 116 LTPLLAAFFNAMLGYLVLCIFLLFSGVEPGWQLVLLPLALLPFLLCVTGLAWFLAGLGVY 175
L + A A L + + G L+ + L L + L
Sbjct: 115 LGEMAWAATKAALAGAGIGVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPS 174

Query: 176 VRDIGQFVQFLLVLLLFISPVFYPLSFLPPVMQPYLYLNPLTIPVEMVRAILFDAPYPTL 235
+ ++ +LF+S +P+ LP V Q PL+ ++++R I+ P +
Sbjct: 175 YDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDV 234

Query: 236 G 236

Sbjct: 235 C 235


110HWH78_RS14655HWH78_RS14690N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS14655-118-3.737865MFS transporter
HWH78_RS14660-215-3.092565cysteine hydrolase
HWH78_RS14665-110-2.648366tRNA dihydrouridine(20/20a) synthase DusA
HWH78_RS14670014-3.488906transaldolase
HWH78_RS14675-114-3.254189STAS domain-containing protein
HWH78_RS14680-113-1.419929SpoIIE family protein phosphatase
HWH78_RS14685-1140.042228cyclic di-GMP-binding protein PilZ
HWH78_RS14690-115-0.395759VacJ family lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14655TCRTETA1031e-26 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 103 bits (259), Expect = 1e-26
Identities = 85/362 (23%), Positives = 148/362 (40%), Gaps = 13/362 (3%)

Query: 15 RNLAVCLFGAFTTVFAMTLILPFLPVYIGQLGVSGHAAIVQWSGIAYAATFVTAGLVAPL 74
R L V L + LI+P LP + L S GI A + AP+
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVT--AHYGILLALYALMQFACAPV 62

Query: 75 WGMLGDRYGRKPMLVRASLGMAITMLLMGLASDIWQFIGLRLLAGIAGGYSSGATILVAV 134
G L DR+GR+P+L+ + G A+ +M A +W R++AGI G + A +A
Sbjct: 63 LGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD 122

Query: 135 QAPKSRSAWALGLLSSGVMAGNLLGPLVGGFLPPLIGIRATFWGASGLIFIAFVFTTFML 194
A G +S+ G + GP++GG + A F+ A+ L + F+ F+L
Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLL 181

Query: 195 RETA---RPSVPAVEEPPKVGWSNMPNKAVILAMLATGLLLMIANMSVEPIITVYIETLL 251
E+ R + P + V+ A++A ++ + + ++ E
Sbjct: 182 PESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 252 EDSRRVTSVAGLAMSA-AALGSIISATYLGKVADRIGYGVIMIAALSVAAVLLIPQAFVY 310
+ G++++A L S+ A G VA R+G ++ + I AF
Sbjct: 242 HWD---ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 311 AGWQLIALRFLMGMALGGL-LPCMAAVIRHSVPERFVGSAMGWSLSAQFAGQVIGPVIGG 369
GW + L +A GG+ +P + A++ V E G G + ++GP++
Sbjct: 299 RGWMAFPIMVL--LASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356

Query: 370 FV 371
+
Sbjct: 357 AI 358



Score = 51.4 bits (123), Expect = 3e-09
Identities = 43/177 (24%), Positives = 73/177 (41%), Gaps = 4/177 (2%)

Query: 217 PNKAVILAMLATGLLLMIANMSVEPIITVYIETLLEDSRRVTSVAGLAMSAAALGSIISA 276
PN+ +I+ + L + + + P++ + L+ S VT+ G+ ++ AL A
Sbjct: 3 PNRPLIVILSTVALDAVGIGL-IMPVLPGLLRDLVH-SNDVTAHYGILLALYALMQFACA 60

Query: 277 TYLGKVADRIGYGVIMIAALSVAAVLLIPQAFVYAGWQLIALRFLMGMALGGLLPCMAAV 336
LG ++DR G +++ +L+ AAV A W L R + G+ G A
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAY 119

Query: 337 IRHSVPERFVGSAMGWSLSAQFAGQVIGPVIGGFVGGQFGMRSVFLVTSLLMLAGAL 393
I G+ + G V GPV+GG + G F + F + L L
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFFAAAALNGLNFL 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14660ISCHRISMTASE523e-10 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 51.6 bits (123), Expect = 3e-10
Identities = 47/189 (24%), Positives = 72/189 (38%), Gaps = 6/189 (3%)

Query: 9 PFNSALLLMDFQEFVLNNFVP-APRAAEVVARTSRFLSRVRQTDMLVVHVTVGCPPDGPP 67
P + LL+ D Q + ++ F A E+ A + ++ Q + VV+ P P
Sbjct: 28 PNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQ--PGSQNP 85

Query: 68 MDRRNRLFRELRESGLIEPGSPGLAITAALQPIESEPVITKSRVGAFTGTELDELLWAHD 127
DR L + GL G I L P + + V+TK R AF T L E++
Sbjct: 86 DDRA--LLTDFWGPGLNS-GPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEG 142

Query: 128 IETLLIAGATTSGVVLTTVRQAFDLDYQLVVLRDGCVDGDAELHDYLMARVISDHATITE 187
+ L+I G L T +AF D + + D D E H + A
Sbjct: 143 RDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVM 202

Query: 188 IDEVAKTLR 196
D + L+
Sbjct: 203 TDSLLDQLQ 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14680HTHFIS1107e-29 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 110 bits (277), Expect = 7e-29
Identities = 39/128 (30%), Positives = 60/128 (46%), Gaps = 1/128 (0%)

Query: 6 ATLLIIDDDEVVRESLAAYLEDSNFKVLQALNGLQGLQIFESEQPDLVICDLRMPQIDGL 65
AT+L+ DDD +R L L + + V N + + DLV+ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 ELIRRIRQTASETPIIVLSGAGVMSDAVEALRLGAADYLIKPLEDLAVLEHSVRRALDRA 125
+L+ RI++ + P++V+S A++A GA DYL KP DL L + RAL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRALAEP 122

Query: 126 YLRVENQR 133
R
Sbjct: 123 KRRPSKLE 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS14690VACJLIPOPROT2723e-95 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 272 bits (698), Expect = 3e-95
Identities = 75/231 (32%), Positives = 113/231 (48%), Gaps = 15/231 (6%)

Query: 16 LACASLALAPTLSLAAS--------EEDPWESINRPIFTFN-DTLDTYALKPLAQGYQKV 66
L ++LAL TL + + DP E NR ++ FN + LD Y ++P+A ++
Sbjct: 3 LRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWRDY 62

Query: 67 TPNFVQDGVHNFFNNLGDVKNLANNLLQAKFHNAGVDTSRLLFNSTFGLAGLIDVATPMG 126
P ++G+ NF NL + + N LQ + V +R N+ G+ G IDVA
Sbjct: 63 VPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGMAN 122

Query: 127 LQ---RNDEDFGQTLGYWGVGSGPYVMLPFLGPSTLRDAPAKIPDIYVSPYHYMDDVRAR 183
+ FG TLG++GVG GPYV LPF G TLRD + D ++ +
Sbjct: 123 PKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPMSV 182

Query: 184 NVMFGINTVDTRANLLKSEKLI--SGDKYIFIRNAYLQNREFKVKDGEVED 232
+ + ++TRA LL S+ L+ S D YI +R AY Q +F GE++
Sbjct: 183 G-KWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKP 232


111HWH78_RS15155HWH78_RS15250N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS151552121.832223response regulator
HWH78_RS151600121.608613CBS domain-containing protein
HWH78_RS15165-1131.692008DUF2897 family protein
HWH78_RS15170-1141.196213CPBP family intramembrane metalloprotease
HWH78_RS15175-1151.859630atu genes transcriptional repressor AtuR
HWH78_RS15180-1162.581130DUF1446 domain-containing protein
HWH78_RS15185-1172.572820SDR family oxidoreductase
HWH78_RS15190-2172.054751geranyl-CoA carboxylase subunit beta
HWH78_RS15195-2152.008015citronellyl-CoA dehydrogenase
HWH78_RS152001123.307119isohexenylglutaconyl-CoA hydratase
HWH78_RS152050112.792032geranyl-CoA carboxylase subunit apha
HWH78_RS15210-1111.570035NAD(P)-dependent oxidoreductase
HWH78_RS152150101.555813long-chain-acyl-CoA synthetase
HWH78_RS152200132.116554hypothetical protein
HWH78_RS152250122.024288anti-sigma factor SbrR
HWH78_RS152301130.888474ECF family RNA polymerase sigma factor SbrI
HWH78_RS152350120.790268PLP-dependent aminotransferase family protein
HWH78_RS152400113.298571hypothetical protein
HWH78_RS152450112.930734response regulator transcription factor
HWH78_RS15250-2103.528362OmpA family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15155HTHFIS985e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.6 bits (243), Expect = 5e-25
Identities = 29/136 (21%), Positives = 57/136 (41%), Gaps = 2/136 (1%)

Query: 7 RILFVDDEERILRSLAMQF-RRHYEVLTESDPRRALERLKTERIQVLVSDQRMPQMSGAE 65
IL DD+ I L R Y+V S+ + ++V+D MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 LLAQARERYPETLRILLTGYSDLDAAVDALNDGGIFRYLTKPWNPQEMAFTLRQAAEIAS 125
LL + ++ P+ ++++ + A+ A + G + YL KP++ E+ + +A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 126 RQGLPAPPAATLAAPL 141
R+ + PL
Sbjct: 124 RRPSKLEDDSQDGMPL 139



Score = 54.8 bits (132), Expect = 1e-10
Identities = 27/139 (19%), Positives = 55/139 (39%), Gaps = 5/139 (3%)

Query: 142 SVLLLDDDPETLDCVGAFCHAGGHRLLRARNLAEALVWLNTEPVEVLVSDLKLAGEHTAP 201
++L+ DDD + G+ + N A W+ +++V+D+ + E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 202 LLKSLAQAHPRLLSLVVTPFRDTQALLELINQAQIFRYLPKPIRRGLFEKGLKAAAEQAL 261
LL + +A P L LV++ ++ + + YLPKP L +AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKP----FDLTELIGIIGRAL 119

Query: 262 LWRGRSLPEVDRLAEVPRD 280
R +++ ++
Sbjct: 120 AEPKRRPSKLEDDSQDGMP 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15160PF06580462e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 45.6 bits (108), Expect = 2e-07
Identities = 35/172 (20%), Positives = 72/172 (41%), Gaps = 24/172 (13%)

Query: 260 QIGELVSGLKDFAR--LDRAFSEEVDLND---CVRNAVLIARTAIKDKAEISSQLGELPL 314
+ E+++ L + R L + + +V L D V + + +A +D+ + +Q+ +
Sbjct: 192 KAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIM 251

Query: 315 IACAPSQINQVLL-NLLTNAAQAMERFGRILLKSWADERQVFLSVQDNGKGMPAEVLGRI 373
P + Q L+ N + + + + G+ILLK D V L V++ G
Sbjct: 252 DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA-------- 303

Query: 374 FDPFFTTKPVGQGTGLGLSISYKIIQQHGG---TIRVASEPGRGTRFLISLP 422
K + TG GL + +Q G I+++ + G+ ++ +P
Sbjct: 304 ------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15175HTHTETR699e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.3 bits (169), Expect = 9e-17
Identities = 32/190 (16%), Positives = 65/190 (34%), Gaps = 8/190 (4%)

Query: 23 ESARGKLLQTAAHLFRSKGYERTTVRDLASAVGIQSGSIFHHFKSKDEILRSVMEETILY 82
+ R +L A LF +G T++ ++A A G+ G+I+ HFK K ++ + E +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 83 NTALMRAALAD-AEDLRERVLGLIRCELQSIMGGTGEAMAVLVYEWRSLSAEGQAYILGL 141
L A D + ++ L+S + + + + + A +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 142 RDIYEQMWLD----VLGEARLAGYCQG--DPFILRRFLTGALSWT-TTWFRPEGPMSLDQ 194
+ D L A + G +S W L +
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 195 LAEEALALVI 204
A + +A+++
Sbjct: 190 EARDYVAILL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15185DHBDHDRGNASE1193e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 119 bits (299), Expect = 3e-34
Identities = 74/255 (29%), Positives = 121/255 (47%), Gaps = 10/255 (3%)

Query: 13 DGQTIIVTGGGSGIGRCTAHELAALGAHVVLVGRKAEKLEKTAGEIVEDGGSVSWHACDI 72
+G+ +TG GIG A LA+ GAH+ V EKLEK + + D+
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 73 REEEAVKTLVANILAERGTIHHLVNNAGGQYPSPLASISQKGFETVLRTNLVGGFLVARE 132
R+ A+ + A I E G I LVN AG P + S+S + +E N G F +R
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 133 VFNQSMSKTGGSIVNMLADMWGGMP--GMGHSGAARSGMENFTRTAAVEWGHAGVRVNAV 190
V M + GSIV + ++ G+P M ++++ FT+ +E +R N V
Sbjct: 127 VSKYMMDRRSGSIVTVGSNP-AGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 191 APG-------WIASSGMDTYEGAFKAVIPTLREHVPLKRIGSESEVAAAIVFLLSPGAAF 243
+PG W + + E K + T + +PLK++ S++A A++FL+S A
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 244 VSGNTIRIDGAASQG 258
++ + + +DG A+ G
Sbjct: 246 ITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15205RTXTOXIND382e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.5 bits (87), Expect = 2e-04
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 579 AGASAQVGASSGTLK-APMDGAIV-EVLVGEGERVGKGQLLLVLEAMKMEHPLKAGVDGV 636
A A+ ++ S + + P++ +IV E++V EGE V KG +LL L A+ E A
Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE----ADTLKT 139

Query: 637 VRRVQVGRGEQVRNRQVLVEVEADA 661
+ R EQ R + + +E +
Sbjct: 140 QSSLLQARLEQTRYQILSRSIELNK 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15210DHBDHDRGNASE813e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.3 bits (200), Expect = 3e-20
Identities = 54/191 (28%), Positives = 85/191 (44%), Gaps = 9/191 (4%)

Query: 3 LHGKTLFITGASRGIGREIALRAARDGANLVIAAKSAEPHPKLEGTIFSVAAEVEAAGGQ 62
+ GK FITGA++GIG +A A GA++ + E K+ ++ + A EA
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---- 61

Query: 63 ALPLQLDVRDEQAVAAAMARAAERFGGIDALVNNAGAIRLVGVEKLEPKRFDLMYQINTR 122
DVRD A+ AR G ID LVN AG +R + L + ++ + +N+
Sbjct: 62 ---FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118

Query: 123 AVLVCSQAALPYLRRSANGHILSLSPPINLAGRWFAQHGPYTVTKYGMSMLTLGMHEEFG 182
V S++ Y+ +G I+++ N AG Y +K M T + E
Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGS--NPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 183 KYAISVNALWP 193
+Y I N + P
Sbjct: 177 EYNIRCNIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15220IGASERPTASE333e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.1 bits (75), Expect = 3e-04
Identities = 19/100 (19%), Positives = 29/100 (29%), Gaps = 9/100 (9%)

Query: 4 APKTASKKVAPAAEQVAEPKPPAKPKPAAAPPKPASRPVAKDKPAPAKRASTARLDPEVR 63
PK S+ V+P EQ +P A+P P P ++ V
Sbjct: 1122 VPKVTSQ-VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 64 KPLPSAKLDLRLPK-------ELVQKMAPPGTEETH-KPK 95
+P+ + P E+ KPK
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK 1220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15225IGASERPTASE361e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 1e-04
Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 8/120 (6%)

Query: 112 AAAAKRAMRAPAAPAPLSSEMSEP--PALLASYASSGEAPQLMAEAAPAAPAALADRPPA 169
+ + P + + P P+ A EAP + APA P+ +
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAP--VPPPAPATPSETTETVAE 1042

Query: 170 QAAQQAK---VQAALAGDFVAQARGKAVAVKPEVLDEALGAVLALREQGKTEQAATQLAE 226
+ Q++K A + AQ R A K V +A + +T++ T +
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVA-QSGSETKETQTTETK 1101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15245HTHFIS501e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 50.2 bits (120), Expect = 1e-09
Identities = 35/157 (22%), Positives = 61/157 (38%), Gaps = 11/157 (7%)

Query: 10 LVIADSFPVMQWALQRYLSEECGRQVLAVVGDSDSLVERLADLPPESILITELGLPGQRS 69
+++AD ++ L + LS G V + +A +++T++ +P
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVRITSNAATLW-RWIAAGDG-DLVVTDVVMPD--- 59

Query: 70 RDGIHLVEWLTRHCPQMKVMVYSVFSAPLLAKAVLRSGASAYISKRSPLETLKAALECMA 129
+ L+ + + P + V+V S + + A GA Y+ K L L + A
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG-RA 118

Query: 130 LGQTFLDPG-LHPQRHTGKPL---SPTEVDILRRLAR 162
L + P L G PL S +I R LAR
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15250OMPADOMAIN1022e-27 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 102 bits (255), Expect = 2e-27
Identities = 44/126 (34%), Positives = 63/126 (50%), Gaps = 11/126 (8%)

Query: 155 DVLFDFNRAELKPAANRTALKLVQFL-QLNPRRV-IRIEGYTDSVGDRQANLDLSRERAQ 212
DVLF+FN+A LKP +L L L+P+ + + GYTD +G N LS RAQ
Sbjct: 220 DVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279

Query: 213 AVADVLADLGVDPARMQVVGYGEAFPVTDNASNRGR---------AQNRRVEIVFSNDKG 263
+V D L G+ ++ G GE+ PVT N + + A +RRVEI K
Sbjct: 280 SVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGIKD 339

Query: 264 QLSAPR 269
++ P+
Sbjct: 340 VVTQPQ 345


112HWH78_RS15315HWH78_RS15340N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS153151133.294050ABC transporter substrate-binding protein
HWH78_RS153201133.216820iron ABC transporter permease
HWH78_RS15325-1112.701749MBL fold metallo-hydrolase
HWH78_RS15330092.714666LysE family translocator
HWH78_RS15335-1112.790075AraC family transcriptional regulator
HWH78_RS15340-1102.744329SDR family oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15315FERRIBNDNGPP382e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 38.4 bits (89), Expect = 2e-05
Identities = 49/266 (18%), Positives = 96/266 (36%), Gaps = 43/266 (16%)

Query: 43 PSRAVSHDINLTEMMVALGLQTRMVGYTGISGW--WKNADPGLIAALKPLPELV-----A 95
P+R V+ + E+++ALG+ G + W + +P PLP+ V
Sbjct: 35 PNRIVALEWLPVELLLALGIVP--YGVADTINYRLWVS-EP-------PLPDSVIDVGLR 84

Query: 96 RYPTAETLLDVDADFFFAGWGYGMRVGGDLTPASLEPLG-VKVYELSESCAQIGEPRRAS 154
P E L ++ F GYG +P L + + + S+ + R++
Sbjct: 85 TEPNLELLTEMKPSFMVWSAGYGP------SPEMLARIAPGRGFNFSDGKQPLAMARKS- 137

Query: 155 LDELYRDLRNLGRIFDVEPRAERLVASLQARIERARAGIPANAEAPRVF--LYDSGEDRP 212
L + + +++ AE +A + I + P + L D
Sbjct: 138 -------LTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLV 190

Query: 213 FTSGRLGMPQALIEAAGGRSVTDDVAASW--TQVNWESVVA-RDPQVIVIVDYGETSAAQ 269
F + Q +++ G + W T V+ + + A +D V+ D+ +
Sbjct: 191 FGPN--SLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCF-DHDNSKDMD 247

Query: 270 KQRFLEENPALRSLTAIRERRFIVLP 295
L P +++ +R RF +P
Sbjct: 248 A---LMATPLWQAMPFVRAGRFQRVP 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15325PF05932270.047 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 27.1 bits (60), Expect = 0.047
Identities = 9/53 (16%), Positives = 18/53 (33%), Gaps = 8/53 (15%)

Query: 74 ADHLSAAIFLQRELGGCLAIGARITQVQAKFSGLFNLGEAFPVDGRQFEHLFE 126
L+ A+ G L + + SGL++ ++ P + L
Sbjct: 66 QCLLAGALNPLLNAGPGLGLDEK--------SGLYHAYQSIPREKLSVPTLKR 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15335PF05272280.034 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.034
Identities = 6/16 (37%), Positives = 8/16 (50%)

Query: 258 FRRAYGMTPAAYRRQC 273
+R AYG + RQ
Sbjct: 672 YRGAYGRYVQDHPRQV 687


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS15340DHBDHDRGNASE711e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 70.9 bits (173), Expect = 1e-16
Identities = 71/260 (27%), Positives = 118/260 (45%), Gaps = 17/260 (6%)

Query: 6 IKGKTVLVTGGAKNLGGLIARDLAAHGAKAIAIHYNSAASKADADATVAALQAAGAKAVA 65
I+GK +TG A+ +G +AR LA+ GA A+ YN + V++L+A A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL----EKVVSSLKAEARHAEA 61

Query: 66 LQGDLTSAAAMEKLFADAIAAVGKPDIAINTVGKVLKKPFTEISEAEYDEMSAVNAKSAF 125
D+ +AA++++ A +G DI +N G + +S+ E++ +VN+ F
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 126 FFLREAGKHVND--NGKICTLVTSLLGAYTPYYAAYAGTKAPVEHFTRAASKEFGARGIS 183
R K++ D +G I T+ ++ G AAYA +KA FT+ E I
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 184 VTAVGPGPMDTPFFYPAEGADAVAYHKTAAALSPFSRTGL--------TDIEDVVPFIRH 235
V PG +T + + A +L F +TG+ +DI D V F+
Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETF-KTGIPLKKLAKPSDIADAVLFL-- 238

Query: 236 LVSEGWWITGQTILINGGYT 255
+ + IT + ++GG T
Sbjct: 239 VSGQAGHITMHNLCVDGGAT 258


113HWH78_RS16360HWH78_RS16385N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS163601121.512890VWA domain-containing protein
HWH78_RS163650121.049360BatD family protein
HWH78_RS163700130.275313alpha/beta hydrolase fold domain-containing
HWH78_RS16375114-0.733100cationic peptide response regulator
HWH78_RS16380014-0.842242cationic peptide sensor histidine kinase CprS
HWH78_RS16385014-1.131605RND family transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16360TYPE4SSCAGX372e-04 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 37.1 bits (85), Expect = 2e-04
Identities = 39/158 (24%), Positives = 73/158 (46%), Gaps = 13/158 (8%)

Query: 340 LMLSLPQPAMAFQFEDLWLRPDQQGQRLLQRGQADEAAKRFEDFRWKGLSLYQARDYAAA 399
L++ P P + + L +++ + Q+ Q D+ KR E+ R K + +
Sbjct: 131 LIVDAPDPK-ELEEQKKALEKEKEAKEQAQKAQKDKREKRKEE-RAKNRA-----NLENL 183

Query: 400 AQAFAQGDQADDHYNRGNALARQGELEAAVDAYEQALERQPQLVAAQRNK-ALVEELLRQ 458
A + ++ N + +Q E E +D E+ + Q Q AQ N +EEL ++
Sbjct: 184 TNAMSNPQNLSNNKNLSELIKQQRENE--LDQMERLEDMQEQ---AQANALKQIEELNKK 238

Query: 459 RQEQAAQQQAGENKEQRQEASQQSPPSGSSQRPPRDAA 496
+ E+A +Q+A + + + SQ+SP S + P D+A
Sbjct: 239 QAEEAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSA 276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16375HTHFIS845e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 5e-21
Identities = 30/129 (23%), Positives = 59/129 (45%)

Query: 3 IHVLVVEDNFDLAGTVIDYLEAAGVVCDHARDGQAGLNLARANRYDVILLDIMLPRINGR 62
+LV +D+ + + L AG + A D+++ D+++P N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 QVCRQLREAGLQTPVLMLTALDTLQDKLDGFDAGADDYLLKPFELPELLVRLQALSRRRS 122
+ ++++A PVL+++A +T + + GA DYL KPF+L EL+ +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 123 GQAQRLQVD 131
+ +L+ D
Sbjct: 124 RRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16380PF07675320.005 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 32.4 bits (73), Expect = 0.005
Identities = 23/88 (26%), Positives = 36/88 (40%), Gaps = 6/88 (6%)

Query: 64 PAPDSYYFKGSVGTAGLPPKLREMLDTPPYKSIGAMQLLGNWDDDDEEEDDDAPSDDAYV 123
PA + G G P + + K M+ G D D E +DD+P+ Y
Sbjct: 480 PASGKMWIAGDGGNQ--PARYDDFAFEAGKKYTFTMRRAGMGDGTDMEVEDDSPASYTYT 537

Query: 124 VVR--QPLADGKTLYLYDND--AAGSID 147
V R + +G T ++ D AAG+ +
Sbjct: 538 VYRDGTKIKEGLTATTFEEDGVAAGNHE 565


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16385ACRIFLAVINRP711e-14 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 71.0 bits (174), Expect = 1e-14
Identities = 36/175 (20%), Positives = 77/175 (44%), Gaps = 11/175 (6%)

Query: 613 IEAATNEVIKQSELII-LVLVYICVAAMCMITFRSFAATLCIVLPLILTSVLGNALMAAL 671
++ + +EV+K L ++LV++ + + ++ ATL + + + + A++AA
Sbjct: 333 VQLSIHEVVK--TLFEAIMLVFLVMY----LFLQNMRATLIPTIAVPVVLLGTFAILAAF 386

Query: 672 GIGVKVATLPVIALGVGIGVDYGIYIYTRLESFLRM-GLPLQEAYYETLRSTGKAVLFTG 730
G + T+ + L +G+ VD I + +E + LP +EA +++ A++
Sbjct: 387 GYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIA 446

Query: 731 LCLAIGVATWIF---SAIKFQADMGLMLTFMLLWNMFGALWLLPALARFLINPAK 782
+ L+ F S + + + ++ AL L PAL L+ P
Sbjct: 447 MVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501



Score = 42.9 bits (101), Expect = 6e-06
Identities = 35/221 (15%), Positives = 80/221 (36%), Gaps = 15/221 (6%)

Query: 251 LITLVLLYWFTKCIRSTIAVLITTLVAVLWQLGLLNLVGFGLDPYSMLVPFLIFAIGISH 310
++ +++Y F + +R+T+ I V +L +L G+ ++ +M L + +
Sbjct: 349 MLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 311 GVQKINGIA-LQSSGADNALMAARLTFRQLFLPGMIAILADAVGFITLLVID--IGVI-R 366
+ + + + A + Q+ + + + FI + G I R
Sbjct: 409 AIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYR 468

Query: 367 ELAIGASIGVAVIVFTNLILLPVAISYI--GISKKAVQRSKDDAVREHPFWRLLSNFASP 424
+ +I +A+ V LIL P + + +S + + + + N +
Sbjct: 469 QFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTN 528

Query: 425 KVAPV------SIAIALLMLGGGLWYGKHLKIG---DLDQG 456
V + + I L++ G + L + DQG
Sbjct: 529 SVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQG 569



Score = 33.3 bits (76), Expect = 0.005
Identities = 23/113 (20%), Positives = 46/113 (40%), Gaps = 5/113 (4%)

Query: 626 LIILVLVYICVAAMCMITFRSFAATLCIVLPLILTSVLGNALMAALGIGVKVATLPVIAL 685
I V+V++C+AA+ + S++ + ++L + L V V + +
Sbjct: 877 AISFVVVFLCLAAL----YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLT 932

Query: 686 GVGIGVDYGIYIYTRLESFLRM-GLPLQEAYYETLRSTGKAVLFTGLCLAIGV 737
+G+ I I + + G + EA +R + +L T L +GV
Sbjct: 933 TIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGV 985


114HWH78_RS16425HWH78_RS16555N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS16425-192.423720metallophosphoesterase
HWH78_RS16430-192.364339NAD(+) kinase
HWH78_RS16435-282.889862DUF1853 family protein
HWH78_RS16440-282.759775pyridoxal-phosphate dependent enzyme
HWH78_RS16445-192.788352hypothetical protein
HWH78_RS16450-2112.472354NADPH-dependent 2,4-dienoyl-CoA reductase
HWH78_RS164551112.994994carbon-nitrogen hydrolase family protein
HWH78_RS164603123.035416AraC family transcriptional regulator
HWH78_RS164803173.101362***GspM family type II secretion system protein
HWH78_RS164851152.116554GspL family type II secretion system protein
HWH78_RS16490-1151.429579GspK family T2SS minor pseudopilin variant XcpX
HWH78_RS164951142.006455GspJ family T2SS minor pseudopilin variant XcpW
HWH78_RS165001111.549793GspI family T2SS minor pseudopilin variant XcpV
HWH78_RS16505080.627192GspH family T2SS minor pseudopilin variant XcpU
HWH78_RS16510070.618129GspG family T2SS major pseudopilin variant XcpT
HWH78_RS16515061.134249GspF family T2SS innner membrane protein variant
HWH78_RS16520190.218733GspE family T2SS ATPase variant XcpR
HWH78_RS1652518-0.343316type II secretion system protein N
HWH78_RS16530190.134238GspD family T2SS secretin variant XcpQ
HWH78_RS16535-1110.953447SDR family oxidoreductase
HWH78_RS16540-1130.296345O-succinylhomoserine sulfhydrylase
HWH78_RS165450140.056900amidophosphoribosyltransferase
HWH78_RS165500160.765287CvpA family protein
HWH78_RS165550170.774983SPOR domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16425ANTHRAXTOXNA310.008 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.9 bits (69), Expect = 0.008
Identities = 13/36 (36%), Positives = 19/36 (52%)

Query: 231 GDIVFQPDALPEAIAREPLSEEQKSSLLTYGADEPL 266
G+I F L E + LSEE+K+S+ + G P
Sbjct: 102 GEIYFTDIDLVEHKELQDLSEEEKNSMNSRGEKVPF 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16430PF06057290.028 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 28.7 bits (64), Expect = 0.028
Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 7/58 (12%)

Query: 65 LVVVVGGDGSML----GAARALARHKVPVLGINRGSLG-FLTDIRPDELEAKVGEVLD 117
LV+ + GDG L + PV+G + SL + P ++ ++D
Sbjct: 53 LVIFLSGDGGWATLDKAVGGILQQQGWPVVGWS--SLKYYWKQKDPKDVTQDTLAIID 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16435FLGFLIJ290.020 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 28.6 bits (63), Expect = 0.020
Identities = 16/37 (43%), Positives = 23/37 (62%), Gaps = 2/37 (5%)

Query: 36 QRHPLAASRWRQEPERLAAWLREQERQPQHLAAWLAQ 72
Q+ +A + WR++ +RL AW QERQ AA LA+
Sbjct: 92 QKVDIALNSWREKKQRLQAWQTLQERQST--AALLAE 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16445RTXTOXIND514e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.4 bits (123), Expect = 4e-09
Identities = 34/206 (16%), Positives = 67/206 (32%), Gaps = 18/206 (8%)

Query: 165 AAVEPQRLQMAAEEQWYAAGPAAPKAPPAEPPRKQEDEQTARLAQLVKQQRQQLAALARQ 224
A +E R Q+ + P K P + +E+ RL L+K+Q Q
Sbjct: 146 ARLEQTRYQILSRSIELNKLPEL-KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204

Query: 225 QEQRLAGLARQHEEELARREQDARGQLDILRSEVLSLQQALERQARENAELQQRLLEQGE 284
+E L + + R + +S + + A + +LEQ
Sbjct: 205 KELNLDKKRAE-RLTVLARINRYENLSRVEKSRL----DDFSSLLHKQAIAKHAVLEQ-- 257

Query: 285 QFQRNREELTRQLRFIENQGRNETDLLRSEFADELEARVAAAVAGYKEQVSIRDVELAYR 344
+ E +LR +++ + + SE + +K ++ +L
Sbjct: 258 --ENKYVEAVNELR----VYKSQLEQIESEIL-SAKEEYQLVTQLFKNEILD---KLRQT 307

Query: 345 NELDQQLEQELAELRAERDRLAAQGP 370
+ L ELA+ + + P
Sbjct: 308 TDNIGLLTLELAKNEERQQASVIRAP 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16480PYOCINKILLER280.026 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.8 bits (61), Expect = 0.026
Identities = 23/113 (20%), Positives = 33/113 (29%), Gaps = 5/113 (4%)

Query: 55 AERHLQSARQYFTEQRALHAYIQQQAPNVRQADAAAPQAQIDPAALQGMVTASAAQAGLS 114
A+R + + RA + Y +V A Q+ A S A A L
Sbjct: 230 AKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLG 289

Query: 115 VERLDNEGEGAVQVALQPAPFAKLLPWLEQLNGQ-----GVQVAEAGLDRQVD 162
AV A W +Q G+ A+ GL V+
Sbjct: 290 RVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSVN 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16495PilS_PF08805367e-05 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 35.7 bits (82), Expect = 7e-05
Identities = 11/45 (24%), Positives = 24/45 (53%)

Query: 1 MRLQRGFTLLELLIAIAIFALLALATYRMFDSVMQTDQATRVQEQ 45
+G TL+E+L+ + + +LA + Y+++ V Q++ Q
Sbjct: 22 KEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNN 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16500BCTERIALGSPG368e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 36.4 bits (84), Expect = 8e-06
Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 4/68 (5%)

Query: 1 MRRARGFTLLEVLVALAIF----AMVAASVLSASARSLQNASRLEDKTLAMWIADNRLNE 56
+ RGFTLLE++V + I ++V +++ ++ + + + L + +L+
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 57 LQLEQTPP 64
T
Sbjct: 64 HHYPTTNQ 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16505BCTERIALGSPH1433e-46 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 143 bits (361), Expect = 3e-46
Identities = 50/183 (27%), Positives = 85/183 (46%), Gaps = 32/183 (17%)

Query: 5 RGFTLIELMVVMVIISVLIGLAVLSTGFASTSRELDSEAERLAGL---IGVLTDEAVLDN 61
RGFTL+E+M++++++ V G+ +L+ SR+ DS A+ LA + + +
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFP---ASRD-DSAAQTLARFEAQLRFVQQRGLQTG 59

Query: 62 REYGLRLERDAYQVLRY------DEAKA-------RWLPVARDSHRLPEWAELTFELDGQ 108
+ +G+ + D +Q L D A A RWLP+ + + G
Sbjct: 60 QFFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGR------VATSGSIAGG 113

Query: 109 PLVLAGSKGEKEQKKGTDQPQLLILSSGELSPFRLRLAERGPEGRALSLSSDGFRLPRVE 168
L LA ++GE D P +LI GE++PFRL L E ++ ++ G LP +
Sbjct: 114 KLNLAFAQGEAWTPG--DNPDVLIFPGGEMTPFRLTLG----EAPGIAFNARGESLPEPQ 167

Query: 169 VAR 171
A+
Sbjct: 168 EAQ 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16510BCTERIALGSPG2123e-74 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 212 bits (541), Expect = 3e-74
Identities = 75/138 (54%), Positives = 95/138 (68%), Gaps = 3/138 (2%)

Query: 10 RQQSGFTLIEIMVVVVILGILAALVVPQVMSRPDQAKVTVAKGDIKAIAAALDMYKLDNF 69
+Q GFTL+EIMVV+VI+G+LA+LVVP +M ++A A DI A+ ALDMYKLDN
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNH 64

Query: 70 AYPSTQQGLEALVKKPTGNPQPKNWNKDGYLKKLPVDPWGNPYQYLAPGTKGPFDLYSLG 129
YP+T QGLE+LV+ PT P N+NK+GY+K+LP DPWGN Y + PG G +DL S G
Sbjct: 65 HYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLSAG 124

Query: 130 ADGKEGGSDNDADIGNWD 147
DG+ G D DI NW
Sbjct: 125 PDGEMGTED---DITNWG 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16515BCTERIALGSPF501e-180 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 501 bits (1292), Expect = e-180
Identities = 213/406 (52%), Positives = 278/406 (68%), Gaps = 2/406 (0%)

Query: 1 MAAFEYLALDPSGRQQKGVLEADSARQVRQLLRERQLAPLDVKPTRTREQSGQGGRLTFA 60
MA + Y ALD G++ +G EADSARQ RQLLRER L PL V R +Q L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 RG--LSARDLALVTRQLATLVQAALPIEEALRAAAAQSTSQRIQSMLLAVRAKVLEGHSL 118
R LS DLAL+TRQLATLV A++P+EEAL A A QS + ++ AVR+KV+EGHSL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 119 AGSLREFPTAFPELYRATVAAGEHAGHLGPVLEQLADYTEQRQQSRQKIQLALLYPVILM 178
A +++ FP +F LY A VAAGE +GHL VL +LADYTEQRQQ R +IQ A++YP +L
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 179 VASLAIVGFLLGYVVPDVVRVFIDSGQTLPLLTRVLIGVSDWVKAWGALAFVAAIGGVIG 238
V ++A+V LL VVP VV FI Q LPL TRVL+G+SD V+ +G +A + G +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 239 FRYALRKDAFRERWHGFLLRVPLVGRLVRSTDTARFASTLAILTRSGVPLVEALAIAAEV 298
FR LR++ R +H LL +PL+GR+ R +TAR+A TL+IL S VPL++A+ I+ +V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 299 IANRIIRNEVVKAAQKVREGASLTRSLEATGQFPPMMLHMIASGERSGELDQMLARTARN 358
++N R+ + A VREG SL ++LE T FPPMM HMIASGERSGELD ML R A N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 359 QENDLAAQIGLMVGLFEPFMLIFMGAVVLVIVLAILLPILSLNQLV 404
Q+ + ++Q+ L +GLFEP +++ M AVVL IVLAIL PIL LN L+
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16525BCTERIALGSPC493e-09 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 49.2 bits (117), Expect = 3e-09
Identities = 37/148 (25%), Positives = 57/148 (38%), Gaps = 19/148 (12%)

Query: 32 APALLAVALIIAMSISLAWQAAG--WLRLQRSPVAVAASPVSHESIRSDPTRLAR--LFG 87
+P+++ L + + Q A W V++ ++ R P L LFG
Sbjct: 10 SPSVIRRILFYLLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFG 69

Query: 88 TSAQDPNAPP----------PATNLDLVLKGSFVQSDPKLSSAIIQRQGDKPHRYAVGGE 137
S + N P + L+L L G D S AII + ++ V E
Sbjct: 70 VS-PEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQ-FSRGVNEE 127

Query: 138 ISDG--VKLHAVYRDRVELQRGGRLESL 163
+ G K+ ++ DRV LQ GR E L
Sbjct: 128 V-PGYNAKIVSIRPDRVVLQYQGRYEVL 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16530BCTERIALGSPD5940.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 594 bits (1533), Expect = 0.0
Identities = 217/631 (34%), Positives = 345/631 (54%), Gaps = 35/631 (5%)

Query: 41 AFVPAGNQQEAHWTINLKDADIREFIDQISEITGETFVVDPRVKGQVSVVSKAQLSLSEV 100
F PA ++ ++ + K DI+EFI+ +S+ +T ++DP V+G ++V S L+ +
Sbjct: 21 LFRPAAAEE---FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQY 77

Query: 101 YQLFLSVMSTHGFTVVAQGDQA-RIVPNAEAKTEAG--GGQSAP---DRLETRVIQVQQS 154
YQ FLSV+ +GF V+ + ++V + +AKT A +AP D + TRV+ +
Sbjct: 78 YQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNV 137

Query: 155 PVSELIPLIRPLVPQYGHLAAV--PSANALIISDRSANIARIEDVIRQLDQKGSHDYSVI 212
+L PL+R L G + V +N L+++ R+A I R+ ++ ++D G +
Sbjct: 138 AARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTV 197

Query: 213 NLRYGWVMDAAEV---LNNAMSRGQAKGAAGAQVIADARTNRLIILGPPQARAKLVQLAQ 269
L + D ++ LN S+ G+ A V+AD RTN +++ G P +R +++ + +
Sbjct: 198 PLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIK 257

Query: 270 SLDTPTARSANTRVIRLRHNDAKTLAETLGQISEGMKNNGGQGGEQTGGGRPSNILIRAD 329
LD A NT+VI L++ A L E L IS M++ + NI+I+A
Sbjct: 258 QLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDK--NIIIKAH 315

Query: 330 ESTNALVLLADPDTVNALEDIVRQLDVPRAQVLVEAAIVEISGDIQDAVGVQWAINKGGM 389
TNAL++ A PD +N LE ++ QLD+ R QVLVEA I E+ +G+QWA GM
Sbjct: 316 GQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGM 375

Query: 390 GGTKTNFANTGLSIGTLLQSLESNKAPESIP----------DGAIVGIGSSSFGALVTAL 439
T F N+GL I T + ++ +G G ++ L+TAL
Sbjct: 376 ----TQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTAL 431

Query: 440 SANTKSNLLSTPSLLTLDNQKAEILVGQNVPFNTGSYTTNSEGASNPFTTVERKDIGVSL 499
S++TK+++L+TPS++TLDN +A VGQ VP TGS TT+ + N F TVERK +G+ L
Sbjct: 432 SSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGD---NIFNTVERKTVGIKL 488

Query: 500 KVTPHINDGAALRLEIEQEISALLPNAQQRNNT-DLITSKRSIKSTILAENGQVIVIGGL 558
KV P IN+G ++ LEIEQE+S++ A ++ + R++ + +L +G+ +V+GGL
Sbjct: 489 KVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGL 548

Query: 559 IQDDVSQAESKVPLLGDIPLLGRLFRSTKDTHTKRNLMVFLRPTVVRDSAGLAALSGKKY 618
+ VS KVPLLGDIP++G LFRST +KRNLM+F+RPTV+RD S +Y
Sbjct: 549 LDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQY 608

Query: 619 SDIR-VIDGTRGPEGRPSILPTNANQLFDGQ 648
+ RG E ++L + +++ Q
Sbjct: 609 TAFNDAQSKQRGKENNDAMLNQDLLEIYPRQ 639


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16535DHBDHDRGNASE1146e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 114 bits (287), Expect = 6e-33
Identities = 74/254 (29%), Positives = 110/254 (43%), Gaps = 18/254 (7%)

Query: 10 GKVALVTGAARGIGLGISAWLIAEGWQVVLADNDRERGARVAE---ALGEHAWFVAMDVA 66
GK+A +TGAA+GIG ++ L ++G + D + E+ +V A HA DV
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 67 QEGQVAMSVAEVLGQFGRLDGLVCNAAIANPRNTPLEALSLGEWTRTLAVNLTGPMLLAK 126
+ A + + G +D LV A + P + +LS EW T +VN TG ++
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRP--GLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 127 YCTPYLRA-HNGAIVNIASTRAHQSEPDSEAYAASKGGLLALTHALAASLGPE-IRVNAL 184
+ Y+ +G+IV + S A AYA+SK + T L L IR N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 185 SPG----------WIDTREAAEREAAPLTELDHDQHLVGRVGTVEDVASLVAWLLSEDAG 234
SPG W D A + L L ++ D+A V +L+S AG
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPL-KKLAKPSDIADAVLFLVSGQAG 244

Query: 235 FVTGQEFLVDGGMT 248
+T VDGG T
Sbjct: 245 HITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16555PERTACTIN300.006 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.5 bits (68), Expect = 0.006
Identities = 28/85 (32%), Positives = 31/85 (36%), Gaps = 1/85 (1%)

Query: 84 AAGQPSQPIGGLPATPPATQPPAQAQAQAPAASLPPSQPQPPAAPPSPPPA-EKRLDANN 142
A P+ P P QPP Q P P Q QP A P PP E AN
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAANA 624

Query: 143 LPQSWSVQLASLSNRARAEELQKTL 167
+ V LAS A + L K L
Sbjct: 625 AVNTGGVGLASTLWYAESNALSKRL 649


115HWH78_RS16705HWH78_RS16745N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS1670508-2.109040HlyD family secretion protein
HWH78_RS16710015-3.296970multidrug efflux MFS transporter
HWH78_RS16715022-3.928628excinuclease ABC subunit B
HWH78_RS16720134-5.558680aspartate/tyrosine/aromatic aminotransferase
HWH78_RS16730249-7.834759*ComEA family DNA-binding protein
HWH78_RS16735254-9.469475polysaccharide biosynthesis protein
HWH78_RS16740368-11.556259glycosyltransferase family 4 protein
HWH78_RS16745473-13.447288SDR family oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16705RTXTOXIND1834e-56 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 183 bits (467), Expect = 4e-56
Identities = 74/417 (17%), Positives = 144/417 (34%), Gaps = 96/417 (23%)

Query: 7 RRLTVFLVAVGLIALAFFLHWWFIGRHVESTDNAYVQGEIT------RVASQLGARVEEV 60
R + F++ +IA + VE A G++T + + V+E+
Sbjct: 58 RLVAYFIMGFLVIAFI-----LSVLGQVEIV--ATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 61 LVRDNQHVDKGQLLVRLEDADF--------------KLAVERAQA--------------- 91
+V++ + V KG +L++L +L R Q
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKL 170

Query: 92 -----------------------ALATREAELAQARSKLVQQGSLIAASAADVNASQATL 128
+T + + Q L ++ + A +N +
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 129 GRAQIDLNRAEALRKPGYVS-------EERVTTLTADNHVARSQL---------AKARAD 172
+ L+ +L ++ E + + V +SQL AK
Sbjct: 231 RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290

Query: 173 LEAQRVQRDTLGAEIKRLEAQIASARTELAQAEINLSRTLIHSPISGLVGQRSAR-NGQY 231
L Q + + L ++++ I ELA+ E ++I +P+S V Q G
Sbjct: 291 LVTQLFKNEILD-KLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349

Query: 232 VQVGTHLLSLVPDED-IWVQANFKETQVGRMRDGQKARLTFDAFPDT---PIDGRIDSLF 287
V L+ +VP++D + V A + +G + GQ A + +AFP T + G++ ++
Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409

Query: 288 AASGAQFSLLPPDNATGNFTKVVQRIPVKIVFEADNPLHGRIRPGMSVEAEVELRDR 344
+ D G V+ I + + + + GM+V AE++ R
Sbjct: 410 LDA-------IEDQRLGLVFNVIISIEENCLSTGNKNI--PLSSGMAVTAEIKTGMR 457


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16710TCRTETB1043e-26 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 104 bits (261), Expect = 3e-26
Identities = 85/402 (21%), Positives = 165/402 (41%), Gaps = 19/402 (4%)

Query: 8 AFMAVLDIQITNSSLKDIQGALAATLEEGSWISTSYLVAEIIMIPMTAWLVQLLSARRLA 67
+F +VL+ + N SL DI +W++T++++ I + L L +RL
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 68 VMISVGFLVSSLLCSFAWNLESMIVF-RAMQGFTGGALIPLAFTLALVKLPEHHRPKGMA 126
+ + S++ + S+++ R +QG A L + +P+ +R K
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 127 LFAITATFAPSIGPTLGGWLTENFGWEYIFYINVPPGLLMIAGLLYGLEKKAPHWELLKS 186
L +GP +GG + W Y+ I + ++ + L+ L+K+
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLKKEVRIKGHF-- 199

Query: 187 TDYAGIVTLGIGLGCLQVFLEEGHRKDWLESQLIVSLGSVALFSLVLFVILQLSRPNPLI 246
D GI+ + +G+ +F + S LIVS + S ++FV +P +
Sbjct: 200 -DIKGIILMSVGIVFFMLFT-----TSYSISFLIVS-----VLSFLIFVKHIRKVTDPFV 248

Query: 247 DLGILRNRNFGLASISSIGLGMGLYGSIYVLPLYLAQIQGYNAMQIGEVIMWMG-IPQLF 305
D G+ +N F + + + + G + ++P + + + +IG VI++ G + +
Sbjct: 249 DPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308

Query: 306 LIPLVPKLMRLVSPRLLCAAGFGLFGLASFFSGVLNPDFAGPQFNQIQLLRALG-QPMIM 364
+ L+ P + G ++ + L F I ++ LG
Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LETTSWFMTIIIVFVLGGLSFTK 366

Query: 365 VTISLIATAYLQPQDAGSASSLFNILRNLGGAIGIALLATLL 406
IS I ++ L+ Q+AG+ SL N L GIA++ LL
Sbjct: 367 TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16735NUCEPIMERASE578e-11 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 57.1 bits (138), Expect = 8e-11
Identities = 46/292 (15%), Positives = 103/292 (35%), Gaps = 56/292 (19%)

Query: 301 VMVTGAGGSIGSELCRQIMSCSPSVLILFEHSEYNLYSIHQELERRIKRESLSVNLLPIL 360
+VTGA G IG + ++++ V+ + ++Y S+ Q + +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP----GFQFHK 58

Query: 361 GSVRNPERLVDVMRTWKVNTVYHAAAYKHVPIVEHNIAEGVLNNVIGTLHAVQAAVQVGV 420
+ + E + D+ + V+ + V N +N+ G L+ ++ +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 421 QNFVLIST---------------DKAVRPTNVMGSTKRLAEMVLQALSNESAPVLFGDRK 465
Q+ + S+ D P ++ +TK+ E++ S
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYS------------ 166

Query: 466 DVHHVNKTRFTMVRFGNVLGSSGS---VIPLFREQIKRGGPVTV-THPSITRYFMTIPEA 521
H+ T +RF V G G + F + + G + V + + R F I +
Sbjct: 167 ---HLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDI 223

Query: 522 AQLVIQA----------GSMGQGGD--------VFVLDMGPPVKILELAEKM 555
A+ +I+ ++ G V+ + PV++++ + +
Sbjct: 224 AEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS16745NUCEPIMERASE663e-14 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 66.0 bits (161), Expect = 3e-14
Identities = 67/356 (18%), Positives = 120/356 (33%), Gaps = 69/356 (19%)

Query: 5 NVLVTGATGFIGAALVNSLCSSGQ-----------YKVWAGCRRRGGAWPRGVTP----L 49
LVTGA GFIG + L +G Y V R G L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 50 LLGELGSSVVWDAESAIDTVVHCAARVHV-MSETASDPLVEFRKANVQGT---LDLAREA 105
E + + + V R+ V S + +N+ G L+ R
Sbjct: 62 ADREGMTDLFASGH--FERVFISPHRLAVRYSLENPHAYAD---SNLTGFLNILEGCRHN 116

Query: 106 VSRGVRRFIFISSIKVNGEGTEPGRPY-TADSPPNPVDPYGVSKREAEQALLDLAEETGL 164
++ ++ SS V G + P+ T DS +PV Y +K+ E + GL
Sbjct: 117 ---KIQHLLYASSSSVYGLNRKM--PFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171

Query: 165 EVVIIRPVLVYGPGVKAN--VQTMMRWLKRGVPLPL-GAIHNRRSLVSLDNLVDLIITCI 221
+R VYGP + + + + + G + + +R +D++ + II
Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231

Query: 222 EHPA-----------------AVGQVFLVSDGEDLSTTELLRRMGRALGAPAR--LLPVP 262
+ A +V+ + + + + ++ + ALG A+ +LP+
Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291

Query: 263 ASWIGAAAKVLNRQAFARRLCGSLQVDIMKTRQVLGWTPPVGVDQALEKTARSFLD 318
V + A D +V+G+TP V ++ + D
Sbjct: 292 ------PGDV--LETSA---------DTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


116HWH78_RS17005HWH78_RS17025N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS170051161.090075YciI family protein
HWH78_RS170102152.928802hypothetical protein
HWH78_RS170153143.454961response regulator transcription factor
HWH78_RS170202143.472730Spy/CpxP family protein refolding chaperone
HWH78_RS170252131.835014HAMP domain-containing histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17010adhesinmafb309e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 30.4 bits (68), Expect = 9e-04
Identities = 13/45 (28%), Positives = 18/45 (40%)

Query: 53 AAGFTGSLIVAEFDSLAAAQSWAEADPYRAAGVYAEVVVKPFKKV 97
G GS+ E ++ A W + +P A V A V KV
Sbjct: 278 VIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17015IGASERPTASE280.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.010
Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 8/76 (10%)

Query: 22 EEPAPAPIPAAQPSITQATAELERRLVETERQRDELVSRMRQENRQLREQ--------LQ 73
E P P P PA T+ AE ++ +T + ++ + +NR++ ++ Q
Sbjct: 1022 EAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 74 AAQAQRQPPLLTEEQT 89
+ + E QT
Sbjct: 1082 TNEVAQSGSETKETQT 1097


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17020HTHFIS1039e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 103 bits (258), Expect = 9e-28
Identities = 42/117 (35%), Positives = 63/117 (53%)

Query: 4 LLLIDDDRELCELLGTWLVQEGFSVRASHDGAQARRALAEQTPDAVVLDVMLPDGSGLEL 63
+L+ DDD + +L L + G+ VR + + A R +A D VV DV++PD + +L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LKQLRGDHPDLPVLMLSARGEPLDRILGLELGADDYLAKPCDPRELTARLRAVLRRT 120
L +++ PDLPVL++SA+ + I E GA DYL KP D EL + L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS17030PF06580290.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.026
Identities = 16/100 (16%), Positives = 35/100 (35%), Gaps = 17/100 (17%)

Query: 341 VDNLLRNAVRFNPVGQPLEVRASSAGDYLRLSVRDHGPGIAAELQEQLGEPFFRAPNQSS 400
V+N +++ + P G + ++ + + L V + G +E
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------------- 309

Query: 401 PGHGLGLA-IARRAIERHGGHLRLG-NHPDGGFIATLSLP 438
G GL + R +G ++ + G A + +P
Sbjct: 310 -STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


117HWH78_RS18055HWH78_RS18095N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS180552151.223034ABC transporter permease
HWH78_RS180601142.534741ABC transporter permease
HWH78_RS180651133.791587efflux RND transporter periplasmic adaptor
HWH78_RS180702143.816754phosphate-starvation-inducible protein PsiE
HWH78_RS180752123.333818DUF3509 domain-containing protein
HWH78_RS180802133.991207TolC family outer membrane protein
HWH78_RS180850123.633348HlyD family type I secretion periplasmic adaptor
HWH78_RS180900123.421995type I secretion system permease/ATPase
HWH78_RS180951122.443684heme acquisition protein HasA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18055ABC2TRNSPORT280.039 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 28.4 bits (63), Expect = 0.039
Identities = 27/122 (22%), Positives = 50/122 (40%), Gaps = 1/122 (0%)

Query: 246 LGYRQSASFFMLLGIVLPFLIAVIALSEFIAELLPTEESVYLTMTFITLPLFYMAGYSWP 305
LGY Q S L ++ +A +L + L P+ + T + P+ +++G +P
Sbjct: 139 LGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFP 198

Query: 306 EQAMPDWVRWLADAIPSTWAIRAIAEMNQMDLPLREVSDHALVLLGMAATYALLGTLLYQ 365
+P + A +P + +I I + + P+ +V H L L T L +
Sbjct: 199 VDQLPIVFQTAARFLPLSHSIDLIRPI-MLGHPVVDVCQHVGALCIYIVIPFFLSTALLR 257

Query: 366 YR 367
R
Sbjct: 258 RR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18065RTXTOXIND561e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.6 bits (134), Expect = 1e-10
Identities = 26/161 (16%), Positives = 62/161 (38%), Gaps = 17/161 (10%)

Query: 41 IVSSKAKGRVQVLHVRRGDEVKQGDLLISLDSPELEAQLDALHAARNQVQAQLDESLHGT 100
+ V+ + V+ G+ V++GD+L+ L + EA ++ +QA+L+++ +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSS--LLQARLEQTRYQI 155

Query: 101 REESIRALKASLAQAEAELRNAESDFQRNQQMVERGFLSRTQFDLSRRERDVARDRVAEA 160
SI K + E ++++ L + QF + ++ + +
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNV---SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKK 212

Query: 161 RANLDEGLKGDREERRQALQAAVRRADAQIAELQAQIDDLQ 201
RA + A + R + ++++DD
Sbjct: 213 RAER------------LTVLARINRYENLSRVEKSRLDDFS 241



Score = 51.8 bits (124), Expect = 1e-09
Identities = 29/205 (14%), Positives = 77/205 (37%), Gaps = 24/205 (11%)

Query: 75 LEAQLDALHAARNQVQAQLDESLHGTREESIRALKASLAQAEAELRNAESDFQRNQQMVE 134
++ Q + Q + LD+ + + A + + E R +S ++
Sbjct: 191 IKEQFSTWQNQKYQKELNLDK-----KRAERLTVLARINRYENLSRVEKSRLDDFSSLLH 245

Query: 135 RGFLSRTQFDLSRRERDVARDRVAEARANLDE------GLKGDREERRQALQAAV----R 184
+ +++ + A + + ++ L++ K + + Q + + R
Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR 305

Query: 185 RADAQIAELQAQI----DDLQ---VRAPVNGEVGPIPA-EQGELINAYSPLLTLVRLDDS 236
+ I L ++ + Q +RAPV+ +V + +G ++ L+ +V DD+
Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365

Query: 237 YFV-FNLREDILAKVRKGDRIVMQV 260
V ++ + + G +++V
Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKV 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18080RTXTOXIND320.006 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.006
Identities = 23/170 (13%), Positives = 47/170 (27%), Gaps = 9/170 (5%)

Query: 60 LPSLRYDYNKARNDSTVSQGDARVERDYRSYASTLSLEQPLFDYEAYARYRQ-GEAQAL- 117
L +L + + + S++ Q R S + P ++ E + L
Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 118 ---FADEQFRGRSQELA---VRLFAAYSETLFAREQVLLAEAQRRALETQLAFNQRAFEE 171
EQF + + L +E L ++ E R +++L +
Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246

Query: 172 GEGTRTDLLET-RARLSLTRAEEIAAGDRAAAARRTLEAMLGQALEDREL 220
+ +LE + + L A L +
Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18085RTXTOXIND416e-145 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 416 bits (1072), Expect = e-145
Identities = 96/435 (22%), Positives = 170/435 (39%), Gaps = 8/435 (1%)

Query: 15 AALELDEK---RFSRLGWGLVLLGFVGFLLWAGLAPLDKGVGVSGTVMVAGSRKAVQHPT 71
A LEL E R RL ++ V + + L ++ +G + +G K ++
Sbjct: 44 AHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIE 103

Query: 72 GGLVRHIRVHEGERVEAGQVLLEMDATQARAQADGLFAQYLAALASLARLSAERDEKARI 131
+V+ I V EGE V G VLL++ A A A + L A R
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 132 EFPAELLALDDPRLPTLLEQQ----RQLHDSRRRALRLELDGLAETVAGSQAQLDGLQAA 187
+ P EL D+P + E++ L + + + + +A+ + A
Sbjct: 164 KLP-ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLAR 222

Query: 188 LRSKEQQRAALEEQLRGLRQLASEGYVPRNRLLDSERLLAQVNGEIAGDLGSLGSTRRQI 247
+ E + +L L + + ++ +L+ E + E+ L +I
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282

Query: 248 LELRLRMAQRREKFQEEVRGSLADAQVRAEELRNRLASARFDLANSEVRAPVAGLVVGQE 307
L + + F+ E+ L L LA S +RAPV+ V +
Sbjct: 283 LSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLK 342

Query: 308 VFTEGGVIAPGQQLMEILPERQPLLVDARLPVEMVDKVRVGLPVELMFSAFSQSTTPRVE 367
V TEGGV+ + LM I+PE L V A + + + + VG + AF + +
Sbjct: 343 VHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV 402

Query: 368 GEVTLVSADRLLDERSEAPYYRVRIRVGEEGVRRLAGLEIRPGMPVEAFVRSGERSLLNY 427
G+V ++ D + D+R + + + + GM V A +++G RS+++Y
Sbjct: 403 GKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISY 462

Query: 428 LFKPLADRTHLALGE 442
L PL + +L E
Sbjct: 463 LLSPLEESVTESLRE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18095PF064382761e-97 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 276 bits (706), Expect = 1e-97
Identities = 204/205 (99%), Positives = 205/205 (100%)

Query: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60
MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS
Sbjct: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60

Query: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120
TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG
Sbjct: 61 TASDAAFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLG 120

Query: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180
LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA
Sbjct: 121 LDSPIAQGRDGTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQLAAAGVAHA 180

Query: 181 TPAAAAAEIGVVGVQELPHDLALAA 205
TPAAAAAE+GVVGVQELPHDLALAA
Sbjct: 181 TPAAAAAEVGVVGVQELPHDLALAA 205


118HWH78_RS18365HWH78_RS18405N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS18365-1122.152739N-acetylglutaminylglutamine amidotransferase
HWH78_RS183700132.488531N-acetylglutaminylglutamine synthetase
HWH78_RS183750131.968485osmoprotectant NAGGN system M42 family
HWH78_RS183801122.101305hybrid sensor histidine kinase/response
HWH78_RS18385-1121.707788YheU family protein
HWH78_RS18390-1121.820905hypothetical protein
HWH78_RS18395-1131.686548MFS transporter
HWH78_RS18400-2130.815835DEAD/DEAH box helicase
HWH78_RS18405-2120.668718MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18365ANTHRAXTOXNA330.003 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 33.2 bits (75), Expect = 0.003
Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 7/68 (10%)

Query: 192 APHTLLEGVKKLPPATW-MSVDLDGSCEQRTWWT---LDYG--PRPDERELTLDDWQERV 245
AP +L E K++P W V+ S E++ T + YG +PD + TL +WQ+++
Sbjct: 498 AP-SLTEIKKQIPQKEWDKVVNTPNSLEKQKGVTNLLIKYGIERKPDSTKGTLSNWQKQM 556

Query: 246 LDGLREAV 253
LD L EAV
Sbjct: 557 LDRLNEAV 564


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18370SACTRNSFRASE353e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.3 bits (81), Expect = 3e-04
Identities = 15/53 (28%), Positives = 19/53 (35%)

Query: 197 LAVDPQCSRPGVGEALVRHLVEHFMSRELAYLDLSVLHNNQQAKALYRKLGFR 249
+AV + GVG AL+ +E L L N A Y K F
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18380HTHFIS702e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.9 bits (171), Expect = 2e-14
Identities = 34/132 (25%), Positives = 57/132 (43%), Gaps = 7/132 (5%)

Query: 786 LDAPCILVAEDNPVNQLVVRGFLAKRGYAVRLAGNGRLALDEYLRDPNGIQLILMDGEMP 845
+ ILVA+D+ + V+ L++ GY VR+ N L++ D MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD--LVVTDVVMP 58

Query: 846 EMDGFEATRLIRREERAQGWPRVPIVALTAHILDEHRRAGIEAGMDAYLGKPVDRAELYA 905
+ + F+ I++ P +P++ ++A E G YL KP D EL
Sbjct: 59 DENAFDLLPRIKKA-----RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG 113

Query: 906 TLERLLGQPSRQ 917
+ R L +P R+
Sbjct: 114 IIGRALAEPKRR 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18390PRPHPHLPASEC384e-05 Prokaryotic zinc-dependent phospholipase C signature.
		>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature.

Length = 398

Score = 38.5 bits (89), Expect = 4e-05
Identities = 13/38 (34%), Positives = 20/38 (52%)

Query: 241 QYFGLSRFAFANGHPYWGYRFLGWGMHYIQDITQPYHS 278
++ L+R+ + G+ +LG MHY DI PYH
Sbjct: 128 KFSALARYEWQRGNYKQATFYLGEAMHYFGDIDTPYHP 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18395TCRTETA433e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.5 bits (100), Expect = 3e-06
Identities = 57/281 (20%), Positives = 94/281 (33%), Gaps = 13/281 (4%)

Query: 79 ALPLVLLSILSGVIADNHDRRKIMLWGLSFEMTGAMFATLLAFLGYLDPVLLIISILWIS 138
AL + + G ++D RR ++L L A + + P L ++ I I
Sbjct: 53 ALMQFACAPVLGALSDRFGRRPVLLVSL-------AGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 139 LGGS-VTIPAWQAAVNEQVPARMVSDAVLLNSVNYNVARAAGPALGGLLLSAVGPAWVFL 197
G + T A + + + S + AGP LGGL+ P F
Sbjct: 106 AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFF 164

Query: 198 FNSFCY-MALIWAIWQWRRDVPKRSLPPEGILEGVTAALRFTQYSTVTRLVMMRSFAFGL 256
+ + + + P A+ R+ + TV +M F L
Sbjct: 165 AAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQL 224

Query: 257 SASAVWALLPLLAHRNPDGDAAIYGYMLGALG-LGAILGSTQVSRLRQRIGSSRLISLAG 315
AL + DA G L A G L ++ + + R+G R + L
Sbjct: 225 VGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGM 284

Query: 316 FTLALILLTLGLVDNLWVLFPVLIL--GGGCWIGALATYNS 354
+ L W+ FP+++L GG + AL S
Sbjct: 285 IADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLS 325



Score = 40.6 bits (95), Expect = 1e-05
Identities = 32/189 (16%), Positives = 63/189 (33%), Gaps = 12/189 (6%)

Query: 12 PLKPEGQAAKPERTGTWAPFSIQAFRIIWICNLFANLGTWA--QSVAAAWVVTDA---HA 66
K E + + E A F + + Q AA WV+ H
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHW 243

Query: 67 SPLMVA-MIQVAAALPLVLLSILSGVIADNHDRRKIMLWGLSFEMTGAMFATLLAFLGYL 125
+ + L + ++++G +A R+ ++ G+ + TG L +
Sbjct: 244 DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG------YILLAFA 297

Query: 126 DPVLLIISILWISLGGSVTIPAWQAAVNEQVPARMVSDAVLLNSVNYNVARAAGPALGGL 185
+ I+ + G + +PA QA ++ QV + ++ GP L
Sbjct: 298 TRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 186 LLSAVGPAW 194
+ +A W
Sbjct: 358 IYAASITTW 366



Score = 34.4 bits (79), Expect = 0.001
Identities = 32/142 (22%), Positives = 51/142 (35%), Gaps = 8/142 (5%)

Query: 277 AAIYGYMLGALGLGAILGSTQVSRLRQRIGSSR--LISLAGFTLALILLTLGLVDNLWVL 334
A YG +L L + + L R G L+SLAG + ++ LWVL
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA--PFLWVL 99

Query: 335 FPVLILGG-GCWIGALATYNSAVQILVPDWIKARALALYQTALYGGLALGSFLWGHLAET 393
+ I+ G GA+A + + + +AR G+ G L G +
Sbjct: 100 YIGRIVAGITGATGAVAG--AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG- 156

Query: 394 MTVHGALLAAGCLLLASVILLY 415
+ H AA L + +
Sbjct: 157 FSPHAPFFAAAALNGLNFLTGC 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS18405TCRTETA583e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 57.9 bits (140), Expect = 3e-11
Identities = 46/209 (22%), Positives = 86/209 (41%), Gaps = 4/209 (1%)

Query: 26 VIIALAFFFDSMDLAMMTFLLGSIKAEFGLDSAQA---GLLASSSFFGMVIGAALSGMLA 82
++I D++ + ++ +L + + + G+L + A + G L+
Sbjct: 8 IVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALS 67

Query: 83 DRFGRKPVFQASIVLWGLASYLCSTAGDLDSLTFYRVLLGIGMGMEFPIAQSLLSEMIPA 142
DRFGR+PV S+ + + +TA L L R++ GI G +A + ++++
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDG 126

Query: 143 SRRGKYIALMDGFWPLGFVAAGCLSYFLLPLTGWRSIFLVLALPAVFVLAIRFLIPESPR 202
R ++ M + G VA L + + F AL + L FL+PES +
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186

Query: 203 WLEQAGRREQADRVLRDIEARVMRSLGLT 231
+ RRE + + AR M +
Sbjct: 187 GERRPLRREALNPLASFRWARGMTVVAAL 215



Score = 30.6 bits (69), Expect = 0.012
Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 9/167 (5%)

Query: 286 LSALLQQSGFAVTQSVYYTVLISLAGIPGFLCAAWL---VESWGRKPSCVLMLLGGGAMA 342
L LL+ + + +Y +L++L + F CA L + +GR+P ++ L G
Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87

Query: 343 YAYGQTAVFGGSLALLIGFGLAMQFFLFGMWAVLYTYTPELYPTSARATGSGFASAVGRI 402
L +L G + AV Y ++ RA GF SA
Sbjct: 88 AIMA----TAPFLWVLY-IGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGF 142

Query: 403 GSLLGPLVTGLVLPLTGQGGVFTLGALCFGVAALVVWAFGIETRGRT 449
G + GP++ GL+ + F A G+ L E+
Sbjct: 143 GMVAGPVLGGLMGGFSPHAP-FFAAAALNGLNFLTGCFLLPESHKGE 188


119HWH78_RS19565HWH78_RS19615N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS195651110.118941TetR/AcrR family transcriptional regulator
HWH78_RS195700100.197754lysine--tRNA ligase
HWH78_RS195750131.310141peptide chain release factor 2
HWH78_RS195801102.075918Wsp signal transduction system regulator
HWH78_RS19585091.466438Wsp signal transduction system protein-glutamate
HWH78_RS19590-191.328644Wsp signal transduction system sensor histidine
HWH78_RS19595-291.038697chemotaxis protein CheW
HWH78_RS19600-291.008786protein-glutamate O-methyltransferase CheR
HWH78_RS19605-1100.654160chemotaxis protein CheW
HWH78_RS19610-2100.546290Wsp signal transduction system chemoreceptor
HWH78_RS19615-2100.770585MHS family MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19565HTHTETR524e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.6 bits (123), Expect = 4e-10
Identities = 29/137 (21%), Positives = 58/137 (42%), Gaps = 7/137 (5%)

Query: 27 KASRQGSEQRRQAILDAAMRLIVRDGVRAVRHRAVAAEAQVPLSATTYYFKDIDDLITDT 86
+ ++Q +++ RQ ILD A+RL + GV + +A A V A ++FKD DL ++
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 87 FALFVERNAEALSAFWSSVEGDLQEMAAVLADD-------PGARGSLVERIVELAVQYVQ 139
+ L E + + GD + + R L+E I +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 140 VQLTERREHLLAEQAFR 156
+ + ++ + L +++
Sbjct: 123 MAVVQQAQRNLCLESYD 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19580HTHFIS681e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 1e-14
Identities = 34/129 (26%), Positives = 53/129 (41%), Gaps = 3/129 (2%)

Query: 21 VLLVDDQAMIGEAVRRSLASEAGIDFHFCSDPQQAVAVANQIKPTVILQDLVMPGVDGLT 80
+L+ DD A I + ++L S AG D S+ +++ D+VMP +
Sbjct: 6 ILVADDDAAIRTVLNQAL-SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 81 LLAAYRGNPATRDIPIIVLSTKEEPTVKSAAFAAGANDYLVKLPDAIELVARIRYHSRSY 140
LL + A D+P++V+S + A GA DYL K D EL+ I
Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 141 IALQQRDEA 149
+ E
Sbjct: 123 KRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19585HTHFIS522e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 51.8 bits (124), Expect = 2e-09
Identities = 31/141 (21%), Positives = 53/141 (37%), Gaps = 13/141 (9%)

Query: 2 RIGIVNDMPLAVEALRRALAFEPQHQIVWVASNGAEAVTQCAADTPDVVLMDLLMPVMDG 61
I + +D L +AL+ V + SN A AA D+V+ D++MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 VEATRRIMAESPCAIVIVTVDIEQNVHRVFEAMGYGALDAVNTP----------ALGIGN 111
+ RI P V+V + + +A GA D + P +
Sbjct: 63 FDLLPRIKKARPDLPVLV-MSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 112 PQTAAAPLLRKIQNVGWLIGQ 132
P+ + L Q+ L+G+
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19590HTHFIS747e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 7e-16
Identities = 30/113 (26%), Positives = 52/113 (46%), Gaps = 2/113 (1%)

Query: 644 QRKRILVVDDSLTVRELERKLLLGRGYDVAVAVDGMDGWNALRSEHFDLLITDIDMPRMD 703
ILV DD +R + + L GYDV + + W + + DL++TD+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 704 GIELVTLVRRDSRLQSLPVMVVSYKDREEDRRRGLDAGADYYLAKASFHDEAL 756
+L+ ++ LPV+V+S ++ + + GA YL K E +
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19600FLGHOOKFLIK290.033 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 29.0 bits (64), Expect = 0.033
Identities = 18/76 (23%), Positives = 25/76 (32%), Gaps = 3/76 (3%)

Query: 249 QPIGVPLSFVFRRTSEAPRGARPKAVSDGARPVVAAAVERASVRPSPPPPAKPRQRLSSL 308
P P F + + A A+P+ E S P+ S L
Sbjct: 155 LPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPL 214

Query: 309 VPPASGQPL---ASPV 321
+ P QPL A+PV
Sbjct: 215 ITPHQTQPLPTVAAPV 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19615TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 35.2 bits (81), Expect = 6e-04
Identities = 15/27 (55%), Positives = 17/27 (62%)

Query: 304 VAGWLSDRIGRKPVLLAGLLLATLFYF 330
V G LSDR GR+PVLL L A + Y
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYA 88



Score = 32.1 bits (73), Expect = 0.005
Identities = 24/113 (21%), Positives = 45/113 (39%), Gaps = 17/113 (15%)

Query: 63 IFALMAFAAGFLVRPFGALVFGRLGDMIGRKYTFLVTILLMGLSTFAVGLLPTYASIGVA 122
++ALM FA A V G L D GR+ LV++ + + P
Sbjct: 51 LYALMQFAC--------APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW----- 97

Query: 123 APIILVTLRMLQGLALGGEYGGAAIYVAEHAPANKRGSYTSWIQSTATLGLLL 175
+L R++ G+ G A Y+A+ ++R + ++ + G++
Sbjct: 98 ---VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVA 146


120HWH78_RS19640HWH78_RS19690N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS19640-210-0.076879response regulator transcription factor
HWH78_RS19645-2110.261957hypothetical protein
HWH78_RS19650-213-0.059880WG repeat-containing protein
HWH78_RS196551180.553078FKBP-type peptidyl-prolyl cis-trans isomerase
HWH78_RS196600121.256622MFS transporter
HWH78_RS19665-28-0.493540MexR antirepressor ArmR
HWH78_RS19670-290.176029hypothetical protein
HWH78_RS19675-290.069784efflux system transcriptional repressor NalC
HWH78_RS19680-2110.136797hypothetical protein
HWH78_RS19685-2100.030680NADH:flavin oxidoreductase/NADH oxidase
HWH78_RS19690011-0.262367M4 family elastase LasB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19640HTHFIS613e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 3e-13
Identities = 28/130 (21%), Positives = 57/130 (43%), Gaps = 11/130 (8%)

Query: 2 KTRVILVDDHALTLIGMRYLLSAYD-DLRIVAQAQDADGLLAQLEAHPCDLLITDLMMPG 60
+++ DD A + LS D+RI + A + A DL++TD++MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATL---WRWIAAGDGDLVVTDVVMPD 59

Query: 61 SQQADGLRLVQKVRRRYPDLPIIVVTMLGNPALVSSLLKLGIHGLVSK----RGMLDDLP 116
+ L+ ++++ PDLP++V++ + G + + K ++ +
Sbjct: 60 ---ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 117 KAIRHAGRRP 126
+A+ RRP
Sbjct: 117 RALAEPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19655INFPOTNTIATR805e-22 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 80.4 bits (198), Expect = 5e-22
Identities = 42/104 (40%), Positives = 59/104 (56%), Gaps = 2/104 (1%)

Query: 5 LQIEDLLLGDGKEVVKGALITTQYKGTLEDGTLFDSSYERGRPFQCVIGTGRVIKGWDQG 64
LQ + + G G + K +T +Y GTL DGT+FDS+ + G+P +VI GW +
Sbjct: 128 LQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKP--ATFQVSQVIPGWTEA 185

Query: 65 LMGMKVGGKRRLFVPSHLAYGERQVGAHIKPHSNLLFEIELLEV 108
L M G +FVP+ LAYG R VG I P+ L+F+I L+ V
Sbjct: 186 LQLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19660TCRTETB652e-13 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 64.5 bits (157), Expect = 2e-13
Identities = 68/380 (17%), Positives = 134/380 (35%), Gaps = 57/380 (15%)

Query: 40 IALPSLQRSFGGDLAALSWIMSAFPFVGVFGGIAAGLLVRRWGDRRLLTGGLAILGGASL 99
++LP + F A+ +W+ +AF G G L + G +RLL G+ I S+
Sbjct: 35 VSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSV 94

Query: 100 LGASMQDFA-WLLATRFVEGLGFLIVVVAAPAVLHRITSETRRSVVFGLWSTFMAGGIAL 158
+G F L+ RF++G G V+ R + R FGL + +A G +
Sbjct: 95 IGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV 154

Query: 159 SMLFGPLLADWRADWQLSALLVLVAALLLPF--SVPADDGRRAAGVRPAGLGTLLKVPAI 216
G ++A + W L+ ++ + +PF + + R G+ +L I
Sbjct: 155 GPAIGGMIAHY-IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGI--ILMSVGI 211

Query: 217 TLLALGFTTYNLQFFALMTF---------------------------------------- 236
L T+Y++ F +
Sbjct: 212 VFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGT 271

Query: 237 ------LPVFLMQR---LGVALETAGLIGAAIVAANALGNVAAGFILSRGIRPGALLAST 287
+ ++M+ L A + +I ++ G + + RG + T
Sbjct: 272 VAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVT 331

Query: 288 AILMGLTGAAFFHAATPGLLAIALGFVFSAVAGMLPTTVLATAPLASPAPSLTPLAIGWV 347
+ + A+F T + I + FV ++ TV++T +S + +
Sbjct: 332 FLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFT--KTVISTIVSSSLKQQEAGAGMSLL 389

Query: 348 MQGNYLGQVIGPLLIGLIVS 367
++L + G ++G ++S
Sbjct: 390 NFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19675HTHTETR646e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.3 bits (156), Expect = 6e-15
Identities = 25/96 (26%), Positives = 44/96 (45%), Gaps = 5/96 (5%)

Query: 10 ERGRQRRRAMLDAATQAFLEHGFEGTTLDMVIERAGGSRGTLYSSFGGKEGLFAAVIA-- 67
+ ++ R+ +LD A + F + G T+L + + AG +RG +Y F K LF+ +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 68 --HMIEEIFDDSADQPR-PAATLSATLEHFGRRFLT 100
++ E + A P P + L L H +T
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVT 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS19690THERMOLYSIN399e-136 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 399 bits (1027), Expect = e-136
Identities = 138/488 (28%), Positives = 206/488 (42%), Gaps = 59/488 (12%)

Query: 51 GAGGADELKAIRSTTLPNGKQVTRYEQFHNGVRVVGEAITEVKGPGKSVAAQRSGHFVAN 110
G + L I + G V R+EQ +G + G+ + SG + N
Sbjct: 69 GGQARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGE--LSSLSGTLIPN 126

Query: 111 IAADLPGSTTAAVSAEQVLAQAKS------LKAQGRKTENDKVELVIRLGENNIAQLVYN 164
+ T AA+S +Q AK K + E LVI E +L Y
Sbjct: 127 LDKRTL-KTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEET-PRLAYE 184

Query: 165 VSYLIPGEGLSRPHFVIDAKTGEVLDQWEGLAHAEAGGPG---------------GNQKI 209
V+ ++IDA G+VL++W + A+ GG G+QK
Sbjct: 185 VNVRFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQKY 244

Query: 210 GKYTYGSDYGPLIVNDRCEMDDGNVITVDMNSSTDDSKTTPFRFACPTNTYKQVNGAYSP 269
TY S YG + D + T D + T + + Q +Y
Sbjct: 245 INTTYSSYYGYYYLQDNTR--GSGIFTYDGRNRTVLPGSLW------ADGDNQFFASYDA 296

Query: 270 -LNDAHFFGGVVFKLYRDWFG---TSPLTHKLYMKVHYGRSVENAYWDGTAMLFGDG-AT 324
DAH++ GVV+ Y++ G + VHYGR NA+W+G+ M++GDG
Sbjct: 297 AAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDGDGQ 356

Query: 325 MFYPLV-SLDVAAHEVSHGFTEQNSGLIYRGQSGGMNEAFSDMAGEAAEFYMRGKNDFLI 383
F P +DV HE++H T+ +GL+Y+ +SG +NEA SD+ G EFY D+ I
Sbjct: 357 TFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNPDWEI 416

Query: 384 GYDIKK---GSGALRYMDQPSRDGRSIDNASQYYNGID----VHHSSGVYNRAFYLLANS 436
G DI ALR M P++ G D+ S+ Y G VH +SG+ N+A YLL+
Sbjct: 417 GEDIYTPGVAGDALRSMSDPAKYGDP-DHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQG 475

Query: 437 --------PGWDTRKAFEVFVDANRYYWTATSNYNSGACGVIRSAQNRNYS----AADVT 484
G K ++F A YY T TSN++ +++A + S V
Sbjct: 476 GVHYGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVK 535

Query: 485 RAFSTVGV 492
+AF+ VGV
Sbjct: 536 QAFNAVGV 543


121HWH78_RS20515HWH78_RS20550N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS20515-1151.100392nitrate/nitrite transporter NarK2
HWH78_RS205200151.382444NarK/NasA family nitrate transporter
HWH78_RS205250142.700109two-component system sensor histidine kinase
HWH78_RS205300153.366416two-component system response regulator NarL
HWH78_RS205351143.885677anaerobic/virulence modulator AnvM
HWH78_RS205400153.994383hypothetical protein
HWH78_RS205451143.882528class I SAM-dependent methyltransferase
HWH78_RS205501102.987151SDR family oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20515TCRTETA300.019 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.2 bits (68), Expect = 0.019
Identities = 28/128 (21%), Positives = 47/128 (36%), Gaps = 11/128 (8%)

Query: 52 AVWMIWSTVTVRLNSAGFAFSNDQLFLLAALPSISGATLRVFYSFMVPIFGGRRWTALST 111
A+W+I+ ++ S L L S++ A + + G RR L
Sbjct: 231 ALWVIFGEDRFHWDATTIGIS---LAAFGILHSLAQA---MITGPVAARLGERRALMLGM 284

Query: 112 ASMLIPCIWLGFAVQDPSTPYWVFALIALLCGFGGGNFASSMSNISFFYPKSQQGTALGL 171
+ I L FA + W+ I +L GG + + +S + +QG G
Sbjct: 285 IADGTGYILLAFATR-----GWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGS 339

Query: 172 NAGLGNLG 179
A L +L
Sbjct: 340 LAALTSLT 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20520TCRTETA392e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.4 bits (92), Expect = 2e-05
Identities = 60/350 (17%), Positives = 114/350 (32%), Gaps = 30/350 (8%)

Query: 39 ELGLSESQ---FGLMVALPILTGSLVRLPLGLITDRFGGRIVFFIHMLLVAIPIYGLAFA 95
+L S +G+++AL L LG ++DRFG R V + + A+ +A A
Sbjct: 34 DLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA 93

Query: 96 SQYWHYLVLGLFVGLAGGSFAVGIAYTSAWFEKERQGTAMGIFGAGNAGAAITNLVAPMI 155
W + + G+ G + AV AY + + + + G A + V +
Sbjct: 94 PFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL 153

Query: 156 VVAFGWRMVPQVYSVAMLVTAVLFWLFTWTDPAHLKGAAEASQRPSLAKQLAPLAELRVW 215
+ F P + A+ L F + + + + + V
Sbjct: 154 MGGFSPHA-PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVV 212

Query: 216 RFGLYYFFVFG--GFVALALWLPKYYIAEYGLDLKTASFITMLFTLPSGLIRA-LGGWFS 272
+ FF+ G V ALW+ + + D T F + L +A + G +
Sbjct: 213 AALMAVFFIMQLVGQVPAALWV-IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271

Query: 273 DHYGARS-VNWGVFWVCLVCLFFLSYPQTTMTIHGIQGDLSLGIGLNVWLFTFLVFVVGI 331
G R + G+ + + W+ F + V+
Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFATR-------------------GWMA-FPIMVLLA 311

Query: 332 AQGFGKASVYRIIHDYYPSN-MGTVGGMVGVIGGLGGFCLPILFGYAADH 380
+ G G ++ ++ G + G + + L P+LF
Sbjct: 312 SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20525PF06580432e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.3 bits (102), Expect = 2e-06
Identities = 26/111 (23%), Positives = 48/111 (43%), Gaps = 12/111 (10%)

Query: 495 FGERGEVTIELDNRLQHVPLSPNEEIHVLQIVREALSNVVRHSQAQR---AWVRLSSQAD 551
F +R + +++ + V + P ++Q + E N ++H AQ + L D
Sbjct: 236 FEDRLQFENQINPAIMDVQVPP----MLVQTLVE---NGIKHGIAQLPQGGKILLKGTKD 288

Query: 552 -GQVSIAVEDDGVGFDPQQNRSGHYGLTIMQERGQTL-GSQLRFEARAPHG 600
G V++ VE+ G S GL ++ER Q L G++ + + G
Sbjct: 289 NGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20530HTHFIS801e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 1e-19
Identities = 44/197 (22%), Positives = 76/197 (38%), Gaps = 18/197 (9%)

Query: 13 RLLLVDDHPMMRKGVAQLLELEDDLSVVGEAGSGEEALRLAAELDPDMILLDLNMKGMNG 72
+L+ DD +R + Q L V + R A D D+++ D+ M N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 73 LDTLRALREAGVDARIVVFTVSDDKGDVVNVLRAGADGYLLKDMEPERLLEHIRQAATGQ 132
D L +++A D ++V + + + GA YL K + L+ I +A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120

Query: 133 MTLSPQLTQILAQALRGDD---RSKSLDELTERERQILRQIAHGYSNKMIARKLDITE-G 188
+ +++ + G RS ++ E+ ++L ++ MI E G
Sbjct: 121 -EPKRRPSKLEDDSQDGMPLVGRSAAMQEI----YRVLARLMQTDLTLMI-----TGESG 170

Query: 189 TVKVHVKRVLHKLGMRS 205
T K V R LH G R
Sbjct: 171 TGKELVARALHDYGKRR 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20550DHBDHDRGNASE851e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.1 bits (210), Expect = 1e-21
Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 7/180 (3%)

Query: 5 VAFVTGCSSGIGRALADAFQRAGYRVWA----SARKEDDVRALAEAGFQAVQ--LDVNDA 58
+AF+TG + GIG A+A G + A + E V +L A DV D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 AALARLAEELGVEAAGLDVLVNNAGYGAMGPLLDGGVDAMRRQFETNVFAVVGVTRALFP 118
AA+ + + E +D+LVN AG G + + F N V +R++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 119 -LLRRKSGLVVNVGSVSGVLVTPFAGAYCASKAAVHALSDALRLELAPFGVEVLEVQPGA 177
++ R+SG +V VGS + AY +SKAA + L LELA + + V PG+
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189


122HWH78_RS20875HWH78_RS20910N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS20875011-0.975380transporter substrate-binding domain-containing
HWH78_RS20880-1110.091081two-component system response regulator cyclic
HWH78_RS20885-1101.516696response regulator transcription factor
HWH78_RS20890-181.732992TIGR03862 family flavoprotein
HWH78_RS20895-171.044500DEAD/DEAH box helicase
HWH78_RS20900-180.861551NYN domain-containing protein
HWH78_RS20905091.487943DUF2076 domain-containing protein
HWH78_RS20910-281.311791cysteine hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20875HTHFIS642e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.1 bits (156), Expect = 2e-12
Identities = 31/112 (27%), Positives = 49/112 (43%), Gaps = 5/112 (4%)

Query: 957 RLQVLVVDDHAVNRQILHQQLSFLGHDVEEAENGLSALNLWHGQPFDMVITDCHMPLMSG 1016
+LV DD A R +L+Q LS G+DV N + D+V+TD MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 1017 SDLARSIRQEERENGEEPVVIIGLTADAQPEEIERCIQAGMNECLIKPIGLD 1068
DL I+ + + PV+++ +A + + G + L KP L
Sbjct: 63 FDLLPRIK---KARPDLPVLVM--SAQNTFMTAIKASEKGAYDYLPKPFDLT 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20880HTHFIS531e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.9 bits (127), Expect = 1e-09
Identities = 27/140 (19%), Positives = 51/140 (36%), Gaps = 9/140 (6%)

Query: 1 MNDLNVLVLEDEPFQRLVAVTALKKVVPGSILEAADGKEAVAILESCGHVDIAICDLQMS 60
M +LV +D+ R V AL + + ++ + + G D+ + D+ M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRA-GYDVRITSNAATLWRWIAA-GDGDLVVTDVVMP 58

Query: 61 GMDGLAFLRHASLSGKVHSVILSSEVDPILRQATI-SMIECLGLNFLGDLGKPFSLERIT 119
+ L + V++ S Q T + I+ L KPF L +
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSA------QNTFMTAIKASEKGAYDYLPKPFDLTELI 112

Query: 120 ALLTRYNARRQDLPRQIEVA 139
++ R A + P ++E
Sbjct: 113 GIIGRALAEPKRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20885HTHFIS726e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.8 bits (176), Expect = 6e-17
Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 1/111 (0%)

Query: 3 TVLIVDDHPVIRLAVRVLLEKHGLQVVAETDNGVDAIQLVREHEPDVVILDIGIPKLDGL 62
T+L+ DD IR + L + G V T N + + + D+V+ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 TVISRIKSLELRSQVLVLTSQSAEAFCKRCIQVGARGFVNKEEDLNNLINA 113
++ RIK VLV+++Q+ + + GA ++ K DL LI
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20895TONBPROTEIN290.039 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 28.8 bits (64), Expect = 0.039
Identities = 23/104 (22%), Positives = 34/104 (32%), Gaps = 16/104 (15%)

Query: 352 EVELLAAIETLIGQTLQRREEPDFAPEHRVPQTA----PGGVVLKKPKKPKKPKAAESAG 407
V ++ + Q +Q EP PE VV++KPK KPK
Sbjct: 46 SVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 105

Query: 408 ---------KPGKIHLGSWFDSSAP---TVKAVRKAPGFGAGAA 439
KP + S F+++AP T A +
Sbjct: 106 VQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSV 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS20910ISCHRISMTASE462e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 45.8 bits (108), Expect = 2e-08
Identities = 47/198 (23%), Positives = 68/198 (34%), Gaps = 33/198 (16%)

Query: 11 SQVALLIVDLQRGMQRHDLPPRNNPGAE--ARIVELLAAWRAAGWPVVHVRHVSRQPGSP 68
++ LLI D+Q +P E A I +L G PVV+ + QPGS
Sbjct: 29 NRAVLLIHDMQNYFVDA-FTAGASPVTELSANIRKLKNQCVQLGIPVVY----TAQPGSQ 83

Query: 69 -----------FAPGQPG----VEFQPALAPRDDEAVFEKNVPDAFINSGLQRWLHVRDI 113
+ PG + LAP DD+ V K AF + L +
Sbjct: 84 NPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGR 143

Query: 114 RQVALVGVATENSVEASARSAGNLGFQTWVVADACFTFAKPDFHGTPRSADEVHAMALAN 173
Q+ + G+ +A A + + V DA F+ H MAL
Sbjct: 144 DQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEK-----------HQMALEY 192

Query: 174 LHGEYAVVLRAAELLQRL 191
G A + LL +L
Sbjct: 193 AAGRCAFTVMTDSLLDQL 210


123HWH78_RS21025HWH78_RS21050N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS210250122.560178SEC-C domain-containing protein
HWH78_RS210301132.560847AMP nucleosidase
HWH78_RS210350112.946983PaaI family thioesterase
HWH78_RS21040-1122.322828acyl-CoA dehydrogenase family protein
HWH78_RS21045-1112.064713TetR/AcrR family transcriptional regulator
HWH78_RS21050-2121.204685hybrid sensor histidine kinase/response
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21025SECA411e-06 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 41.0 bits (96), Expect = 1e-06
Identities = 14/22 (63%), Positives = 16/22 (72%), Gaps = 1/22 (4%)

Query: 162 GRGDQACPCGSGKRYRNCCSRL 183
GR D CPCGSGK+Y+ C RL
Sbjct: 880 GRND-PCPCGSGKKYKQCHGRL 900


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21030MYCMG045320.007 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 31.6 bits (71), Expect = 0.007
Identities = 31/124 (25%), Positives = 50/124 (40%), Gaps = 19/124 (15%)

Query: 130 QDIPYPYVVEQGDELAGSGVTAAELARVFPSTDLSAASDDIADGLYEWERADQLPLALFD 189
Q++ + Y E+ EL V+ ++ + + +R + L D
Sbjct: 149 QNLVFVYRGEKISELEQENVSWTDVIKAI---------------VKHKDRFNDNRLVFID 193

Query: 190 AARVDFSLRRLVHYTGSDWRHVQPWILLTNYHRYV-DQFIRLGLTRLREDPRFVRMVLPG 248
AR FSL +V+ T ++ V P Y V + F RLGLT+ D FV
Sbjct: 194 DARTIFSLANIVN-TNNNSADVNPKEDGIGYFTNVYESFQRLGLTKSNLDSIFVNS--DS 250

Query: 249 NVII 252
N++I
Sbjct: 251 NIVI 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21045HTHTETR611e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.8 bits (147), Expect = 1e-13
Identities = 33/170 (19%), Positives = 64/170 (37%), Gaps = 8/170 (4%)

Query: 11 QRDSALRERILQLGLRRVAEGGFAALTMQALADDAGIATGSLYRHFRGKGELAAEIFRRA 70
Q R+ IL + LR ++ G ++ ++ +A AG+ G++Y HF+ K +L +EI+ +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 71 SQREVDALAVVL-RGPGAPAWRLAEGLRRF--AARAWSSQRLAFALI-----AEPVDPEV 122
+ + PG P L E L + +RL +I V
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 123 DEQRLRYREAYAALFVELLEEGRRSGAFQLSLVPLAAACLVGAIAEALVG 172
+ + + L+ + L+ AA ++ L+
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21050HTHFIS732e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.9 bits (179), Expect = 2e-15
Identities = 32/114 (28%), Positives = 50/114 (43%), Gaps = 2/114 (1%)

Query: 669 TVLVVEDNAINQLVTRGMLLKLGYRVRTADNGSEALELLARERPDGVLLDCQMPVMDGFA 728
T+LV +D+A + V L + GY VR N + +A D V+ D MP + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 729 TCRAIRALPGCAELPVLALTAHSHSGDRERCLAAGMSDYMAKPVKFEELQTLLH 782
I+ +LPVL ++A + + G DY+ KP EL ++
Sbjct: 65 LLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


124HWH78_RS21565HWH78_RS21595N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS21565-1130.846241non-ribosomal peptide synthetase
HWH78_RS215700170.425484SDR family oxidoreductase
HWH78_RS21575119-0.498897response regulator transcription factor
HWH78_RS21580118-0.128982type 1 fimbrial protein
HWH78_RS215851150.125501filamentous hemagglutinin N-terminal
HWH78_RS215901110.379615fimbria/pilus periplasmic chaperone
HWH78_RS215951100.467874fimbrial biogenesis outer membrane usher protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21565NUCEPIMERASE464e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 45.9 bits (109), Expect = 4e-07
Identities = 70/340 (20%), Positives = 123/340 (36%), Gaps = 72/340 (21%)

Query: 621 ILLTGASGLMGAHLLAELLASREADLHCPVRAQNDAH--ALERLRQAARQHRIELAESDW 678
L+TGA+G +G H+ LL + ND + +L++ R LA+ +
Sbjct: 3 YLVTGAAGFIGFHVSKRLL--EAGHQVVGIDNLNDYYDVSLKQARLEL------LAQPGF 54

Query: 679 RRVRAYAADLAEPGFGLPAETYRELAGSVDQVFHSA--SAVNF-IQ-PYSYMKRDNVEGL 734
+ + AD + + G ++VF S AV + ++ P++Y N+ G
Sbjct: 55 QFHKIDLADRE-----GMTDLFAS--GHFERVFISPHRLAVRYSLENPHAYADS-NLTGF 106

Query: 735 GQVLRFCASGRCKPLMLLSSISVYSWGHLHTGKRLMREDDDIDQNLPAVVTDMGYVRSKW 794
+L C + + L+ SS SVY K DD +D P + Y +K
Sbjct: 107 LNILEGCRHNKIQHLLYASSSSVYGLNR----KMPFSTDDSVDH--PVSL----YAATKK 156

Query: 795 VMEKIADLAAE-RGLPLMTFRLGYATCHSRTGAHADYQWWSR-------LARTCLEYRAV 846
E +A + GLP R + T Y W R + LE +++
Sbjct: 157 ANELMAHTYSHLYGLPATGLR--FFTV---------YGPWGRPDMALFKFTKAMLEGKSI 205

Query: 847 PLLR--ELREGLTTVDYMVEAISVIARQP-----------------SALGKKFNLVPSIP 887
+ +++ T +D + EAI + A + +N+ S P
Sbjct: 206 DVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSP 265

Query: 888 RCLT--LDEFFDRLGRRAGRPLRQMPFDDWVSLWEDNRDA 925
L + D LG A + + + D + D +
Sbjct: 266 VELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKAL 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21570DHBDHDRGNASE642e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 63.9 bits (155), Expect = 2e-14
Identities = 42/190 (22%), Positives = 72/190 (37%), Gaps = 9/190 (4%)

Query: 3 NVLIVGASRGIGLGLADAFLQRGAQVFAVARRPQGSPGLQALAERAGERLQAVTGDLNQR 62
I GA++GIG +A +GA + AV P+ + + + +A D+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 63 DCAERIGEALGER--RIDRLIVNAGIYGPQQQDVAEIDAEQTAQLFLTNAIAPLRLARAL 120
+ I + ID L+ AG+ P + E+ F N+ +R++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIH--SLSDEEWEATFSVNSTGVFNASRSV 127

Query: 121 SG--RVSRGGVVAFMSSQMASLALGLSATMPLYGASKAALNSLVRSWEGEFEELPFSLLL 178
S R G + + S A + +M Y +SKAA + E E +
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVP---RTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 179 LHPGWVRTEM 188
+ PG T+M
Sbjct: 185 VSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21575HTHFIS561e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 56.4 bits (136), Expect = 1e-11
Identities = 30/126 (23%), Positives = 54/126 (42%), Gaps = 5/126 (3%)

Query: 5 RIRVMVADDHPAISLGISYELSQCGSLEMLGQVSNSTELIGRLNEGDCDVVIVDYTMPGG 64
++VADD AI ++ LS+ G + SN+ L + GD D+V+ D MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 65 KYGDGLALLSLLRRRYPHLQLVVFTMLNNPGLIRAILKQGINCILSKSDSTSHLLAAVSA 124
+ LL +++ P L ++V + N ++G L K + L+ +
Sbjct: 61 ---NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 125 AYSRNQ 130
A + +
Sbjct: 118 ALAEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21585PF05860798e-20 haemagglutination activity domain.
		>PF05860#haemagglutination activity domain.

Length = 117

Score = 79.5 bits (196), Expect = 8e-20
Identities = 31/108 (28%), Positives = 49/108 (45%), Gaps = 9/108 (8%)

Query: 54 LPSGGTVVGGSANGEIHLSGGNSLSVNQKVDKLIANWDSFSVAAGERVIFNQPSSSSIAL 113
LP + I Q L ++ FSV FN P++ +
Sbjct: 9 LPINSNITTEGNTRII-------ERGTQAGSNLFHSFQEFSVPTSGTAFFNNPTNIQNII 61

Query: 114 NRVIGTKASDIQGRIDANG--QVFLVNPNGVLFGRGAQVNVGGLVAST 159
+RV G S+I G I AN +FL+NPNG++FG+ A++++GG +
Sbjct: 62 SRVTGGSVSNIDGLIRANATANLFLINPNGIIFGQNARLDIGGSFVGS 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21595PF005777410.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 741 bits (1914), Expect = 0.0
Identities = 268/868 (30%), Positives = 412/868 (47%), Gaps = 55/868 (6%)

Query: 1 MAVASPAGGLDAPSRRIVFDAQMLALGPGGRSIDTSRFERGDVIEPGRYRLDLLLNSRWR 60
+ A S + F+ + LA D SRFE G + PG YR+D+ LN+ +
Sbjct: 31 FVACAFAAQAPLSSAELYFNPRFLA-DDPQAVADLSRFENGQELPPGTYRVDIYLNNGYM 89

Query: 61 GVEEVELRRQPGRESAVFCYDRGLLERAGIDLEKSARGQDRSSARDPLPEGLHCDPLERY 120
+V + V C R L G++ S + L C PL
Sbjct: 90 ATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTA--------SVSGMNLLADDACVPLTSM 141

Query: 121 VPGARVKLDIAEQSIYVSVPSYYLSLDSSKTYVDPASWDSGISAALLNYNSNL-HVRENH 179
+ A +LD+ +Q + +++P ++S + ++ Y+ P WD GI+A LLNYN + V+
Sbjct: 142 IHDATAQLDVGQQRLNLTIPQAFMS-NRARGYIPPELWDPGINAGLLNYNFSGNSVQNRI 200

Query: 180 GRSATSGYAGMNAGFNFGRARLRHNGTATWSRRMGS-----HYQRSATYVQTDLPAWRAQ 234
G ++ Y + +G N G RLR N T +++ S +Q T+++ D+ R++
Sbjct: 201 GGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSR 260

Query: 235 LLLGENSTSSEFFDAVSFRGVQLSSDDRMLPDSLRYYAPVVRGTASTNARVSVYQRGYLI 294
L LG+ T + FD ++FRG QL+SDD MLPDS R +APV+ G A A+V++ Q GY I
Sbjct: 261 LTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDI 320

Query: 295 YETTVAPGAFALDELQTASYGGDLEVRVTEASGEVRSFIVPFATTVQLLRPGTTRYSLTA 354
Y +TV PG F ++++ A GDL+V + EA G + F VP+++ L R G TRYS+TA
Sbjct: 321 YNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITA 380

Query: 355 GRL-NDPSLERRPNMLQGVYQRGLGNDVTAYAGGAFTGSYMSGLMGAALNT-PVGGFSGD 412
G + + + +P Q GL T Y G Y + G N +G S D
Sbjct: 381 GEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVD 440

Query: 413 VTLARTEVPGDDRLSGSSYRLAYSKNLPNTGTNFSLLAYRYSTGGYLGLRDAAFMQDRVE 472
+T A + +P D + G S R Y+K+L +GTN L+ YRYST GY D + +
Sbjct: 441 MTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGY 500

Query: 473 RGEPLE--------------SFSRLRNRLDANISQQLGNGGNLYLNGSSQRYWSGGGRAV 518
E + R +L ++QQLG LYL+GS Q YW
Sbjct: 501 NIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDE 560

Query: 519 NFSVGYSNQWRDVSYSISAQRLRSHYEGFSSGDKRGETSTLFSLNLSIPLGG-------A 571
F G + + D+++++S ++ + + + +LN++IP +
Sbjct: 561 QFQAGLNTAFEDINWTLSYSLTKNAW--------QKGRDQMLALNVNIPFSHWLRSDSKS 612

Query: 572 GRGSPTLSSYLTRDSNSGTQLTSGVSGMLGKRGEASYSLSASHDRDSRQTSKS---ASLD 628
+ S ++ D N +GV G L + SYS+ + S S A+L+
Sbjct: 613 QWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLN 672

Query: 629 YRLPQVELGSSLSQGPGYRQLSVKAAGGLVAHSGGITAAQTLGETIGLVHAPNARGAA-A 687
YR S +QL +GG++AH+ G+T Q L +T+ LV AP A+ A
Sbjct: 673 YRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVE 732

Query: 688 GYSGSRIDRHGYAVIPNLLPYQLNSVDLDPNGMADEIELRSSSRNVAPTAGAVVRLDYPT 747
+G R D GYAV+P Y+ N V LD N +AD ++L ++ NV PT GA+VR ++
Sbjct: 733 NQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA 792

Query: 748 RVARPLLVDSRMPSGEPLPFAAEVLDAHSGQSVGAVGQGSRLVLRVEQDRGSVRVRWGNE 807
RV LL+ + +PLPF A V S QS G V ++ L G V+V+WG E
Sbjct: 793 RVGIKLLMTLT-HNNKPLPFGAMVTSE-SSQSSGIVADNGQVYLSGMPLAGKVQVKWGEE 850

Query: 808 PQQQCLVDYALGPRETTPPVLQLA--CR 833
C+ +Y L P + QL+ CR
Sbjct: 851 ENAHCVANYQLPPESQQQLLTQLSAECR 878


125HWH78_RS21980HWH78_RS22005N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS21980094.438009Fe2+-enterobactin ABC transporter
HWH78_RS219851104.157491iron ABC transporter permease
HWH78_RS219900112.894719iron chelate uptake ABC transporter family
HWH78_RS21995-1112.098274SDR family oxidoreductase
HWH78_RS22000-1121.640799LysR family transcriptional regulator
HWH78_RS220051102.823864MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21980FERRIBNDNGPP384e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 37.6 bits (87), Expect = 4e-05
Identities = 53/289 (18%), Positives = 97/289 (33%), Gaps = 28/289 (9%)

Query: 2 PTRRRSALPLLALALSLFA-TLAAAGEPKPARIVSTTPSVTGILLAMDAPLVASAATTPS 60
RR L +AL+ L+ A A P RIV+ +LLA+ A T
Sbjct: 6 LISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINY 65

Query: 61 RLTDAKGFFSQWAKVADQRGVEVLYRNLRFD--IEAVIAQDPDLLVASA---TGADSAAP 115
RL + S+ V+ LR + +E + P +V SA + A
Sbjct: 66 RL-----WVSEPPLPDS-----VIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLAR 115

Query: 116 Y-RAELEAQGVPTLVVDYSKHSWQELATELGRHTGLERQAQAAIQRFDAYTAEVAA-AIA 173
+ ++ S E+A L + A+ + +++ + + +
Sbjct: 116 IAPGRGFNFSDGKQPLAMARKSLTEMADLLNL----QSAAETHLAQYEDFIRSMKPRFVK 171

Query: 174 PPQGPVSVVGYNIAGNYSIGRQASPQARLLEALGFQVAELPEALAGKVTRASDFQFISRE 233
P+ + + + S +L+ G +P A G+ +S +
Sbjct: 172 RGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYG-----IPNAWQGETNFWG-STAVSID 225

Query: 234 NLPAAIAGDSVFLLGASDDDVQAFLADPVLANLSAVREKRVYALGPSSF 282
L A D + + D+ A +A P+ + VR R + F
Sbjct: 226 RLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWF 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21985PF04335300.011 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 29.8 bits (67), Expect = 0.011
Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 3/43 (6%)

Query: 7 RRRRLRAWGLLAGALLLALA---ALASLALGSRPVPLAVTLDA 46
R + AW + A LA A A+A+L P +T+D
Sbjct: 29 ERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDR 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS21995DHBDHDRGNASE1196e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 119 bits (299), Expect = 6e-35
Identities = 75/258 (29%), Positives = 117/258 (45%), Gaps = 32/258 (12%)

Query: 5 RTALVTGATRGIGLALARRLAASGWSVVGI-----------------ARHASDDFPGRLL 47
+ A +TGA +GIG A+AR LA+ G + + ARHA + FP
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFP---- 63

Query: 48 CCDLADPAQTAETLRGLLSESA-VDALVNNAGIALPQSLENLDLAALQQVFDLNVRVAVQ 106
D+ D A E + E +D LVN AG+ P + +L + F +N
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 107 LAQACLPGLKRSPAGRIVNLCSRAIHGAR-ERTAYAAAKSALVGVTRTWALELAPLGITV 165
+++ + +G IV + S R AYA++K+A V T+ LELA I
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 166 NAVAPGPIETELFRQTRPVGGEEERRILST-------IPMQRLGRPDEVAALIEFLLSEG 218
N V+PG ET++ E+ I + IP+++L +P ++A + FL+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 219 ASFVTGQVIGVDGGGSLG 236
A +T + VDGG +LG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22005TCRTETA506e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 50.2 bits (120), Expect = 6e-09
Identities = 90/396 (22%), Positives = 138/396 (34%), Gaps = 30/396 (7%)

Query: 14 RSTLSVAILAFSAFLIVTTEFLIVGLLPSLARDLQISISAA---GRLVTLFAFTVMLFGP 70
+ + ++ + L LI+ +LP L RDL S G L+ L+A P
Sbjct: 2 KPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAP 61

Query: 71 PLTAVLAHLPRKPLFVAILLAFALSNGLAALSTDLRLLAVARFVPALMLPVFWGTASETA 130
L A+ R+P+ + L A+ + A + L +L + R V + + A
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121

Query: 131 AQLAGTERAGQAIARVYLGISCALLLGIPLGTLAANGIGWRGAFWILAGLSLLMAVALVL 190
G ERA + + ++ G LG L G F+ A L+ L +
Sbjct: 122 DITDGDERA-RHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCF 179

Query: 191 FMPAVDRGERLDLRRQARIFGEPLFLANVALSVLVFSAMFVSYTYLADILERIAGI---- 246
+P +GER LRR+A A V A+F + + + I
Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGED 239

Query: 247 ----TPARIGWWLMGFGAV---------GLFGNWLGGRLVDRSPLRATALFLVLLALGMA 293
IG L FG + G LG R + A +LLA A
Sbjct: 240 RFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF--A 297

Query: 294 ASVPLAGQPLLFCLALALWGVANTALYPVCQVRVMRSVQHAQALAATSNVAAANAGIGLG 353
+A P++ LA G+ AL + +V + Q S A + +G
Sbjct: 298 TRGWMA-FPIMVLLASG--GIGMPALQAMLSRQVD---EERQGQLQGSLAALTSLTSIVG 351

Query: 354 ALLGGETIATLGLERIGFVAAALAVLGLSLLPVVAR 389
LL A G+ A A L L LP + R
Sbjct: 352 PLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


126HWH78_RS22245HWH78_RS22280N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS22245-180.729456multidrug efflux RND transporter periplasmic
HWH78_RS22250-1101.273848multidrug efflux RND transporter permease MexI
HWH78_RS222550101.648188multidrug efflux transporter outer membrane
HWH78_RS222600111.035330phenazine-1-carboxylate N-methyltransferase
HWH78_RS22265-1131.789086phenazine biosynthesis protein PhzA
HWH78_RS22270-1142.792853phenazine biosynthesis protein PhzB
HWH78_RS222751164.082846phenazine biosynthesis protein PhzC
HWH78_RS222800143.724271phenazine biosynthesis protein PhzD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22245RTXTOXIND461e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.4 bits (110), Expect = 1e-07
Identities = 18/106 (16%), Positives = 43/106 (40%), Gaps = 2/106 (1%)

Query: 65 AGRQVQVAAEAAGRITRIAFESGQQVQQGQLLVQLNDAVEQAELIRLKAQLRNAEILHAR 124
+GR ++ + I + G+ V++G +L++L +A+ ++ ++ L A + +
Sbjct: 93 SGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARL--EQ 150

Query: 125 ARKLVERNVASQEQLDNAVAARDMALGAVRQTQALIDQKAIRAPFS 170
R + +L + V + + L I+ FS
Sbjct: 151 TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS 196



Score = 40.2 bits (94), Expect = 9e-06
Identities = 25/134 (18%), Positives = 60/134 (44%), Gaps = 6/134 (4%)

Query: 102 AVEQAELIRLKAQLRNAEILHARARKLVERNVASQ-EQLDNAVAARDMALGAVRQTQALI 160
V +++L ++++++ +A+ + +L + + + Q + + + L + Q
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 161 DQKAIRAPFSGQLGIRRVH-LGQYLGVAEPVASLV-DARTLKSNFSLDESTSPELKLGQP 218
IRAP S ++ +VH G + AE + +V + TL+ + + +GQ
Sbjct: 329 V---IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQN 385

Query: 219 LEVLVDAYPGRSFP 232
+ V+A+P +
Sbjct: 386 AIIKVEAFPYTRYG 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22250ACRIFLAVINRP8020.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 802 bits (2072), Expect = 0.0
Identities = 316/1029 (30%), Positives = 530/1029 (51%), Gaps = 29/1029 (2%)

Query: 5 DLFVRRPVLALVVSTLILLLGLFSLGKLPIRQYPLLESSTITVTTEYPGASADLMQGFVT 64
+ F+RRP+ A V++ ++++ G ++ +LP+ QYP + ++V+ YPGA A +Q VT
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 65 QPIAQAVSSVEGIDYLSSTSVQ-GRSVVTIRMLLNRDSTQAMTETMAKVNSVRYKLPERA 123
Q I Q ++ ++ + Y+SSTS G +T+ D A + K+ LP+
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 124 YDSVIERSSGETTAVAYVGFSS--KTLPIPALTDYLSRVVEPMFSSIDGVAKVQTFGGQR 181
I ++ + GF S ++DY++ V+ S ++GV VQ FG Q
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 182 LAMRLWLDADRLAGRGLTASDVAEAIRRNNYQAAPG------MVKGQYVLSNVRVNTDLT 235
AMR+WLDAD L LT DV ++ N Q A G + GQ + +++ T
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 236 NVDDFREMVIRNDGNG-LVRLRDVGTVELGAAATETSALMDGDPAVHLGLFPTPTGNPLV 294
N ++F ++ +R + +G +VRL+DV VELG A ++G PA LG+ N L
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 295 IVDGIRKLLPEIQKTLPPDVRVDLAYETSRFIQASIDEVVRTLVEALLIVVLVIYLCLGS 354
I+ L E+Q P ++V Y+T+ F+Q SI EVV+TL EA+++V LV+YL L +
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 355 LRSVLIPVATIPLSMLGAAALMLAFGFSVNLLTLLAMVLAIGLVVDDAIVVVENVHRHIE 414
+R+ LIP +P+ +LG A++ AFG+S+N LT+ MVLAIGL+VDDAIVVVENV R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EGKS-PVAAALIGAREVAGPVIAMTITLAAVYTPIGLMGGLTGALFREFALTLAGAVIVS 473
E K P A ++ G ++ + + L+AV+ P+ GG TGA++R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 474 GVVALTLSPVMSSLLLQA-----HQNEGRMGRAAEWFFGGLTRRYGQVLEFSLGHRWLTG 528
+VAL L+P + + LL+ H+N+G F Y + LG
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 529 GLALLVCISLPLLYSMPKRELAPTEDQAAVLTAIKAPQHANLDYVELFARKLDQVYTSIP 588
+ L+ + +L+ P EDQ LT I+ P A + + ++ Y
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 589 E------TVSTWIINGTDGPAASFGGINLAAWEKRERD---ASAIQSELQGKVGDVEGSS 639
+ A ++L WE+R D A A+ + ++G +
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 640 IFAFQLAA--LPGSTGGLPVQMVLRSPQDYPVLYRTMEEIKQKARQSGLFVV-VDSDLDY 696
+ F + A G+ G +++ ++ + L + ++ A Q +V V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 697 NNPVVQVRIDRAKANSLGIRMQDIGESLAVLVGENYVNRFGMEGRSYDVIPQSLRDQRFT 756
+ ++ +D+ KA +LG+ + DI ++++ +G YVN F GR + Q+ R
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 757 PQALARQFVRTQDGNLVPLSTVVRVALQVEPNKLIQFDQQNAATLQAIPAPGVSMGQAVA 816
P+ + + +VR+ +G +VP S +L +++ + +Q APG S G A+A
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 817 FLDDVARGLPAGFSHDWQSDSRQYTQEGNTLVFAFLAALVVIYLVLAAQYESLADPLIIL 876
++++A LPAG +DW S Q GN + VV++L LAA YES + P+ ++
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 877 ITVPLSICGALLPLALGYATMNIYTQIGLVTLIGLISKHGILMVEFANELQLHERLDRRA 936
+ VPL I G LL L ++Y +GL+T IGL +K+ IL+VEFA +L E
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 937 AILRAAQIRLRPVLMTTAAMVFGLVPLLFASGAGAASRFGLGVVIVSGMLVGTLFTLFVL 996
A L A ++RLRP+LMT+ A + G++PL ++GAG+ ++ +G+ ++ GM+ TL +F +
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 997 PTVYTLLAR 1005
P + ++ R
Sbjct: 1022 PVFFVVIRR 1030



Score = 92.6 bits (230), Expect = 4e-21
Identities = 69/327 (21%), Positives = 135/327 (41%), Gaps = 13/327 (3%)

Query: 701 VQVRIDRAKANSLGIRMQDIGESLAV----LVGENYVNRFGMEGRSYDVIPQSLRDQRFT 756
+++ +D N + D+ L V + + G+ + + + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA-QTRFKN 242

Query: 757 PQALARQFVRT-QDGNLVPLSTVVRVALQVEP-NKLIQFDQQNAATLQAIPAPGVSMGQA 814
P+ + +R DG++V L V RV L E N + + + + AA L A G +
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 815 V----AFLDDVARGLPAGFSHDWQSDSRQYTQEG-NTLVFAFLAALVVIYLVLAAQYESL 869
A L ++ P G + D+ + Q + +V A+++++LV+ +++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 870 ADPLIILITVPLSICGALLPLALGYATMNIYTQIGLVTLIGLISKHGILMVEFANELQLH 929
LI I VP+ + G LA ++N T G+V IGL+ I++VE + +
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 930 ERLDRRAAILRAAQIRLRPVLMTTAAMVFGLVPLLFASGAGAASRFGLGVVIVSGMLVGT 989
++L + A ++ ++ + +P+ F G+ A + IVS M +
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 990 LFTLFVLPTV-YTLLARNHAEVDKSPR 1015
L L + P + TLL AE ++
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKG 509


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22255RTXTOXIND290.032 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.032
Identities = 18/120 (15%), Positives = 35/120 (29%), Gaps = 4/120 (3%)

Query: 334 LGSASRAFEL--APSVSWPAF-RLGNVRARLRAVEAQ-SDAALARYQRSLLLAQEDVGNA 389
SR+ EL P + P NV + +Q + ++
Sbjct: 153 YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKK 212

Query: 390 LNQLAEHQRRLVALFQSATHGANALEIANERYRAGAGSYLAVLENQRALYQIREELAQAE 449
+ R+ + + L+ + A + AVLE + + EL +
Sbjct: 213 RAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYK 272


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22280ISCHRISMTASE351e-125 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 351 bits (901), Expect = e-125
Identities = 102/207 (49%), Positives = 136/207 (65%), Gaps = 2/207 (0%)

Query: 3 GIPEITAYPLPTAQQLPANLARWSLEPRRAVLLVHDMQRYFLRPLPESLRA--GLVANAA 60
IP I Y +PTA +P N W +P RAVLL+HDMQ YF+ L AN
Sbjct: 2 AIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIR 61

Query: 61 RLRRWCVEQGVQIAYTAQPGSMTEEQRGLLKDFWGPGMRASPADREVVEELAPGPDDWLL 120
+L+ CV+ G+ + YTAQPGS + R LL DFWGPG+ + P + +++ ELAP DD +L
Sbjct: 62 KLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVL 121

Query: 121 TKWRYSAFFHSDLLQRMRAAGRDQLVLCGVYAHVGVLISTVDAYSNDIQPFLVADAIADF 180
TKWRYSAF ++LL+ MR GRDQL++ G+YAH+G L++ +A+ DI+ F V DA+ADF
Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADF 181

Query: 181 SEAHHRMALEYAASRCAMVVTTDEVLE 207
S H+MALEYAA RCA V TD +L+
Sbjct: 182 SLEKHQMALEYAAGRCAFTVMTDSLLD 208


127HWH78_RS22965HWH78_RS22985N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS22965-1110.642158polyamine ABC transporter ATP-binding protein
HWH78_RS229700121.051756polyamine ABC transporter substrate-binding
HWH78_RS22975-1121.115856response regulator transcription factor
HWH78_RS22980-212-1.107596sensor histidine kinase
HWH78_RS22985-212-1.884309alpha/beta hydrolase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22965PF05272340.001 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.9 bits (77), Expect = 0.001
Identities = 15/88 (17%), Positives = 26/88 (29%), Gaps = 20/88 (22%)

Query: 40 LTLLGPSGSGKTTSLMMLAGFETPTAGEILLAGRSINNVPPHKRDIGMVFQNYALFPHMT 99
+ L G G GK+T + L G + + + +D
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVA---YE 646

Query: 100 VAENLAFPLSVRGMSKTDVKERVKRALS 127
++E + + D E VK S
Sbjct: 647 LSE-------MTAFRRADA-EAVKAFFS 666


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22970BICOMPNTOXIN290.028 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 28.7 bits (64), Expect = 0.028
Identities = 14/61 (22%), Positives = 24/61 (39%), Gaps = 11/61 (18%)

Query: 135 KVAGSPQGWADFWDVKKFPGKRGLRWGAKYSLEFALMADGV-----APK------DVYQT 183
K+ G +++ KK + +RW +Y++ V PK +V QT
Sbjct: 80 KMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYVSLINYLPKNKIESTNVSQT 139

Query: 184 L 184
L
Sbjct: 140 L 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22975HTHFIS644e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.7 bits (155), Expect = 4e-14
Identities = 31/116 (26%), Positives = 51/116 (43%), Gaps = 2/116 (1%)

Query: 3 IRVLVAEDHTIVREGIKQLIGMAKDLQVVGEATNGEQLLETLRGTPCEVVLLDISMPGVN 62
+LVA+D +R + Q + A V +N L + ++V+ D+ MP N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLEAIPRIRALNEPPAILVLSMHDEAQMAARALKIGAAGYATKDSDPALLLTAIRR 118
+ +PRI+ +LV+S + A +A + GA Y K D L+ I R
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS22985CHANLCOLICIN290.031 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 28.9 bits (64), Expect = 0.031
Identities = 31/124 (25%), Positives = 47/124 (37%), Gaps = 17/124 (13%)

Query: 120 AGWQTLSLALPDPQSTAPVTRPAESAASASADKDA---------------SAADSASKPD 164
A W T L + A AE+ A A A++DA +A+ + S +
Sbjct: 55 AKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSATE 114

Query: 165 VKGESGNA--PAPESTAEAGSGEPAQSEDQAPPPAIDPVEQRKAHAERVMARLQASIDLA 222
+ + A E A + E A+ E +A A EQR+ ER A + + LA
Sbjct: 115 LAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLA 174

Query: 223 LQHE 226
E
Sbjct: 175 EAEE 178


128HWH78_RS23680HWH78_RS23705N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS236800160.723488two-component system response regulator CreB
HWH78_RS236851150.500683ATP-dependent zinc protease
HWH78_RS236901140.666176acyltransferase
HWH78_RS236950130.778008DUF2780 domain-containing protein
HWH78_RS237000150.996262AAA family protein disaggregase ClpG
HWH78_RS237052111.078593multidrug transporter subunit MdtD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23680HTHFIS815e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.4 bits (201), Expect = 5e-20
Identities = 37/134 (27%), Positives = 63/134 (47%), Gaps = 1/134 (0%)

Query: 2 PHILIVEDEAAIADTLLYALQAEGFATTWVTLAGEALALQERQPADLLILDVGLPDISGF 61
IL+ +D+AAI L AL G+ + A DL++ DV +PD + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EACKRLR-RFSEVPVIFLTARDAEIDRVVGLEIGADDYVVKPFSPREVAARVKAILKRMA 120
+ R++ ++PV+ ++A++ + + E GA DY+ KPF E+ + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 PRPAALEGAAPSGP 134
RP+ LE + G
Sbjct: 124 RRPSKLEDDSQDGM 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23685NEISSPPORIN280.027 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 27.6 bits (61), Expect = 0.027
Identities = 14/25 (56%), Positives = 18/25 (72%), Gaps = 1/25 (4%)

Query: 1 MKRALALLSLFALPVLA-AEPNLYG 24
MK++L L+L ALPV A A+ LYG
Sbjct: 1 MKKSLIALTLAALPVAAMADVTLYG 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23700HTHFIS497e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 7e-08
Identities = 66/351 (18%), Positives = 117/351 (33%), Gaps = 48/351 (13%)

Query: 576 TAEEREKLLQMEERLHQRVIG---QQEAITAVSDAVRLARAGLRQGSRPIATFLFLGPTG 632
+ L +R ++ + S A++ L + + T + G +G
Sbjct: 111 LIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESG 170

Query: 633 VGKTELAKALAEVVFGDEDAMIRIDMSEYMERHAVSRLIGAPPGYVGYDEGGQLTERVRR 692
GK +A+AL + + I+M+ S L G E G T R
Sbjct: 171 TGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTR 222

Query: 693 RPYSV-------ILLDEIEKAHADVNNILLQVFDDGRLTDGKGRVVDFTNTIIIATSNLG 745
+ LDEI D LL+V G T GR ++ I+A +N
Sbjct: 223 STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN-- 280

Query: 746 SELIMKNAQAGEFAQPPEKLKRELMTTLRGHFRPEFLNRLDEVIVFESLSKAQIEDIVRL 805
+ ++ + G FR + RL+ V + + + EDI L
Sbjct: 281 --------------KDLKQSINQ------GLFREDLYYRLNVVPLRLPPLRDRAEDIPDL 320

Query: 806 QLERVKRAAHAQDIYLHIDDSLVGHLAEEAYQPEFGARELKRQIRQQLETRLATAMLKGE 865
V++A D + + +A+ REL+ +R+ TA+ +
Sbjct: 321 VRHFVQQAEKEGLDVKRFDQEALELM--KAHPWPGNVRELENLVRR------LTALYPQD 372

Query: 866 VKEGETVTFFYDAKDGVGYRKGAAPKPAARKKSGAGETPKGRATAARKPAA 916
V E + ++ + AA + + S A E + A+ A
Sbjct: 373 VITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDAL 423


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23705TCRTETB1209e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 120 bits (302), Expect = 9e-32
Identities = 91/416 (21%), Positives = 177/416 (42%), Gaps = 17/416 (4%)

Query: 6 QLTPRIARQLPWLVAVAFFMQALDGTILNTALPSMASSLNENPLRMQAVVIAYLLTVALL 65
Q R + L WL ++FF L+ +LN +LP +A+ N+ P V A++LT ++
Sbjct: 7 QSNLRHNQILIWLCILSFF-SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIG 65

Query: 66 IPASGWIADRFGTRRVFLGAVLLFSLGSLLCALSPS-LELLVGARIVQGVGGALMMPVGR 124
G ++D+ G +R+ L +++ GS++ + S LL+ AR +QG G A +
Sbjct: 66 TAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVM 125

Query: 125 LVILRVYPRQDLVRVLSFVTIPGLLGPLAGPTLGGWLVEYASWHWIFLINLP-VGLLGCL 183
+V+ R P+++ + + +G GP +GG + Y HW +L+ +P + ++
Sbjct: 126 VVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVP 183

Query: 184 VAMKLMPDLRSPVPSRFDSIGFLLFGGSMVLISIALEGLGELHLSHLRVVLLLIGGLVLL 243
MKL+ + FD G +L +V + L + V+ LI
Sbjct: 184 FLMKLLKKEVR-IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLI-VSVLSFLI------ 235

Query: 244 TAYWLRALRIDKPLFPPSLFKARTFAVGILGNLFARLGSGALPFLTPLLLQVGLGYPPST 303
+ ++ P P L K F +G+L + P +++ +
Sbjct: 236 --FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAE 293

Query: 304 AG-MTMIPLALFAMVAKPMAKPLLDFFGYRKLLVGNTLILGCLIAGFGLVDQDTPYVWLL 362
G + + P + ++ + L+D G +L L + + T + +
Sbjct: 294 IGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTI 353

Query: 363 LHLSLLGAVNSLQFTAMNTLTLIDLQDSNASSGNSLMSVVVQLSISLGVACAAALL 418
+ + +LG ++ + T ++T+ L+ A +G SL++ LS G+A LL
Sbjct: 354 IIVFVLGGLSFTK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


129HWH78_RS23850HWH78_RS23885N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS23850013-0.579444DEAD/DEAH box helicase
HWH78_RS23855-114-1.214389multidrug efflux RND transporter outer membrane
HWH78_RS23860-114-1.588737multidrug efflux RND transporter permease
HWH78_RS23865-211-1.080038multidrug efflux RND transporter periplasmic
HWH78_RS23870-210-0.457271MarR family transcriptional repressor MexR
HWH78_RS23875-390.922869YceI family protein
HWH78_RS23880-391.641372cytochrome b
HWH78_RS23885-282.282138flavin monoamine oxidase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23850SECA381e-04 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 37.9 bits (88), Expect = 1e-04
Identities = 28/108 (25%), Positives = 49/108 (45%), Gaps = 7/108 (6%)

Query: 212 IEVTPPNTTVERIEQ--RVFRLPAPQKRALLAHLVTVGAWEQ-VLVFTRTKHGANRLAEY 268
V P N + R + V+ A + +A++ + A Q VLV T + + ++
Sbjct: 409 TVVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNE 468

Query: 269 LTKHGLPAAAIHG-NKSQNARTKALADFKANDVRILVATDIAARGLDI 315
LTK G+ ++ + A A A + A + +AT++A RG DI
Sbjct: 469 LTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNMAGRGTDI 513


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23860ACRIFLAVINRP13530.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1353 bits (3503), Expect = 0.0
Identities = 692/1034 (66%), Positives = 838/1034 (81%), Gaps = 3/1034 (0%)

Query: 1 MSKFFIDRPIFAWVIALVIMLAGGLSILSLPVNQYPAIAPPAIAVQVSYPGASAETVQDT 60
M+ FFI RPIFAWV+A+++M+AG L+IL LPV QYP IAPPA++V +YPGA A+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VVQVIEQQMNGIDNLRYISSESNSDGSMTITVTFEQGTDPDIAQVQVQNKLQLATPLLPQ 120
V QVIEQ MNGIDNL Y+SS S+S GS+TIT+TF+ GTDPDIAQVQVQNKLQLATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQRQGIRVTKAVKNFLMVVGVVSTDGSMTKEDLSNYIVSNIQDPLSRTKGVGDFQVFGS 180
EVQ+QGI V K+ ++LMV G VS + T++D+S+Y+ SN++D LSR GVGD Q+FG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYSMRIWLDPAKLNSYQLTPGDVSSAIQAQNVQISSGQLGGLPAVKGQQLNATIIGKTRL 240
QY+MRIWLD LN Y+LTP DV + ++ QN QI++GQLGG PA+ GQQLNA+II +TR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 QTAEQFENILLKVNPDGSQVRLKDVADVGLGGQDYSINAQFNGSPASGIAIKLATGANAL 300
+ E+F + L+VN DGS VRLKDVA V LGG++Y++ A+ NG PA+G+ IKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAKAIRQTIANLEPFMPQGMKVVYPYDTTPVVSASIHEVVKTLGEAILLVFLVMYLFLQ 360
DTAKAI+ +A L+PF PQGMKV+YPYDTTP V SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFGVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTF +LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLSPREAARKSMGQIQGALVGIAMVLSAVFLPMAFFGGSTGVIYRQFSITIVSAMAL 480
E+ L P+EA KSM QIQGALVGIAMVLSAVF+PMAFFGGSTG IYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVIVALILTPALCATMLKPIEKGDHGEHKGGFFGWFNRMFLSTTHGYERGVASILKHRAP 540
SV+VALILTPALCAT+LKP+ H E+KGGFFGWFN F + + Y V IL
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLLIYVVIVAGMIWMFTRIPTAFLPDEDQGVLFAQVQTPPGSSAERTQVVVDSMREYLLE 600
YLLIY +IVAGM+ +F R+P++FLP+EDQGV +Q P G++ ERTQ V+D + +Y L+
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KESSSVSSVFTVTGFNFAGRGQSSGMAFIMLKPWEERPGGENSVFELAKRAQMHFFSFKD 660
E ++V SVFTV GF+F+G+ Q++GMAF+ LKPWEER G ENS + RA+M +D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFAPPSVLELGNATGFDLFLQDQAGVGHEVLLQARNKFLMLAAQNPA-LQRVRPNG 719
V F P+++ELG ATGFD L DQAG+GH+ L QARN+ L +AAQ+PA L VRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 720 MSDEPQYKLEIDDEKASALGVSLADINSTVSIAWGSSYVNDFIDRGRVKRVYLQGRPDAR 779
+ D Q+KLE+D EKA ALGVSL+DIN T+S A G +YVNDFIDRGRVK++Y+Q R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 780 MNPDDLSKWYVRNDKGEMVPFNAFATGKWEYGSPKLERYNGVPAMEILGEPAPGLSSGDA 839
M P+D+ K YVR+ GEMVPF+AF T W YGSP+LERYNG+P+MEI GE APG SSGDA
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 840 MAAVEEIVKQLPKGVGYSWTGLSYEERLSGSQAPALYALSLLVVFLCLAALYESWSIPFS 899
MA +E + +LP G+GY WTG+SY+ERLSG+QAPAL A+S +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 900 VMLVVPLGVIGALLATSMRGLSNDVFFQVGLLTTIGLSAKNAILIVEFAKELHE-QGKGI 958
VMLVVPLG++G LLA ++ NDV+F VGLLTTIGLSAKNAILIVEFAK+L E +GKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 959 VEAAIEACRMRLRPIVMTSLAFILGVVPLAISTGAGSGSQHAIGTGVIGGMVTATVLAIF 1018
VEA + A RMRLRPI+MTSLAFILGV+PLAIS GAGSG+Q+A+G GV+GGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1019 WVPLFYVAVSTLFK 1032
+VP+F+V + FK
Sbjct: 1020 FVPVFFVVIRRCFK 1033


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23865RTXTOXIND478e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 47.1 bits (112), Expect = 8e-08
Identities = 24/93 (25%), Positives = 43/93 (46%), Gaps = 1/93 (1%)

Query: 62 RIAEVRPQVNGIILKRLFKEGSDVKAGQQLYQIDPATYEADYQSAQANLASTQEQAQRYK 121
R E++P N I+ + + KEG V+ G L ++ EAD Q++L + + RY+
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 122 LLVADQAVSKQQYADANA-AYLQSKAAVEQARI 153
+L ++K Y Q+ + E R+
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187



Score = 42.9 bits (101), Expect = 1e-06
Identities = 42/268 (15%), Positives = 93/268 (34%), Gaps = 37/268 (13%)

Query: 37 EVGIVTLEAQTVTLNTELPGRTNAFRIAEVRPQVNGIILKRLFKEGSDVKAGQQLYQIDP 96
E+ + A+ +T+ + N R+ + R L + + K +
Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLD----DFSSLLHKQAIAKHAVLEQENKY 261

Query: 97 ATYEADYQSAQANLASTQEQAQRYK--LLVADQAVSKQ---QYADANAAYLQSKAAVEQA 151
+ + ++ L + + K + Q + + + +
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKN 321

Query: 152 RINLRYTKVLSPISGRIGRSAV-TEGALVTNGQANAMATVQQLDPIYVDVTQPSTALLRL 210
+ + + +P+S ++ + V TEG +VT + M V + D + V + + +
Sbjct: 322 EERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQNKDIGFI 380

Query: 211 RRELASGQLERAGDNAAKVSLKLE--DGSQYP-LEGRLE--FSEVSVDEGTGSVT--IRA 263
GQ +K+E ++Y L G+++ + D+ G V I +
Sbjct: 381 N----VGQ---------NAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIIS 427

Query: 264 V------FPNPNNELLPGMFVHAQLQEG 285
+ N N L GM V A+++ G
Sbjct: 428 IEENCLSTGNKNIPLSSGMAVTAEIKTG 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23885FLGFLGJ300.016 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 30.5 bits (68), Expect = 0.016
Identities = 24/128 (18%), Positives = 43/128 (33%), Gaps = 6/128 (4%)

Query: 171 NLSPTAR----LLVNQRIRSRYDEPSRLSLLYLAQQGRAYRGVDDRDLRAARLPGGSQVL 226
N+ P AR + V ++S D + L+ ++ R Y + D+ + G L
Sbjct: 32 NIRPVARQVEGMFVQMMLKSMRDALPK-DGLFSSEHTRLYTSMYDQQIAQQMTAGKGLGL 90

Query: 227 AEAFVKQIKTIKTKSKVSSIVQAKDGVAVKAGSETYKADYVVLAVPLKALGQIQMTPSLS 286
AE VKQ+ + + S+ A ++ L P S
Sbjct: 91 AEMMVKQMTPEQPLPEEST-PAAPMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDS 149

Query: 287 GTQMSALK 294
++ L
Sbjct: 150 KAFLAQLS 157


130HWH78_RS23920HWH78_RS23980N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS23920191.426548chemotaxis protein CheB
HWH78_RS23925191.014489chemotaxis signal transduction system protein
HWH78_RS23930112-2.077299type 4 fimbrial methyltransferase PilK
HWH78_RS23935113-1.935765chemotaxis chemoreceptor PilJ
HWH78_RS23940013-1.517385chemotaxis protein CheW
HWH78_RS23945016-1.146053twitching motility response regulator PilH
HWH78_RS23950016-0.413099twitching motility response regulator PilG
HWH78_RS23955-2140.304651glutathione synthase
HWH78_RS23960-1151.989049energy transducer TonB
HWH78_RS23965-1142.273514YqgE/AlgH family protein
HWH78_RS239700132.145492Holliday junction resolvase RuvX
HWH78_RS239751141.496349bifunctional pyr operon transcriptional
HWH78_RS239801141.303361aspartate carbamoyltransferase catalytic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23920HTHFIS300.013 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.013
Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 3/82 (3%)

Query: 7 PRVAVIADTSLQRHVLQQALLGHGYEVVLNADPARVDDAALECAPDLWLVDLTQQDDS-- 64
+ V D + R VL QAL GY+V + ++ A + DL + D+ D++
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 65 PLLDSLLEQD-RAPVLFGEGHA 85
LL + + PVL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQN 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23925HTHFIS682e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 2e-13
Identities = 26/113 (23%), Positives = 54/113 (47%), Gaps = 2/113 (1%)

Query: 2351 VMVVDDSVTVRKVTTRLLERNGMNVLTAKDGVDAIAQLQEHRPDILLLDIEMPRMDGFEV 2410
++V DD +R V + L R G +V + + D+++ D+ MP + F++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 2411 ATLVRHDERLGNLPIIMITSRTGEKHRERALGIGVNQYLGKPYQETELLEAIQ 2463
++ + +LP+++++++ +A G YL KP+ TEL+ I
Sbjct: 66 LPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23945HTHFIS808e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 8e-21
Identities = 33/119 (27%), Positives = 51/119 (42%), Gaps = 2/119 (1%)

Query: 2 ARILIVDDSPTEMYKLTAMLEKHGHQVLKAENGGDGVALARQEKPDVVLMDIVMPGLNGF 61
A IL+ DD L L + G+ V N D+V+ D+VMP N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 QATRQLTKDAETSAIPVIIVTTKDQETDKVWGKRQGARDYLTKPVDEETLLKTINAVLA 120
++ K +PV++++ ++ + +GA DYL KP D L+ I LA
Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23950HTHFIS733e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.9 bits (179), Expect = 3e-18
Identities = 32/117 (27%), Positives = 51/117 (43%), Gaps = 2/117 (1%)

Query: 6 DGLKVMVIDDSKTIRRTAETLLKKVGCDVITAIDGFDALAKIADTHPNIIFVDIMMPRLD 65
G ++V DD IR L + G DV + IA +++ D++MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 66 GYQTCALIKNNSAFKSTPVIMLSSKDGLFDKAKGRIVGSDQYLTKPFSKEELLGAIK 122
+ IK A PV+++S+++ K G+ YL KPF EL+G I
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23960PF03544631e-13 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 63.1 bits (153), Expect = 1e-13
Identities = 31/183 (16%), Positives = 58/183 (31%), Gaps = 14/183 (7%)

Query: 95 APFQDNQVKKVAPPAT--------PKQARSEEAPKAAVTTTRQRQQKAPSKTQAQKAEQV 146
AP Q V VAP P + E P+ ++ + K +
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104

Query: 147 AKPAPHFDSTQLSAEIASLEADLAKEQQAYAKRPRIHRLSAASTMRDKGAWYKEDWRKKI 206
KP Q ++ + P S A+ K + +
Sbjct: 105 PKPVK--KVEQPKRDVKP--VESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRAL 160

Query: 207 ERIGNLNYPDEARRQKLYGSLRLLVSINRDGTIYEVQVLESSGEPILDQAAQRIVRLAAP 266
R YP A+ ++ G +++ + DG + VQ+L + + ++ + +R
Sbjct: 161 SRN-QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMR-RWR 218

Query: 267 YAP 269
Y P
Sbjct: 219 YEP 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS23980TYPE3IMPPROT290.032 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 28.6 bits (64), Expect = 0.032
Identities = 10/41 (24%), Positives = 17/41 (41%)

Query: 293 ADGAQSVILNQVTYGIAIRMAVLSMAMSGQNTQRQLEQEDA 333
A G Q + N G+A+ +++ M + E ED
Sbjct: 40 ALGLQQIPSNMTLNGVALLLSMFVMWPIMHDAYVYFEDEDV 80


131HWH78_RS24130HWH78_RS24165N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS241301131.994643signal recognition particle-docking protein
HWH78_RS24135-1131.889120insulinase family protein
HWH78_RS24140-1111.873954insulinase family protein
HWH78_RS241450111.89710216S rRNA (guanine(966)-N(2))-methyltransferase
HWH78_RS24150-1111.699884hypothetical protein
HWH78_RS24155-2121.457556hydrolase
HWH78_RS24160-3110.297163GNAT family N-acetyltransferase
HWH78_RS24165-211-0.459549TetR/AcrR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS24130TONBPROTEIN468e-08 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 46.1 bits (109), Expect = 8e-08
Identities = 23/95 (24%), Positives = 32/95 (33%), Gaps = 11/95 (11%)

Query: 59 SLTEQPGRQQP-----SAAEPAEPPAPVAEAPLAGDEPASAAEHSPRPEAPVAQPEPILA 113
+ E P QP EPP V P E P PE P+
Sbjct: 34 QVIELPAPAQPISVTMVTPADLEPPQAVQP------PPEPVVEPEPEPEPIPEPPKEAPV 87

Query: 114 AEPEPEPEPEPEPEPEPVAPLAAAPAVSEPATRPG 148
+P+P+P+P+P+P V +RP
Sbjct: 88 VIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPA 122



Score = 34.2 bits (78), Expect = 6e-04
Identities = 28/123 (22%), Positives = 40/123 (32%), Gaps = 16/123 (13%)

Query: 28 QAGEQPA-DQPVEPVSETAAAEQRAPADDVAQSLTEQPGRQQPSAAEPAEPPAPVAEAPL 86
Q E PA QP+ T A + A EP P P+ E P
Sbjct: 34 QVIELPAPAQPISVTMVTPADLEPPQA----------VQPPPEPVVEPEPEPEPIPEPP- 82

Query: 87 AGDEPASAAEHSPRPE-APVAQPEPILAAEPEPEPEPEPEPEPEPVAPLAAAPAVSEPAT 145
+ A P+P+ P +P + +P+ + +P P A A S AT
Sbjct: 83 ---KEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTAT 139

Query: 146 RPG 148

Sbjct: 140 AAT 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS24140PHPHTRNFRASE340.002 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 33.6 bits (77), Expect = 0.002
Identities = 27/109 (24%), Positives = 46/109 (42%), Gaps = 16/109 (14%)

Query: 228 PTISREQLQAFHKKAYAAGN--VVIALVGDLS--RQEAEAIAAEVSKALPQGPALAKTVQ 283
I R QL+A +A GN V+ ++ L RQ + E K L +G ++ +++
Sbjct: 368 QDIFRTQLRAL-LRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIE 426

Query: 284 P----ETPKPGLT------HIDFPSEQTH-LMLAQLGIDRQDPDYAALY 321
E P + +DF S T+ L+ + DR + + LY
Sbjct: 427 VGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLY 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS24155PF06057290.024 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 29.0 bits (65), Expect = 0.024
Identities = 14/73 (19%), Positives = 28/73 (38%), Gaps = 15/73 (20%)

Query: 79 GLQRALLERGWASVALN-----WRGCSGEPNRLPRGYHSGVSDDLAEVVAHLRARRPQAP 133
+ L ++GW V + W+ + P+ V+ D ++ +A
Sbjct: 69 AVGGILQQQGWPVVGWSSLKYYWK------QKDPKD----VTQDTLAIIDKYQAEFGTQK 118

Query: 134 LYAVGYSLGGNVL 146
+ +GYS G V+
Sbjct: 119 VILIGYSFGAEVI 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS24165HTHTETR595e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.3 bits (143), Expect = 5e-13
Identities = 30/170 (17%), Positives = 63/170 (37%), Gaps = 11/170 (6%)

Query: 7 TRDRIAQASLELFNAQGERSVTTNHIATHLGISPGNLYYHYPNKQAIIAELFAEYESHVE 66
TR I +L LF+ QG S + IA G++ G +Y+H+ +K + +E++ ES++
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 67 SFLRLPEGRGLTVDDKTF--YLEALLAAMWRYRFLHRDLEHLLESD------PELAARYR 118
+ + L +L + +E + + R
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 119 AFAQRCLVNAKAIYRGFTEAGILR-MNETQLEALTLNAWI--ILTSWVRF 165
+ + EA +L T+ A+ + +I ++ +W+
Sbjct: 132 NLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181


132HWH78_RS25360HWH78_RS25415N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS25360-3121.614306TetR/AcrR family transcriptional regulator
HWH78_RS25365-1110.896066purine permease
HWH78_RS25370110-0.015642membrane protein
HWH78_RS25375-1100.958447hypothetical protein
HWH78_RS253800120.653296gamma-glutamyltransferase family protein
HWH78_RS253851110.323249helix-turn-helix transcriptional regulator
HWH78_RS253901110.462950OprD family porin
HWH78_RS253952111.062301hypothetical protein
HWH78_RS254002121.172968LysR family transcriptional regulator
HWH78_RS254052131.060060triclosan efflux RND transporter permease
HWH78_RS254102122.660069triclosan efflux RND transporter adaptor protein
HWH78_RS254151122.324334triclosan efflux RND transporter adaptor protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25360HTHTETR662e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 2e-15
Identities = 30/214 (14%), Positives = 66/214 (30%), Gaps = 30/214 (14%)

Query: 15 KPAGRIRQKNEEAILAAAEEEFARHGFKGTSMNTIAQNVGLPKANLHYYFGNKLGLYTAV 74
+ + Q+ + IL A F++ G TS+ IA+ G+ + ++++F +K L++ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 75 LSNILELWDSTFNTLGVD--DDPAEALARYIRAKMEFSRRYPLASRIFA----------- 121
DP L + +E + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 122 MEIISGGECLTAHFNQDYRSWFRGRAAVFEAWIAAGRMDP-VDPVHLIFLLWGSTQHYAD 180
M ++ + + + + I A + + ++ G Y
Sbjct: 123 MAVVQQAQ---RNLCLESYDRI---EQTLKHCIEAKMLPADLMTRRAAIIMRG----YIS 172

Query: 181 FASQIGLVTGR-KRMSRQDFAAAADNLVRIILKG 213
GL+ D A + V I+L+
Sbjct: 173 -----GLMENWLFAPQSFDLKKEARDYVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25370CHANNELTSX468e-08 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 45.8 bits (108), Expect = 8e-08
Identities = 38/135 (28%), Positives = 58/135 (42%), Gaps = 9/135 (6%)

Query: 14 LLAAGQAVAEDHDMTPTHETDSGPLL---WHNESLTYLYGKNFKINPPIQQTFTLEHAS- 69
LLAAG VA + P W ++S+ + + + P I+ LE+ +
Sbjct: 5 LLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLEYEAF 64

Query: 70 -GWTWGDLFIFFDQ-INYNGKEDAS---NGKNTYYGEITPRLSFGKLTGADLSFGPVKDV 124
W D + + D + + G A N + + EI PR S KLT DLSFGP K+
Sbjct: 65 AKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGPFKEW 124

Query: 125 LLAGTYEFGEGDTEA 139
A Y + G ++
Sbjct: 125 YFANNYIYDMGRNDS 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25380PF09025300.016 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 29.6 bits (66), Expect = 0.016
Identities = 26/90 (28%), Positives = 33/90 (36%), Gaps = 10/90 (11%)

Query: 134 LPFEQLL---RPAIELARDGFPVSPVIARLWQSGLDKFRAALPQRPELRAWFDEFLIDGR 190
L FEQ L PA G + RL Q + R EL+A L GR
Sbjct: 30 LAFEQALGGEPPAAGRRLAGLENGALGERLLQRFAQPLQGLEADRLELKAMLRAELPLGR 89

Query: 191 APRA------GEVFRQPGQADTLDELARSQ 214
+ G V PG + L +LAR +
Sbjct: 90 QQQTFLLQLLGAVEHAPG-GEYLAQLARRE 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25385PHAGEIV300.009 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 30.3 bits (68), Expect = 0.009
Identities = 15/76 (19%), Positives = 29/76 (38%), Gaps = 3/76 (3%)

Query: 105 TRCRVLEVTPLARELIKSFCELPVDYPEGDSAESRLVQVLLDQLRLLPEVAFSLPMPREP 164
R ++ + +KS + D + +V D L LP+ ++ +P +
Sbjct: 138 NNVRAKDLIRVVELFVKSNTSKSSNVLSVDGSNLLVVSAPKDILDNLPQFLSTVDLPTDQ 197

Query: 165 RLLRLCQALIDEPTQS 180
L+ + LI E Q
Sbjct: 198 ILI---EGLIFEVQQG 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25405ACRIFLAVINRP490e-159 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 490 bits (1263), Expect = e-159
Identities = 240/1052 (22%), Positives = 444/1052 (42%), Gaps = 69/1052 (6%)

Query: 7 LSDWALRHQSLVWYLMAVSLVMGVFSYLNLGREEDPSFAIKTMVIQTRWPGATVDDTLEQ 66
++++ +R W L + ++ G + L L + P+ A + + +PGA +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 67 VTDRIEKKLEELDSLDYVKSYT-RPGESTVFVYLKDTTKAGDIPDIWYQVRKKISDIQGE 125
VT IE+ + +D+L Y+ S + G T+ + + T D QV+ K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT---DPDIAQVQVQNKLQLATPL 117

Query: 126 FPQGIQGPG-FNDEFGDVFGSVYAFTADGLDFRQ--LRDYVEKVRLD-IRSVKDLGKVQM 181
PQ +Q G ++ + V F +D Q + DYV D + + +G VQ+
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 182 IGAQNEV-IYLNFSTRKLAALGLDQRQVVQSLQAQNAVTPSGVVEAGPE------RISVR 234
GAQ + I+L+ L L V+ L+ QN +G + P S+
Sbjct: 178 FGAQYAMRIWLDAD--LLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 235 TSGNFRSEKDLQAVNLRVNDRFY--RLSDLASISRDFVDPPTSLFRYKGEPAIGLAVAMK 292
F++ ++ V LRVN RL D+A + + + R G+PA GL + +
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARV-ELGGENYNVIARINGKPAAGLGIKLA 294

Query: 293 EGGNILEFGEALNARMQEITGELPVGVGVHQVSNQAQVVKKAVGGFTRALFEAVVIVLIV 352
G N L+ +A+ A++ E+ P G+ V + V+ ++ + LFEA+++V +V
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 353 SFVSLG-LRAGLVVACSIPLVLAMVFVFMEYTDITMQRVSLGALIIALGLLVDDAMITVE 411
++ L +RA L+ ++P+VL F + ++ +++ +++A+GLLVDDA++ VE
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 412 MMITRLELGDSLHDSATY-AYTSTAFPMLTGTLVTVAGFVPIGLNASSAGEYTFTLFAVI 470
+ + AT + + ++ +V A F+P+ S G I
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 471 AVALLLSWIVAVLFAPVIAVHILPKTLKHKSEQKKG---RIAERFDSLLHLA-------M 520
A+ LS +VA++ P + +L E K G FD ++ +
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 521 RRRWTTIFLTALLFGVSLFLMKFVQHQFFPSSDRPELLVDLNLPQNSSIHETRAVMDR-L 579
+ + AL+ + L + F P D+ L + LP ++ T+ V+D+
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 580 EATLKDDEDID-HWSAYVGEGAIRFYLPLDQQLQNNFYGQLVIVTKDLEAR---ERVAAR 635
+ LK+++ G Q QN + K E R E A
Sbjct: 595 DYYLKNEKANVESVFTVNGFS-------FSGQAQNAGMAF--VSLKPWEERNGDENSAEA 645

Query: 636 LRDRLRKDYVGI-STYVQPLEMGPPV--------GRPIQYRVSGPQIDKVREYAMGLAGV 686
+ R + + I +V P M P + +G D + + L G+
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNM-PAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGM 704

Query: 687 LDGNP-NIGDIVYDWNEPGKMLKIDIAQDKARQLGLSSEDVAQIMNSVVTGSAVTQVRDD 745
+P ++ + + E K+++ Q+KA+ LG+S D+ Q +++ + G+ V D
Sbjct: 705 AAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDR 764

Query: 746 IYLVNVIGRAEDSERGSLETLESLQIVTPSGTSIPLKAFAKVSYELEQPLVWRRDRKPTI 805
+ + +A+ R E ++ L + + +G +P AF + P + R + P++
Sbjct: 765 GRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSM 824

Query: 806 TVKASLRGEIQPTDLVARLAPEVKRFADGLPANYRIEVGGTVEESGKAEGPIAKVVPLML 865
+ +GE P ++ A LPA + G + + +V +
Sbjct: 825 EI----QGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISF 880

Query: 866 FLMATFLMIQLQSVQKLFLVASVAPLGLIGVVAALLPTGTPMGFVAILGILALIGIIIRN 925
++ L +S V V PLG++GV+ A ++G+L IG+ +N
Sbjct: 881 VVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKN 940

Query: 926 SVILVTQI-DAFEKDGKTPWEAVLEATHHRTRPILLTAAAASLGMIPIA------REVFW 978
++++V D EK+GK EA L A R RPIL+T+ A LG++P+A
Sbjct: 941 AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN 1000

Query: 979 GPMAYAMIGGIVAATLLTLIFLPALYVAWYRI 1010
+ ++GG+V+ATLL + F+P +V R
Sbjct: 1001 -AVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 79.1 bits (195), Expect = 5e-17
Identities = 79/517 (15%), Positives = 172/517 (33%), Gaps = 35/517 (6%)

Query: 518 LAMRRRWTTIFLTALLFGVSLFLMKFVQHQFFPSSDRPELLVDLNLPQNSSIHETRAVMD 577
+RR L +L + + +P+ P + V N P + V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 578 RLEATLKDDEDIDHWSAYVGEGAIRFYLPLDQQLQNNFYGQLVIVTKDLEARERVAARLR 637
+E + +++ + S+ + A + L Q + V V + L
Sbjct: 64 VIEQNMNGIDNLMYMSS-TSDSAGSVTITLTFQSGTDPDIAQVQVQ---NKLQLATPLLP 119

Query: 638 DRLRKDYVGISTYVQPLEMG----PPVGRPIQYRVSGPQIDKVREYAMGLAGVLDGNPNI 693
+++ + + M Q +S V++ L GV D
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 694 GDIVYDWNEPGKMLKIDIAQDKARQLGLSSEDVAQIMNS----VVTGSAVTQVRDDIYLV 749
++I + D + L+ DV + + G +
Sbjct: 180 AQ---------YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQL 230

Query: 750 NVIGRAEDSERGSLETLESLQI-VTPSGTSIPLKAFAKVSYELE-QPLVWRRDRKPTITV 807
N A+ + E + + V G+ + LK A+V E ++ R + KP +
Sbjct: 231 NASIIAQT-RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289

Query: 808 KASLRGEIQPTDLVARLAPEVKRFADGLPANYRIEVGGTVEESGKAEGPIAKVVPLML-- 865
L D + ++ P ++ + + + I +VV +
Sbjct: 290 GIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLY--PYDTTPFVQLSIHEVVKTLFEA 347

Query: 866 -FLMATFLMIQLQSVQKLFLVASVAPLGLIGVVAALLPTGTPMGFVAILGILALIGIIIR 924
L+ + + LQ+++ + P+ L+G A L G + + + G++ IG+++
Sbjct: 348 IMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVD 407

Query: 925 NSVILVTQI-DAFEKDGKTPWEAVLEATHHRTRPILLTAAAASLGMIPIA-----REVFW 978
+++++V + +D P EA ++ ++ A S IP+A +
Sbjct: 408 DAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIY 467

Query: 979 GPMAYAMIGGIVAATLLTLIFLPALYVAWYRIPEPGR 1015
+ ++ + + L+ LI PAL +
Sbjct: 468 RQFSITIVSAMALSVLVALILTPALCATLLKPVSAEH 504


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25410RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.8 bits (93), Expect = 1e-05
Identities = 16/103 (15%), Positives = 32/103 (31%), Gaps = 3/103 (2%)

Query: 63 TNGRIASRLFDVGDFVGKGALLATLDPTDQQNQLRASQGDLASAEAQLIDAQANARRQEE 122
N + + G+ V KG +L L + +Q L A + Q +R E
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 123 --LFARSVTAQARLDDARTR-LKTSQASFDQAKAAVQQARDQL 162
L + + + + + + + Q + Q
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205



Score = 39.0 bits (91), Expect = 2e-05
Identities = 23/182 (12%), Positives = 59/182 (32%), Gaps = 31/182 (17%)

Query: 51 IQARYESVLGFRTNGRIASRLFDVGDFVGKGALLATLDPTDQQNQLRASQGDLASAEAQL 110
I ++ S L + K A+L +Q+N+ + +L ++QL
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQ-AIAKHAVL------EQENKYVEAVNELRVYKSQL 275

Query: 111 IDAQANARRQEELFARSVTAQARLDDARTRLKTSQASFDQAKAAVQQARDQLSYTRLVTD 170
++ +E + Q ++ +L+ + + + + ++ + +
Sbjct: 276 EQIESEILSAKE--EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333

Query: 171 FDGVITTW--HAEAGQVVSAGQAVVTLARPEVREAVFDLPTEVAESLPADARFLVSAQLD 228
+ H E G VV+ + ++ + P D V+A +
Sbjct: 334 VSVKVQQLKVHTE-GGVVTTAETLMVIV-------------------PEDDTLEVTALVQ 373

Query: 229 PQ 230
+
Sbjct: 374 NK 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS25415RTXTOXIND491e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 49.1 bits (117), Expect = 1e-08
Identities = 23/129 (17%), Positives = 41/129 (31%), Gaps = 9/129 (6%)

Query: 75 VGGKIVERLVDVGDHVAAGQVLARLDP-------QDQRSNVENAQAAVAAQQAQSKLADL 127
+ E +V G+ V G VL +L +S++ A+ Q S+ +L
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 128 NYQRQKALLPKGYTSQSEYDQALASVRSAQSSLKAAQAQLANARDLLSYTELRASDAGVI 187
N + L + Y ++ L + Q Q L+ + RA V+
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE--LNLDKKRAERLTVL 220

Query: 188 TARQAEVGQ 196

Sbjct: 221 ARINRYENL 229



Score = 42.9 bits (101), Expect = 2e-06
Identities = 33/216 (15%), Positives = 71/216 (32%), Gaps = 26/216 (12%)

Query: 58 SITGDIQARVQADQSFRVGGKIVERLVDVGDHVAAGQVLARLDPQDQRSNVENAQAAVAA 117
++ I + + L+ A A L+ +++ N +
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQ----AIAKHAVLEQENKYVEAVNELRVYKS 273

Query: 118 QQAQSKLADLNYQRQKALLPKGYTSQSEYDQALASVRSAQSSLKAAQAQLANARDLLSYT 177
Q Q + L+ + + L+ + + ++ L +R ++ +LA + +
Sbjct: 274 QLEQIESEILSAKEEYQLVTQLFKNE-----ILDKLRQTTDNIGLLTLELAKNEERQQAS 328

Query: 178 ELRASDAGVITARQA-EVGQVVQATVPIFTLARDGERDAVFNVYESLFSHDVDGQRITVS 236
+RA + + + G VV + + + D V + + D+ I V
Sbjct: 329 VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPE---DDTLEVTALVQNKDIG--FINVG 383

Query: 237 LLGKPEVTA---------SGKVREITP--TVDERSG 261
+V A GKV+ I D+R G
Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419


133HWH78_RS26420HWH78_RS26455N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS26420110-2.063056energy transducer TonB1
HWH78_RS26425110-2.354151MFS transporter
HWH78_RS26430111-1.038660cation:proton antiporter
HWH78_RS26435-2130.423258Tim44 domain-containing protein
HWH78_RS26440-2120.603936SMI1/KNR4 family protein
HWH78_RS26445-2100.580626YgdI/YgdR family lipoprotein
HWH78_RS26450-291.118738GntR family transcriptional regulator
HWH78_RS26455-190.689347SDR family oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS26420TONBPROTEIN1152e-32 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 115 bits (288), Expect = 2e-32
Identities = 66/193 (34%), Positives = 93/193 (48%), Gaps = 17/193 (8%)

Query: 137 AEPTPQPPAAAPEPTPPKIEEPKPEPPKPKPVEKPKPKPKPKPKPVENAIPKAKPKPEPK 196
P A P P P EP+PEP P E P KPKPKP KPKP+P
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKP--------KPKPKPV 103

Query: 197 PKPEPEPSTEASSQPSPSSAAPPPPAPTVGQSTPGAQTAPSGSQGPAGLPSGSLNDSDIK 256
K + +P + P +P + ++ + + + S + S +
Sbjct: 104 KKVQEQP------KRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVA---SGPR 154

Query: 257 PLRMDPPVYPRMAQARGIEGRVKVLFTITSDGRIDDIQVLESVPSRMFDREVRQAMAKWR 316
L + P YP AQA IEG+VKV F +T DGR+D++Q+L + P+ MF+REV+ AM +WR
Sbjct: 155 ALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWR 214

Query: 317 FEPRVSGGKIVAR 329
+EP G IV
Sbjct: 215 YEPGKPGSGIVVN 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS26425TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 6e-04
Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 286 AATLFLFMLLQPIVGALSDKIGRRPILIAFGVLGTVFTYPILSTLHSV 333
A + P++GALSD+ GRRP+L+ + G Y I++T +
Sbjct: 50 ALYALMQFACAPVLGALSDRFGRRPVLLV-SLAGAAVDYAIMATAPFL 96



Score = 34.4 bits (79), Expect = 9e-04
Identities = 37/192 (19%), Positives = 74/192 (38%), Gaps = 33/192 (17%)

Query: 49 KAFFPQGDMTAQLLNTAAIFAVGFLMRPIGGWLMGIYADRKGRKAALLASVLLMCFGSLI 108
+ D+TA A++A LM+ ++G +DR GR+ LL S+ I
Sbjct: 33 RDLVHSNDVTAHYGILLALYA---LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAI 89

Query: 109 IALTPSYETIGVAAPILLVVARLLQGLSVGGEYGTSATYLSEMANKEQR----GFFSSFQ 164
+A P +L + R++ G++ G + Y++++ + ++R GF S+
Sbjct: 90 MATAPFLW--------VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACF 140

Query: 165 YVTLISGQLIALAVLIVLQQTLTVEQLESWGWRVPFFIGA----LCAVVAMFLRRGMEET 220
+++G ++ + + PFF A L + FL +
Sbjct: 141 GFGMVAGPVLG-------------GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187

Query: 221 ESFSKKKEEPKE 232
E ++E
Sbjct: 188 ERRPLRREALNP 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS26430RTXTOXINA320.008 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.9 bits (72), Expect = 0.008
Identities = 11/44 (25%), Positives = 23/44 (52%)

Query: 360 PVAVAVSAITTLLTPYLIRAADPLSQHLANAMPQRMARIFGHYG 403
PV+ V A+T +++ L + + +H+A+ M +A +G
Sbjct: 394 PVSALVGAVTGIISGILEASKQAMFEHVASKMADVIAEWEKKHG 437


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS26455DHBDHDRGNASE1044e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 4e-29
Identities = 78/261 (29%), Positives = 119/261 (45%), Gaps = 24/261 (9%)

Query: 9 GQVALISGAGSELGIGFAIARRLAREGVRLL-ITASSERIRQRAEELSACGHDVRAASAD 67
G++A I+GA GIG A+AR LA +G + + + E++ + L A A AD
Sbjct: 8 GKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 68 LTDEAQVQGLLDWAEAQWGRVDILVNNAGMAQLDSAEPFSAVEATSLRDWQLSLSRNLTS 127
+ D A + + E + G +DILVN AG+ + + + S +W+ + S N T
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRP------GLIHSLSDEEWEATFSVNSTG 119

Query: 128 AFLLTRGLLPGMRERGYGRIVNVASTTGTRGSNPGEAAYSAAKAGLVGWSMGLALEVAKS 187
F +R + M +R G IV V S AAY+++KA V ++ L LE+A+
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGV-PRTSMAAYASSKAAAVMFTKCLGLELAEY 178

Query: 188 GITVNSVAPG-------WIATASSTAEER-------QAALASPSGRAGRPEEVAAAVAFL 233
I N V+PG W A E+ P + +P ++A AV FL
Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 234 ASPEASFVNGELLVVDGGNCL 254
S +A + L VDGG L
Sbjct: 239 VSGQAGHITMHNLCVDGGATL 259


134HWH78_RS26655HWH78_RS26695N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS26655-110-1.290502HAMP domain-containing protein
HWH78_RS26660-110-0.963061sigma-54-dependent response regulator
HWH78_RS26670-111-1.463344DUF1328 domain-containing protein
HWH78_RS26675011-1.197077inhibitor of vertebrate lysozyme family protein
HWH78_RS26680-111-0.366982glutamate/aspartate:proton symporter GltP
HWH78_RS26685-1100.550265membrane protein
HWH78_RS26690-290.424499DUF2914 domain-containing protein
HWH78_RS26695-190.880959MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS26660PF06580531e-09 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 53.3 bits (128), Expect = 1e-09
Identities = 28/165 (16%), Positives = 63/165 (38%), Gaps = 27/165 (16%)

Query: 430 LINDLLNFSRYQTGMQKLELASC----DLVDLLTQAQQ-RFIPKGEARRVSLQLELGDEL 484
++ L RY S +VD Q +F R+ + ++ +
Sbjct: 196 MLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQF-----EDRLQFENQINPAI 250

Query: 485 PRLQLDRLQIERVIDNLLENALRHSSEGGQIHLQARRQGDRVLIAVEDNGEGIPFSQQGR 544
+Q+ + ++ +++N +++ + +GG+I L+ + V + VE+ G
Sbjct: 251 MDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL-------- 302

Query: 545 IFEPFVQVGRKKGGAGLGLALCKEIIQLHGG---RIAVRSQPGQG 586
+ K G GL +E +Q+ G +I + + G+
Sbjct: 303 ------ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS26665HTHFIS448e-157 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 448 bits (1154), Expect = e-157
Identities = 165/476 (34%), Positives = 250/476 (52%), Gaps = 39/476 (8%)

Query: 9 GRILLVDDEPAILRTFRYCLEDEGYSVATASSAPQAEALLQRQVFDLCFLDLRLGEDNGL 68
IL+ DD+ AI L GY V S+A + DL D+ + ++N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 69 DVLAQMRVQAPWMRVVIVTAHSAVDTAVDAMQAGAVDYLVKPCSPDQLRLAAAKQLEVRQ 128
D+L +++ P + V++++A + TA+ A + GA DYL KP +L + L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 129 LTARLEALEDEVRRQGDGLESHSPAMAAVLETARQVAATDANILILGESGSGKGELARAI 188
R + ++ + G L S AM + ++ TD ++I GESG+GK +ARA+
Sbjct: 124 ---RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 189 HTWSKRAKKPQVTINCPSLTAELMESELFGHSRGAFTGATESTLGRVSQADGGTLFLDEI 248
H + KR P V IN ++ +L+ESELFGH +GAFTGA + GR QA+GGTLFLDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 249 GDFPLTLQPKLLRFIQDKEYERVGDPVTRRADVRILAATNRDLGAMVAQGQFREDLLYRL 308
GD P+ Q +LLR +Q EY VG R+DVRI+AATN+DL + QG FREDL YRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 309 NVIVLNLPPLRERAEDILGLAERFLARFVKDYGRPARGFSEAAREAMRQYPWPGNVRELR 368
NV+ L LPPLR+RAEDI L F+ + K+ G + F + A E M+ +PWPGNVREL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 369 NVIERASIICNQELVDVDHLGFSAAQSASSAPR---------------IGESLS------ 407
N++ R + + Q+++ + + +P + E++
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 408 -------------LEDLEKAHITAVMASSA-TLDQAAKTLGIDASTLYRKRKQYGL 449
L ++E I A + ++ +AA LG++ +TL +K ++ G+
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS26680V8PROTEASE320.007 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 31.5 bits (71), Expect = 0.007
Identities = 7/41 (17%), Positives = 18/41 (43%)

Query: 292 AYGAPKAISSFVVPTGYSFNLDGSTLYQSIAAIFIAQLYGI 332
+ A ++ + TGY + +T+++S I + +
Sbjct: 186 SNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAM 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS26690PF07520290.041 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 28.8 bits (64), Expect = 0.041
Identities = 5/30 (16%), Positives = 15/30 (50%)

Query: 1 MPNQQRPILKARLEQLIALVGGLMARYPRT 30
Q++ ++++R+ + LV ++ T
Sbjct: 506 TSVQEQAMIRSRVSGALTLVKEMLGTKDGT 535


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS26695TCRTETA371e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.7 bits (85), Expect = 1e-04
Identities = 59/273 (21%), Positives = 97/273 (35%), Gaps = 31/273 (11%)

Query: 61 LMRPLGAVFLGAYIDRHGRRQGLIITLGLMAMGTLLIAFVPGYATLGVAAPLLVLF-GRL 119
LM+ A LGA DR GRR L+++ L YA + A L VL+ GR+
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVS---------LAGAAVDYAIMATAPFLWVLYIGRI 104

Query: 120 LQGFSAGVELGGVSVYLSEIATPGRKGFFVSWQSASQQVAVVFAGLLGVLLNQWLSPQDM 179
+ G + G Y+++I + + SA +V +LG L M
Sbjct: 105 VAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL---------M 154

Query: 180 GEWGWRVPFFIGCLIVPALFVIRRSLEETPEFEARTHRPSLSQVLRSIGQNFGVVLAGTA 239
G + PFF + F+ PE RP L + + +F T
Sbjct: 155 GGFSPHAPFFAAAALNGLNFLT--GCFLLPESHKGERRP-LRREALNPLASFRWARGMTV 211

Query: 240 MVVMTTVSFYLI------TAYTPTFGKNELQLSDLDSLLVTMCIGLSN-FIWLPVMGAFS 292
+ + V F + A FG++ + G+ + + G +
Sbjct: 212 VAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271

Query: 293 DRIGRKPLLLGASALALLTAYPALSWLVREPSF 325
R+G + L+ +A T Y L++ R
Sbjct: 272 ARLGERRALM-LGMIADGTGYILLAFATRGWMA 303



Score = 31.3 bits (71), Expect = 0.006
Identities = 16/45 (35%), Positives = 21/45 (46%), Gaps = 2/45 (4%)

Query: 258 FGKNELQLSDLDSLLVTMCIGLSNFIWLPVMGAFSDRIGRKPLLL 302
+ + LL L F PV+GA SDR GR+P+LL
Sbjct: 35 LVHSNDVTAHYGILLA--LYALMQFACAPVLGALSDRFGRRPVLL 77


135HWH78_RS27290HWH78_RS27315N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS27290013-0.236168phosphate signaling complex protein PhoU
HWH78_RS27295-1120.019932response regulator
HWH78_RS273000140.180436peptidoglycan DD-metalloendopeptidase family
HWH78_RS273050140.251177hemolysin family protein
HWH78_RS27310-1121.142659phosphate regulon sensor histidine kinase PhoR
HWH78_RS27315-291.350817phosphate regulon transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27290FLGHOOKAP1280.033 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.033
Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 9/88 (10%)

Query: 11 ISQQFNAELEDVRSHLLAMGGLVEKQVNDAVNALIDADSGLAQQVREIDDQINQMERNID 70
+ QF + +R +KQVN A+ A +D + A+Q+ ++DQI+++
Sbjct: 139 LVNQFKTTDQYLR--------DQDKQVNIAIGASVDQINNYAKQIASLNDQISRLTGVGA 190

Query: 71 EECVR-ILARRQPAASDLRLIISISKSV 97
+L +R S+L I+ + SV
Sbjct: 191 GASPNNLLDQRDQLVSELNQIVGVEVSV 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27295HTHFIS918e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.4 bits (227), Expect = 8e-23
Identities = 29/139 (20%), Positives = 63/139 (45%), Gaps = 4/139 (2%)

Query: 1 MSKVSALVVDDAPFIRDLMKKGLRDNFPGLHIEEAVNGRKAQQLLSRQNVDLILCDWEMP 60
M+ + LV DD IR ++ + L + G + N + ++ + DL++ D MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 EMSGLELLTWCRAQENLKTTPFIMVTSRGDKENVVQAIQAGVSDYIGKPFSNDQLVAKIK 120
+ + +LL + P ++++++ ++A + G DY+ KPF +L+ I
Sbjct: 59 DENAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 121 KALSRSGKLEALAAHAPRR 139
+AL+ + + +
Sbjct: 117 RALAEPKRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27310PF06580386e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.9 bits (88), Expect = 6e-05
Identities = 22/99 (22%), Positives = 36/99 (36%), Gaps = 25/99 (25%)

Query: 333 LVFNAVKY----TPDEGEIRIRWWADEQGAHLSVQDTGIGVDPKHLPRLTERFYRVDSSR 388
LV N +K+ P G+I ++ D L V++TG
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK----------------- 305

Query: 389 ASNTGGTGLGLAIVKHVLIR---HRARLEISSVPGKGST 424
+ TG GL V+ L A++++S GK +
Sbjct: 306 -NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27315HTHFIS1002e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99.5 bits (248), Expect = 2e-26
Identities = 39/124 (31%), Positives = 63/124 (50%), Gaps = 2/124 (1%)

Query: 1 MVGKTILIVDDEAPIREMIAVALEMAGYECLEAENTQQAHAVIVDRKPDLILLDWMLPGT 60
M G TIL+ DD+A IR ++ AL AGY+ N I DL++ D ++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGIELARRLKRDELTVDIPIIMLTAKGEEDNKIQGLEVGADDYITKPFSPRELVARLKAV 120
+ +L R+K+ D+P+++++A+ I+ E GA DY+ KPF EL+ +
Sbjct: 61 NAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 121 LRRT 124
L
Sbjct: 119 LAEP 122


136HWH78_RS27450HWH78_RS27505N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS27450-112-1.637926DUF4870 domain-containing protein
HWH78_RS27455-312-1.033635catabolite repression control protein Crc
HWH78_RS27460-3100.113307orotate phosphoribosyltransferase
HWH78_RS27465-110-0.438859hypothetical protein
HWH78_RS27470-110-0.588126acyl-CoA thioesterase
HWH78_RS27475-19-0.715779cytochrome c5 family protein
HWH78_RS27480-27-0.300543FAD-binding protein
HWH78_RS27485-110-0.029531DSD1 family PLP-dependent enzyme
HWH78_RS27490-1130.192691transporter
HWH78_RS27495-190.850909AraC family transcriptional regulator SphR
HWH78_RS27500-2101.249587acetylglutamate kinase
HWH78_RS27505-1101.256083phosphomannomutase/phosphoglucomutase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27450ACRIFLAVINRP280.010 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 27.9 bits (62), Expect = 0.010
Identities = 7/41 (17%), Positives = 18/41 (43%)

Query: 65 FQITVALAMFVSFLLMLVVIGFFLLGLVCLAALVLTIIAGI 105
VA++ V FL + + + + + + + L I+ +
Sbjct: 872 APALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVL 912


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27460PF00577280.041 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 27.9 bits (62), Expect = 0.041
Identities = 12/46 (26%), Positives = 23/46 (50%)

Query: 105 HGEGGTLVGAPLSGRVLIIDDVITAGTAIREVMQIIDAQGARAAGV 150
H + + +SG VL + +T G + + + ++ A GA+ A V
Sbjct: 686 HSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKV 731


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27465RTXTOXIND300.008 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.008
Identities = 17/109 (15%), Positives = 36/109 (33%), Gaps = 1/109 (0%)

Query: 82 LRQRKAAQAQASSDAQLLRLYSSLEDVDRARERRLAELDGLSSVARGNLQSLKLQQANLQ 141
L + + + + L +SSL + + E + A L+ K Q ++
Sbjct: 220 LARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIE 279

Query: 142 GQAAN-QERAGRPVAQALVDQLDDLKQEEKRLQGEIGRFQKAREDAERT 189
+ + +E + LD L+Q + K E + +
Sbjct: 280 SEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQAS 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27485ALARACEMASE290.041 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 28.6 bits (64), Expect = 0.041
Identities = 26/147 (17%), Positives = 49/147 (33%), Gaps = 21/147 (14%)

Query: 33 IDLDRLDHNIDVVMRSVRRGGKHLRL--VEKSLPSPGLLAYIARRAGTRRLMSFHQPFLN 90
+DL L N+ VR+ H R+ V K+ + I G
Sbjct: 9 LDLQALKQNL----SIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGF-------- 56

Query: 91 HDAVAFADADILL---GKPLPVRSAELFYREHKGAFDPARQLQWLIDTPQRLRQYLALAQ 147
A+ + I L G P+ E F+ +L + + +L+
Sbjct: 57 --ALLNLEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNAR- 113

Query: 148 GLGTRMRVNIELDVGLHRGGVADQAAL 174
L + + ++++ G++R G L
Sbjct: 114 -LKAPLDIYLKVNSGMNRLGFQPDRVL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27495HTHFIS290.039 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.6 bits (64), Expect = 0.039
Identities = 19/136 (13%), Positives = 41/136 (30%), Gaps = 18/136 (13%)

Query: 161 RALVGPAFEPLGIELIHAAPPYAGEYLRLLGPQVRFGCLHNRMAIASHWLDMRLPNHNLP 220
R + + E+I + R G L A+ + R +
Sbjct: 364 RLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENM---RQYFASFG 420

Query: 221 ALRQALALLEQESTQVHRKLDLVQAVERAIARDLSLGSQIERISAELNMSSRTLRRRLAE 280
L ++ ++ L ++ A+ + + L ++ TLR+++ E
Sbjct: 421 DALPPSGLYDRVLAEMEYPL-ILAALTAT-------RGNQIKAADLLGLNRNTLRKKIRE 472

Query: 281 HGLTFEALLEQVRRGR 296
G+ V R
Sbjct: 473 LGV-------SVYRSS 481


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27500CARBMTKINASE558e-11 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 54.8 bits (132), Expect = 8e-11
Identities = 66/301 (21%), Positives = 116/301 (38%), Gaps = 61/301 (20%)

Query: 26 VGKTLVIKYGGNAMESEELKAGF----------ARDVVLMKAVGINPVVVHGGGPQIGDL 75
+GK +VI GGNA++ K + AR + + A G V+ HG GPQ+G L
Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60

Query: 76 LKRLSIESHFIDGMRVTDAATMDVV-----------------EMVLGGQVNKDIVNLINR 118
L L +++ A MDV + + K +V +I +
Sbjct: 61 L--LHMDAG--QATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQ 116

Query: 119 -----------HGGSAIG--LTGKDAELIRAKKLTVTRQ---------TPEMTKPEIIDI 156
+ +G + A+ + +K + ++ P ++
Sbjct: 117 TIVDKNDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEA 176

Query: 157 GHVGEVTGVNVGLLNMLVKGDFIPVIAPIGVGSNGESYNINADLVAGKVAEALKAEKLML 216
+ V G++ + G +PVI G E+ I+ DL K+AE + A+ M+
Sbjct: 177 ETIK--KLVERGVIVIASGGGGVPVILEDGEIKGVEAV-IDKDLAGEKLAEEVNADIFMI 233

Query: 217 LTNIAGLMDKQG----QVLTGLSTEQVNELIADGT-IYGGMLPKIRCALEAVQGGVTSAH 271
LT++ G G Q L + E++ + +G G M PK+ A+ ++ G A
Sbjct: 234 LTDVNGAALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAI 293

Query: 272 I 272
I
Sbjct: 294 I 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS27505PF03544300.042 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.6 bits (66), Expect = 0.042
Identities = 16/109 (14%), Positives = 30/109 (27%), Gaps = 4/109 (3%)

Query: 317 QRTAKPPVPSLPGFAPLIQALARQPRR-KPEPTSVPSPAKAAPVAPVAVAKAPPREEPAL 375
Q P + A P+ +P P V P P +AP E
Sbjct: 39 QVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK 98

Query: 376 ADPLFQNTDILDIDILDEDQDLLGLEQT---PIMSTAKAPTLPASIFRA 421
P + + ++ D + + A+ + A+ +
Sbjct: 99 PKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATS 147


137HWH78_RS29105HWH78_RS29165N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS29105127-2.033991GNAT family N-acetyltransferase
HWH78_RS29115234-3.044235sulfonamide-resistant dihydropteroate synthase
HWH78_RS29120236-3.690762phage integrase N-terminal SAM-like
HWH78_RS29130334-3.393590IS91 family transposase
HWH78_RS29135342-4.649342LysR family transcriptional regulator
HWH78_RS29140249-6.323052tetracycline efflux MFS transporter Tet(G)
HWH78_RS29145352-7.355710tetracycline resistance transcriptional
HWH78_RS29150253-8.057061chloramphenicol/florfenicol efflux MFS
HWH78_RS29155356-9.263091dihydropteroate synthase
HWH78_RS29160565-12.935539quaternary ammonium compound efflux SMR
HWH78_RS29165564-12.647423chloramphenicol efflux MFS transporter CmlA6
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29115SACTRNSFRASE383e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.4 bits (89), Expect = 3e-06
Identities = 17/70 (24%), Positives = 27/70 (38%), Gaps = 15/70 (21%)

Query: 90 AYLHKLAVRRTHAGRGVSSALIEACRHAARTQGCAKLRLD--------CHPNLRGLYERL 141
A + +AV + + +GV +AL+ A+ L L+ CH Y +
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACH-----FYAKH 144

Query: 142 GFT--HVDTF 149
F VDT
Sbjct: 145 HFIIGAVDTM 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29145TCRTETA483e-173 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 483 bits (1245), Expect = e-173
Identities = 236/383 (61%), Positives = 288/383 (75%)

Query: 3 SSAIIALLIVGLDAMGLGLIMPVLPTLLRELVPAEQVAGHYGALLSLYALMQVVFAPMLG 62
I+ L V LDA+G+GLIMPVLP LLR+LV + V HYG LL+LYALMQ AP+LG
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 63 QLSDSYGRRPVLLASLAGAAVDYTIMASAPVLWVLYIGRLVSGVTGATGAVAASTIADST 122
LSD +GRRPVLL SLAGAAVDY IMA+AP LWVLYIGR+V+G+TGATGAVA + IAD T
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124

Query: 123 GEGSRARWFGYMGACYGAGMIAGPALGGMLGGISAHAPFIAAALLNGFAFLLACIFLKET 182
RAR FG+M AC+G GM+AGP LGG++GG S HAPF AAA LNG FL C L E+
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 183 HHSHGGTGKPVRIKPFVLLRLDDALRGLGALFAVFFIIQLIGQVPAALWVIYGEDRFQWN 242
H + + P R + + AL AVFFI+QL+GQVPAALWVI+GEDRF W+
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWD 244

Query: 243 TATVGLSLAAFGATHAIFQAFVTGPLSSRLGERRTLLFGMAADATGFVLLAFATQGWMVF 302
T+G+SLAAFG H++ QA +TGP+++RLGERR L+ GM AD TG++LLAFAT+GWM F
Sbjct: 245 ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAF 304

Query: 303 PILLLLAAGGVGMPALQAMLSNNVSSNKQGALQGTLTSLTNLSSIAGPLGFTALYSATAG 362
PI++LLA+GG+GMPALQAMLS V +QG LQG+L +LT+L+SI GPL FTA+Y+A+
Sbjct: 305 PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASIT 364

Query: 363 AWNGWVWIVGAILYLICLPILRR 385
WNGW WI GA LYL+CLP LRR
Sbjct: 365 TWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29150TETREPRESSOR312e-111 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 312 bits (800), Expect = e-111
Identities = 102/205 (49%), Positives = 138/205 (67%), Gaps = 2/205 (0%)

Query: 1 MTKLDKGTVIAAALELLNEVGMDSLTTRKLAERLKVQQPALYWHFQNKRALLDALAEAML 60
M +L++ +VI AALELLNE G+D LTTRKLA++L ++QP LYWH +NKRALLDALA +L
Sbjct: 1 MARLNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEIL 60

Query: 61 AERHTRSLPEENEDWRVFLKENALSFRTALLSYRDGARIHAGTRPTEPNFGTAETQIRFL 120
A H SLP E W+ FL+ NA+SFR ALL YRDGA++H GTRP E + T ETQ+RF+
Sbjct: 61 ARHHDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLGTRPDEKQYDTVETQLRFM 120

Query: 121 CAEGFCPKRAVWALRAVSHYVVGSVLEQQASDADERVPDRPDVSEQAPSSFLHDLFHELE 180
GF + ++A+ AVSH+ +G+VLEQQ A DRP ++ L + ++
Sbjct: 121 TENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAAL--TDRPAAPDENLPPLLREALQIMD 178

Query: 181 TDGMDAAFNFGLDSLIAGFERLRSS 205
+D + AF GL+SLI GFE ++
Sbjct: 179 SDDGEQAFLHGLESLIRGFEVQLTA 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29160TCRTETB635e-13 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 63.4 bits (154), Expect = 5e-13
Identities = 31/138 (22%), Positives = 58/138 (42%), Gaps = 2/138 (1%)

Query: 37 VPAMPGVLNTTPSIIQLTLSLYMVMLGVGQVIFGPLSDRVGRRPILLVGATAFVAASLGA 96
+P + N P+ + +M+ +G ++G LSD++G + +LL G S+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 97 ACSSTALAFVAF-RLVQAVGASAMLVATFATVRDVYANRPEGAVIYGLFSSMLAFVPALG 155
+ + + R +Q GA A A V Y + +GL S++A +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGA-AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155

Query: 156 PIAGALIGEFWGWQAIFI 173
P G +I + W + +
Sbjct: 156 PAIGGMIAHYIHWSYLLL 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29175TCRTETB582e-11 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 58.4 bits (141), Expect = 2e-11
Identities = 34/146 (23%), Positives = 69/146 (47%), Gaps = 2/146 (1%)

Query: 36 AVPFMPNALGTTASTIQLTLTTYLVMIGAGQLLFGPLSDRLGRRPVLLGGGLAYVVASM- 94
++P + N ++ T +++ G ++G LSD+LG + +LL G + S+
Sbjct: 36 SLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI 95

Query: 95 GLALTSSAEVFLGLRILQACGASACLVSTFATVRDIYAGREESNVIYGILGSMLAIVPAV 154
G S + + R +Q GA+A + V Y +E +G++GS++A+ V
Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAA-FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV 154

Query: 155 GPLLGALVDMWLGWRAIFAFLGLGMI 180
GP +G ++ ++ W + + +I
Sbjct: 155 GPAIGGMIAHYIHWSYLLLIPMITII 180


138HWH78_RS29565HWH78_RS29600N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS29565210-3.394638type IV pilus assembly protein PilM
HWH78_RS29570010-3.278285type 4a pilus biogenesis protein PilN
HWH78_RS29575-111-1.992528type 4a pilus biogenesis protein PilO
HWH78_RS29580-110-1.371672type 4a pilus biogenesis lipoprotein PilP
HWH78_RS29585011-1.262657type 4a pilus secretin PilQ
HWH78_RS29590-112-0.602153shikimate kinase AroK
HWH78_RS29595-112-0.5861623-dehydroquinate synthase
HWH78_RS29600-211-0.006261AAA family ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29565SHAPEPROTEIN320.002 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 32.4 bits (74), Expect = 0.002
Identities = 40/158 (25%), Positives = 63/158 (39%), Gaps = 38/158 (24%)

Query: 197 VVDIGATMTTLSVLHNGRTIYTREQLFGGRQLTEEI----QRRYGLSVEE--AGLAKKQG 250
VVDIG T ++V+ +Y+ GG + E I +R YG + E A K +
Sbjct: 163 VVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEI 222

Query: 251 G--LPDDYDSEV-------------------------LRPFKDAVVQQVSRSLQFF---F 280
G P D E+ L+ +V V +L+
Sbjct: 223 GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPEL 282

Query: 281 AAGQFNDVDYIVLAGGTASIQDLDRLIQQKIGTPTLVA 318
A+ +VL GG A +++LDRL+ ++ G P +VA
Sbjct: 283 ASDISER--GMVLTGGGALLRNLDRLLMEETGIPVVVA 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29585BCTERIALGSPD3121e-98 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 312 bits (800), Expect = 1e-98
Identities = 110/419 (26%), Positives = 183/419 (43%), Gaps = 53/419 (12%)

Query: 325 VPWDQALDLVLKTKGLDKRKLGNVLLVAPADEIAARERQEL--------EAQKQIAELAP 376
+ W A D+V L+K + L + + A ER Q+ IA +
Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQ 258

Query: 377 LRRE--------LIQVNYAKAADIAKLFQSVTSDGGQEGKEGGRGS--------ITVDDR 420
L R+ +I + YAKA+D+ ++ + S Q K+ + I +
Sbjct: 259 LDRQQATQGNTKVIYLKYAKASDLVEVLTGI-SSTMQSEKQAAKPVAALDKNIIIKAHGQ 317

Query: 421 TNSIIAYQPQERLDELRRIVSQLDIPVRQVMIEARIVEANVGYDKSLGVRWGGAYHKGNW 480
TN++I + +++L R+++QLDI QV++EA I E +LG++W
Sbjct: 318 TNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN----- 372

Query: 481 NGYGKDGNIGIKDEDGMNCGPIAGNCTFPTTGTSKSPSPFVDIGAKDATSGIGIGFITDN 540
G + N G+ + AG + GT S A + +GI GF N
Sbjct: 373 AGMTQFTNSGLPISTAI-----AGANQYNKDGTVSSSLA----SALSSFNGIAAGFYQGN 423

Query: 541 IILDLQLSAMEKTGNGEIVSQPKVVTSDKETAKILKGQEVPYQEASSSGATSTSF----- 595
+ L+A+ + +I++ P +VT D A GQEVP S + + F
Sbjct: 424 --WAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVER 481

Query: 596 KEAALSLEVTPQITPDNRIIVEVK-----VTKDAPDFDRALNGVPPINKNEVNAKILVND 650
K + L+V PQI + +++E++ V A L N VN +LV
Sbjct: 482 KTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT--FNTRTVNNAVLVGS 539

Query: 651 GETIVIGGVFSNTQSKSVDKVPFLGDLPYLGRLFRRDTVSDVKNELLVFLTPRIMNNQA 709
GET+V+GG+ + S + DKVP LGD+P +G LFR + K L++F+ P ++ ++
Sbjct: 540 GETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRD 598



Score = 52.6 bits (126), Expect = 4e-09
Identities = 31/188 (16%), Positives = 74/188 (39%), Gaps = 13/188 (6%)

Query: 281 GEKLSLNFQDIDVRSVLQLIADFTDLNLVASDTVQGNITLRLQN-VPWDQALDL---VLK 336
E+ S +F+ D++ + ++ + ++ +V+G IT+R + + +Q VL
Sbjct: 27 AEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVLD 86

Query: 337 TKGLDKRKLGN-VLLVAPADEIAARERQELEAQKQIAELAPLRRELIQVNYAKAADIAKL 395
G + N VL V + + A + + + ++ + A D+A L
Sbjct: 87 VYGFAVINMNNGVLKVVRSKD-AKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPL 145

Query: 396 FQSVTSDGGQEGKEGGRGSITVDDRTNSIIAYQPQERLDELRRIVSQLDIPVRQVMIEAR 455
+ + + G GS+ + +N ++ + L IV ++D + ++
Sbjct: 146 LRQLNDN-------AGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVP 198

Query: 456 IVEANVGY 463
+ A+
Sbjct: 199 LSWASAAD 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29590PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.011
Identities = 11/33 (33%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 4 LILVGPMGAGKSTIGRLLAKELHLAFKDSDKEI 36
++L G G GKST+ L F D+ +I
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDF--FSDTHFDI 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS29600PF03544431e-06 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 43.0 bits (101), Expect = 1e-06
Identities = 21/108 (19%), Positives = 31/108 (28%)

Query: 355 LPSAAVPPTVSSSAPPVTPLANNGVTPMHPVPPAPTEPTAPAATPTPAQTPAPAAPVASA 414
+ A + P + PP + P PP P P P P V
Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQP 114

Query: 415 PASKPAPAPAPAKPAASKPATTAAAKPAPAPAAKPASGVGAGSQWYRN 462
PA P + + A A +KP + V +G +
Sbjct: 115 KRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSR 162



Score = 43.0 bits (101), Expect = 1e-06
Identities = 24/104 (23%), Positives = 32/104 (30%), Gaps = 1/104 (0%)

Query: 355 LPSAAVPPTVSSSAPPVTPLANNGVTPMHPVPPAPTEPTAPAATPTPAQTPAPAAPVASA 414
LP+ A P +V+ AP P PV EP P A
Sbjct: 43 LPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPK 102

Query: 415 PASKPA-PAPAPAKPAASKPATTAAAKPAPAPAAKPASGVGAGS 457
P KP P + + A+ APA +S A +
Sbjct: 103 PKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT 146



Score = 34.6 bits (79), Expect = 7e-04
Identities = 24/72 (33%), Positives = 26/72 (36%), Gaps = 11/72 (15%)

Query: 383 HPVPPAPTEP-----TAPAATPTPAQTPAPAAPVASAPASKPAPAPAPAKPAASKPATTA 437
PAP +P APA P P PV +P P P P P K A
Sbjct: 40 VIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVV-----EPEPEPEPI-PEPPKEAPVV 93

Query: 438 AAKPAPAPAAKP 449
KP P P KP
Sbjct: 94 IEKPKPKPKPKP 105


139HWH78_RS30465HWH78_RS30495N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS304653213.038903urease subunit alpha
HWH78_RS304700183.570653urease subunit beta
HWH78_RS304750173.031977L-methionine sulfoximine N-acetyltransferase
HWH78_RS30480-1173.029683urease subunit gamma
HWH78_RS30485-1163.189138urease accessory protein UreD
HWH78_RS30490-2141.161620GNAT family N-acetyltransferase
HWH78_RS30495-1140.953636urea ABC transporter ATP-binding subunit UrtE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30465UREASE10960.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1096 bits (2837), Expect = 0.0
Identities = 422/567 (74%), Positives = 480/567 (84%), Gaps = 2/567 (0%)

Query: 2 KISRQAYADMFGPTVGDRVRLADTDLWLEVERDFTVYGEEVKFGGGKVIRDGMGQSQL-G 60
++SR AYA+MFGPTVGD+VRLADT+L++EVE+DFT +GEEVKFGGGKVIRDGMGQSQ+
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 61 AAQVVDTVITNALILDHWGVVKADVGLKDGRIQAIGKAGNPDIQPGVNIAIGAGTEVIAG 120
VDTVITNALILDHWG+VKAD+GLKDGRI AIGKAGNPD+QPGV I +G GTEVIAG
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAG 123

Query: 121 EGMILTAGGIDTHIHFICPQQIEEALMSGVTTMIGGGTGPAAGTNATTCTSGPWHMARML 180
EG I+TAGG+D+HIHFICPQQIEEALMSG+T M+GGGTGPA GT ATTCT GPWH+ARM+
Sbjct: 124 EGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMI 183

Query: 181 QAADAFPMNIGFTGKGNASLPLPLEEQVLAGAIGLKLHEDWGSTPAAIDNCLEVAERHDI 240
+AADAFPMN+ F GKGNASLP L E VL GA LKLHEDWG+TPAAID CL VA+ +D+
Sbjct: 184 EAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDV 243

Query: 241 QVAIHTDTLNESGFVETTLGAFKGRTIHTYHTEGAGGGHAPDIIKACGFANVLPSSTNPT 300
QV IHTDTLNESGFVE T+ A KGRTIH YHTEGAGGGHAPDII+ CG NV+PSSTNPT
Sbjct: 244 QVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPT 303

Query: 301 RPFTRNTIDEHLDMLMVCHHLDPAIAEDVAFAESRIRRETIAAEDILHDLGAFSMISSDS 360
RP+T NT+ EHLDMLMVCHHL P I ED+AFAESRIR+ETIAAEDILHD+GAFS+ISSDS
Sbjct: 304 RPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDS 363

Query: 361 QAMGRVGEVITRTWQTADKMKRQRGRLDGDGARNDNFRARRYIAKYTINPAITHGISHEV 420
QAMGRVGEV RTWQTADKMKRQRGRL + NDNFR +RYIAKYTINPAI HG+SHE+
Sbjct: 364 QAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEI 423

Query: 421 GSVEAGKWADLVLWRPAFFGVKPSLILKGGAIAASLMGDINGSIPTPQPVHYRPMFASYA 480
GS+E GK ADLVLW PAFFGVKP ++L GG IAA+ MGD N SIPTPQPVHYRPMF +Y
Sbjct: 424 GSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYG 483

Query: 481 GSRHATSLTFVSQAAFAAGVPQQLGLRKAIGVVSGCR-GVQKSDLIHNGYLPTIEVDAQN 539
SR +S+TFVSQA+ AG+ +LG+ K + V R G+ K+ +IHN P IEVD +
Sbjct: 484 RSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPET 543

Query: 540 YQVRADGQLLWCEPADVLPMAQRYFLF 566
Y+VRADG+LL CEPA VLPMAQRYFLF
Sbjct: 544 YEVRADGELLTCEPATVLPMAQRYFLF 570


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30475SACTRNSFRASE423e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.9 bits (98), Expect = 3e-07
Identities = 15/63 (23%), Positives = 26/63 (41%), Gaps = 1/63 (1%)

Query: 81 RGTVEHSVYVRDDQRGKGLGVQLLQALIERARAQGLHVMVAAIESGNAASIGLHRRLGFE 140
+E + V D R KG+G LL IE A+ ++ + N ++ + + F
Sbjct: 89 YALIED-IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 141 ISG 143
I
Sbjct: 148 IGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30490SACTRNSFRASE378e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.8 bits (85), Expect = 8e-06
Identities = 16/74 (21%), Positives = 34/74 (45%), Gaps = 1/74 (1%)

Query: 57 DGQPVGLLVTRETADGFL-VDNLAVLPECKGQGIGRQLLERAERDATSLGYRSLYLYTNE 115
+ +G + R +G+ ++++AV + + +G+G LL +A A + L L T +
Sbjct: 73 ENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132

Query: 116 RMTENIELYARVGY 129
YA+ +
Sbjct: 133 INISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30495PF05272280.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 27.7 bits (61), Expect = 0.045
Identities = 13/37 (35%), Positives = 19/37 (51%)

Query: 14 SHILRGLSFEAKVGEVTCLLGRNGVGKTTLLRCLMGL 50
H+ R + K L G G+GK+TL+ L+GL
Sbjct: 583 GHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619


140HWH78_RS30905HWH78_RS30935N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HWH78_RS30905-1112.070019cyclic di-GMP phosphodiesterase
HWH78_RS309101112.003311serine protein kinase RIO
HWH78_RS309152132.494210EamA family transporter
HWH78_RS309202122.239551nucleotidyltransferase family protein
HWH78_RS309253131.915559Cu(I)-responsive transcriptional regulator
HWH78_RS309302142.217407two-component system sensor histidine kinase
HWH78_RS309351132.434717two-component system response regulator PrmA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30905HTHFIS742e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.1 bits (182), Expect = 2e-16
Identities = 29/120 (24%), Positives = 52/120 (43%), Gaps = 6/120 (5%)

Query: 13 VLVVDDTPDNLLLMRELLE-EQYRVRTAGSGPAGLRAAVEEPRPDLILLDVNMPGMDGYE 71
+LV DD ++ + L Y VR + R + DL++ DV MP + ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRW-IAAGDGDLVVTDVVMPDENAFD 64

Query: 72 VCRRLKA-DPLTRDIPLMFLTARADRDDEQQGLALGAVDYLGKPVSPPIVLARVRTHLQL 130
+ R+K P D+P++ ++A+ + GA DYL KP ++ + L
Sbjct: 65 LLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30910IGASERPTASE300.014 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.014
Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 4/75 (5%)

Query: 218 PELRQTRYAKEMWALYEAGELTAETPLSGTFVEAEEAADVRAVLREIEAAQREEARRQAR 277
+ AKE + +A T E SG+ E +E +E ++EE +
Sbjct: 1061 TTAQNREVAKEAKSNVKANTQTNEVAQSGS--ETKETQ--TTETKETATVEKEEKAKVET 1116

Query: 278 RQADDAPRGEREEPP 292
+ + P+ + P
Sbjct: 1117 EKTQEVPKVTSQVSP 1131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30925PF07675300.002 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.4 bits (68), Expect = 0.002
Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 2/82 (2%)

Query: 3 IGEAAKKSGLTPKMIRYYESIELLRPAGRSASGYRHYNENDLHTLAFIRRSRDLGFSLDE 62
G + + +G P+ + +++L PAG +RHYN +DL+ + +G S
Sbjct: 933 FGLSTEANGAKPQSVWIERTVDL--PAGTKYVAFRHYNCSDLNYILLDDIQFTMGGSPTP 990

Query: 63 VGKLLTLWQDRQRASADVKALA 84
T+++D + +
Sbjct: 991 TDYTYTVYRDGTKIKEGLTETT 1012


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30930PF06580340.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.002
Identities = 15/81 (18%), Positives = 31/81 (38%), Gaps = 20/81 (24%)

Query: 360 LVGNALRY----TPAGGQVEIRVENRAQHAVLRVRDNGPGVALEEQQAIFTRFYRSPATS 415
LV N +++ P GG++ ++ L V + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA----------------LKN 306

Query: 416 SGEGSGLGLPIVKRIVELHFG 436
+ E +G GL V+ +++ +G
Sbjct: 307 TKESTGTGLQNVRERLQMLYG 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HWH78_RS30935HTHFIS792e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-19
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 1/123 (0%)

Query: 2 RILLAEDDLLLGDGIRAGLRLEGDTVEWVIDGVAAENALVTDEFDLLVLDIGLPRRSGLD 61
IL+A+DD + + L G V + + + DL+V D+ +P + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILRNLRHQGLLTPVLLLTARDKVADRVAGLDSGADDYLTKPFDLDELQARV-RALTRRTT 120
+L ++ PVL+++A++ + + GA DYL KPFDL EL + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GRA 123
+
Sbjct: 125 RPS 127



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.