PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2003.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_002947 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1PP_0005PP_0048Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0005-117-3.014245tRNA modification GTPase TrmE
PP_0006119-4.065934inner membrane protein translocase component
PP_0007223-4.033468hypothetical protein
PP_0008319-4.248546ribonuclease P
PP_0009321-5.32410550S ribosomal protein L34
PP_0010321-5.832255chromosome replication initiator DnaA
PP_0011327-6.427758DNA polymerase III subunit beta
PP_0012429-6.847689recombination protein F
PP_0013432-7.575732DNA gyrase subunit B
PP_0014644-8.585329transposase
PP_0015444-8.468669ATPase AAA
PP_0016442-8.319224hypothetical protein
PP_0017441-8.817052transcriptional regulator MvaT, P16 subunit
PP_0018440-8.851610hypothetical protein
PP_0019335-8.077770hypothetical protein
PP_0020233-8.093853hypothetical protein
PP_0021131-7.221985hypothetical protein
PP_0022132-6.651611hypothetical protein
PP_0023031-7.123042hypothetical protein
PP_0024130-7.159714hypothetical protein
PP_0025327-6.548099hypothetical protein
PP_0026426-6.082972CDF family cobalt/cadmium/zinc transporter
PP_0027327-6.985266hypothetical protein
PP_0028225-6.097522hypothetical protein
PP_0029324-5.378451DNA-binding response regulator CzrR
PP_0030226-5.707108sensor histidine kinase
PP_0031130-6.787447hypothetical protein
PP_0032025-6.563673hypothetical protein
PP_0033024-6.248915sugar transferase
PP_0034027-6.587338ribonuclease III
PP_0035130-6.999636GtrA family protein
PP_0036028-5.430112LysR family transcriptional regulator
PP_0037-126-4.707464phosphate-selective porin O and P
PP_0038025-4.590166hypothetical protein
PP_0039025-4.204158hypothetical protein
PP_0040024-3.847543hypothetical protein
PP_0041123-4.092614heavy metal translocating P-type ATPase
PP_0042224-4.727353hypothetical protein
PP_0043225-4.901261CzcA family cobalt/zinc/cadmium efflux
PP_0044334-5.963012CzcB family cobalt/zinc/cadmium efflux
PP_0045136-6.612485CzcC family cobalt/zinc/cadmium efflux
PP_0046032-6.080910porin
PP_0047134-5.461829DNA-binding heavy metal response regulator
PP_0048-127-3.458940hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_000660KDINNERMP7660.0 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 766 bits (1978), Expect = 0.0
Identities = 255/561 (45%), Positives = 348/561 (62%), Gaps = 34/561 (6%)

Query: 1 MDIKRTILIAALAVVSYVMVLKWNDDYGQAALPTQNTAASTVAPG--LPDGVPAGNNGAS 58
MD +R +L+ AL VS+++ W D Q T +T A G GVPA G
Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQG-- 58

Query: 59 ADVPSANAESSPAELAPVALSKDLIRVKTDVLELAIDPVGGDIVQLNLPKYPRRQDHPNI 118
LI VKTDVL+L I+ GGD+ Q LP YP+ +
Sbjct: 59 ----------------------KLISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQ- 95

Query: 119 PFQLFDNGGERVYLAQSGLTGTDGPDARASG-RPLYAAEQKSYQLADGQEQLVVDLKFSD 177
PFQL + + +Y AQSGLTG DGPD A+G RPLY E+ +Y LA+GQ +L V + ++D
Sbjct: 96 PFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTD 155

Query: 178 NGVN-YIKRFSFKRGEYDLNVSYLIDNQSGQAWNGNMFAQLKRDASGDPSSSTATGT--- 233
N + K F KRG+Y +NV+Y + N + + F QLK+ + P T +
Sbjct: 156 AAGNTFTKTFVLKRGDYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFAL 215

Query: 234 ATYLGAALWTASEPYKKVSMKDI-DKGSLKENVSGGWVAWLQHYFVTAWIPAKSDNNVVQ 292
T+ GAA T E Y+K I D +L + GGWVA LQ YF TAWIP N
Sbjct: 216 HTFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFY 275

Query: 293 TRKDSQGNYIIGYTGPVISVPAGGKVETSALLYAGPKIQSKLKELSPGLELTVDYGFLWF 352
T G IGY + V G ++ L+ GP+IQ K+ ++P L+LTVDYG+LWF
Sbjct: 276 TANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWF 335

Query: 353 IAQPIFWLLQHIHSLLGNWGWSIIVLTMLIKGLFFPLSAASYRSMARMRAVAPKLAALKE 412
I+QP+F LL+ IHS +GNWG+SII++T +++G+ +PL+ A Y SMA+MR + PK+ A++E
Sbjct: 336 ISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRE 395

Query: 413 RFGDDRQKMSQAMMELYKKEKINPLGGCLPILVQMPVFLALYWVLLESVEMRQAPWILWI 472
R GDD+Q++SQ MM LYK EK+NPLGGC P+L+QMP+FLALY++L+ SVE+RQAP+ LWI
Sbjct: 396 RLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWI 455

Query: 473 TDLSIKDPFFILPIIMGATMFIQQRLNPTP-PDPMQAKVMKMMPIIFTFFFLWFPAGLVL 531
DLS +DP++ILPI+MG TMF Q+++PT DPMQ K+M MP+IFT FFLWFP+GLVL
Sbjct: 456 HDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVL 515

Query: 532 YWVVNNCLSISQQWYITRRIE 552
Y++V+N ++I QQ I R +E
Sbjct: 516 YYIVSNLVTIIQQQLIYRGLE 536


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0015PF05272290.035 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.035
Identities = 7/17 (41%), Positives = 13/17 (76%)

Query: 52 LLIQGPSGVGKSTLVKE 68
++++G G+GKSTL+
Sbjct: 599 VVLEGTGGIGKSTLINT 615


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0018IGASERPTASE300.017 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.017
Identities = 5/23 (21%), Positives = 12/23 (52%)

Query: 62 EPNSWAFLNRDADYACVLAMDHM 84
+W ++ + +D A M+H+
Sbjct: 622 SNENWLYMGKTSDEAKRNVMNHI 644


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0029HTHFIS793e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.1 bits (195), Expect = 3e-19
Identities = 37/118 (31%), Positives = 58/118 (49%), Gaps = 2/118 (1%)

Query: 2 RVLVVEDEIKTAEYLQQGLSESGYVVDIVHNGVDALHLFNTNVYSLVLLDVNLPGIDGWD 61
+LV +D+ L Q LS +GY V I N LV+ DV +P + +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LLETIRKT-SRVRIIMLTARGRINDKLKGLDGGADDYLVKPFEFPELLARI-RSLQRR 117
LL I+K + +++++A+ +K + GA DYL KPF+ EL+ I R+L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0043ACRIFLAVINRP8060.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 806 bits (2084), Expect = 0.0
Identities = 234/1064 (21%), Positives = 433/1064 (40%), Gaps = 59/1064 (5%)

Query: 5 IIRFAIEQRIVVMIAVLIMAGIGIYSYQKLPIDAVPDITNVQVQINTAAPGYSPLETEQR 64
+ F I + I + +I+ G + +LP+ P I V ++ PG +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 ITFPVETAMAGLPGLQQTRSLSRS-GLSQVTVIFKDGTDIFFARQLINERLQVAKEQLPE 123
+T +E M G+ L S S S G +T+ F+ GTD A+ + +LQ+A LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 GVEAVMGPVSTGLGEIFLWTVEAEDGAVKEDGTPYTPTDLRVIQDWIIKPQLRNVPGVAE 183
V+ V +L D T D+ +K L + GV +
Sbjct: 121 EVQQQGISVEKSSS-SYLMVA-----GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174

Query: 184 INTIGGYAKQFLVAPDPKRLATYKLTLNDLVAALESNNANVGAGYI------ERNGEQLL 237
+ G + D L YKLT D++ L+ N + AG +
Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 238 IRAPGQVGNIEDIANIVI-TSVDGAPIRISSVADVSIGKELRTGAATENGREVVLGTVFM 296
I A + N E+ + + + DG+ +R+ VA V +G E A NG+ + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 297 LIGENSRTVSQAVAAKLADINRTLPKGVVAVTVYDRTNLVEKAIATVKKNLVEGAILVIA 356
G N+ ++A+ AKLA++ P+G+ + YD T V+ +I V K L E +LV
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 357 ILFLFLGNIRAALITAMVIPLSMLFTFTGMFNNKVSANLMSLG--ALDFGIIVDGAVVIV 414
+++LFL N+RA LI + +P+ +L TF + S N +++ L G++VD A+V+V
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 415 ENAIRRLAHAQHKHGRMLTKTERFHEVFAAAREARRPLIFGQLIIMVVYLPIFALTGVEG 474
EN R + K + + + L+ +++ V++P+ G G
Sbjct: 414 ENVERVMMED---------KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 475 KMFHPMAFTVVMALLGAMVLSVTFVPAAIAMFVTGKVKEEEGVVMRTARL---------- 524
++ + T+V A+ ++++++ PA A + E
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524

Query: 525 RYEPVLQWVLGHRNIAFSAAVALVVLSGLLASRMGSEFIPSLSEGDFAMQAMRVPGTSL- 583
Y + +LG +V +L R+ S F+P +G F G +
Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584

Query: 584 -TQSVEMQQRLEKAVIAQVPEVERMFARSGTAEIASDPMPPNASDAYIMLKPQDQWPNPK 642
TQ V + Q + + + VE +F +G + NA A++ LKP ++ +
Sbjct: 585 RTQKV-LDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDE 640

Query: 643 KPRDELIAEVQKAAAGVPGSNYELSQPIQLRFNELISGVRSDVA-VKVFGDDMDVLNNTA 701
+ +I + + EL + D + G D L
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNM--PAIVELGTATGFDFELIDQAGLGHDALTQAR 698

Query: 702 NKIAAALKAVPGS-SEVKVEQTSGLPVLTINIDREKAARYGLNIADVQNSIAIAVGGRQA 760
N++ P S V+ + +D+EKA G++++D+ +I+ A+GG
Sbjct: 699 NQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYV 758

Query: 761 GTLYEGDRRFDMVVRLPETVRTDVAGMSSLLIPVPANAAQGANQIGFIPLSQVANLDLQL 820
+ R + V+ R + L V + + +P S
Sbjct: 759 NDFIDRGRVKKLYVQADAKFRMLPEDVDKL--YVRSANGE------MVPFSAFTTSHWVY 810

Query: 821 GPNQISRENGKRLVIVSANVRGRDLGSFVEEATASLDK-KVQIPAGYWTTWGGQFEQLQS 879
G ++ R NG + + G+ +A A ++ ++PAG W G Q +
Sbjct: 811 GSPRLERYNGLPSMEIQGEAAP---GTSSGDAMALMENLASKLPAGIGYDWTGMSYQERL 867

Query: 880 AAKRLQIVVPVALLLVMTLLFLMFNNLKDGMLVFTGIPFALTGGVVALWLRDIPLSISAG 939
+ + +V ++ ++V L ++ + + V +P + G ++A L + +
Sbjct: 868 SGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFM 927

Query: 940 VGFIALSGVAVLNGLVMIAFIRGLRE-EGRTLRQAVDEGALTRLRPVLMTALVASLGFIP 998
VG + G++ N ++++ F + L E EG+ + +A RLRP+LMT+L LG +P
Sbjct: 928 VGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLP 987

Query: 999 MALATGTGAEVQRPLATVVIGGILSSTALTLLVLPALYHWAHRK 1042
+A++ G G+ Q + V+GG++S+T L + +P + R
Sbjct: 988 LAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0044RTXTOXIND478e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 47.1 bits (112), Expect = 8e-08
Identities = 24/139 (17%), Positives = 53/139 (38%), Gaps = 16/139 (11%)

Query: 149 ASQQISDLRSEQQAAQRRVELARVTFEREKQLWQDKISAEQDYLQARQALQEAEISLANA 208
A ++ +S+ + + + A+ ++ QL++++I Q + + LA
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL--DKLRQTTDNIGLLTLELAKN 321

Query: 209 KQKVGAIGASVNSVGGNRYELRAPFDAVVVE-KHLTVGEVVSEATNAFILSDLNQV-WAT 266
+++ +RAP V + K T G VV+ A ++ + T
Sbjct: 322 EERQQ------------ASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVT 369

Query: 267 FAVPPTDLGKVTTGRAVKV 285
V D+G + G+ +
Sbjct: 370 ALVQNKDIGFINVGQNAII 388



Score = 39.4 bits (92), Expect = 2e-05
Identities = 21/130 (16%), Positives = 44/130 (33%), Gaps = 13/130 (10%)

Query: 88 AGVALEAAAPRDLGTVVSFPGEIRFDEDRTAHVVPRVPGVVEAVQANLGETVKKGQVLAV 147
+A + + V + G++ + P +V+ + GE+V+KG VL
Sbjct: 68 LVIAFILSVLGQVEIVATANGKLTHSGRSKE-IKPIENSIVKEIIVKEGESVRKGDVLLK 126

Query: 148 IASQQISDLRSEQQAAQRRVELARVTFER---------EKQLWQDKISAEQDYLQARQAL 198
+ + ++ Q + AR+ R +L + K+ E + +
Sbjct: 127 LTALG---AEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183

Query: 199 QEAEISLANA 208
SL
Sbjct: 184 VLRLTSLIKE 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0045IGASERPTASE320.008 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.6 bits (71), Expect = 0.008
Identities = 25/174 (14%), Positives = 54/174 (31%), Gaps = 8/174 (4%)

Query: 170 GRVRAGKSSPVEATRAQVQLAEAQLQVRRAETEKATAYQQLAQITGSSVTVFDRLESPTL 229
V S V+A ++A++ + + +T + + + + V E P +
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 230 SPGLPPRTEDLLAKLDQTAEMRQ--AVVQIDKSDASLGSEKAQRIPNLTVSVGSQYDRSV 287
+ + P+ E Q R+ V I + + + P S +V
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS------SNV 1179

Query: 288 RERVNTVGLSMPLPLFDRNQGNILSASRRADQARDQRNAVELRLRTETQTALNQ 341
+ V N N A+ + + N + R R ++ +
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHN 1233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0047HTHFIS771e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.8 bits (189), Expect = 1e-18
Identities = 30/129 (23%), Positives = 62/129 (48%), Gaps = 1/129 (0%)

Query: 2 RILVIEDEVKTAEYVRQGLTECGYVVDCVHTGSDGLFLAKQHEYELIILDINLPEMDGWQ 61
ILV +D+ + Q L+ GY V + + +L++ D+ +P+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLELLRRKNCPSRIMMLTARSRLADKVRGLENGADDYLIKPFEFPELLARV-RALMRRSD 120
+L +++ +++++A++ ++ E GA DYL KPF+ EL+ + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 HPASVEVIR 129
P+ +E
Sbjct: 125 RPSKLEDDS 133


2PP_0163PP_0168Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0163821-1.176190GntR family transcriptional regulator
PP_0164821-1.177597hypothetical protein
PP_0165821-1.115511diguanylate cyclase
PP_0166922-1.179862HlyD family type I secretion membrane fusion
PP_0167922-1.186757toxin secretion ATP-binding protein
PP_0168922-1.331955surface adhesion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0166RTXTOXIND318e-106 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 318 bits (816), Expect = e-106
Identities = 107/426 (25%), Positives = 201/426 (47%), Gaps = 9/426 (2%)

Query: 41 PRVVRLTIWGVILFFVFLIVWASVAPIDEVTRGEGKAIPSSKVQKIQNLEGGIVAEIFAK 100
R RL + ++ F V + + + ++ V GK S + ++I+ +E IV EI K
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 101 EGQIVEVGQPLLRLDETRFASNVGETEADRLAMALRVERLSA-----EVEDRPLII---D 152
EG+ V G LL+L ++ +T++ L L R E+ P + +
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 153 EKLRKAAPNQAASEESLYQSRRQQLQDEIGGLQQQLVQRQQELREYSSKRTQYANSLELL 212
+ + + SL + + Q++ + L +++ E ++ +Y N +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 213 RKEISMSEPLVATGAISQVEVLRLRRAEVENRGQLDSTALAIPRAEAAIREVQSKIEETR 272
+ + L+ AI++ VL VE +L + + E+ I + + +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 273 GKFRSEALTQLNEARTELNKATATSKALDDRVHRTLVTSPVRGIVKQLLVNTIGGVIQPG 332
F++E L +L + + T ++R +++ +PV V+QL V+T GGV+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 333 SDIIEVVPLDDTLVIEAKILPKDIAFLHPGQEATVKFTAYDYTIYGGLKAKLEQIGADTI 392
++ +VP DDTL + A + KDI F++ GQ A +K A+ YT YG L K++ I D I
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413

Query: 393 TDEDKKTTYYLIKLRTDRSHLGTDEKPLLIIPGMVATVDIMTGKKTIMSYLLKPIMKARS 452
D+ + + + + + + L T K + + GM T +I TG ++++SYLL P+ ++ +
Sbjct: 414 EDQ-RLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVT 472

Query: 453 EALRER 458
E+LRER
Sbjct: 473 ESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0168CABNDNGRPT915e-20 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 90.8 bits (225), Expect = 5e-20
Identities = 55/214 (25%), Positives = 86/214 (40%), Gaps = 11/214 (5%)

Query: 8451 GADTIDSGNGNDIIFGDLITLNGVVSEGYQALQTYVAQKSGVEVGAVTTSNVHQYITEHY 8510
T +G+ + ++ +AL V G + + + +Q I +
Sbjct: 260 ANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINLNE 319

Query: 8511 TEFDISGAKDGNDILSGGNGNDILFGQGGSDTLNGGKGNDILLGGTGNDTLIGGQGDDIL 8570
F G GN ++ G + G G+D L G ++IL GG GND L GG G D L
Sbjct: 320 GSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTL 379

Query: 8571 IGGSGADTFVWKAGD----VGNDVIKDFNKAEGDRIDLKDLLQGEKGSTIDNYLKLTTVE 8626
GG+G DTFV+ +G D I DF D+IDL + S + + T
Sbjct: 380 YGGAGRDTFVYGSGQDSTVAAYDWIADFQ-KGIDKIDLSAFRNEGQLSFVQDQ--FTGKG 436

Query: 8627 GTTTLQVSSEGKL----NAEGGIANADVTIKLEG 8656
LQ + + E G ++ D +++ G
Sbjct: 437 QEVMLQWDAANSITNLWLHEAGHSSVDFLVRIVG 470


3PP_0184PP_0194Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_01843122.100677argininosuccinate lyase
PP_01855122.345756LytTR family two component transcriptional
PP_01866122.219543porphobilinogen deaminase
PP_01878141.343277uroporphyrinogen-III synthase
PP_01887130.666842uroporphyrin-III C-methyltransferase
PP_018912200.489689HemY domain-containing protein
PP_01901328-0.476552disulfide bond formation protein DsbB
PP_01919160.461159anti-RNA polymerase sigma 70 factor
PP_01929151.000042peptidyl-prolyl cis-trans isomerase FklB
PP_01938131.446579hypothetical protein
PP_01946111.157008alginate regulatory protein AlgP
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0185HTHFIS763e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.4 bits (188), Expect = 3e-18
Identities = 29/152 (19%), Positives = 58/152 (38%), Gaps = 6/152 (3%)

Query: 3 VLIVDDEPQGRERLTRLLGELEGYTVLEPSATNGDEALALIESLKPDVVLLDIGMPGLDG 62
+L+ DD+ R L + L GY V +N I + D+V+ D+ MP +
Sbjct: 6 ILVADDDAAIRTVLNQAL-SRAGYDVR--ITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 63 LQVAARLCEREAPPAVVFCSG--DDEYGAEAFKDSTLSHVTKPFQAHALRDALRKAEKPN 120
+ R+ + V+ S +A + ++ KPF L + +A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 121 RAQLAALTRPANEGGGPRSHISARTRKGIELI 152
+ + + L + +G SA ++ ++
Sbjct: 123 KRRPSKLEDDSQDGMPLVGR-SAAMQEIYRVL 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0192INFPOTNTIATR1155e-34 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 115 bits (290), Expect = 5e-34
Identities = 68/220 (30%), Positives = 111/220 (50%), Gaps = 15/220 (6%)

Query: 14 VLGLCLMAPLALAD----NDDHD-LAYSLGASLGERLRQEMPGLQLDALVEGLKQSYQGQ 68
++GL + +A D D D L+YS+GA LG+ + + + D L +G++ G
Sbjct: 10 IMGLAMSTAMAATDATSLTTDKDKLSYSIGADLGKNFKNQGIDINPDVLAKGMQDGMSGA 69

Query: 69 PLKLDKARMQAVLQQHE-------AQEGDAAAQKLQAAETRFMANERGRYGVHELTEGVL 121
L L + +M+ VL + + + E + A++ +A F++ + + G+ L G+
Sbjct: 70 QLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPSGLQ 129

Query: 122 YSELQAGTGAQPKAGGKVQVRYVGRLPDGSIFDQNQ---TPQWFNLDSVIEGWQVALPKM 178
Y + AGTGA+P V V Y G L DG++FD + P F + VI GW AL M
Sbjct: 130 YKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLM 189

Query: 179 HTGAKWRLVIPSAQAYGAEGAGDLIAPYTPLVFEIELLAV 218
G+ W + +P+ AYG G I P L+F+I L++V
Sbjct: 190 PAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0194IGASERPTASE521e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 52.4 bits (125), Expect = 1e-09
Identities = 29/206 (14%), Positives = 57/206 (27%), Gaps = 7/206 (3%)

Query: 131 DQRAAKPAAKATAAAKPAAKPAARATAAAKPAAKPAAKTTTAAKPAAKPAAKATAAAKPA 190
+ +A P+ + A A ++ +K +K K A
Sbjct: 1002 NIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET 1061

Query: 191 AKPAAKATAAAKPAAKPAAKATAAAKPAAKPAAKATAAAKPAAKPAA--KATAAAKPAAK 248
+ AK K + A+ ++ T K A KA + +
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121

Query: 249 PAAKAAAAAKPAAKPAAKAPAAKPAAKPAAT---KAPARTAAKPAA--KPAEAKPATPAA 303
+ + + P A+PA + T K P A +PA+ +
Sbjct: 1122 VPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQ 1181

Query: 304 TTNSVSPAAAAPSAPVSTPSQAPSAS 329
+ S + + P+ +
Sbjct: 1182 PVTESTTVNTGNSVVENPENTTPATT 1207



Score = 48.5 bits (115), Expect = 3e-08
Identities = 46/281 (16%), Positives = 75/281 (26%), Gaps = 15/281 (5%)

Query: 48 RGKAQEKLHNGRLKLQDAAKAGKAKAQSKAQKAIGELEELLESLKERQTQTRSY-IQQLK 106
K E HN L L DA+KA + +L L+ + Y + K
Sbjct: 929 ADKTGEPNHNE-LTLFDASKAQRDHLNVSLVGNTVDLGAWKYKLRNVNGRYDLYNPEVEK 987

Query: 107 RDAQDSLKLAQGVGKVREAAGKALDQRAAKPAAKATAAAKPAAKPAARATAAAKPAAKPA 166
R+ ++ PA + T +K
Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQE 1047

Query: 167 AKTTTAAKPAAKPAAKATAAAKPAAKPAAKATAAAKPAAKPAAKATAAAKPAAKPAAKAT 226
+KT + A AK KA A+ ++ K A
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 227 AAAKPAAKPAAKATAAAKPAAKPAAKAAAAAKPAAKPAAKAPAAKPAAKPAATKAPARTA 286
K AK + + + ++ +P A+PA P
Sbjct: 1108 KEEK------------AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 287 AKPAAKP-AEAKPATPAATTNSVSPAAAAPSAPVSTPSQAP 326
+P ++ A PA T+S S V+T +
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVV 1196


4PP_0295PP_0300Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0295-115-4.016857glycine betaine/L-proline ABC transporter
PP_0296-118-4.306821glycine/betaine ABC transporter
PP_0297-218-3.543419L-serine dehydratase
PP_0298019-3.592199AraC family transcriptional regulator
PP_0299016-3.496902hypothetical protein
PP_0300015-3.100447metallopeptidase
5PP_0322PP_0330Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_03222220.750395serine hydroxymethyltransferase
PP_03231220.888532sarcosine oxidase subunit beta
PP_0324216-1.325212sarcosine oxidase subunit delta
PP_0325212-1.420897sarcosine oxidase subunit alpha
PP_0326222-4.908330sarcosine oxidase subunit gamma
PP_0327320-4.954068formyltetrahydrofolate deformylase
PP_0328320-5.035046formaldehyde dehydrogenase
PP_0329328-6.401441hypothetical protein
PP_0330021-3.502453hypothetical protein
6PP_0628PP_0650Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_06282241.122032hypothetical protein
PP_06293271.000212hypothetical protein
PP_0630-1230.514195zinc-binding protein
PP_0631-123-0.727133dephospho-CoA kinase
PP_0632-322-1.732367prepilin peptidase
PP_0633-223-2.399547type IV pili biogenesis protein PilC
PP_0634130-6.337837fimbrial protein pilin
PP_0635339-9.319124*group II intron-encoding maturase
PP_0636343-11.409913cold shock DNA-binding domain-containing
PP_0637439-10.048047ISPpu15, transposase Orf2
PP_0638749-12.216381ISPpu15, transposase Orf1
PP_0639751-12.382180hypothetical protein
PP_0640643-9.247254hypothetical protein
PP_0641432-3.045064hypothetical protein
PP_0642432-3.072356hypothetical protein
PP_0643341-4.687761hypothetical protein
PP_0644346-6.271759hypothetical protein
PP_0645248-6.466873hypothetical protein
PP_0646250-6.929252hypothetical protein
PP_0647235-5.421351hypothetical protein
PP_0648233-4.404454hypothetical protein
PP_0649326-2.672307MutT/nudix family protein
PP_0650220-1.784319hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0632PREPILNPTASE330e-116 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 330 bits (847), Expect = e-116
Identities = 148/283 (52%), Positives = 188/283 (66%), Gaps = 2/283 (0%)

Query: 3 LWALLAEQPAYFLTLATVLGLLVGSFINVLVYRLPIMLERQWQREAQEVLGLPTT--QHA 60
L L P + +L + L++GSF+NV+++RLPIMLER+WQ E +
Sbjct: 4 LLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEP 63

Query: 61 RFDLCLPASRCPHCAHRIRAWENIPVISYLALGGRCSSCKNRISLRYPVVEVASALLSLV 120
++L +P S CPHC H I A ENIP++S+L L GRC C+ IS RYP+VE+ +ALLS+
Sbjct: 64 PYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVA 123

Query: 121 VAWRFGASVEALVALPLTWCLLALSLIDADHQLLPDVLVLPTMWLGLIVNAFGIHVPLAD 180
VA L AL LTW L+AL+ ID D LLPD L LP +W GL+ N G V L D
Sbjct: 124 VAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGD 183

Query: 181 ALWGAVAGYLSLWTVYWVFRLVTGKEGMGYGDFKLMALIGAWGGWQVLPLTLLLSSVVGA 240
A+ GA+AGYL LW++YW F+L+TGKEGMGYGDFKL+A +GAW GWQ LP+ LLLSS+VGA
Sbjct: 184 AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGA 243

Query: 241 LFGLCLLRFRRDAMGTAIPFGPYLAIAGWIAVLWGDEIYASYM 283
G+ L+ R IPFGPYLAIAGWIA+LWGD I Y+
Sbjct: 244 FMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0633BCTERIALGSPF427e-151 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 427 bits (1099), Expect = e-151
Identities = 129/405 (31%), Positives = 205/405 (50%), Gaps = 10/405 (2%)

Query: 7 LYHWHGTDANGAPVSGQTPGRSPAYVRAGLIRQGITVASLRPA---------SGLAFSLP 57
YH+ DA G G S R L +G+ S+ +GL+
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 58 KRREKADPAGFSRQLATLLKAGVPLLQAFEVMGRSGCDAAQAALLERLKQDVASGLGLAD 117
R +D A +RQLATL+ A +PL +A + + + + L+ ++ V G LAD
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 118 ALQRHPAWFDALYCNLVRVGEQSGTLDRQLEQLAGMLEQRRVLHKKVRKAMIYPLLLLLT 177
A++ P F+ LYC +V GE SG LD L +LA EQR+ + ++++AMIYP +L +
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 178 GLGVSAILLLEVIPKFESMFSGMGAALPAFTQWVINLSTGLSRFAPLLLVMGVVMGVAVR 237
+ V +ILL V+PK F M ALP T+ ++ +S + F P +L+ + +A R
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 238 QLYRQHAPARLWISRRVLGLPVFGKLLGQAALARFARSLATSYGAGVPLLDALGTVARVT 297
+ RQ R+ RR+L LP+ G++ AR+AR+L+ + VPLL A+ V
Sbjct: 243 VMLRQE-KRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVM 301

Query: 298 GGDLHEQAVLRLRQGMANGQGLNQAMAGEPLFPPLLVQLTAIGESSGTLDQMLEKAASHY 357
D + + G L++A+ LFPP++ + A GE SG LD MLE+AA +
Sbjct: 302 SNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQ 361

Query: 358 EEQVSQALDQLTSLLEPAIVLILGLLVGGLVVAMYLPIFQLGSLI 402
+ + S + L EP +V+ + +V +V+A+ PI QL +L+
Sbjct: 362 DREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0634BCTERIALGSPG551e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 55.3 bits (133), Expect = 1e-12
Identities = 20/62 (32%), Positives = 40/62 (64%), Gaps = 1/62 (1%)

Query: 1 MKGQRGITLIELMIVVAIIGILATIALPMYTNHQARSKAAAGLLEISALKTAMDL-RLND 59
QRG TL+E+M+V+ IIG+LA++ +P ++ ++ + +I AL+ A+D+ +L++
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 60 GK 61

Sbjct: 64 HH 65


7PP_0681PP_0693Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0681219-2.242929hypothetical protein
PP_0682221-1.348225hypothetical protein
PP_0683224-1.169634hypothetical protein
PP_0684225-1.433159FKBP-type peptidylprolyl isomerase
PP_0685425-1.283762hypothetical protein
PP_06863240.368486alkylphosphonate utilization protein PhnA
PP_06873210.362454polyprenyl synthetase
PP_06883210.64675250S ribosomal protein L21
PP_06894211.13754850S ribosomal protein L27
PP_06903211.791421GTPase ObgE
PP_06912182.379761gamma-glutamyl kinase
PP_06922191.497463CreA family protein
PP_06932201.944547hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0684INFPOTNTIATR1694e-55 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 169 bits (429), Expect = 4e-55
Identities = 88/205 (42%), Positives = 124/205 (60%), Gaps = 6/205 (2%)

Query: 5 NLSTDETRVSYGIGRQLGGQLRDNPPPGVSLEAILAGLTDAFNGADSRVSEADLSASF-K 63
+L+TD+ ++SY IG LG + N ++ + + G+ D +GA ++E + K
Sbjct: 26 SLTTDKDKLSYSIGADLGKNFK-NQGIDINPDVLAKGMQDGMSGAQLILTEEQMKDVLSK 84

Query: 64 VIREVM---QAEAAAKAEAAAAAGKEFLVENAKREGITTLASGLQFEVLTAGEGAKPTRE 120
+++M AE KAE A G FL N + GI L SGLQ++++ AG GAKP +
Sbjct: 85 FQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGAKPGKS 144

Query: 121 DNVRTHYHGTLIDGTVFDSSYERGQPAEFPVGGVIAGWTEALQLMNAGSKWRLYVPSELA 180
D V Y GTLIDGTVFDS+ + G+PA F V VI GWTEALQLM AGS W ++VP++LA
Sbjct: 145 DTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLA 204

Query: 181 YGAQGVGS-IPPHSVLVFDVELLDV 204
YG + VG I P+ L+F + L+ V
Sbjct: 205 YGPRSVGGPIGPNETLIFKIHLISV 229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0690PF07201300.015 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 30.2 bits (68), Expect = 0.015
Identities = 32/169 (18%), Positives = 54/169 (31%), Gaps = 34/169 (20%)

Query: 245 VDIAPLDESSPADAAEVIVNELT-----RFSPSLAERE-------RWLVLNKA----DMV 288
V I S AD AE E+T R SL +R+ V + V
Sbjct: 39 VQIVSGTLQSIADMAE----EVTFVFSERKELSLDKRKLSDSQARVSDVEEQVNQYLSKV 94

Query: 289 MDDERDERVQEVIDRLEWEGPVYVISAISK----QGTDKLSHDLMRYLEDRADRLANDPA 344
+ E+ + V E++ L P +S + + + M L D L P
Sbjct: 95 PELEQKQNVSELLSLLS-NSPNISLSQLKAYLEGKSEEPSEQFKM--LCGLRDALKGRPE 151

Query: 345 YAEELADLDQRIED-------EARAQLQALDDARTLRRTGVKSVHDIGD 386
A ++Q + + +A ++GV + + D
Sbjct: 152 LAHLSHLVEQALVSMAEEQGETIVLGARITPEAYRESQSGVNPLQPLRD 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0691CARBMTKINASE439e-07 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 43.3 bits (102), Expect = 9e-07
Identities = 39/147 (26%), Positives = 60/147 (40%), Gaps = 19/147 (12%)

Query: 124 TLRTLVDLGV---------VPVINENDTVVTDEIRFGDNDTLAALVANLVEADLLVILTD 174
T++ LV+ GV VPVI E+ + E D D +A V AD+ +ILTD
Sbjct: 178 TIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVI-DKDLAGEKLAEEVNADIFMILTD 236

Query: 175 RDGMFDADPRNNPEAQLIYEARADDPSLDAVAGGTGGALGRGGMQTKLRAARLAARSGAH 234
+G + Q + E + ++ G G M K+ AA G
Sbjct: 237 VNGAALY--YGTEKEQWLREVKVEELRKYYEEGH----FKAGSMGPKVLAAIRFIEWGGE 290

Query: 235 TIIIGGRIERVLDRLKAGERLGTLLSP 261
II +E+ ++ L G+ GT + P
Sbjct: 291 RAII-AHLEKAVEAL-EGKT-GTQVLP 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0693CHANLCOLICIN399e-05 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 38.9 bits (90), Expect = 9e-05
Identities = 40/254 (15%), Positives = 86/254 (33%), Gaps = 27/254 (10%)

Query: 465 AIDLTHIDPPALQALADRAALRDQKERLEKELKQLKTQQAVAADRSASKAQTETLYQEVL 524
A +L H + A+QA +R L +E+ KE + +A KA +QE
Sbjct: 112 ATELAHANNAAMQAEDERLRLAKAEEKARKEAE------------AAEKA-----FQE-- 152

Query: 525 DAQKALEDFRRSQTLAAEEPEKLEQLSQLEAAQDELKRSSDAFTERVQQLSAKLQL-VGR 583
A++ ++ R + + + E + AA E ++ + Q+ + Q V +
Sbjct: 153 -AEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEI----AQKKLSAAQSEVVK 207

Query: 584 QLGDLESKQRTLEDALRRRQLLPADLPYGTPYMEAIDDSMDNLLPLLNDYQDSWQSLQRV 643
G++++ L ++ R L + L L+ +
Sbjct: 208 MDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQN 267

Query: 644 DNQIEALYAQVRLKGVAKFDSEDD--MERRLQLLVNAYAHRTDEALTLAKARRAAVTDIA 701
EA +V + + + E R+ + ++ R A + +
Sbjct: 268 RPFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVH 327

Query: 702 RTLRNIRSDYDSLE 715
N++ ++L
Sbjct: 328 EAEENLKKAQNNLL 341


8PP_0719PP_0729Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0719321-1.585645GTP-dependent nucleic acid-binding protein EngD
PP_0720120-0.291476peptidyl-tRNA hydrolase
PP_0721120-1.24209750S ribosomal protein L25
PP_0722-118-1.275629ribose-phosphate pyrophosphokinase
PP_0723019-2.174751*4-diphosphocytidyl-2-C-methyl-D-erythritol
PP_0724119-2.433109molecular chaperone LolB
PP_0725222-3.641280hypothetical protein
PP_0726335-5.667506transferase
PP_0727226-3.931781hypothetical protein
PP_0728221-3.336508hypothetical protein
PP_0729215-1.592902phosphoenolpyruvate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0725SYCDCHAPRONE355e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 34.5 bits (79), Expect = 5e-04
Identities = 18/119 (15%), Positives = 37/119 (31%), Gaps = 4/119 (3%)

Query: 411 LQQAIQRYPDDLNLLYTRAMLAEKRNDLTQMEKDLRAIITREPENAMALNALGYTLADRT 470
+ + D L LY+ A + K +A+ + ++ LG
Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACR-QAM 83

Query: 471 TRYTEAKALIDKAHQLTPDDPAVLDSLGWVNYRLGNLDAAETYL---RQAFANFPDHEV 526
+Y A + +P + G L AE+ L ++ A+ + +
Sbjct: 84 GQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKE 142



Score = 33.0 bits (75), Expect = 0.002
Identities = 16/63 (25%), Positives = 24/63 (38%)

Query: 283 PDDDELRYSLALVCLENKDWDEAEGYLQELVERDSYVDAAHLNLGRIHEERHDPAGALRE 342
D E YSLA ++ +++A Q L D Y L LG + A+
Sbjct: 33 SDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHS 92

Query: 343 YAL 345
Y+
Sbjct: 93 YSY 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0728PHPHTRNFRASE471e-07 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 46.7 bits (111), Expect = 1e-07
Identities = 24/96 (25%), Positives = 41/96 (42%), Gaps = 8/96 (8%)

Query: 298 RVAHVLPGQLAPDI----EGCIVLIENADPGFDWIFTH--RIAGFITAYGGENSHMSIRA 351
RV L G + E +++ E+ P D + + GF T GG SH +I +
Sbjct: 137 RVLGHLIGVETGSLATIAEETVIIAEDLTPS-DTAQLNKQFVKGFATDIGGRTSHSAIMS 195

Query: 352 REFAIPAAIGVGDTRFKTLLAATSLLLDCAERRIQV 387
R IPA +G + + + +++D E + V
Sbjct: 196 RSLEIPAVVGTKEVT-EKIQHGDMVIVDGIEGIVIV 230


9PP_0779PP_0791Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0779330-7.439058methyl-accepting chemotaxis transducer/sensory
PP_0780434-8.361751hypothetical protein
PP_0781122-6.051319hypothetical protein
PP_0782018-4.710251hypothetical protein
PP_078309-0.737287hypothetical protein
PP_0784-18-0.306639hypothetical protein
PP_07852121.195015sulfate transport protein CysZ
PP_07862141.716573thioredoxin reductase
PP_07872142.193233nicotinate-nucleotide pyrophosphorylase
PP_07882123.058615hypothetical protein
PP_07892133.356316N-acetyl-anhydromuranmyl-L-alanine amidase
PP_07901123.285549signaling modulator of AmpD, AmpE
PP_07910103.343433TatD family deoxyribonuclease
10PP_0801PP_0812Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_08016143.823510hypothetical protein
PP_08025133.919242chemotaxis protein CheV
PP_08035133.908313HlyD family type I secretion membrane fusion
PP_08045133.936367protein secretion ABC efflux system, permease
PP_08055133.699038TolC family type I secretion outer membrane
PP_08066143.482045surface adhesion protein
PP_0807-19-0.475321anaerobic nitric oxide reductase transcriptional
PP_0808-216-3.067610nitric oxide dioxygenase
PP_0809021-4.326289disulfide bond formation protein B
PP_0810125-5.125013cyoups1 protein
PP_0811123-4.269682cyoups2 protein
PP_0812121-3.181445ubiquinol oxidase subunit II
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0802HTHFIS596e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.7 bits (142), Expect = 6e-12
Identities = 23/109 (21%), Positives = 47/109 (43%), Gaps = 7/109 (6%)

Query: 169 AANILVVDDSQVALQQSVHTLRNLGIECHTARSAKDAINVLLELQGTAQEINIIVSDIEM 228
A ILV DD L G + +A + A + +++V+D+ M
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI-----AAGDGDLVVTDVVM 57

Query: 229 SEMDGYAFTRTLRETPDFQHLYVLLHTSLDSAMSSEKATQAGANAILTK 277
+ + + +++ L VL+ ++ ++ M++ KA++ GA L K
Sbjct: 58 PDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0803RTXTOXIND2563e-83 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 256 bits (655), Expect = 3e-83
Identities = 92/426 (21%), Positives = 172/426 (40%), Gaps = 58/426 (13%)

Query: 21 RAGRIITLCALMLAAFLAWAAWFEVTEVSTGTGKVIPSSREQVIQSFEGGIVAQMSVAEG 80
I L + +V V+T GK+ S R + I+ E IV ++ V EG
Sbjct: 59 LVAYFIMGF---LVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115

Query: 81 DLVERGQVLAQLDPTKTASSVGESEAKYRAARASQARLQAEVTG---------KPLTFPE 131
+ V +G VL +L + ++++ AR Q R Q K P
Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175

Query: 132 SLRDSPDLIDAETALYQTRRR---------------------GLEQTLAGIQDSLQLVRS 170
S + + T+L + + + + ++ ++ +S
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 171 ELKITENLAKMGASSRVEVI---------------------RLNRQRSELELKANEARSD 209
L +L A ++ V+ ++ + + +
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 210 YLVRAREELAKASAEADALSEVIRGRSDSLTRLTLRSPVRGIVKDIEVNTLGGVVQPGGQ 269
+ ++L + + L+ + + +R+PV V+ ++V+T GGVV
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 270 VMKIVPMDERLLIETRIAPRDIAFIHPDQAAKVKISAYDYSVYGGLDGKVVGISPDTLQD 329
+M IVP D+ L + + +DI FI+ Q A +K+ A+ Y+ YG L GKV I+ D ++D
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIED 415

Query: 330 EVKPEIYYYRVFIRTEQDSLQNKAGKRFAIVPGMIATVDIRTGEKTILDYLIKPL-NRAK 388
+ + + + V I E++ L + K + GM T +I+TG ++++ YL+ PL
Sbjct: 416 Q-RLGLVFN-VIISIEENCL-STGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVT 472

Query: 389 EALRER 394
E+LRER
Sbjct: 473 ESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0806RTXTOXINA505e-07 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 49.6 bits (118), Expect = 5e-07
Identities = 24/72 (33%), Positives = 32/72 (44%), Gaps = 10/72 (13%)

Query: 6158 DVIAGTDGNDHLDGSQG--------GHITLQGGAGDDTLVVVDQNFAS--VDGGSGTDTL 6207
D ++G +G+D L G G G+ L GG GDD V + A + GG G D L
Sbjct: 765 DTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKL 824

Query: 6208 LWGGGDASIDLG 6219
G +D G
Sbjct: 825 YGSEGADLLDGG 836



Score = 40.7 bits (95), Expect = 2e-04
Identities = 24/62 (38%), Positives = 30/62 (48%), Gaps = 11/62 (17%)

Query: 6158 DVIAGTDGNDHLDGSQGGHITLQGGAGDDTLVVVDQNFASVDGGSGTDTLLWGGGDASID 6217
D+I G DGND L G +G L GG GDD L GG G D L+ G+ ++
Sbjct: 747 DLIEGNDGNDRLYGDKGNDT-LSGGNGDDQL----------YGGDGNDKLIGVAGNNYLN 795

Query: 6218 LG 6219
G
Sbjct: 796 GG 797



Score = 34.2 bits (78), Expect = 0.023
Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 2/87 (2%)

Query: 6128 DSAAGLTATTSLLADTGDESAALASLAAATDVIAGTDGNDHLDGSQGGHITLQGGAGDDT 6187
D G+ L GD+ + + A +V+ G GND L GS+G + L GG GDD
Sbjct: 783 DKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADL-LDGGEGDDL 841

Query: 6188 LVVVDQNFASVDG-GSGTDTLLWGGGD 6213
L N G G + GG
Sbjct: 842 LKGGYGNDIYRYLSGYGHHIIDDDGGK 868


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0807HTHFIS379e-129 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 379 bits (974), Expect = e-129
Identities = 140/369 (37%), Positives = 195/369 (52%), Gaps = 15/369 (4%)

Query: 164 ERIEHLALRAEDEHHRAELYRQASGQD-RELIGQSPAHKRLVEEIRLVGSSDLTVLITGE 222
+ + RA E R + QD L+G+S A + + + + +DLT++ITGE
Sbjct: 109 TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGE 168

Query: 223 TGVGKELVAQALHRASSRADKPLVSLNCAALPDTLVESELFGHVRGAFTGAHGERRGKFE 282
+G GKELVA+ALH R + P V++N AA+P L+ESELFGH +GAFTGA G+FE
Sbjct: 169 SGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFE 228

Query: 283 LANGGTLFLDEVGELPLTVQAKLLRVLQSGQLQRLGSDREHRVDVRLIAATNRDLAAEVR 342
A GGTLFLDE+G++P+ Q +LLRVLQ G+ +G R DVR++AATN+DL +
Sbjct: 229 QAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSIN 288

Query: 343 TGNFRADFYHRLSVYPLHVPPLRERGRDVLLLAGYFLEQNRSRLGLNSLRLSHEAQAALI 402
G FR D Y+RL+V PL +PPLR+R D+ L +F++Q + GL+ R EA +
Sbjct: 289 QGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMK 347

Query: 403 AYDWPGNVRELEHLIGRSALKALGQHPDRPRILTL-------------EAIDLDLRVSAT 449
A+ WPGNVRELE+L+ R R I A L +S
Sbjct: 348 AHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQA 407

Query: 450 TPGTLPSPAAPLQVVTPPEGGLREAVDSYQRQVIEACLQRHQDNWAAAARELGLDRANLS 509
+ A PP G + + +I A L + N AA LGL+R L
Sbjct: 408 VEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 510 RLARRLGLR 518
+ R LG+
Sbjct: 468 KKIRELGVS 476


11PP_1039PP_1052Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_10392133.280034Sec-independent protein translocase subunit
PP_10402134.121648twin-arginine translocation protein subunit
PP_10411133.641371twin-arginine translocation protein TatA
PP_10421133.860185general secretion pathway protein K
PP_10431154.036267hypothetical protein
PP_10442144.359673lipoprotein UxpA
PP_10452164.141774type II secretion pathway protein XcpP
PP_10462153.773189general secretion pathway protein D
PP_10475164.470594type II secretion system protein E
PP_10487144.675414general secretion pathway protein F
PP_10497144.257006general secretion pathway protein G
PP_10509164.316477general secretion pathway protein H
PP_10518173.492159type II secretion system protein I/J
PP_10526163.420954type II secretion pathway protein XcpW
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1040TATBPROTEIN714e-19 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 70.8 bits (173), Expect = 4e-19
Identities = 27/103 (26%), Positives = 51/103 (49%), Gaps = 8/103 (7%)

Query: 1 MFEVGFSELLLIGVVALLVLGPERLPVAARTLGRGLGQARRAMHALRTQVEREIEMPNLD 60
MF++GFSELLL+ ++ L+VLGP+RLPVA +T+ + R ++ ++ +E+++
Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQ 60

Query: 61 -------QAPLQRLEQEIRQGISLNAEPANDAATAVLPKENAS 96
+A L L E++ A ++ +
Sbjct: 61 DSLKKVEKASLTNLTPELKA-SMDELRQAAESMKRSYVANDPE 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1041TATBPROTEIN342e-05 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 33.8 bits (77), Expect = 2e-05
Identities = 12/45 (26%), Positives = 20/45 (44%)

Query: 1 MGGIGIWQLVIVLLIVFLLFGTKRLKGLGSDVGEAIQGFRKSMGG 45
M IG +L++V +I ++ G +RL V I+ R
Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATT 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1046BCTERIALGSPD473e-162 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 473 bits (1219), Expect = e-162
Identities = 195/629 (31%), Positives = 311/629 (49%), Gaps = 97/629 (15%)

Query: 10 ALSLALSMAYAQEPVFDDNGTPMYEVNFVDTELGEFIDSVSRITGTTFIVDPRVKGKVTV 69
S +L++ +F + +F T++ EFI++VS+ T I+DP V+G +TV
Sbjct: 7 IRSFSLTLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITV 66

Query: 70 RTVDLHDADAIYDIFLAQLRAQGYATVDLPNGSVKIVPDQAARLEPVPV----------- 118
R+ D+ + + Y FL+ L G+A +++ NG +K+V + A+ VPV
Sbjct: 67 RSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDE 126

Query: 119 ---------------------EAGGQQGEGS----DSVATRVFSVRNAASEQVLGILKPL 153
+ G GS + + + R A +++L I++ +
Sbjct: 127 VVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERV 186

Query: 154 IDP--RVGVITPYPAAHQL-------------------------VVTDWRSNL------- 179
+ R V P A VV D R+N
Sbjct: 187 DNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEP 246

Query: 180 ---ERIASLLRQLDRPQEVPGSGSTQVIYLRHANAGEVVKVLRGLSQEGAVPVEGAGEAE 236
+RI ++++QLDR Q G+T+VIYL++A A ++V+VL G+S + A
Sbjct: 247 NSRQRIIAMIKQLDRQQAT--QGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVA 304

Query: 237 GKDRPVVPAAGGSGIRLEYEEGTNAVVMVGPDSELAAYRAIVEQLDIRRAQVVVEAIIAE 296
D+ ++ A G TNA+++ + ++ QLDIRR QV+VEAIIAE
Sbjct: 305 ALDKNIIIKAHGQ---------TNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAE 355

Query: 297 VSDSSAQELGVQWLFADEKFGAGIVNFGSNGVNIASIAGAAASGDNEALGDLLSTTTGAT 356
V D+ LG+QW AG+ F ++G+ I++ A + + G + S+ A
Sbjct: 356 VQDADGLNLGIQWANK----NAGMTQFTNSGLPISTAIAGANQYNKD--GTVSSSLASAL 409

Query: 357 AGIGHFGGGF---NFAMLVNALKGKSGFNLLSTPTLLTLDNAEASILVGQEVPFVTGSVT 413
+ GF N+AML+ AL + ++L+TP+++TLDN EA+ VGQEVP +TGS T
Sbjct: 410 SSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQT 469

Query: 414 QNNANPYQTIERKEVGVKLRIKPQINIDNSVRLDIVQEVSSIADTSSASD----VITNKR 469
+ N + T+ERK VG+KL++KPQIN +SV L+I QEVSS+AD +S++ N R
Sbjct: 470 TSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTR 529

Query: 470 EIKTKVMVEDNGLVILGGLISDELSTSDQRVPLLGDIPYLGRLFRSDATRNTKQNLMVFI 529
+ V+V V++GGL+ +S + +VPLLGDIP +G LFRS + + +K+NLM+FI
Sbjct: 530 TVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFI 589

Query: 530 RPRILRDGPSLAGLSEDKYRTLQQTTPLQ 558
RP ++RD S +Y Q
Sbjct: 590 RPTVIRDRDEYRQASSGQYTAFNDAQSKQ 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1048BCTERIALGSPF452e-161 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 452 bits (1165), Expect = e-161
Identities = 176/404 (43%), Positives = 248/404 (61%), Gaps = 8/404 (1%)

Query: 1 MPTYRYQAVDLAGKSHKASLQADSERHARQLLREQGLF--------ARQLQRHDAGSRQP 52
M Y YQA+D GK + + +ADS R ARQLLRE+GL Q + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 53 RRQRLSRAQLCELTRQLATLTGAGIPLVDALATLERQLRQPALHSVLVALRGSLAEGLGL 112
R+ RLS + L LTRQLATL A +PL +AL + +Q +P L ++ A+R + EG L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 113 ARSLARQGAPFTGLYCALVEAGERSGHLAQVLTRLADHLEQVQRQQHKARTALIYPAVLM 172
A ++ F LYCA+V AGE SGHL VL RLAD+ EQ Q+ + + + A+IYP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 173 GVSLAVVIGLMTFVVPKLTEQFAHAGQSLPLITSLLIGLSQGLVHAGPWLLGVALLLGGL 232
V++AVV L++ VVPK+ EQF H Q+LPL T +L+G+S + GPW+L L
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 233 AGWLLRKPHWCLRRDQLLLRLPRIGNLLQVLESARLARSLAILCGSGVALLEALQVATET 292
+LR+ + + LL LP IG + + L +AR AR+L+IL S V LL+A++++ +
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 293 IGNRRIRLAMEQVRQHVQGGTSLHRALDASQQFPPLLVNMVGSGEASGTLADMLERVADD 352
+ N R + V+ G SLH+AL+ + FPP++ +M+ SGE SG L MLER AD+
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 353 QERGFARQVDTAMALFEPLMILVMGAVVLFIVLAVLLPIMQLNQ 396
Q+R F+ Q+ A+ LFEPL+++ M AVVLFIVLA+L PI+QLN
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNT 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1049BCTERIALGSPG2173e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 217 bits (555), Expect = 3e-76
Identities = 71/141 (50%), Positives = 96/141 (68%), Gaps = 3/141 (2%)

Query: 11 RRNRQRGFTLMEIMVVIFIIGLLIAVVAPSVLGNQDKAMKQKVMADLATLEQALDMYRLD 70
++QRGFTL+EIMVVI IIG+L ++V P+++GN++KA KQK ++D+ LE ALDMY+LD
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 71 NLRFPSNEQGLAALVKKPAQEPLPRAWRSDGYVRRLPEDPWGTPYQYRMPGEHGRVDVYS 130
N +P+ QGL +LV+ P PL + +GY++RLP DPWG Y PGEHG D+ S
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLS 122

Query: 131 LGADGLPGGEGQDADLGNWAL 151
G DG G E D+ NW L
Sbjct: 123 AGPDGEMGTED---DITNWGL 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1050BCTERIALGSPH376e-06 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 37.2 bits (86), Expect = 6e-06
Identities = 22/89 (24%), Positives = 38/89 (42%), Gaps = 1/89 (1%)

Query: 4 QRGFSLIELLVVLAIAGLMTGLAVAGLGNG-QASVEQALQRLAVKVRGQAALARHAGQLR 62
QRGF+L+E++++L + G+ G+ + S Q L R ++R GQ
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFF 62

Query: 63 GLRWNGQRPEFVRREGNAWVVEAVPLGDW 91
G+ + R +F+ E A W
Sbjct: 63 GVSVHPDRWQFLVLEARDGADPAPADDGW 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1051PilS_PF08805325e-04 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 31.8 bits (72), Expect = 5e-04
Identities = 9/39 (23%), Positives = 19/39 (48%)

Query: 15 KQAQRGFTLLEVTVALAIAAVLAVITSQVLHQRLAVQDN 53
K+ +G TL+EV + + + VLA ++ + +
Sbjct: 22 KEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQS 60


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1052BCTERIALGSPG290.010 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 28.7 bits (64), Expect = 0.010
Identities = 12/28 (42%), Positives = 20/28 (71%), Gaps = 4/28 (14%)

Query: 4 RQTGLTLIELMVALALTAVLGIMLAALV 31
+Q G TL+E+MV + ++G+ LA+LV
Sbjct: 6 KQRGFTLLEIMVVI---VIIGV-LASLV 29


12PP_1081PP_1093Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1081219-1.487230glutaredoxin-like protein
PP_1082115-0.339446bacterioferritin
PP_1083016-0.153758BFD (2Fe-2S)-binding domain-containing protein
PP_1084115-0.564548anti-oxidant AhpCTSA family protein
PP_1085-112-0.847557ribonuclease T
PP_1086013-1.302000dihydroorotase
PP_1087-114-2.643230OmpA family outer membrane protein
PP_1088-217-3.027697argininosuccinate synthase
PP_1089124-2.122548hypothetical protein
PP_1090420-0.705970LuxR family transcriptional regulator
PP_10915210.275204hypothetical protein
PP_10922150.504817endonuclease III
PP_10933141.010692hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1082HELNAPAPROT318e-04 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 31.4 bits (71), Expect = 8e-04
Identities = 21/104 (20%), Positives = 41/104 (39%), Gaps = 10/104 (9%)

Query: 44 EYKESIDEMKHADKLIKRILFLEGIPN--VQDLGKLL------IGEHTKEMLECDLKIEQ 95
E + E D + +R+L + G P V++ + EM++ + +
Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109

Query: 96 QGLVDLKAAIAHCETAGDFGSRDVLEDILESEEEHIDWLETQLG 139
Q + K I E D + D+ ++E E+ + L + LG
Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1087NAFLGMOTY1073e-29 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 107 bits (268), Expect = 3e-29
Identities = 71/252 (28%), Positives = 129/252 (51%), Gaps = 6/252 (2%)

Query: 42 FECRLSQPIEGFGSGTFVRRAGEQPV--FQLRSDSNALGAGTASLLAAAAPWQPGRGDIN 99
EC+L PI FG F RA ++ F+L+ SL++ PW+PG
Sbjct: 44 LECQLVHPIPSFGDAVFSSRASKKINLDFELKMRRPMGETRNVSLISMPPPWRPGEHADR 103

Query: 100 LGNVRMARSGVLFSSSQGQASRLINGLLDGR--STVVRNYTGEAGRPMEVRVLPVSFAKA 157
+ N++ + + Q A +++ L GR + +++ R +EV + V F
Sbjct: 104 ITNLKFFKQFDGYVGGQ-TAWGILSELEKGRYPTFSYQDWQSRDQR-IEVALSSVLFQSK 161

Query: 158 YNDYQVCAGKLLPMNYDQVRQTQVGFPGGGIELDAAAKARLDVILDYMKADPTVNHVELN 217
YN + C LL +++ + T + + G +L A+K RL I DY++ + ++ V +
Sbjct: 162 YNAFSDCIANLLKYSFEDIAFTILHYERQGDQLTKASKKRLAQIADYVRHNQDIDLVLVA 221

Query: 218 GHSDNSGNRLTNRDLSRRRALAVADYFKANGVPEEQITVRFHGERYPLAKNNSASNRARN 277
++D++ + ++ LS RRA ++ YF++ G+PE++I V+ +G+R P+A N + + +N
Sbjct: 222 TYTDSTDGKSESQSLSERRAESLRTYFESLGLPEDRIQVQGYGKRRPIADNGTPIGKDKN 281

Query: 278 RRVNIELDRVAV 289
RRV I L R V
Sbjct: 282 RRVVISLGRTQV 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1090HTHFIS814e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.0 bits (200), Expect = 4e-20
Identities = 40/158 (25%), Positives = 70/158 (44%), Gaps = 7/158 (4%)

Query: 3 KVLIVDDHPVIRLAVRMLMERHGYDVVAETDNGVAALQLTRQHLPDIVVLDIGIPKLDGL 62
+L+ DD IR + + R GYDV T N + D+VV D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EVIARMATFSPGSKVLVLTSQAPGNFSMRCMQAGAAGYVCKQQELTELLSAIKAVLSGYS 122
+++ R+ P VLV+++Q +++ + GA Y+ K +LTEL+ I L+
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA--- 120

Query: 123 YFPNQALHASRGRGAGSETDMVNRLSAREMTVLQQLAR 160
+ + + +V R SA + + LAR
Sbjct: 121 --EPKRRPSKLEDDSQDGMPLVGR-SAAMQEIYRVLAR 155


13PP_1103PP_1119Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_11031163.043171DEAD/DEAH box helicase
PP_11040172.429930succinylglutamate desuccinylase/aspartoacylase
PP_11050151.929507ATP-dependent DNA ligase
PP_1106-1140.829592RNA processing exonuclease
PP_1107-112-0.011118hypothetical protein
PP_1108-112-0.007325acylase
PP_1109116-2.109950GntR family transcriptional regulator
PP_1110117-2.585880serine O-acetyltransferase
PP_1111-117-3.420378synthetase
PP_1112023-5.102905major facilitator superfamily protein
PP_1113229-7.224905pyridoxal-phosphate dependent enzyme family
PP_1114332-7.523457SEC-C domain-containing protein
PP_1115230-7.034349lipoprotein
PP_1116230-6.500921resolvase site-specific recombinase
PP_1117125-4.708994hypothetical protein
PP_1118017-2.720059recombinase-like protein
PP_11192141.073571hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1112TCRTETB330.002 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.3 bits (76), Expect = 0.002
Identities = 23/101 (22%), Positives = 39/101 (38%), Gaps = 3/101 (2%)

Query: 53 LCLMLATYPVSRLMSRIGRKKAFMLGAIPLALSGVSGFLAVEHQHFPTLVLSHSALGV-Y 111
L + T +L ++G K+ + G I V GF V H F L+++ G
Sbjct: 60 LTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGF--VGHSFFSLLIMARFIQGAGA 117

Query: 112 IAFANFNRFAATDNLSQALKPKALSLVVAGGVIAAVVGPTL 152
AF + + + KA L+ + + VGP +
Sbjct: 118 AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1114SECA586e-14 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 57.6 bits (139), Expect = 6e-14
Identities = 19/42 (45%), Positives = 22/42 (52%), Gaps = 1/42 (2%)

Query: 23 GHVHGPHCNHGHQEPIRNALKDVGRNDPCPCGSEKKYKKCHG 64
H + + VGRNDPCPCGS KKYK+CHG
Sbjct: 858 SHQDDDS-AAAAALAAQTGERKVGRNDPCPCGSGKKYKQCHG 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1115PERTACTIN327e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.4 bits (73), Expect = 7e-04
Identities = 17/48 (35%), Positives = 23/48 (47%), Gaps = 5/48 (10%)

Query: 62 HLRVDNPNDSRLFIRNVSYAIRLNDLLLVQDEAS----VW-RSVGGHA 104
L VD S LF NV + L+D L+V +AS +W R+ G
Sbjct: 468 VLMVDTLAGSGLFRMNVFADLGLSDKLVVMRDASGQHRLWVRNSGSEP 515


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1116PYOCINKILLER310.006 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.5 bits (68), Expect = 0.006
Identities = 20/97 (20%), Positives = 39/97 (40%), Gaps = 5/97 (5%)

Query: 90 AQLMEQVDFKVATMPQADKFQLHLFAALAQQEREFIATRTKEALASLQRRAD---AGDAV 146
A+ + + + + + + L + + EA++SLQ R + A A
Sbjct: 154 AEEIGEQAVREGNINGPEAYMRFLDREMEGLTAAYNVKLFTEAISSLQIRMNTLTAAKAS 213

Query: 147 AQQKVANRA--EAQAKGRAVADITAANNIRMSKINTY 181
+ AN+A +A A+ + A+ A + NTY
Sbjct: 214 IEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTY 250


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1117BLACTAMASEA320.003 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 32.5 bits (74), Expect = 0.003
Identities = 13/65 (20%), Positives = 27/65 (41%), Gaps = 11/65 (16%)

Query: 169 SPSAVVEKVIKYLGKDLVQEVVKACKIKPTQRSQLTEWIIDNYNPSDESHRAALPDPEKV 228
+P+++ + K L + + QL +W++D+ + R+ LP +
Sbjct: 178 TPASMAATLRKLLTSQRLS---------ARSQRQLLQWMVDD-RVAGPLIRSVLPAGWFI 227

Query: 229 HDLKT 233
D KT
Sbjct: 228 AD-KT 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1119SECA484e-09 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 47.9 bits (114), Expect = 4e-09
Identities = 14/20 (70%), Positives = 16/20 (80%)

Query: 134 KAGRNDPCPCASGHKFKKCC 153
K GRNDPCPC SG K+K+C
Sbjct: 878 KVGRNDPCPCGSGKKYKQCH 897



Score = 28.7 bits (64), Expect = 0.011
Identities = 8/14 (57%), Positives = 8/14 (57%)

Query: 7 CPCGSGNLLDACCG 20
CPCGSG C G
Sbjct: 885 CPCGSGKKYKQCHG 898


14PP_1171PP_1176Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1171311-0.478044NAD-dependent epimerase/dehydratase
PP_1172213-1.900971hypothetical protein
PP_1173212-2.414293porin
PP_1174325-1.627653hypothetical protein
PP_1175226-1.591757hypothetical protein
PP_1176225-0.854654hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1171NUCEPIMERASE692e-15 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 68.7 bits (168), Expect = 2e-15
Identities = 41/179 (22%), Positives = 77/179 (43%), Gaps = 23/179 (12%)

Query: 8 RLLLTGAAGGLGKVLRERL-KGYAEVLRLSDISP----------MAPAAGPHEEVITCDL 56
+ L+TGAAG +G + +RL + +V+ + +++ + A P + DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 57 ADKAAVHTLVE--GVDAIIHFG---GV--STE--HAFEEILGPNICGVFHVYEAARKHGV 107
AD+ + L + + V S E HA+ + N+ G ++ E R + +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADS---NLTGFLNILEGCRHNKI 118

Query: 108 KRIIFASSNHTIGFYRQDERIDAHAPRRPDSYYGLSKCYGEDVASFYFDRYGIETVSIR 166
+ +++ASS+ G R+ + P S Y +K E +A Y YG+ +R
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177


15PP_1208PP_1221Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1208318-1.421228hypothetical protein
PP_1209420-1.265161cold-shock domain-contain protein
PP_1210319-0.728833DNA-binding stress protein
PP_1211220-0.225752hypothetical protein
PP_12122190.223265hypothetical protein
PP_12133170.154649aspartyl-tRNA synthetase
PP_12140140.571983hypothetical protein
PP_12150130.677236Holliday junction resolvase
PP_1216312-0.220687Holliday junction DNA helicase RuvA
PP_1217111-0.870900Holliday junction DNA helicase RuvB
PP_1218415-1.828528hypothetical protein
PP_1219216-1.751584biopolymer transport protein TolQ
PP_1220215-2.163855biopolymer transport protein TolR
PP_1221315-1.700942biopolymer transport protein TolA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1210HELNAPAPROT1569e-52 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 156 bits (395), Expect = 9e-52
Identities = 50/147 (34%), Positives = 79/147 (53%)

Query: 8 SEEDRKSIVDGLSRLLSDTYVLYLKTHNFHWNVTGPSFRTLHLMFEEQYNELALAVDSIA 67
++ ++ + + L+ LS+ ++LY K H FHW V GP F TLH FEE Y+ A VD+IA
Sbjct: 6 AKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIA 65

Query: 68 ERIRALGFPAPGSYAFYARHSSIKEEEGVPPADEMIRQLVQGQEAVVRTARSIFPVVDKV 127
ER+ A+G + Y H+SI + A EM++ LV + + ++ + + ++
Sbjct: 66 ERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLAEEN 125

Query: 128 SDEPTADLLTQRMQVHEKTAWMLRVLL 154
D TADL ++ EK WML L
Sbjct: 126 QDNATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1221IGASERPTASE637e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 63.2 bits (153), Expect = 7e-13
Identities = 34/230 (14%), Positives = 74/230 (32%), Gaps = 4/230 (1%)

Query: 37 TPELPPSKPIVQATLYQLKSKSQATTQTNQKIAGEAKKTASRQTEVEQLEQKKVEQEAVK 96
T P+ ++ A + A T S TE K+ + K
Sbjct: 995 TNITTPNNIQADVPSVPSNNEEIARVD-EAPVPPPAPATPSETTETVAENSKQESKTVEK 1053

Query: 97 AAEQKKADAAQKAEEAREAAEAKKAEDAAKAAEAAKAAEAKKAAEAKKADEAKKAAEKQQ 156
+ AQ E A+EA + + E A++ K + + E ++++
Sbjct: 1054 NEQDATETTAQNREVAKEAK--SNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111

Query: 157 ADIAKKKAEDEAKKKAEEEAKKAAAEEAKKKAAEDAKKKAAEEAKKKAAEDAKKKAAAED 216
A + +K ++ K ++ K+ +E + AE A++ K+ A E
Sbjct: 1112 AKVETEKTQEVPKVTSQVSPKQEQSETVQP-QAEPARENDPTVNIKEPQSQTNTTADTEQ 1170

Query: 217 AKKKAAEEAKKKAAADAQKKKAQEAARKAAEDKKAQALAELLSDTTERQQ 266
K+ + ++ A + S+++ + +
Sbjct: 1171 PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK 1220



Score = 61.2 bits (148), Expect = 3e-12
Identities = 36/201 (17%), Positives = 71/201 (35%), Gaps = 9/201 (4%)

Query: 69 AGEAKKTASRQTEVEQLEQKKVEQEAVKAAEQKKADAAQKAEEAREAAEAKKAEDAAKAA 128
+ Q +V + E V A A +E AE K E
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEK 1053

Query: 129 EAAKAAEAKK-AAEAKKADEAKKAAEKQQADIAKKKAEDEAKKKAEEEAKKAAAEEAKKK 187
A E E K ++ A Q ++A+ +E K+ E K+ A E ++K
Sbjct: 1054 NEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE--TKETQTTETKETATVEKEEK 1111

Query: 188 AAEDAKKKAAEEAKKKAAEDAKKKAAAEDAKKKAAEEAKKKAAADAQKKKAQE----AAR 243
A + +K +E K ++ + K+ +E + +A + + ++ ++Q
Sbjct: 1112 AKVETEKT--QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTE 1169

Query: 244 KAAEDKKAQALAELLSDTTER 264
+ A++ + + TT
Sbjct: 1170 QPAKETSSNVEQPVTESTTVN 1190


16PP_1301PP_1324Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1301012-3.0378692-alkenal reductase
PP_1302013-3.698675hypothetical protein
PP_1303-115-3.626121sulfate adenylyltransferase subunit 2
PP_1304-219-3.612770bifunctional sulfate adenylyltransferase subunit
PP_1305028-4.432579Pyocin S-type immunity protein
PP_1306-217-2.394091pyocin S-type Killer domain-containing protein
PP_1307-212-0.244724AsnC family transcriptional regulator
PP_1308-214-0.189466methionine gamma-lyase
PP_1309-1180.339100hypothetical protein
PP_1310-118-0.215687hypothetical protein
PP_1311119-0.885283tryptophanyl-tRNA synthetase
PP_1312217-1.473821AFG1 family ATPase
PP_1313016-2.110539AraC family transcriptional regulator
PP_1314218-2.831197aldo/keto reductase
PP_1315318-3.89284950S ribosomal protein L13
PP_1316216-3.45720130S ribosomal protein S9
PP_1317012-3.022973ubiquinol-cytochrome c reductase, iron-sulfur
PP_1318-112-3.292475ubiquinol--cytochrome c reductase, cytochrome b
PP_1319-213-2.491035ubiquinol--cytochrome c reductase, cytochrome
PP_13200161.190071stringent starvation protein A
PP_13211131.767082ClpXP protease specificity-enhancing factor
PP_13222121.837138transport-associated protein
PP_13233111.957986phosphoheptose isomerase
PP_13244111.980223hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1301V8PROTEASE672e-14 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 67.3 bits (164), Expect = 2e-14
Identities = 37/194 (19%), Positives = 64/194 (32%), Gaps = 38/194 (19%)

Query: 119 ESSLGSAVIMSPEGYLLTNNHVTSGADQIVVALK------------DGRETLARVIGSDP 166
+ + S V++ LLTN HV ALK +G T ++
Sbjct: 100 GTFIASGVVVGK-DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSG 158

Query: 167 ETDLAVLKIDL--------KNLPAITIGRSDTIHIGDVSLAIGNPFGVGQTVTMGIISAT 218
E DLA++K + + T+ + + G P TM +
Sbjct: 159 EGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESK 215

Query: 219 GRNQLGLNNYEDFIQTDAAINPGNSGGALVDANGNLIGINTAIFSKSGGSQGIGFAIP-- 276
G+ L +Q D + GNSG + + +IGI+ G+
Sbjct: 216 GK-ITYLKGE--AMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGA 263

Query: 277 VKLALEVMKSIVEH 290
V + V + ++
Sbjct: 264 VFINENVRNFLKQN 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1304TCRTETOQM731e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 73.0 bits (179), Expect = 1e-15
Identities = 51/151 (33%), Positives = 69/151 (45%), Gaps = 19/151 (12%)

Query: 33 VDDGKSTLIGRLLHDSKMIYEDHLEAITRDSKKVGTTGEEVDLALLV-DGLQAEREQGIT 91
VD GK+TL LL++S I ++G VD D ER++GIT
Sbjct: 12 VDAGKTTLTESLLYNSGAI------------TELG----SVDKGTTRTDNTLLERQRGIT 55

Query: 92 IDVAYRYFSTAKRKFIIADTPGHEQYTRNMATGASTCDLAIILVDARYGVQTQTRRHSYI 151
I F K I DTPGH + + S D AI+L+ A+ GVQ QTR +
Sbjct: 56 IQTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHA 115

Query: 152 ASLLGIKHIVVAVNKMDLKGFD-EGVFESIK 181
+GI I +NK+D G D V++ IK
Sbjct: 116 LRKMGIPTIFF-INKIDQNGIDLSTVYQDIK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1306PYOCINKILLER2537e-77 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 253 bits (648), Expect = 7e-77
Identities = 152/441 (34%), Positives = 221/441 (50%), Gaps = 43/441 (9%)

Query: 351 ATQNLMQRAAEVENLQAQLAAQEEA-ARQQAEAERLA-----------AEAERQRQEALR 398
A N+ + +LQ ++ A A +A A A AE + ++Q A+R
Sbjct: 186 AAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIR 245

Query: 399 RSVSYINEARLAVSTPA----VIPIGATTFAVAEAAYSALAESIAAALTRLVATTVPSVA 454
+ +Y A +V A +I + ++A+A A+A L R++A+ +A
Sbjct: 246 AANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIA-----VLGRVLASAPSVMA 300

Query: 455 VGTLAMA--------WPSTLGNSERQYLISTPLDSLSPAGGPDLAALAASSTSIDLPYLL 506
VG ++ W +Y + L +L A+A +S ++DLP L
Sbjct: 301 VGFASLTYSSRTAEQWQD-QTPDSVRYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMRL 359

Query: 507 AGVENENELDLYVVPSG-----KPVAVRAATFDSERQVY-----SLALDNPQRILTWTPA 556
N L VV + K V VR A +++ +Y S + P ILTWTPA
Sbjct: 360 TNEARGNTTTLSVVSTDGVSVPKAVPVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTPA 419

Query: 557 SAPGGDEGNSTSLPLVPPGTVVYTGSSLNPVVTEQEGYPALDLLDQERLIITFPMDSGLP 616
S PG +ST+ P+VP VY G++L PV E YP + L ++ LII FP DSG+
Sbjct: 420 SPPGNQNPSSTT-PVVPKPVPVYEGATLTPVKATPETYPGVITLPED-LIIGFPADSGIK 477

Query: 617 PILVVFKSPRYEAGTSTGHGAQVSDTWRKEAASLEGAPIPAQIAELLKSREFRNFDAFRR 676
PI V+F+ PR G +TG G VS W A+ EGAPIP+QIA+ L+ + F+N+ FR
Sbjct: 478 PIYVMFRDPRDVPGAATGKGQPVSGNWLGAASQGEGAPIPSQIADKLRGKTFKNWRDFRE 537

Query: 677 QFWKAVANDPELSKQFDEMSLSRMRKNGYSPIVDFPDSHLSQKTFILHHVIPISEGGGVY 736
QFW AVANDPELSKQF+ SL+ MR G +P V + + +HH + +++GGGVY
Sbjct: 538 QFWIAVANDPELSKQFNPGSLAVMRD-GGAPYVRESEQAGGRIKIEIHHKVRVADGGGVY 596

Query: 737 DMDNIRIVTPLSHNSIHYGTK 757
+M N+ VTP H IH G K
Sbjct: 597 NMGNLVAVTPKRHIEIHKGGK 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1310PREPILNPTASE300.005 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.2 bits (68), Expect = 0.005
Identities = 20/68 (29%), Positives = 26/68 (38%), Gaps = 5/68 (7%)

Query: 59 AGYVTLRFNYRGVGQSAGSHDMGAGEVADAEAAAAWLRARHPGLPLVLMGFSFGG---FV 115
AGY+ L Y G MG G+ A AWL + LP+VL+ S G +
Sbjct: 190 AGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQ--ALPIVLLLSSLVGAFMGI 247

Query: 116 ATSLAGRL 123
L
Sbjct: 248 GLILLRNH 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1314HELNAPAPROT290.015 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 29.1 bits (65), Expect = 0.015
Identities = 20/96 (20%), Positives = 38/96 (39%), Gaps = 16/96 (16%)

Query: 104 KHNRQHIVAALEESLKRLQTDRIDLY----QLHWPERSTNFFGKLGYQHLPQDHFTPLEE 159
N + +E SL ++ LY + HW + +FF ++ + E
Sbjct: 3 TENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFF---TLHEKFEELYDHAAE 59

Query: 160 TLEVLDEQVRAGKIRHIGLSNETPWGTMK-FLQLAE 194
T++ + E++ A IG P T+K + + A
Sbjct: 60 TVDTIAERLLA-----IGGQ---PVATVKEYTEHAS 87


17PP_1531PP_1581Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1531224-1.582302arsenate reductase
PP_1532426-1.694452phage integrase
PP_1533426-0.995108excisionase
PP_1534328-1.663590hypothetical protein
PP_1535328-1.341016methyltransferase
PP_1536331-0.371023hypothetical protein
PP_1537230-0.706354hypothetical protein
PP_1538228-0.485078hypothetical protein
PP_1539429-0.966192hypothetical protein
PP_1540327-0.340621hypothetical protein
PP_1541327-0.431951methyltransferase
PP_1542427-0.962907hypothetical protein
PP_1543524-1.143293hypothetical protein
PP_1544327-3.086879hypothetical protein
PP_1545225-3.108484hypothetical protein
PP_1546221-3.470409hypothetical protein
PP_1547118-2.136095hypothetical protein
PP_1548317-1.760539hypothetical protein
PP_1549116-1.274336Cro/CI family transcriptional regulator
PP_15501141.044822Cro/CI family transcriptional regulator
PP_15510121.089004phage replication protein O
PP_1552-2151.995048IstB domain-containing protein ATP-binding
PP_1553-1151.013173hypothetical protein
PP_15542170.711979hypothetical protein
PP_15553190.365327phage integrase
PP_1556327-0.361886hypothetical protein
PP_15572260.458452hypothetical protein
PP_1558220-0.519290antitermination protein Q
PP_1559118-0.119396phage holin
PP_15602200.152871hypothetical protein
PP_15611190.253026phage holin
PP_15623190.227393phage terminase small subunit
PP_1563319-0.176849phage terminase, large subunit
PP_15643220.399202hypothetical protein
PP_15653220.420979HK97 family phage portal protein
PP_15663220.232496head maturation protease
PP_15673200.531489HK97 family phage major capsid protein
PP_15680210.785725hypothetical protein
PP_15692220.948421hypothetical protein
PP_15702221.002637head-tail adaptor
PP_1571226-1.219448hypothetical protein
PP_1572227-2.945349hypothetical protein
PP_1573226-3.052401major tail protein
PP_1574221-1.352952hypothetical protein
PP_1575221-1.613011hypothetical protein
PP_1576322-2.538826immunity protein
PP_1577321-2.257039lambda family phage tail tape measure protein
PP_1578519-1.631131hypothetical protein
PP_1579420-1.307940hypothetical protein
PP_1580426-3.282940hypothetical protein
PP_1581326-3.314306BNR repeat-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1574TYPE3OMBPROT270.037 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 27.0 bits (59), Expect = 0.037
Identities = 13/49 (26%), Positives = 25/49 (51%), Gaps = 4/49 (8%)

Query: 50 VISAASKQ-DSIAARIAASICDEHGNPVFSSPLDITHGPLDPVELEKDP 97
+ A++++ D IA + + D+ G +FS I HG + L+K+
Sbjct: 192 ICCASTRESDHIANMWLSKVVDDEGKEIFSG---IRHGVISAYGLKKNS 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1577PF02370290.036 M protein repeat
		>PF02370#M protein repeat

Length = 168

Score = 29.3 bits (65), Expect = 0.036
Identities = 20/88 (22%), Positives = 37/88 (42%), Gaps = 5/88 (5%)

Query: 408 KKLLGQFDTAEEGYKRQIALINTETDKRKEATEVAKLQFELESGNLAGLSAKQQERLKGL 467
+ L+G+ + + I +RKE E + + + E + QE+ K
Sbjct: 51 RALMGENQDLRKREGQYQDKIEELEKERKEKQERPERREKFERQHQD---KHYQEQQKKH 107

Query: 468 AAELDQLKKLKQAKEDDKEVAGFDASVK 495
E QL+ KQ +K+++ DAS +
Sbjct: 108 QQEQQQLEAEKQKLAKEKQIS--DASRQ 133


18PP_1666PP_1681Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_16664121.743210hypothetical protein
PP_16672150.654354hypothetical protein
PP_16680132.323037DNA replication initiation factor
PP_16690122.780378NLP/P60 protein
PP_16702164.220897NLP/P60 protein
PP_16712165.177803hypothetical protein
PP_16721144.232174cob(I)yrinic acid a,c-diamide
PP_16732144.470560cobyrinic acid a,c-diamide synthase
PP_16742134.280769cob(II)yrinic acid a,c-diamide reductase
PP_16753144.388980cobalamin biosynthesis protein
PP_16763143.718234threonine-phosphate decarboxylase
PP_16772162.602998cobyric acid synthase
PP_16781172.993553adenosylcobinamide kinase
PP_16791172.678059nicotinate-nucleotide--dimethylbenzimidazole
PP_16802191.986113alpha-ribazole-5'-phosphate phosphatase
PP_16812200.831525cobalamin synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1670PF04619300.002 Dr-family adhesin
		>PF04619#Dr-family adhesin

Length = 160

Score = 30.3 bits (68), Expect = 0.002
Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 4/66 (6%)

Query: 94 REMIVMRAPTVDASSLQSGDLVFFATSGGSQVSHAGIYVGDGRFVHAPSTGGTVRLDYLS 153
R+ + + D S+ + + VF+ GS GIYV DG+ + P T+ L
Sbjct: 99 RDKLYVNIRPTDNSAWTTDNGVFYKNDVGSWGGIIGIYV-DGQQTNTPPGNYTLTLT--- 154

Query: 154 NSYWAK 159
YWAK
Sbjct: 155 GGYWAK 160


19PP_1764PP_1815Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_17642221.253049phosphoglycolate phosphatase
PP_17651201.5099283-demethylubiquinone-9 3-methyltransferase
PP_17662181.327740methylthioribose-1-phosphate isomerase
PP_1767221-0.136330DNA gyrase subunit A
PP_1768321-0.600401phosphoserine aminotransferase
PP_1769319-0.772823chorismate mutase
PP_1770018-0.956169bifunctional cyclohexadienyl dehydrogenase/
PP_1771018-3.534692cytidylate kinase
PP_1772-120-4.33060030S ribosomal protein S1
PP_1773-125-5.255356integration host factor subunit beta
PP_1774-126-5.886051hypothetical protein
PP_1775-226-5.755208beta-lactamase domain-containing protein
PP_1776-125-6.104235mannose-6-phosphate isomerase
PP_1777-125-6.065541phosphomannomutase
PP_1778-124-6.137924lipopolysaccharide ABC export system, permease
PP_1779-124-5.318040lipopolysaccharide ABC export system,
PP_1780-123-4.854431mannosyltransferase
PP_1781031-6.991254O-acyltransferase
PP_1782239-9.080228dTDP-4-dehydrorhamnose 3,5-epimerase
PP_1783344-11.429872glucose-1-phosphate thymidylyltransferase
PP_1784448-12.198177dTDP-4-dehydrorhamnose reductase
PP_1785550-13.585220dTDP-glucose 4,6-dehydratase
PP_1786452-13.758641glycosyl transferase family protein
PP_1787452-12.483417hypothetical protein
PP_1788245-9.664149hypothetical protein
PP_1789037-6.935773HAD superfamily hydrolase
PP_1790034-5.879358acylneuraminate cytidylyltransferase
PP_1791-129-4.904416aldolase
PP_1792-125-4.046583glycosyl transferase family protein
PP_1793019-3.839308glycosyl transferase family protein
PP_1794016-3.185655hypothetical protein
PP_1795016-2.940870hypothetical protein
PP_1797019-3.524995HlyD family secretion protein
PP_1798121-2.835960outer membrane efflux protein
PP_1799022-2.651777GDP-mannose 4,6-dehydratase
PP_1800026-3.136599oxidoreductase Rmd
PP_1801128-4.084828glycosyl transferase WbpY
PP_1802228-4.810507glycosyl transferase WbpZ
PP_1803119-3.903614UDP-sugar epimerase
PP_1804119-4.645716glycosyl transferase WbpL
PP_1805-118-5.158033polysaccharide biosynthesis protein CapD
PP_1806013-3.663710KpsF/GutQ family protein
PP_1807-112-2.3033622-dehydro-3-deoxyphosphooctonate aldolase
PP_1808014-1.167548glucose-6-phosphate isomerase
PP_1809220-1.319415hypothetical protein
PP_1810218-0.085466hypothetical protein
PP_18113180.856561UDP-N-acetylglucosamine 2-epimerase
PP_18121181.449432hypothetical protein
PP_1813318-0.445428competence protein ComEA
PP_18142160.988783hypothetical protein
PP_18152161.543950orotidine 5'-phosphate decarboxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1773DNABINDINGHU1167e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 116 bits (292), Expect = 7e-38
Identities = 34/89 (38%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 2 TKSELIERIVTHQGLLSSKDVELAIKTMLEQMSQCLATGDRIEIRGFGSFSLHYRAPRVG 61
K +LI + V L+ KD A+ + +S LA G+++++ GFG+F + RA R G
Sbjct: 3 NKQDLIAK-VAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGQSVSLEGKFVPHFKPGKELRDRV 90
RNP+TG+ + ++ VP FK GK L+D V
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1775FLGPRINGFLGI300.017 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 30.3 bits (68), Expect = 0.017
Identities = 15/43 (34%), Positives = 21/43 (48%)

Query: 403 LDGQMYEIRAKVVTVGGFSGHADQAGLVGFVNAMSRVPGRVVL 445
DGQ+Y + + V GFS D A L V +RVP ++
Sbjct: 137 ADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAII 179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1780RTXTOXIND310.037 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.3 bits (71), Expect = 0.037
Identities = 16/87 (18%), Positives = 34/87 (39%), Gaps = 6/87 (6%)

Query: 284 FQQVGESVESCHSALSQVNLELGQHQAALVDLSQEVERHRAELESLKQVVNGLNHHLSDT 343
FQ V E ++L + Q+Q +++ RAE ++ +N +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQ--KELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 344 QQRLA----LADERMAATQAWIDKQQA 366
+ RL L ++ A A ++++
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENK 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1784NUCEPIMERASE437e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 43.2 bits (102), Expect = 7e-07
Identities = 43/200 (21%), Positives = 70/200 (35%), Gaps = 32/200 (16%)

Query: 1 MKILLLGKNGQVGWELQRALAPLG-EVIALD-----------RQGAEGLC--------GD 40
MK L+ G G +G+ + + L G +V+ +D + E L D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 41 LSNLDGLAATIRQLAPDVIVNAAAYTAVDKA-ESDQALAAMINAAAPAVLARETAALGAW 99
L++ +G+ + + + AV + E+ A A +L
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 100 LIHYSTDYVFDGSGSQRWEETAPTG-PLSVYGRTKLEGE-HAILAS---GAKAVVLRTSW 154
L++ S+ V+ + + P+S+Y TK E A S G A LR
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 155 VYAARG------HNFAKTML 168
VY G F K ML
Sbjct: 181 VYGPWGRPDMALFKFTKAML 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1785NUCEPIMERASE1784e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 178 bits (452), Expect = 4e-55
Identities = 91/358 (25%), Positives = 146/358 (40%), Gaps = 43/358 (12%)

Query: 2 ILVTGGAGFIGSNFVLQWCAHNEEPVLNLDALT--YAGNL--ANLQPLEGNPQHRFVQGN 57
LVTG AGFIG + + + V+ +D L Y +L A L+ L P +F + +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQ-VVGIDNLNDYYDVSLKQARLELL-AQPGFQFHKID 60

Query: 58 ICDAALLTKLFAEHRPRAVVHFAAESHVDRSITGPEAFVETNVMGTFRLLEAARAHWNSL 117
+ D +T LFA V V S+ P A+ ++N+ G +LE R N +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRH--NKI 118

Query: 118 EGAEKEAFRFLHVSTDEVYGTLGPNDPAFTETTPYAPNSPYSASKAASDHLVRSYFHTYG 177
+ L+ S+ VYG L P T+ + P S Y+A+K A++ + +Y H YG
Sbjct: 119 Q-------HLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 178 MPVLTTNCSNNYGPLHFPEKLIPLMIVNALAGKALPVYGDGQQIRDWLYVEDHCSGIRRV 237
+P YGP P+ + L GK++ VY G+ RD+ Y++D I R+
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRL 230

Query: 238 LEAGAFGETYNIGGWNEKANIDIVRTLCSLLDEMAPAASRQVINQKTGEPVE--QYAELI 295
+ +T A A A +V N PVE Y + +
Sbjct: 231 QDVIPHADTQWTVETGTPA---------------ASIAPYRVYNIGNSSPVELMDYIQAL 275

Query: 296 A----------YVTDRPGHDRRYAIDARKIERELGWKPAETFETGIRKTVAWYLANQK 343
+ +PG + D + + +G+ P T + G++ V WY K
Sbjct: 276 EDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1786SYCDCHAPRONE402e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 39.5 bits (92), Expect = 2e-05
Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 12/105 (11%)

Query: 211 KLGMHYLSRHKVQEGRILLEAALSIAPS-AEVFNRLGGSYMEDGHFSIALQYFNAAAQL- 268
L + K ++ + +A + + F LG G + +A+ ++ A +
Sbjct: 41 SLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMD 100

Query: 269 ---PNPPVWTAFNQAHCLAKLHQLDEAIRVLSAG---IATNPNHR 307
P P F+ A CL + +L EA L IA +
Sbjct: 101 IKEPRFP----FHAAECLLQKGELAEAESGLFLAQELIADKTEFK 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1791adhesinb330.003 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 32.5 bits (74), Expect = 0.003
Identities = 9/35 (25%), Positives = 16/35 (45%)

Query: 280 YLAGKYGIHPTYIQEMLGDSRFGEEDILAVIEYLR 314
Y + Y + YI E+ + + I ++E LR
Sbjct: 211 YFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLR 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1794RTXTOXINA487e-08 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 48.0 bits (114), Expect = 7e-08
Identities = 29/113 (25%), Positives = 47/113 (41%), Gaps = 11/113 (9%)

Query: 136 NGDDVITVKGDQNTLIDAGDGNDTIVTGNGDNVVIAGAGNNNVTTGTGDDTI-------- 187
+G+D + N + G+G+D + G+G++ +I AGNN + G GDD
Sbjct: 753 DGNDRLY-GDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLA 811

Query: 188 --ILSGSNHADIVNAGAGYDVVQLDGSRDDYAFAVGNNFNVNLTGNQTASITD 238
+L G D + G D++ D GN+ L+G I D
Sbjct: 812 KNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDD 864



Score = 47.7 bits (113), Expect = 8e-08
Identities = 27/94 (28%), Positives = 44/94 (46%), Gaps = 2/94 (2%)

Query: 145 GDQNTLIDAGDGNDTIVTGNGDNVVIAGAGNNNVTTGTGDDTIILSGSNHADIVNAGAGY 204
D + LI+ DGND + G++ + G G++ + G G+D L G + +N G G
Sbjct: 743 ADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDK--LIGVAGNNYLNGGDGD 800

Query: 205 DVVQLDGSRDDYAFAVGNNFNVNLTGNQTASITD 238
D Q+ G+ G N L G++ A + D
Sbjct: 801 DEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLD 834


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1797RTXTOXIND323e-108 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 323 bits (829), Expect = e-108
Identities = 96/422 (22%), Positives = 176/422 (41%), Gaps = 7/422 (1%)

Query: 24 RRIGLTVVFVTFGIFGTWAAFAPLSNAVHGTGVVTVQNYRKTVQHLEGGIVKELHARDGD 83
R + ++ I + + G +T K ++ +E IVKE+ ++G+
Sbjct: 58 RLVAYFIMGF-LVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116

Query: 84 LVKKGDPLIVLDESQLSAEYESTRNQLIVARYKEARLRA-----ERDGLDSIPSVIMEGT 138
V+KGD L+ L A+ T++ L+ AR ++ R + E + L +
Sbjct: 117 SVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF 176

Query: 139 DSDRAQEALAGEQQVFKARHDSLLGEISVNRERIQQMQQQIAGLNDMIRTKAGLNKSYSG 198
+ +E L + K + + + + + + + + I L++
Sbjct: 177 QNVSEEEVLR-LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 199 EIKQLKELLAEGFVDNQRLLEQERKLDMLKTEVADHQSSITKTKLQIGETELQIVQLKKK 258
+ LL + + +LEQE K E+ ++S + + + +I + + + +
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 259 FDADVAKELSDVQAQVFDLQEKEAALRDRLSRVVIRAPESGMVLDMKVHTIGGVVSAATP 318
F ++ +L + L + A +R VIRAP S V +KVHT GGVV+ A
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 319 LLDIVPAQSDLVVEAKVAPRDIDRLELGKTADVRFSAFNQATTPVIEGKLTRISADSLVE 378
L+ IVP L V A V +DI + +G+ A ++ AF + GK+ I+ D++ +
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIED 415

Query: 379 ERSGDQYYLVRVKVTEDGMKKLGNRKLQPGMPAEVLINAGDRTMLQYLLKPARNMFAESL 438
+R G + ++ N L GM I G R+++ YLL P ESL
Sbjct: 416 QRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESL 475

Query: 439 IE 440
E
Sbjct: 476 RE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1799NUCEPIMERASE1108e-30 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 110 bits (277), Expect = 8e-30
Identities = 73/354 (20%), Positives = 122/354 (34%), Gaps = 65/354 (18%)

Query: 13 MKAIVTGITGQDGAYLAELLLEKGYTVYG-----TYRRTSSVNFWRIEELGIHTNPNLHL 67
MK +VTG G G ++++ LLE G+ V G Y S + R+E L P
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVS-LKQARLELLA---QPGFQF 56

Query: 68 VEYDLTDLSASIRLLQTTEATEVYNLAAQSFVGVSFEQPLTTAEITGLGAVNLLEAIRIV 127
+ DL D L + V+ + V S E P A+ G +N+LE R
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 128 NPKARFYQASTSEMFGKVQEIPQVETTPF-YPRSPYGVAKLYAHWMTINYRESYNLFATS 186
+ Y AS+S ++G +++P +P S Y K M Y Y L AT
Sbjct: 117 KIQHLLY-ASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATG 175

Query: 187 GILFNHESPLRGRE-----FVTRKITDSVAKIKLGLLDKLELGNLDAKRDWGFAKEYVEG 241
F P GR T+ + + + I + KRD+ + + E
Sbjct: 176 LRFFTVYGP-WGRPDMALFKFTKAMLEGKS-IDV-------YNYGKMKRDFTYIDDIAEA 226

Query: 242 MWRMLQADEPDT-------------------FVLATNRTETVRDFVTMAFKAAGIEINWS 282
+ R+ + + + + D++ A GIE
Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK-- 284

Query: 283 GKDEAEQGTCAASGKVLVAINPKFYRPAEVELLIGNPAKAKDVLGWEPKTNLEE 336
N +P +V + +V+G+ P+T +++
Sbjct: 285 -------------------KNMLPLQPGDVLETSADTKALYEVIGFTPETTVKD 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1800NUCEPIMERASE935e-24 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 93.3 bits (232), Expect = 5e-24
Identities = 56/237 (23%), Positives = 96/237 (40%), Gaps = 25/237 (10%)

Query: 7 RALITGVHGFTGRYMAAELRAGGYEVFGTGS--------------QPLPAADYR--QVDL 50
+ L+TG GF G +++ L G++V G + + L ++ ++DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 51 TDGQGLRALLAELQPDVVVHLAAIAFVGHGTAD--AFYKVNLIGTRNLLEAIAACGKTPE 108
D +G+ L A + V V + + A+ NL G N+LE +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--IQ 119

Query: 109 CVLIASSANVYG-NVSEGMLGEQTPPAPANDYAVSKLAMEYMARLWCD--RLPIVITRPF 165
+L ASS++VYG N + + P + YA +K A E MA + LP R F
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 166 NYTGVGQAENFLLPKIVSHFRRKADTIEL-GNLDVWRDFSDVRAVVQAYRGLIEARP 221
G + L K + +I++ + RDF+ + + +A L + P
Sbjct: 180 TVYGPWGRPDMALFKFTKAM-LEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1803NUCEPIMERASE746e-17 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 73.7 bits (181), Expect = 6e-17
Identities = 62/343 (18%), Positives = 118/343 (34%), Gaps = 42/343 (12%)

Query: 5 TILVTGASGFVGGALCRQLATLGSFAI-------------RAASRDLGGASVAGIQAVTV 51
LVTGA+GF+G + ++L G + + A +L + +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 52 ADLSATTDWARALSGVDLVVHAAARVHVMKETASDSL---AEFRRVNVDGTLNLARQAAA 108
AD TD A + V + R+ V SL + N+ G LN+
Sbjct: 62 ADREGMTD-LFASGHFERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNILEGCRH 115

Query: 109 AGVRRFIFISSIKVNGESSQPGQPLRADDSPA-PQDAYGVSKHEAEQGLRQLAAATGMEV 167
++ ++ SS V G + + P DDS P Y +K E + G+
Sbjct: 116 NKIQHLLYASSSSVYGLNRKM--PFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 168 VVIRPVLVYGPGVKAN--FHSMMRWLQRGVPLP-FGAVCNRRSLVSLANLVDLVVTCIDH 224
+R VYGP + + + + G + + +R + ++ + ++ D
Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233

Query: 225 PRAANQTFLASDGDDVSLTQLLRALGLALGRPARLLPVPAGLLRGAVLLIGRRDLAQRLF 284
A+ + G + R + P L+ L +G A++
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALED----ALGIE--AKKNM 287

Query: 285 GTLQ--------VDIEKNRQLLGWYPPCTLEQGLNMTARSFLG 319
LQ D + +++G+ P T++ G+ +
Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1805NUCEPIMERASE531e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 53.2 bits (128), Expect = 1e-09
Identities = 46/248 (18%), Positives = 91/248 (36%), Gaps = 38/248 (15%)

Query: 323 TVLVTGAGGSIGSELCRQIIGLGPTTLLLFDHSEYNLYAILSELEQRVARESLPVRLLPI 382
LVTGA G IG + ++++ G + + N Y +S + R+ + P
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGI---DNLNDYYDVSLKQARLELLAQP-GFQFH 57

Query: 383 LGSVRNQQHLAHVMSTWRVATVYHAAAYKHVPMVEHNMAEGILNNVFGTLCTAQAALQSG 442
+ +++ + + ++ V+ + V N +N+ G L + +
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK 117

Query: 443 VANFVLIST---------------DKAVRPTNVMGGTKRLSELILQALSREAAPVMYGDS 487
+ + + S+ D P ++ TK+ +EL+ S +YG
Sbjct: 118 IQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH-----LYG-- 170

Query: 488 SKIARVNKTRFTMVRFGNVLGSSGS---VIPLFHKQIKAGGPITV-THPKITRYFMTIPE 543
T +RF V G G + F K + G I V + K+ R F I +
Sbjct: 171 --------LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDD 222

Query: 544 AAQLVVQA 551
A+ +++
Sbjct: 223 IAEAIIRL 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1810ANTHRAXTOXNA310.005 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.9 bits (69), Expect = 0.005
Identities = 20/56 (35%), Positives = 31/56 (55%), Gaps = 4/56 (7%)

Query: 88 EIEQLIEQNDALRAE-LERERAERLKLEASLKPRALTPQAHEAFKALAGELKAKTL 142
E++ E L+ E +E++R + LK E +LK L P+ +AFK +A EL L
Sbjct: 275 GFEKISES---LKKEGVEKDRIDVLKGEKALKASGLVPEHADAFKKIARELNTYIL 327


20PP_1826PP_1832Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_18260173.005001isochorismatase superfamily hydrolase
PP_18270173.359891N5-glutamine S-adenosyl-L-methionine-dependent
PP_18282153.115649hypothetical protein
PP_18292153.529075alpha/beta hydrolase
PP_18301122.192412chorismate synthase
PP_18313162.450020major facilitator superfamily protein
PP_18324221.535766oxidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1826ISCHRISMTASE552e-11 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 55.0 bits (132), Expect = 2e-11
Identities = 43/167 (25%), Positives = 68/167 (40%), Gaps = 19/167 (11%)

Query: 11 TGRDYPPAKL------SHASLIIIDAQKEYLSG-PLKLSGMDEAVANIARLLDAARKSGR 63
T D P K+ + A L+I D Q ++ S + E ANI +L + + G
Sbjct: 13 TASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGI 72

Query: 64 PIIHVRHLGTV-----GGRFDPQGPA-------GQFIPGLEPLEGEIVIEKRMPNAFKNT 111
P+++ G+ D GP + I L P + ++V+ K +AFK T
Sbjct: 73 PVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRT 132

Query: 112 KLHETLQELGHLDLIVCGFMSHSSVSTTVRRAKDYGYRCTLVEDASA 158
L E +++ G LI+ G +H T A + V DA A
Sbjct: 133 NLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVA 179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1831TCRTETA349e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.0 bits (78), Expect = 9e-04
Identities = 65/338 (19%), Positives = 115/338 (34%), Gaps = 31/338 (9%)

Query: 25 PFLALYFDHLGFSPARIGELVAIPMLMRCVAPNLWGWLGDRTGQRLLIVRLGALCTLATF 84
P L H A G L+A+ LM+ + G L DR G+R +++
Sbjct: 29 PGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL----------V 78

Query: 85 ALIFFGKSYAWLALVMALHAFFWHAVLPQFEVIT----LAHLHGQTSRYSQVRLWG---- 136
+L YA +A L + ++ T A++ T + R +G
Sbjct: 79 SLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSA 138

Query: 137 --SIGFILTVVGLGRLFEW-LSLDIYPVALVTIMAGIVAASLWVPNAQ-----PVEQGGR 188
G + V G + + + A + + + +P + P+ +
Sbjct: 139 CFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC-FLLPESHKGERRPLRREAL 197

Query: 189 SGTGGFLQQLRAPGVIAFYVCVALMQLSHGPYYTFLTLHLEH-LGYTRGAIGL-LWALGV 246
+ F V A +MQL + E + IG+ L A G+
Sbjct: 198 NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI 257

Query: 247 VAEVLMFMVMSRLFARFSVQRVLLVSFLLAALRWLLLGNLAQESAVLIFAQLLHAATFGC 306
+ + M+ + AR +R L++ + ++LL + LL A+ G
Sbjct: 258 LHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLL--ASGGI 315

Query: 307 FHAASIAFVQASFGARQQGQGQALYAALSGTGGALGAL 344
A A + +QGQ Q AAL+ +G L
Sbjct: 316 GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL 353


21PP_1847PP_1860Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_18472142.163624TonB-dependent siderophore receptor
PP_18482172.650967hypothetical protein
PP_18491162.756648hypothetical protein
PP_18502152.720053major facilitator superfamily protein
PP_18512162.186463hypothetical protein
PP_18523181.9731383-ketoacyl-ACP reductase
PP_18531160.723331LysR family transcriptional regulator
PP_18540120.101547hypothetical protein
PP_1855212-0.339197hypothetical protein
PP_18560121.164064hypothetical protein
PP_18570131.311585hypothetical protein
PP_1858-1160.676487elongation factor P
PP_18591171.728831OsmC family protein
PP_1860215-0.160801MarR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1850TCRTETA431e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.9 bits (101), Expect = 1e-06
Identities = 71/370 (19%), Positives = 129/370 (34%), Gaps = 16/370 (4%)

Query: 30 PLLHSIAQQFGLSTASAGSIVIAAQLSYGAGLLLLAPLG----DLFEQRRLITMMTVVST 85
P+L + + S I L Y AP+ D F +R ++ + +
Sbjct: 26 PVLPGLLRDLVHSNDVTAHYGILLAL-YALMQFACAPVLGALSDRFGRRPVLLVSLAGAA 84

Query: 86 LGLVISACAPSLPWLLLGTALTGLFSVVAQILVPMAATLSEPHQRGRAVGTLMSGLLLGI 145
+ I A AP L L +G + G+ + A +++ +R R G + + G+
Sbjct: 85 VDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGM 144

Query: 146 LLARTAAGFMAELGGWRSIYVLAAALMALTALALYRSLPQHHSHAGLKYPALIGSVFRLF 205
+ G M + + AAAL L L LP+ H + F
Sbjct: 145 VAGPVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASF 203

Query: 206 IEEPVLRLRSLLGLLAFSLFALFWTPLAF--LLANGPYHYSDAVIGL-FGLAGAAGAL-S 261
+ + + L + F + + P A + +H+ IG+ G +L
Sbjct: 204 RWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQ 263

Query: 262 ANWAGRLADRGKGSLGTTVGLVVLLLSWVPLGFAEQSLLALLLGVLMLDLAVQLVHVSNQ 321
A G +A R +G++ ++ L FA + +A + VL+ + + +
Sbjct: 264 AMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAM 323

Query: 322 NAVIALRPEARTRLNAGYITCYFIGGALGSLLGTQLF-----QRQGWMGIVVAGLVIGGL 376
+ E + +L + +G LL T ++ GW I A L + L
Sbjct: 324 LSRQV-DEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCL 382

Query: 377 GLLVWGLAER 386
L GL
Sbjct: 383 PALRRGLWSG 392


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1852DHBDHDRGNASE1254e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 125 bits (315), Expect = 4e-37
Identities = 84/256 (32%), Positives = 128/256 (50%), Gaps = 21/256 (8%)

Query: 7 LEGKVALVQGGSRGIGAAIVRRLAREGAQVAFTYVSSAGPAEELAREITENGGKALALRA 66
+EGK+A + G ++GIG A+ R LA +GA +A + E++ + A A A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 67 DSADAAAVQLAVDDTEKALGRLDILVNNAGVLAVAPVTEFDLADFDHMLAVNVRSVFVAS 126
D D+AA+ E+ +G +DILVN AGVL + +++ +VN VF AS
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 127 QAAARYM--GQGGRIINIGSTNAERMPFAGGAPYAMSKSALVGLTRGMARDLGPQGITVN 184
++ ++YM + G I+ +GS N +P A YA SK+A V T+ + +L I N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 185 NVQPGPVDTDMN--------------PASGEFAESLIPLMAIGRYGEPEEIASFVAYLAG 230
V PG +TDM S E ++ IPL + +P +IA V +L
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPL---KKLAKPSDIADAVLFLVS 240

Query: 231 PEAGYITGASLTVDGG 246
+AG+IT +L VDGG
Sbjct: 241 GQAGHITMHNLCVDGG 256


22PP_1897PP_1905Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_18970173.672537DNA internalization-related competence protein
PP_1898-1162.475638MotA/TolQ/ExbB proton channel
PP_18990162.399299biopolymer transport protein ExbD/TolR
PP_19002192.894886tetraacyldisaccharide 4'-kinase
PP_19012172.385915hypothetical protein
PP_19022152.0887203-deoxy-manno-octulosonate cytidylyltransferase
PP_19033161.744765protein tyrosine phosphatase
PP_19043150.969138UDP-N-acetylenolpyruvoylglucosamine reductase
PP_19055170.949634Rne/Rng family ribonuclease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1905IGASERPTASE682e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 68.2 bits (166), Expect = 2e-13
Identities = 65/320 (20%), Positives = 107/320 (33%), Gaps = 22/320 (6%)

Query: 732 PREERAPREERAPREERAPREERAPREERAPREERAPREERAPRPPREERQPRAAEEATE 791
P E+ + + + EE A R + AP PP P E TE
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIA-RVDEAPVPPPAPATP---SETTE 1038

Query: 792 QAAELAEEQLPGEELLQDEQEGTDGERPRRRSRGQRRRSNRRERQRNANGELIDGSEEEG 851
AE ++++ E ++EQ+ T+ R + + + + Q N + GSE +
Sbjct: 1039 TVAENSKQESKTVE--KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQS--GSETKE 1094

Query: 852 SEEQPQQHQATELGAELAAGLAVTAAVASSNISSDAEAQANQQAERATAEVAAVAE-TDN 910
++ + AT E A S + Q + + AE A + T N
Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVN 1154

Query: 911 SEAAQPVEQVEAATKAQEASVAPAVEQPVTEPVAAAEATAEPVVEVAPQPVAEDAPAVEP 970
+ Q A T+ + VEQPVTE T V P +P
Sbjct: 1155 IKEPQSQTNTTADTEQPAKETSSNVEQPVTE-----STTVNTGNSVVENPENTTPATTQP 1209

Query: 971 VVVAETAVTETPAEAPVVEAGEIEQAPAVVEVAPVAAQPAPVVEAQPEVVAEPAPAVVEP 1030
T +E+ + + P VE + A ++ + AV+
Sbjct: 1210 -----TVNSESSNKPKNRHRRSVRSVPHNVE-PATTSSNDRSTVALCDLTSTNTNAVLSD 1263

Query: 1031 APAVVEPATVMLANGRAPND 1050
A A + V L G+A +
Sbjct: 1264 ARA--KAQFVALNVGKAVSQ 1281



Score = 62.0 bits (150), Expect = 1e-11
Identities = 47/300 (15%), Positives = 85/300 (28%), Gaps = 22/300 (7%)

Query: 494 NNQSSYEIAAAEAEEAPQPTATRTLVRQEAAVKTAPARANAPVPAAAEEPQAAAPVAPAP 553
N Y++ E E+ Q + T P A VP+ P +A
Sbjct: 973 NVNGRYDLYNPEVEKRNQTV--------DTTNITTPNNIQADVPS---VPSNNEEIARVD 1021

Query: 554 SAPEPSLFKGLVKSLVSLFAGKDEPAAAPVAPAAEKPAAERSPRNEERRNGRQQSRNRNG 613
AP P S + ++ + E+ A E + +N E + + N
Sbjct: 1022 EAPVPPPAPAT-PSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080

Query: 614 RRDEERKPREERAERAPREERAPREERAPREERAPREERAPREERAPREERAPRQPREDR 673
+ +E + E E E + EE+A E +E + +P+Q +
Sbjct: 1081 QTNEVAQSGSETKETQTTETKETATVEK--EEKAKVETEKTQEVPKVTSQVSPKQ-EQSE 1137

Query: 674 RGNRGEERVRELREPLDATPPAEREERQPREERVAREERA---PREERAPREERAPREER 730
E RE ++ P + E+ A+E + +
Sbjct: 1138 TVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE 1197

Query: 731 APREERAPREERAPREERAPREERAPREERAPREERAPREERAPRPPREERQPRAAEEAT 790
P + E + + + R P +R A + T
Sbjct: 1198 NPENTTPATTQPTVNSESSNKPKNRHRRSVRSV----PHNVEPATTSSNDRSTVALCDLT 1253



Score = 61.6 bits (149), Expect = 2e-11
Identities = 42/286 (14%), Positives = 86/286 (30%), Gaps = 24/286 (8%)

Query: 676 NRGEERVRELREPLDATPPAEREERQP----REERVAREERAPREERAPREERAPREERA 731
N E+ + + + T P + P E +AR + AP AP E A
Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA 1041

Query: 732 --PREERAPREERAPREERAPREERAPREERAPREERAPREERAPRPPRE--ERQPRAAE 787
++E E+ + R +E + + + E E Q +
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101

Query: 788 EATEQAAELAEEQLPGEELLQDEQEGTDGERPRRRSRGQRRRSNRRERQRNANGELIDGS 847
E E E+ E Q+ + T P++ + R+ + + +
Sbjct: 1102 ETATVEKE--EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159

Query: 848 EEEGSE---EQPQQHQATELGAELAAGLAVTAAVASSNISSDAEAQANQQAERATAEVAA 904
+ + EQP + ++ + + V ++ + + AT +
Sbjct: 1160 SQTNTTADTEQPAKETSSNVEQPVTESTTVNT--------GNSVVENPENTTPATTQPTV 1211

Query: 905 VAETDNSEAAQPVEQVEAATKAQEASVAPAVEQPVTEPVAAAEATA 950
+E+ N + V + E VA + T+
Sbjct: 1212 NSESSNKPKNRHRRSVRSVPHNVE---PATTSSNDRSTVALCDLTS 1254



Score = 40.4 bits (94), Expect = 5e-05
Identities = 32/197 (16%), Positives = 55/197 (27%), Gaps = 27/197 (13%)

Query: 483 QRLRDDNPEVLNNQSSYEIA--AAEAEEAPQPTATRTLVRQEAAVKT--------APARA 532
+ ++ V N + E+A +E +E Q T T+ E K +
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKET-QTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 533 NAPVPAAAEEPQAAAPVAPAPSAPEPSLFKGLVKSLVSLFAGKDEPAAAPVAPAAEKPAA 592
+ V E+ + P A +P++ EP + A + A
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDPTV-------------NIKEPQSQTNTTADTEQPA 1172

Query: 593 ERSPRNEERRNGRQQSRNRNGRRDEERKPREERAERAPREERAPREERAPREERAPREER 652
+ + N E+ + N G E A P + R R+ R
Sbjct: 1173 KETSSNVEQPVTESTTVNT-GNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV- 1230

Query: 653 APREERAPREERAPRQP 669
P R
Sbjct: 1231 -PHNVEPATTSSNDRST 1246


23PP_1915PP_1963Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1915225-5.197727acyl carrier protein
PP_1916230-6.6879433-oxoacyl-ACP synthase
PP_1917336-8.3141034-amino-4-deoxychorismate lyase
PP_1918339-9.191688aminodeoxychorismate lyase
PP_1920446-9.833183hypothetical protein
PP_1921443-7.537097hypothetical protein
PP_1922434-5.905746hypothetical protein
PP_1923327-3.289480hypothetical protein
PP_1924319-1.665878phosphinothricin N-acetyltransferase
PP_1925217-1.384395monooxygenase
PP_1926218-3.265873phosphatase
PP_1927218-3.825850arsenical resistance protein ArsH
PP_1928221-3.839291arsenate reductase
PP_1929224-4.504977arsenite efflux transporter
PP_1930333-6.916735arsenic resistance transcriptional regulator
PP_1931652-12.899452hypothetical protein
PP_1932753-12.178641hypothetical protein
PP_1933653-12.075033hypothetical protein
PP_1934555-12.554272hypothetical protein
PP_1935657-12.965915Cro/CI family transcriptional regulator
PP_1936657-12.808093hypothetical protein
PP_1937453-10.895802helicase
PP_1938354-11.669256hypothetical protein
PP_1940253-10.864747methyl-accepting chemotaxis transducer
PP_1941248-9.016857hypothetical protein
PP_1942347-8.659031LysR family transcriptional regulator
PP_1943342-7.703291formyltetrahydrofolate deformylase
PP_1944444-7.190954glycine cleavage system protein T
PP_1945347-7.6703165,10-methylene-tetrahydrofolate dehydrogenase
PP_1946348-8.028132short chain dehydrogenase/reductase
PP_1947448-8.330125hypothetical protein
PP_1948345-7.645246benzaldehyde dehydrogenase
PP_1949248-7.832346GMC family oxidoreductase
PP_1950145-7.632291hypothetical protein
PP_1951144-7.533137short chain dehydrogenase/reductase
PP_1952142-7.715082metallo-beta-lactamase
PP_1953141-7.521216short chain dehydrogenase/reductase
PP_1954346-9.697775hypothetical protein
PP_1955447-11.214353cytochrome P450 family protein
PP_1956549-12.550993hypothetical protein
PP_1957648-12.899946Pdr/VanB family oxidoreductase
PP_1958650-13.898460hypothetical protein
PP_1959652-14.377958hypothetical protein
PP_1960544-11.933368hypothetical protein
PP_1961539-10.049270hypothetical protein
PP_1962434-8.773262phage integrase site specific recombinase
PP_1963225-5.783074hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1915ACRIFLAVINRP250.034 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 25.2 bits (55), Expect = 0.034
Identities = 12/46 (26%), Positives = 18/46 (39%), Gaps = 5/46 (10%)

Query: 34 GADSLDTVELVMALEEEFETEIPDEEAEKIT-----TVQAAIDYVK 74
GA++LDT + + A E + P VQ +I V
Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVV 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1918PF05616320.004 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 32.4 bits (73), Expect = 0.004
Identities = 17/55 (30%), Positives = 25/55 (45%), Gaps = 3/55 (5%)

Query: 367 QASGADDAQRPEPPAADPSATDEPTAQPAPDEAPAQDA---PAGGAQAAERPTPD 418
Q + D Q P P + + P AQP P+ +PA++ PA RP P+
Sbjct: 300 QGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPE 354



Score = 31.3 bits (70), Expect = 0.008
Identities = 17/65 (26%), Positives = 28/65 (43%), Gaps = 3/65 (4%)

Query: 354 PQQEAKPGADDAEQASGADDAQRPEPPAADPSATDEPTAQPAPDEAPAQDA---PAGGAQ 410
P+ + PG+ +A A + E PA +P+ + P +P P+ P + P Q
Sbjct: 311 PRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQ 370

Query: 411 AAERP 415
RP
Sbjct: 371 PGTRP 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1924FLGFLIH310.003 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 30.5 bits (68), Expect = 0.003
Identities = 20/73 (27%), Positives = 29/73 (39%)

Query: 1 MHSGIDIRVARPEDAEEIQIIYAPIVLNTAISFEEAVPSVEQMRERISTTLQTYPYLVAV 60
M + + P+D Q + PIV EEA PS+EQ ++ Y +
Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60

Query: 61 REGRVVGYAYASQ 73
EGR G+ Q
Sbjct: 61 AEGRQQGHKQGYQ 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1926BACYPHPHTASE270.047 Salmonella/Yersinia modular tyrosine phosphatase si...
		>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase

signature.
Length = 468

Score = 26.7 bits (58), Expect = 0.047
Identities = 7/20 (35%), Positives = 10/20 (50%)

Query: 113 IHCKGGSGRTGLFAARLLIE 132
IHC+ G GRT + +
Sbjct: 401 IHCRAGVGRTAQLIGAMCMN 420


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1941HTHTETR522e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.6 bits (123), Expect = 2e-10
Identities = 20/83 (24%), Positives = 34/83 (40%), Gaps = 1/83 (1%)

Query: 10 ATLRAEQQQETREKLLRATIETISYKGYQSATIDNITSHAGTGRATFYLHFRSKPEALMA 69
A ++ QETR+ +L + S +G S ++ I AG R Y HF+ K L +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDK-SDLFS 60

Query: 70 GWQEIYMPQMVNILQNLDESYPA 92
E+ + + +P
Sbjct: 61 EIWELSESNIGELELEYQAKFPG 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1942PF05043320.004 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 31.8 bits (72), Expect = 0.004
Identities = 17/54 (31%), Positives = 25/54 (46%)

Query: 8 LNLLLLLDELYREQNLSAAARRLGMSQPMASASLRRLREYFEDQLFLSTGRGMR 61
L LL LL E R + S A L ++ L ++ F D +F S+ G+R
Sbjct: 13 LELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIR 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1946DHBDHDRGNASE1271e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 127 bits (319), Expect = 1e-37
Identities = 74/254 (29%), Positives = 112/254 (44%), Gaps = 19/254 (7%)

Query: 9 GKVVLVTGAGSGIGRATALAFAQSGASVAVADISTDHGLKTVELVKAEGGEATFFHVDVG 68
GK+ +TGA GIG A A A GA +A D + + K V +KAE A F DV
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 69 SEPSVQSMLAGVVAHYGGLDIAHNNAGIEANIVPLAELDSDNWRRVIDVNLSSVFYCLKG 128
++ + A + G +DI N AG+ + L + W VN + VF +
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 129 EIPLMLKRGGGAIVNTASASGLIGGYRLSGYTATKHGVVGLTKAAAIDYANQNIRINAVC 188
M+ R G+IV S + ++ Y ++K V TK ++ A NIR N V
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 189 PGPVDSPFLADMPQPM--------------RDRLLFGTPIGRLATAEEIARSVLWLCSDD 234
PG ++ DM + + G P+ +LA +IA +VL+L S
Sbjct: 187 PGSTET----DMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 235 AKYVVGHSMSVDGG 248
A ++ H++ VDGG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1948PF03944300.018 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 30.4 bits (68), Expect = 0.018
Identities = 14/31 (45%), Positives = 17/31 (54%)

Query: 369 APSDKTGWYVRPTVYTNVNNSMRIAREEIFG 399
AP+D TG+ + P T VNN R E FG
Sbjct: 487 APNDYTGFTISPIHATQVNNQTRTFISEKFG 517


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1949PF07520310.010 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 31.5 bits (71), Expect = 0.010
Identities = 37/152 (24%), Positives = 53/152 (34%), Gaps = 22/152 (14%)

Query: 15 AGSAGCVLANRLSANPEHSVLLLEAGSRPKGLWASMP---------AGVSRVILPGPTNW 65
A +LA + E+SV L A W +P AG + PGP++W
Sbjct: 68 AERDAPILAATTPEDDEYSVRPLAALEPFLEKWVPIPVLRLKNQRGAGGEELYDPGPSSW 127

Query: 66 AY-----QSEPDPSLAG-RRIYVPRGKALGGSSAINGMAYLRGHREDYDHWVSLGCAGWG 119
A +PDP R+ + AL S Y+ R D +
Sbjct: 128 ARLRTVELPQPDPETGHTHRVQIALDTAL--SDQDQSAHYVAPERADSEKPREFRLV--S 183

Query: 120 WDDVLPFYKKFEHREEGDEAFRGRDGELWVTD 151
+ + F R E DE D +LWV+D
Sbjct: 184 DPGAMSW---FLQRLEADEDGNAVDLQLWVSD 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1951DHBDHDRGNASE1256e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 125 bits (316), Expect = 6e-37
Identities = 76/271 (28%), Positives = 123/271 (45%), Gaps = 26/271 (9%)

Query: 9 VEGARVIVTGAASGLGLAFTEAMAESGAQVAMLDLNREALDAQFRRLRSLGYSVRSHVLD 68
+EG +TGAA G+G A +A GA +A +D N E L+ L++ + D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 69 VTDRDAVDDTFNAVAAGFGGLDIVFANAGI-DPGPGFAALNAAGEREPANMLEEYSDHRW 127
V D A+D+ + G +DI+ AG+ PG + SD W
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGL----------------IHSLSDEEW 109

Query: 128 RKVISVSLDAVFYSIRAAARHMRANRSGSIIVTTSVSALRPAVTLGAAYAAAKAGAAQLV 187
SV+ VF + R+ +++M RSGSI+ S A P ++ AYA++KA A
Sbjct: 110 EATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMA-AYASSKAAAVMFT 168

Query: 188 RATALELASDGVRVNAIAPGPFETDIGGGFMHNSEVRAKMAA--------GVPMGRIAEV 239
+ LELA +R N ++PG ETD+ + ++ G+P+ ++A+
Sbjct: 169 KCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKP 228

Query: 240 EEIKPLALYLASKASSFVTGQQFVIDGGLSL 270
+I L+L S + +T +DGG +L
Sbjct: 229 SDIADAVLFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1953DHBDHDRGNASE1293e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 129 bits (324), Expect = 3e-38
Identities = 81/278 (29%), Positives = 125/278 (44%), Gaps = 43/278 (15%)

Query: 4 QNKIVVLTGAASGIGKATAQLLVEQGAHVVAMDLKSDLLQQAFGSEE----HVLCIPTDV 59
+ KI +TGAA GIG+A A+ L QGAH+ A+D + L++ S + H P DV
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 60 SDSEAVRAAFQAVDAKFGRVDVIINAAGINAPTREANQKMVDANVAALDAMKSGRAPTFD 119
DS A+ ++ + G +D+++N AG+ + ++
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGV-------------LRPGLIHSL--------- 104

Query: 120 FLADTSDQDFRRVMEVNLFSQFYCIREGVPLMRRAGGGSIVNISSVAALLGVAMPLYYPA 179
SD+++ VN F R M GSIV + S A + Y +
Sbjct: 105 -----SDEEWEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYAS 159

Query: 180 SKAAVLGLTRAAAAELAPYNIRVNAIAPGSVDTPL-----MHEQPPEVV------QFLVS 228
SKAA + T+ ELA YNIR N ++PGS +T + E E V F
Sbjct: 160 SKAAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTG 219

Query: 229 MQPIKRLAQPEELAQSILFLAGEHSSFITGQTLSPNGG 266
+ P+K+LA+P ++A ++LFL + IT L +GG
Sbjct: 220 I-PLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1961HTHTETR290.016 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 29.2 bits (65), Expect = 0.016
Identities = 8/44 (18%), Positives = 24/44 (54%)

Query: 5 DTRRRLIELIAEHHASHPGKKLGIEELSRRAGISRQSFNRYYKD 48
+TR+ ++++ + + E+++ AG++R + ++KD
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKD 54


24PP_2020PP_2031Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_20204152.789335hypothetical protein
PP_20214162.756612hypothetical protein
PP_20225173.292451hypothetical protein
PP_20236203.986879glutathione S-transferase
PP_20246224.232961SMC domain-containing protein
PP_20254234.783112nuclease SbcCD subunit D
PP_20265234.885860hypothetical protein
PP_20274234.886652hypothetical protein
PP_20283174.544787hypothetical protein
PP_20290134.054654von Willebrand factor A
PP_20300113.692377hypothetical protein
PP_2031-1113.292689hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2024GPOSANCHOR491e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 48.5 bits (115), Expect = 1e-07
Identities = 54/356 (15%), Positives = 123/356 (34%), Gaps = 12/356 (3%)

Query: 708 RRLDEEISQDEKRQSALLALQRDAARLNQQLQAAHDAQQQAQRHLEQQHQALANDEQLLQ 767
L + S AL + + ++ + Q L + L+
Sbjct: 67 NTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLE 126

Query: 768 QGLNDLAGVLPEEALKALNDDPANAFLALDQQIAQRLQQLEQRKDELEEQQARQTQLDKL 827
+ L A + A++ + + + A ++ L
Sbjct: 127 KALEGA-----MNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTL 181

Query: 828 RDQQQARVQGQQQLQQKLAALDEQRQQALASLAELLGEHASAEAWQQHMDTALEHARTLD 887
++ A Q +L++ L A + L E A+ A + ++ ALE A
Sbjct: 182 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFS 241

Query: 888 ADTAQRLQDLRTQGVQLASELKANTQQQQALDAECQQLQAQIAQWRSEHPELDD--AGLD 945
+ +++ L + L + + + A+I +E L+ A L+
Sbjct: 242 TADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLE 301

Query: 946 RLLAMDDAQVNELRQRLQGAEKAIEQGRVLLQEREQRLQHHAAQMT-----VDTTVQALE 1000
+ +A LR+ L + +A +Q Q+ E++ + A +D + +A +
Sbjct: 302 HQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK 361

Query: 1001 QALAELRERLANHEQQCAELRAQQADDQRRQQAHQALAAEIEQAHQQWQRWARLNA 1056
Q AE ++ ++ A ++ + D ++A + + +E+A+ + +LN
Sbjct: 362 QLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNK 417



Score = 42.4 bits (99), Expect = 1e-05
Identities = 46/306 (15%), Positives = 110/306 (35%), Gaps = 15/306 (4%)

Query: 637 QKQVETLNSKLVELRTQLGVVNAQLKDFQQQQQRLGEQLQPLVAQVQAHSLWP---ALAP 693
+K +E + ++ + A+ ++ L + L+ + A S
Sbjct: 126 EKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEK 185

Query: 694 QDDKARSAWLDSQLRRLDEEISQDEKRQSALLALQRDAARLNQQLQAAHDAQQQAQRHLE 753
+AR A L+ L + D + L A + A L+ A +
Sbjct: 186 AALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADS 245

Query: 754 QQHQALANDEQLLQQGLNDLAGVLPEEALKALNDDPANAFLALDQQIAQRLQQLEQRKDE 813
+ + L ++ L+ +L E+AL+ + + A++ ++ D
Sbjct: 246 AKIKTLEAEKAALEARQAEL-----EKALEGAMNFSTADSAKIKTLEAEKAALEAEKADL 300

Query: 814 LEEQQARQTQLDKLRDQQQARVQGQQQLQQKLAALDEQRQQALASLAELLGEHASAEAWQ 873
+ Q LR A + ++QL+ + L+EQ + + AS L + ++ +
Sbjct: 301 EHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAK 360

Query: 874 QHMDTALEHARTLDADTAQRLQDLRTQGVQLASELKANTQQQQALDAECQQLQAQIAQWR 933
+ ++ + + + Q LR L A+ + ++ ++ ++ +++A
Sbjct: 361 KQLEAEHQKLEEQNKISEASRQSLRRD-------LDASREAKKQVEKALEEANSKLAALE 413

Query: 934 SEHPEL 939
+ EL
Sbjct: 414 KLNKEL 419



Score = 38.5 bits (89), Expect = 2e-04
Identities = 46/294 (15%), Positives = 94/294 (31%), Gaps = 14/294 (4%)

Query: 314 RQALSAQLAPVAAKIAEQQQQQAELQVRTRELEQALDTARQALADRQAEHGENAPRLRQA 373
L + + ++ + EL ++ L ++L+++ ++ E R
Sbjct: 66 NNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADL 125

Query: 374 FAAQDTLARLDQELAAQRSISQQAQQQVADGQQQLQQ-----LEDNQQRSVQQLALIDTA 428
A + +A+ + + +A + L++ + + S + L
Sbjct: 126 EKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEK 185

Query: 429 LADSQHLAGLANAWHAYLPQLKQVMLIGGRLTKGREELPGLQAQASQANARLQAERDAYD 488
A A L A + L + L +A +A A
Sbjct: 186 AALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADS 245

Query: 489 LLFREAKAEPQALAEQIDLLGGMLQDNRKQQRAVEEMSRLHGREQELRQQLDALRERQQQ 548
+ +AE AL + L + + A+ + + + L + AL +
Sbjct: 246 AKIKTLEAEKAALEARQAEL------EKALEGAMNFSTADSAKIKTLEAEKAALEAEKAD 299

Query: 549 AMLQRQQLITEGTAAKAELEAAEQA---LTLTRQLLERQRLARNTSVEELRNQL 599
Q Q L + + +L+A+ +A L Q LE Q S + LR L
Sbjct: 300 LEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDL 353



Score = 38.5 bits (89), Expect = 2e-04
Identities = 50/313 (15%), Positives = 98/313 (31%), Gaps = 16/313 (5%)

Query: 464 EELPGLQAQASQANARLQAERDAYDLLFREAKAEPQALAEQIDLLGGMLQDNRKQQRAVE 523
+ L + S A +L+ + + + A+ L G + + ++
Sbjct: 85 DHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIK 144

Query: 524 EMSRLHGREQELRQQLDALRERQQQAMLQRQQLITEGTAAKAELEAAEQALTLTRQLLER 583
+ + L+ E I A KA LEA + L +
Sbjct: 145 TLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMN 204

Query: 584 QRLARNTSVEELRNQLRDGEPCPVCGSAEHPFHQPEALLQSLGRHDQAEEHAAQKQVETL 643
A + ++ L + +A L+ A +++TL
Sbjct: 205 FSTADSAKIKTLEAEKAALA-------------ARKADLEKALEGAMNFSTADSAKIKTL 251

Query: 644 NSKLVELRTQLGVVNAQLKDFQQQQQRLGEQLQPLVAQVQAHSLWPALAPQDDKARSAWL 703
++ L + + L+ +++ L A+ A A + +A
Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANR 311

Query: 704 DSQLRRLD---EEISQDEKRQSALLALQRDAARLNQQLQAAHDAQQQAQRHLEQQHQALA 760
S R LD E Q E L + + Q L+ DA ++A++ LE +HQ L
Sbjct: 312 QSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLE 371

Query: 761 NDEQLLQQGLNDL 773
++ + L
Sbjct: 372 EQNKISEASRQSL 384



Score = 35.4 bits (81), Expect = 0.002
Identities = 38/289 (13%), Positives = 87/289 (30%), Gaps = 12/289 (4%)

Query: 315 QALSAQLAPVAAKIAEQQQQQAELQVRTRELEQALDTARQALADRQAEHGENAPRLRQAF 374
+ + + + + + ++L + L+ D + L++ + + +N L +
Sbjct: 53 EKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKA 112

Query: 375 AAQDTLARLDQELAAQRSISQQAQQQVADGQQQLQQLEDNQQRSVQQLALIDTALADSQH 434
+ L +L + + + L+ + L +
Sbjct: 113 SKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL---------EKA 163

Query: 435 LAGLANAWHAYLPQLKQVMLIGGRLTKGREELPGLQAQASQANARLQAERDAYDLLFREA 494
L G N A ++K + L + EL A + A+ +
Sbjct: 164 LEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAAL 223

Query: 495 KAEPQALAEQIDLLGGMLQDNRKQQRAVEEMSRLHGREQELRQQLDALRERQQQAMLQRQ 554
A L + ++ + + + +E + + +L+ E
Sbjct: 224 AARKADLEKALEGAMNFSTADSAKIKTLEA---EKAALEARQAELEKALEGAMNFSTADS 280

Query: 555 QLITEGTAAKAELEAAEQALTLTRQLLERQRLARNTSVEELRNQLRDGE 603
I A KA LEA + L Q+L R + ++ R + E
Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLE 329



Score = 32.3 bits (73), Expect = 0.013
Identities = 53/310 (17%), Positives = 94/310 (30%), Gaps = 21/310 (6%)

Query: 202 QRAFSKAREAGEAHNALKERASHLLPMAAEARAELDQRLEQAQQQFKADQAGERQLEQQR 261
A S + EA A L A E + +A++A LE ++
Sbjct: 136 STADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKA---ALEARQ 192

Query: 262 NWLNEQRQLQAQHTEASTTLQAAELDWQQLAEPRLDLARLERLAPQRHQFHRRQALSAQL 321
L + + + A + + R +
Sbjct: 193 AELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLE 252

Query: 322 APVAAKIAEQQQQQAELQVRTRELEQALDTARQALADRQAEHGENAPRLRQAFAAQDTLA 381
A AA A Q + + L+ + A++ A E A Q+
Sbjct: 253 AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQ 312

Query: 382 RLDQELAAQRSISQQAQQQVADGQQQLQQLEDNQQRSVQQLALIDTALADSQHLAGLANA 441
L ++L A R + + + Q+LE+ + S + L S+ A
Sbjct: 313 SLRRDLDASR-------EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEA 365

Query: 442 WHAYLPQLKQVML-----IGGRLTKGREELPGLQAQASQANARLQA------ERDAYDLL 490
H L + ++ + L RE ++ +AN++L A E + L
Sbjct: 366 EHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKL 425

Query: 491 FREAKAEPQA 500
+ KAE QA
Sbjct: 426 TEKEKAELQA 435


25PP_2058PP_2068Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2058224-3.715237porin
PP_2059124-1.293724CsbD family protein
PP_20601200.570957transcriptional regulator SoxR
PP_20611210.954276hypothetical protein
PP_20621191.313722hypothetical protein
PP_20632172.926535hypothetical protein
PP_20641143.438999RND family efflux transporter MFP subunit
PP_20651133.163152acriflavin resistance protein
PP_20661123.177710GntR family transcriptional regulator
PP_20670133.027609EmrB/QacA family drug resistance transporter
PP_20681143.148934multidrug efflux MFS membrane fusion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2064RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.8 bits (93), Expect = 1e-05
Identities = 21/136 (15%), Positives = 53/136 (38%), Gaps = 2/136 (1%)

Query: 57 ASGELEAVNQVQ-VAAEMPGRITRIAFESGQTVAAGQLLVQLNDAPEQALRVQLQARLRN 115
A+G+L + + + + I + G++V G +L++L +A ++ Q+ L
Sbjct: 86 ANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 116 ADVVLQRSRKL-RAMNAVSQELLDNAATAVDVARGELQHVEALIAQKAIRAPFAGKLGIR 174
A + R + L R++ L E + + K + + + +
Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 175 RVHQGQYLAAGETIVS 190
++ + A T+++
Sbjct: 206 ELNLDKKRAERLTVLA 221



Score = 31.0 bits (70), Expect = 0.007
Identities = 27/135 (20%), Positives = 47/135 (34%), Gaps = 13/135 (9%)

Query: 91 GQLLVQLNDAPEQALRVQLQARLRNADVVLQRSRKLRAMNAVSQELLDNAATAVDVARGE 150
QL + L + + +L + KLR L E
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL-----------TLE 317

Query: 151 LQHVEALIAQKAIRAPFAGKLGIRRVH-QGQYLAAGETIVSLA-DISQLHVNFALGEQAA 208
L E IRAP + K+ +VH +G + ET++ + + L V + +
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDI 377

Query: 209 PEVHAGQVLALTVDA 223
++ GQ + V+A
Sbjct: 378 GFINVGQNAIIKVEA 392


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2065ACRIFLAVINRP7780.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 778 bits (2011), Expect = 0.0
Identities = 313/1029 (30%), Positives = 521/1029 (50%), Gaps = 29/1029 (2%)

Query: 5 DVFVRRPVLALVVSSLIILMGLFAMGKLPIRQYPLLESSTITISTEYPGASAELMQGFVT 64
+ F+RRP+ A V++ ++++ G A+ +LP+ QYP + +++S YPGA A+ +Q VT
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 65 QPITQAVSSVEGIDYLSSSSQQ-GRSLITLRMVLNRDSTQALAETMAKVNQVRYRLPEKA 123
Q I Q ++ ++ + Y+SS+S G ITL D A + K+ LP++
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 124 YDPVVELSAGDSTAVAYVGFASDS--LSIPELSDYLSRVVEPQFSGIDGVAKVQSFGGQR 181
+ + S+ + GF SD+ + ++SDY++ V+ S ++GV VQ FG Q
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 182 LAMRLWLDSEQMAGRGVTAADVAQAVRANNYQATPGQV------RGQYVLADIQVDTDLT 235
AMR+WLD++ + +T DV ++ N Q GQ+ GQ + A I T
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 236 RVEDFRELIIR-NDGTDLVRLRDIGTVELSAAATQTSATMDGKPAVHLGLFPTPSGNPLV 294
E+F ++ +R N +VRL+D+ VEL A ++GKPA LG+ N L
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 295 IVEGIRQLLPQIQQTLPPGVYVALAYETARFIDASIHEVLRTLVEAMLIVVLVIWLCLGS 354
+ I+ L ++Q P G+ V Y+T F+ SIHEV++TL EA+++V LV++L L +
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 355 LRSVLIAVVAIPLSMLGAAGLMLMFGFSLNLLTLLAMVLAIGLVVDDAIVVVENVHRHIE 414
+R+ LI +A+P+ +LG ++ FG+S+N LT+ MVLAIGL+VDDAIVVVENV R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EGKS-PIAAALAGAREIAGPVIAMTLTLAAVYAPIGLMGGLTGTLFREFALTLAGAVIVS 473
E K P A +I G ++ + + L+AV+ P+ GG TG ++R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 474 GIVALTLSPVMSSLLLQPGQQH-----GAMAAIADRLFGTLSGVYGRVLAYTLAHRWISG 528
+VAL L+P + + LL+P G + F Y + L
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 529 GVALLVCLSLPWLYLLPQRELAPPEDQAAVLTAIKSPQHASLEYAERFALK-LDQVMKSI 587
+ L+ + L+L P EDQ LT I+ P A+ E ++ + D +K+
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 588 AET-----TDTWIINGTDGPAASFGGINLSAWQAR---ERSAAQVQAQLQQAVADIEGSS 639
T A ++L W+ R E SA V + + + I
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 640 IFAFQVA--SLPGSSGGLPVQMVLRSAQDYPELFQTMEVLKQRARDS-GLFAVVDSDLDY 696
+ F + G++ G +++ ++ + L Q L A V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 697 NNPVVKVRVDRAKAASLGISMQAIGESLGVLVGEQYLNRFALFGRSYDVIPQSIQDQRLT 756
+ K+ VD+ KA +LG+S+ I +++ +G Y+N F GR + Q+ R+
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 757 PAALSRQYVRAEDGSLVPLATLVRLDIEVAPNRLLQFDQQNASTLQAIPAPGVSMGNAVA 816
P + + YVR+ +G +VP + RL +++ + +Q APG S G+A+A
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 817 FLEQLTAELPPGFSHDWQSESRQYVQEGFALMWAFLAALVVIYLVLAAQYESLVDPLIIL 876
+E L ++LP G +DW S Q G + VV++L LAA YES P+ ++
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 877 VTVPLSICGALLPLALGWATLNIYTQIGLVTLIGLISKHGILMVAFANEIQVRDNLDRAA 936
+ VPL I G LL L ++Y +GL+T IGL +K+ IL+V FA ++ ++
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 937 AIVRAAQIRLRPVLMTTAAMTFGVLPLLFASGAGANSRFGLGVVIVCGMLVGTLFTLFVL 996
A + A ++RLRP+LMT+ A GVLPL ++GAG+ ++ +G+ ++ GM+ TL +F +
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 997 PTIYAWLAR 1005
P + + R
Sbjct: 1022 PVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2067TCRTETB987e-24 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 97.6 bits (243), Expect = 7e-24
Identities = 71/394 (18%), Positives = 156/394 (39%), Gaps = 17/394 (4%)

Query: 46 FMAGMNVHVTSAALPEIRGSLGASFEEGSWISTAYLVAEIVMIPLTAWLVDVFSLRRVMW 105
F + +N V + +LP+I +W++TA+++ + + L D ++R++
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 106 TGSLIFLIASVACSWAPN-LEAMIVIRVIQGAAGAVLIPLSFQLIITELPASKMAMGMAL 164
G +I SV + +I+ R IQGA A L ++ +P L
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 165 FSLANSVAQAAGPSIGGWLTDAYSWRWIFYLQLFPGIALLLAIAWSIEAKPMKLELLRKG 224
++ + GP+IGG + W ++ + + I + + + +K
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHF---- 199

Query: 225 DWLGIAAMVIGLGGLQIVLEEGGRLDWFGSPLIVGMSVVAAIALVVFVVTQLFGQRAFIN 284
D GI M +G+ + +L F + + +V+ ++ ++FV F++
Sbjct: 200 DIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 285 LRLLGHYNFGVASVAMFIFGAATFGLVFLVPNYLSQLQGFSAHDVGVALIAYGVVQLLL- 343
L + F + + I G V +VP + + S ++G +I G + +++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 344 APLMPRLMGWTSAKFMVASGFLIMALGCWLGAGLSADSADNVIIPSTVVRGIGQPFIMVA 403
+ L+ +++ G +++ +L A ++ + V G F
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 404 LSVLAVAGLDKREAGSASAVFSMLRNLGGAIGTA 437
+S + + L ++EAG+ ++ + L G A
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIA 402


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2068RTXTOXIND1271e-34 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 127 bits (321), Expect = 1e-34
Identities = 67/428 (15%), Positives = 134/428 (31%), Gaps = 77/428 (17%)

Query: 52 PSPAETEQRPSAKTRRRLAVIASGSLAAITLLAFTSYWFSTGRY---LETTDDAYVRADW 108
P+ E + P ++ R +A + ++AF G+
Sbjct: 43 PAHLELIETPVSRRPRLVAYF----IMGFLVIAFI--LSVLGQVEIVATANGKLTHSGRS 96

Query: 109 VALSPRVAGYVAKVEVADDQPVKAGDVLVRLQNRDYRARLDQARAGVTEAQA-------- 160
+ P V ++ V + + V+ GDVL++L A + ++ + +A+
Sbjct: 97 KEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQIL 156

Query: 161 -------------------------------------ALAAAQASQQVATERIDQQQQAI 183
+ Q + +D+++
Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216

Query: 184 LQAEAVVRSATAEQRRSELDVQRYRGLVRDDAATVQRLETASAHASQAQAALQGAQAALR 243
L A + R + + + L+ A + +A L+ ++ L
Sbjct: 217 LTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLE 276

Query: 244 QQRSQLAMAKARAAQAEAELQQRAAALARAQAHQQL---------AEQDEQDTVIRAPIT 294
Q S++ AK + Q + E+ +Q +VIRAP++
Sbjct: 277 QIESEILSAKEEYQLVTQLFKNEILD-KLRQTTDNIGLLTLELAKNEERQQASVIRAPVS 335

Query: 295 GVVGQRRVRT-GQYVVPGQPLLAVVPLQQAYVV-ANYKETQLARMRPGQPVEIRVDSFAS 352
V Q +V T G V + L+ +VP V A + + + GQ I+V++F
Sbjct: 336 VKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPY 395

Query: 353 Q---PLHGHVASFSPASGNVFALLPSDNATGNFTKIVQRFPVRILLDKPLDGPQVLPGMS 409
L G V + + + D G ++ L + GM+
Sbjct: 396 TRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTG-NKNIPLSSGMA 447

Query: 410 VVSTVDTR 417
V + + T
Sbjct: 448 VTAEIKTG 455


26PP_2131PP_2142Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2131213-0.761004ABC transporter ATP-binding protein
PP_2132216-1.311229universal stress protein
PP_2133216-1.538324hypothetical protein
PP_2134216-1.296400ISPpu10, transposase
PP_2135219-0.990178hypothetical protein
PP_2136119-1.001459multifunctional fatty acid oxidation complex
PP_2137-114-0.8678773-ketoacyl-CoA thiolase
PP_2138213-0.897091hypothetical protein
PP_2139314-0.404471DNA topoisomerase I
PP_21403130.820743hypothetical protein
PP_21412121.756953hypothetical protein
PP_21423141.955443cell division inhibitor-like protein
27PP_2214PP_2230Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2214213-0.362837short-chain dehydrogenase/reductase SDR
PP_2215114-0.961946acetyl-CoA acetyltransferase
PP_2216217-2.624955acyl-CoA dehydrogenase
PP_2217321-3.605238enoyl-CoA hydratase
PP_2218227-4.924115ISPpu8, transposase
PP_2219125-4.056602hypothetical protein
PP_2220124-3.167600C4-type zinc finger DksA/TraR family protein
PP_2221222-2.907642hypothetical protein
PP_2222218-1.704718hypothetical protein
PP_22232150.269259hypothetical protein
PP_22244151.232254hypothetical protein
PP_22253141.551045monovalent cation/H+ antiporter subunit G
PP_22263141.575539monovalent cation/H+ antiporter subunit F
PP_22273141.646401monovalent cation/H+ antiporter subunit E
PP_22283141.795705monovalent cation/H+ antiporter subunit D
PP_22292121.899888monovalent cation/H+ antiporter subunit C
PP_22302121.882368monovalent cation/H+ antiporter subunit A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2214DHBDHDRGNASE829e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 82.0 bits (202), Expect = 9e-21
Identities = 61/201 (30%), Positives = 81/201 (40%), Gaps = 17/201 (8%)

Query: 3 IANKHFIVSGAASGLGAATAQMLVEAGAKVMLVDLNAQAVEAKARELGDNARFA---VAD 59
I K ++GAA G+G A A+ L GA + VD N + +E L AR A AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 60 ISDEQAAQSAVDAAVSAFGSLHGLVNCAGI--VGAEKVLGKQGPHGLASFAKVINVNLIG 117
+ D A G + LVN AG+ G L + + +VN G
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDE------EWEATFSVNSTG 119

Query: 118 SFNLLRLAAAAMAEGAADESGERGVIINTASIAAYDGQIGQAAYAASKGAIASLTLPAAR 177
FN R + M G I+ S A + AAYA+SK A T
Sbjct: 120 VFNASRSVSKYM------MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173

Query: 178 ELARFGIRVMTIAPGIFETPM 198
ELA + IR ++PG ET M
Sbjct: 174 ELAEYNIRCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2219CHANLCOLICIN270.033 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 26.6 bits (58), Expect = 0.033
Identities = 9/20 (45%), Positives = 14/20 (70%)

Query: 35 INGNGNGNGNGNGNGNGNSR 54
+NG +G+G+G G G G S+
Sbjct: 25 LNGTPDGSGSGGGGGKGGSK 44


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2227PF05272280.016 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.016
Identities = 18/44 (40%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 76 LLVARGVLRADKQPPRSAFVHIPLALRDPHGLAALSMITTVVPG 119
LL A+ L DKQP + A + I LRD HG + M+ PG
Sbjct: 309 LLAAKPYLPFDKQPGQKAMLGIGEVLRDTHG-CTVQMLPIDKPG 351


28PP_2264PP_2316Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2264214-1.196860ABC transporter substrate-binding protein
PP_2265217-2.117069bifunctional 5,10-methylene-tetrahydrofolate
PP_2266216-2.754230***DNA-directed RNA polymerase, bacteriophage and
PP_2267521-4.363451phage single-stranded DNA-binding protein
PP_2268422-3.421619phage endodeoxyribonuclease I
PP_2269422-3.036941N-acetylmuramoyl-L-alanine amidase
PP_2270422-3.132125DNA primase/helicase
PP_2271623-2.501241hypothetical protein
PP_2272727-6.494334hypothetical protein
PP_2273829-7.121223DNA polymerase
PP_2274833-8.500357hypothetical protein
PP_2275834-8.619570hypothetical protein
PP_2276832-7.326776phage exonuclease
PP_2277530-6.676221hypothetical protein
PP_2278225-2.921752hypothetical protein
PP_2279422-2.770751head-to-tail joining protein
PP_2280523-2.379787hypothetical protein
PP_2281521-2.140343capsid assembly protein
PP_2282522-1.878525minor capsid protein 10
PP_2283625-1.889904tail tubular protein A
PP_2284624-1.816388tail tubular protein B
PP_2285525-1.616674hypothetical protein
PP_2286525-1.785927hypothetical protein
PP_2287424-2.823752phage internal core protein
PP_2288326-4.435960phage tail fiber protein
PP_2289230-6.599554hypothetical protein
PP_2290229-6.952534hypothetical protein
PP_2291229-7.133535hypothetical protein
PP_2292128-6.480439hypothetical protein
PP_2293124-4.819501DNA maturase B
PP_2294520-5.041605hypothetical protein
PP_2295318-3.526661antirestriction protein
PP_2296417-3.560804hypothetical protein
PP_2297417-2.618255integrative genetic element Ppu40, integrase
PP_2298519-2.638530*hypothetical protein
PP_2299519-2.299202trigger factor
PP_2300215-1.847544ATP-dependent Clp protease proteolytic subunit
PP_2301313-1.797182ATP-dependent protease ATP-binding subunit ClpX
PP_2302212-0.933222ATP-dependent protease La
PP_2303111-0.139851histone family protein DNA-binding protein
PP_23040120.137662PpiC-type peptidyl-prolyl cis-trans isomerase
PP_23050140.970517patatin
PP_23063151.740416lipoprotein
PP_23072141.812481CHAD domain-containing protein
PP_23080121.612752acyl-CoA thioesterase
PP_23090101.545643hypothetical protein
PP_2310-190.922168methyl-accepting chemotaxis sensory transducer
PP_23112142.498295TatD family hydrolase
PP_23121102.686695lytic transglycosylase
PP_23132142.427057DoxX family protein
PP_23142142.830018hypothetical protein
PP_23150112.291387transcription elongation factor GreB
PP_23162122.735601hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_227956KDTSANTIGN300.038 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 29.5 bits (66), Expect = 0.038
Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 482 QEVQQEQQQQQMQQAMQSGVAPAVQAAGRMM 512
+ QQ+Q Q Q QQA + AA R++
Sbjct: 338 PQAQQQQGQGQQQQAQATAQEAVAAAAVRLL 368


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2287TONBPROTEIN310.034 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.7 bits (69), Expect = 0.034
Identities = 12/42 (28%), Positives = 15/42 (35%)

Query: 436 EEPKTVDLPPEEPRAPEEPPAAGEPRDEPQEPSVGTPGRKQP 477
EP PP EP EP P + P V + +P
Sbjct: 55 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKP 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2302PF05272310.024 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.024
Identities = 13/83 (15%), Positives = 29/83 (34%), Gaps = 6/83 (7%)

Query: 292 DWLVQVPWKAQSKVRLDLAKAEEILDADHYGLEEVKERILEYLAVQKRVKKIRGP----- 346
DW+ W ++ L D+ +++ + V ++ P
Sbjct: 537 DWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFD 596

Query: 347 -VLCLVGPPGVGKTSLAESIAAA 368
+ L G G+GK++L ++
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2303DNABINDINGHU1201e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 120 bits (303), Expect = 1e-39
Identities = 48/88 (54%), Positives = 64/88 (72%)

Query: 2 NKSELIDAIAASADIPKAVAGRALDAVIESVTGALKQGDDVVLVGFGTFSVKERAERTGR 61
NK +LI +A + ++ K + A+DAV +V+ L +G+ V L+GFG F V+ERA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKAIKIEAAKVPGFKAGKGLKDAV 89
NPQTG+ IKI+A+KVP FKAGK LKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_23042FE2SRDCTASE310.012 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 30.8 bits (69), Expect = 0.012
Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 536 GEDGIDPAELQALFRLGKPQAKDKPVYGSVVLRDGSLV 573
GE ++ F +D P++ +VVLRDG LV
Sbjct: 203 GEATVESLRHALFFEKTLTNGEDNPLWRTVVLRDGLLV 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2309ACRIFLAVINRP300.003 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.8 bits (67), Expect = 0.003
Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 3/39 (7%)

Query: 30 LIAVPLFILGTLLVLSGLFGFDLGQIAVGVIALIAALGL 68
IAVP+ +LGT +L+ FG+ + + + + L A+GL
Sbjct: 369 TIAVPVVLLGTFAILA-AFGYSINTLTMFGMVL--AIGL 404


29PP_2355PP_2362Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_23552111.458751response regulator receiver protein
PP_23561111.908220phytochrome family protein
PP_23575162.170153spore coat U domain-containing protein
PP_23585171.910619spore coat U domain-containing protein
PP_23595161.780245spore coat U domain-containing protein
PP_23604161.477982spore coat U domain-containing protein
PP_23612131.635110type 1 pili usher pathway chaperone CsuC
PP_23623121.771257fimbrial biogenesis outer membrane usher
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2355HTHFIS498e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.4 bits (118), Expect = 8e-10
Identities = 21/123 (17%), Positives = 52/123 (42%), Gaps = 13/123 (10%)

Query: 5 ILLVEDNPRDLELTLLALERSQLANEVIVLRDGADALDYLLRRNAYAERDDGNPAVLLLD 64
IL+ +D+ + AL R +V + + A ++ G+ +++ D
Sbjct: 6 ILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWI---------AAGDGDLVVTD 54

Query: 65 LKLPKVDGLEVLKEVRATAELRSIPTVMLTSSREEPDLLRAYELGVNAYVVKPVEFKEFV 124
+ +P + ++L ++ +P +++++ ++A E G Y+ KP + E +
Sbjct: 55 VVMPDENAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112

Query: 125 TAI 127
I
Sbjct: 113 GII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2356PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.2 bits (94), Expect = 2e-05
Identities = 28/143 (19%), Positives = 50/143 (34%), Gaps = 28/143 (19%)

Query: 591 LLNFSQMGRSALRLSDVDLNAL---VEAIRSELAPD---YEGR-AIVWDIAPLPKVIGDP 643
L + S++ R +LR S+ +L + + S L +E R I P + P
Sbjct: 197 LTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVP 256

Query: 644 AFINMALHNLIANAIKY--TRGRTPARIEISAVQHPEETEVCIRDNGVGFDMAYANKLFG 701
+ + L+ N IK+ + +I + + + + + G
Sbjct: 257 PML---VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG------------- 300

Query: 702 VFQRLHRMEDFEGTGIGLASVRR 724
L E TG GL +VR
Sbjct: 301 ---SLALKNTKESTGTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2362PF00577509e-170 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 509 bits (1311), Expect = e-170
Identities = 152/811 (18%), Positives = 265/811 (32%), Gaps = 80/811 (9%)

Query: 58 TLYLDLVVNQMPR----VDLIPVQQRAG-RLYLDSDVLRAAGVSLPGNPQGEVALDS--- 109
T +D+ +N V G L L + G++ + D
Sbjct: 77 TYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACV 136

Query: 110 -----IAGLHTDYDSQNQRLLLQVPPAWLPEQQVGDRNLYPASDARSSFGALFNYDLYIN 164
I D QRL L +P A++ + G + P L NY+ N
Sbjct: 137 PLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGY--IPPELWDPGINAGLLNYNFSGN 194

Query: 165 DTD--EGGTYLAAWNELRLFDSWGTFSSTGQWRQSFKGAQADDTRRGFMRYDTTWRFTDE 222
GG A+ L+ + G + S+ + + + ++ TW D
Sbjct: 195 SVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDI 254

Query: 223 QRLL-TYEAGDFVTGALPWSSSVRVGGVQLSRDFAARPDLVTYPLPAFAGEAAVPTSLDL 281
L GD T + + G QL+ D PD P G A + +
Sbjct: 255 IPLRSRLTLGDGYTQGDIFDG-INFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTI 313

Query: 282 FINGFKSSTTELQPGPYTLTNVPFINGVGEAVVVTTDALGRQVSTTLPFYVTSSLLQKGL 341
NG+ + + PGP+T+ ++ G+ V +A G T+P+ L ++G
Sbjct: 314 KQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGH 373

Query: 342 SDYSVAAGSLRRDYGVRDFSYGPGIASGSLRYGLSDMFTLETHAETAESLMLGGLGGNMR 401
+ YS+ AG R ++ P +L +GL +T+ + A+ G
Sbjct: 374 TRYSITAGEYRSGNAQQE---KPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKN 430

Query: 402 VGNFGVLNAALAQSR--FEGDKGHQ-------------------VALGYQYNSQR-IGFG 439
+G G L+ + Q+ D H +GY+Y++ F
Sbjct: 431 MGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFA 490

Query: 440 YQRLQRHGDYADLSRVVSPDMQL-----------SKSSEQVTLSVNLNEYGSIGAGYFDV 488
R Y ++ ++ + Q+T++ L ++
Sbjct: 491 DTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQ 550

Query: 489 R-AGDGTRTRLINLSYSKPL-WGSSSVYLSANREVGDSQWAVQAQLVIPFDL-------- 538
G + + ++ S + L +
Sbjct: 551 TYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDS 610

Query: 539 -----HGTLALSMERSNEGETLQRVNYSRAVPAGVGVGYNL--GYAAGSD--RDAYRQAD 589
H + + SM G + + Y++ GYA G D + A
Sbjct: 611 KSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYAT 670

Query: 590 VTWRLQSVQLQAGTYGSSGEMTRWADASGSLVWMDAGVFAANRIDDAFVVVSTAGYADVP 649
+ +R G S + SG ++ GV ++D V+V G D
Sbjct: 671 LNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAK 730

Query: 650 VRYENQEIGRTDAKGHLLVPYSSGYYRGKYEIDPMNLPPDVLAPDVEQRVAVRRGSGYLL 709
V ENQ RTD +G+ ++PY++ Y + +D L +V + V RG+
Sbjct: 731 V--ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRA 788

Query: 710 EFPLKRVMAASVELVDGNQQVLKLGSRVTHAESGTQAVVGWDGLVYLENLSSHNRLQVAL 769
EF + + + N + L G+ VT S + +V +G VYL + ++QV
Sbjct: 789 EFKARVG-IKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKW 847

Query: 770 --EGGGHCEVAFDLPEAQGSVPLIG-PLVCR 797
E HC + LP L CR
Sbjct: 848 GEEENAHCVANYQLPPESQQQLLTQLSAECR 878


30PP_2422PP_2427Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_24222112.013372alkylhydroperoxidase
PP_24232101.666546hypothetical protein
PP_24245142.6931602'-5' RNA ligase
PP_24254112.896918AraC family transcriptional regulator
PP_24264112.953793D-isomer specific 2-hydroxyacid dehydrogenase
PP_24272132.720233hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2425HTHTETR300.007 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 30.4 bits (68), Expect = 0.007
Identities = 8/37 (21%), Positives = 15/37 (40%)

Query: 197 IGAALAHLREHYAEPLSVEALAARANMSVSTFHEHFK 233
+ AL + S+ +A A ++ + HFK
Sbjct: 17 LDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFK 53


31PP_2590PP_2611Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_25902131.792777outer membrane ferric siderophore receptor
PP_25915142.833775ferric siderophore ABC transporter
PP_25924132.657025ferric siderophore ABC transporter ATP-binding
PP_25934132.870545ferric siderophore ABC transporter permease
PP_25942122.630942ferric siderophore ABC transporter permease
PP_25952122.906088ABC transporter ATP-binding protein/permease
PP_2596-1122.860418ABC transporter ATP-binding protein/permease
PP_2597-1132.633266hypothetical protein
PP_2598-1122.963983hypothetical protein
PP_2599-1112.363590metal dependent phosphohydrolase
PP_26002112.808000FAD-binding dehydrogenase
PP_26010112.234168IclR family transcriptional regulator
PP_26021102.304005oxidoreductase domain-containing protein
PP_26031102.393682xylose isomerase
PP_26042101.981961major facilitator family transporter
PP_26051112.310387fumarate reductase/succinate dehydrogenase
PP_2606117-0.792712NIPSNAP family containing protein
PP_2607118-2.009945fumarate reductase/succinate dehydrogenase
PP_2608124-3.723873shikimate dehydrogenase
PP_2609023-3.893400IclR family transcriptional regulator
PP_2610023-4.389706hypothetical protein
PP_2611022-4.291777hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2591FERRIBNDNGPP503e-09 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 50.3 bits (120), Expect = 3e-09
Identities = 45/201 (22%), Positives = 74/201 (36%), Gaps = 19/201 (9%)

Query: 56 TPLTVQHKLGTTVISQLPQRTVALDMNEVDFLDQLGVPVAGMPKDFVPDFLARYKD---- 111
+PL Q + P R VAL+ V+ L LG+ G V D Y+
Sbjct: 19 SPLLWQMNTAHAA-AIDPNRIVALEWLPVELLLALGIVPYG-----VAD-TINYRLWVSE 71

Query: 112 ---AGQTADVGSIVQPNLERVHAARPDLILITSLQANHYDELSEMAPTLHFDVDYRDSET 168
DVG +PNLE + +P ++ ++ + L+ +AP F+
Sbjct: 72 PPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQP- 130

Query: 169 GHVAMVRQHLLSLGQVFGKQGLAQRKADALEAKLAQARS--VTRDRPERALVVLHNNGAF 226
+AM R+ L + + Q A+ E + + V R L L +
Sbjct: 131 --LAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHM 188

Query: 227 SAFGVQSRYGFIFNDLGVKPA 247
FG S + I ++ G+ A
Sbjct: 189 LVFGPNSLFQEILDEYGIPNA 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2597TETREPRESSOR382e-05 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 37.6 bits (87), Expect = 2e-05
Identities = 34/141 (24%), Positives = 55/141 (39%), Gaps = 13/141 (9%)

Query: 35 ITRERIADVSI----AIGLPNLTFVGVAAALGVSHMALYKHVPNIEALKCLVAEEIFQR- 89
+ RE + D ++ G+ LT +A LG+ LY HV N AL +A EI R
Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILARH 63

Query: 90 --WQIPRADGNCVEGLQSYLTRFATSVQAFVKAHPGLTPYVIRRLAATEAMIDKINDHQA 147
+ +P A E QS+L A S + + + + + + Q
Sbjct: 64 HDYSLPAAG----ESWQSFLRNNAMSFRRALLRYRDGAKVHLGTRPDEKQYDTV--ETQL 117

Query: 148 HIAQAYGLSLEEARWLLSTVA 168
G SL + + +S V+
Sbjct: 118 RFMTENGFSLRDGLYAISAVS 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2602TYPE3IMSPROT401e-05 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 39.7 bits (93), Expect = 1e-05
Identities = 25/108 (23%), Positives = 43/108 (39%), Gaps = 25/108 (23%)

Query: 57 RQMLEQVRPEAVIVANPNTLHVATAL--DCVEAGVPVLVEKPVGVNLDEVRALVEASRRR 114
R M E V+ +V+VANP H+A + E +P++ K + +V+ + + +
Sbjct: 248 RNMRENVKRSSVVVANPT--HIAIGILYKRGETPLPLVTFK--YTDA-QVQTVRKIAEEE 302

Query: 115 GVPVLVGHHRRHNPLIAKAHQVINEGKLGRLINVTALWQLQKPDSYFE 162
GVP+L PL A + + + I P E
Sbjct: 303 GVPILQRI-----PL---ARALYWDALVDHYI----------PAEQIE 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2604TCRTETB402e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.2 bits (94), Expect = 2e-05
Identities = 36/156 (23%), Positives = 67/156 (42%), Gaps = 3/156 (1%)

Query: 80 FATTLNYIDRAALGIMQPVLAKEMSWTAMDYANINFWFQVGYAIGFLLQGRLIDKVGVKR 139
+ + ++ L + P +A + + +N F + ++IG + G+L D++G+KR
Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80

Query: 140 AFFFAVLLWSLATGAHGLATSAAGFMV-CRFILGLTEAANYPACVKTVRLWF-PAGERAI 197
F +++ + + S ++ RFI G AA +PA V V + P R
Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAA-FPALVMVVVARYIPKENRGK 139

Query: 198 ATGLFNAGTNVGAMVTPALLPLILAVWGWQAAFIAM 233
A GL + +G V PA+ +I W +
Sbjct: 140 AFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIP 175


32PP_2629PP_2650Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_26293141.513812hypothetical protein
PP_26302122.426942hypothetical protein
PP_26312132.269233hypothetical protein
PP_26322132.502975hypothetical protein
PP_26333132.862210hypothetical protein
PP_26342132.622783cellulose synthase
PP_26352132.418034cellulose synthase catalytic subunit
PP_26362132.045786cellulose synthase regulator protein
PP_26372151.905405endo-1,4-D-glucanase
PP_26382141.486770cellulose synthase subunit BcsC
PP_26392140.529660dihydrodipicolinate synthase
PP_2640090.515202acetyltransferase
PP_26410100.704355(Fe-S)-binding protein
PP_2642090.747374GntR family transcriptional regulator
PP_2643080.592474methyl-accepting chemotaxis sensory transducer
PP_26441100.569404hypothetical protein
PP_26452100.749391magnesium-translocating P-type ATPase
PP_26462111.006547hypothetical protein
PP_26472110.832484major facilitator family transporter
PP_26482121.097161universal stress protein
PP_26492121.019773LysR family transcriptional regulator
PP_26502130.886342iron-containing alcohol dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2640SACTRNSFRASE362e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.1 bits (83), Expect = 2e-05
Identities = 22/123 (17%), Positives = 52/123 (42%), Gaps = 31/123 (25%)

Query: 34 GIETFTQVSAPQAFAERMQGDNLML--------ACFV---EGAIAGLIELKEG------- 75
G+ T+T+ + + ++ + D++ + A F+ E G I+++
Sbjct: 33 GVWTYTEERFSKPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALI 92

Query: 76 RHIAMLFIAPGLQRQGIGKRLMNAALEHASA--------EVVTVKASLSSVPAYQRYGFT 127
IA +A +++G+G L++ A+E A E + ++S+ Y ++ F
Sbjct: 93 EDIA---VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDI--NISACHFYAKHHFI 147

Query: 128 LAG 130
+
Sbjct: 148 IGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2643IGASERPTASE330.005 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.005
Identities = 35/213 (16%), Positives = 64/213 (30%), Gaps = 21/213 (9%)

Query: 284 AAATALNAVTEESANNLRQQGQELEQAATAVTEMTTAVEEVARNAITTSQTTSE---SNQ 340
A N N + Q G E ++ T T+ T VE+ + + T +T ++Q
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 341 LAAQSRRQVSENIDGTEAMTREIQTSSAHLQQLVGQVRDIGKVLEVIRS-----VSEQTN 395
++ + + + A + + Q D + + S V+E T
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTT 1188

Query: 396 LLALNAAIE-------AARAGEAGRGFAVVADEVRTLAYRTQQSTQEIEQMIGSVQAGTE 448
+ N+ +E A + + R+ E S T
Sbjct: 1189 VNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE-PATTSSNDRSTV 1247

Query: 449 AAVASMQASTNRAQS-----TLDVTLASGQVLE 476
A +TN S V L G+ +
Sbjct: 1248 ALCDLTSTNTNAVLSDARAKAQFVALNVGKAVS 1280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2647TCRTETA310.011 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.6 bits (69), Expect = 0.011
Identities = 60/278 (21%), Positives = 98/278 (35%), Gaps = 20/278 (7%)

Query: 62 GLMVTLPGIMAALAAPLISVGVGALDRRYLLIGLTLIMIIANAIVAYAGDFNLLLVGRVL 121
G+++ L +M AP++ RR +L+ + AI+A A +L +GR++
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 122 LGISIGGFWATAIALSGRLAPDGMGVAKANSIIMAGVTLATVVGVPVGTWLSGLMGWRMT 181
GI+ G A A A + G A+ + A V G +G + G
Sbjct: 106 AGIT-GATGAVAGAYIADITD-GDERARHFGFMSACFGFGMVAGPVLGGLMGGF-SPHAP 162

Query: 182 FLVTALVGVPVLLAQVFLLPRLMPEKAIRIRDLPALFINPQARVGLIAVLLIGLAHFAAY 241
F A + L FLLP K R R L +NP A + + A A +
Sbjct: 163 FFAAAALNGLNFLTGCFLLPE--SHKGER-RPLRREALNPLASFRWARGMTVVAALMAVF 219

Query: 242 -----------TYVAPFFKQNAGFDGPTIGSLLLLYGVAG-FMGNIFAGFAANRSVRHTL 289
F + +D TIG L +G+ + G A R
Sbjct: 220 FIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRA 279

Query: 290 MLVALMIAVSTALFPHFATGMTGAAMLIALWGFAFGAF 327
+++ ++ + + FAT G + A G
Sbjct: 280 LMLGMIADGTGYILLAFAT--RGWMAFPIMVLLASGGI 315


33PP_2666PP_2671Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_26663132.577804hypothetical protein
PP_26673152.601300ABC transporter
PP_26685162.986689ABC transporter ATP-binding protein
PP_26694132.285572hypothetical protein
PP_26703122.497424hypothetical protein
PP_26712142.165939integral membrane sensor signal transduction
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2667ABC2TRNSPORT429e-07 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 42.2 bits (99), Expect = 9e-07
Identities = 27/116 (23%), Positives = 51/116 (43%), Gaps = 7/116 (6%)

Query: 139 PAAGLLMALPALLLVAFMLSALGLLLSNAIRQLENFAGVMNFVIFPLFFLSSALYPLWKM 198
LL ALP + L ++LG++++ + F VI P+ FLS A++P+ ++
Sbjct: 143 QWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQL 202

Query: 199 REASQWLYWLCAVNPFTHAVELVRFALYER----LNLLALAVCLGLTALFTLLAIL 250
Q P +H+++L+R + + A+C+ + F L L
Sbjct: 203 PIVFQTAARFL---PLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTAL 255


34PP_2715PP_2729Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2715215-0.408938arsH protein
PP_2716216-0.753642arsenate reductase
PP_2717219-1.989830arsenite efflux transporter
PP_2718123-3.431501arsenic resistance transcriptional regulator
PP_2719222-3.759533hypothetical protein
PP_2720021-3.679475hypothetical protein
PP_2721-120-3.951354hypothetical protein
PP_2722-118-2.998698hemerythrin HHE cation binding domain-containing
PP_2723-217-2.175023short chain dehydrogenase/reductase
PP_2724-323-2.318087hypothetical protein
PP_2725-323-1.576857protease PfpI
PP_2726-321-0.778883hypothetical protein
PP_2727-1180.244703C-factor
PP_27281170.798641hypothetical protein
PP_27292131.270898hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2723DHBDHDRGNASE1127e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 112 bits (282), Expect = 7e-32
Identities = 75/253 (29%), Positives = 114/253 (45%), Gaps = 13/253 (5%)

Query: 40 LAGKIALITGADSGIGRAVAIAYAREGADVAIAYLNEHDDAQETARWVKAAGRQCLLLPG 99
+ GKIA ITGA GIG AVA A +GA +A N + ++ +KA R P
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPA 64

Query: 100 DLAQKQHCHDIVDKTVAQFGRIDILVNNAAFQMAHESLDDIDDDEWVKTFDTNITAIFRI 159
D+ +I + + G IDILVN A + + + D+EW TF N T +F
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 160 CQRALPSMP--KGGSIINTSSVNSDDPSPSLLAYAATKGAIANFTAGLAQLLGKQGIRVN 217
+ M + GSI+ S + P S+ AYA++K A FT L L + IR N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 218 SVAPGPI-----WTPLIPATMPDEAVRNFGS----GYPMGRPGQPVEVAPIYVLLGSDEA 268
V+PG W+ ++ ++ G P+ + +P ++A + L S +A
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 269 SYISGSRYAVTGG 281
+I+ V GG
Sbjct: 244 GHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2727DHBDHDRGNASE519e-10 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 51.2 bits (122), Expect = 9e-10
Identities = 49/247 (19%), Positives = 91/247 (36%), Gaps = 38/247 (15%)

Query: 12 NVLVCGASQGIGLALCTQLLARDDIGLVFAVSRRATTSPALDTLFTEHARRLVRLDCDAR 71
+ GA+QGIG A+ L ++ + AV + + AR D R
Sbjct: 10 IAFITGAAQGIGEAVARTLASQG--AHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 72 SEQALQTLAAQVGACCDQLNLVFSTLGVLQEGPARAEKALTQLDMAGLLSSFTTNCFAPV 131
A+ + A++ ++++ + GVL+ G + L ++F+ N
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGL------IHSLSDEEWEATFSVNSTGVF 121

Query: 132 LLLKHLLPLLRRHPMAFVALSARVGSIGDNHLG----GWYSYRASKAALNQLLRTASIEL 187
+ + + S + ++G N G +Y +SKAA + +EL
Sbjct: 122 NASRSVSKYMMDRR------SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175

Query: 188 KRLNPASTVLALHPGTTDTQLSRP------------------FQGNVPPGKLFSPAFAAT 229
N +++ PG+T+T + F+ +P KL P+ A
Sbjct: 176 AEYNIRCNIVS--PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIAD 233

Query: 230 CILALVS 236
+L LVS
Sbjct: 234 AVLFLVS 240


35PP_2778PP_2799Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_27782220.0685663-oxoacyl-ACP synthase
PP_27792220.046453beta-ketoacyl synthase
PP_27803200.0414283-oxoacyl-ACP synthase
PP_2781217-0.733256beta-ketoacyl synthase
PP_2782315-2.168053pyridoxalphosphate dependent aminotransferase,
PP_2783214-2.6191533-oxoacyl-ACP reductase
PP_2784113-2.945318short chain dehydrogenase/reductase
PP_2785-112-1.808100hypothetical protein
PP_2786-211-1.321072hypothetical protein
PP_2787-212-0.739623transporter
PP_2788-1111.657662MerR family transcriptional regulator
PP_2789-2112.997171oxidoreductase
PP_2790-1113.733179Fis family transcriptional regulator
PP_27910154.020180aminoglycoside phosphotransferase
PP_27921163.236855hypothetical protein
PP_27930152.711538acyl-CoA dehydrogenase
PP_27940141.923774short chain dehydrogenase/reductase
PP_27950131.639772acyl-CoA synthetase
PP_27960131.467966hypothetical protein
PP_27970131.841291acetate permease
PP_27982152.005187oxidoreductase
PP_27992162.437786aminotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2783DHBDHDRGNASE1354e-41 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 135 bits (341), Expect = 4e-41
Identities = 83/249 (33%), Positives = 122/249 (48%), Gaps = 12/249 (4%)

Query: 4 KIAVVTGGSRGIGKSIVLALAGAGYQVAFSYVRDEASAAALQAQVEGLGRDCLAVQCDVK 63
KIA +TG ++GIG+++ LA G +A E + + + R A DV+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSL-KAEARHAEAFPADVR 67

Query: 64 EAPSIQAFFERVEQRFERIDLLVNNAGITRDGLLATQSLNDITEVIQTNLVGTLLCCQQV 123
++ +I R+E+ ID+LVN AG+ R GL+ + S + N G + V
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 124 LPCMMRQRSGCIVNLSSVAAQKPGKGQSNYAAAKGGVEALTRALAVELAPRNIRVNAVAP 183
MM +RSG IV + S A P + YA++K T+ L +ELA NIR N V+P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 184 GIVSTDMSQAL---VGAHEQEI-----QSRLLI--KRFARPEEIADAVLYLA-ERGLYIT 232
G TDM +L EQ I + I K+ A+P +IADAVL+L + +IT
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 233 GEVLSVNGG 241
L V+GG
Sbjct: 248 MHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2784DHBDHDRGNASE965e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 96.3 bits (239), Expect = 5e-26
Identities = 76/258 (29%), Positives = 114/258 (44%), Gaps = 13/258 (5%)

Query: 5 RTIVITGAANGIGRAVAESFAAQAEHLLILLDRDLATLQGWVTEGEFAARIETHQANIAD 64
+ ITGAA GIG AVA + A+Q H+ + + + A E A++ D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 LASLQLLFKGLADRVGFVDVLVNSAGVCDENEPEDL--DNWHKVISINLNGTFYVTSLCL 122
A++ + + +G +D+LVN AGV L + W S+N G F +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 PLMAD--GGRIVNMSSILGRAGKVRNTAYCASKHGIIGMTKALALDLAPRRITVNAILPA 180
M D G IV + S + AY +SK + TK L L+LA I N + P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 181 WIDTPMLQ----GELAAQARIAGITHEQILRNAKKKLPLRRFIQGDEVAAMVRYLASPQA 236
+T M E A+ I G L K +PL++ + ++A V +L S QA
Sbjct: 189 STETDMQWSLWADENGAEQVIKGS-----LETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 237 SGVTAQSLMIDGGAGLGM 254
+T +L +DGGA LG+
Sbjct: 244 GHITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2787ACRIFLAVINRP551e-09 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 54.8 bits (132), Expect = 1e-09
Identities = 46/343 (13%), Positives = 114/343 (33%), Gaps = 38/343 (11%)

Query: 171 EMASMADLENISLSADGELWIHKTLHALDMDPIKVE--AQIMGNEQMVGGVVSADKK--V 226
+ + ++L + D ++++ A++ + + + K
Sbjct: 239 RFKNPEEFGKVTLRVNS-----------DGSVVRLKDVARVELGGENYNVIARINGKPAA 287

Query: 227 AMVVAELGTKQDDAQAQLRAYHQVREIIAKYQAAHPEFTDEVFIAGMPIFIAAQQEIIDH 286
+ + A A L ++ +A+ Q P+ ++ F+ +
Sbjct: 288 GLGIK----LATGANA-LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVK 342

Query: 287 DLAMLFPIVFLLVTSLLVFFFRKPLGVVLPLFNILFCTIWTLGLMALLRVPMDLLTSVLP 346
L +VFL++ F + ++P + + T ++A ++ LT
Sbjct: 343 TLFEAIMLVFLVM----YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGM 398

Query: 347 VFLFTICCADAIHVMAEYYEQLNSGKS-FREANRETQRLMVTPVVLTTVTTIATFL-IST 404
V + DAI V+ + K +EA ++ + +V + A F+ ++
Sbjct: 399 VLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAF 458

Query: 405 TNNIVSI--RNFGVFMSIGLTAALIISLLLIPAWISIWGKDAVPRKVQLKESLISHYLVV 462
R F + + + +++++L+L PA + K + K +
Sbjct: 459 FGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTT 518

Query: 463 F----------CAWLIRWRKPVLLVTLPLLAMMTVFTFKVDIE 495
F ++ LL+ ++A M V ++
Sbjct: 519 FDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSS 561



Score = 52.9 bits (127), Expect = 5e-09
Identities = 36/203 (17%), Positives = 88/203 (43%), Gaps = 18/203 (8%)

Query: 670 PANLQVTHAGTPYIWTGVLQEITQGQVLSFSLALLAVTLMMMFWLKSVRLGILGMLTLLT 729
P ++V PY T +Q V + A++ V L+M +L+++R ++ + +
Sbjct: 318 PQGMKVL---YPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV 374

Query: 730 TSVTVYGSMYLLDIELNIGTTLVTFLVVG-VVDYAVHLLSRI-KMLVQKGIEIDEAILAA 787
+ + + +N T L +G +VD A+ ++ + +++++ + EA +
Sbjct: 375 VLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKS 434

Query: 788 MQGVGRSTVVNVVIFSMGFVALLFSA------YKPVIDLGVLVILALSSSGFMTILLVTL 841
M + + V ++ S F+ + F Y+ + ++ A++ S + ++L
Sbjct: 435 MSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQ---FSITIVSAMALSVLVALILT-- 489

Query: 842 ISPWFFASIVPQPAVQEGEQPGG 864
P A+++ + + E GG
Sbjct: 490 --PALCATLLKPVSAEHHENKGG 510



Score = 49.8 bits (119), Expect = 4e-08
Identities = 36/199 (18%), Positives = 82/199 (41%), Gaps = 19/199 (9%)

Query: 649 SVAGDYQAMLDKLDAWLAINKPANLQVTHAGTPYIWTGVLQEITQGQVLSFSLALLAVTL 708
+ +GD A+++ L + L PA + G Y + +++ + V L
Sbjct: 834 TSSGDAMALMENLASKL----PAGIGYDWTGMSY----QERLSGNQAPALVAISFVVVFL 885

Query: 709 MMMFWLKSVRLGILGMLTLLTTSVTVYGSMYLLDIELNIGTTLVTFLVVGVVDY-AVHLL 767
+ +S + + ML + V V + L + + ++ + +G+ A+ ++
Sbjct: 886 CLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIV 945

Query: 768 SRIK-MLVQKGIEIDEAILAAMQGVGRSTVVNVVIFSMGFVALLFS------AYKPVIDL 820
K ++ ++G + EA L A++ R ++ + F +G + L S A +
Sbjct: 946 EFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA---V 1002

Query: 821 GVLVILALSSSGFMTILLV 839
G+ V+ + S+ + I V
Sbjct: 1003 GIGVMGGMVSATLLAIFFV 1021



Score = 49.5 bits (118), Expect = 6e-08
Identities = 29/153 (18%), Positives = 60/153 (39%), Gaps = 6/153 (3%)

Query: 288 LAMLFPIVFLLVTSLLVFFFRKPLGVVLPLFNILFCTIWTLGLMALLRVPMDLLTSVLPV 347
L I F++V L + V + + + L L D+ V +
Sbjct: 872 APALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLL 931

Query: 348 FLFTICCADAIHVMAEYYEQL--NSGKSFREANRETQRLMVTPVVLTTVTTIATFL---I 402
+ +AI ++ E+ + L GK EA R+ + P+++T++ I L I
Sbjct: 932 TTIGLSAKNAI-LIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAI 990

Query: 403 STTNNIVSIRNFGVFMSIGLTAALIISLLLIPA 435
S + G+ + G+ +A ++++ +P
Sbjct: 991 SNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPV 1023


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2790HTHFIS335e-110 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 335 bits (861), Expect = e-110
Identities = 134/364 (36%), Positives = 193/364 (53%), Gaps = 24/364 (6%)

Query: 304 ITVVQRADQRIRSTRRPGAFTARYRLDQLNGNSKANREMLQLAKRFATSHSTILITGESG 363
I ++ RA + L G S A +E+ ++ R + T++ITGESG
Sbjct: 112 IGIIGRALAEPKRRPSKLE-DDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESG 170

Query: 364 TGKELLAQGIHNESPRRQGPFVAINCAAFPESLLESELFGYEEGAFSGSRKGGKPGLFEA 423
TGKEL+A+ +H+ RR GPFVAIN AA P L+ESELFG+E+GAF+G+ + G FE
Sbjct: 171 TGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGA-QTRSTGRFEQ 229

Query: 424 AHRGTLFLDEIGDMPVSLQTRLLRVLQEREVLRLGSTEPIAIDVRIIAATHKDLRSAMDD 483
A GTLFLDEIGDMP+ QTRLLRVLQ+ E +G PI DVRI+AAT+KDL+ +++
Sbjct: 230 AEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQ 289

Query: 484 GDFRTDLYFRLNILRLQTTPLRERPEDIALICRGISQRLLVQGQPPGAADIPAALLPYLE 543
G FR DLY+RLN++ L+ PLR+R EDI + R Q+ +G L ++
Sbjct: 290 GLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDV--KRFDQEALELMK 347

Query: 544 RYAWPGNVRELENVIERAMLSARELLEEHRVNEQYLARVLPELCEGPPPSPARKKS---- 599
+ WPGNVRELEN++ R + + + E L +P+ + + S
Sbjct: 348 AHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQA 407

Query: 600 ----------------SGETDLHTIGKVAQLRHVKETLESCRGNLDEAARRLGISRTTLW 643
+ + + L + RGN +AA LG++R TL
Sbjct: 408 VEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 644 RRLR 647
+++R
Sbjct: 468 KKIR 471


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2794DHBDHDRGNASE1125e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 112 bits (280), Expect = 5e-32
Identities = 72/253 (28%), Positives = 123/253 (48%), Gaps = 14/253 (5%)

Query: 10 ALDGRRALVTGASSGLGRHFAMTLAAAGAEVVVTARRQAPLQALVEAIEVAGGRAQAFAL 69
++G+ A +TGA+ G+G A TLA+ GA + L+ +V +++ A+AF
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 70 DVTS----REDICRVLDAAGPLDVLVNNAGVSDSQPLLACDDQTWDHVLDTNLKGAWAVA 125
DV E R+ GP+D+LVN AGV + + D+ W+ N G + +
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 126 QESARRMVVAGKGGSLINVTSILASRVAGAVGPYLAAKAGLAHLTRAMALELARHGIRVN 185
+ ++ M+ + GS++ V S A ++ Y ++KA T+ + LELA + IR N
Sbjct: 125 RSVSKYMM-DRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 186 ALAPGYVMTDLNEAFLASEAGDKLRSR---------IPSRRFSVPSDLDGALLLLASDAG 236
++PG TD+ + A E G + + IP ++ + PSD+ A+L L S
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 237 RAMSGAEIVVDGG 249
++ + VDGG
Sbjct: 244 GHITMHNLCVDGG 256


36PP_2835PP_2847Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_28352112.439678Ldh family oxidoreductase
PP_28361132.229479fumarylacetoacetate hydrolase
PP_28371122.217029major facilitator family transporter
PP_28382122.649109VRR-NUC domain-containing protein
PP_28390132.559371DEAD/DEAH box helicase
PP_28402132.841381NnrS family protein
PP_28413132.841356phage integrase site specific recombinase
PP_28423113.152305urease accessory protein UreD
PP_28433123.402896urease subunit gamma
PP_28441123.058183urease subunit beta
PP_28451113.092575urease subunit alpha
PP_28462142.280136urease accessory protein UreE
PP_28473111.903389HupE/UreJ protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2837TCRTETB432e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 42.9 bits (101), Expect = 2e-06
Identities = 31/181 (17%), Positives = 72/181 (39%), Gaps = 4/181 (2%)

Query: 12 VVFLLLIGIVNYLDRSALSIANTSIQKDMMISPSQMGILLSAFSIAYAFAQLPMGMIIDR 71
+++L ++ + L+ L+++ I D P+ + +AF + ++ G + D+
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 72 LGSK--IALGASLLGWSVAQAAFGMVNSFAGFMGLRVLLGIGEAPMFPSAAKALSEWFDA 129
LG K + G + + G + F+ + R + G G A ++ +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGH-SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 130 NERGTPTGVVWSSTCLGPCLAPPLLTLFMVNFGWRGMFIITGVIGVVLALCWLTFYKSKA 189
RG G++ S +G + P + + W + +I +I ++ + K +
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIP-MITIITVPFLMKLLKKEV 193

Query: 190 R 190
R
Sbjct: 194 R 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2845UREASE10620.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1062 bits (2748), Expect = 0.0
Identities = 404/566 (71%), Positives = 471/566 (83%), Gaps = 2/566 (0%)

Query: 4 ISRQAYADMFGPTVGDRVRLADTALWVEVEKDFTIYGEEVKFGGGKVIRDGMGQGQML-A 62
+SR AYA+MFGPTVGD+VRLADT L++EVEKDFT +GEEVKFGGGKVIRDGMGQ Q+
Sbjct: 5 MSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTRE 64

Query: 63 AEAMDLVLTNALIIDHWGIVKADIGIKHGRIAVIGKAGNPDVQPGVNVPVGPGTEVIAAE 122
A+D V+TNALI+DHWGIVKADIG+K GRIA IGKAGNPD+QPGV + VGPGTEVIA E
Sbjct: 65 GGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGE 124

Query: 123 GKIVTAGGVDSHIHFICPQQVDEALNSGVTTFIGGGTGPATGTNATTCTPGPWYLARMLQ 182
GKIVTAGG+DSHIHFICPQQ++EAL SG+T +GGGTGPA GT ATTCTPGPW++ARM++
Sbjct: 125 GKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIE 184

Query: 183 AADSLPINIGLLGKGNASRPDALREQIAAGAVGLKLHEDWGSTPAAIDCCLGVAEEMDIQ 242
AAD+ P+N+ GKGNAS P AL E + GA LKLHEDWG+TPAAIDCCL VA+E D+Q
Sbjct: 185 AADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDVQ 244

Query: 243 VAIHTDTLNESGCIEDTLAAIGDRTIHTFHTEGAGGGHAPDIIRAAGQANVLPSSTNPTL 302
V IHTDTLNESG +EDT+AAI RTIH +HTEGAGGGHAPDIIR GQ NV+PSSTNPT
Sbjct: 245 VMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPTR 304

Query: 303 PYTINTVDEHLDMLMVCHHLDPSIAEDVAFAESRIRRETIAAEDILHDMGAFAMTSSDSQ 362
PYT+NT+ EHLDMLMVCHHL P+I ED+AFAESRIR+ETIAAEDILHD+GAF++ SSDSQ
Sbjct: 305 PYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDSQ 364

Query: 363 AMGRVGEVVLRTWQVAHQMKLRRGPLAPDTPYSDNFRVKRYIAKYTINPALTHGIGHEVG 422
AMGRVGEV +RTWQ A +MK +RG L +T +DNFRVKRYIAKYTINPA+ HG+ HE+G
Sbjct: 365 AMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEIG 424

Query: 423 SVEVGKLADLVLWSPAFFAVKPALVLKGGMIVTAPMGDINGSIPTPQPVHYRPMFGALGA 482
S+EVGK ADLVLW+PAFF VKP +VL GG I APMGD N SIPTPQPVHYRPMFGA G
Sbjct: 425 SLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYGR 484

Query: 483 ARHATRMTFLPQAAMDRGLAEELNLRSLIGVVNGCR-RVRKPDMVHNTLQPLIEVDAQTY 541
+R + +TF+ QA++D GLA L + + V R + K M+HN+L P IEVD +TY
Sbjct: 485 SRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPETY 544

Query: 542 QVRADGELLVCEPASELPLAQRYFLF 567
+VRADGELL CEPA+ LP+AQRYFLF
Sbjct: 545 EVRADGELLTCEPATVLPMAQRYFLF 570


37PP_2871PP_2886Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_28712130.201570aldolase
PP_2872411-0.087416hypothetical protein
PP_2873313-0.121929hypothetical protein
PP_28742140.383422hypothetical protein
PP_28753150.381110hypothetical protein
PP_28762150.736581beta-lactamase
PP_28772180.305637bile acid/Na+ symporter family protein
PP_28781161.304428hypothetical protein
PP_2879-2151.421818hypothetical protein
PP_28801142.711260TetR family transcriptional regulator
PP_28812142.800176short chain dehydrogenase/reductase
PP_28823153.685341DSBA oxidoreductase
PP_28833164.077528hypothetical protein
PP_28844144.273355XRE family transcriptional regulator
PP_28854154.071828major facilitator superfamily transporter
PP_28862143.075953cytochrome B561
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2880HTHTETR522e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.6 bits (123), Expect = 2e-10
Identities = 36/186 (19%), Positives = 61/186 (32%), Gaps = 17/186 (9%)

Query: 1 MRYSNEHKQQTRERLLASSGALAKRGGFASTGVAGLMKAIGLTGGAFYNHFPSKDDLFTE 60
R + + Q+TR+ +L + L + G +ST + + KA G+T GA Y HF K DLF+E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 VVRQELSNSPLARLASKGA----NRERLGRCLQQYLSLVHLRNAEGGCPLPPLGVEIARA 116
+ SN L + L L L
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 117 DTPVREVAEHWLVELHRAWSTTL-------------EDEQLAWVLISQCVGALLVGRMLA 163
+ V + A+ L + A +++ + L+ + A
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 164 SESVQA 169
+S
Sbjct: 182 PQSFDL 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2881DHBDHDRGNASE702e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 69.7 bits (170), Expect = 2e-16
Identities = 49/231 (21%), Positives = 93/231 (40%), Gaps = 15/231 (6%)

Query: 6 KVVLVIGAGDATGGEIAKRFAREGYIACVTRRQADKLQPLLEEIHAAGGQAYGFGSDARK 65
K+ + GA G +A+ A +G +KL+ ++ + A A F +D R
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 66 EEEVAELVETIERDIGPIEAFVFNIGANVPCSILEETPRKYFKIWEMACFAGFLTAQAVA 125
+ E+ IER++GPI+ V G P I + ++ + + F +++V+
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 126 RRMVQRERGTILFTGATAGTRGAAGFAAFAGAKHGLRALAQSMARELGPRNIHVAHVVVD 185
+ M+ R G+I+ G+ AA+A +K + + EL NI ++V
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR-CNIVSP 187

Query: 186 GAIDTAFIRDSFPERYALKDQ--------------DGILDPAHIADSYWFL 222
G+ +T + + + + P+ IAD+ FL
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2885TCRTETA943e-23 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 93.7 bits (233), Expect = 3e-23
Identities = 83/337 (24%), Positives = 125/337 (37%), Gaps = 37/337 (10%)

Query: 62 GAAVTVAGVVWVLLARPWGRAADRLGRRRILLLGSAGFTLAYWLLCLFVEGALRWMPGAT 121
G + + ++ A G +DR GRR +LL+ AG + Y + W+
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAI---MATAPFLWV---- 98

Query: 122 LAFIGLMIARGCIGAFYAAIPVGYNALIADHVEPQRRARAMASLGAANAVGLVVGPALAA 181
+IG ++A G GA A A IAD + RAR + A G+V GP L
Sbjct: 99 -LYIGRIVA-GITGATGAVA----GAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG 152

Query: 182 LLARHSLSLPFHIMSLLPATAFLVLFFTLKPQALPHSHAPSPVRLNDP---------RLR 232
L+ S PF + L FL F L H P+R R
Sbjct: 153 LMGGFSPHAPFFAAAALNGLNFLTGCFLLPE---SHKGERRPLRREALNPLASFRWARGM 209

Query: 233 RP----LLVAFSAMLSVTVSQIIVGFFALDRLHLGPAEAAQAAGIALTTVGVALMLAQVI 288
+ V F L V + F DR H GI+L G+ LAQ +
Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDAT----TIGISLAAFGILHSLAQAM 265

Query: 289 LRQL---EWPPLKMIRVGATVSALGFACGSLATTAPWLWACYFVAAAGMGFVFPAFSALA 345
+ + + +G G+ + AT + + A+G G PA A+
Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAML 324

Query: 346 ANAMQASEQGATAGSIGAAQGMGAVIGPLAGTLVYAL 382
+ + QG GS+ A + +++GPL T +YA
Sbjct: 325 SRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361



Score = 37.1 bits (86), Expect = 1e-04
Identities = 39/129 (30%), Positives = 49/129 (37%), Gaps = 8/129 (6%)

Query: 271 AGIALTTVGVALMLAQVILRQLEWPPLKMIRVGATVSALGFACGSLATTAPWLWACYF-- 328
A AL A +L + R P L + GA V A TAP+LW Y
Sbjct: 50 ALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMA------TAPFLWVLYIGR 103

Query: 329 VAAAGMGFVFPAFSALAANAMQASEQGATAGSIGAAQGMGAVIGPLAGTLVYALDPRLPF 388
+ A G A A+ E+ G + A G G V GP+ G L+ P PF
Sbjct: 104 IVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPF 163

Query: 389 LAVAVLLLL 397
A A L L
Sbjct: 164 FAAAALNGL 172


38PP_2930PP_2947Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2930-193.336535L-serine dehydratase
PP_2931-1102.233143hypothetical protein
PP_2932-1121.976521amidase
PP_2933-39-0.131429glutathione S-transferase
PP_2934-211-0.461943hypothetical protein
PP_2935-110-1.939169major facilitator superfamily transporter
PP_2936012-2.897454ABC transporter ATP-binding protein
PP_2937-120-3.438518integrase
PP_2938-121-2.893197OsmC family protein
PP_2939-120-2.488096hypothetical protein
PP_2940020-2.813727hypothetical protein
PP_2941020-2.593562hypothetical protein
PP_2942020-3.419692response regulator
PP_2943125-3.805351cytochrome c551 peroxidase
PP_2944028-4.240597sensor histidine kinase
PP_2945128-4.508807sensor histidine kinase/response regulator
PP_2946135-5.413915peptidyl-tRNA hydrolase domain-containing
PP_2947032-4.154008transcriptional regulator MvaT, P16 subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2935TCRTETA414e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.3 bits (97), Expect = 4e-06
Identities = 80/369 (21%), Positives = 126/369 (34%), Gaps = 40/369 (10%)

Query: 18 QILSIVFYTFIAFLCIGLPIAVLPSYVHDQLGFGAVIA--GVTIGLQYLATLLSRPFAGR 75
++ I+ + + IGL + VLP + D + V A G+ + L L P G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 76 VADTLGGKQAIRFGLLGIAGCGVLTLLSAWTLTLPLLSLALLLGGRLLLGIAQGLIGVAT 135
++D G + + L G A V + A L +L + GR++ GI VA
Sbjct: 66 LSDRFGRRPVLLVSLAGAA---VDYAIMATAPFLWVLYI-----GRIVAGITGATGAVAG 117

Query: 136 LSWGISQVGPVHT-ARVISWNGIASYGAIAIGAPV--GVLAVDGLDFSVLGP-----ALL 187
I+ + AR + A +G + PV G++ FS P AL
Sbjct: 118 AY--IADITDGDERARHFGFMS-ACFGFGMVAGPVLGGLMG----GFSPHAPFFAAAALN 170

Query: 188 VLATLALLVLRKRPDVVVVRGERL----PFWSAFGRVAPCGLGLTLAS------IGYGTL 237
L L L R R P S + +A +G
Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPA 230

Query: 238 TTFVTLYYLERGWAGA--AWCLSAFGVCFIISRLLFVNAVNRFGGYNVAVAC-MATEVLG 294
+V W L+AFG+ +++ + V G A+ M + G
Sbjct: 231 ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG 290

Query: 295 LSLLWLAPSPPWALVGAGLTGFGLSLVYPALGVEAIKQVPSSSRGAGLGAYAVFFDMALA 354
LL A A L G + PAL +QV +G G+ A + +
Sbjct: 291 YILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-S 348

Query: 355 IAGPVMGAV 363
I GP++
Sbjct: 349 IVGPLLFTA 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2936PF05272310.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.017
Identities = 17/38 (44%), Positives = 21/38 (55%), Gaps = 7/38 (18%)

Query: 352 GPNGIGKTTLLRTLVG-----EMTPDAGSVKWTDSAEV 384
G GIGK+TL+ TLVG + D G+ K DS E
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGK--DSYEQ 638


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2942HTHFIS1141e-29 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 114 bits (286), Expect = 1e-29
Identities = 36/136 (26%), Positives = 67/136 (49%), Gaps = 3/136 (2%)

Query: 12 RPTVLLVDDEESILSSLRRLLRGQPYDVKLATSGEQALAQMAEGPVDLVMSDARMPGMDG 71
T+L+ DD+ +I + L + L YDV++ ++ +A G DLV++D MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 72 ATLLAQINQHHPSTVRILLTGYADPSAIIKAVNDGQIHRYISKPWNDDELLMTLRQALEH 131
LL +I + P ++++ IKA G + Y+ KP++ EL+ + +AL
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKG-AYDYLPKPFDLTELIGIIGRALAE 121

Query: 132 QHSERERQRLELLARR 147
+R +LE ++
Sbjct: 122 P--KRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2943TYPE4SSCAGA320.003 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 32.4 bits (73), Expect = 0.003
Identities = 24/92 (26%), Positives = 41/92 (44%), Gaps = 10/92 (10%)

Query: 125 ESLEEQGDAVITSAHEMGGDWR------VIEQRIAADVRY---RQAFEDAYPDAVTKDNI 175
ESL+E+ +A + GGDW + +++ ++DV+ ++ PD T
Sbjct: 188 ESLKERQEAE-KNGEPTGGDWLDIFLSFIFDKKQSSDVKEAINQEPVPHVQPDIATTTTD 246

Query: 176 LSALADYQRTLLTPGARFDRYLQGDTEALTLE 207
+ L R LL F ++ GD E L +E
Sbjct: 247 IQGLPPEARDLLDERGNFSKFTLGDMEMLDVE 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2945HTHFIS554e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.8 bits (132), Expect = 4e-10
Identities = 33/196 (16%), Positives = 62/196 (31%), Gaps = 48/196 (24%)

Query: 46 RILLIDDMPTIHEDFRKILAPARSQNSELDEMEGLLFGEQVKNERPVFELDSAYGGEEGL 105
IL+ DD I + L R +++
Sbjct: 5 TILVADDDAAIRTVLNQAL------------------------SRAGYDVRITSNAATLW 40

Query: 106 GLLKRALQASKPYALAFVDMRMPGGWDGAQTIEHLWEEDPLLQVVVCTAYSDY-SWDELL 164
+ A+ L D+ MP + + + + P L V+V +A + + + +
Sbjct: 41 RWI-----AAGDGDLVVTDVVMPDE-NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKAS 94

Query: 165 DRLQAHDRLLILKKPFDNIEVQQMASTLLTKWEMTQRASLKMHQLEQRVERRTQQLTQA- 223
A+D L KPFD E+ + RA + + ++E +Q
Sbjct: 95 -EKGAYDYLP---KPFDLTELI----------GIIGRALAEPKRRPSKLEDDSQDGMPLV 140

Query: 224 --SEALQQEIEERKQL 237
S A+Q+ +L
Sbjct: 141 GRSAAMQEIYRVLARL 156


39PP_2960PP_3033Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2960223-3.022829AraC family transcriptional regulator
PP_2961224-3.112706LysR family transcriptional regulator
PP_2962326-3.521858zinc-containing alcohol dehydrogenase
PP_2963423-3.634229hypothetical protein
PP_2964421-3.744271Tn4652, transposase
PP_2965422-3.140194Tn4652, tnpA repressor protein TnpC
PP_2966118-2.539315hypothetical protein
PP_2967119-2.447277hypothetical protein
PP_2968121-3.582273hypothetical protein
PP_2969031-5.978415hypothetical protein
PP_2970030-5.770894hypothetical protein
PP_2971130-5.525304IS1246 transposase
PP_2972033-6.100190hypothetical protein
PP_2973036-6.340945diacylglycerol kinase
PP_2974034-6.013581hypothetical protein
PP_2975031-4.888380transposase
PP_2976-132-4.247059Tn4652, transposase subunit A
PP_2977131-4.099239Tn4652, transposase subunit B
PP_2978229-3.746294hypothetical protein
PP_2979327-3.182198hypothetical protein
PP_2980227-3.470725hypothetical protein
PP_2981027-3.166517Tn4652, cointegrate resolution protein S
PP_2982031-3.760805Tn4652, cointegrate resolution protein T
PP_2983-130-3.866184hypothetical protein
PP_2984-128-3.275617hypothetical protein
PP_2985-131-3.657468hypothetical protein
PP_2986028-3.844722oxidoreductase
PP_2987028-4.494564hypothetical protein
PP_2988028-4.174309zinc-containing alcohol dehydrogenase
PP_2989229-4.276410short chain dehydrogenase/reductase
PP_2990231-4.759547MerR family transcriptional regulator
PP_2991229-4.710322hypothetical protein
PP_2992223-2.597643hypothetical protein
PP_2993218-0.368081hypothetical protein
PP_2994115-0.033279NADH:flavin oxidoreductase
PP_29951171.347516DNA topology modulation kinase FlaR
PP_29961152.655498hypothetical protein
PP_29972163.769684LysR family transcriptional regulator
PP_29982173.4546092-dehydropantoate 2-reductase
PP_29991152.262633glyoxalase
PP_3000-1132.549964acyl dehydratase MaoC
PP_30010102.315977CAIB/BAIF family protein
PP_3002-2121.341370shikimate 5-dehydrogenase
PP_3003-115-0.5695173-dehydroquinate dehydratase
PP_3004-118-0.840289hypothetical protein
PP_3005120-0.890624hypothetical protein
PP_3006127-3.663459RNA polymerase sigma factor
PP_3007132-5.384595hypothetical protein
PP_3008142-8.108373hypothetical protein
PP_3009139-6.582185hypothetical protein
PP_3010137-5.719366hypothetical protein
PP_3011126-3.958418hypothetical protein
PP_3012222-1.823874hypothetical protein
PP_3013121-1.071995hypothetical protein
PP_30141170.401023hypothetical protein
PP_30152171.118445medium chain acyl-CoA ligase
PP_30163171.668374lipopolysaccharide core biosynthesis protein
PP_30174192.695152methylated-DNA--protein-cysteine
PP_30184202.553252hypothetical protein
PP_3019326-2.177890nitrilase/cyanide hydratase and apolipoprotein
PP_3020433-4.607229serine/threonine protein phosphatase
PP_3021338-5.883373amino acid transporter LysE
PP_3022343-6.889717AraC family transcriptional regulator
PP_3023244-8.260002amino acid efflux protein
PP_3024132-5.910796hypothetical protein
PP_3025-128-3.439756amino acid transporter LysE
PP_3026-223-2.432350phage recombinase
PP_3027-417-0.952255hypothetical protein
PP_3028-221-2.163710hypothetical protein
PP_3029-123-2.784255DNA cytosine methyltransferase
PP_3030140-6.136485hypothetical protein
PP_3031138-6.862696pyocin R2_PP, transcriptional activator prtN
PP_3032243-6.382448DNA-binding protein Roi-like protein
PP_3033129-3.778761transcriptional repressor pyocin R2_PP
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2979ACRIFLAVINRP270.034 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.7 bits (59), Expect = 0.034
Identities = 11/38 (28%), Positives = 16/38 (42%), Gaps = 2/38 (5%)

Query: 24 FSCRHPQTPWLPKVIAMIVIAYALS--PIDLIPDFIPV 59
F R P W+ +I M+ A A+ P+ P P
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPP 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2982RTXTOXIND371e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.7 bits (85), Expect = 1e-04
Identities = 23/156 (14%), Positives = 58/156 (37%), Gaps = 4/156 (2%)

Query: 63 PLSEQLANLVGQLADQLEEDAQATVAQEREQLQRERLDYQNQARLAESRIQQLESQSSGL 122
P + ++ L ++ +T ++ Q + + + +RI + E+ S
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 123 TEQLQAVQQALQQEQQQRQQTEVENARLAQANGDQEVRLQDRDSQIRSLEEKHQHARDAL 182
+L L ++ + + + +A + V SQ+ +E + A++
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYK----SQLEQIESEILSAKEEY 289

Query: 183 EHYRQASKEQREQEQRRHESQVQQLQLELRQLQQTL 218
+ Q K + + R+ + L LEL + ++
Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325



Score = 30.6 bits (69), Expect = 0.009
Identities = 23/193 (11%), Positives = 63/193 (32%), Gaps = 9/193 (4%)

Query: 77 DQLEEDAQATVAQER-EQLQRERLDYQNQARLAESRIQQLESQSSGLTEQLQAVQQALQQ 135
L +A Q Q + E+ YQ +R E Q + ++ L+
Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187

Query: 136 EQQQRQQTEVENARLAQANGDQEVRLQDRDSQIRSLEEKHQHARDALEHYRQASKEQREQ 195
++Q + Q E+ L + ++ ++ + + + +
Sbjct: 188 TSLIKEQFSTWQNQKYQK----ELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL 243

Query: 196 EQRRH--ESQVQQLQLELRQLQQTLIIKQDELTQLNRDNARLLTEARQLQKEQHAQQQLL 253
++ + V + + + + L + + +L Q+ L + Q + ++L
Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIES--EILSAKEEYQLVTQLFKNEIL 301

Query: 254 AQKNQAMEALQSV 266
+ Q + + +
Sbjct: 302 DKLRQTTDNIGLL 314



Score = 29.4 bits (66), Expect = 0.019
Identities = 25/189 (13%), Positives = 60/189 (31%), Gaps = 6/189 (3%)

Query: 142 QTEVENARLAQANGDQEVRLQDRDSQIRSLEEKHQHARD--ALEHYRQASKEQREQEQRR 199
E + + + + RS+E +++ S+E+ +
Sbjct: 131 GAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190

Query: 200 HESQVQQLQLELRQLQQTLIIKQDELTQLNRDNARLLTEARQLQKEQHAQQQLLAQK--- 256
+ Q Q + Q + L K+ E + R +R + LL ++
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250

Query: 257 NQAMEALQSVLAGSERSNEALEQRCRTLQEEVSRLGEASATLAQQAQGLQ-ERLVEANTQ 315
A+ ++ + + + ++ E+ E + Q + ++L +
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 316 LKLLRAPLA 324
+ LL LA
Sbjct: 311 IGLLTLELA 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2986NUCEPIMERASE469e-08 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 46.3 bits (110), Expect = 9e-08
Identities = 27/127 (21%), Positives = 50/127 (39%), Gaps = 12/127 (9%)

Query: 13 VTGATGLLGNNLVRELVARGYTVKGL--------VRSKAKGEQQFNNLPGVELVVGDMAE 64
VTGA G +G ++ + L+ G+ V G+ V K + PG + D+A+
Sbjct: 5 VTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQ-PGFQFHKIDLAD 63

Query: 65 VDAFAA--SLQGCDTVFHTASFFRDNYKGGSHWKELEQINVSGTRRLLEQAYGAGIRRFI 122
+ + + VF + Y + N++G +LE I+ +
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAVRYSLENPHA-YADSNLTGFLNILEGCRHNKIQHLL 122

Query: 123 HTSSIAV 129
+ SS +V
Sbjct: 123 YASSSSV 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2989DHBDHDRGNASE777e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 77.0 bits (189), Expect = 7e-19
Identities = 52/185 (28%), Positives = 93/185 (50%), Gaps = 2/185 (1%)

Query: 7 VLITGASSGIGATYAERFARRGHDLILVARDTSRMEALALRLREESHVAVEVLPADLTSS 66
ITGA+ GIG A A +G + V + ++E + L+ E+ A E PAD+ S
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVRDS 69

Query: 67 ADLSVLESRL-RDDANIGVLINNAGMAQSGGFLDQSAEAIERLVTLNTTALTRLAAAIAP 125
A + + +R+ R+ I +L+N AG+ + G S E E ++N+T + + +++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 126 RLAQSGTGAIVNVGSVVGFAPEFGMSIYGATKAFVLFLSQGLSQELSPKGVYVQAVLPAA 185
+ +G+IV VGS P M+ Y ++KA + ++ L EL+ + V P +
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 186 TRTEI 190
T T++
Sbjct: 190 TETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2995HTHFIS270.030 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 27.5 bits (61), Expect = 0.030
Identities = 13/33 (39%), Positives = 19/33 (57%), Gaps = 3/33 (9%)

Query: 4 VMIIGQPGSGKSTLAR---KLGERTGLPVVHID 33
+MI G+ G+GK +AR G+R P V I+
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAIN 195


40PP_3045PP_3050Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3045225-1.102365ClpP protease
PP_3046223-0.709520hypothetical protein
PP_3047223-0.618512hypothetical protein
PP_30482230.596403hypothetical protein
PP_30492211.383143hypothetical protein
PP_30502192.444136hypothetical protein
41PP_3095PP_3117Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_30952211.362577chaperone-associated ATPase
PP_30962160.147806hypothetical protein
PP_3097315-0.918742hypothetical protein
PP_3098212-1.495063hypothetical protein
PP_3099213-2.690250hypothetical protein
PP_3100125-5.166472hypothetical protein
PP_3101124-5.535909ADP-ribosylglycohydrolase
PP_3102123-5.710478hypothetical protein
PP_3103013-3.629630hypothetical protein
PP_3104113-4.034134hypothetical protein
PP_3105112-3.832800hypothetical protein
PP_3106112-3.025365hypothetical protein
PP_3107113-1.657239hypothetical protein
PP_3108113-2.072851rhs-like protein
PP_3109124-3.658784hypothetical protein
PP_3110323-3.331577hypothetical protein
PP_3111220-1.980151hypothetical protein
PP_3112118-0.467594hypothetical protein
PP_31131160.713430ISPpu13, transposase Orf1
PP_31141132.125925ISPpu13, transposase Orf2
PP_31151133.552557ISPpu13, transposase Orf3
PP_31162113.856893LexA repressor
PP_31172113.448421hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3104SECA290.040 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 28.7 bits (64), Expect = 0.040
Identities = 12/38 (31%), Positives = 18/38 (47%)

Query: 165 ASRGKPGSVSAARAMKGRLKLVRMGGNLYFEIPANSQA 202
A G P +V+ A M GR + +GG+ E+ A
Sbjct: 492 AQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENP 529


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3108PYOCINKILLER538e-09 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 52.9 bits (126), Expect = 8e-09
Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 11/87 (12%)

Query: 1278 WSTARKNYWKAEAKAP--TRAYSPTNLVRMAEGKAPKMTVEVISRKTDKISIREYALELH 1335
W R+ +W A A P ++ ++P +L M +G AP R++++ R +E+H
Sbjct: 532 WRDFREQFWIAVANDPELSKQFNPGSLAVMRDGGAPY------VRESEQAGGRI-KIEIH 584

Query: 1336 HNDIPQRVGGAGVHDSSNLLALTPWEH 1362
H + GG GV++ NL+A+TP H
Sbjct: 585 H-KVRVADGG-GVYNMGNLVAVTPKRH 609


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3112ALARACEMASE290.009 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 28.6 bits (64), Expect = 0.009
Identities = 16/70 (22%), Positives = 26/70 (37%), Gaps = 1/70 (1%)

Query: 55 VLVAEELARPKPDPLPYLTGLQRLGVEAGQALAFEDSLPGTAAASGAGIFTVGVATTQTP 114
L L P L +G+ RLG + + L L A + + A + P
Sbjct: 108 ALQNARLKAPLDIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMS-HFAEAEHP 166

Query: 115 ERLLAAGARL 124
+ + A AR+
Sbjct: 167 DGISGAMARI 176


42PP_3216PP_3251Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3216-3133.639345hypothetical protein
PP_3217-3133.907120aliphatic sulfonate ABC transporter
PP_3218-2153.731448NtaA/SnaA/SoxA family monooxygenase
PP_3219-1163.844779alkansulfonate monooxygenase
PP_32200174.097178ABC transporter ATP-binding protein
PP_32210164.285045ABC transporter permease
PP_3222-1153.418132ABC transporter permease
PP_3223-2153.561760ABC transporter substrate-binding protein
PP_3224-2133.807437aldolase
PP_3225-2134.137746D-isomer specific 2-hydroxyacid dehydrogenase
PP_3226-3112.904047acyl-CoA dehydrogenase
PP_3227-2111.892234LysR family transcriptional regulator
PP_3228-2141.649053aliphatic sulfonate ABC transporter
PP_32290150.329236aliphatic sulfonate ABC transporter
PP_3230218-0.310657phosphoribosyl transferase domain-containing
PP_3231117-0.719674hypothetical protein
PP_3232119-0.239268acetyltransferase
PP_3233119-0.094041Crp/Fnr family transcriptional regulator
PP_32341140.123439heat shock protein 20
PP_32352160.636624hypothetical protein
PP_3236114-0.165565lipoprotein OprI
PP_3237-2140.133805universal stress protein
PP_3238-217-0.050538transcriptional regulator PyrR
PP_3239-1190.245402Tn4652, cointegrate resolution protein T
PP_3240-2200.036113hypothetical protein
PP_3241-1190.208583hypothetical protein
PP_32421190.730550diguanylate cyclase
PP_32432161.339807acetyltransferase
PP_32442121.344708magnesium transporter MgtC family protein
PP_32453131.321685hypothetical protein
PP_32461111.172156fatty acid hydroxylase
PP_32470111.240305bile acid/Na+ symporter family protein
PP_32480131.532144Dyp-type peroxidase
PP_32491140.770031aldo/keto reductase
PP_32500160.430298major facilitator superfamily transporter
PP_32512170.606741hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3239GPOSANCHOR353e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 35.0 bits (80), Expect = 3e-04
Identities = 30/250 (12%), Positives = 75/250 (30%), Gaps = 5/250 (2%)

Query: 77 TQLQDEADLKIEQAESTFTQQREQLEAQLEIARQALAAAHQQHKIDAAALAAETEKLLST 136
+ + + +E A + T +++ LE + ALAA + + +
Sbjct: 119 EARKADLEKALEGAMNFSTADSAKIK-TLEAEKAALAARKADLEKALEGAMNFSTADSAK 177

Query: 137 QSTLQAEQLRSASLNQSLGELQVRLADKDEQVKSLEDKHRHARDALEHYRNASREQREQE 196
TL+AE+ + L + + + + AL + + E
Sbjct: 178 IKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEG- 236

Query: 197 QRRHEAQLQQMQVELRQLQQGMIVKQDELTRLHRDNERLLGEHRQAASECRAQDELLEQR 256
+++ L+ + L + E + +++ + +
Sbjct: 237 ---AMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAAL 293

Query: 257 DAQIQGLRTILAQAQGASEEMRRQLDVQAQSLEARRDVCAEQARQLQWLEEQLKARDEAL 316
+A+ L + +RR LD ++ + + Q + E ++ L
Sbjct: 294 EAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDL 353

Query: 317 AQCRAQLASV 326
R +
Sbjct: 354 DASREAKKQL 363



Score = 30.0 bits (67), Expect = 0.014
Identities = 31/213 (14%), Positives = 77/213 (36%), Gaps = 1/213 (0%)

Query: 78 QLQDEADLKIEQAESTFTQQREQLEAQLEIARQALAAAHQQHKIDAAALAAETEKLLSTQ 137
+ T ++ L A+ +AL A D+A + + + +
Sbjct: 200 EGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE 259

Query: 138 STLQAEQLRSASLNQSLGELQVRLADKDEQVKSLEDKHRHARDALEHYRNASREQREQEQ 197
+ + ++ + + +LE + + NA+R+ ++
Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQ-VLNANRQSLRRDL 318

Query: 198 RRHEAQLQQMQVELRQLQQGMIVKQDELTRLHRDNERLLGEHRQAASECRAQDELLEQRD 257
+Q++ E ++L++ + + L RD + +Q +E + +E + +
Sbjct: 319 DASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISE 378

Query: 258 AQIQGLRTILAQAQGASEEMRRQLDVQAQSLEA 290
A Q LR L ++ A +++ + L+ L A
Sbjct: 379 ASRQSLRRDLDASREAKKQVEKALEEANSKLAA 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3243SACTRNSFRASE310.001 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.5 bits (71), Expect = 0.001
Identities = 14/60 (23%), Positives = 30/60 (50%), Gaps = 2/60 (3%)

Query: 77 LHLHEISVRQEAQGQGVGRRLLQQVVDAGRCAGVRELTL-TTFVDVPWNAPFYARFGFEM 135
+ +I+V ++ + +GVG LL + ++ + L L T +++ FYA+ F +
Sbjct: 90 ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINIS-ACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3250TCRTETA531e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 52.5 bits (126), Expect = 1e-09
Identities = 70/328 (21%), Positives = 117/328 (35%), Gaps = 21/328 (6%)

Query: 21 AVIAGLLLFYLLFTGYFMLRPVRETMGVAGGVENLQWLFTGTFIATLA-----CLPLFGW 75
+I L L G ++ PV + N G +A A C P+ G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 76 LASRVQRRHILPWTYGFFASNLLLFAALLAGNPDDLWTARAFYIWLSVFNLLTISLAWSV 135
L+ R RR +L + A + + A A L+ R ++ T ++A +
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMA--TAPFLWVLYIGRI----VAGITGATGAVAGAY 119

Query: 136 LADLFSTAQGKRLFGLLAAGASLGGLSGPVLGTLLVAPLGHAGLLVLAAVLLLGSIGATL 195
+AD+ + R FG ++A G ++GPVLG L+ HA AA+ L +
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 196 FLQRWRARQPIAMQTEHQGSRPLGGNPFTGASAVLRSPYLLGIALFVVLLASVSTFLYFE 255
L P + + E + R NP + +A + + +
Sbjct: 180 LL-------PESHKGERRPLRREALNPLASFRWAR---GMTVVAALMAVFFIMQLVGQVP 229

Query: 256 QARIVSETFTDRTRQTQVFGLIDTVVQALAILTQVFLTGRLARRLGVGVLLVAVPLIMAA 315
A V G+ L L Q +TG +A RLG L+ +
Sbjct: 230 AALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289

Query: 316 GFLWLALAPVFALFVVVMVVRRAGEYAL 343
G++ LA A + +MV+ +G +
Sbjct: 290 GYILLAFATRGWMAFPIMVLLASGGIGM 317


43PP_3292PP_3318Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3292-126-3.242168hypothetical protein
PP_3293-130-3.865890hypothetical protein
PP_3294-237-3.752489universal stress protein
PP_3295039-4.863690hypothetical protein
PP_3296039-4.027499hypothetical protein
PP_3297-130-3.753555hypothetical protein
PP_3298-130-2.706935CinA domain-containing protein
PP_3299-130-2.926938outer membrane lipoprotein
PP_3300030-3.896780TetR family transcriptional regulator
PP_3301-130-3.486554RND efflux membrane fusion protein
PP_3302128-3.825805RND efflux transporter
PP_3303133-3.8225563-oxoacyl-ACP synthase
PP_3304030-4.229941Bcr/CflA family multidrug resistance
PP_3305020-2.000500TerC family membrane protein
PP_3306-116-0.008470hypothetical protein
PP_3307-116-0.009832hypothetical protein
PP_3308-117-0.016057oxidoreductase, small subunit
PP_3309-217-0.115029oxidoreductase, molybdopterin-binding subunit
PP_3310018-0.513672oxidoreductase, large subunit
PP_3311220-0.173776glutathione-regulated potassium-proton
PP_3312520-0.017288heat shock protein
PP_3313520-0.114608heat shock protein
PP_3314422-2.217300heat shock protein 20
PP_3315321-2.413177hypothetical protein
PP_3316220-2.448955chaperone-associated ATPase
PP_3317124-4.099132hypothetical protein
PP_3318021-3.700401hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3299RTXTOXIND290.037 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.037
Identities = 30/194 (15%), Positives = 56/194 (28%), Gaps = 38/194 (19%)

Query: 65 GDPL--LSRLVTEALGQNLQLAQAQARVAQARAALGSATAALVPSAGINGQAARSRQSVE 122
GD L L+ L EA Q + QAR+ Q R Q +
Sbjct: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQTRY-----------------QILSRSIELN 163

Query: 123 TPLGQLLNSTPDYDRYGNSYELNLQASWEIDLFGGLRRDRQAAVGEYQASEAGAIATRLA 182
L P + L R ++ + L
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVL---------------RLTSLIKEQFSTWQNQKYQKELN 208

Query: 183 VAAQTADIYTTVRGLQARLAIAQNQVKTQQDLLAKVNLLNRKGLAPDYEVRQTEGELSQV 242
+ + A+ + AR+ +N + ++ L + L K + V + E + +
Sbjct: 209 LDKKRAERL----TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 243 EATVPVLRAGLDAA 256
+ V ++ L+
Sbjct: 265 VNELRVYKSQLEQI 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3300HTHTETR618e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.2 bits (148), Expect = 8e-14
Identities = 27/197 (13%), Positives = 62/197 (31%), Gaps = 12/197 (6%)

Query: 17 DVRDQIIQAAMEHFAHYGYDKTTVSDLAKSIGFSKAYIYKFFESKQAIGEVICSSRLALI 76
+ R I+ A+ F+ G T++ ++AK+ G ++ IY F+ K + I + I
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 77 MQRIATATSDAPTASEKLRRLFRAIAEGGADLFFHDRKLYDIAAVASRDQ-----WSSVK 131
+ + P + R I + + + + + + V+
Sbjct: 71 GELELEYQAKFP---GDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 132 SHEANIS----KVILEILTQGRDAGEFERKTPLDELTLAIFLIMRPYVNAALLQHNLDTL 187
+ N+ I + L +A + + + + L L
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDL 187

Query: 188 QDAVVQLPALILRSLAP 204
+ A++L
Sbjct: 188 KKEARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3301RTXTOXIND453e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.8 bits (106), Expect = 3e-07
Identities = 15/104 (14%), Positives = 36/104 (34%), Gaps = 7/104 (6%)

Query: 67 VSGKILQRLVDTGQTVKRGQPLMRMDPVDLN-----LQARAQQEAVTAARARAKQTG--D 119
+ + + +V G++V++G L+++ + Q+ Q + R +
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 120 DEARYRGLVADGAVSASSYDQIKAAADAAKAQLSAAQAQADVAR 163
++ L + S +++ K Q S Q Q
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3302ACRIFLAVINRP449e-143 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 449 bits (1157), Expect = e-143
Identities = 233/1045 (22%), Positives = 432/1045 (41%), Gaps = 59/1045 (5%)

Query: 8 LSALAVRERSITLFLIVLIAFAGTLAFFKLGRAEDPPFTVKQMTIITAWPGATAQEMQDL 67
++ +R L +++ AG LA +L A+ P +++ +PGA AQ +QD
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 68 VAEPLEKRMQELRWYDRTETYT-RPGLAFTMVSLQDKTPPSAVQEEFYQARKKAGDQAKL 126
V + +E+ M + + + G ++ Q T P Q + + A L
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLA---TPL 117

Query: 127 MPAGVIGPML-NDEFSDVTFAVYALKA-KGEPQRQLVRD--AETLRQQLLHVPGVKKVNI 182
+P V + ++ S V + + + D A ++ L + GV V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 183 IGEQ-AERIFVSFSHDRLATLGITPQDIFSALDNQNALSPSGSVET------QGPQVVVR 235
G Q A RI++ D L +TP D+ + L QN +G + Q +
Sbjct: 178 FGAQYAMRIWLDA--DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 236 VDGAFDQLAKIRETPVVAQ--GRPLKLSDVADVERGYEDPATFLVRNDGEPALLLGIVMR 293
F + + + G ++L DVA VE G E+ R +G+PA LGI +
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVI-ARINGKPAAGLGIKLA 294

Query: 294 EGWNGLDLGKALEAETAKINEGMPLGMTLSKVTDQAVNITSSVDEFMIKFFVALLVVMLV 353
G N LD KA++A+ A++ P GM + D + S+ E + F A+++V LV
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 354 CFLSMG-WRVGVVVAAAVPLTLAIVFVVMAATGKNFDRITLGSLILALGLLVDDAIIAIE 412
+L + R ++ AVP+ L F ++AA G + + +T+ ++LA+GLLVDDAI+ +E
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 413 MMV-VKMEEGYDRIKASAYAWSHTAAPMLSGTLVTAIGFMPNGFAQSTAGEYTSNMFWIV 471
+ V ME+ +A+ + S ++ +V + F+P F + G +
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 472 GIALIASWVVAVAFTPYLGVKLL----PRIKTIEGGHAAIYNTRHY---NRFRALLGWVI 524
A+ S +VA+ TP L LL +GG +NT N + +G ++
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 525 AHKWLVAGTVVSTFVAAVLGMGLVKKQFFPTSDRPEVLVELQMPYGTSIEQTNATAIKVE 584
V+ + F P D+ L +Q+P G + E+T +V
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 585 SWLRQQEEAKIVTTYIGQGPPRFFLAMAPELPDPSFAKIVV--LTENQGARE---ALKHR 639
+ + E+A + + + G + + + + A + + E G A+ HR
Sbjct: 595 DYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHR 649

Query: 640 LREAASE-----GLAPGAQVRVTQLVFGPYSPYPVAYRVMGPDASQ--LRQIAARVQSVL 692
+ + + V + + +G DA Q+
Sbjct: 650 AKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAA--- 706

Query: 693 QASPMMKTVNTDWGPLVPTLHFSLNQDRLQSVGLTSASVSQQLQFLLTGVPITSVREDIR 752
Q + +V + ++Q++ Q++G++ + ++Q + L G + + R
Sbjct: 707 QHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGR 766

Query: 753 SVQVVGRAAGQIRLDPAQIENFTLVGSNGQRVPVSQIGDVSIRMEDPILRRRDRTPTMTV 812
++ +A + R+ P ++ + +NG+ VP S P L R + P+M +
Sbjct: 767 VKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEI 826

Query: 813 RGDIAEGLQPPDVSTAIWKDLQPIVTQLPAGYKIEMAGSIEESAKASQAIVPLLPIMIAL 872
+G+ A G D + + + ++LPAG + G + + L+ I +
Sbjct: 827 QGEAAPGTSSGDAMALM----ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVV 882

Query: 873 TLLIIILQVRSISAMVMVFLTSPLGLIGVVPVLLLFGQPFGINALVGLIALSGILMRNTL 932
L + S S V V L PLG++GV+ LF Q + +VGL+ G+ +N +
Sbjct: 883 VFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAI 942

Query: 933 ILIGQIDHNQL-EGLAPFDAVVEATVQRARPVLLTALAAILAFIPLTHSVFWGT-----L 986
+++ EG +A + A R RP+L+T+LA IL +PL S G+ +
Sbjct: 943 LIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAV 1002

Query: 987 AYTLIGGTFVGTIMTLVFLPAMYSI 1011
++GG T++ + F+P + +
Sbjct: 1003 GIGVMGGMVSATLLAIFFVPVFFVV 1027



Score = 86.8 bits (215), Expect = 2e-19
Identities = 61/320 (19%), Positives = 130/320 (40%), Gaps = 14/320 (4%)

Query: 712 LHFSLNQDRLQSVGLTSASVSQQLQF----LLTGVPITSVREDIRSVQVVGRAAGQIRLD 767
+ L+ D L LT V QL+ + G + + + A + + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK-N 242

Query: 768 PAQIENFTL-VGSNGQRVPVSQIGDVSIRMED-PILRRRDRTPTMTVRGDIAEGLQPPDV 825
P + TL V S+G V + + V + E+ ++ R + P + +A G D
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 826 STAIWKDLQPIVTQLPAGYKIEMAGSIEESAKAS-QAIVPLLPIMIALTLLIIILQVRSI 884
+ AI L + P G K+ + S +V L I L L++ L ++++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 885 SAMVMVFLTSPLGLIGVVPVLLLFGQPFGINALVGLIALSGILMRNTLILIGQIDHNQLE 944
A ++ + P+ L+G +L FG + G++ G+L+ + ++++ ++ +E
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 945 -GLAPFDAVVEATVQRARPVLLTALAAILAFIPL-----THSVFWGTLAYTLIGGTFVGT 998
L P +A ++ Q ++ A+ FIP+ + + + T++ +
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 999 IMTLVFLPAMYSIWFKIRPN 1018
++ L+ PA+ + K
Sbjct: 483 LVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3304TCRTETB742e-16 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 73.8 bits (181), Expect = 2e-16
Identities = 74/399 (18%), Positives = 148/399 (37%), Gaps = 53/399 (13%)

Query: 18 RANVLTAKVILLLAALAAISNLSTNIILPAFPEMARQLNVSSQELGLTLSSFFITFAFAQ 77
++N+ ++++ L L+ S L+ ++ + P++A N ++F +TF+
Sbjct: 7 QSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGT 66

Query: 78 LLVGPLADRYGRKRLVVGGLMIFVVGTFWAA-NAATLDMLILGRVIQAIGVCAAAVLARA 136
+ G L+D+ G KRL++ G++I G+ + +LI+ R IQ G A L
Sbjct: 67 AVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMV 126

Query: 137 IARDLYEGENLARALSLTMIAAATAPGFSPLIGSMLNTTLGWRALFIAVGMSAILIALFY 196
+ EN +A L A G P IG M+ + W L + +I + +
Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI--PMITIITVPF 184

Query: 197 LRGIGETLPAHRRVTQSVPAILIAYG---------------------------------- 222
L + + + IL++ G
Sbjct: 185 LMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVT 244

Query: 223 ------KLASNRLFILPALATSLLMSGLFASFAAAPSILMEGMGLSSLQVG--LYFAATV 274
L N F++ L ++ + + P ++ + LS+ ++G + F T+
Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTM 304

Query: 275 FVVFAAGLAAPRLAHRWGSRAITLSGLATACTAGALLLVGPSNPSLGWYSLSMVLFLWG- 333
V+ G L R G + + + + L + W+ +++F+ G
Sbjct: 305 SVII-FGYIGGILVDRRGPLYVLN--IGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 334 -MGIANPLGTALTMTPFGKEAGLASALL---GFLTMAIG 368
+ T ++ + +EAG +LL FL+ G
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTG 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3316HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 0.002
Identities = 40/186 (21%), Positives = 63/186 (33%), Gaps = 34/186 (18%)

Query: 614 TVEEREKLLHLEQRLHER----------LVGQDEAVRAV-ADAVRLSRAGLREGSKPVAT 662
+ + L +R + LVG+ A++ + RL + T
Sbjct: 111 LIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD--------LT 162

Query: 663 FLFLGSTGVGKTELAKALAETIYGDESALLRIDMSEYGERHSVARLVGAPPGYVGYDEGG 722
+ G +G GK +A+AL + + I+M+ + L G E G
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKG 214

Query: 723 QLTEKVRRKPYSV-------LLLDEIEKAHADVYNILLQVFDDGRLTDGKGRVVDFTNTI 775
T R L LDEI D LL+V G T GR ++
Sbjct: 215 AFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR 274

Query: 776 IIATSN 781
I+A +N
Sbjct: 275 IVAATN 280


44PP_3331PP_3354Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_33312122.719670hypothetical protein
PP_33323140.805531cytochrome c-type protein
PP_33350100.275348hypothetical protein
PP_33361110.437598hypothetical protein
PP_33370130.975328hypothetical protein
PP_33380131.297409ubiquinol oxidase subunit II
PP_33390131.800451hypothetical protein
PP_33400131.938770TonB-dependent receptor
PP_33414123.311452nickel responsive regulator
PP_33424133.196854nickel ABC transporter substrate-binding
PP_33435132.364525nickel ABC transporter permease
PP_33444142.623611nickel transporter permease NikC
PP_33450122.047050nickel transporter ATP-binding protein NikD
PP_3346-1131.736497nickel transporter ATP-binding protein NikE
PP_3347-1111.609022hypothetical protein
PP_3348-2112.076626diguanylate cyclase
PP_3349-1113.3076673-hydroxyphenylpropionic transporter MhpT
PP_3350-1113.600862hypothetical protein
PP_3351-1124.017767hypothetical protein
PP_3352-1124.408367arylsulfatase
PP_3353-1124.278367hypothetical protein
PP_3354-1113.834355acyl-CoA dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3349TCRTETB613e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 60.7 bits (147), Expect = 3e-12
Identities = 76/336 (22%), Positives = 126/336 (37%), Gaps = 15/336 (4%)

Query: 15 IGLCFLVALLEGLDLQATGIAAPHMAKAFNLTPAMLGWVFSAGLLGLLPGALIGGWLADR 74
I LC L L+ ++ P +A FN PA WV +A +L G + G L+D+
Sbjct: 17 IWLCILS-FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 75 FGRKAILIVAVLLFGGFSLGTAHAQTYDSLLI-ARLMTGLGLGAALPILIALA-SEAAPE 132
G K +L+ +++ S+ ++ SLLI AR + G G AA P L+ + + P+
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AAAFPALVMVVVARYIPK 134

Query: 133 RLRSTAVSLTYCGVPLGGAVASLI-GMAGVGDGWRTVFYVGGIAPIVIAFVLMIWLKE-- 189
R A L V +G V I GM W + + I I + F++ + KE
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR 194

Query: 190 -SQAFRAQGVAKAGSEGVLAQLFGPQQASRTLLLWVACFFTLTVLYMLLNWLPSLLIGQG 248
F +G+ S G++ + S + L+ F + V ++ P + G G
Sbjct: 195 IKGHFDIKGIILM-SVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLG 253

Query: 249 FSRPQAGAVQILFNLGGAAGSF--LTGRMMDRGFAGRAVLIAYAGMLASLAGLGLSSSFG 306
+ P V + G F + MM I + + + G
Sbjct: 254 KNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIG 313

Query: 307 LMLLAGFTAGYCAIGGQLVL----YALAPTLYSTQV 338
+L+ Y G L + L +T
Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW 349


45PP_3392PP_3397Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_33923111.153763anhydrase 3 protein
PP_33932101.298874CAIB/BAIF family protein
PP_33943101.8977313-hydroxy-3-methylglutaryl-CoA lyase
PP_33953111.704724LysR family transcriptional regulator
PP_33964121.743832hypothetical protein
PP_33973111.656259hypothetical protein
46PP_3462PP_3484Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3462-1113.057117hypothetical protein
PP_34630103.237340aldehyde dehydrogenase
PP_34640111.050469hypothetical protein
PP_3465-119-1.221033hypothetical protein
PP_3466-218-1.528562ABC transporter ATP-binding protein
PP_3467-119-2.763336Fis family transcriptional regulator
PP_3468028-6.270525hypothetical protein
PP_3469125-5.499319hypothetical protein
PP_3471-219-2.574976hypothetical protein
PP_3472-1201.574067curli production protein CsgG
PP_3473-1231.914516curli fiber protein CsgF
PP_3474-2182.006953curli assembly protein CsgE
PP_3475-2172.299053hypothetical protein
PP_3476-1172.578937type II secretion system protein G
PP_34770173.169927hypothetical protein
PP_34780162.958937type II and III secretion system protein
PP_34790162.684336hypothetical protein
PP_34800142.474856hypothetical protein
PP_34810123.450243hypothetical protein
PP_3482-1113.288073hypothetical protein
PP_3483-2102.776212type II secretion system protein E
PP_3484-2193.040716response regulator receiver protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3467HTHFIS318e-104 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 318 bits (816), Expect = e-104
Identities = 130/370 (35%), Positives = 189/370 (51%), Gaps = 54/370 (14%)

Query: 306 RALQLPRHG-RVTPSTPSSKPALSKQSPALDALAGGDQRLARNLRMARQGLGNGLPVLLL 364
RAL P+ L +S A+ + R+ + + L +++
Sbjct: 117 RALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEI----------YRVLARLMQTDLTLMIT 166

Query: 365 GETGTGKEVVARALHQASPRADKPFVAVNCAAIPEGLIESELFGYREGAFTGSRRGGMVG 424
GE+GTGKE+VARALH R + PFVA+N AAIP LIESELFG+ +GAFTG++ G
Sbjct: 167 GESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRST-G 225

Query: 425 RLMQAHGGTLFLDEIGDMPLALQARLLRVLQERRVAPLGAGDEQDIDVALICATHRDLKR 484
R QA GGTLFLDEIGDMP+ Q RLLRVLQ+ +G DV ++ AT++DLK+
Sbjct: 226 RFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQ 285

Query: 485 LVQEQHFREDLFYRVNGVSLRLPALRER-DDLAAIIQGLLDKADARGV---TLDPALTAL 540
+ + FREDL+YR+N V LRLP LR+R +D+ +++ + +A+ G+ D L
Sbjct: 286 SINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALEL 345

Query: 541 LEGFDWPGNIRQLEMVVRTALAMREDGEQVLTLDHLTDCLLDELASSAAPSG-------- 592
++ WPGN+R+LE +VR A+ V+T + + + L E+ S
Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQD--VITREIIENELRSEIPDSPIEKAAARSGSLS 403

Query: 593 ----------------------------SLKDSELEQIRGALARHQGNVSAAAAALGISR 624
L + E I AL +GN AA LG++R
Sbjct: 404 ISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNR 463

Query: 625 ATLYRKLKQL 634
TL +K+++L
Sbjct: 464 NTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3476BCTERIALGSPG664e-17 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 66.4 bits (162), Expect = 4e-17
Identities = 40/142 (28%), Positives = 61/142 (42%), Gaps = 25/142 (17%)

Query: 4 AMKRSKGFTLIELLVVMAIIATLMTIAMPRYFNSLESSREATLRQSLAVLREALDHYYGD 63
A + +GFTL+E++VV+ II L ++ +P + E + + + L ALD Y D
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 64 TGHYPDS---LEQLVEQRYL----RNTPVDPITER--SDAW----QLVPP---------- 100
HYP + LE LVE L N + +R +D W LV P
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLS 122

Query: 101 --PEGVAGGVADIKSGATGRAR 120
P+G G DI + + +
Sbjct: 123 AGPDGEMGTEDDITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3477BCTERIALGSPG494e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.7 bits (116), Expect = 4e-10
Identities = 28/113 (24%), Positives = 53/113 (46%), Gaps = 15/113 (13%)

Query: 1 MSARRRMQGFSLIEVVLTLALLGLLASMAAPLTETVVRRGKEQQLREALYQIRDAIDAYK 60
M A + +GF+L+E+++ + ++G+LAS+ P + +Q+ + + +A+D YK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 RAFDAGYIEKRVDSSGYPPNLQVLVEGVRDVRSAKGAKFY----FLRRIPHDP 109
+D+ YP Q L V A Y +++R+P DP
Sbjct: 61 -----------LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADP 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3478BCTERIALGSPD1442e-38 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 144 bits (364), Expect = 2e-38
Identities = 74/321 (23%), Positives = 139/321 (43%), Gaps = 29/321 (9%)

Query: 303 DERLNTLTMRDTSDAVRMAEKLLQSQDQSNPEVVLEVEVMEVATSRILDLGLQWPNTFGV 362
+ N L + D + E+++ D P+V++E + EV + L+LG+QW N
Sbjct: 315 HGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAG 374

Query: 363 LSDDGN---------------------PVSVLDQLKGINSSRISIA-PAPQAKINA--QD 398
++ N S+ L N + A
Sbjct: 375 MTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSS 434

Query: 399 KDINTLASPVIRVSNREQARIHIGQRVPIISATSVPSTQGPVITESVTYLDVGLKLEVQP 458
+ LA+P I + +A ++GQ VP+++ + +T G I +V VG+KL+V+P
Sbjct: 435 TKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQ--TTSGDNIFNTVERKTVGIKLKVKP 492

Query: 459 TVHLNNEVAIKVALEVSNATPLEATRQGTIPVQVDTRNAQTSLRLHDGETQVLAGLVRND 518
++ + V +++ EVS+ ++ + +TR ++ + GET V+ GL+
Sbjct: 493 QINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKS 552

Query: 519 HNASGNKIPGLGDIPGLGRLFGSNKDDMSKSELVLAITPRIVRNL-PYQSPSDMEFSTGT 577
+ + +K+P LGDIP +G LF S +SK L+L I P ++R+ Y+ S +++
Sbjct: 553 VSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYT--A 610

Query: 578 ESAMQVRQMAPLPPMDVPGNA 598
+ Q +Q +
Sbjct: 611 FNDAQSKQRGKENNDAMLNQD 631


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3484HTHFIS743e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.5 bits (183), Expect = 3e-18
Identities = 34/123 (27%), Positives = 55/123 (44%), Gaps = 1/123 (0%)

Query: 3 RVLVVDDEQTLAQNLQAYLQAQGLEVHVAHDGASGIEQAESLAPQVVVLDYRLPDMEGFQ 62
+LV DD+ + L L G +V + + A+ + +VV D +PD F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VLEAVRKNR-QCHFVLITAHPTVEVRERAAELGVSHVLFKPFPLMELARAIFDLLGIERR 121
+L ++K R ++++A T +A+E G L KPF L EL I L +R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 RRA 124
R +
Sbjct: 125 RPS 127


47PP_3502PP_3520Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3502-183.029573ISPpu10, transposase
PP_3503-1103.930328Fis family transcriptional regulator
PP_3504-1114.288228hypothetical protein
PP_3505-1114.302535hypothetical protein
PP_3506-3143.658755magnesium chelatase
PP_3507-3142.808439cobaltochelatase subunit CobN
PP_3508-3120.628946cobalamin biosynthesis protein CobW
PP_3509-1110.503977glyoxalase
PP_35101121.872954hypothetical protein
PP_35110121.965688branched-chain amino acid aminotransferase
PP_3512292.195525transmembrane pair domain-containing protein
PP_3513292.938549LysR family transcriptional regulator
PP_3514293.029214hydantoinase B/oxoprolinase
PP_35151103.0259515-oxoprolinase
PP_35162102.099440AraC family transcriptional regulator
PP_35171102.168039hypothetical protein
PP_35182121.688130hypothetical protein
PP_3519216-0.348838lipoprotein
PP_3520215-0.234685hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3503HTHFIS401e-139 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 401 bits (1031), Expect = e-139
Identities = 148/379 (39%), Positives = 196/379 (51%), Gaps = 36/379 (9%)

Query: 98 FDFHTLPFDVSRVQVTLGRAFGMARLRGKGAVKVDEATHELLGESRPIRELRKLLGKLAP 157
+D+ PFD++ + +GRA + R + L+G S ++E+ ++L +L
Sbjct: 99 YDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQ 158

Query: 158 TESPVLIRGESGTGKELVARTLHRQSQRSEQPFIAINCGAIPEHLIQSELFGHEKGAFTG 217
T+ ++I GESGTGKELVAR LH +R PF+AIN AIP LI+SELFGHEKGAFTG
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTG 218

Query: 218 AHQRKTGRIEAADGGTLFLDEIGDLPLELQANLLRFLQEKHIERVGGSQPIPVDVRVLAA 277
A R TGR E A+GGTLFLDEIGD+P++ Q LLR LQ+ VGG PI DVR++AA
Sbjct: 219 AQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAA 278

Query: 278 THVDLERAIEQGRFREDLYYRLNVLQVVTAPLRDRHGDLSMLANHFAHFYSVETGRRPRS 337
T+ DL+++I QG FREDLYYRLNV+ + PLRDR D+ L HF E G +
Sbjct: 279 TNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKR 337

Query: 338 FSDHALAAMGRHDWPGNVRELANRVRRGLVLAEGRQIEAQDLGLQLLD------------ 385
F AL M H WPGNVREL N VRR L I + + +L
Sbjct: 338 FDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAA 397

Query: 386 -----------------------PEQQPLGTLEEYKQRAERQALCDVLNRHSDNLSVAAK 422
P G + E + L N AA
Sbjct: 398 RSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAAD 457

Query: 423 VLGISRPTFYRLLHKHQIR 441
+LG++R T + + + +
Sbjct: 458 LLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3506HTHFIS461e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.0 bits (109), Expect = 1e-07
Identities = 40/149 (26%), Positives = 58/149 (38%), Gaps = 24/149 (16%)

Query: 34 VLIEGPRGMAKSTLARGLADL--LGEGPFVTLPLGASEERLVGTLDLDAAL-GQGKAQFS 90
++I G G K +AR L D GPFV + + A L +++ L G K F+
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDL-----IESELFGHEKGAFT 217

Query: 91 ------PGVLAQADGGVLYVDEVNLLPDTLVDLLLDVAASGTNRIERDGISHRHSARFVL 144
G QA+GG L++DE+ +P LL V G G + +
Sbjct: 218 GAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGE--YTTVGGRTPIRSDVRI 275

Query: 145 IGTMNP------EEGELRPQLLDRFGLNV 167
+ N +G R L R LNV
Sbjct: 276 VAATNKDLKQSINQGLFREDLYYR--LNV 302


48PP_3604PP_3616Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_36043221.666120hypothetical protein
PP_36053201.955600LysR family transcriptional regulator
PP_36063171.715436quinone oxidoreductase
PP_36071100.244507hypothetical protein
PP_3608090.013390LysR family transcriptional regulator
PP_360909-0.300469hypothetical protein
PP_3610-19-0.589053hypothetical protein
PP_3611010-1.312041hypothetical protein
PP_3612012-2.037314TonB-dependent siderophore receptor
PP_3613319-3.026434L-sorbosone dehydrogenase
PP_3614428-4.936795hypothetical protein
PP_3615219-4.286456hypothetical protein
PP_3616218-3.950767hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3614PF04335270.022 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 27.1 bits (60), Expect = 0.022
Identities = 12/44 (27%), Positives = 19/44 (43%), Gaps = 10/44 (22%)

Query: 51 AWLIAGALVFCGLALLFALANL----------IRAERKGGRATL 84
AW++AG A + A+A L I +R G A++
Sbjct: 35 AWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASI 78


49PP_3669PP_3707Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3669228-6.586607LysR family transcriptional regulator
PP_3670446-12.223871hypothetical protein
PP_3671557-15.125689aldo/keto reductase
PP_3672664-17.274599hypothetical protein
PP_3675563-15.940680hypothetical protein
PP_3676558-14.278395hypothetical protein
PP_3677452-12.728832hypothetical protein
PP_3678544-9.998452hypothetical protein
PP_3679540-7.721538hypothetical protein
PP_3680540-7.865440hypothetical protein
PP_3681540-8.574570helicase
PP_3682550-11.658778hypothetical protein
PP_3683347-10.504506hypothetical protein
PP_3684452-11.128692hypothetical protein
PP_3685544-7.124371hypothetical protein
PP_3686543-7.258597hypothetical protein
PP_3688543-7.301356hypothetical protein
PP_3689642-7.098203serine/threonine protein phosphatase
PP_3690643-7.041435hypothetical protein
PP_3691642-6.722834DNA helicase-like protein
PP_3692644-8.366738hypothetical protein
PP_3693646-7.998186transcriptional regulator MvaT, P16 subunit
PP_3694446-8.054538hypothetical protein
PP_3695447-8.151202hypothetical protein
PP_3696446-8.405305hypothetical protein
PP_3697447-8.491762hypothetical protein
PP_3698347-8.888377hypothetical protein
PP_3699247-9.965931hypothetical protein
PP_3700458-12.852039hypothetical protein
PP_3701358-12.919018hypothetical protein
PP_3702460-14.274894hypothetical protein
PP_3703460-13.672091hypothetical protein
PP_3704563-14.006204hypothetical protein
PP_3705563-13.672061hypothetical protein
PP_3706144-8.992500hypothetical protein
PP_3707137-7.646423hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3707RTXTOXINC290.024 Gram-negative bacterial RTX toxin-activating protein C...
		>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C

signature.
Length = 170

Score = 29.1 bits (65), Expect = 0.024
Identities = 25/114 (21%), Positives = 49/114 (42%), Gaps = 18/114 (15%)

Query: 21 AFGLWGINDMGLVCGNSYFID-----GDDW-SELRSFMVVSQSEDYISVFDIDGN----- 69
A+ W ++ L Y D +DW S R + + D+I+ F +G
Sbjct: 55 AYCSWA--NLSLENEIKYLNDVTSLVAEDWTSGDRKWFI-----DWIAPFGDNGALYKYM 107

Query: 70 AQELPREVLPSLRLHPISYDRSSKKFKAYKFVQDVASSLIRQYEQVGICELIEK 123
++ P E+ ++R+ P ++ +F K + +A+ + +QY I E+ K
Sbjct: 108 RKKFPDELFRAIRVDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKRK 161


50PP_3720PP_3751Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_37200153.511257ribosyldihydronicotinamide dehydrogenase
PP_37210143.164869aspartate aminotransferase
PP_3722-1122.654272alanine racemase
PP_3724-1122.801859acyl-CoA synthetase
PP_3725-1142.503660acyl-CoA dehydrogenase
PP_3726-1142.771022enoyl-CoA hydratase
PP_37270122.164462amino acid transporter
PP_37280132.624020multi-sensor hybrid histidine kinase
PP_37293143.271398amino acid ABC transporter substrate-binding
PP_37304143.585622transcriptional regulator
PP_37313143.217906TetR family transcriptional regulator
PP_37323152.935380enoyl-CoA hydratase
PP_37331142.798697ABC transporter
PP_37343122.747032hypothetical protein
PP_37350112.056079ABC transporter ATP-binding protein
PP_37361111.390904Rieske (2Fe-2S) domain-containing protein
PP_37371121.562928ferredoxin
PP_37380121.626603GntR family transcriptional regulator
PP_3740-1112.123088major facilitator family transporter
PP_3741-2122.748420penicillin-binding protein 2
PP_3742-2103.230700glutathione S-transferase
PP_3743-2113.920101hypothetical protein
PP_3744-2113.497699DNA-binding transcriptional regulator GlcC
PP_3745-1143.909178glycolate oxidase subunit GlcD
PP_37462114.487856glycolate oxidase FAD binding subunit
PP_37472124.112082glycolate oxidase iron-sulfur subunit
PP_37481124.110291hypothetical protein
PP_37491123.742314hypothetical protein
PP_37502123.890340GntR family transcriptional regulator
PP_37512143.217992major facilitator superfamily transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3722ALARACEMASE2101e-66 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 210 bits (535), Expect = 1e-66
Identities = 102/344 (29%), Positives = 158/344 (45%), Gaps = 35/344 (10%)

Query: 44 AWVEVSASALQHNIRTLQAELAGKSKLCAVLKADAYGHGIGLVMPSIIAQGVPCVAVASN 103
+ AL+ N+ ++ + A +++ +V+KA+AYGHGI + +I A A+ +
Sbjct: 5 IQASLDLQALKQNLSIVR-QAATHARVWSVVKANAYGHGIERIWSAIGATD--GFALLNL 61

Query: 104 EEARVVRASGFTGQ-LVRVRLASLSELEDGLQYDMEELVGSAEFARQADAIA-ARHGKTL 161
EEA +R G+ G L+ +LE Q+ + V S Q A+ AR L
Sbjct: 62 EEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNW---QLKALQNARLKAPL 118

Query: 162 RIHMALNSSGMSRNGVE----MATWSGRGEALQITDQKHLKLVALMTHFA-VEDKDDVRK 216
I++ +NS GM+R G + + W Q+ ++ + LM+HFA E D +
Sbjct: 119 DIYLKVNS-GMNRLGFQPDRVLTVWQ------QLRAMANVGEMTLMSHFAEAEHPDGISG 171

Query: 217 GLAAFNEQTDWLIKHARLDRSKLTLHAANSFATLEVPEARLDMVRTGGALFGDT------ 270
+A + + L + +NS ATL PEA D VR G L+G +
Sbjct: 172 AMARIEQAAEGL---------ECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWR 222

Query: 271 VPARTEYKRAMQFKSHVAAVHSYPAGNTVGYDRTFTLARDSRLANITVGYSDGYRRVFTN 330
A T + M S + V + AG VGY +T + R+ + GY+DGY R
Sbjct: 223 DIANTGLRPVMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPT 282

Query: 331 KGHVLINGHRVPVVGKVSMNTLMVDVTDFPDVKGGNEVVLFGKQ 374
VL++G R VG VSM+ L VD+T P G V L+GK+
Sbjct: 283 GTPVLVDGVRTMTVGTVSMDMLAVDLTPCPQAGIGTPVELWGKE 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3728HTHFIS625e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.8 bits (150), Expect = 5e-12
Identities = 28/114 (24%), Positives = 44/114 (38%), Gaps = 3/114 (2%)

Query: 514 ILVVEDVALNREVAGGLLMRDGHRVSFAEDASQALQACAQRRFDLVLLDVHLPGMSGVAL 573
ILV +D A R V L R G+ V +A+ + A DLV+ DV +P + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 574 CRQLRSSPGPNRHSRILALTAGVQPGQVAGYLDAGMQGVLAKPLRLDSLRKALA 627
+++ +L ++A + G L KP L L +
Sbjct: 66 LPRIKK---ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3730HTHFIS1002e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 100 bits (250), Expect = 2e-26
Identities = 33/130 (25%), Positives = 61/130 (46%), Gaps = 1/130 (0%)

Query: 3 PRVLIVDDDPLVRDLLQAYLSREGYDVHCADTAERAEALLGSQDVDLVLLDIRLPGKDGL 62
+L+ DDD +R +L LSR GYDV A + + D DLV+ D+ +P ++
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 TLTRELR-VRSEVGIILITGRNDDIDRIVGLECGADDYVIKPLNPRELVSRAKNLIRRVR 121
L ++ R ++ +++++ +N + I E GA DY+ KP + EL+ + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 HAREAHPAPA 131
+
Sbjct: 124 RRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3731HTHTETR611e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.8 bits (147), Expect = 1e-13
Identities = 35/210 (16%), Positives = 73/210 (34%), Gaps = 7/210 (3%)

Query: 5 ARYHRMLPELRKANLVEATLVCLKRHGFQGASIRKISAEAGVSVGLISHHYAGKDELVAE 64
AR + + + ++++ L + G S+ +I+ AGV+ G I H+ K +L +E
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 65 AYMAVTGRVMGLLREAMAQAAPNARERLSAFFRASFCAELLDPQ---LLDAWLAFWGAVK 121
+ + L E A+ + L + + + + L++ V
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 122 TADAINQVHDHSYGEYRNELGQLLAR-LAEEEGWQGFDADLAAISLSALLDGLWLESGLN 180
+ Q + E + + Q L + + AAI + + GL
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181

Query: 181 PGTFTPEQGVIICEAWVDGLQAGGRRRFSL 210
P +F ++ +V L +L
Sbjct: 182 PQSFDLKK---EARDYVAILLEMYLLCPTL 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3740TCRTETB516e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 50.6 bits (121), Expect = 6e-09
Identities = 31/172 (18%), Positives = 76/172 (44%), Gaps = 2/172 (1%)

Query: 27 ILNMIDGFDVLVMAFTAASVSAEWGLNGAQVGLLLSAGLFGMAAGSLFIAPWADRFGRRP 86
IL+ + +V+ + ++ ++ A + +A + + G+ +D+ G +
Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80

Query: 87 LILLCLALSGIGMLLSALSQSPLQLALL-RGLTGLGIGGILASSNVIASEYASERWRGLA 145
L+L + ++ G ++ + S L ++ R + G G A V+ + Y + RG A
Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140

Query: 146 VSLQSTGYALGATLGGLLAVWLLGHWGWRSVFLFGGVVTVLVIPLVLLWLPE 197
L + A+G +G + + + W + L ++T++ +P ++ L +
Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI-PMITIITVPFLMKLLKK 191


51PP_3766PP_3791Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_37662140.701061lactoylglutathione lyase
PP_37671170.259114hypothetical protein
PP_3768118-0.244221shikimate 5-dehydrogenase
PP_3769117-1.468356hypothetical protein
PP_3770-120-2.708317hypothetical protein
PP_3771-124-3.088383hypothetical protein
PP_3772024-3.139390phage repressor
PP_3773-130-3.278460hypothetical protein
PP_3774031-3.861402alginate lyase 2
PP_3775035-4.037742sarcosine oxidase
PP_3776133-4.921901rarD protein
PP_3777334-5.592199hypothetical protein
PP_3778433-5.290315pyrroline-5-carboxylate reductase
PP_3779433-5.491096LysR family transcriptional regulator
PP_3780231-5.204607hypothetical protein
PP_3781129-5.009866oxygen-independent coproporphyrinogen III
PP_3782227-4.137630hypothetical protein
PP_3783125-3.630077hypothetical protein
PP_3784026-3.783591hypothetical protein
PP_3785-126-3.967211hypothetical protein
PP_3786028-4.048422aminotransferase
PP_3787030-4.235405hypothetical protein
PP_3788-129-4.960519non-ribosomal peptide synthetase
PP_3789-225-4.964068efflux transporter
PP_3790-224-4.857835diaminopimelate epimerase
PP_3791-214-3.050999phage integrase site specific recombinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3789TCRTETA446e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.0 bits (104), Expect = 6e-07
Identities = 42/202 (20%), Positives = 68/202 (33%), Gaps = 19/202 (9%)

Query: 52 TGALLGISMLLATLMSLPAGLLFDRFARLHLATITLLLMTLAVGLLPFAQI--ILLVSLL 109
G LL + L+ + G L DRF R + ++L + ++ A +L + +
Sbjct: 45 YGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRI 104

Query: 110 LVFMEFAAALFGIGLKALLADFVSVKQRVSAFSYRYILTNVAFAIGPVLGVRLAEVSLTL 169
+ + A G A +AD +R F + GPVLG + S
Sbjct: 105 VAGITGAT---GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHA 161

Query: 170 ALLVAAGACG-AAMVVMMCLGASQGQPRTSLKTGAPSLADALKVLGNDRNLVLYTLGSFF 228
AA G + L S R L+ A + + + +
Sbjct: 162 PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW-------------ARG 208

Query: 229 NTVVHGRFTFFLSLWLLYQYPA 250
TVV F + L+ Q PA
Sbjct: 209 MTVVAALMAVFFIMQLVGQVPA 230


52PP_3836PP_3852Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3836-121-3.227788hypothetical protein
PP_3837-128-3.808523*SURF4 domain-containing protein
PP_3838-131-3.724189hypothetical protein
PP_3839-124-3.219755alcohol dehydrogenase
PP_3840-128-4.624311hypothetical protein
PP_3841028-4.358472MerR family transcriptional regulator
PP_3842-126-4.283219hypothetical protein
PP_3843023-4.098901hypothetical protein
PP_3844236-8.197472D-aminopeptidase
PP_3845240-9.231552polyamine ABC transporter substrate-binding
PP_3846443-9.263770carbon-nitrogen hydrolase
PP_3847549-10.502486LuxR family transcriptional regulator
PP_3848651-10.226404hypothetical protein
PP_3849546-9.169386calcium-binding protein, hemolysin-type
PP_3850737-4.851708hypothetical protein
PP_3851435-4.932535hypothetical protein
PP_3852229-3.789743hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3842NUCEPIMERASE423e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 41.7 bits (98), Expect = 3e-06
Identities = 27/130 (20%), Positives = 49/130 (37%), Gaps = 20/130 (15%)

Query: 6 FVTGGSGFVGQHLLAGLTAQGHKTWVLMRSPGNIE-----RLKEQVGQLGGNPEYIHAVE 60
VTG +GF+G H+ L GH+ + N+ LK+ +L P + +
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGI----DNLNDYYDVSLKQARLELLAQPGF-QFHK 58

Query: 61 GDIS-QEGLGLSEADKERVTSAAVFFHLAAQ----FSWGLTPERARTVNVQGALSVARLA 115
D++ +EG+ A F + +S P N+ G L++
Sbjct: 59 IDLADREGMTDLFASGH----FERVFISPHRLAVRYSLE-NPHAYADSNLTGFLNILEGC 113

Query: 116 ASQRIRLLMV 125
+I+ L+
Sbjct: 114 RHNKIQHLLY 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3849RTXTOXINA819e-18 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 81.2 bits (200), Expect = 9e-18
Identities = 55/164 (33%), Positives = 74/164 (45%), Gaps = 16/164 (9%)

Query: 347 TSAGDSFSGSSRADWIVGGDGNDELKGLSGDDMLEGGDGDDILDGGTGADIMIGGQGNDT 406
T+ D F GS D G DG+D ++G G+D L G G+D L GG G D + GG GND
Sbjct: 725 TTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDK 784

Query: 407 YYVDWGNDKVIETSSAGGIDTVISSVSRTLGAYQENLVLTGSSAINGTGNNLANTLTGND 466
GN+ + + G D G VL G G GN + L G++
Sbjct: 785 LIGVAGNNYL----NGGDGDDEFQ----VQGNSLAKNVLFG-----GKGN---DKLYGSE 828

Query: 467 ADNVLNGGAGADIMVGGLGNDTYYVDWGNDKVIETSANGGIDTI 510
++L+GG G D++ GG GND Y G I G D +
Sbjct: 829 GADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKL 872



Score = 71.9 bits (176), Expect = 6e-15
Identities = 73/328 (22%), Positives = 116/328 (35%), Gaps = 65/328 (19%)

Query: 381 EGGDGDDILDGGTGADIMIGGQGNDTYYVDWGNDKVIETSSAGGIDTVISSVSRTLGAYQ 440
GDGDD + G+ + G+G+D Y D + G T+ + + G Y
Sbjct: 615 HLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDT---------GYLTIDGTKATEAGNYT 665

Query: 441 ENLVLTGSSAINGTGNNLANTLTGNDADNVLNGGAGADIMVGGLGNDTYYVDWGNDKVIE 500
VL G + G + Y + +
Sbjct: 666 VTRVLGGDVKVLQEVVKEQEVSVGKRTEKT------------------QYRSYEFTHI-- 705

Query: 501 TSANGGIDTI--ISSVSRTLGDYQENLVLTGTAALYGNGNNLANTLTGNNGDNVLNGGAG 558
+ + SV +G + + +G + + + GN+G++ L G G
Sbjct: 706 --NGKNLTETDNLYSVEELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKG 763

Query: 559 ADTMIGGLGNDTYYVDWGNDKVIETSANGGIDTVISSVSRTLGDHQENLVLTGTKANYGT 618
DT+ GG G+D Y GNDK+I + N ++ G
Sbjct: 764 NDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNG-------------------------GD 798

Query: 619 GNSLDNTLTGNGADNLLNGGAGNDTLVGGAGNDRLVGDLGKDVLTGGAGNDVFAFNTTKE 678
G+ + A N+L GG GND L G G D L G G D+L GG GND++ + +
Sbjct: 799 GDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSG-- 856

Query: 679 SGLTSTAWDVITDFKRGSDKIDVSGIDA 706
+I D DK+ ++ ID
Sbjct: 857 -----YGHHIIDDDGGKEDKLSLADIDF 879



Score = 60.4 bits (146), Expect = 2e-11
Identities = 46/206 (22%), Positives = 76/206 (36%), Gaps = 15/206 (7%)

Query: 470 VLNGGAGADIMVGGLGNDTYYVDWGNDKVIETSANGGIDTIISSVSRTLGDYQENLVLTG 529
+ G G D + G+ Y G+D V + G TI + + G+Y VL G
Sbjct: 613 ESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATEAGNYTVTRVLGG 672

Query: 530 TAALYGNGNNLANTLTGNNGDNVLNGGAGADTMIGGLGNDTYYVDWGNDKVIETSANGGI 589
+ G + T+ + ET +
Sbjct: 673 DVKVLQEVVKEQEVSVGKRTEKTQYRSY----------EFTHI---NGKNLTETDNLYSV 719

Query: 590 DTVISSVSRTL--GDHQENLVLTGTKANYGTGNSLDNTLTGNGADNLLNGGAGNDTLVGG 647
+ +I + G ++ + GN ++ L G+ ++ L+GG G+D L GG
Sbjct: 720 EELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGG 779

Query: 648 AGNDRLVGDLGKDVLTGGAGNDVFAF 673
GND+L+G G + L GG G+D F
Sbjct: 780 DGNDKLIGVAGNNYLNGGDGDDEFQV 805


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3850PF00577260.014 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 26.4 bits (58), Expect = 0.014
Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 5/64 (7%)

Query: 4 QGRMMAGVPQAWLAELGDHVALVTDPDGRAAVLSVMAYAARRRNDV--DDNDLVDMLELT 61
++ P A A++ + + TD G A + Y R N V D N L D ++L
Sbjct: 716 DTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEY---RENRVALDTNTLADNVDLD 772

Query: 62 EAAR 65
A
Sbjct: 773 NAVA 776


53PP_3881PP_3917Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3881225-2.672281phage terminase, large subunit
PP_3882228-2.693737phage terminase small subunit
PP_3883230-2.908237phage holin
PP_3884118-1.010156phage holin
PP_3885116-0.417823hypothetical protein
PP_38861150.582415hypothetical protein
PP_38871130.983143hypothetical protein
PP_38883161.467258hypothetical protein
PP_38891150.406736phage integrase
PP_3890119-0.698273hypothetical protein
PP_3891222-2.083078hypothetical protein
PP_3892322-3.120872hypothetical protein
PP_3893323-3.114691phage DNA helicase
PP_3894328-5.063194phage replication protein O
PP_3895533-5.234622regulatory protein Cro
PP_3896529-4.787165Cro/CI family transcriptional regulator
PP_3897529-4.557622hypothetical protein
PP_3898626-4.049032hypothetical protein
PP_3899428-4.907442hicB protein
PP_3900325-3.165377hicA protein
PP_3901425-2.195665hypothetical protein
PP_3902224-1.852678hypothetical protein
PP_3903325-1.775141hypothetical protein
PP_3904424-2.054714hypothetical protein
PP_3905421-0.948719LuxR family transcriptional regulator
PP_3906321-1.604848hypothetical protein
PP_39070210.087425hypothetical protein
PP_3908023-0.264999hypothetical protein
PP_3909-122-0.154169hypothetical protein
PP_3910-1220.040481hypothetical protein
PP_3911-2230.485359hypothetical protein
PP_3912-1260.782721DNA-cytosine methyltransferase
PP_3913225-0.336767hypothetical protein
PP_39143230.798889hypothetical protein
PP_3915223-0.608228hypothetical protein
PP_3916324-0.880656hypothetical protein
PP_3917317-1.944454hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3882PERTACTIN270.027 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 27.4 bits (60), Expect = 0.027
Identities = 15/56 (26%), Positives = 22/56 (39%)

Query: 6 PTPTELKLVRGNPGKRPINKNEPQPAKRIPSAPDHLSSDGQVAWGRLTVLLDRMGV 61
P P +L L G G+ I E P S P ++ Q W T +D + +
Sbjct: 376 PEPVKLTLAGGAQGQGDIVATELPPIPGASSGPLDVALASQARWTGATRAVDSLSI 431


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3885FbpA_PF05833270.002 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 26.8 bits (59), Expect = 0.002
Identities = 7/24 (29%), Positives = 16/24 (66%)

Query: 6 TGQLRSMLIGGQIEKVVKPCRSTV 29
+L++ +I G+I+KV +P + +
Sbjct: 12 IDELKNTIINGKIDKVNQPEKDEI 35


54PP_3962PP_3989Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3962-132-4.301023lipoprotein
PP_3963133-4.874700hypothetical protein
PP_3964031-4.891492ISPpu14, transposase Orf3
PP_3965333-5.146912ISPpu14, transposase Orf2
PP_3966434-5.436308ISPpu14, transposase Orf1
PP_3967435-5.179896MutT/nudix family protein
PP_3968434-5.096893sensor histidine kinase
PP_3969232-3.864489response regulator
PP_3970130-3.633122molecular chaperone GroES
PP_3971030-3.503736hypothetical protein
PP_3972333-5.242197short chain dehydrogenase
PP_3973230-5.518717hypothetical protein
PP_3974132-5.454516LysR family transcriptional regulator
PP_3975132-5.839059hypothetical protein
PP_3976028-5.263124isochorismatase superfamily hydrolase
PP_3977137-6.044740hypothetical protein
PP_3978037-5.540247hypothetical protein
PP_3979037-5.814409ISPpu14, transposase Orf1
PP_3980133-5.324115ISPpu14, transposase Orf2
PP_3981233-5.632933ISPpu14, transposase Orf3
PP_3982338-6.478630hypothetical protein
PP_3983543-9.565401hypothetical protein
PP_3984549-10.940491ISPpu13, transposase Orf3
PP_3985443-10.204527ISPpu13, transposase Orf2
PP_3986452-12.071932ISPpu13, transposase Orf1
PP_3987242-9.719455phage integrase site specific recombinase
PP_3988239-8.988623hypothetical protein
PP_3989-122-4.431638DNA-cytosine methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3969HTHFIS716e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.4 bits (175), Expect = 6e-18
Identities = 29/119 (24%), Positives = 53/119 (44%), Gaps = 5/119 (4%)

Query: 5 VLVVEDDEVLRWLMTEAVTHLGHTVTDCASADDALAILEKIPDLSLVITDIQMPGHIDGL 64
+LV +DD +R ++ +A++ G+ V ++A + LV+TD+ MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPD-ENAF 63

Query: 65 ELAKAIWWEHPELPVIIVSGHVVFS--PSSLPANAR-FIKKPCTLDSLSRAMQELLPTR 120
+L I P+LPV+++S F + A ++ KP L L + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3972DHBDHDRGNASE654e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 65.5 bits (159), Expect = 4e-14
Identities = 53/189 (28%), Positives = 78/189 (41%), Gaps = 2/189 (1%)

Query: 122 LRGKVVVITGASSGIGRAAAHAFACKGARLVLAARDEEALFDVLDECTDCGTDAIAVTTD 181
+ GK+ ITGA+ GIG A A A +GA + + E L V+ A A D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 182 VTHSDQVQALAAQASAFGHGRIDIWVNNAGVGAVGNFEDTPLEAHEQVIQTDLIGYLRGA 241
V S + + A+ G IDI VN AGV G E E + G +
Sbjct: 66 VRDSAAIDEITARIER-EMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 242 YVALPFFKAQGSGILINTLSLGSWVAQPYAAAYSASKFGLRGLTEALRGELTKFPNIHVC 301
+ + SG ++ S + V + AAY++SK T+ L EL ++ NI
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY-NIRCN 183

Query: 302 DIYPAVMDT 310
+ P +T
Sbjct: 184 IVSPGSTET 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3974HTHFIS320.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.1 bits (73), Expect = 0.003
Identities = 14/83 (16%), Positives = 38/83 (45%), Gaps = 7/83 (8%)

Query: 3 VVKPSPGSAQSIPANANGPKWDVGEAVSQLSWDDLRIIKTLSDCS-NRAATAKKLGINVS 61
V+ + + +A P ++++ + I+ L+ N+ A LG+N +
Sbjct: 407 AVEENMRQYFASFGDALPPSGLYDRVLAEMEYP--LILAALTATRGNQIKAADLLGLNRN 464

Query: 62 TVSRRVAQVEKTLGVALFDHRKA 84
T+ +++ + LGV+++ ++
Sbjct: 465 TLRKKI----RELGVSVYRSSRS 483


55PP_4079PP_4092Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_4079020-3.244499hypothetical protein
PP_4080023-4.231756hypothetical protein
PP_4081128-5.543070hypothetical protein
PP_4082126-5.451204hcp protein
PP_4084226-5.441218hypothetical protein
PP_4085228-6.322098RHS family protein
PP_4086337-9.651495hypothetical protein
PP_4087231-6.945166hypothetical protein
PP_4091024-4.921691ISPpu15, transposase Orf2
PP_4092116-3.227135ISPpu15, transposase Orf1
56PP_4153PP_4164Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_41530103.497097hypothetical protein
PP_41540103.666682hypothetical protein
PP_41550113.023485hypothetical protein
PP_41561123.390918LysR family transcriptional regulator
PP_41571113.417389winged helix family two component
PP_41581103.672672osmosensitive K+ channel signal transduction
PP_41591102.529535potassium-transporting ATPase subunit C
PP_41611112.187685potassium-transporting ATPase subunit A
PP_41622132.657763hypothetical protein
PP_41632142.774269hypothetical protein
PP_41642162.330228alpha/beta hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4157HTHFIS951e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.9 bits (236), Expect = 1e-24
Identities = 45/159 (28%), Positives = 74/159 (46%), Gaps = 4/159 (2%)

Query: 3 QSATLLVIDDEPQIRKFLRISLASQGYKVLEAATGAEGLAQAALGKPDLVVLDLGLPDMD 62
AT+LV DD+ IR L +L+ GY V + A A G DLVV D+ +PD +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GQQVLRELREWSA-VPVMVLSVRASEVQKVDALDGGANDYVTKPFGIQEFLARV-RALLR 120
+L +++ +PV+V+S + + + + A + GA DY+ KPF + E + + RAL
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 121 QVPQAGGSEVAASFGPLTV--DFAFRRVTLDGVEVALTR 157
+ E + G V A + + + T
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4162PREPILNPTASE320.002 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 32.5 bits (74), Expect = 0.002
Identities = 23/79 (29%), Positives = 36/79 (45%), Gaps = 6/79 (7%)

Query: 5 DRLLVQILLVALLGAGLWVMAPFISALLWGAILAFASWPLMRFLTRLLGGRETLAAG--- 61
D+L + +L LL L A+ GA+ + + + +LL G+E + G
Sbjct: 159 DQLTLPLLWGGLLFNLLGGFVSLGDAV-IGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFK 217

Query: 62 LLTM--AWILIVALPLVWL 78
LL AW+ ALP+V L
Sbjct: 218 LLAALGAWLGWQALPIVLL 236


57PP_4185PP_4191Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_4185229-1.368372succinyl-CoA synthetase subunit alpha
PP_4186329-1.023860succinyl-CoA synthetase subunit beta
PP_4187329-0.659543dihydrolipoamide dehydrogenase
PP_4188428-0.896143dihydrolipoamide succinyltransferase
PP_4189426-1.3387712-oxoglutarate dehydrogenase E1
PP_4190426-1.437865succinate dehydrogenase iron-sulfur subunit
PP_4191222-1.649369succinate dehydrogenase flavoprotein subunit
58PP_4230PP_4261Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_42300123.426218hypothetical protein
PP_42310153.305371xanthine dehydrogenase accessory factor
PP_42321153.443682gluconate 2-dehydrogenase
PP_42333153.310629(2Fe-2S)-binding protein
PP_4235419-1.694452protein-disulfide reductase
PP_4236327-6.138200redoxin domain-containing protein
PP_4237543-10.488566disulfide isomerase/thiol-disulfide oxidase
PP_4238181.061318hypothetical protein
PP_4239190.888917hypothetical protein
PP_4240190.935988microcin b17 processing protein mcbd
PP_42412112.351260hypothetical protein
PP_42422112.611966hypothetical protein
PP_42433143.261906peptide synthase
PP_4244016-0.223753extracytoplasmic-function sigma-70 factor
PP_4245-1130.109801siderophore biosynthesis protein
PP_4246-1140.284403hypothetical protein
PP_42470170.512273exonuclease
PP_4248015-0.021918hypothetical protein
PP_4249014-0.417823hypothetical protein
PP_4250221-0.427900cbb3-type cytochrome c oxidase subunit I
PP_4251220-0.371790cbb3-type cytochrome c oxidase subunit II
PP_4252217-0.714970cytochrome c oxidase, cbb3-type, CcoQ subunit
PP_4253117-0.480821cytochrome c oxidase, cbb3-type subunit III
PP_4254219-0.628578hypothetical protein
PP_4255022-0.017050cbb3-type cytochrome c oxidase subunit I
PP_4256-2181.963535cbb3-type cytochrome c oxidase subunit II
PP_4257-3192.126574cbb3-type cytochrome oxidase subunit
PP_4258-1192.818176cytochrome c oxidase, cbb3-type subunit III
PP_4259-1212.819166(Fe-S)-binding protein
PP_42600203.348422hypothetical protein
PP_4261-2193.399488heavy metal translocating P-type ATPase
59PP_4342PP_4360Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_4342-121-4.415647flagellar number regulator FleN
PP_4343028-6.542411flagellar biosynthesis regulator FlhF
PP_4344139-9.548951flagellar biosynthesis protein FlhA
PP_4345459-13.995677GntR family transcriptional regulator
PP_4346248-11.727846D-alanine--D-alanine ligase
PP_4347242-10.907120hypothetical protein
PP_4348035-9.741331cystathionine beta-lyase
PP_4349123-7.008438hypothetical protein
PP_4350017-3.846340aminotransferase
PP_43524200.153042flagellar biosynthesis protein FlhB
PP_43535200.498443flagellar biosynthesis protein FliR
PP_43544220.684374flagellar biosynthesis protein FliQ
PP_43554210.673991flagellar biosynthesis protein FliP
PP_43560131.695349flagellar assembly protein FliO
PP_43570110.757661flagellar motor switch protein
PP_43580130.817397flagellar motor switch protein FliM
PP_43591150.504924flagellar basal body protein FliL
PP_43602160.942464hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4352TYPE3IMSPROT324e-111 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 324 bits (831), Expect = e-111
Identities = 103/349 (29%), Positives = 188/349 (53%), Gaps = 3/349 (0%)

Query: 9 DKTEDPTEKRKRDAREKGEVARSKELNTVAVTLAGAGGLLAFGGHVAETLLALMRMNFSL 68
+KTE PT K+ RDAR+KG+VA+SKE+ + A+ +A + L+ + E LM
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLML--IPA 61

Query: 69 TRDIIVDERAMGAFLLASGKMAIWAVQPVLILLFVVSFVAPIALSGFLFSGSLLQPKFSR 128
+ + +A+ + + P+L + +++ + + GFL SG ++P +
Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 129 MNPLSGIKRMFSMQALTELLKALAKFFVILVVAVVVLSGDRQALLSIANEPLEQAIIHSL 188
+NP+ G KR+FS+++L E LK++ K ++ ++ +++ G+ LL + +E
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 189 QVVGWSALWMSAGLLLIAAADVPFQLYQTHKKMKMTKQEVRDEYKDSEGKPEVKQRIRQL 248
Q++ + + G ++I+ AD F+ YQ K++KM+K E++ EYK+ EG PE+K + RQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 249 QREVSQRRMMAAVPDADVIITNPTHYAVALQYDPEKGGAAPLLLAKGSDFMALKIREIGV 308
+E+ R M V + V++ NPTH A+ + Y + PL+ K +D +R+I
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETP-LPLVTFKYTDAQVQTVRKIAE 300

Query: 309 EHNIQILESPALARAIYYSTELEQEIPAGLYLAVAQVLAYVFQIRQYRA 357
E + IL+ LARA+Y+ ++ IPA A A+VL ++ + +
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQ 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4353TYPE3IMRPROT1363e-41 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 136 bits (344), Expect = 3e-41
Identities = 94/255 (36%), Positives = 152/255 (59%), Gaps = 2/255 (0%)

Query: 1 MLELTDTQIGTWVATFILPLFRVTAVLMTMPIFGTRMLPARIRLYVAVAVTVVIVPALPP 60
ML++T Q +W+ + PL RV A++ T PI R +P R++L +A+ +T I P+LP
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 LPEFDPLSLRGMLLCAEQIIVGALFGLALQLLFQAFVVAGQIVAVQMGMAFASMVDPANG 120
S + L +QI++G G +Q F A AG+I+ +QMG++FA+ VDPA+
Sbjct: 61 NDVPV-FSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 VNVTVISQFMTMLVSVLFLLMNGHLVVFEVLTESFTTLPVGSALVVNHFWELAGRMGW-V 179
+N+ V+++ M ML +LFL NGHL + +L ++F TLP+G + ++ + + G +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 180 FGAGLLLILPVIAALLVVNIAFGVMTRAAPQLNIFSIGFPLTLVMGMFIFWVGLADVLSH 239
F GL+L LP+I LL +N+A G++ R APQL+IF IGFPLTL +G+ + + +
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPF 239

Query: 240 YQALASETLQWLREL 254
+ L SE L ++
Sbjct: 240 CEHLFSEIFNLLADI 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4354TYPE3IMQPROT533e-13 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 53.2 bits (128), Expect = 3e-13
Identities = 21/74 (28%), Positives = 38/74 (51%)

Query: 7 VDLFRDALWLTTLMVAVLVVPSLLVGLVVAMFQAATQINEQTLSFLPRLLVMLITLIIAG 66
V AL+L ++ + + ++GL+V +FQ TQ+ EQTL F +LL + + L +
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 PWLVQKFMEYITTL 80
W + + Y +
Sbjct: 65 GWYGEVLLSYGRQV 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4355FLGBIOSNFLIP2692e-93 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 269 bits (688), Expect = 2e-93
Identities = 137/244 (56%), Positives = 186/244 (76%), Gaps = 1/244 (0%)

Query: 5 LRTLLTLALLLAAPLALAADPLSIPAITLSNTPDGQQEYSVSLQILLIMTALSFIPAFVI 64
+R LL++A +L + A +P IT P G Q +S+ +Q L+ +T+L+FIPA ++
Sbjct: 1 MRRLLSVAPVLLWLITPLAFA-QLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILL 59

Query: 65 LMTSFTRIIIVFSILRQALGLQQTPSNQLLTGMALFLTMFIMAPVFDRVNQDALQPYLKE 124
+MTSFTRIIIVF +LR ALG P NQ+L G+ALFLT FIM+PV D++ DA QP+ +E
Sbjct: 60 MMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEE 119

Query: 125 QMTAQQAIDKAQGPLKDFMLAQTRQSDLDLFMRLSKRTDIAGPDQVPLTILVPAFVTSEL 184
+++ Q+A++K PL++FML QTR++DL LF RL+ + GP+ VP+ IL+PA+VTSEL
Sbjct: 120 KISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSEL 179

Query: 185 KTAFQIGFMIFIPFLIIDMVVASVLMAMGMMMLSPLIISLPFKIMLFVLVDGWALIMGTL 244
KTAFQIGF IFIPFLIID+V+ASVLMA+GMMM+ P I+LPFK+MLFVLVDGW L++G+L
Sbjct: 180 KTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSL 239

Query: 245 ASSF 248
A SF
Sbjct: 240 AQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4357FLGMOTORFLIN1201e-37 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 120 bits (301), Expect = 1e-37
Identities = 69/159 (43%), Positives = 96/159 (60%), Gaps = 29/159 (18%)

Query: 7 MANENEITSPEDQALADEWAAALEE-----TGSAGQADIDALLGGDTGSSSGPGRLPMEE 61
M++ N + AL D WA AL E T SA A L GGD SG +
Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDV---SGAMQ----- 52

Query: 62 FASSPKPNENVSLEGPNLDVILDIPVNISMEVGSTEINIRNLLQLNQGSVIELDRLAGEP 121
++D+I+DIPV +++E+G T + I+ LL+L QGSV+ LD LAGEP
Sbjct: 53 ----------------DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEP 96

Query: 122 LDVLVNGTLIAHGEVVVVNEKFGIRLTDVISPSERIKKL 160
LD+L+NG LIA GEVVVV +K+G+R+TD+I+PSER+++L
Sbjct: 97 LDILINGYLIAQGEVVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4358FLGMOTORFLIM2572e-86 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 257 bits (657), Expect = 2e-86
Identities = 95/324 (29%), Positives = 164/324 (50%), Gaps = 9/324 (2%)

Query: 5 DLLSQDEIDALLHGVDDGLVQTESASEPGSIKS---YDLTSQDRIVRGRMPTLEMINERF 61
++LSQDEID LL + G E A + YD D+ + +M TL +++E F
Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62

Query: 62 ARYTRISMFNLLRRSADVAVGGVQVMKFGEYVHSLYVPTSLNLVKIKPLRGTSLFILDAK 121
AR T S+ LR V V V + + E++ S+ P++L ++ + PL+G ++ +D
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122

Query: 122 LVFKLVDNFFGGDGRHAKIEGREFTPTELRVVRMVLDQCFVDLKEAWQAIMPVSFEYMNS 181
+ F ++D FGG G+ AK++ R+ T E V+ V+ + +++E+W ++ +
Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181

Query: 182 EVNPAMANIVGPSEAVVVSTFHIELDGGGGDLHVTMPYSMIEPVREMLDAGF--QSDLDD 239
E NP A IV PSE VV+ T ++ G ++ +PY IEP+ L + F S
Sbjct: 182 ETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRRS 241

Query: 240 QDERWVKALREDVLDVAVPMTATVARRQLKLRDILHMQPGDVIPVE---LPEHLVLRANG 296
+++ LR+ + V + + A V +L +RDIL ++ GD+I + + + VL
Sbjct: 242 STTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIGN 301

Query: 297 VPAFKARLGSHKGNLALQIIDPIE 320
F + G +A QI++ IE
Sbjct: 302 RKKFLCQPGVVGKKIAAQILERIE 325


60PP_4403PP_4471Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_4403-112-3.877432branched-chain alpha-keto acid dehydrogenase
PP_4404132-9.772870dihydrolipoamide dehydrogenase
PP_4405238-11.666429PAS/PAC sensor-containing diguanylate cyclase
PP_4406444-12.955081hypothetical protein
PP_4407448-13.480694amino acid ABC transporter permease
PP_4409655-15.255011phage integrase site specific recombinase
PP_4410654-14.344132hypothetical protein
PP_4411645-10.235495hypothetical protein
PP_4412647-10.754892hypothetical protein
PP_4413748-10.211027hypothetical protein
PP_4415743-9.489670hypothetical protein
PP_4414539-7.087032hypothetical protein
PP_4416531-6.410389hypothetical protein
PP_4417332-5.775679hypothetical protein
PP_4418235-6.675228hypothetical protein
PP_4419237-7.302466transposase, OrfA
PP_4420137-7.364580transposase, OrfB
PP_4421139-8.109662hypothetical protein
PP_4422244-8.913250succinate-semialdehyde dehydrogenase
PP_4423345-9.544010hypothetical protein
PP_4424344-8.043074AsnC family transcriptional regulator
PP_4425343-6.863415amino acid ABC transporter ATP-binding protein
PP_4426346-6.932623amino acid ABC transporter permease
PP_4427341-5.907962amino acid ABC transporter permease
PP_4428242-6.918809amino acid ABC transporter substrate-binding
PP_4429245-7.675138GntR family transcriptional regulator
PP_4430245-8.191709threonine dehydratase
PP_4431245-8.634222ectoine utilization protein EutC
PP_4432142-8.322151peptidase, M24 family protein
PP_4433141-7.647363amino acid MFS transporter
PP_4434136-6.026696D-amino acid dehydrogenase small subunit
PP_4435129-3.704747hypothetical protein
PP_4437227-3.207463ISPpu14, transposase Orf1
PP_4438125-3.114079ISPpu14, transposase Orf2
PP_4439025-3.463409ISPpu14, transposase Orf3
PP_4441030-4.207432ISPpu14, transposase Orf1
PP_4442234-7.320717ISPpu14, transposase Orf2
PP_4443339-10.084600ISPpu14, transposase Orf3
PP_4444445-11.906239transposase
PP_4445446-11.675216IS3 family transposase
PP_4446553-14.094662group II intron-encoding maturase
PP_4447557-15.046875hypothetical protein
PP_4448353-12.312568hypothetical protein
PP_4449253-10.160409hypothetical protein
PP_4450252-9.796239phosphoglycerate mutase
PP_4451152-9.027954hypothetical protein
PP_4452048-6.771649NAD/NADP octopine/nopaline dehydrogenase
PP_4453046-6.274339opine ABC transporter ATP-binding protein
PP_4454-143-6.029825opine ABC transporter permease
PP_4455044-7.011483opine ABC transporter substrate-binding protein
PP_4456146-7.647953nopaline dehydrogenase
PP_4457246-8.532769nopaline dehydrogenase
PP_4458248-9.367569opine ABC transporter substrate-binding protein
PP_4459153-10.511955transposase
PP_4460261-11.906081LysR family transcriptional regulator
PP_4461258-11.208637major facilitator family transporter
PP_4462356-11.2547714-hydroxy-4-methyl-2-oxoglutarate aldolase
PP_4463354-11.013492carbon-nitrogen hydrolase
PP_4464353-9.629301LysR family transcriptional regulator
PP_4465346-8.348431porin
PP_4466227-5.041005TauD/TfdA family dioxygenase
PP_4467224-4.231829LysR family transcriptional regulator
PP_4468115-2.471487Cro/CI family transcriptional regulator
PP_4469114-0.901592**phosphonate metabolism
PP_4470214-1.117032Arc domain-containing protein DNA binding
PP_4471214-1.145832magnesium transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4409LUXSPROTEIN280.042 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 28.3 bits (63), Expect = 0.042
Identities = 11/28 (39%), Positives = 18/28 (64%), Gaps = 1/28 (3%)

Query: 367 PNQQHLASKLGLHSLRHSNIFFLKNYLE 394
PN+ L+ K G+H+L H F++N+L
Sbjct: 42 PNKDILSEK-GIHTLEHLYAGFMRNHLN 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4433TCRTETA402e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.8 bits (93), Expect = 2e-05
Identities = 40/151 (26%), Positives = 59/151 (39%), Gaps = 31/151 (20%)

Query: 69 LATFAIS-FLIRPLGGMFWGPLGDRIGRKRVLAMTVLMMSVATFIIGILPTYQSVGWIAP 127
LA +A+ F P+ G L DR GR+ VL +++ +V I+ P W+
Sbjct: 49 LALYALMQFACAPVLGA----LSDRFGRRPVLLVSLAGAAVDYAIMATAPFL----WV-- 98

Query: 128 VALIALRLIQGFSTGGEYGGAATFLAEYAPDKRR----GFYGSFLEFGSLAGFSLGALVT 183
L R++ G TG A ++A+ R GF + FG +AG LG L
Sbjct: 99 --LYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL-- 153

Query: 184 LSVSVVIGDAAMYEWGWRVPFLIAAPLGVIA 214
M + PF AA L +
Sbjct: 154 -----------MGGFSPHAPFFAAAALNGLN 173



Score = 30.6 bits (69), Expect = 0.015
Identities = 17/71 (23%), Positives = 34/71 (47%), Gaps = 4/71 (5%)

Query: 261 VLVAALNITYYILLAYMPTYMHKEVGASENMSLLAPLVGMLAMMMFI--PFAGRISDVVG 318
V + A+ I +++ +P + V +++ + L+ + A+M F P G +SD G
Sbjct: 14 VALDAVGIG--LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFG 71

Query: 319 FKKMWFFSLIG 329
+ + SL G
Sbjct: 72 RRPVLLVSLAG 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4458RTXTOXINC290.036 Gram-negative bacterial RTX toxin-activating protein C...
		>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C

signature.
Length = 170

Score = 28.7 bits (64), Expect = 0.036
Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 11/42 (26%)

Query: 159 KYPDTAF----------LGRVSNYHGGQIVSKAAAEKLGERY 190
K+PD F +G+VS +HGG+I K A K+ ++Y
Sbjct: 110 KFPDELFRAIRVDPKTHVGKVSEFHGGKI-DKQLANKIFKQY 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4461TCRTETA371e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 37.1 bits (86), Expect = 1e-04
Identities = 45/270 (16%), Positives = 91/270 (33%), Gaps = 36/270 (13%)

Query: 72 VIGRIGDKRGRKPAMLLTIICMAIGSIGIGLIPTYETIGVGAPILLVMLRCLQGFAAGGE 131
V+G + D+ GR+P +L+++ A+ Y + + ++ + + G
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVD---------YAIMATAPFLWVLYIGRIVAGITGAT 112

Query: 132 WGTSASYIVEWSPAGRKGFFGSFQSVSSSGGALLASLVASALLLIPAEDLLDWGWRVPFI 191
+ +YI + + + F S G + ++ + PF
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSP--------HAPFF 164

Query: 192 AGG-LAIFAFSL-FLRAHAEETPEYVNSKAEVVNPTDSKPYVLGLQAFGFTIFWTTLSYL 249
A L F E + E +NP S + G+ + + L
Sbjct: 165 AAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQL 224

Query: 250 VS----AYMVTYTQNHAGLTRTEALIS-SNIALLLQITLIPVAGALSDRFGRKP------ 298
V A V + ++ T IS + +L + + G ++ R G +
Sbjct: 225 VGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGM 284

Query: 299 ------LLLLACLGTATLAYPILNLMSGGA 322
+LLA +A+PI+ L++ G
Sbjct: 285 IADGTGYILLAFATRGWMAFPIMVLLASGG 314



Score = 36.3 bits (84), Expect = 2e-04
Identities = 20/38 (52%), Positives = 27/38 (71%), Gaps = 1/38 (2%)

Query: 278 LLQITLIPVAGALSDRFGRKPLLLLACLGTATLAYPIL 315
L+Q PV GALSDRFGR+P+LL++ G A + Y I+
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAG-AAVDYAIM 90


61PP_4521PP_4530Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_45212111.929286aerotaxis receptor
PP_45223141.636329LysR family transcriptional regulator
PP_45232141.439028agmatinase
PP_45244161.056251Na+/solute symporter
PP_45251190.377083LysR family transcriptional regulator
PP_4526221-0.565224hypothetical protein
PP_4527011-2.017717hypothetical protein
PP_4528011-1.610024hypothetical protein
PP_4529111-1.969238hypothetical protein
PP_4530210-3.703435hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4521RTXTOXINA300.023 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.3 bits (68), Expect = 0.023
Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 11/86 (12%)

Query: 281 SATAIAQMAATIQQVTHNVQS---TAHAASDADQLAQQ--------GSELALKSLKDMGS 329
A I + Q+ TA ++ D+L ++ SELA S++ +
Sbjct: 131 GAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSELAKASIELINQ 190

Query: 330 MSDAVNDIGQAVNALAEQTQSIGSVV 355
+ D V + VN+ ++Q ++GSV+
Sbjct: 191 LVDTVASLNNNVNSFSQQLNTLGSVL 216


62PP_4737PP_4743Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_4737019-3.281728D-lactate dehydrogenase
PP_4738230-9.085859hypothetical protein
PP_4739128-8.499655hypothetical protein
PP_4740228-8.510954type I restriction-modification system, R
PP_4741230-8.272176type I restriction-modification system, M
PP_4742233-8.271686type I restriction-modification system subunit
PP_4743124-6.929612hypothetical protein
63PP_4759PP_4784Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_4759-1163.359844GntR family transcriptional regulator
PP_47600143.638576zinc-containing alcohol dehydrogenase
PP_4761-1143.258854HAD superfamily hydrolase
PP_4762-1142.940488acyl-CoA thioesterase
PP_47630153.002582acetyltransferase
PP_4764-1142.716186histone deacetylase superfamily protein
PP_47651162.669462hypothetical protein
PP_47661182.111058DEAD/DEAH box helicase
PP_47671143.326211hypothetical protein
PP_47681123.030245DNA polymerase III subunit epsilon
PP_47692123.122912hypothetical protein
PP_47701133.203776periplasmic ligand-binding sensor protein
PP_47711122.735515hypothetical protein
PP_47720122.660609ATP-dependent helicase HrpB
PP_47733100.887396hypothetical protein
PP_47742111.719762cation diffusion facilitator family transporter
PP_47752121.011179hypothetical protein
PP_47760101.400759AsnC family transcriptional regulator
PP_47770102.073458hypothetical protein
PP_4779-1122.447557AMP nucleosidase
PP_4780-2112.474295acyl-CoA dehydrogenase
PP_4781-3121.820069integral membrane sensor signal transduction
PP_47820232.263178phosphomethylpyrimidine kinase
PP_47831201.260529thiamine-phosphate pyrophosphorylase
PP_4784217-0.202539glutamate-1-semialdehyde aminotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4768RTXTOXINA290.013 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.013
Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 2/35 (5%)

Query: 46 GVPVPAFVAGLTGITTAMLRSA--PPVEKVMNEVA 78
G PV A V +TGI + +L ++ E V +++A
Sbjct: 392 GAPVSALVGAVTGIISGILEASKQAMFEHVASKMA 426


64PP_4824PP_4830Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_4824272.525470integral membrane sensor hybrid histidine
PP_4825184.203592multiple antibiotic resistance protein MarC
PP_4826294.734449precorrin-3B C(17)-methyltransferase
PP_48272114.840861precorrin-2 C(20)-methyltransferase
PP_48283115.073213precorrin-8X methylmutase
PP_48292114.582151precorrin-3B synthase
PP_48301113.976341precorrin-6y C5,15-methyltransferase subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4824HTHFIS747e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.5 bits (183), Expect = 7e-16
Identities = 31/142 (21%), Positives = 50/142 (35%), Gaps = 11/142 (7%)

Query: 641 ARVLVVDDNDTCRKVLVQQCSAWGMNVSAVPSGKEALALLRTKAHLRDYFDAVLLDQNMP 700
A +LV DD+ R VL Q S G +V + + D V+ D MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-----AGDGDLVVTDVVMP 58

Query: 701 GMTGMQLAAKIKEDPSLNHDILVVMLTGISNAPSKVIARNAGVKRILAKPVAGYTLKTTL 760
L +IK+ D+ V++++ + + + A G L KP L +
Sbjct: 59 DENAFDLLPRIKK---ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD---LTELI 112

Query: 761 AEELAQRGREQAMPASLPGIPQ 782
+ P+ L Q
Sbjct: 113 GIIGRALAEPKRRPSKLEDDSQ 134



Score = 64.1 bits (156), Expect = 1e-12
Identities = 26/129 (20%), Positives = 51/129 (39%), Gaps = 5/129 (3%)

Query: 791 RVLVAEDNSISTKVIRSMLGKLNLKPDTACNGEEALQAMKAQHYDLVLMDCEMPILDGFS 850
+LVA+D++ V+ L + N + + A DLV+ D MP + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 851 ATQQLRAWEAENQRQRTPVVALTAHILAEHKERARLAGMDGHMAKPVELSQLRELIQYWA 910
+++ R PV+ ++A +A G ++ KP +L++L +I
Sbjct: 65 LLPRIKKA-----RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 911 NHREAKVDP 919
+ +
Sbjct: 120 AEPKRRPSK 128


65PP_5354PP_5360Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
PP_5354-114-3.236085hypothetical protein
PP_5355017-3.088760sodium/hydrogen exchanger
PP_5356023-3.305074thioesterase
PP_5357026-3.394799pyridoxamine kinase
PP_5358029-4.405584hypothetical protein
PP_5359-125-4.089489CobW/P47K family protein
PP_5360-220-3.606123hypothetical protein
66PP_0043PP_0051N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0043225-4.901261CzcA family cobalt/zinc/cadmium efflux
PP_0044334-5.963012CzcB family cobalt/zinc/cadmium efflux
PP_0045136-6.612485CzcC family cobalt/zinc/cadmium efflux
PP_0046032-6.080910porin
PP_0047134-5.461829DNA-binding heavy metal response regulator
PP_0048-127-3.458940hypothetical protein
PP_0049-124-1.929478hypothetical protein
PP_00500150.281483hypothetical protein
PP_0051-1130.474817Fis family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0043ACRIFLAVINRP8060.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 806 bits (2084), Expect = 0.0
Identities = 234/1064 (21%), Positives = 433/1064 (40%), Gaps = 59/1064 (5%)

Query: 5 IIRFAIEQRIVVMIAVLIMAGIGIYSYQKLPIDAVPDITNVQVQINTAAPGYSPLETEQR 64
+ F I + I + +I+ G + +LP+ P I V ++ PG +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 ITFPVETAMAGLPGLQQTRSLSRS-GLSQVTVIFKDGTDIFFARQLINERLQVAKEQLPE 123
+T +E M G+ L S S S G +T+ F+ GTD A+ + +LQ+A LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 GVEAVMGPVSTGLGEIFLWTVEAEDGAVKEDGTPYTPTDLRVIQDWIIKPQLRNVPGVAE 183
V+ V +L D T D+ +K L + GV +
Sbjct: 121 EVQQQGISVEKSSS-SYLMVA-----GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174

Query: 184 INTIGGYAKQFLVAPDPKRLATYKLTLNDLVAALESNNANVGAGYI------ERNGEQLL 237
+ G + D L YKLT D++ L+ N + AG +
Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 238 IRAPGQVGNIEDIANIVI-TSVDGAPIRISSVADVSIGKELRTGAATENGREVVLGTVFM 296
I A + N E+ + + + DG+ +R+ VA V +G E A NG+ + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 297 LIGENSRTVSQAVAAKLADINRTLPKGVVAVTVYDRTNLVEKAIATVKKNLVEGAILVIA 356
G N+ ++A+ AKLA++ P+G+ + YD T V+ +I V K L E +LV
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 357 ILFLFLGNIRAALITAMVIPLSMLFTFTGMFNNKVSANLMSLG--ALDFGIIVDGAVVIV 414
+++LFL N+RA LI + +P+ +L TF + S N +++ L G++VD A+V+V
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 415 ENAIRRLAHAQHKHGRMLTKTERFHEVFAAAREARRPLIFGQLIIMVVYLPIFALTGVEG 474
EN R + K + + + L+ +++ V++P+ G G
Sbjct: 414 ENVERVMMED---------KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 475 KMFHPMAFTVVMALLGAMVLSVTFVPAAIAMFVTGKVKEEEGVVMRTARL---------- 524
++ + T+V A+ ++++++ PA A + E
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524

Query: 525 RYEPVLQWVLGHRNIAFSAAVALVVLSGLLASRMGSEFIPSLSEGDFAMQAMRVPGTSL- 583
Y + +LG +V +L R+ S F+P +G F G +
Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584

Query: 584 -TQSVEMQQRLEKAVIAQVPEVERMFARSGTAEIASDPMPPNASDAYIMLKPQDQWPNPK 642
TQ V + Q + + + VE +F +G + NA A++ LKP ++ +
Sbjct: 585 RTQKV-LDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDE 640

Query: 643 KPRDELIAEVQKAAAGVPGSNYELSQPIQLRFNELISGVRSDVA-VKVFGDDMDVLNNTA 701
+ +I + + EL + D + G D L
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNM--PAIVELGTATGFDFELIDQAGLGHDALTQAR 698

Query: 702 NKIAAALKAVPGS-SEVKVEQTSGLPVLTINIDREKAARYGLNIADVQNSIAIAVGGRQA 760
N++ P S V+ + +D+EKA G++++D+ +I+ A+GG
Sbjct: 699 NQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYV 758

Query: 761 GTLYEGDRRFDMVVRLPETVRTDVAGMSSLLIPVPANAAQGANQIGFIPLSQVANLDLQL 820
+ R + V+ R + L V + + +P S
Sbjct: 759 NDFIDRGRVKKLYVQADAKFRMLPEDVDKL--YVRSANGE------MVPFSAFTTSHWVY 810

Query: 821 GPNQISRENGKRLVIVSANVRGRDLGSFVEEATASLDK-KVQIPAGYWTTWGGQFEQLQS 879
G ++ R NG + + G+ +A A ++ ++PAG W G Q +
Sbjct: 811 GSPRLERYNGLPSMEIQGEAAP---GTSSGDAMALMENLASKLPAGIGYDWTGMSYQERL 867

Query: 880 AAKRLQIVVPVALLLVMTLLFLMFNNLKDGMLVFTGIPFALTGGVVALWLRDIPLSISAG 939
+ + +V ++ ++V L ++ + + V +P + G ++A L + +
Sbjct: 868 SGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFM 927

Query: 940 VGFIALSGVAVLNGLVMIAFIRGLRE-EGRTLRQAVDEGALTRLRPVLMTALVASLGFIP 998
VG + G++ N ++++ F + L E EG+ + +A RLRP+LMT+L LG +P
Sbjct: 928 VGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLP 987

Query: 999 MALATGTGAEVQRPLATVVIGGILSSTALTLLVLPALYHWAHRK 1042
+A++ G G+ Q + V+GG++S+T L + +P + R
Sbjct: 988 LAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0044RTXTOXIND478e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 47.1 bits (112), Expect = 8e-08
Identities = 24/139 (17%), Positives = 53/139 (38%), Gaps = 16/139 (11%)

Query: 149 ASQQISDLRSEQQAAQRRVELARVTFEREKQLWQDKISAEQDYLQARQALQEAEISLANA 208
A ++ +S+ + + + A+ ++ QL++++I Q + + LA
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL--DKLRQTTDNIGLLTLELAKN 321

Query: 209 KQKVGAIGASVNSVGGNRYELRAPFDAVVVE-KHLTVGEVVSEATNAFILSDLNQV-WAT 266
+++ +RAP V + K T G VV+ A ++ + T
Sbjct: 322 EERQQ------------ASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVT 369

Query: 267 FAVPPTDLGKVTTGRAVKV 285
V D+G + G+ +
Sbjct: 370 ALVQNKDIGFINVGQNAII 388



Score = 39.4 bits (92), Expect = 2e-05
Identities = 21/130 (16%), Positives = 44/130 (33%), Gaps = 13/130 (10%)

Query: 88 AGVALEAAAPRDLGTVVSFPGEIRFDEDRTAHVVPRVPGVVEAVQANLGETVKKGQVLAV 147
+A + + V + G++ + P +V+ + GE+V+KG VL
Sbjct: 68 LVIAFILSVLGQVEIVATANGKLTHSGRSKE-IKPIENSIVKEIIVKEGESVRKGDVLLK 126

Query: 148 IASQQISDLRSEQQAAQRRVELARVTFER---------EKQLWQDKISAEQDYLQARQAL 198
+ + ++ Q + AR+ R +L + K+ E + +
Sbjct: 127 LTALG---AEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183

Query: 199 QEAEISLANA 208
SL
Sbjct: 184 VLRLTSLIKE 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0045IGASERPTASE320.008 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.6 bits (71), Expect = 0.008
Identities = 25/174 (14%), Positives = 54/174 (31%), Gaps = 8/174 (4%)

Query: 170 GRVRAGKSSPVEATRAQVQLAEAQLQVRRAETEKATAYQQLAQITGSSVTVFDRLESPTL 229
V S V+A ++A++ + + +T + + + + V E P +
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 230 SPGLPPRTEDLLAKLDQTAEMRQ--AVVQIDKSDASLGSEKAQRIPNLTVSVGSQYDRSV 287
+ + P+ E Q R+ V I + + + P S +V
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS------SNV 1179

Query: 288 RERVNTVGLSMPLPLFDRNQGNILSASRRADQARDQRNAVELRLRTETQTALNQ 341
+ V N N A+ + + N + R R ++ +
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHN 1233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0047HTHFIS771e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.8 bits (189), Expect = 1e-18
Identities = 30/129 (23%), Positives = 62/129 (48%), Gaps = 1/129 (0%)

Query: 2 RILVIEDEVKTAEYVRQGLTECGYVVDCVHTGSDGLFLAKQHEYELIILDINLPEMDGWQ 61
ILV +D+ + Q L+ GY V + + +L++ D+ +P+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLELLRRKNCPSRIMMLTARSRLADKVRGLENGADDYLIKPFEFPELLARV-RALMRRSD 120
+L +++ +++++A++ ++ E GA DYL KPF+ EL+ + RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 HPASVEVIR 129
P+ +E
Sbjct: 125 RPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0051HTHFIS354e-121 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 354 bits (910), Expect = e-121
Identities = 123/367 (33%), Positives = 192/367 (52%), Gaps = 43/367 (11%)

Query: 106 ILDEQGKVVAFVERLTTITLASAQPQEQGLVGRAPTFKAALASLQRAAPAQIPVLLQGES 165
++ G+ +A +R + +Q LVGR+ + L R + +++ GES
Sbjct: 111 LIGIIGRALAEPKRRPSKLEDDSQ-DGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGES 169

Query: 166 GTGKELFARAVHMGSPRANGPLVVVDCTGLTESLFESELFGYEKGAFTGANQRKIGLAEA 225
GTGKEL ARA+H R NGP V ++ + L ESELFG+EKGAFTGA R G E
Sbjct: 170 GTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQ 229

Query: 226 AHGGTLFLDEIGEVPLAMQVKLLRLIESGSFRPVGSLRTVHSDFRLISATHKPLKQMVAD 285
A GGTLFLDEIG++P+ Q +LLR+++ G + VG + SD R+++AT+K LKQ +
Sbjct: 230 AEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQ 289

Query: 286 GSFREDLYYRISGFPIRLPALRERVKDLPLLAQSLLQRMA--GQPTPRLSEDALQQLALH 343
G FREDLYYR++ P+RLP LR+R +D+P L + +Q+ G R ++AL+ + H
Sbjct: 290 GLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAH 349

Query: 344 PFPGNIRELRNILERARLFADDGVIRPEHLPEDT-------------------GLTGAAK 384
P+PGN+REL N++ R VI E + + ++ A +
Sbjct: 350 PWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVE 409

Query: 385 GSGRR------------NDLGE---------LAQALEQFKGSRSELASHLGMSERTLYRR 423
+ R+ + AL +G++ + A LG++ TL ++
Sbjct: 410 ENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK 469

Query: 424 LKALGLN 430
++ LG++
Sbjct: 470 IRELGVS 476


67PP_0129PP_0134N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0129-1100.116621diguanylate cyclase
PP_0130-312-0.030682N-acetylmuramoyl-L-alanine amidase
PP_0131-3130.149810diguanylate phosphodiesterase
PP_0132-211-0.435764multi-sensor signal transduction histidine
PP_0133-213-0.384139Fis family transcriptional regulator
PP_0134-115-0.242663transport-associated protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0129PF03544300.030 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.6 bits (66), Expect = 0.030
Identities = 17/95 (17%), Positives = 29/95 (30%), Gaps = 5/95 (5%)

Query: 185 APVALPAAEVVPQAAGSAEPSRHDDVDALQPLPAPAVPTAEVAEAPELELEAFGPALIET 244
PV P E P E ++ +P P P + E P+ +++
Sbjct: 71 EPVVEPEPEPEPIPEPPKEAPV--VIEKPKPKPKPKPKPVKKVEQPKRDVKPVESR---P 125

Query: 245 AEMPPAPLPRDKTPLPEAEVSASESLAEAEPLQAL 279
A P T ++ + A +AL
Sbjct: 126 ASPFENTAPARPTSSTATAATSKPVTSVASGPRAL 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0131SECYTRNLCASE310.015 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 30.9 bits (70), Expect = 0.015
Identities = 13/56 (23%), Positives = 25/56 (44%), Gaps = 2/56 (3%)

Query: 6 AFIALLRQIFYRPWMLATLAALASAAVLLSASIGIALQQMKQSESEQMNAQGERFL 61
IAL+ + + + ++L+ +G+ L+ +KQ ES+ E FL
Sbjct: 383 GLIALVPTMALVGFGASQNFPFGGTSILI--IVGVGLETVKQIESQLQQRNYEGFL 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0132PF06580532e-09 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 52.6 bits (126), Expect = 2e-09
Identities = 34/187 (18%), Positives = 66/187 (35%), Gaps = 39/187 (20%)

Query: 415 LETIG----EEMQRLTQLINDLLNFSRYQSGLQKLELAPCA-----IDDLLDHAQSRFAE 465
L I E+ + +++ L RY A +D L A +F +
Sbjct: 179 LNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFED 238

Query: 466 QAAHKQIELIKELDPPLPRIQADVAQLDRVLDNLLHNAIRH----TANGGRIRLHARRHA 521
+ ++ +++P + +Q V + ++ L+ N I+H GG+I L +
Sbjct: 239 R-----LQFENQINPAIMDVQ--VPPM--LVQTLVENGIKHGIAQLPQGGKILLKGTKDN 289

Query: 522 ERVIISVEDNGEGISYGQQARIFEPFVQVGRKKGGAGLGLALCKE-IVQLHGGRMGVF-- 578
V + VE+ G + K G GL +E + L+G +
Sbjct: 290 GTVTLEVENTGSL--------------ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLS 335

Query: 579 SRPGQGT 585
+ G+
Sbjct: 336 EKQGKVN 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0133HTHFIS440e-154 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 440 bits (1134), Expect = e-154
Identities = 162/476 (34%), Positives = 239/476 (50%), Gaps = 40/476 (8%)

Query: 9 GRILLVDDESAILRTFRYCLEDEGYSVATANSAAQAETLLQRQVFDLCFLDLRLGEDNGL 68
IL+ DD++AI L GY V ++AA + DL D+ + ++N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 69 DVLAQMRIQAPWMRVVIVTAHSAIDTAVDAIQAGAADYLVKPCSPDQLRLATAKQLEVRQ 128
D+L +++ P + V++++A + TA+ A + GA DYL KP +L + L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 129 LSARLEALEGEIRKPKDGLDSHSPAMMAVLETARQVAITDANILILGESGTGKGELARAI 188
R LE + + L S AM + ++ TD ++I GESGTGK +ARA+
Sbjct: 124 --RRPSKLEDDSQDG-MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 189 HGWSKRARKACVTINCPSLNAELMESELFGHTRGAFTGASESTLGRVSQADGGTLFLDEI 248
H + KR V IN ++ +L+ESELFGH +GAFTGA + GR QA+GGTLFLDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 249 GDFPLTLQPKLLRFIQDKEYERVGDPVTRRADVRILAATNLNLEEMVRESRFREDLLYRL 308
GD P+ Q +LLR +Q EY VG R+DVRI+AATN +L++ + + FREDL YRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 309 NVITLHLPPLRERSEDILILADRFLARFVKEYSRPARGFSDEARTALLNYRWPGNIRELR 368
NV+ L LPPLR+R+EDI L F+ + KE + F EA + + WPGN+REL
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEG-LDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 369 NVVERASIICPQERVEISHL----------------------------------GMGEQP 394
N+V R + + PQ+ + +
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 395 AGSAPRVGA-ALSLDELERAHIGAVLA-ASDTLDQAAKTLGIDASTLYRKRKQYNL 448
+ P G L E+E I A L +AA LG++ +TL +K ++ +
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0134PF07201280.030 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 28.3 bits (63), Expect = 0.030
Identities = 11/42 (26%), Positives = 19/42 (45%), Gaps = 1/42 (2%)

Query: 70 ERQLAEDVARATRGIEQVENLLQLNAQLVERPQELRAYAQRL 111
+R+L++ AR + EQV L +E+ Q + L
Sbjct: 70 KRKLSDSQARVSDVEEQVNQYLSK-VPELEQKQNVSELLSLL 110


68PP_0161PP_0168N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0161015-1.160296FecR anti-FecI sigma factor
PP_0162010-0.632399ECF subfamily RNA polymerase sigma factor
PP_0163821-1.176190GntR family transcriptional regulator
PP_0164821-1.177597hypothetical protein
PP_0165821-1.115511diguanylate cyclase
PP_0166922-1.179862HlyD family type I secretion membrane fusion
PP_0167922-1.186757toxin secretion ATP-binding protein
PP_0168922-1.331955surface adhesion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0161TYPE3OMGPROT290.033 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 28.7 bits (64), Expect = 0.033
Identities = 11/54 (20%), Positives = 23/54 (42%), Gaps = 1/54 (1%)

Query: 247 IVTQDMRLSNFLAQVSRYRHGYLGCSNEIADLRLSGVFRLEDPEQLLRLLPQTL 300
V + L + L + S++I D ++SG F ++P+ L+ +
Sbjct: 38 YVAKGESLRDLLTDFGANYDATVVVSDKIND-KVSGQFEHDNPQDFLQHIASLY 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0162PF00577290.010 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 29.0 bits (65), Expect = 0.010
Identities = 10/38 (26%), Positives = 18/38 (47%)

Query: 36 ADAADLAQDTFVRLLQRREQLQLNAPRAFLRTVARGLV 73
+ D +L +++L L P+AF+ ARG +
Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYI 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0166RTXTOXIND318e-106 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 318 bits (816), Expect = e-106
Identities = 107/426 (25%), Positives = 201/426 (47%), Gaps = 9/426 (2%)

Query: 41 PRVVRLTIWGVILFFVFLIVWASVAPIDEVTRGEGKAIPSSKVQKIQNLEGGIVAEIFAK 100
R RL + ++ F V + + + ++ V GK S + ++I+ +E IV EI K
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 101 EGQIVEVGQPLLRLDETRFASNVGETEADRLAMALRVERLSA-----EVEDRPLII---D 152
EG+ V G LL+L ++ +T++ L L R E+ P + +
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 153 EKLRKAAPNQAASEESLYQSRRQQLQDEIGGLQQQLVQRQQELREYSSKRTQYANSLELL 212
+ + + SL + + Q++ + L +++ E ++ +Y N +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 213 RKEISMSEPLVATGAISQVEVLRLRRAEVENRGQLDSTALAIPRAEAAIREVQSKIEETR 272
+ + L+ AI++ VL VE +L + + E+ I + + +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 273 GKFRSEALTQLNEARTELNKATATSKALDDRVHRTLVTSPVRGIVKQLLVNTIGGVIQPG 332
F++E L +L + + T ++R +++ +PV V+QL V+T GGV+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 333 SDIIEVVPLDDTLVIEAKILPKDIAFLHPGQEATVKFTAYDYTIYGGLKAKLEQIGADTI 392
++ +VP DDTL + A + KDI F++ GQ A +K A+ YT YG L K++ I D I
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413

Query: 393 TDEDKKTTYYLIKLRTDRSHLGTDEKPLLIIPGMVATVDIMTGKKTIMSYLLKPIMKARS 452
D+ + + + + + + L T K + + GM T +I TG ++++SYLL P+ ++ +
Sbjct: 414 EDQ-RLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVT 472

Query: 453 EALRER 458
E+LRER
Sbjct: 473 ESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0168CABNDNGRPT915e-20 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 90.8 bits (225), Expect = 5e-20
Identities = 55/214 (25%), Positives = 86/214 (40%), Gaps = 11/214 (5%)

Query: 8451 GADTIDSGNGNDIIFGDLITLNGVVSEGYQALQTYVAQKSGVEVGAVTTSNVHQYITEHY 8510
T +G+ + ++ +AL V G + + + +Q I +
Sbjct: 260 ANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINLNE 319

Query: 8511 TEFDISGAKDGNDILSGGNGNDILFGQGGSDTLNGGKGNDILLGGTGNDTLIGGQGDDIL 8570
F G GN ++ G + G G+D L G ++IL GG GND L GG G D L
Sbjct: 320 GSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTL 379

Query: 8571 IGGSGADTFVWKAGD----VGNDVIKDFNKAEGDRIDLKDLLQGEKGSTIDNYLKLTTVE 8626
GG+G DTFV+ +G D I DF D+IDL + S + + T
Sbjct: 380 YGGAGRDTFVYGSGQDSTVAAYDWIADFQ-KGIDKIDLSAFRNEGQLSFVQDQ--FTGKG 436

Query: 8627 GTTTLQVSSEGKL----NAEGGIANADVTIKLEG 8656
LQ + + E G ++ D +++ G
Sbjct: 437 QEVMLQWDAANSITNLWLHEAGHSSVDFLVRIVG 470


69PP_0482PP_0488N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0482-28-0.855013bacterioferritin
PP_0483-27-0.631339excinuclease ABC subunit A
PP_048409-0.209642major facilitator family transporter
PP_048519-0.236938single-stranded DNA-binding protein
PP_04861110.136296GntR family transcriptional regulator
PP_04871111.111002hypothetical protein
PP_04882132.294913short chain dehydrogenase/reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0482HELNAPAPROT385e-06 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 37.5 bits (87), Expect = 5e-06
Identities = 19/114 (16%), Positives = 43/114 (37%), Gaps = 17/114 (14%)

Query: 39 KLYERINHEMEEETQHADALMRRILMLEGTP---------DMRADDLEVGSTVPEMIEAD 89
L+E+ + + D + R+L + G P D ++ EM++A
Sbjct: 45 TLHEKFEELYDHAAETVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQAL 104

Query: 90 LKLEYKVRGALCKGIELCELHKDYVSRDILRAQLADTEEDHTYWLEKQQGLIKA 143
+ ++ I L E ++D + D+ + + +EKQ ++ +
Sbjct: 105 VNDYKQISSESKFVIGLAEENQDNATADLFVGLIEE--------VEKQVWMLSS 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0484TCRTETB798e-18 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 78.8 bits (194), Expect = 8e-18
Identities = 74/375 (19%), Positives = 141/375 (37%), Gaps = 58/375 (15%)

Query: 33 MVLPV-LATYGMDLAGATPALIGLAIGAYGLTQAVLQIPFGMISDRIGRRPVIYLGLVIF 91
MVL V L D PA A+ LT ++ +G +SD++G + ++ G++I
Sbjct: 31 MVLNVSLPDIANDF-NKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN 89

Query: 92 ALGSVLAAQADSIWGV-IAGRILQGAG--AISAAVMALLSDLTREQHRTKAMAMIGMSIG 148
GSV+ S + + I R +QGAG A A VM +++ +++R KA +IG +
Sbjct: 90 CFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVA 149

Query: 149 LSFAVAMVVGPLLTSAFGLSGLFLVTAGLALVGILLIAFVVPSTHSILQHRESGVARQAI 208
+ V +G ++ S L L+ + L+ +
Sbjct: 150 MGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEV---------------- 193

Query: 209 GPTLRHPDLLRLDVSIFILHAVLMASFVALPLAFVERGGLPKEQHWWVYLTALFISFFAM 268
R+ I +LM+ + + F + +L +SF
Sbjct: 194 ----------RIKGHFDIKGIILMSVGIVFFMLFT-------TSYSISFLIVSVLSFLIF 236

Query: 269 VPFIIYGEKKRKMKRVLAGAVSVLLLTEIYFWEWADGLRGLVIGTVVFFT--AFNLLEAS 326
V + +++V V L I F + G++ G ++F T F +
Sbjct: 237 V---------KHIRKVTDPFVDPGLGKNIPF------MIGVLCGGIIFGTVAGFVSMVPY 281

Query: 327 LPSLVSKVSPAGGKGTAMGVYSTSQFLGAALGGILGGWLFQHGGLNTVFLGSAVLCAIWL 386
+ V ++S A + + S + +GGIL + + G L + +G L +L
Sbjct: 282 MMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGIL---VDRRGPLYVLNIGVTFLSVSFL 338

Query: 387 IVALRMNEPPYVTSL 401
+ + + ++
Sbjct: 339 TASFLLETTSWFMTI 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0485PERTACTIN310.003 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.8 bits (69), Expect = 0.003
Identities = 22/72 (30%), Positives = 30/72 (41%), Gaps = 3/72 (4%)

Query: 96 YTTEIIVDINGTMQLLGGRPQGQQQGGDPYNQGGGNYGGGQQQQYNQAPPRQQAQRPQQA 155
Y + + NG L+G + + P Q G G Q P Q Q PQ+
Sbjct: 548 YRYRLAANGNGQWSLVGAKAPPAPK---PAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQ 604

Query: 156 PQRPAPQQPAPQ 167
P+ PAPQ PA +
Sbjct: 605 PEAPAPQPPAGR 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0488DHBDHDRGNASE804e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 80.5 bits (198), Expect = 4e-20
Identities = 62/247 (25%), Positives = 106/247 (42%), Gaps = 16/247 (6%)

Query: 2 KTAFVTGASSGFGRAICCTLIGKGYRVIG---GARRMDKLKALEAELGVNFIPLALDVTD 58
K AF+TGA+ G G A+ TL +G + +++K+ + + DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 59 SVSLDKAVEQMREASLQIDLLVNNAGLALGVDRAQTSSAANWQQMIDTNITGLAMVTHKI 118
S ++D+ ++ ID+LVN AG+ L + S W+ N TG+ + +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 119 LPQMVEADSGMIINIGSIAGTYPYPGGNVYGASKAFVRQFSLNLRADLAGTRVRVSNIEP 178
M++ SG I+ +GS P Y +SKA F+ L +LA +R + + P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 179 GLCSGTDFSVVRLNGDMDAVQALYRDVEALL----------PEDIAATVAW-VAEQPAHV 227
G + TD + A Q + +E P DIA V + V+ Q H+
Sbjct: 188 G-STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 228 NINTIEI 234
++ + +
Sbjct: 247 TMHNLCV 253


70PP_0690PP_0707N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_06903211.791421GTPase ObgE
PP_06912182.379761gamma-glutamyl kinase
PP_06922191.497463CreA family protein
PP_06932201.944547hypothetical protein
PP_0694-1161.993738hypothetical protein
PP_06950121.906606hypothetical protein
PP_0696-1100.882492hypothetical protein
PP_06970120.353119acetyltransferase
PP_06981131.167948LysR family transcriptional regulator
PP_06991130.935367amino acid transporter LysE
PP_07000141.492861*FecR anti-FecI sigma factor
PP_07010121.943288MFS efflux transporter
PP_07020102.366819major facilitator family transporter
PP_0703-1102.553252FecR anti-FecI sigma factor
PP_0704-282.471123ECF subfamily RNA polymerase sigma factor
PP_0705-292.182526DNA-3-methyladenine glycosylase II
PP_0706-291.601952adaptive response regulator protein
PP_0707-290.773483mechanosensitive ion channel protein MscS
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0690PF07201300.015 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 30.2 bits (68), Expect = 0.015
Identities = 32/169 (18%), Positives = 54/169 (31%), Gaps = 34/169 (20%)

Query: 245 VDIAPLDESSPADAAEVIVNELT-----RFSPSLAERE-------RWLVLNKA----DMV 288
V I S AD AE E+T R SL +R+ V + V
Sbjct: 39 VQIVSGTLQSIADMAE----EVTFVFSERKELSLDKRKLSDSQARVSDVEEQVNQYLSKV 94

Query: 289 MDDERDERVQEVIDRLEWEGPVYVISAISK----QGTDKLSHDLMRYLEDRADRLANDPA 344
+ E+ + V E++ L P +S + + + M L D L P
Sbjct: 95 PELEQKQNVSELLSLLS-NSPNISLSQLKAYLEGKSEEPSEQFKM--LCGLRDALKGRPE 151

Query: 345 YAEELADLDQRIED-------EARAQLQALDDARTLRRTGVKSVHDIGD 386
A ++Q + + +A ++GV + + D
Sbjct: 152 LAHLSHLVEQALVSMAEEQGETIVLGARITPEAYRESQSGVNPLQPLRD 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0691CARBMTKINASE439e-07 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 43.3 bits (102), Expect = 9e-07
Identities = 39/147 (26%), Positives = 60/147 (40%), Gaps = 19/147 (12%)

Query: 124 TLRTLVDLGV---------VPVINENDTVVTDEIRFGDNDTLAALVANLVEADLLVILTD 174
T++ LV+ GV VPVI E+ + E D D +A V AD+ +ILTD
Sbjct: 178 TIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVI-DKDLAGEKLAEEVNADIFMILTD 236

Query: 175 RDGMFDADPRNNPEAQLIYEARADDPSLDAVAGGTGGALGRGGMQTKLRAARLAARSGAH 234
+G + Q + E + ++ G G M K+ AA G
Sbjct: 237 VNGAALY--YGTEKEQWLREVKVEELRKYYEEGH----FKAGSMGPKVLAAIRFIEWGGE 290

Query: 235 TIIIGGRIERVLDRLKAGERLGTLLSP 261
II +E+ ++ L G+ GT + P
Sbjct: 291 RAII-AHLEKAVEAL-EGKT-GTQVLP 314


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0693CHANLCOLICIN399e-05 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 38.9 bits (90), Expect = 9e-05
Identities = 40/254 (15%), Positives = 86/254 (33%), Gaps = 27/254 (10%)

Query: 465 AIDLTHIDPPALQALADRAALRDQKERLEKELKQLKTQQAVAADRSASKAQTETLYQEVL 524
A +L H + A+QA +R L +E+ KE + +A KA +QE
Sbjct: 112 ATELAHANNAAMQAEDERLRLAKAEEKARKEAE------------AAEKA-----FQE-- 152

Query: 525 DAQKALEDFRRSQTLAAEEPEKLEQLSQLEAAQDELKRSSDAFTERVQQLSAKLQL-VGR 583
A++ ++ R + + + E + AA E ++ + Q+ + Q V +
Sbjct: 153 -AEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEI----AQKKLSAAQSEVVK 207

Query: 584 QLGDLESKQRTLEDALRRRQLLPADLPYGTPYMEAIDDSMDNLLPLLNDYQDSWQSLQRV 643
G++++ L ++ R L + L L+ +
Sbjct: 208 MDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQN 267

Query: 644 DNQIEALYAQVRLKGVAKFDSEDD--MERRLQLLVNAYAHRTDEALTLAKARRAAVTDIA 701
EA +V + + + E R+ + ++ R A + +
Sbjct: 268 RPFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVH 327

Query: 702 RTLRNIRSDYDSLE 715
N++ ++L
Sbjct: 328 EAEENLKKAQNNLL 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0697SACTRNSFRASE300.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.3 bits (68), Expect = 0.002
Identities = 15/59 (25%), Positives = 26/59 (44%)

Query: 64 DEAHLLNITVKPENQGCGLGLRLLKHLMARAYQLNGRECFLEVRASNQSAYRLYERYGF 122
A + +I V + + G+G LL + A + + LE + N SA Y ++ F
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0700TYPE3OMGPROT300.012 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 30.2 bits (68), Expect = 0.012
Identities = 15/56 (26%), Positives = 22/56 (39%), Gaps = 2/56 (3%)

Query: 261 ATDMPLRQVLERLAGYQGQRLWMMDEQVAHRRVSGDFNLDQPGQSLQSLAAAQQLQ 316
A LR +L + + D + +VSG F D P LQ +A+ L
Sbjct: 40 AKGESLRDLLTDFGANYDATVVVSD--KINDKVSGQFEHDNPQDFLQHIASLYNLV 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0701TCRTETA371e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 37.1 bits (86), Expect = 1e-04
Identities = 54/270 (20%), Positives = 100/270 (37%), Gaps = 14/270 (5%)

Query: 38 LIQSVLPAIYPMLKANYDLSFAQIGMITLTFQITASLLQPWVGFFTDRRPTPNLLPLGTL 97
LI VLP + L + D++ A G++ + + P +G +DR +L +
Sbjct: 23 LIMPVLPGLLRDLVHSNDVT-AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLA 81

Query: 98 CTLVGIVMLAFVGSFPMILLASALVGIGSSTFHPETSRIARLASGGR----FGLAQSTFQ 153
V ++A ++ + + GI +T + IA + G FG + F
Sbjct: 82 GAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFG 141

Query: 154 VGGNTGSALGPLLAAAIV-IPFGQTHVAWFGLAGLFFLGVTLMLRGWYKEHLNQAKARKA 212
G G LG L+ PF A L GL FL +L +K + R+A
Sbjct: 142 FGMVAGPVLGGLMGGFSPHAPF----FAAAALNGLNFLTGCFLLPESHKGERRPLR-REA 196

Query: 213 VQATHGISRNRVIAALIVLGLLVFSKYFYMASFTSYFTFYLIEKFGVSVASSQLHLFLF- 271
+ R + + L + F + + + ++F + + L F
Sbjct: 197 LNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFG 256

Query: 272 -LGAVAAGTFFGGPIGDRIGRKAVIWFSIL 300
L ++A GP+ R+G + + ++
Sbjct: 257 ILHSLAQAMIT-GPVAARLGERRALMLGMI 285



Score = 31.3 bits (71), Expect = 0.007
Identities = 21/90 (23%), Positives = 35/90 (38%)

Query: 281 FGGPIGDRIGRKAVIWFSILGVAPFTLALPYADLFWTTVLSVVIGFILASAFSAIVVYAQ 340
G + DR GR+ V+ S+ G A + A W + ++ I + + Y
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121

Query: 341 ELVPGSVGMIAGIFFGLMFGFGGIGAALLG 370
++ G F FGFG + +LG
Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLG 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0702TCRTETA447e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.7 bits (103), Expect = 7e-07
Identities = 76/341 (22%), Positives = 131/341 (38%), Gaps = 21/341 (6%)

Query: 24 LPLVSLRLHEAGASTLEIGIISAIPAAGMMLSAFLVDACCRHLTRRTIYLLCFSLCTVSI 83
LP + L + T GI+ A+ A A ++ A RR + L+ + V
Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87

Query: 84 ALLESAFGSLWLLALLRLGLGL-GMGIAIILGESWVNELCPEHNRGKIMALYATSFTGFQ 142
A++ +A LW+L + R+ G+ G A+ +++ ++ R + + F
Sbjct: 88 AIMATA-PFLWVLYIGRIVAGITGATGAVAG--AYIADITDGDERARHFGFMSACFGFGM 144

Query: 143 VLGPAMLAVLGADSPWITGVV-TVCYGLALLCIVLTVPNDHVEHEEGEKSFG---LAGFF 198
V GP + ++G SP GL L +P H + LA F
Sbjct: 145 VAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFR 204

Query: 199 RVAPALCMAVLFFSFFDAVVLSLLP----VYATSHGFA--VGVAALMVTVVFAGDMLFQL 252
+A L FF ++ +P V F + + L Q
Sbjct: 205 WARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQA 264

Query: 253 PL-GWLADRV-ERTGLHLACGLVAMVIGIGLPWLLNLTWLLWPLLVVLGAVAGGIYTLAL 310
+ G +A R+ ER L L G++A G L W+ +P++V+L +GGI AL
Sbjct: 265 MITGPVAARLGERRALML--GMIADGTGYILLAFATRGWMAFPIMVLLA--SGGIGMPAL 320

Query: 311 -VLIGQRFKGQDLVTANASVGLLWGVGSLVGPLVSGAAMNV 350
++ ++ + S+ L + S+VGPL+ A
Sbjct: 321 QAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0707ACRIFLAVINRP300.040 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.2 bits (68), Expect = 0.040
Identities = 29/154 (18%), Positives = 48/154 (31%), Gaps = 8/154 (5%)

Query: 382 DEVASALGWFPKVLALALMVMMTMERMNSVIGASLALTVAVNGLTALVVALTFAGALLCY 441
++ AL VL+ + M I ++T+ +++VAL A LC
Sbjct: 436 SQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPA-LCA 494

Query: 442 R--RTLRKHDLERPTGLAGLIPFVMVIWLTLILLALLTGYL--TLAYFLTAKLLWVSLVI 497
+ + E G G + L T Y L L+ +V+
Sbjct: 495 TLLKPVSAEHHENKGGFFGWF-NTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVV 553

Query: 498 TCAYLLTTFF--GDLCETLLSPRQPGGLALASTL 529
L ++F D L + P G T
Sbjct: 554 LFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587


71PP_0802PP_0807N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_08025133.919242chemotaxis protein CheV
PP_08035133.908313HlyD family type I secretion membrane fusion
PP_08045133.936367protein secretion ABC efflux system, permease
PP_08055133.699038TolC family type I secretion outer membrane
PP_08066143.482045surface adhesion protein
PP_0807-19-0.475321anaerobic nitric oxide reductase transcriptional
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0802HTHFIS596e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.7 bits (142), Expect = 6e-12
Identities = 23/109 (21%), Positives = 47/109 (43%), Gaps = 7/109 (6%)

Query: 169 AANILVVDDSQVALQQSVHTLRNLGIECHTARSAKDAINVLLELQGTAQEINIIVSDIEM 228
A ILV DD L G + +A + A + +++V+D+ M
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI-----AAGDGDLVVTDVVM 57

Query: 229 SEMDGYAFTRTLRETPDFQHLYVLLHTSLDSAMSSEKATQAGANAILTK 277
+ + + +++ L VL+ ++ ++ M++ KA++ GA L K
Sbjct: 58 PDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0803RTXTOXIND2563e-83 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 256 bits (655), Expect = 3e-83
Identities = 92/426 (21%), Positives = 172/426 (40%), Gaps = 58/426 (13%)

Query: 21 RAGRIITLCALMLAAFLAWAAWFEVTEVSTGTGKVIPSSREQVIQSFEGGIVAQMSVAEG 80
I L + +V V+T GK+ S R + I+ E IV ++ V EG
Sbjct: 59 LVAYFIMGF---LVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115

Query: 81 DLVERGQVLAQLDPTKTASSVGESEAKYRAARASQARLQAEVTG---------KPLTFPE 131
+ V +G VL +L + ++++ AR Q R Q K P
Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175

Query: 132 SLRDSPDLIDAETALYQTRRR---------------------GLEQTLAGIQDSLQLVRS 170
S + + T+L + + + + ++ ++ +S
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 171 ELKITENLAKMGASSRVEVI---------------------RLNRQRSELELKANEARSD 209
L +L A ++ V+ ++ + + +
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 210 YLVRAREELAKASAEADALSEVIRGRSDSLTRLTLRSPVRGIVKDIEVNTLGGVVQPGGQ 269
+ ++L + + L+ + + +R+PV V+ ++V+T GGVV
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 270 VMKIVPMDERLLIETRIAPRDIAFIHPDQAAKVKISAYDYSVYGGLDGKVVGISPDTLQD 329
+M IVP D+ L + + +DI FI+ Q A +K+ A+ Y+ YG L GKV I+ D ++D
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIED 415

Query: 330 EVKPEIYYYRVFIRTEQDSLQNKAGKRFAIVPGMIATVDIRTGEKTILDYLIKPL-NRAK 388
+ + + + V I E++ L + K + GM T +I+TG ++++ YL+ PL
Sbjct: 416 Q-RLGLVFN-VIISIEENCL-STGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVT 472

Query: 389 EALRER 394
E+LRER
Sbjct: 473 ESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0806RTXTOXINA505e-07 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 49.6 bits (118), Expect = 5e-07
Identities = 24/72 (33%), Positives = 32/72 (44%), Gaps = 10/72 (13%)

Query: 6158 DVIAGTDGNDHLDGSQG--------GHITLQGGAGDDTLVVVDQNFAS--VDGGSGTDTL 6207
D ++G +G+D L G G G+ L GG GDD V + A + GG G D L
Sbjct: 765 DTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKL 824

Query: 6208 LWGGGDASIDLG 6219
G +D G
Sbjct: 825 YGSEGADLLDGG 836



Score = 40.7 bits (95), Expect = 2e-04
Identities = 24/62 (38%), Positives = 30/62 (48%), Gaps = 11/62 (17%)

Query: 6158 DVIAGTDGNDHLDGSQGGHITLQGGAGDDTLVVVDQNFASVDGGSGTDTLLWGGGDASID 6217
D+I G DGND L G +G L GG GDD L GG G D L+ G+ ++
Sbjct: 747 DLIEGNDGNDRLYGDKGNDT-LSGGNGDDQL----------YGGDGNDKLIGVAGNNYLN 795

Query: 6218 LG 6219
G
Sbjct: 796 GG 797



Score = 34.2 bits (78), Expect = 0.023
Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 2/87 (2%)

Query: 6128 DSAAGLTATTSLLADTGDESAALASLAAATDVIAGTDGNDHLDGSQGGHITLQGGAGDDT 6187
D G+ L GD+ + + A +V+ G GND L GS+G + L GG GDD
Sbjct: 783 DKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADL-LDGGEGDDL 841

Query: 6188 LVVVDQNFASVDG-GSGTDTLLWGGGD 6213
L N G G + GG
Sbjct: 842 LKGGYGNDIYRYLSGYGHHIIDDDGGK 868


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0807HTHFIS379e-129 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 379 bits (974), Expect = e-129
Identities = 140/369 (37%), Positives = 195/369 (52%), Gaps = 15/369 (4%)

Query: 164 ERIEHLALRAEDEHHRAELYRQASGQD-RELIGQSPAHKRLVEEIRLVGSSDLTVLITGE 222
+ + RA E R + QD L+G+S A + + + + +DLT++ITGE
Sbjct: 109 TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGE 168

Query: 223 TGVGKELVAQALHRASSRADKPLVSLNCAALPDTLVESELFGHVRGAFTGAHGERRGKFE 282
+G GKELVA+ALH R + P V++N AA+P L+ESELFGH +GAFTGA G+FE
Sbjct: 169 SGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFE 228

Query: 283 LANGGTLFLDEVGELPLTVQAKLLRVLQSGQLQRLGSDREHRVDVRLIAATNRDLAAEVR 342
A GGTLFLDE+G++P+ Q +LLRVLQ G+ +G R DVR++AATN+DL +
Sbjct: 229 QAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSIN 288

Query: 343 TGNFRADFYHRLSVYPLHVPPLRERGRDVLLLAGYFLEQNRSRLGLNSLRLSHEAQAALI 402
G FR D Y+RL+V PL +PPLR+R D+ L +F++Q + GL+ R EA +
Sbjct: 289 QGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMK 347

Query: 403 AYDWPGNVRELEHLIGRSALKALGQHPDRPRILTL-------------EAIDLDLRVSAT 449
A+ WPGNVRELE+L+ R R I A L +S
Sbjct: 348 AHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQA 407

Query: 450 TPGTLPSPAAPLQVVTPPEGGLREAVDSYQRQVIEACLQRHQDNWAAAARELGLDRANLS 509
+ A PP G + + +I A L + N AA LGL+R L
Sbjct: 408 VEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 510 RLARRLGLR 518
+ R LG+
Sbjct: 468 KKIRELGVS 476


72PP_0828PP_0836N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_0828012-0.783588hypothetical protein
PP_0829010-1.193371hypothetical protein
PP_0830-113-0.650351acetyltransferase
PP_0831015-0.961579hypothetical protein
PP_0832116-1.064304*S-adenosylmethionine--tRNA
PP_0833018-1.555346queuine tRNA-ribosyltransferase
PP_0834-217-1.156160preprotein translocase subunit YajC
PP_0835-116-0.687047preprotein translocase subunit SecD
PP_0836-114-0.398970preprotein translocase subunit SecF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0828PREPILNPTASE290.029 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 28.6 bits (64), Expect = 0.029
Identities = 29/108 (26%), Positives = 45/108 (41%), Gaps = 16/108 (14%)

Query: 16 SCIPRVS-RYPRVKDVIARKYRLVVKTIGY-----IGWSLFWLLIWDVLVTIDFMLFLNS 69
C +S RYP V+ + A V T+ L W+L+ + +D ML L
Sbjct: 101 GCQAPISARYPLVELLTALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKML-LPD 159

Query: 70 KFTLPLIPLTLLGSALVVLVSFRNS---------SAYSRWWEARTLWG 108
+ TLPL+ LL + L VS ++ +S +W + L G
Sbjct: 160 QLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLWSLYWAFKLLTG 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0830SACTRNSFRASE333e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.0 bits (75), Expect = 3e-04
Identities = 11/60 (18%), Positives = 26/60 (43%), Gaps = 1/60 (1%)

Query: 80 NVVVAPGARGLGVARYLVTAMIDLARQQYSAREVWVSCFNHNTAGLLLYPQLGFMPFGIE 139
++ VA R GV L+ I+ A++ + + + + N + Y + F+ ++
Sbjct: 94 DIAVAKDYRKKGVGTALLHKAIEWAKENHFC-GLMLETQDINISACHFYAKHHFIIGAVD 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0831V8PROTEASE320.003 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 31.5 bits (71), Expect = 0.003
Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 1/39 (2%)

Query: 154 HDGPITKGMSGGPVFADDGKVVGINVAIIPTDEINAGKR 192
+D T G SG PVF + +V+GI+ +P E N
Sbjct: 228 YDLSTTGGNSGSPVFNEKNEVIGIHWGGVPN-EFNGAVF 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0835SECFTRNLCASE757e-17 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 75.3 bits (185), Expect = 7e-17
Identities = 38/169 (22%), Positives = 79/169 (46%), Gaps = 13/169 (7%)

Query: 446 TIGPSLGADNITKGIDASLWGMLFVSLFIIAIY------RAFGVLATIALAGNMVLLLAL 499
++GP + + + + + L + +I Y F + A +AL +++L + L
Sbjct: 142 SVGPKVSGELVWTAVWS-----LLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGL 196

Query: 500 MSLLGATLTLPGIAGIVLTMGMAVDANVLIFSRIREEIAA--GMSVQRAIHEGFNRAYSA 557
++L L +A ++ G +++ V++F R+RE + M ++ ++ N S
Sbjct: 197 FAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSR 256

Query: 558 IVDANLTTLLVGGILFAMGTGPVKGFAVTMSLGIFTSMFTAVMVTRALV 606
V +TTLL + G ++GF M G+FT +++V V + +V
Sbjct: 257 TVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_0836SECFTRNLCASE296e-102 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 296 bits (759), Expect = e-102
Identities = 104/302 (34%), Positives = 160/302 (52%), Gaps = 20/302 (6%)

Query: 2 KTINFMGVRNVAFAVTVLLTVLALFSWWQKGLNFGLDFTGGTLIELTYERPADVKAVRAE 61
+F + F +++ + ++ GLNFG+DF GGT I DV RA
Sbjct: 12 TNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAA 71

Query: 62 LVESGFNEAVVQSFG------ATTDLLVRMP------------GDDPMLGNKVAEALQKA 103
L + ++ ++R+ L NKV AL
Sbjct: 72 LEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAV 131

Query: 104 GGDNPAKVKRVEFVGPQVGEELRDQGGIGMLLALGGILIYLAFRFQWKFAVGAIVSLIHD 163
K+ E VGP+V EL +L A I+ Y+ RF+W+FA+GA+V+L+HD
Sbjct: 132 DPA--LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHD 189

Query: 164 VVVTLGILSFFQITFDLTVLAAVLAIIGYSLNDTIVVFDRVRENFRVMRKASLIENINVS 223
V++T+G+ + Q+ FDLT +AA+L I GYS+NDT+VVFDR+REN + L + +N+S
Sbjct: 190 VLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLS 249

Query: 224 TTQTLLRTVATSVSTLLAIAALLFFGGDNLFGFSLALFIGVMAGTYSSIYIANVVLIWLN 283
+TL RTV T ++TLLA+ +L +GGD + GF A+ GV GTYSS+Y+A +++++
Sbjct: 250 VNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIG 309

Query: 284 LS 285
L
Sbjct: 310 LD 311


73PP_1046PP_1052N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_10462153.773189general secretion pathway protein D
PP_10475164.470594type II secretion system protein E
PP_10487144.675414general secretion pathway protein F
PP_10497144.257006general secretion pathway protein G
PP_10509164.316477general secretion pathway protein H
PP_10518173.492159type II secretion system protein I/J
PP_10526163.420954type II secretion pathway protein XcpW
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1046BCTERIALGSPD473e-162 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 473 bits (1219), Expect = e-162
Identities = 195/629 (31%), Positives = 311/629 (49%), Gaps = 97/629 (15%)

Query: 10 ALSLALSMAYAQEPVFDDNGTPMYEVNFVDTELGEFIDSVSRITGTTFIVDPRVKGKVTV 69
S +L++ +F + +F T++ EFI++VS+ T I+DP V+G +TV
Sbjct: 7 IRSFSLTLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITV 66

Query: 70 RTVDLHDADAIYDIFLAQLRAQGYATVDLPNGSVKIVPDQAARLEPVPV----------- 118
R+ D+ + + Y FL+ L G+A +++ NG +K+V + A+ VPV
Sbjct: 67 RSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDE 126

Query: 119 ---------------------EAGGQQGEGS----DSVATRVFSVRNAASEQVLGILKPL 153
+ G GS + + + R A +++L I++ +
Sbjct: 127 VVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERV 186

Query: 154 IDP--RVGVITPYPAAHQL-------------------------VVTDWRSNL------- 179
+ R V P A VV D R+N
Sbjct: 187 DNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEP 246

Query: 180 ---ERIASLLRQLDRPQEVPGSGSTQVIYLRHANAGEVVKVLRGLSQEGAVPVEGAGEAE 236
+RI ++++QLDR Q G+T+VIYL++A A ++V+VL G+S + A
Sbjct: 247 NSRQRIIAMIKQLDRQQAT--QGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVA 304

Query: 237 GKDRPVVPAAGGSGIRLEYEEGTNAVVMVGPDSELAAYRAIVEQLDIRRAQVVVEAIIAE 296
D+ ++ A G TNA+++ + ++ QLDIRR QV+VEAIIAE
Sbjct: 305 ALDKNIIIKAHGQ---------TNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAE 355

Query: 297 VSDSSAQELGVQWLFADEKFGAGIVNFGSNGVNIASIAGAAASGDNEALGDLLSTTTGAT 356
V D+ LG+QW AG+ F ++G+ I++ A + + G + S+ A
Sbjct: 356 VQDADGLNLGIQWANK----NAGMTQFTNSGLPISTAIAGANQYNKD--GTVSSSLASAL 409

Query: 357 AGIGHFGGGF---NFAMLVNALKGKSGFNLLSTPTLLTLDNAEASILVGQEVPFVTGSVT 413
+ GF N+AML+ AL + ++L+TP+++TLDN EA+ VGQEVP +TGS T
Sbjct: 410 SSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQT 469

Query: 414 QNNANPYQTIERKEVGVKLRIKPQINIDNSVRLDIVQEVSSIADTSSASD----VITNKR 469
+ N + T+ERK VG+KL++KPQIN +SV L+I QEVSS+AD +S++ N R
Sbjct: 470 TSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTR 529

Query: 470 EIKTKVMVEDNGLVILGGLISDELSTSDQRVPLLGDIPYLGRLFRSDATRNTKQNLMVFI 529
+ V+V V++GGL+ +S + +VPLLGDIP +G LFRS + + +K+NLM+FI
Sbjct: 530 TVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFI 589

Query: 530 RPRILRDGPSLAGLSEDKYRTLQQTTPLQ 558
RP ++RD S +Y Q
Sbjct: 590 RPTVIRDRDEYRQASSGQYTAFNDAQSKQ 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1048BCTERIALGSPF452e-161 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 452 bits (1165), Expect = e-161
Identities = 176/404 (43%), Positives = 248/404 (61%), Gaps = 8/404 (1%)

Query: 1 MPTYRYQAVDLAGKSHKASLQADSERHARQLLREQGLF--------ARQLQRHDAGSRQP 52
M Y YQA+D GK + + +ADS R ARQLLRE+GL Q + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 53 RRQRLSRAQLCELTRQLATLTGAGIPLVDALATLERQLRQPALHSVLVALRGSLAEGLGL 112
R+ RLS + L LTRQLATL A +PL +AL + +Q +P L ++ A+R + EG L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 113 ARSLARQGAPFTGLYCALVEAGERSGHLAQVLTRLADHLEQVQRQQHKARTALIYPAVLM 172
A ++ F LYCA+V AGE SGHL VL RLAD+ EQ Q+ + + + A+IYP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 173 GVSLAVVIGLMTFVVPKLTEQFAHAGQSLPLITSLLIGLSQGLVHAGPWLLGVALLLGGL 232
V++AVV L++ VVPK+ EQF H Q+LPL T +L+G+S + GPW+L L
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 233 AGWLLRKPHWCLRRDQLLLRLPRIGNLLQVLESARLARSLAILCGSGVALLEALQVATET 292
+LR+ + + LL LP IG + + L +AR AR+L+IL S V LL+A++++ +
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 293 IGNRRIRLAMEQVRQHVQGGTSLHRALDASQQFPPLLVNMVGSGEASGTLADMLERVADD 352
+ N R + V+ G SLH+AL+ + FPP++ +M+ SGE SG L MLER AD+
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 353 QERGFARQVDTAMALFEPLMILVMGAVVLFIVLAVLLPIMQLNQ 396
Q+R F+ Q+ A+ LFEPL+++ M AVVLFIVLA+L PI+QLN
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNT 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1049BCTERIALGSPG2173e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 217 bits (555), Expect = 3e-76
Identities = 71/141 (50%), Positives = 96/141 (68%), Gaps = 3/141 (2%)

Query: 11 RRNRQRGFTLMEIMVVIFIIGLLIAVVAPSVLGNQDKAMKQKVMADLATLEQALDMYRLD 70
++QRGFTL+EIMVVI IIG+L ++V P+++GN++KA KQK ++D+ LE ALDMY+LD
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 71 NLRFPSNEQGLAALVKKPAQEPLPRAWRSDGYVRRLPEDPWGTPYQYRMPGEHGRVDVYS 130
N +P+ QGL +LV+ P PL + +GY++RLP DPWG Y PGEHG D+ S
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLS 122

Query: 131 LGADGLPGGEGQDADLGNWAL 151
G DG G E D+ NW L
Sbjct: 123 AGPDGEMGTED---DITNWGL 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1050BCTERIALGSPH376e-06 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 37.2 bits (86), Expect = 6e-06
Identities = 22/89 (24%), Positives = 38/89 (42%), Gaps = 1/89 (1%)

Query: 4 QRGFSLIELLVVLAIAGLMTGLAVAGLGNG-QASVEQALQRLAVKVRGQAALARHAGQLR 62
QRGF+L+E++++L + G+ G+ + S Q L R ++R GQ
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFF 62

Query: 63 GLRWNGQRPEFVRREGNAWVVEAVPLGDW 91
G+ + R +F+ E A W
Sbjct: 63 GVSVHPDRWQFLVLEARDGADPAPADDGW 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1051PilS_PF08805325e-04 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 31.8 bits (72), Expect = 5e-04
Identities = 9/39 (23%), Positives = 19/39 (48%)

Query: 15 KQAQRGFTLLEVTVALAIAAVLAVITSQVLHQRLAVQDN 53
K+ +G TL+EV + + + VLA ++ + +
Sbjct: 22 KEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQS 60


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1052BCTERIALGSPG290.010 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 28.7 bits (64), Expect = 0.010
Identities = 12/28 (42%), Positives = 20/28 (71%), Gaps = 4/28 (14%)

Query: 4 RQTGLTLIELMVALALTAVLGIMLAALV 31
+Q G TL+E+MV + ++G+ LA+LV
Sbjct: 6 KQRGFTLLEIMVVI---VIIGV-LASLV 29


74PP_1112PP_1128N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1112023-5.102905major facilitator superfamily protein
PP_1113229-7.224905pyridoxal-phosphate dependent enzyme family
PP_1114332-7.523457SEC-C domain-containing protein
PP_1115230-7.034349lipoprotein
PP_1116230-6.500921resolvase site-specific recombinase
PP_1117125-4.708994hypothetical protein
PP_1118017-2.720059recombinase-like protein
PP_11192141.073571hypothetical protein
PP_11200121.256300hypothetical protein
PP_1121-1111.415986OmpA/MotB domain-containing protein
PP_1122-1121.599085OmpA/MotB domain-containing protein
PP_11230111.215074hypothetical protein
PP_11240121.491618hypothetical protein
PP_11250121.367204ATP-dependent DNA helicase DinG
PP_11260151.240329Beta-agarase
PP_11270140.874674beta-lactamase
PP_1128017-1.550511OmpA/MotB domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1112TCRTETB330.002 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.3 bits (76), Expect = 0.002
Identities = 23/101 (22%), Positives = 39/101 (38%), Gaps = 3/101 (2%)

Query: 53 LCLMLATYPVSRLMSRIGRKKAFMLGAIPLALSGVSGFLAVEHQHFPTLVLSHSALGV-Y 111
L + T +L ++G K+ + G I V GF V H F L+++ G
Sbjct: 60 LTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGF--VGHSFFSLLIMARFIQGAGA 117

Query: 112 IAFANFNRFAATDNLSQALKPKALSLVVAGGVIAAVVGPTL 152
AF + + + KA L+ + + VGP +
Sbjct: 118 AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1114SECA586e-14 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 57.6 bits (139), Expect = 6e-14
Identities = 19/42 (45%), Positives = 22/42 (52%), Gaps = 1/42 (2%)

Query: 23 GHVHGPHCNHGHQEPIRNALKDVGRNDPCPCGSEKKYKKCHG 64
H + + VGRNDPCPCGS KKYK+CHG
Sbjct: 858 SHQDDDS-AAAAALAAQTGERKVGRNDPCPCGSGKKYKQCHG 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1115PERTACTIN327e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 32.4 bits (73), Expect = 7e-04
Identities = 17/48 (35%), Positives = 23/48 (47%), Gaps = 5/48 (10%)

Query: 62 HLRVDNPNDSRLFIRNVSYAIRLNDLLLVQDEAS----VW-RSVGGHA 104
L VD S LF NV + L+D L+V +AS +W R+ G
Sbjct: 468 VLMVDTLAGSGLFRMNVFADLGLSDKLVVMRDASGQHRLWVRNSGSEP 515


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1116PYOCINKILLER310.006 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.5 bits (68), Expect = 0.006
Identities = 20/97 (20%), Positives = 39/97 (40%), Gaps = 5/97 (5%)

Query: 90 AQLMEQVDFKVATMPQADKFQLHLFAALAQQEREFIATRTKEALASLQRRAD---AGDAV 146
A+ + + + + + + L + + EA++SLQ R + A A
Sbjct: 154 AEEIGEQAVREGNINGPEAYMRFLDREMEGLTAAYNVKLFTEAISSLQIRMNTLTAAKAS 213

Query: 147 AQQKVANRA--EAQAKGRAVADITAANNIRMSKINTY 181
+ AN+A +A A+ + A+ A + NTY
Sbjct: 214 IEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTY 250


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1117BLACTAMASEA320.003 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 32.5 bits (74), Expect = 0.003
Identities = 13/65 (20%), Positives = 27/65 (41%), Gaps = 11/65 (16%)

Query: 169 SPSAVVEKVIKYLGKDLVQEVVKACKIKPTQRSQLTEWIIDNYNPSDESHRAALPDPEKV 228
+P+++ + K L + + QL +W++D+ + R+ LP +
Sbjct: 178 TPASMAATLRKLLTSQRLS---------ARSQRQLLQWMVDD-RVAGPLIRSVLPAGWFI 227

Query: 229 HDLKT 233
D KT
Sbjct: 228 AD-KT 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1119SECA484e-09 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 47.9 bits (114), Expect = 4e-09
Identities = 14/20 (70%), Positives = 16/20 (80%)

Query: 134 KAGRNDPCPCASGHKFKKCC 153
K GRNDPCPC SG K+K+C
Sbjct: 878 KVGRNDPCPCGSGKKYKQCH 897



Score = 28.7 bits (64), Expect = 0.011
Identities = 8/14 (57%), Positives = 8/14 (57%)

Query: 7 CPCGSGNLLDACCG 20
CPCGSG C G
Sbjct: 885 CPCGSGKKYKQCHG 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1121OMPADOMAIN1193e-34 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 119 bits (299), Expect = 3e-34
Identities = 47/139 (33%), Positives = 69/139 (49%), Gaps = 11/139 (7%)

Query: 101 PPEPVAVVEEVVVQKEEVIVIRDVHFEFDSARLTASDKERLNTIATRLKQ-EAPSARLSV 159
P A VQ + + DV F F+ A L + L+ + ++L + + V
Sbjct: 198 PVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVV 257

Query: 160 SGHTDSVGSDSYNQKLSERRAHSVTDYLVESGVPRSSFVSVVGAGETQPVADNATAEGR- 218
G+TD +GSD+YNQ LSERRA SV DYL+ G+P +S G GE+ PV N +
Sbjct: 258 LGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADK-ISARGMGESNPVTGNTCDNVKQ 316

Query: 219 --------AMNRRTEIKIQ 229
A +RR EI+++
Sbjct: 317 RAALIDCLAPDRRVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1122OMPADOMAIN1276e-37 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 127 bits (320), Expect = 6e-37
Identities = 57/140 (40%), Positives = 77/140 (55%), Gaps = 13/140 (9%)

Query: 138 PAAPPASAPEPSPEVIT--LDDNGAVMFAFDSAELTPAAQQRLQGLVEKL--NSPTVAKV 193
A A AP P+PEV T V+F F+ A L P Q L L +L P V
Sbjct: 196 AAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSV 255

Query: 194 RVIGHTDSVGSDSYNQALSERRASSVAEYLIGQGLEMGKVTSQGRGESEPVTDNETEEGR 253
V+G+TD +GSD+YNQ LSERRA SV +YLI +G+ K++++G GES PVT N + +
Sbjct: 256 VVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVK 315

Query: 254 AR---------NRRVELHLN 264
R +RRVE+ +
Sbjct: 316 QRAALIDCLAPDRRVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1123ENTSNTHTASED280.004 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 28.1 bits (62), Expect = 0.004
Identities = 13/46 (28%), Positives = 19/46 (41%)

Query: 43 FGLHLLEVLFFNRSLRGRSHRWFDRLQILLTGIFHVMSIPRPREAP 88
LHLL + R WF R ++T + + +P R AP
Sbjct: 180 ISLHLLPAFAATMAERTVRTEWFQRDNSVITLVSAITRVPHDRSAP 225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1128OMPADOMAIN551e-10 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 55.3 bits (133), Expect = 1e-10
Identities = 28/81 (34%), Positives = 40/81 (49%), Gaps = 9/81 (11%)

Query: 177 PKTAVLVLGHADSSGAAVANQKLSLERAASVSAIFRLSGLQRDRLTLKGMGSVMPRAAN- 235
+V+VLG+ D G+ NQ LS RA SV G+ D+++ +GMG P N
Sbjct: 251 KDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNT 310

Query: 236 -DSAEGR-------ALNRRVE 248
D+ + R A +RRVE
Sbjct: 311 CDNVKQRAALIDCLAPDRRVE 331


75PP_1181PP_1192N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1181-1142.273656winged helix family two component
PP_11820111.981575integral membrane sensor signal transduction
PP_1183-1120.9978114'-phosphopantetheinyl transferase
PP_11840120.849223dienelactone hydrolase
PP_11850150.816965outer membrane protein H1
PP_1186-1141.174714winged helix family two component
PP_11870130.977180integral membrane sensor signal transduction
PP_11881181.260727C4-dicarboxylate transporter DctA
PP_11890180.887758hypothetical protein
PP_1190-1180.111537hypothetical protein
PP_11910200.002978S4 domain-containing protein
PP_1192015-0.382421acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1181HTHFIS772e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.8 bits (189), Expect = 2e-18
Identities = 31/137 (22%), Positives = 63/137 (45%), Gaps = 2/137 (1%)

Query: 6 PRILIVEDDQRLAELTAEYLQANGFEVAVEADGARAARRIIDSQPDLVILDLMLPGEDGL 65
IL+ +DD + + + L G++V + ++ A R I DLV+ D+++P E+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 SICRRVRSQYQG-PILMLTARSDELDQVQGLDLGADDYVCKP-VRPRLLLARIQALLRRS 123
+ R++ P+L+++A++ + ++ + GA DY+ KP L+ +AL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 124 ETVDSKRQDLAFGALRI 140
D G +
Sbjct: 124 RRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1182PF06580372e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 2e-04
Identities = 21/106 (19%), Positives = 37/106 (34%), Gaps = 23/106 (21%)

Query: 430 LQNLVGNAMRHA------EGEVRLSYQLGQQRCRIDVEDDGPGIPEGFWDRIFTPFTRLD 483
+Q LV N ++H G++ L ++VE+ G +
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------- 306

Query: 484 DSRTRASGGHGLGLSIVRRIIYWHAGRATVGRSEALGGACFSLNWP 529
T+ S G GL ++ R+ + A + SE G + P
Sbjct: 307 ---TKESTGTGL-QNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1183ENTSNTHTASED962e-26 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 96.3 bits (239), Expect = 2e-26
Identities = 69/224 (30%), Positives = 106/224 (47%), Gaps = 14/224 (6%)

Query: 11 LRHHWPLPRPLPGAVLVSCAFDPAHLATDDFQRAGIVPSASLQRSVAKRQAEYLAGRVCA 70
L H+PLP G L FD + D + L+ + KR+AE+LAGR+ A
Sbjct: 2 LTSHFPLP--FAGHRLHIVDFDASSFREHDLLW--LPHHDRLRSAGRKRKAEHLAGRIAA 57

Query: 71 RAALQHLDGRDYVPGTHEDRSPIWPAGIHGSITHGKGWAAAVVAGENSCQGLGLDQEALL 130
AL+ + G VPG + R P+WP G+ GSI+H A AV+ S Q +G+D E ++
Sbjct: 58 VHALREV-GVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVI----SRQRIGIDIEKIM 112

Query: 131 DDERAERLMGEILIPAELERLDRRQLG--LTVTLTFSLKESLFKTLYPLTRQRFYFEHAE 188
A L I+ E + L L L +TL FS KES++K + F A+
Sbjct: 113 SQHTATELAPSIIDSDERQILQASLLPFPLALTLAFSAKESVYKA-FSDRVTLPGFNSAK 171

Query: 189 VLDWSAEGLARLRLLTDLSPQWQQGAELQGQFCLQDGHLLSLVS 232
V +A L LL + + ++ ++ +D +++LVS
Sbjct: 172 VTSLTA-THISLHLLPAFAATMAE-RTVRTEWFQRDNSVITLVS 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1185OMPADOMAIN280.024 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 28.0 bits (62), Expect = 0.024
Identities = 31/171 (18%), Positives = 56/171 (32%), Gaps = 14/171 (8%)

Query: 2 KTFNTLLAAMAVCAAGITTAQAADDNFASLTYGQTS----DKVRKSGLLQRNTDHLNADG 57
KT + A+A A A + + G + + +G N A G
Sbjct: 3 KTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAGAFG 62

Query: 58 IIGKDDTWGVRLGKINDQGRYYMTYDNVSGDHS--GLKLRQENLLGSYDLFLPVGDTTKL 115
+ G +G + GR +G + G++L Y P+ D +
Sbjct: 63 GYQVNPYVGFEMG-YDWLGRMPYKGSVENGAYKAQGVQL---TAKLGY----PITDDLDI 114

Query: 116 FGGGSLGVTKLTQDSPGASRDTDYGYAYGLQAGVIQDITDKASVELGYRYL 166
+ V + S ++ D G + GV IT + + L Y++
Sbjct: 115 YTRLGGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWT 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1186HTHFIS843e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.5 bits (209), Expect = 3e-21
Identities = 30/120 (25%), Positives = 55/120 (45%), Gaps = 1/120 (0%)

Query: 2 KLLVVEDEALLRHHLYTRLGESGHVVEAVADAEEALYQAEQYHFDLAIIDLGLPGISGLE 61
+LV +D+A +R L L +G+ V ++A DL + D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LITRLRSQDKTFPILILTARGNWQDKVEGLAAGADDYLVKPFQFEE-LEARLNALLRRSS 120
L+ R++ P+L+++A+ + ++ GA DYL KPF E + AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1187PF06580300.022 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.022
Identities = 22/104 (21%), Positives = 38/104 (36%), Gaps = 29/104 (27%)

Query: 326 EKRVDVVLELPDA---AQVPMEQGALLEMLGNLLENAYR------LSLGQVRVSLMKAPG 376
E R+ ++ A QVP +L + L+EN + G++ + K G
Sbjct: 237 EDRLQFENQINPAIMDVQVP----PML--VQTLVENGIKHGIAQLPQGGKILLKGTKDNG 290

Query: 377 YLTLCIEDDGPGVPVDQRERILERGERLDSQHPGQGIGLAVVKD 420
+TL +E+ G L + G GL V++
Sbjct: 291 TVTLEVENTGSLA--------------LKNTKESTGTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1192SACTRNSFRASE334e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 4e-04
Identities = 17/59 (28%), Positives = 24/59 (40%), Gaps = 5/59 (8%)

Query: 86 VDMLFVAPGYRGQGVGKRLLRYAIS-----ELNAEYLDVNEQNPKALGFYLHEGFEVIG 139
++ + VA YR +GVG LL AI L+ + N A FY F +
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


76PP_1299PP_1306N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1299118-0.088876polar amino acid ABC transporter inner membrane
PP_1300115-1.064304general amino acid ABC transporter ATP-binding
PP_1301012-3.0378692-alkenal reductase
PP_1302013-3.698675hypothetical protein
PP_1303-115-3.626121sulfate adenylyltransferase subunit 2
PP_1304-219-3.612770bifunctional sulfate adenylyltransferase subunit
PP_1305028-4.432579Pyocin S-type immunity protein
PP_1306-217-2.394091pyocin S-type Killer domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_12992FE2SRDCTASE280.038 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 28.5 bits (63), Expect = 0.038
Identities = 16/62 (25%), Positives = 27/62 (43%), Gaps = 18/62 (29%)

Query: 9 DMPPPVKTVGVLAWMRANLFSSWL------------------NTLLTLFAIYLVWLIVPP 50
D P P+ + + W N+ SS L L++L+A + + L+VPP
Sbjct: 47 DEPAPLNAMTLAQWSSPNVLSSLLAVYSDHIYRNQPMMIRENKPLISLWAQWYIGLMVPP 106

Query: 51 LL 52
L+
Sbjct: 107 LM 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1301V8PROTEASE672e-14 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 67.3 bits (164), Expect = 2e-14
Identities = 37/194 (19%), Positives = 64/194 (32%), Gaps = 38/194 (19%)

Query: 119 ESSLGSAVIMSPEGYLLTNNHVTSGADQIVVALK------------DGRETLARVIGSDP 166
+ + S V++ LLTN HV ALK +G T ++
Sbjct: 100 GTFIASGVVVGK-DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSG 158

Query: 167 ETDLAVLKIDL--------KNLPAITIGRSDTIHIGDVSLAIGNPFGVGQTVTMGIISAT 218
E DLA++K + + T+ + + G P TM +
Sbjct: 159 EGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESK 215

Query: 219 GRNQLGLNNYEDFIQTDAAINPGNSGGALVDANGNLIGINTAIFSKSGGSQGIGFAIP-- 276
G+ L +Q D + GNSG + + +IGI+ G+
Sbjct: 216 GK-ITYLKGE--AMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGA 263

Query: 277 VKLALEVMKSIVEH 290
V + V + ++
Sbjct: 264 VFINENVRNFLKQN 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1304TCRTETOQM731e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 73.0 bits (179), Expect = 1e-15
Identities = 51/151 (33%), Positives = 69/151 (45%), Gaps = 19/151 (12%)

Query: 33 VDDGKSTLIGRLLHDSKMIYEDHLEAITRDSKKVGTTGEEVDLALLV-DGLQAEREQGIT 91
VD GK+TL LL++S I ++G VD D ER++GIT
Sbjct: 12 VDAGKTTLTESLLYNSGAI------------TELG----SVDKGTTRTDNTLLERQRGIT 55

Query: 92 IDVAYRYFSTAKRKFIIADTPGHEQYTRNMATGASTCDLAIILVDARYGVQTQTRRHSYI 151
I F K I DTPGH + + S D AI+L+ A+ GVQ QTR +
Sbjct: 56 IQTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHA 115

Query: 152 ASLLGIKHIVVAVNKMDLKGFD-EGVFESIK 181
+GI I +NK+D G D V++ IK
Sbjct: 116 LRKMGIPTIFF-INKIDQNGIDLSTVYQDIK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1306PYOCINKILLER2537e-77 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 253 bits (648), Expect = 7e-77
Identities = 152/441 (34%), Positives = 221/441 (50%), Gaps = 43/441 (9%)

Query: 351 ATQNLMQRAAEVENLQAQLAAQEEA-ARQQAEAERLA-----------AEAERQRQEALR 398
A N+ + +LQ ++ A A +A A A AE + ++Q A+R
Sbjct: 186 AAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIR 245

Query: 399 RSVSYINEARLAVSTPA----VIPIGATTFAVAEAAYSALAESIAAALTRLVATTVPSVA 454
+ +Y A +V A +I + ++A+A A+A L R++A+ +A
Sbjct: 246 AANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIA-----VLGRVLASAPSVMA 300

Query: 455 VGTLAMA--------WPSTLGNSERQYLISTPLDSLSPAGGPDLAALAASSTSIDLPYLL 506
VG ++ W +Y + L +L A+A +S ++DLP L
Sbjct: 301 VGFASLTYSSRTAEQWQD-QTPDSVRYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMRL 359

Query: 507 AGVENENELDLYVVPSG-----KPVAVRAATFDSERQVY-----SLALDNPQRILTWTPA 556
N L VV + K V VR A +++ +Y S + P ILTWTPA
Sbjct: 360 TNEARGNTTTLSVVSTDGVSVPKAVPVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTPA 419

Query: 557 SAPGGDEGNSTSLPLVPPGTVVYTGSSLNPVVTEQEGYPALDLLDQERLIITFPMDSGLP 616
S PG +ST+ P+VP VY G++L PV E YP + L ++ LII FP DSG+
Sbjct: 420 SPPGNQNPSSTT-PVVPKPVPVYEGATLTPVKATPETYPGVITLPED-LIIGFPADSGIK 477

Query: 617 PILVVFKSPRYEAGTSTGHGAQVSDTWRKEAASLEGAPIPAQIAELLKSREFRNFDAFRR 676
PI V+F+ PR G +TG G VS W A+ EGAPIP+QIA+ L+ + F+N+ FR
Sbjct: 478 PIYVMFRDPRDVPGAATGKGQPVSGNWLGAASQGEGAPIPSQIADKLRGKTFKNWRDFRE 537

Query: 677 QFWKAVANDPELSKQFDEMSLSRMRKNGYSPIVDFPDSHLSQKTFILHHVIPISEGGGVY 736
QFW AVANDPELSKQF+ SL+ MR G +P V + + +HH + +++GGGVY
Sbjct: 538 QFWIAVANDPELSKQFNPGSLAVMRD-GGAPYVRESEQAGGRIKIEIHHKVRVADGGGVY 596

Query: 737 DMDNIRIVTPLSHNSIHYGTK 757
+M N+ VTP H IH G K
Sbjct: 597 NMGNLVAVTPKRHIEIHKGGK 617


77PP_1385PP_1392N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1385-3130.505419hydrophobe/amphiphile efflux-1 (HAE1) family
PP_1386091.282683RND family efflux transporter MFP subunit
PP_13872101.381601TetR family transcriptional regulator
PP_13883121.934070EmrB/QacA family drug resistance transporter
PP_13891151.393188carboxyphosphonoenolpyruvate phosphonomutase
PP_13901171.535458hypothetical protein
PP_13911181.441929LysR family transcriptional regulator
PP_13920191.540506NAD-dependent epimerase/dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1385ACRIFLAVINRP13180.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1318 bits (3412), Expect = 0.0
Identities = 668/1033 (64%), Positives = 827/1033 (80%), Gaps = 4/1033 (0%)

Query: 1 MSKFFIDRPIFAWVIALVIMLVGALSILKLPINQYPSIAPPAIAIAVTYPGASAQTVQDT 60
M+ FFI RPIFAWV+A+++M+ GAL+IL+LP+ QYP+IAPPA++++ YPGA AQTVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VVQVIEQQLNGIDNLRYVSSESNSDGSMTITATFEQGTNPDTAQVQVQNKLNLATPLLPQ 120
V QVIEQ +NGIDNL Y+SS S+S GS+TIT TF+ GT+PD AQVQVQNKL LATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGIRVTKAVKNFLLVIGLVSEDGSMTKDDLANYIVSNMQDPISRTAGVGDFQVFGA 180
EVQQQGI V K+ ++L+V G VS++ T+DD+++Y+ SN++D +SR GVGD Q+FGA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWLDPAKLNKFQLTPVDVKTAVAAQNVQVSSGQLGGLPALPGTQLNATIIGKTRL 240
QYAMRIWLD LNK++LTPVDV + QN Q+++GQLGG PALPG QLNA+II +TR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 QTAEQFESILLKVNKDGSQVRLGDVAQVGLGGENYAVSAQFNGKPASGLAVKLATGANAL 300
+ E+F + L+VN DGS VRL DVA+V LGGENY V A+ NGKPA+GL +KLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAKALRETIKGLEPFFPPGVKAVFPYDTTPVVTESISGVIHTLIEAVVLVFLVMYLFLQ 360
DTAKA++ + L+PFFP G+K ++PYDTTP V SI V+ TL EA++LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATIITTMTVPVVLLGTFGILAAAGFSINTLTMFAMVLAIGLLVDDAIVVVENVERVM 420
N RAT+I T+ VPVVLLGTF ILAA G+SINTLTMF MVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 SEEGLPPKEATKRSMEQIQGALVGIALVLSAVLLPMAFFGGSTGVIYRQFSITIVSAMGL 480
E+ LPPKEAT++SM QIQGALVGIA+VLSAV +PMAFFGGSTG IYRQFSITIVSAM L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALIFTPALCATMLKPLKKGEHHTAKGGFFGWFNRNFDRSVNGYERSVGAILRNKVP 540
SVLVALI TPALCAT+LKP+ EHH KGGFFGWFN FD SVN Y SVG IL +
Sbjct: 481 SVLVALILTPALCATLLKPVSA-EHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 FLLAYALIVVGMIWLFARIPTAFLPEEDQGVLFAQVQTPAGSSAERTQVVVDQMREYLLK 600
+LL YALIV GM+ LF R+P++FLPEEDQGV +Q PAG++ ERTQ V+DQ+ +Y LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 DEADTVSSVFTVNGFNFAGRGQSSGMAFIMLKPWDERS-KENSVFALAQRAQQHFFTFRD 659
+E V SVFTVNGF+F+G+ Q++GMAF+ LKPW+ER+ ENS A+ RA+ RD
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 660 AMVFAFAPPAVLELGNATGFDVFLQDRGGVGHEKLMEARNQFLAKAAQSKI-LSAVRPNG 718
V F PA++ELG ATGFD L D+ G+GH+ L +ARNQ L AAQ L +VRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 719 LNDEPQYQLTIDDERASALGVTIADINNTLSIALGASYVNDFIDRGRVKKVYIQGEPSAR 778
L D Q++L +D E+A ALGV+++DIN T+S ALG +YVNDFIDRGRVKK+Y+Q + R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 779 MSPEDLQKWYVRNGAGEMVPFSSFAKGEWTYGSPKLSRYNGVEAMEILGAPAPGYSTGEA 838
M PED+ K YVR+ GEMVPFS+F W YGSP+L RYNG+ +MEI G APG S+G+A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 839 MAEVERIAGELPSGIGFSWTGMSYEEKLSGSQMPALFALSVLFVFLCLAALYESWSIPIA 898
MA +E +A +LP+GIG+ WTGMSY+E+LSG+Q PAL A+S + VFLCLAALYESWSIP++
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 899 VVLVVPLGIIGALIATSLRGLSNDVYFLVGLLTTIGLAAKNAILIVEFAKELHE-QGRSL 957
V+LVVPLGI+G L+A +L NDVYF+VGLLTTIGL+AKNAILIVEFAK+L E +G+ +
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 958 YDAAIEACRMRLRPIIMTSLAFILGVVPLTIASGAGAGSQHAIGTGVIGGMISATVLAIF 1017
+A + A RMRLRPI+MTSLAFILGV+PL I++GAG+G+Q+A+G GV+GGM+SAT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1018 WVPLFFVAVSSLF 1030
+VP+FFV + F
Sbjct: 1020 FVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1386RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.3 bits (102), Expect = 1e-06
Identities = 37/226 (16%), Positives = 81/226 (35%), Gaps = 23/226 (10%)

Query: 73 ILKRLFKEGS----EVKEGQQLY---QIDPAVYEATLANAKANLLATRSLAERYKQLIDE 125
L + + V E + Y + VY++ L ++ +L+ + + QL
Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298

Query: 126 QAVSKQEYDDANAKRLQAEASLKSAQIDLRYTKVLAPISGRI-GRSSFTEGALVSNGQTD 184
+ + K L + + + + AP+S ++ TEG +V+ +T
Sbjct: 299 EILDK--LRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET- 355

Query: 185 AMATIQQLDPIYVDVTQSTAELLKLRRDL------ESGQLQKAGDNAASVQLVLEDGSLF 238
M + + D + V ++ + E+ + G V+ + D
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIED 415

Query: 239 KQEGRLEFSEVAVDETTGSVTLRALFPNPDHTLLPGMFVHARLKAG 284
++ G + ++++E S + + L GM V A +K G
Sbjct: 416 QRLGLVFNVIISIEENCLSTGNKNIP------LSSGMAVTAEIKTG 455



Score = 42.9 bits (101), Expect = 1e-06
Identities = 21/96 (21%), Positives = 37/96 (38%), Gaps = 2/96 (2%)

Query: 61 RVAEVRPQVNGIILKRLFKEGSEVKEGQQLYQIDPAVYEATLANAKANLLATRSLAERYK 120
R E++P N I+ + + KEG V++G L ++ EA +++LL R RY+
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 121 QLIDEQAVSKQEYDDANAKR--LQAEASLKSAQIDL 154
L ++K + L
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1387HTHTETR1406e-44 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 140 bits (354), Expect = 6e-44
Identities = 79/209 (37%), Positives = 121/209 (57%)

Query: 1 MVRRTKEEAQETRAQIIEAAERAFYKRGVARTTLADIAELAGVTRGAIYWHFNNKAELVQ 60
M R+TK+EAQETR I++ A R F ++GV+ T+L +IA+ AGVTRGAIYWHF +K++L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 ALLDSLHETHDHLARASESEDELDPLGCMRKLLLQVFNELVLDARTRRINEILHHKCEFT 120
+ + L +++ DPL +R++L+ V V + R R + EI+ HKCEF
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 121 DDMCEIRQQRQSAVLDCHKGITLALANAVRRGQLPGELDVERAAVAMFAYVDGLIGRWLL 180
+M ++Q +++ L+ + I L + + LP +L RAA+ M Y+ GL+ WL
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 181 LPDSVDLLGDVEKWVDTGLDMLRLSPALR 209
P S DL + +V L+M L P LR
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLR 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1388TCRTETB1382e-38 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 138 bits (350), Expect = 2e-38
Identities = 89/412 (21%), Positives = 172/412 (41%), Gaps = 25/412 (6%)

Query: 13 VLTALMLAIFLGALDQTIVAVSLPAISAQFSDVG-LLAWVISGYMVAMTVAVPIYGKLGD 71
+L L + F L++ ++ VSLP I+ F+ WV + +M+ ++ +YGKL D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 72 LYGRRRMILTGISLFTLASIACAMAQDM-PQLVLARVLQGIGAGGMVSVSQAIIGDFVPP 130
G +R++L GI + S+ + L++AR +QG GA ++ ++ ++P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 131 RERGRYQGYFSSMYAAASVAGPVLGGWLTEYLSWRWVFWINLPLGLVALWAIRRALADMP 190
RG+ G S+ A GP +GG + Y+ W ++ I + + + ++
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLL---KK 191

Query: 191 VQRREAQVDYLGAMLLILGLGSLLLGITLVGQGHAWADPAVLALFGCALLGLALFIAHER 250
R + D G +L+ +G+ +L T L + +L +F+ H R
Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISF-------LIVS---VLSFLIFVKHIR 241

Query: 251 RCPEPLLPLSLFGNR---VAVLCWAVIFFASFQSISLTMLMPLRYQGITGAGADSAALHL 307
+ +P + L N + VLC +IF +S+ M ++ A S +
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI--I 299

Query: 308 LPLAMGLPMGAFTGGRMTSRTGRYKPQILAGALLMPVAIFAMALTPPQSALLSALFMLLT 367
P M + + + GG + R G + G + V+ + ++ + ++
Sbjct: 300 FPGTMSVIIFGYIGGILVDRRGPLY-VLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFV 358

Query: 368 GIACGLQFPTSLVGT--QSAVASKDIGVATSTTNLFRSLGGAMGVACMSSLL 417
GL F +++ T S++ ++ G S N L G+A + LL
Sbjct: 359 --LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1392NUCEPIMERASE365e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 36.3 bits (84), Expect = 5e-05
Identities = 27/122 (22%), Positives = 42/122 (34%), Gaps = 29/122 (23%)

Query: 3 KIAIIGATGRAGSQLLEEALRRGHRVLAI-----ARDPS------TLEGREGVTVKSLDA 51
K + GA G G + + L GH+V+ I D S L + G +D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 52 TDSAALQA--AVAGMDAVLSAAH-----FSTMQPHA-----------IIEPVKRAGVKRL 93
D + A + V + H +S PHA I+E + ++ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 94 LV 95
L
Sbjct: 122 LY 123


78PP_1492PP_1497N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1492-2111.491618chemotaxis protein CheA
PP_1493-2150.411248chemotaxis-specific methylesterase
PP_1494-1150.369545response regulator/GGDEF domain-containing
PP_1496-1160.521375lysyl-tRNA synthetase
PP_1497-2130.780470TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1492HTHFIS748e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 8e-16
Identities = 32/116 (27%), Positives = 56/116 (48%), Gaps = 3/116 (2%)

Query: 638 RKRILVVDDSLTVRELQRKLLGNRGYDVAVAVDGMDGWNALRSEDFDLLITDIDMPRMDG 697
ILV DD +R + + L GYDV + + W + + D DL++TD+ MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 698 IELVTLVRRDQRLQSLPVMVVSYKDREEDRRRGLDAGADYYLAKASFHDDALLDAV 753
+L+ +++ LPV+V+S ++ + + GA YL K F L+ +
Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK-PFDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1493HTHFIS461e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.4 bits (110), Expect = 1e-07
Identities = 32/173 (18%), Positives = 55/173 (31%), Gaps = 11/173 (6%)

Query: 2 KIAIVNDMPLAVEALRRAVALEPAHQVVWVASNGAEAVQRCTEQLPDLILMDLIMPVMDG 61
I + +D L +A L A V + SN A + DL++ D++MP +
Sbjct: 5 TILVADDDAAIRTVLNQA--LSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 VEATRRIMAETPCAIVIVTVDRKQNVHRVFEAMGHGALD-VVDTPALGAGDAREAAAPLL 120
+ RI P V+V + +A GA D + L A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSA-QNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 121 RKILNIGWLVGQQRAPAARSVAAPLREASQRRGLVAIGSSAGGPAALEVLLKG 173
K Q +A ++E + + L +++ G
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLM-------QTDLTLMITG 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1494HTHFIS626e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.2 bits (151), Expect = 6e-13
Identities = 35/162 (21%), Positives = 61/162 (37%), Gaps = 15/162 (9%)

Query: 20 VLLVDDQAMIGEAVRRGLANEDNIDFHFCADPHQAVAQAMRIKPTVILQDLIMPGLDGLT 79
+L+ DD A I + + L+ D ++ +++ D++MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 80 LVREYRNNPVTQDIPIIVLSTKEDPLVKSAAFAAGANDYLVKLPDTIELVARIRYHSRSY 139
L+ + D+P++V+S + + A GA DYL K D EL+ I
Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA---- 118

Query: 140 LTLLQRDEAYRALRVSQQQLL--DSNLMLQ------RLMNSD 173
L +R + L S M + RLM +D
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1497HTHTETR524e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.6 bits (123), Expect = 4e-10
Identities = 22/90 (24%), Positives = 39/90 (43%)

Query: 27 KTARQGSEQRRQLILDAAMRIVVRDGVRGVRHRAVAAEAGVPLSATTYYFKDIEDLLTDT 86
+ +Q +++ RQ ILD A+R+ + GV +A AGV A ++FKD DL ++
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 87 FAQYVERSAAYMAKLWANTEVVLRQLLAQG 116
+ + A +L +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREI 92


79PP_1694PP_1698N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_16940121.759906hypothetical protein
PP_1695-2121.406260integral membrane sensor hybrid histidine
PP_1696-2110.808511major facilitator superfamily protein
PP_1697-2130.472777GntR family transcriptional regulator
PP_16980101.587370major facilitator family transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1694RTXTOXIND419e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 9e-06
Identities = 19/150 (12%), Positives = 52/150 (34%), Gaps = 8/150 (5%)

Query: 25 QVQRRQGARQAEQALLEERLNAAQLAQAGLQAQLDASRDEVSDLSEANAVKQAQLAAQSR 84
+V R + + + + + +L +A+ ++ + V++++L
Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD--- 239

Query: 85 ELELLQIDRDNARDAAHAWSLERANREAELRRLEAQTARLDAELREQQESHQQRLEDLQE 144
L + A+ A + ELR ++Q ++++E+ +E +Q + +
Sbjct: 240 -FSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298

Query: 145 ARDTLRAQFADMATKIFDEREQRFAQTSQQ 174
Q T A+ ++
Sbjct: 299 EILDKLRQ----TTDNIGLLTLELAKNEER 324



Score = 30.6 bits (69), Expect = 0.014
Identities = 32/184 (17%), Positives = 66/184 (35%), Gaps = 16/184 (8%)

Query: 56 AQLDASRDEVSDLSEANAVKQAQLAAQSRELELLQIDRDNARDAAHAWSLERANREAELR 115
+L A E L +++ QA+L ++ I+ + + +
Sbjct: 125 LKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLP------DEPYFQN 178

Query: 116 RLEAQTARLDAELREQQESHQQRLEDLQEARDTLRAQFADMATKIFDEREQRFAQTSQQH 175
E + RL + ++EQ + Q + + D RA+ + +I R + ++ +
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARI--NRYENLSRVEKSR 236

Query: 176 LGQLLDPLKE-----RIQAFEKRVEESYQQEARERFSLGKELERLQQLNLRLSDEATNLT 230
L L + E+ E Y + E +LE+++ L +E +T
Sbjct: 237 L-DDFSSLLHKQAIAKHAVLEQ--ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 231 QALK 234
Q K
Sbjct: 294 QLFK 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1695HTHFIS595e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.5 bits (144), Expect = 5e-11
Identities = 31/121 (25%), Positives = 51/121 (42%), Gaps = 4/121 (3%)

Query: 1040 VLCVDNEDSILIGMNSLLSRWGCQVWTARNQAECEALLAKGMRPHLALVDYHLDDGETGT 1099
+L D++ +I +N LSR G V N A +A G L + D + D
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDE-NAF 63

Query: 1100 GLMGWLRARLGEPVPGVVISADGSKET-IALVHASGLDYLAKPVKPAALRALLNRHLSLA 1158
L+ ++ + +P +V+SA + T I DYL KP L ++ R L+
Sbjct: 64 DLLPRIKKARPD-LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 1159 Q 1159
+
Sbjct: 123 K 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1696TCRTETA513e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 51.0 bits (122), Expect = 3e-09
Identities = 82/394 (20%), Positives = 136/394 (34%), Gaps = 17/394 (4%)

Query: 11 TVRLLLTTTFTLTVARALTLPYLVVYLAD--NFQLPISQIGLLIGGALIVASLLSLYGGH 68
+ ++L+T V L +P L L D + + G+L+ ++ + G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 69 LVDTLSNHTLVSASTLLFALAFVGAVASRSALPFFFCLVLINLALAVVDIAAKAGFCALL 128
L D ++ S A+ + +A+ L + ++ A A +
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYA-IMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124

Query: 129 PVDERAEVFAIKYTLSNVGYAAGPLLGVAMLELNDHVPFLASALL-GLAMCLAYWRLGDR 187
DERA F G AGP+LG M + H PF A+A L GL + L +
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 188 SLQASAPDKPAAGFGQVALGLARDRRLVCFTVGGVLSAVVFGQFTAYLSQYLVVTSNPAE 247
P + A + AR +V + + GQ A L +
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL-WVIFGEDRFHW 243

Query: 248 AARLIGYLVTTNAVTVIALQ-YLIGRRISRQRLMPWLLAGMGLFIAGLLGFALAGSVLAW 306
A IG + + Q + G +R L+ GM G + A A
Sbjct: 244 DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMA 303

Query: 307 CLAMLVFTLGEIIVIPAEYMFIDLIAPEHLRGVYYGA-QNLSNLGAALGPVMVGFALVHL 365
M++ G I +PA + E +G G+ L++L + +GP++
Sbjct: 304 FPIMVLLASGGIG-MPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362

Query: 366 WP---------GAVFYLLVLSVILAGVFYWLGTR 390
GA YLL L + G++ G R
Sbjct: 363 ITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQR 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1698TCRTETB483e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 48.3 bits (115), Expect = 3e-08
Identities = 77/397 (19%), Positives = 138/397 (34%), Gaps = 59/397 (14%)

Query: 16 FWACFGGWSLDALEVQMFGLAIPALIAAFSLTKGDAGLISGLTLVTSAIGGWLGGTLSDR 75
W C + L + +++P + F+ ++ ++T +IG + G LSD+
Sbjct: 17 IWLCILSF-FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 76 YGRVRTLQWMILWFSFFTFLSAFVTGFYPLL-FVKAMQGFGIGGEWAAGAVLMAETINPK 134
G R L + I+ F + + F+ LL + +QG G A V++A I +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 135 YRGKVMGTVQSAWAVGWGLAVALFMLIYSLVPQEFAWRVMFFVGLLPSLLIIWVRRNVPE 194
RGK G + S A+G G+ I ++ W + L+P + II V +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGP----AIGGMIAHYIHWSYLL---LIPMITIITVPFLMKL 188

Query: 195 PDSFQRLQKEKAIPSSFLQSMA-----------------------GIFRPELLRVT---- 227
R++ I L S+ IF + +VT
Sbjct: 189 LKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFV 248

Query: 228 ----------LLGGLLGLGAHGGYHAVMTWLPTFLKTERNLSVLNSG------GYLAVII 271
++G L G G ++ +P +K LS G G ++VII
Sbjct: 249 DPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308

Query: 272 LAFWCGCVVSGLLIDRIGRRKNILLFALCCVLTVQAYVFFPLTNTQMLFLGFPLGF-FAA 330
+ + G+L+DR G + + ++ F T + + + +
Sbjct: 309 FGY-----IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLS 363

Query: 331 GIPASLGAFFNELYPADVRGAGVGFCYNFGRVLSAVF 367
+ + GAG+ NF LS
Sbjct: 364 FTKTVISTIVSSSLKQQEAGAGMSL-LNFTSFLSEGT 399


80PP_1780PP_1805N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1780-123-4.854431mannosyltransferase
PP_1781031-6.991254O-acyltransferase
PP_1782239-9.080228dTDP-4-dehydrorhamnose 3,5-epimerase
PP_1783344-11.429872glucose-1-phosphate thymidylyltransferase
PP_1784448-12.198177dTDP-4-dehydrorhamnose reductase
PP_1785550-13.585220dTDP-glucose 4,6-dehydratase
PP_1786452-13.758641glycosyl transferase family protein
PP_1787452-12.483417hypothetical protein
PP_1788245-9.664149hypothetical protein
PP_1789037-6.935773HAD superfamily hydrolase
PP_1790034-5.879358acylneuraminate cytidylyltransferase
PP_1791-129-4.904416aldolase
PP_1792-125-4.046583glycosyl transferase family protein
PP_1793019-3.839308glycosyl transferase family protein
PP_1794016-3.185655hypothetical protein
PP_1795016-2.940870hypothetical protein
PP_1797019-3.524995HlyD family secretion protein
PP_1798121-2.835960outer membrane efflux protein
PP_1799022-2.651777GDP-mannose 4,6-dehydratase
PP_1800026-3.136599oxidoreductase Rmd
PP_1801128-4.084828glycosyl transferase WbpY
PP_1802228-4.810507glycosyl transferase WbpZ
PP_1803119-3.903614UDP-sugar epimerase
PP_1804119-4.645716glycosyl transferase WbpL
PP_1805-118-5.158033polysaccharide biosynthesis protein CapD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1780RTXTOXIND310.037 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.3 bits (71), Expect = 0.037
Identities = 16/87 (18%), Positives = 34/87 (39%), Gaps = 6/87 (6%)

Query: 284 FQQVGESVESCHSALSQVNLELGQHQAALVDLSQEVERHRAELESLKQVVNGLNHHLSDT 343
FQ V E ++L + Q+Q +++ RAE ++ +N +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQ--KELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 344 QQRLA----LADERMAATQAWIDKQQA 366
+ RL L ++ A A ++++
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENK 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1784NUCEPIMERASE437e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 43.2 bits (102), Expect = 7e-07
Identities = 43/200 (21%), Positives = 70/200 (35%), Gaps = 32/200 (16%)

Query: 1 MKILLLGKNGQVGWELQRALAPLG-EVIALD-----------RQGAEGLC--------GD 40
MK L+ G G +G+ + + L G +V+ +D + E L D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 41 LSNLDGLAATIRQLAPDVIVNAAAYTAVDKA-ESDQALAAMINAAAPAVLARETAALGAW 99
L++ +G+ + + + AV + E+ A A +L
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 100 LIHYSTDYVFDGSGSQRWEETAPTG-PLSVYGRTKLEGE-HAILAS---GAKAVVLRTSW 154
L++ S+ V+ + + P+S+Y TK E A S G A LR
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 155 VYAARG------HNFAKTML 168
VY G F K ML
Sbjct: 181 VYGPWGRPDMALFKFTKAML 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1785NUCEPIMERASE1784e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 178 bits (452), Expect = 4e-55
Identities = 91/358 (25%), Positives = 146/358 (40%), Gaps = 43/358 (12%)

Query: 2 ILVTGGAGFIGSNFVLQWCAHNEEPVLNLDALT--YAGNL--ANLQPLEGNPQHRFVQGN 57
LVTG AGFIG + + + V+ +D L Y +L A L+ L P +F + +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQ-VVGIDNLNDYYDVSLKQARLELL-AQPGFQFHKID 60

Query: 58 ICDAALLTKLFAEHRPRAVVHFAAESHVDRSITGPEAFVETNVMGTFRLLEAARAHWNSL 117
+ D +T LFA V V S+ P A+ ++N+ G +LE R N +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRH--NKI 118

Query: 118 EGAEKEAFRFLHVSTDEVYGTLGPNDPAFTETTPYAPNSPYSASKAASDHLVRSYFHTYG 177
+ L+ S+ VYG L P T+ + P S Y+A+K A++ + +Y H YG
Sbjct: 119 Q-------HLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 178 MPVLTTNCSNNYGPLHFPEKLIPLMIVNALAGKALPVYGDGQQIRDWLYVEDHCSGIRRV 237
+P YGP P+ + L GK++ VY G+ RD+ Y++D I R+
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRL 230

Query: 238 LEAGAFGETYNIGGWNEKANIDIVRTLCSLLDEMAPAASRQVINQKTGEPVE--QYAELI 295
+ +T A A A +V N PVE Y + +
Sbjct: 231 QDVIPHADTQWTVETGTPA---------------ASIAPYRVYNIGNSSPVELMDYIQAL 275

Query: 296 A----------YVTDRPGHDRRYAIDARKIERELGWKPAETFETGIRKTVAWYLANQK 343
+ +PG + D + + +G+ P T + G++ V WY K
Sbjct: 276 EDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1786SYCDCHAPRONE402e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 39.5 bits (92), Expect = 2e-05
Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 12/105 (11%)

Query: 211 KLGMHYLSRHKVQEGRILLEAALSIAPS-AEVFNRLGGSYMEDGHFSIALQYFNAAAQL- 268
L + K ++ + +A + + F LG G + +A+ ++ A +
Sbjct: 41 SLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMD 100

Query: 269 ---PNPPVWTAFNQAHCLAKLHQLDEAIRVLSAG---IATNPNHR 307
P P F+ A CL + +L EA L IA +
Sbjct: 101 IKEPRFP----FHAAECLLQKGELAEAESGLFLAQELIADKTEFK 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1791adhesinb330.003 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 32.5 bits (74), Expect = 0.003
Identities = 9/35 (25%), Positives = 16/35 (45%)

Query: 280 YLAGKYGIHPTYIQEMLGDSRFGEEDILAVIEYLR 314
Y + Y + YI E+ + + I ++E LR
Sbjct: 211 YFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLR 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1794RTXTOXINA487e-08 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 48.0 bits (114), Expect = 7e-08
Identities = 29/113 (25%), Positives = 47/113 (41%), Gaps = 11/113 (9%)

Query: 136 NGDDVITVKGDQNTLIDAGDGNDTIVTGNGDNVVIAGAGNNNVTTGTGDDTI-------- 187
+G+D + N + G+G+D + G+G++ +I AGNN + G GDD
Sbjct: 753 DGNDRLY-GDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLA 811

Query: 188 --ILSGSNHADIVNAGAGYDVVQLDGSRDDYAFAVGNNFNVNLTGNQTASITD 238
+L G D + G D++ D GN+ L+G I D
Sbjct: 812 KNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDD 864



Score = 47.7 bits (113), Expect = 8e-08
Identities = 27/94 (28%), Positives = 44/94 (46%), Gaps = 2/94 (2%)

Query: 145 GDQNTLIDAGDGNDTIVTGNGDNVVIAGAGNNNVTTGTGDDTIILSGSNHADIVNAGAGY 204
D + LI+ DGND + G++ + G G++ + G G+D L G + +N G G
Sbjct: 743 ADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDK--LIGVAGNNYLNGGDGD 800

Query: 205 DVVQLDGSRDDYAFAVGNNFNVNLTGNQTASITD 238
D Q+ G+ G N L G++ A + D
Sbjct: 801 DEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLD 834


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1797RTXTOXIND323e-108 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 323 bits (829), Expect = e-108
Identities = 96/422 (22%), Positives = 176/422 (41%), Gaps = 7/422 (1%)

Query: 24 RRIGLTVVFVTFGIFGTWAAFAPLSNAVHGTGVVTVQNYRKTVQHLEGGIVKELHARDGD 83
R + ++ I + + G +T K ++ +E IVKE+ ++G+
Sbjct: 58 RLVAYFIMGF-LVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116

Query: 84 LVKKGDPLIVLDESQLSAEYESTRNQLIVARYKEARLRA-----ERDGLDSIPSVIMEGT 138
V+KGD L+ L A+ T++ L+ AR ++ R + E + L +
Sbjct: 117 SVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF 176

Query: 139 DSDRAQEALAGEQQVFKARHDSLLGEISVNRERIQQMQQQIAGLNDMIRTKAGLNKSYSG 198
+ +E L + K + + + + + + + + I L++
Sbjct: 177 QNVSEEEVLR-LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 199 EIKQLKELLAEGFVDNQRLLEQERKLDMLKTEVADHQSSITKTKLQIGETELQIVQLKKK 258
+ LL + + +LEQE K E+ ++S + + + +I + + + +
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 259 FDADVAKELSDVQAQVFDLQEKEAALRDRLSRVVIRAPESGMVLDMKVHTIGGVVSAATP 318
F ++ +L + L + A +R VIRAP S V +KVHT GGVV+ A
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 319 LLDIVPAQSDLVVEAKVAPRDIDRLELGKTADVRFSAFNQATTPVIEGKLTRISADSLVE 378
L+ IVP L V A V +DI + +G+ A ++ AF + GK+ I+ D++ +
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIED 415

Query: 379 ERSGDQYYLVRVKVTEDGMKKLGNRKLQPGMPAEVLINAGDRTMLQYLLKPARNMFAESL 438
+R G + ++ N L GM I G R+++ YLL P ESL
Sbjct: 416 QRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESL 475

Query: 439 IE 440
E
Sbjct: 476 RE 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1799NUCEPIMERASE1108e-30 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 110 bits (277), Expect = 8e-30
Identities = 73/354 (20%), Positives = 122/354 (34%), Gaps = 65/354 (18%)

Query: 13 MKAIVTGITGQDGAYLAELLLEKGYTVYG-----TYRRTSSVNFWRIEELGIHTNPNLHL 67
MK +VTG G G ++++ LLE G+ V G Y S + R+E L P
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVS-LKQARLELLA---QPGFQF 56

Query: 68 VEYDLTDLSASIRLLQTTEATEVYNLAAQSFVGVSFEQPLTTAEITGLGAVNLLEAIRIV 127
+ DL D L + V+ + V S E P A+ G +N+LE R
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 128 NPKARFYQASTSEMFGKVQEIPQVETTPF-YPRSPYGVAKLYAHWMTINYRESYNLFATS 186
+ Y AS+S ++G +++P +P S Y K M Y Y L AT
Sbjct: 117 KIQHLLY-ASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATG 175

Query: 187 GILFNHESPLRGRE-----FVTRKITDSVAKIKLGLLDKLELGNLDAKRDWGFAKEYVEG 241
F P GR T+ + + + I + KRD+ + + E
Sbjct: 176 LRFFTVYGP-WGRPDMALFKFTKAMLEGKS-IDV-------YNYGKMKRDFTYIDDIAEA 226

Query: 242 MWRMLQADEPDT-------------------FVLATNRTETVRDFVTMAFKAAGIEINWS 282
+ R+ + + + + D++ A GIE
Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK-- 284

Query: 283 GKDEAEQGTCAASGKVLVAINPKFYRPAEVELLIGNPAKAKDVLGWEPKTNLEE 336
N +P +V + +V+G+ P+T +++
Sbjct: 285 -------------------KNMLPLQPGDVLETSADTKALYEVIGFTPETTVKD 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1800NUCEPIMERASE935e-24 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 93.3 bits (232), Expect = 5e-24
Identities = 56/237 (23%), Positives = 96/237 (40%), Gaps = 25/237 (10%)

Query: 7 RALITGVHGFTGRYMAAELRAGGYEVFGTGS--------------QPLPAADYR--QVDL 50
+ L+TG GF G +++ L G++V G + + L ++ ++DL
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 51 TDGQGLRALLAELQPDVVVHLAAIAFVGHGTAD--AFYKVNLIGTRNLLEAIAACGKTPE 108
D +G+ L A + V V + + A+ NL G N+LE +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--IQ 119

Query: 109 CVLIASSANVYG-NVSEGMLGEQTPPAPANDYAVSKLAMEYMARLWCD--RLPIVITRPF 165
+L ASS++VYG N + + P + YA +K A E MA + LP R F
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 166 NYTGVGQAENFLLPKIVSHFRRKADTIEL-GNLDVWRDFSDVRAVVQAYRGLIEARP 221
G + L K + +I++ + RDF+ + + +A L + P
Sbjct: 180 TVYGPWGRPDMALFKFTKAM-LEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1803NUCEPIMERASE746e-17 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 73.7 bits (181), Expect = 6e-17
Identities = 62/343 (18%), Positives = 118/343 (34%), Gaps = 42/343 (12%)

Query: 5 TILVTGASGFVGGALCRQLATLGSFAI-------------RAASRDLGGASVAGIQAVTV 51
LVTGA+GF+G + ++L G + + A +L + +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 52 ADLSATTDWARALSGVDLVVHAAARVHVMKETASDSL---AEFRRVNVDGTLNLARQAAA 108
AD TD A + V + R+ V SL + N+ G LN+
Sbjct: 62 ADREGMTD-LFASGHFERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNILEGCRH 115

Query: 109 AGVRRFIFISSIKVNGESSQPGQPLRADDSPA-PQDAYGVSKHEAEQGLRQLAAATGMEV 167
++ ++ SS V G + + P DDS P Y +K E + G+
Sbjct: 116 NKIQHLLYASSSSVYGLNRKM--PFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 168 VVIRPVLVYGPGVKAN--FHSMMRWLQRGVPLP-FGAVCNRRSLVSLANLVDLVVTCIDH 224
+R VYGP + + + + G + + +R + ++ + ++ D
Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233

Query: 225 PRAANQTFLASDGDDVSLTQLLRALGLALGRPARLLPVPAGLLRGAVLLIGRRDLAQRLF 284
A+ + G + R + P L+ L +G A++
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALED----ALGIE--AKKNM 287

Query: 285 GTLQ--------VDIEKNRQLLGWYPPCTLEQGLNMTARSFLG 319
LQ D + +++G+ P T++ G+ +
Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1805NUCEPIMERASE531e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 53.2 bits (128), Expect = 1e-09
Identities = 46/248 (18%), Positives = 91/248 (36%), Gaps = 38/248 (15%)

Query: 323 TVLVTGAGGSIGSELCRQIIGLGPTTLLLFDHSEYNLYAILSELEQRVARESLPVRLLPI 382
LVTGA G IG + ++++ G + + N Y +S + R+ + P
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGI---DNLNDYYDVSLKQARLELLAQP-GFQFH 57

Query: 383 LGSVRNQQHLAHVMSTWRVATVYHAAAYKHVPMVEHNMAEGILNNVFGTLCTAQAALQSG 442
+ +++ + + ++ V+ + V N +N+ G L + +
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK 117

Query: 443 VANFVLIST---------------DKAVRPTNVMGGTKRLSELILQALSREAAPVMYGDS 487
+ + + S+ D P ++ TK+ +EL+ S +YG
Sbjct: 118 IQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH-----LYG-- 170

Query: 488 SKIARVNKTRFTMVRFGNVLGSSGS---VIPLFHKQIKAGGPITV-THPKITRYFMTIPE 543
T +RF V G G + F K + G I V + K+ R F I +
Sbjct: 171 --------LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDD 222

Query: 544 AAQLVVQA 551
A+ +++
Sbjct: 223 IAEAIIRL 230


81PP_1941PP_1953N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1941248-9.016857hypothetical protein
PP_1942347-8.659031LysR family transcriptional regulator
PP_1943342-7.703291formyltetrahydrofolate deformylase
PP_1944444-7.190954glycine cleavage system protein T
PP_1945347-7.6703165,10-methylene-tetrahydrofolate dehydrogenase
PP_1946348-8.028132short chain dehydrogenase/reductase
PP_1947448-8.330125hypothetical protein
PP_1948345-7.645246benzaldehyde dehydrogenase
PP_1949248-7.832346GMC family oxidoreductase
PP_1950145-7.632291hypothetical protein
PP_1951144-7.533137short chain dehydrogenase/reductase
PP_1952142-7.715082metallo-beta-lactamase
PP_1953141-7.521216short chain dehydrogenase/reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1941HTHTETR522e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.6 bits (123), Expect = 2e-10
Identities = 20/83 (24%), Positives = 34/83 (40%), Gaps = 1/83 (1%)

Query: 10 ATLRAEQQQETREKLLRATIETISYKGYQSATIDNITSHAGTGRATFYLHFRSKPEALMA 69
A ++ QETR+ +L + S +G S ++ I AG R Y HF+ K L +
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDK-SDLFS 60

Query: 70 GWQEIYMPQMVNILQNLDESYPA 92
E+ + + +P
Sbjct: 61 EIWELSESNIGELELEYQAKFPG 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1942PF05043320.004 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 31.8 bits (72), Expect = 0.004
Identities = 17/54 (31%), Positives = 25/54 (46%)

Query: 8 LNLLLLLDELYREQNLSAAARRLGMSQPMASASLRRLREYFEDQLFLSTGRGMR 61
L LL LL E R + S A L ++ L ++ F D +F S+ G+R
Sbjct: 13 LELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIR 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1946DHBDHDRGNASE1271e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 127 bits (319), Expect = 1e-37
Identities = 74/254 (29%), Positives = 112/254 (44%), Gaps = 19/254 (7%)

Query: 9 GKVVLVTGAGSGIGRATALAFAQSGASVAVADISTDHGLKTVELVKAEGGEATFFHVDVG 68
GK+ +TGA GIG A A A GA +A D + + K V +KAE A F DV
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 69 SEPSVQSMLAGVVAHYGGLDIAHNNAGIEANIVPLAELDSDNWRRVIDVNLSSVFYCLKG 128
++ + A + G +DI N AG+ + L + W VN + VF +
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 129 EIPLMLKRGGGAIVNTASASGLIGGYRLSGYTATKHGVVGLTKAAAIDYANQNIRINAVC 188
M+ R G+IV S + ++ Y ++K V TK ++ A NIR N V
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 189 PGPVDSPFLADMPQPM--------------RDRLLFGTPIGRLATAEEIARSVLWLCSDD 234
PG ++ DM + + G P+ +LA +IA +VL+L S
Sbjct: 187 PGSTET----DMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 235 AKYVVGHSMSVDGG 248
A ++ H++ VDGG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1948PF03944300.018 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 30.4 bits (68), Expect = 0.018
Identities = 14/31 (45%), Positives = 17/31 (54%)

Query: 369 APSDKTGWYVRPTVYTNVNNSMRIAREEIFG 399
AP+D TG+ + P T VNN R E FG
Sbjct: 487 APNDYTGFTISPIHATQVNNQTRTFISEKFG 517


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1949PF07520310.010 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 31.5 bits (71), Expect = 0.010
Identities = 37/152 (24%), Positives = 53/152 (34%), Gaps = 22/152 (14%)

Query: 15 AGSAGCVLANRLSANPEHSVLLLEAGSRPKGLWASMP---------AGVSRVILPGPTNW 65
A +LA + E+SV L A W +P AG + PGP++W
Sbjct: 68 AERDAPILAATTPEDDEYSVRPLAALEPFLEKWVPIPVLRLKNQRGAGGEELYDPGPSSW 127

Query: 66 AY-----QSEPDPSLAG-RRIYVPRGKALGGSSAINGMAYLRGHREDYDHWVSLGCAGWG 119
A +PDP R+ + AL S Y+ R D +
Sbjct: 128 ARLRTVELPQPDPETGHTHRVQIALDTAL--SDQDQSAHYVAPERADSEKPREFRLV--S 183

Query: 120 WDDVLPFYKKFEHREEGDEAFRGRDGELWVTD 151
+ + F R E DE D +LWV+D
Sbjct: 184 DPGAMSW---FLQRLEADEDGNAVDLQLWVSD 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1951DHBDHDRGNASE1256e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 125 bits (316), Expect = 6e-37
Identities = 76/271 (28%), Positives = 123/271 (45%), Gaps = 26/271 (9%)

Query: 9 VEGARVIVTGAASGLGLAFTEAMAESGAQVAMLDLNREALDAQFRRLRSLGYSVRSHVLD 68
+EG +TGAA G+G A +A GA +A +D N E L+ L++ + D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 69 VTDRDAVDDTFNAVAAGFGGLDIVFANAGI-DPGPGFAALNAAGEREPANMLEEYSDHRW 127
V D A+D+ + G +DI+ AG+ PG + SD W
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGL----------------IHSLSDEEW 109

Query: 128 RKVISVSLDAVFYSIRAAARHMRANRSGSIIVTTSVSALRPAVTLGAAYAAAKAGAAQLV 187
SV+ VF + R+ +++M RSGSI+ S A P ++ AYA++KA A
Sbjct: 110 EATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMA-AYASSKAAAVMFT 168

Query: 188 RATALELASDGVRVNAIAPGPFETDIGGGFMHNSEVRAKMAA--------GVPMGRIAEV 239
+ LELA +R N ++PG ETD+ + ++ G+P+ ++A+
Sbjct: 169 KCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKP 228

Query: 240 EEIKPLALYLASKASSFVTGQQFVIDGGLSL 270
+I L+L S + +T +DGG +L
Sbjct: 229 SDIADAVLFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1953DHBDHDRGNASE1293e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 129 bits (324), Expect = 3e-38
Identities = 81/278 (29%), Positives = 125/278 (44%), Gaps = 43/278 (15%)

Query: 4 QNKIVVLTGAASGIGKATAQLLVEQGAHVVAMDLKSDLLQQAFGSEE----HVLCIPTDV 59
+ KI +TGAA GIG+A A+ L QGAH+ A+D + L++ S + H P DV
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 60 SDSEAVRAAFQAVDAKFGRVDVIINAAGINAPTREANQKMVDANVAALDAMKSGRAPTFD 119
DS A+ ++ + G +D+++N AG+ + ++
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGV-------------LRPGLIHSL--------- 104

Query: 120 FLADTSDQDFRRVMEVNLFSQFYCIREGVPLMRRAGGGSIVNISSVAALLGVAMPLYYPA 179
SD+++ VN F R M GSIV + S A + Y +
Sbjct: 105 -----SDEEWEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYAS 159

Query: 180 SKAAVLGLTRAAAAELAPYNIRVNAIAPGSVDTPL-----MHEQPPEVV------QFLVS 228
SKAA + T+ ELA YNIR N ++PGS +T + E E V F
Sbjct: 160 SKAAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTG 219

Query: 229 MQPIKRLAQPEELAQSILFLAGEHSSFITGQTLSPNGG 266
+ P+K+LA+P ++A ++LFL + IT L +GG
Sbjct: 220 I-PLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGG 256


82PP_1974PP_1981N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_1974-2120.989692excinuclease ABC subunit B
PP_1975-1101.095466EmrB/QacA family drug resistance transporter
PP_1976-1122.088166secretion protein HlyD family protein
PP_19770141.438930glutamyl-tRNA synthetase
PP_1978-1111.909900******TetR family transcriptional regulator
PP_1979-2101.741430hydrolase
PP_1980-1111.454057thioesterase
PP_1981-1111.043469NifR3/Smm1 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1974RTXTOXIND310.016 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.0 bits (70), Expect = 0.016
Identities = 13/62 (20%), Positives = 23/62 (37%), Gaps = 6/62 (9%)

Query: 612 AKAAEESARYEAELRTPGEITKRIKQLEEKMMQFARDLEFEAAAQLRD---EIAQLRERL 668
+A E Y+++L +I I +E+ + + E +LR I L L
Sbjct: 262 VEAVNELRVYKSQL---EQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLEL 318

Query: 669 IS 670

Sbjct: 319 AK 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1975TCRTETB1022e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 102 bits (256), Expect = 2e-25
Identities = 85/412 (20%), Positives = 169/412 (41%), Gaps = 24/412 (5%)

Query: 18 WIAVMSVMLGAFMAVLDIQITNSSLKDIQGALSATLEEGSWISTSYLVAEIIMIPLTAWL 77
W+ ++S F +VL+ + N SL DI + +W++T++++ I + L
Sbjct: 18 WLCILS-----FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKL 72

Query: 78 VQLLSARRLAVWVSGGFLLSSLLCSMAWNLESMILF-RALQGFTGGALIPLAFTLTLIKL 136
L +RL ++ S++ + + S+++ R +QG A L + +
Sbjct: 73 SDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132

Query: 137 PEHHRAKGMAMFAMTATFAPSIGPTLGGWLTENWGWEYIFYINIPPGLLMIAGLLYGLEK 196
P+ +R K + +GP +GG + W Y+ I + ++ + L+ L+K
Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLKK 191

Query: 197 KEAHWELLKSTDYAGIVTLGLGLGCLQVFLEEGHRKDWLESHLIVGLGSVALVSLITFVI 256
+ D GI+ + +G+ +F + S LI V+++S + FV
Sbjct: 192 EVRIKGHF---DIKGIILMSVGIVFFMLFT-----TSYSISFLI-----VSVLSFLIFVK 238

Query: 257 LQFSKPHPLINLRILGNRNFGLSSIASLGMGVGLYGSIYLLPLYLAQVQGYNALQIGEVI 316
P ++ + N F + + + + G + ++P + V + +IG VI
Sbjct: 239 HIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI 298

Query: 317 MWMG-IPQLFLIPLVPQLMKVISPKVLCALGFCLFGAASFGSGVLNPDFAGPQFNHIQII 375
++ G + + + L+ P + +G + + L F I I+
Sbjct: 299 IFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL--LETTSWFMTIIIV 356

Query: 376 RALG-QPMIMVTISLIATAYIQQQDAGSASSLFNILRNLGGAIGIALLATLL 426
LG IS I ++ ++QQ+AG+ SL N L GIA++ LL
Sbjct: 357 FVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1976RTXTOXIND1526e-44 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 152 bits (386), Expect = 6e-44
Identities = 67/411 (16%), Positives = 139/411 (33%), Gaps = 94/411 (22%)

Query: 43 RRLTLFFALVAIIALAFLGHWYFKGRFYESTDNAYVQGEIT------RISSQLGARIETV 96
R + F +IA G+ E A G++T I + ++ +
Sbjct: 58 RLVAYFIMGFLVIAFI----LSVLGQ-VEIV--ATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 97 PVEDNQHVSKGDLLVR------------LEAADFELAVERAR------------------ 126
V++ + V KGD+L++ +++ + +E+ R
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKL 170

Query: 127 ----------------------AALATREAEYAQAQSRLTQQGSLIAAGQAQLAATQATF 164
+T + + Q + L ++ + A++ +
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230

Query: 165 DRSRLDLSRAEKLRKPGFVS-------EERVTTLSADSHVAGSQVDKARADLQSQRQQVT 217
+ L L ++ E + + V SQ+++ +++ S +++
Sbjct: 231 RVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290

Query: 218 ALNAELKRL--------DAQIANARTDLAQAELNLTRCEIHAPISGTIGQRNAR-NGQVV 268
+ K I +LA+ E I AP+S + Q G VV
Sbjct: 291 LVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVV 350

Query: 269 QAGAYLLSIVPDED-IWVQANFKETQIGRMHPGQRAELLFDSYPDT---PIEGRVDSLFA 324
L+ IVP++D + V A + IG ++ GQ A + +++P T + G+V ++
Sbjct: 351 TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINL 410

Query: 325 ASGAQFSLLPPDNATGNFTKVVQRIPIKLTFHADNPLHGRIRPGMSVTATV 375
+ D G V+ I + + + GM+VTA +
Sbjct: 411 DA-------IEDQRLGLVFNVIISIEENCLSTGNKNI--PLSSGMAVTAEI 452


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1978HTHTETR504e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.0 bits (119), Expect = 4e-10
Identities = 22/85 (25%), Positives = 33/85 (38%)

Query: 1 MSDKKSRTRERILEAARTALIQHGPAEPSVSQVMGAAGLTVGGFYAHFDSKDELMLEAFR 60
+ TR+ IL+ A Q G + S+ ++ AAG+T G Y HF K +L E +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 QLLGERRALLAQIDPNLDGAGRRAL 85
L + G L
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVL 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_1981PREPILNPTASE290.020 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 29.4 bits (66), Expect = 0.020
Identities = 17/71 (23%), Positives = 26/71 (36%), Gaps = 11/71 (15%)

Query: 208 WRR-CREVSGAEDIMLGRGLVSRPDLGLQIAAARDGRDYQPMSWHDLLPLLREFW----- 261
W+ R +D V P L + + P++ + +PLL W
Sbjct: 45 WQAEYRSYFNPDDEG-----VDEPPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRC 99

Query: 262 RQAQAKLSPRY 272
R QA +S RY
Sbjct: 100 RGCQAPISARY 110


83PP_2064PP_2073N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_20641143.438999RND family efflux transporter MFP subunit
PP_20651133.163152acriflavin resistance protein
PP_20661123.177710GntR family transcriptional regulator
PP_20670133.027609EmrB/QacA family drug resistance transporter
PP_20681143.148934multidrug efflux MFS membrane fusion protein
PP_20691112.477396multidrug efflux MFS outer membrane protein
PP_2070-216-0.664622AraC family transcriptional regulator
PP_2071-317-1.705029hypothetical protein
PP_2072-412-0.713431AraC family transcriptional regulator
PP_2073-413-0.725879acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2064RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.8 bits (93), Expect = 1e-05
Identities = 21/136 (15%), Positives = 53/136 (38%), Gaps = 2/136 (1%)

Query: 57 ASGELEAVNQVQ-VAAEMPGRITRIAFESGQTVAAGQLLVQLNDAPEQALRVQLQARLRN 115
A+G+L + + + + I + G++V G +L++L +A ++ Q+ L
Sbjct: 86 ANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145

Query: 116 ADVVLQRSRKL-RAMNAVSQELLDNAATAVDVARGELQHVEALIAQKAIRAPFAGKLGIR 174
A + R + L R++ L E + + K + + + +
Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 175 RVHQGQYLAAGETIVS 190
++ + A T+++
Sbjct: 206 ELNLDKKRAERLTVLA 221



Score = 31.0 bits (70), Expect = 0.007
Identities = 27/135 (20%), Positives = 47/135 (34%), Gaps = 13/135 (9%)

Query: 91 GQLLVQLNDAPEQALRVQLQARLRNADVVLQRSRKLRAMNAVSQELLDNAATAVDVARGE 150
QL + L + + +L + KLR L E
Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL-----------TLE 317

Query: 151 LQHVEALIAQKAIRAPFAGKLGIRRVH-QGQYLAAGETIVSLA-DISQLHVNFALGEQAA 208
L E IRAP + K+ +VH +G + ET++ + + L V + +
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDI 377

Query: 209 PEVHAGQVLALTVDA 223
++ GQ + V+A
Sbjct: 378 GFINVGQNAIIKVEA 392


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2065ACRIFLAVINRP7780.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 778 bits (2011), Expect = 0.0
Identities = 313/1029 (30%), Positives = 521/1029 (50%), Gaps = 29/1029 (2%)

Query: 5 DVFVRRPVLALVVSSLIILMGLFAMGKLPIRQYPLLESSTITISTEYPGASAELMQGFVT 64
+ F+RRP+ A V++ ++++ G A+ +LP+ QYP + +++S YPGA A+ +Q VT
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 65 QPITQAVSSVEGIDYLSSSSQQ-GRSLITLRMVLNRDSTQALAETMAKVNQVRYRLPEKA 123
Q I Q ++ ++ + Y+SS+S G ITL D A + K+ LP++
Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122

Query: 124 YDPVVELSAGDSTAVAYVGFASDS--LSIPELSDYLSRVVEPQFSGIDGVAKVQSFGGQR 181
+ + S+ + GF SD+ + ++SDY++ V+ S ++GV VQ FG Q
Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 182 LAMRLWLDSEQMAGRGVTAADVAQAVRANNYQATPGQV------RGQYVLADIQVDTDLT 235
AMR+WLD++ + +T DV ++ N Q GQ+ GQ + A I T
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 236 RVEDFRELIIR-NDGTDLVRLRDIGTVELSAAATQTSATMDGKPAVHLGLFPTPSGNPLV 294
E+F ++ +R N +VRL+D+ VEL A ++GKPA LG+ N L
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 295 IVEGIRQLLPQIQQTLPPGVYVALAYETARFIDASIHEVLRTLVEAMLIVVLVIWLCLGS 354
+ I+ L ++Q P G+ V Y+T F+ SIHEV++TL EA+++V LV++L L +
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 355 LRSVLIAVVAIPLSMLGAAGLMLMFGFSLNLLTLLAMVLAIGLVVDDAIVVVENVHRHIE 414
+R+ LI +A+P+ +LG ++ FG+S+N LT+ MVLAIGL+VDDAIVVVENV R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EGKS-PIAAALAGAREIAGPVIAMTLTLAAVYAPIGLMGGLTGTLFREFALTLAGAVIVS 473
E K P A +I G ++ + + L+AV+ P+ GG TG ++R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 474 GIVALTLSPVMSSLLLQPGQQH-----GAMAAIADRLFGTLSGVYGRVLAYTLAHRWISG 528
+VAL L+P + + LL+P G + F Y + L
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 529 GVALLVCLSLPWLYLLPQRELAPPEDQAAVLTAIKSPQHASLEYAERFALK-LDQVMKSI 587
+ L+ + L+L P EDQ LT I+ P A+ E ++ + D +K+
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 588 AET-----TDTWIINGTDGPAASFGGINLSAWQAR---ERSAAQVQAQLQQAVADIEGSS 639
T A ++L W+ R E SA V + + + I
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 640 IFAFQVA--SLPGSSGGLPVQMVLRSAQDYPELFQTMEVLKQRARDS-GLFAVVDSDLDY 696
+ F + G++ G +++ ++ + L Q L A V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 697 NNPVVKVRVDRAKAASLGISMQAIGESLGVLVGEQYLNRFALFGRSYDVIPQSIQDQRLT 756
+ K+ VD+ KA +LG+S+ I +++ +G Y+N F GR + Q+ R+
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 757 PAALSRQYVRAEDGSLVPLATLVRLDIEVAPNRLLQFDQQNASTLQAIPAPGVSMGNAVA 816
P + + YVR+ +G +VP + RL +++ + +Q APG S G+A+A
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 817 FLEQLTAELPPGFSHDWQSESRQYVQEGFALMWAFLAALVVIYLVLAAQYESLVDPLIIL 876
+E L ++LP G +DW S Q G + VV++L LAA YES P+ ++
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 877 VTVPLSICGALLPLALGWATLNIYTQIGLVTLIGLISKHGILMVAFANEIQVRDNLDRAA 936
+ VPL I G LL L ++Y +GL+T IGL +K+ IL+V FA ++ ++
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 937 AIVRAAQIRLRPVLMTTAAMTFGVLPLLFASGAGANSRFGLGVVIVCGMLVGTLFTLFVL 996
A + A ++RLRP+LMT+ A GVLPL ++GAG+ ++ +G+ ++ GM+ TL +F +
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 997 PTIYAWLAR 1005
P + + R
Sbjct: 1022 PVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2067TCRTETB987e-24 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 97.6 bits (243), Expect = 7e-24
Identities = 71/394 (18%), Positives = 156/394 (39%), Gaps = 17/394 (4%)

Query: 46 FMAGMNVHVTSAALPEIRGSLGASFEEGSWISTAYLVAEIVMIPLTAWLVDVFSLRRVMW 105
F + +N V + +LP+I +W++TA+++ + + L D ++R++
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 106 TGSLIFLIASVACSWAPN-LEAMIVIRVIQGAAGAVLIPLSFQLIITELPASKMAMGMAL 164
G +I SV + +I+ R IQGA A L ++ +P L
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 165 FSLANSVAQAAGPSIGGWLTDAYSWRWIFYLQLFPGIALLLAIAWSIEAKPMKLELLRKG 224
++ + GP+IGG + W ++ + + I + + + +K
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHF---- 199

Query: 225 DWLGIAAMVIGLGGLQIVLEEGGRLDWFGSPLIVGMSVVAAIALVVFVVTQLFGQRAFIN 284
D GI M +G+ + +L F + + +V+ ++ ++FV F++
Sbjct: 200 DIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 285 LRLLGHYNFGVASVAMFIFGAATFGLVFLVPNYLSQLQGFSAHDVGVALIAYGVVQLLL- 343
L + F + + I G V +VP + + S ++G +I G + +++
Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309

Query: 344 APLMPRLMGWTSAKFMVASGFLIMALGCWLGAGLSADSADNVIIPSTVVRGIGQPFIMVA 403
+ L+ +++ G +++ +L A ++ + V G F
Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 404 LSVLAVAGLDKREAGSASAVFSMLRNLGGAIGTA 437
+S + + L ++EAG+ ++ + L G A
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIA 402


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2068RTXTOXIND1271e-34 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 127 bits (321), Expect = 1e-34
Identities = 67/428 (15%), Positives = 134/428 (31%), Gaps = 77/428 (17%)

Query: 52 PSPAETEQRPSAKTRRRLAVIASGSLAAITLLAFTSYWFSTGRY---LETTDDAYVRADW 108
P+ E + P ++ R +A + ++AF G+
Sbjct: 43 PAHLELIETPVSRRPRLVAYF----IMGFLVIAFI--LSVLGQVEIVATANGKLTHSGRS 96

Query: 109 VALSPRVAGYVAKVEVADDQPVKAGDVLVRLQNRDYRARLDQARAGVTEAQA-------- 160
+ P V ++ V + + V+ GDVL++L A + ++ + +A+
Sbjct: 97 KEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQIL 156

Query: 161 -------------------------------------ALAAAQASQQVATERIDQQQQAI 183
+ Q + +D+++
Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216

Query: 184 LQAEAVVRSATAEQRRSELDVQRYRGLVRDDAATVQRLETASAHASQAQAALQGAQAALR 243
L A + R + + + L+ A + +A L+ ++ L
Sbjct: 217 LTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLE 276

Query: 244 QQRSQLAMAKARAAQAEAELQQRAAALARAQAHQQL---------AEQDEQDTVIRAPIT 294
Q S++ AK + Q + E+ +Q +VIRAP++
Sbjct: 277 QIESEILSAKEEYQLVTQLFKNEILD-KLRQTTDNIGLLTLELAKNEERQQASVIRAPVS 335

Query: 295 GVVGQRRVRT-GQYVVPGQPLLAVVPLQQAYVV-ANYKETQLARMRPGQPVEIRVDSFAS 352
V Q +V T G V + L+ +VP V A + + + GQ I+V++F
Sbjct: 336 VKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPY 395

Query: 353 Q---PLHGHVASFSPASGNVFALLPSDNATGNFTKIVQRFPVRILLDKPLDGPQVLPGMS 409
L G V + + + D G ++ L + GM+
Sbjct: 396 TRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTG-NKNIPLSSGMA 447

Query: 410 VVSTVDTR 417
V + + T
Sbjct: 448 VTAEIKTG 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2069RTXTOXIND363e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.0 bits (83), Expect = 3e-04
Identities = 11/102 (10%), Positives = 32/102 (31%)

Query: 367 VSKAEALQRAQVARYRGVALSALKDVRQALARYDGERQRLQALDAALAHSQHSFALAQGN 426
V + +L + Q + ++ ++ + A R+ + +
Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL 243

Query: 427 YRAGTVDGLALLDSEREMISLRANHTEARGRLAQAQVNLFRA 468
+ A+L+ E + + + +L Q + + A
Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2073SACTRNSFRASE318e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.5 bits (71), Expect = 8e-04
Identities = 14/74 (18%), Positives = 29/74 (39%), Gaps = 10/74 (13%)

Query: 82 DDQPLGFVGFNEN-----HVEMLFVDPARHRQGIGRALLDFGRQ---SRSAMSVDVNEQN 133
++ +G + N +E + V ++G+G ALL + + + Q+
Sbjct: 73 ENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132

Query: 134 PQATA--FYQHYGF 145
+A FY + F
Sbjct: 133 INISACHFYAKHHF 146


84PP_2089PP_2093N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_20891162.555587OmpF family protein
PP_20902163.527247uroporphyrin-III C-methyltransferase
PP_20910163.191887serine/threonine-protein kinase
PP_2092-2102.789759nitrite transporter
PP_2093-3112.253686response regulator receiver and ANTAR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2089OMPADOMAIN1333e-38 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 133 bits (335), Expect = 3e-38
Identities = 80/365 (21%), Positives = 129/365 (35%), Gaps = 74/365 (20%)

Query: 15 VAATSIGAMAQGQGAVETEIFY------KKEFFDSQRDFKNDGN-------LFGGSIGYF 61
+A G Q A + +Y ++ D+ F N+ G GY
Sbjct: 8 IAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTG--FINNNGPTHENQLGAGAFGGYQ 65

Query: 62 LTDDVELRLGYDEVHNARGEDGKN-----IKGSNTALDAVYHFNNPYDAIRPYVSAGFSH 116
+ V +GYD + + +G Y + D Y G
Sbjct: 66 VNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDI---YTRLGGMV 122

Query: 117 -QSLGQTGRGGRDHSTFAN--VGAGAKWYITDMFYARAGVEAQYNI-DQGDTEWAP---- 168
++ ++ G++H T + G ++ IT R + NI D P
Sbjct: 123 WRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIGTRPDNGM 182

Query: 169 -SVGVGLNFGGSPKQAEAAPAPVAEVCSDSDNDGVCDNVDKCPDTPANVTVDADGCPAVA 227
S+GV FG APAP
Sbjct: 183 LSLGVSYRFGQGEAAPVVAPAPAPAP------------------------------EVQT 212

Query: 228 EVVRVELDVKFDFDKSVVKPNSYGDIKNLADFMKQY--PQTTTVVEGHTDSVGPDAYNQK 285
+ ++ DV F+F+K+ +KP + L + + VV G+TD +G DAYNQ
Sbjct: 213 KHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQG 272

Query: 286 LSERRANAVKQVLTQQYGVESSRVDSVGYGETRPVADNATEEGR---------AVNRRVE 336
LSERRA +V L + G+ + ++ + G GE+ PV N + + A +RRVE
Sbjct: 273 LSERRAQSVVDYLISK-GIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVE 331

Query: 337 AQVEA 341
+V+
Sbjct: 332 IEVKG 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2091YERSSTKINASE381e-04 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 37.8 bits (87), Expect = 1e-04
Identities = 35/111 (31%), Positives = 52/111 (46%), Gaps = 11/111 (9%)

Query: 362 VARQLLQAVGVLHRRNLLHRDIKPDNLHLGR-DGQLRLLDFGLAYCPGLSEDPLHELPG- 419
+A +LL L + ++H DIKP N+ R G+ ++D GL G E P
Sbjct: 250 IAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSG-------EQPKG 302

Query: 420 -TPSYIAPEAFDGH-PPSPRQDLYAVGVTLYHLLTGHYPYGEVEAFQRPRF 468
T S+ APE G+ S + D++ V TL H + G E++ Q RF
Sbjct: 303 FTESFKAPELGVGNLGASEKSDVFLVVSTLLHCIEGFEKNPEIKPNQGLRF 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2092TCRTETB393e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.7 bits (90), Expect = 3e-05
Identities = 80/415 (19%), Positives = 142/415 (34%), Gaps = 81/415 (19%)

Query: 46 IAADLQLSAQQRGLMVATPILAGAILRFAMGVLVDRLSPKTAGLIGQVVVIVALAAAWHL 105
IA D + +L +I G L D+L K L G ++I +
Sbjct: 40 IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFG--IIINCFGSVIGF 97

Query: 106 GVHSYEQALLLGVFL-GFAGASF-AVSLPLASQWYPPQHQGKAMG-IAGAGNSGTVFAAL 162
HS+ L++ F+ G A+F A+ + + +++ P +++GKA G I G
Sbjct: 98 VGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPA 157

Query: 163 LAPALAAGFGWNNVFGFALIPLSLALVVFALLARNAPQRPKPKAMADYLKAL-------- 214
+ +A W+ + LIP+ + V L + + + K D +
Sbjct: 158 IGGMIAHYIHWSYLL---LIPMITIITVP-FLMKLLKKEVRIKGHFDIKGIILMSVGIVF 213

Query: 215 ----GDRDSWWFMFFYSVTFGGFI------------------------------------ 234
S F+ ++F F+
Sbjct: 214 FMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVA 273

Query: 235 GLASALPGYFSDQYGLSPVTAGYYTAACVFAGSL----MRPLGGALADRFGGIRTLLGMY 290
G S +P D + LS G + +F G++ +GG L DR G + L
Sbjct: 274 GFVSMVPYMMKDVHQLSTAEIG---SVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGV 330

Query: 291 GVAAICIAAVGFNLPSAAAALALFVSAMLG-LGAGNGAVFQLVPQRFR-QEIGVMTGLI- 347
++ F L + + + + + +LG L + +V + QE G L+
Sbjct: 331 TFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLN 390

Query: 348 -----GMAGGIG--GFLLAAGL-------GTIKQHTGDYQLGLWLFASLGLLAWF 388
GI G LL+ L + Q T Y L LF+ + +++W
Sbjct: 391 FTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSNLLLLFSGIIVISWL 445


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2093HTHFIS492e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.4 bits (118), Expect = 2e-09
Identities = 25/124 (20%), Positives = 54/124 (43%), Gaps = 2/124 (1%)

Query: 3 RILLIDDTQSKLGRLRAALSEAGFEIIEAPDLTIDLPACVETVRPDVVLIDTDSPDRDVM 62
IL+ DD + L ALS AG+++ + L + D+V+ D PD +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAA-TLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EQVVLVSRDQPR-PIVLFTDEHDPGMMRQAIQAGVSAYIVEGIHAARLQPILDVAMARFE 121
+ + + + +P P+++ + ++ +A + G Y+ + L I+ A+A +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 122 SDQA 125
+
Sbjct: 124 RRPS 127


85PP_2302PP_2309N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2302212-0.933222ATP-dependent protease La
PP_2303111-0.139851histone family protein DNA-binding protein
PP_23040120.137662PpiC-type peptidyl-prolyl cis-trans isomerase
PP_23050140.970517patatin
PP_23063151.740416lipoprotein
PP_23072141.812481CHAD domain-containing protein
PP_23080121.612752acyl-CoA thioesterase
PP_23090101.545643hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2302PF05272310.024 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.024
Identities = 13/83 (15%), Positives = 29/83 (34%), Gaps = 6/83 (7%)

Query: 292 DWLVQVPWKAQSKVRLDLAKAEEILDADHYGLEEVKERILEYLAVQKRVKKIRGP----- 346
DW+ W ++ L D+ +++ + V ++ P
Sbjct: 537 DWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFD 596

Query: 347 -VLCLVGPPGVGKTSLAESIAAA 368
+ L G G+GK++L ++
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2303DNABINDINGHU1201e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 120 bits (303), Expect = 1e-39
Identities = 48/88 (54%), Positives = 64/88 (72%)

Query: 2 NKSELIDAIAASADIPKAVAGRALDAVIESVTGALKQGDDVVLVGFGTFSVKERAERTGR 61
NK +LI +A + ++ K + A+DAV +V+ L +G+ V L+GFG F V+ERA R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKAIKIEAAKVPGFKAGKGLKDAV 89
NPQTG+ IKI+A+KVP FKAGK LKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_23042FE2SRDCTASE310.012 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 30.8 bits (69), Expect = 0.012
Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 536 GEDGIDPAELQALFRLGKPQAKDKPVYGSVVLRDGSLV 573
GE ++ F +D P++ +VVLRDG LV
Sbjct: 203 GEATVESLRHALFFEKTLTNGEDNPLWRTVVLRDGLLV 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2309ACRIFLAVINRP300.003 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.8 bits (67), Expect = 0.003
Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 3/39 (7%)

Query: 30 LIAVPLFILGTLLVLSGLFGFDLGQIAVGVIALIAALGL 68
IAVP+ +LGT +L+ FG+ + + + + L A+GL
Sbjct: 369 TIAVPVVLLGTFAILA-AFGYSINTLTMFGMVL--AIGL 404


86PP_2409PP_2416N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2409-2111.748799cobalt-zinc-cadmium resistance protein CzcB
PP_2410-1121.138907cobalt-zinc-cadmium resistance protein CzcA
PP_24110101.475467major facilitator family transporter
PP_24120101.998286ParA family protein
PP_2413-192.865663diguanylate cyclase
PP_24140112.726471hypothetical protein
PP_2415-1102.285003acetyltransferase
PP_2416-1112.362504iron ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2409RTXTOXIND449e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.7 bits (103), Expect = 9e-07
Identities = 26/136 (19%), Positives = 49/136 (36%), Gaps = 13/136 (9%)

Query: 140 ASQQISDLRSEQQAAQRRLELARLTFQREQQLWQERISAEQDYLQARQALQEAEIALANA 199
A ++ +S+ + + + A+ +Q QL++ I + L E+A
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323

Query: 200 RQKVAAVGPAGAGNRYELRAPFDAVVVE-KHLTVGEVVDETSNAFTLS-DLSRVWATFAV 257
RQ +RAP V + K T G VV + + + T V
Sbjct: 324 RQ-----------QASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 258 APRDLGKVVTGREVTV 273
+D+G + G+ +
Sbjct: 373 QNKDIGFINVGQNAII 388



Score = 35.6 bits (82), Expect = 4e-04
Identities = 19/131 (14%), Positives = 46/131 (35%), Gaps = 13/131 (9%)

Query: 79 AGIQLAAAGPRELGTAISFPGEIRFDEDRTAHVVPRVPGVVEAVQAELGQAVKRGQVLAV 138
I + ++ + G++ + P +V+ + + G++V++G VL
Sbjct: 68 LVIAFILSVLGQVEIVATANGKLTHSGRSKE-IKPIENSIVKEIIVKEGESVRKGDVLLK 126

Query: 139 IASQQISDLRSEQQAAQRRLELARLTFQREQ---------QLWQERISAEQDYLQARQAL 189
+ + ++ Q L ARL R Q +L + ++ E + +
Sbjct: 127 LTALG---AEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183

Query: 190 QEAEIALANAR 200
+L +
Sbjct: 184 VLRLTSLIKEQ 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2410ACRIFLAVINRP7800.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 780 bits (2016), Expect = 0.0
Identities = 228/1062 (21%), Positives = 427/1062 (40%), Gaps = 55/1062 (5%)

Query: 5 LIQFAIEQRLVVMLAVVLMAAVGIHSYQKLPIDAVPDITNVQVQINTAAPGYSPLETEQR 64
+ F I + + + +++ G + +LP+ P I V ++ PG +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 ITFAIETAMAGLPGLKQTRSLSRS-GLSQVTVIFDDGTDIFFARQLVNERLQVAREQLPE 123
+T IE M G+ L S S S G +T+ F GTD A+ V +LQ+A LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 GIEAGMGPISTGLGEIFLWTVEAQEGALKEDGTPYTPTDLRVIQDWIIKPQLRNVPGVAE 183
++ + +L D T D+ +K L + GV +
Sbjct: 121 EVQQQGISVEKSSS-SYLMVA-----GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174

Query: 184 VNSIGGHAKQYLIAPEPKRLAAYKLTLNDLIAALERNNANIGAGYI------ERNGEQLL 237
V G I + L YKLT D+I L+ N I AG +
Sbjct: 175 VQLFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 238 IRAPGQVASAEDIANIVI-SSVDGTPIRVSHVAQVGLGEELRSGAATENGREVVLGTVFM 296
I A + + E+ + + + DG+ +R+ VA+V LG E + A NG+ + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 297 LIGENSRTVSQAVAAKLVEINRNLPKGVVAVTVYDRTNLVEKAIATVKKNLIEGAILVIA 356
G N+ ++A+ AKL E+ P+G+ + YD T V+ +I V K L E +LV
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 357 VLFLFLGNIRAALITAMVIPLSMLFTFTGMFSNKVSANLMSLG--ALDFGIIVDGAVVIV 414
V++LFL N+RA LI + +P+ +L TF + + S N +++ L G++VD A+V+V
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 415 ENAIRRLAHAQQRHGRMLTRGERFHEVFAAAREARRPLIYGQLIIMVVYLPIFALTGVEG 474
EN R + + + + + L+ +++ V++P+ G G
Sbjct: 414 ENVERVMMEDKLP---------PKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 475 KMFHPMAFTVVMALLGAMILSVTFVPAAIALFVTGKVKEEEGL----------VMRTARQ 524
++ + T+V A+ ++++++ PA A + E +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524

Query: 525 RYAPVLAWVLGRRKLACAAAAALVLLSGVMASRMGSEFIPSLSEGDFALQALRVPGTSLS 584
Y + +LG A +V V+ R+ S F+P +G F G +
Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584

Query: 585 QSVD-MQQRLEQAIIAQVPEVERVFARTGTAEIASDPMPPNISDAYVMLRPREQWVDPGK 643
++ + Q + + + VE VF G + N A+V L+P E+
Sbjct: 585 RTQKVLDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDEN 641

Query: 644 PRDELIAQVQRAAASVPGSNYELSQPIQLRFNELISGVRSDVA-VKLFGDDMEVLNRTAA 702
+ +I + + + EL + D + G + L +
Sbjct: 642 SAEAVIHRAKMELGKIRDGFVIPFNM--PAIVELGTATGFDFELIDQAGLGHDALTQARN 699

Query: 703 QIAASLQGVPGA-SEVKVEQTTGLPVLTIDIDRDKAARHGLNVGDVQDAIAIAVGGRTAG 761
Q+ P + V+ +++D++KA G+++ D+ I+ A+GG
Sbjct: 700 QLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVN 759

Query: 762 TLYEGDRRFDMVVRLSETLRTDVDGLASLLIPVPASAAERAGQIGFIPLSQVATLNLQLG 821
+ R + V+ R + + L + G+ +P S T + G
Sbjct: 760 DFIDRGRVKKLYVQADAKFRMLPEDVDKLYVR------SANGE--MVPFSAFTTSHWVYG 811

Query: 822 PNQVSREDGKRVVVVSANVRGRDLGSFVQEAEQALIDQVQVPPGYWTRWGGQFEQLQSAA 881
++ R +G + + + L ++P G W G Q + +
Sbjct: 812 SPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENL--ASKLPAGIGYDWTGMSYQERLSG 869

Query: 882 ERLQVVVPVALLLVMALLLMMFNNLRDGLLVFTGIPFALTGGVLALWARDIPLSISAGVG 941
+ +V ++ ++V L ++ + + V +P + G +LA + + VG
Sbjct: 870 NQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVG 929

Query: 942 FIALSGVAVLNGLVMIAFIRGLRE-EGRTLRAAVEEGALTRLRPVLMTALVASLGFIPMA 1000
+ G++ N ++++ F + L E EG+ + A RLRP+LMT+L LG +P+A
Sbjct: 930 LLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLA 989

Query: 1001 LATGTGAEVQRPLATVVIGGILSSTALTLLVLPALYQWAYRR 1042
++ G G+ Q + V+GG++S+T L + +P + R
Sbjct: 990 ISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2411TCRTETA371e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 37.1 bits (86), Expect = 1e-04
Identities = 67/321 (20%), Positives = 111/321 (34%), Gaps = 58/321 (18%)

Query: 52 VALLKTFAVFAVAFALRPLGGIVFGALGDRLGRKRILSLTILLMAGSTTLIGLLPTYASI 111
LL +A+ A A P+ G AL DR GR+ +L +++ A ++ P
Sbjct: 46 GILLALYALMQFACA--PVLG----ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL--- 96

Query: 112 GLAAPALLTLARCLQGFSAGGEYAGACAYLMEHAPDDKRAFYGSFVPVSTFSAFACAAVI 171
+L + R + G + G A A AY+ + D+RA + F+ V+
Sbjct: 97 -----WVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVL 150

Query: 172 AYGLEASLSTEAMNAWGWRIPFLIAAPLGLVGLYLRWRMEETPAFREAVAQGKEHEHSPL 231
GL S PF AA L + + E+ + E PL
Sbjct: 151 G-GLMGGFSP--------HAPFFAAAALNGLNFLTGCFL-----LPES----HKGERRPL 192

Query: 232 KETLRHHGRVIRNLGAFISLTALSFYMFTTYFATYLQLVGNLTRAQSLLVT--------- 282
+ + R + AL F +QLVG + A ++
Sbjct: 193 RREALNPLASFRWARGMTVVAALMAVFFI------MQLVGQVPAALWVIFGEDRFHWDAT 246

Query: 283 TVALLFAAVGCP-------LAGAFSDRVGRRKTIGFTCLWVMLCVFPAYWLASSGSMSGA 335
T+ + AA G + G + R+G R+ + + LA + A
Sbjct: 247 TIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI---LLAFATRGWMA 303

Query: 336 LLGVILLAVGALCSGVVTAAL 356
++LLA G + + A L
Sbjct: 304 FPIMVLLASGGIGMPALQAML 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2416PF05272280.045 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.045
Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 40 LGIVGPNGSGKSSLLKLLAGLR 61
+ + G G GKS+L+ L GL
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD 620


87PP_2783PP_2790N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2783214-2.6191533-oxoacyl-ACP reductase
PP_2784113-2.945318short chain dehydrogenase/reductase
PP_2785-112-1.808100hypothetical protein
PP_2786-211-1.321072hypothetical protein
PP_2787-212-0.739623transporter
PP_2788-1111.657662MerR family transcriptional regulator
PP_2789-2112.997171oxidoreductase
PP_2790-1113.733179Fis family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2783DHBDHDRGNASE1354e-41 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 135 bits (341), Expect = 4e-41
Identities = 83/249 (33%), Positives = 122/249 (48%), Gaps = 12/249 (4%)

Query: 4 KIAVVTGGSRGIGKSIVLALAGAGYQVAFSYVRDEASAAALQAQVEGLGRDCLAVQCDVK 63
KIA +TG ++GIG+++ LA G +A E + + + R A DV+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSL-KAEARHAEAFPADVR 67

Query: 64 EAPSIQAFFERVEQRFERIDLLVNNAGITRDGLLATQSLNDITEVIQTNLVGTLLCCQQV 123
++ +I R+E+ ID+LVN AG+ R GL+ + S + N G + V
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 124 LPCMMRQRSGCIVNLSSVAAQKPGKGQSNYAAAKGGVEALTRALAVELAPRNIRVNAVAP 183
MM +RSG IV + S A P + YA++K T+ L +ELA NIR N V+P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 184 GIVSTDMSQAL---VGAHEQEI-----QSRLLI--KRFARPEEIADAVLYLA-ERGLYIT 232
G TDM +L EQ I + I K+ A+P +IADAVL+L + +IT
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 233 GEVLSVNGG 241
L V+GG
Sbjct: 248 MHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2784DHBDHDRGNASE965e-26 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 96.3 bits (239), Expect = 5e-26
Identities = 76/258 (29%), Positives = 114/258 (44%), Gaps = 13/258 (5%)

Query: 5 RTIVITGAANGIGRAVAESFAAQAEHLLILLDRDLATLQGWVTEGEFAARIETHQANIAD 64
+ ITGAA GIG AVA + A+Q H+ + + + A E A++ D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 LASLQLLFKGLADRVGFVDVLVNSAGVCDENEPEDL--DNWHKVISINLNGTFYVTSLCL 122
A++ + + +G +D+LVN AGV L + W S+N G F +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 PLMAD--GGRIVNMSSILGRAGKVRNTAYCASKHGIIGMTKALALDLAPRRITVNAILPA 180
M D G IV + S + AY +SK + TK L L+LA I N + P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 181 WIDTPMLQ----GELAAQARIAGITHEQILRNAKKKLPLRRFIQGDEVAAMVRYLASPQA 236
+T M E A+ I G L K +PL++ + ++A V +L S QA
Sbjct: 189 STETDMQWSLWADENGAEQVIKGS-----LETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 237 SGVTAQSLMIDGGAGLGM 254
+T +L +DGGA LG+
Sbjct: 244 GHITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2787ACRIFLAVINRP551e-09 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 54.8 bits (132), Expect = 1e-09
Identities = 46/343 (13%), Positives = 114/343 (33%), Gaps = 38/343 (11%)

Query: 171 EMASMADLENISLSADGELWIHKTLHALDMDPIKVE--AQIMGNEQMVGGVVSADKK--V 226
+ + ++L + D ++++ A++ + + + K
Sbjct: 239 RFKNPEEFGKVTLRVNS-----------DGSVVRLKDVARVELGGENYNVIARINGKPAA 287

Query: 227 AMVVAELGTKQDDAQAQLRAYHQVREIIAKYQAAHPEFTDEVFIAGMPIFIAAQQEIIDH 286
+ + A A L ++ +A+ Q P+ ++ F+ +
Sbjct: 288 GLGIK----LATGANA-LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVK 342

Query: 287 DLAMLFPIVFLLVTSLLVFFFRKPLGVVLPLFNILFCTIWTLGLMALLRVPMDLLTSVLP 346
L +VFL++ F + ++P + + T ++A ++ LT
Sbjct: 343 TLFEAIMLVFLVM----YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGM 398

Query: 347 VFLFTICCADAIHVMAEYYEQLNSGKS-FREANRETQRLMVTPVVLTTVTTIATFL-IST 404
V + DAI V+ + K +EA ++ + +V + A F+ ++
Sbjct: 399 VLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAF 458

Query: 405 TNNIVSI--RNFGVFMSIGLTAALIISLLLIPAWISIWGKDAVPRKVQLKESLISHYLVV 462
R F + + + +++++L+L PA + K + K +
Sbjct: 459 FGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTT 518

Query: 463 F----------CAWLIRWRKPVLLVTLPLLAMMTVFTFKVDIE 495
F ++ LL+ ++A M V ++
Sbjct: 519 FDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSS 561



Score = 52.9 bits (127), Expect = 5e-09
Identities = 36/203 (17%), Positives = 88/203 (43%), Gaps = 18/203 (8%)

Query: 670 PANLQVTHAGTPYIWTGVLQEITQGQVLSFSLALLAVTLMMMFWLKSVRLGILGMLTLLT 729
P ++V PY T +Q V + A++ V L+M +L+++R ++ + +
Sbjct: 318 PQGMKVL---YPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV 374

Query: 730 TSVTVYGSMYLLDIELNIGTTLVTFLVVG-VVDYAVHLLSRI-KMLVQKGIEIDEAILAA 787
+ + + +N T L +G +VD A+ ++ + +++++ + EA +
Sbjct: 375 VLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKS 434

Query: 788 MQGVGRSTVVNVVIFSMGFVALLFSA------YKPVIDLGVLVILALSSSGFMTILLVTL 841
M + + V ++ S F+ + F Y+ + ++ A++ S + ++L
Sbjct: 435 MSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQ---FSITIVSAMALSVLVALILT-- 489

Query: 842 ISPWFFASIVPQPAVQEGEQPGG 864
P A+++ + + E GG
Sbjct: 490 --PALCATLLKPVSAEHHENKGG 510



Score = 49.8 bits (119), Expect = 4e-08
Identities = 36/199 (18%), Positives = 82/199 (41%), Gaps = 19/199 (9%)

Query: 649 SVAGDYQAMLDKLDAWLAINKPANLQVTHAGTPYIWTGVLQEITQGQVLSFSLALLAVTL 708
+ +GD A+++ L + L PA + G Y + +++ + V L
Sbjct: 834 TSSGDAMALMENLASKL----PAGIGYDWTGMSY----QERLSGNQAPALVAISFVVVFL 885

Query: 709 MMMFWLKSVRLGILGMLTLLTTSVTVYGSMYLLDIELNIGTTLVTFLVVGVVDY-AVHLL 767
+ +S + + ML + V V + L + + ++ + +G+ A+ ++
Sbjct: 886 CLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIV 945

Query: 768 SRIK-MLVQKGIEIDEAILAAMQGVGRSTVVNVVIFSMGFVALLFS------AYKPVIDL 820
K ++ ++G + EA L A++ R ++ + F +G + L S A +
Sbjct: 946 EFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA---V 1002

Query: 821 GVLVILALSSSGFMTILLV 839
G+ V+ + S+ + I V
Sbjct: 1003 GIGVMGGMVSATLLAIFFV 1021



Score = 49.5 bits (118), Expect = 6e-08
Identities = 29/153 (18%), Positives = 60/153 (39%), Gaps = 6/153 (3%)

Query: 288 LAMLFPIVFLLVTSLLVFFFRKPLGVVLPLFNILFCTIWTLGLMALLRVPMDLLTSVLPV 347
L I F++V L + V + + + L L D+ V +
Sbjct: 872 APALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLL 931

Query: 348 FLFTICCADAIHVMAEYYEQL--NSGKSFREANRETQRLMVTPVVLTTVTTIATFL---I 402
+ +AI ++ E+ + L GK EA R+ + P+++T++ I L I
Sbjct: 932 TTIGLSAKNAI-LIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAI 990

Query: 403 STTNNIVSIRNFGVFMSIGLTAALIISLLLIPA 435
S + G+ + G+ +A ++++ +P
Sbjct: 991 SNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPV 1023


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2790HTHFIS335e-110 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 335 bits (861), Expect = e-110
Identities = 134/364 (36%), Positives = 193/364 (53%), Gaps = 24/364 (6%)

Query: 304 ITVVQRADQRIRSTRRPGAFTARYRLDQLNGNSKANREMLQLAKRFATSHSTILITGESG 363
I ++ RA + L G S A +E+ ++ R + T++ITGESG
Sbjct: 112 IGIIGRALAEPKRRPSKLE-DDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESG 170

Query: 364 TGKELLAQGIHNESPRRQGPFVAINCAAFPESLLESELFGYEEGAFSGSRKGGKPGLFEA 423
TGKEL+A+ +H+ RR GPFVAIN AA P L+ESELFG+E+GAF+G+ + G FE
Sbjct: 171 TGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGA-QTRSTGRFEQ 229

Query: 424 AHRGTLFLDEIGDMPVSLQTRLLRVLQEREVLRLGSTEPIAIDVRIIAATHKDLRSAMDD 483
A GTLFLDEIGDMP+ QTRLLRVLQ+ E +G PI DVRI+AAT+KDL+ +++
Sbjct: 230 AEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQ 289

Query: 484 GDFRTDLYFRLNILRLQTTPLRERPEDIALICRGISQRLLVQGQPPGAADIPAALLPYLE 543
G FR DLY+RLN++ L+ PLR+R EDI + R Q+ +G L ++
Sbjct: 290 GLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDV--KRFDQEALELMK 347

Query: 544 RYAWPGNVRELENVIERAMLSARELLEEHRVNEQYLARVLPELCEGPPPSPARKKS---- 599
+ WPGNVRELEN++ R + + + E L +P+ + + S
Sbjct: 348 AHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQA 407

Query: 600 ----------------SGETDLHTIGKVAQLRHVKETLESCRGNLDEAARRLGISRTTLW 643
+ + + L + RGN +AA LG++R TL
Sbjct: 408 VEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 644 RRLR 647
+++R
Sbjct: 468 KKIR 471


88PP_2812PP_2830N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_2812-2151.652030transporter
PP_2813-1142.675063BNR repeat-containing protein
PP_2814-1133.216524hypothetical protein
PP_28150153.720351hypothetical protein
PP_2816-1153.663971NfxB family transcriptional regulator
PP_2817-1123.413513multidrug efflux RND membrane fusion protein
PP_2818-1112.514352multidrug efflux RND transporter MexD
PP_28190131.854834outer membrane protein OprJ
PP_28200180.370163NfxB family transcriptional regulator
PP_28210160.521375hypothetical protein
PP_2822-1130.696567hypothetical protein
PP_28230131.051704methyl-accepting chemotaxis transducer
PP_28241190.972855TetR family transcriptional regulator
PP_28250181.302947integrase
PP_28261131.566202transcriptional regulator MexT
PP_28271131.414355zinc-containing alcohol dehydrogenase
PP_28281141.407737hypothetical protein
PP_28290101.684898GntR family transcriptional regulator
PP_2830-1102.157290major facilitator family transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2812ACRIFLAVINRP756e-16 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 75.3 bits (185), Expect = 6e-16
Identities = 34/209 (16%), Positives = 80/209 (38%), Gaps = 9/209 (4%)

Query: 590 TTINRVVDAAKAFRSDYPQPGLSIRLASGNAGVLAAINEEVEKSETPMLLYVYAAIALLV 649
T + + +PQ G+ + + EV K+ L + L++
Sbjct: 301 DTAKAIKAKLAELQPFFPQ-GMKVLYPYDTTPFVQLSIHEVVKT----LFEAIMLVFLVM 355

Query: 650 FAVYRDLRAVLVCCLPLTIGTFIGYWFMKELQIGLTIATLPVMVLAVGIGVDYAFYIYNR 709
+ +++RA L+ + + + + + + T+ MVLA+G+ VD A +
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 710 LQLHLAHGQSITK-AVEHALLEVGVATIFTAITLAVGVATWAF---SELKFQADMGKLLA 765
++ + + K A E ++ ++ A + A+ L+ AF S +
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 766 FMFIVNMVMAMTVLPAFAVWLERVFPRKR 794
+++++A+ + PA L + +
Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAEH 504



Score = 41.7 bits (98), Expect = 1e-05
Identities = 38/243 (15%), Positives = 84/243 (34%), Gaps = 18/243 (7%)

Query: 208 KEIRQQFEDGEFEVQIIGFAKQIGDIADGASAVLEFCLLALLLTAGAVYWYCHSLRFTLL 267
E++ F G ++++ + V++ A++L +Y + ++R TL+
Sbjct: 311 AELQPFFPQG---MKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLI 367

Query: 268 ALVCSLASLVWQFGSLRLLGYGLDPLAVLVPFLVFAIGVSHGVQQINFIVREIAIGKS-- 325
+ L+ F L GY ++ L + L + V + + + R + K
Sbjct: 368 PTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPP 427

Query: 326 --AEEAARSSFTGLLIPGTLALVTALVSFVTLLLIPIPMVREVAITASLGVAYKIITNLL 383
A E + S G L+ + L + + R+ +IT +A ++ L+
Sbjct: 428 KEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALI 487

Query: 384 MLPLLASMLRVDDRYAAAQEVSRQRRTRWLRGLARLAEW------RNAQWVLGVALVVFL 437
+ P L + L + E + + + + +LG L
Sbjct: 488 LTPALCATL----LKPVSAEHHENKG-GFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLL 542

Query: 438 VAI 440
+
Sbjct: 543 IYA 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2815HTHFIS300.015 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.015
Identities = 20/113 (17%), Positives = 36/113 (31%), Gaps = 9/113 (7%)

Query: 276 RAAIGSAGAGMNGFRRSHLEALTTQRLMGRLAGSPAVATIDQVRMVSLMTQDARAARQFV 335
R + R +E + + A A + + + ++ R
Sbjct: 364 RLTALYPQDVI---TREIIE-NELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 336 LSTLGRLATEPTVL-----QHSLHAFLANGCNITQTAEVLGTHRNTLLRRLER 383
L VL L A A N + A++LG +RNTL +++
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2816HTHTETR395e-06 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 38.8 bits (90), Expect = 5e-06
Identities = 20/145 (13%), Positives = 45/145 (31%), Gaps = 16/145 (11%)

Query: 29 ASMGELAVLAGISRATLHRYCGTRDNL-DSQLEQHAKDTLMHILDNSGTLACREPLAALR 87
S+GE+A AG++R ++ + + +L E + L+ +PL+ LR
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFP-GDPLSVLR 90

Query: 88 QLIHAHLAQGELIA--FLASRYPVHMSVQHDLRFYLERLD------------ALFASGQR 133
+++ L L H +++
Sbjct: 91 EILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIE 150

Query: 134 QGVFRADVTAALLTEMFVSLLHGMV 158
+ AD+ + + G++
Sbjct: 151 AKMLPADLMTRRAAIIMRGYISGLM 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2817RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.3 bits (102), Expect = 1e-06
Identities = 31/161 (19%), Positives = 57/161 (35%), Gaps = 14/161 (8%)

Query: 37 PVEIVTLAPEQVALAAELPGRVEPMRVAEVRARVPGIVLHKRFEEGADVKAGDVLFQIDP 96
VEIV A ++ + R E++ IV +EG V+ GDVL ++
Sbjct: 79 QVEIVATANGKLTHSG---------RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA 129

Query: 97 APFKAALARAEADLARAQAVQQEAQARVKRYE--PLVKIEAVSQQDFDSATAELRSAGAA 154
+A + ++ L +A+ Q Q + E L +++ + F + + E +
Sbjct: 130 LGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTS 189

Query: 155 VRSAQADV---QAARLNLGYATVTAPISGRIGRALATEGAL 192
+ Q Q + L A + R E
Sbjct: 190 LIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLS 230



Score = 42.5 bits (100), Expect = 2e-06
Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 4/103 (3%)

Query: 100 KAALARAEADLARAQAVQQEAQARVKRYEPLVKIEAVSQQDFDSATAELRSAGAAVRSAQ 159
+ A +L ++ Q Q + + + V+Q + +LR +
Sbjct: 258 ENKYVEAVNELRVYKS--QLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLT 315

Query: 160 ADVQAARLNLGYATVTAPISGRI-GRALATEGALVGQGEATLM 201
++ + + AP+S ++ + TEG +V E TLM
Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLM 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2818ACRIFLAVINRP11380.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1138 bits (2946), Expect = 0.0
Identities = 512/1034 (49%), Positives = 701/1034 (67%), Gaps = 7/1034 (0%)

Query: 1 MSRFFIHRPNFAWVVALFISLAGLLVIPSLPVAQYPNVAPPQISITASYPGASAKVMVES 60
M+ FFI RP FAWV+A+ + +AG L I LPVAQYP +APP +S++A+YPGA A+ + ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTSIIEQSLNGAKGLLYYESTNNSNGVAEVMVTFEPGTDPDMAQVDVQNRLKQAEARMPQ 120
VT +IEQ++NG L+Y ST++S G + +TF+ GTDPD+AQV VQN+L+ A +PQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 AVLTQGLKVEQASSGFLLIYALTSTAGNRGDTTALADYAARNINNELLRVPGVGKLQFFA 180
V QG+ VE++SS +L++ S ++DY A N+ + L R+ GVG +Q F
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGT-TQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 181 SEAAMRVWVDPQKLVGYGLSIDDINSAIRGQNVQVPAGSFGSTPGASEQELTATLAVQGT 240
++ AMR+W+D L Y L+ D+ + ++ QN Q+ AG G TP Q+L A++ Q
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 241 LDTPEAFAGIVLRANPDGSSVRLGDVARMAIGSENYNLSARLNGHPAVAGAVQLAPGANA 300
PE F + LR N DGS VRL DVAR+ +G ENYN+ AR+NG PA ++LA GANA
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 301 IQTATLVKERLAELSQFFPEGVEYSVPYDTSRFVDVAIEKVIHTLIEAMVLVFLVMFLFL 360
+ TA +K +LAEL FFP+G++ PYDT+ FV ++I +V+ TL EA++LVFLVM+LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 361 QNVRYTLVPSIVVPVCLLGTLMIMKLLGFSVNMMTMFGMVLAIGILVDDAIVVVENVERL 420
QN+R TL+P+I VPV LLGT I+ G+S+N +TMFGMVLAIG+LVDDAIVVVENVER+
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 421 MAEEGLSPVEATIKAMGQVSGAIIGITLVLAAVFLPLAFMSGSVGVIYQQFSVSLAVSIL 480
M E+ L P EAT K+M Q+ GA++GI +VL+AVF+P+AF GS G IY+QFS+++ ++
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 481 FSGFLALTFTPALCATLLKPVPHGHHE-KAGFFGAFNRGFARVTERYSLLNSELVARAGR 539
S +AL TPALCATLLKPV HHE K GFFG FN F Y+ +++ GR
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 540 WMLAYVGILVVLGYSYLRLPEAFVPAEDLGYSVVDVQLPPGASRVRTDHTAEALEKFLM- 598
++L Y I+ + +LRLP +F+P ED G + +QLP GA++ RT + + + +
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 -SRDAVANSFIVSGFSFSGQGDNAALAFPTFKDWSQRDKA-QSAEAETAAINAQFAANGD 656
+ V + F V+GFSFSGQ NA +AF + K W +R+ SAEA + D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 657 GAITAVMPPPIDGLGNSGGFALRLMDRGGLGREALLAARDQLLARANGNPVILYAMM-EG 715
G + P I LG + GF L+D+ GLG +AL AR+QLL A +P L ++ G
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 716 LAEAPQLRLHIDREKARALGVSFEAINSTLATAFGSAVINDFTNAGRQQRVVVQAEQGER 775
L + Q +L +D+EKA+ALGVS IN T++TA G +NDF + GR +++ VQA+ R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 776 MTPESVLRLYAPNANGEQVPFSAFVTTQWEEGPVQLVRYNGYPSIRIAGDASPGHSTGQA 835
M PE V +LY +ANGE VPFSAF T+ W G +L RYNG PS+ I G+A+PG S+G A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 836 MAEMERLVSELPPGIGYAWTGLSYQEKVSSGQAASLFALAILVVFLLLVALYESWAIPLT 895
MA ME L S+LP GIGY WTG+SYQE++S QA +L A++ +VVFL L ALYESW+IP++
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 896 VMLIVPIGALGAVWAVTLTGMPNDVYFKVGLITIIGLAAKNAILIVEFAKELWEK-GYSL 954
VML+VP+G +G + A TL NDVYF VGL+T IGL+AKNAILIVEFAK+L EK G +
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 955 CDAAIEAARLRFRPIVMTSMAFILGVVPLAIASGAGAASQRAIGTGVIGGMLSATLLGVV 1014
+A + A R+R RPI+MTS+AFILGV+PLAI++GAG+ +Q A+G GV+GGM+SATLL +
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1015 FVPVCFVWVLTLLK 1028
FVPV FV + K
Sbjct: 1020 FVPVFFVVIRRCFK 1033



Score = 84.9 bits (210), Expect = 9e-19
Identities = 88/528 (16%), Positives = 176/528 (33%), Gaps = 60/528 (11%)

Query: 540 WMLAYVGILVVLGYSYLRLPEAFVPAEDLGYSVVDVQLP-PGAS-RVRTDHTAEALEKFL 597
W+LA + +++ + L+LP A P + V V PGA + D + +E+ +
Sbjct: 13 WVLA-IILMMAGALAILQLPVAQYP--TIAPPAVSVSANYPGADAQTVQDTVTQVIEQNM 69

Query: 598 MSRDAVANSFIVSGFSFSGQGDNAALAFPTFKDWSQRDKAQS-AEAETAAINAQFAANGD 656
D + +S S S L TF+ + D AQ + +
Sbjct: 70 NGIDNLMY---MSSTSDSAGSVTITL---TFQSGTDPDIAQVQVQNKLQLATPLLPQ--- 120

Query: 657 GAITAVMPPPIDGLGNSGGFALRL--------MDRGGLGREALLAARDQLLARANGNPVI 708
V I +S + + + + +D L
Sbjct: 121 ----EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTL---------- 166

Query: 709 LYAMMEGLAEAP------QLRLHIDREKARALGVSFEAINSTLATA---FGSAVINDFTN 759
+ + G+ + +R+ +D + ++ + + L + +
Sbjct: 167 --SRLNGVGDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPA 224

Query: 760 AGRQQRVVVQAEQGERMTPESVLRLY-APNANGEQVPFS--AFVTTQWEEGPVQLVRYNG 816
QQ Q PE ++ N++G V A V E + R NG
Sbjct: 225 LPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGEN-YNVIARING 283

Query: 817 YPSIRIAGDASPGHST----GQAMAEMERLVSELPPGIGYAWTGLSYQEKVSSGQAASLF 872
P+ + + G + A++ L P G+ + V +
Sbjct: 284 KPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYP-YDTTPFVQLSIHEVVK 342

Query: 873 AL--AILVVFLLLVALYESWAIPLTVMLIVPIGALGAVWAVTLTGMPNDVYFKVGLITII 930
L AI++VFL++ ++ L + VP+ LG + G + G++ I
Sbjct: 343 TLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAI 402

Query: 931 GLAAKNAILIVE-FAKELWEKGYSLCDAAIEAARLRFRPIVMTSMAFILGVVPLAIASGA 989
GL +AI++VE + + E +A ++ +V +M +P+A G+
Sbjct: 403 GLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGS 462

Query: 990 GAASQRAIGTGVIGGMLSATLLGVVFVPVCFVWVLTLLKRKPSPVQQA 1037
A R ++ M + L+ ++ P +L + + +
Sbjct: 463 TGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2820HTHTETR336e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 32.7 bits (74), Expect = 6e-04
Identities = 19/162 (11%), Positives = 56/162 (34%), Gaps = 9/162 (5%)

Query: 4 PTDERLLKALADAIVVHP--RATLKELAEAAGVSKATLHRFCGTRDNLVNMLESYGDQVL 61
T + +L +L E+A+AAGV++ ++ + +L + + + +
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 62 TQVIGNADLHGSDPTAALHRLIVEHL-------KHREMMIFLLFQYRPDSFDDSETNRPW 114
++ ++ R I+ H+ + R +++ ++F + + +
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 115 RAYADALDAFFLRGQQAGAFRIDISAAIFTEMFLSMVYGIVD 156
R + + + A + T ++ G +
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2823RTXTOXINA330.003 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 33.0 bits (75), Expect = 0.003
Identities = 46/223 (20%), Positives = 87/223 (39%), Gaps = 37/223 (16%)

Query: 316 TMTNSIQEVAQRSEQASEQASAAARQANEA-RHNIDSLSLSIGD----LGNSVLSSVQAM 370
T+ ++ Q A + A + A ++A E R+ + L L I G+S+ V+
Sbjct: 12 TLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQGSSLNDLVRTA 71

Query: 371 EQLEAETQHVGSVLTVI--RSIAEQTNLLALNAAIEAARAGEQGRGFAVVADEVRNLAQK 428
++L E Q+ T I + L+ L RG + A ++ L QK
Sbjct: 72 DELGIEVQYDEKNGTAITKQVFGTAEKLIGLTE-----------RGVTIFAPQLDKLLQK 120

Query: 429 --------------TTASTAQIQDIIQRLQNSASSVLHAMNLN----GEKARSSIQRSEH 470
+ + I+ QN + L +M ++ +K+ ++ SE
Sbjct: 121 YQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180

Query: 471 ATQTLEAITGAVRQIDELNAGIARFTNEQIGLSRSIQQDTERL 513
A ++E I V + LN + F+ +Q+ S+ +T+ L
Sbjct: 181 AKASIELINQLVDTVASLNNNVNSFS-QQLNTLGSVLSNTKHL 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2824HTHTETR743e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.9 bits (181), Expect = 3e-18
Identities = 27/212 (12%), Positives = 69/212 (32%), Gaps = 17/212 (8%)

Query: 24 LARRGRPVGDRDAKRSELLAAAIAVIAQEGYAGATMRKVAQHAGCTTGAVTYYFANKEEM 83
+AR+ + + +L A+ + +Q+G + ++ ++A+ AG T GA+ ++F +K ++
Sbjct: 1 MARKTKQEAQETRQH--ILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58

Query: 84 VSAVAQTLFDKVDALLDIDLDQV---DIKSLIEQWHQWISLD-EPGNWLAWLQLLTH--- 136
S + + + L + + L E + ++++ H
Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118

Query: 137 -ARHQPAFASVIKQRYTNFREVATSVLEAGQRQGQIREDVPADVLADHIAAFSDG----W 191
+ + L+ + D+ A + + G W
Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENW 178

Query: 192 LMMLPFESEPVSAERGKALIDAFIVMISPPKT 223
L + + + + M T
Sbjct: 179 LF---APQSFDLKKEARDYVAILLEMYLLCPT 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2826PF05043290.031 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 28.8 bits (64), Expect = 0.031
Identities = 17/65 (26%), Positives = 35/65 (53%), Gaps = 6/65 (9%)

Query: 1 MNRNDLRRVDLNLLIVFETLMHERSVTRA--AEKLFLGQPAISAALSRLRNLFDDPLFVR 58
+++ R+++L L ++FE H+R R+ AE L + A+ LS +++ F D +F
Sbjct: 5 LSKKSHRQLEL-LELLFE---HKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHS 60

Query: 59 TGRSM 63
+ +
Sbjct: 61 STNGI 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_2830TCRTETB411e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.6 bits (95), Expect = 1e-05
Identities = 31/197 (15%), Positives = 72/197 (36%), Gaps = 18/197 (9%)

Query: 262 QMFRDRQIWLAIAVYFVHQITIYTVIFFLPGIIGTYAALSPFQVGLLTAVPWIAAAIGAA 321
+ ++ + + + T+ + +P ++ LS ++G + P + I
Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG 310

Query: 322 TLPRLATSPRRCRTLLFLGLLTMAAGLLLASLT---NSFIGLIGFSLTALMLFVVQSIIF 378
+ + R +L +G+ ++ L AS S+ I L +++I
Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIS 370

Query: 379 VFPSSRLSGSALAAGLAFVTTCGLFGGFVGPSVMGL---------------IEQTTGSTR 423
SS L AG++ + G +++G ++Q+T
Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYS 430

Query: 424 NGLWIIAALLVCAALVS 440
N L + + ++V + LV+
Sbjct: 431 NLLLLFSGIIVISWLVT 447


89PP_3200PP_3206N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_32000133.307508hypothetical protein
PP_32012124.275392BNR repeat-containing protein
PP_32020133.234318transporter
PP_3203-1152.967834major facilitator family transporter
PP_3204-1132.919585hypothetical protein
PP_3205-1153.020835fumarylacetoacetate hydrolase
PP_3206-1132.523064hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3200BORPETOXINA300.023 Bordetella pertussis toxin A subunit signature.
		>BORPETOXINA#Bordetella pertussis toxin A subunit signature.

Length = 269

Score = 29.8 bits (66), Expect = 0.023
Identities = 15/38 (39%), Positives = 20/38 (52%)

Query: 247 VGAYYQFEWEANRLPAAGSYFSTDDFFGDGAERMFVGA 284
+G Y+ + N AA SYF D +GD A R+ GA
Sbjct: 119 IGYIYEVRADNNFYGAASSYFEYVDTYGDNAGRILAGA 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3202ACRIFLAVINRP603e-11 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 60.2 bits (146), Expect = 3e-11
Identities = 32/175 (18%), Positives = 72/175 (41%), Gaps = 9/175 (5%)

Query: 618 IEAATNQVVKQANRDMLWWVYGAVIVLCLVTFRSWRAVLCAVLPLVLTSILCEALMVALG 677
++ + ++VVK L+ V ++ + ++ RA L + + + + A++ A G
Sbjct: 333 VQLSIHEVVKT-----LFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFG 387

Query: 678 IGVKVATLPVIALGVGIGVDYALYVM-SIVLAQLRQGASLSQAYYRALLFTGKVVMLTGI 736
+ T+ + L +G+ VD A+ V+ ++ + +A +++ ++ +
Sbjct: 388 YSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAM 447

Query: 737 TLAIGVGTWIF---SPIKFQADMGVLLAFMFVWNMVGALILLPALAYFLLPHRKA 788
L+ F S + + +++ ALIL PAL LL A
Sbjct: 448 VLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502



Score = 35.6 bits (82), Expect = 0.001
Identities = 25/154 (16%), Positives = 65/154 (42%), Gaps = 12/154 (7%)

Query: 247 AIAITAAVLYWYTRCVRSTALVVVCSLVAVIWQLGLLPLLGYALDPYSVLVPFLVFAIGM 306
AI + V+Y + + +R+T + + V ++ +L GY+++ ++ +V AIG+
Sbjct: 347 AIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFG--MVLAIGL 404

Query: 307 SHGAQKMNGIMQDIGRGMH-----RLVAARFTFRRLFLAGLTALLCDAVGFAVLMIIQIQ 361
+ +++++ R M A + ++ A + + + F +
Sbjct: 405 LVDDAIV--VVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGS 462

Query: 362 ---VIQDLAVIASLGVAVLIFTNLILLPVLLSYV 392
+ + ++ +A+ + LIL P L + +
Sbjct: 463 TGAIYRQFSITIVSAMALSVLVALILTPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3203TCRTETA733e-16 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 72.9 bits (179), Expect = 3e-16
Identities = 92/398 (23%), Positives = 150/398 (37%), Gaps = 19/398 (4%)

Query: 11 QAALLLFGSCLPVLGAVLIAPVLPRMHAYFAETPGVAVLVPVALTLPALVIALLAPLAGV 70
++L L +G LI PVLP + + V + L L AL+ AP+ G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 71 LADRVGRRALLLASMLLYSLCGLLPLWLDSLVLIVASRAGIGLAEAGIMTCCTTLMGDYF 130
L+DR GRR +LL S+ ++ + L ++ R G+ A + D
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADIT 124

Query: 131 DGQRRARLFALQMVVTSLAAALFMGVGGALGESDWRTPFTLYAVGALCLPLMALLLW--- 187
DG RAR F +GG +G PF A L L
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 188 ---EPRACQAASEASVDSRFPWAALTPLYLLTVLA-GVSLFIVPVQAGYVL---QLLHID 240
E R + + + S +T + L + + L A +V+ H D
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWD 244

Query: 241 APRQVGLAM-GANQLGVLAGALAF-RLLARLPAGRLLAMGFATAGLGGGLMALASSHAPV 298
+G+++ L LA A+ + ARL R L +G G G L+A A + +
Sbjct: 245 -ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA-TRGWM 302

Query: 299 VLAVLINGLGVGLLLPTLITLVMQQVGFDQRGRATGGFTSAIFAGEFVSPLIVLALTAGV 358
+++ G+ +P L ++ +QV +++G+ G + V PL+ A+
Sbjct: 303 AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI---Y 359

Query: 359 ATHLPHALLLVAIGQLLLAPVCLSLMRHRRAVTVAGAR 396
A + I L +CL +R R + AG R
Sbjct: 360 AASITTWNGWAWIAGAALYLLCLPALR-RGLWSGAGQR 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3206NUCEPIMERASE752e-17 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 75.2 bits (185), Expect = 2e-17
Identities = 47/167 (28%), Positives = 71/167 (42%), Gaps = 22/167 (13%)

Query: 1 MRVMVTGANGFVGRQLVQRLLDLGELRGRRIEALLVLDQALDGLPEDARLRR-------- 52
M+ +VTGA GF+G + +RLL+ G ++ + L+ D + ARL
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLE----AGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQF 56

Query: 53 HHGSVTDAALLRRVLADG-VDVVFHLVSVPGGAAETQYERGY-QVNLQASLELLDQLRNP 110
H + D + + A G + VF + Y NL L +L+ R+
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRH- 115

Query: 111 LCPPVLVYASSVAVYGG--KLPTRMDEG--QPASPQLSYAAHKRMVE 153
L+YASS +VYG K+P D+ P S YAA K+ E
Sbjct: 116 NKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSL---YAATKKANE 159


90PP_3299PP_3304N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3299-130-2.926938outer membrane lipoprotein
PP_3300030-3.896780TetR family transcriptional regulator
PP_3301-130-3.486554RND efflux membrane fusion protein
PP_3302128-3.825805RND efflux transporter
PP_3303133-3.8225563-oxoacyl-ACP synthase
PP_3304030-4.229941Bcr/CflA family multidrug resistance
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3299RTXTOXIND290.037 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.037
Identities = 30/194 (15%), Positives = 56/194 (28%), Gaps = 38/194 (19%)

Query: 65 GDPL--LSRLVTEALGQNLQLAQAQARVAQARAALGSATAALVPSAGINGQAARSRQSVE 122
GD L L+ L EA Q + QAR+ Q R Q +
Sbjct: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQTRY-----------------QILSRSIELN 163

Query: 123 TPLGQLLNSTPDYDRYGNSYELNLQASWEIDLFGGLRRDRQAAVGEYQASEAGAIATRLA 182
L P + L R ++ + L
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVL---------------RLTSLIKEQFSTWQNQKYQKELN 208

Query: 183 VAAQTADIYTTVRGLQARLAIAQNQVKTQQDLLAKVNLLNRKGLAPDYEVRQTEGELSQV 242
+ + A+ + AR+ +N + ++ L + L K + V + E + +
Sbjct: 209 LDKKRAERL----TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 243 EATVPVLRAGLDAA 256
+ V ++ L+
Sbjct: 265 VNELRVYKSQLEQI 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3300HTHTETR618e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.2 bits (148), Expect = 8e-14
Identities = 27/197 (13%), Positives = 62/197 (31%), Gaps = 12/197 (6%)

Query: 17 DVRDQIIQAAMEHFAHYGYDKTTVSDLAKSIGFSKAYIYKFFESKQAIGEVICSSRLALI 76
+ R I+ A+ F+ G T++ ++AK+ G ++ IY F+ K + I + I
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 77 MQRIATATSDAPTASEKLRRLFRAIAEGGADLFFHDRKLYDIAAVASRDQ-----WSSVK 131
+ + P + R I + + + + + + V+
Sbjct: 71 GELELEYQAKFP---GDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 132 SHEANIS----KVILEILTQGRDAGEFERKTPLDELTLAIFLIMRPYVNAALLQHNLDTL 187
+ N+ I + L +A + + + + L L
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDL 187

Query: 188 QDAVVQLPALILRSLAP 204
+ A++L
Sbjct: 188 KKEARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3301RTXTOXIND453e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.8 bits (106), Expect = 3e-07
Identities = 15/104 (14%), Positives = 36/104 (34%), Gaps = 7/104 (6%)

Query: 67 VSGKILQRLVDTGQTVKRGQPLMRMDPVDLN-----LQARAQQEAVTAARARAKQTG--D 119
+ + + +V G++V++G L+++ + Q+ Q + R +
Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162

Query: 120 DEARYRGLVADGAVSASSYDQIKAAADAAKAQLSAAQAQADVAR 163
++ L + S +++ K Q S Q Q
Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3302ACRIFLAVINRP449e-143 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 449 bits (1157), Expect = e-143
Identities = 233/1045 (22%), Positives = 432/1045 (41%), Gaps = 59/1045 (5%)

Query: 8 LSALAVRERSITLFLIVLIAFAGTLAFFKLGRAEDPPFTVKQMTIITAWPGATAQEMQDL 67
++ +R L +++ AG LA +L A+ P +++ +PGA AQ +QD
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 68 VAEPLEKRMQELRWYDRTETYT-RPGLAFTMVSLQDKTPPSAVQEEFYQARKKAGDQAKL 126
V + +E+ M + + + G ++ Q T P Q + + A L
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLA---TPL 117

Query: 127 MPAGVIGPML-NDEFSDVTFAVYALKA-KGEPQRQLVRD--AETLRQQLLHVPGVKKVNI 182
+P V + ++ S V + + + D A ++ L + GV V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 183 IGEQ-AERIFVSFSHDRLATLGITPQDIFSALDNQNALSPSGSVET------QGPQVVVR 235
G Q A RI++ D L +TP D+ + L QN +G + Q +
Sbjct: 178 FGAQYAMRIWLDA--DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 236 VDGAFDQLAKIRETPVVAQ--GRPLKLSDVADVERGYEDPATFLVRNDGEPALLLGIVMR 293
F + + + G ++L DVA VE G E+ R +G+PA LGI +
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVI-ARINGKPAAGLGIKLA 294

Query: 294 EGWNGLDLGKALEAETAKINEGMPLGMTLSKVTDQAVNITSSVDEFMIKFFVALLVVMLV 353
G N LD KA++A+ A++ P GM + D + S+ E + F A+++V LV
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 354 CFLSMG-WRVGVVVAAAVPLTLAIVFVVMAATGKNFDRITLGSLILALGLLVDDAIIAIE 412
+L + R ++ AVP+ L F ++AA G + + +T+ ++LA+GLLVDDAI+ +E
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 413 MMV-VKMEEGYDRIKASAYAWSHTAAPMLSGTLVTAIGFMPNGFAQSTAGEYTSNMFWIV 471
+ V ME+ +A+ + S ++ +V + F+P F + G +
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 472 GIALIASWVVAVAFTPYLGVKLL----PRIKTIEGGHAAIYNTRHY---NRFRALLGWVI 524
A+ S +VA+ TP L LL +GG +NT N + +G ++
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 525 AHKWLVAGTVVSTFVAAVLGMGLVKKQFFPTSDRPEVLVELQMPYGTSIEQTNATAIKVE 584
V+ + F P D+ L +Q+P G + E+T +V
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 585 SWLRQQEEAKIVTTYIGQGPPRFFLAMAPELPDPSFAKIVV--LTENQGARE---ALKHR 639
+ + E+A + + + G + + + + A + + E G A+ HR
Sbjct: 595 DYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHR 649

Query: 640 LREAASE-----GLAPGAQVRVTQLVFGPYSPYPVAYRVMGPDASQ--LRQIAARVQSVL 692
+ + + V + + +G DA Q+
Sbjct: 650 AKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAA--- 706

Query: 693 QASPMMKTVNTDWGPLVPTLHFSLNQDRLQSVGLTSASVSQQLQFLLTGVPITSVREDIR 752
Q + +V + ++Q++ Q++G++ + ++Q + L G + + R
Sbjct: 707 QHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGR 766

Query: 753 SVQVVGRAAGQIRLDPAQIENFTLVGSNGQRVPVSQIGDVSIRMEDPILRRRDRTPTMTV 812
++ +A + R+ P ++ + +NG+ VP S P L R + P+M +
Sbjct: 767 VKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEI 826

Query: 813 RGDIAEGLQPPDVSTAIWKDLQPIVTQLPAGYKIEMAGSIEESAKASQAIVPLLPIMIAL 872
+G+ A G D + + + ++LPAG + G + + L+ I +
Sbjct: 827 QGEAAPGTSSGDAMALM----ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVV 882

Query: 873 TLLIIILQVRSISAMVMVFLTSPLGLIGVVPVLLLFGQPFGINALVGLIALSGILMRNTL 932
L + S S V V L PLG++GV+ LF Q + +VGL+ G+ +N +
Sbjct: 883 VFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAI 942

Query: 933 ILIGQIDHNQL-EGLAPFDAVVEATVQRARPVLLTALAAILAFIPLTHSVFWGT-----L 986
+++ EG +A + A R RP+L+T+LA IL +PL S G+ +
Sbjct: 943 LIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAV 1002

Query: 987 AYTLIGGTFVGTIMTLVFLPAMYSI 1011
++GG T++ + F+P + +
Sbjct: 1003 GIGVMGGMVSATLLAIFFVPVFFVV 1027



Score = 86.8 bits (215), Expect = 2e-19
Identities = 61/320 (19%), Positives = 130/320 (40%), Gaps = 14/320 (4%)

Query: 712 LHFSLNQDRLQSVGLTSASVSQQLQF----LLTGVPITSVREDIRSVQVVGRAAGQIRLD 767
+ L+ D L LT V QL+ + G + + + A + + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK-N 242

Query: 768 PAQIENFTL-VGSNGQRVPVSQIGDVSIRMED-PILRRRDRTPTMTVRGDIAEGLQPPDV 825
P + TL V S+G V + + V + E+ ++ R + P + +A G D
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 826 STAIWKDLQPIVTQLPAGYKIEMAGSIEESAKAS-QAIVPLLPIMIALTLLIIILQVRSI 884
+ AI L + P G K+ + S +V L I L L++ L ++++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 885 SAMVMVFLTSPLGLIGVVPVLLLFGQPFGINALVGLIALSGILMRNTLILIGQIDHNQLE 944
A ++ + P+ L+G +L FG + G++ G+L+ + ++++ ++ +E
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 945 -GLAPFDAVVEATVQRARPVLLTALAAILAFIPL-----THSVFWGTLAYTLIGGTFVGT 998
L P +A ++ Q ++ A+ FIP+ + + + T++ +
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 999 IMTLVFLPAMYSIWFKIRPN 1018
++ L+ PA+ + K
Sbjct: 483 LVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3304TCRTETB742e-16 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 73.8 bits (181), Expect = 2e-16
Identities = 74/399 (18%), Positives = 148/399 (37%), Gaps = 53/399 (13%)

Query: 18 RANVLTAKVILLLAALAAISNLSTNIILPAFPEMARQLNVSSQELGLTLSSFFITFAFAQ 77
++N+ ++++ L L+ S L+ ++ + P++A N ++F +TF+
Sbjct: 7 QSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGT 66

Query: 78 LLVGPLADRYGRKRLVVGGLMIFVVGTFWAA-NAATLDMLILGRVIQAIGVCAAAVLARA 136
+ G L+D+ G KRL++ G++I G+ + +LI+ R IQ G A L
Sbjct: 67 AVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMV 126

Query: 137 IARDLYEGENLARALSLTMIAAATAPGFSPLIGSMLNTTLGWRALFIAVGMSAILIALFY 196
+ EN +A L A G P IG M+ + W L + +I + +
Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI--PMITIITVPF 184

Query: 197 LRGIGETLPAHRRVTQSVPAILIAYG---------------------------------- 222
L + + + IL++ G
Sbjct: 185 LMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVT 244

Query: 223 ------KLASNRLFILPALATSLLMSGLFASFAAAPSILMEGMGLSSLQVG--LYFAATV 274
L N F++ L ++ + + P ++ + LS+ ++G + F T+
Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTM 304

Query: 275 FVVFAAGLAAPRLAHRWGSRAITLSGLATACTAGALLLVGPSNPSLGWYSLSMVLFLWG- 333
V+ G L R G + + + + L + W+ +++F+ G
Sbjct: 305 SVII-FGYIGGILVDRRGPLYVLN--IGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 334 -MGIANPLGTALTMTPFGKEAGLASALL---GFLTMAIG 368
+ T ++ + +EAG +LL FL+ G
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTG 400


91PP_3406PP_3413N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3406-2161.145385acetyltransferase
PP_34070140.453582hypothetical protein
PP_3409-1100.288005cobalamin biosynthesis protein cobE
PP_3410-180.403854precorrin-4 C(11)-methyltransferase
PP_3411080.413343hypothetical protein
PP_3412060.245136LuxR family transcriptional regulator
PP_3413-170.497559Hpt sensor hybrid histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3406SACTRNSFRASE421e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 42.2 bits (99), Expect = 1e-07
Identities = 20/64 (31%), Positives = 25/64 (39%), Gaps = 2/64 (3%)

Query: 73 STWLGRNGIYLEDLYVTPEHRGDGAGRQLLQHIAREAVANNCGRLEWSVLDWNEPAIGFY 132
S W G +ED+ V ++R G G LL A N+ L D N A FY
Sbjct: 84 SNWNGY--ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141

Query: 133 QKLG 136
K
Sbjct: 142 AKHH 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3410LCRVANTIGEN300.011 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 29.7 bits (66), Expect = 0.011
Identities = 13/46 (28%), Positives = 20/46 (43%), Gaps = 5/46 (10%)

Query: 53 SAELHLEQIIAAMRSAHEKGQDVARVHSG-----DPSLYGAIGEQI 93
+AEL + +I A + H +H D +LYG E+I
Sbjct: 161 TAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYTDEEI 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3412HTHFIS665e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.0 bits (161), Expect = 5e-15
Identities = 21/102 (20%), Positives = 46/102 (45%), Gaps = 1/102 (0%)

Query: 2 TTVLIVDDHPIVRLSLRLLLERERFHVIGEVGNGSEVAQVARELRPDVVILDIGLPGLDG 61
T+L+ DD +R L L R + V N + + + D+V+ D+ +P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 MEVIKRLQSLEPVPKFMVLTGQATDLYVRRCLDAGIGAFVTK 103
+++ R++ P +V++ Q T + + + G ++ K
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3413HTHFIS594e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.5 bits (144), Expect = 4e-11
Identities = 21/66 (31%), Positives = 29/66 (43%)

Query: 848 ILVVDDYPANLLLLERQLQTLGHHVTLAENGEIALARWQEARFDLVITDCSMPVMDGHEL 907
ILV DD A +L + L G+ V + N DLV+TD MP + +L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 908 TRRIRS 913
RI+
Sbjct: 66 LPRIKK 71


92PP_3419PP_3432N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3419-190.459974Fis family transcriptional regulator
PP_3420-391.472073sensor histidine kinase
PP_3421-3141.520384sensor histidine kinase
PP_3422-2182.587264lytic transglycosylase
PP_3423-2172.843874general secretion pathway protein G
PP_3424-3152.492214type II secretion system protein
PP_3425-491.624562RND family efflux transporter MFP subunit
PP_3426-470.842216hydrophobe/amphiphile efflux-1 (HAE1) family
PP_3427-2100.364915NodT family RND efflux system outer membrane
PP_3428-211-1.079207hypothetical protein
PP_3429-114-2.075790histidine kinase
PP_3430-213-1.886531PAS/PAC sensor hybrid histidine kinase
PP_3431-212-2.047512ThiJ/PfpI domain-containing protein
PP_3432-110-0.949296hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3419HTHFIS461e-162 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 461 bits (1189), Expect = e-162
Identities = 168/491 (34%), Positives = 256/491 (52%), Gaps = 35/491 (7%)

Query: 4 SILVVEDDEILADNIRTYLSLKGYEVIVCHSAELALEQIKRAQPDAVLTDNSLPGMSGHD 63
+ILV +DD + + LS GY+V + +A I D V+TD +P + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 LLRTLVAQAPDLKVIMMTGYGNVEDAVQAMKEGAFHYLTKPVVLAELKLTLDKALATERM 123
LL + PDL V++M+ A++A ++GA+ YL KP L EL + +ALA +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 124 ERTLSFYQEREAQKSGLQALIGESPVMLTLKHTLRQVLDAERRMASDDLPPVLIEGETGT 183
+ E L+G S M + L +++ + ++I GE+GT
Sbjct: 125 RP-----SKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL--------TLMITGESGT 171

Query: 184 GKELVARALHFDGSRSKGPFIEFNCASIPANLLEAELFGHEKGAFTDAKERRVGLVEAAD 243
GKELVARALH G R GPF+ N A+IP +L+E+ELFGHEKGAFT A+ R G E A+
Sbjct: 172 GKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAE 231

Query: 244 GGTLFLDEIGEMDLVLQAKLLKLLEDRSIRRIGAVKERKVDLRVISATNCNLEQMVQQGK 303
GGTLFLDEIG+M + Q +LL++L+ +G + D+R+++ATN +L+Q + QG
Sbjct: 232 GGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGL 291

Query: 304 FRRDLFFRLRIIALKVPRLYSRGQDILLLARHFLAHHSRRYGKPNLRFSAEAESLLLGYS 363
FR DL++RL ++ L++P L R +DI L RHF+ + + G RF EA L+ +
Sbjct: 292 FREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQ-QAEKEGLDVKRFDQEALELMKAHP 350

Query: 364 WPGNVRELRNMLEQTVLLAPNEVVQAHQLNLCM--TLVDEPLAQ---------------- 405
WPGNVREL N++ + L P +V+ + + + D P+ +
Sbjct: 351 WPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEE 410

Query: 406 ---QPMAAMFEMPRHEPEPGTSLPDMERDLVCKTLDRTDWNVTKSARMLGLSRDMLRYRI 462
Q A+ + L +ME L+ L T N K+A +LGL+R+ LR +I
Sbjct: 411 NMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKI 470

Query: 463 EKLGLTRPDKR 473
+LG++
Sbjct: 471 RELGVSVYRSS 481


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3423BCTERIALGSPG1903e-65 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 190 bits (485), Expect = 3e-65
Identities = 59/141 (41%), Positives = 88/141 (62%), Gaps = 7/141 (4%)

Query: 31 RRTNPQRGFTLLELLVVLVVLGLLAGIVAPKYFSQLGRSEAKVARAQIEGLSKALDLYRL 90
R T+ QRGFTLLE++VV+V++G+LA +V P +++ + A + I L ALD+Y+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 91 EVGHYPNSEQGLQALVVAPS---GEARWTGPYLQKAVPQDPWGRPYIYRQPGENGGEYDL 147
+ HYP + QGL++LV AP+ A + K +P DPWG Y+ PGE+G YDL
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGA-YDL 120

Query: 148 LSMGKDGQPGGDGENAEVTSW 168
LS G DG+ G + ++T+W
Sbjct: 121 LSAGPDGEMGTED---DITNW 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3424BCTERIALGSPF2589e-85 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 258 bits (660), Expect = 9e-85
Identities = 124/403 (30%), Positives = 208/403 (51%), Gaps = 10/403 (2%)

Query: 10 YSLKALGRQG-VVQLQIDAEDADQARRQAEDQGLRVLSLRSSGGALR-----GMAWRREA 63
Y +AL QG + +A+ A QAR+ ++GL LS+ + G + G++ RR+
Sbjct: 4 YHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRKI 63

Query: 64 AF---DLVLFSQELSTLLNAGLPLIDALESLAEKSPAATARKVLAELVRQLYEGRSLSQA 120
DL L +++L+TL+ A +PL +AL+++A++S +++A + ++ EG SL+ A
Sbjct: 64 RLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADA 123

Query: 121 LGQQPRVFPPLYVALVQSSERTGALGDALTRYISYRQRLDLVRQKLVGASVYPLLLLLVG 180
+ P F LY A+V + E +G L L R Y ++ +R ++ A +YP +L +V
Sbjct: 124 MKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVA 183

Query: 181 GGVVLFLLGYVVPRFSQVFEGMGTELPWLSRVLMQVGLFLHAQQAPLALGTVGGVAALWL 240
VV LL VVP+ + F M LP +RVLM + + + L + G A +
Sbjct: 184 IAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFRV 243

Query: 241 LRRHPRVRYWASCQLRRLPALHQRLMMYELARFYRSLGILLQGGIPILTAMGMARGLLGN 300
+ R + R +L LP + + AR+ R+L IL +P+L AM ++ ++ N
Sbjct: 244 MLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMSN 303

Query: 301 AAA-QGLEQASQRVGEGLPLSDALAAGHLVTPVSLRLLRAGEQSGNLGEMLERCADFHDQ 359
A L A+ V EG+ L AL L P+ ++ +GE+SG L MLER AD D+
Sbjct: 304 DYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQDR 363

Query: 360 EIGRWVEWFVKLFEPLLMTFIGLLIGLIVILMYMPIFELASSI 402
E + + LFEPLL+ + ++ IV+ + PI +L + +
Sbjct: 364 EFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3425RTXTOXIND552e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.2 bits (133), Expect = 2e-10
Identities = 19/102 (18%), Positives = 43/102 (42%)

Query: 65 EVRPRVSGQIDQVAFTEGAQVKKGDLLFQIDPRPFQAEVRRLEAQLQQAKATAIRSANEA 124
E++P + + ++ EG V+KGD+L ++ +A+ + ++ L QA+ R +
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157

Query: 125 RRGERLRDSNAISAELAESRSSAAAEARAGVDAIQAQLDLAR 166
R E + + ++ + E I+ Q +
Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ 199



Score = 40.6 bits (95), Expect = 9e-06
Identities = 16/115 (13%), Positives = 36/115 (31%), Gaps = 9/115 (7%)

Query: 104 RRLEAQLQQAKATAIRSANEARRGERLRDSNAISAELAESR-------SSAAAEARAGVD 156
LE + + +A +++ + + + E + +
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 157 AIQAQLDLARLNLSFTRVTAPISGRVSR-AEFTAGNIVTADVTPLTSVVSTDKVY 210
+ +L + + AP+S +V + T G +VT L +V D
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA-ETLMVIVPEDDTL 366


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3426ACRIFLAVINRP11010.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1101 bits (2849), Expect = 0.0
Identities = 434/1048 (41%), Positives = 647/1048 (61%), Gaps = 25/1048 (2%)

Query: 4 SKFFITRPIFAAVLSLVLLIAGSISLFQLPISEYPEVVPPTVVVRANFPGANPKVIGETV 63
+ FFI RPIFA VL+++L++AG++++ QLP+++YP + PP V V AN+PGA+ + + +TV
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 AAPLEQAITGVENMLYMSSQSTADGKLTLTITFALGTDLDNAQVQVQNRVTRTQPKLPEE 123
+EQ + G++N++YMSS S + G +T+T+TF GTD D AQVQVQN++ P LP+E
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 124 VTRIGITVDKASPDLTMVVHLTSPDNRYDMLYLSNYAILNIKDELARLGGVGDVQLFGMG 183
V + GI+V+K+S MV S + +S+Y N+KD L+RL GVGDVQLFG
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-A 180

Query: 184 DYSLRVWLDPNKTASRNLTASDVVAAIREQNRQVAAGQLGAPPAPGSTSFQLSINTQGRL 243
Y++R+WLD + LT DV+ ++ QN Q+AAGQLG PA SI Q R
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 244 VNEEEFENIIIRAGADGEITRLKDIARVELGSSQYALRSLLNNQPAVAIPIFQRPGSNAI 303
N EEF + +R +DG + RLKD+ARVELG Y + + +N +PA + I G+NA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 304 EISDEVRAKMAELKKDFPEGMDYSIVYDPTIFVRGSIEAVVHTLFEALVLVVLVVILFLQ 363
+ + ++AK+AEL+ FP+GM YD T FV+ SI VV TLFEA++LV LV+ LFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 364 TWRASIIPLLAVPVSLIGTFAVMHLFGFSLNALSLFGLVLAIGIVVDDAIVVVENVER-N 422
RA++IP +AVPV L+GTFA++ FG+S+N L++FG+VLAIG++VDDAIVVVENVER
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 423 IGLGLKPLEATQKAMSEVTGPIIATALVLCAVFVPAAFISGLTGQFYKQFALTIAISTVI 482
+ L P EAT+K+MS++ G ++ A+VL AVF+P AF G TG Y+QF++TI + +
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 483 SAFNSLTLSPALAAVLLK----DHHAPKDRFSRFLDKLLGSWLFSPFNRFFDRASHSYVG 538
S +L L+PAL A LLK +HH K F F FN FD + + Y
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGF------------FGWFNTTFDHSVNHYTN 528

Query: 539 GVRRVIRSSGIALFVYAGLMGLTYLGFSSTPTGFVPAQDKQYLVAFAQLPDAASLDRTEA 598
V +++ S+G L +YA ++ + F P+ F+P +D+ + QLP A+ +RT+
Sbjct: 529 SVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQK 588

Query: 599 VIKRMSEIALKQPGVADSVAF--PGLSINGFTNSPNSGIVFTPLKPFDERKDPSQSAAAI 656
V+ ++++ LK F G S +G + N+G+ F LKP++ER SA A+
Sbjct: 589 VLDQVTDYYLKNEKANVESVFTVNGFSFSG--QAQNAGMAFVSLKPWEERNGDENSAEAV 646

Query: 657 AAALNAQFADIQDAYIAIFPPPPVQGLGTIGGFRLQIEDRGNLGYEALYKETQNIIAK-S 715
+ I+D ++ F P + LGT GF ++ D+ LG++AL + ++ +
Sbjct: 647 IHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAA 706

Query: 716 HNVPELAGLFTSYQVNVPQVDAAIDREKAKTHGVAITDIFDTLQVYLGSLYTNDFNRFGR 775
+ L + + + Q +D+EKA+ GV+++DI T+ LG Y NDF GR
Sbjct: 707 QHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGR 766

Query: 776 TYQVNVQAEQQFRLDAEQIGQLKVRNNLGEMIPLATFLKVSDTSGPDRVMHYNGFITAEI 835
++ VQA+ +FR+ E + +L VR+ GEM+P + F G R+ YNG + EI
Sbjct: 767 VKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEI 826

Query: 836 NGAAAPGYSSGQAEAAIEKLLKEELPNGMTFEWTDLTYQQILSGNTALLVFPLCVLLAFL 895
G AAPG SSG A A +E L +LP G+ ++WT ++YQ+ LSGN A + + ++ FL
Sbjct: 827 QGEAAPGTSSGDAMALMEN-LASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFL 885

Query: 896 VLAAQYESWSLPLAVILIVPMTLLSAITGVIVSGGDNNIFTQIGLIVLVGLACKNAILIV 955
LAA YESWS+P++V+L+VP+ ++ + + N+++ +GL+ +GL+ KNAILIV
Sbjct: 886 CLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIV 945

Query: 956 EFAKDEQAK-GLDPLAAVLEACRLRLRPILMTSIAFIMGVVPLVFSSGAGSEMRHAMGVA 1014
EFAKD K G + A L A R+RLRPILMTS+AFI+GV+PL S+GAGS ++A+G+
Sbjct: 946 EFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIG 1005

Query: 1015 VFSGMIGVTVFGLFLTPVFFFLIRRFVE 1042
V GM+ T+ +F PVFF +IRR +
Sbjct: 1006 VMGGMVSATLLAIFFVPVFFVVIRRCFK 1033



Score = 84.1 bits (208), Expect = 2e-18
Identities = 65/322 (20%), Positives = 126/322 (39%), Gaps = 20/322 (6%)

Query: 739 IDREKAKTHGVAITDIFDTL-----QVYLGSLYTNDFNRFGRTYQVNVQAEQQFRLDAEQ 793
+D + + + D+ + L Q+ G L G+ ++ A+ +F+ + E+
Sbjct: 188 LDADLLNKYKLTPVDVINQLKVQNDQIAAGQL-GGTPALPGQQLNASIIAQTRFK-NPEE 245

Query: 794 IGQLKVRNNL-GEMIPLATFLKVSDTSGPDRVM-HYNGFITAEINGAAAPGYSSGQ-AEA 850
G++ +R N G ++ L +V V+ NG A + A G ++ A+A
Sbjct: 246 FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKA 305

Query: 851 AIEKL--LKEELPNGM----TFEWTDLTYQQILSGNTALLVFPLCVLLAFLVLAAQYESW 904
KL L+ P GM ++ T I L ++L FLV+ ++
Sbjct: 306 IKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLF---EAIMLVFLVMYLFLQNM 362

Query: 905 SLPLAVILIVPMTLLSAITGVIVSGGDNNIFTQIGLIVLVGLACKNAILIVEFAKDEQAK 964
L + VP+ LL + G N T G+++ +GL +AI++VE + +
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 965 -GLDPLAAVLEACRLRLRPILMTSIAFIMGVVPLVFSSGAGSEMRHAMGVAVFSGMIGVT 1023
L P A ++ ++ ++ +P+ F G+ + + + S M
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 1024 VFGLFLTPVFFFLIRRFVERRQ 1045
+ L LTP + + V
Sbjct: 483 LVALILTPALCATLLKPVSAEH 504


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3428adhesinb330.002 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 32.5 bits (74), Expect = 0.002
Identities = 13/47 (27%), Positives = 20/47 (42%), Gaps = 4/47 (8%)

Query: 318 VELDPDNAD-YRYTLAVTLHELDQLDAAQKQLETVLNRQPANRRARV 363
E DP N + Y L + +L LD K+ + N P ++ V
Sbjct: 160 SEKDPANKETYEKNLKAYVEKLSALD---KEAKEKFNNIPGEKKMIV 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3429HTHFIS911e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.0 bits (226), Expect = 1e-21
Identities = 33/119 (27%), Positives = 56/119 (47%), Gaps = 1/119 (0%)

Query: 420 HVLVVEDDPHVRQLLCQALGENGFPCQSAADANEGLKVLRSAQPVDLLITDVGLPGMNGR 479
+LV +DD +R +L QAL G+ + ++A + + + DL++TDV +P N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDENAF 63

Query: 480 QLAEIARSLRPRLPVLFITGYAETAMAREGFLGADMHLICKPFELQQLQARVTHILGKP 538
L + RP LPVL ++ A + + KPF+L +L + L +P
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3430HTHFIS732e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.6 bits (178), Expect = 2e-15
Identities = 29/128 (22%), Positives = 51/128 (39%), Gaps = 13/128 (10%)

Query: 582 MARGERLLLVDDELDLRAVMREYLTERGFDVTDVGDANSALERFRHGGPFDLVITDIGLP 641
M +L+ DD+ +R V+ + L+ G+DV +A + G DLV+TD+ +P
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMP 58

Query: 642 GGFSGRQVAKAMRMQLAQQKILFITGYAD-----QSIEAQLLDQPGTALLNKPFSLAHLA 696
+ + ++ +L ++ ++ E D L KPF L L
Sbjct: 59 DE-NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDY-----LPKPFDLTELI 112

Query: 697 DEALRLLD 704
R L
Sbjct: 113 GIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3432TYPE3IMQPROT260.049 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 26.3 bits (58), Expect = 0.049
Identities = 14/56 (25%), Positives = 24/56 (42%), Gaps = 10/56 (17%)

Query: 1 MSKIMHAGRSMVELLLLIA--VALVPVVSGLLVMAFQLEAKLAENASISVQEAVFS 54
M ++ AG + L+L+++ +V + GLLV FQ +QE
Sbjct: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQ--------TVTQLQEQTLP 48


93PP_3451PP_3456N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_34510162.788848hypothetical protein
PP_3452-1162.677536diguanylate cyclase
PP_34530162.996458integral membrane sensor signal transduction
PP_34540172.468523winged helix family two component
PP_34550172.598777RND family efflux transporter MFP subunit
PP_3456-1171.589972hydrophobe/amphiphile efflux-1 (HAE1) family
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3451TYPE3OMGPROT310.005 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 31.4 bits (71), Expect = 0.005
Identities = 13/59 (22%), Positives = 24/59 (40%), Gaps = 4/59 (6%)

Query: 153 RRDALYSQLQAFNRQLDKPLHISAFSTGKLTPRVNA----VWLDQLAGLGLTVWWQDGA 207
+ ++L L F D + +S K++ + +L +A L VW+ DG
Sbjct: 41 KGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGN 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3453PF06580290.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.026
Identities = 43/305 (14%), Positives = 101/305 (33%), Gaps = 80/305 (26%)

Query: 136 NVLSWGVTVLIGAAMLGCLLLWVWPHWRDLERLK-ETARRLGQGQMAE----RTHISPHS 190
LS V++ M L W +++ ++ + + + Q A+ + I+PH
Sbjct: 116 LALSIIFNVVVVTFMWSLLYF-GWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHF 174

Query: 191 NIGELAGVFDTMASDLERHVNQQRELLNAVSHELRTPLTRLDFGLVLLFDEVPPASRKRL 250
+ + + + + + RE+L ++S +R L + V L DE+
Sbjct: 175 ----MFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADEL-------- 222

Query: 251 LELVGHVRELDELVLELLSYSRLYNADQARERVEVSLLELVDSVLGGFAEELDGRGIQWE 310
+ + L+L S + + Q ++ +++++
Sbjct: 223 --------TVVDSYLQLASI-QFEDRLQFENQINPAIMDV-------------------- 253

Query: 311 VRAEGELPRFVLDPRLTARAVQNLVRNAMRYCDESLLLRLRLEADGACL-LTVEDDGIGV 369
++P ++ V+N +++ + + + L+ D + L VE+ G
Sbjct: 254 -----QVPPMLVQT-----LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA 303

Query: 370 PVEERERVFQPFYRLDRSRDRNTGGFGLGLAISRRAIE---GQGGTLTLAQSALGGAQFR 426
+E G GL R ++ G + L+ G
Sbjct: 304 LKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLS-EKQGKVNAM 344

Query: 427 IRLPA 431
+ +P
Sbjct: 345 VLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3454HTHFIS842e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.7 bits (207), Expect = 2e-20
Identities = 29/136 (21%), Positives = 62/136 (45%)

Query: 48 PNIFLVEDDSALSELIASYLQRNDFHVQVIARGDHVLDEYRRQRPDLVILDLMLPGIDGL 107
I + +DD+A+ ++ L R + V++ + + DLV+ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 108 QLCRLLRQESQTLPILMLTARDDSHDQVLGLEMGADDYVTKPCEPRVLLARVRTLLRRSS 167
L +++ LP+L+++A++ + E GA DY+ KP + L+ + L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 168 VNEPRLDNDLILIGGL 183
+L++D L
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3455RTXTOXIND444e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.4 bits (105), Expect = 4e-07
Identities = 39/214 (18%), Positives = 80/214 (37%), Gaps = 31/214 (14%)

Query: 64 RTAEVRARVAGVVLKRVYREGSDVKQGDVLFLIDPAPFKADHDSARATL--AKAEATRYQ 121
R+ E++ +V + + +EG V++GDVL + +AD +++L A+ E TRYQ
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 122 A-------------RLQEQRYRELVDDKAVSRQEYDNAKASFLQADAEVAEARAALERAR 168
+L ++ Y + V ++ V R K F + + L++ R
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT-SLIKEQFSTWQNQKYQKELNLDKKR 213

Query: 169 LNLGYATVTAPISGRIGRAQVTEGAL-----------VGQNETTPLATIQQLDPIHADVT 217
TV A I+ ++V + L + ++ L + ++
Sbjct: 214 AER--LTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAV--LEQENKYVEAVNELR 269

Query: 218 QSTRELNALRRALRAGELQQVGDGQARATLIQDD 251
+L + + + + + Q I D
Sbjct: 270 VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303



Score = 37.9 bits (88), Expect = 6e-05
Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 4/92 (4%)

Query: 113 AKAEATRYQARLQ--EQRYRELVDDKAVSRQEYDNAKASFL-QADAEVAEARAALERARL 169
A E Y+++L+ E ++ + Q + N L Q + L +
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323

Query: 170 NLGYATVTAPISGRIGRAQV-TEGALVGQNET 200
+ + AP+S ++ + +V TEG +V ET
Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3456ACRIFLAVINRP11360.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1136 bits (2939), Expect = 0.0
Identities = 539/1031 (52%), Positives = 736/1031 (71%), Gaps = 7/1031 (0%)

Query: 1 MPQFFIDRPVFAWVVALFILLAGALAIPQLPVAQYPNVAPPQVEIYAVYPGASAATMDES 60
M FFI RP+FAWV+A+ +++AGALAI QLPVAQYP +APP V + A YPGA A T+ ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VVSLIEQELNGADNLLYFESQS-SLGSATITATFAPGTHPDLAQVDVQNRLKVVESRLPR 119
V +IEQ +NG DNL+Y S S S GS TIT TF GT PD+AQV VQN+L++ LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 PVTQQGLQVEKVSTGFLLLATLTSEDGKLDETALSDILARNVMDEIRRLKGVGKAQLYGS 179
V QQG+ VEK S+ +L++A S++ + +SD +A NV D + RL GVG QL+G+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 ERAMRIWIDPRKLIGFNLTPNDVAEAIAAQNAQVAPGSIGDLPSRSTQEITANVVVKGQL 239
+ AMRIW+D L + LTP DV + QN Q+A G +G P+ Q++ A+++ + +
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 240 SSPDEFAAIVLRANPDGSTVTIGDVARVEIGAQEYQYGTRLNGKPATAFSVQLSPGANAM 299
+P+EF + LR N DGS V + DVARVE+G + Y R+NGKPA ++L+ GANA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 300 ETATLVRAKMQDLARYFPEGVKYDIPYDTSPFVKVSIEQVINTLFEAMLLVFAVMFLFLQ 359
+TA ++AK+ +L +FP+G+K PYDT+PFV++SI +V+ TLFEA++LVF VM+LFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 360 NLRYTLIPTLVVPVALMGTFAVMLAMGFSVNVLTLFGMVLAIGILVDDAIVVVENVERIM 419
N+R TLIPT+ VPV L+GTFA++ A G+S+N LT+FGMVLAIG+LVDDAIVVVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 420 AEEGLPPKQATRKAMGQISGAIVGITLVLVAVFLPMAFMQGSVGVIYQQFSLSMAVSILF 479
E+ LPPK+AT K+M QI GA+VGI +VL AVF+PMAF GS G IY+QFS+++ ++
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 480 SAFLALSLTPALCATLLKPVAKGEHHERKGFFGWFNRRFESMSNGYQRWVVQALKRSGRY 539
S +AL LTPALCATLLKPV+ H + GFFGWFN F+ N Y V + L +GRY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 540 LLVYAVLLAVLGYGFSQLPTAFLPTEDQGYTITDIQLPPGASRMRTEQVAAQIE--AHNA 597
LL+YA+++A + F +LP++FLP EDQG +T IQLP GA++ RT++V Q+
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 598 EEPGVGNTTLILGFSFSGSGQNAALAFTTLKDWSER-GADDSAQSIADRATMAFTQLKDA 656
E+ V + + GFSFSG QNA +AF +LK W ER G ++SA+++ RA M +++D
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 657 IAYSVLPPPIDGLGESTGFEFRLQDRGGMGHAELMAARDQLLESASKSKV-LTNVREASL 715
P I LG +TGF+F L D+ G+GH L AR+QLL A++ L +VR L
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 716 AESPQVQLEIDRRQANALGVSFADIGTVLDVAVGSSYVNDFPNQGRMQRVVVQAEGDQRS 775
++ Q +LE+D+ +A ALGVS +DI + A+G +YVNDF ++GR++++ VQA+ R
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 776 QVEDLLNIHVRNDSGKMVPLGAFVQARWVSGPVQLTRYNGYPAVSISGEPAAGYSSGEAM 835
ED+ ++VR+ +G+MVP AF + WV G +L RYNG P++ I GE A G SSG+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 836 AEVERLVAQLPAGTGLEWTGLSLQERLSGSQAPLLMALSLLVVFLCLAALYESWSIPTAV 895
A +E L ++LPAG G +WTG+S QERLSG+QAP L+A+S +VVFLCLAALYESWSIP +V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 896 LLVVPLGVLGAVLAVTLRGMPNDVFFKVGLITLIGLSAKNAILIIEFAKHLVD-QGVDAA 954
+LVVPLG++G +LA TL NDV+F VGL+T IGLSAKNAILI+EFAK L++ +G
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 955 DAAVQAARLRLRPIVMTSLAFILGVVPLAIASGASSASQQAIGTGVIGGMLSAT-LAVVF 1013
+A + A R+RLRPI+MTSLAFILGV+PLAI++GA S +Q A+G GV+GGM+SAT LA+ F
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1014 VPVFFVVVMRL 1024
VPVFFVV+ R
Sbjct: 1021 VPVFFVVIRRC 1031


94PP_3545PP_3552N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3545-1111.321685PAS/PAC sensor hybrid histidine kinase
PP_3546-2121.519905PAS/PAC sensor hybrid histidine kinase
PP_3547-1122.563848short chain dehydrogenase/reductase
PP_35480122.394240EmrB/QacA family drug resistance transporter
PP_35490122.964554secretion protein HlyD family protein
PP_35500132.957113MarR family transcriptional regulator
PP_35511132.512585LuxR family transcriptional regulator
PP_35520142.515549PAS/PAC sensor signal transduction histidine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3545HTHFIS736e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.3 bits (180), Expect = 6e-16
Identities = 29/124 (23%), Positives = 58/124 (46%), Gaps = 2/124 (1%)

Query: 430 KVMLVEDEPALRLVMLEVLLDQGHEVQAFEDGRQAYKALQEAPAPDLLITDVGLPGGIDG 489
+++ +D+ A+R V+ + L G++V+ + ++ + DL++TDV +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE-NA 62

Query: 490 YQLADACRGIAPHAAVLLITGYDLAHSTAKARPHRRTELLAKPFDLQALAQALERLLGST 549
+ L + P VL+++ + + KA + L KPFDL L + R L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 550 AQSP 553
+ P
Sbjct: 123 KRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3546HTHFIS792e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 2e-17
Identities = 31/119 (26%), Positives = 53/119 (44%), Gaps = 2/119 (1%)

Query: 570 RILLVEDQTALRLVIGEVLEELGYRVDAFENGPSALTHLQSGERPDLLLSDVGLPGGLNG 629
IL+ +D A+R V+ + L GY V N + + +G+ DL+++DV +P N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE-NA 62

Query: 630 RQVAERFRERYPDLKVLLITGYDESAAFSDGQPLQGTLVLTKPFELEALAERVRELLEP 688
+ R ++ PDL VL+++ + L KPF+L L + L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3547DHBDHDRGNASE944e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 94.0 bits (233), Expect = 4e-25
Identities = 72/254 (28%), Positives = 117/254 (46%), Gaps = 17/254 (6%)

Query: 4 VIVITGGSRGIGAATALLAARQGYRICINYHADDQAAEAILSQVRALGAEAIAVRADVSV 63
+ ITG ++GIG A A A QG I + + E ++S ++A A A ADV
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 64 EDEIIQLFLRVDDELGPVTALVNNAGTIGQQSRVEDMSEFRLLNVMKTNVVGPMLCAKHA 123
I ++ R++ E+GP+ LVN AG + + + +S+ N G ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 124 LLRMARRHGGQGGAIVNVSSVAARLGSPNEYVD-YAASKGALDTFTIGLAKEVAGEGVRV 182
M R + G+IV V S A G P + YA+SK A FT L E+A +R
Sbjct: 128 SKYMMDR---RSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 183 NGVRPGYIHTGFH-----ALSGDPDRV----SKLEPGLPMGRGGRPEEVAEAILWLLSDK 233
N V PG T +G + + G+P+ + +P ++A+A+L+L+S +
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 234 ASYATGSLIDLGGG 247
A + T + + GG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3548TCRTETB1201e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 120 bits (303), Expect = 1e-31
Identities = 82/403 (20%), Positives = 162/403 (40%), Gaps = 28/403 (6%)

Query: 19 IGLSLATFMQVLDTTIANVALPTISGNLGVSYEQGTWVITSFAVSNAIALPLTGWLSRRF 78
I L + +F VL+ + NV+LP I+ + WV T+F ++ +I + G LS +
Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 79 GEVKLFIWATLLFVLASFLCGIAQSMPELVGF-RVLQGVVAGPLYPMTQTLLIAVY-PPA 136
G +L ++ ++ S + + S L+ R +QG +P +++A Y P
Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQG-AGAAAFPALVMVVVARYIPKE 135

Query: 137 KRGMALALLAMVTVVAPIAGPILGGWITDSYSWPWIFF---INVPIGLFAAAVVRQQMRT 193
RG A L+ + + GP +GG I W ++ I + F ++++++R
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRI 195

Query: 194 RPVVTSRQPMDYIGLLTLIIGVGALQVVLDKGNDLDWFESSFIIVGSLISVVFLAVFVIW 253
+ D G++ + +G+ + F +S+ I ++SV+ +FV
Sbjct: 196 ------KGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKH 239

Query: 254 ELTDRHPVVNLRLFVHRNFRIGTIVLVGGYAGFFGINLILPQWLQTQMGYTATWAGLAVA 313
P V+ L + F IG + + G ++P ++ + G +
Sbjct: 240 IRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII 299

Query: 314 PIGLLPVIMS-PFVGKYAHRFDLRVLA--GLAFLAIGTSCYMRAGFTSEVDFQHVALVQL 370
G + VI+ G R + G+ FL++ ++ A F E + ++ +
Sbjct: 300 FPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVS---FLTASFLLETTSWFMTIIIV 356

Query: 371 FMGIGVALFFMPTLSILLSDLPPHQIADGSGLATFLRTLGGSF 413
F+ G++ +I+ S L + G L F L
Sbjct: 357 FVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3549RTXTOXIND883e-21 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 88.3 bits (219), Expect = 3e-21
Identities = 54/412 (13%), Positives = 117/412 (28%), Gaps = 90/412 (21%)

Query: 15 EPSRKRKAWLLGLLLLLILAGVGTWAWYSIVGRWHESTDDAYVNGNVVEITPLVAGTVTS 74
E R+ L+ ++ L + V + +G EI P+ V
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109

Query: 75 IGADDGDLVHAGQVLLQFDPADSEVALQSAEAKLARSVRQVRGLYSNVDSL--------- 125
I +G+ V G VLL+ +E ++ L ++ + S+
Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169

Query: 126 ----------------------KAQLETRQAELRKAQQDFNRR----------------- 146
K Q T Q + + + + +++
Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229

Query: 147 -----------KVLADSGAIAA-------EELSHARDDLSVAQAAVNSARQQLSTS---- 184
L AIA + A ++L V ++ + ++ ++
Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289

Query: 185 ---SALVDDTVVSSHPDVMAAAADLRQ----AYLDHARTTLVAPVTGYVAKRTVQ-LGQR 236
+ L + ++ L + + APV+ V + V G
Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349

Query: 237 LQPGTATMAVIPLDQV-WIDANFKETQLREMRIGQPVEITADVYGSEV--KYSGTVDSLG 293
+ M ++P D + A + + + +GQ I + + G V ++
Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409

Query: 294 AGTGSAFALLPAQNATGNWIKIVQRVPVRIHLSPDQLKDHPLRIGLSTVVEV 345
G ++ + + K+ PL G++ E+
Sbjct: 410 LDA-------IEDQRLGLVFNVIISIEE--NCLSTGNKNIPLSSGMAVTAEI 452


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3551HTHFIS1013e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 101 bits (253), Expect = 3e-27
Identities = 27/150 (18%), Positives = 56/150 (37%)

Query: 1 MLQAKVYVVDDDQGMRDSTVWLLQSVGLQALPFASGQAFLDACVDDGPACVLLDVRMPGL 60
M A + V DDD +R L G ++ V+ DV MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 GGLAVQQAMRERGLMVPVIFVSGHADVPIVVRAFKAGACDFIEKPYNDQLLLDSVQAALE 120
+ +++ +PV+ +S ++A + GA D++ KP++ L+ + AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 HAGLARQGDQALALVQARIDGLTPRERDVF 150
+ + + G + ++++
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3552PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 31/127 (24%), Positives = 58/127 (45%), Gaps = 11/127 (8%)

Query: 291 ISEQATHAAEVIRRLRAFLRKGPRRLQALDVAELAGEAMRLCAW---EAAR--DQVQVEL 345
I E T A E++ L +R R A V+ LA E + ++ + + D++Q E
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVS-LADELTVVDSYLQLASIQFEDRLQFEN 244

Query: 346 RMSAQLPLVYADRVLLEQVLLNLLRNAIDANREQQGERPSRILLCAARDGDGVLVEVADQ 405
+++ + V +L++ ++ N +++ I A Q G +ILL +D V +EV +
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGI-AQLPQGG----KILLKGTKDNGTVTLEVENT 299

Query: 406 GPGVAPE 412
G
Sbjct: 300 GSLALKN 306


95PP_3582PP_3588N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3582-18-0.579882RND efflux transporter
PP_3583-37-0.766713acriflavin resistance protein
PP_3585-211-1.327262RND family efflux transporter MFP subunit
PP_3586-212-0.995302ISPpu9, transposase
PP_3587-2110.359170redoxin domain-containing protein
PP_3588-2100.074912Bcr/CflA family multidrug resistance
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3582RTXTOXIND340.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.6 bits (77), Expect = 0.002
Identities = 32/214 (14%), Positives = 63/214 (29%), Gaps = 33/214 (15%)

Query: 92 RSNQTVAQSEAQYRQA-------QALVRSSRAALFPSLDLSTSKNRSAQGTGSSSSSLSN 144
+ ++++ QA Q L RS P L L S
Sbjct: 131 GAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190

Query: 145 NSSGIRNTYNAQLGVSWEIDLWGKLRETLNANEASAEASFA----DLASIR--------- 191
N + +D R T+ A E L
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250

Query: 192 ----LSQQSELVQNYLQLRVIDEQKRLLEATVAAYERSLRMNENQYRAGVAGPDAVAQAR 247
L Q+++ V+ +LRV Q +E+ + + + ++ ++ +
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN---------EIL 301

Query: 248 TQLKSTQADLIDLIWQRAQFENAIAVLLGKAPAD 281
+L+ T ++ L + A+ E + +AP
Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVS 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3583ACRIFLAVINRP8030.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 803 bits (2076), Expect = 0.0
Identities = 299/1037 (28%), Positives = 523/1037 (50%), Gaps = 30/1037 (2%)

Query: 3 LSGPFIRRPVATMLLSLAIMLLGGVSFGLLPVAPLPQMDFPVIVVSANLSGASPEVMAST 62
++ FIRRP+ +L++ +M+ G ++ LPVA P + P + VSAN GA + + T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VATPLERKLGSIAGVTTLTSSS-NQGSTRVVIGFELGRDIDGAAREVQAAINATRNLLPS 121
V +E+ + I + ++S+S + GS + + F+ G D D A +VQ + LLP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 GMRSMPTYKKINPSQAPIMVLSLTSD--VLQKGQLYDLADTILSQSLAQVSGVGEVQIGG 179
++ + S + +MV SD + + D + + +L++++GVG+VQ+ G
Sbjct: 121 EVQQQGISVE-KSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 180 SSLPAVRIAVEPQLLNQYNLSLDEVRTAVSNANQRRPMGFV------EDAERNWQVRAND 233
+ A+RI ++ LLN+Y L+ +V + N + G + + N + A
Sbjct: 180 AQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 234 QLESAKDYEPVVIR-QQNGTILRLSDVATVTDGVENRYNSGFFNDQAAVLLVVNRQTGAN 292
+ ++ +++ V +R +G+++RL DVA V G EN N + A L + TGAN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 293 IIETVDQIKAQLPALQSLLPASVQLNVAMDRSPVIKATLKEAEHTLLIAVVLVILVVYLF 352
++T IKA+L LQ P +++ D +P ++ ++ E TL A++LV LV+YLF
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 353 LGSLRASLIPSLAVPVSLVGTFAVMYVCGFSLNNLSLMALILATGLVVDDAIVVLENISR 412
L ++RA+LIP++AVPV L+GTFA++ G+S+N L++ ++LA GL+VDDAIVV+EN+ R
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 413 HI-ENGQPPMKAAFLGAKEVGFTLLSMNVSLVAVFVSILFMGGIVRNLFQEFSITLAAAI 471
+ E+ PP +A ++ L+ + + L AVF+ + F GG ++++FSIT+ +A+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 472 IVSLVVSLTLTPMLCARWLKPQQAEQTRLQR----WSDTLHQRMVAAYDHSLGWALRHKR 527
+S++V+L LTP LCA LKP AE + W +T V Y +S+G L
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 528 LTLLSLLATIGINIALYVVVPKTLMPQQDTGQLMGFIRGDDGLSFTVMQPKMEIYRRALL 587
LL + + L++ +P + +P++D G + I+ G + Q ++ L
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 588 ADP-----AVQSVAGFIGGNSGTNNAFVLVRLKPISERKID---AQKVIERLRKELPKVP 639
+ +V +V GF N V LKP ER D A+ VI R + EL K+
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 640 GGRLFLMADQDLQLGGGGRDQTSSQYLYTLQSGDLAALREWFPKVVAALRALP-ELTAID 698
G + + G L AL + +++ P L ++
Sbjct: 659 DGFVIPFNMPAIV--ELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 699 ARDGAGTQQVTLVVDRDQAKRLGIDMDMVTAVLNNAYSQRQISTIYDSLNQYQVVLEINP 758
T Q L VD+++A+ LG+ + + ++ A ++ D ++ ++ +
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 759 KYAWDPSTLEQVQVITADGARVPLSTIAHYENSLANDRVSHEGQFASEDIAFDVAEGYSP 818
K+ P ++++ V +A+G VP S + R+ S +I + A G S
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836

Query: 819 DQAMAALERAVAKLGLPEEVIAKLGGTADAFAQTQQGQPFMILGALLLVYLVLGILYESY 878
AMA +E +K LP + G + + P ++ + ++V+L L LYES+
Sbjct: 837 GDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESW 894

Query: 879 IHPLTILSTLPSAGVGALLALYVTGGEFSLISLLGLFLLIGVVKKNAILMIDLALQLERH 938
P++++ +P VG LLA + + + ++GL IG+ KNAIL+++ A L
Sbjct: 895 SIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEK 954

Query: 939 QGFSPEESIRRACLLRLRPILMTTLAAILGALPLLLSQAEGAEMRQPLGLTIIGGLVFSQ 998
+G E+ A +RLRPILMT+LA ILG LPL +S G+ + +G+ ++GG+V +
Sbjct: 955 EGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSAT 1014

Query: 999 ILTLYTTPVVYLYLDRL 1015
+L ++ PV ++ + R
Sbjct: 1015 LLAIFFVPVFFVVIRRC 1031



Score = 98.0 bits (244), Expect = 1e-22
Identities = 83/511 (16%), Positives = 177/511 (34%), Gaps = 41/511 (8%)

Query: 2 NLSGPFIRRPVATMLLSLAIMLLGGVSFGLLPVAPLPQMDFPVIVVSANL-SGASPEVMA 60
N G + +L+ I+ V F LP + LP+ D V + L +GA+ E
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 61 STVATPLERKL-------GSIAGVTTLTSSSN-QGSTRVVIGFELGRDIDGAAREVQAAI 112
+ + L S+ V + S Q + + + + +G +A I
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 113 NATRNLLPSGMRSMPTYKKINPSQAPIMVLSLTS-------DVLQKG--QLYDLADTILS 163
+ + L + I + I+ L + D G L + +L
Sbjct: 648 HRAKMELG----KIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLG 703

Query: 164 QSLAQVSGVGEVQIGGSS-LPAVRIAVEPQLLNQYNLSLDEVRTAVSNANQRRPMGFVED 222
+ + + V+ G ++ V+ + +SL ++ +S A + D
Sbjct: 704 MAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID 763

Query: 223 AERNWQVRA---NDQLESAKDYEPVVIRQQNGTILRLSDVATVTDG----VENRYNSGFF 275
R ++ +D + + +R NG ++ S T RYN
Sbjct: 764 RGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYN---- 819

Query: 276 NDQAAVLLVVNRQTGANIIETVDQIKAQLPALQSLLPASVQLNVAMDRSPVIKATLKEAE 335
L + Q A + A + L S LPA + + S + + +A
Sbjct: 820 -----GLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDW-TGMSYQERLSGNQAP 873

Query: 336 HTLLIAVVLVILVVYLFLGSLRASLIPSLAVPVSLVGTFAVMYVCGFSLNNLSLMALILA 395
+ I+ V+V L + S + L VP+ +VG + + ++ L+
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTT 933

Query: 396 TGLVVDDAIVVLENI-SRHIENGQPPMKAAFLGAKEVGFTLLSMNVSLVAVFVSILFMGG 454
GL +AI+++E + G+ ++A + + +L +++ + + + G
Sbjct: 934 IGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG 993

Query: 455 IVRNLFQEFSITLAAAIIVSLVVSLTLTPML 485
I + ++ + ++++ P+
Sbjct: 994 AGSGAQNAVGIGVMGGMVSATLLAIFFVPVF 1024



Score = 76.4 bits (188), Expect = 4e-16
Identities = 65/411 (15%), Positives = 146/411 (35%), Gaps = 24/411 (5%)

Query: 624 AQKVIERLRKELPKVPGGRLFLMADQDLQLGGGGRDQTSSQYL-YTLQSGDLAALREWFP 682
+V +L+ P +P Q++Q G +++SS YL D +
Sbjct: 104 QVQVQNKLQLATPLLP---------QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDI 154

Query: 683 KVVAALRALPELTAI----DARDGAGTQQVTLVVDRDQAKRLGIDMDMVTAVLNNAYSQ- 737
A L+ + D + + + +D D + + V L Q
Sbjct: 155 SDYVASNVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQI 214

Query: 738 ---RQISTIYDSLNQYQVVLEINPKYAWDPSTLEQVQV-ITADGARVPLSTIAHYENSLA 793
+ T Q + ++ +P +V + + +DG+ V L +A E
Sbjct: 215 AAGQLGGTPALPGQQLNASIIAQTRFK-NPEEFGKVTLRVNSDGSVVRLKDVARVELGGE 273

Query: 794 NDR--VSHEGQFASEDIAFDVAEGYSPDQAMAALER-AVAKLGLPEEV-IAKLGGTADAF 849
N G+ A+ + D A A + A + P+ + + T
Sbjct: 274 NYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFV 333

Query: 850 AQTQQGQPFMILGALLLVYLVLGILYESYIHPLTILSTLPSAGVGALLALYVTGGEFSLI 909
+ + A++LV+LV+ + ++ L +P +G L G + +
Sbjct: 334 QLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTL 393

Query: 910 SLLGLFLLIGVVKKNAILMIDLALQLERHQGFSPEESIRRACLLRLRPILMTTLAAILGA 969
++ G+ L IG++ +AI++++ ++ P+E+ ++ ++ +
Sbjct: 394 TMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVF 453

Query: 970 LPLLLSQAEGAEMRQPLGLTIIGGLVFSQILTLYTTPVVYLYLDRLRHRFN 1020
+P+ + + +TI+ + S ++ L TP + L + +
Sbjct: 454 IPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEH 504


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3585RTXTOXIND471e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.7 bits (111), Expect = 1e-07
Identities = 31/132 (23%), Positives = 54/132 (40%), Gaps = 22/132 (16%)

Query: 83 ALGTVT-ATNTVNVRSRVAGELVKIHFKEGQRVKAGDLLAEIDPRPYRIALQQAEGTLAQ 141
A G +T + + ++ + +I KEG+ V+ GD+L ++ AE +
Sbjct: 86 ANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTAL-------GAEADTLK 138

Query: 142 NQAQLKNAQVDLARYKGLYAEDSIAKQTLDT----------AEAQVAQFQGLVK----TN 187
Q+ L A+++ RY+ L + K +E +V + L+K T
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198

Query: 188 QAQVNDARLNLD 199
Q Q LNLD
Sbjct: 199 QNQKYQKELNLD 210



Score = 37.9 bits (88), Expect = 6e-05
Identities = 18/102 (17%), Positives = 41/102 (40%), Gaps = 10/102 (9%)

Query: 132 LQQAEGTLAQNQAQLKNAQVDLARYKGLYAEDSIAKQTLDTAEAQVAQFQGLVKTNQAQV 191
L+ + L Q ++++ +A+ + L+ + + K L + ++
Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK--LRQTTDNIGLLT-------LEL 318

Query: 192 NDARLNLDFTQIRAPISGRV-GLRQLDLGNLVAANDTTALVV 232
+ IRAP+S +V L+ G +V +T ++V
Sbjct: 319 AKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3588TCRTETB546e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 53.7 bits (129), Expect = 6e-10
Identities = 80/399 (20%), Positives = 147/399 (36%), Gaps = 55/399 (13%)

Query: 16 LLILLCLLGVF-PLDVIL--PSFPALSDEFRVDTKQIAYSVSFFAVGVAMAQIVIGPLSD 72
+LI LC+L F L+ ++ S P ++++F + + F + ++ V G LSD
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 73 GIGRKRLLLAGLSVSIVGAL-GCVFSTHYETFMAFRLVQALGCGSLV-LGQALVQDLYSG 130
+G KRLLL G+ ++ G++ G V + + + R +Q G + L +V
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 131 TQRNAMRILLTSASGLFISLSPLAGAFLQQSFGWEASFTVFVIIAAIVSLLSCVLLHDTP 190
R L+ S + + P G + W S+ + + + I+++ + L
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPFLMKLLKKE 192

Query: 191 ASHDRA--------PSMSSYRVMLRDTDY----LAHSMLSSLAFACHFS----------- 227
S+ ML T Y L S+LS L F H
Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 228 ----------------------FIVIAPLLLMGRLELTAYQFSLVFIGYG-LAYIVGGMA 264
F+ + P ++ +L+ + V I G ++ I+ G
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 265 AAYLNSRVSPQTQIKAGLLLISTAGITLLMWEWVAGLSVLGALLPMIVCTTGTTLVRPAA 324
L R P + G+ +S + +T + ++ ++ + T V
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTI 372

Query: 325 TTQALARYPRQAGAAASLNTTLLFAGAGLTSSVVAGLES 363
+ +L ++AGA SL F G ++V GL S
Sbjct: 373 VSSSLK--QQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


96PP_3756PP_3762N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_3756112-0.050459TetR family transcriptional regulator
PP_3757112-0.242304response regulator receiver protein
PP_37581130.229738response regulator receiver sensor signal
PP_3759011-0.209593chemotaxis protein CheB
PP_3760-110-0.731021chemotaxis protein CheR
PP_3761-111-0.491927multi-sensor hybrid histidine kinase
PP_3762-212-1.371129response regulator receiver protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3756HTHTETR572e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.3 bits (138), Expect = 2e-12
Identities = 26/125 (20%), Positives = 49/125 (39%), Gaps = 2/125 (1%)

Query: 18 DRAMALFAEKGFGQVSMRELAAHVGLTAGSLYHHFPSKQDLLYDLIEELYEEL-QATLDQ 76
D A+ LF+++G S+ E+A G+T G++Y HF K DL ++ E + + L+
Sbjct: 18 DVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEY 77

Query: 77 ARRAMARGASALSCLIAAHWQLHAERPLQFRLAERDL-CCLSEAQQAHLASLRKRYEAGL 135
+ S L ++ + + L E C + A + ++
Sbjct: 78 QAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLES 137

Query: 136 LRLIA 140
I
Sbjct: 138 YDRIE 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3757HTHFIS703e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.9 bits (171), Expect = 3e-17
Identities = 37/121 (30%), Positives = 55/121 (45%), Gaps = 12/121 (9%)

Query: 9 VLVVEDEPAIRMILRDYLAGEGYHVLVAEDGEQAFAILASKPHLDLMVTDFRLPGGISGV 68
+LV +D+ AIR +L L+ GY V + + + +A+ DL+VTD +P +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDE-NAF 63

Query: 69 DIAEPAVKLRPDLKVIFISGYP-----AEILESGSPITRKAPILAKPFDLDTLHEQIQSL 123
D+ K RPDL V+ +S + E G+ L KPFDL L I
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGA-----YDYLPKPFDLTELIGIIGRA 118

Query: 124 L 124
L
Sbjct: 119 L 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3758HTHFIS711e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.4 bits (175), Expect = 1e-15
Identities = 43/197 (21%), Positives = 77/197 (39%), Gaps = 21/197 (10%)

Query: 5 TTAKLLIVDDLPENLLALDALIQGEDREVHQAQSAEAALSLLLEHEFALAILDVQMPGMN 64
T A +L+ DD L+ + +V +A + + L + DV MP N
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 65 GFELAELMRGTDKTKHIPIVFVSAAGREMNYAFKGYESGAVDFLHKPLDTLAVKSKVSVF 124
F+L ++ +P++ +SA M A K E GA D+L KP D
Sbjct: 62 AFDLLPRIKKAR--PDLPVLVMSAQNTFMT-AIKASEKGAYDYLPKPFD----------- 107

Query: 125 VDLFRQRKVLGRQLEALEQSRQEQELLLSQLQVARCELEHAVRMRDDFMSIVSHEVRTPL 184
+++G AL + ++ L Q + + M++ + + ++T L
Sbjct: 108 -----LTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR-LMQTDL 161

Query: 185 NGLIL-ETQLRKMHLAR 200
+I E+ K +AR
Sbjct: 162 TLMITGESGTGKELVAR 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3761HTHFIS811e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 1e-17
Identities = 37/123 (30%), Positives = 57/123 (46%), Gaps = 3/123 (2%)

Query: 1032 KVLLVDDDVRNIFALTSALEHKGAIVEIGRNGREAIERLEQHDDIDLVLMDVMMPEMDGF 1091
+L+ DDD L AL G V I N + D DLV+ DV+MP+ + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMPDENAF 63

Query: 1092 EATRLIRQQPRWRKLPIIAVTAKAMKDDQQRCLQAGANDYLAKPIDLDRLFSLIRVWLPQ 1151
+ I++ LP++ ++A+ + + GA DYL KP DL L +I L +
Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 1152 LER 1154
+R
Sbjct: 122 PKR 124



Score = 71.4 bits (175), Expect = 8e-15
Identities = 34/169 (20%), Positives = 63/169 (37%), Gaps = 16/169 (9%)

Query: 765 ILVIEDEPNFARILFDLAHELGYSCLVAHGADEGFELAAQYIPDAILLDMRLPDHSGLTV 824
ILV +D+ +L GY + A + A D ++ D+ +PD + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 825 LQRLKEQASTRHIPVHIISVEDRVE---AAMHMGAVGYAVKPTSREELKEVFARLEAKLT 881
L R+K+ +PV ++S ++ A GA Y KP EL + R A+
Sbjct: 66 LPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 882 QKLKHILLVEDDDLQRESIARLIGD-----DDVEITAVAMAQDALALLR 925
++ + + L+G + + A M D ++
Sbjct: 124 RRPSKLE------DDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166



Score = 63.7 bits (155), Expect = 2e-12
Identities = 16/81 (19%), Positives = 33/81 (40%), Gaps = 2/81 (2%)

Query: 886 HILLVEDDDLQRESIARLIGDDDVEITAVAMAQDALALLRQNIYDCMIIDLKLPDMLGNE 945
IL+ +DD R + + + ++ + A + D ++ D+ +PD +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 946 LLKRMTAEDIRAFPPVIVYTG 966
LL R+ PV+V +
Sbjct: 65 LLPRIKKARPDL--PVLVMSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_3762HTHFIS696e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.7 bits (168), Expect = 6e-17
Identities = 31/120 (25%), Positives = 52/120 (43%), Gaps = 7/120 (5%)

Query: 2 HLLVVEDDDIVRMLMVEVLDDLGYKVIEAEDATAALRVLEDPSQALALMMTDVGLPDMRG 61
+LV +DD +R ++ + L GY V +A R + + L++TDV +PD
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA--AGDGDLVVTDVVMPDENA 62

Query: 62 EELAGKARELRPLLPVLFASGYADSFNVPEGMHL-----IGKPFSIDQLRDTVVGILGTP 116
+L + ++ RP LPVL S + + KPF + +L + L P
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


97PP_4110PP_4115N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_4110-18-0.034107hypothetical protein
PP_411109-0.293016elongation factor G
PP_4113-2100.245230hypothetical protein
PP_4114-116-0.271477hypothetical protein
PP_4115-119-0.584767NolW domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4110FLGFLIH320.002 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 31.7 bits (71), Expect = 0.002
Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 67 GQAAGRQQGQQARVQGGLGEGLEDGQVQ-RTDQVEISLGVRQ 107
G A GRQQG + Q GL +GLE G + ++ Q I ++Q
Sbjct: 59 GIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQ 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4111TCRTETOQM5740.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 574 bits (1482), Expect = 0.0
Identities = 171/695 (24%), Positives = 299/695 (43%), Gaps = 79/695 (11%)

Query: 11 RNIGIVAHVDAGKTTTTERILFYTGVNHKMGEVHDGAATMDWMAQEQERGITITSAATTA 70
NIG++AHVDAGKTT TE +L+ +G ++G V G D E++RGITI + T+
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSF 63

Query: 71 FWQGSTKQFAHKYRFNIIDTPGHVDFTIEVERSLRVLDGAVVVFSGADGVEPQSETVWRQ 130
W+ + + NIIDTPGH+DF EV RSL VLDGA+++ S DGV+ Q+ ++
Sbjct: 64 QWENT--------KVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHA 115

Query: 131 ANKYHVPRLAYINKMDRQGADFLRVVKQIDQRLGHHPVPIQLAIGSEENFMGQIDLVKMK 190
K +P + +INK+D+ G D V + I ++L V Q
Sbjct: 116 LRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ------------------- 156

Query: 191 AIYWNDADQGTSYREEEIPAELKALADEWRAHMIEAAAEANDELTMKFLDGEELSIEEIK 250
+ T++ E E +W + E ND+L K++ G+ L E++
Sbjct: 157 KVELYPNMCVTNFTESE----------QW-----DTVIEGNDDLLEKYMSGKSLEALELE 201

Query: 251 AGLRQRTIANEIVPTILGSSFKNKGVPLMLDAVIDYLPAPSEIPAIRGTDPDDEEKHLER 310
R + P GS+ N G+ +++ + + + +
Sbjct: 202 QEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH------------------ 243

Query: 311 HADDKEPFSALAFKIATDPFVGTLTFARVYSGVLSSGNAVLNSVKGKKERIGRMVQMHAN 370
+ FKI L + R+YSGVL ++V S K K +I M
Sbjct: 244 --RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMYTSING 300

Query: 371 QRAEIKDVCAGDIAALIG----MKDVTTGDTLCDMDKPIILERMDFPDPVISVAVEPKTK 426
+ +I +G+I L + V GDT + ER++ P P++ VEP
Sbjct: 301 ELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSKP 355

Query: 427 ADQEKMGIALGKLAQEDPSFRVRTDEETGQTIISGMGELHLDIIVDRMRREFNVEANIGK 486
+E + AL +++ DP R D T + I+S +G++ +++ ++ +++VE I +
Sbjct: 356 QQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKE 415

Query: 487 PQVAYREKIRNTCEIEGRFVRQSGGRGQYGHCWIRFAPGDEGKEGLEFINEIVGGVVPRE 546
P V Y E+ E + + + +P G G+++ + + G + +
Sbjct: 416 PTVIYMERPLKK--AEYTIHIEVPPNPFWASIGLSVSPLPLGS-GMQYESSVSLGYLNQS 472

Query: 547 YIPAIQKGIEEQMKNGVLAGYPLINLKAAVFDGSYHDVDSNEMAYKIAASMATKQLSQKG 606
+ A+ +GI + G L G+ + + K G Y+ S +++ A + +Q+ +K
Sbjct: 473 FQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKA 531

Query: 607 GAVLLEPVMKVEVVTPEEYQGDILGDLSRRRGMIQDGDETPAGKVIRAEVPLGEMFGYAT 666
G LLEP + ++ P+EY D + I D ++ E+P + Y +
Sbjct: 532 GTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRS 591

Query: 667 SMRSMTQGRASFSMEFTRYAEAPASIADGIVKKSR 701
+ T GR+ E Y + + + R
Sbjct: 592 DLTFFTNGRSVCLTELKGYHVT---TGEPVCQPRR 623


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4113SUBTILISIN786e-18 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 78.3 bits (193), Expect = 6e-18
Identities = 62/285 (21%), Positives = 101/285 (35%), Gaps = 58/285 (20%)

Query: 248 VRVGVIEREVDFDAPGFS-HYRGG----CEQGQTCLYARDGNRPASHGSHVAGILAAQGE 302
V+V V++ D D P GG + +D N HG+HVAG +AA
Sbjct: 43 VKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNG---HGTHVAGTIAATEN 99

Query: 303 GF-LPGLGKHSPGFEVMVERNSDAGITANIAASVN-LVEDGVRVLNWSWGIHRVGARDVN 360
+ G+ + + V +G I + +E V +++ S G +
Sbjct: 100 ENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLG------GPED 153

Query: 361 GDEVDSLVRSGLAMSGYEELLEEFFLWLRKEHPDVVVVNSAGN-GASWSGTDEYRLPSSF 419
E+ V+ +A ++V+ +AGN G TDE P +
Sbjct: 154 VPELHEAVKKAVA-------------------SQILVMCAAGNEGDGDDRTDELGYPGCY 194

Query: 420 ITEQLLVVGGHQRSGKATAVEAHDHVRKRHSSNIDMRVDVSAAACI----RPVSAEAAHC 475
+++ VG A+ SN + VD+ A P A
Sbjct: 195 --NEVISVGAINFDRHAS-----------EFSNSNNEVDLVAPGEDILSTVPGGKYATFS 241

Query: 476 GTSYATPLVTATVATMLSINPA-----LTPEQVRMLLRRSALTLG 515
GTS ATP V +A + + A LT ++ L + + LG
Sbjct: 242 GTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4115PHAGEIV434e-07 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 43.4 bits (102), Expect = 4e-07
Identities = 51/258 (19%), Positives = 91/258 (35%), Gaps = 58/258 (22%)

Query: 44 ATEVVPLQHRSSAELLPALQAFI----GKDGTVSAFE--NKLIVNASPERIDDLRALLQQ 97
T+ + + + +L+ ++ F+ K V + + N L+V+A + +D+L L
Sbjct: 131 VTQTFKINNVRAKDLIRVVELFVKSNTSKSSNVLSVDGSNLLVVSAPKDILDNLPQFLST 190

Query: 98 LDTAPKRLLIS--------VDNNDSNFQDNRGNARVIHYGTSNRDGGMQQVQAS------ 143
+D ++LI D D +F V ++R +
Sbjct: 191 VDLPTDQILIEGLIFEVQQGDALDFSFAAGSQRGTVAGGVNTDRLTSVLSSAGGSFGIFN 250

Query: 144 ----------------------------EGQPALIQVGQSIPITSTSTDGYGRIQSN--- 172
GQ I VGQ++P + G +N
Sbjct: 251 GDVLGLSVRALKTNSHSKILSVPRILTLSGQKGSISVGQNVPFITGRVTGESANVNNPFQ 310

Query: 173 -TEYRNVTQGFYVTPSL-SGETVRLQISSNNDRI--SHERADVVKVQ-STDTTVTGKLGE 227
E +NV V P +G + L I+S D + S + +DV+ Q S TTV + G+
Sbjct: 311 TVERQNVGISMSVFPVAMAGGNIVLDITSKADSLSSSTQASDVITNQRSIATTVNLRDGQ 370

Query: 228 WITLAGF--NQQSQADRS 243
+ L G + + D
Sbjct: 371 TLLLGGLTDYKNTSQDSG 388


98PP_4206PP_4212N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_42061122.708290major facilitator family transporter
PP_4207-1142.593063hypothetical protein
PP_4208-2132.564053RNA polymerase sigma factor
PP_4209-2152.837254RND family efflux transporter MFP subunit
PP_4210-2152.836808ABC transporter ATP-binding protein
PP_4211-3152.480063RND efflux transporter
PP_42120161.467933hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4206TCRTETB355e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.9 bits (80), Expect = 5e-04
Identities = 62/337 (18%), Positives = 111/337 (32%), Gaps = 44/337 (13%)

Query: 67 VTGY-LARPLGGILMAHFADHLGRKRVFSLSILMMALPCLLIGVMPTYADIGYAAPLILL 125
T + L +G + +D LG KR+ I++ ++ +G++ +L+
Sbjct: 55 NTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI-------GFVGHSFFSLLI 107

Query: 126 ALRILQGAAVGGEVPSAWTFVAEHAPNGRRGYALGFLQA----GLTFGYLLGALTA---- 177
R +QGA VA + P RG A G + + G G +G + A
Sbjct: 108 MARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH 167

Query: 178 --------------TLLAQLFTPQEI-----LDYAWRYPFLLGGVFGVIGVWLRRW--LS 216
+E+ D +G VF ++ L
Sbjct: 168 WSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLI 227

Query: 217 ETPVFLALRARQEQPVTFPLRQVLGEHRRALIPAALLTCVLTSAVVVLVVITPTVMQQRF 276
+ + + + + VT P + L ++ V V + P +M+
Sbjct: 228 VSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVH 287

Query: 277 GMS---AGHTFALSSVGIVFLNIGCVLAGLLVDRVGAWRALMLYSLLLPLG-IGALYASL 332
+S G G + + I + G+LVDR G L + L + + A +
Sbjct: 288 QLSTAEIGSVIIF--PGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLE 345

Query: 333 VGQWGMTW-LAYALAGLSCGVVGVVPSVMVGLFPAQI 368
W MT + + L GLS + V L +
Sbjct: 346 TTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEA 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4207HTHFIS310.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.6 bits (69), Expect = 0.002
Identities = 15/64 (23%), Positives = 27/64 (42%), Gaps = 3/64 (4%)

Query: 7 RILIADAHPCQRLQLERLLNGLGYYRIAPVASFEELQRLVQCALQPFHLLLGNIELASHC 66
IL+AD R L + L+ G Y + ++ L R + A L++ ++ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61

Query: 67 GVDL 70
DL
Sbjct: 62 AFDL 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4209RTXTOXIND574e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 57.1 bits (138), Expect = 4e-11
Identities = 27/183 (14%), Positives = 57/183 (31%), Gaps = 39/183 (21%)

Query: 99 QAKLDAGRFSIDNLKAQLAEQRAQLKLAQQQLKRQRDLAAVG--------------ATRE 144
+ LD R + A++ ++ + +L L
Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV 265

Query: 145 EDLQTAEAQLNVTQARIDMYQAQIRQANASLRSD----------------------EAEL 182
+L+ ++QL ++ I + + + +++ E
Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325

Query: 183 GYTRIFAPMDGTVVAVDAR-EGQTLNAQQQTPLILRIAKLSPMTVWAQVSEADIGKIQPG 241
+ I AP+ V + EG + + L++ + + + V A V DIG I G
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAE--TLMVIVPEDDTLEVTALVQNKDIGFINVG 383

Query: 242 MTA 244
A
Sbjct: 384 QNA 386



Score = 57.1 bits (138), Expect = 5e-11
Identities = 32/198 (16%), Positives = 68/198 (34%), Gaps = 33/198 (16%)

Query: 6 HTRRRLLLGGLGLLGLGSLLAWTSLPFGAQPVSTVAVTRADIESSVTALGTLQPR-RYVD 64
+RR L+ + L + L +E TA G L R +
Sbjct: 53 VSRRPRLVAYFIMGFLVIAFILSVL--------------GQVEIVATANGKLTHSGRSKE 98

Query: 65 VGAQASGQIRNLHVEVGDQVHKGQLLVEIDPSTQQAKLDAGRFSIDNLKAQLAEQRAQLK 124
+ + ++ + V+ G+ V KG +L+++ +A + S L+A+L + R Q+
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSS--LLQARLEQTRYQIL 156

Query: 125 LAQQ----------------QLKRQRDLAAVGATREEDLQTAEAQLNVTQARIDMYQAQI 168
Q + ++ + + +E T + Q + +D +A+
Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216

Query: 169 RQANASLRSDEAELGYTR 186
A + E +
Sbjct: 217 LTVLARINRYENLSRVEK 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4212ECOLNEIPORIN290.050 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 29.0 bits (65), Expect = 0.050
Identities = 16/69 (23%), Positives = 26/69 (37%), Gaps = 8/69 (11%)

Query: 10 AGVALAGVTVPGALYVQRQLTREEFPETPGEAVVELADTATQ-------QLGDTLRGIWR 62
A V L G T+ + R + E + D ++ LG+ L+ IW+
Sbjct: 19 ADVTLYG-TIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKAIWQ 77

Query: 63 WSFKGAQAG 71
K + AG
Sbjct: 78 VEQKASIAG 86


99PP_4327PP_4340N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_43270111.375783cytochrome c biogenesis protein CcmA
PP_4328-1121.513768hypothetical protein
PP_4329-2110.716262FlhB domain-containing protein
PP_4330-290.448892hypothetical protein
PP_4331-3120.951901hypothetical protein
PP_4332-2131.323447purine-binding chemotaxis protein CheW
PP_4333-2111.922619chemotaxis protein CheW
PP_4334-1110.811732ParA family protein
PP_43351110.313209flagellar motor protein MotD
PP_43361110.468401flagellar motor protein
PP_43370110.510702chemotaxis-specific methylesterase
PP_4338-1100.517399chemotaxis protein CheA
PP_4339-111-0.054577chemotaxis protein CheZ
PP_4340-210-1.126707response regulator receiver protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4327PF05272280.027 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.027
Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 1/41 (2%)

Query: 32 MLQISGPNGSGKTSLLRLLAGLMQPTAGQILLG-GKPLAEQ 71
+ + G G GK++L+ L GL + +G GK EQ
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQ 638


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4329TYPE3IMSPROT641e-15 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 64.4 bits (157), Expect = 1e-15
Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 3/73 (4%)

Query: 9 AIALNYDG--HQAPTLTAKGDEDLAEAILALAREHEVPIYENAELVR-LLARLELGEQIP 65
AI + Y P +T K + + + +A E VPI + L R L + IP
Sbjct: 268 AIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIP 327

Query: 66 EALYLTIAEIIAF 78
AE++ +
Sbjct: 328 AEQIEATAEVLRW 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4331PF06580260.034 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 26.4 bits (58), Expect = 0.034
Identities = 12/49 (24%), Positives = 22/49 (44%), Gaps = 10/49 (20%)

Query: 5 VAVIFLALAWALSLWFFLNYSKR---------QRELAAQQAEGDALRDQ 44
V+ + W+L L+F ++ K + AQ+A+ AL+ Q
Sbjct: 122 FNVVVVTFMWSL-LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQ 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4333IGASERPTASE330.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.1 bits (75), Expect = 0.001
Identities = 21/107 (19%), Positives = 40/107 (37%), Gaps = 8/107 (7%)

Query: 3 QTRQTSTRPQMALQSYLDGLLQEATEAKDLSEQPAVADEFAEAVREEQARDARQPARPEP 62
+T +T P++ Q QE +E +PA ++ ++E Q++ +P
Sbjct: 1115 ETEKTQEVPKVTSQVSPK---QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171

Query: 63 AEAAAASFAPRPFAEPRLAVLPSVMPVEAPVVTV---VEQEVVAEAS 106
A+ ++ V VE P T + V +E+S
Sbjct: 1172 AKETSS--NVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS 1216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4335OMPADOMAIN722e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 71.9 bits (176), Expect = 2e-16
Identities = 35/122 (28%), Positives = 54/122 (44%), Gaps = 16/122 (13%)

Query: 134 LNSSLLFGSGDAMPSDKAFAIIEKVASILK---PFANPVHVEGFTDNLPIRTAQYPTNWE 190
L S +LF A + A ++++ S L P V V G+TD I + Y N
Sbjct: 217 LKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY--NQG 272

Query: 191 LSSARAASIVRLLAMEGVNPARMASVGYGEYQPVASNDTAEGRAR---------NRRVVL 241
LS RA S+V L +G+ ++++ G GE PV N + R +RRV +
Sbjct: 273 LSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEI 332

Query: 242 VI 243
+
Sbjct: 333 EV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4337HTHFIS574e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.1 bits (138), Expect = 4e-11
Identities = 31/122 (25%), Positives = 49/122 (40%), Gaps = 6/122 (4%)

Query: 2 AVKVLVVDDSGFFRRRVSEILSADPTIQVVGTATNGKEAIDQALALKPDVITMDYEMPMM 61
+LV DD R +++ LS V +N A D++ D MP
Sbjct: 3 GATILVADDDAAIRTVLNQALS-RAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVRHIMQRCP-TPVLMFSSLTHEGARVTLDALDAGAVDYLPKNFEDISRNPDKVKQ 120
+ + I + P PVL+ S+ + A + GA DYLPK F D++ + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAI--KASEKGAYDYLPKPF-DLTELIGIIGR 117

Query: 121 LL 122
L
Sbjct: 118 AL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4338PF06580432e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.3 bits (102), Expect = 2e-06
Identities = 22/122 (18%), Positives = 49/122 (40%), Gaps = 22/122 (18%)

Query: 455 ETDLDKNLVEALADPLV--HLVRNAVDHGIEMPDEREASGKARTGRVVLSAEQEGDHILL 512
E ++ +++ P++ LV N + HGI + G+++L ++ + L
Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTL 294

Query: 513 SISDDGKGMDPNILRAKAVEKGLMDKDAAERLSESDCYNLIFAPGFSTKTEISDVSGRGV 572
+ + G N + GL + ERL +++ G + ++S+ G+
Sbjct: 295 EVENTGSLALKNTKESTGT--GLQNVR--ERL------QMLY--GTEAQIKLSEKQGKVN 342

Query: 573 GM 574
M
Sbjct: 343 AM 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4340HTHFIS902e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.3 bits (224), Expect = 2e-24
Identities = 31/120 (25%), Positives = 55/120 (45%), Gaps = 3/120 (2%)

Query: 2 KILIVDDFSTMRRIIKNLLRDLGFTNTDEADDGTTALPMLENGHYDFLVTDWNMPGMSGI 61
IL+ DD + +R ++ L G+ + T + G D +VTD MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 DLLRKVRASDKLKSMPVLMVTAEAKRDQIIEAAQAGVNGYVVKPFTAQVLKEKIEKIFER 121
DLL +++ + +PVL+++A+ I+A++ G Y+ KPF L I +
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


100PP_4352PP_4390N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_43524200.153042flagellar biosynthesis protein FlhB
PP_43535200.498443flagellar biosynthesis protein FliR
PP_43544220.684374flagellar biosynthesis protein FliQ
PP_43554210.673991flagellar biosynthesis protein FliP
PP_43560131.695349flagellar assembly protein FliO
PP_43570110.757661flagellar motor switch protein
PP_43580130.817397flagellar motor switch protein FliM
PP_43591150.504924flagellar basal body protein FliL
PP_43602160.942464hypothetical protein
PP_43611151.452471flagellar hook-length control protein
PP_43620140.894038Hpt protein
PP_43631141.113060response regulator receiver protein
PP_4364-1141.810343anti-sigma-factor antagonist
PP_4365-1141.874474flagellar biosynthesis chaperone
PP_4366-1132.342396flagellum-specific ATP synthase
PP_4367-1132.639037flagellar assembly protein H
PP_4368-2132.130456flagellar motor switch protein G
PP_4369-2101.931099flagellar MS-ring protein
PP_43700121.103388flagellar hook-basal body protein FliE
PP_4371-215-0.585966Fis family transcriptional regulator
PP_4372-219-2.464184PAS/PAC sensor signal transduction histidine
PP_4373-125-4.615104Fis family transcriptional regulator
PP_4374-127-5.149002hypothetical protein
PP_4375-122-4.182022flagellar protein FliS
PP_4376017-2.858195flagellar cap protein FliD
PP_4377015-1.594914flagellin FlaG
PP_4378-114-1.051137flagellin FliC
PP_4379-1150.263143beta-ketoacyl-ACP synthase
PP_43800140.401305flagellar hook-associated protein FlgL
PP_43810160.886680flagellar hook-associated protein FlgK
PP_43821160.833571flagellar rod assembly protein FlgJ
PP_4383318-0.097715flagellar basal body P-ring biosynthesis protein
PP_4384217-0.480821flagellar basal body L-ring protein
PP_4385117-1.008261flagellar basal body rod protein FlgG
PP_4386116-1.295902flagellar basal body rod protein FlgF
PP_4387015-1.863011hypothetical protein
PP_4388014-2.367439flagellar hook protein FlgE
PP_4389013-1.493691flagellar basal body rod modification protein
PP_4390012-1.313030flagellar basal body rod protein FlgC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4352TYPE3IMSPROT324e-111 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 324 bits (831), Expect = e-111
Identities = 103/349 (29%), Positives = 188/349 (53%), Gaps = 3/349 (0%)

Query: 9 DKTEDPTEKRKRDAREKGEVARSKELNTVAVTLAGAGGLLAFGGHVAETLLALMRMNFSL 68
+KTE PT K+ RDAR+KG+VA+SKE+ + A+ +A + L+ + E LM
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLML--IPA 61

Query: 69 TRDIIVDERAMGAFLLASGKMAIWAVQPVLILLFVVSFVAPIALSGFLFSGSLLQPKFSR 128
+ + +A+ + + P+L + +++ + + GFL SG ++P +
Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 129 MNPLSGIKRMFSMQALTELLKALAKFFVILVVAVVVLSGDRQALLSIANEPLEQAIIHSL 188
+NP+ G KR+FS+++L E LK++ K ++ ++ +++ G+ LL + +E
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181

Query: 189 QVVGWSALWMSAGLLLIAAADVPFQLYQTHKKMKMTKQEVRDEYKDSEGKPEVKQRIRQL 248
Q++ + + G ++I+ AD F+ YQ K++KM+K E++ EYK+ EG PE+K + RQ
Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241

Query: 249 QREVSQRRMMAAVPDADVIITNPTHYAVALQYDPEKGGAAPLLLAKGSDFMALKIREIGV 308
+E+ R M V + V++ NPTH A+ + Y + PL+ K +D +R+I
Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETP-LPLVTFKYTDAQVQTVRKIAE 300

Query: 309 EHNIQILESPALARAIYYSTELEQEIPAGLYLAVAQVLAYVFQIRQYRA 357
E + IL+ LARA+Y+ ++ IPA A A+VL ++ + +
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQ 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4353TYPE3IMRPROT1363e-41 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 136 bits (344), Expect = 3e-41
Identities = 94/255 (36%), Positives = 152/255 (59%), Gaps = 2/255 (0%)

Query: 1 MLELTDTQIGTWVATFILPLFRVTAVLMTMPIFGTRMLPARIRLYVAVAVTVVIVPALPP 60
ML++T Q +W+ + PL RV A++ T PI R +P R++L +A+ +T I P+LP
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 LPEFDPLSLRGMLLCAEQIIVGALFGLALQLLFQAFVVAGQIVAVQMGMAFASMVDPANG 120
S + L +QI++G G +Q F A AG+I+ +QMG++FA+ VDPA+
Sbjct: 61 NDVPV-FSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119

Query: 121 VNVTVISQFMTMLVSVLFLLMNGHLVVFEVLTESFTTLPVGSALVVNHFWELAGRMGW-V 179
+N+ V+++ M ML +LFL NGHL + +L ++F TLP+G + ++ + + G +
Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179

Query: 180 FGAGLLLILPVIAALLVVNIAFGVMTRAAPQLNIFSIGFPLTLVMGMFIFWVGLADVLSH 239
F GL+L LP+I LL +N+A G++ R APQL+IF IGFPLTL +G+ + + +
Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPF 239

Query: 240 YQALASETLQWLREL 254
+ L SE L ++
Sbjct: 240 CEHLFSEIFNLLADI 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4354TYPE3IMQPROT533e-13 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 53.2 bits (128), Expect = 3e-13
Identities = 21/74 (28%), Positives = 38/74 (51%)

Query: 7 VDLFRDALWLTTLMVAVLVVPSLLVGLVVAMFQAATQINEQTLSFLPRLLVMLITLIIAG 66
V AL+L ++ + + ++GL+V +FQ TQ+ EQTL F +LL + + L +
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 PWLVQKFMEYITTL 80
W + + Y +
Sbjct: 65 GWYGEVLLSYGRQV 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4355FLGBIOSNFLIP2692e-93 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 269 bits (688), Expect = 2e-93
Identities = 137/244 (56%), Positives = 186/244 (76%), Gaps = 1/244 (0%)

Query: 5 LRTLLTLALLLAAPLALAADPLSIPAITLSNTPDGQQEYSVSLQILLIMTALSFIPAFVI 64
+R LL++A +L + A +P IT P G Q +S+ +Q L+ +T+L+FIPA ++
Sbjct: 1 MRRLLSVAPVLLWLITPLAFA-QLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILL 59

Query: 65 LMTSFTRIIIVFSILRQALGLQQTPSNQLLTGMALFLTMFIMAPVFDRVNQDALQPYLKE 124
+MTSFTRIIIVF +LR ALG P NQ+L G+ALFLT FIM+PV D++ DA QP+ +E
Sbjct: 60 MMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEE 119

Query: 125 QMTAQQAIDKAQGPLKDFMLAQTRQSDLDLFMRLSKRTDIAGPDQVPLTILVPAFVTSEL 184
+++ Q+A++K PL++FML QTR++DL LF RL+ + GP+ VP+ IL+PA+VTSEL
Sbjct: 120 KISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSEL 179

Query: 185 KTAFQIGFMIFIPFLIIDMVVASVLMAMGMMMLSPLIISLPFKIMLFVLVDGWALIMGTL 244
KTAFQIGF IFIPFLIID+V+ASVLMA+GMMM+ P I+LPFK+MLFVLVDGW L++G+L
Sbjct: 180 KTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSL 239

Query: 245 ASSF 248
A SF
Sbjct: 240 AQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4357FLGMOTORFLIN1201e-37 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 120 bits (301), Expect = 1e-37
Identities = 69/159 (43%), Positives = 96/159 (60%), Gaps = 29/159 (18%)

Query: 7 MANENEITSPEDQALADEWAAALEE-----TGSAGQADIDALLGGDTGSSSGPGRLPMEE 61
M++ N + AL D WA AL E T SA A L GGD SG +
Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDV---SGAMQ----- 52

Query: 62 FASSPKPNENVSLEGPNLDVILDIPVNISMEVGSTEINIRNLLQLNQGSVIELDRLAGEP 121
++D+I+DIPV +++E+G T + I+ LL+L QGSV+ LD LAGEP
Sbjct: 53 ----------------DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEP 96

Query: 122 LDVLVNGTLIAHGEVVVVNEKFGIRLTDVISPSERIKKL 160
LD+L+NG LIA GEVVVV +K+G+R+TD+I+PSER+++L
Sbjct: 97 LDILINGYLIAQGEVVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4358FLGMOTORFLIM2572e-86 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 257 bits (657), Expect = 2e-86
Identities = 95/324 (29%), Positives = 164/324 (50%), Gaps = 9/324 (2%)

Query: 5 DLLSQDEIDALLHGVDDGLVQTESASEPGSIKS---YDLTSQDRIVRGRMPTLEMINERF 61
++LSQDEID LL + G E A + YD D+ + +M TL +++E F
Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62

Query: 62 ARYTRISMFNLLRRSADVAVGGVQVMKFGEYVHSLYVPTSLNLVKIKPLRGTSLFILDAK 121
AR T S+ LR V V V + + E++ S+ P++L ++ + PL+G ++ +D
Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122

Query: 122 LVFKLVDNFFGGDGRHAKIEGREFTPTELRVVRMVLDQCFVDLKEAWQAIMPVSFEYMNS 181
+ F ++D FGG G+ AK++ R+ T E V+ V+ + +++E+W ++ +
Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181

Query: 182 EVNPAMANIVGPSEAVVVSTFHIELDGGGGDLHVTMPYSMIEPVREMLDAGF--QSDLDD 239
E NP A IV PSE VV+ T ++ G ++ +PY IEP+ L + F S
Sbjct: 182 ETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRRS 241

Query: 240 QDERWVKALREDVLDVAVPMTATVARRQLKLRDILHMQPGDVIPVE---LPEHLVLRANG 296
+++ LR+ + V + + A V +L +RDIL ++ GD+I + + + VL
Sbjct: 242 STTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIGN 301

Query: 297 VPAFKARLGSHKGNLALQIIDPIE 320
F + G +A QI++ IE
Sbjct: 302 RKKFLCQPGVVGKKIAAQILERIE 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4361FLGHOOKFLIK506e-09 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 50.2 bits (119), Expect = 6e-09
Identities = 46/164 (28%), Positives = 75/164 (45%), Gaps = 5/164 (3%)

Query: 259 TAKTANAVPATANPLHQPLPMNQNAWAEGLVNRVMYLSSQNLKSADIQLEPAELGRLDIR 318
T +P A P+ P+ + W + L + + Q +SA+++L P +LG + I
Sbjct: 216 TPHQTQPLPTVAAPVLSA-PLGSHEWQQSLSQHISLFTRQGQQSAELRLHPQDLGEVQIS 274

Query: 319 VNVAADQATQVTFISGHAGVRDALDSQVHRLRELFAQQGLAQPDVNVADQSRGQQQQQGQ 378
+ V +QA Q+ +S H VR AL++ + LR A+ G+ N++ +S QQQ
Sbjct: 275 LKVDDNQA-QIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSNISGESFSGQQQAAS 333

Query: 379 AQGSNLSGVAARRAEQGGAEAVDSARPLE-QQVVVGDSAVDYYA 421
Q S A G + P+ Q V G+S VD +A
Sbjct: 334 QQQQ--SQRTANHEPLAGEDDDTLPVPVSLQGRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4363HTHFIS775e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.2 bits (190), Expect = 5e-17
Identities = 31/137 (22%), Positives = 60/137 (43%), Gaps = 3/137 (2%)

Query: 5 QALTVLVAEDSAVDRLLLAQIVRRQGHQVFTAENGEQAVALYLERRPQLVLLDALMPVMD 64
T+LVA+D A R +L Q + R G+ V N LV+ D +MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 65 GFEAARQIKALAGEALVPIIFLTSLNEEEGLVRCLEAGGDDFMAKPYSA-VILAAKIRAM 123
F+ +IK + +P++ +++ N ++ E G D++ KP+ ++ RA+
Sbjct: 62 AFDLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 124 DRLRRLQATVLEQRDQI 140
+R + + +
Sbjct: 120 AEPKRRPSKLEDDSQDG 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4365FLGFLIJ531e-11 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 52.9 bits (126), Expect = 1e-11
Identities = 40/140 (28%), Positives = 75/140 (53%)

Query: 10 LAPVVDMAEEAERKAAQRLGHFQQQLATAQAKLAELERFREDYQLQWINRGGQGVNGSWL 69
LA + D+AE+ AA+ LG ++ A+ +L L ++ +Y+ + G+ +
Sbjct: 7 LATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAGITSNRW 66

Query: 70 VNYQRFLGQLETAMTQQRQSLVWHQNNLNNARGTWQQAYARVEGLRKLVQRYQDEARRAE 129
+NYQ+F+ LE A+TQ RQ L ++ A +W++ R++ + L +R A AE
Sbjct: 67 INYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQSTAALLAE 126

Query: 130 DKREQRLLDELSQRLPRQNP 149
++ +Q+ +DE +QR + P
Sbjct: 127 NRLDQKKMDEFAQRAAMRKP 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4367FLGFLIH606e-13 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 59.8 bits (144), Expect = 6e-13
Identities = 50/209 (23%), Positives = 95/209 (45%), Gaps = 11/209 (5%)

Query: 22 VWTLPSFDPEPEP-EPEPEPEPEVIEEVEEVPLEEVQPLTLEELEAIRQEAYNEGFATGE 80
WT P P EPE +IEE E +++ L ++ E Q EG G
Sbjct: 9 TWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGH 68

Query: 81 REGFHSTQLKVRQEAEEALKTKLD----SLERLMANLMEPIAEQDTQIEKSLVHLIAHMS 136
++G+ + ++ K++ +++L++ + D+ I L+ + +
Sbjct: 69 KQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAA 128

Query: 137 RQVIGRELRNDSSQITQVLREALKLLPMGADNIRIHLNPQDF----ELAKALRERHEENW 192
RQVIG+ D+S + + +++ L+ P+ + ++ ++P D ++ A H W
Sbjct: 129 RQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLH--GW 186

Query: 193 RLLEDSTLLPGGCRIETAHSRIDATMETR 221
RL D TL PGGC++ +DA++ TR
Sbjct: 187 RLRGDPTLHPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4368FLGMOTORFLIG303e-104 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 303 bits (777), Expect = e-104
Identities = 104/330 (31%), Positives = 205/330 (62%)

Query: 10 KLSRVDKAAILLLSLGETDAAQVLRHMGPKEVQRVGVAMAQMGNVHRDQVEQVMSEFVDI 69
L+ KAAILL+S+G +++V +++ +E++ + +A++ + + + V+ EF ++
Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73

Query: 70 VGDQTSLGVGSDAYIRKMLNQALGEDKANGLVDRILLGGNTSGLDSLKWMEPRAVADVIR 129
+ Q + G Y R++L ++LG KA +++ + + + ++ +P + + I+
Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQ 133

Query: 130 YEHPQIQAIVVAYLDPDQAGEVLSNFDHKVRLDIVLRVSSLNTVQPAALKELNQILEKQF 189
EHPQ A++++YLDP +A +LS+ +V+ ++ R++ ++ P ++E+ ++LEK+
Sbjct: 134 QEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKL 193

Query: 190 SGNSNAARTTLGGIKRAADIMNFLDSSVEGALMDAIREIDSDLSEQIEDLMFVFNNLADV 249
+ S+ T+ GG+ +I+N D E +++++ E D +L+E+I+ MFVF ++ +
Sbjct: 194 ASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLL 253

Query: 250 DDRGIQALLREVSSDVLVVSLKGADERVKDKIFKNMSKRASELLRDDLEAKGPVRVSDVE 309
DDR IQ +LRE+ L +LK D V++KIFKNMSKRA+ +L++D+E GP R DVE
Sbjct: 254 DDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVE 313

Query: 310 TAQKEILTIARRMAEAGEIVLGGKGAEEMI 339
+Q++I+++ R++ E GEIV+ G E+++
Sbjct: 314 ESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4369FLGMRINGFLIF5330.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 533 bits (1374), Expect = 0.0
Identities = 206/572 (36%), Positives = 307/572 (53%), Gaps = 35/572 (6%)

Query: 28 LENISQMPMLRQIGLLVGLAASVAIGFAVVLWSQQPDYRPLYGSLSGMDTKQVMDTLAAA 87
LE ++++ +I L+V +A+VAI A+VLW++ PDYR L+ +LS D ++ L
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQM 72

Query: 88 DIPYNVEPNSGALLVKADDLSRARLKLAAAGVAPSDGNVGFELLDKEQGLGTSQFMEATR 147
+IPY SGA+ V AD + RL+LA G+ P G VGFELLD+E+ G SQF E
Sbjct: 73 NIPYRFANGSGAIEVPADKVHELRLRLAQQGL-PKGGAVGFELLDQEK-FGISQFSEQVN 130

Query: 148 YRRSLEGELARTVSSLNNVKAARVHLAIPKSSVFVRDERKPSASVLVELYPGRALEAGQV 207
Y+R+LEGELART+ +L VK+ARVHLA+PK S+FVR+++ PSASV V L PGRAL+ GQ+
Sbjct: 131 YQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQI 190

Query: 208 MAIVNLVATSVPELDKSQVTVVDQKGNLLSEQIQDSALTQAGKQFDYSRRVESMLTQRVH 267
A+V+LV+++V L VT+VDQ G+LL++ S Q ++ VES + +R+
Sbjct: 191 SAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQS-NTSGRDLNDAQLKFANDVESRIQRRIE 249

Query: 268 NILQPVLGNDRYKAEVSADLDFSAVESTSEQFNPDQPA----LRSEQSVDEQRASSQGPQ 323
IL P++GN A+V+A LDF+ E T E ++P+ A LRS Q ++ + P
Sbjct: 250 AILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPG 309

Query: 324 GVPGALSNQPPGAASAPQTTGGAATPAAAIQPGQPLVDANGQQIMDPATGQPMLAPYPSD 383
GVPGALSNQP AP T P N Q +T + P
Sbjct: 310 GVPGALSNQPAPPNEAPIAT-------------PPTNQQNAQNTPQTSTSTNSNSAGPRS 356

Query: 384 KRQQSTKNFELDRSISHTRQQQGRMARLSVAVVVDDQVKIDPATGDTTRAPWGAEDLARF 443
++ T N+E+DR+I HT+ G + RLSVAVVV+ + D P A+ + +
Sbjct: 357 TQRNETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTL-----ADGKPLPLTADQMKQI 411

Query: 444 TRLVQDAVGFDASRGDSVTVINVPFAADRGEEIADIAFYQQPWFWDIVKQVLGVVFILVL 503
L ++A+GF RGD++ V+N PF+A ++ F+QQ F D + + +LV+
Sbjct: 412 EDLTREAMGFSDKRGDTLNVVNSPFSA-VDNTGGELPFWQQQSFIDQLLAAGRWLLVLVV 470

Query: 504 VF----GVLRPVLNNITGGGKQAAPDSDMELGGMMGLDGELANDRVSLGGPTSILLPSPS 559
+ +RP L K A + + ++ L+ D + L
Sbjct: 471 AWILWRKAVRPQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRL---- 526

Query: 560 EGYEAQLNAIKGLVAEDPGRVAQVVKDWINAD 591
G E I+ + DP VA V++ W++ D
Sbjct: 527 -GAEVMSQRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4370FLGHOOKFLIE791e-22 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 78.5 bits (193), Expect = 1e-22
Identities = 43/94 (45%), Positives = 56/94 (59%), Gaps = 3/94 (3%)

Query: 17 MQADAMSLPKVTAAPELAPGQSTFADMLGQAIGKVHETQQASTQLANAFEIGKSGVDLTD 76
+QA AMS + P+ +FA L A+ ++ +TQ A+ A F +G+ GV L D
Sbjct: 13 LQATAMSARAQESLPQ---PTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALND 69

Query: 77 VMIASQKASVSMQAMTQVRNKLVQAYQDIMQMPV 110
VM QKASVSMQ QVRNKLV AYQ++M M V
Sbjct: 70 VMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4371HTHFIS478e-169 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 478 bits (1231), Expect = e-169
Identities = 174/472 (36%), Positives = 255/472 (54%), Gaps = 36/472 (7%)

Query: 2 AIKVLLVEDDRVLRQALGDTLEIGGFAYQAVGSAEEALEAVLEDAFSLVVSDVNMPGMDG 61
+L+ +DD +R L L G+ + +A + LVV+DV MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 HQLLSQLRRQQPQLPVLLMTAHAAVERAVEAMRQGAADYLVKPFEP--------KALLNL 113
LL ++++ +P LPVL+M+A A++A +GA DYL KPF+ +AL
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 114 VERHAAGRVTGEEGP--VACEPASRQLLELAARVARSDSTVLISGESGTGKEVLARYIHQ 171
R + ++G V A +++ + AR+ ++D T++I+GESGTGKE++AR +H
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD 182

Query: 172 QSPRAAQPFVAINCAAIPDNMLEATLFGHEKGAFTGAIAAQAGKFEQAEGGTLLLDEISE 231
R PFVAIN AAIP +++E+ LFGHEKGAFTGA G+FEQAEGGTL LDEI +
Sbjct: 183 YGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGD 242

Query: 232 MPMALQAKLLRVLQEREVERVGGRKPISLDIRVLATTNRDLAGEVAAGRFREDLYYRLSV 291
MPM Q +LLRVLQ+ E VGGR PI D+R++A TN+DL + G FREDLYYRL+V
Sbjct: 243 MPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNV 302

Query: 292 FPLAWRPLRERPGDILQLAERLLARHVAKMKHASVRLSPAARACLQAYAWPGNVRELDNA 351
PL PLR+R DI L + + K R A ++A+ WPGNVREL+N
Sbjct: 303 VPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVRELENL 361

Query: 352 LQRALILQQGGVIEAADFCL-----------------AGAIPLSAGTEPSL--------E 386
++R L VI +G++ +S E ++ +
Sbjct: 362 VRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGD 421

Query: 387 VVADAGGLGDDMRRHEYQMIIDTLRAERGRRKEAAERLGISPRTLRYKLAQM 438
+ +G + EY +I+ L A RG + +AA+ LG++ TLR K+ ++
Sbjct: 422 ALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4373HTHFIS506e-179 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 506 bits (1304), Expect = e-179
Identities = 177/488 (36%), Positives = 255/488 (52%), Gaps = 10/488 (2%)

Query: 5 TKILLIDDDSARRRDLAVVLNFLGEENLACASHDWQQAVEPLSSSREVLCVLIGTVNAPG 64
IL+ DDD+A R L L+ G + ++ +++ + V+ V
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNA--ATLWRWIAAG-DGDLVVTDVVMPDE 60

Query: 65 NLLGLLKTVATWDEFLPVLLLGEISSAELP-EDLRRRVLSNLEMPPSYSQLLDSLHRAQV 123
N LL + LPVL++ ++ + + L P ++L+ + RA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 124 YREMYDQARERGRQREPNLFRSLVGTSRAIQHVRQMMQQVADTDASVLILGESGTGKEVV 183
+ R + + LVG S A+Q + +++ ++ TD +++I GESGTGKE+V
Sbjct: 121 EP----KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELV 176

Query: 184 ARNLHYHSKRREAPFVPVNCGAIPAELLESELFGHEKGAFTGAITSRAGRFELANGGTLF 243
AR LH + KRR PFV +N AIP +L+ESELFGHEKGAFTGA T GRFE A GGTLF
Sbjct: 177 ARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLF 236

Query: 244 LDEIGDMPLPMQVKLLRVLQERTFERVGSNKTQSIDVRIIAATHKNLETMIEDGTFREDL 303
LDEIGDMP+ Q +LLRVLQ+ + VG DVRI+AAT+K+L+ I G FREDL
Sbjct: 237 LDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDL 296

Query: 304 YYRLNVFPIEMAPLRERVEDIPLLMNELISRMEHEKRGSIRFNSASIMSLCRHGWPGNVR 363
YYRLNV P+ + PLR+R EDIP L+ + + E E RF+ ++ + H WPGNVR
Sbjct: 297 YYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVR 356

Query: 364 ELANLVERMAIMHPYGVIGVSELPKKFRY-VDDEDEQMVDSLRSDLEERVAINGHTPN-F 421
EL NLV R+ ++P VI + + R + D + + L A+ + F
Sbjct: 357 ELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYF 416

Query: 422 SNHAMLPPEGLDLKDYLGSLEQGLIQQALDDANGIVARAAERLRIRRTTLVEKMRKYGMS 481
++ P L +E LI AL G +AA+ L + R TL +K+R+ G+S
Sbjct: 417 ASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476

Query: 482 RQGGEDQA 489
A
Sbjct: 477 VYRSSRSA 484


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4377FIMBRIALPAPE270.010 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 27.3 bits (60), Expect = 0.010
Identities = 24/80 (30%), Positives = 35/80 (43%), Gaps = 11/80 (13%)

Query: 35 VPTKEVQRPELEKAVSDIQEYVKASQRQLDFSVDD----STGTLVVKVIATGSGEVIRQL 90
+P VQ E+ +IQ V++ Q DF+VD S GT+ V + + G
Sbjct: 36 IPACTVQNAEVNWGDIEIQNLVQSGGNQKDFTVDMNCPYSLGTMKVTITSNGQ------- 88

Query: 91 PSEAALSLARSLAEGDGFLL 110
+ L S A GDG L+
Sbjct: 89 TGNSILVPNTSTASGDGLLI 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4378FLAGELLIN1392e-37 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 139 bits (352), Expect = 2e-37
Identities = 110/421 (26%), Positives = 166/421 (39%), Gaps = 11/421 (2%)

Query: 2 ALTVNTNITSMSVQKNLNKSSDALGTTMGRLSSGLKINSAKDDAAGLQISNRLTTQIKGL 61
A +NTN S+ Q NLNKS +L + + RLSSGL+INSAKDDAAG I+NR T+ IKGL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 SVAVKNANDGISIAQTAEGAMATSGNIMQRMRELALQSANGSNSDDDRASMQQEFTALSG 121
+ A +NANDGISIAQT EGA+ N +QR+REL++Q+ NG+NSD D S+Q E
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELTRIANTTTFGGRNLLDGTFSGSSFQVGANSNESISFGMKDVSATSMKGNYNEASVAGG 181
E+ R++N T F G +L QVGAN E+I+ ++ + S+ + + G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQM-KIQVGANDGETITIDLQKIDVKSLGLDGFNVN---G 176

Query: 182 VATLQASVTGAAGKFGTNNAGSTSASVVGTAGAGVFDKPTIGAAAGNLVLNVGTTTTTIA 241
++ K T + + V A
Sbjct: 177 PKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVN-----SGAVVTDTTAPTVPDKVYVNA 231

Query: 242 AAAGDTLQDVVDNINLETSNTGVTASIDSATGALKLDGTQAFTIDASTDDVLSTALGLAE 301
A T D +N ++ T + + + A+ D ++ +
Sbjct: 232 ANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKT 291

Query: 302 AGGAQLSKTGTANLRDGVLGAGGAGNLTLGSTNIALVATDTLSSVVGKVNAQTGTTGVTA 361
+ T N L + A + + VN Q T
Sbjct: 292 GNDGNGKVSTTINGEKVTLTVADITAGA--ANVDAATLQSSKNVYTSVVNGQFTFDDKTK 349

Query: 362 SIDSATGQLKLNSAAGFDVGGTAGTLTGLGLTAGSVAIAPQTTGLASAASIDINGTTFNF 421
+ + L+ N+A + T AG T + ++
Sbjct: 350 NESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINED 409

Query: 422 A 422
A
Sbjct: 410 A 410



Score = 94.7 bits (235), Expect = 2e-22
Identities = 70/334 (20%), Positives = 113/334 (33%), Gaps = 1/334 (0%)

Query: 353 QTGTTGVTASIDSATGQLKLNSAAGFDVGGTAGTLTGLGLTAGSVAIAPQTTGLASAASI 412
G T ++ + G + A+
Sbjct: 174 VNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAAN 233

Query: 413 DINGTTFNFAQGDDLDAIVDNINNNGAGAVGSGTALTGVTAKNDNGRLVLTSANGQDIKL 472
T A A A+ G + +T
Sbjct: 234 GQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGN 293

Query: 473 DNGSGVTGQGALAAVGLNSGTTKAGLVADTSISLNGVEVKFKKGDDMDSIAASINAASTG 532
D V+ V L AG + +L + + + +
Sbjct: 294 DGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESA 353

Query: 533 VNASVVVNAGSSTLSLFADQDITVADGSNGTGLAALGLTAVAGKTSAVEMESTVSNLNIT 592
+ + N S + G + G T K +A + + ++
Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDK-TASGVSTLINEDAAA 412

Query: 593 DAQSAQQAIQVLDGAMQSLDSQRSQLGAVQNRFDSTVANLQSISENSTAARSRIQDADFA 652
+S + +D A+ +D+ RS LGA+QNRFDS + NL + N +ARSRI+DAD+A
Sbjct: 413 AKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYA 472

Query: 653 AETAELSKQQTLQQASTAILSQANQLPSSVLKLL 686
E + +SK Q LQQA T++L+QANQ+P +VL LL
Sbjct: 473 TEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4380FLAGELLIN752e-16 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 74.7 bits (183), Expect = 2e-16
Identities = 95/505 (18%), Positives = 162/505 (32%), Gaps = 30/505 (5%)

Query: 18 QNYSNLAKITEQVTSKSRIQSAGDDPVGAARLLVLQQQSALLEQYSGNMTTVKNALLQEE 77
++ S+L+ E+++S RI SA DD G A L Q S N + E
Sbjct: 19 KSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTE 78

Query: 78 SVLSTINDALQRASELAIQAGNGGLSDADRTSIASEIKEIEANVFGLLNSRDANGDYMFG 137
L+ IN+ LQR EL++QA NG SD+D SI EI++ + + N NG +
Sbjct: 79 GALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLS 138

Query: 138 GSKSSTPPYVRNADGTYSYQGDQTQLSLQVSDTLRLATNDTGYSIFDQATNNSRTQSSLV 197
N T + + +L L + + + ++
Sbjct: 139 QDNQMKIQVGANDGETITIDLQKID-----VKSLGLDGFNVNGPKEATVGDLKSSFKNVT 193

Query: 198 APSPDDGKVLLSGGLLTSATTYGNQFAAGQPYSVTFTSATEYSVRDAAGNDITNETAGKG 257
+ S + A P V +A D A N+ +
Sbjct: 194 GYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTT 253

Query: 258 KFDVNTEGGKMISLR------GVDFEINLNLAEGDDADAVVAGHVFTLEAKPDTLTATRS 311
K T K I+ G F+ D + + +T T +
Sbjct: 254 KSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVA 313

Query: 312 AGNPSTAQVTGTSVTDPEAYRSTFPSNGAVIRFTSDTEYSLYAQPYSADSKPIGSGTVGA 371
A V ++ + ++ NG S A++ G +
Sbjct: 314 DITAGAANVDAATLQSSKNVYTSV-VNGQFTFDDKTKNESAKLSDLEANNAVKGESKITV 372

Query: 372 GNAVTVAGVTYQFDSAPKQGDQFSVNANTHRTQNVLDTLGQLRSALEKPLDTPEAQAALK 431
A A + + A+ T D AA K
Sbjct: 373 NGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDA------------------AAAK 414

Query: 432 TATDSAISNLASARDRIDITRGSIGARGNSLDIQAQENESLGLANQSTQSAIGDTDMASA 491
+T + ++++ SA ++D R S+GA N D + S +S I D D A+
Sbjct: 415 KSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATE 474

Query: 492 TIQLTLQQAMLEASQLAFARISQLS 516
++ Q + +A A+ +Q+
Sbjct: 475 VSNMSKAQILQQAGTSVLAQANQVP 499


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4381FLGHOOKAP12184e-65 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 218 bits (557), Expect = 4e-65
Identities = 135/465 (29%), Positives = 237/465 (50%), Gaps = 14/465 (3%)

Query: 2 SSLISIGLSGLSASQAALSVTSNNIANAATSGYSRQQTVQAAGPSHNIGAGFLGTGTTLA 61
SSLI+ +SGL+A+QAAL+ SNNI++ +GY+RQ T+ A S G++G G ++
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 DVRRIYSSYLDNQLQTATSLQADSVTFQDQITSVDKLLADRDTGISSVLTAFFSALQTAA 121
V+R Y +++ NQL+ A + + +Q++ +D +L+ + +++ + FF++LQT
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 AKPGDVASRQLLLTQAQTLSNRFNAVSTQLTQQNATINSQLDTMAGQVNKLTANIAEYNK 181
+ D A+RQ L+ +++ L N+F L Q+ +N + Q+N IA N
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 QI--TAASASGNTPNSLLDARSEAVRQLNELVGVTVQER-DGNYDVYLGSGQSLVTGTKA 238
QI +G +PN+LLD R + V +LN++VGV V + G Y++ + +G SLV G+ A
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 239 NTLSVEPGVADKSQVALRINYDSFSSDVT--SVVSGGAIGGLLRYRQDVLTPSMNELGRV 296
L+ P AD S+ + + + +++ G++GG+L +R L + N LG++
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 297 ALVVADSINSQLGQGLDANGQFGSSLFSSINSATAIAQRSLASSNNSAGSGNLDVTIANS 356
AL A++ N+Q G DANG G F+ I + ++ + + G + T+ ++
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFA-------IGKPAVLQNTKNKGDVAIGATVTDA 353

Query: 357 GALTTYDYEVKFTSASQYSVRRSDGTDMGSFDLNADPAPVIDGFSLALKGGGLAAGDSFK 416
A+ DY++ F +Q+ V R + +A+ DG L G A DSF
Sbjct: 354 SAVLATDYKISF-DNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTF-TGTPAVNDSFT 411

Query: 417 VIPTRAAAGSITTTLTDANKLAFAGPISAAAGSGNSGTGTITQPT 461
+ P A ++ +TD K+A A A +G + +
Sbjct: 412 LKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQS 456



Score = 78.1 bits (192), Expect = 4e-17
Identities = 55/151 (36%), Positives = 74/151 (49%), Gaps = 26/151 (17%)

Query: 551 ETTIGGSPAANDSFTL-----------------------SFNADGKADNRNANALLDLQT 587
E T G+PA NDSFTL S G +DNRN ALLDLQ+
Sbjct: 397 ELTFTGTPAVNDSFTLKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQS 456

Query: 588 KSTVGTNSGTGTSFTSAYASLVERVGAKASQATIDTTATQAVLKSATESRSAVSGVNLDD 647
S G SF AYASLV +G K + + V+ + + ++SGVNLD+
Sbjct: 457 NSKT---VGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDE 513

Query: 648 EAASLVKFQHYYTASSQIIKAAQETFSTLIN 678
E +L +FQ YY A++Q+++ A F LIN
Sbjct: 514 EYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4382FLGFLGJ1464e-43 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 146 bits (369), Expect = 4e-43
Identities = 70/163 (42%), Positives = 102/163 (62%), Gaps = 1/163 (0%)

Query: 222 DSDAFVATMLPMAEQAARRIGVDPRYLVAQAALETGWGKSVMRNSDGSSSHNLFGIKATG 281
DS AF+A + A+ A+++ GV ++AQAALE+GWG+ +R +G S+NLFG+KA+G
Sbjct: 148 DSKAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASG 207

Query: 282 NWQGEQARAITSEFRDGQFVKETAAFRSYDSYQDSFHDLVTLLQSNARYQGALDAADNPE 341
NW+G T+E+ +G+ K A FR Y SY ++ D V LL N RY A+ A + E
Sbjct: 208 NWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRY-AAVTTAASAE 266

Query: 342 QFARELQKAGYATDPGYAKKIISIAQQMQSTPQYAMAGRTTNL 384
Q A+ LQ AGYATDP YA+K+ ++ QQM+S + N+
Sbjct: 267 QGAQALQDAGYATDPHYARKLTNMIQQMKSISDKVSKTYSMNI 309



Score = 68.2 bits (166), Expect = 7e-15
Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 6/104 (5%)

Query: 19 LNRLSALKHGDRDSEANVRKVAQEFESLFISEMLKASRKASDVLADDNPMNTETVKQYRD 78
LN L A K G+ D AN+R VA++ E +F+ MLK+ R D L D ++E + Y
Sbjct: 18 LNELKA-KAGE-DPAANIRPVARQVEGMFVQMMLKSMR---DALPKDGLFSSEHTRLYTS 72

Query: 79 MYDQQLAVSMSREGGGIGLQDVLVRQLTKGRSASINTSPFPRVD 122
MYDQQ+A M+ G G+GL +++V+Q+T + ++P +
Sbjct: 73 MYDQQIAQQMT-AGKGLGLAEMMVKQMTPEQPLPEESTPAAPMK 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4383FLGPRINGFLGI449e-160 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 449 bits (1156), Expect = e-160
Identities = 166/366 (45%), Positives = 223/366 (60%), Gaps = 10/366 (2%)

Query: 7 LIATTLLLSCAFAAQAERLKDIASISGVRSNQLIGYGLVVGLNGTGDQTTQTPFTLQTFN 66
A L + A R+KDIAS+ R NQLIGYGLVVGL GTGD +PFT Q+
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 67 NMLSQFGIKVPAGSGNVQLKNVAAVSVHADLPPFAKPGQVVDITVSSIGNSKSLRGGSLL 126
ML GI G N KN+AAV V A+LPPFA PG VD+TVSS+G++ SLRGG+L+
Sbjct: 73 AMLQNLGITTQGGQSNA--KNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLI 130

Query: 127 MTPLKGIDGNVYAIAQGNLVVGGFDAEGRDGSKITVNVPSAGRIPGGASVERAVPSGFNQ 186
MT L G DG +YA+AQG L+V GF A+G D + +T V ++ R+P GA +ER +PS F
Sbjct: 131 MTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKD 189

Query: 187 GNTLTLNLNRPDFTTAKRIVDKVNDL----LGPGVAQAVDGGSVRVSAPMDPSQRVDYLS 242
L L L PDF+TA R+ D VN G +A+ D + V P + ++
Sbjct: 190 SVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLMA 248

Query: 243 ILENLEIDPGQAVAKVIINSRTGTIVIGQNVKVSPAAVTHGSLTVTITEDPIVSQPGAFS 302
+ENL ++ AKV+IN RTGTIVIG +V++S AV++G+LTV +TE P V QP FS
Sbjct: 249 EIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFS 307

Query: 303 NGQTAVVPRSRVNAEQEAKPMFKFGPGTTLDEIVRAVNQVGAAPGDLMAILEALKQAGAL 362
GQTAV P++ + A QE + G L +V +N +G ++AIL+ +K AGAL
Sbjct: 308 RGQTAVQPQTDIMAMQEGSKVA-IVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGAL 366

Query: 363 QADLIV 368
QA+L++
Sbjct: 367 QAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4384FLGLRINGFLGH1993e-66 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 199 bits (506), Expect = 3e-66
Identities = 84/221 (38%), Positives = 113/221 (51%), Gaps = 15/221 (6%)

Query: 16 LAGCVAPTPKPNDPYYAPVLPRTPLPAAANNGSIYQAGF-----EQNLYSDRKAFRVGDI 70
L GC P P P P NGSI+Q+ Q L+ DR+ +GD
Sbjct: 19 LTGCAWIPSTPLVQGATSAQP-VPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRNIGDT 77

Query: 71 ITITLNERTSASKNAGSQIAKTSKTDIGLTSLFGSTPN-TNNPFGGGDLSLEAGYSGDRA 129
+TI L E SASK++ + ++ KT+ G F + P FG +E SG
Sbjct: 78 LTIVLQENVSASKSSSANASRDGKTNFG----FDTVPRYLQGLFGNARADVE--ASGGNT 131

Query: 130 TKGDSKATQGNTLTGSITVTVAEVLPNGIIAVRGEKWLTLNTGEELVRIAGMVRADDIAT 189
G A NT +G++TVTV +VL NG + V GEK + +N G E +R +G+V I+
Sbjct: 132 FNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISG 191

Query: 190 DNTVPSTRVADARITYSGTGSFADASQPGWLDRFFI--SPL 228
NTVPST+VADARI Y G G +A GWL RFF+ SP+
Sbjct: 192 SNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4385FLGHOOKAP1438e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 42.6 bits (100), Expect = 8e-07
Identities = 12/44 (27%), Positives = 21/44 (47%)

Query: 216 QQTLENSNVSTVEELVNMITTQRAYEMNSKVISTADQMLSFVTQ 259
Q S V+ EE N+ Q+ Y N++V+ TA+ + +
Sbjct: 501 NQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 39.6 bits (92), Expect = 9e-06
Identities = 19/77 (24%), Positives = 33/77 (42%), Gaps = 14/77 (18%)

Query: 5 LWVAKTGLSAQDTNLTVISNNLANVSTTGFKRDRAEFADLLYQIKRQPGAQSTQDSELPS 64
+ A +GL+A L SNN+++ + G+ R + +S L +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT--------------IMAQANSTLGA 49

Query: 65 GLQVGTGVRIVGTQKSF 81
G VG GV + G Q+ +
Sbjct: 50 GGWVGNGVYVSGVQREY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4388FLGHOOKAP1416e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.5 bits (97), Expect = 6e-06
Identities = 20/61 (32%), Positives = 26/61 (42%)

Query: 2 SFNIGLSGLYAANKALNVTGNNIANVATTGFKSSRAEFADQYSNSIRGTSAGKTVIGTGV 61
N +SGL AA ALN NNI++ G+ A S G G V +GV
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62

Query: 62 K 62
+
Sbjct: 63 Q 63



Score = 37.6 bits (87), Expect = 8e-05
Identities = 16/48 (33%), Positives = 24/48 (50%)

Query: 392 QISGGALEDSNVDLTGELVNLIKAQSNYQANAKTISTESTIMQTIIQM 439
Q+S S V+L E NL + Q Y ANA+ + T + I +I +
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4390FLGHOOKAP1357e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 34.9 bits (80), Expect = 7e-05
Identities = 8/38 (21%), Positives = 20/38 (52%)

Query: 108 NVNVVEEMADMISASRAFQTNAELMNTAKNMMQKVLTL 145
VN+ EE ++ + + NA+++ TA + ++ +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545



Score = 31.9 bits (72), Expect = 8e-04
Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 25/152 (16%)

Query: 4 SSVFNIAGSGMSAQNTRLNTVASNIANAETVSSSIDQTYRARHPVFATTFQNAQAGSSQS 63
SS+ N A SG++A LNT ++NI++ + R +
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYT-------RQTTIMAQANSTLGA---- 49

Query: 64 LFEDQGEAGQGVQVKGI--VEDQSTLEARYEPNHPAANKDGYVYYPNVNVVEEMADMISA 121
G G GV V G+ D ++ Y ++ ++ M ++
Sbjct: 50 ----GGWVGNGVYVSGVQREYDAFITNQLRAAQTQSSGLTA--RYEQMSKIDNMLSTSTS 103

Query: 122 SRA------FQTNAELMNTAKNMMQKVLTLGQ 147
S A F + L++ A++ + +G+
Sbjct: 104 SLATQMQDFFTSLQTLVSNAEDPAARQALIGK 135


101PP_4502PP_4505N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_45021142.442398hypothetical protein
PP_45031122.538215winged helix family two component
PP_45040122.022988hypothetical protein
PP_4505-1131.819191integral membrane sensor signal transduction
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4502adhesinmafb270.011 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 27.3 bits (60), Expect = 0.011
Identities = 11/44 (25%), Positives = 15/44 (34%)

Query: 54 AGFSGSLIVAEFESLAAAQTWADADPYIAAGVYDKVVVKPFKQV 97
G GS+ E + A W +P A V V +V
Sbjct: 279 IGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4503HTHFIS993e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99.1 bits (247), Expect = 3e-26
Identities = 37/116 (31%), Positives = 60/116 (51%)

Query: 4 LLLIDDDQELCELLGSWLTQEGFSVRACHDGQSARLALAEHAPAAVVLDVMLPDGSGLEL 63
+L+ DDD + +L L++ G+ VR + + +A VV DV++PD + +L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LKQLRSEHAELPVLMLSARGEPLDRILGLELGADDYLAKPCDPRELTARLRAVLRR 119
L +++ +LPVL++SA+ + I E GA DYL KP D EL + L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4504NEISSPPORIN280.012 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 28.0 bits (62), Expect = 0.012
Identities = 13/20 (65%), Positives = 15/20 (75%), Gaps = 1/20 (5%)

Query: 1 MRKTLIALMFAAALPTVAMA 20
M+K+LIAL AALP AMA
Sbjct: 1 MKKSLIALTL-AALPVAAMA 19


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4505PF06580384e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 4e-05
Identities = 18/115 (15%), Positives = 41/115 (35%), Gaps = 24/115 (20%)

Query: 327 PGLSLQGWPTLIERAVDNLLRNALRFNPAGQPVEVSAAREQDRIVISVRDHGPGAAEEHL 386
P + +Q TL+E + ++ + P G + + ++ + + V + G A +
Sbjct: 256 PPMLVQ---TLVENGI----KHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN-- 306

Query: 387 AQLGEPFFRAPGQEAPGHGLGLA-IARKAAERHGGSLMLE-NHPQGGFVARLELP 439
G GL + + +G ++ + QG A + +P
Sbjct: 307 -------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


102PP_4641PP_4648N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_4641112-0.209517carbon starvation protein CstA
PP_4642010-1.386516type IV pilus assembly PilZ
PP_4643-211-0.261279xanthine/uracil permease
PP_4644-314-0.545967DNA repair protein RadA
PP_4645-312-1.485650large-conductance mechanosensitive channel
PP_4646-112-0.980173oxidoreductase FAD/NAD(P)-binding
PP_4647-1110.169250LuxR family transcriptional regulator
PP_46481121.337140rRNA (guanine-N(2)-)-methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4641ACRIFLAVINRP310.029 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.6 bits (69), Expect = 0.029
Identities = 11/66 (16%), Positives = 27/66 (40%)

Query: 168 FGCFLIMIIILAVLALIVVKALAESPWGMFTVMATIPIAMFMGIYMRYIRPGRIGEISVV 227
++ I V+ + + AL ES +VM +P+ + + + + +V
Sbjct: 869 GNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMV 928

Query: 228 GVVLLL 233
G++ +
Sbjct: 929 GLLTTI 934


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4643ACRIFLAVINRP310.014 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.0 bits (70), Expect = 0.014
Identities = 21/142 (14%), Positives = 44/142 (30%), Gaps = 21/142 (14%)

Query: 114 VLGAVMAASLIGFLITPVFSRIT-------KFFPPLVTGIVI-----TTIGLTLMPVAAR 161
+ GA++ +++ ++ VF + + IV + L L P
Sbjct: 438 IQGALVGIAMV---LSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCA 494

Query: 162 WVMGGNSASPE-----FGSMANIGLAALTFAIVLLLSKLGSATISRLSILLAMVVGTLIA 216
++ SA F N + K+ +T L I +V G ++
Sbjct: 495 TLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVL 554

Query: 217 WA-LGMTDFSKVGEGPMFAFPT 237
+ L + + +G
Sbjct: 555 FLRLPSSFLPEEDQGVFLTMIQ 576


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4645MECHCHANNEL1783e-61 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 178 bits (454), Expect = 3e-61
Identities = 89/134 (66%), Positives = 108/134 (80%), Gaps = 1/134 (0%)

Query: 1 MGMLNEFKAFAVKGNVVDMAVGIIIGAAFGKIVSSFVGDVIMPPLGLLIGGVDFSDLAIT 60
M ++ EF+ FA++GNVVD+AVG+IIGAAFGKIVSS V D+IMPPLGLLIGG+DF A+T
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LKAAEGDVPAVVLAYGKFIQTVIDFVIVAFAIFMGVKAINKLKREEAVAPTTPPVPSAEE 120
L+ A+GD+PAVV+ YG FIQ V DF+IVAFAIFM +K INKL R++ P P P+ EE
Sbjct: 61 LRDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKE-EPAAAPAPTKEE 119

Query: 121 TLLTEIRDLLKTQN 134
LLTEIRDLLK QN
Sbjct: 120 VLLTEIRDLLKEQN 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4648RTXTOXIND290.025 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.025
Identities = 25/127 (19%), Positives = 48/127 (37%), Gaps = 12/127 (9%)

Query: 49 VLVLNDSFGALAASLAGQLQVVSSGDSHLGHLALEKNLARNGLPFDSVPFVPASEHWQGP 108
VL+ + GA A +L Q ++ + + L +++ N LP +P P ++
Sbjct: 123 VLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEE 182

Query: 109 FDRVLVRVPKTLALLEEQLIRLQGQLAPGAQVIAGAMIKHLPRAAGDLMEKYIGPVQASL 168
V + +L++EQ Q Q + RA + I +
Sbjct: 183 E------VLRLTSLIKEQFSTWQNQKYQKELNLDKK------RAERLTVLARINRYENLS 230

Query: 169 ALKKARL 175
++K+RL
Sbjct: 231 RVEKSRL 237


103PP_4988PP_4994N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_49881132.678369chemotaxis protein CheA
PP_4989-1191.799314methyl-accepting chemotaxis sensory transducer
PP_4990-2161.785168type IV pili signal transduction protein PilI
PP_4991-1171.419126response regulator receiver protein
PP_4992-1172.039537response regulator receiver protein
PP_4993-1182.261682glutathione synthetase
PP_4994-2193.720478TonB family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4988HTHFIS734e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.9 bits (179), Expect = 4e-15
Identities = 25/102 (24%), Positives = 49/102 (48%), Gaps = 2/102 (1%)

Query: 1527 VMVVDDSVTVRKVTSRLLERHGMSVLTAKDGVDAMALLEEHRPDVLLLDIEMPRMDGFEV 1586
++V DD +R V ++ L R G V + + D+++ D+ MP + F++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 1587 ATRIRRDARLKDLPIIMITSRTGQKHRDRAMAIGVNEYLGKP 1628
RI+ DLP+++++++ +A G +YL KP
Sbjct: 66 LPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4991HTHFIS814e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 4e-21
Identities = 37/121 (30%), Positives = 56/121 (46%), Gaps = 4/121 (3%)

Query: 2 ARVLIVDDSPTEMYRLTEWLEKHGYQVLKASNGADGVALARQDKPDAVLMDIVMPGMNGF 61
A +L+ DD L + L + GY V SN A D V+ D+VMP N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 QATRQLSK-DPDTSAIPVIVVTTKDQETDRIWATRQGARDFLTKPVEEEALIAKLKEVLG 120
++ K PD +PV+V++ ++ I A+ +GA D+L KP + LI + L
Sbjct: 64 DLLPRIKKARPD---LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 A 121

Sbjct: 121 E 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4992HTHFIS711e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.0 bits (174), Expect = 1e-17
Identities = 28/115 (24%), Positives = 49/115 (42%), Gaps = 4/115 (3%)

Query: 6 KVMVIDDSRTIRRTAQMLLGEAGCEVITASDGFDALAKIVDHQPSIIFVDVLMPRLDGYQ 65
++V DD IR L AG +V S+ I ++ DV+MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 TCAVIKH-NSAFKDTPVILLSSRDGLFDKARGRVVGSDQFLTKPFSKEELLDAIR 119
++ A D PV+++S+++ + G+ +L KPF EL+ I
Sbjct: 65 ---LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_4994PF03544622e-13 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 61.9 bits (150), Expect = 2e-13
Identities = 30/171 (17%), Positives = 56/171 (32%), Gaps = 12/171 (7%)

Query: 106 ITPPPAARP---EVVPPPPPKKSAVVTTAPKPHKVEPKPKESKAQPKPAAPTPDFDSSQL 162
+ PP A +P VV P P + P +E + K +PKP
Sbjct: 60 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVE------- 112

Query: 163 SSQIASLEAELSNEQQMYAKRPRIHRLNAASTMRDKGAWYKEEWRKKVERVGNLNYPDEA 222
+ E P + A+ K + + R YP A
Sbjct: 113 QPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRN-QPQYPARA 171

Query: 223 RRQQIYGNLRMMVSINRDGSLYEVLVLESSGQPVLDQAAQRIVRLAAPFAP 273
+ +I G +++ + DG + V +L + + ++ + +R + P
Sbjct: 172 QALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMR-RWRYEP 221


104PP_5077PP_5083N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_5077-1180.598017sporulation domain-containing protein
PP_5078-1210.5730613-dehydroquinate synthase
PP_5079-2120.974151shikimate kinase
PP_5080-3100.207072type IV pilus secretin PilQ
PP_5081-3120.447558type IV pili biogenesis protein
PP_5082-312-0.205234fimbrial assembly family protein
PP_5083-3111.038208type IV pili biogenesis protein PilM
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5077PERTACTIN373e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 36.6 bits (84), Expect = 3e-04
Identities = 28/79 (35%), Positives = 32/79 (40%), Gaps = 3/79 (3%)

Query: 364 VTTIAPPQGVPAGPAPAPAQPVQSVAPAPKPVATQPAKPVAPAKPAPAPTQVAVAK--PA 421
V APP PA P P P Q P P QP +P APAP A + A
Sbjct: 563 VGAKAPPAPKPA-PQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAA 621

Query: 422 AKAAEKPAAAGAGNSGWYA 440
A AA G ++ WYA
Sbjct: 622 ANAAVNTGGVGLASTLWYA 640


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5079PF05272280.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.017
Identities = 11/33 (33%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 4 LILVGPMGAGKSTIGRLLAKELRLLFKDSDKEI 36
++L G G GKST+ L F D+ +I
Sbjct: 599 VVLEGTGGIGKSTLINTLVGL--DFFSDTHFDI 629


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5080BCTERIALGSPD2407e-75 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 240 bits (613), Expect = 7e-75
Identities = 91/403 (22%), Positives = 162/403 (40%), Gaps = 57/403 (14%)

Query: 68 DVPWDQALDLVLRSKGLSRRQEGNVLLVAPAAEFAAQ--------SADARVSQALDAHLQ 119
+ W A D+V L++ + L + A A S + Q + A ++
Sbjct: 198 PLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIK 257

Query: 120 PLRRE--------LLPIHHAKAADLAQLLLAGLE--------GDGIP----PASLSVDER 159
L R+ ++ + +AKA+DL ++L + + +
Sbjct: 258 QLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQ 317

Query: 160 TNTLVVHLPADRLAEMRQLVAQLDVPVRQVAIEARIVEANVDYEKSLGVRWGGPLYGEK- 218
TN L+V D + ++ +++AQLD+ QV +EA I E +LG++W G
Sbjct: 318 TNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQ 377

Query: 219 ------------------LRPGKELFVDLGLERAGSSIGLGLLRGDVLLDLELSAMEKSG 260
+ G + + I G +G+ + L+A+ S
Sbjct: 378 FTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGN--WAMLLTALSSST 435

Query: 261 NGEIISQPKVVTADKETARILKGTEVPYQETSQSG-----ATSVSFREASLSLEVTPQIT 315
+I++ P +VT D A G EVP SQ+ +V + + L+V PQI
Sbjct: 436 KNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQIN 495

Query: 316 PDNKVIMTVRVTKDEPDYVNALNN---VPPIRKNEVNAKVRVADGETIVIGGVYSTSQNN 372
+ V++ + + + VN V V GET+V+GG+ S ++
Sbjct: 496 EGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSD 555

Query: 373 VVDKVPFFGDLPYVGRLFRRDALQEKKSELLVFLTPRIMSDQA 415
DKVP GD+P +G LFR + + K L++F+ P ++ D+
Sbjct: 556 TADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRD 598



Score = 53.0 bits (127), Expect = 1e-09
Identities = 46/245 (18%), Positives = 96/245 (39%), Gaps = 15/245 (6%)

Query: 5 MKYLLATLLSIPLASATPYQGEPLSLNFQDVEVRAVLQVLADYAGVNLVASDAVQGSVTL 64
++ TLL P E S +F+ +++ + ++ ++ +V+G++T+
Sbjct: 7 IRSFSLTLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITV 66

Query: 65 RLQD-VPWDQALDL---VLRSKGLSR-RQEGNVLLVAPAAEFAAQSADARVSQALDAHLQ 119
R D + +Q VL G + VL V + + A +A S A
Sbjct: 67 RSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKD-AKTAAVPVASDAAPGIGD 125

Query: 120 PLRRELLPIHHAKAADLAQLLLAGLEGDGIPPASLSVDERTNTLVVHLPADRLAEMRQLV 179
+ ++P+ + A DLA LL + G+ S+ E +N L++ A + + +V
Sbjct: 126 EVVTRVVPLTNVAARDLAPLLRQLNDNAGV--GSVVHYEPSNVLLMTGRAAVIKRLLTIV 183

Query: 180 AQLDVPVRQVAIEARIVEANVDYEKSLGVRWGGPLYGEKLR---PGKELFVDLGLERAGS 236
++D + + + A+ V+ L + + PG + + ER +
Sbjct: 184 ERVDNAGDRSVVTVPLSWASAAD----VVKLVTELNKDTSKSALPGSMVANVVADERTNA 239

Query: 237 SIGLG 241
+ G
Sbjct: 240 VLVSG 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5083AUTOINDCRSYN290.021 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 28.7 bits (64), Expect = 0.021
Identities = 18/88 (20%), Positives = 33/88 (37%), Gaps = 11/88 (12%)

Query: 40 AQELFQAPLADDWAQAPG------VVVAALQRAYRRSGLRQRRVALALPASQVICKLCHL 93
+ LF + + ++++ G +V + +RSG R V L + L L
Sbjct: 120 SSMLFLSMI--NYSKDKGYDGIYTIVSHPMLTILKRSGWGIRVVEQGLSEKEERVYLVFL 177

Query: 94 PVEQSGAQLEAQLLADAERLFPFPLEDL 121
PV+ + + L R F +L
Sbjct: 178 PVD---DENQEALARRINRSGTFMSNEL 202


105PP_5239PP_5248N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_52391102.794202magnesium chelatase subunit D/I family protein
PP_52400112.288007response regulator
PP_52410112.255354LuxR family transcriptional regulator
PP_52420101.939383sensor histidine kinase/GAF domain-containing
PP_52430130.222184isochorismatase superfamily hydrolase
PP_52440110.567209AraC family transcriptional regulator
PP_5245-1121.105237AraC family transcriptional regulator
PP_52460131.464102potassium efflux system protein
PP_5247-1111.236606hypothetical protein
PP_5248-1131.582644isochorismatase superfamily hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5239HTHFIS355e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.2 bits (81), Expect = 5e-04
Identities = 37/165 (22%), Positives = 54/165 (32%), Gaps = 48/165 (29%)

Query: 198 LAAKRALLLAAAGAHNLLFTGPPGTGKTLLASRLPGLLPPLDEHEALEVAAIQSVSGKAP 257
R L L+ TG GTGK L+A A+ +
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVAR------------------ALHDYGKRR- 187

Query: 258 LNSWPQRPFRHPHHSASGP------ALVG-------GSSRPQPGEITLAHHGVLFLDEL- 303
PF + A+ P L G G+ G A G LFLDE+
Sbjct: 188 -----NGPF-VAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIG 241

Query: 304 ---PEFERRVLEVLREPLESGEIVIARARDKVRFPARFQLVAAMN 345
+ + R+L VL++ GE + + ++VAA N
Sbjct: 242 DMPMDAQTRLLRVLQQ----GE--YTTVGGRTPIRSDVRIVAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5240HTHFIS924e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.8 bits (228), Expect = 4e-25
Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 1/120 (0%)

Query: 1 MSRG-VCIVDDDASVRKSLANLLRSAGFETLSFSAGHAFLASPLAGEAGCVLLDLKMPGM 59
M+ + + DDDA++R L L AG++ S AG+ V+ D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 SGLEVQRELAQRGWRLPVICMSAHWDDGSVRAAMGLGALACLGKPFSEEVLLKVVEEALA 119
+ ++ + + LPV+ MSA + A GA L KPF L+ ++ ALA
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5241HTHFIS905e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 5e-22
Identities = 25/112 (22%), Positives = 53/112 (47%)

Query: 127 VLVVDDDSSVRTALGRLLRSQDIPHHLFASAEALSEARLETPCACLLLDMHLPGTSGLEV 186
+LV DDD+++RT L + L + ++A L ++ D+ +P + ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 187 QDALCRLALPWPIVFMTGFGTIPMTVQAMRAGAVEFLTKPFDEDQLLTLLQT 238
+ + P++ M+ T ++A GA ++L KPFD +L+ ++
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5242PF06580382e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 2e-04
Identities = 52/281 (18%), Positives = 98/281 (34%), Gaps = 64/281 (22%)

Query: 1412 LEILASQAAVSLQTAKFYTRLAEENQIRTQMEAELRRSRAE-LARSAHLQAMNELSASIA 1470
L I+ + V+ + Y + +AE+ + + +A+ A L A L A I
Sbjct: 118 LSIIFNVVVVTFMWSLLYFGWHFFKNYK---QAEIDQWKMASMAQEAQLMA---LKAQI- 170

Query: 1471 HEISQP--LLGIASNAAASLRWLKRAKPDLEEAIAGLEDIRNDSERAGNIVRAL----RS 1524
P + NA ++R L I D +A ++ +L R
Sbjct: 171 ----NPHFMF----NALNNIRAL----------------ILEDPTKAREMLTSLSELMRY 206

Query: 1525 LAKQSPMQLKAVKLDEL--IREVVRLTSA---DAAKGKVDVQTRLKAGVCVTADPVQLQQ 1579
+ S + ++ DEL + ++L S D + + + + V P+ +Q
Sbjct: 207 SLRYSNARQVSLA-DELTVVDSYLQLASIQFEDRLQFENQINPAIMD---VQVPPMLVQT 262

Query: 1580 LVFNLITNALEALAGYRCDGVLKITSAVVEDEVEICVDDNGPGIAADERERVFDAFHTTK 1639
LV N I + + L G + + V + V++ G + +E
Sbjct: 263 LVENGIKHGIAQLPQ---GGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------- 309

Query: 1640 TGGMGMGLA-ICSSVAQAHGGQLQ-ALVSQLGGCRIRVSLP 1678
G GL + + +G + Q L + G V +P
Sbjct: 310 --STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5243ISCHRISMTASE411e-06 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 41.2 bits (96), Expect = 1e-06
Identities = 31/159 (19%), Positives = 56/159 (35%), Gaps = 20/159 (12%)

Query: 8 RLNKDDAVVLLVDHQTGLISLVQDFSP--NEFKNNVLALGDLAKFFGLPTILTTS-FEQG 64
+ + AV+L+ D Q + + E N+ L + G+P + T Q
Sbjct: 25 VPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQN 84

Query: 65 PNGPLV------PELKEMFPDAPYIAR----PGQI-------NAWDNEDFVKAIKATGRK 107
P+ + P L + I + +A+ + ++ ++ GR
Sbjct: 85 PDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRD 144

Query: 108 QLIIAGVVTDVCVAFPTLSALAEGFEVFVVTDASGTFNE 146
QLII G+ + A E + F V DA F+
Sbjct: 145 QLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSL 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5248ISCHRISMTASE381e-05 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 38.1 bits (88), Expect = 1e-05
Identities = 31/166 (18%), Positives = 59/166 (35%), Gaps = 26/166 (15%)

Query: 13 DAAVLLV-DHQAGLLSLVRDIEP--DKFKNNVLALADLAKFFNLPTILTTS-FEQGPNGP 68
+ AVLL+ D Q + + N+ L + +P + T Q P+
Sbjct: 29 NRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDDR 88

Query: 69 LV------PELKALFPDAPYIAR----PGQI-------NAWDNEDFVKAVKATGKKQLII 111
+ P L + + I + +A+ + ++ ++ G+ QLII
Sbjct: 89 ALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLII 148

Query: 112 AGVVTEVCVAFPALSALEEEFEVFVVTDASGTFNEMTRDAAHDRMS 157
G+ + A A E+ + F V DA F+ +M+
Sbjct: 149 TGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSL-----EKHQMA 189


106PP_5320PP_5324N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_5320-2130.139283transcriptional regulator
PP_5321-213-0.242493PAS/PAC sensor signal transduction histidine
PP_5322-3150.373824hypothetical protein
PP_5323-2160.195675M24/M37 family peptidase
PP_5324-120-0.274508response regulator receiver protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5320HTHFIS981e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 98.0 bits (244), Expect = 1e-25
Identities = 39/124 (31%), Positives = 63/124 (50%), Gaps = 2/124 (1%)

Query: 14 MVGRNILIVDDEAPIREMIAVALEMAGYDCLEAENSQQAHAIIVDRKPDLILLDWMLPGT 73
M G IL+ DD+A IR ++ AL AGYD N+ I DL++ D ++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 74 SGIELARRLKRDELTGDIPIIMLTAKGEEDNKIQGLEVGADDYITKPFSPRELVARLKAV 133
+ +L R+K + D+P+++++A+ I+ E GA DY+ KPF EL+ +
Sbjct: 61 NAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 134 LRRT 137
L
Sbjct: 119 LAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5321PF06580290.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.026
Identities = 20/99 (20%), Positives = 34/99 (34%), Gaps = 25/99 (25%)

Query: 329 LVFNAVKY----TQDEGSIRIRWWADAQGAHLSVQDSGVGIDAKHLPRLTERFYRVDSSR 384
LV N +K+ G I ++ D L V+++G + K+
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG-SLALKNTKE------------ 309

Query: 385 ASNTGGTGLGLAIVKHVLMRHRGK---LEISSVPGHGST 420
TG GL V+ L G +++S G +
Sbjct: 310 -----STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5323RTXTOXIND290.020 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.020
Identities = 11/42 (26%), Positives = 18/42 (42%), Gaps = 7/42 (16%)

Query: 211 PSGNFVRILHPDGTMGVYLHLMRGSVVVAEGQRVRQGQMLAK 252
SG I + + ++ V EG+ VR+G +L K
Sbjct: 92 HSGRSKEIKPIEN--SIVKEII-----VKEGESVRKGDVLLK 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5324HTHFIS843e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.7 bits (207), Expect = 3e-20
Identities = 31/124 (25%), Positives = 59/124 (47%), Gaps = 4/124 (3%)

Query: 1 MSKVNVLVVDDAPFIRDLVRKCLRNAFPGMAIDDAINGRKAMAMLGKEAFDLVLCDWEMP 60
M+ +LV DD IR ++ + L A G + N + DLV+ D MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 EMSGLELLTWCRQQPALKNLQFIMVTSRGDKENVIQAIQAGVSDFVGKPFTNEQLLTKVK 120
+ + +LL ++ A +L ++++++ I+A + G D++ KPF +L+ +
Sbjct: 59 DENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 121 KALT 124
+AL
Sbjct: 117 RALA 120


107PP_5379PP_5387N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
PP_5379432-6.554209copper resistance B
PP_5380129-5.135509copper resistance protein A
PP_5381-132-4.081313hypothetical protein
PP_5382-224-2.750958hypothetical protein
PP_5383-225-2.633980two component heavy metal response
PP_5384-223-2.612904heavy metal sensor signal transduction histidine
PP_5385-320-1.350748CzcC family heavy metal RND efflux outer
PP_5386021-2.260849CzcB family heavy metal RND efflux membrane
PP_5387122-3.094542CzcA family heavy metal RND efflux protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5379CHLAMIDIAOMP300.018 Chlamydia major outer membrane protein signature.
		>CHLAMIDIAOMP#Chlamydia major outer membrane protein signature.

Length = 393

Score = 30.0 bits (67), Expect = 0.018
Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 317 EVGLRLRYEIVRQFAPYIGVTWNRSYGKTADLIR 350
+ L L Y + F PYIGV W+R+ AD IR
Sbjct: 272 QASLALSYRL-NMFTPYIGVKWSRA-SFDADTIR 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5380ICENUCLEATIN435e-06 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 42.8 bits (100), Expect = 5e-06
Identities = 33/122 (27%), Positives = 44/122 (36%)

Query: 377 MAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKMAGMDHGSMAGM 436
G + AG + S +AG A + MAG S AG SMAG
Sbjct: 913 TTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGY 972

Query: 437 DHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMDHSSMEGMGGAM 496
D S +A AG + AG A+ + AG A D S + G G ++
Sbjct: 973 DSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSL 1032

Query: 497 QS 498
S
Sbjct: 1033 TS 1034



Score = 41.3 bits (96), Expect = 1e-05
Identities = 36/138 (26%), Positives = 45/138 (32%)

Query: 366 SMSDMGMDHGSMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S G D +AG AG + S+MAG AG G D S +
Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMD 485
AG AG D S A A AG G A D S +AG A +
Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317

Query: 486 HSSMEGMGGAMQSHPASE 503
+ G G + S+
Sbjct: 318 STQTAGYGSTQTAQKGSD 335



Score = 40.9 bits (95), Expect = 2e-05
Identities = 39/118 (33%), Positives = 47/118 (39%), Gaps = 2/118 (1%)

Query: 377 MAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM-AGMDHGSMAG 435
+AG AG D + +AG AG + S+MAG TGM S + AG AG
Sbjct: 193 IAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYG-STQTGMKGSDLTAGYGSTGTAG 251

Query: 436 MDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMDHSSMEGMG 493
D S +A AG D S AG A AG G A D S + G G
Sbjct: 252 DDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 309



Score = 39.4 bits (91), Expect = 5e-05
Identities = 31/133 (23%), Positives = 43/133 (32%)

Query: 366 SMSDMGMDHGSMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S MAG A S AG +MAG D S +AG G +
Sbjct: 934 STQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLT 993

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMD 485
AG A + A AG D S +AG + + AG ++ +
Sbjct: 994 AGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLR 1053

Query: 486 HSSMEGMGGAMQS 498
G G ++ S
Sbjct: 1054 SVLTAGYGSSLIS 1066



Score = 39.4 bits (91), Expect = 6e-05
Identities = 33/133 (24%), Positives = 44/133 (33%)

Query: 366 SMSDMGMDHGSMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S S G + +AG A + MAG A S AG +M G D S +
Sbjct: 918 STSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLI 977

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMD 485
AG AG + A A + AG A D S +AG + +
Sbjct: 978 AGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIR 1037

Query: 486 HSSMEGMGGAMQS 498
G G + S
Sbjct: 1038 SFLTAGYGSTLIS 1050



Score = 37.8 bits (87), Expect = 1e-04
Identities = 37/138 (26%), Positives = 45/138 (32%)

Query: 366 SMSDMGMDHGSMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S G D +AG AG D S AG A AG G D S +
Sbjct: 246 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLI 305

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMD 485
AG AG + ++ A A AG G A D S +AG A D
Sbjct: 306 AGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGED 365

Query: 486 HSSMEGMGGAMQSHPASE 503
S G G + S+
Sbjct: 366 SSLTAGYGSTQTAQKGSD 383



Score = 37.0 bits (85), Expect = 3e-04
Identities = 32/117 (27%), Positives = 40/117 (34%)

Query: 377 MAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKMAGMDHGSMAGM 436
+AG AG + +AG AG D + +AG G + S+MAG
Sbjct: 177 IAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMK 236

Query: 437 DHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMDHSSMEGMG 493
A G AG D S +AG A D S AG A G G
Sbjct: 237 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYG 293



Score = 36.7 bits (84), Expect = 4e-04
Identities = 32/131 (24%), Positives = 49/131 (37%)

Query: 366 SMSDMGMDHGSMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S G S AG D +AG ++ AG + AG ++ A + TG +
Sbjct: 862 SDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTST 921

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMD 485
AG + +AG ++ A MAG S+ A A + MAG D +A
Sbjct: 922 AGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYG 981

Query: 486 HSSMEGMGGAM 496
+ G +
Sbjct: 982 STQTAGYQSTL 992



Score = 35.5 bits (81), Expect = 8e-04
Identities = 34/121 (28%), Positives = 41/121 (33%)

Query: 378 AGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKMAGMDHGSMAGMD 437
AG AG D S +AG AG + ++ AG AG AG D
Sbjct: 290 AGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDD 349

Query: 438 HSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMDHSSMEGMGGAMQ 497
S +A AG D S AG A AG G A D S + G G
Sbjct: 350 SSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQT 409

Query: 498 S 498
+
Sbjct: 410 A 410



Score = 35.5 bits (81), Expect = 8e-04
Identities = 33/137 (24%), Positives = 44/137 (32%)

Query: 366 SMSDMGMDHGSMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S S G D +AG AG + AG A + G + G + S +
Sbjct: 870 STSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLI 929

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMD 485
AG A + MA A S AG MA D S +AG A
Sbjct: 930 AGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQ 989

Query: 486 HSSMEGMGGAMQSHPAS 502
+ G G + +S
Sbjct: 990 STLTAGYGSTQTAEHSS 1006



Score = 35.1 bits (80), Expect = 0.001
Identities = 28/116 (24%), Positives = 37/116 (31%)

Query: 378 AGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKMAGMDHGSMAGMD 437
AG A + G + AG + S +AG + MAG A
Sbjct: 898 AGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQ 957

Query: 438 HSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMDHSSMEGMG 493
S A MAG D S +AG A + AG A+ + G G
Sbjct: 958 SSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYG 1013



Score = 34.3 bits (78), Expect = 0.002
Identities = 30/116 (25%), Positives = 37/116 (31%)

Query: 378 AGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKMAGMDHGSMAGMD 437
G + AG D S +AG AG + AG AG S AG D
Sbjct: 626 TGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGAD 685

Query: 438 HSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMDHSSMEGMG 493
S +A AG + AG A +G A D S + G G
Sbjct: 686 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYG 741



Score = 33.6 bits (76), Expect = 0.003
Identities = 31/137 (22%), Positives = 43/137 (31%)

Query: 366 SMSDMGMDHGSMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S S G D +AG AG + AG A + G + G D S +
Sbjct: 822 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLI 881

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMD 485
AG AG + A A + G A + S +AG A
Sbjct: 882 AGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFK 941

Query: 486 HSSMEGMGGAMQSHPAS 502
+ M G G + + S
Sbjct: 942 STLMAGYGSSQTAREQS 958



Score = 33.2 bits (75), Expect = 0.004
Identities = 25/101 (24%), Positives = 40/101 (39%)

Query: 376 SMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKMAGMDHGSMAG 435
SMAG D +AG ++ AG AG ++ A G + AG D +AG
Sbjct: 968 SMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAG 1027

Query: 436 MDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGM 476
S + + AG + ++G+ A S ++G
Sbjct: 1028 YGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLISGR 1068



Score = 32.8 bits (74), Expect = 0.005
Identities = 36/161 (22%), Positives = 55/161 (34%), Gaps = 9/161 (5%)

Query: 352 MQAAVPAVDPR-PLISMSDMGMDHGSMAGMDH-------GNMAGMDHSKM-AGMDHGNMA 402
+ + V + P+ D ++ GS ++G S++ AG A
Sbjct: 127 VTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETA 186

Query: 403 GMDHSKMAGMDHGNMTGMDHSKMAGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQ 462
G + +AG G D + +AG AG + S+MA AG
Sbjct: 187 GDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGS 246

Query: 463 GGMADMDHSKMAGMDQGGMADMDHSSMEGMGGAMQSHPASE 503
G A D S +AG A D S G G + S+
Sbjct: 247 TGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSD 287



Score = 32.8 bits (74), Expect = 0.005
Identities = 30/121 (24%), Positives = 37/121 (30%)

Query: 378 AGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKMAGMDHGSMAGMD 437
AG + AG D S +AG AG + AG +G S AG D
Sbjct: 674 AGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGAD 733

Query: 438 HSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMDHSSMEGMGGAMQ 497
S +A A S AG A G A D S + G G
Sbjct: 734 SSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQT 793

Query: 498 S 498
+
Sbjct: 794 A 794



Score = 32.0 bits (72), Expect = 0.008
Identities = 36/146 (24%), Positives = 55/146 (37%), Gaps = 10/146 (6%)

Query: 368 SDMGMDHGSM--AGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
SD+ +GS AG D +AG ++ AG + AG ++ A G +
Sbjct: 286 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 345

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSK--------MAGMDQGGMADMDHSKMAGMD 477
AG D +AG ++ A D AG ++ AG G A D S +AG
Sbjct: 346 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 405

Query: 478 QGGMADMDHSSMEGMGGAMQSHPASE 503
A + + G G + S+
Sbjct: 406 STQTAGEESTQTAGYGSTQTAQKGSD 431



Score = 32.0 bits (72), Expect = 0.009
Identities = 29/126 (23%), Positives = 42/126 (33%)

Query: 366 SMSDMGMDHGSMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S S G D +AG AG + AG A + AG G D S +
Sbjct: 966 STSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLI 1025

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMD 485
AG +G+ A ++G+ AG ++ S AG +A
Sbjct: 1026 AGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHR 1085

Query: 486 HSSMEG 491
S + G
Sbjct: 1086 SSLIAG 1091



Score = 32.0 bits (72), Expect = 0.010
Identities = 28/108 (25%), Positives = 45/108 (41%), Gaps = 3/108 (2%)

Query: 368 SDMGMDHGS--MAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S+ H S +AG + + G +AG AG + ++G D M G +
Sbjct: 1078 SNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLI 1137

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKM 473
AG D AG D SK+ ++ + D SK+ + + D SK+
Sbjct: 1138 AGADSTQTAG-DRSKLLAGNNSYLTAGDRSKLTAGNDCILMAGDRSKL 1184



Score = 31.6 bits (71), Expect = 0.012
Identities = 29/116 (25%), Positives = 34/116 (29%)

Query: 378 AGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKMAGMDHGSMAGMD 437
+G + AG D S +AG A S AG G S AG D
Sbjct: 722 SGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGAD 781

Query: 438 HSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMDHSSMEGMG 493
S +A AG AG A G A D S + G G
Sbjct: 782 SSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYG 837



Score = 31.6 bits (71), Expect = 0.013
Identities = 34/146 (23%), Positives = 51/146 (34%), Gaps = 10/146 (6%)

Query: 368 SDMGMDHGSM--AGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S + +GS AG D +AG ++ AG + AG ++ A G +
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMA--------GMDQGGMADMDHSKMAGMD 477
AG D +AG ++ A + AG ++ A G A D S +AG
Sbjct: 682 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYG 741

Query: 478 QGGMADMDHSSMEGMGGAMQSHPASE 503
A S G G + S
Sbjct: 742 STQTASYHSSLTAGYGSTQTAREQSV 767



Score = 30.5 bits (68), Expect = 0.025
Identities = 31/138 (22%), Positives = 39/138 (28%)

Query: 366 SMSDMGMDHGSMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S G D +AG A S AG A G + G D S +
Sbjct: 582 STGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLI 641

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMD 485
AG AG + A A AG A D S +AG A +
Sbjct: 642 AGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYN 701

Query: 486 HSSMEGMGGAMQSHPASE 503
G G + S+
Sbjct: 702 SILTAGYGSTQTAQEGSD 719



Score = 30.5 bits (68), Expect = 0.029
Identities = 29/128 (22%), Positives = 38/128 (29%)

Query: 366 SMSDMGMDHGSMAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM 425
S G + AG A + G + AG D S +AG G +
Sbjct: 838 STQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILT 897

Query: 426 AGMDHGSMAGMDHSKMAEMDHGGMAGMDHSKMAGMDQGGMADMDHSKMAGMDQGGMADMD 485
AG A + AG + S +AG A + MAG A
Sbjct: 898 AGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQ 957

Query: 486 HSSMEGMG 493
S G G
Sbjct: 958 SSLTAGYG 965



Score = 30.1 bits (67), Expect = 0.032
Identities = 29/84 (34%), Positives = 41/84 (48%), Gaps = 3/84 (3%)

Query: 377 MAGMDHGNMAGMDHSKMAGMDHGNMAGMDHSKMAGMDHGNMTGMDHSKM-AGMDHGSMAG 435
++G D MAG +AG D AG D SK+ ++ +T D SK+ AG D MAG
Sbjct: 1121 ISGADSVQMAGERGKLIAGADSTQTAG-DRSKLLAGNNSYLTAGDRSKLTAGNDCILMAG 1179

Query: 436 MDHSKMAEMDHGGMAGMDHSKMAG 459
D SK+ + + SK+ G
Sbjct: 1180 -DRSKLTAGINSILTAGCRSKLIG 1202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5383HTHFIS875e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 87.2 bits (216), Expect = 5e-22
Identities = 36/117 (30%), Positives = 60/117 (51%)

Query: 2 KLLVAEDEPKTGAYLQQGLAEAGFTVDRVLTGTDALQHALSESYDLLILDVMMPGLEGWE 61
+LVA+D+ L Q L+ AG+ V + + DL++ DV+MP ++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRMVRAAGKDVPVLFLTARDGVDDRVKGLELGADDYLIKPFAFSELLARVRTLLRR 118
+L ++ A D+PVL ++A++ +K E GA DYL KPF +EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5385RTXTOXIND310.009 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.0 bits (70), Expect = 0.009
Identities = 20/120 (16%), Positives = 36/120 (30%), Gaps = 15/120 (12%)

Query: 296 QLPLFSGSRQDPMIAARRAQV---RQLEDEQEAALREHTAQLEADL----AEYQRLQRAV 348
+LP + +V L EQ + + Q E +L AE + +
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARI 223

Query: 349 TRSRDTLLPLAEQRVSLALADYRAGKSALEEVLTARRQRVEARLQDIDLQGQLAATAARL 408
R + + + + A VL + VEA +L ++L
Sbjct: 224 NRYENLSRVEKSRLDDFS-SLLHKQAIAKHAVLEQENKYVEA-------VNELRVYKSQL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5386RTXTOXIND487e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 47.5 bits (113), Expect = 7e-08
Identities = 31/144 (21%), Positives = 53/144 (36%), Gaps = 15/144 (10%)

Query: 194 ELVDRVARSGKVQNQVTLVAPSAGVIQALDLR-PGMTVTPGATLARINGIANV-WLEAAV 251
L +A++ + Q + AP + +Q L + G VT TL I + + A V
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 252 PEAQASGLHEGQAVEAHLPAYPGET---VQGKLTALLADADLQSRT---LRLRIELP--- 302
++ GQ + A+P + GK+ + DA R + I +
Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENC 432

Query: 303 ----NKDGRLRPGMTAQVSLRPGE 322
NK+ L GM ++ G
Sbjct: 433 LSTGNKNIPLSSGMAVTAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
PP_5387ACRIFLAVINRP6800.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 680 bits (1756), Expect = 0.0
Identities = 215/1063 (20%), Positives = 432/1063 (40%), Gaps = 61/1063 (5%)

Query: 5 LIRWSVSNRMLVLLATLFLMAWGIFSLRSLPIDALPDLSDAQVIIRTSYPGQAPQIVENQ 64
+ + + + + + LM G ++ LP+ P ++ V + +YPG Q V++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 VTYPLTTTMLSVPGAKTVRGFSA-FGDSFVYVIFEDGTDLYWARSRVLEYLSQVQSRLPA 123
VT + M + + S G + + F+ GTD A+ +V L LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 TAK-PALGPDATGVGWIYQYALVDRSGRHDLAQLRSLQDWFLRFELKTVPDVAEVASIGG 182
+ + + + ++ V + + ++ L + V +V G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 183 MVKQYQVVLDPLRMASLGITQGEVANAIGKANQETGGG------VLEQGEAEFMVRASGY 236
++ LD + +T +V N + N + G L + + A
Sbjct: 181 Q-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 237 LKTLDDFRAIPLRLAANGAPVMLGDVARVQLGPEARRGIGELDGEGEAVGGVVILRSGKN 296
K ++F + LR+ ++G+ V L DVARV+LG E I ++G+ A G + L +G N
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGAN 298

Query: 297 AREAITHVKDKLETLKKSLPAGVELVTTYDRSQLIDRAVDNLSHKLIEEFIVVALVCAAF 356
A + +K KL L+ P G++++ YD + + ++ + L E ++V LV F
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 357 LWHLRSSLVAIVSLPVGVLIALIIMRHQGINANIMSLGGIAIAIGAMVDAAVVMIENAHK 416
L ++R++L+ +++PV +L I+ G + N +++ G+ +AIG +VD A+V++EN +
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 417 RVEAWHARYPGQVLRGAEHWRVMTDAAVEVGPALFFSLMIITLSFVPVFTLQAQEGRLFA 476
+ + ++ AL M+++ F+P+ G ++
Sbjct: 419 VMMEDKLPPKEATEKSMS----------QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYR 468

Query: 477 PLAFTKTYAMAAAAGLAVTLVPVLMGYWIR--GPLPAEERNPLNRGLIRVYRPALE---- 530
+ T AMA + +A+ L P L ++ E + + ++
Sbjct: 469 QFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTN 528

Query: 531 ---VVLRRPWMTLLGALAILLSSLWPLSHLGGEFLPPLDEGDLLYMPSALPGLSAQKASE 587
+L LL I+ + L FLP D+G L M G + ++ +
Sbjct: 529 SVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQK 588

Query: 588 LLQRTDR--LIRTVPEVASVFGKAGRAESATDPAPLEMFETIVRLKPKDQW-RPGMSSEK 644
+L + L V SVF G + S F V LKP ++ S+E
Sbjct: 589 VLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAF---VSLKPWEERNGDENSAEA 645

Query: 645 LVEELDRTVRVPGLTNIWIPPIR-NRIDMLATGIKSPIGVKVAGSDLAQIDHTTLA---- 699
++ + + + + ++ P I L T G D A + H L
Sbjct: 646 VIHRA--KMELGKIRDGFVIPFNMPAIVELGTA----TGFDFELIDQAGLGHDALTQARN 699

Query: 700 --IERVAKSVPGVTSALAERLTGGRYVDLDIDRQAAARYGLNIADVQAIVAGAIGGETIG 757
+ A+ + S L L++D++ A G++++D+ ++ A+GG +
Sbjct: 700 QLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVN 759

Query: 758 ETVEGLARYPISVRYPREWRDSVDALRQLPVYTAQGGQITLGTVARVGIAEGPPMLKSEN 817
+ ++ + V+ ++R + + +L V +A G + G P L+ N
Sbjct: 760 DFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYN 819

Query: 818 ARPSGWVYIDVRGRDLS-SVVADLRRAVDR-EVKLNPGMSLSYSGQFEYLERANARLAWV 875
PS ++++G + D ++ KL G+ ++G + + +
Sbjct: 820 GLPS----MEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPAL 875

Query: 876 VPATLAIIFVLLYLTFGRMGEALLIMGTLPFALTGGVWLLYLMGFNLSVATGVGFIALAG 935
V + ++F+ L + + +M +P + G + L V VG + G
Sbjct: 876 VAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIG 935

Query: 936 VAAEFGVIMLIYLNNAWAERQANSEHTQDALLEAIREGAVQRIRPKAMTVAVIVAGLLPI 995
++A+ ++++ + + E ++EA R+RP MT + G+LP+
Sbjct: 936 LSAKNAILIVEFAKDLM-------EKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 996 LWSNGTGSEVMSRIAVPMVGGMLTAPLLSLFVIPAAYWLVRRR 1038
SNG GS + + + ++GGM++A LL++F +P + ++RR
Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 75.6 bits (186), Expect = 7e-16
Identities = 93/522 (17%), Positives = 180/522 (34%), Gaps = 48/522 (9%)

Query: 3 EKLIRWSVSNRMLVLLATLFLMAWGIFSLRSLPIDALPDLSDAQVIIRTSYPGQAP---- 58
+ + + LL ++A + LP LP+ + P A
Sbjct: 527 TNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERT 586

Query: 59 QIVENQVT-YPLTTTMLSVPGAKTVRGFSAFG----DSFVYVIFEDGTDLYWARSRVLEY 113
Q V +QVT Y L +V TV GFS G +V + + +
Sbjct: 587 QKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAV 646

Query: 114 LSQVQSRLPATAKPALGP------DATGVGWIYQYALVDRSGRHDLAQLRSLQDWFLRFE 167
+ + + L + P G + + L+D++G L ++ L
Sbjct: 647 IHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAG-LGHDALTQARNQLLGMA 705

Query: 168 LKTVPDVAEV-ASIGGMVKQYQVVLDPLRMASLGITQGEVANAIGKA-NQETGGGVLEQG 225
+ + V + Q+++ +D + +LG++ ++ I A +++G
Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765

Query: 226 EA-EFMVRA-SGYLKTLDDFRAIPLRLAANGAPVMLGDVARVQLGPEARR-----GIGEL 278
+ V+A + + +D + +R +ANG V + R G+ +
Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVR-SANGEMVPFSAFTTSHWVYGSPRLERYNGLPSM 824

Query: 279 DGEGEAVGGVVILRSGKNAREAITHVKDKLETLKKSLPAGVELVTTYDRSQLIDRAVDNL 338
+ +GEA G +E L LPAG+ S + +
Sbjct: 825 EIQGEAAPGTS-----------SGDAMALMENLASKLPAGIGY-DWTGMSYQERLSGNQA 872

Query: 339 SHKLIEEFIVVALVCAAFLWHLRSSLVAIVSLPVGVLIALIIMRHQGINANIMSLGGIAI 398
+ F+VV L AA + ++ +P+G++ L+ ++ + G+
Sbjct: 873 PALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLT 932

Query: 399 AIGAMVDAAVVMIENAHKRVEAWHARYPGQVLRGAEHWRVMTDAAVEVGPALFFSLMIIT 458
IG A++++E A +E G+ + A + + + P L SL I
Sbjct: 933 TIGLSAKNAILIVEFAKDLMEK-----EGKGVVEA----TLMAVRMRLRPILMTSLAFI- 982

Query: 459 LSFVPVFTLQAQEGRLFAPLAFTKTYAMAAAAGLAVTLVPVL 500
L +P+ + M +A LA+ VPV
Sbjct: 983 LGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVF 1024



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.